Query gi|254780546|ref|YP_003064959.1| hypothetical protein CLIBASIA_02165 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 423 No_of_seqs 202 out of 4812 Neff 8.1 Searched_HMMs 39220 Date Mon May 30 01:01:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780546.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10660 tilS tRNA(Ile)-lysidi 100.0 0 0 466.0 31.1 391 14-415 8-415 (433) 2 TIGR02432 lysidine_TilS_N tRNA 100.0 0 0 405.0 16.4 188 21-210 1-204 (204) 3 cd01992 PP-ATPase N-terminal d 100.0 0 0 361.0 18.0 184 21-210 1-185 (185) 4 pfam01171 ATP_bind_3 PP-loop f 100.0 0 0 359.4 16.7 181 21-207 1-182 (182) 5 cd01993 Alpha_ANH_like_II This 100.0 1.7E-40 4.2E-45 305.4 17.1 175 21-198 1-184 (185) 6 COG0037 MesJ tRNA(Ile)-lysidin 100.0 2.2E-39 5.5E-44 297.6 20.7 190 19-215 21-215 (298) 7 PRK10696 C32 tRNA thiolase; Pr 100.0 3.4E-32 8.7E-37 246.7 19.6 194 7-212 28-232 (311) 8 cd01990 Alpha_ANH_like_I This 99.7 2.4E-16 6.2E-21 134.6 10.1 149 22-197 1-157 (202) 9 COG1606 ATP-utilizing enzymes 99.7 2.5E-15 6.4E-20 127.4 12.6 167 1-197 2-175 (269) 10 KOG2840 consensus 99.6 2.7E-14 6.8E-19 120.2 9.5 201 14-215 46-260 (347) 11 PRK00143 trmU tRNA (5-methylam 99.5 2.4E-14 6E-19 120.5 8.0 172 17-200 2-199 (355) 12 cd01998 tRNA_Me_trans tRNA met 99.5 8.9E-14 2.3E-18 116.4 6.9 168 21-200 1-195 (349) 13 pfam03054 tRNA_Me_trans tRNA m 99.5 1.7E-13 4.4E-18 114.5 7.1 168 21-200 2-197 (354) 14 TIGR00268 TIGR00268 conserved 99.4 2.4E-12 6.2E-17 106.3 11.7 164 6-197 2-176 (263) 15 COG0482 TrmU Predicted tRNA(5- 99.4 2.6E-12 6.6E-17 106.1 9.5 172 18-200 2-197 (356) 16 PRK00919 GMP synthase subunit 99.2 1.6E-10 4.1E-15 93.4 11.8 164 4-187 2-177 (306) 17 TIGR00884 guaA_Cterm GMP synth 99.2 8.4E-11 2.1E-15 95.4 9.7 168 8-188 6-188 (319) 18 cd01712 ThiI ThiI is required 99.2 1.5E-10 3.8E-15 93.7 10.7 144 21-186 1-156 (177) 19 cd01713 PAPS_reductase This do 99.2 6.1E-10 1.5E-14 89.3 10.9 149 21-189 1-162 (173) 20 PRK00074 guaA GMP synthase; Re 99.0 3.1E-09 8E-14 84.3 10.6 156 14-187 211-382 (513) 21 pfam01507 PAPS_reduct Phosphoa 99.0 2.4E-09 6.2E-14 85.1 10.1 147 21-190 1-157 (174) 22 cd01997 GMP_synthase_C The C-t 99.0 2.4E-09 6.2E-14 85.1 9.6 152 21-187 1-165 (295) 23 KOG2805 consensus 99.0 8.5E-10 2.2E-14 88.3 7.2 173 16-200 2-205 (377) 24 pfam02568 ThiI Thiamine biosyn 99.0 3.6E-09 9.3E-14 83.8 10.1 151 19-191 3-166 (197) 25 COG0519 GuaA GMP synthase, PP- 99.0 5.2E-09 1.3E-13 82.8 10.7 161 14-188 16-185 (315) 26 COG0603 Predicted PP-loop supe 99.0 6.1E-09 1.5E-13 82.3 10.9 150 19-189 2-181 (222) 27 cd01995 ExsB ExsB is a transcr 99.0 5.5E-09 1.4E-13 82.6 10.2 136 21-190 1-142 (169) 28 PRK11106 queuosine biosynthesi 99.0 4.9E-09 1.3E-13 82.9 9.5 148 20-188 2-182 (231) 29 pfam06508 ExsB ExsB. This fami 98.9 3E-09 7.6E-14 84.5 7.5 102 21-131 1-125 (137) 30 TIGR00420 trmU tRNA (5-methyla 98.9 6.3E-09 1.6E-13 82.2 8.0 168 20-199 1-215 (394) 31 TIGR00364 TIGR00364 exsB prote 98.8 2.6E-08 6.7E-13 77.8 9.1 151 23-185 2-189 (227) 32 PRK08349 hypothetical protein; 98.8 1.6E-07 4E-12 72.3 11.4 142 21-186 2-158 (198) 33 COG0175 CysH 3'-phosphoadenosi 98.8 1.6E-07 4E-12 72.3 10.7 157 5-189 27-197 (261) 34 PRK13820 argininosuccinate syn 98.7 3.1E-07 7.9E-12 70.2 11.6 143 18-189 2-165 (395) 35 PRK00509 argininosuccinate syn 98.7 4.9E-07 1.3E-11 68.8 12.5 141 19-189 2-167 (398) 36 cd01986 Alpha_ANH_like Adenine 98.7 5.8E-08 1.5E-12 75.4 7.0 83 22-140 1-83 (103) 37 PRK05370 argininosuccinate syn 98.7 4.7E-07 1.2E-11 68.9 11.6 152 11-189 3-184 (447) 38 PRK05253 sulfate adenylyltrans 98.7 4.8E-07 1.2E-11 68.9 11.3 169 16-190 23-206 (300) 39 PRK08576 hypothetical protein; 98.7 2.2E-07 5.5E-12 71.3 9.1 170 8-206 225-399 (439) 40 pfam02540 NAD_synthase NAD syn 98.6 1.5E-06 3.8E-11 65.3 12.5 150 7-187 3-169 (243) 41 TIGR03573 WbuX N-acetyl sugar 98.6 7.3E-07 1.9E-11 67.6 10.0 165 20-193 60-259 (343) 42 PRK04527 argininosuccinate syn 98.6 1E-06 2.5E-11 66.6 10.2 147 19-193 3-174 (397) 43 PRK02090 phosphoadenosine phos 98.6 4.4E-06 1.1E-10 62.1 13.5 158 2-189 26-196 (243) 44 KOG2594 consensus 98.5 2.4E-07 6.2E-12 70.9 6.5 172 12-188 55-265 (396) 45 pfam00764 Arginosuc_synth Argi 98.5 1.8E-06 4.5E-11 64.9 9.9 139 23-189 1-161 (389) 46 cd00553 NAD_synthase NAD+ synt 98.5 3.1E-06 7.8E-11 63.2 11.1 140 17-187 21-178 (248) 47 PRK13794 hypothetical protein; 98.5 1.3E-06 3.3E-11 65.8 8.8 169 11-206 235-413 (473) 48 PRK01565 thiamine biosynthesis 98.5 5.3E-06 1.4E-10 61.5 11.7 146 18-186 175-332 (399) 49 PRK12563 sulfate adenylyltrans 98.4 2.8E-06 7.2E-11 63.4 9.7 170 16-190 34-217 (312) 50 PRK13795 hypothetical protein; 98.4 2.4E-06 6.2E-11 63.9 9.0 161 11-206 235-413 (630) 51 KOG1622 consensus 98.4 1.2E-06 3.1E-11 65.9 7.4 163 14-190 225-408 (552) 52 cd01999 Argininosuccinate_Synt 98.4 8E-06 2E-10 60.2 11.2 141 22-189 1-164 (385) 53 PRK08557 hypothetical protein; 98.4 4.3E-06 1.1E-10 62.1 9.6 151 19-206 183-351 (420) 54 PRK13980 NAD synthetase; Provi 98.4 1.4E-05 3.6E-10 58.5 11.9 150 7-187 13-178 (264) 55 cd01991 Asn_Synthase_B_C The C 98.4 2.4E-06 6E-11 64.0 7.9 108 15-130 11-128 (269) 56 COG0137 ArgG Argininosuccinate 98.3 2.4E-05 6.1E-10 56.9 12.7 149 18-191 3-171 (403) 57 cd01996 Alpha_ANH_like_III Thi 98.3 4.3E-06 1.1E-10 62.1 8.9 62 21-91 3-64 (154) 58 PRK00876 nadE NAD synthetase; 98.3 1.9E-05 4.9E-10 57.5 11.7 77 7-90 17-97 (325) 59 pfam00733 Asn_synthase Asparag 98.3 1.8E-05 4.7E-10 57.7 11.2 165 9-211 6-185 (195) 60 PRK06850 hypothetical protein; 98.3 3.1E-05 7.9E-10 56.0 12.3 182 3-191 3-215 (488) 61 COG2117 Predicted subunit of t 98.3 3.6E-06 9.1E-11 62.7 7.3 160 21-211 2-175 (198) 62 TIGR03183 DNA_S_dndC putative 98.3 2.5E-05 6.5E-10 56.7 11.6 170 15-191 9-207 (447) 63 TIGR00032 argG argininosuccina 98.3 4.3E-05 1.1E-09 55.1 12.7 160 21-208 1-193 (420) 64 PRK01269 thiamine biosynthesis 98.2 2.7E-05 6.8E-10 56.5 11.3 146 18-186 176-333 (483) 65 COG0301 ThiI Thiamine biosynth 98.1 6.3E-05 1.6E-09 53.9 11.4 145 18-186 174-332 (383) 66 COG0171 NadE NAD synthase [Coe 98.1 0.00012 3.1E-09 51.9 12.0 156 8-187 11-185 (268) 67 COG3969 Predicted phosphoadeno 98.1 1.4E-05 3.5E-10 58.6 6.9 168 15-188 23-232 (407) 68 PTZ00323 NAD+ synthase; Provis 98.0 0.00025 6.4E-09 49.6 11.7 84 3-89 24-113 (294) 69 cd01984 AANH_like Adenine nucl 98.0 7.7E-06 2E-10 60.3 3.7 69 22-139 1-69 (86) 70 PRK02628 nadE NAD synthetase; 97.9 0.0001 2.6E-09 52.4 9.2 71 17-90 360-431 (678) 71 PRK13981 NAD synthetase; Provi 97.9 0.00017 4.2E-09 50.9 9.1 139 17-186 279-435 (543) 72 TIGR00342 TIGR00342 thiamine b 97.8 0.00017 4.3E-09 50.8 9.0 142 20-185 187-344 (391) 73 COG1365 Predicted ATPase (PP-l 97.8 0.00011 2.9E-09 52.1 7.1 148 20-207 61-216 (255) 74 cd01994 Alpha_ANH_like_IV This 97.7 0.0016 4E-08 44.0 12.6 147 21-199 1-154 (194) 75 PRK08384 thiamine biosynthesis 97.7 0.00094 2.4E-08 45.6 10.5 130 18-185 178-307 (310) 76 TIGR03108 eps_aminotran_1 exos 97.7 0.0002 5.2E-09 50.3 7.0 70 15-92 254-323 (628) 77 COG0367 AsnB Asparagine syntha 97.6 0.00076 1.9E-08 46.2 8.8 106 15-130 226-344 (542) 78 PRK09431 asnB asparagine synth 97.6 0.00037 9.3E-09 48.5 7.1 111 15-130 223-353 (555) 79 TIGR03104 trio_amidotrans aspa 97.5 0.00027 6.9E-09 49.4 5.9 105 15-130 256-375 (589) 80 PRK00768 nadE NAD synthetase; 97.4 0.014 3.5E-07 37.3 14.1 142 17-187 38-202 (274) 81 TIGR01536 asn_synth_AEB aspara 97.4 0.00073 1.9E-08 46.4 6.8 116 15-130 307-444 (646) 82 pfam01902 ATP_bind_4 ATP-bindi 97.3 0.0083 2.1E-07 38.9 11.7 63 21-89 2-66 (219) 83 PTZ00077 asparagine synthetase 97.3 0.0021 5.4E-08 43.1 8.2 74 15-92 233-341 (610) 84 pfam09179 TilS TilS substrate 97.1 0.0036 9.1E-08 41.5 8.1 63 254-318 4-69 (69) 85 TIGR02039 CysD sulfate adenyly 97.1 0.0025 6.3E-08 42.6 7.2 68 20-92 20-87 (295) 86 KOG1706 consensus 97.0 0.0099 2.5E-07 38.3 9.3 162 17-199 3-206 (412) 87 cd01987 USP_OKCHK USP domain i 96.9 0.0038 9.8E-08 41.2 6.9 105 21-146 1-109 (124) 88 TIGR00552 nadE NAD+ synthetase 96.9 0.017 4.4E-07 36.7 10.0 156 4-186 4-188 (286) 89 COG2102 Predicted ATPases of P 96.6 0.067 1.7E-06 32.5 11.2 64 21-91 2-69 (223) 90 KOG0573 consensus 95.6 0.087 2.2E-06 31.7 7.6 38 20-61 251-288 (520) 91 KOG0571 consensus 95.2 0.15 3.8E-06 30.0 7.9 75 14-92 220-295 (543) 92 cd00293 USP_Like Usp: Universa 95.0 0.31 7.8E-06 27.8 9.2 95 21-131 1-106 (130) 93 PRK10490 sensor protein KdpD; 94.4 0.41 1E-05 26.9 8.5 94 18-128 249-346 (895) 94 TIGR00289 TIGR00289 conserved 93.2 0.32 8E-06 27.7 6.0 67 21-94 2-72 (227) 95 cd01989 STK_N The N-terminal d 93.2 0.68 1.7E-05 25.4 7.8 95 21-130 1-115 (146) 96 PRK13011 formyltetrahydrofolat 93.0 0.51 1.3E-05 26.2 6.8 26 173-198 126-151 (287) 97 PRK13010 purU formyltetrahydro 92.7 0.8 2E-05 24.9 8.2 26 173-198 129-154 (289) 98 PRK09982 universal stress prot 92.3 0.88 2.3E-05 24.5 9.0 101 20-140 4-120 (142) 99 COG2205 KdpD Osmosensitive K+ 91.1 1.2 3E-05 23.6 8.8 94 19-129 248-345 (890) 100 PRK10116 universal stress prot 90.7 1.3 3.3E-05 23.4 8.3 96 20-130 4-114 (142) 101 TIGR00434 cysH phosophoadenyly 90.5 1.3 3.3E-05 23.3 8.5 144 16-189 10-183 (226) 102 PRK06027 purU formyltetrahydro 89.1 1.7 4.3E-05 22.6 7.1 24 174-197 125-148 (285) 103 pfam10288 DUF2392 Protein of u 88.3 0.85 2.2E-05 24.7 4.3 35 160-194 28-62 (104) 104 pfam02677 DUF208 Uncharacteriz 87.5 2.1 5.4E-05 21.9 7.4 60 24-90 3-64 (176) 105 LOAD_USPA consensus 84.5 2.9 7.4E-05 20.9 9.4 94 20-129 1-108 (135) 106 cd01988 Na_H_Antiporter_C The 84.4 2.9 7.5E-05 20.8 8.0 95 21-130 1-106 (132) 107 cd01399 GlcN6P_deaminase GlcN6 83.6 3.2 8.1E-05 20.6 7.8 77 9-85 8-88 (232) 108 TIGR00655 PurU formyltetrahydr 80.9 4 0.0001 19.9 7.7 88 20-126 93-182 (294) 109 cd05565 PTS_IIB_lactose PTS_II 80.3 4.1 0.00011 19.8 7.4 68 21-89 2-80 (99) 110 COG0788 PurU Formyltetrahydrof 80.0 4.2 0.00011 19.7 7.3 23 175-197 128-150 (287) 111 PRK09590 celB cellobiose phosp 74.4 6 0.00015 18.7 7.2 75 20-95 2-89 (104) 112 TIGR01949 AroFGH_arch predicte 73.8 6.2 0.00016 18.6 4.3 75 52-129 105-181 (259) 113 cd05564 PTS_IIB_chitobiose_lic 73.0 6.4 0.00016 18.4 6.9 69 21-90 1-80 (96) 114 KOG0189 consensus 72.7 6.5 0.00017 18.4 7.1 159 3-189 32-205 (261) 115 COG1636 Uncharacterized protei 71.4 7 0.00018 18.2 9.2 65 18-89 2-68 (204) 116 PRK06696 uridine kinase; Valid 70.6 7.3 0.00018 18.1 7.0 58 6-66 11-70 (227) 117 PRK10717 cysteine synthase A; 68.7 7.6 0.00019 17.9 3.8 57 16-87 60-116 (334) 118 cd01524 RHOD_Pyr_redox Member 68.6 7.9 0.0002 17.8 5.0 51 5-64 37-87 (90) 119 pfam05889 SLA_LP_auto_ag Solub 65.5 8.7 0.00022 17.5 3.6 18 173-190 173-190 (389) 120 PRK00443 nagB glucosamine-6-ph 65.4 9.1 0.00023 17.4 7.2 90 19-112 32-126 (264) 121 TIGR00424 APS_reduc 5'-adenyly 62.0 10 0.00027 17.0 6.5 161 3-188 104-282 (469) 122 KOG3425 consensus 60.8 11 0.00028 16.8 7.2 83 2-90 8-103 (128) 123 pfam03599 CdhD CO dehydrogenas 60.3 11 0.00028 16.8 7.2 133 11-146 21-169 (384) 124 TIGR00639 PurN phosphoribosylg 60.1 11 0.00029 16.7 7.9 94 21-125 2-101 (215) 125 TIGR00873 gnd 6-phosphoglucona 59.7 6.8 0.00017 18.3 2.2 154 2-185 47-224 (480) 126 PRK07667 uridine kinase; Provi 59.6 11 0.00029 16.7 6.0 41 17-60 10-52 (190) 127 pfam01861 DUF43 Protein of unk 59.6 11 0.00029 16.7 6.3 79 2-96 24-106 (243) 128 KOG1325 consensus 58.6 11 0.00027 16.9 3.0 30 7-36 24-65 (571) 129 cd00946 FBP_aldolase_IIA Class 58.4 12 0.00031 16.5 11.5 138 7-149 27-205 (345) 130 TIGR02057 PAPS_reductase phosp 57.3 12 0.00032 16.4 13.1 175 5-189 14-197 (239) 131 PTZ00142 6-phosphogluconate de 56.3 13 0.00033 16.3 6.5 79 3-91 55-133 (474) 132 TIGR03531 selenium_SpcS O-phos 56.0 13 0.00033 16.3 5.2 23 168-190 218-240 (444) 133 PRK05647 purN phosphoribosylgl 55.6 13 0.00034 16.2 8.1 86 20-125 2-87 (200) 134 COG4586 ABC-type uncharacteriz 52.8 15 0.00037 15.9 3.5 26 244-269 172-197 (325) 135 PRK09589 celA 6-phospho-beta-g 52.3 12 0.00031 16.5 2.5 58 74-131 73-134 (479) 136 KOG2303 consensus 52.1 15 0.00038 15.8 3.3 26 17-42 347-372 (706) 137 TIGR03356 BGL beta-galactosida 51.8 11 0.00029 16.7 2.2 58 74-131 60-120 (427) 138 PRK09593 arb 6-phospho-beta-gl 51.2 13 0.00034 16.2 2.5 57 74-130 79-139 (478) 139 TIGR02407 ectoine_ectB diamino 51.2 16 0.0004 15.7 4.2 23 65-87 212-234 (413) 140 PRK13511 6-phospho-beta-galact 51.0 13 0.00034 16.2 2.5 57 74-130 60-119 (470) 141 cd07025 Peptidase_S66 LD-Carbo 50.4 16 0.00041 15.7 4.3 24 23-46 3-26 (282) 142 PRK09852 cryptic 6-phospho-bet 47.4 17 0.00043 15.5 2.6 58 74-131 77-138 (474) 143 TIGR01859 fruc_bis_ald_ fructo 47.4 18 0.00045 15.3 4.1 153 17-191 38-232 (339) 144 pfam00232 Glyco_hydro_1 Glycos 47.0 15 0.00038 15.9 2.3 57 75-131 65-124 (454) 145 COG0362 Gnd 6-phosphogluconate 46.6 17 0.00042 15.5 2.4 77 3-91 52-130 (473) 146 COG0794 GutQ Predicted sugar p 46.6 18 0.00046 15.3 4.0 22 67-88 99-120 (202) 147 KOG2644 consensus 46.6 18 0.00046 15.3 5.2 22 168-189 208-229 (282) 148 pfam01182 Glucosamine_iso Gluc 46.5 18 0.00046 15.3 8.5 102 9-113 7-113 (236) 149 KOG2316 consensus 45.2 19 0.00048 15.1 12.3 63 21-93 2-76 (277) 150 pfam08497 Radical_SAM_N Radica 45.2 16 0.00042 15.6 2.2 20 70-89 35-54 (298) 151 pfam10309 DUF2414 Protein of u 44.7 16 0.00041 15.6 2.1 34 166-200 10-50 (62) 152 cd01568 QPRTase_NadC Quinolina 43.7 20 0.00051 15.0 3.3 14 73-86 47-61 (269) 153 PRK09287 6-phosphogluconate de 43.7 20 0.00051 15.0 7.1 79 3-91 39-117 (459) 154 PRK00162 glpE thiosulfate sulf 43.0 20 0.00052 14.9 5.1 50 5-62 44-93 (104) 155 TIGR00853 pts-lac PTS system, 41.8 21 0.00054 14.8 6.9 71 15-88 42-125 (142) 156 COG0299 PurN Folate-dependent 41.5 22 0.00055 14.7 7.4 85 21-126 2-87 (200) 157 cd03115 SRP The signal recogni 40.3 22 0.00057 14.6 6.7 59 22-88 4-62 (173) 158 PRK11498 bcsA cellulose syntha 39.1 23 0.0006 14.5 3.1 51 47-104 300-350 (858) 159 PRK13937 phosphoheptose isomer 38.1 24 0.00062 14.4 5.6 51 9-60 32-84 (192) 160 TIGR00693 thiE thiamine-phosph 38.0 24 0.00062 14.4 2.9 23 66-88 51-73 (210) 161 smart00022 PLAc Cytoplasmic ph 37.9 24 0.00062 14.4 3.4 25 7-31 51-87 (549) 162 TIGR01198 pgl 6-phosphoglucono 37.8 24 0.00062 14.3 5.3 105 7-115 14-127 (251) 163 pfam08727 P3A Poliovirus 3A pr 37.4 22 0.00057 14.6 1.9 30 169-202 24-53 (57) 164 cd07203 cPLA2_Fungal_PLB Funga 36.9 25 0.00064 14.3 3.4 14 20-33 63-76 (552) 165 PRK10162 acetyl esterase; Prov 36.4 26 0.00065 14.2 6.6 14 171-184 215-228 (319) 166 PRK13832 plasmid partitioning 35.8 20 0.0005 15.0 1.4 24 179-202 214-240 (518) 167 pfam00448 SRP54 SRP54-type pro 35.7 26 0.00067 14.1 6.3 60 22-89 5-64 (196) 168 pfam01268 FTHFS Formate--tetra 35.1 27 0.00068 14.1 6.1 19 127-145 302-320 (555) 169 PRK04165 acetyl-CoA decarbonyl 35.1 27 0.00068 14.1 7.6 131 13-146 85-227 (454) 170 TIGR01277 thiQ thiamine ABC tr 35.0 24 0.00061 14.4 1.8 115 15-133 20-194 (213) 171 pfam01188 MR_MLE Mandelate rac 35.0 26 0.00066 14.2 1.9 21 177-197 64-84 (98) 172 pfam00926 DHBP_synthase 3,4-di 34.9 27 0.00069 14.0 7.5 105 74-190 57-185 (193) 173 PRK05773 3,4-dihydroxy-2-butan 34.8 27 0.00069 14.0 3.9 20 171-190 189-208 (218) 174 pfam00582 Usp Universal stress 34.6 27 0.0007 14.0 9.4 95 20-130 3-114 (140) 175 COG0363 NagB 6-phosphogluconol 34.5 27 0.0007 14.0 5.3 66 19-85 31-99 (238) 176 cd02651 nuc_hydro_IU_UC_XIUA n 34.4 28 0.0007 14.0 6.0 52 28-86 9-62 (302) 177 cd03557 L-arabinose_isomerase 34.3 28 0.0007 14.0 3.6 42 166-207 209-250 (484) 178 PRK00414 gmhA phosphoheptose i 33.8 28 0.00072 13.9 6.7 50 7-57 32-83 (192) 179 cd00147 cPLA2_like Cytosolic p 33.6 28 0.00072 13.9 3.3 25 7-31 24-53 (438) 180 PTZ00249 variable surface prot 33.6 25 0.00064 14.2 1.7 14 186-199 147-160 (516) 181 pfam10829 DUF2554 Protein of u 33.5 28 0.00072 13.9 3.0 24 52-75 29-53 (76) 182 PRK09762 galactosamine-6-phosp 33.5 28 0.00072 13.9 6.3 74 12-85 20-97 (236) 183 PRK12358 putative 6-phosphoglu 33.4 28 0.00073 13.9 7.3 68 10-79 18-85 (239) 184 PTZ00285 glucosamine-6-phospha 33.1 29 0.00073 13.8 8.3 91 17-111 30-125 (253) 185 cd01582 Homoaconitase Homoacon 33.1 29 0.00073 13.8 4.3 34 52-85 27-63 (363) 186 TIGR00269 TIGR00269 conserved 32.8 11 0.00029 16.7 -0.2 50 165-214 3-52 (106) 187 cd00958 DhnA Class I fructose- 32.7 29 0.00074 13.8 5.8 40 51-90 90-131 (235) 188 pfam01890 CbiG_C Cobalamin syn 31.8 30 0.00077 13.7 4.5 67 21-93 3-70 (121) 189 PRK08813 threonine dehydratase 31.2 31 0.00079 13.6 3.2 23 20-42 81-103 (349) 190 PRK09197 fructose-bisphosphate 30.8 31 0.0008 13.6 7.3 71 17-92 43-137 (349) 191 PRK10264 hydrogenase 1 maturat 30.6 32 0.0008 13.6 3.1 155 8-211 25-191 (195) 192 PRK00955 hypothetical protein; 30.4 32 0.00081 13.5 2.2 108 18-142 64-180 (599) 193 cd03326 MR_like_1 Mandelate ra 30.0 32 0.00082 13.5 2.4 34 176-209 223-256 (385) 194 TIGR03030 CelA cellulose synth 30.0 32 0.00082 13.5 2.5 81 22-104 135-226 (713) 195 PRK05634 nucleosidase; Provisi 29.8 33 0.00083 13.5 3.4 27 169-195 132-163 (188) 196 pfam03205 MobB Molybdopterin g 29.7 33 0.00083 13.5 4.6 37 24-62 3-41 (122) 197 PRK07008 acyl-CoA synthetase; 29.4 33 0.00084 13.4 3.3 47 9-64 52-100 (539) 198 TIGR01735 FGAM_synt phosphorib 29.4 12 0.0003 16.6 -0.6 127 55-209 87-213 (1401) 199 cd03324 rTSbeta_L-fuconate_deh 29.3 33 0.00085 13.4 2.6 33 176-208 258-290 (415) 200 cd03315 MLE_like Muconate lact 28.4 34 0.00087 13.3 1.9 32 177-208 148-179 (265) 201 TIGR02793 nikR nickel-responsi 28.4 30 0.00076 13.8 1.3 21 274-294 18-38 (130) 202 pfam07942 N2227 N2227-like pro 28.1 35 0.00089 13.3 4.0 74 119-203 171-249 (268) 203 PRK08385 nicotinate-nucleotide 27.8 35 0.00089 13.2 2.7 21 70-90 47-67 (279) 204 COG1440 CelA Phosphotransferas 27.4 36 0.00091 13.2 6.7 69 20-89 2-81 (102) 205 COG0108 RibB 3,4-dihydroxy-2-b 27.3 36 0.00091 13.2 6.8 105 74-190 62-190 (203) 206 PRK13520 L-tyrosine decarboxyl 27.2 36 0.00091 13.2 7.4 72 7-91 67-139 (375) 207 pfam01116 F_bP_aldolase Fructo 27.1 36 0.00092 13.2 7.9 98 7-117 27-131 (283) 208 PRK12678 transcription termina 27.1 24 0.00061 14.4 0.6 47 22-82 15-61 (667) 209 cd06381 PBP1_iGluR_delta_like 26.9 36 0.00092 13.1 5.3 72 7-90 15-95 (363) 210 cd03298 ABC_ThiQ_thiamine_tran 26.6 37 0.00094 13.1 2.0 24 16-40 21-45 (211) 211 KOG1907 consensus 26.5 23 0.00059 14.5 0.5 17 21-37 784-800 (1320) 212 COG3111 Periplasmic protein wi 26.4 37 0.00095 13.1 3.2 61 277-341 36-96 (128) 213 pfam01156 IU_nuc_hydro Inosine 26.1 38 0.00096 13.0 6.0 54 21-82 2-56 (303) 214 cd00477 FTHFS Formyltetrahydro 26.0 38 0.00096 13.0 5.6 21 125-145 284-304 (524) 215 cd06368 PBP1_iGluR_non_NMDA_li 25.9 38 0.00096 13.0 5.7 75 7-92 15-97 (324) 216 COG0157 NadC Nicotinate-nucleo 25.7 38 0.00097 13.0 3.3 12 70-81 50-61 (280) 217 cd00453 FTBP_aldolase_II Fruct 25.3 39 0.00099 12.9 7.5 138 7-149 24-203 (340) 218 PRK07313 phosphopantothenoylcy 25.3 39 0.00099 12.9 4.7 35 19-57 2-36 (180) 219 pfam02441 Flavoprotein Flavopr 24.9 39 0.001 12.9 4.9 33 20-56 1-33 (118) 220 pfam02016 Peptidase_S66 LD-car 24.9 39 0.001 12.9 4.3 60 23-86 3-66 (281) 221 CHL00202 argB acetylglutamate 24.8 40 0.001 12.9 7.2 158 2-211 1-165 (284) 222 COG1377 FlhB Flagellar biosynt 24.8 40 0.001 12.9 1.6 15 385-399 308-324 (363) 223 cd03329 MR_like_4 Mandelate ra 24.1 41 0.001 12.8 1.5 31 179-209 210-240 (368) 224 COG0452 Dfp Phosphopantothenoy 24.1 41 0.001 12.8 4.5 35 20-58 5-39 (392) 225 TIGR01136 cysKM cysteine synth 24.0 41 0.001 12.8 5.1 17 298-314 178-194 (315) 226 TIGR01162 purE phosphoribosyla 23.8 41 0.0011 12.7 4.3 51 64-123 7-59 (159) 227 TIGR02322 phosphon_PhnN phosph 23.8 41 0.0011 12.7 3.4 131 22-162 5-166 (183) 228 COG1751 Uncharacterized conser 23.4 42 0.0011 12.7 5.4 60 19-86 28-88 (186) 229 TIGR01121 D_amino_aminoT D-ami 23.3 42 0.0011 12.7 2.0 27 169-195 202-228 (278) 230 TIGR03468 HpnG hopanoid-associ 23.3 42 0.0011 12.7 4.8 28 171-198 131-163 (212) 231 PRK06633 acetyl-CoA acetyltran 23.3 37 0.00093 13.1 1.0 31 51-83 78-108 (392) 232 TIGR01393 lepA GTP-binding pro 23.3 39 0.00099 12.9 1.1 87 300-390 192-297 (598) 233 PRK11706 TDP-4-oxo-6-deoxy-D-g 23.2 42 0.0011 12.7 1.6 25 174-198 135-159 (375) 234 PRK11175 universal stress prot 23.2 42 0.0011 12.7 8.8 145 19-198 3-167 (304) 235 PRK09409 insertion element IS2 23.1 43 0.0011 12.6 5.8 18 172-189 203-220 (301) 236 cd01561 CBS_like CBS_like: Thi 22.8 43 0.0011 12.6 3.2 55 18-87 51-105 (291) 237 cd04191 Glucan_BSP_ModH Glucan 22.6 43 0.0011 12.6 5.5 79 41-125 25-110 (254) 238 pfam00763 THF_DHG_CYH Tetrahyd 22.1 44 0.0011 12.5 4.8 40 50-91 29-68 (117) 239 PRK07261 topology modulation p 22.1 44 0.0011 12.5 5.5 53 22-78 4-57 (171) 240 pfam12268 DUF3612 Protein of u 22.0 45 0.0011 12.5 1.6 26 173-198 133-158 (178) 241 pfam10686 DUF2493 Protein of u 22.0 45 0.0011 12.5 7.2 63 21-92 3-68 (71) 242 cd01444 GlpE_ST GlpE sulfurtra 21.9 45 0.0011 12.5 4.9 35 6-40 43-77 (96) 243 pfam03848 TehB Tellurite resis 21.5 45 0.0012 12.4 8.1 144 5-185 18-163 (192) 244 PRK05702 flhB flagellar biosyn 21.5 46 0.0012 12.4 1.7 23 177-201 105-127 (352) 245 cd06391 PBP1_iGluR_delta_2 N-t 21.5 46 0.0012 12.4 5.1 74 6-91 14-96 (400) 246 COG3962 Acetolactate synthase 21.5 30 0.00076 13.7 0.3 26 171-196 243-268 (617) 247 PRK06115 dihydrolipoamide dehy 21.2 46 0.0012 12.4 6.6 16 175-190 219-234 (466) 248 PRK07116 flavodoxin; Provision 21.1 46 0.0012 12.4 5.0 21 73-93 18-38 (158) 249 cd01523 RHOD_Lact_B Member of 21.0 47 0.0012 12.4 5.9 46 8-62 50-95 (100) 250 TIGR02211 LolD_lipo_ex lipopro 21.0 47 0.0012 12.4 3.2 109 16-128 28-202 (221) 251 PRK09108 type III secretion sy 20.7 47 0.0012 12.3 1.7 24 176-201 99-122 (354) 252 COG1660 Predicted P-loop-conta 20.7 47 0.0012 12.3 5.2 15 22-36 5-19 (286) 253 TIGR03588 PseC UDP-4-keto-6-de 20.7 47 0.0012 12.3 1.7 25 174-198 137-161 (380) 254 cd07985 LPLAT_GPAT Lysophospho 20.3 48 0.0012 12.3 2.6 13 118-130 21-33 (235) 255 PRK12721 secretion system appa 20.3 48 0.0012 12.3 1.8 23 177-201 98-120 (349) 256 PRK13109 flhB flagellar biosyn 20.2 48 0.0012 12.3 1.7 21 179-201 109-129 (358) 257 cd00458 SugarP_isomerase Sugar 20.0 49 0.0012 12.2 6.0 101 10-115 9-113 (169) No 1 >PRK10660 tilS tRNA(Ile)-lysidine synthetase; Provisional Probab=100.00 E-value=0 Score=466.02 Aligned_cols=391 Identities=21% Similarity=0.256 Sum_probs=272.5 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 85279983999966942489999999999986489972999999667798784689999999998718988999975047 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) ++|..+++|+||||||+|||||||+++.+..+. ++++++|+|||||||++|++|++||+++|+++|||+++.+++... T Consensus 8 ~~~~~~~~ilvavSGG~DS~~LL~~L~~l~~~~--~~~~l~~~HvnHgl~~~ad~~~~fv~~~c~~~~ip~~~~~~~v~~ 85 (433) T PRK10660 8 RQLLESRQILVAFSGGLDSTVLLHQLVQWRTQN--PGVTLRAIHIHHGLSPNADSWVKHCEQVCQQWQVPLVVERVQLAQ 85 (433) T ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHC--CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECC T ss_conf 646999989999728099999999999999866--898289999718979266999999999999759978999987279 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCC Q ss_conf 87780378885455554320012344201234566656689999862365444332235764100357968710121042 Q gi|254780546|r 94 PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCR 173 (423) Q Consensus 94 ~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~ 173 (423) .+.++|++||.+||.++.+.+.+. .+++||||+|||+||++||+.||+|+.|+ +||++.. ..+++.++||||.++ T Consensus 86 ~~~~~E~aAR~~RY~~f~~~~~~~--~~l~tAHh~dDQaETvLlrL~RGsG~~GL--~gm~~~r-~~~~~~liRPLL~~~ 160 (433) T PRK10660 86 EGLGIEAAARQARYQAFARTLLPG--EVLVTAQHLDDQCETFLLALKRGSGPAGL--SAMAEVS-PFAGTQLLRPLLART 160 (433) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHC--CEEEECCCCCHHHHHHHHHHHCCCCCCCC--CCCCCCC-CCCCCEEECCCHHHH T ss_conf 997699999999999999987438--87996245651999999998658996445--6775224-478970773744665 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCCC Q ss_conf 899999899818932026678642210189997643453100---6677999999999979999999998840--04499 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDIDLHA---LYLKMKKFQDLRVKVNNAVAMLIPKYLT--VHMRS 248 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l~~~~---l~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~ 248 (423) |+||++||+++||+|+|||||.|++|+||++|+.+.|.+.+. +.+...+++......+....+++...+. ....+ T Consensus 161 r~eI~~Y~~~~~l~~~eD~SN~d~~y~RN~iR~~vlP~L~~~~P~~~~~l~r~a~l~~e~~~~l~~~~~~~~~~~~~~~~ 240 (433) T PRK10660 161 RGELEQWAQAHGLRWIEDESNQDDRYDRNFLRLRVLPLLQQRWPHFAEATARSAALCAEQEQLLDELLAEDLAHCQTPDG 240 (433) T ss_pred HHHHHHHHHHCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999984998038999997611259999997899998780699999999999999999999999999998646267 Q ss_pred EEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH---HCCCCCEEECCCEEEEEECCEEEEEECCCCC Q ss_conf 7998477833755789899999999983589999889999999998---6699845610757999889989998453437 Q gi|254780546|r 249 IIAISQDILNIDSTLLFYLLRVSAAICGGQISLPGYRSMERVMLFL---KSGKRGCVSIGRVVIDRRANFLWITRAVRNL 325 (423) Q Consensus 249 ~~~~~~~~~~~~~~~~~r~l~~~l~~~~g~~~~p~~~~l~~ll~~l---~~~~~~~~tl~g~~i~~~~~~l~i~RE~~~~ 325 (423) .. ....+..++...+.++++.|+...+ ..+|+...+.+++..+ ..+..+.+.+++..++++++.+|+....... T Consensus 241 ~l-~~~~l~~l~~~~~~~llr~wl~~~~--~~~p~~~~l~~i~~~~~~~~~d~~~~l~~~~~~vr~y~~~l~~~~~~~~~ 317 (433) T PRK10660 241 TL-QIDPLLAMSDARRAAILRRWLAGQN--APMPSRDALQRIWQEVALAREDASPCLRLGAFEIRRYQSQLWWIKSVAGQ 317 (433) T ss_pred CC-CHHHHHHCCHHHHHHHHHHHHHHCC--CCCCCHHHHHHHHHHHHHHCCCCCCEEEECCEEEEEECCEEEEECCCCCC T ss_conf 23-3376775799999999999999647--99985999999999998516688827998898999987889995256665 Q ss_pred CCCCC----CCCCCEEECCCEEEECCCCCEEE-ECCCCCCCCCCCCCCC-CCCCHHHHHHCCHHHCCCEEECCCCCCCCH Q ss_conf 75523----68982320784175127775035-2167544254201231-135546776271242797661230068881 Q gi|254780546|r 326 PTLIL----YPEETTVWDGRYQFQNLSDSLIQ-IGPQSYRKADVPSGIP-PIIAQRALSSMPSKEGGKPLLAPFSRFMTK 399 (423) Q Consensus 326 ~~~~~----~~~~~~~wDgR~~i~~~~~~~~~-l~~~g~~~~~~~~~~~-~~~~~~~l~~lP~~~~g~~liaP~~r~~~~ 399 (423) ....+ ......+.+|.+.+....+.... ..+.....+.++...+ ....+.+.+.++.++.+.+ ||||.|..-| T Consensus 318 ~~~~~~~~~~~~~l~lp~~~g~~~~~~~~~~~~~~~~~~l~iR~r~~Gd~~~~g~~gsKkLKk~f~d~k-IP~~~R~~iP 396 (433) T PRK10660 318 SETIVPWPTWLQPLELPAGLGSVQLVAGGDIRPPRADEAVSVRFKAPGLLHIVGRNGGRKLKKIWQELG-IPPWLRDTTP 396 (433) T ss_pred CCCCCCCCCCCCCEECCCCCCEEEECCCCCCCCCCCCCCEEEEECCCCCEEECCCCCCHHHHHHHHHCC-CCHHHHCCCC T ss_conf 555434567786266369971799766754256678995799817997578789999805999999849-8989967868 Q ss_pred H---HHHHHHHHHHHHCCC Q ss_conf 1---214888888874557 Q gi|254780546|r 400 F---DLPVAYAFSISFGKV 415 (423) Q Consensus 400 ~---d~~~~~~~~~~~~~~ 415 (423) . +.++++......+.. T Consensus 397 Ll~~~~~ivwv~Gv~vs~~ 415 (433) T PRK10660 397 LLFYGETLIAAAGVFVTQE 415 (433) T ss_pred EEEECCEEEEEECEEECCC T ss_conf 8999999999916568434 No 2 >TIGR02432 lysidine_TilS_N tRNA(Ile)-lysidine synthetase; InterPro: IPR012795 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . The only examples in which the wobble position of a tRNA must discriminate between G and A of mRNA are AUA (Ile) vs. AUG (Met) and UGA (stop) vs. UGG (Trp). In all bacteria, the wobble position of the tRNA(Ile) recognizing AUA is lysidine, a lysine derivative of cytidine. This domain is found, apparently, in all bacteria in a single copy. Eukaryotic sequences appear to be organellar. The domain architecture of this protein is variable; some, including characterised proteins of Escherichia coli and Bacillus subtilis known to be tRNA(Ile)-lysidine synthetase, include a conserved 50-residue domain that many other members lack. This protein belongs to the ATP-binding PP-loop family. It appears in the literature and protein databases as TilS, YacA, and putative cell cycle protein MesJ (a misnomer).; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0016879 ligase activity forming carbon-nitrogen bonds, 0008033 tRNA processing, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=404.97 Aligned_cols=188 Identities=36% Similarity=0.510 Sum_probs=171.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCC----CEEEEEEEECCCCCCCHHH-HHHHHHHHHHHCCCCEEEEE-EECCCC Q ss_conf 39999669424899999999999864899----7299999966779878468-99999999987189889999-750478 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFG----KIKFSAISVDHCLRETAKD-EVRYVSDVCSRLRIAHSVVS-WKNSKP 94 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~----~~~l~a~~vdHglr~~s~~-e~~~v~~~~~~lgi~~~~~~-~~~~~~ 94 (423) +|+||||||+|||||||++.++....... .+++.|+||||++|++|++ |+++|+++|++++|++++.+ +++... T Consensus 1 ~~lvAvSGG~DS~aLL~~L~~~~~~~~~~~~~~~~~~~a~h~nH~lR~~s~~~~~~~v~~~c~~~~~~~~~~~~~~~~~~ 80 (204) T TIGR02432 1 KILVAVSGGVDSMALLHLLLKLQPKLKISWQIKDFKLIAAHVNHGLRPESAQEEAEFVQQFCEKLNIPLEIKKLVDVKAL 80 (204) T ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCC T ss_conf 97888628642799999999976632787788750689999617788415899999999999961895699842124110 Q ss_pred ----CCCCHHHHHHHHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC-----CCCE Q ss_conf ----778037888545555432001234-420123456665668999986236544433223576410035-----7968 Q gi|254780546|r 95 ----QTGLMAAAREARYALISEHAKTIN-ATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYD-----LNLW 164 (423) Q Consensus 95 ----~~~~~~~ar~~r~~~~~~~~~~~~-~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~-----~~~~ 164 (423) +.++|++||..||.++.+.+.+.+ +.+|+||||+|||+||++||+.||+|+.|+ +||.+..... .++. T Consensus 81 ~~~~~~~~E~~AR~~RY~~f~~~~~~~~~~~~i~tAHh~dDq~ET~L~rL~RG~~~~Gl--~g~~~~~~~~~~g~~~~~~ 158 (204) T TIGR02432 81 AKGKKKNLEEAAREARYAFFEEIAKKHGKADYILTAHHADDQAETILMRLLRGSGLRGL--SGMPEIRPLGSLGWYKGGQ 158 (204) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHH--HHHCCCCCCCCCCCCCCEE T ss_conf 12247677899999999999999997299408997248322799999987516870001--1101336633244225632 Q ss_pred EECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 7101210428999998998189320266786422101899976434 Q gi|254780546|r 165 ISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRD 210 (423) Q Consensus 165 i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~ 210 (423) |+||||.++|+||..||++++|+|+|||||.|.+|+|||+|++|.| T Consensus 159 ~~RPLL~~~k~ei~~y~~~~~l~~~eD~tN~~~~y~RNriR~~~~P 204 (204) T TIGR02432 159 IIRPLLNISKSEIEEYLKENGLPYFEDETNQDDKYLRNRIRHELLP 204 (204) T ss_pred EEECCCCCCHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHCC T ss_conf 7707756778999999996689656578889876205899976169 No 3 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=100.00 E-value=0 Score=361.02 Aligned_cols=184 Identities=40% Similarity=0.591 Sum_probs=169.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC-CCCCCCH Q ss_conf 399996694248999999999998648997299999966779878468999999999871898899997504-7877803 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS-KPQTGLM 99 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~-~~~~~~~ 99 (423) ||+||||||+|||+||++++++.... +.++.|+|||||+|++|.+|+++|+++|+++||++++..++.. .++.++| T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~---~~~l~a~hvdh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~~~~~~~~e 77 (185) T cd01992 1 KILVAVSGGPDSMALLHLLSELKPRL---GLRLVAVHVDHGLRPESDEEAAFVADLCAKLGIPLYILVVALAPKPGGNLE 77 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHC---CCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCHH T ss_conf 99999678499999999999999974---994899998189888888999999999998599889999775367899999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHH Q ss_conf 78885455554320012344201234566656689999862365444332235764100357968710121042899999 Q gi|254780546|r 100 AAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRS 179 (423) Q Consensus 100 ~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~ 179 (423) +.||..||..+...|++.++..++||||+|||+||++|++.||+|+.|+ ++|.+. ....++.++||||.++|+||++ T Consensus 78 ~~aR~~Ry~~l~~~~~~~~~~~i~lgHh~dD~~ET~lm~l~rg~~~~gl--~gm~~~-~~~~~~~iiRPLL~~~k~ei~~ 154 (185) T cd01992 78 AAAREARYDFFAEIAKEHGADVLLTAHHADDQAETVLMRLLRGSGLRGL--AGMPAR-IPFGGGRLIRPLLGITRAEIEA 154 (185) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHCCCCCCCC--CCCCCC-CCCCCCEEEEHHHHHHHHHHHH T ss_conf 9999999999999998735450420363036899999998718996415--277841-3579952871577853999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 8998189320266786422101899976434 Q gi|254780546|r 180 FLLQRNISWCEDPSNTDDRFERVRVRRFVRD 210 (423) Q Consensus 180 ~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~ 210 (423) ||+.++++|++||||.|++|+||++|+.+.| T Consensus 155 ~~~~~~i~~~~D~sN~d~~y~Rn~iR~~iiP 185 (185) T cd01992 155 YLRENGLPWWEDPSNEDPRYTRNRIRHELLP 185 (185) T ss_pred HHHHCCCCEEECCCCCCCCCCHHHHHHHHCC T ss_conf 9998499567990879987639999746179 No 4 >pfam01171 ATP_bind_3 PP-loop family. This family of proteins belongs to the PP-loop superfamily. Probab=100.00 E-value=0 Score=359.36 Aligned_cols=181 Identities=39% Similarity=0.570 Sum_probs=167.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC-CCCCCCH Q ss_conf 399996694248999999999998648997299999966779878468999999999871898899997504-7877803 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS-KPQTGLM 99 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~-~~~~~~~ 99 (423) ||+||||||+|||+|||+++.+.... ++++.++|||||+|++|++|+++|+++|+++|||+++.+++.. .++.++| T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~---~~~~~~~hvnh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~~~~~~~~e 77 (182) T pfam01171 1 KILVAVSGGPDSMALLYLLKKLKPKF---GIDLTAAHVDHGLREESDREAQFVKELCRQLNIPLEVLRVDVAKKSGLNLE 77 (182) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHC---CCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHH T ss_conf 99999678599999999999999975---997899998799865405899999999998599759999734766787775 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHH Q ss_conf 78885455554320012344201234566656689999862365444332235764100357968710121042899999 Q gi|254780546|r 100 AAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRS 179 (423) Q Consensus 100 ~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~ 179 (423) +.||..||..+...+.+.++..++||||+|||+||++|+++||+|+.|+ ++|.+.. ...+++++||||.++|+||++ T Consensus 78 ~~aR~~Ry~~l~~~a~~~~~~~i~lgHh~DD~~ET~lm~l~rG~~~~gl--~gm~~~~-~~~~~~iiRPLl~~~k~ei~~ 154 (182) T pfam01171 78 EAAREARYDFFEEIAKKNGAEVLLTAHHADDQAETFLMRLLRGSGLAGL--AGIAPVR-PLAGGRIVRPLLKVTKSEIEE 154 (182) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCC--CCCCCCC-CCCCCEEECCCCCCCHHHHHH T ss_conf 7899999999999898617664887434232999999999728981103--3887513-468943870211488999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 8998189320266786422101899976 Q gi|254780546|r 180 FLLQRNISWCEDPSNTDDRFERVRVRRF 207 (423) Q Consensus 180 ~~~~~~i~wveDpSN~d~~f~R~rlR~~ 207 (423) ||+.++++|++||||.|++|+||++|++ T Consensus 155 ~a~~~~l~~~~D~sN~d~~~~Rn~iR~e 182 (182) T pfam01171 155 YLKEHGIPWVEDESNADLRYTRNRIRHE 182 (182) T ss_pred HHHHCCCCEEECCCCCCCCCCHHHHCCC T ss_conf 9998699368990889987643676488 No 5 >cd01993 Alpha_ANH_like_II This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=100.00 E-value=1.7e-40 Score=305.39 Aligned_cols=175 Identities=23% Similarity=0.271 Sum_probs=149.7 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCH- Q ss_conf 3999966942489999999999986489972999999667798784689999999998718988999975047877803- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLM- 99 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~- 99 (423) ||+||||||+|||+||++++.|.+.. ..++++.++|||||+|++|++|+++|+++|+.++|++++..+.......... T Consensus 1 kilva~SGG~DS~~Ll~ll~~~~~~~-~~~~~l~~~hvnh~~r~~s~~~~~~v~~~~~~~~i~~~~~~~~~~~~~~~~~~ 79 (185) T cd01993 1 RILVALSGGKDSLVLLHVLKKLQRRY-PYGFELEALTVDEGIPGYRDESLEVVERLAEELGIELEIVSFKEEYTDDIEVK 79 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHC-CCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHH T ss_conf 98999658499999999999988864-88956999995699888889999999998996299359987640367638999 Q ss_pred --------HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHC Q ss_conf --------788854555543200123442012345666566899998623654443322357641003579687101210 Q gi|254780546|r 100 --------AAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLR 171 (423) Q Consensus 100 --------~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~ 171 (423) ..+++.||..+.+.+.+.++..++||||+|||+||++|+++||++..|++ +++......++++++||||. T Consensus 80 ~~~~~~~c~~c~r~Rr~~l~~~~~~~~~~~i~~gHh~dD~~ET~l~~l~rg~~~~~~~--~~~~~~~~~~~i~iiRPLL~ 157 (185) T cd01993 80 KRGGKSPCSLCGVLRRGLLNKIAKELGADKLATGHNLDDEAETLLMNLLRGGILRLMR--PGPILYLDEGDVTRIRPLVY 157 (185) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCCHHHCC--CCCCCCCCCCCEEEEECCCC T ss_conf 9864063468999999999999998199889753307689999999998489700056--76633356898479954988 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 428999998998189320266786422 Q gi|254780546|r 172 CRREDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 172 ~~r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) ++|+||++||++++++|++||||.+.. T Consensus 158 ~~k~ei~~y~~~~~l~~~eD~~~~~~~ 184 (185) T cd01993 158 VREKEIVLYAELNGLPFVEEECPYAGN 184 (185) T ss_pred CCHHHHHHHHHHCCCCEEECCCCCCCC T ss_conf 989999999998799878899988989 No 6 >COG0037 MesJ tRNA(Ile)-lysidine synthase MesJ [Cell cycle control, cell division, chromosome partitioning] Probab=100.00 E-value=2.2e-39 Score=297.61 Aligned_cols=190 Identities=34% Similarity=0.469 Sum_probs=172.3 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCC--- Q ss_conf 98399996694248999999999998648997299999966779878468999999999871898899997504787--- Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQ--- 95 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~--- 95 (423) +.+|+||||||+||++||++++++... +++.++|||||+++.|+.+.+.|+.+|+.+|+++.+.+++..... T Consensus 21 ~~~ilVavSGGkDS~~ll~~L~~l~~~-----~~~~a~~Vd~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~ 95 (298) T COG0037 21 EYKILVAVSGGKDSLALLHLLKELGRR-----IEVEAVHVDHGLRGYSDQEAELVEKLCEKLGIPLIVERVTDDLGRETL 95 (298) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCC-----CEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEHHHHHHHCCC T ss_conf 785899937878999999999984225-----738999970898864328999999999964998488620454310024 Q ss_pred --CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCC Q ss_conf --780378885455554320012344201234566656689999862365444332235764100357968710121042 Q gi|254780546|r 96 --TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCR 173 (423) Q Consensus 96 --~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~ 173 (423) .++++.||..||..+...+.+.|+.+++||||+|||+||++|++.+|++..+ +.+|++.....++..++|||+.++ T Consensus 96 ~~~~~c~~c~~~R~~~l~~~a~~~g~~~i~tgH~~dD~~et~lm~l~~g~~~~~--l~~~~~~~~~~~~~~~iRPL~~~~ 173 (298) T COG0037 96 DGKSICAACRRLRRGLLYKIAKELGADKIATGHHLDDQAETFLMNLLRGSGLRG--LRGMPPKRPFEGGLLIIRPLLYVR 173 (298) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCHHH--HHCCCCCCCCCCCCEEECCCCCCC T ss_conf 676787999999999999999985999898567846899999999861752245--640885244578864547574688 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 899999899818932026678642210189997643453100 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDIDLHA 215 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l~~~~ 215 (423) ++||+.|+..++++|++|+||.+.+|.||++|+.+.++.... T Consensus 174 ~~ei~~~~~~~~l~~~~d~~n~~~~~~R~~~r~~l~~~~~~~ 215 (298) T COG0037 174 EKEIELYAKEKGLPYIEDESNYDLRYTRNRIREELLPLEEER 215 (298) T ss_pred HHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 999999999759987517888763023799999998998748 No 7 >PRK10696 C32 tRNA thiolase; Provisional Probab=100.00 E-value=3.4e-32 Score=246.70 Aligned_cols=194 Identities=22% Similarity=0.305 Sum_probs=164.0 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 99999998527998399996694248999999999998648997299999966779878468999999999871898899 Q gi|254780546|r 7 ESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSV 86 (423) Q Consensus 7 ~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~ 86 (423) .+|..| +|+..++||+||||||+||++||++|..+++..+. +++++|+||||+.... +.+...++|+++|||+++ T Consensus 28 kAI~dy-~MIedGDRVlVglSGGKDS~~LL~iL~~Lq~~api-~FeLvAv~lD~~~pGf---~~~~L~~yl~~lGvp~~i 102 (311) T PRK10696 28 EAIADF-NMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPI-NFSLVAVNLDQKQPGF---PEHILPEYLEKLGVEYKI 102 (311) T ss_pred HHHHHH-CCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCC-CCEEEEEEECCCCCCC---CHHHHHHHHHHCCCCCEE T ss_conf 999985-87789999999826788899999999999985899-8559999837899998---806889999974997079 Q ss_pred EEEECC-------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC- Q ss_conf 997504-------787780378885455554320012344201234566656689999862365444332235764100- Q gi|254780546|r 87 VSWKNS-------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTIL- 158 (423) Q Consensus 87 ~~~~~~-------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~- 158 (423) ...+.. ..+.+.+..|++.|+..++..+++.|+..|++|||+||++||++|++++|+++. +|+|... T Consensus 103 ~~~d~~~iv~~~~~egks~CslCsRlRRg~Ly~~A~e~G~nKIALGHH~DDi~ETfLMNlf~gG~Lk-----tM~Pkl~~ 177 (311) T PRK10696 103 VEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMK-----GMPPKLMS 177 (311) T ss_pred EEEEHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHCCCCC-----CCCCEEEC T ss_conf 9840767998754368863789999999999999998699879860650558999999999668763-----67976674 Q ss_pred CCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCC---CCCCHHHHHHHHHHHH Q ss_conf 35796871012104289999989981893202667864---2210189997643453 Q gi|254780546|r 159 YDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTD---DRFERVRVRRFVRDID 212 (423) Q Consensus 159 ~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d---~~f~R~rlR~~l~~l~ 212 (423) ..++.++||||..+++++|.+|++..++|.| |+|.. +.-.|+.+++.|.+.. T Consensus 178 d~g~~~VIRPL~y~~E~di~~~a~~~~fPiI--pcnlcgsq~~lqR~~ik~ml~~~~ 232 (311) T PRK10696 178 DDGKHIVIRPLAYCREKDIERFADAKAFPII--PCNLCGSQPNLQRQVIADMLRDWD 232 (311) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHCCCCEE--CCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 8997599830420159999999997599874--166778836789999999999999 No 8 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=99.69 E-value=2.4e-16 Score=134.57 Aligned_cols=149 Identities=20% Similarity=0.264 Sum_probs=117.0 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC---CCCCC Q ss_conf 999966942489999999999986489972999999667798784689999999998718988999975047---87780 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK---PQTGL 98 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~---~~~~~ 98 (423) |+||+|||+||+.|++++++... .++.++|+||++-+ ..|.+.+.++|+++||+|.++..+... -..|- T Consensus 1 V~vA~SGGVDSs~la~la~~alG------~~v~aV~~~s~~~~--~~E~e~a~~~a~~~gi~~~~i~~~~l~~~~~~~N~ 72 (202) T cd01990 1 VAVAFSGGVDSTLLLKAAVDALG------DRVLAVTATSPLFP--RRELEEAKRLAKEIGIRHEVIETDELDDPEFAKNP 72 (202) T ss_pred CEEECCCCHHHHHHHHHHHHHHC------CCEEEEEEECCCCC--HHHHHHHHHHHHHCCCCEEEEECCHHHHHHHCCCC T ss_conf 98982385999999999999768------78799997279899--78999999999863985799734111016554597 Q ss_pred HH---HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHC--CC Q ss_conf 37---88854555543200123442012345666566899998623654443322357641003579687101210--42 Q gi|254780546|r 99 MA---AAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLR--CR 173 (423) Q Consensus 99 ~~---~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~--~~ 173 (423) .+ .+....|..+...+.+.+..+++.|.+.||..|. |. |+. .. ....+..||.. ++ T Consensus 73 ~~RCy~CK~~l~~~l~~~a~~~g~~~v~dGtn~dDl~d~------RP----Gl~--A~-------~e~~v~sPL~e~gl~ 133 (202) T cd01990 73 PDRCYLCKKALYEALKEIAEELGLDVVLDGTNADDLGDY------RP----GLK--AL-------RELGVRSPLAEAGLG 133 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCC------CC----CHH--HH-------HHCCCCCCHHHCCCC T ss_conf 556229989999999999996799789415757632355------78----688--99-------876998852651978 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 899999899818932026678642 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~ 197 (423) |+|+|+++++.|+++++-||...- T Consensus 134 K~eVR~la~~lgLp~~~kp~~~CL 157 (202) T cd01990 134 KAEIRELARELGLPTWDKPAMACL 157 (202) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCE T ss_conf 899999999839985668998754 No 9 >COG1606 ATP-utilizing enzymes of the PP-loop superfamily [General function prediction only] Probab=99.66 E-value=2.5e-15 Score=127.44 Aligned_cols=167 Identities=20% Similarity=0.288 Sum_probs=130.5 Q ss_pred CCCCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 97698999999998527998399996694248999999999998648997299999966779878468999999999871 Q gi|254780546|r 1 MFLSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRL 80 (423) Q Consensus 1 ~~~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~l 80 (423) |.+|-.+.++.+++... +++||+|||+||+.|+.++++.+. -++.|+|||-..-+ ..+.+.+...++++ T Consensus 2 ~~~~Kl~~l~~~ik~~~---kv~vAfSGGvDSslLa~la~~~lG------~~v~AvTv~sP~~p--~~e~e~A~~~A~~i 70 (269) T COG1606 2 MLLSKLERLKKAIKEKK---KVVVAFSGGVDSSLLAKLAKEALG------DNVVAVTVDSPYIP--RREIEEAKNIAKEI 70 (269) T ss_pred CHHHHHHHHHHHHHHCC---EEEEEECCCCCHHHHHHHHHHHHC------CCEEEEEEECCCCC--HHHHHHHHHHHHHH T ss_conf 43799999999986437---399996588427999999999735------64699997168777--66606789999994 Q ss_pred CCCEEEEEEECCC-----CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 8988999975047-----87780378885455554320012344201234566656689999862365444332235764 Q gi|254780546|r 81 RIAHSVVSWKNSK-----PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCD 155 (423) Q Consensus 81 gi~~~~~~~~~~~-----~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~ 155 (423) ||.|.++..+... ...+-+-.+..+.|..+...+.+.|.++++-|.+++|--+ -| -|. -+. T Consensus 71 Gi~H~~i~~~~~~~~~~~n~~~rCY~CK~~v~~~l~~~a~~~Gyd~V~dGtNasDl~~------~R----PG~--rA~-- 136 (269) T COG1606 71 GIRHEFIKMNRMDPEFKENPENRCYLCKRAVYSTLVEEAEKRGYDVVADGTNASDLFD------YR----PGL--RAL-- 136 (269) T ss_pred CCCCEEEEHHHCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHCC------CC----CCH--HHH-- T ss_conf 8762465500025233049987355778999999999999739988974775787337------88----302--357-- Q ss_pred CCCCCCCCEEECCH--HCCCHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 10035796871012--1042899999899818932026678642 Q gi|254780546|r 156 TILYDLNLWISRPF--LRCRREDIRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 156 ~~~~~~~~~i~RPL--L~~~r~~l~~~~~~~~i~wveDpSN~d~ 197 (423) ...-+.-|| ++++|++|+.+++..|++|+|-||-..- T Consensus 137 -----kE~gi~sPl~e~gitk~eIre~a~~lgl~~~~kp~~aCl 175 (269) T COG1606 137 -----KELGIRSPLAEFGITKKEIREIAKSLGLPTWDKPSMACL 175 (269) T ss_pred -----HHCCCCCHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf -----760777718883975999999999759984558654332 No 10 >KOG2840 consensus Probab=99.56 E-value=2.7e-14 Score=120.16 Aligned_cols=201 Identities=17% Similarity=0.179 Sum_probs=160.0 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 85279983999966942489999999999986489972999999667798784689999999998718988999975047 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) +.|..+++|+++.|||+||++|+|.+..+-. +..++.++..+.||||++..-++-...|+..-.++|+|..|+.+.... T Consensus 46 ~l~~rge~v~igasGgkdstvlA~v~~~Ln~-r~~~g~~l~Lls~degi~gyrd~sl~avkrn~~~~~lPL~ivs~~dl~ 124 (347) T KOG2840 46 KLFARGERVAIGASGGKDSTVLAYVLDALNE-RHDYGLRLFLLSIDEGIRGYRDDSLEAVKRNGVQYGLPLCIVSYKDLY 124 (347) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH-HCCCCCEEEEEECCCCCCCEECCHHHHHHHHHHHCCCCEEEECHHHHH T ss_conf 8667888313255566027799999998403-017884134210100364200429999998614207863884199972 Q ss_pred C--------------CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 8--------------77803788854555543200123442012345666566899998623654443322357641003 Q gi|254780546|r 94 P--------------QTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILY 159 (423) Q Consensus 94 ~--------------~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~ 159 (423) . ..|....+-.-|.+++...+...++..+.|||++||-+||++|+++||-.-.-..+..+.-.... T Consensus 125 ~~~tmd~i~~~i~~~~rn~ctfCgv~RrqaL~~ga~~l~~~~~~tghnaDD~aetvl~n~lrgds~rl~R~~~~~t~~~e 204 (347) T KOG2840 125 GEWTMDEIVSEIGQEIRNNCTFCGVFRRQALDRGADVLGAAELVTGHNADDWAETVLMNLLRGDSARLERLTEITTPSLE 204 (347) T ss_pred CCCHHHHHHHHHHHHHCCCCEEECHHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 50208889999866230460451189888887202213414553022306789999999997478875302365567644 Q ss_pred CCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 57968710121042899999899818932026678642210189997643453100 Q gi|254780546|r 160 DLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDIDLHA 215 (423) Q Consensus 160 ~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l~~~~ 215 (423) .+.+.-.+||-...+.||.-|+....+.|+.-..|..+.-.|--.|..|+++.... T Consensus 205 ~~~~~r~kplk~~~~keivLya~~~~L~yFs~eCv~ap~a~rg~~r~~lk~le~~~ 260 (347) T KOG2840 205 MGIIPRLKPLKYASEKEIVLYASLSKLRYFSTECVKAPGASRGDARILLKDLERER 260 (347) T ss_pred CCCCCCCCCCCCHHHEEEHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC T ss_conf 67663334444401402016878888999877640487530789999964653417 No 11 >PRK00143 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; Reviewed Probab=99.54 E-value=2.4e-14 Score=120.54 Aligned_cols=172 Identities=16% Similarity=0.157 Sum_probs=119.7 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC-----CC-CCCHHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 79983999966942489999999999986489972999999667-----79-8784689999999998718988999975 Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH-----CL-RETAKDEVRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH-----gl-r~~s~~e~~~v~~~~~~lgi~~~~~~~~ 90 (423) +...||+||+|||+||++-+++|++- +++|..+|.-- +- ..-|.++...++++|+.|||||+++.+. T Consensus 2 ~sk~rV~VamSGGVDSsVaA~LL~~~-------Gy~V~Gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~ 74 (355) T PRK00143 2 PSKKRVVVGMSGGVDSSVAAALLKEQ-------GYDVIGLFMKLWDDDDGSGGCCAEEDIADARAVADKLGIPLYVVDFA 74 (355) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHC-------CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHH T ss_conf 98888999916789999999999977-------99589999988768877789985788999999999869857996869 Q ss_pred C-----------------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCC--CC-CCC Q ss_conf 0-----------------478778037888545555432001234420123456665668999986236544--43-322 Q gi|254780546|r 91 N-----------------SKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAE--KG-MGL 150 (423) Q Consensus 91 ~-----------------~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~--~g-~~l 150 (423) . ..|.+.+... +......+.+.+++.++++++|||.+-=.-.+--.+|.++.-. +. --| T Consensus 75 ~~f~~~Vi~~f~~~Y~~G~TPNPcv~CN-~~IKFg~l~~~A~~lgad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL 153 (355) T PRK00143 75 KEFWDNVFDYFLDEYKAGRTPNPCVLCN-KEIKFKAFLDYALELGADYIATGHYARIRDEDGRYELLRAVDPNKDQSYFL 153 (355) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHCC-CCCCHHHHHHHHHHCCCCEECCCCEEEEEECCCEEEEEECCCCCCCCEEEE T ss_conf 9876788999999997699999733408-723399999999873998423352599991598189986788988720576 Q ss_pred CCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 35764100357968710121042899999899818932026678642210 Q gi|254780546|r 151 SGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFE 200 (423) Q Consensus 151 ~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~ 200 (423) +.+.+... -+++=||-+++|+|+|+.|+++|++..+-|..+|..|- T Consensus 154 ~~l~~~~L----~~~~FPLG~~~K~eVR~iA~~~gL~~a~K~dSq~ICFi 199 (355) T PRK00143 154 YQLTQEQL----AKLLFPLGELTKPEVREIAAEIGLPTAKKKDSTGICFI 199 (355) T ss_pred ECCCHHHH----HHEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEE T ss_conf 10579887----42075876773589999999758988997777860440 No 12 >cd01998 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. This family of enzyme only presents in bacteria and eukaryote. The archaeal counterpart of this enzyme performs same function, but is completely unrelated in sequence. Probab=99.48 E-value=8.9e-14 Score=116.45 Aligned_cols=168 Identities=18% Similarity=0.237 Sum_probs=117.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC---C---CCCHHHHHHHHHHHHHHCCCCEEEEEEEC--- Q ss_conf 39999669424899999999999864899729999996677---9---87846899999999987189889999750--- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC---L---RETAKDEVRYVSDVCSRLRIAHSVVSWKN--- 91 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg---l---r~~s~~e~~~v~~~~~~lgi~~~~~~~~~--- 91 (423) ||+||+|||+||++-++||++. ++++..+|.-.- - ..-|.++...++++|+.|||||+++.... T Consensus 1 kV~v~mSGGVDSsvaA~LL~~~-------G~~V~gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~ef~ 73 (349) T cd01998 1 KVVVAMSGGVDSSVAAALLKEQ-------GYEVIGVFMKNWDEDDGKGGCCSEEDLKDARRVADQLGIPHYVVNFEKEYW 73 (349) T ss_pred CEEEECCCCHHHHHHHHHHHHC-------CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHH T ss_conf 9899907789999999999877-------995799999967887667898867789999999998699679968099886 Q ss_pred --------------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH-HHHHHHHHCCCCC--C-CCCCCCC Q ss_conf --------------47877803788854555543200123442012345666566-8999986236544--4-3322357 Q gi|254780546|r 92 --------------SKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQL-ETVYMRSQRDYAE--K-GMGLSGM 153 (423) Q Consensus 92 --------------~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~-Et~l~rl~r~sg~--~-g~~l~~~ 153 (423) ..|.+.+... +...+..+.+.+++.++.+++|||.+.=.. ++--.++.++.-. + .--|+.+ T Consensus 74 ~~V~~~f~~~Y~~G~TPNPcv~CN-~~IKF~~l~~~A~~~g~d~iATGHYAri~~~~~~~~~L~r~~D~~KDQSYfL~~l 152 (349) T cd01998 74 EKVFEPFLEEYKKGRTPNPDILCN-KEIKFGALLDYAKKLGADYIATGHYARIEEDNNGRYRLLRGVDPNKDQSYFLSQL 152 (349) T ss_pred HHHHHHHHHHHHCCCCCCCCHHCC-CEEEHHHHHHHHHHCCCCEECCCCEEEEEECCCCCEEEEECCCCCCCCCEEEECC T ss_conf 888999999997489998762118-7351999999998759986413514788975898489987689987820564012 Q ss_pred CCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 64100357968710121042899999899818932026678642210 Q gi|254780546|r 154 CDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFE 200 (423) Q Consensus 154 ~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~ 200 (423) .+... -+++=||-+++|+|+|+.|++.|++-.+-|..++..|- T Consensus 153 ~~~~L----~~~~FPLG~~~K~eVR~iA~~~gl~~a~K~dSq~ICFi 195 (349) T cd01998 153 SQEQL----SRLIFPLGDLTKPEVREIAKELGLPVAKKKDSQGICFI 195 (349) T ss_pred CHHHH----HHEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEE T ss_conf 48766----10387887865389999999859998997777760331 No 13 >pfam03054 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. Probab=99.46 E-value=1.7e-13 Score=114.46 Aligned_cols=168 Identities=19% Similarity=0.259 Sum_probs=115.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC-------CCHHHHHHHHHHHHHHCCCCEEEEEEEC-- Q ss_conf 3999966942489999999999986489972999999667798-------7846899999999987189889999750-- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR-------ETAKDEVRYVSDVCSRLRIAHSVVSWKN-- 91 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr-------~~s~~e~~~v~~~~~~lgi~~~~~~~~~-- 91 (423) ||+||+|||+||++-++||++- +++|+.+|...--. .-|.++...++++|+.|||||+++.+.. T Consensus 2 kV~VamSGGVDSsVaA~LL~~~-------Gy~V~Gv~m~~w~~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~f 74 (354) T pfam03054 2 KVVVAMSGGVDSSVAAYLLKEQ-------GYEVIGVFMKNWDEEDEFGHGCCSEEDLADAQRVCEQLGIPLYVVNFEKEY 74 (354) T ss_pred EEEEEECCCHHHHHHHHHHHHC-------CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHH T ss_conf 6999904789999999999977-------996399999956688766788998899999999999869978997879987 Q ss_pred ---------------CCCCCCCHHHHHHHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHHHHHHCCCCC--CC-CCCCC Q ss_conf ---------------4787780378885455554320012-34420123456665668999986236544--43-32235 Q gi|254780546|r 92 ---------------SKPQTGLMAAAREARYALISEHAKT-INATLIMTAHTFDDQLETVYMRSQRDYAE--KG-MGLSG 152 (423) Q Consensus 92 ---------------~~~~~~~~~~ar~~r~~~~~~~~~~-~~~~~l~~ah~~dD~~Et~l~rl~r~sg~--~g-~~l~~ 152 (423) ..|.+.+... +...+..+.+.+.+ .|+++++|||.+-=.-..--.+|.++.-. +. --|+. T Consensus 75 ~~~Vi~~fv~~Y~~G~TPNPcv~CN-~~IKFg~l~~~A~~~lGad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL~~ 153 (354) T pfam03054 75 WEKVFEPFLDEYKNGRTPNPDILCN-REIKFGALLDYAKQVLGADYLATGHYARVSLNENESELLRGKDKNKDQSYFLSG 153 (354) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHC-CCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCEEEEEECCCCCCCCCEEEEC T ss_conf 5899999999997699998414418-964389999999986699810447428999049938999899999980178610 Q ss_pred CCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 764100357968710121042899999899818932026678642210 Q gi|254780546|r 153 MCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFE 200 (423) Q Consensus 153 ~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~ 200 (423) +.+... -+++=||-+++|+|+|+.|++.|++..+-|..++..|- T Consensus 154 l~q~~L----~~~lFPLG~~~K~eVR~iA~~~gL~~a~K~dSq~ICFi 197 (354) T pfam03054 154 LSQEQL----EKLLFPLGDLTKEEVRKIAKEAGLPTAKKKDSQGICFI 197 (354) T ss_pred CCHHHH----HHEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEE T ss_conf 789876----43182876885289999999769988998777870330 No 14 >TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232 This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily .. Probab=99.44 E-value=2.4e-12 Score=106.28 Aligned_cols=164 Identities=19% Similarity=0.296 Sum_probs=123.4 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC-CHH-HHHHHHHHHHHHCCCC Q ss_conf 99999999852799839999669424899999999999864899729999996677987-846-8999999999871898 Q gi|254780546|r 6 IESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE-TAK-DEVRYVSDVCSRLRIA 83 (423) Q Consensus 6 ~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~-~s~-~e~~~v~~~~~~lgi~ 83 (423) .+.++.|++.-.+..+|+||+|||+||+.|+.++.+.+ ..++.|+||- +| -|. .|.+..+..++.+||. T Consensus 2 L~~L~~fl~~kf~nk~~~IAySGGvDS~lla~v~~~v~------g~~~lait~~---sP~~sp~~el~~A~~~A~~~g~~ 72 (263) T TIGR00268 2 LENLRNFLKEKFKNKKVLIAYSGGVDSSLLAAVCSDVA------GTEVLAITVV---SPSISPRRELEDAKAIAKEIGVK 72 (263) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHH------HHCEEEEEEE---CCCCCCHHHHHHHHHHHHHHCCC T ss_conf 46888898767640816999517465899999998753------1011357876---27757354489999999880830 Q ss_pred EEEEEEECCC-C--CCCCHHHH---HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 8999975047-8--77803788---8545555432001234420123456665668999986236544433223576410 Q gi|254780546|r 84 HSVVSWKNSK-P--QTGLMAAA---REARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTI 157 (423) Q Consensus 84 ~~~~~~~~~~-~--~~~~~~~a---r~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~ 157 (423) |.+++.+... + +.|.++++ ...-+..|...+.+.|..+++-|.+.||..+ .|. |+... T Consensus 73 he~~~~d~~~n~~f~~N~~~RCY~CK~~~~~~L~~~a~~~gy~~V~dGtN~dDL~~------~RP---------G~~A~- 136 (263) T TIGR00268 73 HELVKIDKMANPNFRANVEERCYFCKKKVLSILVKLAEKRGYDVVVDGTNADDLED------HRP---------GLRAV- 136 (263) T ss_pred EEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCC------CCC---------HHHHH- T ss_conf 12110012368500168854441548889999899998639957982346200023------675---------13567- Q ss_pred CCCCCCEEE-CCH--HCCCHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 035796871-012--1042899999899818932026678642 Q gi|254780546|r 158 LYDLNLWIS-RPF--LRCRREDIRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 158 ~~~~~~~i~-RPL--L~~~r~~l~~~~~~~~i~wveDpSN~d~ 197 (423) ...... -|+ +.++|.||+..++..|++|.|-||...- T Consensus 137 ---~E~~g~~SP~aef~I~K~eir~ia~~lg~~~~DKP~~~CL 176 (263) T TIGR00268 137 ---KELNGVYSPWAEFGITKKEIREIAKSLGLSFYDKPSEACL 176 (263) T ss_pred ---HHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf ---6607886872002568799999999748898888662102 No 15 >COG0482 TrmU Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=99.39 E-value=2.6e-12 Score=106.09 Aligned_cols=172 Identities=16% Similarity=0.188 Sum_probs=121.7 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC-----CCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 998399996694248999999999998648997299999966779-----878468999999999871898899997504 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL-----RETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl-----r~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) ...||+||+|||+||++-+++|++. +++|..+|..-.. ...|.++...+++.|..+|||++++.+... T Consensus 2 ~~~kV~v~mSGGVDSSVaA~lLk~Q-------GyeViGl~m~~~~~~~~~~C~s~~d~~da~~va~~LGIp~~~vdf~~~ 74 (356) T COG0482 2 KKKKVLVGMSGGVDSSVAAYLLKEQ-------GYEVIGLFMKNWDEDGGGGCCSEEDLRDAERVADQLGIPLYVVDFEKE 74 (356) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHC-------CCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECHHHH T ss_conf 8767999905777899999999976-------974999999964147888677256789999999980995599846899 Q ss_pred ---------------CCCCCCHHHH-HHHHHHHHHHHCCCCCCCCHHHHHHHHHHH---HHHHHHHHCCCCCCCCCCCCC Q ss_conf ---------------7877803788-854555543200123442012345666566---899998623654443322357 Q gi|254780546|r 93 ---------------KPQTGLMAAA-REARYALISEHAKTINATLIMTAHTFDDQL---ETVYMRSQRDYAEKGMGLSGM 153 (423) Q Consensus 93 ---------------~~~~~~~~~a-r~~r~~~~~~~~~~~~~~~l~~ah~~dD~~---Et~l~rl~r~sg~~g~~l~~~ 153 (423) -.++|-.-.+ +......+.+.+.+.++++++|||.+--.- +..+.|-....-....-|..+ T Consensus 75 y~~~V~~~f~~~Y~~G~TPNPci~CN~~iKF~~~l~~a~~lgad~iATGHYar~~~~~~~~~l~r~~D~~KDQsYfL~~~ 154 (356) T COG0482 75 FWNKVFEYFLAEYKAGKTPNPCILCNKEIKFKALLDYAKELGADYIATGHYARQREDEGIELLLRGVDLNKDQSYFLYAL 154 (356) T ss_pred HHHHHHHHHHHHHHCCCCCCCCHHCCHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHEECCC T ss_conf 88888887789985799999651048788999999999975998578744676543776000146787420300020126 Q ss_pred CCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 64100357968710121042899999899818932026678642210 Q gi|254780546|r 154 CDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFE 200 (423) Q Consensus 154 ~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~ 200 (423) .+... -.++=||-++.|.++|..|.+.|++.-+-+--++..|- T Consensus 155 ~~~ql----~~~lFPlG~l~K~evR~iA~~~gL~~a~KkdS~~ICFi 197 (356) T COG0482 155 SQEQL----ERLLFPLGDLEKLEVRPIAAEKGLPTAKKKDSQGICFI 197 (356) T ss_pred CHHHH----HHCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEC T ss_conf 88777----53225677777789999999759976676667762213 No 16 >PRK00919 GMP synthase subunit B; Validated Probab=99.25 E-value=1.6e-10 Score=93.41 Aligned_cols=164 Identities=20% Similarity=0.227 Sum_probs=111.2 Q ss_pred CHHHHHHHHH---HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC-CCCHHHHHHHHHHHHHH Q ss_conf 9899999999---8527998399996694248999999999998648997299999966779-87846899999999987 Q gi|254780546|r 4 SPIESVRFFV---RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL-RETAKDEVRYVSDVCSR 79 (423) Q Consensus 4 ~p~~~~~~~~---~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl-r~~s~~e~~~v~~~~~~ 79 (423) .|..-+...+ +.-.+..++++|+|||+||++.+.|++.... -++++++||||| |.+ |.+.|.+.+++ T Consensus 2 n~~~fi~~~i~~Ir~~Vg~~kvi~~lSGGVDStV~A~Ll~kAig------~~l~~v~VD~GllR~~---E~~~V~~~~~~ 72 (306) T PRK00919 2 DPEKFIEEAIEEIREEVGDGKAIIALSGGVDSSVAAVLAHRAIG------DRLLAVYVDTGLMRKG---ETERIREIFKD 72 (306) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHH------CCEEEEEECCCCCCCC---CHHHHHHHHHH T ss_conf 87999999999999984898599991688479999999998864------2659999868988798---69999999983 Q ss_pred CCCCEEEEEEECC-----CCCCCCH---HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 1898899997504-----7877803---7888545555432001234420123456665668999986236544433223 Q gi|254780546|r 80 LRIAHSVVSWKNS-----KPQTGLM---AAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLS 151 (423) Q Consensus 80 lgi~~~~~~~~~~-----~~~~~~~---~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~ 151 (423) |++..++..... +.-..-| ..-....+..+.+.+++.+..+++.|...-|.+|+ +++++.-.=. T Consensus 73 -~~~l~~vda~~~Fl~~L~gv~DPE~KRKiIG~~Fi~vfe~~~~~~~~~~LaQGTlyPDvIES-------~~~IKSHHNV 144 (306) T PRK00919 73 -GLNLRIVDASDRFLEALKGVTDPEEKRKIIGETFIRVFEEVAKEIGAEYLVQGTIAPDWIES-------EGGIKSHHNV 144 (306) T ss_pred -CCCCEEEEHHHHHHHHHCCCCCHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECEECCCHHHC-------CCCCCCCCCC T ss_conf -58928983499999973799895674602579999999999998298768511073760204-------6764341668 Q ss_pred CCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 576410035796871012104289999989981893 Q gi|254780546|r 152 GMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 152 ~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) +.-|. .-...++-||-.+-|+|.|+..++.|++ T Consensus 145 ggLp~---~m~~kliEPLr~LfKDEVR~lG~~LGlp 177 (306) T PRK00919 145 GGLPE---GMVLKIVEPLRDLYKDEVREVARALGLP 177 (306) T ss_pred CCCHH---HCCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 87733---2356310318998559999999996978 No 17 >TIGR00884 guaA_Cterm GMP synthase, C-terminal domain; InterPro: IPR001674 The amidotransferase family of enzymes utilises the ammonia derived from the hydrolysis of glutamine for a subsequent chemical reaction catalyzed by the same enzyme. The ammonia intermediate does not dissociate into solution during the chemical transformations . GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. The C-terminal domain is specific to the GMP synthases 6.3.5.2 from EC. In prokaryotes this domain mediates dimerisation. Eukaryotic GMP synthases are monomers. This domain in eukaryotes includes several large insertions that may form globular domains .; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006164 purine nucleotide biosynthetic process, 0006177 GMP biosynthetic process. Probab=99.23 E-value=8.4e-11 Score=95.43 Aligned_cols=168 Identities=21% Similarity=0.284 Sum_probs=114.0 Q ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHH-HCCCCEEE Q ss_conf 99999985279983999966942489999999999986489972999999667798784689999999998-71898899 Q gi|254780546|r 8 SVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCS-RLRIAHSV 86 (423) Q Consensus 8 ~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~-~lgi~~~~ 86 (423) +|..- +.-.+..++++|+|||+||++.+.|++... +-+|+++-|||||-- ..|.+.|++... ++|++... T Consensus 6 ~i~~i-r~~VGD~~vi~ALSGGVDSsV~A~L~hrAI------GD~L~~vFVD~GLlR--~gE~E~V~~~F~~~lg~nl~~ 76 (319) T TIGR00884 6 AIEEI-REQVGDAKVIIALSGGVDSSVAAVLLHRAI------GDRLTCVFVDHGLLR--KGEAERVVKTFSDKLGLNLVV 76 (319) T ss_pred HHHHH-HHHCCCCEEEEEECCCHHHHHHHHHHHHHH------CCCEEEEEECCCCCC--CCCHHHHHHHHHHHHCCCCEE T ss_conf 99999-852288468998108815899999998642------276048982278887--663789999987530898278 Q ss_pred EEEECCC--CCCCCHHHHH------HHHHHHHHHHCCCCC----CCCHHHHHHHHHHHHHHHHHHHCCC-CCCCCCCC-C Q ss_conf 9975047--8778037888------545555432001234----4201234566656689999862365-44433223-5 Q gi|254780546|r 87 VSWKNSK--PQTGLMAAAR------EARYALISEHAKTIN----ATLIMTAHTFDDQLETVYMRSQRDY-AEKGMGLS-G 152 (423) Q Consensus 87 ~~~~~~~--~~~~~~~~ar------~~r~~~~~~~~~~~~----~~~l~~ah~~dD~~Et~l~rl~r~s-g~~g~~l~-~ 152 (423) +.-.... .-.|..+... +..-..+.+.+++.+ +.+|+.|.-.-|-+|+.-.....++ .++.-+=- | T Consensus 77 VDA~e~FL~~L~GV~DPE~KRKIIG~~FI~VFE~~A~~~~~~~~a~yL~QGTlYPDvIES~~~kG~~~a~~IKsHHNVGG 156 (319) T TIGR00884 77 VDAKERFLSALKGVTDPEEKRKIIGRVFIEVFEREAEKIGDKKKAEYLVQGTLYPDVIESASSKGTSGAATIKSHHNVGG 156 (319) T ss_pred ECCCHHHHHHCCCCCCCHHHCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 76107988752889870341311225667778889985079977999973434474646023047764764563106688 Q ss_pred CCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCC Q ss_conf 764100357968710121042899999899818932 Q gi|254780546|r 153 MCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISW 188 (423) Q Consensus 153 ~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~w 188 (423) + +..-.+.++=||=.+-|+|.|...++.|+|- T Consensus 157 L----P~~m~l~LvEPLR~LfKDEVR~lG~~LGlP~ 188 (319) T TIGR00884 157 L----PEDMKLKLVEPLRELFKDEVRKLGKELGLPE 188 (319) T ss_pred C----CCCCEEEEECCCHHCCHHHHHHHHHHCCCCH T ss_conf 8----5426025875402302178999998718867 No 18 >cd01712 ThiI ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway. It belongs to the Adenosine Nucleotide Hydrolysis suoerfamily and predicted to bind to Adenosine nucleotide. Probab=99.23 E-value=1.5e-10 Score=93.66 Aligned_cols=144 Identities=18% Similarity=0.184 Sum_probs=98.3 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCC--HHHHHHHHHHHHHHCCCCEEEEEEECC------ Q ss_conf 399996694248999999999998648997299999966779878--468999999999871898899997504------ Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRET--AKDEVRYVSDVCSRLRIAHSVVSWKNS------ 92 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~--s~~e~~~v~~~~~~lgi~~~~~~~~~~------ 92 (423) |+++++|||.||.+-++++.+ + +++|..+|++-+--.. +.+.+..+.+...+++..+....+... T Consensus 1 kvl~L~SGGiDS~VAa~ll~k----r---G~~V~~l~f~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 73 (177) T cd01712 1 KALALLSGGIDSPVAAWLLMK----R---GIEVDALHFNSGPFTSEKAREKVEDLARKLARYSPGHKLVVIIFTFFVQKE 73 (177) T ss_pred CEEEEECCCCHHHHHHHHHHH----C---CCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHH T ss_conf 989995778169999999998----7---998999999899988889999999999999994899628999453899999 Q ss_pred ----CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECC Q ss_conf ----7877803788854555543200123442012345666566899998623654443322357641003579687101 Q gi|254780546|r 93 ----KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRP 168 (423) Q Consensus 93 ----~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RP 168 (423) .+..+-.-.+++.-|+.....+++.|+..++|||...-.+-..+.++. .+ ...-...++|| T Consensus 74 ~~~~g~~~npcv~ckr~m~r~a~~~A~~~ga~~IvTGe~lGQvasqt~~nl~-----------~i----~~~~~~~ilRP 138 (177) T cd01712 74 IYGYGKEKYRCILCKRMMYRIAEKLAEELGADAIVTGESLGQVASQTLENLL-----------VI----SSGTDLPILRP 138 (177) T ss_pred HHHHCCCCCCCEEHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHH-----------HH----HHHHCCCCCCC T ss_conf 9970888898636299999999999998699899866522204476899999-----------99----87425642148 Q ss_pred HHCCCHHHHHHHHHHHCC Q ss_conf 210428999998998189 Q gi|254780546|r 169 FLRCRREDIRSFLLQRNI 186 (423) Q Consensus 169 LL~~~r~~l~~~~~~~~i 186 (423) |++++|+||++++++.|+ T Consensus 139 L~~~dK~EI~~~A~~igt 156 (177) T cd01712 139 LIGFDKEEIIGIARRIGT 156 (177) T ss_pred CCCCCHHHHHHHHHHHCC T ss_conf 889998999999998096 No 19 >cd01713 PAPS_reductase This domain is found in phosphoadenosine phosphosulphate (PAPS) reductase enzymes or PAPS sulphotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. A highly modified version of the P loop, the fingerprint peptide of mononucleotide-binding proteins, is present in the active site of the protein, which appears to be a positively charged cleft containing a number of conserved arginine and lysine residues. Although PAPS reductase has no ATPase activity, it shows a striking similarity to the structure of the ATP pyrophosphatase (ATP PPase) domain of GMP synthetase, indicating that both enzyme families have evolved from a common ancestral nucleotide-binding fold. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP) . It is also found in NodP nodulation protein P from Rhizobium meliloti which has ATP sulphurylase acti Probab=99.15 E-value=6.1e-10 Score=89.34 Aligned_cols=149 Identities=19% Similarity=0.201 Sum_probs=91.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC------- Q ss_conf 3999966942489999999999986489972999999667798784689999999998718988999975047------- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK------- 93 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~------- 93 (423) +++|++|||+||+|||||+.+..... ..+.++++|.|. +-.+-.++++++.+.+|++..+....... T Consensus 1 ~~~vsfSGGKDS~vll~L~~~~~~~~----~~~~vvf~DTg~--e~pet~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (173) T cd01713 1 NVVVSFSGGKDSTVLLHLALKALPEL----KPVPVIFLDTGY--EFPETYEFVDRVAERYGLPLVVVRPPDSPAEGLALG 74 (173) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCCC----CCEEEEEECCCC--CCHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHH T ss_conf 95999646499999999999966555----883799968889--898999999999987399079996897289999861 Q ss_pred -CCCCCHHHH-----HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEC Q ss_conf -877803788-----85455554320012344201234566656689999862365444332235764100357968710 Q gi|254780546|r 94 -PQTGLMAAA-----REARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISR 167 (423) Q Consensus 94 -~~~~~~~~a-----r~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~R 167 (423) ......... ...+-.-+....++.+....++|--+++-..-..+ ..........+...+. T Consensus 75 ~~~~~~~~~~~~~c~~~~K~~P~~~~~~~~~~~~~~~GiR~~Es~~R~~~--------------~~~~~~~~~~~~~~~~ 140 (173) T cd01713 75 LKGFPLPSPDRRWCCRILKVEPLRRALKELGVVAWITGIRRDESARRALL--------------PVVWTDDGKGGILKVN 140 (173) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHCCC--------------CCCCCCCCCCCEEEEE T ss_conf 35779983226889888807899999983298289994121462444437--------------6234326889989993 Q ss_pred CHHCCCHHHHHHHHHHHCCCCC Q ss_conf 1210428999998998189320 Q gi|254780546|r 168 PFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 168 PLL~~~r~~l~~~~~~~~i~wv 189 (423) |+++++.+++-+|+..+++++. T Consensus 141 Pi~~Wt~~dVw~yi~~~~l~~~ 162 (173) T cd01713 141 PLLDWTYEDVWAYLARHGLPYN 162 (173) T ss_pred CHHHCCHHHHHHHHHHCCCCCC T ss_conf 2285999999999998399999 No 20 >PRK00074 guaA GMP synthase; Reviewed Probab=99.04 E-value=3.1e-09 Score=84.30 Aligned_cols=156 Identities=19% Similarity=0.290 Sum_probs=103.4 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC-CCCCHHHHHHHHHHHH-HHCCCCEEEEEEEC Q ss_conf 852799839999669424899999999999864899729999996677-9878468999999999-87189889999750 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC-LRETAKDEVRYVSDVC-SRLRIAHSVVSWKN 91 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg-lr~~s~~e~~~v~~~~-~~lgi~~~~~~~~~ 91 (423) +.-.+..+|+.|+|||+||++.+.|++.... -+++.++|||| ||.+ |.+.|.+.. +.+|++..++.... T Consensus 211 k~~Vg~~kVi~~lSGGVDStV~A~Ll~kAIG------d~l~cvfVD~GllRkn---E~~~V~~~~~~~lgl~~~~vdA~~ 281 (513) T PRK00074 211 REQVGDKKVILGLSGGVDSSVAAVLLHKAIG------DQLTCVFVDHGLLRKN---EAEQVMEMFREHFGLNLIHVDASD 281 (513) T ss_pred HHHHCCCEEEEEECCCCCHHHHHHHHHHHHH------HCEEEEEECCCCCCCC---CHHHHHHHHHHHCCCCEEEEEHHH T ss_conf 9983876289972388308999999999851------1248999537711167---299999999886199679922489 Q ss_pred CC--CCCCC--HHHHH----HHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC---C--C Q ss_conf 47--87780--37888----5455554320012-344201234566656689999862365444332235764---1--0 Q gi|254780546|r 92 SK--PQTGL--MAAAR----EARYALISEHAKT-INATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCD---T--I 157 (423) Q Consensus 92 ~~--~~~~~--~~~ar----~~r~~~~~~~~~~-~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~---~--~ 157 (423) .. .-.++ .+.-| ....+.+.+.+++ .++.+++.|...-|.+|+.- +++. ...+.. + . T Consensus 282 ~Fl~~L~gv~DPE~KRKIIG~~FI~vfe~~a~~~~~~~~L~QGTlYPDvIES~~-----~~~~----a~~IKsHHNVggl 352 (513) T PRK00074 282 RFLSALAGVTDPEEKRKIIGREFIEVFEEEAKKLGGVKFLAQGTLYPDVIESAS-----GTGK----AATIKSHHNVGGL 352 (513) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCEEEECC-----CCCC----CCCCCCCCCCCCC T ss_conf 999973799897996788778999999999975799839964676550688479-----8999----8860487758788 Q ss_pred CCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 035796871012104289999989981893 Q gi|254780546|r 158 LYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 158 ~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) +..-...++-||-.+-|+|.|...++.|++ T Consensus 353 p~~~~~~lvEPlr~lfKDEVR~lg~~Lglp 382 (513) T PRK00074 353 PEDMKLKLVEPLRELFKDEVRKLGLELGLP 382 (513) T ss_pred HHHCCCEEEEEHHHHCCHHHHHHHHHHCCC T ss_conf 566487545031756158999999996897 No 21 >pfam01507 PAPS_reduct Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP). It is also found in NodP nodulation protein P from Rizobium which has ATP sulfurylase activity (sulfate adenylate transferase). Probab=99.04 E-value=2.4e-09 Score=85.07 Aligned_cols=147 Identities=15% Similarity=0.154 Sum_probs=84.6 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCC-----C Q ss_conf 39999669424899999999999864899729999996677987846899999999987189889999750478-----7 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKP-----Q 95 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~-----~ 95 (423) +|+|++|||+||+|||||+.+.. ..+.++++|.|. +-.+-.+++.++.+++|++..+...+.... . T Consensus 1 ~v~vsfSGGKDS~vlL~L~~~~~-------~~~~vvf~Dtg~--efpet~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (174) T pfam01507 1 ELVVSFSGGKDSLVLLHLASKAF-------PPGPVIFIDTGY--EFPETYEFVDELEEKYGLNLKVYRPEDSFAEGINPE 71 (174) T ss_pred CEEEEECCHHHHHHHHHHHHHHC-------CCCCEEEEECCC--CCHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHC T ss_conf 99999576099999999999848-------997579997899--868999999999998498189980867798776312 Q ss_pred CCCHHH-----HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHH Q ss_conf 780378-----885455554320012344201234566656689999862365444332235764100357968710121 Q gi|254780546|r 96 TGLMAA-----AREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFL 170 (423) Q Consensus 96 ~~~~~~-----ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL 170 (423) +..... ++.....-+....++.+...+++|--.++-..-..+ ..........+..-+.|++ T Consensus 72 ~~~~~~~~~~~~~~~K~~p~~~~l~~~~~~~~i~GiR~~Es~~R~~~--------------~~~~~~~~~~~~~~~~PI~ 137 (174) T pfam01507 72 GIPSKLWEDCPCRLRKVEPLKRALKKLDFDAWFTGLRRDESPSRAKL--------------PIVSIDGDFPKVIKVFPLL 137 (174) T ss_pred CCCHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHC--------------CEEEEECCCCCEEEEECHH T ss_conf 79501145452146651799999986599689995100244566407--------------1674305788859995005 Q ss_pred CCCHHHHHHHHHHHCCCCCC Q ss_conf 04289999989981893202 Q gi|254780546|r 171 RCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 171 ~~~r~~l~~~~~~~~i~wve 190 (423) +++.+++-+|+.++|+++.. T Consensus 138 ~Wt~~DVw~yi~~~~lp~np 157 (174) T pfam01507 138 NWTETDVWQYILANNVPYNP 157 (174) T ss_pred HCCHHHHHHHHHHCCCCCCC T ss_conf 29999999999983999995 No 22 >cd01997 GMP_synthase_C The C-terminal domain of GMP synthetase. It contains two subdomains; the ATP pyrophosphatase domain which closes to the N-termial and the dimerization domain at C-terminal end. The ATP-PPase is a twisted, five-stranded parallel beta-sheet sandwiched between helical layers. It has a signature nucleotide-binding motif, or P-loop, at the end of the first-beta strand.The dimerization domain formed by the C-terminal 115 amino acid for prokaryotic proteins. It is adjacent to teh ATP-binding site of the ATP-PPase subdomain. The largest difference between the primary sequence of prokaryotic and eukaryotic GMP synthetase map to the dimerization domain.Eukaryotic GMP synthetase has several large insertions relative to prokaryotes. Probab=99.03 E-value=2.4e-09 Score=85.06 Aligned_cols=152 Identities=22% Similarity=0.284 Sum_probs=99.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC-CCCHHHHHHHHHHHHHH-CCCCEEEEEEECC-----C Q ss_conf 399996694248999999999998648997299999966779-87846899999999987-1898899997504-----7 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL-RETAKDEVRYVSDVCSR-LRIAHSVVSWKNS-----K 93 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl-r~~s~~e~~~v~~~~~~-lgi~~~~~~~~~~-----~ 93 (423) ||++|+|||+||++.+.|++.... -++++++||||| |.+ |.+.|.+.+++ ++++..++..... + T Consensus 1 kVi~~lSGGVDStV~A~Ll~kAig------~~l~~vfVDnGlmRk~---E~e~V~~~~~~~~~~~~~~vdas~~Fl~~L~ 71 (295) T cd01997 1 KVILALSGGVDSTVAAVLLHKAIG------DRLTCVFVDNGLLRKN---EAERVEELFSKLLGINLIVVDASERFLSALK 71 (295) T ss_pred CEEEEECCCHHHHHHHHHHHHHHH------CCEEEEEECCCCCCCC---CHHHHHHHHHHHCCCCEEEECHHHHHHHHHC T ss_conf 989990578289999999998865------0569999858978888---2999999998706997799566999999865 Q ss_pred CCCCCHH---HHHHHHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCCCCCCCCCEEEC Q ss_conf 8778037---888545555432001234-4201234566656689999862365--444332235764100357968710 Q gi|254780546|r 94 PQTGLMA---AAREARYALISEHAKTIN-ATLIMTAHTFDDQLETVYMRSQRDY--AEKGMGLSGMCDTILYDLNLWISR 167 (423) Q Consensus 94 ~~~~~~~---~ar~~r~~~~~~~~~~~~-~~~l~~ah~~dD~~Et~l~rl~r~s--g~~g~~l~~~~~~~~~~~~~~i~R 167 (423) ....-|+ .-....++.+.+.+++.+ +.+++.|...-|.+|+.-- .++ -++.-.=.+..| ..-.+.++- T Consensus 72 gv~DPE~KRKiIG~~Fi~vf~~~~~~~~~~~~L~QGTlyPDvIES~~~---~~~a~~IKsHHNvggLp---~~~~~kliE 145 (295) T cd01997 72 GVTDPEEKRKIIGETFIEVFEEEAKKLGLAEYLAQGTLYPDVIESGSG---KGSADTIKSHHNVGGLP---EDMKLKLIE 145 (295) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCEEECCCC---CCCCCEEEEECCCCCCC---HHCCCCCCH T ss_conf 877925636565389999999999755886189724302625750788---88875145224567772---322654001 Q ss_pred CHHCCCHHHHHHHHHHHCCC Q ss_conf 12104289999989981893 Q gi|254780546|r 168 PFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 168 PLL~~~r~~l~~~~~~~~i~ 187 (423) ||-.+-|+|.|+..++.|++ T Consensus 146 PLr~LfKDEVR~lG~~LGlp 165 (295) T cd01997 146 PLRDLFKDEVRELGRELGLP 165 (295) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 67887459999999986897 No 23 >KOG2805 consensus Probab=99.03 E-value=8.5e-10 Score=88.30 Aligned_cols=173 Identities=17% Similarity=0.214 Sum_probs=112.7 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEE-------CCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 2799839999669424899999999999864899729999996-------677987846899999999987189889999 Q gi|254780546|r 16 LVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISV-------DHCLRETAKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 16 l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~v-------dHglr~~s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) -.++++|+||+|||+||.+-++|++.- ++++..++. ++|-+.-...+.+.++.+|+.|+||++.+. T Consensus 2 p~~~~~VvvamSgGVDSsVaa~Ll~~~-------g~~v~gv~M~nWd~~de~~s~cp~e~D~~da~~Vc~~LnI~~~~Vn 74 (377) T KOG2805 2 PEKPDRVVVAMSGGVDSSVAARLLAAR-------GYNVTGVFMKNWDSLDEFGSQCPAERDWKDAKRVCKQLNIPLHQVN 74 (377) T ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHC-------CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCEEEEEE T ss_conf 866654899953771189999999741-------8871699662201222356689811208999999987097027874 Q ss_pred EE-----------------CCCCCCCCHHHHHHHHHH-HHHHHCCCCCCCCHHHHHHHHHHHHHHH---HHHHCCC---C Q ss_conf 75-----------------047877803788854555-5432001234420123456665668999---9862365---4 Q gi|254780546|r 89 WK-----------------NSKPQTGLMAAAREARYA-LISEHAKTINATLIMTAHTFDDQLETVY---MRSQRDY---A 144 (423) Q Consensus 89 ~~-----------------~~~~~~~~~~~ar~~r~~-~~~~~~~~~~~~~l~~ah~~dD~~Et~l---~rl~r~s---g 144 (423) +. +..|.+.+... +...+. +....-...+.++++|||-+--..|... .+++.+. - T Consensus 75 f~kEYW~~Vfs~~L~~Y~~G~TPNPDI~CN-~~IKFg~~~~~a~en~~~d~latGHYAr~~~~~~~~~~~~l~~~~d~~K 153 (377) T KOG2805 75 FVKEYWNDVFSPFLEEYENGRTPNPDILCN-KHIKFGKFFKHAIENLGYDWLATGHYARVVLEDEDNAESHLLISKDMVK 153 (377) T ss_pred EHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CEEECCHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCEEEEECCCCCC T ss_conf 379999999999988876589999971344-5142117899998755887377421214403755676235761256667 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 44332235764100357968710121042899999899818932026678642210 Q gi|254780546|r 145 EKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFE 200 (423) Q Consensus 145 ~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~ 200 (423) -.+.-|+.+..... -+++=||-.++|+|+++.+++.|++--|.|--+-..|- T Consensus 154 DQt~FL~~in~~~L----~r~lfPlg~~~K~eVk~lA~~~gf~~aeK~eSqGICFv 205 (377) T KOG2805 154 DQTYFLSTINQTQL----KRLLFPLGCLTKSEVKKLAKQAGFPNAEKPESQGICFV 205 (377) T ss_pred CCEEEEECCCHHHH----HHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCEEEEE T ss_conf 73267630369999----86206676567799999998659864568665026885 No 24 >pfam02568 ThiI Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis. Probab=99.02 E-value=3.6e-09 Score=83.85 Aligned_cols=151 Identities=17% Similarity=0.158 Sum_probs=101.6 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC--CHHHHHHHHHHHHHHC----CCCEEEEEEECC Q ss_conf 9839999669424899999999999864899729999996677987--8468999999999871----898899997504 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE--TAKDEVRYVSDVCSRL----RIAHSVVSWKNS 92 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~--~s~~e~~~v~~~~~~l----gi~~~~~~~~~~ 92 (423) ..|+++.+|||.||.+.++++.. + +.+|.++|.+++.-. ++.+-+....+...++ .+++++...... T Consensus 3 ~gk~l~LlSGGiDSpVAa~lmmk----R---G~~V~~vhf~~~p~~~~~~~~k~~~l~~~l~~y~~~~~~~l~vv~~~~~ 75 (197) T pfam02568 3 QGKVLALLSGGIDSPVAAYLMMR----R---GCRVVALHFINEPGTSEEAIEKVRKLAELLAEYGTSCEGKLVVVDFTKV 75 (197) T ss_pred CCCEEEEECCCCHHHHHHHHHHH----C---CCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHH T ss_conf 77189886687129999999998----7---9979999987999998999999999999999737887544999574999 Q ss_pred -------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf -------7877803788854555543200123442012345666566899998623654443322357641003579687 Q gi|254780546|r 93 -------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWI 165 (423) Q Consensus 93 -------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i 165 (423) .+.....-.+|+.-|+.....+++.++..++||..++..+-..+-++. . ....-+..+ T Consensus 76 ~~~i~~~~~~~~~cv~cKr~M~r~A~~iA~~~ga~~IVTGEsLGQVaSQTl~nl~-----------~----i~~~~~~pi 140 (197) T pfam02568 76 QKEIIEKAPEKYRCVLCKRCMYRAAEKVAEEEGADALVTGESLGQVASQTLDNLR-----------V----ISAATNLPI 140 (197) T ss_pred HHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHCCCCHHH-----------H----HHHHHCCCC T ss_conf 9999962898765456999999999999998499899847303121004531069-----------9----987624853 Q ss_pred ECCHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 10121042899999899818932026 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQRNISWCED 191 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~~i~wveD 191 (423) +|||+++.|+||++.+++.|.-.+.. T Consensus 141 lRPLig~DK~EIi~~Ar~IGt~~~s~ 166 (197) T pfam02568 141 LRPLIGLDKEEIINLAKEIGTYEISI 166 (197) T ss_pred CCCCCCCCHHHHHHHHHHHCCHHHHC T ss_conf 13434699999999999948677662 No 25 >COG0519 GuaA GMP synthase, PP-ATPase domain/subunit [Nucleotide transport and metabolism] Probab=99.01 E-value=5.2e-09 Score=82.76 Aligned_cols=161 Identities=20% Similarity=0.237 Sum_probs=108.5 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHH-CCCCEEEEEEECC Q ss_conf 852799839999669424899999999999864899729999996677987846899999999987-1898899997504 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSR-LRIAHSVVSWKNS 92 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~-lgi~~~~~~~~~~ 92 (423) +.-.+..++++|+|||+||++.+.|++.... -++..+.|||||-- ..|.+.|.+...+ +|++...+.-... T Consensus 16 r~~vg~~kvi~alSGGVDSsv~a~L~~~AiG------d~l~cvfVD~GLlR--~~E~e~V~~~f~~~~~~nl~~VdA~~~ 87 (315) T COG0519 16 REQVGDGKVILALSGGVDSSVAAVLAHRAIG------DQLTCVFVDHGLLR--KGEAEQVVEMFREHLGLNLIVVDAKDR 87 (315) T ss_pred HHHHCCCEEEEEECCCCCHHHHHHHHHHHHH------CCEEEEEECCCCCC--CCCHHHHHHHHHHHCCCCEEEECHHHH T ss_conf 9972885399983387808999999999740------52489996377434--785699999987532984599754988 Q ss_pred C----CCCCCHHHH----HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 7----877803788----85455554320012344201234566656689999862365444332235764100357968 Q gi|254780546|r 93 K----PQTGLMAAA----REARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLW 164 (423) Q Consensus 93 ~----~~~~~~~~a----r~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~ 164 (423) . .+-.-.+.- .+.....+.+.+++.++.+++.|...-|.+|+. .-.+..++.-.=.|. .+..-... T Consensus 88 Fl~~L~GvtDPE~KRKiIG~~Fi~VFe~ea~k~~~~~LaQGTiYpDvIES~---~g~~~~IKSHHNVGG---LP~~m~lk 161 (315) T COG0519 88 FLSALKGVTDPEEKRKIIGREFIEVFEEEAKKLGAEFLAQGTIYPDVIESG---TGKAGTIKSHHNVGG---LPEDMKLK 161 (315) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEC---CCCCCCCCCCCCCCC---CCCCCCEE T ss_conf 998843999889989998799999999999857860687044345046506---888886213562688---95014222 Q ss_pred EECCHHCCCHHHHHHHHHHHCCCC Q ss_conf 710121042899999899818932 Q gi|254780546|r 165 ISRPFLRCRREDIRSFLLQRNISW 188 (423) Q Consensus 165 i~RPLL~~~r~~l~~~~~~~~i~w 188 (423) ++-||-.+-|+|+|...++.|++- T Consensus 162 LvEPLr~LfKDEVR~lg~~LGlp~ 185 (315) T COG0519 162 LVEPLRELFKDEVRELGRELGLPE 185 (315) T ss_pred EEHHHHHHHHHHHHHHHHHHCCCH T ss_conf 307789886899999999859988 No 26 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=99.01 E-value=6.1e-09 Score=82.27 Aligned_cols=150 Identities=16% Similarity=0.195 Sum_probs=103.2 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCC--- Q ss_conf 98399996694248999999999998648997299999966779878468999999999871898899997504787--- Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQ--- 95 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~--- 95 (423) +.|-+|=+|||.||+++|+.+++- ..+|+++|+|+|-|. ..|.+.++.+|+.+||+++++.++....- T Consensus 2 ~~kavvl~SGG~DStt~l~~a~~~-------~~ev~alsfdYGQrh--~~Ele~A~~iak~lgv~~~iid~~~~~~~~~s 72 (222) T COG0603 2 MKKAVVLLSGGLDSTTCLAWAKKE-------GYEVHALTFDYGQRH--RKELEAAKELAKKLGVPHHIIDVDLLGEIGGS 72 (222) T ss_pred CCEEEEECCCCHHHHHHHHHHHHC-------CCEEEEEEEECCCCC--HHHHHHHHHHHHHCCCCEEEECHHHHHHCCCC T ss_conf 834999916881389999999960-------988999975078986--89999999999981998289541687535888 Q ss_pred ---CC---CH------H-------HHHHH-HHHHHHHHCCCCCCCCHHHHHHHHHH------HHHHHHHHHCCCCCCCCC Q ss_conf ---78---03------7-------88854-55554320012344201234566656------689999862365444332 Q gi|254780546|r 96 ---TG---LM------A-------AAREA-RYALISEHAKTINATLIMTAHTFDDQ------LETVYMRSQRDYAEKGMG 149 (423) Q Consensus 96 ---~~---~~------~-------~ar~~-r~~~~~~~~~~~~~~~l~~ah~~dD~------~Et~l~rl~r~sg~~g~~ 149 (423) .+ +. + .+|-. -.......+...++..+.+|-|..|. ...|...+.... . T Consensus 73 aLtd~~~~vp~~~~~~~~~p~t~VP~RN~iflsiA~~~Ae~~g~~~I~~Gv~~~D~sgYPDcrpefi~a~~~~~-----~ 147 (222) T COG0603 73 ALTDDSIDVPKYEFAEEEIPATFVPARNLIFLSIAAAYAEALGADAIIIGVNEEDFSGYPDCRPEFIEALNEAL-----N 147 (222) T ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHH-----H T ss_conf 67588863666666666686447606609999999999987599859997532104789988889999999999-----8 Q ss_pred CCCCCCCCCCCCCCE-EECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 235764100357968-7101210428999998998189320 Q gi|254780546|r 150 LSGMCDTILYDLNLW-ISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 150 l~~~~~~~~~~~~~~-i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) + ++. .++. +.-||...+|++|.....+.|.++- T Consensus 148 l-~~~------~~~~~i~aPl~~l~Ka~iv~l~~elg~~~~ 181 (222) T COG0603 148 L-GTE------KGVRIIHAPLMELTKAEIVKLADELGVPLE 181 (222) T ss_pred H-HCC------CCCCEEECCEEECCHHHHHHHHHHHCCCCH T ss_conf 6-255------786177678331349999998888688510 No 27 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=98.99 E-value=5.5e-09 Score=82.58 Aligned_cols=136 Identities=16% Similarity=0.163 Sum_probs=94.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHH Q ss_conf 39999669424899999999999864899729999996677987846899999999987189889999750478778037 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMA 100 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~ 100 (423) |.+|=+|||.||+++|+.+.+ . ..+++++++|+|-|. ..|.+..+.+|+.+|-.-. + |..|.-- T Consensus 1 kavvllSGGlDSt~~l~~~~~----~---g~~v~~l~~dYGQr~--~~E~~~a~~i~~~l~~~~~-----V--P~RN~if 64 (169) T cd01995 1 KAVVLLSGGLDSTTCLAWAKK----E---GYEVHALSFDYGQRH--AKEEEAAKLIAEKLGPSTY-----V--PARNLIF 64 (169) T ss_pred CEEEEECCCHHHHHHHHHHHH----C---CCEEEEEEEECCCCH--HHHHHHHHHHHHHHCCCCE-----E--CCCHHHH T ss_conf 989990787889999999998----4---996999997569854--9999999999999689946-----5--2838999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH------HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCH Q ss_conf 8885455554320012344201234566656------6899998623654443322357641003579687101210428 Q gi|254780546|r 101 AAREARYALISEHAKTINATLIMTAHTFDDQ------LETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRR 174 (423) Q Consensus 101 ~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~------~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r 174 (423) .......+...++..+.+|-|.+|. ...|+-.+...-. .....+++|.-||+..+| T Consensus 65 ------ls~A~~~A~~~g~~~v~~G~~~~d~~~ypDc~~~F~~a~~~~~~------------~~~~~~v~i~aP~~~~~K 126 (169) T cd01995 65 ------LSIAAAYAEALGAEAIIIGVNAEDYSGYPDCRPEFIEAMNKALN------------LGTENGIKIHAPLIDLSK 126 (169) T ss_pred ------HHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHH------------HCCCCCCEEEECCCCCCH T ss_conf ------99999999982999689998247756899986999999999998------------308798589843335989 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999989981893202 Q gi|254780546|r 175 EDIRSFLLQRNISWCE 190 (423) Q Consensus 175 ~~l~~~~~~~~i~wve 190 (423) .||...+.+.|++|-. T Consensus 127 ~eiv~~g~~lgv~~~~ 142 (169) T cd01995 127 AEIVRLGGELGVPLEL 142 (169) T ss_pred HHHHHHHHHCCCCHHH T ss_conf 9999999884997998 No 28 >PRK11106 queuosine biosynthesis protein QueC; Provisional Probab=98.98 E-value=4.9e-09 Score=82.91 Aligned_cols=148 Identities=18% Similarity=0.201 Sum_probs=100.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCC----- Q ss_conf 839999669424899999999999864899729999996677987846899999999987189-88999975047----- Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI-AHSVVSWKNSK----- 93 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi-~~~~~~~~~~~----- 93 (423) .|.+|-+|||.||+++|+.+.+ . ..+++++++|+|-|. ..|.+..+++|+.+|+ +|.++..+... T Consensus 2 kkavVLlSGGlDStt~L~~a~~----~---~~~v~alsfdYGQrh--~~El~~A~~ia~~~gv~~h~vidl~~l~~~~~S 72 (231) T PRK11106 2 KRAVVVFSGGQDSTTCLIQALQ----Q---YDEVHCITFDYGQRH--RAEIDVARELALKLGARAHKVLDVTLLNELAVS 72 (231) T ss_pred CEEEEEECCCHHHHHHHHHHHH----C---CCEEEEEEEECCCCC--HHHHHHHHHHHHHCCCCCEEEECCHHHHHHCCC T ss_conf 8089990787899999999998----1---994999987778671--999999999999859840067532777775235 Q ss_pred ---------CCCCCHH--------HHHHH-HHHHHHHHCCCCCCCCHHHHHHHHHH------HHHHHHHHHCCCCCCCCC Q ss_conf ---------8778037--------88854-55554320012344201234566656------689999862365444332 Q gi|254780546|r 94 ---------PQTGLMA--------AAREA-RYALISEHAKTINATLIMTAHTFDDQ------LETVYMRSQRDYAEKGMG 149 (423) Q Consensus 94 ---------~~~~~~~--------~ar~~-r~~~~~~~~~~~~~~~l~~ah~~dD~------~Et~l~rl~r~sg~~g~~ 149 (423) +....++ ..|-. .....+..+...++..+.+|-|..|. -..|+..+...... T Consensus 73 aLt~~~i~vp~~~~~~~~~p~T~VP~RN~ifLsiAaa~Ae~~ga~~I~~G~~~~D~sgYPDCr~eFi~a~~~al~~---- 148 (231) T PRK11106 73 SLTRDSIPVPDYEPEADGLPNTFVPGRNILFLTLAAIYAYQVKAEAVITGVCETDFSGYPDCRDEFVKALNHAVVL---- 148 (231) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHH---- T ss_conf 6677787777765444578764671672999999999999859995998046556688998989999999999985---- Q ss_pred CCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHH---HHCCCC Q ss_conf 235764100357968710121042899999899---818932 Q gi|254780546|r 150 LSGMCDTILYDLNLWISRPFLRCRREDIRSFLL---QRNISW 188 (423) Q Consensus 150 l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~---~~~i~w 188 (423) + ....++|.-||++++|+||....+ +.+++| T Consensus 149 --g------~~~~i~i~aPl~~l~K~eiv~l~~~~~~L~~p~ 182 (231) T PRK11106 149 --G------MAKDIRFETPLMWLDKAETWALADYYGQLDLVR 182 (231) T ss_pred --C------CCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCC T ss_conf --1------789819983777788889999998616668650 No 29 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=98.95 E-value=3e-09 Score=84.47 Aligned_cols=102 Identities=21% Similarity=0.233 Sum_probs=71.6 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCC---- Q ss_conf 3999966942489999999999986489972999999667798784689999999998718988999975047877---- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQT---- 96 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~---- 96 (423) |++|.+|||.||+++|+.+++ . ..+++++|+|+|.|. ..|.+..+++|+.+|++|+++..+..+.-. T Consensus 1 Kavvl~SGG~DSt~~l~~a~~----~---~~~v~ait~dYGQ~~--~~Ei~~A~~ia~~l~i~h~vidl~~l~~~~~saL 71 (137) T pfam06508 1 KAVVLLSGGLDSTTCLAWAKK----E---GYEVYALTFDYGQRH--SKELECAKKIAKALGVEHKIVDLDFLKQIGGSAL 71 (137) T ss_pred CEEEEECCCHHHHHHHHHHHH----C---CCEEEEEEECCCCCH--HHHHHHHHHHHHHCCCCCEEECCCHHHHHCCCCC T ss_conf 989991787899999999998----6---996899981478873--9999999999998299750303302444165522 Q ss_pred -----CC-----HH--------HHHHH-HHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf -----80-----37--------88854-55554320012344201234566656 Q gi|254780546|r 97 -----GL-----MA--------AAREA-RYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 97 -----~~-----~~--------~ar~~-r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) .+ +. ..|-. -.......+...++..+.+|-|.+|. T Consensus 72 ~~~~~~vp~~~~~~~~~~~t~VP~RN~iflsiA~a~A~~~g~~~I~~G~~~~D~ 125 (137) T pfam06508 72 TDDSIEVPDSELESEEIPNTYVPGRNLIFLSIAASYAEAIGANDIFIGVNEEDF 125 (137) T ss_pred CCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 368855566543456787765636549999999999998699979995655756 No 30 >TIGR00420 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; InterPro: IPR004506 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (2.1.1.61 from EC) catalyses the addition of 5-methylaminomethyl-2-thiouridylate to tRNAs using S-adenosyl-L-methionine as a substrate and releasing S-adenosyl-L-homocysteine. The enzyme is cytoplasmic and is involved in tRNA processing.; GO: 0004808 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=98.91 E-value=6.3e-09 Score=82.15 Aligned_cols=168 Identities=17% Similarity=0.210 Sum_probs=116.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEE-----CCC-------------CCCCHHHHHHHHHHHHHHCC Q ss_conf 839999669424899999999999864899729999996-----677-------------98784689999999998718 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISV-----DHC-------------LRETAKDEVRYVSDVCSRLR 81 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~v-----dHg-------------lr~~s~~e~~~v~~~~~~lg 81 (423) .+|+|++|||+||++-+++|++- ..+|.-+.. +-. -..-+.++...++++|+.|| T Consensus 1 ~kv~VglSGGVDSSVsA~lL~~q-------g~~v~G~ylk~W~~~~~~d~~GH~~~N~~K~~C~~~~dl~~a~~ic~~LG 73 (394) T TIGR00420 1 KKVIVGLSGGVDSSVSAYLLKQQ-------GYEVVGVYLKNWDEDEKNDILGHKKINENKSGCSSAEDLRDAQAICEKLG 73 (394) T ss_pred CEEEEECCCCHHHHHHHHHHHHC-------CCEEEEEEEEEEECCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 94899806854789999999714-------96589998776512466664431656246778836887999999998559 Q ss_pred CCEEEEEEE-----------------CCCCCCCCHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHH-HHHHHHH-- Q ss_conf 988999975-----------------047877803788854555543200123---44201234566656-6899998-- Q gi|254780546|r 82 IAHSVVSWK-----------------NSKPQTGLMAAAREARYALISEHAKTI---NATLIMTAHTFDDQ-LETVYMR-- 138 (423) Q Consensus 82 i~~~~~~~~-----------------~~~~~~~~~~~ar~~r~~~~~~~~~~~---~~~~l~~ah~~dD~-~Et~l~r-- 138 (423) ||++++... +..|.+.+... +......+.+.+.+. ++..++|||-+-=+ ...---+ T Consensus 74 i~l~~~nf~~~Y~~~Vf~~~i~~y~~G~TPnPDi~CN-~~iKFG~~~e~~~~~~Gtg~~~~aTGHYAr~~q~~~~~~~~~ 152 (394) T TIGR00420 74 IPLEKVNFQKEYWNKVFEPFIQEYKEGLTPNPDILCN-KLIKFGLLLEYAKQLLGTGNDKIATGHYARIAQEIENKSLFE 152 (394) T ss_pred CCEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CCCCHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHCCCCHHH T ss_conf 8189855078878888899999873677778620047-120037899999985289954344244765443421310144 Q ss_pred HHCCCCC--CC-CCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHC--CCCCC-CCCCCCCCC Q ss_conf 6236544--43-32235764100357968710121042899999899818--93202-667864221 Q gi|254780546|r 139 SQRDYAE--KG-MGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRN--ISWCE-DPSNTDDRF 199 (423) Q Consensus 139 l~r~sg~--~g-~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~--i~wve-DpSN~d~~f 199 (423) |.++--- +. --|+.+...... .++=||-++.|.++|+.|++.+ ++--| -+--++.+| T Consensus 153 L~~~~D~~KDQSYFL~~l~~~~~~----~~~FPlG~L~K~~vR~iA~~~~PfL~ta~tkKDSqgICF 215 (394) T TIGR00420 153 LLRALDKNKDQSYFLYHLSQEQLA----KLLFPLGELLKPEVRQIAKNAGPFLSTAETKKDSQGICF 215 (394) T ss_pred HHCCCCCCCCCHHHHHHHHHHHHH----HHCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCEE T ss_conf 430467888611737640189888----750530103677899999864863234456886445021 No 31 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=98.84 E-value=2.6e-08 Score=77.75 Aligned_cols=151 Identities=19% Similarity=0.218 Sum_probs=103.2 Q ss_pred EEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEE-EEEEECCC-------- Q ss_conf 999669424899999999999864899729999996677987846899999999987189889-99975047-------- Q gi|254780546|r 23 LVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHS-VVSWKNSK-------- 93 (423) Q Consensus 23 ~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~-~~~~~~~~-------- 93 (423) +|-+|||.||++++..+++- +.+++|+++|.|+|-|.. ..|.+..+++|+.|||+++ ++..+..+ T Consensus 2 v~~lSGG~DStT~~~~a~~~-----~GkyeV~a~TF~YGQR~H-~~Ele~A~~ia~~Lgi~~~~~~Dl~~l~~l~~YsaL 75 (227) T TIGR00364 2 VVVLSGGQDSTTVLLIALDE-----GGKYEVHAITFDYGQRAH-SRELESARKIAEALGIRHHFVIDLSLLKQLGKYSAL 75 (227) T ss_pred EEEEECCHHHHHHHHHHHHC-----CCCEEEEEECCCCHHHHH-HHHHHHHHHHHHHHCCCCEEEECHHHHHHHHCCCCC T ss_conf 68743734689999999961-----795079985465013789-999999999999808970786177999855404523 Q ss_pred --------CCCCCHH------HHHH--------HHHHHHHHHCCCCCCCCHHHHHHHHH------HHHHHHHHHHCCCCC Q ss_conf --------8778037------8885--------45555432001234420123456665------668999986236544 Q gi|254780546|r 94 --------PQTGLMA------AARE--------ARYALISEHAKTINATLIMTAHTFDD------QLETVYMRSQRDYAE 145 (423) Q Consensus 94 --------~~~~~~~------~ar~--------~r~~~~~~~~~~~~~~~l~~ah~~dD------~~Et~l~rl~r~sg~ 145 (423) ...+.+. ..+. ....+....+...|+..+.+|-+..| .-+.|+--+..- T Consensus 76 t~~~~~~~~~~~e~d~~~~~E~~p~tfVPgRN~~fl~~a~~yA~~~gA~~v~~G~~~~DfSgYPDCr~EFvka~n~~--- 152 (227) T TIGR00364 76 TDEQEIPEQKSNEEDKEKALETLPNTFVPGRNLIFLSLAASYAEALGAEAVITGVCETDFSGYPDCRDEFVKALNHA--- 152 (227) T ss_pred CCCCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH--- T ss_conf 57875676666532023565311333378047999999999888618425220010003657988868899999999--- Q ss_pred CCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHC Q ss_conf 4332235764100357968710121042899999899818 Q gi|254780546|r 146 KGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRN 185 (423) Q Consensus 146 ~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~ 185 (423) +.|+.|... .....+++.=||..++|+||-+.+.+.| T Consensus 153 --l~Lgn~~~~-v~i~Piql~aPL~~ltKaeiv~la~elG 189 (227) T TIGR00364 153 --LNLGNMLTP-VEIRPIQLEAPLMDLTKAEIVKLADELG 189 (227) T ss_pred --HHHHCCCCC-CEEEEEEEECCEEECCHHHHHHHHHHHC T ss_conf --997412688-1120246666701269899999988735 No 32 >PRK08349 hypothetical protein; Validated Probab=98.78 E-value=1.6e-07 Score=72.26 Aligned_cols=142 Identities=15% Similarity=0.147 Sum_probs=94.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHC-CC---CEEEEEEECC---- Q ss_conf 399996694248999999999998648997299999966779878468999999999871-89---8899997504---- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRL-RI---AHSVVSWKNS---- 92 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~l-gi---~~~~~~~~~~---- 92 (423) |+++-+|||.||.+-.+++.. + +.++..+|+|.+ +.+.+.+..+.+...++ |- .+++...... T Consensus 2 Kvl~LlSGGiDSPVAa~~mmK----R---G~~V~~lhf~~~--~~~~~kv~~~~~~L~~~~~~~~~~~~iv~~~~~~~~i 72 (198) T PRK08349 2 KVVALLSSGIDSPVAIYLMLS----R---GVEIYPLHFRQD--EKKEHKARELVEILQEIHGGKVKDPVIVDAYEVQGPV 72 (198) T ss_pred EEEEEECCCCCHHHHHHHHHH----C---CCEEEEEEECCC--HHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHH T ss_conf 499996588438999999997----7---997999986387--7889999999999999708887528997722532899 Q ss_pred -------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf -------7877803788854555543200123442012345666566899998623654443322357641003579687 Q gi|254780546|r 93 -------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWI 165 (423) Q Consensus 93 -------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i 165 (423) .+.....--.|+.-|+.....+++.++..++||..++..+-..+-++.--+. .-+..+ T Consensus 73 ~~~i~~~~~~~~~~vl~rr~M~riA~~iA~~~g~~aivTGEsLGQVASQTl~NL~~i~~---------------~~~~pV 137 (198) T PRK08349 73 FEKLREIGKEKWTCLFCKYTMYRVAERYAHEIGAKAIVTGDSLGQVASQTLDNLMVIST---------------ATDLPI 137 (198) T ss_pred HHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHH---------------HHCCCC T ss_conf 99998607765130999999999999999985998898452167888999988999987---------------506766 Q ss_pred ECCHHCCCHHHHHHHHHHHCC Q ss_conf 101210428999998998189 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~~i 186 (423) +|||+++.|+||.+++++.|. T Consensus 138 lRPLig~DK~EII~~Ar~IGT 158 (198) T PRK08349 138 LRPLIGLDKEEIVRIAKEIGT 158 (198) T ss_pred CCCCCCCCHHHHHHHHHHHCC T ss_conf 477667998999999998195 No 33 >COG0175 CysH 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [Amino acid transport and metabolism / Coenzyme metabolism] Probab=98.75 E-value=1.6e-07 Score=72.30 Aligned_cols=157 Identities=17% Similarity=0.244 Sum_probs=89.3 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 89999999985279983999966942489999999999986489972999999667798784689999999998718988 Q gi|254780546|r 5 PIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAH 84 (423) Q Consensus 5 p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~ 84 (423) |...++-....+..+ +++++|||+||+++||++..... .+.++|||.|..- .+-.+++.++.+++|++. T Consensus 27 ~~~i~~~~~~~~~~~--~~~~~S~Gkds~V~l~L~~k~~~-------~~~vif~DTg~~f--~Et~~~~d~~~~~~~~~l 95 (261) T COG0175 27 PIEILRWAAEEFSNP--VVVSFSGGKDSTVLLHLAAKAFP-------DFPVIFLDTGYHF--PETYEFRDRLAEEYGLDL 95 (261) T ss_pred HHHHHHHHHHHCCCC--EEEEECCCHHHHHHHHHHHHHCC-------CCCEEEEECCCCC--HHHHHHHHHHHHHCCCEE T ss_conf 899999999874897--59981274679999999986025-------8718997077757--889999999998729717 Q ss_pred EEEEEECCCCCC-----CCHH--HHH-HH---HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 999975047877-----8037--888-54---555543200123442012345666566899998623654443322357 Q gi|254780546|r 85 SVVSWKNSKPQT-----GLMA--AAR-EA---RYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGM 153 (423) Q Consensus 85 ~~~~~~~~~~~~-----~~~~--~ar-~~---r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~ 153 (423) .+.+-+...... +.-. ..| ++ .-.-+.......+...+++|.=.|+.. .-+.+ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~r~c~~i~K~~pl~~al~~~~~~a~~~G~Rrdes~----------------~Rak~ 159 (261) T COG0175 96 KVYRPDDEVAEGEKYGGKLWEPSVERWCCDIRKVEPLKRALDEYGFDAWFTGLRRDESP----------------TRAKL 159 (261) T ss_pred EEECCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCC----------------CCCCC T ss_conf 98167323443442136679997402332568551399999635862499861204563----------------20248 Q ss_pred CCCCCCCC---CCEEECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 64100357---9687101210428999998998189320 Q gi|254780546|r 154 CDTILYDL---NLWISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 154 ~~~~~~~~---~~~i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) +...... +..-+=||++++..++-.|...+++++. T Consensus 160 -~~~~~~~~~~~~~rv~Pl~~Wt~~dVw~Yi~~~~lp~n 197 (261) T COG0175 160 -PVVSFDSEFGESIRVNPLADWTELDVWLYILANNLPYN 197 (261) T ss_pred -CEECCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCC T ss_conf -64534678768368715320999999999999589988 No 34 >PRK13820 argininosuccinate synthase; Provisional Probab=98.73 E-value=3.1e-07 Score=70.20 Aligned_cols=143 Identities=22% Similarity=0.158 Sum_probs=93.1 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCE-EEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC--- Q ss_conf 9983999966942489999999999986489972-999999667798784689999999998718988999975047--- Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKI-KFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK--- 93 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~-~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~--- 93 (423) .+.||++|+|||.|+++.+..|++- .++ +|+++++|-|- + ++|.+.+++.+..+|++++++...... T Consensus 2 ~~kKVvLAySGGLDTSv~l~wL~e~------yg~~eVi~~~~d~Gq-~--~e~~~~~~~kA~~~G~~~~viD~r~ef~~~ 72 (395) T PRK13820 2 MKKKVVLAYSGGLDTSVCIPLLKEK------YGYDEVITVAVDVGQ-P--EEEIKEAEEKAKKLGVKHYTIDAKEEFAKD 72 (395) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHH------CCCCEEEEEEEECCC-C--HHHHHHHHHHHHHCCCCEEEEECHHHHHHH T ss_conf 9872999968984899999999985------199869999998999-7--677699999999709949998169999999 Q ss_pred -------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf -------------8778037888545555432001234420123456--6656689999862365444332235764100 Q gi|254780546|r 94 -------------PQTGLMAAAREARYALISEHAKTINATLIMTAHT--FDDQLETVYMRSQRDYAEKGMGLSGMCDTIL 158 (423) Q Consensus 94 -------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~--~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~ 158 (423) .=....+.+|-.-...+.+.+++.++..++-|-+ -+||+--.+ - ++. T Consensus 73 ~i~paI~ana~YegYpL~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGNDQvRFe~-~-~~a---------------- 134 (395) T PRK13820 73 YIFPAIKANALYEGYPLGTALARPLIAKKIVEVAKKEGASAIAHGCTGKGNDQLRFEA-V-FRA---------------- 134 (395) T ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHH-H-HHC---------------- T ss_conf 9999997087326730324104689999999999975982983077789870999999-9-867---------------- Q ss_pred CCCCCEEECCH--HCCCHHHHHHHHHHHCCCCC Q ss_conf 35796871012--10428999998998189320 Q gi|254780546|r 159 YDLNLWISRPF--LRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 159 ~~~~~~i~RPL--L~~~r~~l~~~~~~~~i~wv 189 (423) -++.++-|. ..++|++..+||+++||+.- T Consensus 135 --p~~~iiaP~R~~~~~R~~~i~ya~~~gI~v~ 165 (395) T PRK13820 135 --TDLDVIAPIRELNLTREWEIEYAKEHGIPVP 165 (395) T ss_pred --CCCEEEEEECCHHHHHHHHHHHHHHCCCCCC T ss_conf --8986872104533458999999998599888 No 35 >PRK00509 argininosuccinate synthase; Provisional Probab=98.72 E-value=4.9e-07 Score=68.76 Aligned_cols=141 Identities=18% Similarity=0.145 Sum_probs=94.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCC---- Q ss_conf 9839999669424899999999999864899729999996677987846899999999987189-88999975047---- Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI-AHSVVSWKNSK---- 93 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi-~~~~~~~~~~~---- 93 (423) ..||++|+|||.|+++.+..|++- .+++|+++++|-|- .+|.+.+++.+.++|. +++++...... T Consensus 2 ~kKVvLAySGGLDTSv~l~wL~e~------yg~eVia~~~d~Gq----~~d~~~i~~kA~~~GA~~~~v~D~r~ef~~~~ 71 (398) T PRK00509 2 VKKVVLAYSGGLDTSVIIKWLKEE------YGCEVVAFTADVGQ----GEELEPIREKALASGASEIYVEDLREEFVRDY 71 (398) T ss_pred CCEEEEEECCCCHHHHHHHHHHHH------CCCEEEEEEEECCC----HHHHHHHHHHHHHHCCCEEEEECCHHHHHHHH T ss_conf 854999908984599999999875------39889999997988----77889999999982997899963099999999 Q ss_pred -------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH----HHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf -------------87780378885455554320012344201234566----6566899998623654443322357641 Q gi|254780546|r 94 -------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTF----DDQLETVYMRSQRDYAEKGMGLSGMCDT 156 (423) Q Consensus 94 -------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~----dD~~Et~l~rl~r~sg~~g~~l~~~~~~ 156 (423) .-.-.-+.+|-.-...+.+.+++.++..+ +|-. +||+--.+. + . .+. T Consensus 72 i~p~I~ana~Yeg~YpL~tslaRplia~~~ve~A~~~ga~ai--aHG~TGkGNDQvRFe~~--~-----~-----aL~-- 135 (398) T PRK00509 72 VFPAIRANALYEGKYPLGTALARPLIAKKLVEIARKEGADAV--AHGCTGKGNDQVRFELA--I-----A-----ALA-- 135 (398) T ss_pred HHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHHCCCEEE--EECCCCCCCCHHHHHHH--H-----H-----HHC-- T ss_conf 999996085455854552203188999999999997198599--85455678608999999--9-----9-----859-- Q ss_pred CCCCCCCEEECCHHC---CCHHHHHHHHHHHCCCCC Q ss_conf 003579687101210---428999998998189320 Q gi|254780546|r 157 ILYDLNLWISRPFLR---CRREDIRSFLLQRNISWC 189 (423) Q Consensus 157 ~~~~~~~~i~RPLL~---~~r~~l~~~~~~~~i~wv 189 (423) .++.++-|.-. .+|++..+||+++||+.- T Consensus 136 ----P~l~iiaP~Rd~~~~sR~~~i~ya~~~gIpv~ 167 (398) T PRK00509 136 ----PDLKVIAPWREWDLMSREELIAYAEEHGIPIP 167 (398) T ss_pred ----CCCEEEEEEECCCCCCHHHHHHHHHHCCCCCC T ss_conf ----99768611321341689999999998699888 No 36 >cd01986 Alpha_ANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases and ATP sulphurylases. The domain forms a apha/beta/apha fold which binds to Adenosine group.. Probab=98.69 E-value=5.8e-08 Score=75.36 Aligned_cols=83 Identities=28% Similarity=0.277 Sum_probs=64.3 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHH Q ss_conf 99996694248999999999998648997299999966779878468999999999871898899997504787780378 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAA 101 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ 101 (423) ++|++|||+||+..+++++++ .+++.++++|||..+......+.+++ T Consensus 1 ~~v~~sgG~ds~~~~~~l~~~-------~~~~~~~~~~~~~~~~~~~~~~~a~~-------------------------- 47 (103) T cd01986 1 VLVAFSGGKDSSVAAALLKKL-------GYQVIAVTVDHGISPRLEDAKEIAKE-------------------------- 47 (103) T ss_pred CEEEECCCCHHHHHHHHHHHC-------CCCEEEEEEECCCCCHHHHHHHHHHH-------------------------- T ss_conf 989742882219999999986-------98628999853766278899999999-------------------------- Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 885455554320012344201234566656689999862 Q gi|254780546|r 102 AREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQ 140 (423) Q Consensus 102 ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~ 140 (423) .+...+...+.+.++..+++||..+|...+...... T Consensus 48 ---~~~~~~~~~a~~~~~~~i~~G~~~~~~~~~~~~~~~ 83 (103) T cd01986 48 ---AREEAAKRIAKEKGAETIATGTRRDDVANRALGLTA 83 (103) T ss_pred ---HHHHHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHH T ss_conf ---999999999886172143047531677887761898 No 37 >PRK05370 argininosuccinate synthase; Validated Probab=98.69 E-value=4.7e-07 Score=68.92 Aligned_cols=152 Identities=14% Similarity=0.178 Sum_probs=93.3 Q ss_pred HHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCE-EEEEE Q ss_conf 99985279983999966942489999999999986489972999999667798784689999999998718988-99997 Q gi|254780546|r 11 FFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAH-SVVSW 89 (423) Q Consensus 11 ~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~-~~~~~ 89 (423) +-+++++.+.||++|+|||.|+++.+..|++ . +.+|+++++|-|--. ..+.+.+++.+.++|..- +++.. T Consensus 3 ~~~~~~~~gkKVvLAYSGGLDTSv~l~wL~e----~---g~eVia~taDvGQ~d--e~d~~~i~~kA~~~GA~k~~v~D~ 73 (447) T PRK05370 3 TILKHLPVGQRVGIAFSGGLDTSAALLWMRQ----K---GAVPYAYTANLGQPD--EDDYDAIPRRAMEYGAENARLIDC 73 (447) T ss_pred HHHHCCCCCCEEEEEECCCCHHHHHHHHHHH----C---CCEEEEEEEECCCCC--CHHHHHHHHHHHHHCCCEEEEECC T ss_conf 3530587888899995898279999999987----5---987999999799986--133899999999829978999717 Q ss_pred ECCCCC--------------------CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHH--HHHHHHHHHHHHHHCCCCCCC Q ss_conf 504787--------------------7803788854555543200123442012345--666566899998623654443 Q gi|254780546|r 90 KNSKPQ--------------------TGLMAAAREARYALISEHAKTINATLIMTAH--TFDDQLETVYMRSQRDYAEKG 147 (423) Q Consensus 90 ~~~~~~--------------------~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah--~~dD~~Et~l~rl~r~sg~~g 147 (423) ....-. .+.-..+|-.-...+.+.+++.++..++-|- --+||+- +.+. + T Consensus 74 reefv~~~i~ai~a~A~y~~~~g~~Y~l~t~laRplia~~lv~~a~~~ga~~iaHGaTGKGNDQVR-----Fe~~----~ 144 (447) T PRK05370 74 RAQLVAEGIAAIQCGAFHISTGGVTYFNTTPLGRAVTGTMLVAAMKEDGVNIWGDGSTYKGNDIER-----FYRY----G 144 (447) T ss_pred HHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHH-----HHHH----H T ss_conf 999999999999736233302576445877307889999999999983998995577678743889-----9999----9 Q ss_pred CCCCCCCCCCCCCCCCEEECCHHCC-------CHHHHHHHHHHHCCCCC Q ss_conf 3223576410035796871012104-------28999998998189320 Q gi|254780546|r 148 MGLSGMCDTILYDLNLWISRPFLRC-------RREDIRSFLLQRNISWC 189 (423) Q Consensus 148 ~~l~~~~~~~~~~~~~~i~RPLL~~-------~r~~l~~~~~~~~i~wv 189 (423) .....++.|+-|.... +|+++.+||+++||++- T Consensus 145 ---------~~l~P~lkIiaPwRd~~f~~el~sR~e~i~Ya~~~gIpv~ 184 (447) T PRK05370 145 ---------LLTNPELKIYKPWLDQDFIDELGGRAEMSEFMIACGFDYK 184 (447) T ss_pred ---------HHHCCCCEEECCCCCHHHHHHCCCHHHHHHHHHHCCCCCC T ss_conf ---------8529987697143346666541574999999998499876 No 38 >PRK05253 sulfate adenylyltransferase subunit 2; Provisional Probab=98.68 E-value=4.8e-07 Score=68.87 Aligned_cols=169 Identities=12% Similarity=0.061 Sum_probs=105.5 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC-CC Q ss_conf 27998399996694248999999999998648997299999966779878468999999999871898899997504-78 Q gi|254780546|r 16 LVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS-KP 94 (423) Q Consensus 16 l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~-~~ 94 (423) ...-++.++-+|||+||++||||+...-.- ..++ +-.+|||+|.-- .+=.+|..++++++|+.+.+-..+.. +. T Consensus 23 ~a~f~~pvmL~S~GKDS~VmLhLarKAF~p-~~iP--fPllHIDTg~kF--~Emi~fRD~~a~e~gl~LiV~~n~e~i~~ 97 (300) T PRK05253 23 AAEFENPVMLYSIGKDSSVMLHLARKAFYP-GKLP--FPLLHVDTGWKF--PEMIEFRDRRAKELGLELIVHSNPEGIAR 97 (300) T ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHHCC-CCCC--CCEEEEECCCCH--HHHHHHHHHHHHHHCCCEEEEECHHHHHC T ss_conf 997369679871672599999999998578-8999--634883079981--89999999999984997899757677756 Q ss_pred CCCC-----HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH--CCCCCCC--C----CCCCCCCCCC-CC Q ss_conf 7780-----378885455554320012344201234566656689999862--3654443--3----2235764100-35 Q gi|254780546|r 95 QTGL-----MAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQ--RDYAEKG--M----GLSGMCDTIL-YD 160 (423) Q Consensus 95 ~~~~-----~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~--r~sg~~g--~----~l~~~~~~~~-~~ 160 (423) +.+. ..-.....-..|.....+.+.+.++.|--.|.+.-----|.+ |.....+ . .++.+.+... .+ T Consensus 98 g~~p~~~g~~~~~~~lKT~~L~~al~~~~fda~~gGaRRDEe~sRAKERiFS~R~~~~~Wdpk~QrPElW~lyn~~~~~g 177 (300) T PRK05253 98 GINPFTHGSAKHTNIMKTEGLKQALEKYGFDAAFGGARRDEEKSRAKERIFSFRDEFGQWDPKNQRPELWNLYNGRINKG 177 (300) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCC T ss_conf 99956688222315787899999999648765651012344321212542477564688885669802667635567999 Q ss_pred CCCEEECCHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 796871012104289999989981893202 Q gi|254780546|r 161 LNLWISRPFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 161 ~~~~i~RPLL~~~r~~l~~~~~~~~i~wve 190 (423) ..+++ =||++++.-+|=+|....+|+.+. T Consensus 178 eh~RV-fPlsnWTElDIW~YI~~E~Ipi~~ 206 (300) T PRK05253 178 EHIRV-FPLSNWTELDIWQYIYRENIPIVP 206 (300) T ss_pred CEEEE-ECCCCCHHHHHHHHHHHHCCCCCC T ss_conf 64775-112022577899999984888887 No 39 >PRK08576 hypothetical protein; Provisional Probab=98.66 E-value=2.2e-07 Score=71.30 Aligned_cols=170 Identities=15% Similarity=0.132 Sum_probs=95.7 Q ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999985279983999966942489999999999986489972999999667798784689999999998718988999 Q gi|254780546|r 8 SVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 8 ~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) +-.+|+.++... .|+|.+|||+||+|-|.++.+.. -++.|+.||.|+ |-.+-.++|+.+++++||+.... T Consensus 225 ~s~~fL~~~~~~-~iiVp~SGGKDStA~LlLA~ea~-------~dv~aVfvDTGl--EfPeT~eyVe~va~klGv~lv~a 294 (439) T PRK08576 225 ASIEFLEKFGEY-TIIVPWSGGKDSTAALLLALKAF-------DEVTAVYVDTGY--EMPLTDEYVEKVAEKLGVDLIRA 294 (439) T ss_pred HHHHHHHHCCCC-CEEEECCCCHHHHHHHHHHHHHH-------CCEEEEEEECCC--CCHHHHHHHHHHHHHHCCEEEEE T ss_conf 999999962898-88995688779999999999861-------883699970798--65758999999999859769994 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC-CCC-CCCCC---CCC Q ss_conf 9750478778037888545555432001234420123456665668999986236544433223-576-41003---579 Q gi|254780546|r 88 SWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLS-GMC-DTILY---DLN 162 (423) Q Consensus 88 ~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~-~~~-~~~~~---~~~ 162 (423) ..+...+-....-.++..| ||......-+- -...+..+..+ .-|.- ++.+.. .+. +.... .++ T Consensus 295 ~~d~~~~i~~~G~Pt~d~R------WCtk~KL~PL~--~~i~~~~~~l~---~vG~R-~~ES~~R~~~~~v~~rn~~vg~ 362 (439) T PRK08576 295 GVDVPMPIEGYGLPTHENR------WCTKMKVEALY--EAIRELSNPLL---VVGDR-DGESARRRLRPPVRERETDFGK 362 (439) T ss_pred CCCCCCHHCCCCCCCCCCC------CCCCCCCHHHH--HHHHHHCCCEE---EECCC-HHHHHHHHCCCCEEECCCCCCC T ss_conf 7886422104799976663------31020025589--99996489769---95272-5654776118963550477777 Q ss_pred CEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 68710121042899999899818932026678642210189997 Q gi|254780546|r 163 LWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRR 206 (423) Q Consensus 163 ~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~ 206 (423) ..-+.|+..++..++-.|.-.+|+++ ||=|++-+-|- T Consensus 363 ~~~~~PIk~Wsa~~VwLYI~~~gip~-------NPLY~kGF~RI 399 (439) T PRK08576 363 ILVVMPLKFWSGAMVQLYILMNGIEL-------NPLYYEGFYRL 399 (439) T ss_pred CEEEECHHHCCHHHHHHHHHHCCCCC-------CHHHHCCCCCC T ss_conf 26672343076999999999759999-------84787477324 No 40 >pfam02540 NAD_synthase NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation. Probab=98.62 E-value=1.5e-06 Score=65.34 Aligned_cols=150 Identities=18% Similarity=0.200 Sum_probs=88.4 Q ss_pred HHHHHHHHHC---CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9999999852---7998399996694248999999999998648997299999966779878468999999999871898 Q gi|254780546|r 7 ESVRFFVRSL---VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIA 83 (423) Q Consensus 7 ~~~~~~~~~l---~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~ 83 (423) +.+-.|++.- .+-..++|++|||.||.+.+.++++... +.++.++..-.. ..|.+..+..+.+|+.+||+ T Consensus 3 ~~i~~~l~dyv~k~g~~gvViGlSGGIDSav~a~La~~AlG-----~~~v~~v~mP~~--~ss~~s~~~A~~la~~lgi~ 75 (243) T pfam02540 3 ERLVDFLRDYVQKSGAKGVVLGLSGGIDSAVVAYLAVKALG-----KENVLALIMPSI--NSSEEDVQDALALAENLGIN 75 (243) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCC-----CCEEEEEECCCC--CCCHHHHHHHHHHHHHHCCC T ss_conf 79999999999981999199978887999999999999559-----742899964777--89888999999999996892 Q ss_pred EEEEEEECCC--------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 8999975047--------------87780378885455554320012344201234566656689999862365444332 Q gi|254780546|r 84 HSVVSWKNSK--------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMG 149 (423) Q Consensus 84 ~~~~~~~~~~--------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~ 149 (423) +.++..+..- ..+|+++ +.|-..++..+...+..++-|+. .... T Consensus 76 ~~~i~I~~~~~~~~~~~~~~~~~~a~~Ni~a---R~Rm~~ly~~A~~~~~lVlgT~N-------------------ksE~ 133 (243) T pfam02540 76 YKTIDIKPIVRAFSQLFQPAKDDLAKGNLKA---RIRMIILYAHANKFNRLVLGTGN-------------------KSEL 133 (243) T ss_pred EEEEECHHHHHHHHHHHCCCHHHHHHHHHHH---HHHHHHHHHHHCCCCEEEEECCC-------------------CCCE T ss_conf 6999888999999988532013778745578---88899999862337918981687-------------------0420 Q ss_pred CCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 23576410035796871012104289999989981893 Q gi|254780546|r 150 LSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 150 l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) ..|.... .+.+.-=+-|+-++.|.|+++.++..|+| T Consensus 134 ~~Gy~Tk--yGD~~~di~Pi~dL~KteV~~la~~l~vP 169 (243) T pfam02540 134 ALGYFTK--YGDGACDIAPIGDLYKTQVYELAKRLNVP 169 (243) T ss_pred EEEEEEE--CCCCCCCHHHHCCCCHHHHHHHHHHHCCC T ss_conf 0543231--27776551541688189999999994988 No 41 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=98.58 E-value=7.3e-07 Score=67.56 Aligned_cols=165 Identities=15% Similarity=0.143 Sum_probs=103.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC------- Q ss_conf 8399996694248999999999998648997299999966779878468999999999871898899997504------- Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS------- 92 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~------- 92 (423) =..+|+||||+||+-.||+++.- .+++..+||+|+|+.. +.=.+..+++|+.+|++|...+.+.. T Consensus 60 YDCiv~vSGGkDS~y~~~~l~~~------~gl~pL~vt~d~~~~t--~~g~~Ni~~l~~~lgvD~i~~~~n~~~~k~l~k 131 (343) T TIGR03573 60 YDCIIGVSGGKDSTYQAHVLKKK------LGLNPLLVTVDPGWNT--ELGVKNLNNLIKKLGFDLHTITINPETFRKLQR 131 (343) T ss_pred CCEEEECCCCHHHHHHHHHHHHH------HCCCEEEEEECCCCCC--HHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHH T ss_conf 86898688772899999999998------2992599983598779--899999999998369985874699999999999 Q ss_pred ---CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH-------HHHHHHHHHCCCCCCCCCCCCC--------- Q ss_conf ---787780378885455554320012344201234566656-------6899998623654443322357--------- Q gi|254780546|r 93 ---KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQ-------LETVYMRSQRDYAEKGMGLSGM--------- 153 (423) Q Consensus 93 ---~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~-------~Et~l~rl~r~sg~~g~~l~~~--------- 153 (423) ..-++..-..-.+-+......|.+.++..++.|++.... ............+.++..+..+ T Consensus 132 ~~~~~~gd~~~~~~~~i~~~~~~iA~k~~IplIi~Gen~~~e~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (343) T TIGR03573 132 AYFKKVGDPEWPQDHAIFASVYQVALKFNIPLIIWGENIAEEYGGDSEEELNPDEWKYNKRGIDAANIKDFSDKGLSERD 211 (343) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCHHHHHHCCCCHHH T ss_conf 99986689469999999999999999819998997567010057755446672467888615552059999978998343 Q ss_pred -CCCC------CCCCCCEEECCH--HCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf -6410------035796871012--104289999989981893202667 Q gi|254780546|r 154 -CDTI------LYDLNLWISRPF--LRCRREDIRSFLLQRNISWCEDPS 193 (423) Q Consensus 154 -~~~~------~~~~~~~i~RPL--L~~~r~~l~~~~~~~~i~wveDpS 193 (423) .+.. ....+++.+.|+ ..+..++...++.+.. .|..+|. T Consensus 212 L~~y~~p~~~~~~~~~i~~i~l~~y~~Wd~~~~~~~~~~~~-gw~~~~~ 259 (343) T TIGR03573 212 LYAYTYPDKEKLQSKGVKVIYLGYYVKWDKKKNYEFIKKRG-GWREGPH 259 (343) T ss_pred CHHCCCCCHHHHHHCCCEEEECCCHHCCCHHHHHHHHHHHC-CCCCCCC T ss_conf 41046989899964495399644000348999999999963-9511888 No 42 >PRK04527 argininosuccinate synthase; Provisional Probab=98.56 E-value=1e-06 Score=66.60 Aligned_cols=147 Identities=18% Similarity=0.175 Sum_probs=90.3 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECC----- Q ss_conf 9839999669424899999999999864899729999996677987846899999999987189-8899997504----- Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI-AHSVVSWKNS----- 92 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi-~~~~~~~~~~----- 92 (423) ..||++|+|||.|+++.+..|++ . +++++++++|-|--. ++|.+.+++.+..+|. +++++..... T Consensus 3 ~kkVvLAySGGLDTSv~l~wL~e----~---g~~Vi~~~ad~G~~~--~e~~~~i~~kA~~~GA~~~~v~D~~~ef~~~~ 73 (397) T PRK04527 3 TKDIVLAFSGGLDTSFCIPYLQE----R---GYAVHTVFADTGGVD--AEERDFIEKRAAELGAASHVTVDGGPAIWEGF 73 (397) T ss_pred CCEEEEEECCCCHHHHHHHHHHH----C---CCCEEEEEEECCCCC--CHHHHHHHHHHHHHCCCEEEEECCHHHHHHHH T ss_conf 87599990897289999999987----5---994799999789986--15157899999981997799972899999999 Q ss_pred -----C------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf -----7------8778037888545555432001234420123456--66566899998623654443322357641003 Q gi|254780546|r 93 -----K------PQTGLMAAAREARYALISEHAKTINATLIMTAHT--FDDQLETVYMRSQRDYAEKGMGLSGMCDTILY 159 (423) Q Consensus 93 -----~------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~--~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~ 159 (423) + ..--+...+|-.-...+.+.+++.++..++-|-+ -+||+--. +.... + T Consensus 74 v~~~i~ana~Yeg~YpL~tsaRplIak~~ve~A~~~ga~~iaHG~TGkGNDQvRFe-~~~~a------l----------- 135 (397) T PRK04527 74 VKPFVWAGEGYQGQYPLLVSDRYLIVDAALKRAEELGTRIIAHGCTGMGNDQVRFD-LAVKA------L----------- 135 (397) T ss_pred HHHHHHHCHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHH-HHHHH------C----------- T ss_conf 99998606454185556321079999999999997298384047666787025677-68886------3----------- Q ss_pred CCCCEEECCHH------CCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 57968710121------04289999989981893202667 Q gi|254780546|r 160 DLNLWISRPFL------RCRREDIRSFLLQRNISWCEDPS 193 (423) Q Consensus 160 ~~~~~i~RPLL------~~~r~~l~~~~~~~~i~wveDpS 193 (423) .++.++-|.- ..+|++..+||+++||+.-.++. T Consensus 136 -~~~~viAP~R~~~~~~~~~R~~~~~ya~~~gipv~~~~k 174 (397) T PRK04527 136 -GDYQIVAPIREIQKEHTQTRAYEQKYLEERGFGVRAKQK 174 (397) T ss_pred -CCCCCCCCHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCC T ss_conf -676543655653102220299999767864999887778 No 43 >PRK02090 phosphoadenosine phosphosulfate reductase; Provisional Probab=98.56 E-value=4.4e-06 Score=62.06 Aligned_cols=158 Identities=16% Similarity=0.163 Sum_probs=95.8 Q ss_pred CCCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 76989999999985279983999966942489999999999986489972999999667798784689999999998718 Q gi|254780546|r 2 FLSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLR 81 (423) Q Consensus 2 ~~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lg 81 (423) ..+|.+.++..+..+ +.+++++-|-|++|++|||++.+. ...+-++.||.|.-. .+--++..++.+.+| T Consensus 26 ~~~p~eil~wa~~~~--~~~~~~tsSFG~es~Vllhli~~~-------~~~ipV~flDTG~~f--~ETy~~~d~l~~~~~ 94 (243) T PRK02090 26 KASAQERLAWALEEF--GGGLALASSFGAESAVLLHLVAQV-------DPDIPVIFLDTGYLF--PETYRFIDQLTERLL 94 (243) T ss_pred CCCHHHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHHHHH-------CCCCCEEEEECCCCC--HHHHHHHHHHHHHHC T ss_conf 799999999999975--997699945867899999999941-------999967986579888--899999999999979 Q ss_pred CCEEEEEEECCC-----CCCCC-----HHHHHHHHH---HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 988999975047-----87780-----378885455---55432001234420123456665668999986236544433 Q gi|254780546|r 82 IAHSVVSWKNSK-----PQTGL-----MAAAREARY---ALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGM 148 (423) Q Consensus 82 i~~~~~~~~~~~-----~~~~~-----~~~ar~~r~---~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~ 148 (423) ++.++..-+... ..+++ ...-.++.. .-+....... ...+||.-.+. .. T Consensus 95 l~i~~~~P~~~~~~~~~~~g~l~~~~~~d~~~CC~irKvePL~raL~~~--daWitG~Rr~Q-s~--------------- 156 (243) T PRK02090 95 LNLKVYRPDASAAEQEARYGGLWEQSVEDPDECCRIRKVEPLNRALAGL--DAWITGLRREQ-SG--------------- 156 (243) T ss_pred CCEEEECCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCHHHHHHHHHC--CEEEECCCHHH-CH--------------- T ss_conf 9779977881499999982997667632489876776502799986017--75874471442-63--------------- Q ss_pred CCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 22357641003579687101210428999998998189320 Q gi|254780546|r 149 GLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 149 ~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) .-+.+... ...+++.-+-||++++++|+.+|.+++++|+- T Consensus 157 ~Ra~l~~v-e~d~g~~KvnPL~~Ws~~dv~~Yi~~~~LP~n 196 (243) T PRK02090 157 TRAELPVL-EIDRGRFKINPLADWTNEDVWAYLKEHDLPYH 196 (243) T ss_pred HHCCCCCE-EECCCEEEECCCHHCCHHHHHHHHHHCCCCCC T ss_conf 34258734-52399888670011999999999998599988 No 44 >KOG2594 consensus Probab=98.53 E-value=2.4e-07 Score=70.92 Aligned_cols=172 Identities=13% Similarity=0.086 Sum_probs=109.9 Q ss_pred HHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHH----CCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9985279983999966942489999999999986----489972999999667798784689999999998718988999 Q gi|254780546|r 12 FVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSD----RSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 12 ~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~----~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) +.++......+++++|||+-|++||+.++..+++ +...++++.++.+ ...-.++....+..+..-.+ .+|...- T Consensus 55 ~~~~~~~sk~vLlv~~G~ssSlvlLd~vh~~l~q~k~~~~~~~~tv~v~~~-~~~~~~~~~v~e~lq~l~~~-~~~~~~~ 132 (396) T KOG2594 55 SHKMGTPSKNVLLVFDGGSSSLVLLDFVHLALKQLKNKRLRRDFTVLVLVV-FQEFTDSTAVFEALQELIID-NIEWVRY 132 (396) T ss_pred HHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE-EEECCCHHHHHHHHHHHHHH-HCCCCCE T ss_conf 343088553469995188404769999999999866653276885489999-88525118889999999984-0555426 Q ss_pred EEE-------------CCCCCCC--------------------CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 975-------------0478778--------------------0378885455554320012344201234566656689 Q gi|254780546|r 88 SWK-------------NSKPQTG--------------------LMAAAREARYALISEHAKTINATLIMTAHTFDDQLET 134 (423) Q Consensus 88 ~~~-------------~~~~~~~--------------------~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et 134 (423) .+. .....+| -++--+.-+-..+...+.+.|+..++.||+.+|-++- T Consensus 133 ~V~~la~~~~~~~~~~~~~~~~n~El~ak~~kl~~~~sv~~~srqDLl~~lk~kll~~vA~~~g~~~i~~g~~~t~la~~ 212 (396) T KOG2594 133 VVSCLAPPEKDNHVVPVESINGNDELIAKDRKLKLSDSVPDDSRQDLLLHLKMKLLQKVAAENGYNRIVLGDSTTDLASH 212 (396) T ss_pred EEEECCCHHHHCCCCCCCCCCCCCEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHH T ss_conf 88733735764575611003787001132433046445664017899999999999999998599879836735279999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCC--CCCCCCEEECCHHCCCHHHHHHHHHHHCCCC Q ss_conf 99986236544433223576410--0357968710121042899999899818932 Q gi|254780546|r 135 VYMRSQRDYAEKGMGLSGMCDTI--LYDLNLWISRPFLRCRREDIRSFLLQRNISW 188 (423) Q Consensus 135 ~l~rl~r~sg~~g~~l~~~~~~~--~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~w 188 (423) ++-....|.|.. .+..-... +..+++.++|||-++.+.||..|+...|+++ T Consensus 213 vlt~v~~GRG~s---is~~v~~~d~r~~~d~~llrPLrDl~~~Ei~~y~~l~~l~~ 265 (396) T KOG2594 213 VLTAVVKGRGGS---ISTDVQVVDKRPKGDVKLLRPLRDLLSLEITSYCLLDGLAY 265 (396) T ss_pred HHHHHHHCCCCC---CEEHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999998366765---10054453346578720000277888999999997606870 No 45 >pfam00764 Arginosuc_synth Arginosuccinate synthase. This family contains a PP-loop motif. Probab=98.49 E-value=1.8e-06 Score=64.86 Aligned_cols=139 Identities=22% Similarity=0.189 Sum_probs=90.6 Q ss_pred EEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCC-------- Q ss_conf 999669424899999999999864899729999996677987846899999999987189-88999975047-------- Q gi|254780546|r 23 LVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI-AHSVVSWKNSK-------- 93 (423) Q Consensus 23 ~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi-~~~~~~~~~~~-------- 93 (423) ++|+|||.|+++.+..|++ . ..+|+++++|-|- ..+|.+.+++.+.++|. +++++...... T Consensus 1 VLAySGGLDTSv~l~wL~e----~---~~eVia~~~d~Gq---~~ed~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~p~ 70 (389) T pfam00764 1 VLAYSGGLDTSVCIPWLKE----K---YYEVIAVAVDVGQ---GEEDLDEAREKALKLGAVKHYVIDAKEEFVEDYIFPA 70 (389) T ss_pred CEEECCCCHHHHHHHHHHH----C---CCEEEEEEEECCC---CHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHH T ss_conf 9603676289999999997----0---9928999997999---7787899999999829978999737999999989999 Q ss_pred ---------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf ---------8778037888545555432001234420123456--66566899998623654443322357641003579 Q gi|254780546|r 94 ---------PQTGLMAAAREARYALISEHAKTINATLIMTAHT--FDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLN 162 (423) Q Consensus 94 ---------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~--~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~ 162 (423) .=....+.+|-.-...+.+.+++.++.+++-|-+ -+||+--.+. .+ .+. -+ T Consensus 71 i~ana~Yeg~Y~L~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGNDQvRFe~~--~~----------aL~------P~ 132 (389) T pfam00764 71 IKANALYEGRYPLGTALARPLIAKKLVEAAKKEGADAVAHGCTGKGNDQVRFELS--IR----------SLA------PD 132 (389) T ss_pred HHCCCCCCCCEEECCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHH--HH----------HHC------CC T ss_conf 9668674464785243216699999999987639769943666799745354788--99----------749------99 Q ss_pred CEEECCH--HCCCHHHHHHHHHHHCCCCC Q ss_conf 6871012--10428999998998189320 Q gi|254780546|r 163 LWISRPF--LRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 163 ~~i~RPL--L~~~r~~l~~~~~~~~i~wv 189 (423) +.++-|. ..++|++..+||+++||+.- T Consensus 133 l~iiaP~R~~~~sR~~~i~ya~~~gI~v~ 161 (389) T pfam00764 133 LKVIAPVRDLNLTREEEIEYAKEKGIPIP 161 (389) T ss_pred CEEECCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 75854524532057999999998599887 No 46 >cd00553 NAD_synthase NAD+ synthase is a homodimer, which catalyzes the final step in de novo nicotinamide adenine dinucleotide (NAD+) biosynthesis, an amide transfer from either ammonia or glutamine to nicotinic acid adenine dinucleotide (NaAD). The conversion of NaAD to NAD+ occurs via an NAD-adenylate intermediate and requires ATP and Mg2+. The intemediate is subsequently cleaved into NAD+ and AMP. In many prokaryotes, such as E. coli , NAD synthetase consists of a single domain and is strictly ammonia dependent. In contrast, eukaryotes and other prokaryotes have an additional N-terminal amidohydrolase domain that prefer glutamine, Interestingly, NAD+ synthases in these prokaryotes, can also utilize ammonia as an amide source . Probab=98.48 E-value=3.1e-06 Score=63.15 Aligned_cols=140 Identities=14% Similarity=0.145 Sum_probs=88.9 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC--- Q ss_conf 79983999966942489999999999986489972999999667798784689999999998718988999975047--- Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK--- 93 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~--- 93 (423) ..-..++|++|||.||.+.+.++.+... +.+++++..-..- .|.+..+....+|+.+|+++.++..+... T Consensus 21 ~g~~~~viGlSGGIDSav~a~La~~Alg-----~~~v~~v~mP~~~--ss~~s~~~A~~la~~lgi~~~~i~I~~~~~~~ 93 (248) T cd00553 21 SGFKGVVLGLSGGIDSALVAALAVRALG-----RENVLALFMPSRY--SSEETREDAKELAEALGIEHVNIDIDPAVEAF 93 (248) T ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHCC-----CCEEEEEECCCCC--CCHHHHHHHHHHHHHHCCCEEEECHHHHHHHH T ss_conf 1999199958888999999999999728-----8759998899978--98758999999999958916871559999999 Q ss_pred ---------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf ---------------87780378885455554320012344201234566656689999862365444332235764100 Q gi|254780546|r 94 ---------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTIL 158 (423) Q Consensus 94 ---------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~ 158 (423) ..+|+++ +.|-..+...+...+..++-|+ +-.|.. .|-.- . T Consensus 94 ~~~~~~~~~~~~~~~~~~Niqa---RlRm~~Ly~~An~~~~lVlgTg----NksE~~-----~Gy~T------------k 149 (248) T cd00553 94 LALLGESGGSELEDLALGNIQA---RLRMVILYALANKLGGLVLGTG----NKSELL-----LGYFT------------K 149 (248) T ss_pred HHHHHHHCCCCHHHHHHHHHHH---HHHHHHHHHHHHHCCCEEECCC----CHHHHH-----HCCCE------------E T ss_conf 9986440487055678742487---8889999999974499896287----460787-----57210------------0 Q ss_pred CCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 35796871012104289999989981893 Q gi|254780546|r 159 YDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 159 ~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) .+.+.-=+-|+-++.|.++++.++..|+| T Consensus 150 yGD~~~d~~Pi~~L~Kt~V~~la~~l~vp 178 (248) T cd00553 150 YGDGAADINPIGDLYKTQVRELARYLGVP 178 (248) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHCCH T ss_conf 07877582213688499999999996888 No 47 >PRK13794 hypothetical protein; Provisional Probab=98.46 E-value=1.3e-06 Score=65.76 Aligned_cols=169 Identities=17% Similarity=0.147 Sum_probs=88.8 Q ss_pred HHHHHCC--CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9998527--99839999669424899999999999864899729999996677987846899999999987189889999 Q gi|254780546|r 11 FFVRSLV--YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 11 ~~~~~l~--~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) .|++... .+..+.|++|||+||+|.|+|+.... . .++.++.+|.|+ |-.+-.++++++++++|++..+.. T Consensus 235 ~fir~~~~~~~~pv~VSfSGGKDS~v~L~La~ka~----~--~~~~vvF~DTgl--EfPeT~e~ve~v~~~~gv~ii~~~ 306 (473) T PRK13794 235 GFMRNTIVKIGKPPSVAYSGGKDSLATLLLALKAF----G--NEFPVLFADTGL--EFPETLENVEDVEELYGLEIIRTS 306 (473) T ss_pred HHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHC----C--CCEEEEEEECCC--CCHHHHHHHHHHHHHHCCCEEEEC T ss_conf 99999998669983997068689999999999860----8--971699960888--767699999999998598789915 Q ss_pred EECC-CCCCCCHHHHHHHHH-------HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 7504-787780378885455-------55432001234420123456665668999986236544433223576410035 Q gi|254780546|r 89 WKNS-KPQTGLMAAAREARY-------ALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYD 160 (423) Q Consensus 89 ~~~~-~~~~~~~~~ar~~r~-------~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~ 160 (423) -... ..-......++..|. .-+....++.....++ ...-+...|++ -|..+..-.. .... T Consensus 307 ~~~fw~~~~~~GpP~rd~RWCc~v~Kl~pl~~~l~~~~~~~~l-~~vG~R~~ES~-~Rak~prv~~----------n~~i 374 (473) T PRK13794 307 SGEFWEKLEEYGPPARDNRWCSEVCKLSPLGKLIDEKYEGECL-SFVGQRKYESF-NRSKKPRIWR----------NPTI 374 (473) T ss_pred CCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCE-EEEECCHHHHH-HHHCCCCCCC----------CCCC T ss_conf 6447788886199976660213226378999999974899718-99852168768-7504874123----------7887 Q ss_pred CCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 7968710121042899999899818932026678642210189997 Q gi|254780546|r 161 LNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRR 206 (423) Q Consensus 161 ~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~ 206 (423) .+....-|+.+++--++--|+-.++++| |+-|++-+-|- T Consensus 375 ~~qi~~~PI~~Wtal~VwLYI~~~~lp~-------NpLY~~Gf~Rv 413 (473) T PRK13794 375 KKQMLAAPILNWTAMHVWIYLFREKAPY-------NVLYEQGFDRI 413 (473) T ss_pred CCCEEEECHHCCHHHHHHHHHHHCCCCC-------CHHHHCCCCCC T ss_conf 6715761431051879999998758998-------85887687545 No 48 >PRK01565 thiamine biosynthesis protein ThiI; Provisional Probab=98.45 E-value=5.3e-06 Score=61.45 Aligned_cols=146 Identities=21% Similarity=0.200 Sum_probs=95.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHH---HHHC--CCCEEEEEEECC Q ss_conf 99839999669424899999999999864899729999996677987846899999999---9871--898899997504 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDV---CSRL--RIAHSVVSWKNS 92 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~---~~~l--gi~~~~~~~~~~ 92 (423) ...++++=+|||.||-+-.+++-. + +.++..+|++ +---.+.+-.+-|.++ ..++ ++.++++..... T Consensus 175 t~Gk~l~LlSGGIDSPVAa~~mmk----R---G~~v~~v~f~-~~p~t~~~a~~kv~~l~~~L~~y~~~~kl~~v~f~~~ 246 (399) T PRK01565 175 TSGKALLLLSGGIDSPVAGYLAMK----R---GVEIEAVHFH-SPPYTSERAKEKVIDLARILAKYGGRIKLHVVPFTEI 246 (399) T ss_pred CCCCEEEEECCCCCHHHHHHHHHH----C---CCEEEEEEEE-CCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHHH T ss_conf 778579986178753999999985----5---9879999986-8998778999999999999997288855999663999 Q ss_pred -------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf -------7877803788854555543200123442012345666566899998623654443322357641003579687 Q gi|254780546|r 93 -------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWI 165 (423) Q Consensus 93 -------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i 165 (423) .+..-..-..|+.-|+.....+++.++..+.||-.+.-.+-..+-++.--+ ..-+..+ T Consensus 247 ~~~i~~~~~~~~~~vl~RR~M~riA~~iA~~~~~~alvTGESLGQVASQTl~Nl~~i~---------------~~~~~pV 311 (399) T PRK01565 247 QEEIKKKVPESYLMTLMRRFMMRIADKIAEKRGALAIVTGESLGQVASQTLESMQAIN---------------AVTNLPV 311 (399) T ss_pred HHHHHHCCCCCEEEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHH---------------HHCCCCC T ss_conf 9999860880038599999999999999998599789824135567675567788887---------------5217754 Q ss_pred ECCHHCCCHHHHHHHHHHHCC Q ss_conf 101210428999998998189 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~~i 186 (423) +|||+++.|+||.+.+++.|- T Consensus 312 lRPLIg~DK~EII~~Ar~IGT 332 (399) T PRK01565 312 LRPLIGMDKEEIIEIAKEIGT 332 (399) T ss_pred CCCCCCCCHHHHHHHHHHCCC T ss_conf 466557898999999987096 No 49 >PRK12563 sulfate adenylyltransferase subunit 2; Provisional Probab=98.43 E-value=2.8e-06 Score=63.42 Aligned_cols=170 Identities=11% Similarity=0.040 Sum_probs=98.8 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC-CC Q ss_conf 27998399996694248999999999998648997299999966779878468999999999871898899997504-78 Q gi|254780546|r 16 LVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS-KP 94 (423) Q Consensus 16 l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~-~~ 94 (423) ...-++.++=+|||+||++||||+...-.- ..++ +-++|||+|..- .+-.+|..++++++|+.+.+-..+.. +. T Consensus 34 ~a~f~~pvmL~S~GKDS~VmlhLA~KAF~p-~~iP--fPllHIDTg~kF--~Emi~fRD~~a~~~~l~LiV~~n~~~i~~ 108 (312) T PRK12563 34 VAECSKPVMLYSIGKDSSVMLHLAMKAFRP-TRPP--FPLLHVDTTWKF--REMIDFRDRRAKELGLDLVVHHNPDGIAR 108 (312) T ss_pred HHHHCCCEEEEECCCHHHHHHHHHHHHHCC-CCCC--CCEEEEECCCCH--HHHHHHHHHHHHHHCCEEEEECCHHHHHC T ss_conf 997369679861674299999999997377-8999--735886368875--99999999999981986898328788866 Q ss_pred CCC-----CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH--CCCC--CCCC----CCCCCCCCCCCCC Q ss_conf 778-----0378885455554320012344201234566656689999862--3654--4433----2235764100357 Q gi|254780546|r 95 QTG-----LMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQ--RDYA--EKGM----GLSGMCDTILYDL 161 (423) Q Consensus 95 ~~~-----~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~--r~sg--~~g~----~l~~~~~~~~~~~ 161 (423) +.+ .........-..+.+...+.+.+.++.|--.|.+------|.+ |..+ ++-. .++.+.+.....+ T Consensus 109 g~~p~~~~~~~~~~~lKT~~L~~ale~~~fDa~~gGaRRDEEksRAKERiFS~Rd~~~~WDPknQrPElWnlyN~~~~~G 188 (312) T PRK12563 109 GIVPFRHGSALHTDVAKTQGLKQALDHHGFDAAIGGARRDEEKSRAKERIFSFRSAFHRWDPKAQRPELWSLYNARLRRG 188 (312) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHHHHCCCCCCC T ss_conf 99967678377766898899999999739887850453213423443403066677889885558835877615567999 Q ss_pred CCEEECCHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 96871012104289999989981893202 Q gi|254780546|r 162 NLWISRPFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 162 ~~~i~RPLL~~~r~~l~~~~~~~~i~wve 190 (423) .-.-+=||++++.-+|-+|....+|+.+. T Consensus 189 eh~RVfPlsNWTElDIW~YI~~E~Ipi~~ 217 (312) T PRK12563 189 ESLRVFPLSNWTELDVWQYIAREKIPLVP 217 (312) T ss_pred CEEEECCCCCCCHHHHHHHHHHHCCCCCC T ss_conf 71442235566178999999982889886 No 50 >PRK13795 hypothetical protein; Provisional Probab=98.41 E-value=2.4e-06 Score=63.88 Aligned_cols=161 Identities=18% Similarity=0.193 Sum_probs=89.8 Q ss_pred HHHHHCCC--CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 99985279--9839999669424899999999999864899729999996677987846899999999987189889999 Q gi|254780546|r 11 FFVRSLVY--PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 11 ~~~~~l~~--~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) .|++.... +..+.|++|||+||+|.|+|+.... .++.++.+|.|+ |-.+-.++|+++++++|+++.+.+ T Consensus 235 ~fir~~~~~~~~pv~VsfSGGKDS~v~L~La~ka~-------~~~~~~f~dTgl--EfPeT~e~v~~~~~~~gi~~~~~~ 305 (630) T PRK13795 235 NFIREVAEKYNLPVVVSFSGGKDSLVVLDLAAEAL-------RKFKAFFNNTGL--EFPETVENVKEVAEEYGVELIVAD 305 (630) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHC-------CCEEEEEECCCC--CCHHHHHHHHHHHHHHCCCEEEEC T ss_conf 99998787548975995268689999999999854-------982899950776--764699999999998698189965 Q ss_pred EECCC--CCCCCHHHHHHHHHHHHHHHCCCCC----CCCHHHHHH--------HHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 75047--8778037888545555432001234----420123456--------665668999986236544433223576 Q gi|254780546|r 89 WKNSK--PQTGLMAAAREARYALISEHAKTIN----ATLIMTAHT--------FDDQLETVYMRSQRDYAEKGMGLSGMC 154 (423) Q Consensus 89 ~~~~~--~~~~~~~~ar~~r~~~~~~~~~~~~----~~~l~~ah~--------~dD~~Et~l~rl~r~sg~~g~~l~~~~ 154 (423) ..... .-......+|.. .||.... ....+-.+. -+...|.+ -| +..+ T Consensus 306 a~~~fw~~~~~~GpP~rd~------RWCckv~Kl~pl~~~i~~~~~~~~l~~~G~R~~ES~-~R------------~~~~ 366 (630) T PRK13795 306 AGDAFWRAIEKFGPPARDY------RWCCKVCKLGPITRAIKSNFPQGCLSFVGQRKYESF-AR------------AKSP 366 (630) T ss_pred CCHHHHHHHHHCCCCCCCC------CCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHH-HH------------HCCC T ss_conf 6367888888749996556------435543100579999997589962999855246637-66------------0488 Q ss_pred CCCC--CCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 4100--357968710121042899999899818932026678642210189997 Q gi|254780546|r 155 DTIL--YDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRR 206 (423) Q Consensus 155 ~~~~--~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~ 206 (423) ..+. ...+...+-|+.+++--++--|.-.+++++ |+-|++-+-|- T Consensus 367 rv~~n~~i~~q~~a~PI~~Ws~~~VwLYi~~~~l~~-------NplY~~Gf~Ri 413 (630) T PRK13795 367 RVWRNPWVPNQIGAAPIQDWTALEVWLYIFSRKLPY-------NPLYERGFDRI 413 (630) T ss_pred CCCCCCCCCCEEEEECHHCCCHHHHHHHHHHCCCCC-------CHHHHCCCCCC T ss_conf 614477877726872033173869999998848998-------84787587565 No 51 >KOG1622 consensus Probab=98.40 E-value=1.2e-06 Score=65.95 Aligned_cols=163 Identities=20% Similarity=0.115 Sum_probs=91.0 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 85279983999966942489999999999986489972999999667798784689999999998718988999975047 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) .......++++++|||.||+++..++.+... .-+++++|||-|+-- ..|++.|++....+||+.++....... T Consensus 225 ~k~vG~~~Vl~~vSGgvdStV~a~Ll~~alg-----~~R~~ai~vdNG~mr--k~Ea~~V~~tl~~lgi~i~v~~as~~f 297 (552) T KOG1622 225 RKWVGDYKVLVAVSGGVDSTVCAALLRRALG-----PDRVHAIHVDNGFMR--KKEAEQVEKTLVYLGIPITVVDASETF 297 (552) T ss_pred HHHHCCCCEEEEECCCCHHHHHHHHHHHHHC-----CCCEEEEEECCCCHH--HHHHHHHHHHHHHCCCCEEEEECHHHH T ss_conf 9984644559983488508999999998637-----894489981264010--367788999999738844996215888 Q ss_pred CC-----CCCHH-------HHHHHHHHHHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC-- Q ss_conf 87-----78037-------8885455554320012344--20123456665668999986236544433223576410-- Q gi|254780546|r 94 PQ-----TGLMA-------AAREARYALISEHAKTINA--TLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTI-- 157 (423) Q Consensus 94 ~~-----~~~~~-------~ar~~r~~~~~~~~~~~~~--~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~-- 157 (423) -+ .+-+. ..-...-........+.+. .+++.|.-.-|..|... ..|+|. +..+.... T Consensus 298 ~s~L~~~~dPE~KRkiIG~tfikv~~~~~~~l~~k~~~~~~flaQgtL~Pd~ieS~s---~~g~~~----a~tIKThhn~ 370 (552) T KOG1622 298 LSKLKGVTDPEEKRKIIGRTFIKVFDPVASELNKKHGEKESFLAQGTLRPDLIESAS---VYGSGH----AETIKTHHND 370 (552) T ss_pred HHHHCCCCCHHHHCEECCCCEEEECCHHHHHHHHCCCCCCEEEECCCCCCHHHHHCC---CCCCCH----HHHHHCCCCC T ss_conf 876126688788161024200240767888752065760004433501410443134---567734----5442010255 Q ss_pred ----CCC-CCCEEECCHHCCCHHHHHHHHHHHCCCCCC Q ss_conf ----035-796871012104289999989981893202 Q gi|254780546|r 158 ----LYD-LNLWISRPFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 158 ----~~~-~~~~i~RPLL~~~r~~l~~~~~~~~i~wve 190 (423) +.. .-..++-||-++-|+|.|+.-+..|++-.- T Consensus 371 ~~L~r~lrklgK~ieplk~~~kDEvr~lgk~lGlp~~L 408 (552) T KOG1622 371 TGLIRDLRKLGKVIEPLKDFHKDEVRELGKDLGLPESL 408 (552) T ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 05899999831657426788889998766650872432 No 52 >cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity Probab=98.39 E-value=8e-06 Score=60.21 Aligned_cols=141 Identities=21% Similarity=0.175 Sum_probs=90.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCC------- Q ss_conf 9999669424899999999999864899729999996677987846899999999987189-88999975047------- Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI-AHSVVSWKNSK------- 93 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi-~~~~~~~~~~~------- 93 (423) |++|+|||.|+++.+..|++- . ..+|++++||-|- + +++.+.+++-+.++|. +++++...... T Consensus 1 VvLAySGGLDTSv~l~wL~e~----~--~~eVia~~~d~Gq-~--~e~~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~p 71 (385) T cd01999 1 VVLAYSGGLDTSVILKWLKEK----G--GYEVIAVTADVGQ-P--EEEIEAIEEKALKLGAKKHVVVDLREEFVEDYIFP 71 (385) T ss_pred CEEEECCCCHHHHHHHHHHHH----C--CCEEEEEEEECCC-C--HHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHH T ss_conf 979977881799999999974----5--9849999998999-6--67778999999981997899961299999999999 Q ss_pred ----------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf ----------8778037888545555432001234420123456--6656689999862365444332235764100357 Q gi|254780546|r 94 ----------PQTGLMAAAREARYALISEHAKTINATLIMTAHT--FDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDL 161 (423) Q Consensus 94 ----------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~--~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~ 161 (423) .=....+.+|-.-...+.+.+++.++..++-|-+ -+||+--.+. .+ .+. . T Consensus 72 ai~ana~Yeg~Y~L~tslaRplIak~~ve~A~~~ga~aiaHG~TGkGNDQvRFe~~--~~----------aL~------P 133 (385) T cd01999 72 AIQANALYEGTYPLGTALARPLIAKALVEVAKEEGADAVAHGCTGKGNDQVRFELA--FY----------ALN------P 133 (385) T ss_pred HHHHCCCCCCCEECCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHHHH--HH----------HHC------C T ss_conf 99608643375321350157899999999999809849974566688840689999--99----------859------9 Q ss_pred CCEEECCH---HCCCHHHHHHHHHHHCCCCC Q ss_conf 96871012---10428999998998189320 Q gi|254780546|r 162 NLWISRPF---LRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 162 ~~~i~RPL---L~~~r~~l~~~~~~~~i~wv 189 (423) ++.++-|. =..+|++..+||+++||+.- T Consensus 134 ~l~iiAP~Rd~~~~sR~~~i~ya~~~gi~v~ 164 (385) T cd01999 134 DLKIIAPWRDWEFLSREEEIEYAEEHGIPVP 164 (385) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHCCCCCC T ss_conf 9758514731444888999999998599988 No 53 >PRK08557 hypothetical protein; Provisional Probab=98.38 E-value=4.3e-06 Score=62.15 Aligned_cols=151 Identities=17% Similarity=0.200 Sum_probs=84.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC---CCC Q ss_conf 98399996694248999999999998648997299999966779878468999999999871898899997504---787 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS---KPQ 95 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~---~~~ 95 (423) +-.|.|++|||+||+|+|+++.... . ++.++.+|.|+ |-.+-.++|+++++.+|++..+..-+.- -.. T Consensus 183 ~~pV~VSfSGGKDS~a~L~La~~a~-----~--~~~vvF~DTGl--EfPeT~eyve~~~~~~~~~l~~~~~~~fw~~~~~ 253 (420) T PRK08557 183 GYAINASFSGGKDSAVSTLLSKEVI-----P--DLDVVFIDTGL--EYPETLNYVKDFAKKYDINLDTVDGDNFWDNLEK 253 (420) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHC-----C--CCEEEEECCCC--CCHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHH T ss_conf 9746996068789999999999854-----8--81699962887--7723899999999980996799654668988775 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCC----CCCHHHHHH---------HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--CC Q ss_conf 78037888545555432001234----420123456---------6656689999862365444332235764100--35 Q gi|254780546|r 96 TGLMAAAREARYALISEHAKTIN----ATLIMTAHT---------FDDQLETVYMRSQRDYAEKGMGLSGMCDTIL--YD 160 (423) Q Consensus 96 ~~~~~~ar~~r~~~~~~~~~~~~----~~~l~~ah~---------~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~--~~ 160 (423) .+. .++. ..||+... ....+-.+. -+...|++ -| +.....+. .. T Consensus 254 ~Gp--P~rd------~RWCc~v~Kl~Pl~~~i~~~~~~~~vl~~~G~R~~ES~-~R------------~~~~~~~~~~~v 312 (420) T PRK08557 254 EGI--PTKD------NRWCNSACKLMPLKRYLKKKYGNKKVLTIDGSRKYESF-TR------------ANLDYERKSGFI 312 (420) T ss_pred CCC--CCCC------CCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECHHHHH-HH------------HCCCCCCCCCCC T ss_conf 499--9766------72507431017899999985789875999970145407-66------------058741347776 Q ss_pred CCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 7968710121042899999899818932026678642210189997 Q gi|254780546|r 161 LNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRR 206 (423) Q Consensus 161 ~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~ 206 (423) ....-+-|+++++--++--|+-.++++| |+-|++-+-|- T Consensus 313 ~~q~~~~PI~~Wsa~~VwLYI~~~~l~~-------NplY~~Gf~Rv 351 (420) T PRK08557 313 DFQTNVFPILDWNSLDIWSYIFSNDILY-------NPMYDKGFERI 351 (420) T ss_pred CCCEEEEHHHCCCHHHHHHHHHHCCCCC-------CHHHHCCCCCC T ss_conf 6650353153260769999999759999-------84886687636 No 54 >PRK13980 NAD synthetase; Provisional Probab=98.36 E-value=1.4e-05 Score=58.45 Aligned_cols=150 Identities=19% Similarity=0.164 Sum_probs=89.8 Q ss_pred HHHHHHHHHC---CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9999999852---7998399996694248999999999998648997299999966779878468999999999871898 Q gi|254780546|r 7 ESVRFFVRSL---VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIA 83 (423) Q Consensus 7 ~~~~~~~~~l---~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~ 83 (423) +.+-.|++.- .+...++|++|||.||.+.+.++.+.. + +-+++++..=. +..|.+..+....+|+.+||+ T Consensus 13 ~~iv~~lrdy~~~~g~kg~VlGlSGGIDSavva~La~~Al----g-~~~v~~v~mP~--~~ss~~s~~dA~~la~~lgi~ 85 (264) T PRK13980 13 EIIVDFIREEVEKAGFKGVVLGLSGGIDSAVVAYLAVKAL----G-KENVLALLMPY--RVSPPEDLEDALLVAERLGIE 85 (264) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHC----C-CCCEEEEECCC--CCCCHHHHHHHHHHHHHHCCC T ss_conf 9999999999998099809997986888999999999854----9-66068998989--999887899999999986998 Q ss_pred EEEEEEECCC-------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8999975047-------------877803788854555543200123442012345666566899998623654443322 Q gi|254780546|r 84 HSVVSWKNSK-------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGL 150 (423) Q Consensus 84 ~~~~~~~~~~-------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l 150 (423) +.++..+..- ..+|+++ +.|-..+...+...+..++-|+.- .| +++-.+-- .| T Consensus 86 ~~~i~I~~~~~~~~~~~~~~~~~~~~Niqa---R~Rm~~Ly~~An~~~~lVlgTgNk----sE-~~~Gy~Tk---yG--- 151 (264) T PRK13980 86 YKVIEITPIVDAFFSAVPDADRLRRGNIMA---RTRMVLLYDYANRDNRLVLGTSNK----SE-LLLGYFTK---YG--- 151 (264) T ss_pred EEEEECHHHHHHHHHHCCCCCHHHHHHHHH---HHHHHHHHHHHHHCCCEEECCCCH----HH-HHHHHCCC---CC--- T ss_conf 499827999999998631000668887589---899999999864339776558867----47-98721001---68--- Q ss_pred CCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 3576410035796871012104289999989981893 Q gi|254780546|r 151 SGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 151 ~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) .+.-=+-|+-++.|.|+++.++..|+| T Consensus 152 ----------D~~~d~~Pi~dL~Kt~V~~La~~l~vP 178 (264) T PRK13980 152 ----------DGAVDINPIGDLYKTQVRELARHLGVP 178 (264) T ss_pred ----------CCCCCHHHHCCCCHHHHHHHHHHHCCC T ss_conf ----------766562452587399999999993996 No 55 >cd01991 Asn_Synthase_B_C The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase . Probab=98.36 E-value=2.4e-06 Score=63.96 Aligned_cols=108 Identities=19% Similarity=0.236 Sum_probs=73.9 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 52799839999669424899999999999864899729999996677987846899999999987189889999750478 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKP 94 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~ 94 (423) .+.....|++.+|||.||++++.+++.... . ++.+.+++++. +..+|..+++.+++.+|++++++..+...- T Consensus 11 ~l~sd~~vg~~LSGGlDSs~ia~~~~~~~~----~--~~~~~s~~~~~--~~~de~~~a~~va~~~~~~~~~i~~~~~~~ 82 (269) T cd01991 11 RLRSDVPVGVLLSGGLDSSLVAALAARLLP----E--PVKTFSIGFGF--EGSDEREYARRVAEHLGTEHHEVEFTPADL 82 (269) T ss_pred HHCCCCCEEEEECCHHHHHHHHHHHHHHCC----C--CCCEEEEECCC--CCCCHHHHHHHHHHHCCCCCEEEECCHHHH T ss_conf 954488568651356999999999998368----9--98479986289--997378999999998098824531434889 Q ss_pred CC----------CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 77----------8037888545555432001234420123456665 Q gi|254780546|r 95 QT----------GLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 95 ~~----------~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) .. ..........+..+...+.+.++.++++|+-.|. T Consensus 83 ~~~~~~~~~~~~~p~~~~~~~~~~~l~~~a~~~g~~VllsG~GgDE 128 (269) T cd01991 83 LAALPDVIWELDEPFADSSAIPLYLLSRLARKHGIKVVLSGEGADE 128 (269) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH T ss_conf 9999999998639601254789999999998539779996776146 No 56 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=98.34 E-value=2.4e-05 Score=56.85 Aligned_cols=149 Identities=16% Similarity=0.138 Sum_probs=91.8 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC-EEEEEEECCCC-- Q ss_conf 998399996694248999999999998648997299999966779878468999999999871898-89999750478-- Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIA-HSVVSWKNSKP-- 94 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~-~~~~~~~~~~~-- 94 (423) ...||++|+|||.|-++.+..|++ +. +.+++++++|-|- + .+|.+.+++-+.++|.. ++++....+.- T Consensus 3 ~~kkvvLAYSGGLDTSv~i~wL~e----~~--~~eVit~tadvGQ-~--eed~~~i~eKA~~~Ga~~~~viD~reeFv~~ 73 (403) T COG0137 3 KVKKVVLAYSGGLDTSVAIKWLKE----KG--GAEVIAVTADVGQ-P--EEDLDAIREKALELGAEEAYVIDAREEFVED 73 (403) T ss_pred CCCEEEEEECCCCCHHHHHHHHHH----HC--CCEEEEEEEECCC-C--HHHHHHHHHHHHHHCCCEEEEEECHHHHHHH T ss_conf 776799995488239999999997----65--9469999975899-7--5775799999998188528996438999999 Q ss_pred ---------------CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf ---------------77803788854555543200123442012345666566899998623654443322357641003 Q gi|254780546|r 95 ---------------QTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILY 159 (423) Q Consensus 95 ---------------~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~ 159 (423) =...-+.||-.-...+.+.+++.|+..++ |-. -|.|-+....-. ..... T Consensus 74 yi~~~i~ana~Yeg~YpL~TalaRPLIak~lVe~A~k~Ga~ava--HGc------------TGKGNDQvRFE~--~~~al 137 (403) T COG0137 74 YIFPAIKANALYEGVYPLGTALARPLIAKKLVEAAKKEGADAVA--HGC------------TGKGNDQVRFEL--AILAL 137 (403) T ss_pred HHHHHHHHHCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEE--ECC------------CCCCCCEEEEEE--EHHHH T ss_conf 99999973051215641454346779999999999971996997--467------------888875354320--04541 Q ss_pred CCCCEEECCH--HCCCHHHHHHHHHHHCCCCCCC Q ss_conf 5796871012--1042899999899818932026 Q gi|254780546|r 160 DLNLWISRPF--LRCRREDIRSFLLQRNISWCED 191 (423) Q Consensus 160 ~~~~~i~RPL--L~~~r~~l~~~~~~~~i~wveD 191 (423) ..++.++-|. ..++|++..+|+.++||+.--| T Consensus 138 ~pdlkiiAP~Rew~~~R~~~i~Ya~~~gipv~~~ 171 (403) T COG0137 138 NPDLKIIAPWREWNLTREEEIEYAEEHGIPVKAT 171 (403) T ss_pred CCCCEEEEEHHHHCCCHHHHHHHHHHCCCCCCCC T ss_conf 8996798563341337699999999849976656 No 57 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=98.34 E-value=4.3e-06 Score=62.13 Aligned_cols=62 Identities=19% Similarity=0.313 Sum_probs=49.5 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 39999669424899999999999864899729999996677987846899999999987189889999750 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKN 91 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~ 91 (423) ..+|+||||+||+-.+|.+++-. +++..+||+|+++.. +...+.++++|+. |+++.....+. T Consensus 3 DcIVpvSGGKDS~y~~~~lk~ky------glnpL~Vt~~~~~~t--~~g~~Nl~nl~~~-g~D~~~~~~~~ 64 (154) T cd01996 3 DCIIGVSGGKDSSYALYLLKEKY------GLNPLAVTVDNGFNS--EEAVKNIKNLIKK-GLDLDHLVINP 64 (154) T ss_pred CEEEECCCCHHHHHHHHHHHHHH------CCCEEEEEECCCCCC--HHHHHHHHHHHHC-CCCEEEEECCH T ss_conf 78996888818999999999994------996599996798754--6899999999974-99816993288 No 58 >PRK00876 nadE NAD synthetase; Reviewed Probab=98.31 E-value=1.9e-05 Score=57.53 Aligned_cols=77 Identities=22% Similarity=0.290 Sum_probs=54.5 Q ss_pred HHHHHHHHH-C-CC-CC-EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 999999985-2-79-98-39999669424899999999999864899729999996677987846899999999987189 Q gi|254780546|r 7 ESVRFFVRS-L-VY-PA-HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI 82 (423) Q Consensus 7 ~~~~~~~~~-l-~~-~~-~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi 82 (423) +.+..|++. + .. .. -++||+|||.||++.+.|+.+-.. +-++.++..-. +..+.+.....+.+|+.+|| T Consensus 17 ~riv~fir~~v~~~~~~kG~VlGlSGGIDSAv~a~Lav~AlG-----~e~V~gl~MP~--~~s~~~s~~~a~~la~~lGi 89 (325) T PRK00876 17 ERIRAFIREQVLGTLKRRGVVLGLSGGIDSSVTLALCVRALG-----KDRVLGLLMPE--RDSSPDSLRLGRMLAESLGV 89 (325) T ss_pred HHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHC-----CCCEEEEECCC--CCCCHHHHHHHHHHHHHCCC T ss_conf 999999999998516986799968768889999999999748-----55169997886--56887789999999998199 Q ss_pred CEEEEEEE Q ss_conf 88999975 Q gi|254780546|r 83 AHSVVSWK 90 (423) Q Consensus 83 ~~~~~~~~ 90 (423) ++.+..+. T Consensus 90 ~~~~idI~ 97 (325) T PRK00876 90 EYVVEDIS 97 (325) T ss_pred CEEEEECH T ss_conf 35998448 No 59 >pfam00733 Asn_synthase Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine. Probab=98.29 E-value=1.8e-05 Score=57.67 Aligned_cols=165 Identities=22% Similarity=0.218 Sum_probs=100.4 Q ss_pred HHHHH-HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999-85279983999966942489999999999986489972999999667798784689999999998718988999 Q gi|254780546|r 9 VRFFV-RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 9 ~~~~~-~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) +.+.+ ..+.....|++.+|||.||++++.++++.. . .++.+++++..- ....|..+++..++.+|++|+.+ T Consensus 6 l~~aV~~~l~sdvpig~~LSGGlDSs~ia~l~~~~~----~--~~i~~~s~~~~~--~~~~E~~~a~~~a~~~~~~~~~v 77 (195) T pfam00733 6 LRDAVKRRLRADVPVGVLLSGGLDSSLIAALAARQS----S--PPLKTFSVGFEG--SDYDEAPYAELVADHLGTDHHEI 77 (195) T ss_pred HHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHC----C--CCCEEEECCCCC--CCCCHHHHHHHHHHHHCCCCEEE T ss_conf 999999983469836753067468999999999853----8--995389557888--99758999999996513351787 Q ss_pred EEECC--------------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 97504--------------7877803788854555543200123442012345666566899998623654443322357 Q gi|254780546|r 88 SWKNS--------------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGM 153 (423) Q Consensus 88 ~~~~~--------------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~ 153 (423) .++.. .|..... ....| .+...++ .+..++++|.-.|.. +-|-..+ ..+| T Consensus 78 ~~~~~~~~~~~~~~i~~~~~P~~~~~---~~~~~-~l~k~~~-~~~kV~lsG~GaDEl--------f~GYr~D---r~~m 141 (195) T pfam00733 78 IVTEEELLDALPEVIYHLEEPFGDSS---AIPLY-LLSRLAR-KGVKVVLSGEGADEL--------FGGYRDD---RMSM 141 (195) T ss_pred EECHHHHHHHHHHHHHHHCCCCCCCC---HHHHH-HHHHHHH-CCCEEEEEEECHHHH--------HCCCCCC---CHHH T ss_conf 72559999998888998629877751---78999-9999986-898499971568888--------4798687---0122 Q ss_pred CCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 6410035796871012104289999989981893202667864221018999764345 Q gi|254780546|r 154 CDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDI 211 (423) Q Consensus 154 ~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l 211 (423) . .++...=|+|+ .++.+||......|.-. |.. ++--+|+.+... T Consensus 142 ~------~gvE~R~Pfld---~~lv~~~~~ip~~~k~~--~~~---~K~iLR~a~~~~ 185 (195) T pfam00733 142 A------HGLEVRVPFLD---HRLVEFALSLPPELKLR--DGE---EKYILREAARGI 185 (195) T ss_pred H------CCCEEECCCCC---HHHHHHHHHCCHHHHCC--CCC---CHHHHHHHHHCC T ss_conf 2------57305566223---79999999499999479--999---889999998671 No 60 >PRK06850 hypothetical protein; Provisional Probab=98.29 E-value=3.1e-05 Score=56.03 Aligned_cols=182 Identities=14% Similarity=0.179 Sum_probs=108.4 Q ss_pred CCHHHHHHHHHHH--CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCC--CCCEEEEEEEECCCCC-CC----HHHHHHHH Q ss_conf 6989999999985--27998399996694248999999999998648--9972999999667798-78----46899999 Q gi|254780546|r 3 LSPIESVRFFVRS--LVYPAHILVAVSGGSDSMGLLIALHSVLSDRS--FGKIKFSAISVDHCLR-ET----AKDEVRYV 73 (423) Q Consensus 3 ~~p~~~~~~~~~~--l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~--~~~~~l~a~~vdHglr-~~----s~~e~~~v 73 (423) ..+++.+.+.++. +......+||+|||+||+|+|+|.-.....-+ ...-.|++++-|.+.- |. -..-.+.. T Consensus 3 ~~~i~~i~~ei~~~Yl~D~rPWvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnPiV~~~v~~sL~~i 82 (488) T PRK06850 3 GEPIEELIEEIQELYCADNRPWVIGYSGGKDSTAVLQLVWNALAGLPPEKRHKPVYVISTDTLVENPVVVAWVNKSLERI 82 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 13799999999999863799879975877199999999999998378876689889995898756779999999999999 Q ss_pred HHHHHHCCCCEEEEEEECC-------------CCCCCCHHHHHHHH--------HHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 9999871898899997504-------------78778037888545--------55543200123442012345666566 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS-------------KPQTGLMAAAREAR--------YALISEHAKTINATLIMTAHTFDDQL 132 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~-------------~~~~~~~~~ar~~r--------~~~~~~~~~~~~~~~l~~ah~~dD~~ 132 (423) +..+++.|+|..+..+... .|.++- ..|-+- -+++.+...+.|-..+++|.-.+.-. T Consensus 83 ~~~A~~~~LPI~~~~v~P~~~dtFWvnlIGrGYPaP~~--~FRWCTdRLKI~P~~rFI~~~v~~~ge~IlvLGtR~~ES~ 160 (488) T PRK06850 83 KEAAKKQGLPITPHKLTPKVKDTFWVNLIGKGYPAPRR--KFRWCTERLKINPSNDFIRDKVSAFGEVIVVLGTRKAESA 160 (488) T ss_pred HHHHHHCCCCCEEEEECCCCCCCHHHHHCCCCCCCCCC--CCCCCCHHHHCCHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 99998659982268814886544335420476799987--6654756651484899999998753978999854527779 Q ss_pred H-HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 8-9999862365444332235764100357968710121042899999899818932026 Q gi|254780546|r 133 E-TVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCED 191 (423) Q Consensus 133 E-t~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveD 191 (423) . .-.|......|..- .++ +- ....+..++.|.-.++-+++=.|+....-||=.| T Consensus 161 ~Ra~~m~k~e~~~~r~-~Ls---~h-~~l~n~~vy~PI~~ws~ddVW~yL~~~~nPWg~~ 215 (488) T PRK06850 161 ARAQVMAKHKIGGTRD-RLS---RH-STLPNSFVYTPIEDWSNDDVWKYLLQWENPWGGS 215 (488) T ss_pred HHHHHHHHHHCCCCCC-CCC---CC-CCCCCCEEECCHHHCCCCHHHHHHHHCCCCCCCC T ss_conf 9999999875247301-257---78-8888868971255377337999986189988997 No 61 >COG2117 Predicted subunit of tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=98.28 E-value=3.6e-06 Score=62.67 Aligned_cols=160 Identities=19% Similarity=0.281 Sum_probs=98.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEC--------- Q ss_conf 39999669424899999999999864899729999996677987846899999999987189889999750--------- Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKN--------- 91 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~--------- 91 (423) ++.|=+|||+||+.-+.+|..+ ++++..+||+.|+-++ -++.++.++.+|+||.+++.+. T Consensus 2 ~v~vLfSGGKDSSLaA~iL~kl-------gyev~LVTvnFGv~d~----~k~A~~tA~~lgF~h~vl~Ldr~ile~A~em 70 (198) T COG2117 2 DVYVLFSGGKDSSLAALILDKL-------GYEVELVTVNFGVLDS----WKYARETAAILGFPHEVLQLDREILEDAVEM 70 (198) T ss_pred CEEEEECCCCCHHHHHHHHHHH-------CCCCEEEEEEECCCCC----HHHHHHHHHHHCCCCCEECCCHHHHHHHHHH T ss_conf 5689844897105899999871-------7871799987334664----0668999998299700001688999999999 Q ss_pred ----CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEC Q ss_conf ----4787780378885455554320012344201234566656689999862365444332235764100357968710 Q gi|254780546|r 92 ----SKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISR 167 (423) Q Consensus 92 ----~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~R 167 (423) ..|..+||-.-+.+ ..++ +.. ....+.-|.-.||.+-+.-.+-.+. .--.-++..++ T Consensus 71 ~iedg~P~~aIq~iH~~a-lE~~---A~r-~~~~iaDGTRRDDrvP~ls~~~~qS--------------LEdR~nv~Yi~ 131 (198) T COG2117 71 IIEDGYPRNAIQYIHEMA-LEAL---ASR-EVDRIADGTRRDDRVPKLSRSEAQS--------------LEDRLNVQYIR 131 (198) T ss_pred HHHCCCCCHHHHHHHHHH-HHHH---HHH-HHHHHCCCCCCCCCCCCCCHHHHHH--------------HHHHCCCEEEC T ss_conf 985389850889999999-9999---877-7888757874456676424988751--------------88751864553 Q ss_pred CHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCH-HHHHHHHHHH Q ss_conf 1210428999998998189320266786422101-8999764345 Q gi|254780546|r 168 PFLRCRREDIRSFLLQRNISWCEDPSNTDDRFER-VRVRRFVRDI 211 (423) Q Consensus 168 PLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R-~rlR~~l~~l 211 (423) ||+++.+..|+..+... +..-+-||....+-+= +-+|..|... T Consensus 132 PL~G~G~kti~~Lv~~~-f~~e~~~Se~~~k~DYEaElR~lL~er 175 (198) T COG2117 132 PLLGLGYKTIRRLVSAI-FILEEGPSEKIEKADYEAELRYLLRER 175 (198) T ss_pred CCCCCCHHHHHHHHHHH-EEEECCCCCCCCCCCHHHHHHHHHHHC T ss_conf 03344678999998877-341016655555532699999999972 No 62 >TIGR03183 DNA_S_dndC putative sulfurtransferase DndC. Members of this protein family are the DndC protein from the dnd (degradation during electrophoresis) operon. The dnd phenotype reflects a sulfur-containing modification to DNA. This operon is sparsely and sporadically distributed among bactera; among the first eight examples are members from the Actinobacteria, Firmicutes, Gammaproteobacteria, Cyanobacteria. DndC is suggested to be a sulfurtransferase. Probab=98.27 E-value=2.5e-05 Score=56.67 Aligned_cols=170 Identities=16% Similarity=0.211 Sum_probs=101.3 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCC--CCCEEEEEEEECCCCC-CC----HHHHHHHHHHHHHHCCCCEEEE Q ss_conf 527998399996694248999999999998648--9972999999667798-78----4689999999998718988999 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRS--FGKIKFSAISVDHCLR-ET----AKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~--~~~~~l~a~~vdHglr-~~----s~~e~~~v~~~~~~lgi~~~~~ 87 (423) .+......+||+|||+||+|+|+|.-.....-+ ...-.|++++-|.+.- |. -..-.+..+..+++.++|..+. T Consensus 9 Y~~d~~PWvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnP~V~~~v~~~L~~i~~~a~~~~LPi~~~ 88 (447) T TIGR03183 9 YCADDIPWVIGYSGGKDSTAVLQLVWNALSELPPEQRTKKIHVISTDTLVENPIVAAWVNVSLERMQEAAQKQGLPIEPH 88 (447) T ss_pred HHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 85379987997587739999999999999847887576888999799875577999999999999999999749982368 Q ss_pred EEECC-------------CCCCCCHHHHHHHH--------HHHHHHHCCCCCCCCHHHHHHHHHHH-HHHHHHHHCCCCC Q ss_conf 97504-------------78778037888545--------55543200123442012345666566-8999986236544 Q gi|254780546|r 88 SWKNS-------------KPQTGLMAAAREAR--------YALISEHAKTINATLIMTAHTFDDQL-ETVYMRSQRDYAE 145 (423) Q Consensus 88 ~~~~~-------------~~~~~~~~~ar~~r--------~~~~~~~~~~~~~~~l~~ah~~dD~~-Et~l~rl~r~sg~ 145 (423) .+... -|.++.. .|-+- -+++.+...+.|...+++|.-.+.-. -.-.|....+++. T Consensus 89 ~~~P~~~dtFWvnlIGrGYPaP~~~--FRWCTdrLKI~P~~~fi~~~v~~~ge~ilvLGtR~~ES~~Ra~~i~k~e~~~~ 166 (447) T TIGR03183 89 RLTPAIEDTFWVNLIGKGYPAPRQK--FRWCTDRLKISPSNTFIRDKVAANGEVILVLGTRKNESQARAAVMEKHEGGST 166 (447) T ss_pred EECCCCCCCHHHHHCCCCCCCCCCC--CCCCCHHHHCCHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHCCCC T ss_conf 9337854430001034630799987--65255766368289999986521496899995652777999999998644670 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 4332235764100357968710121042899999899818932026 Q gi|254780546|r 146 KGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCED 191 (423) Q Consensus 146 ~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveD 191 (423) .. .++ +. ....+..++.|.-.++-+++=.|+....-||=.| T Consensus 167 ~~-~ls---~h-~~l~n~~vy~PI~dws~ddVW~yL~~~~~Pwg~~ 207 (447) T TIGR03183 167 RD-RLS---RN-SSLPNSWVYTPIEDWSNDDVWMYLLQNPNPWGYD 207 (447) T ss_pred HH-HCC---CC-CCCCCCEEECCHHHCCHHHHHHHHCCCCCCCCCC T ss_conf 42-127---68-8888837973067476408999863389988998 No 63 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=98.27 E-value=4.3e-05 Score=55.06 Aligned_cols=160 Identities=16% Similarity=0.192 Sum_probs=97.7 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCE-EEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC------ Q ss_conf 3999966942489999999999986489972-999999667798784689999999998718988999975047------ Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKI-KFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK------ 93 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~-~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~------ 93 (423) ||++|+|||.|-++-+.+|.+ + .++ +++++++|=| +|+ .+|...+++.+.++|..-+ ..+|..+ T Consensus 1 KVvLAySGGLDTSv~l~wL~~----k--yG~~~Via~~~dvG-QPD-E~d~~~~~~kA~~~GA~~~-~~iDak~eFv~dy 71 (420) T TIGR00032 1 KVVLAYSGGLDTSVCLKWLEE----K--YGYEEVIAVTADVG-QPD-EEDIDAIEEKALKYGAEKH-YTIDAKEEFVKDY 71 (420) T ss_pred CEEEEEECCHHHHHHHHHHHH----H--HCCCCEEEEEEECC-CCC-HHHHHHHHHHHHHHCCCEE-EEEECHHHHHHHH T ss_conf 968886070259999999886----5--28986079997567-999-8888889999887367102-5772368899999 Q ss_pred ------------CC-----CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf ------------87-----7803788854555543200123442012345666566899998623654443322357641 Q gi|254780546|r 94 ------------PQ-----TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDT 156 (423) Q Consensus 94 ------------~~-----~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~ 156 (423) .+ ...-+-||-.-...+-+.+++.|+..+. |-- -|.|-+...--. -. T Consensus 72 ~f~aiqanA~Ye~~GG~~Y~L~TaLaRPlIA~~lVe~Ak~~Ga~Ava--HGC------------TGKGNDQ~RFe~--~~ 135 (420) T TIGR00032 72 LFAAIQANAVYEGTGGLVYPLSTALARPLIAKKLVEVAKKEGAEAVA--HGC------------TGKGNDQVRFER--SI 135 (420) T ss_pred HHHHHHCCCEECCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCEEEE--CCC------------CCCCCCHHHHHH--HH T ss_conf 88876506603178741034766114579999999998466954987--076------------877741357658--88 Q ss_pred CCCCCCCEEECCH--HCCC-----HHHHHHHHHHHCCCCCCCCCCCCCCC--CHHHHHHHH Q ss_conf 0035796871012--1042-----89999989981893202667864221--018999764 Q gi|254780546|r 157 ILYDLNLWISRPF--LRCR-----REDIRSFLLQRNISWCEDPSNTDDRF--ERVRVRRFV 208 (423) Q Consensus 157 ~~~~~~~~i~RPL--L~~~-----r~~l~~~~~~~~i~wveDpSN~d~~f--~R~rlR~~l 208 (423) .....++.++-|. +.++ |+|..+|+.++||++ |++.+-.| |+|-+=..+ T Consensus 136 ~~~~p~LkviaP~R~~~~~e~lgGR~e~~eYa~~~Gip~---p~~~~K~YSiD~Nl~grs~ 193 (420) T TIGR00032 136 RALNPDLKVIAPWRDLNLTEELGGREEEIEYAAQKGIPV---PMTKEKPYSIDENLWGRSI 193 (420) T ss_pred HHHCCCCEEECCCCCHHHHHHHCCCHHHHHHHHHCCCCC---CCCCCCCCCHHHHHHHHHH T ss_conf 752689858747346467764059488999999668887---6546777762366775555 No 64 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=98.25 E-value=2.7e-05 Score=56.49 Aligned_cols=146 Identities=15% Similarity=0.100 Sum_probs=94.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHH-HHHHCC----CCEEEEEEECC Q ss_conf 9983999966942489999999999986489972999999667798784689999999-998718----98899997504 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSD-VCSRLR----IAHSVVSWKNS 92 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~-~~~~lg----i~~~~~~~~~~ 92 (423) ...++++=+|||-||-+-.+++-. + +.+++.+|+|.|-. .+..-...+.. +.+++| +.++++..... T Consensus 176 tqGkvL~LlSGGiDSPVAa~~mmK----R---G~~v~~l~F~lg~~-~~e~~V~~va~~L~~~~~~~~~vr~~~V~f~~v 247 (483) T PRK01269 176 TQEDVLSLISGGFDSGVASYMLMR----R---GSRVHYCFFNLGGA-AHEIGVKQVAHYLWNRYGSSHRVRFISVDFEPV 247 (483) T ss_pred CCCCEEEEECCCCCCHHHHHHHHH----C---CCEEEEEEECCCCH-HHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHH T ss_conf 778689995278883888999984----5---98799999579986-889999999999999738888708999767999 Q ss_pred -------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf -------7877803788854555543200123442012345666566899998623654443322357641003579687 Q gi|254780546|r 93 -------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWI 165 (423) Q Consensus 93 -------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i 165 (423) -+..-..-..|+.-|+.....+.+.++..+.||-.+.-.+-..+-++.--+ ..-+..+ T Consensus 248 ~~eI~~~v~~~~~~vv~KR~M~R~A~~iA~~~g~~ALVTGESLGQVASQTL~NL~~i~---------------~~~~~pV 312 (483) T PRK01269 248 VGEILEKVDNGQMGVVLKRMMLRAASKVAERYGIQALVTGEALGQVSSQTLTNLRLID---------------NVTDTLI 312 (483) T ss_pred HHHHHHHCCHHHEEHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHH---------------HHCCCCE T ss_conf 9999873876344399999999999999998599889855503456776787889988---------------7407740 Q ss_pred ECCHHCCCHHHHHHHHHHHCC Q ss_conf 101210428999998998189 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~~i 186 (423) +|||+++.|+||.+.+++.|- T Consensus 313 lRPLIg~DK~eII~~Ar~IGT 333 (483) T PRK01269 313 LRPLIAMDKEDIIDLARQIGT 333 (483) T ss_pred ECCCCCCCHHHHHHHHHHHCH T ss_conf 047667988999999998392 No 65 >COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Probab=98.14 E-value=6.3e-05 Score=53.85 Aligned_cols=145 Identities=13% Similarity=0.107 Sum_probs=93.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH-HHHHHH----HHHHCCCCEEEEEEE-- Q ss_conf 9983999966942489999999999986489972999999667798784689-999999----998718988999975-- Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE-VRYVSD----VCSRLRIAHSVVSWK-- 90 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e-~~~v~~----~~~~lgi~~~~~~~~-- 90 (423) ...++++=+|||.||-+-.+++-. + +.++.++|.+- -|...++ .+.+.. .+...+-.......+ T Consensus 174 t~Gk~l~LlSGGIDSPVA~~l~mk----R---G~~v~~v~f~~--~p~~~~~a~~k~~~l~~~~~~~~~~~~~~~~v~f~ 244 (383) T COG0301 174 TQGKVLLLLSGGIDSPVAAWLMMK----R---GVEVIPVHFGN--PPYTSEKAREKVVALALLRLTSYGGKVRLYVVPFT 244 (383) T ss_pred CCCCEEEEEECCCCHHHHHHHHHH----C---CCEEEEEEECC--CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHH T ss_conf 677489997078774999999985----6---98779999727--99851889999999986430246872289997629 Q ss_pred -------CCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf -------0478778037888545555432001234420123456665668999986236544433223576410035796 Q gi|254780546|r 91 -------NSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNL 163 (423) Q Consensus 91 -------~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~ 163 (423) ...+..-..-..|+.-|+.....+++.|+..+.||-.+...+-..+-++.-- ...-+. T Consensus 245 ~v~~~i~~~~~~~y~~v~~rR~M~riA~~iae~~g~~aIvtGEsLGQVASQTl~nL~~i---------------~~~t~~ 309 (383) T COG0301 245 EVQEEILEKVPESYRCVLLKRMMYRIAEKLAEEFGAKAIVTGESLGQVASQTLENLRVI---------------DSVTNT 309 (383) T ss_pred HHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHH---------------HHCCCC T ss_conf 99999986367530204789999999999999839839983686104667678889999---------------815587 Q ss_pred EEECCHHCCCHHHHHHHHHHHCC Q ss_conf 87101210428999998998189 Q gi|254780546|r 164 WISRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 164 ~i~RPLL~~~r~~l~~~~~~~~i 186 (423) .++|||+++.|+||.+.+++.|. T Consensus 310 pIlRPLI~~DK~eIi~~Ar~IgT 332 (383) T COG0301 310 PVLRPLIGLDKEEIIEIARRIGT 332 (383) T ss_pred CEECCCCCCCHHHHHHHHHHHCC T ss_conf 42314457998999999998297 No 66 >COG0171 NadE NAD synthase [Coenzyme metabolism] Probab=98.10 E-value=0.00012 Score=51.86 Aligned_cols=156 Identities=17% Similarity=0.147 Sum_probs=95.2 Q ss_pred HHHHHHH-HC--CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 9999998-52--79983999966942489999999999986489972999999667798784689999999998718988 Q gi|254780546|r 8 SVRFFVR-SL--VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAH 84 (423) Q Consensus 8 ~~~~~~~-~l--~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~ 84 (423) .+..|++ .+ ..-.+++|++|||.||+.-+.++.+.... +....++.++...++-- +..+.+.++..|+.+|+.. T Consensus 11 ~~v~fl~~~l~~~~~k~~VlGiSGGiDSa~~~~La~~A~~~-~~~~~~~~av~mP~~~~--~~~~~~da~~~~~~lg~~~ 87 (268) T COG0171 11 RLVDFLRDYLKKAGFKGVVLGLSGGIDSALVLALAVRALGK-GDSKENVLAVRLPYGYT--VQADEEDAQDLAEALGIDY 87 (268) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCC-CCCHHHEEEEECCCCCC--CCCCHHHHHHHHHHHCCCE T ss_conf 99999999999739998699766681999999999998565-65124326686788776--5347999999999829966 Q ss_pred EEEEEECC---------CCC-------CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99997504---------787-------78037888545555432001234420123456665668999986236544433 Q gi|254780546|r 85 SVVSWKNS---------KPQ-------TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGM 148 (423) Q Consensus 85 ~~~~~~~~---------~~~-------~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~ 148 (423) .+..+... ... -..++.-.+.|-..++..+...+..++-|+|-.. . ..|-..+ T Consensus 88 ~~i~I~~~v~~~~~~~~~~~~~~~~~~~~~~NikaR~Rm~~lY~~An~~~~lVlGTgn~sE----~-----~~Gy~Tk-- 156 (268) T COG0171 88 KEINIKPAVDAFLKKLLKLFLGIYLEDLALGNIKARLRMVILYAIANKLGGLVLGTGNKSE----L-----ALGYFTK-- 156 (268) T ss_pred EEEECHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCHHH----H-----HCCCEEC-- T ss_conf 9975288899988766665234421127776450999999999998555978975884877----8-----6074432-- Q ss_pred CCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 223576410035796871012104289999989981893 Q gi|254780546|r 149 GLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 149 ~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) .+.+.-=+-|+-++.|.++.+.++..|++ T Consensus 157 ----------yGDg~~d~~Pi~~L~KtqV~~La~~l~ip 185 (268) T COG0171 157 ----------YGDGAVDINPIADLYKTQVYALARHLGIP 185 (268) T ss_pred ----------CCCCCCCHHHHCCCCHHHHHHHHHHCCCC T ss_conf ----------06764470541487589999999873999 No 67 >COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only] Probab=98.08 E-value=1.4e-05 Score=58.56 Aligned_cols=168 Identities=15% Similarity=0.176 Sum_probs=92.6 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHH-CCCCEE----EEEE Q ss_conf 52799839999669424899999999999864899729999996677987846899999999987-189889----9997 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSR-LRIAHS----VVSW 89 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~-lgi~~~----~~~~ 89 (423) -++.-++|.|++|||+||.+||||+.++.++.+..+ |.|+|+|.--+- ..-..+|++.... .++--. .+.+ T Consensus 23 if~~f~~VcVSFSGGKDS~lmLhL~~~~ar~~~~~~--i~VlfiD~E~QY--s~TidyV~em~~~~~dv~~~~yWvcLPl 98 (407) T COG3969 23 IFNTFPRVCVSFSGGKDSGLMLHLVAEVARENGRDK--ISVLFIDWEAQY--SCTIDYVQEMRESYHDVIETFYWVCLPL 98 (407) T ss_pred HHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCC--EEEEEECCHHHH--HHHHHHHHHHHHCCCCCCCCCEEEEEEH T ss_conf 884388389992378740589999999999819886--579997004555--5689999998730457555405999631 Q ss_pred ----------------ECC--------CCCC-CCHHHH----------HHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHH Q ss_conf ----------------504--------7877-803788----------8545555432001-234420123456665668 Q gi|254780546|r 90 ----------------KNS--------KPQT-GLMAAA----------REARYALISEHAK-TINATLIMTAHTFDDQLE 133 (423) Q Consensus 90 ----------------~~~--------~~~~-~~~~~a----------r~~r~~~~~~~~~-~~~~~~l~~ah~~dD~~E 133 (423) +.. -|.. -+.+.+ -+.....+..+.. +..+..+++|.-+|.-.. T Consensus 99 ~t~na~S~~qp~W~~Wep~~e~~WVR~~P~~~ii~d~~~F~Fyr~~M~feeFv~~F~~Wl~~~~~~ta~LvGiRadESlN 178 (407) T COG3969 99 TTQNALSQYQPEWICWEPGTEVDWVRQPPEQVAITDPAFFPFYRYGMTFEEFVPAFAAWLSQKRPATAVLVGIRADESLN 178 (407) T ss_pred HCCCCHHHCCCEEECCCCCCCCCCCCCCCHHCCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCCCEEEEEEECCHHHHH T ss_conf 10133221485240479998332003785211005787664000256599999999999833688618998623055578 Q ss_pred HHHHHHHCCCCCCC-CCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCC Q ss_conf 99998623654443-32235764100357968710121042899999899818932 Q gi|254780546|r 134 TVYMRSQRDYAEKG-MGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISW 188 (423) Q Consensus 134 t~l~rl~r~sg~~g-~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~w 188 (423) -+.--..+..+-.. -+.+ .......+.++-+=|+-+..-++|=.+-...+..+ T Consensus 179 Rf~ai~~~~k~~~~~~~pW--tt~~~~~~~~~~~yPiYDW~~eDiW~~~Ak~~~~y 232 (407) T COG3969 179 RFNAIARKEKLRFADDKPW--TTRIFPNGHVWTFYPIYDWKVEDIWTANAKFSYAY 232 (407) T ss_pred HHHHHHHHHHCCCCCCCCC--EEEECCCCCEEEEEECCCCHHHHHHHHHHHCCCCC T ss_conf 8998877653124799885--00015887259987412100888999987607742 No 68 >PTZ00323 NAD+ synthase; Provisional Probab=97.97 E-value=0.00025 Score=49.64 Aligned_cols=84 Identities=13% Similarity=0.067 Sum_probs=50.6 Q ss_pred CCHHHHHHHHHHH----C--CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 6989999999985----2--799839999669424899999999999864899729999996677987846899999999 Q gi|254780546|r 3 LSPIESVRFFVRS----L--VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDV 76 (423) Q Consensus 3 ~~p~~~~~~~~~~----l--~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~ 76 (423) +.|.+-|++...- + ++-..++|++|||.||...+.++.+-......+..++..+..-. ..|+...+..++. T Consensus 24 ~dp~~eI~~rv~fLrDYv~k~GfkgvVLGlSGGIDSAl~aaLA~~Alg~env~~~r~~gv~~P~---~ss~~s~~~a~~~ 100 (294) T PTZ00323 24 FDPVAWIEMKCAKLNEYMRRCGLKACVTSVSGGIDSAVVLALCSRAMRMPNSPIVRNVGICQPI---CSSAWALARGREN 100 (294) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCC---CCCHHHHHHHHHH T ss_conf 8989999999999999999829985999586369999999999998655457530356776785---4576659879999 Q ss_pred HHHCCCCEEEEEE Q ss_conf 9871898899997 Q gi|254780546|r 77 CSRLRIAHSVVSW 89 (423) Q Consensus 77 ~~~lgi~~~~~~~ 89 (423) ++.+|+...+... T Consensus 101 a~~~g~~~~~~~~ 113 (294) T PTZ00323 101 IAACGATEVVVDQ 113 (294) T ss_pred HHHCCCCEEECCC T ss_conf 9976885232360 No 69 >cd01984 AANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases, ATP sulphurylases Universal Stress Response protein and electron transfer flavoprotein (ETF). The domain forms a apha/beta/apha fold which binds to Adenosine nucleotide. Probab=97.95 E-value=7.7e-06 Score=60.33 Aligned_cols=69 Identities=32% Similarity=0.289 Sum_probs=54.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHH Q ss_conf 99996694248999999999998648997299999966779878468999999999871898899997504787780378 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAA 101 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ 101 (423) |+|++|||.||..++++++++. . ...++.+++++ T Consensus 1 ilv~~Sgg~dS~~~~~la~~~~--~--~~~~~~~~~~~------------------------------------------ 34 (86) T cd01984 1 ILVALSGGLDSSVLLHLAKRLK--S--GGPEVVALVVV------------------------------------------ 34 (86) T ss_pred CEEEECCCCCHHHHHHHHHHHH--C--CCCEEEEEEHH------------------------------------------ T ss_conf 9788617853899999999852--5--99808976389------------------------------------------ Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 88545555432001234420123456665668999986 Q gi|254780546|r 102 AREARYALISEHAKTINATLIMTAHTFDDQLETVYMRS 139 (423) Q Consensus 102 ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl 139 (423) .+.+.+.+...+.++..+++|||.+|+.++..+.. T Consensus 35 ---~~~~~~~~~~~~~~~~~i~~G~~~~~~~~~~~~~~ 69 (86) T cd01984 35 ---AFVRILKRLAAEEGADVIILGHNADDVAGRRLGAS 69 (86) T ss_pred ---HHHHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCC T ss_conf ---99999999877608988998677324533101665 No 70 >PRK02628 nadE NAD synthetase; Reviewed Probab=97.93 E-value=0.0001 Score=52.36 Aligned_cols=71 Identities=15% Similarity=0.222 Sum_probs=48.1 Q ss_pred CCCCEEEEEECCCHHHH-HHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 79983999966942489-999999999986489972999999667798784689999999998718988999975 Q gi|254780546|r 17 VYPAHILVAVSGGSDSM-GLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~-aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~ 90 (423) +.-.+++|++|||.||+ |||.++.. ...-+.+.-+|++++.=- +.-|+.-.....++|+.|||.+..+.++ T Consensus 360 ~g~~~vViGLSGGiDSaLaLLVaa~A-~d~Lg~~r~~V~~vtMPs--~~TS~~S~~dA~~La~~LGi~~~~I~I~ 431 (678) T PRK02628 360 TGIKKVVIGISGGLDSTLALLVAAKA-FDRLGLPRKNILAYTMPG--FGTTDRTKNNAVALMKALGVTAREIDIR 431 (678) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHH-HHHHCCCHHCEEEEECCC--CCCCHHHHHHHHHHHHHCCCCEEEEEHH T ss_conf 49981899677764479999999999-998489712248997788--7657878999999999729977997629 No 71 >PRK13981 NAD synthetase; Provisional Probab=97.85 E-value=0.00017 Score=50.89 Aligned_cols=139 Identities=14% Similarity=0.118 Sum_probs=82.8 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC---- Q ss_conf 7998399996694248999999999998648997299999966779878468999999999871898899997504---- Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS---- 92 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~---- 92 (423) +.-.+++|++|||.||...+.++.+-+. .-+|++++.=.. ..|+.-.+..+++|+.|||.++++.++.. T Consensus 279 ~gf~~vviGLSGGiDSaL~aaiA~dALG-----~~nV~~v~MPs~--~tS~~s~~dA~~La~~LGi~~~~i~I~~~~~~~ 351 (543) T PRK13981 279 NGFPGVVLGLSGGIDSALVAAIAVDALG-----AENVRAVMMPSR--YTSDESLDDAAALARNLGVRYDIIPIEPAFEAF 351 (543) T ss_pred HCCCEEEEECCCCHHHHHHHHHHHHHHC-----HHHEEEEECCCC--CCCHHHHHHHHHHHHHHCCCEEEEEHHHHHHHH T ss_conf 0897499976788359999999998538-----343788645877--786656999999999959974875149999999 Q ss_pred -----------C---CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf -----------7---87780378885455554320012344201234566656689999862365444332235764100 Q gi|254780546|r 93 -----------K---PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTIL 158 (423) Q Consensus 93 -----------~---~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~ 158 (423) . ...|+|+..| -..+...+.+.|.-.+.|| |-.|.- ..-+...|- |+ T Consensus 352 ~~~l~~~f~~~~~dvt~ENiQAR~R---~~iLm~laN~~g~lvl~Tg----nkSE~a----vGy~TlyGD----~~---- 412 (543) T PRK13981 352 EAALAPLFAGTEPDITEENLQSRIR---GTLLMALSNKFGSLVLTTG----NKSEMA----VGYATLYGD----MA---- 412 (543) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHH---HHHHHHHHHCCCCEEECCC----CHHHHH----HCHHHHCCC----CC---- T ss_conf 9974044168766652234548889---9999998733896796478----672787----474653376----55---- Q ss_pred CCCCCEEECCHHCCCHHHHHHHHHHHCC Q ss_conf 3579687101210428999998998189 Q gi|254780546|r 159 YDLNLWISRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 159 ~~~~~~i~RPLL~~~r~~l~~~~~~~~i 186 (423) + -+-|+-+++|.++.+.++-.+- T Consensus 413 --g---~~avi~dv~KT~V~~L~r~~n~ 435 (543) T PRK13981 413 --G---GFAPIKDVYKTLVYRLCRWRNT 435 (543) T ss_pred --C---CCCCCCCCCHHHHHHHHHHHHH T ss_conf --6---7430368659999999999865 No 72 >TIGR00342 TIGR00342 thiamine biosynthesis/tRNA modification protein ThiI; InterPro: IPR003720 Thiamine pyrophosphate (TPP) is synthesized de novo in many bacteria and is a required cofactor for many enzymes in the cell. ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway . Almost all the proteins in this group have an N-terminal THUMP domain (see IPR004114 from INTERPRO).; GO: 0009228 thiamin biosynthetic process. Probab=97.84 E-value=0.00017 Score=50.82 Aligned_cols=142 Identities=18% Similarity=0.131 Sum_probs=96.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHH-HHHHHHHHHH---HCCCCEEEE--EEEC-- Q ss_conf 8399996694248999999999998648997299999966779878468-9999999998---718988999--9750-- Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKD-EVRYVSDVCS---RLRIAHSVV--SWKN-- 91 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~-e~~~v~~~~~---~lgi~~~~~--~~~~-- 91 (423) .+++.=+|||-||=+-.+++=+ + +.+++++|.+- .|...+ -.+.|+.++. ..+..+.+. .++- T Consensus 187 Gkvl~LlSGGiDSPVAaf~~m~----R---Gc~V~~vhf~~--~~~~~~~~~~kv~~la~~~~~~~~~~~~~l~~~~F~~ 257 (391) T TIGR00342 187 GKVLALLSGGIDSPVAAFLAMK----R---GCRVVAVHFFN--EPAASEKAREKVERLANLLSLNETGGSVKLYVVDFTD 257 (391) T ss_pred HHHHHHHCCCCCHHHHHHHHHH----C---CCEEEEEEECC--CCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHH T ss_conf 3632041188351679999966----3---87799997328--8552468999999999988540003799999854389 Q ss_pred -------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf -------4787780378885455554320012344201234566656689999862365444332235764100357968 Q gi|254780546|r 92 -------SKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLW 164 (423) Q Consensus 92 -------~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~ 164 (423) ..+..-..-..|+.-|+.....+...+|.+++||-.+.-.+=-.+-++.--. ..-+.. T Consensus 258 ~~~~~~~~~~e~~~~v~~rR~M~~~A~~~ae~~g~~A~VTGe~LGQVASQTL~Nl~vI~---------------~~~~~~ 322 (391) T TIGR00342 258 VQEEIIEIIPEKYTMVLCRRLMLKIASKVAEKEGCLAIVTGESLGQVASQTLENLRVIQ---------------AVVNTP 322 (391) T ss_pred HHHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHH---------------HCCCCC T ss_conf 99999841899878874066799999988886099489976634203777888999997---------------237874 Q ss_pred EEC-CHHCCCHHHHHHHHHHHC Q ss_conf 710-121042899999899818 Q gi|254780546|r 165 ISR-PFLRCRREDIRSFLLQRN 185 (423) Q Consensus 165 i~R-PLL~~~r~~l~~~~~~~~ 185 (423) |+| ||.++.|++|.+.|++.| T Consensus 323 iL~RPLIg~DK~~Ii~~Ak~Ig 344 (391) T TIGR00342 323 ILRRPLIGMDKEEIIELAKEIG 344 (391) T ss_pred EEECCCCCCCHHHHHHHHHHCC T ss_conf 2407755479789999974139 No 73 >COG1365 Predicted ATPase (PP-loop superfamily) [General function prediction only] Probab=97.77 E-value=0.00011 Score=52.07 Aligned_cols=148 Identities=17% Similarity=0.149 Sum_probs=81.7 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC---CC-- Q ss_conf 8399996694248999999999998648997299999966779878468999999999871898899997504---78-- Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS---KP-- 94 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~---~~-- 94 (423) -+++||+|||+||+|-..+++.. .++||.|-----++--..+...|.-+|+.+..+.-+.. +. T Consensus 61 ~kiaVA~SGG~DSsas~iilR~~------------g~~v~p~t~~Lp~~ir~n~~~l~~~lg~~p~yveedl~~i~kGal 128 (255) T COG1365 61 PKIAVAYSGGVDSSASAIILRWA------------GFTVDPGTAILPDHIRRNKEELETLLGEVPEYVEEDLEDIEKGAL 128 (255) T ss_pred CEEEEEECCCCCHHHHHHHHHHH------------CEEECCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC T ss_conf 43899842786427889998862------------104146321387788678999989972577789989999986320 Q ss_pred --CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEE-ECCHHC Q ss_conf --77803788854555543200123442012345666566899998623654443322357641003579687-101210 Q gi|254780546|r 95 --QTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWI-SRPFLR 171 (423) Q Consensus 95 --~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i-~RPLL~ 171 (423) .--.+-.+-..-.......+++.++..+++|.-+ +.|........+-+++ +--++. T Consensus 129 nGRfhpCGRCh~~I~~~V~~k~re~di~~vafGDlL---------------------s~G~~svy~eD~i~rlnlPAflA 187 (255) T COG1365 129 NGRFHPCGRCHSMIENAVMDKARELDIDVVAFGDLL---------------------STGYGSVYREDGIFRLNLPAFLA 187 (255) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCC---------------------CCCCCCEECCCCEEEECCHHHHH T ss_conf 587787523789999999999985387089974632---------------------34663212058879972588885 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 428999998998189320266786422101899976 Q gi|254780546|r 172 CRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRF 207 (423) Q Consensus 172 ~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~ 207 (423) .+|.|+++.++.+++.. +.+|-=..+|.. T Consensus 188 ltK~Elr~il~~~~~e~-------~~kygCPll~e~ 216 (255) T COG1365 188 LTKDELRSILKWNGYEL-------EMKYGCPLLREV 216 (255) T ss_pred HCCHHHHHHHHHCCCCC-------HHCCCCCHHHHH T ss_conf 17199999987138430-------011588358999 No 74 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=97.74 E-value=0.0016 Score=44.00 Aligned_cols=147 Identities=10% Similarity=0.067 Sum_probs=87.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH----HHHHHHHHHHHHHCCCCEEEEEEECCCCCC Q ss_conf 3999966942489999999999986489972999999667798784----689999999998718988999975047877 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA----KDEVRYVSDVCSRLRIAHSVVSWKNSKPQT 96 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s----~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~ 96 (423) |+++-.|||+||+--++.+.+ .+. .....+.++... .+| .--.+.+...++.+|||+....++..+ T Consensus 1 Kv~~l~SGGKDS~lAl~~a~~----~g~-~v~~L~~~~~~~--~~~~~~H~~~~~l~~~qAealgiPl~~~~~~~~~--- 70 (194) T cd01994 1 KVVALISGGKDSCYALYRALE----EGH-EVVALLNLTPEE--GSSMMYHTVNHELLELQAEAMGIPLIRIEISGEE--- 70 (194) T ss_pred CEEEEECCCHHHHHHHHHHHH----CCC-EEEEEEEEECCC--CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCC--- T ss_conf 989997785999999999998----699-259999996399--8805250557899999999859966999668987--- Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHH Q ss_conf 80378885455554320012344201234566656689999862365444332235764100357968710121042899 Q gi|254780546|r 97 GLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRRED 176 (423) Q Consensus 97 ~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~ 176 (423) +.... .+.......++.++..+++|.-.-+..-+..-+.+... ++..+-||-+....+ T Consensus 71 ---~~~~~-~l~~~L~~~k~~gi~~vv~GdI~s~~qr~~~e~~c~~l------------------gl~~~~PLW~~~~~~ 128 (194) T cd01994 71 ---EDEVE-DLKELLRKLKEEGVDAVVFGAILSEYQRTRVERVCERL------------------GLEPLAPLWGRDQEE 128 (194) T ss_pred ---CHHHH-HHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHC------------------CCEEECHHCCCCHHH T ss_conf ---17999-99999999997599599999633288999999999973------------------988870010799999 Q ss_pred HHHHHHHHCCCCC---CCCCCCCCCC Q ss_conf 9998998189320---2667864221 Q gi|254780546|r 177 IRSFLLQRNISWC---EDPSNTDDRF 199 (423) Q Consensus 177 l~~~~~~~~i~wv---eDpSN~d~~f 199 (423) +..-.-+.|+..+ -|..-.|..| T Consensus 129 ll~e~i~~Gf~aiiv~V~a~~L~~~~ 154 (194) T cd01994 129 LLREMIEAGFKAIIIKVAAEGLDESW 154 (194) T ss_pred HHHHHHHCCCEEEEEEEECCCCCHHH T ss_conf 99999987990999995026899689 No 75 >PRK08384 thiamine biosynthesis protein ThiI; Provisional Probab=97.66 E-value=0.00094 Score=45.57 Aligned_cols=130 Identities=9% Similarity=0.057 Sum_probs=81.2 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCC Q ss_conf 99839999669424899999999999864899729999996677987846899999999987189889999750478778 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTG 97 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~ 97 (423) ...++++=+|||- |-+-.+++-. + +.++..+|++++ +.+.+-+..+.+...+++......-+.. + . T Consensus 178 s~Gk~l~LlSGGi-SPVAa~~mmK----R---G~~v~~vhf~~~--~~~~~kv~~l~~~L~~y~~~~~~~~~~~-~---~ 243 (310) T PRK08384 178 TQGKMVGLLSDEL-SAVAIFLMMK----R---GVEVIPVHIGMG--EKNLEKVRKLWNQLKKYSYGSKGRLVVV-K---N 243 (310) T ss_pred CCCCEEEEECCCC-CHHHHHHHHH----C---CCEEEEEEECCC--HHHHHHHHHHHHHHHHHCCCCCEEEEEE-C---H T ss_conf 7884899953886-3999999985----6---987999985688--7899999999999998679984699996-6---5 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHH Q ss_conf 03788854555543200123442012345666566899998623654443322357641003579687101210428999 Q gi|254780546|r 98 LMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDI 177 (423) Q Consensus 98 ~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l 177 (423) +......+++.++..+.||-.+...+-..+-++.-- ...-+..++|||+++.|+|| T Consensus 244 ---------~~~a~~ia~~~~~~alvTGEsLGQVASQTl~nl~~i---------------~~~~~~PVlRPLIg~DK~EI 299 (310) T PRK08384 244 ---------FERVNKIIRDFGAKGVVKGLRPEQLASETLENIYED---------------SRMFDVPVYYPLIALPDEYI 299 (310) T ss_pred ---------HHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHHH---------------HHCCCCCCCCCCCCCCHHHH T ss_conf ---------999999999859989997741255454579889999---------------73048873038568998999 Q ss_pred HHHHHHHC Q ss_conf 99899818 Q gi|254780546|r 178 RSFLLQRN 185 (423) Q Consensus 178 ~~~~~~~~ 185 (423) .+.+++.| T Consensus 300 I~~Ar~IG 307 (310) T PRK08384 300 EKVKEKIG 307 (310) T ss_pred HHHHHHCC T ss_conf 99998638 No 76 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=97.65 E-value=0.0002 Score=50.27 Aligned_cols=70 Identities=21% Similarity=0.245 Sum_probs=51.5 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 527998399996694248999999999998648997299999966779878468999999999871898899997504 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) .+...-.|++-+|||.||++++.++.... . -.+.+++|... ...-+|..+++++++.+|++|++..++.. T Consensus 254 rl~sDVpvG~~LSGGlDSS~I~a~~~~~~----~--~~i~tfsi~f~--~~~~DE~~~A~~vA~~~g~~h~~~~~~~~ 323 (628) T TIGR03108 254 RMVADVPLGAFLSGGVDSSAVVALMAGLS----D--TPVNTCSIAFD--DPAFDESAYARQVAERYGTNHRVETVDPD 323 (628) T ss_pred HHCCCCCEEEEECCCCCHHHHHHHHHHHC----C--CCCCEEEEECC--CCCCCHHHHHHHHHHHHCCCCEEEECCHH T ss_conf 80578856876468863689999999842----9--98643665348--88754369999999860876479977989 No 77 >COG0367 AsnB Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism] Probab=97.56 E-value=0.00076 Score=46.21 Aligned_cols=106 Identities=22% Similarity=0.278 Sum_probs=69.2 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH-HHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 5279983999966942489999999999986489972999999667798784-689999999998718988999975047 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA-KDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s-~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) .+.....+++-+|||.||+.++.+++....... ....+|+. +++ ..|..+.+..++.+|.+|+...+.... T Consensus 226 r~~advpvg~~lSGGlDSS~Iaa~a~~~~~~~~-----~~~fsvg~---~~~~~~D~~~a~~~A~~lg~~h~~~~~~~~e 297 (542) T COG0367 226 RLVADVPVGVFLSGGLDSSLIAAIAAEELGKEG-----KTTFTVGF---EDSDSPDAKYARAVAKFLGTPHHEIILTNEE 297 (542) T ss_pred HHCCCCCEEEEECCCHHHHHHHHHHHHHHCCCC-----CEEEEEEC---CCCCCCHHHHHHHHHHHHCCCCEEEEECHHH T ss_conf 621588789996776229999999998623455-----21589854---8999707999999999859985799617899 Q ss_pred ------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf ------------8778037888545555432001234420123456665 Q gi|254780546|r 94 ------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 94 ------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) ...+.. .-..-..++...+++.+..++++|.-+|. T Consensus 298 ~~~~~~~vv~~~~~p~~~--~~~~ply~~~~~a~~~g~kVvLSGeGADE 344 (542) T COG0367 298 LLNALPEVVKALDTPGGM--AASIPLYLLSRKARAEGEKVVLSGEGADE 344 (542) T ss_pred HHHHHHHHHHHHCCCCCC--HHHHHHHHHHHHHHHCCCEEEECCCCHHH T ss_conf 999999999870788620--04579999998765338789823874888 No 78 >PRK09431 asnB asparagine synthetase B; Provisional Probab=97.55 E-value=0.00037 Score=48.47 Aligned_cols=111 Identities=19% Similarity=0.241 Sum_probs=66.0 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCC-------CEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 52799839999669424899999999999864899-------72999999667798784689999999998718988999 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFG-------KIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~-------~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) ++.....|++-+|||.||+.++.+++......... ...++..+| |+ ++| .|..+.+.+++.+|..|+.+ T Consensus 223 rL~sDvpvG~~LSGGLDSSlIaala~k~~~~~~~~~~~~~~~~~~l~tFsi--G~-~~s-~D~~~Ar~vA~~lgt~h~ei 298 (555) T PRK09431 223 RLMSDVPYGVLLSGGLDSSLISAIAKKYAARRIEDDERSEAWWPQLHSFAV--GL-EGS-PDLKAAREVADYLGTVHHEI 298 (555) T ss_pred HHCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEE--CC-CCC-CHHHHHHHHHHHHCCCCEEE T ss_conf 835688624320577067999999998520244332222234678753660--47-998-14999999999829956599 Q ss_pred EEECCCCCCCCHH-------------HHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 9750478778037-------------888545555432001234420123456665 Q gi|254780546|r 88 SWKNSKPQTGLMA-------------AAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 88 ~~~~~~~~~~~~~-------------~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) .++...-...++. .+...-| ++....+..++.++++|.-+|. T Consensus 299 ~~t~~d~~~~l~~vi~~lE~~d~~~~~asip~y-llsk~i~~~gvKVvLSGeGADE 353 (555) T PRK09431 299 HFTVQEGLDALRDVIYHLETYDVTTIRASTPMY-LMSRKIKAMGIKMVLSGEGADE 353 (555) T ss_pred EECHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH-HHHHHHHHCCCEEEEECCCHHH T ss_conf 954799999999999874146875025689999-9999999639879992775044 No 79 >TIGR03104 trio_amidotrans asparagine synthase family amidotransferase. Members of this protein family are closely related to several isoforms of asparagine synthetase (glutamine amidotransferase) and typically have been given this name in genome annotation to date. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and a member of the GNAT family of acetyltransferases. Probab=97.50 E-value=0.00027 Score=49.38 Aligned_cols=105 Identities=22% Similarity=0.260 Sum_probs=65.0 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC-CHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 52799839999669424899999999999864899729999996677987-84689999999998718988999975047 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE-TAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~-~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) ++.....|++-+|||.||++++.++.+.. .. ++..++|...-.+ +..+|..+++.+++.+|.+|+.+.++... T Consensus 256 rl~sDvpvg~~LSGGlDSS~i~a~~~~~~----~~--~i~tfs~~f~~~~~~~~dE~~~a~~va~~~~~~h~~~~~~~~~ 329 (589) T TIGR03104 256 RLVADVPVGVLLSGGLDSSLIVGLLAEAG----VD--GLRTFSIGFEDVGGEKGDEFEYSDIIAERFHTRHHKIRIPNHR 329 (589) T ss_pred HHHCCCCCCEECCCCCCHHHHHHHHHHHC----CC--CCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCHH T ss_conf 97446763000178754389999999956----99--9852788724688776667999999999738785799957178 Q ss_pred --------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf --------------8778037888545555432001234420123456665 Q gi|254780546|r 94 --------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 94 --------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) |-.+... -.-| .+.+.+++ ...++++|.-.|. T Consensus 330 ~~~~l~~~~~~~deP~~~~~~---~~~~-~l~~~~~~-~~kV~LsG~GaDE 375 (589) T TIGR03104 330 VLPALPEAVAAMSEPMVSHDC---VAFY-LLSEEVSK-HVKVVQSGQGADE 375 (589) T ss_pred HHHHHHHHHHCCCCCCCCCHH---HHHH-HHHHHHCC-CCEEEEEEECCCC T ss_conf 999889988526898768306---8899-99987328-8369995203332 No 80 >PRK00768 nadE NAD synthetase; Reviewed Probab=97.41 E-value=0.014 Score=37.34 Aligned_cols=142 Identities=13% Similarity=0.130 Sum_probs=67.2 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHH----CCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 79983999966942489999999999986----48997299999966779878468999999999871898899997504 Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSD----RSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~----~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) .+...++|++|||.||.+.+.+++..... .......+.++..-++... +..+++ ..++-++..+. ..++.. T Consensus 38 ~g~~g~VlGLSGGIDSAvta~L~~~Av~al~~~~~~~~~~~iav~mP~~~~~-~~~da~---~~~~~i~~~~~-~~i~I~ 112 (274) T PRK00768 38 SGLKTLVLGISGGQDSTLAGRLAQLAVEELRAETGDASYQFIAVRLPYGVQA-DEDDAQ---DALAFIQPDEV-LTVNIK 112 (274) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCC-CHHHHH---HHHHHCCCCEE-EEECCC T ss_conf 4998399818857688999999999999865304766422689768988657-999999---99852276604-886170 Q ss_pred C--------------C-----CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 7--------------8-----77803788854555543200123442012345666566899998623654443322357 Q gi|254780546|r 93 K--------------P-----QTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGM 153 (423) Q Consensus 93 ~--------------~-----~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~ 153 (423) . . .+|+++ +.|.-.++..+...+..++-|+.-- |. +.|. T Consensus 113 ~~~d~~~~~l~~~~~~~~d~~~~Niqa---RiRM~~LY~~An~~g~LVlGTgNks----E~---------------~vGY 170 (274) T PRK00768 113 PAVDASVAALRAAGIELSDFVKGNIKA---RERMIAQYAIAGARGGLVVGTDHAA----EA---------------ITGF 170 (274) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHH---HHHHHHHHHHHHCCCCEEECCCCHH----HH---------------HCCC T ss_conf 999999999987277511678888579---8999999999827899886488640----44---------------4085 Q ss_pred CCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCC Q ss_conf 6410035796871012104289999989981893 Q gi|254780546|r 154 CDTILYDLNLWISRPFLRCRREDIRSFLLQRNIS 187 (423) Q Consensus 154 ~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~ 187 (423) . ...+.+.-=+-|+-++.|.|+++.++..|+| T Consensus 171 f--TkYGDg~~Di~PI~dL~KteV~~lA~~LgvP 202 (274) T PRK00768 171 F--TKFGDGGADLLPLFGLNKRQGRALLAALGAP 202 (274) T ss_pred E--EEECCCCCCHHHHCCCCHHHHHHHHHHHCCC T ss_conf 0--1026775173763256199999999995979 No 81 >TIGR01536 asn_synth_AEB asparagine synthase (glutamine-hydrolyzing); InterPro: IPR006426 These sequences represent glutamine-hydrolysing asparagine synthase. The group have a poorly conserved C-terminal extension while bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis .; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process. Probab=97.38 E-value=0.00073 Score=46.35 Aligned_cols=116 Identities=19% Similarity=0.170 Sum_probs=76.4 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCC-CEEEEEEEECCCCCCCHHHHHH--------HHHHHHHHCCCCEE Q ss_conf 52799839999669424899999999999864899-7299999966779878468999--------99999987189889 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFG-KIKFSAISVDHCLRETAKDEVR--------YVSDVCSRLRIAHS 85 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~-~~~l~a~~vdHglr~~s~~e~~--------~v~~~~~~lgi~~~ 85 (423) +|..-..|++=+|||.||+.+.-++.....+.... ...|+..+|.-.-.+..=+|.. +.+++++.+|+.|+ T Consensus 307 rl~aDvpvG~~LSGGlDSS~vaa~a~~~~~~~~~~W~~~~~tFsigf~~~~~~~~Es~l~P~~D~P~A~~vA~~~G~~h~ 386 (646) T TIGR01536 307 RLVADVPVGVLLSGGLDSSLVAAIARREAPSEGVDWKGPVKTFSIGFEGDSKDFDESKLRPDDDQPYARKVADELGTEHH 386 (646) T ss_pred HCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEECCCCCCCCHHHHHHHHHHHCCCCE T ss_conf 32005606888717167999999999976764331177434788651778765300257899861899999998498324 Q ss_pred EEEEECCC------------CCCCCHHHHHHHHHHHHHHHCCCC-CCCCHHHHHHHHH Q ss_conf 99975047------------877803788854555543200123-4420123456665 Q gi|254780546|r 86 VVSWKNSK------------PQTGLMAAAREARYALISEHAKTI-NATLIMTAHTFDD 130 (423) Q Consensus 86 ~~~~~~~~------------~~~~~~~~ar~~r~~~~~~~~~~~-~~~~l~~ah~~dD 130 (423) .+.++... .+-...+..-..=..++...+++. +..+++.|--+|. T Consensus 387 ~~~~s~~e~~~~l~~v~~~~e~y~p~~~~~~~P~Yl~s~~ar~~Tg~kV~LSGeGaDE 444 (646) T TIGR01536 387 EVLISEEEVLKALPEVIYHLETYDPTAIRASIPLYLLSKLAREDTGVKVVLSGEGADE 444 (646) T ss_pred EEEECHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCHHH T ss_conf 8984589999998887654248854303112358999999996288389974866355 No 82 >pfam01902 ATP_bind_4 ATP-binding region. This family of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042. Probab=97.33 E-value=0.0083 Score=38.88 Aligned_cols=63 Identities=13% Similarity=0.056 Sum_probs=39.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH--HHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 3999966942489999999999986489972999999667798784--68999999999871898899997 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA--KDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s--~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) |+++-+|||+||+--++.+.+ . + .....+.++.+.-..-. .--.+.++..++.+|||+..... T Consensus 2 K~~~l~SGGKDS~~Al~~a~~----~-~-~v~~L~t~~~~~~ds~~~H~~~~~l~~~qA~algiPl~~~~~ 66 (219) T pfam01902 2 KVAALYSGGKDSNYALYWALK----E-I-EVPYLVSMKSENKESYMFHEPNLHLTKLLAEALGIPIIKLYT 66 (219) T ss_pred CEEEEECCCHHHHHHHHHHHH----C-C-CCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC T ss_conf 399997286999999999987----1-9-837999996379981026157889999999975996899967 No 83 >PTZ00077 asparagine synthetase; Provisional Probab=97.26 E-value=0.0021 Score=43.06 Aligned_cols=74 Identities=19% Similarity=0.210 Sum_probs=51.7 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHC-----------------------------------CCCCEEEEEEEEC Q ss_conf 52799839999669424899999999999864-----------------------------------8997299999966 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDR-----------------------------------SFGKIKFSAISVD 59 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~-----------------------------------~~~~~~l~a~~vd 59 (423) +|...-.|++-+|||.||+....+++...+.. ...+.+++.++| T Consensus 233 rL~sDVPvG~lLSGGLDSSlIaAia~k~l~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~tFsI- 311 (610) T PTZ00077 233 RLMGDVPFGILLSGGLDSSIIAAILAKHLKIIDGEGGGGIQGGGAAAAHNAAHEANNNNGEDNCGNNNADPCHLKSFSI- 311 (610) T ss_pred HHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE- T ss_conf 8225672366634770589999999986542011122222244321000111112444432212356667444226886- Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 779878468999999999871898899997504 Q gi|254780546|r 60 HCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 60 Hglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) |+ ++| .|....+++++.+|..|+...++.+ T Consensus 312 -Gl-~gs-pDl~~Ar~vA~~lGt~Hhe~~~t~e 341 (610) T PTZ00077 312 -GL-KGS-PDLKAAKEVAEFLGIEHTEFHFTVE 341 (610) T ss_pred -EC-CCC-CHHHHHHHHHHHHCCCCEEEEECHH T ss_conf -25-898-2489999999983883459997779 No 84 >pfam09179 TilS TilS substrate binding domain. This domain is found in the tRNA(Ile) lysidine synthetase (TilS) protein. Probab=97.13 E-value=0.0036 Score=41.49 Aligned_cols=63 Identities=17% Similarity=0.249 Sum_probs=53.6 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH---CCCCCEEECCCEEEEEECCEEEE Q ss_conf 778337557898999999999835899998899999999986---69984561075799988998999 Q gi|254780546|r 254 QDILNIDSTLLFYLLRVSAAICGGQISLPGYRSMERVMLFLK---SGKRGCVSIGRVVIDRRANFLWI 318 (423) Q Consensus 254 ~~~~~~~~~~~~r~l~~~l~~~~g~~~~p~~~~l~~ll~~l~---~~~~~~~tl~g~~i~~~~~~l~i 318 (423) ..+..++.+.+.+++|.|+...| ..+|+.++++++++.+. .|.++.+.++++.++++++.+|+ T Consensus 4 ~~l~~l~~~r~~~lLR~wL~~~g--~~~Ps~~~l~~i~~~l~~a~~d~~p~l~~~~~~irRy~~~Lyl 69 (69) T pfam09179 4 AALAALSPARQRNLLRYWLAQLG--LPMPSQAQLEEILRQLLLAKEDAQPQLPLGGGELRRYRGRLYL 69 (69) T ss_pred HHHHHCCHHHHHHHHHHHHHHCC--CCCCCHHHHHHHHHHHHHCCCCCCEEEEECCEEEEEECCEEEC T ss_conf 88775999999999999999879--9999999999999999837789981899899999983899639 No 85 >TIGR02039 CysD sulfate adenylyltransferase, small subunit; InterPro: IPR011784 Metabolic assimilation of sulphur from inorganic sulphate, requires sulphate activation by coupling to a nucleoside, for the production of high-energy nucleoside phosphosulphates. This pathway appears to be similar in all prokaryotic organisms. Activation is first achieved through reaction of sulphate with ATP by sulphate adenylyltransferase (ATP sulphurylase) to produce 5'-phosphosulphate (APS), coupled by GTP hydrolysis. Subsequently, APS is phosphorylated by an APS kinase to produce 3'-phosphoadenosine-5'-phosphosulphate (PAPS). In Escherichia coli, ATP sulphurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulphur-containing amino acids. Homologous to this E. coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulphurase and APS kinase activity, yet are involved in Nod factor sulphation, and sulphation of other macromolecules.; GO: 0004781 sulfate adenylyltransferase (ATP) activity, 0019421 sulfate reduction APS pathway. Probab=97.12 E-value=0.0025 Score=42.60 Aligned_cols=68 Identities=16% Similarity=0.205 Sum_probs=48.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 8399996694248999999999998648997299999966779878468999999999871898899997504 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) .+=++=+|+|+||++||||+..--. |. .+-.-.+|||-|..- .+-..+=-...++.++.+.+...+.. T Consensus 20 ~~PV~LyS~GKDS~VlLHLA~KaF~--Pg-~~PFplLHvDTgw~f--~E~~~fRD~~~~~~~~~L~v~~~~~~ 87 (295) T TIGR02039 20 ERPVLLYSIGKDSSVLLHLARKAFY--PG-RLPFPLLHVDTGWKF--REMIAFRDELVAKYGLELIVHSNEEG 87 (295) T ss_pred CCCEEEEECCCHHHHHHHHHHHCCC--CC-CCCCCEEEEECCCCH--HHHHHHHHHHHHHHCCEEEEEECHHH T ss_conf 7976898657347999998764058--88-879735774066327--89999999999870976888505023 No 86 >KOG1706 consensus Probab=96.98 E-value=0.0099 Score=38.33 Aligned_cols=162 Identities=17% Similarity=0.123 Sum_probs=81.5 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE-EEECCCC- Q ss_conf 79983999966942489999999999986489972999999667798784689999999998718988999-9750478- Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV-SWKNSKP- 94 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~-~~~~~~~- 94 (423) ....++++|+|||.|-++.|..|++. ++++.+..-|-|- .++-+..++-+-+.|-.-.+. .+..+.- T Consensus 3 ~~~~~vVLAySGgLDTscil~WLkeq-------GyeViay~AnvGQ----~edfe~ar~kAlk~Gakk~~~ed~~~eFve 71 (412) T KOG1706 3 SSKKSVVLAYSGGLDTSCILAWLKEQ-------GYEVIAYLANVGQ----KEDFEEARKKALKSGAKKVVVEDVREEFVE 71 (412) T ss_pred CCCCEEEEEECCCCCCHHHHHHHHHC-------CCEEEEEECCCCC----HHHHHHHHHHHHHCCCEEEEEHHHHHHHHH T ss_conf 77745999953886723435888862-------9658984001562----666899997533127607986131588876 Q ss_pred ---CCCCHHHHHHH-HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH----HHHCCCCCCCCC--CCCCCCCCCCCCCCE Q ss_conf ---77803788854-55554320012344201234566656689999----862365444332--235764100357968 Q gi|254780546|r 95 ---QTGLMAAAREA-RYALISEHAKTINATLIMTAHTFDDQLETVYM----RSQRDYAEKGMG--LSGMCDTILYDLNLW 164 (423) Q Consensus 95 ---~~~~~~~ar~~-r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~----rl~r~sg~~g~~--l~~~~~~~~~~~~~~ 164 (423) -+.+|+.|-.. +|-+-.. ...-+.+|.+-|.++..=. ....|.|-+... |+- ....-++. T Consensus 72 dfi~Pa~qs~a~YEd~YLLGTS------laRp~ia~~qv~va~~eg~~aVsHGcTGKGNDQvrFELt~----ysl~P~~k 141 (412) T KOG1706 72 DFIWPALQSSALYEDRYLLGTS------LARPVIAKAQVDVAQREGAKAVSHGCTGKGNDQVRFELTF----YSLKPDVK 141 (412) T ss_pred HCCHHHHHHCCHHHCEEEECCC------CCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCEEEEEEE----ECCCCCCE T ss_conf 0201644311210101342244------4431343335357765175033035566777402466533----11477630 Q ss_pred EECCHH-------CCCHHHHHHHHHHHCCCC-----------------------CCCCCCCCCCC Q ss_conf 710121-------042899999899818932-----------------------02667864221 Q gi|254780546|r 165 ISRPFL-------RCRREDIRSFLLQRNISW-----------------------CEDPSNTDDRF 199 (423) Q Consensus 165 i~RPLL-------~~~r~~l~~~~~~~~i~w-----------------------veDpSN~d~~f 199 (423) ++-|+- .-.|+++.+|++++|||. .|||+|+-++- T Consensus 142 viapwrmp~f~~rf~Gr~Dl~eYakq~giPvpvT~k~pwsmDeNl~HiSyEagiLEdP~nqpp~~ 206 (412) T KOG1706 142 VIAPWRMPEFYERFKGRKDLLEYAKQHGIPVPVTPKNPWSMDENLMHISYEAGILEDPKNQPPKD 206 (412) T ss_pred EECCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCC T ss_conf 33354546788764271478999986499766568887654411001000035334867799810 No 87 >cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. Probab=96.93 E-value=0.0038 Score=41.24 Aligned_cols=105 Identities=17% Similarity=0.188 Sum_probs=75.5 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC----CCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCC Q ss_conf 399996694248999999999998648997299999966779----8784689999999998718988999975047877 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL----RETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQT 96 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl----r~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~ 96 (423) ||+||+||...|-.++--..+++.+. +.+++++||+-.- ..+..+..+...+.++++|.+..++.- . T Consensus 1 rILV~vd~s~~s~~lir~a~rlA~~~---~a~l~vl~V~~~~~~~~~~~~~~~l~~~~~la~~lga~~~~~~~--~---- 71 (124) T cd01987 1 RILVCISGGPNAERLIRRAARLADRL---KAPWYVVYVETPRLNRLSEAERRRLAEALRLAEELGAEVVTLPG--D---- 71 (124) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHC---CCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEC--C---- T ss_conf 99999589850799999999999964---99899999955975658979999999999999985998999947--9---- Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 80378885455554320012344201234566656689999862365444 Q gi|254780546|r 97 GLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEK 146 (423) Q Consensus 97 ~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~ 146 (423) +. ...+.+.+++.+++.+++|++..-- +.++++||-.+ T Consensus 72 d~--------~~~I~~~A~~~~~t~IVlG~~~~~~----~~~~~~gSv~~ 109 (124) T cd01987 72 DV--------AEAIVEFAREHNVTQIVVGKSRRSR----WRELFRGSLVD 109 (124) T ss_pred CH--------HHHHHHHHHHCCCCEEEECCCCCCH----HHHHHCCCHHH T ss_conf 98--------9999999998499899976898854----68872388999 No 88 >TIGR00552 nadE NAD+ synthetase; InterPro: IPR003694 NAD+ synthase (6.3.5.1 from EC) catalyzes the last step in the biosynthesis of nicotinamide adenine dinucleotide and is induced by stress factors such as heat shock and glucose limitation. The three-dimensional structure of NH3-dependent NAD+ synthetase from Bacillus subtilis, in its free form and in complex with ATP shows that the enzyme consists of a tight homodimer with alpha/beta subunit topology .; GO: 0003952 NAD+ synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0009435 NAD biosynthetic process. Probab=96.91 E-value=0.017 Score=36.65 Aligned_cols=156 Identities=18% Similarity=0.151 Sum_probs=100.1 Q ss_pred CHHHHHHH---HHHHC---C-CCCEEEEEECCCHHH--------HHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHH Q ss_conf 98999999---99852---7-998399996694248--------999999999998648997299999966779878468 Q gi|254780546|r 4 SPIESVRF---FVRSL---V-YPAHILVAVSGGSDS--------MGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKD 68 (423) Q Consensus 4 ~p~~~~~~---~~~~l---~-~~~~i~vAvSGG~DS--------~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~ 68 (423) .+.+-+.+ |+..- . ....+++++|||.|| ++...++.+... ..+.+++.+-|+-. .+.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~g~v~GlsGG~ds~l~g~l~~~~v~~~~~~~~~-----~~~~~~~~~p~~~~-~~~~ 77 (286) T TIGR00552 4 KVVEELEDAVDFLRGYVKKSPGAKGVVLGLSGGVDSTLAGKLCQAVVAALCVEALG-----KEQNHALLLPHSNT-TPED 77 (286) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCC-----HHHHEEEEECCCCC-CCHH T ss_conf 37899999898888887632576504673047620356778999999999985046-----11420332146678-8412 Q ss_pred HHHHHHHHHHHCCCCEEEEEEECC--------------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 999999999871898899997504--------------787780378885455554320012344201234566656689 Q gi|254780546|r 69 EVRYVSDVCSRLRIAHSVVSWKNS--------------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLET 134 (423) Q Consensus 69 e~~~v~~~~~~lgi~~~~~~~~~~--------------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et 134 (423) +.+....+|+.+|++......... .+.....+...+.|...++..+...+...+-|+|-..-.. T Consensus 78 d~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gn~~~r~r~~~~y~~a~~~~~lv~gt~n~~e~~~-- 155 (286) T TIGR00552 78 DVQDALALAEPLGINYKTIDIAPIAASFQAQTETGDPLADFLAEGNLKARLRMALLYAHANKHNLLVLGTGNKSELLL-- 155 (286) T ss_pred HHHHHHHHHHHCCCCCEECCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHH-- T ss_conf 179999988740544211130467788776420344333102332224788999998873101635640461344543-- Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCC Q ss_conf 9998623654443322357641003579687101210428999998998189 Q gi|254780546|r 135 VYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNI 186 (423) Q Consensus 135 ~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i 186 (423) |-+ ...+.+.-=+.|+-.+.|.+....++..|+ T Consensus 156 -------Gy~------------t~~Gdg~~d~~p~~~l~k~~~~~l~~~l~~ 188 (286) T TIGR00552 156 -------GYF------------TKYGDGGCDILPLGGLFKTEVYELAKRLGV 188 (286) T ss_pred -------HHH------------HHCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf -------200------------101442000345532237889999998175 No 89 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=96.60 E-value=0.067 Score=32.47 Aligned_cols=64 Identities=11% Similarity=0.056 Sum_probs=38.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH----HHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 3999966942489999999999986489972999999667798784----6899999999987189889999750 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA----KDEVRYVSDVCSRLRIAHSVVSWKN 91 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s----~~e~~~v~~~~~~lgi~~~~~~~~~ 91 (423) |+++=+|||+||.-.+|.+.+ .+ .+|..+..=+--+++| .--.+.+...++.+|||........ T Consensus 2 k~~aL~SGGKDS~~Al~~a~~----~G---~eV~~Ll~~~p~~~dS~m~H~~n~~~~~~~Ae~~gi~l~~~~~~g 69 (223) T COG2102 2 KVIALYSGGKDSFYALYLALE----EG---HEVVYLLTVKPENGDSYMFHTPNLELAELQAEAMGIPLVTFDTSG 69 (223) T ss_pred CEEEEEECCCHHHHHHHHHHH----CC---CEEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 689998167188999999997----59---836899998159987044222256799999874488359983476 No 90 >KOG0573 consensus Probab=95.57 E-value=0.087 Score=31.66 Aligned_cols=38 Identities=29% Similarity=0.399 Sum_probs=28.8 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC Q ss_conf 839999669424899999999999864899729999996677 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC 61 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg 61 (423) ++|.|-+|||.|||++++++++....+ .+++ -+.|-.+ T Consensus 251 s~VcVlfSGGvDs~vvA~l~h~~vp~n--e~Id--LINVaF~ 288 (520) T KOG0573 251 SNVCVLFSGGVDSTVVAVLAHYVVPEN--EPID--LINVAFG 288 (520) T ss_pred CCEEEEECCCCHHHHHHHHHHHHCCCC--CCEE--EEEEECC T ss_conf 767999648840899999998606988--8636--8986025 No 91 >KOG0571 consensus Probab=95.25 E-value=0.15 Score=30.02 Aligned_cols=75 Identities=17% Similarity=0.154 Sum_probs=52.5 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCC-CCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 8527998399996694248999999999998648-997299999966779878468999999999871898899997504 Q gi|254780546|r 14 RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRS-FGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 14 ~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~-~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) ++|-...+++|-+|||.||+..+-++....++.. .-+..++.+.| |+- +|.+ ....+++++-+|.+|+......+ T Consensus 220 KRLM~d~p~GvLLSGGLDSSLvAsia~R~lk~~~~~~~~~lhsFaI--Gle-~SPD-L~aarkVAd~igt~Hhe~~ft~q 295 (543) T KOG0571 220 KRLMTDVPFGVLLSGGLDSSLVASIAARELKKAQAARGSKLHSFAI--GLE-DSPD-LLAARKVADFIGTIHHEHTFTIQ 295 (543) T ss_pred HHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEE--CCC-CCHH-HHHHHHHHHHHCCCCEEEEECHH T ss_conf 9863168615895077318999999999998766412787547885--289-9853-78889999874885238997588 No 92 >cd00293 USP_Like Usp: Universal stress protein family. The universal stress protein Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae Usp reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, athough Usp lacks ATP-binding activity. Probab=94.98 E-value=0.31 Score=27.79 Aligned_cols=95 Identities=18% Similarity=0.136 Sum_probs=62.3 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC-----------CHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 39999669424899999999999864899729999996677987-----------8468999999999871898899997 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE-----------TAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~-----------~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) +|+||+.|...|..++..+.++.... +.+++++||.-.-.. ++.+..+.....+...|++...... T Consensus 1 ~Ilv~vd~s~~s~~al~~a~~la~~~---~~~l~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 77 (130) T cd00293 1 RILVAVDGSEESERALRWAARLARRL---GAELVLLHVVDPPPSSAAELAELLEEEARALLEALREALAEAGVKVETVVL 77 (130) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHC---CCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 99999898989999999999999872---997999999728865530144678999999999999985427983899999 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 504787780378885455554320012344201234566656 Q gi|254780546|r 90 KNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 90 ~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) .+.. ...+.+.+.+.+++.++.|.|.... T Consensus 78 ~~~~-------------~~~i~~~a~~~~~dliV~G~~~~~~ 106 (130) T cd00293 78 EGDP-------------AEAILEAAEELGADLIVMGSRGRSG 106 (130) T ss_pred ECCH-------------HHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 4661-------------8999887776118899994799975 No 93 >PRK10490 sensor protein KdpD; Provisional Probab=94.45 E-value=0.41 Score=26.90 Aligned_cols=94 Identities=14% Similarity=0.123 Sum_probs=63.8 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC----CCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 99839999669424899999999999864899729999996677----98784689999999998718988999975047 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC----LRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg----lr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) ..++|+|++||++.|-.|..-.+.+..+.. ...+|+||+.. +..+.........+++++||-...++.-+.. T Consensus 249 ~~e~ilv~~~~~~~~~~lir~~~r~a~~~~---~~~~~~~v~~~~~~~~~~~~~~~~~~~~~LAe~LGa~v~~l~G~dv- 324 (895) T PRK10490 249 TRDAILLCIGHNTGSEKLVRTAARLAARLG---SVWHAVYVETPRLHRLPEKKRRAILSALRLAQELGAETATLSDPAE- 324 (895) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHCC---CCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCH- T ss_conf 676299998899874699999999998479---9869999964444579877899999999999981998999837988- Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 87780378885455554320012344201234566 Q gi|254780546|r 94 PQTGLMAAAREARYALISEHAKTINATLIMTAHTF 128 (423) Q Consensus 94 ~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~ 128 (423) . ..+.+.++..++..++.|... T Consensus 325 -a------------~~il~~Ar~~nvT~IVlGrs~ 346 (895) T PRK10490 325 -E------------KAVVRYAREHNLGKIIIGRPA 346 (895) T ss_pred -H------------HHHHHHHHHCCCCEEEECCCC T ss_conf -9------------999999998399989988878 No 94 >TIGR00289 TIGR00289 conserved hypothetical protein TIGR00289; InterPro: IPR005237 This family of conserved hypothetical proteins has no known function. Homologous proteins related to MJ0570 of Methanococcus jannaschii include both the apparent orthologs in the family, the much longer protein YLR143W from Saccharomyces cerevisiae (Baker's yeast), and second homologous proteins from Archaeoglobus fulgidus and Pyrococcus horikoshii that appear to represent a second orthologous group.. Probab=93.23 E-value=0.32 Score=27.71 Aligned_cols=67 Identities=18% Similarity=0.168 Sum_probs=41.5 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH----HHHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 3999966942489999999999986489972999999667798784----6899999999987189889999750478 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA----KDEVRYVSDVCSRLRIAHSVVSWKNSKP 94 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s----~~e~~~v~~~~~~lgi~~~~~~~~~~~~ 94 (423) |++|=+|||+||+--||-+.+ .+..-.-|+-+.=| .++| ---.+...=.++.+|||+.-+...+.+. T Consensus 2 kvAvLySGGKDS~~ALy~al~----eG~eV~~LV~v~se---N~eSyMfH~pN~Hl~~l~AEavGiPL~klyt~GeeE 72 (227) T TIGR00289 2 KVAVLYSGGKDSILALYKALE----EGFEVKYLVGVISE---NEESYMFHVPNVHLTDLVAEAVGIPLIKLYTSGEEE 72 (227) T ss_pred EEEEEECCCHHHHHHHHHHHH----CCCCEEEEEEEECC---CCCCCEEECHHHHHHHHHHHHCCCCHHHHHCCCCCC T ss_conf 068985187136899999986----28854785100028---898620534257899999976176625630278652 No 95 >cd01989 STK_N The N-terminal domain of Eukaryotic Serine Threonine kinases. The Serine Threonine kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. The N-terminal domain is homologous to the USP family which has a ATP binding fold. The N-terminal domain is predicted to be involved in ATP binding. Probab=93.19 E-value=0.68 Score=25.36 Aligned_cols=95 Identities=18% Similarity=0.136 Sum_probs=58.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC--------------------CCHHHHHHHHHHHHHHC Q ss_conf 3999966942489999999999986489972999999667798--------------------78468999999999871 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR--------------------ETAKDEVRYVSDVCSRL 80 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr--------------------~~s~~e~~~v~~~~~~l 80 (423) .|+|||.|..-|...|..+.+-... .+-.++.+||.--.. .+..+-.....++|++. T Consensus 1 tV~VAVd~s~~S~~AL~WAl~n~~~---~~~~lvLlHV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~C~~~ 77 (146) T cd01989 1 SVAVAVDKDKKSKNALKWALDNLAT---KGQTIVLVHVHPPITSIPSSSGKLEVASAYKQEEDKEAKELLLPYRCFCSRK 77 (146) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHC---CCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9899978988799999999986406---6981999997168876777653066788998888899999999999999875 Q ss_pred CCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 89889999750478778037888545555432001234420123456665 Q gi|254780546|r 81 RIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 81 gi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) ||...++-.+...+... +.+.....+++.+.+|-+.+- T Consensus 78 ~V~~e~~v~e~~d~~~~------------I~e~v~~~~i~~LVmGs~~~~ 115 (146) T cd01989 78 GVQCEDVVLEDDDVAKA------------IVEYVADHGITKLVMGASSDN 115 (146) T ss_pred CCEEEEEEEECCCHHHH------------HHHHHHHCCCCEEEECCCCCC T ss_conf 97289999834888899------------999999759989998627998 No 96 >PRK13011 formyltetrahydrofolate deformylase; Reviewed Probab=92.99 E-value=0.51 Score=26.24 Aligned_cols=26 Identities=15% Similarity=0.296 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 28999998998189320266786422 Q gi|254780546|r 173 RREDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 173 ~r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) ...+++..+..+|+||..=|.|.+.+ T Consensus 126 NH~dL~~lae~~~IPF~~iPv~~~~K 151 (287) T PRK13011 126 NHPDLEPLAAWHGIPFHHFPITPDTK 151 (287) T ss_pred CCHHHHHHHHHCCCCEEEECCCCCCC T ss_conf 98678999997699828837887763 No 97 >PRK13010 purU formyltetrahydrofolate deformylase; Reviewed Probab=92.66 E-value=0.8 Score=24.86 Aligned_cols=26 Identities=19% Similarity=0.457 Sum_probs=18.4 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 28999998998189320266786422 Q gi|254780546|r 173 RREDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 173 ~r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) ...+++..+..+|+||..=|.|.+.+ T Consensus 129 NH~dL~~lae~~gIPF~hipv~~~~k 154 (289) T PRK13010 129 NHPDLQPLAVQHDIPFHHLPVTPDTK 154 (289) T ss_pred CCHHHHHHHHHCCCCEEEECCCCCCH T ss_conf 87778999997799839826877752 No 98 >PRK09982 universal stress protein UspD; Provisional Probab=92.30 E-value=0.88 Score=24.55 Aligned_cols=101 Identities=15% Similarity=0.198 Sum_probs=68.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC---C-----C-------HHHH-HHHHHHHHHHCCCC Q ss_conf 83999966942489999999999986489972999999667798---7-----8-------4689-99999999871898 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR---E-----T-------AKDE-VRYVSDVCSRLRIA 83 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr---~-----~-------s~~e-~~~v~~~~~~lgi~ 83 (423) .+|+|||..-.||-.|+.=+..+++.. +.++.-+|||.... + + ..+| ..+.++++...+.| T Consensus 4 kHILVAVdLs~eS~~Li~KAv~lAk~~---~AklSlIhvd~~~~elY~gl~~~~~~~~~~~~~e~~~~~L~~L~~~~~YP 80 (142) T PRK09982 4 KHIGVAISGNEEDALLVNKALELARHN---DAHLTLIHIDDGLSELYPGIYFPATEDILQLLKNKSDNKLYKLTKNIQWP 80 (142) T ss_pred CEEEEEEECCHHHHHHHHHHHHHHHHC---CCEEEEEEEECCCHHHCCHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 179999855841699999999998760---98399999908846651102053125688999999999999999847998 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 899997504787780378885455554320012344201234566656689999862 Q gi|254780546|r 84 HSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQ 140 (423) Q Consensus 84 ~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~ 140 (423) -+-.-+- .+.+.+ .+...+++.+++.+..|||+| |+-++. T Consensus 81 I~~~~v~----~Gdl~~--------~L~~~v~~~~iDLlIcGHHh~-----fwskl~ 120 (142) T PRK09982 81 KTKLRIE----RGEMPE--------TLLEIMQKEQCDLLVCGHHHS-----FINRLM 120 (142) T ss_pred CEEEEEE----ECCHHH--------HHHHHHHHHCCCEEEECCCHH-----HHHHHH T ss_conf 2478998----448899--------999999984998899656286-----999998 No 99 >COG2205 KdpD Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms] Probab=91.06 E-value=1.2 Score=23.65 Aligned_cols=94 Identities=19% Similarity=0.150 Sum_probs=63.7 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC----CHHHHHHHHHHHHHHCCCCEEEEEEECCCC Q ss_conf 9839999669424899999999999864899729999996677987----846899999999987189889999750478 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE----TAKDEVRYVSDVCSRLRIAHSVVSWKNSKP 94 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~----~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~ 94 (423) .++|+|++||+..|-.|.--+..+..+- ....+|+|||-.=.. ++..-.+.+.++|+++|-+..++.-+... T Consensus 248 ~e~ilvcI~~~~~~e~liR~a~RlA~~~---~a~~~av~v~~~~~~~~~~~~~~~l~~~~~Lae~lGae~~~l~~~dv~- 323 (890) T COG2205 248 RERILVCISGSPGSEKLIRRAARLASRL---HAKWTAVYVETPELHRLSEKEARRLHENLRLAEELGAEIVTLYGGDVA- 323 (890) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHH---CCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCHH- T ss_conf 5618999789986389999999999972---897699998256644443779999999999999838869998377389- Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 77803788854555543200123442012345666 Q gi|254780546|r 95 QTGLMAAAREARYALISEHAKTINATLIMTAHTFD 129 (423) Q Consensus 95 ~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~d 129 (423) + .+.+.|+......++.|.+.. T Consensus 324 ----~---------~i~~ya~~~~~TkiViG~~~~ 345 (890) T COG2205 324 ----K---------AIARYAREHNATKIVIGRSRR 345 (890) T ss_pred ----H---------HHHHHHHHCCCEEEEECCCCC T ss_conf ----9---------999999972981599568743 No 100 >PRK10116 universal stress protein UspC; Provisional Probab=90.69 E-value=1.3 Score=23.42 Aligned_cols=96 Identities=17% Similarity=0.158 Sum_probs=62.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC--CC------------HHHH-HHHHHHHHHHCCCCE Q ss_conf 83999966942489999999999986489972999999667798--78------------4689-999999998718988 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR--ET------------AKDE-VRYVSDVCSRLRIAH 84 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr--~~------------s~~e-~~~v~~~~~~lgi~~ 84 (423) .+|+||+-|-.+|-.++.=+..+++... -+|..+||+.-.. .+ ..++ .++++++....|+|- T Consensus 4 khILVAvDlS~~S~~~i~kA~~lA~~~~---AklslihV~~~~~~y~~~~~~~~~d~~~~~~e~a~~~L~~l~~~~~~pi 80 (142) T PRK10116 4 SNILVAVAVTPESQQLLAKAVSIARPVN---GKISLITLASDPEMYNQFAAPMLEDLRSVMQEETQSFLDELIQKADYPI 80 (142) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCC---CEEEEEEEECCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 6899994188506999999999999819---9899999935766652102213788999999999999999998749981 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 9999750478778037888545555432001234420123456665 Q gi|254780546|r 85 SVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 85 ~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) ....+....+. ..+.+.+++.+++.+..|||-.+ T Consensus 81 ~~~~v~~G~~~------------~~I~~~a~e~~~DLIV~G~Hg~~ 114 (142) T PRK10116 81 EKTFIAYGELS------------EHILEVCRKHHFDLVICGNHNHS 114 (142) T ss_pred EEEEEEECCHH------------HHHHHHHHHHCCCEEEEECCCCH T ss_conf 58999965889------------99999999969999999059854 No 101 >TIGR00434 cysH phosophoadenylyl-sulfate reductase; InterPro: IPR004511 Phosphoadenosine phosphosulphate reductase (1.8.4.8 from EC), involved in the assimilation of inorganic sulphate, is designated cysH in bacteria and MET16 in Saccharomyces cerevisiae. Synonyms include phosphoadenosine phosphosulphate reductase and PAPS reductase. In a reaction requiring reduced thioredoxin and NADPH, it converts 3'-phosphoadenylylsulphate (PAPS) to sulphite and adenosine 3',5' diphosphate (PAP). A related family of plant enzymes, scoring below the trusted cut-off, differs in having a thioredoxin-like C-terminal domain, not requiring thioredoxin, and in having a preference for 5'-adenylylsulphate (APS) over PAPS.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019379 sulfate assimilation phosphoadenylyl sulfate reduction by phosphoadenylyl-sulfate reductase (thioredoxin). Probab=90.54 E-value=1.3 Score=23.33 Aligned_cols=144 Identities=18% Similarity=0.248 Sum_probs=83.2 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEEC---CCCCCCHHHHHHHHHHHHHHCC----CCEEEEE Q ss_conf 27998399996694248999999999998648997299999966---7798784689999999998718----9889999 Q gi|254780546|r 16 LVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVD---HCLRETAKDEVRYVSDVCSRLR----IAHSVVS 88 (423) Q Consensus 16 l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vd---Hglr~~s~~e~~~v~~~~~~lg----i~~~~~~ 88 (423) -.-|..++.|.|=|..-+|+|||+..+.+ ++.++-+| -|+...-+.+ +..++-+++- ...++.+ T Consensus 10 ~~fp~~LV~~~SFGi~g~V~ldL~~ki~~-------~~~~i~lDfiDTG~hF~eTY~--~~~~~~~ry~qPkglni~vy~ 80 (226) T TIGR00434 10 DTFPDELVYSTSFGIEGAVLLDLVSKIKK-------DIPVIFLDFIDTGYHFPETYE--LIDEVKERYPQPKGLNIKVYK 80 (226) T ss_pred HHCCCCCEEHHHHHHHHHHHHHHHHHHCC-------CCCEEEECCHHHCCCCHHHHH--HHHHHHHHHCCCCCCEEEEEC T ss_conf 63586102022678999999999885178-------851465300011267277899--999998761688775067767 Q ss_pred EECCCCCCCCHHHHHHHHHH-HH---------------------HHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 75047877803788854555-54---------------------320012344201234566656689999862365444 Q gi|254780546|r 89 WKNSKPQTGLMAAAREARYA-LI---------------------SEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEK 146 (423) Q Consensus 89 ~~~~~~~~~~~~~ar~~r~~-~~---------------------~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~ 146 (423) +.+.+..+.-.++|. .| .++.++..+.+.++|-=.| | T Consensus 81 -----p~GClS~a~~~~~YGd~LWe~~dVE~~n~y~~lrKVEP~~rALkel~~~AW~~GLRRd-Q--------------- 139 (226) T TIGR00434 81 -----PDGCLSLAEQAAKYGDKLWEQEDVEDPNKYDYLRKVEPLKRALKELEASAWLTGLRRD-Q--------------- 139 (226) T ss_pred -----CCCCCCHHHHHHHHCCHHHCCCCCCCHHHHCHHHCCHHHHHHCCCCCEEEEEEECCCC-C--------------- T ss_conf -----8888797899977163001145778701211000102788750237056323203314-3--------------- Q ss_pred CCCCCCCCCCCC-CCCCCEEECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 332235764100-3579687101210428999998998189320 Q gi|254780546|r 147 GMGLSGMCDTIL-YDLNLWISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 147 g~~l~~~~~~~~-~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) |..-+.+..+-. ...++.=+=||++++=++|-.|...++++|- T Consensus 140 ~~~R~nl~~~~~d~kf~~~Kv~PlI~Wt~~~v~~Y~~~h~LpYn 183 (226) T TIGR00434 140 GPSRANLKIVNIDEKFGILKVLPLIDWTWKDVYQYLDAHNLPYN 183 (226) T ss_pred CCCCCCCCHHHHHHHCCEEEEECCCCCCHHHHHHHHHHCCCCCC T ss_conf 54321375244362458035404214876899999875579887 No 102 >PRK06027 purU formyltetrahydrofolate deformylase; Reviewed Probab=89.07 E-value=1.7 Score=22.55 Aligned_cols=24 Identities=21% Similarity=0.376 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 899999899818932026678642 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~ 197 (423) ..+++..+..+|+||..=|.|.++ T Consensus 125 H~dL~~lae~~~IPF~~iPv~~~~ 148 (285) T PRK06027 125 HPDLRSLVESFGIPFHHVPVTKET 148 (285) T ss_pred CHHHHHHHHHCCCCEEEECCCCCC T ss_conf 578999999869982882687665 No 103 >pfam10288 DUF2392 Protein of unknown function (DUF2392). This is a family of proteins conserved from plants to humans. The function is not known. It carries a characteristic GRG sequence motif. Probab=88.31 E-value=0.85 Score=24.68 Aligned_cols=35 Identities=17% Similarity=0.103 Sum_probs=30.0 Q ss_pred CCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 57968710121042899999899818932026678 Q gi|254780546|r 160 DLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSN 194 (423) Q Consensus 160 ~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN 194 (423) .+++.++|||=++.++||..||+..++.-+.-+++ T Consensus 28 ~~~i~~~~PLRD~l~~Ei~~Y~~~~~l~~~~~~~~ 62 (104) T pfam10288 28 GNDIKILRPLRDLLLKEIEAYCSLKNLPPLLINSL 62 (104) T ss_pred CCCCEEEEEHHHHHHHHHHHHHHHHCCCHHCCCCC T ss_conf 88834773758856999999999808740013555 No 104 >pfam02677 DUF208 Uncharacterized BCR, COG1636. Probab=87.50 E-value=2.1 Score=21.88 Aligned_cols=60 Identities=10% Similarity=0.026 Sum_probs=45.0 Q ss_pred EEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH--HHHHHHHHHHCCCCEEEEEEE Q ss_conf 9966942489999999999986489972999999667798784689--999999998718988999975 Q gi|254780546|r 24 VAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE--VRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 24 vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e--~~~v~~~~~~lgi~~~~~~~~ 90 (423) +=+==|+||+.-+..|.+. ++++.+.--|+++.|.+..+ .+.++++|+++||+...-.++ T Consensus 3 LH~CCaPCs~~~l~~L~~~-------g~~vt~~FyNPNIhP~~Ey~~R~~~~~~~~~~~~i~~i~~~Yd 64 (176) T pfam02677 3 LHICCAPCSSYPLEYLREY-------GFEVTGFFYNPNIHPYSEYLRRLEEVKRFAEKLGIPLIEGDYD 64 (176) T ss_pred EEECCCCCCHHHHHHHHHC-------CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCC T ss_conf 2501878607999999984-------9836998779999978999999999999999859937956888 No 105 >LOAD_USPA consensus Probab=84.54 E-value=2.9 Score=20.88 Aligned_cols=94 Identities=21% Similarity=0.139 Sum_probs=59.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCC--------------HHHHHHHHHHHHHHCCCCEE Q ss_conf 8399996694248999999999998648997299999966779878--------------46899999999987189889 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRET--------------AKDEVRYVSDVCSRLRIAHS 85 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~--------------s~~e~~~v~~~~~~lgi~~~ 85 (423) .+|+|++-|..+|...+..+.++.... +.+++++||.-.-... +.+..+...+.+...+++.. T Consensus 1 k~Ilv~vd~s~~s~~a~~~A~~la~~~---~a~l~~lhV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (135) T LOAD_USPA 1 KKILVAIDGSPESEKALRWAVDLAKRR---GAELILLHVIPPSVSTAASPALDLALLLEEALKLLEEALELLEEAGVKID 77 (135) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHC---CCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 989999869989999999999999874---99899999962776444444201468999999999999999986399739 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 99975047877803788854555543200123442012345666 Q gi|254780546|r 86 VVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFD 129 (423) Q Consensus 86 ~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~d 129 (423) +.-..+ .+. ..+.+.+.+.+++.++.|+|.. T Consensus 78 ~~i~~g-~~~------------~~I~~~~~~~~~dliVmG~~~~ 108 (135) T LOAD_USPA 78 VEVEEG-SPA------------EAILELAEESNADLIVVGSRGR 108 (135) T ss_pred EEEEEE-CCH------------HHHHHHHHHCCCCEEEEECCCC T ss_conf 999996-428------------9999998605776799946899 No 106 >cd01988 Na_H_Antiporter_C The C-terminal domain of a subfamily of Na+ /H+ antiporter existed in bacteria and archea . Na+/H+ exchange proteins eject protons from cells, effectively eliminating excess acid from actively metabolising cells. Na+ /H+ exchange activity is also crucial for the regulation of cell volume, and for the reabsorption of NaCl across renal, intestinal, and other epithelia. These antiports exchange Na+ for H+ in an electroneutral manner, and this activity is carried out by a family of Na+ /H+ exchangers, or NHEs, which are known to be present in both prokaryotic and eukaryotic cells. These exchangers are highly-regulated (glyco)phosphoproteins, which, based on their primary structure, appear to contain 10-12 membrane-spanning regions (M) at the N-terminus and a large cytoplasmic region at the C-terminus. The transmembrane regions M3-M12 share identity wit h other members of the family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the regio Probab=84.42 E-value=2.9 Score=20.84 Aligned_cols=95 Identities=17% Similarity=0.153 Sum_probs=61.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC---CCC--------CCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 3999966942489999999999986489972999999667---798--------78468999999999871898899997 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH---CLR--------ETAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH---glr--------~~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) ||+|++.|-.-|..++..+..+.+.. +-+++++||=- ... ..+.+..+.....++..|++...... T Consensus 1 rILVpvd~s~~s~~al~~A~~la~~~---~a~l~~lhVv~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 77 (132) T cd01988 1 RILVPVANPNTARDLLELAAALARAQ---NGEIIPLNVIEVPNHSSPSQLEVNVQRARKLLRQAERIAASLGVPVHTIIR 77 (132) T ss_pred CEEEECCCCHHHHHHHHHHHHHHHHC---CCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 98997289889999999999999876---998999999845876785566789999999999999999876995699999 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 50478778037888545555432001234420123456665 Q gi|254780546|r 90 KNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 90 ~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) ....+.. ...+.+++.+++.+..|.|--. T Consensus 78 ~~~~~~~------------~I~~~a~e~~~DlIVmG~~g~~ 106 (132) T cd01988 78 IDHDIAS------------GILRTAKERQADLIIMGWHGST 106 (132) T ss_pred ECCCHHH------------HHHHHHHHHCCCEEEECCCCCC T ss_conf 7799799------------9999999849999998379998 No 107 >cd01399 GlcN6P_deaminase GlcN6P_deaminase: Glucosamine-6-phosphate (GlcN6P) deaminase subfamily; GlcN6P deaminase catalyzes the reversible conversion of GlcN6P to D-fructose-6-phosphate (Fru6P) and ammonium. The reaction is an aldo-keto isomerization coupled with an amination or deamination. It is the last step of the metabolic pathway of N-acetyl-D-glucosamine-6-phosphate (GlcNAc6P). GlcN6P deaminase is a hexameric enzyme that is allosterically activated by GlcNAc6P. Probab=83.57 E-value=3.2 Score=20.60 Aligned_cols=77 Identities=9% Similarity=0.207 Sum_probs=48.0 Q ss_pred HHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC--CCCCCHHHH-HHHH-HHHHHHCCCCE Q ss_conf 9999985279983999966942489999999999986489972999999667--798784689-9999-99998718988 Q gi|254780546|r 9 VRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH--CLRETAKDE-VRYV-SDVCSRLRIAH 84 (423) Q Consensus 9 ~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH--glr~~s~~e-~~~v-~~~~~~lgi~~ 84 (423) .......+....++.||+|||.--..+...|.+........--+++.+.+|- ++.+++.+- ..++ +.+...++|+. T Consensus 8 a~~i~~~i~~~~~~~ialsGG~tP~~~y~~L~~~~~~~~i~w~~v~~f~~DE~~~v~~~~~~Sn~~~~~~~ll~~~~i~~ 87 (232) T cd01399 8 AELIAELIREKPPAVLGLATGSTPLGVYEELIELHKEGGLSFSNVTTFNLDEYVGLPPDHPQSYHYFMRENLFDHIDIKP 87 (232) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCHHHEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCH T ss_conf 99999999977986999899878999999999988634997579389927423478950558899999998642579998 Q ss_pred E Q ss_conf 9 Q gi|254780546|r 85 S 85 (423) Q Consensus 85 ~ 85 (423) . T Consensus 88 ~ 88 (232) T cd01399 88 E 88 (232) T ss_pred H T ss_conf 9 No 108 >TIGR00655 PurU formyltetrahydrofolate deformylase; InterPro: IPR004810 An Escherichia coli gene designated purU has been identified and characterised. The gene codes for a 280-amino-acid protein, PurU (P37051 from SWISSPROT, 3.5.1.10 from EC). PurU is an enzyme that catalyses the hydrolysis of 10-formyltetrahydrofolate (formyl-FH4) to FH4 and formate ,.10-formyltetrahydrofolate + H(2)O = formate +tetrahydrofolate Formyl-FH4 hydrolase generates the formate that is used by purT-encoded 5'-phosphoribosylglycinamide transformylase for step three of de novo purine nucleotide synthesis. Formyl-FH4 hydrolase, a hexamer of 32kDa subunits, is activated by methionine and inhibited by glycine. Heterotropic cooperativity is observed for activation by methionine in the presence of glycine and for inhibition by glycine in the presence of methionine. These results suggest that formyl-FH4 hydrolase is a regulatory enzyme whose main function is to balance the pools of FH4 and C1-FH4 in response to changing growth conditions. The enzyme uses methionine and glycine to sense the pools of C1-FH4 and FH4, respectively. ; GO: 0008864 formyltetrahydrofolate deformylase activity, 0006189 'de novo' IMP biosynthetic process. Probab=80.85 E-value=4 Score=19.93 Aligned_cols=88 Identities=11% Similarity=0.126 Sum_probs=44.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCH Q ss_conf 83999966942489999999999986489972999999667798784689999999998718988999975047877803 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLM 99 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~ 99 (423) .||++=|| +.+-||+-||.+|. .+.+..+|.+|.=||- ..++..+++||||+.+..+.+. . T Consensus 93 krvai~vs--Ke~HCL~DLL~r~~--~GeL~~ei~~VIsNH~----------~lr~lve~F~iPF~~~p~~~~~-----r 153 (294) T TIGR00655 93 KRVAILVS--KEDHCLGDLLWRWY--SGELDAEIALVISNHE----------DLRELVERFGIPFHYIPATKDT-----R 153 (294) T ss_pred CEEEEEEE--CCHHHHHHHHHHHC--CCCCEEEEEEEECCCH----------HHHHHHHHCCCCEEEECCCCCH-----H T ss_conf 76899840--41203589999873--8982379999864987----------8998866538988986378850-----5 Q ss_pred HHHHHHHHHHHHH--HCCCCCCCCHHHHH Q ss_conf 7888545555432--00123442012345 Q gi|254780546|r 100 AAAREARYALISE--HAKTINATLIMTAH 126 (423) Q Consensus 100 ~~ar~~r~~~~~~--~~~~~~~~~l~~ah 126 (423) ..+......++.+ ...+...+++++|- T Consensus 154 ~e~Ek~~Lell~~yelkh~~~~DlvVLAk 182 (294) T TIGR00655 154 LEHEKKELELLKQYELKHKYQVDLVVLAK 182 (294) T ss_pred HHHHHHHHHHHHCCCCCEECCCCEEEEEH T ss_conf 68999999987403321102754698210 No 109 >cd05565 PTS_IIB_lactose PTS_IIB_lactose: subunit IIB of enzyme II (EII) of the lactose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS) found in Firmicutes as well as Actinobacteria. In this system, EII is a lactose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIC and IIB domains are expressed as a single protein from the lac operon. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include lactose, chitobiose/lichenan, ascorbate, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. Probab=80.30 E-value=4.1 Score=19.80 Aligned_cols=68 Identities=21% Similarity=0.151 Sum_probs=46.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC-----------CCCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 39999669424899999999999864899729999996677-----------9878468999999999871898899997 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC-----------LRETAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg-----------lr~~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) ||++-+|||.-|.-|+..+.+..+.++. ++++.|....-- |-|+-..--+..++.|.++|||..+.+- T Consensus 2 kVLl~Ca~GmSSs~la~~m~k~A~~~gi-~~~i~A~~~~~~~d~~~~yDvilLaPQv~~~~~~lk~~ad~~Gi~v~~i~~ 80 (99) T cd05565 2 NVLVLCAGGGTSGLLANALNKGAKERGV-PLEAAAGAYGSHYDMIPDYDLVILAPQMASYYDELKKDTDRLGIKLVTTTG 80 (99) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECHHHHHHHHCCCEEEECHHHHHHHHHHHHHHHHCCCEEEEECH T ss_conf 0999907988589999999999998199-769995103448988707999999850888899999999983991887083 No 110 >COG0788 PurU Formyltetrahydrofolate hydrolase [Nucleotide transport and metabolism] Probab=79.96 E-value=4.2 Score=19.73 Aligned_cols=23 Identities=17% Similarity=0.468 Sum_probs=14.5 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999899818932026678642 Q gi|254780546|r 175 EDIRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 175 ~~l~~~~~~~~i~wveDpSN~d~ 197 (423) .+++..+..+||||..=|-|.+. T Consensus 128 ~dl~~~v~~~~IPfhhip~~~~~ 150 (287) T COG0788 128 DDLRPLVERFDIPFHHIPVTKEN 150 (287) T ss_pred HHHHHHHHHCCCCEEECCCCCCC T ss_conf 78899999749986654678775 No 111 >PRK09590 celB cellobiose phosphotransferase system IIB component; Reviewed Probab=74.44 E-value=6 Score=18.68 Aligned_cols=75 Identities=16% Similarity=0.135 Sum_probs=53.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC---C----------CCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 839999669424899999999999864899729999996677---9----------878468999999999871898899 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC---L----------RETAKDEVRYVSDVCSRLRIAHSV 86 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg---l----------r~~s~~e~~~v~~~~~~lgi~~~~ 86 (423) .||++.||||.-|+-|..-+.+....++ ...++.|+.+.-+ + .|+-..-....++.+.+.|+|..+ T Consensus 2 kKILLvCaaGMSTSmlv~km~~~A~~~G-~dveI~Av~~~e~~~~i~~~~yDv~LlgPQVr~~~~~~k~~a~~~giPv~v 80 (104) T PRK09590 2 AKALIICAAGMSSSLMAKKTTEFLKGKG-KDIEVDAISATEGGKAIAAAAFDLYLVSPQTKMYFKQFEEAGSKAGKPVVQ 80 (104) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHCC-CCEEEEEEEHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 5599996899879999999999999769-836999848889887632268878998876887899999999872997788 Q ss_pred EEEECCCCC Q ss_conf 997504787 Q gi|254780546|r 87 VSWKNSKPQ 95 (423) Q Consensus 87 ~~~~~~~~~ 95 (423) +.-..-.|- T Consensus 81 I~~~~Y~Pl 89 (104) T PRK09590 81 IPPQAYIPI 89 (104) T ss_pred ECCCCCCCC T ss_conf 787464577 No 112 >TIGR01949 AroFGH_arch predicted phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR010210 This entry represents a clade of primarily archaeal sequences related to fructose-bisphosphate aldolase. The members appear to be phospho-2-dehydro-3-deoxyheptonate aldolases. This enzyme is the first step of the chorismate biosynthesis pathway. Evidence for this assignment is based on gene clustering and phylogenetic profiling. A group of species lack members of the three other types of phospho-2-dehydro-3-deoxyheptonate aldolase (represented by IPR006219 from INTERPRO, IPR002480 from INTERPRO and IPR006268 from INTERPRO), and also apparently lack the well-known forms of step 2 (3-dehydroquinate synthase), but contain all other steps of the pathway: Aquifex, Archaeoglobus, Halobacterium, Methanococcus, Methanopyrus and Methanobacterium. In Aquifex, Archaeoglobus, Halobacterium, Methanopyrus and Methanosarcina the genes are clustered with other genes from the chorismate, phenylalanine, tyrosine and tryptophan biosynthesis pathways. In addition, these genes in Archaeoglobus, Halobacterium, and Methanosarcina are adjacent to 3-dehydroquinate synthase (IPR002812 from INTERPRO) which also has the property of having members only in those species which lack steps 1 and 2. Together these two genes appear to perform the synthesis of 3-dehydroquinate. It is presumed that the substrates and the chemical transformations involved are identical, but this has not yet been proven experimentally.. Probab=73.76 E-value=6.2 Score=18.57 Aligned_cols=75 Identities=11% Similarity=0.097 Sum_probs=49.7 Q ss_pred EEEEEEECCCCCCCHHH--HHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 99999966779878468--9999999998718988999975047877803788854555543200123442012345666 Q gi|254780546|r 52 KFSAISVDHCLRETAKD--EVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFD 129 (423) Q Consensus 52 ~l~a~~vdHglr~~s~~--e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~d 129 (423) .=+-+|||=|--.|.++ ++..+.+.|.+||||+----+. +++.+++.--..-.+..++...|.|++.+-|.-+=| T Consensus 105 DAVS~HvNvGs~~e~d~~~~lg~vA~~ad~~GvPlLAMmYa---RG~~i~~e~d~~~v~HAaRlg~ElGADvvK~~Y~Gd 181 (259) T TIGR01949 105 DAVSIHVNVGSDTEADQIEDLGDVAEIADDWGVPLLAMMYA---RGPRIDDEVDPEVVAHAARLGEELGADVVKVPYTGD 181 (259) T ss_pred CEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCHHECCCC---CCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCC T ss_conf 86799886489873899999989999886548842011278---868876644767888887653534577665143177 No 113 >cd05564 PTS_IIB_chitobiose_lichenan PTS_IIB_chitobiose_lichenan: subunit IIB of enzyme II (EII) of the N,N-diacetylchitobiose-specific and lichenan-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In these systems, EII is either a lichenan- or an N,N-diacetylchitobiose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. In the chitobiose system, these subunits are expressed as separate proteins from chbA, chbB, and chbC of the chb operon (formerly the cel (cellulose) operon). In the lichenan system, these subunits are expressed from licA, licB, and licC of the lic operon. The lic operon of Bacillus subtilis is required for the transport and degradation of oligomeric beta-glucosides, which are produced by extracellular enzymes on substrates such as lichenan or barley glucan. The lic operon is transcribed from a gammaA-dependent promoter and is inducible by lichenan, lichenan hydrolysate, and cellobiose. The IIB d Probab=72.97 E-value=6.4 Score=18.44 Aligned_cols=69 Identities=16% Similarity=0.210 Sum_probs=49.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC-----------CCCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 39999669424899999999999864899729999996677-----------9878468999999999871898899997 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC-----------LRETAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg-----------lr~~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) ||++.+|||.-|.-|+.-+++.....+. ++++.|+...-- |-|.-......+++.+...|+|..++.- T Consensus 1 kIlL~C~~GmSts~l~~km~~~a~~~~~-~~~i~A~~~~~~~~~~~~~DviLl~PQv~y~~~~i~~~~~~~~ipV~vI~~ 79 (96) T cd05564 1 KILLVCSAGMSTSILVKKMKKAAEKRGI-DAEIEAVPESELEEYIDDADVVLLGPQVRYMLDEVKKKAAEYGIPVAVIDM 79 (96) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEEHHHHHHHHHCCCEEEECCHHHHHHHHHHHHHHHCCCCEEEECH T ss_conf 9899959986599999999999997499-779999648999977415999997727999999999986104992796073 Q ss_pred E Q ss_conf 5 Q gi|254780546|r 90 K 90 (423) Q Consensus 90 ~ 90 (423) . T Consensus 80 ~ 80 (96) T cd05564 80 M 80 (96) T ss_pred H T ss_conf 5 No 114 >KOG0189 consensus Probab=72.74 E-value=6.5 Score=18.41 Aligned_cols=159 Identities=15% Similarity=0.155 Sum_probs=88.2 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCC- Q ss_conf 6989999999985279983999966942489999999999986489972999999667798784689999999998718- Q gi|254780546|r 3 LSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLR- 81 (423) Q Consensus 3 ~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lg- 81 (423) +||.|.+...+-.+ |.-+.+|.||-.| ++|..+++.. +. + ..++.+|.+-.- .+-.....++-+++| T Consensus 32 ~sP~eIm~~al~tf--~~~~q~a~~G~~~-lvlid~~~~~----~~-~--~~l~~idT~~~~--PeT~~l~d~VekkY~~ 99 (261) T KOG0189 32 LSPQEIMDWALETF--PNLFQTAASGLEG-LVLIDMLSKT----GR-P--FRLFFIDTLHHF--PETLRLFDAVEKKYGN 99 (261) T ss_pred CCHHHHHHHHHHHH--HHHHHHHHCCCCC-HHHHHHHHHC----CC-C--CEEEEEECCCCC--HHHHHHHHHHHHHCCC T ss_conf 79899999999972--1489998646450-3799999873----99-9--616995334448--5887899999875486 Q ss_pred CCEEEEEEECCC-------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 988999975047-------------8778037888545555432001234420123456665668999986236544433 Q gi|254780546|r 82 IAHSVVSWKNSK-------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGM 148 (423) Q Consensus 82 i~~~~~~~~~~~-------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~ 148 (423) |+.++..-+... -..+-++-.|-+..+-+..+.+......+.||.-.+. | |-.+ T Consensus 100 i~I~~~~pd~~e~ea~~~~K~~~~~~E~~~q~~~~l~KV~P~~RA~k~L~v~A~~TGrRksQ-----------~-gtRs- 166 (261) T KOG0189 100 IRIHVYFPDAVEVEALFASKGGFSLWEDDHQEYDRLRKVEPARRAYKGLNVKAVFTGRRKSQ-----------G-GTRS- 166 (261) T ss_pred EEEEEECCHHHHHHHHHHHCCCHHHEECCCHHHHHHHHCCHHHHHHHCCCEEEEEECCCCCC-----------C-CCCC- T ss_conf 57899756067788999860353430067466523332357788861456026895120267-----------8-8656- Q ss_pred CCCCCCCC-CCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 22357641-003579687101210428999998998189320 Q gi|254780546|r 149 GLSGMCDT-ILYDLNLWISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 149 ~l~~~~~~-~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) .++-+ .....++--|-||.+++-.++-.|+..+++|+- T Consensus 167 ---elpiVqvD~~fellK~NPlaN~~~~dV~nyi~t~nVP~N 205 (261) T KOG0189 167 ---ELPIVQVDPVFELLKINPLANWEFNDVWNYIRTNNVPYN 205 (261) T ss_pred ---CCCEEEECCCCCEEEECCCCCCCHHHHHHHHHHCCCCHH T ss_conf ---663487437333345335344318899999986388679 No 115 >COG1636 Uncharacterized protein conserved in bacteria [Function unknown] Probab=71.40 E-value=7 Score=18.20 Aligned_cols=65 Identities=14% Similarity=0.065 Sum_probs=52.7 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH--HHHHHHHHHHCCCCEEEEEE Q ss_conf 9983999966942489999999999986489972999999667798784689--99999999871898899997 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE--VRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e--~~~v~~~~~~lgi~~~~~~~ 89 (423) +..+|+|-+-=.+||+-.+..+.+. +.++....-|.+++|.+..+ ...+.++|+++||++.--.+ T Consensus 2 ~~~kiLlH~CCAPcs~y~le~l~~~-------~~~i~~yFYNPNIhP~~EY~~R~~e~~~f~~~~~i~~iegdY 68 (204) T COG1636 2 GRPKLLLHSCCAPCSGYVLEKLRDS-------GIKITIYFYNPNIHPLSEYELRKEEVKRFAEKFGINFIEGDY 68 (204) T ss_pred CCCEEEEEEECCCCCHHHHHHHHHC-------CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCC T ss_conf 9870789862577718999999965-------866699974899892699999999999999982970553575 No 116 >PRK06696 uridine kinase; Validated Probab=70.56 E-value=7.3 Score=18.08 Aligned_cols=58 Identities=16% Similarity=0.163 Sum_probs=37.9 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCC--HHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH Q ss_conf 999999998527998399996694--2489999999999986489972999999667798784 Q gi|254780546|r 6 IESVRFFVRSLVYPAHILVAVSGG--SDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA 66 (423) Q Consensus 6 ~~~~~~~~~~l~~~~~i~vAvSGG--~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s 66 (423) .+.+...+.++....+++||++|+ +-=+.|+..+.+.....+. .+..+|+|-=+++.+ T Consensus 11 ~~~~~~~i~~~~p~rpl~VgIdG~~gSGKTTlA~~La~~L~~~G~---~V~~v~~Ddf~~~~~ 70 (227) T PRK06696 11 VKEIANHILTLNLTRPLRVAIDGITASGKTTFANELAEEIKKRGR---PVIRASIDDFHNPKV 70 (227) T ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCC---CEEEECCCCCCCCHH T ss_conf 999999998359998689997789987879999999999974699---489971544347377 No 117 >PRK10717 cysteine synthase A; Provisional Probab=68.68 E-value=7.6 Score=17.95 Aligned_cols=57 Identities=18% Similarity=0.188 Sum_probs=36.5 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 279983999966942489999999999986489972999999667798784689999999998718988999 Q gi|254780546|r 16 LVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 16 l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) ..+|...+|.-|.|-=..+|+++++.+ ++++++ .+..+. .. .-.+.++.+|-+.... T Consensus 60 ~l~~g~~vveaSSGN~g~alA~~a~~~-------G~~~~i-v~p~~~----~~---~k~~~i~~~GAeVi~v 116 (334) T PRK10717 60 LLKPGGTIVEGTAGNTGIGLALVAAAR-------GYKTVI-VMPETQ----SQ---EKKDLLRALGAELVEV 116 (334) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHHC-------CCCEEE-EECCCC----HH---HHHHHHHHCCCEEEEC T ss_conf 988998899827973899999999980-------996599-956854----59---9999999759839976 No 118 >cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain. Probab=68.59 E-value=7.9 Score=17.80 Aligned_cols=51 Identities=18% Similarity=0.171 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC Q ss_conf 899999999852799839999669424899999999999864899729999996677987 Q gi|254780546|r 5 PIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE 64 (423) Q Consensus 5 p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~ 64 (423) |...+...+..+.+..+|+|-+..|.-|......|.+ . +++ +.+++.|+.. T Consensus 37 P~~~l~~~~~~l~k~~~ivv~C~~G~rS~~aa~~L~~----~---G~~--v~~l~GG~~~ 87 (90) T cd01524 37 PLDELRDRLNELPKDKEIIVYCAVGLRGYIAARILTQ----N---GFK--VKNLDGGYKT 87 (90) T ss_pred CHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHH----C---CCC--EEEECCCHHH T ss_conf 5077999999861587099988998059999999998----6---998--8996698887 No 119 >pfam05889 SLA_LP_auto_ag Soluble liver antigen/liver pancreas antigen (SLA/LP autoantigen). This family consists of several eukaryotic and archaeal proteins which are related to the human soluble liver antigen/liver pancreas antigen (SLA/LP autoantigen). Autoantibodies are a hallmark of autoimmune hepatitis, but most are not disease specific. Autoantibodies to soluble liver antigen (SLA) and to liver and pancreas antigen (LP) have been described as disease specific, occurring in about 30% of all patients with autoimmune hepatitis. The function of SLA/LP is unknown, however, it has been suggested that the protein may function as a serine hydroxymethyltransferase and may be an important enzyme in the thus far poorly understood selenocysteine pathway. Some archaeal members are annotated as being pyridoxal phosphate-dependent enzymes. Probab=65.48 E-value=8.7 Score=17.51 Aligned_cols=18 Identities=6% Similarity=0.102 Sum_probs=12.6 Q ss_pred CHHHHHHHHHHHCCCCCC Q ss_conf 289999989981893202 Q gi|254780546|r 173 RREDIRSFLLQRNISWCE 190 (423) Q Consensus 173 ~r~~l~~~~~~~~i~wve 190 (423) .=.+|-..|+++++|.+- T Consensus 173 ~v~~IAklC~~~~IpHvV 190 (389) T pfam05889 173 NVKEIAKICAEYDVPHLV 190 (389) T ss_pred CHHHHHHHHHHCCCCEEE T ss_conf 799999999974997478 No 120 >PRK00443 nagB glucosamine-6-phosphate deaminase; Provisional Probab=65.45 E-value=9.1 Score=17.38 Aligned_cols=90 Identities=12% Similarity=0.202 Sum_probs=51.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC--CCCCHHH-HHHH-HHHHHHHCCCCEEEEEE-ECCC Q ss_conf 9839999669424899999999999864899729999996677--9878468-9999-99999871898899997-5047 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC--LRETAKD-EVRY-VSDVCSRLRIAHSVVSW-KNSK 93 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg--lr~~s~~-e~~~-v~~~~~~lgi~~~~~~~-~~~~ 93 (423) +..+.+|+|||.-=..+...|.+........--+++++++|-- +.++..+ -..+ -+.+...++||..-+.. ++ T Consensus 32 ~~~~~l~la~G~tp~~~y~~L~~~~~~~~i~w~~v~~f~~DEy~gvp~~~~~S~~~~l~~~ll~~l~i~~~ni~~~~g-- 109 (264) T PRK00443 32 DRPFVLGLATGSTPLGTYKALIELHKAGKIDFSRVTTFNLDEYVGLPKDHPESYRSFMHENFFDHVDIPPENINILNG-- 109 (264) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHCCCCCHHHEEEEECEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHCCCCCC-- T ss_conf 897399968988989999999999861699856859992301268997760779999999988636986987104888-- Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 8778037888545555432 Q gi|254780546|r 94 PQTGLMAAAREARYALISE 112 (423) Q Consensus 94 ~~~~~~~~ar~~r~~~~~~ 112 (423) ...+.+++| .+|..... T Consensus 110 ~~~d~~~~~--~~ye~~i~ 126 (264) T PRK00443 110 NADDIEAEC--RRYEEKIK 126 (264) T ss_pred CCCCHHHHH--HHHHHHHH T ss_conf 878999999--99999998 No 121 >TIGR00424 APS_reduc 5'-adenylylsulfate reductase, thioredoxin-independent; InterPro: IPR004508 This enzyme, involved in the assimilation of inorganic sulphate, is closely related to the thioredoxin-dependent PAPS reductase of Bacteria (CysH) and Saccharomyces cerevisiae. However, it has its own C-terminal thioredoxin-like domain and is not thioredoxin-dependent. Also, it has a substrate preference for 5'-adenylylsulphate (APS) over 3'-phosphoadenylylsulphate (PAPS) so the pathway does not require an APS kinase (CysC) to convert APS to PAPS. Arabidopsis thaliana appears to have three isozymes, all able to complement Escherichia coli CysH mutants (even in backgrounds lacking thioredoxin or APS kinase) but likely localized to different compartments in Arabidopsis.; GO: 0016671 oxidoreductase activity acting on sulfur group of donors disulfide as acceptor, 0019421 sulfate reduction APS pathway. Probab=62.03 E-value=10 Score=16.95 Aligned_cols=161 Identities=17% Similarity=0.151 Sum_probs=92.1 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC-CCCCHHHHHHHHHHHHHHCC Q ss_conf 69899999999852799839999669424899999999999864899729999996677-98784689999999998718 Q gi|254780546|r 3 LSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC-LRETAKDEVRYVSDVCSRLR 81 (423) Q Consensus 3 ~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg-lr~~s~~e~~~v~~~~~~lg 81 (423) .||.|.+.+.+..+ +..|+||+||--|- +|.-.++ -.+. + +.++.+|.| |.|+. -.+...+-+.+| T Consensus 104 asPleimdk~l~k~--G~~iaiafsGaedv-alieya~----ltG~-P--frvfsldtGrlnPet---y~~fd~vek~yG 170 (469) T TIGR00424 104 ASPLEIMDKALEKF--GNDIAIAFSGAEDV-ALIEYAH----LTGR-P--FRVFSLDTGRLNPET---YRLFDAVEKQYG 170 (469) T ss_pred CCCHHHHHHHHHHC--CCCEEEEECCCCHH-HHHHHHH----HCCC-C--EEEEEECCCCCCHHH---HHHHHHHHHHCC T ss_conf 58457889999840--65057664163113-5554565----4487-2--279863056557246---677665643226 Q ss_pred CCEEEEEEECCC-------------CCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 988999975047-------------8778037888545555432001234420123456665668999986236544433 Q gi|254780546|r 82 IAHSVVSWKNSK-------------PQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGM 148 (423) Q Consensus 82 i~~~~~~~~~~~-------------~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~ 148 (423) |.....--|... -..+-|+-+|....+-+....+ |.....||.-.|.---| ... . T Consensus 171 i~ieymfPd~vevq~lvr~kGlfsfyedGhqeCCrvrkvrPlrralk--Gl~aWitGqrkdqsPGt---------rse-i 238 (469) T TIGR00424 171 IRIEYMFPDAVEVQALVRSKGLFSFYEDGHQECCRVRKVRPLRRALK--GLKAWITGQRKDQSPGT---------RSE-I 238 (469) T ss_pred EEEEEECCCHHHHHHHHHHCCCEEEECCCCCCCEEECCCCHHHHHHH--HHHHHHCCCCCCCCCCC---------CCC-C T ss_conf 17897558603334455405621331156511001102324788865--53433124432478886---------233-4 Q ss_pred CCCCCCCCCC----CCCCCEEECCHHCCCHHHHHHHHHHHCCCC Q ss_conf 2235764100----357968710121042899999899818932 Q gi|254780546|r 149 GLSGMCDTIL----YDLNLWISRPFLRCRREDIRSFLLQRNISW 188 (423) Q Consensus 149 ~l~~~~~~~~----~~~~~~i~RPLL~~~r~~l~~~~~~~~i~w 188 (423) -.....|+.- -.+.+.-|-|+-++.-.++-.|++...+|. T Consensus 239 P~vqvdPvfeGldGGvGslvkWnP~anveG~d~Wnflr~mdvPv 282 (469) T TIGR00424 239 PVVQVDPVFEGLDGGVGSLVKWNPVANVEGKDVWNFLRTMDVPV 282 (469) T ss_pred CEEEECCCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHCCCH T ss_conf 34763442004567600202204433455147998887622412 No 122 >KOG3425 consensus Probab=60.79 E-value=11 Score=16.81 Aligned_cols=83 Identities=16% Similarity=0.135 Sum_probs=51.2 Q ss_pred CCCHHHHHHHHHHHCCCCCEEEEEECCCHH----------HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHH Q ss_conf 769899999999852799839999669424----------8999999999998648997299999966779878468999 Q gi|254780546|r 2 FLSPIESVRFFVRSLVYPAHILVAVSGGSD----------SMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVR 71 (423) Q Consensus 2 ~~~p~~~~~~~~~~l~~~~~i~vAvSGG~D----------S~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~ 71 (423) ..+-.+.....++...++++|.|.++|++| -.+..-...+-.+. .+.+++.+||+-|-|+. |-. T Consensus 8 ~~~g~e~~~~~~~~~~n~~~ifvlF~gskd~~tGqSWCPdCV~AEPvi~~alk~---ap~~~~~v~v~VG~rp~---Wk~ 81 (128) T KOG3425 8 LLPGYESFEETLKNVENGKTIFVLFLGSKDDTTGQSWCPDCVAAEPVINEALKH---APEDVHFVHVYVGNRPY---WKD 81 (128) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH---CCCCEEEEEEEECCCCC---CCC T ss_conf 430589999999987278669999965668888983782187763899999973---87761799997068774---148 Q ss_pred HHHHHHHHCCC---CEEEEEEE Q ss_conf 99999987189---88999975 Q gi|254780546|r 72 YVSDVCSRLRI---AHSVVSWK 90 (423) Q Consensus 72 ~v~~~~~~lgi---~~~~~~~~ 90 (423) -...+=.+.++ -.+.++|. T Consensus 82 p~n~FR~d~~~lt~vPTLlrw~ 103 (128) T KOG3425 82 PANPFRKDPGILTAVPTLLRWK 103 (128) T ss_pred CCCCCCCCCCCEEECCEEEEEC T ss_conf 9985434987414033436873 No 123 >pfam03599 CdhD CO dehydrogenase/acetyl-CoA synthase delta subunit. Probab=60.31 E-value=11 Score=16.75 Aligned_cols=133 Identities=17% Similarity=0.133 Sum_probs=80.8 Q ss_pred HHHHHCCCCCEEEEEECCCHHHHHHHHHHH---HHHHH---CCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 999852799839999669424899999999---99986---489972999999667798784689999999998718988 Q gi|254780546|r 11 FFVRSLVYPAHILVAVSGGSDSMGLLIALH---SVLSD---RSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAH 84 (423) Q Consensus 11 ~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~---~~~~~---~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~ 84 (423) ++.+.+..|..+++-||--.|...+--.+. ++... +-+..+.+-.+.|.|- +.+.++-++.|+++++..++|. T Consensus 21 RHektf~npt~ia~dv~D~m~~~ei~a~~~~i~~v~~~~~~rVG~~l~~D~iav~~~-s~dp~~fa~~vk~V~~~~~~pl 99 (384) T pfam03599 21 RFELPFPNPTAITIDVFDNLSPELLKARRERIEDVMFDPKKRVGEKLTLDMIAIRLI-STDPKEFAKAVEKVLQAVDVPL 99 (384) T ss_pred ECCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEC-CCCHHHHHHHHHHHHHHCCCCE T ss_conf 466256799826999538877677999999987521322222212202338999907-8999999999999996469988 Q ss_pred EEEEEECC---------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHH-HHHHHHHHHHHHHHCCCCCC Q ss_conf 99997504---------7877803788854555543200123442012345-66656689999862365444 Q gi|254780546|r 85 SVVSWKNS---------KPQTGLMAAAREARYALISEHAKTINATLIMTAH-TFDDQLETVYMRSQRDYAEK 146 (423) Q Consensus 85 ~~~~~~~~---------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah-~~dD~~Et~l~rl~r~sg~~ 146 (423) .....+.. +....+-..|-...|..+.+.+++.++-.++.+- ..+.+.+.. +.+...|++ T Consensus 100 iL~~~dp~vl~aale~~~~~rPLlyaAt~~N~~~m~~lA~~~~~Pv~v~a~~dl~~l~~L~--~~l~~~Gik 169 (384) T pfam03599 100 VIGGSDPEVLKAALEVAEDERPLLYAATLDNYKEIAELALEYKHPVLLWSINDLNELKNLN--RKLLKAGVK 169 (384) T ss_pred EEEECCHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHH--HHHHHCCCC T ss_conf 9980899999999997465897785489989999999999749828997488899999999--999976985 No 124 >TIGR00639 PurN phosphoribosylglycinamide formyltransferase; InterPro: IPR004607 This family describes phosphoribosylglycinamide formyltransferase (GAR transformylase), one of several proteins found in formyltransferase. This enzyme uses formyl tetrahydrofolate as a formyl group donor to produce 5'-phosphoribosyl-N-formylglycinamide. PurT, a different GAR transformylase, uses ATP and formate rather than formyl tetrahydrofolate. Experimental proof includes complementation of Escherichia coli purN mutants by orthologs from vertebrates (where it is a domain of a multifunctional protein), Bacillus subtilis, and Arabidopsis thaliana. In phylogenetic analyses, the member from Saccharomyces cerevisiae shows a long branch length but membership in the family, while the formyltetrahydrofolate deformylases form a closely related outgroup. ; GO: 0004644 phosphoribosylglycinamide formyltransferase activity, 0006189 'de novo' IMP biosynthetic process. Probab=60.06 E-value=11 Score=16.72 Aligned_cols=94 Identities=23% Similarity=0.250 Sum_probs=55.6 Q ss_pred EEEEEECC-CHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCH Q ss_conf 39999669-42489999999999986489972999999667798784689999999998718988999975047877803 Q gi|254780546|r 21 HILVAVSG-GSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLM 99 (423) Q Consensus 21 ~i~vAvSG-G~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~ 99 (423) ||+|=.|| |+-==||..++++- ....+-.+..|..| +| ..++.+.|++.+||+.++.-..-..+-..+ T Consensus 2 ~i~VLiSG~GSNLqAiiDA~~~~---~~~~~A~v~lVisn---~p-----~A~g~~RA~~~~iP~~v~~~~~f~sGiake 70 (215) T TIGR00639 2 RIVVLISGSGSNLQAIIDAIKEG---QGKIDASVVLVISN---KP-----KAYGLERAAQAGIPTKVLSLKDFDSGIAKE 70 (215) T ss_pred EEEEEEECCCCHHHHHHHHHHHC---CCCCCCEEEEEEEC---CC-----CCHHHHHHHHCCCCEEEEECCCCCCCCCCC T ss_conf 36899842871799999998521---27867368999865---86-----626677788758946774216788775310 Q ss_pred -----HHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf -----78885455554320012344201234 Q gi|254780546|r 100 -----AAAREARYALISEHAKTINATLIMTA 125 (423) Q Consensus 100 -----~~ar~~r~~~~~~~~~~~~~~~l~~a 125 (423) +.||...=..+....+..+.+.+++| T Consensus 71 Dkaarakar~~~d~~i~~~l~~~~vdlvvlA 101 (215) T TIGR00639 71 DKAARAKAREAFDQAIAEELKAAEVDLVVLA 101 (215) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 1678888789999999999996099699984 No 125 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=59.72 E-value=6.8 Score=18.28 Aligned_cols=154 Identities=18% Similarity=0.136 Sum_probs=84.8 Q ss_pred CCCHHHHHHHHHHHCCCCCEEEEEECCC--HHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 7698999999998527998399996694--24899999999999864899729999996677987846899999999987 Q gi|254780546|r 2 FLSPIESVRFFVRSLVYPAHILVAVSGG--SDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSR 79 (423) Q Consensus 2 ~~~p~~~~~~~~~~l~~~~~i~vAvSGG--~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~ 79 (423) .+.|..+|..|+++|.+|+||+|-|-=| +|.=++-.-|+-++.. + . +.||=|-+-..| ...=.+-.++ T Consensus 47 ~i~g~~~ieeFV~~Le~PRKImLMVkAG~pVdaD~~I~~L~P~LE~--G---D---iIIDGGNS~Y~D--T~RR~~eL~~ 116 (480) T TIGR00873 47 KIVGAYSIEEFVQSLERPRKIMLMVKAGAPVDADAVINSLLPLLEK--G---D---IIIDGGNSHYKD--TERRYKELKA 116 (480) T ss_pred CCCCCCCHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCC--C---C---EEECCCCCCCCC--HHHHHHHHHH T ss_conf 5334326799998506887288877538853778999996443589--9---8---887588788466--5789999986 Q ss_pred CCCCEEEEEEECCC----CCCCCHHHHHHHHHHHHH-------HHCC----CCCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 18988999975047----877803788854555543-------2001----23442012345666566899998623654 Q gi|254780546|r 80 LRIAHSVVSWKNSK----PQTGLMAAAREARYALIS-------EHAK----TINATLIMTAHTFDDQLETVYMRSQRDYA 144 (423) Q Consensus 80 lgi~~~~~~~~~~~----~~~~~~~~ar~~r~~~~~-------~~~~----~~~~~~l~~ah~~dD~~Et~l~rl~r~sg 144 (423) -||-|.=.=+.+-+ .+++++.=-...-|..+. .... +-+|.+|- -.|+| T Consensus 117 ~Gi~FvG~GvSGGEeGAl~GPSiMPGG~~~Ay~~~~PIl~~IAAkv~dPdg~PCC~wiG----------------~~GAG 180 (480) T TIGR00873 117 KGILFVGVGVSGGEEGALKGPSIMPGGSAEAYPLVAPILQKIAAKVDDPDGEPCCTWIG----------------PDGAG 180 (480) T ss_pred CCCCEEEEEEECCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEC----------------CCCCC T ss_conf 49816730132453021378877888798789999678876554216853294224378----------------67984 Q ss_pred CCCCCCCCCCCCCCCCCCC-------EEECCHHCCCHHHHHHHHHHHC Q ss_conf 4433223576410035796-------8710121042899999899818 Q gi|254780546|r 145 EKGMGLSGMCDTILYDLNL-------WISRPFLRCRREDIRSFLLQRN 185 (423) Q Consensus 145 ~~g~~l~~~~~~~~~~~~~-------~i~RPLL~~~r~~l~~~~~~~~ 185 (423) .. ..|----.+++|+ .+.|-.|+++-++|.+.-.+.| T Consensus 181 Hy----VKMVHNGIEYGDMQLI~EAY~~lK~~L~L~~~~i~~vF~~WN 224 (480) T TIGR00873 181 HY----VKMVHNGIEYGDMQLIAEAYDILKEGLGLSNEEIAEVFTEWN 224 (480) T ss_pred CE----EEEEECCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHC T ss_conf 30----532576711777989999999998651888889999985523 No 126 >PRK07667 uridine kinase; Provisional Probab=59.61 E-value=11 Score=16.67 Aligned_cols=41 Identities=17% Similarity=0.301 Sum_probs=22.2 Q ss_pred CCCCEEEEEECCCHHH--HHHHHHHHHHHHHCCCCCEEEEEEEECC Q ss_conf 7998399996694248--9999999999986489972999999667 Q gi|254780546|r 17 VYPAHILVAVSGGSDS--MGLLIALHSVLSDRSFGKIKFSAISVDH 60 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS--~aLl~ll~~~~~~~~~~~~~l~a~~vdH 60 (423) .+..+++||++||+=| +.++.-+++-.... +..+.++|+|- T Consensus 10 ~~~~r~iIgIaG~sgSGKTTla~~L~~~l~~~---~~~v~v~~~Dd 52 (190) T PRK07667 10 HKENRFILGIDGLSRSGKTTFVANLKENMKQE---GIPFHIFHIDD 52 (190) T ss_pred CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC---CCCEEEEECCC T ss_conf 75986999977989788999999999998665---98379996662 No 127 >pfam01861 DUF43 Protein of unknown function DUF43. This family includes archaebacterial proteins of unknown function. All the members are 350-400 amino acids long. Probab=59.61 E-value=11 Score=16.67 Aligned_cols=79 Identities=18% Similarity=0.222 Sum_probs=55.4 Q ss_pred CCCHHHHHHHHHHHCCC----CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 76989999999985279----98399996694248999999999998648997299999966779878468999999999 Q gi|254780546|r 2 FLSPIESVRFFVRSLVY----PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVC 77 (423) Q Consensus 2 ~~~p~~~~~~~~~~l~~----~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~ 77 (423) ..+|+-++++.+-+... +.+|+ |=|--|.+.|+.++. .++-+|+|+-||.- =..|..+.+ T Consensus 24 ~~T~ET~v~Ra~l~~~rgdL~Gk~iL--~vGDDDLtSlA~al~-------~~p~~I~VvDIDeR-------ll~fI~~~A 87 (243) T pfam01861 24 FVTPETTVARVALMYSRGDLEGKEIL--VLGDDDLTSLAAALT-------GLPKRIAVVDIDER-------LIKFIERVA 87 (243) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCEEE--EECCCCHHHHHHHHC-------CCCCEEEEEECCHH-------HHHHHHHHH T ss_conf 31628899999999873885798799--972861878999964-------89854899967589-------999999999 Q ss_pred HHCCCCEEEEEEECCCCCC Q ss_conf 8718988999975047877 Q gi|254780546|r 78 SRLRIAHSVVSWKNSKPQT 96 (423) Q Consensus 78 ~~lgi~~~~~~~~~~~~~~ 96 (423) +++|++.....+|..++-+ T Consensus 88 ~~~gl~i~~~~~DlR~pLP 106 (243) T pfam01861 88 KEEGLKIEAFVHDLRNPLP 106 (243) T ss_pred HHCCCCEEEEEECCCCCCC T ss_conf 9749971688741014899 No 128 >KOG1325 consensus Probab=58.61 E-value=11 Score=16.87 Aligned_cols=30 Identities=23% Similarity=0.144 Sum_probs=19.1 Q ss_pred HHHHHHHHHC------C-----CCCEEEEEECCCHH-HHHHH Q ss_conf 9999999852------7-----99839999669424-89999 Q gi|254780546|r 7 ESVRFFVRSL------V-----YPAHILVAVSGGSD-SMGLL 36 (423) Q Consensus 7 ~~~~~~~~~l------~-----~~~~i~vAvSGG~D-S~aLl 36 (423) ++.++|+..+ . ...+|+||+|||.= +|... T Consensus 24 ~AL~~fL~~~~~~l~~~~~~~~d~P~vaIa~SGGG~RAMl~g 65 (571) T KOG1325 24 EALKSFLSNTTLNLDSDSLVSSDGPVVGIAGSGGGLRAMLSG 65 (571) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHH T ss_conf 999999864445556543335789858998348778988632 No 129 >cd00946 FBP_aldolase_IIA Class II Type A, Fructose-1,6-bisphosphate (FBP) aldolases. The enzyme catalyses the zinc-dependent, reversible aldol condensation of dihydroxyacetone phosphate with glyceraldehyde-3-phosphate to form fructose-1,6-bisphosphate. FBP aldolase is homodimeric and used in gluconeogenesis and glycolysis. The type A and type B Class II FBPA's differ in the presence and absence of distinct indels in the sequence that result in differing loop lengths in the structures. Probab=58.39 E-value=12 Score=16.53 Aligned_cols=138 Identities=9% Similarity=0.038 Sum_probs=67.1 Q ss_pred HHHHHHHHHC-CCCCEEEEEECCC-HH------------------HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH Q ss_conf 9999999852-7998399996694-24------------------89999999999986489972999999667798784 Q gi|254780546|r 7 ESVRFFVRSL-VYPAHILVAVSGG-SD------------------SMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA 66 (423) Q Consensus 7 ~~~~~~~~~l-~~~~~i~vAvSGG-~D------------------S~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s 66 (423) |.+...+..- ...+.|+|.+|-| .. ...++..++.+......| +++|.|||-. +. T Consensus 27 E~~~Ail~AAee~~sPVIiq~S~g~~~y~~g~~~~~~~~~~~v~g~i~~a~~v~~~a~~~~VP----VaLHLDHg~~-~~ 101 (345) T cd00946 27 STINAVLEAARDAKSPIIIQFSNGGAAFYAGKGLKNEKQKASIAGAIAAAHHVRSMAEHYGVP----VVLHTDHCAK-KL 101 (345) T ss_pred HHHHHHHHHHHHHCCCEEEECCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC----EEEECCCCCC-CC T ss_conf 999999999999788999987775887705655432017788877899999999999876998----8997356753-22 Q ss_pred HHHHH----HHHHHHHHCCCC-EEEEEEECCC--CCCCCHHHHHHHHHHHHHHHCCCCCCCCHH------H--H------ Q ss_conf 68999----999999871898-8999975047--877803788854555543200123442012------3--4------ Q gi|254780546|r 67 KDEVR----YVSDVCSRLRIA-HSVVSWKNSK--PQTGLMAAAREARYALISEHAKTINATLIM------T--A------ 125 (423) Q Consensus 67 ~~e~~----~v~~~~~~lgi~-~~~~~~~~~~--~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~------~--a------ 125 (423) ..|.. .....+...+.+ |.++=+|... -..|++...+...|..-....-+.....+. . . T Consensus 102 ~~~~~~~~~~~~~~~~~~~~~gFtSVMiDgS~~p~eeNi~~TkevVe~Ah~~gvsVEaELG~vGG~EDgv~~~~~~~~~~ 181 (345) T cd00946 102 LPWFDGLLEADEEYFKQHGEPLFSSHMLDLSEEPLEENIEICKKYLERMAKINMWLEMEIGITGGEEDGVDNSGVDNAEL 181 (345) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCHHHC T ss_conf 10100057877999996488878457765898999999999999999986639814662034577566766555541212 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 566656689999862365444332 Q gi|254780546|r 126 HTFDDQLETVYMRSQRDYAEKGMG 149 (423) Q Consensus 126 h~~dD~~Et~l~rl~r~sg~~g~~ 149 (423) -+.-++++.+.-++...+|++.+. T Consensus 182 yT~Peea~~f~~~~~~~tgvd~LA 205 (345) T cd00946 182 YTQPEDVWYVYEALSKISPNFSIA 205 (345) T ss_pred CCCHHHHHHHHHHHCCCCCCCEEE T ss_conf 799999999999963677760565 No 130 >TIGR02057 PAPS_reductase phosphoadenosine phosphosulfate reductase; InterPro: IPR011800 Requiring thioredoxin as an electron donor, phosphoadenosine phosphosulphate reductase catalyzes the reduction of 3'-phosphoadenylylsulphate (PAPS) to sulphite and phospho-adenosine-phosphate (PAP). Found in enterobacteria, cyanobacteria, and yeast, PAPS reductase is related to a group of plant (IPR004508 from INTERPRO) and bacterial (IPR011798 from INTERPRO) enzymes preferring 5'-adenylylsulphate (APS) over PAPS as a substrate for reduction to sulphite.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019344 cysteine biosynthetic process. Probab=57.31 E-value=12 Score=16.41 Aligned_cols=175 Identities=13% Similarity=0.175 Sum_probs=104.7 Q ss_pred HHHHHHHHHHHCCCCC-EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 8999999998527998-399996694248999999999998648997299999966779878468999999999871898 Q gi|254780546|r 5 PIESVRFFVRSLVYPA-HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIA 83 (423) Q Consensus 5 p~~~~~~~~~~l~~~~-~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~ 83 (423) |.+.|.=.+..| |. .++..-|=|.|.++.+|++....+.+. + .+-++-+|-|-.. .+=-.++.++-.++.-| T Consensus 14 a~~iiAWs~~~~--P~G~Lv~tsaFg~~~lV~~hll~~~~keky-~--~~pvif~DTly~F--~~Tl~l~~~~~~~y~qP 86 (239) T TIGR02057 14 AQEIIAWSIETF--PTGELVQTSAFGIQALVILHLLSSILKEKY-P--DIPVIFIDTLYLF--PQTLELVDELTEKYKQP 86 (239) T ss_pred HHHHHHHHHHHC--CCCCEEECCHHHHHHHHHHHHHHHHHHHHC-C--CCCEEEECCCCCC--HHHHHHHHHHHHHHCCC T ss_conf 457999998755--999745301020478999999999876217-9--9867862111362--78999999999986276 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHCC------CCCCCCHHHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCCCCC Q ss_conf 89999750478778037888545555432001------234420123456665668999986-23654443322357641 Q gi|254780546|r 84 HSVVSWKNSKPQTGLMAAAREARYALISEHAK------TINATLIMTAHTFDDQLETVYMRS-QRDYAEKGMGLSGMCDT 156 (423) Q Consensus 84 ~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~------~~~~~~l~~ah~~dD~~Et~l~rl-~r~sg~~g~~l~~~~~~ 156 (423) - -......+......++.-.++|.-+..... ...+..+==.|=+=+-.++...-. .|.+ .+..-+.|+.. T Consensus 87 k-n~~L~~~~~~~~~s~a~~~AkYG~~l~~KD~~~~~~Y~~~~KvEP~~RAlkEL~~~Aw~tG~Rr~--Q~~~R~nL~v~ 163 (239) T TIGR02057 87 K-NLDLNLYKYDDVESEAAFEAKYGKLLWQKDVEKLEKYDDIAKVEPMQRALKELNAVAWFTGRRRD--QASARANLPVL 163 (239) T ss_pred C-CCCCEEEECCCCCCHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCHH--HHCCCCCCCCE T ss_conf 5-56330331688887898974420465405722233213434436233556888899987425543--42022368610 Q ss_pred CC-CCCCCEEECCHHCCCHHHHHHHHHHHCCCCC Q ss_conf 00-3579687101210428999998998189320 Q gi|254780546|r 157 IL-YDLNLWISRPFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 157 ~~-~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wv 189 (423) -. ..+|.-=+-||++++++++-+|+.+++++|- T Consensus 164 ~~D~~~G~lKv~Pl~~W~~~~~~~Y~~~h~v~Y~ 197 (239) T TIGR02057 164 ELDLQNGILKVNPLIDWTRKQVYQYLAAHKVPYN 197 (239) T ss_pred EEECCCCCCEECCCCCCCHHHHHHHHHHCCCCCC T ss_conf 0001478103202102650227999986788468 No 131 >PTZ00142 6-phosphogluconate dehydrogenase; Provisional Probab=56.31 E-value=13 Score=16.30 Aligned_cols=79 Identities=14% Similarity=0.145 Sum_probs=53.5 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 69899999999852799839999669424899999999999864899729999996677987846899999999987189 Q gi|254780546|r 3 LSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI 82 (423) Q Consensus 3 ~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi 82 (423) +++.+.+..|+.++.+|++|++-|-.|..--..+.-+..++. + --+.||+|-.. -.+.....+.|++.|| T Consensus 55 i~~~~sl~e~v~sl~~pr~Iilmv~aG~~Vd~vi~~L~~~L~--~------GDIIID~GNs~--~~dt~rr~~~l~~kgI 124 (474) T PTZ00142 55 VKGYHSLEELVNSLKRPRRVMLLIKAGEAVDEFIDNILPHLE--K------GDIIIDGGNEW--YNNSERRIKLCKEKGI 124 (474) T ss_pred CCCCCCHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCC--C------CCEEEECCCCC--HHHHHHHHHHHHHCCC T ss_conf 646678999997379999899982698259999999985088--9------99898799888--6579999999985799 Q ss_pred CEEEEEEEC Q ss_conf 889999750 Q gi|254780546|r 83 AHSVVSWKN 91 (423) Q Consensus 83 ~~~~~~~~~ 91 (423) .+.-.-+.+ T Consensus 125 ~fld~GVSG 133 (474) T PTZ00142 125 LYIGMGVSG 133 (474) T ss_pred CEECCCCCC T ss_conf 186478884 No 132 >TIGR03531 selenium_SpcS O-phosphoseryl-tRNA(Sec) selenium transferase. In the archaea and eukaryotes, the conversion of the mischarged serine to selenocysteine (Sec) on its tRNA is accomplished in two steps. This enzyme, O-phosphoseryl-tRNA(Sec) selenium transferase, acts second, after a phosphophorylation step catalyzed by a homolog of the bacterial SelA protein. Probab=56.03 E-value=13 Score=16.27 Aligned_cols=23 Identities=17% Similarity=0.106 Sum_probs=12.6 Q ss_pred CHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 12104289999989981893202 Q gi|254780546|r 168 PFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 168 PLL~~~r~~l~~~~~~~~i~wve 190 (423) |=..=.=.+|-.+|++++|+.+- T Consensus 218 PR~~D~i~eiAklC~~~~IpHvV 240 (444) T TIGR03531 218 PRSPDDIEEIAKICAKYDIPHVV 240 (444) T ss_pred CCCCCCHHHHHHHHHHCCCCEEE T ss_conf 99976699999999974998378 No 133 >PRK05647 purN phosphoribosylglycinamide formyltransferase; Reviewed Probab=55.58 E-value=13 Score=16.22 Aligned_cols=86 Identities=21% Similarity=0.227 Sum_probs=49.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCH Q ss_conf 83999966942489999999999986489972999999667798784689999999998718988999975047877803 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLM 99 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~ 99 (423) .||+|=+||+.-.+ -.++... . .+..+.++..+.-|+ +++ .+-+.|++.|||+.++...... + T Consensus 2 kkiavl~SG~GSNl--~aii~a~-~-~~~l~~~i~~Visn~---~~a-----~~l~~A~~~~Ip~~~i~~~~~~---~-- 64 (200) T PRK05647 2 KRIVVLASGNGSNL--QAIIDAC-A-AGQLPAEVVAVISDR---PDA-----YGLERAEQAGIPTFVLDHKDFP---S-- 64 (200) T ss_pred CEEEEEECCCCHHH--HHHHHHH-H-CCCCCCEEEEEEECC---CCC-----HHHHHHHHCCCCEEEECCCCCC---C-- T ss_conf 78999991580449--9999998-7-599994899999789---753-----6659999759976996776679---9-- Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 78885455554320012344201234 Q gi|254780546|r 100 AAAREARYALISEHAKTINATLIMTA 125 (423) Q Consensus 100 ~~ar~~r~~~~~~~~~~~~~~~l~~a 125 (423) |...-..+.+...+...+.++++ T Consensus 65 ---r~~~d~~i~~~L~~~~vDlIvLA 87 (200) T PRK05647 65 ---REAFDAALVEALDAYEPDLVVLA 87 (200) T ss_pred ---HHHHHHHHHHHHHHCCCCEEEEC T ss_conf ---89999999999986499999998 No 134 >COG4586 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=52.81 E-value=15 Score=15.92 Aligned_cols=26 Identities=8% Similarity=0.202 Sum_probs=14.7 Q ss_pred CCCCCEEEECHHHHCCCHHHHHHHHH Q ss_conf 04499799847783375578989999 Q gi|254780546|r 244 VHMRSIIAISQDILNIDSTLLFYLLR 269 (423) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~r~l~ 269 (423) .++....+++.....++-..+.++.+ T Consensus 172 Lh~p~VLfLDEpTvgLDV~aq~~ir~ 197 (325) T COG4586 172 LHPPKVLFLDEPTVGLDVNAQANIRE 197 (325) T ss_pred CCCCCEEEECCCCCCCCHHHHHHHHH T ss_conf 68983897448745751438999999 No 135 >PRK09589 celA 6-phospho-beta-glucosidase; Reviewed Probab=52.29 E-value=12 Score=16.48 Aligned_cols=58 Identities=10% Similarity=0.011 Sum_probs=43.6 Q ss_pred HHHHHHCCCCEEEEEEECC--CCCCC--CHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 9999871898899997504--78778--0378885455554320012344201234566656 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS--KPQTG--LMAAAREARYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~--~~~~~--~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) -++++++|+.++-..+++. .|.+. .-+.+--..|+.+...+.+.|+.-++|-||-|-- T Consensus 73 i~lm~~lG~~~yRFSIsWsRI~P~g~~~~~N~~gl~~Y~~lId~l~~~GI~P~VTL~H~d~P 134 (479) T PRK09589 73 IQLFAEMGFKCFRTSIAWTRIFPQGDEQEPNEEGLQFYDDLFDECLKQGIEPVVTLSHFEMP 134 (479) T ss_pred HHHHHHCCCCEEEEECCHHHEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCH T ss_conf 99999819997885436766124787788699999999999999998399779972565673 No 136 >KOG2303 consensus Probab=52.10 E-value=15 Score=15.84 Aligned_cols=26 Identities=27% Similarity=0.289 Sum_probs=19.8 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHH Q ss_conf 79983999966942489999999999 Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSV 42 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~ 42 (423) ....-+.+-+|||.||.+.+.+.+.+ T Consensus 347 s~~aGfflPLSGG~DSsatA~iV~sM 372 (706) T KOG2303 347 SGQAGFFLPLSGGVDSSATAAIVYSM 372 (706) T ss_pred CCCCCEEEECCCCCCCHHHHHHHHHH T ss_conf 38874477647886405899999999 No 137 >TIGR03356 BGL beta-galactosidase. Probab=51.76 E-value=11 Score=16.71 Aligned_cols=58 Identities=14% Similarity=0.080 Sum_probs=44.2 Q ss_pred HHHHHHCCCCEEEEEEECC--CCC-CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 9999871898899997504--787-780378885455554320012344201234566656 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS--KPQ-TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~--~~~-~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) -++++++|+..+-..+++. .|. .+.-+.+-...|+.+.+.+.+.|+.-++|-||-|-- T Consensus 60 i~l~~~lG~~~yRfSI~WsRI~P~g~g~~N~~gl~~Y~~~i~~l~~~GIeP~VTL~Hfd~P 120 (427) T TIGR03356 60 VALMKELGVDAYRFSIAWPRIFPEGTGPVNPKGLDFYDRLVDELLEAGIEPFVTLYHWDLP 120 (427) T ss_pred HHHHHHCCCCEEEEECCHHHEECCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCH T ss_conf 9999982999898325576543089898899999999999999998098789998556782 No 138 >PRK09593 arb 6-phospho-beta-glucosidase; Reviewed Probab=51.23 E-value=13 Score=16.21 Aligned_cols=57 Identities=14% Similarity=0.064 Sum_probs=43.8 Q ss_pred HHHHHHCCCCEEEEEEECC--CCCCCC--HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 9999871898899997504--787780--37888545555432001234420123456665 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS--KPQTGL--MAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~--~~~~~~--~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) -++++++|+..+-..+++. .|.++. -+.+-...|+.+.+.+.+.|+.-++|-||-|= T Consensus 79 i~lm~~lG~~~YRFSIsWsRI~P~g~~~~~N~~gv~~Y~~lId~l~~~GI~P~VTL~H~dl 139 (478) T PRK09593 79 LALFAEMGFKTYRMSIAWTRIFPKGDELEPNEAGLQFYEDIFKECHKYGIEPLVTITHFDC 139 (478) T ss_pred HHHHHHHCCCEEEEECCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 9999981999886323374534678988869999999999999999839926998417778 No 139 >TIGR02407 ectoine_ectB diaminobutyrate--2-oxoglutarate aminotransferase; InterPro: IPR012773 Members of this family of class III pyridoxal-phosphate-dependent aminotransferases are diaminobutyrate--2-oxoglutarate aminotransferase (2.6.1.76 from EC) that catalyze the first step in ectoine biosynthesis from L-aspartate beta-semialdehyde. This family is readily separated phylogenetically from enzymes with the same substrate and product but involved in other process such as siderophore (Q9Z3R2 from SWISSPROT) or 1,3-diaminopropane (P44951 from SWISSPROT) biosynthesis. Ectoine is a compatible solute particularly effective in conferring salt tolerance.; GO: 0030170 pyridoxal phosphate binding, 0047307 diaminobutyrate-pyruvate transaminase activity, 0019491 ectoine biosynthetic process. Probab=51.19 E-value=16 Score=15.74 Aligned_cols=23 Identities=22% Similarity=0.328 Sum_probs=9.7 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 84689999999998718988999 Q gi|254780546|r 65 TAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 65 ~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) .|.+|.+-++++|++++|-+.|= T Consensus 212 As~eWLq~l~~lCr~~DILLIVD 234 (413) T TIGR02407 212 ASKEWLQRLEKLCRRHDILLIVD 234 (413) T ss_pred CCHHHHHHHHHHHHHCCCEEEEE T ss_conf 23557999999998649868773 No 140 >PRK13511 6-phospho-beta-galactosidase; Provisional Probab=50.98 E-value=13 Score=16.20 Aligned_cols=57 Identities=16% Similarity=0.174 Sum_probs=43.5 Q ss_pred HHHHHHCCCCEEEEEEECC--CCC-CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 9999871898899997504--787-78037888545555432001234420123456665 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS--KPQ-TGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~--~~~-~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) -++++++|+.++-..+++. .|. .+.-+.+--..|..+.+.+.+.|+.-++|-||-|- T Consensus 60 i~l~~~lG~~~yRfSi~WsRI~P~g~g~~n~~gl~~Y~~~i~~l~~~GI~P~vTL~H~d~ 119 (470) T PRK13511 60 LKLAEEFGVNGIRISIAWSRIFPDGYGEVNPKGVEYYHKLFAECHKRHVEPFVTLHHFDT 119 (470) T ss_pred HHHHHHCCCCEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCC T ss_conf 999998299978875127545868999859999999999999999819878999838898 No 141 >cd07025 Peptidase_S66 LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein. LD-carboxypeptidase (Muramoyltetrapeptide carboxypeptidase; EC 3.4.17.13; Merops family S66; initially described as Carboxypeptidase II) family also includes the microcin c7 self-immunity protein (MccF) as well as uncharacterized proteins including hypothetical proteins. LD-carboxypeptidase hydrolyzes the amide bond that links the dibasic amino acids to C-terminal D-amino acids. The physiological substrates of LD-carboxypeptidase are tetrapeptide fragments (such as UDP-MurNAc-tetrapeptides) that are produced when bacterial cell walls are degraded; they contain an L-configured residue (L-lysine or meso-diaminopimelic acid residue) as the penultimate residue and D-alanine as the ultimate residue. A possible role of LD-carboxypeptidase is in peptidoglycan recycling whereby the resulting tripeptide (precursor for murein synthesis) can be reconverted into peptidoglycan by attachment o Probab=50.37 E-value=16 Score=15.66 Aligned_cols=24 Identities=25% Similarity=0.069 Sum_probs=11.7 Q ss_pred EEEECCCHHHHHHHHHHHHHHHHC Q ss_conf 999669424899999999999864 Q gi|254780546|r 23 LVAVSGGSDSMGLLIALHSVLSDR 46 (423) Q Consensus 23 ~vAvSGG~DS~aLl~ll~~~~~~~ 46 (423) +||-|++.+....+....++++.. T Consensus 3 ivaPS~~~~~~~~~~~~~~~L~~~ 26 (282) T cd07025 3 IVAPSSPIDEEERLERAIARLESL 26 (282) T ss_pred EECCCCCCCCHHHHHHHHHHHHHC T ss_conf 991899999889999999999968 No 142 >PRK09852 cryptic 6-phospho-beta-glucosidase; Provisional Probab=47.45 E-value=17 Score=15.49 Aligned_cols=58 Identities=19% Similarity=0.074 Sum_probs=43.9 Q ss_pred HHHHHHCCCCEEEEEEECC--CCCCCC--HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 9999871898899997504--787780--378885455554320012344201234566656 Q gi|254780546|r 74 SDVCSRLRIAHSVVSWKNS--KPQTGL--MAAAREARYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 74 ~~~~~~lgi~~~~~~~~~~--~~~~~~--~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) -++++++|+.++-..+++. .|.++. -+.+--..|..+.+.+.+.|+.-++|-||-|=- T Consensus 77 i~l~~~lG~~~YRFSIsWsRI~P~g~~~~~N~~gl~~Y~~lId~l~~~GI~P~VTL~H~dlP 138 (474) T PRK09852 77 IALMAEMGFKVFRTSIAWSRLFPQGDELTPNQQGIAFYRSVFEECKKYGIEPLVTLCHFDVP 138 (474) T ss_pred HHHHHHCCCCEEEEECCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC T ss_conf 99999809997885436766005788788799999999999999998298589982686553 No 143 >TIGR01859 fruc_bis_ald_ fructose-1,6-bisphosphate aldolase, class II; InterPro: IPR011289 Fructose-1,6-bisphosphate (FBP) aldolases reversibly cleave FBP to two triose phosphates, glycerone phophate and D-glyceraldehyde 3-phosphate. They play a key role in both glycolysis (FBP cleavage) and gluconeogenesis (FBP synthesis). These enzymes can be divide into two classes based on their mode of catalysis. Class I FBP aldolases form a Schiff-base intermediate between glycerone phophate and an active site lysine residue, while the class II enzymes use a divalent cation such as Zn(2+) for catalysis . This entry represents class II FBP aldolases found in Gram-positive bacteria, a variety of Gram-negative baceria, and amitochondriate protists. These enzymes are homotetramers where each momomer forms a TIM barrel fold .; GO: 0004332 fructose-bisphosphate aldolase activity, 0008270 zinc ion binding, 0006096 glycolysis, 0030388 fructose 16-bisphosphate metabolic process. Probab=47.36 E-value=18 Score=15.34 Aligned_cols=153 Identities=12% Similarity=0.062 Sum_probs=73.7 Q ss_pred CCCCEEEEEECCCH-------HHHHHH----HHHHHHHHHCCCCCEEEEEEEECCCCCCCHH------------------ Q ss_conf 79983999966942-------489999----9999999864899729999996677987846------------------ Q gi|254780546|r 17 VYPAHILVAVSGGS-------DSMGLL----IALHSVLSDRSFGKIKFSAISVDHCLRETAK------------------ 67 (423) Q Consensus 17 ~~~~~i~vAvSGG~-------DS~aLl----~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~------------------ 67 (423) ...+.|+|++|=|. |-.... -+.+.+..+... +.- +|+|.|||-.-|+- T Consensus 38 e~~SPVI~q~SeGA~KY~GkGd~~~G~~~~~~mV~~L~E~~~~-tVP-VaLHLDHG~s~E~c~~AI~aGfsSVMIDgShl 115 (339) T TIGR01859 38 EERSPVIVQVSEGAIKYMGKGDATLGYKVAVAMVKTLIESMSI-TVP-VALHLDHGSSYESCIKAIKAGFSSVMIDGSHL 115 (339) T ss_pred HHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCC-CEE-EEEECCCCCCHHHHHHHHHCCCCEEECCCCCC T ss_conf 7368755312027874048531778899999999999986799-320-68646887648999999964981672024268 Q ss_pred ------HHHHHHHHHHHHCC--CCEEE--EEEECCCCC-CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf ------89999999998718--98899--997504787-78037888545555432001234420123456665668999 Q gi|254780546|r 68 ------DEVRYVSDVCSRLR--IAHSV--VSWKNSKPQ-TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVY 136 (423) Q Consensus 68 ------~e~~~v~~~~~~lg--i~~~~--~~~~~~~~~-~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l 136 (423) +.+..|-+++...| |.-.. =++-+..+. .-++..+.-+--.-....+++.+.++|+-|- T Consensus 116 P~EEN~~~Tkkvve~Ah~~GDYvSVEaElG~~gG~ED~i~V~~~~~~ladp~Ea~~fV~eT~~D~LA~aI---------- 185 (339) T TIGR01859 116 PFEENVALTKKVVEIAHAKGDYVSVEAELGTIGGIEDGIVVVEKEGELADPDEAEQFVKETGVDYLAAAI---------- 185 (339) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEEEEECEECCEECCEEEEECCCEECCHHHHHHHHHHCCCCEECCCC---------- T ss_conf 7888888889999999851997999998430352762737871563675879999999763877540100---------- Q ss_pred HHHHCCCCCCCCCCCCCCCC-CCCCCCCEEECCHHCCCHHHHHHHHHHHC-CCCCCC Q ss_conf 98623654443322357641-00357968710121042899999899818-932026 Q gi|254780546|r 137 MRSQRDYAEKGMGLSGMCDT-ILYDLNLWISRPFLRCRREDIRSFLLQRN-ISWCED 191 (423) Q Consensus 137 ~rl~r~sg~~g~~l~~~~~~-~~~~~~~~i~RPLL~~~r~~l~~~~~~~~-i~wveD 191 (423) |+-. | ..-.+.. .....++.++=|-|+|. .|...-+.-+ +|-|== T Consensus 186 -----Gt~H-G--a~kykgeqCtrnaDGvLiWp~LdF~--~l~~I~~~~~G~PLVLH 232 (339) T TIGR01859 186 -----GTSH-G--AFKYKGEQCTRNADGVLIWPGLDFE--RLKEIKELTNGIPLVLH 232 (339) T ss_pred -----CCCC-C--CCCCCCCCCCCCCCCEECCCCCCHH--HHHHHHHHHCCCCEEEC T ss_conf -----3502-3--5788866533046723548888978--99999875089766735 No 144 >pfam00232 Glyco_hydro_1 Glycosyl hydrolase family 1. Probab=47.02 E-value=15 Score=15.86 Aligned_cols=57 Identities=14% Similarity=0.093 Sum_probs=44.1 Q ss_pred HHHHHCCCCEEEEEEECC--CCC-CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 999871898899997504--787-780378885455554320012344201234566656 Q gi|254780546|r 75 DVCSRLRIAHSVVSWKNS--KPQ-TGLMAAAREARYALISEHAKTINATLIMTAHTFDDQ 131 (423) Q Consensus 75 ~~~~~lgi~~~~~~~~~~--~~~-~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~ 131 (423) ++++++|+..+-..+++. .|. .+.-+.+--..|+.+.+.+.+.|+.-++|-||-|-- T Consensus 65 ~l~~~lG~~ayRfSIsWsRI~P~g~g~~N~~gl~~Y~~li~~l~~~GIeP~VTL~Hfd~P 124 (454) T pfam00232 65 ALMKELGVTAYRFSISWPRIFPKGEGEINEAGLDYYDRLIDELLAAGIEPYVTLYHWDLP 124 (454) T ss_pred HHHHHCCCCEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH T ss_conf 999973999898225698754289898598999999999999998499689985277766 No 145 >COG0362 Gnd 6-phosphogluconate dehydrogenase [Carbohydrate transport and metabolism] Probab=46.62 E-value=17 Score=15.55 Aligned_cols=77 Identities=23% Similarity=0.246 Sum_probs=50.7 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEEECCCH--HHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 6989999999985279983999966942--48999999999998648997299999966779878468999999999871 Q gi|254780546|r 3 LSPIESVRFFVRSLVYPAHILVAVSGGS--DSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRL 80 (423) Q Consensus 3 ~~p~~~~~~~~~~l~~~~~i~vAvSGG~--DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~l 80 (423) +.|..++..|+.+|.+|++|++-|--|. |.+ ...+.-++.. -=+.||-|-...-| ...-.+...+. T Consensus 52 i~~~~sieefV~~Le~PRkI~lMVkAG~~VD~~--I~~L~p~Le~--------gDIiIDGGNs~y~D--T~RR~~eL~~~ 119 (473) T COG0362 52 IVPAYSIEEFVASLEKPRKILLMVKAGTPVDAV--IEQLLPLLEK--------GDIIIDGGNSHYKD--TIRRNKELSEK 119 (473) T ss_pred CCCCCCHHHHHHHHCCCCEEEEEEECCCCHHHH--HHHHHHHCCC--------CCEEEECCCCCCCH--HHHHHHHHHHC T ss_conf 456685999998715884699998269927899--9998752578--------89898688767820--78999998863 Q ss_pred CCCEEEEEEEC Q ss_conf 89889999750 Q gi|254780546|r 81 RIAHSVVSWKN 91 (423) Q Consensus 81 gi~~~~~~~~~ 91 (423) ||-|.-.-+.+ T Consensus 120 Gi~FvG~GVSG 130 (473) T COG0362 120 GILFVGMGVSG 130 (473) T ss_pred CCEEEECCCCC T ss_conf 88489516666 No 146 >COG0794 GutQ Predicted sugar phosphate isomerase involved in capsule formation [Cell envelope biogenesis, outer membrane] Probab=46.60 E-value=18 Score=15.26 Aligned_cols=22 Identities=14% Similarity=0.169 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHCCCCEEEEE Q ss_conf 6899999999987189889999 Q gi|254780546|r 67 KDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 67 ~~e~~~v~~~~~~lgi~~~~~~ 88 (423) .+|...+...|++.|++...++ T Consensus 99 T~el~~~~~~aK~~g~~liaiT 120 (202) T COG0794 99 TKELLNLAPKAKRLGAKLIAIT 120 (202) T ss_pred HHHHHHHHHHHHHCCCCEEEEE T ss_conf 7799999999997599489995 No 147 >KOG2644 consensus Probab=46.58 E-value=18 Score=15.26 Aligned_cols=22 Identities=27% Similarity=0.677 Sum_probs=12.0 Q ss_pred CHHCCCHHHHHHHHHHHCCCCC Q ss_conf 1210428999998998189320 Q gi|254780546|r 168 PFLRCRREDIRSFLLQRNISWC 189 (423) Q Consensus 168 PLL~~~r~~l~~~~~~~~i~wv 189 (423) |+|+++=.++-++.+....+++ T Consensus 208 pll~ws~t~vw~~l~~~~~p~c 229 (282) T KOG2644 208 PLLEWSYTDVWDLLREGNLPYC 229 (282) T ss_pred CCCCCHHHHHHHHHHCCCCCEE T ss_conf 6344148789999861797636 No 148 >pfam01182 Glucosamine_iso Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase. Probab=46.53 E-value=18 Score=15.26 Aligned_cols=102 Identities=10% Similarity=0.092 Sum_probs=60.0 Q ss_pred HHHHHHHCCCCCE-EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC--CCCCCHHH-HHHH-HHHHHHHCCCC Q ss_conf 9999985279983-999966942489999999999986489972999999667--79878468-9999-99999871898 Q gi|254780546|r 9 VRFFVRSLVYPAH-ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH--CLRETAKD-EVRY-VSDVCSRLRIA 83 (423) Q Consensus 9 ~~~~~~~l~~~~~-i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH--glr~~s~~-e~~~-v~~~~~~lgi~ 83 (423) |++-.+-+++.+| +++|++||.-=+.+...|.+..+.....--+++.++.|- |+.++..+ -..+ -+.+...++|| T Consensus 7 ~~~i~~~~~~~~k~~vlgLatG~TP~~~Y~~L~~~~~~~~i~~~~v~~F~lDEyvgl~~~~p~S~~~~l~~~l~~~i~Ip 86 (236) T pfam01182 7 INRINSFKPTAERPFVLGLPTGSTPLGTYKKLIEAYKAGRVSFKHVVTFNMDEYVGLPPEHPESYHSFMYNNFFNHIDIP 86 (236) T ss_pred HHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCHHHEEEECCCEECCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 99999619899998799838987889999999999981799736769983704458897642359999999987355998 Q ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 899997504787780378885455554320 Q gi|254780546|r 84 HSVVSWKNSKPQTGLMAAAREARYALISEH 113 (423) Q Consensus 84 ~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~ 113 (423) ..-+..-. ....+.++. ..+|...... T Consensus 87 ~eni~~~~-g~~~d~~~~--~~~ye~~I~~ 113 (236) T pfam01182 87 PENIHILN-GNAADIDAE--CRNYEEKIKS 113 (236) T ss_pred HHHCCCCC-CCCCCHHHH--HHHHHHHHHH T ss_conf 99846899-998899999--9999999997 No 149 >KOG2316 consensus Probab=45.24 E-value=19 Score=15.12 Aligned_cols=63 Identities=14% Similarity=0.172 Sum_probs=36.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC-C-HHHHHH----------HHHHHHHHCCCCEEEEE Q ss_conf 39999669424899999999999864899729999996677987-8-468999----------99999987189889999 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE-T-AKDEVR----------YVSDVCSRLRIAHSVVS 88 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~-~-s~~e~~----------~v~~~~~~lgi~~~~~~ 88 (423) |++--+|||+||---++-+.. . +-+++|+- +|+| + +.+|.. -+.-+++-+++|.+..+ T Consensus 2 rvvaLiSGGKDScynmm~cv~----~---gHeiVaLa---nl~p~~d~~delDSyMyQtVGh~~i~lyaecm~lPlyrr~ 71 (277) T KOG2316 2 RVVALISGGKDSCYNMMCCVR----L---GHEIVALA---NLHPKEDESDELDSYMYQTVGHDVIDLYAECMGLPLYRRR 71 (277) T ss_pred CEEEEEECCHHHHHHHHHHHH----C---CCEEEEEE---CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 189997178078999999998----4---87435643---2468755520577889886027789999998468346642 Q ss_pred EECCC Q ss_conf 75047 Q gi|254780546|r 89 WKNSK 93 (423) Q Consensus 89 ~~~~~ 93 (423) ..... T Consensus 72 i~g~s 76 (277) T KOG2316 72 IRGRS 76 (277) T ss_pred CCCCC T ss_conf 14864 No 150 >pfam08497 Radical_SAM_N Radical SAM N-terminal. This domain tends to occur to the N-terminus of the pfam04055 domain in hypothetical bacterial proteins. Probab=45.21 E-value=16 Score=15.57 Aligned_cols=20 Identities=5% Similarity=0.099 Sum_probs=9.7 Q ss_pred HHHHHHHHHHCCCCEEEEEE Q ss_conf 99999999871898899997 Q gi|254780546|r 70 VRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 70 ~~~v~~~~~~lgi~~~~~~~ 89 (423) +..+..+.+..|...-|+.. T Consensus 35 ~AiIgR~Le~~GyrVgIIaQ 54 (298) T pfam08497 35 MAIIGRVLEAQGFRVGIIAQ 54 (298) T ss_pred HHHHHHHHHHCCCEEEEEEC T ss_conf 89999999986965989718 No 151 >pfam10309 DUF2414 Protein of unknown function (DUF2414). This is a family of proteins conserved from fungi to mammals. One mouse member is referred to as ELG protein but this is not a homologue of human ELG protein. The function is not known. Probab=44.65 E-value=16 Score=15.61 Aligned_cols=34 Identities=24% Similarity=0.480 Sum_probs=22.9 Q ss_pred ECCHHCCCHHHHHHHHHHH----C---CCCCCCCCCCCCCCC Q ss_conf 1012104289999989981----8---932026678642210 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQR----N---ISWCEDPSNTDDRFE 200 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~----~---i~wveDpSN~d~~f~ 200 (423) +|=+=.++-++|..|..++ + |.||+|.| .+.-|. T Consensus 10 irGVD~lsT~Dik~y~~~y~~~~~p~~IEWIdDtS-~Nvvf~ 50 (62) T pfam10309 10 IRGVDELSTDDIKAYVKEYFDEESPFRIEWIDDTS-ANIVFK 50 (62) T ss_pred EECCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCC-EEEEEC T ss_conf 97278798799999999975658970489865983-799977 No 152 >cd01568 QPRTase_NadC Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=43.69 E-value=20 Score=14.96 Aligned_cols=14 Identities=21% Similarity=0.150 Sum_probs=5.7 Q ss_pred HHHHHHHC-CCCEEE Q ss_conf 99999871-898899 Q gi|254780546|r 73 VSDVCSRL-RIAHSV 86 (423) Q Consensus 73 v~~~~~~l-gi~~~~ 86 (423) +..+++.+ |+.... T Consensus 47 ~~~i~~~~~~~~~~~ 61 (269) T cd01568 47 AEEVFELLDGIEVEW 61 (269) T ss_pred HHHHHHHCCCEEEEE T ss_conf 999999859819999 No 153 >PRK09287 6-phosphogluconate dehydrogenase; Validated Probab=43.68 E-value=20 Score=14.96 Aligned_cols=79 Identities=19% Similarity=0.191 Sum_probs=51.7 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 69899999999852799839999669424899999999999864899729999996677987846899999999987189 Q gi|254780546|r 3 LSPIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI 82 (423) Q Consensus 3 ~~p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi 82 (423) +.|.+.+..|+.+|.+|++|++-|--|..--..+.-+..++ .+ --+.||.|-.. -.+...-.+.|++.|| T Consensus 39 ~~~~~~l~e~v~sl~~PR~IilmV~aG~~vD~vI~~L~~~L--~~------gDIiIDgGNS~--y~dT~rR~~~l~~kgI 108 (459) T PRK09287 39 IVPAYTLEEFVASLEKPRKILLMVKAGAPVDAVIEQLLPLL--EK------GDIIIDGGNSN--YTDTIRREKELAEKGI 108 (459) T ss_pred CCCCCCHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHC--CC------CCEEEECCCCC--HHHHHHHHHHHHHCCC T ss_conf 50358999999736788879998168842999999998508--89------99898799976--4337999999997599 Q ss_pred CEEEEEEEC Q ss_conf 889999750 Q gi|254780546|r 83 AHSVVSWKN 91 (423) Q Consensus 83 ~~~~~~~~~ 91 (423) .|.-.-+.+ T Consensus 109 ~Flg~GVSG 117 (459) T PRK09287 109 HFIGMGVSG 117 (459) T ss_pred EEECCCCCC T ss_conf 064578888 No 154 >PRK00162 glpE thiosulfate sulfurtransferase; Validated Probab=42.98 E-value=20 Score=14.89 Aligned_cols=50 Identities=22% Similarity=0.319 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC Q ss_conf 8999999998527998399996694248999999999998648997299999966779 Q gi|254780546|r 5 PIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL 62 (423) Q Consensus 5 p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl 62 (423) |.+.+..++..+.....|+|-|-+|..|+..+..|.+ .++ + .+.+++.|+ T Consensus 44 p~~~l~~~~~~~~~d~~ivv~C~~G~rS~~aa~~L~~----~Gf---~-~V~~l~GG~ 93 (104) T PRK00162 44 TNDSLGAFMRDADFDTPVMVMCYHGNSSQGAAQYLLQ----QGF---D-VVYSIDGGF 93 (104) T ss_pred CHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHH----CCC---C-CEEEECCHH T ss_conf 9899999997368889889995988189999999997----376---4-389815459 No 155 >TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component; InterPro: IPR014350 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . According to structural and sequence analyses, the PTS EIIB domain (2.7.1.69 from EC) can be divided in five groups , , : The PTS EIIB type 1 domain, which is found in the Glucose class of PTS, has an average length of about 80 amino acids. It forms a split alpha/beta sandwich composed of an antiparallel sheet (beta 1 to beta 4) and three alpha helices superimposed onto one side of the sheet. The phosphorylation site (Cys) is located at the end of the first beta strand on a protrusion formed by the edge of beta 1 and the reverse turn between beta 1 and beta 2 . The PTS EIIB type 2 domain, which is found in the Mannitol class of PTS, has an average length of about 100 amino acids. It consists of a four stranded parallel beta sheet flanked by two alpha helices (alpha 1 and 3) on one face and helix alpha 2 on the opposite face, with a characteristic Rossmann fold comprising two right-handed beta-alpha-beta motifs. The phosphorylation site (Cys) is located at the N-terminus of the domain, in the first beta strand. The PTS EIIB type 3 domain, which is found in the Lactose class of PTS, has an average length of about 100 amino acids. It is composed of a central four-stranded parallel open twisted beta sheet, which is flanked by three alpha helices on the concave side and two on the convex side of the beta sheet. The phosphorylation site (Cys) is located in the C-terminal end of the first beta strand . The PTS EIIB type 4 domain, which is found in the Mannose class of PTS, has an average length of about 160 amino acids. It has a central core of seven parallel beta strands surrounded by a total of six alpha-helices. Three helices cover the front face, one the back face with the remaining two capping the central beta sheet at the top and bottom. The phosphorylation site (His) is located at the suface exposed loop between strand 1 and helix 1 . The PTS EIIB type 5 domain, which is found in the Sorbitol class of PTS, has an average length of about 190 amino acids. The phosphorylation site (Cys) is located in the N-terminus of the domain. The EIIB type 3 protein is often found downstream of EIIC type 3. This entry hits fewer proteins than IPR013012 from INTERPRO. . Probab=41.81 E-value=21 Score=14.77 Aligned_cols=71 Identities=14% Similarity=0.176 Sum_probs=52.1 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC-------------CCHHHHHHHHHHHHHHCC Q ss_conf 5279983999966942489999999999986489972999999667798-------------784689999999998718 Q gi|254780546|r 15 SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR-------------ETAKDEVRYVSDVCSRLR 81 (423) Q Consensus 15 ~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr-------------~~s~~e~~~v~~~~~~lg 81 (423) ......+|+|=|+||.=|.-|.-.+....+.... +..+.|+- .+.. |+...--...++....+| T Consensus 42 ~~~n~~~iLl~CaaGmstsLLv~~l~k~A~~~~~-~~~i~A~~--~~~~~e~~~~~d~VlLaPQ~~~~~~~lk~~t~~~G 118 (142) T TIGR00853 42 NIMNETNILLLCAAGMSTSLLVNKLNKAAKEYGV-PVKIAAVS--YGAAMEILDDADVVLLAPQVAYMLPDLKKETDKKG 118 (142) T ss_pred HHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCC-CEEEEEEC--CCHHHHHHHHHCEEEEHHHHHHHHHHHHHHCCCCC T ss_conf 2003211688876973547899999999984599-75888405--76344335350431120326755799998513589 Q ss_pred CCEEEEE Q ss_conf 9889999 Q gi|254780546|r 82 IAHSVVS 88 (423) Q Consensus 82 i~~~~~~ 88 (423) ||..+.. T Consensus 119 ip~~~i~ 125 (142) T TIGR00853 119 IPVEVIN 125 (142) T ss_pred CCEEEEC T ss_conf 7536638 No 156 >COG0299 PurN Folate-dependent phosphoribosylglycinamide formyltransferase PurN [Nucleotide transport and metabolism] Probab=41.53 E-value=22 Score=14.74 Aligned_cols=85 Identities=20% Similarity=0.227 Sum_probs=51.4 Q ss_pred EEEEEECCCHHHH-HHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCH Q ss_conf 3999966942489-999999999986489972999999667798784689999999998718988999975047877803 Q gi|254780546|r 21 HILVAVSGGSDSM-GLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLM 99 (423) Q Consensus 21 ~i~vAvSGG~DS~-aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~ 99 (423) ||+|=.||+.--+ +++..+ . .+..+.++.++..|+. -.++.+.+++.|||..++.-.... T Consensus 2 ki~VlaSG~GSNlqaiida~----~-~~~~~a~i~~Visd~~--------~A~~lerA~~~gIpt~~~~~k~~~------ 62 (200) T COG0299 2 KIAVLASGNGSNLQAIIDAI----K-GGKLDAEIVAVISDKA--------DAYALERAAKAGIPTVVLDRKEFP------ 62 (200) T ss_pred EEEEEEECCCCCHHHHHHHH----H-CCCCCCEEEEEEECCC--------CCHHHHHHHHCCCCEEEECCCCCC------ T ss_conf 49999947965599999987----6-4898817999995797--------778999999849988974445678------ Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 788854555543200123442012345 Q gi|254780546|r 100 AAAREARYALISEHAKTINATLIMTAH 126 (423) Q Consensus 100 ~~ar~~r~~~~~~~~~~~~~~~l~~ah 126 (423) .|.+.-..+.....+.+.+.+++|= T Consensus 63 --~r~~~d~~l~~~l~~~~~dlvvLAG 87 (200) T COG0299 63 --SREAFDRALVEALDEYGPDLVVLAG 87 (200) T ss_pred --CHHHHHHHHHHHHHHCCCCEEEECC T ss_conf --7789999999999852999999866 No 157 >cd03115 SRP The signal recognition particle (SRP) mediates the transport to or across the plasma membrane in bacteria and the endoplasmic reticulum in eukaryotes. SRP recognizes N-terminal sighnal sequences of newly synthesized polypeptides at the ribosome. The SRP-polypeptide complex is then targeted to the membrane by an interaction between SRP and its cognated receptor (SR). In mammals, SRP consists of six protein subunits and a 7SL RNA. One of these subunits is a 54 kd protein (SRP54), which is a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 is a multidomain protein that consists of an N-terminal domain, followed by a central G (GTPase) domain and a C-terminal M domain. Probab=40.28 E-value=22 Score=14.61 Aligned_cols=59 Identities=12% Similarity=0.108 Sum_probs=44.8 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9999669424899999999999864899729999996677987846899999999987189889999 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) .+|+..|-.-.+.++.+++.+.. + +.++..++.|.- |..+ .+..+.+|+.+|+|++... T Consensus 4 ~lvGptGvGKTTTiaKLA~~~~~-~---~~kV~lit~Dt~-R~gA---~eQL~~~a~~l~v~~~~~~ 62 (173) T cd03115 4 LLVGLQGVGKTTTAAKLALYLKK-K---GKKVLLVAADTY-RPAA---IEQLRVLGEQVGVPVFEEG 62 (173) T ss_pred EEECCCCCCHHHHHHHHHHHHHH-C---CCEEEEEECCCC-CHHH---HHHHHHHHHHCCCEEEECC T ss_conf 99899999889999999999997-6---992899974887-5779---9999999997498599227 No 158 >PRK11498 bcsA cellulose synthase catalytic subunit; Provisional Probab=39.14 E-value=23 Score=14.49 Aligned_cols=51 Identities=12% Similarity=0.072 Sum_probs=35.9 Q ss_pred CCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHH Q ss_conf 8997299999966779878468999999999871898899997504787780378885 Q gi|254780546|r 47 SFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAAARE 104 (423) Q Consensus 47 ~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ar~ 104 (423) ..|+-++.+...|-|-|++ ++++|+++|+.+....-..-.+.+|+..+-.. T Consensus 300 DYP~dKl~VyvLDDG~R~e-------~r~lAeelGv~YitR~~N~HAKAGNLNhAL~~ 350 (858) T PRK11498 300 DWPKDKLNIWILDDGGREE-------FRQFAQEVGVGYIARPTHEHAKAGNINNALKY 350 (858) T ss_pred CCCHHHCEEEEECCCCCHH-------HHHHHHHHCCCEEECCCCCCCCCCCHHHHHHH T ss_conf 9980117699932999989-------99999985997673688866642267888864 No 159 >PRK13937 phosphoheptose isomerase; Provisional Probab=38.09 E-value=24 Score=14.38 Aligned_cols=51 Identities=14% Similarity=0.175 Sum_probs=29.9 Q ss_pred HHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCC--CCCEEEEEEEECC Q ss_conf 999998527998399996694248999999999998648--9972999999667 Q gi|254780546|r 9 VRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRS--FGKIKFSAISVDH 60 (423) Q Consensus 9 ~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~--~~~~~l~a~~vdH 60 (423) +....+.+..+.+|.+.=.||+-|.|- |++.++..+-. .+++....++-|. T Consensus 32 a~~i~~~~~~g~kI~~~GNGGSaa~A~-Hfa~dl~~~~~~~r~~lpaisL~~d~ 84 (192) T PRK13937 32 AEALIEALASGGKLLLCGNGGSAADAQ-HIAAELVGRYKKERPALPAIALTTDT 84 (192) T ss_pred HHHHHHHHHCCCEEEEEECCHHHHHHH-HHHHHHHCCCCCCCCCCCEEECCCCH T ss_conf 999999998799899996863188899-99999964135578985357767882 No 160 >TIGR00693 thiE thiamine-phosphate pyrophosphorylase; InterPro: IPR003733 Thiamine monophosphate synthase (TMP) (2.5.1.3 from EC) catalyzes the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to yield thiamine phosphate in the thiamine biosynthesis pathway . TENI, a protein from Bacillus subtilis that regulates the production of several extracellular enzymes by reducing alkaline protease production belongs to this group .; GO: 0004789 thiamin-phosphate diphosphorylase activity, 0009228 thiamin biosynthetic process. Probab=37.99 E-value=24 Score=14.37 Aligned_cols=23 Identities=9% Similarity=0.261 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 46899999999987189889999 Q gi|254780546|r 66 AKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 66 s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) --++++.++++|++++|+|.|-. T Consensus 51 ~~~~A~~l~~lc~~y~~~f~vND 73 (210) T TIGR00693 51 RLELAEKLRELCRKYGVPFIVND 73 (210) T ss_pred HHHHHHHHHHHHHHCCCCEEECC T ss_conf 99999999999987089768828 No 161 >smart00022 PLAc Cytoplasmic phospholipase A2, catalytic subunit. Cytosolic phospholipases A2 hydrolyse arachidonyl phospholipids. Family includes phospholipases B isoforms. Probab=37.89 E-value=24 Score=14.36 Aligned_cols=25 Identities=28% Similarity=0.274 Sum_probs=16.2 Q ss_pred HHHHHHHHHCC------------CCCEEEEEECCCHH Q ss_conf 99999998527------------99839999669424 Q gi|254780546|r 7 ESVRFFVRSLV------------YPAHILVAVSGGSD 31 (423) Q Consensus 7 ~~~~~~~~~l~------------~~~~i~vAvSGG~D 31 (423) ++.++|++..+ .-.+|+||+|||.= T Consensus 51 ~aL~~~L~~~~~~~~d~~~~~~~~~P~Igia~SGGGy 87 (549) T smart00022 51 EAMKSFLGRANSNFLDSSLLNSSDVPVIAIAGSGGGF 87 (549) T ss_pred HHHHHHHHHCCCCCCCHHHCCCCCCCEEEEEECCCCH T ss_conf 9999999754798886543266679859999568769 No 162 >TIGR01198 pgl 6-phosphogluconolactonase; InterPro: IPR005900 6-phosphogluconolactonase (3.1.1.31 from EC) is the enzyme responsible for the hydrolysis of 6-phosphogluconolactone to 6-phosphogluconate, the second step in the pentose phosphate pathway.6-phospho-D-glucono-1,5-lactone + H2O = 6-phospho-D-gluconate. ; GO: 0017057 6-phosphogluconolactonase activity, 0006098 pentose-phosphate shunt. Probab=37.83 E-value=24 Score=14.35 Aligned_cols=105 Identities=18% Similarity=0.171 Sum_probs=65.1 Q ss_pred HHHHHHH-HHCCCCCEEEEEECCCHHHHHHHHHHHHHHH--HCCCCCEEEEEEEECCCCCCCHHHHHH---HHHHHHHHC Q ss_conf 9999999-8527998399996694248999999999998--648997299999966779878468999---999999871 Q gi|254780546|r 7 ESVRFFV-RSLVYPAHILVAVSGGSDSMGLLIALHSVLS--DRSFGKIKFSAISVDHCLRETAKDEVR---YVSDVCSRL 80 (423) Q Consensus 7 ~~~~~~~-~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~--~~~~~~~~l~a~~vdHglr~~s~~e~~---~v~~~~~~l 80 (423) +.|..++ +-|.+..++.||||||.=-.+|+-.|..-.. .-++. +++..=+|--+=|.++.|-- .=..+.++. T Consensus 14 ~~~~~~~~~Al~~~~~~~~alSGGr~p~~L~~~La~~~~~~~~~W~--~~~~~l~DER~VP~~h~dSN~~~~r~~LL~~~ 91 (251) T TIGR01198 14 ERIATKLQKALAERGQFLLALSGGRSPIALLEALAAQEAGSKLDWS--RIHLFLGDERYVPLDHADSNTGLAREALLDRV 91 (251) T ss_pred HHHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHCCCCCCCC--CEEEEEEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 9999999999997689689982487457899997786343658802--15788632321479880466899999974036 Q ss_pred CCCEEEEEEECCCC---CCCCHHHHHHHHHHHHHHHCC Q ss_conf 89889999750478---778037888545555432001 Q gi|254780546|r 81 RIAHSVVSWKNSKP---QTGLMAAAREARYALISEHAK 115 (423) Q Consensus 81 gi~~~~~~~~~~~~---~~~~~~~ar~~r~~~~~~~~~ 115 (423) .||..-+..-.... +..++..|.. |..-..... T Consensus 92 ~i~~~~i~~~~~~~~~~g~~~e~~A~~--y~~~l~~~~ 127 (251) T TIGR01198 92 AIPASNIHPMPTELSELGADIEEAAEL--YEQELAAAF 127 (251) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHH--HHHHHHHHH T ss_conf 786214776776663226865035899--999988762 No 163 >pfam08727 P3A Poliovirus 3A protein like. This domain is found in positive-strand RNA viruses. The 3A protein is a critical component of the poliovirus replication complex, and is also an inhibitor of host cell ER to Golgi transport. Probab=37.40 E-value=22 Score=14.62 Aligned_cols=30 Identities=20% Similarity=0.526 Sum_probs=20.0 Q ss_pred HHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHH Q ss_conf 2104289999989981893202667864221018 Q gi|254780546|r 169 FLRCRREDIRSFLLQRNISWCEDPSNTDDRFERV 202 (423) Q Consensus 169 LL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~ 202 (423) |-.+.-+|+++||+.+| |+- |. .+..-||+ T Consensus 24 L~SVds~EVr~YC~~kg--Wiv-p~-~~~~iEr~ 53 (57) T pfam08727 24 LQSVDSPEVREYCEKKG--WIV-PA-TETTIERN 53 (57) T ss_pred HHHCCCHHHHHHHHHCC--CCC-CC-CCHHHHHH T ss_conf 98658799999999779--444-57-52047766 No 164 >cd07203 cPLA2_Fungal_PLB Fungal Phospholipase B-like; cPLA2 GrpIVA homologs; catalytic domain. Fungal phospholipase B are Group IV cPLA2 homologs. Aspergillus PLA2 is Ca-dependent, yet it does not contain a C2 domain. PLB deacylates both sn-1 and sn-2 chains of phospholipids and are abundantly expressed in fungi. It shows lysophospholipase (lysoPL) and transacylase activities. The active site residues from cPLA2 are also conserved in PLB. Like cPLA2, PLB also has a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). It includes PLB1 from Schizosaccharomyces pombe, PLB2 from Candida glabrata, and PLB3 from Saccharomyces cerevisiae. PLB1, PLB2, and PLB3 show PLB and lysoPL activities; PLB3 is specific for phosphoinositides. Probab=36.95 E-value=25 Score=14.26 Aligned_cols=14 Identities=43% Similarity=0.513 Sum_probs=11.1 Q ss_pred CEEEEEECCCHHHH Q ss_conf 83999966942489 Q gi|254780546|r 20 AHILVAVSGGSDSM 33 (423) Q Consensus 20 ~~i~vAvSGG~DS~ 33 (423) .+|+||+|||.==. T Consensus 63 P~IgiA~SGGGyRA 76 (552) T cd07203 63 PRIGIAVSGGGYRA 76 (552) T ss_pred CEEEEEECCCCHHH T ss_conf 86999845864999 No 165 >PRK10162 acetyl esterase; Provisional Probab=36.41 E-value=26 Score=14.20 Aligned_cols=14 Identities=7% Similarity=0.135 Sum_probs=7.4 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 04289999989981 Q gi|254780546|r 171 RCRREDIRSFLLQR 184 (423) Q Consensus 171 ~~~r~~l~~~~~~~ 184 (423) .++++.+..|...+ T Consensus 215 ~lt~~~~~~~~~~y 228 (319) T PRK10162 215 GLTQQDLQMYEEAY 228 (319) T ss_pred CCCHHHHHHHHHHH T ss_conf 55899999999985 No 166 >PRK13832 plasmid partitioning protein; Provisional Probab=35.76 E-value=20 Score=15.01 Aligned_cols=24 Identities=29% Similarity=0.642 Sum_probs=19.7 Q ss_pred HHHHHHCCCCCCC---CCCCCCCCCHH Q ss_conf 9899818932026---67864221018 Q gi|254780546|r 179 SFLLQRNISWCED---PSNTDDRFERV 202 (423) Q Consensus 179 ~~~~~~~i~wveD---pSN~d~~f~R~ 202 (423) +.++.+||.|+|| |.++|.+|.-+ T Consensus 214 dlAqayGi~W~EDLFapadeD~RYttd 240 (518) T PRK13832 214 DLAQAYGIVWVEDLFAPADEDSRYTTD 240 (518) T ss_pred HHHHHHCCEEEHHHCCCCCCCCCCCCH T ss_conf 899971855405413776644454322 No 167 >pfam00448 SRP54 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins. Probab=35.70 E-value=26 Score=14.12 Aligned_cols=60 Identities=15% Similarity=0.194 Sum_probs=44.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99996694248999999999998648997299999966779878468999999999871898899997 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) .+|+..|=.-.+.++.+++.+.. + +.++..++.|.- |..+ .+..+.+|+.+|+|++.... T Consensus 5 ~lvGptGvGKTTTiaKLAa~~~~-~---~~~V~lit~Dt~-R~gA---~eQL~~ya~~l~v~~~~~~~ 64 (196) T pfam00448 5 LLVGLQGSGKTTTIAKLAAYLKK-Q---GKKVLLVAADTF-RAAA---IEQLKQLAERLGVPVFGSGT 64 (196) T ss_pred EEECCCCCCHHHHHHHHHHHHHH-C---CCEEEEEECCCC-CHHH---HHHHHHHHHHCCCEEEECCC T ss_conf 99899999889999999999997-7---992899975877-6889---99999999863981781487 No 168 >pfam01268 FTHFS Formate--tetrahydrofolate ligase. Probab=35.12 E-value=27 Score=14.06 Aligned_cols=19 Identities=16% Similarity=-0.045 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHHCCCCC Q ss_conf 6665668999986236544 Q gi|254780546|r 127 TFDDQLETVYMRSQRDYAE 145 (423) Q Consensus 127 ~~dD~~Et~l~rl~r~sg~ 145 (423) -+|=-+|-|+--.+|.+|+ T Consensus 302 GaDLGaEKF~dIKcr~~gl 320 (555) T pfam01268 302 GADLGAEKFFDIKCRKSGL 320 (555) T ss_pred CCCCCCHHHCCEEHHCCCC T ss_conf 6665623430100220599 No 169 >PRK04165 acetyl-CoA decarbonylase/synthase complex subunit gamma; Provisional Probab=35.11 E-value=27 Score=14.06 Aligned_cols=131 Identities=18% Similarity=0.222 Sum_probs=82.8 Q ss_pred HHHCCCCCEEEEEECCCHHHHHHHHHHHHHH---HHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9852799839999669424899999999999---8648997299999966779878468999999999871898899997 Q gi|254780546|r 13 VRSLVYPAHILVAVSGGSDSMGLLIALHSVL---SDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSW 89 (423) Q Consensus 13 ~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~---~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~ 89 (423) .+.+..|..++|-||--.|.-.+---++.+. ..+-+..+.+-.+-|-+- +.+...-+..|+..++..++|...... T Consensus 85 ektF~npt~~av~v~D~m~e~ei~~r~~~i~~~~~~rvG~~l~~d~vAvr~~-s~d~~~fa~~vk~va~~~~~pliL~s~ 163 (454) T PRK04165 85 EKTFFNPTGIAVDVSDTMSEEEIDERLKEINNFQFERVGEILKLDAVALRNV-SGDPEKFAKAVKKVAETTELPLILCSF 163 (454) T ss_pred CEEEECCCCEEEEECCCCCHHHHHHHHHHHHHCEEEEECCEEEEEEEEEECC-CCCHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 1010079826999337777899999999865125677234115208999917-899899999999997216997799838 Q ss_pred ECC---------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 504---------787780378885455554320012344201234566656689999862365444 Q gi|254780546|r 90 KNS---------KPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEK 146 (423) Q Consensus 90 ~~~---------~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~ 146 (423) +.. +...-+--.|-...|..+.+.+++.++-..+.|-++++..+.. ..+...|++ T Consensus 164 dp~vl~aaL~~~~~~kPLlyaAt~~N~~~m~~LA~~~~~Pl~V~a~~L~~L~~l~--~~~~~~Gik 227 (454) T PRK04165 164 DPAVLKAALEVVADRRPLLYAATKDNWKEMAELAKEYNCPLVVSAPNLEELKSLV--KKLQAAGIK 227 (454) T ss_pred CHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHH--HHHHHCCCC T ss_conf 9999999999745379755227898899999999874984799779989999999--999976985 No 170 >TIGR01277 thiQ thiamine ABC transporter, ATP-binding protein; InterPro: IPR005968 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The model describes thiamine ABC transporter, ATP-binding protein, believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. This protein is found so far only in Proteobacteria, and is found in complete genomes only if the ThiB and ThiP subunits are also found. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate. ; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=35.03 E-value=24 Score=14.42 Aligned_cols=115 Identities=22% Similarity=0.271 Sum_probs=72.3 Q ss_pred HCCCCCEEE-EEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC-------------------------------CC Q ss_conf 527998399-9966942489999999999986489972999999667-------------------------------79 Q gi|254780546|r 15 SLVYPAHIL-VAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH-------------------------------CL 62 (423) Q Consensus 15 ~l~~~~~i~-vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH-------------------------------gl 62 (423) +++.+++|+ ++=||-.-|| ||-|.+-+.. |..+ +|.+=-+|| || T Consensus 20 ~V~~Ge~VAi~GpSGAGKST-LLnLiAGF~~--PasG-~i~~nd~~~t~~aPy~RPvSMLFQEnNLF~HLTV~~NigLGl 95 (213) T TIGR01277 20 SVEDGERVAILGPSGAGKST-LLNLIAGFLE--PASG-EIKVNDKDHTRLAPYRRPVSMLFQENNLFAHLTVRQNIGLGL 95 (213) T ss_pred CCCCCCEEEEECCCCCCHHH-HHHHHHHHCC--CCCE-EEEECCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHCCC T ss_conf 13017768887589862788-9877864047--7640-588778012268887777503432210253024888865378 Q ss_pred CCC---HHHHHHHHHHHHHHCCCCEEEEEEECC-------------------------CCCCCCHHHHHHHHHHHHHHHC Q ss_conf 878---468999999999871898899997504-------------------------7877803788854555543200 Q gi|254780546|r 63 RET---AKDEVRYVSDVCSRLRIAHSVVSWKNS-------------------------KPQTGLMAAAREARYALISEHA 114 (423) Q Consensus 63 r~~---s~~e~~~v~~~~~~lgi~~~~~~~~~~-------------------------~~~~~~~~~ar~~r~~~~~~~~ 114 (423) .|. .+.+.+.+.+++...||.-+.-|.... +|-+.+...=|..=..+....| T Consensus 96 ~PgLKLnA~q~ek~~~~A~qvGi~dyl~RLP~~LSGGQrQRVALARClvr~~PIlLLDEPFSALDp~LR~EMLaLv~~lc 175 (213) T TIGR01277 96 KPGLKLNAVQQEKVEDVARQVGIDDYLERLPEELSGGQRQRVALARCLVREKPILLLDEPFSALDPKLREEMLALVKKLC 175 (213) T ss_pred CCCCCCCHHHHHHHHHHHHHCCHHHHHHHCCHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHH T ss_conf 86410156778999999973385789872501116733789999886417887300158811226788999999999765 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 1234420123456665668 Q gi|254780546|r 115 KTINATLIMTAHTFDDQLE 133 (423) Q Consensus 115 ~~~~~~~l~~ah~~dD~~E 133 (423) .+...+.++..||..|-+. T Consensus 176 ~e~~~TllmVtH~~sda~~ 194 (213) T TIGR01277 176 DEKKLTLLMVTHSLSDAAA 194 (213) T ss_pred HCCCCEEEEEECCHHHHHH T ss_conf 1036468998458899987 No 171 >pfam01188 MR_MLE Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain. Probab=34.98 E-value=26 Score=14.16 Aligned_cols=21 Identities=29% Similarity=0.621 Sum_probs=13.2 Q ss_pred HHHHHHHHCCCCCCCCCCCCC Q ss_conf 999899818932026678642 Q gi|254780546|r 177 IRSFLLQRNISWCEDPSNTDD 197 (423) Q Consensus 177 l~~~~~~~~i~wveDpSN~d~ 197 (423) +..-+...++.|+|+|-+.+. T Consensus 64 ~~~~l~~~~~~~~EeP~~~~d 84 (98) T pfam01188 64 LARALEELGLTWIEEPLPADD 84 (98) T ss_pred HHHHHCCCCCEEEECCCCCCC T ss_conf 998732579569979899999 No 172 >pfam00926 DHBP_synthase 3,4-dihydroxy-2-butanone 4-phosphate synthase. 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin. Sometimes found as a bifunctional enzyme with pfam00925. Probab=34.94 E-value=27 Score=14.04 Aligned_cols=105 Identities=15% Similarity=0.181 Sum_probs=52.9 Q ss_pred HHHHHHCCCCEE------------EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH------------HH Q ss_conf 999987189889------------999750478778037888545555432001234420123456------------66 Q gi|254780546|r 74 SDVCSRLRIAHS------------VVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHT------------FD 129 (423) Q Consensus 74 ~~~~~~lgi~~~------------~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~------------~d 129 (423) .+.|++||+|+. .+.++.....+++.+.-|..-.+.+++... ...+..--||- .. T Consensus 57 ~~~~~~LgLp~m~~~n~~~~~t~Ftvsvd~~~~~TGISa~DRa~Tir~la~~~~-~~~df~~PGHvfpL~a~~gGvl~R~ 135 (193) T pfam00926 57 EERADRLGLPPMVDRNTDRFGTAFTVSVDAKGTTTGISAADRALTIRALADPNA-FPEDFVRPGHVFPLRAKEGGVLERR 135 (193) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCC-CCCEEECCCCCCHHHCCCCCEEECC T ss_conf 999976699565555678878875988864888789789999999999848888-8332404896563014369860469 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 5668999986236544433223576410035796871012104289999989981893202 Q gi|254780546|r 130 DQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 130 D~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wve 190 (423) .+.|- ..-|.+.+|..-. +.+...... ++. -.+++++.+|+++++++.++ T Consensus 136 GHTEa-svdL~~lAGl~P~--avi~Eil~~--dG~------~~~~~~~~~fA~~~~lp~is 185 (193) T pfam00926 136 GHTEA-AVDLARLAGLTPA--AVICEILND--DGT------MARLPDLEEFAKEHGLPLIT 185 (193) T ss_pred CHHHH-HHHHHHHCCCCCE--EEEEEEECC--CCC------CCCHHHHHHHHHHCCCCEEE T ss_conf 66899-9999998199963--999998569--988------26889999999983997998 No 173 >PRK05773 3,4-dihydroxy-2-butanone 4-phosphate synthase; Validated Probab=34.84 E-value=27 Score=14.03 Aligned_cols=20 Identities=15% Similarity=0.297 Sum_probs=17.2 Q ss_pred CCCHHHHHHHHHHHCCCCCC Q ss_conf 04289999989981893202 Q gi|254780546|r 171 RCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 171 ~~~r~~l~~~~~~~~i~wve 190 (423) ..++.++..|+++++++.++ T Consensus 189 ~m~~~~~~~fA~~~~l~~is 208 (218) T PRK05773 189 SLSKEEAKRYARELGIPLVE 208 (218) T ss_pred CCCHHHHHHHHHHCCCCEEE T ss_conf 65689999999983997998 No 174 >pfam00582 Usp Universal stress protein family. The universal stress protein UspA is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. UspA enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae UspA reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, though UspA lacks ATP-binding activity. Probab=34.55 E-value=27 Score=14.00 Aligned_cols=95 Identities=22% Similarity=0.179 Sum_probs=52.7 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH---------HHHHH--------HHHHHHHHCCC Q ss_conf 83999966942489999999999986489972999999667798784---------68999--------99999987189 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA---------KDEVR--------YVSDVCSRLRI 82 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s---------~~e~~--------~v~~~~~~lgi 82 (423) .+|+|++-|...|...+..+.++.... +.+++++||-.--...+ .++.. .....+..-+. T Consensus 3 k~Ilv~vD~s~~s~~al~~A~~~a~~~---~~~l~llhv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (140) T pfam00582 3 KRILVAVDGSEESERALEWALELAKRR---GAELILLHVIDPEPSGAASEALEEEEEEELEEEEAEAEALAEAAAEAGGV 79 (140) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHC---CCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 849999859999999999999999864---99799999962887554443330357999999999999999998635798 Q ss_pred CEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 889999750478778037888545555432001234420123456665 Q gi|254780546|r 83 AHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDD 130 (423) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD 130 (423) ...+....+ .+. ..+...+.+.+++.++.|.|... T Consensus 80 ~~~~~~~~g-~~~------------~~I~~~~~~~~~dliV~G~~~~~ 114 (140) T pfam00582 80 VVEVVVRGG-DPA------------EAILEVAEEEDADLIVMGSRGRS 114 (140) T ss_pred CEEEEEEEE-CHH------------HHHCEEECCCCCCEEEEECCCCC T ss_conf 679999960-703------------64210000567888999289998 No 175 >COG0363 NagB 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase [Carbohydrate transport and metabolism] Probab=34.45 E-value=27 Score=13.99 Aligned_cols=66 Identities=18% Similarity=0.269 Sum_probs=37.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH--HHHHH-HHHHHCCCCEE Q ss_conf 983999966942489999999999986489972999999667798784689--99999-99987189889 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE--VRYVS-DVCSRLRIAHS 85 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e--~~~v~-~~~~~lgi~~~ 85 (423) ..++.||+|||+==..+...|..... ....--+++.+.+|--.=+.++.+ ..+++ .+.....++-. T Consensus 31 ~~~~~l~LsgGsTP~~~ye~L~~~~~-~~~~w~~v~~f~~DEr~vp~~~~~Sn~~~~~~~l~~~~~~~~~ 99 (238) T COG0363 31 RGRAVLALSGGSTPLALYEALVKLPQ-GQLDWSKVTIFNLDERVVPPDDPESNYGLMRRNLFDHIDIPAE 99 (238) T ss_pred CCCEEEEECCCCCHHHHHHHHHHCCC-CCCCCHHEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCHH T ss_conf 67579997689887999999996325-6898313389916311169988413999999986462567286 No 176 >cd02651 nuc_hydro_IU_UC_XIUA nuc_hydro_IU_UC_XIUA: inosine-uridine preferring, xanthosine-inosine-uridine-adenosine-preferring and, uridine-cytidine preferring nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains proteins similar to nucleoside hydrolases which hydrolyze both pyrimidine and purine ribonucleosides: the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata, the inosine-uridine-xanthosine preferring nucleoside hydrolase RihC from Escherichia coli and the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium. This group also contains proteins similar to the pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases URH1 from Saccharomyces cerevisiae, E. coli RihA and E. coli RihB. E. coli RihA is equally efficient with uridine a Probab=34.40 E-value=28 Score=13.98 Aligned_cols=52 Identities=13% Similarity=0.201 Sum_probs=30.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC--CHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9424899999999999864899729999996677987--8468999999999871898899 Q gi|254780546|r 28 GGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE--TAKDEVRYVSDVCSRLRIAHSV 86 (423) Q Consensus 28 GG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~--~s~~e~~~v~~~~~~lgi~~~~ 86 (423) |.=|.+||++++. . +.++|.++|+-+|--+ .+..-+..+.+.|.+.+||.+. T Consensus 9 G~DDa~Al~~al~-----~--~~iel~gIt~v~GN~~~~~~~~n~~~~l~~~g~~~iPV~~ 62 (302) T cd02651 9 GHDDAVAILLALF-----H--PELDLLGITTVAGNVPLEKTTRNALKLLTLLGRTDVPVAA 62 (302) T ss_pred CHHHHHHHHHHHH-----C--CCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 6799999999977-----8--9986999997978868999999999999984899998763 No 177 >cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute. Probab=34.30 E-value=28 Score=13.97 Aligned_cols=42 Identities=12% Similarity=0.091 Sum_probs=30.3 Q ss_pred ECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 101210428999998998189320266786422101899976 Q gi|254780546|r 166 SRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRF 207 (423) Q Consensus 166 ~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~ 207 (423) ..=.-.++.+||.+-..++.-.|--+|+..+....+--+|.+ T Consensus 209 v~~v~~Vsd~ei~al~~ey~~~Y~~~~~~~~~~~~~~~l~~a 250 (484) T cd03557 209 VARVDAVSDSDVDALVDEYEALYDLAPELKDGGERRASLREA 250 (484) T ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 999985899999999999986046472224745789999999 No 178 >PRK00414 gmhA phosphoheptose isomerase; Reviewed Probab=33.80 E-value=28 Score=13.92 Aligned_cols=50 Identities=16% Similarity=0.099 Sum_probs=31.4 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHC--CCCCEEEEEEE Q ss_conf 9999999852799839999669424899999999999864--89972999999 Q gi|254780546|r 7 ESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDR--SFGKIKFSAIS 57 (423) Q Consensus 7 ~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~--~~~~~~l~a~~ 57 (423) +++....+.+..+.+|.++=-||+-+.| -|++.++..+- ..+++...+++ T Consensus 32 ~aa~~i~~~~~~g~kI~~cGNGGSaa~A-~Hfa~dl~~~~~~~r~~l~ai~Ls 83 (192) T PRK00414 32 RAAVLIADSFKAGGKVLSCGNGGSHCDA-MHFAEELTGRYRENRPGYPAIAIS 83 (192) T ss_pred HHHHHHHHHHHCCCEEEEEECCHHHHHH-HHHHHHHHCCCCCCCCCCCEEECC T ss_conf 9999999999879979999685889999-999998746314466654325327 No 179 >cd00147 cPLA2_like Cytosolic phospholipase A2, catalytic domain; hydrolyses arachidonyl phospholipids. Catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bond of phospholipids to release arachidonic acid. At the active site, cPLA2 contains a serine nucleophile through which the catalytic mechanism is initiated. The active site is partially covered by a solvent-accessible flexible lid. cPLA2 displays interfacial activation as it exists in both "closed lid" and "open lid" forms. Movement of the cPLA2 lid possibly exposes a greater hydrophobic surface and the active site. cPLA2 belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). Calcium is required for cPLA2 to bind with membranes or phospholipids. Group IV cPLA2 includes six intercellular enzymes: cPLA2alpha, cPLA2beta, cPLA2gamma, cPLA2delta, cP Probab=33.64 E-value=28 Score=13.90 Aligned_cols=25 Identities=20% Similarity=0.158 Sum_probs=15.6 Q ss_pred HHHHHHHHHC-----CCCCEEEEEECCCHH Q ss_conf 9999999852-----799839999669424 Q gi|254780546|r 7 ESVRFFVRSL-----VYPAHILVAVSGGSD 31 (423) Q Consensus 7 ~~~~~~~~~l-----~~~~~i~vAvSGG~D 31 (423) .+.++|++.- .+-.+|+||+|||.= T Consensus 24 ~aL~~~L~~~~~~~~~~~P~Iaia~SGGGy 53 (438) T cd00147 24 KALKKFLGLENDLNPDEVPVIAILGSGGGY 53 (438) T ss_pred HHHHHHHHHCCCCCCCCCCEEEEEECCCCH T ss_conf 999999734046886558758998458548 No 180 >PTZ00249 variable surface protein Vir28; Provisional Probab=33.56 E-value=25 Score=14.25 Aligned_cols=14 Identities=14% Similarity=0.102 Sum_probs=5.6 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 93202667864221 Q gi|254780546|r 186 ISWCEDPSNTDDRF 199 (423) Q Consensus 186 i~wveDpSN~d~~f 199 (423) +....|+++....+ T Consensus 147 lk~~~~~~~~C~~i 160 (516) T PTZ00249 147 LKESKDSSTLCRNI 160 (516) T ss_pred CCCCCCCCCCCCCH T ss_conf 05666677742109 No 181 >pfam10829 DUF2554 Protein of unknown function (DUF2554). This family of proteins with unknown function appears to be restricted to Enterobacteriaceae. Probab=33.55 E-value=28 Score=13.89 Aligned_cols=24 Identities=33% Similarity=0.427 Sum_probs=16.6 Q ss_pred EEEEEEECCCCCCCHHHH-HHHHHH Q ss_conf 999999667798784689-999999 Q gi|254780546|r 52 KFSAISVDHCLRETAKDE-VRYVSD 75 (423) Q Consensus 52 ~l~a~~vdHglr~~s~~e-~~~v~~ 75 (423) -+.+-++||+||.+++-+ ..-+.. T Consensus 29 ~lwvk~~dhqLrHeADSDELR~~aE 53 (76) T pfam10829 29 FVWVKNVDHQLRHEADSDELRAVAE 53 (76) T ss_pred EEEECCHHHHHHHCCCHHHHHHHHH T ss_conf 6865151288773026199999999 No 182 >PRK09762 galactosamine-6-phosphate isomerase; Provisional Probab=33.49 E-value=28 Score=13.88 Aligned_cols=74 Identities=8% Similarity=0.150 Sum_probs=46.5 Q ss_pred HHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECC--CCCCCHHH-HHHHH-HHHHHHCCCCEE Q ss_conf 9985279983999966942489999999999986489972999999667--79878468-99999-999987189889 Q gi|254780546|r 12 FVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDH--CLRETAKD-EVRYV-SDVCSRLRIAHS 85 (423) Q Consensus 12 ~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdH--glr~~s~~-e~~~v-~~~~~~lgi~~~ 85 (423) ....+..+.+..+|+|||.-=..+...|.+........--+++.+++|- |+.++... -..+. +.+...++|+-. T Consensus 20 i~~~i~~k~~~~l~latG~TP~~~Y~~L~~~~~~~~l~~~~v~~f~lDEy~glp~~~~~S~~~~l~~~l~~~v~i~~~ 97 (236) T PRK09762 20 LLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLRED 97 (236) T ss_pred HHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHCCCCHHHEEEEECEEECCCCCCCCHHHHHHHHHHHHCCCCCCHH T ss_conf 999998789929995898898999999999997179984668999252754888986042899999972454699799 No 183 >PRK12358 putative 6-phosphogluconolactonase; Provisional Probab=33.41 E-value=28 Score=13.87 Aligned_cols=68 Identities=10% Similarity=0.098 Sum_probs=42.0 Q ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 9999852799839999669424899999999999864899729999996677987846899999999987 Q gi|254780546|r 10 RFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSR 79 (423) Q Consensus 10 ~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~ 79 (423) ......+....+..+|+|||+==..+...|.+..+.. .+--+++.++.|- +-...++....+..+.++ T Consensus 18 ~~i~~~i~~~~~~~l~LatGsTP~~~y~~L~~~~~~~-~~~~~v~~f~lDE-~~~~~~~~~~~~~~~l~~ 85 (239) T PRK12358 18 HHLLGYMSKTKRVNLAITAGSTPKGMYEYLTTLVKGK-AWYDNCHYYNFDE-IPFRGKEGEGVTITNLRN 85 (239) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCC-CCCCEEEEEEEEE-ECCCCCCCHHHHHHHHHH T ss_conf 9999999748994899799879899999999996257-8842179996105-558985553569999999 No 184 >PTZ00285 glucosamine-6-phosphate isomerase; Provisional Probab=33.13 E-value=29 Score=13.84 Aligned_cols=91 Identities=10% Similarity=0.092 Sum_probs=52.5 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC--CCCCHHHH-HH-HHHHHHHHCCCCEEEE-EEEC Q ss_conf 799839999669424899999999999864899729999996677--98784689-99-9999998718988999-9750 Q gi|254780546|r 17 VYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC--LRETAKDE-VR-YVSDVCSRLRIAHSVV-SWKN 91 (423) Q Consensus 17 ~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg--lr~~s~~e-~~-~v~~~~~~lgi~~~~~-~~~~ 91 (423) .....+.||+|||.==..+...|.+........--+++.+++|-- +.++..+- .. .-+.+...++||..-+ ..+. T Consensus 30 ~~~~~~~i~LsgG~tP~~~y~~L~~~~~~~~i~~~~v~~f~~DEy~gl~~~~~~S~~~~~~~~l~~~v~ip~~~i~~~~~ 109 (253) T PTZ00285 30 TSDRPFVLGLPTGSTPLPTYQELIRAYREGRVSFSNVVTFNMDEYVGLPRDHPQSYHYFMKENFFDHVDIKEENRHILNG 109 (253) T ss_pred CCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCHHHEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHEECCCC T ss_conf 03998299969983499999999999861599857959973632227898884549999999987417997998777899 Q ss_pred CCCCCCCHHHHHHHHHHHHH Q ss_conf 47877803788854555543 Q gi|254780546|r 92 SKPQTGLMAAAREARYALIS 111 (423) Q Consensus 92 ~~~~~~~~~~ar~~r~~~~~ 111 (423) ...+.++.| .+|.... T Consensus 110 --~~~d~~~~~--~~ye~~i 125 (253) T PTZ00285 110 --TAPDLEEEC--RRYEEKI 125 (253) T ss_pred --CCCCHHHHH--HHHHHHH T ss_conf --987999999--9999999 No 185 >cd01582 Homoaconitase Homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase catalytic domain. Homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase is part of an unusual lysine biosynthesis pathway found only in filamentous fungi, in which lysine is synthesized via the alpha-aminoadipate pathway. In this pathway, homoaconitase catalyzes the conversion of cis-homoaconitic acid into homoisocitric acid. The reaction mechanism is believed to be similar to that of other aconitases. Probab=33.10 E-value=29 Score=13.84 Aligned_cols=34 Identities=9% Similarity=0.205 Sum_probs=25.8 Q ss_pred EEEEEEECCCCCCCH---HHHHHHHHHHHHHCCCCEE Q ss_conf 999999667798784---6899999999987189889 Q gi|254780546|r 52 KFSAISVDHCLRETA---KDEVRYVSDVCSRLRIAHS 85 (423) Q Consensus 52 ~l~a~~vdHglr~~s---~~e~~~v~~~~~~lgi~~~ 85 (423) +-.++++||..-..+ .+...+++++|+++||.++ T Consensus 27 ~~~~~~~DH~~p~~~~~~~~~~~~~~~~a~~~gi~~~ 63 (363) T cd01582 27 DQIVMTLDHDVQNKSEKNLKKYKNIESFAKKHGIDFY 63 (363) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEE T ss_conf 8778971889999973409999999999998299766 No 186 >TIGR00269 TIGR00269 conserved hypothetical protein TIGR00269; InterPro: IPR000541 The following uncharacterised proteins have been shown to share regions of similarities, yeast chromosome VII hypothetical protein YGL211w; Dictyostelium discoideum (Slime mold) protein veg136; and Methanococcus jannaschii hypothetical proteins MJ1157 and MJ1478.. Probab=32.82 E-value=11 Score=16.67 Aligned_cols=50 Identities=16% Similarity=0.322 Sum_probs=44.5 Q ss_pred EECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 71012104289999989981893202667864221018999764345310 Q gi|254780546|r 165 ISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDIDLH 214 (423) Q Consensus 165 i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l~~~ 214 (423) -|.||=.++.+|+.=|+..+++++--|+.--..---|.++|.+|..+.++ T Consensus 3 rIKPL~~iPE~EV~lY~~~~~l~~~~~~CPYS~~S~R~~~~dfL~~le~~ 52 (106) T TIGR00269 3 RIKPLREIPEKEVKLYALLNELKVHLDECPYSSLSVRARIRDFLKDLEEK 52 (106) T ss_pred CCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 44688656711044554430451031688889840358899999986303 No 187 >cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies. Probab=32.71 E-value=29 Score=13.80 Aligned_cols=40 Identities=10% Similarity=0.081 Sum_probs=21.5 Q ss_pred EEEEEEEECCCCCCCHH--HHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 29999996677987846--89999999998718988999975 Q gi|254780546|r 51 IKFSAISVDHCLRETAK--DEVRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 51 ~~l~a~~vdHglr~~s~--~e~~~v~~~~~~lgi~~~~~~~~ 90 (423) ..-+.+||+.|...|.. ++...+.+.|.++|+|..+..+. T Consensus 90 AdaV~~~v~~Gs~~E~~~l~~l~~v~~ea~~~G~Pll~~~yp 131 (235) T cd00958 90 ADAVGVTVYVGSEEEREMLEELARVAAEAHKYGLPLIAWMYP 131 (235) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 567999986898168999999999999999839978999741 No 188 >pfam01890 CbiG_C Cobalamin synthesis G C-terminus. Members of this family are involved in cobalamin synthesis. A member from Synechocystis sp. PCC 6803 has been designated cbiH but in fact represents a fusion between cbiH and cbiG. As other multi-functional proteins involved in cobalamin biosynthesis catalyse adjacent steps in the pathway, including CysG, CobL (CbiET), CobIJ and CobA-HemD, it is therefore possible that CbiG catalyses a reaction step adjacent to CbiH. In the anaerobic pathway such a step could be the formation of a gamma lactone, which is thought to help to mediate the anaerobic ring contraction process. Within the cobalamin synthesis pathway CbiG catalyses the both the opening of the lactone ring and the extrusion of the two-carbon fragment of cobalt-precorrin-5A from C-20 and its associated methyl group (deacylation) to give cobalt-precorrin-5B. This family is the C-terminal region, and the mid- and N-termival parts are conserved independently in other families. Probab=31.81 E-value=30 Score=13.70 Aligned_cols=67 Identities=10% Similarity=0.152 Sum_probs=47.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEE-EEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 399996694248999999999998648997299-9999667798784689999999998718988999975047 Q gi|254780546|r 21 HILVAVSGGSDSMGLLIALHSVLSDRSFGKIKF-SAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l-~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~ 93 (423) -++|++.-|.+.-.+..++.+...+.......+ ...++|-. .+..-..++|+.+|+|+.....+..+ T Consensus 3 v~GiGcrrg~~~~~i~~~i~~~l~~~~l~~~~i~~iasid~K------~~E~gl~~~a~~l~~pl~~~~~e~L~ 70 (121) T pfam01890 3 VLGIGCRRGTSAEEIEAAIEEALAEAGLSPEAVAAIATIDLK------ADEPGLLELAARLGVPLRFFSAEELA 70 (121) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHCCEEEEEECC------CCCHHHHHHHHHHCCCEEEECHHHHH T ss_conf 998363899799999999999999859997884056863214------79889999999929986997899996 No 189 >PRK08813 threonine dehydratase; Provisional Probab=31.19 E-value=31 Score=13.63 Aligned_cols=23 Identities=13% Similarity=0.276 Sum_probs=18.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHH Q ss_conf 83999966942489999999999 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSV 42 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~ 42 (423) .+=+|+.|.|--..++++.++.+ T Consensus 81 ~~GVV~aSaGNHaqavA~aA~~~ 103 (349) T PRK08813 81 ERPVICASAGNHAQGVAWSAYRL 103 (349) T ss_pred CCCEEEECCCHHHHHHHHHHHHC T ss_conf 78679988648999999999986 No 190 >PRK09197 fructose-bisphosphate aldolase; Provisional Probab=30.83 E-value=31 Score=13.59 Aligned_cols=71 Identities=13% Similarity=0.084 Sum_probs=35.4 Q ss_pred CCCCEEEEEECCC-HH------------------HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHH----H Q ss_conf 7998399996694-24------------------89999999999986489972999999667798784689999----9 Q gi|254780546|r 17 VYPAHILVAVSGG-SD------------------SMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRY----V 73 (423) Q Consensus 17 ~~~~~i~vAvSGG-~D------------------S~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~----v 73 (423) ...+.|+|.+|=| .. ..+++..++.+....+.| +++|.|||-... -+|... . T Consensus 43 e~~sPvIiq~S~g~~~y~~G~~v~~~~~~~~v~gai~~a~~v~~~A~~~~VP----VaLHLDH~~~~~-l~~i~~~i~~~ 117 (349) T PRK09197 43 KAKSPVIIQFSNGGAAFIAGKGVKDDVPGAAVLGAIAGAKHVHQVAEHYGVP----VILHTDHCAKKL-LPWIDGLLDAG 117 (349) T ss_pred HHCCCEEEEECCCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC----EEEECCCCCCCC-CHHHHHHHHHH T ss_conf 9788999991603566615755321005677664899999999999867998----899788887533-01678999998 Q ss_pred HHHHHHCCCC-EEEEEEECC Q ss_conf 9999871898-899997504 Q gi|254780546|r 74 SDVCSRLRIA-HSVVSWKNS 92 (423) Q Consensus 74 ~~~~~~lgi~-~~~~~~~~~ 92 (423) .+.+..-+-+ |.++=+|.. T Consensus 118 ~~~~~~g~~~~FtSVMiDgS 137 (349) T PRK09197 118 EKHFAAGGKPLFSSHMIDLS 137 (349) T ss_pred HHHHHHCCCCCCCCCEECCC T ss_conf 99998536667864233185 No 191 >PRK10264 hydrogenase 1 maturation protease; Provisional Probab=30.64 E-value=32 Score=13.57 Aligned_cols=155 Identities=15% Similarity=0.166 Sum_probs=84.3 Q ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999985279983999966942489999999999986489972999999667798784689999999998718988999 Q gi|254780546|r 8 SVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 8 ~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) +++...+....|+.|-+ +-||--++-|++.+.++ ..=+.+=.||.|..|+ ++. T Consensus 25 ~v~~L~~~~~~p~~V~l-iDGGT~G~~Ll~~le~~-------d~LIvvDAV~~g~~PG-------------------tv~ 77 (195) T PRK10264 25 VAERLYAHYHWPEEVEI-VDGGTQGLNLLGYVESA-------SHLLILDAIDYGLEPG-------------------TLR 77 (195) T ss_pred HHHHHHHHCCCCCCCEE-EECCCCHHHHHHHHHCC-------CEEEEEECCCCCCCCC-------------------EEE T ss_conf 99999985689988289-84453199999998429-------9799984653799997-------------------599 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEC Q ss_conf 97504787780378885455554320012344201234566656689999862365444332235764100357968710 Q gi|254780546|r 88 SWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISR 167 (423) Q Consensus 88 ~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~R 167 (423) +++.+.- .. ..+ .-..--|+-.-.|+..+--+.|.-+.-..+.|+.|......++-+ + T Consensus 78 ~~~~dev----p~---------------~~~--~~kmS~Hqvgl~evL~~a~L~G~~P~~~~LiGvqP~~le~~g~~L-t 135 (195) T PRK10264 78 TYAGERI----PA---------------YLS--AKKMSLHQNSFSEVLALADIRGHLPAHIALVGLQPAMLDDYGGSL-S 135 (195) T ss_pred EEECCCC----CH---------------HHC--CCCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCEEECCCCCC-C T ss_conf 9714304----21---------------217--577786404799999999954999874999843102330578988-9 Q ss_pred CH----HCCCHHHHHHHHHHHCCCCCC--------CCCCCCCCCCHHHHHHHHHHH Q ss_conf 12----104289999989981893202--------667864221018999764345 Q gi|254780546|r 168 PF----LRCRREDIRSFLLQRNISWCE--------DPSNTDDRFERVRVRRFVRDI 211 (423) Q Consensus 168 PL----L~~~r~~l~~~~~~~~i~wve--------DpSN~d~~f~R~rlR~~l~~l 211 (423) |- +.---+.+.+.+++.|+.-.- +++=.-.+|+..|+|+.-..+ T Consensus 136 p~V~a~ip~av~~~l~~L~~wGv~~~p~~~~~~l~~~~l~~~~ye~~r~~~~~~~~ 191 (195) T PRK10264 136 ELAREQLPAAEQAALAQLAAWGIVPQPANESRCLNYDCLSMENYEGVRLRQYRMRL 191 (195) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHEEC T ss_conf 89998669999999999998099876677444544777778766467688774010 No 192 >PRK00955 hypothetical protein; Provisional Probab=30.43 E-value=32 Score=13.54 Aligned_cols=108 Identities=19% Similarity=0.177 Sum_probs=48.0 Q ss_pred CCCEEEEEECCC-HHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHC--CCCEEEEEEECCCC Q ss_conf 998399996694-248999999999998648997299999966779878468999999999871--89889999750478 Q gi|254780546|r 18 YPAHILVAVSGG-SDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRL--RIAHSVVSWKNSKP 94 (423) Q Consensus 18 ~~~~i~vAvSGG-~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~l--gi~~~~~~~~~~~~ 94 (423) +..|+..+||+| .|||+-=+-+.. +.+.. .-+-=-=-+|.||+-+- ..++ +.+++. ++|..+ T Consensus 64 G~PrLffgVsaGn~DSMVn~YTa~k--K~R~~---DaYtPgG~~g~RPDRA~-ivYt-n~ir~~f~~vPiii-------- 128 (599) T PRK00955 64 GKPRLFFLVSAGNMDSMVNHYTAAK--KLRSK---DAYSPGGKSGLRPDRAT-IVYC-NKIKEAYPDVPIII-------- 128 (599) T ss_pred CCCCEEEEECCCCHHHHHHHCCCCC--CCCCC---CCCCCCCCCCCCCCCHH-HHHH-HHHHHHCCCCCEEE-------- T ss_conf 8985787764654878887505544--54554---58797997888988036-7999-99999779999896-------- Q ss_pred CCCCHHHHHH-HHHHHHHHHCC-----CCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 7780378885-45555432001-----234420123456665668999986236 Q gi|254780546|r 95 QTGLMAAARE-ARYALISEHAK-----TINATLIMTAHTFDDQLETVYMRSQRD 142 (423) Q Consensus 95 ~~~~~~~ar~-~r~~~~~~~~~-----~~~~~~l~~ah~~dD~~Et~l~rl~r~ 142 (423) ++||+.-|+ +-|.+..+..+ ..+++.++.|---. ++-.+.-++..| T Consensus 129 -GGIEASLRR~aHYDyWsdkvRrSIL~DskADlLvYGMgE~-~i~eiA~~L~~G 180 (599) T PRK00955 129 -GGIEASLRRFAHYDYWSDKVRRSILLDSGADLLIYGMGEK-PIVEIARRLKSG 180 (599) T ss_pred -CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHH-HHHHHHHHHHCC T ss_conf -2247877766554566654213321167888776446278-999999999779 No 193 >cd03326 MR_like_1 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 1. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=29.97 E-value=32 Score=13.49 Aligned_cols=34 Identities=15% Similarity=0.252 Sum_probs=24.0 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 9999899818932026678642210189997643 Q gi|254780546|r 176 DIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVR 209 (423) Q Consensus 176 ~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~ 209 (423) ++-.-+..+++.|+|||-+.++--.-.++|+.+. T Consensus 223 ~~~~~le~~~l~w~EeP~~~~d~~~~a~l~~~~~ 256 (385) T cd03326 223 AYAKALAPYGLRWYEEPGDPLDYALQAELADHYD 256 (385) T ss_pred HHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHCC T ss_conf 9999855329918978899558999999996389 No 194 >TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils. Probab=29.97 E-value=32 Score=13.49 Aligned_cols=81 Identities=9% Similarity=0.088 Sum_probs=41.7 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH-----------HHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 999966942489999999999986489972999999667798784-----------689999999998718988999975 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETA-----------KDEVRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s-----------~~e~~~v~~~~~~lgi~~~~~~~~ 90 (423) |.|..=.-+-++.---+..... -..|+.++++...|-|-..+. ...-..++++|+++|+.+....-. T Consensus 135 V~IPtYNEp~~vv~~Tl~aa~~--ldYP~~kl~V~vLDDG~td~~~~~~~~~~~~~~~rR~e~~~la~~lGv~YitR~~n 212 (713) T TIGR03030 135 VFIPTYNEDLEIVATTVLAAKN--MDYPADKFRVWILDDGGTDQKRNDPDPEQAEAAQRREELKEFCRKLGVNYITRPRN 212 (713) T ss_pred EEECCCCCCHHHHHHHHHHHHH--CCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEECCCC T ss_conf 8964899998999999999986--89998546699997989710023430123333441399999999809865537987 Q ss_pred CCCCCCCCHHHHHH Q ss_conf 04787780378885 Q gi|254780546|r 91 NSKPQTGLMAAARE 104 (423) Q Consensus 91 ~~~~~~~~~~~ar~ 104 (423) .-.+.+|+..+-.. T Consensus 213 ~hAKAGNLN~AL~~ 226 (713) T TIGR03030 213 VHAKAGNINNALKH 226 (713) T ss_pred CCCCCHHHHHHHHH T ss_conf 88761678999873 No 195 >PRK05634 nucleosidase; Provisional Probab=29.80 E-value=33 Score=13.47 Aligned_cols=27 Identities=7% Similarity=0.069 Sum_probs=17.4 Q ss_pred HHCCCHHHHHHHHHHHCCCC-----CCCCCCC Q ss_conf 21042899999899818932-----0266786 Q gi|254780546|r 169 FLRCRREDIRSFLLQRNISW-----CEDPSNT 195 (423) Q Consensus 169 LL~~~r~~l~~~~~~~~i~w-----veDpSN~ 195 (423) +.+.--..|-..|..+|+|| |.|--|. T Consensus 132 ~VDME~~AiA~vc~~~~vPf~~~K~ISD~Ade 163 (188) T PRK05634 132 LCDMEGAAVVGVAKHFGIPVTLLKQVSDSADE 163 (188) T ss_pred EEEEHHHHHHHHHHHCCCCEEEEEEEEECCCC T ss_conf 88405889999999829988999997017887 No 196 >pfam03205 MobB Molybdopterin guanine dinucleotide synthesis protein B. This protein contains a P-loop. Probab=29.69 E-value=33 Score=13.46 Aligned_cols=37 Identities=30% Similarity=0.321 Sum_probs=22.0 Q ss_pred EEECCCHHH--HHHHHHHHHHHHHCCCCCEEEEEEEECCCC Q ss_conf 996694248--999999999998648997299999966779 Q gi|254780546|r 24 VAVSGGSDS--MGLLIALHSVLSDRSFGKIKFSAISVDHCL 62 (423) Q Consensus 24 vAvSGG~DS--~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl 62 (423) |.+-|.+|| +.|+..+..++.+.+.. -+..+|.|||- T Consensus 3 v~i~G~~~sGKttl~~~L~~~~~~~g~~--~~~~~~~d~gq 41 (122) T pfam03205 3 VLVVGPKDSGKTTLIRKLLNYLKRRGYR--VAVVKHLDHGQ 41 (122) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCC--EEEEEECCCCC T ss_conf 9999489998999999999999987994--48999899998 No 197 >PRK07008 acyl-CoA synthetase; Validated Probab=29.44 E-value=33 Score=13.43 Aligned_cols=47 Identities=4% Similarity=-0.057 Sum_probs=31.3 Q ss_pred HHHHHH--HCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC Q ss_conf 999998--52799839999669424899999999999864899729999996677987 Q gi|254780546|r 9 VRFFVR--SLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE 64 (423) Q Consensus 9 ~~~~~~--~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~ 64 (423) +...|. .+.++++|+|-..=+.+..++.+.+... ..+++.+|..+.+ T Consensus 52 lA~~L~~~Gv~~GdrVai~~~n~~e~~~~~~a~~~~---------Gav~vpl~~~~~~ 100 (539) T PRK07008 52 LAQALAALGVEPGDRVGTLAWNGYRHLEAYYGVSGS---------GAVCHTINPRLFP 100 (539) T ss_pred HHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHH---------CEEEEEECCCCCH T ss_conf 999999769688699999878879999999999984---------9699844899999 No 198 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=29.37 E-value=12 Score=16.56 Aligned_cols=127 Identities=15% Similarity=0.089 Sum_probs=71.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 99966779878468999999999871898899997504787780378885455554320012344201234566656689 Q gi|254780546|r 55 AISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLET 134 (423) Q Consensus 55 a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et 134 (423) .+.|-.-|.- +.-|....-++|+.-|.+- |-|++ |..||.+...+.-...-...+.+.-.|-=.|+ T Consensus 87 l~~V~PRlgt-~spWSska~~i~~~cGL~~-v~RiE------------Rg~ry~l~~~~~l~~~~~~~~aa~LHDRMTe~ 152 (1401) T TIGR01735 87 LLLVGPRLGT-ISPWSSKATSIARNCGLAK-VDRIE------------RGRRYYLEGKEPLSEEQIAQLAALLHDRMTES 152 (1401) T ss_pred EEEECCCCCC-CCCCHHHHHHHHHHCCCCC-CCEEE------------CCEEEEEECCCCCCHHHHHHHHHHHCCCHHHH T ss_conf 7785277876-4851656899997268885-22665------------11578863057857889999998617812354 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 999862365444332235764100357968710121042899999899818932026678642210189997643 Q gi|254780546|r 135 VYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVR 209 (423) Q Consensus 135 ~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~ 209 (423) ++-......-.. .++ ...... . =++|+--|+.|++.-.+.||..-+| |..|-+-+-|+.|. T Consensus 153 vl~~~~~~~~lf-----~~~-Ep~~l~--~--v~vl~~GR~ALE~AN~elGLAL~~d----e~DYl~~~fr~~L~ 213 (1401) T TIGR01735 153 VLPHEEEAFELF-----SVP-EPLKLT--T--VDVLGGGRKALEKANQELGLALDED----EIDYLTKRFRQELK 213 (1401) T ss_pred HCCCHHHHHCCC-----CCC-CCCCCC--C--CCHHHHHHHHHHHHHHHHCCCCCHH----HHHHHHHHHHHHHC T ss_conf 346434542011-----325-665775--0--0112444899999989853045756----88999999999837 No 199 >cd03324 rTSbeta_L-fuconate_dehydratase Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA. rTS beta expression is associated with the production of small molecules that appear to mediate the down-regulation of thymidylate synthase protein by a novel intercellular signaling mechanism. A member of this family, from Xanthomonas, has been characterized to be a L-fuconate dehydratase. rTS beta belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=29.28 E-value=33 Score=13.41 Aligned_cols=33 Identities=21% Similarity=0.409 Sum_probs=21.9 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 999989981893202667864221018999764 Q gi|254780546|r 176 DIRSFLLQRNISWCEDPSNTDDRFERVRVRRFV 208 (423) Q Consensus 176 ~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l 208 (423) ++-.-+..+++.|+|||.+.|.-..-.++|+.+ T Consensus 258 ~~~~~le~~~~~w~EEP~~p~d~~~~~~l~~~~ 290 (415) T cd03324 258 EWVKQLAEFKPWWIEEPTSPDDILGHAAIRKAL 290 (415) T ss_pred HHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHH T ss_conf 999875224977898899955778999999873 No 200 >cd03315 MLE_like Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and residues that can function as general acid/base catalysts, a Lys-X-Lys motif and another conserved lysine. Despite these conserved residues, the members of the MLE subgroup, like muconate lactonizing enzyme, o-succinylbenzoate synthase (OSBS) and N-acylamino acid racemase (NAAAR), catalyze different reactions. Probab=28.44 E-value=34 Score=13.31 Aligned_cols=32 Identities=22% Similarity=0.233 Sum_probs=20.4 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 99989981893202667864221018999764 Q gi|254780546|r 177 IRSFLLQRNISWCEDPSNTDDRFERVRVRRFV 208 (423) Q Consensus 177 l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l 208 (423) +-..+.++++.|+|||-+.++--.=.++|+.. T Consensus 148 ~~~~l~~~~i~w~EeP~~~~d~~~~~~L~~~~ 179 (265) T cd03315 148 ALRALEDLGLDYVEQPLPADDLEGRAALARAT 179 (265) T ss_pred HHHHHCCCCCEEEECCCCCCCHHHHHHHHHCC T ss_conf 99864005854884789989999999998519 No 201 >TIGR02793 nikR nickel-responsive transcriptional regulator NikR; InterPro: IPR014160 The members of this entry from Escherichia coli, Pseudomonas putida, and Brucella melitensis, are found associated with a nickel ABC transporter operon that acts to import nickel for use as a cofactor in urease or hydrogenase. These proteins, with characterised nickel-binding and DNA-binding domains, act as nickel-responsive transcriptional regulators. In the larger family of full-length homologues, most others both lack proximity to the nickel ABC transporter operon and form a separate clade. Several of the homologues not in this entry have been shown to bind nickel, copper, or both, and to regulate genes in response to nickel.. Probab=28.42 E-value=30 Score=13.75 Aligned_cols=21 Identities=0% Similarity=-0.146 Sum_probs=16.2 Q ss_pred HHHCCCCCCCHHHHHHHHHHH Q ss_conf 983589999889999999998 Q gi|254780546|r 274 ICGGQISLPGYRSMERVMLFL 294 (423) Q Consensus 274 ~~~g~~~~p~~~~l~~ll~~l 294 (423) .+.-+.|.=|.+.+.+|++.- T Consensus 18 ~~~~~GYqnRSEAiRDLlR~G 38 (130) T TIGR02793 18 LIATRGYQNRSEAIRDLLRAG 38 (130) T ss_pred HHHHCCCCCHHHHHHHHHHHH T ss_conf 987308898017999999988 No 202 >pfam07942 N2227 N2227-like protein. This family features sequences that are similar to a region of hypothetical yeast gene product N2227. This is thought to be expressed during meiosis and may be involved in the defence response to stressful conditions. Probab=28.05 E-value=35 Score=13.27 Aligned_cols=74 Identities=15% Similarity=0.170 Sum_probs=50.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCH-----HCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 420123456665668999986236544433223576410035796871012-----104289999989981893202667 Q gi|254780546|r 119 ATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPF-----LRCRREDIRSFLLQRNISWCEDPS 193 (423) Q Consensus 119 ~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPL-----L~~~r~~l~~~~~~~~i~wveDpS 193 (423) |-.|=||++.-|..||+.--+.-|.-+--.| |.. +-..|+ +.++-+||.+..+..|...+...+ T Consensus 171 ~FFIDTA~Ni~~Yi~tI~~lLkpgG~WIN~G-----PLl------yh~~~~~~~~siELs~eEi~~l~~~~GF~~~~~~~ 239 (268) T pfam07942 171 CFFIDTAHNVLEYIDTIEKILKPGGHWINLG-----PLL------YHFEPLPDEMSIELSLEDIKRLATKRGFKDEKEET 239 (268) T ss_pred EEEEECHHHHHHHHHHHHHHHCCCCEEEECC-----CCC------CCCCCCCCCCCEECCHHHHHHHHHHCCCEEEEEEE T ss_conf 9876646889999999999836698899646-----701------24577788760006899999999856977998883 Q ss_pred CCCCCCCHHH Q ss_conf 8642210189 Q gi|254780546|r 194 NTDDRFERVR 203 (423) Q Consensus 194 N~d~~f~R~r 203 (423) ..+..|..|. T Consensus 240 ~i~~~Y~~d~ 249 (268) T pfam07942 240 GILNGYTTNY 249 (268) T ss_pred ECCCCCCCCH T ss_conf 0246775798 No 203 >PRK08385 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=27.81 E-value=35 Score=13.24 Aligned_cols=21 Identities=14% Similarity=0.128 Sum_probs=11.7 Q ss_pred HHHHHHHHHHCCCCEEEEEEE Q ss_conf 999999998718988999975 Q gi|254780546|r 70 VRYVSDVCSRLRIAHSVVSWK 90 (423) Q Consensus 70 ~~~v~~~~~~lgi~~~~~~~~ 90 (423) ...+..+++.+|+......-+ T Consensus 47 ~~~a~~i~~~~~~~~~~~~~d 67 (279) T PRK08385 47 VEEARALFEHFGVAVEVKKRD 67 (279) T ss_pred HHHHHHHHHHCCCEEEEECCC T ss_conf 999999999749889999288 No 204 >COG1440 CelA Phosphotransferase system cellobiose-specific component IIB [Carbohydrate transport and metabolism] Probab=27.40 E-value=36 Score=13.19 Aligned_cols=69 Identities=13% Similarity=0.211 Sum_probs=47.2 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC-----------CCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 839999669424899999999999864899729999996677-----------987846899999999987189889999 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC-----------LRETAKDEVRYVSDVCSRLRIAHSVVS 88 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg-----------lr~~s~~e~~~v~~~~~~lgi~~~~~~ 88 (423) -+|++.||+|.-|+-|..-.......++ ....+.|+..+-. |-|+-..-...+++.+...|||..++. T Consensus 2 k~IlLvC~aGmSTSlLV~Km~~aA~~kg-~~~~I~A~s~~e~~~~~~~~DvvLlGPQv~y~~~~~~~~~~~~giPV~vI~ 80 (102) T COG1440 2 KKILLVCAAGMSTSLLVTKMKKAAESKG-KDVTIEAYSETELSEYIDNADVVLLGPQVRYMLKQLKEAAEEKGIPVEVID 80 (102) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHCC-CCEEEEEECHHHHHHHHHCCCEEEECHHHHHHHHHHHHHHCCCCCCEEEEC T ss_conf 4499984588748899999999999579-965999810657887663089899872899889999998431599767857 Q ss_pred E Q ss_conf 7 Q gi|254780546|r 89 W 89 (423) Q Consensus 89 ~ 89 (423) - T Consensus 81 ~ 81 (102) T COG1440 81 M 81 (102) T ss_pred H T ss_conf 8 No 205 >COG0108 RibB 3,4-dihydroxy-2-butanone 4-phosphate synthase [Coenzyme metabolism] Probab=27.31 E-value=36 Score=13.18 Aligned_cols=105 Identities=15% Similarity=0.167 Sum_probs=52.9 Q ss_pred HHHHHHCCCCEEEEE------------EECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH------------HH Q ss_conf 999987189889999------------750478778037888545555432001234420123456------------66 Q gi|254780546|r 74 SDVCSRLRIAHSVVS------------WKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHT------------FD 129 (423) Q Consensus 74 ~~~~~~lgi~~~~~~------------~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~------------~d 129 (423) .+.|++||+|+.+-. ++..+.++++.+.-|..-.+.+.+ ......+..--||- .. T Consensus 62 ~e~~~~L~Lp~Mv~~n~~~~~taFtVsVd~~~t~TGISa~DRa~TIr~l~~-~~~~~~df~~PGHVfpL~A~~ggVl~R~ 140 (203) T COG0108 62 EERAKRLGLPPMVDNNTDAHGTAFTVSVDARETTTGISAADRALTIRALAD-PGAKPSDFRRPGHVFPLRAKDGGVLERR 140 (203) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHC-CCCCHHHCCCCCCEEEEEECCCCEECCC T ss_conf 999985799500246888888764999804237688588899999999865-7888888489997003650668763148 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 5668999986236544433223576410035796871012104289999989981893202 Q gi|254780546|r 130 DQLETVYMRSQRDYAEKGMGLSGMCDTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCE 190 (423) Q Consensus 130 D~~Et~l~rl~r~sg~~g~~l~~~~~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wve 190 (423) -|.| -..-|.+..|...-+ .+.+.... ++. -.++.++..|++++|+++++ T Consensus 141 GHTE-asVdLarlAGl~Pa~--VicEi~~~--dG~------mar~~~~~~fa~~h~l~~it 190 (203) T COG0108 141 GHTE-AAVDLARLAGLKPAG--VICEIMND--DGT------MARLPELEEFAKEHGLPVIT 190 (203) T ss_pred CHHH-HHHHHHHHCCCCCCE--EEEEEECC--CCC------CCCHHHHHHHHHHCCCCEEE T ss_conf 8079-999999983999737--99998679--866------15738999999983996888 No 206 >PRK13520 L-tyrosine decarboxylase; Provisional Probab=27.25 E-value=36 Score=13.17 Aligned_cols=72 Identities=14% Similarity=0.086 Sum_probs=42.8 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHH-HHHHHHHHCCCCEE Q ss_conf 99999998527998399996694248999999999998648997299999966779878468999-99999987189889 Q gi|254780546|r 7 ESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVR-YVSDVCSRLRIAHS 85 (423) Q Consensus 7 ~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~-~v~~~~~~lgi~~~ 85 (423) +.|+-+.+.+..|+..++-.|||..|.-+...+.+ ......+.++++- ++ ++ -+.+.|.=+|++.. T Consensus 67 ~vv~~~a~l~g~p~~~G~~tsGGTes~l~Al~aaR--~~~~~~~p~iv~~--------~~---aH~s~~Kaa~~~gi~~~ 133 (375) T PRK13520 67 EVVRMLGELLHLPDAYGYITSGGTEANIQAVRAAR--NLADAEKPNIVVP--------ES---AHFSFDKAADMLGLELR 133 (375) T ss_pred HHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHH--HHCCCCCCEEEEE--------CC---CHHHHHHHHHHCCCEEE T ss_conf 99999999809998808995618999999999997--6268999879996--------73---16999999997495569 Q ss_pred EEEEEC Q ss_conf 999750 Q gi|254780546|r 86 VVSWKN 91 (423) Q Consensus 86 ~~~~~~ 91 (423) .+..+. T Consensus 134 ~vp~d~ 139 (375) T PRK13520 134 RAPLDE 139 (375) T ss_pred EEEECC T ss_conf 940388 No 207 >pfam01116 F_bP_aldolase Fructose-bisphosphate aldolase class-II. Probab=27.12 E-value=36 Score=13.15 Aligned_cols=98 Identities=13% Similarity=0.094 Sum_probs=47.2 Q ss_pred HHHHHHHHHC-CCCCEEEEEECCCHH----HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 9999999852-799839999669424----89999999999986489972999999667798784689999999998718 Q gi|254780546|r 7 ESVRFFVRSL-VYPAHILVAVSGGSD----SMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLR 81 (423) Q Consensus 7 ~~~~~~~~~l-~~~~~i~vAvSGG~D----S~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lg 81 (423) +.++..+..- ...+.|++.+|-|.= --.+..++..+......| +++|.|||...+ .-..|-+.| T Consensus 27 e~~~avi~AAee~~sPvIlq~s~~~~~~~g~~~~~~~~~~~a~~~~VP----V~lHLDH~~~~e-------~~~~ai~~G 95 (283) T pfam01116 27 ETINAVLEAAEEANSPVIIQVSPGAAKYAGAEALAAMVRAAAEKYSVP----VALHLDHGASFE-------GILEAIEAG 95 (283) T ss_pred HHHHHHHHHHHHHCCCEEEECCCCHHHHCCHHHHHHHHHHHHHHCCCC----EEEECCCCCCHH-------HHHHHHHHC T ss_conf 999999999999699999996844675589999999999999977998----999667779999-------999999819 Q ss_pred CCEEEEEEECC--CCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 98899997504--7877803788854555543200123 Q gi|254780546|r 82 IAHSVVSWKNS--KPQTGLMAAAREARYALISEHAKTI 117 (423) Q Consensus 82 i~~~~~~~~~~--~~~~~~~~~ar~~r~~~~~~~~~~~ 117 (423) +. .+=+|.. .-..|+....+...|..-....-+. T Consensus 96 ft--SVM~DgS~l~~eeNi~~T~~vv~~Ah~~gv~VEa 131 (283) T pfam01116 96 FS--SVMIDGSHLPFEENIAITKEVVEYAHARGVSVEA 131 (283) T ss_pred CC--EEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 98--6986389799999999999999999873984899 No 208 >PRK12678 transcription termination factor Rho; Provisional Probab=27.09 E-value=24 Score=14.43 Aligned_cols=47 Identities=15% Similarity=0.085 Sum_probs=25.1 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 9999669424899999999999864899729999996677987846899999999987189 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI 82 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi 82 (423) ..-+-+||..+|.|.-| +++++..+..+. -+||+ .+.+..+++.-+- T Consensus 15 ~~~~~~~~L~~mkLpEL-q~lA~~lGIkG~--------s~lRK-----~dLI~aI~e~q~~ 61 (667) T PRK12678 15 AARARSGGLAGMVLPEL-RALAKQLGIKGT--------SGMRK-----GELIAAIKEARGA 61 (667) T ss_pred CCCCCCCCHHHHHHHHH-HHHHHHCCCCCC--------CCCCH-----HHHHHHHHHHHCC T ss_conf 46666787143226999-999998599764--------55788-----9999999997457 No 209 >cd06381 PBP1_iGluR_delta_like N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of an orphan family of delta receptors, GluRdelta1 and GluRdelta2. This CD represents the N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of an orphan family of delta receptors, GluRdelta1 and GluRdelta2. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. Although the delta receptors are a member of the ionotropic glutamate receptor family, they cannot be activated by AMPA, kainate, NMDA, glutamate, or any other ligands. Phylogenetic analysis shows that both GluRdelta1 and GluRalpha2 are more homologous to non-NMDA receptors. G Probab=26.95 E-value=36 Score=13.13 Aligned_cols=72 Identities=11% Similarity=0.133 Sum_probs=47.0 Q ss_pred HHHHHHHHHCCCCC------EEEEE---ECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 99999998527998------39999---6694248999999999998648997299999966779878468999999999 Q gi|254780546|r 7 ESVRFFVRSLVYPA------HILVA---VSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVC 77 (423) Q Consensus 7 ~~~~~~~~~l~~~~------~i~vA---vSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~ 77 (423) .+.+..+..++... ++..- +.- .||-.+-..++++..+ .|.|+- -|-+..-+..++.+| T Consensus 15 ~aF~~Av~~~N~~~~~l~~~~l~~~i~~v~~-~dsF~~~~~~C~l~~~------GV~Aif-----Gp~~~~s~~~vqSic 82 (363) T cd06381 15 EVFAVAVIDLNINEQILQTEKITLSISFIDL-NNHFDAVQEACDLMNQ------GILALV-----TSTGCASAIALQSLT 82 (363) T ss_pred HHHHHHHHHHCCCHHHCCCCCCEEEEEECCC-CCCHHHHHHHHHHHHC------CCEEEE-----CCCCHHHHHHHHHHH T ss_conf 9999999985458022356752468887589-9978999999998744------816998-----788666899999875 Q ss_pred HHCCCCEEEEEEE Q ss_conf 8718988999975 Q gi|254780546|r 78 SRLRIAHSVVSWK 90 (423) Q Consensus 78 ~~lgi~~~~~~~~ 90 (423) ..+.|||...+++ T Consensus 83 ~al~iPhi~~~~~ 95 (363) T cd06381 83 DAMHIPHLFIQRG 95 (363) T ss_pred CCCCCCCEEEECC T ss_conf 1578872786326 No 210 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=26.57 E-value=37 Score=13.09 Aligned_cols=24 Identities=21% Similarity=0.141 Sum_probs=12.9 Q ss_pred CCCCCE-EEEEECCCHHHHHHHHHHH Q ss_conf 279983-9999669424899999999 Q gi|254780546|r 16 LVYPAH-ILVAVSGGSDSMGLLIALH 40 (423) Q Consensus 16 l~~~~~-i~vAvSGG~DS~aLl~ll~ 40 (423) +.+++- .+++-||-.-|+. +.++. T Consensus 21 i~~Ge~~~ilGpSGsGKSTL-l~li~ 45 (211) T cd03298 21 FAQGEITAIVGPSGSGKSTL-LNLIA 45 (211) T ss_pred ECCCCEEEEECCCCCHHHHH-HHHHH T ss_conf 88998999999999559999-99997 No 211 >KOG1907 consensus Probab=26.53 E-value=23 Score=14.54 Aligned_cols=17 Identities=29% Similarity=0.694 Sum_probs=14.3 Q ss_pred EEEEEECCCHHHHHHHH Q ss_conf 39999669424899999 Q gi|254780546|r 21 HILVAVSGGSDSMGLLI 37 (423) Q Consensus 21 ~i~vAvSGG~DS~aLl~ 37 (423) .+.||.-||+|||.|.. T Consensus 784 eLgiAIdgGKDSlSMa~ 800 (1320) T KOG1907 784 ELGIAIDGGKDSLSMAM 800 (1320) T ss_pred HHCEEECCCCCCHHHHE T ss_conf 81531037852011000 No 212 >COG3111 Periplasmic protein with OB-fold [Function unknown] Probab=26.37 E-value=37 Score=13.06 Aligned_cols=61 Identities=15% Similarity=0.210 Sum_probs=36.7 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCCEEECCCEEEEEECCEEEEEECCCCCCCCCCCCCCCEEECCC Q ss_conf 58999988999999999866998456107579998899899984534377552368982320784 Q gi|254780546|r 277 GQISLPGYRSMERVMLFLKSGKRGCVSIGRVVIDRRANFLWITRAVRNLPTLILYPEETTVWDGR 341 (423) Q Consensus 277 g~~~~p~~~~l~~ll~~l~~~~~~~~tl~g~~i~~~~~~l~i~RE~~~~~~~~~~~~~~~~wDgR 341 (423) |.+..|. .....+.+.........+++-|-++++-++..|++|....-....+. .-+|+|+ T Consensus 36 gGf~Gp~-~~~~TV~~Ak~~~Dda~V~l~GnIv~qi~~D~y~FrD~sGeI~VeId---d~~w~g~ 96 (128) T COG3111 36 GGFQGPN-AKVTTVDQAKTLHDDAWVSLEGNIVRQIGDDRYVFRDASGEINVDID---DKVWNGQ 96 (128) T ss_pred CCCCCCC-CCEEEHHHHHCCCCCCEEEEEEEEEEEECCCEEEEECCCCCEEEEEC---CCCCCCC T ss_conf 7621788-54268987533025984999756777607836999758961999816---0113897 No 213 >pfam01156 IU_nuc_hydro Inosine-uridine preferring nucleoside hydrolase. Probab=26.10 E-value=38 Score=13.03 Aligned_cols=54 Identities=17% Similarity=0.221 Sum_probs=33.9 Q ss_pred EEEEEECCCH-HHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 3999966942-4899999999999864899729999996677987846899999999987189 Q gi|254780546|r 21 HILVAVSGGS-DSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRI 82 (423) Q Consensus 21 ~i~vAvSGG~-DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi 82 (423) ||++=.=+|. |.+||+.++. . +.++|.++++-+|-- ...+-+..+..+++.+|- T Consensus 2 kvIiDtD~G~DDa~Al~~al~-----~--~~~~l~git~~~Gn~-~~~~~~~n~~~~l~~~g~ 56 (303) T pfam01156 2 KVILDTDPGIDDALALLLALA-----S--PELDLLGITTVAGNV-PLEKTTRNALRLLELLGR 56 (303) T ss_pred CEEEECCCCHHHHHHHHHHHH-----C--CCCEEEEEEEECCCC-CHHHHHHHHHHHHHHHCC T ss_conf 499989997489999999966-----9--998899999925986-899999999999998389 No 214 >cd00477 FTHFS Formyltetrahydrofolate synthetase (FTHFS) catalyzes the ATP-dependent activation of formate ion via its addition to the N10 position of tetrahydrofolate. FTHFS is a highly expressed key enzyme in both the Wood-Ljungdahl pathway of autotrophic CO2 fixation (acetogenesis) and the glycine synthase/reductase pathways of purinolysis. The key physiological role of this enzyme in acetogens is to catalyze the formylation of tetrahydrofolate, an initial step in the reduction of carbon dioxide and other one-carbon precursors to acetate. In purinolytic organisms, the enzymatic reaction is reversed, liberating formate from 10-formyltetrahydrofolate with concurrent production of ATP. Probab=26.05 E-value=38 Score=13.02 Aligned_cols=21 Identities=14% Similarity=-0.073 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCC Q ss_conf 456665668999986236544 Q gi|254780546|r 125 AHTFDDQLETVYMRSQRDYAE 145 (423) Q Consensus 125 ah~~dD~~Et~l~rl~r~sg~ 145 (423) |.-+|=-+|-|+--.+|.+|. T Consensus 284 GFGaDLGaEKF~dIkcr~~gl 304 (524) T cd00477 284 GFGADLGAEKFFNIKCRYSGL 304 (524) T ss_pred CCCCCCCCHHHHCCCCCCCCC T ss_conf 355465605541130014799 No 215 >cd06368 PBP1_iGluR_non_NMDA_like N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the non-NMDA (N-methyl-d-asparate) subtypes of ionotropic glutamate receptors. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the non-NMDA (N-methyl-d-asparate) subtypes of ionotropic glutamate receptors. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. Glutamate mediates the majority of excitatory synaptic transmission in the central nervous system via two broad classes of ionotropic receptors, characterized by their response to glutamate agonists: N-methyl-d -aspartate (NMDA) and non-NMDA receptors. NMDA receptors Probab=25.94 E-value=38 Score=13.01 Aligned_cols=75 Identities=21% Similarity=0.337 Sum_probs=48.4 Q ss_pred HHHHHHHHHCCCC------CEEEEEE--CCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 9999999852799------8399996--6942489999999999986489972999999667798784689999999998 Q gi|254780546|r 7 ESVRFFVRSLVYP------AHILVAV--SGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCS 78 (423) Q Consensus 7 ~~~~~~~~~l~~~------~~i~vAv--SGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~ 78 (423) .+.+..+..++.. .++..-+ ---.||-.+...++++..+ .+.|+. -|.+...+..++.+|. T Consensus 15 ~Af~~Av~~vN~~~~~l~~~~L~~~~~~~~~~dsf~~~~~~C~ll~~------gV~AI~-----Gp~~~~~~~~v~si~~ 83 (324) T cd06368 15 LAFRFAIDRINTNEEILAKFTLVPDIDELNTNDSFELTNKACDLLSQ------GVAAIF-----GPSSSSSANTVQSICD 83 (324) T ss_pred HHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHC------CCEEEE-----CCCCHHHHHHHHHHHH T ss_conf 99999999985687757881899999953368768999999998754------958999-----8897568999999863 Q ss_pred HCCCCEEEEEEECC Q ss_conf 71898899997504 Q gi|254780546|r 79 RLRIAHSVVSWKNS 92 (423) Q Consensus 79 ~lgi~~~~~~~~~~ 92 (423) .+.|||...+|+.. T Consensus 84 ~~~vP~i~~~~~~~ 97 (324) T cd06368 84 ALEIPHITTSWSPN 97 (324) T ss_pred HCCCCEEEECCCCC T ss_conf 32685588227888 No 216 >COG0157 NadC Nicotinate-nucleotide pyrophosphorylase [Coenzyme metabolism] Probab=25.67 E-value=38 Score=12.98 Aligned_cols=12 Identities=17% Similarity=0.196 Sum_probs=6.1 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999998718 Q gi|254780546|r 70 VRYVSDVCSRLR 81 (423) Q Consensus 70 ~~~v~~~~~~lg 81 (423) ...+....+.+| T Consensus 50 ~~~a~~~f~~l~ 61 (280) T COG0157 50 LDVAEEVFELLG 61 (280) T ss_pred HHHHHHHHHHHC T ss_conf 799999999809 No 217 >cd00453 FTBP_aldolase_II Fructose/tagarose-bisphosphate aldolase class II. This family includes fructose-1,6-bisphosphate (FBP) and tagarose 1,6-bisphosphate (TBP) aldolases. FBP-aldolase is homodimeric and used in gluconeogenesis and glycolysis; the enzyme controls the condensation of dihydroxyacetone phosphate with glyceraldehyde-3-phosphate to yield fructose-1,6-bisphosphate. TBP-aldolase is tetrameric and produces tagarose-1,6-bisphosphate. There is an absolute requirement for a divalent metal ion, usually zinc, and in addition the enzymes are activated by monovalent cations such as Na+. Although structurally similar, the class I aldolases use a different mechanism and are believed to have an independent evolutionary origin. Probab=25.35 E-value=39 Score=12.94 Aligned_cols=138 Identities=11% Similarity=0.076 Sum_probs=61.8 Q ss_pred HHHHHHHHHC-CCCCEEEEEECCC-H-------------------HHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCC Q ss_conf 9999999852-7998399996694-2-------------------48999999999998648997299999966779878 Q gi|254780546|r 7 ESVRFFVRSL-VYPAHILVAVSGG-S-------------------DSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRET 65 (423) Q Consensus 7 ~~~~~~~~~l-~~~~~i~vAvSGG-~-------------------DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~ 65 (423) |.+...+..- ...+.|+|.+|-| . ....+..+++.+......| +++|.|||-.. T Consensus 24 E~~~AvleAAee~~SPVIlq~s~g~~~~~~g~~~~~~~~~~~~~~g~~~~~~~v~~~A~~~~VP----VaLHlDH~~~~- 98 (340) T cd00453 24 DSINAVLETAAKVKAPVIVQFSNGGASFIAGKGVKSDVPQGAAILGAISGAHHVHQMAEHYGVP----VILHTDHCAKK- 98 (340) T ss_pred HHHHHHHHHHHHHCCCEEEECCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----EEEECCCCCCC- T ss_conf 9999999999996999899887668988545003321244555542899999999999866998----89976778875- Q ss_pred HHHHHHHH----HHHHHHCCCC-EEEEEEECCCC--CCCCHHHHHHHHHHHHHHHCCCCCCCCHH---HH---------- Q ss_conf 46899999----9999871898-89999750478--77803788854555543200123442012---34---------- Q gi|254780546|r 66 AKDEVRYV----SDVCSRLRIA-HSVVSWKNSKP--QTGLMAAAREARYALISEHAKTINATLIM---TA---------- 125 (423) Q Consensus 66 s~~e~~~v----~~~~~~lgi~-~~~~~~~~~~~--~~~~~~~ar~~r~~~~~~~~~~~~~~~l~---~a---------- 125 (423) +-+|...+ ++.|...+-+ |.++=+|.... ..|++...+...+..-....-+.....+- .+ T Consensus 99 ~l~~i~~~i~aG~~~~~~~~~~~FsSvMiDgS~~p~eENi~~Tkevve~Ah~~gvsVEaELG~iGG~Edgv~~~~~~~~~ 178 (340) T cd00453 99 LLPWIDGLLDAGEKHFAATGKPLFSSHMIDLSEESLQENIEICSKYLERMSKIGMTLEIELGCTGGEEDGVDNSHMDASA 178 (340) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 55889999997188886447889845875389899999999999999999984992899621025766676665554334 Q ss_pred -HHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf -566656689999862365444332 Q gi|254780546|r 126 -HTFDDQLETVYMRSQRDYAEKGMG 149 (423) Q Consensus 126 -h~~dD~~Et~l~rl~r~sg~~g~~ 149 (423) -+.-|++..|.-++-.-++++.+. T Consensus 179 ~yTdPeea~~fv~~tG~~~~vD~LA 203 (340) T cd00453 179 LYTQPEDVDYAYTELSKISPRFTIA 203 (340) T ss_pred CCCCHHHHHHHHHHHCCCCCCCEEE T ss_conf 5589899999999866888861577 No 218 >PRK07313 phosphopantothenoylcysteine decarboxylase; Validated Probab=25.31 E-value=39 Score=12.93 Aligned_cols=35 Identities=20% Similarity=0.135 Sum_probs=25.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEE Q ss_conf 983999966942489999999999986489972999999 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAIS 57 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~ 57 (423) ..||++++|||.-..=...++..+.+. +.+|.++- T Consensus 2 ~K~IllgvtGsIAayK~~~lir~L~k~----g~~V~vi~ 36 (180) T PRK07313 2 TKKILLAVSGSIAAYKAADLTSQLTKI----GYDVTVIM 36 (180) T ss_pred CCEEEEEEECHHHHHHHHHHHHHHHHC----CCEEEEEE T ss_conf 976999995499999999999999988----99599997 No 219 >pfam02441 Flavoprotein Flavoprotein. This family contains diverse flavoprotein enzymes. This family includes epidermin biosynthesis protein, EpiD, which has been shown to be a flavoprotein that binds FMN. This enzyme catalyses the removal of two reducing equivalents from the cysteine residue of the C-terminal meso-lanthionine of epidermin to form a --C==C-- double bond. This family also includes the B chain of dipicolinate synthase a small polar molecule that accumulates to high concentrations in bacterial endospores, and is thought to play a role in spore heat resistance, or the maintenance of heat resistance. dipicolinate synthase catalyses the formation of dipicolinic acid from dihydroxydipicolinic acid. This family also includes phenylacrylic acid decarboxylase (EC:4.1.1.-). Probab=24.88 E-value=39 Score=12.88 Aligned_cols=33 Identities=21% Similarity=0.333 Sum_probs=23.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 8399996694248999999999998648997299999 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAI 56 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~ 56 (423) .||+||+||++...-...++..+.+ .+.++.++ T Consensus 1 kri~l~itGs~~a~~~~~ll~~L~~----~~~~v~vv 33 (118) T pfam02441 1 KKILLGITGSSAAIKALRLLRELKK----EGAEVRVI 33 (118) T ss_pred CEEEEEEECHHHHHHHHHHHHHHHH----CCCEEEEE T ss_conf 9799999178999999999999987----89979999 No 220 >pfam02016 Peptidase_S66 LD-carboxypeptidase. Muramoyl-tetrapeptide carboxypeptidase hydrolyses a peptide bond between a di-basic amino acid and the C-terminal D-alanine in the tetrapeptide moiety in peptidoglycan. This cleaves the bond between an L- and a D-amino acid. The function of this activity is in murein recycling. This family also includes the microcin c7 self-immunity protein. This family corresponds to Merops family S66. Probab=24.88 E-value=39 Score=12.88 Aligned_cols=60 Identities=13% Similarity=0.065 Sum_probs=26.3 Q ss_pred EEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEE-EE--CCCCCCCHHHH-HHHHHHHHHHCCCCEEE Q ss_conf 9996694248999999999998648997299999-96--67798784689-99999999871898899 Q gi|254780546|r 23 LVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAI-SV--DHCLRETAKDE-VRYVSDVCSRLRIAHSV 86 (423) Q Consensus 23 ~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~-~v--dHglr~~s~~e-~~~v~~~~~~lgi~~~~ 86 (423) +||-|++.|...+-...+.+ +.. +++++.- |+ .|+.-..++++ ++...+...+=.|.... T Consensus 3 iiaPS~~~~~~~~~~~~~~L-~~~---G~~v~~~~~~~~~~~~~ag~d~~Ra~dL~~a~~d~~i~aI~ 66 (281) T pfam02016 3 IVAPSSPLDPERLERAIARL-ESL---GLEVVVGPHALARDGYLAGTDEERAADLHAAFADPEIDAII 66 (281) T ss_pred EEECCCCCCHHHHHHHHHHH-HHC---CCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEE T ss_conf 99289989999999999999-959---99898787624446764799999999999985089988999 No 221 >CHL00202 argB acetylglutamate kinase; Provisional Probab=24.81 E-value=40 Score=12.87 Aligned_cols=158 Identities=13% Similarity=0.148 Sum_probs=79.6 Q ss_pred CCCHHHHHHHHHHHC-----CCCCEEEEEECCCH--HHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHH Q ss_conf 769899999999852-----79983999966942--48999999999998648997299999966779878468999999 Q gi|254780546|r 2 FLSPIESVRFFVRSL-----VYPAHILVAVSGGS--DSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVS 74 (423) Q Consensus 2 ~~~p~~~~~~~~~~l-----~~~~~i~vAvSGG~--DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~ 74 (423) |++|.|.+.-|...+ ..+.-++|.++|-. |-...-.+++++..-+. .+ +..+.| ||-+++ .. T Consensus 1 m~~~~e~v~~lrea~pYi~~~rgktfVIk~gG~~~~d~~l~~~~~~DialL~~-lG--ik~VlV-HGgg~q-------I~ 69 (284) T CHL00202 1 MLNNDERVQVLSEALPYIQKFRGRIMVIKYGGAAMKNLILKADIIKDILFLSC-IG--LKIVVV-HGGGPE-------IN 69 (284) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCEEEECHHHHHHHHHHHHHHHH-CC--CEEEEE-CCCCHH-------HH T ss_conf 98778999999997799999689989999896666375489999999999998-89--979998-899668-------99 Q ss_pred HHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99987189889999750478778037888545555432001234420123456665668999986236544433223576 Q gi|254780546|r 75 DVCSRLRIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMC 154 (423) Q Consensus 75 ~~~~~lgi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~ 154 (423) +.++++|++.... ++..-+. .+. -... ...++ .++...++..+..+|....++++. T Consensus 70 ~~l~~~gi~~~f~--~G~RvTd-~~~-l~vv---------------~~vl~----g~vn~~lv~~l~~~g~~a~gl~g~- 125 (284) T CHL00202 70 FWLKQLNISPKFW--NGIRVTD-KVT-MEIV---------------EMVLA----GKVNKDLVGSINANGGKAVGLCGK- 125 (284) T ss_pred HHHHHCCCCCCCC--CCCCCCC-HHH-HHHH---------------HHHHH----HHHHHHHHHHHHHCCCCCEEEECC- T ss_conf 9999769984362--8832188-789-9999---------------99987----789999999998569950553125- Q ss_pred CCCCCCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 410035796871012104289999989981893202667864221018999764345 Q gi|254780546|r 155 DTILYDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDRFERVRVRRFVRDI 211 (423) Q Consensus 155 ~~~~~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~~l 211 (423) .+++-..||.- ...+.|+-++..-|+..-+..+-+-..|. T Consensus 126 -----dg~~i~A~~~~------------~~D~g~vG~V~~Vd~~~I~~lL~~g~IpV 165 (284) T CHL00202 126 -----DANLIVARASD------------KKDLGLVGEIQQVDPQLIDMLLEKNYIPV 165 (284) T ss_pred -----CCCEEEEECCC------------CCCCCCCCCCEEECHHHHHHHHHCCCEEE T ss_conf -----57669983589------------87744444412326799999970897599 No 222 >COG1377 FlhB Flagellar biosynthesis pathway, component FlhB [Cell motility and secretion / Intracellular trafficking and secretion] Probab=24.75 E-value=40 Score=12.86 Aligned_cols=15 Identities=27% Similarity=0.740 Sum_probs=6.4 Q ss_pred CCCEEE--CCCCCCCCH Q ss_conf 797661--230068881 Q gi|254780546|r 385 GGKPLL--APFSRFMTK 399 (423) Q Consensus 385 ~g~~li--aP~~r~~~~ 399 (423) +|-.++ +|.+|.+++ T Consensus 308 ~~Ipi~enppLARaLY~ 324 (363) T COG1377 308 HGIPIIENPPLARALYR 324 (363) T ss_pred CCCCEECCHHHHHHHHH T ss_conf 49956418077999997 No 223 >cd03329 MR_like_4 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 4. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=24.13 E-value=41 Score=12.78 Aligned_cols=31 Identities=16% Similarity=0.154 Sum_probs=21.3 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 9899818932026678642210189997643 Q gi|254780546|r 179 SFLLQRNISWCEDPSNTDDRFERVRVRRFVR 209 (423) Q Consensus 179 ~~~~~~~i~wveDpSN~d~~f~R~rlR~~l~ 209 (423) ..+.++++-|+|||...+.--.=.++|+.+. T Consensus 210 ~~le~~~~~w~EeP~~~~d~~~~~~l~~~~~ 240 (368) T cd03329 210 RALEELGFFWYEDPLREASISSYRWLAEKLD 240 (368) T ss_pred HHHCCCCEEEEECCCCCCCHHHHHHHHHHCC T ss_conf 6433465157517998447999999996099 No 224 >COG0452 Dfp Phosphopantothenoylcysteine synthetase/decarboxylase [Coenzyme metabolism] Probab=24.09 E-value=41 Score=12.78 Aligned_cols=35 Identities=11% Similarity=0.158 Sum_probs=24.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEE Q ss_conf 839999669424899999999999864899729999996 Q gi|254780546|r 20 AHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISV 58 (423) Q Consensus 20 ~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~v 58 (423) .+|+|+||||..+-=.+.+...+.+ . +.++.++-- T Consensus 5 k~ill~v~gsiaayk~~~l~r~L~~-~---ga~v~vvmt 39 (392) T COG0452 5 KRILLGVTGSIAAYKSVELVRLLRR-S---GAEVRVVMT 39 (392) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHH-C---CCEEEEECC T ss_conf 6389994173334467899998762-7---973689706 No 225 >TIGR01136 cysKM cysteine synthases; InterPro: IPR005856 This model discriminates cysteine synthases (both CysK and CysM) from cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain lacking from this protein. Bacterial proteins lacking the CBS domain but otherwise showing resemblance to cystathionine beta-synthases and considerable phylogenetic distance from known cysteine synthases were excluded. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid. In bacteria such two forms of the enzyme are known (genes cysK and cysM). CysM differs from CysK in that it can also use thiosulphate instead of sulphide, to produce cysteine thiosulphonate instead of cysteine. ; GO: 0004124 cysteine synthase activity, 0006535 cysteine biosynthetic process from serine. Probab=24.05 E-value=41 Score=12.77 Aligned_cols=17 Identities=12% Similarity=0.128 Sum_probs=5.8 Q ss_pred CCCEEECCCEEEEEECC Q ss_conf 98456107579998899 Q gi|254780546|r 298 KRGCVSIGRVVIDRRAN 314 (423) Q Consensus 298 ~~~~~tl~g~~i~~~~~ 314 (423) +.+++|.-|=.++.+.. T Consensus 178 TGGTItGvgr~LK~~~p 194 (315) T TIGR01136 178 TGGTITGVGRVLKEQNP 194 (315) T ss_pred CCCCHHHHHHHHHHHCC T ss_conf 67511168989863268 No 226 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=23.82 E-value=41 Score=12.74 Aligned_cols=51 Identities=8% Similarity=0.073 Sum_probs=32.4 Q ss_pred CCHHHH-HHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCC-CCHH Q ss_conf 784689-99999999871898899997504787780378885455554320012344-2012 Q gi|254780546|r 64 ETAKDE-VRYVSDVCSRLRIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINA-TLIM 123 (423) Q Consensus 64 ~~s~~e-~~~v~~~~~~lgi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~-~~l~ 123 (423) .+||-+ .+...+.|+++||||+..-+..- +++. +...+...+++.|. ..+. T Consensus 7 S~SD~~~m~~a~~~L~~fgi~~e~~V~SAH-RTP~--------~~~~ya~~a~~~G~P~ViI 59 (159) T TIGR01162 7 SDSDLETMKKAAEILEEFGIPYELRVVSAH-RTPE--------LMFEYAKEAEERGIPKVII 59 (159) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEEECCC-CCHH--------HHHHHHHHHHHCCCCEEEE T ss_conf 725478999999999855996678986067-7808--------8999999998678997998 No 227 >TIGR02322 phosphon_PhnN phosphonate metabolism protein/1,5-bisphosphokinase (PRPP-forming) PhnN; InterPro: IPR012699 Members of this family resemble PhnN of phosphonate utilization operons, where different such operons confer the ability to use somewhat different profiles of C-P bond-containing compounds , including phosphites as well as phosphonates. PhnN in Escherichia coli shows considerable homology to guanylate kinases (2.7.4.8 from EC), and has actually been shown to act as a ribose 1,5-bisphosphokinase (PRPP forming). This suggests an analogous kinase reaction for phosphonate metabolism, converting 5-phosphoalpha-1-(methylphosphono)ribose to methylphosphono-PRPP.. Probab=23.81 E-value=41 Score=12.74 Aligned_cols=131 Identities=18% Similarity=0.169 Sum_probs=56.8 Q ss_pred EEEEECC-CHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC--CCCCHHH-H---HHHHHHHHHHCCCCEEEEEEEC--- Q ss_conf 9999669-424899999999999864899729999996677--9878468-9---9999999987189889999750--- Q gi|254780546|r 22 ILVAVSG-GSDSMGLLIALHSVLSDRSFGKIKFSAISVDHC--LRETAKD-E---VRYVSDVCSRLRIAHSVVSWKN--- 91 (423) Q Consensus 22 i~vAvSG-G~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHg--lr~~s~~-e---~~~v~~~~~~lgi~~~~~~~~~--- 91 (423) ++|+=|| ||||+ |-.+.+-.... .+|+.+| - -||-+.. | |--.++|-+.-+=..+.++|+. T Consensus 5 ~vvGPSGaGKDtL--l~~AR~~l~~~----~r~~F~r---RvITR~a~AggEnH~Als~~EF~~~~~~G~FAl~W~AHGL 75 (183) T TIGR02322 5 YVVGPSGAGKDTL--LDYARARLAGD----PRVHFVR---RVITRPADAGGENHDALSTEEFDAREDGGAFALSWQAHGL 75 (183) T ss_pred EEEECCCCCHHHH--HHHHHHHHCCC----CCEEEEE---EEEECCCCCCCCCCCCCCHHHHHHHHHCCCEEEEEECCCC T ss_conf 9970778867789--99999970489----9658831---2774375224667640477899999718981898613674 Q ss_pred -------------------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHHHHHHHHCCCCCCCCCC Q ss_conf -------------------478778037888545555432001234420123456--66566899998623654443322 Q gi|254780546|r 92 -------------------SKPQTGLMAAAREARYALISEHAKTINATLIMTAHT--FDDQLETVYMRSQRDYAEKGMGL 150 (423) Q Consensus 92 -------------------~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~--~dD~~Et~l~rl~r~sg~~g~~l 150 (423) .+.+-..-..+| .||..+.-.+-...-++|..=-- .=...|.+.-||.|..-..|-.+ T Consensus 76 ~YGiP~eid~wL~~G~~Vv~NGSRa~Lp~ar-~rYp~L~~V~Ita~~dVLa~RL~~RgRE~~~~I~~RL~Rs~~~~~dll 154 (183) T TIGR02322 76 SYGIPVEIDQWLEAGDVVVVNGSRAVLPEAR-QRYPNLLVVNITASPDVLAQRLAARGRESAEEIEERLARSARFAGDLL 154 (183) T ss_pred CCCCCHHHHHHHHCCCEEEEECHHHHHHHHH-HHHHCCEEEEEECCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHH T ss_conf 3578303787875499899975288999998-743246027874581289999997589898899999887753110220 Q ss_pred CCCCCCCCCCCC Q ss_conf 357641003579 Q gi|254780546|r 151 SGMCDTILYDLN 162 (423) Q Consensus 151 ~~~~~~~~~~~~ 162 (423) +...+.....+. T Consensus 155 ~~~~dv~~i~Ns 166 (183) T TIGR02322 155 LEPADVTTIDNS 166 (183) T ss_pred CCCCCEEEEECC T ss_conf 059872798468 No 228 >COG1751 Uncharacterized conserved protein [Function unknown] Probab=23.44 E-value=42 Score=12.69 Aligned_cols=60 Identities=18% Similarity=0.160 Sum_probs=41.8 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH-HHHHHHHHHHCCCCEEE Q ss_conf 983999966942489999999999986489972999999667798784689-99999999871898899 Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE-VRYVSDVCSRLRIAHSV 86 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e-~~~v~~~~~~lgi~~~~ 86 (423) -.+++||-|-|---+-++-++.. .++++++|-+.|.+..-..| ...+++..++.|..... T Consensus 28 ik~~vVAS~tG~tA~k~lemveg--------~lkvVvVthh~Gf~e~g~~e~~~E~~~~L~erGa~v~~ 88 (186) T COG1751 28 IKHIVVASSTGYTALKALEMVEG--------DLKVVVVTHHAGFEEKGTQEMDEEVRKELKERGAKVLT 88 (186) T ss_pred CCEEEEEECCCHHHHHHHHHCCC--------CCEEEEEEEECCCCCCCCEECCHHHHHHHHHCCCEEEE T ss_conf 61599984365789999984426--------73599998322655688520589999999970860043 No 229 >TIGR01121 D_amino_aminoT D-amino acid aminotransferase; InterPro: IPR005784 D-amino acid aminotransferase (2.6.1.21 from EC) catalyzes transamination between various D-amino acids and alpha-keto acids. This enzyme is a homodimer. The pyridoxal phosphate attachment site is a Lys in the motif Cys-Asp-Ile-Lys-Ser-Leu-Asn. Specificity is broad for various D-amino acids, and differs among members of the family .; GO: 0008483 transaminase activity, 0008152 metabolic process. Probab=23.34 E-value=42 Score=12.68 Aligned_cols=27 Identities=19% Similarity=0.165 Sum_probs=19.3 Q ss_pred HHCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 210428999998998189320266786 Q gi|254780546|r 169 FLRCRREDIRSFLLQRNISWCEDPSNT 195 (423) Q Consensus 169 LL~~~r~~l~~~~~~~~i~wveDpSN~ 195 (423) |-++||+.|...|.+.||++-|.|--. T Consensus 202 LnGITR~~i~~~~~~~~~p~~E~~~T~ 228 (278) T TIGR01121 202 LNGITRQVILKLAEENGIPVKEEPFTK 228 (278) T ss_pred CCCCCHHHHHHHHHHCCCCCCCCCCCH T ss_conf 168447899999874689701221256 No 230 >TIGR03468 HpnG hopanoid-associated phosphorylase. The sequences in this family are members of the pfam01048 family of phosphorylases typically acting on nucleotide-sugar substrates. The genes of the family modeled here are generally in the same locus with genes involved in the biosynthesis and elaboration of hopene, the cyclization product of the polyisoprenoid squalene. This gene is adjacent to the genes PhnA-E and squalene-hopene cyclase (which would be HpnF) in Zymomonas mobilis and their association with hopene biosynthesis has been noted in the literature. Extending the gene symbol sequence, we suggest the symbol HpnG for the product of this gene. Hopanoids are known to be components of the plasma membrane and to have polar sugar head groups in Z. mobilis and other species. Probab=23.31 E-value=42 Score=12.68 Aligned_cols=28 Identities=7% Similarity=0.179 Sum_probs=19.4 Q ss_pred CCCHHHHHHHHHHHCCCC-----CCCCCCCCCC Q ss_conf 042899999899818932-----0266786422 Q gi|254780546|r 171 RCRREDIRSFLLQRNISW-----CEDPSNTDDR 198 (423) Q Consensus 171 ~~~r~~l~~~~~~~~i~w-----veDpSN~d~~ 198 (423) +.--.-+-++|..+|+|| |-||.+.+.= T Consensus 131 DMEsaAvA~va~~~giPF~viRaISD~a~~~lP 163 (212) T TIGR03468 131 DMESGAVAAVAAAAGLPFAVIRVISDPADRALP 163 (212) T ss_pred ECHHHHHHHHHHHCCCCEEEEEEEECCCCCCCC T ss_conf 032899999999819978999998536665477 No 231 >PRK06633 acetyl-CoA acetyltransferase; Provisional Probab=23.27 E-value=37 Score=13.11 Aligned_cols=31 Identities=0% Similarity=-0.176 Sum_probs=11.9 Q ss_pred EEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 299999966779878468999999999871898 Q gi|254780546|r 51 IKFSAISVDHCLRETAKDEVRYVSDVCSRLRIA 83 (423) Q Consensus 51 ~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~ 83 (423) .++-+.|||- ...|.-++-......-..|-. T Consensus 78 ~~vp~~TVnr--~C~SGl~Ai~~aa~~I~~G~~ 108 (392) T PRK06633 78 KEVPGYTINK--VCGSGLKSVALAANSIMTGDN 108 (392) T ss_pred CCCCEEEECC--CCHHHHHHHHHHHHHHHCCCC T ss_conf 8887268877--337999999999999967998 No 232 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=23.27 E-value=39 Score=12.94 Aligned_cols=87 Identities=14% Similarity=0.132 Sum_probs=42.3 Q ss_pred CEEECCCEEEEEECCEEEEEECCCCCCCCCCCCCCCEEE---------C--CCEEEEC-CCCCEEEECCCCCCCCCCCCC Q ss_conf 456107579998899899984534377552368982320---------7--8417512-777503521675442542012 Q gi|254780546|r 300 GCVSIGRVVIDRRANFLWITRAVRNLPTLILYPEETTVW---------D--GRYQFQN-LSDSLIQIGPQSYRKADVPSG 367 (423) Q Consensus 300 ~~~tl~g~~i~~~~~~l~i~RE~~~~~~~~~~~~~~~~w---------D--gR~~i~~-~~~~~~~l~~~g~~~~~~~~~ 367 (423) -+.-.-.+++..|+|.+.+.|=+ .-.+.+|+.+-+ + |-|.+.. .......-|..|+..+...+- T Consensus 192 LkALIFDS~YD~YrGVv~~vRv~----~G~ik~gD~I~~Mstgk~y~V~evG~~~P~~~~~~~~L~aGeVGy~~AgIK~v 267 (598) T TIGR01393 192 LKALIFDSHYDNYRGVVVLVRVF----EGTIKKGDKIRFMSTGKEYEVDEVGVFTPKLEVKTQELSAGEVGYIIAGIKDV 267 (598) T ss_pred CEEEEEEEEECCCCEEEEEEEEE----ECEECCCCEEEEEECCCEEEEEEEEEECCCCCCCCCCEECCCEEEEEEEEEEC T ss_conf 32278843543865089999995----26864698899953487666755003434520146620016305999865310 Q ss_pred CC-------CCCCHHHHHHCCHHHCCCEEE Q ss_conf 31-------135546776271242797661 Q gi|254780546|r 368 IP-------PIIAQRALSSMPSKEGGKPLL 390 (423) Q Consensus 368 ~~-------~~~~~~~l~~lP~~~~g~~li 390 (423) .+ +....++.-.+|....-+++| T Consensus 268 ~D~~VGDTiT~~~~Pa~eplpGF~~~KP~V 297 (598) T TIGR01393 268 SDVKVGDTITSVKNPAKEPLPGFKEVKPMV 297 (598) T ss_pred CCEECCCEEECCCCCCCCCCCCCCCCCCEE T ss_conf 411205445256787376788861257658 No 233 >PRK11706 TDP-4-oxo-6-deoxy-D-glucose transaminase; Provisional Probab=23.20 E-value=42 Score=12.66 Aligned_cols=25 Identities=16% Similarity=0.130 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 8999998998189320266786422 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) =++|.++|+++||+.|||..-.--. T Consensus 135 ~~~I~~iak~~~i~vIEDaA~a~Gs 159 (375) T PRK11706 135 MDTIMALAKKHNLFVVEDAAQGVMS 159 (375) T ss_pred HHHHHHHHHHCCCEEEECCCCCCCC T ss_conf 8999998853790898344003687 No 234 >PRK11175 universal stress protein UspE; Provisional Probab=23.18 E-value=42 Score=12.66 Aligned_cols=145 Identities=14% Similarity=0.154 Sum_probs=69.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC--------CCHHHH---------HHHHHHHHH--- Q ss_conf 983999966942489999999999986489972999999667798--------784689---------999999998--- Q gi|254780546|r 19 PAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLR--------ETAKDE---------VRYVSDVCS--- 78 (423) Q Consensus 19 ~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr--------~~s~~e---------~~~v~~~~~--- 78 (423) -.||+|++-|..+|..-+.-+..+++..+ . +|+++++-+... ++..++ .+.+++..+ T Consensus 3 fk~ILv~~D~s~~s~~Al~rA~~LA~~~~-A--~l~ll~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 79 (304) T PRK11175 3 YQNILVVIDPNQDDQPALARAVYLAQRSG-A--KITAFLSIYDFSYEMTTLLSPDEREAMRQGVISQRTAWIQEQAKPYL 79 (304) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHC-C--EEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78799977997432999999999999859-9--89999996067731013478256999999999999999999999997 Q ss_pred HCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 71898899997504787780378885455554320012344201234566656689999862365444332235764100 Q gi|254780546|r 79 RLRIAHSVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSGMCDTIL 158 (423) Q Consensus 79 ~lgi~~~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~~~~~~~ 158 (423) ..|++..+.-....++ +....+.+.+.+++.++.|.|..+.... ++.|+.-. .+....+ T Consensus 80 ~~g~~v~~~v~~~~~p------------~~~Ii~~a~~~~~DLVV~g~~~~~~l~~----~l~gs~~~--~llR~~~--- 138 (304) T PRK11175 80 DAGVPIDIKVVWHNRP------------FEAIIQEVIKGKHDLVVKMTHQHDKLES----VIFTPTDW--HLLRKCP--- 138 (304) T ss_pred HCCCCCEEEEEECCCH------------HHHHHHHHHHCCCCEEEECCCCCCHHHH----HHHCHHHH--HHHHCCC--- T ss_conf 4698710799988987------------9999999997599899987777623443----30051589--9984499--- Q ss_pred CCCCCEEECCHHCCCHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 3579687101210428999998998189320266786422 Q gi|254780546|r 159 YDLNLWISRPFLRCRREDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 159 ~~~~~~i~RPLL~~~r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) .|.|-++...-.. ...+-.--|+|+.++. T Consensus 139 --------~PVllv~~~~~~~---~~~IlvavD~~~~~~~ 167 (304) T PRK11175 139 --------CPVLMVKDHDWPE---GGKILVAVNVASEDPY 167 (304) T ss_pred --------CCEEEEECCCCCC---CCCEEEEECCCCCCHH T ss_conf --------9589972566666---6858999678863255 No 235 >PRK09409 insertion element IS2 transposase InsD; Reviewed Probab=23.06 E-value=43 Score=12.64 Aligned_cols=18 Identities=17% Similarity=0.141 Sum_probs=8.8 Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 428999998998189320 Q gi|254780546|r 172 CRREDIRSFLLQRNISWC 189 (423) Q Consensus 172 ~~r~~l~~~~~~~~i~wv 189 (423) ++-.++.++|+++||.+. T Consensus 203 y~S~~~~~~~~~~gi~~s 220 (301) T PRK09409 203 YRANETRQFARMLGLEPK 220 (301) T ss_pred HHHHHHHHHHHHCCCEEC T ss_conf 489999999998498760 No 236 >cd01561 CBS_like CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase. CBS is a unique heme-containing enzyme that catalyzes a pyridoxal 5'-phosphate (PLP)-dependent condensation of serine and homocysteine to give cystathionine. Deficiency of CBS leads to homocystinuria, an inherited disease of sulfur metabolism characterized by increased levels of the toxic metabolite homocysteine. Cysteine synthase on the other hand catalyzes the last step of cysteine biosynthesis. This subgroup also includes an O-Phosphoserine sulfhydrylase found in hyperthermophilic archaea which produces L-cysteine from sulfide and the more thermostable O-phospho-L-serine. Probab=22.85 E-value=43 Score=12.61 Aligned_cols=55 Identities=11% Similarity=0.211 Sum_probs=27.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9983999966942489999999999986489972999999667798784689999999998718988999 Q gi|254780546|r 18 YPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVV 87 (423) Q Consensus 18 ~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~ 87 (423) .+...+|..|.|-=..+++..++.+ ++++.++ | |+.. ...-.+.++.+|-..... T Consensus 51 ~~~~~vv~aSsGN~g~alA~~a~~~-------Gi~~~I~-~-----p~~~--~~~k~~~i~~~GA~Vv~v 105 (291) T cd01561 51 KPGTTIIEPTSGNTGIGLAMVAAAK-------GYRFIIV-M-----PETM--SEEKRKLLRALGAEVILT 105 (291) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHC-------CCCEEEE-C-----CCCC--HHHHHHHHHHCCCEEEEE T ss_conf 8997899808984799999999980-------9983896-4-----7630--699999999669889997 No 237 >cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane. Probab=22.59 E-value=43 Score=12.58 Aligned_cols=79 Identities=14% Similarity=0.186 Sum_probs=39.7 Q ss_pred HHHHHCCCCCEEEEEEEECCCCCCCH-HHHHHHHHHHHHHCCCC---EEEEEEE-CCCCCCCCHHHHHH--HHHHHHHHH Q ss_conf 99986489972999999667798784-68999999999871898---8999975-04787780378885--455554320 Q gi|254780546|r 41 SVLSDRSFGKIKFSAISVDHCLRETA-KDEVRYVSDVCSRLRIA---HSVVSWK-NSKPQTGLMAAARE--ARYALISEH 113 (423) Q Consensus 41 ~~~~~~~~~~~~l~a~~vdHglr~~s-~~e~~~v~~~~~~lgi~---~~~~~~~-~~~~~~~~~~~ar~--~r~~~~~~~ 113 (423) .+....+...+++ ...|-+-+++. .+|..-..++|+++|-. ++..+-+ ...+.+|+.+..+. .+|.++ T Consensus 25 sl~~~~~~~~f~v--~vLsDs~~p~~~~~E~~a~~~l~~~~~~~~~v~Yr~R~~n~g~KAGni~~~l~~~g~~yd~~--- 99 (254) T cd04191 25 SLAKTGLADHFDF--FILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRRRRENTGRKAGNIADFCRRWGSRYDYM--- 99 (254) T ss_pred HHHHHCCCCCCEE--EEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCCEE--- T ss_conf 9986388565379--99808997578999999999999984788745988568667987377999999539984679--- Q ss_pred CCCCCCCCHHHH Q ss_conf 012344201234 Q gi|254780546|r 114 AKTINATLIMTA 125 (423) Q Consensus 114 ~~~~~~~~l~~a 125 (423) --..++.+..+ T Consensus 100 -~vlDAD~~~~~ 110 (254) T cd04191 100 -VVLDADSLMSG 110 (254) T ss_pred -EEECCCCCCCH T ss_conf -99758889986 No 238 >pfam00763 THF_DHG_CYH Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain. Probab=22.08 E-value=44 Score=12.51 Aligned_cols=40 Identities=18% Similarity=0.146 Sum_probs=32.0 Q ss_pred CEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 729999996677987846899999999987189889999750 Q gi|254780546|r 50 KIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKN 91 (423) Q Consensus 50 ~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~ 91 (423) ...+.++.| |=.+.|......-.+.|+++||.+.+.+++. T Consensus 29 ~P~Laii~v--g~~~~s~~Yv~~k~k~~~~~Gi~~~~~~l~~ 68 (117) T pfam00763 29 TPKLAVILV--GDDPASQVYVRSKRKAAEELGIESELIRLPE 68 (117) T ss_pred CCEEEEEEE--CCCHHHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 971899984--8975679999999999997287169987776 No 239 >PRK07261 topology modulation protein; Provisional Probab=22.05 E-value=44 Score=12.51 Aligned_cols=53 Identities=11% Similarity=0.152 Sum_probs=21.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHH-HHHHHHHHH Q ss_conf 999966942489999999999986489972999999667798784689-999999998 Q gi|254780546|r 22 ILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDE-VRYVSDVCS 78 (423) Q Consensus 22 i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e-~~~v~~~~~ 78 (423) .+||.||..-|+. +..+...-+.|-+.+=.++...|..+-+.+| .+.++++.+ T Consensus 4 ~IiG~sGsGKSTl----Ar~L~~~~~ip~~~LD~l~w~p~w~~~~~~e~~~~~~~~~~ 57 (171) T PRK07261 4 AIIGYSGSGKSTL----ARFLGQHYNCPVLHLDQLHFSSNWQERDDDDMIADISNFLL 57 (171) T ss_pred EEECCCCCCHHHH----HHHHHHHHCCCEEEECCEEECCCCEECCHHHHHHHHHHHHH T ss_conf 9988999868999----99999987979797022788899988889999999999984 No 240 >pfam12268 DUF3612 Protein of unknown function (DUF3612). This domain family is found in bacteria, and is approximately 180 amino acids in length. The family is found in association with pfam01381. Probab=21.96 E-value=45 Score=12.49 Aligned_cols=26 Identities=27% Similarity=0.408 Sum_probs=19.9 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 28999998998189320266786422 Q gi|254780546|r 173 RREDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 173 ~r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) -|.+|+..++-.+|.||+|--..+.+ T Consensus 133 ik~~L~sVa~ILnInWI~rgie~~Ar 158 (178) T pfam12268 133 IKTDLSSVARILNINWIERGIQEEAR 158 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 99999999987610877742368864 No 241 >pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known. Probab=21.95 E-value=45 Score=12.49 Aligned_cols=63 Identities=17% Similarity=0.119 Sum_probs=36.2 Q ss_pred EEEEEECCCHH---HHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 39999669424---8999999999998648997299999966779878468999999999871898899997504 Q gi|254780546|r 21 HILVAVSGGSD---SMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAHSVVSWKNS 92 (423) Q Consensus 21 ~i~vAvSGG~D---S~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~~~~~~~~~ 92 (423) ---|+|+||.| .-.+-..|-.+..+.+ .. +.|+-|--..++. ..++|+++.||+....+-++. T Consensus 3 g~~V~~~Ggrd~~D~~~i~~~Ld~~~~~~p----d~--vlihGG~~kGad~---lA~~WA~~~gv~~i~f~~dW~ 68 (71) T pfam10686 3 GTRVAFSGGRDFNDHRLIWDALDKVHARHP----DM--VLLHGGAPKGAER---IAARWARRRGVPQVAFKPDWR 68 (71) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHCC----CE--EEEECCCCCCHHH---HHHHHHHHCCCCEEEECCCHH T ss_conf 998999808986509999999999998689----87--9997798633799---999999986997697685611 No 242 >cd01444 GlpE_ST GlpE sulfurtransferase (ST) and homologs are members of the Rhodanese Homology Domain superfamily. Unlike other rhodanese sulfurtransferases, GlpE is a single domain protein but indications are that it functions as a dimer. The active site contains a catalytically active cysteine. Probab=21.87 E-value=45 Score=12.48 Aligned_cols=35 Identities=23% Similarity=0.194 Sum_probs=18.7 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHH Q ss_conf 99999999852799839999669424899999999 Q gi|254780546|r 6 IESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALH 40 (423) Q Consensus 6 ~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~ 40 (423) ...+..++..+.+...|+|-+.+|.-|.-....|. T Consensus 43 ~~~~~~~~~~~~~~~~ivv~C~~G~rs~~aa~~L~ 77 (96) T cd01444 43 EDSLDDWLGDLDRDRPVVVYCYHGNSSAQLAQALR 77 (96) T ss_pred HHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHH T ss_conf 66799988985689818998089668999999999 No 243 >pfam03848 TehB Tellurite resistance protein TehB. Probab=21.54 E-value=45 Score=12.44 Aligned_cols=144 Identities=13% Similarity=0.184 Sum_probs=86.6 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 89999999985279983999966942489999999999986489972999999667798784689999999998718988 Q gi|254780546|r 5 PIESVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDVCSRLRIAH 84 (423) Q Consensus 5 p~~~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~~~~lgi~~ 84 (423) +...|..+.+.+.+..-+-+|+-.|.+|.-|+ . . +++|.|+ |+. ..-.+.+++++++-|++. T Consensus 18 ~HSev~~a~~~i~pgk~LDlgcG~GRNslyLa---~-----~---G~~Vtav--D~n-----~~aL~~l~~ia~~e~l~i 79 (192) T pfam03848 18 THSEVLEAVKTVKPGKALDLGCGQGRNSLFLS---L-----L---GYDVTAV--DHN-----ENSIANLQDIKEKENLDI 79 (192) T ss_pred CCHHHHHHHHHCCCCCEEEECCCCCHHHHHHH---H-----C---CCEEEEE--ECC-----HHHHHHHHHHHHHCCCCC T ss_conf 80999999863799746660478973189998---6-----8---9917999--799-----999999999999709975 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CCCCCCCCCCC Q ss_conf 99997504787780378885455554320012344201234566656689999862365444332235-76410035796 Q gi|254780546|r 85 SVVSWKNSKPQTGLMAAAREARYALISEHAKTINATLIMTAHTFDDQLETVYMRSQRDYAEKGMGLSG-MCDTILYDLNL 163 (423) Q Consensus 85 ~~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~~~~~~~l~~ah~~dD~~Et~l~rl~r~sg~~g~~l~~-~~~~~~~~~~~ 163 (423) .+...|... .++++ .|.+.. ..+++-...-+.+..++..+.+..-.+|..+.- .|.. ....- T Consensus 80 ~~~~~Din~--~~~~e-----~YD~Ii--------sTVvfmFL~~~~ip~iI~~mq~~T~pGGynlIv~am~t--~d~pc 142 (192) T pfam03848 80 PTALYDINS--ASIDE-----NYDFIL--------STVVLMFLQAERIPAIIANMQEHTNVGGYNLIVAAMST--ADYPC 142 (192) T ss_pred EEEEECCCC--CCCCC-----CCCEEE--------EEEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEEEECC--CCCCC T ss_conf 268731555--68767-----768798--------88877731867877999999985289988999976145--53789 Q ss_pred EEECCH-HCCCHHHHHHHHHHHC Q ss_conf 871012-1042899999899818 Q gi|254780546|r 164 WISRPF-LRCRREDIRSFLLQRN 185 (423) Q Consensus 164 ~i~RPL-L~~~r~~l~~~~~~~~ 185 (423) . -|+ -.+...||.+|=+... T Consensus 143 ~--~~f~ftfk~gEL~~yy~~We 163 (192) T pfam03848 143 T--VPFSFTFKEGELKRYYQDWE 163 (192) T ss_pred C--CCCCCCCCHHHHHHHHCCCE T ss_conf 9--89887788018999738987 No 244 >PRK05702 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=21.51 E-value=46 Score=12.43 Aligned_cols=23 Identities=13% Similarity=0.232 Sum_probs=12.0 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCH Q ss_conf 9998998189320266786422101 Q gi|254780546|r 177 IRSFLLQRNISWCEDPSNTDDRFER 201 (423) Q Consensus 177 l~~~~~~~~i~wveDpSN~d~~f~R 201 (423) +....-+.|..|. |...-++|+| T Consensus 105 i~~~~~Q~G~~fs--~k~l~pk~~r 127 (352) T PRK05702 105 LLAPILQGGWLFS--GKALKPKFSK 127 (352) T ss_pred HHHHHHHHCCCCC--CCCCCCCHHH T ss_conf 9999985274516--4247888465 No 245 >cd06391 PBP1_iGluR_delta_2 N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the delta2 receptor of an orphan glutamate receptor family. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the delta2 receptor of an orphan glutamate receptor family. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. Although the delta receptors are a member of the ionotropic glutamate receptor family, they cannot be activated by AMPA, kainate, NMDA, glutamate, or any other ligands. Phylogenetic analysis shows that both GluRdelta1 and GluRalpha2 are closer related to non-NMDA receptors. GluRdelta2 was shown to function as a Probab=21.46 E-value=46 Score=12.43 Aligned_cols=74 Identities=14% Similarity=0.175 Sum_probs=49.0 Q ss_pred HHHHHHHHHHCCCC------CEEE---EEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 99999999852799------8399---99669424899999999999864899729999996677987846899999999 Q gi|254780546|r 6 IESVRFFVRSLVYP------AHIL---VAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRETAKDEVRYVSDV 76 (423) Q Consensus 6 ~~~~~~~~~~l~~~------~~i~---vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~~s~~e~~~v~~~ 76 (423) +.+.+..+..++.. .++. -=+.-. ||-..-..++++... .|.|+. -|.|..-+..++.+ T Consensus 14 e~AFr~Av~~iN~~~~~l~~~~L~~~i~~v~~~-DsF~~~k~~C~ll~~------GV~AIf-----GP~s~~sa~~v~Si 81 (400) T cd06391 14 DEVFRMAVADLNQNNEILQTEKITVSVTFVDGN-NPFQAVQEACELMNQ------GILALV-----SSIGCTSAGSLQSL 81 (400) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCC-CCHHHHHHHHHHHHC------CCEEEE-----CCCCHHHHHHHHHH T ss_conf 999999999860583325787106888873899-908999999998743------946999-----99975589999987 Q ss_pred HHHCCCCEEEEEEEC Q ss_conf 987189889999750 Q gi|254780546|r 77 CSRLRIAHSVVSWKN 91 (423) Q Consensus 77 ~~~lgi~~~~~~~~~ 91 (423) |..+.|||....|+. T Consensus 82 cdal~IPhi~~~~~~ 96 (400) T cd06391 82 ADAMHIPHLFIQRST 96 (400) T ss_pred HCCCCCCCEEECCCC T ss_conf 446889807632577 No 246 >COG3962 Acetolactate synthase [Amino acid transport and metabolism] Probab=21.45 E-value=30 Score=13.72 Aligned_cols=26 Identities=23% Similarity=0.323 Sum_probs=18.5 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 04289999989981893202667864 Q gi|254780546|r 171 RCRREDIRSFLLQRNISWCEDPSNTD 196 (423) Q Consensus 171 ~~~r~~l~~~~~~~~i~wveDpSN~d 196 (423) .--+++|++|+..+|||.+|--.... T Consensus 243 S~A~~~L~af~E~~~iPv~ETQaGKs 268 (617) T COG3962 243 SGAREALRAFAETHGIPVVETQAGKS 268 (617) T ss_pred CHHHHHHHHHHHHCCCCEEECCCCCC T ss_conf 30799999999864996576357864 No 247 >PRK06115 dihydrolipoamide dehydrogenase; Reviewed Probab=21.22 E-value=46 Score=12.39 Aligned_cols=16 Identities=6% Similarity=0.206 Sum_probs=8.1 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999989981893202 Q gi|254780546|r 175 EDIRSFLLQRNISWCE 190 (423) Q Consensus 175 ~~l~~~~~~~~i~wve 190 (423) +.+.+.++..|+.+.. T Consensus 219 ~~l~~~l~~~gi~i~~ 234 (466) T PRK06115 219 KTLQKALAKQGMRFKL 234 (466) T ss_pred HHHHHHHHHCCCEEEE T ss_conf 8888899876949995 No 248 >PRK07116 flavodoxin; Provisional Probab=21.08 E-value=46 Score=12.37 Aligned_cols=21 Identities=0% Similarity=-0.208 Sum_probs=9.1 Q ss_pred HHHHHHHCCCCEEEEEEECCC Q ss_conf 999998718988999975047 Q gi|254780546|r 73 VSDVCSRLRIAHSVVSWKNSK 93 (423) Q Consensus 73 v~~~~~~lgi~~~~~~~~~~~ 93 (423) ++.+.+.+|-+..-+.....- T Consensus 18 A~~I~~~~gaDi~eI~~~~~Y 38 (158) T PRK07116 18 AEKLAEVTGGDLYEITPAQPY 38 (158) T ss_pred HHHHHHHHCCCEEEEEECCCC T ss_conf 999999879997999867876 No 249 >cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain. Probab=21.01 E-value=47 Score=12.36 Aligned_cols=46 Identities=11% Similarity=-0.047 Sum_probs=25.9 Q ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCC Q ss_conf 9999998527998399996694248999999999998648997299999966779 Q gi|254780546|r 8 SVRFFVRSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCL 62 (423) Q Consensus 8 ~~~~~~~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHgl 62 (423) .....++.+.+...|+|-|..|.-|...+.+|++ + +++ ++++.-|+ T Consensus 50 ~~~~~~~~lp~d~~ivv~C~~G~rS~~aa~~L~~----~---G~~--v~~L~GG~ 95 (100) T cd01523 50 IEEDILDQLPDDQEVTVICAKEGSSQFVAELLAE----R---GYD--VDYLAGGM 95 (100) T ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHH----C---CCE--EEEECCCH T ss_conf 6888996478998399992896569999999998----7---983--79836627 No 250 >TIGR02211 LolD_lipo_ex lipoprotein releasing system, ATP-binding protein; InterPro: IPR011924 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents LolD, a member of the ABC transporter family. LolD is involved in localization of lipoproteins in some bacteria. It works with a transmembrane protein LolC, which in some species is a paralogous pair LolC and LolE. Depending on the residue immediately following the modified N-terminal Cys residue, the nascent lipoprotein may be carried further by LolA and LolB to the outer membrane, or remain at the inner membrane. Excluded from this entry are homologs from the archaeal genus Methanosarcina .; GO: 0005524 ATP binding, 0006810 transport, 0016020 membrane. Probab=20.98 E-value=47 Score=12.36 Aligned_cols=109 Identities=17% Similarity=0.243 Sum_probs=61.1 Q ss_pred CCCCCE-EEEEECCCHHHHHHHHHHHHHHH------------------------HCCCCCEEEEEEEECCCC-------- Q ss_conf 279983-99996694248999999999998------------------------648997299999966779-------- Q gi|254780546|r 16 LVYPAH-ILVAVSGGSDSMGLLIALHSVLS------------------------DRSFGKIKFSAISVDHCL-------- 62 (423) Q Consensus 16 l~~~~~-i~vAvSGG~DS~aLl~ll~~~~~------------------------~~~~~~~~l~a~~vdHgl-------- 62 (423) +.+++- -+|+=||=.=|| |||+|--+=. ++...++ +.=.||=| T Consensus 28 i~~GE~~~IvG~SGSGKST-LLHlLGGLD~PT~G~v~f~G~~l~~lS~~~~~~LRN~~LGF---iYQFHHLL~dFtaLEN 103 (221) T TIGR02211 28 IGKGEIVAIVGSSGSGKST-LLHLLGGLDNPTSGEVLFNGQSLSKLSSNERAKLRNKKLGF---IYQFHHLLPDFTALEN 103 (221) T ss_pred EECCCEEEEECCCCCCHHH-HHHHHHCCCCCCCCEEEECCCCHHHCCHHHHHHHHHHHCCC---EEEHHHCCCCCCHHHH T ss_conf 3066337987367871689-99987306899631589706323440446788751222584---4432020300002688 Q ss_pred -------CCCHHHH-HHHHHHHHHHCCCCEEEEEEE-------------------------CCCCCCCCHHHHHHHHHHH Q ss_conf -------8784689-999999998718988999975-------------------------0478778037888545555 Q gi|254780546|r 63 -------RETAKDE-VRYVSDVCSRLRIAHSVVSWK-------------------------NSKPQTGLMAAAREARYAL 109 (423) Q Consensus 63 -------r~~s~~e-~~~v~~~~~~lgi~~~~~~~~-------------------------~~~~~~~~~~~ar~~r~~~ 109 (423) +..|..| .+...+..++-|..|.+-+.. .+.|++|+.......-|.+ T Consensus 104 VaMP~LIg~~s~~ea~~~A~~mL~~VgL~~R~~h~PSELSGGERQRvAIARALvN~P~lvlADEPTGNLD~~~a~~iF~L 183 (221) T TIGR02211 104 VAMPLLIGKKSKKEAKERAYEMLEKVGLEHRINHRPSELSGGERQRVAIARALVNQPSLVLADEPTGNLDNNNAKSIFEL 183 (221) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHH T ss_conf 77777535899889999999998860733445557773456337999999986189765861298853237779999999 Q ss_pred HHHHCCCCCCCCHHHHHHH Q ss_conf 4320012344201234566 Q gi|254780546|r 110 ISEHAKTINATLIMTAHTF 128 (423) Q Consensus 110 ~~~~~~~~~~~~l~~ah~~ 128 (423) +.+.-++.+...++..|.. T Consensus 184 ~~eLN~~~~TsflvVTHD~ 202 (221) T TIGR02211 184 MLELNRELNTSFLVVTHDL 202 (221) T ss_pred HHHHHHHCCCEEEEEECCH T ss_conf 9998865391699983475 No 251 >PRK09108 type III secretion system protein HrcU; Validated Probab=20.75 E-value=47 Score=12.33 Aligned_cols=24 Identities=13% Similarity=0.093 Sum_probs=11.0 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCH Q ss_conf 99998998189320266786422101 Q gi|254780546|r 176 DIRSFLLQRNISWCEDPSNTDDRFER 201 (423) Q Consensus 176 ~l~~~~~~~~i~wveDpSN~d~~f~R 201 (423) -+....-+.|..|- +....++|+| T Consensus 99 ~i~~~~~Q~G~~fs--~k~l~pk~~r 122 (354) T PRK09108 99 GILGQAPQTGLNIS--LKPVMPKFDS 122 (354) T ss_pred HHHHHHHCCCCCCC--HHHCCCCHHC T ss_conf 99999840365405--1413676020 No 252 >COG1660 Predicted P-loop-containing kinase [General function prediction only] Probab=20.73 E-value=47 Score=12.32 Aligned_cols=15 Identities=33% Similarity=0.543 Sum_probs=11.8 Q ss_pred EEEEECCCHHHHHHH Q ss_conf 999966942489999 Q gi|254780546|r 22 ILVAVSGGSDSMGLL 36 (423) Q Consensus 22 i~vAvSGG~DS~aLl 36 (423) |+-++||-.-|+||= T Consensus 5 IVTGlSGAGKsvAl~ 19 (286) T COG1660 5 IVTGLSGAGKSVALR 19 (286) T ss_pred EEECCCCCCHHHHHH T ss_conf 995688876889999 No 253 >TIGR03588 PseC UDP-4-keto-6-deoxy-N-acetylglucosamine 4-aminotransferase. This family of enzymes are aminotransferases of the pfam01041 family involved in the biosynthesis of pseudaminic acid. They convert UDP-4-keto-6-deoxy-N-acetylglucosamine into UDP-4-amino-4,6-dideoxy-N-acetylgalactose. Pseudaminic acid has a role in surface polysaccharide in Pseudomonas as well as in the modification of flagellin in Campylobacter and Helicobacter species. Probab=20.68 E-value=47 Score=12.32 Aligned_cols=25 Identities=16% Similarity=0.182 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 8999998998189320266786422 Q gi|254780546|r 174 REDIRSFLLQRNISWCEDPSNTDDR 198 (423) Q Consensus 174 r~~l~~~~~~~~i~wveDpSN~d~~ 198 (423) =++|.++|+.+|++.|||.+-.--. T Consensus 137 ~~~i~~ia~~~~i~lIEDaA~a~Ga 161 (380) T TIGR03588 137 MQAIAALAKKHGLKIIEDASHALGA 161 (380) T ss_pred HHHHHHHHHHCCCEEEEECCCHHCC T ss_conf 9999999998698899988001157 No 254 >cd07985 LPLAT_GPAT Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: GPAT. Lysophospholipid acyltransferase (LPLAT) superfamily member: glycerol-3-phosphate 1-acyltransferase (GPAT, PlsB). LPLATs are acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. This subgroup includes glycerol-3-phosphate 1-acyltransferase (GPAT, PlsB). Probab=20.30 E-value=48 Score=12.26 Aligned_cols=13 Identities=8% Similarity=0.256 Sum_probs=5.5 Q ss_pred CCCCHHHHHHHHH Q ss_conf 4420123456665 Q gi|254780546|r 118 NATLIMTAHTFDD 130 (423) Q Consensus 118 ~~~~l~~ah~~dD 130 (423) |...++++-|+.. T Consensus 21 G~NVvllsNHQtE 33 (235) T cd07985 21 GHNVVLLANHQTE 33 (235) T ss_pred CCCEEEEECCCCC T ss_conf 9968997177675 No 255 >PRK12721 secretion system apparatus protein SsaU; Reviewed Probab=20.25 E-value=48 Score=12.26 Aligned_cols=23 Identities=17% Similarity=0.107 Sum_probs=11.9 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCH Q ss_conf 9998998189320266786422101 Q gi|254780546|r 177 IRSFLLQRNISWCEDPSNTDDRFER 201 (423) Q Consensus 177 l~~~~~~~~i~wveDpSN~d~~f~R 201 (423) +....-..|..|- +...-++|+| T Consensus 98 i~~~~~q~G~~fs--~k~l~pk~~r 120 (349) T PRK12721 98 VGSVILQVGPVLA--TKAIGFKGEK 120 (349) T ss_pred HHHHHHHCCCEEC--HHHCCCCHHH T ss_conf 9999973063507--1414876654 No 256 >PRK13109 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=20.24 E-value=48 Score=12.25 Aligned_cols=21 Identities=19% Similarity=0.262 Sum_probs=9.0 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCH Q ss_conf 98998189320266786422101 Q gi|254780546|r 179 SFLLQRNISWCEDPSNTDDRFER 201 (423) Q Consensus 179 ~~~~~~~i~wveDpSN~d~~f~R 201 (423) ...-+.|..|- +...-|+|+| T Consensus 109 ~~~~Q~G~~fs--~k~l~Pk~~k 129 (358) T PRK13109 109 ASLLQNLPRFV--LDRIQPKWSR 129 (358) T ss_pred HHHHHCCCEEC--CCCCCCCHHH T ss_conf 99860140305--4126887676 No 257 >cd00458 SugarP_isomerase SugarP_isomerase: Sugar Phosphate Isomerase family; includes type A ribose 5-phosphate isomerase (RPI_A), glucosamine-6-phosphate (GlcN6P) deaminase, and 6-phosphogluconolactonase (6PGL). RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. GlcN6P deaminase catalyzes the reversible conversion of GlcN6P to D-fructose-6-phosphate (Fru6P) and ammonium, the last step of the metabolic pathway of N-acetyl-D-glucosamine-6-phosphate. 6PGL converts 6-phosphoglucono-1,5-lactone to 6-phosphogluconate, the second step of the oxidative phase of the pentose phosphate pathway. Probab=20.02 E-value=49 Score=12.22 Aligned_cols=101 Identities=12% Similarity=0.007 Sum_probs=57.1 Q ss_pred HHHH-HHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC-CHHH--HHHHHHHHHHHCCCCEE Q ss_conf 9999-852799839999669424899999999999864899729999996677987-8468--99999999987189889 Q gi|254780546|r 10 RFFV-RSLVYPAHILVAVSGGSDSMGLLIALHSVLSDRSFGKIKFSAISVDHCLRE-TAKD--EVRYVSDVCSRLRIAHS 85 (423) Q Consensus 10 ~~~~-~~l~~~~~i~vAvSGG~DS~aLl~ll~~~~~~~~~~~~~l~a~~vdHglr~-~s~~--e~~~v~~~~~~lgi~~~ 85 (423) ..++ +-++.+.++++++|||+--..+-..|.+..+...+ -+++.+.+|--.-+ +..+ ....-+.+....+||.. T Consensus 9 a~~i~k~i~~K~~~vLGLatGSTP~~~Y~~Li~~~~~~sf--~nV~tFnlDEY~~l~dh~qSy~~~m~e~lf~hIdI~~e 86 (169) T cd00458 9 EDKXEKLLEEKDDMVIGLGTGSTPAYFYKLLGEKLKRGEI--SDIVGFPTDERYVPLDSDQSNFRQAKLLAFEHDIIPAS 86 (169) T ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCC--CEEEEEEEHCCCCCCCCCCCHHHHHHHHHHCCCCCCHH T ss_conf 9999999960778499957998879999999999744780--13899840000479876310999999985412699999 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 999750478778037888545555432001 Q gi|254780546|r 86 VVSWKNSKPQTGLMAAAREARYALISEHAK 115 (423) Q Consensus 86 ~~~~~~~~~~~~~~~~ar~~r~~~~~~~~~ 115 (423) -+... ..+...+. |.+..|..+...+. T Consensus 87 NIh~p--dG~~p~~~-aac~~ye~~i~~~g 113 (169) T cd00458 87 NVHYV--DTSLPIEK-ACEKYEREILDQVD 113 (169) T ss_pred HCCCC--CCCCCCHH-HHHHHHHHHHHHCC T ss_conf 92589--99999789-99999999998659 Done!