Query gi|254780999|ref|YP_003065412.1| NAD synthetase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 562 No_of_seqs 265 out of 3977 Neff 6.9 Searched_HMMs 39220 Date Mon May 30 03:13:49 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780999.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK13981 NAD synthetase; Provi 100.0 0 0 1282.3 51.8 542 5-556 1-543 (543) 2 PRK02628 nadE NAD synthetase; 100.0 0 0 1122.5 50.1 557 1-561 7-675 (678) 3 KOG2303 consensus 100.0 0 0 586.1 29.1 535 1-546 1-665 (706) 4 PRK13980 NAD synthetase; Provi 100.0 0 0 583.6 21.6 255 271-545 7-264 (264) 5 cd00553 NAD_synthase NAD+ synt 100.0 0 0 542.6 21.0 245 271-529 2-248 (248) 6 pfam02540 NAD_synthase NAD syn 100.0 0 0 526.1 18.9 238 277-533 3-243 (243) 7 PTZ00323 NAD+ synthase; Provis 100.0 0 0 513.9 20.1 248 270-533 25-285 (294) 8 COG0171 NadE NAD synthase [Coe 100.0 0 0 504.5 20.8 253 270-544 3-267 (268) 9 PRK00768 nadE NAD synthetase; 100.0 0 0 472.5 17.7 235 273-537 21-269 (274) 10 cd07570 GAT_Gln-NAD-synth Glut 100.0 0 0 458.3 28.6 247 6-254 1-247 (261) 11 PRK00876 nadE NAD synthetase; 100.0 0 0 457.6 20.0 241 273-531 13-308 (325) 12 cd07586 nitrilase_8 Uncharacte 100.0 0 0 404.8 25.2 246 6-255 1-252 (269) 13 cd07576 R-amidase_like Pseudom 100.0 0 0 391.6 24.2 240 6-253 1-241 (254) 14 cd07585 nitrilase_7 Uncharacte 100.0 0 0 385.1 24.4 241 6-252 1-243 (261) 15 cd07583 nitrilase_5 Uncharacte 100.0 0 0 381.9 23.7 239 6-252 1-241 (253) 16 cd07584 nitrilase_6 Uncharacte 100.0 0 0 380.6 24.6 241 6-253 1-246 (258) 17 TIGR03381 agmatine_aguB N-carb 100.0 0 0 377.2 26.1 246 5-254 1-262 (279) 18 cd07573 CPA N-carbamoylputresc 100.0 0 0 375.6 24.9 246 5-254 1-265 (284) 19 cd07580 nitrilase_2 Uncharacte 100.0 0 0 370.9 24.7 243 6-253 1-251 (268) 20 cd07568 ML_beta-AS_like mammal 100.0 0 0 369.6 25.2 250 2-253 1-266 (287) 21 cd07581 nitrilase_3 Uncharacte 100.0 0 0 367.3 24.3 239 7-252 1-243 (255) 22 cd07572 nit Nit1, Nit 2, and r 100.0 0 0 370.2 21.7 243 6-252 1-253 (265) 23 cd07197 nitrilase Nitrilase su 100.0 0 0 360.5 26.2 238 7-251 1-240 (253) 24 cd07577 Ph0642_like Pyrococcus 100.0 0 0 362.6 24.3 234 6-252 1-241 (259) 25 cd07569 DCase N-carbamyl-D-ami 100.0 0 0 360.1 25.3 252 2-254 1-282 (302) 26 PRK10438 hypothetical protein; 100.0 0 0 353.6 24.4 243 2-258 1-245 (256) 27 cd07578 nitrilase_1_R1 First n 100.0 0 0 353.8 24.2 239 5-253 1-243 (258) 28 cd07575 Xc-1258_like Xanthomon 100.0 0 0 351.5 24.5 236 5-253 1-238 (252) 29 TIGR00552 nadE NAD+ synthetase 100.0 0 0 355.0 18.9 262 271-544 5-284 (286) 30 cd07564 nitrilases_CHs Nitrila 100.0 0 0 342.8 24.1 244 5-254 1-276 (297) 31 COG0388 Predicted amidohydrola 100.0 0 0 339.2 26.0 250 3-256 1-255 (274) 32 cd07565 aliphatic_amidase alip 100.0 0 0 328.8 22.7 238 5-251 1-251 (291) 33 cd07574 nitrilase_Rim1_like Un 100.0 0 0 321.1 27.1 245 5-254 1-265 (280) 34 cd07579 nitrilase_1_R2 Second 100.0 1.4E-45 0 318.0 21.6 233 6-251 1-254 (279) 35 cd07582 nitrilase_4 Uncharacte 100.0 4.5E-44 0 307.5 24.7 243 5-251 6-277 (294) 36 pfam00795 CN_hydrolase Carbon- 100.0 1E-40 2.8E-45 285.5 19.3 170 6-176 1-172 (172) 37 cd07587 ML_beta-AS mammalian-l 100.0 1.9E-36 4.9E-41 257.6 20.6 248 3-252 62-341 (363) 38 cd07566 ScNTA1_like Saccharomy 100.0 5.4E-36 1.4E-40 254.6 20.2 220 6-229 1-270 (295) 39 KOG0807 consensus 100.0 2.2E-33 5.7E-38 237.5 12.4 239 5-250 16-270 (295) 40 cd07571 ALP_N-acyl_transferase 100.0 1.7E-29 4.3E-34 212.1 24.5 224 5-249 1-251 (270) 41 KOG0806 consensus 100.0 6.4E-32 1.6E-36 228.0 11.9 248 2-254 11-277 (298) 42 PRK13287 amiF formamidase; Pro 100.0 1.1E-29 2.7E-34 213.5 21.8 268 3-304 12-291 (333) 43 PRK00302 lnt apolipoprotein N- 100.0 1.1E-25 2.8E-30 187.2 23.0 224 3-248 224-475 (509) 44 PRK13286 amiE acylamide amidoh 99.9 6.2E-25 1.6E-29 182.3 17.7 234 3-249 11-262 (343) 45 KOG0805 consensus 99.9 5.7E-24 1.4E-28 176.0 20.2 238 2-249 15-292 (337) 46 TIGR00546 lnt apolipoprotein N 99.9 1.3E-22 3.2E-27 167.2 15.6 214 2-229 177-441 (441) 47 COG0815 Lnt Apolipoprotein N-a 99.9 1E-21 2.5E-26 161.3 19.6 224 4-248 227-481 (518) 48 cd07567 biotinidase_like bioti 99.9 1.1E-20 2.8E-25 154.5 18.8 231 7-247 3-275 (299) 49 PRK12291 apolipoprotein N-acyl 99.9 3.7E-20 9.4E-25 151.0 19.2 197 3-227 195-415 (420) 50 KOG0808 consensus 99.9 1.5E-20 3.7E-25 153.7 12.5 244 4-251 73-350 (387) 51 cd01990 Alpha_ANH_like_I This 99.5 3.4E-14 8.7E-19 112.0 8.5 161 295-484 1-173 (202) 52 cd01998 tRNA_Me_trans tRNA met 99.4 2.9E-12 7.3E-17 99.5 9.7 147 294-445 1-180 (349) 53 PRK00143 trmU tRNA (5-methylam 99.4 3.2E-12 8.1E-17 99.1 9.7 150 291-445 3-184 (355) 54 COG1606 ATP-utilizing enzymes 99.4 4.8E-12 1.2E-16 97.9 10.2 150 282-446 9-164 (269) 55 pfam03054 tRNA_Me_trans tRNA m 99.3 9.4E-12 2.4E-16 96.1 9.5 147 294-445 2-182 (354) 56 PRK00074 guaA GMP synthase; Re 99.3 8.6E-11 2.2E-15 89.8 11.3 162 289-472 213-393 (513) 57 PRK00919 GMP synthase subunit 99.3 1.1E-10 2.7E-15 89.2 11.0 169 278-472 9-188 (306) 58 TIGR00268 TIGR00268 conserved 99.2 4.4E-11 1.1E-15 91.7 7.5 213 283-523 5-236 (263) 59 COG0482 TrmU Predicted tRNA(5- 99.2 2.5E-10 6.4E-15 86.7 9.6 150 291-445 2-182 (356) 60 TIGR00884 guaA_Cterm GMP synth 99.1 7.5E-10 1.9E-14 83.6 11.0 162 289-472 13-198 (319) 61 cd01997 GMP_synthase_C The C-t 99.1 3.8E-10 9.6E-15 85.6 9.1 157 294-472 1-176 (295) 62 TIGR00420 trmU tRNA (5-methyla 99.1 5.1E-10 1.3E-14 84.7 9.6 146 293-445 1-198 (394) 63 TIGR00364 TIGR00364 exsB prote 99.0 3.5E-09 9E-14 79.2 8.8 150 295-446 1-190 (227) 64 KOG2805 consensus 99.0 2.6E-09 6.6E-14 80.1 8.0 150 291-445 4-190 (377) 65 COG0519 GuaA GMP synthase, PP- 98.8 4.6E-08 1.2E-12 71.9 10.5 187 288-499 17-227 (315) 66 PTZ00077 asparagine synthetase 98.6 4.5E-07 1.2E-11 65.4 9.9 137 273-418 220-399 (610) 67 COG0603 Predicted PP-loop supe 98.6 3.8E-07 9.8E-12 65.9 9.5 150 292-445 2-177 (222) 68 PRK13825 conjugal transfer pro 98.6 5.2E-06 1.3E-10 58.5 15.0 160 31-205 217-386 (389) 69 PRK11106 queuosine biosynthesi 98.5 2.2E-06 5.6E-11 61.0 10.4 150 292-445 1-176 (231) 70 cd01712 ThiI ThiI is required 98.4 3.9E-06 1E-10 59.3 10.0 141 294-446 1-156 (177) 71 pfam02568 ThiI Thiamine biosyn 98.4 4.4E-06 1.1E-10 59.0 10.0 144 293-447 4-162 (197) 72 pfam00733 Asn_synthase Asparag 98.4 8.8E-06 2.2E-10 57.0 11.1 124 281-415 8-134 (195) 73 TIGR03104 trio_amidotrans aspa 98.3 4.9E-06 1.2E-10 58.7 9.3 132 276-417 246-382 (589) 74 COG0037 MesJ tRNA(Ile)-lysidin 98.3 8.7E-06 2.2E-10 57.0 10.3 143 293-445 22-185 (298) 75 KOG1622 consensus 98.3 1.2E-06 3E-11 62.7 5.7 156 288-471 226-415 (552) 76 pfam07186 consensus 98.3 8.6E-05 2.2E-09 50.5 14.9 159 26-205 219-393 (396) 77 TIGR03108 eps_aminotran_1 exos 98.3 7.2E-06 1.8E-10 57.6 9.3 136 274-418 242-377 (628) 78 pfam06508 ExsB ExsB. This fami 98.3 1.4E-06 3.6E-11 62.2 4.9 65 294-360 1-65 (137) 79 cd01991 Asn_Synthase_B_C The C 98.2 7.2E-06 1.8E-10 57.6 8.3 126 282-417 7-135 (269) 80 KOG0571 consensus 98.2 1.7E-05 4.3E-10 55.1 9.8 149 276-439 211-368 (543) 81 PRK09431 asnB asparagine synth 98.2 1.9E-05 4.9E-10 54.8 9.1 135 273-416 210-359 (555) 82 TIGR01536 asn_synth_AEB aspara 98.1 1.6E-05 4E-10 55.4 7.6 122 293-420 312-454 (646) 83 TIGR03573 WbuX N-acetyl sugar 98.1 1E-05 2.6E-10 56.6 6.5 65 294-359 61-125 (343) 84 COG0367 AsnB Asparagine syntha 98.1 2.9E-05 7.3E-10 53.6 8.7 158 275-445 211-376 (542) 85 cd01993 Alpha_ANH_like_II This 98.0 2.4E-05 6.1E-10 54.2 6.9 145 294-445 1-171 (185) 86 pfam01171 ATP_bind_3 PP-loop f 98.0 2.8E-05 7.2E-10 53.7 7.1 137 294-445 1-160 (182) 87 PRK01269 thiamine biosynthesis 97.9 6.4E-05 1.6E-09 51.4 7.4 185 293-494 178-377 (483) 88 PRK01565 thiamine biosynthesis 97.9 0.00017 4.4E-09 48.6 9.4 144 293-446 177-332 (399) 89 cd01996 Alpha_ANH_like_III Thi 97.8 4E-05 1E-09 52.7 5.7 64 294-359 3-66 (154) 90 cd01713 PAPS_reductase This do 97.8 0.00015 3.9E-09 48.9 7.9 149 294-445 1-158 (173) 91 PRK13795 hypothetical protein; 97.8 0.00045 1.1E-08 45.9 10.2 155 282-445 236-399 (630) 92 cd01995 ExsB ExsB is a transcr 97.7 0.0003 7.8E-09 46.9 8.4 52 294-347 1-52 (169) 93 cd01992 PP-ATPase N-terminal d 97.7 0.00019 4.9E-09 48.2 7.3 138 294-445 1-160 (185) 94 PRK08349 hypothetical protein; 97.6 0.00057 1.4E-08 45.2 9.2 144 294-447 2-159 (198) 95 PRK13794 hypothetical protein; 97.6 0.00051 1.3E-08 45.5 8.1 75 280-356 234-308 (473) 96 PRK00509 argininosuccinate syn 97.4 0.0089 2.3E-07 37.4 12.6 179 292-479 2-216 (398) 97 PRK08576 hypothetical protein; 97.3 0.00082 2.1E-08 44.1 6.5 72 283-356 226-297 (439) 98 PRK08557 hypothetical protein; 97.3 0.0015 3.8E-08 42.5 7.4 73 281-355 170-244 (420) 99 PRK10696 C32 tRNA thiolase; Pr 97.2 0.0014 3.5E-08 42.6 6.5 162 275-445 22-204 (311) 100 PRK04527 argininosuccinate syn 97.1 0.017 4.4E-07 35.4 11.6 130 292-433 3-148 (397) 101 pfam01507 PAPS_reduct Phosphoa 97.1 0.0064 1.6E-07 38.3 8.9 143 294-445 1-152 (174) 102 PRK13820 argininosuccinate syn 97.1 0.0015 3.9E-08 42.3 5.7 146 291-448 2-161 (395) 103 COG0301 ThiI Thiamine biosynth 97.0 0.024 6.1E-07 34.5 11.8 168 293-476 176-357 (383) 104 PRK10660 tilS tRNA(Ile)-lysidi 97.0 0.0023 5.7E-08 41.2 6.0 62 293-355 14-81 (433) 105 cd01986 Alpha_ANH_like Adenine 96.8 0.0073 1.9E-07 37.9 7.5 47 295-345 1-47 (103) 106 PRK05370 argininosuccinate syn 96.8 0.018 4.6E-07 35.3 9.2 19 293-311 243-262 (447) 107 TIGR02432 lysidine_TilS_N tRNA 96.5 0.0074 1.9E-07 37.9 5.8 138 294-445 1-179 (204) 108 COG1365 Predicted ATPase (PP-l 96.5 0.013 3.4E-07 36.2 7.0 135 293-445 61-201 (255) 109 TIGR00342 TIGR00342 thiamine b 96.1 0.044 1.1E-06 32.8 7.9 140 293-446 187-345 (391) 110 TIGR00032 argG argininosuccina 95.9 0.07 1.8E-06 31.5 8.3 111 294-405 1-119 (420) 111 cd01999 Argininosuccinate_Synt 95.8 0.16 4E-06 29.2 12.5 132 295-432 1-144 (385) 112 pfam00764 Arginosuc_synth Argi 95.5 0.2 5.2E-06 28.4 11.6 57 297-356 2-59 (389) 113 cd01994 Alpha_ANH_like_IV This 95.1 0.17 4.4E-06 28.9 7.8 62 294-356 1-67 (194) 114 COG0175 CysH 3'-phosphoadenosi 94.8 0.053 1.4E-06 32.3 4.6 145 294-445 41-193 (261) 115 COG0137 ArgG Argininosuccinate 94.6 0.11 2.9E-06 30.1 5.8 40 192-231 105-145 (403) 116 pfam01902 ATP_bind_4 ATP-bindi 93.4 0.31 8E-06 27.2 6.3 61 294-356 2-67 (219) 117 PRK08384 thiamine biosynthesis 93.3 0.46 1.2E-05 26.1 6.9 122 293-445 180-307 (310) 118 KOG0573 consensus 90.8 0.31 7.9E-06 27.3 3.6 29 292-320 250-278 (520) 119 cd01984 AANH_like Adenine nucl 90.4 0.18 4.7E-06 28.7 2.2 33 295-327 1-34 (86) 120 PRK06850 hypothetical protein; 90.0 0.22 5.6E-06 28.3 2.3 29 57-85 66-94 (488) 121 TIGR03183 DNA_S_dndC putative 89.6 0.25 6.3E-06 27.9 2.3 31 56-86 57-87 (447) 122 pfam04273 DUF442 Putative phos 88.9 1.7 4.3E-05 22.4 6.9 94 297-397 9-108 (110) 123 COG3969 Predicted phosphoadeno 88.5 0.79 2E-05 24.6 4.3 11 510-520 387-397 (407) 124 TIGR01390 CycNucDiestase 2',3' 87.4 2.1 5.4E-05 21.8 6.0 220 23-262 31-303 (634) 125 TIGR01214 rmlD dTDP-4-dehydror 87.3 1.2 3E-05 23.5 4.6 68 143-218 35-116 (317) 126 COG2117 Predicted subunit of t 87.1 1.5 3.7E-05 22.9 4.9 60 295-358 3-62 (198) 127 PRK12563 sulfate adenylyltrans 84.5 2 5E-05 22.0 4.5 146 290-445 35-212 (312) 128 PRK05253 sulfate adenylyltrans 80.2 3.5 8.8E-05 20.4 4.5 147 290-445 24-201 (300) 129 COG2102 Predicted ATPases of P 79.7 4.4 0.00011 19.7 7.5 63 294-357 2-69 (223) 130 KOG2840 consensus 79.2 2.7 6.9E-05 21.1 3.7 67 293-360 52-125 (347) 131 COG1091 RfbD dTDP-4-dehydrorha 78.5 4.8 0.00012 19.5 5.1 64 151-220 34-108 (281) 132 TIGR02858 spore_III_AA stage I 70.2 3.9 1E-04 20.1 2.6 55 474-529 206-263 (282) 133 PRK09453 phosphodiesterase; Pr 66.4 6.4 0.00016 18.7 3.0 43 5-54 1-43 (183) 134 TIGR02124 hypE hydrogenase exp 64.9 9.3 0.00024 17.6 3.7 93 319-435 220-316 (345) 135 TIGR01188 drrA daunorubicin re 64.3 9.9 0.00025 17.4 4.1 35 282-348 166-201 (343) 136 PRK02090 phosphoadenosine phos 63.2 10 0.00027 17.3 10.0 143 293-445 42-192 (243) 137 TIGR01405 polC_Gram_pos DNA po 59.1 5.7 0.00014 19.0 1.7 175 281-470 674-875 (1264) 138 pfam02273 Acyl_transf_2 Acyl t 58.8 12 0.00032 16.8 3.6 69 282-360 90-158 (294) 139 TIGR01361 DAHP_synth_Bsub phos 57.9 13 0.00033 16.7 6.3 69 135-203 11-93 (262) 140 PRK13210 putative L-xylulose 5 57.7 13 0.00033 16.7 8.6 12 191-202 57-68 (284) 141 KOG1706 consensus 56.6 13 0.00034 16.5 4.3 19 24-45 19-37 (412) 142 pfam09987 DUF2226 Uncharacteri 56.3 14 0.00035 16.5 6.5 28 89-116 36-63 (252) 143 KOG0259 consensus 56.3 14 0.00035 16.5 3.6 70 160-245 192-266 (447) 144 COG3453 Uncharacterized protei 55.6 14 0.00036 16.4 6.3 108 296-415 9-123 (130) 145 PRK13946 shikimate kinase; Pro 52.5 12 0.0003 16.9 2.4 82 313-406 15-97 (195) 146 TIGR02039 CysD sulfate adenyly 52.0 11 0.00028 17.2 2.2 64 289-354 16-83 (295) 147 PRK09987 dTDP-4-dehydrorhamnos 51.6 16 0.00041 16.0 4.7 65 150-220 37-112 (299) 148 COG4646 DNA methylase [Transcr 51.1 15 0.00037 16.3 2.7 15 525-539 588-602 (637) 149 TIGR01980 sufB FeS assembly pr 50.9 14 0.00035 16.5 2.5 33 213-245 183-227 (469) 150 cd03819 GT1_WavL_like This fam 49.6 17 0.00044 15.8 3.2 15 159-173 70-84 (355) 151 TIGR02346 chap_CCT_theta T-com 49.3 18 0.00045 15.8 2.9 39 300-349 295-333 (554) 152 PRK13604 luxD acyl transferase 47.8 18 0.00047 15.6 3.7 69 282-360 95-163 (305) 153 TIGR02715 amido_AtzE amidohydr 47.4 5.3 0.00014 19.2 0.0 23 303-327 268-292 (454) 154 PRK13209 L-xylulose 5-phosphat 47.4 19 0.00048 15.6 8.6 17 336-352 167-183 (283) 155 TIGR01455 glmM phosphoglucosam 45.8 20 0.00051 15.4 4.1 91 126-230 159-260 (450) 156 PRK09982 universal stress prot 44.9 9.7 0.00025 17.5 1.0 31 390-420 69-103 (142) 157 PRK07106 5-aminoimidazole-4-ca 44.7 21 0.00052 15.3 4.2 23 156-178 125-147 (391) 158 PRK03367 consensus 44.0 21 0.00053 15.3 2.6 36 1-43 1-37 (329) 159 cd03812 GT1_CapH_like This fam 44.0 21 0.00054 15.3 4.8 12 338-349 237-248 (358) 160 cd03823 GT1_ExpE7_like This fa 43.3 22 0.00055 15.2 3.3 12 162-173 91-102 (359) 161 TIGR01182 eda 2-dehydro-3-deox 43.0 22 0.00056 15.2 4.2 16 30-45 74-89 (205) 162 TIGR02778 ligD_pol DNA polymer 42.7 8.6 0.00022 17.8 0.5 60 273-337 144-207 (251) 163 COG2176 PolC DNA polymerase II 42.6 19 0.00048 15.6 2.2 32 287-318 878-913 (1444) 164 TIGR03642 cas_csx13 CRISPR-ass 40.8 24 0.0006 15.0 4.9 53 271-323 69-122 (124) 165 COG5482 Uncharacterized conser 40.1 14 0.00035 16.5 1.2 85 321-415 69-153 (229) 166 pfam01553 Acyltransferase Acyl 39.0 25 0.00064 14.8 4.0 10 166-175 90-99 (132) 167 pfam04356 DUF489 Protein of un 38.9 25 0.00064 14.8 2.8 57 339-395 108-168 (190) 168 TIGR02317 prpB methylisocitrat 38.8 25 0.00064 14.8 4.2 39 160-208 168-207 (287) 169 TIGR01196 edd phosphogluconate 38.8 25 0.00064 14.8 2.7 77 287-377 269-347 (614) 170 cd03795 GT1_like_4 This family 38.7 25 0.00065 14.7 4.1 14 69-82 19-32 (357) 171 cd03802 GT1_AviGT4_like This f 38.2 26 0.00066 14.7 3.1 36 5-43 1-42 (335) 172 PRK00232 pdxA 4-hydroxythreoni 38.1 24 0.00062 14.9 2.2 40 1-45 1-42 (334) 173 pfam01261 AP_endonuc_2 Xylose 37.8 26 0.00067 14.7 9.8 23 22-44 62-84 (201) 174 TIGR02776 NHEJ_ligase_prk DNA 37.6 26 0.00067 14.6 2.4 12 398-409 359-370 (645) 175 PRK08639 threonine dehydratase 37.4 27 0.00068 14.6 4.2 48 282-329 293-344 (418) 176 TIGR03436 acidobact_VWFA VWFA- 36.8 27 0.00069 14.6 5.0 34 273-306 141-178 (296) 177 pfam01488 Shikimate_DH Shikima 35.6 19 0.00049 15.5 1.4 56 291-357 12-68 (134) 178 PRK11907 bifunctional 2',3'-cy 35.4 29 0.00073 14.4 6.8 36 22-57 143-179 (810) 179 TIGR01181 dTDP_gluc_dehyt dTDP 35.2 29 0.00073 14.4 2.2 41 159-211 65-105 (340) 180 PRK06707 amidase; Provisional 35.2 21 0.00053 15.3 1.5 15 158-172 173-187 (536) 181 PRK01909 pdxA 4-hydroxythreoni 35.0 29 0.00074 14.4 2.8 40 1-45 1-44 (329) 182 COG1134 TagH ABC-type polysacc 34.4 30 0.00075 14.3 4.9 17 336-352 107-123 (249) 183 PTZ00249 variable surface prot 34.2 15 0.00037 16.3 0.6 23 11-34 19-41 (516) 184 cd02649 nuc_hydro_CeIAG nuc_hy 33.2 31 0.00079 14.2 5.0 46 282-330 107-154 (306) 185 PRK08662 nicotinate phosphorib 33.2 28 0.00072 14.5 1.9 38 275-312 242-279 (343) 186 TIGR02356 adenyl_thiF thiazole 32.7 31 0.0008 14.1 2.5 57 155-226 103-161 (210) 187 cd01714 ETF_beta The electron 32.6 32 0.00081 14.1 5.2 52 295-347 54-105 (202) 188 PRK00218 hypothetical protein; 31.5 33 0.00084 14.0 2.8 58 338-395 113-174 (207) 189 PRK11340 phosphodiesterase Yae 31.4 33 0.00084 14.0 6.7 76 3-90 47-122 (270) 190 COG5322 Predicted dehydrogenas 31.4 33 0.00084 14.0 5.6 22 292-313 168-189 (351) 191 COG2129 Predicted phosphoester 31.4 33 0.00084 14.0 4.0 38 2-45 1-38 (226) 192 cd03220 ABC_KpsT_Wzt ABC_KpsT_ 31.4 33 0.00084 14.0 5.6 14 298-311 142-155 (224) 193 TIGR03569 NeuB_NnaB N-acetylne 31.2 33 0.00085 14.0 5.3 28 18-45 10-37 (329) 194 TIGR00143 hypF [NiFe] hydrogen 30.8 34 0.00086 13.9 3.8 160 197-368 319-504 (799) 195 COG2272 PnbA Carboxylesterase 30.2 35 0.00088 13.9 6.7 87 282-368 167-265 (491) 196 KOG4201 consensus 30.0 3.9 0.0001 20.0 -2.9 54 123-176 111-168 (289) 197 COG3845 ABC-type uncharacteriz 30.0 35 0.00089 13.8 5.4 19 95-113 205-223 (501) 198 KOG1195 consensus 29.8 35 0.0009 13.8 3.6 30 281-310 313-342 (567) 199 TIGR01139 cysK cysteine syntha 29.6 35 0.0009 13.8 2.9 37 292-329 265-302 (312) 200 cd01018 ZntC Metal binding pro 29.6 35 0.0009 13.8 6.2 63 288-354 185-247 (266) 201 pfam04392 ABC_sub_bind ABC tra 28.7 37 0.00094 13.7 3.8 106 296-406 107-217 (292) 202 TIGR00389 glyS_dimeric glycyl- 28.3 26 0.00067 14.6 1.1 12 489-500 478-489 (606) 203 PRK05077 frsA fermentation/res 27.8 38 0.00097 13.6 3.2 37 296-332 196-236 (414) 204 TIGR00421 ubiX_pad polyprenyl 27.6 38 0.00098 13.6 2.2 26 326-351 146-176 (181) 205 KOG0452 consensus 27.5 38 0.00098 13.6 2.4 158 280-444 487-699 (892) 206 TIGR02440 FadJ fatty oxidation 27.4 39 0.00099 13.5 2.6 46 309-354 429-493 (732) 207 KOG1917 consensus 27.1 23 0.00058 15.1 0.5 13 18-30 51-63 (1125) 208 cd03269 ABC_putative_ATPase Th 26.8 40 0.001 13.5 5.8 21 392-412 172-192 (210) 209 PRK02848 consensus 26.6 40 0.001 13.5 2.4 40 1-45 1-45 (341) 210 TIGR02805 exbB2 tonB-system en 26.5 28 0.0007 14.5 0.9 23 295-317 82-110 (139) 211 COG1535 EntB Isochorismate hyd 26.2 41 0.001 13.4 4.4 80 189-297 61-148 (218) 212 COG2876 AroA 3-deoxy-D-arabino 26.1 41 0.001 13.4 3.8 48 157-204 62-114 (286) 213 PRK11107 hybrid sensory histid 26.0 41 0.001 13.4 6.7 131 11-153 360-492 (920) 214 pfam02677 DUF208 Uncharacteriz 26.0 41 0.001 13.4 4.4 62 142-208 82-144 (176) 215 PRK13546 teichoic acids export 25.7 42 0.0011 13.3 6.2 14 298-311 143-156 (264) 216 PRK06102 hypothetical protein; 25.5 37 0.00095 13.6 1.4 15 158-172 106-120 (452) 217 COG1029 FwdB Formylmethanofura 25.3 42 0.0011 13.3 4.0 17 425-441 342-358 (429) 218 cd04863 MtLigD_Pol_like MtLigD 25.2 30 0.00077 14.2 0.9 26 281-306 134-161 (231) 219 pfam04310 MukB MukB N-terminal 25.1 36 0.00093 13.7 1.3 17 297-313 203-219 (227) 220 pfam10042 DUF2278 Uncharacteri 25.1 42 0.0011 13.3 3.9 21 217-237 175-195 (206) 221 PRK13482 DNA integrity scannin 25.1 16 0.0004 16.1 -0.6 54 146-209 70-123 (352) 222 pfam09587 PGA_cap Bacterial ca 25.0 43 0.0011 13.3 3.4 14 158-171 66-79 (237) 223 PRK09856 fructoselysine 3-epim 25.0 43 0.0011 13.3 7.1 13 284-296 232-244 (276) 224 COG5001 Predicted signal trans 24.7 43 0.0011 13.2 3.7 126 315-445 305-442 (663) 225 cd03796 GT1_PIG-A_like This fa 24.1 44 0.0011 13.2 3.7 15 68-82 18-32 (398) 226 TIGR02176 pyruv_ox_red pyruvat 23.7 45 0.0012 13.1 3.1 52 338-389 551-616 (1194) 227 PRK04507 consensus 23.5 46 0.0012 13.1 2.2 36 1-43 1-37 (323) 228 TIGR01515 branching_enzym 1,4- 23.3 38 0.00097 13.6 1.1 126 194-335 60-202 (608) 229 COG0703 AroK Shikimate kinase 23.1 46 0.0012 13.0 2.7 76 319-406 3-79 (172) 230 PRK13545 tagH teichoic acids e 23.1 46 0.0012 13.0 5.1 69 32-111 156-224 (549) 231 PRK13270 treF trehalase; Provi 22.5 48 0.0012 13.0 2.6 19 350-368 346-364 (549) 232 TIGR01136 cysKM cysteine synth 22.5 48 0.0012 13.0 3.1 39 291-329 265-305 (315) 233 cd03811 GT1_WabH_like This fam 22.4 48 0.0012 12.9 3.9 11 40-50 3-13 (353) 234 pfam06500 DUF1100 Alpha/beta h 22.4 48 0.0012 12.9 3.3 37 296-332 192-232 (411) 235 TIGR01513 NAPRTase_put putativ 22.3 48 0.0012 12.9 3.6 72 276-357 291-371 (523) 236 PRK08305 spoVFB dipicolinate s 22.3 48 0.0012 12.9 2.8 38 140-177 3-43 (195) 237 pfam02065 Melibiase Melibiase. 22.2 48 0.0012 12.9 2.0 61 19-82 53-118 (395) 238 pfam05201 GlutR_N Glutamyl-tRN 22.0 30 0.00075 14.3 0.3 18 396-413 35-52 (150) 239 TIGR02538 type_IV_pilB type IV 21.8 34 0.00088 13.9 0.6 51 393-446 414-467 (577) 240 PRK09922 UDP-D-galactose:(gluc 21.8 49 0.0013 12.9 3.2 40 3-45 1-43 (361) 241 cd02648 nuc_hydro_1 NH_1: A su 21.7 49 0.0013 12.9 5.0 20 283-302 141-160 (367) 242 cd01571 NAPRTase_B Nicotinate 21.5 50 0.0013 12.8 3.2 38 275-312 226-263 (302) 243 cd03268 ABC_BcrA_bacitracin_re 21.5 50 0.0013 12.8 5.7 16 297-312 125-140 (208) 244 TIGR01769 GGGP geranylgeranylg 21.5 50 0.0013 12.8 9.8 137 20-173 40-211 (212) 245 TIGR01919 hisA-trpF bifunction 21.5 50 0.0013 12.8 2.5 32 283-315 67-100 (246) 246 cd01534 4RHOD_Repeat_3 Member 21.4 40 0.001 13.5 0.9 21 285-307 74-94 (95) 247 pfam00128 Alpha-amylase Alpha 21.4 50 0.0013 12.8 5.5 67 17-89 1-73 (314) 248 PRK08091 ribulose-phosphate 3- 21.3 50 0.0013 12.8 1.4 37 281-320 174-210 (235) 249 pfam05820 DUF845 Baculovirus p 21.0 51 0.0013 12.8 2.8 42 105-150 41-86 (119) 250 cd03821 GT1_Bme6_like This fam 20.9 51 0.0013 12.7 4.0 15 162-176 107-121 (375) 251 TIGR02871 spore_ylbJ sporulati 20.8 15 0.00037 16.3 -1.4 45 155-209 18-65 (384) 252 cd04148 RGK RGK subfamily. Th 20.8 52 0.0013 12.7 4.1 35 333-367 123-163 (221) 253 TIGR01983 UbiG ubiquinone bios 20.6 52 0.0013 12.7 6.1 111 283-415 76-206 (275) 254 TIGR02713 allophanate_hyd allo 20.2 43 0.0011 13.2 0.9 16 398-413 425-441 (582) 255 TIGR02691 arsC_pI258_fam arsen 20.1 41 0.001 13.4 0.7 19 402-420 3-23 (129) No 1 >PRK13981 NAD synthetase; Provisional Probab=100.00 E-value=0 Score=1282.31 Aligned_cols=542 Identities=51% Similarity=0.857 Sum_probs=517.6 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 28999963883688999999999999999985998999166444799844541488999999999999999988769789 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGI 84 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~i 84 (562) ||||++|+|+++||+++|+++|++.+++|+++||||+|||||+||||+|+||+++++|++.+.+++++|++.+++ ++.+ T Consensus 1 mkIalaQ~n~~vGD~~~N~~~i~~~i~~A~~~gadlvvfPEL~ltGY~~~DLl~~~~f~~~~~~al~~la~~~~~-~i~v 79 (543) T PRK13981 1 LRIALAQLNPTVGDIAGNAAKIRAAAAEAADAGADLLLFPELFLSGYPPEDLLLRPAFLAACEAALERLAAATAD-GPAV 79 (543) T ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHCCC-CCEE T ss_conf 989998289977799999999999999999879989992740210799577603889999999999999985389-9889 Q ss_pred EEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 99999988898789999996997999887776235543332100377888761476474562011111110168999997 Q gi|254780999|r 85 VVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLK 164 (562) Q Consensus 85 ivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la 164 (562) ++|+|.+.++++|||+++|++|+|++.|+|+|||||++|||+|||++|....+|+++|+|+|+.||||+|++..|+..++ T Consensus 80 ivG~p~~~~~~lyN~a~vi~~G~i~~~~~K~~LPnY~vFdE~RyF~~g~~~~~f~~~g~~iGv~ICED~W~~~~p~~~l~ 159 (543) T PRK13981 80 LVGHPWREGGKLYNAAALLDGGEILATYFKQDLPNYGVFDEKRYFAPGPEPGPFELKGVRIGVPICEDIWWPEGPAETLA 159 (543) T ss_pred EEEEEEECCCCEEEEEEEEECCEEEEEEECCCCCCCCCHHHHHHCCCCCCCCEEEECCEEEEEEEECCCCCCCCHHHHHH T ss_conf 99647824990479999999998999983211799850203442146897764998999999972001377876899998 Q ss_pred HCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEEE Q ss_conf 14777599315553335807888788999854337046421356887662104864355589859996167776379999 Q gi|254780999|r 165 KQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMTE 244 (562) Q Consensus 165 ~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d 244 (562) ++|||+|+|||||||+.+|.+.|..+++.++.+++++++|+|+|||||++||||+|||+|++|+++++++.|+|++.++| T Consensus 160 ~~GadliinlsASp~~~gK~~~R~~l~~~~a~~~~~~~iY~N~vGgqdelVFDG~S~v~~~~G~l~~~~~~F~e~~~~vd 239 (543) T PRK13981 160 EAGAELLLVPNASPYHRGKPDLRESVLRARVRETGLPLVYLNQVGGQDELVFDGASFVLNADGELAAQLPAFEEQIAVVD 239 (543) T ss_pred HCCCCEEEECCCCHHHCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEECCCCEEECCCCCHHHHCCCCCCCEEEEE T ss_conf 65996999817993463887999999999998669738986268998745761864698799867564267652569998 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEE Q ss_conf 98335212365557555554478999888642202035888764224540575215872368999999841495446797 Q gi|254780999|r 245 WHYDQQLSQWNYMSDDSASTMYIPLQEEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTI 324 (562) Q Consensus 245 ~d~d~~~~~~~~~~~~~~~~~~~p~~~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~ 324 (562) ++..... +... ......+.+.++++|+||++|||||++|+||++|||||||||||||||+||++|||++||++| T Consensus 240 ~~~~~~~--~~~~----~~~~~~~~~~~e~i~~alvlglrDy~~k~gf~~vviGLSGGiDSaL~aaiA~dALG~~nV~~v 313 (543) T PRK13981 240 FDRGDDG--WRCV----EGPIAPELPGEEEDYRALVLGLRDYVRKNGFPGVVLGLSGGIDSALVAAIAVDALGAENVRAV 313 (543) T ss_pred EECCCCC--CCCC----CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHCHHHEEEE T ss_conf 7546666--6578----876678987189999999999999998608974999767883599999999985383437886 Q ss_pred ECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECC Q ss_conf 32466665221356999999721354267758999998863132346874310110123304667798887516954436 Q gi|254780999|r 325 MLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLT 404 (562) Q Consensus 325 ~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~ 404 (562) +||+++||++|++||+.||++||+++.+|||+++++++.+++++.|.+...|+|+||+|||+||++||++||++|+|||| T Consensus 314 ~MPs~~tS~~s~~dA~~La~~LGi~~~~i~I~~~~~~~~~~l~~~f~~~~~dvt~ENiQAR~R~~iLm~laN~~g~lvl~ 393 (543) T PRK13981 314 MMPSRYTSDESLDDAAALARNLGVRYDIIPIEPAFEAFEAALAPLFAGTEPDITEENLQSRIRGTLLMALSNKFGSLVLT 393 (543) T ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEC T ss_conf 45877786656999999999959974875149999999997404416876665223454888999999987338967964 Q ss_pred CCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 43244775411023666555401017763365899999963103322234444668878828898845389997700068 Q gi|254780999|r 405 TSNKSEISVGYGTLYGDMSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPHQTDQESLP 484 (562) Q Consensus 405 t~nksE~~~Gy~T~~GD~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~Q~ded~l~ 484 (562) ||||||+|+||||+||||+|+|+||+|||||+||+|+||+|++.... +..++||++|++|||||||+|+|+|||+|| T Consensus 394 TgnkSE~avGy~TlyGD~~g~~avi~dv~KT~V~~L~r~~n~~~~~~---~~~~vip~~ii~k~pSaEL~p~Q~ded~L~ 470 (543) T PRK13981 394 TGNKSEMAVGYATLYGDMAGGFAPIKDVYKTLVYRLCRWRNTNHPGG---PDGEVIPERIITKPPSAELRPNQTDQDSLP 470 (543) T ss_pred CCCHHHHHHCHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC---CCCCCCCHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 78672787474653376556743036865999999999986536456---666756187668898977799997688789 Q ss_pred CHHHHHHHHHHHHHCCCCHHHHH-HCCCHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCHHH Q ss_conf 87889999999982799977987-1899999999999998745668738970247264537864331011024 Q gi|254780999|r 485 PYPILDDIIKRIVENEESFINND-QEYNDETVRYVEHLLYGSEYKRRQAPVGTKITAKSFGRDRLYPISNKFR 556 (562) Q Consensus 485 ~Y~~lD~il~~~i~~~~~~~~~~-~~~~~~~v~~~~~~~~~~~~KR~~~pp~~~is~~sfg~~~r~Pi~~~~~ 556 (562) ||++||+||++|++++.++.++. ++|+++.|++|+++|++|||||+|+|||||||+|+||+||||||+++|| T Consensus 471 pYe~lD~iL~~~ie~~~s~~~i~~~g~~~~~v~~~~~~~~~~e~KR~q~ppg~kvs~rafg~d~R~Pi~~~~~ 543 (543) T PRK13981 471 PYDVLDAILERLVEEEQSVAEIVAAGFDRETVRRVERLLYIAEYKRRQAAPGVKITRRAFGRDRRYPITNRFR 543 (543) T ss_pred CHHHHHHHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCCCCCCC T ss_conf 8789999999999769999999882899999999999999704442008995587888899887776755579 No 2 >PRK02628 nadE NAD synthetase; Reviewed Probab=100.00 E-value=0 Score=1122.49 Aligned_cols=557 Identities=25% Similarity=0.369 Sum_probs=493.1 Q ss_pred CCCC--EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9886--28999963883688999999999999999985998999166444799844541488999999999999999988 Q gi|254780999|r 1 MLKK--LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTH 78 (562) Q Consensus 1 M~~~--mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~ 78 (562) ||+. +|||++|+++++||+++|.++|++.+++|+++||+|+||||||||||+|+|||++++|++.+.++|++|++.++ T Consensus 7 ~~~~gfirvA~a~~~~~VgD~~~N~~~I~~~i~~A~~~ga~lvvfPEL~ltGY~~eDL~l~~~fl~~~~~aL~~l~~~t~ 86 (678) T PRK02628 7 IYRHGFVRVAAATPKVRVADPAFNAERILALAREAHDDGVALAVFPELSLSGYSCGDLFLQDALLDAVEDALAALVEASA 86 (678) T ss_pred HHHCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHCC T ss_conf 46577127999478987538899999999999999988998998255203389889985799999999999999998578 Q ss_pred HCCCEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCC-----------------EEE-- Q ss_conf 7697899999998889878999999699799988777623554333210037788876-----------------147-- Q gi|254780999|r 79 DGGAGIVVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSND-----------------PIV-- 139 (562) Q Consensus 79 ~~~i~iivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~-----------------~~~-- 139 (562) +.++.++||+|.+.++++|||+++|.+|+|++.|+|++||||++|||+|||+||.... +|+ T Consensus 87 ~~~~~viVG~P~~~~~~LyN~A~vi~~G~Ilg~~~K~~LPNY~vFdE~R~F~~g~~~~~~~i~~~g~~vPfG~dllF~~~ 166 (678) T PRK02628 87 DLDPVLVVGAPLRVRHRLYNCAVVIHRGRILGVVPKSYLPNYREFYEKRWFAPGDGIRGETIRLCGQEVPFGTDLLFEAE 166 (678) T ss_pred CCCEEEEEECEEECCCCEEEEEEEEECCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEEC T ss_conf 98859999330620895488999999999999970433879976574636777777886168347822045665256541 Q ss_pred -ECCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC--CCCC Q ss_conf -6474562011111110168999997147775993155533358078887889998543370464213568876--6210 Q gi|254780999|r 140 -FRDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD--ELIF 216 (562) Q Consensus 140 -~~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d--~lvf 216 (562) +++++||+.||||+|.+..|+..++.+||++|+|||||||+.+|.+.|+.+++.+|++++++++|+|.++||+ ++|| T Consensus 167 ~~~g~~~GveICEDlW~~~~P~~~la~~GA~lI~NlSASp~~~gK~~~R~~lv~~~s~~~~~~yvY~n~G~gEsttDlVF 246 (678) T PRK02628 167 DLPGFVFGVEICEDLWVPIPPSSYAALAGATVLANLSASNITVGKADYRRLLVASQSARCLAAYVYAAAGVGESTTDLAW 246 (678) T ss_pred CCCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEEECCCCHHHCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCEEE T ss_conf 66883899997211488998589998749887971688803348699999999999886288569982357876760699 Q ss_pred CCCCEEECCCCEEEEECCCCCCC--EEEEEEEECCCCCCCCC---------------------------------CCCCC Q ss_conf 48643555898599961677763--79999983352123655---------------------------------57555 Q gi|254780999|r 217 DGASFCFDGQQQLAFQMKHFSEQ--NFMTEWHYDQQLSQWNY---------------------------------MSDDS 261 (562) Q Consensus 217 ~G~S~I~d~~G~ii~~~~~f~e~--~~i~d~d~d~~~~~~~~---------------------------------~~~~~ 261 (562) ||+|+|++ +|++++++++|+++ ++++|+|++.....+.. ..... T Consensus 247 dG~s~I~~-nG~lla~~~~F~~~~~l~~adiDl~~l~~~R~~~~sf~~~~~~~~~~~~~~~~v~~~~~~~~~~~~l~r~v 325 (678) T PRK02628 247 DGQTLIYE-NGELLAESERFPREEQLIIADIDLERLRQERLRNGSFDDNRRHHRNESAPFRTIPFALDPPTGDLGLRRPV 325 (678) T ss_pred ECCEEEEC-CCCEEEECCCCCCCCCEEEEEEEHHHHHHHHHHCCCCCCCCHHCCCCCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 68867872-88167533566667636899830777666664047622210001345566336543115664544433544 Q ss_pred CCCCCCCCCH------HHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHH---HHHHHHHHCCCH--HHEEEEECCCCC Q ss_conf 5544789998------88642202035888764224540575215872368---999999841495--446797324666 Q gi|254780999|r 262 ASTMYIPLQE------EEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSA---LCAAIAVDALGK--ENVQTIMLPYKY 330 (562) Q Consensus 262 ~~~~~~p~~~------~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSa---l~a~la~~alg~--~~v~~~~mp~~~ 330 (562) ...|++|..+ .+++|+++|+||+||++|+|++++||||||||||| ||++.|+++||. +||+||+||+++ T Consensus 326 ~~~PfvP~~~~~~~~~~~ei~~~qv~GL~dyl~k~g~~~vViGLSGGiDSaLaLLVaa~A~d~Lg~~r~~V~~vtMPs~~ 405 (678) T PRK02628 326 ERFPFVPSDPERLDQRCYEAYNIQVSGLAQRLRATGIKKVVIGISGGLDSTLALLVAAKAFDRLGLPRKNILAYTMPGFG 405 (678) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHCCCHHCEEEEECCCCC T ss_conf 66888888533445549999999999999999864998189967776447999999999999848971224899778876 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCC--CCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCH Q ss_conf 6522135699999972135426775899999886313234687--43101101233046677988875169544364324 Q gi|254780999|r 331 TSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEE--PSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNK 408 (562) Q Consensus 331 ~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~--~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nk 408 (562) ||+.|++||++||++||+++.+|+|++++.++++++++.+.+. .+|+|+||+|||+|+++||++||++|+|||+|||| T Consensus 406 TS~~S~~dA~~La~~LGi~~~~I~I~~a~~~~~~di~~~~~~~~~~~Dvt~ENiQAR~R~~iLM~laNk~g~LVL~TGNk 485 (678) T PRK02628 406 TTDRTKNNAVALMKALGVTAREIDIRPAALQMLKDIGHPFARGEKVYDVTFENVQAGERTQILFRLANQRGGLVIGTGDL 485 (678) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCC T ss_conf 57878999999999729977997629999999998444222378776602665203556789999745779679736851 Q ss_pred HHHHHHHCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCC-------CCCC Q ss_conf 4775411023-6665554010177633658999999631033222344446688788288988453899-------9770 Q gi|254780999|r 409 SEISVGYGTL-YGDMSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPH-------QTDQ 480 (562) Q Consensus 409 sE~~~Gy~T~-~GD~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~-------Q~de 480 (562) ||+|+||||+ ||||||+|+||+|||||.||.|+||++++.. ..+....|+.+|+++||||||.|. |+|| T Consensus 486 SElAvGy~Ty~yGD~mg~yavn~~VpKTlV~~L~r~~a~~~~---~~~~~~~vl~~Il~tpiSpEL~P~~~~~~i~QktE 562 (678) T PRK02628 486 SELALGWCTYGVGDHMSHYNVNASVPKTLIQHLIRWVAESGQ---FDEAVSEVLLDILDTPISPELVPPDEEGEIVQKTE 562 (678) T ss_pred CHHHHCEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC---CCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 468648310477541246244678837899999999986244---45311256898546999914279886677887463 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCHHHHH----------------------HCCCHHHHHHHH----HHHHHHHHHHHHCCC Q ss_conf 006887889999999982799977987----------------------189999999999----999874566873897 Q gi|254780999|r 481 ESLPPYPILDDIIKRIVENEESFINND----------------------QEYNDETVRYVE----HLLYGSEYKRRQAPV 534 (562) Q Consensus 481 d~l~~Y~~lD~il~~~i~~~~~~~~~~----------------------~~~~~~~v~~~~----~~~~~~~~KR~~~pp 534 (562) |.|||||+|||||+++++.+.++..+. .+|++++|++|+ ++|++|||||+|+|| T Consensus 563 d~lgPYel~Df~Ly~~lr~g~~p~ki~~la~~af~~~~~~~~~~~~~~~~~y~~~~i~~wl~~f~~Rff~~QfKRs~~P~ 642 (678) T PRK02628 563 DIVGPYELQDFFLYYFLRYGFRPSKIAFLAWHAWKDAERGNWPGPEDKRNAYDLAEIKKWLEVFLRRFFSSQFKRSALPN 642 (678) T ss_pred CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCHHHCCC T ss_conf 55787577899999999728988999999999740222245665320024789999999999999998872304200899 Q ss_pred CCEECC-CCCCC--CCCCCCCC---HHHHHHHC Q ss_conf 024726-45378--64331011---02455414 Q gi|254780999|r 535 GTKITA-KSFGR--DRLYPISN---KFRDHISE 561 (562) Q Consensus 535 ~~~is~-~sfg~--~~r~Pi~~---~~~~~~~~ 561 (562) ||||+. -|+++ |||||..+ -|.++|++ T Consensus 643 Gpkv~s~~slSPRgdwrmPsd~~~~~wl~el~~ 675 (678) T PRK02628 643 GPKVGSGGSLSPRGDWRAPSDASAALWLDELER 675 (678) T ss_pred CCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 987478866387412358764278999999996 No 3 >KOG2303 consensus Probab=100.00 E-value=0 Score=586.15 Aligned_cols=535 Identities=22% Similarity=0.270 Sum_probs=433.2 Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 98862899996388368899999999999999998599899916644479984454148899999999999999998876 Q gi|254780999|r 1 MLKKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDG 80 (562) Q Consensus 1 M~~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~ 80 (562) |+++++||.+++|.+.-|+++|.++|++.|++|++.||.+-+-|||-+|||.|+|.|++.+..-++++.+.+|...-... T Consensus 1 m~r~vtvAtc~lNqWAlDFegN~~rI~~Si~eAk~~gA~~RlGPELEi~GYgC~DHf~E~Dt~~HswE~l~~lv~~~~~~ 80 (706) T KOG2303 1 MGRKVTVATCTLNQWALDFEGNMQRILKSIEEAKARGARYRLGPELEITGYGCEDHFLESDTLLHSWEMLAELVESPVTQ 80 (706) T ss_pred CCCEEEEEEECHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECCCCEEECCCCHHHHHCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 98358898841114444133329999999999996498462177335147772775224117887999999997497778 Q ss_pred CCEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCE----------------------- Q ss_conf 978999999988898789999996997999887776235543332100377888761----------------------- Q gi|254780999|r 81 GAGIVVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDP----------------------- 137 (562) Q Consensus 81 ~i~iivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~----------------------- 137 (562) ++.+.+|+|....+..|||.+++-||+|+.+.+|+-|.|.++|.|.|||+++.+... T Consensus 81 ~il~diGmPv~hr~~ryNCrv~~~n~kil~IRpKm~lanDgnyRE~RwFt~W~~~~~~e~y~lP~~i~~~~~Q~tVPfGd 160 (706) T KOG2303 81 DILCDIGMPVMHRNVRYNCRVLFLNRKILLIRPKMWLANDGNYRESRWFTPWTRPRVTEEYQLPRMIQKHTGQETVPFGD 160 (706) T ss_pred CEEEECCCCHHHHHHHHCCCEEECCCEEEEECCCCEECCCCCCHHHCCCCCCCCCCCCCEEECCHHHHHHHCCEEECCCC T ss_conf 74686387122220220402120277189970420333688702100233346655430233668889870880303551 Q ss_pred --EEECCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC-CC Q ss_conf --476474562011111110168999997147775993155533358078887889998543370464213568876-62 Q gi|254780999|r 138 --IVFRDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD-EL 214 (562) Q Consensus 138 --~~~~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d-~l 214 (562) +.+.+.-+|..|||++|.|..|...|+++|++|++|-|+|++..+|.+.|.+++...+.+.+..++|+||-|+++ .+ T Consensus 161 avl~~~dt~ig~EiCEEL~tp~sphi~mal~GVei~~NaSGShh~LrK~~~r~~li~~at~k~GGvYlyaNqrGCDG~Rl 240 (706) T KOG2303 161 AVLQTWDTCIGSEICEELWTPRSPHIDMALDGVEIITNASGSHHELRKLNTRVDLILNATSKCGGVYLYANQRGCDGDRL 240 (706) T ss_pred EEEEECCCCHHHHHHHHHCCCCCCCHHHHHCCEEEEECCCCCHHHHHHHHHHHHHHHCCHHHCCEEEEEECCCCCCCCEE T ss_conf 13432231130888998738999633344076388851885278886420346877410331662899613677887614 Q ss_pred CCCCCCEEECCCCEEEEECCCCC-CCE--EEEEEEECCCCCCCC-----------------------------CCCCCCC Q ss_conf 10486435558985999616777-637--999998335212365-----------------------------5575555 Q gi|254780999|r 215 IFDGASFCFDGQQQLAFQMKHFS-EQN--FMTEWHYDQQLSQWN-----------------------------YMSDDSA 262 (562) Q Consensus 215 vf~G~S~I~d~~G~ii~~~~~f~-e~~--~i~d~d~d~~~~~~~-----------------------------~~~~~~~ 262 (562) .|||.|+|+ -+|++++|..+|+ +++ .++.+|++...+-+. ....+.. T Consensus 241 YydGca~Ia-~NG~vlAqg~QFsl~DveVv~atvDle~vrsyR~~~~S~~~~as~~~~~~ri~v~~~ls~~~d~~~~~t~ 319 (706) T KOG2303 241 YYDGCAMIA-MNGSVLAQGSQFSLDDVEVVTATVDLEDVRSYRASISSRGLQASRAVKYPRIHVDFELSQHFDLLATPTE 319 (706) T ss_pred EECCHHHEE-ECCEEEEECCCCCCCCEEEEEEEECHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCEECCCCCCCCCCCC T ss_conf 763611145-4561455123033233278999851889888886650244322135776505643253013554468888 Q ss_pred CCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH-------HHCC--CHHH------------- Q ss_conf 54478999888642202035888764224540575215872368999999-------8414--9544------------- Q gi|254780999|r 263 STMYIPLQEEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIA-------VDAL--GKEN------------- 320 (562) Q Consensus 263 ~~~~~p~~~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la-------~~al--g~~~------------- 320 (562) .....-.+++|++--.-...||||++++|..|+++.||||+||+-+|+|. ++|+ |.|. T Consensus 320 p~e~~~hsPeeEia~GPacwlWdyLRRs~~aGfflPLSGG~DSsatA~iV~sMC~~V~~av~~g~eqv~~Dvr~i~~~~~ 399 (706) T KOG2303 320 PIEWKYHSPEEEIALGPACWLWDYLRRSGQAGFFLPLSGGVDSSATAAIVYSMCRQVCKAVQSGDEQVLADVRRIVNDIS 399 (706) T ss_pred CCCCCCCCCHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC T ss_conf 75545579689860484488999998638874477647886405899999999999999997473654320687714888 Q ss_pred -------------EEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCC----------CCCCCH Q ss_conf -------------679732466665221356999999721354267758999998863132346----------874310 Q gi|254780999|r 321 -------------VQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQ----------EEPSGI 377 (562) Q Consensus 321 -------------v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~----------~~~~~~ 377 (562) .+...|.+.+||++|.+.|++||.++|..|..+.|+.+|.+..+.+.-..+ ....++ T Consensus 400 ~~p~dp~~l~nri~~TcyMgSenSS~ETr~rak~La~~igs~H~~i~iD~~vsavl~lF~~vtGk~P~f~~~ggsn~enl 479 (706) T KOG2303 400 YTPTDPADLCNRILYTCYMGSENSSKETRRRAKELANQIGSYHIDLNIDTAVSAVLSLFNLVTGKTPRFKSHGGSNRENL 479 (706) T ss_pred CCCCCHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHCCEEEEEEEHHHHHHHHHHHHHHHCCCCCEECCCCCHHHHH T ss_conf 68889899987666542004656648899999999986351024155059999999999887478752115888606667 Q ss_pred HHHCCCCHHHHHHHHHHHHHC--------CCEECCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 110123304667798887516--------954436432447754110236665554010177633658999999631033 Q gi|254780999|r 378 VAENIQSRIRGNILMALSNHS--------KAMLLTTSNKSEISVGYGTLYGDMSGGFNPLKDLYKTQVFQLASWRNSHGI 449 (562) Q Consensus 378 ~~eN~qaR~R~~~L~~~an~~--------~~lvl~t~nksE~~~Gy~T~~GD~~g~~~p~~~l~Kt~v~~la~~~n~~~~ 449 (562) +++|||||+||++-|-+|... |.||||+.|..|...||-|+|.+++.|++|||++.||+++.+.+|.-+++ T Consensus 480 aLQNiQARiRMvLaylfAqL~~wvr~~~GglLVLGSaNVDE~LrGYLTKYDCSSADINPIGgISK~DLr~Fl~~a~~~~- 558 (706) T KOG2303 480 ALQNIQARIRMVLAYLFAQLLLWVRGRPGGLLVLGSANVDESLRGYLTKYDCSSADINPIGGISKTDLRRFLQYAKEKF- 558 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC- T ss_conf 6777999999999999998778750787736997247603576511101455624667766724878999999999863- Q ss_pred CCCCCCCCCCCCHHHCCCCCCCCCCC------CCCCCCCCC-CHHHHHHHHHHH-HHCCCCH---HHH----HHCCCH-- Q ss_conf 22234444668878828898845389------997700068-878899999999-8279997---798----718999-- Q gi|254780999|r 450 TSGLGPLTEVIPPSILEKSPSAELRP------HQTDQESLP-PYPILDDIIKRI-VENEESF---INN----DQEYND-- 512 (562) Q Consensus 450 ~~~~~~~~~~ip~~i~~~~psaeL~~------~Q~ded~l~-~Y~~lD~il~~~-i~~~~~~---~~~----~~~~~~-- 512 (562) ..++.+.|++.||||||.| .|+||.++| +|+.|--+=+.- +..-.+. ... ...+++ T Consensus 559 -------~lp~L~~il~a~pTAELePl~~g~~~QtDE~dmGmTY~ELsv~GrlRK~~~cGPysMF~kLl~~W~~klsp~q 631 (706) T KOG2303 559 -------GLPALQSILDAPPTAELEPLTDGDYSQTDEADMGMTYAELSVFGRLRKVSKCGPYSMFCKLLHQWGDKLSPRQ 631 (706) T ss_pred -------CCHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCHHH T ss_conf -------9568999864998302552556762334446417669998776742043366858999999998567589899 Q ss_pred --HHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCC Q ss_conf --9999999999874566873897024726453786 Q gi|254780999|r 513 --ETVRYVEHLLYGSEYKRRQAPVGTKITAKSFGRD 546 (562) Q Consensus 513 --~~v~~~~~~~~~~~~KR~~~pp~~~is~~sfg~~ 546 (562) |.|++|+..+..|.||-- .-.|.+.+-++++| T Consensus 632 vaEKVk~FF~~Y~iNRHKmT--vlTPsyHAE~Yspe 665 (706) T KOG2303 632 VAEKVKRFFSYYSINRHKMT--VLTPSYHAENYSPE 665 (706) T ss_pred HHHHHHHHHHHHEECCCCCE--ECCCCCCCCCCCCC T ss_conf 99999888754122240111--12643002337976 No 4 >PRK13980 NAD synthetase; Provisional Probab=100.00 E-value=0 Score=583.63 Aligned_cols=255 Identities=39% Similarity=0.574 Sum_probs=244.5 Q ss_pred HHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 88864220203588876422454057521587236899999984149544679732466665221356999999721354 Q gi|254780999|r 271 EEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKY 350 (562) Q Consensus 271 ~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~ 350 (562) +.+.+.+.++.+||||++|+|++++||||||||||||||+||++|+|++||+|++||+++||+.|.+||+.+|++||+++ T Consensus 7 d~~~~~~~iv~~lrdy~~~~g~kg~VlGlSGGIDSavva~La~~Alg~~~v~~v~mP~~~ss~~s~~dA~~la~~lgi~~ 86 (264) T PRK13980 7 DYEKVREIIVDFIREEVEKAGFKGVVLGLSGGIDSAVVAYLAVKALGKENVLALLMPYRVSPPEDLEDALLVAERLGIEY 86 (264) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCE T ss_conf 89999999999999999980998099979868889999999998549660689989899998878999999999869984 Q ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 26775899999886313234687431011012330466779888751695443643244775411023666555401017 Q gi|254780999|r 351 DVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDMSGGFNPLK 430 (562) Q Consensus 351 ~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~~g~~~p~~ 430 (562) .+++|+++++++.+.+.. ..+++.+|+|||+||++||++||++|+||+|||||||+++||||+|||++|+++||+ T Consensus 87 ~~i~I~~~~~~~~~~~~~-----~~~~~~~NiqaR~Rm~~Ly~~An~~~~lVlgTgNksE~~~Gy~TkyGD~~~d~~Pi~ 161 (264) T PRK13980 87 KVIEITPIVDAFFSAVPD-----ADRLRRGNIMARTRMVLLYDYANRDNRLVLGTSNKSELLLGYFTKYGDGAVDINPIG 161 (264) T ss_pred EEEECHHHHHHHHHHCCC-----CCHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCHHHHHHHHCCCCCCCCCCHHHHC T ss_conf 998279999999986310-----006688875898999999998643397765588674798721001687665624525 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHCCCCHHHHHH- Q ss_conf 763365899999963103322234444668878828898845389997700068-878899999999827999779871- Q gi|254780999|r 431 DLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPHQTDQESLP-PYPILDDIIKRIVENEESFINNDQ- 508 (562) Q Consensus 431 ~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~Q~ded~l~-~Y~~lD~il~~~i~~~~~~~~~~~- 508 (562) |||||+||+||+|+| ||++|++|||||||||+|+|||+|| ||++||.||+++++++.+..++.. T Consensus 162 dL~Kt~V~~La~~l~--------------vP~~Ii~k~PSa~L~~~Q~DE~~LG~~Y~~lD~iL~~~ie~~~~~~~i~~~ 227 (264) T PRK13980 162 DLYKTQVRELARHLG--------------VPEDIIEKPPSADLWEGQTDEDELGFSYEEIDEILYALFDKKMPREEILAQ 227 (264) T ss_pred CCCHHHHHHHHHHHC--------------CCHHHEECCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 873999999999939--------------966440369997767999987771999999999999999769999999987 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCC Q ss_conf -8999999999999987456687389702472645378 Q gi|254780999|r 509 -EYNDETVRYVEHLLYGSEYKRRQAPVGTKITAKSFGR 545 (562) Q Consensus 509 -~~~~~~v~~~~~~~~~~~~KR~~~pp~~~is~~sfg~ 545 (562) +++++.|++|.+++.+|||||+ +||+||||.|+||. T Consensus 228 ~g~~~~~v~~i~~~~~~se~KR~-~Pp~~kis~~~fg~ 264 (264) T PRK13980 228 LGIDEELVDRVRRRVQRSQHKRR-MPPIAKISGRTIGI 264 (264) T ss_pred CCCCHHHHHHHHHHHHHCCCCCC-CCCCCEECCCCCCC T ss_conf 19899999999999985402337-99943151776789 No 5 >cd00553 NAD_synthase NAD+ synthase is a homodimer, which catalyzes the final step in de novo nicotinamide adenine dinucleotide (NAD+) biosynthesis, an amide transfer from either ammonia or glutamine to nicotinic acid adenine dinucleotide (NaAD). The conversion of NaAD to NAD+ occurs via an NAD-adenylate intermediate and requires ATP and Mg2+. The intemediate is subsequently cleaved into NAD+ and AMP. In many prokaryotes, such as E. coli , NAD synthetase consists of a single domain and is strictly ammonia dependent. In contrast, eukaryotes and other prokaryotes have an additional N-terminal amidohydrolase domain that prefer glutamine, Interestingly, NAD+ synthases in these prokaryotes, can also utilize ammonia as an amide source . Probab=100.00 E-value=0 Score=542.56 Aligned_cols=245 Identities=43% Similarity=0.589 Sum_probs=237.8 Q ss_pred HHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 88864220203588876422454057521587236899999984149544679732466665221356999999721354 Q gi|254780999|r 271 EEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKY 350 (562) Q Consensus 271 ~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~ 350 (562) +.++++++++.||+||++|+|++++||||||||||||||+||++|+|++||+|++||+++||+++.++|+.+|++||+++ T Consensus 2 ~~~~~~~~lv~~l~~y~~~~g~~~~viGlSGGIDSav~a~La~~Alg~~~v~~v~mP~~~ss~~s~~~A~~la~~lgi~~ 81 (248) T cd00553 2 DLEEIINALVLFLRDYLRKSGFKGVVLGLSGGIDSALVAALAVRALGRENVLALFMPSRYSSEETREDAKELAEALGIEH 81 (248) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCE T ss_conf 68999999999999999981999199958888999999999999728875999889997898758999999999958916 Q ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 26775899999886313234687431011012330466779888751695443643244775411023666555401017 Q gi|254780999|r 351 DVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDMSGGFNPLK 430 (562) Q Consensus 351 ~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~~g~~~p~~ 430 (562) .+++|+++++++...+.+..+....+++.+|+|||+||.+||++||++|+||+|||||||.++||||+|||++|+++||+ T Consensus 82 ~~i~I~~~~~~~~~~~~~~~~~~~~~~~~~NiqaRlRm~~Ly~~An~~~~lVlgTgNksE~~~Gy~TkyGD~~~d~~Pi~ 161 (248) T cd00553 82 VNIDIDPAVEAFLALLGESGGSELEDLALGNIQARLRMVILYALANKLGGLVLGTGNKSELLLGYFTKYGDGAADINPIG 161 (248) T ss_pred EEECHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCHHHHHHCCCEECCCCCCCCCCCC T ss_conf 87155999999999864404870556787424878889999999974499896287460787572100078775822136 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHCCCCHHHHH-H Q ss_conf 763365899999963103322234444668878828898845389997700068-87889999999982799977987-1 Q gi|254780999|r 431 DLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPHQTDQESLP-PYPILDDIIKRIVENEESFINND-Q 508 (562) Q Consensus 431 ~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~Q~ded~l~-~Y~~lD~il~~~i~~~~~~~~~~-~ 508 (562) |||||+||+||+|+| ||++|++|||||||||+|+|||+|| ||+.||.||+.+++.+.+..++. . T Consensus 162 ~L~Kt~V~~la~~l~--------------vp~~Ii~k~PSaeL~~~Q~DE~~lg~~Y~~lD~~L~~~~~~~~~~~~~~~~ 227 (248) T cd00553 162 DLYKTQVRELARYLG--------------VPESIIDKPPSAELWPGQTDEDELGMPYEELDQFLYLRLEGGLGPEEILAP 227 (248) T ss_pred CCCHHHHHHHHHHHC--------------CHHHHHCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHC T ss_conf 884999999999968--------------889873479996667999898775999999999999999659998999868 Q ss_pred CCCHHHHHHHHHHHHHHHHHH Q ss_conf 899999999999998745668 Q gi|254780999|r 509 EYNDETVRYVEHLLYGSEYKR 529 (562) Q Consensus 509 ~~~~~~v~~~~~~~~~~~~KR 529 (562) ++.++.|++|.+++.+||||| T Consensus 228 ~~~~~~v~~i~~~~~~~~hKR 248 (248) T cd00553 228 GIIEEVVKRVFRLYKKNEHKR 248 (248) T ss_pred CCCHHHHHHHHHHHHHCCCCC T ss_conf 999999999999999717072 No 6 >pfam02540 NAD_synthase NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation. Probab=100.00 E-value=0 Score=526.06 Aligned_cols=238 Identities=43% Similarity=0.610 Sum_probs=228.3 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHH Q ss_conf 20203588876422454057521587236899999984149544679732466665221356999999721354267758 Q gi|254780999|r 277 NACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIH 356 (562) Q Consensus 277 ~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~ 356 (562) +.++-+|+||++|+|++++||||||||||||+|+||++|+|++||++++||+++|++.+.++|+.+|++||+++.+++|+ T Consensus 3 ~~i~~~l~dyv~k~g~~gvViGlSGGIDSav~a~La~~AlG~~~v~~v~mP~~~ss~~s~~~A~~la~~lgi~~~~i~I~ 82 (243) T pfam02540 3 ERLVDFLRDYVQKSGAKGVVLGLSGGIDSAVVAYLAVKALGKENVLALIMPSINSSEEDVQDALALAENLGINYKTIDIK 82 (243) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECH T ss_conf 79999999999981999199978887999999999999559742899964777898889999999999968926999888 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHH Q ss_conf 99999886313234687431011012330466779888751695443643244775411023666555401017763365 Q gi|254780999|r 357 DLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDMSGGFNPLKDLYKTQ 436 (562) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~~g~~~p~~~l~Kt~ 436 (562) ++++++...+.+ ...+++.+|+|||+||++||++||++|+||+|||||||+++||+|+|||++++++||+|||||+ T Consensus 83 ~~~~~~~~~~~~----~~~~~a~~Ni~aR~Rm~~ly~~A~~~~~lVlgT~NksE~~~Gy~TkyGD~~~di~Pi~dL~Kte 158 (243) T pfam02540 83 PIVRAFSQLFQP----AKDDLAKGNLKARIRMIILYAHANKFNRLVLGTGNKSELALGYFTKYGDGACDIAPIGDLYKTQ 158 (243) T ss_pred HHHHHHHHHHCC----CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEECCCCCCCHHHHCCCCHHH T ss_conf 999999988532----0137787455788889999986233791898168704200543231277765515416881899 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHCCCCHHHHHH--CCCHH Q ss_conf 899999963103322234444668878828898845389997700068-878899999999827999779871--89999 Q gi|254780999|r 437 VFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPHQTDQESLP-PYPILDDIIKRIVENEESFINNDQ--EYNDE 513 (562) Q Consensus 437 v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~Q~ded~l~-~Y~~lD~il~~~i~~~~~~~~~~~--~~~~~ 513 (562) ||+||+|+| ||++|++|+|||||||+|+||++|| ||+.||.||+.+++++.++.++.+ +++++ T Consensus 159 V~~la~~l~--------------vP~~ii~k~Psa~L~~~qtDE~~lg~~Y~~lD~~l~~~~~~~~~~~~i~~~~g~~~~ 224 (243) T pfam02540 159 VYELAKRLN--------------VPERIIKKPPSADLWPGQTDEDELGITYDELDQILKGLIEKKLSPEEIIGDLGLPAE 224 (243) T ss_pred HHHHHHHHC--------------CCHHHHHCCCCCCCCCCCCCHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHCCCCHH T ss_conf 999999949--------------888898379998888899861003899999999999998759999999987298999 Q ss_pred HHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999987456687389 Q gi|254780999|r 514 TVRYVEHLLYGSEYKRRQAP 533 (562) Q Consensus 514 ~v~~~~~~~~~~~~KR~~~p 533 (562) .|++|.+++++|||||+ +| T Consensus 225 ~v~~i~~~~~~~e~KR~-~P 243 (243) T pfam02540 225 VVERVENLIQKSEHKRR-LP 243 (243) T ss_pred HHHHHHHHHHHHHHCCC-CC T ss_conf 99999999997405529-97 No 7 >PTZ00323 NAD+ synthase; Provisional Probab=100.00 E-value=0 Score=513.89 Aligned_cols=248 Identities=23% Similarity=0.282 Sum_probs=227.6 Q ss_pred CHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHE-----EEEECCCCCCCCCCHHHHHHHHH Q ss_conf 9888642202035888764224540575215872368999999841495446-----79732466665221356999999 Q gi|254780999|r 270 QEEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENV-----QTIMLPYKYTSPQSLEDAAACAK 344 (562) Q Consensus 270 ~~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v-----~~~~mp~~~~s~~s~~~a~~la~ 344 (562) ++++++ +.-|-.||||++|+||+++|||||||||||||++||++|||++|+ .+++||. ++|+.+.++|+.+++ T Consensus 25 dp~~eI-~~rv~fLrDYv~k~GfkgvVLGlSGGIDSAl~aaLA~~Alg~env~~~r~~gv~~P~-~ss~~s~~~a~~~a~ 102 (294) T PTZ00323 25 DPVAWI-EMKCAKLNEYMRRCGLKACVTSVSGGIDSAVVLALCSRAMRMPNSPIVRNVGICQPI-CSSAWALARGRENIA 102 (294) T ss_pred CHHHHH-HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCC-CCCHHHHHHHHHHHH T ss_conf 989999-999999999999829985999586369999999999998655457530356776785-457665987999999 Q ss_pred HHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHH---HHHHHCC--CEECCCCCHHH-HHHHHCCC Q ss_conf 721354267758999998863132346874310110123304667798---8875169--54436432447-75411023 Q gi|254780999|r 345 ALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILM---ALSNHSK--AMLLTTSNKSE-ISVGYGTL 418 (562) Q Consensus 345 ~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~---~~an~~~--~lvl~t~nksE-~~~Gy~T~ 418 (562) ++|+...+++|.+++..+...+...++....|++.||+|||+||.++| +++|+.| +|||||||||| +++||||+ T Consensus 103 ~~g~~~~~~~~~~i~~~~~~~~~~~~g~~~~d~~~gNlqAR~Rm~~~~~la~l~n~~g~~~LVlgTgNkSE~~~vGY~Tk 182 (294) T PTZ00323 103 ACGATEVVVDQTELHKQLSTLVETAVGIDGGDFARGQLRSYMRTPVGYYVAQLLSQEGTPAIVMGTGNMDEDGYLGYFCK 182 (294) T ss_pred HCCCCEEECCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHCEEEEE T ss_conf 76885232360777999999999860886104678899999988899999998600488707862897554212004420 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHH Q ss_conf 666555401017763365899999963103322234444668878828898845389997700068-8788999999998 Q gi|254780999|r 419 YGDMSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPHQTDQESLP-PYPILDDIIKRIV 497 (562) Q Consensus 419 ~GD~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~Q~ded~l~-~Y~~lD~il~~~i 497 (562) |||++|||+||+|||||+||+||+|+| ||++|++|||||||||+|+|||+|| ||++||.+|+.|+ T Consensus 183 YGD~agd~apI~DL~Kt~V~~Lar~Lg--------------iPe~ii~K~PSAeL~~gQ~DedsLg~~Yd~LD~~l~~yv 248 (294) T PTZ00323 183 AGDGVVDVQLISDLHKSEVFLVAEELG--------------VPENTLQAAPSADLWEGQTDEEELGFPYDFVELYTGWYL 248 (294) T ss_pred ECCCCCCCHHHCCCCHHHHHHHHHHCC--------------CCHHHHCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHH T ss_conf 347754703304683999999999809--------------989993089897667899708775999899999999998 Q ss_pred HCCCCHHHH-HHCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 279997798-718999999999999987456687389 Q gi|254780999|r 498 ENEESFINN-DQEYNDETVRYVEHLLYGSEYKRRQAP 533 (562) Q Consensus 498 ~~~~~~~~~-~~~~~~~~v~~~~~~~~~~~~KR~~~p 533 (562) +.+.+..++ .+||++|.|++|.+++..+||||||.. T Consensus 249 e~~~~~~~i~~~g~~~e~v~~v~rlv~~~Eykrrq~a 285 (294) T PTZ00323 249 KLSETERQSFLHSLSSEARQQFERYVAACELTHRRNA 285 (294) T ss_pred HCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6465698999834999999999999999999998752 No 8 >COG0171 NadE NAD synthase [Coenzyme metabolism] Probab=100.00 E-value=0 Score=504.49 Aligned_cols=253 Identities=46% Similarity=0.680 Sum_probs=236.7 Q ss_pred CHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCC----HHHEEEEECCCCCCCCCCHHHHHHHHHH Q ss_conf 988864220203588876422454057521587236899999984149----5446797324666652213569999997 Q gi|254780999|r 270 QEEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALG----KENVQTIMLPYKYTSPQSLEDAAACAKA 345 (562) Q Consensus 270 ~~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg----~~~v~~~~mp~~~~s~~s~~~a~~la~~ 345 (562) .+.++.+++++.+|++|++++|++++|+|||||||||++++||++|+| ++||++++||++++++.+.+||+.+++. T Consensus 3 ~d~~~~~~~~v~fl~~~l~~~~~k~~VlGiSGGiDSa~~~~La~~A~~~~~~~~~~~av~mP~~~~~~~~~~da~~~~~~ 82 (268) T COG0171 3 IDLEEEINRLVDFLRDYLKKAGFKGVVLGLSGGIDSALVLALAVRALGKGDSKENVLAVRLPYGYTVQADEEDAQDLAEA 82 (268) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCHHHEEEEECCCCCCCCCCHHHHHHHHHH T ss_conf 67999999999999999997399986997666819999999999985656512432668678877653479999999998 Q ss_pred HCCCCCCCCHHHHHHHHHHHHCCCCCC-CCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCCCC Q ss_conf 213542677589999988631323468-7431011012330466779888751695443643244775411023666555 Q gi|254780999|r 346 LGCKYDVLPIHDLVNHFFSLMSQFLQE-EPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDMSG 424 (562) Q Consensus 346 lg~~~~~i~I~~~~~~~~~~~~~~~~~-~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~~g 424 (562) ||+.+.+++|.+++.+|...+.+.+.+ ...++++||+|||+||.+||++||++|+||+||||+||+++||||+|||+++ T Consensus 83 lg~~~~~i~I~~~v~~~~~~~~~~~~~~~~~~~~~~NikaR~Rm~~lY~~An~~~~lVlGTgn~sE~~~Gy~TkyGDg~~ 162 (268) T COG0171 83 LGIDYKEINIKPAVDAFLKKLLKLFLGIYLEDLALGNIKARLRMVILYAIANKLGGLVLGTGNKSELALGYFTKYGDGAV 162 (268) T ss_pred HCCCEEEEECHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCHHHHHCCCEECCCCCCC T ss_conf 29966997528889998876666523442112777645099999999999855597897588487786074432067644 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCC---CCCCCCCC-CHHHHHHHHHHHHHCC Q ss_conf 4010177633658999999631033222344446688788288988453899---97700068-8788999999998279 Q gi|254780999|r 425 GFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPH---QTDQESLP-PYPILDDIIKRIVENE 500 (562) Q Consensus 425 ~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~---Q~ded~l~-~Y~~lD~il~~~i~~~ 500 (562) ||+||++||||+||+|+++++ ||++|++|||||||||+ |+||+++| ||+.+|.+|+.++++ T Consensus 163 d~~Pi~~L~KtqV~~La~~l~--------------ipe~I~~k~PTAeL~~~~~~q~DE~~lg~~Y~~lD~~L~~~~~~- 227 (268) T COG0171 163 DINPIADLYKTQVYALARHLG--------------IPEEILKKPPTADLWPDEPGQTDEAELGMPYEELDDILYGLLEN- 227 (268) T ss_pred CHHHHCCCCHHHHHHHHHHCC--------------CCHHHHCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHH- T ss_conf 705414875899999998739--------------99999639999112479888877777499999999999976531- Q ss_pred CCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEC---CCCCC Q ss_conf 997798718999999999999987456687389702472---64537 Q gi|254780999|r 501 ESFINNDQEYNDETVRYVEHLLYGSEYKRRQAPVGTKIT---AKSFG 544 (562) Q Consensus 501 ~~~~~~~~~~~~~~v~~~~~~~~~~~~KR~~~pp~~~is---~~sfg 544 (562) ..+++++.++++.+++..+||||+ +|++|+++ .++|+ T Consensus 228 ------~~~i~~~~~~~i~~~~~~~~~KR~-~p~~~~~~~~~~~~~~ 267 (268) T COG0171 228 ------GQEISEELVKKIERLYKKSEHKRR-LPIGPKITPWKFLSFG 267 (268) T ss_pred ------HCCCCHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCC T ss_conf ------112688999999999987776323-9998675422135668 No 9 >PRK00768 nadE NAD synthetase; Reviewed Probab=100.00 E-value=0 Score=472.46 Aligned_cols=235 Identities=26% Similarity=0.350 Sum_probs=206.0 Q ss_pred HHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCC---CHH------HEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 86422020358887642245405752158723689999998414---954------467973246666522135699999 Q gi|254780999|r 273 EADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDAL---GKE------NVQTIMLPYKYTSPQSLEDAAACA 343 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~al---g~~------~v~~~~mp~~~~s~~s~~~a~~la 343 (562) ++..+..|-+||||++++|++|+||||||||||||||+||+.|+ +++ +++++.|| ++++.+.+||+..+ T Consensus 21 ~~~i~~iv~~Lrdyv~k~g~~g~VlGLSGGIDSAvta~L~~~Av~al~~~~~~~~~~~iav~mP--~~~~~~~~da~~~~ 98 (274) T PRK00768 21 EEEIRRRVDFLKDYLKKSGLKTLVLGISGGQDSTLAGRLAQLAVEELRAETGDASYQFIAVRLP--YGVQADEDDAQDAL 98 (274) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC--CCCCCCHHHHHHHH T ss_conf 9999999999999999849983998188576889999999999998653047664226897689--88657999999998 Q ss_pred HHHCCC-CCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCC Q ss_conf 972135-4267758999998863132346874310110123304667798887516954436432447754110236665 Q gi|254780999|r 344 KALGCK-YDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDM 422 (562) Q Consensus 344 ~~lg~~-~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~ 422 (562) +.++.. ..+++|.++++++...+.... ....+++.+|+|||+||++||++||++|+||+|||||||.++||+|+|||+ T Consensus 99 ~~i~~~~~~~i~I~~~~d~~~~~l~~~~-~~~~d~~~~NiqaRiRM~~LY~~An~~g~LVlGTgNksE~~vGYfTkYGDg 177 (274) T PRK00768 99 AFIQPDEVLTVNIKPAVDASVAALRAAG-IELSDFVKGNIKARERMIAQYAIAGARGGLVVGTDHAAEAITGFFTKFGDG 177 (274) T ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHCCCEEEECCC T ss_conf 5227660488617099999999998727-751167888857989999999998278998864886404440850102677 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCC---CCCCCCCCCC-CHHHHHHHHHHHHH Q ss_conf 5540101776336589999996310332223444466887882889884538---9997700068-87889999999982 Q gi|254780999|r 423 SGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELR---PHQTDQESLP-PYPILDDIIKRIVE 498 (562) Q Consensus 423 ~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~---~~Q~ded~l~-~Y~~lD~il~~~i~ 498 (562) ++|++||+|||||+||+||+|+| ||++|++|||||+|| |+|+|||+|| +|+.||.+|+ T Consensus 178 ~~Di~PI~dL~KteV~~lA~~Lg--------------vP~~Ii~k~PSA~Lwe~~pgQtDE~elG~~Y~~lD~~L~---- 239 (274) T PRK00768 178 GADLLPLFGLNKRQGRALLAALG--------------APEHLYEKVPTADLEDDRPGLPDEVALGVTYDQIDDYLE---- 239 (274) T ss_pred CCCHHHHCCCCHHHHHHHHHHHC--------------CCHHHHCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHC---- T ss_conf 51737632561999999999959--------------799983689899754458999777775949999999983---- Q ss_pred CCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 799977987189999999999999874566873897024 Q gi|254780999|r 499 NEESFINNDQEYNDETVRYVEHLLYGSEYKRRQAPVGTK 537 (562) Q Consensus 499 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~KR~~~pp~~~ 537 (562) .++++++.+++|.+++.+|+||| ++||.|- T Consensus 240 --------g~~~~~~~~~~i~~~~~~s~hKR-~~P~~~~ 269 (274) T PRK00768 240 --------GKPVSEEAAETIENWYLKTQHKR-HLPITIF 269 (274) T ss_pred --------CCCCCHHHHHHHHHHHHCCCCCC-CCCCCCC T ss_conf --------89999999999999997254565-7989964 No 10 >cd07570 GAT_Gln-NAD-synth Glutamine aminotransferase (GAT, glutaminase) domain of glutamine-dependent NAD synthetases (class 7 and 8 nitrilases). Glutamine-dependent NAD synthetases are bifunctional enzymes, which have an N-terminal GAT domain and a C-terminal NAD+ synthetase domain. The GAT domain is a glutaminase (EC 3.5.1.2) which hydrolyses L-glutamine to L-glutamate and ammonia. The ammonia is used by the NAD+ synthetase domain in the ATP-dependent amidation of nicotinic acid adenine dinucleotide. Glutamine aminotransferases are categorized depending on their active site residues into different unrelated classes. This class of GAT domain belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this sub Probab=100.00 E-value=0 Score=458.34 Aligned_cols=247 Identities=40% Similarity=0.711 Sum_probs=235.4 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++.+||+++|++++.+++++|+++||||||||||++|||+|+|+++++.+.+.+.+++++|++.++++++.++ T Consensus 1 kIa~~Q~~~~~gd~~~Nl~~~~~~i~~A~~~ga~livfPEl~ltGY~~~d~~~~~~~~~~~~~~l~~l~~~~~~~~i~iv 80 (261) T cd07570 1 RIALAQLNPTVGDLEGNAEKILEAIREAKAQGADLVVFPELSLTGYPPEDLLLRPDFLEAAEEALEELAAATADLDIAVV 80 (261) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHCCEEEE T ss_conf 99999899987899999999999999999889979998889544799788763656789999999999998865796999 Q ss_pred EEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 99999888987899999969979998877762355433321003778887614764745620111111101689999971 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLKK 165 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la~ 165 (562) +|+|++.++++|||+++|.+|++++.|+|+|||+|++|+|++||++|+...+|+++++|+|++||||+|+|+.+.+.+++ T Consensus 81 ~G~~~~~~~~~yNsa~~i~~G~i~~~y~K~hLp~~~~f~E~~~f~~G~~~~~~~~~~~kiG~~ICyD~~fpe~~~r~la~ 160 (261) T cd07570 81 VGLPLRHDGKLYNAAAVLQNGKILGVVPKQLLPNYGVFDEKRYFTPGDKPDVLFFKGLRIGVEICEDLWVPDPPSAELAL 160 (261) T ss_pred EEEEEEECCEEEEEEEEEECCEEEEEEEEEECCCCCCCCCHHHCCCCCCCCEEEECCEEEEEEEECCCCCCCHHHHHHHH T ss_conf 96789889978888899649928889988916999863602312368877337868968887655146788779999997 Q ss_pred CCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEEEE Q ss_conf 47775993155533358078887889998543370464213568876621048643555898599961677763799999 Q gi|254780999|r 166 QGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMTEW 245 (562) Q Consensus 166 ~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~ 245 (562) +|||+|++||||||..+|.+.|+.++++||.||+++++++|++|++++++|.|+|+|+||+|+++++++.++|. ++++ T Consensus 161 ~Gadii~~psA~p~~~~k~~~~~~l~~arA~en~~~vv~~N~vG~~~~~~f~G~S~I~~P~G~vla~~~~~ee~--~a~i 238 (261) T cd07570 161 AGADLILNLSASPFHLGKQDYRRELVSSRSARTGLPYVYVNQVGGQDDLVFDGGSFIADNDGELLAEAPRFEED--LADV 238 (261) T ss_pred CCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCEEECCEEEEECCCCCEEEECCCCCEE--EEEE T ss_conf 59979996478865568289999999999998298599993557899669836089999997389985999489--9998 Q ss_pred EECCCCCCC Q ss_conf 833521236 Q gi|254780999|r 246 HYDQQLSQW 254 (562) Q Consensus 246 d~d~~~~~~ 254 (562) |+|+....+ T Consensus 239 Dld~~~~~R 247 (261) T cd07570 239 DLDRLRSER 247 (261) T ss_pred EHHHHHHHH T ss_conf 729989998 No 11 >PRK00876 nadE NAD synthetase; Reviewed Probab=100.00 E-value=0 Score=457.63 Aligned_cols=241 Identities=28% Similarity=0.381 Sum_probs=210.4 Q ss_pred HHHHCCHHHHHHHHHHHC-CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 864220203588876422-4540575215872368999999841495446797324666652213569999997213542 Q gi|254780999|r 273 EADYNACVLSLRDYVQKN-NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYD 351 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~-g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~ 351 (562) +..-+..+-.||||+.++ +.+|+||||||||||||+++||++|||++||+|+.||++.|++++.++|+.+|+.||+++. T Consensus 13 ~~e~~riv~fir~~v~~~~~~kG~VlGlSGGIDSAv~a~Lav~AlG~e~V~gl~MP~~~s~~~s~~~a~~la~~lGi~~~ 92 (325) T PRK00876 13 AAEAERIRAFIREQVLGTLKRRGVVLGLSGGIDSSVTLALCVRALGKDRVLGLLMPERDSSPDSLRLGRMLAESLGVEYV 92 (325) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf 99999999999999985169867999687688899999999997485516999788656887789999999998199359 Q ss_pred CCCHHHHHHHHH------HHH---CCC--------------CCC---------------C-----------CCCHHHHCC Q ss_conf 677589999988------631---323--------------468---------------7-----------431011012 Q gi|254780999|r 352 VLPIHDLVNHFF------SLM---SQF--------------LQE---------------E-----------PSGIVAENI 382 (562) Q Consensus 352 ~i~I~~~~~~~~------~~~---~~~--------------~~~---------------~-----------~~~~~~eN~ 382 (562) +++|+++++++- ..+ -+. ++. . ...++.+|+ T Consensus 93 ~idI~~~l~~~~~y~~~d~~i~~~~p~~~~~~~~~i~~~~~l~~~~~~~~~l~~~~~~~~~~~~~l~~~~~l~~~a~~Ni 172 (325) T PRK00876 93 VEDISPALEALGCYERRDEAIRRVVPEYGPGWKSKLVLPNFLEGDLLNVPYLVVQDPGGEVRKKRLPPADYLQIVAATNF 172 (325) T ss_pred EEECHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCCCCHHCCCCCCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCH T ss_conf 98448999875431356888875033324200000002311102322310121046421000013550344445553233 Q ss_pred CCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCH Q ss_conf 33046677988875169544364324477541102366655540101776336589999996310332223444466887 Q gi|254780999|r 383 QSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYGDMSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPP 462 (562) Q Consensus 383 qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~GD~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~ 462 (562) |||+||.+||++||++|++|+||+||||+.+||||||||+++|++||+||||||||+||+|++ ||+ T Consensus 173 KaR~RM~~lY~~A~~~n~lVlGT~NksE~~~GyfTKyGDGa~Di~PI~~LyKtQV~~LA~~Lg--------------vPe 238 (325) T PRK00876 173 KQRTRKMVEYYHADRLNYAVAGTPNRLEYDQGFFVKYGDGAADLKPIAHLYKTQVYALAEYLG--------------VPE 238 (325) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCCHHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHHHCC--------------CCH T ss_conf 689999999999987398776588465566351475447887845135775899999999809--------------999 Q ss_pred HHCCCCCCCCCCCCCCCCCCC--C-CHHHHHHHHHHHHHCCCCHHHHHH--CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 882889884538999770006--8-878899999999827999779871--89999999999999874566873 Q gi|254780999|r 463 SILEKSPSAELRPHQTDQESL--P-PYPILDDIIKRIVENEESFINNDQ--EYNDETVRYVEHLLYGSEYKRRQ 531 (562) Q Consensus 463 ~i~~~~psaeL~~~Q~ded~l--~-~Y~~lD~il~~~i~~~~~~~~~~~--~~~~~~v~~~~~~~~~~~~KR~~ 531 (562) +|++|+|||+||++|+|||++ + ||+.+|.+|+.+ +++.+.+++.+ +.+.|.|++|++.+ +.||+. T Consensus 239 ~Ii~K~PSadLw~~~~~edE~~~g~~Y~~lD~~L~~~-~~~~~~~~~~~~~g~~~e~v~~v~~~i---~~kr~~ 308 (325) T PRK00876 239 EIRRRPPTTDTYSLYQTQEEFYFRLPYDIMDLCLYAQ-NHGVPAEVVAAALGLTPEQVERVWRDI---EAKRRT 308 (325) T ss_pred HHHCCCCCCCCCCCCCCHHHHCCCCCHHHHHHHHHHH-HCCCCHHHHHHHHCCCHHHHHHHHHHH---HHHHHH T ss_conf 9936999813358999724540599889999999987-649999999988398999999999999---987530 No 12 >cd07586 nitrilase_8 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=404.81 Aligned_cols=246 Identities=28% Similarity=0.446 Sum_probs=219.4 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++++||+++|++++++++++|+++|||||||||+++|||+++|+..+.. .......+..+++.++ ++.|+ T Consensus 1 kiA~~Q~~~~~gD~~~Nl~~~~~~i~~A~~~ga~livfPE~~~tGy~~~d~~~~~a-~~~~~~~~~~l~~~a~--~~~iv 77 (269) T cd07586 1 RVAIAQIDPVLGDVEENLEKHLEIIETARERGADLVVFPELSLTGYNLGDLVYEVA-MHADDPRLQALAEASG--GICVV 77 (269) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHH-CCCCHHHHHHHHHHCC--CCEEE T ss_conf 99999899976899999999999999999881969998889401798567655543-0476799999999879--98399 Q ss_pred EEEEEEE-CCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 9999988-898789999996997999887776235543332100377888761476474562011111110168999997 Q gi|254780999|r 86 VGFPRQD-QEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLK 164 (562) Q Consensus 86 vG~p~~~-~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la 164 (562) +|.+++. ++++||+++++.+|++++.|+|+|||+|+.|+|.+||++|+...+|+++++|+|++||||+|+|+ +++.++ T Consensus 78 ~G~~~~~~~~~~~Ns~~~~~~G~i~~~yrK~hLp~~~~~~E~~~f~~G~~~~v~~~~~~~iG~~ICyD~~fPe-~~r~la 156 (269) T cd07586 78 FGFVEEGRDGRFYNSAAYLEDGRVVHVHRKVYLPTYGLFEEGRYFAPGSHLRAFDTRFGRAGVLICEDAWHPS-LPYLLA 156 (269) T ss_pred EEEEEECCCCCEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCEECCCCCCEEEEECCEEEEEEEECCCCCHH-HHHHHH T ss_conf 9968987999199999999799299997179659999716221113999506899789089987750104769-999999 Q ss_pred HCCCCEEEECCCCCCCCCC-----HHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCC Q ss_conf 1477759931555333580-----78887889998543370464213568876621048643555898599961677763 Q gi|254780999|r 165 KQGAEFLFSLNASPYYHNK-----LKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQ 239 (562) Q Consensus 165 ~~Gadlii~psASp~~~~k-----~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~ 239 (562) .+|||+|++|+|||+...+ ...|+.++++||.||++|++++|++|.+++..|.|+|+|++|+|+++++++.|+|+ T Consensus 157 ~~Ga~ii~~paa~p~~~~~~~~~~~~~w~~l~~arA~en~~~vv~~N~~G~~~~~~~~G~S~ii~P~G~ila~~~~~~e~ 236 (269) T cd07586 157 LDGADVIFIPANSPARGVGGDFDNEENWETLLKFYAMMNGVYVVFANRVGVEDGVYFWGGSRVVDPDGEVVAEAPLFEED 236 (269) T ss_pred HCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCEEEECCCEEECCCCCEEEECCCCCCE T ss_conf 87998999936886536666446799999999999997097899986622789907956448997998398875899987 Q ss_pred EEEEEEEECCCCCCCC Q ss_conf 7999998335212365 Q gi|254780999|r 240 NFMTEWHYDQQLSQWN 255 (562) Q Consensus 240 ~~i~d~d~d~~~~~~~ 255 (562) ++++|+|++.....+. T Consensus 237 vi~adiDl~~i~~~R~ 252 (269) T cd07586 237 LLVAELDRSAIRRARF 252 (269) T ss_pred EEEEEEEHHHHHHHHH T ss_conf 9999975599999986 No 13 >cd07576 R-amidase_like Pseudomonas sp. MCI3434 R-amidase and related proteins (putative class 13 nitrilases). Pseudomonas sp. MCI3434 R-amidase hydrolyzes (R,S)-piperazine-2-tert-butylcarboxamide to form (R)-piperazine-2-carboxylic acid. It does so with strict R-stereoselectively. Its preferred substrates are carboxamide compounds which have the amino or imino group connected to their beta- or gamma-carbon. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), class 13 represents proteins that at the time were difficult to place in a distinct similarity group. It has been suggested that this subgroup represents a new class. Members of the nitrilase superfamily generally form homomeric compl Probab=100.00 E-value=0 Score=391.55 Aligned_cols=240 Identities=25% Similarity=0.379 Sum_probs=214.9 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++.+||+++|++++++++++|+++|||||||||+++|||++.|.+.+... ......++.|++.+++++++++ T Consensus 1 kiA~~Q~~~~~gD~~~N~~~~~~~i~~A~~~ga~lvvfPE~~l~Gy~~~~~~~~~a~-~~~~~~~~~l~~~A~~~~i~iv 79 (254) T cd07576 1 RLALYQGPARDGDVAANLARLDEAAARAAAAGADLLVFPELFLTGYNIGDAVARLAE-PADGPALQALRAIARRHGIAIV 79 (254) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHC-CCCCHHHHHHHHHHHHCCCEEE T ss_conf 999998999889999999999999999998869799948886358986612344311-4786999999999997298699 Q ss_pred EEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 9999988898789999996-997999887776235543332100377888761476474562011111110168999997 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLK 164 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la 164 (562) +|++++.++++||++++|. +|+++++|+|+|||++ +|++||.+|+...+|+++++|+|++||||+|+|+ +++.++ T Consensus 80 ~G~~e~~~~~~yNs~~~i~~~G~i~~~y~K~hL~~~---~E~~~f~~G~~~~vf~~~~~~iG~~IC~D~~fpe-~~r~la 155 (254) T cd07576 80 VGYPERAGGAVYNAAVLIDEDGTVLANYRKTHLFGD---SERAAFTPGDRFPVVELRGLRVGLLICYDVEFPE-LVRALA 155 (254) T ss_pred EEEEEEECCEEEEEEEEECCCCEEEEEECCCCCCCC---CCCEEECCCCCCEEEEECCEEEEEEEECCCCCCH-HHHHHH T ss_conf 989997599788999999289839885714446886---6305883898634668898799988861543879-999999 Q ss_pred HCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEEE Q ss_conf 14777599315553335807888788999854337046421356887662104864355589859996167776379999 Q gi|254780999|r 165 KQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMTE 244 (562) Q Consensus 165 ~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d 244 (562) ++|||+|++|+|++...++. +..++++||.||++|++++|++|.+++++|+|+|+|+||+|+++++++. +|+++++| T Consensus 156 ~~Ga~ii~~psa~~~~~~~~--~~~~~~arA~en~~~vv~~n~~G~~~~~~~~G~S~ii~P~G~ila~~~~-~e~~~~ad 232 (254) T cd07576 156 LAGADLVLVPTALMEPYGFV--ARTLVPARAFENQIFVAYANRCGAEDGLTYVGLSSIAGPDGTVLARAGR-GEALLVAD 232 (254) T ss_pred HCCCCEEEECCCCCCCHHHH--HHHHHHHHHHHCCCEEEEEECCCCCCCEEECCCEEEEECCCEEEEECCC-CCEEEEEE T ss_conf 86997999887888852069--9999999999829949999664158980772876999189829876699-88699999 Q ss_pred EEECCCCCC Q ss_conf 983352123 Q gi|254780999|r 245 WHYDQQLSQ 253 (562) Q Consensus 245 ~d~d~~~~~ 253 (562) +|.+..... T Consensus 233 iD~~~i~~~ 241 (254) T cd07576 233 LDPAALAAA 241 (254) T ss_pred ECHHHHHHH T ss_conf 719999999 No 14 >cd07585 nitrilase_7 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=385.08 Aligned_cols=241 Identities=20% Similarity=0.266 Sum_probs=214.9 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++++||+++|++++.+++++|+++|||||||||+++|||++.+......... ....++.|++.++++++.++ T Consensus 1 kial~Q~~~~~gd~~~Nl~~~~~~i~~A~~~ga~lvvfPE~~ltGy~~~~~~~~~~~~~-~~~~~~~l~~~a~~~~i~iv 79 (261) T cd07585 1 RIALVQFEARVGDKARNLAVIARWTRKAAAQGAELVCFPEMCITGYTHVRALSREAEVP-DGPSTQALSDLARRYGLTIL 79 (261) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHCCC-CCHHHHHHHHHHHHHCCEEE T ss_conf 99999898987889999999999999999886929998988302687024566651468-98999999999987596999 Q ss_pred EEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 99999888987899999969979998877762355433321003778887614764745620111111101689999971 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLKK 165 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la~ 165 (562) +|++++.++++||+++++.+|..++.|+|+||+. .|++||++|+...+|++.++|+|++||||+|+|+ +++.+++ T Consensus 80 ~g~~e~~~~~~yNs~~~i~~~g~i~~yrK~hl~~----~E~~~f~~G~~~~vf~~~~~~iG~~IC~D~~fPe-~~r~la~ 154 (261) T cd07585 80 AGLIEKAGDRPYNTYLVCLPDGLVHRYRKLHLFR----REHPYIAAGDEYPVFATPGVRFGILICYDNHFPE-NVRATAL 154 (261) T ss_pred EEEEEECCCCEEEEEEEEECCCEEEEEECCCCCC----CCCCCCCCCCCCEEEEECCEEEEEEECCCCCCCH-HHHHHHH T ss_conf 9679906994899999996996168885377886----5434347998424897689279867613555879-9999998 Q ss_pred CCCCEEEECCCCCCCCCC--HHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEE Q ss_conf 477759931555333580--788878899985433704642135688766210486435558985999616777637999 Q gi|254780999|r 166 QGAEFLFSLNASPYYHNK--LKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMT 243 (562) Q Consensus 166 ~Gadlii~psASp~~~~k--~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~ 243 (562) +||++|++|||||..... ...++.++++||.+|+++++++|++|++++.+|+|+|+|+||+|+++++++.++|+++++ T Consensus 155 ~Ga~ii~~psa~~~~~~~~~~~~~~~~~~aRA~en~~~v~~~N~vG~~~~~~~~G~S~Iv~P~G~il~~~~~~~e~~~~a 234 (261) T cd07585 155 LGAEILFAPHATPGTTSPKGREWWMRWLPARAYDNGVFVAACNGVGRDGGEVFPGGAMILDPYGRVLAETTSGGDGMVVA 234 (261) T ss_pred CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEECEEEEECCCCCEEEECCCCCCEEEEE T ss_conf 79979998865678777421889999999999972994898456656898157142799989983987338999889999 Q ss_pred EEEECCCCC Q ss_conf 998335212 Q gi|254780999|r 244 EWHYDQQLS 252 (562) Q Consensus 244 d~d~d~~~~ 252 (562) |+|++.... T Consensus 235 diDl~~i~~ 243 (261) T cd07585 235 DLDLDLINT 243 (261) T ss_pred EEEHHHHHH T ss_conf 988599999 No 15 >cd07583 nitrilase_5 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=381.87 Aligned_cols=239 Identities=24% Similarity=0.365 Sum_probs=213.4 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++++||+++|+++|++++++|+++||||||||||++|||++.++..... ......++.|++.+++++++++ T Consensus 1 kIA~~Q~~~~~gd~~~Nl~~i~~~i~~A~~~ga~lvvfPE~~ltGY~~~~~~~~a~--~~~~~~~~~l~~~a~~~~i~iv 78 (253) T cd07583 1 KIALIQLDIVWGDPEANIERVESLIEEAAAAGADLIVLPEMWNTGYFLDDLYELAD--EDGGETVSFLSELAKKHGVNIV 78 (253) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHCC--CCCCHHHHHHHHHHHHHCCEEE T ss_conf 99999898987999999999999999999883989991898635899789986333--4782999999999887497999 Q ss_pred EE-EEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHH Q ss_conf 99-99988898789999996-99799988777623554333210037788876147647456201111111016899999 Q gi|254780999|r 86 VG-FPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHL 163 (562) Q Consensus 86 vG-~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~l 163 (562) +| .+++.++++||++++|. +|+++++|+|+||+++ ++|++||.+|+...+|+++++|+|++||||+|+|+ +++.+ T Consensus 79 ~G~~~e~~~~~~yNsa~~i~~~G~i~~~yrK~hl~~~--~~E~~~f~~G~~~~v~~~~~~~iG~~IC~D~~fPe-~~r~~ 155 (253) T cd07583 79 AGSVAEKEGGKLYNTAYVIDPDGELIATYRKIHLFGL--MGEDKYLTAGDELEVFELDGGKVGLFICYDLRFPE-LFRKL 155 (253) T ss_pred EEEEEEEECCEEEEEEEEECCCCEEEEEEECCCCCCC--CCCCEEEECCCCCCEEEECCEEEEEEEECCCCCHH-HHHHH T ss_conf 9006980099499999999089859999923026997--66153786488641376445278857860333789-99999 Q ss_pred HHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEE Q ss_conf 71477759931555333580788878899985433704642135688766210486435558985999616777637999 Q gi|254780999|r 164 KKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMT 243 (562) Q Consensus 164 a~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~ 243 (562) +.+|||+|++|++||+.. .+.+..++++||.||++|++++|++|.+++..|+|+|+|+||+|+++++++. +|+++++ T Consensus 156 a~~Gadii~~psa~p~~~--~~~w~~~~~arA~en~~~vv~aN~~G~~~~~~~~G~S~Iv~P~G~ila~~~~-~e~~i~a 232 (253) T cd07583 156 ALEGAEILFVPAEWPAAR--IEHWRTLLRARAIENQAFVVACNRVGTDGGNEFGGHSMVIDPWGEVLAEAGE-EEEILTA 232 (253) T ss_pred HHCCCCEEEECCCCCCCC--HHHHHHHHHHHHHHCCCCEEEECCCCCCCCCEECCEEEEECCCCCEEEECCC-CCEEEEE T ss_conf 987997999906999742--2667668999999839949997662158997873448999999869888799-9849999 Q ss_pred EEEECCCCC Q ss_conf 998335212 Q gi|254780999|r 244 EWHYDQQLS 252 (562) Q Consensus 244 d~d~d~~~~ 252 (562) |+|++.... T Consensus 233 diDl~~~~~ 241 (253) T cd07583 233 EIDLEEVAE 241 (253) T ss_pred EECHHHHHH T ss_conf 973999999 No 16 >cd07584 nitrilase_6 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=380.59 Aligned_cols=241 Identities=26% Similarity=0.337 Sum_probs=212.5 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHH-HHHHH-HHHHHHHHHHHHHCCCE Q ss_conf 89999638836889999999999999999859989991664447998445414889-99999-99999999998876978 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKS-FIQAC-SSAIDTLKSDTHDGGAG 83 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~-f~~~~-~~~l~~La~~~~~~~i~ 83 (562) |||++|+++++||+++|++++.+++++|+++||||||||||++|||++.++..+.. +.+.. ...++.|++.++++++. T Consensus 1 riA~~Q~~~~~gD~~~N~~~~~~~i~~A~~~gadlvvfPEl~ltGy~~~~~~~~~~~~a~~~~~~~~~~l~~~a~~~~i~ 80 (258) T cd07584 1 KVALIQMDSVLGDVKANLKKAAELCKEAAAEGADLICFPELATTGYRPDLLGPKLWELSEPIDGPTVRLFSELAKELGVY 80 (258) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCE T ss_conf 99999898987899999999999999999887949995877424799578512489876407849999999998870938 Q ss_pred EEEEEEEEEC--CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHH Q ss_conf 9999999888--98789999996-99799988777623554333210037788876147647456201111111016899 Q gi|254780999|r 84 IVVGFPRQDQ--EGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNIC 160 (562) Q Consensus 84 iivG~p~~~~--~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~ 160 (562) |++|++++.+ +++||++++|. +|+++++|+|+||+. +|.+||.+|+...+|++.++|+|++||||+|+|+ .. T Consensus 81 iv~g~~e~~~~~~~~yNs~~~i~~~G~i~~~yrK~hl~~----~E~~~f~~G~~~~v~~~~~gkiG~~IC~D~~fPe-~~ 155 (258) T cd07584 81 IVCGFVEKGGVPGKVYNSAVVIDPEGESLGVYRKIHLWG----LEKQYFREGEQYPVFDTPFGKIGVMICYDMGFPE-VA 155 (258) T ss_pred EEEECCEECCCCCCEEEEEEEECCCCCEEEEEECCCCCC----CCCCCCCCCCCCEEEEECCCEEEEEEECCCCCCH-HH T ss_conf 999403144899958888999948983898797576676----5212357897310576068467778871678717-99 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCE Q ss_conf 99971477759931555333580788878899985433704642135688766210486435558985999616777637 Q gi|254780999|r 161 KHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQN 240 (562) Q Consensus 161 ~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~ 240 (562) +.++.+|||+|++|++||.. +.+.+..++++||.+|+++++++|++|.+++..|+|+|+|++|+|+++++++.|+|++ T Consensus 156 r~la~~Gadii~~psa~~~~--~~~~~~~~~~arA~en~~~vv~~N~vG~~~~~~~~G~S~iv~p~G~~la~~~~~~e~l 233 (258) T cd07584 156 RILTLKGAEVIFCPSAWREQ--DADIWDINLPARALENTVFVAAVNRVGNEGDLVLFGKSKILNPRGQVLAEASEEAEEI 233 (258) T ss_pred HHHHHCCCCEEEECCCCCCC--CCHHHHHHHHHHHHHCCCEEEEEEEECCCCCEEEEEEEEEECCCCEEEEECCCCCCEE T ss_conf 99998699899987888886--4379999999999986994899810115899078311489989990967538999889 Q ss_pred EEEEEEECCCCCC Q ss_conf 9999983352123 Q gi|254780999|r 241 FMTEWHYDQQLSQ 253 (562) Q Consensus 241 ~i~d~d~d~~~~~ 253 (562) +++|+|++..... T Consensus 234 ~~adiDl~~~~~~ 246 (258) T cd07584 234 LYAEIDLDAIADY 246 (258) T ss_pred EEEEECHHHHHHH T ss_conf 9999639999999 No 17 >TIGR03381 agmatine_aguB N-carbamoylputrescine amidase. Members of this family are N-carbamoylputrescine amidase (3.5.1.53). Bacterial genes are designated AguB. The AguAB pathway replaces SpeB for conversion of agmatine to putrescine in two steps rather than one. Probab=100.00 E-value=0 Score=377.25 Aligned_cols=246 Identities=26% Similarity=0.387 Sum_probs=214.3 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHH-----HHHHHHHHHHHH Q ss_conf 289999638836889999999999999999859989991664447998445414889999999-----999999999887 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACS-----SAIDTLKSDTHD 79 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~-----~~l~~La~~~~~ 79 (562) ||||++|+++. +|+++|++++.+++++|+++||||||||||++|||++.+. .+.+.+... ..+..+++.+++ T Consensus 1 mkvA~vQ~~~~-~d~~~Nl~~~~~~i~~A~~~ga~livfPEl~~tgy~~~~~--~~~~~~~a~~~~~~~~~~~l~~~A~~ 77 (279) T TIGR03381 1 VTVAALQMACS-DDVETNIARAERLVREAAARGAQIILLPELFEGPYFCKDQ--DEDYFALAQPVEGHPAIKRFQALAKE 77 (279) T ss_pred CEEEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCC--CHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 98999945489-9999999999999999998849899938876677754455--44478740325798999999999987 Q ss_pred CCCEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHHH Q ss_conf 6978999999988898789999996-997999887776235543332100377888-76147647456201111111016 Q gi|254780999|r 80 GGAGIVVGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKNS 157 (562) Q Consensus 80 ~~i~iivG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~~ 157 (562) +++.+++|++++.++++||+++++. +|+++++|+|+|||++..|+|++||.+|+. ..+|++.++|+|++||||+|+|+ T Consensus 78 ~~i~i~~g~~e~~~~~~yNt~~vi~~~G~i~~~yrK~hlp~~~~~~E~~~f~~G~~~~~v~~~~~~kiG~~ICyD~~fPe 157 (279) T TIGR03381 78 LGVVIPVSFFEKAGNAYYNSLAMIDADGSVLGVYRKSHIPDGPGYQEKFYFRPGDTGFKVWDTRYGRIGVGICWDQWFPE 157 (279) T ss_pred CCEEEEEEEEEEECCEEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEEECCCCCCCCEEEECCCEEEEEEEECCCCCH T ss_conf 39599994478608823788999989980999998853899996347875148887661598589369876660435748 Q ss_pred HHHHHHHHCCCCEEEECCCC---CCCC--CCHHHHHHHHHHHHHHCCCEEEECCCCCCC----CCCCCCCCCEEECCCCE Q ss_conf 89999971477759931555---3335--807888788999854337046421356887----66210486435558985 Q gi|254780999|r 158 NICKHLKKQGAEFLFSLNAS---PYYH--NKLKKRHEIVTGQISHVHLPIIYVNQVGGQ----DELIFDGASFCFDGQQQ 228 (562) Q Consensus 158 ~~~~~la~~Gadlii~psAS---p~~~--~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~----d~lvf~G~S~I~d~~G~ 228 (562) ..+.++++||++|++|+|+ |+.. .....+..++++||.+|+++++++|++|.+ ++..|.|+|+|++|+|+ T Consensus 158 -~~r~la~~Ga~ii~~Psa~~~~p~~~~~~~~~~w~~~~~arA~en~~~vv~an~~G~~~~~~~~~~f~G~S~Iv~P~G~ 236 (279) T TIGR03381 158 -TARAMALMGAEVLFYPTAIGSEPHDPDLDSRDHWQRVMQGHAAANLVPVVAANRIGTEIGDGGEQTFYGSSFIADHTGE 236 (279) T ss_pred -HHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCEEEECCCCC T ss_conf -9999998699799988414687776556628999999999999829536510310035788887658498389999999 Q ss_pred EEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf 99961677763799999833521236 Q gi|254780999|r 229 LAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 229 ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) ++++++.++|+++++|+|++.....+ T Consensus 237 ila~~~~~~e~~i~a~iDl~~i~~~R 262 (279) T TIGR03381 237 LVAEAGRSEEAVLVATFDLDEIAKQR 262 (279) T ss_pred EEEECCCCCCEEEEEEEEHHHHHHHH T ss_conf 98766899878999997599999999 No 18 >cd07573 CPA N-carbamoylputrescine amidohydrolase (CPA) (class 11 nitrilases). CPA (EC 3.5.1.53, also known as N-carbamoylputrescine amidase and carbamoylputrescine hydrolase) converts N-carbamoylputrescine to putrescine, a step in polyamine biosynthesis in plants and bacteria. This subgroup includes Arabidopsis thaliana CPA, also known as nitrilase-like 1 (NLP1), and Pseudomonas aeruginosa AguB. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 11. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer; P. aeruginosa AugB is a homohexamer, Arabidopsis thaliana NLP1 is a homooctomer. Probab=100.00 E-value=0 Score=375.64 Aligned_cols=246 Identities=24% Similarity=0.342 Sum_probs=211.8 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH-----HHHHHHHHHHHHH Q ss_conf 28999963883688999999999999999985998999166444799844541488999999-----9999999999887 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC-----SSAIDTLKSDTHD 79 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~-----~~~l~~La~~~~~ 79 (562) ||||++|+++. +|+++|++++.+++++|+++|||||||||+++|||++.+. ...+.+.. ...++.|++.+++ T Consensus 1 mkiA~~Q~~~~-~d~~~Nl~~~~~~i~~A~~~ga~lvvfPE~~~tgy~~~~~--~~~~~~~a~~~~~~~~~~~l~~~A~~ 77 (284) T cd07573 1 VTVALVQMACS-EDPEANLAKAEELVREAAAQGAQIVCLQELFETPYFCQEE--DEDYFDLAEPPIPGPTTARFQALAKE 77 (284) T ss_pred CEEEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCC--CHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 98999967189-9999999999999999998859799978883348981214--64789874231799899999999998 Q ss_pred CCCEEEEEEEEEEC-CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHH Q ss_conf 69789999999888-98789999996-997999887776235543332100377888-7614764745620111111101 Q gi|254780999|r 80 GGAGIVVGFPRQDQ-EGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKN 156 (562) Q Consensus 80 ~~i~iivG~p~~~~-~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~ 156 (562) ++++|++|++++.+ +.+||++++|+ +|++++.|+|+|||++..|+|++||++|+. ..+|++.++|+|++||||+|+| T Consensus 78 ~~i~i~~g~~e~~~~~~~~Nta~~i~~~G~i~~~yrK~hLp~~~~~~E~~~f~~G~~~~~v~~~~~g~iG~~ICyD~~fP 157 (284) T cd07573 78 LGVVIPVSLFEKRGNGLYYNSAVVIDADGSLLGVYRKMHIPDDPGYYEKFYFTPGDTGFKVFDTRYGRIGVLICWDQWFP 157 (284) T ss_pred HCCEEEECCEEECCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCEEEEEECCCCCCCCEEEEECCCEEEEEEEECCCCC T ss_conf 29189963157769981798899986899446448720106899563664012688652058508855875475000473 Q ss_pred HHHHHHHHHCCCCEEEECCCCCC-------CCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC----CCCCCCCCEEECC Q ss_conf 68999997147775993155533-------358078887889998543370464213568876----6210486435558 Q gi|254780999|r 157 SNICKHLKKQGAEFLFSLNASPY-------YHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD----ELIFDGASFCFDG 225 (562) Q Consensus 157 ~~~~~~la~~Gadlii~psASp~-------~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d----~lvf~G~S~I~d~ 225 (562) + .++.++++|||+|++|+|..+ .....+.+..++++||.+|++|++++|++|.++ .+.|.|+|+|++| T Consensus 158 E-~~r~la~~Ga~iil~PsA~g~~~~~~~~~~~~~~~w~~l~~arA~eN~~yvv~~N~vG~~~~~~~~~~f~G~S~Iv~P 236 (284) T cd07573 158 E-AARLMALQGAEILFYPTAIGSEPQEPPEGLDQRDAWQRVQRGHAIANGVPVAAVNRVGVEGDPGSGITFYGSSFIADP 236 (284) T ss_pred H-HHHHHHHCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCEECCCEEEEEC T ss_conf 9-999999869909997841467766655553369999999999887628827730233357889998407487289938 Q ss_pred CCEEEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf 98599961677763799999833521236 Q gi|254780999|r 226 QQQLAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 226 ~G~ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) +|+++++++.++|+++++|+|++.....+ T Consensus 237 ~G~vla~~~~~~e~~i~adiDl~~l~~~R 265 (284) T cd07573 237 FGEILAQASRDEEEILVAEFDLDEIEEVR 265 (284) T ss_pred CCCEEEECCCCCCEEEEEEEEHHHHHHHH T ss_conf 99898734899878999997099999999 No 19 >cd07580 nitrilase_2 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=370.91 Aligned_cols=243 Identities=21% Similarity=0.241 Sum_probs=208.1 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH--HHHHHHHHHHHHHCCCE Q ss_conf 8999963883688999999999999999985998999166444799844541488999999--99999999998876978 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC--SSAIDTLKSDTHDGGAG 83 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~--~~~l~~La~~~~~~~i~ 83 (562) |||++|+++++||+++|++++.+++++|+++||||||||||++|||++.|.-......+.. ...++.|++.+++++++ T Consensus 1 riA~~Q~~~~~gd~~~Nl~k~~~~i~~A~~~ga~lvvfPE~~ltGy~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~~i~ 80 (268) T cd07580 1 RVACVQFDPRVGDLDANLARSIELIREAADAGANLVVLPELANTGYVFESRDEAFALAEEVPDGASTRAWAELAAELGLY 80 (268) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCEE T ss_conf 98999898987999999999999999999888919998888411698468899999865427988999999999972949 Q ss_pred EEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHHHHHHHH Q ss_conf 999999988898789999996997999887776235543332100377888-7614764745620111111101689999 Q gi|254780999|r 84 IVVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKNSNICKH 162 (562) Q Consensus 84 iivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~~~~~~~ 162 (562) +++|++++.++++|||+++|.+|.++++|+|+||++ +|++||.+|+. ..+|+++++|+|++||||+|+|+ .++. T Consensus 81 iv~g~~e~~~~~~yNs~~~i~~~g~i~~yrK~hl~~----~E~~~f~~G~~~~~v~~~~~g~iG~~IC~D~~fPe-~~r~ 155 (268) T cd07580 81 IVAGFAERDGDRLYNSAVLVGPDGVIGTYRKAHLWN----EEKLLFEPGDLGLPVFDTPFGRIGVAICYDGWFPE-TFRL 155 (268) T ss_pred EEEEEEECCCCEEEEEEEEECCCCEEEEEECCCCCC----CHHHEEECCCCCCEEEEECCCEEEEEEECCCCCHH-HHHH T ss_conf 998357732996999899963995899995356897----07854857998703898078579998840113709-9999 Q ss_pred HHHCCCCEEEECCCCCCCCCCHH----HHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEE-CCCCC Q ss_conf 97147775993155533358078----88788999854337046421356887662104864355589859996-16777 Q gi|254780999|r 163 LKKQGAEFLFSLNASPYYHNKLK----KRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQ-MKHFS 237 (562) Q Consensus 163 la~~Gadlii~psASp~~~~k~~----~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~-~~~f~ 237 (562) ++.+|||+|++|+++|....... .+..+.++||.+|+++++++|++|.+++..|+|+|+|+||+|+++++ .+.++ T Consensus 156 la~~Ga~ii~~p~~~~~~~~~~~~~~~~~~~l~~arA~en~~~v~~~n~~G~~~~~~~~G~S~IvdP~G~vla~~~~~~~ 235 (268) T cd07580 156 LALQGADIVCVPTNWVPMPRPPEGGPPMANILAMAAAHSNGLFIACADRVGTERGQPFIGQSLIVGPDGWPLAGPASGDE 235 (268) T ss_pred HHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCEECCCCEEECCCCCEEEECCCCCC T ss_conf 99869979999266888677531036899999999998779889996776776896564888799999988841068998 Q ss_pred CCEEEEEEEECCCCCC Q ss_conf 6379999983352123 Q gi|254780999|r 238 EQNFMTEWHYDQQLSQ 253 (562) Q Consensus 238 e~~~i~d~d~d~~~~~ 253 (562) |+++++|+|++..... T Consensus 236 e~ll~adiDl~~~~~~ 251 (268) T cd07580 236 EEILLADIDLTAARRK 251 (268) T ss_pred CEEEEEEECHHHHHHH T ss_conf 7899999716999998 No 20 >cd07568 ML_beta-AS_like mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This family includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This family belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=100.00 E-value=0 Score=369.58 Aligned_cols=250 Identities=17% Similarity=0.191 Sum_probs=214.7 Q ss_pred CCCEEEEEEECC-------CCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCC-HHHHHHH--HHHHH Q ss_conf 886289999638-------8368899999999999999998599899916644479984454148-8999999--99999 Q gi|254780999|r 2 LKKLKIAIAQLN-------PVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFK-KSFIQAC--SSAID 71 (562) Q Consensus 2 ~~~mkIal~Q~n-------~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~-~~f~~~~--~~~l~ 71 (562) ++++|||++|+. |.++|.++|++|+++++++|+++|||||||||+++|||.+.+.-.. ..+.+.. ...++ T Consensus 1 ~~~~~~~~~q~~~~~~~~aPv~~d~eanl~k~~~li~eAa~~Ga~LivfPE~~~tgy~~~~~~~~~~~~a~~~~~gp~~~ 80 (287) T cd07568 1 SRIVRVGLIQASNVIPTDAPIEKQKEAMIQKHVTMIREAAEAGAQIVCLQEIFYGPYFCAEQDTKWYEFAEEIPNGPTTK 80 (287) T ss_pred CCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHCCCCCCHHHH T ss_conf 94699999914378898887123399999999999999998859199878886568776544447888617058988999 Q ss_pred HHHHHHHHCCCEEEEEEEEEE-CCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEEC Q ss_conf 999998876978999999988-898789999996-997999887776235543332100377888-76147647456201 Q gi|254780999|r 72 TLKSDTHDGGAGIVVGFPRQD-QEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGIL 148 (562) Q Consensus 72 ~La~~~~~~~i~iivG~p~~~-~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ 148 (562) .|++.++++++++++|+.++. ++.+||++++|. +|++++.|+|+|||+++.|.|++||.+|+. ..+|+++++|+|++ T Consensus 81 ~l~~~Ar~~~i~iv~g~~e~~~~g~lyNt~~~i~~~G~ii~~yrK~hlp~~~~~~E~~~f~~G~~~~~Vf~t~~g~iG~l 160 (287) T cd07568 81 RFAALAKEYNMVLILPIYEKEQGGTLYNTAAVIDADGTYLGKYRKNHIPHVGGFWEKFYFRPGNLGYPVFDTAFGKIGVY 160 (287) T ss_pred HHHHHHHHCCCEEEEEEEEECCCCCEEEEEEEECCCCCEEEEECCCCCCCCCCCCEEEEECCCCCCCEEEEECCEEEEEE T ss_conf 99999998592899635787699968999999999997966880134979998470035768898845999799069886 Q ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC---CCCCCCCCEEECC Q ss_conf 1111110168999997147775993155533358078887889998543370464213568876---6210486435558 Q gi|254780999|r 149 ICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD---ELIFDGASFCFDG 225 (562) Q Consensus 149 ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d---~lvf~G~S~I~d~ 225 (562) ||||+|+|+ .++.++.+||++|++|+|++...+.. .+....++||.+|+++++++|++|.++ ...|.|+|+|++| T Consensus 161 ICyD~~fPE-~~r~la~~Ga~ii~~Psa~~~~~~~~-~w~~~~~arA~en~~yv~~~n~~G~e~~~~~~~~~G~S~Ii~P 238 (287) T cd07568 161 ICYDRHFPE-GWRALGLNGAEIVFNPSATVAGLSEY-LWKLEQPAAAVANGYFVGAINRVGTEAPWNIGEFYGSSYFVDP 238 (287) T ss_pred ECCCCCCHH-HHHHHHHCCCCEEEECCCCCCCCCHH-HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCEEEECC T ss_conf 512325729-99999987997998377777775504-5778988888875936999545566678877856375089889 Q ss_pred CCEEEEECCCCCCCEEEEEEEECCCCCC Q ss_conf 9859996167776379999983352123 Q gi|254780999|r 226 QQQLAFQMKHFSEQNFMTEWHYDQQLSQ 253 (562) Q Consensus 226 ~G~ii~~~~~f~e~~~i~d~d~d~~~~~ 253 (562) +|+++++++.++|+++++|+|++..... T Consensus 239 ~G~via~~~~~~e~ii~adiDl~~~~~~ 266 (287) T cd07568 239 RGQFVASASRDKDELLVAELDLDLIREV 266 (287) T ss_pred CCCEEEECCCCCCEEEEEEECHHHHHHH T ss_conf 9988865379988899999658999999 No 21 >cd07581 nitrilase_3 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=367.32 Aligned_cols=239 Identities=21% Similarity=0.257 Sum_probs=209.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 99996388368899999999999999998599899916644479984454148899999999999999998876978999 Q gi|254780999|r 7 IAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIVV 86 (562) Q Consensus 7 Ial~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~iiv 86 (562) ||++|+++. ||+++|++++++++++|+++|||||||||++++||+..+..+.+.........++.|++.++++++++++ T Consensus 1 IA~~Q~~~~-gd~~~Nl~ki~~~i~~A~~~gadlvvfPE~~l~~~~~~~~~~~~~a~~~~~~~~~~l~~~A~~~~i~iv~ 79 (255) T cd07581 1 VALAQFASS-GDKEENLEKVRRLLAEAAAAGADLVVFPEYTMARFGDGLDDYARVAEPLDGPFVSALARLARELGITVVA 79 (255) T ss_pred CEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHCCCEEEE T ss_conf 489994799-9999999999999999998839899989882005994377888764246869999999999877997998 Q ss_pred EEEEEEC-CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCC--EEEECCEEEEECCCCCCCHHHHHHHH Q ss_conf 9999888-98789999996-99799988777623554333210037788876--14764745620111111101689999 Q gi|254780999|r 87 GFPRQDQ-EGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSND--PIVFRDIRLGILICEDIWKNSNICKH 162 (562) Q Consensus 87 G~p~~~~-~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~--~~~~~g~kiGv~ICeDlw~~~~~~~~ 162 (562) |++++.+ +++||++++|. +|++++.|+|+||++...|+|++||.+|+... +|+++++|+|++||||+|+|+ +++. T Consensus 80 G~~e~~~~~~~yNta~~i~~~G~i~~~y~K~hL~~~~~~~E~~~f~~G~~~~~~~~~~~g~~iG~~IC~D~~fPe-~~r~ 158 (255) T cd07581 80 GMFEPAGDGRVYNTLVVVGPDGEIIAVYRKIHLYDAFGFRESDTVAPGDELPPVVFVVGGVKVGLATCYDLRFPE-LARA 158 (255) T ss_pred EEEEECCCCCEEEEEEEEECCCCEEEEEEEEECCCCCCCCEEEEEECCCCCCEEEEEECCEEEEEEEEEECCCCH-HHHH T ss_conf 003788999489999999689965889953425777887623457359987518998299589978877067819-9999 Q ss_pred HHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEE Q ss_conf 97147775993155533358078887889998543370464213568876621048643555898599961677763799 Q gi|254780999|r 163 LKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFM 242 (562) Q Consensus 163 la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i 242 (562) ++++||++|++|++|+....+.+.|..++++||.||++|++++|++|.+ |.|+|+|++|+|+++++++. +|++++ T Consensus 159 la~~Ga~li~~psa~~~~~~~~~~w~~~~~arA~en~~~vv~~n~~G~~----~~G~S~Iv~P~G~ila~~~~-~E~~i~ 233 (255) T cd07581 159 LALAGADVIVVPAAWVAGPGKEEHWETLLRARALENTVYVAAAGQAGPR----GIGRSMVVDPLGVVLADLGE-REGLLV 233 (255) T ss_pred HHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCC----CCCEEEEECCCCCEEEECCC-CCCEEE T ss_conf 9987997999877888988759999999999999659879999471799----74159999899749775599-982899 Q ss_pred EEEEECCCCC Q ss_conf 9998335212 Q gi|254780999|r 243 TEWHYDQQLS 252 (562) Q Consensus 243 ~d~d~d~~~~ 252 (562) +|+|++.... T Consensus 234 a~iDl~~~~~ 243 (255) T cd07581 234 ADIDPERVEE 243 (255) T ss_pred EEECHHHHHH T ss_conf 9954999999 No 22 >cd07572 nit Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases). This subgroup includes mammalian Nit1 and Nit2, the Nit1-like domain of the invertebrate NitFhit, and various uncharacterized bacterial and archaeal Nit-like proteins. Nit1 and Nit2 are candidate tumor suppressor proteins. In NitFhit, the Nit1-like domain is encoded as a fusion protein with the non-homologous tumor suppressor, fragile histidine triad (Fhit). Mammalian Nit1 and Fhit may affect distinct signal pathways, and both may participate in DNA damage-induced apoptosis. Nit1 is a negative regulator in T cells. Overexpression of Nit2 in HeLa cells leads to a suppression of cell growth through cell cycle arrest in G2. These Nit proteins and the Nit1-like domain of NitFhit belong to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in t Probab=100.00 E-value=0 Score=370.15 Aligned_cols=243 Identities=21% Similarity=0.268 Sum_probs=211.1 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++. +|+++|++++++++++|+++|||||||||++++||................+.++.|++.+++++++++ T Consensus 1 rIA~~Q~~~~-~d~~~Nl~k~~~~i~~Aa~~gadlivfPE~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~A~~~~i~iv 79 (265) T cd07572 1 RVALIQMTST-ADKEANLARAKELIEEAAAQGAKLVVLPECFNYPGGTDAFKLALAEEEGDGPTLQALSELAKEHGIWLV 79 (265) T ss_pred CEEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHCCEEE T ss_conf 9999968898-899999999999999999885979998865247788603556652454799899999999998098899 Q ss_pred EE-EEEEE--CCEEEEEEEEEE-CCEEEEEEEEEEE-----CCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHH Q ss_conf 99-99988--898789999996-9979998877762-----355433321003778887614764745620111111101 Q gi|254780999|r 86 VG-FPRQD--QEGVLNSVVILD-AGNIIAVRDKINL-----PNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKN 156 (562) Q Consensus 86 vG-~p~~~--~~~~yNsa~vi~-~G~il~~y~K~~L-----P~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~ 156 (562) +| ++++. ++++||++++|. +|+++++|+|+|| |++..|+|.+||.+|+...+|+++++|+|++||||+|+| T Consensus 80 ~G~~~e~~~~~g~~yNsa~vi~~~G~i~~~yrK~HLf~~~~p~~~~~~E~~~f~~G~~~~vf~~~~~riG~~ICyD~~fP 159 (265) T cd07572 80 GGSIPERDDDDGKVYNTSLVFDPDGELVARYRKIHLFDVDVPGGISYRESDTLTPGDEVVVVDTPFGKIGLGICYDLRFP 159 (265) T ss_pred EEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCCEEEEEEEECCCEEEEECCCCCCCCEEEEECCCCC T ss_conf 52489985789948999999879986887861378603468997623013368779852450227843412676044584 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC-CCCCCCCCCEEECCCCEEEEECCC Q ss_conf 6899999714777599315553335807888788999854337046421356887-662104864355589859996167 Q gi|254780999|r 157 SNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQ-DELIFDGASFCFDGQQQLAFQMKH 235 (562) Q Consensus 157 ~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~-d~lvf~G~S~I~d~~G~ii~~~~~ 235 (562) + +++.++.+|||+|++|+|+++..++.+ +..++++||.||+++++++|++|.+ +...|.|+|+|+||+|+++++++. T Consensus 160 E-~~r~la~~Ga~ii~~Psa~~~~~g~~~-w~~~~~aRA~en~~~v~~~N~~G~~~~~~~~~G~S~I~~P~G~ila~~~~ 237 (265) T cd07572 160 E-LARALARQGADILTVPAAFTMTTGPAH-WELLLRARAIENQCYVVAAAQAGDHEAGRETYGHSMIVDPWGEVLAEAGE 237 (265) T ss_pred H-HHHHHHHCCCCEEEECCCCCCCCCHHH-HHHHHHHHHHHCCCEEEEECCCCCCCCCCEECCCEEEECCCCCEEEECCC T ss_conf 9-899877649969997453577752889-99999999987794698414644589985870335999599988876599 Q ss_pred CCCCEEEEEEEECCCCC Q ss_conf 77637999998335212 Q gi|254780999|r 236 FSEQNFMTEWHYDQQLS 252 (562) Q Consensus 236 f~e~~~i~d~d~d~~~~ 252 (562) +|+++++|+|++.... T Consensus 238 -~E~ll~adiDl~~i~~ 253 (265) T cd07572 238 -GEGVVVAEIDLDRLEE 253 (265) T ss_pred -CCEEEEEEECHHHHHH T ss_conf -9879999964999999 No 23 >cd07197 nitrilase Nitrilase superfamily, including nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes. This superfamily (also known as the C-N hydrolase superfamily) contains hydrolases that break carbon-nitrogen bonds; it includes nitrilases, cyanide dihydratases, aliphatic amidases, N-terminal amidases, beta-ureidopropionases, biotinidases, pantotheinase, N-carbamyl-D-amino acid amidohydrolases, the glutaminase domain of glutamine-dependent NAD+ synthetase, apolipoprotein N-acyltransferases, and N-carbamoylputrescine amidohydrolases, among others. These enzymes depend on a Glu-Lys-Cys catalytic triad, and work through a thiol acylenzyme intermediate. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. These oligomers include dimers, tetramers, hexamers, octamers, tetradecamers, octadecamers, as well as variable length helical arrangements and homo-oligomeric spirals. These proteins have roles in vitamin and Probab=100.00 E-value=0 Score=360.50 Aligned_cols=238 Identities=28% Similarity=0.423 Sum_probs=216.0 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH-HHHHHHHHHHHHHCCCEEE Q ss_conf 999963883688999999999999999985998999166444799844541488999999-9999999999887697899 Q gi|254780999|r 7 IAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC-SSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 7 Ial~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~-~~~l~~La~~~~~~~i~ii 85 (562) ||++|+++.+||+++|++++++++++|+++|+|||||||+++|||++.+......+.+.. ...++.|++.+++++++++ T Consensus 1 IA~~Q~~~~~~d~~~Nl~~i~~~i~~A~~~gadlvv~PE~~l~Gy~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~~i~i~ 80 (253) T cd07197 1 IAAVQLAPKIGDVEANLAKALRLIKEAAEQGADLIVLPELFLTGYSFESAKEDLDLAEELDGPTLEALAELAKELGIYIV 80 (253) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHCHHHHHHCCCHHHHHHHHHHHHCCCEEE T ss_conf 09999998768899999999999999998859099958873035983565557766543772999999999988097899 Q ss_pred EEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 9999988898789999996-997999887776235543332100377888761476474562011111110168999997 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLK 164 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la 164 (562) +|++++.++++||+++++. +|+++++|+|+|||+ |+|.+||.+|+...+|+++++|+|++||+|+|+|+ +++.++ T Consensus 81 ~G~~e~~~~~~yNs~~~i~~~G~~i~~y~K~~l~~---~~E~~~~~~G~~~~v~~~~~~~ig~~IC~D~~~pe-~~r~~~ 156 (253) T cd07197 81 AGIAEKDGDKLYNTAVVIDPDGEIIGKYRKIHLFD---FGERRYFSPGDEFPVFDTPGGKIGLLICYDLRFPE-LARELA 156 (253) T ss_pred EEEEEEECCEEEEEEEEECCCCCEEEEEEECCCCC---CCHHHCCCCCCCCEEEEECCEEEEEEEECCCCCHH-HHHHHH T ss_conf 73689439988898999879986898887569999---41420337999754776299689866720535779-999999 Q ss_pred HCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEEE Q ss_conf 14777599315553335807888788999854337046421356887662104864355589859996167776379999 Q gi|254780999|r 165 KQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMTE 244 (562) Q Consensus 165 ~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d 244 (562) .+|||+|++|+|+|+.. ...+..++++||.+|+++++++|++|.+++..|.|+|+|++|+|+++++++. .|+++++| T Consensus 157 ~~Ga~lil~psa~~~~~--~~~~~~~~~arA~en~~~vv~~n~vG~~~~~~~~G~S~Ii~P~G~~l~~~~~-~e~~i~a~ 233 (253) T cd07197 157 LKGADIILVPAAWPTAR--REHWELLLRARAIENGVYVVAANRVGEEGGLEFAGGSMIVDPDGEVLAEASE-EEGILVAE 233 (253) T ss_pred HCCCCEEEEECCCCCCC--CHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCC-CCEEEEEE T ss_conf 75999999926789987--2145677888888549779997663368996044657999599829863599-98299999 Q ss_pred EEECCCC Q ss_conf 9833521 Q gi|254780999|r 245 WHYDQQL 251 (562) Q Consensus 245 ~d~d~~~ 251 (562) +|++... T Consensus 234 idl~~~~ 240 (253) T cd07197 234 LDLDELR 240 (253) T ss_pred ECHHHHH T ss_conf 7099999 No 24 >cd07577 Ph0642_like Pyrococcus horikoshii Ph0642 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases). Uncharacterized subgroup of the nitrilase superfamily. This superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. Pyrococcus horikoshii Ph0642 is a hypothetical protein belonging to this subgroup. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). This subgroup was classified as belonging to class 13, which represents proteins that at the time were difficult to place in a distinct similarity group. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=362.65 Aligned_cols=234 Identities=23% Similarity=0.298 Sum_probs=200.1 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH--HHHHHHHHHHHHHCCCE Q ss_conf 8999963883688999999999999999985998999166444799844541488999999--99999999998876978 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC--SSAIDTLKSDTHDGGAG 83 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~--~~~l~~La~~~~~~~i~ 83 (562) |||++|+++.+||+++|++++.+++++| +|||||||||++|||+..+.-....+.+.. ...++.|++.+++++++ T Consensus 1 kIA~~Q~~~~~gd~~~Nl~k~~~~i~~a---~adlvvfPEl~ltGY~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~~i~ 77 (259) T cd07577 1 KVGYVQFNPKFGEVEKNLKKVESLIKGV---EADLIVLPELFNTGYAFTSKEEVASLAESIPDGPTTRFLQELARETGAY 77 (259) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHC---CCCEEECCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCE T ss_conf 9999989898899999999999999975---9989997886102688899899998742536988999999999972958 Q ss_pred EEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHHHHHHHH Q ss_conf 999999988898789999996997999887776235543332100377888-7614764745620111111101689999 Q gi|254780999|r 84 IVVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKNSNICKH 162 (562) Q Consensus 84 iivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~~~~~~~ 162 (562) +++|+|++.++++||+++++.+|.+++.|+|+||. |+|++||.+|+. ..+|+++++|+|++||||+|+|+ +++. T Consensus 78 iv~G~~e~~~~~~yNs~~~i~~~g~i~~y~K~hl~----~~E~~~f~~G~~~~~v~~~~~~kiG~~IC~D~~fPe-~~r~ 152 (259) T cd07577 78 IVAGLPERDGDKFYNSAVVVGPEGYIGIYRKTHLF----YEEKLFFEPGDTGFRVFDIGDIRIGVMICFDWYFPE-AART 152 (259) T ss_pred EEEEEEEEECCEEEEEEEEECCCCEEEEEECCCCC----CCCCEEEECCCCCCEEEEECCEEEEEEEEHHCCCHH-HHHH T ss_conf 99979997799899999998199338999654799----752128827997504997799898645782117719-9999 Q ss_pred HHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCC----CCCCCCCCCCEEECCCCEEEEECCCCCC Q ss_conf 971477759931555333580788878899985433704642135688----7662104864355589859996167776 Q gi|254780999|r 163 LKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGG----QDELIFDGASFCFDGQQQLAFQMKHFSE 238 (562) Q Consensus 163 la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg----~d~lvf~G~S~I~d~~G~ii~~~~~f~e 238 (562) ++.+|||+|++|++++.. .+...+++||.+|+++++++|++|. .+.+.|+|+|+|++|+|+++++++.++| T Consensus 153 la~~Ga~ii~~ps~~~~~-----~~~~~~~arA~en~~~vv~~n~vG~~~~~~~~~~f~G~S~i~~P~G~ila~~~~~~E 227 (259) T cd07577 153 LALKGADIIAHPANLVLP-----YCPKAMPIRALENRVFTITANRIGTEERGGETLRFIGKSQITSPKGEVLARAPEDGE 227 (259) T ss_pred HHHCCCCEEEEEECCCCH-----HHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCEECCCEEEECCCCCEEEECCCCCC T ss_conf 998799899990046872-----159999999997498699997555157898873475894999899989887799988 Q ss_pred CEEEEEEEECCCCC Q ss_conf 37999998335212 Q gi|254780999|r 239 QNFMTEWHYDQQLS 252 (562) Q Consensus 239 ~~~i~d~d~d~~~~ 252 (562) +++++|+|++.... T Consensus 228 ~~l~adiDl~~~~~ 241 (259) T cd07577 228 EVLVAEIDPRLARD 241 (259) T ss_pred EEEEEEEEHHHHHH T ss_conf 89999970699898 No 25 >cd07569 DCase N-carbamyl-D-amino acid amidohydrolase (DCase, class 6 nitrilases). DCase hydrolyses N-carbamyl-D-amino acids to produce D-amino acids. It is an important biocatalyst in the pharmaceutical industry, producing useful D-amino acids for example in the preparation of beta-lactam antibiotics. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 6. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Agrobacterium radiobacter DCase forms a tetramer (dimer of dimers). Some DCases may form trimers. Probab=100.00 E-value=0 Score=360.09 Aligned_cols=252 Identities=18% Similarity=0.242 Sum_probs=210.2 Q ss_pred CCCEEEEEEECCCCCC--CHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHH----HHHH--HHHHHHH Q ss_conf 8862899996388368--899999999999999998599899916644479984454148899----9999--9999999 Q gi|254780999|r 2 LKKLKIAIAQLNPVVG--DIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSF----IQAC--SSAIDTL 73 (562) Q Consensus 2 ~~~mkIal~Q~n~~~g--d~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f----~~~~--~~~l~~L 73 (562) .+|||||++|++|+.. |.++|++|++++|++|+++||+|||||||++|||.|..+...... .+.. ...++.| T Consensus 1 ~~~~rvA~~Q~~p~~~~~~~e~nl~k~~~~i~~Aa~~GA~LvvfPE~~~tgy~p~~~~~~~~~~~~~~e~~~~gp~~~~l 80 (302) T cd07569 1 SRQVILAAAQMGPIARAETRESVVARLIALLEEAASRGAQLVVFPELALTTFFPRWYFPDEAELDSFFETEMPNPETQPL 80 (302) T ss_pred CCEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHH T ss_conf 92789999900267786459999999999999999880999987899766887443458999999766431899899999 Q ss_pred HHHHHHCCCEEEEEEEEEEC----CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCC--------CCCCCCCCCC-CCEEE Q ss_conf 99988769789999999888----98789999996-99799988777623554333--------2100377888-76147 Q gi|254780999|r 74 KSDTHDGGAGIVVGFPRQDQ----EGVLNSVVILD-AGNIIAVRDKINLPNYSEFH--------EKRTFISGYS-NDPIV 139 (562) Q Consensus 74 a~~~~~~~i~iivG~p~~~~----~~~yNsa~vi~-~G~il~~y~K~~LP~y~~Fd--------E~r~F~~G~~-~~~~~ 139 (562) ++.++++++.+++|++++.. +++||++++|. +|++++.|+|+|||++.+|+ |++||.+|+. ..+|+ T Consensus 81 ~~~A~~~~i~v~~G~~e~~~~~~~~~~yNt~~~i~~~G~il~~yrK~HLp~~~~~~~~~~~~~~E~~~f~~Gd~~~~V~~ 160 (302) T cd07569 81 FDRAKELGIGFYLGYAELTEDGGVKRRFNTSILVDKSGKIVGKYRKVHLPGHKEPEPYRPFQHLEKRYFEPGDLGFPVFR 160 (302) T ss_pred HHHHHHCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEE T ss_conf 99999849899965641345567864277899998998499888701178865545787644331467658987640476 Q ss_pred ECCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCC----CCCHHHH----HHHHHHHHHHCCCEEEECCCCCCC Q ss_conf 64745620111111101689999971477759931555333----5807888----788999854337046421356887 Q gi|254780999|r 140 FRDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYY----HNKLKKR----HEIVTGQISHVHLPIIYVNQVGGQ 211 (562) Q Consensus 140 ~~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~----~~k~~~R----~~~~~~~a~e~~~~vvy~N~vGg~ 211 (562) +.++|+|++||||+|+|+ ..+.++.+||+||+++..+|.. ......| .-.+++||.+|+++++++|++|.+ T Consensus 161 t~~griG~~ICyD~~fPE-~~r~la~~GA~li~~~~~~p~~~~~~~~~~~~~~~~~~l~~~arA~en~~~vv~~n~~G~~ 239 (302) T cd07569 161 VPGGIMGMCICNDRRWPE-TWRVMGLQGVELVLLGYNTPTHNPPAPEHDHLRLFHNLLSMQAGAYQNGTWVVAAAKAGME 239 (302) T ss_pred ECCEEEEEEEECCCCCHH-HHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCC T ss_conf 388379998604656749-9999997799599962677666888664356789999999863255347717850456677 Q ss_pred CCCCCCCCCEEECCCCEEEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf 6621048643555898599961677763799999833521236 Q gi|254780999|r 212 DELIFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 212 d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) +...|.|+|+|+||+|+++++++..+|+++++|+|++.....+ T Consensus 240 ~~~~~~G~S~Ii~P~G~vlaea~~~~e~ll~AdlDl~~~~~~R 282 (302) T cd07569 240 DGCDLIGGSCIVAPTGEIVAQATTLEDEVIVADCDLDLCREGR 282 (302) T ss_pred CCCEEECCCEEECCCCCEEEECCCCCCEEEEEEECHHHHHHHH T ss_conf 8973763418998999987604889986999995699999998 No 26 >PRK10438 hypothetical protein; Provisional Probab=100.00 E-value=0 Score=353.63 Aligned_cols=243 Identities=17% Similarity=0.229 Sum_probs=204.7 Q ss_pred CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 88628999963883688999999999999999985998999166444799844541488999999999999999988769 Q gi|254780999|r 2 LKKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGG 81 (562) Q Consensus 2 ~~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~ 81 (562) |++||||++|+++..+|+++|++++.+++++| +|+|||||||||+|||..++... . . ...+.++.+.+.+++.+ T Consensus 1 M~~mkial~Q~~~~~~D~~~Nl~~~~~~i~~a--~gadlivlPElf~tGy~~~~~~~--~-~-~~~~~~~~l~~~A~~~~ 74 (256) T PRK10438 1 MSGLKITLLQQPLVWMDGPANLRHFDRQLEGI--TGRDVIVLPEMFTTGFAMEAAAS--S-L-PQDDVVNWMTAKAQQTN 74 (256) T ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHC--CCCCEEECCCCCCCCCCCHHHHH--H-H-CCCHHHHHHHHHHHHCC T ss_conf 99778999956676379999999999999745--49999990875567887215566--4-2-17589999999998789 Q ss_pred CEEEEEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHH Q ss_conf 789999999888987899999969-9799988777623554333210037788876147647456201111111016899 Q gi|254780999|r 82 AGIVVGFPRQDQEGVLNSVVILDA-GNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNIC 160 (562) Q Consensus 82 i~iivG~p~~~~~~~yNsa~vi~~-G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~ 160 (562) +.|+.+.+++.+++.||+++++.+ |+ ++.|+|+|| ++.++|.+||.+|+...+|+++++|+|++||||+|||+ .+ T Consensus 75 ~~i~g~~~~~~~~~~~N~~~~i~~~G~-v~~YrK~HL--f~~~~E~~~f~~G~~~~v~~~~g~~iG~~ICYDlrFPe-~~ 150 (256) T PRK10438 75 ALIAGSVALQTESGAVNRFLLVEPGGT-VHFYDKRHL--FRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPV-WS 150 (256) T ss_pred EEEEEEEEEECCCCCEEEEEEEECCCE-EEEECCEEC--CCCCCCEEEECCCCCCEEEEECCCEEEEEEEECCCCHH-HH T ss_conf 099999989729972899999928966-988522405--89888201587999707999789137998862346849-99 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC-CCCCCCCCEEECCCCEEEEECCCCCCC Q ss_conf 9997147775993155533358078887889998543370464213568876-621048643555898599961677763 Q gi|254780999|r 161 KHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD-ELIFDGASFCFDGQQQLAFQMKHFSEQ 239 (562) Q Consensus 161 ~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d-~lvf~G~S~I~d~~G~ii~~~~~f~e~ 239 (562) +. ++|++++++|+++|... ...+..++++||.||+++++++|++|.++ .+.|.|+|+|+||+|+++++++..++. T Consensus 151 r~--l~~~~~~~~~a~~p~~~--~~hW~~llrARAiENq~fv~~~N~~G~~~~g~~~~G~S~ivdP~G~vla~a~~~e~~ 226 (256) T PRK10438 151 RN--LNDYDLALYVANWPAPR--SLHWQTLLTARAIENQAYVAGCNRVGSDGNGHHYRGDSRIINPQGEIIATADPHQAT 226 (256) T ss_pred HH--HCCCCEEEEECCCCCCC--CHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECEEEECCCCCEEEECCCCCCE T ss_conf 96--47597999966787641--178999999999865828999714445899988841309994997053647899978 Q ss_pred EEEEEEEECCCCCCCCCCC Q ss_conf 7999998335212365557 Q gi|254780999|r 240 NFMTEWHYDQQLSQWNYMS 258 (562) Q Consensus 240 ~~i~d~d~d~~~~~~~~~~ 258 (562) ++++++|.+.....+...+ T Consensus 227 ~i~a~id~~~v~~~R~~~P 245 (256) T PRK10438 227 RIDAELSLVALQEYREKFP 245 (256) T ss_pred EEEEEECHHHHHHHHHHCC T ss_conf 9999951999999998787 No 27 >cd07578 nitrilase_1_R1 First nitrilase domain of an uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). Members of this subgroup have two nitrilase domains. This is the first of those two domains. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=353.80 Aligned_cols=239 Identities=21% Similarity=0.222 Sum_probs=204.2 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH-HHHHHHHHHHHHHCCCE Q ss_conf 28999963883688999999999999999985998999166444799844541488999999-99999999998876978 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC-SSAIDTLKSDTHDGGAG 83 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~-~~~l~~La~~~~~~~i~ 83 (562) .|||++|+++.+||+++|++++.+++++|+++||+|||||||++|||+..|.-....+.+.. ....+.+++.+++++++ T Consensus 1 fkvA~vQ~~~~~gd~~~Nl~~~~~~i~~A~~~ga~lvvfPEl~~tGy~~~~~~~~~~~~e~~~g~~~~~l~~~A~~~~i~ 80 (258) T cd07578 1 YKAAAIQFEPEMGEKERNIERLLALCEEAARAGARLIVTPEMATTGYCWYDRAEIAPFVEPIPGPTTARFAELAREHDCY 80 (258) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHCCCC T ss_conf 98999968887889999999999999999988697999898963479866988989877648988999999999873992 Q ss_pred EEEEEEEEE--CCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHHHHHH Q ss_conf 999999988--898789999996997999887776235543332100377888-76147647456201111111016899 Q gi|254780999|r 84 IVVGFPRQD--QEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKNSNIC 160 (562) Q Consensus 84 iivG~p~~~--~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~~~~~ 160 (562) +++|+++.. ++.+||+++++.++.++++|+|+|+. +.|++||.+|+. ..+|++.++|+|++||||+|+|+ ++ T Consensus 81 iv~G~~e~~~~~g~~yNt~~~i~~~G~i~~yrK~h~~----~~E~~~~~~G~~~~~vf~t~~gkiG~~ICyD~~fpE-~~ 155 (258) T cd07578 81 IVVGLPEVDSRSGIYYNSAVLIGPSGVIGRHRKTHPY----ISEPKWAADGDLGHQVFDTEIGRIALLICMDIHFFE-TA 155 (258) T ss_pred EEEEEEEEECCCCCEEEEEEEECCCCEEEEEECCCCC----CCCCCCCCCCCCCCEEEEECCCCEEEEEECCCCCHH-HH T ss_conf 7976546887999599999998399589998345776----342311048997623899589637767874653679-99 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCCCCE Q ss_conf 99971477759931555333580788878899985433704642135688766210486435558985999616777637 Q gi|254780999|r 161 KHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFSEQN 240 (562) Q Consensus 161 ~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~ 240 (562) |.++++|||+|++|++++........ ..+||.||+++++++|++|.+++..|.|+|+|+||+|++++++.. +|++ T Consensus 156 R~la~~Ga~ii~~psa~~~~~~~~~~----~~arA~eN~~~vv~~n~~G~~~~~~~~G~S~Iv~P~G~vla~~~~-~e~i 230 (258) T cd07578 156 RLLALGGADVICHISNWLAERTPAPY----WINRAFENGCYLIESNRWGLERGVQFSGGSCIIEPDGTIQASIDS-GDGV 230 (258) T ss_pred HHHHHCCCCEEEECCCCCCCCCHHHH----HHHHHHHCCCEEEEECCCCCCCCEEEECCEEEECCCCCEEECCCC-CCEE T ss_conf 99998799899986678887872799----999999769879997743267880783323999899878852799-9869 Q ss_pred EEEEEEECCCCCC Q ss_conf 9999983352123 Q gi|254780999|r 241 FMTEWHYDQQLSQ 253 (562) Q Consensus 241 ~i~d~d~d~~~~~ 253 (562) +++|+|++..... T Consensus 231 ~~adiDl~~~~~~ 243 (258) T cd07578 231 ALGEIDLDRARHR 243 (258) T ss_pred EEEEEEHHHHHHH T ss_conf 9999607998998 No 28 >cd07575 Xc-1258_like Xanthomonas campestris XC1258 and related proteins, members of the nitrilase superfamily (putative class 13 nitrilases). Uncharacterized subgroup belonging to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup either represents a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. XC1258 is a homotetramer. Probab=100.00 E-value=0 Score=351.47 Aligned_cols=236 Identities=17% Similarity=0.249 Sum_probs=201.4 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 28999963883688999999999999999985998999166444799844541488999999999999999988769789 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGI 84 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~i 84 (562) ||||++|+++.+||+++|++++.+++++|+ +||||||||||++|||++....+... .....++.|++.++++++.| T Consensus 1 mkial~Q~~~~~~d~e~Nl~~~~~~i~~a~-~~adlivfPEl~~tGy~~~~~~~ae~---~~~~~~~~l~~~A~~~~i~i 76 (252) T cd07575 1 LKIALIQTDLVWEDPEANLAHFEEKIEQLK-EKTDLIVLPEMFTTGFSMNAEALAEP---MNGPTLQWMKAQAKKKGAAI 76 (252) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHH-CCCCEEECCCCCCCCCCCCHHHHCCC---CCCHHHHHHHHHHHHCCCEE T ss_conf 989999667887999999999999999986-69999999886556899896884635---78889999999997569889 Q ss_pred EEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHH Q ss_conf 99999988898789999996-99799988777623554333210037788876147647456201111111016899999 Q gi|254780999|r 85 VVGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHL 163 (562) Q Consensus 85 ivG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~l 163 (562) +.|+++++++++||+++++. +|++ ..|+|+||+ +.++|.+||++|+...+|+++++|+|++||||+|||+ .+|. T Consensus 77 ~g~~~~~~~~~~yNta~~i~~~G~i-~~YrK~hL~--~~~~E~~~f~~G~~~~v~~~~g~riGl~ICyDl~FPe-~~R~- 151 (252) T cd07575 77 TGSLIIKEGGKYYNRLYFVTPDGEV-YHYDKRHLF--RMAGEHKVYTAGNERVIVEYKGWKILLQVCYDLRFPV-WSRN- 151 (252) T ss_pred EEEEEEEECCEEEEEEEEEECCCEE-EEEEEEECC--CCCCCCCCCCCCCCCEEEEECCEEEEEEEECCCCCHH-HHHH- T ss_conf 9999998799589999999199669-998027669--9888660003898657998589667778864657679-9875- Q ss_pred HHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC-CCCCCCCCCEEECCCCEEEEECCCCCCCEEE Q ss_conf 714777599315553335807888788999854337046421356887-6621048643555898599961677763799 Q gi|254780999|r 164 KKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQ-DELIFDGASFCFDGQQQLAFQMKHFSEQNFM 242 (562) Q Consensus 164 a~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~-d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i 242 (562) ..|+|++++|+++|.. +.+.++.++++||.||+++++++|++|.+ ++..|.|+|+|+||+|+++++++. +|++++ T Consensus 152 -~~~~dil~~~a~~P~~--~~~~w~~l~~aRA~eN~~~vv~~N~~G~~~~~~~~~G~S~Iv~P~G~ila~a~~-~e~~~~ 227 (252) T cd07575 152 -TNDYDLLLYVANWPAP--RRAAWDTLLKARAIENQAYVIGVNRVGTDGNGLEYSGDSAVIDPLGEPLAEAEE-DEGVLT 227 (252) T ss_pred -HCCCCEEEEEECCCCC--CHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEEEEEEECCCCCEEEECCC-CCEEEE T ss_conf -3478747987148763--168999999999997497699965666689998884247999999778797799-985999 Q ss_pred EEEEECCCCCC Q ss_conf 99983352123 Q gi|254780999|r 243 TEWHYDQQLSQ 253 (562) Q Consensus 243 ~d~d~d~~~~~ 253 (562) +++|.+..... T Consensus 228 a~iD~~~i~~~ 238 (252) T cd07575 228 ATLDKEALQEF 238 (252) T ss_pred EEECHHHHHHH T ss_conf 99658999999 No 29 >TIGR00552 nadE NAD+ synthetase; InterPro: IPR003694 NAD+ synthase (6.3.5.1 from EC) catalyzes the last step in the biosynthesis of nicotinamide adenine dinucleotide and is induced by stress factors such as heat shock and glucose limitation. The three-dimensional structure of NH3-dependent NAD+ synthetase from Bacillus subtilis, in its free form and in complex with ATP shows that the enzyme consists of a tight homodimer with alpha/beta subunit topology .; GO: 0003952 NAD+ synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0009435 NAD biosynthetic process. Probab=100.00 E-value=0 Score=354.97 Aligned_cols=262 Identities=35% Similarity=0.485 Sum_probs=226.4 Q ss_pred HHHHHHCCHHHHHHHHHHHC-CCCEEEEECCCCCHH--------HHHHHHHHHCCCHHHEEEEECCCCCC-CCCCHHHHH Q ss_conf 88864220203588876422-454057521587236--------89999998414954467973246666-522135699 Q gi|254780999|r 271 EEEADYNACVLSLRDYVQKN-NFHKVIIGLSGGIDS--------ALCAAIAVDALGKENVQTIMLPYKYT-SPQSLEDAA 340 (562) Q Consensus 271 ~~e~~~~Alvlgl~d~~~~~-g~~~~viglSGGiDS--------al~a~la~~alg~~~v~~~~mp~~~~-s~~s~~~a~ 340 (562) ..+++++++. .+++|+.+. |++++|+|+|||+|| ++++++|++++|+++++++.||+..+ ++.+.++|. T Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~~g~v~GlsGG~ds~l~g~l~~~~v~~~~~~~~~~~~~~~~~~p~~~~~~~~d~~~a~ 83 (286) T TIGR00552 5 VVEELEDAVD-FLRGYVKKSPGAKGVVLGLSGGVDSTLAGKLCQAVVAALCVEALGKEQNHALLLPHSNTTPEDDVQDAL 83 (286) T ss_pred HHHHHHHHHH-HHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEEEECCCCCCCHHHHHHHH T ss_conf 7899999898-888887632576504673047620356778999999999985046114203321466788412179999 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCC Q ss_conf 99997213542677589999988631323468743101101233046677988875169544364324477541102366 Q gi|254780999|r 341 ACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYG 420 (562) Q Consensus 341 ~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~G 420 (562) .+++.+|+.+..++|++....+...... .......++.+|++||+||.++|++||+++++|+||||++|+.+||+|+|| T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gn~~~r~r~~~~y~~a~~~~~lv~gt~n~~e~~~Gy~t~~G 162 (286) T TIGR00552 84 ALAEPLGINYKTIDIAPIAASFQAQTET-GDPLADFLAEGNLKARLRMALLYAHANKHNLLVLGTGNKSELLLGYFTKYG 162 (286) T ss_pred HHHHHCCCCCEECCCHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCC T ss_conf 9887405442111304677887764203-443331023322247889999988731016356404613445432001014 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCC---CCCCCCCC-CHHHHHHHHHHH Q ss_conf 65554010177633658999999631033222344446688788288988453899---97700068-878899999999 Q gi|254780999|r 421 DMSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPH---QTDQESLP-PYPILDDIIKRI 496 (562) Q Consensus 421 D~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~---Q~ded~l~-~Y~~lD~il~~~ 496 (562) |++++++|+++|+|++||+|++|+|.. ..++|+.+++|||||+||++ |+||.++| +|+.+|.+|..+ T Consensus 163 dg~~d~~p~~~l~k~~~~~l~~~l~~~---------~~~~P~~~~~k~P~a~l~~g~p~q~de~~~g~~y~~ld~~l~~~ 233 (286) T TIGR00552 163 DGGCDILPLGGLFKTEVYELAKRLGVE---------QTIIPEEIIEKPPTADLWPGKPSQTDETELGIPYDELDDYLKGL 233 (286) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCC---------CCCCCHHHHCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHHH T ss_conf 420003455322378899999981753---------10146667337887554678887765445057777777887766 Q ss_pred HHCCCCH---HHH-HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCC Q ss_conf 8279997---798-71899999999999998745668738970247264537 Q gi|254780999|r 497 VENEESF---INN-DQEYNDETVRYVEHLLYGSEYKRRQAPVGTKITAKSFG 544 (562) Q Consensus 497 i~~~~~~---~~~-~~~~~~~~v~~~~~~~~~~~~KR~~~pp~~~is~~sfg 544 (562) .+...+. .+. ...+. +.++++..++.++||||+|++|+.....+.|+ T Consensus 234 ~~~~~~~p~d~~~~~~~~~-~~~~~~~~~~~~~~hk~~~~~P~~~~~~~~~~ 284 (286) T TIGR00552 234 EELSQPKPIDQEVLATLLP-ELVKRIESLVQKSEHKREQALPPVPTIFDLFW 284 (286) T ss_pred HHHCCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHC T ss_conf 6412677514687862016-89999999887555555423786113545305 No 30 >cd07564 nitrilases_CHs Nitrilases, cyanide hydratase (CH)s, and similar proteins (class 1 nitrilases). Nitrilases (nitrile aminohydrolases, EC:3.5.5.1) hydrolyze nitriles (RCN) to ammonia and the corresponding carboxylic acid. Most nitrilases prefer aromatic nitriles, some prefer arylacetonitriles and others aliphatic nitriles. This group includes the nitrilase cyanide dihydratase (CDH), which hydrolyzes inorganic cyanide (HCN) to produce formate. It also includes cyanide hydratase (CH), which hydrolyzes HCN to formamide. This group includes four Arabidopsis thaliana nitrilases (Ath)NIT1-4. AthNIT1-3 have a strong substrate preference for phenylpropionitrile (PPN) and other nitriles which may originate from the breakdown of glucosinolates. The product of PPN hydrolysis, phenylacetic acid has auxin activity. AthNIT1-3 can also convert indoacetonitrile to indole-3-acetic acid (IAA, auxin), but with a lower affinity and velocity. From their expression patterns, it has been speculated that Probab=100.00 E-value=0 Score=342.78 Aligned_cols=244 Identities=22% Similarity=0.279 Sum_probs=202.3 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHC--------CHHHHHHH----HHHHHH Q ss_conf 289999638836889999999999999999859989991664447998445414--------88999999----999999 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVF--------KKSFIQAC----SSAIDT 72 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~--------~~~f~~~~----~~~l~~ 72 (562) ||||++|+++.+||+++|++++.+++++|+++||||||||||++||||....+. ...+.+.+ ...++. T Consensus 1 mkVA~vQ~~~~~~d~~~Nl~~~~~~i~~Aa~~ga~lvvfPE~~~tGYp~~~~~~~~~~~~~~~~~~~~~a~~~~g~~~~~ 80 (297) T cd07564 1 VKVAAVQAAPVFLDLAATVEKACRLIEEAAANGAQLVVFPEAFIPGYPYWIWFGAPAEGRELFARYYENSVEVDGPELER 80 (297) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHH T ss_conf 98999957787799999999999999999988995999589986578724431684003799999997464699989999 Q ss_pred HHHHHHHCCCEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCC--CCCEEEECCEEEEECC Q ss_conf 99998876978999999988898789999996-99799988777623554333210037788--8761476474562011 Q gi|254780999|r 73 LKSDTHDGGAGIVVGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGY--SNDPIVFRDIRLGILI 149 (562) Q Consensus 73 La~~~~~~~i~iivG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~--~~~~~~~~g~kiGv~I 149 (562) |++.+++++++|++|++++.++++||++++|. +|+++++|+|+| |+| .|+.+|.+|+ ...+|++.++|+|++| T Consensus 81 l~~~A~~~~i~iv~G~~E~~~~~~yNtav~i~~~G~i~~~yrKi~-p~~---~E~~~f~~G~g~~~~v~~t~~griG~~I 156 (297) T cd07564 81 LAEAARENGIYVVLGVSERDGGTLYNTQLLIDPDGELLGKHRKLK-PTH---AERLVWGQGDGSGLRVVDTPIGRLGALI 156 (297) T ss_pred HHHHHHHCCCEEEEEEEECCCCCEEEEEEEECCCCCEEEEECCCC-CCC---CCEEEECCCCCCCCEEECCCCEEEEEEE T ss_conf 999999829599984166369956867999879995898965767-786---6224663578877457525871688786 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEE--CCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC---------------C Q ss_conf 111110168999997147775993--15553335807888788999854337046421356887---------------6 Q gi|254780999|r 150 CEDIWKNSNICKHLKKQGAEFLFS--LNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQ---------------D 212 (562) Q Consensus 150 CeDlw~~~~~~~~la~~Gadlii~--psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~---------------d 212 (562) |||+|+|+ ..+.|+.+|++|++. |++.+...++ +.+..++++||.||+++++++|++|.+ + T Consensus 157 CyD~~fP~-~~~~l~~~ga~i~va~~P~~~~~~~~~-~~w~~l~rarAiEnq~fVva~n~~g~~~~~~~~~~~~~~~~~~ 234 (297) T cd07564 157 CWENYMPL-ARYALYAQGEQIHVAPWPDFSPYYLSR-EAWLAASRHYALEGRCFVLSACQVVTEEDIPADCEDDEEADPL 234 (297) T ss_pred CCCCHHHH-HHHHHHHCCCEEEEECCCCCCCCCCCH-HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCC T ss_conf 25410699-999998589809997378644677878-9999999999985595899965313633332003565445567 Q ss_pred CCCCCCCCEEECCCCEEEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf 621048643555898599961677763799999833521236 Q gi|254780999|r 213 ELIFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 213 ~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) +..|.|+|+|+||+|++++++...+|+++++|+|++.....+ T Consensus 235 ~~~~~G~S~Ii~P~G~vla~~~~~~e~ii~adiDl~~i~~~R 276 (297) T cd07564 235 EVLGGGGSAIVGPDGEVLAGPLPDEEGILYADIDLDDIVEAK 276 (297) T ss_pred CCCCCCEEEEECCCCCEEECCCCCCCEEEEEEECHHHHHHHH T ss_conf 764375589998999786564799974999995699999999 No 31 >COG0388 Predicted amidohydrolase [General function prediction only] Probab=100.00 E-value=0 Score=339.18 Aligned_cols=250 Identities=25% Similarity=0.388 Sum_probs=218.1 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHH-HHCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 86289999638836889999999999999999859989991664447998445-41488999999999999999988769 Q gi|254780999|r 3 KKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPED-LVFKKSFIQACSSAIDTLKSDTHDGG 81 (562) Q Consensus 3 ~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~D-l~~~~~f~~~~~~~l~~La~~~~~~~ 81 (562) ..||||++|+++..+|+++|++++++++++|+++|||||||||+++|||+++| ++++..........+..+.+.++ .. T Consensus 1 ~~~riA~~Q~~~~~~d~~~N~~~~~~~i~~A~~~ga~LvvfPEl~~tgy~~~~~~~~~~~~~~~~~~~~~~l~~~a~-~~ 79 (274) T COG0388 1 SMMRVAAAQMAPKAGDPAENLARILRLIREAAARGADLVVFPELFLTGYPCEDDLFLEEAAAEAGEETLEFLAALAE-EG 79 (274) T ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHH-HC T ss_conf 94799998036666897999999999999998769989993784355798412567665430356289999999887-69 Q ss_pred CEEEEEEEEEECCEEEEEEEEE-ECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCC-EEEECCEEEEECCCCCCCHHHHH Q ss_conf 7899999998889878999999-699799988777623554333210037788876-14764745620111111101689 Q gi|254780999|r 82 AGIVVGFPRQDQEGVLNSVVIL-DAGNIIAVRDKINLPNYSEFHEKRTFISGYSND-PIVFRDIRLGILICEDIWKNSNI 159 (562) Q Consensus 82 i~iivG~p~~~~~~~yNsa~vi-~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~-~~~~~g~kiGv~ICeDlw~~~~~ 159 (562) ..+++|.+.......||+++++ .+|+++++|+|+|||++ .|+|+++|.+|+... +|++.++|+|+.||||+|+|+ . T Consensus 80 ~~~ivg~~~~~~~~~~~~~~~i~~~G~ii~~y~K~hl~~~-~~~e~~~~~~G~~~~~v~~~~~~kiG~~IC~D~~fPe-~ 157 (274) T COG0388 80 GVIIVGGPLPEREKLYNNAALIDPDGEILGKYRKLHLFDA-FYEERRFFTPGDEGVVVFETDGGKIGLLICYDLRFPE-L 157 (274) T ss_pred CEEEEEEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCC-CCCHHHCCCCCCCCCEEEEECCEEEEEEECCCCCCCH-H T ss_conf 8899985066134416789998689869999865827987-5224313147876650696688578999814547837-8 Q ss_pred HHHH-HHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCC-CCCCCCCEEECCCCEEEEECCCCC Q ss_conf 9999-71477759931555333580788878899985433704642135688766-210486435558985999616777 Q gi|254780999|r 160 CKHL-KKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDE-LIFDGASFCFDGQQQLAFQMKHFS 237 (562) Q Consensus 160 ~~~l-a~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~-lvf~G~S~I~d~~G~ii~~~~~f~ 237 (562) .+.+ +.+||++|++|+|+++..+ .+.|..++++||.+|+++++++|++|.+++ ..|+|+|+|+||+|++++++..++ T Consensus 158 ~~~~~~~~Gaeii~~p~a~~~~~~-~~~w~~l~~arA~en~~~vi~~n~~g~~~~~~~~~G~S~i~~p~G~v~~~~~~~~ 236 (274) T COG0388 158 ARRLLALGGAELLLVPAAWPAERG-LDHWEVLLRARAIENQVYVLAANRAGFDGAGLEFCGHSAIIDPDGEVLAEAGEEE 236 (274) T ss_pred HHHHHHHCCCEEEEECCCCCCCCC-HHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCEECCEEEEECCCCCEEEECCCCC T ss_conf 999886269659998887755433-6789999975278536269996676776763255110499848997787437888 Q ss_pred CCEEEEEEEECCCCCCCCC Q ss_conf 6379999983352123655 Q gi|254780999|r 238 EQNFMTEWHYDQQLSQWNY 256 (562) Q Consensus 238 e~~~i~d~d~d~~~~~~~~ 256 (562) |+++++|+|.+.....+.. T Consensus 237 e~~~~~~id~~~~~~~r~~ 255 (274) T COG0388 237 EGVLLADIDLAELAEVRRK 255 (274) T ss_pred CCEEEEEECHHHHHHHHCC T ss_conf 8689999746677553045 No 32 >cd07565 aliphatic_amidase aliphatic amidases (class 2 nitrilases). Aliphatic amidases catalyze the hydrolysis of short-chain aliphatic amides to form ammonia and the corresponding organic acid. This group includes Pseudomonas aeruginosa (Pa) AmiE, the amidase from Geobacillus pallidus RAPc8 (RAPc8 amidase), and Helicobacter pylori (Hp) AmiE and AmiF. PaAimE and HpAmiE hydrolyze various very short aliphatic amides, including propionamide, acetamide and acrylamide. HpAmiF is a formamidase which specifically hydrolyzes formamide. These proteins belong to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 2. Members of this superfamily generally form homomeric complexes, the basic Probab=100.00 E-value=0 Score=328.80 Aligned_cols=238 Identities=15% Similarity=0.133 Sum_probs=195.2 Q ss_pred EEEEEEECCC----CCCCHHHHHHHHHHHHHHHHH--CCCCEEEECCCCCCCCCHHHHHCCHHHHHH-HHHHHHHHHHHH Q ss_conf 2899996388----368899999999999999998--599899916644479984454148899999-999999999998 Q gi|254780999|r 5 LKIAIAQLNP----VVGDIAGNIAKARRAREEANR--QGMDLILFTELFISGYPPEDLVFKKSFIQA-CSSAIDTLKSDT 77 (562) Q Consensus 5 mkIal~Q~n~----~~gd~~~N~~~i~~~i~~A~~--~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~-~~~~l~~La~~~ 77 (562) ++||++|+.+ ...|++.|+++|.+.+++|++ .|||||||||+++|||.+.... ...+.+. -...++.|++.+ T Consensus 1 v~va~vQ~~~~~~~~~~dv~~n~~~i~~~i~~a~a~~~GadLIVlPE~~~tG~~~~~~~-~~~~a~~~~Gp~~~~l~~~A 79 (291) T cd07565 1 VGVAVVQYKVPVLHTKEEVLENAERIADMVEGTKRGLPGMDLIVFPEYSTQGLMYDKWT-MDETACTVPGPETDIFAEAC 79 (291) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCHHH-HHHHHHCCCCHHHHHHHHHH T ss_conf 96999963488889989999999999999999973499986998888655789977678-88754229987999999999 Q ss_pred HHCCCEEEEEEEEEE---CCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCC-EEEE-CCEEEEECCCC Q ss_conf 876978999999988---898789999996-99799988777623554333210037788876-1476-47456201111 Q gi|254780999|r 78 HDGGAGIVVGFPRQD---QEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSND-PIVF-RDIRLGILICE 151 (562) Q Consensus 78 ~~~~i~iivG~p~~~---~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~-~~~~-~g~kiGv~ICe 151 (562) +++++++++|.+++. ++.+||++++|+ +|+++++|||+|+... +.+| .+|+... +++. .|+|+|++||| T Consensus 80 ~~~~v~iv~g~~E~~~~~~~~~yNtavvid~~G~ii~~YRKiHl~~~----~e~~-~pGd~~~~v~~t~~G~kiGl~ICy 154 (291) T cd07565 80 KEAKVWGVFSIMERNPDHGKNPYNTAIIIDDQGEIVLKYRKLHPWVP----IEPW-YPGDLGTPVCEGPKGSKIALIICH 154 (291) T ss_pred HHCCCEEEEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCCCCC----CCCC-CCCCCCCEEEECCCCCEEEEEEEC T ss_conf 86692899986876478789768899999899839989864168887----7784-468887547876898689999863 Q ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEE Q ss_conf 11101689999971477759931555333580788878899985433704642135688766210486435558985999 Q gi|254780999|r 152 DIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAF 231 (562) Q Consensus 152 Dlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~ 231 (562) |+|||+ .+|.++++|||+|++|++++... .+.+..+.++||.+|+++++++|++|.++.+.|.|+|+|+||+|++++ T Consensus 155 D~~FPE-~~R~lal~GAeli~~Psa~~~~~--~~~W~~l~rARAieN~~yvv~~Nr~G~e~~~~f~G~S~IvdP~G~vla 231 (291) T cd07565 155 DGMYPE-IARECAYKGAELIIRIQGYMYPA--KDQWIITNKANAWCNLMYTASVNLAGFDGVFSYFGESMIVNFDGRTLG 231 (291) T ss_pred CCCCHH-HHHHHHHCCCEEEEECCCCCCCC--HHHHHHHHHHHHHHHCCEEEEECCCCCCCCEEEEEEEEEECCCCCEEE T ss_conf 424289-99999978994999888789885--178999999999970978999856646898477446699999988877 Q ss_pred ECCCCCCCEEEEEEEECCCC Q ss_conf 61677763799999833521 Q gi|254780999|r 232 QMKHFSEQNFMTEWHYDQQL 251 (562) Q Consensus 232 ~~~~f~e~~~i~d~d~d~~~ 251 (562) ++..-+|+++++|+|++... T Consensus 232 ~a~~~~eei~~AeiDl~~~~ 251 (291) T cd07565 232 EGGREPDEIVTAELSPSLVR 251 (291) T ss_pred ECCCCCCEEEEEEECHHHHH T ss_conf 75799987999998789999 No 33 >cd07574 nitrilase_Rim1_like Uncharacterized subgroup of the nitrilase superfamily; some members of this subgroup have an N-terminal RimI domain (class 12 nitrilases). Some members of this subgroup are implicated in post-translational modification, as they contain an N-terminal GCN5-related N-acetyltransferase (GNAT) protein RimI family domain. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 12. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=0 Score=321.07 Aligned_cols=245 Identities=18% Similarity=0.210 Sum_probs=196.8 Q ss_pred EEEEEEECCC-CCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCC---HHHHHCCHHHHHH----HHHHHHHHHHH Q ss_conf 2899996388-36889999999999999999859989991664447998---4454148899999----99999999999 Q gi|254780999|r 5 LKIAIAQLNP-VVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYP---PEDLVFKKSFIQA----CSSAIDTLKSD 76 (562) Q Consensus 5 mkIal~Q~n~-~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~---~~Dl~~~~~f~~~----~~~~l~~La~~ 76 (562) ||||++|++. ..+|++.|++++.+++++|+++|||||||||++.+||. +.+.-........ ....++.+++. T Consensus 1 ~rvA~vQ~~~~~~~d~e~nl~~~~~~i~~Aa~~GA~lvvfPE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l 80 (280) T cd07574 1 VRVAAAQYPLRRYASFEEFAAKVEYWVAEAAGYGADLLVFPEYFTMELLSLLPEAIDGLDEAIRALAALTPDYVALFSEL 80 (280) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 98999957898999999999999999999998859699958875463401476210028999764242486999999999 Q ss_pred HHHCCCEEEEE-EEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCC Q ss_conf 88769789999-99988898789999996-99799988777623554333210037788876147647456201111111 Q gi|254780999|r 77 THDGGAGIVVG-FPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIW 154 (562) Q Consensus 77 ~~~~~i~iivG-~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw 154 (562) +++++++|+.| ++++.++++||++++|+ +|+ ++.|+|+||+.+. .|..+|.+|+...+|+++++|+|++||||+| T Consensus 81 A~~~~i~iv~G~~~~~~~~~~yNta~vi~~~G~-i~~~rK~hl~~~e--~~~~~~~~G~~~~v~~~~~griG~~ICyD~~ 157 (280) T cd07574 81 ARKYGINIIAGSMPVREDGRLYNRAYLFGPDGT-IGHQDKLHMTPFE--REEWGISGGDKLKVFDTDLGKIGILICYDSE 157 (280) T ss_pred HHHHCCEEEEEEEEEEECCEEEEEEEEECCCCC-EEEEEEEECCCCC--CEEEEECCCCCCEEEEECCEEEEEEEECCCC T ss_conf 998296999936999769989999999958981-9999864059986--3155226999557999799279877861002 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC----CCCCCCCCEEECCC---- Q ss_conf 0168999997147775993155533358078887889998543370464213568876----62104864355589---- Q gi|254780999|r 155 KNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD----ELIFDGASFCFDGQ---- 226 (562) Q Consensus 155 ~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d----~lvf~G~S~I~d~~---- 226 (562) +|+ .++.++++|||+|++|+++|...+....+ ...++||.||+++++++|++|.+. ...|.|+|+|++|. T Consensus 158 fPe-~~r~la~~Ga~ii~~Psa~~~~~~~~~~~-~~~~ARAieN~~yvv~~n~vG~~~~~~~~~~~~G~S~I~~P~~~~~ 235 (280) T cd07574 158 FPE-LARALAEAGADLLLVPSCTDTRAGYWRVR-IGAQARALENQCYVVQSGTVGNAPWSPAVDVNYGQAAVYTPCDFGF 235 (280) T ss_pred CHH-HHHHHHHCCCCEEEECCCCCCCCHHHHHH-HHHHHHHHHCEEEEEEECCCCCCCCCCCCCEECCEEEEECCCCCCC T ss_conf 569-99999987996999899888755589999-9999998858079999256247888787654412239988866767 Q ss_pred --CEEEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf --8599961677763799999833521236 Q gi|254780999|r 227 --QQLAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 227 --G~ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) |.++++++..+|+++++|+|++.....+ T Consensus 236 ~~~gvla~~~~~~E~il~AdiDl~~v~~~R 265 (280) T cd07574 236 PEDGILAEGEPNTEGWLIADLDLEALRRLR 265 (280) T ss_pred CCCCEEEECCCCCCEEEEEEEEHHHHHHHH T ss_conf 899989712799988999997469999999 No 34 >cd07579 nitrilase_1_R2 Second nitrilase domain of an uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). Members of this subgroup have two nitrilase domains. This is the second of those two domains. The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=1.4e-45 Score=318.01 Aligned_cols=233 Identities=21% Similarity=0.227 Sum_probs=191.3 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++. +|++.|++++.+++++|+++||+||||||+++|||+...... .......++.|++.+++++++|+ T Consensus 1 rvA~~Q~~~~-~d~~~Nl~~~~~~i~~Aa~~ga~lvvlPE~~~tg~~~~~~~a----~~~~g~~~~~l~~lA~~~~i~iv 75 (279) T cd07579 1 RIAVAQFAPT-PDIAGNLATIDRLAAEAKATGAELVVFPELALTGLDDPASEA----ESDTGPAVSALRRLARRLRLYLV 75 (279) T ss_pred CEEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHHC----CCCCCHHHHHHHHHHHHCCEEEE T ss_conf 9999946589-999999999999999999879959997999411599547743----55998899999999997591999 Q ss_pred EEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 99999888987899999969-97999887776235543332100377888761476474562011111110168999997 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILDA-GNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLK 164 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~~-G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la 164 (562) +|++++.++++||++++|.+ | +++.|+|+||.. .|++||++|+...+|++.++|+|++||||+|||+ .++.++ T Consensus 76 ~G~~e~~~~~~yNs~~~i~~~G-~i~~YrK~HL~~----~E~~~~~~G~~~~v~~t~~griGl~ICyD~rFPE-~~R~la 149 (279) T cd07579 76 AGFAEADGDGLYNSAVLVGPEG-LVGTYRKTHLIE----PERSWATPGDTWPVYDLPLGRVGLLIGHDALFPE-AGRVLA 149 (279) T ss_pred EEEEECCCCEEEEEEEEECCCC-CEEEEEECCCCC----CCCEEECCCCCCEEEEECCEEEEEEEECCCCHHH-HHHHHH T ss_conf 9899945993788899985997-678997025897----5441133898055999899889988880201289-999999 Q ss_pred HCCCCEEEECCCCC--C---------------CCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCC Q ss_conf 14777599315553--3---------------358078887889998543370464213568876621048643555898 Q gi|254780999|r 165 KQGAEFLFSLNASP--Y---------------YHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQ 227 (562) Q Consensus 165 ~~Gadlii~psASp--~---------------~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G 227 (562) ++|||+|++|+|+. + ..++....+.++++||.||+++++++|++|.+++ |.|+|+|++|+| T Consensus 150 ~~GA~il~~PaA~~~~~~~~~~~~~~~~~~~~~~g~~~~hw~llraRAieN~~~v~~an~~~~~~~--~~G~s~i~gp~~ 227 (279) T cd07579 150 LRGCDLLACPAAIAIPFVGAHAGTSVPQPYPIPTGADPTHWHLARVRAGENNVYFAFANVPDPARG--YTGWSGVFGPDT 227 (279) T ss_pred HCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEECCCCCCCC--CCCEEEEEECCC T ss_conf 879939997663466755333453233455544465347999999888646759999435467678--663469996575 Q ss_pred EEEEEC---CCCCCCEEEEEEEECCCC Q ss_conf 599961---677763799999833521 Q gi|254780999|r 228 QLAFQM---KHFSEQNFMTEWHYDQQL 251 (562) Q Consensus 228 ~ii~~~---~~f~e~~~i~d~d~d~~~ 251 (562) .+.... ..-+|+++++|+|.+... T Consensus 228 ~~~p~~~~~~~~~egi~~A~iDl~~~~ 254 (279) T cd07579 228 FAFPRQEAAIGDEEGIAWALIDTSNLD 254 (279) T ss_pred CCCCCEEEECCCCCCEEEEEECHHHHH T ss_conf 543531350388772799997589974 No 35 >cd07582 nitrilase_4 Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases). The nitrilase superfamily is comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13). Class 13 represents proteins that at the time were difficult to place in a distinct similarity group; this subgroup represents either a new class or one that was included previously in class 13. Members of this superfamily generally form homomeric complexes, the basic building block of which is a homodimer. Probab=100.00 E-value=4.5e-44 Score=307.46 Aligned_cols=243 Identities=18% Similarity=0.144 Sum_probs=190.6 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCC-----CEEEECCCCCCCCCHHHHHCCHHHHHHH----HHHHHHHHH Q ss_conf 2899996388368899999999999999998599-----8999166444799844541488999999----999999999 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGM-----DLILFTELFISGYPPEDLVFKKSFIQAC----SSAIDTLKS 75 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~ga-----dLvvfPEl~LtGY~~~Dl~~~~~f~~~~----~~~l~~La~ 75 (562) +..++.|++.+ .|+++|++++.++|++|+++|| |||||||+++|||+..+......+.+.. ...++.|++ T Consensus 6 ~~~~~~~~~~~-~D~~aNl~~~~~~I~~Aa~~ga~~~~~~LvvfPE~~~tGy~~~~~~~~~~~~~~a~~~~Gp~~~~l~~ 84 (294) T cd07582 6 LQPTCEAAEDR-ADILANIDRINEQIDAAVGFSGPGLPVRLVVLPEYALQGFPMGEPREVWQFDKAAIDIPGPETEALGE 84 (294) T ss_pred EEEEEEEECCH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 98666520371-00999999999999999976875469968999898554798776224477886355799989999999 Q ss_pred HHHHCCCEEEEEEEEEEC---CEEEEEEEEEE-CCEEEEEEEEEEECCCC-------CCCCCCCCCCCCCC----CEEEE Q ss_conf 988769789999999888---98789999996-99799988777623554-------33321003778887----61476 Q gi|254780999|r 76 DTHDGGAGIVVGFPRQDQ---EGVLNSVVILD-AGNIIAVRDKINLPNYS-------EFHEKRTFISGYSN----DPIVF 140 (562) Q Consensus 76 ~~~~~~i~iivG~p~~~~---~~~yNsa~vi~-~G~il~~y~K~~LP~y~-------~FdE~r~F~~G~~~----~~~~~ 140 (562) .+++++++|++|+.++.+ +++||++++|. +|++++.|+|+|++.+. +|++ .++.+|... .++++ T Consensus 85 ~Ak~~~i~iv~G~~Er~~~~~g~~yNta~li~~~G~i~~~yrKihl~~~~~~~~~~~~~~~-~~~~~g~~~~~~f~V~dt 163 (294) T cd07582 85 KAKELNVYIAANAYERDPDFPGLYFNTAFIIDPSGEIILRYRKMNSLAAEGSPSPHDVWDE-YIEVYGYGLDALFPVADT 163 (294) T ss_pred HHHHCCEEEEEEEEEECCCCCCCEEEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCEE-EEEEECCCCCCCCEEEEC T ss_conf 9998795999756885488899589999999999879999836677854567876644327-899844789865556753 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCC----CCC Q ss_conf 4745620111111101689999971477759931555333580788878899985433704642135688766----210 Q gi|254780999|r 141 RDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDE----LIF 216 (562) Q Consensus 141 ~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~----lvf 216 (562) ..+|+|++||||+|||+ ..+.++.+|||+|++|+++....+ ...++.+.++||.||+|+++.+|++|-++. ..| T Consensus 164 ~~GriG~~ICyD~~fPE-~~R~la~~GA~vi~~paa~~~~~~-~~~w~~~~raRAieN~~yVv~~n~~g~~~~~~~~~~~ 241 (294) T cd07582 164 EIGNLGCLACEEGLYPE-VARGLAMNGAEVLLRSSSEVPSVE-LDPWEIANRARALENLAYVVSANSGGIYGSPYPADSF 241 (294) T ss_pred CCCEEEEEEECCCCCHH-HHHHHHHCCCCEEEECCCCCCCCC-HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCEE T ss_conf 66608899704302489-999999889979998975899864-7799999999999739779998053457877877466 Q ss_pred CCCCEEECCCCEEEEECCCCC-CCEEEEEEEECCCC Q ss_conf 486435558985999616777-63799999833521 Q gi|254780999|r 217 DGASFCFDGQQQLAFQMKHFS-EQNFMTEWHYDQQL 251 (562) Q Consensus 217 ~G~S~I~d~~G~ii~~~~~f~-e~~~i~d~d~d~~~ 251 (562) .|+|+|+||+|+++++++..+ |.++++|+|++... T Consensus 242 ~G~S~IvdP~G~vla~a~~~~~e~iv~AeiDl~~~~ 277 (294) T cd07582 242 GGGSMIVDYKGRVLAEAGYGPGSMVAGAEIDIEALR 277 (294) T ss_pred EEEEEEECCCCCEEEECCCCCCCEEEEEEECHHHHH T ss_conf 003699879998888888998778999997289999 No 36 >pfam00795 CN_hydrolase Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6. Nitrilase-related proteins generally have a conserved E-K-C catalytic triad, and are multimeric alpha-beta-beta-alpha sandwich proteins. Probab=100.00 E-value=1e-40 Score=285.52 Aligned_cols=170 Identities=31% Similarity=0.469 Sum_probs=155.7 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 89999638836889999999999999999859989991664447998445414889999999999999999887697899 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) |||++|+++..||+++|++++++++++|.++|+|||||||+++|||++.+........+.....++.|++.+++++++++ T Consensus 1 kiA~~Q~~~~~~d~~~Nl~~~~~~i~~A~~~gadlvv~PE~~l~gy~~~~~~~~~~~~~~~~~~~~~l~~~a~~~~i~i~ 80 (172) T pfam00795 1 RVALVQLPPSAFDLEANLQKLLELIEEAARQGADLIVFPELFIPGYAHGAAEYLELAEAIPGEVLQALSALARKNGITVV 80 (172) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCEEE T ss_conf 99999899876899999999999999999879989995641246787206777776454683999999999988596999 Q ss_pred EEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEECCCCCCCHHHHHHHHH Q ss_conf 9999988898789999996-997999887776235543332100377888-76147647456201111111016899999 Q gi|254780999|r 86 VGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGILICEDIWKNSNICKHL 163 (562) Q Consensus 86 vG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv~ICeDlw~~~~~~~~l 163 (562) +|++++.++++||++++|. +|+++++|+|+||++++.|+|.+||.+|.. ..+|+++++|+|++||||+|+|+ .++.+ T Consensus 81 ~G~~~~~~~~~yNs~~~i~~~G~i~~~Y~K~hL~~~~~~~E~~~~~~G~~~~~v~~~~~~riG~~ICyD~~fpe-~~r~l 159 (172) T pfam00795 81 AGIPERDGGGLYNTLVLIDPDGELLGKYRKRHLVPFGEWVERPLFGPGGATFPVFDTPVGKLGLLICYEIRFPE-LARML 159 (172) T ss_pred EEEEEEECCEEEEEEEEECCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCCCEECCCCEEEEEEEECCCCCHH-HHHHH T ss_conf 94689528888888899969986999993375899764127875157887440104698899999860006579-99999 Q ss_pred HHCCCCEEEECCC Q ss_conf 7147775993155 Q gi|254780999|r 164 KKQGAEFLFSLNA 176 (562) Q Consensus 164 a~~Gadlii~psA 176 (562) +.+|||+|++||| T Consensus 160 a~~Ga~ii~~PsA 172 (172) T pfam00795 160 ALKGAEILANPSA 172 (172) T ss_pred HHCCCCEEEECCC T ss_conf 9889969996999 No 37 >cd07587 ML_beta-AS mammalian-like beta-alanine synthase (beta-AS) and similar proteins (class 5 nitrilases). This subgroup includes mammalian-like beta-AS (EC 3.5.1.6, also known as beta-ureidopropionase or N-carbamoyl-beta-alanine amidohydrolase). This enzyme catalyzes the third and final step in the catabolic pyrimidine catabolic pathway responsible for the degradation of uracil and thymine, the hydrolysis of N-carbamyl-beta-alanine and N-carbamyl-beta-aminoisobutyrate to the beta-amino acids, beta-alanine and beta-aminoisobutyrate respectively. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 5. Members of this superfamily generally form homomeric Probab=100.00 E-value=1.9e-36 Score=257.55 Aligned_cols=248 Identities=15% Similarity=0.171 Sum_probs=199.2 Q ss_pred CCEEEEEEECCCCC-------CCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCC-CHH-HHHCCHHHHHHHH--HHHH Q ss_conf 86289999638836-------88999999999999999985998999166444799-844-5414889999999--9999 Q gi|254780999|r 3 KKLKIAIAQLNPVV-------GDIAGNIAKARRAREEANRQGMDLILFTELFISGY-PPE-DLVFKKSFIQACS--SAID 71 (562) Q Consensus 3 ~~mkIal~Q~n~~~-------gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY-~~~-Dl~~~~~f~~~~~--~~l~ 71 (562) +.+||+++|-...+ -..++-.+||.++++.|+..|+.+|||.|.+-..| +|- .-+---.|.+.+. .... T Consensus 62 r~vrvg~iqn~i~~pt~~p~~~q~~ai~~~i~~~i~~Aa~~gvni~c~qe~w~mpfafctrek~pw~efae~a~~Gptt~ 141 (363) T cd07587 62 RIVRVGLIQNKIVLPTTAPIAEQREAIHDRIKKIIEAAAMAGVNIICFQEAWTMPFAFCTREKLPWCEFAESAEDGPTTK 141 (363) T ss_pred CEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHCCCCEECCCCCCCHHHHCCCCCCCCHHH T ss_conf 47899787404559998668999999999999999999975987998400105872111357776022126677793699 Q ss_pred HHHHHHHHCCCEEEEEEEEEEC---CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEE Q ss_conf 9999988769789999999888---98789999996-997999887776235543332100377888-761476474562 Q gi|254780999|r 72 TLKSDTHDGGAGIVVGFPRQDQ---EGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLG 146 (562) Q Consensus 72 ~La~~~~~~~i~iivG~p~~~~---~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiG 146 (562) .+++.++++++.|+....+++. +.+||+++||+ +|+++++|||+|+|....|.|+-||.||+. ..+|++.-.|+| T Consensus 142 ~~~~~A~k~~mViv~pi~Erd~~~~~~~~NTavVId~dG~~lGkyRK~HIP~~~~f~EkfYf~pGn~G~~Vf~T~~GkIG 221 (363) T cd07587 142 FCQELAKKYNMVIVSPILERDEEHGDTIWNTAVVISNSGNVLGKSRKNHIPRVGDFNESTYYMEGNTGHPVFETQFGKIA 221 (363) T ss_pred HHHHHHHHCCEEEEEECCCCCCCCCCEEEEEEEEECCCCCEEEEEECCCCCCCCCCCEEECCCCCCCCCCCEECCCCEEE T ss_conf 99999998496999842003344587578668999789866422211338999995531213688789875543641166 Q ss_pred ECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCC---C------C--- Q ss_conf 011111110168999997147775993155533358078887889998543370464213568876---6------2--- Q gi|254780999|r 147 ILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQD---E------L--- 214 (562) Q Consensus 147 v~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d---~------l--- 214 (562) +.||||.|+|+ -++.++++||+|+++|||.--.. ....+....+.+|..|++|++.+|+||-|. + . T Consensus 222 V~IC~DqwfPe-~aR~~aL~GAEIvf~PtAt~g~~-se~~W~~eqr~hAiAN~~~v~aiNRVG~E~~pne~tsGdg~~a~ 299 (363) T cd07587 222 VNICYGRHHPL-NWLMYGLNGAEIVFNPSATVGAL-SEPMWPIEARNAAIANSYFTVGINRVGTEVFPNEFTSGDGKPAH 299 (363) T ss_pred EEEECCCCCHH-HHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCC T ss_conf 45631543748-89986416864997545545652-36789998899999708837982355625676544567775343 Q ss_pred ----CCCCCCEEECCCCEEEEECCCCCCCEEEEEEEECCCCC Q ss_conf ----10486435558985999616777637999998335212 Q gi|254780999|r 215 ----IFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQQLS 252 (562) Q Consensus 215 ----vf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~~~~ 252 (562) -|.|+|||.+|+|+.+.+++.-+|+++++++|++.... T Consensus 300 ~d~g~FYGSSfia~P~G~~~~~lsr~~eglLV~eiDL~~~rq 341 (363) T cd07587 300 KDFGHFYGSSYVAAPDGSRTPGLSRTRDGLLVAELDLNLCRQ 341 (363) T ss_pred CCCCCCCCCCEEECCCCCCCCCCCCCCCCEEEEEECHHHHHH T ss_conf 557642252147889987645778887636999933689999 No 38 >cd07566 ScNTA1_like Saccharomyces cerevisiae N-terminal amidase NTA1, and related proteins (class 3 nitrilases). Saccharomyces cerevisiae NTA1 functions in the N-end rule protein degradation pathway. It specifically deaminates the N-terminal asparagine and glutamine residues of substrates of this pathway, to aspartate and glutamate respectively, these latter are the destabilizing residues. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 3. Probab=100.00 E-value=5.4e-36 Score=254.62 Aligned_cols=220 Identities=18% Similarity=0.319 Sum_probs=162.5 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHH----CCCCEEEECCCCCCCCCHHHHHCCHHHHHHHH--HHHHHHHHHHHH Q ss_conf 899996388368899999999999999998----59989991664447998445414889999999--999999999887 Q gi|254780999|r 6 KIAIAQLNPVVGDIAGNIAKARRAREEANR----QGMDLILFTELFISGYPPEDLVFKKSFIQACS--SAIDTLKSDTHD 79 (562) Q Consensus 6 kIal~Q~n~~~gd~~~N~~~i~~~i~~A~~----~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~--~~l~~La~~~~~ 79 (562) |||++|++|.+||+++|++|+.++++++.. .|+||||||||++|||...++-.-..+++... .......+.+++ T Consensus 1 RIA~lQ~~P~~GdV~~Nl~ra~~ll~~~~~~~~~~~~dLlVlPEl~~TGY~f~~~~ei~~~~e~~~~G~s~~~~~~~A~k 80 (295) T cd07566 1 RIACLQLNPQIGQVEENLSRAWELLDKTKKRAKLKKPDILVLPELALTGYNFHSLEHIKPYLEPTTSGPSFEWAREVAKK 80 (295) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 97999728847877888999999999998653125999898687355688989978988765025887189999999997 Q ss_pred CCCEEEEEEEEEECC---EEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCC----------------EEE Q ss_conf 697899999998889---8789999996-99799988777623554333210037788876----------------147 Q gi|254780999|r 80 GGAGIVVGFPRQDQE---GVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSND----------------PIV 139 (562) Q Consensus 80 ~~i~iivG~p~~~~~---~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~----------------~~~ 139 (562) +++++++|+|++.++ ++|||+++|. +|++++.|||+||-.. ++.|...|.... ... T Consensus 81 ~~~~vvvGypE~~~~~~~~~YNSa~vv~p~G~~l~~YRK~hL~~t----de~Wa~~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (295) T cd07566 81 FNCHVVIGYPEKVDESSPKLYNSALVVDPEGEVVFNYRKSFLYYT----DEEWGCEENPGGFQTFPLPFAKDDDFDGGSV 156 (295) T ss_pred CCEEEEEEEEEECCCCCCCEEEEEEEECCCCCEEEEEECCCCCCC----CHHHHCCCCCCCCCEECCCCCCCCCCCCCCC T ss_conf 791899973032588776234347998789866115611136555----5023235788875101035566666676433 Q ss_pred ECCEEEEECCCCCC--------CHHHHHHHHHHHCCCCEEEECCCCCCCCC--------------CHHHHHH-HHH-HHH Q ss_conf 64745620111111--------10168999997147775993155533358--------------0788878-899-985 Q gi|254780999|r 140 FRDIRLGILICEDI--------WKNSNICKHLKKQGAEFLFSLNASPYYHN--------------KLKKRHE-IVT-GQI 195 (562) Q Consensus 140 ~~g~kiGv~ICeDl--------w~~~~~~~~la~~Gadlii~psASp~~~~--------------k~~~R~~-~~~-~~a 195 (562) ...+++|+.||.|+ |+..++++..+.+|++||++|+||-.... ....+.+ +.. ..+ T Consensus 157 ~~~~~~~~gICmDiNPYkFeaP~~~~Efa~~~~~~~a~lii~~maWl~~~s~~~~~~~~~~p~~~t~~yW~~R~~pl~~~ 236 (295) T cd07566 157 DVTLKTSIGICMDLNPYKFEAPFTDFEFATHVLDNGTELIICPMAWLHSLSPTELTVLPQEPDTETVSYWLQRFEPLRAE 236 (295) T ss_pred CCCEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHCC T ss_conf 57402699998348852356896288999999866998999977556778831013677886113899999973433227 Q ss_pred HHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEE Q ss_conf 4337046421356887662104864355589859 Q gi|254780999|r 196 SHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQL 229 (562) Q Consensus 196 ~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~i 229 (562) .++++.++++|++|-+++..|.|+|+|+.-...+ T Consensus 237 ~~~~~~vv~aNR~G~E~~~~f~G~S~v~~~~~~~ 270 (295) T cd07566 237 PLEGTQVVFCNRIGTENDTLYAGSSAVIGIYDGL 270 (295) T ss_pred CCCCCEEEEEECCCCCCCEEECCCCEEEEECCCC T ss_conf 6666189997012254883451553688401589 No 39 >KOG0807 consensus Probab=100.00 E-value=2.2e-33 Score=237.50 Aligned_cols=239 Identities=20% Similarity=0.286 Sum_probs=190.9 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCC-CCCCCHHHHHCCHHHHH-HHHHHHHHHHHHHHHCCC Q ss_conf 2899996388368899999999999999998599899916644-47998445414889999-999999999999887697 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELF-ISGYPPEDLVFKKSFIQ-ACSSAIDTLKSDTHDGGA 82 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~-LtGY~~~Dl~~~~~f~~-~~~~~l~~La~~~~~~~i 82 (562) -+||++|++.. +|+..|++..++++++|+.+||+++.|||.| .-|-.+.. .-.+.+ ...+.+++..+.+++.+| T Consensus 16 ~~vAv~Qm~S~-~Dl~kNl~~~keLi~eA~~k~A~~iflPE~~dFi~~n~~e---si~Lae~l~~k~m~~y~elar~~nI 91 (295) T KOG0807 16 KRVAVAQMTSS-NDLTKNLATCKELISEAAQKGAKLIFLPEAFDFIGQNPLE---SIELAEPLDGKFMEQYRELARSHNI 91 (295) T ss_pred CEEEEEEECCC-HHHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHCCCCCC---CEECCCCCCHHHHHHHHHHHHHCCE T ss_conf 10477752165-6788889999999999997399799732133233279622---1322256684899999998774482 Q ss_pred EEEEEEE-EEE---CCEEEEEEEEEE-CCEEEEEEEEEEE-----CCCCCCCCCCCCCCCCCCC-EEEECCEEEEECCCC Q ss_conf 8999999-988---898789999996-9979998877762-----3554333210037788876-147647456201111 Q gi|254780999|r 83 GIVVGFP-RQD---QEGVLNSVVILD-AGNIIAVRDKINL-----PNYSEFHEKRTFISGYSND-PIVFRDIRLGILICE 151 (562) Q Consensus 83 ~iivG~p-~~~---~~~~yNsa~vi~-~G~il~~y~K~~L-----P~y~~FdE~r~F~~G~~~~-~~~~~g~kiGv~ICe 151 (562) ++.+|.. ++. +++++|+-++|+ +|+|...|+|.|| |+.....|...-.||.... |++..-+|+|..||| T Consensus 92 wlSlgg~~~r~~~~~~k~~N~hl~id~~G~i~a~Y~KlHLFDVeipg~~~lkES~~t~pG~~i~~pv~tP~GklGlaICY 171 (295) T KOG0807 92 WLSLGGHHERSDDGNQKLRNTHLLIDSKGEIRAEYQKLHLFDVEIPGGPRLKESNTTQPGTAIESPVDTPLGKLGLAICY 171 (295) T ss_pred EEEECCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE T ss_conf 58730435777300130343579986776689997652056763599961013467678865588567862201102354 Q ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCC--CCCCCCCEEECCCCEE Q ss_conf 11101689999971477759931555333580788878899985433704642135688766--2104864355589859 Q gi|254780999|r 152 DIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDE--LIFDGASFCFDGQQQL 229 (562) Q Consensus 152 Dlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~--lvf~G~S~I~d~~G~i 229 (562) |++||+ ++..++..||+||..|||+....|+.+ |.-++++||.|++|||+.+.|+|-.++ -.| |+|||+||+|.+ T Consensus 172 DiRFpE-~sl~LR~~gA~iLtyPSAFT~~TG~AH-WEiLlRARAietQCYVvaaaQ~G~HneKR~Sy-GhSMiVDPWGtV 248 (295) T KOG0807 172 DIRFPE-LSLKLRKMGAQILTYPSAFTIKTGEAH-WEILLRARAIETQCYVVAAAQVGKHNEKRESY-GHSMIVDPWGTV 248 (295) T ss_pred ECCCCH-HHHHHHHCCCCEEECCCHHHHCCCHHH-HHHHHHHHHHHCCEEEEEHHHCCCCCCHHHCC-CCCEEECCHHHH T ss_conf 225736-889998729827963515431344889-99999988764021898565426656022304-864687654555 Q ss_pred EEECCCCC-CCEEEEEEEECCC Q ss_conf 99616777-6379999983352 Q gi|254780999|r 230 AFQMKHFS-EQNFMTEWHYDQQ 250 (562) Q Consensus 230 i~~~~~f~-e~~~i~d~d~d~~ 250 (562) +++++.-+ .++.++|+|+..- T Consensus 249 va~~se~~~~~l~~AdiDlsll 270 (295) T KOG0807 249 VARCSERTGPGLILADIDLSLL 270 (295) T ss_pred HEECCCCCCCCEEEEECCHHHH T ss_conf 3026877899658997568999 No 40 >cd07571 ALP_N-acyl_transferase Apolipoprotein N-acyl transferase (class 9 nitrilases). ALP N-acyl transferase (Lnt), is an essential membrane-bound enzyme in gram-negative bacteria, which catalyzes the N-acylation of apolipoproteins, the final step in lipoprotein maturation. This is a reverse amidase (i.e. condensation) reaction. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 9. Probab=99.98 E-value=1.7e-29 Score=212.15 Aligned_cols=224 Identities=19% Similarity=0.298 Sum_probs=178.3 Q ss_pred EEEEEEECCCCC------CCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 289999638836------88999999999999999985998999166444799844541488999999999999999988 Q gi|254780999|r 5 LKIAIAQLNPVV------GDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTH 78 (562) Q Consensus 5 mkIal~Q~n~~~------gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~ 78 (562) +||+++|.|... .....++++..++.+++.++++|||||||.+++.+... ....++.+...++ T Consensus 1 ~~V~lVQ~Ni~~~~k~~~~~~~~~~~~~~~ls~~~~~~~~dlIVwPEta~p~~~~~-----------~~~~~~~~~~~~~ 69 (270) T cd07571 1 LRVALVQGNIPQDEKWDPEQRQATLDRYLDLTRELADEKPDLVVWPETALPFDLQR-----------DPDALARLARAAR 69 (270) T ss_pred CEEEEEECCCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHH-----------CHHHHHHHHHHHH T ss_conf 98999907998145128546999999999999877617999999489466765111-----------8899999999987 Q ss_pred HCCCEEEEEEEEEECC--EEEEEEEEEE-CCEEEEEEEEEEECCCCCC---------------CCCCCCCCCCCCCEEEE Q ss_conf 7697899999998889--8789999996-9979998877762355433---------------32100377888761476 Q gi|254780999|r 79 DGGAGIVVGFPRQDQE--GVLNSVVILD-AGNIIAVRDKINLPNYSEF---------------HEKRTFISGYSNDPIVF 140 (562) Q Consensus 79 ~~~i~iivG~p~~~~~--~~yNsa~vi~-~G~il~~y~K~~LP~y~~F---------------dE~r~F~~G~~~~~~~~ 140 (562) +.+..+++|.+..+++ +.||+++++. +|++..+|+|+||-.+++| .+...|.+|+...++.+ T Consensus 70 ~~~~~li~G~~~~~~~~~~~yNs~~~~~~~g~~~~~Y~K~hLVPFGEyiP~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~ 149 (270) T cd07571 70 AVGAPLLTGAPRREPGGGRYYNSALLLDPGGGILGRYDKHHLVPFGEYVPLRDLLRFLGLLFDLPMGDFSPGTGPQPLLL 149 (270) T ss_pred HCCCEEEEEEEEEECCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEC T ss_conf 55997999554565698867889999989998420784332206541366177777788752466586314999864652 Q ss_pred CC-EEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCC-CCCCC-CCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCC Q ss_conf 47-4562011111110168999997147775993155-53335-807888788999854337046421356887662104 Q gi|254780999|r 141 RD-IRLGILICEDIWKNSNICKHLKKQGAEFLFSLNA-SPYYH-NKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFD 217 (562) Q Consensus 141 ~g-~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psA-Sp~~~-~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~ 217 (562) .+ .++|++||||..+++ ..+.++.+||++|+|++- +.|.. ....+...+.+.||.|++.|++.++.. T Consensus 150 ~~~~~~~~~ICYE~~f~~-~~r~~~~~ga~~lv~~tNdaWfg~s~~~~Qhl~~ar~RAiE~~r~~vr~~nt--------- 219 (270) T cd07571 150 GGGVRVGPLICYESIFPE-LVRDAVRQGADLLVNITNDAWFGDSAGPYQHLAMARLRAIETGRPLVRAANT--------- 219 (270) T ss_pred CCCCEEEEEEEEHHHCHH-HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCC--------- T ss_conf 798258889975343649-9999864487099997366456788417999999898899829963986696--------- Q ss_pred CCCEEECCCCEEEEECCCCCCCEEEEEEEECC Q ss_conf 86435558985999616777637999998335 Q gi|254780999|r 218 GASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQ 249 (562) Q Consensus 218 G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~ 249 (562) |-|.++||.|+++.+.+.|+++++..++.... T Consensus 220 GiSa~Id~~G~i~~~~~~~~~~~l~~~v~~~~ 251 (270) T cd07571 220 GISAVIDPDGRIVARLPLFEAGVLVAEVPLRT 251 (270) T ss_pred EEEEEECCCCCEEEECCCCCCEEEEEEEECCC T ss_conf 26899999987888747998279999950588 No 41 >KOG0806 consensus Probab=99.98 E-value=6.4e-32 Score=227.97 Aligned_cols=248 Identities=17% Similarity=0.146 Sum_probs=202.4 Q ss_pred CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHH------HHHHHHHH Q ss_conf 886289999638836889999999999999999859989991664447998445414889999999------99999999 Q gi|254780999|r 2 LKKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACS------SAIDTLKS 75 (562) Q Consensus 2 ~~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~------~~l~~La~ 75 (562) +.++++|++|+....++...|++.....+++|+++|++||||||++++||...+-+. .+++... .....+.. T Consensus 11 ~~~~~~a~vq~~~~l~~~~~Ni~~~~~~i~~aa~~g~~iIv~PE~~~~gy~~~~sf~--py~E~i~~~~~~~ps~~~ls~ 88 (298) T KOG0806 11 LPNATEALVSLEEALLLMNENIDILEKAVKEAAKQGAKIIVFPEDGLYGYNFTESFY--PYLEDIPDPGCRDPSRQGLSE 88 (298) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCCCCCCCCC--CHHHHCCCCCCCCHHHHHHHH T ss_conf 534533221012211233465899999999998649809987313256524565312--144308996668836887688 Q ss_pred HHHHCCCEEEEEEEEEEC--CEEEEEEEEEE-CCEEEEEEEEEEECCCCC-----CCCCCCCCCCCCCCEEEECCEEEEE Q ss_conf 988769789999999888--98789999996-997999887776235543-----3321003778887614764745620 Q gi|254780999|r 76 DTHDGGAGIVVGFPRQDQ--EGVLNSVVILD-AGNIIAVRDKINLPNYSE-----FHEKRTFISGYSNDPIVFRDIRLGI 147 (562) Q Consensus 76 ~~~~~~i~iivG~p~~~~--~~~yNsa~vi~-~G~il~~y~K~~LP~y~~-----FdE~r~F~~G~~~~~~~~~g~kiGv 147 (562) .++++.+.++.|.++... ++.||+..++. +|+.++.|+|+||++.++ |.|...|.+|....++....+|||+ T Consensus 89 va~~~~~~~i~g~i~~~~~~~k~yns~~~~~~~g~l~~~yrk~hlFD~d~~~~~ry~e~~~~~~g~~f~~~~~~~gkfGi 168 (298) T KOG0806 89 VAERLSCYIIGGSIEEEALGDKLYNSCADSSCPGDGLAKYRKNHLFDTDGPGVIRYRESHLLSPGDQFTVVDTSYGKFGI 168 (298) T ss_pred HHHHCEEEEECCCCHHHCCCCCCCCCCCCCCCCCCHHHEEEEEEEECCCCCCCEEEEEEECCCCCCCCCCCCCCCCCEEE T ss_conf 87744589965761431146644575122588805112452347733678662022321013776667744578871579 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEECCCCC---CCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC-CCCCCCC-CCEE Q ss_conf 1111111016899999714777599315553---335807888788999854337046421356887-6621048-6435 Q gi|254780999|r 148 LICEDIWKNSNICKHLKKQGAEFLFSLNASP---YYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQ-DELIFDG-ASFC 222 (562) Q Consensus 148 ~ICeDlw~~~~~~~~la~~Gadlii~psASp---~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~-d~lvf~G-~S~I 222 (562) .||+|+|+++ +++.++++||++|+.|.|+. -..+. ..+..+.++|+-.+.+++..+|..+.. ..+...| +|.+ T Consensus 169 ~IC~Di~F~d-~A~~~~~~g~~~ivyPtaw~~~~l~~~~-~hw~~~~~~~a~~n~~~v~~~s~~~~~s~~y~~~gshs~~ 246 (298) T KOG0806 169 FICFDIRFYD-PAMILVKDGADLIVYPTAWNNELLSAVP-LHWALLMRARANDNAANVHAPSPARTGSGIYAPRGSHSIM 246 (298) T ss_pred EEEECCCCCC-HHHHHHHCCCCEEEECCHHHHHCCCCCC-HHHHHHHHCCCCCCEEEEECCCCCCCCCEEEECCCCCEEE T ss_conf 9983145523-6799997188489936067400146651-5799997477552203321157676786024158763220 Q ss_pred ECCCCEEEEECCCCCCCEEEEEEEECCCCCCC Q ss_conf 55898599961677763799999833521236 Q gi|254780999|r 223 FDGQQQLAFQMKHFSEQNFMTEWHYDQQLSQW 254 (562) Q Consensus 223 ~d~~G~ii~~~~~f~e~~~i~d~d~d~~~~~~ 254 (562) .+|.|++++.... ++.+...++|.+...+.+ T Consensus 247 ~~p~gkvl~a~~~-~~e~~~a~~d~~~~~~~r 277 (298) T KOG0806 247 VNPTGKVLAAAVE-KEEIIYADVDPSAIASRR 277 (298) T ss_pred CCCCCEEEEECCC-CCCCCCCCCCHHHHHHHH T ss_conf 2885247630257-776312447989988886 No 42 >PRK13287 amiF formamidase; Provisional Probab=99.97 E-value=1.1e-29 Score=213.47 Aligned_cols=268 Identities=15% Similarity=0.149 Sum_probs=202.2 Q ss_pred CCEEEEEEECCC----CCCCHHHHHHHHHHHHHHHHHC--CCCEEEECCCCCCCCCHHHHHCCHHHHHH-HHHHHHHHHH Q ss_conf 862899996388----3688999999999999999985--99899916644479984454148899999-9999999999 Q gi|254780999|r 3 KKLKIAIAQLNP----VVGDIAGNIAKARRAREEANRQ--GMDLILFTELFISGYPPEDLVFKKSFIQA-CSSAIDTLKS 75 (562) Q Consensus 3 ~~mkIal~Q~n~----~~gd~~~N~~~i~~~i~~A~~~--gadLvvfPEl~LtGY~~~Dl~~~~~f~~~-~~~~l~~La~ 75 (562) +.+-||++|... +..|+..|+++|.+++..++.. |.|||||||.++.|....-. ...++... -....+.+++ T Consensus 12 d~v~v~~vqy~~P~~~t~~ev~~n~~~I~~~v~~~k~G~PgmDLvVFPEYSt~G~~~d~~-~~~e~~~~ipGpet~~f~~ 90 (333) T PRK13287 12 EGVLVGLIQYPVPVVNSREDIDKQIEQIIKTVHKTKAGYPGMDLIVFPEYSTQGLNTKKW-TTEEFLCTVDGPEVDAFAQ 90 (333) T ss_pred CCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCHH-HHHHHHHCCCCHHHHHHHH T ss_conf 856999995278866899999999999999999975689997489802413466678734-5577761499778999999 Q ss_pred HHHHCCCEEEEEEEEEECC--EEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEEC-CEEEEECCC Q ss_conf 9887697899999998889--8789999996-997999887776235543332100377888-7614764-745620111 Q gi|254780999|r 76 DTHDGGAGIVVGFPRQDQE--GVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFR-DIRLGILIC 150 (562) Q Consensus 76 ~~~~~~i~iivG~p~~~~~--~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~-g~kiGv~IC 150 (562) .+++.+++.+++..++..+ +.||++++|+ +|+|+.+|||+|.- +=.|. | .||+. ..+++.. |.|+|+.|| T Consensus 91 acr~~~vwgvfSi~E~np~~~~pYNT~iLIn~~Geiv~kYRKi~PW---vPiE~-W-~PGd~~~pV~dgPkG~KI~liIC 165 (333) T PRK13287 91 ACKENKVWGVFSIMERNPDGNEPYNTAIIIDDQGEIILKYRKLHPW---VPVEP-W-EPGDLGIPVCDGPGGSKLAVCIC 165 (333) T ss_pred HHHHCCEEEEEEEEEECCCCCCCCEEEEEECCCCCEEEEEEECCCC---CCCCC-C-CCCCCCCEEEECCCCCEEEEEEE T ss_conf 9997794899998887689998630289988998188899822565---56777-3-58986515654688757899997 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEE Q ss_conf 11110168999997147775993155533358078887889998543370464213568876621048643555898599 Q gi|254780999|r 151 EDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLA 230 (562) Q Consensus 151 eDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii 230 (562) +|.|+|| ++|.++.+||+||+-+++ |..-..+++.-..++.|-.|.++++.+|.+|.++.+.|-|+|+|++-+|+++ T Consensus 166 ~DG~fPE-iaRe~A~kGAEviiR~~G--Ym~p~~~qw~~t~k~~A~~N~~yva~vn~~G~Dgv~~yfG~s~i~~fDG~~L 242 (333) T PRK13287 166 HDGMFPE-MAREAAYKGANVMIRISG--YSTQVNDQWILTNRSNAWQNLMYTASVNLAGYDGVFYYFGEGQVCNFDGTTL 242 (333) T ss_pred CCCCCHH-HHHHHHHCCCEEEEEECC--CCCCHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCEEEECCCEEECCCCCEE T ss_conf 5899818-999987468879998357--6561356799999998886255799975025676414404616966788682 Q ss_pred EECCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 96167776379999983352123655575555544789998886422020358887642245405752158723 Q gi|254780999|r 231 FQMKHFSEQNFMTEWHYDQQLSQWNYMSDDSASTMYIPLQEEEADYNACVLSLRDYVQKNNFHKVIIGLSGGID 304 (562) Q Consensus 231 ~~~~~f~e~~~i~d~d~d~~~~~~~~~~~~~~~~~~~p~~~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiD 304 (562) .++..-+.++..+++..+.... .+.. -++..++-+-+-+|++ |+.||.- T Consensus 243 ~~~~~~~~~i~~Ael~~~~ir~-------------------aR~~-----~~~eN~ly~l~HRGYt-~v~gG~~ 291 (333) T PRK13287 243 VQGHRNPWEIVTAEVRPDLADE-------------------ARLG-----WGLENNIYNLGHRGYV-AVPGGAK 291 (333) T ss_pred EECCCCCCCEEEEEECHHHHHH-------------------HHHC-----CCHHHHHHHHCCCCEE-CCCCCCC T ss_conf 1047888824999966999999-------------------9864-----1011013441245400-4658623 No 43 >PRK00302 lnt apolipoprotein N-acyltransferase; Reviewed Probab=99.95 E-value=1.1e-25 Score=187.16 Aligned_cols=224 Identities=20% Similarity=0.289 Sum_probs=167.1 Q ss_pred CCEEEEEEECCCCCC------CHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 862899996388368------89999999999999999859989991664447998445414889999999999999999 Q gi|254780999|r 3 KKLKIAIAQLNPVVG------DIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSD 76 (562) Q Consensus 3 ~~mkIal~Q~n~~~g------d~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~ 76 (562) ++++|+++|.|.... +.+.++++..++.+++ .+++|+||+||.+++-+. +...+..+.+.+. T Consensus 224 ~~l~V~iVQpNi~q~~k~~~~~~~~~~~~~~~lt~~~-~~~~dlIvWPEta~p~~~-----------~~~~~~~~~i~~~ 291 (509) T PRK00302 224 EPLKVALVQGNIPQDLKWDPAGREATLEKYLDLSEPG-LGPADLIIWPETAIPFLE-----------ENPPPFLAAIDAL 291 (509) T ss_pred CCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCC-CCCCCEEEEECCCCCCCH-----------HHCHHHHHHHHHH T ss_conf 8549999927988100247345999999999976403-688868996376676641-----------1158999999999 Q ss_pred HHHCCCEEEEEEEEEECC---EEEEEEEEEE-CCEEEEEEEEEEECCCCCCCC---------------CCCCCCCCCCC- Q ss_conf 887697899999998889---8789999996-997999887776235543332---------------10037788876- Q gi|254780999|r 77 THDGGAGIVVGFPRQDQE---GVLNSVVILD-AGNIIAVRDKINLPNYSEFHE---------------KRTFISGYSND- 136 (562) Q Consensus 77 ~~~~~i~iivG~p~~~~~---~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE---------------~r~F~~G~~~~- 136 (562) +++.+..+++|.+..+++ +.|||+++++ +|++..+|+|+||-.+|||-- ..-|++|++.. T Consensus 292 ~~~~~~~ll~G~~~~~~~~~~~~yNs~~~~~~~g~~~~~YdK~~LVPFGEyiP~~~~~~~~~~~~~~~~~~~~~G~~~~~ 371 (509) T PRK00302 292 LREKGSPLITGAVGADKGGQGRYYNSIYVLGPNGEILGRYDKHHLVPFGEYVPLESLLRPLVPFFALPMGDFSRGPGVQP 371 (509) T ss_pred HHHCCCEEEEEEEECCCCCCCCEEEEEEEECCCCCCCCEECCEEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 87569849998511004887667889999879997101663111026632267598888899984566466778988776 Q ss_pred EEEECCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCC--CCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCC Q ss_conf 14764745620111111101689999971477759931555333--5807888788999854337046421356887662 Q gi|254780999|r 137 PIVFRDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYY--HNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDEL 214 (562) Q Consensus 137 ~~~~~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~--~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~l 214 (562) ++..++.++|++||||.-+++ ..+..+.+|||+|+|++.--|. .....+...+.+-||.|++.|++-+... T Consensus 372 ~~~~~~~~~~~~ICyE~~f~~-~~r~~~~~ga~~lvn~TND~Wfg~s~~~~QHl~~ar~RAiE~~r~~vraant------ 444 (509) T PRK00302 372 PLLAKGFKLAPLICYEIIFPE-LVRRYKAQGADLLLNITNDAWFGDSIGPYQHFQMARMRALELGRPLIRATNT------ 444 (509) T ss_pred CCCCCCCEEEEEEEEECCCHH-HHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCC------ T ss_conf 535788347778886320679-9997641588589991576545798558999999999998547887998186------ Q ss_pred CCCCCCEEECCCCEEEEECCCCCCCEEEEEEEEC Q ss_conf 1048643555898599961677763799999833 Q gi|254780999|r 215 IFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYD 248 (562) Q Consensus 215 vf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d 248 (562) |-|.++||.|+++.+.+.|+++++..++... T Consensus 445 ---GiSa~Id~~G~i~~~~~~~~~~~l~~~vp~~ 475 (509) T PRK00302 445 ---GITAVIDPLGRIIAQLPLFTEGVLDGTVPPT 475 (509) T ss_pred ---EEEEEECCCCCEEEECCCCCCEEEEEEECCC T ss_conf ---4689999998888763899736999993178 No 44 >PRK13286 amiE acylamide amidohydrolase; Provisional Probab=99.94 E-value=6.2e-25 Score=182.26 Aligned_cols=234 Identities=13% Similarity=0.136 Sum_probs=180.7 Q ss_pred CCEEEEEEECCC----CCCCHHHHHHHHHHHHHHHHHC--CCCEEEECCCCCCCCCHHHHHCCHHH-HHHH----HHHHH Q ss_conf 862899996388----3688999999999999999985--99899916644479984454148899-9999----99999 Q gi|254780999|r 3 KKLKIAIAQLNP----VVGDIAGNIAKARRAREEANRQ--GMDLILFTELFISGYPPEDLVFKKSF-IQAC----SSAID 71 (562) Q Consensus 3 ~~mkIal~Q~n~----~~gd~~~N~~~i~~~i~~A~~~--gadLvvfPEl~LtGY~~~Dl~~~~~f-~~~~----~~~l~ 71 (562) +.+-||++|... +..|+..|.++|.+++..++.. |.|||||||+++.|-.-. +.- .+.. ....+ T Consensus 11 d~v~vavvqy~~P~~~t~~ev~~n~~~I~~mv~~~k~G~PgmDLiVFPEYSt~Gi~yd-----~~~~~e~a~tipGpEt~ 85 (343) T PRK13286 11 DTVGVAVVNYKMPRLHTKAEVLENARKIADMIVGMKQGLPGMDLVIFPEYSTHGIMYD-----RQEMYDTASTIPGEETA 85 (343) T ss_pred CCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCC-----HHHHHHHHHCCCCHHHH T ss_conf 8559999952787668999999999999999999756999974797433003666678-----57888656359977899 Q ss_pred HHHHHHHHCCCEEEEEEE-EEE----CCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEE-CCEE Q ss_conf 999998876978999999-988----898789999996-997999887776235543332100377888761476-4745 Q gi|254780999|r 72 TLKSDTHDGGAGIVVGFP-RQD----QEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVF-RDIR 144 (562) Q Consensus 72 ~La~~~~~~~i~iivG~p-~~~----~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~-~g~k 144 (562) .+++.+++.+++-++..- +++ +...||+.++|+ +|+|+.+|||+|. .+=-|. +.||+...+.+- +|.| T Consensus 86 ~f~~ACr~~~vWGvFSi~ge~he~~P~k~PyNt~ilInd~GEIv~kYRKi~P---WvPiE~--W~PGd~~~V~eGPkG~K 160 (343) T PRK13286 86 IFAAACRKAKVWGVFSLTGERHEEHPRKAPYNTLILINDKGEIVQKYRKIMP---WCPIEG--WYPGDCTYVSEGPKGLK 160 (343) T ss_pred HHHHHHHHCCEEEEEEEECCCCCCCCCCCCCCEEEEECCCCHHHHHHHHCCC---CCCCCC--CCCCCCCEEEECCCCCE T ss_conf 9999998679589999622332338888985238998798612121010567---665568--65786207831788757 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEEC Q ss_conf 62011111110168999997147775993155533358078887889998543370464213568876621048643555 Q gi|254780999|r 145 LGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFD 224 (562) Q Consensus 145 iGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d 224 (562) +++.||.|.-+|+ .+|..+.+||+|++-+++ |..-..+++.-..++.|-.|.++++.+|..|.++.+.|-|+|+|++ T Consensus 161 isliIC~DG~yPE-i~RecA~KGAeliiR~~G--YmyP~~~qw~~tnk~~A~~N~~Yva~vn~aG~Dgvy~yfG~s~i~~ 237 (343) T PRK13286 161 ISLIICDDGNYPE-IWRDCAMKGAELIVRCQG--YMYPAKEQQVLIAKAMAWANNCYVAVANAAGFDGVYSYFGHSAIIG 237 (343) T ss_pred EEEEEECCCCCHH-HHHHHHHCCCEEEEEECC--CCCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCEEEEC T ss_conf 8899966998868-999987468879997335--6661777899988888875131577774035787511358706966 Q ss_pred CCCEEEEECCCCCCCEEEEEEEECC Q ss_conf 8985999616777637999998335 Q gi|254780999|r 225 GQQQLAFQMKHFSEQNFMTEWHYDQ 249 (562) Q Consensus 225 ~~G~ii~~~~~f~e~~~i~d~d~d~ 249 (562) -+|+++.++..-+.++..+++..+. T Consensus 238 fDG~tL~e~g~~~~~i~~Ael~~~~ 262 (343) T PRK13286 238 FDGRTLGECGEEEMGIQYAQLSKSL 262 (343) T ss_pred CCCCEEEECCCCCCCEEEEEECHHH T ss_conf 8987742047887723899965999 No 45 >KOG0805 consensus Probab=99.93 E-value=5.7e-24 Score=176.01 Aligned_cols=238 Identities=18% Similarity=0.215 Sum_probs=188.9 Q ss_pred CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHH----------CCHHHHHHHHH--- Q ss_conf 88628999963883688999999999999999985998999166444799844541----------48899999999--- Q gi|254780999|r 2 LKKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLV----------FKKSFIQACSS--- 68 (562) Q Consensus 2 ~~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~----------~~~~f~~~~~~--- 68 (562) |...|++++|....++|....++|++.++.+|+++||.||+|||-++-|||-+.-| -+..|...... T Consensus 15 ~s~~~v~ivQ~~t~~~dtpaTL~K~~~~~~Eaa~~Ga~LV~fPEAfiGGYPrg~~Fg~~~G~r~~eGR~ef~kY~a~AIe 94 (337) T KOG0805 15 SSIVRVTIVQASTVYNDTPATLDKAEKYIVEAASKGAELVLFPEAFIGGYPRGFRFGLAVGVRNEEGRDEFRKYHASAIE 94 (337) T ss_pred CCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEHHHHCCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHC T ss_conf 55247999972667889877899999999998607854998116860688777503588850362317999999887750 Q ss_pred ----HHHHHHHHHHHCCCEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCC--CCCCCCEEEEC Q ss_conf ----999999998876978999999988898789999996-99799988777623554333210037--78887614764 Q gi|254780999|r 69 ----AIDTLKSDTHDGGAGIVVGFPRQDQEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFI--SGYSNDPIVFR 141 (562) Q Consensus 69 ----~l~~La~~~~~~~i~iivG~p~~~~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~--~G~~~~~~~~~ 141 (562) .+++|+..+++.++.+++|.+++++..+|.++++++ .|..++++||. +|+-. |.-.+- -|...++|++. T Consensus 95 v~gpEv~~l~~la~~~~v~lv~G~iEreg~TLYCt~~f~~p~g~~lGKHRKl-mPTal---ERciWGqGDGSTiPV~dT~ 170 (337) T KOG0805 95 VPGPEVERLAELAKKNNVYLVMGAIEREGYTLYCTVLFFSPQGQFLGKHRKL-MPTAL---ERCIWGQGDGSTIPVYDTP 170 (337) T ss_pred CCCHHHHHHHHHHHCCCEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCC-CCCHH---HHEEECCCCCCCCCEEECC T ss_conf 7984799999986147758999898405617999999987877611000000-64144---5505505888422023045 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCC------------ Q ss_conf 74562011111110168999997147775993155533358078887889998543370464213568------------ Q gi|254780999|r 142 DIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVG------------ 209 (562) Q Consensus 142 g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vG------------ 209 (562) -.|||-.||++-+.|. ....|-.+|++|.+.|.+ .....|..-++..|.|-+|+++.++|-- T Consensus 171 iGKIG~AICWEN~MPl-~R~alY~KgieIycAPT~-----D~r~~w~~sM~~IAlEG~cFvlSA~QF~k~~d~p~~peyl 244 (337) T KOG0805 171 IGKIGAAICWENRMPL-YRTALYAKGIEIYCAPTA-----DGRKEWQSSMLHIALEGGCFVLSACQFCKRKDFPDHPDYL 244 (337) T ss_pred CCCCCEEEECCCCCHH-HHHHHHHCCCEEEECCCC-----CCCHHHHHHHHHHEECCCEEEEEHHHHCCCCCCCCCCHHH T ss_conf 5400101211133679-999987558579962677-----8618899864660102746999821204312589970442 Q ss_pred --------CCCCCCCCCCCEEECCCCEEEEECCCCCCCEEEEEEEECC Q ss_conf --------8766210486435558985999616777637999998335 Q gi|254780999|r 210 --------GQDELIFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQ 249 (562) Q Consensus 210 --------g~d~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~ 249 (562) +.|..+-.|+|.|++|-|++++.-..-+|.++++|+|++- T Consensus 245 ~~~~~~~k~pD~vv~~GGSviI~PlG~VlagP~~~~EgL~tadldl~d 292 (337) T KOG0805 245 FTDWYDDKEPDSVVSQGGSVIISPLGQVLAGPNFESEGLITADLDLGD 292 (337) T ss_pred CCCCHHHCCCCCCEECCCCEEECCCCCEECCCCCCCCCEEEEECCCHH T ss_conf 262012028886234697388755423322688676603887306026 No 46 >TIGR00546 lnt apolipoprotein N-acyltransferase; InterPro: IPR004563 Apolipoprotein N-acyltransferase 2.3.1 from EC transfers the acyl group to lipoproteins and is involved in lipoprotein biosynthesis. It is an integral membrane protein.; GO: 0016410 N-acyltransferase activity, 0042158 lipoprotein biosynthetic process, 0016021 integral to membrane. Probab=99.90 E-value=1.3e-22 Score=167.21 Aligned_cols=214 Identities=21% Similarity=0.318 Sum_probs=162.0 Q ss_pred CCCEEEEEEECCCCCC------CHHHHHHHHHHHHHHHHHC-------CCCEEEECCCCCCCCCHHHHHCCHHHHHH--- Q ss_conf 8862899996388368------8999999999999999985-------99899916644479984454148899999--- Q gi|254780999|r 2 LKKLKIAIAQLNPVVG------DIAGNIAKARRAREEANRQ-------GMDLILFTELFISGYPPEDLVFKKSFIQA--- 65 (562) Q Consensus 2 ~~~mkIal~Q~n~~~g------d~~~N~~~i~~~i~~A~~~-------gadLvvfPEl~LtGY~~~Dl~~~~~f~~~--- 65 (562) .+++||+++|.|.-.- .++.+++.+.++++++.++ +.||||+||.+++=+. . ..+.-.++ T Consensus 177 ~~~~~ValvQ~~~p~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dlvvwPEta~~~~l-~---~~~~~~~~~Q~ 252 (441) T TIGR00546 177 EPTLKVALVQPNIPQDLKFDKEGLEAILELLTSLTKQAVEKVHAGLYEKPDLVVWPETALPFDL-E---NSPDKLHESQS 252 (441) T ss_pred CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCHHHCCCC-C---CCCHHCCCCCC T ss_conf 7326899986888800134110179999999999885225667677787327884615332366-6---43211033673 Q ss_pred HHHHHHHHHHHHHHCCCEEEEEEEEEECC-------EEEEEEEEEEC-C----EEEEEEEEEEE-------C--CCCCC- Q ss_conf 99999999999887697899999998889-------87899999969-9----79998877762-------3--55433- Q gi|254780999|r 66 CSSAIDTLKSDTHDGGAGIVVGFPRQDQE-------GVLNSVVILDA-G----NIIAVRDKINL-------P--NYSEF- 123 (562) Q Consensus 66 ~~~~l~~La~~~~~~~i~iivG~p~~~~~-------~~yNsa~vi~~-G----~il~~y~K~~L-------P--~y~~F- 123 (562) ..+.++.+.....+.++.+++|++...++ ..|||++++++ | +++.+|+|+|| | =...| T Consensus 253 ~f~~~~~~~~~~~~~~~p~~~G~~~~~~~~~~~~~~~~yNs~~~~~~~~~a~~~~~~~Y~K~~LVPFGEy~P~~~~~~f~ 332 (441) T TIGR00546 253 AFDLLKALKALVLSKGIPILIGAPDADPKGQKGEEYEYYNSVYLLDPGGKAVLEVVQRYDKVKLVPFGEYIPFGLKFVFK 332 (441) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHCCCEEECCCCCCCCCCCHHHHH T ss_conf 78999999999874098489963010378888874311125899848862003505353874543562326776333444 Q ss_pred ---------CCCCCCCCCCCCCEEEECC--EEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCC--CHHHHHHH Q ss_conf ---------3210037788876147647--4562011111110168999997147775993155533358--07888788 Q gi|254780999|r 124 ---------HEKRTFISGYSNDPIVFRD--IRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHN--KLKKRHEI 190 (562) Q Consensus 124 ---------dE~r~F~~G~~~~~~~~~g--~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~--k~~~R~~~ 190 (562) .+.+.|++|....++...+ +++++.||||.-+++ ..|..+++||++|+|+|.--|... ...+...+ T Consensus 333 ~~~~~~~~~~~~~~~~~G~~~~v~~~~~~~~~~~~~ICYE~~Fp~-~~r~~~~~Ga~~L~~~tNDaWf~~~~~~~QH~~~ 411 (441) T TIGR00546 333 WLSKLFFLASEQESFARGNGPQVLKLPGGAVKLAPLICYESIFPE-LVRALARQGAELLVNLTNDAWFGDSSGPWQHFAL 411 (441) T ss_pred HHHHHHHHCCCCCCHHCCCCCCCCCCCCCCEEEEEEEEHHHHHHH-HHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHH T ss_conf 776543200222331127887653447760277753115541168-8987425883289974005643666167889999 Q ss_pred HHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEE Q ss_conf 999854337046421356887662104864355589859 Q gi|254780999|r 191 VTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQL 229 (562) Q Consensus 191 ~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~i 229 (562) .+-||.|++.+++-++.. |-|.++||.|++ T Consensus 412 ~~~RA~E~~~~~~Ra~Nt---------G~sa~i~p~G~~ 441 (441) T TIGR00546 412 ARFRAIENGRPLVRAANT---------GISAVIDPRGRI 441 (441) T ss_pred HHHHHHHCCCEEEEEECC---------CCEEEEECCCCC T ss_conf 888876448548999606---------625898067889 No 47 >COG0815 Lnt Apolipoprotein N-acyltransferase [Cell envelope biogenesis, outer membrane] Probab=99.90 E-value=1e-21 Score=161.31 Aligned_cols=224 Identities=19% Similarity=0.276 Sum_probs=157.7 Q ss_pred CEEEEEEECCCC---CCCHHHHHHHHHH---HHHHHH--HCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 628999963883---6889999999999---999999--85998999166444799844541488999999999999999 Q gi|254780999|r 4 KLKIAIAQLNPV---VGDIAGNIAKARR---AREEAN--RQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKS 75 (562) Q Consensus 4 ~mkIal~Q~n~~---~gd~~~N~~~i~~---~i~~A~--~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~ 75 (562) .++|+++|.|.. ..|.++-...+.. ....+. ..++|+||+||.+++ +... +..+...++.+ T Consensus 227 ~~~V~lvQ~nI~q~lk~~~~~~~~~~~~~~~~~~~~~~~~~~~dlVIwPEtA~p-~~~~----------~~~~~~~~~~~ 295 (518) T COG0815 227 TLTVALVQGNIPQDLKWDADALARLIAGYLEEEFLAAVDKQKPDLVVWPETALP-FDLT----------RHPDALARLAE 295 (518) T ss_pred CEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCC-CCHH----------HCCHHHHHHHH T ss_conf 547999647885002678888988887654333100123789888983675555-5411----------13127889999 Q ss_pred HHHHCCCEEEEEEEEE--ECCE--EEEEEEEEEC-CEEEEEEEEEEECCCCCCC-------------C--CCCCCCCCCC Q ss_conf 9887697899999998--8898--7899999969-9799988777623554333-------------2--1003778887 Q gi|254780999|r 76 DTHDGGAGIVVGFPRQ--DQEG--VLNSVVILDA-GNIIAVRDKINLPNYSEFH-------------E--KRTFISGYSN 135 (562) Q Consensus 76 ~~~~~~i~iivG~p~~--~~~~--~yNsa~vi~~-G~il~~y~K~~LP~y~~Fd-------------E--~r~F~~G~~~ 135 (562) .....++.+++|.... .+++ .|||++++++ |++..+|+|+||-..|||- . ...|.+|... T Consensus 296 ~~~~~~~~~iiG~~~~~~~~~~~~yyNSv~~~~~~~~~~~~ydK~~LVPFGEYiP~~~~l~~~~~~~~~~~~~f~~G~~~ 375 (518) T COG0815 296 ALQRVGAPLLIGTDVDGPAPGGGIYYNSVLVLDPGGEGVYRYDKVHLVPFGEYIPFPELLRPLYFFLNLPMSDFSRGPGP 375 (518) T ss_pred HHHCCCCCEEEEECCCCCCCCCCCEEEEEEEECCCCCCCCEECCEEEECCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 98604996899603445688875312479996689875550043443077057560777667777634552345579988 Q ss_pred CEEEECC-EEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCC--HHHHHHHHHHHHHHCCCEEEECCCCCCCC Q ss_conf 6147647-45620111111101689999971477759931555333580--78887889998543370464213568876 Q gi|254780999|r 136 DPIVFRD-IRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNK--LKKRHEIVTGQISHVHLPIIYVNQVGGQD 212 (562) Q Consensus 136 ~~~~~~g-~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k--~~~R~~~~~~~a~e~~~~vvy~N~vGg~d 212 (562) .++.+.+ .|+++.||||.-+++ ..|....+|+|+|+|+|.--|.-.. ..+...+.+.||.|++.|++-+... T Consensus 376 ~v~~~~~~~~~~~~ICYE~~F~~-~~r~~~~qga~~Lin~SNDAWf~~s~~p~QH~~~a~~RAiE~grp~iRAtNt---- 450 (518) T COG0815 376 QVLLLAGGPKIAPLICYEAIFPE-LVRASARQGAELLLNLSNDAWFGGSWGPYQHFQQARVRAVELGRPLVRATNT---- 450 (518) T ss_pred CCCCCCCCCEEEEEEEEEHHCHH-HHHHHHCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCC---- T ss_conf 76113888445025664223339-8887541776289975565345999338899999999988608728997178---- Q ss_pred CCCCCCCCEEECCCCEEEEECCCCCCCEEEEEEEEC Q ss_conf 621048643555898599961677763799999833 Q gi|254780999|r 213 ELIFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYD 248 (562) Q Consensus 213 ~lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d 248 (562) |-|.|+||.|+++++.+.|+.+++..++... T Consensus 451 -----GiSavIdp~Gri~~~l~~~~~~~l~~~v~~~ 481 (518) T COG0815 451 -----GISAVIDPRGRILAQLPYFTRGVLDATVPLK 481 (518) T ss_pred -----CEEEEECCCCCEEEECCCCCCCEEEEECCCC T ss_conf -----4058999999766425787741023311345 No 48 >cd07567 biotinidase_like biotinidase and vanins (class 4 nitrilases). These secondary amidases participate in vitamin recycling. Biotinidase (EC 3.5.1.12) has both a hydrolase and a transferase activity. It hydrolyzes free biocytin or small biotinyl-peptides produced during the proteolytic degradation of biotin-dependent carboxylases, to release free biotin (vitamin H), and it can transfer biotin to acceptor molecules such as histones. Biotinidase deficiency in humans is an autosomal recessive disorder characterized by neurological and cutaneous symptoms. This subgroup includes the three human vanins, vanin1-3. Vanins are ectoenzymes, Vanin-1, and -2 are membrane associated, vanin-3 is secreted. They are pantotheinases (EC 3.5.1.92, pantetheine hydrolase), which convert pantetheine, to pantothenic acid (vitamin B5) and cysteamine (2-aminoethanethiol, a potent anti-oxidant). They are potential targets for therapeutic intervention in inflammatory disorders. Vanin-1 deficient mice lacking Probab=99.88 E-value=1.1e-20 Score=154.47 Aligned_cols=231 Identities=17% Similarity=0.149 Sum_probs=160.2 Q ss_pred EEEEECCCCC-CC------HHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHH----HCC-------------HHH Q ss_conf 9999638836-88------99999999999999998599899916644479984454----148-------------899 Q gi|254780999|r 7 IAIAQLNPVV-GD------IAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDL----VFK-------------KSF 62 (562) Q Consensus 7 Ial~Q~n~~~-gd------~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl----~~~-------------~~f 62 (562) .|++-..+.. ++ .+.|++.-+++|++|+++|+|||||||..|+|+..... +++ +.. T Consensus 3 AaVvE~~p~~~~~~~a~~~~~~Nl~~y~~~i~~a~~~~adIIVFPE~gl~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 82 (299) T cd07567 3 AAVVEHHPILSPDPDALQIMEKNLDIYEEIIKSAAKQGADIIVFPEDGLTGFIFTRFVIYPFLEDVPDPEVNWNPCLDPD 82 (299) T ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCC T ss_conf 99998725678997899999999999999999776559989988674446776577423521023799655666656844 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEEEEEEC------------CEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 9999999999999988769789999999888------------98789999996-9979998877762355433321003 Q gi|254780999|r 63 IQACSSAIDTLKSDTHDGGAGIVVGFPRQDQ------------EGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTF 129 (562) Q Consensus 63 ~~~~~~~l~~La~~~~~~~i~iivG~p~~~~------------~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F 129 (562) .......+.+|.-.+++..+++++++.++.. -..||+.++++ +|+++++|||.||- .|..+- T Consensus 83 ~~~~~~~l~~LSC~Ar~~~~YvV~Nl~ek~~C~~~~~~Cp~dg~~~yNTnVVFdr~G~vVaRYrK~nLf-----~E~~~~ 157 (299) T cd07567 83 RFDYTEVLQRLSCAARENSIYVVANLGEKQPCDSSDPHCPPDGRYQYNTNVVFDRDGTLIARYRKYNLF-----GEPGFD 157 (299) T ss_pred CCCHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCCEEEECEEEECCCCCEEEEEECCCCC-----CCCCCC T ss_conf 455189999877776417869999476454477888899755527860307875888378653022214-----664557 Q ss_pred CCCC-CCCEEEECCE-EEEECCCCCCCHHHHHHHHHHHC-CCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 7788-8761476474-56201111111016899999714-7775993155533358078887889998543370464213 Q gi|254780999|r 130 ISGY-SNDPIVFRDI-RLGILICEDIWKNSNICKHLKKQ-GAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVN 206 (562) Q Consensus 130 ~~G~-~~~~~~~~g~-kiGv~ICeDlw~~~~~~~~la~~-Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N 206 (562) .+.. +..+|++..+ |||+.||.|+-|.+ |+..|..+ |++=++.|++|--..- .-.-.+.-++.|..+++.++.+| T Consensus 158 ~~~~pe~~tF~TdFGV~FG~FiCFDiLF~~-PA~~Lv~~~~V~~iv~pt~W~s~lP-fLtAvq~q~aWA~~~~VNlLAA~ 235 (299) T cd07567 158 VPPEPEIVTFDTDFGVTFGIFTCFDILFKE-PALELVKKLGVDDIVFPTAWFSELP-FLTAVQIQQAWAYANGVNLLAAN 235 (299) T ss_pred CCCCCCCEEEECCCCCEEEEEEEEEECCCC-HHHHHHHHCCCCEEEECHHHHCCCC-CHHHHHHHHHHHHHCCCCEEECC T ss_conf 889875137866876345269974114556-1799998639885994667744571-06788899999987597089236 Q ss_pred CCCCCCCCCCCCCCEEECCC-CEEEEECCCC-CCCEEEEEEEE Q ss_conf 56887662104864355589-8599961677-76379999983 Q gi|254780999|r 207 QVGGQDELIFDGASFCFDGQ-QQLAFQMKHF-SEQNFMTEWHY 247 (562) Q Consensus 207 ~vGg~d~lvf~G~S~I~d~~-G~ii~~~~~f-~e~~~i~d~d~ 247 (562) .-- ...=.+||-|+.+. |.+......- +..++++++.. T Consensus 236 ~~~---p~~~~tGSGIY~g~~g~~~~~~~~~~~~kLlva~vpk 275 (299) T cd07567 236 YNN---PSAGMTGSGIYAGRSGALVYHYDNEPGGKLLVAEVPK 275 (299) T ss_pred CCC---CCCCCCCCCEECCCCCCEEEEECCCCCCEEEEEECCC T ss_conf 889---8778864656579985379983699974599997788 No 49 >PRK12291 apolipoprotein N-acyltransferase; Reviewed Probab=99.87 E-value=3.7e-20 Score=151.05 Aligned_cols=197 Identities=15% Similarity=0.156 Sum_probs=145.6 Q ss_pred CCEEEEEEECCCCCC------CHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 862899996388368------89999999999999999859989991664447998445414889999999999999999 Q gi|254780999|r 3 KKLKIAIAQLNPVVG------DIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSD 76 (562) Q Consensus 3 ~~mkIal~Q~n~~~g------d~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~ 76 (562) +.++|+++|.|.... ..+.+.++..+.++++.+.++||||+||.+++.|... ....++.+.+. T Consensus 195 ~~~~V~lVQ~NI~q~~Kw~~~~~~~~~~~~~~l~~~~~~~~~dLiIwPEta~P~~~~~-----------~~~~~~~l~~~ 263 (420) T PRK12291 195 LPLNIELVNTNISQDLKWDKEYLKSIINENLKEIDKAIDEKKTLIVLPESAFPLFLNN-----------SPFLLDELKEL 263 (420) T ss_pred CCEEEEEECCCCCHHHHHCHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHH-----------CHHHHHHHHHH T ss_conf 8717999818989677519545999999999987532147998899657444666740-----------89999999852 Q ss_pred HHHCCCEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCC----------------CCCCCCCCCCCCEEEE Q ss_conf 887697899999998889878999999699799988777623554333----------------2100377888761476 Q gi|254780999|r 77 THDGGAGIVVGFPRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFH----------------EKRTFISGYSNDPIVF 140 (562) Q Consensus 77 ~~~~~i~iivG~p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~Fd----------------E~r~F~~G~~~~~~~~ 140 (562) + .++.+++|.+..++++.|||++++++|+ ..+|+|+||-..|||- ...-|++|+....+.+ T Consensus 264 ~--~~~~ii~G~~~~e~~~~yNS~~~~~~~~-~~~YdK~~LVPFGEyiPf~~~l~~~~~~~~~~~~~~f~~G~~~~~~~~ 340 (420) T PRK12291 264 S--YKIAIIAGALRYENEHIYNSTYLFSKGN-VQIADKVILVPFGEEIPLPEFLKDPINKLFFGGASDFSKASNFSDFTL 340 (420) T ss_pred C--CCCEEEECCEECCCCCEEEEEEEECCCC-CCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEE T ss_conf 6--7982997624424784566889966887-124744301265357657999999999971677666889999985547 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCC--CCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCC Q ss_conf 474562011111110168999997147775993155--533358078887889998543370464213568876621048 Q gi|254780999|r 141 RDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNA--SPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDG 218 (562) Q Consensus 141 ~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psA--Sp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G 218 (562) ++.+++.+||||.-+++ ...+++|+++|+|- |--...-..+..++.+.||.|++.|++-+.. .| T Consensus 341 ~~~~~~~~ICYE~~~~~-----~~~~~~~~lvniSNDaWFg~S~gP~Qh~~~a~~RAiE~g~~i~rsaN---------~G 406 (420) T PRK12291 341 DGVKFRNAICYEATSEE-----LYEGPSKIIIAISNNAWFVPSIEPTLQKLLLKYYARKYGKTIYHAAN---------GS 406 (420) T ss_pred CCEEEEEEEEEEECCHH-----HHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECC---------CE T ss_conf 89179989847766734-----43378988999510432589626999999999999970998899838---------80 Q ss_pred CCEEECCCC Q ss_conf 643555898 Q gi|254780999|r 219 ASFCFDGQQ 227 (562) Q Consensus 219 ~S~I~d~~G 227 (562) -|.|+||+. T Consensus 407 iSaiI~Pk~ 415 (420) T PRK12291 407 PSEIITPKK 415 (420) T ss_pred EEEEECHHH T ss_conf 349998244 No 50 >KOG0808 consensus Probab=99.85 E-value=1.5e-20 Score=153.69 Aligned_cols=244 Identities=15% Similarity=0.189 Sum_probs=181.7 Q ss_pred CEEEEEEECCCC-------CCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCC-CH-HHHHCCHHHHHHHH--HHHHH Q ss_conf 628999963883-------688999999999999999985998999166444799-84-45414889999999--99999 Q gi|254780999|r 4 KLKIAIAQLNPV-------VGDIAGNIAKARRAREEANRQGMDLILFTELFISGY-PP-EDLVFKKSFIQACS--SAIDT 72 (562) Q Consensus 4 ~mkIal~Q~n~~-------~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY-~~-~Dl~~~~~f~~~~~--~~l~~ 72 (562) -+|+++.|-... .....+-.+++...++.|+..|+.+++|.|-+-..+ +| ...+--..|.+.+. ..... T Consensus 73 ~vrvgliqn~i~lpttapv~eq~~aih~r~kaiieaaa~agvniiclqeawtmpfafctrerlpwtefaesv~~gptt~f 152 (387) T KOG0808 73 VVRVGLIQNSIALPTTAPVSEQTRAIHDRLKAIIEAAAVAGVNIICLQEAWTMPFAFCTRERLPWTEFAESVDTGPTTKF 152 (387) T ss_pred EEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHCCCHHHHCCCCCCHHHHCCCCCCCCHHHH T ss_conf 79986520445688887377888999999999999987548657974012247413330344760543042444835799 Q ss_pred HHHHHHHCCCEEEEEEEEEE---CCEEEEEEEEEE-CCEEEEEEEEEEECCCCCCCCCCCCCCCCC-CCEEEECCEEEEE Q ss_conf 99998876978999999988---898789999996-997999887776235543332100377888-7614764745620 Q gi|254780999|r 73 LKSDTHDGGAGIVVGFPRQD---QEGVLNSVVILD-AGNIIAVRDKINLPNYSEFHEKRTFISGYS-NDPIVFRDIRLGI 147 (562) Q Consensus 73 La~~~~~~~i~iivG~p~~~---~~~~yNsa~vi~-~G~il~~y~K~~LP~y~~FdE~r~F~~G~~-~~~~~~~g~kiGv 147 (562) +.+.++++++.|+-...+++ ++.+.|++++|. +|.++++.||.|+|.-|.|.|..|+..|+- .++|++..+||++ T Consensus 153 lqklakkhdmvivspilerd~ehgdvlwntavvisn~g~vigk~rknhiprvgdfnestyymeg~lghpvfet~fgriav 232 (387) T KOG0808 153 LQKLAKKHDMVIVSPILERDIEHGDVLWNTAVVISNNGNVIGKHRKNHIPRVGDFNESTYYMEGDLGHPVFETVFGRIAV 232 (387) T ss_pred HHHHHHHCCEEEEEHHHHCCCCCCCEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEECCEEEE T ss_conf 99988657869971013145434724554469982687533321235698535667522675057798300221033788 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHH--HHHHHHHHHHCCCEEEECCCCCCC----------C--- Q ss_conf 1111111016899999714777599315553335807888--788999854337046421356887----------6--- Q gi|254780999|r 148 LICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKR--HEIVTGQISHVHLPIIYVNQVGGQ----------D--- 212 (562) Q Consensus 148 ~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R--~~~~~~~a~e~~~~vvy~N~vGg~----------d--- 212 (562) .|||---+|. -+-.+..+||+||+||||. .|....- ---.+..|+.|+++....|+||.+ + T Consensus 233 nicygrhhpl-nwlmy~lngaeiifnpsat---vgalseplwpiearnaaianh~ft~~inrvgtevfpneftsgdgkpa 308 (387) T KOG0808 233 NICYGRHHPL-NWLMYGLNGAEIIFNPSAT---VGALSEPLWPIEARNAAIANHYFTGSINRVGTEVFPNEFTSGDGKPA 308 (387) T ss_pred EEECCCCCCH-HHHHHHCCCCEEEECCCCC---CCCCCCCCCCHHHHHHHHHHCEEEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 8622677735-6565414673599777532---00004766760231256542247876313434107886668999734 Q ss_pred --C-CCCCCCCEEECCCCEEEEECCCCCCCEEEEEEEECCCC Q ss_conf --6-21048643555898599961677763799999833521 Q gi|254780999|r 213 --E-LIFDGASFCFDGQQQLAFQMKHFSEQNFMTEWHYDQQL 251 (562) Q Consensus 213 --~-lvf~G~S~I~d~~G~ii~~~~~f~e~~~i~d~d~d~~~ 251 (562) + --|.|+|.+..|++.-.-.+++..++++++++|+.... T Consensus 309 h~dfghfygssy~aapd~srtp~lsr~rdgllia~ldlnlcr 350 (387) T KOG0808 309 HNDFGHFYGSSYFAAPDASRTPSLSRYRDGLLIADLDLNLCR 350 (387) T ss_pred CCCCCCCCCCEEEECCCCCCCCCCCCCCCCEEEEECCHHHHH T ss_conf 343540114301326887778650000365477614468899 No 51 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=99.54 E-value=3.4e-14 Score=112.01 Aligned_cols=161 Identities=27% Similarity=0.348 Sum_probs=111.8 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCC Q ss_conf 57521587236899999984149544679732466665221356999999721354267758999998863132346874 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEP 374 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~ 374 (562) |+|++|||+||+++|++|.+++| +++++|+.-+...++...+.|+.+|+.+|+++.+++.+.+.... +...+ T Consensus 1 V~vA~SGGVDSs~la~la~~alG-~~v~aV~~~s~~~~~~E~e~a~~~a~~~gi~~~~i~~~~l~~~~-------~~~N~ 72 (202) T cd01990 1 VAVAFSGGVDSTLLLKAAVDALG-DRVLAVTATSPLFPRRELEEAKRLAKEIGIRHEVIETDELDDPE-------FAKNP 72 (202) T ss_pred CEEECCCCHHHHHHHHHHHHHHC-CCEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCHHHHHH-------HCCCC T ss_conf 98982385999999999999768-78799997279899789999999998639857997341110165-------54597 Q ss_pred CCHHHHCCCCHHHHHHHHHHHHHCCC--EECCCCCHHHH---HHHHCCCCCCCCCCCCCCCC--CCHHHHHHHHHHHHHC Q ss_conf 31011012330466779888751695--44364324477---54110236665554010177--6336589999996310 Q gi|254780999|r 375 SGIVAENIQSRIRGNILMALSNHSKA--MLLTTSNKSEI---SVGYGTLYGDMSGGFNPLKD--LYKTQVFQLASWRNSH 447 (562) Q Consensus 375 ~~~~~eN~qaR~R~~~L~~~an~~~~--lvl~t~nksE~---~~Gy~T~~GD~~g~~~p~~~--l~Kt~v~~la~~~n~~ 447 (562) .+..+- =-+.....+...|.++|. ++-|| |.++. --|.-.. ...|-.+|+.+ ++|.+|++|+++++ T Consensus 73 ~~RCy~--CK~~l~~~l~~~a~~~g~~~v~dGt-n~dDl~d~RPGl~A~--~e~~v~sPL~e~gl~K~eVR~la~~lg-- 145 (202) T cd01990 73 PDRCYL--CKKALYEALKEIAEELGLDVVLDGT-NADDLGDYRPGLKAL--RELGVRSPLAEAGLGKAEIRELARELG-- 145 (202) T ss_pred HHHHHH--HHHHHHHHHHHHHHHCCCCEEEECC-CCCCCCCCCCCHHHH--HHCCCCCCHHHCCCCHHHHHHHHHHCC-- T ss_conf 556229--9899999999999967997894157-576323557868899--876998852651978899999999839-- Q ss_pred CCCCCCCCCCCCCCHHHCCCCCCCCCC---C-C-CCCCCCCC Q ss_conf 332223444466887882889884538---9-9-97700068 Q gi|254780999|r 448 GITSGLGPLTEVIPPSILEKSPSAELR---P-H-QTDQESLP 484 (562) Q Consensus 448 ~~~~~~~~~~~~ip~~i~~~~psaeL~---~-~-Q~ded~l~ 484 (562) +| +.++||+|-|. | + ..+++.|- T Consensus 146 ------------Lp--~~~kp~~~CLasRip~g~~it~e~L~ 173 (202) T cd01990 146 ------------LP--TWDKPAMACLASRIPYGTEITEERLK 173 (202) T ss_pred ------------CC--CCCCCCCCCEEEECCCCCCCCHHHHH T ss_conf ------------98--56689987546665788848999999 No 52 >cd01998 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. This family of enzyme only presents in bacteria and eukaryote. The archaeal counterpart of this enzyme performs same function, but is completely unrelated in sequence. Probab=99.39 E-value=2.9e-12 Score=99.45 Aligned_cols=147 Identities=27% Similarity=0.358 Sum_probs=96.8 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCC--------CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHH--H Q ss_conf 0575215872368999999841495446797324666--------65221356999999721354267758999998--8 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKY--------TSPQSLEDAAACAKALGCKYDVLPIHDLVNHF--F 363 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~--------~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~--~ 363 (562) +|+||||||+|||++|+|..++ |-+ |.|++|---. +++....||+.+|+.|||+|.++++++.+... . T Consensus 1 kV~v~mSGGVDSsvaA~LL~~~-G~~-V~gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~ef~~~V~~ 78 (349) T cd01998 1 KVVVAMSGGVDSSVAAALLKEQ-GYE-VIGVFMKNWDEDDGKGGCCSEEDLKDARRVADQLGIPHYVVNFEKEYWEKVFE 78 (349) T ss_pred CEEEECCCCHHHHHHHHHHHHC-CCC-EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHH T ss_conf 9899907789999999999877-995-79999996788766789886778999999999869967996809988688899 Q ss_pred HHHCCCCCCCCC--CHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHH---H-HCCCCC-C--------------- Q ss_conf 631323468743--10110123304667798887516954436432447754---1-102366-6--------------- Q gi|254780999|r 364 SLMSQFLQEEPS--GIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISV---G-YGTLYG-D--------------- 421 (562) Q Consensus 364 ~~~~~~~~~~~~--~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~---G-y~T~~G-D--------------- 421 (562) ..+.....+..+ ++.= | .++.--.|+..|.+.|.=-+.||.=.-... | |.-+-| | T Consensus 79 ~f~~~Y~~G~TPNPcv~C-N--~~IKF~~l~~~A~~~g~d~iATGHYAri~~~~~~~~~L~r~~D~~KDQSYfL~~l~~~ 155 (349) T cd01998 79 PFLEEYKKGRTPNPDILC-N--KEIKFGALLDYAKKLGADYIATGHYARIEEDNNGRYRLLRGVDPNKDQSYFLSQLSQE 155 (349) T ss_pred HHHHHHHCCCCCCCCHHC-C--CEEEHHHHHHHHHHCCCCEECCCCEEEEEECCCCCEEEEECCCCCCCCCEEEECCCHH T ss_conf 999999748999876211-8--7351999999998759986413514788975898489987689987820564012487 Q ss_pred -CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf -555401017763365899999963 Q gi|254780999|r 422 -MSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 422 -~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) ..-.+-|+|+++|++|+++|+.++ T Consensus 156 ~L~~~~FPLG~~~K~eVR~iA~~~g 180 (349) T cd01998 156 QLSRLIFPLGDLTKPEVREIAKELG 180 (349) T ss_pred HHHHEECCCCCCCHHHHHHHHHHCC T ss_conf 6610387887865389999999859 No 53 >PRK00143 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; Reviewed Probab=99.39 E-value=3.2e-12 Score=99.15 Aligned_cols=150 Identities=21% Similarity=0.278 Sum_probs=99.0 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCC--------CCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 454057521587236899999984149544679732466--------665221356999999721354267758999998 Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYK--------YTSPQSLEDAAACAKALGCKYDVLPIHDLVNHF 362 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~--------~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~ 362 (562) .-++|+||+|||+|||++|+|..++ |- +|.|++|-.- -.+.....||+.+|+.|||+|.+++.++.+... T Consensus 3 sk~rV~VamSGGVDSsVaA~LL~~~-Gy-~V~Gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~f~~~ 80 (355) T PRK00143 3 SKKRVVVGMSGGVDSSVAAALLKEQ-GY-DVIGLFMKLWDDDDGSGGCCAEEDIADARAVADKLGIPLYVVDFAKEFWDN 80 (355) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHC-CC-CEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 8888999916789999999999977-99-589999988768877789985788999999999869857996869987678 Q ss_pred --HHHHCCCCCCCCC--CHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHH--HHCCC-CC-C----C-------- Q ss_conf --8631323468743--10110123304667798887516954436432447754--11023-66-6----5-------- Q gi|254780999|r 363 --FSLMSQFLQEEPS--GIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISV--GYGTL-YG-D----M-------- 422 (562) Q Consensus 363 --~~~~~~~~~~~~~--~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~--Gy~T~-~G-D----~-------- 422 (562) ...+.....+..+ ++. =| .++.--.|+..|.+.|.=.+.||.=.-..- |-..+ -| | . T Consensus 81 Vi~~f~~~Y~~G~TPNPcv~-CN--~~IKFg~l~~~A~~lgad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL~~l~ 157 (355) T PRK00143 81 VFDYFLDEYKAGRTPNPCVL-CN--KEIKFKAFLDYALELGADYIATGHYARIRDEDGRYELLRAVDPNKDQSYFLYQLT 157 (355) T ss_pred HHHHHHHHHHCCCCCCCCHH-CC--CCCCHHHHHHHHHHCCCCEECCCCEEEEEECCCEEEEEECCCCCCCCEEEEECCC T ss_conf 89999999976999997334-08--7233999999998739984233525999915981899867889887205761057 Q ss_pred ----CCCCCCCCCCCHHHHHHHHHHHH Q ss_conf ----55401017763365899999963 Q gi|254780999|r 423 ----SGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 423 ----~g~~~p~~~l~Kt~v~~la~~~n 445 (562) .-.+-|+|+++|++|+++|+-++ T Consensus 158 ~~~L~~~~FPLG~~~K~eVR~iA~~~g 184 (355) T PRK00143 158 QEQLAKLLFPLGELTKPEVREIAAEIG 184 (355) T ss_pred HHHHHHEECCCCCCCHHHHHHHHHHCC T ss_conf 988742075876773589999999758 No 54 >COG1606 ATP-utilizing enzymes of the PP-loop superfamily [General function prediction only] Probab=99.38 E-value=4.8e-12 Score=97.95 Aligned_cols=150 Identities=23% Similarity=0.371 Sum_probs=111.3 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 58887642245405752158723689999998414954467973246666522135699999972135426775899999 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNH 361 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~ 361 (562) .|+.|++..+ +++|+-|||+||+++|.+|++|+| +||.+++.-+.+....+.++|+..|+.+|++|..+++..+- T Consensus 9 ~l~~~ik~~~--kv~vAfSGGvDSslLa~la~~~lG-~~v~AvTv~sP~~p~~e~e~A~~~A~~iGi~H~~i~~~~~~-- 83 (269) T COG1606 9 RLKKAIKEKK--KVVVAFSGGVDSSLLAKLAKEALG-DNVVAVTVDSPYIPRREIEEAKNIAKEIGIRHEFIKMNRMD-- 83 (269) T ss_pred HHHHHHHHCC--EEEEEECCCCCHHHHHHHHHHHHC-CCEEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEHHHCC-- T ss_conf 9999986437--399996588427999999999735-64699997168777666067899999948762465500025-- Q ss_pred HHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHHH---HHHCCCCCCCCCCCCCCC--CCCHH Q ss_conf 886313234687431011012330466779888751695-443643244775---411023666555401017--76336 Q gi|254780999|r 362 FFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEIS---VGYGTLYGDMSGGFNPLK--DLYKT 435 (562) Q Consensus 362 ~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~~---~Gy~T~~GD~~g~~~p~~--~l~Kt 435 (562) +.+.+.+.+.. ++=-|..-..|-..|-+.|+ .|+.-.|.|++- =||-..+-. |-.+|+. ++.|. T Consensus 84 ------~~~~~n~~~rC--Y~CK~~v~~~l~~~a~~~Gyd~V~dGtNasDl~~~RPG~rA~kE~--gi~sPl~e~gitk~ 153 (269) T COG1606 84 ------PEFKENPENRC--YLCKRAVYSTLVEEAEKRGYDVVADGTNASDLFDYRPGLRALKEL--GIRSPLAEFGITKK 153 (269) T ss_pred ------HHHCCCCCCCC--HHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHCCCCCCHHHHHHC--CCCCHHHHHCCCHH T ss_conf ------23304998735--577899999999999973998897477578733788302357760--77771888397599 Q ss_pred HHHHHHHHHHH Q ss_conf 58999999631 Q gi|254780999|r 436 QVFQLASWRNS 446 (562) Q Consensus 436 ~v~~la~~~n~ 446 (562) +++++|+.+|- T Consensus 154 eIre~a~~lgl 164 (269) T COG1606 154 EIREIAKSLGL 164 (269) T ss_pred HHHHHHHHCCC T ss_conf 99999997599 No 55 >pfam03054 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. Probab=99.34 E-value=9.4e-12 Score=96.05 Aligned_cols=147 Identities=24% Similarity=0.312 Sum_probs=93.8 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCC---------CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHH-- Q ss_conf 0575215872368999999841495446797324666---------65221356999999721354267758999998-- Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKY---------TSPQSLEDAAACAKALGCKYDVLPIHDLVNHF-- 362 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~---------~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~-- 362 (562) +|+||+|||+||||+|+|..++ |- .|.|++|---. .++....||+..|+.|||+|.+++.++.+... T Consensus 2 kV~VamSGGVDSsVaA~LL~~~-Gy-~V~Gv~m~~w~~~~~~~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~f~~~Vi 79 (354) T pfam03054 2 KVVVAMSGGVDSSVAAYLLKEQ-GY-EVIGVFMKNWDEEDEFGHGCCSEEDLADAQRVCEQLGIPLYVVNFEKEYWEKVF 79 (354) T ss_pred EEEEEECCCHHHHHHHHHHHHC-CC-CEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHH T ss_conf 6999904789999999999977-99-639999995668876678899889999999999986997899787998758999 Q ss_pred HHHHCCCCCCCCC--CHHHHCCCCHHHHHHHHHHHHH-CCCEECCCCCHHHHHHH---HCCCCC-C-------------- Q ss_conf 8631323468743--1011012330466779888751-69544364324477541---102366-6-------------- Q gi|254780999|r 363 FSLMSQFLQEEPS--GIVAENIQSRIRGNILMALSNH-SKAMLLTTSNKSEISVG---YGTLYG-D-------------- 421 (562) Q Consensus 363 ~~~~~~~~~~~~~--~~~~eN~qaR~R~~~L~~~an~-~~~lvl~t~nksE~~~G---y~T~~G-D-------------- 421 (562) ...+.....+..+ ++. =| .++.--.|+..|.+ .|.=.+.||.=.-...+ |.-+-| | T Consensus 80 ~~fv~~Y~~G~TPNPcv~-CN--~~IKFg~l~~~A~~~lGad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL~~l~q 156 (354) T pfam03054 80 EPFLDEYKNGRTPNPDIL-CN--REIKFGALLDYAKQVLGADYLATGHYARVSLNENESELLRGKDKNKDQSYFLSGLSQ 156 (354) T ss_pred HHHHHHHHCCCCCCCCHH-HC--CCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCEEEEEECCCCCCCCCEEEECCCH T ss_conf 999999976999984144-18--964389999999986699810447428999049938999899999980178610789 Q ss_pred --CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf --555401017763365899999963 Q gi|254780999|r 422 --MSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 422 --~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) ..-.+-|+|+++|++|+++|+-++ T Consensus 157 ~~L~~~lFPLG~~~K~eVR~iA~~~g 182 (354) T pfam03054 157 EQLEKLLFPLGDLTKEEVRKIAKEAG 182 (354) T ss_pred HHHHHEECCCCCCCHHHHHHHHHHCC T ss_conf 87643182876885289999999769 No 56 >PRK00074 guaA GMP synthase; Reviewed Probab=99.27 E-value=8.6e-11 Score=89.78 Aligned_cols=162 Identities=20% Similarity=0.282 Sum_probs=96.0 Q ss_pred HCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCC-CCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHC Q ss_conf 224540575215872368999999841495446797324666652-2135699999972135426775899999886313 Q gi|254780999|r 289 KNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSP-QSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 289 ~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~-~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~ 367 (562) +-|-++|++|||||+||+|+|+|..+|+| +|++|+..-.-.=.+ ++.+--+.+.+.+|++...++=. +.|++.|. T Consensus 213 ~Vg~~kVi~~lSGGVDStV~A~Ll~kAIG-d~l~cvfVD~GllRknE~~~V~~~~~~~lgl~~~~vdA~---~~Fl~~L~ 288 (513) T PRK00074 213 QVGDKKVILGLSGGVDSSVAAVLLHKAIG-DQLTCVFVDHGLLRKNEAEQVMEMFREHFGLNLIHVDAS---DRFLSALA 288 (513) T ss_pred HHCCCEEEEEECCCCCHHHHHHHHHHHHH-HCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEEHH---HHHHHHHC T ss_conf 83876289972388308999999999851-124899953771116729999999988619967992248---99999737 Q ss_pred CCCCCCCCCHHHHCCCCHHHHHHHHHHHHHC---CCEECCCC--CHHHHHHH------HCC------CCCCCC-CCCCCC Q ss_conf 2346874310110123304667798887516---95443643--24477541------102------366655-540101 Q gi|254780999|r 368 QFLQEEPSGIVAENIQSRIRGNILMALSNHS---KAMLLTTS--NKSEISVG------YGT------LYGDMS-GGFNPL 429 (562) Q Consensus 368 ~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~---~~lvl~t~--nksE~~~G------y~T------~~GD~~-g~~~p~ 429 (562) ...+.+..--.-++.= ..+.-..|.+. .+|+-||- |--|.+-| --| +.-++. .-+.|+ T Consensus 289 gv~DPE~KRKIIG~~F----I~vfe~~a~~~~~~~~L~QGTlYPDvIES~~~~~~a~~IKsHHNVgglp~~~~~~lvEPl 364 (513) T PRK00074 289 GVTDPEEKRKIIGREF----IEVFEEEAKKLGGVKFLAQGTLYPDVIESASGTGKAATIKSHHNVGGLPEDMKLKLVEPL 364 (513) T ss_pred CCCCHHHHHHHHHHHH----HHHHHHHHHHCCCCCEEECCCCCCCEEEECCCCCCCCCCCCCCCCCCCHHHCCCEEEEEH T ss_conf 9989799678877899----999999997579983996467655068847989998860487758788566487545031 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCC Q ss_conf 7763365899999963103322234444668878828898845 Q gi|254780999|r 430 KDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAE 472 (562) Q Consensus 430 ~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psae 472 (562) .+|+|-+||.|.+.++ +|++++.+.|=|. T Consensus 365 r~lfKDEVR~lg~~Lg--------------lp~~~~~RhPFPG 393 (513) T PRK00074 365 RELFKDEVRKLGLELG--------------LPEEIVYRHPFPG 393 (513) T ss_pred HHHCCHHHHHHHHHHC--------------CCHHHHCCCCCCC T ss_conf 7561589999999968--------------9777602489999 No 57 >PRK00919 GMP synthase subunit B; Validated Probab=99.25 E-value=1.1e-10 Score=89.17 Aligned_cols=169 Identities=21% Similarity=0.283 Sum_probs=95.3 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 02035888764224540575215872368999999841495446797324666652213569999997213542677589 Q gi|254780999|r 278 ACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHD 357 (562) Q Consensus 278 Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~ 357 (562) ..+--+|+ +-|-++|+++||||+||+|+|+|..+|+| +|++|+..-.-.-.....+.-....+. |++...++ T Consensus 9 ~~i~~Ir~---~Vg~~kvi~~lSGGVDStV~A~Ll~kAig-~~l~~v~VD~GllR~~E~~~V~~~~~~-~~~l~~vd--- 80 (306) T PRK00919 9 EAIEEIRE---EVGDGKAIIALSGGVDSSVAAVLAHRAIG-DRLLAVYVDTGLMRKGETERIREIFKD-GLNLRIVD--- 80 (306) T ss_pred HHHHHHHH---HHCCCCEEEEECCCCHHHHHHHHHHHHHH-CCEEEEEECCCCCCCCCHHHHHHHHHH-CCCCEEEE--- T ss_conf 99999999---84898599991688479999999998864-265999986898879869999999983-58928983--- Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCC--CEECCCC--CHHHHHHHHCCCCCC------CC-CCC Q ss_conf 999988631323468743101101233046677988875169--5443643--244775411023666------55-540 Q gi|254780999|r 358 LVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSK--AMLLTTS--NKSEISVGYGTLYGD------MS-GGF 426 (562) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~--~lvl~t~--nksE~~~Gy~T~~GD------~~-g~~ 426 (562) +-+.|++.|....+.+..-..-++.=- .+.-..+.+.| +|+-||- +--|.+-+--|-+-+ +. --+ T Consensus 81 a~~~Fl~~L~gv~DPE~KRKiIG~~Fi----~vfe~~~~~~~~~~LaQGTlyPDvIES~~~IKSHHNVggLp~~m~~kli 156 (306) T PRK00919 81 ASDRFLEALKGVTDPEEKRKIIGETFI----RVFEEVAKEIGAEYLVQGTIAPDWIESEGGIKSHHNVGGLPEGMVLKIV 156 (306) T ss_pred HHHHHHHHHCCCCCHHHCCCHHHHHHH----HHHHHHHHHHCCCEEEECEECCCHHHCCCCCCCCCCCCCCHHHCCCCCC T ss_conf 499999973799895674602579999----9999999982987685110737602046764341668877332356310 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCC Q ss_conf 1017763365899999963103322234444668878828898845 Q gi|254780999|r 427 NPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAE 472 (562) Q Consensus 427 ~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psae 472 (562) .|+.+|+|-+|++|.+.++ +|++++.+.|=|. T Consensus 157 EPLr~LfKDEVR~lG~~LG--------------lp~~iv~RhPFPG 188 (306) T PRK00919 157 EPLRDLYKDEVREVARALG--------------LPEEISERMPFPG 188 (306) T ss_pred CHHHHHHHHHHHHHHHHHC--------------CCHHHCCCCCCCC T ss_conf 3189985599999999969--------------7888704589999 No 58 >TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232 This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily .. Probab=99.22 E-value=4.4e-11 Score=91.69 Aligned_cols=213 Identities=23% Similarity=0.342 Sum_probs=135.2 Q ss_pred HHHHHHH-CCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCC-CCHHHHHHHHHHHCCCCCCCCHHHHH- Q ss_conf 8887642-24540575215872368999999841495446797324666652-21356999999721354267758999- Q gi|254780999|r 283 LRDYVQK-NNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSP-QSLEDAAACAKALGCKYDVLPIHDLV- 359 (562) Q Consensus 283 l~d~~~~-~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~-~s~~~a~~la~~lg~~~~~i~I~~~~- 359 (562) |+.++.. -.-++++|.-||||||+++|++|.+.+|. +++||+.-|+.-|+ .+++||+..|+.+|+.|..|.|+.+. T Consensus 5 L~~fl~~kf~nk~~~IAySGGvDS~lla~v~~~v~g~-~~lait~~sP~~sp~~el~~A~~~A~~~g~~he~~~~d~~~n 83 (263) T TIGR00268 5 LRNFLKEKFKNKKVLIAYSGGVDSSLLAAVCSDVAGT-EVLAITVVSPSISPRRELEDAKAIAKEIGVKHELVKIDKMAN 83 (263) T ss_pred HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHH-CEEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC T ss_conf 8889876764081699951746589999999875310-113578762775735448999999988083012110012368 Q ss_pred HHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHHH---HHHCCCCCCCCCCCCCCCC--CC Q ss_conf 99886313234687431011012330466779888751695-443643244775---4110236665554010177--63 Q gi|254780999|r 360 NHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEIS---VGYGTLYGDMSGGFNPLKD--LY 433 (562) Q Consensus 360 ~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~~---~Gy~T~~GD~~g~~~p~~~--l~ 433 (562) ..| .....+..| ==+ +.=-..|...|.+.|+ .|+-=.|-|.+. =||-.. -.+.|.++|+++ ++ T Consensus 84 ~~f--------~~N~~~RCY-~CK-~~~~~~L~~~a~~~gy~~V~dGtN~dDL~~~RPG~~A~-~E~~g~~SP~aef~I~ 152 (263) T TIGR00268 84 PNF--------RANVEERCY-FCK-KKVLSILVKLAEKRGYDVVVDGTNADDLEDHRPGLRAV-KELNGVYSPWAEFGIT 152 (263) T ss_pred CCC--------CCCCCCCCH-HHH-HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHH-HHCCCCCCCCCCCCCC T ss_conf 500--------168854441-548-88999989999863995798234620002367513567-6607886872002568 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCC----CC-CCCCCCCCHHHHH-HHHHHHHHCCC---CHH Q ss_conf 365899999963103322234444668878828898845389----99-7700068878899-99999982799---977 Q gi|254780999|r 434 KTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRP----HQ-TDQESLPPYPILD-DIIKRIVENEE---SFI 504 (562) Q Consensus 434 Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~----~Q-~ded~l~~Y~~lD-~il~~~i~~~~---~~~ 504 (562) |.+|++.|+-+|- ++.+||..|=|.- +. -|+..|---+.+. .+|+.+..+.- +-. T Consensus 153 K~eir~ia~~lg~----------------~~~DKP~~~CL~sR~p~g~EI~~ekL~~v~eaE~~~l~~~g~~qvRVR~y~ 216 (263) T TIGR00268 153 KKEIREIAKSLGL----------------SFYDKPSEACLASRFPFGEEIDVEKLKKVDEAEEIVLRRAGVKQVRVRNYD 216 (263) T ss_pred HHHHHHHHHHCCC----------------CCCCCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 7999999997488----------------988886621021047687510489877699999999998356605765267 Q ss_pred HHH-HCCCHHHHHHHHHHHH Q ss_conf 987-1899999999999998 Q gi|254780999|r 505 NND-QEYNDETVRYVEHLLY 523 (562) Q Consensus 505 ~~~-~~~~~~~v~~~~~~~~ 523 (562) .++ =+..+|.+.++++-.. T Consensus 217 nlAviE~~~~~l~kl~~~~~ 236 (263) T TIGR00268 217 NLAVIEVAEDELSKLLNEAE 236 (263) T ss_pred CEEEEEECHHHHHHHHHHHH T ss_conf 65799628588999986313 No 59 >COG0482 TrmU Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=99.17 E-value=2.5e-10 Score=86.73 Aligned_cols=150 Identities=23% Similarity=0.300 Sum_probs=93.1 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCC-------CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHH-- Q ss_conf 4540575215872368999999841495446797324666-------6522135699999972135426775899999-- Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKY-------TSPQSLEDAAACAKALGCKYDVLPIHDLVNH-- 361 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~-------~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~-- 361 (562) ...+|++|+|||+||||+|.|..+. |- +|.|++|-.-. +|.....||+..|+.|||.+.++++++-+.. T Consensus 2 ~~~kV~v~mSGGVDSSVaA~lLk~Q-Gy-eViGl~m~~~~~~~~~~C~s~~d~~da~~va~~LGIp~~~vdf~~~y~~~V 79 (356) T COG0482 2 KKKKVLVGMSGGVDSSVAAYLLKEQ-GY-EVIGLFMKNWDEDGGGGCCSEEDLRDAERVADQLGIPLYVVDFEKEFWNKV 79 (356) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHC-CC-EEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECHHHHHHHHH T ss_conf 8767999905777899999999976-97-499999996414788867725678999999998099559984689988888 Q ss_pred HHHHHCCCCCCCCC--CHHHHCCCCHHHHHHHHHHHHHCCCEECCCCC---------HHHHHHH--------HCCCCC-- Q ss_conf 88631323468743--10110123304667798887516954436432---------4477541--------102366-- Q gi|254780999|r 362 FFSLMSQFLQEEPS--GIVAENIQSRIRGNILMALSNHSKAMLLTTSN---------KSEISVG--------YGTLYG-- 420 (562) Q Consensus 362 ~~~~~~~~~~~~~~--~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~n---------ksE~~~G--------y~T~~G-- 420 (562) +...+.....+..+ ++.= |=. +-=-.+.+.|.+.|+=.+.||. +.++..| |+-... T Consensus 80 ~~~f~~~Y~~G~TPNPci~C-N~~--iKF~~~l~~a~~lgad~iATGHYar~~~~~~~~~l~r~~D~~KDQsYfL~~~~~ 156 (356) T COG0482 80 FEYFLAEYKAGKTPNPCILC-NKE--IKFKALLDYAKELGADYIATGHYARQREDEGIELLLRGVDLNKDQSYFLYALSQ 156 (356) T ss_pred HHHHHHHHHCCCCCCCCHHC-CHH--HHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHEECCCCH T ss_conf 88778998579999965104-878--899999999997599857874467654377600014678742030002012688 Q ss_pred C-CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 6-555401017763365899999963 Q gi|254780999|r 421 D-MSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 421 D-~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) + ..--+-|+|++.|++||.+|...+ T Consensus 157 ~ql~~~lFPlG~l~K~evR~iA~~~g 182 (356) T COG0482 157 EQLERLLFPLGDLEKLEVRPIAAEKG 182 (356) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHCC T ss_conf 77753225677777789999999759 No 60 >TIGR00884 guaA_Cterm GMP synthase, C-terminal domain; InterPro: IPR001674 The amidotransferase family of enzymes utilises the ammonia derived from the hydrolysis of glutamine for a subsequent chemical reaction catalyzed by the same enzyme. The ammonia intermediate does not dissociate into solution during the chemical transformations . GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. The C-terminal domain is specific to the GMP synthases 6.3.5.2 from EC. In prokaryotes this domain mediates dimerisation. Eukaryotic GMP synthases are monomers. This domain in eukaryotes includes several large insertions that may form globular domains .; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006164 purine nucleotide biosynthetic process, 0006177 GMP biosynthetic process. Probab=99.14 E-value=7.5e-10 Score=83.64 Aligned_cols=162 Identities=21% Similarity=0.289 Sum_probs=101.0 Q ss_pred HCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCC-CHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHC Q ss_conf 2245405752158723689999998414954467973246666522-135699999972135426775899999886313 Q gi|254780999|r 289 KNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQ-SLEDAAACAKALGCKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 289 ~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~-s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~ 367 (562) +-|=.+++++||||+|||++|.|+.+|+| +|++||..=+-.=-.. .-+=-+.+-+.||++...|| +-+.|++.|+ T Consensus 13 ~VGD~~vi~ALSGGVDSsV~A~L~hrAIG-D~L~~vFVD~GLlR~gE~E~V~~~F~~~lg~nl~~VD---A~e~FL~~L~ 88 (319) T TIGR00884 13 QVGDAKVIIALSGGVDSSVAAVLLHRAIG-DRLTCVFVDHGLLRKGEAERVVKTFSDKLGLNLVVVD---AKERFLSALK 88 (319) T ss_pred HCCCCEEEEEECCCHHHHHHHHHHHHHHC-CCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEC---CCHHHHHHCC T ss_conf 22884689981088158999999986422-7604898227888766378999998753089827876---1079887528 Q ss_pred CCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCC------CEECCCC--CHHHHH-H-HH---CCCCCCC-CC--------- Q ss_conf 23468743101101233046677988875169------5443643--244775-4-11---0236665-55--------- Q gi|254780999|r 368 QFLQEEPSGIVAENIQSRIRGNILMALSNHSK------AMLLTTS--NKSEIS-V-GY---GTLYGDM-SG--------- 424 (562) Q Consensus 368 ~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~------~lvl~t~--nksE~~-~-Gy---~T~~GD~-~g--------- 424 (562) -..+.+..=-- ==|.=-...=..|++.+ +|+-||- |--|.. . |- .|.=--| .| T Consensus 89 GV~DPE~KRKI----IG~~FI~VFE~~A~~~~~~~~a~yL~QGTlYPDvIES~~~kG~~~a~~IKsHHNVGGLP~~m~l~ 164 (319) T TIGR00884 89 GVTDPEEKRKI----IGRVFIEVFEREAEKIGDKKKAEYLVQGTLYPDVIESASSKGTSGAATIKSHHNVGGLPEDMKLK 164 (319) T ss_pred CCCCCHHHCCH----HHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 89870341311----22566777888998507997799997343447464602304776476456310668885426025 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCC Q ss_conf 401017763365899999963103322234444668878828898845 Q gi|254780999|r 425 GFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAE 472 (562) Q Consensus 425 ~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psae 472 (562) =+.|+-.|||=+||.|++.++ +|++|+.+.|=|. T Consensus 165 LvEPLR~LfKDEVR~lG~~LG--------------lP~ei~~R~PFPG 198 (319) T TIGR00884 165 LVEPLRELFKDEVRKLGKELG--------------LPEEIVWRHPFPG 198 (319) T ss_pred EECCCHHCCHHHHHHHHHHCC--------------CCHHHHCCCCCCC T ss_conf 875402302178999998718--------------8679964783686 No 61 >cd01997 GMP_synthase_C The C-terminal domain of GMP synthetase. It contains two subdomains; the ATP pyrophosphatase domain which closes to the N-termial and the dimerization domain at C-terminal end. The ATP-PPase is a twisted, five-stranded parallel beta-sheet sandwiched between helical layers. It has a signature nucleotide-binding motif, or P-loop, at the end of the first-beta strand.The dimerization domain formed by the C-terminal 115 amino acid for prokaryotic proteins. It is adjacent to teh ATP-binding site of the ATP-PPase subdomain. The largest difference between the primary sequence of prokaryotic and eukaryotic GMP synthetase map to the dimerization domain.Eukaryotic GMP synthetase has several large insertions relative to prokaryotes. Probab=99.13 E-value=3.8e-10 Score=85.58 Aligned_cols=157 Identities=23% Similarity=0.319 Sum_probs=92.1 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHH-HHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 057521587236899999984149544679732466665221356-9999997213542677589999988631323468 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLED-AAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQE 372 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~-a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~ 372 (562) +|++++|||+||+++|+|..+|+| ++++|+..-.-+=.....+. -+.+.+.++++...++-. +.|++.|....+. T Consensus 1 kVi~~lSGGVDStV~A~Ll~kAig-~~l~~vfVDnGlmRk~E~e~V~~~~~~~~~~~~~~vdas---~~Fl~~L~gv~DP 76 (295) T cd01997 1 KVILALSGGVDSTVAAVLLHKAIG-DRLTCVFVDNGLLRKNEAERVEELFSKLLGINLIVVDAS---ERFLSALKGVTDP 76 (295) T ss_pred CEEEEECCCHHHHHHHHHHHHHHH-CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEECHH---HHHHHHHCCCCCH T ss_conf 989990578289999999998865-056999985897888829999999987069977995669---9999986587792 Q ss_pred CCCCHHHHCCCCHHHHHHHHHHHHHCC---CEECCCC--CHHHHHHHH------CCCC------CCCC-CCCCCCCCCCH Q ss_conf 743101101233046677988875169---5443643--244775411------0236------6655-54010177633 Q gi|254780999|r 373 EPSGIVAENIQSRIRGNILMALSNHSK---AMLLTTS--NKSEISVGY------GTLY------GDMS-GGFNPLKDLYK 434 (562) Q Consensus 373 ~~~~~~~eN~qaR~R~~~L~~~an~~~---~lvl~t~--nksE~~~Gy------~T~~------GD~~-g~~~p~~~l~K 434 (562) +..-..-++. -..+.-..|.+.+ +|+-||- +--|.+-+- -|-+ -++. .-+.|+.+|+| T Consensus 77 E~KRKiIG~~----Fi~vf~~~~~~~~~~~~L~QGTlyPDvIES~~~~~~a~~IKsHHNvggLp~~~~~kliEPLr~LfK 152 (295) T cd01997 77 EEKRKIIGET----FIEVFEEEAKKLGLAEYLAQGTLYPDVIESGSGKGSADTIKSHHNVGGLPEDMKLKLIEPLRDLFK 152 (295) T ss_pred HHHHHHHHHH----HHHHHHHHHHHCCCCCEEEECCCCCCEEECCCCCCCCCEEEEECCCCCCCHHCCCCCCHHHHHHHH T ss_conf 5636565389----999999999755886189724302625750788888751452245677723226540016788745 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCC Q ss_conf 65899999963103322234444668878828898845 Q gi|254780999|r 435 TQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAE 472 (562) Q Consensus 435 t~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psae 472 (562) -+|++|.+.++ +|++++.+.|=|. T Consensus 153 DEVR~lG~~LG--------------lp~~iv~RhPFPG 176 (295) T cd01997 153 DEVRELGRELG--------------LPEEIVERHPFPG 176 (295) T ss_pred HHHHHHHHHHC--------------CCHHHEEECCCCC T ss_conf 99999999868--------------9788822079999 No 62 >TIGR00420 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; InterPro: IPR004506 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (2.1.1.61 from EC) catalyses the addition of 5-methylaminomethyl-2-thiouridylate to tRNAs using S-adenosyl-L-methionine as a substrate and releasing S-adenosyl-L-homocysteine. The enzyme is cytoplasmic and is involved in tRNA processing.; GO: 0004808 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.13 E-value=5.1e-10 Score=84.70 Aligned_cols=146 Identities=25% Similarity=0.347 Sum_probs=88.5 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCC--------------------CCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 405752158723689999998414954467973246--------------------666522135699999972135426 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPY--------------------KYTSPQSLEDAAACAKALGCKYDV 352 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~--------------------~~~s~~s~~~a~~la~~lg~~~~~ 352 (562) ++|+||||||+||||+|++...- |. +|.||+|-. .=++.+...+|++.|+.|||+++. T Consensus 1 ~kv~VglSGGVDSSVsA~lL~~q-g~-~v~G~ylk~W~~~~~~d~~GH~~~N~~K~~C~~~~dl~~a~~ic~~LGi~l~~ 78 (394) T TIGR00420 1 KKVIVGLSGGVDSSVSAYLLKQQ-GY-EVVGVYLKNWDEDEKNDILGHKKINENKSGCSSAEDLRDAQAICEKLGIPLEK 78 (394) T ss_pred CEEEEECCCCHHHHHHHHHHHHC-CC-EEEEEEEEEEECCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 94899806854789999999714-96-58999877651246666443165624677883688799999999855981898 Q ss_pred CCHHHHHHH--HHHHHCCCCCCCC--CCH-HHHCCCCHHH-HHHHHHHHHH-C--CCEECCCCCHHHHH-H--------- Q ss_conf 775899999--8863132346874--310-1101233046-6779888751-6--95443643244775-4--------- Q gi|254780999|r 353 LPIHDLVNH--FFSLMSQFLQEEP--SGI-VAENIQSRIR-GNILMALSNH-S--KAMLLTTSNKSEIS-V--------- 413 (562) Q Consensus 353 i~I~~~~~~--~~~~~~~~~~~~~--~~~-~~eN~qaR~R-~~~L~~~an~-~--~~lvl~t~nksE~~-~--------- 413 (562) ++....+.. +...|...-.+.. +|+ . |= ++- |. +..++++ . |.=.+.||.=.-.. . T Consensus 79 ~nf~~~Y~~~Vf~~~i~~y~~G~TPnPDi~C--N~--~iKFG~-~~e~~~~~~Gtg~~~~aTGHYAr~~q~~~~~~~~~L 153 (394) T TIGR00420 79 VNFQKEYWNKVFEPFIQEYKEGLTPNPDILC--NK--LIKFGL-LLEYAKQLLGTGNDKIATGHYARIAQEIENKSLFEL 153 (394) T ss_pred ECHHHHHHHHHHHHHHHHHHCCCCCCCCCCC--CC--CCCHHH-HHHHHHHHCCCCCCEEECCHHHHHHHHHCCCCHHHH T ss_conf 5507887888889999987367777862004--71--200378-999999852899543442447654434213101444 Q ss_pred --H--------HCCC--C-CCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf --1--------1023--6-66555401017763365899999963 Q gi|254780999|r 414 --G--------YGTL--Y-GDMSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 414 --G--------y~T~--~-GD~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) + ||=. - -=++.-+=|+|+|.|++||.||+..+ T Consensus 154 ~~~~D~~KDQSYFL~~l~~~~~~~~~FPlG~L~K~~vR~iA~~~~ 198 (394) T TIGR00420 154 LRALDKNKDQSYFLYHLSQEQLAKLLFPLGELLKPEVRQIAKNAG 198 (394) T ss_pred HCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCC T ss_conf 304678886117376401898887505301036778999998648 No 63 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=98.98 E-value=3.5e-09 Score=79.23 Aligned_cols=150 Identities=25% Similarity=0.259 Sum_probs=97.5 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCC-CCCCHHHHHHHHHHHCCCCC-CCCHHHHHHHHH-HHHCCCCC Q ss_conf 5752158723689999998414954467973246666-52213569999997213542-677589999988-63132346 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYT-SPQSLEDAAACAKALGCKYD-VLPIHDLVNHFF-SLMSQFLQ 371 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~-s~~s~~~a~~la~~lg~~~~-~i~I~~~~~~~~-~~~~~~~~ 371 (562) .|+.+|||+|||+++++|.+..|+++|++|+--.-=- -..-++-|+.+|+.|||+++ .||++-+-+-.. +.|-..-. T Consensus 1 Av~~lSGG~DStT~~~~a~~~~GkyeV~a~TF~YGQR~H~~Ele~A~~ia~~Lgi~~~~~~Dl~~l~~l~~YsaLt~~~~ 80 (227) T TIGR00364 1 AVVVLSGGQDSTTVLLIALDEGGKYEVHAITFDYGQRAHSRELESARKIAEALGIRHHFVIDLSLLKQLGKYSALTDEQE 80 (227) T ss_pred CEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHCCCCCCCCCC T ss_conf 96874373468999999996179507998546501378999999999999980897078617799985540452357875 Q ss_pred C-----CCC--CHHHHCCCCH---HH----HHHHHHHHHHCCC--EECC---------CCCHHHHH------HHHC-CCC Q ss_conf 8-----743--1011012330---46----6779888751695--4436---------43244775------4110-236 Q gi|254780999|r 372 E-----EPS--GIVAENIQSR---IR----GNILMALSNHSKA--MLLT---------TSNKSEIS------VGYG-TLY 419 (562) Q Consensus 372 ~-----~~~--~~~~eN~qaR---~R----~~~L~~~an~~~~--lvl~---------t~nksE~~------~Gy~-T~~ 419 (562) . ... +-++|-+.+= =| ..+..-+|.+.|+ +.+| +.|+.|+. +-+. +.+ T Consensus 81 ~~~~~~~e~d~~~~~E~~p~tfVPgRN~~fl~~a~~yA~~~gA~~v~~G~~~~DfSgYPDCr~EFvka~n~~l~Lgn~~~ 160 (227) T TIGR00364 81 IPEQKSNEEDKEKALETLPNTFVPGRNLIFLSLAASYAEALGAEAVITGVCETDFSGYPDCRDEFVKALNHALNLGNMLT 160 (227) T ss_pred CCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 67666653202356531133337804799999999988861842522001000365798886889999999999741268 Q ss_pred CCCCCC-----CCCCCCCCHHHHHHHHHHHHH Q ss_conf 665554-----010177633658999999631 Q gi|254780999|r 420 GDMSGG-----FNPLKDLYKTQVFQLASWRNS 446 (562) Q Consensus 420 GD~~g~-----~~p~~~l~Kt~v~~la~~~n~ 446 (562) + .-- .+|+-+++|.+.++||..+++ T Consensus 161 ~--v~i~Piql~aPL~~ltKaeiv~la~elGk 190 (227) T TIGR00364 161 P--VEIRPIQLEAPLMDLTKAEIVKLADELGK 190 (227) T ss_pred C--CEEEEEEEECCEEECCHHHHHHHHHHHCC T ss_conf 8--11202466667012698999999887351 No 64 >KOG2805 consensus Probab=98.98 E-value=2.6e-09 Score=80.09 Aligned_cols=150 Identities=18% Similarity=0.219 Sum_probs=84.2 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECC---------CCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHH-- Q ss_conf 4540575215872368999999841495446797324---------66665221356999999721354267758999-- Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLP---------YKYTSPQSLEDAAACAKALGCKYDVLPIHDLV-- 359 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp---------~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~-- 359 (562) .+..|||++|||+|||++|.|.+.+ | -||.||.|- +.-..+....||+..|+.|+|.++.++...-. T Consensus 4 ~~~~VvvamSgGVDSsVaa~Ll~~~-g-~~v~gv~M~nWd~~de~~s~cp~e~D~~da~~Vc~~LnI~~~~Vnf~kEYW~ 81 (377) T KOG2805 4 KPDRVVVAMSGGVDSSVAARLLAAR-G-YNVTGVFMKNWDSLDEFGSQCPAERDWKDAKRVCKQLNIPLHQVNFVKEYWN 81 (377) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHC-C-CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCEEEEEEEHHHHHH T ss_conf 6654899953771189999999741-8-8716996622012223566898112089999999870970278743799999 Q ss_pred ---HHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCC-----------------CC Q ss_conf ---9988631323468743101101233046677988875169544364324477541102-----------------36 Q gi|254780999|r 360 ---NHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGT-----------------LY 419 (562) Q Consensus 360 ---~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T-----------------~~ 419 (562) +.++..-.... .+.+|+.- |-+-..-...-++. +..|+=-|+||.-.-..++-.+ ++ T Consensus 82 ~Vfs~~L~~Y~~G~-TPNPDI~C-N~~IKFg~~~~~a~-en~~~d~latGHYAr~~~~~~~~~~~~l~~~~d~~KDQt~F 158 (377) T KOG2805 82 DVFSPFLEEYENGR-TPNPDILC-NKHIKFGKFFKHAI-ENLGYDWLATGHYARVVLEDEDNAESHLLISKDMVKDQTYF 158 (377) T ss_pred HHHHHHHHHHHCCC-CCCCCCCC-CCEEECCHHHHHHH-HHCCCCEEEEEEEEEEECCCCCCCCEEEEECCCCCCCCEEE T ss_conf 99999988876589-99997134-45142117899998-75588737742121440375567623576125666773267 Q ss_pred CC------CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 66------555401017763365899999963 Q gi|254780999|r 420 GD------MSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 420 GD------~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) =. .-..+-|++++.|.+|++||++.+ T Consensus 159 L~~in~~~L~r~lfPlg~~~K~eVk~lA~~~g 190 (377) T KOG2805 159 LSTINQTQLKRLLFPLGCLTKSEVKKLAKQAG 190 (377) T ss_pred EECCCHHHHHHHHCCCCCCCHHHHHHHHHHCC T ss_conf 63036999986206676567799999998659 No 65 >COG0519 GuaA GMP synthase, PP-ATPase domain/subunit [Nucleotide transport and metabolism] Probab=98.84 E-value=4.6e-08 Score=71.91 Aligned_cols=187 Identities=22% Similarity=0.299 Sum_probs=106.7 Q ss_pred HHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCC-CHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 42245405752158723689999998414954467973246666522-13569999997213542677589999988631 Q gi|254780999|r 288 QKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQ-SLEDAAACAKALGCKYDVLPIHDLVNHFFSLM 366 (562) Q Consensus 288 ~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~-s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~ 366 (562) ++-|-.++++|||||.||+++|+|+.+|+| +++.|++.-...-... ..+-.+.+.+.+|++...++ +-+.|.+.+ T Consensus 17 ~~vg~~kvi~alSGGVDSsv~a~L~~~AiG-d~l~cvfVD~GLlR~~E~e~V~~~f~~~~~~nl~~Vd---A~~~Fl~~L 92 (315) T COG0519 17 EQVGDGKVILALSGGVDSSVAAVLAHRAIG-DQLTCVFVDHGLLRKGEAEQVVEMFREHLGLNLIVVD---AKDRFLSAL 92 (315) T ss_pred HHHCCCEEEEEECCCCCHHHHHHHHHHHHH-CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEC---HHHHHHHHH T ss_conf 972885399983387808999999999740-5248999637743478569999998753298459975---498899884 Q ss_pred CCCCCCCCCCHHHHCCCCHHHHH---HHHHHHHHCC--CEECCCC--CHHHHHHHH-CC---------CCCCC-CCCCCC Q ss_conf 32346874310110123304667---7988875169--5443643--244775411-02---------36665-554010 Q gi|254780999|r 367 SQFLQEEPSGIVAENIQSRIRGN---ILMALSNHSK--AMLLTTS--NKSEISVGY-GT---------LYGDM-SGGFNP 428 (562) Q Consensus 367 ~~~~~~~~~~~~~eN~qaR~R~~---~L~~~an~~~--~lvl~t~--nksE~~~Gy-~T---------~~GD~-~g~~~p 428 (562) ....+.+.. +-.+.-. ..-..|.+.+ +|+-||- +--|...|- .| +.-|| ..-+.| T Consensus 93 ~GvtDPE~K-------RKiIG~~Fi~VFe~ea~k~~~~~LaQGTiYpDvIES~~g~~~~IKSHHNVGGLP~~m~lkLvEP 165 (315) T COG0519 93 KGVTDPEEK-------RKIIGREFIEVFEEEAKKLGAEFLAQGTIYPDVIESGTGKAGTIKSHHNVGGLPEDMKLKLVEP 165 (315) T ss_pred CCCCCHHHH-------HHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCEEEEHH T ss_conf 399988998-------9998799999999999857860687044345046506888886213562688950142223077 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCC--CC---CCCCCCCCCCCHHHHHHHHHHHHHC Q ss_conf 17763365899999963103322234444668878828898845--38---9997700068878899999999827 Q gi|254780999|r 429 LKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSAE--LR---PHQTDQESLPPYPILDDIIKRIVEN 499 (562) Q Consensus 429 ~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psae--L~---~~Q~ded~l~~Y~~lD~il~~~i~~ 499 (562) +-+|+|-+|+.|+++++ +|++++.+.|=|. |. .+-.+++.|---..-|.|+..-+++ T Consensus 166 Lr~LfKDEVR~lg~~LG--------------lp~~iv~RhPFPGPGLaiRilGevt~Ekl~ilR~Ad~I~~eEl~~ 227 (315) T COG0519 166 LRELFKDEVRELGRELG--------------LPEEIVYRHPFPGPGLAVRILGEVTREKLEILREADAIVEEELRK 227 (315) T ss_pred HHHHHHHHHHHHHHHHC--------------CCHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 89886899999999859--------------988884048999998678962544799999998876999999986 No 66 >PTZ00077 asparagine synthetase; Provisional Probab=98.62 E-value=4.5e-07 Score=65.42 Aligned_cols=137 Identities=23% Similarity=0.251 Sum_probs=80.4 Q ss_pred HHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCH---------------------------------- Q ss_conf 8642202035888764224540575215872368999999841495---------------------------------- Q gi|254780999|r 273 EADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGK---------------------------------- 318 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~---------------------------------- 318 (562) +++...|.-+++.++.. - --+-+=|||||||+++++|+++-+.+ T Consensus 220 ~~lr~~L~~AV~~rL~s-D-VPvG~lLSGGLDSSlIaAia~k~l~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (610) T PTZ00077 220 EEIRIHLEKAVIKRLMG-D-VPFGILLSGGLDSSIIAAILAKHLKIIDGEGGGGIQGGGAAAAHNAAHEANNNNGEDNCG 297 (610) T ss_pred HHHHHHHHHHHHHHHHC-C-CCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99999999999998225-6-723666347705899999999865420111222222443210001111124444322123 Q ss_pred ------HHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCC--HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHH Q ss_conf ------446797324666652213569999997213542677--589999988631323468743101101233046677 Q gi|254780999|r 319 ------ENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLP--IHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNI 390 (562) Q Consensus 319 ------~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~--I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~ 390 (562) .+++.+...-+.+ + ....|+..|+.||..|+++. ++..++++.+.+-+. +..|++ .++|-+-|-+ T Consensus 298 ~~~a~~~~l~tFsIGl~gs-p-Dl~~Ar~vA~~lGt~Hhe~~~t~ee~ldal~dvIy~l---Etyd~t--tirAs~pmYL 370 (610) T PTZ00077 298 NNNADPCHLKSFSIGLKGS-P-DLKAAKEVAEFLGIEHTEFHFTVEEGIDSLHDVIYHI---ETYDIT--TIRASTPMYI 370 (610) T ss_pred CCCCCCCCCEEEEEECCCC-C-HHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHC---CCCCCC--CCCHHHHHHH T ss_conf 5666744422688625898-2-4899999999838834599977799987757665212---566755--4210358999 Q ss_pred HHHHHHHCCC-EECCCCCHHHHHHHHCCC Q ss_conf 9888751695-443643244775411023 Q gi|254780999|r 391 LMALSNHSKA-MLLTTSNKSEISVGYGTL 418 (562) Q Consensus 391 L~~~an~~~~-lvl~t~nksE~~~Gy~T~ 418 (562) |.-.+...|- +||+----.|+--||--+ T Consensus 371 lsr~ik~~gvKVvLSGEGADElFgGYlyf 399 (610) T PTZ00077 371 LSRLIKSSCVKMVLSGEGADEIFGGYLYF 399 (610) T ss_pred HHHHHHHCCCEEEECCCCCHHHHCCCHHH T ss_conf 99999864988995388744653464477 No 67 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=98.62 E-value=3.8e-07 Score=65.91 Aligned_cols=150 Identities=25% Similarity=0.308 Sum_probs=91.8 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHH-HHHCCC- Q ss_conf 540575215872368999999841495446797324666652213569999997213542677589999988-631323- Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFF-SLMSQF- 369 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~-~~~~~~- 369 (562) .++.||-+|||+||+.+++.|.+.. +.|++++.-.-.-....++-|+.+|+.||+++++|+++-.-+ +. +.+... T Consensus 2 ~~kavvl~SGG~DStt~l~~a~~~~--~ev~alsfdYGQrh~~Ele~A~~iak~lgv~~~iid~~~~~~-~~~saLtd~~ 78 (222) T COG0603 2 MKKAVVLLSGGLDSTTCLAWAKKEG--YEVHALTFDYGQRHRKELEAAKELAKKLGVPHHIIDVDLLGE-IGGSALTDDS 78 (222) T ss_pred CCEEEEECCCCHHHHHHHHHHHHCC--CEEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHH-CCCCCCCCCC T ss_conf 8349999168813899999999609--889999750789868999999999998199828954168753-5888675888 Q ss_pred CCCCCCCHHHHCCC---CHHHHHHH----HHHHHHCC--CEECC---------CCCHHHH------HHHHCCCCCCCCCC Q ss_conf 46874310110123---30466779----88875169--54436---------4324477------54110236665554 Q gi|254780999|r 370 LQEEPSGIVAENIQ---SRIRGNIL----MALSNHSK--AMLLT---------TSNKSEI------SVGYGTLYGDMSGG 425 (562) Q Consensus 370 ~~~~~~~~~~eN~q---aR~R~~~L----~~~an~~~--~lvl~---------t~nksE~------~~Gy~T~~GD~~g~ 425 (562) ..-+....+.+.+. -=-|-.++ ..+|-..| .+++| ..++-|+ ++-++|..|=-. - T Consensus 79 ~~vp~~~~~~~~~p~t~VP~RN~iflsiA~~~Ae~~g~~~I~~Gv~~~D~sgYPDcrpefi~a~~~~~~l~~~~~~~~-i 157 (222) T COG0603 79 IDVPKYEFAEEEIPATFVPARNLIFLSIAAAYAEALGADAIIIGVNEEDFSGYPDCRPEFIEALNEALNLGTEKGVRI-I 157 (222) T ss_pred CCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCE-E T ss_conf 636666666666864476066099999999999875998599975321047899888899999999998625578617-7 Q ss_pred CCCCCCCCHHHHHHHHHHHH Q ss_conf 01017763365899999963 Q gi|254780999|r 426 FNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 426 ~~p~~~l~Kt~v~~la~~~n 445 (562) .+|+..+.|.++..++..++ T Consensus 158 ~aPl~~l~Ka~iv~l~~elg 177 (222) T COG0603 158 HAPLMELTKAEIVKLADELG 177 (222) T ss_pred ECCEEECCHHHHHHHHHHHC T ss_conf 67833134999999888868 No 68 >PRK13825 conjugal transfer protein TraB; Provisional Probab=98.61 E-value=5.2e-06 Score=58.48 Aligned_cols=160 Identities=14% Similarity=0.175 Sum_probs=95.4 Q ss_pred HHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCEEEEEEEEEECC-EEE Q ss_conf 9999859989991664447998445414889999999999999999887697899999998889878999999699-799 Q gi|254780999|r 31 EEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIVVGFPRQDQEGVLNSVVILDAG-NII 109 (562) Q Consensus 31 ~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~iivG~p~~~~~~~yNsa~vi~~G-~il 109 (562) +.+...|+++|||||..+.=+.|. .+...+. ..+..+..+++|....+.++..|+.+.|..+ ..+ T Consensus 217 ~~~~~~g~~vvvLPEs~lg~Wtp~--------~~~lw~~------~l~~~~~tvi~Ga~v~~~~Gydn~lv~i~~~~~ri 282 (389) T PRK13825 217 RAAARAGARVVVLPESALGFWTPT--------TERLWRE------SLRGADATVIAGAAVVDPGGYDNVLVAISAGGGRI 282 (389) T ss_pred HHHCCCCCEEEEECCHHCCCCCCH--------HHHHHHH------HHCCCCCEEEEEEEECCCCCCCCEEEEECCCCCEE T ss_conf 873479973999463211568713--------8999999------74689928998236758998744479971897268 Q ss_pred EEEEEEEECCC------CCCCCCCCCCC-CCCCCEEEECCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 98877762355------43332100377-888761476474562011111110168999997147775993155533358 Q gi|254780999|r 110 AVRDKINLPNY------SEFHEKRTFIS-GYSNDPIVFRDIRLGILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHN 182 (562) Q Consensus 110 ~~y~K~~LP~y------~~FdE~r~F~~-G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~ 182 (562) ....++..|-- ..+.+.-.|.. -....+.+++|.|++.+||||.--.= |.-.......|+|+.++..=++.+ T Consensus 283 ly~~RmPVP~sMWqPW~~~~g~~g~~~a~~f~npv~~i~G~r~a~lICYEqll~w-p~l~sml~~Pdviva~~N~Wwt~g 361 (389) T PRK13825 283 LYRERMPVPVSMWQPWLPWTGESGGARAHFFANPVVTVDGRRVAPLICYEQLIVW-PVLQSMLHAPDVIVAVGNGWWTKG 361 (389) T ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEEEEEEEHHHHHHH-HHHHHHHCCCCEEEECCCCCCCCC T ss_conf 5036776651036677522489997200124586364568068775418787888-999987429988997588661689 Q ss_pred CHHHH--HHHHHHHHHHCCCEEEEC Q ss_conf 07888--788999854337046421 Q gi|254780999|r 183 KLKKR--HEIVTGQISHVHLPIIYV 205 (562) Q Consensus 183 k~~~R--~~~~~~~a~e~~~~vvy~ 205 (562) ..... +.-.++-++=.++|++.+ T Consensus 362 Tsi~aIQras~~awarLfg~Plv~A 386 (389) T PRK13825 362 TSIVAIQRASAEAWARLFGVPLVRA 386 (389) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 9709999999999999837844876 No 69 >PRK11106 queuosine biosynthesis protein QueC; Provisional Probab=98.49 E-value=2.2e-06 Score=60.96 Aligned_cols=150 Identities=21% Similarity=0.255 Sum_probs=83.5 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHCCC- Q ss_conf 5405752158723689999998414954467973246666522135699999972135-42677589999988631323- Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCK-YDVLPIHDLVNHFFSLMSQF- 369 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~-~~~i~I~~~~~~~~~~~~~~- 369 (562) .+++||=+|||+||+++++.|.+.. ++|+++..-.---...-++.|+.+|+.+|+. |.+++|+-+-+-..+.+... T Consensus 1 MkkavVLlSGGlDStt~L~~a~~~~--~~v~alsfdYGQrh~~El~~A~~ia~~~gv~~h~vidl~~l~~~~~SaLt~~~ 78 (231) T PRK11106 1 MKRAVVVFSGGQDSTTCLIQALQQY--DEVHCITFDYGQRHRAEIDVARELALKLGARAHKVLDVTLLNELAVSSLTRDS 78 (231) T ss_pred CCEEEEEECCCHHHHHHHHHHHHCC--CEEEEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHCCCCCCCCC T ss_conf 9808999078789999999999819--94999987778671999999999999859840067532777775235667778 Q ss_pred CCCCCCCHHHHCCCC---HHHHHHHH----HHHHHCC--CEECCCC---------CHHHH------HHHHCCCCCCCCCC Q ss_conf 468743101101233---04667798----8875169--5443643---------24477------54110236665554 Q gi|254780999|r 370 LQEEPSGIVAENIQS---RIRGNILM----ALSNHSK--AMLLTTS---------NKSEI------SVGYGTLYGDMSGG 425 (562) Q Consensus 370 ~~~~~~~~~~eN~qa---R~R~~~L~----~~an~~~--~lvl~t~---------nksE~------~~Gy~T~~GD~~g~ 425 (562) ...+......+++.+ =-|-.++. ++|-..| .+++|.- ++.|+ ++.+.|. ...-- T Consensus 79 i~vp~~~~~~~~~p~T~VP~RN~ifLsiAaa~Ae~~ga~~I~~G~~~~D~sgYPDCr~eFi~a~~~al~~g~~--~~i~i 156 (231) T PRK11106 79 IPVPDYEPEADGLPNTFVPGRNILFLTLAAIYAYQVKAEAVITGVCETDFSGYPDCRDEFVKALNHAVVLGMA--KDIRF 156 (231) T ss_pred CCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCC--CCEEE T ss_conf 7777765444578764671672999999999999859995998046556688998989999999999985178--98199 Q ss_pred CCCCCCCCHHHHHHHHHHHH Q ss_conf 01017763365899999963 Q gi|254780999|r 426 FNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 426 ~~p~~~l~Kt~v~~la~~~n 445 (562) .+|+.++.|.++..|++..+ T Consensus 157 ~aPl~~l~K~eiv~l~~~~~ 176 (231) T PRK11106 157 ETPLMWLDKAETWALADYYG 176 (231) T ss_pred EECCCCCCHHHHHHHHHHCC T ss_conf 83777788889999998616 No 70 >cd01712 ThiI ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway. It belongs to the Adenosine Nucleotide Hydrolysis suoerfamily and predicted to bind to Adenosine nucleotide. Probab=98.40 E-value=3.9e-06 Score=59.29 Aligned_cols=141 Identities=21% Similarity=0.173 Sum_probs=79.2 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCC-CCCCCCCHHHHHHHHHHHC-----CCCCCCCHHHHHHHHHHHHC Q ss_conf 05752158723689999998414954467973246-6665221356999999721-----35426775899999886313 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPY-KYTSPQSLEDAAACAKALG-----CKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~-~~~s~~s~~~a~~la~~lg-----~~~~~i~I~~~~~~~~~~~~ 367 (562) ++++.+||||||+|+|.+..+. |- .|.++.|-+ .++++.+.+.+..+++.+. .....++....+ +....+ T Consensus 1 kvl~L~SGGiDS~VAa~ll~kr-G~-~V~~l~f~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~--~~~~~~ 76 (177) T cd01712 1 KALALLSGGIDSPVAAWLLMKR-GI-EVDALHFNSGPFTSEKAREKVEDLARKLARYSPGHKLVVIIFTFFV--QKEIYG 76 (177) T ss_pred CEEEEECCCCHHHHHHHHHHHC-CC-EEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHH--HHHHHH T ss_conf 9899957781699999999987-99-8999999899988889999999999999994899628999453899--999997 Q ss_pred CCCCCCCCCHHHHCCCCHHH---HHHHHHHHHHCCCEECCCCCH-----HHHHHH-HCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 23468743101101233046---677988875169544364324-----477541-102366655540101776336589 Q gi|254780999|r 368 QFLQEEPSGIVAENIQSRIR---GNILMALSNHSKAMLLTTSNK-----SEISVG-YGTLYGDMSGGFNPLKDLYKTQVF 438 (562) Q Consensus 368 ~~~~~~~~~~~~eN~qaR~R---~~~L~~~an~~~~lvl~t~nk-----sE~~~G-y~T~~GD~~g~~~p~~~l~Kt~v~ 438 (562) . +... |-.--.| .-+...+|.+.|+-.+-||.- |.-.-. +.+-.+-..-=+-|+++..|.++. T Consensus 77 ~---g~~~-----npcv~ckr~m~r~a~~~A~~~ga~~IvTGe~lGQvasqt~~nl~~i~~~~~~~ilRPL~~~dK~EI~ 148 (177) T cd01712 77 Y---GKEK-----YRCILCKRMMYRIAEKLAEELGADAIVTGESLGQVASQTLENLLVISSGTDLPILRPLIGFDKEEII 148 (177) T ss_pred H---CCCC-----CCCEEHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH T ss_conf 0---8888-----9863629999999999999869989986652220447689999999874256421488899989999 Q ss_pred HHHHHHHH Q ss_conf 99999631 Q gi|254780999|r 439 QLASWRNS 446 (562) Q Consensus 439 ~la~~~n~ 446 (562) ++|+-++. T Consensus 149 ~~A~~igt 156 (177) T cd01712 149 GIARRIGT 156 (177) T ss_pred HHHHHHCC T ss_conf 99998096 No 71 >pfam02568 ThiI Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis. Probab=98.39 E-value=4.4e-06 Score=58.96 Aligned_cols=144 Identities=19% Similarity=0.150 Sum_probs=92.8 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEEC-CCCCCCCCCHHHHHHHHHHH-------CCCCCCCCHHHHHHHHHH Q ss_conf 4057521587236899999984149544679732-46666522135699999972-------135426775899999886 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIML-PYKYTSPQSLEDAAACAKAL-------GCKYDVLPIHDLVNHFFS 364 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~m-p~~~~s~~s~~~a~~la~~l-------g~~~~~i~I~~~~~~~~~ 364 (562) .++++-+||||||.|++.+..+. |- +|+++.+ ...++++.+.+.++.+++.| .+++..+|..+....... T Consensus 4 gk~l~LlSGGiDSpVAa~lmmkR-G~-~V~~vhf~~~p~~~~~~~~k~~~l~~~l~~y~~~~~~~l~vv~~~~~~~~i~~ 81 (197) T pfam02568 4 GKVLALLSGGIDSPVAAYLMMRR-GC-RVVALHFINEPGTSEEAIEKVRKLAELLAEYGTSCEGKLVVVDFTKVQKEIIE 81 (197) T ss_pred CCEEEEECCCCHHHHHHHHHHHC-CC-EEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHH T ss_conf 71898866871299999999987-99-79999987999998999999999999999737887544999574999999996 Q ss_pred HHCCCCCCCCCCHHHHCCCCHHHH-HHHHHHHHHCCCEECCCCCH-HHHH-HH----HCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 313234687431011012330466-77988875169544364324-4775-41----10236665554010177633658 Q gi|254780999|r 365 LMSQFLQEEPSGIVAENIQSRIRG-NILMALSNHSKAMLLTTSNK-SEIS-VG----YGTLYGDMSGGFNPLKDLYKTQV 437 (562) Q Consensus 365 ~~~~~~~~~~~~~~~eN~qaR~R~-~~L~~~an~~~~lvl~t~nk-sE~~-~G----y~T~~GD~~g~~~p~~~l~Kt~v 437 (562) ...+. .. .+--|--| -+...+|.+.|+..+-||.- ...| .- +.+-.+-..-=+-|+.++.|.++ T Consensus 82 ~~~~~----~~-----cv~cKr~M~r~A~~iA~~~ga~~IVTGEsLGQVaSQTl~nl~~i~~~~~~pilRPLig~DK~EI 152 (197) T pfam02568 82 KAPEK----YR-----CVLCKRCMYRAAEKVAEEEGADALVTGESLGQVASQTLDNLRVISAATNLPILRPLIGLDKEEI 152 (197) T ss_pred CCCCC----CE-----EHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCHHHH T ss_conf 28987----65-----4569999999999999984998998473031210045310699987624853134346999999 Q ss_pred HHHHHHHHHC Q ss_conf 9999996310 Q gi|254780999|r 438 FQLASWRNSH 447 (562) Q Consensus 438 ~~la~~~n~~ 447 (562) .++||-++.. T Consensus 153 i~~Ar~IGt~ 162 (197) T pfam02568 153 INLAKEIGTY 162 (197) T ss_pred HHHHHHHCCH T ss_conf 9999994867 No 72 >pfam00733 Asn_synthase Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine. Probab=98.37 E-value=8.8e-06 Score=57.00 Aligned_cols=124 Identities=22% Similarity=0.266 Sum_probs=70.8 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHH--H Q ss_conf 35888764224540575215872368999999841495446797324666652213569999997213542677589--9 Q gi|254780999|r 281 LSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHD--L 358 (562) Q Consensus 281 lgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~--~ 358 (562) -+++.+++ .-..+-+-|||||||+++++++.+-. ..++.+++.-...+.......|+.+|+.+|++|..+.++. . T Consensus 8 ~aV~~~l~--sdvpig~~LSGGlDSs~ia~l~~~~~-~~~i~~~s~~~~~~~~~E~~~a~~~a~~~~~~~~~v~~~~~~~ 84 (195) T pfam00733 8 DAVKRRLR--ADVPVGVLLSGGLDSSLIAALAARQS-SPPLKTFSVGFEGSDYDEAPYAELVADHLGTDHHEIIVTEEEL 84 (195) T ss_pred HHHHHHHC--CCCCEEEECCCCHHHHHHHHHHHHHC-CCCCEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEEECHHHH T ss_conf 99999834--69836753067468999999999853-8995389557888997589999999965133517877255999 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHHHHHH Q ss_conf 999886313234687431011012330466779888751695-443643244775411 Q gi|254780999|r 359 VNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEISVGY 415 (562) Q Consensus 359 ~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~~~Gy 415 (562) .+.+...+.. .+.+..+ .+-+=..+++..+.+ |. .+|+-.--.|+.-|| T Consensus 85 ~~~~~~~i~~-~~~P~~~------~~~~~~~~l~k~~~~-~~kV~lsG~GaDElf~GY 134 (195) T pfam00733 85 LDALPEVIYH-LEEPFGD------SSAIPLYLLSRLARK-GVKVVLSGEGADELFGGY 134 (195) T ss_pred HHHHHHHHHH-HCCCCCC------CCHHHHHHHHHHHHC-CCEEEEEEECHHHHHCCC T ss_conf 9998888998-6298777------517899999999868-984999715688884798 No 73 >TIGR03104 trio_amidotrans asparagine synthase family amidotransferase. Members of this protein family are closely related to several isoforms of asparagine synthetase (glutamine amidotransferase) and typically have been given this name in genome annotation to date. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and a member of the GNAT family of acetyltransferases. Probab=98.35 E-value=4.9e-06 Score=58.70 Aligned_cols=132 Identities=15% Similarity=0.163 Sum_probs=75.4 Q ss_pred HCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCC---CCCCHHHHHHHHHHHCCCCCC Q ss_conf 22020358887642245405752158723689999998414954467973246666---522135699999972135426 Q gi|254780999|r 276 YNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYT---SPQSLEDAAACAKALGCKYDV 352 (562) Q Consensus 276 ~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~---s~~s~~~a~~la~~lg~~~~~ 352 (562) .+.+.-+++.+++. --.+-+-|||||||+++++++.+.. ..++.+++.-...+ .......|+.+|+.+|++|++ T Consensus 246 ~~~l~~sV~~rl~s--Dvpvg~~LSGGlDSS~i~a~~~~~~-~~~i~tfs~~f~~~~~~~~dE~~~a~~va~~~~~~h~~ 322 (589) T TIGR03104 246 LEALRLAVKRRLVA--DVPVGVLLSGGLDSSLIVGLLAEAG-VDGLRTFSIGFEDVGGEKGDEFEYSDIIAERFHTRHHK 322 (589) T ss_pred HHHHHHHHHHHHHC--CCCCCEECCCCCCHHHHHHHHHHHC-CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEE T ss_conf 99999999999744--6763000178754389999999956-99985278872468877666799999999973878579 Q ss_pred CCHH--HHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCC Q ss_conf 7758--9999988631323468743101101233046677988875169544364324477541102 Q gi|254780999|r 353 LPIH--DLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGT 417 (562) Q Consensus 353 i~I~--~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T 417 (562) +.|+ ...+.+...+.. ++.+..+. +-+-..+|+..+.+.--++|+-.--.|+--||.- T Consensus 323 ~~~~~~~~~~~l~~~~~~-~deP~~~~------~~~~~~~l~~~~~~~~kV~LsG~GaDElfgGY~~ 382 (589) T TIGR03104 323 IRIPNHRVLPALPEAVAA-MSEPMVSH------DCVAFYLLSEEVSKHVKVVQSGQGADEVFGGYHW 382 (589) T ss_pred EEECCHHHHHHHHHHHHC-CCCCCCCC------HHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCC T ss_conf 995717899988998852-68987683------0688999998732883699952033323479754 No 74 >COG0037 MesJ tRNA(Ile)-lysidine synthase MesJ [Cell cycle control, cell division, chromosome partitioning] Probab=98.33 E-value=8.7e-06 Score=57.02 Aligned_cols=143 Identities=17% Similarity=0.139 Sum_probs=91.3 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCC--CCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 405752158723689999998414954467973246666--522135699999972135426775899999886313234 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYT--SPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFL 370 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~--s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~ 370 (562) .+++||+|||-||++.+.+..+..+.-++.++..-.... ++......+.+|+.+|+.+.+..++..+..... T Consensus 22 ~~ilVavSGGkDS~~ll~~L~~l~~~~~~~a~~Vd~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~------ 95 (298) T COG0037 22 YKILVAVSGGKDSLALLHLLKELGRRIEVEAVHVDHGLRGYSDQEAELVEKLCEKLGIPLIVERVTDDLGRETL------ 95 (298) T ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEHHHHHHHCCC------ T ss_conf 85899937878999999999984225738999970898864328999999999964998488620454310024------ Q ss_pred CCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCC-CHHHHH----------------HHHCCCCCCCCC--CCCCCCC Q ss_conf 687431011012330466779888751695443643-244775----------------411023666555--4010177 Q gi|254780999|r 371 QEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTS-NKSEIS----------------VGYGTLYGDMSG--GFNPLKD 431 (562) Q Consensus 371 ~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~-nksE~~----------------~Gy~T~~GD~~g--~~~p~~~ 431 (562) . +...+..-.+.|-.+|...|...|.=++-|| |.++.+ .|...+..-..+ -+-|+-. T Consensus 96 ~----~~~~c~~c~~~R~~~l~~~a~~~g~~~i~tgH~~dD~~et~lm~l~~g~~~~~l~~~~~~~~~~~~~~~iRPL~~ 171 (298) T COG0037 96 D----GKSICAACRRLRRGLLYKIAKELGADKIATGHHLDDQAETFLMNLLRGSGLRGLRGMPPKRPFEGGLLIIRPLLY 171 (298) T ss_pred C----CCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCEEECCCCC T ss_conf 6----767879999999999999999859998985678468999999998617522456408852445788645475746 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 63365899999963 Q gi|254780999|r 432 LYKTQVFQLASWRN 445 (562) Q Consensus 432 l~Kt~v~~la~~~n 445 (562) +++.++..++...+ T Consensus 172 ~~~~ei~~~~~~~~ 185 (298) T COG0037 172 VREKEIELYAKEKG 185 (298) T ss_pred CCHHHHHHHHHHCC T ss_conf 88999999999759 No 75 >KOG1622 consensus Probab=98.32 E-value=1.2e-06 Score=62.71 Aligned_cols=156 Identities=24% Similarity=0.310 Sum_probs=91.3 Q ss_pred HHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCC-CCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 422454057521587236899999984149544679732466665-2213569999997213542677589999988631 Q gi|254780999|r 288 QKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTS-PQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLM 366 (562) Q Consensus 288 ~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s-~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~ 366 (562) +.-|..+|++.+|||.||+|+++|...|+|+++++++..---+-. .++-.--+.|.+ ||+....++-...+ .+.+ T Consensus 226 k~vG~~~Vl~~vSGgvdStV~a~Ll~~alg~~R~~ai~vdNG~mrk~Ea~~V~~tl~~-lgi~i~v~~as~~f---~s~L 301 (552) T KOG1622 226 KWVGDYKVLVAVSGGVDSTVCAALLRRALGPDRVHAIHVDNGFMRKKEAEQVEKTLVY-LGIPITVVDASETF---LSKL 301 (552) T ss_pred HHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHH-CCCCEEEEECHHHH---HHHH T ss_conf 9846445599834885089999999986378944899812640103677889999997-38844996215888---8761 Q ss_pred CCCCCCCCC----CHHH--------------------HCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCC--C Q ss_conf 323468743----1011--------------------0123304667798887516954436432447754110236--6 Q gi|254780999|r 367 SQFLQEEPS----GIVA--------------------ENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLY--G 420 (562) Q Consensus 367 ~~~~~~~~~----~~~~--------------------eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~--G 420 (562) ....+.+.. +.+. =++|+-+|-- +.+-+.+.|+| ++|.. -|-+ | T Consensus 302 ~~~~dPE~KRkiIG~tfikv~~~~~~~l~~k~~~~~~flaQgtL~Pd------~ieS~s~~g~~-~a~tI---KThhn~~ 371 (552) T KOG1622 302 KGVTDPEEKRKIIGRTFIKVFDPVASELNKKHGEKESFLAQGTLRPD------LIESASVYGSG-HAETI---KTHHNDT 371 (552) T ss_pred CCCCCHHHHCEECCCCEEEECCHHHHHHHHCCCCCCEEEECCCCCCH------HHHHCCCCCCC-HHHHH---HCCCCCC T ss_conf 26688788161024200240767888752065760004433501410------44313456773-45442---0102550 Q ss_pred ----CC---CCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCC Q ss_conf ----65---5540101776336589999996310332223444466887882889884 Q gi|254780999|r 421 ----DM---SGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEVIPPSILEKSPSA 471 (562) Q Consensus 421 ----D~---~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psa 471 (562) |+ +--+.|+.|+.|-+|+.|-+-++ +|+.++.+.|=+ T Consensus 372 ~L~r~lrklgK~ieplk~~~kDEvr~lgk~lG--------------lp~~Lv~rhPfp 415 (552) T KOG1622 372 GLIRDLRKLGKVIEPLKDFHKDEVRELGKDLG--------------LPESLVPRHPFP 415 (552) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCC--------------CCHHHHCCCCCC T ss_conf 58999998316574267888899987666508--------------724320258999 No 76 >pfam07186 consensus Probab=98.30 E-value=8.6e-05 Score=50.53 Aligned_cols=159 Identities=13% Similarity=0.096 Sum_probs=92.4 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCEEEEEEEEEEC Q ss_conf 99999999985998999166444799844541488999999999999999988769789999999888987899999969 Q gi|254780999|r 26 ARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIVVGFPRQDQEGVLNSVVILDA 105 (562) Q Consensus 26 i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~iivG~p~~~~~~~yNsa~vi~~ 105 (562) +....+.+..+|++++|+||-.+.=+.|. .+..++ +..+..+..+++|....+..+.-|+.+.+.. T Consensus 219 li~tv~~~~~~g~rvvvLPEs~lg~Wtp~--------~~~lW~------~~l~~~~~Tvi~Ga~v~~~~GydnvlV~i~~ 284 (396) T pfam07186 219 LIATVRRAAGEEARVVVLPESALGFWTPT--------VERLWR------DGLRGADVTVIAGAAVIDAGGYDNVMVTISA 284 (396) T ss_pred HHHHHHHHCCCCCEEEECCCHHHCCCCCC--------HHHHHH------HHHCCCCCEEEEEEEECCCCCCEEEEEEECC T ss_conf 99999975169986999052342267740--------688989------8735799379985443599870358999718 Q ss_pred --CEEEEEEEEEE-ECC-----------CCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCCCHHHHHHHHHHHCCCCEE Q ss_conf --97999887776-235-----------5433321003778887614764745620111111101689999971477759 Q gi|254780999|r 106 --GNIIAVRDKIN-LPN-----------YSEFHEKRTFISGYSNDPIVFRDIRLGILICEDIWKNSNICKHLKKQGAEFL 171 (562) Q Consensus 106 --G~il~~y~K~~-LP~-----------y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDlw~~~~~~~~la~~Gadli 171 (562) ++++ |+... .|- ....-...||. ..+++++|.|++.+||||.-..- |.-......-|+| T Consensus 285 ~~~ril--y~~RmPVPvsMWqPW~~~~g~~g~a~a~~f~----npvv~i~g~rva~lICYEqLl~w-P~Lqsml~~PDvI 357 (396) T pfam07186 285 DEARIL--YRERMPVPVSMWQPWLSWTGQGGGARAHFFA----NPFAEVAGSRNAPLICYEQLIVW-PVLQSMLYSPDAI 357 (396) T ss_pred CCCEEE--EECCCCCCHHHCCCCCCCCCCCCCCCCCCCC----CCEEEECCEEEEEEEEHHHHHHH-HHHHHHHCCCCEE T ss_conf 983376--2047876422035664335898874245356----97489768068887658888878-9999875199889 Q ss_pred EECCCCCCCCCCHHHH--HHHHHHHHHHCCCEEEEC Q ss_conf 9315553335807888--788999854337046421 Q gi|254780999|r 172 FSLNASPYYHNKLKKR--HEIVTGQISHVHLPIIYV 205 (562) Q Consensus 172 i~psASp~~~~k~~~R--~~~~~~~a~e~~~~vvy~ 205 (562) +.+...=++.+..... ....++-++=.++|++-+ T Consensus 358 Va~~N~Wwt~gTsi~aIQrasv~AwaRLfg~Pvv~A 393 (396) T pfam07186 358 VAVGNGWWTEGTSIVAIQRASVTAWAKLFGKPVVIA 393 (396) T ss_pred EECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 983787745898619999999999999837844887 No 77 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=98.30 E-value=7.2e-06 Score=57.56 Aligned_cols=136 Identities=15% Similarity=0.117 Sum_probs=77.2 Q ss_pred HHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 64220203588876422454057521587236899999984149544679732466665221356999999721354267 Q gi|254780999|r 274 ADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVL 353 (562) Q Consensus 274 ~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i 353 (562) ++.+.+.-+++.+++. --.|=+=|||||||+++++++++.. .+++.+++.....+..+....|+.+|+.+|++|+++ T Consensus 242 ~~~~ll~~sV~~rl~s--DVpvG~~LSGGlDSS~I~a~~~~~~-~~~i~tfsi~f~~~~~DE~~~A~~vA~~~g~~h~~~ 318 (628) T TIGR03108 242 ELIERLREAVRSRMVA--DVPLGAFLSGGVDSSAVVALMAGLS-DTPVNTCSIAFDDPAFDESAYARQVAERYGTNHRVE 318 (628) T ss_pred HHHHHHHHHHHHHHCC--CCCEEEEECCCCCHHHHHHHHHHHC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 9999999999998057--8856876468863689999999842-998643665348887543699999998608764799 Q ss_pred CHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCC Q ss_conf 75899999886313234687431011012330466779888751695443643244775411023 Q gi|254780999|r 354 PIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTL 418 (562) Q Consensus 354 ~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~ 418 (562) .+++.--.....+....+.+..|. +-+=+-.|+-.|.+.--++|+=.--.|+--||..+ T Consensus 319 ~~~~~~~~~~~~~~~~~deP~~d~------s~i~~~~l~~~a~~~vkV~LsG~GgDElF~GY~~y 377 (628) T TIGR03108 319 TVDPDDFSLVDRLAGLYDEPFADS------SALPTYRVCELARKRVTVALSGDGGDELFAGYRRY 377 (628) T ss_pred ECCHHHHHHHHHHHHHHCCCCCCH------HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHH T ss_conf 779899987999999846886667------89999999999864974999667754011585566 No 78 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=98.25 E-value=1.4e-06 Score=62.18 Aligned_cols=65 Identities=34% Similarity=0.472 Sum_probs=51.3 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 0575215872368999999841495446797324666652213569999997213542677589999 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN 360 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~ 360 (562) ++|+-+|||+||+++++++.+. | .+|++++.-.--.....++.|+.+|+.||++|.+|+|..+-+ T Consensus 1 Kavvl~SGG~DSt~~l~~a~~~-~-~~v~ait~dYGQ~~~~Ei~~A~~ia~~l~i~h~vidl~~l~~ 65 (137) T pfam06508 1 KAVVLLSGGLDSTTCLAWAKKE-G-YEVYALTFDYGQRHSKELECAKKIAKALGVEHKIVDLDFLKQ 65 (137) T ss_pred CEEEEECCCHHHHHHHHHHHHC-C-CEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHH T ss_conf 9899917878999999999986-9-968999814788739999999999998299750303302444 No 79 >cd01991 Asn_Synthase_B_C The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase . Probab=98.24 E-value=7.2e-06 Score=57.56 Aligned_cols=126 Identities=24% Similarity=0.304 Sum_probs=71.1 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHH--HH Q ss_conf 5888764224540575215872368999999841495446797324666652213569999997213542677589--99 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHD--LV 359 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~--~~ 359 (562) +++++++ .-..|-+-||||+||+++|+++.+-. ..++.++++-......+....|+.+|+.+|++++.+.+++ .. T Consensus 7 sv~~~l~--sd~~vg~~LSGGlDSs~ia~~~~~~~-~~~~~~~s~~~~~~~~de~~~a~~va~~~~~~~~~i~~~~~~~~ 83 (269) T cd01991 7 AVRRRLR--SDVPVGVLLSGGLDSSLVAALAARLL-PEPVKTFSIGFGFEGSDEREYARRVAEHLGTEHHEVEFTPADLL 83 (269) T ss_pred HHHHHHC--CCCCEEEEECCHHHHHHHHHHHHHHC-CCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEECCHHHHH T ss_conf 9999954--48856865135699999999999836-89984799862899973789999999980988245314348899 Q ss_pred HHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCC-HHHHHHHHCC Q ss_conf 998863132346874310110123304667798887516954436432-4477541102 Q gi|254780999|r 360 NHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSN-KSEISVGYGT 417 (562) Q Consensus 360 ~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~n-ksE~~~Gy~T 417 (562) +.+.+.+.. .+.+..+ .+. .-.-.++..|.+.|.=|+=||+ -.|+.-||.. T Consensus 84 ~~~~~~~~~-~~~p~~~---~~~---~~~~~l~~~a~~~g~~VllsG~GgDElf~Gy~~ 135 (269) T cd01991 84 AALPDVIWE-LDEPFAD---SSA---IPLYLLSRLARKHGIKVVLSGEGADELFGGYPR 135 (269) T ss_pred HHHHHHHHH-HCCCCHH---HHH---HHHHHHHHHHHHCCCEEEEECCCHHHHHCCCHH T ss_conf 999999998-6396012---547---899999999985397799967761464138588 No 80 >KOG0571 consensus Probab=98.22 E-value=1.7e-05 Score=55.14 Aligned_cols=149 Identities=20% Similarity=0.247 Sum_probs=88.2 Q ss_pred HCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCH------HHEEEEECCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 2202035888764224540575215872368999999841495------4467973246666522135699999972135 Q gi|254780999|r 276 YNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGK------ENVQTIMLPYKYTSPQSLEDAAACAKALGCK 349 (562) Q Consensus 276 ~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~------~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~ 349 (562) -+++.-+++.++.... .+=+=||||+||+|+|+||..-+-. ..++.+...-. .|+ .+.+|++.|+.||+. T Consensus 211 r~~~~~aV~KRLM~d~--p~GvLLSGGLDSSLvAsia~R~lk~~~~~~~~~lhsFaIGle-~SP-DL~aarkVAd~igt~ 286 (543) T KOG0571 211 RHTLEKAVRKRLMTDV--PFGVLLSGGLDSSLVASIAARELKKAQAARGSKLHSFAIGLE-DSP-DLLAARKVADFIGTI 286 (543) T ss_pred HHHHHHHHHHHHHCCC--CEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCC-CCH-HHHHHHHHHHHHCCC T ss_conf 9999999999863168--615895077318999999999998766412787547885289-985-378889999874885 Q ss_pred CCCCC--HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 42677--5899999886313234687431011012330466779888751695-44364324477541102366655540 Q gi|254780999|r 350 YDVLP--IHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEISVGYGTLYGDMSGGF 426 (562) Q Consensus 350 ~~~i~--I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~~~Gy~T~~GD~~g~~ 426 (562) |++.- +++-.++....+=+ -+..|+| -|+|-+-|-+|.-.-.+.|- +||+----.|+--||--+. = T Consensus 287 Hhe~~ft~qegidal~eVI~h---LETYDvt--tIRastpmyLlsr~Ikk~gvkmvlSGEGsDEifggYlYfh------~ 355 (543) T KOG0571 287 HHEHTFTIQEGIDALDEVIYH---LETYDVT--TIRASTPMYLLSRKIKKLGVKMVLSGEGSDEIFGGYLYFH------K 355 (543) T ss_pred CEEEEECHHHHHHHHHHHHEE---EECCCCC--EEECCCCHHHHHHHHHHCCEEEEEECCCCHHHHCCEEEEE------C T ss_conf 238997588877788877424---1013441--5861774678999997655289981577134421121100------3 Q ss_pred CCCCCCCHHHHHH Q ss_conf 1017763365899 Q gi|254780999|r 427 NPLKDLYKTQVFQ 439 (562) Q Consensus 427 ~p~~~l~Kt~v~~ 439 (562) +|.+-=..++.-. T Consensus 356 APs~~~fh~E~~r 368 (543) T KOG0571 356 APSAEEFHEESVR 368 (543) T ss_pred CCCHHHHHHHHHH T ss_conf 8897898899999 No 81 >PRK09431 asnB asparagine synthetase B; Provisional Probab=98.17 E-value=1.9e-05 Score=54.78 Aligned_cols=135 Identities=17% Similarity=0.188 Sum_probs=76.0 Q ss_pred HHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCH------------HHEEEEECCCCCCCCCCHHHHH Q ss_conf 8642202035888764224540575215872368999999841495------------4467973246666522135699 Q gi|254780999|r 273 EADYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGK------------ENVQTIMLPYKYTSPQSLEDAA 340 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~------------~~v~~~~mp~~~~s~~s~~~a~ 340 (562) +++.+.|.-+++.++.. --.+-+=|||||||+++++||.+-... .+++++...-..+ + ....|+ T Consensus 210 ~~lr~lL~~AV~~rL~s--DvpvG~~LSGGLDSSlIaala~k~~~~~~~~~~~~~~~~~~l~tFsiG~~~s-~-D~~~Ar 285 (555) T PRK09431 210 NELREALEAAVKKRLMS--DVPYGVLLSGGLDSSLISAIAKKYAARRIEDDERSEAWWPQLHSFAVGLEGS-P-DLKAAR 285 (555) T ss_pred HHHHHHHHHHHHHHHCC--CCCEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCC-C-HHHHHH T ss_conf 99999999999998356--8862432057706799999999852024433222223467875366047998-1-499999 Q ss_pred HHHHHHCCCCCCCCHHH--HHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHHHHHHC Q ss_conf 99997213542677589--9999886313234687431011012330466779888751695-4436432447754110 Q gi|254780999|r 341 ACAKALGCKYDVLPIHD--LVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEISVGYG 416 (562) Q Consensus 341 ~la~~lg~~~~~i~I~~--~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~~~Gy~ 416 (562) ..|+.+|+.|+++.++. ..+.+.+.+.+. + ..|.+ .+.|-+-+-+|.-.+...|- +||+=---.|+--||- T Consensus 286 ~vA~~lgt~h~ei~~t~~d~~~~l~~vi~~l-E--~~d~~--~~~asip~yllsk~i~~~gvKVvLSGeGADELFgGY~ 359 (555) T PRK09431 286 EVADYLGTVHHEIHFTVQEGLDALRDVIYHL-E--TYDVT--TIRASTPMYLMSRKIKAMGIKMVLSGEGADELFGGYL 359 (555) T ss_pred HHHHHHCCCCEEEEECHHHHHHHHHHHHHHH-H--CCCCC--CHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHCCCH T ss_conf 9999829956599954799999999999874-1--46875--0256899999999999639879992775044104877 No 82 >TIGR01536 asn_synth_AEB asparagine synthase (glutamine-hydrolyzing); InterPro: IPR006426 These sequences represent glutamine-hydrolysing asparagine synthase. The group have a poorly conserved C-terminal extension while bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis .; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process. Probab=98.10 E-value=1.6e-05 Score=55.35 Aligned_cols=122 Identities=23% Similarity=0.286 Sum_probs=76.5 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCH------HHEEEEECCCCCCCCCCHH-----------HHHHHHHHHCCCCCCCCH Q ss_conf 40575215872368999999841495------4467973246666522135-----------699999972135426775 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGK------ENVQTIMLPYKYTSPQSLE-----------DAAACAKALGCKYDVLPI 355 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~------~~v~~~~mp~~~~s~~s~~-----------~a~~la~~lg~~~~~i~I 355 (562) .-|-+=||||||||++|+||..-++. ..|+.+...-...+ ...+ -|+++|+.||++|+++.| T Consensus 312 vpvG~~LSGGlDSS~vaa~a~~~~~~~~~~W~~~~~tFsigf~~~~-~~~~Es~l~P~~D~P~A~~vA~~~G~~h~~~~~ 390 (646) T TIGR01536 312 VPVGVLLSGGLDSSLVAAIARREAPSEGVDWKGPVKTFSIGFEGDS-KDFDESKLRPDDDQPYARKVADELGTEHHEVLI 390 (646) T ss_pred CCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCC-CCCEECCCCCCCCHHHHHHHHHHHCCCCEEEEE T ss_conf 6068887171679999999999767643311774347886517787-653002578998618999999984983248984 Q ss_pred H--HHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHC-CCEECCCCC-HHHHHHHHCCCCC Q ss_conf 8--999998863132346874310110123304667798887516-954436432-4477541102366 Q gi|254780999|r 356 H--DLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHS-KAMLLTTSN-KSEISVGYGTLYG 420 (562) Q Consensus 356 ~--~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~-~~lvl~t~n-ksE~~~Gy~T~~G 420 (562) + +.++.+.+.+-+ ++ ..+.+ ++.|=+=+-+++-++.+. |--|+=+|. -.|+-=||--+.+ T Consensus 391 s~~e~~~~l~~v~~~-~e--~y~p~--~~~~~~P~Yl~s~~ar~~Tg~kV~LSGeGaDElFgGY~~f~~ 454 (646) T TIGR01536 391 SEEEVLKALPEVIYH-LE--TYDPT--AIRASIPLYLLSKLAREDTGVKVVLSGEGADELFGGYEYFRE 454 (646) T ss_pred CHHHHHHHHHHHHHH-HC--CCCCC--CCCHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHCCHHHHCC T ss_conf 589999998887654-24--88543--031123589999999962883899748663555304244214 No 83 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=98.09 E-value=1e-05 Score=56.64 Aligned_cols=65 Identities=25% Similarity=0.345 Sum_probs=57.5 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 057521587236899999984149544679732466665221356999999721354267758999 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLV 359 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~ 359 (562) .+|||+|||-||+-+|.+.+.-+| -|+++|+.-..+.++....+-..+++.||+.+..+.++.-+ T Consensus 61 DCiv~vSGGkDS~y~~~~l~~~~g-l~pL~vt~d~~~~t~~g~~Ni~~l~~~lgvD~i~~~~n~~~ 125 (343) T TIGR03573 61 DCIIGVSGGKDSTYQAHVLKKKLG-LNPLLVTVDPGWNTELGVKNLNNLIKKLGFDLHTITINPET 125 (343) T ss_pred CEEEECCCCHHHHHHHHHHHHHHC-CCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEECCCHHH T ss_conf 689868877289999999999829-92599983598779899999999998369985874699999 No 84 >COG0367 AsnB Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism] Probab=98.09 E-value=2.9e-05 Score=53.65 Aligned_cols=158 Identities=25% Similarity=0.208 Sum_probs=90.8 Q ss_pred HHCCHHHHHHHHHHHCC--CCEEEEECCCCCHHHHHHHHHHHCCCHHHE--EEEECCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 42202035888764224--540575215872368999999841495446--79732466665221356999999721354 Q gi|254780999|r 275 DYNACVLSLRDYVQKNN--FHKVIIGLSGGIDSALCAAIAVDALGKENV--QTIMLPYKYTSPQSLEDAAACAKALGCKY 350 (562) Q Consensus 275 ~~~Alvlgl~d~~~~~g--~~~~viglSGGiDSal~a~la~~alg~~~v--~~~~mp~~~~s~~s~~~a~~la~~lg~~~ 350 (562) ..+-+...|++-|.+.- -..|-+=+|||+|||++|+++....+..-. ..+-++. +......-|+..|+.||..| T Consensus 211 ~~~~l~~~l~~sV~~r~~advpvg~~lSGGlDSS~Iaa~a~~~~~~~~~~~fsvg~~~--~~~~D~~~a~~~A~~lg~~h 288 (542) T COG0367 211 LAEHLRSLLEDAVKRRLVADVPVGVFLSGGLDSSLIAAIAAEELGKEGKTTFTVGFED--SDSPDAKYARAVAKFLGTPH 288 (542) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEECCC--CCCCHHHHHHHHHHHHCCCC T ss_conf 8899999999999866215887899967762299999999986234552158985489--99707999999999859985 Q ss_pred CCCCHH--HHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCC-CEECCCCCHHHHHHHHC-CCCCCCCCCC Q ss_conf 267758--9999988631323468743101101233046677988875169-54436432447754110-2366655540 Q gi|254780999|r 351 DVLPIH--DLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSK-AMLLTTSNKSEISVGYG-TLYGDMSGGF 426 (562) Q Consensus 351 ~~i~I~--~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~-~lvl~t~nksE~~~Gy~-T~~GD~~g~~ 426 (562) +++.+. +....+...+.. .+. ++. +.+-+=+-+++..+.+.| -+||+=---.|+--||. .. .... T Consensus 289 ~~~~~~~~e~~~~~~~vv~~-~~~--p~~----~~~~~ply~~~~~a~~~g~kVvLSGeGADElFgGY~~~~----~~~~ 357 (542) T COG0367 289 HEIILTNEELLNALPEVVKA-LDT--PGG----MAASIPLYLLSRKARAEGEKVVLSGEGADELFGGYPPYS----RFAP 357 (542) T ss_pred EEEEECHHHHHHHHHHHHHH-HCC--CCC----CHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHCCCCHHH----HHCC T ss_conf 79961789999999999987-078--862----004579999998765338789823874888754983554----2045 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 1017763365899999963 Q gi|254780999|r 427 NPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 427 ~p~~~l~Kt~v~~la~~~n 445 (562) .+..-+-++....+.++.. T Consensus 358 ~~~~~~~~~~~~~~~~~~~ 376 (542) T COG0367 358 GPEELLNEALRRALALIDY 376 (542) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 6355668999876555654 No 85 >cd01993 Alpha_ANH_like_II This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=98.00 E-value=2.4e-05 Score=54.18 Aligned_cols=145 Identities=18% Similarity=0.266 Sum_probs=81.0 Q ss_pred EEEEECCCCCHHHHHHHHHHHC---CC-HHHEEEEECCC--CCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHC Q ss_conf 0575215872368999999841---49-54467973246--666522135699999972135426775899999886313 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDA---LG-KENVQTIMLPY--KYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~a---lg-~~~v~~~~mp~--~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~ 367 (562) +++||+|||.||.+.+.+..+- .+ .-++.++..=. +..|+...+..+.+|+.+++++..+.+........... T Consensus 1 kilva~SGG~DS~~Ll~ll~~~~~~~~~~~~l~~~hvnh~~r~~s~~~~~~v~~~~~~~~i~~~~~~~~~~~~~~~~~~- 79 (185) T cd01993 1 RILVALSGGKDSLVLLHVLKKLQRRYPYGFELEALTVDEGIPGYRDESLEVVERLAEELGIELEIVSFKEEYTDDIEVK- 79 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHH- T ss_conf 9899965849999999999998886488956999995699888889999999998996299359987640367638999- Q ss_pred CCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCC----HHHHHH-----H--HCCC------CCCCCCC---CC Q ss_conf 2346874310110123304667798887516954436432----447754-----1--1023------6665554---01 Q gi|254780999|r 368 QFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSN----KSEISV-----G--YGTL------YGDMSGG---FN 427 (562) Q Consensus 368 ~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~n----ksE~~~-----G--y~T~------~GD~~g~---~~ 427 (562) .........+-+|+|-..|+.++.+.|.-+|-||. ..|..+ | ..++ ..+ .++ += T Consensus 80 -----~~~~~~~c~~c~r~Rr~~l~~~~~~~~~~~i~~gHh~dD~~ET~l~~l~rg~~~~~~~~~~~~~~~-~~~i~iiR 153 (185) T cd01993 80 -----KRGGKSPCSLCGVLRRGLLNKIAKELGADKLATGHNLDDEAETLLMNLLRGGILRLMRPGPILYLD-EGDVTRIR 153 (185) T ss_pred -----HHHHCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCCHHHCCCCCCCCCC-CCCEEEEE T ss_conf -----986406346899999999999999819988975330768999999999848970005676633356-89847995 Q ss_pred CCCCCCHHHHHHHHHHHH Q ss_conf 017763365899999963 Q gi|254780999|r 428 PLKDLYKTQVFQLASWRN 445 (562) Q Consensus 428 p~~~l~Kt~v~~la~~~n 445 (562) |+-+++|.++++.|+..+ T Consensus 154 PLL~~~k~ei~~y~~~~~ 171 (185) T cd01993 154 PLVYVREKEIVLYAELNG 171 (185) T ss_pred CCCCCCHHHHHHHHHHCC T ss_conf 498898999999999879 No 86 >pfam01171 ATP_bind_3 PP-loop family. This family of proteins belongs to the PP-loop superfamily. Probab=97.99 E-value=2.8e-05 Score=53.70 Aligned_cols=137 Identities=18% Similarity=0.249 Sum_probs=77.0 Q ss_pred EEEEECCCCCHHHHHHHHHHHC---CCHHHEEEEEC--CCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 0575215872368999999841---49544679732--466665221356999999721354267758999998863132 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDA---LGKENVQTIML--PYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQ 368 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~a---lg~~~v~~~~m--p~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~ 368 (562) +++||+|||.||.+.+.+..+. .+ -++.++.. .-+..|.......+..|+.+|+++.+..++.. . T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~-~~~~~~hvnh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~---------~ 70 (182) T pfam01171 1 KILVAVSGGPDSMALLYLLKKLKPKFG-IDLTAAHVDHGLREESDREAQFVKELCRQLNIPLEVLRVDVA---------K 70 (182) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCC-CCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC---------C T ss_conf 999996785999999999999999759-978999987998654058999999999985997599997347---------6 Q ss_pred CCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCC----CHHHHHH-----H--HCCCCC-------CCCCCCCCCC Q ss_conf 34687431011012330466779888751695443643----2447754-----1--102366-------6555401017 Q gi|254780999|r 369 FLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTS----NKSEISV-----G--YGTLYG-------DMSGGFNPLK 430 (562) Q Consensus 369 ~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~----nksE~~~-----G--y~T~~G-------D~~g~~~p~~ 430 (562) ......|..--+.|=..|..+|.+.|.-++-|| |..|..+ | ...+.| +..--+=|+- T Consensus 71 -----~~~~~~e~~aR~~Ry~~l~~~a~~~~~~~i~lgHh~DD~~ET~lm~l~rG~~~~gl~gm~~~~~~~~~~iiRPLl 145 (182) T pfam01171 71 -----KSGLNLEEAAREARYDFFEEIAKKNGAEVLLTAHHADDQAETFLMRLLRGSGLAGLAGIAPVRPLAGGRIVRPLL 145 (182) T ss_pred -----CCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEECCCC T ss_conf -----678777578999999999998986176648874342329999999997289811033887513468943870211 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 763365899999963 Q gi|254780999|r 431 DLYKTQVFQLASWRN 445 (562) Q Consensus 431 ~l~Kt~v~~la~~~n 445 (562) .++|.+++++|+..+ T Consensus 146 ~~~k~ei~~~a~~~~ 160 (182) T pfam01171 146 KVTKSEIEEYLKEHG 160 (182) T ss_pred CCCHHHHHHHHHHCC T ss_conf 488999999999869 No 87 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=97.88 E-value=6.4e-05 Score=51.36 Aligned_cols=185 Identities=16% Similarity=0.223 Sum_probs=103.4 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEEC--CCCCCCCCCHHHHHHHHHHHC----CCCCCCCHHHHHHHHHHHH Q ss_conf 4057521587236899999984149544679732--466665221356999999721----3542677589999988631 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIML--PYKYTSPQSLEDAAACAKALG----CKYDVLPIHDLVNHFFSLM 366 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~m--p~~~~s~~s~~~a~~la~~lg----~~~~~i~I~~~~~~~~~~~ 366 (562) .+++.=+||||||.|++-+.-+. |. +|+.+.. ++..+...-++-|+.|++..| +++..+|.++++....... T Consensus 178 GkvL~LlSGGiDSPVAa~~mmKR-G~-~v~~l~F~lg~~~~e~~V~~va~~L~~~~~~~~~vr~~~V~f~~v~~eI~~~v 255 (483) T PRK01269 178 EDVLSLISGGFDSGVASYMLMRR-GS-RVHYCFFNLGGAAHEIGVKQVAHYLWNRYGSSHRVRFISVDFEPVVGEILEKV 255 (483) T ss_pred CCEEEEECCCCCCHHHHHHHHHC-CC-EEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHC T ss_conf 86899952788838889999845-98-79999957998688999999999999973888870899976799999998738 Q ss_pred CCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCH-----HHHHHHHCCCC-CCCCCCCCCCCCCCHHHHHHH Q ss_conf 323468743101101233046677988875169544364324-----47754110236-665554010177633658999 Q gi|254780999|r 367 SQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNK-----SEISVGYGTLY-GDMSGGFNPLKDLYKTQVFQL 440 (562) Q Consensus 367 ~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nk-----sE~~~Gy~T~~-GD~~g~~~p~~~l~Kt~v~~l 440 (562) .+ ....+. .=|.=+-+--.+|.+.|+..|-||.- |.-.-.-.+.. +-..-=+-|+-...|.++-++ T Consensus 256 ~~----~~~~vv----~KR~M~R~A~~iA~~~g~~ALVTGESLGQVASQTL~NL~~i~~~~~~pVlRPLIg~DK~eII~~ 327 (483) T PRK01269 256 DN----GQMGVV----LKRMMLRAASKVAERYGIQALVTGEALGQVSSQTLTNLRLIDNVTDTLILRPLIAMDKEDIIDL 327 (483) T ss_pred CH----HHEEHH----HHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCHHHHHHH T ss_conf 76----344399----9999999999999985998898555034567767878899887407740047667988999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCH--HHCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHH Q ss_conf 9996310332223444466887--8828898845389997700068-8788999999 Q gi|254780999|r 441 ASWRNSHGITSGLGPLTEVIPP--SILEKSPSAELRPHQTDQESLP-PYPILDDIIK 494 (562) Q Consensus 441 a~~~n~~~~~~~~~~~~~~ip~--~i~~~~psaeL~~~Q~ded~l~-~Y~~lD~il~ 494 (562) ||-+++.-.. +-+|| .++.+.|+.--.+....+++-- ++++||..++ T Consensus 328 Ar~IGTye~s-------~~~pEyCgvis~~P~~~a~~~~ie~ee~~fd~~vl~~~v~ 377 (483) T PRK01269 328 ARQIGTEDFA-------KTMPEYCGVISKKPTVKAVKEKIEAEEKKFDFAILDRVVE 377 (483) T ss_pred HHHHCHHHHH-------CCCCCCEEECCCCCCCCCCHHHHHHHHHHCCHHHHHHHHH T ss_conf 9983928776-------0368650512689836558999899987199999999998 No 88 >PRK01565 thiamine biosynthesis protein ThiI; Provisional Probab=97.87 E-value=0.00017 Score=48.56 Aligned_cols=144 Identities=19% Similarity=0.241 Sum_probs=89.7 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEE-ECCCCCCCCCCHHHHHHHHHHH-----CCCCCCCCHHHHHHHHHHHH Q ss_conf 40575215872368999999841495446797-3246666522135699999972-----13542677589999988631 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTI-MLPYKYTSPQSLEDAAACAKAL-----GCKYDVLPIHDLVNHFFSLM 366 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~-~mp~~~~s~~s~~~a~~la~~l-----g~~~~~i~I~~~~~~~~~~~ 366 (562) .++++=|||||||.|++-+..+. |- .|..+ +....++++.+++-++.|++.| ++++.++|.+++........ T Consensus 177 Gk~l~LlSGGIDSPVAa~~mmkR-G~-~v~~v~f~~~p~t~~~a~~kv~~l~~~L~~y~~~~kl~~v~f~~~~~~i~~~~ 254 (399) T PRK01565 177 GKALLLLSGGIDSPVAGYLAMKR-GV-EIEAVHFHSPPYTSERAKEKVIDLARILAKYGGRIKLHVVPFTEIQEEIKKKV 254 (399) T ss_pred CCEEEEECCCCCHHHHHHHHHHC-CC-EEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCC T ss_conf 85799861787539999999855-98-79999986899877899999999999999728885599966399999998608 Q ss_pred CCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCH-----HHHHHH-HCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 323468743101101233046677988875169544364324-----477541-10236665554010177633658999 Q gi|254780999|r 367 SQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNK-----SEISVG-YGTLYGDMSGGFNPLKDLYKTQVFQL 440 (562) Q Consensus 367 ~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nk-----sE~~~G-y~T~~GD~~g~~~p~~~l~Kt~v~~l 440 (562) .+ ...-+. .=|.=+-+--.+|.+.|+..|-||.- |.-.-. +++-.+-..--+-|+-+..|.++-++ T Consensus 255 ~~----~~~~vl----~RR~M~riA~~iA~~~~~~alvTGESLGQVASQTl~Nl~~i~~~~~~pVlRPLIg~DK~EII~~ 326 (399) T PRK01565 255 PE----SYLMTL----MRRFMMRIADKIAEKRGALAIVTGESLGQVASQTLESMQAINAVTNLPVLRPLIGMDKEEIIEI 326 (399) T ss_pred CC----CEEEHH----HHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHH T ss_conf 80----038599----9999999999999985997898241355676755677888875217754466557898999999 Q ss_pred HHHHHH Q ss_conf 999631 Q gi|254780999|r 441 ASWRNS 446 (562) Q Consensus 441 a~~~n~ 446 (562) ||-++. T Consensus 327 Ar~IGT 332 (399) T PRK01565 327 AKEIGT 332 (399) T ss_pred HHHCCC T ss_conf 987096 No 89 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=97.84 E-value=4e-05 Score=52.71 Aligned_cols=64 Identities=25% Similarity=0.359 Sum_probs=53.2 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 057521587236899999984149544679732466665221356999999721354267758999 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLV 359 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~ 359 (562) .++||+|||-||+.+|.++++-.|- |+++|+....+.++...++.+.+++. |+.+..+.+++-. T Consensus 3 DcIVpvSGGKDS~y~~~~lk~kygl-npL~Vt~~~~~~t~~g~~Nl~nl~~~-g~D~~~~~~~~~~ 66 (154) T cd01996 3 DCIIGVSGGKDSSYALYLLKEKYGL-NPLAVTVDNGFNSEEAVKNIKNLIKK-GLDLDHLVINPEE 66 (154) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCC-CEEEEEECCCCCCHHHHHHHHHHHHC-CCCEEEEECCHHH T ss_conf 7899688881899999999999499-65999967987546899999999974-9981699328889 No 90 >cd01713 PAPS_reductase This domain is found in phosphoadenosine phosphosulphate (PAPS) reductase enzymes or PAPS sulphotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. A highly modified version of the P loop, the fingerprint peptide of mononucleotide-binding proteins, is present in the active site of the protein, which appears to be a positively charged cleft containing a number of conserved arginine and lysine residues. Although PAPS reductase has no ATPase activity, it shows a striking similarity to the structure of the ATP pyrophosphatase (ATP PPase) domain of GMP synthetase, indicating that both enzyme families have evolved from a common ancestral nucleotide-binding fold. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP) . It is also found in NodP nodulation protein P from Rhizobium meliloti which has ATP sulphurylase acti Probab=97.77 E-value=0.00015 Score=48.86 Aligned_cols=149 Identities=19% Similarity=0.135 Sum_probs=82.1 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCH-HHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 0575215872368999999841495-446797324666652213569999997213542677589999988631323468 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGK-ENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQE 372 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~-~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~ 372 (562) .++++.|||-||++++.|+.++..+ .++..+..-+.+.-+++.+-.+++++.+|.+...+................... T Consensus 1 ~~~vsfSGGKDS~vll~L~~~~~~~~~~~~vvf~DTg~e~pet~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (173) T cd01713 1 NVVVSFSGGKDSTVLLHLALKALPELKPVPVIFLDTGYEFPETYEFVDRVAERYGLPLVVVRPPDSPAEGLALGLKGFPL 80 (173) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHCCCCCC T ss_conf 95999646499999999999966555883799968889898999999999987399079996897289999861357799 Q ss_pred CCCCHHHHCCCCHHH-HHHHHHHHHHCCCEECCCCC-HHHHHHHH------CCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 743101101233046-67798887516954436432-44775411------02366655540101776336589999996 Q gi|254780999|r 373 EPSGIVAENIQSRIR-GNILMALSNHSKAMLLTTSN-KSEISVGY------GTLYGDMSGGFNPLKDLYKTQVFQLASWR 444 (562) Q Consensus 373 ~~~~~~~eN~qaR~R-~~~L~~~an~~~~lvl~t~n-ksE~~~Gy------~T~~GD~~g~~~p~~~l~Kt~v~~la~~~ 444 (562) ...+... --...+ .++-.++.+ .+.-+.-+|- ++|..--. .....+..--++||.|-...+|+...+.. T Consensus 81 ~~~~~~~--c~~~~K~~P~~~~~~~-~~~~~~~~GiR~~Es~~R~~~~~~~~~~~~~~~~~~~Pi~~Wt~~dVw~yi~~~ 157 (173) T cd01713 81 PSPDRRW--CCRILKVEPLRRALKE-LGVVAWITGIRRDESARRALLPVVWTDDGKGGILKVNPLLDWTYEDVWAYLARH 157 (173) T ss_pred CCCCHHH--HHHHHHHHHHHHHHHH-CCCCEEEEECCCCCCHHHCCCCCCCCCCCCCCEEEEECHHHCCHHHHHHHHHHC T ss_conf 8322688--9888807899999983-298289994121462444437623432688998999322859999999999983 Q ss_pred H Q ss_conf 3 Q gi|254780999|r 445 N 445 (562) Q Consensus 445 n 445 (562) + T Consensus 158 ~ 158 (173) T cd01713 158 G 158 (173) T ss_pred C T ss_conf 9 No 91 >PRK13795 hypothetical protein; Provisional Probab=97.77 E-value=0.00045 Score=45.85 Aligned_cols=155 Identities=19% Similarity=0.187 Sum_probs=86.7 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 58887642245405752158723689999998414954467973246666522135699999972135426775899999 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNH 361 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~ 361 (562) .+|+-..+.+. .+.|..|||=||++++.||.+|++. +.++..-+-.--++|.+.++..++.+|+++.+.+-...+-. T Consensus 236 fir~~~~~~~~-pv~VsfSGGKDS~v~L~La~ka~~~--~~~~f~dTglEfPeT~e~v~~~~~~~gi~~~~~~a~~~fw~ 312 (630) T PRK13795 236 FIREVAEKYNL-PVVVSFSGGKDSLVVLDLAAEALRK--FKAFFNNTGLEFPETVENVKEVAEEYGVELIVADAGDAFWR 312 (630) T ss_pred HHHHHHHHCCC-CEEEECCCCHHHHHHHHHHHHHCCC--EEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEECCCHHHHH T ss_conf 99987875489-7599526868999999999985498--28999507767646999999999986981899656367888 Q ss_pred HHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCC-CEECCCCCH-HHHHHHH-------CCCCCCCCCCCCCCCCC Q ss_conf 88631323468743101101233046677988875169-544364324-4775411-------02366655540101776 Q gi|254780999|r 362 FFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSK-AMLLTTSNK-SEISVGY-------GTLYGDMSGGFNPLKDL 432 (562) Q Consensus 362 ~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~-~lvl~t~nk-sE~~~Gy-------~T~~GD~~g~~~p~~~l 432 (562) ....+++. ..|.--=. ...=-+++.-++-+..+ +.+.=.|++ .|...-. .++.+- .-..+||-+= T Consensus 313 ~~~~~GpP----~rd~RWCc-kv~Kl~pl~~~i~~~~~~~~l~~~G~R~~ES~~R~~~~rv~~n~~i~~-q~~a~PI~~W 386 (630) T PRK13795 313 AIEKFGPP----ARDYRWCC-KVCKLGPITRAIKSNFPQGCLSFVGQRKYESFARAKSPRVWRNPWVPN-QIGAAPIQDW 386 (630) T ss_pred HHHHCCCC----CCCCCCCC-CCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCC-EEEEECHHCC T ss_conf 88874999----65564355-431005799999975899629998552466376604886144778777-2687203317 Q ss_pred CHHHHHHHHHHHH Q ss_conf 3365899999963 Q gi|254780999|r 433 YKTQVFQLASWRN 445 (562) Q Consensus 433 ~Kt~v~~la~~~n 445 (562) +-.+|+-..-+.+ T Consensus 387 s~~~VwLYi~~~~ 399 (630) T PRK13795 387 TALEVWLYIFSRK 399 (630) T ss_pred CHHHHHHHHHHCC T ss_conf 3869999998848 No 92 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=97.70 E-value=0.0003 Score=46.94 Aligned_cols=52 Identities=33% Similarity=0.436 Sum_probs=38.4 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHC Q ss_conf 057521587236899999984149544679732466665221356999999721 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALG 347 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg 347 (562) ++||=+|||+||++.++.+.+.- ..|++++.-.---.....+.|+.+|+.|| T Consensus 1 kavvllSGGlDSt~~l~~~~~~g--~~v~~l~~dYGQr~~~E~~~a~~i~~~l~ 52 (169) T cd01995 1 KAVVLLSGGLDSTTCLAWAKKEG--YEVHALSFDYGQRHAKEEEAAKLIAEKLG 52 (169) T ss_pred CEEEEECCCHHHHHHHHHHHHCC--CEEEEEEEECCCCHHHHHHHHHHHHHHHC T ss_conf 98999078788999999999849--96999997569854999999999999968 No 93 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=97.69 E-value=0.00019 Score=48.21 Aligned_cols=138 Identities=17% Similarity=0.173 Sum_probs=74.1 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCC--HHHEEEEECCCCC--CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 057521587236899999984149--5446797324666--652213569999997213542677589999988631323 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALG--KENVQTIMLPYKY--TSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQF 369 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg--~~~v~~~~mp~~~--~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~ 369 (562) +++||+|||.||.+.+.++.+.-. .-++.++..=... .|....+..+..|+.+|+++.+..+.. T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~~~l~a~hvdh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~------------ 68 (185) T cd01992 1 KILVAVSGGPDSMALLHLLSELKPRLGLRLVAVHVDHGLRPESDEEAAFVADLCAKLGIPLYILVVAL------------ 68 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEE------------ T ss_conf 99999678499999999999999974994899998189888888999999999998599889999775------------ Q ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCC----HHHHHH-------HHCCCCC-------CCCCCCCCCCC Q ss_conf 46874310110123304667798887516954436432----447754-------1102366-------65554010177 Q gi|254780999|r 370 LQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSN----KSEISV-------GYGTLYG-------DMSGGFNPLKD 431 (562) Q Consensus 370 ~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~n----ksE~~~-------Gy~T~~G-------D~~g~~~p~~~ 431 (562) . .......|+.--+.|-..+..++.+.|.-.+-||. ..|-.+ |...+-| +..--+=|+-+ T Consensus 69 -~-~~~~~~~e~~aR~~Ry~~l~~~~~~~~~~~i~lgHh~dD~~ET~lm~l~rg~~~~gl~gm~~~~~~~~~~iiRPLL~ 146 (185) T cd01992 69 -A-PKPGGNLEAAAREARYDFFAEIAKEHGADVLLTAHHADDQAETVLMRLLRGSGLRGLAGMPARIPFGGGRLIRPLLG 146 (185) T ss_pred -C-CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEHHHH T ss_conf -3-67899999999999999999999873545042036303689999999871899641527784135799528715778 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 63365899999963 Q gi|254780999|r 432 LYKTQVFQLASWRN 445 (562) Q Consensus 432 l~Kt~v~~la~~~n 445 (562) ++|.++++.|+..+ T Consensus 147 ~~k~ei~~~~~~~~ 160 (185) T cd01992 147 ITRAEIEAYLRENG 160 (185) T ss_pred HHHHHHHHHHHHCC T ss_conf 53999999999849 No 94 >PRK08349 hypothetical protein; Validated Probab=97.65 E-value=0.00057 Score=45.17 Aligned_cols=144 Identities=15% Similarity=0.143 Sum_probs=75.8 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHH----HHHCCC---CCCCCHHHHHHHHHHHH Q ss_conf 05752158723689999998414954467973246666522135699999----972135---42677589999988631 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACA----KALGCK---YDVLPIHDLVNHFFSLM 366 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la----~~lg~~---~~~i~I~~~~~~~~~~~ 366 (562) +++.=+||||||.|+|.+..+. |- +|.++.+ .+++.+.+.+..++ +..|-. +..++.......+...+ T Consensus 2 Kvl~LlSGGiDSPVAa~~mmKR-G~-~V~~lhf---~~~~~~~~kv~~~~~~L~~~~~~~~~~~~iv~~~~~~~~i~~~i 76 (198) T PRK08349 2 KVVALLSSGIDSPVAIYLMLSR-GV-EIYPLHF---RQDEKKEHKARELVEILQEIHGGKVKDPVIVDAYEVQGPVFEKL 76 (198) T ss_pred EEEEEECCCCCHHHHHHHHHHC-CC-EEEEEEE---CCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHH T ss_conf 4999965884389999999977-99-7999986---38778899999999999997088875289977225328999999 Q ss_pred CCCCCCCCCCHHHHCCCCHHHH-HHHHHHHHHCCCEECCCCCH-----HHHHHH-HCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 3234687431011012330466-77988875169544364324-----477541-1023666555401017763365899 Q gi|254780999|r 367 SQFLQEEPSGIVAENIQSRIRG-NILMALSNHSKAMLLTTSNK-----SEISVG-YGTLYGDMSGGFNPLKDLYKTQVFQ 439 (562) Q Consensus 367 ~~~~~~~~~~~~~eN~qaR~R~-~~L~~~an~~~~lvl~t~nk-----sE~~~G-y~T~~GD~~g~~~p~~~l~Kt~v~~ 439 (562) ......... .+--|--| -+--.+|.+.|+..+-||.- |.-.-. +.+-.+-..--+-|+-+..|.++-+ T Consensus 77 ~~~~~~~~~-----~vl~rr~M~riA~~iA~~~g~~aivTGEsLGQVASQTl~NL~~i~~~~~~pVlRPLig~DK~EII~ 151 (198) T PRK08349 77 REIGKEKWT-----CLFCKYTMYRVAERYAHEIGAKAIVTGDSLGQVASQTLDNLMVISTATDLPILRPLIGLDKEEIVR 151 (198) T ss_pred HHCCCCCCE-----EHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 860776513-----099999999999999998599889845216788899998899998750676647766799899999 Q ss_pred HHHHHHHC Q ss_conf 99996310 Q gi|254780999|r 440 LASWRNSH 447 (562) Q Consensus 440 la~~~n~~ 447 (562) +||-++.. T Consensus 152 ~Ar~IGTy 159 (198) T PRK08349 152 IAKEIGTF 159 (198) T ss_pred HHHHHCCH T ss_conf 99981955 No 95 >PRK13794 hypothetical protein; Provisional Probab=97.57 E-value=0.00051 Score=45.46 Aligned_cols=75 Identities=20% Similarity=0.236 Sum_probs=56.9 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHH Q ss_conf 03588876422454057521587236899999984149544679732466665221356999999721354267758 Q gi|254780999|r 280 VLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIH 356 (562) Q Consensus 280 vlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~ 356 (562) +-.+|+-+.+.+. .+.|.-|||=||++++.||.+|+|++ +.++..-+-.--++|.+..+..++.+|++.....=. T Consensus 234 v~fir~~~~~~~~-pv~VSfSGGKDS~v~L~La~ka~~~~-~~vvF~DTglEfPeT~e~ve~v~~~~gv~ii~~~~~ 308 (473) T PRK13794 234 IGFMRNTIVKIGK-PPSVAYSGGKDSLATLLLALKAFGNE-FPVLFADTGLEFPETLENVEDVEELYGLEIIRTSSG 308 (473) T ss_pred HHHHHHHHHHCCC-CEEEEECCCHHHHHHHHHHHHHCCCC-EEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEECCC T ss_conf 9999999986699-83997068689999999999860897-169996088876769999999999859878991564 No 96 >PRK00509 argininosuccinate synthase; Provisional Probab=97.40 E-value=0.0089 Score=37.36 Aligned_cols=179 Identities=16% Similarity=0.157 Sum_probs=96.3 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC-CCCCCCHHHHHH-HHH-HHHC- Q ss_conf 540575215872368999999841495446797324666652213569999997213-542677589999-988-6313- Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC-KYDVLPIHDLVN-HFF-SLMS- 367 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~-~~~~i~I~~~~~-~~~-~~~~- 367 (562) .++||+.-|||+|+++++....+-.|. .|++++.-. .+ .+..+..++-|..+|. ++.++|....+- .+. ..+. T Consensus 2 ~kKVvLAySGGLDTSv~l~wL~e~yg~-eVia~~~d~-Gq-~~d~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~p~I~a 78 (398) T PRK00509 2 VKKVVLAYSGGLDTSVIIKWLKEEYGC-EVVAFTADV-GQ-GEELEPIREKALASGASEIYVEDLREEFVRDYVFPAIRA 78 (398) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHCCC-EEEEEEEEC-CC-HHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHC T ss_conf 854999908984599999999875398-899999979-88-778899999999829978999630999999999999960 Q ss_pred CCCCCCCCCHHHHCCC--CHHH-HHHHHHHHHHCCCEEC-----CCCC-HHHHHHHHCCCCCCCCCCCCCCCCC---CHH Q ss_conf 2346874310110123--3046-6779888751695443-----6432-4477541102366655540101776---336 Q gi|254780999|r 368 QFLQEEPSGIVAENIQ--SRIR-GNILMALSNHSKAMLL-----TTSN-KSEISVGYGTLYGDMSGGFNPLKDL---YKT 435 (562) Q Consensus 368 ~~~~~~~~~~~~eN~q--aR~R-~~~L~~~an~~~~lvl-----~t~n-ksE~~~Gy~T~~GD~~g~~~p~~~l---~Kt 435 (562) +.+ ..-.+-..- ||-- .--+..+|.+.|+-.+ |-|| .--+-+++.++.-+ .--++|+-|. ..+ T Consensus 79 na~----Yeg~YpL~tslaRplia~~~ve~A~~~ga~aiaHG~TGkGNDQvRFe~~~~aL~P~-l~iiaP~Rd~~~~sR~ 153 (398) T PRK00509 79 NAL----YEGKYPLGTALARPLIAKKLVEIARKEGADAVAHGCTGKGNDQVRFELAIAALAPD-LKVIAPWREWDLMSRE 153 (398) T ss_pred CCC----CCCCEEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHCCC-CEEEEEEECCCCCCHH T ss_conf 854----55854552203188999999999997198599854556786089999999985999-7686113213416899 Q ss_pred HHHHHHHHHHHCCCCCCC--------------------CCCCCCCCHHHCCCCCCCCCCCCCCC Q ss_conf 589999996310332223--------------------44446688788288988453899977 Q gi|254780999|r 436 QVFQLASWRNSHGITSGL--------------------GPLTEVIPPSILEKSPSAELRPHQTD 479 (562) Q Consensus 436 ~v~~la~~~n~~~~~~~~--------------------~~~~~~ip~~i~~~~psaeL~~~Q~d 479 (562) +.-+.|+-.+=..+.... .| ....|+++.....+++..|+..+ T Consensus 154 ~~i~ya~~~gIpv~~~~~~~yS~D~NLwg~S~Egg~Ledp-~~~ppe~~~~~t~~p~~ap~~pe 216 (398) T PRK00509 154 ELIAYAEEHGIPIPVTKKSPYSIDANLWHRSIEGGVLEDP-WNEPPEDVYEWTVSPEDAPDEPE 216 (398) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCC-CCCCCHHHHHCCCCHHHCCCCCE T ss_conf 9999999869988877788730256613210156754686-53450657535699778899975 No 97 >PRK08576 hypothetical protein; Provisional Probab=97.31 E-value=0.00082 Score=44.13 Aligned_cols=72 Identities=22% Similarity=0.299 Sum_probs=57.5 Q ss_pred HHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHH Q ss_conf 88876422454057521587236899999984149544679732466665221356999999721354267758 Q gi|254780999|r 283 LRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIH 356 (562) Q Consensus 283 l~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~ 356 (562) -.+++++.+-..++|..|||=||+.++.||.+|++ +|.+|..-+-.--++|.+..+.+++.+|++....+.+ T Consensus 226 s~~fL~~~~~~~iiVp~SGGKDStA~LlLA~ea~~--dv~aVfvDTGlEfPeT~eyVe~va~klGv~lv~a~~d 297 (439) T PRK08576 226 SIEFLEKFGEYTIIVPWSGGKDSTAALLLALKAFD--EVTAVYVDTGYEMPLTDEYVEKVAEKLGVDLIRAGVD 297 (439) T ss_pred HHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHC--CEEEEEEECCCCCHHHHHHHHHHHHHHCCEEEEECCC T ss_conf 99999962898889956887799999999998618--8369997079865758999999999859769994788 No 98 >PRK08557 hypothetical protein; Provisional Probab=97.28 E-value=0.0015 Score=42.46 Aligned_cols=73 Identities=22% Similarity=0.334 Sum_probs=54.8 Q ss_pred HHHHHHHHHCCCC--EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCH Q ss_conf 3588876422454--05752158723689999998414954467973246666522135699999972135426775 Q gi|254780999|r 281 LSLRDYVQKNNFH--KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPI 355 (562) Q Consensus 281 lgl~d~~~~~g~~--~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I 355 (562) -.++....+..-+ -|.|..|||=||++++.||.+|++. +..+..-+-.--++|.+..+..++.+|++..++.= T Consensus 170 ~fir~~~~k~~~~~~pV~VSfSGGKDS~a~L~La~~a~~~--~~vvF~DTGlEfPeT~eyve~~~~~~~~~l~~~~~ 244 (420) T PRK08557 170 EILQDYVEKYKRKGYAINASFSGGKDSAVSTLLSKEVIPD--LDVVFIDTGLEYPETLNYVKDFAKKYDINLDTVDG 244 (420) T ss_pred HHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHCCC--CEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEECC T ss_conf 9999999971667974699606878999999999985488--16999628877723899999999980996799654 No 99 >PRK10696 C32 tRNA thiolase; Provisional Probab=97.18 E-value=0.0014 Score=42.65 Aligned_cols=162 Identities=17% Similarity=0.201 Sum_probs=86.5 Q ss_pred HHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHC---CC-HHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 42202035888764224540575215872368999999841---49-544679732466665221356999999721354 Q gi|254780999|r 275 DYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDA---LG-KENVQTIMLPYKYTSPQSLEDAAACAKALGCKY 350 (562) Q Consensus 275 ~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la~~a---lg-~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~ 350 (562) +.+...-++.||=-=..-.+|+||||||-||...+.+...- .+ .=.+.++++-.....-. .+--...++.||+++ T Consensus 22 l~r~vgkAI~dy~MIedGDRVlVglSGGKDS~~LL~iL~~Lq~~api~FeLvAv~lD~~~pGf~-~~~L~~yl~~lGvp~ 100 (311) T PRK10696 22 LRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFP-EHILPEYLEKLGVEY 100 (311) T ss_pred HHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCC-HHHHHHHHHHCCCCC T ss_conf 9999999999858778999999982678889999999999998589985599998378999988-068899999749970 Q ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHH-HHHH--HCCCCCCC----- Q ss_conf 267758999998863132346874310110123304667798887516954436432447-7541--10236665----- Q gi|254780999|r 351 DVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSE-ISVG--YGTLYGDM----- 422 (562) Q Consensus 351 ~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE-~~~G--y~T~~GD~----- 422 (562) ..+.-+ ...-....+ +.+.+...+=||+|=-+||..|.+.|+=.+..|.--+ .+-- ...+||-. T Consensus 101 ~i~~~d-~~~iv~~~~-------~egks~CslCsRlRRg~Ly~~A~e~G~nKIALGHH~DDi~ETfLMNlf~gG~LktM~ 172 (311) T PRK10696 101 KIVEEN-TYGIVKEKI-------PEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMP 172 (311) T ss_pred EEEEEE-HHHHHHHHC-------CCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 799840-767998754-------368863789999999999999998699879860650558999999999668763679 Q ss_pred ------CCC---CCCCCCCCHHHHHHHHHHHH Q ss_conf ------554---01017763365899999963 Q gi|254780999|r 423 ------SGG---FNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 423 ------~g~---~~p~~~l~Kt~v~~la~~~n 445 (562) -|+ +-|+.-+.-.++..+|+..+ T Consensus 173 Pkl~~d~g~~~VIRPL~y~~E~di~~~a~~~~ 204 (311) T PRK10696 173 PKLMSDDGKHIVIRPLAYCREKDIERFADAKA 204 (311) T ss_pred CEEECCCCCEEEEEECCCCCHHHHHHHHHHCC T ss_conf 76674899759983042015999999999759 No 100 >PRK04527 argininosuccinate synthase; Provisional Probab=97.12 E-value=0.017 Score=35.45 Aligned_cols=130 Identities=15% Similarity=0.231 Sum_probs=63.6 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC-CCCCCCHHHH-HHHHHH-HHCC Q ss_conf 540575215872368999999841495446797324666652213569999997213-5426775899-999886-3132 Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC-KYDVLPIHDL-VNHFFS-LMSQ 368 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~-~~~~i~I~~~-~~~~~~-~~~~ 368 (562) .++||+.-|||+|+|+++....+- |-+ |++++.-.-.-.++..+..++-|..+|. ++.++|.... ++.+.. .+. T Consensus 3 ~kkVvLAySGGLDTSv~l~wL~e~-g~~-Vi~~~ad~G~~~~e~~~~i~~kA~~~GA~~~~v~D~~~ef~~~~v~~~i~- 79 (397) T PRK04527 3 TKDIVLAFSGGLDTSFCIPYLQER-GYA-VHTVFADTGGVDAEERDFIEKRAAELGAASHVTVDGGPAIWEGFVKPFVW- 79 (397) T ss_pred CCEEEEEECCCCHHHHHHHHHHHC-CCC-EEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHH- T ss_conf 875999908972899999999875-994-79999978998615157899999981997799972899999999999986- Q ss_pred CCCCCCCCHHHHC----CCCHHHHH---HHHHHHHHCCCEECCC-----CCHH-HHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 3468743101101----23304667---7988875169544364-----3244-775411023666555401017763 Q gi|254780999|r 369 FLQEEPSGIVAEN----IQSRIRGN---ILMALSNHSKAMLLTT-----SNKS-EISVGYGTLYGDMSGGFNPLKDLY 433 (562) Q Consensus 369 ~~~~~~~~~~~eN----~qaR~R~~---~L~~~an~~~~lvl~t-----~nks-E~~~Gy~T~~GD~~g~~~p~~~l~ 433 (562) .+..||| .-| -|-. .+..+|.+.|.-.++- ||-- -+-+++.++ +| ..-++|+-+.. T Consensus 80 ------ana~Yeg~YpL~ts-aRplIak~~ve~A~~~ga~~iaHG~TGkGNDQvRFe~~~~al-~~-~~viAP~R~~~ 148 (397) T PRK04527 80 ------AGEGYQGQYPLLVS-DRYLIVDAALKRAEELGTRIIAHGCTGMGNDQVRFDLAVKAL-GD-YQIVAPIREIQ 148 (397) T ss_pred ------HCHHHCCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHC-CC-CCCCCCHHHHH T ss_conf ------06454185556321-079999999999997298384047666787025677688863-67-65436556531 No 101 >pfam01507 PAPS_reduct Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP). It is also found in NodP nodulation protein P from Rizobium which has ATP sulfurylase activity (sulfate adenylate transferase). Probab=97.06 E-value=0.0064 Score=38.29 Aligned_cols=143 Identities=19% Similarity=0.184 Sum_probs=77.8 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 05752158723689999998414954467973246666522135699999972135426775899999886313234687 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEE 373 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~ 373 (562) +++|+.|||-||++++.|+.++..+ +..+.+-+.+.-++|.+-.+.+++.+|++..+..-+.........-+. . T Consensus 1 ~v~vsfSGGKDS~vlL~L~~~~~~~--~~vvf~Dtg~efpet~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 74 (174) T pfam01507 1 ELVVSFSGGKDSLVLLHLASKAFPP--GPVIFIDTGYEFPETYEFVDELEEKYGLNLKVYRPEDSFAEGINPEGI----P 74 (174) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCC--CCEEEEECCCCCHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHCCC----C T ss_conf 9999957609999999999984899--757999789986899999999999849818998086779877631279----5 Q ss_pred CCCHHHHCCCCHHH--HHHHHHHHHHCCCEECCCCCHHH-HHHH----HCCCCCCCCC--CCCCCCCCCHHHHHHHHHHH Q ss_conf 43101101233046--67798887516954436432447-7541----1023666555--40101776336589999996 Q gi|254780999|r 374 PSGIVAENIQSRIR--GNILMALSNHSKAMLLTTSNKSE-ISVG----YGTLYGDMSG--GFNPLKDLYKTQVFQLASWR 444 (562) Q Consensus 374 ~~~~~~eN~qaR~R--~~~L~~~an~~~~lvl~t~nksE-~~~G----y~T~~GD~~g--~~~p~~~l~Kt~v~~la~~~ 444 (562) ..... .. -.+++ ..+.-++. .++.-++=+|-+.+ ..-. +.-..++..+ -+.||.|-+..+|+...+.. T Consensus 75 ~~~~~-~~-~~~~~K~~p~~~~l~-~~~~~~~i~GiR~~Es~~R~~~~~~~~~~~~~~~~~~~PI~~Wt~~DVw~yi~~~ 151 (174) T pfam01507 75 SKLWE-DC-PCRLRKVEPLKRALK-KLDFDAWFTGLRRDESPSRAKLPIVSIDGDFPKVIKVFPLLNWTETDVWQYILAN 151 (174) T ss_pred HHHCC-CC-CHHHHHHHHHHHHHH-HCCCCEEEEEEECCCHHHHHHCCEEEEECCCCCEEEEECHHHCCHHHHHHHHHHC T ss_conf 01145-45-214665179999998-6599689995100244566407167430578885999500529999999999983 Q ss_pred H Q ss_conf 3 Q gi|254780999|r 445 N 445 (562) Q Consensus 445 n 445 (562) + T Consensus 152 ~ 152 (174) T pfam01507 152 N 152 (174) T ss_pred C T ss_conf 9 No 102 >PRK13820 argininosuccinate synthase; Provisional Probab=97.06 E-value=0.0015 Score=42.33 Aligned_cols=146 Identities=20% Similarity=0.295 Sum_probs=85.2 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH-HH-HHHHCC Q ss_conf 4540575215872368999999841495446797324666652213569999997213542677589999-98-863132 Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN-HF-FSLMSQ 368 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~-~~-~~~~~~ 368 (562) ..++||+.-|||+|+++++....+-.|-+.|++++.-. ...++..+.+++-|..+|+++.++|....+- .+ ...+.. T Consensus 2 ~~kKVvLAySGGLDTSv~l~wL~e~yg~~eVi~~~~d~-Gq~~e~~~~~~~kA~~~G~~~~viD~r~ef~~~~i~paI~a 80 (395) T PRK13820 2 MKKKVVLAYSGGLDTSVCIPLLKEKYGYDEVITVAVDV-GQPEEEIKEAEEKAKKLGVKHYTIDAKEEFAKDYIFPAIKA 80 (395) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEEC-CCCHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHC T ss_conf 98729999689848999999999851998699999989-99767769999999970994999816999999999999970 Q ss_pred -CCCCCCCCHHHHCCCCHHHH---HHHHHHHHHCCCEEC-----CCCC---HHHHHHHHCCCCCCCCCCCCCCCCCCHHH Q ss_conf -34687431011012330466---779888751695443-----6432---44775411023666555401017763365 Q gi|254780999|r 369 -FLQEEPSGIVAENIQSRIRG---NILMALSNHSKAMLL-----TTSN---KSEISVGYGTLYGDMSGGFNPLKDLYKTQ 436 (562) Q Consensus 369 -~~~~~~~~~~~eN~qaR~R~---~~L~~~an~~~~lvl-----~t~n---ksE~~~Gy~T~~GD~~g~~~p~~~l~Kt~ 436 (562) ..- .+ |-..-|=-|- .-+..+|.+.|.-.+ |.|| +-|.+. - .-| ..-++|+-+..-+ T Consensus 81 na~Y---eg--YpL~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGNDQvRFe~~~--~--ap~-~~iiaP~R~~~~~- 149 (395) T PRK13820 81 NALY---EG--YPLGTALARPLIAKKIVEVAKKEGASAIAHGCTGKGNDQLRFEAVF--R--ATD-LDVIAPIRELNLT- 149 (395) T ss_pred CCCC---CC--CCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHH--H--CCC-CEEEEEECCHHHH- T ss_conf 8732---67--3032410468999999999997598298307778987099999998--6--789-8687210453345- Q ss_pred HHHHHHHHHHCC Q ss_conf 899999963103 Q gi|254780999|r 437 VFQLASWRNSHG 448 (562) Q Consensus 437 v~~la~~~n~~~ 448 (562) =-++..|..++. T Consensus 150 R~~~i~ya~~~g 161 (395) T PRK13820 150 REWEIEYAKEHG 161 (395) T ss_pred HHHHHHHHHHCC T ss_conf 899999999859 No 103 >COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Probab=97.05 E-value=0.024 Score=34.52 Aligned_cols=168 Identities=20% Similarity=0.229 Sum_probs=96.3 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEEC-CCCCCCCCCHHHHHHHH-HHHC-----CCCCCCCHHHHHHHHHHH Q ss_conf 4057521587236899999984149544679732-46666522135699999-9721-----354267758999998863 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIML-PYKYTSPQSLEDAAACA-KALG-----CKYDVLPIHDLVNHFFSL 365 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~m-p~~~~s~~s~~~a~~la-~~lg-----~~~~~i~I~~~~~~~~~~ 365 (562) .+++.=|||||||.|++-++-+. |- .|+.+.. -..++++.....+..|+ +.+. +.+..+|-.++.+.+... T Consensus 176 Gk~l~LlSGGIDSPVA~~l~mkR-G~-~v~~v~f~~~p~~~~~a~~k~~~l~~~~~~~~~~~~~~~~v~f~~v~~~i~~~ 253 (383) T COG0301 176 GKVLLLLSGGIDSPVAAWLMMKR-GV-EVIPVHFGNPPYTSEKAREKVVALALLRLTSYGGKVRLYVVPFTEVQEEILEK 253 (383) T ss_pred CCEEEEEECCCCHHHHHHHHHHC-CC-EEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH T ss_conf 74899970787749999999856-98-77999972799851889999999986430246872289997629999999863 Q ss_pred HCCCCCCCCCCHHHHCCCCHHHHH-HHHHHHHHCCCEECCCCCHH-----HHHHH-HCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 132346874310110123304667-79888751695443643244-----77541-102366655540101776336589 Q gi|254780999|r 366 MSQFLQEEPSGIVAENIQSRIRGN-ILMALSNHSKAMLLTTSNKS-----EISVG-YGTLYGDMSGGFNPLKDLYKTQVF 438 (562) Q Consensus 366 ~~~~~~~~~~~~~~eN~qaR~R~~-~L~~~an~~~~lvl~t~nks-----E~~~G-y~T~~GD~~g~~~p~~~l~Kt~v~ 438 (562) ..+ ... ++--|-.|. +--.+|...|+..+-||.-- ..+-. +..-.+-..-=+-|+-.+.|.++- T Consensus 254 ~~~----~y~-----~v~~rR~M~riA~~iae~~g~~aIvtGEsLGQVASQTl~nL~~i~~~t~~pIlRPLI~~DK~eIi 324 (383) T COG0301 254 VPE----SYR-----CVLLKRMMYRIAEKLAEEFGAKAIVTGESLGQVASQTLENLRVIDSVTNTPVLRPLIGLDKEEII 324 (383) T ss_pred CCC----CCE-----EHHHHHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEECCCCCCCHHHHH T ss_conf 675----302-----04789999999999999839839983686104667678889999815587423144579989999 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCC Q ss_conf 99999631033222344446688788288988453899 Q gi|254780999|r 439 QLASWRNSHGITSGLGPLTEVIPPSILEKSPSAELRPH 476 (562) Q Consensus 439 ~la~~~n~~~~~~~~~~~~~~ip~~i~~~~psaeL~~~ 476 (562) ++||-++..-. +..+. =.--.+-.||.+.-+|. T Consensus 325 ~~Ar~IgT~ei--Si~p~---e~cc~~f~p~~p~t~~~ 357 (383) T COG0301 325 EIARRIGTYEI--SIEPP---EDCCVIFAPPTPKTKPK 357 (383) T ss_pred HHHHHHCCHHH--HCCCC---CCCCCCCCCCCCCCCCC T ss_conf 99998297443--13578---88857637998745752 No 104 >PRK10660 tilS tRNA(Ile)-lysidine synthetase; Provisional Probab=96.98 E-value=0.0023 Score=41.25 Aligned_cols=62 Identities=15% Similarity=0.200 Sum_probs=32.7 Q ss_pred CEEEEECCCCCHHHHHHHHHH---HCCCHHHEEEEECCCCCCCCCCHHHH---HHHHHHHCCCCCCCCH Q ss_conf 405752158723689999998---41495446797324666652213569---9999972135426775 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAV---DALGKENVQTIMLPYKYTSPQSLEDA---AACAKALGCKYDVLPI 355 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~---~alg~~~v~~~~mp~~~~s~~s~~~a---~~la~~lg~~~~~i~I 355 (562) ++++||+|||.||.+.+.+.. +.+..-+++++-.=+.-. +++.+++ +.+|+.+|+.+....+ T Consensus 14 ~~ilvavSGG~DS~~LL~~L~~l~~~~~~~~l~~~HvnHgl~-~~ad~~~~fv~~~c~~~~ip~~~~~~ 81 (433) T PRK10660 14 RQILVAFSGGLDSTVLLHQLVQWRTQNPGVTLRAIHIHHGLS-PNADSWVKHCEQVCQQWQVPLVVERV 81 (433) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCC-CCHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 989999728099999999999999866898289999718979-26699999999999975997899998 No 105 >cd01986 Alpha_ANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases and ATP sulphurylases. The domain forms a apha/beta/apha fold which binds to Adenosine group.. Probab=96.81 E-value=0.0073 Score=37.91 Aligned_cols=47 Identities=32% Similarity=0.383 Sum_probs=33.8 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHH Q ss_conf 575215872368999999841495446797324666652213569999997 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKA 345 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~ 345 (562) ++|.+|||+||++++.++.++.. ++.+++....... ..++++..|+. T Consensus 1 ~~v~~sgG~ds~~~~~~l~~~~~--~~~~~~~~~~~~~--~~~~~~~~a~~ 47 (103) T cd01986 1 VLVAFSGGKDSSVAAALLKKLGY--QVIAVTVDHGISP--RLEDAKEIAKE 47 (103) T ss_pred CEEEECCCCHHHHHHHHHHHCCC--CEEEEEEECCCCC--HHHHHHHHHHH T ss_conf 98974288221999999998698--6289998537662--78899999999 No 106 >PRK05370 argininosuccinate synthase; Validated Probab=96.76 E-value=0.018 Score=35.31 Aligned_cols=19 Identities=11% Similarity=0.212 Sum_probs=8.4 Q ss_pred CEEEEECCCC-CHHHHHHHH Q ss_conf 4057521587-236899999 Q gi|254780999|r 293 HKVIIGLSGG-IDSALCAAI 311 (562) Q Consensus 293 ~~~viglSGG-iDSal~a~l 311 (562) +|.-|.|-|- +.|.+-+.. T Consensus 243 ~G~PValnG~~~~spveli~ 262 (447) T PRK05370 243 QGQPVALNGKTFSDPVELML 262 (447) T ss_pred CEEEEEECCEECCCHHHHHH T ss_conf 30778989986789999999 No 107 >TIGR02432 lysidine_TilS_N tRNA(Ile)-lysidine synthetase; InterPro: IPR012795 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . The only examples in which the wobble position of a tRNA must discriminate between G and A of mRNA are AUA (Ile) vs. AUG (Met) and UGA (stop) vs. UGG (Trp). In all bacteria, the wobble position of the tRNA(Ile) recognizing AUA is lysidine, a lysine derivative of cytidine. This domain is found, apparently, in all bacteria in a single copy. Eukaryotic sequences appear to be organellar. The domain architecture of this protein is variable; some, including characterised proteins of Escherichia coli and Bacillus subtilis known to be tRNA(Ile)-lysidine synthetase, include a conserved 50-residue domain that many other members lack. This protein belongs to the ATP-binding PP-loop family. It appears in the literature and protein databases as TilS, YacA, and putative cell cycle protein MesJ (a misnomer).; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0016879 ligase activity forming carbon-nitrogen bonds, 0008033 tRNA processing, 0005737 cytoplasm. Probab=96.51 E-value=0.0074 Score=37.89 Aligned_cols=138 Identities=15% Similarity=0.182 Sum_probs=78.0 Q ss_pred EEEEECCCCCHHHHHHHHHHHC---CCHH------HEEEEECCC--CCCCCC-CHHHHHHHHHHHCCCCCCCC-HHHHHH Q ss_conf 0575215872368999999841---4954------467973246--666522-13569999997213542677-589999 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDA---LGKE------NVQTIMLPY--KYTSPQ-SLEDAAACAKALGCKYDVLP-IHDLVN 360 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~a---lg~~------~v~~~~mp~--~~~s~~-s~~~a~~la~~lg~~~~~i~-I~~~~~ 360 (562) +++|++|||.||.+.+.+..+- ++.+ ++.++..=. +.+|.. ..+..+..|+.+++.+.+.. ++-.-. T Consensus 1 ~~lvAvSGG~DS~aLL~~L~~~~~~~~~~~~~~~~~~~a~h~nH~lR~~s~~~~~~~v~~~c~~~~~~~~~~~~~~~~~~ 80 (204) T TIGR02432 1 KILVAVSGGVDSMALLHLLLKLQPKLKISWQIKDFKLIAAHVNHGLRPESAQEEAEFVQQFCEKLNIPLEIKKLVDVKAL 80 (204) T ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCC T ss_conf 97888628642799999999976632787788750689999617788415899999999999961895699842124110 Q ss_pred HHHHHHCCCCCCCCCCHHHHCCCC---HHHHHHHHHHHHHCC-CEECCCCC----HHHHH----------HH-----HCC Q ss_conf 988631323468743101101233---046677988875169-54436432----44775----------41-----102 Q gi|254780999|r 361 HFFSLMSQFLQEEPSGIVAENIQS---RIRGNILMALSNHSK-AMLLTTSN----KSEIS----------VG-----YGT 417 (562) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~eN~qa---R~R~~~L~~~an~~~-~lvl~t~n----ksE~~----------~G-----y~T 417 (562) + ... . .|+++ +.|=..+..++.++| .=+|.|+. ..|-. .| +.. T Consensus 81 ~---------~~~--~---~~~E~~AR~~RY~~f~~~~~~~~~~~~i~tAHh~dDq~ET~L~rL~RG~~~~Gl~g~~~~~ 146 (204) T TIGR02432 81 A---------KGK--K---KNLEEAAREARYAFFEEIAKKHGKADYILTAHHADDQAETILMRLLRGSGLRGLSGMPEIR 146 (204) T ss_pred C---------CCC--C---CCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHHHHHCCCC T ss_conf 1---------224--7---6778999999999999999972994089972483227999999875168700011101336 Q ss_pred CCCC-----CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 3666-----555401017763365899999963 Q gi|254780999|r 418 LYGD-----MSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 418 ~~GD-----~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) ..|. ..==+=|+=.++|.++...|+-.+ T Consensus 147 ~~~~~g~~~~~~~~RPLL~~~k~ei~~y~~~~~ 179 (204) T TIGR02432 147 PLGSLGWYKGGQIIRPLLNISKSEIEEYLKENG 179 (204) T ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCC T ss_conf 633244225632770775677899999999668 No 108 >COG1365 Predicted ATPase (PP-loop superfamily) [General function prediction only] Probab=96.49 E-value=0.013 Score=36.19 Aligned_cols=135 Identities=20% Similarity=0.251 Sum_probs=81.8 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCC-CCCCHHHHHHHHHHHCCCCCCCC--HHHHHHHHHHHHCCC Q ss_conf 405752158723689999998414954467973246666-52213569999997213542677--589999988631323 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYT-SPQSLEDAAACAKALGCKYDVLP--IHDLVNHFFSLMSQF 369 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~-s~~s~~~a~~la~~lg~~~~~i~--I~~~~~~~~~~~~~~ 369 (562) .+++|..|||.|||-++.++..+- ...-|+.+- .++-..+++.+++-||+...-+. .+.+.+..+. . T Consensus 61 ~kiaVA~SGG~DSsas~iilR~~g------~~v~p~t~~Lp~~ir~n~~~l~~~lg~~p~yveedl~~i~kGaln---G- 130 (255) T COG1365 61 PKIAVAYSGGVDSSASAIILRWAG------FTVDPGTAILPDHIRRNKEELETLLGEVPEYVEEDLEDIEKGALN---G- 130 (255) T ss_pred CEEEEEECCCCCHHHHHHHHHHHC------EEECCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCC---C- T ss_conf 438998427864278899988621------041463213877886789999899725777899899999863205---8- Q ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHCCCCC-CCCCCCC--CCCCCCHHHHHHHHHHHH Q ss_conf 468743101101233046677988875169544364324477541102366-6555401--017763365899999963 Q gi|254780999|r 370 LQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVGYGTLYG-DMSGGFN--PLKDLYKTQVFQLASWRN 445 (562) Q Consensus 370 ~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~Gy~T~~G-D~~g~~~--p~~~l~Kt~v~~la~~~n 445 (562) ...+.+.... .=+...|+.+...+.=++.+|+.- ++||...|- |..=-++ ..=.++|-+++.++.|-. T Consensus 131 -RfhpCGRCh~-----~I~~~V~~k~re~di~~vafGDlL--s~G~~svy~eD~i~rlnlPAflAltK~Elr~il~~~~ 201 (255) T COG1365 131 -RFHPCGRCHS-----MIENAVMDKARELDIDVVAFGDLL--STGYGSVYREDGIFRLNLPAFLALTKDELRSILKWNG 201 (255) T ss_pred -CCCCCCHHHH-----HHHHHHHHHHHHCCCEEEEECCCC--CCCCCCEECCCCEEEECCHHHHHHCCHHHHHHHHHCC T ss_conf -7787523789-----999999999985387089974632--3466321205887997258888517199999987138 No 109 >TIGR00342 TIGR00342 thiamine biosynthesis/tRNA modification protein ThiI; InterPro: IPR003720 Thiamine pyrophosphate (TPP) is synthesized de novo in many bacteria and is a required cofactor for many enzymes in the cell. ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway . Almost all the proteins in this group have an N-terminal THUMP domain (see IPR004114 from INTERPRO).; GO: 0009228 thiamin biosynthetic process. Probab=96.11 E-value=0.044 Score=32.80 Aligned_cols=140 Identities=21% Similarity=0.241 Sum_probs=84.8 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECC-CCCCCCCCHHHHHHHHHHH-------CCCCCCCCHHHHHHHHHH Q ss_conf 40575215872368999999841495446797324-6666522135699999972-------135426775899999886 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLP-YKYTSPQSLEDAAACAKAL-------GCKYDVLPIHDLVNHFFS 364 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp-~~~~s~~s~~~a~~la~~l-------g~~~~~i~I~~~~~~~~~ 364 (562) .+|+.=|||||||-|+|.++-+. | =+|.+|.|- ..++++.+.+--..||+.+ .+++..+|.++.++...+ T Consensus 187 Gkvl~LlSGGiDSPVAaf~~m~R-G-c~V~~vhf~~~~~~~~~~~~kv~~la~~~~~~~~~~~~~l~~~~F~~~~~~~~~ 264 (391) T TIGR00342 187 GKVLALLSGGIDSPVAAFLAMKR-G-CRVVAVHFFNEPAASEKAREKVERLANLLSLNETGGSVKLYVVDFTDVQEEIIE 264 (391) T ss_pred HHHHHHHCCCCCHHHHHHHHHHC-C-CEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHHHHHHHHH T ss_conf 36320411883516799999663-8-779999732885524689999999999885400037999998543899999984 Q ss_pred HHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHHHHHH----------HCCCCCCCCCCCC-CCCCCC Q ss_conf 31323468743101101233046677988875169544364324477541----------1023666555401-017763 Q gi|254780999|r 365 LMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSEISVG----------YGTLYGDMSGGFN-PLKDLY 433 (562) Q Consensus 365 ~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE~~~G----------y~T~~GD~~g~~~-p~~~l~ 433 (562) ...+- ..-+.. =|.=+.+--.+|...|+..|=||+ +|| +.+-.+=...=+. |+=++. T Consensus 265 ~~~e~----~~~v~~----rR~M~~~A~~~ae~~g~~A~VTGe----~LGQVASQTL~Nl~vI~~~~~~~iL~RPLIg~D 332 (391) T TIGR00342 265 IIPEK----YTMVLC----RRLMLKIASKVAEKEGCLAIVTGE----SLGQVASQTLENLRVIQAVVNTPILRRPLIGMD 332 (391) T ss_pred CCCCC----CEEEEH----HHHHHHHHHHHHHHCCCCEEEECC----CCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCC T ss_conf 18998----788740----667999999888860994899766----342037778889999972378742407755479 Q ss_pred HHHHHHHHHHHHH Q ss_conf 3658999999631 Q gi|254780999|r 434 KTQVFQLASWRNS 446 (562) Q Consensus 434 Kt~v~~la~~~n~ 446 (562) |+++-+|||.+++ T Consensus 333 K~~Ii~~Ak~IgT 345 (391) T TIGR00342 333 KEEIIELAKEIGT 345 (391) T ss_pred HHHHHHHHHHCCC T ss_conf 7899999741396 No 110 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=95.94 E-value=0.07 Score=31.51 Aligned_cols=111 Identities=19% Similarity=0.298 Sum_probs=76.7 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCC-CCCHHHHHHHHHHHCC-CCCCCCHHH-HHHH-HHHHHC-C Q ss_conf 057521587236899999984149544679732466665-2213569999997213-542677589-9999-886313-2 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTS-PQSLEDAAACAKALGC-KYDVLPIHD-LVNH-FFSLMS-Q 368 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s-~~s~~~a~~la~~lg~-~~~~i~I~~-~~~~-~~~~~~-~ 368 (562) +|||.-|||+|.|+++.+.-+--|-++|++++.= -... ++..+.+++-|..+|. ++.+||=.+ .++. .+..|+ + T Consensus 1 KVvLAySGGLDTSv~l~wL~~kyG~~~Via~~~d-vGQPDE~d~~~~~~kA~~~GA~~~~~iDak~eFv~dy~f~aiqan 79 (420) T TIGR00032 1 KVVLAYSGGLDTSVCLKWLEEKYGYEEVIAVTAD-VGQPDEEDIDAIEEKALKYGAEKHYTIDAKEEFVKDYLFAAIQAN 79 (420) T ss_pred CEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEE-CCCCCHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHCC T ss_conf 9688860702599999998865289860799975-679998888889999887367102577236889999988876506 Q ss_pred CCCCCCCCHHHHCCCCHHHHH---HHHHHHHHCCCEECCC Q ss_conf 346874310110123304667---7988875169544364 Q gi|254780999|r 369 FLQEEPSGIVAENIQSRIRGN---ILMALSNHSKAMLLTT 405 (562) Q Consensus 369 ~~~~~~~~~~~eN~qaR~R~~---~L~~~an~~~~lvl~t 405 (562) .+-...-+++|-|--|=-|-+ .|-.+|-+.|.-.++- T Consensus 80 A~Ye~~GG~~Y~L~TaLaRPlIA~~lVe~Ak~~Ga~AvaH 119 (420) T TIGR00032 80 AVYEGTGGLVYPLSTALARPLIAKKLVEVAKKEGAEAVAH 119 (420) T ss_pred CEECCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 6031787410347661145799999999984669549870 No 111 >cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity Probab=95.78 E-value=0.16 Score=29.16 Aligned_cols=132 Identities=17% Similarity=0.202 Sum_probs=67.1 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC-CCCCCCHHHHHHH--HHHHHCCCCC Q ss_conf 575215872368999999841495446797324666652213569999997213-5426775899999--8863132346 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC-KYDVLPIHDLVNH--FFSLMSQFLQ 371 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~-~~~~i~I~~~~~~--~~~~~~~~~~ 371 (562) ||+.-|||+|+++++-...+--+. .|++++.-. ...++..+.+++-|..+|. ++.++|....+-. ....+.. T Consensus 1 VvLAySGGLDTSv~l~wL~e~~~~-eVia~~~d~-Gq~~e~~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~pai~a--- 75 (385) T cd01999 1 VVLAYSGGLDTSVILKWLKEKGGY-EVIAVTADV-GQPEEEIEAIEEKALKLGAKKHVVVDLREEFVEDYIFPAIQA--- 75 (385) T ss_pred CEEEECCCCHHHHHHHHHHHHCCC-EEEEEEEEC-CCCHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHH--- T ss_conf 979977881799999999974598-499999989-996677789999999819978999612999999999999960--- Q ss_pred CCCCCHHHHCCC--CHHH-HHHHHHHHHHCCCEECCC-----CC-HHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 874310110123--3046-677988875169544364-----32-4477541102366655540101776 Q gi|254780999|r 372 EEPSGIVAENIQ--SRIR-GNILMALSNHSKAMLLTT-----SN-KSEISVGYGTLYGDMSGGFNPLKDL 432 (562) Q Consensus 372 ~~~~~~~~eN~q--aR~R-~~~L~~~an~~~~lvl~t-----~n-ksE~~~Gy~T~~GD~~g~~~p~~~l 432 (562) +-...-.|-..- ||-- ...+..+|.+.|+-.++- || .--+-+++.+++.|. .-++|+-+. T Consensus 76 na~Yeg~Y~L~tslaRplIak~~ve~A~~~ga~aiaHG~TGkGNDQvRFe~~~~aL~P~l-~iiAP~Rd~ 144 (385) T cd01999 76 NALYEGTYPLGTALARPLIAKALVEVAKEEGADAVAHGCTGKGNDQVRFELAFYALNPDL-KIIAPWRDW 144 (385) T ss_pred CCCCCCCEECCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHHHHHHHHCCCC-EEEEEEECC T ss_conf 864337532135015789999999999980984997456668884068999999859997-585147314 No 112 >pfam00764 Arginosuc_synth Arginosuccinate synthase. This family contains a PP-loop motif. Probab=95.47 E-value=0.2 Score=28.45 Aligned_cols=57 Identities=23% Similarity=0.349 Sum_probs=20.9 Q ss_pred EECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC-CCCCCCHH Q ss_conf 5215872368999999841495446797324666652213569999997213-54267758 Q gi|254780999|r 297 IGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC-KYDVLPIH 356 (562) Q Consensus 297 iglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~-~~~~i~I~ 356 (562) |.-|||+|+++++.-..+- |. .|++++.-. ....+..+.+++-|..+|. ++.++|.. T Consensus 2 LAySGGLDTSv~l~wL~e~-~~-eVia~~~d~-Gq~~ed~~~i~~kA~~~GA~~~~v~D~r 59 (389) T pfam00764 2 LAYSGGLDTSVCIPWLKEK-YY-EVIAVAVDV-GQGEEDLDEAREKALKLGAVKHYVIDAK 59 (389) T ss_pred EEECCCCHHHHHHHHHHHC-CC-EEEEEEEEC-CCCHHHHHHHHHHHHHHCCCEEEEECCH T ss_conf 6036762899999999970-99-289999979-9977878999999998299789997379 No 113 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=95.05 E-value=0.17 Score=28.90 Aligned_cols=62 Identities=29% Similarity=0.268 Sum_probs=42.3 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECC-CCCCCC--C--CHHHHHHHHHHHCCCCCCCCHH Q ss_conf 0575215872368999999841495446797324-666652--2--1356999999721354267758 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLP-YKYTSP--Q--SLEDAAACAKALGCKYDVLPIH 356 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp-~~~~s~--~--s~~~a~~la~~lg~~~~~i~I~ 356 (562) ++++-.|||=||++++..|.+. |-+-+..++|. ....+- + ..+-.+..|+.+|+.+..++|. T Consensus 1 Kv~~l~SGGKDS~lAl~~a~~~-g~~v~~L~~~~~~~~~~~~~H~~~~~l~~~qAealgiPl~~~~~~ 67 (194) T cd01994 1 KVVALISGGKDSCYALYRALEE-GHEVVALLNLTPEEGSSMMYHTVNHELLELQAEAMGIPLIRIEIS 67 (194) T ss_pred CEEEEECCCHHHHHHHHHHHHC-CCEEEEEEEEECCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECC T ss_conf 9899977859999999999986-992599999963998805250557899999999859966999668 No 114 >COG0175 CysH 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [Amino acid transport and metabolism / Coenzyme metabolism] Probab=94.80 E-value=0.053 Score=32.26 Aligned_cols=145 Identities=23% Similarity=0.188 Sum_probs=81.2 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 05752158723689999998414954467973246666522135699999972135426775899999886313234687 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEE 373 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~ 373 (562) .+++.-|||.||+|++.|+.+++.. +..+..=+.+-=+++++-...+++.+|+...+..-+..+..- ......+..+ T Consensus 41 ~~~~~~S~Gkds~V~l~L~~k~~~~--~~vif~DTg~~f~Et~~~~d~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~ 117 (261) T COG0175 41 PVVVSFSGGKDSTVLLHLAAKAFPD--FPVIFLDTGYHFPETYEFRDRLAEEYGLDLKVYRPDDEVAEG-EKYGGKLWEP 117 (261) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCC--CCEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHH-HHCCCCCCCC T ss_conf 7599812746799999999860258--718997077757889999999998729717981673234434-4213667999 Q ss_pred CCCHHHHCCCCHHHHH--HHHHHHHHCCCEECCCCCHHHHHHH--HCCCCCCCC----CCCCCCCCCCHHHHHHHHHHHH Q ss_conf 4310110123304667--7988875169544364324477541--102366655----5401017763365899999963 Q gi|254780999|r 374 PSGIVAENIQSRIRGN--ILMALSNHSKAMLLTTSNKSEISVG--YGTLYGDMS----GGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 374 ~~~~~~eN~qaR~R~~--~L~~~an~~~~lvl~t~nksE~~~G--y~T~~GD~~----g~~~p~~~l~Kt~v~~la~~~n 445 (562) ..+. -=-.+|-+ +--++........++=--++|...- .--...|.. --++||.|-+-.+|+......| T Consensus 118 ~~~r----~c~~i~K~~pl~~al~~~~~~a~~~G~Rrdes~~Rak~~~~~~~~~~~~~~rv~Pl~~Wt~~dVw~Yi~~~~ 193 (261) T COG0175 118 SVER----WCCDIRKVEPLKRALDEYGFDAWFTGLRRDESPTRAKLPVVSFDSEFGESIRVNPLADWTELDVWLYILANN 193 (261) T ss_pred CCCH----HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEECCCCCCCCCEEECCHHCCCHHHHHHHHHHHC T ss_conf 7402----332568551399999635862499861204563202486453467876836871532099999999999958 No 115 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=94.59 E-value=0.11 Score=30.14 Aligned_cols=40 Identities=13% Similarity=0.177 Sum_probs=17.8 Q ss_pred HHHHHHCCCEEEECCCCC-CCCCCCCCCCCEEECCCCEEEE Q ss_conf 998543370464213568-8766210486435558985999 Q gi|254780999|r 192 TGQISHVHLPIIYVNQVG-GQDELIFDGASFCFDGQQQLAF 231 (562) Q Consensus 192 ~~~a~e~~~~vvy~N~vG-g~d~lvf~G~S~I~d~~G~ii~ 231 (562) ...|.+.++..+.--+.| |+|..-|+-.-...+|+=++++ T Consensus 105 Ve~A~k~Ga~avaHGcTGKGNDQvRFE~~~~al~pdlkiiA 145 (403) T COG0137 105 VEAAKKEGADAVAHGCTGKGNDQVRFELAILALNPDLKIIA 145 (403) T ss_pred HHHHHHCCCCEEEECCCCCCCCEEEEEEEHHHHCCCCEEEE T ss_conf 99999719969974678888753543200454189967985 No 116 >pfam01902 ATP_bind_4 ATP-binding region. This family of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042. Probab=93.43 E-value=0.31 Score=27.23 Aligned_cols=61 Identities=28% Similarity=0.121 Sum_probs=38.4 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHE-EEEECCCCCCCC--C--CHHHHHHHHHHHCCCCCCCCHH Q ss_conf 0575215872368999999841495446-797324666652--2--1356999999721354267758 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENV-QTIMLPYKYTSP--Q--SLEDAAACAKALGCKYDVLPIH 356 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v-~~~~mp~~~~s~--~--s~~~a~~la~~lg~~~~~i~I~ 356 (562) ++++-.|||=||++++..|.+. -+-+ +..+.|...-|- + ..+-.+..|+.+|+++..+.+. T Consensus 2 K~~~l~SGGKDS~~Al~~a~~~--~~v~~L~t~~~~~~ds~~~H~~~~~l~~~qA~algiPl~~~~~~ 67 (219) T pfam01902 2 KVAALYSGGKDSNYALYWALKE--IEVPYLVSMKSENKESYMFHEPNLHLTKLLAEALGIPIIKLYTK 67 (219) T ss_pred CEEEEECCCHHHHHHHHHHHHC--CCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC T ss_conf 3999972869999999999871--98379999963799810261578899999999759968999679 No 117 >PRK08384 thiamine biosynthesis protein ThiI; Provisional Probab=93.26 E-value=0.46 Score=26.12 Aligned_cols=122 Identities=16% Similarity=0.077 Sum_probs=66.0 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 40575215872368999999841495446797324666652213569999997213542677589999988631323468 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQE 372 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~ 372 (562) .++++=+|||| |.|++-+..+. |- .|+.+. +++++.+.+-++.+++.|.- +. .+. T Consensus 180 Gk~l~LlSGGi-SPVAa~~mmKR-G~-~v~~vh---f~~~~~~~~kv~~l~~~L~~-y~------------------~~~ 234 (310) T PRK08384 180 GKMVGLLSDEL-SAVAIFLMMKR-GV-EVIPVH---IGMGEKNLEKVRKLWNQLKK-YS------------------YGS 234 (310) T ss_pred CCEEEEECCCC-CHHHHHHHHHC-CC-EEEEEE---ECCCHHHHHHHHHHHHHHHH-HC------------------CCC T ss_conf 84899953886-39999999856-98-799998---56887899999999999998-67------------------998 Q ss_pred CCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCH-----HHHHHH-HCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 743101101233046677988875169544364324-----477541-1023666555401017763365899999963 Q gi|254780999|r 373 EPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNK-----SEISVG-YGTLYGDMSGGFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 373 ~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nk-----sE~~~G-y~T~~GD~~g~~~p~~~l~Kt~v~~la~~~n 445 (562) ...-+..++. .....+|.+.|+.-|-||.- |.-.-. +++-.+-..-=+-|+-...|.++-++||-++ T Consensus 235 ~~~~~~~~~~------~~a~~ia~~~~~~alvTGEsLGQVASQTl~nl~~i~~~~~~PVlRPLIg~DK~EII~~Ar~IG 307 (310) T PRK08384 235 KGRLVVVKNF------ERVNKIIRDFGAKGVVKGLRPEQLASETLENIYEDSRMFDVPVYYPLIALPDEYIEKVKEKIG 307 (310) T ss_pred CEEEEEECHH------HHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 4699996659------999999998599899977412554545798899997304887303856899899999998638 No 118 >KOG0573 consensus Probab=90.80 E-value=0.31 Score=27.27 Aligned_cols=29 Identities=28% Similarity=0.389 Sum_probs=24.5 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCHHH Q ss_conf 54057521587236899999984149544 Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALGKEN 320 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg~~~ 320 (562) -..|.|=.|||+||+++|+++.+-++..- T Consensus 250 ~s~VcVlfSGGvDs~vvA~l~h~~vp~ne 278 (520) T KOG0573 250 ESNVCVLFSGGVDSTVVAVLAHYVVPENE 278 (520) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCCC T ss_conf 57679996488408999999986069888 No 119 >cd01984 AANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases, ATP sulphurylases Universal Stress Response protein and electron transfer flavoprotein (ETF). The domain forms a apha/beta/apha fold which binds to Adenosine nucleotide. Probab=90.44 E-value=0.18 Score=28.74 Aligned_cols=33 Identities=27% Similarity=0.513 Sum_probs=23.2 Q ss_pred EEEECCCCCHHHHHHHHHHHCC-CHHHEEEEECC Q ss_conf 5752158723689999998414-95446797324 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDAL-GKENVQTIMLP 327 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~al-g~~~v~~~~mp 327 (562) +++++|||+||++++.++.+.- +...++.+.+. T Consensus 1 ilv~~Sgg~dS~~~~~la~~~~~~~~~~~~~~~~ 34 (86) T cd01984 1 ILVALSGGLDSSVLLHLAKRLKSGGPEVVALVVV 34 (86) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEHH T ss_conf 9788617853899999999852599808976389 No 120 >PRK06850 hypothetical protein; Provisional Probab=90.04 E-value=0.22 Score=28.26 Aligned_cols=29 Identities=14% Similarity=0.231 Sum_probs=14.5 Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 14889999999999999999887697899 Q gi|254780999|r 57 VFKKSFIQACSSAIDTLKSDTHDGGAGIV 85 (562) Q Consensus 57 ~~~~~f~~~~~~~l~~La~~~~~~~i~ii 85 (562) +..|-....+...++.|...+++.++.|. T Consensus 66 VEnPiV~~~v~~sL~~i~~~A~~~~LPI~ 94 (488) T PRK06850 66 VENPVVVAWVNKSLERIKEAAKKQGLPIT 94 (488) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 56779999999999999999986599822 No 121 >TIGR03183 DNA_S_dndC putative sulfurtransferase DndC. Members of this protein family are the DndC protein from the dnd (degradation during electrophoresis) operon. The dnd phenotype reflects a sulfur-containing modification to DNA. This operon is sparsely and sporadically distributed among bactera; among the first eight examples are members from the Actinobacteria, Firmicutes, Gammaproteobacteria, Cyanobacteria. DndC is suggested to be a sulfurtransferase. Probab=89.58 E-value=0.25 Score=27.90 Aligned_cols=31 Identities=13% Similarity=0.145 Sum_probs=20.9 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 4148899999999999999998876978999 Q gi|254780999|r 56 LVFKKSFIQACSSAIDTLKSDTHDGGAGIVV 86 (562) Q Consensus 56 l~~~~~f~~~~~~~l~~La~~~~~~~i~iiv 86 (562) ++..|-....+...++.|...+++.++.|.+ T Consensus 57 lVEnP~V~~~v~~~L~~i~~~a~~~~LPi~~ 87 (447) T TIGR03183 57 LVENPIVAAWVNVSLERMQEAAQKQGLPIEP 87 (447) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 7557799999999999999999974998236 No 122 >pfam04273 DUF442 Putative phosphatase (DUF442). Although this domain is uncharacterized it seems likely that it performs a phosphatase function. Probab=88.91 E-value=1.7 Score=22.42 Aligned_cols=94 Identities=23% Similarity=0.175 Sum_probs=57.0 Q ss_pred EECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHH------HHHHHHHHHHCCCC Q ss_conf 521587236899999984149544679732466665221356999999721354267758------99999886313234 Q gi|254780999|r 297 IGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIH------DLVNHFFSLMSQFL 370 (562) Q Consensus 297 iglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~------~~~~~~~~~~~~~~ 370 (562) +.+||=|.-.=...|+ +.|=+-|++..-......+-+..+-+..|+.+|++|..+|+. +.++.|...+... T Consensus 9 ~~vs~Qi~~~di~~la--~~GfktIInnRPd~E~~~qp~~~~~~~~a~~~Gl~y~~iPv~~~~~t~~~v~~f~~~l~~~- 85 (110) T pfam04273 9 LSVSPQIQPDDIAAAA--RAGFRSVINNRPDGEEPGQPSNAAEQAAARAAGLAYRFIPVISGQITEADVEAFQRALAAA- 85 (110) T ss_pred EEECCCCCHHHHHHHH--HCCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHC- T ss_conf 7575998999999999--8598388533888777899888999999998399799964477898999999999999858- Q ss_pred CCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 687431011012330466779888751 Q gi|254780999|r 371 QEEPSGIVAENIQSRIRGNILMALSNH 397 (562) Q Consensus 371 ~~~~~~~~~eN~qaR~R~~~L~~~an~ 397 (562) ..+... -=++..|...||++|.. T Consensus 86 ~~Pvl~----~CrSG~Rs~~lwala~a 108 (110) T pfam04273 86 EGPVLA----HCRSGTRALNLYALSQA 108 (110) T ss_pred CCCEEE----ECCCCHHHHHHHHHHHC T ss_conf 998999----88998779999999855 No 123 >COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only] Probab=88.50 E-value=0.79 Score=24.62 Aligned_cols=11 Identities=0% Similarity=-0.106 Sum_probs=4.0 Q ss_pred CCHHHHHHHHH Q ss_conf 99999999999 Q gi|254780999|r 510 YNDETVRYVEH 520 (562) Q Consensus 510 ~~~~~v~~~~~ 520 (562) ..++..++... T Consensus 387 ~tk~~~qr~~~ 397 (407) T COG3969 387 PTKPEYQRYKE 397 (407) T ss_pred CCCHHHHHHHH T ss_conf 78178999999 No 124 >TIGR01390 CycNucDiestase 2',3'-cyclic-nucleotide 2'-phosphodiesterase; InterPro: IPR006294 These sequences represent 2',3'-cyclic-nucleotide 2'-phosphodiesterase; it is a bifunctional enzyme localized to the periplasm of Gram-negative bacteria. 2',3'-cyclic-nucleotide 2'-phosphodiesters are intermediates formed during the hydrolysis of RNA by the ribonuclease I, which is also found in the periplasm, and other enzymes of the RNAse T2 family. Bacteria are unable to transport 2',3'-cyclic-nucleotides into the cytoplasm. 2',3'-cyclic-nucleotide 2'-phosphodiesterase contains 2 active sites which catalyze the reactions that convert the 2',3'-cyclic-nucleotide into a 3'-nucleotide, which is then converted into nucleic acid and phosphate. Both final products can be transported into the cytoplasm. Thus, it has been suggested that 2',3'-cyclic-nucleotide 2'-phosphodiesterase has a scavenging function. Experimental evidence indicates that 2',3'-cyclic-nucleotide 2'-phosphodiesterase enables Yersinia enterocolitica O:8 to grow on 23-cAMP as a sole source of carbon and energy . ; GO: 0008663 2'3'-cyclic-nucleotide 2'-phosphodiesterase activity, 0009117 nucleotide metabolic process. Probab=87.39 E-value=2.1 Score=21.80 Aligned_cols=220 Identities=15% Similarity=0.239 Sum_probs=102.0 Q ss_pred HHHHHHHHHHHHHCCCCEEEEC-CCCCCCCCHHHHHCC-HHH----HHHHHH-------------------HHHHHHHHH Q ss_conf 9999999999998599899916-644479984454148-899----999999-------------------999999998 Q gi|254780999|r 23 IAKARRAREEANRQGMDLILFT-ELFISGYPPEDLVFK-KSF----IQACSS-------------------AIDTLKSDT 77 (562) Q Consensus 23 ~~~i~~~i~~A~~~gadLvvfP-El~LtGY~~~Dl~~~-~~f----~~~~~~-------------------~l~~La~~~ 77 (562) +.+.-.++++|.++.-.=|+.= ==-+.|-|.+|..-+ ..+ +....+ -|+.|.+.. T Consensus 31 l~~~A~L~~~AR~E~~NsvL~DNGDLIQG~Pl~DY~ara~g~k~G~~~P~~~al~~l~Y~vg~lGNHEFNYGL~fL~~~~ 110 (634) T TIGR01390 31 LTRAATLIKQARAEVKNSVLVDNGDLIQGSPLADYMARAKGLKAGEVHPVYKALNLLKYDVGNLGNHEFNYGLEFLKEAI 110 (634) T ss_pred HHHHHHHHHHHHHHHHCEEEEECCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHH T ss_conf 57888999997532104277636752148844656764154346874569998864186323016731331068999999 Q ss_pred HHCCCEEEEEE-EEEEC-CEE---EEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEE-EECCCC Q ss_conf 87697899999-99888-987---8999999699799988777623554333210037788876147647456-201111 Q gi|254780999|r 78 HDGGAGIVVGF-PRQDQ-EGV---LNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRL-GILICE 151 (562) Q Consensus 78 ~~~~i~iivG~-p~~~~-~~~---yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~ki-Gv~ICe 151 (562) +.-...+|-.. ....+ ++. +=+-++|++-++...--|.|.-.=|.- -|. .+.+..++--.+ |=.... T Consensus 111 ~~A~~P~v~aN~~~d~~~~~pg~~~FTPY~I~~~~v~D~~G~~~~l~vG~I----G~v---pPQI~~WDK~NL~GKv~~~ 183 (634) T TIGR01390 111 AAAKFPIVNANTVVDAGTGQPGANLFTPYLIKEKSVVDTDGKKHTLKVGYI----GLV---PPQIMVWDKANLDGKVTTA 183 (634) T ss_pred HHCCCCEEECCCEEECCCCCCCCCCCCCEEEECCEEEECCCCEEEEEEEEE----EEE---CCCEEEECCCCCCCCEEEH T ss_conf 727897763252464467887654337645631247506886468888789----732---8701300111267526863 Q ss_pred CCCHH-HHHHHHHHHCCCCEEEECC-----CCCCCCCCHHHHHHHHHHHHHHCCCEEE----ECCCC-CCCCCCCCCCCC Q ss_conf 11101-6899999714777599315-----5533358078887889998543370464----21356-887662104864 Q gi|254780999|r 152 DIWKN-SNICKHLKKQGAEFLFSLN-----ASPYYHNKLKKRHEIVTGQISHVHLPII----YVNQV-GGQDELIFDGAS 220 (562) Q Consensus 152 Dlw~~-~~~~~~la~~Gadlii~ps-----ASp~~~~k~~~R~~~~~~~a~e~~~~vv----y~N~v-Gg~d~lvf~G~S 220 (562) |+-.. ....-+|+.+|||+|+.+. +-||..+ .||.++++ .++.+ -|++-.+|.|.. T Consensus 184 DI~~Ta~K~vPEMk~~GAD~~Va~aHsG~s~d~Y~~g-------------~ENs~~~L~~vpgidav~fGHsH~~FPgk~ 250 (634) T TIGR01390 184 DIVETARKYVPEMKAKGADIVVALAHSGISADPYQEG-------------AENSVYYLTKVPGIDAVLFGHSHAVFPGKD 250 (634) T ss_pred HHHHHHHHCCHHHHHCCCCEEEEECCCCCCCCCCCCC-------------CCCEEEHHHCCCCEEEEEECCCCCCCCCCC T ss_conf 2688887315477736895999800688886644355-------------530001000378310898566133478800 Q ss_pred EEECCC---------CEEEEECCCCCCCEEEEEEEECCC--CCCCCCCCCCCC Q ss_conf 355589---------859996167776379999983352--123655575555 Q gi|254780999|r 221 FCFDGQ---------QQLAFQMKHFSEQNFMTEWHYDQQ--LSQWNYMSDDSA 262 (562) Q Consensus 221 ~I~d~~---------G~ii~~~~~f~e~~~i~d~d~d~~--~~~~~~~~~~~~ 262 (562) |.--|+ |.-..-+-.+..++-++|+.+... ..+|......+. T Consensus 251 f~~~pgADI~~GT~~G~p~vM~G~Wg~~~gVvDL~L~~~r~~g~W~v~~~k~~ 303 (634) T TIGR01390 251 FAKIPGADITKGTINGVPAVMAGYWGNHLGVVDLQLEEDRKSGKWTVTSAKAE 303 (634) T ss_pred CCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCEEEECCCCC T ss_conf 11588774435512451011277545740378312011011475378617421 No 125 >TIGR01214 rmlD dTDP-4-dehydrorhamnose reductase; InterPro: IPR005913 dTDP-4-dehydrorhamnose reductase (1.1.1.133 from EC) catalyzes the last of 4 steps in making dTDP-rhamnose, a precursor of LPS molecules such as core antigen and O-antigen.dTDP-6-deoxy-L-mannose + NADP+ = dTDP-4-dehydro-6-deoxy-L-mannose + NADPH ; GO: 0008831 dTDP-4-dehydrorhamnose reductase activity, 0045226 extracellular polysaccharide biosynthetic process. Probab=87.31 E-value=1.2 Score=23.47 Aligned_cols=68 Identities=13% Similarity=0.223 Sum_probs=47.8 Q ss_pred EEEEECCC--CCCCHHHHHHHHHHHCCCCEEEECCCCCCC-CCCHH------HHHH-----HHHHHHHHCCCEEEECCCC Q ss_conf 45620111--111101689999971477759931555333-58078------8878-----8999854337046421356 Q gi|254780999|r 143 IRLGILIC--EDIWKNSNICKHLKKQGAEFLFSLNASPYY-HNKLK------KRHE-----IVTGQISHVHLPIIYVNQV 208 (562) Q Consensus 143 ~kiGv~IC--eDlw~~~~~~~~la~~Gadlii~psASp~~-~~k~~------~R~~-----~~~~~a~e~~~~vvy~N~v 208 (562) ++.| .=| .|+-.|+...+.+.....|+|||-.| |+ ..+.+ .+.+ .+...|.+.+.++||++ T Consensus 35 ~~~~-~~~~~~Dl~dP~~l~~~~r~~~Pd~vvntAA--YT~VD~AE~~~~~AyavNa~A~~~lA~~A~~~Ga~~vh~S-- 109 (317) T TIGR01214 35 LKLA-ARWSQLDLTDPEALEELLRAIRPDAVVNTAA--YTDVDGAESDPEKAYAVNALAPQNLARAAARVGARLVHIS-- 109 (317) T ss_pred CCHH-HHHHHHCCCCHHHHHHHHHHHCCCEEEECCH--HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE-- T ss_conf 6113-3654406224688999998528753762301--1010000377778765740789999999986691599986-- Q ss_pred CCCCCCCCCC Q ss_conf 8876621048 Q gi|254780999|r 209 GGQDELIFDG 218 (562) Q Consensus 209 Gg~d~lvf~G 218 (562) .|+|||| T Consensus 110 ---TDYVFDG 116 (317) T TIGR01214 110 ---TDYVFDG 116 (317) T ss_pred ---ECEEECC T ss_conf ---3423447 No 126 >COG2117 Predicted subunit of tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=87.11 E-value=1.5 Score=22.87 Aligned_cols=60 Identities=33% Similarity=0.391 Sum_probs=40.1 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 5752158723689999998414954467973246666522135699999972135426775899 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDL 358 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~ 358 (562) +-+=-|||-||+|+|.|.. -||-+ |..|+.- |---.+-.-|++.|+.||.+|+++.+..- T Consensus 3 v~vLfSGGKDSSLaA~iL~-klgye-v~LVTvn--FGv~d~~k~A~~tA~~lgF~h~vl~Ldr~ 62 (198) T COG2117 3 VYVLFSGGKDSSLAALILD-KLGYE-VELVTVN--FGVLDSWKYARETAAILGFPHEVLQLDRE 62 (198) T ss_pred EEEEECCCCCHHHHHHHHH-HHCCC-CEEEEEE--ECCCCCHHHHHHHHHHHCCCCCEECCCHH T ss_conf 6898448971058999998-71787-1799987--33466406689999982997000016889 No 127 >PRK12563 sulfate adenylyltransferase subunit 2; Provisional Probab=84.51 E-value=2 Score=22.02 Aligned_cols=146 Identities=15% Similarity=0.122 Sum_probs=78.8 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHHHCCCHHHE--EEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHC Q ss_conf 24540575215872368999999841495446--7973246666522135699999972135426775899999886313 Q gi|254780999|r 290 NNFHKVIIGLSGGIDSALCAAIAVDALGKENV--QTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 290 ~g~~~~viglSGGiDSal~a~la~~alg~~~v--~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~ 367 (562) ..|++.|+=-|||=||+|.+-||.+|+-|.++ -.+-.-+-+.=++.+.--..+++..|+++.+..-++.++ ..+. T Consensus 35 a~f~~pvmL~S~GKDS~VmlhLA~KAF~p~~iPfPllHIDTg~kF~Emi~fRD~~a~~~~l~LiV~~n~~~i~---~g~~ 111 (312) T PRK12563 35 AECSKPVMLYSIGKDSSVMLHLAMKAFRPTRPPFPLLHVDTTWKFREMIDFRDRRAKELGLDLVVHHNPDGIA---RGIV 111 (312) T ss_pred HHHCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCEEEEECCHHHHH---CCCC T ss_conf 9736967986167429999999999737789997358863688759999999999998198689832878886---6999 Q ss_pred CCCCCCCCCH-HHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHH-HHHH-----------------------HCCCCCC Q ss_conf 2346874310-11012330466779888751695-4436432447-7541-----------------------1023666 Q gi|254780999|r 368 QFLQEEPSGI-VAENIQSRIRGNILMALSNHSKA-MLLTTSNKSE-ISVG-----------------------YGTLYGD 421 (562) Q Consensus 368 ~~~~~~~~~~-~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE-~~~G-----------------------y~T~~GD 421 (562) + ..++. +. ...+.+.-|...=.+++. .++|-+-+.| -+-. |..++|. T Consensus 112 p----~~~~~~~~---~~~lKT~~L~~ale~~~fDa~~gGaRRDEEksRAKERiFS~Rd~~~~WDPknQrPElWnlyN~~ 184 (312) T PRK12563 112 P----FRHGSALH---TDVAKTQGLKQALDHHGFDAAIGGARRDEEKSRAKERIFSFRSAFHRWDPKAQRPELWSLYNAR 184 (312) T ss_pred C----CCCCCHHH---HHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHHHHCCC T ss_conf 6----76783777---6689889999999973988785045321342344340306667788988555883587761556 Q ss_pred CCC----CCCCCCCCCHHHHHHHHHHHH Q ss_conf 555----401017763365899999963 Q gi|254780999|r 422 MSG----GFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 422 ~~g----~~~p~~~l~Kt~v~~la~~~n 445 (562) ..- -+-||.+-+-.+||+..+--| T Consensus 185 ~~~Geh~RVfPlsNWTElDIW~YI~~E~ 212 (312) T PRK12563 185 LRRGESLRVFPLSNWTELDVWQYIAREK 212 (312) T ss_pred CCCCCEEEECCCCCCCHHHHHHHHHHHC T ss_conf 7999714422355661789999999828 No 128 >PRK05253 sulfate adenylyltransferase subunit 2; Provisional Probab=80.25 E-value=3.5 Score=20.41 Aligned_cols=147 Identities=15% Similarity=0.143 Sum_probs=79.4 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHHHCCCHHHE--EEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHC Q ss_conf 24540575215872368999999841495446--7973246666522135699999972135426775899999886313 Q gi|254780999|r 290 NNFHKVIIGLSGGIDSALCAAIAVDALGKENV--QTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMS 367 (562) Q Consensus 290 ~g~~~~viglSGGiDSal~a~la~~alg~~~v--~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~ 367 (562) ..|+..|+=-|||=||+|.+.||.+|+=|-++ -.+-.-+.+.=++.++--..+++..|+++.+-.-...++. .+. T Consensus 24 a~f~~pvmL~S~GKDS~VmLhLarKAF~p~~iPfPllHIDTg~kF~Emi~fRD~~a~e~gl~LiV~~n~e~i~~---g~~ 100 (300) T PRK05253 24 AEFENPVMLYSIGKDSSVMLHLARKAFYPGKLPFPLLHVDTGWKFPEMIEFRDRRAKELGLELIVHSNPEGIAR---GIN 100 (300) T ss_pred HHHCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEECHHHHHC---CCC T ss_conf 97369679871672599999999998578899963488307998189999999999984997899757677756---999 Q ss_pred CCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC-EECCCCCHHHH-HHH-----------------------HCCCCCCC Q ss_conf 234687431011012330466779888751695-44364324477-541-----------------------10236665 Q gi|254780999|r 368 QFLQEEPSGIVAENIQSRIRGNILMALSNHSKA-MLLTTSNKSEI-SVG-----------------------YGTLYGDM 422 (562) Q Consensus 368 ~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~-lvl~t~nksE~-~~G-----------------------y~T~~GD~ 422 (562) + ++... .+ -...+.+.-|...-.+++. .++|-.-+.|- +-. |..++|.. T Consensus 101 p-~~~g~--~~---~~~~lKT~~L~~al~~~~fda~~gGaRRDEe~sRAKERiFS~R~~~~~Wdpk~QrPElW~lyn~~~ 174 (300) T PRK05253 101 P-FTHGS--AK---HTNIMKTEGLKQALEKYGFDAAFGGARRDEEKSRAKERIFSFRDEFGQWDPKNQRPELWNLYNGRI 174 (300) T ss_pred C-CCCCC--HH---HHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCC T ss_conf 5-66882--22---315787899999999648765651012344321212542477564688885669802667635567 Q ss_pred CC----CCCCCCCCCHHHHHHHHHHHH Q ss_conf 55----401017763365899999963 Q gi|254780999|r 423 SG----GFNPLKDLYKTQVFQLASWRN 445 (562) Q Consensus 423 ~g----~~~p~~~l~Kt~v~~la~~~n 445 (562) .- -+.||.+-+-.+|++..+--| T Consensus 175 ~~geh~RVfPlsnWTElDIW~YI~~E~ 201 (300) T PRK05253 175 NKGEHIRVFPLSNWTELDIWQYIYREN 201 (300) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHHHC T ss_conf 999647751120225778999999848 No 129 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=79.73 E-value=4.4 Score=19.74 Aligned_cols=63 Identities=30% Similarity=0.271 Sum_probs=40.5 Q ss_pred EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCC-C--C--CCHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 05752158723689999998414954467973246666-5--2--213569999997213542677589 Q gi|254780999|r 294 KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYT-S--P--QSLEDAAACAKALGCKYDVLPIHD 357 (562) Q Consensus 294 ~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~-s--~--~s~~~a~~la~~lg~~~~~i~I~~ 357 (562) +++.=.|||=||..++..|.+. |-+=+..++|..... | - -..+-+..+|+.+|+++....... T Consensus 2 k~~aL~SGGKDS~~Al~~a~~~-G~eV~~Ll~~~p~~~dS~m~H~~n~~~~~~~Ae~~gi~l~~~~~~g 69 (223) T COG2102 2 KVIALYSGGKDSFYALYLALEE-GHEVVYLLTVKPENGDSYMFHTPNLELAELQAEAMGIPLVTFDTSG 69 (223) T ss_pred CEEEEEECCCHHHHHHHHHHHC-CCEEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 6899981671889999999975-9836899998159987044222256799999874488359983476 No 130 >KOG2840 consensus Probab=79.19 E-value=2.7 Score=21.08 Aligned_cols=67 Identities=28% Similarity=0.347 Sum_probs=45.0 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEEC-------CCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 4057521587236899999984149544679732-------4666652213569999997213542677589999 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIML-------PYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN 360 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~m-------p~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~ 360 (562) +.|-+|-|||-||+|.|-. .++|-.....++.+ ...+--+.+....+......|+...++.-+.++. T Consensus 52 e~v~igasGgkdstvlA~v-~~~Ln~r~~~g~~l~Lls~degi~gyrd~sl~avkrn~~~~~lPL~ivs~~dl~~ 125 (347) T KOG2840 52 ERVAIGASGGKDSTVLAYV-LDALNERHDYGLRLFLLSIDEGIRGYRDDSLEAVKRNGVQYGLPLCIVSYKDLYG 125 (347) T ss_pred CCCCCCCCCCHHHHHHHHH-HHHHHHHCCCCCEEEEEECCCCCCCEECCHHHHHHHHHHHCCCCEEEECHHHHHC T ss_conf 8313255566027799999-9984030178841342101003642004299999986142078638841999725 No 131 >COG1091 RfbD dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane] Probab=78.45 E-value=4.8 Score=19.49 Aligned_cols=64 Identities=14% Similarity=0.233 Sum_probs=42.7 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHH------HHH-----HHHHHHHHHCCCEEEECCCCCCCCCCCCCCC Q ss_conf 11110168999997147775993155533358078------887-----8899985433704642135688766210486 Q gi|254780999|r 151 EDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLK------KRH-----EIVTGQISHVHLPIIYVNQVGGQDELIFDGA 219 (562) Q Consensus 151 eDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~------~R~-----~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~ 219 (562) .|++.+..+.+.......|+|+|-.|-+ ...+.+ ... ..+...+.+.+.++|+.+ .++||||. T Consensus 34 ~Ditd~~~v~~~i~~~~PDvVIn~AAyt-~vD~aE~~~e~A~~vNa~~~~~lA~aa~~~ga~lVhiS-----TDyVFDG~ 107 (281) T COG1091 34 LDITDPDAVLEVIRETRPDVVINAAAYT-AVDKAESEPELAFAVNATGAENLARAAAEVGARLVHIS-----TDYVFDGE 107 (281) T ss_pred CCCCCHHHHHHHHHHHCCCEEEECCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEE-----CCEEECCC T ss_conf 5556858999999861999899873203-65413389899777677999999999997197699963-----44574389 Q ss_pred C Q ss_conf 4 Q gi|254780999|r 220 S 220 (562) Q Consensus 220 S 220 (562) - T Consensus 108 ~ 108 (281) T COG1091 108 K 108 (281) T ss_pred C T ss_conf 8 No 132 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=70.19 E-value=3.9 Score=20.05 Aligned_cols=55 Identities=13% Similarity=0.212 Sum_probs=25.2 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHH---HHHHHHHHHHHH Q ss_conf 89997700068878899999999827999779871899999999---999998745668 Q gi|254780999|r 474 RPHQTDQESLPPYPILDDIIKRIVENEESFINNDQEYNDETVRY---VEHLLYGSEYKR 529 (562) Q Consensus 474 ~~~Q~ded~l~~Y~~lD~il~~~i~~~~~~~~~~~~~~~~~v~~---~~~~~~~~~~KR 529 (562) .|+=-=-|++|.+|=.+.+++++ .-+-++...+++.+-+++++ +.+++...-|-| T Consensus 206 SP~Viv~DEIGr~ED~~Al~eA~-naGV~~I~TaHg~~~~Dl~kRP~fk~l~e~~~Fer 263 (282) T TIGR02858 206 SPDVIVVDEIGREEDVEALLEAL-NAGVSVIATAHGRDLEDLKKRPVFKELLEQKAFER 263 (282) T ss_pred CCCEEEEECCCCHHHHHHHHHHH-CCCCEEEEEECCCCHHHHHCCHHHHHHHHHCCCEE T ss_conf 98579981488953389999986-16756887640488126650766799997243148 No 133 >PRK09453 phosphodiesterase; Provisional Probab=66.39 E-value=6.4 Score=18.68 Aligned_cols=43 Identities=23% Similarity=0.353 Sum_probs=28.1 Q ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHH Q ss_conf 28999963883688999999999999999985998999166444799844 Q gi|254780999|r 5 LKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPE 54 (562) Q Consensus 5 mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~ 54 (562) |||++.. |..+|+....+.++...+.++|+++.-== +.+|.|. T Consensus 1 MkI~vIS------DtHgn~~ale~vl~~~~~~~vD~iih~GD-iv~~Gp~ 43 (183) T PRK09453 1 MKLGFAS------DIHGSLPATEKALELFKQSGADWLVHLGD-VLYHGPR 43 (183) T ss_pred CEEEEEE------ECCCCHHHHHHHHHHHHHCCCCEEEECCC-CCCCCCC T ss_conf 9899998------67899699999999987559989999986-2555777 No 134 >TIGR02124 hypE hydrogenase expression/formation protein HypE; InterPro: IPR011854 This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (IPR006283 from INTERPRO) and other enzymes.. Probab=64.93 E-value=9.3 Score=17.60 Aligned_cols=93 Identities=14% Similarity=0.180 Sum_probs=55.4 Q ss_pred HHEEEEECCCCCCCCCCHH-HHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 4467973246666522135-699999972135426775899999886313234687431011012330466779888751 Q gi|254780999|r 319 ENVQTIMLPYKYTSPQSLE-DAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNH 397 (562) Q Consensus 319 ~~v~~~~mp~~~~s~~s~~-~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~ 397 (562) ..|++..=|+|.==...++ -|++.-..+-++...||+.+.|+.....|+- ++ +++||. T Consensus 220 ~~v~~~RD~TRGGLA~vLNE~A~~sg~~i~l~E~~iPV~eeV~gaCE~LGl----dP-----------------l~lANE 278 (345) T TIGR02124 220 PAVHAMRDATRGGLAAVLNEIAESSGVGIELEEEKIPVKEEVKGACELLGL----DP-----------------LYLANE 278 (345) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCC----CH-----------------HHHHCC T ss_conf 142376167855047899999996199279985247883789999986170----32-----------------542047 Q ss_pred CCCEECCCCCHHHHHHHHCCC--CCCCCCCCC-CCCCCCHH Q ss_conf 695443643244775411023--666555401-01776336 Q gi|254780999|r 398 SKAMLLTTSNKSEISVGYGTL--YGDMSGGFN-PLKDLYKT 435 (562) Q Consensus 398 ~~~lvl~t~nksE~~~Gy~T~--~GD~~g~~~-p~~~l~Kt 435 (562) =..++.=..+..|-+|.-.=. ||- +=+ .||.+..+ T Consensus 279 G~~v~~V~~E~A~~vLe~lk~hp~G~---~A~YiIG~V~e~ 316 (345) T TIGR02124 279 GKLVLAVPPEAAEKVLEILKSHPLGK---DAAYIIGEVVEK 316 (345) T ss_pred CEEEEEECHHHHHHHHHHHHHCCCCC---CCCEEEEEEEEC T ss_conf 62899828377999999986077643---321563014737 No 135 >TIGR01188 drrA daunorubicin resistance ABC transporter, ATP-binding protein; InterPro: IPR005894 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains the daunorubicin resistance ABC transporter, ATP binding subunit in bacteria and archaea. This model is restricted in its scope to preferentially recognize the ATP binding subunit associated with effux of the drug, daunorubicin. In other words it functions as an ATP dependent antiporter. . Probab=64.34 E-value=9.9 Score=17.41 Aligned_cols=35 Identities=20% Similarity=0.344 Sum_probs=23.9 Q ss_pred HHHHHHHHCCCC-EEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 588876422454-0575215872368999999841495446797324666652213569999997213 Q gi|254780999|r 282 SLRDYVQKNNFH-KVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC 348 (562) Q Consensus 282 gl~d~~~~~g~~-~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~ 348 (562) .+|||+++..-. ++. |.+-+ +-.++|.+||..++| T Consensus 166 ~iWd~i~~lk~~~g~T---------------------------ilLTT-----hYmeEAd~L~Driai 201 (343) T TIGR01188 166 AIWDYIEALKKEEGVT---------------------------ILLTT-----HYMEEADKLCDRIAI 201 (343) T ss_pred HHHHHHHHHHHCCCCE---------------------------EEEEC-----CCHHHHHHHCCEEEE T ss_conf 9999999987407969---------------------------99743-----786999823688788 No 136 >PRK02090 phosphoadenosine phosphosulfate reductase; Provisional Probab=63.15 E-value=10 Score=17.27 Aligned_cols=143 Identities=16% Similarity=0.117 Sum_probs=81.4 Q ss_pred CEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCH-HHHHHHHHHHHCCCCC Q ss_conf 405752158723689999998414954467973246666522135699999972135426775-8999998863132346 Q gi|254780999|r 293 HKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPI-HDLVNHFFSLMSQFLQ 371 (562) Q Consensus 293 ~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I-~~~~~~~~~~~~~~~~ 371 (562) .++++.-|-|.+|+|.+-|+.+.-. .+.-+.+=+-+.-++|++-+..+.+.+|++.+++.= ...+..... .+.... T Consensus 42 ~~~~~tsSFG~es~Vllhli~~~~~--~ipV~flDTG~~f~ETy~~~d~l~~~~~l~i~~~~P~~~~~~~~~~-~g~l~~ 118 (243) T PRK02090 42 GGLALASSFGAESAVLLHLVAQVDP--DIPVIFLDTGYLFPETYRFIDQLTERLLLNLKVYRPDASAAEQEAR-YGGLWE 118 (243) T ss_pred CCEEEEECCCHHHHHHHHHHHHHCC--CCCEEEEECCCCCHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH-HCCCCC T ss_conf 9769994586789999999994199--9967986579888899999999999979977997788149999998-299766 Q ss_pred CCCCCHHHHCCCCHHHH--HHHHHHHHHCCCEECCCCCHHH-H----HHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 87431011012330466--7798887516954436432447-7----541102366655540101776336589999996 Q gi|254780999|r 372 EEPSGIVAENIQSRIRG--NILMALSNHSKAMLLTTSNKSE-I----SVGYGTLYGDMSGGFNPLKDLYKTQVFQLASWR 444 (562) Q Consensus 372 ~~~~~~~~eN~qaR~R~--~~L~~~an~~~~lvl~t~nksE-~----~~Gy~T~~GD~~g~~~p~~~l~Kt~v~~la~~~ 444 (562) ....|. .-=-++|- ++--+++.. ...+ ||-|.+ . .+...-..+ ..--++|+.+-++.+|+...+-. T Consensus 119 ~~~~d~---~~CC~irKvePL~raL~~~-daWi--tG~Rr~Qs~~Ra~l~~ve~d~-g~~KvnPL~~Ws~~dv~~Yi~~~ 191 (243) T PRK02090 119 QSVEDP---DECCRIRKVEPLNRALAGL-DAWI--TGLRREQSGTRAELPVLEIDR-GRFKINPLADWTNEDVWAYLKEH 191 (243) T ss_pred CCCCCH---HHHHHHHCCHHHHHHHHHC-CEEE--ECCCHHHCHHHCCCCCEEECC-CEEEECCCHHCCHHHHHHHHHHC T ss_conf 763248---9876776502799986017-7587--447144263342587345239-98886700119999999999985 Q ss_pred H Q ss_conf 3 Q gi|254780999|r 445 N 445 (562) Q Consensus 445 n 445 (562) | T Consensus 192 ~ 192 (243) T PRK02090 192 D 192 (243) T ss_pred C T ss_conf 9 No 137 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=59.10 E-value=5.7 Score=19.01 Aligned_cols=175 Identities=17% Similarity=0.233 Sum_probs=99.1 Q ss_pred HHHHHH-HHHCCCCEEEEECCCCCHHHHHHHHH----HHCCCHHHEEEEECCCCCCCCCCHHHHH----HHHHHHC--CC Q ss_conf 358887-64224540575215872368999999----8414954467973246666522135699----9999721--35 Q gi|254780999|r 281 LSLRDY-VQKNNFHKVIIGLSGGIDSALCAAIA----VDALGKENVQTIMLPYKYTSPQSLEDAA----ACAKALG--CK 349 (562) Q Consensus 281 lgl~d~-~~~~g~~~~viglSGGiDSal~a~la----~~alg~~~v~~~~mp~~~~s~~s~~~a~----~la~~lg--~~ 349 (562) ..+.+. |+||--.|++||--|-|-||+||.|. |--|.|.=+..=+--+.+..+.+..+.- +-|..+| .. T Consensus 674 YlIS~~LV~KS~~DGYlVGSRGSVGSSlVAtl~gITEVNPL~PHY~Cp~Cky~Ef~~D~~~~~GfDLp~K~CP~Cgak~p 753 (1264) T TIGR01405 674 YLISQLLVKKSLQDGYLVGSRGSVGSSLVATLTGITEVNPLPPHYLCPNCKYSEFVTDGSVGSGFDLPDKDCPKCGAKAP 753 (1264) T ss_pred HHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 89999988874158846637743114578542166115757687508787355300378778877685788888887776 Q ss_pred C----CCCCHHHHHHHHHHHHCCCCCCCCCCHHH---HCCCCHHHHH--HHHHHHHHCCCEECCC-CCHHHHHHHHCCCC Q ss_conf 4----26775899999886313234687431011---0123304667--7988875169544364-32447754110236 Q gi|254780999|r 350 Y----DVLPIHDLVNHFFSLMSQFLQEEPSGIVA---ENIQSRIRGN--ILMALSNHSKAMLLTT-SNKSEISVGYGTLY 419 (562) Q Consensus 350 ~----~~i~I~~~~~~~~~~~~~~~~~~~~~~~~---eN~qaR~R~~--~L~~~an~~~~lvl~t-~nksE~~~Gy~T~~ 419 (562) + +.|| ++.| ||- -+...+|+.+ +--|||.--- -|+--.-.+.+--+|| .-|| |.||-=.| T Consensus 754 l~kDG~~IP----FETF---LGF-~GDKVPDIDLNFSGEYQ~~AH~~vk~lFGe~~~fRAGTIgTvAeKT--AyGYVK~Y 823 (1264) T TIGR01405 754 LKKDGQDIP----FETF---LGF-KGDKVPDIDLNFSGEYQAKAHNYVKELFGEDHTFRAGTIGTVAEKT--AYGYVKKY 823 (1264) T ss_pred CCCCCCCCC----CCCC---CCC-CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHH--HHHHHHHH T ss_conf 345736998----5001---677-8884686446787224778889999860897223146515355342--21025666 Q ss_pred CC---CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCC-CCH--HHCCCCCC Q ss_conf 66---55540101776336589999996310332223444466-887--88288988 Q gi|254780999|r 420 GD---MSGGFNPLKDLYKTQVFQLASWRNSHGITSGLGPLTEV-IPP--SILEKSPS 470 (562) Q Consensus 420 GD---~~g~~~p~~~l~Kt~v~~la~~~n~~~~~~~~~~~~~~-ip~--~i~~~~ps 470 (562) -. ..+ ..+.+.+|-.|++=+-..-.+.+-=|.+-+ ||. +|.+-.|- T Consensus 824 ~en~~~~~-----~~~r~aei~rl~~~~tGvKRTTGQHPGGIiivPk~~dvyDFtP~ 875 (1264) T TIGR01405 824 FENIIDQG-----KHYRDAEIERLVQKCTGVKRTTGQHPGGIIIVPKEMDVYDFTPV 875 (1264) T ss_pred HHHHHHCC-----CCCCCHHHHHHHCCCCCCCCCCCCCCCCEEEECCCEEEEECCCC T ss_conf 77754227-----65550467876505632221578688765887687016521777 No 138 >pfam02273 Acyl_transf_2 Acyl transferase. This bacterial family of Acyl transferases (or myristoyl-acp-specific thioesterases) catalyse the first step in the bioluminescent fatty acid reductase system. Probab=58.79 E-value=12 Score=16.78 Aligned_cols=69 Identities=28% Similarity=0.423 Sum_probs=43.1 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 5888764224540575215872368999999841495446797324666652213569999997213542677589999 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN 360 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~ 360 (562) -+-|.++..|...+ |=|-|+|.|-||.+....-|+ .+ -.|.-.-.+.-..|-++||..|...||+.+-+ T Consensus 90 ~VidWLk~rgi~~l-----GlIAaSLSARIAY~v~~ei~l-sf----LiTAVGVvnLR~TLEkal~~Dyl~lpi~~lPe 158 (294) T pfam02273 90 TVIDWLKTRGINNI-----GLIASSLSARIAYEVISEINL-SF----LITAVGVVNLRDTLEKALGFDYLSLPIDELPE 158 (294) T ss_pred HHHHHHHHCCCCHH-----HHHHHHHHHHHHHHHHHHCCH-HE----EEEEEEEEEHHHHHHHHHCCCHHCCCHHHCCC T ss_conf 99999986597333-----445776778999998742132-11----66654555379999998487521189446965 No 139 >TIGR01361 DAHP_synth_Bsub phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006268 These sequences are one of at least three types of phospho-2-dehydro-3-deoxyheptonate aldolase (DAHP synthase). This enzyme catalyzes the first of 7 steps in the biosynthesis of chorismate, that last common precursor of all three aromatic amino acids and of PABA, ubiquinone and menaquinone. Some members of this family, including an experimentally characterised member from Bacillus subtilis, are bifunctional, with a chorismate mutase domain N-terminal to this region. The member of this family from Synechocystis PCC 6803, CcmA, was shown to be essential for carboxysome formation. However, no other candidate for this enzyme is present in that species, chorismate biosynthesis does occur, other species having this protein lack carboxysomes but appear to make chorismate, and a requirement of CcmA for carboxysome formation does not prohibit a role in chorismate biosynthesis.; GO: 0016832 aldehyde-lyase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=57.86 E-value=13 Score=16.68 Aligned_cols=69 Identities=17% Similarity=0.312 Sum_probs=40.4 Q ss_pred CCEEEECCEEEEE-------CCC--CCCCHHHHHHHHHHHCCCCEEE----ECCCCCCCCC-CHHHHHHHHHHHHHHCCC Q ss_conf 7614764745620-------111--1111016899999714777599----3155533358-078887889998543370 Q gi|254780999|r 135 NDPIVFRDIRLGI-------LIC--EDIWKNSNICKHLKKQGAEFLF----SLNASPYYHN-KLKKRHEIVTGQISHVHL 200 (562) Q Consensus 135 ~~~~~~~g~kiGv-------~IC--eDlw~~~~~~~~la~~Gadlii----~psASp~~~~-k~~~R~~~~~~~a~e~~~ 200 (562) ..+.++++++||= -=| |+.-.-...++..+..||++|= =|--|||.+. ....-...++..+.+.+. T Consensus 11 ~tv~~v~~v~IG~G~~~~iAGPCsvEs~eq~~~~A~~vk~~Ga~~LRGGAfKPRTSPYsFQGlg~~gl~~l~~A~~~~GL 90 (262) T TIGR01361 11 KTVVDVKGVKIGEGSPIVIAGPCSVESEEQIVETARFVKEAGAKILRGGAFKPRTSPYSFQGLGEEGLKLLRRAADETGL 90 (262) T ss_pred CCEEECCCEEECCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 61577185775384789986487748879999999999866740430663488888841247418999999999986099 Q ss_pred EEE Q ss_conf 464 Q gi|254780999|r 201 PII 203 (562) Q Consensus 201 ~vv 203 (562) ++| T Consensus 91 ~~v 93 (262) T TIGR01361 91 PVV 93 (262) T ss_pred EEE T ss_conf 489 No 140 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=57.72 E-value=13 Score=16.67 Aligned_cols=12 Identities=17% Similarity=0.182 Sum_probs=5.2 Q ss_pred HHHHHHHCCCEE Q ss_conf 999854337046 Q gi|254780999|r 191 VTGQISHVHLPI 202 (562) Q Consensus 191 ~~~~a~e~~~~v 202 (562) ++..+.++++++ T Consensus 57 i~~~~~~~gl~I 68 (284) T PRK13210 57 LVKAIYETGVRI 68 (284) T ss_pred HHHHHHHCCCEE T ss_conf 999999829835 No 141 >KOG1706 consensus Probab=56.60 E-value=13 Score=16.55 Aligned_cols=19 Identities=16% Similarity=0.183 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHCCCCEEEECC Q ss_conf 9999999999985998999166 Q gi|254780999|r 24 AKARRAREEANRQGMDLILFTE 45 (562) Q Consensus 24 ~~i~~~i~~A~~~gadLvvfPE 45 (562) .-|+.++ +++|.|+|+|-. T Consensus 19 scil~WL---keqGyeViay~A 37 (412) T KOG1706 19 SCILAWL---KEQGYEVIAYLA 37 (412) T ss_pred HHHHHHH---HHCCCEEEEEEC T ss_conf 3435888---862965898400 No 142 >pfam09987 DUF2226 Uncharacterized protein conserved in archaea (DUF2226). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=56.34 E-value=14 Score=16.52 Aligned_cols=28 Identities=14% Similarity=0.335 Sum_probs=24.5 Q ss_pred EEEECCEEEEEEEEEECCEEEEEEEEEE Q ss_conf 9988898789999996997999887776 Q gi|254780999|r 89 PRQDQEGVLNSVVILDAGNIIAVRDKIN 116 (562) Q Consensus 89 p~~~~~~~yNsa~vi~~G~il~~y~K~~ 116 (562) ..+.++++|.+.+++.+|+|++.|-+.. T Consensus 36 ~~k~g~~L~e~~i~VenGKIVg~y~~~~ 63 (252) T pfam09987 36 TTRDGEDLYDAFIIVENGKVVGAYLGRV 63 (252) T ss_pred EECCCCCCCEEEEEEECCEEEEEEEEEC T ss_conf 9636872012589998896999998522 No 143 >KOG0259 consensus Probab=56.34 E-value=14 Score=16.52 Aligned_cols=70 Identities=13% Similarity=0.161 Sum_probs=33.8 Q ss_pred HHHHHHCC--CCEEEECC---CCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECC Q ss_conf 99997147--77599315---55333580788878899985433704642135688766210486435558985999616 Q gi|254780999|r 160 CKHLKKQG--AEFLFSLN---ASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMK 234 (562) Q Consensus 160 ~~~la~~G--adlii~ps---ASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~ 234 (562) .+.++..+ |=+++||+ ++-|+. ...+-+.+.|.+.+..++.=- -++..+|.+.-|+ |-|+ T Consensus 192 veal~DENT~AivviNP~NPcGnVys~----~HL~kiae~A~klgi~vIaDE---VY~~~vfg~~pfv--pmg~------ 256 (447) T KOG0259 192 VEALADENTVAIVVINPNNPCGNVYSE----DHLKKIAETAKKLGIMVIADE---VYGHTVFGDKPFV--PMGK------ 256 (447) T ss_pred HHHHHCCCEEEEEEECCCCCCCCCCCH----HHHHHHHHHHHHHCCEEEEHH---HCCEEECCCCCCC--CHHH------ T ss_conf 997644670699996799998651519----999999999998497697611---1231331799730--0133------ Q ss_pred CCCCCEEEEEE Q ss_conf 77763799999 Q gi|254780999|r 235 HFSEQNFMTEW 245 (562) Q Consensus 235 ~f~e~~~i~d~ 245 (562) |+..+-+.++ T Consensus 257 -fssiVPVitl 266 (447) T KOG0259 257 -FSSIVPVITL 266 (447) T ss_pred -CCCCCCEEEE T ss_conf -0565736861 No 144 >COG3453 Uncharacterized protein conserved in bacteria [Function unknown] Probab=55.64 E-value=14 Score=16.45 Aligned_cols=108 Identities=24% Similarity=0.253 Sum_probs=62.3 Q ss_pred EEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHH------HHHHHHHHHHCCC Q ss_conf 7521587236899999984149544679732466665221356999999721354267758------9999988631323 Q gi|254780999|r 296 IIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIH------DLVNHFFSLMSQF 369 (562) Q Consensus 296 viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~------~~~~~~~~~~~~~ 369 (562) .+.+||-|.-+=...|+ ++|=+-|++-.--....++-+-..-++-++..|+.+..||.. .-|+.|...+.. T Consensus 9 ~lsVsgQi~~~D~~~ia--a~GFksiI~nRPDgEe~~QP~~~~i~~aa~~aGl~y~~iPV~~~~iT~~dV~~f~~Al~e- 85 (130) T COG3453 9 RLSVSGQISPADIASIA--ALGFKSIICNRPDGEEPGQPGFAAIAAAAEAAGLTYTHIPVTGGGITEADVEAFQRALDE- 85 (130) T ss_pred CEEECCCCCHHHHHHHH--HHCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHH- T ss_conf 34236989888999999--704221001699877889997499999999669825876347998799999999999997- Q ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCCCHHH-HHHHH Q ss_conf 46874310110123304667798887516954436432447-75411 Q gi|254780999|r 370 LQEEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTSNKSE-ISVGY 415 (562) Q Consensus 370 ~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~nksE-~~~Gy 415 (562) +++++-. --++..|.+.||.+....++|- ++| .++|+ T Consensus 86 aegPVla----yCrsGtRs~~ly~~~~~~~gm~-----~de~~a~g~ 123 (130) T COG3453 86 AEGPVLA----YCRSGTRSLNLYGLGELDGGMS-----RDEIEALGQ 123 (130) T ss_pred CCCCEEE----EECCCCHHHHHHHHHHHHCCCC-----HHHHHHHHH T ss_conf 0898786----5457706889999999866998-----899999888 No 145 >PRK13946 shikimate kinase; Provisional Probab=52.46 E-value=12 Score=16.93 Aligned_cols=82 Identities=17% Similarity=0.099 Sum_probs=44.1 Q ss_pred HHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHH-HHHCCCCCCCCCCHHHHCCCCHHHHHHH Q ss_conf 841495446797324666652213569999997213542677589999988-6313234687431011012330466779 Q gi|254780999|r 313 VDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFF-SLMSQFLQEEPSGIVAENIQSRIRGNIL 391 (562) Q Consensus 313 ~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~-~~~~~~~~~~~~~~~~eN~qaR~R~~~L 391 (562) .-+|+.+||.-+-||+-.-| .-++.||+.||..+...| ..++... ..+...|..... +..+...+-. | T Consensus 15 ~~~l~kknIvLIG~mGsGKS----tvGk~LA~~L~~~fiD~D--~~IE~~~g~sI~eIF~~~GE----~~FR~~E~~~-l 83 (195) T PRK13946 15 RAALGKRTVVLVGLMGAGKS----TVGRRLATMLGLPFLDAD--TEIERAARMTIPEIFATYGE----PEFRDLERRV-I 83 (195) T ss_pred HHHHCCCCEEEECCCCCCHH----HHHHHHHHHHCCCEEECH--HHHHHHHCCCHHHHHHHHCH----HHHHHHHHHH-H T ss_conf 99858995899899999889----999999999797989885--99999809989999998697----9999989999-9 Q ss_pred HHHHHHCCCEECCCC Q ss_conf 888751695443643 Q gi|254780999|r 392 MALSNHSKAMLLTTS 406 (562) Q Consensus 392 ~~~an~~~~lvl~t~ 406 (562) ..+.+. +..|++|| T Consensus 84 ~~l~~~-~~~VIstG 97 (195) T PRK13946 84 ARLLKG-GPLVLATG 97 (195) T ss_pred HHHHCC-CCEEEECC T ss_conf 998648-98799758 No 146 >TIGR02039 CysD sulfate adenylyltransferase, small subunit; InterPro: IPR011784 Metabolic assimilation of sulphur from inorganic sulphate, requires sulphate activation by coupling to a nucleoside, for the production of high-energy nucleoside phosphosulphates. This pathway appears to be similar in all prokaryotic organisms. Activation is first achieved through reaction of sulphate with ATP by sulphate adenylyltransferase (ATP sulphurylase) to produce 5'-phosphosulphate (APS), coupled by GTP hydrolysis. Subsequently, APS is phosphorylated by an APS kinase to produce 3'-phosphoadenosine-5'-phosphosulphate (PAPS). In Escherichia coli, ATP sulphurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulphur-containing amino acids. Homologous to this E. coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulphurase and APS kinase activity, yet are involved in Nod factor sulphation, and sulphation of other macromolecules.; GO: 0004781 sulfate adenylyltransferase (ATP) activity, 0019421 sulfate reduction APS pathway. Probab=52.04 E-value=11 Score=17.16 Aligned_cols=64 Identities=14% Similarity=0.101 Sum_probs=38.7 Q ss_pred HCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHE----EEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 224540575215872368999999841495446----797324666652213569999997213542677 Q gi|254780999|r 289 KNNFHKVIIGLSGGIDSALCAAIAVDALGKENV----QTIMLPYKYTSPQSLEDAAACAKALGCKYDVLP 354 (562) Q Consensus 289 ~~g~~~~viglSGGiDSal~a~la~~alg~~~v----~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~ 354 (562) ...|++=|+=-|+|=||+|.+.||.+|+=|-++ +-|= +.+.=.+-..---.+++..++.+.+.. T Consensus 16 aa~F~~PV~LyS~GKDS~VlLHLA~KaF~Pg~~PFplLHvD--Tgw~f~E~~~fRD~~~~~~~~~L~v~~ 83 (295) T TIGR02039 16 AAEFERPVLLYSIGKDSSVLLHLARKAFYPGRLPFPLLHVD--TGWKFREMIAFRDELVAKYGLELIVHS 83 (295) T ss_pred HHCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCCCCEEEEE--CCCCHHHHHHHHHHHHHHHCCEEEEEE T ss_conf 63037976898657347999998764058888797357740--663278999999999987097688850 No 147 >PRK09987 dTDP-4-dehydrorhamnose reductase; Provisional Probab=51.64 E-value=16 Score=16.03 Aligned_cols=65 Identities=14% Similarity=0.271 Sum_probs=41.6 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCH------HHH-----HHHHHHHHHHCCCEEEECCCCCCCCCCCCCC Q ss_conf 11111016899999714777599315553335807------888-----7889998543370464213568876621048 Q gi|254780999|r 150 CEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKL------KKR-----HEIVTGQISHVHLPIIYVNQVGGQDELIFDG 218 (562) Q Consensus 150 CeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~------~~R-----~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G 218 (562) |-|+-.+..+...+.....|+|+|..|-.- ..+- ..+ -..+...+.+.++++|+.. .+.|||| T Consensus 37 ~~Dit~~~~v~~~~~~~~Pd~IIN~aA~T~-VD~~E~~~~~a~~vN~~~~~~La~~~~~~~~~lIhiS-----TD~VFdG 110 (299) T PRK09987 37 CGDFSNPEGVAETVRKIRPDVIVNAAAHTA-VDKAESEPEFAQLLNATSVEAIAKAANEVGAWVVHYS-----TDYVFPG 110 (299) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEECHHHCC-HHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEC-----CCEEECC T ss_conf 367899999999999659999998831016-3665248999999888999999999997398599963-----2116068 Q ss_pred CC Q ss_conf 64 Q gi|254780999|r 219 AS 220 (562) Q Consensus 219 ~S 220 (562) .. T Consensus 111 ~~ 112 (299) T PRK09987 111 TG 112 (299) T ss_pred CC T ss_conf 99 No 148 >COG4646 DNA methylase [Transcription / DNA replication, recombination, and repair] Probab=51.13 E-value=15 Score=16.33 Aligned_cols=15 Identities=27% Similarity=0.419 Sum_probs=10.8 Q ss_pred HHHHHHHCCCCCEEC Q ss_conf 456687389702472 Q gi|254780999|r 525 SEYKRRQAPVGTKIT 539 (562) Q Consensus 525 ~~~KR~~~pp~~~is 539 (562) ..|||-|+.|+|++. T Consensus 588 ~a~~ryqy~~~~~ik 602 (637) T COG4646 588 RAFKRYQYVLSPRIK 602 (637) T ss_pred HHHHCCCCCCCCHHH T ss_conf 455403557761556 No 149 >TIGR01980 sufB FeS assembly protein SufB; InterPro: IPR010231 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufB, which is part of the SUF system and forms a complex with SufBCD.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=50.91 E-value=14 Score=16.52 Aligned_cols=33 Identities=12% Similarity=0.222 Sum_probs=24.0 Q ss_pred CCCCCCCCEEECCCCEEEE------------ECCCCCCCEEEEEE Q ss_conf 6210486435558985999------------61677763799999 Q gi|254780999|r 213 ELIFDGASFCFDGQQQLAF------------QMKHFSEQNFMTEW 245 (562) Q Consensus 213 ~lvf~G~S~I~d~~G~ii~------------~~~~f~e~~~i~d~ 245 (562) ..||.|||||+=|.|.-+- ...+||-.++|+|= T Consensus 183 ~AVWSGGsFvYVPkGV~v~~PLqtYFRIN~~~~GQFErTLiIade 227 (469) T TIGR01980 183 GAVWSGGSFVYVPKGVRVDMPLQTYFRINSENTGQFERTLIIADE 227 (469) T ss_pred HHEEECCEEEEECCCCEECCCCCCCEECCCCCCCCCEEEEEEEEE T ss_conf 211027857984789866663000000284778863258899951 No 150 >cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core. Probab=49.59 E-value=17 Score=15.83 Aligned_cols=15 Identities=0% Similarity=0.341 Sum_probs=7.8 Q ss_pred HHHHHHHCCCCEEEE Q ss_conf 999997147775993 Q gi|254780999|r 159 ICKHLKKQGAEFLFS 173 (562) Q Consensus 159 ~~~~la~~Gadlii~ 173 (562) ..+.+.....|++.. T Consensus 70 l~~~l~~~~~Divh~ 84 (355) T cd03819 70 LRRLIREEKVDIVHA 84 (355) T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999969989997 No 151 >TIGR02346 chap_CCT_theta T-complex protein 1, theta subunit; InterPro: IPR012721 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT theta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=49.30 E-value=18 Score=15.80 Aligned_cols=39 Identities=21% Similarity=0.400 Sum_probs=16.0 Q ss_pred CCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 58723689999998414954467973246666522135699999972135 Q gi|254780999|r 300 SGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCK 349 (562) Q Consensus 300 SGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~ 349 (562) +||-=|=+++..+ -+-++.-+-.||+| +=++||+.+|.. T Consensus 295 ~gG~v~d~AlHy~----~~Y~im~~k~~SKF-------ELrRlCk~~gA~ 333 (554) T TIGR02346 295 TGGSVGDMALHYV----EKYNIMVLKIPSKF-------ELRRLCKTVGAT 333 (554) T ss_pred ECCCHHHHHHHHH----HHCCEEEEEECCCH-------HHHHHHHHHCCC T ss_conf 4684889999878----15582699977826-------588999880777 No 152 >PRK13604 luxD acyl transferase; Provisional Probab=47.80 E-value=18 Score=15.65 Aligned_cols=69 Identities=23% Similarity=0.378 Sum_probs=43.0 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 5888764224540575215872368999999841495446797324666652213569999997213542677589999 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN 360 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~ 360 (562) .+-|.++..|.... |=|-++|.|-||.+....-++ .+ -.|.-.-.+.-..|-++||..|...||+.+-+ T Consensus 95 ~VidWLk~rgi~~i-----GlIAaSLSARIAY~v~~eidl-sf----LiTAVGVvnLR~TLEkal~yDyl~lpi~~lPe 163 (305) T PRK13604 95 TVVDWLKTRGIQNI-----GLIAASLSARIAYEVINEIDL-SF----LITAVGVVNLRDTLEKALKYDYLQLPINELPE 163 (305) T ss_pred HHHHHHHHCCCCHH-----HHHHHHHHHHHHHHHHHHCCH-HE----EEEEEEEEEHHHHHHHHHCCCHHCCCHHHCCC T ss_conf 99999986597423-----446776788999998742123-20----45543555379999998487531089446965 No 153 >TIGR02715 amido_AtzE amidohydrolase, AtzE family; InterPro: IPR014087 Members of this protein family are aminohydrolases related to, but distinct from, glutamyl-tRNA(Gln) amidotransferase subunit A. The best-characterised member is from Pseudomonas sp., which hydrolyses ammonia from the three-nitrogen compound biuret to yield allophanate. Allophanate is also an intermediate in urea degradation by the urea carboxylase/allophanate hydrolase pathway, an alternative to urease.. Probab=47.44 E-value=5.3 Score=19.18 Aligned_cols=23 Identities=35% Similarity=0.577 Sum_probs=9.3 Q ss_pred CHHHHHHHHH--HHCCCHHHEEEEECC Q ss_conf 2368999999--841495446797324 Q gi|254780999|r 303 IDSALCAAIA--VDALGKENVQTIMLP 327 (562) Q Consensus 303 iDSal~a~la--~~alg~~~v~~~~mp 327 (562) .|+.+.|++- ++|||..++ |+|| T Consensus 268 ~~pea~AAv~~vA~AL~a~~~--V~lP 292 (454) T TIGR02715 268 ADPEALAAVGRVAKALGATEI--VELP 292 (454) T ss_pred CCHHHHHHHHHHHHHHCCCCE--EECC T ss_conf 668899999999996188626--6461 No 154 >PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=47.42 E-value=19 Score=15.61 Aligned_cols=17 Identities=18% Similarity=0.149 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 35699999972135426 Q gi|254780999|r 336 LEDAAACAKALGCKYDV 352 (562) Q Consensus 336 ~~~a~~la~~lg~~~~~ 352 (562) .+.+..+++.++..... T Consensus 167 ~~~~~~~i~~v~sP~l~ 183 (283) T PRK13209 167 ISKALGYAHYLNSPWFQ 183 (283) T ss_pred HHHHHHHHHHCCCCCEE T ss_conf 99999999966997289 No 155 >TIGR01455 glmM phosphoglucosamine mutase; InterPro: IPR006352 This family describes GlmM, phosphoglucosamine mutase, also designated MrsA and YhbF in Escherichia coli , UreC in Helicobacter pylori , and femR315 or FemD in Staphlococcus aureus . It converts glucosamine-6-phosphate to glucosamine-1-phosphate as part of the pathway toward UDP-N-acetylglucosamine for peptidoglycan and lipopolysaccharides.. Probab=45.75 E-value=20 Score=15.45 Aligned_cols=91 Identities=13% Similarity=0.191 Sum_probs=53.1 Q ss_pred CCC--CCCCCCCCEEEECCEEEEECCCCCC-CHHHHHHHHHHHCCCCEEEE---CCCCC--CCCCCHHHHHHHHHHHHHH Q ss_conf 100--3778887614764745620111111-10168999997147775993---15553--3358078887889998543 Q gi|254780999|r 126 KRT--FISGYSNDPIVFRDIRLGILICEDI-WKNSNICKHLKKQGAEFLFS---LNASP--YYHNKLKKRHEIVTGQISH 197 (562) Q Consensus 126 ~r~--F~~G~~~~~~~~~g~kiGv~ICeDl-w~~~~~~~~la~~Gadlii~---psASp--~~~~k~~~R~~~~~~~a~e 197 (562) .|| |....-+..+.+.|.|+-+= |--- -+.-.| ..+.+-||++|.. |+++. -..|..+ .+.++...+| T Consensus 159 ~rY~~F~~~t~P~~~~L~GLk~VlD-CAnGAay~~AP-~~F~~LGAeViAi~~~PdG~NIN~~cGSTH--~~~Lq~~V~E 234 (450) T TIGR01455 159 GRYIEFLKSTLPRGLSLSGLKVVLD-CANGAAYKVAP-KVFRELGAEVIAIGVEPDGLNINDGCGSTH--LEALQKAVRE 234 (450) T ss_pred HHHHHHHHHHCCCCCCCCCCEEEEE-CCCCHHHHHHH-HHHHHCCCEEEEECCCCCCCCCCCCCCCCC--HHHHHHHHHH T ss_conf 9999999984689988787568764-10423689999-999971986999806888632244568677--7889988402 Q ss_pred CCCEEEECCCCCCCCCCCCCCCC---EEECCCCEEE Q ss_conf 37046421356887662104864---3555898599 Q gi|254780999|r 198 VHLPIIYVNQVGGQDELIFDGAS---FCFDGQQQLA 230 (562) Q Consensus 198 ~~~~vvy~N~vGg~d~lvf~G~S---~I~d~~G~ii 230 (562) .+.-+ .+-|||.+ +++|.+|+++ T Consensus 235 ~~AdL----------GiAfDGDaDR~~~VD~~G~~V 260 (450) T TIGR01455 235 SGADL----------GIAFDGDADRVLAVDENGDIV 260 (450) T ss_pred CCCCC----------CEEECCCCCEEEEECCCCCCC T ss_conf 03440----------100128845289880665954 No 156 >PRK09982 universal stress protein UspD; Provisional Probab=44.93 E-value=9.7 Score=17.49 Aligned_cols=31 Identities=10% Similarity=0.005 Sum_probs=15.5 Q ss_pred HHHHHHHHCCC----EECCCCCHHHHHHHHCCCCC Q ss_conf 79888751695----44364324477541102366 Q gi|254780999|r 390 ILMALSNHSKA----MLLTTSNKSEISVGYGTLYG 420 (562) Q Consensus 390 ~L~~~an~~~~----lvl~t~nksE~~~Gy~T~~G 420 (562) .|..++.+.++ ..+++|+.++-....++.++ T Consensus 69 ~L~~L~~~~~YPI~~~~v~~Gdl~~~L~~~v~~~~ 103 (142) T PRK09982 69 KLYKLTKNIQWPKTKLRIERGEMPETLLEIMQKEQ 103 (142) T ss_pred HHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHC T ss_conf 99999984799824789984488999999999849 No 157 >PRK07106 5-aminoimidazole-4-carboxamide ribonucleotide transformylase; Provisional Probab=44.71 E-value=21 Score=15.34 Aligned_cols=23 Identities=9% Similarity=0.247 Sum_probs=15.5 Q ss_pred HHHHHHHHHHCCCCEEEECCCCC Q ss_conf 16899999714777599315553 Q gi|254780999|r 156 NSNICKHLKKQGAEFLFSLNASP 178 (562) Q Consensus 156 ~~~~~~~la~~Gadlii~psASp 178 (562) +...++.+...=.|+|+.|+=++ T Consensus 125 d~~~A~~I~~~F~EvIIAP~f~~ 147 (391) T PRK07106 125 DEATARILNREVSDGIIAPGYTP 147 (391) T ss_pred CHHHHHHHHHHHHHHHCCCCCCH T ss_conf 89999999986576410578898 No 158 >PRK03367 consensus Probab=44.01 E-value=21 Score=15.33 Aligned_cols=36 Identities=19% Similarity=0.294 Sum_probs=20.4 Q ss_pred CCCCEEEEEEECCCCCCCHHHHH-HHHHHHHHHHHHCCCCEEEE Q ss_conf 98862899996388368899999-99999999999859989991 Q gi|254780999|r 1 MLKKLKIAIAQLNPVVGDIAGNI-AKARRAREEANRQGMDLILF 43 (562) Q Consensus 1 M~~~mkIal~Q~n~~~gd~~~N~-~~i~~~i~~A~~~gadLvvf 43 (562) ||++-||++. .||+.+-= +-+.++..+ ...++++++ T Consensus 1 mm~~~~IaIT-----~GDPaGIGPEIi~kl~~~--~~~~~~iv~ 37 (329) T PRK03367 1 MVKNQRVVIT-----PGEPAGIGPDLVVQLAQR--EWPVELVVC 37 (329) T ss_pred CCCCCCEEEE-----CCCCCHHHHHHHHHHHHC--CCCCCEEEE T ss_conf 9989978998-----788753189999999700--689888999 No 159 >cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases. capH in Staphylococcus aureus has been shown to be required for the biosynthesis of the type 1 capsular polysaccharide (CP1). Probab=44.01 E-value=21 Score=15.27 Aligned_cols=12 Identities=25% Similarity=0.332 Sum_probs=5.7 Q ss_pred HHHHHHHHHCCC Q ss_conf 699999972135 Q gi|254780999|r 338 DAAACAKALGCK 349 (562) Q Consensus 338 ~a~~la~~lg~~ 349 (562) .-+++++.+|++ T Consensus 237 ~l~~~i~~~~l~ 248 (358) T cd03812 237 EIKKKVKELGLE 248 (358) T ss_pred HHHHHHHHCCCC T ss_conf 899999982987 No 160 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=43.27 E-value=22 Score=15.20 Aligned_cols=12 Identities=8% Similarity=0.063 Sum_probs=5.1 Q ss_pred HHHHCCCCEEEE Q ss_conf 997147775993 Q gi|254780999|r 162 HLKKQGAEFLFS 173 (562) Q Consensus 162 ~la~~Gadlii~ 173 (562) .+.....|++.. T Consensus 91 ~~~~~~pDivh~ 102 (359) T cd03823 91 LLEDFRPDVVHF 102 (359) T ss_pred HHHHCCCCEEEE T ss_conf 998749999998 No 161 >TIGR01182 eda 2-dehydro-3-deoxyphosphogluconate aldolase/4-hydroxy-2-oxoglutarate aldolase; InterPro: IPR000887 4-Hydroxy-2-oxoglutarate aldolase (4.1.3.16 from EC) (KHG-aldolase) catalyzes the interconversion of 4-hydroxy-2-oxoglutarate into pyruvate and glyoxylate. Phospho-2-dehydro-3-deoxygluconate aldolase (4.1.2.14 from EC) (KDPG-aldolase) catalyzes the interconversion of 6-phospho-2-dehydro-3-deoxy-D-gluconate into pyruvate and glyceraldehyde 3-phosphate. These two enzymes are structurally and functionally related . They are both homotrimeric proteins of approximately 220 amino-acid residues. They are class I aldolases whose catalytic mechanism involves the formation of a Schiff-base intermediate between the substrate and the epsilon-amino group of a lysine residue. In both enzymes, an arginine is required for catalytic activity.; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=42.98 E-value=22 Score=15.17 Aligned_cols=16 Identities=19% Similarity=0.218 Sum_probs=6.4 Q ss_pred HHHHHHCCCCEEEECC Q ss_conf 9999985998999166 Q gi|254780999|r 30 REEANRQGMDLILFTE 45 (562) Q Consensus 30 i~~A~~~gadLvvfPE 45 (562) +++|.+.|||.+|-|= T Consensus 74 ~~~A~~AGA~F~vSPG 89 (205) T TIGR01182 74 LRQAVAAGAQFIVSPG 89 (205) T ss_pred HHHHHHCCCCEEECCC T ss_conf 9999970895787697 No 162 >TIGR02778 ligD_pol DNA polymerase LigD, polymerase domain; InterPro: IPR014145 DNA repair of double-stranded breaks by non-homologous end joining (NHEJ) is accomplished by a two-protein system that is present in a minority of prokaryotes. One component is the Ku protein (see IPR009187 from INTERPRO), which binds DNA ends. The other is a DNA ligase, a protein that is a multidomain polypeptide in most of those bacteria that have NheJ, a permuted polypeptide in Mycobacterium tuberculosis and a few other species, and the product of tandem genes in some other bacteria. This region represents the C-terminal polymerase domain.. Probab=42.73 E-value=8.6 Score=17.81 Aligned_cols=60 Identities=18% Similarity=0.187 Sum_probs=33.8 Q ss_pred HHHHCCHHHHHHHHHHHCCCCE--EEEECCC--CCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHH Q ss_conf 8642202035888764224540--5752158--723689999998414954467973246666522135 Q gi|254780999|r 273 EADYNACVLSLRDYVQKNNFHK--VIIGLSG--GIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLE 337 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~g~~~--~viglSG--GiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~ 337 (562) ..+++ -+||.-|++-||-|| |+|.|.= ..+.-=+-..|. ++ ++ -++-.||.+||++.++. T Consensus 144 r~~L~--~lgL~sFvKTSGGKGLHv~vPl~~a~~~~w~~~~~Fa~-~~-A~-~l~~~~P~~ft~~m~K~ 207 (251) T TIGR02778 144 RELLD--ELGLESFVKTSGGKGLHVYVPLRPANTLSWEEVKDFAK-AL-AK-ALAQQAPDRFTAEMSKK 207 (251) T ss_pred HHHHH--HHCCCCCCEECCCCEEEEEEEECCCCCCCHHHHHHHHH-HH-HH-HHHHHCCCHHHHHHHHH T ss_conf 99998--71235644115897038987221588988789999999-99-99-99986882456676576 No 163 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=42.62 E-value=19 Score=15.57 Aligned_cols=32 Identities=28% Similarity=0.416 Sum_probs=20.9 Q ss_pred HHHCCCCEEEEECCCCCHHHHHHHHH----HHCCCH Q ss_conf 64224540575215872368999999----841495 Q gi|254780999|r 287 VQKNNFHKVIIGLSGGIDSALCAAIA----VDALGK 318 (562) Q Consensus 287 ~~~~g~~~~viglSGGiDSal~a~la----~~alg~ 318 (562) |+||.-.|++||--|-+-||+||.+. |..|.| T Consensus 878 VkkS~~dGYlVGSRGSVGSSlVAtm~gITEVNPL~P 913 (1444) T COG2176 878 VKKSLDDGYLVGSRGSVGSSLVATMIGITEVNPLPP 913 (1444) T ss_pred HHHHCCCCEEECCCCCCHHHHHHHHHCCCCCCCCCC T ss_conf 986114770765787504889987634412478996 No 164 >TIGR03642 cas_csx13 CRISPR-associated protein, Csx13 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown. Probab=40.85 E-value=24 Score=14.96 Aligned_cols=53 Identities=13% Similarity=0.191 Sum_probs=40.2 Q ss_pred HHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCHH-HHHHHHHHHCCCHHHEEE Q ss_conf 88864220203588876422454057521587236-899999984149544679 Q gi|254780999|r 271 EEEADYNACVLSLRDYVQKNNFHKVIIGLSGGIDS-ALCAAIAVDALGKENVQT 323 (562) Q Consensus 271 ~~e~~~~Alvlgl~d~~~~~g~~~~viglSGGiDS-al~a~la~~alg~~~v~~ 323 (562) +.+..++.++..++++.++.+.+++.+-+|||--. ++.+++++.-+|.++|.- T Consensus 69 d~~~~~~~i~~~i~~~r~~~~~~ri~l~IsGGRK~Ms~~l~l~aql~~~DrvyH 122 (124) T TIGR03642 69 DILTFMSIAAKEVKKERENYGCERIIVNISGGRKIMTIILALYAQLLFEDEVYH 122 (124) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCEEE T ss_conf 999999999999999996588733899952873999999999999858761166 No 165 >COG5482 Uncharacterized conserved protein [Function unknown] Probab=40.14 E-value=14 Score=16.52 Aligned_cols=85 Identities=13% Similarity=0.068 Sum_probs=47.0 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 67973246666522135699999972135426775899999886313234687431011012330466779888751695 Q gi|254780999|r 321 VQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMALSNHSKA 400 (562) Q Consensus 321 v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~~~~ 400 (562) -++..||.+.--.+|..+-..||+.||+-..-++-..-+.-....+ .+.+.+ | ++-|.-+.-.+-+.+|- T Consensus 69 wiAar~sakgkgresd~r~r~LcRRLg~G~lgvs~~g~v~ilv~~v------~PepR~--N--~krRskl~~EhqrRrgd 138 (229) T COG5482 69 WIAARVSAKGKGRESDKRYRDLCRRLGLGMLGVSDVGDVSILVGSV------TPEPRT--N--PKRRSKLMREHQRRRGD 138 (229) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCEEEEEEEC------CCCCCC--C--HHHHHHHHHHHHHHCCC T ss_conf 6421024344652557899999998567647775259635898626------878866--8--56788999999872289 Q ss_pred EECCCCCHHHHHHHH Q ss_conf 443643244775411 Q gi|254780999|r 401 MLLTTSNKSEISVGY 415 (562) Q Consensus 401 lvl~t~nksE~~~Gy 415 (562) .++|-+-+.-+++.| T Consensus 139 p~vGGs~r~pimTAY 153 (229) T COG5482 139 PAVGGSTRAPIMTAY 153 (229) T ss_pred CCCCCCCCCHHHHHH T ss_conf 876773443799999 No 166 >pfam01553 Acyltransferase Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene. Probab=39.01 E-value=25 Score=14.78 Aligned_cols=10 Identities=10% Similarity=0.066 Sum_probs=3.6 Q ss_pred CCCCEEEECC Q ss_conf 4777599315 Q gi|254780999|r 166 QGAEFLFSLN 175 (562) Q Consensus 166 ~Gadlii~ps 175 (562) +|-.+++-|- T Consensus 90 ~g~~i~iFPE 99 (132) T pfam01553 90 AGKLVVIFPE 99 (132) T ss_pred CCCEEEEECC T ss_conf 9988999388 No 167 >pfam04356 DUF489 Protein of unknown function (DUF489). Protein of unknown function, cotranscribed with purB in Escherichia coli, but with function unrelated to purine biosynthesis. Probab=38.93 E-value=25 Score=14.77 Aligned_cols=57 Identities=11% Similarity=0.173 Sum_probs=29.8 Q ss_pred HHHHHHHHCCCCCCC--CHHHHHHHHHHHHCCCC--CCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 999999721354267--75899999886313234--6874310110123304667798887 Q gi|254780999|r 339 AAACAKALGCKYDVL--PIHDLVNHFFSLMSQFL--QEEPSGIVAENIQSRIRGNILMALS 395 (562) Q Consensus 339 a~~la~~lg~~~~~i--~I~~~~~~~~~~~~~~~--~~~~~~~~~eN~qaR~R~~~L~~~a 395 (562) +..-++.++..+..+ .+..+.....+.+++-+ .+++.-+..++.|+|+|+.+|..+= T Consensus 108 ~~~q~~~f~~~~~~via~lA~iY~dtiS~l~pRI~V~G~~~~Lq~~~~~~kIRALLLAGIR 168 (190) T pfam04356 108 IERQLEHFDLTHENVIANLASIYSDTISTLGPRIQVTGSPEHLQQPQNQAKIRALLLAGIR 168 (190) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHCCCCHHHHHHHHHHHHHH T ss_conf 9999985443519999999999999754479816984878880795269999999999999 No 168 >TIGR02317 prpB methylisocitrate lyase; InterPro: IPR012695 Propionate is the second most abundant low molecular mass carbon source found in soil, being generated by the degradation of several amino acids, fementation of crabohydrates and the oxidation of odd-chain fatty acids. Many different organisms are capable of metabolising propionate and several distinct pathways for utilising this compound exist, all of which begin with its activation to propionyl-CoA . Many bacteria metabolise propionate to pyruvate by the 2-methylcitric acid cycle. In the first step, catalysed by methycitrate synthase, oxaloacetate and propionyl-CoA are condensed to form 2-methylcitrate and CoA. The 2-methylcitrate is subsequently converted to 2-methylisocitrate by the combined actions of a dehydratase and an aconitase. In the final step, 2-methylisocitrate lyase cleaves a C-C bond within this compound to form succinate and pyruvate. The succinate is further oxidised to oxaloacetate which re-enters the cycle, while the pyruvate can be used further for either energy generation or biomass synthesis. Proteins in this entry are mostly 2-methylisocitrate lyases. Some of them have been annotated (incorrectly it seems) as the related protein carboxyphosphoenolpyruvate phosphomutase, which is involved in synthesizing the antibiotic bialaphos in Streptomyces hygroscopicus. Structural studies indicate that these enzymes belong to the isocitrate lyase family, binding a Mg(2+) cofactor and forming a TIM barrel fold , . As with other enzymes in this family, substrate-binding induces a conformational change that prevents solvent access to the active site and which is important for catalysis. . Probab=38.80 E-value=25 Score=14.76 Aligned_cols=39 Identities=23% Similarity=0.361 Sum_probs=21.1 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCC-CEEEECCCC Q ss_conf 9999714777599315553335807888788999854337-046421356 Q gi|254780999|r 160 CKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVH-LPIIYVNQV 208 (562) Q Consensus 160 ~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~-~~vvy~N~v 208 (562) +..+.+.|||+|| |..+... +.++..+..-. +|++ +|.. T Consensus 168 A~aYvEAGAD~IF-----pEAL~~~----~eFr~Fa~~vk~vPLL-ANMT 207 (287) T TIGR02317 168 AKAYVEAGADMIF-----PEALESE----EEFREFAKAVKDVPLL-ANMT 207 (287) T ss_pred HHHHHHHCCCCCH-----HHHHHHH----HHHHHHHHHCCCCCEE-CCCC T ss_conf 9867872615313-----8785328----8889999866898423-0112 No 169 >TIGR01196 edd phosphogluconate dehydratase; InterPro: IPR004786 Two dehydratases, dihydroxy-acid dehydratase (gene ilvD or ILV3) and 6-phosphogluconate dehydratase (gene edd) have been shown to be evolutionary related . Dihydroxy-acid dehydratase catalyzes the fourth step in the biosynthesis of isoleucine and valine, the dehydratation of 2,3-dihydroxy-isovaleic acid into alpha-ketoisovaleric acid. 6-Phosphogluconate dehydratase catalyzes the first step in the Entner-Doudoroff pathway, the dehydratation of 6-phospho-D-gluconate into 6-phospho-2-dehydro-3-deoxy-D-gluconate. Another protein containing this signature is the Escherichia coli hypothetical protein yjhG. The N-terminal part of the proteins contains a cysteine that could be involved in the binding of a 2Fe-2S iron-sulphur cluster . This family defines the 6-phosphogluconate dehydratases.; GO: 0004456 phosphogluconate dehydratase activity, 0009255 Entner-Doudoroff pathway. Probab=38.80 E-value=25 Score=14.76 Aligned_cols=77 Identities=14% Similarity=0.170 Sum_probs=41.1 Q ss_pred HHHCCCCEEEEECCCCCH--HHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 642245405752158723--689999998414954467973246666522135699999972135426775899999886 Q gi|254780999|r 287 VQKNNFHKVIIGLSGGID--SALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFS 364 (562) Q Consensus 287 ~~~~g~~~~viglSGGiD--Sal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~ 364 (562) .+-++-.+-++++--=|| |=|-+.++-.| +-.|+.+|.-. -++|++-||-+.==|+++ +..... T Consensus 269 ~r~t~~ng~~~P~gemiDEKsivN~~vgL~A------------TGGSTNHTmHL-vAmARAAGi~LnWdDfse-lS~~vP 334 (614) T TIGR01196 269 ARLTAKNGEVLPLGEMIDEKSIVNAVVGLLA------------TGGSTNHTMHL-VAMARAAGIILNWDDFSE-LSDVVP 334 (614) T ss_pred HHHCCCCCCEEEHHHHHHHHHHHHHHHHHHH------------CCCCHHHHHHH-HHHHHHCCCEECCCCHHH-HHHHHH T ss_conf 8521688824112000015778889999862------------48863578999-999975691305123655-510533 Q ss_pred HHCCCCCCCCCCH Q ss_conf 3132346874310 Q gi|254780999|r 365 LMSQFLQEEPSGI 377 (562) Q Consensus 365 ~~~~~~~~~~~~~ 377 (562) .|......-..|| T Consensus 335 LLARvYPNG~ADv 347 (614) T TIGR01196 335 LLARVYPNGKADV 347 (614) T ss_pred HHHEECCCCCCCH T ss_conf 4503267984222 No 170 >cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=38.71 E-value=25 Score=14.75 Aligned_cols=14 Identities=21% Similarity=0.242 Sum_probs=5.2 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 99999999887697 Q gi|254780999|r 69 AIDTLKSDTHDGGA 82 (562) Q Consensus 69 ~l~~La~~~~~~~i 82 (562) .+.+|++...+.|. T Consensus 19 ~~~~L~~~L~~~Gh 32 (357) T cd03795 19 VIRDLAEGLAARGI 32 (357) T ss_pred HHHHHHHHHHHCCC T ss_conf 99999999997799 No 171 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=38.24 E-value=26 Score=14.70 Aligned_cols=36 Identities=19% Similarity=0.288 Sum_probs=21.1 Q ss_pred EEEEEEECCC------CCCCHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 2899996388------36889999999999999999859989991 Q gi|254780999|r 5 LKIAIAQLNP------VVGDIAGNIAKARRAREEANRQGMDLILF 43 (562) Q Consensus 5 mkIal~Q~n~------~~gd~~~N~~~i~~~i~~A~~~gadLvvf 43 (562) |||+++.... ..|-.+. .+.++++...+.|-++.|+ T Consensus 1 MkI~~v~~~~~p~pP~~~GG~e~---~~~~La~~L~~~Gh~V~v~ 42 (335) T cd03802 1 MRIALVAPPREPVPPPAYGGTER---VVAALTEGLVARGHEVTLF 42 (335) T ss_pred CEEEEECCCCCCCCCCCCCCHHH---HHHHHHHHHHHCCCEEEEE T ss_conf 98699888400369999897999---9999999999769989999 No 172 >PRK00232 pdxA 4-hydroxythreonine-4-phosphate dehydrogenase; Reviewed Probab=38.05 E-value=24 Score=14.88 Aligned_cols=40 Identities=28% Similarity=0.332 Sum_probs=23.5 Q ss_pred CCCCEEEEEEECCCCCCCHHHH-HHHHHHHHHHH-HHCCCCEEEECC Q ss_conf 9886289999638836889999-99999999999-985998999166 Q gi|254780999|r 1 MLKKLKIAIAQLNPVVGDIAGN-IAKARRAREEA-NRQGMDLILFTE 45 (562) Q Consensus 1 M~~~mkIal~Q~n~~~gd~~~N-~~~i~~~i~~A-~~~gadLvvfPE 45 (562) ||.|-+|++. .||+.+- -+-+.+.+.+. ..+.+.++++-. T Consensus 1 mm~Kp~IaIT-----~GDPaGIGpEIilKal~~~~~~~~~~~viigd 42 (334) T PRK00232 1 MMMKPRIAIT-----PGDPAGIGPELIAKLLAQPDVRWPAHLVVIAD 42 (334) T ss_pred CCCCCCEEEE-----CCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEC T ss_conf 9999828996-----88863538999999984813048988899988 No 173 >pfam01261 AP_endonuc_2 Xylose isomerase-like TIM barrel. This TIM alpha/beta barrel structure is found in xylose isomerase and in endonuclease IV (EC:3.1.21.2). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae. Probab=37.84 E-value=26 Score=14.66 Aligned_cols=23 Identities=22% Similarity=0.319 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 99999999999998599899916 Q gi|254780999|r 22 NIAKARRAREEANRQGMDLILFT 44 (562) Q Consensus 22 N~~~i~~~i~~A~~~gadLvvfP 44 (562) .++.+.+.++.|+.-|+..++++ T Consensus 62 ~~~~~~~~l~~a~~lG~~~i~~~ 84 (201) T pfam01261 62 ALEALKRAIELAAALGAKVVVVH 84 (201) T ss_pred HHHHHHHHHHHHHHCCCCEEEEC T ss_conf 99999999999997399589982 No 174 >TIGR02776 NHEJ_ligase_prk DNA ligase D; InterPro: IPR014143 Members of this entry are DNA ligases involved in the repair of DNA double-stranded breaks by non-homologous end joining (NheJ). The system of the bacterial Ku protein (IPR009187 from INTERPRO) plus this DNA ligase is seen in about 200f bacterial genomes to date and at least one archaeon (Archeoglobus fulgidus). This entry describes a central and C-terminal domain. These two domains may be permuted, as in genus Mycobacterium, or divided into tandem ORFs. An additional N-terminal 3 -phosphoesterase (PE) domain (IPR014144 from INTERPRO) is present in some members of this ligase. Most examples of genes for this ligase are adjacent to the gene for Ku.. Probab=37.64 E-value=26 Score=14.64 Aligned_cols=12 Identities=17% Similarity=0.232 Sum_probs=4.9 Q ss_pred CCCEECCCCCHH Q ss_conf 695443643244 Q gi|254780999|r 398 SKAMLLTTSNKS 409 (562) Q Consensus 398 ~~~lvl~t~nks 409 (562) ...+.++.-+|- T Consensus 359 ~~~v~ithPDk~ 370 (645) T TIGR02776 359 VAGVRITHPDKV 370 (645) T ss_pred CCCEEEECCCEE T ss_conf 672175177502 No 175 >PRK08639 threonine dehydratase; Validated Probab=37.40 E-value=27 Score=14.61 Aligned_cols=48 Identities=23% Similarity=0.331 Sum_probs=26.9 Q ss_pred HHHHHHHHCCCCEEEEECCCC-CHHHHHHHHHHHCC---CHHHEEEEECCCC Q ss_conf 588876422454057521587-23689999998414---9544679732466 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGG-IDSALCAAIAVDAL---GKENVQTIMLPYK 329 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGG-iDSal~a~la~~al---g~~~v~~~~mp~~ 329 (562) ||..+..+..-++||+=+||| +|.....-+.-.+| |++..+.+.+|.+ T Consensus 293 al~~~~~~~~gk~Vv~ilsGgNiD~~~l~~i~er~l~~~gr~~~~~v~~pd~ 344 (418) T PRK08639 293 ALDEYKDEIKGKTVVCVISGGNNDIERMPEIKERSLRYEGLKHYFIVNFPQR 344 (418) T ss_pred HHHHCHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCC T ss_conf 9984615526980999981588898899999999878629869999978999 No 176 >TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196. Probab=36.80 E-value=27 Score=14.55 Aligned_cols=34 Identities=26% Similarity=0.271 Sum_probs=24.0 Q ss_pred HHHHCCHHHHHHHHHHHCC----CCEEEEECCCCCHHH Q ss_conf 8642202035888764224----540575215872368 Q gi|254780999|r 273 EADYNACVLSLRDYVQKNN----FHKVIIGLSGGIDSA 306 (562) Q Consensus 273 e~~~~Alvlgl~d~~~~~g----~~~~viglSGGiDSa 306 (562) -.+|+|+.++..+.+.+.. -++++|=+|.|.|.+ T Consensus 141 tal~dAi~laa~~~~~~~~~~~~gRK~li~iSdG~d~~ 178 (296) T TIGR03436 141 TALYDAITLAALQQLANALAGIPGRKALIVISDGEDNS 178 (296) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCC T ss_conf 10278899999999875404798867999992698863 No 177 >pfam01488 Shikimate_DH Shikimate / quinate 5-dehydrogenase. This family contains both shikimate and quinate dehydrogenases. Shikimate 5-dehydrogenase catalyses the conversion of shikimate to 5-dehydroshikimate. This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids. Quinate 5-dehydrogenase catalyses the conversion of quinate to 5-dehydroquinate. This reaction is part of the quinate pathway where quinic acid is exploited as a source of carbon in prokaryotes and microbial eukaryotes. Both the shikimate and quinate pathways share two common pathway metabolites 3-dehydroquinate and dehydroshikimate. Probab=35.60 E-value=19 Score=15.51 Aligned_cols=56 Identities=20% Similarity=0.400 Sum_probs=31.8 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCC-CCCCCHHH Q ss_conf 45405752158723689999998414954467973246666522135699999972135-42677589 Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCK-YDVLPIHD 357 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~-~~~i~I~~ 357 (562) +.+-+|||- |++-+.++..|... |..+|.-+ ..|.+.|+.+|+.+|-. ...++.+. T Consensus 12 ~~~vlVIGa-G~~~~~~~~~L~~~--g~~~i~v~--------nRt~~ka~~la~~~~~~~~~~~~~~~ 68 (134) T pfam01488 12 GKKVLLIGA-GEMARLAAKHLLSK--GAKKITIA--------NRTLEKAKELAEEFGGEEVEALPLDE 68 (134) T ss_pred CCEEEEECC-CHHHHHHHHHHHHC--CCCEEEEE--------CCCHHHHHHHHHHCCCCCEEEEECHH T ss_conf 898999996-09999999999975--99889995--------47578999999984997258985135 No 178 >PRK11907 bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase precursor protein; Reviewed Probab=35.44 E-value=29 Score=14.41 Aligned_cols=36 Identities=25% Similarity=0.284 Sum_probs=26.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEECC-CCCCCCCHHHHH Q ss_conf 999999999999985998999166-444799844541 Q gi|254780999|r 22 NIAKARRAREEANRQGMDLILFTE-LFISGYPPEDLV 57 (562) Q Consensus 22 N~~~i~~~i~~A~~~gadLvvfPE-l~LtGY~~~Dl~ 57 (562) -+.|.-.+|++|+++...-+++=- -.|.|-|.+|+. T Consensus 143 GLaRtAtLI~~aR~E~~NslLvDNGD~iQGsPLgDY~ 179 (810) T PRK11907 143 GLAKTAVLIEEAKKENPNTVLVDNGDTIQGTPLGTYK 179 (810) T ss_pred CHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHH T ss_conf 4789999999998636874998478401677205467 No 179 >TIGR01181 dTDP_gluc_dehyt dTDP-glucose 4,6-dehydratase; InterPro: IPR005888 The conversion of dTDP-glucose into dTDP-4-keto-6-deoxyglucose by Escherichia coli dTDP-glucose 4,6-dehydratase) takes place in the active site in three steps: dehydrogenation to dTDP-4-ketoglucose, dehydration to dTDP-4-ketoglucose-5,6-ene, and rereduction of C6 to the methyl group. The 4,6-dehydratase makes use of tightly bound NAD^+ as the coenzyme for transiently oxidizing the substrate, activating it for the dehydration step . This and other 4,6-dehydratases catalyze the first committed step in all 6-deoxysugar biosynthetic pathways described to date. Numerous 6-deoxysugars are used in bacterial lipopolysaccharide production as well as in the biosynthesis of a diverse array of secondary metabolites.; GO: 0008460 dTDP-glucose 46-dehydratase activity, 0009225 nucleotide-sugar metabolic process. Probab=35.24 E-value=29 Score=14.39 Aligned_cols=41 Identities=17% Similarity=0.148 Sum_probs=20.0 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCC Q ss_conf 99999714777599315553335807888788999854337046421356887 Q gi|254780999|r 159 ICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQ 211 (562) Q Consensus 159 ~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~ 211 (562) +.+.+.+.-+|.++|..|--+ ..||+..=.+++-+|-+|-+ T Consensus 65 v~~~~~e~~~D~VvhFAAESH------------VDRSI~~P~~F~~TNv~GT~ 105 (340) T TIGR01181 65 VDRLLKEHDPDAVVHFAAESH------------VDRSISGPEAFIETNVVGTY 105 (340) T ss_pred HHHHHHHCCCCEEEECCCCCH------------HHHCCCCCHHHHHCCHHHHH T ss_conf 988840017677886222052------------33301454114440337889 No 180 >PRK06707 amidase; Provisional Probab=35.17 E-value=21 Score=15.32 Aligned_cols=15 Identities=47% Similarity=0.512 Sum_probs=9.5 Q ss_pred HHHHHHHHCCCCEEE Q ss_conf 899999714777599 Q gi|254780999|r 158 NICKHLKKQGAEFLF 172 (562) Q Consensus 158 ~~~~~la~~Gadlii 172 (562) .+.+.|...||=+|= T Consensus 173 ~vV~rLr~AGAIilG 187 (536) T PRK06707 173 TIVKQLKEEGAFVLG 187 (536) T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999978987987 No 181 >PRK01909 pdxA 4-hydroxythreonine-4-phosphate dehydrogenase; Validated Probab=35.04 E-value=29 Score=14.37 Aligned_cols=40 Identities=25% Similarity=0.326 Sum_probs=21.6 Q ss_pred CC-CCEEEEEEECCCCCCCHHHH-HHHHHHHHHHHHH--CCCCEEEECC Q ss_conf 98-86289999638836889999-9999999999998--5998999166 Q gi|254780999|r 1 ML-KKLKIAIAQLNPVVGDIAGN-IAKARRAREEANR--QGMDLILFTE 45 (562) Q Consensus 1 M~-~~mkIal~Q~n~~~gd~~~N-~~~i~~~i~~A~~--~gadLvvfPE 45 (562) |+ +++||++.. ||+.+- -+-+.+.+.+... ..+.++|+-. T Consensus 1 m~~~~lrIaIT~-----GDPaGIGPEIilKal~~~~~~~~~~~~vviGd 44 (329) T PRK01909 1 MRPQPLQIAITT-----GEPAGVGPELTVQALADAATRWPDARFTVLGD 44 (329) T ss_pred CCCCCCEEEEEC-----CCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 999997699975-----88850179999999986575468988899979 No 182 >COG1134 TagH ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=34.41 E-value=30 Score=14.31 Aligned_cols=17 Identities=18% Similarity=0.018 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 35699999972135426 Q gi|254780999|r 336 LEDAAACAKALGCKYDV 352 (562) Q Consensus 336 ~~~a~~la~~lg~~~~~ 352 (562) .++.+..+.-+|....+ T Consensus 107 reNi~l~~~~lG~~~~e 123 (249) T COG1134 107 RENIYLRGLILGLTRKE 123 (249) T ss_pred HHHHHHHHHHHCCCHHH T ss_conf 87899999984666999 No 183 >PTZ00249 variable surface protein Vir28; Provisional Probab=34.22 E-value=15 Score=16.32 Aligned_cols=23 Identities=13% Similarity=0.047 Sum_probs=13.1 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 638836889999999999999999 Q gi|254780999|r 11 QLNPVVGDIAGNIAKARRAREEAN 34 (562) Q Consensus 11 Q~n~~~gd~~~N~~~i~~~i~~A~ 34 (562) ++++++.| +.|++++...+..+. T Consensus 19 ~F~~~~~~-~~n~~~~~~~l~~~~ 41 (516) T PTZ00249 19 SFDYTPTD-EFNISFVNALIKDIR 41 (516) T ss_pred HCCCCCCC-CHHHHHHHHHHHHHH T ss_conf 63788773-100899999997640 No 184 >cd02649 nuc_hydro_CeIAG nuc_hydro_CeIAG: Nucleoside hydrolases similar to the inosine-adenosine-guanosine-preferring nucleoside hydrolase from Caenorhabditis elegans. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the purine-preferring nucleoside hydrolase (IAG-NH) from C. elegans and the salivary purine nucleosidase from Aedes aegypti. C. elegans IAG-NH exhibits a high affinity for the substrate analogue p-nitrophenylriboside (p-NPR). Probab=33.23 E-value=31 Score=14.19 Aligned_cols=46 Identities=13% Similarity=0.206 Sum_probs=17.2 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHC-CCHHHEE-EEECCCCC Q ss_conf 5888764224540575215872368999999841-4954467-97324666 Q gi|254780999|r 282 SLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDA-LGKENVQ-TIMLPYKY 330 (562) Q Consensus 282 gl~d~~~~~g~~~~viglSGGiDSal~a~la~~a-lg~~~v~-~~~mp~~~ 330 (562) .+.|-++++. ..+.| +.=|==+-+.+++..+- + +++|. -+.|.+-+ T Consensus 107 ~i~~~~~~~p-~eiti-va~GPlTNlAlal~~~P~~-~~~i~~iviMGGa~ 154 (306) T cd02649 107 AIIRLVREYP-GEITL-VALGPLTNLALAYRLDPSL-PQKIKRLYIMGGNR 154 (306) T ss_pred HHHHHHHHCC-CCEEE-EECCCHHHHHHHHHHCHHH-HHHCCEEEEEECCC T ss_conf 9999998589-97799-9767648999999979677-85378289981556 No 185 >PRK08662 nicotinate phosphoribosyltransferase; Reviewed Probab=33.17 E-value=28 Score=14.45 Aligned_cols=38 Identities=21% Similarity=0.313 Sum_probs=29.7 Q ss_pred HHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH Q ss_conf 42202035888764224540575215872368999999 Q gi|254780999|r 275 DYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIA 312 (562) Q Consensus 275 ~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la 312 (562) ....++--+|..+...|++.+-|=+|||+|---...+. T Consensus 242 d~~~l~~~vR~~LD~~G~~~vkIv~Sgglde~~I~~l~ 279 (343) T PRK08662 242 NFRKIVEEVRWTLDLHGYSHVKIFVSGGLDEEDIRELR 279 (343) T ss_pred CHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHH T ss_conf 77999999999854557998289994899999999998 No 186 >TIGR02356 adenyl_thiF thiazole biosynthesis adenylyltransferase ThiF; InterPro: IPR012731 Members of the HesA/MoeB/ThiF family of proteins (IPR000594 from INTERPRO) include a number of members encoded in the midst of thiamine biosynthetic operons. This mix of known and putative ThiF proteins shows a deep split in phylogenetic trees. The Escherichia coli ThiF and MoeB proteins are seemingly more closely related than the E. coli ThiF and Campylobacter (for example) ThiF. This entry represents the more widely distributed clade of ThiF proteins as found in E. coli.. Probab=32.72 E-value=31 Score=14.13 Aligned_cols=57 Identities=19% Similarity=0.438 Sum_probs=38.3 Q ss_pred HHHHHHHHHHHCCCC--EEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEECCC Q ss_conf 016899999714777--599315553335807888788999854337046421356887662104864355589 Q gi|254780999|r 155 KNSNICKHLKKQGAE--FLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVGGQDELIFDGASFCFDGQ 226 (562) Q Consensus 155 ~~~~~~~~la~~Gad--lii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~ 226 (562) .+++.. ..-+.+| |++= +|. -...| .++...+...+.|+||+..+| |.|..+++++. T Consensus 103 t~~~~~--~~i~~~DPdlVlD--c~D----Nf~tr-yliNdaC~~~~~PlI~aavvG------~~G~~~~f~~~ 161 (210) T TIGR02356 103 TAENLE--LLIENVDPDLVLD--CTD----NFATR-YLINDACQALGIPLISAAVVG------FEGQVMVFDPG 161 (210) T ss_pred CHHHHH--HHHHCCCCCEEEE--CCC----CHHHH-HHHHHHHHHHCCCEEEEEEEE------ECCEEEEEECC T ss_conf 779999--9962389968961--566----87788-999999998479869998876------35778998468 No 187 >cd01714 ETF_beta The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases. ETF transfers electrons to the main respiratory chain via ETF-ubiquinone oxidoreductase. ETF is an heterodimer that consists of an alpha and a beta subunit which binds one molecule of FAD per dimer . A similar system also exists in some bacteria. The homologous pair of proteins (FixA/FixB) are essential for nitrogen fixation. The beta subunit protein is distantly related to and forms a heterodimer with the alpha subunit. Probab=32.59 E-value=32 Score=14.12 Aligned_cols=52 Identities=21% Similarity=0.062 Sum_probs=22.6 Q ss_pred EEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHC Q ss_conf 57521587236899999984149544679732466665221356999999721 Q gi|254780999|r 295 VIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALG 347 (562) Q Consensus 295 ~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg 347 (562) -|..+|=|=+++-...--+-|+|.++.+-+.-+... ..++..-|+.|++.+. T Consensus 54 ~VtvlsvGp~~a~~~Lr~alAmGaD~ai~i~d~~~~-~~D~~~tA~~La~~i~ 105 (202) T cd01714 54 EVTVVSMGPPQAEEALREALAMGADRAILVSDRAFA-GADTLATAKALAAAIK 105 (202) T ss_pred EEEEEEECCHHHHHHHHHHHHHCCCEEEEECCCCCC-CCCHHHHHHHHHHHHH T ss_conf 899999374788999999997089835998065556-6698999999999998 No 188 >PRK00218 hypothetical protein; Reviewed Probab=31.48 E-value=33 Score=14.00 Aligned_cols=58 Identities=10% Similarity=0.184 Sum_probs=30.8 Q ss_pred HHHHHHHHHCCCCCCC--CHHHHHHHHHHHHCCCC--CCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 6999999721354267--75899999886313234--6874310110123304667798887 Q gi|254780999|r 338 DAAACAKALGCKYDVL--PIHDLVNHFFSLMSQFL--QEEPSGIVAENIQSRIRGNILMALS 395 (562) Q Consensus 338 ~a~~la~~lg~~~~~i--~I~~~~~~~~~~~~~~~--~~~~~~~~~eN~qaR~R~~~L~~~a 395 (562) .+..-++.++..+..+ .+..+.....+.+++-+ .+++.-+..++.|+|+|+.+|..+= T Consensus 113 ~~~~q~~~f~~~~~~via~la~iY~dtIS~l~pRI~V~G~~~~L~~~~~a~kIRALLLAGIR 174 (207) T PRK00218 113 QIQRQLEHFDLLHENVIANLASIYSDTISPLGPRIQVTGNPAYLQQPQVQAKIRALLLAGIR 174 (207) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHCCCCHHHHHHHHHHHHHH T ss_conf 99999985453629999999999999754469826884878880795269999999999999 No 189 >PRK11340 phosphodiesterase YaeI; Provisional Probab=31.44 E-value=33 Score=14.00 Aligned_cols=76 Identities=16% Similarity=-0.011 Sum_probs=39.7 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 86289999638836889999999999999999859989991664447998445414889999999999999999887697 Q gi|254780999|r 3 KKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGA 82 (562) Q Consensus 3 ~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i 82 (562) +.+||+... +...|.+ .+.+++.+.++...+.++|+|++.==.+.. ..+...+.. .+.+....+..++ T Consensus 47 ~~lrI~~lS-DlH~g~~-~~~~~l~~~v~~~n~~~PDlVv~tGD~vd~-------~~~~~~~~~---~~~l~~L~~~~gv 114 (270) T PRK11340 47 APFKILFLA-DLHYSRF-VPLSLISDAIALGIEQKPDLILLGGDYVLF-------DMPLNFSAF---SDVLSPLAECAPT 114 (270) T ss_pred CCCEEEEEE-CCCCCCC-CCHHHHHHHHHHHHHCCCCEEEECCCEECC-------CCCHHHHHH---HHHHHHCCCCCCE T ss_conf 995899992-5668898-898999999999871299999986763447-------772107999---9998740133885 Q ss_pred EEEEEEEE Q ss_conf 89999999 Q gi|254780999|r 83 GIVVGFPR 90 (562) Q Consensus 83 ~iivG~p~ 90 (562) +.+.|.-+ T Consensus 115 yaVlGNHD 122 (270) T PRK11340 115 FACFGNHD 122 (270) T ss_pred EEECCCCC T ss_conf 99737864 No 190 >COG5322 Predicted dehydrogenase [General function prediction only] Probab=31.43 E-value=33 Score=13.99 Aligned_cols=22 Identities=32% Similarity=0.415 Sum_probs=16.9 Q ss_pred CCEEEEECCCCCHHHHHHHHHH Q ss_conf 5405752158723689999998 Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAV 313 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~ 313 (562) ..-.|+|-.|.|-|+++-.|+. T Consensus 168 atvaivGa~G~Ia~~Iar~la~ 189 (351) T COG5322 168 ATVAIVGATGDIASAIARWLAP 189 (351) T ss_pred CEEEEECCCCHHHHHHHHHHCC T ss_conf 8578724785489999987634 No 191 >COG2129 Predicted phosphoesterases, related to the Icc protein [General function prediction only] Probab=31.40 E-value=33 Score=13.99 Aligned_cols=38 Identities=24% Similarity=0.375 Sum_probs=30.4 Q ss_pred CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECC Q ss_conf 88628999963883688999999999999999985998999166 Q gi|254780999|r 2 LKKLKIAIAQLNPVVGDIAGNIAKARRAREEANRQGMDLILFTE 45 (562) Q Consensus 2 ~~~mkIal~Q~n~~~gd~~~N~~~i~~~i~~A~~~gadLvvfPE 45 (562) +.+|||.... |+.++..+..+.+..|+...+|++|.-= T Consensus 1 ~~~mkil~vt------DlHg~~~~~~k~~~~~~~~~~D~lviaG 38 (226) T COG2129 1 IKKMKILAVT------DLHGSEDSLKKLLNAAADIRADLLVIAG 38 (226) T ss_pred CCCCEEEEEE------CCCCCHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 9740599984------0333357899999987603688899953 No 192 >cd03220 ABC_KpsT_Wzt ABC_KpsT_Wzt The KpsT/Wzt ABC transporter subfamily is involved in extracellular polysaccharide export. Among the variety of membrane-linked or extracellular polysaccharides excreted by bacteria, only capsular polysaccharides, lipopolysaccharides, and teichoic acids have been shown to be exported by ABC transporters. A typical system is made of a conserved integral membrane and an ABC. In addition to these proteins, capsular polysaccharide exporter systems require two 'accessory' proteins to perform their function: a periplasmic (E.coli) or a lipid-anchored outer membrane protein called OMA (Neisseria meningitidis and Haemophilus influenzae) and a cytoplasmic membrane protein MPA2. Probab=31.36 E-value=33 Score=13.99 Aligned_cols=14 Identities=36% Similarity=0.482 Sum_probs=6.6 Q ss_pred ECCCCCHHHHHHHH Q ss_conf 21587236899999 Q gi|254780999|r 298 GLSGGIDSALCAAI 311 (562) Q Consensus 298 glSGGiDSal~a~l 311 (562) -+|||.---|..|. T Consensus 142 ~LSgG~kqRl~iA~ 155 (224) T cd03220 142 TYSSGMKARLAFAI 155 (224) T ss_pred CCCHHHHHHHHHHH T ss_conf 46999999999999 No 193 >TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine. Probab=31.19 E-value=33 Score=13.97 Aligned_cols=28 Identities=21% Similarity=0.416 Sum_probs=24.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEECC Q ss_conf 8999999999999999985998999166 Q gi|254780999|r 18 DIAGNIAKARRAREEANRQGMDLILFTE 45 (562) Q Consensus 18 d~~~N~~~i~~~i~~A~~~gadLvvfPE 45 (562) +-.+|++.+.++|+.|++.|||.|=|.= T Consensus 10 NH~Gdl~~Ak~LI~~A~~sGadaVKFQ~ 37 (329) T TIGR03569 10 NHNGSLELAKKLVDAAAEAGADAVKFQT 37 (329) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 7678099999999999994969999307 No 194 >TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF; InterPro: IPR004421 The large subunit of [NiFe]-hydrogenase, as well as other nickel metalloenzymes, is synthesized as a precursor devoid of the metalloenzyme active site. This precursor then undergoes a complex post-translational maturation process that requires a number of accessory proteins , , . Members of the HypF family are accessory proteins involved in hydrogenase maturation. They contain the following domains: acylphosphatase, zinc fingers (2 repeats), a YrdC-like domain, and a C-terminal domain with a putative O-carbamoyltransferase motif. The presence of CO and CN- ligands of the active site iron atoms is essential for [NiFe]-hydrogenase enzyme activity . Both ligands have been suggested to originate from carbamoylphosphate , which is required for maturation of [NiFe]-hydrogenases . Escherichia coli HypF interacts with carbamoylphosphate as a substrate and releases inorganic phosphate . In addition, HypF also cleaves ATP into AMP and pyrophosphate in the presence of carbamoylphosphate. This, and the fact that HypF catalyzes a carbamoylphosphate-dependent pyrophosphate ATP exchange reaction, suggest that the protein catalyzes the activation of carbamoylphosphate . The mechanism of action of HypF, as well as of its individual domains, is not yet clear. Mutations in any of the three major signature motifs, the acylphosphatase, the zinc fingers, and the O-carbamoyltransferase motif, can block carbamoylphosphate phosphatase activity. This indicates an integrated cooperativity between these domains in the cleavage reaction . The N-terminal acylphosphatase (ACP) domain is thought to support the conversion of carbamoylphosphate into CO and CN- , . Biochemical results demonstrating its ACP activity are not available , . ACPs are small enzymes that specifically catalyze the hydrolysis of carboxylphosphate bonds in acylphosphates, including carbamoylphosphate . Zinc fingers have been implicated in bivalent cation binding or as part of a chaperone domain interacting with the large subunit precursor, but experimental studies on such a function are lacking thus far. The YrdC-like domain is present in protein families with regulatory functions (IPR012200 from INTERPRO, IPR010923 from INTERPRO) and has been implicated in RNA binding . It is not clear what function it may have in members of the HypF family. A C-terminal domain is distantly related to peptidase M22, but contains a conserved O-carbamoyltransferase motif required for the carbamoylphosphate phosphatase activity . The function of this domain is not clear. Nomenclature note: the following names are used as synonyms of HypF: HupY in Azotobacter chroococcum, HupN in Rhizobium leguminosarum, HydA in Escherichia coli. In other organisms, these names are used to designate various "hydrogenase cluster" proteins unrelated to the members of this family. ; GO: 0030528 transcription regulator activity. Probab=30.80 E-value=34 Score=13.93 Aligned_cols=160 Identities=11% Similarity=0.103 Sum_probs=69.4 Q ss_pred HCCCEEEE--CCCCCCC---CCC-CCCCCCEEEC----CCCEEEEECCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 33704642--1356887---662-1048643555----898599961677763799999833521236555755555447 Q gi|254780999|r 197 HVHLPIIY--VNQVGGQ---DEL-IFDGASFCFD----GQQQLAFQMKHFSEQNFMTEWHYDQQLSQWNYMSDDSASTMY 266 (562) Q Consensus 197 e~~~~vvy--~N~vGg~---d~l-vf~G~S~I~d----~~G~ii~~~~~f~e~~~i~d~d~d~~~~~~~~~~~~~~~~~~ 266 (562) +...|+|. +|.-|-. |+. +.+.-++|+| .+-+|+.+. +|..|-.++-....-++.+ .-.++... T Consensus 319 ~~~~p~VmTSaNl~G~P~~idn~~~l~~L~gIaDgFL~HnR~Iv~R~----DDSVV~~V~g~~~~lRRsR--GyaP~p~~ 392 (799) T TIGR00143 319 LLAKPLVMTSANLPGEPMAIDNEEILEKLQGIADGFLVHNRRIVNRV----DDSVVKEVAGELLFLRRSR--GYAPEPLK 392 (799) T ss_pred HCCCCEEEECCCCCCCCCCCCHHHHHHHHCCCCCCEECCCCCCCCCC----CCCEEEEECCCCCHHHHHC--CCCCCCCC T ss_conf 22777166026788988865779998861224150000361005858----7642764478210234304--76886000 Q ss_pred CCCCHHHHHHCCHHHHHHHHH----HHCCCCEEEEECC-CCCHHHHHH-HHHHHCC---------CHHH-EEEEECCCCC Q ss_conf 899988864220203588876----4224540575215-872368999-9998414---------9544-6797324666 Q gi|254780999|r 267 IPLQEEEADYNACVLSLRDYV----QKNNFHKVIIGLS-GGIDSALCA-AIAVDAL---------GKEN-VQTIMLPYKY 330 (562) Q Consensus 267 ~p~~~~e~~~~Alvlgl~d~~----~~~g~~~~viglS-GGiDSal~a-~la~~al---------g~~~-v~~~~mp~~~ 330 (562) +|........++|++|=.-+. -|.|.+=++|.-- |.+-+.=+. .--.+|+ -+-. |.+=+=|... T Consensus 393 lP~~s~~~~~~~LalGa~l~~t~~llk~~~si~~lSQh~Gdls~~~t~n~~fkeA~~~~~~~~d~~~~~~iv~D~HP~y~ 472 (799) T TIGR00143 393 LPKRSVKNQKKILALGAELKNTFALLKGGQSIAYLSQHIGDLSKYETYNKFFKEAVNFLLRLYDFEPIDVIVCDLHPQYN 472 (799) T ss_pred CCCCCCCCCCCEEEECCCCCCEEEEEECCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCC T ss_conf 78654578865798777542238998668348997355256556303568899999999974189850488864888831 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 65221356999999721354267758999998863132 Q gi|254780999|r 331 TSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQ 368 (562) Q Consensus 331 ~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~ 368 (562) |+ +-|+++++.+++++..| .+-+.+.++.+.+ T Consensus 473 st----~~Ae~l~~~fs~~~~~V--QHHhAHalA~~ae 504 (799) T TIGR00143 473 ST----QYAEELAKKFSLPVLRV--QHHHAHALAVMAE 504 (799) T ss_pred CH----HHHHHHHHHHCCCCCCH--HHHHHHHHHHHHH T ss_conf 07----89999998718750100--1478999999986 No 195 >COG2272 PnbA Carboxylesterase type B [Lipid metabolism] Probab=30.15 E-value=35 Score=13.86 Aligned_cols=87 Identities=25% Similarity=0.264 Sum_probs=46.8 Q ss_pred HHHHHHHHCCCC--E-EEEECCCCCHHHHHHHHHHHCCCH-HHEEEEECC--CCCCCCCCHHHHHHHHHHHCCC------ Q ss_conf 588876422454--0-575215872368999999841495-446797324--6666522135699999972135------ Q gi|254780999|r 282 SLRDYVQKNNFH--K-VIIGLSGGIDSALCAAIAVDALGK-ENVQTIMLP--YKYTSPQSLEDAAACAKALGCK------ 349 (562) Q Consensus 282 gl~d~~~~~g~~--~-~viglSGGiDSal~a~la~~alg~-~~v~~~~mp--~~~~s~~s~~~a~~la~~lg~~------ 349 (562) .++|....-|-. . -|.|-|.|-=|++++...=.|=|- .+.+...=+ ...+.+.+.+-|..+++.||+. T Consensus 167 WV~~NIe~FGGDp~NVTl~GeSAGa~si~~Lla~P~AkGLF~rAi~~Sg~~~~~~s~~~A~~~a~~f~~~lG~~~~~~~~ 246 (491) T COG2272 167 WVRDNIEAFGGDPQNVTLFGESAGAASILTLLAVPSAKGLFHRAIALSGAASRVTSREEAREKAAAFARALGIPEATLDK 246 (491) T ss_pred HHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 99998998489963147862051079998866276501789999985788776676799999999999984897778987 Q ss_pred CCCCCHHHHHHHHHHHHCC Q ss_conf 4267758999998863132 Q gi|254780999|r 350 YDVLPIHDLVNHFFSLMSQ 368 (562) Q Consensus 350 ~~~i~I~~~~~~~~~~~~~ 368 (562) +..++-+.+++.....+.. T Consensus 247 L~~~~~~~L~~~~~~~~~~ 265 (491) T COG2272 247 LRALSAEDLVKARLPLIGR 265 (491) T ss_pred HHCCCHHHHHHHHHHHCCC T ss_conf 7367998987543121225 No 196 >KOG4201 consensus Probab=30.02 E-value=3.9 Score=20.03 Aligned_cols=54 Identities=13% Similarity=0.174 Sum_probs=27.8 Q ss_pred CCCCCCCCCCCCCCE-EE-ECCEEEE--ECCCCCCCHHHHHHHHHHHCCCCEEEECCC Q ss_conf 332100377888761-47-6474562--011111110168999997147775993155 Q gi|254780999|r 123 FHEKRTFISGYSNDP-IV-FRDIRLG--ILICEDIWKNSNICKHLKKQGAEFLFSLNA 176 (562) Q Consensus 123 FdE~r~F~~G~~~~~-~~-~~g~kiG--v~ICeDlw~~~~~~~~la~~Gadlii~psA 176 (562) ..|..||..+...-. +. .-|+++- .+.|-|.-++....-....+|||-++-+.+ T Consensus 111 LTe~k~FkGsledL~~irk~~~~k~p~~~lL~KeFivd~~QI~~aR~~GADaVLLIva 168 (289) T KOG4201 111 LTEPKWFKGSLEDLVAIRKIAGVKCPPKCLLRKEFIVDPYQIYEARLKGADAVLLIVA 168 (289) T ss_pred ECCCHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHCCCCEEEHHHH T ss_conf 2370453053878999999716768817676788735799999998638851311898 No 197 >COG3845 ABC-type uncharacterized transport systems, ATPase components [General function prediction only] Probab=29.99 E-value=35 Score=13.84 Aligned_cols=19 Identities=21% Similarity=0.541 Sum_probs=10.2 Q ss_pred EEEEEEEEEECCEEEEEEE Q ss_conf 8789999996997999887 Q gi|254780999|r 95 GVLNSVVILDAGNIIAVRD 113 (562) Q Consensus 95 ~~yNsa~vi~~G~il~~y~ 113 (562) .+-+.+.|+..|++++.+. T Consensus 205 ~iaDrvTVLR~Gkvvgt~~ 223 (501) T COG3845 205 AIADRVTVLRRGKVVGTVD 223 (501) T ss_pred HHHCEEEEEECCEEEEEEC T ss_conf 7637369985774886635 No 198 >KOG1195 consensus Probab=29.79 E-value=35 Score=13.82 Aligned_cols=30 Identities=17% Similarity=0.330 Sum_probs=17.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHH Q ss_conf 358887642245405752158723689999 Q gi|254780999|r 281 LSLRDYVQKNNFHKVIIGLSGGIDSALCAA 310 (562) Q Consensus 281 lgl~d~~~~~g~~~~viglSGGiDSal~a~ 310 (562) -++-|++.+-.|.+.+--+|-|-|-...+. T Consensus 313 aaal~Rl~~y~fdr~iYVvd~~Q~~hf~~l 342 (567) T KOG1195 313 AAALDRLEKYSFDRLIYVVDSQQKDHFCQL 342 (567) T ss_pred HHHHHHHHHCCCCEEEEEECHHHHHHHHHH T ss_conf 999999874484428998356878899999 No 199 >TIGR01139 cysK cysteine synthase A; InterPro: IPR005859 This model discriminates cysteine synthase A (CysK) and cysteine synthase B (CysM) from cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain lacking from this protein. Bacterial proteins lacking the CBS domain but otherwise showing resemblance to cystathionine beta-synthases and considerable phylogenetic distance from known cysteine synthases were excluded. Cysteine synthase (O-acetylserine (thiol)-lyase, 2.5.1.47 from EC) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid. In bacteria such two forms of the enzyme are known (genes cysK and cysM). CysK differs from CysM in that it can also use sulphide instead of thiosulphate, to produce cysteine instead of cysteine thiosulphonate. ; GO: 0004124 cysteine synthase activity, 0006535 cysteine biosynthetic process from serine. Probab=29.65 E-value=35 Score=13.80 Aligned_cols=37 Identities=22% Similarity=0.412 Sum_probs=26.3 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHCCC-HHHEEEEECCCC Q ss_conf 54057521587236899999984149-544679732466 Q gi|254780999|r 292 FHKVIIGLSGGIDSALCAAIAVDALG-KENVQTIMLPYK 329 (562) Q Consensus 292 ~~~~viglSGGiDSal~a~la~~alg-~~~v~~~~mp~~ 329 (562) -+|+++|+|.|.-=+-++-||.+ ++ ++++.-+++|+. T Consensus 265 eEGiL~GISSGA~vaAAl~~Ak~-~~n~dK~iV~i~Ps~ 302 (312) T TIGR01139 265 EEGILVGISSGAAVAAALKLAKR-LENPDKLIVVILPST 302 (312) T ss_pred HCCEEECCCHHHHHHHHHHHHHH-HCCCCCEEEEEECCC T ss_conf 67801015578999999999873-248898899982789 No 200 >cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=29.58 E-value=35 Score=13.79 Aligned_cols=63 Identities=16% Similarity=0.249 Sum_probs=31.8 Q ss_pred HHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 4224540575215872368999999841495446797324666652213569999997213542677 Q gi|254780999|r 288 QKNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLP 354 (562) Q Consensus 288 ~~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~ 354 (562) +.-|++-+.+..+|.=-|+--.+-..+.+-..+|.+|..-..+++ ..++.+|+.+|++..+++ T Consensus 185 ~~ygl~~~~i~~~~~eps~~~l~~l~~~ik~~~v~~If~E~~~~~----k~~~~ia~etg~kv~~ld 247 (266) T cd01018 185 RDYGLTQIPIEEEGKEPSPADLKRLIDLAKEKGVRVVFVQPQFST----KSAEAIAREIGAKVVTID 247 (266) T ss_pred HHCCCEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCC----HHHHHHHHHHCCCEEEEC T ss_conf 867987986058888989999999999999839989997089990----999999997099379967 No 201 >pfam04392 ABC_sub_bind ABC transporter substrate binding protein. This family contains many hypothetical proteins and some ABC transporter substrate binding proteins. Probab=28.65 E-value=37 Score=13.69 Aligned_cols=106 Identities=18% Similarity=0.156 Sum_probs=53.9 Q ss_pred EEECCCCCHHHHHHHHHHHCCCH-HHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHH--HHHHHHCCCCC- Q ss_conf 75215872368999999841495-446797324666652213569999997213542677589999--98863132346- Q gi|254780999|r 296 IIGLSGGIDSALCAAIAVDALGK-ENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVN--HFFSLMSQFLQ- 371 (562) Q Consensus 296 viglSGGiDSal~a~la~~alg~-~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~--~~~~~~~~~~~- 371 (562) +-|+|--.+..-...+..+.++. ++|--+.-|+.-++....+..+..|+.+|++....++...-+ .....+....+ T Consensus 107 vTGvs~~~~~~~~l~ll~~l~P~~k~igviyn~~e~~s~~~~~~~~~~a~~~gi~l~~~~v~~~~ei~~a~~~l~~~~Da 186 (292) T pfam04392 107 VTGVSDLVDVEQTIELIKKLLPNVKSIGVYYSPSEANSVSLVEEIKKYAKKSGIKVVEASVPSSNDVPSAMSSMAGKVDA 186 (292) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHCCCCE T ss_conf 67852774799999999986889858999957998657999999999999769989999668866799999974328988 Q ss_pred -CCCCCHHHHCCCCHHHHHHHHHHHHHCCCEECCCC Q ss_conf -87431011012330466779888751695443643 Q gi|254780999|r 372 -EEPSGIVAENIQSRIRGNILMALSNHSKAMLLTTS 406 (562) Q Consensus 372 -~~~~~~~~eN~qaR~R~~~L~~~an~~~~lvl~t~ 406 (562) .-..|- .+.+ .+..+...+++.+--+.+.. T Consensus 187 l~i~~d~---~v~s--~~~~i~~~a~~~kiPv~~~~ 217 (292) T pfam04392 187 IFIPTDN---LIAS--AFTAVLQEANKAKIPVITSD 217 (292) T ss_pred EEEECCC---CHHH--HHHHHHHHHHHCCCCEEECC T ss_conf 9993781---0788--99999999997499989577 No 202 >TIGR00389 glyS_dimeric glycyl-tRNA synthetase; InterPro: IPR002315 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer, this family. It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain . The sequence and crystal structure of the homodimeric glycyl-tRNA synthetase from Thermus thermophilus, shows that each monomer consists of an active site strongly resembling that of the aspartyl and seryl enzymes, a C-terminal anticodon recognition domain of 100 residues and a third domain unusually inserted between motifs 1 and 2 almost certainly interacting with the acceptor arm of tRNA(Gly). The C-terminal domain has a novel five-stranded parallel-antiparallel beta-sheet structure with three surrounding helices. The active site residues most probably responsible for substrate recognition, in particular in the Gly binding pocket, can be identified by inference from aspartyl-tRNA synthetase due to the conserved nature of the class II active site , .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=28.31 E-value=26 Score=14.63 Aligned_cols=12 Identities=33% Similarity=0.625 Sum_probs=7.1 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999998279 Q gi|254780999|r 489 LDDIIKRIVENE 500 (562) Q Consensus 489 lD~il~~~i~~~ 500 (562) ||.|.|+++... T Consensus 478 i~Ri~Y~~l~hs 489 (606) T TIGR00389 478 IDRIIYSLLEHS 489 (606) T ss_pred HHHHHHHHHHHH T ss_conf 378999999874 No 203 >PRK05077 frsA fermentation/respiration switch protein; Reviewed Probab=27.81 E-value=38 Score=13.60 Aligned_cols=37 Identities=27% Similarity=0.283 Sum_probs=23.5 Q ss_pred EEECCCCCHHHHH--HHHHHHCCCHHHE--EEEECCCCCCC Q ss_conf 7521587236899--9999841495446--79732466665 Q gi|254780999|r 296 IIGLSGGIDSALC--AAIAVDALGKENV--QTIMLPYKYTS 332 (562) Q Consensus 296 viglSGGiDSal~--a~la~~alg~~~v--~~~~mp~~~~s 332 (562) +|=+|||+||-=+ --|.-+-|.|.++ +.+-|||..-| T Consensus 196 tViv~GgLDsLQ~D~~rLF~dylaP~giAMLTlDMPSvG~S 236 (414) T PRK05077 196 TVLMCGGLDSLQTDYYRLFRDYLAPRGIAMLTIDMPSVGFS 236 (414) T ss_pred EEEEECCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCC T ss_conf 59995672477888999999852646733797037877653 No 204 >TIGR00421 ubiX_pad polyprenyl P-hydroxybenzoate and phenylacrylic acid decarboxylases; InterPro: IPR004507 This family contains flavoproteins, which are aromatic acid decarboxylases. An example is the Saccharomyces cerevisiae gene, PAD1 that encodes phenylacrylic acid decarboxylase. Mutations of this gene are viable and confer resistance to cinnamic acid. ; GO: 0016831 carboxy-lyase activity. Probab=27.58 E-value=38 Score=13.57 Aligned_cols=26 Identities=35% Similarity=0.580 Sum_probs=19.5 Q ss_pred CCCCCCCCCCHHHH-----HHHHHHHCCCCC Q ss_conf 24666652213569-----999997213542 Q gi|254780999|r 326 LPYKYTSPQSLEDA-----AACAKALGCKYD 351 (562) Q Consensus 326 mp~~~~s~~s~~~a-----~~la~~lg~~~~ 351 (562) ||++||-+.|++|= -...+.|||+.. T Consensus 146 ~PaFY~rPkS~~Dl~~~~VgR~LD~lGI~~d 176 (181) T TIGR00421 146 VPAFYNRPKSVEDLIDFIVGRVLDQLGIEND 176 (181) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 5544478988788986779877776244378 No 205 >KOG0452 consensus Probab=27.51 E-value=38 Score=13.56 Aligned_cols=158 Identities=22% Similarity=0.296 Sum_probs=76.2 Q ss_pred HHHHHHHHHHCCCCE------EEEECCCCCHHHHHHHHHHHCC-------CHHHEEEEECCC-C--CCCCCCHHHHHHHH Q ss_conf 035888764224540------5752158723689999998414-------954467973246-6--66522135699999 Q gi|254780999|r 280 VLSLRDYVQKNNFHK------VIIGLSGGIDSALCAAIAVDAL-------GKENVQTIMLPY-K--YTSPQSLEDAAACA 343 (562) Q Consensus 280 vlgl~d~~~~~g~~~------~viglSGGiDSal~a~la~~al-------g~~~v~~~~mp~-~--~~s~~s~~~a~~la 343 (562) ..|+..|+.|.||.= -.||=||-+|-+++.+|--.-| |..|.-|-.-|. + |-..--+--|.++| T Consensus 487 ~SGv~pyL~klGF~IvGYGC~TCiGNsgpl~e~V~~ai~~ndlV~~gvLSGNrNFEGRvhp~tRANYLASPpLvvaYaia 566 (892) T KOG0452 487 ESGVLPYLEKLGFDIVGYGCMTCIGNSGPLDEAVVNAIEQNDLVAAGVLSGNRNFEGRVHPNTRANYLASPPLVVAYAIA 566 (892) T ss_pred HCCCHHHHHHCCCEEECCCCCEECCCCCCCCHHHHHHHHCCCEEEEEEEECCCCCCCCCCCCCHHHHCCCCHHHHHHHHC T ss_conf 43536679861942653322121168887887898753338848998840687766511664100001486477664442 Q ss_pred HHHCCCCCCCCHH-----------------HHHHHHHHH--HCCCCCCCCCCHHHHCCCCH--------HHHHHHHHHHH Q ss_conf 9721354267758-----------------999998863--13234687431011012330--------46677988875 Q gi|254780999|r 344 KALGCKYDVLPIH-----------------DLVNHFFSL--MSQFLQEEPSGIVAENIQSR--------IRGNILMALSN 396 (562) Q Consensus 344 ~~lg~~~~~i~I~-----------------~~~~~~~~~--~~~~~~~~~~~~~~eN~qaR--------~R~~~L~~~an 396 (562) -...|.+..-||. +-+..++.. +...|. -++++|+-. .-...||.+.. T Consensus 567 GtV~IDfe~eplg~~~~Gk~vfl~DIWPtr~Ev~~ve~~~Vip~mFk-----~~y~~I~~gn~~Wn~L~~p~~~Ly~Wd~ 641 (892) T KOG0452 567 GTVNIDFETEPLGVDPDGKNVFLRDIWPTREEVAEVEEEHVIPSMFK-----EVYEKIELGNPDWNQLEVPSSKLYPWDP 641 (892) T ss_pred CEEECCEECCCCCCCCCCCEEEEEECCCCHHHHHHHHHHCCCHHHHH-----HHHHHHHHCCHHHHHCCCCCCCEECCCC T ss_conf 51350100255431899876887514787799999988600289999-----9999886167114420277663021588 Q ss_pred H---------CCCEECCCCCHHHHHHHHCCCC-CCCC--CCCCCCCCCCHHHHHHHHHHH Q ss_conf 1---------6954436432447754110236-6655--540101776336589999996 Q gi|254780999|r 397 H---------SKAMLLTTSNKSEISVGYGTLY-GDMS--GGFNPLKDLYKTQVFQLASWR 444 (562) Q Consensus 397 ~---------~~~lvl~t~nksE~~~Gy~T~~-GD~~--g~~~p~~~l~Kt~v~~la~~~ 444 (562) . +.+|....--..+.--.||-++ ||+. -.++|-|++.|+-- -|||+ T Consensus 642 ~STYI~~ppfF~~mT~~~p~~~~i~~A~~LLnlGDSvTTDHISPAGsI~r~Sp--AAr~L 699 (892) T KOG0452 642 KSTYIKEPPFFEGMTRDLPGPQSIEDAYCLLNLGDSVTTDHISPAGSIARTSP--AARYL 699 (892) T ss_pred CCCEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCH--HHHHH T ss_conf 87511078301131237998310101168886067433256487753133788--88888 No 206 >TIGR02440 FadJ fatty oxidation complex, alpha subunit FadJ; InterPro: IPR012802 Members of this family represent the alpha subunit of the multifunctional enzyme complex of the fatty acid degradation cycle, which plays a minor role in aerobic beta-oxidation of fatty acids. The FadJI complex is necessary for anaerobic growth on short-chain acids with nitrate as an electron acceptor. Activities include: enoyl-CoA hydratase (4.2.1.17 from EC), 3-hydroxyacyl-CoA dehydrogenase (1.1.1.35 from EC), and 3-hydroxybutyryl-CoA epimerase (5.1.2.3 from EC). A representative is the Escherichia coli FadJ protein (also named YfcX) (P77399 from SWISSPROT). This entry excludes FadB of IPR012799 from INTERPRO.; GO: 0003857 3-hydroxyacyl-CoA dehydrogenase activity, 0004300 enoyl-CoA hydratase activity, 0008692 3-hydroxybutyryl-CoA epimerase activity, 0051287 NAD binding, 0006635 fatty acid beta-oxidation, 0016507 fatty acid beta-oxidation multienzyme complex. Probab=27.38 E-value=39 Score=13.55 Aligned_cols=46 Identities=30% Similarity=0.294 Sum_probs=34.2 Q ss_pred HHHHHHCCCHHHEEEE-------------ECCCCCC------CCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 9999841495446797-------------3246666------52213569999997213542677 Q gi|254780999|r 309 AAIAVDALGKENVQTI-------------MLPYKYT------SPQSLEDAAACAKALGCKYDVLP 354 (562) Q Consensus 309 a~la~~alg~~~v~~~-------------~mp~~~~------s~~s~~~a~~la~~lg~~~~~i~ 354 (562) --||+.|-.||||+|+ ..|+..| |++|.--.-+||+-.|=+-.++. T Consensus 429 ~qIAa~A~RPE~ViGLHYFSPVeKMPLVEvIpH~~TenGsatS~~TIATtValA~KQGKTpIVV~ 493 (732) T TIGR02440 429 GQIAAAAERPENVIGLHYFSPVEKMPLVEVIPHAKTENGSATSEETIATTVALAKKQGKTPIVVA 493 (732) T ss_pred HHHHHHCCCCCCEECCEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE T ss_conf 89998404975212030007767588557607887676666764579999999850699527873 No 207 >KOG1917 consensus Probab=27.07 E-value=23 Score=15.08 Aligned_cols=13 Identities=23% Similarity=0.209 Sum_probs=5.9 Q ss_pred CHHHHHHHHHHHH Q ss_conf 8999999999999 Q gi|254780999|r 18 DIAGNIAKARRAR 30 (562) Q Consensus 18 d~~~N~~~i~~~i 30 (562) ..++|.+-|.+.+ T Consensus 51 ~~eg~ik~iv~~f 63 (1125) T KOG1917 51 HLEGAIKFIVRKF 63 (1125) T ss_pred CHHHHHHHHHHHH T ss_conf 2002799999961 No 208 >cd03269 ABC_putative_ATPase This subfamily is involved in drug resistance, nodulation, lipid transport, and bacteriocin and lantibiotic immunity. In eubacteria and archaea, the typical organization consists of one ABC and one or two IMs. Eukaryote systems of the ABCA subfamily display ABC domains strongly similar to this family. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=26.77 E-value=40 Score=13.48 Aligned_cols=21 Identities=19% Similarity=0.308 Sum_probs=8.6 Q ss_pred HHHHHHCCCEECCCCCHHHHH Q ss_conf 888751695443643244775 Q gi|254780999|r 392 MALSNHSKAMLLTTSNKSEIS 412 (562) Q Consensus 392 ~~~an~~~~lvl~t~nksE~~ 412 (562) -.++.+-..++++|-+..|.. T Consensus 172 ~~~~~~g~til~ssH~l~e~~ 192 (210) T cd03269 172 RELARAGKTVILSTHQMELVE 192 (210) T ss_pred HHHHHCCCEEEEECCCHHHHH T ss_conf 999968989999888489999 No 209 >PRK02848 consensus Probab=26.57 E-value=40 Score=13.45 Aligned_cols=40 Identities=23% Similarity=0.348 Sum_probs=19.7 Q ss_pred CCCCE---EEEEEECCCCCCCHHHH-HHHHHHHHHH-HHHCCCCEEEECC Q ss_conf 98862---89999638836889999-9999999999-9985998999166 Q gi|254780999|r 1 MLKKL---KIAIAQLNPVVGDIAGN-IAKARRAREE-ANRQGMDLILFTE 45 (562) Q Consensus 1 M~~~m---kIal~Q~n~~~gd~~~N-~~~i~~~i~~-A~~~gadLvvfPE 45 (562) ||.+. +||+ +.||+.+- .+-+.+.+.+ .....+.++++-- T Consensus 1 mm~~~~KP~IaI-----T~GDPaGIGpEIilKal~~~~~~~~~~~viigd 45 (341) T PRK02848 1 MMTAAAKPVIAL-----TLGDPAGIGPELIAKLLADPDVREKANIVLVGD 45 (341) T ss_pred CCCCCCCCEEEE-----ECCCCCHHHHHHHHHHHHCHHHCCCCCEEEEEC T ss_conf 998899998999-----347874338999999984743406998899989 No 210 >TIGR02805 exbB2 tonB-system energizer ExbB; InterPro: IPR014172 Members of this protein family appear to be the ExbB protein of an ExbBD proton-transporting membrane complex that, by means of TonB, energizes transport by TonB-dependent receptors. Note that this family represents one of at least two distinct groups TolQ homologues designated ExbB - see also IPR014164 from INTERPRO. Each group associates with a distinct group of ExbD proteins, and a single species may have two ExbB/ExbD/TonB systems.. Probab=26.49 E-value=28 Score=14.51 Aligned_cols=23 Identities=43% Similarity=0.647 Sum_probs=15.2 Q ss_pred EEEECCCC-CHHHHH-----HHHHHHCCC Q ss_conf 57521587-236899-----999984149 Q gi|254780999|r 295 VIIGLSGG-IDSALC-----AAIAVDALG 317 (562) Q Consensus 295 ~viglSGG-iDSal~-----a~la~~alg 317 (562) +=+|..|| ||...+ +||-+-||| T Consensus 82 Y~~G~gggdID~~~i~~gLSLALKATALG 110 (139) T TIGR02805 82 YEMGLGGGDIDVKAILVGLSLALKATALG 110 (139) T ss_pred HHHCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 86325667645115455612899999977 No 211 >COG1535 EntB Isochorismate hydrolase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=26.23 E-value=41 Score=13.41 Aligned_cols=80 Identities=20% Similarity=0.348 Sum_probs=47.5 Q ss_pred HHHHHHHHHCCCEEEECCCCCCCCC----CC--CCCCCEEECCCCE-EEEE-CCCCCCCEEEEEEEECCCCCCCCCCCCC Q ss_conf 8899985433704642135688766----21--0486435558985-9996-1677763799999833521236555755 Q gi|254780999|r 189 EIVTGQISHVHLPIIYVNQVGGQDE----LI--FDGASFCFDGQQQ-LAFQ-MKHFSEQNFMTEWHYDQQLSQWNYMSDD 260 (562) Q Consensus 189 ~~~~~~a~e~~~~vvy~N~vGg~d~----lv--f~G~S~I~d~~G~-ii~~-~~~f~e~~~i~d~d~d~~~~~~~~~~~~ 260 (562) ..++..+.++++||+|.-+.|.++. +. |+|--+-..|..+ ++.+ +|. ..+.+++.|..+. T Consensus 61 ~~Lr~~~~~~giPVvyTaqp~~qs~~draLL~d~WGpgl~~~p~~~~vv~~l~P~-~~D~vL~kwrYsA----------- 128 (218) T COG1535 61 AKLRIWCKQAGIPVVYTAQPGEQSPEDRALLKDFWGPGLTASPEQQKVVDELAPG-ADDTVLTKWRYSA----------- 128 (218) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCC-CCCEEEEEEEHHH----------- T ss_conf 9999999982996799806776997898999973289888886666667750899-8853775411366----------- Q ss_pred CCCCCCCCCCHHHHHHCCHHHHHHHHHHHCCCCEEEE Q ss_conf 5554478999888642202035888764224540575 Q gi|254780999|r 261 SASTMYIPLQEEEADYNACVLSLRDYVQKNNFHKVII 297 (562) Q Consensus 261 ~~~~~~~p~~~~e~~~~Alvlgl~d~~~~~g~~~~vi 297 (562) .+ ..-|-+.+++.|-...+| T Consensus 129 --------------F~---~s~Llq~lr~~grdQLII 148 (218) T COG1535 129 --------------FH---RSPLLQMLREKGRDQLII 148 (218) T ss_pred --------------HH---CCHHHHHHHHCCCCCEEE T ss_conf --------------60---582999998728772897 No 212 >COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism] Probab=26.08 E-value=41 Score=13.40 Aligned_cols=48 Identities=21% Similarity=0.358 Sum_probs=34.9 Q ss_pred HHHHHHHHHCCCCEEE----ECCCCCCCCCCH-HHHHHHHHHHHHHCCCEEEE Q ss_conf 6899999714777599----315553335807-88878899985433704642 Q gi|254780999|r 157 SNICKHLKKQGAEFLF----SLNASPYYHNKL-KKRHEIVTGQISHVHLPIIY 204 (562) Q Consensus 157 ~~~~~~la~~Gadlii----~psASp~~~~k~-~~R~~~~~~~a~e~~~~vvy 204 (562) ...+...+..||+++- -|..|||.+.-. ..+...++..+.+++.+++- T Consensus 62 ~~~A~~vk~~Ga~~lRGgafKPRTSPYsFQGlge~gL~~l~~a~~~~Gl~vvt 114 (286) T COG2876 62 RETAESVKAAGAKALRGGAFKPRTSPYSFQGLGEEGLKLLKRAADETGLPVVT 114 (286) T ss_pred HHHHHHHHHCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 99999998736223137767888995333665778899999988872990588 No 213 >PRK11107 hybrid sensory histidine kinase BarA; Provisional Probab=25.98 E-value=41 Score=13.38 Aligned_cols=131 Identities=17% Similarity=0.188 Sum_probs=60.7 Q ss_pred ECCCCCCCHHHHHHHHHHHHH-HHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHH-HHHHHHHHHHHHHCCCEEEEEE Q ss_conf 638836889999999999999-99985998999166444799844541488999999-9999999999887697899999 Q gi|254780999|r 11 QLNPVVGDIAGNIAKARRARE-EANRQGMDLILFTELFISGYPPEDLVFKKSFIQAC-SSAIDTLKSDTHDGGAGIVVGF 88 (562) Q Consensus 11 Q~n~~~gd~~~N~~~i~~~i~-~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~-~~~l~~La~~~~~~~i~iivG~ 88 (562) +++..+.|+..-++.+...+. .|.++|.++.+.-+-.++-+-.+| +.-++++ ..-+..-.+++.+..+.+-+-. T Consensus 360 ~le~~~f~l~~~~~~v~~~~~~~a~~k~l~l~~~~~~~~p~~v~gD----~~Rl~QIl~NLl~NAiKfT~~G~I~v~v~~ 435 (920) T PRK11107 360 ILENIPFSLRETLDEVVTLLAHSAHEKGLELTLNIDPDVPDNVIGD----PLRLQQIITNLVGNAIKFTESGNIDILVEL 435 (920) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCEEEC----HHHHHHHHHHHHHHHHHCCCCCCEEEEEEE T ss_conf 6740167899999999999999999739879999789997306538----999999999999999971889937999999 Q ss_pred EEEECCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCEEEEECCCCCC Q ss_conf 99888987899999969979998877762355433321003778887614764745620111111 Q gi|254780999|r 89 PRQDQEGVLNSVVILDAGNIIAVRDKINLPNYSEFHEKRTFISGYSNDPIVFRDIRLGILICEDI 153 (562) Q Consensus 89 p~~~~~~~yNsa~vi~~G~il~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~~g~kiGv~ICeDl 153 (562) ....++...-...|-+.|- |+-. ..++ ..|+ -|...+....=.++|.-+|+.||..+ T Consensus 436 ~~~~~~~~~l~~~V~DTGi--GI~~-~~~~--~iF~---~F~q~d~s~~r~~gGtGLGLaI~k~L 492 (920) T PRK11107 436 RALSDEKVQLEVQIRDTGI--GISE-RQQS--QLFQ---AFRQADASISRRHGGTGLGLVITQKL 492 (920) T ss_pred EECCCCEEEEEEEEEEECC--CCCH-HHHH--HHHC---CCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 9647984899999999705--8699-9999--7854---20156765445679877429999999 No 214 >pfam02677 DUF208 Uncharacterized BCR, COG1636. Probab=25.96 E-value=41 Score=13.38 Aligned_cols=62 Identities=19% Similarity=0.260 Sum_probs=38.5 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHCCCCEEE-ECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 7456201111111016899999714777599-315553335807888788999854337046421356 Q gi|254780999|r 142 DIRLGILICEDIWKNSNICKHLKKQGAEFLF-SLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQV 208 (562) Q Consensus 142 g~kiGv~ICeDlw~~~~~~~~la~~Gadlii-~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~v 208 (562) |.|. ..|+|++... .++..+..|-|.+- .++-||+. +.+.-.+.....+.+.++.++|.+-- T Consensus 82 g~RC--~~Cy~~RL~~-tA~~A~e~gfd~ftTTL~~Sp~k--~~~~I~~iG~~~~~~~gi~f~~~DfR 144 (176) T pfam02677 82 GKRC--TYCYDMRLEE-TAQYAKEHGFDAFTTTLLISPYK--NHEQINEIGERAAKEYGVEFLYSDFR 144 (176) T ss_pred CCHH--HHHHHHHHHH-HHHHHHHCCCCEEECCCCCCCCC--CHHHHHHHHHHHHHHHCCCEEECCCC T ss_conf 8715--9999999999-99999985999997646177764--79999999999999979978500047 No 215 >PRK13546 teichoic acids export protein ATP-binding subunit; Provisional Probab=25.66 E-value=42 Score=13.35 Aligned_cols=14 Identities=29% Similarity=0.377 Sum_probs=6.6 Q ss_pred ECCCCCHHHHHHHH Q ss_conf 21587236899999 Q gi|254780999|r 298 GLSGGIDSALCAAI 311 (562) Q Consensus 298 glSGGiDSal~a~l 311 (562) -+|||.---|..|. T Consensus 143 ~LSgGqkqrl~lA~ 156 (264) T PRK13546 143 KYSSGMRAKLGFSI 156 (264) T ss_pred CCCHHHHHHHHHHH T ss_conf 47999999999999 No 216 >PRK06102 hypothetical protein; Provisional Probab=25.50 E-value=37 Score=13.64 Aligned_cols=15 Identities=20% Similarity=0.149 Sum_probs=8.9 Q ss_pred HHHHHHHHCCCCEEE Q ss_conf 899999714777599 Q gi|254780999|r 158 NICKHLKKQGAEFLF 172 (562) Q Consensus 158 ~~~~~la~~Gadlii 172 (562) .+.+.|...||-++- T Consensus 106 ~vV~rLr~aGaiilG 120 (452) T PRK06102 106 AVVALLARAGMVSLG 120 (452) T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999978997997 No 217 >COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion] Probab=25.29 E-value=42 Score=13.30 Aligned_cols=17 Identities=24% Similarity=0.229 Sum_probs=10.3 Q ss_pred CCCCCCCCCHHHHHHHH Q ss_conf 40101776336589999 Q gi|254780999|r 425 GFNPLKDLYKTQVFQLA 441 (562) Q Consensus 425 ~~~p~~~l~Kt~v~~la 441 (562) +..|.+++|+.-+..|+ T Consensus 342 ~sDp~ah~P~~~~~~l~ 358 (429) T COG1029 342 ASDPGAHFPRDAVEHLA 358 (429) T ss_pred ECCCCCCCHHHHHHHHH T ss_conf 53841127199998762 No 218 >cd04863 MtLigD_Pol_like MtLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Mycobacterium tuberculosis (Mt)LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. MtLigD is monomeric and contains an N-terminal Pol domain, a central phosphoesterase module, and a C-terminal ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The MtLigD Pol domain is stimulated by manganese, is error-prone, and prefers adding rNTPs to dNTPs in vitro. The MtLigD Pol domain has been shown to prefer DNA gapped substrates Probab=25.23 E-value=30 Score=14.25 Aligned_cols=26 Identities=19% Similarity=0.370 Sum_probs=13.6 Q ss_pred HHHHHHHHHCCCCEE--EEECCCCCHHH Q ss_conf 358887642245405--75215872368 Q gi|254780999|r 281 LSLRDYVQKNNFHKV--IIGLSGGIDSA 306 (562) Q Consensus 281 lgl~d~~~~~g~~~~--viglSGGiDSa 306 (562) +||..|++-||.+|+ ++.|.++.|.. T Consensus 134 lgL~sf~KTSG~kGlHV~vPl~~~~~~~ 161 (231) T cd04863 134 LGLASFPKTSGSKGLHLYVPLDGPVSSD 161 (231) T ss_pred CCCCEEEEECCCCEEEEEEECCCCCCHH T ss_conf 6995446706998679999789999989 No 219 >pfam04310 MukB MukB N-terminal. This family represents the N-terminal region of MukB, one of a group of bacterial proteins essential for the movement of nucleoids from mid-cell towards the cell quarters (i.e. chromosome partitioning). The structure of the N-terminal domain consists of an antiparallel six-stranded beta sheet surrounded by one helix on one side and by five helices on the other side. It contains an exposed Walker A loop in an unexpected helix-loop-helix motif (in other proteins, Walker A motifs generally adopt a P loop conformation as part of a strand-loop-helix motif embedded in a conserved topology of alternating helices and (parallel) beta strands). Probab=25.14 E-value=36 Score=13.73 Aligned_cols=17 Identities=35% Similarity=0.513 Sum_probs=9.0 Q ss_pred EECCCCCHHHHHHHHHH Q ss_conf 52158723689999998 Q gi|254780999|r 297 IGLSGGIDSALCAAIAV 313 (562) Q Consensus 297 iglSGGiDSal~a~la~ 313 (562) -.++||||||++--|.. T Consensus 203 aslyGgissaItrsLrd 219 (227) T pfam04310 203 ASLYGGISSAITRSLRD 219 (227) T ss_pred HHHHCCHHHHHHHHHHH T ss_conf 98716726888888998 No 220 >pfam10042 DUF2278 Uncharacterized conserved protein (DUF2278). Members of this family of hypothetical bacterial proteins have no known function. Probab=25.11 E-value=42 Score=13.28 Aligned_cols=21 Identities=29% Similarity=0.267 Sum_probs=7.8 Q ss_pred CCCCEEECCCCEEEEECCCCC Q ss_conf 486435558985999616777 Q gi|254780999|r 217 DGASFCFDGQQQLAFQMKHFS 237 (562) Q Consensus 217 ~G~S~I~d~~G~ii~~~~~f~ 237 (562) ||+=+|--++|+..+-.-.|+ T Consensus 175 DGall~~~~d~~w~a~FlaFq 195 (206) T pfam10042 175 DGALLVHFSDGQWAAVFLAFA 195 (206) T ss_pred CCEEEEECCCCCEEEEEEHHH T ss_conf 664999927992887662033 No 221 >PRK13482 DNA integrity scanning protein DisA; Provisional Probab=25.08 E-value=16 Score=16.10 Aligned_cols=54 Identities=22% Similarity=0.228 Sum_probs=34.4 Q ss_pred EECCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 2011111110168999997147775993155533358078887889998543370464213568 Q gi|254780999|r 146 GILICEDIWKNSNICKHLKKQGAEFLFSLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVG 209 (562) Q Consensus 146 Gv~ICeDlw~~~~~~~~la~~Gadlii~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vG 209 (562) ++.+.+|.+ +.+. |++-++|+.|-. ....-.||......|+.++.+||.+++-- T Consensus 70 AIVls~d~~------kIl~---ANvqL~PD~sIp-T~ETGtRHRTAeRvAkQTg~~VIaVSqrr 123 (352) T PRK13482 70 AIVLSSDGS------KILR---ANVQLVPDPSIP-TSETGTRHRTAERVAKQTGYPVIAVSQRR 123 (352) T ss_pred CEEECCCHH------HHHH---HHHEECCCCCCC-CCCCCCCHHHHHHHHHHHCCEEEEEECCC T ss_conf 189677540------6877---321006999998-88776205679999998699289996234 No 222 >pfam09587 PGA_cap Bacterial capsule synthesis protein PGA_cap. This protein is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein. Probab=25.04 E-value=43 Score=13.27 Aligned_cols=14 Identities=21% Similarity=0.347 Sum_probs=6.5 Q ss_pred HHHHHHHHCCCCEE Q ss_conf 89999971477759 Q gi|254780999|r 158 NICKHLKKQGAEFL 171 (562) Q Consensus 158 ~~~~~la~~Gadli 171 (562) .....|+..|.|++ T Consensus 66 ~~~~~L~~~G~d~v 79 (237) T pfam09587 66 ENAEALKAAGFDVV 79 (237) T ss_pred HHHHHHHHCCCCEE T ss_conf 99999998499999 No 223 >PRK09856 fructoselysine 3-epimerase; Provisional Probab=25.00 E-value=43 Score=13.27 Aligned_cols=13 Identities=0% Similarity=-0.003 Sum_probs=6.5 Q ss_pred HHHHHHCCCCEEE Q ss_conf 8876422454057 Q gi|254780999|r 284 RDYVQKNNFHKVI 296 (562) Q Consensus 284 ~d~~~~~g~~~~v 296 (562) ...++..|++|+| T Consensus 232 l~~L~~~gY~G~v 244 (276) T PRK09856 232 MRDIIDRGYEGYC 244 (276) T ss_pred HHHHHHCCCCCEE T ss_conf 9999985998159 No 224 >COG5001 Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain [Signal transduction mechanisms] Probab=24.67 E-value=43 Score=13.23 Aligned_cols=126 Identities=18% Similarity=0.221 Sum_probs=55.8 Q ss_pred CCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHHH Q ss_conf 14954467973246666522135699999972135426775899999886313234687431011012330466779888 Q gi|254780999|r 315 ALGKENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGNILMAL 394 (562) Q Consensus 315 alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~ 394 (562) .||-+. .++..|...-...-.-.|..+|+.|+..|..-..+-- .-.+++-. ..+...-+.||+-+|---.+--+- T Consensus 305 RLGGDE-Fa~i~p~~~dDA~rva~a~al~e~L~ApY~l~~~~v~---vgASiGia-~fP~~a~t~EqLf~RADyALYhAK 379 (663) T COG5001 305 RLGGDE-FALIIPALEDDALRVAGARALCESLQAPYDLRGVRVQ---VGASIGIA-PFPSGADTSEQLFERADYALYHAK 379 (663) T ss_pred HHCCCE-EEEECCCCCCHHHHHCCHHHHHHHHCCCCCCCCCEEE---EEEEEEEE-ECCCCCCCHHHHHHHHHHHHHHHH T ss_conf 706871-4897357896788750469999985797346886278---51124453-268766529999976668999887 Q ss_pred HHHCCCEECCCCC---------HHHHHHHHCCCCCCCCCCCCCCCCCCHH---HHHHHHHHHH Q ss_conf 7516954436432---------4477541102366655540101776336---5899999963 Q gi|254780999|r 395 SNHSKAMLLTTSN---------KSEISVGYGTLYGDMSGGFNPLKDLYKT---QVFQLASWRN 445 (562) Q Consensus 395 an~~~~lvl~t~n---------ksE~~~Gy~T~~GD~~g~~~p~~~l~Kt---~v~~la~~~n 445 (562) .|..|.-||=... .-|-++--.-+--.+...|.||-|+-.. -...|+||.. T Consensus 380 ~~gkG~AvlFda~hE~~I~~~A~vE~~Lr~adl~~elsV~fQPiVDi~~gkt~a~EALaRWhs 442 (663) T COG5001 380 QNGKGAAVLFDARHEAAIRDMAVVEQALRSADLEQELSVHFQPIVDIVSGKTIALEALARWHS 442 (663) T ss_pred HCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHEEECEEEEEECCCCHHHHHHHHHHCC T ss_conf 528985687433578888899999999875545542402110113410562289999886438 No 225 >cd03796 GT1_PIG-A_like This family is most closely related to the GT1 family of glycosyltransferases. Phosphatidylinositol glycan-class A (PIG-A), an X-linked gene in humans, is necessary for the synthesis of N-acetylglucosaminyl-phosphatidylinositol, a very early intermediate in glycosyl phosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is an important cellular structure that facilitates the attachment of many proteins to cell surfaces. Somatic mutations in PIG-A have been associated with Paroxysmal Nocturnal Hemoglobinuria (PNH), an acquired hematological disorder. Probab=24.05 E-value=44 Score=13.15 Aligned_cols=15 Identities=20% Similarity=0.109 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999999999887697 Q gi|254780999|r 68 SAIDTLKSDTHDGGA 82 (562) Q Consensus 68 ~~l~~La~~~~~~~i 82 (562) ..+..|++...+.|. T Consensus 18 ~~v~~La~~L~~~Gh 32 (398) T cd03796 18 THIYQLSQCLIKRGH 32 (398) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999999997699 No 226 >TIGR02176 pyruv_ox_red pyruvate:ferredoxin (flavodoxin) oxidoreductase; InterPro: IPR011895 The oxidative decarboxylation of pyruvate to acetyl-CoA, a central step in energy metabolism, can occur by two different mechanisms . In mitochondria and aerobic bacteria this reaction is catalysed by the multienzyme complex pyruvate dehydrogenase using NAD as electron acceptor. In anaerobic organisms, however, this reaction is reversibly catalysed by a single enzyme using either ferrodoxin or flavodoxin as the electron acceptor. Pyruvate:ferrodoxin/flavodoxin reductases (PFORs) in this entry occur in both obligately and facultatively anaerobic bacteria and also some eukaryotic microorganisms. These proteins are single-chain enzymes containing a thiamin pyrophosphate cofactor for the cleavage of carbon-carbon bonds next to a carbonyl group, and iron-sulphur clusters for electron transfer. The Desulfovibrio africanus enzyme is currently the only PFOR whose three dimensional structure is known , . It is a homodimer where each subunit contains one thiamin pyrophosphate cofactor and two ferrodoxin-like 4Fe-S clusters and an atypical 4Fe-S cluster. Each monomer is composed of seven domains - domains I, II and VI make intersubunit contacts, while domains III, IV and V are located at the suface of the dimer, and domain VII forms a long arm extending over the other subunit. The cofactor is bound at the interface of domains I and VI and is proximal to the atypical 4Fe-S bound by domain VI, while the ferrodoxin-like 4Fe-S clusters are bound by domain V. Comparison of this enzyme with the multi-chain PFORs shows a correspondance between the domains in this enzyme and the subunits of the multi-chain enzymes.; GO: 0005506 iron ion binding, 0016903 oxidoreductase activity acting on the aldehyde or oxo group of donors, 0006118 electron transport. Probab=23.70 E-value=45 Score=13.11 Aligned_cols=52 Identities=21% Similarity=0.272 Sum_probs=42.7 Q ss_pred HHHHHHHHHCC--------------CCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHH Q ss_conf 69999997213--------------54267758999998863132346874310110123304667 Q gi|254780999|r 338 DAAACAKALGC--------------KYDVLPIHDLVNHFFSLMSQFLQEEPSGIVAENIQSRIRGN 389 (562) Q Consensus 338 ~a~~la~~lg~--------------~~~~i~I~~~~~~~~~~~~~~~~~~~~~~~~eN~qaR~R~~ 389 (562) ||-++|+..|. -..+||.+.+|+.+-+.|.+..+.+...|...|++|=-+++ T Consensus 551 dA~kiA~~~GLG~RIN~iMQtAFFKla~vlP~e~Av~~lK~~I~ksYGKKG~~vV~~N~~AvD~~v 616 (1194) T TIGR02176 551 DAVKIAQEVGLGNRINTIMQTAFFKLAGVLPFEKAVDLLKKSIEKSYGKKGEEVVQKNIKAVDKAV 616 (1194) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 688898770788448999998899871579878999998899885048977788987799997773 No 227 >PRK04507 consensus Probab=23.50 E-value=46 Score=13.08 Aligned_cols=36 Identities=28% Similarity=0.412 Sum_probs=16.8 Q ss_pred CCCCEEEEEEECCCCCCCHHHH-HHHHHHHHHHHHHCCCCEEEE Q ss_conf 9886289999638836889999-999999999999859989991 Q gi|254780999|r 1 MLKKLKIAIAQLNPVVGDIAGN-IAKARRAREEANRQGMDLILF 43 (562) Q Consensus 1 M~~~mkIal~Q~n~~~gd~~~N-~~~i~~~i~~A~~~gadLvvf 43 (562) ||++ +||+ +.||+.+- -+-+.++..+. ...+.++++ T Consensus 1 M~~P-~IaI-----T~GDPaGIGPEIilK~~~~~-~~~~~~vvi 37 (323) T PRK04507 1 MMVP-SLAL-----VPGEPAGIGPELCVRLAQQP-RSDAHLIAY 37 (323) T ss_pred CCCC-EEEE-----ECCCCCHHHHHHHHHHHHCC-CCCCCEEEE T ss_conf 9998-3899-----15886266999999998666-458998999 No 228 >TIGR01515 branching_enzym 1,4-alpha-glucan branching enzyme; InterPro: IPR006407 This entry represents the core region of the glycogen branching enzymes, which are responsible for the transfer of chains of approximately seven alpha(1,4)-linked glucosyl residues to other similar chains (in new alpha-(1,6) linkages) in the biosynthesis of glycogen . The branching enzyme is responsible for the degree of alpha(1,6) branch linkages found in polysaccharides .; GO: 0003844 14-alpha-glucan branching enzyme activity, 0005978 glycogen biosynthetic process. Probab=23.31 E-value=38 Score=13.61 Aligned_cols=126 Identities=10% Similarity=0.195 Sum_probs=72.7 Q ss_pred HHHHCCCEEEECCCCCCCCCCCCCCCCEEECCCCEEEEECCCCC--------CCEEEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 85433704642135688766210486435558985999616777--------6379999983352123655575555544 Q gi|254780999|r 194 QISHVHLPIIYVNQVGGQDELIFDGASFCFDGQQQLAFQMKHFS--------EQNFMTEWHYDQQLSQWNYMSDDSASTM 265 (562) Q Consensus 194 ~a~e~~~~vvy~N~vGg~d~lvf~G~S~I~d~~G~ii~~~~~f~--------e~~~i~d~d~d~~~~~~~~~~~~~~~~~ 265 (562) +-.++++|-++.=-+|.-..+=| =|++++|.+..++.-+. ...++++++-|..-...+.....-..+. T Consensus 60 ~~~~~GvWeLFiPg~~~g~~YKY----Ei~~~~G~~~lkaDPyA~y~evRP~tASlv~~l~~d~~W~~~r~~~~~~~~Pv 135 (608) T TIGR01515 60 KRNDNGVWELFIPGVGEGELYKY----EIVDANGEIRLKADPYAFYAEVRPETASLVADLEQDEEWLEKRKAKDPYEKPV 135 (608) T ss_pred CCCCCCEEEECCCCCCCCCCCEE----EEECCCCCEEECCCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHHCCCCCCCE T ss_conf 02787225511776546862204----66768987564367530124689874025525853589999852102200882 Q ss_pred CC----CCCHHHHHHCCHHHHHHHHHHHCCCCEEE-EECCCCCHHHHHHHHHHHCCCH---HHEEEEECCC-CCCCCCC Q ss_conf 78----99988864220203588876422454057-5215872368999999841495---4467973246-6665221 Q gi|254780999|r 266 YI----PLQEEEADYNACVLSLRDYVQKNNFHKVI-IGLSGGIDSALCAAIAVDALGK---ENVQTIMLPY-KYTSPQS 335 (562) Q Consensus 266 ~~----p~~~~e~~~~Alvlgl~d~~~~~g~~~~v-iglSGGiDSal~a~la~~alg~---~~v~~~~mp~-~~~s~~s 335 (562) .+ -.|...-.|+-|+--|--|+++.||..+= ++|. =.=.+. =.|.|+.-|+ |+-+++. T Consensus 136 svYElHLGSW~~lSY~eL~~~L~pYvK~~GFTHiELLPv~------------EHPFDGSWGYQ~TGyYAPTsRfG~P~D 202 (608) T TIGR01515 136 SVYELHLGSWRKLSYRELADQLIPYVKELGFTHIELLPVA------------EHPFDGSWGYQVTGYYAPTSRFGTPDD 202 (608) T ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCC------------CCCCCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 8998400005421278886310157875087125414664------------278878655357755278888877456 No 229 >COG0703 AroK Shikimate kinase [Amino acid transport and metabolism] Probab=23.13 E-value=46 Score=13.04 Aligned_cols=76 Identities=17% Similarity=0.072 Sum_probs=38.6 Q ss_pred HHEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHH-HHHCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 446797324666652213569999997213542677589999988-6313234687431011012330466779888751 Q gi|254780999|r 319 ENVQTIMLPYKYTSPQSLEDAAACAKALGCKYDVLPIHDLVNHFF-SLMSQFLQEEPSGIVAENIQSRIRGNILMALSNH 397 (562) Q Consensus 319 ~~v~~~~mp~~~~s~~s~~~a~~la~~lg~~~~~i~I~~~~~~~~-~~~~~~~~~~~~~~~~eN~qaR~R~~~L~~~an~ 397 (562) .|+.-|-||+-.-| .-++.||+.|+..+...| ...+... .+++..|... -++..+.+.+..+.-.. +. T Consensus 3 ~~IvLiG~mGaGKS----TIGr~LAk~L~~~F~D~D--~~Ie~~~g~sI~eIF~~~----GE~~FR~~E~~vl~~l~-~~ 71 (172) T COG0703 3 MNIVLIGFMGAGKS----TIGRALAKALNLPFIDTD--QEIEKRTGMSIAEIFEEE----GEEGFRRLETEVLKELL-EE 71 (172) T ss_pred CCEEEECCCCCCHH----HHHHHHHHHCCCCCCCCH--HHHHHHHCCCHHHHHHHH----HHHHHHHHHHHHHHHHH-HC T ss_conf 61899717999776----899999998199802246--999999796999999998----28999999999999986-26 Q ss_pred CCCEECCCC Q ss_conf 695443643 Q gi|254780999|r 398 SKAMLLTTS 406 (562) Q Consensus 398 ~~~lvl~t~ 406 (562) . ..|++|| T Consensus 72 ~-~~ViaTG 79 (172) T COG0703 72 D-NAVIATG 79 (172) T ss_pred C-CEEEECC T ss_conf 9-9599779 No 230 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=23.05 E-value=46 Score=13.03 Aligned_cols=69 Identities=13% Similarity=0.290 Sum_probs=37.7 Q ss_pred HHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCEEEEEEEEEECCEEEEE Q ss_conf 99985998999166444799844541488999999999999999988769789999999888987899999969979998 Q gi|254780999|r 32 EANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIVVGFPRQDQEGVLNSVVILDAGNIIAV 111 (562) Q Consensus 32 ~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~iivG~p~~~~~~~yNsa~vi~~G~il~~ 111 (562) .|....+|+++.-|-.-.| ...|.+.|.+.+.++.+ ...+.++|.+-...=..+-+.+.+++.|++... T Consensus 156 IA~~~dPDILIIDEaLSVG--------D~~F~~Kc~~rm~ef~e---~gkTIvfVSHsl~~Vk~~C~R~iWLe~G~vr~~ 224 (549) T PRK13545 156 ISVHINPDILVIDEALSVG--------DQTFTKKCLDKMNEFKE---QGKTIFFISHSLSQVKSFCTKALWLHYGQVKEY 224 (549) T ss_pred HHHHCCCCEEEEECCCCCC--------CHHHHHHHHHHHHHHHH---CCCEEEEEECCHHHHHHHCCEEEEEECCEEEEE T ss_conf 9982499999994620057--------89999999999999997---898899995888999985731063438667874 No 231 >PRK13270 treF trehalase; Provisional Probab=22.52 E-value=48 Score=12.96 Aligned_cols=19 Identities=5% Similarity=0.177 Sum_probs=9.9 Q ss_pred CCCCCHHHHHHHHHHHHCC Q ss_conf 4267758999998863132 Q gi|254780999|r 350 YDVLPIHDLVNHFFSLMSQ 368 (562) Q Consensus 350 ~~~i~I~~~~~~~~~~~~~ 368 (562) +.-|+++.++-....+|+. T Consensus 346 iiPVDLNalLy~~E~~lA~ 364 (549) T PRK13270 346 FIPIDLNAFLFKLESAIAN 364 (549) T ss_pred CCEECHHHHHHHHHHHHHH T ss_conf 0414489999999999999 No 232 >TIGR01136 cysKM cysteine synthases; InterPro: IPR005856 This model discriminates cysteine synthases (both CysK and CysM) from cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain lacking from this protein. Bacterial proteins lacking the CBS domain but otherwise showing resemblance to cystathionine beta-synthases and considerable phylogenetic distance from known cysteine synthases were excluded. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid. In bacteria such two forms of the enzyme are known (genes cysK and cysM). CysM differs from CysK in that it can also use thiosulphate instead of sulphide, to produce cysteine thiosulphonate instead of cysteine. ; GO: 0004124 cysteine synthase activity, 0006535 cysteine biosynthetic process from serine. Probab=22.49 E-value=48 Score=12.96 Aligned_cols=39 Identities=23% Similarity=0.336 Sum_probs=27.4 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHCC--CHHHEEEEECCCC Q ss_conf 45405752158723689999998414--9544679732466 Q gi|254780999|r 291 NFHKVIIGLSGGIDSALCAAIAVDAL--GKENVQTIMLPYK 329 (562) Q Consensus 291 g~~~~viglSGGiDSal~a~la~~al--g~~~v~~~~mp~~ 329 (562) .-+|+++|+|.|--=+-++-||.+.- |++.+..+++|+. T Consensus 265 ~~EGIl~GiSSGAavaAAl~~A~~~~~~G~~k~iV~i~pd~ 305 (315) T TIGR01136 265 REEGILVGISSGAAVAAALQVAKRLENKGPGKVIVAILPDT 305 (315) T ss_pred HHCCEEEECCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCC T ss_conf 74680662156899999999997322478896699996689 No 233 >cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core. Probab=22.40 E-value=48 Score=12.94 Aligned_cols=11 Identities=27% Similarity=0.286 Sum_probs=4.6 Q ss_pred EEEECCCCCCC Q ss_conf 99916644479 Q gi|254780999|r 40 LILFTELFISG 50 (562) Q Consensus 40 LvvfPEl~LtG 50 (562) +.+.|-+...| T Consensus 3 l~v~~~~~~GG 13 (353) T cd03811 3 LFVIPSLGGGG 13 (353) T ss_pred EEEECCCCCCH T ss_conf 99969999915 No 234 >pfam06500 DUF1100 Alpha/beta hydrolase of unknown function (DUF1100). This family consists of several hypothetical bacterial proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=22.35 E-value=48 Score=12.94 Aligned_cols=37 Identities=27% Similarity=0.301 Sum_probs=23.3 Q ss_pred EEECCCCCHHHHH--HHHHHHCCCHHHE--EEEECCCCCCC Q ss_conf 7521587236899--9999841495446--79732466665 Q gi|254780999|r 296 IIGLSGGIDSALC--AAIAVDALGKENV--QTIMLPYKYTS 332 (562) Q Consensus 296 viglSGGiDSal~--a~la~~alg~~~v--~~~~mp~~~~s 332 (562) +|=+|||+||-=+ --|.-+-|.|.++ +.+-|||..-| T Consensus 192 tViv~ggLDsLQ~D~~rLF~dyLaP~giAMLTlDMPSvG~S 232 (411) T pfam06500 192 VVLVSAGLDSLQTDMWRLFRDYLAPRDIAMLTIDMPSVGAS 232 (411) T ss_pred EEEEECCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCC T ss_conf 69995772577888999999853745733797047877754 No 235 >TIGR01513 NAPRTase_put putative nicotinate phosphoribosyltransferase; InterPro: IPR006405 A deep split separates two related families of proteins, one of which includes experimentally characterised examples of nicotinate phosphoribosyltransferase, the first enzyme of NAD salvage biosynthesis. This entry represents the other family. Members have a different (longer) spacing of several key motifs and have an additional C-terminal domain of up to 100 residues. One argument suggesting that this family represents the same enzyme is that no species has a member of both families. Another is that the gene encoding this protein is located near other NAD salvage biosynthesis genes in Nostoc and in at least four different Gram-positive bacteria. NAD and NADP are ubiquitous in life. Most members of this family are from Gram-positive bacteria. An additional set of mutually closely related archaeal sequences score between the trusted and noise cut-offs. . Probab=22.35 E-value=48 Score=12.94 Aligned_cols=72 Identities=18% Similarity=0.203 Sum_probs=47.5 Q ss_pred HCCHHHHHHHHHHHCC-------CCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHH--HH Q ss_conf 2202035888764224-------54057521587236899999984149544679732466665221356999999--72 Q gi|254780999|r 276 YNACVLSLRDYVQKNN-------FHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAK--AL 346 (562) Q Consensus 276 ~~Alvlgl~d~~~~~g-------~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~--~l 346 (562) ...|...+|..+...| ++.+=|=+|||+|=...+.|...- .|=++.||=-- -.|+.. + = .| T Consensus 291 L~~LSk~~R~~LD~AGaqfqvPWL~~vkI~VSn~LDE~~I~~L~~~g-apID~~GVGT~-LvTa~~----~----PGG~l 360 (523) T TIGR01513 291 LAYLSKKVRKELDEAGAQFQVPWLEEVKIVVSNDLDENSIAALKAEG-APIDVYGVGTS-LVTASD----A----PGGAL 360 (523) T ss_pred HHHHHHHHHHHHHHHCHHCCCCCHHCCEEEEECCCCHHHHHHHHHCC-CCEEEEEECCE-EEECCC----C----CCCCC T ss_conf 99999999999986111115860037548996740478999999658-94538884678-898478----8----88500 Q ss_pred CCCCCCCCHHH Q ss_conf 13542677589 Q gi|254780999|r 347 GCKYDVLPIHD 357 (562) Q Consensus 347 g~~~~~i~I~~ 357 (562) ++-|+.+.++. T Consensus 361 ~~VYKlva~~g 371 (523) T TIGR01513 361 SGVYKLVAYEG 371 (523) T ss_pred CCEEEEEEECC T ss_conf 21378888637 No 236 >PRK08305 spoVFB dipicolinate synthase subunit B; Reviewed Probab=22.28 E-value=48 Score=12.93 Aligned_cols=38 Identities=16% Similarity=0.159 Sum_probs=25.8 Q ss_pred ECCEEEEECCCCCCCHH---HHHHHHHHHCCCCEEEECCCC Q ss_conf 64745620111111101---689999971477759931555 Q gi|254780999|r 140 FRDIRLGILICEDIWKN---SNICKHLKKQGAEFLFSLNAS 177 (562) Q Consensus 140 ~~g~kiGv~ICeDlw~~---~~~~~~la~~Gadlii~psAS 177 (562) ..|.|||+.||.-+-.. ....+.|.+.||++..+.|.+ T Consensus 3 L~gK~I~lgvTGSiaay~kv~~~ir~L~~~GA~V~~ImS~~ 43 (195) T PRK08305 3 LKGKRIGFGLTGSHCTYEEVMPQIEKLVAEGAEVRPIVSYT 43 (195) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCEEEEEECHH T ss_conf 68988999967279989999999999998799699997544 No 237 >pfam02065 Melibiase Melibiase. Probab=22.18 E-value=48 Score=12.92 Aligned_cols=61 Identities=18% Similarity=0.256 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCC-----CHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999985998999166444799-----8445414889999999999999999887697 Q gi|254780999|r 19 IAGNIAKARRAREEANRQGMDLILFTELFISGY-----PPEDLVFKKSFIQACSSAIDTLKSDTHDGGA 82 (562) Q Consensus 19 ~~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY-----~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i 82 (562) ++-|.+++++++++|++-|+++.|+=--+..+= ..+|+.-.+.- -..-|..|+...++.|+ T Consensus 53 Fd~~e~~l~~la~~Aa~lGvE~FVlDDGWFg~R~~D~~gLGDW~v~~~k---fP~GL~~l~d~v~~~GM 118 (395) T pfam02065 53 FDFNEDKLLDLADEAADLGIELFVLDDGWFGGRNDDNSSLGDWFVNEQK---FPNGLKPLADHVHSLGM 118 (395) T ss_pred CCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCHHH---CCCCHHHHHHHHHHCCC T ss_conf 2589999999999999739869998178778999988777381616500---58987999999997398 No 238 >pfam05201 GlutR_N Glutamyl-tRNAGlu reductase, N-terminal domain. Probab=22.03 E-value=30 Score=14.31 Aligned_cols=18 Identities=39% Similarity=0.519 Sum_probs=14.3 Q ss_pred HHCCCEECCCCCHHHHHH Q ss_conf 516954436432447754 Q gi|254780999|r 396 NHSKAMLLTTSNKSEISV 413 (562) Q Consensus 396 n~~~~lvl~t~nksE~~~ 413 (562) .-.+.++|+|=|++|+.+ T Consensus 35 ~i~e~viLsTCNRtEiY~ 52 (150) T pfam05201 35 GVSEAVILSTCNRVEIYA 52 (150) T ss_pred CCCEEEEEEECCCEEEEE T ss_conf 975168986047279999 No 239 >TIGR02538 type_IV_pilB type IV-A pilus assembly ATPase PilB; InterPro: IPR013374 This model describes a protein involved in type IV pilus biogenesis designated PilB in Pseudomonas aeruginosa but PilF in Neisseria gonorrhoeae; the more common usage, reflected here, is PilB. This protein is an ATPase involved in protein export for pilin assembly, and is closely related to GspE (IPR013369 from INTERPRO) of type II secretion systems (also referred to as the main terminal branch of the general secretion pathway). Note that type IV pilus systems are often divided into type IV-A and IV-B, with the latter group including bundle-forming pilus, mannose-sensitive hemagglutinin, etc. Members of this family are found in type IV-A systems.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0009297 pilus biogenesis. Probab=21.82 E-value=34 Score=13.88 Aligned_cols=51 Identities=14% Similarity=0.084 Sum_probs=27.3 Q ss_pred HHHHHCCCEECCCCCHHHHH---HHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 88751695443643244775---4110236665554010177633658999999631 Q gi|254780999|r 393 ALSNHSKAMLLTTSNKSEIS---VGYGTLYGDMSGGFNPLKDLYKTQVFQLASWRNS 446 (562) Q Consensus 393 ~~an~~~~lvl~t~nksE~~---~Gy~T~~GD~~g~~~p~~~l~Kt~v~~la~~~n~ 446 (562) =-|.+.|||||+|-+==.++ .--.-+ +...|+.=++|.---=-.|||-+.. T Consensus 414 iKAAqTGHLVlSTLHTNdAp~Tl~RL~NM---GiapFNIASSV~LI~AQRLARRLCs 467 (577) T TIGR02538 414 IKAAQTGHLVLSTLHTNDAPETLARLVNM---GIAPFNIASSVNLIMAQRLARRLCS 467 (577) T ss_pred HHHHCCCCCEECCCCCCCHHHHHHHHHHC---CCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99840487210100016858999999753---8413799999999999975524031 No 240 >PRK09922 UDP-D-galactose:(glucosyl)lipopolysaccharide-1,6-D-galactosyltransferase; Provisional Probab=21.79 E-value=49 Score=12.87 Aligned_cols=40 Identities=20% Similarity=0.299 Sum_probs=21.9 Q ss_pred CCEEEEEEECCCC-CCCHHHHHHHHHHHHHHHHHC--CCCEEEECC Q ss_conf 8628999963883-688999999999999999985--998999166 Q gi|254780999|r 3 KKLKIAIAQLNPV-VGDIAGNIAKARRAREEANRQ--GMDLILFTE 45 (562) Q Consensus 3 ~~mkIal~Q~n~~-~gd~~~N~~~i~~~i~~A~~~--gadLvvfPE 45 (562) ++|||+....+.. .|-.|.-+-.+.+.. .+. +.+++++.. T Consensus 1 ~~MKIlfi~~~l~~~GGaErvl~~La~~L---~~~~~~~~v~~~~~ 43 (361) T PRK09922 1 KKMKIAFIGEAVSGFGGMETVISNVINTF---ENSKINCEMFFFCR 43 (361) T ss_pred CCEEEEEECCCCCCCCCHHHHHHHHHHHH---HHHCCCEEEEEEEC T ss_conf 97099999999999880499999999999---87199879999934 No 241 >cd02648 nuc_hydro_1 NH_1: A subgroup of nucleoside hydrolases. This group contains fungal proteins similar to nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. Probab=21.73 E-value=49 Score=12.86 Aligned_cols=20 Identities=25% Similarity=0.330 Sum_probs=8.1 Q ss_pred HHHHHHHCCCCEEEEECCCC Q ss_conf 88876422454057521587 Q gi|254780999|r 283 LRDYVQKNNFHKVIIGLSGG 302 (562) Q Consensus 283 l~d~~~~~g~~~~viglSGG 302 (562) +.|-++++.-+.+.|=--|- T Consensus 141 ii~~l~~~p~~~Vtlva~GP 160 (367) T cd02648 141 ILDILREEPDHTVTIAALGP 160 (367) T ss_pred HHHHHHHCCCCEEEEEECCC T ss_conf 99999967998189995786 No 242 >cd01571 NAPRTase_B Nicotinate phosphoribosyltransferase (NAPRTase), subgroup B. Nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. This salvage pathway serves to recycle NAD degradation products. Probab=21.53 E-value=50 Score=12.83 Aligned_cols=38 Identities=18% Similarity=0.255 Sum_probs=30.3 Q ss_pred HHCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH Q ss_conf 42202035888764224540575215872368999999 Q gi|254780999|r 275 DYNACVLSLRDYVQKNNFHKVIIGLSGGIDSALCAAIA 312 (562) Q Consensus 275 ~~~Alvlgl~d~~~~~g~~~~viglSGGiDSal~a~la 312 (562) ....++.-+|..+.+.|++.+-|-+|||||=.-...++ T Consensus 226 ~~~~l~~~vR~~LD~~G~~~vkI~aSggl~e~~I~~l~ 263 (302) T cd01571 226 VFRYLIREVRWALDIRGYKHVKIFVSGGLDEEDIKELE 263 (302) T ss_pred CHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHH T ss_conf 79999999999997669887489996999999999998 No 243 >cd03268 ABC_BcrA_bacitracin_resist The BcrA subfamily represents ABC transporters involved in peptide antibiotic resistance. Bacitracin is a dodecapeptide antibiotic produced by B. licheniformis and B. subtilis. The synthesis of bacitracin is non-ribosomally catalyzed by a multienzyme complex BcrABC. Bacitracin has potent antibiotic activity against gram-positive bacteria. The inhibition of peptidoglycan biosynthesis is the best characterized bacterial effect of bacitracin. The bacitracin resistance of B. licheniformis is mediated by the ABC transporter Bcr which is composed of two identical BcrA ATP-binding subunits and one each of the integral membrane proteins, BcrB and BcrC. B. subtilis cells carrying bcr genes on high-copy number plasmids develop collateral detergent sensitivity, a similar phenomenon in human cells with overexpressed multi-drug resistance P-glycoprotein. Probab=21.50 E-value=50 Score=12.83 Aligned_cols=16 Identities=38% Similarity=0.465 Sum_probs=7.3 Q ss_pred EECCCCCHHHHHHHHH Q ss_conf 5215872368999999 Q gi|254780999|r 297 IGLSGGIDSALCAAIA 312 (562) Q Consensus 297 iglSGGiDSal~a~la 312 (562) =.+|||.---|..|.| T Consensus 125 ~~LS~G~kqrl~la~a 140 (208) T cd03268 125 KGFSLGMKQRLGIALA 140 (208) T ss_pred HHCCHHHHHHHHHHHH T ss_conf 5699999999999999 No 244 >TIGR01769 GGGP geranylgeranylglyceryl phosphate synthase; InterPro: IPR010946 This entry represents geranylgeranylglyceryl phosphate synthase which catalyses the first committed step in the synthesis of ether-linked membrane lipids in archaea .. Probab=21.49 E-value=50 Score=12.83 Aligned_cols=137 Identities=14% Similarity=0.229 Sum_probs=76.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEE-------E Q ss_conf 999999999999999859989991664447998445414889999999999999999887697899999998-------8 Q gi|254780999|r 20 AGNIAKARRAREEANRQGMDLILFTELFISGYPPEDLVFKKSFIQACSSAIDTLKSDTHDGGAGIVVGFPRQ-------D 92 (562) Q Consensus 20 ~~N~~~i~~~i~~A~~~gadLvvfPEl~LtGY~~~Dl~~~~~f~~~~~~~l~~La~~~~~~~i~iivG~p~~-------~ 92 (562) +.|++.....+++ .-..-+|+||=- -+.+.+..+++-.+.-. ...+.+.+||++.+ . T Consensus 40 e~~ld~~v~~ik~--~~~~p~iLFPG~-------------~~~Ls~~ADAvlFmSlL-NS~d~yfivGaqi~aA~~i~~~ 103 (212) T TIGR01769 40 EDKLDQVVKKIKE--ITNLPVILFPGS-------------VNLLSRKADAVLFMSLL-NSADTYFIVGAQIKAAVLIKKL 103 (212) T ss_pred HHHHHHHHHHHHH--CCCCCEEECCCC-------------HHHHHHHHHHHHHHHHH-CCCCCCEEECCHHHHHHHHHHH T ss_conf 5478899998640--157866882788-------------45788886598888865-3699736741515789999872 Q ss_pred CCEE--EEEEEE-EECCE--EEEEEEEEEECCCCCCC--------------CCCCCCCCCCCC-EEEE------CC-E-E Q ss_conf 8987--899999-96997--99988777623554333--------------210037788876-1476------47-4-5 Q gi|254780999|r 93 QEGV--LNSVVI-LDAGN--IIAVRDKINLPNYSEFH--------------EKRTFISGYSND-PIVF------RD-I-R 144 (562) Q Consensus 93 ~~~~--yNsa~v-i~~G~--il~~y~K~~LP~y~~Fd--------------E~r~F~~G~~~~-~~~~------~g-~-k 144 (562) +.++ --+|++ +.+|+ -+++.-|..+=.|..++ ..-||++|..-. |+.= +. . + T Consensus 104 ~PrlE~Ip~AY~iv~~G~ktavG~vG~~~~Ip~~~~~~A~~Y~LAA~Y~G~~~vYLEAGSGAs~Pv~~e~i~~~k~~~~~ 183 (212) T TIGR01769 104 QPRLEVIPTAYLIVGPGGKTAVGYVGKARLIPYDKPEIAAAYALAAKYLGFKLVYLEAGSGASEPVSPETISLVKKKISS 183 (212) T ss_pred CCCCCCCCEEEEEECCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 78856365226887588740446520012589986668999999998741351213105786667866799999985489 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCCEEEE Q ss_conf 62011111110168999997147775993 Q gi|254780999|r 145 LGILICEDIWKNSNICKHLKKQGAEFLFS 173 (562) Q Consensus 145 iGv~ICeDlw~~~~~~~~la~~Gadlii~ 173 (562) +-+-+.-=++.+| .+..+++.|||+|+. T Consensus 184 I~LIVGGGIr~~E-iA~~~v~aGAd~IVT 211 (212) T TIGR01769 184 IPLIVGGGIRSPE-IALKIVLAGADVIVT 211 (212) T ss_pred CCEEECCCCCCHH-HHHHHHHHCCCEEEC T ss_conf 7277527758889-999999708982634 No 245 >TIGR01919 hisA-trpF bifunctional HisA/TrpF protein; InterPro: IPR010188 This entry represents a bifunctional protein possessing both hisA (1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase) and trpF (N-(5'phosphoribosyl)anthranilate isomerase) activities . Thus, it is involved in both the histidine and tryptophan biosynthetic pathways. Enzymes with this property have been described only in the Actinobacteria (High-GC Gram-positive). The enzyme is closely related to the monofunctional HisA proteins and in Actinobacteria, the classical monofunctional TrpF is generally absent.; GO: 0003949 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imidazole-4-carboxamide isomerase activity, 0004640 phosphoribosylanthranilate isomerase activity, 0000105 histidine biosynthetic process, 0000162 tryptophan biosynthetic process, 0005737 cytoplasm. Probab=21.49 E-value=50 Score=12.83 Aligned_cols=32 Identities=47% Similarity=0.606 Sum_probs=21.5 Q ss_pred HHHHHHHCCCCEEEEECCCCC--HHHHHHHHHHHC Q ss_conf 888764224540575215872--368999999841 Q gi|254780999|r 283 LRDYVQKNNFHKVIIGLSGGI--DSALCAAIAVDA 315 (562) Q Consensus 283 l~d~~~~~g~~~~viglSGGi--DSal~a~la~~a 315 (562) |++-+.|-+ ..|-|.||||| |.+|.+||+.=| T Consensus 67 l~EiVg~Ld-drV~vELsGGiRDD~SL~~AL~tGa 100 (246) T TIGR01919 67 LAEIVGKLD-DRVDVELSGGIRDDESLEAALATGA 100 (246) T ss_pred HHHHHHHHC-CEEEEEEECCCCCHHHHHHHHHHCC T ss_conf 999986307-8788985068556789999998077 No 246 >cd01534 4RHOD_Repeat_3 Member of the Rhodanese Homology Domain superfamily, repeat 3. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 3rd repeat which does not contain the putative catalytic Cys residue. Probab=21.41 E-value=40 Score=13.49 Aligned_cols=21 Identities=24% Similarity=0.578 Sum_probs=13.7 Q ss_pred HHHHHCCCCEEEEECCCCCHHHH Q ss_conf 87642245405752158723689 Q gi|254780999|r 285 DYVQKNNFHKVIIGLSGGIDSAL 307 (562) Q Consensus 285 d~~~~~g~~~~viglSGGiDSal 307 (562) .++++.|++ |.-|.||+++.| T Consensus 74 ~~L~~~G~~--v~~L~GG~~A~~ 94 (95) T cd01534 74 SWLAQMGWE--VYVLEGGLAAAL 94 (95) T ss_pred HHHHHCCCC--EEEECCCCCCCC T ss_conf 999986997--699578612353 No 247 >pfam00128 Alpha-amylase Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain. Probab=21.41 E-value=50 Score=12.82 Aligned_cols=67 Identities=24% Similarity=0.407 Sum_probs=45.8 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCC-----CCCHHHHHC-CHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 889999999999999999859989991664447-----998445414-8899999999999999998876978999999 Q gi|254780999|r 17 GDIAGNIAKARRAREEANRQGMDLILFTELFIS-----GYPPEDLVF-KKSFIQACSSAIDTLKSDTHDGGAGIVVGFP 89 (562) Q Consensus 17 gd~~~N~~~i~~~i~~A~~~gadLvvfPEl~Lt-----GY~~~Dl~~-~~~f~~~~~~~l~~La~~~~~~~i~iivG~p 89 (562) ||+.+-++|+- ++ ++-|++.|-++=.+-+ ||.+.|++. .+.| -..+.+..|.+.+++.+|.|++=.+ T Consensus 1 Gd~~gi~~kLd-yl---~~LGv~~i~l~P~~~~~~~~~GY~~~d~~~vd~~~--Gt~~e~~~Lv~~~H~~Gi~VilD~V 73 (314) T pfam00128 1 GDLQGIIDKLD-YL---KDLGVTAIWLSPIFDSPQSYHGYDITDYYKIDPHF--GTMDDFKELIDKAHERGIKVILDLV 73 (314) T ss_pred CCHHHHHHHHH-HH---HHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC--CCHHHHHHHHHHHHHCCCEEEECEE T ss_conf 98889998248-99---97599989979671799999996835678768232--9999999999998466988860312 No 248 >PRK08091 ribulose-phosphate 3-epimerase; Validated Probab=21.26 E-value=50 Score=12.80 Aligned_cols=37 Identities=11% Similarity=0.084 Sum_probs=23.2 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCHHH Q ss_conf 3588876422454057521587236899999984149544 Q gi|254780999|r 281 LSLRDYVQKNNFHKVIIGLSGGIDSALCAAIAVDALGKEN 320 (562) Q Consensus 281 lgl~d~~~~~g~~~~viglSGGiDSal~a~la~~alg~~~ 320 (562) .-|+++..+.+.. +.|.+.|||.-. ++..+++| |.+. T Consensus 174 ~~l~~~~~~~~~~-~~I~VDGGI~~~-ti~~~~~a-Gad~ 210 (235) T PRK08091 174 AQLLCLLGDKREG-KLIVIDGSMTQD-QLPSLIAQ-GIDW 210 (235) T ss_pred HHHHHHHHHCCCC-CEEEEECCCCHH-HHHHHHHC-CCCE T ss_conf 9999999964999-159984898988-89999983-9999 No 249 >pfam05820 DUF845 Baculovirus protein of unknown function (DUF845). This family consists of several highly related Baculovirus proteins of unknown function. Probab=21.02 E-value=51 Score=12.76 Aligned_cols=42 Identities=19% Similarity=0.316 Sum_probs=28.7 Q ss_pred CCEEE-EEEEEEEECCCCCCCCCCCCCCCCCCCEEEE---CCEEEEECCC Q ss_conf 99799-9887776235543332100377888761476---4745620111 Q gi|254780999|r 105 AGNII-AVRDKINLPNYSEFHEKRTFISGYSNDPIVF---RDIRLGILIC 150 (562) Q Consensus 105 ~G~il-~~y~K~~LP~y~~FdE~r~F~~G~~~~~~~~---~g~kiGv~IC 150 (562) .|.++ +.|.|+.+.|.-.|. |.||.++.+|.. +|.-+.++|+ T Consensus 41 ~g~~LaHYyA~i~~sngy~FE----fHPGSQPrTFQ~vh~dg~~i~~~vl 86 (119) T pfam05820 41 GGALLAHYYAQIDLSNGYSFE----FHPGSQPRTFQTVHTDGTLIKVLVL 86 (119) T ss_pred CCCHHHHEEEEEEECCCCEEE----ECCCCCCCCCEEECCCCCEEEEEEE T ss_conf 610563002778832893287----7799998633554369919998875 No 250 >cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide. Probab=20.88 E-value=51 Score=12.75 Aligned_cols=15 Identities=20% Similarity=0.359 Sum_probs=8.5 Q ss_pred HHHHCCCCEEEECCC Q ss_conf 997147775993155 Q gi|254780999|r 162 HLKKQGAEFLFSLNA 176 (562) Q Consensus 162 ~la~~Gadlii~psA 176 (562) .....+.-+++.+.. T Consensus 107 ~~~~~~~p~i~~~h~ 121 (375) T cd03821 107 AARKYGIPYVVSPHG 121 (375) T ss_pred HHHHCCCCEEEEECC T ss_conf 999849969999799 No 251 >TIGR02871 spore_ylbJ sporulation integral membrane protein YlbJ; InterPro: IPR014226 Members of this entry include YlbJ, from Bacillus subtilis. They are found exclusively in the Firmicutes (low-GC Gram-positive bacteria) and are known from studies in B. subtilis to be part of the sigma-E regulon . Mutation leads to a sporulation defect, confirming that members of this entry are involved in sporulation. The protein appears to be universal among endospore-forming bacteria, and in Symbiobacterium thermophilum IAM 14863, there are two ORFs encoding YlbJ that are distant from each other.. Probab=20.85 E-value=15 Score=16.31 Aligned_cols=45 Identities=11% Similarity=0.192 Sum_probs=25.3 Q ss_pred HHHHHHHHHHHCCCCEEE---ECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 016899999714777599---3155533358078887889998543370464213568 Q gi|254780999|r 155 KNSNICKHLKKQGAEFLF---SLNASPYYHNKLKKRHEIVTGQISHVHLPIIYVNQVG 209 (562) Q Consensus 155 ~~~~~~~~la~~Gadlii---~psASp~~~~k~~~R~~~~~~~a~e~~~~vvy~N~vG 209 (562) +|. .+-+.+.+|+++.+ -||=.||-. +..--...+..-+|.+.-| T Consensus 18 ~P~-~~~eAs~~G~~Lff~~VfPsLfPFfI---------~seLli~~g~v~~~g~lLe 65 (384) T TIGR02871 18 NPK-ASIEASLSGAKLFFYVVFPSLFPFFI---------ISELLINLGGVSFYGKLLE 65 (384) T ss_pred CHH-HHHHHHHHHHHHHHHHHHHHHCHHHH---------HHHHHHHCCCEEEEHHHHH T ss_conf 718-89999875478989888865341899---------9999984793586154424 No 252 >cd04148 RGK RGK subfamily. The RGK (Rem, Rem2, Rad, Gem/Kir) subfamily of Ras GTPases are expressed in a tissue-specific manner and are dynamically regulated by transcriptional and posttranscriptional mechanisms in response to environmental cues. RGK proteins bind to the beta subunit of L-type calcium channels, causing functional down-regulation of these voltage-dependent calcium channels, and either termination of calcium-dependent secretion or modulation of electrical conduction and contractile function. Inhibition of L-type calcium channels by Rem2 may provide a mechanism for modulating calcium-triggered exocytosis in hormone-secreting cells, and has been proposed to influence the secretion of insulin in pancreatic beta cells. RGK proteins also interact with and inhibit the Rho/Rho kinase pathway to modulate remodeling of the cytoskeleton. Two characteristics of RGK proteins cited in the literature are N-terminal and C-terminal extensions beyond the GTPase domain typical of Ra Probab=20.77 E-value=52 Score=12.73 Aligned_cols=35 Identities=20% Similarity=0.471 Sum_probs=22.9 Q ss_pred CCCHHHHHHHHHHHCCCCCCCC------HHHHHHHHHHHHC Q ss_conf 2213569999997213542677------5899999886313 Q gi|254780999|r 333 PQSLEDAAACAKALGCKYDVLP------IHDLVNHFFSLMS 367 (562) Q Consensus 333 ~~s~~~a~~la~~lg~~~~~i~------I~~~~~~~~~~~~ 367 (562) .-|.+.++++|+.+|+.|.+.+ |+++++....++. T Consensus 123 ~Vs~eEg~~~A~~~~~~F~EtSAk~~~NV~elF~~lvrqIr 163 (221) T cd04148 123 EVSVQEGRACAVVFDCKFIETSAGLQHNVDELLEGIVRQIR 163 (221) T ss_pred CCCHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 99999999999985998999457999498999999999998 No 253 >TIGR01983 UbiG ubiquinone biosynthesis O-methyltransferase; InterPro: IPR010233 This entry represents an O-methyltransferase, which is believed to act at two points in the ubiquinone biosynthetic pathway in bacteria (UbiG) and mitochondria (COQ3) , . A separate methylase (MenG/UbiE) catalyses the single C-methylation step. The most commonly used names for genes in this family do not indicate whether this gene is an O-methyl, or C-methyl transferase.; GO: 0008425 2-polyprenyl-6-methoxy-14-benzoquinone methyltransferase activity, 0006744 ubiquinone biosynthetic process. Probab=20.63 E-value=52 Score=12.71 Aligned_cols=111 Identities=22% Similarity=0.314 Sum_probs=64.4 Q ss_pred HHHHHH-HCCCCEEEEECCCCCHHHHHHHHHHHCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCC--------CCCCC Q ss_conf 888764-224540575215872368999999841495446797324666652213569999997213--------54267 Q gi|254780999|r 283 LRDYVQ-KNNFHKVIIGLSGGIDSALCAAIAVDALGKENVQTIMLPYKYTSPQSLEDAAACAKALGC--------KYDVL 353 (562) Q Consensus 283 l~d~~~-~~g~~~~viglSGGiDSal~a~la~~alg~~~v~~~~mp~~~~s~~s~~~a~~la~~lg~--------~~~~i 353 (562) .+|-+. .+|.+=+=||==||| |+-.|| -+|...|.|| ..|+..++-|+..|+.=++ +|..- T Consensus 76 ~~~~~~~~~G~~vLDVGCGGGl---LsE~lA--R~Ga~~V~GI-----D~~~~~i~~Ak~Ha~~~~~~~~~~K~i~Y~~~ 145 (275) T TIGR01983 76 IRDELLKDSGLRVLDVGCGGGL---LSEPLA--RLGANKVTGI-----DASEENIEVAKLHAKKDPLQEALDKKIEYRCT 145 (275) T ss_pred HHHHHHCCCCCEEEEECCCHHH---HHHHHH--HCCCCEEEEE-----ECHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 7763411789779984278578---889997--5588425775-----21177999999888733402331111454443 Q ss_pred CHHHHHHHHHHHHCCCCCCCCCCHHH--------HCCCCHHHHHHHHHHHH---HCCCEECCCCCHHHHHHHH Q ss_conf 75899999886313234687431011--------01233046677988875---1695443643244775411 Q gi|254780999|r 354 PIHDLVNHFFSLMSQFLQEEPSGIVA--------ENIQSRIRGNILMALSN---HSKAMLLTTSNKSEISVGY 415 (562) Q Consensus 354 ~I~~~~~~~~~~~~~~~~~~~~~~~~--------eN~qaR~R~~~L~~~an---~~~~lvl~t~nksE~~~Gy 415 (562) .|+.+.+.. ......|+.. .|+|+ ++-..|+ -.|-|+++|=||+=.+-.+ T Consensus 146 ~~E~l~~~~-------h~~~~FD~V~~mEvlEHV~dp~~-----f~~~c~~llkPgG~lF~STINRt~kS~~~ 206 (275) T TIGR01983 146 SVEELAEEQ-------HTKKSFDVVTCMEVLEHVPDPQA-----FIKACAQLLKPGGILFFSTINRTPKSYLL 206 (275) T ss_pred HHHHHHCCC-------CCCCCCCEEEEEEEEECCCCHHH-----HHHHHHHHCCCCCCEEEECCCHHHHHHHH T ss_conf 078873055-------78415733764320000278889-----99999985089984897300021899999 No 254 >TIGR02713 allophanate_hyd allophanate hydrolase; InterPro: IPR014085 Allophanate hydrolase catalyses the second reaction in an ATP-dependent two-step degradation of urea to ammonia and C02, following the action of the biotin-containing urea carboxylase. The yeast enzyme, a fusion of allophanate hydrolase to urea carboxylase, is designated urea amidolyase.. Probab=20.17 E-value=43 Score=13.22 Aligned_cols=16 Identities=19% Similarity=0.092 Sum_probs=7.7 Q ss_pred CCCEECC-CCCHHHHHH Q ss_conf 6954436-432447754 Q gi|254780999|r 398 SKAMLLT-TSNKSEISV 413 (562) Q Consensus 398 ~~~lvl~-t~nksE~~~ 413 (562) .|.+-+| ..++++-+. T Consensus 425 ~Gal~~G~~~~~~~~~~ 441 (582) T TIGR02713 425 LGALPLGTATGESLPAA 441 (582) T ss_pred CCCCCCCCCCCCCCCCC T ss_conf 67655886557420014 No 255 >TIGR02691 arsC_pI258_fam arsenate reductase (thioredoxin); InterPro: IPR014064 This family describes the well-studied thioredoxin-dependent arsenate reductase of Staphylococcus aureus plasmid pI258 and other mechanistically similar arsenate reductases. The mechanism involves an intramolecular disulphide bond cascade, and aligned members of this family have four absolutely conserved Cys residues. This group of arsenate reductases belongs to the low-molecular weight protein-tyrosine phosphatase family (IPR000106 from INTERPRO), as does a group of glutathione/glutaredoxin type arsenate reductases (IPR014062 from INTERPRO). At least two other, non-homologous groups of arsenate reductases involved in arsenical resistance are also known. This enzyme reduces arsenate to arsenite, which may be more toxic but which is more easily exported.. Probab=20.09 E-value=41 Score=13.37 Aligned_cols=19 Identities=37% Similarity=0.688 Sum_probs=11.1 Q ss_pred ECCCCC--HHHHHHHHCCCCC Q ss_conf 436432--4477541102366 Q gi|254780999|r 402 LLTTSN--KSEISVGYGTLYG 420 (562) Q Consensus 402 vl~t~n--ksE~~~Gy~T~~G 420 (562) +|+||| ||..|=||.-.|. T Consensus 3 FlCTGNSCRSQmAEG~gk~~L 23 (129) T TIGR02691 3 FLCTGNSCRSQMAEGWGKKYL 23 (129) T ss_pred EEECCCCCHHHHHHHHHHHHC T ss_conf 230275305577799998746 Done!