Query gi|254780653|ref|YP_003065066.1| phosphoglycerate kinase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 400 No_of_seqs 131 out of 2640 Neff 5.9 Searched_HMMs 39220 Date Sun May 29 21:04:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780653.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK13962 bifunctional phosphog 100.0 0 0 1078.6 33.8 395 2-398 1-396 (653) 2 PRK00073 pgk phosphoglycerate 100.0 0 0 1065.8 34.0 390 4-398 1-391 (391) 3 cd00318 Phosphoglycerate_kinas 100.0 0 0 1053.7 33.7 390 7-398 1-397 (397) 4 PTZ00005 phosphoglycerate kina 100.0 0 0 1026.4 35.0 397 1-399 5-419 (419) 5 COG0126 Pgk 3-phosphoglycerate 100.0 0 0 1024.6 31.1 389 1-399 1-393 (395) 6 pfam00162 PGK Phosphoglycerate 100.0 0 0 1020.9 32.5 378 6-388 1-383 (383) 7 KOG1367 consensus 100.0 0 0 909.8 21.7 394 3-399 5-415 (416) 8 PRK09206 pyruvate kinase; Prov 95.9 0.17 4.3E-06 30.1 10.1 308 42-371 16-396 (470) 9 PRK05826 pyruvate kinase; Prov 93.4 0.7 1.8E-05 26.1 10.9 308 42-371 18-399 (461) 10 PRK03803 murD UDP-N-acetylmura 89.2 1.9 4.9E-05 23.3 7.1 60 169-231 92-152 (448) 11 COG2511 GatE Archaeal Glu-tRNA 88.7 1.4 3.7E-05 24.1 5.7 192 14-219 170-402 (631) 12 PRK08125 bifunctional UDP-gluc 87.7 2.4 6.1E-05 22.7 6.9 64 43-111 13-77 (660) 13 PRK00005 fmt methionyl-tRNA fo 86.2 2.9 7.3E-05 22.1 7.0 69 40-110 10-79 (309) 14 PRK00683 murD UDP-N-acetylmura 83.4 3.8 9.8E-05 21.3 6.3 62 170-231 83-145 (418) 15 PRK00192 mannosyl-3-phosphogly 83.3 1.8 4.6E-05 23.4 4.0 53 17-82 6-59 (275) 16 COG0223 Fmt Methionyl-tRNA for 82.9 4 0.0001 21.2 6.1 77 41-150 12-90 (307) 17 TIGR01825 gly_Cac_T_rel pyrido 81.3 1 2.6E-05 25.0 2.1 79 181-266 255-354 (392) 18 PRK06988 putative formyltransf 79.5 5.3 0.00013 20.4 6.6 66 40-110 12-78 (313) 19 PRK02006 murD UDP-N-acetylmura 76.9 6.3 0.00016 20.0 6.6 59 170-228 97-162 (501) 20 PRK00771 signal recognition pa 76.9 6.3 0.00016 20.0 10.2 191 16-235 36-264 (433) 21 PRK10867 signal recognition pa 76.7 6.3 0.00016 19.9 10.7 192 16-235 35-270 (453) 22 cd06556 ICL_KPHMT Members of t 76.0 6.5 0.00017 19.8 4.9 31 311-341 204-236 (240) 23 TIGR00873 gnd 6-phosphoglucona 75.5 6.8 0.00017 19.7 5.2 63 146-221 36-100 (480) 24 PRK06719 precorrin-2 dehydroge 75.1 4.7 0.00012 20.8 3.9 43 1-60 1-43 (157) 25 pfam00224 PK Pyruvate kinase, 75.0 7 0.00018 19.6 4.9 103 42-149 16-133 (348) 26 COG0469 PykF Pyruvate kinase [ 73.2 5.8 0.00015 20.2 4.0 306 42-369 19-399 (477) 27 PRK02472 murD UDP-N-acetylmura 72.3 8.2 0.00021 19.2 6.6 34 11-59 5-38 (450) 28 PRK04308 murD UDP-N-acetylmura 71.5 8.5 0.00022 19.1 6.3 62 169-230 90-152 (445) 29 TIGR01697 PNPH-PUNA-XAPA inosi 71.1 5 0.00013 20.6 3.3 123 52-183 53-194 (266) 30 PRK06354 pyruvate kinase; Prov 69.6 8.3 0.00021 19.2 4.2 308 42-371 22-402 (589) 31 TIGR02460 osmo_MPGsynth mannos 69.1 8.2 0.00021 19.2 4.0 210 9-260 50-290 (394) 32 COG0703 AroK Shikimate kinase 67.9 5.7 0.00014 20.2 3.0 30 54-94 3-32 (172) 33 PRK06247 pyruvate kinase; Prov 67.9 10 0.00026 18.6 4.5 302 42-371 19-395 (477) 34 pfam08645 PNK3P Polynucleotide 66.8 11 0.00027 18.5 4.6 23 41-63 33-55 (158) 35 TIGR02584 cas_NE0113 CRISPR-as 66.3 9 0.00023 18.9 3.8 41 334-374 119-168 (224) 36 pfam02548 Pantoate_transf Keto 66.1 7.1 0.00018 19.6 3.2 133 175-340 95-241 (261) 37 PRK03918 chromosome segregatio 66.0 9.7 0.00025 18.7 3.9 10 333-342 833-842 (882) 38 PRK00311 panB 3-methyl-2-oxobu 65.8 9.7 0.00025 18.7 3.9 135 173-340 94-242 (266) 39 PRK06124 gluconate 5-dehydroge 63.5 12 0.00031 18.1 6.3 67 1-92 1-67 (259) 40 PRK10530 phosphotransferase; P 61.7 10 0.00026 18.6 3.4 46 30-83 11-58 (272) 41 TIGR02094 more_P_ylases alpha- 61.6 7.1 0.00018 19.6 2.6 60 153-212 399-463 (618) 42 cd06557 KPHMT-like Ketopantoat 61.5 11 0.00027 18.5 3.4 134 174-337 90-234 (254) 43 TIGR01064 pyruv_kin pyruvate k 59.9 11 0.00028 18.4 3.2 305 42-371 33-432 (513) 44 TIGR00088 trmD tRNA (guanine-N 59.1 14 0.00035 17.7 3.7 46 21-67 48-95 (247) 45 TIGR02463 MPGP_rel mannosyl-3- 58.8 8.1 0.00021 19.2 2.5 67 29-121 6-75 (224) 46 PTZ00066 pyruvate kinase; Prov 58.8 14 0.00035 17.8 3.6 309 42-371 52-437 (513) 47 TIGR01163 rpe ribulose-phospha 58.6 15 0.00038 17.5 5.5 159 35-272 10-180 (216) 48 PRK06718 precorrin-2 dehydroge 58.2 14 0.00036 17.7 3.6 35 11-60 6-40 (202) 49 KOG2007 consensus 56.7 7.1 0.00018 19.6 1.9 97 53-154 62-188 (586) 50 TIGR01179 galE UDP-glucose 4-e 56.7 16 0.00041 17.3 4.5 241 44-345 14-285 (341) 51 PRK13776 formimidoylglutamase; 56.5 14 0.00037 17.6 3.4 39 298-339 263-304 (311) 52 COG0066 LeuD 3-isopropylmalate 56.4 3.5 8.9E-05 21.6 0.2 55 114-174 41-104 (191) 53 cd04299 GT1_Glycogen_Phosphory 55.5 11 0.00028 18.3 2.7 62 152-213 474-539 (778) 54 cd04240 AAK_UC AAK_UC: Unchara 53.5 18 0.00046 17.0 7.4 171 187-392 23-200 (203) 55 PRK01710 murD UDP-N-acetylmura 52.0 19 0.00048 16.9 5.9 41 3-60 4-44 (458) 56 TIGR00705 SppA_67K signal pept 51.8 19 0.00049 16.8 4.6 51 310-360 320-376 (614) 57 PRK05057 aroK shikimate kinase 51.0 17 0.00043 17.2 3.0 36 52-96 3-38 (172) 58 COG4100 Cystathionine beta-lya 49.4 19 0.00048 16.9 3.1 80 13-103 127-216 (416) 59 TIGR01678 FAD_lactone_ox sugar 48.7 18 0.00046 17.0 2.9 69 252-321 337-415 (505) 60 COG1518 CRISPR-associated prot 47.7 7.5 0.00019 19.5 0.8 188 13-242 17-221 (327) 61 pfam02836 Glyco_hydro_2_C Glyc 46.6 23 0.00059 16.3 3.3 146 12-193 9-157 (297) 62 cd03280 ABC_MutS2 MutS2 homolo 45.9 18 0.00045 17.0 2.5 34 172-205 91-128 (200) 63 TIGR02076 pyrH_arch uridylate 45.7 24 0.0006 16.2 3.6 32 179-210 26-59 (232) 64 cd04256 AAK_P5CS_ProBA AAK_P5C 45.3 24 0.00061 16.2 5.4 50 14-63 7-57 (284) 65 PRK05429 gamma-glutamyl kinase 45.2 24 0.00062 16.2 5.4 211 14-239 7-259 (372) 66 PRK09954 hypothetical protein; 44.9 24 0.00062 16.2 5.8 29 78-107 99-131 (362) 67 PRK08339 short chain dehydroge 44.7 25 0.00063 16.1 7.2 67 1-97 1-68 (263) 68 PRK13946 shikimate kinase; Pro 44.5 25 0.00063 16.1 3.2 32 52-94 19-50 (195) 69 TIGR00099 Cof-subfamily Cof-li 44.3 25 0.00063 16.1 3.7 49 30-82 7-58 (270) 70 PTZ00063 histone deacetylase; 43.9 25 0.00064 16.1 5.3 18 41-58 114-131 (439) 71 COG0419 SbcC ATPase involved i 43.6 26 0.00065 16.0 4.2 11 313-323 843-853 (908) 72 TIGR01317 GOGAT_sm_gam glutama 43.3 11 0.00028 18.4 1.1 79 176-275 128-226 (517) 73 PRK03669 mannosyl-3-phosphogly 43.0 26 0.00066 16.0 3.7 63 16-95 6-71 (271) 74 PRK01158 phosphoglycolate phos 42.8 24 0.00061 16.2 2.8 23 41-65 24-46 (226) 75 PRK07847 amidophosphoribosyltr 42.4 27 0.00068 15.9 3.8 129 103-261 209-371 (489) 76 PRK06197 short chain dehydroge 42.3 27 0.00068 15.9 4.5 125 6-176 7-134 (306) 77 PRK03815 murD UDP-N-acetylmura 41.9 27 0.00069 15.9 6.2 38 191-229 91-128 (401) 78 cd00288 Pyruvate_Kinase Pyruva 41.4 28 0.0007 15.8 4.5 304 42-367 16-397 (480) 79 PRK08305 spoVFB dipicolinate s 41.0 28 0.00071 15.8 5.2 67 12-91 2-70 (195) 80 cd03243 ABC_MutS_homologs The 40.8 22 0.00056 16.5 2.3 17 172-188 91-107 (202) 81 cd00079 HELICc Helicase superf 40.7 28 0.00072 15.7 6.4 69 34-112 9-78 (131) 82 PRK13539 cytochrome c biogenes 40.4 29 0.00073 15.7 4.3 61 188-264 121-181 (206) 83 TIGR02700 flavo_MJ0208 archaeo 40.4 5.9 0.00015 20.1 -0.6 50 211-269 83-133 (237) 84 TIGR01292 TRX_reduct thioredox 40.3 27 0.00068 15.9 2.7 125 144-305 111-256 (321) 85 pfam05445 Pox_ser-thr_kin Poxv 39.6 12 0.00029 18.2 0.7 66 125-206 196-262 (434) 86 COG2071 Predicted glutamine am 39.5 30 0.00075 15.6 3.2 17 206-222 53-69 (243) 87 TIGR01700 PNPH purine nucleosi 39.0 30 0.00077 15.6 3.5 94 52-152 49-151 (259) 88 PRK08227 aldolase; Validated 38.5 15 0.00038 17.6 1.1 50 179-242 210-259 (291) 89 PRK10976 putative sugar phosph 38.2 31 0.00079 15.5 3.3 45 30-82 10-56 (266) 90 smart00534 MUTSac ATPase domai 37.5 31 0.0008 15.4 2.7 13 174-186 63-75 (185) 91 PRK12314 gamma-glutamyl kinase 37.0 32 0.00082 15.4 7.1 52 14-65 8-63 (265) 92 PRK07534 methionine synthase I 36.8 32 0.00083 15.4 5.0 16 47-62 52-67 (335) 93 pfam02641 DUF190 Uncharacteriz 36.7 22 0.00057 16.4 1.8 32 21-52 59-90 (101) 94 PRK10416 cell division protein 36.7 33 0.00083 15.3 11.3 77 186-264 291-382 (499) 95 KOG4105 consensus 36.5 11 0.00027 18.5 0.2 19 140-158 7-25 (141) 96 cd00209 DHFR Dihydrofolate red 36.2 32 0.00083 15.4 2.6 25 191-221 93-117 (158) 97 PRK00131 aroK shikimate kinase 35.6 34 0.00086 15.2 4.2 32 52-94 3-34 (175) 98 PRK09604 putative DNA-binding/ 35.5 34 0.00087 15.2 3.8 58 37-98 50-114 (335) 99 TIGR01752 flav_long flavodoxin 34.9 35 0.00089 15.2 4.2 15 5-20 78-92 (176) 100 TIGR01923 menE O-succinylbenzo 34.8 11 0.00028 18.4 -0.0 81 132-226 208-298 (490) 101 PRK07424 bifunctional sterol d 34.6 35 0.0009 15.1 3.1 40 12-65 177-216 (410) 102 TIGR01196 edd phosphogluconate 34.5 30 0.00077 15.5 2.2 38 56-95 479-517 (614) 103 TIGR03590 PseG pseudaminic aci 34.5 35 0.0009 15.1 11.5 40 17-62 2-41 (280) 104 cd04254 AAK_UMPK-PyrH-Ec UMP k 34.1 36 0.00091 15.1 5.3 45 16-60 1-46 (231) 105 PRK10769 folA dihydrofolate re 33.8 35 0.00088 15.2 2.4 26 190-221 89-114 (159) 106 pfam11004 DUF2843 Protein of u 33.5 29 0.00073 15.7 2.0 25 98-122 8-32 (281) 107 COG0626 MetC Cystathionine bet 33.5 37 0.00093 15.0 5.2 69 41-119 89-158 (396) 108 TIGR01487 SPP-like SPP-like hy 33.5 37 0.00093 15.0 5.4 74 29-128 8-88 (223) 109 TIGR00338 serB phosphoserine p 33.2 37 0.00094 15.0 4.0 43 41-95 90-134 (223) 110 PRK13542 consensus 33.1 37 0.00095 15.0 4.4 15 13-27 28-42 (224) 111 KOG0875 consensus 33.0 23 0.00059 16.3 1.4 40 139-178 70-110 (264) 112 PRK07272 amidophosphoribosyltr 32.8 38 0.00096 14.9 4.3 128 104-261 217-379 (484) 113 PRK01438 murD UDP-N-acetylmura 32.6 38 0.00096 14.9 5.7 61 170-230 101-164 (481) 114 KOG3087 consensus 32.5 15 0.00039 17.4 0.5 50 138-189 172-225 (229) 115 TIGR01318 gltD_gamma_fam gluta 32.5 20 0.00052 16.6 1.1 140 181-360 137-305 (480) 116 cd07101 ALDH_SSADH2_GabD2 Myco 31.9 39 0.00099 14.8 4.8 51 214-264 262-326 (454) 117 COG1509 KamA Lysine 2,3-aminom 31.8 39 0.00099 14.8 4.5 40 178-217 148-190 (369) 118 pfam00946 Paramyx_RNA_pol Para 31.7 39 0.001 14.8 3.4 44 313-358 672-718 (1161) 119 PRK10640 rhaB rhamnulokinase; 31.7 25 0.00064 16.1 1.4 68 328-395 356-431 (471) 120 PRK01045 ispH 4-hydroxy-3-meth 31.4 40 0.001 14.8 5.0 23 43-65 103-125 (304) 121 PRK00694 4-hydroxy-3-methylbut 31.4 40 0.001 14.8 6.2 241 1-257 1-293 (606) 122 COG1648 CysG Siroheme synthase 30.8 40 0.001 14.7 3.8 42 1-60 1-42 (210) 123 TIGR03641 cas1_HMARI CRISPR-as 30.5 28 0.00072 15.7 1.5 46 26-71 29-75 (322) 124 TIGR00612 ispG_gcpE 4-hydroxy- 30.5 22 0.00057 16.4 1.0 271 34-341 34-374 (633) 125 COG0334 GdhA Glutamate dehydro 30.5 41 0.001 14.7 3.4 26 191-218 209-234 (411) 126 TIGR02180 GRX_euk Glutaredoxin 30.5 40 0.001 14.8 2.3 51 316-367 24-80 (85) 127 PRK13402 gamma-glutamyl kinase 30.4 41 0.001 14.7 5.8 212 12-239 2-252 (363) 128 COG2453 CDC14 Predicted protei 30.1 42 0.0011 14.7 3.7 33 26-58 78-110 (180) 129 KOG0584 consensus 30.1 37 0.00095 15.0 2.1 10 156-165 202-211 (632) 130 PRK13306 ulaD 3-keto-L-gulonat 29.8 42 0.0011 14.6 5.0 94 130-258 119-214 (216) 131 TIGR01249 pro_imino_pep_1 prol 29.5 17 0.00042 17.2 0.2 30 119-148 204-236 (310) 132 pfam05496 RuvB_N Holliday junc 29.1 43 0.0011 14.5 5.0 24 62-91 56-79 (234) 133 TIGR01393 lepA GTP-binding pro 28.9 11 0.00029 18.3 -0.8 63 121-188 301-371 (598) 134 PRK09344 phosphoenolpyruvate c 28.7 21 0.00053 16.6 0.6 49 108-161 71-120 (525) 135 cd03286 ABC_MSH6_euk MutS6 hom 28.7 37 0.00095 15.0 1.9 22 173-194 93-114 (218) 136 pfam00186 DHFR_1 Dihydrofolate 28.4 42 0.0011 14.6 2.1 25 191-221 91-115 (159) 137 PRK04690 murD UDP-N-acetylmura 28.2 45 0.0011 14.4 6.9 38 11-63 4-41 (468) 138 PRK06935 2-deoxy-D-gluconate 3 28.2 45 0.0011 14.4 5.1 40 7-61 8-47 (258) 139 PRK07631 amidophosphoribosyltr 27.9 45 0.0012 14.4 4.4 15 104-118 215-229 (475) 140 PRK10637 cysG siroheme synthas 27.9 45 0.0012 14.4 3.7 36 10-60 7-42 (457) 141 PRK13772 formimidoylglutamase; 27.4 46 0.0012 14.4 4.1 40 298-340 265-307 (313) 142 TIGR01826 CofD_related conserv 27.4 46 0.0012 14.3 5.1 50 43-95 211-261 (331) 143 COG0362 Gnd 6-phosphogluconate 27.3 47 0.0012 14.3 3.9 150 68-230 51-206 (473) 144 PRK08341 amidophosphoribosyltr 27.1 47 0.0012 14.3 3.2 15 105-119 205-219 (442) 145 COG0396 sufC Cysteine desulfur 27.0 47 0.0012 14.3 3.0 49 4-54 3-54 (251) 146 TIGR02534 mucon_cyclo muconate 26.9 47 0.0012 14.3 5.5 190 24-245 116-330 (369) 147 KOG0730 consensus 26.8 47 0.0012 14.3 3.4 35 309-352 465-499 (693) 148 cd04729 NanE N-acetylmannosami 26.6 48 0.0012 14.3 4.0 114 24-144 59-180 (219) 149 TIGR01527 arch_NMN_Atrans nico 26.3 48 0.0012 14.2 2.2 20 204-223 16-35 (171) 150 cd01563 Thr-synth_1 Threonine 26.2 49 0.0012 14.2 7.3 111 13-146 35-146 (324) 151 PRK07349 amidophosphoribosyltr 26.1 49 0.0012 14.2 3.2 129 103-261 238-401 (495) 152 pfam02401 LYTB LytB protein. T 26.0 49 0.0012 14.2 5.1 60 329-388 191-254 (280) 153 PRK05337 beta-hexosaminidase; 26.0 49 0.0012 14.2 7.8 14 168-181 101-114 (336) 154 TIGR01027 proB glutamate 5-kin 25.7 50 0.0013 14.2 5.5 202 16-240 1-262 (379) 155 TIGR01098 3A0109s03R phosphona 25.5 39 0.001 14.8 1.5 47 73-124 68-118 (299) 156 KOG0572 consensus 25.5 50 0.0013 14.1 3.5 26 194-219 268-297 (474) 157 pfam00488 MutS_V MutS domain V 25.3 51 0.0013 14.1 2.1 22 173-194 105-126 (234) 158 pfam06941 NT5C 5' nucleotidase 25.3 51 0.0013 14.1 4.5 44 39-86 75-118 (191) 159 TIGR03640 cas1_DVULG CRISPR-as 25.2 51 0.0013 14.1 2.8 62 13-74 17-83 (340) 160 cd03115 SRP The signal recogni 25.2 51 0.0013 14.1 6.9 133 62-220 6-147 (173) 161 cd03284 ABC_MutS1 MutS1 homolo 25.1 51 0.0013 14.1 2.5 23 173-195 93-115 (216) 162 COG4678 Muramidase (phage lamb 24.9 38 0.00096 14.9 1.3 42 161-202 18-66 (180) 163 PRK13540 cytochrome c biogenes 24.7 52 0.0013 14.0 4.3 16 12-27 10-25 (200) 164 PRK00625 shikimate kinase; Pro 24.7 18 0.00047 17.0 -0.3 53 175-231 62-114 (173) 165 pfam09916 DUF2145 Uncharacteri 24.5 31 0.00079 15.5 0.9 27 48-74 2-28 (200) 166 PRK11366 puuD gamma-glutamyl-g 24.5 52 0.0013 14.0 2.9 22 176-198 96-117 (254) 167 PRK05733 single-stranded DNA-b 24.3 52 0.0013 14.0 1.9 72 54-125 6-89 (172) 168 cd03238 ABC_UvrA The excision 24.2 53 0.0013 14.0 4.2 26 122-147 88-115 (176) 169 PRK13480 3'-5' exoribonuclease 24.2 53 0.0013 14.0 3.2 17 167-183 155-171 (314) 170 COG0313 Predicted methyltransf 24.1 53 0.0014 14.0 8.6 150 38-200 62-228 (275) 171 TIGR01316 gltA glutamate synth 24.0 53 0.0014 13.9 3.1 79 131-223 156-239 (462) 172 COG0737 UshA 5'-nucleotidase/2 24.0 53 0.0014 13.9 3.5 30 38-67 188-218 (517) 173 PRK05329 anaerobic glycerol-3- 23.8 41 0.0011 14.7 1.4 28 37-64 82-118 (425) 174 pfam03808 Glyco_tran_WecB Glyc 23.6 54 0.0014 13.9 5.7 140 210-390 2-145 (172) 175 TIGR01497 kdpB K+-transporting 23.6 43 0.0011 14.5 1.4 135 138-278 233-378 (675) 176 PRK06751 single-stranded DNA-b 23.4 43 0.0011 14.6 1.4 64 54-117 2-73 (172) 177 PRK13948 shikimate kinase; Pro 23.4 19 0.00047 16.9 -0.5 29 55-94 12-40 (182) 178 COG3962 Acetolactate synthase 23.4 55 0.0014 13.9 3.2 41 174-214 216-256 (617) 179 PRK00421 murC UDP-N-acetylmura 23.3 55 0.0014 13.9 7.4 58 138-196 155-218 (459) 180 PTZ00340 O-sialoglycoprotein e 23.2 55 0.0014 13.9 3.8 14 186-199 129-142 (348) 181 TIGR02707 butyr_kinase butyrat 23.1 29 0.00075 15.6 0.5 195 13-224 17-305 (353) 182 PRK08945 short chain dehydroge 23.1 56 0.0014 13.8 6.9 64 1-89 1-64 (245) 183 PRK13774 formimidoylglutamase; 23.1 56 0.0014 13.8 3.7 43 168-213 97-143 (311) 184 PRK11031 guanosine pentaphosph 23.0 47 0.0012 14.3 1.5 73 15-95 35-113 (494) 185 pfam10652 DUF2480 Protein of u 23.0 48 0.0012 14.3 1.6 31 191-221 123-157 (167) 186 PRK09246 amidophosphoribosyltr 22.9 56 0.0014 13.8 4.1 15 103-117 218-232 (503) 187 PRK07772 single-stranded DNA-b 22.8 46 0.0012 14.4 1.5 64 54-117 5-80 (183) 188 PRK12702 mannosyl-3-phosphogly 22.6 57 0.0014 13.8 2.7 66 30-121 9-76 (302) 189 pfam01867 Cas_Cas1 CRISPR asso 22.6 51 0.0013 14.1 1.6 61 13-73 13-78 (319) 190 cd07090 ALDH_F9_TMBADH NAD+-de 22.5 36 0.00092 15.1 0.9 50 214-263 261-324 (457) 191 COG3010 NanE Putative N-acetyl 22.4 57 0.0015 13.7 4.4 114 20-145 61-184 (229) 192 PRK08883 ribulose-phosphate 3- 22.2 58 0.0015 13.7 5.6 79 180-271 99-177 (220) 193 PRK06196 oxidoreductase; Provi 22.1 58 0.0015 13.7 4.0 39 7-60 19-57 (316) 194 pfam06838 Alum_res Aluminium r 21.8 59 0.0015 13.7 3.5 19 218-236 284-302 (405) 195 COG4531 ZnuA ABC-type Zn2+ tra 21.5 57 0.0014 13.8 1.7 43 295-346 240-282 (318) 196 COG0541 Ffh Signal recognition 21.4 60 0.0015 13.6 11.0 191 16-235 35-269 (451) 197 TIGR00078 nadC nicotinate-nucl 21.3 45 0.0012 14.4 1.2 134 189-377 120-261 (276) 198 cd06542 GH18_EndoS-like Endo-b 21.3 60 0.0015 13.6 6.3 106 41-152 51-180 (255) 199 COG4122 Predicted O-methyltran 20.8 33 0.00084 15.3 0.4 56 201-256 45-108 (219) 200 COG2256 MGS1 ATPase related to 20.8 62 0.0016 13.5 2.7 55 32-95 25-81 (436) 201 TIGR01511 ATPase-IB1_Cu copper 20.7 62 0.0016 13.5 6.0 15 112-126 326-340 (545) 202 cd03287 ABC_MSH3_euk MutS3 hom 20.6 62 0.0016 13.5 2.3 13 173-185 94-106 (222) 203 pfam01293 PEPCK_ATP Phosphoeno 20.6 36 0.00092 15.0 0.6 30 132-162 79-108 (451) 204 PRK01722 formimidoylglutamase; 20.5 63 0.0016 13.5 3.3 32 180-214 110-141 (320) 205 TIGR03639 cas1_NMENI CRISPR-as 20.5 32 0.00081 15.4 0.2 59 13-71 14-77 (278) 206 PRK02048 4-hydroxy-3-methylbut 20.3 63 0.0016 13.5 9.5 235 17-265 11-297 (613) 207 PRK06739 pyruvate kinase; Vali 20.3 63 0.0016 13.5 4.9 95 42-147 15-121 (352) 208 TIGR00576 dut dUTP diphosphata 20.2 39 0.00098 14.9 0.6 67 259-330 41-110 (151) 209 TIGR02874 spore_ytfJ sporulati 20.1 64 0.0016 13.5 1.8 46 263-312 85-131 (132) 210 PTZ00311 phosphoenolpyruvate c 20.1 36 0.00093 15.0 0.5 45 118-162 92-137 (543) 211 PRK00439 leuD 3-isopropylmalat 20.1 28 0.00072 15.8 -0.1 49 129-178 39-94 (163) 212 TIGR01704 MTA/SAH-Nsdase MTA/S 20.0 64 0.0016 13.4 4.0 142 53-205 21-203 (229) 213 PRK03731 aroL shikimate kinase 20.0 24 0.00062 16.2 -0.5 33 55-96 4-36 (172) No 1 >PRK13962 bifunctional phosphoglycerate kinase/triosephosphate isomerase; Provisional Probab=100.00 E-value=0 Score=1078.55 Aligned_cols=395 Identities=42% Similarity=0.735 Sum_probs=388.0 Q ss_pred CCCCCCHHHHCCCCCEEEEEEECCCCC-CCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 888771010015996899982016713-1857772589999889899999889809999137788898473322478888 Q gi|254780653|r 2 SRLRTMNDLRDIRGLRCLLRVDWNVPF-IDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVS 80 (400) Q Consensus 2 ~~~~ti~d~~d~~gk~VlvRvD~NvPi-~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~ 80 (400) |+.|||+|+ |++|||||+|||||||+ .||+|+|++||++++|||++|+++||||||+||||||+|++++++||+||++ T Consensus 1 m~~ktl~D~-d~~gK~VllRvDlNVPi~~~g~I~Dd~RI~~~lpTI~~ll~~gakvvi~SHlGRPkG~~d~~~SL~pva~ 79 (653) T PRK13962 1 MEKRTIDDI-DLKGKKVFVRVDFNVPLDEDGNITDDTRIRAALPTIKYALDRGAKVILLSHLGRPKGEVDPKYSLAPVAK 79 (653) T ss_pred CCCCCHHHC-CCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH T ss_conf 997832001-7689989999255687589790897889999999999999788979998068899999898768599999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEE Q ss_conf 76431013354566631025566553136464898242430343210002466531243642475440440113763330 Q gi|254780653|r 81 IAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITG 160 (400) Q Consensus 81 ~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~g 160 (400) +|+++|+++|.|++||+|+.+++++++|++|+|+||||+|||+||++||++|+++||+++|+|||||||||||+|||++| T Consensus 80 ~L~~~l~~~V~f~~d~~g~~~~~~i~~l~~G~I~LLENvRF~~~E~~nd~~fa~~LA~l~DiyVnDAFg~aHR~haS~~g 159 (653) T PRK13962 80 RLSELLGKEVAFADDVIGEEVRKAVEQLKEGDVLLLENVRFEKGETKNDPEFAKRLASLADIYVNDAFGTAHRAHASVVG 159 (653) T ss_pred HHHHHHCCCCEECCCCCCHHHHHHHHCCCCCCEEEEEECCCCCCHHCCCHHHHHHHHHHCCEEEECCCHHHHHCCCCCCC T ss_conf 99999789728577678889999996379995899863686831102899999999864789862331033421554115 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCC Q ss_conf 23321000332556577765331016678718999607802557889987730688886205789999983014343333 Q gi|254780653|r 161 LSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLC 240 (400) Q Consensus 161 i~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~ 240 (400) ||+++|+|||+|||+|+++|++++++|+||+++||||||+||||++|++|++++|+|++||+||||||+|+|++||+|++ T Consensus 160 i~~~lps~aG~L~ekEl~~L~k~l~~p~rP~~aIiGGAKIsdKI~vi~~Ll~kvD~IiiGGgmAntFL~A~G~~IGkSl~ 239 (653) T PRK13962 160 IALFLPSVAGFLMEKEIEFLRKATENPERPYVVILGGAKVSDKIGVITNLLEKVDKILIGGAMAFTFLKALGYEVGKSLV 239 (653) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCEEEECHHHHHHHHHHCCCCCCCCCC T ss_conf 35568441328899999999999629898739998078720589999999987899998229999999977997786654 Q ss_pred CCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 10000135688998753024554303541134323233331025321201334576520035677764057767999766 Q gi|254780653|r 241 QRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGP 320 (400) Q Consensus 241 e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP 320 (400) |+++++.++++++++++++++|+||+|++|+++++.+.++.+++.++||++++++||||+|++.|+++|++|+||||||| T Consensus 240 E~~~~~~a~~il~~a~~~~~~I~lP~D~vv~~~~~~~~~~~~~~~~~Ip~~~~ilDIGp~Ti~~~~~~I~~a~TI~WNGP 319 (653) T PRK13962 240 EEDKIDAAREILAKAEEKGVELVLPVDAVAAQKFEPDAEKKVVPIDAIPEDWMGLDIGPETVELFKEKLSDAKTIVWNGP 319 (653) T ss_pred CHHHHHHHHHHHHHHHHCCCEECCCCEEEEECCCCCCCCEEEEEHHHCCCCCEEECCCHHHHHHHHHHHHHCCEEEEECC T ss_conf 43579999999999997298653763699743457888647844551899897746889999999999977999999789 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCHHHHHHCC Q ss_conf 253227436588999999998611148858997480469989984887686468750799899874899868988407 Q gi|254780653|r 321 LGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWLEGKDLPGIVALSG 398 (400) Q Consensus 321 ~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L~G~~LPgi~aL~~ 398 (400) ||+||+++|+.||.++++++++++ .+++|||+|||||++|++++|+.++|+||||||||||+||||++||||+||.- T Consensus 320 ~GvFE~~~F~~GT~~ia~aia~~~-~~~~~sIvGGGDT~aAi~~~g~~~~fshvSTGGGA~Le~LeGk~LPGi~aL~d 396 (653) T PRK13962 320 MGVFEFDAFAAGTEAVAEAIAEVT-EKGAITIIGGGDSAAAVEKLGFESKFSHVSTGGGASLEFLEGKELPGIQSLED 396 (653) T ss_pred CCHHCCCCHHHHHHHHHHHHHHHH-CCCCEEEECCHHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCCCHHHHHC T ss_conf 645038737899999999998642-58998998827999999965986797568467899999977998775877745 No 2 >PRK00073 pgk phosphoglycerate kinase; Provisional Probab=100.00 E-value=0 Score=1065.84 Aligned_cols=390 Identities=48% Similarity=0.808 Sum_probs=383.2 Q ss_pred CCCCHHHHCCCCCEEEEEEECCCCCC-CCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHH Q ss_conf 87710100159968999820167131-85777258999988989999988980999913778889847332247888876 Q gi|254780653|r 4 LRTMNDLRDIRGLRCLLRVDWNVPFI-DGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIA 82 (400) Q Consensus 4 ~~ti~d~~d~~gk~VlvRvD~NvPi~-~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l 82 (400) ||||+|+ |++||+||||||||||++ +|+|+|++||++++|||++|+++||||||+||||||+|++++++||+||+++| T Consensus 1 ~ktl~d~-~~~gK~VllRvD~NvPi~~~g~I~Dd~RI~~~lpTI~~l~~~gakvvi~SHlGRPkg~~~~~~SL~pva~~L 79 (391) T PRK00073 1 KKTLDDL-DLKGKRVLVRVDFNVPLKDNGKITDDTRIRAALPTIKYLLEKGAKVILLSHLGRPKGEGDEEFSLAPVAKRL 79 (391) T ss_pred CCCHHHC-CCCCCEEEEEECCCCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHH T ss_conf 9855564-868998999923578718999478848899989999999978998999936889999989656879999999 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECC Q ss_conf 43101335456663102556655313646489824243034321000246653124364247544044011376333023 Q gi|254780653|r 83 ESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLS 162 (400) Q Consensus 83 ~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~ 162 (400) +++|+++|.|++||+|+.+++++++|++|+|+||||+|||+||++||++|+++||++||+|||||||||||+|||++|+| T Consensus 80 ~~~L~~~V~f~~d~~g~~~~~~i~~l~~G~I~LLEN~RF~~~E~~nd~~fak~La~l~DiyVNDAF~~sHR~haS~~gi~ 159 (391) T PRK00073 80 SELLGKPVKFVDDCIGEEAREAIAALKDGEVLLLENVRFNKGEEKNDPELAKKLASLGDVFVNDAFGTAHRAHASTVGIA 159 (391) T ss_pred HHHHCCCEEECCCCCCHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHCCEEEECCHHHHHHCCCCCCCHH T ss_conf 99861170404677887899999717899799870003566544389999999875488998443046564265200411 Q ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 32100033255657776533101667871899960780255788998773068888620578999998301434333310 Q gi|254780653|r 163 HLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLCQR 242 (400) Q Consensus 163 ~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~ 242 (400) +++|||||++||+|+++|++++++|+||+++||||||+||||++|++|++++|+|++||+||||||+|+|++||+|++|+ T Consensus 160 ~~lps~aG~l~ekEi~~L~~~l~~p~~P~~aIiGGaKisdKi~~i~~l~~k~D~iiigG~mantfl~a~G~~IG~sl~E~ 239 (391) T PRK00073 160 KFLPAAAGFLMEKELDALGKALENPERPFVAILGGAKVSDKIGVLENLLKKVDKLIIGGGMANTFLKAQGYNVGKSLVEE 239 (391) T ss_pred HHCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCCCCCCCH T ss_conf 20411201899999999998743888872899827858878999999998588888772899999998398478654566 Q ss_pred CCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 00013568899875302455430354113432323333102532120133457652003567776405776799976625 Q gi|254780653|r 243 DFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGPLG 322 (400) Q Consensus 243 ~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP~G 322 (400) ++++.|+++++++++++++|++|+|++|++++..+.++++++++++|++|+++||||+|++.|+++|++|+||||||||| T Consensus 240 ~~i~~a~~il~~a~~~~~~I~lP~D~~~~~~~~~~~~~~~~~~~~i~~~~~i~DIG~~Ti~~~~~~I~~aktI~wNGP~G 319 (391) T PRK00073 240 DLIDTAKELLEKAKEKGVKIPLPVDVVVAKEFSADAEATVVPVDEIPDDWMILDIGPKTIELFAEIIKDAKTIVWNGPMG 319 (391) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCEEEEEECCCCCCCEEEEECCCCCCCCEEEECCHHHHHHHHHHHHHCCEEEEECCCC T ss_conf 78999999999987438827665047762024788752686634567666798717899999999986689899978815 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCHHHHHHCC Q ss_conf 3227436588999999998611148858997480469989984887686468750799899874899868988407 Q gi|254780653|r 323 VFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWLEGKDLPGIVALSG 398 (400) Q Consensus 323 vfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L~G~~LPgi~aL~~ 398 (400) +||+++|+.||.++++++++ +++|||||||||++|++++|+.++|+||||||||||+||+|++||||+||++ T Consensus 320 vfE~~~F~~GT~~i~~aia~----~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~Le~L~G~~LPgi~aL~~ 391 (391) T PRK00073 320 VFEFENFAKGTKAVAKAIAE----STAFSIAGGGDTAAAVEKLGLADKFSHISTGGGASLEFLEGKELPGVAALEE 391 (391) T ss_pred CCCCCCHHHHHHHHHHHHHH----CCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCCHHHHHCC T ss_conf 43476241799999999986----7997998772899999974987798279278799999977999651565139 No 3 >cd00318 Phosphoglycerate_kinase Phosphoglycerate kinase (PGK) is a monomeric enzyme which catalyzes the transfer of the high-energy phosphate group of 1,3-bisphosphoglycerate to ADP, forming ATP and 3-phosphoglycerate. This reaction represents the first of the two substrate-level phosphorylation events in the glycolytic pathway. Substrate-level phosphorylation is defined as production of ATP by a process, which is catalyzed by water-soluble enzymes in the cytosol; not involving membranes and ion gradients. Probab=100.00 E-value=0 Score=1053.75 Aligned_cols=390 Identities=46% Similarity=0.759 Sum_probs=381.7 Q ss_pred CHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 10100159968999820167131857772589999889899999889809999137788898473322478888764310 Q gi|254780653|r 7 MNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESIL 86 (400) Q Consensus 7 i~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l 86 (400) |+|+ |++|||||||||||||+++|+|+|++||++++|||++|+++||||||+||||||+|++++++||+||+++|+++| T Consensus 1 l~d~-d~~gK~VllRvD~NvPi~~g~I~Dd~RI~~~lpTI~~ll~~gakvvi~SHlGRP~g~~~~~~SL~~va~~L~~~L 79 (397) T cd00318 1 IDDL-DLKGKRVLVRVDFNVPVDDGKITDDTRIRAALPTIKYLLEQGAKVVLLSHLGRPKGEPNEKYSLAPVAKALSELL 79 (397) T ss_pred CCCC-CCCCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 9865-768999999953577762996897688999999999999788989998258999998797568799999999985 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCC-------HHHHHHHHCCCCCEEEECCHHHHCCCCCCEE Q ss_conf 133545666310255665531364648982424303432100-------0246653124364247544044011376333 Q gi|254780653|r 87 HKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERN-------DPDFVRMLSRNGDFYINDAFSVSHRAHASIT 159 (400) Q Consensus 87 ~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n-------~~~f~k~La~laDiyVnDAF~~aHR~haS~~ 159 (400) +++|.|++||+|+.+++++++|++|+|+||||+|||+||++| +++|+|+||++||+|||||||||||+|||++ T Consensus 80 ~~~V~f~~d~~g~~~~~~i~~l~~GeI~LLENvRF~~~E~~n~~~~~~~d~~fak~La~l~DiyVNDAFg~aHR~haS~~ 159 (397) T cd00318 80 GQPVTFANDCVGPEAEEAVEALKPGDVLLLENVRFYPEEEGKRDDDKEADEEFAKKLASLGDVYVNDAFGTAHRAHASMV 159 (397) T ss_pred HHHHCCCCCCCCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCEEEECCHHHHHHHCCCCC T ss_conf 24212477777867999985278885898323101534201565320015999999875478987421225556253301 Q ss_pred ECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCC Q ss_conf 02332100033255657776533101667871899960780255788998773068888620578999998301434333 Q gi|254780653|r 160 GLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSL 239 (400) Q Consensus 160 gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl 239 (400) |||+++|||||++||+|+++|++++++|+||+++||||||+||||++|++|++++|+|++||+||||||+|+|++||+|+ T Consensus 160 gi~~~lps~aG~l~ekEl~~L~~~l~~p~rP~~aIiGGaKisdKi~vi~~l~~k~D~iiiGG~mantFL~A~G~~iG~sl 239 (397) T cd00318 160 GIALLLPSAAGFLMEKELKYLAKALENPERPFVAILGGAKVSDKIQVIENLLDKVDYLIIGGGMAFTFLKAQGMDIGKSL 239 (397) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHCCEEEECHHHHHHHHHHCCCCCCCCC T ss_conf 50003741136999999999998851888974999728862157999999998478899760999999997699778655 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 31000013568899875302455430354113432323333102532120133457652003567776405776799976 Q gi|254780653|r 240 CQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNG 319 (400) Q Consensus 240 ~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNG 319 (400) +|+++++.++++++++++++++|++|+|++|+.+++...++.+++.+++|++++++||||+|++.|+++|++|+|||||| T Consensus 240 ~e~~~~~~a~~il~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~ilDIG~~Ti~~~~~~I~~aktI~wNG 319 (397) T cd00318 240 FEEDGIELAKSLLEKAKAKGVKIVLPVDVVVADKFKADANTKVVTDDGIPDGWMGLDIGPKTIELFAEVIRKAKTIVWNG 319 (397) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEECCCEEEEECCCCCCCCEEEEECCCCCCCCEECCCCHHHHHHHHHHHHHCCEEEEEC T ss_conf 67577999999999987549848687569994366788870686600266565640148999999999871188899989 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCHHHHHHCC Q ss_conf 6253227436588999999998611148858997480469989984887686468750799899874899868988407 Q gi|254780653|r 320 PLGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWLEGKDLPGIVALSG 398 (400) Q Consensus 320 P~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L~G~~LPgi~aL~~ 398 (400) |||+||+++|+.||.++++++++.++ +++|||||||||++|++++|+.++|+||||||||||+||||++||||+||+. T Consensus 320 P~GvfE~~~F~~GT~~i~~~ia~~~~-~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~Le~LeGk~LPgieaL~~ 397 (397) T cd00318 320 PMGVFEFPAFAKGTKAIADAIAAATK-AGAFSIIGGGDTAAAAEKFGLADKISHVSTGGGASLELLEGKELPGVAALEE 397 (397) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCC-CCCEEEEECHHHHHHHHHCCCCCCCEEEECCHHHHHHHHCCCCCCCCHHCCC T ss_conf 81423556163899999999998455-8998999586999999975986796079378399999976999650000149 No 4 >PTZ00005 phosphoglycerate kinase; Provisional Probab=100.00 E-value=0 Score=1026.40 Aligned_cols=397 Identities=42% Similarity=0.691 Sum_probs=379.7 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCE-EEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 9888771010015996899982016713185777258999988989999988980-999913778889847332247888 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAK-VVIFSHLGRPQSKSDKDCSLFKVV 79 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~ak-vii~SH~GRPkg~~~~~~Sl~~v~ 79 (400) |.+-.||+|+.|++||+||||||||||++||+|+|++||++++|||++|+++||| |||+||||||+|+.++++||+||+ T Consensus 5 ~~~~ltldd~~d~~gK~VllRvD~NvPi~~g~I~Dd~RI~a~lpTI~~l~~~gak~Vvl~SH~GRP~g~~~~~~SL~pva 84 (419) T PTZ00005 5 LSNKLGLDDVDDLKGKRVLMRVDFNVPIKNGVIKDTTRIKATIPTIKYLLKNGAKSIVLMSHLGRPDGRRVEKYSLKPVV 84 (419) T ss_pred CCCCCCHHHCCCCCCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCHHHHH T ss_conf 03777342213468998999934677753995897788999999999999879988999678889999779665839999 Q ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCH---------------HHHHHHHCCCCCEEE Q ss_conf 87643101335456663102556655313646489824243034321000---------------246653124364247 Q gi|254780653|r 80 SIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERND---------------PDFVRMLSRNGDFYI 144 (400) Q Consensus 80 ~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~---------------~~f~k~La~laDiyV 144 (400) ++|+++|+++|.|++|++|+.+++++++|++|+|+||||+|||+||+.|+ ..|+++||++||+|| T Consensus 85 ~~L~~~L~~~V~f~~d~~g~~a~~~i~~l~~G~I~LLENvRF~~eEe~~~~~~~~~~~k~~~~~~~~fa~~La~laDvyV 164 (419) T PTZ00005 85 EELEKLLGKEVKFLSDCVGEEVEKECANPEEGSVILLENLRFHIEEEGKGVDAEGDKVKADDEEVQAFRNSLTKLGDVYV 164 (419) T ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHCCEEE T ss_conf 99999868666456554588999999638888589971155253212455432012102686889999999865367987 Q ss_pred ECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHH Q ss_conf 54404401137633302332100033255657776533101667871899960780255788998773068888620578 Q gi|254780653|r 145 NDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMA 224 (400) Q Consensus 145 nDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~la 224 (400) |||||||||+|||++|+ +++|+|||++||+|+++|++++++|+||+++||||||+||||++|++|++++|+|++||+|| T Consensus 165 nDAF~~aHR~haS~vgi-~~~~~~aG~l~ekEl~~L~~~l~~p~~P~~aIlGGaKisdKi~vi~~ll~k~D~iiigG~ma 243 (419) T PTZ00005 165 NDAFGTAHRAHSSMVGV-NLPQKVAGFLMKKELEYFAKALENPQRPFLAILGGAKVKDKIQLIKNLLDKVDEMIIGGGMA 243 (419) T ss_pred ECCCCCCCCCCCCEECC-CCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCEEEECCHHH T ss_conf 43201012245621031-56834132899999999999964988883899648873427999998997278699772799 Q ss_pred HHHHHH-HCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCC-CCCCEECCCCCCCCCCCHHH Q ss_conf 999998-3014343333100001356889987530245543035411343232333310-25321201334576520035 Q gi|254780653|r 225 NSFLVA-QGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQV-VSAQSVPLDSIILDVGFKTV 302 (400) Q Consensus 225 ntfL~A-~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~-~~~~~i~~~~~i~DIGp~Ti 302 (400) ||||+| +|++||+|++|+++++.++++++++++++++|++|+||+|+++++.+.+.+. .+.+++|++++++||||+|+ T Consensus 244 ntfl~a~~G~~iG~sl~e~~~~~~~~~i~~~a~~~~~kI~lP~D~vv~~~~~~~~~~~~~~~~~~ip~~~~ilDIG~~Ti 323 (419) T PTZ00005 244 YTFKKVLNNMPIGSSLFDEEGAKIVKEIMEKAKEKNVKILLPVDFVVADKFSNDANIKVVTDEEGIPDGWMGLDCGPKSI 323 (419) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEECCCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCHHHH T ss_conf 99999966998775543445679999999998863996657568998524577787037342146886440026576679 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHH Q ss_conf 67776405776799976625322743658899999999861114885899748046998998488768646875079989 Q gi|254780653|r 303 EYIKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFL 382 (400) Q Consensus 303 ~~~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L 382 (400) +.|+++|++|+|||||||||+||+++|+.||.++++++++.++ +++|||+|||||++|++++|+.++|+|||||||||| T Consensus 324 ~~~~~~I~~aktI~wNGP~G~fE~~~F~~GT~~i~~~ia~~~~-~~~~sivGGGdT~aai~~~g~~~~~~hvSTGGGA~L 402 (419) T PTZ00005 324 ENFKDVILTAKTIVWNGPLGVFEFPNFAKGSIEVLDLVVEATK-NGAITIIGGGDTASLAEKTGKANKVSHVSTGGGASL 402 (419) T ss_pred HHHHHHHHHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCC-CCCEEEEECHHHHHHHHHCCCCCCCEEECCCHHHHH T ss_conf 9999998558989998982445556064899999999997520-799899927699999997597578207807769999 Q ss_pred HHHCCCCCHHHHHHCCC Q ss_conf 98748998689884079 Q gi|254780653|r 383 EWLEGKDLPGIVALSGH 399 (400) Q Consensus 383 ~~L~G~~LPgi~aL~~~ 399 (400) +||+|++||||+||++. T Consensus 403 ~~L~G~~LPgi~aL~~k 419 (419) T PTZ00005 403 ELLEGKELPGVVALSSK 419 (419) T ss_pred HHHCCCCCCHHHHHCCC T ss_conf 99769986346673059 No 5 >COG0126 Pgk 3-phosphoglycerate kinase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=1024.58 Aligned_cols=389 Identities=49% Similarity=0.790 Sum_probs=375.3 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 98887710100159968999820167131857772589999889899999889809999137788898473322478888 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVS 80 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~ 80 (400) |+..+++.|. +++|||||||||||||+++|+|+||+||++++|||+||+++||+|||+||||||++ +++++||+||++ T Consensus 1 ~~~~~~~~d~-~~~gK~VlvRvD~NvP~~dG~I~dd~RI~a~lpTIk~l~~~ga~Vvl~SHlGRPk~-~~~~~SL~pva~ 78 (395) T COG0126 1 MMMKKTILDL-DLAGKRVLVRVDFNVPVDDGKITDDTRIRAALPTIKYLLEKGAKVVLLSHLGRPKE-YSDKTSLEPVAK 78 (395) T ss_pred CCCCCHHHHH-CCCCCEEEEEECCCCCCCCCEECCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCC-CCCCCCHHHHHH T ss_conf 9752111332-14698899994357866698167718798716899999958995999824789899-987644799999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH----HHHHHCCCCCEEEECCHHHHCCCCC Q ss_conf 764310133545666310255665531364648982424303432100024----6653124364247544044011376 Q gi|254780653|r 81 IAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD----FVRMLSRNGDFYINDAFSVSHRAHA 156 (400) Q Consensus 81 ~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~----f~k~La~laDiyVnDAF~~aHR~ha 156 (400) +|+++|+++|.|++||+|+.+++++++|++|+|+||||+|||++|++|+++ |+|+||+|+|+|||||||||||+|| T Consensus 79 ~Ls~ll~~~V~f~~d~~g~~a~~~v~~l~~GevlLLEN~RF~~~E~~~d~~~~~~l~k~la~l~DvfVnDAFgtAHRaha 158 (395) T COG0126 79 RLSELLGKEVKFVDDCVGPEARQAVAELKDGEVLLLENVRFYSEEEKNDEEARTELVKKLASLGDVFVNDAFGTAHRAHA 158 (395) T ss_pred HHHHHCCCCEEECCCCCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCEEEECHHHHHHHHCC T ss_conf 99986299618544446879999996168986799863331565568625568999999986469887424577776254 Q ss_pred CEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCC Q ss_conf 33302332100033255657776533101667871899960780255788998773068888620578999998301434 Q gi|254780653|r 157 SITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVG 236 (400) Q Consensus 157 S~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG 236 (400) |++||++++|||||+|||+|+++|.+++++|+||+++|+||||+||||.+|+||++++|+|++||+||||||+|+|++|| T Consensus 159 S~~g~~~~lps~aG~LmekEl~~L~k~l~~p~rP~vaIlGGaKVsdki~vienLl~kaD~liigGgma~tFl~A~G~~vG 238 (395) T COG0126 159 STVGFAKFLPSAAGFLMEKELDALGKALENPERPFVAILGGAKVSDKIGVIENLLKKADKLIIGGGMANTFLKAQGYDVG 238 (395) T ss_pred CHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCC T ss_conf 31246664425451889999999999863999865999606500008999999997517588652689999998366501 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEE Q ss_conf 33331000013568899875302455430354113432323333102532120133457652003567776405776799 Q gi|254780653|r 237 RSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVM 316 (400) Q Consensus 237 ~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~ 316 (400) +|++|.+..+.|++|+++++. +|++|+|++|+.++..+.+...++ +||++|+++||||+|++.|+++|++|+||+ T Consensus 239 ~sl~E~~~~~~Ak~ll~k~~~---~I~lPvD~~v~~~f~~~~~~~~~~--~i~~~~~~lDIGp~Ti~~~~~~i~~Aktiv 313 (395) T COG0126 239 KSLVEFDLIDGAKELLEKAKD---KIVLPVDVVVAKEFSRDAPATVKL--EIPDDLMILDIGPKTIELFAEIIKGAKTIV 313 (395) T ss_pred HHHHHHHHHHHHHHHHHHHCC---CEECCCCEEECCCCCCCCCCCCCC--CCCCCCCCCCCCHHHHHHHHHHHHHCCEEE T ss_conf 678889888899999997178---177762058705666666543346--788876131568899999999984588999 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCCCCHHHHHH Q ss_conf 97662532274365889999999986111488589974804699899848876864687507998998748998689884 Q gi|254780653|r 317 WNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWLEGKDLPGIVAL 396 (400) Q Consensus 317 WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L~G~~LPgi~aL 396 (400) |||||||||+++|+.||.++++++++. +++|||+|||||++|++++|+.++|+||||||||+|+||||+.||||++| T Consensus 314 wNGP~GVfE~~~Fa~GT~~v~~aia~~---~~a~SiiGGGdt~aAi~~~G~~d~~shISTGGGAsLe~leGk~LPgv~aL 390 (395) T COG0126 314 WNGPMGVFEFENFAKGTEEVAKAIAKS---SGAFSIIGGGDTAAAIDKLGLADKISHISTGGGASLEFLEGKELPGVEAL 390 (395) T ss_pred EECCCCCEECCHHHHHHHHHHHHHHHC---CCCEEEECCCHHHHHHHHCCCCCCCCEEECCCHHHHHHHCCCCCCHHHHH T ss_conf 828963112540545699999999844---79759978837999999739644675683070788898638976469987 Q ss_pred CCC Q ss_conf 079 Q gi|254780653|r 397 SGH 399 (400) Q Consensus 397 ~~~ 399 (400) ++. T Consensus 391 ~~~ 393 (395) T COG0126 391 EES 393 (395) T ss_pred HHC T ss_conf 631 No 6 >pfam00162 PGK Phosphoglycerate kinase. Probab=100.00 E-value=0 Score=1020.91 Aligned_cols=378 Identities=49% Similarity=0.801 Sum_probs=371.6 Q ss_pred CCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 71010015996899982016713185777258999988989999988980999913778889847332247888876431 Q gi|254780653|r 6 TMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESI 85 (400) Q Consensus 6 ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~ 85 (400) ||+|+ |++|||||||+|||||++||+|+|++||++++|||++|+++||||||+||||||++++++++||+||+++|+++ T Consensus 1 tl~d~-~~~gK~VllRvD~NvPi~~g~I~Dd~RI~~~~pTI~~Ll~~gakvvi~SH~GRPk~~~~~~~SL~~va~~L~~~ 79 (383) T pfam00162 1 TLDDL-DLKGKRVLVRVDFNVPLDDGKITDDTRIRAALPTIKYLLEKGAKVVLLSHLGRPKGGPKDKYSLKPVAKRLSEL 79 (383) T ss_pred CCCCC-CCCCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 93114-66899999993257775499189768899999999999978997999956789999989764779999999998 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCC----CCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEEC Q ss_conf 01335456663102556655313646489824243034321----00024665312436424754404401137633302 Q gi|254780653|r 86 LHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEE----RNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGL 161 (400) Q Consensus 86 l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~----~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi 161 (400) |+++|.|++||+|+.+++++++|++|+|+||||+|||+||+ +|+++|+++||++||+|||||||||||+|||++|+ T Consensus 80 L~~~V~f~~d~~g~~~~~~i~~l~~G~I~LLEN~RF~~~E~~~~~~n~~~fak~La~l~DiyVnDAF~~sHR~haS~~gi 159 (383) T pfam00162 80 LGKPVKFVDDCIGPEAEAAIAALKPGEVLLLENVRFYAEEESKAEKNDPEFAKKLASLGDIYVNDAFGTAHRAHASTVGV 159 (383) T ss_pred HCCCCEECCCCCCHHHHHHHHHCCCCCEEEEHHHCCCCCCCCCCCCCCHHHHHHHHHHCCEEECCCHHHHHHCCCCHHHH T ss_conf 66771515777777899999717899699870103452236554113699999997536898505304556516303335 Q ss_pred CCCC-CCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCC Q ss_conf 3321-000332556577765331016678718999607802557889987730688886205789999983014343333 Q gi|254780653|r 162 SHLL-PSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLC 240 (400) Q Consensus 162 ~~~l-ps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~ 240 (400) |+++ |||||++||+|+++|++++++|+||+++||||||+||||++|++|++++|+|++||+||||||+|+|++||+|++ T Consensus 160 ~~~l~ps~aG~l~ekEl~~L~~~l~~p~kP~~aIiGGaKisdKi~~l~~L~~k~D~iiigG~mantFL~A~G~~iG~sl~ 239 (383) T pfam00162 160 AKVLPPSAAGFLMEKELEALSKALENPERPFVAILGGAKVSDKIKVIENLLDKVDKLLIGGGMANTFLKAQGYNIGKSLV 239 (383) T ss_pred HHHCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCCCCCC T ss_conf 75468232664299999999988528888808997368601379999999985877897728999999984981787457 Q ss_pred CCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 10000135688998753024554303541134323233331025321201334576520035677764057767999766 Q gi|254780653|r 241 QRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGP 320 (400) Q Consensus 241 e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP 320 (400) |+++++.|+++++++++++++|+||+||+|+++++.+.++++++++++|++|+++||||+|++.|+++|++|+||||||| T Consensus 240 e~~~i~~a~~il~~a~~~~~~I~lP~D~vv~~~~~~~~~~~~~~~~~i~~~~~ilDIG~~Ti~~~~~~I~~aktI~wNGP 319 (383) T pfam00162 240 EEDLIETAKELLEKAKEKGVKIVLPVDVVVADEFSADAETKVVDVDEIPDGWMGLDIGPKTIELFAEVIKDAKTVVWNGP 319 (383) T ss_pred CHHHHHHHHHHHHHHHHCCCEEECCCEEEEEECCCCCCCEEEEECCCCCCCCEEEECCHHHHHHHHHHHHHCCEEEEECC T ss_conf 86789999999999874388587871267620347898626720445666667886078999999999987899999788 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHHCCC Q ss_conf 25322743658899999999861114885899748046998998488768646875079989987489 Q gi|254780653|r 321 LGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWLEGK 388 (400) Q Consensus 321 ~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L~G~ 388 (400) ||+||+++|+.||.++++++++ +++|||+|||||++|++++|+.++|+||||||||||+||||+ T Consensus 320 ~GvfE~~~F~~GT~~i~~~ia~----~~~~sivGGGdT~aai~~~g~~~~~~~vSTgGGA~Le~LeGk 383 (383) T pfam00162 320 MGVFEFENFAKGTKAVAKAIAE----SGAFSIVGGGDTAAAVEKAGLADKFSHISTGGGASLELLEGK 383 (383) T ss_pred CCCCCCCCHHHHHHHHHHHHHH----CCCEEEECCHHHHHHHHHCCCCCCCEEEECCHHHHHHHHCCC T ss_conf 2544366264899999999996----799899958089999997598679608967879989997499 No 7 >KOG1367 consensus Probab=100.00 E-value=0 Score=909.84 Aligned_cols=394 Identities=41% Similarity=0.677 Sum_probs=379.9 Q ss_pred CCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCE-EEEEECCCCCCCCCCCCCCHHHHHHH Q ss_conf 88771010015996899982016713185777258999988989999988980-99991377888984733224788887 Q gi|254780653|r 3 RLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAK-VVIFSHLGRPQSKSDKDCSLFKVVSI 81 (400) Q Consensus 3 ~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~ak-vii~SH~GRPkg~~~~~~Sl~~v~~~ 81 (400) ++.||+++ |++|||||+|||||||+++++|+|++||.+++|||+|++++++| |||+||+|||+|.+.++|||+|++.. T Consensus 5 ~klti~~~-dl~GKrVf~RVDfNVPl~d~~Itnn~RI~aalPtIk~~l~~~~k~VvL~SHLGRP~G~~~~kySL~Pva~e 83 (416) T KOG1367 5 SKLTIDNL-DLKGKRVFIRVDFNVPLKDNKITNNQRIVAALPTIKYLLSNGAKSVVLMSHLGRPKGVRNKKYSLAPVAPE 83 (416) T ss_pred CCCCCCCC-CCCCCEEEEEEECCCCCCCCEECCCCEEEECCCHHHHHHHCCCCEEEEHHHCCCCCCCCCCCCCCCCHHHH T ss_conf 32451012-44686799998516634577112551222113178999857984898741058999988865243414999 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHH--------------HHHHHHCCCCCEEEECC Q ss_conf 6431013354566631025566553136464898242430343210002--------------46653124364247544 Q gi|254780653|r 82 AESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDP--------------DFVRMLSRNGDFYINDA 147 (400) Q Consensus 82 l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~--------------~f~k~La~laDiyVnDA 147 (400) |+.+|+++|.|.+||+|++++.++++..+|.++||||+|||.+||.+.+ +|.+.|++|+|+||||| T Consensus 84 Lk~lLg~~v~flddCvg~eVe~a~~~p~~G~viLLENlRfy~eEEg~~~~~~~~~~a~~~~v~~fr~~l~~l~DvyVnDA 163 (416) T KOG1367 84 LKSLLGKEVVFLDDCVGPEVEKAVASPAPGSVILLENLRFYVEEEGKGKDDSGKKVADPAKVKEFRASLASLGDVYVNDA 163 (416) T ss_pred HHHHHCCCEEEECCCCCHHHHHHHCCCCCCCEEEEECCEEEHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCEEEECC T ss_conf 99884765363012455789998538899838985032301244147865444523787889999999874115786400 Q ss_pred HHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHH Q ss_conf 04401137633302332100033255657776533101667871899960780255788998773068888620578999 Q gi|254780653|r 148 FSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSF 227 (400) Q Consensus 148 F~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantf 227 (400) |||+||+|+|++|+.... +++||||+||++++.+++++|.||+++|+||+|++|||++|+||++|++.++||||||||| T Consensus 164 FGtaHRahsSm~g~~~~~-~~aGfLl~KEL~yf~kalenp~rPFlaIlGGaKVadKIqlIenLldkv~~liigGGMaftF 242 (416) T KOG1367 164 FGTAHRAHSSMVGVGLPQ-SAAGFLLQKELDYFAKALENPVRPFLAILGGAKVADKIQLIENLLDKVNELIIGGGMAFTF 242 (416) T ss_pred CCCCHHCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCHHHHHHHHHHHHHHHCCEEEECCCEEEHH T ss_conf 020000123211468730-2245778878999999870877633544447416667999999986415288748500119 Q ss_pred HHHH-CCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCC-EECCCCCCCCCCCHHHHHH Q ss_conf 9983-0143433331000013568899875302455430354113432323333102532-1201334576520035677 Q gi|254780653|r 228 LVAQ-GMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQ-SVPLDSIILDVGFKTVEYI 305 (400) Q Consensus 228 L~A~-G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~-~i~~~~~i~DIGp~Ti~~~ 305 (400) ++++ |.+||+|++|++..+.+++++++|++++++|+||+||++++.|..+++...++.. .||++||.+||||+|++.| T Consensus 243 lKvl~~~eiG~Sl~de~g~e~v~~l~~kak~~~v~i~lPvDfv~adkf~~da~s~~~ta~~gIp~g~mgLD~GPes~k~f 322 (416) T KOG1367 243 LKVLNGMEIGKSLFDEEGAEIVKDLMEKAKAKGVRILLPVDFVIADKFAEDANSKQVTAEEGIPDGWMGLDIGPESIKMF 322 (416) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCCCCCEECCCCCCCCCCCCCCCCHHHHHHH T ss_conf 99857762212332034679999999999875957982000465111067655223112358998744346586789999 Q ss_pred HHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCHHHHHHHH Q ss_conf 76405776799976625322743658899999999861114885899748046998998488768646875079989987 Q gi|254780653|r 306 KQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAGGAFLEWL 385 (400) Q Consensus 306 ~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgGGA~L~~L 385 (400) .+.|..||||+||||+|+||++.|+.||++++.++++++. .++.||+|||||++|++++|..|+++||||||||+||+| T Consensus 323 a~~v~~aKtIvWNGP~GvfE~~~Fa~GTeal~d~~v~~t~-~G~~tiiGGGDTata~~k~g~~dk~ShVSTGGGasLeLL 401 (416) T KOG1367 323 AEAVATAKTIVWNGPPGVFEFEKFAAGTEALMDALVKLTG-KGVTTIIGGGDTATACKKFGTEDKVSHVSTGGGASLELL 401 (416) T ss_pred HHHHHHHHEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCCEEEECCCCHHHHHHHHCCCCCEEEEECCCCEEHHHH T ss_conf 9997665078853897510046542047999999998744-885799768737999998486440246604885551542 Q ss_pred CCCCCHHHHHHCCC Q ss_conf 48998689884079 Q gi|254780653|r 386 EGKDLPGIVALSGH 399 (400) Q Consensus 386 ~G~~LPgi~aL~~~ 399 (400) ||+.|||+.+|+.. T Consensus 402 eGK~LPGv~aLs~~ 415 (416) T KOG1367 402 EGKVLPGVDALSEA 415 (416) T ss_pred CCCCCCCHHHHCCC T ss_conf 69867666651359 No 8 >PRK09206 pyruvate kinase; Provisional Probab=95.86 E-value=0.17 Score=30.06 Aligned_cols=308 Identities=14% Similarity=0.182 Sum_probs=146.3 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCC-EEECCH Q ss_conf 88989999988980999--913778889847332247888876431013354566631025566553136464-898242 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGG-IILAEN 118 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~-ilLLEN 118 (400) ...+|+.|+++|..+.= +||- -+ +.....-...+.+++.++++|...-|.-|++.+. .++++|+ +.|-++ T Consensus 16 ~~e~L~~li~aG~~v~RiN~SHg-~~----e~h~~~i~~iR~~~~~~~~~v~Il~Dl~GPkIR~--g~l~~~~~i~l~~G 88 (470) T PRK09206 16 SEEMLTKLLDAGMNVMRLNFSHG-DY----AEHGQRIQNLRNVMSKTGKKAAILLDTKGPEIRT--MKLEGGNDVSLKAG 88 (470) T ss_pred CHHHHHHHHHCCCCEEEEECCCC-CH----HHHHHHHHHHHHHHHHCCCCCEEEEECCCCCEEE--EECCCCCEEEECCC T ss_conf 99999999986998999979999-99----9999999999999997399827999788980489--85379973586289 Q ss_pred HHH--CCCCC---------CCHHHHHHHHCCCCCEEEECCHHHH-------------------CCCCCCE--EECCCCCC Q ss_conf 430--34321---------0002466531243642475440440-------------------1137633--30233210 Q gi|254780653|r 119 VRF--YSEEE---------RNDPDFVRMLSRNGDFYINDAFSVS-------------------HRAHASI--TGLSHLLP 166 (400) Q Consensus 119 ~Rf--~~~E~---------~n~~~f~k~La~laDiyVnDAF~~a-------------------HR~haS~--~gi~~~lp 166 (400) -+| ..++. -+.++|.+.+..--.+|+.|..-.- =+++-.+ -|...-+| T Consensus 89 ~~~~l~~~~~~~g~~~~i~v~~~~l~~~v~~Gd~I~idDG~i~l~V~~v~~~~i~~~V~~gG~L~s~KgvnlP~~~~~lp 168 (470) T PRK09206 89 QTFTFTTDKSVVGNNEIVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAITGNEVICKVLNNGDLGENKGVNLPGVSIALP 168 (470) T ss_pred CEEEEECCCCCCCCCCEEEECCHHHHHHCCCCCEEEEECCEEEEEEEEEECCEEEEEEEECCEECCCCCCCCCCCCCCCC T ss_conf 88999568764787778974635658557889989984896899999970997999997483854886423344457867 Q ss_pred C------------------CCCHHH---HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHH--HHHHHHCCCEEEECHHH Q ss_conf 0------------------033255---657776533101667871899960780255788--99877306888862057 Q gi|254780653|r 167 S------------------YIGRAM---QKELSMLESCFSESKKPLVAIVGGSKVSTKITL--LINLVKKVDKLVIGGGM 223 (400) Q Consensus 167 s------------------~aG~l~---ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~--i~~L~~k~D~iiigG~l 223 (400) + |.+..| .+.+..+.+++.....+-+-|+ |||+++-.+ |+..++.+|.|++.=| T Consensus 169 ~ltekD~~di~f~~~~~vD~IalSFVrsa~DV~~lr~~l~~~~~~~~~II--AKIE~~~av~NldeIi~~sDgIMIARG- 245 (470) T PRK09206 169 ALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVLEIREHLKAHGGENIQII--SKIENQEGLNNFDEILEASDGIMVARG- 245 (470) T ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHCCCCCEEE--EEECCHHHHHHHHHHHHHCCEEEEECC- T ss_conf 88988999999999859999998587987889999999997269973499--984688888739999986779999358- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC--CCCCCC-EECCC-CCCC---- Q ss_conf 899999830143433331000013568899875302455430354113432323333--102532-12013-3457---- Q gi|254780653|r 224 ANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPT--QVVSAQ-SVPLD-SIIL---- 295 (400) Q Consensus 224 antfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~--~~~~~~-~i~~~-~~i~---- 295 (400) -.|.+++- ++.--.-|.|+.++.+.+.+.++-+...-+= ..+..|+ ++.++. .+-++ +-+| T Consensus 246 ------DLgvei~~----e~vp~~Qk~II~~c~~~gKPvivATqmLeSM-i~~p~PTRAEvsDVaNAV~dG~DavMLS~E 314 (470) T PRK09206 246 ------DLGVEIPV----EEVIFAQKMMIEKCNRARKVVITATQMLDSM-IKNPRPTRAEAGDVANAILDGTDAVMLSGE 314 (470) T ss_pred ------CCCCCCCH----HHHHHHHHHHHHHHHHCCCCEEEEECCHHHH-CCCCCCCHHHHHHHHHHHHHCCCEEEECCC T ss_conf ------65331688----8858999999999997399889993777874-489998657687899999847678997463 Q ss_pred ----CCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHHHCCCC Q ss_conf ----6520035677764057767999766253227436588999999998611148858997---480469989984887 Q gi|254780653|r 296 ----DVGFKTVEYIKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALAHAGIS 368 (400) Q Consensus 296 ----DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~~~g~~ 368 (400) .--.++++.++.++.++...+|.- ........-..-+.+++.+........++..|+ -.|.|+..+.++--. T Consensus 315 TA~G~yPv~~V~~m~~I~~~aE~~~~~~-~~~~~~~~~~~~~~aia~~a~~~a~~~~a~aIv~~T~sG~ta~~is~~RP~ 393 (470) T PRK09206 315 SAKGKYPLEAVTIMATICERTDRVMNSR-LESNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARSVRKYFPD 393 (470) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCC T ss_conf 2477787999999999999998777653-100135777888999999999999865998999985873899999703999 Q ss_pred CCC Q ss_conf 686 Q gi|254780653|r 369 NEF 371 (400) Q Consensus 369 ~~~ 371 (400) -.+ T Consensus 394 ~pI 396 (470) T PRK09206 394 ATI 396 (470) T ss_pred CCE T ss_conf 988 No 9 >PRK05826 pyruvate kinase; Provisional Probab=93.44 E-value=0.7 Score=26.11 Aligned_cols=308 Identities=15% Similarity=0.215 Sum_probs=155.4 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHH Q ss_conf 88989999988980999--9137788898473322478888764310133545666310255665531364648982424 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENV 119 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~ 119 (400) ...+|+.|+++|..+.= +||-.. +.....-...+.+++-++++|...-|.-|++.+. ..++++.+.|-++- T Consensus 18 ~~e~l~~li~aG~~v~RiN~SHg~~-----e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkIRt--g~~~~~~i~l~~G~ 90 (461) T PRK05826 18 SPENLEKLIEAGVNVVRLNFSHGSH-----EEHGKRAALVREIAAKLGRPVAILLDLKGPKIRV--GKFKDGKVELKIGD 90 (461) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCH-----HHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEE--EECCCCCEEEECCC T ss_conf 9899999998799999998899999-----9999999999999997399706999888983689--86479838970599 Q ss_pred H--HCCC-CC--------CCHHHHHHHHCCCCCEEEECCHHH---------------------HCCCCCCEEECCCCCCC Q ss_conf 3--0343-21--------000246653124364247544044---------------------01137633302332100 Q gi|254780653|r 120 R--FYSE-EE--------RNDPDFVRMLSRNGDFYINDAFSV---------------------SHRAHASITGLSHLLPS 167 (400) Q Consensus 120 R--f~~~-E~--------~n~~~f~k~La~laDiyVnDAF~~---------------------aHR~haS~~gi~~~lps 167 (400) + |... +. -+.++|.+.+..--.+|+.|..=. .-|..-.+-+..--+|+ T Consensus 91 ~v~i~~~~~~~g~~~~i~v~~~~l~~~v~~Gd~I~idDG~i~l~V~~~~~~~i~~~V~~~G~L~s~Kgvnlp~~~~~lp~ 170 (461) T PRK05826 91 KFTLDTDQKEEGDQERVGLDYKDLPKDVKPGDILLLDDGKLQLKVVEVDGDEVETEVLNGGPLSNNKGINIPGGGLSLPA 170 (461) T ss_pred EEEEEECCCCCCCCCEEEECCHHHHHHCCCCCEEEEECCEEEEEEEECCCCEEEEEECCCEEEECCCCCCCCCCCCCCCC T ss_conf 89998278778976889843078686578899899827859999996169869999746809846877542344357677 Q ss_pred ------------------CCCHHH---HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHH--HHHHHHCCCEEEEC-HHH Q ss_conf ------------------033255---657776533101667871899960780255788--99877306888862-057 Q gi|254780653|r 168 ------------------YIGRAM---QKELSMLESCFSESKKPLVAIVGGSKVSTKITL--LINLVKKVDKLVIG-GGM 223 (400) Q Consensus 168 ------------------~aG~l~---ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~--i~~L~~k~D~iiig-G~l 223 (400) +.+..| .+.+..+.+++.....+-+-|+ |||+++-.+ |+..++.+|.|++. |- T Consensus 171 lTekD~~di~fa~~~~vD~ialSFVrsa~DV~~lr~~l~~~~~~~~~II--aKIE~~~al~NldeIi~~sDgIMIARGD- 247 (461) T PRK05826 171 LTEKDKADIKFAAEQGVDYIAVSFVRNAEDVEEARRLLREAGNCDAKII--AKIERAEAVDNLDEIIEASDGIMVARGD- 247 (461) T ss_pred CCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCCCEEE--EEECCHHHHHHHHHHHHHCCEEEEECCC- T ss_conf 9866899999999769999974688986789999999997299984489--9836889987399999865899997785- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC--CCCCCCCE-ECCC-CCCC---- Q ss_conf 89999983014343333100001356889987530245543035411343232333--31025321-2013-3457---- Q gi|254780653|r 224 ANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIP--TQVVSAQS-VPLD-SIIL---- 295 (400) Q Consensus 224 antfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~--~~~~~~~~-i~~~-~~i~---- 295 (400) .|.+++- ++.-..-|.|+.++.+.+.++++-+...-+= ..+..| .++.++.. +-++ +-++ T Consensus 248 -------LgvEi~~----e~vp~~Qk~Ii~~c~~~gkpvivATqmLeSM-~~~p~PTRAEv~DVanAv~dG~DavmLS~E 315 (461) T PRK05826 248 -------LGVEIPD----AEVPGLQKKIIRKAREAGKPVITATQMLESM-IENPRPTRAEVSDVANAVLDGTDAVMLSGE 315 (461) T ss_pred -------CCCCCCH----HHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-CCCCCCCHHHHHHHHHHHHHCCCEEEECHH T ss_conf -------3032687----6749999999998875499899976568976-079999727787899999836878997432 Q ss_pred ----CCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CCEEEE---CCCHHHHHHHHCCC Q ss_conf ----6520035677764057767999766253227436588999999998611148-858997---48046998998488 Q gi|254780653|r 296 ----DVGFKTVEYIKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKER-RIISIA---GGGDTITALAHAGI 367 (400) Q Consensus 296 ----DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~-~~~siv---GGGdT~aai~~~g~ 367 (400) .--.++++.++.++.+|..-+|+---.....+....-+..++...+...... ++-.|+ --|.|+..+.++-- T Consensus 316 TA~G~yPv~aV~~m~~I~~~aE~~~~~~~~~~~~~~~~~~~~~~~~~~av~~a~~~~~a~aIv~~T~sG~ta~~is~~RP 395 (461) T PRK05826 316 TAAGKYPVEAVEAMARICKGAEKEFLNVSKHRLDSTFFDRIDEAIAMSAMYAANHLMGVKAIVALTESGRTARLMSRFRP 395 (461) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHCHHHHCCHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCC T ss_conf 23787889999999999999972323310000012458981999999999998612498789998798189999995497 Q ss_pred CCCC Q ss_conf 7686 Q gi|254780653|r 368 SNEF 371 (400) Q Consensus 368 ~~~~ 371 (400) .-.+ T Consensus 396 ~~pI 399 (461) T PRK05826 396 GVPI 399 (461) T ss_pred CCCE T ss_conf 9989 No 10 >PRK03803 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=89.19 E-value=1.9 Score=23.27 Aligned_cols=60 Identities=25% Similarity=0.447 Sum_probs=36.9 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCC-CEEEECHHHHHHHHHHH Q ss_conf 3325565777653310166787189996078025578899877306-88886205789999983 Q gi|254780653|r 169 IGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKV-DKLVIGGGMANSFLVAQ 231 (400) Q Consensus 169 aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~-D~iiigG~lantfL~A~ 231 (400) .|.-+..|++-+.+. .+.|.++|-|---=+|.-.++.++++.. -+..+||-++..++... T Consensus 92 ~~i~i~~e~el~~~~---~~~~~IaVTGTnGKTTTtsli~~iL~~~g~~~~~gGNiG~p~~~~~ 152 (448) T PRK03803 92 AGIKISGDIELFARA---AKAPIVAITGSNAKSTVTTLVGEMAKAAGKRVAVGGNLGTPALDLL 152 (448) T ss_pred CCCCEECHHHHHHHH---CCCCEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHH T ss_conf 599683199998621---7898899858998388999999999865983799403276136652 No 11 >COG2511 GatE Archaeal Glu-tRNAGln amidotransferase subunit E (contains GAD domain) [Translation, ribosomal structure and biogenesis] Probab=88.69 E-value=1.4 Score=24.07 Aligned_cols=192 Identities=18% Similarity=0.267 Sum_probs=99.3 Q ss_pred CCCEEEEEEE-CCCCCC----CCEECCHHHHHHHHHHHHHHHHCCCEEE----------EEEC-CC-CCCCCCCCC---- Q ss_conf 9968999820-167131----8577725899998898999998898099----------9913-77-888984733---- Q gi|254780653|r 14 RGLRCLLRVD-WNVPFI----DGKVADVTRIERVVPTILELVEKKAKVV----------IFSH-LG-RPQSKSDKD---- 72 (400) Q Consensus 14 ~gk~VlvRvD-~NvPi~----~g~I~d~~RI~~~~pTI~~l~~~~akvi----------i~SH-~G-RPkg~~~~~---- 72 (400) .++.|.-|+| +-.|+. .--|.+..--+.+..-|-+++..-.|+- =+|- -| |.+-|...+ T Consensus 170 ~~~~VvYsLDRLGIPLiEIsT~PdI~sp~~A~evA~~IG~~lrsTGkvKrglgTIRQDvNvSI~~GarvEIKGVq~LdlI 249 (631) T COG2511 170 KGDTVVYSLDRLGIPLIEISTEPDIRSPEQAREVAERIGYILRSTGKVKRGLGTIRQDVNVSIKGGARVEIKGVQDLDLI 249 (631) T ss_pred CCCEEEEEECCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCEEEECCCCHHHHH T ss_conf 58738997213687448842688889979999999998799998355123676400000255338716886274056667 Q ss_pred -----------CCHHHHHHHHHHH----CCCCCCCCCCCCHHHHHHHH-HHCCCCCEEECCHHHHCCC----CCCCHHHH Q ss_conf -----------2247888876431----01335456663102556655-3136464898242430343----21000246 Q gi|254780653|r 73 -----------CSLFKVVSIAESI----LHKNILFVNDCIGSTLSQSI-ASLSEGGIILAENVRFYSE----EERNDPDF 132 (400) Q Consensus 73 -----------~Sl~~v~~~l~~~----l~~~V~fi~d~~g~~~~~~i-~~l~~g~ilLLEN~Rf~~~----E~~n~~~f 132 (400) .+|..+.+.|.+. ...++.-+.|.+-.+-...| +.++.|.-++-=-++=|.| |..+.--| T Consensus 250 ~~vv~yEv~RQ~nLlkirdeL~~RG~~~~~~~~~DVTdlF~~TkSkiI~~~l~~gG~V~A~~l~~f~GlvG~EiqpGrRl 329 (631) T COG2511 250 PDVVEYEVERQLNLLKIRDELKERGGADVSEEVFDVTDLFANTKSKIIRSGLKKGGKVYAVLLPGFAGLVGREIQPGRRL 329 (631) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCCHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 89999999999889999999997355300144452898865573188887542487699997477565223002577410 Q ss_pred HHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHH Q ss_conf 65312436424754404401137633302332100033255657776533101667871899960780255788998773 Q gi|254780653|r 133 VRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVK 212 (400) Q Consensus 133 ~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~ 212 (400) ...+|.++-.| -.=|.-|-+ -||.|- +-|.|++.+...+....--.++++.|-- +.--..++..++ T Consensus 330 GtE~AD~ar~~--GvgGlfHsD---------ELP~YG--ITeeEV~~v~~~lga~~~Davvlva~~~-~~~~~a~~~V~~ 395 (631) T COG2511 330 GTEFADYARAY--GVGGLFHSD---------ELPGYG--ITEEEVEAVRDALGAGEEDAVVLVAGEE-ERAKRALEAVIE 395 (631) T ss_pred CHHHHHHHHHH--CCCCCCCCC---------CCCCCC--CCHHHHHHHHHHHCCCCCCEEEEEECCH-HHHHHHHHHHHH T ss_conf 35789999982--777510056---------675678--7999999999984799777489970778-899999999999 Q ss_pred CCCEEEE Q ss_conf 0688886 Q gi|254780653|r 213 KVDKLVI 219 (400) Q Consensus 213 k~D~iii 219 (400) ++...+. T Consensus 396 Ra~~ai~ 402 (631) T COG2511 396 RAKEAIE 402 (631) T ss_pred HHHHHHC T ss_conf 9999871 No 12 >PRK08125 bifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase; Validated Probab=87.73 E-value=2.4 Score=22.66 Aligned_cols=64 Identities=20% Similarity=0.195 Sum_probs=41.4 Q ss_pred HHHHHHHHHCCCEEE-EEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 898999998898099-991377888984733224788887643101335456663102556655313646 Q gi|254780653|r 43 VPTILELVEKKAKVV-IFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEG 111 (400) Q Consensus 43 ~pTI~~l~~~~akvi-i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g 111 (400) ..+++.|+++|..|+ +.+|-.||..+ -...|.+ +.-. ..|.||....+.-.++..+.+++++|. T Consensus 13 ~~~L~aLleaG~eV~aVvThpD~Pgek-~~~~sVk---~~A~-e~gIPV~qP~~i~~pe~ie~L~~l~PD 77 (660) T PRK08125 13 CVGIEALLEAGYEIAAVFTHTDNPGEN-TFFGSVA---RLAA-ELGIPVYAPEDVNHPLWVERIAELAPD 77 (660) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCC-CCCCCHH---HHHH-HCCCCEECCCCCCCHHHHHHHHHCCCC T ss_conf 999999997899169998679999877-6888499---9999-749967647889988999999844999 No 13 >PRK00005 fmt methionyl-tRNA formyltransferase; Reviewed Probab=86.22 E-value=2.9 Score=22.13 Aligned_cols=69 Identities=20% Similarity=0.262 Sum_probs=39.5 Q ss_pred HHHHHHHHHHHHCCCEEE-EEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCC Q ss_conf 998898999998898099-99137788898473322478888764310133545666310255665531364 Q gi|254780653|r 40 ERVVPTILELVEKKAKVV-IFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSE 110 (400) Q Consensus 40 ~~~~pTI~~l~~~~akvi-i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~ 110 (400) .-++++++.|++.+..++ ++|+-.||.|... +....|+..+.++ .|.+|....+.-.++..+.++++++ T Consensus 10 ~fs~~~L~~L~~~~~~i~~Vvt~pdk~~gR~~-k~~~~~v~~~a~~-~~i~~~~p~~~~~~~~~~~l~~~~~ 79 (309) T PRK00005 10 EFAVPSLKALLESGHEVVAVVTQPDRPAGRGK-KLTPSPVKQLALE-HGIPVLQPESLRDPEFLAELAALNA 79 (309) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCCCCC-CCCCCHHHHHHHH-CCCCEEECCCCCCHHHHHHHHHCCC T ss_conf 89999999999789947999929998668999-7989989999998-6993883577999999999984199 No 14 >PRK00683 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=83.41 E-value=3.8 Score=21.33 Aligned_cols=62 Identities=15% Similarity=0.194 Sum_probs=33.8 Q ss_pred CHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCC-EEEECHHHHHHHHHHH Q ss_conf 3255657776533101667871899960780255788998773068-8886205789999983 Q gi|254780653|r 170 GRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVD-KLVIGGGMANSFLVAQ 231 (400) Q Consensus 170 G~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D-~iiigG~lantfL~A~ 231 (400) |.-+-.|++-..+...-.+.|.++|-|--==+|.-.++.++++++. ..++||-++..++.+. T Consensus 83 ~i~i~seiel~~~~~~~~~~~~IaITGTNGKTTTt~ll~~iL~~~g~~~~~~GNIG~p~l~~~ 145 (418) T PRK00683 83 KIPVVTDIQLAFQTPEFQRYPSLGITGSTGKTTTILFLEHLLRTLGIPAFAMGNIGLPILDGM 145 (418) T ss_pred CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCEEHHHHHHHHHHHCCCCCEEEECCCHHHHHHH T ss_conf 997540799997424414798799976898660999999999867998188803465777643 No 15 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=83.29 E-value=1.8 Score=23.43 Aligned_cols=53 Identities=23% Similarity=0.266 Sum_probs=35.3 Q ss_pred EEEEEEECCCCCCCCEECCH-HHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHH Q ss_conf 89998201671318577725-8999988989999988980999913778889847332247888876 Q gi|254780653|r 17 RCLLRVDWNVPFIDGKVADV-TRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIA 82 (400) Q Consensus 17 ~VlvRvD~NvPi~~g~I~d~-~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l 82 (400) +-||=.| .||+++|+ .++.....+|+.|.++|-++|+.| |||- .+..++.+.| T Consensus 6 ~~LI~tD-----lDGTLL~~~~~~~~~~~ai~~l~~~Gi~vviaT--GR~~------~~~~~~~~~l 59 (275) T PRK00192 6 KLLVFTD-----LDGTLLDHTYDWEPAKPTLKALKEKGIPVIPCT--SKTA------AEVEVLRAEL 59 (275) T ss_pred CEEEEEE-----CCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEEC--CCCH------HHHHHHHHHH T ss_conf 4699994-----857784899983689999999997899999968--9889------9999999995 No 16 >COG0223 Fmt Methionyl-tRNA formyltransferase [Translation, ribosomal structure and biogenesis] Probab=82.91 E-value=4 Score=21.20 Aligned_cols=77 Identities=22% Similarity=0.407 Sum_probs=50.5 Q ss_pred HHHHHHHHHHHCCCEEE-EEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHH Q ss_conf 98898999998898099-99137788898473322478888764310133545666310255665531364648982424 Q gi|254780653|r 41 RVVPTILELVEKKAKVV-IFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENV 119 (400) Q Consensus 41 ~~~pTI~~l~~~~akvi-i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~ 119 (400) -+.|+++.|++.+-.|+ +.+|-.||.|. ..+...-|+...-.++ |+ T Consensus 12 fa~~~L~~L~~~~~eivaV~Tqpdkp~gR-~~~l~~spVk~~A~~~--------------------------------~i 58 (307) T COG0223 12 FAVPSLEALIEAGHEIVAVVTQPDKPAGR-GKKLTPSPVKRLALEL--------------------------------GI 58 (307) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCCCC-CCCCCCCHHHHHHHHC--------------------------------CC T ss_conf 54999999982898048999489976678-7858888389999974--------------------------------99 Q ss_pred HHCCCCCCCHHHHHHHHCCC-CCEEEECCHHH Q ss_conf 30343210002466531243-64247544044 Q gi|254780653|r 120 RFYSEEERNDPDFVRMLSRN-GDFYINDAFSV 150 (400) Q Consensus 120 Rf~~~E~~n~~~f~k~La~l-aDiyVnDAF~~ 150 (400) -+|.-|.-+++++...++.+ +|+.|-=|||- T Consensus 59 pv~qP~~l~~~e~~~~l~~l~~D~ivvvayG~ 90 (307) T COG0223 59 PVFQPEKLNDPEFLEELAALDPDLIVVVAYGQ 90 (307) T ss_pred CEECCCCCCCHHHHHHHHCCCCCEEEEEEHHH T ss_conf 55664447868999998603999999974222 No 17 >TIGR01825 gly_Cac_T_rel pyridoxal phosphate-dependent acyltransferase, putative; InterPro: IPR010962 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . This entry contains a number of enzyme families: Serine palmitoyltransferase Glycine C-acetyltransferase (2-amino-3-ketobutyrate coenzyme A ligase) 5-aminolevulinic acid synthase 8-amino-7-oxononanoate synthase All transfer the R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from coenzyme A to an amino acid (Gly, Gly, Ala, respectively), with release of CO2 for the latter two reactions.; GO: 0016740 transferase activity, 0030170 pyridoxal phosphate binding. Probab=81.30 E-value=1 Score=25.00 Aligned_cols=79 Identities=19% Similarity=0.199 Sum_probs=38.9 Q ss_pred HHHCCCCCCCEEEEEEECCHHH----HHHHHHHHHHC---CCEEEECHHHHHH-HHHH----HCCCCCCCC--------C Q ss_conf 3310166787189996078025----57889987730---6888862057899-9998----301434333--------3 Q gi|254780653|r 181 ESCFSESKKPLVAIVGGSKVST----KITLLINLVKK---VDKLVIGGGMANS-FLVA----QGMGVGRSL--------C 240 (400) Q Consensus 181 ~~~l~~~~~P~v~IiGGaKisd----Ki~~i~~L~~k---~D~iiigG~lant-fL~A----~G~~IG~Sl--------~ 240 (400) -..|.+..||+++ .-|.-+- -+..++.|.+- ++++|= || ||++ .||+.|+|- . T Consensus 255 idyL~~~aRPFLF--Sta~~Pa~~~A~~~Av~~l~~~~~l~~kLW~-----Nt~ylk~~L~~LGydl~~S~tPItPv~~G 327 (392) T TIGR01825 255 IDYLKARARPFLF--STALPPAVVAALIEAVDVLQESPELMEKLWD-----NTRYLKAGLSKLGYDLGESETPITPVLIG 327 (392) T ss_pred HHHHHHCCCCCCC--CCCCCHHHHHHHHHHHHHHCCCCCHHHHHHH-----HHHHHHHHHHHHCCCCCCCCCCCCCEEEC T ss_conf 9999862898601--2578807899999999987079314777678-----89999987676164114766861245876 Q ss_pred CCC-CCHHHHHHHHHHHHCCCCCCCEE Q ss_conf 100-00135688998753024554303 Q gi|254780653|r 241 QRD-FSDNVHQIAWEAKRSACEIIVPR 266 (400) Q Consensus 241 e~~-~~~~ak~i~~~a~~~~~~I~lP~ 266 (400) |+. --+..++|++++-=+-.-|.+|+ T Consensus 328 de~~A~efS~rL~~~yGvFa~sivFPT 354 (392) T TIGR01825 328 DEKVAQEFSRRLLDEYGVFAQSIVFPT 354 (392) T ss_pred CCHHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 515689999987775290899842165 No 18 >PRK06988 putative formyltransferase; Provisional Probab=79.51 E-value=5.3 Score=20.45 Aligned_cols=66 Identities=15% Similarity=0.166 Sum_probs=38.0 Q ss_pred HHHHHHHHHHHHCCCEEE-EEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCC Q ss_conf 998898999998898099-99137788898473322478888764310133545666310255665531364 Q gi|254780653|r 40 ERVVPTILELVEKKAKVV-IFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSE 110 (400) Q Consensus 40 ~~~~pTI~~l~~~~akvi-i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~ 110 (400) +-++|+++.|++.|..|+ ++|+-.||.++ ....|+.++-.+ +|.+|....+.-.++..+.++++++ T Consensus 12 ~fav~~L~~L~~~~~~v~~VvTqpD~p~~~----~~~~~Vk~~A~~-~~ipv~~p~~~~~~e~~~~l~~~~~ 78 (313) T PRK06988 12 NVGVRCLQVLLARGVDVALVVTHEDNPTEN----IWFGSVAQVAAE-HGIPVITPADPNDPELRAAVAAAAP 78 (313) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCC----CCCCHHHHHHHH-CCCCEECCCCCCCHHHHHHHHHCCC T ss_conf 899999999997899679998998997668----899829999998-6995877898999999999985399 No 19 >PRK02006 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=76.89 E-value=6.3 Score=19.96 Aligned_cols=59 Identities=20% Similarity=0.333 Sum_probs=28.6 Q ss_pred CHHHHHHHHHHHHHCCC------CCCCEEEEEEECCHHHHHHHHHHHHHCC-CEEEECHHHHHHHH Q ss_conf 32556577765331016------6787189996078025578899877306-88886205789999 Q gi|254780653|r 170 GRAMQKELSMLESCFSE------SKKPLVAIVGGSKVSTKITLLINLVKKV-DKLVIGGGMANSFL 228 (400) Q Consensus 170 G~l~ekEi~~L~~~l~~------~~~P~v~IiGGaKisdKi~~i~~L~~k~-D~iiigG~lantfL 228 (400) |.-+..|++-+.+.+.. ...+.++|=|-.-=+|.-.++.++++.+ -...+||-++..++ T Consensus 97 gi~i~~eiel~~~~~~~~~~~~~~~~~iIaVTGTnGKTTTt~li~~iL~~~g~~~~~~GNIG~p~l 162 (501) T PRK02006 97 GIPVWGELELFAQALAALGASRGYAPKVLAITGTNGKTTTTSLTGLLCERAGKKVAVAGNISPAAL 162 (501) T ss_pred CCCEEEHHHHHHHHHHHCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHH T ss_conf 995876899999887630222356874899938996687999999999976997465256461026 No 20 >PRK00771 signal recognition particle protein Srp54; Provisional Probab=76.87 E-value=6.3 Score=19.95 Aligned_cols=191 Identities=17% Similarity=0.255 Sum_probs=116.9 Q ss_pred CEEEEEEECCCCCCC------------CEEC---CH--HHHHHHHHHHHHHHHC------CCEEEEE-ECCCCCCCCCCC Q ss_conf 689998201671318------------5777---25--8999988989999988------9809999-137788898473 Q gi|254780653|r 16 LRCLLRVDWNVPFID------------GKVA---DV--TRIERVVPTILELVEK------KAKVVIF-SHLGRPQSKSDK 71 (400) Q Consensus 16 k~VlvRvD~NvPi~~------------g~I~---d~--~RI~~~~pTI~~l~~~------~akvii~-SH~GRPkg~~~~ 71 (400) ++.|+..|.|+++.. .++. +. .-++.....+-.++.. ...||++ .-||- | + T Consensus 36 r~ALLeADV~~~vvk~f~~~vk~k~~g~~v~~~~~p~q~iikiv~~eL~~llg~~~~~~~kP~Vim~vGlqGs--G---K 110 (433) T PRK00771 36 QRALLQADVNVKLVMNLSKEIERRALEEEPPKGMSKKEHIIKIVYEELVKLLGEEAEILLKPQTILLVGLQGS--G---K 110 (433) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCC--C---H T ss_conf 9999965678799999999999997146245789989999999999999984967656689858999737889--7---8 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH-HHH---HHCCCCCEEEEC Q ss_conf 32247888876431013354566-6310255665531364648982424303432100024-665---312436424754 Q gi|254780653|r 72 DCSLFKVVSIAESILHKNILFVN-DCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD-FVR---MLSRNGDFYIND 146 (400) Q Consensus 72 ~~Sl~~v~~~l~~~l~~~V~fi~-d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~-f~k---~La~laDiyVnD 146 (400) ..|....|.|+++. |++|..+. |++-+.+.++++.+... -|+.||..+...||. .++ .-+.-.|+-+-| T Consensus 111 TTT~aKLA~~~kk~-g~kv~lvaaDt~RpaA~eQL~~la~~-----~~v~~~~~~~~~dp~~i~~~a~~~~k~~DvviiD 184 (433) T PRK00771 111 TTTAAKLARYFQKK-GLKVGVICADTWRPGAYEQLKQLCEK-----INVPFYGDPKEKDAVKIVKEGLEKLKKVDVIIVD 184 (433) T ss_pred HHHHHHHHHHHHHC-CCCEEEEECCCCCHHHHHHHHHHHHH-----CCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 99999999999977-99467850678836899999999986-----3887317889999999999999984569889997 Q ss_pred CHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC--CCEEEEC---- Q ss_conf 4044011376333023321000332556577765331016678718999607802557889987730--6888862---- Q gi|254780653|r 147 AFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK--VDKLVIG---- 220 (400) Q Consensus 147 AF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k--~D~iiig---- 220 (400) - |-|-|-. --+-.|+..+.+++ +|.. .+.|+-+.==-+-+...+.+-+. .+.++++ T Consensus 185 T---AGRl~~d-------------~~Lm~El~~i~~~~-~P~e-~llV~Da~~GQ~a~~~a~~F~~~~~i~gvIlTKlDg 246 (433) T PRK00771 185 T---AGRHKLE-------------KDLIEEMKQIKEIT-KPDE-VILVIDATIGQQASEQAKAFKEAVGIGGIIITKLDG 246 (433) T ss_pred C---CCCCCCC-------------HHHHHHHHHHHHHH-CCCE-EEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 7---6521040-------------99999999998775-7976-899865442267899999998753887379972567 Q ss_pred ---HHHHHHHHHHHCCCC Q ss_conf ---057899999830143 Q gi|254780653|r 221 ---GGMANSFLVAQGMGV 235 (400) Q Consensus 221 ---G~lantfL~A~G~~I 235 (400) ||.|-..-.+.|..| T Consensus 247 darGGaaLSi~~~t~~PI 264 (433) T PRK00771 247 TAKGGGALSAVAETGAPI 264 (433) T ss_pred CCCCCHHHHHHHHHCCCC T ss_conf 887305421898878995 No 21 >PRK10867 signal recognition particle protein; Provisional Probab=76.74 E-value=6.3 Score=19.93 Aligned_cols=192 Identities=19% Similarity=0.252 Sum_probs=123.4 Q ss_pred CEEEEEEECCCCCCC------------CEE----C-CHHHHHHHHHHHHHHHHCC----------CEEEEE-ECCCCCCC Q ss_conf 689998201671318------------577----7-2589999889899999889----------809999-13778889 Q gi|254780653|r 16 LRCLLRVDWNVPFID------------GKV----A-DVTRIERVVPTILELVEKK----------AKVVIF-SHLGRPQS 67 (400) Q Consensus 16 k~VlvRvD~NvPi~~------------g~I----~-d~~RI~~~~pTI~~l~~~~----------akvii~-SH~GRPkg 67 (400) ++-|+..|.|+++.. .++ + ++.-++..-..+-.++... ..||++ .-||- | T Consensus 35 r~ALLeADV~~~vvk~~i~~vke~~~g~~v~~~l~p~q~i~kiv~~eL~~lLg~~~~~l~~~~~~p~VIm~vGLqGs--G 112 (453) T PRK10867 35 RMALLEADVALPVVREFINRVKEKAVGHEVNKSLTPGQEFVKIVRNELVAAMGEENQTLNLAAQPPAVVLMAGLQGA--G 112 (453) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCC--C T ss_conf 99999756887999999999999963351357898899999999999999858887666337899969999746888--5 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCC-CCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH-HHH-----HHCCCC Q ss_conf 84733224788887643101335456-66310255665531364648982424303432100024-665-----312436 Q gi|254780653|r 68 KSDKDCSLFKVVSIAESILHKNILFV-NDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD-FVR-----MLSRNG 140 (400) Q Consensus 68 ~~~~~~Sl~~v~~~l~~~l~~~V~fi-~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~-f~k-----~La~la 140 (400) +..|....|.|+.+.-+++|..+ -|++-+.+.++++.+... -++.||..+...||. .++ .-.... T Consensus 113 ---KTTT~aKLA~~lk~k~~k~vllvaaDt~RpaA~eQL~~la~~-----~~v~~~~~~~~~dp~~ia~~a~~~ak~~~~ 184 (453) T PRK10867 113 ---KTTSVGKLGKFLREKHKKKVLVVSADVYRPAAIKQLETLAEQ-----VGVDFFPSDVGQKPVDIVNAALKEAKLKFY 184 (453) T ss_pred ---HHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHH-----CCCCEECCCCCCCHHHHHHHHHHHHHHCCC T ss_conf ---185899999999973898379855887705899999999985-----198043678899889999999999997799 Q ss_pred CEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC--CCEEE Q ss_conf 4247544044011376333023321000332556577765331016678718999607802557889987730--68888 Q gi|254780653|r 141 DFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK--VDKLV 218 (400) Q Consensus 141 DiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k--~D~ii 218 (400) |+-+-|- |-|-|-. --+-.|+..+.+++ +|.. .+.|+-+.-=-+-+...+.+-+. .+.++ T Consensus 185 DvvivDT---AGRl~~d-------------~~Lm~El~~i~~~~-~P~e-~llV~Da~~GQ~a~~~a~~F~~~~~~~gvI 246 (453) T PRK10867 185 DVLLVDT---AGRLHVD-------------EAMMDEIKQVHASI-NPVE-TLFVVDAMTGQDAANTAKAFNEALPLTGVV 246 (453) T ss_pred CEEEEEC---CCCHHCC-------------HHHHHHHHHHHHHH-CCCC-EEEEEECCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 9999978---7601210-------------88899999998763-7871-379743223566899999999855987078 Q ss_pred EC-------HHHHHHHHHHHCCCC Q ss_conf 62-------057899999830143 Q gi|254780653|r 219 IG-------GGMANSFLVAQGMGV 235 (400) Q Consensus 219 ig-------G~lantfL~A~G~~I 235 (400) ++ ||-|-..-.+.|..| T Consensus 247 lTKlDgdarGG~alS~~~~t~~PI 270 (453) T PRK10867 247 LTKVDGDARGGAALSIRHITGKPI 270 (453) T ss_pred EECCCCCCCCHHHHHHHHHHCCCC T ss_conf 750467876138989999978696 No 22 >cd06556 ICL_KPHMT Members of the ICL/PEPM_KPHMT enzyme superfamily catalyze the formation and cleavage of either P-C or C-C bonds. Typical members are phosphoenolpyruvate mutase (PEPM), phosphonopyruvate hydrolase (PPH), carboxyPEP mutase (CPEP mutase), oxaloacetate hydrolase (OAH), isocitrate lyase (ICL), 2-methylisocitrate lyase (MICL), and ketopantoate hydroxymethyltransferase (KPHMT). Probab=75.96 E-value=6.5 Score=19.83 Aligned_cols=31 Identities=6% Similarity=0.094 Sum_probs=20.8 Q ss_pred CCCEEEEECCCCCCCC--CCHHHHHHHHHHHHH Q ss_conf 7767999766253227--436588999999998 Q gi|254780653|r 311 QARTVMWNGPLGVFEI--EPFDRATVEVAHYVA 341 (400) Q Consensus 311 ~AktI~WNGP~GvfE~--~~F~~GT~~l~~~ia 341 (400) ..+..+|+.-+|+++. |.|.+--..+...|. T Consensus 204 DGQvLV~~DllG~~~~~~PkFvK~y~~~~~~i~ 236 (240) T cd06556 204 DGQFLVLADAFGITGGHIPKFAKNFHAETGDIR 236 (240) T ss_pred CEEEEEHHHHCCCCCCCCCCHHHHHHCCHHHHH T ss_conf 667772664448999999834988625376899 No 23 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=75.46 E-value=6.8 Score=19.71 Aligned_cols=63 Identities=22% Similarity=0.372 Sum_probs=51.4 Q ss_pred CCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCH--HHHHHHHHHHHHCCCEEEECH Q ss_conf 4404401137633302332100033255657776533101667871899960780--255788998773068888620 Q gi|254780653|r 146 DAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKV--STKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 146 DAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKi--sdKi~~i~~L~~k~D~iiigG 221 (400) |.|=.-|-+|-+++|+ ++|+-+=..|+.|+|=++.|=.|+=| +.-|.-|.-++++-|-||=|| T Consensus 36 d~fl~~~~~gk~i~g~-------------~~ieeFV~~Le~PRKImLMVkAG~pVdaD~~I~~L~P~LE~GDiIIDGG 100 (480) T TIGR00873 36 DEFLKERAKGKKIVGA-------------YSIEEFVQSLERPRKIMLMVKAGAPVDADAVINSLLPLLEKGDIIIDGG 100 (480) T ss_pred HHHHHCCCCCCCCCCC-------------CCHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEECCC T ss_conf 9998603788853343-------------2679999850688728887753885377899999644358998887588 No 24 >PRK06719 precorrin-2 dehydrogenase; Validated Probab=75.05 E-value=4.7 Score=20.78 Aligned_cols=43 Identities=16% Similarity=0.223 Sum_probs=33.3 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 988877101001599689998201671318577725899998898999998898099991 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) |++|--+ +-|++||+||| +..|++ +..-++.|++.||+|.++| T Consensus 1 m~~~~Pi--fl~l~~k~vLV-------vGGG~V--------A~rK~~~Ll~~ga~VtVvs 43 (157) T PRK06719 1 MYNMYPL--MFNLHNKVVVI-------IGGGKI--------AYRKASGLKDTGAFVTVVS 43 (157) T ss_pred CCCCCEE--EEECCCCEEEE-------ECCCHH--------HHHHHHHHHHCCCEEEEEC T ss_conf 9964306--89759987999-------889899--------9999999987879699999 No 25 >pfam00224 PK Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains. Probab=75.05 E-value=7 Score=19.65 Aligned_cols=103 Identities=19% Similarity=0.243 Sum_probs=59.4 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH-------HHCCCCC Q ss_conf 88989999988980999--91377888984733224788887643101335456663102556655-------3136464 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSI-------ASLSEGG 112 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i-------~~l~~g~ 112 (400) ...+|+.|++.|..+.= +||-.. +.....-...+..++.++++|...-|.-|++.+..- -.++.|+ T Consensus 16 ~~~~l~~li~~G~dv~RiN~SHg~~-----e~~~~~i~~iR~~~~~~~~~v~Il~Dl~GpkIR~g~~~~~~~~i~l~~G~ 90 (348) T pfam00224 16 SVENLEKLIKAGMNVARMNFSHGSH-----EYHQSRIDNVREAEEKLGGLVAIALDTKGPEIRTGLFKDGKKDIELKAGD 90 (348) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCH-----HHHHHHHHHHHHHHHHHCCCEEEEEECCCCCEEEEECCCCCEEEEECCCC T ss_conf 9999999998699999997999999-----99999999999999981998079983689953555226885038966998 Q ss_pred EEECCHHHHCC--CCC----CCHHHHHHHHCCCCCEEEECCHH Q ss_conf 89824243034--321----00024665312436424754404 Q gi|254780653|r 113 IILAENVRFYS--EEE----RNDPDFVRMLSRNGDFYINDAFS 149 (400) Q Consensus 113 ilLLEN~Rf~~--~E~----~n~~~f~k~La~laDiyVnDAF~ 149 (400) .+.+..-.-+. ++. -+.+++.+.+..---+|++|.-- T Consensus 91 ~v~~~~~~~~~~~~~~~~i~v~~~~l~~~vk~Gd~I~idDG~i 133 (348) T pfam00224 91 KFLVSTDPAYKGAGDKEKIYVDYKNLTKDVSPGGIILVDDGVL 133 (348) T ss_pred EEEEECCCCCCCCCCCCEEECCCHHHHHHCCCCCEEEECCCEE T ss_conf 9999647302578986889737177698758998898438818 No 26 >COG0469 PykF Pyruvate kinase [Carbohydrate transport and metabolism] Probab=73.20 E-value=5.8 Score=20.19 Aligned_cols=306 Identities=16% Similarity=0.192 Sum_probs=158.1 Q ss_pred HHHHHHHHHHCCCEEEEE--ECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCH- Q ss_conf 889899999889809999--13778889847332247888876431013354566631025566553136464898242- Q gi|254780653|r 42 VVPTILELVEKKAKVVIF--SHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAEN- 118 (400) Q Consensus 42 ~~pTI~~l~~~~akvii~--SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN- 118 (400) +-+.|+.|+++|..|+=+ ||-. .++.-.....++..++.+|++|...-|.-||+.+. ..+++|.+-|-.. T Consensus 19 s~e~l~~li~aG~nV~RlNfSHG~-----~e~h~~~i~~vR~~~~~~~~~vaIl~DlkGPkIR~--g~~~~~~~~l~~G~ 91 (477) T COG0469 19 SEEMLEKLIEAGMNVVRLNFSHGD-----HEEHKKRIDNVREAAEKLGRPVAILLDLKGPKIRT--GKFKGGAVELEKGE 91 (477) T ss_pred CHHHHHHHHHCCCCEEEEECCCCC-----HHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEE--EECCCCCEEEECCC T ss_conf 989999999705868988568898-----68999999999999998399169997598984304--75378848962698 Q ss_pred -HHHCCCCC----------CCHHHHHHHHCCCCCEEEECCHHH------H-----CC-----CCCCEEEC--CCCCCCCC Q ss_conf -43034321----------000246653124364247544044------0-----11-----37633302--33210003 Q gi|254780653|r 119 -VRFYSEEE----------RNDPDFVRMLSRNGDFYINDAFSV------S-----HR-----AHASITGL--SHLLPSYI 169 (400) Q Consensus 119 -~Rf~~~E~----------~n~~~f~k~La~laDiyVnDAF~~------a-----HR-----~haS~~gi--~~~lps~a 169 (400) ++|...+. -+.+.|.+.+..--.+++.|+-=- . -| .-.|--|+ |..-+ -. T Consensus 92 ~~~~~~~~~~~~~~~~~v~v~y~~l~~dV~~G~~iLlDDG~i~l~V~~v~~~~v~~~v~n~G~l~~~KgvN~pg~~l-~~ 170 (477) T COG0469 92 KFTLTTDDKVGEGDEERVSVDYKDLAKDVKPGDRILLDDGKIELRVVEVDGDAVITRVLNGGVLSSNKGVNLPGVDL-SL 170 (477) T ss_pred EEEEECCCCCCCCCCCEEECCHHHHHHHCCCCCEEEEECCEEEEEEEEEECCEEEEEEEECCCCCCCCCEECCCCCC-CC T ss_conf 89996344456887757842578878653889889996890489999960887999998397956887145589877-98 Q ss_pred CHHHHHHHHHH-------------------------HHHCCCCCCCEEEEEEECCHHHH--HHHHHHHHHCCCEEEECHH Q ss_conf 32556577765-------------------------33101667871899960780255--7889987730688886205 Q gi|254780653|r 170 GRAMQKELSML-------------------------ESCFSESKKPLVAIVGGSKVSTK--ITLLINLVKKVDKLVIGGG 222 (400) Q Consensus 170 G~l~ekEi~~L-------------------------~~~l~~~~~P~v~IiGGaKisdK--i~~i~~L~~k~D~iiigG~ 222 (400) ..+=||..+.| .+++.....+-+.|+ |||..+ +.-|+..++..|.|++.=| T Consensus 171 palteKD~~dl~f~~~~gvD~vA~SFVr~~~Dv~~~R~~l~~~~~~~~~ii--aKIE~~eav~NldeIi~~SDGIMVARG 248 (477) T COG0469 171 PALTEKDKEDLKFGLEQGVDFVALSFVRNAEDVEEVREILAETGGRDVKII--AKIENQEAVDNLDEIIEASDGIMVARG 248 (477) T ss_pred CCCCCCCHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCEEE--EEECCHHHHHHHHHHHHHCCCEEEEEC T ss_conf 878834788899798659988999666888899999999998379872499--950587777579999986383299865 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC--CCCCCC-EECC-CCCCCCCC Q ss_conf 7899999830143433331000013568899875302455430354113432323333--102532-1201-33457652 Q gi|254780653|r 223 MANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPT--QVVSAQ-SVPL-DSIILDVG 298 (400) Q Consensus 223 lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~--~~~~~~-~i~~-~~~i~DIG 298 (400) = .|.+||- ++..-.-|.+++.+.+.|...+.-+-..-+-. .+..+. ++.++. .+-+ .+-.+.-| T Consensus 249 D-------LGVEip~----e~Vp~~QK~iI~~~~~~gkpVItATQMLeSMi-~np~PTRAEvsDVanAvlDGtDAvMLS~ 316 (477) T COG0469 249 D-------LGVEIPL----EEVPIIQKRIIRKARRAGKPVITATQMLESMI-ENPRPTRAEVSDVANAVLDGTDAVMLSG 316 (477) T ss_pred C-------CCCCCCH----HHHHHHHHHHHHHHHHCCCCEEEEECCHHHHH-HCCCCCCHHHHHHHHHHHHCCCEEEECH T ss_conf 6-------5053477----87369999999999875996099706388884-0899871005688899872774356032 Q ss_pred --------CHHHHHHHHHHCCCCEEEE-ECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHHHCC Q ss_conf --------0035677764057767999-766253227436588999999998611148858997---4804699899848 Q gi|254780653|r 299 --------FKTVEYIKQVIAQARTVMW-NGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALAHAG 366 (400) Q Consensus 299 --------p~Ti~~~~~~I~~AktI~W-NGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~~~g 366 (400) -++++.+..+...+..-+- +.-+.....+.+...+..++.+.......-++.-|| --|.|+-.+.++. T Consensus 317 ETA~G~yPveaV~~M~~I~~~aE~~~~~~~~~~~~~~~~~~~~~e~ia~aa~~~a~~l~~k~iv~~T~sG~ta~~isk~R 396 (477) T COG0469 317 ETAAGKYPVEAVATMARIAKEAEKELPDNQLLRFRVDPPDSSITEAIALAAVDIAEKLDAKAIVTLTESGRTARLLSKYR 396 (477) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHCCC T ss_conf 02068787999999999999874154223333320255446389999999999987457758999758877999996279 Q ss_pred CCC Q ss_conf 876 Q gi|254780653|r 367 ISN 369 (400) Q Consensus 367 ~~~ 369 (400) -.. T Consensus 397 p~~ 399 (477) T COG0469 397 PEA 399 (477) T ss_pred CCC T ss_conf 999 No 27 >PRK02472 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=72.26 E-value=8.2 Score=19.22 Aligned_cols=34 Identities=12% Similarity=0.017 Sum_probs=21.2 Q ss_pred HCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 0159968999820167131857772589999889899999889809999 Q gi|254780653|r 11 RDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIF 59 (400) Q Consensus 11 ~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~ 59 (400) .+++||+|+| +-=| ..-+...++|.++|++|++. T Consensus 5 ~~~~~k~i~v-------iGlG--------~sG~s~a~~L~~~G~~V~~~ 38 (450) T PRK02472 5 TTFQNKKVLV-------LGLA--------KSGYAAAKLLHKLGANVTVN 38 (450) T ss_pred CCCCCCEEEE-------EEEC--------HHHHHHHHHHHHCCCEEEEE T ss_conf 0358998999-------9778--------99999999999886989998 No 28 >PRK04308 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=71.53 E-value=8.5 Score=19.11 Aligned_cols=62 Identities=23% Similarity=0.326 Sum_probs=44.3 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCC-EEEECHHHHHHHHHH Q ss_conf 33255657776533101667871899960780255788998773068-888620578999998 Q gi|254780653|r 169 IGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVD-KLVIGGGMANSFLVA 230 (400) Q Consensus 169 aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D-~iiigG~lantfL~A 230 (400) .|.-+..|++-+.+.+.....|.++|=|--==+|.-.++.++++++. ...++|.++..++.+ T Consensus 90 ~gi~i~~~~el~~~~~~~~~~~~IaVTGTnGKTTTtsli~~iL~~~g~~~~~~GNiG~p~~~~ 152 (445) T PRK04308 90 NGGRVLGDIELLADIVNRRGDKVIAITGSNGKTTVTSLVGYLCIKCGLDTVIAGNIGTPVLEA 152 (445) T ss_pred CCCCEECHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHH T ss_conf 599571599999999841699579994899837799999999997599629985037521111 No 29 >TIGR01697 PNPH-PUNA-XAPA inosine guanosine and xanthosine phosphorylase family; InterPro: IPR011268 This entry describes a subset of the phosphorylase family. The entry excludes the methylthioadenosine phosphorylases (MTAP, IPR010044 from INTERPRO), which are believed to play a specific role in the recycling of methionine from methylthioadenosine. This entry consists of three clades of purine phosphorylases based on a neighbour-joining tree using the MTAP family as an out group. The highest-branching clade (IPR011269 from INTERPRO) consists of a group of sequences from both Gram-positive and Gram-negative bacteria which have been shown to act as purine nucleotide phosphorylases but whose physiological substrate and role in vivo remain unknown . Of the two remaining clades, one is xanthosine phosphorylase (XAPA, IPR010943 from INTERPRO); it is limited to certain gammaproteobacteria and constitutes a special purine phosphorylase found in a specialised operon for xanthosine catabolism . The enzyme also acts on the same purines (inosine and guanosine) as the other characterised members of this subfamily, but is only induced when xanthosine must be degraded. The remaining and largest clade consists of purine nucleotide phosphorylases (PNPH, IPR011270 from INTERPRO) from metazoa and bacteria which act primarily on guanosine and inosine, and do not act on adenosine. Sequences from Clostridium and Thermotoga fall between these last two clades and are uncharacterised with respect to substrate range. ; GO: 0004731 purine-nucleoside phosphorylase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=71.06 E-value=5 Score=20.60 Aligned_cols=123 Identities=20% Similarity=0.275 Sum_probs=78.1 Q ss_pred CCC--EEEEEECCCCCCCCCCCC-CCHHHHH--HHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCC- Q ss_conf 898--099991377888984733-2247888--8764310133545666310255665531364648982424303432- Q gi|254780653|r 52 KKA--KVVIFSHLGRPQSKSDKD-CSLFKVV--SIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEE- 125 (400) Q Consensus 52 ~~a--kvii~SH~GRPkg~~~~~-~Sl~~v~--~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E- 125 (400) .|- +|++| |||- -+=+. +|+..+. =++=++||.++.++..--|--= .++++||++++---=++.+- T Consensus 53 gg~rvpV~~m--~GRf--H~YEGn~d~~~vt~PvR~mk~lGv~~lvvTNAAGg~n----~~~~~GdLm~i~DHIN~~~~a 124 (266) T TIGR01697 53 GGKRVPVVVM--QGRF--HYYEGNYDMAAVTFPVRVMKLLGVEILVVTNAAGGLN----ADFKPGDLMIIKDHINLPGLA 124 (266) T ss_pred CCEEEEEEEE--ECCC--CCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCC----CCCCCCCEEEEECHHHCCCCC T ss_conf 5711468885--0772--3567896666776899999866920899974661206----788985379971121144000 Q ss_pred --C----CCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCC--CC---CCHHHHH--HHHHHHHH Q ss_conf --1----0002466531243642475440440113763330233210--00---3325565--77765331 Q gi|254780653|r 126 --E----RNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLP--SY---IGRAMQK--ELSMLESC 183 (400) Q Consensus 126 --~----~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lp--s~---aG~l~ek--Ei~~L~~~ 183 (400) . .|+++|..++-++.|.|=-| |-..=++-|+-.++|+.+. .| .|+.+|. |++.+..+ T Consensus 125 g~nPLvGpN~~~FG~RFp~l~~AYD~~-lr~~~~~~A~~~~~~~~l~EGVYv~~~GP~YETPAE~rm~~~l 194 (266) T TIGR01697 125 GFNPLVGPNDDEFGTRFPDLSNAYDKE-LRALAKDVAKELGVPRLLKEGVYVMVSGPSYETPAEIRMLRIL 194 (266) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHH-HHHHHHHHHHHCCCCCEEECCEEEEECCCCCCCHHHHHHHHHH T ss_conf 478786988855578998766456799-9999999998568982565268885086595267899999984 No 30 >PRK06354 pyruvate kinase; Provisional Probab=69.57 E-value=8.3 Score=19.16 Aligned_cols=308 Identities=13% Similarity=0.188 Sum_probs=146.5 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHH Q ss_conf 88989999988980999--9137788898473322478888764310133545666310255665531364648982424 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENV 119 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~ 119 (400) ...+|+.|+++|..+.= +||-.. +.........+.+++.++++|...-|.-||+.+. ..+++|.|.|-++- T Consensus 22 s~e~l~~Li~aGmnV~RlNfSHg~~-----e~h~~~i~~iR~~~~~~~~~vaIl~Dl~GPkIR~--G~~~~~~i~L~~G~ 94 (589) T PRK06354 22 SPEKLRQLIEAGATTARLNFSHGDH-----EEHGARIKNIREASKELGKTVGILQDLQGPKIRL--GRFEDGPIELKTGD 94 (589) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCH-----HHHHHHHHHHHHHHHHCCCCCEEEEECCCCEEEE--EECCCCCEEECCCC T ss_conf 9999999998699999998898999-----9999999999999998099817999788980388--75279828964799 Q ss_pred HH--CCCCCC--------CHHHHHHHHCCCCCEEEECCHHHHC-----C----CCCCE-----------EECCC---CCC Q ss_conf 30--343210--------0024665312436424754404401-----1----37633-----------30233---210 Q gi|254780653|r 120 RF--YSEEER--------NDPDFVRMLSRNGDFYINDAFSVSH-----R----AHASI-----------TGLSH---LLP 166 (400) Q Consensus 120 Rf--~~~E~~--------n~~~f~k~La~laDiyVnDAF~~aH-----R----~haS~-----------~gi~~---~lp 166 (400) +| ..++.. +.++|.+.+..-..++++|..=... + -++-+ +-+|. -+| T Consensus 95 ~~~lt~~~~~g~~~~i~v~y~~l~~~v~~Gd~IlidDG~I~l~V~~v~~~~~~i~c~V~~gG~L~~~KGVNlPg~~l~lp 174 (589) T PRK06354 95 PFILTSREVLGTQEKFSVTYDGLADEVPEGSRILLDDGLVEMEVEEVDKAAGELHCKVLVGGVLSNNKGVNFPGVSLSLP 174 (589) T ss_pred EEEEECCCCCCCCCEEECCHHHHHHHCCCCCEEEECCCEEEEEEEEEECCCCEEEEEEEECCEECCCCCCCCCCCCCCCC T ss_conf 99996588689778785366898976689988995488089999999678998999998793971898241799877877 Q ss_pred C------------------CCCHH---HHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHH--HHHHHHHCCCEEEECHHH Q ss_conf 0------------------03325---565777653310166787189996078025578--899877306888862057 Q gi|254780653|r 167 S------------------YIGRA---MQKELSMLESCFSESKKPLVAIVGGSKVSTKIT--LLINLVKKVDKLVIGGGM 223 (400) Q Consensus 167 s------------------~aG~l---~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~--~i~~L~~k~D~iiigG~l 223 (400) + |.... =.+.+..+.+++++..+ -+-|+ |||+.+-. =|+..++.+|.|++.=| T Consensus 175 ~lTeKD~~dl~f~~~~~vD~VAlSFVrsa~DV~~ir~~l~~~g~-~~~II--AKIE~~eav~NldeIi~~sDgIMVARG- 250 (589) T PRK06354 175 ALTEKDREDLIFGLEQGVDWIALSFVRNPSDVLEIKELIEHNGK-SIPVI--AKIEKQEAIDNIDAILELCDGVMVARG- 250 (589) T ss_pred CCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCC-CCEEE--EEECCHHHHHHHHHHHHHCCEEEEECC- T ss_conf 89820289999999749998998278998999999999986698-63499--974579999869999874769999548- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC--CCCCCC-EECCC-CCCCCCC- Q ss_conf 899999830143433331000013568899875302455430354113432323333--102532-12013-3457652- Q gi|254780653|r 224 ANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPT--QVVSAQ-SVPLD-SIILDVG- 298 (400) Q Consensus 224 antfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~--~~~~~~-~i~~~-~~i~DIG- 298 (400) -.|.+++- ++.--.-|+|+.++.+.+.++++-+...-+= ..+..|. +..++. .+-++ +-+|=-| T Consensus 251 ------DLGvEip~----e~VP~~QK~II~~c~~~gKPVI~ATQMLeSM-i~np~PTRAEvsDVANAV~DGtDAVMLSgE 319 (589) T PRK06354 251 ------DLGVEIPA----EEVPLLQKRLIKKANSLGIPVITATQMLDSM-QRNPRPTRAEVSDVANAILDGTDAVMLSNE 319 (589) T ss_pred ------CCCCCCCH----HHHHHHHHHHHHHHHHCCCCEEEEECCHHHH-CCCCCCCHHHHHHHHHHHHHCCCEEEECCC T ss_conf ------63202588----8847999999999997399789981557875-379998715467798999707878997462 Q ss_pred -------CHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHHHCCCC Q ss_conf -------0035677764057767999766253227436588999999998611148858997---480469989984887 Q gi|254780653|r 299 -------FKTVEYIKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALAHAGIS 368 (400) Q Consensus 299 -------p~Ti~~~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~~~g~~ 368 (400) -++++.+..++..++.-+|.-..--........=|.+++.+..+...+-++-.|+ --|.|+..+.+|--. T Consensus 320 TA~G~yPveaV~~M~~I~~~~E~~~~~~~~~~~~~~~~~~i~~aIa~aa~~~A~~l~a~aIv~~T~SG~TAr~vSk~RP~ 399 (589) T PRK06354 320 TAVGDYPVEAVQTMATIAVRIERDLPYKDIFSKRPEFTTTITNAISQAVSHIARQLDAAAIVTLTKSGATARNVSKFRPK 399 (589) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCC T ss_conf 24786789999999999999861701001012125668987899999999999866998999988981899999954989 Q ss_pred CCC Q ss_conf 686 Q gi|254780653|r 369 NEF 371 (400) Q Consensus 369 ~~~ 371 (400) -.+ T Consensus 400 ~pI 402 (589) T PRK06354 400 TPI 402 (589) T ss_pred CCE T ss_conf 989 No 31 >TIGR02460 osmo_MPGsynth mannosyl-3-phosphoglycerate synthase; InterPro: IPR012812 This family consists of examples of mannosyl-3-phosphoglycerate synthase (MPGS), which together with mannosyl-3-phosphoglycerate phosphatase (MPGP), comprises a two-step pathway for mannosylglycerate biosynthesis. Mannosylglycerate is a compatible solute that tends to be restricted to extreme thermophiles of archaea and bacteria. Note that in Rhodothermus marinus, this pathway is one of two; the other is condensation of GDP-mannose with D-glycerate by mannosylglycerate synthase.; GO: 0050504 mannosyl-3-phosphoglycerate synthase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=69.09 E-value=8.2 Score=19.21 Aligned_cols=210 Identities=16% Similarity=0.257 Sum_probs=108.1 Q ss_pred HHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHH-HHHHHHHHCC Q ss_conf 100159968999820167131857772589999889899999889809999137788898473322478-8887643101 Q gi|254780653|r 9 DLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFK-VVSIAESILH 87 (400) Q Consensus 9 d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~-v~~~l~~~l~ 87 (400) |+.|+-.++++| ||++|.++.=-.-.-+++| +.|-|||.|-=-|- . .+.|+++- .++..+.+++ T Consensus 50 ~L~d~~~~~aiv-----vP~knE~L~llegVL~~IP-------~e~li~vvSnS~Re--~-vdRF~~EveL~r~f~~~~~ 114 (394) T TIGR02460 50 ELEDVLEKTAIV-----VPVKNEKLKLLEGVLSGIP-------HECLIIVVSNSKRE--P-VDRFKMEVELVREFSRLTE 114 (394) T ss_pred HHHHHHHCCEEE-----EECCCCCHHHHHHHHHCCC-------CCCCEEEEECCCCC--C-CCHHHHHHHHHHHHHHHHC T ss_conf 799885085788-----7111474143311342278-------83768888667489--9-8566789999999998618 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH---HHHHHCCC-CCEEEECCHHHHCCCCCCEEECCC Q ss_conf 33545666310255665531364648982424303432100024---66531243-642475440440113763330233 Q gi|254780653|r 88 KNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD---FVRMLSRN-GDFYINDAFSVSHRAHASITGLSH 163 (400) Q Consensus 88 ~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~---f~k~La~l-aDiyVnDAF~~aHR~haS~~gi~~ 163 (400) ++|.-+.. .+|..-++.++.+=-+ +|=||-|-=+| .-| ++=-||++ +--||= | +.-.+ T Consensus 115 ~~i~i~hQ-~DP~LA~Al~~~~y~~-~~~~~G~vRsG----KaEGMllGl~lA~~~G~~YvG--F----------vDsDN 176 (394) T TIGR02460 115 RKILIIHQ-KDPALAEALKEVGYES-ILGESGRVRSG----KAEGMLLGLLLAAALGAEYVG--F----------VDSDN 176 (394) T ss_pred CCEEEEEC-CCHHHHHHHHCCCCCC-CCCCCCCEECC----HHHHHHHHHHHHHHHCCCEEE--E----------EECCC T ss_conf 95189972-8878999973388867-55798856435----043899999999986387003--6----------74067 Q ss_pred CCC-C--------CCCHHHHHHHHHHHHHCCCCCCCEEEEEE-E------CCHHHHH-HHHHHHHHC---CC-EEEECH- Q ss_conf 210-0--------03325565777653310166787189996-0------7802557-889987730---68-888620- Q gi|254780653|r 164 LLP-S--------YIGRAMQKELSMLESCFSESKKPLVAIVG-G------SKVSTKI-TLLINLVKK---VD-KLVIGG- 221 (400) Q Consensus 164 ~lp-s--------~aG~l~ekEi~~L~~~l~~~~~P~v~IiG-G------aKisdKi-~~i~~L~~k---~D-~iiigG- 221 (400) |.| | ||||+|.+-==+.=++. =+-||.++ =+ + .-||+.= ..|+.|++. .. .|+.+| T Consensus 177 Y~PGAv~EYv~~YaAgf~mA~s~YsmvRi~-Wr~KPKl~-~~e~l~f~k~GRvSe~~N~~LN~l~S~~~~fEt~ii~TGN 254 (394) T TIGR02460 177 YFPGAVNEYVKIYAAGFLMATSPYSMVRIV-WRYKPKLT-EDEDLYFRKWGRVSEITNKYLNLLISEVTGFETDIIKTGN 254 (394) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCEEEE-CCCCCEEE-CCCCEEECCCCCCCCCHHHHHHHHHHHCCEEEEEEEECCC T ss_conf 877425688999998642026864313765-27887363-5871463331100240257899887510201000133067 Q ss_pred --HHHHHHHHHHCCCC--CCCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf --57899999830143--4333310000135688998753024 Q gi|254780653|r 222 --GMANSFLVAQGMGV--GRSLCQRDFSDNVHQIAWEAKRSAC 260 (400) Q Consensus 222 --~lantfL~A~G~~I--G~Sl~e~~~~~~ak~i~~~a~~~~~ 260 (400) ==|.|.-.|.-.+- |-| +| --+++...+++|. T Consensus 255 AGEhAlt~~LA~~Ln~sSGYa-vE------pyelv~~LE~fG~ 290 (394) T TIGR02460 255 AGEHALTMKLAEELNYSSGYA-VE------PYELVYILEKFGG 290 (394) T ss_pred CHHHHHHHHHHHHCCCCCCCC-CC------HHHHHHHHHHHCC T ss_conf 156889899984277757761-00------6899999987246 No 32 >COG0703 AroK Shikimate kinase [Amino acid transport and metabolism] Probab=67.88 E-value=5.7 Score=20.24 Aligned_cols=30 Identities=17% Similarity=0.402 Sum_probs=16.4 Q ss_pred CEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 80999913778889847332247888876431013354566 Q gi|254780653|r 54 AKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVN 94 (400) Q Consensus 54 akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~ 94 (400) .+++|+.-.|- || .| +.+.|.+.|+.+ |+| T Consensus 3 ~~IvLiG~mGa--GK----ST---IGr~LAk~L~~~--F~D 32 (172) T COG0703 3 MNIVLIGFMGA--GK----ST---IGRALAKALNLP--FID 32 (172) T ss_pred CCEEEECCCCC--CH----HH---HHHHHHHHCCCC--CCC T ss_conf 61899717999--77----68---999999981998--022 No 33 >PRK06247 pyruvate kinase; Provisional Probab=67.86 E-value=10 Score=18.62 Aligned_cols=302 Identities=16% Similarity=0.234 Sum_probs=138.9 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH-----HHHCCCCCEE Q ss_conf 88989999988980999--9137788898473322478888764310133545666310255665-----5313646489 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQS-----IASLSEGGII 114 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~-----i~~l~~g~il 114 (400) ...+|+.|+++|..+.= +||-. + +.....-...+.+++-++++|.-.-|.-|++.+.. --.++.|+.+ T Consensus 19 ~~e~l~~li~aG~~v~RiN~SHg~-~----e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkIR~g~~~~~~i~l~~G~~v 93 (477) T PRK06247 19 SEDMIRKLVEAGADVFRLNFSHGD-H----DDHRELYKRIREVEDELGRPIGILADLQGPKLRLGRFADGKVQLANGQTF 93 (477) T ss_pred CHHHHHHHHHCCCCEEEEECCCCC-H----HHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCEEECCCCEE T ss_conf 999999999879999999889899-9----99999999999999971997079997889836888537985896489999 Q ss_pred ECCHHHHCCCCCC--------CHHHHHHHHCCCCCEEEECCHHH---------------------HCCCCCCEEECCCCC Q ss_conf 8242430343210--------00246653124364247544044---------------------011376333023321 Q gi|254780653|r 115 LAENVRFYSEEER--------NDPDFVRMLSRNGDFYINDAFSV---------------------SHRAHASITGLSHLL 165 (400) Q Consensus 115 LLEN~Rf~~~E~~--------n~~~f~k~La~laDiyVnDAF~~---------------------aHR~haS~~gi~~~l 165 (400) -| ..++.. +.++|.+.+..--.+|+.|.-=. ..|..-.+-+..--+ T Consensus 94 ~l-----~~~~~~~~~~~i~v~~~~l~~~v~~Gd~I~idDG~i~l~V~~~~~~~i~~~V~~gG~L~s~KgvnlP~~~l~l 168 (477) T PRK06247 94 RL-----DVDDAPGDHDRVSLPHPEILAALKPGDRLLVDDGKVRLVVEACDGDDVVCRVVEGGPVSDRKGVSLPGTVLPV 168 (477) T ss_pred EE-----ECCCCCCCCCEEEECCHHHHHHCCCCCEEEEECCCEEEEEEECCCCEEEEEEEECCEECCCCCCCCCCCCCCC T ss_conf 99-----6677789766677435777975689988999279559999971698699999628597388712046875575 Q ss_pred CC------------------CCCHHH---HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHH--HHHHHHCCCEEEECHH Q ss_conf 00------------------033255---657776533101667871899960780255788--9987730688886205 Q gi|254780653|r 166 PS------------------YIGRAM---QKELSMLESCFSESKKPLVAIVGGSKVSTKITL--LINLVKKVDKLVIGGG 222 (400) Q Consensus 166 ps------------------~aG~l~---ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~--i~~L~~k~D~iiigG~ 222 (400) |+ +.+..| .+.+..+.+++. ++ +-|+ |||+++-.+ ++..++.+|.|++.=| T Consensus 169 p~lTekD~~di~~a~~~~vD~ialSFVrsa~DV~~vr~~l~--~~--~~II--aKIE~~~av~NldeIi~~sDgIMVARG 242 (477) T PRK06247 169 SALTEKDRADLEFALELGVDWVALSFVQRPEDVEEVRKVIG--GR--VPVM--AKIEKPQAVDRLEAIVEASDAIMVARG 242 (477) T ss_pred CCCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCC--CC--CCEE--EEECCHHHHHHHHHHHHHHCEEEEECC T ss_conf 57897799999989975999999757698789999998658--76--5089--996388787748999986177899658 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC--CCCCCCC-EECCC-CCCC--- Q ss_conf 789999983014343333100001356889987530245543035411343232333--3102532-12013-3457--- Q gi|254780653|r 223 MANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIP--TQVVSAQ-SVPLD-SIIL--- 295 (400) Q Consensus 223 lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~--~~~~~~~-~i~~~-~~i~--- 295 (400) -.|.+++- ++.--.-|.++.++.+.+.+.++-+...-+= ..+..| .++.++. .+-++ +-++ T Consensus 243 -------DLgvEi~~----e~vp~~Qk~ii~~~~~~gKpvivATqmLeSM-~~~p~PTRAEv~DVanAv~dG~DavmLs~ 310 (477) T PRK06247 243 -------DLGVEVPL----ESVPLIQKRIIRMAREAGKPVVVATQMLESM-IESPVPTRAEVSDVATAVLEGADAVMLSA 310 (477) T ss_pred -------CCCCCCCH----HHHHHHHHHHHHHHHHCCCEEEEECCHHHHH-CCCCCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf -------65455798----8979999999999986299699971508874-37999865878789999983787899861 Q ss_pred -----CCCCHHHHHHHHHHCCCCEEEEECCCCCCC-CCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHHHCC Q ss_conf -----652003567776405776799976625322-7436588999999998611148858997---4804699899848 Q gi|254780653|r 296 -----DVGFKTVEYIKQVIAQARTVMWNGPLGVFE-IEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALAHAG 366 (400) Q Consensus 296 -----DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE-~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~~~g 366 (400) .---++++.++.++..+..-.+-...-.+. ......-+..++.+........++-.|+ --|-|+..+.++- T Consensus 311 ETA~G~yPv~~V~~m~~I~~~aE~~~~~~~~~~~~~~~~~~~~~~~ia~~a~~~a~~~~a~~Iv~~T~sG~ta~~is~~R 390 (477) T PRK06247 311 ETASGKYPVEAVRTMARIIRQVERDPPYTHVPRAQRPQPEATGADAISYAARDIAETLDLAALVAYTSSGDTALRAARER 390 (477) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHC T ss_conf 00489778999999999999986384134444302688778999999999999986568988999858957999999549 Q ss_pred CCCCC Q ss_conf 87686 Q gi|254780653|r 367 ISNEF 371 (400) Q Consensus 367 ~~~~~ 371 (400) -.-.+ T Consensus 391 p~~pI 395 (477) T PRK06247 391 PPLPI 395 (477) T ss_pred CCCCE T ss_conf 89988 No 34 >pfam08645 PNK3P Polynucleotide kinase 3 phosphatase. Polynucleotide kinase 3 phosphatases play a role in the repair of single breaks in DNA induced by DNA-damaging agents such as gamma radiation and camptothecin. Probab=66.80 E-value=11 Score=18.48 Aligned_cols=23 Identities=35% Similarity=0.495 Sum_probs=20.6 Q ss_pred HHHHHHHHHHHCCCEEEEEECCC Q ss_conf 98898999998898099991377 Q gi|254780653|r 41 RVVPTILELVEKKAKVVIFSHLG 63 (400) Q Consensus 41 ~~~pTI~~l~~~~akvii~SH~G 63 (400) .+.+.|+.|.++|-++||+|.|+ T Consensus 33 ~V~~~L~~L~~~gY~iVIvTNQ~ 55 (158) T pfam08645 33 SVPEKLKKLHEDGYKIVIFTNQG 55 (158) T ss_pred CHHHHHHHHHHCCCEEEEEECCC T ss_conf 88999999998898799994775 No 35 >TIGR02584 cas_NE0113 CRISPR-associated protein, NE0113 family; InterPro: IPR013413 Members of this minor CRISPR-associated (Cas) protein family are encoded in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum.. Probab=66.29 E-value=9 Score=18.94 Aligned_cols=41 Identities=32% Similarity=0.382 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHCCCCC---EEEECCCHHHH-----HHHHCCC-CCCCCEE Q ss_conf 9999999861114885---89974804699-----8998488-7686468 Q gi|254780653|r 334 VEVAHYVAKLTKERRI---ISIAGGGDTIT-----ALAHAGI-SNEFTYV 374 (400) Q Consensus 334 ~~l~~~ia~~~~~~~~---~sivGGGdT~a-----ai~~~g~-~~~~~hv 374 (400) .-|++.+++++...+. -||.||-=|.. |+..||- .|+.+|| T Consensus 119 ~~I~~~va~l~~~~d~~lh~SlAGGRKTMGfyLG~Al~Lfar~~D~l~HV 168 (224) T TIGR02584 119 EFIVKLVAKLTAAKDAVLHASLAGGRKTMGFYLGYALSLFAREQDRLSHV 168 (224) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 89999998753304838998606872578899999999736888706888 No 36 >pfam02548 Pantoate_transf Ketopantoate hydroxymethyltransferase. Ketopantoate hydroxymethyltransferase (EC:2.1.2.11) is the first enzyme in the pantothenate biosynthesis pathway. Probab=66.11 E-value=7.1 Score=19.62 Aligned_cols=133 Identities=20% Similarity=0.335 Sum_probs=57.9 Q ss_pred HHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC----CCEEEECHHHHHHHHHHHCCCC-CCCCCCCCCCHHHH Q ss_conf 577765331016678718999607802557889987730----6888862057899999830143-43333100001356 Q gi|254780653|r 175 KELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK----VDKLVIGGGMANSFLVAQGMGV-GRSLCQRDFSDNVH 249 (400) Q Consensus 175 kEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k----~D~iiigG~lantfL~A~G~~I-G~Sl~e~~~~~~ak 249 (400) .-+++-.+++.+..--.|-+=||.. -++.++.|.+. +-|| |.+-...-+.-||.+ |+ ..+.++ T Consensus 95 ~a~~nA~rlmke~GadaVKlEgg~~---~~~~I~~l~~~GIPV~gHi---GL~PQ~~~~~GG~r~qGk------~~~ea~ 162 (261) T pfam02548 95 QALRNAARLMKEAGADAVKLEGGAE---MADTIKALVDRGIPVMGHI---GLTPQSVNQLGGYKVQGR------TEEEAE 162 (261) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCC---HHHHHHHHHHCCCCEEEEE---CCCCHHCCCCCCCCCCCC------CHHHHH T ss_conf 9999999999854999899788853---5899999998899765342---367401123688510259------999999 Q ss_pred HHHHHHH---HCCCCCC----CEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 8899875---3024554----30354113432323333102532120133457652003567776405776799976625 Q gi|254780653|r 250 QIAWEAK---RSACEII----VPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGPLG 322 (400) Q Consensus 250 ~i~~~a~---~~~~~I~----lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP~G 322 (400) +|++.++ +-|+--+ +|.+++ + ++...-.||. +..--||.+ ..+..+|+--+| T Consensus 163 ~l~~dA~~le~AGa~~ivlE~vp~~la--~--------~It~~~~IPt--IGIGAG~~c---------DGQvLV~~DlLG 221 (261) T pfam02548 163 QLLEDAKALEEAGAFALVLECVPAELA--K--------EITEKLSIPT--IGIGAGPGC---------DGQVLVLHDMLG 221 (261) T ss_pred HHHHHHHHHHHCCCCEEEEECCHHHHH--H--------HHHHCCCCCE--EEECCCCCC---------CEEEEEHHHHCC T ss_conf 999999999846874899966709999--9--------9996489989--972668988---------568884765408 Q ss_pred CCC--CCCHHHHHHHHHHHH Q ss_conf 322--743658899999999 Q gi|254780653|r 323 VFE--IEPFDRATVEVAHYV 340 (400) Q Consensus 323 vfE--~~~F~~GT~~l~~~i 340 (400) +++ .|+|.+--..+.+.+ T Consensus 222 ~~~~~~Pkf~k~y~~~~~~~ 241 (261) T pfam02548 222 LFSGFKPKFVKRYADLAGVI 241 (261) T ss_pred CCCCCCCCHHHHHHHHHHHH T ss_conf 99898995598886379999 No 37 >PRK03918 chromosome segregation protein; Provisional Probab=65.97 E-value=9.7 Score=18.73 Aligned_cols=10 Identities=0% Similarity=0.165 Sum_probs=4.1 Q ss_pred HHHHHHHHHH Q ss_conf 9999999986 Q gi|254780653|r 333 TVEVAHYVAK 342 (400) Q Consensus 333 T~~l~~~ia~ 342 (400) ...+++++.. T Consensus 833 ~~~~~~~L~~ 842 (882) T PRK03918 833 RRKLVDIMER 842 (882) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 38 >PRK00311 panB 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed Probab=65.81 E-value=9.7 Score=18.74 Aligned_cols=135 Identities=21% Similarity=0.356 Sum_probs=64.7 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC----CCEEEECHHHHHHHHHHHCCCC-CCCCCCCCCCHH Q ss_conf 56577765331016678718999607802557889987730----6888862057899999830143-433331000013 Q gi|254780653|r 173 MQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK----VDKLVIGGGMANSFLVAQGMGV-GRSLCQRDFSDN 247 (400) Q Consensus 173 ~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k----~D~iiigG~lantfL~A~G~~I-G~Sl~e~~~~~~ 247 (400) .+.-+++-.+++.+..--.|-+=||..+ ++.++.|.+. +-|| |.+-....+.-||.+ |++ .+. T Consensus 94 ~~~a~~nA~rl~ke~gadaVKlEgg~~~---~~~i~~l~~~GIPV~gHi---GL~PQ~~~~~GG~r~qGkt------~~e 161 (266) T PRK00311 94 PEQALRNAGRLMKEAGAHAVKLEGGEEL---AETIRRLVERGIPVMGHL---GLTPQSVNVLGGYKVQGRD------EEA 161 (266) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCH---HHHHHHHHHCCCCEEEEE---CCCCCCCCCCCCCCCCCCC------HHH T ss_conf 9999999999998569998997888336---799999998799766541---5765001136886022599------999 Q ss_pred HHHHHHHHH---HCCCCCC----CEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 568899875---3024554----303541134323233331025321201334576520035677764057767999766 Q gi|254780653|r 248 VHQIAWEAK---RSACEII----VPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGP 320 (400) Q Consensus 248 ak~i~~~a~---~~~~~I~----lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP 320 (400) +++|++.|+ +-|+--+ +|.+++ . ++...-.||. ++.--||.+ ..+..+|+-- T Consensus 162 a~~l~~dA~~le~AGaf~ivlE~Vp~~la--~--------~It~~l~IPt--IGIGAG~~c---------DGQvLV~~Dl 220 (266) T PRK00311 162 AEQLLEDAKALEEAGAFALVLECVPAELA--K--------EITEALSIPT--IGIGAGPDC---------DGQVLVMHDM 220 (266) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCHHHH--H--------HHHHCCCCCE--EEECCCCCC---------CEEEEEHHHH T ss_conf 99999999999847954999955869999--9--------9996489988--970558998---------5688866343 Q ss_pred CCCCC--CCCHHHHHHHHHHHH Q ss_conf 25322--743658899999999 Q gi|254780653|r 321 LGVFE--IEPFDRATVEVAHYV 340 (400) Q Consensus 321 ~GvfE--~~~F~~GT~~l~~~i 340 (400) +|++. .|.|.+--..+.+.+ T Consensus 221 LG~~~~~~PkFvk~y~~~~~~~ 242 (266) T PRK00311 221 LGLFSGFKPKFVKRYADLGGSI 242 (266) T ss_pred CCCCCCCCCCHHHHHHHHHHHH T ss_conf 3899898995899987279999 No 39 >PRK06124 gluconate 5-dehydrogenase; Provisional Probab=63.51 E-value=12 Score=18.08 Aligned_cols=67 Identities=16% Similarity=0.225 Sum_probs=41.4 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 98887710100159968999820167131857772589999889899999889809999137788898473322478888 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVS 80 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~ 80 (400) |+.|.-++.+ ||+||++||- | -.+-|= ..+-+.|.+.|++|++. +|- .-.++..++ T Consensus 1 ~~~~~~~~~~-~L~gK~alIT---------G---gs~GIG--~~ia~~la~~Ga~V~i~---~r~------~~~~~~~~~ 56 (259) T PRK06124 1 MPSMSYLQRF-SLAGQVALVT---------G---SARGLG--LEIARALAEAGAHVLVN---GRN------AARVEAAVA 56 (259) T ss_pred CCCCCCHHHC-CCCCCEEEEE---------C---CCCHHH--HHHHHHHHHCCCEEEEE---ECC------HHHHHHHHH T ss_conf 9986723640-9999989992---------8---674899--99999999879999999---698------899999999 Q ss_pred HHHHHCCCCCCC Q ss_conf 764310133545 Q gi|254780653|r 81 IAESILHKNILF 92 (400) Q Consensus 81 ~l~~~l~~~V~f 92 (400) .+++. +.++.+ T Consensus 57 ~l~~~-g~~~~~ 67 (259) T PRK06124 57 ALRAA-GGAAEA 67 (259) T ss_pred HHHHC-CCCEEE T ss_conf 99965-995899 No 40 >PRK10530 phosphotransferase; Provisional Probab=61.67 E-value=10 Score=18.60 Aligned_cols=46 Identities=13% Similarity=0.254 Sum_probs=29.2 Q ss_pred CCEECCHH-H-HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 85777258-9-999889899999889809999137788898473322478888764 Q gi|254780653|r 30 DGKVADVT-R-IERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAE 83 (400) Q Consensus 30 ~g~I~d~~-R-I~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~ 83 (400) ||+++++. + -.+.+.+|+.+.++|-++++.| ||| ..++.|+.+.+. T Consensus 11 DGTLl~~~~~i~~~~~~ai~~l~~~Gi~~~iaT--GR~------~~~~~~~~~~l~ 58 (272) T PRK10530 11 DGTLLTPKKTILPSSLEALARAREAGYQLIIVT--GRH------HVAIHPFYQALA 58 (272) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEC--CCC------HHHHHHHHHHHC T ss_conf 736358989599999999999997899999995--998------788899999809 No 41 >TIGR02094 more_P_ylases alpha-glucan phosphorylases; InterPro: IPR011834 This family consists of known phosphorylases, and homologues believed to share the function of using inorganic phosphate to cleave an alpha 1,4 linkage between the terminal glucose residue and the rest of the polymer (maltodextrin, glycogen, etc.). The name of the glucose storage polymer substrate, and therefore the name of this enzyme, depends on the chain lengths and branching patterns. A number of the members of this family have been shown to operate on small maltodextrins, as may be obtained by utilization of exogenous sources. This family represents a distinct clade from the related families: Glycogen/starch/alpha-glucan phosphorylase and Glycosyl transferase, family 35.; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=61.63 E-value=7.1 Score=19.59 Aligned_cols=60 Identities=13% Similarity=0.359 Sum_probs=43.7 Q ss_pred CCCCCEEECCCCCCCCC-CHHHHHHHHHHHHHCCC-CCCCEEEEEEE-CCHHHHHH--HHHHHHH Q ss_conf 13763330233210003-32556577765331016-67871899960-78025578--8998773 Q gi|254780653|r 153 RAHASITGLSHLLPSYI-GRAMQKELSMLESCFSE-SKKPLVAIVGG-SKVSTKIT--LLINLVK 212 (400) Q Consensus 153 R~haS~~gi~~~lps~a-G~l~ekEi~~L~~~l~~-~~~P~v~IiGG-aKisdKi~--~i~~L~~ 212 (400) -.-+=|.||++-.-.|= .-|+-+.++-|.+++++ ++||+-.|++| |--.|-.+ .|+.+.+ T Consensus 399 ~p~~LtIGFARRfAtYKRa~Llf~D~eRL~~il~~n~~RPVQ~vFAGKAHP~D~~GK~~Iq~i~~ 463 (618) T TIGR02094 399 SPDVLTIGFARRFATYKRATLLFRDLERLKRILNNNPERPVQIVFAGKAHPADEAGKELIQRIVE 463 (618) T ss_pred CCCCCEECHHHCCCCCCCHHHHCCCHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHH T ss_conf 78852522122276002613220777899998356888856889715567997545899999999 No 42 >cd06557 KPHMT-like Ketopantoate hydroxymethyltransferase (KPHMT) is the first enzyme in the pantothenate biosynthesis pathway. Ketopantoate hydroxymethyltransferase (KPHMT) catalyzes the first committed step in the biosynthesis of pantothenate (vitamin B5), which is a precursor to coenzyme A and is required for penicillin biosynthesis. Probab=61.46 E-value=11 Score=18.51 Aligned_cols=134 Identities=18% Similarity=0.290 Sum_probs=63.7 Q ss_pred HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC-CCEEEECHHHHHHHHHHHCCCC-CCCCCCCCCCHHHHHH Q ss_conf 6577765331016678718999607802557889987730-6888862057899999830143-4333310000135688 Q gi|254780653|r 174 QKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK-VDKLVIGGGMANSFLVAQGMGV-GRSLCQRDFSDNVHQI 251 (400) Q Consensus 174 ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k-~D~iiigG~lantfL~A~G~~I-G~Sl~e~~~~~~ak~i 251 (400) +.-+++-.+++++..--.|-+=||.. ..+.++.|.+. .--+-=-|..-....+..||.+ |++ .+.++++ T Consensus 90 ~~A~~nA~rl~ke~GadaVKlEgg~~---~~~~i~~L~~~GIPV~gHiGLtPQ~~~~~Gg~k~qGk~------~~ea~~l 160 (254) T cd06557 90 EQALRNAARLMKEAGADAVKLEGGAE---VAETIRALVDAGIPVMGHIGLTPQSVNQLGGYKVQGKT------EEEAERL 160 (254) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCC---HHHHHHHHHHCCCCEEEEECCCCHHCCCCCCCEEECCC------HHHHHHH T ss_conf 99999999999855999899798833---89999999987997665415774011216875322499------9999999 Q ss_pred HHHHH---HCCCCCC----CEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 99875---3024554----3035411343232333310253212013345765200356777640577679997662532 Q gi|254780653|r 252 AWEAK---RSACEII----VPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGPLGVF 324 (400) Q Consensus 252 ~~~a~---~~~~~I~----lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP~Gvf 324 (400) ++.|+ +-|+--+ +|.+++ .++...-.||. ++.--||.+ ..+..+|.--+|++ T Consensus 161 ~~dA~~le~AGaf~ivlE~vp~~la----------~~It~~~~IPt--IGIGAG~~c---------DGQvLV~~DlLG~~ 219 (254) T cd06557 161 LEDALALEEAGAFALVLECVPAELA----------KEITEALSIPT--IGIGAGPDC---------DGQVLVWHDMLGLS 219 (254) T ss_pred HHHHHHHHHCCCEEEEEECCHHHHH----------HHHHHCCCCCE--EECCCCCCC---------CEEEEEHHHHHCCC T ss_conf 9999999847955999956709999----------99985789888--945678998---------70378474642789 Q ss_pred CC--CCHHHHHHHHH Q ss_conf 27--43658899999 Q gi|254780653|r 325 EI--EPFDRATVEVA 337 (400) Q Consensus 325 E~--~~F~~GT~~l~ 337 (400) +. |.|.+--..+. T Consensus 220 ~~~~PkFvK~y~~~~ 234 (254) T cd06557 220 PGFKPKFVKRYADLG 234 (254) T ss_pred CCCCCCHHHHHHHHH T ss_conf 898996398886379 No 43 >TIGR01064 pyruv_kin pyruvate kinase; InterPro: IPR001697 Pyruvate kinase (2.7.1.40 from EC) (PK) catalyses the final step in glycolysis , the conversion of phosphoenolpyruvate to pyruvate with concomitant phosphorylation of ADP to ATP:ADP + phosphoenolpyruvate = ATP + pyruvate The enzyme, which is found in all living organisms, requires both magnesium and potassium ions for its activity. In vertebrates, there are four tissue-specific isozymes: L (liver), R (red cells), M1 (muscle, heart and brain), and M2 (early foetal tissue). In plants, PK exists as cytoplasmic and plastid isozymes, while most bacteria and lower eukaryotes have one form, except in certain bacteria, such as Escherichia coli, that have two isozymes. All isozymes appear to be tetramers of identical subunits of ~500 residues. PK helps control the rate of glycolysis, along with phosphofructokinase (IPR000023 from INTERPRO) and hexokinase (IPR001312 from INTERPRO). PK possesses allosteric sites for numerous effectors, yet the isozymes respond differently, in keeping with their different tissue distributions . The activity of L-type (liver) PK is increased by fructose-1,6-bisphosphate (F1,6BP) and lowered by ATP and alanine (gluconeogenic precursor), therefore when glucose levels are high, glycolysis is promoted, and when levels are low, gluconeogenesis is promoted. L-type PK is also hormonally regulated, being activated by insulin and inhibited by glucagon, which covalently modifies the PK enzyme. M1-type (muscle, brain) PK is inhibited by ATP, but F1,6BP and alanine have no effect, which correlates with the function of muscle and brain, as opposed to the liver. The structure of several pyruvate kinases from various organisms have been determined , . The protein comprises three-four domains: a small N-terminal helical domain (absent in bacterial PK), a beta/alpha-barrel domain, a beta-barrel domain (inserted within the beta/alpha-barrel domain), and a 3-layer alpha/beta/alpha sandwich domain. ; GO: 0000287 magnesium ion binding, 0004743 pyruvate kinase activity, 0030955 potassium ion binding, 0006096 glycolysis. Probab=59.94 E-value=11 Score=18.43 Aligned_cols=305 Identities=13% Similarity=0.192 Sum_probs=174.1 Q ss_pred HHHHHHHHHHCCCEEEEE--ECCCCCCCCCCCCCCHHHHHHHHHHHCC-CCCCCCCCCCHHHHHHHHHHCCC-CCEEECC Q ss_conf 889899999889809999--1377888984733224788887643101-33545666310255665531364-6489824 Q gi|254780653|r 42 VVPTILELVEKKAKVVIF--SHLGRPQSKSDKDCSLFKVVSIAESILH-KNILFVNDCIGSTLSQSIASLSE-GGIILAE 117 (400) Q Consensus 42 ~~pTI~~l~~~~akvii~--SH~GRPkg~~~~~~Sl~~v~~~l~~~l~-~~V~fi~d~~g~~~~~~i~~l~~-g~ilLLE 117 (400) +---|. ++++|-.|+=| ||- .++..-+.-..++..++.++ ++|.-.-|+-||+.+. ..+++ |++-|-+ T Consensus 33 AYAniE-iikaGmnVaRlNFSHG-----~~E~h~~~i~~vR~~~~~~~~~~vaI~lDtkGPeIR~--g~~~~~~~~~l~~ 104 (513) T TIGR01064 33 AYANIE-IIKAGMNVARLNFSHG-----SHEEHGKRIENVREASEKLGGRPVAILLDTKGPEIRT--GEIKGDGEVKLKK 104 (513) T ss_pred HHHHHH-HHHHCCCEEEECCCCC-----CHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEE--EECCCCCCEEECC T ss_conf 775455-5320886355426798-----9889999999999999971995089997268877998--7417887568627 Q ss_pred HHHH-------CCCCC------CCHHH---HHHHHCCCCCEEEECCHHH----------------------HCCCCCCEE Q ss_conf 2430-------34321------00024---6653124364247544044----------------------011376333 Q gi|254780653|r 118 NVRF-------YSEEE------RNDPD---FVRMLSRNGDFYINDAFSV----------------------SHRAHASIT 159 (400) Q Consensus 118 N~Rf-------~~~E~------~n~~~---f~k~La~laDiyVnDAF~~----------------------aHR~haS~~ 159 (400) .=.+ |.+|- -+.+. |.+....-..|||.|.-=. .-|+-=++= T Consensus 105 G~~v~~~~~~~~~~~~~~~~~~v~Y~~gYnl~~~v~~G~~iLvDDG~i~L~V~~~~~~~~v~~~v~ngG~l~~~KGvNlP 184 (513) T TIGR01064 105 GDKVILTTDVKYKGEGSEEKVSVDYKGGYNLVKDVVEGDKILVDDGKISLVVVSVEGDKTVICEVLNGGTLKSKKGVNLP 184 (513) T ss_pred CCEEEEECCHHHHCCCCCCCEEEECCCCCCCCCCCCCCCEEEEECCEEEEEEEEECCCCEEEEEEEECCEECCCCCEECC T ss_conf 88799825713420347786687167865565402688889876677999998862885689999968687178715558 Q ss_pred ECCCCCCCC-------CCHH--------------HHHHHHHHHHHCCCCCCCEEEEEEECCHHH--HHHH--HHHHHHCC Q ss_conf 023321000-------3325--------------565777653310166787189996078025--5788--99877306 Q gi|254780653|r 160 GLSHLLPSY-------IGRA--------------MQKELSMLESCFSESKKPLVAIVGGSKVST--KITL--LINLVKKV 214 (400) Q Consensus 160 gi~~~lps~-------aG~l--------------~ekEi~~L~~~l~~~~~P~v~IiGGaKisd--Ki~~--i~~L~~k~ 214 (400) |..--||+- .=|- -...++.+.++|.....|-+-|| |||.. -+.= ++..++-. T Consensus 185 G~~~~LP~lsEKD~~Dl~Fg~~~gvD~va~SFvR~~~DV~~~R~~L~~~G~~~~~II--aKIE~~~gv~N~a~deI~~aS 262 (513) T TIGR01064 185 GADVDLPALSEKDKKDLKFGVEQGVDFVAASFVRTAEDVEEVREVLEEKGGKDVKII--AKIENQEGVDNDAIDEIAEAS 262 (513) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCCCEEE--EEECCCCCCCHHHHHHHHHHH T ss_conf 872015667988899999998729788997278887569999999997088986488--763684335706799999872 Q ss_pred CEEEECHHHHHHHHHHH---CCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC--CCCCCC-EE Q ss_conf 88886205789999983---0143433331000013568899875302455430354113432323333--102532-12 Q gi|254780653|r 215 DKLVIGGGMANSFLVAQ---GMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPT--QVVSAQ-SV 288 (400) Q Consensus 215 D~iiigG~lantfL~A~---G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~--~~~~~~-~i 288 (400) |.|++ |+ |.+|+- | +..-.-|.++++|...+..+++-+-..-+=. .+..|+ +..++. .| T Consensus 263 DGiMV----------ARGDLGvEip~---e-eVp~~QK~~I~~cn~~gk~VItATQMLdSMi-~Np~PTRAEVsDVANAi 327 (513) T TIGR01064 263 DGIMV----------ARGDLGVEIPA---E-EVPILQKKLIRKCNRAGKPVITATQMLDSMI-KNPRPTRAEVSDVANAI 327 (513) T ss_pred CCEEE----------EECCCCEECCH---H-HHHHHHHHHHHHHHHHCCEEEEEECCHHHCC-CCCCCCCCEEEEEEEEE T ss_conf 95289----------76567025472---4-7999999999999850992799833245500-68889411132235553 Q ss_pred CCC-CCCCCCC--------CHHHHHHHHHHCCCCEEEE-ECCCCC--CCCCCHHHHHHHHHHHHHHHHCC-------CCC Q ss_conf 013-3457652--------0035677764057767999-766253--22743658899999999861114-------885 Q gi|254780653|r 289 PLD-SIILDVG--------FKTVEYIKQVIAQARTVMW-NGPLGV--FEIEPFDRATVEVAHYVAKLTKE-------RRI 349 (400) Q Consensus 289 ~~~-~~i~DIG--------p~Ti~~~~~~I~~AktI~W-NGP~Gv--fE~~~F~~GT~~l~~~ia~~~~~-------~~~ 349 (400) -++ +-+|==| -++++.++.+...|+..+- .--+.- ...+.++.=+.+|+.++...... -++ T Consensus 328 LDGtDAvMLSGETA~G~YP~eAV~~M~~Ia~~aE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~ns~~~ 407 (513) T TIGR01064 328 LDGTDAVMLSGETAKGKYPVEAVQMMAKIAKEAEKALAYLTNFNDRKNSTLKPSTITEAIALSAVEAAEKLDKLVNSLDA 407 (513) T ss_pred ECCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 06776131010113477808999999999999986333343021345203477760778989999999864122100155 Q ss_pred EEEE---CCCHHHHHHHHCCCCCCC Q ss_conf 8997---480469989984887686 Q gi|254780653|r 350 ISIA---GGGDTITALAHAGISNEF 371 (400) Q Consensus 350 ~siv---GGGdT~aai~~~g~~~~~ 371 (400) .-|| -.|.|+-.+.++--.-.+ T Consensus 408 kaivv~T~~G~ta~~~S~~rp~~pi 432 (513) T TIGR01064 408 KAIVVPTESGRTARLLSKYRPSAPI 432 (513) T ss_pred CEEEEECCCCCHHHHHHHHCCCCCE T ss_conf 2689814898478889752779838 No 44 >TIGR00088 trmD tRNA (guanine-N1)-methyltransferase; InterPro: IPR002649 In transfer RNA many different modified nucleosides are found, especially in the anticodon region. tRNA (guanine-N1-)-methyltransferase 2.1.1.31 from EC is one of several nucleases operating together with the tRNA-modifying enzymes before the formation of the mature tRNA. It catalyses the reaction:S-adenosyl-L-methionine + tRNA -> S-adenosyl-L-homocysteine + tRNA containing N1-methylguanine methylating guanosine(G) to N1-methylguanine (1-methylguanosine (m1G)) at position 37 of tRNAs that read CUN (leucine), CCN(proline), and CGG (arginine) codons. The presence of m1G improves the cellular growth rate and the polypeptide steptime and also prevents the tRNA from shifting the reading frame . The mechanism of the trmD3-induced frameshift involving mutant tRNA(Pro) and tRNA(Leu) species has been investigated . It has been suggested that the conformation of the anticodon loop may be a major determining element for the formation of m1G37 in vivo . ; GO: 0003723 RNA binding, 0008175 tRNA methyltransferase activity, 0008033 tRNA processing. Probab=59.06 E-value=14 Score=17.74 Aligned_cols=46 Identities=24% Similarity=0.320 Sum_probs=35.4 Q ss_pred EEECCCCCC--CCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCC Q ss_conf 820167131--85777258999988989999988980999913778889 Q gi|254780653|r 21 RVDWNVPFI--DGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQS 67 (400) Q Consensus 21 RvD~NvPi~--~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg 67 (400) ||| .-|.- .|-|++..=|..|+.+.+..-.++|+||++|=+|++.. T Consensus 48 tvD-D~pYGGG~GMvLk~ePi~~Al~~~~~~~~~~~~villsP~G~~~~ 95 (247) T TIGR00088 48 TVD-DRPYGGGAGMVLKPEPIFDALESVKAAKKEGAKVILLSPQGEKFD 95 (247) T ss_pred CCC-CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC T ss_conf 256-753378976504867899999852104325887999788996426 No 45 >TIGR02463 MPGP_rel mannosyl-3-phosphoglycerate phosphatase homolog; InterPro: IPR012815 This family consists of members of the HAD superfamily, subfamily IIB. All members are closely related to mannosyl-3-phosphoglycerate phosphatase, the second enzyme in a two-step pathway for biosynthesis of mannosylglycerate, a compatible solute present in some thermophiles and in Dehalococcoides ethenogenes. However, members of this family are separable in a neighbour-joining tree constructed from a multiple sequence alignment and are found only in mesophiles that lack the companion mannosyl-3-phosphoglycerate synthase (IPR012812 from INTERPRO). Members of this family are likely to act on a compound related to yet distinct from mannosyl-3-phosphoglycerate.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0005737 cytoplasm. Probab=58.83 E-value=8.1 Score=19.22 Aligned_cols=67 Identities=21% Similarity=0.267 Sum_probs=50.2 Q ss_pred CCCEECC--HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCC-CCCCCCCCHHHHHHHH Q ss_conf 1857772--589999889899999889809999137788898473322478888764310133-5456663102556655 Q gi|254780653|r 29 IDGKVAD--VTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKN-ILFVNDCIGSTLSQSI 105 (400) Q Consensus 29 ~~g~I~d--~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~-V~fi~d~~g~~~~~~i 105 (400) -||..+| .|+...+.+||+.|-+.+--||+.| .|--. =++.|++-||.. --||- T Consensus 6 lDGTLlD~H~Ydw~PA~~~l~~L~e~~iPvI~Ct------SKTAA------Ev~~lr~~L~L~~~PyIv----------- 62 (224) T TIGR02463 6 LDGTLLDSHTYDWQPAAPWLTRLQEAGIPVILCT------SKTAA------EVEALRKALGLTGDPYIV----------- 62 (224) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECC------CCCHH------HHHHHHHHHCCCCCCEEE----------- T ss_conf 6755257767654677999999997589764258------73078------999999984889898177----------- Q ss_pred HHCCCCCEEECCHHHH Q ss_conf 3136464898242430 Q gi|254780653|r 106 ASLSEGGIILAENVRF 121 (400) Q Consensus 106 ~~l~~g~ilLLEN~Rf 121 (400) ++|-++-+|...- T Consensus 63 ---ENGa~I~g~~~~~ 75 (224) T TIGR02463 63 ---ENGAAIHGETLWR 75 (224) T ss_pred ---ECCEEEECCCCCC T ss_conf ---6460567752145 No 46 >PTZ00066 pyruvate kinase; Provisional Probab=58.77 E-value=14 Score=17.76 Aligned_cols=309 Identities=15% Similarity=0.178 Sum_probs=144.1 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCC-EEECCH Q ss_conf 88989999988980999--913778889847332247888876431013354566631025566553136464-898242 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGG-IILAEN 118 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~-ilLLEN 118 (400) ...+|+.|+++|..+.= +||-.. + +-...+..+.+..++..+++|...-|.-|++.+.. .+++++ |.|-++ T Consensus 52 ~~e~L~~li~aG~nv~RlNfSHg~~-e---~h~~~i~~iR~a~~~~~~~~v~Il~Dl~GPkIR~G--~l~~~~~i~l~~G 125 (513) T PTZ00066 52 NVETLVQLIDAGMNICRFNFSHGDH-E---SHKKTLNNVREAQKSRPNANIGLMLDTKGPEIRTG--FLKNHKPITLKEG 125 (513) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCH-H---HHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEE--ECCCCCEEEECCC T ss_conf 9999999998699999998899999-9---99999999999998659996699998889825888--7489964785389 Q ss_pred HHH--CCC-CC--------CCHHHHHHHHCCCCCEEEECCHHH-------------------HCCCCCCE--EECCCCCC Q ss_conf 430--343-21--------000246653124364247544044-------------------01137633--30233210 Q gi|254780653|r 119 VRF--YSE-EE--------RNDPDFVRMLSRNGDFYINDAFSV-------------------SHRAHASI--TGLSHLLP 166 (400) Q Consensus 119 ~Rf--~~~-E~--------~n~~~f~k~La~laDiyVnDAF~~-------------------aHR~haS~--~gi~~~lp 166 (400) -++ ..+ +. -+.++|.+.+..--.+|+.|.-=. -=+++-++ -|..-.+| T Consensus 126 ~~v~i~~~~~~~g~~~~I~v~~~~l~~~v~~Gd~IlidDG~I~l~V~~v~~~~v~~~V~~gG~L~s~KgVnlP~~~l~lp 205 (513) T PTZ00066 126 STLKITTDYTILGDETCISCSYKKLPQSVKVGNIILIADGSLSCEVLEVHDDHIVVKVLNSATIGERKNMNLPGVKVELP 205 (513) T ss_pred CEEEEECCCCCCCCCCEEEECCCCCHHHCCCCCEEEEECCEEEEEEEEECCCEEEEEEEECCEEECCCCEECCCCCCCCC T ss_conf 99999748865798787972586577567889889997997999999822977999997483974786433589866755 Q ss_pred C-------------------CCCHH---HHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHH--HHHHHHCCCEEEECHH Q ss_conf 0-------------------03325---5657776533101667871899960780255788--9987730688886205 Q gi|254780653|r 167 S-------------------YIGRA---MQKELSMLESCFSESKKPLVAIVGGSKVSTKITL--LINLVKKVDKLVIGGG 222 (400) Q Consensus 167 s-------------------~aG~l---~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~--i~~L~~k~D~iiigG~ 222 (400) + +.+.. =.+.+..+.+++.... +.+-|+ |||+++-.+ |+..++.+|.|++.=| T Consensus 206 ~lTekD~~dil~fa~~~~vD~IalSFVrs~~DV~~~r~~l~~~g-~~~~II--aKIE~~~av~NldeIi~~sDgIMVARG 282 (513) T PTZ00066 206 VIGEKDKNDILNFAIPMNCDFIALSFIQSADDVRLCRNLLGERG-KHIKII--PKIENIEGLINFDKILAESDGIMVARG 282 (513) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCC-CCCEEE--EEECCHHHHHCHHHHHHHCCEEEEECC T ss_conf 47636889999999875999999867799899999999999759-864589--984376556589999985898999568 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC--CCCCCCC-EECCC-CCCC--- Q ss_conf 789999983014343333100001356889987530245543035411343232333--3102532-12013-3457--- Q gi|254780653|r 223 MANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIP--TQVVSAQ-SVPLD-SIIL--- 295 (400) Q Consensus 223 lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~--~~~~~~~-~i~~~-~~i~--- 295 (400) -.|.+++- ++.--.-|+|+.++.+.|.+.++-+...-+= ..+..| .++.++. .+-++ +-+| T Consensus 283 -------DLgvEip~----e~vp~~QK~II~~c~~~gKPVIvATqmLeSM-i~np~PTRAEvsDVaNAV~DGaDavMLSg 350 (513) T PTZ00066 283 -------DLGMEIPP----EKVFLAQKLMISKCNLQGKPIITATQMLESM-IKNPRPTRAESTDVANAVLDGSDCVMLSG 350 (513) T ss_pred -------CCCCCCCH----HHHHHHHHHHHHHHHHCCCCEEEECHHHHHH-CCCCCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf -------54242698----8868999999999997599699974257774-04999871667789999984677788735 Q ss_pred -----CCCCHHHHHHHHHHCCCCEEEEECCCC---CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHHH Q ss_conf -----652003567776405776799976625---3227436588999999998611148858997---48046998998 Q gi|254780653|r 296 -----DVGFKTVEYIKQVIAQARTVMWNGPLG---VFEIEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALAH 364 (400) Q Consensus 296 -----DIGp~Ti~~~~~~I~~AktI~WNGP~G---vfE~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~~ 364 (400) .--.++++.+.+++.+|+..+|....- ..+.+....-+.+++.+......+.++..|+ --|.|+..+.+ T Consensus 351 ETA~GkyPveaV~~m~~I~~~aE~~~~~~~~~~~~~~~~~~~~~~~~aia~~av~~A~~~~akaIv~~T~sG~ta~~iS~ 430 (513) T PTZ00066 351 ETAGGKFPVEAVTIMSKLCFEAEACIDYRLLYQSLVLAISTPVSVQEAIARSAVELAEDIEAKLIIALTETGYTARLISK 430 (513) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHH T ss_conf 22477798999999999999987041066677665422688898799999999999973799899998799789999996 Q ss_pred CCCCCCC Q ss_conf 4887686 Q gi|254780653|r 365 AGISNEF 371 (400) Q Consensus 365 ~g~~~~~ 371 (400) +--.-.+ T Consensus 431 ~RP~~pI 437 (513) T PTZ00066 431 YRPSCTI 437 (513) T ss_pred HCCCCCE T ss_conf 5989999 No 47 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=58.60 E-value=15 Score=17.54 Aligned_cols=159 Identities=14% Similarity=0.205 Sum_probs=95.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEE---EEECCCCCCCCCCCCCCHHH-HHHHHHHH-CCCCCCCCCCCCHHHHHHHHHHCC Q ss_conf 25899998898999998898099---99137788898473322478-88876431-013354566631025566553136 Q gi|254780653|r 35 DVTRIERVVPTILELVEKKAKVV---IFSHLGRPQSKSDKDCSLFK-VVSIAESI-LHKNILFVNDCIGSTLSQSIASLS 109 (400) Q Consensus 35 d~~RI~~~~pTI~~l~~~~akvi---i~SH~GRPkg~~~~~~Sl~~-v~~~l~~~-l~~~V~fi~d~~g~~~~~~i~~l~ 109 (400) |..|+..- |+.+.++||-.+ +| .|.|.|..|..| |++.|.++ ...++. T Consensus 10 Df~rLgee---~~~v~~AGaD~iH~DVM------DGHFVPNlT~Gp~v~~~~r~~g~~~P~D------------------ 62 (216) T TIGR01163 10 DFARLGEE---VKAVEEAGADLIHVDVM------DGHFVPNLTFGPPVLEALRKYGTKLPID------------------ 62 (216) T ss_pred CHHHHHHH---HHHHHHCCCCEEEEEEC------CCCCCCCCCCCHHHHHHHHHHCCCCCEE------------------ T ss_conf 47779999---99999669978998624------7971771002778999887407952126------------------ Q ss_pred CCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 46489824243034321000246653124364247544044011376333023321000332556577765331016678 Q gi|254780653|r 110 EGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKK 189 (400) Q Consensus 110 ~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~ 189 (400) ==||.||. ..|++.+++.+=-||- -|-. | -.+.. +.|..+-+...+ T Consensus 63 --VHLMv~~p----------d~~~~~Fa~aGA~~I~-----vH~E------------a--~~h~~---R~l~~Ik~~G~~ 108 (216) T TIGR01163 63 --VHLMVENP----------DRYIEDFAEAGADIIT-----VHAE------------A--TEHIH---RLLQLIKELGAK 108 (216) T ss_pred --EEECCCCH----------HHHHHHHHHHCCCEEE-----EECC------------C--CCCHH---HHHHHHHHCCCC T ss_conf --63035785----------7778899970899899-----8437------------7--62679---999999971897 Q ss_pred CEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCC-----CCCCCCCCCCHHHHHHHHHHH--HCCCCC Q ss_conf 7189996078025578899877306888862057899999830143-----433331000013568899875--302455 Q gi|254780653|r 190 PLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGV-----GRSLCQRDFSDNVHQIAWEAK--RSACEI 262 (400) Q Consensus 190 P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~I-----G~Sl~e~~~~~~ak~i~~~a~--~~~~~I 262 (400) |-+++ +-.|-+..|+++++.||.|++ +.+ |+++... .++..+++.+..+ +.+..+ T Consensus 109 AG~v~----NP~TPl~~~~~~L~~~D~VLl-------------MSVnPGFgGQkFIP~-~~~Kir~~R~~id~~~~~~~~ 170 (216) T TIGR01163 109 AGIVL----NPATPLEALEYVLEDVDLVLL-------------MSVNPGFGGQKFIPE-TLEKIRELRKMIDKLELGLSI 170 (216) T ss_pred EEEEE----CCCCCHHHHHHHHHHCCEEEE-------------EEEECCCCCCCCHHH-HHHHHHHHHHHHHHHCCCCCE T ss_conf 06886----799998789989876298998-------------876079988411057-899999999999860279955 Q ss_pred CCEEEEECCC Q ss_conf 4303541134 Q gi|254780653|r 263 IVPRDVVVAR 272 (400) Q Consensus 263 ~lP~D~~v~~ 272 (400) .+-+|-=+.. T Consensus 171 ~ieVDGGv~~ 180 (216) T TIGR01163 171 LIEVDGGVNE 180 (216) T ss_pred EEEECCCCCH T ss_conf 8997179897 No 48 >PRK06718 precorrin-2 dehydrogenase; Reviewed Probab=58.22 E-value=14 Score=17.69 Aligned_cols=35 Identities=23% Similarity=0.337 Sum_probs=29.5 Q ss_pred HCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 01599689998201671318577725899998898999998898099991 Q gi|254780653|r 11 RDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 11 ~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) -|++||+||| +..|++ +..-++.|++.||+|.++| T Consensus 6 l~l~gk~vLV-------VGGG~v--------A~rK~~~Ll~~gA~VtVvs 40 (202) T PRK06718 6 IDLSNKRVVI-------VGGGKV--------AGRRAITLLKYGAHITVIS 40 (202) T ss_pred EECCCCEEEE-------ECCCHH--------HHHHHHHHHHCCCEEEEEC T ss_conf 9828986999-------889899--------9999999986899699986 No 49 >KOG2007 consensus Probab=56.71 E-value=7.1 Score=19.59 Aligned_cols=97 Identities=22% Similarity=0.298 Sum_probs=57.3 Q ss_pred CCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC--HHHHHHHHH--HCCCCCEEECCHH-HHCCCCCC Q ss_conf 980999913778889847332247888876431013354566631--025566553--1364648982424-30343210 Q gi|254780653|r 53 KAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCI--GSTLSQSIA--SLSEGGIILAENV-RFYSEEER 127 (400) Q Consensus 53 ~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~--g~~~~~~i~--~l~~g~ilLLEN~-Rf~~~E~~ 127 (400) |..|-=.||+|-- ....|+--+.++|+.+++.+|.|+.... +++.....+ .|.. ..+.|++. |+|.+|=. T Consensus 62 GpTvYD~SHmGHA----rsYVsfDIlrRiL~dyfgy~V~~vmNiTDVDDKII~RAr~~~m~e-~~~~l~~~F~~~e~eF~ 136 (586) T KOG2007 62 GPTVYDSSHMGHA----RSYVSFDILRRILRDYFGYDVTFVMNITDVDDKIIKRARKEEMGE-KPLSLSERFCYYEEEFL 136 (586) T ss_pred CCCCCCHHHHHHH----HHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHHHH T ss_conf 6801341221014----666229999999999728636999435520489998765523023-41218998888899999 Q ss_pred CH-------------------H---HHHHHHCCCCCEEEECC---HHHHCCC Q ss_conf 00-------------------2---46653124364247544---0440113 Q gi|254780653|r 128 ND-------------------P---DFVRMLSRNGDFYINDA---FSVSHRA 154 (400) Q Consensus 128 n~-------------------~---~f~k~La~laDiyVnDA---F~~aHR~ 154 (400) .| | .|.+++-..+-.|+-|. |.+.-.. T Consensus 137 ~DM~~LnvLpPtv~tRVSeyvp~II~fIqkIIdnG~aY~~dGsVYFdv~kf~ 188 (586) T KOG2007 137 QDMAALNVLPPTVQTRVSEYVPQIIKFIQKIIDNGYAYAVDGSVYFDVDKFP 188 (586) T ss_pred HHHHHHCCCCCCCCCHHHHCHHHHHHHHHHHHHCCCEEEECCCEEEECCCCC T ss_conf 8999808899853014542447899999999847853652780898644465 No 50 >TIGR01179 galE UDP-glucose 4-epimerase; InterPro: IPR005886 Synonym: UDP-galactose 4-epimerase UDP-glucose 4-epimerase (5.1.3.2 from EC) interconverts UDP-glucose and UDP-galactose which are precursors of glucose- and galactose-containing exopolysaccharides (EPS). A set of related proteins, some of which are tentatively identified as UDP-glucose-4-epimerase in Thermotoga maritima, Bacillus halodurans, and several archaea, but deeply branched from this set and lacking experimental evidence, are excluded and described by a separate model. ; GO: 0003978 UDP-glucose 4-epimerase activity, 0006012 galactose metabolic process. Probab=56.69 E-value=16 Score=17.33 Aligned_cols=241 Identities=18% Similarity=0.271 Sum_probs=132.7 Q ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHH----CCCCCEEEC-- Q ss_conf 989999988980999913778889847332247888876431013354566-6310255665531----364648982-- Q gi|254780653|r 44 PTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVN-DCIGSTLSQSIAS----LSEGGIILA-- 116 (400) Q Consensus 44 pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~-d~~g~~~~~~i~~----l~~g~ilLL-- 116 (400) -|.+.|+++|..|||+==|. .+-.-+.++++++.|+++.|+. |.-.....+++.. -.+=|-++- T Consensus 14 Ht~~~Ll~~G~ev~vlDNLs---------~G~~~~l~~~~~~~G~~~~fv~gDL~D~~~l~~~f~kqql~~~idAViHFA 84 (341) T TIGR01179 14 HTVRQLLESGYEVVVLDNLS---------NGSAEALKRGEEITGKEVTFVEGDLRDRELLRRVFEKQQLEHKIDAVIHFA 84 (341) T ss_pred HHHHHHHHCCCEEEEEECCC---------CCCHHHHCCCCEECCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 88788763597289981578---------884887500234148532058717515799999987743116754675201 Q ss_pred ----------CHHHHCCCCCCCHHHHHHHHCCCC-CEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf ----------424303432100024665312436-424754404401137633302332100033255657776533101 Q gi|254780653|r 117 ----------ENVRFYSEEERNDPDFVRMLSRNG-DFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFS 185 (400) Q Consensus 117 ----------EN~Rf~~~E~~n~~~f~k~La~la-DiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~ 185 (400) +=+|+|.-=..|...|.+.....+ -=|| ||.+ |-++|.|.-.| .-|. ..+. T Consensus 85 g~~~VgESv~~Pl~YY~NNv~nTl~L~~~m~~~GV~~~i---FSSs----AaVYG~p~~~P-----i~E~------~pl~ 146 (341) T TIGR01179 85 GLIAVGESVQKPLKYYRNNVVNTLNLLEAMQETGVKKFI---FSSS----AAVYGEPESIP-----ISED------SPLG 146 (341) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE---ECCC----CEEECCCCCCC-----CCCC------CCCC T ss_conf 121252557524544000468999999999981897415---3042----14507788555-----0222------5677 Q ss_pred CCCCCEEEEEEECCHHHHHHHHHHHHH---CCCEEEECHHHHHHHHHHH--------CCCCCCCCCCCCCC-HHHHHHHH Q ss_conf 667871899960780255788998773---0688886205789999983--------01434333310000-13568899 Q gi|254780653|r 186 ESKKPLVAIVGGSKVSTKITLLINLVK---KVDKLVIGGGMANSFLVAQ--------GMGVGRSLCQRDFS-DNVHQIAW 253 (400) Q Consensus 186 ~~~~P~v~IiGGaKisdKi~~i~~L~~---k~D~iiigG~lantfL~A~--------G~~IG~Sl~e~~~~-~~ak~i~~ 253 (400) .|..|+ |=+|. |.|.++. ++|.= .-.+.|++. ||.||..- .+.- ...+-+.+ T Consensus 147 ~PinPY----G~sKl-----M~E~iL~D~~~a~~~-----~~~v~LRYFNv~GA~p~GY~iGe~~--~~~tNhLip~~~~ 210 (341) T TIGR01179 147 DPINPY----GRSKL-----MVERILRDLSKADPD-----LSYVILRYFNVAGADPEGYEIGEDP--PGITNHLIPYACQ 210 (341) T ss_pred CCCCCC----CCCHH-----HHHHHHHHHHHHCCC-----CCEEEECCCCCCCCCCCCCCCCCCC--CCCCCCHHHHHHH T ss_conf 874866----55668-----899999999873876-----7799850578514488877236685--2029418999999 Q ss_pred HHHHCCCCC-CCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHH Q ss_conf 875302455-4303541134323233331025321201334576520035677764057767999766253227436588 Q gi|254780653|r 254 EAKRSACEI-IVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVIAQARTVMWNGPLGVFEIEPFDRA 332 (400) Q Consensus 254 ~a~~~~~~I-~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE~~~F~~G 332 (400) -|.-+..++ ++=.|+-. .+-.-|-|-=-..|+-.-=+..+.-.-.+.++=+||==.| ..|| T Consensus 211 ~A~G~~~~l~IFGtDYPT------------~DGTcvRDYIHV~DLA~AH~~Al~~L~~g~~s~~~NlG~G----~G~S-- 272 (341) T TIGR01179 211 VAVGKRDKLTIFGTDYPT------------PDGTCVRDYIHVMDLAEAHLAALEYLENGGESHVYNLGYG----QGFS-- 272 (341) T ss_pred HHCCCCCCCEECCCCCCC------------CCCCEEEEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCC----CCCC-- T ss_conf 844899731362487876------------7987653002002077789999999860796369862467----5410-- Q ss_pred HHHHHHHHHHHHC Q ss_conf 9999999986111 Q gi|254780653|r 333 TVEVAHYVAKLTK 345 (400) Q Consensus 333 T~~l~~~ia~~~~ 345 (400) =++++++..+.+. T Consensus 273 V~EVi~a~~~vsG 285 (341) T TIGR01179 273 VLEVIEAFKKVSG 285 (341) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999866109 No 51 >PRK13776 formimidoylglutamase; Provisional Probab=56.46 E-value=14 Score=17.62 Aligned_cols=39 Identities=15% Similarity=0.174 Sum_probs=19.4 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCCCC-CCHHH--HHHHHHHH Q ss_conf 20035677764057767999766253227-43658--89999999 Q gi|254780653|r 298 GFKTVEYIKQVIAQARTVMWNGPLGVFEI-EPFDR--ATVEVAHY 339 (400) Q Consensus 298 Gp~Ti~~~~~~I~~AktI~WNGP~GvfE~-~~F~~--GT~~l~~~ 339 (400) -+..+..+.+.+..+..+. -|-+.|. |.|+. -|..++.- T Consensus 263 ~~~~~~~l~~~~~~~~kv~---~~DivEvnP~~D~d~~Ta~lAA~ 304 (311) T PRK13776 263 EMSVVEHLVRRAKASGKLR---LADIAELNPQLDSDQRTARIAAR 304 (311) T ss_pred CHHHHHHHHHHHHCCCCEE---EEEEEEECCCCCCCCCHHHHHHH T ss_conf 9999999999985159877---99989768888897719999999 No 52 >COG0066 LeuD 3-isopropylmalate dehydratase small subunit [Amino acid transport and metabolism] Probab=56.38 E-value=3.5 Score=21.60 Aligned_cols=55 Identities=31% Similarity=0.536 Sum_probs=39.2 Q ss_pred EECCHHHHCCCCCCCHHHHHHHHCCC--CCEEE-ECCHHH-HCCCCCCE----EECCCC-CCCCCCHHHH Q ss_conf 98242430343210002466531243--64247-544044-01137633----302332-1000332556 Q gi|254780653|r 114 ILAENVRFYSEEERNDPDFVRMLSRN--GDFYI-NDAFSV-SHRAHASI----TGLSHL-LPSYIGRAMQ 174 (400) Q Consensus 114 lLLEN~Rf~~~E~~n~~~f~k~La~l--aDiyV-nDAF~~-aHR~haS~----~gi~~~-lps~aG~l~e 174 (400) .+.+..||.+ |+|+.+...+ |||-| .+-||| |-|.||-. +||.-+ .||+|-.-.. T Consensus 41 ~~f~~~r~~~------PdF~~n~~~yq~g~IlVag~NFGcGSSREHApwALk~~Gi~~VIA~SFAdIFy~ 104 (191) T COG0066 41 HLFEDWRYLD------PDFVLNVPPYQGGDILVAGENFGCGSSREHAPWALKDYGIRAVIAPSFADIFYR 104 (191) T ss_pred CCCCCCCCCC------CCHHHCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHCCEEEEEECCHHHHHHH T ss_conf 1531010249------643215886678648996688778852778999999759249993629999870 No 53 >cd04299 GT1_Glycogen_Phosphorylase_like This family is most closely related to the oligosaccharide phosphorylase domain family and other unidentified sequences. Oligosaccharide phosphorylase catalyzes the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The members of this family are found in bacteria and Archaea. Probab=55.51 E-value=11 Score=18.35 Aligned_cols=62 Identities=13% Similarity=0.301 Sum_probs=45.9 Q ss_pred CCCCCCEEECCCCCCCCC-CHHHHHHHHHHHHHCCCCCCCEEEEEEE-CCHHHH--HHHHHHHHHC Q ss_conf 113763330233210003-3255657776533101667871899960-780255--7889987730 Q gi|254780653|r 152 HRAHASITGLSHLLPSYI-GRAMQKELSMLESCFSESKKPLVAIVGG-SKVSTK--ITLLINLVKK 213 (400) Q Consensus 152 HR~haS~~gi~~~lps~a-G~l~ekEi~~L~~~l~~~~~P~v~IiGG-aKisdK--i~~i~~L~~k 213 (400) -...+=|.||++-...|= .-|+-+.++-|.+++.+|++|+-.|+.| |.-.|. ..+|+.+.+. T Consensus 474 ldp~~LTIGFARRfAtYKRa~Llf~D~eRL~~il~~~~rPVQiIfAGKAHP~D~~Gk~lI~~i~~~ 539 (778) T cd04299 474 LDPNVLTIGFARRFATYKRATLLLRDPERLKRLLNDPERPVQFIFAGKAHPADEPGKELIQEIVEF 539 (778) T ss_pred CCCCCCEEEEHHHCCCCCCHHHHCCCHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 686521341122056667602220899999998639997659999378899876679999999998 No 54 >cd04240 AAK_UC AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria. Sequences in this CD are members of the Amino Acid Kinase (AAK) superfamily. Probab=53.50 E-value=18 Score=17.00 Aligned_cols=171 Identities=16% Similarity=0.208 Sum_probs=96.0 Q ss_pred CCCCEEEEEEECCHHHHHHHHHHHHH----CCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 67871899960780255788998773----06888862057899999830143433331000013568899875302455 Q gi|254780653|r 187 SKKPLVAIVGGSKVSTKITLLINLVK----KVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEI 262 (400) Q Consensus 187 ~~~P~v~IiGGaKisdKi~~i~~L~~----k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I 262 (400) ..++++.|.||.++.|.+.-+..-.. -++.+-+-...-|.++.+. +...+ ..+...++.+..++.+..| T Consensus 23 ~~~~v~IVpGGG~fAd~vR~~~~~~~~~d~~aH~mAi~Am~~~a~ll~~---~~~~~----~~~~~~~~~~~~~~~~~~v 95 (203) T cd04240 23 SGGGVVIVPGGGPFADVVRRYQERKGLSDAAAHWMAILAMEQYGYLLAD---LEPRL----VARTLAELTDVLERGKIAI 95 (203) T ss_pred CCCCEEEEECCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH---HCCHH----HCCCHHHHHHHHHCCCCEE T ss_conf 5997899979808999999999982998789999999999999999996---01310----1046477999973389158 Q ss_pred CCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHH-HHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 430354113432323333102532120133457652003567-7764057767999766253227436588999999998 Q gi|254780653|r 263 IVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEY-IKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVA 341 (400) Q Consensus 263 ~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~-~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia 341 (400) ++|.+.+... +.+|.+ +|+-..|+.. +.+.+...+-|+..-.-|+|... -++.+.++ T Consensus 96 ~lP~~~l~~~-------------~~lp~s---WdvtsDs~a~~~A~~l~a~~li~~~~vdGiy~~~------~kl~~~i~ 153 (203) T cd04240 96 LLPYRLLLDT-------------DPLPHS---WEVTSDSIAAWLAKKLGAKRLVIVTDVDGIYEKD------GKLVNEIA 153 (203) T ss_pred EECHHHHCCC-------------CCCCCC---CCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCC------HHHHHHCC T ss_conf 7115875158-------------988878---7766189999999975988589985677551875------05541158 Q ss_pred HHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECCH--HHHHHHHCCCCCHH Q ss_conf 6111488589974804699899848876864687507--99899874899868 Q gi|254780653|r 342 KLTKERRIISIAGGGDTITALAHAGISNEFTYVSTAG--GAFLEWLEGKDLPG 392 (400) Q Consensus 342 ~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTgG--GA~L~~L~G~~LPg 392 (400) .... ...++|- .-....+.++++.- ||-.|. .-++.+|.|+.-+| T Consensus 154 ~~~l--~~~~~VD-~~~~~~~~~~~i~~---~VvnG~~p~~i~~~l~g~~~~G 200 (203) T cd04240 154 AAEL--LGETSVD-PAFPRLLTKYGIRC---YVVNGDDPERVLAALRGREGVG 200 (203) T ss_pred HHHH--CCCCCHH-HHHHHHHHHCCCEE---EEECCCCHHHHHHHHCCCCCCC T ss_conf 9886--4887410-77899999769639---9987998799999986898798 No 55 >PRK01710 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=52.03 E-value=19 Score=16.85 Aligned_cols=41 Identities=29% Similarity=0.433 Sum_probs=26.3 Q ss_pred CCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 8877101001599689998201671318577725899998898999998898099991 Q gi|254780653|r 3 RLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 3 ~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) .|+.+.|+ ++||+|+| +-+ |. .-....++|.++|++|++.- T Consensus 4 ~~~~~~~~--l~~kkv~i-----~Gl--G~--------sG~a~a~~L~~~g~~v~~~D 44 (458) T PRK01710 4 DFNEFKDF--IKNKKVAV-----VGI--GV--------SNIPLIKFLVKLGAKVTAFD 44 (458) T ss_pred HHHHHHHH--HCCCEEEE-----EEE--CH--------HHHHHHHHHHHCCCEEEEEE T ss_conf 48999999--78996999-----978--78--------89999999997889799998 No 56 >TIGR00705 SppA_67K signal peptide peptidase SppA, 67K type; InterPro: IPR004634 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signalling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . These enzymes have a molecular mass around 67 kDa and a duplication such that the N-terminal half shares extensive homology with the C-terminal half and was shown in E. coli to form homotetramers. E. coli SohB, which is most closely homologous to the C-terminal duplication of SppA, is predicted to perform a similar function of small peptide degradation, but in the periplasm. Many prokaryotes have a single SppA/SohB homolog that may perform the function of either or both. ; GO: 0009003 signal peptidase activity, 0006465 signal peptide processing, 0016021 integral to membrane. Probab=51.78 E-value=19 Score=16.83 Aligned_cols=51 Identities=16% Similarity=0.128 Sum_probs=38.1 Q ss_pred CCCCEEEEECCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCEEEE-----CCCHHHH Q ss_conf 577679997662532274365-88999999998611148858997-----4804699 Q gi|254780653|r 310 AQARTVMWNGPLGVFEIEPFD-RATVEVAHYVAKLTKERRIISIA-----GGGDTIT 360 (400) Q Consensus 310 ~~AktI~WNGP~GvfE~~~F~-~GT~~l~~~ia~~~~~~~~~siv-----GGGdT~a 360 (400) .+-..|+.+||.=-=|...-- .|.-.+++.+-.+..+.+...|| +||.-.| T Consensus 320 ~~iaiv~~~G~I~~G~~t~~nG~ggD~~a~lLr~a~~D~~iKAvvLRinSPGGsv~A 376 (614) T TIGR00705 320 DKIAIVYLEGSIVDGRDTEENGVGGDTVAALLRKARSDPDIKAVVLRINSPGGSVFA 376 (614) T ss_pred CCEEEEEEECCEECCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCEEH T ss_conf 816999700642357756678600367999999870799812899886389863428 No 57 >PRK05057 aroK shikimate kinase I; Reviewed Probab=51.01 E-value=17 Score=17.20 Aligned_cols=36 Identities=11% Similarity=0.179 Sum_probs=21.9 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 898099991377888984733224788887643101335456663 Q gi|254780653|r 52 KKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDC 96 (400) Q Consensus 52 ~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~ 96 (400) .+..|+|+...|- || .+ +.+.|++.|+.+..=.|+. T Consensus 3 ~kknI~LiG~mGs--GK----st---vgk~LA~~l~~~fiD~D~~ 38 (172) T PRK05057 3 EKRNIFLVGPMGA--GK----ST---IGRQLAQQLNMEFYDSDQE 38 (172) T ss_pred CCCCEEEECCCCC--CH----HH---HHHHHHHHHCCCEEECHHH T ss_conf 8882898899999--88----99---9999999969996878099 No 58 >COG4100 Cystathionine beta-lyase family protein involved in aluminum resistance [Inorganic ion transport and metabolism] Probab=49.40 E-value=19 Score=16.90 Aligned_cols=80 Identities=20% Similarity=0.392 Sum_probs=38.7 Q ss_pred CCCCEEEEEEECC-----CCCC-CCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCC-CCCCCCCHHHHHHHHHHH Q ss_conf 5996899982016-----7131-85777258999988989999988980999913778889-847332247888876431 Q gi|254780653|r 13 IRGLRCLLRVDWN-----VPFI-DGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQS-KSDKDCSLFKVVSIAESI 85 (400) Q Consensus 13 ~~gk~VlvRvD~N-----vPi~-~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg-~~~~~~Sl~~v~~~l~~~ 85 (400) ++|+.+---.||+ ||+. +|+|- ++|++.-+...-|+|.+- |.+| ...+.+++..+.+...-. T Consensus 127 ~rg~~~gSL~dfgi~Y~~v~Lt~~gkiD--------~~~v~~~i~~~tkli~IQ---RS~GY~~RpS~~I~eI~~~i~~v 195 (416) T COG4100 127 LRGEGQGSLKDFGIKYKAVPLTADGKID--------IQAVKTAISDRTKLIGIQ---RSKGYAWRPSLSIAEIEEMITFV 195 (416) T ss_pred CCCCCCCCHHHHCCCEEECCCCCCCCCC--------HHHHHHHCCCCCEEEEEE---ECCCCCCCCCCCHHHHHHHHHHH T ss_conf 2777854278738550021356688233--------899987508552599997---32576778864299999999999 Q ss_pred ---CCCCCCCCCCCCHHHHHH Q ss_conf ---013354566631025566 Q gi|254780653|r 86 ---LHKNILFVNDCIGSTLSQ 103 (400) Q Consensus 86 ---l~~~V~fi~d~~g~~~~~ 103 (400) -..-|.|+|.|.|+.+++ T Consensus 196 k~inpn~ivFVDNCYGEFvE~ 216 (416) T COG4100 196 KEINPNVIVFVDNCYGEFVEE 216 (416) T ss_pred HHCCCCEEEEEECCCHHHHHC T ss_conf 844997799980563655411 No 59 >TIGR01678 FAD_lactone_ox sugar 1,4-lactone oxidases; InterPro: IPR010031 This entry identifies a family of at least two different sugar 1,4 lactone oxidases, both involved in synthesising ascorbic acid or a derivative. These include L-gulonolactone oxidase (1.1.3.8 from EC) from rat and D-arabinono-1,4-lactone oxidase (1.1.3.37 from EC) from Saccharomyces cerevisiae. Members are proposed to have the cofactor FAD covalently bound at a site specified by IPR006093 from INTERPRO.; GO: 0016899 oxidoreductase activity acting on the CH-OH group of donors oxygen as acceptor, 0009058 biosynthetic process. Probab=48.66 E-value=18 Score=16.99 Aligned_cols=69 Identities=19% Similarity=0.091 Sum_probs=31.6 Q ss_pred HHHHHHCCCCCCCEEEEECCCCCCCCCC-----CCCCCCCEECC--CCCCCCCCCHHHHHHHHHHCCC---CEEEEECCC Q ss_conf 9987530245543035411343232333-----31025321201--3345765200356777640577---679997662 Q gi|254780653|r 252 AWEAKRSACEIIVPRDVVVAREMKTGIP-----TQVVSAQSVPL--DSIILDVGFKTVEYIKQVIAQA---RTVMWNGPL 321 (400) Q Consensus 252 ~~~a~~~~~~I~lP~D~~v~~~~~~~~~-----~~~~~~~~i~~--~~~i~DIGp~Ti~~~~~~I~~A---ktI~WNGP~ 321 (400) .+.+.+.+.=.++|+.|-|....-.+.+ +...+..-+-. -...+|--|.-+. ++...+.. =|.+.|--| T Consensus 337 ~~~a~~~~~y~H~P~EVR~~~~T~~dePldtskrtntsPGPvyGnvCrPfldntPsh~~-lsp~~~v~nsqltcYinai~ 415 (505) T TIGR01678 337 EDDAKKKEVYVHYPVEVRVSRGTLEDEPLDTSKRTNTSPGPVYGNVCRPFLDNTPSHCL-LSPLFKVDNSQLTCYINAIM 415 (505) T ss_pred HHHHHCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEECCCCCCHHCCCCCEEE-ECCCCCCCCCCEEEEEEEEE T ss_conf 83032230289731489624866886656422125688985003534400117875167-63334577431368898888 No 60 >COG1518 CRISPR-associated protein Cas1 [Defense mechanisms] Probab=47.71 E-value=7.5 Score=19.46 Aligned_cols=188 Identities=13% Similarity=0.130 Sum_probs=94.0 Q ss_pred CCCCEEEEEE----ECCCCCCC-CEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 5996899982----01671318-577725899998898999998898099991377888984733224788887643101 Q gi|254780653|r 13 IRGLRCLLRV----DWNVPFID-GKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILH 87 (400) Q Consensus 13 ~~gk~VlvRv----D~NvPi~~-g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~ 87 (400) +++.++.+-- =.++|+.+ ..|.=-++..-+-+-++++.++|--|+...+.|+|-|.+-+..+.......+.+... T Consensus 17 ~~~~~l~~~~~~~~~~~ipi~~v~~I~i~~~~siSs~av~~la~~gI~i~f~~~~G~~~g~~~p~~~~~~~~~~~~Q~~~ 96 (327) T COG1518 17 RKDNTLVIKNKEGGKRKIPLEEVDGIVLFGGTSISSAALRLLAKRGIPIVFFDQYGFYLGRLSPREGLGSGLLRLKQYEA 96 (327) T ss_pred EECCEEEEEECCCCEEEECHHHEEEEEEECCCCCCHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHH T ss_conf 97898999976884687575572689994687407999999997698699988998587998064346639999999999 Q ss_pred CCCCCCCC-----CCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECC Q ss_conf 33545666-----3102556655313646489824243034321000246653124364247544044011376333023 Q gi|254780653|r 88 KNILFVND-----CIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLS 162 (400) Q Consensus 88 ~~V~fi~d-----~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~ 162 (400) +.++ .-.+-+..++.+ |..++|.+..+.+-....+..|..+.. T Consensus 97 ----~~~~~krl~lAk~~i~~ki~N-------~~~~~r~~~~~~~~~~~~~~~l~~l~~--------------------- 144 (327) T COG1518 97 ----YLDDEKRLELAKKFIEGKILN-------LQRFLRRYEGRIEELDEISDYLERLAN--------------------- 144 (327) T ss_pred ----HCCHHHHHHHHHHHHHHHHHH-------HHHHHHHHHCCCCHHHHHHHHHHHHHC--------------------- T ss_conf ----726678899999999999862-------999999853022114689999998651--------------------- Q ss_pred CCCCCCCCHHHHHHHH-------HHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCC Q ss_conf 3210003325565777-------653310166787189996078025578899877306888862057899999830143 Q gi|254780653|r 163 HLLPSYIGRAMQKELS-------MLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGV 235 (400) Q Consensus 163 ~~lps~aG~l~ekEi~-------~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~I 235 (400) ....+.+|..|=+ ++..++- +. -.+-|-.+ -+..+.++.+++-.-.++-+ ...+.+...|.+= T Consensus 145 ---~~~~~~l~g~Eg~aa~~Yy~~~~~~~~--~~--~~f~~R~r-rpp~D~vNa~Ls~GyslLy~--~v~~ai~~~GL~P 214 (327) T COG1518 145 ---ANSIEELMGIEGAAAKLYYKALDELLP--KE--FGFEGRNR-RPPLDPVNALLSYGYSLLYA--TVLSAIVITGLDP 214 (327) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHHHHCC--CC--CCCCCCCC-CCCCCHHHHHHHHHHHHHHH--HHHHHHHHHCCCC T ss_conf ---576577878999999999987766454--32--46577777-99858889999778999999--9999999827763 Q ss_pred CCCCCCC Q ss_conf 4333310 Q gi|254780653|r 236 GRSLCQR 242 (400) Q Consensus 236 G~Sl~e~ 242 (400) .-+...+ T Consensus 215 ~iG~lH~ 221 (327) T COG1518 215 YIGFLHE 221 (327) T ss_pred CCCCCCC T ss_conf 0222047 No 61 >pfam02836 Glyco_hydro_2_C Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities. Probab=46.58 E-value=23 Score=16.31 Aligned_cols=146 Identities=15% Similarity=0.160 Sum_probs=75.9 Q ss_pred CCCCCEEEEEEEC---CCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 1599689998201---6713185777258999988989999988980999913778889847332247888876431013 Q gi|254780653|r 12 DIRGLRCLLRVDW---NVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHK 88 (400) Q Consensus 12 d~~gk~VlvRvD~---NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~ 88 (400) =|+||.|.+|-=- --| ..|...++..+++ -|+.+-+.|+..|=.||- |. ++. +--.|+.+.=++ T Consensus 9 ~lNGkp~~l~Gvnrh~~~~-~~G~a~~~~~~~~---Di~l~K~~g~N~vR~sHy--p~---~~~--~~~~cD~~Gl~V-- 75 (297) T pfam02836 9 LLNGKPLYFRGVNRHEDHP-RRGRAFDEDDMVK---DIQLMKQLNINAVRTSHY--PN---APE--WYQLCDEYGLYV-- 75 (297) T ss_pred EECCEEEEEEEEECCCCCC-CCCCCCCHHHHHH---HHHHHHHCCCCEEEECCC--CC---CHH--HHHHHHHCCCEE-- T ss_conf 9999998875147575786-5663189999999---999999769998981589--99---999--999998779099-- Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCC Q ss_conf 35456663102556655313646489824243034321000246653124364247544044011376333023321000 Q gi|254780653|r 89 NILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSY 168 (400) Q Consensus 89 ~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~ 168 (400) +++.-.+.. |. -.-..+.......+++|.+....-. -..=.-+|+|||++.-.-.-++. T Consensus 76 ----~~E~~~~~~---------~~---~~~~~~~~~~~~~~p~~~~~~~~~~-----~emv~rd~NHPSIi~Ws~~NE~~ 134 (297) T pfam02836 76 ----IDETNLETH---------GL---WQKFGNSWEPSWNNPEWLKAHLQRA-----REMVQRDKNHPSIIIWSLGNESG 134 (297) T ss_pred ----EEECCCCCC---------CC---HHHCCCCCCCCCCCHHHHHHHHHHH-----HHHHHHCCCCCEEEEEECCCCCC T ss_conf ----993574433---------40---4332665665567989999999999-----99998636997289983575777 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 3325565777653310166787189 Q gi|254780653|r 169 IGRAMQKELSMLESCFSESKKPLVA 193 (400) Q Consensus 169 aG~l~ekEi~~L~~~l~~~~~P~v~ 193 (400) .|..+++ +..+-+.+ +|.||+.. T Consensus 135 ~~~~~~~-~~~~~k~~-DptRpv~~ 157 (297) T pfam02836 135 AGENIKA-MYKATKEL-DPTRPVHY 157 (297) T ss_pred CCHHHHH-HHHHHHHH-CCCCCEEE T ss_conf 2178999-99999976-89997773 No 62 >cd03280 ABC_MutS2 MutS2 homologs in bacteria and eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, a Probab=45.91 E-value=18 Score=17.04 Aligned_cols=34 Identities=3% Similarity=0.171 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHCCCCCCCEEEEE----EECCHHHHHH Q ss_conf 556577765331016678718999----6078025578 Q gi|254780653|r 172 AMQKELSMLESCFSESKKPLVAIV----GGSKVSTKIT 205 (400) Q Consensus 172 l~ekEi~~L~~~l~~~~~P~v~Ii----GGaKisdKi~ 205 (400) .|..|+..+..+++......++++ -|....|.+. T Consensus 91 ~F~~E~~~~~~il~~~~~~sLvliDE~~~gTn~~eg~~ 128 (200) T cd03280 91 TFSSHMKNIARILQHADPDSLVLLDELGSGTDPVEGAA 128 (200) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHH T ss_conf 99999999999998588888797556668988789999 No 63 >TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=45.71 E-value=24 Score=16.23 Aligned_cols=32 Identities=28% Similarity=0.387 Sum_probs=19.2 Q ss_pred HHHHHCCC-CCCCEEEEEEECCHH-HHHHHHHHH Q ss_conf 65331016-678718999607802-557889987 Q gi|254780653|r 179 MLESCFSE-SKKPLVAIVGGSKVS-TKITLLINL 210 (400) Q Consensus 179 ~L~~~l~~-~~~P~v~IiGGaKis-dKi~~i~~L 210 (400) .|.++..+ .++-..+|.||.|+. +.|.+.+.| T Consensus 26 ~l~~~~~~Gek~~v~vVvGGG~~AR~YI~~aR~l 59 (232) T TIGR02076 26 ILRKLSDEGEKHKVGVVVGGGKTAREYIGVAREL 59 (232) T ss_pred HHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 9999984499468998878876588999999860 No 64 >cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Probab=45.31 E-value=24 Score=16.19 Aligned_cols=50 Identities=16% Similarity=0.199 Sum_probs=40.4 Q ss_pred CCCEEEEEEECCCCCC-CCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 9968999820167131-8577725899998898999998898099991377 Q gi|254780653|r 14 RGLRCLLRVDWNVPFI-DGKVADVTRIERVVPTILELVEKKAKVVIFSHLG 63 (400) Q Consensus 14 ~gk~VlvRvD~NvPi~-~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~G 63 (400) +-|||.|.+-=++=.. ++.-.+..||......|..|.++|..|||.||=+ T Consensus 7 ~~krIVIKiGss~lt~~d~~~~~~~~i~~l~~~I~~L~~~G~evvlVsSGA 57 (284) T cd04256 7 HAKRIVVKLGSAVVTREDECGLALGRLASIVEQVSELQSQGREVILVTSGA 57 (284) T ss_pred CCCEEEEEECHHHEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCH T ss_conf 487899995702237999997589999999999999997899699990684 No 65 >PRK05429 gamma-glutamyl kinase; Provisional Probab=45.17 E-value=24 Score=16.17 Aligned_cols=211 Identities=20% Similarity=0.300 Sum_probs=99.8 Q ss_pred CCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEEC---------CCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 996899982016713185777258999988989999988980999913---------77888984733224788887643 Q gi|254780653|r 14 RGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSH---------LGRPQSKSDKDCSLFKVVSIAES 84 (400) Q Consensus 14 ~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH---------~GRPkg~~~~~~Sl~~v~~~l~~ 84 (400) +-||+.|.+-=++=..+..-.|..||......|..|.++|..|||.|. +|.++. . ...+.++.+....+ T Consensus 7 ~~krIVIKiGSs~Lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGAIa~G~~~L~~~~r-p-~~l~~~QA~AAvGQ 84 (372) T PRK05429 7 DARRIVVKVGSSLLTDDGGGLDRARIAELARQIAALRAAGHEVVLVSSGAVAAGRSRLGLPKR-P-KTLAEKQAAAAVGQ 84 (372) T ss_pred CCCEEEEECCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCC-C-CCHHHHHHHHHHCH T ss_conf 287899991743368988888899999999999999978999999884079862776099988-5-63778889987156 Q ss_pred ---------HC---CC---CCCCCCCCCH-----HHHHHHHHHCCC-CCE-EECCHHHHCCCCC--CCHHHHHHHHCCC- Q ss_conf ---------10---13---3545666310-----255665531364-648-9824243034321--0002466531243- Q gi|254780653|r 85 ---------IL---HK---NILFVNDCIG-----STLSQSIASLSE-GGI-ILAENVRFYSEEE--RNDPDFVRMLSRN- 139 (400) Q Consensus 85 ---------~l---~~---~V~fi~d~~g-----~~~~~~i~~l~~-g~i-lLLEN~Rf~~~E~--~n~~~f~k~La~l- 139 (400) .+ +. +|..-.+-+. ..+++.+..|-. |=| +.=||=---.+|. ..+..++-..|.+ T Consensus 85 ~~Lm~~Y~~~f~~~~~~~aQiLlT~~D~~~r~rylN~r~tl~~Ll~~gvIPIINENDtVat~Ei~fGDND~LaA~VA~li 164 (372) T PRK05429 85 SRLMQAYEELFARYGITVAQILLTRDDLSDRERYLNARATLRTLLELGVVPIINENDTVATDEIKFGDNDTLSALVANLV 164 (372) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCHHHHCCHHHHHHHHHHHHHHHHCCCEEECCCCCCCEECCCCCCCCCHHHHHHHHHH T ss_conf 99999999999863985578862665640458999999999999877960001477740005555578515882687773 Q ss_pred -CCEEEE----CC-HHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC Q ss_conf -642475----44-044011376333023321000332556577765331016678718999607802557889987730 Q gi|254780653|r 140 -GDFYIN----DA-FSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK 213 (400) Q Consensus 140 -aDiyVn----DA-F~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k 213 (400) ||..|. |. |..--|.++.- +..|..- -+..|+..+. ..... -+|-.-+.+||...+..... T Consensus 165 ~ADlLiiLTdvdGly~~~P~~~~~A----~~I~~v~--~i~~~i~~~~---~~~~s----~~GtGGM~tKl~AA~~a~~~ 231 (372) T PRK05429 165 EADLLILLTDIDGLYTADPRKNPDA----KLIPEVE--AITDELEAMA---GGAGS----GLGTGGMATKLEAARIATRA 231 (372) T ss_pred CCCEEEEEECCCCCCCCCCCCCCCC----CEEEEEC--CCCHHHHHHH---CCCCC----CCCCCCHHHHHHHHHHHHHC T ss_conf 8877999745675335899768654----0441205--7877899871---68877----88778437799999999977 Q ss_pred -CCEEEECHHHHHHHHHHH-CCCCCCCC Q ss_conf -688886205789999983-01434333 Q gi|254780653|r 214 -VDKLVIGGGMANSFLVAQ-GMGVGRSL 239 (400) Q Consensus 214 -~D~iiigG~lantfL~A~-G~~IG~Sl 239 (400) ++.+|.-|---+.+.... |-++|.-+ T Consensus 232 Gi~~iIa~G~~~~~l~~i~~g~~~GT~f 259 (372) T PRK05429 232 GIPVVIASGRKPDVLLRLLAGEAVGTLF 259 (372) T ss_pred CCCEEECCCCCCCHHHHHHCCCCCEEEE T ss_conf 9989992499965899997799726999 No 66 >PRK09954 hypothetical protein; Provisional Probab=44.95 E-value=24 Score=16.15 Aligned_cols=29 Identities=17% Similarity=0.291 Sum_probs=16.7 Q ss_pred HHHHHHHHCCCCCCCCC----CCCHHHHHHHHHH Q ss_conf 88876431013354566----6310255665531 Q gi|254780653|r 78 VVSIAESILHKNILFVN----DCIGSTLSQSIAS 107 (400) Q Consensus 78 v~~~l~~~l~~~V~fi~----d~~g~~~~~~i~~ 107 (400) +|.-|.+ ||.+|.|+. |..|+...+..++ T Consensus 99 iA~nLar-LG~~v~lIs~VG~D~~G~~ll~~l~~ 131 (362) T PRK09954 99 IAHNLAL-LGRDVHLLSAIGDDFYGETLLEETRR 131 (362) T ss_pred HHHHHHH-CCCCEEEEEEECCCHHHHHHHHHHHH T ss_conf 9999997-69976999996385879999999998 No 67 >PRK08339 short chain dehydrogenase; Provisional Probab=44.67 E-value=25 Score=16.12 Aligned_cols=67 Identities=19% Similarity=0.226 Sum_probs=40.5 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 98887710100159968999820167131857772589999889899999889809999137788898473322478888 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVS 80 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~ 80 (400) ||+| ||+||++||- -- .+-|= ..+-+.|.+.|++|++.+ |. .-.+...++ T Consensus 1 mm~l-------~L~gK~alIT-------G~-----s~GIG--~aiA~~la~~Ga~V~i~~---r~------~~~l~~~~~ 50 (263) T PRK08339 1 MLKI-------DLSGKLAFTT-------AS-----SKGIG--FGVARVLARAGADVIILS---RN------EENLKRAKE 50 (263) T ss_pred CCCC-------CCCCCEEEEE-------CC-----CCHHH--HHHHHHHHHCCCEEEEEE---CC------HHHHHHHHH T ss_conf 9741-------7899989991-------62-----60999--999999998699999997---98------899999999 Q ss_pred HHHHHCCCCCCCCC-CCC Q ss_conf 76431013354566-631 Q gi|254780653|r 81 IAESILHKNILFVN-DCI 97 (400) Q Consensus 81 ~l~~~l~~~V~fi~-d~~ 97 (400) .+.+.-+.++.++. |.. T Consensus 51 ~l~~~~~~~~~~~~~D~~ 68 (263) T PRK08339 51 KIKSESDVEVHYIVADLT 68 (263) T ss_pred HHHHHCCCCEEEEECCCC T ss_conf 998504985799984899 No 68 >PRK13946 shikimate kinase; Provisional Probab=44.46 E-value=25 Score=16.10 Aligned_cols=32 Identities=25% Similarity=0.363 Sum_probs=20.5 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 8980999913778889847332247888876431013354566 Q gi|254780653|r 52 KKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVN 94 (400) Q Consensus 52 ~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~ 94 (400) ++-.|+|+...|- || .+ +.+.|++.|+.+ |+| T Consensus 19 ~kknIvLIG~mGs--GK----St---vGk~LA~~L~~~--fiD 50 (195) T PRK13946 19 GKRTVVLVGLMGA--GK----ST---VGRRLATMLGLP--FLD 50 (195) T ss_pred CCCCEEEECCCCC--CH----HH---HHHHHHHHHCCC--EEE T ss_conf 8995899899999--88----99---999999997979--898 No 69 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=44.30 E-value=25 Score=16.09 Aligned_cols=49 Identities=27% Similarity=0.335 Sum_probs=31.0 Q ss_pred CCEECC--HHHHH-HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHH Q ss_conf 857772--58999-988989999988980999913778889847332247888876 Q gi|254780653|r 30 DGKVAD--VTRIE-RVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIA 82 (400) Q Consensus 30 ~g~I~d--~~RI~-~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l 82 (400) ||+.++ +.+|. +++.+|+.+.++|-+++|.| |||--. ....+.++...+ T Consensus 7 DGTLL~~~~~~i~~~~~~al~~l~~~G~~~~laT--GR~~~g--~~~~~~~~~~~~ 58 (270) T TIGR00099 7 DGTLLNKDDKEISPSTKEALAKLREKGIKVVLAT--GRPLAG--VYKELKEILKEL 58 (270) T ss_pred CCEEECCCCCCCCHHHHHHHHHHHHCCCEEEEEE--CCCCCC--HHHHHHHHHHHH T ss_conf 8503559987518889999999996698899981--886434--789999988850 No 70 >PTZ00063 histone deacetylase; Provisional Probab=43.94 E-value=25 Score=16.05 Aligned_cols=18 Identities=11% Similarity=0.242 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHCCCEEEE Q ss_conf 988989999988980999 Q gi|254780653|r 41 RVVPTILELVEKKAKVVI 58 (400) Q Consensus 41 ~~~pTI~~l~~~~akvii 58 (400) +++..-..|.+..+.+.+ T Consensus 114 gSl~AA~~L~~g~adiai 131 (439) T PTZ00063 114 ASIDGAHKLNNQQADICV 131 (439) T ss_pred HHHHHHHHHHCCCCCEEE T ss_conf 599999998556867898 No 71 >COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=43.59 E-value=26 Score=16.02 Aligned_cols=11 Identities=18% Similarity=0.588 Sum_probs=5.3 Q ss_pred CEEEEECCCCC Q ss_conf 67999766253 Q gi|254780653|r 313 RTVMWNGPLGV 323 (400) Q Consensus 313 ktI~WNGP~Gv 323 (400) ...|.--|.|. T Consensus 843 ~~l~LDEpf~~ 853 (908) T COG0419 843 ELLFLDEPFGT 853 (908) T ss_pred CEEEEECCCCC T ss_conf 82334289887 No 72 >TIGR01317 GOGAT_sm_gam glutamate synthases, NADH/NADPH, small subunit; InterPro: IPR006005 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This describes a family in several archaeal and deeply branched bacterial lineages of a homotetrameric form of the NADPH-dependent or NADH-dependent glutamate synthase (1.4.1.13 from EC and 1.4.1.14 from EC respectively) small subunit. There is no corresponding large subunit. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=43.26 E-value=11 Score=18.41 Aligned_cols=79 Identities=24% Similarity=0.393 Sum_probs=43.0 Q ss_pred HHHHHHHHCCC----C----CC--CEEEEEEECCHHHHHHHHHHHH---------HCCCEEEECHHHHHHHHHHHCCCCC Q ss_conf 77765331016----6----78--7189996078025578899877---------3068888620578999998301434 Q gi|254780653|r 176 ELSMLESCFSE----S----KK--PLVAIVGGSKVSTKITLLINLV---------KKVDKLVIGGGMANSFLVAQGMGVG 236 (400) Q Consensus 176 Ei~~L~~~l~~----~----~~--P~v~IiGGaKisdKi~~i~~L~---------~k~D~iiigG~lantfL~A~G~~IG 236 (400) |+..+++..++ | +| -.|||+|=- .-=|-.-.+|. +|.|++ ||.|- |-|. T Consensus 128 E~~IIdkgf~EGWv~P~PP~krTGkkVAVVGSG--PAGLAaA~qLnrAGH~VTVfER~DR~--GGLL~--------YGIP 195 (517) T TIGR01317 128 ELEIIDKGFQEGWVQPRPPKKRTGKKVAVVGSG--PAGLAAADQLNRAGHTVTVFEREDRV--GGLLR--------YGIP 195 (517) T ss_pred HHHHHHHHHCCCCEECCCCCCCCCCEEEEECCC--HHHHHHHHHHHHCCCCEEEEECCCCC--CCCCC--------CCCC T ss_conf 666564131178630468874478668997567--57999999985358838997436788--86302--------4888 Q ss_pred CCCCCCC-CCHHHHHHHHHHHHCCCCCCCEEEEECCCCCC Q ss_conf 3333100-00135688998753024554303541134323 Q gi|254780653|r 237 RSLCQRD-FSDNVHQIAWEAKRSACEIIVPRDVVVAREMK 275 (400) Q Consensus 237 ~Sl~e~~-~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~ 275 (400) .-.+|+. .++ ++| +.++.-| +||++..+.. T Consensus 196 nmKLdK~e~v~--RRi-~~l~aEG------~~FvtnteiG 226 (517) T TIGR01317 196 NMKLDKEEIVD--RRI-DLLEAEG------VDFVTNTEIG 226 (517) T ss_pred CCCCCHHHHHH--HHH-HHHHHCC------CCCCCCCCCC T ss_conf 74337388999--999-9987478------4201783004 No 73 >PRK03669 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=42.98 E-value=26 Score=15.96 Aligned_cols=63 Identities=21% Similarity=0.321 Sum_probs=41.7 Q ss_pred CEEEEEEECCCCCCCCEECC--HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCC-CCC Q ss_conf 68999820167131857772--589999889899999889809999137788898473322478888764310133-545 Q gi|254780653|r 16 LRCLLRVDWNVPFIDGKVAD--VTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKN-ILF 92 (400) Q Consensus 16 k~VlvRvD~NvPi~~g~I~d--~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~-V~f 92 (400) +.++|=-| -||..+| ++....+.|.|+.|.++|-.||+.| +|. ...+ ..+.+-||.. ..| T Consensus 6 ~~~lIfTD-----LDGTLLdh~~y~~~~a~~~i~~L~~~~IpiI~~S------SKT--~~Ei----~~l~~~Lgl~~~Pf 68 (271) T PRK03669 6 QPLLIFTD-----LDGTLLDSHTYDWQPAAPWLTRLREAQVPVILCS------SKT--AAEM----LYLQQTLGLQGLPL 68 (271) T ss_pred CCEEEEEC-----CCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEC------CCC--HHHH----HHHHHHCCCCCCCE T ss_conf 98899987-----8245648788780689999999998799699967------887--9999----99999809999988 Q ss_pred CCC Q ss_conf 666 Q gi|254780653|r 93 VND 95 (400) Q Consensus 93 i~d 95 (400) |-+ T Consensus 69 I~E 71 (271) T PRK03669 69 IAE 71 (271) T ss_pred EEE T ss_conf 982 No 74 >PRK01158 phosphoglycolate phosphatase; Provisional Probab=42.82 E-value=24 Score=16.19 Aligned_cols=23 Identities=17% Similarity=0.276 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 9889899999889809999137788 Q gi|254780653|r 41 RVVPTILELVEKKAKVVIFSHLGRP 65 (400) Q Consensus 41 ~~~pTI~~l~~~~akvii~SH~GRP 65 (400) +.+..|+.+.++|-++++.| ||| T Consensus 24 ~~~~ai~~l~~~Gi~v~iaT--GR~ 46 (226) T PRK01158 24 KAVEAIRKAEKLGVPVILVT--GNI 46 (226) T ss_pred HHHHHHHHHHHCCCEEEEEC--CCC T ss_conf 99999999998899899989--997 No 75 >PRK07847 amidophosphoribosyltransferase; Provisional Probab=42.38 E-value=27 Score=15.90 Aligned_cols=129 Identities=13% Similarity=0.232 Sum_probs=67.6 Q ss_pred HHHHHCCCCCEEECCHH-----HHCCCCCC-----------------------CHHHHHHHHCCCCCEEEECCHHHHCCC Q ss_conf 65531364648982424-----30343210-----------------------002466531243642475440440113 Q gi|254780653|r 103 QSIASLSEGGIILAENV-----RFYSEEER-----------------------NDPDFVRMLSRNGDFYINDAFSVSHRA 154 (400) Q Consensus 103 ~~i~~l~~g~ilLLEN~-----Rf~~~E~~-----------------------n~~~f~k~La~laDiyVnDAF~~aHR~ 154 (400) +.+.+++|||++.+..- +|.+.+.+ -..++.+.||.-.++ || T Consensus 209 e~irdV~PGEiv~I~~~G~~s~~~~~~~~~~C~FEyIYFaRpdS~~~g~~Vy~~R~~~G~~LA~e~~~---~~------- 278 (489) T PRK07847 209 SFVREIEPGELLAIDADGVRSTRFAEPTPKGCVFEYVYLARPDSTIAGRSVHAARVEIGRRLAREHPV---EA------- 278 (489) T ss_pred EEEEECCCCEEEEECCCCEEEEECCCCCCCCCEEEEHHHCCCCCCCCCEEHHHHHHHHHHHHHHHCCC---CC------- T ss_conf 59981699708998189617985268887541565024318971147818999999999998743788---89------- Q ss_pred CCCEEECCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECC-----HHHHHHHHHHHHHCCCEEEECHHHHHHHH Q ss_conf 7633302332-10003325565777653310166787189996078-----02557889987730688886205789999 Q gi|254780653|r 155 HASITGLSHL-LPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSK-----VSTKITLLINLVKKVDKLVIGGGMANSFL 228 (400) Q Consensus 155 haS~~gi~~~-lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaK-----isdKi~~i~~L~~k~D~iiigG~lantfL 228 (400) --++++|.- .|++.|+.-+.-+..-.-++.++--..++|.-.-+ +.-|+..++.+++- .+|++ T Consensus 279 -D~Vi~VPdSg~~aA~gyA~~sgip~~~gliKn~yvgRTFI~p~q~~R~~~v~~Kln~i~~~i~g-K~vvl--------- 347 (489) T PRK07847 279 -DLVIPVPESGTPAAVGYAQESGIPFGQGLVKNAYVGRTFIQPSQTIRQLGIRLKLNPLREVIRG-KRLVV--------- 347 (489) T ss_pred -CEEEECCCCHHHHHHHHHHHCCCCHHHCEEECCCCCCCEECCCHHHHHHCCCEEEECCHHEECC-CEEEE--------- T ss_conf -9798569971999999999739954442001461474013786656663651477402641779-87999--------- Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 983014343333100001356889987530245 Q gi|254780653|r 229 VAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACE 261 (400) Q Consensus 229 ~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~ 261 (400) |-.|++.-. +.+.|++++++.|.+ T Consensus 348 ------VDDSIVRGt---T~k~iv~~Lr~aGAk 371 (489) T PRK07847 348 ------VDDSIVRGN---TQRALVRMLREAGAA 371 (489) T ss_pred ------ECCCCCCCC---CHHHHHHHHHHCCCC T ss_conf ------847835667---799999999976998 No 76 >PRK06197 short chain dehydrogenase; Provisional Probab=42.28 E-value=27 Score=15.89 Aligned_cols=125 Identities=18% Similarity=0.139 Sum_probs=60.6 Q ss_pred CCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 71010015996899982016713185777258999988989999988980999913778889847332247888876431 Q gi|254780653|r 6 TMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESI 85 (400) Q Consensus 6 ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~ 85 (400) |-.|+.|+.||+|+|-- -+. =| -..|-+.|.++|++||+.. |-..+. +..++.+.+. T Consensus 7 ~~~~ipDL~GK~~lITG------a~s------GI--G~~~A~~La~~ga~Vil~~---R~~~k~------~~a~~~i~~~ 63 (306) T PRK06197 7 TAADIPDQSGRVAVVTG------ANT------GL--GYETAAALAAKGAHVVLAV---RNLDKG------NAAAARITAA 63 (306) T ss_pred CHHHCCCCCCCEEEECC------CCC------HH--HHHHHHHHHHCCCEEEEEE---CCHHHH------HHHHHHHHHH T ss_conf 86568898999999916------895------99--9999999997849899997---989999------9999999976 Q ss_pred C-CCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCC Q ss_conf 0-133545666310255665531364648982424303432100024665312436424754404401137633302332 Q gi|254780653|r 86 L-HKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHL 164 (400) Q Consensus 86 l-~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~ 164 (400) . +.+|.|+. -|.-=|+.+|-+.+|... -.+--|+.||.|=-......-+--|+... T Consensus 64 ~~~~~i~~~~----------------lDLssl~sV~~~a~~~~~-------~~~~lDvLinNAGi~~~~~~~T~dG~E~~ 120 (306) T PRK06197 64 HPGADVTLQE----------------LDLASLASVRAAADALRA-------AYPRIDLLINNAGVMYTPKQTTADGFELQ 120 (306) T ss_pred CCCCCEEEEE----------------CCCHHHHHHHHHHHHHHH-------CCCCCCEEEECCCCCCCCCCCCCCCHHHH T ss_conf 8998579997----------------664307789999999996-------18987689977844568872267653333 Q ss_pred CC-CCCC-HHHHHH Q ss_conf 10-0033-255657 Q gi|254780653|r 165 LP-SYIG-RAMQKE 176 (400) Q Consensus 165 lp-s~aG-~l~ekE 176 (400) .- .+.| |++-+. T Consensus 121 f~vN~lghflLt~l 134 (306) T PRK06197 121 FGTNHLGHFALTGL 134 (306) T ss_pred HHHHHHHHHHHHHH T ss_conf 33313688888887 No 77 >PRK03815 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=41.89 E-value=27 Score=15.85 Aligned_cols=38 Identities=21% Similarity=0.347 Sum_probs=19.9 Q ss_pred EEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHH Q ss_conf 189996078025578899877306888862057899999 Q gi|254780653|r 191 LVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLV 229 (400) Q Consensus 191 ~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~ 229 (400) .++|-|---=+|.-.++.+++++.. ..+||=++..++. T Consensus 91 ~I~ITGTNGKsTtt~li~~iL~~~g-~~~~GNIG~Pl~~ 128 (401) T PRK03815 91 SIWISGTNGKTTTTQMLTHLLEDFG-AVSGGNIGTPLAE 128 (401) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCC-CCEECCCCHHHHH T ss_conf 4899847777899999999998669-8130416643886 No 78 >cd00288 Pyruvate_Kinase Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer. Probab=41.38 E-value=28 Score=15.80 Aligned_cols=304 Identities=14% Similarity=0.175 Sum_probs=142.5 Q ss_pred HHHHHHHHHHCCCEEEE--EECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCC-CEEECCH Q ss_conf 88989999988980999--91377888984733224788887643101335456663102556655313646-4898242 Q gi|254780653|r 42 VVPTILELVEKKAKVVI--FSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEG-GIILAEN 118 (400) Q Consensus 42 ~~pTI~~l~~~~akvii--~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g-~ilLLEN 118 (400) ...+|+.|+++|..+.= +||-. + +.....-...+.+++.++++|...-|.-|++.+.. .+++| ++.|-++ T Consensus 16 ~~~~l~~li~aG~~v~RiN~SHg~-~----e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkiR~g--~~~~~~~i~l~~G 88 (480) T cd00288 16 SVENLKKLIKAGMNVARMNFSHGS-H----EYHQSRIDNVREAAEKTGGPVAIALDTKGPEIRTG--LFKGGKDISLKAG 88 (480) T ss_pred CHHHHHHHHHCCCCEEEEECCCCC-H----HHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEE--ECCCCCEEEECCC T ss_conf 999999999879999999899999-9----99999999999999971998079997889835898--5489963795489 Q ss_pred HHH--CCC--CC--------CCHHHHHHHHCCCCCEEEECCHHHH---CCCCCC-EE-----E----------CCCC--- Q ss_conf 430--343--21--------0002466531243642475440440---113763-33-----0----------2332--- Q gi|254780653|r 119 VRF--YSE--EE--------RNDPDFVRMLSRNGDFYINDAFSVS---HRAHAS-IT-----G----------LSHL--- 164 (400) Q Consensus 119 ~Rf--~~~--E~--------~n~~~f~k~La~laDiyVnDAF~~a---HR~haS-~~-----g----------i~~~--- 164 (400) -++ ... +. -+-++|.+.+..--.+|+.|..=.. -..... +. | +|.. T Consensus 89 ~~v~l~~~~~~~~~~~~~i~v~~~~l~~~v~~Gd~I~idDG~i~l~V~~~~~~~~i~~~v~~gG~l~s~Kgvn~p~~~~~ 168 (480) T cd00288 89 DKFLVTTDPAAKKGTKEKIYVDYKNLTKDVSPGNTILVDDGLLSLKVLSKDDDKTLVCEVLNGGVLGSRKGVNLPGTDVD 168 (480) T ss_pred CEEEEEECCCCCCCCCCEEECCHHHHHHHCCCCCEEEEECCEEEEEEEEECCCCEEEEEEEECCEECCCCCCCCCCCCCC T ss_conf 99999856876789888896572888976589988999478389999998589769999975839738863236888678 Q ss_pred CCC------------------CCCHHH---HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHH--HHHHHHCCCEEEECH Q ss_conf 100------------------033255---657776533101667871899960780255788--998773068888620 Q gi|254780653|r 165 LPS------------------YIGRAM---QKELSMLESCFSESKKPLVAIVGGSKVSTKITL--LINLVKKVDKLVIGG 221 (400) Q Consensus 165 lps------------------~aG~l~---ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~--i~~L~~k~D~iiigG 221 (400) +|. +.++.| .+.+..+.+++.+.. +-+-|+ |||.++-.+ |+..++.+|.|++.= T Consensus 169 lp~lTekD~~di~~a~~~~vD~valSFVr~~~Dv~~lr~~l~~~g-~~~~Ii--aKIE~~~al~nl~eIi~~sDgIMIAR 245 (480) T cd00288 169 LPALSEKDKADLRFGVEQGVDMIFASFVRKASDVLEIREVLGEKG-KDIKII--AKIENQEGVNNFDEILEASDGIMVAR 245 (480) T ss_pred CCCCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCC-CCCEEE--EEECCHHHHHCHHHHHHHCCEEEEEC T ss_conf 667987789999868875999999888898999999999999748-866699--98527766647999998538899977 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC--CCCCCC-EECCC-CCCC-- Q ss_conf 57899999830143433331000013568899875302455430354113432323333--102532-12013-3457-- Q gi|254780653|r 222 GMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPT--QVVSAQ-SVPLD-SIIL-- 295 (400) Q Consensus 222 ~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~--~~~~~~-~i~~~-~~i~-- 295 (400) | -.|.+++- ++.--.-|.|++++.+.+.+.++-+...-+- ..+..|+ ++.++. .+-++ +-+| T Consensus 246 G-------DLgvEi~~----e~vp~~Qk~Ii~~c~~~gKPvIvATqmLeSM-i~~p~PTRAEv~DVanAv~dG~DavMLs 313 (480) T cd00288 246 G-------DLGVEIPA----EEVFLAQKMLIAKCNLAGKPVITATQMLESM-IYNPRPTRAEVSDVANAVLDGTDCVMLS 313 (480) T ss_pred C-------CCCCCCCH----HHHHHHHHHHHHHHHHCCCEEEEECCHHHHH-HCCCCCCEEEHHHHHHHHHHCCCEEEEC T ss_conf 8-------65564798----8989999999999998399599967368876-2389985210245888987458679975 Q ss_pred ------CCCCHHHHHHHHHHCCCCEEEEECCCC---CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE---CCCHHHHHHH Q ss_conf ------652003567776405776799976625---3227436588999999998611148858997---4804699899 Q gi|254780653|r 296 ------DVGFKTVEYIKQVIAQARTVMWNGPLG---VFEIEPFDRATVEVAHYVAKLTKERRIISIA---GGGDTITALA 363 (400) Q Consensus 296 ------DIGp~Ti~~~~~~I~~AktI~WNGP~G---vfE~~~F~~GT~~l~~~ia~~~~~~~~~siv---GGGdT~aai~ 363 (400) .--.++++.+..++..+...+|.-+.- ..+.+.-..-+.+++.+........++..|+ --|.|+..+. T Consensus 314 ~ETa~G~yPv~~V~~m~~I~~~aE~~~~~~~~~~~~~~~~~~~~~~~~~ia~~a~~~a~~~~a~aIv~~T~sG~tA~~is 393 (480) T cd00288 314 GETAKGKYPVEAVKAMARICLEAEKALSHRVLFNEMRRLTPRPTSTTEAVAMSAVRAAFELGAKAIVVLTTSGRTARLVS 393 (480) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHH T ss_conf 32227879899999999999999735204544445431257899989999999999998469989999879858999999 Q ss_pred HCCC Q ss_conf 8488 Q gi|254780653|r 364 HAGI 367 (400) Q Consensus 364 ~~g~ 367 (400) ++-- T Consensus 394 ~~RP 397 (480) T cd00288 394 KYRP 397 (480) T ss_pred HHCC T ss_conf 6597 No 79 >PRK08305 spoVFB dipicolinate synthase subunit B; Reviewed Probab=40.96 E-value=28 Score=15.76 Aligned_cols=67 Identities=24% Similarity=0.265 Sum_probs=42.3 Q ss_pred CCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEE-EEEECCCCCCCCCCCC-CCHHHHHHHHHHHCCCC Q ss_conf 159968999820167131857772589999889899999889809-9991377888984733-22478888764310133 Q gi|254780653|r 12 DIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKV-VIFSHLGRPQSKSDKD-CSLFKVVSIAESILHKN 89 (400) Q Consensus 12 d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akv-ii~SH~GRPkg~~~~~-~Sl~~v~~~l~~~l~~~ 89 (400) +++||+|++-+ .|-|. -.+++++-++.|.++||.| +++|+--. ..+.. -.-....+.++++.|.+ T Consensus 2 ~L~gK~I~lgv-------TGSia---ay~kv~~~ir~L~~~GA~V~~ImS~~a~---~~~Trfg~~~~~~~~le~iTg~~ 68 (195) T PRK08305 2 SLKGKRIGFGL-------TGSHC---TYEEVMPQIEKLVAEGAEVRPIVSYTVQ---TTDTRFGEGEEWIKKIEEITGFK 68 (195) T ss_pred CCCCCEEEEEE-------CCHHH---HHHHHHHHHHHHHHCCCEEEEEECHHHH---HCCCCCCCHHHHHHHHHHHHCCC T ss_conf 76898899996-------72799---8999999999999879969999754477---50566752788999999985995 Q ss_pred CC Q ss_conf 54 Q gi|254780653|r 90 IL 91 (400) Q Consensus 90 V~ 91 (400) +. T Consensus 69 vi 70 (195) T PRK08305 69 VI 70 (195) T ss_pred CE T ss_conf 20 No 80 >cd03243 ABC_MutS_homologs The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA stru Probab=40.75 E-value=22 Score=16.46 Aligned_cols=17 Identities=24% Similarity=0.221 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 55657776533101667 Q gi|254780653|r 172 AMQKELSMLESCFSESK 188 (400) Q Consensus 172 l~ekEi~~L~~~l~~~~ 188 (400) .|+.|+..++.+++... T Consensus 91 ~F~~e~~~~~~i~~~~~ 107 (202) T cd03243 91 TFMAELLELKEILSLAT 107 (202) T ss_pred HHHHHHHHHHHHHHHCC T ss_conf 49999999999998677 No 81 >cd00079 HELICc Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process Probab=40.65 E-value=28 Score=15.73 Aligned_cols=69 Identities=14% Similarity=0.202 Sum_probs=38.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC-CCHHHHHHHHHHCCCCC Q ss_conf 72589999889899999889809999137788898473322478888764310133545666-31025566553136464 Q gi|254780653|r 34 ADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVND-CIGSTLSQSIASLSEGG 112 (400) Q Consensus 34 ~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d-~~g~~~~~~i~~l~~g~ 112 (400) .++.|+...+.-|+...+++.|+++.++. .-.+..++.+|.+ .+.++..+.- ....+-...++.+++|+ T Consensus 9 ~~~~K~~~l~~~i~~~~~~~~kviIF~~~---------~~~~~~l~~~L~~-~~~~~~~~~~~~~~~~R~~~~~~F~~~~ 78 (131) T cd00079 9 VEDEKLEALLELLKEHLKKGGKVLIFCPS---------KKMLDELAELLRK-PGIKVAALHGDGSQEEREEVLKDFREGE 78 (131) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEECC---------HHHHHHHHHHHHH-CCCCEEEEECCCCHHHHHHHHHHHHHHC T ss_conf 86699999999999999789909999788---------9999999999955-8998999989999999999999977540 No 82 >PRK13539 cytochrome c biogenesis protein CcmA; Provisional Probab=40.43 E-value=29 Score=15.71 Aligned_cols=61 Identities=11% Similarity=0.097 Sum_probs=32.6 Q ss_pred CCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 78718999607802557889987730688886205789999983014343333100001356889987530245543 Q gi|254780653|r 188 KKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIV 264 (400) Q Consensus 188 ~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~l 264 (400) ++|.--.=||-| -|+.+...|+...+-+++-= ..+-.+....+...+++....+.+.-|++ T Consensus 121 ~~~~~~LSgG~k--qrv~la~al~~~p~vllLDE--------------PtsgLD~~~~~~v~~~i~~~~~~g~tiIi 181 (206) T PRK13539 121 HLPFGYLSAGQK--RRVALARLLVSNRPIWLLDE--------------PTAALDSASQALFAELIRAHLAQGGIVIA 181 (206) T ss_pred CCCHHHCCHHHH--HHHHHHHHHHHCCCEEEEEC--------------CCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 788124999999--99999999986989899979--------------97778999999999999999958999999 No 83 >TIGR02700 flavo_MJ0208 archaeoflavoprotein, MJ0208 family; InterPro: IPR014073 This entry describes one of two paralogous families of archaeal flavoprotein. The other, described by IPR014072 from INTERPRO and typified by the partially characterised AF1518 of ArchaArchaeoglobus fulgidus, is a homodimeric FMN-containing flavoprotein that accepts electrons from ferredoxin and can transfer them to various oxidoreductases. The function of this protein family is unknown.. Probab=40.38 E-value=5.9 Score=20.10 Aligned_cols=50 Identities=24% Similarity=0.383 Sum_probs=35.8 Q ss_pred HHCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC-CCCEEEEE Q ss_conf 730688886205789999983014343333100001356889987530245-54303541 Q gi|254780653|r 211 VKKVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACE-IIVPRDVV 269 (400) Q Consensus 211 ~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~-I~lP~D~~ 269 (400) +.|.|.+++.=+-+||--|-. |.|..||+.+... .|-|-.++ +++|+|+- T Consensus 83 LgkYD~LivsPAT~NTVaKia-~GIADsLVTNAva--------QAgKG~Vpv~vVPvDye 133 (237) T TIGR02700 83 LGKYDLLIVSPATANTVAKIA-YGIADSLVTNAVA--------QAGKGDVPVLVVPVDYE 133 (237) T ss_pred CCCCCEEEECCCCCCCHHHEE-ECCHHHHHHHHHH--------HCCCCCCCEEEEEEECC T ss_conf 453265787585102001100-0210035676775--------22377543588875214 No 84 >TIGR01292 TRX_reduct thioredoxin-disulfide reductase; InterPro: IPR005982 Reactive oxygen species (ROS) are known mediators of intracellular signaling cascades. Excessive production of ROS may, however, lead to oxidative stress, loss of cell function, and ultimately apoptosis or necrosis. A balance between oxidant and antioxidant intracellular systems is hence vital for cell function, regulation, and adaptation to diverse growth conditions. Thioredoxin reductase in conjunction with thioredoxin is a ubiquitous oxidoreductase system with antioxidant and redox regulatory roles. Thioredoxin reductase (1.8.1.9 from EC) reduces oxidised thioredoxin in the presence of NADPH. Reduced thioredoxin serves as an electron donor for thioredoxin peroxidase which consequently reduces H_2O_2 to H_2O. In mammals, extracellular forms of Trx also have cytokine-like effects. Mammalian TrxR has a highly reactive active site selenocysteine residue resulting in a profound reductive capacity, reducing several substrates in addition to Trx .; GO: 0004791 thioredoxin-disulfide reductase activity, 0019430 removal of superoxide radicals, 0005737 cytoplasm. Probab=40.34 E-value=27 Score=15.90 Aligned_cols=125 Identities=17% Similarity=0.235 Sum_probs=69.2 Q ss_pred EECCHHHHCCCCCCEE-ECCCCCC----CCC---C--HHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC Q ss_conf 7544044011376333-0233210----003---3--2556577765331016678718999607802557889987730 Q gi|254780653|r 144 INDAFSVSHRAHASIT-GLSHLLP----SYI---G--RAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK 213 (400) Q Consensus 144 VnDAF~~aHR~haS~~-gi~~~lp----s~a---G--~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k 213 (400) |-=|=|++||. -.+- |=..|+- .|| | +++ ..|+ |+|+||-. +-+.=-.+|.+- T Consensus 111 vIiAtGa~~r~-lg~~kGE~ef~GrGVSyCA~CDGA~~ff------------k~K~--V~VvGGGd--sA~eEA~yL~~~ 173 (321) T TIGR01292 111 VIIATGAEARK-LGIPKGEDEFLGRGVSYCATCDGASPFF------------KNKE--VAVVGGGD--SALEEALYLTRI 173 (321) T ss_pred EEEECCCHHHH-CCCCCCHHHHCCCCEEEEEHHCCCCHHH------------CCCE--EEEECCCC--HHHHHHHHHHHC T ss_conf 99913871542-3788664553268666722310540120------------5988--99987982--488888999853 Q ss_pred CCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHC---CCCCCCEEEEECCCCCCC--CCC-CCCC---- Q ss_conf 688886205789999983014343333100001356889987530---245543035411343232--333-3102---- Q gi|254780653|r 214 VDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRS---ACEIIVPRDVVVAREMKT--GIP-TQVV---- 283 (400) Q Consensus 214 ~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~---~~~I~lP~D~~v~~~~~~--~~~-~~~~---- 283 (400) ++++.+.= ..|...-.+-++++++++ ++.++++.-+.--. .++ ..+ .++. T Consensus 174 a~kV~lvH-------------------RRd~fRA~k~~~~r~~~~~eG~I~~l~n~~v~Ei~-G~~~~~V~~v~i~N~~t 233 (321) T TIGR01292 174 AKKVTLVH-------------------RRDKFRAEKILLDRLKKNPEGKIEFLWNSTVEEIV-GDNSKKVESVKIKNTVT 233 (321) T ss_pred CCEEEEEE-------------------CCCCCCCCHHHHHHHHHCCCCCEEEECCCEEEEEE-CCCCEEEEEEEEEECCC T ss_conf 87679997-------------------79863632899989983789878996486899996-17831553358996238 Q ss_pred -CCCEECCCCCCCCCCCHHHHHH Q ss_conf -5321201334576520035677 Q gi|254780653|r 284 -SAQSVPLDSIILDVGFKTVEYI 305 (400) Q Consensus 284 -~~~~i~~~~~i~DIGp~Ti~~~ 305 (400) ...+++-+..++=||-+....+ T Consensus 234 ~e~~~l~vdGvF~aIG~~P~t~~ 256 (321) T TIGR01292 234 GEEEELKVDGVFIAIGHEPNTEL 256 (321) T ss_pred CEEEEEEECEEEEEECCCCCHHH T ss_conf 84889984447898412032057 No 85 >pfam05445 Pox_ser-thr_kin Poxvirus serine/threonine protein kinase. Probab=39.62 E-value=12 Score=18.24 Aligned_cols=66 Identities=18% Similarity=0.208 Sum_probs=30.4 Q ss_pred CCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHH-HH Q ss_conf 21000246653124364247544044011376333023321000332556577765331016678718999607802-55 Q gi|254780653|r 125 EERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVS-TK 203 (400) Q Consensus 125 E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKis-dK 203 (400) |.+.+..|+|-++-.--.||.+-|-+- +....-. .-+|+|-.+ -..-.|.-+.|.-=|+.| || T Consensus 196 ~~K~~~~Fvkl~SyFYp~~v~sniNVI-----------Nnf~~mi-hffe~ekra----n~~y~rGnIIIfPLA~cSadk 259 (434) T pfam05445 196 ERKDSEKFVKLFSYFYPKVVKSNINVI-----------NNFSHMI-HFFEHEKRA----NYSYDRGNIIIFPLARCSADK 259 (434) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHH-HHHHHHHHC----CCCCCCCCEEEEEHHHCCHHH T ss_conf 741447799999988599997513665-----------2138999-998641311----553136868998634244200 Q ss_pred HHH Q ss_conf 788 Q gi|254780653|r 204 ITL 206 (400) Q Consensus 204 i~~ 206 (400) +.. T Consensus 260 v~~ 262 (434) T pfam05445 260 LTL 262 (434) T ss_pred CCH T ss_conf 778 No 86 >COG2071 Predicted glutamine amidotransferases [General function prediction only] Probab=39.46 E-value=30 Score=15.61 Aligned_cols=17 Identities=24% Similarity=0.432 Sum_probs=9.2 Q ss_pred HHHHHHHCCCEEEECHH Q ss_conf 89987730688886205 Q gi|254780653|r 206 LLINLVKKVDKLVIGGG 222 (400) Q Consensus 206 ~i~~L~~k~D~iiigG~ 222 (400) .+.++++.+|.|++.|+ T Consensus 53 ~~~~~l~~iDgliltGg 69 (243) T COG2071 53 DARQYLDLIDGLILTGG 69 (243) T ss_pred HHHHHHHHCCEEEECCC T ss_conf 89999865367895289 No 87 >TIGR01700 PNPH purine nucleoside phosphorylase I, inosine and guanosine-specific; InterPro: IPR011270 This entry represents a family of bacterial and metazoan purine phosphorylases acting primarily on inosine and guanosine and not acting on adenosine. PNP-I refers to the nomenclature from Bacillus stearothermophilus where PHP-II refers to the nucleotidase acting on adenosine as the primary substrate. The bacterial enzymes (PUNA) are typified by the Bacillus PupG protein which is involved in the metabolism of nucleosides as a carbon source. Several metazoan enzymes (PNPH) are well characterised including the human and bovine enzymes which have been crystallised. ; GO: 0004731 purine-nucleoside phosphorylase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=38.98 E-value=30 Score=15.57 Aligned_cols=94 Identities=18% Similarity=0.178 Sum_probs=59.9 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCHHHHH--HHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHH---HCCCCC Q ss_conf 8980999913778889847332247888--87643101335456663102556655313646489824243---034321 Q gi|254780653|r 52 KKAKVVIFSHLGRPQSKSDKDCSLFKVV--SIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVR---FYSEEE 126 (400) Q Consensus 52 ~~akvii~SH~GRPkg~~~~~~Sl~~v~--~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~R---f~~~E~ 126 (400) .|-+||.| |||= =-|+=.||+..+. =++-|+||.+..+|..--|--= .+++.||+.|.--.= +.+||. T Consensus 49 ~G~~Vv~m--QGRF-H~YEGny~~~~VTFPVRV~k~LGve~l~VtNAAGGiN----~~F~~GDlmlI~DHIN~~~~~G~N 121 (259) T TIGR01700 49 GGKPVVAM--QGRF-HLYEGNYDMAKVTFPVRVMKLLGVEALVVTNAAGGIN----TEFKVGDLMLIRDHINLMGLAGFN 121 (259) T ss_pred CCEEEEEE--CCCC-CCCCCCCCHHHHCHHHHHHHHCCCCEEEEECCCCCCC----CCCCCCCEEEECCHHHCCCCCCCC T ss_conf 79679983--1886-5667887766604356777772712588713554527----776677778971112164416788 Q ss_pred ----CCHHHHHHHHCCCCCEEEECCHHHHC Q ss_conf ----00024665312436424754404401 Q gi|254780653|r 127 ----RNDPDFVRMLSRNGDFYINDAFSVSH 152 (400) Q Consensus 127 ----~n~~~f~k~La~laDiyVnDAF~~aH 152 (400) =|++.|.-+.-+.-|+|=-|=-.-|| T Consensus 122 PL~GPN~~~FG~RFPdms~AYD~~~~~kA~ 151 (259) T TIGR01700 122 PLIGPNDERFGARFPDMSDAYDKELRQKAK 151 (259) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 886868876787877466312768899999 No 88 >PRK08227 aldolase; Validated Probab=38.54 E-value=15 Score=17.56 Aligned_cols=50 Identities=26% Similarity=0.446 Sum_probs=32.3 Q ss_pred HHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 6533101667871899960780255788998773068888620578999998301434333310 Q gi|254780653|r 179 MLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLCQR 242 (400) Q Consensus 179 ~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~ 242 (400) .+.++.....-|. .|.||.|.+|. .+| +.+...+-.| |.|.-+|+..++. T Consensus 210 ~f~~Vv~a~pvPV-liaGG~k~~d~-e~L----~~v~~a~~aG--------a~Gv~~GRNVfQ~ 259 (291) T PRK08227 210 GFERITAGCPVPI-VIAGGKKLPER-DAL----EMCYQAIDQG--------ASGVDMGRNIFQS 259 (291) T ss_pred HHHHHHHCCCCCE-EEECCCCCCHH-HHH----HHHHHHHHCC--------CCEEEECCHHHCC T ss_conf 5999996489978-99679989869-999----9999999769--------9368724002358 No 89 >PRK10976 putative sugar phosphatase; Provisional Probab=38.17 E-value=31 Score=15.48 Aligned_cols=45 Identities=16% Similarity=0.178 Sum_probs=27.9 Q ss_pred CCEECC-HHHH-HHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHH Q ss_conf 857772-5899-9988989999988980999913778889847332247888876 Q gi|254780653|r 30 DGKVAD-VTRI-ERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIA 82 (400) Q Consensus 30 ~g~I~d-~~RI-~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l 82 (400) ||++++ +.+| .+...+|+.+.++|-++++.| ||| ..++.++.+.+ T Consensus 10 DGTLl~~~~~is~~~~~ai~~l~~~Gi~~viAT--GR~------~~~~~~~~~~l 56 (266) T PRK10976 10 DGTLLSPDHTLSPYAKETLKLLTARGIHFVFAT--GRH------HVDVGQIRDNL 56 (266) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEC--CCC------HHHHHHHHHHC T ss_conf 646558879719999999999997899999997--999------79999999973 No 90 >smart00534 MUTSac ATPase domain of DNA mismatch repair MUTS family. Probab=37.52 E-value=31 Score=15.43 Aligned_cols=13 Identities=8% Similarity=0.087 Sum_probs=5.1 Q ss_pred HHHHHHHHHHCCC Q ss_conf 6577765331016 Q gi|254780653|r 174 QKELSMLESCFSE 186 (400) Q Consensus 174 ekEi~~L~~~l~~ 186 (400) ..|+..+..++++ T Consensus 63 ~~e~~~~~~il~~ 75 (185) T smart00534 63 MVEMKETANILKN 75 (185) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999983 No 91 >PRK12314 gamma-glutamyl kinase; Provisional Probab=36.96 E-value=32 Score=15.36 Aligned_cols=52 Identities=23% Similarity=0.326 Sum_probs=40.1 Q ss_pred CCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEEC----CCCC Q ss_conf 996899982016713185777258999988989999988980999913----7788 Q gi|254780653|r 14 RGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSH----LGRP 65 (400) Q Consensus 14 ~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH----~GRP 65 (400) +-|||.|.+-=++=..+..=.|..||......|..+.++|..+||.|+ .|++ T Consensus 8 ~~krIViKiG~s~lt~~~g~~~~~~i~~la~~I~~l~~~G~~vvlVsSGav~~G~~ 63 (265) T PRK12314 8 SAKRIVIKVGSSSLSYENGKINLERIEQLVFVISDLMNKGKEVILVSSGAIGAGLT 63 (265) T ss_pred CCCEEEEEECHHHEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHCCCC T ss_conf 08789999661126789998599999999999999997899899986374431500 No 92 >PRK07534 methionine synthase I; Validated Probab=36.84 E-value=32 Score=15.35 Aligned_cols=16 Identities=6% Similarity=0.191 Sum_probs=7.6 Q ss_pred HHHHHCCCEEEEEECC Q ss_conf 9999889809999137 Q gi|254780653|r 47 LELVEKKAKVVIFSHL 62 (400) Q Consensus 47 ~~l~~~~akvii~SH~ 62 (400) +.-+++||.||...-+ T Consensus 52 ~dyi~AGAdVI~TNTy 67 (335) T PRK07534 52 QGAVDAGSDIFLTNSF 67 (335) T ss_pred HHHHHHCCCEEEECCC T ss_conf 9999965999980776 No 93 >pfam02641 DUF190 Uncharacterized ACR, COG1993. Probab=36.73 E-value=22 Score=16.39 Aligned_cols=32 Identities=22% Similarity=0.383 Sum_probs=14.8 Q ss_pred EEECCCCCCCCEECCHHHHHHHHHHHHHHHHC Q ss_conf 82016713185777258999988989999988 Q gi|254780653|r 21 RVDWNVPFIDGKVADVTRIERVVPTILELVEK 52 (400) Q Consensus 21 RvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~ 52 (400) |..-|.|+.=.-|-.+.||++.+|.++.+++. T Consensus 59 ~ls~dlPvvIe~VD~~eki~~~lp~l~~~~~~ 90 (101) T pfam02641 59 QLSRRLPVVIEIVDTEEKIARFLDEIKELIKN 90 (101) T ss_pred HHCCCCCEEEEEEECHHHHHHHHHHHHHHCCC T ss_conf 42699998999993999999999999976468 No 94 >PRK10416 cell division protein FtsY; Provisional Probab=36.69 E-value=33 Score=15.34 Aligned_cols=77 Identities=16% Similarity=0.217 Sum_probs=33.0 Q ss_pred CCCCCEEEEEEE----CCHHHHHHHHHHHHHCCCEEEECH-------HHHHHHHHHHCCCCCCCCCC----CCCCHHHHH Q ss_conf 667871899960----780255788998773068888620-------57899999830143433331----000013568 Q gi|254780653|r 186 ESKKPLVAIVGG----SKVSTKITLLINLVKKVDKLVIGG-------GMANSFLVAQGMGVGRSLCQ----RDFSDNVHQ 250 (400) Q Consensus 186 ~~~~P~v~IiGG----aKisdKi~~i~~L~~k~D~iiigG-------~lantfL~A~G~~IG~Sl~e----~~~~~~ak~ 250 (400) +.++|+|.++-| .|-.+--++-..|-++--+++++. ++-..=.|+. .+|-.++. .+....+.+ T Consensus 291 ~~~~P~VIl~vGvNG~GKTTTigKLA~~~~~~gkkVllaA~DTfRaAAieQL~~w~~--r~~v~vi~~~~g~Dpa~V~~d 368 (499) T PRK10416 291 EGKTPFVILMVGVNGVGKTTTIGKLARQFEQQGKSVMLAAGDTFRAAAVEQLQVWGQ--RNNIPVIAQHTGADSASVIFD 368 (499) T ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHC--CCCCEEEECCCCCCHHHHHHH T ss_conf 899987999974787878989999999999779953788406675689999999842--457369836899997999999 Q ss_pred HHHHHHHCCCCCCC Q ss_conf 89987530245543 Q gi|254780653|r 251 IAWEAKRSACEIIV 264 (400) Q Consensus 251 i~~~a~~~~~~I~l 264 (400) .++.++.++..+++ T Consensus 369 ai~~a~~~~~Dvvi 382 (499) T PRK10416 369 AIQAAKARNVDVLI 382 (499) T ss_pred HHHHHHHCCCCEEE T ss_conf 99999972999899 No 95 >KOG4105 consensus Probab=36.49 E-value=11 Score=18.46 Aligned_cols=19 Identities=32% Similarity=0.496 Sum_probs=13.1 Q ss_pred CCEEEECCHHHHCCCCCCE Q ss_conf 6424754404401137633 Q gi|254780653|r 140 GDFYINDAFSVSHRAHASI 158 (400) Q Consensus 140 aDiyVnDAF~~aHR~haS~ 158 (400) +-+---|.||+|||-|+-- T Consensus 7 a~~~Rr~sFSA~HRLhS~~ 25 (141) T KOG4105 7 ATMERRDSFSAAHRLHSEK 25 (141) T ss_pred EEEEEEHHHHHHHHHCCCC T ss_conf 5666520345777644401 No 96 >cd00209 DHFR Dihydrofolate reductase (DHFR). Reduces 7,8-dihydrofolate to 5,6,7,8-tetrahydrofolate with NADPH as a cofactor. This is an essential step in the biosynthesis of deoxythymidine phosphate since 5,6,7,8-tetrahydrofolate is required to regenerate 5,10-methylenetetrahydrofolate which is then utilized by thymidylate synthase. Inhibition of DHFR interrupts thymidilate synthesis and DNA replication, inhibitors of DHFR (such as Methotrexate) are used in cancer chemotherapy. 5,6,7,8-tetrahydrofolate also is involved in glycine, serine, and threonine metabolism and aminoacyl-tRNA biosynthesis. Probab=36.16 E-value=32 Score=15.35 Aligned_cols=25 Identities=16% Similarity=0.406 Sum_probs=18.6 Q ss_pred EEEEEEECCHHHHHHHHHHHHHCCCEEEECH Q ss_conf 1899960780255788998773068888620 Q gi|254780653|r 191 LVAIVGGSKVSTKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 191 ~v~IiGGaKisdKi~~i~~L~~k~D~iiigG 221 (400) -+.|+||+++ .+.+++.+|.+++.= T Consensus 93 ~i~IiGG~~i------Y~~~l~~~d~i~lT~ 117 (158) T cd00209 93 EIFVIGGAEI------YKQALPYADRLYLTR 117 (158) T ss_pred EEEEECHHHH------HHHHHHHCCEEEEEE T ss_conf 4999578999------999998619799998 No 97 >PRK00131 aroK shikimate kinase; Reviewed Probab=35.63 E-value=34 Score=15.23 Aligned_cols=32 Identities=19% Similarity=0.409 Sum_probs=20.8 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 8980999913778889847332247888876431013354566 Q gi|254780653|r 52 KKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVN 94 (400) Q Consensus 52 ~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~ 94 (400) ++..|+|+...|- || .+ +.+.|++.|+.+ |+| T Consensus 3 ~~~nI~liG~~Gs--GK----Tt---vgk~LA~~L~~~--fiD 34 (175) T PRK00131 3 KGPNIVLIGMMGA--GK----ST---IGRLLAKRLGYE--FID 34 (175) T ss_pred CCCEEEEECCCCC--CH----HH---HHHHHHHHHCCC--CCC T ss_conf 9980898889999--98----99---999999995969--023 No 98 >PRK09604 putative DNA-binding/iron metalloprotein/AP endonuclease; Reviewed Probab=35.47 E-value=34 Score=15.21 Aligned_cols=58 Identities=17% Similarity=0.213 Sum_probs=27.4 Q ss_pred HHHHHHHHHHHHHHHC-CC---EEEEEECCCCCCCCCCCCCCHH---HHHHHHHHHCCCCCCCCCCCCH Q ss_conf 8999988989999988-98---0999913778889847332247---8888764310133545666310 Q gi|254780653|r 37 TRIERVVPTILELVEK-KA---KVVIFSHLGRPQSKSDKDCSLF---KVVSIAESILHKNILFVNDCIG 98 (400) Q Consensus 37 ~RI~~~~pTI~~l~~~-~a---kvii~SH~GRPkg~~~~~~Sl~---~v~~~l~~~l~~~V~fi~d~~g 98 (400) .-.+...+.++..+++ +- .+-.++==-+| | .- -+|. ..++.|+..++.|+.-++...+ T Consensus 50 ~H~~~i~~lv~~al~~a~i~~~~id~IAvT~gP-G-L~--g~L~VG~~~Ak~La~~~~~Pli~VnH~~g 114 (335) T PRK09604 50 HHVEALPPLLEEALKEAGLSLEDIDAIAVTAGP-G-LV--GALLVGATAAKALALALNKPLIGVNHLEG 114 (335) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHCCEEEEECCC-C-CC--HHHHHHHHHHHHHHHHCCCCEEECCHHHH T ss_conf 999999999999998659998789789994799-9-61--13999999999999980998241125777 No 99 >TIGR01752 flav_long flavodoxin; InterPro: IPR010086 Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They function as electron transfer agents in a variety of microbial metabolic processes, including nitrogen fixation by nitrogenase , sulphite reduction , and light-dependent NADP+ reduction during photosynthesis . This entry describes the long chain type, typical for nitrogen fixation but associated with pyruvate formate-lyase activation and cobalamin-dependent methionine synthase activity in Escherichia coli .; GO: 0010181 FMN binding, 0006118 electron transport. Probab=34.86 E-value=35 Score=15.15 Aligned_cols=15 Identities=27% Similarity=0.612 Sum_probs=7.2 Q ss_pred CCCHHHHCCCCCEEEE Q ss_conf 7710100159968999 Q gi|254780653|r 5 RTMNDLRDIRGLRCLL 20 (400) Q Consensus 5 ~ti~d~~d~~gk~Vlv 20 (400) ++|+++ ||+||+|=+ T Consensus 78 ~~L~~~-dl~GK~VAl 92 (176) T TIGR01752 78 SDLEEL-DLKGKTVAL 92 (176) T ss_pred HHHHHC-CCCCCEEEE T ss_conf 544202-358876899 No 100 >TIGR01923 menE O-succinylbenzoate-CoA ligase; InterPro: IPR010192 This entry represents the enzyme O-succinylbenzoate-CoA ligase (MenE), which is involved in the fourth step of the menaquinone (vitamin K2) biosynthesis pathway. In bacteria, menaquinone is used during fumarate reduction in anaerobic respiration. In green sulphur bacteria and heliobacteria, menaquinones are thought to be used as a loosely bound secondary electron acceptor in the photosynthetic reaction centre. In cyanobacteria, the product of the menaquinone pathway is phylloquinone (2-methyl-3-phytyl-1,4-naphthoquinone), a molecule used exclusively as an electron transfer cofactor in Photosystem (PS) I. .; GO: 0008756 o-succinylbenzoate-CoA ligase activity, 0009234 menaquinone biosynthetic process. Probab=34.77 E-value=11 Score=18.41 Aligned_cols=81 Identities=20% Similarity=0.312 Sum_probs=44.1 Q ss_pred HHHHHCCCCCEEEECCHHH-------HC--CCCCCEEECCCCCCCCCCHHHH-HHHHHHHHHCCCCCCCEEEEEEECCHH Q ss_conf 6653124364247544044-------01--1376333023321000332556-577765331016678718999607802 Q gi|254780653|r 132 FVRMLSRNGDFYINDAFSV-------SH--RAHASITGLSHLLPSYIGRAMQ-KELSMLESCFSESKKPLVAIVGGSKVS 201 (400) Q Consensus 132 f~k~La~laDiyVnDAF~~-------aH--R~haS~~gi~~~lps~aG~l~e-kEi~~L~~~l~~~~~P~v~IiGGaKis 201 (400) +-|-|..-+-++|+|.|+. +| =.|+|.| |..|..++--.-. ++...|. ...|||++|+ T Consensus 208 l~r~l~~G~tl~~~~~~~~~~l~~~i~~~~vth~SLV--PT~L~rll~~~~~~~~p~~lr----------~iLLGG~~ip 275 (490) T TIGR01923 208 LFRWLLEGATLKVVDKFAAEQLLESIANEKVTHISLV--PTQLKRLLDESGSNEEPESLR----------KILLGGSAIP 275 (490) T ss_pred HHHHHHCCEEEEECCCCCHHHHHHHHHCCCCEEEECC--HHHHHHHHHHCCCCCCHHHHH----------HHHHCCCCCC T ss_conf 9998754747886583018999998507960378720--789999997168986557888----------9870784258 Q ss_pred HHHHHHHHHHHCCCEEEECHHHHHH Q ss_conf 5578899877306888862057899 Q gi|254780653|r 202 TKITLLINLVKKVDKLVIGGGMANS 226 (400) Q Consensus 202 dKi~~i~~L~~k~D~iiigG~lant 226 (400) .++|+.=.++-=-++.|=||--+ T Consensus 276 --~~Li~~a~~~g~p~Y~~YGMTE~ 298 (490) T TIGR01923 276 --APLIEEAQEKGLPIYLSYGMTET 298 (490) T ss_pred --HHHHHHHHHCCCCEEECCCCCEE T ss_conf --78999999738966505763010 No 101 >PRK07424 bifunctional sterol desaturase/short chain dehydrogenase; Validated Probab=34.61 E-value=35 Score=15.13 Aligned_cols=40 Identities=18% Similarity=0.246 Sum_probs=29.4 Q ss_pred CCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 159968999820167131857772589999889899999889809999137788 Q gi|254780653|r 12 DIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRP 65 (400) Q Consensus 12 d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRP 65 (400) .++||+|-|-- -.|.+ --.+++++..+|||||.+||..+| T Consensus 177 SLKGKTV~VTG------ASG~L--------G~aL~k~l~~~GAKVIalTs~~~~ 216 (410) T PRK07424 177 SLKGKTVAVTG------ASGTL--------GQALLKELHQQGAKVIALTSNSDK 216 (410) T ss_pred ECCCCEEEEEC------CCCHH--------HHHHHHHHHHCCCEEEEEECCCCC T ss_conf 01686799954------77377--------899999999779989999358986 No 102 >TIGR01196 edd phosphogluconate dehydratase; InterPro: IPR004786 Two dehydratases, dihydroxy-acid dehydratase (gene ilvD or ILV3) and 6-phosphogluconate dehydratase (gene edd) have been shown to be evolutionary related . Dihydroxy-acid dehydratase catalyzes the fourth step in the biosynthesis of isoleucine and valine, the dehydratation of 2,3-dihydroxy-isovaleic acid into alpha-ketoisovaleric acid. 6-Phosphogluconate dehydratase catalyzes the first step in the Entner-Doudoroff pathway, the dehydratation of 6-phospho-D-gluconate into 6-phospho-2-dehydro-3-deoxy-D-gluconate. Another protein containing this signature is the Escherichia coli hypothetical protein yjhG. The N-terminal part of the proteins contains a cysteine that could be involved in the binding of a 2Fe-2S iron-sulphur cluster . This family defines the 6-phosphogluconate dehydratases.; GO: 0004456 phosphogluconate dehydratase activity, 0009255 Entner-Doudoroff pathway. Probab=34.49 E-value=30 Score=15.53 Aligned_cols=38 Identities=16% Similarity=0.286 Sum_probs=16.0 Q ss_pred EEEEECCCCCCCCCCCCC-CHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999913778889847332-2478888764310133545666 Q gi|254780653|r 56 VVIFSHLGRPQSKSDKDC-SLFKVVSIAESILHKNILFVND 95 (400) Q Consensus 56 vii~SH~GRPkg~~~~~~-Sl~~v~~~l~~~l~~~V~fi~d 95 (400) |+++=||| ||....|++ -|-|.--.|+.. +.||..+.| T Consensus 479 VaVVRfQG-PkANGMPELHKLTp~LG~LQDR-GfKVALvTD 517 (614) T TIGR01196 479 VAVVRFQG-PKANGMPELHKLTPPLGVLQDR-GFKVALVTD 517 (614) T ss_pred EEEECCCC-CCCCCCCCCHHCCCCCHHHHCC-CCEEEEEEC T ss_conf 88845787-8887485712015300042408-625999866 No 103 >TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases. Probab=34.46 E-value=35 Score=15.11 Aligned_cols=40 Identities=25% Similarity=0.265 Sum_probs=30.1 Q ss_pred EEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 8999820167131857772589999889899999889809999137 Q gi|254780653|r 17 RCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHL 62 (400) Q Consensus 17 ~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~ 62 (400) +|++|+|-+.-+.-| -+.+++---+.|.++|..+..+++- T Consensus 2 kI~fr~d~~~~iG~G------H~~RclaLA~~l~~~g~~v~f~~~~ 41 (280) T TIGR03590 2 RILFRADASSEIGLG------HVMRCLTLARELRKRGAEVAFACKT 41 (280) T ss_pred EEEEEEECCCCEEHH------HHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 799999678991320------8999999999999889949999927 No 104 >cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be Probab=34.06 E-value=36 Score=15.07 Aligned_cols=45 Identities=24% Similarity=0.270 Sum_probs=34.0 Q ss_pred CEEEEEEECCCCC-CCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 6899982016713-18577725899998898999998898099991 Q gi|254780653|r 16 LRCLLRVDWNVPF-IDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 16 k~VlvRvD~NvPi-~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) |||.+.+-=.+=. +++.-.|..||++...-|..+.++|.+|+|.. T Consensus 1 kRIViKiGGs~L~~~~~~~ld~~~I~~la~~I~~l~~~G~eVviVv 46 (231) T cd04254 1 KRVLLKLSGEALAGENGFGIDPEVLNRIAREIKEVVDLGVEVAIVV 46 (231) T ss_pred CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9599995166607899988599999999999999997798699998 No 105 >PRK10769 folA dihydrofolate reductase; Provisional Probab=33.83 E-value=35 Score=15.17 Aligned_cols=26 Identities=27% Similarity=0.576 Sum_probs=19.5 Q ss_pred CEEEEEEECCHHHHHHHHHHHHHCCCEEEECH Q ss_conf 71899960780255788998773068888620 Q gi|254780653|r 190 PLVAIVGGSKVSTKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 190 P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG 221 (400) +-+.|+||++| -+.+++.+|++.++= T Consensus 89 ~~i~IiGG~~I------Y~~~l~~~d~l~lT~ 114 (159) T PRK10769 89 PEIMVIGGGRV------YEQFLPKAQKLYLTH 114 (159) T ss_pred CEEEEECCHHH------HHHHHHHCCEEEEEE T ss_conf 80999739999------999997789999999 No 106 >pfam11004 DUF2843 Protein of unknown function (DUF2843). This bacterial family of proteins has no known function. Probab=33.51 E-value=29 Score=15.69 Aligned_cols=25 Identities=16% Similarity=0.299 Sum_probs=20.3 Q ss_pred HHHHHHHHHHCCCCCEEECCHHHHC Q ss_conf 0255665531364648982424303 Q gi|254780653|r 98 GSTLSQSIASLSEGGIILAENVRFY 122 (400) Q Consensus 98 g~~~~~~i~~l~~g~ilLLEN~Rf~ 122 (400) .+...+.++.|+.|.||++++++|- T Consensus 8 ~~~~~~~~~~LE~G~VL~fp~l~F~ 32 (281) T pfam11004 8 PEPQRAAIRALEQGKVLYFPKLGFP 32 (281) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCC T ss_conf 6789999999747977986798888 No 107 >COG0626 MetC Cystathionine beta-lyases/cystathionine gamma-synthases [Amino acid transport and metabolism] Probab=33.49 E-value=37 Score=15.01 Aligned_cols=69 Identities=14% Similarity=0.240 Sum_probs=43.9 Q ss_pred HHHHH-HHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHH Q ss_conf 98898-99999889809999137788898473322478888764310133545666310255665531364648982424 Q gi|254780653|r 41 RVVPT-ILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENV 119 (400) Q Consensus 41 ~~~pT-I~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~ 119 (400) +++.| +-.+++.|.+|++.... .-+...++..+-+..|.++.|+++...+...+++.. ++-.++++|-- T Consensus 89 aAI~~~~l~ll~~GD~vl~~~~~---------YG~t~~~~~~~l~~~gi~~~~~d~~~~~~~~~~~~~-~~tk~v~lEtP 158 (396) T COG0626 89 AAISTALLALLKAGDHVLLPDDL---------YGGTYRLFEKILQKFGVEVTFVDPGDDEALEAAIKE-PNTKLVFLETP 158 (396) T ss_pred HHHHHHHHHHCCCCCEEEECCCC---------CCHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHCC-CCCEEEEEECC T ss_conf 99999999835799989845776---------432999999999856859999789975889987446-68349999679 No 108 >TIGR01487 SPP-like SPP-like hydrolase, Archaeal; InterPro: IPR006382 This group of Archaeal sequences is most closely related to the sucrose-phosphate phosphatases (SPP) from plants and cyanobacteria. If it should be shown that the proteins in this family possess sucrose-phosphatase activity then SPP would represent a broad family of functionally related proteins.. Probab=33.48 E-value=37 Score=15.01 Aligned_cols=74 Identities=23% Similarity=0.345 Sum_probs=51.6 Q ss_pred CCCEECC--H-HHHH-HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH Q ss_conf 1857772--5-8999-9889899999889809999137788898473322478888764310133545666310255665 Q gi|254780653|r 29 IDGKVAD--V-TRIE-RVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQS 104 (400) Q Consensus 29 ~~g~I~d--~-~RI~-~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~ 104 (400) .||++|| . ++|. +++.-|+.+-++|-+|+|+| |.. .|+++.|..+|+..=.|+ T Consensus 8 ~DGTLT~~~~~~~I~~~A~e~iRk~~k~GI~V~Lvt------GN~------~~f~~al~~~i~~~G~v~----------- 64 (223) T TIGR01487 8 IDGTLTDEEKNRLISLRAIEAIRKAEKKGIKVSLVT------GNV------VPFARALAVLIGVSGPVV----------- 64 (223) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE------CCH------HHHHHHHHHHHCCCCCEE----------- T ss_conf 676425444463108889999986752783699980------882------799888887506888689----------- Q ss_pred HHHCCCCCEEECC---HHHHCCCCCCC Q ss_conf 5313646489824---24303432100 Q gi|254780653|r 105 IASLSEGGIILAE---NVRFYSEEERN 128 (400) Q Consensus 105 i~~l~~g~ilLLE---N~Rf~~~E~~n 128 (400) .++|.+++-+ .-||+..++++ T Consensus 65 ---~ENGg~~~~~~~~~~~~~~~~~~~ 88 (223) T TIGR01487 65 ---AENGGVVFFNDDSEDIFLSNEEKE 88 (223) T ss_pred ---EECCCEEEECCCCEEEEEECCCHH T ss_conf ---626838997579803532023214 No 109 >TIGR00338 serB phosphoserine phosphatase SerB; InterPro: IPR004469 Phosphoserine phosphatase (SerB), (3.1.3.3 from EC), also known as O-phosphoserine phosphohydrolase, is involved in both serine and glycine biosynthesis. It catalyzes the reaction 3-phospho-serine + H2O = L-serine + phosphate, which is the last step in the biosynthesis of serine from carbohydrates. The reaction proceeds via the formation of a phosphoryl-enzyme intermediate. It acts as a homodimer, and requires magnesium as a cofactor.; GO: 0004647 phosphoserine phosphatase activity, 0006564 L-serine biosynthetic process. Probab=33.21 E-value=37 Score=14.98 Aligned_cols=43 Identities=23% Similarity=0.174 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCC--CCCCC Q ss_conf 98898999998898099991377888984733224788887643101335--45666 Q gi|254780653|r 41 RVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNI--LFVND 95 (400) Q Consensus 41 ~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V--~fi~d 95 (400) -+..|++.|-++|.||.++| |.|+ .+++++.+.|+.+= .|.+. T Consensus 90 G~~Elv~~L~~~gykVav~S------GGF~------~~a~~~k~~L~LD~tn~faN~ 134 (223) T TIGR00338 90 GAEELVKTLKEKGYKVAVIS------GGFD------LFAEHLKDKLGLDYTNAFANR 134 (223) T ss_pred CHHHHHHHHHHCCEEEEEEE------CCHH------HHHHHHHHHHCCCHHHHHHCE T ss_conf 87899999988690799985------7768------889888998578864721020 No 110 >PRK13542 consensus Probab=33.13 E-value=37 Score=14.97 Aligned_cols=15 Identities=13% Similarity=0.184 Sum_probs=8.1 Q ss_pred CCCCEEEEEEECCCC Q ss_conf 599689998201671 Q gi|254780653|r 13 IRGLRCLLRVDWNVP 27 (400) Q Consensus 13 ~~gk~VlvRvD~NvP 27 (400) ..+|++|=.+||.+. T Consensus 28 ~g~~~il~~isl~i~ 42 (224) T PRK13542 28 RGGRAVFRGIDISLA 42 (224) T ss_pred ECCEEEEECCEEEEC T ss_conf 999998846167875 No 111 >KOG0875 consensus Probab=33.01 E-value=23 Score=16.27 Aligned_cols=40 Identities=20% Similarity=0.331 Sum_probs=34.9 Q ss_pred CCCEEEECCHHHHCCCCCCEEECCCCCCCC-CCHHHHHHHH Q ss_conf 364247544044011376333023321000-3325565777 Q gi|254780653|r 139 NGDFYINDAFSVSHRAHASITGLSHLLPSY-IGRAMQKELS 178 (400) Q Consensus 139 laDiyVnDAF~~aHR~haS~~gi~~~lps~-aG~l~ekEi~ 178 (400) -+|++|--|.+-.|+.-++.+|.+++.-+| .|+|+.+.+- T Consensus 70 ~gd~v~~~a~s~elpkyg~~~GLtNyaAay~TglLLarR~l 110 (264) T KOG0875 70 EGDVIVRAAYAHELPKYGVKVGLTNYAAAYCTGLLLACRLL 110 (264) T ss_pred CCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 32399972013346655555464016888766789999999 No 112 >PRK07272 amidophosphoribosyltransferase; Provisional Probab=32.77 E-value=38 Score=14.93 Aligned_cols=128 Identities=16% Similarity=0.251 Sum_probs=61.9 Q ss_pred HHHHCCCCCEEECCH--HHH--CCCCCC-------------------------CHHHHHHHHCCCCCEEEECCHHHHCCC Q ss_conf 553136464898242--430--343210-------------------------002466531243642475440440113 Q gi|254780653|r 104 SIASLSEGGIILAEN--VRF--YSEEER-------------------------NDPDFVRMLSRNGDFYINDAFSVSHRA 154 (400) Q Consensus 104 ~i~~l~~g~ilLLEN--~Rf--~~~E~~-------------------------n~~~f~k~La~laDiyVnDAF~~aHR~ 154 (400) .+.+++|||++.+.. +++ +.++.+ -..++.+.||.-.++ || T Consensus 217 ~irdv~PGEiv~i~~~g~~~~~~~~~~~~~~C~FEyIYFaRpdS~i~g~~Vy~~R~~~G~~LA~e~~~---~~------- 286 (484) T PRK07272 217 FVRDVQPGEVVVIDDDGIQVDSYTDETQLAICSMEYIYFARPDSTIHGVNVHTARKRMGKRLAQEFPV---DA------- 286 (484) T ss_pred EEEECCCCEEEEEECCCEEEEECCCCCCCCCCEEEEEECCCCCCEECCEEHHHHHHHHHHHHHHHCCC---CC------- T ss_conf 99843897699995796589962799988745344211167981387868999999999999835787---78------- Q ss_pred CCCEEECCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECC-----HHHHHHHHHHHHHCCCEEEECHHHHHHHH Q ss_conf 7633302332-10003325565777653310166787189996078-----02557889987730688886205789999 Q gi|254780653|r 155 HASITGLSHL-LPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSK-----VSTKITLLINLVKKVDKLVIGGGMANSFL 228 (400) Q Consensus 155 haS~~gi~~~-lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaK-----isdKi~~i~~L~~k~D~iiigG~lantfL 228 (400) --++++|.- .|++.|+.-+.-+..-.-++.+.--..++|.-+-+ +.-|+..++.+++-- +|++ T Consensus 287 -DvVi~VPdSg~~aA~Gya~~~gip~~~glikn~yvgRTFI~p~q~~R~~~v~~Kln~~~~~i~gK-~vvl--------- 355 (484) T PRK07272 287 -DIVIGVPNSSLSAASGYAEESGLPYEMGLVKNQYVARTFIQPTQELREQGVRMKLSAVSGVVKGK-RVVM--------- 355 (484) T ss_pred -CCCCCCCCCHHHHHHHHHHHCCCCHHCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHCCC-EEEE--------- T ss_conf -72500788538999999987299511051450311345789967888873634010254553797-5999--------- Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 983014343333100001356889987530245 Q gi|254780653|r 229 VAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACE 261 (400) Q Consensus 229 ~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~ 261 (400) |-.|++.-. +.++|++.+++-|.+ T Consensus 356 ------VDDSIVRGt---T~k~iv~~Lr~aGAk 379 (484) T PRK07272 356 ------VDDSIVRGT---TSRRIVKLLKEAGAK 379 (484) T ss_pred ------EECCCCCCC---CHHHHHHHHHHCCCC T ss_conf ------825411267---599999999977998 No 113 >PRK01438 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=32.64 E-value=38 Score=14.92 Aligned_cols=61 Identities=15% Similarity=0.263 Sum_probs=33.3 Q ss_pred CHHHHHHHHHHHHHCC--CCCCCEEEEEEECCHHHHHHHHHHHHHCCC-EEEECHHHHHHHHHH Q ss_conf 3255657776533101--667871899960780255788998773068-888620578999998 Q gi|254780653|r 170 GRAMQKELSMLESCFS--ESKKPLVAIVGGSKVSTKITLLINLVKKVD-KLVIGGGMANSFLVA 230 (400) Q Consensus 170 G~l~ekEi~~L~~~l~--~~~~P~v~IiGGaKisdKi~~i~~L~~k~D-~iiigG~lantfL~A 230 (400) |.-+-.|++-+.+++. ....|.++|=|=-==+|.-.++.++++.+. ...++|.++..++.+ T Consensus 101 gi~i~~eiel~~~~~~~~~~~~~~IaVTGTnGKTTTtsli~~iL~~~g~~~~~~GNIG~p~l~~ 164 (481) T PRK01438 101 GIPVWGDVELAWRLRDPDGTPAPWLAVTGTNGKTTTVQMLASILRAAGLRAAAVGNIGVPVLDA 164 (481) T ss_pred CCCEECHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEECCHHHHHH T ss_conf 9938618999998752136888789993899746099999999996699707996256777764 No 114 >KOG3087 consensus Probab=32.54 E-value=15 Score=17.42 Aligned_cols=50 Identities=24% Similarity=0.422 Sum_probs=33.1 Q ss_pred CCCCEEEEC-CHHHHCCCCCCEEECCCCCCCCC--CHHHHHHHHHHHHH-CCCCCC Q ss_conf 436424754-40440113763330233210003--32556577765331-016678 Q gi|254780653|r 138 RNGDFYIND-AFSVSHRAHASITGLSHLLPSYI--GRAMQKELSMLESC-FSESKK 189 (400) Q Consensus 138 ~laDiyVnD-AF~~aHR~haS~~gi~~~lps~a--G~l~ekEi~~L~~~-l~~~~~ 189 (400) +-.|+||.+ |+.+.|-+|+++ |-.|+.+|. |--.+.-+.-|+.+ +...|| T Consensus 172 KgVDLYVlErA~lsTh~~~~~~--fe~~~~~Y~~~~K~~~avlkk~eeVRlRGRKR 225 (229) T KOG3087 172 KGVDLYVLERAVLSTHDKKALL--FEGFLEGYRASGKQSKAVLKKLEEVRLRGRKR 225 (229) T ss_pred CCCEEEEEHHHHHCCCCCCHHH--HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCC T ss_conf 5631232467673157650889--99999999850221599998888875114420 No 115 >TIGR01318 gltD_gamma_fam glutamate synthase, small subunit; InterPro: IPR006006 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This model describes both glutatate synthase small subunit and closely related paralogs of unknown function from a number of gamma and alpha subdivision Proteobacteria, including Escherichia coli. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=32.51 E-value=20 Score=16.64 Aligned_cols=140 Identities=21% Similarity=0.284 Sum_probs=80.8 Q ss_pred HHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC------CCE-EEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 331016678718999607802557889987730------688-8862057899999830143433331000013568899 Q gi|254780653|r 181 ESCFSESKKPLVAIVGGSKVSTKITLLINLVKK------VDK-LVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAW 253 (400) Q Consensus 181 ~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k------~D~-iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~ 253 (400) +++....+| |||||+.= -=|.--+-|... .|+ =-|||.|- +-|..-..|+..+..=++|++ T Consensus 137 S~V~~~g~r--VAviGAGP--AGLaCAD~L~RaGV~v~VfDRhP~iGGLLt--------FGIPsFKLdK~V~~~Rr~if~ 204 (480) T TIGR01318 137 SHVKPTGKR--VAVIGAGP--AGLACADILARAGVQVVVFDRHPEIGGLLT--------FGIPSFKLDKAVLSRRREIFT 204 (480) T ss_pred CCCCCCCCE--EEEECCCC--CHHHHHHHHHHCCCEEEEEECCCCCCCCCC--------CCCCCHHHHHHHHHHHHHHHH T ss_conf 773007827--89977886--025799987517855999747703076013--------688851102789999999997 Q ss_pred HHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCC--CC---------------CCCCCHHHHHHHHHHCCCCEEE Q ss_conf 875302455430354113432323333102532120133--45---------------7652003567776405776799 Q gi|254780653|r 254 EAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLDS--II---------------LDVGFKTVEYIKQVIAQARTVM 316 (400) Q Consensus 254 ~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~--~i---------------~DIGp~Ti~~~~~~I~~AktI~ 316 (400) . -|+++.|-+. |.+ .++++++-.++ -+ =|. |.=+..+.=.|.+.|.+. T Consensus 205 ~---MGi~F~Ln~E--vGr---------D~~l~~LLe~YDAVFlGvGTY~~~~ggLP~eDa-~GV~~ALPFLianTr~lm 269 (480) T TIGR01318 205 A---MGIEFKLNTE--VGR---------DISLDDLLEDYDAVFLGVGTYRSMRGGLPGEDA-PGVLKALPFLIANTRQLM 269 (480) T ss_pred H---CCCEEECCCE--EEC---------CCCHHHHHHHCCEEEEECCCCCCCCCCCCCCCC-CCHHHHHHHHHHCCCEEC T ss_conf 5---8927865816--503---------255544431148489611433431287788774-216664247766152123 Q ss_pred EECC-----CCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHH Q ss_conf 9766-----2532274365889999999986111488589974804699 Q gi|254780653|r 317 WNGP-----LGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTIT 360 (400) Q Consensus 317 WNGP-----~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~a 360 (400) =++| .|.-+..+| ++ . +-+-..|.|||||.+ T Consensus 270 Gl~eyGrPiaGw~~~~P~----------~~-~--~Gk~VVVLGGGDTaM 305 (480) T TIGR01318 270 GLPEYGRPIAGWEPEEPL----------ID-V--EGKRVVVLGGGDTAM 305 (480) T ss_pred CCCCCCCCCCCCCCCCCC----------CC-C--CCCEEEEECCCCCHH T ss_conf 788877744677766774----------23-4--776689858887525 No 116 >cd07101 ALDH_SSADH2_GabD2 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 2-like. Succinate-semialdehyde dehydrogenase 2 (SSADH2) and similar proteins are in this CD. SSADH1 (GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde to succinate. SSADH activity in Mycobacterium tuberculosis is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1 was shown to be much higher than that of GabD2, and GabD2 (SSADH2) is likely to serve physiologically as a dehydrogenase for a different aldehyde(s). Probab=31.95 E-value=39 Score=14.85 Aligned_cols=51 Identities=14% Similarity=0.285 Sum_probs=26.4 Q ss_pred CCEEEECHHHHHHHHHH-----HCCCCCC---------CCCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 68888620578999998-----3014343---------333100001356889987530245543 Q gi|254780653|r 214 VDKLVIGGGMANSFLVA-----QGMGVGR---------SLCQRDFSDNVHQIAWEAKRSACEIIV 264 (400) Q Consensus 214 ~D~iiigG~lantfL~A-----~G~~IG~---------Sl~e~~~~~~ak~i~~~a~~~~~~I~l 264 (400) .+++++-..++..|+.. +.+.+|. .++.+...+.++++++.+.+.|.++++ T Consensus 262 ~~ri~V~~~i~d~f~~~l~~~~~~l~vG~p~d~~~~~Gpli~~~~~~~i~~~i~~A~~~Ga~~~~ 326 (454) T cd07101 262 IERIYVHESVYDEFVRRFVARTRALRLGAALDYGPDMGSLISQAQLDRVTAHVDDAVAKGATVLA 326 (454) T ss_pred CCEEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 62899961228999999998775245789754346638877999999999999999977999997 No 117 >COG1509 KamA Lysine 2,3-aminomutase [Amino acid transport and metabolism] Probab=31.81 E-value=39 Score=14.83 Aligned_cols=40 Identities=20% Similarity=0.222 Sum_probs=23.3 Q ss_pred HHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC---CCEE Q ss_conf 765331016678718999607802557889987730---6888 Q gi|254780653|r 178 SMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK---VDKL 217 (400) Q Consensus 178 ~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k---~D~i 217 (400) .+|+-+-+.|.=-=|.+=||--++=+-.-|+.|+++ .+++ T Consensus 148 ~al~YIa~hPeI~eVllSGGDPL~ls~~~L~~ll~~L~~IpHv 190 (369) T COG1509 148 KALDYIAAHPEIREVLLSGGDPLSLSDKKLEWLLKRLRAIPHV 190 (369) T ss_pred HHHHHHHCCCHHHEEEECCCCCCCCCHHHHHHHHHHHHCCCCE T ss_conf 9999997395165177407875636889999999987548964 No 118 >pfam00946 Paramyx_RNA_pol Paramyxovirus RNA dependent RNA polymerase. Paramyxoviridae, like other non-segmented negative strand RNA viruses, have an RNA-dependent RNA polymerase composed of two subunits, a large protein L and a phosphoprotein P. This is a protein family of the L protein. The L protein confers the RNA polymerase activity on the complex. The P protein acts as a transcription factor. Probab=31.71 E-value=39 Score=14.82 Aligned_cols=44 Identities=23% Similarity=0.218 Sum_probs=27.1 Q ss_pred CEEEEECCCCCCCCCC---HHHHHHHHHHHHHHHHCCCCCEEEECCCHH Q ss_conf 6799976625322743---658899999999861114885899748046 Q gi|254780653|r 313 RTVMWNGPLGVFEIEP---FDRATVEVAHYVAKLTKERRIISIAGGGDT 358 (400) Q Consensus 313 ktI~WNGP~GvfE~~~---F~~GT~~l~~~ia~~~~~~~~~sivGGGdT 358 (400) .-++|+||+|=.|==. ..-=|..++..+|..+ +--++++++||- T Consensus 672 ~~~~~~g~~GGiEGLrQK~WTi~si~~I~~~a~~~--~~~v~~l~QGDN 718 (1161) T pfam00946 672 GDIFIVGPMGGIEGLCQKLWTIISILVIDLVAVET--NTRVKSLVQGDN 718 (1161) T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHH--CCEEEEECCCCC T ss_conf 88689558853456673132999999999999997--967999715787 No 119 >PRK10640 rhaB rhamnulokinase; Provisional Probab=31.68 E-value=25 Score=16.06 Aligned_cols=68 Identities=24% Similarity=0.254 Sum_probs=39.0 Q ss_pred CHHHHHHHHHHHHHHHHCCC-CCEEEECCCHHHHHHHHC-----CCCCCC--CEEECCHHHHHHHHCCCCCHHHHH Q ss_conf 36588999999998611148-858997480469989984-----887686--468750799899874899868988 Q gi|254780653|r 328 PFDRATVEVAHYVAKLTKER-RIISIAGGGDTITALAHA-----GISNEF--TYVSTAGGAFLEWLEGKDLPGIVA 395 (400) Q Consensus 328 ~F~~GT~~l~~~ia~~~~~~-~~~sivGGGdT~aai~~~-----g~~~~~--~hvSTgGGA~L~~L~G~~LPgi~a 395 (400) .-+...+.+++.+.+.+..+ ....++|||--.....++ |..=.- ..-+.-|-+++.++.-..++-++. T Consensus 356 slA~~~r~~l~~le~~tG~~~~~l~vvGGGsrn~ll~Q~~ADi~g~pVv~g~~EaTalGn~~~q~~a~G~~~~l~e 431 (471) T PRK10640 356 SLALLYADVLHELAQLRGEPFSQLHIVGGGCQNALLNQLCADACGIRVIAGPVEASTLGNIGIQLMTLDELNNVDD 431 (471) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHCCCEECCCCHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 9999999999999996299967699978576659999998998299665287058799999999997698389999 No 120 >PRK01045 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed Probab=31.41 E-value=40 Score=14.79 Aligned_cols=23 Identities=13% Similarity=0.436 Sum_probs=9.4 Q ss_pred HHHHHHHHHCCCEEEEEECCCCC Q ss_conf 89899999889809999137788 Q gi|254780653|r 43 VPTILELVEKKAKVVIFSHLGRP 65 (400) Q Consensus 43 ~pTI~~l~~~~akvii~SH~GRP 65 (400) -..++.+.++|..+|++.+-+-| T Consensus 103 ~~~v~~~~~~Gy~iviiG~~~Hp 125 (304) T PRK01045 103 HKEVARMSREGYEIILIGHKGHP 125 (304) T ss_pred HHHHHHHHHCCCEEEEECCCCCH T ss_conf 99999998589889998178988 No 121 >PRK00694 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Validated Probab=31.39 E-value=40 Score=14.79 Aligned_cols=241 Identities=15% Similarity=0.145 Sum_probs=128.4 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCC-CCEE-------CCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCC Q ss_conf 98887710100159968999820167131-8577-------725899998898999998898099991377888984733 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFI-DGKV-------ADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKD 72 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~-~g~I-------~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~ 72 (400) ||--+++... .+-|+--|+|- |||+. +.-| +|+.-+++++.-|+.|.++||-+|=+|-++.. .- T Consensus 1 ~~~~~~~~~~--~Rr~Tr~V~vG-~v~iGg~~Pi~vQSMt~t~T~D~~atv~Qi~~l~~aGceiVRvtvp~~~-----~A 72 (606) T PRK00694 1 MMATPCIQNA--FRRKTHSVRIG-NLFVGSEHSIKIQSMTTTATTDVDGTVRQICALQEHGCDIVRVTVQGIK-----EA 72 (606) T ss_pred CCCCCCCCCC--CCCCCEEEEEC-CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCHH-----HH T ss_conf 9767654555--35576368985-9665799967556468998636899999999999859998998779999-----99 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCC-----------CCHHHHHHHHCCCCC Q ss_conf 224788887643101335456663102556655313646489824243034321-----------000246653124364 Q gi|254780653|r 73 CSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEE-----------RNDPDFVRMLSRNGD 141 (400) Q Consensus 73 ~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~-----------~n~~~f~k~La~laD 141 (400) -.|..+.+.|.+ -+.+|..+.|++... +.+.... ++ .|.+|-++|-- -.|.++++.+...-+ T Consensus 73 ~al~~I~~~l~~-~~~~iPlvADIHF~~-~~a~~a~-~~----~~kiRINPGN~~~~~~~f~~~~~~deey~~~~~~~~e 145 (606) T PRK00694 73 QACEHIKERLIQ-QGISIPLVADIHFFP-QAAMHVA-DF----VDKVRINPGNYVDKRNMFTGKIYTDEQYAHSLLRLEE 145 (606) T ss_pred HHHHHHHHHHHH-CCCCCCEEEECCCCH-HHHHHHH-HH----CCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 869999999863-699999886668888-9999999-75----2838888997666644444333317777777776888 Q ss_pred EE---EEC--CHHHHCC---CCCCEEE--CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHH Q ss_conf 24---754--4044011---3763330--233210003325565777653310166787189996078025578899877 Q gi|254780653|r 142 FY---IND--AFSVSHR---AHASITG--LSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLV 211 (400) Q Consensus 142 iy---VnD--AF~~aHR---~haS~~g--i~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~ 211 (400) -| |.- ..+++-| +|-|+-. ..+|=... =-++|.-++++.-+-+..=.-++.=+=-|.+-.-+..-+-|. T Consensus 146 ~~~~~v~~~k~~~~~iRIGvN~GSL~~~i~~~yG~t~-~~mveSale~~~i~e~~~f~diviS~KaSn~~~~v~ayrll~ 224 (606) T PRK00694 146 KFSPLVEKCKRLGKAMRIGVNHGSLSERIMQRYGDTI-EGMVYSALEYIEVCEKLDYRDVVFSMKSSNPKVMVAAYRQLA 224 (606) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHCCCCH-HHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHH T ss_conf 9999999999849987873386766899996049948-999999999999999789980899986487899999999999 Q ss_pred HCCC-----E-EEEC-------------H--HHHHHHHHHHCCCCCCCCCCC--CCCHHHHHHHHHHHH Q ss_conf 3068-----8-8862-------------0--578999998301434333310--000135688998753 Q gi|254780653|r 212 KKVD-----K-LVIG-------------G--GMANSFLVAQGMGVGRSLCQR--DFSDNVHQIAWEAKR 257 (400) Q Consensus 212 ~k~D-----~-iiig-------------G--~lantfL~A~G~~IG~Sl~e~--~~~~~ak~i~~~a~~ 257 (400) +++| + +=+| - |++.....--|=.|--||-|+ ..+..+++|+....+ T Consensus 225 ~~~~~egm~YPLHlGvTEAG~~~~G~iKSa~GiG~LL~~GiGDTiRVSLt~dP~~Ei~v~~~l~~~~~~ 293 (606) T PRK00694 225 KDLDARGWLYPLHLGVTEAGMGTDGIIKSAVGIGTLLSEGLGDTIRCSLTGCPTTEIPVCISLLKHTTI 293 (606) T ss_pred HHHHHCCCCCCEEEEECCCCCCCCCEEHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 998734687661662404668767521278999999971687438985578946661899999998887 No 122 >COG1648 CysG Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) [Coenzyme metabolism] Probab=30.84 E-value=40 Score=14.73 Aligned_cols=42 Identities=26% Similarity=0.352 Sum_probs=30.8 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 988877101001599689998201671318577725899998898999998898099991 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) |+.|+- +.+++||+|+| +..|++ +.--++.+++.||++++.| T Consensus 1 ~~~lPl---~~~l~~k~Vlv-------vGgG~v--------a~rKa~~ll~~ga~v~Vvs 42 (210) T COG1648 1 MNYLPL---FLDLEGKKVLV-------VGGGSV--------ALRKARLLLKAGADVTVVS 42 (210) T ss_pred CCCCCE---EEECCCCEEEE-------ECCCHH--------HHHHHHHHHHCCCEEEEEC T ss_conf 985026---99819977999-------899899--------9999999974699799987 No 123 >TIGR03641 cas1_HMARI CRISPR-associated protein Cas1, HMARI/TNEAP subtype. It describes Cas1 subgroup that includes Cas1 proteins of the related HMARI and TNEAP subtypes of CRISPR/Cas system. Probab=30.55 E-value=28 Score=15.74 Aligned_cols=46 Identities=11% Similarity=0.093 Sum_probs=34.7 Q ss_pred CCCCC-CEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCC Q ss_conf 71318-57772589999889899999889809999137788898473 Q gi|254780653|r 26 VPFID-GKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDK 71 (400) Q Consensus 26 vPi~~-g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~ 71 (400) +|+.+ ..|.=...+.-+.+.|+.++++|-.|+.+++.|+|.|...+ T Consensus 29 iPl~~i~~Ivi~g~v~ist~~l~~l~~~gI~v~f~~~~G~~~g~l~p 75 (322) T TIGR03641 29 IPVENIDEIYVFGEVSLNSKALSFLSKKGIPIHFFNYYGYYSGSFYP 75 (322) T ss_pred ECHHHCCEEEEECCCCCCHHHHHHHHHCCCEEEEECCCCCEEEEEEC T ss_conf 37577688999848845899999999879949998999988799942 No 124 >TIGR00612 ispG_gcpE 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; InterPro: IPR004588 This protein previously of unknown biochemical function is essential in Escherichia coli. It has now been characterised as 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase, which converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4CPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate in the sixth step of nonmevalonate terpenoid biosynthesis. The family is restricted to bacteria, where it is widely but not universally distributed. No homology can be detected between this family and other proteins.; GO: 0016114 terpenoid biosynthetic process. Probab=30.54 E-value=22 Score=16.41 Aligned_cols=271 Identities=16% Similarity=0.179 Sum_probs=144.1 Q ss_pred CCHH--HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHH--HHHHHHHHCC Q ss_conf 7258--99998898999998898099991377888984733224788887643101335456663102--5566553136 Q gi|254780653|r 34 ADVT--RIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGS--TLSQSIASLS 109 (400) Q Consensus 34 ~d~~--RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~--~~~~~i~~l~ 109 (400) +++. =|++++.-|+.|-+.||-||=+|=.++ ..---+..+++.|+. |+.+|-.|-|.+-. +...+...+. T Consensus 34 t~T~GaD~~~tv~qi~~L~e~GcdIVRvtVp~~-----e~a~A~~~I~~rL~a-l~~~vPLVADiHf~Ghk~~~~~~~~~ 107 (633) T TIGR00612 34 TDTLGADVDATVEQIRALEEAGCDIVRVTVPDK-----ESAEALEEIKERLQA-LGLNVPLVADIHFAGHKYKKAALAVA 107 (633) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCH-----HHHHHHHHHHHHHHH-CCCCCCEEEEECCCCCCHHHHHHHHH T ss_conf 450131089999999999971994899960884-----679999999999986-26888748873025661468999986 Q ss_pred CCCEEECCHHHHCCCC-----------CCCHHHHHHHHCCCCCEE-----------------EECCHHHHC----CCCCC Q ss_conf 4648982424303432-----------100024665312436424-----------------754404401----13763 Q gi|254780653|r 110 EGGIILAENVRFYSEE-----------ERNDPDFVRMLSRNGDFY-----------------INDAFSVSH----RAHAS 157 (400) Q Consensus 110 ~g~ilLLEN~Rf~~~E-----------~~n~~~f~k~La~laDiy-----------------VnDAF~~aH----R~haS 157 (400) .+ +.-+|-+||- .-.+.+||+.+-++-+.+ || -|.-- |.-.. T Consensus 108 ~~----v~kvRINPGNig~k~~~~~~~~Y~~~~yA~~~~~~~e~v~~~Ve~ar~~G~a~RIGVN--~GSL~~~~~~~ll~ 181 (633) T TIGR00612 108 KG----VAKVRINPGNIGFKKKDAKRKIYSDAKYAEILLKVEERVRDVVEKARRHGKAMRIGVN--HGSLEQVLKRRLLE 181 (633) T ss_pred HC----CCEEEECCCCCCCCCCHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECC--CCCCHHHHHHHHHH T ss_conf 11----7825437876434670120045642578886300467889999999857896875035--67660899999999 Q ss_pred EEECC-CCCCC---CCCHHHHHHHHHHHHHCC-CCC-CCEEEEEEECCHHHHHHHHHHHHHCCC-----EEEEC------ Q ss_conf 33023-32100---033255657776533101-667-871899960780255788998773068-----88862------ Q gi|254780653|r 158 ITGLS-HLLPS---YIGRAMQKELSMLESCFS-ESK-KPLVAIVGGSKVSTKITLLINLVKKVD-----KLVIG------ 220 (400) Q Consensus 158 ~~gi~-~~lps---~aG~l~ekEi~~L~~~l~-~~~-~P~v~IiGGaKisdKi~~i~~L~~k~D-----~iiig------ 220 (400) .||=- .-+.. ++==++|.=|+....+=+ +.. --++.=+=+|.+.+-+.+=+.|.++.| .=|== T Consensus 182 ~Yg~~~~~~~~qev~~EA~V~SAl~~a~~~e~~~f~rd~~v~S~KaS~v~~~vaAy~~lA~~~D~~G~~~PLHLGvTEAG 261 (633) T TIGR00612 182 KYGDKGFSLTAQEVTAEAMVQSALESAEILEKLDFRRDNVVLSMKASDVAETVAAYRLLAERSDELGMLYPLHLGVTEAG 261 (633) T ss_pred HHCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEECC T ss_conf 71546832174315878999999999999985089446079998418767899999999999865035252425412028 Q ss_pred --------HHHHHHHHHHHCC--CCCCCCCCCC---CCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCE Q ss_conf --------0578999998301--4343333100---00135688998753024554303541134323233331025321 Q gi|254780653|r 221 --------GGMANSFLVAQGM--GVGRSLCQRD---FSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQS 287 (400) Q Consensus 221 --------G~lantfL~A~G~--~IG~Sl~e~~---~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~ 287 (400) =.+|.-.|.++|+ .|--||-+.- ...+++-.-... +..+|+| + T Consensus 262 ~g~~GivKS~~gig~LL~~GIGDTiRvSLt~~PnGdr~~Ev~va~sll--rhtkiyl----------------------d 317 (633) T TIGR00612 262 MGVKGIVKSSVGIGILLAEGIGDTIRVSLTDDPNGDRVKEVPVARSLL--RHTKIYL----------------------D 317 (633) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCEEHHHHHHHHH--HHHHEEE----------------------E T ss_conf 888734566789999998458957998537888748220116666776--5310023----------------------0 Q ss_pred ECCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCC----HHHHHHHHHHHHH Q ss_conf 20133457652003567776405776799976625322743----6588999999998 Q gi|254780653|r 288 VPLDSIILDVGFKTVEYIKQVIAQARTVMWNGPLGVFEIEP----FDRATVEVAHYVA 341 (400) Q Consensus 288 i~~~~~i~DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE~~~----F~~GT~~l~~~ia 341 (400) +|.....|-+ -.+-...+..=+-|||-+|--..|+|-.=. -+.--.+|++.+- T Consensus 318 lPekknPfsl-qhsenfvsaaekPakttlWGdvyGvflklyPhhltdftPeellehlG 374 (633) T TIGR00612 318 LPEKKNPFSL-QHSENFVSAAEKPAKTTLWGDVYGVFLKLYPHHLTDFTPEELLEHLG 374 (633) T ss_pred CCCCCCCCCC-HHHHHHHHHCCCCCCCEECHHHHEEHHHHCCHHHCCCCHHHHHHHHC T ss_conf 6666686310-11123454215642200100130001011211321478679998717 No 125 >COG0334 GdhA Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism] Probab=30.53 E-value=41 Score=14.70 Aligned_cols=26 Identities=27% Similarity=0.123 Sum_probs=11.2 Q ss_pred EEEEEEECCHHHHHHHHHHHHHCCCEEE Q ss_conf 1899960780255788998773068888 Q gi|254780653|r 191 LVAIVGGSKVSTKITLLINLVKKVDKLV 218 (400) Q Consensus 191 ~v~IiGGaKisdKi~~i~~L~~k~D~ii 218 (400) .|+|-|=..+. .-+.+.|.+.--+|+ T Consensus 209 rVaVQG~GNVg--~~aa~~l~~~GAkvv 234 (411) T COG0334 209 RVAVQGFGNVG--QYAAEKLHELGAKVV 234 (411) T ss_pred EEEEECCCHHH--HHHHHHHHHCCCEEE T ss_conf 89997762899--999999997499799 No 126 >TIGR02180 GRX_euk Glutaredoxin; InterPro: IPR011899 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry is found in eukaryotic glutaredoxins and includes sequences from fungi, plants and metazoans as well as viruses .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=30.47 E-value=40 Score=14.76 Aligned_cols=51 Identities=24% Similarity=0.453 Sum_probs=33.5 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCC---CE---EEECCCHHHHHHHHCCC Q ss_conf 997662532274365889999999986111488---58---99748046998998488 Q gi|254780653|r 316 MWNGPLGVFEIEPFDRATVEVAHYVAKLTKERR---II---SIAGGGDTITALAHAGI 367 (400) Q Consensus 316 ~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~---~~---sivGGGdT~aai~~~g~ 367 (400) |=.+++.+.|.+..+.| .+|=.++.+.|..+- .| .=|||.|...++.+-|- T Consensus 24 ~~~~~~~v~ElD~~~~g-~~~Q~~L~~~TG~~TVP~iFi~g~~iGG~~D~~~~~~~G~ 80 (85) T TIGR02180 24 VKPAKYKVVELDQLSNG-SEIQDYLKEITGQRTVPNIFINGKFIGGCSDLLALDKSGK 80 (85) T ss_pred CCCCCCEEEEECCCCCH-HHHHHHHHHHCCCCCCCCEEECCEEECCCHHHHHHHCCCC T ss_conf 98676406770257885-7899999984489238826568857678245565412520 No 127 >PRK13402 gamma-glutamyl kinase; Provisional Probab=30.39 E-value=41 Score=14.68 Aligned_cols=212 Identities=17% Similarity=0.205 Sum_probs=100.1 Q ss_pred CCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEEC---------CCCCCCCCCCCCCHHHHHHHH Q ss_conf 15996899982016713185777258999988989999988980999913---------778889847332247888876 Q gi|254780653|r 12 DIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSH---------LGRPQSKSDKDCSLFKVVSIA 82 (400) Q Consensus 12 d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH---------~GRPkg~~~~~~Sl~~v~~~l 82 (400) |-.-|||.|.+-=++=..+..-.|..||..-...|..|.++|..|||.|. +|.++ .+..+.++.+... T Consensus 2 ~~~~kRIVIKiGSs~lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGAia~G~~~L~~~~---rp~l~~kQA~AAv 78 (363) T PRK13402 2 DSPWKRIVVKVGSALITPHKQGCSSHYLLGIAQFIVYLRAQGHQVVLVSSGSVAAGYHKLGFID---RPSVPEKKAMAAA 78 (363) T ss_pred CCCCCEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHCCCCC---CCCHHHHHHHHHH T ss_conf 9988669999670106899998789999999999999997899899988787996477609976---8975788899985 Q ss_pred H---------HHCCCC---C-----CCCCCCCHHHHHHHHHHC-CCCCE-EECCHHHHCCCCC--CCHHHHHHHHCCC-- Q ss_conf 4---------310133---5-----456663102556655313-64648-9824243034321--0002466531243-- Q gi|254780653|r 83 E---------SILHKN---I-----LFVNDCIGSTLSQSIASL-SEGGI-ILAENVRFYSEEE--RNDPDFVRMLSRN-- 139 (400) Q Consensus 83 ~---------~~l~~~---V-----~fi~d~~g~~~~~~i~~l-~~g~i-lLLEN~Rf~~~E~--~n~~~f~k~La~l-- 139 (400) . +..+.. | .|.+.-.-..+++.+..| +.|=| +.=||=---.+|. ..+..++-..|.+ T Consensus 79 GQ~~Lm~~Y~~~F~~~~AQiLLT~~D~~~r~rylN~r~Tl~~Ll~~gvIPIINENDtVat~Ei~fGDND~LaA~VA~li~ 158 (363) T PRK13402 79 GQGDMMATWSKLFDFPAAQLLLTHGDLRDRERYISIRNTINVLLEHGILPIINENDAVTTDKLKVGDNDNLSAMVAALAD 158 (363) T ss_pred HHHHHHHHHHHHHCCHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCC T ss_conf 39999999999847357677188878616478899999999999689422115888501001211575079999998718 Q ss_pred CCEEEE----CCH-HHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC- Q ss_conf 642475----440-44011376333023321000332556577765331016678718999607802557889987730- Q gi|254780653|r 140 GDFYIN----DAF-SVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK- 213 (400) Q Consensus 140 aDiyVn----DAF-~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k- 213 (400) ||..|. |.. ..--|.|+.- +.+|..- -+..|+..+.. .... -.=-|| +.+||...+-.... T Consensus 159 ADlLilLTDvdGLYd~dP~~~~~A----klI~~V~--~i~~~i~~~a~---~~~s--~~GtGG--M~TKl~AA~~a~~~G 225 (363) T PRK13402 159 ADTLIICSDVDGLYDQNPRTNPDA----KLIKEVT--EINAEIYAMAG---GAGS--EVGTGG--MRTKIQAAKKAISHG 225 (363) T ss_pred CCEEEEECCCCCCCCCCCCCCCCC----CEEEEEC--CCCHHHHHHHC---CCCC--CCCCCC--HHHHHHHHHHHHHCC T ss_conf 888999416684447998778766----4367851--56677898716---8778--888783--478999999999779 Q ss_pred CCEEEECHHHHHHHHH-HHCCCCCCCC Q ss_conf 6888862057899999-8301434333 Q gi|254780653|r 214 VDKLVIGGGMANSFLV-AQGMGVGRSL 239 (400) Q Consensus 214 ~D~iiigG~lantfL~-A~G~~IG~Sl 239 (400) +..+|.-|.-.+.+.. ..|..+|.-+ T Consensus 226 i~~~Ia~G~~~~~i~~i~~g~~~GT~F 252 (363) T PRK13402 226 IETFIINGFTADIFNQLLKGQNPGTLF 252 (363) T ss_pred CCEEECCCCCCCHHHHHHCCCCCCEEE T ss_conf 969981699974799997089862499 No 128 >COG2453 CDC14 Predicted protein-tyrosine phosphatase [Signal transduction mechanisms] Probab=30.13 E-value=42 Score=14.65 Aligned_cols=33 Identities=36% Similarity=0.673 Sum_probs=17.6 Q ss_pred CCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 713185777258999988989999988980999 Q gi|254780653|r 26 VPFIDGKVADVTRIERVVPTILELVEKKAKVVI 58 (400) Q Consensus 26 vPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii 58 (400) .|+.|+..-|...++..++-|.+..++|.||++ T Consensus 78 ~~~~D~~~p~~~~l~~~v~~i~~~~~~g~kVvV 110 (180) T COG2453 78 LPILDGTVPDLEDLDKIVDFIEEALSKGKKVVV 110 (180) T ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 103677788999999999999999844986999 No 129 >KOG0584 consensus Probab=30.09 E-value=37 Score=14.97 Aligned_cols=10 Identities=20% Similarity=0.507 Sum_probs=6.6 Q ss_pred CCEEECCCCC Q ss_conf 6333023321 Q gi|254780653|r 156 ASITGLSHLL 165 (400) Q Consensus 156 aS~~gi~~~l 165 (400) +|++|-|.|+ T Consensus 202 ksvIGTPEFM 211 (632) T KOG0584 202 KSVIGTPEFM 211 (632) T ss_pred CEECCCCCCC T ss_conf 4122683534 No 130 >PRK13306 ulaD 3-keto-L-gulonate-6-phosphate decarboxylase; Provisional Probab=29.83 E-value=42 Score=14.62 Aligned_cols=94 Identities=13% Similarity=0.129 Sum_probs=53.8 Q ss_pred HHHHHHHCCCC-CEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHH Q ss_conf 24665312436-42475440440113763330233210003325565777653310166787189996078025578899 Q gi|254780653|r 130 PDFVRMLSRNG-DFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLI 208 (400) Q Consensus 130 ~~f~k~La~la-DiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~ 208 (400) .+.++.|..++ |.. +.||+.-+-. ...-.-.++++.+.++... ...++|-||.+.++ ++.+ T Consensus 119 ~e~a~~~~~lgv~~~------i~H~~~D~~~--------~g~~~~~~~~~~ik~l~~~--~~~vaVaGGI~~~~-~~~~- 180 (216) T PRK13306 119 WEQAQQWRDAGIQQV------IYHRSRDAEL--------AGVAWGEKDLNKVKKLSDM--GFKVSVTGGIVPED-LKLF- 180 (216) T ss_pred HHHHHHHHHCCCCEE------EEECCHHHHH--------CCCCCCHHHHHHHHHHHCC--CCEEEECCCCCHHH-HHHH- T ss_conf 888999997699878------8760322442--------4678887789999997636--98299859989899-9998- Q ss_pred HHHH-CCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 8773-0688886205789999983014343333100001356889987530 Q gi|254780653|r 209 NLVK-KVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRS 258 (400) Q Consensus 209 ~L~~-k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~ 258 (400) .+ .+|.+++|+++.+ -+|..+.++++.++.+++ T Consensus 181 --~~~~~~ivIVGraIt~---------------a~dP~~aA~~i~~~I~~~ 214 (216) T PRK13306 181 --KGIPIKTFIAGRAIRG---------------AKDPAEAARAFKDEIAKY 214 (216) T ss_pred --HCCCCCEEEECCCCCC---------------CCCHHHHHHHHHHHHHHH T ss_conf --6279989998852358---------------999999999999999986 No 131 >TIGR01249 pro_imino_pep_1 proline iminopeptidase; InterPro: IPR005944 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=29.52 E-value=17 Score=17.22 Aligned_cols=30 Identities=27% Similarity=0.466 Sum_probs=22.6 Q ss_pred HHHCCCCCC--CHHHHHHHHCCC-CCEEEECCH Q ss_conf 430343210--002466531243-642475440 Q gi|254780653|r 119 VRFYSEEER--NDPDFVRMLSRN-GDFYINDAF 148 (400) Q Consensus 119 ~Rf~~~E~~--n~~~f~k~La~l-aDiyVnDAF 148 (400) +||..++.. .|+.|+..+|.+ ..+|||-.| T Consensus 204 L~~~~~~~~~aed~~~~la~ArlEnHYfVNkgF 236 (310) T TIGR01249 204 LRPENEIVSTAEDAKFSLALARLENHYFVNKGF 236 (310) T ss_pred HCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 134714341114778999998775102204640 No 132 >pfam05496 RuvB_N Holliday junction DNA helicase ruvB N-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family contains the N-terminal region of the protein. Probab=29.13 E-value=43 Score=14.54 Aligned_cols=24 Identities=25% Similarity=0.318 Sum_probs=11.1 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 778889847332247888876431013354 Q gi|254780653|r 62 LGRPQSKSDKDCSLFKVVSIAESILHKNIL 91 (400) Q Consensus 62 ~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~ 91 (400) +|-| |-. +.||. ..+.+-++.+.. T Consensus 56 ~GPP-G~G--KTTlA---riiAk~~~~~~~ 79 (234) T pfam05496 56 YGPP-GLG--KTTLA---NIIANEMGVNIR 79 (234) T ss_pred ECCC-CCC--HHHHH---HHHHHHHCCCEE T ss_conf 7899-998--88999---999984087537 No 133 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=28.91 E-value=11 Score=18.32 Aligned_cols=63 Identities=16% Similarity=0.296 Sum_probs=31.2 Q ss_pred HCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHH--------HHHHHHHHHHHCCCCC Q ss_conf 0343210002466531243642475440440113763330233210003325--------5657776533101667 Q gi|254780653|r 121 FYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRA--------MQKELSMLESCFSESK 188 (400) Q Consensus 121 f~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l--------~ekEi~~L~~~l~~~~ 188 (400) +||-+.+.-++|-+.|-+|. .|||==+--...+.=-||- |===|.|+| +|+|-+ |+-+++.|. T Consensus 301 lYPid~~~Ye~LrdALeKL~---LNDAsL~yE~E~S~ALGFG-FRCGFLGLLHmEiiqERLeREFn-ldlI~TAP~ 371 (598) T TIGR01393 301 LYPIDTEDYEDLRDALEKLK---LNDASLTYEPESSPALGFG-FRCGFLGLLHMEIIQERLEREFN-LDLITTAPS 371 (598) T ss_pred CCCCCCHHHHHHHHHHHHHH---HHHHHHHCCCCCCCCCCCC-EEECCCCCHHHHHHHHHHHHHCC-CEEEEECCC T ss_conf 25888034689999975554---4025421022363003740-43326663368999876544308-706872781 No 134 >PRK09344 phosphoenolpyruvate carboxykinase; Provisional Probab=28.72 E-value=21 Score=16.57 Aligned_cols=49 Identities=14% Similarity=0.188 Sum_probs=29.9 Q ss_pred CCCCCEEECCHHHHCCCCCCC-HHHHHHHHCCCCCEEEECCHHHHCCCCCCEEEC Q ss_conf 364648982424303432100-024665312436424754404401137633302 Q gi|254780653|r 108 LSEGGIILAENVRFYSEEERN-DPDFVRMLSRNGDFYINDAFSVSHRAHASITGL 161 (400) Q Consensus 108 l~~g~ilLLEN~Rf~~~E~~n-~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi 161 (400) ...|++ |..|-++.-.. -....+.| +--|+||.|+|..+|+.|.=-+-+ T Consensus 71 IwWG~v----N~pis~e~Fd~L~~kv~~yL-~~kdlyV~D~~aGadp~~rl~VRV 120 (525) T PRK09344 71 IWWGDV----NKPISPEKFDALKQKVLAYL-SGKDLFVVDGFAGADPDYRLPVRV 120 (525) T ss_pred EEECCC----CCCCCHHHHHHHHHHHHHHH-HCCCEEEEEEEECCCCCCCCCEEE T ss_conf 534787----76889999999999999998-279769997564068432411799 No 135 >cd03286 ABC_MSH6_euk MutS6 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=28.67 E-value=37 Score=14.97 Aligned_cols=22 Identities=18% Similarity=0.170 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHCCCCCCCEEEE Q ss_conf 5657776533101667871899 Q gi|254780653|r 173 MQKELSMLESCFSESKKPLVAI 194 (400) Q Consensus 173 ~ekEi~~L~~~l~~~~~P~v~I 194 (400) |..|+..+..+++...+..+++ T Consensus 93 F~~e~~~~~~il~~~~~~sLvl 114 (218) T cd03286 93 FMVELSETANILRHATPDSLVI 114 (218) T ss_pred HHHHHHHHHHHHHHCCCCCCEE T ss_conf 9999999999998679985010 No 136 >pfam00186 DHFR_1 Dihydrofolate reductase. Probab=28.42 E-value=42 Score=14.62 Aligned_cols=25 Identities=20% Similarity=0.396 Sum_probs=18.4 Q ss_pred EEEEEEECCHHHHHHHHHHHHHCCCEEEECH Q ss_conf 1899960780255788998773068888620 Q gi|254780653|r 191 LVAIVGGSKVSTKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 191 ~v~IiGGaKisdKi~~i~~L~~k~D~iiigG 221 (400) -+.|+||+.+= +.+++.+|++.+.= T Consensus 91 ~i~ViGG~~iY------~~~l~~~~~i~lT~ 115 (159) T pfam00186 91 EIFIIGGAEIY------KQALPYADRLYLTE 115 (159) T ss_pred CEEEECHHHHH------HHHHHHCCEEEEEE T ss_conf 39998869999------99998789999999 No 137 >PRK04690 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=28.21 E-value=45 Score=14.44 Aligned_cols=38 Identities=13% Similarity=0.106 Sum_probs=23.5 Q ss_pred HCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 01599689998201671318577725899998898999998898099991377 Q gi|254780653|r 11 RDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLG 63 (400) Q Consensus 11 ~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~G 63 (400) .+++||+|+| +-+ | +.-..+.++|.++|+++++..... T Consensus 4 ~~~~gkkv~V-----~Gl--G--------~sG~aaa~~L~~~g~~~~v~~~~~ 41 (468) T PRK04690 4 SQLEGKRVAL-----WGW--G--------REGRAAYRALRAQLPAQPLTVFCN 41 (468) T ss_pred HHCCCCEEEE-----EEE--C--------HHHHHHHHHHHHCCCEEEEEECCC T ss_conf 4439797999-----834--7--------879999999996699049997288 No 138 >PRK06935 2-deoxy-D-gluconate 3-dehydrogenase; Provisional Probab=28.17 E-value=45 Score=14.44 Aligned_cols=40 Identities=20% Similarity=0.390 Sum_probs=26.8 Q ss_pred CHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 1010015996899982016713185777258999988989999988980999913 Q gi|254780653|r 7 MNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSH 61 (400) Q Consensus 7 i~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH 61 (400) ++-+ ||+||++||- | -..-|=+ .+-+.|.++||+|++.++ T Consensus 8 m~~f-~L~gKvalVT---------G---as~GIG~--aiA~~la~~Ga~Vvi~~~ 47 (258) T PRK06935 8 MDFF-SLKGKVAIVT---------G---GNTGLGQ--GYAVALAKAGADIIITTH 47 (258) T ss_pred HHHC-CCCCCEEEEE---------C---CCCHHHH--HHHHHHHHCCCEEEEECC T ss_conf 5431-9999989994---------8---5758999--999999987999999729 No 139 >PRK07631 amidophosphoribosyltransferase; Provisional Probab=27.93 E-value=45 Score=14.41 Aligned_cols=15 Identities=7% Similarity=0.412 Sum_probs=10.5 Q ss_pred HHHHCCCCCEEECCH Q ss_conf 553136464898242 Q gi|254780653|r 104 SIASLSEGGIILAEN 118 (400) Q Consensus 104 ~i~~l~~g~ilLLEN 118 (400) .+.+++|||++.+.. T Consensus 215 ~irdv~PGEiv~I~~ 229 (475) T PRK07631 215 YLREVEPGELLIIND 229 (475) T ss_pred EEEECCCCEEEEEEC T ss_conf 998169966999978 No 140 >PRK10637 cysG siroheme synthase; Provisional Probab=27.92 E-value=45 Score=14.41 Aligned_cols=36 Identities=19% Similarity=0.343 Sum_probs=29.4 Q ss_pred HHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 001599689998201671318577725899998898999998898099991 Q gi|254780653|r 10 LRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 10 ~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) +-|++||+||| +-.|++ +..-++.|++.||+|.++| T Consensus 7 fl~L~gk~vLV-------VGGG~v--------A~rK~~~Ll~agA~VtVia 42 (457) T PRK10637 7 FCQLRDRDCLI-------VGGGDV--------AERKARLLLDAGARLTVNA 42 (457) T ss_pred EEECCCCEEEE-------ECCCHH--------HHHHHHHHHHCCCEEEEEC T ss_conf 68738986999-------899899--------9999999987898799989 No 141 >PRK13772 formimidoylglutamase; Provisional Probab=27.43 E-value=46 Score=14.35 Aligned_cols=40 Identities=20% Similarity=0.252 Sum_probs=22.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCCCC-CCHHH--HHHHHHHHH Q ss_conf 20035677764057767999766253227-43658--899999999 Q gi|254780653|r 298 GFKTVEYIKQVIAQARTVMWNGPLGVFEI-EPFDR--ATVEVAHYV 340 (400) Q Consensus 298 Gp~Ti~~~~~~I~~AktI~WNGP~GvfE~-~~F~~--GT~~l~~~i 340 (400) .+..+..+.+.+..+..+.- |-+.|. |.|+. -|..++.-+ T Consensus 265 ~~~~~~~l~~~i~~~~kl~~---~DivEvnP~~D~d~~TA~lAA~l 307 (313) T PRK13772 265 PLPVVEEIATLVRRSGKLRV---ADLAEYNPQYDRDHRGARVAARL 307 (313) T ss_pred CHHHHHHHHHHHHHCCCEEE---EEEEEECCCCCCCCCHHHHHHHH T ss_conf 99999999999863499789---99898698889988699999999 No 142 >TIGR01826 CofD_related conserved hypothetical protein; InterPro: IPR010119 This entry represents a subfamily of conserved hypothetical proteins that forms a sister group to the family of CofD, (IPR010115 from INTERPRO), LPPG:Fo 2-phospho-L-lactate transferase, an enzyme of cytochrome F420 biosynthesis.. Probab=27.35 E-value=46 Score=14.35 Aligned_cols=50 Identities=28% Similarity=0.254 Sum_probs=24.2 Q ss_pred HHHH-HHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8989-9999889809999137788898473322478888764310133545666 Q gi|254780653|r 43 VPTI-LELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVND 95 (400) Q Consensus 43 ~pTI-~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d 95 (400) +|-| +.|.+..|++|.+.=+.+=-| .-..||+..|.++|.+++|++ |||. T Consensus 211 vp~i~~A~~~s~A~kvYV~NlmtqpG-ET~~~~~~dHv~~l~~h~G~~--~iD~ 261 (331) T TIGR01826 211 VPEIAEALRESKAPKVYVCNLMTQPG-ETDGFSVEDHVKALHRHLGKP--FIDV 261 (331) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCC-CCCCCCHHHHHHHHHHHCCCC--CCCE T ss_conf 67899999728998999733566888-664772899999999866898--3447 No 143 >COG0362 Gnd 6-phosphogluconate dehydrogenase [Carbohydrate transport and metabolism] Probab=27.28 E-value=47 Score=14.34 Aligned_cols=150 Identities=18% Similarity=0.257 Sum_probs=78.0 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCC-CC--CCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEE Q ss_conf 84733224788887643101335456-66--3102556655313646489824243034321000246653124364247 Q gi|254780653|r 68 KSDKDCSLFKVVSIAESILHKNILFV-ND--CIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYI 144 (400) Q Consensus 68 ~~~~~~Sl~~v~~~l~~~l~~~V~fi-~d--~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyV 144 (400) +..+.+|++-.+..|++ -|+|.+- .- .++...++..--|.+|+|+.=--=-+|+.-.. --+.|+.-+=.|| T Consensus 51 ~i~~~~sieefV~~Le~--PRkI~lMVkAG~~VD~~I~~L~p~Le~gDIiIDGGNs~y~DT~R----R~~eL~~~Gi~Fv 124 (473) T COG0362 51 NIVPAYSIEEFVASLEK--PRKILLMVKAGTPVDAVIEQLLPLLEKGDIIIDGGNSHYKDTIR----RNKELSEKGILFV 124 (473) T ss_pred CCCCCCCHHHHHHHHCC--CCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCCCHHHH----HHHHHHHCCCEEE T ss_conf 64566859999987158--84699998269927899999875257889898688767820789----9999886388489 Q ss_pred ECCHHHHCCCCCCEEECCCCCCCC---CCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECH Q ss_conf 544044011376333023321000---33255657776533101667871899960780255788998773068888620 Q gi|254780653|r 145 NDAFSVSHRAHASITGLSHLLPSY---IGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 145 nDAF~~aHR~haS~~gi~~~lps~---aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG 221 (400) - -|+|--..--..| |..+|-+ |=.+++.=++.+..-. ...||+.-+|-.-.---.+++.|=++-.|.=+|+- T Consensus 125 G--~GVSGGEeGA~~G-PSiMpGG~~eay~~v~pil~~IaAk~--~g~pCc~~iG~~GAGHfVKmVHNGIEYgDMQlIaE 199 (473) T COG0362 125 G--MGVSGGEEGARHG-PSIMPGGQKEAYELVAPILTKIAAKV--DGEPCCTWIGPDGAGHFVKMVHNGIEYGDMQLIAE 199 (473) T ss_pred E--CCCCCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHC--CCCCCEEEECCCCCCCEEEEEECCCHHHHHHHHHH T ss_conf 5--1666652100249-87699999889999999999987533--99985253789997752210145714888999999 Q ss_pred HHHHHHHHH Q ss_conf 578999998 Q gi|254780653|r 222 GMANSFLVA 230 (400) Q Consensus 222 ~lantfL~A 230 (400) ++.||+- T Consensus 200 --~Y~ilk~ 206 (473) T COG0362 200 --AYDILKD 206 (473) T ss_pred --HHHHHHH T ss_conf --9999997 No 144 >PRK08341 amidophosphoribosyltransferase; Provisional Probab=27.13 E-value=47 Score=14.32 Aligned_cols=15 Identities=13% Similarity=0.326 Sum_probs=11.4 Q ss_pred HHHCCCCCEEECCHH Q ss_conf 531364648982424 Q gi|254780653|r 105 IASLSEGGIILAENV 119 (400) Q Consensus 105 i~~l~~g~ilLLEN~ 119 (400) +.+++|||++...+- T Consensus 205 irdv~PGEii~i~~~ 219 (442) T PRK08341 205 IRDVRPGEVFVVSEG 219 (442) T ss_pred EEECCCCEEEEECCC T ss_conf 897369848997599 No 145 >COG0396 sufC Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones] Probab=26.95 E-value=47 Score=14.30 Aligned_cols=49 Identities=18% Similarity=0.268 Sum_probs=22.6 Q ss_pred CCCCHHHH-CCCCC-EEEEEEECCCCCCCCEECCHH-HHHHHHHHHHHHHHCCC Q ss_conf 87710100-15996-899982016713185777258-99998898999998898 Q gi|254780653|r 4 LRTMNDLR-DIRGL-RCLLRVDWNVPFIDGKVADVT-RIERVVPTILELVEKKA 54 (400) Q Consensus 4 ~~ti~d~~-d~~gk-~VlvRvD~NvPi~~g~I~d~~-RI~~~~pTI~~l~~~~a 54 (400) |-.|.|+. .+.+| .+|=.|++.|+ .|+|..-- -==+-..|+-+.+--.. T Consensus 3 ~L~I~dLhv~v~~~keILkgvnL~v~--~GEvhaiMGPNGsGKSTLa~~i~G~p 54 (251) T COG0396 3 MLEIKDLHVEVEGKKEILKGVNLTVK--EGEVHAIMGPNGSGKSTLAYTIMGHP 54 (251) T ss_pred EEEEEEEEEEECCCHHHHCCCCEEEC--CCCEEEEECCCCCCHHHHHHHHHCCC T ss_conf 37995469995685210037414675--98289998899878899999972899 No 146 >TIGR02534 mucon_cyclo muconate and chloromuconate cycloisomerases; InterPro: IPR013370 This family consists of muconate cycloisomerase (5.5.1.1 from EC) and chloromuconate cycloisomerase (5.5.1.7 from EC), enzymes that often overlap in specificity. It does not include more distantly related proteins such as mandelate racemase (5.1.2.2 from EC).; GO: 0018849 muconate cycloisomerase activity, 0018850 chloromuconate cycloisomerase activity, 0030145 manganese ion binding. Probab=26.91 E-value=47 Score=14.30 Aligned_cols=190 Identities=16% Similarity=0.172 Sum_probs=126.5 Q ss_pred CCCCCCC---CEECC------------HH-HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 1671318---57772------------58-99998898999998898099991377888984733224788887643101 Q gi|254780653|r 24 WNVPFID---GKVAD------------VT-RIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILH 87 (400) Q Consensus 24 ~NvPi~~---g~I~d------------~~-RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~ 87 (400) ||||+.+ |.|-| .. -|.++-.-|..=.++-.|.=+.|.- |. ++--..-.+++.|....+ T Consensus 116 LgVPV~eLLGG~~Rd~vdV~WtLasgdtd~dI~EAe~~~E~~rh~~FKLKiG~~~--~a---~DV~hV~~I~~ALg~r~~ 190 (369) T TIGR02534 116 LGVPVSELLGGAVRDSVDVLWTLASGDTDRDIEEAEELLEEKRHRSFKLKIGARD--LA---DDVAHVVAIKKALGDRAS 190 (369) T ss_pred CCCCHHHHCCCCEECCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCC--HH---HHHHHHHHHHHHHCCCEE T ss_conf 7872344278741144540242115753478999999999726882154437888--65---779999999997299528 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEE---------CCHHHHCCCCCCE Q ss_conf 3354566631025566553136464898242430343210002466531243642475---------4404401137633 Q gi|254780653|r 88 KNILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYIN---------DAFSVSHRAHASI 158 (400) Q Consensus 88 ~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVn---------DAF~~aHR~haS~ 158 (400) ..|.--.-|-..++...+..|.+|.|=|.|-= ==-+|-.-+ ++|+.+++|=|| |||..+-|+-|=+ T Consensus 191 VRVDVN~aW~e~~A~~~~~~LadaGv~LIEQP----tpae~~~~L-arLt~r~~VpIMADE~v~gP~dAl~~a~~~aADV 265 (369) T TIGR02534 191 VRVDVNAAWDELEALKYLPQLADAGVELIEQP----TPAEQREAL-ARLTRRFNVPIMADESVTGPADALEIAKASAADV 265 (369) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCHHHHCCC----CCHHHHHHH-HHHHHHCCCCEEECCCCCCHHHHHHHHHHHCCCE T ss_conf 99720366787899999888885070000488----962048999-9998433885430445688278999999825977 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCC Q ss_conf 30233210003325565777653310166787189996078025578899877306888862057899999830143433 Q gi|254780653|r 159 TGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRS 238 (400) Q Consensus 159 ~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~S 238 (400) ..+ ....|-|+.=.+++-++-++-. ++.-||.-+.=-|+-+=++ ..|-.....+-|.. T Consensus 266 fAl--K~~ksGGL~~~~~~AAIA~AAG------i~lyGgTmLEGpiGtiASa--------------h~f~~~~~l~~GTE 323 (369) T TIGR02534 266 FAL--KIAKSGGLLESKKVAAIAKAAG------IALYGGTMLEGPIGTIASA--------------HFFAAFPELEFGTE 323 (369) T ss_pred EEE--EHHCCCCHHHHHHHHHHHHHCC------CCCCCCCCCCCHHHHHHHH--------------HHHHCCCCCCCCCC T ss_conf 874--0001441578999999997648------3101650031026899999--------------99841356664641 Q ss_pred CCCCCCC Q ss_conf 3310000 Q gi|254780653|r 239 LCQRDFS 245 (400) Q Consensus 239 l~e~~~~ 245 (400) |+=+-.+ T Consensus 324 LFGPLLL 330 (369) T TIGR02534 324 LFGPLLL 330 (369) T ss_pred CCCCCCC T ss_conf 2463111 No 147 >KOG0730 consensus Probab=26.81 E-value=47 Score=14.28 Aligned_cols=35 Identities=23% Similarity=0.375 Sum_probs=25.1 Q ss_pred HCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEE Q ss_conf 05776799976625322743658899999999861114885899 Q gi|254780653|r 309 IAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISI 352 (400) Q Consensus 309 I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~si 352 (400) |...+-|+.-||.|. |-.-+++++|+..+ -+..+| T Consensus 465 i~ppkGVLlyGPPGC--------~KT~lAkalAne~~-~nFlsv 499 (693) T KOG0730 465 ISPPKGVLLYGPPGC--------GKTLLAKALANEAG-MNFLSV 499 (693) T ss_pred CCCCCEEEEECCCCC--------CHHHHHHHHHHHHC-CCEEEC T ss_conf 788754777789986--------24789999864635-872641 No 148 >cd04729 NanE N-acetylmannosamine-6-phosphate epimerase (NanE) converts N-acetylmannosamine-6-phosphate to N-acetylglucosamine-6-phosphate. This reaction is part of the pathway that allows the usage of sialic acid as a carbohydrate source. Sialic acids are a family of related sugars that are found as a component of glycoproteins, gangliosides, and other sialoglycoconjugates. Probab=26.60 E-value=48 Score=14.26 Aligned_cols=114 Identities=17% Similarity=0.239 Sum_probs=56.6 Q ss_pred CCCCCCC--CEEC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 1671318--5777--25899998898999998898099991377888984733224788887643101335456663102 Q gi|254780653|r 24 WNVPFID--GKVA--DVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGS 99 (400) Q Consensus 24 ~NvPi~~--g~I~--d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~ 99 (400) .++|+.. .+-. .+-+|...+.-++.|.+.|+-+|-+---.||.- + ..|+....+.+.+..++. +--||- T Consensus 59 v~lPIIGi~K~~~~~s~VyITPt~~ev~~l~~aGadiIA~DaT~R~RP--~-g~~l~~~i~~i~~~~~~l--~MAD~s-- 131 (219) T cd04729 59 VDLPIIGLIKRDYPDSEVYITPTIEEVDALAAAGADIIALDATDRPRP--D-GETLAELIKRIHEEYNCL--LMADIS-- 131 (219) T ss_pred CCCCEEEEEECCCCCCCEEECCCHHHHHHHHHCCCCEEEEECCCCCCC--C-CCCHHHHHHHHHHHHCCE--EEEECC-- T ss_conf 899889999568899984566889999999985999999946788798--9-978999999999986977--887548-- Q ss_pred HHHHHHHHCCCCCEEECCHHHHCCCCC----CCHHHHHHHHCCCCCEEE Q ss_conf 556655313646489824243034321----000246653124364247 Q gi|254780653|r 100 TLSQSIASLSEGGIILAENVRFYSEEE----RNDPDFVRMLSRNGDFYI 144 (400) Q Consensus 100 ~~~~~i~~l~~g~ilLLEN~Rf~~~E~----~n~~~f~k~La~laDiyV 144 (400) ..++.+...+-|-=++=--+.=|.++. .+|-+|.++|++..+++| T Consensus 132 t~ee~~~A~~~G~D~vgTTL~GYT~~t~~~~~PD~~lv~~l~~~~~~pv 180 (219) T cd04729 132 TLEEALNAAKLGFDIIGTTLSGYTEETAKTEDPDFELLKELRKALGIPV 180 (219) T ss_pred CHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCE T ss_conf 8999999998499899702145677878899987899999999759939 No 149 >TIGR01527 arch_NMN_Atrans nicotinamide-nucleotide adenylyltransferase; InterPro: IPR006418 This family of archaeal proteins exhibits NAD salvage biosynthesis enzyme nicotinamide-nucleotide adenylyltransferase (2.7.7.1 from EC) activity. In some cases, the enzyme was tested and found also to have the activity of nicotinate-nucleotide adenylyltransferase (2.7.7.18 from EC), an enzyme of NAD de novo biosynthesis, although with a higher Km. In some archaeal species, a number of proteins which are uncharacterised with respect to activity, are also present.; GO: 0000309 nicotinamide-nucleotide adenylyltransferase activity, 0009435 NAD biosynthetic process, 0005737 cytoplasm. Probab=26.34 E-value=48 Score=14.23 Aligned_cols=20 Identities=30% Similarity=0.679 Sum_probs=16.9 Q ss_pred HHHHHHHHHCCCEEEECHHH Q ss_conf 78899877306888862057 Q gi|254780653|r 204 ITLLINLVKKVDKLVIGGGM 223 (400) Q Consensus 204 i~~i~~L~~k~D~iiigG~l 223 (400) +.||++.+++||.|+||=|= T Consensus 16 ~~Vik~~l~~VDELiiGiGS 35 (171) T TIGR01527 16 LEVIKKILEEVDELIIGIGS 35 (171) T ss_pred HHHHHHHHHHCCEEEEEEEC T ss_conf 79999887354777987404 No 150 >cd01563 Thr-synth_1 Threonine synthase is a pyridoxal phosphate (PLP) dependent enzyme that catalyses the last reaction in the synthesis of threonine from aspartate. It proceeds by converting O-phospho-L-homoserine (OPH) into threonine and inorganic phosphate. In plants, OPH is an intermediate between the methionine and threonine/isoleucine pathways. Thus threonine synthase competes for OPH with cystathionine-gamma-synthase, the first enzyme in the methionine pathway. These enzymes are in general dimers. Members of this CD, Thr-synth_1, are widely distributed in bacteria, archaea and higher plants. Probab=26.19 E-value=49 Score=14.21 Aligned_cols=111 Identities=13% Similarity=0.122 Sum_probs=63.5 Q ss_pred CCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCC-CC Q ss_conf 59968999820167131857772589999889899999889809999137788898473322478888764310133-54 Q gi|254780653|r 13 IRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKN-IL 91 (400) Q Consensus 13 ~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~-V~ 91 (400) +-+.+|++..+.--| +--++.+.+.-.|....++|.+.|+.+--| +--.++.- ....+|.+ +. T Consensus 35 lg~~~i~~K~E~~np------tGSfK~RgA~~~i~~a~~~g~~~vv~~SsG------N~G~alA~----~a~~~G~k~~I 98 (324) T cd01563 35 LGGKNLYVKDEGLNP------TGSFKDRGMTVAVSKAKELGVKAVACASTG------NTSASLAA----YAARAGIKCVV 98 (324) T ss_pred CCCCEEEEEECCCCC------CCCCHHHHHHHHHHHHHHCCCCEEEEECCH------HHHHHHHH----HHHHCCCCEEE T ss_conf 099879998636897------778699999999999997599979997833------99999999----99985996576 Q ss_pred CCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEEC Q ss_conf 5666310255665531364648982424303432100024665312436424754 Q gi|254780653|r 92 FVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSRNGDFYIND 146 (400) Q Consensus 92 fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnD 146 (400) |+++...+.-...++.+ -.+|++.+.- .....+.+++++.--..|++. T Consensus 99 v~p~~~~~~k~~~i~~~-GAeVv~v~~~------~~~~~~~a~~~~~~~~~~~~~ 146 (324) T cd01563 99 FLPAGKALGKLAQALAY-GATVLAVEGN------FDDALRLVRELAEENWIYLSN 146 (324) T ss_pred ECCCCCHHHHHHHHHHC-CCEEEECCCC------HHHHHHHHHHHHHHCCCEECC T ss_conf 55764389999999955-9989975899------999999999977745831104 No 151 >PRK07349 amidophosphoribosyltransferase; Provisional Probab=26.07 E-value=49 Score=14.20 Aligned_cols=129 Identities=13% Similarity=0.229 Sum_probs=65.0 Q ss_pred HHHHHCCCCCEEECCHHH-----HCCCCCC------------------------CHHHHHHHHCCCCCEEEECCHHHHCC Q ss_conf 655313646489824243-----0343210------------------------00246653124364247544044011 Q gi|254780653|r 103 QSIASLSEGGIILAENVR-----FYSEEER------------------------NDPDFVRMLSRNGDFYINDAFSVSHR 153 (400) Q Consensus 103 ~~i~~l~~g~ilLLEN~R-----f~~~E~~------------------------n~~~f~k~La~laDiyVnDAF~~aHR 153 (400) +.+.+++|||++.+.+-- +.+.+.. -..++.++||.-.++ || T Consensus 238 e~iRdv~PGEiv~I~~~g~~s~~~~~~~~~~~C~FEyIYFaRPDS~i~g~~Vy~~R~~~G~~LA~e~~v---~~------ 308 (495) T PRK07349 238 DYLRDVEPGELVWITEGGLESFRWAEKPQRKLCVFEMIYFARPDSRMHGESLYSYRQRLGQQLAKESPV---DA------ 308 (495) T ss_pred EEEECCCCCEEEEEECCCEEEEEECCCCCCCCCCCHHHHHCCCCCEECCCCHHHHHHHHHHHHHCCCCC---CC------ T ss_conf 588638998599981796368861588876543011222047751247818999999999987260688---88------ Q ss_pred CCCCEEECCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECC-----HHHHHHHHHHHHHCCCEEEECHHHHHHH Q ss_conf 37633302332-10003325565777653310166787189996078-----0255788998773068888620578999 Q gi|254780653|r 154 AHASITGLSHL-LPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSK-----VSTKITLLINLVKKVDKLVIGGGMANSF 227 (400) Q Consensus 154 ~haS~~gi~~~-lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaK-----isdKi~~i~~L~~k~D~iiigG~lantf 227 (400) -=++++|.- .|++.|+.-+.-+-.-.-++.|.--..++|..+-+ +.-|+..++.+++-- +|++ T Consensus 309 --DiVi~VPdSg~~aA~gya~~sgipy~~glikn~YvgRTFI~p~q~~R~~~vr~Kln~~~~~i~gK-~vvl-------- 377 (495) T PRK07349 309 --DLVIGVPDSGIPAAIGFSQASGIPYAEGLIKNRYVGRTFIQPTQAMRESGIRMKLNPLKDVLAGK-RIII-------- 377 (495) T ss_pred --CEEEECCCCHHHHHHHHHHHCCCCCHHHHHHCCCCCCCCCCCCHHHHHHCCCEEEEECHHHHCCC-EEEE-------- T ss_conf --63897899629999999997399712201204544655518758887632013201367885898-5999-------- Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 9983014343333100001356889987530245 Q gi|254780653|r 228 LVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACE 261 (400) Q Consensus 228 L~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~ 261 (400) |-.|++.-. +.++|++.+++.|.+ T Consensus 378 -------vDDSIVRGt---T~k~iv~~Lr~aGAk 401 (495) T PRK07349 378 -------VDDSIVRGT---TSRKIVKALRDAGAT 401 (495) T ss_pred -------ECCCCCCCC---CHHHHHHHHHHCCCC T ss_conf -------826605574---699999999976998 No 152 >pfam02401 LYTB LytB protein. The mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway for isoprenoid biosynthesis is essential in many eubacteria, plants, and the malaria parasite. The LytB gene is involved in the trunk line of the MEP pathway. Probab=26.04 E-value=49 Score=14.20 Aligned_cols=60 Identities=22% Similarity=0.261 Sum_probs=23.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCC----CCCCCEEECCHHHHHHHHCCC Q ss_conf 658899999999861114885899748046998998488----768646875079989987489 Q gi|254780653|r 329 FDRATVEVAHYVAKLTKERRIISIAGGGDTITALAHAGI----SNEFTYVSTAGGAFLEWLEGK 388 (400) Q Consensus 329 F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~aai~~~g~----~~~~~hvSTgGGA~L~~L~G~ 388 (400) .+.-|..=-+++.++.+..++.-||||-+++---+.+.+ ..+..||.|..---.++|.+. T Consensus 191 IC~aT~~RQ~a~~~lA~~vD~miVVGg~nSsNT~~L~eia~~~~~~t~~Ie~~~el~~~~l~~~ 254 (280) T pfam02401 191 ICYATQNRQEAVRELAPEVDVVLVVGGKNSSNSNRLAEIAKEHGVPAYLIDDPEEIQPEWLKGV 254 (280) T ss_pred CCHHHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHCCCEEEECCHHHCCHHHHCCC T ss_conf 2275888999999777549899995799995489999999987999799698276999894799 No 153 >PRK05337 beta-hexosaminidase; Provisional Probab=26.01 E-value=49 Score=14.19 Aligned_cols=14 Identities=29% Similarity=0.202 Sum_probs=9.5 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 03325565777653 Q gi|254780653|r 168 YIGRAMQKELSMLE 181 (400) Q Consensus 168 ~aG~l~ekEi~~L~ 181 (400) ..|.+|..|+.++. T Consensus 101 ~~g~~~a~eL~a~G 114 (336) T PRK05337 101 EAGWLMAAELRACG 114 (336) T ss_pred HHHHHHHHHHHHCC T ss_conf 99999999999809 No 154 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=25.71 E-value=50 Score=14.16 Aligned_cols=202 Identities=18% Similarity=0.232 Sum_probs=104.6 Q ss_pred CEEEEEEECCCCCC-CCE-ECCHHHHHHHHHHHHHHHHCCCEEEEEE---------CCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 68999820167131-857-7725899998898999998898099991---------377888984733224788887643 Q gi|254780653|r 16 LRCLLRVDWNVPFI-DGK-VADVTRIERVVPTILELVEKKAKVVIFS---------HLGRPQSKSDKDCSLFKVVSIAES 84 (400) Q Consensus 16 k~VlvRvD~NvPi~-~g~-I~d~~RI~~~~pTI~~l~~~~akvii~S---------H~GRPkg~~~~~~Sl~~v~~~l~~ 84 (400) |||-|.+==|+=.. .|. -.|..||.....+|..|.++|.+|||+| +||=|+ ..+++..++-..... T Consensus 1 ~riVvK~GsS~Lt~~~g~h~l~~~~i~~lv~~v~~L~~~Gh~vviVSSGA~AaG~~~LG~~~--rP~~la~KQAlAAVG- 77 (379) T TIGR01027 1 KRIVVKVGSSSLTGSSGSHELDRSRIAELVEQVAALHAAGHEVVIVSSGAVAAGFEALGLPE--RPKTLAEKQALAAVG- 77 (379) T ss_pred CEEEEEEECCEEECCCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCCC--CCCCHHHHHHHHHCC- T ss_conf 96899970321117888741377999999999999986599899981675886234455699--986256788787314- Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCH-----H--HHC-CCC--------------CCCH------------H Q ss_conf 1013354566631025566553136464898242-----4--303-432--------------1000------------2 Q gi|254780653|r 85 ILHKNILFVNDCIGSTLSQSIASLSEGGIILAEN-----V--RFY-SEE--------------ERND------------P 130 (400) Q Consensus 85 ~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN-----~--Rf~-~~E--------------~~n~------------~ 130 (400) ++..+..|-... ..-.++.|+|||==| - ||- ... .+|| + T Consensus 78 ----Q~~Lm~~y~~~F---~~Yg~~~aQiLLTr~D~~~R~eeRy~NAr~TL~~Ll~~gvvPIiNENDTVaveEi~fGDND 150 (379) T TIGR01027 78 ----QVRLMQLYENLF---SSYGLKVAQILLTRADFSKREEERYLNARNTLEALLELGVVPIINENDTVAVEEIKFGDND 150 (379) T ss_pred ----HHHHHHHHHHHH---HHCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCEEEEEEEECCCC T ss_conf ----568999999999---7548822243405798863378999999999999986594789867742244000546781 Q ss_pred HHHHHHCCC--CCEE-----EECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCC-CEE-EEEEECCHH Q ss_conf 466531243--6424-----7544044011376333023321000332556577765331016678-718-999607802 Q gi|254780653|r 131 DFVRMLSRN--GDFY-----INDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKK-PLV-AIVGGSKVS 201 (400) Q Consensus 131 ~f~k~La~l--aDiy-----VnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~-P~v-~IiGGaKis 201 (400) .++-..|.| ||.- |.==|.+==|.||-=.=||.+ +|++.....+. ... +.- .==|| +. T Consensus 151 ~LSAlvA~Lv~AD~L~LLTD~dGLYd~dPR~nPdA~~I~~V----------~~~~~~~~~~a-G~S~g~~~~GTGG--M~ 217 (379) T TIGR01027 151 TLSALVAILVGADLLVLLTDVDGLYDADPRTNPDAKLIPVV----------EEITDLLLGVA-GDSSGSSKVGTGG--MR 217 (379) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCC----------CCCCHHHHEEC-CCCCCCCCCCCCH--HH T ss_conf 69999999997528977617553306874537851453412----------45332341111-4667788756646--78 Q ss_pred HHHHHHHHHHHC-CCEEEECHHHHHHHH-HHH----CCCCCCCCC Q ss_conf 557889987730-688886205789999-983----014343333 Q gi|254780653|r 202 TKITLLINLVKK-VDKLVIGGGMANSFL-VAQ----GMGVGRSLC 240 (400) Q Consensus 202 dKi~~i~~L~~k-~D~iiigG~lantfL-~A~----G~~IG~Sl~ 240 (400) +||..-+-=..- +..+|.-|.--.... ..+ |-++|.-+. T Consensus 218 sK~~AA~~A~~~Gv~v~i~~g~~P~~i~~~~~hhfyg~~~GT~F~ 262 (379) T TIGR01027 218 SKIEAARLATRAGVPVIIASGSKPEKIADALEHHFYGAPVGTVFE 262 (379) T ss_pred HHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCCCEEC T ss_conf 999999999868995899448897899999997403778774433 No 155 >TIGR01098 3A0109s03R phosphonate ABC transporter, periplasmic phosphonate-binding protein; InterPro: IPR005770 Bacterial binding protein-dependent transport systems , are multicomponent systems typically composed of a periplasmic substrate-binding protein, one or two reciprocally homologous integral inner-membrane proteins and one or two peripheral membrane ATP-binding proteins that couple energy to the active transport system. The integral inner-membrane proteins translocate the substrate across the membrane. It has been shown , that most of these proteins contain a conserved region located about 80 to 100 residues from their C-terminal extremity. This region seems to be located in a cytoplasmic loop between two transmembrane domains. Apart from the conserved region, the sequence of these proteins is quite divergent, and they have a variable number of transmembrane helices. This is a family of periplasmic proteins which are part of the transport system for alkylphosphonate uptake. ; GO: 0015604 phosphonate transmembrane transporter activity, 0015716 phosphonate transport, 0030288 outer membrane-bounded periplasmic space. Probab=25.52 E-value=39 Score=14.82 Aligned_cols=47 Identities=15% Similarity=0.067 Sum_probs=29.9 Q ss_pred CCHHHHHHHHHHHCCCCCCCCC--CCCHHHHHHHHHHCCCC--CEEECCHHHHCCC Q ss_conf 2247888876431013354566--63102556655313646--4898242430343 Q gi|254780653|r 73 CSLFKVVSIAESILHKNILFVN--DCIGSTLSQSIASLSEG--GIILAENVRFYSE 124 (400) Q Consensus 73 ~Sl~~v~~~l~~~l~~~V~fi~--d~~g~~~~~~i~~l~~g--~ilLLEN~Rf~~~ 124 (400) -.-.|+.++|+|.||.||.+.. |+ ...|+.|+.| +|-.+-+.=|..- T Consensus 68 ~~~~pL~~~Lek~Lg~kV~~~~~~dY-----~a~IEamr~g~~D~a~~g~~sY~~A 118 (299) T TIGR01098 68 RRWEPLKDYLEKKLGIKVELFVATDY-----SAVIEAMRFGRVDIAWFGPSSYVLA 118 (299) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCC-----CCEEEEECCCCEEEEEECCCHHHEE T ss_conf 76689999888755886799861683-----1033423268078983478512322 No 156 >KOG0572 consensus Probab=25.50 E-value=50 Score=14.13 Aligned_cols=26 Identities=15% Similarity=0.172 Sum_probs=10.8 Q ss_pred EEEECCHHHHHHHHHHHHH----CCCEEEE Q ss_conf 9960780255788998773----0688886 Q gi|254780653|r 194 IVGGSKVSTKITLLINLVK----KVDKLVI 219 (400) Q Consensus 194 IiGGaKisdKi~~i~~L~~----k~D~iii 219 (400) +-|-+=....+..=+.|.. .+|-++- T Consensus 268 ~eG~sVY~~R~~~G~~LA~e~P~d~DvVi~ 297 (474) T KOG0572 268 FEGQSVYTVRLQCGEQLATEAPVDADVVIP 297 (474) T ss_pred ECCCCHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 237114899998866762158744555885 No 157 >pfam00488 MutS_V MutS domain V. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam05190. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain V of Thermus aquaticus MutS, which contains a Walker A motif, and is structurally similar to the ATPase domain of ABC transporters. Probab=25.29 E-value=51 Score=14.11 Aligned_cols=22 Identities=9% Similarity=0.113 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHCCCCCCCEEEE Q ss_conf 5657776533101667871899 Q gi|254780653|r 173 MQKELSMLESCFSESKKPLVAI 194 (400) Q Consensus 173 ~ekEi~~L~~~l~~~~~P~v~I 194 (400) |..|+..+..++++..+..+++ T Consensus 105 F~~e~~~~~~il~~~~~~sLvl 126 (234) T pfam00488 105 FMVEMLETANILHNATDKSLVI 126 (234) T ss_pred HHHHHHHHHHHHHHCCCCCEEC T ss_conf 9999999999997388773220 No 158 >pfam06941 NT5C 5' nucleotidase, deoxy (Pyrimidine), cytosolic type C protein (NT5C). This family consists of several 5' nucleotidase, deoxy (Pyrimidine), cytosolic type C (NT5C) proteins. 5'(3')-Deoxyribonucleotidase is a ubiquitous enzyme in mammalian cells whose physiological function is not known. Probab=25.27 E-value=51 Score=14.10 Aligned_cols=44 Identities=16% Similarity=0.202 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 999889899999889809999137788898473322478888764310 Q gi|254780653|r 39 IERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESIL 86 (400) Q Consensus 39 I~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l 86 (400) |..++.+++.|.+.|..++++++ ||... ...|...-..+|++++ T Consensus 75 i~gA~e~l~~L~~~g~~~~~Its--~~~~~--~~~~~~~k~~WL~~~~ 118 (191) T pfam06941 75 IPGAREALRELADEGFTIVFITT--SPFQH--YDHAVGEKYAWVDQHL 118 (191) T ss_pred CCCHHHHHHHHHHCCCCEEEEEC--CCCCC--CCCCHHHHHHHHHHHC T ss_conf 81099999999977995899978--88757--6633889999999865 No 159 >TIGR03640 cas1_DVULG CRISPR-associated protein Cas1, DVULG subtype. The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes the Cas1 protein particular to the DVULG subtype of CRISPR/Cas system. Probab=25.24 E-value=51 Score=14.10 Aligned_cols=62 Identities=16% Similarity=0.187 Sum_probs=41.2 Q ss_pred CCCCEEEEEEE----CCCCCCC-CEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCC Q ss_conf 59968999820----1671318-57772589999889899999889809999137788898473322 Q gi|254780653|r 13 IRGLRCLLRVD----WNVPFID-GKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCS 74 (400) Q Consensus 13 ~~gk~VlvRvD----~NvPi~~-g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~S 74 (400) .+|.+..|+.+ .-+|+.+ ..|.=...+.-+.+.|++|+++|-.|+.+++.||+.|......| T Consensus 17 k~~~~l~v~~~~~~~~~iPi~~I~~I~i~g~v~ist~~l~~l~~~~I~v~f~~~~G~~~g~~~~~~~ 83 (340) T TIGR03640 17 KDGETVVVEVEGEKKARVPLHHLGGIVCFGNVGLSPFLMGRCAEDGISLVFLTENGRFLARVEGPVS 83 (340) T ss_pred EECCEEEEEECCCEEEEECHHHCCEEEEECCCCCCHHHHHHHHHCCCEEEEECCCCCEEEEEECCCC T ss_conf 9999999998993887875788688999768726899999999879839998899977678424788 No 160 >cd03115 SRP The signal recognition particle (SRP) mediates the transport to or across the plasma membrane in bacteria and the endoplasmic reticulum in eukaryotes. SRP recognizes N-terminal sighnal sequences of newly synthesized polypeptides at the ribosome. The SRP-polypeptide complex is then targeted to the membrane by an interaction between SRP and its cognated receptor (SR). In mammals, SRP consists of six protein subunits and a 7SL RNA. One of these subunits is a 54 kd protein (SRP54), which is a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 is a multidomain protein that consists of an N-terminal domain, followed by a central G (GTPase) domain and a C-terminal M domain. Probab=25.21 E-value=51 Score=14.10 Aligned_cols=133 Identities=17% Similarity=0.218 Sum_probs=81.0 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH-HHH---HH Q ss_conf 778889847332247888876431013354566-6310255665531364648982424303432100024-665---31 Q gi|254780653|r 62 LGRPQSKSDKDCSLFKVVSIAESILHKNILFVN-DCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD-FVR---ML 136 (400) Q Consensus 62 ~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~-d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~-f~k---~L 136 (400) .|-| |- -+..++..++.++.+. |++|.++. |+.-.-+.++.+.+.. .-++.|+..+...++. ..+ .. T Consensus 6 vGpt-Gv-GKTTTiaKLA~~~~~~-~~kV~lit~Dt~R~gA~eQL~~~a~-----~l~v~~~~~~~~~~~~~~~~~~~~~ 77 (173) T cd03115 6 VGLQ-GV-GKTTTAAKLALYLKKK-GKKVLLVAADTYRPAAIEQLRVLGE-----QVGVPVFEEGEGKDPVSIAKRAIEH 77 (173) T ss_pred ECCC-CC-CHHHHHHHHHHHHHHC-CCEEEEEECCCCCHHHHHHHHHHHH-----HCCCEEEECCCCCCHHHHHHHHHHH T ss_conf 8999-99-8899999999999976-9928999748875779999999999-----7498599227755879999999999 Q ss_pred --CCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHH-- Q ss_conf --2436424754404401137633302332100033255657776533101667871899960780255788998773-- Q gi|254780653|r 137 --SRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVK-- 212 (400) Q Consensus 137 --a~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~-- 212 (400) .+-.|+-.-|--|.+|++. .+-+|++.+.... .|...++ ++-+..=.+-+..++.+.+ T Consensus 78 ~~~~~~D~IlIDTaGr~~~d~----------------~~~~el~~l~~~~-~p~~~~L-Vl~a~~~~~~~~~~~~f~~~~ 139 (173) T cd03115 78 AREENFDVVIVDTAGRLQIDE----------------NLMEELKKIKRVV-KPDEVLL-VVDAMTGQDAVNQAKAFNEAL 139 (173) T ss_pred HHHCCCCEEEEECCCCCCCCH----------------HHHHHHHHHHHHH-CCCCCEE-ECCCCCHHHHHHHHHHHHHCC T ss_conf 875689989997888787999----------------9999999998644-8972157-424655065899999987427 Q ss_pred CCCEEEEC Q ss_conf 06888862 Q gi|254780653|r 213 KVDKLVIG 220 (400) Q Consensus 213 k~D~iiig 220 (400) .++.+++. T Consensus 140 ~~~~~I~T 147 (173) T cd03115 140 GITGVILT 147 (173) T ss_pred CCCEEEEE T ss_conf 99789997 No 161 >cd03284 ABC_MutS1 MutS1 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clam Probab=25.12 E-value=51 Score=14.09 Aligned_cols=23 Identities=4% Similarity=0.154 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 56577765331016678718999 Q gi|254780653|r 173 MQKELSMLESCFSESKKPLVAIV 195 (400) Q Consensus 173 ~ekEi~~L~~~l~~~~~P~v~Ii 195 (400) |..|+..+..++++..+..++++ T Consensus 93 F~~e~~~~~~il~~a~~~sLvli 115 (216) T cd03284 93 FMVEMVETANILNNATERSLVLL 115 (216) T ss_pred HHHHHHHHHHHHHHCCCCCEEEE T ss_conf 89999999999984877615633 No 162 >COG4678 Muramidase (phage lambda lysozyme) [Carbohydrate transport and metabolism] Probab=24.86 E-value=38 Score=14.92 Aligned_cols=42 Identities=21% Similarity=0.356 Sum_probs=26.6 Q ss_pred CCCCCCCCCCHHHHHH---HHHHHHHC----CCCCCCEEEEEEECCHHH Q ss_conf 2332100033255657---77653310----166787189996078025 Q gi|254780653|r 161 LSHLLPSYIGRAMQKE---LSMLESCF----SESKKPLVAIVGGSKVST 202 (400) Q Consensus 161 i~~~lps~aG~l~ekE---i~~L~~~l----~~~~~P~v~IiGGaKisd 202 (400) .|.+.|+..+++|+-= +.++.+-. .+.++|+..++||--+-| T Consensus 18 ~~~~~~~l~~~~~~~~~p~l~a~m~Ti~~se~~~~~pY~vLvgg~~f~D 66 (180) T COG4678 18 MPNFNPSLPPLVMKGGDPMLRALMRTISASEPNRNRPYDVLVGGQLFTD 66 (180) T ss_pred CCCCCCCCHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCEEEECCCEECH T ss_conf 6664643304555404789999986012235789998248874833031 No 163 >PRK13540 cytochrome c biogenesis protein CcmA; Provisional Probab=24.73 E-value=52 Score=14.04 Aligned_cols=16 Identities=19% Similarity=0.468 Sum_probs=10.3 Q ss_pred CCCCCEEEEEEECCCC Q ss_conf 1599689998201671 Q gi|254780653|r 12 DIRGLRCLLRVDWNVP 27 (400) Q Consensus 12 d~~gk~VlvRvD~NvP 27 (400) .+.++.||=.++|.+| T Consensus 10 ~y~~~~vl~~is~~i~ 25 (200) T PRK13540 10 DYHDQPLLQQISFHLP 25 (200) T ss_pred EECCEEEEECEEEEEC T ss_conf 9999998812278987 No 164 >PRK00625 shikimate kinase; Provisional Probab=24.66 E-value=18 Score=16.96 Aligned_cols=53 Identities=17% Similarity=0.151 Sum_probs=23.3 Q ss_pred HHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHH Q ss_conf 577765331016678718999607802557889987730688886205789999983 Q gi|254780653|r 175 KELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQ 231 (400) Q Consensus 175 kEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~ 231 (400) .|-+.|..+. ..|.|.=.||.=+-.. ...+.|-++.--|++---....+-.-+ T Consensus 62 ~E~~~l~~l~---~~~~VIstGGG~v~~~-~n~~~Lk~~g~vV~L~~~~e~i~~Rl~ 114 (173) T PRK00625 62 EEALALESLP---VIPSIVALGGGTLMHE-ESYDHIRNRGLLVLLSLPIATIYQRLQ 114 (173) T ss_pred HHHHHHHHHC---CCCCEEECCCCCCCCH-HHHHHHHHCCEEEEEECCHHHHHHHHH T ss_conf 9999999732---4896897488501789-999999968999998399999999982 No 165 >pfam09916 DUF2145 Uncharacterized protein conserved in bacteria (DUF2145). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=24.53 E-value=31 Score=15.46 Aligned_cols=27 Identities=26% Similarity=0.347 Sum_probs=21.1 Q ss_pred HHHHCCCEEEEEECCCCCCCCCCCCCC Q ss_conf 999889809999137788898473322 Q gi|254780653|r 48 ELVEKKAKVVIFSHLGRPQSKSDKDCS 74 (400) Q Consensus 48 ~l~~~~akvii~SH~GRPkg~~~~~~S 74 (400) +|-.+|++|+|++-.|||-.++.-.|| T Consensus 2 ~l~a~Ga~vvllaR~Gqdls~~GiryS 28 (200) T pfam09916 2 ALAARGARVVLLARSGQDLSDYGLRYS 28 (200) T ss_pred CHHHCCCEEEEEEECCCCHHHCCCEEE T ss_conf 443278849999806999799197465 No 166 >PRK11366 puuD gamma-glutamyl-gamma-aminobutyrate hydrolase; Provisional Probab=24.47 E-value=52 Score=14.01 Aligned_cols=22 Identities=18% Similarity=0.534 Sum_probs=10.7 Q ss_pred HHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 77765331016678718999607 Q gi|254780653|r 176 ELSMLESCFSESKKPLVAIVGGS 198 (400) Q Consensus 176 Ei~~L~~~l~~~~~P~v~IiGGa 198 (400) |+.-+..+++ .+.|+++|=.|. T Consensus 96 E~~L~~~A~~-~~~PiLGICRG~ 117 (254) T PRK11366 96 SMALINAALE-RRIPIFAICRGL 117 (254) T ss_pred HHHHHHHHHH-CCCCEEEECHHH T ss_conf 9999999998-599979860355 No 167 >PRK05733 single-stranded DNA-binding protein; Provisional Probab=24.27 E-value=52 Score=14.04 Aligned_cols=72 Identities=14% Similarity=0.148 Sum_probs=37.3 Q ss_pred CEEEEEECCCCC-------CCCCCCCCCHHHHHHHHHHHCCCC---CCCCC-CCCHHHHHHHHHHCCCCCEEECC-HHHH Q ss_conf 809999137788-------898473322478888764310133---54566-63102556655313646489824-2430 Q gi|254780653|r 54 AKVVIFSHLGRP-------QSKSDKDCSLFKVVSIAESILHKN---ILFVN-DCIGSTLSQSIASLSEGGIILAE-NVRF 121 (400) Q Consensus 54 akvii~SH~GRP-------kg~~~~~~Sl~~v~~~l~~~l~~~---V~fi~-d~~g~~~~~~i~~l~~g~ilLLE-N~Rf 121 (400) .||+|+.++||- .|.....+||.---.+..+--+.. ..|++ .+.+..++.....|+.|.-+++| -||. T Consensus 6 NkV~LiG~lg~DPElr~t~~G~~v~~fslAt~~~~~~~~~ge~~e~T~w~~v~~wgk~AE~~~~yl~KG~~V~VeGrL~t 85 (172) T PRK05733 6 NKVILVGTCGQDPEVRYLPNGNAVTNLSLATSEQWTDKQSGQKVERTEWHRVSLFGKVAEIAGEYLRKGSQVYIEGKLQT 85 (172) T ss_pred CEEEEEEECCCCCEEEECCCCCEEEEEEEEEECCCCCCCCCCEECCCCEEEEEEECHHHHHHHHHCCCCCEEEEEEEEEE T ss_conf 28999755577963888899987999999972550016787583123228998734578888865389998999988884 Q ss_pred CCCC Q ss_conf 3432 Q gi|254780653|r 122 YSEE 125 (400) Q Consensus 122 ~~~E 125 (400) ..-| T Consensus 86 r~we 89 (172) T PRK05733 86 REWE 89 (172) T ss_pred ECHH T ss_conf 0475 No 168 >cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=24.24 E-value=53 Score=13.98 Aligned_cols=26 Identities=19% Similarity=0.174 Sum_probs=14.3 Q ss_pred CCCCCCCHHHHHHHHCCCCC--EEEECC Q ss_conf 34321000246653124364--247544 Q gi|254780653|r 122 YSEEERNDPDFVRMLSRNGD--FYINDA 147 (400) Q Consensus 122 ~~~E~~n~~~f~k~La~laD--iyVnDA 147 (400) .++-++-.-.+|+.|+.-.+ +++.|= T Consensus 88 LSGGqkQRvaiAraL~~~p~~~ililDE 115 (176) T cd03238 88 LSGGELQRVKLASELFSEPPGTLFILDE 115 (176) T ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEECC T ss_conf 8999999999999998689986899717 No 169 >PRK13480 3'-5' exoribonuclease YhaM; Provisional Probab=24.24 E-value=53 Score=13.98 Aligned_cols=17 Identities=18% Similarity=0.276 Sum_probs=12.3 Q ss_pred CCCCHHHHHHHHHHHHH Q ss_conf 00332556577765331 Q gi|254780653|r 167 SYIGRAMQKELSMLESC 183 (400) Q Consensus 167 s~aG~l~ekEi~~L~~~ 183 (400) +|.|=|+|.-++.+.-+ T Consensus 155 ay~gGLLEHtlsv~~la 171 (314) T PRK13480 155 EFVSGLAYHVVSMLRLA 171 (314) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 33226999999999999 No 170 >COG0313 Predicted methyltransferases [General function prediction only] Probab=24.09 E-value=53 Score=13.96 Aligned_cols=150 Identities=19% Similarity=0.281 Sum_probs=81.4 Q ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHH---HHHHCCCCCEE Q ss_conf 999988989999988980999913778889847332247888876431013354566631025566---55313646489 Q gi|254780653|r 38 RIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQ---SIASLSEGGII 114 (400) Q Consensus 38 RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~---~i~~l~~g~il 114 (400) -=....+-|-..+++|.++-++|.=|-| .-.||-+.|-.-+.. .|.+|.-++ |+.+.- ....+....-+ T Consensus 62 ne~~~~~~li~~l~~g~~valVSDAG~P-~ISDPG~~LV~~a~~----~gi~V~~lP---G~sA~~tAL~~SGl~~~~F~ 133 (275) T COG0313 62 NEKEKLPKLIPLLKKGKSVALVSDAGTP-LISDPGYELVRAARE----AGIRVVPLP---GPSALITALSASGLPSQRFL 133 (275) T ss_pred CHHHHHHHHHHHHHCCCEEEEEECCCCC-CCCCCCHHHHHHHHH----CCCCEEECC---CCCHHHHHHHHCCCCCCCEE T ss_conf 6778799999998669848998058997-666850999999998----699189668---83499999997589999726 Q ss_pred ECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHH------------HHHHH Q ss_conf 824243034321000246653124364247544044011376333023321000332556577------------76533 Q gi|254780653|r 115 LAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKEL------------SMLES 182 (400) Q Consensus 115 LLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi------------~~L~~ 182 (400) .. -|.+...+.....-+.|+.-....|. |-+.||..++...+...+++-.=..+.+|+ ..+.. T Consensus 134 F~---GFLP~k~~~R~~~l~~l~~~~~t~If--yEsphRl~~tL~d~~~~~g~~r~v~v~RELTK~~Ee~~~g~~~e~~~ 208 (275) T COG0313 134 FE---GFLPRKSKERRKRLEALANEPRTLIF--YESPHRLLATLEDIVEVLGSDREVVVARELTKLFEEIYRGTLSELIE 208 (275) T ss_pred EE---CCCCCCCCHHHHHHHHHHHCCCEEEE--EECCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHEECCCHHHHHH T ss_conf 84---11799832789999998745773999--96636589999999997188965999873353113134163999999 Q ss_pred HCC-C-CCCCEEEEEEECCH Q ss_conf 101-6-67871899960780 Q gi|254780653|r 183 CFS-E-SKKPLVAIVGGSKV 200 (400) Q Consensus 183 ~l~-~-~~~P~v~IiGGaKi 200 (400) .+. + .+.++|.|+.|..- T Consensus 209 ~~~~~~~KGE~vlvv~~~~~ 228 (275) T COG0313 209 WLEEDTLKGEFVLVVEGKNK 228 (275) T ss_pred HHHCCCCCCCEEEEEECCCC T ss_conf 86405776548999947864 No 171 >TIGR01316 gltA glutamate synthase (NADPH), homotetrameric; InterPro: IPR006004 One pathway for the assimilation of ammonia and glutamate biosynthesis involves homotetrameric glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate .2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. . Probab=23.98 E-value=53 Score=13.95 Aligned_cols=79 Identities=25% Similarity=0.429 Sum_probs=55.1 Q ss_pred HHHHHHCCCC-CEEEECCHHHHCCCCCCE-EECCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHH Q ss_conf 4665312436-424754404401137633-302332-1000332556577765331016678718999607802557889 Q gi|254780653|r 131 DFVRMLSRNG-DFYINDAFSVSHRAHASI-TGLSHL-LPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLL 207 (400) Q Consensus 131 ~f~k~La~la-DiyVnDAF~~aHR~haS~-~gi~~~-lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i 207 (400) .-|..||+.+ .+-|.+|. |+.=-=+ ||||.| ||.-. ++.||+.|+++=-...- =.++| |==.| T Consensus 156 ~cA~elAk~Gh~VtvfEAL---hKPGGV~~YGIPefRLpKei---~~~E~k~LkklGv~fr~--~~lvG------kt~TL 221 (462) T TIGR01316 156 ACASELAKKGHEVTVFEAL---HKPGGVLAYGIPEFRLPKEI---VETEVKKLKKLGVKFRT--DYLVG------KTVTL 221 (462) T ss_pred HHHHHHHHCCCEEEEEEEC---CCCCCEEEECCCCCCCCHHH---HHHHHHHHHHCCEEEEE--CCEEC------CCHHH T ss_conf 8899997479869999714---89985675368885487578---89888876326637994--43750------51128 Q ss_pred HHHHHCC--CEEEECHHH Q ss_conf 9877306--888862057 Q gi|254780653|r 208 INLVKKV--DKLVIGGGM 223 (400) Q Consensus 208 ~~L~~k~--D~iiigG~l 223 (400) +.|.++. |=++||=|- T Consensus 222 ~eL~~~YGfDAVFIgtGA 239 (462) T TIGR01316 222 EELLEKYGFDAVFIGTGA 239 (462) T ss_pred HHHHHHCCCCEEEEEECC T ss_conf 888875197079995068 No 172 >COG0737 UshA 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases [Nucleotide transport and metabolism] Probab=23.96 E-value=53 Score=13.95 Aligned_cols=30 Identities=30% Similarity=0.357 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHHHHCCC-EEEEEECCCCCCC Q ss_conf 99998898999998898-0999913778889 Q gi|254780653|r 38 RIERVVPTILELVEKKA-KVVIFSHLGRPQS 67 (400) Q Consensus 38 RI~~~~pTI~~l~~~~a-kvii~SH~GRPkg 67 (400) -+.++-..+..|-++++ .||++||.|.+.. T Consensus 188 ~~e~~~~~i~elk~~~vD~iI~LsH~G~~~d 218 (517) T COG0737 188 PIEAAKKYIPELKGEGVDVIIALSHLGIEDD 218 (517) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCCCCH T ss_conf 9999999999987357989999126786633 No 173 >PRK05329 anaerobic glycerol-3-phosphate dehydrogenase subunit B; Validated Probab=23.81 E-value=41 Score=14.66 Aligned_cols=28 Identities=14% Similarity=0.253 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHCCC---------EEEEEECCCC Q ss_conf 899998898999998898---------0999913778 Q gi|254780653|r 37 TRIERVVPTILELVEKKA---------KVVIFSHLGR 64 (400) Q Consensus 37 ~RI~~~~pTI~~l~~~~a---------kvii~SH~GR 64 (400) .++.+.+.-.+.++++.. ..-++|++|- T Consensus 82 ~~v~~~~~~~~~ll~~~gl~l~g~~~~N~~~~TplGt 118 (425) T PRK05329 82 DAVLELLQQFQALLEAAGLPLVGSVEQNHLRVTPLGT 118 (425) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCEECCCCCCC T ss_conf 9999999999999997198643787666452356555 No 174 >pfam03808 Glyco_tran_WecB Glycosyl transferase WecB/TagA/CpsF family. Probab=23.63 E-value=54 Score=13.91 Aligned_cols=140 Identities=11% Similarity=0.029 Sum_probs=68.7 Q ss_pred HHHCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEC Q ss_conf 77306888862057899999830143433331000013568899875302455430354113432323333102532120 Q gi|254780653|r 210 LVKKVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSACEIIVPRDVVVAREMKTGIPTQVVSAQSVP 289 (400) Q Consensus 210 L~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~ 289 (400) .+..+|.++.-|.-.-..++-.|..+... -...|....+++.+.+++.+|++ .-+++ T Consensus 2 ~~~~ad~~~~DG~~i~~~~k~~g~~~~~r---v~G~dl~~~ll~~~~~~~~~v~l----lG~~~---------------- 58 (172) T pfam03808 2 ALNSADLVLPDGIGVVWAARLLGRPLPER---VAGTDLIPALLERAAERGKRVFL----LGGKP---------------- 58 (172) T ss_pred HHHHCCEEEECCHHHHHHHHHCCCCCCCC---CCHHHHHHHHHHHHHHCCCEEEE----EECCH---------------- T ss_conf 46539989309799999999859999751---68599999999999864983899----80888---------------- Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEE-EEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCHHHH---HHHHC Q ss_conf 13345765200356777640577679-997662532274365889999999986111488589974804699---89984 Q gi|254780653|r 290 LDSIILDVGFKTVEYIKQVIAQARTV-MWNGPLGVFEIEPFDRATVEVAHYVAKLTKERRIISIAGGGDTIT---ALAHA 365 (400) Q Consensus 290 ~~~~i~DIGp~Ti~~~~~~I~~AktI-~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~~~~~sivGGGdT~a---ai~~~ 365 (400) ++-.+..+.+.+.-.+-+.+ +.+||++-- --.++++.|.+. +--+.+||=|---+ +.+.. T Consensus 59 ------~~~~~~~~~l~~~yP~l~i~g~~~g~f~~~-------e~~~i~~~I~~~---~~div~vglG~PkQE~~~~~~~ 122 (172) T pfam03808 59 ------GVLEKAAARLRARYPGLRIVGTHDGYFSPE-------EEEAIIEAINAS---GPDLLFVGLGAPKQEKWIARNR 122 (172) T ss_pred ------HHHHHHHHHHHHHCCCCEEEEEECCCCCHH-------HHHHHHHHHHHC---CCCEEEEECCCCHHHHHHHHHH T ss_conf ------999999999998879955999879999868-------999999999845---9999999569817799999999 Q ss_pred CCCCCCCEEECCHHHHHHHHCCCCC Q ss_conf 8876864687507998998748998 Q gi|254780653|r 366 GISNEFTYVSTAGGAFLEWLEGKDL 390 (400) Q Consensus 366 g~~~~~~hvSTgGGA~L~~L~G~~L 390 (400) ...+. . +.-|-||+++|++|+.= T Consensus 123 ~~~~~-~-v~~~vGa~~d~~aG~~~ 145 (172) T pfam03808 123 ARLPV-P-VFIGVGGSFDFLAGTVK 145 (172) T ss_pred HHCCC-C-EEEECCCHHHHHHCCCC T ss_conf 87799-8-79852401246637856 No 175 >TIGR01497 kdpB K+-transporting ATPase, B subunit; InterPro: IPR006391 These sequences describe the P-type ATPase subunit of the complex responsible for translocating potassium ions across biological membranes in microbes. In Escherichia coli and other species, this complex consists of the proteins KdpA, KdpB, KdpC and KdpF. KdpB is the ATPase subunit, while KdpA is the potassium-ion translocating subunit . The function of KdpC is unclear, although it has been suggested to couple the ATPase subunit to the ion-translocating subunit , while KdpF serves to stabilize the complex . The potassium P-type ATPases have been characterised as Type IA based on a phylogenetic analysis which places this clade closest to the heavy-metal translocating ATPases (Type IB) . Others place this clade closer to the Na+/K+ antiporter type (Type IIC) based on physical characteristics . ; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=23.61 E-value=43 Score=14.55 Aligned_cols=135 Identities=16% Similarity=0.159 Sum_probs=71.2 Q ss_pred CCCCEEEECCHHHHCCCCCCEEECC-CCCCCC-CCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 4364247544044011376333023-321000-33255657776533101667871899960780255788998773068 Q gi|254780653|r 138 RNGDFYINDAFSVSHRAHASITGLS-HLLPSY-IGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVD 215 (400) Q Consensus 138 ~laDiyVnDAF~~aHR~haS~~gi~-~~lps~-aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D 215 (400) -++-.|++=+|+-..=+-+-.+..- ...|.- -|+|=.==+.-++|++.-+ ++|.=|=| ++--=++=.-+++|-= T Consensus 233 v~atL~pfa~y~g~~i~v~~LvALLVcLIPTTIGGLLSAIGiAGMdRv~~fN---VIAtSGrA-VEAcGDVdtl~LDKTG 308 (675) T TIGR01497 233 VVATLYPFAAYAGVKISVSVLVALLVCLIPTTIGGLLSAIGIAGMDRVLRFN---VIATSGRA-VEACGDVDTLVLDKTG 308 (675) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---EEEECCCE-EEECCCCCEEEECCCC T ss_conf 9998755654268633088999999874000033688886542355673055---24524640-2412872347764657 Q ss_pred EEEECHHHHHHHHHHHCCCCC--------CCCCCCCCCHHHHHHHHHHHHCCCCCCC-EEEEECCCCCCCCC Q ss_conf 888620578999998301434--------3333100001356889987530245543-03541134323233 Q gi|254780653|r 216 KLVIGGGMANSFLVAQGMGVG--------RSLCQRDFSDNVHQIAWEAKRSACEIIV-PRDVVVAREMKTGI 278 (400) Q Consensus 216 ~iiigG~lantfL~A~G~~IG--------~Sl~e~~~~~~ak~i~~~a~~~~~~I~l-P~D~~v~~~~~~~~ 278 (400) .|=+|==+|.-|.-++|++.- -|+.+ .-.+=|.|.+++++.+.++-. |...+--.+|.... T Consensus 309 TITlGNRlA~eFIp~~G~d~~~lad~a~~aSl~D--dTPEGkSIV~Lakqlgir~d~~~~~~a~fveFtAqt 378 (675) T TIGR01497 309 TITLGNRLAEEFIPVQGVDKKKLADVAVLASLAD--DTPEGKSIVELAKQLGIRIDDKKSESAEFVEFTAQT 378 (675) T ss_pred CCCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCC--CCCCCHHHHHHHHHHCCCCCCCCCCEEEEECCCCCC T ss_conf 4033211053344558834888985887601135--898836888888780861014745323210132333 No 176 >PRK06751 single-stranded DNA-binding protein; Provisional Probab=23.45 E-value=43 Score=14.56 Aligned_cols=64 Identities=13% Similarity=0.135 Sum_probs=31.7 Q ss_pred CEEEEEECCCCC-------CCCCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHHCCCCCEEECC Q ss_conf 809999137788-------89847332247888876431013354566-63102556655313646489824 Q gi|254780653|r 54 AKVVIFSHLGRP-------QSKSDKDCSLFKVVSIAESILHKNILFVN-DCIGSTLSQSIASLSEGGIILAE 117 (400) Q Consensus 54 akvii~SH~GRP-------kg~~~~~~Sl~~v~~~l~~~l~~~V~fi~-d~~g~~~~~~i~~l~~g~ilLLE 117 (400) .||+|+-++||. .|.....+||.---.+-.+-=.++..|++ -..+..++...+.|+.|..+++| T Consensus 2 NkV~LiGrL~~DPElr~t~~G~~v~~f~lAv~r~~~~~~g~~~td~~~vv~wgk~AE~~~~yl~KG~~V~Ve 73 (172) T PRK06751 2 NRVILVGRLTKDPDLRYTPNGVAVATFTLAVNRAFANQQGEREADFINCVIWRKQAENVANYLKKGSLAGVD 73 (172) T ss_pred CEEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCEEEEEEEEEEECHHHHHHHHHHCCCCEEEEE T ss_conf 779999876878758888999879999999667602678976778999997288899999875789989999 No 177 >PRK13948 shikimate kinase; Provisional Probab=23.42 E-value=19 Score=16.90 Aligned_cols=29 Identities=21% Similarity=0.236 Sum_probs=18.4 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 0999913778889847332247888876431013354566 Q gi|254780653|r 55 KVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVN 94 (400) Q Consensus 55 kvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~ 94 (400) .|+|+...|- || .+ +.+.|++.|+.+ |+| T Consensus 12 ~IvLIG~mGs--GK----St---iGk~LA~~l~~~--fiD 40 (182) T PRK13948 12 FVALAGFMGT--GK----SR---IGWELSRALALH--FVD 40 (182) T ss_pred CEEEECCCCC--CH----HH---HHHHHHHHHCCC--EEE T ss_conf 1898899999--88----99---999999996959--888 No 178 >COG3962 Acetolactate synthase [Amino acid transport and metabolism] Probab=23.42 E-value=55 Score=13.88 Aligned_cols=41 Identities=24% Similarity=0.300 Sum_probs=33.6 Q ss_pred HHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 65777653310166787189996078025578899877306 Q gi|254780653|r 174 QKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKV 214 (400) Q Consensus 174 ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~ 214 (400) ++|+...-.+++..+||++..=||.+.|+--..+..+.+.. T Consensus 216 ~~eL~~A~~lik~ak~PlIvaGGGv~YS~A~~~L~af~E~~ 256 (617) T COG3962 216 ERELADAAALIKSAKKPLIVAGGGVLYSGAREALRAFAETH 256 (617) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCEEECHHHHHHHHHHHHC T ss_conf 79999999999835797799668654430799999999864 No 179 >PRK00421 murC UDP-N-acetylmuramate--L-alanine ligase; Provisional Probab=23.28 E-value=55 Score=13.86 Aligned_cols=58 Identities=12% Similarity=0.192 Sum_probs=24.2 Q ss_pred CCCCEEEECC--HHHH-CCCCCCEEECCCCCCC---CCCHHHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 4364247544--0440-1137633302332100---03325565777653310166787189996 Q gi|254780653|r 138 RNGDFYINDA--FSVS-HRAHASITGLSHLLPS---YIGRAMQKELSMLESCFSESKKPLVAIVG 196 (400) Q Consensus 138 ~laDiyVnDA--F~~a-HR~haS~~gi~~~lps---~aG~l~ekEi~~L~~~l~~~~~P~v~IiG 196 (400) .-.|+||-++ |..| +.-+|.+.-+++.-|- +-| -|+.=.++..+++++-+..-.+|+- T Consensus 155 ~~~~~~V~E~dE~d~sf~~~~P~iavitNI~~DHLD~h~-s~e~Y~~ak~~~~~~~~~~g~~ViN 218 (459) T PRK00421 155 GSSDYFVAEADESDRSFLKLHPDIAVVTNIDADHLDYYG-DFEDLKDAFQEFAHNLPFYGALVAC 218 (459) T ss_pred CCCCEEEEEECCCCCCHHHCCCCEEEEECCCHHHHHHCC-CHHHHHHHHHHHHHCCCCCCCEECC T ss_conf 889889999535414364458989999179877865601-8999999999999719954721146 No 180 >PTZ00340 O-sialoglycoprotein endopeptidase; Provisional Probab=23.22 E-value=55 Score=13.86 Aligned_cols=14 Identities=36% Similarity=0.617 Sum_probs=7.9 Q ss_pred CCCCCEEEEEEECC Q ss_conf 66787189996078 Q gi|254780653|r 186 ESKKPLVAIVGGSK 199 (400) Q Consensus 186 ~~~~P~v~IiGGaK 199 (400) ..+.|++.++.|.. T Consensus 129 ~~~~P~~LlVSGGH 142 (348) T PTZ00340 129 GSENPVVLYVSGGN 142 (348) T ss_pred CCCCCEEEEECCCC T ss_conf 98785699986897 No 181 >TIGR02707 butyr_kinase butyrate kinase; InterPro: IPR011245 This group represents bacterial butyrate kinase, an enzyme that facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate . The final steps in butyrate synthesis by anaerobic bacteria can occur via butyrate kinase and phosphotransbutyrylase or via butyryl-CoA:acetate CoA-transferase, the latter providing the dominant route for butyrate formation in human colonic bacteria .; GO: 0005524 ATP binding, 0047761 butyrate kinase activity, 0016310 phosphorylation, 0005737 cytoplasm. Probab=23.13 E-value=29 Score=15.63 Aligned_cols=195 Identities=22% Similarity=0.311 Sum_probs=88.7 Q ss_pred CCCCEEEEEEECCCCCCC----CEECCH--HHHHHHHHHHHHHHHCCCEEE-EEECCCC-----C--CCCCCCCC----- Q ss_conf 599689998201671318----577725--899998898999998898099-9913778-----8--89847332----- Q gi|254780653|r 13 IRGLRCLLRVDWNVPFID----GKVADV--TRIERVVPTILELVEKKAKVV-IFSHLGR-----P--QSKSDKDC----- 73 (400) Q Consensus 13 ~~gk~VlvRvD~NvPi~~----g~I~d~--~RI~~~~pTI~~l~~~~akvi-i~SH~GR-----P--kg~~~~~~----- 73 (400) ++|.+++.+-++-=-++. .+|.|. +|-+..+ +.|-++|-++. +-+=.|| | .|-+.-+- T Consensus 17 f~de~~~~~~tlrH~~eEL~~f~~v~dQfeFR~~~i~---~~L~e~Gi~~~~l~AvVgRGGLLkPi~GGTY~Vn~~MleD 93 (353) T TIGR02707 17 FEDEKPLFEETLRHSVEELAKFKNVIDQFEFRKQVIL---EVLEEKGINISKLDAVVGRGGLLKPIEGGTYLVNEKMLED 93 (353) T ss_pred EECCEEEEEECCCCCHHHHHCCCCEEECHHHHHHHHH---HHHHHCCCCHHHEEEEEECCCCCCCCCCCEEEECHHHHHH T ss_conf 6578067864462587873056640211268999999---9987408871243157972770233389604776556888 Q ss_pred -----------CHH-HHHHHHHHHCC-CCCCCCCCCCHHHH--HHHHHHCCCCCEEECCHHHHCCCCCCCHHHHHHHHCC Q ss_conf -----------247-88887643101-33545666310255--6655313646489824243034321000246653124 Q gi|254780653|r 74 -----------SLF-KVVSIAESILH-KNILFVNDCIGSTL--SQSIASLSEGGIILAENVRFYSEEERNDPDFVRMLSR 138 (400) Q Consensus 74 -----------Sl~-~v~~~l~~~l~-~~V~fi~d~~g~~~--~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~f~k~La~ 138 (400) -|. -++..|.+-++ .|.+-+|-..=++- ...|.-++. +=--.=||+ -|-..-||++|+ T Consensus 94 Lk~~~~GeHASNLGaIIA~~lA~~~~diPafIVDPVVVDEledvARISG~P~----i~RkSIfHA---LNqKAvARr~A~ 166 (353) T TIGR02707 94 LKEGKRGEHASNLGAIIARELADELNDIPAFIVDPVVVDELEDVARISGLPE----IERKSIFHA---LNQKAVARRIAK 166 (353) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCEECCCCCHHHHCCCCC----CCCEEHHHH---HHHHHHHHHHHH T ss_conf 8510588784336899999987651894358826833505510132138898----641104444---338899999999 Q ss_pred -CCCEE--------------------------EECC------HHHHCCCCCCEEECCCCCC-CCCCHHHHHHHHHHHHHC Q ss_conf -36424--------------------------7544------0440113763330233210-003325565777653310 Q gi|254780653|r 139 -NGDFY--------------------------INDA------FSVSHRAHASITGLSHLLP-SYIGRAMQKELSMLESCF 184 (400) Q Consensus 139 -laDiy--------------------------VnDA------F~~aHR~haS~~gi~~~lp-s~aG~l~ekEi~~L~~~l 184 (400) ++--| ||+| || =-|+ -|+ =...+.. ||.|-.=-.| ++|-+ T Consensus 167 e~gK~YE~~N~ivaHlGGGISvaAH~~Gr~vDVNNALdGeGPFS-PERs-G~L-P~~dlv~lCySGkyT~~E---~kKkI 240 (353) T TIGR02707 167 ELGKRYEEMNLIVAHLGGGISVAAHRKGRVVDVNNALDGEGPFS-PERS-GTL-PLGDLVDLCYSGKYTKEE---MKKKI 240 (353) T ss_pred HCCCCCCCCCEEEEECCCCCEEEEECCCCEEEEECCCCCCCCCC-CCCC-CCC-CHHHHHHHHHCCCHHHHH---HHHHH T ss_conf 72896004454899828870341454861799734778432938-7545-656-588899987276103889---99985 Q ss_pred CCCCCCEEEEEEECCHHHHHHHHHH-----------H---------------HHCCCEEEECHHHH Q ss_conf 1667871899960780255788998-----------7---------------73068888620578 Q gi|254780653|r 185 SESKKPLVAIVGGSKVSTKITLLIN-----------L---------------VKKVDKLVIGGGMA 224 (400) Q Consensus 185 ~~~~~P~v~IiGGaKisdKi~~i~~-----------L---------------~~k~D~iiigG~la 224 (400) ..+--+++-||=.-+..=...|++ | --+||-|+++||+| T Consensus 241 -~G~GGl~ayLGTnD~reV~~~I~~GD~kA~~ildAMaYQiAKeIG~mavVL~G~VDAI~LTGGlA 305 (353) T TIGR02707 241 -VGKGGLVAYLGTNDAREVEKKIEAGDEKAKLILDAMAYQIAKEIGKMAVVLKGKVDAIILTGGLA 305 (353) T ss_pred -CCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCEEEEEECCCHH T ss_conf -26745332545588889999997310889999986545875677425040037266898546013 No 182 >PRK08945 short chain dehydrogenase; Provisional Probab=23.11 E-value=56 Score=13.84 Aligned_cols=64 Identities=22% Similarity=0.206 Sum_probs=39.3 Q ss_pred CCCCCCCHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 98887710100159968999820167131857772589999889899999889809999137788898473322478888 Q gi|254780653|r 1 MSRLRTMNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVS 80 (400) Q Consensus 1 M~~~~ti~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~ 80 (400) ||..+--.|. |+||+||| .- -..-|=+ .+-+.|.++|++|++. +|. .-.++..++ T Consensus 1 ~~~~~p~~~~--L~gK~~lI-------TG-----as~GIG~--aiA~~la~~Ga~Vil~---~r~------~~~l~~~~~ 55 (245) T PRK08945 1 MMHYQPKPDL--LKDRIILV-------TG-----AGDGIGR--EAALTYARHGATVILL---GRT------EEKLEAVYD 55 (245) T ss_pred CCCCCCCCCC--CCCCEEEE-------EC-----CCHHHHH--HHHHHHHHCCCEEEEE---ECC------HHHHHHHHH T ss_conf 9999999867--89798999-------48-----8618999--9999999879989999---698------899999999 Q ss_pred HHHHHCCCC Q ss_conf 764310133 Q gi|254780653|r 81 IAESILHKN 89 (400) Q Consensus 81 ~l~~~l~~~ 89 (400) .+++.-+.. T Consensus 56 el~~~~~~~ 64 (245) T PRK08945 56 EIEAAGGPQ 64 (245) T ss_pred HHHHCCCCC T ss_conf 999747984 No 183 >PRK13774 formimidoylglutamase; Provisional Probab=23.10 E-value=56 Score=13.84 Aligned_cols=43 Identities=7% Similarity=0.152 Sum_probs=22.5 Q ss_pred CCCHHHHHHHHHH----HHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC Q ss_conf 0332556577765----331016678718999607802557889987730 Q gi|254780653|r 168 YIGRAMQKELSML----ESCFSESKKPLVAIVGGSKVSTKITLLINLVKK 213 (400) Q Consensus 168 ~aG~l~ekEi~~L----~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k 213 (400) |-+..+|.-.+.+ .+++.....|. +|||.. +-..+.+..+.+. T Consensus 97 ~~~~~Le~aq~~l~~~v~~~l~~g~~pi--vLGGdH-~ia~~~~~g~~~~ 143 (311) T PRK13774 97 HDHEELIDTQKEFAMLAAKSIANHRQTF--LLGGGH-DIAYAQYLATRKV 143 (311) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCEE--EECCCC-HHHHHHHHHHHHH T ss_conf 5972399999999999999997699528--988860-7888989999852 No 184 >PRK11031 guanosine pentaphosphate phosphohydrolase; Provisional Probab=23.05 E-value=47 Score=14.32 Aligned_cols=73 Identities=14% Similarity=0.160 Sum_probs=33.7 Q ss_pred CCEEEEEEECCCCCCCCEECCHHHHHHHHHHHH---HHHHC-CCE-E-EEEECCCCCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 968999820167131857772589999889899---99988-980-9-99913778889847332247888876431013 Q gi|254780653|r 15 GLRCLLRVDWNVPFIDGKVADVTRIERVVPTIL---ELVEK-KAK-V-VIFSHLGRPQSKSDKDCSLFKVVSIAESILHK 88 (400) Q Consensus 15 gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~---~l~~~-~ak-v-ii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~ 88 (400) .-+.-+|.--.+. .+|.+ ++.-+++++.+++ .+++. ++. + ++.+.==|- -++- .-+.+..++.+|. T Consensus 35 ~~k~~vRLg~gl~-~~g~L-s~ea~~r~l~~L~~F~~~l~~~~~~~vraVATsAlR~-A~N~-----~~Fl~~v~~~~G~ 106 (494) T PRK11031 35 RIKRKVRLAAGLN-SDNAL-SNEAMERGWQCLRLFAERLQDIPPSQIRVVATATLRL-AVNA-----DEFIAKAQEILGC 106 (494) T ss_pred HEEEEECCCCCCC-CCCCC-CHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHHC-CCCH-----HHHHHHHHHHHCC T ss_conf 2478951767754-36994-9999999999999999999757997799996699885-9699-----9999999999699 Q ss_pred CCCCCCC Q ss_conf 3545666 Q gi|254780653|r 89 NILFVND 95 (400) Q Consensus 89 ~V~fi~d 95 (400) +|.-++. T Consensus 107 ~IeVIsG 113 (494) T PRK11031 107 PVQVISG 113 (494) T ss_pred CEEEECH T ss_conf 8799770 No 185 >pfam10652 DUF2480 Protein of unknown function (DUF2480). All the members of this family are uncharacterized proteins, but the environment in which they are found on the bacterial genome suggests a function as a glucose-6-phosphate isomerase (EC 5.3.1.9). This could not, however, be confirmed. Probab=23.01 E-value=48 Score=14.26 Aligned_cols=31 Identities=23% Similarity=0.227 Sum_probs=16.7 Q ss_pred EEEEEEECCHH----HHHHHHHHHHHCCCEEEECH Q ss_conf 18999607802----55788998773068888620 Q gi|254780653|r 191 LVAIVGGSKVS----TKITLLINLVKKVDKLVIGG 221 (400) Q Consensus 191 ~v~IiGGaKis----dKi~~i~~L~~k~D~iiigG 221 (400) .|.|=|.++.+ --+.++..|...|.+|+.|- T Consensus 123 ~ViIKGCs~~~vp~~aY~~l~~kL~pvaksimyGE 157 (167) T pfam10652 123 PVIIKGCSDKPVPESAYVLLTQKLQPVAKSIMYGE 157 (167) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHHHHEECC T ss_conf 59998899998858999999999999867850188 No 186 >PRK09246 amidophosphoribosyltransferase; Provisional Probab=22.90 E-value=56 Score=13.82 Aligned_cols=15 Identities=13% Similarity=0.277 Sum_probs=10.1 Q ss_pred HHHHHCCCCCEEECC Q ss_conf 655313646489824 Q gi|254780653|r 103 QSIASLSEGGIILAE 117 (400) Q Consensus 103 ~~i~~l~~g~ilLLE 117 (400) +.+.+++|||++.+. T Consensus 218 ~~irdv~PGEiv~I~ 232 (503) T PRK09246 218 EFVRDVAPGEAIYIT 232 (503) T ss_pred CEEEECCCCEEEEEE T ss_conf 289954898799995 No 187 >PRK07772 single-stranded DNA-binding protein; Provisional Probab=22.83 E-value=46 Score=14.38 Aligned_cols=64 Identities=11% Similarity=0.198 Sum_probs=37.4 Q ss_pred CEEEEEECCCC-C------CCCCCCCCCHHHHHHHHHHHCC----CCCCCCC-CCCHHHHHHHHHHCCCCCEEECC Q ss_conf 80999913778-8------8984733224788887643101----3354566-63102556655313646489824 Q gi|254780653|r 54 AKVVIFSHLGR-P------QSKSDKDCSLFKVVSIAESILH----KNILFVN-DCIGSTLSQSIASLSEGGIILAE 117 (400) Q Consensus 54 akvii~SH~GR-P------kg~~~~~~Sl~~v~~~l~~~l~----~~V~fi~-d~~g~~~~~~i~~l~~g~ilLLE 117 (400) .+|+|+-++|| | .|.....+++.---.+..+--+ .+..|++ .+.+..++...+.|+.|.-+++| T Consensus 5 ~~V~lvGnL~~DPe~r~t~~G~~v~~f~lA~~~r~~d~~~g~~~d~~t~~~~v~~wgklAe~~~~~l~KG~~V~V~ 80 (183) T PRK07772 5 TTITVVGNLTADPELRFTPSGAAVANFTVASTPRTYDRQTNEWKDGEALFLRCSIWRQAAENVAESLTKGMRVIVT 80 (183) T ss_pred CEEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEEECCCCCEECCCEEEEEEEEEHHHHHHHHHHCCCCCEEEEE T ss_conf 8899987568797088899999699999985585272688847716247999998179999988744699899999 No 188 >PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=22.58 E-value=57 Score=13.78 Aligned_cols=66 Identities=17% Similarity=0.268 Sum_probs=42.3 Q ss_pred CCEECC--HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 857772--589999889899999889809999137788898473322478888764310133545666310255665531 Q gi|254780653|r 30 DGKVAD--VTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIAS 107 (400) Q Consensus 30 ~g~I~d--~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~ 107 (400) ||..+| ++....+.|+|+.|-+++-.||+.|. |-- . =...|.+-|+..-.||- T Consensus 9 DGTLLDh~~ys~~~A~~~L~~L~~~~IPvI~~SS------KT~--a----Ev~~lr~~L~l~~PfIv------------- 63 (302) T PRK12702 9 DGSLLDLEFNSYGAARQALAALERRSIPLVLYSL------RTR--A----QLEHLCRQLRLEHPFIC------------- 63 (302) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECC------CCH--H----HHHHHHHHCCCCCCEEE------------- T ss_conf 6545388877726799999999987996797464------449--9----99999997399999898------------- Q ss_pred CCCCCEEECCHHHH Q ss_conf 36464898242430 Q gi|254780653|r 108 LSEGGIILAENVRF 121 (400) Q Consensus 108 l~~g~ilLLEN~Rf 121 (400) ++|..+.+..--| T Consensus 64 -ENGaaI~~P~~~f 76 (302) T PRK12702 64 -EDGSAIYVPEHYF 76 (302) T ss_pred -ECCCEEECCCCCC T ss_conf -0797886033336 No 189 >pfam01867 Cas_Cas1 CRISPR associated protein Cas1. Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. This family of proteins corresponds to Cas1, a CRISPR-associated protein. Cas1 may be involved in linking DNA segments to CRISPR. Probab=22.57 E-value=51 Score=14.09 Aligned_cols=61 Identities=15% Similarity=0.175 Sum_probs=41.1 Q ss_pred CCCCEEEEEEE----CCCCCCC-CEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCC Q ss_conf 59968999820----1671318-5777258999988989999988980999913778889847332 Q gi|254780653|r 13 IRGLRCLLRVD----WNVPFID-GKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDC 73 (400) Q Consensus 13 ~~gk~VlvRvD----~NvPi~~-g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~ 73 (400) .++.++.|+.+ .-+|+.+ ..|.=...+.-+-..|..|.++|-.|+.++.-|+|.|...+.. T Consensus 13 ~~~~~l~v~~~~~~~~~iPl~~i~~i~i~g~v~ist~~l~~l~~~gI~v~f~~~~G~~~g~l~p~~ 78 (319) T pfam01867 13 KKGNTLVVENKGEKKAEIPLEDIDEIVIFGGVSISTAALRLLAENGIPVHFFDYYGRYLGRLYPEE 78 (319) T ss_pred EECCEEEEEECCCEEEEECHHHCCEEEEECCCEECHHHHHHHHHCCCEEEEECCCCCEEEEEECCC T ss_conf 989999999899678897178878899976963889999999987991999899998889996788 No 190 >cd07090 ALDH_F9_TMBADH NAD+-dependent 4-trimethylaminobutyraldehyde dehydrogenase, ALDH family 9A1. NAD+-dependent, 4-trimethylaminobutyraldehyde dehydrogenase (TMABADH, EC=1.2.1.47), also known as aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in humans, is a cytosolic tetramer which catalyzes the oxidation of gamma-aminobutyraldehyde involved in 4-aminobutyric acid (GABA) biosynthesis and also oxidizes betaine aldehyde (gamma-trimethylaminobutyraldehyde) which is involved in carnitine biosynthesis. Probab=22.46 E-value=36 Score=15.05 Aligned_cols=50 Identities=12% Similarity=0.276 Sum_probs=21.7 Q ss_pred CCEEEECHHHHHHHHH-----HHCCCCCCC---------CCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 6888862057899999-----830143433---------3310000135688998753024554 Q gi|254780653|r 214 VDKLVIGGGMANSFLV-----AQGMGVGRS---------LCQRDFSDNVHQIAWEAKRSACEII 263 (400) Q Consensus 214 ~D~iiigG~lantfL~-----A~G~~IG~S---------l~e~~~~~~ak~i~~~a~~~~~~I~ 263 (400) .+++++--.++..|+. ++.+.+|.. ++.+...+.++++++.+...|.+++ T Consensus 261 ~~rv~V~~si~d~f~~~l~~~~~~l~vG~p~~~~~~~GPli~~~~~~~v~~~i~~a~~~Ga~i~ 324 (457) T cd07090 261 GTRVFVQRSIKDEFTERLVERTKKIRIGDPLDEDTQMGALISEEHLEKVLGYIESAKQEGAKVL 324 (457) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 8763000437999999999654217657965555741773399999999999999985699999 No 191 >COG3010 NanE Putative N-acetylmannosamine-6-phosphate epimerase [Carbohydrate transport and metabolism] Probab=22.35 E-value=57 Score=13.75 Aligned_cols=114 Identities=20% Similarity=0.262 Sum_probs=55.7 Q ss_pred EEEECCCCCCC--CEEC--CHHHHHHHHHHHHHHHHCCCEEEEEEC--CCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 98201671318--5777--258999988989999988980999913--77888984733224788887643101335456 Q gi|254780653|r 20 LRVDWNVPFID--GKVA--DVTRIERVVPTILELVEKKAKVVIFSH--LGRPQSKSDKDCSLFKVVSIAESILHKNILFV 93 (400) Q Consensus 20 vRvD~NvPi~~--g~I~--d~~RI~~~~pTI~~l~~~~akvii~SH--~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi 93 (400) +|.-.+||+.. .+=. .+-||...+.-++.|.+.|+-||-+-- .-||.| ++..+..+ .++.|+ .+- T Consensus 61 i~~~v~vPIIGIiKrd~~~s~v~ITptlkeVd~L~~~Ga~IIA~DaT~R~RP~~------~~~~~i~~-~k~~~~--l~M 131 (229) T COG3010 61 IRAVVDVPIIGIIKRDYPDSPVRITPTLKEVDALAEAGADIIAFDATDRPRPDG------DLEELIAR-IKYPGQ--LAM 131 (229) T ss_pred HHHHCCCCEEEEEECCCCCCCCEECCCHHHHHHHHHCCCCEEEEECCCCCCCCC------HHHHHHHH-HHCCCC--EEE T ss_conf 986178876888805899999355661899999997799099962556879843------59999997-335794--787 Q ss_pred CCCCHHHHHHHHHHCCCCCEEECCHHHHCCCC----CCCHHHHHHHHCCCCCEEEE Q ss_conf 66310255665531364648982424303432----10002466531243642475 Q gi|254780653|r 94 NDCIGSTLSQSIASLSEGGIILAENVRFYSEE----ERNDPDFVRMLSRNGDFYIN 145 (400) Q Consensus 94 ~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E----~~n~~~f~k~La~laDiyVn 145 (400) .||... ++.....+-|==+.=--++=|.++ ..+|-.|.|.|+. ++.||. T Consensus 132 AD~St~--ee~l~a~~~G~D~IGTTLsGYT~~~~~~~~pDf~lvk~l~~-~~~~vI 184 (229) T COG3010 132 ADCSTF--EEGLNAHKLGFDIIGTTLSGYTGYTEKPTEPDFQLVKQLSD-AGCRVI 184 (229) T ss_pred ECCCCH--HHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCHHHHHHHHH-CCCEEE T ss_conf 325988--88888997399678224201468998778972899999986-799399 No 192 >PRK08883 ribulose-phosphate 3-epimerase; Provisional Probab=22.20 E-value=58 Score=13.73 Aligned_cols=79 Identities=20% Similarity=0.208 Sum_probs=50.0 Q ss_pred HHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 53310166787189996078025578899877306888862057899999830143433331000013568899875302 Q gi|254780653|r 180 LESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRSLCQRDFSDNVHQIAWEAKRSA 259 (400) Q Consensus 180 L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~Sl~e~~~~~~ak~i~~~a~~~~ 259 (400) ++.+-+...++-+|+ +.+|.+..++.+++.+|.+++=.. --|+. |+++. +..++.++++.+..++++ T Consensus 99 i~~Ik~~g~k~Glal----nP~T~~~~l~~~l~~~D~VLvMtV-------~PGf~-GQ~f~-~~~l~Ki~~l~~~~~~~~ 165 (220) T PRK08883 99 LQLIKEHGCQAGVVL----NPATPLAHLEYIMDKVDLILLMSV-------NPGFG-GQSFI-PHTLDKLRAVRKMIDASG 165 (220) T ss_pred HHHHHHCCCCEEEEE----CCCCCHHHHHHHHHHCCEEEEEEE-------CCCCC-CCCCC-HHHHHHHHHHHHHHHHCC T ss_conf 999998599668884----799987999999974697999874-------58988-75455-779999999999887449 Q ss_pred CCCCCEEEEECC Q ss_conf 455430354113 Q gi|254780653|r 260 CEIIVPRDVVVA 271 (400) Q Consensus 260 ~~I~lP~D~~v~ 271 (400) .++.+-+|--+. T Consensus 166 ~~~~I~VDGGI~ 177 (220) T PRK08883 166 RDIRLEIDGGVK 177 (220) T ss_pred CCCEEEEECCCC T ss_conf 980799989878 No 193 >PRK06196 oxidoreductase; Provisional Probab=22.12 E-value=58 Score=13.72 Aligned_cols=39 Identities=21% Similarity=0.222 Sum_probs=28.2 Q ss_pred CHHHHCCCCCEEEEEEECCCCCCCCEECCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 101001599689998201671318577725899998898999998898099991 Q gi|254780653|r 7 MNDLRDIRGLRCLLRVDWNVPFIDGKVADVTRIERVVPTILELVEKKAKVVIFS 60 (400) Q Consensus 7 i~d~~d~~gk~VlvRvD~NvPi~~g~I~d~~RI~~~~pTI~~l~~~~akvii~S 60 (400) +.|+ |++||+|+|-- -+.-| -..|-+.|.++|++||+.. T Consensus 19 ~~~~-dL~GK~~vITG------a~sGI--------G~~tA~~La~~Ga~Vil~~ 57 (316) T PRK06196 19 LAGL-DLSGKTAIVTG------GYSGL--------GLETTRALAQAGAHVVVPA 57 (316) T ss_pred HCCC-CCCCCEEEECC------CCCHH--------HHHHHHHHHHCCCEEEEEE T ss_conf 5287-99999899917------99679--------9999999997899899994 No 194 >pfam06838 Alum_res Aluminium resistance protein. This family represents the aluminium resistance protein, which confers resistance to aluminium in bacteria. Probab=21.83 E-value=59 Score=13.68 Aligned_cols=19 Identities=21% Similarity=0.214 Sum_probs=9.5 Q ss_pred EECHHHHHHHHHHHCCCCC Q ss_conf 8620578999998301434 Q gi|254780653|r 218 VIGGGMANSFLVAQGMGVG 236 (400) Q Consensus 218 iigG~lantfL~A~G~~IG 236 (400) +-|..++-.++...|+++- T Consensus 284 lKga~~~A~~~e~lG~~v~ 302 (405) T pfam06838 284 LKGAHFTARVLELLGFPVL 302 (405) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999999999998499468 No 195 >COG4531 ZnuA ABC-type Zn2+ transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism] Probab=21.49 E-value=57 Score=13.78 Aligned_cols=43 Identities=23% Similarity=0.290 Sum_probs=30.2 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 7652003567776405776799976625322743658899999999861114 Q gi|254780653|r 295 LDVGFKTVEYIKQVIAQARTVMWNGPLGVFEIEPFDRATVEVAHYVAKLTKE 346 (400) Q Consensus 295 ~DIGp~Ti~~~~~~I~~AktI~WNGP~GvfE~~~F~~GT~~l~~~ia~~~~~ 346 (400) .--|.+++..+++.|++.+..- +|-.|+|. -++++.+++-|+- T Consensus 240 v~PGA~rl~~Ir~~l~e~~a~C------vFaEPQF~---Pkvve~v~~GT~v 282 (318) T COG4531 240 VQPGAKRLAEIRTQLKEQKATC------VFAEPQFR---PKVVETVAEGTSV 282 (318) T ss_pred CCCCHHHHHHHHHHHHHHCCCE------EECCCCCC---HHHHHHHHCCCCC T ss_conf 4743789999999999727967------72189876---1899998547754 No 196 >COG0541 Ffh Signal recognition particle GTPase [Intracellular trafficking and secretion] Probab=21.38 E-value=60 Score=13.62 Aligned_cols=191 Identities=19% Similarity=0.286 Sum_probs=116.9 Q ss_pred CEEEEEEECCCCCCC------------CEECC-----HHHHHHHHHHHHHHHH----------CCCEEEEE-ECCCCCCC Q ss_conf 689998201671318------------57772-----5899998898999998----------89809999-13778889 Q gi|254780653|r 16 LRCLLRVDWNVPFID------------GKVAD-----VTRIERVVPTILELVE----------KKAKVVIF-SHLGRPQS 67 (400) Q Consensus 16 k~VlvRvD~NvPi~~------------g~I~d-----~~RI~~~~pTI~~l~~----------~~akvii~-SH~GRPkg 67 (400) ++-|+-.|.|+|+.- .++.- ++=|+..-.-+-.++- +...+|++ .-||- | T Consensus 35 r~ALLeADVnl~vVk~fi~~ikera~g~ev~~~l~p~q~~iKiV~eELv~llG~~~~~l~l~~~~P~vImmvGLQGs--G 112 (451) T COG0541 35 RRALLEADVNLKVVKDFIKRIKERALGEEVPKGLTPGQQFIKIVYEELVKLLGGENSELNLAKKPPTVILMVGLQGS--G 112 (451) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCCC--C T ss_conf 99999644468999999999999861466788899899999999999999848887665037899858999815679--7 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHHCCCCCEEECCHHHHCCCCCCCHHH-HHH-----HHCCCC Q ss_conf 847332247888876431013354566-6310255665531364648982424303432100024-665-----312436 Q gi|254780653|r 68 KSDKDCSLFKVVSIAESILHKNILFVN-DCIGSTLSQSIASLSEGGIILAENVRFYSEEERNDPD-FVR-----MLSRNG 140 (400) Q Consensus 68 ~~~~~~Sl~~v~~~l~~~l~~~V~fi~-d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~~n~~~-f~k-----~La~la 140 (400) +..|....+.||.+ -+++|..+. |++-+.+.++.+.|..- =++-||..+...+|. .++ .-.... T Consensus 113 ---KTTt~~KLA~~lkk-~~~kvllVaaD~~RpAA~eQL~~La~q-----~~v~~f~~~~~~~Pv~Iak~al~~ak~~~~ 183 (451) T COG0541 113 ---KTTTAGKLAKYLKK-KGKKVLLVAADTYRPAAIEQLKQLAEQ-----VGVPFFGSGTEKDPVEIAKAALEKAKEEGY 183 (451) T ss_pred ---HHHHHHHHHHHHHH-CCCCEEEEECCCCCHHHHHHHHHHHHH-----CCCCEECCCCCCCHHHHHHHHHHHHHHCCC T ss_conf ---48689999999997-499458985056786899999999986-----098531677889979999999999997499 Q ss_pred CEEEECCHHHHCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHC--CCEEE Q ss_conf 4247544044011376333023321000332556577765331016678718999607802557889987730--68888 Q gi|254780653|r 141 DFYINDAFSVSHRAHASITGLSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKK--VDKLV 218 (400) Q Consensus 141 DiyVnDAF~~aHR~haS~~gi~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k--~D~ii 218 (400) |+-+-|- |-|-|- ---+-.|+..+..++ +|.. .+.|+-..==-|-.+.-+.+-+. ...++ T Consensus 184 DvvIvDT---AGRl~i-------------de~Lm~El~~Ik~~~-~P~E-~llVvDam~GQdA~~~A~aF~e~l~itGvI 245 (451) T COG0541 184 DVVIVDT---AGRLHI-------------DEELMDELKEIKEVI-NPDE-TLLVVDAMIGQDAVNTAKAFNEALGITGVI 245 (451) T ss_pred CEEEEEC---CCCCCC-------------HHHHHHHHHHHHHHC-CCCE-EEEEEECCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 8899968---873303-------------099999999998553-9874-899876444567899999986626986499 Q ss_pred EC-------HHHHHHHHHHHCCCC Q ss_conf 62-------057899999830143 Q gi|254780653|r 219 IG-------GGMANSFLVAQGMGV 235 (400) Q Consensus 219 ig-------G~lantfL~A~G~~I 235 (400) ++ ||-|-..-...|..| T Consensus 246 lTKlDGdaRGGaALS~~~~tg~PI 269 (451) T COG0541 246 LTKLDGDARGGAALSARAITGKPI 269 (451) T ss_pred EECCCCCCCCHHHHHHHHHHCCCE T ss_conf 971467876228885699878985 No 197 >TIGR00078 nadC nicotinate-nucleotide pyrophosphorylase; InterPro: IPR004393 Nicotinate-nucleotide pyrophosphorylase (2.4.2.19 from EC), also known as quinolinate phosphoribosyltransferase (decarboxylating), catalyses the conversion of nicotinate D-ribonucleotide, pyrophosphate and carbon dioxide into pyridine-2,3-dicarboxylate and 5-phospho-alpha-D-ribose 1-diphosphate. This enzyme is a type II phosphoribosyltransferase which provides the de novo source of nicotinate mononucleotide (NAMN) for NAD biosynthesis in both prokaryotes and eukaryotes , . Structural studies have shown that the active form of this enzyme is a homodimer with an unsual fold , , . The N-terminal forms a mixed alpha/beta domain, while the C-terminal region forms a multi-stranded, open alpha/beta barrel. The active site is found at the C-terminal ends of the beta strands of the alpha/beta barrel, and is bordered by the N-terminal domain of the second subunit of the dimer. It contains several conserved charged residues thought to be important determinants of substrate binding and catalysis. ; GO: 0004514 nicotinate-nucleotide diphosphorylase (carboxylating) activity, 0019363 pyridine nucleotide biosynthetic process. Probab=21.34 E-value=45 Score=14.41 Aligned_cols=134 Identities=19% Similarity=0.272 Sum_probs=66.0 Q ss_pred CCEEEEEEECCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHCCCCCCC---CCCCCCCHHHHHH---HHHHHHCCCCC Q ss_conf 87189996078025578899877306888862057899999830143433---3310000135688---99875302455 Q gi|254780653|r 189 KPLVAIVGGSKVSTKITLLINLVKKVDKLVIGGGMANSFLVAQGMGVGRS---LCQRDFSDNVHQI---AWEAKRSACEI 262 (400) Q Consensus 189 ~P~v~IiGGaKisdKi~~i~~L~~k~D~iiigG~lantfL~A~G~~IG~S---l~e~~~~~~ak~i---~~~a~~~~~~I 262 (400) +|.+-|.|==|---=++++++ . =+.+|||. .|..|-| ++.+|.+..+-.+ ++.+++ .+.. T Consensus 120 ~~~~~i~dTRKTTPGLR~~EK---y--AVr~GGG~--------nHR~~LsDavmiKDNHi~~~Gs~~~Av~~aR~-~~~~ 185 (276) T TIGR00078 120 NPKVRIADTRKTTPGLRLLEK---Y--AVRVGGGD--------NHRLGLSDAVMIKDNHIAAAGSIEKAVKRARA-ALPF 185 (276) T ss_pred CCCEEEEEEECCCCCCCCEEE---E--EEEECCCC--------CCCCCCCCEEEEEECEEECCCCHHHHHHHHHH-HCCC T ss_conf 993389985257874301023---1--35535885--------78688877389750206606898999999998-4899 Q ss_pred CCEEEEECCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHH-CCCCEEEEECCCCCCCCCCHH-HHHHHHHHHH Q ss_conf 43035411343232333310253212013345765200356777640-577679997662532274365-8899999999 Q gi|254780653|r 263 IVPRDVVVAREMKTGIPTQVVSAQSVPLDSIILDVGFKTVEYIKQVI-AQARTVMWNGPLGVFEIEPFD-RATVEVAHYV 340 (400) Q Consensus 263 ~lP~D~~v~~~~~~~~~~~~~~~~~i~~~~~i~DIGp~Ti~~~~~~I-~~AktI~WNGP~GvfE~~~F~-~GT~~l~~~i 340 (400) .++++|=| +|.+...+.+ .+|..|+ .++|+ ..++..++.+ T Consensus 186 ~~kiEVEV-----------------------------enlE~a~eA~~AGADiIm---------LDNm~p~~~~~av~~~ 227 (276) T TIGR00078 186 AKKIEVEV-----------------------------ENLEEAEEAAEAGADIIM---------LDNMKPEEIKEAVELL 227 (276) T ss_pred CCEEEEEE-----------------------------CCHHHHHHHHHCCCCEEE---------CCCCCHHHHHHHHHHH T ss_conf 80799862-----------------------------898999999970995998---------0698947999999999 Q ss_pred HHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCEEECC Q ss_conf 8611148858997480469989984887686468750 Q gi|254780653|r 341 AKLTKERRIISIAGGGDTITALAHAGISNEFTYVSTA 377 (400) Q Consensus 341 a~~~~~~~~~sivGGGdT~aai~~~g~~~~~~hvSTg 377 (400) ... ...+..=+-||=|..=+..|-... ..|||.| T Consensus 228 ~~~--~p~~~~EaSGGitl~n~~~ya~~g-VD~IS~G 261 (276) T TIGR00078 228 KGR--NPNVLVEASGGITLDNIEEYAETG-VDVISSG 261 (276) T ss_pred HHC--CCEEEEEEECCCCHHHHHHHHHCC-CCEEECC T ss_conf 702--990899983699878999984089-7588327 No 198 >cd06542 GH18_EndoS-like Endo-beta-N-acetylglucosaminidases are bacterial chitinases that hydrolyze the chitin core of various asparagine (N)-linked glycans and glycoproteins. The endo-beta-N-acetylglucosaminidases have a glycosyl hydrolase family 18 (GH18) catalytic domain. Some members also have an additional C-terminal glycosyl hydrolase family 20 (GH20) domain while others have an N-terminal domain of unknown function (pfam08522). Members of this family include endo-beta-N-acetylglucosaminidase S (EndoS) from Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and EndoH from Flavobacterium meningosepticum, and EndoE from Enterococcus faecalis. EndoS is a secreted endoglycosidase from Streptococcus pyogenes that specifically hydrolyzes the glycan on human IgG between two core N-acetylglucosamine residues. EndoE is a secreted endoglycosidase, encoded by the ndoE gene in Enterococcus faecalis, that hydrolyzes the glycan on human RNase B. Probab=21.33 E-value=60 Score=13.61 Aligned_cols=106 Identities=13% Similarity=0.046 Sum_probs=47.3 Q ss_pred HHHHHH-HHHHHCCCEEE---EEECCCCCCCCCCCCCCHHHHHHHHHHHC------CCCCCCCCCCCH---------HH- Q ss_conf 988989-99998898099---99137788898473322478888764310------133545666310---------25- Q gi|254780653|r 41 RVVPTI-LELVEKKAKVV---IFSHLGRPQSKSDKDCSLFKVVSIAESIL------HKNILFVNDCIG---------ST- 100 (400) Q Consensus 41 ~~~pTI-~~l~~~~akvi---i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l------~~~V~fi~d~~g---------~~- 100 (400) ....+. +.+-.+|-||+ +..|+|---......-+.+..|+.|...+ |.++.+-+...+ +. T Consensus 51 ~~~~~~~~~l~~kGtKv~~~~~~~~~g~g~~~~~~~~~~~~yA~~l~d~v~~yglDG~D~D~E~~~~~~~~~~~~~~~~~ 130 (255) T cd06542 51 TNKETYIRPLQAKGTKVLLSILGNHLGAGFANNLSDAAAKAYAKAIVDTVDKYGLDGVDFDDEYSGYGKNGTSQPSNEAF 130 (255) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCHHHH T ss_conf 77877788898657389999832244566678888999999999999999982998667636656677888777735899 Q ss_pred ---HHHHHHHCCC-CCEEECCHHHHCCCCCCCHHHHHHHHCCCCCEEEECCHHHHC Q ss_conf ---5665531364-648982424303432100024665312436424754404401 Q gi|254780653|r 101 ---LSQSIASLSE-GGIILAENVRFYSEEERNDPDFVRMLSRNGDFYINDAFSVSH 152 (400) Q Consensus 101 ---~~~~i~~l~~-g~ilLLEN~Rf~~~E~~n~~~f~k~La~laDiyVnDAF~~aH 152 (400) +++.-+.+.| +.+|.+ .++-.......+.+++++|++|+.|++.+. T Consensus 131 ~~~v~eL~k~~GP~~Kll~i------d~~~~~~~~~~~~~~~y~DY~i~Q~Ygs~~ 180 (255) T cd06542 131 VRLIKELRKYMGPTDKLLTI------DGYGQALSNDGEEVSPYVDYVIYQYYGSSS 180 (255) T ss_pred HHHHHHHHHHCCCCCCEEEE------ECCCCCCCCCCCCHHHHCCEEEEECCCCCC T ss_conf 99999999763998827999------587753344653301106789986447998 No 199 >COG4122 Predicted O-methyltransferase [General function prediction only] Probab=20.77 E-value=33 Score=15.29 Aligned_cols=56 Identities=21% Similarity=0.118 Sum_probs=28.0 Q ss_pred HHHHHHHHHHHHC--C-CEEEECHHHHHHHH-HHHCCC-CC---CCCCCCCCCHHHHHHHHHHH Q ss_conf 2557889987730--6-88886205789999-983014-34---33331000013568899875 Q gi|254780653|r 201 STKITLLINLVKK--V-DKLVIGGGMANSFL-VAQGMG-VG---RSLCQRDFSDNVHQIAWEAK 256 (400) Q Consensus 201 sdKi~~i~~L~~k--~-D~iiigG~lantfL-~A~G~~-IG---~Sl~e~~~~~~ak~i~~~a~ 256 (400) ...-..|..|++. . --+=||.+.++.=+ .|.+.. =| .--.+++..+.|++.++++- T Consensus 45 ~e~g~~L~~L~~~~~~k~iLEiGT~~GySal~mA~~l~~~g~l~tiE~~~e~~~~A~~n~~~ag 108 (219) T COG4122 45 PETGALLRLLARLSGPKRILEIGTAIGYSALWMALALPDDGRLTTIERDEERAEIARENLAEAG 108 (219) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHCC T ss_conf 1489999999973498649996352379999999638889769997079899999999999759 No 200 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=20.75 E-value=62 Score=13.54 Aligned_cols=55 Identities=9% Similarity=0.155 Sum_probs=29.2 Q ss_pred EECCHHHHHHHHHHHHHHHHCCC--EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 77725899998898999998898--09999137788898473322478888764310133545666 Q gi|254780653|r 32 KVADVTRIERVVPTILELVEKKA--KVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVND 95 (400) Q Consensus 32 ~I~d~~RI~~~~pTI~~l~~~~a--kvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d 95 (400) ++.-...+..--..|...++.+. +.|+ +|-|.- . +.| ++..+...++.+...++- T Consensus 25 e~vGQ~HLlg~~~~lrr~v~~~~l~SmIl---~GPPG~-G--KTT---lA~liA~~~~~~f~~~sA 81 (436) T COG2256 25 EVVGQEHLLGEGKPLRRAVEAGHLHSMIL---WGPPGT-G--KTT---LARLIAGTTNAAFEALSA 81 (436) T ss_pred HHCCHHHHHCCCCHHHHHHHCCCCCEEEE---ECCCCC-C--HHH---HHHHHHHHHCCCEEEECC T ss_conf 85571866189943899996499860577---789998-8--889---999998761776699515 No 201 >TIGR01511 ATPase-IB1_Cu copper-translocating P-type ATPase; InterPro: IPR006403 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents P-type ATPases that specialise in cation and copper transport; those transporting copper (3.6.3.4 from EC) have been classified as belonging to the IB subfamily , , . These proteins are involved in a variety of processes in both prokaryotes and eukaryotes. Some of these P-ATPases import copper into cells, such as in Arabidopsis sp., which require copper to create functional hormone receptors for ethylene signalling. Other P-ATPases are involved in copper export out of cells, such as occurs with the efflux of hepatic copper into the bile in eukaryotes. In bacteria, some of these P-ATPases play a role in osmotic adaptation. Certain copper ATPases have been implicated in both Wilson and Menkes diseases . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0004008 copper-exporting ATPase activity, 0005524 ATP binding, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=20.74 E-value=62 Score=13.54 Aligned_cols=15 Identities=20% Similarity=0.171 Sum_probs=8.4 Q ss_pred CEEECCHHHHCCCCC Q ss_conf 489824243034321 Q gi|254780653|r 112 GIILAENVRFYSEEE 126 (400) Q Consensus 112 ~ilLLEN~Rf~~~E~ 126 (400) ..+.|=|-+|+.++. T Consensus 326 ~~~~~Gn~~~~~~~~ 340 (545) T TIGR01511 326 HKIQLGNEKLLGENG 340 (545) T ss_pred EEEEEECHHHHHHCC T ss_conf 489987679961104 No 202 >cd03287 ABC_MSH3_euk MutS3 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=20.64 E-value=62 Score=13.52 Aligned_cols=13 Identities=31% Similarity=0.220 Sum_probs=5.1 Q ss_pred HHHHHHHHHHHCC Q ss_conf 5657776533101 Q gi|254780653|r 173 MQKELSMLESCFS 185 (400) Q Consensus 173 ~ekEi~~L~~~l~ 185 (400) |..|+..+..+++ T Consensus 94 F~~e~~~~~~il~ 106 (222) T cd03287 94 FMVELSETSHILS 106 (222) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999998 No 203 >pfam01293 PEPCK_ATP Phosphoenolpyruvate carboxykinase. Probab=20.63 E-value=36 Score=15.04 Aligned_cols=30 Identities=17% Similarity=0.235 Sum_probs=19.8 Q ss_pred HHHHHCCCCCEEEECCHHHHCCCCCCEEECC Q ss_conf 6653124364247544044011376333023 Q gi|254780653|r 132 FVRMLSRNGDFYINDAFSVSHRAHASITGLS 162 (400) Q Consensus 132 f~k~La~laDiyVnDAF~~aHR~haS~~gi~ 162 (400) ....|. --|+||.|+|..|+.+|.--+-+- T Consensus 79 v~~yl~-~k~lyV~D~~aGad~~~rl~VRvi 108 (451) T pfam01293 79 VLDYLS-GKDLFVVDLFAGADPDYRLKVRVV 108 (451) T ss_pred HHHHHH-CCCEEEEEEEECCCHHHCEEEEEE T ss_conf 999970-796699851206683234307997 No 204 >PRK01722 formimidoylglutamase; Provisional Probab=20.54 E-value=63 Score=13.51 Aligned_cols=32 Identities=9% Similarity=0.377 Sum_probs=20.1 Q ss_pred HHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 53310166787189996078025578899877306 Q gi|254780653|r 180 LESCFSESKKPLVAIVGGSKVSTKITLLINLVKKV 214 (400) Q Consensus 180 L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~ 214 (400) ..++++.... ..+|||.. +-.++.++.+.+.. T Consensus 110 v~~ll~~g~~--pIvLGGdH-sia~g~~~gl~~~~ 141 (320) T PRK01722 110 VSHCLKANPR--TIVLGGGH-EIAFGSFKGVADAF 141 (320) T ss_pred HHHHHHCCCE--EEEECCCC-HHHHHHHHHHHHHC T ss_conf 9999967986--89984764-25778799999865 No 205 >TIGR03639 cas1_NMENI CRISPR-associated protein Cas1, NMENI subtype. The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is a prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes the Cas1 variant of the NMENI subtype of CRISPR/Cas system. Probab=20.47 E-value=32 Score=15.41 Aligned_cols=59 Identities=10% Similarity=0.144 Sum_probs=41.3 Q ss_pred CCCCEEEEEEE---CCCCCCC--CEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCC Q ss_conf 59968999820---1671318--57772589999889899999889809999137788898473 Q gi|254780653|r 13 IRGLRCLLRVD---WNVPFID--GKVADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDK 71 (400) Q Consensus 13 ~~gk~VlvRvD---~NvPi~~--g~I~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~ 71 (400) +++.+..++-| .-+|+.+ .-+.....+.-+.+.+.+|.++|-.++.+++-|+|.|..-+ T Consensus 14 ~k~n~Lvi~~~~~~~~iPl~~I~~Ivi~~~~v~iss~ll~~l~~~~I~v~f~~~~g~~~g~~~p 77 (278) T TIGR03639 14 LKLNQLVVKKDGEEITLPLEDIDVILIENPQVTISSALLSALAENNIALIFCDEKHLPVGQLLP 77 (278) T ss_pred EECCEEEEEECCEEEEECHHHCCEEEECCCCEEECHHHHHHHHHCCCEEEEECCCCCEEEEEEC T ss_conf 9899899998991899876884889996897998799999999879979999899988899954 No 206 >PRK02048 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Provisional Probab=20.28 E-value=63 Score=13.47 Aligned_cols=235 Identities=14% Similarity=0.158 Sum_probs=120.5 Q ss_pred EEEEEEECCCCCC-CCEE-------CCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 8999820167131-8577-------7258999988989999988980999913778889847332247888876431013 Q gi|254780653|r 17 RCLLRVDWNVPFI-DGKV-------ADVTRIERVVPTILELVEKKAKVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHK 88 (400) Q Consensus 17 ~VlvRvD~NvPi~-~g~I-------~d~~RI~~~~pTI~~l~~~~akvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~ 88 (400) +-=|+|- ||++. +.-| +|+.-+++++.-|+.|.++||-+|=+|-++. ..--.|+.+.+.|.+. +. T Consensus 11 Tr~V~vG-~v~iGg~~Pi~vQSMt~t~T~d~~atv~Qi~~l~~aGceiVRvtvp~~-----~~a~al~~I~~~l~~~-~~ 83 (613) T PRK02048 11 TSEVNIG-ATPLGGPNPIRIQSMTNTSTMDTEACVAQAKRIIDAGGEYVRLTTQGV-----REAENLMNINIGLRSQ-GY 83 (613) T ss_pred CEEEEEC-CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCH-----HHHHHHHHHHHHHHHC-CC T ss_conf 5478986-965579996766646999861589999999999985999899888999-----9998499999998627-99 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCCEEECCHHHHCCCCC-----------CCHHHHHHHHCCCCCEEE-----ECCHHHHC Q ss_conf 35456663102556655313646489824243034321-----------000246653124364247-----54404401 Q gi|254780653|r 89 NILFVNDCIGSTLSQSIASLSEGGIILAENVRFYSEEE-----------RNDPDFVRMLSRNGDFYI-----NDAFSVSH 152 (400) Q Consensus 89 ~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~Rf~~~E~-----------~n~~~f~k~La~laDiyV-----nDAF~~aH 152 (400) +|..+.|.+... +-++...+. .|.+|-++|-- ..++++...+-..-+-|+ --..+++- T Consensus 84 ~iPlvADIHF~~-~~A~~a~~~-----~~kvRINPGN~~~~~~~~~~~~~~~~~~~~~~~~i~e~~~p~v~~~k~~~~~i 157 (613) T PRK02048 84 MVPLVADVHFNP-KVADVAAQY-----AEKVRINPGNYVDPGRTFKKLEYTDEEYAAEIQKIRERFVPFLNICKENHTAI 157 (613) T ss_pred CCCEEEECCCCH-HHHHHHHHH-----CCCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 898797327887-999999985-----38687889977666555332222067788888989998899999999769977 Q ss_pred C---CCCCEEE--CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHHHHHHHHHCCC-----E-EEEC- Q ss_conf 1---3763330--2332100033255657776533101667871899960780255788998773068-----8-8862- Q gi|254780653|r 153 R---AHASITG--LSHLLPSYIGRAMQKELSMLESCFSESKKPLVAIVGGSKVSTKITLLINLVKKVD-----K-LVIG- 220 (400) Q Consensus 153 R---~haS~~g--i~~~lps~aG~l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~~i~~L~~k~D-----~-iiig- 220 (400) | +|-|+-. ..+|=++-.| ++|.-++++.-+-+..=.-++.=+=-|.+..-+..-+-|.++.| + +=+| T Consensus 158 RIGvN~GSL~~ri~~~yg~tp~~-mveSAle~~~i~e~~~f~div~S~KaSn~~v~v~Ayrll~~~~~~~~~~yPlHlGv 236 (613) T PRK02048 158 RIGVNHGSLSDRIMSRYGDTPEG-MVESCMEFLRICVEEHFTDVVISIKASNTVVMVKTVRLLVSVMEAEGMHFPLHLGV 236 (613) T ss_pred EEECCCCCCCHHHHHHCCCCHHH-HHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 98548676698889624998799-99999999999986799818999433528999999999999986336886605401 Q ss_pred ------------H--HHHHHHHHHHCCCCCCCCCCC--CCCHHHHHHHHHHHHCCCCCCCE Q ss_conf ------------0--578999998301434333310--00013568899875302455430 Q gi|254780653|r 221 ------------G--GMANSFLVAQGMGVGRSLCQR--DFSDNVHQIAWEAKRSACEIIVP 265 (400) Q Consensus 221 ------------G--~lantfL~A~G~~IG~Sl~e~--~~~~~ak~i~~~a~~~~~~I~lP 265 (400) - |++.....--|-.|--||.++ ..+..+++++...+++.....+| T Consensus 237 TEAG~~~~g~iKSa~gig~LL~~GiGDTiRvSlt~~p~~Ev~v~~~Lv~~~~~r~~~~~ip 297 (613) T PRK02048 237 TEAGDGEDGRIKSALGIGALLADGLGDTIRVSLSEDPEAEIPVARKLVDYILSRENHPYIP 297 (613) T ss_pred CCCCCCCCCEEHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 0467887840168999999997357220899668894556089999999997641478876 No 207 >PRK06739 pyruvate kinase; Validated Probab=20.25 E-value=63 Score=13.47 Aligned_cols=95 Identities=18% Similarity=0.259 Sum_probs=46.4 Q ss_pred HHHHHHHHHHCCCEEE--EEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCHH Q ss_conf 8898999998898099--99137788898473322478888764310133545666310255665531364648982424 Q gi|254780653|r 42 VVPTILELVEKKAKVV--IFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDCIGSTLSQSIASLSEGGIILAENV 119 (400) Q Consensus 42 ~~pTI~~l~~~~akvi--i~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~~~i~~l~~g~ilLLEN~ 119 (400) ...+|+.|+++|..+. =+||- -+ + ......+.+++ ++.++...-|.-|++.+.. .++.+.|.|-++- T Consensus 15 ~~e~l~~li~aGvnv~RiN~SHg-~~----e---~~~~~i~~ir~-~~~~i~Il~Dl~GpkiR~g--~~~~~~i~l~~g~ 83 (352) T PRK06739 15 NKETLAQLINNGMKIVRLNLSHG-TH----E---SHKDIIRLVKS-LDDSIKILGDVQGPKIRLG--EIKGEQITLQAGD 83 (352) T ss_pred CHHHHHHHHHCCCCEEEEECCCC-CH----H---HHHHHHHHHHH-HCCCCCEEEECCCCEEEEE--ECCCCCEEECCCC T ss_conf 99999999987998999989999-99----9---99999999997-4754606770889825888--7169817934899 Q ss_pred HH--CCCCC--------CCHHHHHHHHCCCCCEEEECC Q ss_conf 30--34321--------000246653124364247544 Q gi|254780653|r 120 RF--YSEEE--------RNDPDFVRMLSRNGDFYINDA 147 (400) Q Consensus 120 Rf--~~~E~--------~n~~~f~k~La~laDiyVnDA 147 (400) +| ...+. -+.+.+.+.+..---+|+.|. T Consensus 84 ~v~l~~~~~~~~~~~i~v~~~~l~~~vk~Gd~I~idDG 121 (352) T PRK06739 84 SFILHTQPVTGSSTEASVDYEGIANDVKVGSRILMNDG 121 (352) T ss_pred EEEEECCCCCCCCCEEECCHHHHHHHCCCCCEEEEECC T ss_conf 89984566578776674468998864778988998089 No 208 >TIGR00576 dut dUTP diphosphatase; InterPro: IPR008181 Synonym(s): dUTP diphosphatase, Deoxyuridine-triphosphatase. The essential enzyme dUTP pyrophosphatase (3.6.1.23 from EC) is specific for dUTP and is critical for the fidelity of DNA replication and repair. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate, simultaneously reducing dUTP levels and providing the dUMP for dTTP biosynthesis. dUTPase decreases the intracellular concentration of dUPT so that uracil cannot be incorporated into DNA . The crystal structure of human dUTPase reveals that each subunit of the dUTPase trimer folds into an eight-stranded jelly-roll beta barrel, with the C-terminal beta strands interchanged among the subunits. The structure is similar to that of the Escherichia coli enzyme, despite low sequence homology between the two enzymes . Other enzymes like deoxycytidine triphosphate deaminase (dCTP) (3.5.4.13 from EC) that specifically bind uridine also belong to this group suggesting that the signature may recognise a putative uridine-binding motif. Some retroviruses encode dUTPases. Retroviral dUTPase is synthesised as part of POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, dUTPase and RNase H. ; GO: 0016787 hydrolase activity, 0046080 dUTP metabolic process. Probab=20.23 E-value=39 Score=14.86 Aligned_cols=67 Identities=15% Similarity=0.222 Sum_probs=48.4 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCEECCC-CCCCCCCCHHHHH-HHHHHCCCCEEEEECC-CCCCCCCCHH Q ss_conf 245543035411343232333310253212013-3457652003567-7764057767999766-2532274365 Q gi|254780653|r 259 ACEIIVPRDVVVAREMKTGIPTQVVSAQSVPLD-SIILDVGFKTVEY-IKQVIAQARTVMWNGP-LGVFEIEPFD 330 (400) Q Consensus 259 ~~~I~lP~D~~v~~~~~~~~~~~~~~~~~i~~~-~~i~DIGp~Ti~~-~~~~I~~AktI~WNGP-~GvfE~~~F~ 330 (400) +.+.++|+|+++.-. .....++.+-+.+... ...+|.||-||.. |...|+ .||||== -=-|+.+.+. T Consensus 41 ~~~~Lv~TG~~~~lP--~g~~g~v~PRSGLA~K~gv~i~n~~GvID~DYRGE~k---V~L~N~gp~~~f~v~~Gd 110 (151) T TIGR00576 41 GERALVPTGIAIELP--EGYYGRVAPRSGLALKHGVTIDNSPGVIDADYRGEIK---VILINLGPKEDFTVKKGD 110 (151) T ss_pred CCEEEECCCEEEECC--CCCEEEEEECCCCCEEEEEEEECCCCEEECCCCCCEE---EEEEECCCCCCEEECCCC T ss_conf 984875244299848--9948997304413302468996678178178720268---999858999867885797 No 209 >TIGR02874 spore_ytfJ sporulation protein YtfJ; InterPro: IPR014229 Proteins in this entry, exemplified by YtfJ of Bacillus subtilis, are encoded by bacterial genomes if, and only if, the species is capable of endospore formation. YtfJ was confirmed in spores of B. subtilis; it appears to be expressed in the forespore under control of SigF .. Probab=20.14 E-value=64 Score=13.45 Aligned_cols=46 Identities=15% Similarity=0.229 Sum_probs=38.0 Q ss_pred CCEEEEECCCCCCCCCCCCCCCCCEEC-CCCCCCCCCCHHHHHHHHHHCCC Q ss_conf 430354113432323333102532120-13345765200356777640577 Q gi|254780653|r 263 IVPRDVVVAREMKTGIPTQVVSAQSVP-LDSIILDVGFKTVEYIKQVIAQA 312 (400) Q Consensus 263 ~lP~D~~v~~~~~~~~~~~~~~~~~i~-~~~~i~DIGp~Ti~~~~~~I~~A 312 (400) +-|+-|.|-.+ ...|..++++-. .-+.+.|.=|+.++.+++..++. T Consensus 85 ~~PvAFLvig~----~~iRLL~vd~~~H~~ek~iD~~Pql~e~~k~~~~~~ 131 (132) T TIGR02874 85 VKPVAFLVIGS----DQIRLLPVDQNTHLYEKLIDLVPQLLEKIKDLFSKN 131 (132) T ss_pred ECCEEEEEECC----CCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 13158998778----836875306765504666511478999999873037 No 210 >PTZ00311 phosphoenolpyruvate carboxykinase; Provisional Probab=20.06 E-value=36 Score=15.02 Aligned_cols=45 Identities=9% Similarity=0.118 Sum_probs=27.1 Q ss_pred HHHHCCCCCCC-HHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECC Q ss_conf 24303432100-0246653124364247544044011376333023 Q gi|254780653|r 118 NVRFYSEEERN-DPDFVRMLSRNGDFYINDAFSVSHRAHASITGLS 162 (400) Q Consensus 118 N~Rf~~~E~~n-~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~ 162 (400) |..|-++--.. -......|.+.=.+||.|+|..|+..|.=-+-+- T Consensus 92 N~p~s~e~Fd~L~~kv~~yL~~kd~lyV~D~~aGAD~~~rl~VRVI 137 (543) T PTZ00311 92 NIPLSEESFEKVKKRAIDYLNSRERLFVIDGYAGWDERYRLKVRVI 137 (543) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCHHHEEEEEEE T ss_conf 7778999999999999999834896899843530785322338999 No 211 >PRK00439 leuD 3-isopropylmalate dehydratase small subunit; Reviewed Probab=20.05 E-value=28 Score=15.75 Aligned_cols=49 Identities=27% Similarity=0.600 Sum_probs=27.2 Q ss_pred HHHHHHHHCCCCCEEE-ECCHHH-HCCCCCCE----EECCCCC-CCCCCHHHHHHHH Q ss_conf 0246653124364247-544044-01137633----3023321-0003325565777 Q gi|254780653|r 129 DPDFVRMLSRNGDFYI-NDAFSV-SHRAHASI----TGLSHLL-PSYIGRAMQKELS 178 (400) Q Consensus 129 ~~~f~k~La~laDiyV-nDAF~~-aHR~haS~----~gi~~~l-ps~aG~l~ekEi~ 178 (400) +|+|+++.. -+|+-| -..||| |.|.||-. .||.-++ +|++......-++ T Consensus 39 ~p~f~~~~~-~gdIiVaG~NFG~GSSRE~A~~al~~~Gi~~VIA~SFa~If~rN~in 94 (163) T PRK00439 39 DPEFAKKVK-PGDIIVAGKNFGCGSSREHAPLALKAAGVSAVIAKSFARIFFRNAIN 94 (163) T ss_pred CCCCCCCCC-CCCEEEECCCCCCCCCHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHH T ss_conf 864001466-87589968851688867999999998598799852378999832887 No 212 >TIGR01704 MTA/SAH-Nsdase MTA/SAH nucleosidase; InterPro: IPR010049 This entry represents the enzyme 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase which acts on its two substrates at the same active site. This enzyme is involved in the recycling of the components of S-adenosylmethionine after it has donated one of its two non-ribose sulphur ligands to an acceptor. In the case of 5'-methylthioadenosine this represents the first step of the methionine salvage pathway in bacteria , ,]. This enzyme is widely distributed in bacteria. ; GO: 0008782 adenosylhomocysteine nucleosidase activity, 0008930 methylthioadenosine nucleosidase activity, 0009164 nucleoside catabolic process, 0019509 methionine salvage. Probab=20.04 E-value=64 Score=13.44 Aligned_cols=142 Identities=18% Similarity=0.208 Sum_probs=86.6 Q ss_pred CCEEEEEECCCCCCCCCC------CCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHH--HHHHHCCCCCEEECCHHHHCCC Q ss_conf 980999913778889847------33224788887643101335456663102556--6553136464898242430343 Q gi|254780653|r 53 KAKVVIFSHLGRPQSKSD------KDCSLFKVVSIAESILHKNILFVNDCIGSTLS--QSIASLSEGGIILAENVRFYSE 124 (400) Q Consensus 53 ~akvii~SH~GRPkg~~~------~~~Sl~~v~~~l~~~l~~~V~fi~d~~g~~~~--~~i~~l~~g~ilLLEN~Rf~~~ 124 (400) ++|.+-++|.-=.-|+++ -+.-..++...|+.-|=.+ .+-||++-++-. =--..|+-|||+.=+.+|+|+- T Consensus 21 N~~T~~~~~CE~~TG~~~GTEV~LLKSGIGKV~AA~~~TLLL~-~~KPD~~INTGSAGGl~~TL~VGD~V~S~~~R~HD~ 99 (229) T TIGR01704 21 NAKTISLAGCEIYTGELEGTEVILLKSGIGKVAAALSATLLLD-RYKPDVVINTGSAGGLAHTLKVGDVVVSDDVRYHDV 99 (229) T ss_pred HHCCEEECCCEEECCCCCCEEEEEEECCCHHHHHHHHHHHHHH-CCCCCEEEECCCCCCCCCCCCCCCEEEECCCEECCC T ss_conf 0011022152122164467078886057237899988888875-079976985887544233132056787167402143 Q ss_pred CC---------------------------------CCHHHHHHHHCCCCCEEEECCHHHHCCCCCCEEECCCCCCCCCCH Q ss_conf 21---------------------------------000246653124364247544044011376333023321000332 Q gi|254780653|r 125 EE---------------------------------RNDPDFVRMLSRNGDFYINDAFSVSHRAHASITGLSHLLPSYIGR 171 (400) Q Consensus 125 E~---------------------------------~n~~~f~k~La~laDiyVnDAF~~aHR~haS~~gi~~~lps~aG~ 171 (400) .. -|--.-+|=+---+|-|+||.-+.+.=-| +|-..|| T Consensus 100 DVTAF~YEYGQ~PG~PA~y~AD~KLI~~~E~~~~EL~~i~~~~G~I~~GD~F~N~~~~~~~~R~-------~F~~~~A-- 170 (229) T TIGR01704 100 DVTAFDYEYGQLPGLPAAYKADDKLIAIAEEAVAELDLINVVKGLIVSGDAFINDSKRLEKVRA-------RFSDLIA-- 170 (229) T ss_pred CEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCEEECCCHHHHHHHH-------HHHHHHH-- T ss_conf 0132055447888885123333689999999987652343320225625200067324888852-------0024534-- Q ss_pred HHHHHHHHHHHHCCCCCCCEEEEEEECCHHHHHH Q ss_conf 5565777653310166787189996078025578 Q gi|254780653|r 172 AMQKELSMLESCFSESKKPLVAIVGGSKVSTKIT 205 (400) Q Consensus 172 l~ekEi~~L~~~l~~~~~P~v~IiGGaKisdKi~ 205 (400) +|=|=.+.-++=-+.+-|+|+|=.=|-|.||=. T Consensus 171 -VEMEA~A~A~VCH~F~~PFVV~RA~SD~A~~~S 203 (229) T TIGR01704 171 -VEMEAAAVAQVCHNFKVPFVVVRALSDVADKES 203 (229) T ss_pred -HHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCC T ss_conf -676788997752058998689973000027746 No 213 >PRK03731 aroL shikimate kinase II; Reviewed Probab=20.01 E-value=24 Score=16.18 Aligned_cols=33 Identities=12% Similarity=0.030 Sum_probs=17.2 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 099991377888984733224788887643101335456663 Q gi|254780653|r 55 KVVIFSHLGRPQSKSDKDCSLFKVVSIAESILHKNILFVNDC 96 (400) Q Consensus 55 kvii~SH~GRPkg~~~~~~Sl~~v~~~l~~~l~~~V~fi~d~ 96 (400) .|+|+...|- || .+ +.+.|++.|+.+..=.|+. T Consensus 4 ~I~LiG~mGs--GK----st---iGk~LA~~L~~~fiD~D~~ 36 (172) T PRK03731 4 PLFLVGPRGC--GK----TT---VGMALAQALGYRFVDTDLW 36 (172) T ss_pred CEEEECCCCC--CH----HH---HHHHHHHHHCCCEEECCHH T ss_conf 8899889999--88----99---9999999859997978699 Done!