Query gi|254780889|ref|YP_003065302.1| inosine 5'-monophosphate dehydrogenase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 493 No_of_seqs 261 out of 3061 Neff 5.6 Searched_HMMs 39220 Date Mon May 30 02:47:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780889.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01302 IMP_dehydrog inosine 100.0 0 0 1397.2 36.4 448 13-460 1-476 (476) 2 PRK05567 inositol-5'-monophosp 100.0 0 0 1327.1 41.8 484 1-491 1-485 (486) 3 PTZ00314 inosine-5'-monophosph 100.0 0 0 1295.2 42.0 471 10-483 13-492 (499) 4 PRK07107 inositol-5-monophosph 100.0 0 0 1274.0 40.9 476 1-486 1-497 (497) 5 PRK07807 inositol-5-monophosph 100.0 0 0 1263.4 40.6 468 2-478 1-475 (479) 6 pfam00478 IMPDH IMP dehydrogen 100.0 0 0 1242.0 41.3 463 12-478 1-467 (467) 7 KOG2550 consensus 100.0 0 0 1050.0 30.3 469 11-482 27-502 (503) 8 PRK06843 inositol-5-monophosph 100.0 0 0 1020.5 33.4 401 5-481 1-401 (404) 9 cd00381 IMPDH IMPDH: The catal 100.0 0 0 835.2 31.3 325 13-467 1-325 (325) 10 PRK05096 guanosine 5'-monophos 100.0 0 0 782.8 31.8 336 2-479 1-345 (347) 11 PRK08649 inositol-5-monophosph 100.0 0 0 766.3 21.5 334 3-466 4-366 (368) 12 PRK05458 guanosine 5'-monophos 100.0 0 0 737.3 29.6 318 12-476 3-325 (326) 13 TIGR01303 IMP_DH_rel_1 IMP deh 100.0 0 0 730.3 26.3 469 3-479 1-475 (476) 14 TIGR01305 GMP_reduct_1 guanosi 100.0 0 0 462.1 19.3 324 12-470 6-338 (343) 15 TIGR01304 IMP_DH_rel_2 IMP deh 100.0 0 0 438.1 18.5 333 4-460 3-371 (376) 16 TIGR01306 GMP_reduct_2 guanosi 100.0 0 0 384.0 16.8 310 14-470 2-316 (321) 17 COG0516 GuaB IMP dehydrogenase 100.0 2.6E-31 6.6E-36 244.0 4.6 91 1-91 2-92 (170) 18 cd04602 CBS_pair_IMPDH_2 This 99.9 2.1E-23 5.3E-28 187.3 10.1 110 99-208 1-114 (114) 19 cd04730 NPD_like 2-Nitropropan 99.9 1.8E-20 4.6E-25 166.3 17.7 196 45-377 1-197 (236) 20 cd04596 CBS_pair_DRTGG_assoc T 99.9 1.1E-21 2.8E-26 175.0 10.0 108 99-208 1-108 (108) 21 cd04594 CBS_pair_EriC_assoc_ar 99.9 1.7E-21 4.3E-26 173.7 9.6 103 101-208 2-104 (104) 22 cd04635 CBS_pair_22 The CBS do 99.9 2.2E-21 5.6E-26 172.8 9.5 108 100-208 1-122 (122) 23 cd04610 CBS_pair_ParBc_assoc T 99.9 2.8E-21 7.2E-26 172.0 10.0 106 100-208 2-107 (107) 24 cd04601 CBS_pair_IMPDH This cd 99.9 2.8E-21 7.1E-26 172.1 9.8 109 99-208 1-110 (110) 25 cd04617 CBS_pair_4 The CBS dom 99.9 2.6E-21 6.6E-26 172.3 9.3 107 101-208 2-118 (118) 26 cd04620 CBS_pair_7 The CBS dom 99.8 6.1E-21 1.6E-25 169.6 9.9 108 100-208 1-115 (115) 27 cd04632 CBS_pair_19 The CBS do 99.8 4.9E-21 1.3E-25 170.3 9.4 108 100-208 1-128 (128) 28 cd04583 CBS_pair_ABC_OpuCA_ass 99.8 5.4E-21 1.4E-25 170.0 9.3 109 99-208 1-109 (109) 29 cd04638 CBS_pair_25 The CBS do 99.8 7.6E-21 1.9E-25 168.9 9.8 106 100-208 1-106 (106) 30 cd04619 CBS_pair_6 The CBS dom 99.8 8.7E-21 2.2E-25 168.5 9.6 106 102-208 3-114 (114) 31 cd04599 CBS_pair_GGDEF_assoc2 99.8 1.1E-20 2.8E-25 167.8 9.5 104 100-207 1-104 (105) 32 cd04629 CBS_pair_16 The CBS do 99.8 1.3E-20 3.3E-25 167.3 9.6 107 100-208 1-114 (114) 33 cd04631 CBS_pair_18 The CBS do 99.8 1.2E-20 3.2E-25 167.4 9.3 109 100-208 1-125 (125) 34 cd04621 CBS_pair_8 The CBS dom 99.8 1.4E-20 3.7E-25 167.0 9.7 106 100-207 1-134 (135) 35 cd04600 CBS_pair_HPP_assoc Thi 99.8 1.6E-20 4.2E-25 166.5 9.4 108 100-208 2-124 (124) 36 cd04587 CBS_pair_CAP-ED_DUF294 99.8 2.1E-20 5.4E-25 165.8 9.5 107 100-208 1-113 (113) 37 cd04615 CBS_pair_2 The CBS dom 99.8 2.5E-20 6.4E-25 165.2 9.6 107 101-208 2-113 (113) 38 cd04605 CBS_pair_MET2_assoc Th 99.8 2.1E-20 5.4E-25 165.8 8.5 108 100-208 2-110 (110) 39 cd04584 CBS_pair_ACT_assoc Thi 99.8 3.6E-20 9.2E-25 164.1 9.6 107 100-208 1-121 (121) 40 cd04637 CBS_pair_24 The CBS do 99.8 3.8E-20 9.6E-25 164.0 9.5 106 100-207 1-121 (122) 41 cd04636 CBS_pair_23 The CBS do 99.8 3.8E-20 9.7E-25 163.9 9.5 107 100-208 1-132 (132) 42 cd04582 CBS_pair_ABC_OpuCA_ass 99.8 4.4E-20 1.1E-24 163.5 9.8 105 100-207 1-105 (106) 43 cd04588 CBS_pair_CAP-ED_DUF294 99.8 4.9E-20 1.3E-24 163.1 10.0 107 100-208 1-110 (110) 44 cd04585 CBS_pair_ACT_assoc2 Th 99.8 3.1E-20 7.8E-25 164.6 8.9 107 100-208 1-122 (122) 45 cd04624 CBS_pair_11 The CBS do 99.8 5.2E-20 1.3E-24 163.0 9.8 107 100-207 1-111 (112) 46 cd04622 CBS_pair_9 The CBS dom 99.8 4.1E-20 1.1E-24 163.7 9.1 107 100-208 1-113 (113) 47 cd04586 CBS_pair_BON_assoc Thi 99.8 5E-20 1.3E-24 163.1 9.5 107 100-208 2-135 (135) 48 cd04593 CBS_pair_EriC_assoc_ba 99.8 5.9E-20 1.5E-24 162.6 9.8 107 101-208 2-115 (115) 49 cd04612 CBS_pair_SpoIVFB_EriC_ 99.8 1.2E-19 3E-24 160.5 10.0 107 100-208 1-111 (111) 50 pfam03060 NPD 2-nitropropane d 99.8 2.1E-18 5.2E-23 151.5 16.3 233 36-379 1-235 (330) 51 cd04643 CBS_pair_30 The CBS do 99.8 8.8E-20 2.3E-24 161.3 9.2 106 100-208 1-116 (116) 52 cd04639 CBS_pair_26 The CBS do 99.8 1.3E-19 3.2E-24 160.2 9.8 107 100-207 1-110 (111) 53 cd04613 CBS_pair_SpoIVFB_EriC_ 99.8 1.3E-19 3.3E-24 160.1 9.8 108 100-208 1-114 (114) 54 cd04625 CBS_pair_12 The CBS do 99.8 9.6E-20 2.4E-24 161.1 8.9 105 101-208 2-112 (112) 55 cd04623 CBS_pair_10 The CBS do 99.8 1.1E-19 2.7E-24 160.8 8.9 107 100-208 1-113 (113) 56 cd04626 CBS_pair_13 The CBS do 99.8 1.7E-19 4.4E-24 159.2 9.6 107 100-208 1-111 (111) 57 cd04642 CBS_pair_29 The CBS do 99.8 2.2E-19 5.6E-24 158.5 10.0 108 100-208 1-126 (126) 58 cd04607 CBS_pair_NTP_transfera 99.8 1.6E-19 4.1E-24 159.4 9.3 107 101-208 3-113 (113) 59 cd04634 CBS_pair_21 The CBS do 99.8 1.7E-19 4.3E-24 159.3 9.3 105 100-207 1-142 (143) 60 cd04803 CBS_pair_15 The CBS do 99.8 1.8E-19 4.5E-24 159.2 9.4 106 102-208 3-122 (122) 61 cd04611 CBS_pair_PAS_GGDEF_DUF 99.8 2E-19 5E-24 158.8 9.2 107 100-208 1-111 (111) 62 cd04630 CBS_pair_17 The CBS do 99.8 1.9E-19 4.8E-24 158.9 9.0 108 100-208 1-114 (114) 63 cd04595 CBS_pair_DHH_polyA_Pol 99.8 2.8E-19 7.2E-24 157.7 9.7 105 101-208 3-110 (110) 64 cd04641 CBS_pair_28 The CBS do 99.8 3.5E-19 9E-24 157.0 9.6 107 100-207 1-119 (120) 65 cd04614 CBS_pair_1 The CBS dom 99.8 2.5E-19 6.4E-24 158.0 8.8 95 100-207 1-95 (96) 66 cd04633 CBS_pair_20 The CBS do 99.8 5.2E-19 1.3E-23 155.8 9.7 105 100-207 1-120 (121) 67 cd04604 CBS_pair_KpsF_GutQ_ass 99.8 5.1E-19 1.3E-23 155.8 9.3 107 101-208 3-114 (114) 68 cd04802 CBS_pair_3 The CBS dom 99.8 5.3E-19 1.4E-23 155.7 8.9 105 100-207 1-111 (112) 69 PRK05437 isopentenyl pyrophosp 99.8 1.1E-17 2.8E-22 146.3 15.4 147 260-462 173-335 (351) 70 cd04800 CBS_pair_CAP-ED_DUF294 99.8 1.1E-18 2.8E-23 153.4 8.9 106 100-208 1-111 (111) 71 cd04589 CBS_pair_CAP-ED_DUF294 99.8 3.1E-18 7.9E-23 150.2 9.6 105 100-207 1-110 (111) 72 cd04801 CBS_pair_M50_like This 99.8 3.8E-18 9.8E-23 149.6 9.8 108 100-208 1-114 (114) 73 cd04627 CBS_pair_14 The CBS do 99.8 5.8E-18 1.5E-22 148.3 9.7 105 102-206 3-121 (123) 74 pfam01070 FMN_dh FMN-dependent 99.8 7.6E-17 1.9E-21 140.2 15.2 167 232-461 125-297 (301) 75 TIGR03151 enACPred_II putative 99.8 1.3E-16 3.2E-21 138.7 16.3 200 36-377 1-202 (307) 76 cd04603 CBS_pair_KefB_assoc Th 99.8 7.2E-18 1.8E-22 147.6 9.8 107 101-208 2-111 (111) 77 COG2524 Predicted transcriptio 99.7 6.2E-18 1.6E-22 148.1 8.3 114 94-209 174-290 (294) 78 cd04609 CBS_pair_PALP_assoc2 T 99.7 1.3E-17 3.4E-22 145.7 9.2 104 101-208 2-110 (110) 79 cd02811 IDI-2_FMN Isopentenyl- 99.7 3.4E-16 8.8E-21 135.6 15.6 145 260-460 165-326 (326) 80 cd04640 CBS_pair_27 The CBS do 99.7 1.8E-17 4.5E-22 144.8 8.9 107 101-208 2-126 (126) 81 COG4109 Predicted transcriptio 99.7 1.6E-17 4E-22 145.2 8.4 151 44-211 155-307 (432) 82 cd02809 alpha_hydroxyacid_oxid 99.7 9E-16 2.3E-20 132.6 14.2 164 233-459 133-298 (299) 83 cd04606 CBS_pair_Mg_transporte 99.7 8.5E-17 2.2E-21 139.9 8.1 103 104-208 1-108 (109) 84 COG2905 Predicted signal-trans 99.7 1.7E-16 4.3E-21 137.8 9.3 157 89-250 149-312 (610) 85 cd04598 CBS_pair_GGDEF_assoc T 99.7 1.8E-16 4.7E-21 137.5 7.8 106 101-208 2-119 (119) 86 cd02922 FCB2_FMN Flavocytochro 99.7 4.6E-14 1.2E-18 120.3 18.6 296 15-459 35-342 (344) 87 PRK11543 gutQ D-arabinose 5-ph 99.6 4.7E-16 1.2E-20 134.6 7.7 146 56-209 161-319 (321) 88 cd04737 LOX_like_FMN L-Lactate 99.6 1.3E-13 3.2E-18 117.2 19.0 296 14-460 42-348 (351) 89 cd04736 MDH_FMN Mandelate dehy 99.6 1E-13 2.6E-18 117.8 17.9 309 14-459 34-360 (361) 90 cd04590 CBS_pair_CorC_HlyC_ass 99.6 3.7E-15 9.4E-20 128.2 9.0 107 100-207 1-110 (111) 91 cd04591 CBS_pair_EriC_assoc_eu 99.6 2.9E-15 7.4E-20 128.9 8.2 102 100-208 2-105 (105) 92 COG3448 CBS-domain-containing 99.6 3.8E-15 9.7E-20 128.1 7.7 149 58-208 204-370 (382) 93 PRK11197 lldD L-lactate dehydr 99.6 4.7E-13 1.2E-17 113.0 18.4 323 14-460 40-372 (381) 94 cd04608 CBS_pair_PALP_assoc Th 99.6 4.4E-15 1.1E-19 127.6 7.8 108 100-208 2-123 (124) 95 PRK01862 putative voltage-gate 99.6 1.7E-14 4.3E-19 123.4 10.4 113 95-208 459-578 (583) 96 PRK10892 D-arabinose 5-phospha 99.6 7.6E-15 1.9E-19 125.9 8.4 147 56-209 166-324 (326) 97 TIGR02151 IPP_isom_2 isopenten 99.6 1.4E-13 3.7E-18 116.8 14.2 281 12-465 22-347 (349) 98 cd03332 LMO_FMN L-Lactate 2-mo 99.6 7E-13 1.8E-17 111.8 16.8 312 14-460 55-380 (383) 99 cd02205 CBS_pair The CBS domai 99.5 1.2E-13 3E-18 117.4 9.1 108 100-208 1-113 (113) 100 COG1304 idi Isopentenyl diphos 99.5 4.3E-12 1.1E-16 106.2 15.9 143 261-463 206-348 (360) 101 TIGR03520 GldE gliding motilit 99.4 2.2E-12 5.6E-17 108.3 11.2 144 57-209 159-310 (408) 102 COG0516 GuaB IMP dehydrogenase 99.4 9.1E-14 2.3E-18 118.2 3.3 169 262-481 2-170 (170) 103 cd04592 CBS_pair_EriC_assoc_eu 99.4 1.8E-12 4.7E-17 108.8 7.9 93 101-194 2-119 (133) 104 COG2070 Dioxygenases related t 99.4 1.9E-11 4.8E-16 101.6 11.9 217 35-377 3-225 (336) 105 COG3620 Predicted transcriptio 99.3 2.9E-12 7.5E-17 107.3 7.0 111 94-209 67-183 (187) 106 COG0517 FOG: CBS domain [Gener 99.3 8.1E-12 2.1E-16 104.2 8.9 105 100-206 7-117 (117) 107 COG2239 MgtE Mg/Co/Ni transpor 99.3 2.4E-11 6.2E-16 100.8 9.2 201 78-289 114-335 (451) 108 PRK08318 dihydropyrimidine deh 99.2 1.4E-08 3.5E-13 81.0 21.4 173 230-463 110-320 (413) 109 PRK11573 hypothetical protein; 99.2 1.1E-10 2.8E-15 96.0 10.5 150 53-209 150-310 (413) 110 TIGR01186 proV glycine betaine 99.2 2.9E-11 7.3E-16 100.3 7.1 168 42-210 179-362 (372) 111 PRK07565 dihydroorotate dehydr 99.2 7.1E-09 1.8E-13 83.1 19.2 179 231-470 113-313 (333) 112 cd04739 DHOD_like Dihydroorota 99.2 2.2E-09 5.7E-14 86.7 16.7 174 231-465 111-306 (325) 113 TIGR00393 kpsF sugar isomerase 99.2 1.9E-11 4.7E-16 101.6 6.0 120 77-200 133-271 (272) 114 PRK10070 glycine betaine trans 99.2 5.7E-11 1.4E-15 98.1 8.3 124 85-210 262-393 (400) 115 COG1253 TlyC Hemolysins and re 99.2 2.4E-10 6.2E-15 93.6 10.1 128 82-209 191-327 (429) 116 KOG0538 consensus 99.1 9E-10 2.3E-14 89.5 10.5 300 16-462 40-352 (363) 117 KOG1764 consensus 99.0 6.1E-10 1.6E-14 90.7 5.7 108 101-209 238-358 (381) 118 cd04597 CBS_pair_DRTGG_assoc2 98.9 1E-09 2.6E-14 89.1 5.1 61 148-208 53-113 (113) 119 PRK05567 inositol-5'-monophosp 98.9 6.4E-08 1.6E-12 76.2 13.7 241 96-365 37-297 (486) 120 PTZ00314 inosine-5'-monophosph 98.9 4.9E-08 1.3E-12 77.0 12.8 49 158-206 100-151 (499) 121 cd04729 NanE N-acetylmannosami 98.9 1.7E-08 4.4E-13 80.4 10.4 128 234-370 81-211 (219) 122 PRK07807 inositol-5-monophosph 98.9 4.6E-08 1.2E-12 77.3 12.5 188 158-365 94-296 (479) 123 PRK07107 inositol-5-monophosph 98.9 2E-08 5.2E-13 79.8 10.5 187 158-365 102-312 (497) 124 PRK01130 N-acetylmannosamine-6 98.8 4.4E-08 1.1E-12 77.4 10.3 127 234-370 77-208 (222) 125 pfam04131 NanE Putative N-acet 98.8 5.1E-08 1.3E-12 76.9 10.1 125 234-370 53-178 (192) 126 TIGR01302 IMP_dehydrog inosine 98.8 3.4E-07 8.6E-12 71.1 13.9 242 96-365 30-308 (476) 127 cd02808 GltS_FMN Glutamate syn 98.8 5.2E-08 1.3E-12 76.9 9.4 190 242-460 180-386 (392) 128 pfam00478 IMPDH IMP dehydrogen 98.8 3.8E-07 9.7E-12 70.7 12.9 187 97-302 148-355 (467) 129 cd04740 DHOD_1B_like Dihydroor 98.7 6.7E-07 1.7E-11 68.9 13.2 169 232-462 102-296 (296) 130 KOG0475 consensus 98.7 6.7E-08 1.7E-12 76.1 7.6 171 35-210 488-695 (696) 131 PRK07259 dihydroorotate dehydr 98.7 1E-06 2.6E-11 67.7 13.4 169 232-462 104-299 (301) 132 TIGR01137 cysta_beta cystathio 98.7 9.8E-08 2.5E-12 74.9 8.1 114 93-208 367-525 (527) 133 cd04722 TIM_phosphate_binding 98.7 3.6E-07 9.1E-12 70.9 10.9 123 237-366 76-200 (200) 134 TIGR02708 L_lactate_ox L-lacta 98.7 1.7E-07 4.2E-12 73.3 8.9 129 265-460 221-356 (368) 135 cd04641 CBS_pair_28 The CBS do 98.7 8.2E-08 2.1E-12 75.5 7.1 49 161-209 1-49 (120) 136 COG0069 GltB Glutamate synthas 98.7 1.3E-07 3.3E-12 74.0 7.9 180 260-467 289-482 (485) 137 COG3448 CBS-domain-containing 98.6 5.1E-08 1.3E-12 77.0 4.9 113 152-264 244-364 (382) 138 pfam01180 DHO_dh Dihydroorotat 98.6 5.5E-06 1.4E-10 62.3 15.3 135 232-368 106-271 (290) 139 cd04743 NPD_PKS 2-Nitropropane 98.6 1.1E-06 2.8E-11 67.3 11.5 48 46-93 2-51 (320) 140 cd04635 CBS_pair_22 The CBS do 98.6 1.9E-07 4.9E-12 72.8 7.4 49 161-209 1-49 (122) 141 cd04627 CBS_pair_14 The CBS do 98.6 8.5E-08 2.2E-12 75.4 5.5 97 41-147 22-122 (123) 142 cd04621 CBS_pair_8 The CBS dom 98.6 2.2E-07 5.6E-12 72.4 7.6 51 161-211 1-51 (135) 143 cd04643 CBS_pair_30 The CBS do 98.6 1.9E-07 4.8E-12 72.9 7.2 50 161-210 1-50 (116) 144 pfam01645 Glu_synthase Conserv 98.6 1.2E-07 3E-12 74.3 6.2 207 215-452 145-366 (367) 145 cd04597 CBS_pair_DRTGG_assoc2 98.6 6.1E-08 1.6E-12 76.4 4.3 52 94-146 60-111 (113) 146 cd04632 CBS_pair_19 The CBS do 98.6 7.6E-08 1.9E-12 75.7 4.6 51 161-211 1-51 (128) 147 smart00116 CBS Domain in cysta 98.6 7.3E-08 1.8E-12 75.8 4.4 46 163-208 2-47 (49) 148 cd04618 CBS_pair_5 The CBS dom 98.6 4.6E-07 1.2E-11 70.1 8.4 94 102-207 3-97 (98) 149 cd04613 CBS_pair_SpoIVFB_EriC_ 98.6 4.4E-07 1.1E-11 70.3 8.3 49 161-209 1-49 (114) 150 cd04605 CBS_pair_MET2_assoc Th 98.5 2.9E-07 7.5E-12 71.5 7.0 51 160-210 1-51 (110) 151 cd04619 CBS_pair_6 The CBS dom 98.5 1.2E-07 3.2E-12 74.2 5.0 82 47-145 27-111 (114) 152 cd04584 CBS_pair_ACT_assoc Thi 98.5 3.2E-07 8.3E-12 71.2 7.1 49 161-209 1-49 (121) 153 cd04637 CBS_pair_24 The CBS do 98.5 1.4E-07 3.6E-12 73.7 5.2 91 46-145 26-119 (122) 154 cd04600 CBS_pair_HPP_assoc Thi 98.5 1E-07 2.6E-12 74.8 4.4 50 160-209 1-50 (124) 155 cd04582 CBS_pair_ABC_OpuCA_ass 98.5 6.3E-07 1.6E-11 69.1 8.3 49 161-209 1-49 (106) 156 cd04640 CBS_pair_27 The CBS do 98.5 3.3E-07 8.5E-12 71.1 6.7 48 162-209 2-49 (126) 157 cd04594 CBS_pair_EriC_assoc_ar 98.5 4.1E-07 1E-11 70.4 7.1 95 48-146 3-102 (104) 158 KOG0474 consensus 98.5 1.7E-07 4.3E-12 73.2 5.1 171 32-209 521-746 (762) 159 cd04614 CBS_pair_1 The CBS dom 98.5 7.7E-07 2E-11 68.5 8.3 48 161-208 1-48 (96) 160 cd04629 CBS_pair_16 The CBS do 98.5 6.2E-07 1.6E-11 69.2 7.7 49 161-209 1-49 (114) 161 cd04615 CBS_pair_2 The CBS dom 98.5 3.2E-07 8.1E-12 71.3 6.1 47 162-208 2-48 (113) 162 cd04586 CBS_pair_BON_assoc Thi 98.5 1.8E-07 4.6E-12 73.0 4.8 52 160-211 1-52 (135) 163 cd04624 CBS_pair_11 The CBS do 98.5 9E-07 2.3E-11 68.0 8.2 50 161-210 1-50 (112) 164 cd04638 CBS_pair_25 The CBS do 98.5 4.8E-07 1.2E-11 69.9 6.7 48 161-208 1-48 (106) 165 cd04803 CBS_pair_15 The CBS do 98.5 7.7E-07 2E-11 68.5 7.6 50 161-210 1-50 (122) 166 cd04636 CBS_pair_23 The CBS do 98.5 9.5E-07 2.4E-11 67.8 8.1 50 161-210 1-50 (132) 167 smart00116 CBS Domain in cysta 98.5 2.4E-07 6.1E-12 72.1 5.0 45 101-146 1-45 (49) 168 COG4535 CorC Putative Mg2+ and 98.5 3.9E-07 1E-11 70.6 6.0 117 92-209 67-188 (293) 169 cd04617 CBS_pair_4 The CBS dom 98.5 2E-07 5.2E-12 72.7 4.5 47 162-208 2-48 (118) 170 PRK06857 consensus 98.5 5.9E-06 1.5E-10 62.2 12.0 131 171-364 1-131 (209) 171 KOG1764 consensus 98.5 8E-07 2.1E-11 68.4 7.5 112 99-210 160-286 (381) 172 cd04642 CBS_pair_29 The CBS do 98.4 7.2E-07 1.8E-11 68.7 7.1 48 161-208 1-48 (126) 173 cd04583 CBS_pair_ABC_OpuCA_ass 98.4 1.3E-06 3.4E-11 66.8 8.4 49 161-209 2-50 (109) 174 cd04593 CBS_pair_EriC_assoc_ba 98.4 1.3E-06 3.2E-11 67.0 8.2 48 162-209 2-49 (115) 175 COG4536 CorB Putative Mg2+ and 98.4 4.2E-07 1.1E-11 70.4 5.6 112 102-213 212-327 (423) 176 COG2524 Predicted transcriptio 98.4 9.1E-07 2.3E-11 68.0 7.3 115 135-253 144-273 (294) 177 cd04604 CBS_pair_KpsF_GutQ_ass 98.4 1.4E-06 3.6E-11 66.6 8.2 49 96-145 63-111 (114) 178 cd04608 CBS_pair_PALP_assoc Th 98.4 3.6E-07 9.1E-12 70.9 4.7 54 161-214 2-55 (124) 179 cd04602 CBS_pair_IMPDH_2 This 98.4 2E-06 5E-11 65.6 8.3 47 161-207 2-51 (114) 180 cd04631 CBS_pair_18 The CBS do 98.4 1.5E-06 3.9E-11 66.4 7.6 49 161-209 1-50 (125) 181 cd04639 CBS_pair_26 The CBS do 98.4 1.5E-06 3.9E-11 66.4 7.5 48 162-209 2-49 (111) 182 PRK07455 keto-hydroxyglutarate 98.4 1.1E-05 2.9E-10 60.2 11.5 108 232-364 25-132 (210) 183 cd04633 CBS_pair_20 The CBS do 98.4 2.2E-06 5.5E-11 65.3 7.8 48 161-209 1-48 (121) 184 cd04585 CBS_pair_ACT_assoc2 Th 98.3 2E-06 5.1E-11 65.5 7.3 48 161-209 1-48 (122) 185 PRK05718 keto-hydroxyglutarate 98.3 1.3E-05 3.4E-10 59.6 11.4 108 232-364 27-134 (212) 186 cd04610 CBS_pair_ParBc_assoc T 98.3 2.1E-06 5.3E-11 65.4 7.2 48 160-208 1-48 (107) 187 cd02810 DHOD_DHPD_FMN Dihydroo 98.3 4.4E-06 1.1E-10 63.0 8.9 137 231-368 110-275 (289) 188 PRK08104 consensus 98.3 1.5E-05 3.9E-10 59.2 11.6 108 232-364 27-134 (212) 189 PRK06015 keto-hydroxyglutarate 98.3 1.8E-05 4.6E-10 58.7 11.7 108 232-364 27-134 (212) 190 cd04588 CBS_pair_CAP-ED_DUF294 98.3 1.3E-06 3.4E-11 66.8 5.9 47 161-208 1-47 (110) 191 cd04634 CBS_pair_21 The CBS do 98.3 6.9E-07 1.7E-11 68.8 4.3 49 161-210 1-49 (143) 192 cd04734 OYE_like_3_FMN Old yel 98.3 5.8E-06 1.5E-10 62.2 8.8 141 233-373 142-323 (343) 193 cd04599 CBS_pair_GGDEF_assoc2 98.3 4.3E-06 1.1E-10 63.1 8.1 46 98-145 57-102 (105) 194 cd04623 CBS_pair_10 The CBS do 98.3 9.2E-07 2.4E-11 67.9 4.6 49 161-209 1-49 (113) 195 cd04612 CBS_pair_SpoIVFB_EriC_ 98.3 2.4E-06 6.2E-11 64.9 6.8 48 161-209 1-48 (111) 196 cd04626 CBS_pair_13 The CBS do 98.3 2.3E-06 5.9E-11 65.1 6.4 47 97-145 62-108 (111) 197 cd04625 CBS_pair_12 The CBS do 98.3 1.1E-06 2.8E-11 67.4 4.6 73 56-146 38-110 (112) 198 cd04607 CBS_pair_NTP_transfera 98.3 3.9E-06 1E-10 63.4 7.3 46 163-208 4-49 (113) 199 cd04609 CBS_pair_PALP_assoc2 T 98.3 3.4E-06 8.7E-11 63.9 6.8 48 162-210 2-49 (110) 200 cd04595 CBS_pair_DHH_polyA_Pol 98.2 7.3E-06 1.9E-10 61.5 8.4 50 160-210 1-50 (110) 201 PRK08782 consensus 98.2 2.1E-05 5.5E-10 58.1 10.7 108 232-364 29-136 (219) 202 cd02911 arch_FMN Archeal FMN-b 98.2 0.00018 4.7E-09 51.4 15.4 35 48-82 1-36 (233) 203 PRK03220 consensus 98.2 1.7E-05 4.2E-10 58.9 10.0 186 171-378 32-245 (257) 204 cd04587 CBS_pair_CAP-ED_DUF294 98.2 1.8E-06 4.6E-11 65.9 5.0 49 96-145 62-110 (113) 205 cd04601 CBS_pair_IMPDH This cd 98.2 1.7E-06 4.3E-11 66.0 4.9 48 161-208 2-49 (110) 206 PRK06552 keto-hydroxyglutarate 98.2 3.8E-05 9.8E-10 56.3 11.8 108 232-364 25-135 (209) 207 cd04620 CBS_pair_7 The CBS dom 98.2 1.6E-06 4E-11 66.3 4.6 49 161-209 1-49 (115) 208 cd04596 CBS_pair_DRTGG_assoc T 98.2 1.6E-06 4E-11 66.2 4.5 48 161-208 2-49 (108) 209 cd04630 CBS_pair_17 The CBS do 98.2 3.9E-06 1E-10 63.4 6.5 87 41-146 22-112 (114) 210 PRK01033 imidazole glycerol ph 98.2 1.5E-05 3.9E-10 59.2 9.3 176 171-369 31-230 (253) 211 cd02930 DCR_FMN 2,4-dienoyl-Co 98.2 1.2E-05 3.1E-10 59.9 8.7 138 233-373 138-314 (353) 212 cd04592 CBS_pair_EriC_assoc_eu 98.2 1.8E-06 4.5E-11 65.9 4.4 48 161-208 1-48 (133) 213 pfam00571 CBS CBS domain. CBS 98.2 2E-06 5E-11 65.6 4.6 55 155-209 1-55 (57) 214 PRK08904 consensus 98.2 2.7E-05 6.9E-10 57.4 10.3 108 232-364 22-129 (207) 215 cd00452 KDPG_aldolase KDPG and 98.2 2.1E-05 5.3E-10 58.2 9.7 108 232-364 16-123 (190) 216 cd04622 CBS_pair_9 The CBS dom 98.2 2.6E-06 6.8E-11 64.6 4.8 49 96-145 62-110 (113) 217 PRK13597 imidazole glycerol ph 98.2 5.5E-05 1.4E-09 55.2 11.3 182 171-375 32-235 (252) 218 cd04603 CBS_pair_KefB_assoc Th 98.1 2.8E-06 7.1E-11 64.5 4.5 47 162-208 2-48 (111) 219 cd04589 CBS_pair_CAP-ED_DUF294 98.1 1.3E-05 3.2E-10 59.8 7.7 48 97-146 62-109 (111) 220 cd04801 CBS_pair_M50_like This 98.1 1.2E-05 3.1E-10 59.9 7.4 48 161-208 1-49 (114) 221 PRK02506 dihydroorotate dehydr 98.1 0.00019 4.8E-09 51.4 13.4 172 233-465 106-308 (308) 222 cd04606 CBS_pair_Mg_transporte 98.1 3.2E-06 8.3E-11 64.0 4.3 50 96-146 57-106 (109) 223 pfam01081 Aldolase KDPG and KH 98.1 3.7E-05 9.4E-10 56.4 9.7 108 232-364 20-127 (196) 224 PRK11543 gutQ D-arabinose 5-ph 98.1 7E-06 1.8E-10 61.6 6.0 156 14-210 87-256 (321) 225 COG3010 NanE Putative N-acetyl 98.1 4E-05 1E-09 56.2 9.8 129 234-373 87-217 (229) 226 TIGR03572 WbuZ glycosyl amidat 98.1 7.8E-05 2E-09 54.1 11.3 177 171-369 31-231 (232) 227 TIGR01182 eda 2-dehydro-3-deox 98.1 3.3E-05 8.4E-10 56.8 9.3 106 232-364 20-128 (205) 228 cd02931 ER_like_FMN Enoate red 98.1 2.9E-05 7.4E-10 57.2 8.9 139 233-373 151-343 (382) 229 PRK09140 2-dehydro-3-deoxy-6-p 98.1 2.3E-05 5.7E-10 58.0 8.3 109 231-364 21-130 (206) 230 cd04611 CBS_pair_PAS_GGDEF_DUF 98.1 4.5E-06 1.1E-10 63.0 4.5 47 98-145 62-108 (111) 231 cd04731 HisF The cyclase subun 98.1 5.5E-05 1.4E-09 55.2 10.0 179 171-371 28-229 (243) 232 cd04802 CBS_pair_3 The CBS dom 98.1 5.1E-06 1.3E-10 62.6 4.7 46 98-145 64-109 (112) 233 PRK05211 consensus 98.1 6.2E-05 1.6E-09 54.8 9.9 177 171-369 22-226 (248) 234 COG0167 PyrD Dihydroorotate de 98.1 0.00044 1.1E-08 48.7 14.3 170 232-462 109-307 (310) 235 cd04735 OYE_like_4_FMN Old yel 98.0 3.6E-05 9.1E-10 56.5 8.6 136 233-373 145-321 (353) 236 PRK01659 consensus 98.0 9E-05 2.3E-09 53.6 10.5 184 171-377 31-239 (252) 237 cd04800 CBS_pair_CAP-ED_DUF294 98.0 2E-05 5.2E-10 58.3 7.0 47 161-208 1-47 (111) 238 PRK07114 keto-hydroxyglutarate 98.0 0.00019 4.9E-09 51.3 12.0 110 233-364 29-139 (223) 239 PRK02083 imidazole glycerol ph 98.0 8.8E-05 2.3E-09 53.7 10.2 185 171-377 31-239 (253) 240 PRK00830 consensus 98.0 9.3E-05 2.4E-09 53.6 10.1 186 170-377 34-259 (273) 241 PRK01862 putative voltage-gate 98.0 8.4E-06 2.1E-10 61.0 4.6 59 151-209 454-512 (583) 242 cd02932 OYE_YqiM_FMN Old yello 98.0 6.8E-05 1.7E-09 54.5 9.3 133 233-370 155-325 (336) 243 PRK02621 consensus 98.0 8.1E-05 2.1E-09 54.0 9.5 184 171-377 31-240 (254) 244 COG0800 Eda 2-keto-3-deoxy-6-p 98.0 3.6E-05 9.2E-10 56.5 7.6 108 232-364 25-132 (211) 245 pfam00724 Oxidored_FMN NADH:fl 98.0 6.9E-05 1.8E-09 54.5 9.0 138 233-370 144-321 (336) 246 cd02803 OYE_like_FMN_family Ol 98.0 6.9E-05 1.8E-09 54.5 8.8 137 233-370 142-316 (327) 247 cd02940 DHPD_FMN Dihydropyrimi 98.0 7E-05 1.8E-09 54.5 8.8 136 232-369 113-285 (299) 248 PRK02747 consensus 97.9 0.00011 2.7E-09 53.1 9.4 183 171-376 31-241 (257) 249 PRK13587 1-(5-phosphoribosyl)- 97.9 0.00039 1E-08 49.1 12.1 177 170-368 31-224 (234) 250 pfam00977 His_biosynth Histidi 97.9 0.00032 8.1E-09 49.7 11.4 174 171-368 30-223 (229) 251 PRK13523 NADPH dehydrogenase N 97.9 9.7E-05 2.5E-09 53.4 8.8 134 233-373 143-313 (337) 252 cd02929 TMADH_HD_FMN Trimethyl 97.9 0.00025 6.4E-09 50.5 10.7 138 233-373 151-327 (370) 253 PRK02145 consensus 97.9 0.00012 3.2E-09 52.7 9.0 185 171-377 32-243 (257) 254 cd04591 CBS_pair_EriC_assoc_eu 97.9 2E-05 5.1E-10 58.4 4.9 48 97-146 56-103 (105) 255 PRK13585 1-(5-phosphoribosyl)- 97.9 0.00023 5.9E-09 50.7 10.1 176 170-368 32-224 (240) 256 cd04732 HisA HisA. Phosphorib 97.9 0.00036 9.2E-09 49.3 11.0 176 171-368 30-222 (234) 257 cd04598 CBS_pair_GGDEF_assoc T 97.8 2.2E-05 5.7E-10 58.0 4.6 50 161-211 1-51 (119) 258 cd04590 CBS_pair_CorC_HlyC_ass 97.8 6.1E-05 1.6E-09 54.9 6.8 49 162-210 2-51 (111) 259 PRK08255 salicylyl-CoA 5-hydro 97.8 0.00036 9.2E-09 49.3 10.7 131 233-368 557-725 (770) 260 cd04733 OYE_like_2_FMN Old yel 97.8 0.00017 4.3E-09 51.7 8.9 135 233-369 150-326 (338) 261 PRK04281 consensus 97.8 0.00036 9.3E-09 49.3 10.6 183 171-376 31-239 (254) 262 pfam01207 Dus Dihydrouridine s 97.8 0.00076 1.9E-08 47.0 12.0 100 259-367 108-215 (309) 263 cd04723 HisA_HisF Phosphoribos 97.8 0.00095 2.4E-08 46.3 12.3 178 171-369 36-222 (233) 264 PRK00748 1-(5-phosphoribosyl)- 97.8 0.0008 2E-08 46.8 11.7 175 170-368 29-224 (241) 265 PRK10415 tRNA-dihydrouridine s 97.7 0.0029 7.5E-08 42.8 13.5 73 286-367 153-226 (321) 266 PRK10892 D-arabinose 5-phospha 97.7 4.2E-05 1.1E-09 56.0 3.8 156 14-210 92-261 (326) 267 cd02801 DUS_like_FMN Dihydrour 97.6 0.0025 6.3E-08 43.3 12.4 99 259-367 109-215 (231) 268 cd02933 OYE_like_FMN Old yello 97.6 0.00045 1.1E-08 48.7 8.5 127 233-370 153-319 (338) 269 cd02205 CBS_pair The CBS domai 97.6 9.3E-05 2.4E-09 53.6 4.6 51 161-211 1-51 (113) 270 COG0042 tRNA-dihydrouridine sy 97.6 0.0089 2.3E-07 39.3 15.0 31 41-71 5-35 (323) 271 PRK04128 1-(5-phosphoribosyl)- 97.6 0.0005 1.3E-08 48.3 8.1 170 170-368 31-214 (228) 272 pfam00571 CBS CBS domain. CBS 97.6 0.00013 3.4E-09 52.5 5.1 50 96-146 3-52 (57) 273 PRK10550 tRNA-dihydrouridine s 97.6 0.0077 2E-07 39.8 14.1 30 48-77 2-31 (312) 274 PRK10070 glycine betaine trans 97.5 0.0018 4.5E-08 44.4 10.3 97 114-210 207-334 (400) 275 COG2905 Predicted signal-trans 97.5 8.5E-05 2.2E-09 53.9 3.3 57 153-209 149-205 (610) 276 PRK10415 tRNA-dihydrouridine s 97.5 0.0019 5E-08 44.1 9.9 201 41-302 4-224 (321) 277 PRK13586 1-(5-phosphoribosyl)- 97.4 0.0036 9.1E-08 42.2 11.1 172 171-368 30-220 (231) 278 PRK07028 bifunctional hexulose 97.4 0.0034 8.7E-08 42.3 10.6 160 232-400 118-315 (429) 279 cd00945 Aldolase_Class_I Class 97.4 0.008 2E-07 39.7 12.3 118 234-362 67-198 (201) 280 KOG2335 consensus 97.4 0.0029 7.3E-08 42.9 9.7 157 232-397 85-274 (358) 281 COG3620 Predicted transcriptio 97.4 0.00012 3E-09 52.8 2.6 56 154-210 66-121 (187) 282 KOG2550 consensus 97.3 0.0036 9.1E-08 42.2 10.0 232 45-300 115-381 (503) 283 COG0107 HisF Imidazoleglycerol 97.3 0.0015 3.8E-08 44.9 8.1 37 333-369 196-233 (256) 284 cd02911 arch_FMN Archeal FMN-b 97.3 0.0024 6.1E-08 43.4 9.0 121 229-366 81-221 (233) 285 PRK09426 methylmalonyl-CoA mut 97.3 0.0021 5.2E-08 43.9 8.5 12 15-26 30-41 (715) 286 TIGR01037 pyrD_sub1_fam dihydr 97.3 0.00027 6.9E-09 50.2 4.0 104 260-368 148-273 (308) 287 pfam01791 DeoC DeoC/LacD famil 97.3 0.0036 9.1E-08 42.2 9.4 114 238-361 80-219 (231) 288 COG4109 Predicted transcriptio 97.3 0.00041 1E-08 49.0 4.6 95 153-253 188-288 (432) 289 cd04738 DHOD_2_like Dihydrooro 97.3 0.003 7.6E-08 42.7 9.0 130 232-368 148-312 (327) 290 PRK06552 keto-hydroxyglutarate 97.3 0.0066 1.7E-07 40.3 10.6 122 220-369 66-187 (209) 291 pfam01081 Aldolase KDPG and KH 97.2 0.0049 1.2E-07 41.2 9.8 123 221-370 59-181 (196) 292 COG0106 HisA Phosphoribosylfor 97.2 0.011 2.8E-07 38.7 11.6 176 170-368 31-224 (241) 293 COG1902 NemA NADH:flavin oxido 97.2 0.0088 2.3E-07 39.4 11.0 135 234-372 151-325 (363) 294 COG0042 tRNA-dihydrouridine sy 97.2 0.016 4.1E-07 37.5 12.2 126 234-367 81-230 (323) 295 COG3010 NanE Putative N-acetyl 97.2 0.015 3.7E-07 37.8 11.9 92 194-302 110-209 (229) 296 PRK05286 dihydroorotate dehydr 97.2 0.0081 2.1E-07 39.6 10.2 136 232-368 154-317 (336) 297 PRK10605 N-ethylmaleimide redu 97.2 0.0087 2.2E-07 39.4 10.4 131 233-375 160-331 (362) 298 COG1253 TlyC Hemolysins and re 97.2 0.0027 6.9E-08 43.1 7.8 76 137-212 186-268 (429) 299 cd04741 DHOD_1A_like Dihydroor 97.1 0.0038 9.8E-08 42.0 8.4 133 236-369 110-276 (294) 300 PTZ00170 D-ribulose-5-phosphat 97.1 0.015 3.8E-07 37.7 11.3 125 235-372 76-205 (224) 301 PRK12330 oxaloacetate decarbox 97.1 0.0079 2E-07 39.7 9.9 135 169-314 96-244 (499) 302 PRK13117 consensus 97.1 0.0082 2.1E-07 39.6 9.9 181 151-373 54-242 (268) 303 COG0159 TrpA Tryptophan syntha 97.1 0.024 6.2E-07 36.2 12.1 183 150-373 53-241 (265) 304 PRK07028 bifunctional hexulose 97.1 0.029 7.4E-07 35.7 12.4 121 237-372 73-197 (429) 305 CHL00200 trpA tryptophan synth 97.1 0.033 8.3E-07 35.3 14.7 169 163-372 67-238 (263) 306 TIGR00007 TIGR00007 phosphorib 97.1 0.013 3.3E-07 38.2 10.5 178 169-368 27-231 (241) 307 PRK07455 keto-hydroxyglutarate 97.1 0.0098 2.5E-07 39.0 9.9 122 221-369 64-185 (210) 308 COG0517 FOG: CBS domain [Gener 97.0 0.0015 3.8E-08 44.9 5.3 53 157-210 3-55 (117) 309 PRK13116 consensus 97.0 0.038 9.8E-07 34.8 13.1 180 151-373 54-243 (278) 310 KOG2335 consensus 97.0 0.0025 6.3E-08 43.4 6.2 98 233-335 156-266 (358) 311 pfam03060 NPD 2-nitropropane d 97.0 0.019 4.7E-07 37.0 10.6 67 235-302 147-221 (330) 312 TIGR03520 GldE gliding motilit 97.0 0.0008 2E-08 46.8 3.6 61 149-209 187-250 (408) 313 PRK06512 thiamine-phosphate py 97.0 0.0061 1.5E-07 40.5 8.0 99 263-371 98-198 (221) 314 TIGR03415 ABC_choXWV_ATP choli 97.0 0.0009 2.3E-08 46.5 3.7 51 238-291 176-231 (382) 315 cd04738 DHOD_2_like Dihydrooro 96.9 0.043 1.1E-06 34.5 16.8 44 259-302 264-309 (327) 316 PRK06857 consensus 96.9 0.045 1.2E-06 34.3 12.2 123 221-370 63-185 (209) 317 PRK08091 ribulose-phosphate 3- 96.9 0.0068 1.7E-07 40.2 7.7 132 236-377 82-225 (235) 318 PRK10550 tRNA-dihydrouridine s 96.9 0.042 1.1E-06 34.5 11.7 99 259-366 117-225 (312) 319 cd04722 TIM_phosphate_binding 96.9 0.049 1.2E-06 34.0 17.0 65 237-302 128-199 (200) 320 COG2239 MgtE Mg/Co/Ni transpor 96.9 0.001 2.6E-08 46.1 3.4 61 150-210 129-194 (451) 321 PRK13125 trpA tryptophan synth 96.8 0.036 9.2E-07 35.0 11.1 127 236-373 95-225 (247) 322 PRK11815 tRNA-dihydrouridine s 96.8 0.0088 2.3E-07 39.4 7.9 196 44-302 8-233 (333) 323 PRK00043 thiE thiamine-phospha 96.8 0.01 2.6E-07 38.9 8.2 100 265-371 93-192 (210) 324 cd00429 RPE Ribulose-5-phospha 96.8 0.055 1.4E-06 33.7 13.3 124 235-373 70-202 (211) 325 PRK08745 ribulose-phosphate 3- 96.8 0.055 1.4E-06 33.7 12.8 122 236-371 76-205 (223) 326 PRK13125 trpA tryptophan synth 96.8 0.056 1.4E-06 33.6 12.2 68 235-302 145-217 (247) 327 PRK13131 consensus 96.8 0.021 5.2E-07 36.7 9.6 127 236-373 106-235 (257) 328 PRK05581 ribulose-phosphate 3- 96.8 0.057 1.5E-06 33.6 12.9 125 236-372 75-205 (220) 329 pfam00834 Ribul_P_3_epim Ribul 96.8 0.057 1.5E-06 33.6 12.2 120 236-369 71-198 (201) 330 PRK08104 consensus 96.7 0.025 6.4E-07 36.1 9.7 123 221-370 66-188 (212) 331 pfam04481 DUF561 Protein of un 96.7 0.016 4.1E-07 37.5 8.6 133 235-371 71-220 (243) 332 TIGR00393 kpsF sugar isomerase 96.7 0.0017 4.4E-08 44.4 3.7 156 14-208 45-212 (272) 333 PRK13127 consensus 96.7 0.026 6.6E-07 36.0 9.6 170 163-373 63-235 (262) 334 TIGR01303 IMP_DH_rel_1 IMP deh 96.7 0.022 5.7E-07 36.5 9.3 370 19-455 69-474 (476) 335 pfam01207 Dus Dihydrouridine s 96.7 0.0099 2.5E-07 39.0 7.3 192 51-302 2-213 (309) 336 PRK08904 consensus 96.7 0.017 4.4E-07 37.3 8.5 122 221-369 61-182 (207) 337 TIGR00400 mgtE magnesium trans 96.7 0.0034 8.7E-08 42.3 4.9 111 98-210 141-257 (460) 338 PRK00366 ispG 4-hydroxy-3-meth 96.6 0.029 7.5E-07 35.6 9.4 211 42-305 21-242 (367) 339 cd00452 KDPG_aldolase KDPG and 96.6 0.072 1.8E-06 32.8 11.7 122 220-369 54-175 (190) 340 pfam00290 Trp_syntA Tryptophan 96.6 0.029 7.4E-07 35.7 9.4 179 151-373 46-233 (258) 341 cd04747 OYE_like_5_FMN Old yel 96.6 0.013 3.4E-07 38.1 7.5 127 234-369 146-332 (361) 342 PRK02615 thiamine-phosphate py 96.6 0.021 5.3E-07 36.7 8.5 118 228-371 151-271 (345) 343 pfam05690 ThiG Thiazole biosyn 96.6 0.017 4.4E-07 37.3 8.1 82 282-373 130-211 (246) 344 pfam04309 G3P_antiterm Glycero 96.6 0.026 6.7E-07 36.0 9.0 36 329-364 132-167 (174) 345 PRK12581 oxaloacetate decarbox 96.6 0.038 9.6E-07 34.9 9.7 82 234-316 165-252 (468) 346 cd04729 NanE N-acetylmannosami 96.6 0.035 9E-07 35.1 9.6 71 226-302 128-206 (219) 347 cd02071 MM_CoA_mut_B12_BD meth 96.6 0.026 6.5E-07 36.0 8.8 71 234-307 39-111 (122) 348 PRK09140 2-dehydro-3-deoxy-6-p 96.6 0.045 1.1E-06 34.3 10.0 122 221-369 62-183 (206) 349 PRK02083 imidazole glycerol ph 96.6 0.043 1.1E-06 34.4 9.9 70 232-302 30-103 (253) 350 PRK11573 hypothetical protein; 96.5 0.0039 1E-07 41.9 4.6 60 150-209 184-246 (413) 351 PRK00507 deoxyribose-phosphate 96.5 0.083 2.1E-06 32.4 11.2 114 236-361 78-204 (221) 352 cd02940 DHPD_FMN Dihydropyrimi 96.5 0.019 4.8E-07 37.0 7.9 70 233-302 181-281 (299) 353 PRK04169 geranylgeranylglycery 96.5 0.084 2.1E-06 32.4 11.8 69 234-302 142-214 (229) 354 pfam04551 GcpE GcpE protein. I 96.5 0.031 7.9E-07 35.4 8.9 266 42-375 10-303 (345) 355 PRK01130 N-acetylmannosamine-6 96.5 0.04 1E-06 34.7 9.5 70 226-302 124-203 (222) 356 cd04728 ThiG Thiazole synthase 96.5 0.02 5.2E-07 36.8 7.9 82 282-373 131-212 (248) 357 TIGR00737 nifR3_yhdG putative 96.5 0.0079 2E-07 39.7 5.7 210 41-302 2-238 (336) 358 PRK00208 thiG thiazole synthas 96.5 0.022 5.5E-07 36.6 7.9 83 282-374 132-214 (256) 359 PRK07114 keto-hydroxyglutarate 96.4 0.019 4.9E-07 36.9 7.6 123 220-369 70-193 (223) 360 PRK10558 alpha-dehydro-beta-de 96.4 0.063 1.6E-06 33.2 10.2 105 208-333 3-111 (256) 361 PRK00748 1-(5-phosphoribosyl)- 96.4 0.059 1.5E-06 33.5 10.0 70 232-302 29-102 (241) 362 PRK02747 consensus 96.4 0.062 1.6E-06 33.3 10.0 70 232-302 30-103 (257) 363 PRK07695 transcriptional regul 96.4 0.061 1.5E-06 33.4 9.9 118 237-371 65-183 (202) 364 PRK13139 consensus 96.3 0.051 1.3E-06 33.9 9.3 180 151-373 53-239 (254) 365 cd04742 NPD_FabD 2-Nitropropan 96.3 0.042 1.1E-06 34.5 8.8 87 39-126 6-99 (418) 366 COG0821 gcpE 1-hydroxy-2-methy 96.3 0.076 1.9E-06 32.6 10.1 211 43-306 16-237 (361) 367 PRK12331 oxaloacetate decarbox 96.3 0.059 1.5E-06 33.4 9.6 82 234-316 156-243 (463) 368 PRK08508 biotin synthase; Prov 96.3 0.098 2.5E-06 31.9 10.6 138 221-369 90-254 (279) 369 PRK09282 pyruvate carboxylase 96.3 0.057 1.5E-06 33.5 9.3 134 171-316 98-244 (580) 370 PRK13121 consensus 96.3 0.052 1.3E-06 33.8 9.0 181 150-373 53-241 (265) 371 PRK13111 trpA tryptophan synth 96.3 0.081 2.1E-06 32.5 10.0 169 162-373 60-232 (256) 372 PRK13112 consensus 96.3 0.056 1.4E-06 33.6 9.2 181 151-373 55-242 (279) 373 PRK13126 consensus 96.3 0.11 2.9E-06 31.4 10.7 130 234-373 84-216 (237) 374 KOG2118 consensus 96.3 0.022 5.5E-07 36.6 7.0 147 57-205 168-324 (498) 375 CHL00162 thiG thiamin biosynth 96.3 0.033 8.5E-07 35.2 7.9 87 277-373 140-226 (267) 376 PRK11750 gltB glutamate syntha 96.3 0.052 1.3E-06 33.9 8.9 143 291-470 1015-1176(1483) 377 PRK13134 consensus 96.3 0.062 1.6E-06 33.3 9.3 173 163-377 71-246 (257) 378 PRK13585 1-(5-phosphoribosyl)- 96.2 0.083 2.1E-06 32.4 9.9 97 184-302 3-104 (240) 379 TIGR01137 cysta_beta cystathio 96.2 0.0054 1.4E-07 40.9 3.8 143 60-216 265-437 (527) 380 cd02801 DUS_like_FMN Dihydrour 96.2 0.028 7E-07 35.8 7.4 67 233-303 139-214 (231) 381 PRK03512 thiamine-phosphate py 96.2 0.067 1.7E-06 33.1 9.3 100 265-371 92-192 (211) 382 PRK12999 pyruvate carboxylase; 96.2 0.088 2.2E-06 32.2 9.9 138 167-316 625-781 (1147) 383 PRK06015 keto-hydroxyglutarate 96.2 0.11 2.8E-06 31.5 10.4 123 221-370 66-188 (212) 384 PRK13132 consensus 96.2 0.081 2.1E-06 32.5 9.6 178 151-373 48-231 (246) 385 PRK08883 ribulose-phosphate 3- 96.1 0.13 3.4E-06 30.9 12.9 124 236-371 72-201 (220) 386 CHL00200 trpA tryptophan synth 96.1 0.088 2.2E-06 32.2 9.5 68 234-302 156-231 (263) 387 PRK13124 consensus 96.1 0.073 1.9E-06 32.8 9.0 179 151-373 46-231 (257) 388 PRK13129 consensus 96.1 0.12 3.1E-06 31.2 10.1 170 162-373 70-243 (267) 389 PRK13118 consensus 96.1 0.06 1.5E-06 33.4 8.5 181 151-373 54-241 (269) 390 PRK13136 consensus 96.1 0.11 2.8E-06 31.5 9.8 169 162-373 63-234 (253) 391 PRK02145 consensus 96.0 0.095 2.4E-06 32.0 9.3 70 232-302 31-104 (257) 392 COG0159 TrpA Tryptophan syntha 96.0 0.15 3.8E-06 30.6 13.6 67 234-302 159-233 (265) 393 PRK13135 consensus 96.0 0.099 2.5E-06 31.8 9.4 181 151-373 54-240 (267) 394 cd04724 Tryptophan_synthase_al 96.0 0.097 2.5E-06 31.9 9.3 170 162-373 51-223 (242) 395 pfam04131 NanE Putative N-acet 96.0 0.027 6.8E-07 35.9 6.4 70 226-302 97-173 (192) 396 PRK13115 consensus 96.0 0.13 3.2E-06 31.1 9.8 181 151-373 61-246 (269) 397 PRK11840 bifunctional sulfur c 96.0 0.054 1.4E-06 33.7 7.9 126 234-373 152-287 (327) 398 cd02812 PcrB_like PcrB_like pr 96.0 0.12 3.1E-06 31.2 9.7 57 244-302 147-204 (219) 399 TIGR01186 proV glycine betaine 95.9 0.014 3.5E-07 38.0 4.7 140 103-255 185-346 (372) 400 TIGR03572 WbuZ glycosyl amidat 95.9 0.12 3.1E-06 31.2 9.4 70 232-302 30-103 (232) 401 PRK04281 consensus 95.9 0.01 2.5E-07 39.0 3.8 70 232-302 30-103 (254) 402 cd04730 NPD_like 2-Nitropropan 95.9 0.043 1.1E-06 34.5 7.0 66 236-302 113-185 (236) 403 PRK13597 imidazole glycerol ph 95.9 0.13 3.4E-06 30.9 9.6 70 232-302 31-104 (252) 404 cd00564 TMP_TenI Thiamine mono 95.9 0.17 4.4E-06 30.1 10.8 100 265-371 85-184 (196) 405 PRK13140 consensus 95.8 0.18 4.5E-06 30.0 14.9 180 151-372 51-238 (257) 406 PRK08005 ribulose-phosphate 3- 95.8 0.19 4.8E-06 29.8 13.5 174 171-372 14-198 (210) 407 COG1646 Predicted phosphate-bi 95.7 0.19 4.9E-06 29.7 11.9 67 234-302 151-220 (240) 408 COG4175 ProV ABC-type proline/ 95.7 0.013 3.3E-07 38.2 3.9 18 238-256 176-193 (386) 409 pfam00290 Trp_syntA Tryptophan 95.7 0.2 5E-06 29.7 10.9 67 234-302 151-225 (258) 410 PRK13126 consensus 95.7 0.2 5.2E-06 29.6 11.0 79 234-316 135-218 (237) 411 PRK06512 thiamine-phosphate py 95.6 0.21 5.5E-06 29.4 10.3 37 263-300 154-190 (221) 412 PRK13114 consensus 95.6 0.22 5.5E-06 29.4 11.9 180 151-373 50-237 (266) 413 PRK05718 keto-hydroxyglutarate 95.6 0.18 4.6E-06 30.0 9.3 123 220-369 65-187 (212) 414 PRK02621 consensus 95.6 0.11 2.8E-06 31.5 8.1 70 232-302 30-103 (254) 415 pfam04476 DUF556 Protein of un 95.6 0.11 2.7E-06 31.6 8.1 148 229-399 6-173 (235) 416 cd04726 KGPDC_HPS 3-Keto-L-gul 95.5 0.081 2.1E-06 32.5 7.4 122 238-372 70-193 (202) 417 COG2070 Dioxygenases related t 95.5 0.047 1.2E-06 34.2 6.2 66 237-302 139-213 (336) 418 PRK13138 consensus 95.5 0.18 4.6E-06 30.0 9.1 182 151-373 50-239 (264) 419 cd00959 DeoC 2-deoxyribose-5-p 95.5 0.22 5.7E-06 29.3 11.5 116 236-362 73-200 (203) 420 PRK13305 sgbH 3-keto-L-gulonat 95.5 0.23 5.8E-06 29.3 13.1 132 230-373 65-199 (220) 421 PRK08782 consensus 95.5 0.23 5.8E-06 29.2 12.6 122 221-369 68-189 (219) 422 pfam00218 IGPS Indole-3-glycer 95.5 0.2 5E-06 29.7 9.1 120 242-377 128-249 (254) 423 pfam02581 TMP-TENI Thiamine mo 95.4 0.15 3.8E-06 30.6 8.4 89 266-362 86-174 (180) 424 PRK13523 NADPH dehydrogenase N 95.4 0.037 9.4E-07 34.9 5.3 88 232-336 227-323 (337) 425 PRK08255 salicylyl-CoA 5-hydro 95.4 0.098 2.5E-06 31.9 7.4 89 232-337 643-741 (770) 426 PRK03220 consensus 95.4 0.13 3.2E-06 31.1 7.9 70 232-302 31-104 (257) 427 PRK11815 tRNA-dihydrouridine s 95.4 0.26 6.5E-06 28.9 10.7 98 259-367 119-235 (333) 428 PRK06256 biotin synthase; Vali 95.3 0.26 6.6E-06 28.9 9.3 185 170-369 93-303 (325) 429 KOG4175 consensus 95.3 0.26 6.7E-06 28.8 16.7 68 234-302 160-235 (268) 430 PRK13119 consensus 95.3 0.27 6.8E-06 28.8 10.1 169 162-373 66-239 (261) 431 PRK08508 biotin synthase; Prov 95.3 0.27 6.8E-06 28.7 10.1 82 221-302 152-250 (279) 432 cd04618 CBS_pair_5 The CBS dom 95.3 0.015 3.7E-07 37.8 2.8 42 103-144 53-94 (98) 433 COG0800 Eda 2-keto-3-deoxy-6-p 95.2 0.094 2.4E-06 32.0 6.9 122 221-369 64-185 (211) 434 PRK00830 consensus 95.2 0.16 4.1E-06 30.4 8.0 70 232-302 34-107 (273) 435 PRK07259 dihydroorotate dehydr 95.2 0.071 1.8E-06 32.9 6.2 76 226-302 162-263 (301) 436 PRK02261 methylaspartate mutas 95.2 0.18 4.5E-06 30.0 8.2 71 234-307 43-121 (137) 437 cd02812 PcrB_like PcrB_like pr 95.2 0.051 1.3E-06 33.9 5.3 33 336-368 175-207 (219) 438 PRK00507 deoxyribose-phosphate 95.2 0.3 7.6E-06 28.4 9.9 68 235-306 138-210 (221) 439 TIGR01235 pyruv_carbox pyruvat 95.1 0.3 7.6E-06 28.4 9.9 216 80-316 537-799 (1169) 440 cd04740 DHOD_1B_like Dihydroor 95.1 0.068 1.7E-06 33.0 5.9 68 233-301 167-259 (296) 441 PRK13123 consensus 95.1 0.24 6.1E-06 29.1 8.6 181 151-374 52-237 (256) 442 TIGR01769 GGGP geranylgeranylg 95.1 0.04 1E-06 34.7 4.6 96 184-299 107-209 (212) 443 PRK04169 geranylgeranylglycery 95.1 0.12 3.1E-06 31.2 7.0 34 336-369 185-218 (229) 444 PRK00278 trpC indole-3-glycero 95.1 0.2 5E-06 29.7 8.1 122 242-377 130-251 (261) 445 PRK13127 consensus 95.0 0.32 8.1E-06 28.2 15.8 78 234-316 152-237 (262) 446 COG0167 PyrD Dihydroorotate de 95.0 0.23 5.9E-06 29.2 8.4 79 224-302 164-270 (310) 447 TIGR01163 rpe ribulose-phospha 95.0 0.15 3.8E-06 30.5 7.4 121 237-371 73-203 (216) 448 cd01573 modD_like ModD; Quinol 95.0 0.15 3.9E-06 30.5 7.4 64 234-299 192-255 (272) 449 PRK05848 nicotinate-nucleotide 95.0 0.16 4E-06 30.4 7.5 66 234-302 190-256 (272) 450 PRK09722 allulose-6-phosphate 94.9 0.34 8.6E-06 28.0 13.1 134 220-373 62-204 (227) 451 PRK13129 consensus 94.9 0.34 8.6E-06 28.0 11.3 68 234-302 160-235 (267) 452 PRK08645 bifunctional homocyst 94.9 0.34 8.8E-06 28.0 9.3 141 220-368 323-503 (608) 453 PRK02227 hypothetical protein; 94.9 0.2 5.2E-06 29.6 7.8 147 229-399 6-173 (239) 454 COG0352 ThiE Thiamine monophos 94.9 0.23 5.9E-06 29.2 8.1 98 265-370 94-191 (211) 455 COG2185 Sbm Methylmalonyl-CoA 94.9 0.22 5.7E-06 29.3 8.0 69 235-306 53-123 (143) 456 COG2022 ThiG Uncharacterized e 94.9 0.064 1.6E-06 33.2 5.2 124 237-374 87-220 (262) 457 cd00331 IGPS Indole-3-glycerol 94.9 0.25 6.5E-06 28.9 8.2 123 241-377 90-212 (217) 458 TIGR03151 enACPred_II putative 94.8 0.16 4.1E-06 30.3 7.2 66 236-302 120-190 (307) 459 TIGR01496 DHPS dihydropteroate 94.8 0.15 3.8E-06 30.5 6.9 68 234-302 157-253 (268) 460 PRK07565 dihydroorotate dehydr 94.8 0.13 3.4E-06 30.9 6.6 69 233-302 178-268 (333) 461 pfam01729 QRPTase_C Quinolinat 94.7 0.29 7.3E-06 28.5 8.2 67 233-302 88-155 (169) 462 KOG0474 consensus 94.7 0.045 1.1E-06 34.3 4.0 57 152-208 581-642 (762) 463 pfam01884 PcrB PcrB family. Th 94.6 0.4 1E-05 27.5 11.1 65 237-302 143-211 (231) 464 pfam02581 TMP-TENI Thiamine mo 94.6 0.23 6E-06 29.1 7.5 113 232-370 12-127 (180) 465 PRK00208 thiG thiazole synthas 94.6 0.4 1E-05 27.5 8.7 92 167-302 108-205 (256) 466 COG0502 BioB Biotin synthase a 94.6 0.4 1E-05 27.5 8.8 181 170-366 87-295 (335) 467 PRK13113 consensus 94.6 0.41 1E-05 27.4 10.7 180 151-373 54-241 (263) 468 TIGR03128 RuMP_HxlA 3-hexulose 94.6 0.41 1E-05 27.4 9.9 120 237-372 68-193 (206) 469 cd02810 DHOD_DHPD_FMN Dihydroo 94.6 0.082 2.1E-06 32.4 5.1 69 233-301 177-271 (289) 470 pfam00977 His_biosynth Histidi 94.5 0.42 1.1E-05 27.3 10.1 69 232-301 29-101 (229) 471 TIGR01163 rpe ribulose-phospha 94.5 0.26 6.6E-06 28.8 7.6 26 274-299 169-194 (216) 472 PRK01659 consensus 94.5 0.31 8E-06 28.3 7.9 70 232-302 30-103 (252) 473 PRK00311 panB 3-methyl-2-oxobu 94.5 0.43 1.1E-05 27.3 10.7 33 234-270 97-130 (266) 474 PRK13586 1-(5-phosphoribosyl)- 94.5 0.08 2E-06 32.5 4.9 69 232-302 29-101 (231) 475 COG5016 Pyruvate/oxaloacetate 94.5 0.14 3.7E-06 30.7 6.1 131 169-313 97-242 (472) 476 PRK06739 pyruvate kinase; Vali 94.4 0.16 4E-06 30.4 6.3 124 231-365 164-305 (352) 477 cd04739 DHOD_like Dihydroorota 94.4 0.19 4.9E-06 29.7 6.7 76 226-302 168-266 (325) 478 PRK13135 consensus 94.4 0.46 1.2E-05 27.1 14.3 67 234-302 158-232 (267) 479 COG0269 SgbH 3-hexulose-6-phos 94.3 0.34 8.6E-06 28.0 7.8 34 337-371 165-198 (217) 480 cd04728 ThiG Thiazole synthase 94.3 0.47 1.2E-05 27.0 8.8 93 166-302 106-204 (248) 481 PRK13131 consensus 94.3 0.47 1.2E-05 27.0 11.2 68 234-302 152-227 (257) 482 PRK05211 consensus 94.3 0.48 1.2E-05 26.9 9.1 89 232-338 21-114 (248) 483 cd04724 Tryptophan_synthase_al 94.2 0.48 1.2E-05 26.9 14.1 67 234-302 141-215 (242) 484 PRK07428 nicotinate-nucleotide 94.2 0.39 9.9E-06 27.6 7.9 38 262-299 180-217 (285) 485 TIGR03415 ABC_choXWV_ATP choli 94.2 0.076 1.9E-06 32.7 4.3 16 193-208 240-255 (382) 486 KOG0369 consensus 94.2 0.49 1.3E-05 26.8 9.8 140 168-318 652-810 (1176) 487 pfam05690 ThiG Thiazole biosyn 94.2 0.5 1.3E-05 26.8 9.4 92 167-302 106-203 (246) 488 TIGR01108 oadA oxaloacetate de 94.1 0.21 5.4E-06 29.5 6.4 210 108-380 3-243 (616) 489 cd06556 ICL_KPHMT Members of t 94.1 0.51 1.3E-05 26.7 10.0 13 239-251 163-175 (240) 490 TIGR03239 GarL 2-dehydro-3-deo 94.1 0.52 1.3E-05 26.7 10.9 94 219-333 9-104 (249) 491 PRK13120 consensus 94.1 0.52 1.3E-05 26.7 9.0 185 150-376 57-248 (285) 492 PRK13140 consensus 94.1 0.52 1.3E-05 26.7 10.2 91 172-302 133-231 (257) 493 COG0134 TrpC Indole-3-glycerol 94.1 0.5 1.3E-05 26.8 8.2 115 243-373 127-244 (254) 494 PRK13305 sgbH 3-keto-L-gulonat 94.0 0.33 8.4E-06 28.1 7.2 67 234-302 118-191 (220) 495 TIGR00078 nadC nicotinate-nucl 94.0 0.31 7.8E-06 28.3 7.0 66 234-302 195-261 (276) 496 cd02930 DCR_FMN 2,4-dienoyl-Co 93.9 0.21 5.2E-06 29.6 6.1 84 232-332 224-320 (353) 497 PRK13122 consensus 93.9 0.55 1.4E-05 26.5 10.0 203 109-373 13-222 (242) 498 COG2513 PrpB PEP phosphonomuta 93.8 0.57 1.5E-05 26.4 11.1 179 58-302 45-234 (289) 499 PTZ00300 pyruvate kinase; Prov 93.8 0.58 1.5E-05 26.3 10.5 125 231-366 146-287 (454) 500 PRK06096 molybdenum transport 93.8 0.55 1.4E-05 26.5 8.0 73 228-302 192-264 (284) No 1 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=100.00 E-value=0 Score=1397.23 Aligned_cols=448 Identities=57% Similarity=0.926 Sum_probs=441.0 Q ss_pred CCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHH Q ss_conf 44600057527643247542253343038816003178337852368889999997798799978989899999999874 Q gi|254780889|r 13 ALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVK 92 (493) Q Consensus 13 ~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVK 92 (493) |||||||||+|+||+|.|.||||+||||++|.|+|||+|||||||||++|||||||+|||||||||||||+|++|||||| T Consensus 1 glTFDDVLl~P~~~~v~p~dvdlstr~t~~i~L~iP~~SspMDTVTE~~MAiAMA~~GGiGVIH~N~~~E~Qae~V~~VK 80 (476) T TIGR01302 1 GLTFDDVLLLPGFIDVEPDDVDLSTRITKNIKLNIPILSSPMDTVTESRMAIAMAREGGIGVIHRNMSIERQAEEVKRVK 80 (476) T ss_pred CCCCCEEEECCCCCEECCCCEEEEEEECCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHH T ss_conf 97402157678531232542478743214306754068589882459999999985499479944799899999988752 Q ss_pred HHCCCEEC--CCEEEECCCCHHHHHHHHHHHCCCCEEEECC--CCE----EEEEEEEH---HHHCCCCCCCCC--CCCCC Q ss_conf 22388152--8789946543034565667605041678668--860----27887235---450014344672--44002 Q gi|254780889|r 93 KFESGMVV--NPVTISPYATLADALALMKKYSISGIPVVES--DVG----KLVGILTN---RDVRFASNAQQA--VGELM 159 (493) Q Consensus 93 r~e~gmI~--dPVti~pd~TI~ea~~lm~~~~is~iPVVD~--~~g----kLiGIVT~---rDir~~~~~~~~--V~eiM 159 (493) |+|++||. ||+|+.|+.||.++.++|.+++|+++|||++ +.+ ||+||||. ||++|..+..++ |+++| T Consensus 81 r~e~g~i~redp~t~~P~~tv~~~~~l~~~~gisG~PVv~~G~~~g~ktGKLvGIiT~sqWrD~~f~~~~~~~daV~~~M 160 (476) T TIGR01302 81 RAENGIISREDPVTISPETTVEDVLELMERKGISGIPVVEDGKDGGPKTGKLVGIITKSQWRDVRFVKDKGKKDAVSEVM 160 (476) T ss_pred HHCCCEEEECCCEEECCCCHHHHHHHHCCCCCCCEEEEEECCCCCCCCEEEEEEEEECCCEEECCCCCCCCCCCCEEECC T ss_conf 32066065148868479851899997322157654578836889897100699998377225411010468833010112 Q ss_pred C--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCC---CCCCEEEEEEEECCCHH Q ss_conf 6--786325730358999999986553306457479978688980210001017422326---56876555665137015 Q gi|254780889|r 160 T--RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKD---SKGRLRVAAAVSVAKDI 234 (493) Q Consensus 160 t--~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D---~~grL~VgAAIg~~~d~ 234 (493) | ++++|+++++++++|+++|.+||+++|||||++++|+||||++||.|++.||+|+|| .+||||||||||+++++ T Consensus 161 T~~~~~iT~~e~i~~e~A~~~L~~~r~ekLpvVd~~~~lVgLiT~~Di~~~~~~P~A~kd~vG~~GrL~VgAAvg~r~~D 240 (476) T TIGR01302 161 TPREELITVPEGIDLEEALKVLHKHRIEKLPVVDKDGELVGLITVKDIVKRREFPHASKDTVGENGRLIVGAAVGTREDD 240 (476) T ss_pred CCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEEEECCCCCC T ss_conf 03764348416777899999988608650478827898899986447889863888778874888608999884689861 Q ss_pred HHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHH Q ss_conf 67899987313666998248411388999999999857987245146469999999985487589961027876653100 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVV 314 (493) Q Consensus 235 ~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~ 314 (493) .||+.+|++||||+||||+|||||.++|++|||+|+.||+++||||||+|+++|++|++||||+||||||||||||||+| T Consensus 241 ~~R~~~L~~AGvDv~viDsshGhs~~vl~~ik~~k~~Yp~~~iiaGNVaT~~~a~~LI~AgADg~rVGiGpGSICTTr~V 320 (476) T TIGR01302 241 LERAEALVEAGVDVIVIDSSHGHSIYVLDSIKKIKKTYPDLDIIAGNVATAEQAKALIDAGADGLRVGIGPGSICTTRIV 320 (476) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCEEEEE T ss_conf 89999999659658998166545378999999998638805799434411788988985288878983688981100156 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCC Q ss_conf 01574310037889998622698199726543777621336714878993230101216776448736805888733200 Q gi|254780889|r 315 TGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGS 394 (493) Q Consensus 315 ~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S 394 (493) +||||||+|||++|+++|+++|+||||||||||||||||||||||||||||||||||+|||||++++||++||.|||||| T Consensus 321 ~gVGvPQ~TAv~~Va~~A~~~Gi~VIADGGIr~SGDivKAlAaGA~aVMlGslLAGT~EsPGe~~~~nGrryK~YRGMGS 400 (476) T TIGR01302 321 AGVGVPQITAVYDVAEYAAQSGIPVIADGGIRYSGDIVKALAAGADAVMLGSLLAGTTESPGEYYIINGRRYKSYRGMGS 400 (476) T ss_pred EECCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCEEHHCCCCCCCCCCCCCEEEECCEEEEEECCCCH T ss_conf 51276268899999999972799099837756255899999816772202342101633887269966878987528530 Q ss_pred HHHHH---HCCHHHHCCCCCC-------CCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 68875---2212221012333-------433100454026621057589989999999998555157587768973 Q gi|254780889|r 395 VAAME---RGSSARYSQDGVT-------DVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 395 ~~A~~---~~~~~ry~~~~~~-------~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) ++||+ +||.+|||+++.. ...++|||||||.|||||||.++|+||++||||||+|+|++||+|||+ T Consensus 401 ~gAM~~~g~gS~~RY~~~~~~neytDen~~~~~VpeGVeG~VpyKGsv~~~l~ql~gGL~~gmg~~G~~~i~~l~~ 476 (476) T TIGR01302 401 LGAMENKGKGSSDRYLQEENKNEYTDENEDKKFVPEGVEGAVPYKGSVLELLPQLVGGLKSGMGYVGARSIDELRE 476 (476) T ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHCC T ss_conf 8886478876665533577624345661012407765067417665556677788899996343314255798809 No 2 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=100.00 E-value=0 Score=1327.14 Aligned_cols=484 Identities=62% Similarity=0.990 Sum_probs=473.5 Q ss_pred CCCCCCCCCCCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCC Q ss_conf 92101667667644600057527643247542253343038816003178337852368889999997798799978989 Q gi|254780889|r 1 MARIIENNVGGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFS 80 (493) Q Consensus 1 m~~~~~~~~~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~s 80 (493) ||++.. ++||||||||+|+||++.|++|||+|+||++|+|+|||||||||||||++|||+||++||||||||||| T Consensus 1 ~~~~~~-----~~LTfDDVlLvP~~s~v~p~dVdl~t~lt~~i~L~iPivSS~MDTVTe~~MAiamA~~GGlGVIHrn~s 75 (486) T PRK05567 1 MLRIAK-----EALTFDDVLLVPAHSEVLPNDVDLSTQLTKNIRLNIPLLSAAMDTVTEARLAIAMAQEGGIGVIHKNMS 75 (486) T ss_pred CCCHHC-----CCCCCCCEEECCCCCCCCCCCCEEEEECCCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCCC T ss_conf 942112-----565803378867877767672252036248913288778678766578999999998898799989999 Q ss_pred HHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCC Q ss_conf 89999999987422388152878994654303456566760504167866886027887235450014344672440026 Q gi|254780889|r 81 PSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMT 160 (493) Q Consensus 81 ie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt 160 (493) ||+|+++|++|||||+||+.||++++|+.|+.|+.+++.+++++++||||++ ++|+||||.||++|.++.+++|+++|| T Consensus 76 ie~Q~~~V~~VKr~e~g~i~~P~tl~P~~tv~d~~~l~~~~~~sg~PVv~~~-~kL~GiiT~rD~~f~~~~~~~V~~vMT 154 (486) T PRK05567 76 IEEQAEEVRKVKRSESGVVTDPVTVTPDTTLAEALALMARYGISGVPVVDEE-GKLVGIITNRDVRFETDLSQPVSEVMT 154 (486) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEECCC-CCEEEEEECCCEECCCCCCCCHHHHHC T ss_conf 9999999999975306713798676898889999999997287861487679-947888614201100267765467534 Q ss_pred C-CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 7-863257303589999999865533064574799786889802100010174223265687655566513701567899 Q gi|254780889|r 161 R-NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVG 239 (493) Q Consensus 161 ~-~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~ 239 (493) + +++|+++++++++|.++|.++|+++|||||++|+|+|+||++|++|++.||+|++|++|||+||||||+++++.||++ T Consensus 155 ~~~lvt~~~~is~~eA~~~l~~~kieklPvVd~~g~L~Glit~kDi~k~~~~P~A~~D~~grL~VgAAVg~~~~~~eRa~ 234 (486) T PRK05567 155 KERLVTVPEGTTLEEALELLHEHRIEKLLVVDDNGRLKGLITVKDIEKAEEFPNACKDEQGRLRVGAAVGVGADNEERAE 234 (486) T ss_pred CCCEEEECCCCCHHHHHHHHHHHHHCCCCEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCCHHHHHH T ss_conf 57328925889999999999973130342774689688887766777652088512366688899999626801899999 Q ss_pred HHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCC Q ss_conf 98731366699824841138899999999985798724514646999999998548758996102787665310001574 Q gi|254780889|r 240 PLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGC 319 (493) Q Consensus 240 ~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~ 319 (493) +|++||||+||||+||||+.+++++++++|+.||+++||+|||+|++++++|++||||+||||||||||||||+++|+|+ T Consensus 235 ~Lv~AGvDvivIDtAhGhs~~vi~~ik~ik~~~~~v~viaGNv~T~~~a~~L~~aGaD~vkVGiG~GsiCtTr~v~GvGv 314 (486) T PRK05567 235 ALVKAGVDVLVVDTAHGHSEGVLDRVREIKAKYPDVQIIAGNVATAEAARALIEAGADAVKVGIGPGSICTTRIVAGVGV 314 (486) T ss_pred HHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCC T ss_conf 99976998899504452157789999999740787736875120199999999729876996566886651343247786 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHH Q ss_conf 31003788999862269819972654377762133671487899323010121677644873680588873320068875 Q gi|254780889|r 320 PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAME 399 (493) Q Consensus 320 pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~ 399 (493) ||+|||++|+++++++++|||||||||+||||+||||||||+|||||+||||+||||++++++|++||.||||||++||+ T Consensus 315 Pq~tAv~~~a~~a~~~~v~iIADGGi~~sGdi~KAla~GAd~VMlGs~lAgt~EsPG~~~~~~G~~~K~YRGMgS~~Am~ 394 (486) T PRK05567 315 PQITAIADAAEAAKKTGIPVIADGGIRYSGDIAKALAAGASAVMLGSMLAGTEEAPGEVELYQGRSYKSYRGMGSLGAMS 394 (486) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHCCCCEEEECCHHCCCCCCCCCEEEECCEEEEEEECCCCHHHHH T ss_conf 46999999999998659779964883543579999865898898661214776799726847896999971566599886 Q ss_pred HCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCCCCC Q ss_conf 22122210123334331004540266210575899899999999985551575877689737599999553412388863 Q gi|254780889|r 400 RGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESHVHD 479 (493) Q Consensus 400 ~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~~Hd 479 (493) +||.+||||+. ....|+|||||||+||||||+.++|+||.|||||+|+|+||+||+|||+|++|+++|+||++|||||| T Consensus 395 ~gs~~Ry~q~~-~~~~k~v~EGveg~vpykG~v~~~i~~l~gGlrs~m~y~Ga~~i~el~~ka~fv~iT~ag~~Es~pHd 473 (486) T PRK05567 395 KGSSDRYFQSV-NAADKLVPEGIEGRVPYKGPLSEIIHQLMGGLRSGMGYTGAATIEELREKAEFVRITGAGLRESHVHD 473 (486) T ss_pred CCCCCCCCCCC-CCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCEEEEECCCCCCCCCCCC T ss_conf 03422100022-43455157734798605866899999987777540437787769999856979999902133468975 Q ss_pred EEEEEECCCCCC Q ss_conf 489610788655 Q gi|254780889|r 480 VKITRESPNYSE 491 (493) Q Consensus 480 i~~~~~~~ny~~ 491 (493) |+||+|||||.- T Consensus 474 v~i~~e~pny~~ 485 (486) T PRK05567 474 VQITKEAPNYRV 485 (486) T ss_pred EEEECCCCCCCC T ss_conf 467331789879 No 3 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=100.00 E-value=0 Score=1295.17 Aligned_cols=471 Identities=39% Similarity=0.681 Sum_probs=461.6 Q ss_pred CCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHH Q ss_conf 67644600057527643247542253343038816003178337852368889999997798799978989899999999 Q gi|254780889|r 10 GGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVH 89 (493) Q Consensus 10 ~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~ 89 (493) ...+||||||||+|+||++.|++|||+|+||++|+|+|||||||||||||++|||+|||+|||||||||||||+|++||+ T Consensus 13 ~~~~lTfDDVlLvP~~s~v~p~~Vdl~t~lt~~i~L~iPivSs~MDTVTe~~MAiamA~~GGiGVIHrn~sie~Q~~~V~ 92 (499) T PTZ00314 13 GDKGLTYDDFIILPGYIDFSADDVDLSGQLTKNIRLHIPIVSSPMDTVTEHKMAIAMALMGGIGVIHNNCTVERQVEEVK 92 (499) T ss_pred CCCCCCCCCEEECCCCCCCCHHHEEEEEECCCCCEECCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHH T ss_conf 89861816568867888778504174103248912078779678876568999999997798699879999999999999 Q ss_pred HHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCC--EEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 87422388152878994654303456566760504167866886--0278872354500143446724400267863257 Q gi|254780889|r 90 QVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDV--GKLVGILTNRDVRFASNAQQAVGELMTRNLITVK 167 (493) Q Consensus 90 kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~--gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~ 167 (493) +|||||+|||.||++++|++|+.|++++|.+++|+++||+|++. +||+||||.||++|.++.+++|+++||++++|++ T Consensus 93 ~VKr~e~g~i~dPit~~p~~ti~d~~~l~~~~~~sg~pVv~~g~~~gkL~GIvT~rD~~f~~d~~~~v~~vMt~~lvt~~ 172 (499) T PTZ00314 93 KVKRFENGFIMDPKSLSPEHTVSDVIEIKDKKGFSGIPITEDGRPGGKLLGIVTSKDIDFVKDKSTPVSEIMTTDLVVGR 172 (499) T ss_pred HHHHHCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECCCEECCCCCCCCHHHHHCCCCEEEC T ss_conf 98763066206982738986299999756751886367986388688689998362023230677588886156726725 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC Q ss_conf 30358999999986553306457479978688980210001017422326568765556651370156789998731366 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVD 247 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvD 247 (493) ++++++||.++|.++|+++|||||++|+|+|++|++|++|++.||+|++|++|||+||||||+++++.||+++|++|||| T Consensus 173 ~~isl~eA~~ll~~~kieklpvVd~~g~L~glit~kDi~k~~~~P~a~~D~~grL~VgAAVg~~~~~~eRa~~Lv~aGvD 252 (499) T PTZ00314 173 YPITLEDANDVLNRSRKGVLPIVNDNGELVALVSRSDAVKNRDYPHASKDENKQLLVGAAISTREEDKERAAALIDAGVD 252 (499) T ss_pred CCCCHHHHHHHHHHHHHCCCEEECCCCCEEEEEECCHHHHHHHCCCHHHHCCCCEEEEEEECCCCCHHHHHHHHHHCCCC T ss_conf 99999999999986033123066578958998630348775338712220138789999947880489999999986998 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHH Q ss_conf 69982484113889999999998579872451464699999999854875899610278766531000157431003788 Q gi|254780889|r 248 LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMS 327 (493) Q Consensus 248 viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~ 327 (493) +||||+||||+.++++++++||+.||+++||+|||+|++++++|++||||+||||||||||||||+++|+|+||+|||++ T Consensus 253 vlvIDtAhGhs~~v~~~ik~ik~~~p~v~vIaGNVaT~~~a~~Li~aGAD~vkVGiGpGSiCTTR~v~GvGvPq~tAv~~ 332 (499) T PTZ00314 253 VLVLDSSQGNSIYQIDFIKWIKSTYPHLEVIAGNVVTQDQAKNLIDAGADGIRIGMGSGSICTTQEVCAVGRPQATAVYK 332 (499) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 99981688772789999999885279884676433109999999974998799753588551046434667860567999 Q ss_pred HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHH--CCHHH Q ss_conf 9998622698199726543777621336714878993230101216776448736805888733200688752--21222 Q gi|254780889|r 328 VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMER--GSSAR 405 (493) Q Consensus 328 ~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~--~~~~r 405 (493) |+++++++++|||||||||+||||+||||||||+||||||||||+||||++++++|++||.||||||++||++ ++++| T Consensus 333 ~a~~a~~~gvpiIADGGIr~sGDi~KAlAaGAd~VMlGsllAGt~EsPGe~~~~~G~~yK~YRGMgS~~Am~~~~gs~~r 412 (499) T PTZ00314 333 VARYAHSRGVPCIADGGIRSSGDIVKALALGASCVMLGSMLAGTEETPGEYFFKNGVRLKKYRGMGSLEAMSQGKGSGSR 412 (499) T ss_pred HHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEEECCEEEEEEEECCHHHHHHHCCCCCCH T ss_conf 99986449985991478464318999987289878608410476779972899999999999710439999744466300 Q ss_pred HCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHC-----CEEEEECHHHHCCCCCCCE Q ss_conf 10123334331004540266210575899899999999985551575877689737-----5999995534123888634 Q gi|254780889|r 406 YSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKK-----ANFIRVSVAGLRESHVHDV 480 (493) Q Consensus 406 y~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~-----~~f~~~t~ag~~E~~~Hdi 480 (493) ||++. ..++|||||||+||||||+.++|+||+|||||+|+|+||+||+|||+| ++|+++|+||++||||||| T Consensus 413 y~~~~---~~~~v~EGveg~vp~kG~v~~~l~ql~gGlrs~m~Y~Ga~~i~el~~k~~~g~~~f~~~T~ag~~Es~~Hdi 489 (499) T PTZ00314 413 YLSEE---EKIQVAQGVSGSVVDKGSVLKLIPYLTKGVKHGAQDIGEISIDALREKLYSGQVRFERRSPNAQVEGGVHSL 489 (499) T ss_pred HHHHH---CCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEEEECHHHHHCCCCCCC T ss_conf 11210---367477767798666887899999998587640417687869999864126772699986111001558763 Q ss_pred EEE Q ss_conf 896 Q gi|254780889|r 481 KIT 483 (493) Q Consensus 481 ~~~ 483 (493) +.- T Consensus 490 ~~~ 492 (499) T PTZ00314 490 HSY 492 (499) T ss_pred EEE T ss_conf 011 No 4 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=100.00 E-value=0 Score=1274.00 Aligned_cols=476 Identities=33% Similarity=0.519 Sum_probs=454.0 Q ss_pred CCCCCCCCCCCCCCCCHHEEECCCCC--CCCCCCEEEEEEECC-------CCEECCCEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 92101667667644600057527643--247542253343038-------816003178337852368889999997798 Q gi|254780889|r 1 MARIIENNVGGVALTFDDVLLRPEFS--NVLPRDIDISTRIAK-------DFTLNLPIMSAAMDQVTDSRLAIAMAQAGG 71 (493) Q Consensus 1 m~~~~~~~~~~~~lTfDDVllvP~~s--~v~~~~vdl~t~lt~-------~i~l~iPivSs~MDTVTe~~MAiamA~~GG 71 (493) |+.++.+ .+||||||||+|+|| +++|++|||+|+||| +|+|||||||||||||||++|||+|||+|| T Consensus 1 m~~~~~~----~~~TfDDVlLvP~~S~v~~lp~~VdlsT~lTrf~~~~~~~I~LniPiVSAaMDTVTE~~MAIamA~~GG 76 (497) T PRK07107 1 MAFYFEE----PSRTFGEYLLVPGLSTKECVPQNVSLKTPLVKFKKGEESAITLNIPLVSAIMQSVSDDNMAIALAREGG 76 (497) T ss_pred CCCCCCC----CCCCCCCEEECCCCCCCCCCCCEEEECCCEEECCCCCCCCEEECCCEEECCCCCCCCHHHHHHHHHCCC T ss_conf 9611467----764513368755887777683603702200321567778447568878647753050999999997799 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCC--CEEEEEEEEHHHHCCCC Q ss_conf 7999789898999999998742238815287899465430345656676050416786688--60278872354500143 Q gi|254780889|r 72 LGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESD--VGKLVGILTNRDVRFAS 149 (493) Q Consensus 72 iGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~--~gkLiGIVT~rDir~~~ 149 (493) |||||||||||+|++||++||||++|||.||+|++|++|+.|++++|.+++|+++||||++ .|||+||||.||++|.. T Consensus 77 iGVIH~Nmsie~Qa~~V~~VKr~esg~I~dPvti~p~~Tv~da~~l~~k~~~sg~PVvd~g~~~gkLvGIvT~RD~rf~~ 156 (497) T PRK07107 77 LSFIFGSQSIESEAAMVRRVKNHKAGFVISDSNLTPDNTLADVLDLKERTGHSTIAVTEDGTANGKLLGIVTSRDYRVSR 156 (497) T ss_pred EEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECCEEECCC T ss_conf 79987999999999999999563567668981758986499999988871996234510378598799998464011113 Q ss_pred -CCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf -44672440026--786325730358999999986553306457479978688980210001017422326568765556 Q gi|254780889|r 150 -NAQQAVGELMT--RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 150 -~~~~~V~eiMt--~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) +.+++|+++|| ++++|++++++++||.++|.+||+++|||||++|+|+|++|++||.+++.||+|++|++|||+||| T Consensus 157 ~~~~~~V~~vMT~~~~Lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~g~L~gLiT~kDi~k~~~~P~a~~D~~grL~VgA 236 (497) T PRK07107 157 MSLDTKVKDFMTPFEKLVVAKVGITLKEANNIIWDHKLNTLPIIDKNQHLVYMVFRKDYDSHKENPLELLDSSKRYVVGA 236 (497) T ss_pred CCCCCCHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCEEEEE T ss_conf 67777665640676542784688999999999986344430278379958999983788886439631137678888999 Q ss_pred EEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCC-CCCEECCCCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 65137015678999873136669982484113889999999998579-87245146469999999985487589961027 Q gi|254780889|r 227 AVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFP-SLLVMAGNIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 227 AIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~-~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) |||++ |..||+++|++||+|+||||+||||+.+++++++++|+.|| .++|+||||+|+|++++|++||||+||||||| T Consensus 237 AIg~~-d~~eRa~~Lv~aGvD~lviD~AhGhs~~v~~~ik~ik~~~~~~~~i~aGNVaT~~~~~~L~~aGad~vkVGiGp 315 (497) T PRK07107 237 GINTR-DYEERVPALVEAGADVLCIDSSDGYSEWQKRTLDYIKEKYGDTVKVGAGNVVDRDGFLYLAEAGADFVKVGIGG 315 (497) T ss_pred ECCCC-CHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHCHHHHHHHHHCCCCEEEECCCC T ss_conf 63777-89999999998599999803435352999999999998669876341452126999999998089868971159 Q ss_pred CCCCCCCHHCCCCCCCCHHHHHHHHHHHCC------CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEE Q ss_conf 876653100015743100378899986226------98199726543777621336714878993230101216776448 Q gi|254780889|r 306 GSICTTRVVTGVGCPQLSAIMSVVEVAERA------GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIF 379 (493) Q Consensus 306 Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~------~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~ 379 (493) |||||||+++|+|+||+|||++|+++++++ ++|||||||||+||||+||||+|||+||||||||||+|||||++ T Consensus 316 GSiCtTr~v~gvG~pQ~sAv~~~a~~~~~~~~~~g~~vpiiADGGi~~sGDi~KAlaaGA~~VMlGsllAGt~EsPGe~~ 395 (497) T PRK07107 316 GSICITREQKGIGRGQATALIDVAKARDEYFERTGVYIPICSDGGIVHDYHMTLALAMGADFIMLGRYFARFDESPTNKV 395 (497) T ss_pred CCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCEEEECCCCCCCCCCCCCEE T ss_conf 96621130125677348899999999888777416763287178756554599998538988998811057778997379 Q ss_pred EECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 73680588873320068875221222101233343310045402662105758998999999999855515758776897 Q gi|254780889|r 380 LYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQ 459 (493) Q Consensus 380 ~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~ 459 (493) +++|++||.||||||.+|| +++||++.... ...+||||||+||||||+.++++||+|||||+|+|+||+||+||| T Consensus 396 ~~~G~~~K~YrGMgS~~A~---~~~ry~~~~~~--~~~~~EGveg~VpykG~v~~~i~~l~gGlrs~m~y~Ga~~i~el~ 470 (497) T PRK07107 396 NINGNYMKEYWGEGSNRAR---NWQRYDLGGDK--KLSFEEGVDSYVPYAGKLKDNVAITLSKVRSTMCNCGALSIPELQ 470 (497) T ss_pred EECCEEEEEEECCCCHHHH---CCCHHHCCCCC--EECCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 7899698987666756665---10101025632--001479758986578768999999987887725476777699997 Q ss_pred HCCEEEEECHHHHCCCCCCCEEEEEEC Q ss_conf 375999995534123888634896107 Q gi|254780889|r 460 KKANFIRVSVAGLRESHVHDVKITRES 486 (493) Q Consensus 460 ~~~~f~~~t~ag~~E~~~Hdi~~~~~~ 486 (493) +|++|+++|+||++|||||||.||+|+ T Consensus 471 ~ka~f~~iT~ag~~Es~~Hdv~i~~~s 497 (497) T PRK07107 471 QKAKITLVSSTSIVEGGAHDVVLKDAS 497 (497) T ss_pred HCCEEEEECHHHHHCCCCCEEEEECCC T ss_conf 489999997332100478706973689 No 5 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=100.00 E-value=0 Score=1263.43 Aligned_cols=468 Identities=37% Similarity=0.581 Sum_probs=448.5 Q ss_pred CCCCCCCCCCCCCCCHHEEECCCCCCCCCC-CEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCC Q ss_conf 210166766764460005752764324754-2253343038816003178337852368889999997798799978989 Q gi|254780889|r 2 ARIIENNVGGVALTFDDVLLRPEFSNVLPR-DIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFS 80 (493) Q Consensus 2 ~~~~~~~~~~~~lTfDDVllvP~~s~v~~~-~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~s 80 (493) |+|++++.+.++||||||||+|+||+|+|+ +|+|+| ++++.|+|||||||||||||++|||+||++||||||||||| T Consensus 1 m~~~~~~~~p~~LTFDDVLLvP~~S~v~~~~dV~l~t--t~~i~l~iPivSAaMDTVTE~~MAIamA~~GGiGvIH~Nms 78 (479) T PRK07807 1 MRFLDGHRPPYDLTYNDVFLVPSRSDVASRFDVDLST--ADGTGTTIPVVVANMTAVAGRRMAETVARRGGLVVLPQDLP 78 (479) T ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCEEECC--CCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCCC T ss_conf 9775789999888834677756787788867526103--78988878768458754474999999997798599879999 Q ss_pred HHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCC Q ss_conf 89999999987422388152878994654303456566760504167866886027887235450014344672440026 Q gi|254780889|r 81 PSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMT 160 (493) Q Consensus 81 ie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt 160 (493) ||+|++||++||++ ++++.||+|++|++|+.|++++|.+++|+++|||+++ ++|+||||+||++|. +..++|+++|| T Consensus 79 ie~Qa~~V~kVK~~-~~~i~~Pvti~pd~tv~d~~~l~~~~~~sg~pVv~~~-gkLvGIvT~RDir~~-d~~~~v~~vMT 155 (479) T PRK07807 79 IDAVAETVAWVKSR-DLVFDTPVTLAPDDTVSDALALIHKRAHGAVVVVDEE-GRPVGLVTEADCRGV-DRFTRVGDVMS 155 (479) T ss_pred HHHHHHHHHHHHCC-CCEECCCEEECCCCCHHHHHHHHHHHCCCCCCEECCC-CCEEEEEECHHHCCC-CCCCCHHHHHC T ss_conf 99999999997113-7743699897898719999999998378887414679-947889821341147-77775888625 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHH Q ss_conf 78632573035899999998655330645747997868898021000101742232656876555665137015678999 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGP 240 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~ 240 (493) ++++|++++++++||.++|.+||+++|||||++++|+|+||++||+|++.||+|+ |++|||+||||||+++|++||+++ T Consensus 156 ~~lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~~~L~gLiT~kDi~k~~~~~~a~-D~~grL~VgAAVGv~~d~~eR~~a 234 (479) T PRK07807 156 TDLVTLPAGTDPRKAFDLLEAAPVKVAPVVDADGRLAGVLTRTGALRATIYTPAV-DAAGRLRVAAAVGINGDVAAKARA 234 (479) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCCCEEECCCCEEEEEEEEEHHHHCCCCCCCC-CHHHCEEEEEEECCCCCHHHHHHH T ss_conf 7736613667879999999753523113775699299999961164544688777-822345678872578458999999 Q ss_pred HHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC Q ss_conf 87313666998248411388999999999857987245146469999999985487589961027876653100015743 Q gi|254780889|r 241 LFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP 320 (493) Q Consensus 241 LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p 320 (493) |++||||+||||+|||||.+++++++++|+.||+++||||||+|+|++++|++||||+||||||||||||||+++|+|+| T Consensus 235 Lv~AGvDvlvIDtAHGhS~~vi~~vk~iK~~~p~~~viaGNvaT~~~a~~Li~aGad~ikvGiG~GSiCtTr~v~gvG~p 314 (479) T PRK07807 235 LLEAGVDVLVIDTAHGHQEKMLEAIRAVRALDPGVPLVAGNVVTAEGTRDLVEAGADIVKVGVGPGAMCTTRMMTGVGRP 314 (479) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHCCCCEEECCCCCCCCEECCCCCCCCCC T ss_conf 99769989997545766489999999998408988578743202999999997399976315557832434632377886 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEE-ECCEEEEEEECCCCHHHHH Q ss_conf 100378899986226981997265437776213367148789932301012167764487-3680588873320068875 Q gi|254780889|r 321 QLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFL-YQGRSFKSYRGMGSVAAME 399 (493) Q Consensus 321 q~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~-~~g~~~K~yrGm~S~~A~~ 399 (493) |+|||++|+++++++++|||||||||+||||+||||+|||+||||||||||+|||||+++ ++|++||.||||||++||+ T Consensus 315 q~tAi~~~a~~a~~~gvpiIADGGIr~sGdi~KAla~GA~~VMlGsllAGt~EsPGe~~~~~~Gr~yK~YRGMGS~~Am~ 394 (479) T PRK07807 315 QFSAVLECAAAARELGAHVWADGGVRHPRDVALALAAGASNVMIGSWFAGTYESPGDLMRDRDGRPYKESFGMASARAVA 394 (479) T ss_pred HHHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEEECCEEEEEECCCCCHHHHH T ss_conf 09999999999875699789458725346799998728987888830157777996068806998978832667999996 Q ss_pred HCCH-HHHCCCCCCCCCCEECCCCEE----EEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCC Q ss_conf 2212-221012333433100454026----62105758998999999999855515758776897375999995534123 Q gi|254780889|r 400 RGSS-ARYSQDGVTDVLKLVPEGIEG----RVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRE 474 (493) Q Consensus 400 ~~~~-~ry~~~~~~~~~k~v~eGv~~----~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E 474 (493) +++. ++|+++ ...++||||||+ .||||||+.++|+||+|||||+|+|+|++||+|||+|++|+++|+||++| T Consensus 395 ~~~~~~~~~~~---~~~~~v~EGve~~~~~~vP~kG~v~~vi~ql~gGlrs~mgy~Ga~~i~el~~ka~fv~iT~aG~~E 471 (479) T PRK07807 395 ARTAGDSAFDR---ARKALFEEGISTSRMYLDPDRPGVEDLLDHITSGVRSSCTYAGAATLAEFHERAVVGVQSAAGYAE 471 (479) T ss_pred CCCCCCCCHHH---HCCCCCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCEEEEECCCCCCC T ss_conf 06666431112---114644576502205558878758999999988888613376877699998578799988552556 Q ss_pred CCCC Q ss_conf 8886 Q gi|254780889|r 475 SHVH 478 (493) Q Consensus 475 ~~~H 478 (493) |||- T Consensus 472 shP~ 475 (479) T PRK07807 472 GKPL 475 (479) T ss_pred CCCC T ss_conf 7899 No 6 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=100.00 E-value=0 Score=1241.96 Aligned_cols=463 Identities=55% Similarity=0.884 Sum_probs=449.1 Q ss_pred CCCCCHHEEECCCCCCCCCC-CEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHH Q ss_conf 64460005752764324754-22533430388160031783378523688899999977987999789898999999998 Q gi|254780889|r 12 VALTFDDVLLRPEFSNVLPR-DIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQ 90 (493) Q Consensus 12 ~~lTfDDVllvP~~s~v~~~-~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~k 90 (493) ++||||||||+|+||.+.+| +|||+|+||++|.|+|||||||||||||++|||+||++|||||||||||||+|+++|++ T Consensus 1 ~~LTfDDVlLvP~~S~~~sr~dVdl~t~lt~~i~l~iPivSA~MDTVTe~~MAiamA~~GGiGVIHrn~sie~Q~~~V~~ 80 (467) T pfam00478 1 KGLTFDDVLLLPGRSVLPSRSDVDLSTKLTRNITLNIPLVSANMDTVTESRMAIAMAREGGIGVIHKNMSIEEQAEEVRK 80 (467) T ss_pred CCCCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHH T ss_conf 98772327883686678980126844653599505998896799985889999999988988999689999999999999 Q ss_pred HHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 742238815287899465430345656676050416786688602788723545001434467-2440026786325730 Q gi|254780889|r 91 VKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQ-AVGELMTRNLITVKKT 169 (493) Q Consensus 91 VKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~-~V~eiMt~~litv~~~ 169 (493) |||++++|+.||++++|+.|+.++.++|.+++++++||+|++ +.+.|+++.+|.++...... .....|+++++|++++ T Consensus 81 VK~~e~g~i~~Piti~p~~ti~~~~~l~~~~~~sg~pVv~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~lvt~~~~ 159 (467) T pfam00478 81 VKRFESGFITDPVTVSPDTTVAEALELKERYGISGVPVTEDG-KLLGGLVGIRTSRDIDFLSKVSVVMTMTEDLVTAPEG 159 (467) T ss_pred HHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCC-CEEEEEEEEECCCCCCCCCCCCEEECCCCCEEEECCC T ss_conf 974336821787536987659999999897287832682079-7468998776244334454320110135542772488 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) ++++||.++|.+||+++|||||++++|+|+||++|++|++.||+|++|++|||+||||||+++++.||+++|++||+|+| T Consensus 160 ~~~~ea~~~l~~~kieklpvvd~~g~L~glit~kDi~k~~~~P~a~~D~~grL~VgAAVG~~~~~~eRa~~Lv~aGvDvi 239 (467) T pfam00478 160 ITLEEANEILHEHKIEKLPIVDDDGELVGLITRKDIEKARDYPNASKDAQGRLLVGAAVGTRDDDLERAEALVEAGVDVI 239 (467) T ss_pred CCHHHHHHHHHHHHHHHCCEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCCCEE T ss_conf 99899999999755641544467883788887434677420785222656777999980678659999999987699889 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHH Q ss_conf 98248411388999999999857987245146469999999985487589961027876653100015743100378899 Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVV 329 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~ 329 (493) |||+||||+.+++++++++|+.||+++||||||+|+|++++|++||||+||||||||||||||+++|+|+||+|||++|+ T Consensus 240 vIDtAhGhs~~vi~~ik~ik~~~p~~~iIaGNVaT~e~a~~Li~aGAD~vKVGiGpGSiCTTR~v~GvG~PQ~tAv~~~a 319 (467) T pfam00478 240 VIDSAHGHSEYVLEMIKWIKKKYPDLDVIAGNVVTAEAARELIDAGADAVKVGIGPGSICTTREVAGVGRPQLTAVYEVA 319 (467) T ss_pred EEECCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 97344544188999999987407877378510058999999997077757755668865656420366775087999999 Q ss_pred HHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHH--CCHHHHC Q ss_conf 98622698199726543777621336714878993230101216776448736805888733200688752--2122210 Q gi|254780889|r 330 EVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMER--GSSARYS 407 (493) Q Consensus 330 ~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~--~~~~ry~ 407 (493) ++++++++|||||||||+||||+||||||||+||||||||||+||||++++++|++||.||||||.+||++ |+++||| T Consensus 320 ~~a~~~~vpiIADGGi~~sGDi~KAlaaGAd~VMlGsllAGt~EsPG~~~~~~G~~yK~yrGMgS~~Am~~~~g~~~rY~ 399 (467) T pfam00478 320 DAARKLGVPVIADGGIRYSGDIAKALAAGASAVMLGSLLAGTEESPGEYIERNGRRYKEYRGMGSLGAMEKHKGSKDRYF 399 (467) T ss_pred HHHHHCCCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHCCCCCCCCEEEEECCEEEEEEECCCCHHHHHHCCCCCHHHH T ss_conf 99865698799447623304899998728988987722257777995079589989898745687999974257601122 Q ss_pred CCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCCCC Q ss_conf 12333433100454026621057589989999999998555157587768973759999955341238886 Q gi|254780889|r 408 QDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESHVH 478 (493) Q Consensus 408 ~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~~H 478 (493) |+. ..|+||||+|+.||||||++++|+||+|||||+|+|+||+||+|||+|++|+|+|+||++||||| T Consensus 400 ~~~---~~~~v~eGveg~vpykG~v~~~i~~l~gGlrs~m~Y~Ga~~i~el~~~a~fvrit~ag~~Es~~H 467 (467) T pfam00478 400 QAG---DKKGVPEGVEGRVPYKGSVKDTIHQLLGGLRSSMGYVGAASLKELREKAFFVRVTSAGLREGHVH 467 (467) T ss_pred CCC---CCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCEEEEECCCCCCCCCCC T ss_conf 134---53557885189735777689999999888766221768776999975898999982313347799 No 7 >KOG2550 consensus Probab=100.00 E-value=0 Score=1049.97 Aligned_cols=469 Identities=37% Similarity=0.636 Sum_probs=456.9 Q ss_pred CCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHH Q ss_conf 76446000575276432475422533430388160031783378523688899999977987999789898999999998 Q gi|254780889|r 11 GVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQ 90 (493) Q Consensus 11 ~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~k 90 (493) +.+|||||||++|+|++|.+++|||+|+|||+|+|++|+||||||||||++|||+||++||||+||+||+||+||.+|++ T Consensus 27 ~~~LtynDfliLPg~idF~s~eVsL~t~ltr~itl~tP~vsSpMDTVtes~MAiaMAl~ggIg~IHhNctpe~QA~~v~~ 106 (503) T KOG2550 27 KIGLTYNDFLILPGFIDFASDEVSLQTKLTRNITLNTPLVSSPMDTVTESEMAIAMALLGGIGFIHHNCTPEDQADMVRR 106 (503) T ss_pred CCCCCCCCEEECCCCCCCCCCCCCCCHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHH T ss_conf 45765243463255345543312000023215533574012677531267899999863782156437887887788888 Q ss_pred HHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCC--EEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7422388152878994654303456566760504167866886--02788723545001434467244002678632573 Q gi|254780889|r 91 VKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDV--GKLVGILTNRDVRFASNAQQAVGELMTRNLITVKK 168 (493) Q Consensus 91 VKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~--gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~ 168 (493) ||+|+++++.||+.++|+.|+.++++...+++|+++||++++. +||+||||.||+.|..+..+++.++|++++++.+. T Consensus 107 vK~~~~g~~~dp~v~sp~~tvg~v~~~k~~~gF~g~pvTe~g~~~~KLvG~vtsrdi~f~~~~~~~~~~vmt~~~~~~~~ 186 (503) T KOG2550 107 VKNYENGFINNPIVISPTTTVGEVKEAKEKHGFSGIPVTEDGKRGSKLVGIITSRDIQFLEDNSLLVSDVMTKNPVTGAQ 186 (503) T ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHCCCCHHHHHHCCCCCCCCC T ss_conf 88752254568620377543105565206566335421157754651577774202345534551124330344301444 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCE Q ss_conf 03589999999865533064574799786889802100010174223265687655566513701567899987313666 Q gi|254780889|r 169 TVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDL 248 (493) Q Consensus 169 ~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDv 248 (493) +++++++.+++.++|..+|||||++|+|+.+++++|++|++.||.|++|++++|+||||||+++++++|...|++||+|+ T Consensus 187 gi~l~~~neiL~~~kkGkl~iv~~~gelva~~~rtDl~k~~~yPlask~~~kqll~gAaiGTre~dK~rl~ll~~aGvdv 266 (503) T KOG2550 187 GITLKEANEILKKIKKGKLPVVDDKGELVAMLSRTDLMKNRDYPLASKDSTKQLLCGAAIGTRDDDKERLDLLVQAGVDV 266 (503) T ss_pred CCCHHHHHHHHHHHHCCCCCEECCCCCEEEEEEHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCE T ss_conf 46677889998763148652343677623343334566502787555675413565101366630167788866348868 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHH Q ss_conf 99824841138899999999985798724514646999999998548758996102787665310001574310037889 Q gi|254780889|r 249 VVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSV 328 (493) Q Consensus 249 iviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~ 328 (493) +|+|+++|+|.+++++||++|+.||+++||+|||+|.++++.||++|||++|||||+|||||||+|+++|+||.||||+| T Consensus 267 viLDSSqGnS~~qiemik~iK~~yP~l~ViaGNVVT~~qa~nLI~aGaDgLrVGMGsGSiCiTqevma~GrpQ~TAVy~v 346 (503) T KOG2550 267 VILDSSQGNSIYQLEMIKYIKETYPDLQIIAGNVVTKEQAANLIAAGADGLRVGMGSGSICITQKVMACGRPQGTAVYKV 346 (503) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCCEEECCCEEEHHHHHHHHHCCCCEEEECCCCCCEEEECEEEECCCCCCCCHHHH T ss_conf 99966888504579999999866888634316553388899998736760575255675054530123267762003269 Q ss_pred HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCC Q ss_conf 99862269819972654377762133671487899323010121677644873680588873320068875221222101 Q gi|254780889|r 329 VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQ 408 (493) Q Consensus 329 ~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~ 408 (493) ++.|+.+++||||||||+++|||+|||++||++||||+|||||+|+|||+++.+|+++|.||||||+.||+++|..|||. T Consensus 347 a~~A~q~gvpviADGGi~~~Ghi~KAl~lGAstVMmG~lLAgtTEapGeyf~~~g~r~KkyrGMGSl~AM~~~s~~rY~~ 426 (503) T KOG2550 347 AEFANQFGVPCIADGGIQNVGHVVKALGLGASTVMMGGLLAGTTEAPGEYFFRDGVRLKKYRGMGSLDAMESSSQKRYFS 426 (503) T ss_pred HHHHHHCCCCEECCCCCCCCHHHHHHHHCCCHHHHCCCHHCCCCCCCCCEEEECCEEEEEECCCCHHHHHHHHHHHCCCC T ss_conf 99997649965506875873177888753850631041101023588614742473432011765588775120110036 Q ss_pred CCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHC-----CEEEEECHHHHCCCCCCCEEE Q ss_conf 23334331004540266210575899899999999985551575877689737-----599999553412388863489 Q gi|254780889|r 409 DGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKK-----ANFIRVSVAGLRESHVHDVKI 482 (493) Q Consensus 409 ~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~-----~~f~~~t~ag~~E~~~Hdi~~ 482 (493) +.. .. .++|||++.|+||||+..+++++.+|||++|+|.|+++|.+|+++ .+|..+|.++|.|+++|+++. T Consensus 427 e~d--kv-kiAQGVsg~v~dKGsv~kfipyl~~giqh~cqdiGa~sL~~l~~~~~~~~vrfe~rt~~Aq~Eggvh~l~S 502 (503) T KOG2550 427 EVD--KV-KIAQGVSGSVQDKGSVQKFIPYLLAGIQHSCQDIGARSLKELREMMYSGEVRFEKRTMSAQIEGGVHGLHS 502 (503) T ss_pred CCC--EE-EECCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCEECCCCCCCCC T ss_conf 654--47-64158279863475255537899988876533650777999998754565899750332211157667777 No 8 >PRK06843 inositol-5-monophosphate dehydrogenase; Validated Probab=100.00 E-value=0 Score=1020.50 Aligned_cols=401 Identities=48% Similarity=0.815 Sum_probs=366.0 Q ss_pred CCCCCCCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHH Q ss_conf 16676676446000575276432475422533430388160031783378523688899999977987999789898999 Q gi|254780889|r 5 IENNVGGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQ 84 (493) Q Consensus 5 ~~~~~~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~q 84 (493) |.+++-.++||||||+|+|+||++.|++|||+|+|++|+.|+|||||||||||||++|||+||++||||||||||+||+| T Consensus 1 ~~~k~~r~altfDDVlLvP~~stv~p~dVdlst~lt~~i~l~iPivSs~MDTVte~~mAiama~~GGlGViHrn~~ie~q 80 (404) T PRK06843 1 MTNKIIKEALTFDDVSLIPRKSSILPSEVSLKTQLTKNISLNIPFLSSAMDTVTESQMAIAIAKEGGIGIIHKNMSIEAQ 80 (404) T ss_pred CCCCCCCCCCCCCCEEECCCCCCCCHHHCEEEEEEECCEECCCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCCCHHHH T ss_conf 98541033688320798588887776670676897288165998784688777889999999988988999188999999 Q ss_pred HHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 99999874223881528789946543034565667605041678668860278872354500143446724400267863 Q gi|254780889|r 85 VAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLI 164 (493) Q Consensus 85 a~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~li 164 (493) +++|++||+|+...+ +... +|.. ....++++ T Consensus 81 ~~~v~~Vk~~~~~~~---~~~~------------------------------------~d~~------~~~~~~~~---- 111 (404) T PRK06843 81 KKEIEKVKTYKFQKT---INTN------------------------------------KDTN------EQKTKMLT---- 111 (404) T ss_pred HHHHHHHHHCCCCCE---EECC------------------------------------CCCC------HHHHHHHH---- T ss_conf 999988741124641---2026------------------------------------5432------02456665---- Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHC Q ss_conf 25730358999999986553306457479978688980210001017422326568765556651370156789998731 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDV 244 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lvea 244 (493) +...+.+.+ ..+|+.+.+.||++++|.++||+||||||+..++.+|+.+|++| T Consensus 112 ----------~~~~l~~~~-----------------~~~~~~~~~~~p~a~~d~~~rl~VgAAVg~~~d~~era~~Lv~A 164 (404) T PRK06843 112 ----------AKQHLEESK-----------------IYKNAEHKEDFPNACKDLNSKLRVGAAVSIDIDTIERVEELVKA 164 (404) T ss_pred ----------HHHHHHHHH-----------------HHHHHHHHHHHHHHHHHHHCCEEEEEEECCCHHHHHHHHHHHHC T ss_conf ----------876522324-----------------44416766441344554324676899954685289999999976 Q ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHH Q ss_conf 36669982484113889999999998579872451464699999999854875899610278766531000157431003 Q gi|254780889|r 245 NVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSA 324 (493) Q Consensus 245 GvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~a 324 (493) |+|+||||+||||+.+++++++++|+.||+++||+|||+|++++++|+++|||+|||||||||+||||.++|+|+||+|| T Consensus 165 GvD~lvID~AhGhs~~~~e~ik~ik~~~p~v~VIaGNVaT~~~a~~Li~aGAD~VkVGiGpGsiCTTr~v~GvGvPq~tA 244 (404) T PRK06843 165 HVDILVIDSAHGHSTRIIELVKTIKNKYPNLDLIAGNIVTKEAALDLINVGADCLKVGIGPGSICTTRIVAGVGVPQITA 244 (404) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCHHHH T ss_conf 99999996887521789999999997679961663030579999999981989999565478772566545868748999 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHH Q ss_conf 78899986226981997265437776213367148789932301012167764487368058887332006887522122 Q gi|254780889|r 325 IMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSA 404 (493) Q Consensus 325 v~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ 404 (493) |++|+++++++++|||||||||+||||+||||||||+|||||+||||+||||++++++|++||.||||||++||++|+++ T Consensus 245 i~d~~~~a~~~~v~IIADGGI~~sGDi~KAlA~GAdaVMlGs~lAgt~EaPG~~~~~~G~~~K~yrGMgS~~Am~~g~~~ 324 (404) T PRK06843 245 ICDVYEVCKNTNICIIADGGIRFSGDVVKAIAAGADSVMIGNLFAGTKESPSEEIIYNGKKFKSYVGMGSISAMKRGSKS 324 (404) T ss_pred HHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHCCCCEEEECCHHCCCCCCCCCEEEECCEEEEEEECCCCHHHHHCCCCC T ss_conf 99999996057997883687465327999997189888867131367669973785789487887347539998644710 Q ss_pred HHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCCCCCEE Q ss_conf 21012333433100454026621057589989999999998555157587768973759999955341238886348 Q gi|254780889|r 405 RYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESHVHDVK 481 (493) Q Consensus 405 ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~~Hdi~ 481 (493) ||++.+.....+++|||||+.||||||+.++|+||+|||||+|+|+|++||+|||+|++|+++|+||++|||||||. T Consensus 325 ry~~~~~~~~~~~~~eGv~~~vp~~G~v~~~~~~l~gglrs~m~y~Ga~~i~el~~~~~fv~vt~ag~~Es~~Hdv~ 401 (404) T PRK06843 325 RYFQLENNEPKKLVPEGIEGMVPYSGKLKDILTQLKGGLMSGMGYLGAATISDLKINSKFVKISHSSLKESHPHDVF 401 (404) T ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHCCEEEEECCCCCCCCCCCCCC T ss_conf 00233456433546785679617888889999999989987062857775999974998999980003125788776 No 9 >cd00381 IMPDH IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents. Probab=100.00 E-value=0 Score=835.16 Aligned_cols=325 Identities=57% Similarity=0.932 Sum_probs=314.2 Q ss_pred CCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHH Q ss_conf 44600057527643247542253343038816003178337852368889999997798799978989899999999874 Q gi|254780889|r 13 ALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVK 92 (493) Q Consensus 13 ~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVK 92 (493) +||||||||+|+||++.|++|||+|++++++.|++||+||||||||+.+||++|+++|||||||||+++|+|+++++++| T Consensus 1 gl~fdDVllvP~~s~~~r~~Vdl~~~~~~~~~l~iPIissnMDtV~~~~mA~~la~~Gglgvlhr~~~~e~~~~~v~~vk 80 (325) T cd00381 1 GLTFDDVLLVPGYSTVLPSEVDLSTKLTKNITLNIPLVSAPMDTVTESEMAIAMARLGGIGVIHRNMSIEEQAEEVRKVK 80 (325) T ss_pred CCCCCCEEEECCCCCCCHHHCEEEEEECCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 98831178807888788889267688418814489888678887588999999997799689943588899999999750 Q ss_pred HHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 22388152878994654303456566760504167866886027887235450014344672440026786325730358 Q gi|254780889|r 93 KFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNL 172 (493) Q Consensus 93 r~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l 172 (493) T Consensus 81 -------------------------------------------------------------------------------- 80 (325) T cd00381 81 -------------------------------------------------------------------------------- 80 (325) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEE Q ss_conf 99999998655330645747997868898021000101742232656876555665137015678999873136669982 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVD 252 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD 252 (493) ++++|+||||+++++.+|+.+|+++|+|+|||| T Consensus 81 -----------------------------------------------~~~~v~aaig~~~~~~~r~~~l~~ag~d~i~ID 113 (325) T cd00381 81 -----------------------------------------------GRLLVGAAVGTREDDKERAEALVEAGVDVIVID 113 (325) T ss_pred -----------------------------------------------CCEEEEEEECCCHHHHHHHHHHHHCCCCEEEEE T ss_conf -----------------------------------------------476999997668628999999997699899987 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHH Q ss_conf 48411388999999999857987245146469999999985487589961027876653100015743100378899986 Q gi|254780889|r 253 TAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVA 332 (493) Q Consensus 253 ~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~ 332 (493) +||||+.+++++++++|+.||+++||||||+|+|++++|++||||+|||||||||+||||.+||+|+||+|||+||++++ T Consensus 114 vAhG~~~~~~~~ik~ir~~~p~~~IiaGNV~T~e~a~~L~~~GaD~vkVGiG~GS~CtTr~~tGvG~Pq~sai~~~a~~~ 193 (325) T cd00381 114 SAHGHSVYVIEMIKFIKKKYPNVDVIAGNVVTAEAARDLIDAGADGVKVGIGPGSICTTRIVTGVGVPQATAVADVAAAA 193 (325) T ss_pred CHHCCCHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 00034588999999999768997568645668999999986699899975757777666010178874588999999976 Q ss_pred HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCC Q ss_conf 22698199726543777621336714878993230101216776448736805888733200688752212221012333 Q gi|254780889|r 333 ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVT 412 (493) Q Consensus 333 ~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~ 412 (493) +++++|||||||||++|||+||||+|||+|||||+||||+||||++++++|++||.||||+|.+||++++.+||+.+. T Consensus 194 ~~~~v~iiaDGGi~~~Gdi~KAla~GAd~VMlG~~lAg~~EsPG~~~~~~g~~~k~y~Gm~S~~a~~~~~~~~~~~~~-- 271 (325) T cd00381 194 RDYGVPVIADGGIRTSGDIVKALAAGADAVMLGSLLAGTDESPGEYIEINGKRYKEYRGMGSLGAMKKGGGDRYFGEE-- 271 (325) T ss_pred HCCCCCEEECCCCCCHHHHHHHHHCCCCEEEECCHHCCCCCCCCCEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCC-- T ss_conf 344985894487331078888875288789846210466668961587638278899787654331657765434555-- Q ss_pred CCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEE Q ss_conf 4331004540266210575899899999999985551575877689737599999 Q gi|254780889|r 413 DVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRV 467 (493) Q Consensus 413 ~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~ 467 (493) ..+.++||+|+.||||||++++++||.|||||+|+|+|++||+|||++++|+|+ T Consensus 272 -~~~~~~eG~~~~v~~~g~v~~~~~~l~gglrs~m~y~Ga~~l~e~~~~~~fv~v 325 (325) T cd00381 272 -AKKLVPEGVEGIVPYKGSVKDVLPQLVGGLRSSMGYCGAKSLKELQEKARFVRI 325 (325) T ss_pred -CCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCEEEEC T ss_conf -510148963899867886788999999999898732586739999757879969 No 10 >PRK05096 guanosine 5'-monophosphate oxidoreductase; Provisional Probab=100.00 E-value=0 Score=782.83 Aligned_cols=336 Identities=36% Similarity=0.610 Sum_probs=307.0 Q ss_pred CCCCCCCCCCCCCCCHHEEECCCCCCCCCC-CEEEEEEEC----CCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEEC Q ss_conf 210166766764460005752764324754-225334303----881600317833785236888999999779879997 Q gi|254780889|r 2 ARIIENNVGGVALTFDDVLLRPEFSNVLPR-DIDISTRIA----KDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIH 76 (493) Q Consensus 2 ~~~~~~~~~~~~lTfDDVllvP~~s~v~~~-~vdl~t~lt----~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIH 76 (493) ||+.++. .||||||||+|+||++.|| ||||+++|+ ++..+.|||+||||||||+.+||++|+++||+++|| T Consensus 1 M~I~~~i----kLdFdDVLLvPkrS~i~SRseV~l~~~~~f~~s~~~~~gIPIIaAnMDTV~~~~MA~~L~k~Ggl~vLH 76 (347) T PRK05096 1 MRIEEDL----KLGFKDVLIRPKRSTLKSRSEVELERQFTFKHSGQSWSGVPIIAANMDTVGTFEMAKALASFDILTAVH 76 (347) T ss_pred CCCCCCC----CCCCCCEEEECCCCCCCCHHHEEEEEEEEECCCCCCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEEE T ss_conf 9455677----688733688567677778302366788853357774468857967888728589999999879858984 Q ss_pred CCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCC Q ss_conf 89898999999998742238815287899465430345656676050416786688602788723545001434467244 Q gi|254780889|r 77 RNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVG 156 (493) Q Consensus 77 rn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~ 156 (493) ||+++|+|++++++++. +..+ T Consensus 77 R~~~~ee~~~~~~~~~~-------------------------------------~~~~---------------------- 97 (347) T PRK05096 77 KHYSVEEWAAFINNSSA-------------------------------------DVLK---------------------- 97 (347) T ss_pred CCCCHHHHHHHHHHCCC-------------------------------------CCCC---------------------- T ss_conf 37989999999852143-------------------------------------4467---------------------- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHH Q ss_conf 00267863257303589999999865533064574799786889802100010174223265687655566513701567 Q gi|254780889|r 157 ELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIAD 236 (493) Q Consensus 157 eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~e 236 (493) .+.+++|+.+++.+ T Consensus 98 ------------------------------------------------------------------~v~vsiGi~~~d~~ 111 (347) T PRK05096 98 ------------------------------------------------------------------HVMVSTGTSDADFE 111 (347) T ss_pred ------------------------------------------------------------------EEEEEEECCHHHHH T ss_conf ------------------------------------------------------------------38999917878999 Q ss_pred HHHHHHH--CCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHH Q ss_conf 8999873--13666998248411388999999999857987245146469999999985487589961027876653100 Q gi|254780889|r 237 RVGPLFD--VNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVV 314 (493) Q Consensus 237 Ra~~Lve--aGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~ 314 (493) |+.++++ +++|+||||+||||+.+++++++++|+.+|+.+||||||+|+|++++|++||||+|||||||||+||||.+ T Consensus 112 r~~~i~~~~~~~~~i~iDvA~G~~~~~~~~i~~ik~~~~~~~iiaGNvaT~e~~~~L~~~GaD~vkVGIG~Gs~CtTR~~ 191 (347) T PRK05096 112 KTKQILALSPALNFICIDVANGYSEHFVQFVAKAREAWPDKTICAGNVVTGEMVEELILSGADIVKVGIGPGSVCTTRVK 191 (347) T ss_pred HHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCC T ss_conf 99999952899898999779862088999999999878998088143123999999997378899976778754304522 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCC Q ss_conf 01574310037889998622698199726543777621336714878993230101216776448736805888733200 Q gi|254780889|r 315 TGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGS 394 (493) Q Consensus 315 ~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S 394 (493) ||+|+||+|||+||+++++.+++||||||||+++|||+|||+||||+||+|||||||+||||+++..+|++||.|||||| T Consensus 192 tGvG~Pq~sai~~c~~~~~~~~~~iiaDGGi~~~gDi~KAla~GAd~VM~G~~lAg~~EspG~~~~~~g~~~k~y~Gm~S 271 (347) T PRK05096 192 TGVGYPQLSAVIECADAAHGLGGMIVSDGGCTVPGDVAKAFGGGADFVMLGGMLAGHEESGGEVVEENGEKFMLFYGMSS 271 (347) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEECCEEEEEEECCCH T ss_conf 35673037899999998605799489568847504799998738988986731037777996189589979999957671 Q ss_pred HHHHHH--CCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHH Q ss_conf 688752--212221012333433100454026621057589989999999998555157587768973759999955341 Q gi|254780889|r 395 VAAMER--GSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGL 472 (493) Q Consensus 395 ~~A~~~--~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~ 472 (493) .+||++ ++.++| .+|||+|+.||||||++++|.||.|||||+|+|+||++|+|||+|++|+|+|. T Consensus 272 ~~a~~~~~g~~~~~----------~~~EG~~~~vp~kG~v~~~l~~l~gglrS~m~Y~Ga~~l~el~~~a~fv~vt~--- 338 (347) T PRK05096 272 ESAMDRHVGGVAEY----------RAAEGKTVKLPLRGPVENTARDILGGLRSACTYVGASRLKELTKRTTFIRVQE--- 338 (347) T ss_pred HHHHHHHCCCCCCC----------CCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCEEEEECC--- T ss_conf 98887643887754----------26885089972788989999999989877332758775999975898999913--- Q ss_pred CCCCCCC Q ss_conf 2388863 Q gi|254780889|r 473 RESHVHD 479 (493) Q Consensus 473 ~E~~~Hd 479 (493) .++++|+ T Consensus 339 q~n~~~~ 345 (347) T PRK05096 339 QENRVFN 345 (347) T ss_pred CHHCCCC T ss_conf 0000346 No 11 >PRK08649 inositol-5-monophosphate dehydrogenase; Validated Probab=100.00 E-value=0 Score=766.27 Aligned_cols=334 Identities=31% Similarity=0.487 Sum_probs=276.0 Q ss_pred CCCCCCCCCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEEC------ Q ss_conf 10166766764460005752764324754225334303881600317833785236888999999779879997------ Q gi|254780889|r 3 RIIENNVGGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIH------ 76 (493) Q Consensus 3 ~~~~~~~~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIH------ 76 (493) .+..+++...+||||||||+|+||+++|++||++|+|+ +|+|||||+|||||||||++|||+|||+||||||| T Consensus 4 ~~~~~k~~~~altFDDVLLvP~~s~v~p~dvd~st~l~-~i~L~iPivSAaMDTVTE~~MAIamA~~GGiGVIH~~~i~~ 82 (368) T PRK08649 4 EIGRGKTARRAYGLDEIAIVPSRRTRDPEDVSTAWQID-AYRFEIPIIASPMDAVVSPETAIELGRLGGLGVLNLEGLWT 82 (368) T ss_pred EECCCCEECCCCCCCCEEECCCCCCCCHHHCEEEEEEC-CEEECCCEECCCCCCCCCHHHHHHHHHCCCCEEEECCCCEE T ss_conf 50488143002672537975787755875662218876-88878857768876667899999999879968993221231 Q ss_pred CCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCC Q ss_conf 89898999999998742238815287899465430345656676050416786688602788723545001434467244 Q gi|254780889|r 77 RNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVG 156 (493) Q Consensus 77 rn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~ 156 (493) |+.+++++.+++..+++. ++..++.+ +...|+.+ T Consensus 83 R~~~~~~~~~~i~~~~~~------------------~~~~~~~~--i~~~pi~~-------------------------- 116 (368) T PRK08649 83 RYEDPEPVLDEIASVGKD------------------AATRLMQE--LYAEPIKP-------------------------- 116 (368) T ss_pred ECCCHHHHHHHHHCCCHH------------------HHHHHHHH--HHHCCCCH-------------------------- T ss_conf 028878888988703288------------------88999999--87546728-------------------------- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHH Q ss_conf 00267863257303589999999865533064574799786889802100010174223265687655566513701567 Q gi|254780889|r 157 ELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIAD 236 (493) Q Consensus 157 eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~e 236 (493) + +..+++..+ +| ++ .++|+..+..+..+ T Consensus 117 -----~----------------li~~ri~~~----------------------------k~-~g--~~~a~~~~~~~~~~ 144 (368) T PRK08649 117 -----E----------------LIGKRIAEI----------------------------RD-AG--VIAAVSLSPQNAQK 144 (368) T ss_pred -----H----------------HHHHHHHHH----------------------------HH-CC--CEEEEEECCHHHHH T ss_conf -----8----------------999899987----------------------------64-28--57999962463899 Q ss_pred HHHHHHHCCCCEEEE-----ECCCC-HHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 899987313666998-----24841-138899999999985798724514646999999998548758996102787665 Q gi|254780889|r 237 RVGPLFDVNVDLVVV-----DTAHG-HSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICT 310 (493) Q Consensus 237 Ra~~LveaGvDvivi-----D~ahG-h~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~Ct 310 (493) |+++|++||+|+||| |++|+ |+..++++.+++++. +++||+|||+|+|++++|++||||+|||||||||||| T Consensus 145 ~~~~Lv~aGvDvlvId~~vvd~aH~~~~~~~l~~~~~~~~~--~v~vIaGNVaT~e~a~~Li~aGADaVKVGIGpGSICT 222 (368) T PRK08649 145 LGPTVVEAGADLFVIQGTVVSAEHVSEGGEPLNLKEFIYEL--DVPVVVGGCVTYTTALHLMRTGAAGVLVGIGPGAACT 222 (368) T ss_pred HHHHHHHCCCCEEEEEEEEEEEHHHCCCCHHHHHHHHHCCC--CCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCC T ss_conf 99999974998899841475402220320356566431237--9878973446999999999779989994566887756 Q ss_pred CCHHCCCCCCCCHHHHHHHHHHHCCC-------CEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECC Q ss_conf 31000157431003788999862269-------81997265437776213367148789932301012167764487368 Q gi|254780889|r 311 TRVVTGVGCPQLSAIMSVVEVAERAG-------VAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQG 383 (493) Q Consensus 311 Tr~~~g~g~pq~~av~~~~~~~~~~~-------~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g 383 (493) ||+++|+|+||+|||++|++++++|. +|||||||||+||||+||||||||+||||||||||+||||+. T Consensus 223 TRvVaGvGvPQltAI~~~a~a~~~y~~~~~g~~VpvIADGGIr~sGDi~KAlaaGA~~VMlGsllAGt~EsPG~~----- 297 (368) T PRK08649 223 SRGVLGIGVPMATAIADCAAARRDYLDETGGRYVHVIADGGIVTSGDICKAIACGADAVMLGSPLARAAEAPGRG----- 297 (368) T ss_pred CCCEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHHHCCCCEEEECCCCCCCCCCCCCC----- T ss_conf 634012572169999999999886556526854648956885864189999872899898773104766689876----- Q ss_pred EEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHH----------HHHHHHHHHHHHCCCC Q ss_conf 058887332006887522122210123334331004540266210575899899----------9999999855515758 Q gi|254780889|r 384 RSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLH----------QMSGGLKSSMGYVGAS 453 (493) Q Consensus 384 ~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~----------~~~~glrs~~~y~G~~ 453 (493) .||||+|.. +.+|||+|++|||||+++++++ ||+|||||+|+|+||+ T Consensus 298 ----~~~gm~~~~-------------------~~~peG~~~~v~~~g~~~~vi~g~~~~~d~v~qlvGGLrs~MgY~Ga~ 354 (368) T PRK08649 298 ----FHWGMAAPH-------------------PVLPRGTRIQVGTTGSLEQILFGPADLPDGTHNLLGALRRSMATLGYS 354 (368) T ss_pred ----CCCCCCCCC-------------------CCCCCCEEEECCCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf ----443334677-------------------668996388557665065612477767533888677998863023747 Q ss_pred CHHHHHHCCEEEE Q ss_conf 7768973759999 Q gi|254780889|r 454 NIEEFQKKANFIR 466 (493) Q Consensus 454 ~i~e~~~~~~f~~ 466 (493) ||+||| |++|+. T Consensus 355 ~i~elq-ka~~v~ 366 (368) T PRK08649 355 DLKEFQ-KVEVVV 366 (368) T ss_pred CHHHHH-CCCEEE T ss_conf 088873-323694 No 12 >PRK05458 guanosine 5'-monophosphate oxidoreductase; Provisional Probab=100.00 E-value=0 Score=737.27 Aligned_cols=318 Identities=35% Similarity=0.581 Sum_probs=292.3 Q ss_pred CCCCCHHEEECCCCCCCCCC-CEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHH Q ss_conf 64460005752764324754-22533430388160031783378523688899999977987999789898999999998 Q gi|254780889|r 12 VALTFDDVLLRPEFSNVLPR-DIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQ 90 (493) Q Consensus 12 ~~lTfDDVllvP~~s~v~~~-~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~k 90 (493) ++|+||||||+|+||++.|| +||++++|+ +..+++||+|||||||++.+||++|+++||+|+|||+ .+|+|++.+++ T Consensus 3 ~aldFdDVLLvPkrS~i~SRseVd~s~~~~-~~~~~iPIiaAnMDTV~~~~mA~~l~k~GglgvLHR~-~~e~~~~~~~~ 80 (326) T PRK05458 3 KVFDYEDIQLIPNKCIVNSRSECDTSVTLG-PRTFKLPVVPANMQTIIDEKIAEWLAENGYFYIMHRF-DPEARIPFIKD 80 (326) T ss_pred CCCCCCCEEEECCCCCCCCHHHCEEEEEEC-CCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEEEEC-CHHHHHHHHHH T ss_conf 657866589856878778878843679856-8522788885898974888999999978986899855-88999999985 Q ss_pred HHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 74223881528789946543034565667605041678668860278872354500143446724400267863257303 Q gi|254780889|r 91 VKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTV 170 (493) Q Consensus 91 VKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~ 170 (493) ++. . T Consensus 81 ~~~--------------------------------------~-------------------------------------- 84 (326) T PRK05458 81 MHE--------------------------------------R-------------------------------------- 84 (326) T ss_pred CCC--------------------------------------C-------------------------------------- T ss_conf 232--------------------------------------3-------------------------------------- Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCC--CE Q ss_conf 5899999998655330645747997868898021000101742232656876555665137015678999873136--66 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNV--DL 248 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGv--Dv 248 (493) .|.+++++|+.+++.+|+.+|+++++ |+ T Consensus 85 --------------------------------------------------~~~~~iSvGi~~~~~~~i~~l~~~~~~~~~ 114 (326) T PRK05458 85 --------------------------------------------------GLIASISVGVKDDEYDFIDQLAAEGLTPEY 114 (326) T ss_pred --------------------------------------------------CCEEEEEECCCHHHHHHHHHHHHCCCCCCE T ss_conf --------------------------------------------------727999947998999999999856999777 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC--CCHHHH Q ss_conf 998248411388999999999857987245146469999999985487589961027876653100015743--100378 Q gi|254780889|r 249 VVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP--QLSAIM 326 (493) Q Consensus 249 iviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p--q~~av~ 326 (493) ||||+||||+..++++++++|+.||+++||+|||+|+|++++|++||||+|||||||||+||||.+||+|+| |+||++ T Consensus 115 i~iDvAhG~~~~~~~~i~~ik~~~~~~~iiaGNVaT~e~~~~L~~~Gad~VkVGIG~Gs~CTTR~~tGvG~p~~q~sai~ 194 (326) T PRK05458 115 ITIDIAHGHSDSVINMIKHIKKHLPETFVIAGNVGTPEAVRELENAGADATKVGIGPGKVCITKIKTGFGTGGWQLAALR 194 (326) T ss_pred EEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 99980564428999999999987899839965431899999999749999996777987520350135477589999999 Q ss_pred HHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHH Q ss_conf 89998622698199726543777621336714878993230101216776448736805888733200688752212221 Q gi|254780889|r 327 SVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARY 406 (493) Q Consensus 327 ~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry 406 (493) +|+++++ .|||||||||++|||+||||+|||+||+||+||||+||||++++.+|++||.||||+|.. .+++ T Consensus 195 ~ca~~~~---~~iiaDGGi~~~GDi~KAla~GAd~VM~G~~~Ag~~EspG~~~~~~g~~~k~y~Gmas~~--~~~~---- 265 (326) T PRK05458 195 WCAKAAR---KPIIADGGIRTHGDIAKSIRFGATMVMIGSLFAGHEESPGETVEIDGKLYKEYFGSASEF--QKGE---- 265 (326) T ss_pred HHHHHHC---CCEEEECCCCCCCHHHHHHHCCCCEEEECCHHCCCCCCCCCEEEECCEEEEEEECCCCHH--HCCC---- T ss_conf 9999727---977973685874789999864898898671223777799717978997999987467753--4788---- Q ss_pred CCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCC Q ss_conf 0123334331004540266210575899899999999985551575877689737599999553412388 Q gi|254780889|r 407 SQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESH 476 (493) Q Consensus 407 ~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~ 476 (493) +..+||+|+.||||||++++++||+|||||+|+|+|+++|+||| |++|+++|++++.|+. T Consensus 266 ---------~~~~EG~~~~vp~kG~v~~~~~~l~gglrS~m~Y~Ga~~i~el~-k~~~v~~~n~~~n~~~ 325 (326) T PRK05458 266 ---------YKNVEGKKILVPHKGSLKDTLTEMQQDLQSSISYAGGKDLDAIR-KVDYVIVKNSIFNGDK 325 (326) T ss_pred ---------CCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHC-CCCEEEEECCCCCCCC T ss_conf ---------45788748987368888999999988877633375888688961-8889999388547778 No 13 >TIGR01303 IMP_DH_rel_1 IMP dehydrogenase family protein; InterPro: IPR005991 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase and restricted to the high GC Gram-positive bacteria.. Probab=100.00 E-value=0 Score=730.35 Aligned_cols=469 Identities=37% Similarity=0.559 Sum_probs=437.8 Q ss_pred CCCCCCCCCCCCCCHHEEECCCCCCCCCC-CEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCH Q ss_conf 10166766764460005752764324754-22533430388160031783378523688899999977987999789898 Q gi|254780889|r 3 RIIENNVGGVALTFDDVLLRPEFSNVLPR-DIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSP 81 (493) Q Consensus 3 ~~~~~~~~~~~lTfDDVllvP~~s~v~~~-~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~si 81 (493) ||+++..|.+.|||+||+++|.+|++.+| +|||+|. ++....||+|.+||.+|.+.+||.++||+|||-||++++++ T Consensus 1 rfl~~~~P~y~lty~dvf~vP~rs~v~sr~~vdl~~~--dG~GttiP~vvanmta~aGrrmaet~arrGG~~~~Pqd~P~ 78 (476) T TIGR01303 1 RFLNDSQPSYDLTYDDVFMVPSRSEVGSRLDVDLSTA--DGTGTTIPLVVANMTAVAGRRMAETVARRGGLVILPQDVPI 78 (476) T ss_pred CCCCCCCCCCCEEECEEEEECCCHHHCCCCEEECCCC--CCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCH T ss_conf 9667789862100110687125200002110210367--89864301100013454213577777514886896266777 Q ss_pred HHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCC Q ss_conf 99999999874223881528789946543034565667605041678668860278872354500143446724400267 Q gi|254780889|r 82 SEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTR 161 (493) Q Consensus 82 e~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~ 161 (493) +-..+.|..||.. +.+++.|+++.|+.++.++..++.++.|..-.|++++ ++++|+|++.|+...+.+ +.+.++|.+ T Consensus 79 ~~~~~~~~~vk~~-~~vldtP~~~~P~~~v~~a~~l~~krah~~~~v~~~~-~~P~G~v~~~~~~G~d~f-t~~~~~~~~ 155 (476) T TIGR01303 79 EVVKETVAFVKSR-DLVLDTPITLAPHDTVADALALIHKRAHGAAVVVEED-GKPVGVVTDKDLEGVDRF-TQVEEVMST 155 (476) T ss_pred HHHHHHHHHHHHC-CEEECCCEEECCCHHHHHHHHHHHHHCCCEEEEEECC-CCEEEEEECCCCCCHHHH-HHHHHHHHH T ss_conf 8999999876410-2022374165462338999998753114317898169-812678851531000456-788877754 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHH Q ss_conf 86325730358999999986553306457479978688980210001017422326568765556651370156789998 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~L 241 (493) ++++++.+.+.++++.+|.++..+..|++|.+|+|.|++|++..+|...|.+|. |++||||++||||+++|...|++.| T Consensus 156 ~~~~~~~~~~P~~~f~~l~~~~~~~a~~~~~dG~l~G~l~r~Ga~ra~~y~Pa~-d~~Grlr~~aa~GinGd~~~~~~~l 234 (476) T TIGR01303 156 DLVTLPADLEPREAFDLLEEASRKLAPVVDADGSLAGILTRTGALRATLYTPAV-DAAGRLRIGAAVGINGDVEGKAKAL 234 (476) T ss_pred HHEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEEECCCCCC-CCCCCEEEEEEECCCCCHHHHHHHH T ss_conf 320203678888999998741012011110577255544114300000015552-5677467766520244323789999 Q ss_pred HHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCC Q ss_conf 73136669982484113889999999998579872451464699999999854875899610278766531000157431 Q gi|254780889|r 242 FDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQ 321 (493) Q Consensus 242 veaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq 321 (493) .+||+|+|+||+||||+..+++.+|.++..-..+|+++|||++.+|.++|+++||+.||||+|||++||||..||+|+|| T Consensus 235 ~~aG~d~lv~dtahGhq~~~~~a~k~~~~ld~~~P~~aGn~v~a~G~rdl~~aGa~~~kvGvGPGamCttrm~tGvGrPq 314 (476) T TIGR01303 235 LDAGVDVLVIDTAHGHQEKMISAVKAVRALDLRVPIVAGNVVSAEGVRDLVEAGANIIKVGVGPGAMCTTRMMTGVGRPQ 314 (476) T ss_pred HHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEECCEEECCCHHHHHHCCCCEEEECCCCCCHHHHHHHHCCCCCH T ss_conf 86688589983465406899999999986045586542422411004888744761898646886023444430578713 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEE-ECCEEEEEEECCCCHHHHHH Q ss_conf 00378899986226981997265437776213367148789932301012167764487-36805888733200688752 Q gi|254780889|r 322 LSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFL-YQGRSFKSYRGMGSVAAMER 400 (493) Q Consensus 322 ~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~-~~g~~~K~yrGm~S~~A~~~ 400 (493) ++|++||+..++++|.+||||||||.+.|++.|||+||+.||+||||+||.||||++.. .+++.||+.|||+|..|.+. T Consensus 315 fsavleCa~~a~~~G~h~WadGG~r~PrdvalalaaGasnvm~GsWf~GtyesPGdl~~~~~~~~ykes~Gmas~rav~~ 394 (476) T TIGR01303 315 FSAVLECAAEARKLGKHVWADGGVRHPRDVALALAAGASNVMVGSWFAGTYESPGDLRRDADGRAYKESFGMASKRAVEA 394 (476) T ss_pred HHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 78998988999860772640688676377777665064302441100355478510111004740134531456888864 Q ss_pred CCHHHHCCCCCCCCCCEECCCCEEEE----ECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCC Q ss_conf 21222101233343310045402662----10575899899999999985551575877689737599999553412388 Q gi|254780889|r 401 GSSARYSQDGVTDVLKLVPEGIEGRV----PYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESH 476 (493) Q Consensus 401 ~~~~ry~~~~~~~~~k~v~eGv~~~v----~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~ 476 (493) ........ ...++-+++||++..- +.+|.++++++++..|+||+++|.|+.+|++|.+++..-.+|.+|+.|++ T Consensus 395 r~~~~~~~--~~ar~alfeeGist~r~~~d~~~~Gvedl~d~i~~Gvrs~~ty~Ga~~~~~~~e~a~vG~qs~~GyaeG~ 472 (476) T TIGR01303 395 RTSKEEAF--DRARKALFEEGISTSRMFLDPARGGVEDLIDAIISGVRSSCTYAGADSLEELTERAVVGVQSAAGYAEGK 472 (476) T ss_pred HHCCCHHH--HHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHCHHHHEEECCHHHHHHHHHCEEEEEECCCCCCCCC T ss_conf 20001068--8999988751642122565057787789999875021230012033568865201145642146533788 Q ss_pred CCC Q ss_conf 863 Q gi|254780889|r 477 VHD 479 (493) Q Consensus 477 ~Hd 479 (493) |.. T Consensus 473 P~~ 475 (476) T TIGR01303 473 PLA 475 (476) T ss_pred CCC T ss_conf 788 No 14 >TIGR01305 GMP_reduct_1 guanosine monophosphate reductase; InterPro: IPR005993 Guanosine monophosphate reductase (1.7.1.7 from EC) catalyzes the irreversible and NADPH-dependent reductive deamination of GMP into IMP.NADPH + guanosine 5-phosphate = NADP+ + inosine 5-phosphate + NH3 It converts nucleobase, nucleoside and nucleotide derivatives of G to A nucleotides, and maintains intracellular balance of A and G nucleotides. A deep split separates two families of GMP reductase. This family includes both eukaryotic and some proteobacterial sequences. ; GO: 0003920 GMP reductase activity, 0009117 nucleotide metabolic process. Probab=100.00 E-value=0 Score=462.06 Aligned_cols=324 Identities=38% Similarity=0.652 Sum_probs=285.2 Q ss_pred CCCCCHHEEECCCCCCCCCC-CEEEEEEEC----CCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHH Q ss_conf 64460005752764324754-225334303----8816003178337852368889999997798799978989899999 Q gi|254780889|r 12 VALTFDDVLLRPEFSNVLPR-DIDISTRIA----KDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVA 86 (493) Q Consensus 12 ~~lTfDDVllvP~~s~v~~~-~vdl~t~lt----~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~ 86 (493) .-|.|-||||.|++|...+| +|+|.-.++ +...-.+|||.||||||---+||.+|+..-=+.-||+..++++ T Consensus 6 ~kl~fkdvl~rPkrs~lksr~~v~l~r~~~f~~s~~~~~G~P~~~~nmdtvGtf~ma~~l~~~~~~t~~hkhys~~~--- 82 (343) T TIGR01305 6 LKLDFKDVLLRPKRSTLKSRADVELEREFTFRNSKQTYSGVPIIAANMDTVGTFEMAAALASHKILTAIHKHYSVDE--- 82 (343) T ss_pred CCCCCEEEEECCCCCCCCCCCCCEECCEEEEECCCCEECCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCHHH--- T ss_conf 25562014641661001121100001002331146600354267633542105789998752256646665303688--- Q ss_pred HHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99987422388152878994654303456566760504167866886027887235450014344672440026786325 Q gi|254780889|r 87 QVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITV 166 (493) Q Consensus 87 ~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv 166 (493) ||.+-+ +... ..+..+ T Consensus 83 ----W~~f~~----------------------------------~~~~------------------~~l~~~-------- 98 (343) T TIGR01305 83 ----WKAFAA----------------------------------SASP------------------DVLKNV-------- 98 (343) T ss_pred ----HHHHHH----------------------------------CCCH------------------HHHHHH-------- T ss_conf ----999872----------------------------------1334------------------576543-------- Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHC-- Q ss_conf 730358999999986553306457479978688980210001017422326568765556651370156789998731-- Q gi|254780889|r 167 KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDV-- 244 (493) Q Consensus 167 ~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lvea-- 244 (493) .-.-|..+++.++...+++. T Consensus 99 ----------------------------------------------------------~~s~G~~~~d~~k~~~~~~~~P 120 (343) T TIGR01305 99 ----------------------------------------------------------AVSSGSSDNDLEKLKSILEEVP 120 (343) T ss_pred ----------------------------------------------------------HHHCCCCHHHHHHHHHHHHHCC T ss_conf ----------------------------------------------------------3305885446899999997267 Q ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHH Q ss_conf 36669982484113889999999998579872451464699999999854875899610278766531000157431003 Q gi|254780889|r 245 NVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSA 324 (493) Q Consensus 245 GvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~a 324 (493) -+..||+|+||||++.+++.++.+|+.||.-.|++|||+|.|-.+.|+-.|||.|||||||||+||||..+|+|+|||+| T Consensus 121 ~~~~~C~dvanGyse~fv~f~~~~r~~~P~~ti~aGnvvtGem~eelilsGadi~kvG~GPGsvCttr~k~GvGyPqlsa 200 (343) T TIGR01305 121 QLKFICLDVANGYSEHFVEFVKKVREAYPKKTIMAGNVVTGEMVEELILSGADIVKVGIGPGSVCTTRKKTGVGYPQLSA 200 (343) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEECCCEEHHHHHHHHHCCCCEEEECCCCCCEECCCCCCCCCCHHHHH T ss_conf 70379986035621789999999997365330431341004677877734763799635887501122004567513667 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHH--CC Q ss_conf 7889998622698199726543777621336714878993230101216776448736805888733200688752--21 Q gi|254780889|r 325 IMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMER--GS 402 (493) Q Consensus 325 v~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~--~~ 402 (493) |.||+++++..+-.|++|||+.+|||++||+++|||+||+|+||||++|+.|++++.||+++|-|+||.|-.||++ |+ T Consensus 201 v~eCad~ahGl~G~~~sdGGC~~PGdvakaf~~gadfvm~GG~~~Gh~~~~G~~~e~nG~k~~lfyGmss~~am~~h~G~ 280 (343) T TIGR01305 201 VIECADAAHGLKGHIVSDGGCTTPGDVAKAFGAGADFVMLGGLLAGHEESGGEVVEENGKKFKLFYGMSSDTAMKKHAGG 280 (343) T ss_pred HHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHCCCCEEEEEECCCHHHHHHHCCCC T ss_conf 65432101575524872688898126788773477767652201153210114522068658887515526777531575 Q ss_pred HHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHH Q ss_conf 22210123334331004540266210575899899999999985551575877689737599999553 Q gi|254780889|r 403 SARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVA 470 (493) Q Consensus 403 ~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~a 470 (493) -..|. .+||..-.+||||++++.+.++.|||||+|+|+|+..|+||.+++.|+|++.. T Consensus 281 vaeyr----------a~eGktv~~P~~G~v~~t~~dilGG~rs~Cty~Ga~~lkel~~r~tfirv~~q 338 (343) T TIGR01305 281 VAEYR----------ASEGKTVKVPYRGDVENTVRDILGGLRSACTYVGAKKLKELAKRATFIRVTQQ 338 (343) T ss_pred CCEEE----------CCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEECC T ss_conf 10011----------26885787214577467999873004455566547888876410014687502 No 15 >TIGR01304 IMP_DH_rel_2 IMP dehydrogenase family protein; InterPro: IPR005992 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase. Most species with a member of this family belong to the high GC Gram-positive bacteria.. Probab=100.00 E-value=0 Score=438.12 Aligned_cols=333 Identities=31% Similarity=0.495 Sum_probs=253.4 Q ss_pred CCCCCCCCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEEC------C Q ss_conf 0166766764460005752764324754225334303881600317833785236888999999779879997------8 Q gi|254780889|r 4 IIENNVGGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIH------R 77 (493) Q Consensus 4 ~~~~~~~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIH------r 77 (493) +..++.....|.|||+.+||++.+..|.+||+.+++ +-+++.|||+.+|||.|..++|||.|.++||||||- | T Consensus 3 IG~Gk~ARRaY~ldeIavVpsrRTrdpk~v~t~W~i-DAy~felP~~a~p~Davvsp~~ai~lg~lGgLGV~N~EGL~tR 81 (376) T TIGR01304 3 IGRGKTARRAYALDEIAVVPSRRTRDPKDVDTAWQI-DAYRFELPFLAHPMDAVVSPEFAIELGELGGLGVLNLEGLWTR 81 (376) T ss_pred CCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCC-CEEEECCCCCCCCCCCCCCHHHHHHHHHCCCCEEEECCHHHHH T ss_conf 168885220125554555387434870002100102-3123246501166544247699999872254315411023111 Q ss_pred CCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCC Q ss_conf 98989999999987422388152878994654303456566760504167866886027887235450014344672440 Q gi|254780889|r 78 NFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGE 157 (493) Q Consensus 78 n~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~e 157 (493) +.++++...+|..+... || --+++.++|.|- ..-|+ ++ T Consensus 82 h~D~~~~ld~i~~~~~~------~P-------~~~~a~R~LQEL--yAaPl---------------------------~~ 119 (376) T TIGR01304 82 HEDPEPLLDKIAEADKE------DP-------DQAEATRLLQEL--YAAPL---------------------------KP 119 (376) T ss_pred HCCHHHHHHHHHHHHHC------CC-------CHHHHHHHHHHH--HHCCC---------------------------CH T ss_conf 13778999999987515------88-------478998888998--63679---------------------------86 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHH Q ss_conf 02678632573035899999998655330645747997868898021000101742232656876555665137015678 Q gi|254780889|r 158 LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADR 237 (493) Q Consensus 158 iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eR 237 (493) + =+.+..+..+ +.|.+ +++++++- +..+- T Consensus 120 ----e--------Ll~~ri~~vr-----------~aG~i---------------------------~Av~lsPq-~~~~~ 148 (376) T TIGR01304 120 ----E--------LLGKRIAEVR-----------DAGVI---------------------------TAVRLSPQ-NASKL 148 (376) T ss_pred ----H--------HHHHHHHHHH-----------HCCCE---------------------------EEEEECCH-HHHHH T ss_conf ----4--------7899999997-----------26848---------------------------99986653-16788 Q ss_pred HHHHHHCCCCEEEEE-----CCCCHHHHHHHHHH--HHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE-ECCCCCC Q ss_conf 999873136669982-----48411388999999--999857987245146469999999985487589961-0278766 Q gi|254780889|r 238 VGPLFDVNVDLVVVD-----TAHGHSQKVLDAVV--QIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVG-IGPGSIC 309 (493) Q Consensus 238 a~~LveaGvDviviD-----~ahGh~~~~~~~i~--~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG-ig~Gs~C 309 (493) +..+++||+|+|||. .+|=.++ -.|.++ ++-+++ ++|||+|||.||+.+++||++||-+|.|| +|||..| T Consensus 149 a~~vv~AG~DLLvIqgT~vSaehv~~e-~~E~LnLk~fi~eL-DvPVv~Ggv~~Y~~ALhLMRtGAagvlVGfgG~ga~~ 226 (376) T TIGR01304 149 APVVVEAGADLLVIQGTVVSAEHVSSE-SGEPLNLKKFIQEL-DVPVVAGGVVTYTTALHLMRTGAAGVLVGFGGPGAAT 226 (376) T ss_pred HHHHHHHCCCEEEEHHHHHHHEEECCC-CCCCHHHHHHHHHC-CCCEEECCCCCHHHHHHHHHHCCEEEEECCCCCCCCC T ss_conf 899997173004200123201004688-88721488897548-9887883853088999986301137886457887342 Q ss_pred CCCHHCCCCCCCCHHHHHHHHHHHCC-------CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEEC Q ss_conf 53100015743100378899986226-------98199726543777621336714878993230101216776448736 Q gi|254780889|r 310 TTRVVTGVGCPQLSAIMSVVEVAERA-------GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQ 382 (493) Q Consensus 310 tTr~~~g~g~pq~~av~~~~~~~~~~-------~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~ 382 (493) |||.+.|+.|||.|||.|||-+.++| +|||||||||.+|||++||||||||+|||||.||...|+||+ T Consensus 227 T~~~vLG~~VpmATAiAD~AAARrDYLdEtGGRYVHviADG~i~~sGd~~KAIACGADAV~lGSPLArA~eA~G~----- 301 (376) T TIGR01304 227 TTREVLGIEVPMATAIADVAAARRDYLDETGGRYVHVIADGGIETSGDLAKAIACGADAVVLGSPLARAAEAPGR----- 301 (376) T ss_pred CCCCCCEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCHHHHEECCCCHHHHCCCHHHHHHCCCC----- T ss_conf 466534210672678999997301133306893377886287055463001001377602007802566304788----- Q ss_pred CEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEE-CCC----CHHHHHH----------HHHHHHHHHH Q ss_conf 805888733200688752212221012333433100454026621-057----5899899----------9999999855 Q gi|254780889|r 383 GRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVP-YKG----PIASVLH----------QMSGGLKSSM 447 (493) Q Consensus 383 g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~-~~G----~~~~~~~----------~~~~glrs~~ 447 (493) | -||||.... ..+|-|....+- .-| +++++|. +|.||||.+| T Consensus 302 G----~fWg~~a~~-------------------p~LPRG~~~~~Gv~~Ge~~Ptl~~iL~GPs~~p~G~~Nl~GaLkram 358 (376) T TIGR01304 302 G----FFWGMAAAH-------------------PELPRGKKVDVGVTVGEEAPTLEEILVGPSKLPDGSLNLLGALKRAM 358 (376) T ss_pred C----CCCCCCCCC-------------------CCCCCCCEEEEEEECCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHHH T ss_conf 8----617752587-------------------87888737872002178888779873898988788543157899988 Q ss_pred HHCCCCCHHHHHH Q ss_conf 5157587768973 Q gi|254780889|r 448 GYVGASNIEEFQK 460 (493) Q Consensus 448 ~y~G~~~i~e~~~ 460 (493) ..+|+++|+|||+ T Consensus 359 A~~Gy~dLKefQ~ 371 (376) T TIGR01304 359 AKLGYKDLKEFQK 371 (376) T ss_pred HHCCHHHCCCCCE T ss_conf 7325221420141 No 16 >TIGR01306 GMP_reduct_2 guanosine monophosphate reductase; InterPro: IPR005994 Guanosine monophosphate reductase (1.7.1.7 from EC) catalyzes the irreversible and NADPH-dependent reductive deamination of GMP into IMP.NADPH + guanosine 5-phosphate = NADP+ + inosine 5-phosphate + NH3 It converts nucleobase, nucleoside and nucleotide derivatives of G to A nucleotides, and maintains intracellular balance of A and G nucleotides. A deep split separates two families of GMP reductase. This family is found in a variety of bacterial lineages. . Probab=100.00 E-value=0 Score=384.02 Aligned_cols=310 Identities=35% Similarity=0.595 Sum_probs=267.9 Q ss_pred CCCHHEEECCCCCCCCCC-CEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHH Q ss_conf 460005752764324754-2253343038816003178337852368889999997798799978989899999999874 Q gi|254780889|r 14 LTFDDVLLRPEFSNVLPR-DIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVK 92 (493) Q Consensus 14 lTfDDVllvP~~s~v~~~-~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVK 92 (493) +.|.|+-|+|..-=+.+| +.|++.+|. .-.+++|+|-+||.||-+.++|..+|..|=.=|.||+-. |.....+++.. T Consensus 2 fdyediqliP~kCi~~srs~~dt~~~lG-~~~fklPv~Panmqt~~~e~~a~~la~~~yfy~mhrf~~-~~r~~fik~m~ 79 (321) T TIGR01306 2 FDYEDIQLIPNKCIVESRSECDTSVRLG-KKKFKLPVVPANMQTIIDEKLAKSLAENGYFYIMHRFDE-EARIPFIKDMQ 79 (321) T ss_pred CCCHHHEECCCCEEEECCCCCCCEEEEC-CEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCC-HHHHHHHHHHH T ss_conf 6500110123410240022444046665-612101112236788888999999851695799981470-11268999887 Q ss_pred HHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 22388152878994654303456566760504167866886027887235450014344672440026786325730358 Q gi|254780889|r 93 KFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNL 172 (493) Q Consensus 93 r~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l 172 (493) +.| T Consensus 80 ---------------------------------------~~G-------------------------------------- 82 (321) T TIGR01306 80 ---------------------------------------ERG-------------------------------------- 82 (321) T ss_pred ---------------------------------------HCC-------------------------------------- T ss_conf ---------------------------------------478-------------------------------------- Q ss_pred HHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCC--CCEEE Q ss_conf 9999999865533064574799786889802100010174223265687655566513701567899987313--66699 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVN--VDLVV 250 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaG--vDviv 250 (493) |...-.||+.....+-+..|.+.- -..+. T Consensus 83 -------------------------------------------------l~~sisvGvk~~ey~f~~~l~~~~l~Pe~~t 113 (321) T TIGR01306 83 -------------------------------------------------LFASISVGVKKAEYEFVEKLAEEKLIPEYIT 113 (321) T ss_pred -------------------------------------------------CEEEEEECCHHHHHHHHHHHHHCCCCCCEEE T ss_conf -------------------------------------------------5466520200356899999874267861578 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC--CCHHHHHH Q ss_conf 8248411388999999999857987245146469999999985487589961027876653100015743--10037889 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP--QLSAIMSV 328 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p--q~~av~~~ 328 (493) ||+||||+..++++|+++|+.+|+.++|+|||.|+|+.+.|-.||||+-|||||||.+|.|++.||+|.+ ||.|+..| T Consensus 114 idiahGh~~~vi~mi~h~k~~~P~~fviaGnvGtPe~vrelenaGadatkvGiGPG~vCitk~ktGfGtGGWqlaal~~C 193 (321) T TIGR01306 114 IDIAHGHSNSVIEMIKHIKTHLPDSFVIAGNVGTPEAVRELENAGADATKVGIGPGKVCITKIKTGFGTGGWQLAALRWC 193 (321) T ss_pred EEEECCCHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHCCCCCEEECCCCCCEEEEEEECCCCCCHHHHHHHHHH T ss_conf 87403633789999999987488416875467882556766533764113224787368986402557615899999998 Q ss_pred HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCC Q ss_conf 99862269819972654377762133671487899323010121677644873680588873320068875221222101 Q gi|254780889|r 329 VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQ 408 (493) Q Consensus 329 ~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~ 408 (493) +++++ .|||||||||+.|||+|.+-.||+.||+|||||++.||||++++.+|+.||+|+|.+| .|.+ T Consensus 194 ~kaa~---kP~iadGGirthGdiaksirfGa~mvmiGslfa~h~esPG~t~e~dG~~~key~Gsas----------e~~k 260 (321) T TIGR01306 194 AKAAR---KPIIADGGIRTHGDIAKSIRFGASMVMIGSLFAGHEESPGKTVEVDGKLYKEYFGSAS----------EFQK 260 (321) T ss_pred HHHHC---CCEEECCCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCH----------HCCC T ss_conf 88636---8703158523300344555531043123345420246875121320256675505400----------0246 Q ss_pred CCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHH Q ss_conf 23334331004540266210575899899999999985551575877689737599999553 Q gi|254780889|r 409 DGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVA 470 (493) Q Consensus 409 ~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~a 470 (493) .+ .|.| ||..-.++.||++.+.|.++...|||+++|+|...|..+| +..|+.+.++ T Consensus 261 Ge----~knv-eGkk~~~~~kG~~~dtl~em~qdlqssisyaGG~~l~~~r-~vdyviv~ns 316 (321) T TIGR01306 261 GE----HKNV-EGKKMLVEHKGSLKDTLKEMQQDLQSSISYAGGKDLDSLR-TVDYVIVKNS 316 (321) T ss_pred CC----CCCC-CCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHH-CCCEEEEEEE T ss_conf 65----3344-6406886036736889999987765442111540134441-0035788510 No 17 >COG0516 GuaB IMP dehydrogenase/GMP reductase [Nucleotide transport and metabolism] Probab=99.97 E-value=2.6e-31 Score=244.02 Aligned_cols=91 Identities=40% Similarity=0.632 Sum_probs=83.0 Q ss_pred CCCCCCCCCCCCCCCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCC Q ss_conf 92101667667644600057527643247542253343038816003178337852368889999997798799978989 Q gi|254780889|r 1 MARIIENNVGGVALTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFS 80 (493) Q Consensus 1 m~~~~~~~~~~~~lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~s 80 (493) |+++...++...+||||||+|+|++|++.|++++|+|+++++|.|+||++||+||||||++|||+||+.|||||||+||+ T Consensus 2 ~~~~~~~k~~~~~ltfddVll~p~~s~v~p~~~~vkt~i~~~i~l~iP~vSA~MDtVtea~mAI~ma~~GGIGVih~nm~ 81 (170) T COG0516 2 LMQVLKIKKARPGLTFDDVLLLPAASDVAPAGVDVKTGLGPGIGVNIPQVSAAMDTVTEARMAIAMARDGGIGVMHGNML 81 (170) T ss_pred HHHHHHHHHCCCCEEECCEEECHHHHHCCCCCCEEEECCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 34766544226874574487533345403378826754359940285503226789998988799998698799968857 Q ss_pred HHHHHHHHHHH Q ss_conf 89999999987 Q gi|254780889|r 81 PSEQVAQVHQV 91 (493) Q Consensus 81 ie~qa~~V~kV 91 (493) +++|+.++.++ T Consensus 82 ~~e~~~~v~~v 92 (170) T COG0516 82 AEESPGEYLYQ 92 (170) T ss_pred HHHCCCCEEEE T ss_conf 75689706998 No 18 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=99.90 E-value=2.1e-23 Score=187.35 Aligned_cols=110 Identities=36% Similarity=0.643 Sum_probs=103.2 Q ss_pred ECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCC--CEEEEEEEEHHHHCCCCCCCCCCCCCCCC--CCCCCCCCCCHHH Q ss_conf 5287899465430345656676050416786688--60278872354500143446724400267--8632573035899 Q gi|254780889|r 99 VVNPVTISPYATLADALALMKKYSISGIPVVESD--VGKLVGILTNRDVRFASNAQQAVGELMTR--NLITVKKTVNLEN 174 (493) Q Consensus 99 I~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~--~gkLiGIVT~rDir~~~~~~~~V~eiMt~--~litv~~~~~l~e 174 (493) |+||+|++|++|+.||+++|.+++++++||+|++ .+||+||||.+|+++....+.+++++|++ .+++.++++++++ T Consensus 1 i~dPvtv~p~~tv~ea~~lm~~~~~~~~pVvd~g~~~~klvGIvT~rDi~~~~~~~~~v~~iMt~~~~~i~~~~~~~l~e 80 (114) T cd04602 1 ITDPSVLSPDHTVADVLEIKEKKGFSGIPVTEDGKSGGKLLGIVTSRDIDFLTDSETPLSEVMTPREVLVVAPTGITLEE 80 (114) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEEEHHHHHHCCCCCCCCCCEEECCCEEEEECCCCCHHH T ss_conf 98899979989999999999971987799996577689699999978952350767656351652766899479878999 Q ss_pred HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9999986553306457479978688980210001 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.++|.++++++|||||++|+|+||||++||+|+ T Consensus 81 a~~~m~~~~i~~LPVVd~~g~LvGiIT~~Di~k~ 114 (114) T cd04602 81 ANEILRESKKGKLPIVNDDGELVALVTRSDLKKN 114 (114) T ss_pred HHHHHHHCCCCEEEEECCCCEEEEEEECHHHCCC T ss_conf 9999997496678199789969999992440369 No 19 >cd04730 NPD_like 2-Nitropropane dioxygenase (NPD), one of the nitroalkane oxidizing enzyme families, catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDP is a member of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=99.87 E-value=1.8e-20 Score=166.26 Aligned_cols=196 Identities=24% Similarity=0.382 Sum_probs=144.1 Q ss_pred ECCCEEEECCCCCCCHHHHHHHHHCCCEEEE-CCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCC Q ss_conf 0031783378523688899999977987999-789898999999998742238815287899465430345656676050 Q gi|254780889|r 45 LNLPIMSAAMDQVTDSRLAIAMAQAGGLGVI-HRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSI 123 (493) Q Consensus 45 l~iPivSs~MDTVTe~~MAiamA~~GGiGvI-Hrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~i 123 (493) ++.||+.+||+.+++++||.|+++.||+|+| +.++++|+.-++++++|..- T Consensus 1 i~~PIi~a~M~~vs~~~LaaAvs~aGglG~l~~~~~~~~~l~~~i~~~~~~~---------------------------- 52 (236) T cd04730 1 IRYPIIQAPMAGVSTPELAAAVSNAGGLGFIGAGYLTPEALRAEIRKIRALT---------------------------- 52 (236) T ss_pred CCCCEECCCCCCCCCHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHC---------------------------- T ss_conf 9748687887787869999999968985585788899999999999999746---------------------------- Q ss_pred CCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEEC Q ss_conf 41678668860278872354500143446724400267863257303589999999865533064574799786889802 Q gi|254780889|r 124 SGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVK 203 (493) Q Consensus 124 s~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~ 203 (493) +++.|+ +++.-.. T Consensus 53 ----------~~pfgv----------------------nl~~~~~----------------------------------- 65 (236) T cd04730 53 ----------DKPFGV----------------------NLLVPSS----------------------------------- 65 (236) T ss_pred ----------CCCEEE----------------------CCCCCCC----------------------------------- T ss_conf ----------997244----------------------3324677----------------------------------- Q ss_pred CCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 10001017422326568765556651370156789998731366699824841138899999999985798724514646 Q gi|254780889|r 204 DIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIA 283 (493) Q Consensus 204 DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~ 283 (493) .++..+.++.+++.+++++.. +.|+... .++++|+. .+. +...|. T Consensus 66 ---------------------------~~~~~~~~~~~~~~~v~~v~~--~~g~p~~---~v~~l~~~--g~~-v~~~v~ 110 (236) T cd04730 66 ---------------------------NPDFEALLEVALEEGVPVVSF--SFGPPAE---VVERLKAA--GIK-VIPTVT 110 (236) T ss_pred ---------------------------CCCHHHHHHHHHHCCCCEEEE--CCCCCHH---HHHHHHHC--CCE-EEEECC T ss_conf ---------------------------636899999999769999998--7989789---99999982--998-999589 Q ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEE Q ss_conf 99999999854875899610278766531000157431003788999862269819972654377762133671487899 Q gi|254780889|r 284 TAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVM 363 (493) Q Consensus 284 t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM 363 (493) |.+.++.+.++|+|+|-+ .|+.+-.-+. ....+.++.+.++.+. .++|||+.|||.+..|+.+||++|||.|| T Consensus 111 s~~~A~~a~~~GaD~iv~-qG~eAGGH~g---~~~~~~~~lv~~v~~~---~~ipviaAGGI~~g~~i~aal~lGA~gV~ 183 (236) T cd04730 111 SVEEARKAEAAGADALVA-QGAEAGGHRG---TFDIGTFALVPEVRDA---VDIPVIAAGGIADGRGIAAALALGADGVQ 183 (236) T ss_pred CHHHHHHHHHCCCCEEEE-ECCCCCCCCC---CCCCCHHHHHHHHHHH---HCCCEEEECCCCCHHHHHHHHHHCCCEEE T ss_conf 899999999818998999-7777777889---8755567799999998---29868965462778999999980897999 Q ss_pred ECCHHHCCCCCCCC Q ss_conf 32301012167764 Q gi|254780889|r 364 IGSLLAGTDESPGD 377 (493) Q Consensus 364 ~G~~~agt~Espg~ 377 (493) ||+.|..|.|||-. T Consensus 184 ~GTrfl~t~Es~~~ 197 (236) T cd04730 184 MGTRFLATEESGAS 197 (236) T ss_pred ECCHHHHCCCCCCC T ss_conf 55385708454799 No 20 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.87 E-value=1.1e-21 Score=174.98 Aligned_cols=108 Identities=29% Similarity=0.415 Sum_probs=101.4 Q ss_pred ECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 52878994654303456566760504167866886027887235450014344672440026786325730358999999 Q gi|254780889|r 99 VVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKAL 178 (493) Q Consensus 99 I~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~i 178 (493) |.||++++|++|+.+|+++|.+++|+++||+|++ ++|+|+||.+|+... ..+.+++++|+++++++++++++.+|+++ T Consensus 1 ~~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~-~~lvGivt~~Di~~~-~~~~~v~~iM~~~~~~v~~~~~~~~~~~~ 78 (108) T cd04596 1 LEDTGYLTTTDTVKDWHELNKETGHSRFPVVDEK-NKVVGIVTSKDVAGK-DPDTTIEKVMTKNPITVNPKTSVASVAHM 78 (108) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHC-CCCCCHHHHEECCCCCCCCCCCHHHHHHH T ss_conf 9988898999979999999998599889999689-909999997997517-98872678500476205999889999999 Q ss_pred HHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 986553306457479978688980210001 Q gi|254780889|r 179 LHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 179 M~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.+++++++||||++|+|+|+||++|++|+ T Consensus 79 m~~~~~~~lPVVde~~~lvGiIT~~Dil~A 108 (108) T cd04596 79 MIWEGIEMLPVVDDNKKLLGIISRQDVLKA 108 (108) T ss_pred HHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 998498778999169959999997996319 No 21 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=99.86 E-value=1.7e-21 Score=173.66 Aligned_cols=103 Identities=23% Similarity=0.366 Sum_probs=96.0 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 87899465430345656676050416786688602788723545001434467244002678632573035899999998 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLH 180 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~ 180 (493) +|+|++|++|+.+|.++|.+++++++||+|+ +||+||+|+||+... ...+|.++|+++++++++++++++|+++|. T Consensus 2 ~pVtV~pd~tV~eA~~lM~~~~i~~lPVvd~--~klvGIvt~rDi~~~--~~~~v~d~Mt~~v~tv~p~~~l~ea~~lM~ 77 (104) T cd04594 2 KDIKVKDYDKVYEAKRIMIENDLLSLPVVDY--NKFLGAVYLKDIENA--TYGDVVDYIVRGIPYVRLTSTAEEAWEVMM 77 (104) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHC--CCCCHHHEEECCCEEECCCCCHHHHHHHHH T ss_conf 9879899993999999999839887889989--999999997996524--699844547639888999897999999999 Q ss_pred HCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 6553306457479978688980210001 Q gi|254780889|r 181 QHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 181 ~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++++||||| +|+|+|+||++||+++ T Consensus 78 ~~~i~~lPVvd-~gklvGIIT~~Dil~A 104 (104) T cd04594 78 KNKTRWCPVVD-DGKFKGIVTLDSILDA 104 (104) T ss_pred HCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 86978789998-9999999996895378 No 22 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.86 E-value=2.2e-21 Score=172.80 Aligned_cols=108 Identities=28% Similarity=0.518 Sum_probs=98.1 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-C-------------CCCCCCCCCCCCCCCCCC Q ss_conf 28789946543034565667605041678668860278872354500-1-------------434467244002678632 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-F-------------ASNAQQAVGELMTRNLIT 165 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~-------------~~~~~~~V~eiMt~~lit 165 (493) +||+|++|++|+.+|+++|.+++++++||+|++ |+|+|+||.+|+. + ......+++++|+++++| T Consensus 1 r~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~-g~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~it 79 (122) T cd04635 1 REPVTCTPDDPVSKVWDLMLESGFTGLPVVQKA-GELIGIITRRDIIRAGSVRTSVEDQQRTQTKASPTVEKIMSTPVYS 79 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CCEEEEEEHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHCCCCCEE T ss_conf 999996892979999999997199489999189-8299999989987655400120034431453468999968689879 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 5730358999999986553306457479978688980210001 Q gi|254780889|r 166 VKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 166 v~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++++++.+|.++|.+++++++||||++|+|+|+||++||+|+ T Consensus 80 v~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiIt~~DilrA 122 (122) T cd04635 80 VTPDDSIATAVELMLEHDIGRLPVVNEKDQLVGIVDRHDVLKA 122 (122) T ss_pred ECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9896939999998898099889899379989999997996369 No 23 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.86 E-value=2.8e-21 Score=172.05 Aligned_cols=106 Identities=33% Similarity=0.582 Sum_probs=100.2 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143446724400267863257303589999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALL 179 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM 179 (493) +||+|++|++|+.||+++|.+++++++||+|+ ++++||||.+|+.+. +.+.+|+++|+++++++++++++.+|.++| T Consensus 2 kdvvtv~~d~tv~ea~~~m~~~~~~~~pVvd~--~~lvGivt~~Dll~~-~~~~~v~~iM~~~~~tv~~~~~l~~a~~~m 78 (107) T cd04610 2 RDVITVSPDNTVKDVIKLIKETGHDGFPVVDN--GKVVGIVSARDLLGK-DPDETVEEIMSKDLVVAVPEMDIMDAARVM 78 (107) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHCCC-CCCCCHHHHCCCCCCCCCCCCCHHHHHHHH T ss_conf 88099899295999999999759978999989--999998876453159-987726786057860169998699999999 Q ss_pred HHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86553306457479978688980210001 Q gi|254780889|r 180 HQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 180 ~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .++++++|||||++|+|+|+||++|++|+ T Consensus 79 ~~~~~~~lpVvde~g~lvGiiT~~Dilra 107 (107) T cd04610 79 FRTGISKLPVVDENNNLVGIITNTDVIRS 107 (107) T ss_pred HHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 98299689699239989999994784439 No 24 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=99.86 E-value=2.8e-21 Score=172.09 Aligned_cols=109 Identities=58% Similarity=0.901 Sum_probs=102.5 Q ss_pred ECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCC-CCCCCCCCCCHHHHHH Q ss_conf 528789946543034565667605041678668860278872354500143446724400267-8632573035899999 Q gi|254780889|r 99 VVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTR-NLITVKKTVNLENAKA 177 (493) Q Consensus 99 I~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~-~litv~~~~~l~eA~~ 177 (493) |+||+|++|++|+.||+++|.+++++++||+|++ ++|+|++|.+|+++..+.+.+++++|++ +++++++++++.+|++ T Consensus 1 irdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~-~~l~Giit~~Dl~~~~~~~~~v~~iM~~~~~~~~~~~~~~~~~~~ 79 (110) T cd04601 1 IRDPITVSPDATVAEALELMAEYGISGLPVVDDD-GKLVGIVTNRDLRFETDLDKPVSEVMTPENLLTTVEGTSLEEALE 79 (110) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHHCCCCCCCEEEEECCCEEEECCCCCHHHHHH T ss_conf 9889897993969999999998499889999489-969878877677650576786035770575087047687999999 Q ss_pred HHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9986553306457479978688980210001 Q gi|254780889|r 178 LLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 178 iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.+++++++||||++|+|+|+||++||+|+ T Consensus 80 ~m~~~~i~~lpVvd~~~~lvGiIt~~Dilk~ 110 (110) T cd04601 80 LLHEHKIEKLPVVDDEGKLKGLITVKDIEKR 110 (110) T ss_pred HHHHCCCEEEEEECCCCEEEEEEEHHHHCCC T ss_conf 9986685199999889999999995984379 No 25 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.85 E-value=2.6e-21 Score=172.29 Aligned_cols=107 Identities=29% Similarity=0.428 Sum_probs=96.8 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCC--CCCCCCCCCCHH Q ss_conf 87899465430345656676050416786688602788723545001-----43446724400267--863257303589 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTR--NLITVKKTVNLE 173 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~--~litv~~~~~l~ 173 (493) -|+|++|++|+.||.++|.+++++++||||++ |+|+||+|+||++. ....+.||+++||+ +++|+.+++++. T Consensus 2 pPvtv~~~~si~eA~~~M~~~~ig~l~VVD~~-~~LvGIiT~rDl~r~~~~~~~~~~~pV~~iMT~~P~vvt~~pd~~v~ 80 (118) T cd04617 2 PPVVVRENTSVYDAIVTLFLEDVGSLFVVDED-GDLVGVVSRKDLLKASIGGADLQKVPVGVIMTRMPNITTTTPEESVL 80 (118) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCCEEEECCCCCHH T ss_conf 79798996999999999998399879998499-95999987899999997489846782999857799719978999399 Q ss_pred HHHHHHHHCCCCEEEEECCCC---CEEEEEEECCCHHC Q ss_conf 999999865533064574799---78688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDG---CCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g---~LvGIIT~~DIlk~ 208 (493) +|+++|.+|++++|||||+++ +++|+||.+||+|. T Consensus 81 ~A~~~M~~~~i~~LPVVd~~~~~~~vvGiIT~~Di~kl 118 (118) T cd04617 81 EAAKKLIEHQVDSLPVVEKVDEGLEVIGRITKTNITKL 118 (118) T ss_pred HHHHHHHHCCCCEECEEECCCCCEEEEEEEECCCEECC T ss_conf 99999998299985679439974089999977515049 No 26 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.85 E-value=6.1e-21 Score=169.62 Aligned_cols=108 Identities=35% Similarity=0.565 Sum_probs=97.2 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCC--H Q ss_conf 287899465430345656676050416786688602788723545001-----434467244002678632573035--8 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVN--L 172 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~--l 172 (493) +||+|++|++|+.||+++|.+++++++||++++ ++++||+|++|++. ....+.+++++|+++++|+.++.. + T Consensus 1 r~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~-~~lvGIvT~~Di~r~~~~~~~~~~~~v~~vMt~~~itv~~~~~~~i 79 (115) T cd04620 1 RHPLTVTPDTPVADAIALMSQQGDSSCVLVVEK-GRLLGIFTERDIVRLTAIGKDLSDLPIGEVMTQPVVTLQESEIQDI 79 (115) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCCCH T ss_conf 969996991999999999998399469999379-9299999869999999859981348988963689679838997269 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999999986553306457479978688980210001 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .+|+++|.++++++|||||++|+|+|+||.+||+|. T Consensus 80 ~~a~~~M~~~~i~~lpVvd~~g~lvGiit~~Dilkv 115 (115) T cd04620 80 FTALSLFRQHQIRHLPVLDDQGQLIGLVTAESIRQV 115 (115) T ss_pred HHHHHHHHHCCEEEEEEECCCCEEEEEEEHHHCCCC T ss_conf 999999875595289999579979999992562329 No 27 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.85 E-value=4.9e-21 Score=170.31 Aligned_cols=108 Identities=27% Similarity=0.446 Sum_probs=97.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC--CC----------------CCCCCCCCCCCCC Q ss_conf 28789946543034565667605041678668860278872354500--14----------------3446724400267 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR--FA----------------SNAQQAVGELMTR 161 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir--~~----------------~~~~~~V~eiMt~ 161 (493) .|++|++|++|+.+|+++|.+++++++||+|++ |+|+||||.+|+. +. ...+.+|+++|++ T Consensus 1 kdvvTv~~~~~l~~a~~lm~~~~i~~lPVvd~~-g~lvGiiT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~ 79 (128) T cd04632 1 EDVITVREDDSVGKAINVLREHGISRLPVVDDN-GKLTGIVTRHDIVDFVVRDRDKARTGDRSGEKERMLDLPVYDAMSS 79 (128) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCHHCCHHHHCCHHHHHHCCCHHHHCCC T ss_conf 998698995929999999998399779999689-9789999889999988532101011221001223406799996679 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEE--CCCCCEEEEEEECCCHHC Q ss_conf 86325730358999999986553306457--479978688980210001 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVV--DDDGCCIGLITVKDIERS 208 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVV--De~g~LvGIIT~~DIlk~ 208 (493) +++|+++++++.+|.++|.+++++++||| |++|+|+||||++||+|+ T Consensus 80 ~vitv~~~~~i~~a~~~M~~~~i~~lpVv~~dd~~~lvGIiT~~DilrA 128 (128) T cd04632 80 PVITASPNDSVRDAVDRMLENDDSSVVVVTPDDDTKVVGILTKKDVLRA 128 (128) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9879999293999999999779978966847899989999996996469 No 28 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=99.85 E-value=5.4e-21 Score=170.04 Aligned_cols=109 Identities=31% Similarity=0.516 Sum_probs=103.2 Q ss_pred ECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 52878994654303456566760504167866886027887235450014344672440026786325730358999999 Q gi|254780889|r 99 VVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKAL 178 (493) Q Consensus 99 I~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~i 178 (493) ++||+|++|++|+.||+++|.+++++++||+|++ |+++|++|.+|++.......+++++|+++++++++++++.+|.++ T Consensus 1 ~r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~-~~lvGivt~~di~~~~~~~~~v~~im~~~~~tv~~~~~~~~a~~~ 79 (109) T cd04583 1 IKNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKD-NKLLGIVSLESLEQAYKEAKSLEDIMLEDVFTVQPDASLRDVLGL 79 (109) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHCCCCCCEEEECCEEEEEEECCCCCHHHHHHH T ss_conf 9989997993979999999997399789999589-989999981043430058967875126202897489999999999 Q ss_pred HHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 986553306457479978688980210001 Q gi|254780889|r 179 LHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 179 M~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.+++++++||||++|+|+|+||++|+++. T Consensus 80 m~~~~~~~lPVVd~~~~lvGiiT~~dll~t 109 (109) T cd04583 80 VLKRGPKYVPVVDEDGKLVGLITRSSLVDT 109 (109) T ss_pred HHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 985598388899649999999996882376 No 29 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.85 E-value=7.6e-21 Score=168.94 Aligned_cols=106 Identities=38% Similarity=0.592 Sum_probs=98.1 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143446724400267863257303589999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALL 179 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM 179 (493) .||+|++|++|+.||+++|.+++++++||+|++ |+++|++|++|+.. ...+.++.++|+++++++++++++.+|.++| T Consensus 1 kdvvtv~~~~tv~ea~~lm~~~~i~~lpVvd~~-~~lvGivT~~Dl~~-~~~~~~v~~iMt~~~~tv~~~~~~~~a~~~m 78 (106) T cd04638 1 ENVVYVTLPGTRDDVLELLKEYKVSGVPVVKKS-GELVGIITRKDLLR-NPEEEQLALLMTRDPPTVSPDDDVKEAAKLM 78 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHH-CCCCCHHHHHHCCCCEECCCCCCHHHHHHHH T ss_conf 998999994969999999997699989999389-99999989789644-8750244675358973679949199999999 Q ss_pred HHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86553306457479978688980210001 Q gi|254780889|r 180 HQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 180 ~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .+++++++|||| +|+|+|+||++||+|+ T Consensus 79 ~~~~i~~lpVvd-~~~lvGiIt~~DilkA 106 (106) T cd04638 79 VENNIRRVPVVD-DGKLVGIVTVADIVRA 106 (106) T ss_pred HHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 876975999999-9999999997995359 No 30 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.84 E-value=8.7e-21 Score=168.53 Aligned_cols=106 Identities=25% Similarity=0.381 Sum_probs=97.5 Q ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC------CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 7899465430345656676050416786688602788723545001------4344672440026786325730358999 Q gi|254780889|r 102 PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF------ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 102 PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~------~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) .++++|++|+.||.++|.+++++++||+|++ |+++||+|++|+.. ..+.+.||+++||++++|+++++++.+| T Consensus 3 vitV~~~~ti~eA~~~M~~~~ig~l~Vvd~~-g~lvGIiTe~Di~r~~~~~~~~~~~~~V~~vMt~~vitv~~~~~i~~a 81 (114) T cd04619 3 LAKIDVNATLQRAAKILGEPGIDLVVVCDPH-GKLAGVLTKTDVVRQMGRCGGPGCTAPVENVMTRAVVSCRPGDLLHDV 81 (114) T ss_pred EEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCHHHH T ss_conf 1298986939999999987499889999899-959999994999999873079866789999615898899998909999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .++|.+++++++||||++++++|+||++|++|. T Consensus 82 ~~~M~~~~i~~lPVVD~~~~~vGiit~~Dilk~ 114 (114) T cd04619 82 WQVMKQRGLKNIPVVDENARPLGVLNARDALKV 114 (114) T ss_pred HHHHHHCCCCEEEEEECCCEEEEEEEHHHEECC T ss_conf 998887698588999079959999993545429 No 31 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.84 E-value=1.1e-20 Score=167.78 Aligned_cols=104 Identities=41% Similarity=0.659 Sum_probs=97.1 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143446724400267863257303589999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALL 179 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM 179 (493) +||+|++|++|+.||+++|.+++++++||+++ ++++||||++|++.. ..+.++.++|+++++|+++++++.+|.++| T Consensus 1 rd~vtv~p~~tv~ea~~lM~~~~i~~lpVv~~--~~lvGivT~~Dl~~~-~~~~~v~~iMt~~~itv~~~~~l~~a~~~M 77 (105) T cd04599 1 RDPITIDPLDSVGRAARLMEKHRIGGLPVVED--GKLVGIITSRDVRRA-HPNRLVADAMTREVVTISPEASLLEAKRLM 77 (105) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHHCC-CCCCCHHHHCCCCCEEECCCCCHHHHHHHH T ss_conf 99899689498999999998639988999999--999899981053127-888796885027989997949199999977 Q ss_pred HHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 8655330645747997868898021000 Q gi|254780889|r 180 HQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 180 ~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) .++++++|||+| +|+|+|+||++||.| T Consensus 78 ~~~~i~~lpVvd-~g~lvGiiT~~Divr 104 (105) T cd04599 78 EEKKIERLPVLR-ERKLVGIITKGTIAL 104 (105) T ss_pred HHCCCCEEEEEE-CCEEEEEEECHHHCC T ss_conf 985997999998-999999999458615 No 32 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.84 E-value=1.3e-20 Score=167.29 Aligned_cols=107 Identities=32% Similarity=0.523 Sum_probs=98.0 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-C------CCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 28789946543034565667605041678668860278872354500-1------4344672440026786325730358 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-F------ASNAQQAVGELMTRNLITVKKTVNL 172 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~------~~~~~~~V~eiMt~~litv~~~~~l 172 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+++|++|++|+. . ......+|+++|+++++++++++++ T Consensus 1 r~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~-~~lvGiit~~Dll~~~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~~ 79 (114) T cd04629 1 RNPVTFTPDMSVTEAVEKLLKSKISGGPVVDDN-GNLVGFLSEQDCLKQLLESSYHCDGVATVRDIMTTEVLTVSPDDSI 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHHHHHHHCCCCCCCEEHHHCCCCCEEEECCCCH T ss_conf 989997992999999999997199789999489-9299999678999999775405887659188514688799579999 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999999986553306457479978688980210001 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .+|.++|.+++++++||+| +|+|+|+||++||+|+ T Consensus 80 ~~a~~~m~~~~~~~lpVvd-~g~lvGiIt~~DilrA 114 (114) T cd04629 80 VDLAQLMLKAKPKRYPVVD-DGKLVGQISRRDVLRA 114 (114) T ss_pred HHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 9999998866983999998-9999999997996339 No 33 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.84 E-value=1.2e-20 Score=167.42 Aligned_cols=109 Identities=33% Similarity=0.595 Sum_probs=100.0 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-CC---------------CCCCCCCCCCCCCCC Q ss_conf 28789946543034565667605041678668860278872354500-14---------------344672440026786 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-FA---------------SNAQQAVGELMTRNL 163 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~~---------------~~~~~~V~eiMt~~l 163 (493) +||+|++|++|+.||+++|.+++++++||++++.++++|++|.+|+. +. ...+.++.++|++++ T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~v 80 (125) T cd04631 1 KDVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYLGGGEKFNKIKTGNGLEAINEPVRSIMTRNV 80 (125) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCHHHHCCCHHHCCCCHHHHCCCCC T ss_conf 99779799094999999999839988999979999599999999999987335431012101102321789999566898 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 325730358999999986553306457479978688980210001 Q gi|254780889|r 164 ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 164 itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++++++++.+|.++|.+++++++||||++|+++|+||++||+|+ T Consensus 81 ~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~g~~~GiIt~~DilrA 125 (125) T cd04631 81 ITITPDDSIKDAAELMLEKRVGGLPVVDDDGKLVGIVTERDLLKA 125 (125) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 899799998999999986395099999589939999988997369 No 34 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.84 E-value=1.4e-20 Score=166.96 Aligned_cols=106 Identities=32% Similarity=0.488 Sum_probs=95.0 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC----------------------------CCC Q ss_conf 2878994654303456566760504167866886027887235450014----------------------------344 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA----------------------------SNA 151 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~----------------------------~~~ 151 (493) .|++|++|++|+.||+++|.+++++++||+|++ |+|+||||.+|+... ... T Consensus 1 kdVitv~p~~tl~ea~~~m~~~~i~~lpVvd~~-g~lvGivT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (135) T cd04621 1 EDIATVHPEHSLLHVVDEMEKNGVGRVIVVDDN-GKPVGVITYRDLAFAEFEDNERGLPKKSIKMKRKAGQKRYRYVKEV 79 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHCCCC T ss_conf 986898994979999999987499779999599-9399999879988666314544310233333101002332101446 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 67244002678632573035899999998655330645747997868898021000 Q gi|254780889|r 152 QQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 152 ~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) ..+|+++|+++++|+++++++.+|.++|.++++++||||| +|+|+|+||++||+| T Consensus 80 ~~~v~diMt~~vitv~~d~~l~~a~~~M~~~~i~~lPVvd-~~~lvGiIt~~Dilr 134 (135) T cd04621 80 PLVAEDIMTEEIITVSPNDDVVDAAKLMLEANISGLPVVD-NDNIVGVITKTDICR 134 (135) T ss_pred CCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHCC T ss_conf 8899893889988998989399999999971998899998-999999998588116 No 35 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.84 E-value=1.6e-20 Score=166.55 Aligned_cols=108 Identities=31% Similarity=0.536 Sum_probs=98.7 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC---------------CCCCCCCCCCCCCCCCC Q ss_conf 287899465430345656676050416786688602788723545001---------------43446724400267863 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF---------------ASNAQQAVGELMTRNLI 164 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~---------------~~~~~~~V~eiMt~~li 164 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+++|++|.+|+.. ......+|+++|+++++ T Consensus 2 rdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~-g~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~~ 80 (124) T cd04600 2 RDVVTVTPDTSLEEAWALLRRHRIKALPVVDGD-RRLVGIVTQRDLLRHARPDGRRPLRGRLRGRDKPETVGDIMSPPVV 80 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CCEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCE T ss_conf 896998993979999999997299789999569-8088885699999877504653101222105578898995779986 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 25730358999999986553306457479978688980210001 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++++++.++.++|.+++++++||||++|+|+|+||++||+|+ T Consensus 81 ~v~~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiIt~~Dil~A 124 (124) T cd04600 81 TVRPDTPIAELVPLLADGGHHHVPVVDEDRRLVGIVTQTDLIAA 124 (124) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99898979999999997793499999579989999978997469 No 36 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=99.83 E-value=2.1e-20 Score=165.75 Aligned_cols=107 Identities=30% Similarity=0.488 Sum_probs=98.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC------CCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 287899465430345656676050416786688602788723545001------43446724400267863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF------ASNAQQAVGELMTRNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~------~~~~~~~V~eiMt~~litv~~~~~l~ 173 (493) ++|+|++|++|+.||+++|.+++++++||+|+ ++++||+|++|+.. .+..+.+|+++|+++++++++++++. T Consensus 1 t~Pitv~p~~ti~ea~~~M~~~~i~~~~V~d~--~~l~GivT~~Dl~~~~~~~~~~~~~~~v~~iMt~~~~tv~~~~~l~ 78 (113) T cd04587 1 TKPATVSPTTTVQEAAKLMREKRVSCVLVMDG--NKLVGIFTSKDIALRVVAQGLDPESTLVERVMTPNPVCATSDTPVL 78 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEEHHHHHHHHHCCCCHHHCEEEEEECCCCEEEECCCCHH T ss_conf 98989899499999999999729988999989--9999999845987899873998320898866616846990799999 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99999986553306457479978688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.++|.+++++++||+|++|+++|++|++||+|+ T Consensus 79 ~a~~~m~~~~i~~lpVvd~~~~~vGiit~~Dil~A 113 (113) T cd04587 79 EALHLMVQGKFRHLPVVDKSGQVVGLLDVTKLTHA 113 (113) T ss_pred HHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99999987794199999269989999984885278 No 37 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.83 E-value=2.5e-20 Score=165.25 Aligned_cols=107 Identities=31% Similarity=0.438 Sum_probs=98.6 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC--CCC---CCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 8789946543034565667605041678668860278872354500--143---44672440026786325730358999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR--FAS---NAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir--~~~---~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) .|+|++|++|+.+|++.|.+++++++||+|++ ++++||+|.+|+. +.. ..+.+|+++|+++++|+++++++.+| T Consensus 2 ~P~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~-~~lvGivT~~Di~~~~~~~~~~~~~~v~~iMt~~~~ti~~~~~l~~a 80 (113) T cd04615 2 KPSCVVLNTDIARAVAEMYTSGSRALPVVDDK-KRLVGIITRYDVLSYALESEELKDAKVREVMNSPVITIDANDSIAKA 80 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCHHHH T ss_conf 59598996999999999997399789999489-97999999599999998079824589999275697899089939999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|.+++++++||||++|+++|+||++||+|. T Consensus 81 ~~~m~~~~i~~lpVvd~~g~lvGiit~~Dilr~ 113 (113) T cd04615 81 RWLMSNNNISRLPVLDDKGKVGGIVTEDDILRK 113 (113) T ss_pred HHHHHHHCCEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 999997499489999799939999996883269 No 38 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.83 E-value=2.1e-20 Score=165.78 Aligned_cols=108 Identities=37% Similarity=0.552 Sum_probs=100.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500-14344672440026786325730358999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-FASNAQQAVGELMTRNLITVKKTVNLENAKAL 178 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~~~~~~~~V~eiMt~~litv~~~~~l~eA~~i 178 (493) ++|++++|++|+.||+++|.+++++++||+|++ |+++|++|.+|+. .......+++++|+++++++++++++.+|+++ T Consensus 2 r~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~-~~~vGiit~~Di~~~~~~~~~~v~~im~~~~~~~~~~~~~~~a~~~ 80 (110) T cd04605 2 RPVVTISEDASIKEAAKLMIEENINHLPVVDED-GRLVGIVTSWDISKAVARDKKSVEDIMTRNVITATPDEPIDVAARK 80 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEECHHHHHHHHCCCCCHHHCEEECHHHHHHHHHHHHHHHH T ss_conf 899898993999999999997299889999799-9199999622788898707976336043001201465589999999 Q ss_pred HHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 986553306457479978688980210001 Q gi|254780889|r 179 LHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 179 M~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.+++++++||||++|+|+|++|++||+|. T Consensus 81 m~~~~i~~lPVvD~~~~lvGiiT~~Di~kl 110 (110) T cd04605 81 MERHNISALPVVDAENRVIGIITSEDISKL 110 (110) T ss_pred HHHCCCEEEEEECCCCEEEEEEEHHHHCCC T ss_conf 998396199999189989999993772119 No 39 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=99.83 E-value=3.6e-20 Score=164.11 Aligned_cols=107 Identities=44% Similarity=0.664 Sum_probs=97.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC--------------CCCCCCCCCCCCCCCCC Q ss_conf 2878994654303456566760504167866886027887235450014--------------34467244002678632 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA--------------SNAQQAVGELMTRNLIT 165 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~--------------~~~~~~V~eiMt~~lit 165 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+++|++|.+|++.. .....+|+++|++++++ T Consensus 1 r~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~-~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~t 79 (121) T cd04584 1 RDVVTITPTTTIAEALELMREHKIRHLPVVDEE-GRLVGIVTDRDLRDASPSPFTTLSEHELYLLLKMPVKEIMTKDVIT 79 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHCCCCCHHHHHHCCCCCCCCHHHHCCCCCCE T ss_conf 998896892989999999997299889999089-9399996589987522012200244420251468999966799858 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 5730358999999986553306457479978688980210001 Q gi|254780889|r 166 VKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 166 v~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++++++.+|.++|.+++++++||+| +|+|+|+||++||+|. T Consensus 80 v~~~~~l~~a~~~m~~~~i~~lpVvd-~~~lvGivt~~Dilr~ 121 (121) T cd04584 80 VHPLDTVEEAALLMREHRIGCLPVVE-DGRLVGIITETDLLRT 121 (121) T ss_pred ECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 98999599999998866961999999-9999999982260066 No 40 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.83 E-value=3.8e-20 Score=163.98 Aligned_cols=106 Identities=31% Similarity=0.453 Sum_probs=96.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-CC--------------CCCCCCCCCCCCCCCC Q ss_conf 28789946543034565667605041678668860278872354500-14--------------3446724400267863 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-FA--------------SNAQQAVGELMTRNLI 164 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~~--------------~~~~~~V~eiMt~~li 164 (493) ++++|++|++|+.+|+++|.+++++++||+|+ |+|+|+||++|+. .. .....+|+++|+++++ T Consensus 1 T~viTv~p~~tl~~a~~~m~~~~i~~lpVvd~--~~lvGivT~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~i 78 (122) T cd04637 1 TRVVTVEMDDRLEEVREIFEKHKFHHLLVVED--NELVGVISDRDYLKAISPFLGTAGETEKDLATLNRRAHQIMTRDPI 78 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCHHHHCCCHHHCCCHHHHCCCCCE T ss_conf 97499889696999999998749988999989--9999998889999875002453102211212306899994638983 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 2573035899999998655330645747997868898021000 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) ++++++++.+|.++|.+++++++||||++|+|+|+||++||+| T Consensus 79 tv~~~~~l~~a~~~m~~~~i~~lPVvd~~~~lvGiIT~~Diik 121 (122) T cd04637 79 TVSPDTPVDEASKLLLENSISCLPVVDENGQLIGIITWKDLLK 121 (122) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999984999999999749308999968998999999688327 No 41 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.83 E-value=3.8e-20 Score=163.95 Aligned_cols=107 Identities=34% Similarity=0.568 Sum_probs=96.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC-------------------------CCCCCC Q ss_conf 2878994654303456566760504167866886027887235450014-------------------------344672 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA-------------------------SNAQQA 154 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~-------------------------~~~~~~ 154 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+|+|++|.+|+... .....+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~-~~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNE-GRVVGIVSEGDLIRKIYKGKGLFYVTLLYSVIFLDESKIKKLLGKK 79 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCC T ss_conf 998796891979999999997299789999599-9399998889999998626762000000000000233310203899 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 440026786325730358999999986553306457479978688980210001 Q gi|254780889|r 155 VGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 155 V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |+++|+++++|+.+++++.+|.++|.+++++++|||| +|+|+|+||++||+|+ T Consensus 80 v~diMt~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd-~~~lvGiIt~~Dilra 132 (132) T cd04636 80 VEEIMTKKVITVDEDTTIEDVARIMSKKNIKRLPVVD-DGKLVGIISRGDIIRS 132 (132) T ss_pred HHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 9996789987999979499999999962987899998-9999999996882349 No 42 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=99.83 E-value=4.4e-20 Score=163.48 Aligned_cols=105 Identities=29% Similarity=0.462 Sum_probs=98.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143446724400267863257303589999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALL 179 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM 179 (493) ++|+|++|++|+.||+++|.+++++++||||++ |+|+|+||.+|+... ...++.++|++.++|+++++++.+|+++| T Consensus 1 t~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~-g~l~Givt~~Dl~~~--~~~~v~d~m~~~~~tv~~d~~l~~a~~~M 77 (106) T cd04582 1 TEPITVRPDDPLSDALGLMDDSDLRALTVVDAD-GQPLGFVTRREAARA--SGGCCGDHAEPFKVTVSVDDDLRIVLSRM 77 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHH--CCCCHHHCCCCCCEEECCCCCHHHHHHHH T ss_conf 998389998939999999997399879999799-978999899998763--08953451025766997999999999999 Q ss_pred HHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 8655330645747997868898021000 Q gi|254780889|r 180 HQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 180 ~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) .+++++++||||++|+|+|+||++||++ T Consensus 78 ~~~~i~~lPVVD~~grlvGivT~~Di~~ 105 (106) T cd04582 78 FAHDMSWLPCVDEDGRYVGEVTQRSIAD 105 (106) T ss_pred HHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9729876258989990999998688425 No 43 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=99.83 E-value=4.9e-20 Score=163.12 Aligned_cols=107 Identities=38% Similarity=0.599 Sum_probs=98.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC---CCCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 287899465430345656676050416786688602788723545001---43446724400267863257303589999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF---ASNAQQAVGELMTRNLITVKKTVNLENAK 176 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~---~~~~~~~V~eiMt~~litv~~~~~l~eA~ 176 (493) ++++|++|++|+.||+++|.+++++++||+|+ ++++||+|.+|+.. ......+|+++|+++++++++++++.+|. T Consensus 1 r~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd~--~~lvGiit~~Di~~~~~~~~~~~~V~~~M~~~~~tv~~d~~l~~~~ 78 (110) T cd04588 1 KPLITLNPNATLREAARLFNTHHIHGAPVVDD--GKLVGIVTLSDIAHAIARGLELAKVKDVMTKDVITIDEDEQLYDAI 78 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCHHHHHCCCCCCEECCCCCHHHHH T ss_conf 99479689498999999999709988999989--9999999668989998737977803452048972985899699999 Q ss_pred HHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99986553306457479978688980210001 Q gi|254780889|r 177 ALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 177 ~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++|.+++++++||+|++|+++||||++|++|+ T Consensus 79 ~~m~~~~v~~l~Vvd~~~~lvGIiT~~Dllka 110 (110) T cd04588 79 RLMNKHNVGRLIVTDDEGRPVGIITRTDILRS 110 (110) T ss_pred HHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99987799789999669999999982483468 No 44 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=99.83 E-value=3.1e-20 Score=164.60 Aligned_cols=107 Identities=45% Similarity=0.709 Sum_probs=97.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCC---------------CCCCCCCCCCCCCCC Q ss_conf 28789946543034565667605041678668860278872354500143---------------446724400267863 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFAS---------------NAQQAVGELMTRNLI 164 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~---------------~~~~~V~eiMt~~li 164 (493) +||+|++|++|+.||+++|.+++++++||+|+ |+++|++|.+|++..- ....+|.++|+++++ T Consensus 1 rdpvti~~~~tl~ea~~~m~~~~~~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~ 78 (122) T cd04585 1 KNPITVTPDTSLMEALKLMKENSIRRLPVVDR--GKLVGIVTDRDLKLASPSKATTLDIWELYYLLSKIKVSDIMTRDPI 78 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCE T ss_conf 99899689296999999999739988999989--9999999889999877504655333554212038899995778988 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 25730358999999986553306457479978688980210001 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++++++.+|.++|.+++++++||||++|+++|+||++||+|+ T Consensus 79 tv~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiiT~~DilkA 122 (122) T cd04585 79 TVSPDASVEEAAELMLERKISGLPVVDDQGRLVGIITESDLFRA 122 (122) T ss_pred EEECCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99388669999999987598789999589989999987897469 No 45 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.82 E-value=5.2e-20 Score=162.97 Aligned_cols=107 Identities=32% Similarity=0.544 Sum_probs=99.5 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 287899465430345656676050416786688602788723545001----4344672440026786325730358999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) ++|+|++|++|+.||+++|.+++++++||+|++ ++++|++|++|+.. ..+.+.+++++|+++++++++++++.+| T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~-~~lvGiiT~~Di~~~~~~~~~~~~~v~~~m~~~~~tv~~~~~l~~a 79 (112) T cd04624 1 RPVVTVDPDTSIREAAKLMAEENVGSVVVVDPD-ERPIGIVTERDIVRAVAAGIDLDTPVSEIMTRDLVTVDPDEPVAEA 79 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEECCHHHHHHHCCCCCCCCEEEEECCCCEEECCCCCHHHH T ss_conf 997897893959999999986299889999399-8299999820458898648743442045651671678799989999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 99998655330645747997868898021000 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) +++|.+++++++||||++|+++|+||++||+| T Consensus 80 ~~~m~~~~~~~l~Vvd~~g~lvGiit~~Dilk 111 (112) T cd04624 80 AKLMRKNNIRHHLVVDKGGELVGVISIRDLVR 111 (112) T ss_pred HHHHHHCCCEEEEEECCCCEEEEEEEHHHHCC T ss_conf 99999749839989989999999998077256 No 46 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.82 E-value=4.1e-20 Score=163.68 Aligned_cols=107 Identities=36% Similarity=0.687 Sum_probs=97.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH--CC----CCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 2878994654303456566760504167866886027887235450--01----43446724400267863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV--RF----ASNAQQAVGELMTRNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi--r~----~~~~~~~V~eiMt~~litv~~~~~l~ 173 (493) +||+|++|++|+.||+++|.+++++++||+|+ ++++|++|++|+ +. ....+.+|+++|+++++++++++++. T Consensus 1 r~~vti~~~~tl~~a~~~m~~~~~~~v~V~d~--~~~vGiiT~~Di~~~~~~~~~~~~~~~v~~iM~~~~i~v~~~~~l~ 78 (113) T cd04622 1 RDVVTVSPDDTIREAARLMREHDVGALPVCEN--DRLVGIVTDRDIVVRAVAEGRDPDTTTVGDVMTRGVVTVTEDDDVD 78 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHHHHHHHCCCCHHHCCHHHHHCCCCEEEECCCCHH T ss_conf 99779689398999999999729988999989--9999999632788999973998200776786415767998899879 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99999986553306457479978688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.++|.+++++++||+|++++++|+||++||+|+ T Consensus 79 ~a~~~m~~~~i~~lpVvd~~~~~vGiIt~~Dilra 113 (113) T cd04622 79 EAARLMREHQVRRLPVVDDDGRLVGIVSLGDLARA 113 (113) T ss_pred HHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99999985796589999089989999997995477 No 47 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.82 E-value=5e-20 Score=163.06 Aligned_cols=107 Identities=36% Similarity=0.611 Sum_probs=96.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC---------------------------CCCCC Q ss_conf 287899465430345656676050416786688602788723545001---------------------------43446 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF---------------------------ASNAQ 152 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~---------------------------~~~~~ 152 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+++|++|.+|+.. ..... T Consensus 2 ~~vitv~~~~tl~~a~~~m~~~~i~~lpVvd~~-~~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (135) T cd04586 2 TDVVTVSPETSVAEAARLMLDNHISGLPVVDDD-GRLVGIVSEGDLLRRAELGTERRRARWLDLLAGAEELAAAFVRSHG 80 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHCCCCC T ss_conf 999899993939999999997399679999089-9799999818976445415654114555430231034554304479 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 72440026786325730358999999986553306457479978688980210001 Q gi|254780889|r 153 QAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 153 ~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .+|+++|+++++++++++++.+|.++|.+++++++|||| +|+++|+||++||+|+ T Consensus 81 ~~V~~iM~~~~~~v~~~~~l~~~~~~m~~~~i~~lpVvd-~~~lvGiIt~~DilkA 135 (135) T cd04586 81 RKVADVMTRPVVTVGEDTPLAEVAELMEEHRIKRVPVVR-GGRLVGIVSRADLLRA 135 (135) T ss_pred CCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 898995789987995979599999999975960999999-9999999997996459 No 48 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=99.82 E-value=5.9e-20 Score=162.57 Aligned_cols=107 Identities=28% Similarity=0.365 Sum_probs=97.8 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 87899465430345656676050416786688602788723545001-----4344672440026786325730358999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) .|+|++|++|+.||+++|.+++++++||+|++ |+|+||||.+|++- ......++.++|+++++|+++++++.+| T Consensus 2 pP~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~-g~lvGIvT~~Dl~r~~~~~~~~~~~~v~~vmt~~~~tv~pd~~l~~a 80 (115) T cd04593 2 PPPVLSATTPLREAAEQLIESKHGSALVVDRD-GGVVGIITLPDLLRALEADEAGEPSAVDEVATPPLLTVHPDEPLAHA 80 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCEEEEEECCCCEEECCCCHHHHH T ss_conf 78388991999999999987499899999499-98899988999999997388866779454303896689999949999 Q ss_pred HHHHHHCCCCEEEEECCC--CCEEEEEEECCCHHC Q ss_conf 999986553306457479--978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDD--GCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~--g~LvGIIT~~DIlk~ 208 (493) +++|.++++++|||||++ |+++|++|++||+++ T Consensus 81 l~~M~~~~i~~LPVVd~~~~~~lvGiit~~Dil~A 115 (115) T cd04593 81 LDRMASRGLRQLPVVDRGNPGQVLGLLTRENVLLA 115 (115) T ss_pred HHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHCC T ss_conf 99999869988999978999989999997996569 No 49 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=99.81 E-value=1.2e-19 Score=160.47 Aligned_cols=107 Identities=36% Similarity=0.519 Sum_probs=98.4 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC----CCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 2878994654303456566760504167866886027887235450014----344672440026786325730358999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA----SNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~----~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) .||+|++|++|+.||+++|.+++++++||+|+ |+++|+||.+|++.. .....++.++|+++++++++++++.+| T Consensus 1 Pdvvtv~p~~tl~~a~~~m~~~~~~~lpVvd~--~~l~Givt~~di~~~~~~~~~~~~~v~~im~~~~~~v~~~~~l~~a 78 (111) T cd04612 1 PDVVTVPVDLTVDEVLALMFGERHRGYPVVDD--GRLVGIVTLADIRRVPAEGREATVLVGDVMTRDPVTASPDETLRDA 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCHHHH T ss_conf 98499999295999999999719948999999--9999999899999877514777756999970796052999979999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|.+++++++||||++|+++|+||++||+|+ T Consensus 79 ~~~m~~~~~~~lpVvd~~g~l~GiiT~~DilrA 111 (111) T cd04612 79 LKRMAERDIGRLPVVDDSGRLVGIVSRSDLLRA 111 (111) T ss_pred HHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 999997698199999689989999986886349 No 50 >pfam03060 NPD 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor. Probab=99.81 E-value=2.1e-18 Score=151.51 Aligned_cols=233 Identities=24% Similarity=0.370 Sum_probs=157.1 Q ss_pred EEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECC-CCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHH Q ss_conf 343038816003178337852368889999997798799978-9898999999998742238815287899465430345 Q gi|254780889|r 36 STRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHR-NFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADA 114 (493) Q Consensus 36 ~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHr-n~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea 114 (493) +|+||+-+.++.||+.+||..|+.++||.|.++.||+|+|+. .+++|+..++++++|+.- ..|+-+. T Consensus 1 kt~~t~~lgi~~PIiqapM~~vs~~~LaaAVs~AGglG~i~~~~~~~e~l~~~i~~~~~~t----~~pfgvn-------- 68 (330) T pfam03060 1 RTRFTDLLKIKYPIVQPPMGGISTPELAAAVSEAGGLGVLGAGYLTPDRLEKEIRKVKELT----DKPFGVN-------- 68 (330) T ss_pred CCHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHH----CCCCCCC-------- T ss_conf 9666687589828798887767869999999967988861277699999999999999864----8876313-------- Q ss_pred HHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 65667605041678668860278872354500143446724400267863257303589999999865533064574799 Q gi|254780889|r 115 LALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDG 194 (493) Q Consensus 115 ~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g 194 (493) ++...+ ... +...............+..+. T Consensus 69 ------------------------l~~~~~------~~~--------~~~~~~~~~~~~~~~~~~~~~------------ 98 (330) T pfam03060 69 ------------------------LFLPQP------AEG--------DDFALDPEEHLNAALKLALEY------------ 98 (330) T ss_pred ------------------------CCCCCC------CCC--------HHHHHHHHHHHHHHHHHHHHC------------ T ss_conf ------------------------337885------431--------456666775013678888751------------ Q ss_pred CEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCC Q ss_conf 78688980210001017422326568765556651370156789998731366699824841138899999999985798 Q gi|254780889|r 195 CCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPS 274 (493) Q Consensus 195 ~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~ 274 (493) + .++..+....+.+.+++++..- -|... -+.++++|+. . T Consensus 99 ----------------------------------~-~~~~~~~~~~~~~~~~~~v~~~--~G~p~--~~~v~~~~~~--G 137 (330) T pfam03060 99 ----------------------------------G-VPDYGDDDDSLKDAKPKVVSFG--FGLPP--EDVIERLKES--G 137 (330) T ss_pred ----------------------------------C-CCCHHHHHHHHHHCCCCEEEEC--CCCCH--HHHHHHHHHC--C T ss_conf ----------------------------------6-9837999999997499999989--98982--7999999987--9 Q ss_pred CCEECCCCCCHHHHHHHHHCCCCEEEE-EECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEE Q ss_conf 724514646999999998548758996-1027876653100015743100378899986226981997265437776213 Q gi|254780889|r 275 LLVMAGNIATAEGALALIDAGADIIKV-GIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAK 353 (493) Q Consensus 275 ~~ii~GNv~t~~~~~~l~~~Gad~ikv-Gig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~k 353 (493) +. +.+.|.|.+.++.+.++|+|+|-+ |...|..+-+.. +.+.+..+.++.++ .++||||-|||.+..+++. T Consensus 138 ~~-v~~~v~s~~~A~~a~~~G~D~iV~qG~eAGGH~G~~~----~~~~~~L~~~v~~~---~~iPvIaAGGI~dg~~iaa 209 (330) T pfam03060 138 TK-VIPTVSSAKEARKAEAAGADAVVAQGPEAGGHRGTEV----GTGTFLLVPTVVDA---VDIPVIAAGGIADGRGIAA 209 (330) T ss_pred CE-EEEECCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCC----CCCHHHHHHHHHHH---CCCCEEEECCCCCHHHHHH T ss_conf 98-9998189999999998199989996676677788877----73077778999987---1697785266289999999 Q ss_pred EEEECCCEEEECCHHHCCCCCCCCEE Q ss_conf 36714878993230101216776448 Q gi|254780889|r 354 AIAAGSACVMIGSLLAGTDESPGDIF 379 (493) Q Consensus 354 Ala~GA~~VM~G~~~agt~Espg~~~ 379 (493) ||++||+.|+||+-|..|.||+..-. T Consensus 210 alalGA~gV~mGTrFlat~Es~~~~~ 235 (330) T pfam03060 210 ALALGAEGVQIGTRFLATKEATAPPV 235 (330) T ss_pred HHHCCCCEEEECCEEEECCCCCCCHH T ss_conf 99679989997130011577789999 No 51 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=8.8e-20 Score=161.31 Aligned_cols=106 Identities=29% Similarity=0.452 Sum_probs=95.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC----------CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 287899465430345656676050416786688602788723545001----------4344672440026786325730 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF----------ASNAQQAVGELMTRNLITVKKT 169 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~----------~~~~~~~V~eiMt~~litv~~~ 169 (493) .++++++|++|+.+|+++|.+++++++||+|++ |+|+|+||.+|+.. ....+.+|+++|++++.++.++ T Consensus 1 k~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~-gklvGiit~~Di~~~~~~~~~~~~~~~~~~~V~~iM~~~~~tv~~~ 79 (116) T cd04643 1 EEVAYVQDTNTLRHALLVLTKHGYSAIPVLDKE-GKYVGTISLTDILWKLKGLENLDLERLVDLKVIDVMNTDVPVIIDD 79 (116) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCHHHHCCCCHHHHHCCCCEEEECC T ss_conf 976999994929999999997499879898699-9499998899999987436532202311668779515698899079 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 358999999986553306457479978688980210001 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++.++.++|.++++ +||||++|+|+||||++||+|+ T Consensus 80 ~~i~~~~~~m~~~~~--lpVVD~~~~lvGIITr~DilkA 116 (116) T cd04643 80 ADIEEILHLLIDQPF--LPVVDDDGIFIGIITRREILKA 116 (116) T ss_pred CCHHHHHHHHHHCCE--EEEEECCCEEEEEEEHHHHHCC T ss_conf 999999999987898--9999459999999992795579 No 52 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=1.3e-19 Score=160.20 Aligned_cols=107 Identities=31% Similarity=0.455 Sum_probs=98.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC---CCCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 287899465430345656676050416786688602788723545001---43446724400267863257303589999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF---ASNAQQAVGELMTRNLITVKKTVNLENAK 176 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~---~~~~~~~V~eiMt~~litv~~~~~l~eA~ 176 (493) ++.+|++|++|+.+|+++|.+++++++||+|++ |+++|+||.+|+.. ....+.+|+++|+++++++++++++.+|+ T Consensus 1 T~~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~-g~lvGivt~~di~~~l~~~~~~~~V~~vM~~~v~tv~~~~~l~~~~ 79 (111) T cd04639 1 THFETLSPADTLDDAADALLATTQHEFPVVDGD-GHLVGLLTRDDLIRALAEGGPDAPVRGVMRRDFPTVSPSATLDAVL 79 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CCEEEEEEHHHHHHHHHCCCCCCCHHHHCCCCCEEECCCCCHHHHH T ss_conf 988792998979999999997199789999389-9889999899999998726999705763546987998999499999 Q ss_pred HHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 9998655330645747997868898021000 Q gi|254780889|r 177 ALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 177 ~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) ++|.+++++++||||++|+|+|+||++||++ T Consensus 80 ~~m~~~~~~~lpVVd~~g~lvGivT~~Di~E 110 (111) T cd04639 80 RLMQQGGAPAVPVVDGSGRLVGLVTLENVGE 110 (111) T ss_pred HHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999589988989957992999999788544 No 53 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=99.81 E-value=1.3e-19 Score=160.14 Aligned_cols=108 Identities=29% Similarity=0.423 Sum_probs=98.1 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC-----CCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 2878994654303456566760504167866886027887235450014-----34467244002678632573035899 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA-----SNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~-----~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) +||+|++|++|+.||+++|.+++++++||+|++ |+|+|++|.+|++.. .....+++++|+++++++++++++.+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~-~~lvGiit~~Dl~~~l~~~~~~~~~~v~~im~~~~~~v~~~~~l~~ 79 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDD-GRLVGIVSLDDIREILFDPSLYDLVVASDIMTKPPVVVYPEDSLED 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCEEEHHCCCCCEEEECCCCHHH T ss_conf 997695890989999999986199789999289-9499999999999999705665572931203629659913678899 Q ss_pred HHHHHHHCCCCEEEEECC-CCCEEEEEEECCCHHC Q ss_conf 999998655330645747-9978688980210001 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDD-DGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe-~g~LvGIIT~~DIlk~ 208 (493) |+++|.+++++++||||+ +|+++|+||++||+++ T Consensus 80 a~~~m~~~~~~~lPVVd~~~g~lvGiit~~Dil~A 114 (114) T cd04613 80 ALKKFEDSDYEQLPVVDDDPGKLLGILSRSDLLSA 114 (114) T ss_pred HHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99999975975999998999989999998995359 No 54 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=9.6e-20 Score=161.05 Aligned_cols=105 Identities=27% Similarity=0.460 Sum_probs=95.3 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC------CCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 87899465430345656676050416786688602788723545001------434467244002678632573035899 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF------ASNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~------~~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) +.+|++|++|+.||+++|.+++++++||+++ |+|+||+|+||+.. ....+.+|+++|+++++++++++++.+ T Consensus 2 ~V~Tv~pd~~l~eA~~~M~~~~i~~lvV~~~--g~lvGIiT~rDi~~~~~~~~~~~~~~~V~~iMt~~~~tv~~~~~i~~ 79 (112) T cd04625 2 TIYTVAPETLLSEAVATMAEQDLGSLVVMER--GELVGLLTFREVLQAMAQHGAGVLDTTVRAIMNPEPIVASPDDSIDE 79 (112) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCCCCCHHHHEECCCEEECCCCHHHH T ss_conf 4799899597999999998749987999579--99999998799999999709980117088833489779989996999 Q ss_pred HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9999986553306457479978688980210001 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.++|.++++++|||+| +|+|+|+||++||+|+ T Consensus 80 a~~~M~~~~i~~lpVvd-~g~lvGiit~~Di~rA 112 (112) T cd04625 80 VRRLMVERHLRYLPVLD-GGTLLGVISFHDVAKA 112 (112) T ss_pred HHHHHHHCCCEEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 99976866976999999-9999999997996359 No 55 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=1.1e-19 Score=160.77 Aligned_cols=107 Identities=37% Similarity=0.607 Sum_probs=97.7 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-C-----CCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 28789946543034565667605041678668860278872354500-1-----43446724400267863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-F-----ASNAQQAVGELMTRNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~-----~~~~~~~V~eiMt~~litv~~~~~l~ 173 (493) +||++++|++|+.||+++|.+++++++||+|++ ++++|++|++|+. . ....+.+|+++|+++++++.+++++. T Consensus 1 r~vvti~~~~tl~ea~~~m~~~~~~~l~Vvd~~-~~l~Giit~~Di~~~~~~~~~~~~~~~V~~iM~~~~~~v~~~~~l~ 79 (113) T cd04623 1 RDVITVRPDATVAEAAKLMAEKNIGAVVVVDDG-GRLVGIFSERDIVRKVALRGASALDTPVSEIMTRNVITVTPDDTVD 79 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCHHHCHHHHHCCCEEEEEECCCCHH T ss_conf 997696893979999999997399889999089-8299999768999999864998466401231211028991699199 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99999986553306457479978688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|+++|.+++++++||+| +|+++|+||++|++|+ T Consensus 80 ~a~~~m~~~~i~~lpV~d-~~~lvGiit~~Dllra 113 (113) T cd04623 80 EAMALMTERRFRHLPVVD-GGKLVGIVSIGDVVKA 113 (113) T ss_pred HHHHHHHHCCCEEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 999999976925999999-9999999997996469 No 56 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=1.7e-19 Score=159.21 Aligned_cols=107 Identities=28% Similarity=0.391 Sum_probs=97.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 287899465430345656676050416786688602788723545001----4344672440026786325730358999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) +|.+|++|++|+.||+++|.+++++++||+|++ ++|+||+|.+|++. ....+.+|+++|+++++|++++.++.+| T Consensus 1 tdv~tv~~~~tl~eA~~~m~~~~~~~~~Vvd~~-~~l~GIiT~~Di~~~~~~~~~~~~~V~~iMt~~~itv~~~~~~~~a 79 (111) T cd04626 1 TDFPTIDEDASIREALHEMLKYNTNEIIVKDNE-EKLKGVVTFTDILDLDLFESFLEKKVFNIVSQDVFYVNEEDTIDEA 79 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCHHHHCCCCCEEECCCCCHHHH T ss_conf 997399962939999999998499889999699-9799999949999998526877888899606998799997939999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|.++++++|||+| +|+|+|+||++||++. T Consensus 80 ~~~M~~~~i~~LPVvd-~~klvGiit~~Dil~k 111 (111) T cd04626 80 LDIMREKQIGRLPVVD-DNKLIGVVRTKDILDK 111 (111) T ss_pred HHHHHHHCCCEEEEEE-CCEEEEEEEHHHHCCC T ss_conf 9999980998899999-9999999996981488 No 57 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=2.2e-19 Score=158.49 Aligned_cols=108 Identities=32% Similarity=0.499 Sum_probs=95.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC------------------CCCCCCCCCCCCC Q ss_conf 2878994654303456566760504167866886027887235450014------------------3446724400267 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA------------------SNAQQAVGELMTR 161 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~------------------~~~~~~V~eiMt~ 161 (493) ++.+|+.|++|+.||+++|.+++++++||+|++ |+|+|++|.+|+++. ......+..+|++ T Consensus 1 s~Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~-g~lvGiis~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (126) T cd04642 1 SKVVSIDSDERVLDAFKLMRKNNISGLPVVDEK-GKLIGNISASDLKGLLLSPDDLLLYRTITFKELSEKFTDSDGVKSR 79 (126) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCEEEECCCC T ss_conf 957999995999999999998098789999289-9099999989976555144303444455554203344234202447 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86325730358999999986553306457479978688980210001 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|+++++++.++.++|.+++++++||||++|+|+|+||++||+|. T Consensus 80 ~~itv~~~~~l~~~~~~m~~~~i~~lpVVD~~~~lvGIIs~~Dil~~ 126 (126) T cd04642 80 PLITCTPSSTLKEVITKLVANKVHRVWVVDEEGKPIGVITLTDIISI 126 (126) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99899996939999999998697499999899959999992894219 No 58 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.81 E-value=1.6e-19 Score=159.44 Aligned_cols=107 Identities=32% Similarity=0.514 Sum_probs=99.2 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC----CCCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 87899465430345656676050416786688602788723545001----43446724400267863257303589999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF----ASNAQQAVGELMTRNLITVKKTVNLENAK 176 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~----~~~~~~~V~eiMt~~litv~~~~~l~eA~ 176 (493) +|++++|++|+.||++.|.+++++++||+|++ ++|+|++|++|++- ..+.+.+|+++|+++++++.++.++.+|. T Consensus 3 kpi~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~-~~l~GiiT~~Di~r~l~~~~~~~~~v~~im~~~~~~i~~~~~~~~a~ 81 (113) T cd04607 3 KQLLVSPDASILDALRKIDKNALRIVLVVDEN-GRLLGTVTDGDIRRALLKGLSLDDPVSEVMNRNPITAKVGSSREEIL 81 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEECHHHHHHHHCCCCCCCCHHHHHCCEEEEEECCCCHHHHH T ss_conf 68896994989999999987599789999799-95999998727766876289845615675212129998799899999 Q ss_pred HHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99986553306457479978688980210001 Q gi|254780889|r 177 ALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 177 ~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++|.+++++++||||++|+++|++|++|+++. T Consensus 82 ~~m~~~~i~~lPVvd~~~~lvGiit~~Dll~~ 113 (113) T cd04607 82 ALMRERSIRHLPILDEEGRVVGLATLDDLLSK 113 (113) T ss_pred HHHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99987797499999789949999997993086 No 59 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=1.7e-19 Score=159.27 Aligned_cols=105 Identities=36% Similarity=0.632 Sum_probs=94.7 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-C------------------------------ Q ss_conf 287899465430345656676050416786688602788723545001-4------------------------------ Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-A------------------------------ 148 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-~------------------------------ 148 (493) +||+|++|++|+.+|+++|.+++++++||+|+ |+++|+||++|+.. . T Consensus 1 kdvvTv~~d~~l~ea~~~m~~~~i~~lPVvd~--~~lvGivT~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (143) T cd04634 1 KNPITCNADDTISDAARLLRENKISGAPVLDG--GKLVGIVSESDILKLLVTHDPSGNLWLPSPLELIELPLREFINWEE 78 (143) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99889589398999999999749977999979--9999999869999887516755432356602332111232222667 Q ss_pred ------CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf ------34467244002678632573035899999998655330645747997868898021000 Q gi|254780889|r 149 ------SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 149 ------~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) .....+++++|+++++++++++++.+|.++|.+++++++||+| +|+|+||||++|++| T Consensus 79 ~~~~~~~~~~~~v~diM~~~~itv~~~~~l~~a~~~m~~~~i~~lPVvd-~~~lvGIItr~Dilk 142 (143) T cd04634 79 TKRALTDAGKMKVRDIMTKKVITISPDASIEDAAELMVRHKIKRLPVVE-DGRLVGIVTRGDIIE 142 (143) T ss_pred HHHHHHHHCCCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHCCC T ss_conf 7766543227898995888987998989699999999973988899998-999999999577308 No 60 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.81 E-value=1.8e-19 Score=159.16 Aligned_cols=106 Identities=40% Similarity=0.571 Sum_probs=96.2 Q ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-C-------------CCCCCCCCCCCCCCCCCCCC Q ss_conf 789946543034565667605041678668860278872354500-1-------------43446724400267863257 Q gi|254780889|r 102 PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-F-------------ASNAQQAVGELMTRNLITVK 167 (493) Q Consensus 102 PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~-------------~~~~~~~V~eiMt~~litv~ 167 (493) ++|++|++|+.||+++|.+++++++||+|++ |+++|++|.+|+. . ....+.+|+++|++++++++ T Consensus 3 vvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~-~~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~tv~ 81 (122) T cd04803 3 VVTLSEDDSLADAEELMREHRIRHLPVVNED-GKLVGLLTQRDLLRAALSSLSDNGEESLTKERDVPVAEVMKTDVLTVT 81 (122) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHCCCCCEEEC T ss_conf 1999896999999999997499789999089-868889659999999984376420121025358899996689986998 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 30358999999986553306457479978688980210001 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++++.+|.++|.+++++++||||++|+|+|+||++||+|. T Consensus 82 ~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiiT~~Dilkl 122 (122) T cd04803 82 PDTPLREAAEIMVENKIGCLPVVDDKGTLVGIITRSDFLRL 122 (122) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHEEEC T ss_conf 99989999999985685099999479989999995155649 No 61 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=99.80 E-value=2e-19 Score=158.84 Aligned_cols=107 Identities=37% Similarity=0.598 Sum_probs=98.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 287899465430345656676050416786688602788723545001----4344672440026786325730358999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) ++.+|++|++|+.||+++|.+++++++||+++ ++++|++|.+|+.. ..+.+.+|+++|+++++++++++++.+| T Consensus 1 t~vvTv~p~~tv~ea~~~m~~~~i~~~~V~~~--~~~vGiiT~~Di~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~~~~a 78 (111) T cd04611 1 TQILTCPPDTSLAEAASRMRERRISSIVVVDD--GRPLGIVTERDILRLLASGPDLQTPVGEVMSSPLLTVPADTSLYDA 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCEEEECCCCHHHH T ss_conf 97199889697999999999719988999979--9999999855989987424887788899710683399789999999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|.+++++++||+|++|+++|+||++||+|+ T Consensus 79 ~~~m~~~~i~~lpVvd~~~~~vGiiT~~Dilr~ 111 (111) T cd04611 79 RQLMREHGIRHLVVVDDDGELLGLLSQTDLLQN 111 (111) T ss_pred HHHHHHHCCEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 999997187399999469989999985684278 No 62 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.80 E-value=1.9e-19 Score=158.92 Aligned_cols=108 Identities=32% Similarity=0.423 Sum_probs=97.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC------CCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 28789946543034565667605041678668860278872354500------143446724400267863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR------FASNAQQAVGELMTRNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir------~~~~~~~~V~eiMt~~litv~~~~~l~ 173 (493) .||+|++|++|+.||+++|.+++++++||++++.++++||+|++|+. .....+.+|+++|+++++|+++++++. T Consensus 1 pdvitv~~~~tv~eA~~~M~~~~i~~v~V~~~~~~~~vGiiT~~Di~~~~~~~~~~~~~~~V~~vMt~~v~tv~~d~~~~ 80 (114) T cd04630 1 PNVVTIDGLATVAEALQLMKEHGVSSLVVEKRRESDAYGIVTMRDILKKVVAEGRDPDRVNVYEIMTKPLISVSPDMDIK 80 (114) T ss_pred CCEEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCEEEECCCCCHH T ss_conf 98499998293999999999829988999978996089999818989999846789444498682703549999949499 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99999986553306457479978688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.++|.+++++++||+| +|+++|+||++||+++ T Consensus 81 ~a~~~m~~~~i~~lpVvd-~~~lvGiit~~Dil~A 114 (114) T cd04630 81 YCARLMERTNIRRAPVVE-NNELIGIISLTDIFLA 114 (114) T ss_pred HHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 999999977972999999-9999999976896479 No 63 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.80 E-value=2.8e-19 Score=157.67 Aligned_cols=105 Identities=32% Similarity=0.576 Sum_probs=95.6 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC---CCCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 87899465430345656676050416786688602788723545001---434467244002678632573035899999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF---ASNAQQAVGELMTRNLITVKKTVNLENAKA 177 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~---~~~~~~~V~eiMt~~litv~~~~~l~eA~~ 177 (493) .++|++|++|+.||.++|.+++++++||+|+ |+|+|++|++|+.. ....+.+|.++|+++++++++++++.+|.+ T Consensus 3 pV~ti~p~~tv~ea~~~m~~~~i~~lpVvd~--~~lvGivT~~Di~~~~~~~~~~~~v~~iM~~~~~tv~~~~~i~~a~~ 80 (110) T cd04595 3 PVKTVRPEATIEEARELLLRYGHTALPVVEG--GRVVGIISRRDVEKALRHGLGHAPVKDYMSTDVVTVPPDTPLSEVQE 80 (110) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCEEEEECCEEEECCCCCHHHHHH T ss_conf 9899799599999999999759988999989--99999997689887765376788121032536688669993999999 Q ss_pred HHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9986553306457479978688980210001 Q gi|254780889|r 178 LLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 178 iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.+++++++||+| +|+|+|+||++||+|. T Consensus 81 ~m~~~~~~~lpVvd-~g~lvGiit~~Dilks 110 (110) T cd04595 81 LMVEHDIGRVPVVE-DGRLVGIVTRTDLLRT 110 (110) T ss_pred HHHHCCCEEEEEEE-CCEEEEEEEHHHEECC T ss_conf 88974913999998-9999999984156157 No 64 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.80 E-value=3.5e-19 Score=157.01 Aligned_cols=107 Identities=28% Similarity=0.414 Sum_probs=92.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC------CCCCCCCCC------CCCCCCCCCC Q ss_conf 2878994654303456566760504167866886027887235450014------344672440------0267863257 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA------SNAQQAVGE------LMTRNLITVK 167 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~------~~~~~~V~e------iMt~~litv~ 167 (493) +|++|++|++++.||+++|.+++++++||||++ |+|+|++|++|+... .....++.+ .|+.+++|++ T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~-g~lvGiis~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~t~~ 79 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDEN-GKVVDVYSRFDVINLAKEGAYNNLDLTVGEALERRSQDFEGVRTCS 79 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCCEEEC T ss_conf 997898998999999999998098669998789-9698997599999887515544344208776421555547856989 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 3035899999998655330645747997868898021000 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) +++++.+|.++|.+++++++||||++|+|+|+||++||+| T Consensus 80 ~~~~l~~~~~~m~~~~i~rlpVVD~~~rlvGiIs~~Dilr 119 (120) T cd04641 80 PDDCLRTIFDLIVKARVHRLVVVDENKRVEGIISLSDILQ 119 (120) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9997999999999849857989989997999998688238 No 65 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.80 E-value=2.5e-19 Score=158.04 Aligned_cols=95 Identities=26% Similarity=0.473 Sum_probs=89.7 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143446724400267863257303589999999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALL 179 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM 179 (493) +|++|++|++++.+|+++|.+++++++||+|++ |+|+||+|++|+. |..+++|++++++++||.++| T Consensus 1 rdv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~-g~lvGIiT~rDli------------~~~~~itv~~~~~v~eaa~lM 67 (96) T cd04614 1 RNVPTVWEETPLPVAVRIMELANVKALPVLDDD-GKLSGIITERDLI------------AKSEVVTATKRTTVSECAQKM 67 (96) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEECHHHC------------CCCCCEEECCCCCHHHHHHHH T ss_conf 996390896989999999997299879999899-9499999888882------------577887979989099999999 Q ss_pred HHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 8655330645747997868898021000 Q gi|254780889|r 180 HQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 180 ~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) .+|++++|||||++|+|+|+||.+||+| T Consensus 68 ~~~~I~~LPVvd~~~~lvGiit~~Dllk 95 (96) T cd04614 68 KRNRIEQIPIINGNDKLIGLLRDHDLLK 95 (96) T ss_pred HHCCCCEEEEECCCCCEEEEEEHHHCCC T ss_conf 9859988628989991999988798048 No 66 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.79 E-value=5.2e-19 Score=155.82 Aligned_cols=105 Identities=36% Similarity=0.549 Sum_probs=94.6 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-C--------------CCCCCCCCCCCCCCCC Q ss_conf 287899465430345656676050416786688602788723545001-4--------------3446724400267863 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-A--------------SNAQQAVGELMTRNLI 164 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-~--------------~~~~~~V~eiMt~~li 164 (493) ++++|++|++|+.||+++|.+++++++||+|+ ++++|++|++|+.. . ...+.+|+++|+++++ T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~i~~lpVvd~--~~~vGiiT~~Di~~~l~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~ 78 (121) T cd04633 1 KPVITVSPDDRVSHARRLMLDHDISRLPVIEG--GKLVGIVTEKDIADALRSFRPLVRDRHQERRIRNLPVSDIMTRPVI 78 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCCCHHHCCCCCCCEEECCCCCCCE T ss_conf 99479799398999999999709988999989--9988787599999999744653320110245136685533437988 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 2573035899999998655330645747997868898021000 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) ++++++++.+|.++|.+++++++||+|+ |+++|+||++||+| T Consensus 79 tv~~~~~l~~~~~~m~~~~~~~lpVvd~-~~lvGIIT~~Dilr 120 (121) T cd04633 79 TIEPDTSVSDVASLMLENNIGGLPVVDD-GKLVGIVTRTDILR 120 (121) T ss_pred EEECCCCHHHHHHHHHHHCCEEEEEEEC-CEEEEEEEHHHHHC T ss_conf 9909898999999989719859999989-99999998346028 No 67 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=99.79 E-value=5.1e-19 Score=155.84 Aligned_cols=107 Identities=34% Similarity=0.497 Sum_probs=97.3 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 87899465430345656676050416786688602788723545001-----4344672440026786325730358999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) +.-.++|++|+.||+++|.+++++++||+|++ |+++||+|++|++- ....+.+++++|+++++|++++.++.+| T Consensus 3 ~vp~V~~~~si~ea~~~m~~~~~~~v~Vvd~~-~~lvGiiT~~Dl~r~~~~~~~~~~~~v~~iM~~~~~tv~~~~~i~~a 81 (114) T cd04604 3 ALPLVSPDTSLKDALLEMSRKGLGMTAVVDED-GRLVGIFTDGDLRRALEKGLDILTLPVADVMTRNPKTIDPDALAAEA 81 (114) T ss_pred CCCEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCEEEECCCCHHHH T ss_conf 47788994939999999997699789999799-93999998999999987166645679747435675899389886999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++|.+++++++||+|++|+++|+||++||+|+ T Consensus 82 ~~~m~~~~i~~lpVvd~~~~~vGiit~~DllkA 114 (114) T cd04604 82 LELMEENKITALPVVDDNGRPVGVLHIHDLLRA 114 (114) T ss_pred HHHHHHCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 999987795799999899989999988995469 No 68 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.79 E-value=5.3e-19 Score=155.71 Aligned_cols=105 Identities=35% Similarity=0.529 Sum_probs=95.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC------CCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 28789946543034565667605041678668860278872354500------143446724400267863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR------FASNAQQAVGELMTRNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir------~~~~~~~~V~eiMt~~litv~~~~~l~ 173 (493) +||+|++|++|+.||.++|.+++++++||+|+ ++++|++|++|+. .....+.+|+++|+++++++++++++. T Consensus 1 r~vvti~~~~tl~~a~~~m~~~~i~~l~V~d~--~~~vGivt~~Di~~~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~l~ 78 (112) T cd04802 1 KNVITVDPDTTVYEAANIMTENNIGRLIVVDN--EKPVGIITERDLVKKVVSRNLKPREVPVGEVMSTPLITIDPNASLN 78 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEECHHHHHHHHCCCCCCCCCHHHHHCCCCEEECCCCHHH T ss_conf 99789689198999999999749987999989--9999999821878888754798123899997234855994798699 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 9999998655330645747997868898021000 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) +|.++|.+++++++||+| +|+|+|+||++||+. T Consensus 79 ~a~~~m~~~~~~~lpVvd-~~~lvGiIt~~Dil~ 111 (112) T cd04802 79 EAAKLMAKHGIKRLPVVD-DDELVGIVTTTDIVM 111 (112) T ss_pred HHHHHHHHHCCCEEEEEE-CCEEEEEEECHHHCC T ss_conf 999987871985899998-999999998067105 No 69 >PRK05437 isopentenyl pyrophosphate isomerase; Provisional Probab=99.79 E-value=1.1e-17 Score=146.33 Aligned_cols=147 Identities=29% Similarity=0.423 Sum_probs=116.0 Q ss_pred HHHHHHHHHHHHCCCCCEECCCC---CCHHHHHHHHHCCCCEEEEEECCCC-----CCCCCHH--------CCCCCCCCH Q ss_conf 89999999998579872451464---6999999998548758996102787-----6653100--------015743100 Q gi|254780889|r 260 KVLDAVVQIKKNFPSLLVMAGNI---ATAEGALALIDAGADIIKVGIGPGS-----ICTTRVV--------TGVGCPQLS 323 (493) Q Consensus 260 ~~~~~i~~~k~~~~~~~ii~GNv---~t~~~~~~l~~~Gad~ikvGig~Gs-----~CtTr~~--------~g~g~pq~~ 323 (493) ..++.|+++++..+ +|||+=-| -+++.++.|+++|+++|-|+ |.|. |...|.. ...|.|+.. T Consensus 173 ~~l~~I~~i~~~~~-vPVIvKeVG~Gis~e~a~~l~~~Gv~~IdVs-g~GGTnf~~IE~~R~~~~~~~~~~~~wGipT~~ 250 (351) T PRK05437 173 GWLDRIAEIVSALP-VPVIVKEVGFGISKETAKRLADAGVKAIDVA-GAGGTSWAAVENYRAKQRRLASYFADWGIPTAQ 250 (351) T ss_pred HHHHHHHHHHHHCC-CCEEEEECCCCCCHHHHHHHHHCCCCEEEEC-CCCCCCHHHHHHHHHCCHHHHHHHHHCCCCHHH T ss_conf 99999999998679-9889852157889999999996799999957-998855799998871021245777734866899 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCH Q ss_conf 37889998622698199726543777621336714878993230101216776448736805888733200688752212 Q gi|254780889|r 324 AIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSS 403 (493) Q Consensus 324 av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~ 403 (493) ++.++..+. .+++||||||||+.-||+||||+||++|.+++.|=.... T Consensus 251 sL~e~~~~~--~~~~iiasGGIR~glDi~KaLaLGA~~vgia~p~L~~l~------------------------------ 298 (351) T PRK05437 251 SLLEARSAL--PDLPIIASGGIRTGLDIAKALALGADAVGLAGPFLKAAL------------------------------ 298 (351) T ss_pred HHHHHHHHC--CCCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHH------------------------------ T ss_conf 999999747--998299627878789999999955107775899999998------------------------------ Q ss_pred HHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCC Q ss_conf 22101233343310045402662105758998999999999855515758776897375 Q gi|254780889|r 404 ARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKA 462 (493) Q Consensus 404 ~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~ 462 (493) .+|.| .+..++..+..-||..|.-+|++||.||++.- T Consensus 299 ---------------~~g~e-------~~~~~l~~~~~elk~~M~L~G~~~i~eL~~~~ 335 (351) T PRK05437 299 ---------------EGGVE-------AVIEEIEGWIEELKTAMFLTGAKNIAELRQVP 335 (351) T ss_pred ---------------HCCHH-------HHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCC T ss_conf ---------------56999-------99999999999999999986899899981799 No 70 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=99.78 E-value=1.1e-18 Score=153.44 Aligned_cols=106 Identities=38% Similarity=0.544 Sum_probs=97.1 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 287899465430345656676050416786688602788723545001-----434467244002678632573035899 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) ++|+|++|++|+.||.++|.+++++++||+++ ++++|++|++|+.. ..+.+.+|+++|+++++++++++++.+ T Consensus 1 r~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~~--~~~vGiit~~Di~~~~~~~~~~~~~~v~~im~~~~~tv~~~~~l~~ 78 (111) T cd04800 1 RPPVTCSPDTTIREAARLMTEHRVSSLLVVDD--GRLVGIVTDRDLRNRVVAEGLDPDTPVSEVMTAPPITIPPDATVFE 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEECHHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCHHH T ss_conf 99989789197999999999729998999989--9999999955788998716798566588851268179988991999 Q ss_pred HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9999986553306457479978688980210001 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.++|.+++++++||+| +|+++|+||++||+|. T Consensus 79 a~~~m~~~~i~~lpVvd-~~~lvGiit~~Dilk~ 111 (111) T cd04800 79 ALLLMLERGIHHLPVVD-DGRLVGVISATDLLRL 111 (111) T ss_pred HHHHHHHCCCCEEEEEE-CCEEEEEEEHHHEECC T ss_conf 99999853995899998-9999999985457363 No 71 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=99.76 E-value=3.1e-18 Score=150.24 Aligned_cols=105 Identities=29% Similarity=0.414 Sum_probs=94.8 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 287899465430345656676050416786688602788723545001-----434467244002678632573035899 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) +.|++++|++|+.||.++|.++++++++|++++ +++||+|++|++. ....+.+|+++|+++++|+++++++.+ T Consensus 1 R~pvtv~~~~ti~ea~~~M~~~~i~~~vV~~~~--~~~GIvT~~Dl~~~~~~~~~~~~~~V~~vms~~~~ti~~~~~~~~ 78 (111) T cd04589 1 RPPLIVDASTSIRDAARLMREHGADALLVRDGD--PRLGIVTRTDLLDAVLLDGLPSSTPVGEIATFPLITVDPDDFLFN 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCHHH T ss_conf 999899992989999999997099889997699--279999979989999829996788999928799989999792999 Q ss_pred HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 999998655330645747997868898021000 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) |.++|.+++++++||+| +++++|+||.+||++ T Consensus 79 a~~~M~~~~i~~lpV~d-~~~~vGivt~~Dil~ 110 (111) T cd04589 79 ALLLMTRHRIHRVVVRE-GGEVVGVLEQTDLLS 110 (111) T ss_pred HHHHHHHCCCEEEEEEE-CCEEEEEEEHHHHHC T ss_conf 99999987980889998-999999999688228 No 72 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.76 E-value=3.8e-18 Score=149.56 Aligned_cols=108 Identities=31% Similarity=0.413 Sum_probs=97.6 Q ss_pred CCCEEEECCCCHHHHHHH-HHHHCCCCEEEECCCCEEEEEEEEHHHHCC---CCCCCCCCCCCCC--CCCCCCCCCCCHH Q ss_conf 287899465430345656-676050416786688602788723545001---4344672440026--7863257303589 Q gi|254780889|r 100 VNPVTISPYATLADALAL-MKKYSISGIPVVESDVGKLVGILTNRDVRF---ASNAQQAVGELMT--RNLITVKKTVNLE 173 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~l-m~~~~is~iPVVD~~~gkLiGIVT~rDir~---~~~~~~~V~eiMt--~~litv~~~~~l~ 173 (493) +|+.|++|+.|+.|+.+. +.+++++++||+|++ |+|+||||.+|++. ......++.++|+ +++.++.+++++. T Consensus 1 Rdv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~-g~l~Givt~~dl~~~~~~~~~~~~v~~v~~~~~~~~~v~~~~~l~ 79 (114) T cd04801 1 RDFPTVPAHLTLREFVREYVLGSNQRRFVVVDNE-GRYVGIISLADLRAIPTSQWAQTTVIQVMTPAAKLVTVLSEESLA 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCHHHHCCCCCCCEEECCCCCHH T ss_conf 9955849999599999999850796689998789-979999999998754540256884778210277875899999799 Q ss_pred HHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99999986553306457479978688980210001 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|+++|.++++++|||||++|+++|+||++||+|+ T Consensus 80 ~al~~m~~~~~~~LpVVd~~g~lvGiit~~Dilrr 114 (114) T cd04801 80 EVLKLLEEQGLDELAVVEDSGQVIGLITEADLLRR 114 (114) T ss_pred HHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99999997698678899479909999997884368 No 73 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.75 E-value=5.8e-18 Score=148.30 Aligned_cols=105 Identities=27% Similarity=0.426 Sum_probs=92.4 Q ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-CC----CC---------CCCCCCCCCCCCCCCCC Q ss_conf 789946543034565667605041678668860278872354500-14----34---------46724400267863257 Q gi|254780889|r 102 PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-FA----SN---------AQQAVGELMTRNLITVK 167 (493) Q Consensus 102 PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~~----~~---------~~~~V~eiMt~~litv~ 167 (493) .++++|++|+.+|+++|.+++++++||+|++.|+++||+|.+|+. +. .. .......+|+++++|++ T Consensus 3 fi~v~~~~sl~~a~~lm~~~~i~rlpVvd~~~g~lvGIiT~~dil~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~viti~ 82 (123) T cd04627 3 FIPVPSTASLFQAIEILGSGGIHRVAVTEEESGEVIGILSQRRLVEFLWENARSFPGLDPLYPIPLRDLTIGTSDVISIN 82 (123) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHCCCCCCEEEC T ss_conf 64848989299999999749977799995899949999997999999986376566144541120023323767717989 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCH Q ss_conf 303589999999865533064574799786889802100 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIE 206 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIl 206 (493) ++.++.+|+++|.+++++.+||||++|+|+|++|.+|+- T Consensus 83 ~d~~l~~A~~lM~~~~i~~lpVVD~~g~lvGiiS~~Dlr 121 (123) T cd04627 83 GDQPLIDALHLMHNEGISSVAVVDNQGNLIGNISVTDVR 121 (123) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHEE T ss_conf 999599999999980988786985999699998925422 No 74 >pfam01070 FMN_dh FMN-dependent dehydrogenase. Probab=99.75 E-value=7.6e-17 Score=140.25 Aligned_cols=167 Identities=24% Similarity=0.397 Sum_probs=128.0 Q ss_pred CHHHHHHHHHHHCCCCEEEE--ECCC-CHH---HHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 01567899987313666998--2484-113---88999999999857987245146469999999985487589961027 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVV--DTAH-GHS---QKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 232 ~d~~eRa~~LveaGvDvivi--D~ah-Gh~---~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) .+..+|++ ++|+++|++ |++. |.. .+..+.|++|++..+ .|+|.=.|-++|-++.+.++|+|+|-|-=-+ T Consensus 125 ~~~i~ra~---~ag~~al~ltvD~~~~g~r~~d~r~~~~i~~l~~~~~-~PvivKGI~s~eDA~~a~~~Gv~~I~VSnHG 200 (301) T pfam01070 125 EDLLERAE---AAGYKALVLTVDTPVLGNRERDLRNGDDLAWLRDQWK-GPLVLKGILSPEDAKRAVEAGVDGIVVSNHG 200 (301) T ss_pred HHHHHHHH---HCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCC-CCEEEECCCCHHHHHHHHHCCCCEEEECCCC T ss_conf 99999999---7499979997268765778532043999999998669-9889982899999999998599999964998 Q ss_pred CCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEE Q ss_conf 87665310001574310037889998622698199726543777621336714878993230101216776448736805 Q gi|254780889|r 306 GSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRS 385 (493) Q Consensus 306 Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~ 385 (493) | | +.-.+.|.+.++.+++++... ++|||+|||||+.-||+||||+|||+|++|..|- T Consensus 201 G-----R-qlD~~~~t~~~L~eI~~~v~~-~~~i~~DGGIR~G~DV~KAlALGA~~V~iGRp~l---------------- 257 (301) T pfam01070 201 G-----R-QLDGAPATIDALPEIVAAVGG-RIPVLVDGGIRRGTDVLKALALGADAVLLGRPFL---------------- 257 (301) T ss_pred C-----C-CCCCCCCHHHHHHHHHHHHCC-CEEEEECCCCCCCHHHHHHHHCCCCEEEECHHHH---------------- T ss_conf 5-----4-468886799999999998567-7489963874762689999980898665568999---------------- Q ss_pred EEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHC Q ss_conf 8887332006887522122210123334331004540266210575899899999999985551575877689737 Q gi|254780889|r 386 FKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKK 461 (493) Q Consensus 386 ~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~ 461 (493) ||.+ ..|.+| +..++..|..-|+.+|..+|++||.|+... T Consensus 258 ----~ala-------------------------~~G~~G-------v~~~l~~l~~El~~~M~l~G~~~i~~l~~~ 297 (301) T pfam01070 258 ----YGLA-------------------------AGGEAG-------VAHALEILRDELERTMALLGCTSIADLTPS 297 (301) T ss_pred ----HHHH-------------------------HCCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCCHH T ss_conf ----9999-------------------------657999-------999999999999999998589997895998 No 75 >TIGR03151 enACPred_II putative enoyl-(acyl-carrier-protein) reductase II. This oxidoreductase of the 2-nitropropane dioxygenase family (pfam03060) is commonly found in apparent operons with genes involved in fatty acid biosynthesis. Furthermore, this genomic context generally includes the fabG 3-oxoacyl-[ACP] reductase and lacks the fabI enoyl-[ACP] reductase. Probab=99.75 E-value=1.3e-16 Score=138.71 Aligned_cols=200 Identities=27% Similarity=0.415 Sum_probs=140.8 Q ss_pred EEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECC-CCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHH Q ss_conf 343038816003178337852368889999997798799978-9898999999998742238815287899465430345 Q gi|254780889|r 36 STRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHR-NFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADA 114 (493) Q Consensus 36 ~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHr-n~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea 114 (493) .|+||+-+.++.|||++||..|+.++||.++++.||+|+|.- +++.|+-.++++++|..- +.|+- T Consensus 1 ~T~lt~~lgi~~PIiqapM~~vs~~~La~AVs~aGglG~l~~~~~~~e~l~~~i~~~~~~t----d~P~g---------- 66 (307) T TIGR03151 1 KTRLCDLLGIEYPIFQGGMAWVATGSLAAAVSNAGGLGIIGAGNAPPDVVRKEIRKVKELT----DKPFG---------- 66 (307) T ss_pred CCHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHC----CCCCE---------- T ss_conf 9767697689949787887777878999999808984166788899999999999999852----79860---------- Q ss_pred HHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 65667605041678668860278872354500143446724400267863257303589999999865533064574799 Q gi|254780889|r 115 LALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDG 194 (493) Q Consensus 115 ~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g 194 (493) ++++. . T Consensus 67 ----------------------vnl~~------------------------~---------------------------- 72 (307) T TIGR03151 67 ----------------------VNIML------------------------L---------------------------- 72 (307) T ss_pred ----------------------EEEEE------------------------C---------------------------- T ss_conf ----------------------43332------------------------3---------------------------- Q ss_pred CEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCC Q ss_conf 78688980210001017422326568765556651370156789998731366699824841138899999999985798 Q gi|254780889|r 195 CCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPS 274 (493) Q Consensus 195 ~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~ 274 (493) .++..+.+..+++.++.++. .+.|+... .++++|+. . T Consensus 73 ------------------------------------~~~~~~~~~~~~e~~v~vv~--~~~G~p~~---~~~~~~~~--g 109 (307) T TIGR03151 73 ------------------------------------SPFVDELVDLVIEEKVPVVT--TGAGNPGK---YIPRLKEN--G 109 (307) T ss_pred ------------------------------------CCCHHHHHHHHHHHCCCCEE--ECCCCCHH---HHHHHHHC--C T ss_conf ------------------------------------88899999999860898247--27999689---99999985--9 Q ss_pred CCEECCCCCCHHHHHHHHHCCCCEEEE-EECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEE Q ss_conf 724514646999999998548758996-1027876653100015743100378899986226981997265437776213 Q gi|254780889|r 275 LLVMAGNIATAEGALALIDAGADIIKV-GIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAK 353 (493) Q Consensus 275 ~~ii~GNv~t~~~~~~l~~~Gad~ikv-Gig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~k 353 (493) +.++ ..|.+...++.+.++|+|+|-+ |...|.. .++++...-+.++++..++||||-|||.+..+++. T Consensus 110 ~~v~-~~v~s~~~A~~a~~~G~D~iV~qG~EAGGH----------~G~~~~~~Lvp~v~d~~~iPViAAGGI~dgr~iaa 178 (307) T TIGR03151 110 VKVI-PVVASVALAKRMEKAGADAVIAEGMESGGH----------IGELTTMALVPQVVDAVSIPVIAAGGIADGRGMAA 178 (307) T ss_pred CEEE-EEECCHHHHHHHHHCCCCEEEEECCCCCCC----------CCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHH T ss_conf 9799-981899999999964999999745544687----------78643787799998504686576411336588999 Q ss_pred EEEECCCEEEECCHHHCCCCCCCC Q ss_conf 367148789932301012167764 Q gi|254780889|r 354 AIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 354 Ala~GA~~VM~G~~~agt~Espg~ 377 (493) ||++||+.|.||+-|..|.||+.. T Consensus 179 alalGA~gV~mGTrFlat~Es~a~ 202 (307) T TIGR03151 179 AFALGAEAVQMGTRFLCAKECNVH 202 (307) T ss_pred HHHCCCCEEEECHHHHCCCCCCCC T ss_conf 997188478744197718566899 No 76 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.75 E-value=7.2e-18 Score=147.58 Aligned_cols=107 Identities=18% Similarity=0.256 Sum_probs=99.2 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCC---CCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 87899465430345656676050416786688602788723545001434---467244002678632573035899999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASN---AQQAVGELMTRNLITVKKTVNLENAKA 177 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~---~~~~V~eiMt~~litv~~~~~l~eA~~ 177 (493) .|++++|++|+.+|+++|.+++++++||||++ +|++|++|.+|+..... ...+|+++|..++.++.+++++.+|++ T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~-~k~vG~it~~Dll~~~~~~~~~~~V~d~m~~~v~~i~~~~~v~dalr 80 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEE-NKVLGQVTLSDLLEIGPNDYETLKVCEVYIVPVPIVYCDSKVTDLLR 80 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHCHHHHHCCCHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 44686599959999999987699768998899-96889988999864384455324387751567746389976899999 Q ss_pred HHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9986553306457479978688980210001 Q gi|254780889|r 178 LLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 178 iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.++++.++||||++|+++|+||.+||+|. T Consensus 81 ~~~~~~~~~l~VVD~~grlvGiI~~rdlLr~ 111 (111) T cd04603 81 IFRETEPPVVAVVDKEGKLVGTIYERELLRF 111 (111) T ss_pred HHHHCCCCEEEEECCCCCEEEEEEHHHHHCC T ss_conf 9874499889999389979999962551239 No 77 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=99.74 E-value=6.2e-18 Score=148.06 Aligned_cols=114 Identities=32% Similarity=0.510 Sum_probs=105.2 Q ss_pred HCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC---CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 23881528789946543034565667605041678668860278872354500---143446724400267863257303 Q gi|254780889|r 94 FESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR---FASNAQQAVGELMTRNLITVKKTV 170 (493) Q Consensus 94 ~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir---~~~~~~~~V~eiMt~~litv~~~~ 170 (493) ..+.|..+|+++.|++|+.|+..++.+++++|.||+|++ +++||+|..|+. +.-+.+.+|+++|+++++|+++|+ T Consensus 174 V~~~~s~~~i~v~~d~tl~eaak~f~~~~i~GaPVvd~d--k~vGiit~~dI~~aia~g~~~~kV~~~M~k~vitI~eDe 251 (294) T COG2524 174 VKNLMSKKLITVRPDDTLREAAKLFYEKGIRGAPVVDDD--KIVGIITLSDIAKAIANGNLDAKVSDYMRKNVITINEDE 251 (294) T ss_pred HHHHCCCCCEEECCCCCHHHHHHHHHHCCCCCCCEECCC--CEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCEECCCH T ss_conf 666266784676688639999999997286678510289--458999889999999768965438887505770475730 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 589999999865533064574799786889802100010 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++.||.++|..+++++|.|+|.+|+.+|+||++|||+.. T Consensus 252 ~i~dAir~M~~~nVGRLlV~ds~gkpvGiITrTDIL~~i 290 (294) T COG2524 252 DIYDAIRLMNKNNVGRLLVTDSNGKPVGIITRTDILTRI 290 (294) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 499999998763864699984699678788557888886 No 78 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=99.74 E-value=1.3e-17 Score=145.70 Aligned_cols=104 Identities=30% Similarity=0.499 Sum_probs=93.1 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 87899465430345656676050416786688602788723545001-----4344672440026786325730358999 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRF-----ASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~-----~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) |++|++|++|+.+|+++|.+++++++||+|+ ++++|++|.+|+.. ....+.+|+++|++++.+++++++++++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~~~~~v~~iM~~~~~~v~~~~~l~~~ 79 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD--GRVVGSIDESDLLDALIEGKAKFSLPVREVMGEPLPTVDPDAPIEEL 79 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCEEEEEEECCCEEECCCCCHHHH T ss_conf 8069999497999999999819988999989--99999999999999986387646876984567594597898949999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 999986553306457479978688980210001 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .++|.+++ .+||||++|+++|+||++||+|. T Consensus 80 ~~~~~~~~--~~~vV~~~g~lvGIvT~~Dil~~ 110 (110) T cd04609 80 SELLDRGN--VAVVVDEGGKFVGIITRADLLKY 110 (110) T ss_pred HHHHHHCC--CCEEEECCCEEEEEEEHHHHHCC T ss_conf 99887679--82499619999999983685294 No 79 >cd02811 IDI-2_FMN Isopentenyl-diphosphate:dimethylallyl diphosphate isomerase type 2 (IDI-2) FMN-binding domain. Two types of IDIs have been characterized at present. The long known IDI-1 is only dependent on divalent metals for activity, whereas IDI-2 requires a metal, FMN and NADPH. IDI-2 catalyzes the interconversion of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP) in the mevalonate pathway. Probab=99.73 E-value=3.4e-16 Score=135.56 Aligned_cols=145 Identities=27% Similarity=0.393 Sum_probs=113.5 Q ss_pred HHHHHHHHHHHHCCCCCEECCCC---CCHHHHHHHHHCCCCEEEEEECCC----CCCCCCHH----------CCCCCCCC Q ss_conf 89999999998579872451464---699999999854875899610278----76653100----------01574310 Q gi|254780889|r 260 KVLDAVVQIKKNFPSLLVMAGNI---ATAEGALALIDAGADIIKVGIGPG----SICTTRVV----------TGVGCPQL 322 (493) Q Consensus 260 ~~~~~i~~~k~~~~~~~ii~GNv---~t~~~~~~l~~~Gad~ikvGig~G----s~CtTr~~----------~g~g~pq~ 322 (493) ..++.|+++++..+ +|||+=-| -|+|.++.|.++|+++|-|+=-+| .|=..|.. ...|.|++ T Consensus 165 ~~~~~I~~l~~~~~-vPVIvKeVG~Gis~eda~~l~~~Gv~~IdVSghGGTnf~~IE~~R~~d~~~~~~~~l~dwGi~T~ 243 (326) T cd02811 165 GWLERIEELVKALS-VPVIVKEVGFGISRETAKRLADAGVKAIDVAGAGGTSWARVENYRAKDSDQRLAEYFADWGIPTA 243 (326) T ss_pred HHHHHHHHHHHHCC-CCEEEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCCCHHHHHHHHHCCCCHH T ss_conf 89999999998479-98588524789999999999967999999789999753665310156731337889886285569 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCC Q ss_conf 03788999862269819972654377762133671487899323010121677644873680588873320068875221 Q gi|254780889|r 323 SAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGS 402 (493) Q Consensus 323 ~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~ 402 (493) .++.++..+.. ++|||+|||||+.-|++||||+||++|++|+.|-.... T Consensus 244 ~sL~e~~~~~~--~~~iiasGGir~g~Dv~KalaLGA~~vgiar~~L~~~~----------------------------- 292 (326) T cd02811 244 ASLLEVRSALP--DLPLIASGGIRNGLDIAKALALGADLVGMAGPFLKAAL----------------------------- 292 (326) T ss_pred HHHHHHHHHCC--CCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHH----------------------------- T ss_conf 99999997389--98199868878779999999955533652799999985----------------------------- Q ss_pred HHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 2221012333433100454026621057589989999999998555157587768973 Q gi|254780889|r 403 SARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 403 ~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) +|.+ .+..++..+..-||..|..+|++||+||++ T Consensus 293 -----------------~G~~-------~v~~~l~~~~~el~~~M~l~G~~~i~eLr~ 326 (326) T cd02811 293 -----------------EGEE-------AVIETIEQIIEELRTAMFLTGAKNLAELKQ 326 (326) T ss_pred -----------------CCHH-------HHHHHHHHHHHHHHHHHHHHCCCCHHHHCC T ss_conf -----------------4899-------999999999999999999868998899748 No 80 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.73 E-value=1.8e-17 Score=144.81 Aligned_cols=107 Identities=26% Similarity=0.404 Sum_probs=91.8 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC-----------CCCCCCCCCCCCCCC--CCCC Q ss_conf 878994654303456566760504167866886027887235450014-----------344672440026786--3257 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA-----------SNAQQAVGELMTRNL--ITVK 167 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~-----------~~~~~~V~eiMt~~l--itv~ 167 (493) +|+++.|++|+.||+++|.+++++++||+|++ ++++||||.+|+++. ...+.+|+++||... .++. T Consensus 2 ~Pvtv~~dtti~eA~~~M~~~~v~~l~VvD~~-~~lvGIiT~~Dl~~~~~~~~~~~~~~~~~~l~V~dvMt~~~~~~a~~ 80 (126) T cd04640 2 KPIVIPADTSIDEALELMIKHGVRLLLVVDSD-DNFIGVITAVDLLGEEPIKRIQEGGISRSELTVADVMTPKEDLKALD 80 (126) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEECCCCCCCHHHHHHHHCCCCHHHCCHHHCCCCCCCEEEEC T ss_conf 89696899959999999998397679898799-97999997630012156888864489825505313355452046302 Q ss_pred ----CCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEECCCHHC Q ss_conf ----303589999999865533064574799-78688980210001 Q gi|254780889|r 168 ----KTVNLENAKALLHQHRIEKLLVVDDDG-CCIGLITVKDIERS 208 (493) Q Consensus 168 ----~~~~l~eA~~iM~~~ki~~LPVVDe~g-~LvGIIT~~DIlk~ 208 (493) ++.++.++...|.+++++++||||+++ ++.|+||.+||.|. T Consensus 81 ~~~~~~a~v~d~~~~m~~~~~~hllVvd~~~~~l~GiiS~~DI~R~ 126 (126) T cd04640 81 LEELENASVGDVVETLKASGRQHALVVDREHHQIRGIISTSDIARQ 126 (126) T ss_pred CCCCCCCCHHHHHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHCCC T ss_conf 2000348599999999983980874998999889899987994269 No 81 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=99.73 E-value=1.6e-17 Score=145.18 Aligned_cols=151 Identities=28% Similarity=0.365 Sum_probs=128.5 Q ss_pred EECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEE--CCCEEEECCCCHHHHHHHHHHH Q ss_conf 60031783378523688899999977987999789898999999998742238815--2878994654303456566760 Q gi|254780889|r 44 TLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMV--VNPVTISPYATLADALALMKKY 121 (493) Q Consensus 44 ~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI--~dPVti~pd~TI~ea~~lm~~~ 121 (493) .+.+|++|+.-||.|-+.| |.=|. .|+-|... +.-.++.|+ +++.++.+++++.+..++..+. T Consensus 155 e~~lPvlstsYDTFTVAtm-IN~Al--------~n~lIKkd------I~~Vedi~~P~~~~~yL~~~d~v~d~~~l~~kt 219 (432) T COG4109 155 EKGLPVLSTSYDTFTVATM-INKAL--------SNQLIKKD------IITVEDIMTPLEDTSYLRETDTVEDWLDLVEKT 219 (432) T ss_pred CCCCCEEEECCCCEEHHHH-HHHHH--------HHHHHHHH------EEEHHHHCCCCCCCEECCCCCCHHHHHHHHHHC T ss_conf 1488448704521309999-98998--------77666443------255877446443432324645099999999971 Q ss_pred CCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEE Q ss_conf 50416786688602788723545001434467244002678632573035899999998655330645747997868898 Q gi|254780889|r 122 SISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLIT 201 (493) Q Consensus 122 ~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT 201 (493) ++++|||+|.. .+++|+||.||+.. .++++++..+||++|+|+.+.+++..+.++|....++.+||||++.+|+|+|| T Consensus 220 ~~sRfPVvn~~-~kvvGvVt~rDv~~-~~~~t~ieKVMtknp~tv~~~tsVAsvaq~MiwE~iem~PVv~~n~~llGiit 297 (432) T COG4109 220 GHSRFPVVNRS-MKVVGVVTMRDVLD-KKPSTTIEKVMTKNPITVRAKTSVASVAQMMIWEGIEMLPVVDSNNTLLGIIT 297 (432) T ss_pred CCCCCCEECCC-CEEEEEEEEHHHHC-CCCCCCHHHHHCCCCEEECCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEE T ss_conf 98734133144-26999998346444-79986177774269755425226889988887645526568837861898988 Q ss_pred ECCCHHCCCC Q ss_conf 0210001017 Q gi|254780889|r 202 VKDIERSQLN 211 (493) Q Consensus 202 ~~DIlk~~~~ 211 (493) |+|++|+.++ T Consensus 298 R~dvlk~lq~ 307 (432) T COG4109 298 RQDVLKSLQM 307 (432) T ss_pred HHHHHHHHHH T ss_conf 9999999987 No 82 >cd02809 alpha_hydroxyacid_oxid_FMN Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. Probab=99.70 E-value=9e-16 Score=132.56 Aligned_cols=164 Identities=26% Similarity=0.396 Sum_probs=127.4 Q ss_pred HHHHHHHHHHHCCCCEEEE--ECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 1567899987313666998--24841138899999999985798724514646999999998548758996102787665 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVV--DTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICT 310 (493) Q Consensus 233 d~~eRa~~LveaGvDvivi--D~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~Ct 310 (493) +-.+|++ ++|+.+|++ |++..-...-.+-+++||+..+ .|+|.=.|-|+|-|+...++|+|+|-|-=-+|- T Consensus 133 ~li~rA~---~aG~~al~lTvD~p~~g~R~~w~~i~~l~~~~~-~p~i~KGi~~~~DA~~a~~~G~dgI~VSNHGGR--- 205 (299) T cd02809 133 DLLRRAE---AAGYKALVLTVDTPVLGRRLTWDDLAWLRSQWK-GPLILKGILTPEDALRAVDAGADGIVVSNHGGR--- 205 (299) T ss_pred HHHHHHH---HCCCCEEEEECCCCCCCCCCCHHHHHHHHHHCC-CCEEEECCCCHHHHHHHHHCCCCEEEECCCCCC--- T ss_conf 9999999---859998999705898788799999999998669-987997278899999999859988997288733--- Q ss_pred CCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEE Q ss_conf 31000157431003788999862269819972654377762133671487899323010121677644873680588873 Q gi|254780889|r 311 TRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYR 390 (493) Q Consensus 311 Tr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yr 390 (493) +.-...+.+.++.+++++... +++|+.|||||...||+||||+||++||+|..|. | T Consensus 206 ---qlD~~p~~i~~L~~i~~~v~~-~~~v~~DgGvR~G~Dv~KAlaLGA~~V~iGRp~l--------------------~ 261 (299) T cd02809 206 ---QLDGAPATIDALPEIVAAVGG-RIEVLLDGGIRRGTDVLKALALGADAVLIGRPFL--------------------Y 261 (299) T ss_pred ---CCCCCCCHHHHHHHHHHHHCC-CEEEEECCCCCCHHHHHHHHHCCCCEEEECHHHH--------------------H T ss_conf ---368887789999999998546-7289971884753689999976998898778999--------------------9 Q ss_pred CCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 320068875221222101233343310045402662105758998999999999855515758776897 Q gi|254780889|r 391 GMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQ 459 (493) Q Consensus 391 Gm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~ 459 (493) |.+ ..|.+| ++..+..|..-|+..|..+|+.+|.|+. T Consensus 262 ~l~-------------------------~~G~~G-------v~~~~~~l~~El~~~M~l~G~~~i~~l~ 298 (299) T cd02809 262 GLA-------------------------AGGEAG-------VAHVLEILRDELERAMALLGCASLADLD 298 (299) T ss_pred HHH-------------------------HCCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCC T ss_conf 988-------------------------544999-------9999999999999999984899877779 No 83 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.69 E-value=8.5e-17 Score=139.90 Aligned_cols=103 Identities=27% Similarity=0.490 Sum_probs=93.8 Q ss_pred EEECCCCHHHHHHHHHHHC-----CCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 9946543034565667605-----04167866886027887235450014344672440026786325730358999999 Q gi|254780889|r 104 TISPYATLADALALMKKYS-----ISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKAL 178 (493) Q Consensus 104 ti~pd~TI~ea~~lm~~~~-----is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~i 178 (493) +++|+.|++||++.|+++. ++.+||+|++ |+|+|+||.+|+.. .+.+.+++++|++++.++++++++.+|.++ T Consensus 1 ~~~~~~TV~eai~~lr~~~~~~~~~~~i~Vvd~~-~~l~G~vt~~dll~-~~~~~~v~~iM~~~~~~v~~~~~~~~a~~~ 78 (109) T cd04606 1 AVREDWTVGEALEYLRRNADDPETIYYIYVVDEE-GRLLGVVSLRDLLL-ADPDTPVSDIMDTDVISVSADDDQEEVARL 78 (109) T ss_pred CCCCCCCHHHHHHHHHHCCCCCCEEEEEEEECCC-CEEEEEEEHHHHHH-CCCCCCHHHHHCCCEEECCCCCCHHHHHHH T ss_conf 9699889999999998518885617799999799-97999998677224-188871666300213633799989999999 Q ss_pred HHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 986553306457479978688980210001 Q gi|254780889|r 179 LHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 179 M~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |.+++++++||||++|+|+|+||+.|++.. T Consensus 79 m~~~~~~~lPVVd~~~~lvGiIt~~Di~~~ 108 (109) T cd04606 79 FEKYDLLALPVVDEEGRLVGIITVDDVIDV 108 (109) T ss_pred HHHHCCCEEEEECCCCEEEEEEEHHHHHHH T ss_conf 998527604689889979999996896844 No 84 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=99.69 E-value=1.7e-16 Score=137.77 Aligned_cols=157 Identities=29% Similarity=0.377 Sum_probs=117.6 Q ss_pred HHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-----CCCCCCCCCCCCCCCCC Q ss_conf 9874223881528789946543034565667605041678668860278872354500-----14344672440026786 Q gi|254780889|r 89 HQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-----FASNAQQAVGELMTRNL 163 (493) Q Consensus 89 ~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-----~~~~~~~~V~eiMt~~l 163 (493) .||++ .++..|++++|..+|.+|.+.|.+++++++.+++++ ++++||||++|++ ...+.+.+|+++||.++ T Consensus 149 trv~~---~~~~~~~~v~~~~~i~~aa~km~~~gv~s~v~l~~~-~~~~GIvT~~dl~~~v~~~g~~~~~~V~evmT~p~ 224 (610) T COG2905 149 TRVGE---VKTLPAVTVSPQASIQDAARKMKDEGVSSLVVLDDS-GPLLGIVTRKDLRSRVIADGRSKTQKVSEVMTSPV 224 (610) T ss_pred HHHHH---HHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCCCCEEEHHHHHHHHHHCCCCCCCCHHHHHCCCC T ss_conf 98777---741787526865767999999996088728998379-87433242277789998658974233545414684 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC-CCC-CCCCCCCCEEEEEEEECCCHHHHHHHHH Q ss_conf 32573035899999998655330645747997868898021000101-742-2326568765556651370156789998 Q gi|254780889|r 164 ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL-NPN-ATKDSKGRLRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 164 itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~-~p~-A~~D~~grL~VgAAIg~~~d~~eRa~~L 241 (493) +++.+++.+.||+-+|.+++|+++||. ++|+.+|++|.+||++... ++. ..++....=-+...=.+.+.-.+++..| T Consensus 225 ~svd~~~~~feAml~m~r~~I~hl~V~-e~gq~~Gilt~~dIl~l~s~~~~vl~~~i~~a~sI~el~~~~e~~~~~~s~L 303 (610) T COG2905 225 ISVDRGDFLFEAMLMMLRNRIKHLPVT-EDGQPLGILTLTDILRLFSQNSIVLVKRIARASSIEELANTAELLDDSLSTL 303 (610) T ss_pred EEECCCCHHHHHHHHHHHHCCCEEEEE-CCCEEEEEEEHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 454476569999999998077442361-1890467766999997617785453177554401999999999999999999 Q ss_pred HHCCCCEEE Q ss_conf 731366699 Q gi|254780889|r 242 FDVNVDLVV 250 (493) Q Consensus 242 veaGvDviv 250 (493) +.-|++... T Consensus 304 ~~~gv~~~~ 312 (610) T COG2905 304 VSRGVRTEF 312 (610) T ss_pred HHCCCCHHH T ss_conf 857997899 No 85 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.67 E-value=1.8e-16 Score=137.49 Aligned_cols=106 Identities=26% Similarity=0.471 Sum_probs=94.8 Q ss_pred CCEEEECCCCHHHHHHHHHHH-CCCCEEEECCCCEEEEEEEEHHHHC------CCCC--CCCCCCCCCCCCCCCCCCCCC Q ss_conf 878994654303456566760-5041678668860278872354500------1434--467244002678632573035 Q gi|254780889|r 101 NPVTISPYATLADALALMKKY-SISGIPVVESDVGKLVGILTNRDVR------FASN--AQQAVGELMTRNLITVKKTVN 171 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~-~is~iPVVD~~~gkLiGIVT~rDir------~~~~--~~~~V~eiMt~~litv~~~~~ 171 (493) .+.|++|++|+.|+.++|.++ +++++||+|+ ++++|+||++|+. |..+ .+.||+++|+++++++.++++ T Consensus 2 pv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd~--~r~vGiisr~dl~~~~~~~~g~~l~~~~pV~~~M~~~p~~v~~~~~ 79 (119) T cd04598 2 PAPTVSPDTTVNDVLERFERDPDLSALAVVDD--GRPVGLIMREALMELLSTPYGRALYGKKPVSEVMDPDPLIVEADTP 79 (119) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEC--CEEEEEEEHHHHHHHHCCCCCHHHHCCCCHHHHCCCCCEEECCCCC T ss_conf 98574999849999999987899676899879--9889999899999987074322330389589971689879989896 Q ss_pred HHHHHHHHHHCCCCEEE---EECCCCCEEEEEEECCCHHC Q ss_conf 89999999865533064---57479978688980210001 Q gi|254780889|r 172 LENAKALLHQHRIEKLL---VVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 172 l~eA~~iM~~~ki~~LP---VVDe~g~LvGIIT~~DIlk~ 208 (493) +.+|.++|.++..+++| ||+++|+++|++|.+|++|. T Consensus 80 i~~~~~~~~~~~~~~l~~~ivV~~~g~~~Givt~~DLlr~ 119 (119) T cd04598 80 LEEVSRLATGRDSQNLYDGFIVTEEGRYLGIGTVKDLLRQ 119 (119) T ss_pred HHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEEHHHHHCC T ss_conf 9999999985693325788589579989999898994079 No 86 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=99.65 E-value=4.6e-14 Score=120.27 Aligned_cols=296 Identities=20% Similarity=0.301 Sum_probs=187.5 Q ss_pred CCHHEEECCCC-CCCCCCCEEEEEEECCCCEECCCEEEECCCCC------CCHHHHHHHHHCCCEEEE--CCCCCHHHHH Q ss_conf 60005752764-32475422533430388160031783378523------688899999977987999--7898989999 Q gi|254780889|r 15 TFDDVLLRPEF-SNVLPRDIDISTRIAKDFTLNLPIMSAAMDQV------TDSRLAIAMAQAGGLGVI--HRNFSPSEQV 85 (493) Q Consensus 15 TfDDVllvP~~-s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTV------Te~~MAiamA~~GGiGvI--Hrn~sie~qa 85 (493) -||.+.|+|.. .++. ++|++|.|= +.++..||+-|||--- -|..+|.+-++.|=.-++ .-++|+|+-+ T Consensus 35 Af~~~~l~Pr~L~dvs--~~dtst~l~-G~~~~~P~~iAP~g~~~l~hp~gE~a~AraA~~~gi~~~lSt~ss~slEdVa 111 (344) T cd02922 35 AFQRIRFRPRVLRDVE--KVDTSTTIL-GHKVSLPFFISPAALAKLAHPDGELNLARAAGKHGILQMISTNASCSLEEIV 111 (344) T ss_pred HHHHCCEECCCCCCCC--CCCCCEEEC-CEECCCCEEECHHHHHHHCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHH T ss_conf 9984767653324888--898855689-8336775156647776432884569999999974886574057778889999 Q ss_pred HHHHHHHHHCCCEECCCEEEECCC-CHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 999987422388152878994654-3034565667605041678668860278872354500143446724400267863 Q gi|254780889|r 86 AQVHQVKKFESGMVVNPVTISPYA-TLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLI 164 (493) Q Consensus 86 ~~V~kVKr~e~gmI~dPVti~pd~-TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~li 164 (493) +... .. ...... +.+..|. -..+.++..++.++..+.++-|- -..| -.++|+|+.- ..+ T Consensus 112 ~a~~---~~-~~~wfQ-LY~~~dr~~~~~li~RA~~aG~~alvlTvD~--p~~G-~R~rd~r~~~--~~~---------- 171 (344) T cd02922 112 DARP---PD-QPLFFQ-LYVNKDRTKTEELLKRAEKLGAKAIFLTVDA--PVLG-KRERDERLKA--EEA---------- 171 (344) T ss_pred HHCC---CC-CCEEEE-EECCCCHHHHHHHHHHHHHCCCCEEEEECCC--CCCC-CCHHHHHCCC--CCC---------- T ss_conf 8656---89-866999-8247767999999999998699889995678--8877-5226665077--778---------- Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHC Q ss_conf 25730358999999986553306457479978688980210001017422326568765556651370156789998731 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDV 244 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lvea 244 (493) .... +....+.. ......+.. .. T Consensus 172 ---~~~~----------------------------------------~~~~~~~~-----------~~~~~~~~~---~~ 194 (344) T cd02922 172 ---VSDG----------------------------------------PAGKKTKA-----------KGGGAGRAM---SG 194 (344) T ss_pred ---CCCC----------------------------------------CCCCCCCC-----------CCCHHHHHH---HH T ss_conf ---8766----------------------------------------54333344-----------663166777---75 Q ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHH Q ss_conf 36669982484113889999999998579872451464699999999854875899610278766531000157431003 Q gi|254780889|r 245 NVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSA 324 (493) Q Consensus 245 GvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~a 324 (493) .+|-+ ..-+-++|||+..+ .|+|.=.|-|+|-|+...+.|+|+|-|-=-+| | +.-.+.+.+.+ T Consensus 195 -----~~~~~-----~tw~di~~lr~~~~-~plivKGIl~~~DA~~A~~~G~dgIiVSNHGG-----R-qLD~~~~~i~~ 257 (344) T cd02922 195 -----FIDPT-----LTWDDIKWLRKHTK-LPIVLKGVQTVEDAVLAAEYGVDGIVLSNHGG-----R-QLDTAPAPIEV 257 (344) T ss_pred -----CCCCC-----CCHHHHHHHHHHCC-CCEEECCCCCHHHHHHHHHCCCCEEEEECCCC-----C-CCCCCCCHHHH T ss_conf -----04888-----99999999998669-97010025779999999965998899718862-----1-25788318999 Q ss_pred HHHHHHHHH--CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCC Q ss_conf 788999862--269819972654377762133671487899323010121677644873680588873320068875221 Q gi|254780889|r 325 IMSVVEVAE--RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGS 402 (493) Q Consensus 325 v~~~~~~~~--~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~ 402 (493) +-++.+.+. ..++||+.|||||..-||+||||+||++||+|..|. ||.+- T Consensus 258 Lp~I~~~~~av~~~~~v~~DgGiR~G~DV~KAlALGA~aV~iGRp~l--------------------~gla~-------- 309 (344) T cd02922 258 LLEIRKHCPEVFDKIEVYVDGGVRRGTDVLKALCLGAKAVGLGRPFL--------------------YALSA-------- 309 (344) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHCCCCEEEECHHHH--------------------HHHHH-------- T ss_conf 89999988985887089971885757899999976999897678999--------------------99884-------- Q ss_pred HHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 222101233343310045402662105758998999999999855515758776897 Q gi|254780889|r 403 SARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQ 459 (493) Q Consensus 403 ~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~ 459 (493) .|.+| |+.+|.-|..-|+.+|.-+|+.+|.|+. T Consensus 310 -----------------~G~~G-------v~~~l~il~~El~~~M~l~G~~si~~l~ 342 (344) T cd02922 310 -----------------YGEEG-------VEKAIQILKDEIETTMRLLGVTSLDQLG 342 (344) T ss_pred -----------------CCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCC T ss_conf -----------------43999-------9999999999999999985899888749 No 87 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=99.65 E-value=4.7e-16 Score=134.59 Aligned_cols=146 Identities=23% Similarity=0.318 Sum_probs=112.2 Q ss_pred CCCCHHHHHHHHHCCCEEEECCCCCHHHHHH-----HH-HHH-HHHCCCEEC--CCEEEECCCCHHHHHHHHHHHCCCCE Q ss_conf 2368889999997798799978989899999-----99-987-422388152--87899465430345656676050416 Q gi|254780889|r 56 QVTDSRLAIAMAQAGGLGVIHRNFSPSEQVA-----QV-HQV-KKFESGMVV--NPVTISPYATLADALALMKKYSISGI 126 (493) Q Consensus 56 TVTe~~MAiamA~~GGiGvIHrn~sie~qa~-----~V-~kV-Kr~e~gmI~--dPVti~pd~TI~ea~~lm~~~~is~i 126 (493) -+-+..+|+++..+ |+.+.|+=+. .. ++. .+.++.|.. +.-.+.+++++.|++..|.+.+++.+ T Consensus 161 lalGDALAv~lm~~-------r~F~~~dFa~~HPgG~LGk~Ll~~V~dlM~~~~~lP~v~~~~~l~dai~~ms~k~lG~v 233 (321) T PRK11543 161 LMMGDALAMAVMQA-------RGFNEEDFARSHPAGALGARLLNKVHHLMRRGDAIPQVALTASVMDAMLELSRTGLGLV 233 (321) T ss_pred HHHHHHHHHHHHHH-------HCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCEE T ss_conf 99988999999999-------59998789760989888899999999997335578989999879999999664886179 Q ss_pred EEECCCCEEEEEEEEHHHHCC----CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEE Q ss_conf 786688602788723545001----4344672440026786325730358999999986553306457479978688980 Q gi|254780889|r 127 PVVESDVGKLVGILTNRDVRF----ASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITV 202 (493) Q Consensus 127 PVVD~~~gkLiGIVT~rDir~----~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~ 202 (493) .|+|++ ++|+|++|+.|+|- ..+.+.+++++||++|+++.+++...+|+++|.+++|..|||||++++++|++.. T Consensus 234 ~Vvd~~-~kL~GIITDGDLRR~l~k~~~L~~~v~~vMT~nP~tI~~d~la~eAl~iM~~~kI~~LpVvd~~~k~vGiIhi 312 (321) T PRK11543 234 AVCDAQ-QQVKGVFTDGDLRRWLVGGGALTTPVNEAMTVGGTTLQAQSRAIDAKEILMKRKITAAPVVDENGKLTGAINL 312 (321) T ss_pred EEECCC-CCEEEEEECHHHHHHHHCCCCHHHHHHHHHCCCCCEECCCCHHHHHHHHHHHCCCCEEEEECCCCEEEEEEEH T ss_conf 983377-5068887443899998637856768999848999578998719999999998798689998589959999638 Q ss_pred CCCHHCC Q ss_conf 2100010 Q gi|254780889|r 203 KDIERSQ 209 (493) Q Consensus 203 ~DIlk~~ 209 (493) +|+++.- T Consensus 313 HDLl~~G 319 (321) T PRK11543 313 QDFYQAG 319 (321) T ss_pred HHHHHCC T ss_conf 9786768 No 88 >cd04737 LOX_like_FMN L-Lactate oxidase (LOX) FMN-binding domain. LOX is a member of the family of FMN-containing alpha-hydroxyacid oxidases and catalyzes the oxidation of l-lactate using molecular oxygen to generate pyruvate and H2O2. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=99.63 E-value=1.3e-13 Score=117.18 Aligned_cols=296 Identities=24% Similarity=0.263 Sum_probs=188.5 Q ss_pred CCCHHEEECCCC-CCCCCCCEEEEEEECCCCEECCCEEEECCCC------CCCHHHHHHHHHCCCEEEEC--CCCCHHHH Q ss_conf 460005752764-3247542253343038816003178337852------36888999999779879997--89898999 Q gi|254780889|r 14 LTFDDVLLRPEF-SNVLPRDIDISTRIAKDFTLNLPIMSAAMDQ------VTDSRLAIAMAQAGGLGVIH--RNFSPSEQ 84 (493) Q Consensus 14 lTfDDVllvP~~-s~v~~~~vdl~t~lt~~i~l~iPivSs~MDT------VTe~~MAiamA~~GGiGvIH--rn~sie~q 84 (493) -.|+.+.|.|.. .++. ++|++|.+= +.++..|+.-|||-. --|..+|-+-++.|-+-++- -+.|+|+- T Consensus 42 ~af~~~~l~PrvL~dv~--~~d~~t~ll-G~~~~~P~~iaP~g~~~l~hp~gE~~~AraA~~~gi~~~lSt~s~~s~Eei 118 (351) T cd04737 42 RAFNHKQIVPRVLQGVE--SPDTSTELL-GIKLKTPIIMAPIAAHGLAHATGEVATARGMAEVGSLFSISTYSNTSLEEI 118 (351) T ss_pred HHHHHCCEECCCCCCCC--CCCCCEEEC-CCCCCCCEEECHHHHHHHCCCCHHHHHHHHHHHCCCCEEECCCCCCCHHHH T ss_conf 99984717553345877--798843578-802577626553887404468478999999997598634056777789999 Q ss_pred HHHHHHHHHHCCC-EECCCEEEECCCC-HHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCC Q ss_conf 9999987422388-1528789946543-0345656676050416786688602788723545001434467244002678 Q gi|254780889|r 85 VAQVHQVKKFESG-MVVNPVTISPYAT-LADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRN 162 (493) Q Consensus 85 a~~V~kVKr~e~g-mI~dPVti~pd~T-I~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~ 162 (493) ++.- ..+ ... -+.+..|.. ..+.++..++.++..+.++-|- +..=-.+||++.. +..| T Consensus 119 a~a~------~~~~~wf-QLY~~~dr~~~~~li~RA~~aG~~alvlTVD~---p~~g~Rerd~r~~--~~~p-------- 178 (351) T cd04737 119 AKAS------NGGPKWF-QLYMSKDDGFNRSLLDRAKAAGAKAIILTADA---TVGGNREADIRNK--FQFP-------- 178 (351) T ss_pred HHHC------CCCCEEE-EEECCCCHHHHHHHHHHHHHCCCCEEEEEECC---CCCCCCHHHHHCC--CCCC-------- T ss_conf 9746------7997089-97135887999999999998699989996317---8878627788629--9889-------- Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHH Q ss_conf 63257303589999999865533064574799786889802100010174223265687655566513701567899987 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLF 242 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lv 242 (493) .. +|.. ..+...+. . +... ....... T Consensus 179 -----~~-----------------~~~~------------------------~~~~~~~~---~--~~~~---~~~~~~~ 204 (351) T cd04737 179 -----FG-----------------MPNL------------------------NHFSEGTG---K--GKGI---SEIYAAA 204 (351) T ss_pred -----CC-----------------CCCC------------------------CCCCCCCC---C--CCCH---HHHHHHH T ss_conf -----99-----------------8722------------------------34467775---5--5556---8898863 Q ss_pred HCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCC Q ss_conf 31366699824841138899999999985798724514646999999998548758996102787665310001574310 Q gi|254780889|r 243 DVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQL 322 (493) Q Consensus 243 eaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~ 322 (493) .. ..--+-++|||+.. +.|+|.-.|-|+|-|+...++|+|+|-|-=-+|- +.-...+.+ T Consensus 205 ~~--------------~~~w~di~~lr~~~-~lplilKGI~~~eDA~~A~~~G~dgIvVSNHGGR------QLD~~p~~i 263 (351) T cd04737 205 KQ--------------KLSPADIEFIAKIS-GLPVIVKGIQSPEDADVAINAGADGIWVSNHGGR------QLDGGPASF 263 (351) T ss_pred CC--------------CCCHHHHHHHHHHC-CCCEEECCCCCHHHHHHHHHCCCCEEEECCCCCC------CCCCCHHHH T ss_conf 25--------------79989999999864-9985323667799999998749988997787512------356760478 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCC Q ss_conf 03788999862269819972654377762133671487899323010121677644873680588873320068875221 Q gi|254780889|r 323 SAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGS 402 (493) Q Consensus 323 ~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~ 402 (493) .++-+++++... .++|+.|||||..-||+||||+||++||+|..|- ||++- T Consensus 264 ~~LpeI~~av~~-~~~V~~DgGIR~G~DV~KALALGA~aV~iGRp~l--------------------~glaa-------- 314 (351) T cd04737 264 DSLPEIAEAVNH-RVPIIFDSGVRRGEHVFKALASGADAVAVGRPVL--------------------YGLAL-------- 314 (351) T ss_pred HHHHHHHHHHCC-CCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHH--------------------HHHHH-------- T ss_conf 899999998668-9649976986746899999976998897578999--------------------98871-------- Q ss_pred HHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 2221012333433100454026621057589989999999998555157587768973 Q gi|254780889|r 403 SARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 403 ~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) -|.+| |+.++.-|..-|+.+|..+|+.+|.|+.. T Consensus 315 -----------------~G~~G-------V~~~l~iL~~El~~~M~l~G~~si~dl~~ 348 (351) T cd04737 315 -----------------GGAQG-------VASVLEHLNKELKIVMQLAGTRTIEDVKR 348 (351) T ss_pred -----------------CCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCCC T ss_conf -----------------33899-------99999999999999999868999888490 No 89 >cd04736 MDH_FMN Mandelate dehydrogenase (MDH)-like FMN-binding domain. MDH is part of a widespread family of homologous FMN-dependent a-hydroxy acid oxidizing enzymes that oxidizes (S)-mandelate to phenylglyoxalate. MDH is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=99.63 E-value=1e-13 Score=117.77 Aligned_cols=309 Identities=19% Similarity=0.237 Sum_probs=187.9 Q ss_pred CCCHHEEECCCCC-CCCCCCEEEEEEECCCCEECCCEEEECCCC------CCCHHHHHHHHHCCCEEEEC--CCCCHHHH Q ss_conf 4600057527643-247542253343038816003178337852------36888999999779879997--89898999 Q gi|254780889|r 14 LTFDDVLLRPEFS-NVLPRDIDISTRIAKDFTLNLPIMSAAMDQ------VTDSRLAIAMAQAGGLGVIH--RNFSPSEQ 84 (493) Q Consensus 14 lTfDDVllvP~~s-~v~~~~vdl~t~lt~~i~l~iPivSs~MDT------VTe~~MAiamA~~GGiGvIH--rn~sie~q 84 (493) -.|+.+.|+|..- ++ .++|++|.+= +.++..|+.-|||-- =-|..+|-+-++.|=.=++- -++|+|+- T Consensus 34 ~af~~~~l~PrvL~dv--~~~d~st~ll-G~~~~~P~~iaP~g~~~l~hp~gE~a~AraA~~~gi~~~lSt~ss~s~EeV 110 (361) T cd04736 34 DAFDRWRFIPRRLVDV--SKRDISASLF-GKVWSAPLVIAPTGLNGAFWPNGDLALARAAAKAGIPFVLSTASNMSIEDV 110 (361) T ss_pred HHHHHCCEECCCCCCC--CCCCCCCCCC-CCCCCCCEEECCHHHHHHHCCCCHHHHHHHHHHCCCCEEECCCCCCCHHHH T ss_conf 9998475766223588--7899763158-840578547876357766088842999999998798789679999999999 Q ss_pred HHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 99999874223881528789946543034565667605041678668860278872354500143446724400267863 Q gi|254780889|r 85 VAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLI 164 (493) Q Consensus 85 a~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~li 164 (493) ++... ...... +.+...+-..+.++.-++-++..+.++-|- -+.| ..+||+|+. + T Consensus 111 a~~~~------g~~wfQ-LY~~~r~~~~~li~RA~~aG~~alvlTvD~--pv~G-~Rerd~rng--f------------- 165 (361) T cd04736 111 ARQAD------GDLWFQ-LYVVHRELAELLVKRALAAGYTTLVLTTDV--AVNG-YRERDLRNG--F------------- 165 (361) T ss_pred HHHCC------CCEEEE-EEECCHHHHHHHHHHHHHCCCCEEEEECCC--CCCC-CCCHHHHCC--C------------- T ss_conf 86259------984799-887287999999999998599868995078--8878-883543225--6------------- Q ss_pred CCCCCCCHHHHHHHH---------HHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHH Q ss_conf 257303589999999---------86553306457479978688980210001017422326568765556651370156 Q gi|254780889|r 165 TVKKTVNLENAKALL---------HQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIA 235 (493) Q Consensus 165 tv~~~~~l~eA~~iM---------~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~ 235 (493) ++|...+...+.+.+ .++. .|-. . .+. ..+.. + .. T Consensus 166 ~~P~~~~~~~~~~~~~~P~w~~~~~~~g---~p~~------------~------~~~--~~~~~---------~----~~ 209 (361) T cd04736 166 AIPFRYTPRVLLDGILHPRWLLRFLRNG---MPQL------------A------NFA--SDDAI---------D----VE 209 (361) T ss_pred CCCCCCCHHHHHHHCCCCHHHHHHHHCC---CCCC------------C------CCC--CCCCC---------C----CH T ss_conf 7886556778877515938899765027---7310------------2------346--77777---------7----05 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHC Q ss_conf 78999873136669982484113889999999998579872451464699999999854875899610278766531000 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVT 315 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~ 315 (493) ..+. ..... +|-+- --+-++|||+..+ .++|.=.|-|+|-|+..++.|+|+|-|-=-+|- +. T Consensus 210 ~~~~-~~~~~-----~~~~~-----tW~di~wlr~~w~-~plilKGI~~~eDA~~A~~~G~dgIiVSNHGGR------QL 271 (361) T cd04736 210 VQAA-LMSRQ-----MDASF-----NWQDLRWLRDLWP-HKLLVKGIVTAEDAKRCIELGADGVILSNHGGR------QL 271 (361) T ss_pred HHHH-HHHHC-----CCCCC-----CHHHHHHHHHHCC-CCEEEECCCCHHHHHHHHHCCCCEEEECCCCCC------CC T ss_conf 7889-98843-----68899-----9999999998669-974552148999999998769999997588633------35 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCH Q ss_conf 15743100378899986226981997265437776213367148789932301012167764487368058887332006 Q gi|254780889|r 316 GVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSV 395 (493) Q Consensus 316 g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~ 395 (493) -...+.+.++-+++++. +.+|+-|||||..-||+||||+||++||+|..|. ||++- T Consensus 272 D~a~~~id~Lp~I~~av---~~~V~~DgGIRrG~DV~KALALGA~aV~iGRp~l--------------------ygLaa- 327 (361) T cd04736 272 DDAIAPIEALAEIVAAT---YKPVLIDSGIRRGSDIVKALALGANAVLLGRATL--------------------YGLAA- 327 (361) T ss_pred CCCCCHHHHHHHHHHHH---CCEEEEECCCCCHHHHHHHHHCCCCEEEECHHHH--------------------HHHHH- T ss_conf 77741477799999971---9949994898878999999977999898778999--------------------98771- Q ss_pred HHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 8875221222101233343310045402662105758998999999999855515758776897 Q gi|254780889|r 396 AAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQ 459 (493) Q Consensus 396 ~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~ 459 (493) .|.+| |+.++.-|..-|+.+|.-+|+.+|.|+. T Consensus 328 ------------------------~G~~G-------V~~~l~iL~~El~~~M~l~G~~sv~el~ 360 (361) T cd04736 328 ------------------------RGEAG-------VSEVLRLLKEEIDRTLALIGCPDIASLT 360 (361) T ss_pred ------------------------CCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCC T ss_conf ------------------------10999-------9999999999999999985899867769 No 90 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=99.61 E-value=3.7e-15 Score=128.17 Aligned_cols=107 Identities=24% Similarity=0.406 Sum_probs=93.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCC--CC-CCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 28789946543034565667605041678668860278872354500143--44-6724400267863257303589999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFAS--NA-QQAVGELMTRNLITVKKTVNLENAK 176 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~--~~-~~~V~eiMt~~litv~~~~~l~eA~ 176 (493) ++++++.+++|+.++.+++.+++|+++||++++.++++|+++.+|+.... +. ...+.++| ++++++++++++.+++ T Consensus 1 ~~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~~~~~~~~~~~~~-~~~~~v~~~~~l~~~l 79 (111) T cd04590 1 TDIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALAEGEEDLDLRDLL-RPPLFVPESTPLDDLL 79 (111) T ss_pred CCEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHHCCCCCCCHHHHC-CCCEECCHHHHHHHHH T ss_conf 968999699999999999997799779999547887789999114456665278764077640-4422225566588999 Q ss_pred HHHHHCCCCEEEEECCCCCEEEEEEECCCHH Q ss_conf 9998655330645747997868898021000 Q gi|254780889|r 177 ALLHQHRIEKLLVVDDDGCCIGLITVKDIER 207 (493) Q Consensus 177 ~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk 207 (493) +.|.+++.+.++|+|+.|+++||||..||+. T Consensus 80 ~~~~~~~~~~a~VvDe~G~~~GivT~~DilE 110 (111) T cd04590 80 EEMRKERSHMAIVVDEYGGTAGLVTLEDILE 110 (111) T ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHC T ss_conf 8435549679999969999999998048537 No 91 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=99.60 E-value=2.9e-15 Score=128.93 Aligned_cols=102 Identities=29% Similarity=0.430 Sum_probs=92.9 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCC--CEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 287899465430345656676050416786688--602788723545001434467244002678632573035899999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESD--VGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKA 177 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~--~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~ 177 (493) .+++++.++.|+.++.+++.+++++++||+++. .++++|+++.+|+.+. +..+|++.|+++++++++++|.+ T Consensus 2 ~~vv~~~~~~tv~~~~~ll~~~~~~g~PVv~~~~~~~~lvG~i~~~dl~~~------l~~~m~~~p~tv~~~~~l~~~~~ 75 (105) T cd04591 2 PLVVLLPEGMTVEDLESLLSTTSHNGFPVVDSTEESPRLVGYILRSQLVVA------LKNYIDPSPFTVSPRTSLEKVHQ 75 (105) T ss_pred CCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEEHHHHHHH------HHHHCCCCCEEECCCCCHHHHHH T ss_conf 968993899899999999997799758588368878889999999999999------99758799839899995999999 Q ss_pred HHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9986553306457479978688980210001 Q gi|254780889|r 178 LLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 178 iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|.+.+++++||+| +|+|+|+||++|++|+ T Consensus 76 lf~~~g~r~l~Vv~-~g~lvGiITr~Dl~~a 105 (105) T cd04591 76 LFRKLGLRHLLVVD-EGRLVGIITRKDLLKA 105 (105) T ss_pred HHHHHCCCEEEEEE-CCEEEEEEEHHHHHCC T ss_conf 99994995889957-9999999998995349 No 92 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=99.59 E-value=3.8e-15 Score=128.07 Aligned_cols=149 Identities=22% Similarity=0.314 Sum_probs=117.1 Q ss_pred CCHHHHHHHHHCCCEEEECCCCCHHHHHHHHH--H-HHHH-----CCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEE Q ss_conf 68889999997798799978989899999999--8-7422-----38815287899465430345656676050416786 Q gi|254780889|r 58 TDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVH--Q-VKKF-----ESGMVVNPVTISPYATLADALALMKKYSISGIPVV 129 (493) Q Consensus 58 Te~~MAiamA~~GGiGvIHrn~sie~qa~~V~--k-VKr~-----e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVV 129 (493) +..+.-.++.++|-.==|-|+- +|.-..++. . ..+. .+.|-+|++++++++++.+|.+++.+|++..+||+ T Consensus 204 s~~Dld~aL~~~~E~lDIdrdd-Le~llr~~elqa~~R~~~~LtcadIMSrdVvtv~~~ts~dhA~~ll~~H~ikaLPV~ 282 (382) T COG3448 204 SSEDLDAALQRLGETLDIDRDD-LERLLRETELQALRRRMGELTCADIMSRDVVTVSTDTSIDHARKLLQEHRIKALPVL 282 (382) T ss_pred CHHHHHHHHHHCCCEECCCHHH-HHHHHHHHHHHHHHHHHCCCCHHHHCCCCCEECCCCCCHHHHHHHHHHCCCCCCCCC T ss_conf 8788999999628501578899-999999999999999856430888637453501776775799999997686504532 Q ss_pred CCCCEEEEEEEEHHHHCCC----------CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEE Q ss_conf 6886027887235450014----------344672440026786325730358999999986553306457479978688 Q gi|254780889|r 130 ESDVGKLVGILTNRDVRFA----------SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGL 199 (493) Q Consensus 130 D~~~gkLiGIVT~rDir~~----------~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGI 199 (493) |++ .+|+||||.+|+... .-....|+.+|+.+..|+.++++..|+.-++.+.+.+++||+|++|+++|| T Consensus 283 d~~-~rl~GiVt~~dl~~~a~~~p~qrlr~~~~~~vk~imt~~v~tv~pdtpa~~lvp~lad~g~H~lpvld~~g~lvGI 361 (382) T COG3448 283 DEH-RRLVGIVTQRDLLKHARPSPFQRLRFLRPPTVKGIMTTPVVTVRPDTPAVELVPRLADEGLHALPVLDAAGKLVGI 361 (382) T ss_pred CCC-CCEEEEEEHHHHHHCCCCCHHHHHHCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEECCCCCEEEE T ss_conf 666-6100324488776016856677753159986344345763353589708888887613773136688688838888 Q ss_pred EEECCCHHC Q ss_conf 980210001 Q gi|254780889|r 200 ITVKDIERS 208 (493) Q Consensus 200 IT~~DIlk~ 208 (493) ||.+|++-. T Consensus 362 vsQtDliaa 370 (382) T COG3448 362 VSQTDLIAA 370 (382) T ss_pred EEHHHHHHH T ss_conf 627889999 No 93 >PRK11197 lldD L-lactate dehydrogenase; Provisional Probab=99.59 E-value=4.7e-13 Score=113.04 Aligned_cols=323 Identities=19% Similarity=0.219 Sum_probs=191.1 Q ss_pred CCCHHEEECCCC-CCCCCCCEEEEEEECCCCEECCCEEEECCCC------CCCHHHHHHHHHCCCEEEEC--CCCCHHHH Q ss_conf 460005752764-3247542253343038816003178337852------36888999999779879997--89898999 Q gi|254780889|r 14 LTFDDVLLRPEF-SNVLPRDIDISTRIAKDFTLNLPIMSAAMDQ------VTDSRLAIAMAQAGGLGVIH--RNFSPSEQ 84 (493) Q Consensus 14 lTfDDVllvP~~-s~v~~~~vdl~t~lt~~i~l~iPivSs~MDT------VTe~~MAiamA~~GGiGvIH--rn~sie~q 84 (493) -.|+.+.|+|.. .++ .++|++|.+= +-++..||.-|||-- =-|..+|-+=++.|=.=++- -+.|+|+- T Consensus 40 ~af~~~~l~PRvL~dv--s~~dtst~ll-G~~~~~P~~iaP~g~~~l~hp~gE~a~ArAA~~~gi~~~lSt~ss~slEev 116 (381) T PRK11197 40 EDLADIALRQRVLKNM--SDLSLETELF-GETLAMPVALAPVGLTGMYARRGEVQAARAADAKGIPFTLSTVSVCPIEEV 116 (381) T ss_pred HHHHHCCEECCCCCCC--CCCCCCEEEC-CCCCCCCEEECHHHHHCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCHHHH T ss_conf 9998461755014687--7788863478-830677734675777416789757999999997077178327765679999 Q ss_pred HHHHHHHHHHCCCEECCCEEEECCCC-HHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 99999874223881528789946543-03456566760504167866886027887235450014344672440026786 Q gi|254780889|r 85 VAQVHQVKKFESGMVVNPVTISPYAT-LADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNL 163 (493) Q Consensus 85 a~~V~kVKr~e~gmI~dPVti~pd~T-I~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~l 163 (493) ++.. ...... -+.+..|.. ..+.++..++-++..+.|+-|- -..| -.++|+|+. T Consensus 117 a~a~------~~~~Wf-QLY~~~Dr~~~~~ll~RA~~aG~~alvlTVD~--pv~g-~R~rd~rn~--------------- 171 (381) T PRK11197 117 APAI------KRPMWF-QLYVLRDRGFMRNALERAKAAGCSTLVFTVDM--PVPG-ARYRDAHSG--------------- 171 (381) T ss_pred HHHC------CCCEEE-EEEECCCHHHHHHHHHHHHHCCCCEEEEECCC--CCCC-CCHHHHCCC--------------- T ss_conf 8635------897389-98413888999999999998499879980788--8778-665543067--------------- Q ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHH Q ss_conf 32573035899999998655330645747997868898021000101742232656876555665137015678999873 Q gi|254780889|r 164 ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFD 243 (493) Q Consensus 164 itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lve 243 (493) ++.++. ++..+.+.+.+-+-. +-+ .-.++. ..+.+.. . ..+...+. .+..... . T Consensus 172 ~~~p~~-~~~~~~~~~~~P~w~-~~~-~~~g~p------------~~~~~~~--~----~~~~~~~~-~~~~~~~----~ 225 (381) T PRK11197 172 MSGPNA-AMRRYLQAVTHPQWA-WDV-GLNGRP------------HDLGNIS--A----YLGKPTGL-EDYIGWL----G 225 (381) T ss_pred CCCCCC-HHHHHHHHHCCCHHH-HHH-HCCCCC------------CCCCCCC--C----CCCCCCCH-HHHHHHH----H T ss_conf 778981-287899886481787-763-344788------------6544310--0----13776558-8899998----7 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCH Q ss_conf 13666998248411388999999999857987245146469999999985487589961027876653100015743100 Q gi|254780889|r 244 VNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLS 323 (493) Q Consensus 244 aGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~ 323 (493) ... |-+ ..-+-++|||+..+ .+++.=.|-++|-|+...+.|+|+|-|-=-+|--. -...+.+. T Consensus 226 ~~~-----d~~-----ltW~di~wlr~~w~-~plvlKGIl~~eDA~~A~~~G~dgIiVSNHGGRQL------D~apa~i~ 288 (381) T PRK11197 226 NNF-----DPS-----ISWKDLEWIREFWD-GPMVIKGILDPEDARDAVRFGADGIVVSNHGGRQL------DGVLSSAR 288 (381) T ss_pred HCC-----CCC-----CCHHHHHHHHHHCC-CCEEEECCCCHHHHHHHHHCCCCEEEEECCCCCCC------CCCCCHHH T ss_conf 505-----888-----99999999998729-97678525889999999966998899957763215------67844899 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCH Q ss_conf 37889998622698199726543777621336714878993230101216776448736805888733200688752212 Q gi|254780889|r 324 AIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSS 403 (493) Q Consensus 324 av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~ 403 (493) ++-+++++... .++|+-|||||..-||+||||+||++||+|..|. ||.+- T Consensus 289 ~LpeI~~aV~~-~~~V~~DgGiRrG~DV~KALALGA~aV~vGRp~l--------------------ygLaa--------- 338 (381) T PRK11197 289 ALPAIADAVKG-DIAILADSGIRNGLDVVRMIALGADTVLLGRAFV--------------------YALAA--------- 338 (381) T ss_pred HHHHHHHHHCC-CCEEEEECCCCCHHHHHHHHHCCCCEEEECHHHH--------------------HHHHH--------- T ss_conf 99999998678-9739996897866899999976998897675999--------------------98771--------- Q ss_pred HHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 221012333433100454026621057589989999999998555157587768973 Q gi|254780889|r 404 ARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 404 ~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) -|.+| ++.++.-|..-|+.+|.-+|+.+|.|+.. T Consensus 339 ----------------~G~~G-------V~~~l~iL~~El~~~M~l~G~~~i~~l~~ 372 (381) T PRK11197 339 ----------------AGQAG-------VANLLDLIEKEMRVAMTLTGAKSISEITR 372 (381) T ss_pred ----------------CCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCCH T ss_conf ----------------33889-------99999999999999999858999678799 No 94 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=99.59 E-value=4.4e-15 Score=127.62 Aligned_cols=108 Identities=21% Similarity=0.339 Sum_probs=93.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-----CCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 28789946543034565667605041678668860278872354500-----1434467244002678632573035899 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-----FASNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-----~~~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) ..|+|++|++|+++|+++|.+++|+.+||+|++ |+++|+||++|+. .....+.+|+++|.+++.++..++++.. T Consensus 2 ~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~-g~~~G~vt~~~ll~~~~~~~~~~~~~V~~vm~~~~~~v~~~~~l~~ 80 (124) T cd04608 2 KAPVTVLPTVTCAEAIEILKEKGFDQLPVVDES-GKILGMVTLGNLLSSLSSGKVQPSDPVSKALYKQFKRVNKNDTLGK 80 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCCCCCHHH T ss_conf 999798998909999999998499856898799-9688888799999999738999998289974465651479985567 Q ss_pred HH---------HHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 99---------99986553306457479978688980210001 Q gi|254780889|r 175 AK---------ALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~---------~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +. -.+.+++...+++++++|+++||+|+.|++.. T Consensus 81 ls~~l~~~~~~lvv~~~~~~~~~v~~~~~k~vGIvTr~DLL~y 123 (124) T cd04608 81 LSRILETDAFLLVFFEQISSAAIGKEKQEKPIGIVTKIDLLSY 123 (124) T ss_pred HHCCCCCCHHHHHHHHHHCCCCCCCCCCCEEEEEEEHHHHHHH T ss_conf 3103673278999998602567627779989999867997703 No 95 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=99.58 E-value=1.7e-14 Score=123.40 Aligned_cols=113 Identities=14% Similarity=0.239 Sum_probs=99.6 Q ss_pred CCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC---C--CCCCCCCCCCCCCCCCCCCC Q ss_conf 388152878994654303456566760504167866886027887235450014---3--44672440026786325730 Q gi|254780889|r 95 ESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA---S--NAQQAVGELMTRNLITVKKT 169 (493) Q Consensus 95 e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~---~--~~~~~V~eiMt~~litv~~~ 169 (493) ++.|-+|+.++++++|+.|+.+++.+.++..+||+|++ |+++|+|+.+|++.. + .....+.++|++++.++.++ T Consensus 459 ~dvm~~~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~-g~l~Giv~l~dl~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 537 (583) T PRK01862 459 RELIQPAQTVVPLTASVADMTRVFLEYPVKYLYVTDDD-GRFRGAVALKDITSDLLDKRDTTDKTAADYAHTPFPLLTPD 537 (583) T ss_pred HHHCCCCCCEECCCCCHHHHHHHHHHCCCEEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCCEECCC T ss_conf 99638788234999989999999984797079998899-97999978999887751635556460999817999578899 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCC--CCEEEEEEECCCHHC Q ss_conf 358999999986553306457479--978688980210001 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDD--GCCIGLITVKDIERS 208 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~--g~LvGIIT~~DIlk~ 208 (493) +++.+|+++|.+...+++||||++ ++++|++|++|++++ T Consensus 538 ~sL~~al~~f~~~~~~~LPVVd~~~~~~lvGiit~~dil~a 578 (583) T PRK01862 538 MPLRDALEHFMAFQGERLPVVESEASPTLAGVVYKTSLLDA 578 (583) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHH T ss_conf 98999999999669987789955999879999778999999 No 96 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=99.58 E-value=7.6e-15 Score=125.92 Aligned_cols=147 Identities=22% Similarity=0.318 Sum_probs=109.1 Q ss_pred CCCCHHHHHHHHHCCCE-----EEECCCCCHHHHHHHHHHHHHHCCCEEC-C-CEEEECCCCHHHHHHHHHHHCCCCEEE Q ss_conf 23688899999977987-----9997898989999999987422388152-8-789946543034565667605041678 Q gi|254780889|r 56 QVTDSRLAIAMAQAGGL-----GVIHRNFSPSEQVAQVHQVKKFESGMVV-N-PVTISPYATLADALALMKKYSISGIPV 128 (493) Q Consensus 56 TVTe~~MAiamA~~GGi-----GvIHrn~sie~qa~~V~kVKr~e~gmI~-d-PVti~pd~TI~ea~~lm~~~~is~iPV 128 (493) -+-+..+|+++..+=|. +..|-.=++=.+. ..+.++.|.. + .-.+.+++++.|++..|.+.+++.+.| T Consensus 166 lalgDala~~lm~~rgF~~~dFa~~HPgG~LGk~L-----l~~V~dlM~~~~~lP~v~~~~~l~dai~~mt~k~lG~v~V 240 (326) T PRK10892 166 LVMGDALAVALLKARGFTAEDFALSHPGGALGRKL-----LLRVSDIMHTGDEIPHVKKTASLRDALLEITRKNLGMTVI 240 (326) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHH-----HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEE T ss_conf 99998999999998499986897609897777888-----8789998513765885377602999999874589349999 Q ss_pred ECCCCEEEEEEEEHHHHCCC-----CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEEC Q ss_conf 66886027887235450014-----3446724400267863257303589999999865533064574799786889802 Q gi|254780889|r 129 VESDVGKLVGILTNRDVRFA-----SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVK 203 (493) Q Consensus 129 VD~~~gkLiGIVT~rDir~~-----~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~ 203 (493) +|++ ++|+||+|+.|+|-. +-.+.+++++||++|.++.+++...||+++|.+++|..|||+|+ ++++|++..+ T Consensus 241 vd~~-gkL~GIiTDGDLRR~l~~~~~i~~~~~~diMT~nP~tI~~d~la~eAL~iMe~~kIt~L~Vvd~-~k~vGIihih 318 (326) T PRK10892 241 CDDN-MMIEGIFTDGDLRRVFDMGVDLRQLSIADVMTPGGIRVRPGILAVDALNLMQSRHITSVLVADG-DQLLGVLHMH 318 (326) T ss_pred ECCC-CCEEEEEECHHHHHHHHHCCCCCCCCHHHHHCCCCCEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHH T ss_conf 8589-9689998626899998706883128799971899967899886999999998639818999889-9999997679 Q ss_pred CCHHCC Q ss_conf 100010 Q gi|254780889|r 204 DIERSQ 209 (493) Q Consensus 204 DIlk~~ 209 (493) |+++.- T Consensus 319 Dll~~G 324 (326) T PRK10892 319 DLLRAG 324 (326) T ss_pred HHHHCC T ss_conf 786768 No 97 >TIGR02151 IPP_isom_2 isopentenyl-diphosphate delta-isomerase, type 2; InterPro: IPR011179 This group represents the bacterial and archaeal isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP isomerase catalyses the interconversion of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), and is a key enzyme in the biosynthesis of isoprenoids via the mevalonate pathway. The bacterial and archaeal IPP isomerase (type 2 enzyme) differs from that found in eukaryotes (type 1 enzyme), and requires NADPH, magnesium, and FMN for activity , .; GO: 0004452 isopentenyl-diphosphate delta-isomerase activity, 0010181 FMN binding, 0008299 isoprenoid biosynthetic process, 0005737 cytoplasm. Probab=99.57 E-value=1.4e-13 Score=116.76 Aligned_cols=281 Identities=25% Similarity=0.348 Sum_probs=179.3 Q ss_pred CCCCCHHEEECCCC-CCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCH-HHHHH-----HHHCCCEEEECCCCCHHHH Q ss_conf 64460005752764-32475422533430388160031783378523688-89999-----9977987999789898999 Q gi|254780889|r 12 VALTFDDVLLRPEF-SNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDS-RLAIA-----MAQAGGLGVIHRNFSPSEQ 84 (493) Q Consensus 12 ~~lTfDDVllvP~~-s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~-~MAia-----mA~~GGiGvIHrn~sie~q 84 (493) ....||||-|++.- .++.-.||||+|.|- +..++-||.=++|+==|+. .-.|. .|++=||++- T Consensus 22 ~~~~f~d~~liH~aLPe~~~~~idl~t~fl-G~~~~~P~~I~aMTGG~~~~a~~IN~~LA~aA~e~gi~mg--------- 91 (349) T TIGR02151 22 VSTGFDDIKLIHNALPEINLDEIDLTTEFL-GKRLKAPFLINAMTGGSEEKAGKINRKLARAAEELGIPMG--------- 91 (349) T ss_pred CCCCCCEEEECCCCCCCCCHHHCCCEEEEC-CCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCEE--------- T ss_conf 444541113434568767753626424446-8221167676145577367888998999999998198154--------- Q ss_pred HHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 99999874223881528789946543034565667605041678668860278872354500143446724400267863 Q gi|254780889|r 85 VAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLI 164 (493) Q Consensus 85 a~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~li 164 (493) --++.-++.+| .+.+-.+..++..... T Consensus 92 -------vGSqraal~~P-------~~~~tF~~vR~~aP~~--------------------------------------- 118 (349) T TIGR02151 92 -------VGSQRAALKDP-------EVAETFEVVREEAPNG--------------------------------------- 118 (349) T ss_pred -------ECHHHHHHCCC-------HHHHHHHHHHHHCCCC--------------------------------------- T ss_conf -------30022211271-------2466699999767983--------------------------------------- Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEEC------CCHHHHHH Q ss_conf 257303589999999865533064574799786889802100010174223265687655566513------70156789 Q gi|254780889|r 165 TVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSV------AKDIADRV 238 (493) Q Consensus 165 tv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~------~~d~~eRa 238 (493) |++ |+ +||.--. ..+...|+ T Consensus 119 -----------------------~l~-----------------------AN--------~GA~q~~~~~~~~g~~~~~~a 144 (349) T TIGR02151 119 -----------------------PLI-----------------------AN--------IGAPQLVEGGKKYGVEEAQEA 144 (349) T ss_pred -----------------------CEE-----------------------EC--------CCHHHHHCCCCCCCHHHHHHH T ss_conf -----------------------378-----------------------71--------787887406534488999999 Q ss_pred HHHHHCCCCEEEE--ECCC------C---HHHHHHHHHHHHHHHCCCCCEEC---CCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 9987313666998--2484------1---13889999999998579872451---4646999999998548758996102 Q gi|254780889|r 239 GPLFDVNVDLVVV--DTAH------G---HSQKVLDAVVQIKKNFPSLLVMA---GNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 239 ~~LveaGvDvivi--D~ah------G---h~~~~~~~i~~~k~~~~~~~ii~---GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) -.+++ +|+|.| -++| | ++..+++.|+.+.+.+ ++|||+ |+=-+.+.++.|.++|+++|=|+=- T Consensus 145 id~i~--AdAL~iHlN~~QE~vqpEGDr~F~~G~l~~i~~~~~~~-~vPVIvKEvG~G~S~e~a~~L~~~Gv~aiDv~G~ 221 (349) T TIGR02151 145 IDMIE--ADALAIHLNVLQELVQPEGDRNFSKGWLEKIAEICSAV-SVPVIVKEVGFGISKEVAKLLADAGVKAIDVSGA 221 (349) T ss_pred HHHHH--HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHHHHHHCCEEEEEECCC T ss_conf 98751--01335543233025579997015653899999999652-8987998215799889999998789008870787 Q ss_pred CC--------------CC--C--CCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECC Q ss_conf 78--------------76--6--531000157431003788999862269819972654377762133671487899323 Q gi|254780889|r 305 PG--------------SI--C--TTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGS 366 (493) Q Consensus 305 ~G--------------s~--C--tTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~ 366 (493) +| +- - .-.--...|.|++.++.+|.... ..+.||||-||+|++=|++||||+||++| T Consensus 222 GGTswa~vE~~Rr~~~~~~~~~r~a~~f~~WGipT~~sL~~~~~~~-~~~~~~iASGG~r~GlD~AKAlALGA~~~---- 296 (349) T TIGR02151 222 GGTSWAAVENYRRAKESNQKYERLASAFSDWGIPTAASLLEVRSID-APDAPLIASGGLRTGLDVAKALALGADAV---- 296 (349) T ss_pred CCCCHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH-CCCCCEEEECCCCCHHHHHHHHHHHHHHH---- T ss_conf 6755999998875157523578887777414886689999986421-24773688467778889999999621188---- Q ss_pred HHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 01012167764487368058887332006887522122210123334331004540266210575899899999999985 Q gi|254780889|r 367 LLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSS 446 (493) Q Consensus 367 ~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~ 446 (493) ||| |-+.... ..+|+|. +..+|.++..-||.. T Consensus 297 ------------------------G~A-----------~~~L~~~------~~~g~e~-------~~~~~~~~~~eLk~~ 328 (349) T TIGR02151 297 ------------------------GMA-----------RPLLKAA------LDEGEEA-------VIEEIELIIEELKVA 328 (349) T ss_pred ------------------------HHH-----------HHHHHHH------HCCCHHH-------HHHHHHHHHHHHHHH T ss_conf ------------------------888-----------9999988------5269889-------999999999999999 Q ss_pred HHHCCCCCHHHHHHCCEEE Q ss_conf 5515758776897375999 Q gi|254780889|r 447 MGYVGASNIEEFQKKANFI 465 (493) Q Consensus 447 ~~y~G~~~i~e~~~~~~f~ 465 (493) |.-+||+||+||+..-.++ T Consensus 329 mfl~G~~~i~EL~~~~~v~ 347 (349) T TIGR02151 329 MFLTGAKNIKELKKVPLVI 347 (349) T ss_pred HHHHCCCCHHHHCCCCEEE T ss_conf 9871798879861787154 No 98 >cd03332 LMO_FMN L-Lactate 2-monooxygenase (LMO) FMN-binding domain. LMO is a FMN-containing enzyme that catalyzes the conversion of L-lactate and oxygen to acetate, carbon dioxide, and water. LMO is a member of the family of alpha-hydroxy acid oxidases. It is thought to be a homooctamer with two- and four- fold axes in the center of the octamer. Probab=99.56 E-value=7e-13 Score=111.82 Aligned_cols=312 Identities=20% Similarity=0.243 Sum_probs=190.2 Q ss_pred CCCHHEEECCCCC-CCCCCCEEEEEEECCCCEECCCEEEECCC------CCCCHHHHHHHHHCCCEEEEC--CCCCHHHH Q ss_conf 4600057527643-24754225334303881600317833785------236888999999779879997--89898999 Q gi|254780889|r 14 LTFDDVLLRPEFS-NVLPRDIDISTRIAKDFTLNLPIMSAAMD------QVTDSRLAIAMAQAGGLGVIH--RNFSPSEQ 84 (493) Q Consensus 14 lTfDDVllvP~~s-~v~~~~vdl~t~lt~~i~l~iPivSs~MD------TVTe~~MAiamA~~GGiGvIH--rn~sie~q 84 (493) ..|+.+.|+|..- ++ .++|++|.+= +.++..|+.-|||- .=-|..+|-+-++.|=.=++- -++|+|+- T Consensus 55 ~af~~~~l~PRvL~dv--~~~d~~t~ll-G~~~~~P~~iaP~g~~~l~hp~gE~~~AraA~~~g~~~~lSt~ss~slEev 131 (383) T cd03332 55 DAFSRWRIVPRMLRGV--TERDLSVELF-GRTLAAPLLLAPIGVQELFHPDAELATARAAAELGVPYILSTASSSSIEDV 131 (383) T ss_pred HHHHHCCEECCCCCCC--CCCCCCEEEC-CEECCCCEEECHHHHHHCCCCCHHHHHHHHHHHHCCCEECCCCCCCCHHHH T ss_conf 9998557767113588--8888864579-815677738877877441489778999999998358622057767889999 Q ss_pred HHHHHHHHHHCCC-EECCCEEEECCCCH-HHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCC---CCCCCCC Q ss_conf 9999987422388-15287899465430-34565667605041678668860278872354500143446---7244002 Q gi|254780889|r 85 VAQVHQVKKFESG-MVVNPVTISPYATL-ADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQ---QAVGELM 159 (493) Q Consensus 85 a~~V~kVKr~e~g-mI~dPVti~pd~TI-~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~---~~V~eiM 159 (493) ++.. .++ ... -..+..|..+ .+.++..++.++..+.++-|- -..| -.+||+|+.--+. ..+.+.. T Consensus 132 a~~~------~~~~~wf-QLY~~~Dr~~~~~ll~RA~~aG~~aLvlTVD~--Pv~G-~Rerd~r~g~~P~~~~~~~~~~~ 201 (383) T cd03332 132 AAAA------GDAPRWF-QLYWPKDDDLTESLLRRAEKAGYRVLVVTLDT--WSLG-WRPRDLDLGYLPFLRGIGIANYF 201 (383) T ss_pred HHHC------CCCCEEE-EEECCCCHHHHHHHHHHHHHCCCCEEEEECCC--CCCC-CCHHHHHCCCCCCCCCHHHHHHC T ss_conf 9866------8996399-99515888999999999997389779992268--6668-76545532688643303677754 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 67863257303589999999865533064574799786889802100010174223265687655566513701567899 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVG 239 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~ 239 (493) +++ . -...+.. ...+.... . ........+. T Consensus 202 ~~p---------~--~~~~~~~--------------------------~~~~~~~~---~---------~~~~~~~~~~- 231 (383) T cd03332 202 SDP---------V--FRKKLAE--------------------------PVGEDPEA---P---------PPMEAAVARF- 231 (383) T ss_pred CCH---------H--HHHHHCC--------------------------CCCCCCCC---C---------CCCHHHHHHH- T ss_conf 788---------9--9997325--------------------------67765456---7---------7501459999- Q ss_pred HHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCC Q ss_conf 98731366699824841138899999999985798724514646999999998548758996102787665310001574 Q gi|254780889|r 240 PLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGC 319 (493) Q Consensus 240 ~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~ 319 (493) ... ..|-+. .-+-++|+|+..+ .++|.=.|-++|-|+...+.|||+|-|-=-+|- +.-... T Consensus 232 --~~~-----~~d~~l-----tW~di~wlr~~w~-~plilKGI~~~eDA~~A~~~G~dgIiVSNHGGR------QLD~ap 292 (383) T cd03332 232 --VSV-----FSGPSL-----TWEDLAFLREWTD-LPIVLKGILHPDDARRAVEAGVDGVVVSNHGGR------QVDGSI 292 (383) T ss_pred --HHH-----CCCCCC-----CHHHHHHHHHHCC-CCEEECCCCCHHHHHHHHHCCCCEEEEECCCCC------CCCCCC T ss_conf --985-----378889-----9899999998769-985323568999999999759988998078634------467883 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHH Q ss_conf 31003788999862269819972654377762133671487899323010121677644873680588873320068875 Q gi|254780889|r 320 PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAME 399 (493) Q Consensus 320 pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~ 399 (493) +.+.++-+++++... .++|+.|||||..-||+||||+||++||+|..|- ||.+- T Consensus 293 a~i~~LpeI~~aV~~-~~~V~~DgGIRrG~DV~KAlALGA~~V~iGRp~l--------------------~glaa----- 346 (383) T cd03332 293 AALDALPEIVEAVGD-RLTVLFDSGVRTGADIMKALALGAKAVLIGRPYA--------------------YGLAL----- 346 (383) T ss_pred CHHHHHHHHHHHHCC-CCEEEEECCCCCHHHHHHHHHCCCCEEEECHHHH--------------------HHHHH----- T ss_conf 278999999998479-9849997997867999999976999898778999--------------------98772----- Q ss_pred HCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 2212221012333433100454026621057589989999999998555157587768973 Q gi|254780889|r 400 RGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 400 ~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) .|-+| |+.++.-|..-|+.+|.-+|+.+|.|+.. T Consensus 347 --------------------~G~~G-------V~~~l~iL~~El~~~M~l~G~~si~el~~ 380 (383) T cd03332 347 --------------------GGEDG-------VEHVLRNLLAELDLTMGLAGIRSIAELTR 380 (383) T ss_pred --------------------CCHHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCCC T ss_conf --------------------31999-------99999999999999999858999778592 No 99 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=99.50 E-value=1.2e-13 Score=117.36 Aligned_cols=108 Identities=46% Similarity=0.714 Sum_probs=96.4 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCC-----CCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 2878994654303456566760504167866886027887235450014-----34467244002678632573035899 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFA-----SNAQQAVGELMTRNLITVKKTVNLEN 174 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~-----~~~~~~V~eiMt~~litv~~~~~l~e 174 (493) .+++++.+++|+.+|+++|.+++++++||+|++ ++++|++|.+|+... .....++.++|.++++++.+++++.+ T Consensus 1 ~~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~-~~~~G~it~~dl~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 79 (113) T cd02205 1 RDVVTVSPDDTVAEALRLMLEHGISGLPVVDDD-GRLVGIVTERDLLRALAEGGLDPLVTVGDVMTRDVVTVSPDTSLEE 79 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCCCEEEEEECCCEEEEECCHHHH T ss_conf 997796892989999999998398099999399-9599999889989887631574103304567505468973211779 Q ss_pred HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 9999986553306457479978688980210001 Q gi|254780889|r 175 AKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 175 A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++.+|.+++..++||+|++++++|++|..|+++. T Consensus 80 ~~~~~~~~~~~~~~Vvd~~~~~iG~vs~~di~~~ 113 (113) T cd02205 80 AAELMLEHGIRRLPVVDDEGRLVGIVTRSDILRA 113 (113) T ss_pred HHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 9999998497299999669979999988997469 No 100 >COG1304 idi Isopentenyl diphosphate isomerase (BS_ypgA, MTH48 and related proteins) [Coenzyme transport and metabolism] Probab=99.48 E-value=4.3e-12 Score=106.16 Aligned_cols=143 Identities=29% Similarity=0.348 Sum_probs=115.1 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEE Q ss_conf 99999999985798724514646999999998548758996102787665310001574310037889998622698199 Q gi|254780889|r 261 VLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIV 340 (493) Q Consensus 261 ~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~ii 340 (493) ..+.+.+|++... .+++.=.|.+++-+....+.|+|+|-|==..| .+.--|.|.+.++.+++++.... ++|| T Consensus 206 ~ked~~~i~~~~~-~~lv~kGV~~~~D~~~a~~tg~~~I~vsnhgg------rqlD~g~st~~~L~ei~~av~~~-~~vi 277 (360) T COG1304 206 SKEDGAGISKEWA-GPLVLKGILAPEDAAGAGGTGADGIEVSNHGG------RQLDWGISTADSLPEIVEAVGDR-IEVI 277 (360) T ss_pred CHHHHHHHHHHCC-CCHHHHCCCCHHHHHHHCCCCCEEEEEECCCC------CCCCCCCCHHHHHHHHHHHHCCC-EEEE T ss_conf 3777767877507-75877478978888763368822899976787------40257877699999999971887-1799 Q ss_pred EECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECC Q ss_conf 72654377762133671487899323010121677644873680588873320068875221222101233343310045 Q gi|254780889|r 341 ADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPE 420 (493) Q Consensus 341 adGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~e 420 (493) +|||||+.-|++||||+||++||+|..|--... .. T Consensus 278 ~dGGiR~G~Dv~KAlALGA~~v~igrp~L~~l~---------------------------------------------~~ 312 (360) T COG1304 278 ADGGIRSGLDVAKALALGADAVGIGRPFLYGLA---------------------------------------------AG 312 (360) T ss_pred ECCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHH---------------------------------------------HC T ss_conf 638878778999999937765452599999998---------------------------------------------55 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCE Q ss_conf 4026621057589989999999998555157587768973759 Q gi|254780889|r 421 GIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKAN 463 (493) Q Consensus 421 Gv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~ 463 (493) |.+| |...|..+...|+.+|.-+|++||+||+.... T Consensus 313 g~~G-------V~~~le~~~~El~~~M~L~G~~~i~el~~~~l 348 (360) T COG1304 313 GEAG-------VERVLEIIRKELKIAMALTGAKNIEELKRVPL 348 (360) T ss_pred CHHH-------HHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCE T ss_conf 6878-------99999999999999997428881999655736 No 101 >TIGR03520 GldE gliding motility-associated protein GldE. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype. Probab=99.43 E-value=2.2e-12 Score=108.27 Aligned_cols=144 Identities=17% Similarity=0.260 Sum_probs=116.3 Q ss_pred CCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCC-----CEEC--CCEEEECCCCHHHHHHHHHHHCCCCEEEE Q ss_conf 3688899999977987999789898999999998742238-----8152--87899465430345656676050416786 Q gi|254780889|r 57 VTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFES-----GMVV--NPVTISPYATLADALALMKKYSISGIPVV 129 (493) Q Consensus 57 VTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~-----gmI~--dPVti~pd~TI~ea~~lm~~~~is~iPVV 129 (493) ..|-+..+.++..+|+ + +++.+++.++-++.+ .|+. +.++++.++|+.++++.+.+.+||++||. T Consensus 159 ~eel~~~i~~~~~~g~--~------~eE~~~i~~vl~l~~~~v~eiMtPr~~v~~l~~~~~~~e~~~~~~~~~~SR~PV~ 230 (408) T TIGR03520 159 VDQLSQALELTDEEDT--T------KEEQKILQGIVSFGNTDTKQVMRPRLDIFALDIETSFSEIIPKIIENGYSRIPVY 230 (408) T ss_pred HHHHHHHHHHHHCCCC--C------HHHHHHHHHHHCCCCCEEHHCCCCHHHEEEEECCCHHHHHHHHHHHCCCCCCEEE T ss_conf 9999999985403487--7------8999999998275998903423545795884064438999999985798722266 Q ss_pred CCCCEEEEEEEEHHHHC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 68860278872354500-14344672440026786325730358999999986553306457479978688980210001 Q gi|254780889|r 130 ESDVGKLVGILTNRDVR-FASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 130 D~~~gkLiGIVT~rDir-~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++...++|++..+|+. .....+.++.++| +++..+|+..++.++++.|++++.+.+.|||+.|...|+||..||+.. T Consensus 231 ~~~~D~iiGiv~~kDll~~~~~~~~~~~~~~-r~~~~Vpe~~~~~~lL~~~~~~~~~~AiVvDEyG~~~GiVTlEDilEe 309 (408) T TIGR03520 231 KETIDNITGVLYIKDLLPHLNKKNFDWQSLL-REPYFVPENKKLDDLLRDFQEKKNHLAIVVDEYGGTSGLVTLEDIIEE 309 (408) T ss_pred CCCCCCEEEEEEHHHHHHHHHCCCCCHHHHH-HHCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHH T ss_conf 5898761799779899878736999889986-321489997788999999986387489999899987899888998999 Q ss_pred C Q ss_conf 0 Q gi|254780889|r 209 Q 209 (493) Q Consensus 209 ~ 209 (493) . T Consensus 310 i 310 (408) T TIGR03520 310 I 310 (408) T ss_pred H T ss_conf 9 No 102 >COG0516 GuaB IMP dehydrogenase/GMP reductase [Nucleotide transport and metabolism] Probab=99.42 E-value=9.1e-14 Score=118.20 Aligned_cols=169 Identities=40% Similarity=0.647 Sum_probs=155.9 Q ss_pred HHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEE Q ss_conf 99999999857987245146469999999985487589961027876653100015743100378899986226981997 Q gi|254780889|r 262 LDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVA 341 (493) Q Consensus 262 ~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iia 341 (493) +..++++|..++. +.+|||.+.+++.+++.+| |.+|+|||+| .|+|+||+||+++|+..++.. +++++ T Consensus 2 ~~~~~~~k~~~~~--ltfddVll~p~~s~v~p~~-~~vkt~i~~~--------i~l~iP~vSA~MDtVtea~mA-I~ma~ 69 (170) T COG0516 2 LMQVLKIKKARPG--LTFDDVLLLPAASDVAPAG-VDVKTGLGPG--------IGVNIPQVSAAMDTVTEARMA-IAMAR 69 (170) T ss_pred HHHHHHHHHCCCC--EEECCEEECHHHHHCCCCC-CEEEECCCCC--------CCCCCCEEEHHHHHHHHHHHH-HHHHH T ss_conf 3476654422687--4574487533345403378-8267543599--------402855032267899989887-99998 Q ss_pred ECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCC Q ss_conf 26543777621336714878993230101216776448736805888733200688752212221012333433100454 Q gi|254780889|r 342 DGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEG 421 (493) Q Consensus 342 dGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eG 421 (493) |||| | ||.|+||+ +|+|+++... | |||+|..+|. ||++ .+++++| T Consensus 70 ~GGI---G------------Vih~nm~~--~e~~~~v~~v-----~--~~~~~~~~~~-----~~~~------~~~~~~~ 114 (170) T COG0516 70 DGGI---G------------VMHGNMLA--EESPGEYLYQ-----K--RGMGSIDAMQ-----RYFS------SVLVAQG 114 (170) T ss_pred CCCE---E------------EEECCCCH--HHCCCCEEEE-----E--ECCCCCCCCC-----HHHH------HCCCEEC T ss_conf 6987---9------------99688577--5689706998-----3--0567665441-----2201------0353013 Q ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHHHHCCCCCCCEE Q ss_conf 026621057589989999999998555157587768973759999955341238886348 Q gi|254780889|r 422 IEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVAGLRESHVHDVK 481 (493) Q Consensus 422 v~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~ag~~E~~~Hdi~ 481 (493) +++.+ |+|++.+.+.++.++++++|+|+|+.++.+++.++.|++++.+ |+++|++. T Consensus 115 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 170 (170) T COG0516 115 VSGVV-DKGSIKKFIPYLYGGLQSSCQYIGCRSLTLLKENVRFEFRTAS---EGGVHNLH 170 (170) T ss_pred CCCCC-CCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCEEEEECC---CCCCCCCC T ss_conf 04122-2333102345555667776554264012232004425788513---35666669 No 103 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=99.38 E-value=1.8e-12 Score=108.81 Aligned_cols=93 Identities=33% Similarity=0.386 Sum_probs=76.9 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC-CCC---------------CCCCCCCCCCC---- Q ss_conf 8789946543034565667605041678668860278872354500-143---------------44672440026---- Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR-FAS---------------NAQQAVGELMT---- 160 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir-~~~---------------~~~~~V~eiMt---- 160 (493) .-++++|++|+.||+++|.+++++++||||++ ++|+||||..||+ |.. ....++..+.+ T Consensus 2 ~~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~-~~L~GIvt~gDi~R~~~~~~~~~~~~~~~~~~~~~~~vs~v~t~~~~ 80 (133) T cd04592 2 KYIKVSPTTTLKEALNLMLDEKQSCVLVVDSD-DFLEGILTLGDIQRFLFTNKTTRVQPEDETKQTNTCLVSSVCTKGIS 80 (133) T ss_pred CEEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEEE T ss_conf 45998999859999999998188657998389-97899978799999987623564563200012444323345642036 Q ss_pred -----CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf -----7863257303589999999865533064574799 Q gi|254780889|r 161 -----RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDG 194 (493) Q Consensus 161 -----~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g 194 (493) +.++|+.+++++.+|.++|.++++++||||++.+ T Consensus 81 ~~~~~~~~~tv~pd~~l~~a~~lM~~~~i~~LPVV~~~~ 119 (133) T cd04592 81 YGGQECGLWTCTPDTDLTTAKKLMEAKGVKQLPVVKRGV 119 (133) T ss_pred ECCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCEEECCC T ss_conf 255110038978999999999999986988587670776 No 104 >COG2070 Dioxygenases related to 2-nitropropane dioxygenase [General function prediction only] Probab=99.35 E-value=1.9e-11 Score=101.55 Aligned_cols=217 Identities=25% Similarity=0.370 Sum_probs=134.4 Q ss_pred EEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHH-HHHHHHHHHCCCEECCCEEEECCCCHHH Q ss_conf 334303881600317833785236888999999779879997898989999-9999874223881528789946543034 Q gi|254780889|r 35 ISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQV-AQVHQVKKFESGMVVNPVTISPYATLAD 113 (493) Q Consensus 35 l~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa-~~V~kVKr~e~gmI~dPVti~pd~TI~e 113 (493) ++++++....+..||+-+||--|+..++|.+.++.||+|+|--|....++. +++++++-. ...|. T Consensus 3 ~~~~~~~~~~i~~PIiq~gM~~vs~~~LA~Avs~aGglG~ia~~~~~~e~l~~~i~~~~~~----~~~p~---------- 68 (336) T COG2070 3 LSTRFILLLGIKYPIIQGGMAGVSTPELAAAVSNAGGLGIIASGGLPAEQLRAEIRKIRAL----TDKPF---------- 68 (336) T ss_pred CCCHHHHCCCCCCCEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH----HCCCC---------- T ss_conf 6202230247656655378723486999999982789774331677768899999999986----34842---------- Q ss_pred HHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 56566760504167866886027887235450014344672440026786325730358999999986553306457479 Q gi|254780889|r 114 ALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDD 193 (493) Q Consensus 114 a~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~ 193 (493) ..+. |.... +.+ T Consensus 69 ---------------~~~~--------------f~~~~------------------------------------~~v--- 80 (336) T COG2070 69 ---------------VANN--------------FGSAP------------------------------------APV--- 80 (336) T ss_pred ---------------CCCC--------------CCCCC------------------------------------CCC--- T ss_conf ---------------0135--------------45555------------------------------------553--- Q ss_pred CCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHC-CCCEEEEECCCCH-HHHHHHHHHHHHHH Q ss_conf 978688980210001017422326568765556651370156789998731-3666998248411-38899999999985 Q gi|254780889|r 194 GCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDV-NVDLVVVDTAHGH-SQKVLDAVVQIKKN 271 (493) Q Consensus 194 g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lvea-GvDviviD~ahGh-~~~~~~~i~~~k~~ 271 (493) .+..+.-. .....+.+..+++. ++-++.. +-|. ....++.++ . T Consensus 81 -~~~~l~~~----------------------------~~~~~~~~~~ii~~~~vpvv~~--~~g~~~~~~i~~~~---~- 125 (336) T COG2070 81 -NVNILVAR----------------------------RNAAEAGVDAIIEGAGVPVVST--SFGAPPAEFVARLK---A- 125 (336) T ss_pred -CCEEEECC----------------------------CCCHHHHHHHHHHCCCCCEEEE--CCCCCCHHHHHHHH---H- T ss_conf -11035346----------------------------5534563122564289767971--58999588999999---7- Q ss_pred CCCCCEECCCCCCHHHHHHHHHCCCCEEE-EEE-CCCCCCCCCHHCCC-CCCCCHHHHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 79872451464699999999854875899-610-27876653100015-7431003788999862269819972654377 Q gi|254780889|r 272 FPSLLVMAGNIATAEGALALIDAGADIIK-VGI-GPGSICTTRVVTGV-GCPQLSAIMSVVEVAERAGVAIVADGGIRFS 348 (493) Q Consensus 272 ~~~~~ii~GNv~t~~~~~~l~~~Gad~ik-vGi-g~Gs~CtTr~~~g~-g~pq~~av~~~~~~~~~~~~~iiadGGi~~~ 348 (493) + .+ .+.-+|.|...++...++|+|++- +|. .+|..+. .. .+..++.+-++++..+. +|+||-|||-.. T Consensus 126 ~-g~-~v~~~v~~~~~A~~~~~~G~d~vI~~g~eAGGH~g~-----~~~~~~t~~Lv~ev~~~~~~--iPViAAGGI~dg 196 (336) T COG2070 126 A-GI-KVIHSVITVREALKAERAGADAVIAQGAEAGGHRGG-----VDLEVSTFALVPEVVDAVDG--IPVIAAGGIADG 196 (336) T ss_pred C-CC-EEEEEECCHHHHHHHHHCCCCEEEECCCCCCCCCCC-----CCCCCCHHHHHHHHHHHHCC--CCEEEECCCCCH T ss_conf 4-98-589850889999999817998899437767786899-----88773188899999998548--978987686886 Q ss_pred HHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 76213367148789932301012167764 Q gi|254780889|r 349 GDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 349 gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) .++..||++||+.|-||+.|..|.|++.. T Consensus 197 ~~i~AAlalGA~gVq~GT~Fl~t~Ea~a~ 225 (336) T COG2070 197 RGIAAALALGADGVQMGTRFLATKEADAS 225 (336) T ss_pred HHHHHHHHHCCHHHHHHHHHHCCCCCCCC T ss_conf 99999998441685541254214036888 No 105 >COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription] Probab=99.34 E-value=2.9e-12 Score=107.34 Aligned_cols=111 Identities=32% Similarity=0.469 Sum_probs=95.0 Q ss_pred HCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC--C----CCCCCCCCCCCCCCCCCCCC Q ss_conf 23881528789946543034565667605041678668860278872354500--1----43446724400267863257 Q gi|254780889|r 94 FESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR--F----ASNAQQAVGELMTRNLITVK 167 (493) Q Consensus 94 ~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir--~----~~~~~~~V~eiMt~~litv~ 167 (493) ....|-++.+.++|++++.++.++|++++||.+||+++ ++++|-+|++|+- . ..-.+..|+++|...+.++. T Consensus 67 a~~iM~spvv~v~pdDsi~~vv~lM~~~g~SQlPVi~~--~k~VGsItE~~iv~~~le~~e~i~~~~vr~vM~e~fP~Vs 144 (187) T COG3620 67 AKTIMHSPVVSVSPDDSISDVVNLMRDKGISQLPVIEE--DKVVGSITENDIVRALLEGMESIRSLRVREVMGEPFPTVS 144 (187) T ss_pred HHHHCCCCEEEECCHHHHHHHHHHHHHCCCCCCCEEEC--CEEEEEECHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCC T ss_conf 76653587058772466999999999759752754108--8645331498999997304221211218988558887679 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 303589999999865533064574799786889802100010 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++++....++..|. .+.|+ ++|+++||||+.||+|.. T Consensus 145 ~~~~l~vI~~LL~~~~--AVlV~-e~G~~vGIITk~DI~k~~ 183 (187) T COG3620 145 PDESLNVISQLLEEHP--AVLVV-ENGKVVGIITKADIMKLL 183 (187) T ss_pred CCCCHHHHHHHHHHCC--EEEEE-ECCCEEEEEEHHHHHHHH T ss_conf 9998799999984198--38998-689368887599999987 No 106 >COG0517 FOG: CBS domain [General function prediction only] Probab=99.33 E-value=8.1e-12 Score=104.21 Aligned_cols=105 Identities=42% Similarity=0.663 Sum_probs=94.0 Q ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCC--CCCC--CCCCCCCCCCCCCCCCCCHHHH Q ss_conf 28789946543034565667605041678668860278872354500143--4467--2440026786325730358999 Q gi|254780889|r 100 VNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFAS--NAQQ--AVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 100 ~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~--~~~~--~V~eiMt~~litv~~~~~l~eA 175 (493) .+|+++.|+.++.++..+|.+++++.+||++.+ +++|++|.+|+...- .... ++.++|+++++++.++.++.++ T Consensus 7 ~~~~~v~~~~~~~~a~~~m~~~~~~~~~v~~~~--~l~Giit~~di~~~~~~~~~~~~~v~~i~~~~~~~~~~~~~~~~~ 84 (117) T COG0517 7 KDVITVKPDTSVRDALLLMSENGVSAVPVVDDG--KLVGIITERDILRALAAGGKRLLPVKEVMTKPVVTVDPDTPLEEA 84 (117) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEEECC--EEEEEEEHHHHHHHHHCCCCCCCHHHHHHCCCCEEECCCCCHHHH T ss_conf 688089899819999999987493389974399--999999989998766416655331788625884677799889999 Q ss_pred HHHHHH-CCCCEEEEECCCC-CEEEEEEECCCH Q ss_conf 999986-5533064574799-786889802100 Q gi|254780889|r 176 KALLHQ-HRIEKLLVVDDDG-CCIGLITVKDIE 206 (493) Q Consensus 176 ~~iM~~-~ki~~LPVVDe~g-~LvGIIT~~DIl 206 (493) .+.|.+ +++..+||+|+++ +++|++|++|++ T Consensus 85 ~~~m~~~~~~~~lpVv~~~~~~lvGivt~~di~ 117 (117) T COG0517 85 LELMVERHKIRRLPVVDDDGGKLVGIITLSDIL 117 (117) T ss_pred HHHHHHHCCCCEEEEEECCCCEEEEEEEHHHCC T ss_conf 999987267762569988988799898878749 No 107 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=99.28 E-value=2.4e-11 Score=100.78 Aligned_cols=201 Identities=26% Similarity=0.369 Sum_probs=132.7 Q ss_pred CCCHHHHHHHHHHHHHHCC-----CEECCCEEEECCCCHHHHHHHHHHHC-----CCCEEEECCCCEEEEEEEEHHHHCC Q ss_conf 9898999999998742238-----81528789946543034565667605-----0416786688602788723545001 Q gi|254780889|r 78 NFSPSEQVAQVHQVKKFES-----GMVVNPVTISPYATLADALALMKKYS-----ISGIPVVESDVGKLVGILTNRDVRF 147 (493) Q Consensus 78 n~sie~qa~~V~kVKr~e~-----gmI~dPVti~pd~TI~ea~~lm~~~~-----is~iPVVD~~~gkLiGIVT~rDir~ 147 (493) .|+. ++.+.++..-.|.. -|..+-+++.++.|+.+++..+++.+ +..+.|+|++ +||.|+++-|++.. T Consensus 114 ~l~~-~~r~~v~~~l~y~e~taG~~Mt~e~v~l~~~~Tv~~al~~ir~~~~~~e~~~~lyVvD~~-~~L~Gvvsl~~Ll~ 191 (451) T COG2239 114 LLDP-EERARVRQLLSYPEDTAGRIMTTEFVTLPEDVTVDEALDRIRERAEDAETIYYLYVVDEK-GKLLGVVSLRDLLT 191 (451) T ss_pred HCCH-HHHHHHHHHCCCCHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCC-CCEEEEEEHHHHHC T ss_conf 1899-999999986589834420001302587206758999999999723675662379998776-64677756999864 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE- Q ss_conf 4344672440026786325730358999999986553306457479978688980210001017422326568765556- Q gi|254780889|r 148 ASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA- 226 (493) Q Consensus 148 ~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA- 226 (493) .+.+.+++++|.+++++++.+++.+++.+++.++++-.+||||++++|+|++|..|++...+- .++.|-.. -|+ T Consensus 192 -a~~~~~i~~im~~~~~~V~~~~dqeevA~~~~~ydl~a~PVVd~~~~LiG~itiDDiidvi~e-Ea~eDi~~---~~G~ 266 (451) T COG2239 192 -AEPDELLKDLMEDDVVSVLADDDQEEVARLFEKYDLLAVPVVDEDNRLIGIITIDDIIDVIEE-EATEDILR---MAGP 266 (451) T ss_pred -CCCHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHCCEECCEECCCCCEEEEEEHHHHHHHHHH-HHHHHHHH---HCCC T ss_conf -895768998724356243655787999999998287015357789846325549999999999-98999998---5399 Q ss_pred ------EEECCCHHHHHHHHH----HHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHH Q ss_conf ------651370156789998----731366699824841138899999999985798724514646999999 Q gi|254780889|r 227 ------AVSVAKDIADRVGPL----FDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGAL 289 (493) Q Consensus 227 ------AIg~~~d~~eRa~~L----veaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~ 289 (493) ..++..-.+.|+.-| +.+..-.-+| ++-+..++.+-.+.-..|-+-=++||+.|+-.+. T Consensus 267 ~~~~~~~~~v~~~~k~R~~WLlvll~~~~~ta~vi----~~Fe~~l~~~vaLa~fiPlv~g~gGN~GsQ~~tv 335 (451) T COG2239 267 LDVPDLFASVLKLVKKRIPWLLVLLVTATLTASVI----GLFEDTLEQLVALAAFIPLVAGMGGNAGTQAATV 335 (451) T ss_pred CCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHEEECCCCCHHHHHHHH T ss_conf 88820006899987510799999999898899999----9999779999999988601006887699999999 No 108 >PRK08318 dihydropyrimidine dehydrogenase; Validated Probab=99.24 E-value=1.4e-08 Score=80.99 Aligned_cols=173 Identities=23% Similarity=0.253 Sum_probs=105.3 Q ss_pred CCCH-HHHHHHHHHHCCCCEEEEEC--CCC------------HHHHHHHHHHHHHHHCCCCCEEC---CCCCC-HHHHHH Q ss_conf 3701-56789998731366699824--841------------13889999999998579872451---46469-999999 Q gi|254780889|r 230 VAKD-IADRVGPLFDVNVDLVVVDT--AHG------------HSQKVLDAVVQIKKNFPSLLVMA---GNIAT-AEGALA 290 (493) Q Consensus 230 ~~~d-~~eRa~~LveaGvDviviD~--ahG------------h~~~~~~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~ 290 (493) ..++ +.+-++.+-++|+|+|-+.. -|| .-+.+.+.++++|+.. ++||++ .|+.. .+-|+. T Consensus 110 ~~~e~w~~la~~~e~~GaDalELNiSCPn~~~~~~~G~~~gq~pe~v~~i~~~Vk~~~-~iPV~vKLsPnvtdi~~iA~a 188 (413) T PRK08318 110 CNEEEWKEIAPLVEETGADGIELNFGCPHGMSERGMGSAVGQVPELVEMVTRWVKRGS-RLPVIAKLTPNITDIREPARA 188 (413) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHHHH T ss_conf 7889999999986651887799955567766666555110579999999999988506-885699828997528999999 Q ss_pred HHHCCCCEEEEEECCCC---CCC-CCH-------HCCCC-------CC-CCHHHHHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf 98548758996102787---665-310-------00157-------43-1003788999862269819972654377762 Q gi|254780889|r 291 LIDAGADIIKVGIGPGS---ICT-TRV-------VTGVG-------CP-QLSAIMSVVEVAERAGVAIVADGGIRFSGDI 351 (493) Q Consensus 291 l~~~Gad~ikvGig~Gs---~Ct-Tr~-------~~g~g-------~p-q~~av~~~~~~~~~~~~~iiadGGi~~~gdi 351 (493) +.++|||+|-.-=-=++ +=. |+. .+++| .| -+-.|+++++.....++|||+-|||.+..|+ T Consensus 189 a~~aGADgv~liNTi~~~~~iDid~~~~~p~i~~~~~~GGlSG~aikPiALr~V~~i~~~~~~~~ipIiG~GGI~s~~Da 268 (413) T PRK08318 189 AKRGGADAVSLINTINSITGVDLDRMIPMPIVNGKSSHGGYCGPAVKPIALNMVAEIARDPETRGLPISGIGGIETWRDA 268 (413) T ss_pred HHHCCCCEEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHH T ss_conf 99769988999814786553202235530210677776766645676999999999986346788377975685989999 Q ss_pred EEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCC Q ss_conf 13367148789932301012167764487368058887332006887522122210123334331004540266210575 Q gi|254780889|r 352 AKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGP 431 (493) Q Consensus 352 ~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~ 431 (493) ++-|.|||++|.++|.+- ++|. T Consensus 269 ~e~ilaGAsaVQv~Ta~~-------------------~~G~--------------------------------------- 290 (413) T PRK08318 269 AEFILLGAGTVQVCTAAM-------------------QYGF--------------------------------------- 290 (413) T ss_pred HHHHHHCCCCEEEEEHHH-------------------HCCC--------------------------------------- T ss_conf 999982789216751014-------------------3384--------------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCE Q ss_conf 89989999999998555157587768973759 Q gi|254780889|r 432 IASVLHQMSGGLKSSMGYVGASNIEEFQKKAN 463 (493) Q Consensus 432 ~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~ 463 (493) +++..+..||+.=|.--|..+|+||+.++. T Consensus 291 --~ii~~i~~gL~~~m~~~G~~si~d~~G~a~ 320 (413) T PRK08318 291 --RIVEDMISGLSHYMDEKGFASLEDMVGLAV 320 (413) T ss_pred --HHHHHHHHHHHHHHHHCCCCCHHHHHCCCC T ss_conf --489999999999999809974888725265 No 109 >PRK11573 hypothetical protein; Provisional Probab=99.24 E-value=1.1e-10 Score=96.05 Aligned_cols=150 Identities=15% Similarity=0.180 Sum_probs=113.5 Q ss_pred CCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCC-----EEC--CCEEEECCCCHHHHHHHHHHHCCCC Q ss_conf 785236888999999779879997898989999999987422388-----152--8789946543034565667605041 Q gi|254780889|r 53 AMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESG-----MVV--NPVTISPYATLADALALMKKYSISG 125 (493) Q Consensus 53 ~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~g-----mI~--dPVti~pd~TI~ea~~lm~~~~is~ 125 (493) .-+.+|+.+...-+....| .+ -+++.+++.++-++.+- |+. +.++++.++|+.++++.+.+.+||+ T Consensus 150 ~~~~~t~eEl~~lv~~~~~------~l-~~~e~~mi~~il~l~~~~v~eiMtPR~~i~~l~~~~~~~e~~~~~~~~~~SR 222 (413) T PRK11573 150 VSGSLSKEELRTIVHESRS------QI-SRRNQDMLLSVLDLEKVTVDDIMVPRNEIIGIDINDDWKSILRQLTHSPHGR 222 (413) T ss_pred CCCCCCHHHHHHHHHHCCC------CC-CHHHHHHHHHHHCCCCCEEHEEEEECCCEEEEECCCCHHHHHHHHHHCCCCE T ss_conf 4553689999976641026------67-8799999998725189890118620022689747899999999866338754 Q ss_pred EEEECCCCEEEEEEEEHHHHCC-C-CCCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEE Q ss_conf 6786688602788723545001-4-344672440026--78632573035899999998655330645747997868898 Q gi|254780889|r 126 IPVVESDVGKLVGILTNRDVRF-A-SNAQQAVGELMT--RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLIT 201 (493) Q Consensus 126 iPVVD~~~gkLiGIVT~rDir~-~-~~~~~~V~eiMt--~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT 201 (493) +||.+++...++|++..+|+.. . ...+.....++. +++..+|+..++.++++.|.+++.+.+.|||+.|...|+|| T Consensus 223 ~PVy~~~~D~iiGiv~~kdl~~~~~~~~~~~~~~l~~~~~~~~fVpe~~~~~~lL~~~~~~~~h~AiVvDEyGg~~GiVT 302 (413) T PRK11573 223 IVLYRDSLDDAISMLRVREAWRLMTEKKEFTKETMLRAADEIYFVPEGTPLSTQLVKFQRNKKKVGLVVNEYGDIQGLVT 302 (413) T ss_pred EEEECCCCCCEEEEEEEHHHHHHHHCCCCCCHHHHHHHCCCCEEECCCCHHHHHHHHHHHHCCEEEEEECCCCCEEEEEE T ss_conf 25854788865999982577877642666887888751478779689890999999988428637999948998889954 Q ss_pred ECCCHHCC Q ss_conf 02100010 Q gi|254780889|r 202 VKDIERSQ 209 (493) Q Consensus 202 ~~DIlk~~ 209 (493) ..||+... T Consensus 303 lEDilEei 310 (413) T PRK11573 303 VEDILEEI 310 (413) T ss_pred HHHHHHHH T ss_conf 99999998 No 110 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=99.23 E-value=2.9e-11 Score=100.27 Aligned_cols=168 Identities=18% Similarity=0.279 Sum_probs=125.0 Q ss_pred CCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEEC------CCCCHHHHHHHHHH----------HHHHCCCEECCCEEE Q ss_conf 81600317833785236888999999779879997------89898999999998----------742238815287899 Q gi|254780889|r 42 DFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIH------RNFSPSEQVAQVHQ----------VKKFESGMVVNPVTI 105 (493) Q Consensus 42 ~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIH------rn~sie~qa~~V~k----------VKr~e~gmI~dPVti 105 (493) +..=.|=|||..||=----.==|++-+.|-|--|- +|=+=|-.-+-+++ +++-..-+-..||+. T Consensus 179 ~~~kTIvFitHDlDEA~rigDRIvilk~GeiVQvGTPdeIL~NPaneyVe~F~~~~dl~qv~~P~~~~i~~~~~~~~i~~ 258 (372) T TIGR01186 179 TLQKTIVFITHDLDEALRIGDRIVILKAGEIVQVGTPDEILRNPANEYVEEFIGKVDLSQVLSPDAERIAKRMNTVPITK 258 (372) T ss_pred HHCCEEEEEECCHHHHHHHHHHHHHEECCCEEEECCCHHHHCCCCHHHHHHHHCCCEEEEECCCCHHHHHHHCCCCCEEE T ss_conf 60980899941767887675132011068678842846874288067999873751157523888899996527732686 Q ss_pred ECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 46543034565667605041678668860278872354500143446724400267863257303589999999865533 Q gi|254780889|r 106 SPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIE 185 (493) Q Consensus 106 ~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~ 185 (493) +|+.+.+.|++||++.+++++.|||.+ ++|.|+|.-.++.-.......|.++...++.|+..++.|.|.++...+..+. T Consensus 259 t~~~gp~~Al~Lm~~~~~~s~yvv~~~-~~l~G~v~~~~~~~a~~~~~~~~~~l~~~~~tV~~~t~L~e~~~~v~~~~~~ 337 (372) T TIGR01186 259 TADKGPRSALKLMRDERVDSLYVVDRQ-NKLVGVVDVESIKQARKKAQSLQDVLIDDILTVDEGTLLRELLRKVLKAGIK 337 (372) T ss_pred CCCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 589887899999986597179999728-5488778588999888787789998620011105876048999998507995 Q ss_pred EEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 0645747997868898021000101 Q gi|254780889|r 186 KLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 186 ~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++|||||+.+++||||+.-++.... T Consensus 338 YvpVVDE~~~~~GIv~r~~L~~~l~ 362 (372) T TIGR01186 338 YVPVVDEDQRLVGIVTRASLVDALY 362 (372) T ss_pred EEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 6666614564586764778999871 No 111 >PRK07565 dihydroorotate dehydrogenase 2; Reviewed Probab=99.23 E-value=7.1e-09 Score=83.10 Aligned_cols=179 Identities=23% Similarity=0.247 Sum_probs=118.6 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCHH----------HHHHHHHHHHHHHCCCCCEEC---CCCCC-HHHHHHHHHCCC Q ss_conf 7015678999873136669982484113----------889999999998579872451---46469-999999985487 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTAHGHS----------QKVLDAVVQIKKNFPSLLVMA---GNIAT-AEGALALIDAGA 296 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~ahGh~----------~~~~~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~l~~~Ga 296 (493) .+++.+-++.+-++|+|+|-+.+++-|. ....+.++++|+.. ++||++ .|+.. .+-++.+.++|| T Consensus 113 ~ee~~~~a~~~e~~gadaiElNis~~~~~~~~~~~~~~~~~~~iv~~V~~~~-~~Pv~vKLsPn~tdi~~iA~aa~~~Ga 191 (333) T PRK07565 113 AGGWVDYARQIEEAGADALELNIYYLPTDPDISGAEVEQRYLDILRAVKSAV-SIPVAVKLSPYFSNLANMAKRLDAAGA 191 (333) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCCEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 8999999999976499889997667798865444650788999999998646-885687359982109999999997499 Q ss_pred CEEEE-EECCC-CCCC--CCHHCC--CCCCCCH--HHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 58996-10278-7665--310001--5743100--378899986226981997265437776213367148789932301 Q gi|254780889|r 297 DIIKV-GIGPG-SICT--TRVVTG--VGCPQLS--AIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 297 d~ikv-Gig~G-s~Ct--Tr~~~g--~g~pq~~--av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) |+|-+ .--+| .|.+ .+...+ .+-|... ++..++++....++|||.-|||.+..|+++-|.|||++|.+|+-+ T Consensus 192 dgv~~iNT~~~~~Id~e~~~~~~~~~lSgp~~~~~alr~v~~v~~~~~ipIiG~GGI~sg~DaiE~ilAGAsaVQv~Ta~ 271 (333) T PRK07565 192 DGLVLFNRFYQPDIDLETLEVVPGLVLSTPAELRLPLRWIAILSGRVGADLAATTGVHDAEDVIKMLLAGADVVMIASAL 271 (333) T ss_pred CEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCHHEEEHHH T ss_conf 88998436665633155443736866677431207889999996046989888889598999999998098863362236 Q ss_pred HCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 01216776448736805888733200688752212221012333433100454026621057589989999999998555 Q gi|254780889|r 369 AGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMG 448 (493) Q Consensus 369 agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~ 448 (493) - ++|. +++..+..+|++=|. T Consensus 272 ~-------------------~~G~-----------------------------------------~v~~~i~~eL~~~m~ 291 (333) T PRK07565 272 L-------------------RHGP-----------------------------------------DYIGTILAGLEDWME 291 (333) T ss_pred H-------------------HHCC-----------------------------------------HHHHHHHHHHHHHHH T ss_conf 6-------------------5372-----------------------------------------799999999999999 Q ss_pred HCCCCCHHHHHHCCEEEEECHH Q ss_conf 1575877689737599999553 Q gi|254780889|r 449 YVGASNIEEFQKKANFIRVSVA 470 (493) Q Consensus 449 y~G~~~i~e~~~~~~f~~~t~a 470 (493) --|.++|.|++.|.-.--++.+ T Consensus 292 ~~G~~si~e~~G~l~~~~~~~~ 313 (333) T PRK07565 292 RHGYESLSQFRGSMSQKNVPDP 313 (333) T ss_pred HCCCCCHHHHCCCCCCCCCCCH T ss_conf 8399989996172365679983 No 112 >cd04739 DHOD_like Dihydroorotate dehydrogenase (DHOD) like proteins. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. This subgroup has the conserved FMN binding site, but lacks some catalytic residues and may therefore be inactive. Probab=99.23 E-value=2.2e-09 Score=86.67 Aligned_cols=174 Identities=21% Similarity=0.226 Sum_probs=117.2 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCH----------HHHHHHHHHHHHHHCCCCCEEC---CCCCC-HHHHHHHHHCCC Q ss_conf 701567899987313666998248411----------3889999999998579872451---46469-999999985487 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTAHGH----------SQKVLDAVVQIKKNFPSLLVMA---GNIAT-AEGALALIDAGA 296 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~ahGh----------~~~~~~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~l~~~Ga 296 (493) .+++.+-++.+-++|+|+|-+.+++-| .....+.++++|+.. ++||++ .|+.. .+-++.+.++|| T Consensus 111 ~ee~~~~a~~~~~~gad~lElNls~~~~~~~~~~~~~~~~~~~iv~~Vk~~~-~~Pv~vKLsP~~~di~~ia~aa~~~GA 189 (325) T cd04739 111 AGGWVDYARQIEEAGADALELNIYALPTDPDISGAEVEQRYLDILRAVKSAV-TIPVAVKLSPFFSALAHMAKQLDAAGA 189 (325) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC-CCCEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 8999999999976499879996566788855442106889999999998607-886699539983009999999997599 Q ss_pred CEEEE-EECCC---CCCCCCHHCCC--CCCCC--HHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 58996-10278---76653100015--74310--0378899986226981997265437776213367148789932301 Q gi|254780889|r 297 DIIKV-GIGPG---SICTTRVVTGV--GCPQL--SAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 297 d~ikv-Gig~G---s~CtTr~~~g~--g~pq~--~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) |++-+ .--+| .+.+-+.+.+. +-|-. -++..++.+....++|||+-|||.+..|+++-|.|||++|.+|+.+ T Consensus 190 dgi~liNT~~~~~id~~~~~~~~~~~lSg~~~~~~alr~v~~~~~~~~ipIiG~GGI~s~~Da~e~ilAGAsaVQv~TA~ 269 (325) T cd04739 190 DGLVLFNRFYQPDIDLETLEVVPNLLLSSPAEIRLPLRWIAILSGRVKASLAASGGVHDAEDVVKYLLAGADVVMTTSAL 269 (325) T ss_pred CEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCHHHEEHHH T ss_conf 88997357665642167641536877457530068899999996468989888889598999999998098876143234 Q ss_pred HCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 01216776448736805888733200688752212221012333433100454026621057589989999999998555 Q gi|254780889|r 369 AGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMG 448 (493) Q Consensus 369 agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~ 448 (493) - |+|. +++..+..+|..-|. T Consensus 270 ~-------------------~~G~-----------------------------------------~i~~~i~~eL~~~m~ 289 (325) T cd04739 270 L-------------------RHGP-----------------------------------------DYIGTLLAGLEAWME 289 (325) T ss_pred H-------------------HHCC-----------------------------------------HHHHHHHHHHHHHHH T ss_conf 6-------------------4183-----------------------------------------799999999999999 Q ss_pred HCCCCCHHHHHHCCEEE Q ss_conf 15758776897375999 Q gi|254780889|r 449 YVGASNIEEFQKKANFI 465 (493) Q Consensus 449 y~G~~~i~e~~~~~~f~ 465 (493) --|.++|+|++.|...- T Consensus 290 ~~G~~si~e~~Gkl~~~ 306 (325) T cd04739 290 EHGYESVQQLRGSMSQK 306 (325) T ss_pred HCCCCCHHHHCCCCCCC T ss_conf 83999799962734545 No 113 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=99.22 E-value=1.9e-11 Score=101.61 Aligned_cols=120 Identities=28% Similarity=0.416 Sum_probs=101.8 Q ss_pred CCCCHHHHHH----------HHHHHHHHCCCEEC-CCE-EEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHH Q ss_conf 8989899999----------99987422388152-878-99465430345656676050416786688602788723545 Q gi|254780889|r 77 RNFSPSEQVA----------QVHQVKKFESGMVV-NPV-TISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRD 144 (493) Q Consensus 77 rn~sie~qa~----------~V~kVKr~e~gmI~-dPV-ti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rD 144 (493) ||.+.|+=|. -.-||| +.|-+ |-+ .+.|.+++.||+-.|.|.+.+...|+|++ .++.||+|+.| T Consensus 133 rnF~~eDFA~~HPGG~LG~kLL~kV~---dlm~t~d~lP~v~~tas~~DAL~e~~~~~LG~~~v~~~~-~~~~Gv~tDGD 208 (272) T TIGR00393 133 RNFSEEDFAKFHPGGALGRKLLVKVK---DLMQTDDELPLVAPTASFKDALLEMSRKRLGLAVVCDEN-EQLKGVFTDGD 208 (272) T ss_pred CCCCHHHHHCCCCCHHHHHHHHHHHH---HHHCCCCCCCCEECCCCCEEEEEEEECCCCCEEEEEECC-CCEEEEEECCH T ss_conf 58884244114870466678888767---764066567822236772021023313786179997155-52467871465 Q ss_pred HCC------CCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEE Q ss_conf 001------43446724400267863257-303589999999865533064574799786889 Q gi|254780889|r 145 VRF------ASNAQQAVGELMTRNLITVK-KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLI 200 (493) Q Consensus 145 ir~------~~~~~~~V~eiMt~~litv~-~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGII 200 (493) +|- ...++.+|+++||..+.++. ++.-+.||.++|.++||+.++|||++|+|+|++ T Consensus 209 ~RR~l~~~g~~~l~~~v~~~mT~~p~~~~n~~~~l~~A~~~l~~~kI~~~~vvdd~nkl~G~~ 271 (272) T TIGR00393 209 LRRVLALLGGGALKKEVKDFMTLGPKTLKNSDELLVEALEFLKKRKITSLVVVDDENKLLGVL 271 (272) T ss_pred HHHHHHHHCCHHCCCCHHHHCCCCCEEEECHHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEE T ss_conf 999998816600023122210688813401456899999998727942899983897178753 No 114 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=99.22 E-value=5.7e-11 Score=98.14 Aligned_cols=124 Identities=19% Similarity=0.174 Sum_probs=100.6 Q ss_pred HHHHHHHH-----HHCCCEECCCEEE---ECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCC Q ss_conf 99999874-----2238815287899---465430345656676050416786688602788723545001434467244 Q gi|254780889|r 85 VAQVHQVK-----KFESGMVVNPVTI---SPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVG 156 (493) Q Consensus 85 a~~V~kVK-----r~e~gmI~dPVti---~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~ 156 (493) ++.|+.|+ +.++.|..+|.++ .|.....+|++.|.+++.+.+.|+|++ ++++|+|+..+++........+. T Consensus 262 ~~F~~~v~~~~~~~a~~i~~~~~~~~~~~~~~~~~~~al~~m~~~~~~~~~vvd~~-~~~~G~v~~~~~~~~~~~~~~~~ 340 (400) T PRK10070 262 RTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYGYVIERG-NKFVGAVSIDSLKAALTQQQGLD 340 (400) T ss_pred HHHHCCCCHHHCEEHHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCHH T ss_conf 98755477877023989624687521314888699999999985598679998699-80889988999997763377636 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 002678632573035899999998655330645747997868898021000101 Q gi|254780889|r 157 ELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 157 eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++|.+++.++++++++.+++..|.++... +||||++|+|+|+||+..++.... T Consensus 341 ~~~~~~~~~v~~~~~l~~~~~~~~~~~~~-~~Vvd~~~~l~G~it~~~ll~~L~ 393 (400) T PRK10070 341 AALIDAPLAVDAQTPLSELLSHVGQAPCA-VPVVDEDQQYVGIISKGMLLRALD 393 (400) T ss_pred HHHHCCCCCCCCCCCHHHHHHHHHHCCCC-EEEECCCCCEEEEEEHHHHHHHHH T ss_conf 67505884239999899999999728996-389879991999998799999987 No 115 >COG1253 TlyC Hemolysins and related proteins containing CBS domains [General function prediction only] Probab=99.18 E-value=2.4e-10 Score=93.58 Aligned_cols=128 Identities=23% Similarity=0.320 Sum_probs=102.6 Q ss_pred HHHHHHHHHHHHHCC-----CEEC--CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCC-- Q ss_conf 999999998742238-----8152--8789946543034565667605041678668860278872354500143446-- Q gi|254780889|r 82 SEQVAQVHQVKKFES-----GMVV--NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQ-- 152 (493) Q Consensus 82 e~qa~~V~kVKr~e~-----gmI~--dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~-- 152 (493) +++.+++.+|-+..+ .|+. +.+.+..+.++.++.+.+.+++||++||.+++...++|++..+|+....... T Consensus 191 ~~E~~mi~~v~~l~~~~v~eiMtPR~~i~~l~~~~~~~~~~~~~~~~~~SR~PV~~~~~D~iiGiv~~Kdll~~~~~~~~ 270 (429) T COG1253 191 EEEREMINNVLDLDDRTVREIMTPRTDIVALDLTDTVEELIELILESGHSRIPVYDGDLDNIIGIVHVKDLLRALLDGQS 270 (429) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCC T ss_conf 89999999650213756261760315579941899999999999758986587885799837888689999976431564 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 724400267863257303589999999865533064574799786889802100010 Q gi|254780889|r 153 QAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 153 ~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) .+....--+++..+++.+++.++++.|++++.+.+.|+|+.|...|++|..||+... T Consensus 271 ~~~~~~~~~~~~~Vpet~~~~~lL~~~r~~~~hmAiVvDEyG~~~GlVTleDIiEeI 327 (429) T COG1253 271 DLDLRVLVRPPLFVPETLSLSDLLEEFREERTHMAIVVDEYGGVEGLVTLEDIIEEI 327 (429) T ss_pred CCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCEEEEEEECCCCEEEECCHHHHHHHH T ss_conf 444200257871778989799999998863874999993789843326589999998 No 116 >KOG0538 consensus Probab=99.12 E-value=9e-10 Score=89.51 Aligned_cols=300 Identities=21% Similarity=0.312 Sum_probs=189.4 Q ss_pred CHHEEECCCCC-CCCCCCEEEEEEECCCCEECCCEEEEC-----CCCC-CCHHHHHHHHHCCCEEEEC--CCCCHHHHHH Q ss_conf 00057527643-247542253343038816003178337-----8523-6888999999779879997--8989899999 Q gi|254780889|r 16 FDDVLLRPEFS-NVLPRDIDISTRIAKDFTLNLPIMSAA-----MDQV-TDSRLAIAMAQAGGLGVIH--RNFSPSEQVA 86 (493) Q Consensus 16 fDDVllvP~~s-~v~~~~vdl~t~lt~~i~l~iPivSs~-----MDTV-Te~~MAiamA~~GGiGvIH--rn~sie~qa~ 86 (493) |.-.+|.|.+- ++ +.+|+||.+- +-+.+.||.-|| |-.. -|...|-+-++.|-+=+|- --+|+|+-++ T Consensus 40 F~ri~~rPr~L~dV--~~iD~sTtvl-G~~i~~Pi~iapTa~qkma~pdGE~~taraa~~~~~~~i~Ss~at~S~EdI~~ 116 (363) T KOG0538 40 FRRILFRPRILRDV--SKIDTSTTVL-GQKISAPIMIAPTAMQKMAHPDGELATARAAQAAGTIMILSSWATCSVEDIAS 116 (363) T ss_pred HHHHHCCCHHHEEC--CCCCCCEEEC-CCCCCCEEEECCHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCHHCCHHHHHH T ss_conf 87550142143002--5355410331-40126416875166766048862278889886569858973101078999985 Q ss_pred HHHHHHHHCCCEECCCEEEECCCCHHH-HHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 999874223881528789946543034-5656676050416786688602788723545001434467244002678632 Q gi|254780889|r 87 QVHQVKKFESGMVVNPVTISPYATLAD-ALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLIT 165 (493) Q Consensus 87 ~V~kVKr~e~gmI~dPVti~pd~TI~e-a~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~lit 165 (493) -..++...=-..+.+|..|.+ .++..++.+|..+.|+-+- -.+|- .+.|+++ -+. T Consensus 117 ------aap~~~rwfQLYvykdr~It~~Lv~raEk~GfkAlvlTvDt--P~lG~-R~~D~~n---------------~f~ 172 (363) T KOG0538 117 ------AAPPGIRWFQLYVYKDRDITEQLVKRAEKAGFKALVLTVDT--PRLGR-RESDIKN---------------KFS 172 (363) T ss_pred ------HCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECC--CCCCC-CHHHHHH---------------CCC T ss_conf ------18877379999853744689999999997296699998346--11267-6044440---------------256 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 57303589999999865533064574799786889802100010174223265687655566513701567899987313 Q gi|254780889|r 166 VKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVN 245 (493) Q Consensus 166 v~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaG 245 (493) .+.+. +++++.- +..+ - +.+. -++| T Consensus 173 lp~~l------------------------------~lknfe~----~~~~-----------~--v~~~--------~~sg 197 (363) T KOG0538 173 LPKNL------------------------------TLKNFEG----LKLT-----------E--VEEA--------GDSG 197 (363) T ss_pred CCCCC------------------------------CCCCCCC----CCCC-----------C--CCCC--------CCHH T ss_conf 87421------------------------------0026555----6655-----------6--7866--------3134 Q ss_pred CC---EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCC Q ss_conf 66---699824841138899999999985798724514646999999998548758996102787665310001574310 Q gi|254780889|r 246 VD---LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQL 322 (493) Q Consensus 246 vD---viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~ 322 (493) .. .-.+|.++-+. -|+|+++.. ++||++-.|-|.|-|+.-.++|+++|.|-=-+|- +.-.....+ T Consensus 198 ~~~~~~~~id~Sl~W~-----Di~wLr~~T-~lPIvvKGilt~eDA~~Ave~G~~GIIVSNHGgR------QlD~vpAtI 265 (363) T KOG0538 198 LAAYVSSQIDPSLSWK-----DIKWLRSIT-KLPIVVKGVLTGEDARKAVEAGVAGIIVSNHGGR------QLDYVPATI 265 (363) T ss_pred HHHHHHCCCCCCCCHH-----HHHHHHHCC-CCCEEEEEECCCHHHHHHHHHCCCEEEEECCCCC------CCCCCCCHH T ss_conf 6666423788777742-----469998527-5876998311438799999808865998578753------257664118 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCC Q ss_conf 03788999862269819972654377762133671487899323010121677644873680588873320068875221 Q gi|254780889|r 323 SAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGS 402 (493) Q Consensus 323 ~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~ 402 (493) +++-|+.++.... +||.-|||+|..-||.||||+||.+|.+|..+ -||.+- T Consensus 266 ~~L~Evv~aV~~r-i~V~lDGGVR~G~DVlKALALGAk~VfiGRP~--------------------v~gLA~-------- 316 (363) T KOG0538 266 EALPEVVKAVEGR-IPVFLDGGVRRGTDVLKALALGAKGVFIGRPI--------------------VWGLAA-------- 316 (363) T ss_pred HHHHHHHHHHCCC-EEEEEECCCCCCHHHHHHHHCCCCEEEECCCC--------------------EEEECC-------- T ss_conf 8799999986285-47997267335427999985167368856721--------------------020002-------- Q ss_pred HHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCC Q ss_conf 222101233343310045402662105758998999999999855515758776897375 Q gi|254780889|r 403 SARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKA 462 (493) Q Consensus 403 ~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~ 462 (493) +|..| |+++|.-|..-+..+|.-.||+|+.|+..+- T Consensus 317 -----------------~Ge~G-------V~~vl~iL~~efe~tmaLsGc~sv~ei~~~~ 352 (363) T KOG0538 317 -----------------KGEAG-------VKKVLDILRDEFELTMALSGCRSVKEITRNH 352 (363) T ss_pred -----------------CCCHH-------HHHHHHHHHHHHHHHHHHHCCCCHHHCCCCC T ss_conf -----------------56032-------9999999999999999984786065407452 No 117 >KOG1764 consensus Probab=99.01 E-value=6.1e-10 Score=90.74 Aligned_cols=108 Identities=32% Similarity=0.490 Sum_probs=56.3 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCC------CCCCC-CCCCC------CCCCCCCC Q ss_conf 8789946543034565667605041678668860278872354500143------44672-44002------67863257 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFAS------NAQQA-VGELM------TRNLITVK 167 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~------~~~~~-V~eiM------t~~litv~ 167 (493) +..++.+++++.+|++.|.+++++++||||.. |+++|+++..|+.+.. ..+.. +++.- ...++++. T Consensus 238 ~i~~i~~~~~v~~al~~m~~~~is~lpvV~~~-g~~v~~~s~~Dv~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~vvtc~ 316 (381) T KOG1764 238 NIASISEDTPVIEALKIMSERRISALPVVDEN-GKKVGNYSRFDVIHLAREGTYNNLDLSCLSEALSHRPIRFEGVVTCR 316 (381) T ss_pred HHHHHCCCCCHHHHHHHHHHCCCCCCCEECCC-CCEECCEEEEHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCEEEE T ss_conf 77775178708999999987045744347577-85532156200121564277677773242135331554568737984 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 303589999999865533064574799786889802100010 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) +..++.+++..|..++.+++.|||++|.|+|++|..||+... T Consensus 317 ~~ssL~~vi~~lv~~~vHRl~VVd~~~~l~GvvSLsDil~~l 358 (381) T KOG1764 317 PTSTLAEVIDKLVAHRVHRLWVVDEDGVLVGVISLSDILSYL 358 (381) T ss_pred ECCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 377499999999873860599986898679875589999999 No 118 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.95 E-value=1e-09 Score=89.13 Aligned_cols=61 Identities=33% Similarity=0.543 Sum_probs=56.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 4344672440026786325730358999999986553306457479978688980210001 Q gi|254780889|r 148 ASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 148 ~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ..+-..+|+|+|+++++++++++++.+|+++|.+++++.|||||++|+|+||||.+||-++ T Consensus 53 i~~v~~qV~DIm~~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~g~l~GiIT~~DIAe~ 113 (113) T cd04597 53 LADVHPRVRDVINRKPVTARPNDPLREALNLMHEHNIRTLPVVDDDGTPAGIITLLDLAEK 113 (113) T ss_pred CCCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 2156505666313788341999829999999998499878698899939999877874357 No 119 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=98.93 E-value=6.4e-08 Score=76.24 Aligned_cols=241 Identities=22% Similarity=0.349 Sum_probs=135.5 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCC---CCCCCCCCCCCCCCCCH Q ss_conf 881528789946543034565667605041678668860278872354500143446724---40026786325730358 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAV---GELMTRNLITVKKTVNL 172 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V---~eiMt~~litv~~~~~l 172 (493) +.-+.-|+.-+|=+|+.|..-.+.--..+++-|+..+. ++......-.+| +..|..+|++++++.++ T Consensus 37 ~i~L~iPivSS~MDTVTe~~MAiamA~~GGlGVIHrn~----------sie~Q~~~V~~VKr~e~g~i~~P~tl~P~~tv 106 (486) T PRK05567 37 NIRLNIPLLSAAMDTVTEARLAIAMAQEGGIGVIHKNM----------SIEEQAEEVRKVKRSESGVVTDPVTVTPDTTL 106 (486) T ss_pred CCCCCCCEEECCCCCCCCHHHHHHHHHCCCEEEECCCC----------CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCH T ss_conf 91328877867876657899999999889879998999----------99999999999975306713798676898889 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCC--CCCCEEEEEEEECCCHHHHHHHHHHHCCCC-EE Q ss_conf 999999986553306457479978688980210001017422326--568765556651370156789998731366-69 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKD--SKGRLRVAAAVSVAKDIADRVGPLFDVNVD-LV 249 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D--~~grL~VgAAIg~~~d~~eRa~~LveaGvD-vi 249 (493) .|+.++|.++++.-+||+|++++|+||+|.+|+.-...+-...++ ....|.. +.-++..+. --+.|.+.-.+ +. T Consensus 107 ~d~~~l~~~~~~sg~PVv~~~~kL~GiiT~rD~~f~~~~~~~V~~vMT~~~lvt-~~~~is~~e--A~~~l~~~kieklP 183 (486) T PRK05567 107 AEALALMARYGISGVPVVDEEGKLVGIITNRDVRFETDLSQPVSEVMTKERLVT-VPEGTTLEE--ALELLHEHRIEKLL 183 (486) T ss_pred HHHHHHHHHHCCCCCEEECCCCCEEEEEECCCEECCCCCCCCHHHHHCCCCEEE-ECCCCCHHH--HHHHHHHHHHCCCC T ss_conf 999999997287861487679947888614201100267765467534573289-258899999--99999973130342 Q ss_pred EEECCCCHHHHHHHH--HHHHHHHCCCC-------CEECCCCC----CHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCC Q ss_conf 982484113889999--99999857987-------24514646----999999998548758996102787665310001 Q gi|254780889|r 250 VVDTAHGHSQKVLDA--VVQIKKNFPSL-------LVMAGNIA----TAEGALALIDAGADIIKVGIGPGSICTTRVVTG 316 (493) Q Consensus 250 viD~ahGh~~~~~~~--i~~~k~~~~~~-------~ii~GNv~----t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g 316 (493) ++|- +|+-...+.. +.+. +.||+- ..++.-|. +.|-++.|.++|+|.+-+-.--| T Consensus 184 vVd~-~g~L~Glit~kDi~k~-~~~P~A~~D~~grL~VgAAVg~~~~~~eRa~~Lv~AGvDvivIDtAhG---------- 251 (486) T PRK05567 184 VVDD-NGRLKGLITVKDIEKA-EEFPNACKDEQGRLRVGAAVGVGADNEERAEALVKAGVDVLVVDTAHG---------- 251 (486) T ss_pred EECC-CCCEEEEEEHHHHHHH-HHCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCC---------- T ss_conf 7746-8968888776677765-208851236668889999962680189999999976998899504452---------- Q ss_pred CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE-EECCCEEEEC Q ss_conf 574310037889998622698199726543777621336-7148789932 Q gi|254780889|r 317 VGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI-AAGSACVMIG 365 (493) Q Consensus 317 ~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G 365 (493) .-.-++.++.+..+.. .++|||+ |.+- ..+.++.| -+|||+|-+| T Consensus 252 hs~~vi~~ik~ik~~~--~~v~via-GNv~-T~~~a~~L~~aGaD~vkVG 297 (486) T PRK05567 252 HSEGVLDRVREIKAKY--PDVQIIA-GNVA-TAEAARALIEAGADAVKVG 297 (486) T ss_pred CHHHHHHHHHHHHHCC--CCCCEEE-EEEC-HHHHHHHHHHCCCCEEEEC T ss_conf 1577899999997407--8773687-5120-1999999997298769965 No 120 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=98.92 E-value=4.9e-08 Score=77.04 Aligned_cols=49 Identities=14% Similarity=0.295 Sum_probs=29.4 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC---CCCEEEEEEECCCH Q ss_conf 02678632573035899999998655330645747---99786889802100 Q gi|254780889|r 158 LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDD---DGCCIGLITVKDIE 206 (493) Q Consensus 158 iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe---~g~LvGIIT~~DIl 206 (493) .|..+|+|+.++.++.++.++|.++++.-+||+|+ +++|+||+|.+|+. T Consensus 100 g~i~dPit~~p~~ti~d~~~l~~~~~~sg~pVv~~g~~~gkL~GIvT~rD~~ 151 (499) T PTZ00314 100 GFIMDPKSLSPEHTVSDVIEIKDKKGFSGIPITEDGRPGGKLLGIVTSKDID 151 (499) T ss_pred CCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECCCEE T ss_conf 6206982738986299999756751886367986388688689998362023 No 121 >cd04729 NanE N-acetylmannosamine-6-phosphate epimerase (NanE) converts N-acetylmannosamine-6-phosphate to N-acetylglucosamine-6-phosphate. This reaction is part of the pathway that allows the usage of sialic acid as a carbohydrate source. Sialic acids are a family of related sugars that are found as a component of glycoproteins, gangliosides, and other sialoglycoconjugates. Probab=98.92 E-value=1.7e-08 Score=80.35 Aligned_cols=128 Identities=23% Similarity=0.423 Sum_probs=96.9 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCH---HHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 567899987313666998248411---38899999999985798724514646999999998548758996102787665 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGH---SQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICT 310 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh---~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~Ct 310 (493) ..+-+.+|+++|+|+|.+|.-+-. -....+.++++|+++ +..+| ..++|.|-+..-.++|+|.|---. .| - T Consensus 81 t~~ev~~l~~aGadiIA~DaT~R~RP~g~~l~~~i~~i~~~~-~~l~M-AD~st~ee~~~A~~~G~D~vgTTL-~G---Y 154 (219) T cd04729 81 TIEEVDALAAAGADIIALDATDRPRPDGETLAELIKRIHEEY-NCLLM-ADISTLEEALNAAKLGFDIIGTTL-SG---Y 154 (219) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHH-CCEEE-EECCCHHHHHHHHHCCCCEEECCC-CC---C T ss_conf 899999999859999999467887989978999999999986-97788-754889999999984998997021-45---6 Q ss_pred CCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 311 TRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 311 Tr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) |......--|-+..+.++++. .++||||.|+|.++-+..|||.+||++|.+||..-. T Consensus 155 T~~t~~~~~PD~~lv~~l~~~---~~~pvIaEGri~tPe~a~~a~~~GA~aVVVGsAITr 211 (219) T cd04729 155 TEETAKTEDPDFELLKELRKA---LGIPVIAEGRINSPEQAAKALELGADAVVVGSAITR 211 (219) T ss_pred CCCCCCCCCCCHHHHHHHHHH---CCCCEEEECCCCCHHHHHHHHHCCCCEEEECCCCCC T ss_conf 778788999878999999997---599399706989999999999839989998954388 No 122 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=98.91 E-value=4.6e-08 Score=77.29 Aligned_cols=188 Identities=26% Similarity=0.334 Sum_probs=85.0 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCC-CCCEEEEEEEECCCHHHH Q ss_conf 0267863257303589999999865533064574799786889802100010174223265-687655566513701567 Q gi|254780889|r 158 LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDS-KGRLRVAAAVSVAKDIAD 236 (493) Q Consensus 158 iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~-~grL~VgAAIg~~~d~~e 236 (493) .+..+|+|+.++.++.|+.++|.++++.-+||||++|+|+||||.+|+.....+. ...+- ..+| |-|-.|+..+ | T Consensus 94 ~~i~~Pvti~pd~tv~d~~~l~~~~~~sg~pVv~~~gkLvGIvT~RDir~~d~~~-~v~~vMT~~l-vt~~~g~sl~--e 169 (479) T PRK07807 94 LVFDTPVTLAPDDTVSDALALIHKRAHGAVVVVDEEGRPVGLVTEADCRGVDRFT-RVGDVMSTDL-VTLPAGTDPR--K 169 (479) T ss_pred CEECCCEEECCCCCHHHHHHHHHHHCCCCCCEECCCCCEEEEEECHHHCCCCCCC-CHHHHHCCCC-EEECCCCCHH--H T ss_conf 7436998978987199999999983788874146799478898213411477777-5888625773-6613667879--9 Q ss_pred HHHHHHHCCCC-EEEEECCCCHHHHHH---HHHHHHHHHCCC------CCEECCCCCC----HHHHHHHHHCCCCEEEEE Q ss_conf 89998731366-699824841138899---999999985798------7245146469----999999985487589961 Q gi|254780889|r 237 RVGPLFDVNVD-LVVVDTAHGHSQKVL---DAVVQIKKNFPS------LLVMAGNIAT----AEGALALIDAGADIIKVG 302 (493) Q Consensus 237 Ra~~LveaGvD-viviD~ahGh~~~~~---~~i~~~k~~~~~------~~ii~GNv~t----~~~~~~l~~~Gad~ikvG 302 (493) -...|-+.-.. +.++|- +|+-...+ +..| ++.||. ...++.-|.+ .|-++.|+++|+|.+-|- T Consensus 170 A~~lL~~~kiekLpvVd~-~~~L~gLiT~kDi~k--~~~~~~a~D~~grL~VgAAVGv~~d~~eR~~aLv~AGvDvlvID 246 (479) T PRK07807 170 AFDLLEAAPVKVAPVVDA-DGRLAGVLTRTGALR--ATIYTPAVDAAGRLRVAAAVGINGDVAAKARALLEAGVDVLVID 246 (479) T ss_pred HHHHHHHCCCCCCEEECC-CCEEEEEEEEEHHHH--CCCCCCCCCHHHCEEEEEEECCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 999997535231137756-992999999611645--44688777822345678872578458999999997699899975 Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEEC Q ss_conf 027876653100015743100378899986226981997265437776213367148789932 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIG 365 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G 365 (493) ..-| .-.-++.++.+..+ ...+++||| |-+-+.--..--+-+|||++-+| T Consensus 247 tAHG----------hS~~vi~~vk~iK~--~~p~~~via-GNvaT~~~a~~Li~aGad~ikvG 296 (479) T PRK07807 247 TAHG----------HQEKMLEAIRAVRA--LDPGVPLVA-GNVVTAEGTRDLVEAGADIVKVG 296 (479) T ss_pred CCCC----------CHHHHHHHHHHHHH--HCCCCCEEE-EEHHHHHHHHHHHHCCCCEEECC T ss_conf 4576----------64899999999984--089885787-43202999999997399976315 No 123 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=98.91 E-value=2e-08 Score=79.80 Aligned_cols=187 Identities=22% Similarity=0.289 Sum_probs=88.9 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC---CCCEEEEEEECCCHHCCCCCCC-CCC---CCCCEEEEEEEEC Q ss_conf 02678632573035899999998655330645747---9978688980210001017422-326---5687655566513 Q gi|254780889|r 158 LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDD---DGCCIGLITVKDIERSQLNPNA-TKD---SKGRLRVAAAVSV 230 (493) Q Consensus 158 iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe---~g~LvGIIT~~DIlk~~~~p~A-~~D---~~grL~VgAAIg~ 230 (493) -|..+|+|++++.++.|+.++|.++++..+||+|+ +|+|+||||.+|+--.+..++. ..| ...+|.. +-.|+ T Consensus 102 g~I~dPvti~p~~Tv~da~~l~~k~~~sg~PVvd~g~~~gkLvGIvT~RD~rf~~~~~~~~V~~vMT~~~~Lvt-~~~g~ 180 (497) T PRK07107 102 GFVISDSNLTPDNTLADVLDLKERTGHSTIAVTEDGTANGKLLGIVTSRDYRVSRMSLDTKVKDFMTPFEKLVV-AKVGI 180 (497) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEECCEEECCCCCCCCCHHHHCCCCCCEEE-CCCCC T ss_conf 76689817589864999999888719962345103785987999984640111136777766564067654278-46889 Q ss_pred CCHHHHHHH-HHHHCCCC-EEEEECCCCHHHHHHHHHHHHH--HHCCC-------CCEECCCCCC---HHHHHHHHHCCC Q ss_conf 701567899-98731366-6998248411388999999999--85798-------7245146469---999999985487 Q gi|254780889|r 231 AKDIADRVG-PLFDVNVD-LVVVDTAHGHSQKVLDAVVQIK--KNFPS-------LLVMAGNIAT---AEGALALIDAGA 296 (493) Q Consensus 231 ~~d~~eRa~-~LveaGvD-viviD~ahGh~~~~~~~i~~~k--~~~~~-------~~ii~GNv~t---~~~~~~l~~~Ga 296 (493) . .+-|. .|-+...+ +.++|- +|+-..++ +.|.|+ +.||. ...++.-|.| .|-+..|+++|+ T Consensus 181 s---l~eA~~lL~~~kiekLpvVd~-~g~L~gLi-T~kDi~k~~~~P~a~~D~~grL~VgAAIg~~d~~eRa~~Lv~aGv 255 (497) T PRK07107 181 T---LKEANNIIWDHKLNTLPIIDK-NQHLVYMV-FRKDYDSHKENPLELLDSSKRYVVGAGINTRDYEERVPALVEAGA 255 (497) T ss_pred C---HHHHHHHHHHCCCCCEEEECC-CCCEEEEE-EHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHCCC T ss_conf 9---999999998634443027837-99589999-837888864396311376788889996377789999999998599 Q ss_pred CEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCC--CCEEEEECCCCCHHHHEEEE-EECCCEEEEC Q ss_conf 589961027876653100015743100378899986226--98199726543777621336-7148789932 Q gi|254780889|r 297 DIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERA--GVAIVADGGIRFSGDIAKAI-AAGSACVMIG 365 (493) Q Consensus 297 d~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~--~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G 365 (493) |.+-|-..-|. -.-|+.++....+ .+ .++||| |-+-+ .+-+++| .+|||+|-+| T Consensus 256 D~lviD~AhGh----------s~~v~~~ik~ik~---~~~~~~~i~a-GNVaT-~~~~~~L~~aGad~vkVG 312 (497) T PRK07107 256 DVLCIDSSDGY----------SEWQKRTLDYIKE---KYGDTVKVGA-GNVVD-RDGFLYLAEAGADFVKVG 312 (497) T ss_pred CEEEECCCCCC----------HHHHHHHHHHHHH---HCCCCCCCCC-CCHHC-HHHHHHHHHCCCCEEEEC T ss_conf 99980343535----------2999999999998---6698763414-52126-999999998089868971 No 124 >PRK01130 N-acetylmannosamine-6-phosphate 2-epimerase; Provisional Probab=98.84 E-value=4.4e-08 Score=77.43 Aligned_cols=127 Identities=24% Similarity=0.375 Sum_probs=95.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC---HHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 56789998731366699824841---138899999999985798724514646999999998548758996102787665 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG---HSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICT 310 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG---h~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~Ct 310 (493) ..+-+.+|+++|+|+|.+|.-.- +.....+.++.+|+.| +..+| ..++|.|-+..-.++|+|.|---. -.- T Consensus 77 t~~ev~~l~~aGadiIA~DaT~R~RP~g~~~~~~i~~i~~~~-~~l~M-AD~st~eea~~A~~~G~D~V~TTL----sGY 150 (222) T PRK01130 77 TLKEVDALAAAGADIIALDATLRPRPDGETLAELVKRIKEKP-GQLLM-ADCSTLEEGLAAAKLGFDFIGTTL----SGY 150 (222) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHH-CCEEE-EECCCHHHHHHHHHCCCCEEECCC----CCC T ss_conf 699999999869999998467898989968999999999982-98789-854889999999984999997233----456 Q ss_pred CCHHCC--CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 310001--574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 311 TRVVTG--VGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 311 Tr~~~g--~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) |....+ .--|-+..+.++++ .++|+||.|+|.++-+..|||.+||++|-.||..-. T Consensus 151 T~~t~~~~~~~pD~~lv~~l~~----~~~pvIaEGri~tPe~a~~al~~GA~aVvVGsAITr 208 (222) T PRK01130 151 TEYTEGETPEEPDFALLKELLK----AGCPVIAEGRINTPEQAKKALELGAHAVVVGSAITR 208 (222) T ss_pred CCCCCCCCCCCCCHHHHHHHHH----CCCCEEEECCCCCHHHHHHHHHCCCCEEEECCCCCC T ss_conf 7676778789986999999995----899899747989999999999849989998975479 No 125 >pfam04131 NanE Putative N-acetylmannosamine-6-phosphate epimerase. This family represents a putative ManNAc-6-P-to-GlcNAc-6P epimerase in the N-acetylmannosamine (ManNAc) utilisation pathway found mainly in pathogenic bacteria. Probab=98.82 E-value=5.1e-08 Score=76.92 Aligned_cols=125 Identities=22% Similarity=0.314 Sum_probs=94.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCC-CHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 5678999873136669982484-113889999999998579872451464699999999854875899610278766531 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAH-GHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTR 312 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ah-Gh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr 312 (493) ..+-+.+|+++|+|++.+|.-+ -.....-+.++.+|+.+ ..+ --.+.|.|-++.-.++|+|.|---.- |- |. T Consensus 53 t~~ev~~l~~aGadiIA~DaT~R~RP~~~~~lv~~i~~~~--~l~-MAD~st~eea~~A~~~G~D~I~TTL~-GY---T~ 125 (192) T pfam04131 53 TMKDIDELANAGADIIALDGTDRPRPVDIESFIKRIKEKG--QLA-MADCSTFEEGLNAHKLGVDIVGTTLS-GY---TG 125 (192) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHC--CEE-EEECCCHHHHHHHHHCCCCEEECCCC-CC---CC T ss_conf 8999999998599999984678989758999999999819--988-99749999999999859999982325-57---89 Q ss_pred HHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 0001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 313 VVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 313 ~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) ...+. -|-+..+.+.++ .++||||.|+|.++-+..+||.+||++|-+||-.-. T Consensus 126 ~t~~~-~pD~~ll~~l~~----~~~pvIaEGri~tPe~a~~a~~~GA~aVVVGsAITr 178 (192) T pfam04131 126 GSNPA-EPDFQLVKTLSE----AGCFVIAEGRYNTPELAKKAIEIGADAVTVGSAITR 178 (192) T ss_pred CCCCC-CCCHHHHHHHHH----CCCCEEEECCCCCHHHHHHHHHCCCCEEEECCCCCC T ss_conf 99999-997899999986----899399857989999999999839989998965379 No 126 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=98.81 E-value=3.4e-07 Score=71.06 Aligned_cols=242 Identities=22% Similarity=0.343 Sum_probs=154.5 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCC--CCCCCCCCCCHH Q ss_conf 881528789946543034565667605041678668860278872354500143446724400267--863257303589 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTR--NLITVKKTVNLE 173 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~--~litv~~~~~l~ 173 (493) +.-+.-|+.=+|=+||.|..=-+.--..+++=|+..|.+ .++.+.-.+ .=++.+..+.. +|+|+.|+.++. T Consensus 30 ~i~L~iP~~SspMDTVTE~~MAiAMA~~GGiGVIH~N~~------~E~Qae~V~-~VKr~e~g~i~redp~t~~P~~tv~ 102 (476) T TIGR01302 30 NIKLNIPILSSPMDTVTESRMAIAMAREGGIGVIHRNMS------IERQAEEVK-RVKRAENGIISREDPVTISPETTVE 102 (476) T ss_pred CEEEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCC------HHHHHHHHH-HHHHHCCCEEEECCCEEECCCCHHH T ss_conf 306754068589882459999999985499479944799------899999988-7523206606514886847985189 Q ss_pred HHHHHHHHCCCCEEEEECC--CC-----CEEEEEEE---CCCHHCCCCCC--CCCC---CCCCEEEEEEEECCCHHHHHH Q ss_conf 9999998655330645747--99-----78688980---21000101742--2326---568765556651370156789 Q gi|254780889|r 174 NAKALLHQHRIEKLLVVDD--DG-----CCIGLITV---KDIERSQLNPN--ATKD---SKGRLRVAAAVSVAKDIADRV 238 (493) Q Consensus 174 eA~~iM~~~ki~~LPVVDe--~g-----~LvGIIT~---~DIlk~~~~p~--A~~D---~~grL~VgAAIg~~~d~~eRa 238 (493) ++.++|.++++.-+|||++ .+ +|+||||. +|+.-..+.-. |..+ .+.+|.+ +..++ ..|+| T Consensus 103 ~~~~l~~~~gisG~PVv~~G~~~g~ktGKLvGIiT~sqWrD~~f~~~~~~~daV~~~MT~~~~~iT-~~e~i---~~e~A 178 (476) T TIGR01302 103 DVLELMERKGISGIPVVEDGKDGGPKTGKLVGIITKSQWRDVRFVKDKGKKDAVSEVMTPREELIT-VPEGI---DLEEA 178 (476) T ss_pred HHHHHCCCCCCCEEEEEECCCCCCCCEEEEEEEEECCCEEECCCCCCCCCCCCEEECCCCCCCCEE-ECCCC---CHHHH T ss_conf 999732215765457883688989710069999837722541101046883301011203764348-41677---78999 Q ss_pred HHH-HHCCCC-EEEEECCCCHHHHHH---HHHHHHHHHCC----C---C--CEECCC-CC----CHHHHHHHHHCCCCEE Q ss_conf 998-731366-699824841138899---99999998579----8---7--245146-46----9999999985487589 Q gi|254780889|r 239 GPL-FDVNVD-LVVVDTAHGHSQKVL---DAVVQIKKNFP----S---L--LVMAGN-IA----TAEGALALIDAGADII 299 (493) Q Consensus 239 ~~L-veaGvD-viviD~ahGh~~~~~---~~i~~~k~~~~----~---~--~ii~GN-v~----t~~~~~~l~~~Gad~i 299 (493) ..+ -+.=.+ +.+||- ++|..+++ |..| ++.|| + . .+++|- |. +.|=+..|.+||+|.| T Consensus 179 ~~~L~~~r~ekLpvVd~-~~~lVgLiT~~Di~~--~~~~P~A~kd~vG~~GrL~VgAAvg~r~~D~~R~~~L~~AGvDv~ 255 (476) T TIGR01302 179 LKVLHKHRIEKLPVVDK-DGELVGLITVKDIVK--RREFPHASKDTVGENGRLIVGAAVGTREDDLERAEALVEAGVDVI 255 (476) T ss_pred HHHHHHHCCCEEEEECC-CCCEEEEEEHHHHHH--HHHCCCCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHCCCCEE T ss_conf 99988608650478827-898899986447889--863888778874888608999884689861899999996596589 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEEC Q ss_conf 961027876653100015743100378899986226981997265437776213367-148789932 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA-AGSACVMIG 365 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~G 365 (493) -+=-.-|.=| -|+..|.+..+. .-+++||| |---.-+-+++|. ||||.|=+| T Consensus 256 viDsshGhs~----------~vl~~ik~~k~~--Yp~~~iia--GNVaT~~~a~~LI~AgADg~rVG 308 (476) T TIGR01302 256 VIDSSHGHSI----------YVLDSIKKIKKT--YPDLDIIA--GNVATAEQAKALIDAGADGLRVG 308 (476) T ss_pred EEECCCCCCH----------HHHHHHHHHHHH--CCEEEEEE--CCCCCHHHHHHHHHCCCCEEEEC T ss_conf 9816654537----------899999999863--88057994--34411788988985288878983 No 127 >cd02808 GltS_FMN Glutamate synthase (GltS) FMN-binding domain. GltS is a complex iron-sulfur flavoprotein that catalyzes the reductive synthesis of L-glutamate from 2-oxoglutarate and L-glutamine via intramolecular channelling of ammonia, a reaction in the plant, yeast and bacterial pathway for ammonia assimilation. It is a multifunctional enzyme that functions through three distinct active centers, carrying out L-glutamine hydrolysis, conversion of 2-oxoglutarate into L-glutamate, and electron uptake from an electron donor. Probab=98.79 E-value=5.2e-08 Score=76.88 Aligned_cols=190 Identities=22% Similarity=0.330 Sum_probs=114.0 Q ss_pred HHCCCCEEEEECCC---CHHHHHHHHHHHHHHHCCCCC----EECCCCCCHHHHHHHH-HCCCCEEEE-EECCCCCCCC- Q ss_conf 73136669982484---113889999999998579872----4514646999999998-548758996-1027876653- Q gi|254780889|r 242 FDVNVDLVVVDTAH---GHSQKVLDAVVQIKKNFPSLL----VMAGNIATAEGALALI-DAGADIIKV-GIGPGSICTT- 311 (493) Q Consensus 242 veaGvDviviD~ah---Gh~~~~~~~i~~~k~~~~~~~----ii~GNv~t~~~~~~l~-~~Gad~ikv-Gig~Gs~CtT- 311 (493) ++.|.|++-= ..| -..+.+.+.|.+||+..+..| +.+|+ -.+-...++ .+++|.|-| |-++|.--.- T Consensus 180 ~~~G~d~iSP-~~h~~i~s~edl~~~I~~LR~~~~~kpVgvKl~~g~--~~~~l~~~~~~~~~DfItIDG~eGGTGAaP~ 256 (392) T cd02808 180 IPPGVDLISP-PPHHDIYSIEDLAQLIEDLREATGGKPIGVKLVAGH--GEGDIAAGVAAAGADFITIDGAEGGTGAAPL 256 (392) T ss_pred CCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCC--CHHHHHHHHHHCCCCEEEEECCCCCCCCCHH T ss_conf 9999887799-765666999999999999997289980799968889--8899999999647999996079987541429 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCC----CEEEEECCCCCHHHHEEEEEECCCEEEECCH--HH-CCCCCCCCEEEECCE Q ss_conf 1000157431003788999862269----8199726543777621336714878993230--10-121677644873680 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAG----VAIVADGGIRFSGDIAKAIAAGSACVMIGSL--LA-GTDESPGDIFLYQGR 384 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~----~~iiadGGi~~~gdi~kAla~GA~~VM~G~~--~a-gt~Espg~~~~~~g~ 384 (493) ...--+|.|...++..+.+++.+.+ +.+||+||+++++|++||||+|||+|.+|+. || ||.-+- . -..++ T Consensus 257 ~~~d~~GlP~~~~l~~~~~~L~~~glr~~i~liasGgl~t~~di~kalaLGAD~v~~a~~~m~alGCiq~~-~--C~~~~ 333 (392) T cd02808 257 TFIDHVGLPTELGLARAHQALVKNGLRDRVSLIASGGLRTGADVAKALALGADAVGIGTAALIALGCIQAR-K--CHTNT 333 (392) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCCHHHHCHHHHHHHHCHHHH-H--HCCCC T ss_conf 99974997389999999999997699676379963885747899999986756451056999987479976-5--05998 Q ss_pred EEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 5888733200688752212221012333433100454026621057589989999999998555157587768973 Q gi|254780889|r 385 SFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 385 ~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) -=.|-+...- ++ +. ...++++-+ .|.+++..+...++.-+.-+|.+++.+|.. T Consensus 334 ---CP~GiaTqd~-------~l-~~-----~l~~~~~a~-------rv~ny~~~~~~el~~~~~a~G~~~~~~l~r 386 (392) T cd02808 334 ---CPVGVATQDP-------EL-RR-----RLDVEGKAE-------RVANYLKSLAEELRELAAALGKRSLELLGR 386 (392) T ss_pred ---CCCCCCCCCH-------HH-HC-----CCCCCCHHH-------HHHHHHHHHHHHHHHHHHHHCCCCHHHCCH T ss_conf ---9961113798-------88-41-----468654699-------999999999999999999857999478898 No 128 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=98.76 E-value=3.8e-07 Score=70.68 Aligned_cols=187 Identities=25% Similarity=0.388 Sum_probs=99.8 Q ss_pred CEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCC----CCCCCH Q ss_conf 8152878994654303456566760504167866886027887235450014344672440026786325----730358 Q gi|254780889|r 97 GMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITV----KKTVNL 172 (493) Q Consensus 97 gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv----~~~~~l 172 (493) -+..+++|..++.+..||.++|.++++..+||||++ ++|.|++|.+|+......-.+-.+-- ..+.+. ..+..+ T Consensus 148 ~~~~~lvt~~~~~~~~ea~~~l~~~kieklpvvd~~-g~L~glit~kDi~k~~~~P~a~~D~~-grL~VgAAVG~~~~~~ 225 (467) T pfam00478 148 TMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDD-GELVGLITRKDIEKARDYPNASKDAQ-GRLLVGAAVGTRDDDL 225 (467) T ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEECCC-CCEEEEEEHHHHHHHHCCCCCCCCCC-CCEEEEEEECCCHHHH T ss_conf 135542772488998999999997556415444678-83788887434677420785222656-7779999806786599 Q ss_pred HHHHHHHHHCCCCEEEEEC-CCCCEEEEEEECCCHHC--CCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 9999999865533064574-79978688980210001--01742232656876555665137015678999873136669 Q gi|254780889|r 173 ENAKALLHQHRIEKLLVVD-DDGCCIGLITVKDIERS--QLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 173 ~eA~~iM~~~ki~~LPVVD-e~g~LvGIIT~~DIlk~--~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) +.+ +.|.+.+.+.+ ||| .+|+-.+++ |.+|. ..||+- . .++.-|. +.|-++.|++||||.+ T Consensus 226 eRa-~~Lv~aGvDvi-vIDtAhGhs~~vi---~~ik~ik~~~p~~------~-iIaGNVa----T~e~a~~Li~aGAD~v 289 (467) T pfam00478 226 ERA-EALVEAGVDVI-VIDSAHGHSEYVL---EMIKWIKKKYPDL------D-VIAGNVV----TAEAARELIDAGADAV 289 (467) T ss_pred HHH-HHHHHCCCCEE-EEECCCCCCHHHH---HHHHHHHHCCCCC------C-EEEEEEC----CHHHHHHHHHHCCCEE T ss_conf 999-99987699889-9734454418899---9999987407877------3-7851005----8999999997077757 Q ss_pred EEECCCC----------HHHHHHHHHHHH---HHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 9824841----------138899999999---98579872451-46469999999985487589961 Q gi|254780889|r 250 VVDTAHG----------HSQKVLDAVVQI---KKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 250 viD~ahG----------h~~~~~~~i~~~---k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) -|=+--| .-.-++.++-+. .+.+ .++||| |.+-++--.-.-+-+|||+|=+| T Consensus 290 KVGiGpGSiCTTR~v~GvG~PQ~tAv~~~a~~a~~~-~vpiIADGGi~~sGDi~KAlaaGAd~VMlG 355 (467) T pfam00478 290 KVGIGPGSICTTREVAGVGRPQLTAVYEVADAARKL-GVPVIADGGIRYSGDIAKALAAGASAVMLG 355 (467) T ss_pred EECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC-CCEEEECCCCCCHHHHHHHHHCCCCEEEEC T ss_conf 755668865656420366775087999999998656-987994476233048999987289889877 No 129 >cd04740 DHOD_1B_like Dihydroorotate dehydrogenase (DHOD) class 1B FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. Probab=98.72 E-value=6.7e-07 Score=68.92 Aligned_cols=169 Identities=24% Similarity=0.306 Sum_probs=109.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEECC--C----CH-----HHHHHHHHHHHHHHCCCCCEEC---CCCCC-HHHHHHHHHCCC Q ss_conf 01567899987313666998248--4----11-----3889999999998579872451---46469-999999985487 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTA--H----GH-----SQKVLDAVVQIKKNFPSLLVMA---GNIAT-AEGALALIDAGA 296 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~a--h----Gh-----~~~~~~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~l~~~Ga 296 (493) +|+.+-++.+-++|+|++.+.++ | |. .+...+.++++|+.. ++||++ .|+.. .+-++.+.++|+ T Consensus 102 ~d~~~~~~~~~~~gad~ielNiScPNt~~~g~~~~~~~~~~~~i~~~vk~~~-~~Pi~vKlsP~~~~i~~ia~~~~~~g~ 180 (296) T cd04740 102 EEFVEVAEKLADAGADAIELNISCPNVKGGGMAFGTDPEAVAEIVKAVKKAT-DVPVIVKLTPNVTDIVEIARAAEEAGA 180 (296) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHCCCHHHHHHHHHHHHHCC-CCCEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 8999999998864898899978899867636775749999999999998604-896699718980009999999997699 Q ss_pred CEEEEEECCCC--CC--CCCHHCCCCCCCCH-------HHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEEC Q ss_conf 58996102787--66--53100015743100-------378899986226981997265437776213367148789932 Q gi|254780889|r 297 DIIKVGIGPGS--IC--TTRVVTGVGCPQLS-------AIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIG 365 (493) Q Consensus 297 d~ikvGig~Gs--~C--tTr~~~g~g~pq~~-------av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G 365 (493) |+|-+-=--.+ +- |-+.+.+-+.+-+| ++..+++..+..++|||+-|||.+..|+.+.|.||||+|+++ T Consensus 181 dgiv~~NT~~~~~id~~~~~p~l~~~~GGlSG~~l~~~al~~v~~~~~~~~ipIig~GGI~s~~da~e~i~aGAs~VQi~ 260 (296) T cd04740 181 DGLTLINTLKGMAIDIETRKPILGNVTGGLSGPAIKPIALRMVYQVYKAVEIPIIGVGGIASGEDALEFLMAGASAVQVG 260 (296) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCHHHHH T ss_conf 88999746787663644467552455787686778899999999998545888797579799999999998399888723 Q ss_pred CHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 30101216776448736805888733200688752212221012333433100454026621057589989999999998 Q gi|254780889|r 366 SLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLKS 445 (493) Q Consensus 366 ~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glrs 445 (493) +.+ |+|. .++.++..+|.. T Consensus 261 Tai--------------------~~Gp-----------------------------------------~~i~~i~~~L~~ 279 (296) T cd04740 261 TAN--------------------FVDP-----------------------------------------EAFKEIIEGLEA 279 (296) T ss_pred HHH--------------------HCCH-----------------------------------------HHHHHHHHHHHH T ss_conf 667--------------------4292-----------------------------------------799999999999 Q ss_pred HHHHCCCCCHHHHHHCC Q ss_conf 55515758776897375 Q gi|254780889|r 446 SMGYVGASNIEEFQKKA 462 (493) Q Consensus 446 ~~~y~G~~~i~e~~~~~ 462 (493) -|.--|..+|.|++..| T Consensus 280 ~l~~~G~~si~e~~G~a 296 (296) T cd04740 280 YLDEEGIKSIEELVGLA 296 (296) T ss_pred HHHHCCCCCHHHHCCCC T ss_conf 99983999899950359 No 130 >KOG0475 consensus Probab=98.70 E-value=6.7e-08 Score=76.10 Aligned_cols=171 Identities=25% Similarity=0.340 Sum_probs=121.8 Q ss_pred EEEEECCCCEECCCEEEECCC------CCCCHHHHHHHHHCCCEEEEC-CCCCHHHHHHHHHHHHHHCCCEECCCEEEEC Q ss_conf 334303881600317833785------236888999999779879997-8989899999999874223881528789946 Q gi|254780889|r 35 ISTRIAKDFTLNLPIMSAAMD------QVTDSRLAIAMAQAGGLGVIH-RNFSPSEQVAQVHQVKKFESGMVVNPVTISP 107 (493) Q Consensus 35 l~t~lt~~i~l~iPivSs~MD------TVTe~~MAiamA~~GGiGvIH-rn~sie~qa~~V~kVKr~e~gmI~dPVti~p 107 (493) +--.||-.+..-+|++.+.|- +......+.++-++-|.-.+. ++...+.-+..+-+..|+++- -+++.. T Consensus 488 ImFELTG~l~~IlPLm~av~~SkwI~d~~gk~gIyda~I~~ng~P~l~~k~e~~~t~~~~v~~p~~~~~~----L~~i~~ 563 (696) T KOG0475 488 IMFELTGALNYILPLMLAVMISKWIGDGLGKTGIYDAHIELNGYPFLDSKSEFSSTLAIPVMEPCRSESC----LIVITQ 563 (696) T ss_pred EEEECCCCCEEHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHCHHCCCHH----HEECCC T ss_conf 8654047600079899999999988732120027888888619777653221122033443131017622----212055 Q ss_pred C-CCHHHHHHHHHHHCCCCEEEE-CCCCEEEEEEEEHHHHC-------------------CCCC---------CCCCCCC Q ss_conf 5-430345656676050416786-68860278872354500-------------------1434---------4672440 Q gi|254780889|r 108 Y-ATLADALALMKKYSISGIPVV-ESDVGKLVGILTNRDVR-------------------FASN---------AQQAVGE 157 (493) Q Consensus 108 d-~TI~ea~~lm~~~~is~iPVV-D~~~gkLiGIVT~rDir-------------------~~~~---------~~~~V~e 157 (493) + .|+.|...+|++..++++||| ++++++++|++..||+. |.++ ..-..++ T Consensus 564 ~s~tl~~le~~~~~t~~sgfpvvl~~~sq~lvGfv~rr~l~~~i~~ar~~q~~~~~~~~~f~~~~~~~~~~~~~~~~lk~ 643 (696) T KOG0475 564 DSMTLEDLESLMEDTDFSGFPVVLSEDSQRLVGFVLRRNLFLAILNARKIQSFIVTTSIYFNDPSPSAVAGIPSRLDLKD 643 (696) T ss_pred CCEEHHHHHHHHHHCCCCCCEEEECCCCCEEEEEECHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEE T ss_conf 65168999888862442785279715642267887157888988653005654131113367887434589998757632 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 02678632573035899999998655330645747997868898021000101 Q gi|254780889|r 158 LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 158 iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) +|.-.++|+...++.+.+.+++.+-.++... |+++|+|+|+||.+|++++.. T Consensus 644 il~~tp~tv~d~tp~~~v~~~F~~lg~~~~~-v~~~G~l~Giitkkd~l~~~r 695 (696) T KOG0475 644 ILDMTPFTVTDLTPMETVVDLFRKLGLRQIL-VTKNGILLGIITKKDCLRHTR 695 (696) T ss_pred ECCCCCCCCCCCCCHHHHHHHHHHHCCEEEE-ECCCCEEEEEEEHHHHHHHHC T ss_conf 0468760343458489999999861725899-815982676653489888621 No 131 >PRK07259 dihydroorotate dehydrogenase 1B; Reviewed Probab=98.69 E-value=1e-06 Score=67.66 Aligned_cols=169 Identities=24% Similarity=0.284 Sum_probs=108.2 Q ss_pred CHHHHHHHHHHHC-CCCEEEEECC--CC--------HH-HHHHHHHHHHHHHCCCCCEE---CCCCCC-HHHHHHHHHCC Q ss_conf 0156789998731-3666998248--41--------13-88999999999857987245---146469-99999998548 Q gi|254780889|r 232 KDIADRVGPLFDV-NVDLVVVDTA--HG--------HS-QKVLDAVVQIKKNFPSLLVM---AGNIAT-AEGALALIDAG 295 (493) Q Consensus 232 ~d~~eRa~~Lvea-GvDviviD~a--hG--------h~-~~~~~~i~~~k~~~~~~~ii---~GNv~t-~~~~~~l~~~G 295 (493) +|+.+-++.+.++ |+|++.+.++ |. .. +...+.++.+|+.. ++||+ ..|+.. .+-+..+.++| T Consensus 104 ~d~~~~~~~l~~~~~ad~ielNiScPn~~~~g~~~~~~~~~l~~i~~~v~~~~-~~Pv~vKlsP~~~~i~~ia~~~~~~g 182 (301) T PRK07259 104 EEYVEVAEKLSKAPNVDAIELNISCPNVKHGGMAFGTDPELAYEVVKAVKSVS-KVPVIVKLTPNVTDIVEIAKAAEEAG 182 (301) T ss_pred HHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHCCCCHHHHHHHHHHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHCC T ss_conf 89999999864556888899965478888526660879999999999998734-89779980787121999999999759 Q ss_pred CCEEEEEEC-CC---CCCCCCHHCCCCCCCCH-------HHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 758996102-78---76653100015743100-------37889998622698199726543777621336714878993 Q gi|254780889|r 296 ADIIKVGIG-PG---SICTTRVVTGVGCPQLS-------AIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 296 ad~ikvGig-~G---s~CtTr~~~g~g~pq~~-------av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) +|+|-+.=- +| .+.|.+.+.+-+.+-+| ++..+++.++..++|||+-|||.+.-|+.+.|.|||++|++ T Consensus 183 adgvv~~Nt~~~~~id~~~~~p~~~~~~GGlSG~~l~~~al~~v~~~~~~~~ipIig~GGI~s~~da~e~i~aGAs~VQv 262 (301) T PRK07259 183 ADGLSLINTLKGMAIDIKTRKPILANVTGGLSGPAIKPIALRMVYQVAKAVDIPIIGMGGISTAEDAIEFMMAGASAVQV 262 (301) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCHHHH T ss_conf 98899956776765323567743357888634733518999999999851698889767979999999999839879872 Q ss_pred CCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHHHH Q ss_conf 23010121677644873680588873320068875221222101233343310045402662105758998999999999 Q gi|254780889|r 365 GSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGGLK 444 (493) Q Consensus 365 G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~glr 444 (493) ++-+ |+|. .++.++..+|. T Consensus 263 ~Tav--------------------~~Gp-----------------------------------------~~~~~i~~~L~ 281 (301) T PRK07259 263 GTAN--------------------FIDP-----------------------------------------YAFPEIIEGLE 281 (301) T ss_pred HHHH--------------------HCCC-----------------------------------------HHHHHHHHHHH T ss_conf 1233--------------------1490-----------------------------------------69999999999 Q ss_pred HHHHHCCCCCHHHHHHCC Q ss_conf 855515758776897375 Q gi|254780889|r 445 SSMGYVGASNIEEFQKKA 462 (493) Q Consensus 445 s~~~y~G~~~i~e~~~~~ 462 (493) .-|.--|.++|.|++.-| T Consensus 282 ~~l~~~G~~si~e~~G~a 299 (301) T PRK07259 282 AYLDEEGIKSIEEIIGIA 299 (301) T ss_pred HHHHHCCCCCHHHHHCHH T ss_conf 999984999899971813 No 132 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=98.69 E-value=9.8e-08 Score=74.91 Aligned_cols=114 Identities=25% Similarity=0.489 Sum_probs=93.5 Q ss_pred HHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECC-------CCEEEEEEEEHHHHC---C--CCCCCCCCC---- Q ss_conf 223881528789946543034565667605041678668-------860278872354500---1--434467244---- Q gi|254780889|r 93 KFESGMVVNPVTISPYATLADALALMKKYSISGIPVVES-------DVGKLVGILTNRDVR---F--ASNAQQAVG---- 156 (493) Q Consensus 93 r~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~-------~~gkLiGIVT~rDir---~--~~~~~~~V~---- 156 (493) |-+++-+..||+++|+.|+.+|+++++++||+-+|||+. +.++++|.|+.+.+. + -...+.+|+ T Consensus 367 ~v~~l~L~~~v~~~~t~tv~~ai~iLre~GfdQ~PVv~pga~~P~veag~v~G~v~l~~lL~~l~~~~a~~~D~v~Gkyl 446 (527) T TIGR01137 367 RVKDLHLPALVTVHPTETVGDAIEILREYGFDQLPVVKPGAAEPDVEAGKVLGSVTLRELLSALFAKKAKLEDAVSGKYL 446 (527) T ss_pred EEECCCCCCCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHCCCCCCCCCCCCCH T ss_conf 64156688734546764089999999974873076207788889723534788742677899998424675544343000 Q ss_pred --CCCC---------------CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCC------------EEEEEEECCCHH Q ss_conf --0026---------------78632573035899999998655330645747997------------868898021000 Q gi|254780889|r 157 --ELMT---------------RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGC------------CIGLITVKDIER 207 (493) Q Consensus 157 --eiMt---------------~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~------------LvGIIT~~DIlk 207 (493) ..|. +++.-+..++.+.++.+.|..+ +.+.|+++... ++||||+-|+|. T Consensus 447 dfk~~~~Fndvssynenk~~~~~~~~i~~~~~l~~L~~~le~~--~~a~V~~e~~pyhstGkssqrq~~~gvVT~~DLL~ 524 (527) T TIGR01137 447 DFKVMSRFNDVSSYNENKSKKKKFIQIGEGEKLADLSKFLEKN--SSAIVVEEGKPYHSTGKSSQRQMVIGVVTKIDLLS 524 (527) T ss_pred HHHHHHHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCC--CCEEEEECCCCCCCCCCCCCCEEEEEEEEHHHHHH T ss_conf 3444532101000101024676765446775189999985169--61899707823212565424327898644122443 Q ss_pred C Q ss_conf 1 Q gi|254780889|r 208 S 208 (493) Q Consensus 208 ~ 208 (493) - T Consensus 525 f 525 (527) T TIGR01137 525 F 525 (527) T ss_pred H T ss_conf 1 No 133 >cd04722 TIM_phosphate_binding TIM barrel proteins share a structurally conserved phosphate binding motif and in general share an eight beta/alpha closed barrel structure. Specific for this family is the conserved phosphate binding site at the edges of strands 7 and 8. The phosphate comes either from the substrate, as in the case of inosine monophosphate dehydrogenase (IMPDH), or from ribulose-5-phosphate 3-epimerase (RPE) or from cofactors, like FMN. Probab=98.69 E-value=3.6e-07 Score=70.87 Aligned_cols=123 Identities=25% Similarity=0.288 Sum_probs=88.0 Q ss_pred HHHHHHHCCCCEEEEECCCCHH-HHHHHHHHHHHHHCCCCCEECCCCCCHH-HHHHHHHCCCCEEEEEECCCCCCCCCHH Q ss_conf 8999873136669982484113-8899999999985798724514646999-9999985487589961027876653100 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDTAHGHS-QKVLDAVVQIKKNFPSLLVMAGNIATAE-GALALIDAGADIIKVGIGPGSICTTRVV 314 (493) Q Consensus 237 Ra~~LveaGvDviviD~ahGh~-~~~~~~i~~~k~~~~~~~ii~GNv~t~~-~~~~l~~~Gad~ikvGig~Gs~CtTr~~ 314 (493) .+..+.++|+|.+.+..++++. ....+.++++|+.++++.++.--..+.+ .+..+.+.|+|.+-+... .|++... T Consensus 76 ~~~~~~~~g~d~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~vi~~~~~~~~~~~~~a~~~g~~~v~~~~~---~~~~~~~ 152 (200) T cd04722 76 AAAAARAAGADGVEIHGAVGYLAREDLELIRELREAVPDVKVVVKLSPTGELAAAAAEEAGVDEVGLGNG---GGGGGGR 152 (200) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCEEEECCC---CCCCCCC T ss_conf 9999998399989978999654300689999999844896499968999999999999809979997087---4678887 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECC Q ss_conf 0157431003788999862269819972654377762133671487899323 Q gi|254780889|r 315 TGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGS 366 (493) Q Consensus 315 ~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~ 366 (493) ... .+ ............++|||++|||.++-|+.+|+++|||.|-+|| T Consensus 153 ~~~-~~---~~~~~~~~~~~~~ipvi~~gGi~~~~~~~~~~~~gAdgv~vGs 200 (200) T cd04722 153 DAV-PI---ADLLLILAKRGSKVPVIAGGGINDPEDAAEALALGADGVIVGS 200 (200) T ss_pred CCC-CH---HHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCEEEECC T ss_conf 666-11---6899999998579998997587999999999985998898188 No 134 >TIGR02708 L_lactate_ox L-lactate oxidase; InterPro: IPR014080 Members of this entry oxidize L-lactate to pyruvate, reducing molecular oxygen to hydrogen peroxide. The enzyme is known in Aerococcus viridans, Streptococcus iniae, and some strains of Streptococcus pyogenes where it appears to contribute to virulence.. Probab=98.67 E-value=1.7e-07 Score=73.29 Aligned_cols=129 Identities=30% Similarity=0.404 Sum_probs=103.5 Q ss_pred HHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCH-------HHHHHHHHHHCCCC Q ss_conf 99999857987245146469999999985487589961027876653100015743100-------37889998622698 Q gi|254780889|r 265 VVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLS-------AIMSVVEVAERAGV 337 (493) Q Consensus 265 i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~-------av~~~~~~~~~~~~ 337 (493) |+.| ..|..+||-+=.+-++|-+..-++|||-+|= ||..|-.|+. .+.++|+...++ | T Consensus 221 iE~I-A~ySGLPVyVKG~Q~~ED~~~al~AGASGIW-------------V~NHG~RQl~~~PaaFD~L~~vAE~V~~r-V 285 (368) T TIGR02708 221 IEEI-AAYSGLPVYVKGPQCPEDADRALKAGASGIW-------------VTNHGGRQLDGGPAAFDSLQEVAEAVDKR-V 285 (368) T ss_pred HHHH-HHHCCCCEEEECCCCHHHHHHHHHCCCCEEE-------------EECCCCCCCCCCCCCCHHHHHHHHHHCCC-C T ss_conf 8999-7217983686078886689999972886257-------------60477502367875200069999985285-5 Q ss_pred EEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCE Q ss_conf 19972654377762133671487899323010121677644873680588873320068875221222101233343310 Q gi|254780889|r 338 AIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKL 417 (493) Q Consensus 338 ~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~ 417 (493) ||+=|-|||-.-||.||||-|||.|-||... .+|.|= |.+ T Consensus 286 PIVFDSGvRRG~Hv~KALASGAD~VAlGRPv--------------------~yGLAl------GG~-------------- 325 (368) T TIGR02708 286 PIVFDSGVRRGQHVFKALASGADLVALGRPV--------------------IYGLAL------GGS-------------- 325 (368) T ss_pred CEEEECCCCCHHHHHHHHHCCCCEEECCCHH--------------------HHHHHH------CCH-------------- T ss_conf 6685088432578999872356443013235--------------------666550------102-------------- Q ss_pred ECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 0454026621057589989999999998555157587768973 Q gi|254780889|r 418 VPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQK 460 (493) Q Consensus 418 v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~ 460 (493) - -+..++.+|..-|+.-|+-+|..||++++. T Consensus 326 -----~-------G~~~V~~~l~~~L~~VMQL~G~Q~i~D~K~ 356 (368) T TIGR02708 326 -----V-------GARQVLEYLNKELKRVMQLTGTQTIEDVKG 356 (368) T ss_pred -----H-------HHHHHHHHHHHHHHHHHHHCCCCCHHHHHC T ss_conf -----2-------189999999988777764138751565321 No 135 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.66 E-value=8.2e-08 Score=75.47 Aligned_cols=49 Identities=20% Similarity=0.365 Sum_probs=45.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++|+++++++.||+++|.+++++.+||||++|+|+|++|.+|+++.. T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~g~lvGiis~~Dl~~~~ 49 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDENGKVVDVYSRFDVINLA 49 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 9978989989999999999980986699987899698997599999887 No 136 >COG0069 GltB Glutamate synthase domain 2 [Amino acid transport and metabolism] Probab=98.66 E-value=1.3e-07 Score=74.03 Aligned_cols=180 Identities=22% Similarity=0.323 Sum_probs=113.0 Q ss_pred HHHHHHHHHHHHCCC----CCEECCCCCCHHHHHHHHHCCCCEEEE-EECCCCCCCCCH-HCCCCCCCCHHHHHHHHHHH Q ss_conf 899999999985798----724514646999999998548758996-102787665310-00157431003788999862 Q gi|254780889|r 260 KVLDAVVQIKKNFPS----LLVMAGNIATAEGALALIDAGADIIKV-GIGPGSICTTRV-VTGVGCPQLSAIMSVVEVAE 333 (493) Q Consensus 260 ~~~~~i~~~k~~~~~----~~ii~GNv~t~~~~~~l~~~Gad~ikv-Gig~Gs~CtTr~-~~g~g~pq~~av~~~~~~~~ 333 (493) -....|+.||+.-|. +.+++++.+-.-++- ..+++||.|-| |..+|---+-.. .--+|.|-..++.+..+.+. T Consensus 289 DLaqlI~dLk~~~~~~~I~VKlva~~~v~~iaag-vakA~AD~I~IdG~~GGTGAsP~~~~~~~GiP~e~glae~~q~L~ 367 (485) T COG0069 289 DLAQLIKDLKEANPWAKISVKLVAEHGVGTIAAG-VAKAGADVITIDGADGGTGASPLTSIDHAGIPWELGLAETHQTLV 367 (485) T ss_pred HHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHH-HHHCCCCEEEECCCCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 9999999999618897069999256556778766-640469889975899867878565763687079988999999999 Q ss_pred CCC----CEEEEECCCCCHHHHEEEEEECCCEEEECCH--HH-CCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHH Q ss_conf 269----8199726543777621336714878993230--10-1216776448736805888733200688752212221 Q gi|254780889|r 334 RAG----VAIVADGGIRFSGDIAKAIAAGSACVMIGSL--LA-GTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARY 406 (493) Q Consensus 334 ~~~----~~iiadGGi~~~gdi~kAla~GA~~VM~G~~--~a-gt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry 406 (493) ..+ +.|++|||+++.-|++||+++|||+|=.|.. +| ||--. .. -+-|+ -=+|-+.-.- T Consensus 368 ~~glRd~v~l~~~Ggl~Tg~DVaka~aLGAd~v~~gTa~lia~GCim~--r~-CH~~t---Cp~GIaTqdp--------- 432 (485) T COG0069 368 LNGLRDKVKLIADGGLRTGADVAKAAALGADAVGFGTAALVALGCIMC--RV-CHTGT---CPVGIATQDP--------- 432 (485) T ss_pred HCCCCCEEEEEECCCCCCHHHHHHHHHHCCCHHHHCHHHHHHHHHHHH--HH-CCCCC---CCCEEEECCH--------- T ss_conf 759864169994287067899999997086455202199998533866--44-15899---9750340588--------- Q ss_pred CCCCCCCCCCE-ECCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEE Q ss_conf 01233343310-04540266210575899899999999985551575877689737599999 Q gi|254780889|r 407 SQDGVTDVLKL-VPEGIEGRVPYKGPIASVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRV 467 (493) Q Consensus 407 ~~~~~~~~~k~-v~eGv~~~v~~~G~~~~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~ 467 (493) .-++++ +....+ -|.+++..+...+|.=|.-+|-.+|.||..+..+.+. T Consensus 433 -----~Lrkrl~~~~~~~-------~v~N~~~~~a~e~rella~lG~~~l~el~g~~d~L~~ 482 (485) T COG0069 433 -----ELRKRLDVEGKPE-------RVINYFTFVAEELRELLAALGKRSLSELIGRTDLLRT 482 (485) T ss_pred -----HHHHHCCCCCCHH-------HHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHHHC T ss_conf -----8876337441089-------9999999999999999998678999997454676640 No 137 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=98.62 E-value=5.1e-08 Score=76.97 Aligned_cols=113 Identities=23% Similarity=0.384 Sum_probs=80.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC-CC----C-CCCCCCCEEEE Q ss_conf 672440026786325730358999999986553306457479978688980210001017-42----2-32656876555 Q gi|254780889|r 152 QQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN-PN----A-TKDSKGRLRVA 225 (493) Q Consensus 152 ~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~-p~----A-~~D~~grL~Vg 225 (493) +..-.+||+++++|+++++++.+|.++|.+|+++.|||+|++.+|+|++|++|++++... |. + .-|.-+-.+-- T Consensus 244 ~LtcadIMSrdVvtv~~~ts~dhA~~ll~~H~ikaLPV~d~~~rl~GiVt~~dl~~~a~~~p~qrlr~~~~~~vk~imt~ 323 (382) T COG3448 244 ELTCADIMSRDVVTVSTDTSIDHARKLLQEHRIKALPVLDEHRRLVGIVTQRDLLKHARPSPFQRLRFLRPPTVKGIMTT 323 (382) T ss_pred CCCHHHHCCCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEHHHHHHCCCCCHHHHHHCCCCCCCCCCCCC T ss_conf 43088863745350177677579999999768650453266661003244887760168566777531599863443457 Q ss_pred EEEECCCH--HHHHHHHHHHCCCCEEEEECCCCHHHHHHHH Q ss_conf 66513701--5678999873136669982484113889999 Q gi|254780889|r 226 AAVSVAKD--IADRVGPLFDVNVDLVVVDTAHGHSQKVLDA 264 (493) Q Consensus 226 AAIg~~~d--~~eRa~~LveaGvDviviD~ahGh~~~~~~~ 264 (493) .++-+..| ..|-+..|.+.|--.+.|=-++|+....+.. T Consensus 324 ~v~tv~pdtpa~~lvp~lad~g~H~lpvld~~g~lvGIvsQ 364 (382) T COG3448 324 PVVTVRPDTPAVELVPRLADEGLHALPVLDAAGKLVGIVSQ 364 (382) T ss_pred CCEEECCCCCHHHHHHHHHCCCCCEEEEECCCCCEEEEEEH T ss_conf 63353589708888887613773136688688838888627 No 138 >pfam01180 DHO_dh Dihydroorotate dehydrogenase. Probab=98.62 E-value=5.5e-06 Score=62.34 Aligned_cols=135 Identities=21% Similarity=0.231 Sum_probs=81.6 Q ss_pred CHHHHHHHHHHHCCCCEEEEECC--C---C----HHHHHH-HHHHHHHHHCCCCCEEC---CCCCCHHHHH-HHHHCCCC Q ss_conf 01567899987313666998248--4---1----138899-99999998579872451---4646999999-99854875 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTA--H---G----HSQKVL-DAVVQIKKNFPSLLVMA---GNIATAEGAL-ALIDAGAD 297 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~a--h---G----h~~~~~-~~i~~~k~~~~~~~ii~---GNv~t~~~~~-~l~~~Gad 297 (493) +|+.+-++.+ +..+|++.+.++ | | ...... +.++++|+. .++|+++ .|+...+... ....+|+| T Consensus 106 ~d~~~~~~~~-~~~ad~iElNiScPn~~~~~~~~~~~~~~~~i~~~v~~~-~~~Pv~vKlsp~~~~~~~~~~a~~~~~a~ 183 (290) T pfam01180 106 EDYVEVARKI-GPFADYLELNVSCPNTPGLRALQTDPELAAILLKVVKEV-SKVPVLVKLAPDLTDIVIIDIADVAAGED 183 (290) T ss_pred HHHHHHHHHH-HHHCCEEEEEEECCCCCCCHHHCCCHHHHHHHHHHHHHC-CCCCEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 9999999999-743588999985368876133404298999999998750-47873898389877468999999971837 Q ss_pred EEEEE-------ECCC-CCCCCCHHCCCCCCCCH-------HHHHHHHHHHCC--CCEEEEECCCCCHHHHEEEEEECCC Q ss_conf 89961-------0278-76653100015743100-------378899986226--9819972654377762133671487 Q gi|254780889|r 298 IIKVG-------IGPG-SICTTRVVTGVGCPQLS-------AIMSVVEVAERA--GVAIVADGGIRFSGDIAKAIAAGSA 360 (493) Q Consensus 298 ~ikvG-------ig~G-s~CtTr~~~g~g~pq~~-------av~~~~~~~~~~--~~~iiadGGi~~~gdi~kAla~GA~ 360 (493) ++..= .++. -+.+.+.+.+.+.+-+| ++..+++.++.. ++|||+-|||.+.-|+.+-|.+||| T Consensus 184 gv~gi~~~nt~~~~~~~d~~~~~~~~~~~~GGlSG~~i~~~al~~v~~~~~~~~~~ipIig~GGI~~~~da~e~i~aGA~ 263 (290) T pfam01180 184 GLTGINATNTTVRGMRIDLKTESPILANGTGGLSGPAIKPIALKVIRELYQRVGPEIPIIGVGGIFTGEDALEKILAGAS 263 (290) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCC T ss_conf 76899965873465555555666312567888576067899999999999970899749998894999999999983997 Q ss_pred EEEECCHH Q ss_conf 89932301 Q gi|254780889|r 361 CVMIGSLL 368 (493) Q Consensus 361 ~VM~G~~~ 368 (493) +|.+++-| T Consensus 264 ~VQv~Tal 271 (290) T pfam01180 264 AVQIGTAL 271 (290) T ss_pred HHHHHHHH T ss_conf 99985899 No 139 >cd04743 NPD_PKS 2-Nitropropane dioxygenase (NPD)-like domain, associated with polyketide synthases (PKS). NPD is part of the nitroalkaneoxidizing enzyme family, that catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDPs are members of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=98.60 E-value=1.1e-06 Score=67.33 Aligned_cols=48 Identities=27% Similarity=0.465 Sum_probs=43.0 Q ss_pred CCCEEEECCCCCCC-HHHHHHHHHCCCEEEEC-CCCCHHHHHHHHHHHHH Q ss_conf 03178337852368-88999999779879997-89898999999998742 Q gi|254780889|r 46 NLPIMSAAMDQVTD-SRLAIAMAQAGGLGVIH-RNFSPSEQVAQVHQVKK 93 (493) Q Consensus 46 ~iPivSs~MDTVTe-~~MAiamA~~GGiGvIH-rn~sie~qa~~V~kVKr 93 (493) +-||+-.+|-.|.+ +++|.|.+.-||||+|- -+|+.|+--++++++|. T Consensus 2 rYPIiQGgMa~vsd~a~LAAAVSnAGGLGiIa~~~~~~e~lr~eI~k~r~ 51 (320) T cd04743 2 RYPIVQGPMTRVSDVAEFAVAVAEGGGLPFIALALMRGEQVKALLEETAE 51 (320) T ss_pred CCCEECCCCHHHCCCHHHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 88876676114237088999998187477743378998999999999999 No 140 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.60 E-value=1.9e-07 Score=72.84 Aligned_cols=49 Identities=29% Similarity=0.355 Sum_probs=44.2 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++|+++++++.+|.++|.+++++.+||||++|+++|+||.+|+++.. T Consensus 1 r~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~g~lvGiit~~Dll~~~ 49 (122) T cd04635 1 REPVTCTPDDPVSKVWDLMLESGFTGLPVVQKAGELIGIITRRDIIRAG 49 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHH T ss_conf 9999968929799999999971994899991898299999989987655 No 141 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.59 E-value=8.5e-08 Score=75.36 Aligned_cols=97 Identities=20% Similarity=0.207 Sum_probs=51.8 Q ss_pred CCCEECCCEEEEC----CCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHH Q ss_conf 8816003178337----852368889999997798799978989899999999874223881528789946543034565 Q gi|254780889|r 41 KDFTLNLPIMSAA----MDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALA 116 (493) Q Consensus 41 ~~i~l~iPivSs~----MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~ 116 (493) .+++ .+|++-+. -.-||+++.--.++.....--. .+.... +..+....+..+|+|+.|++++.||++ T Consensus 22 ~~i~-rlpVvd~~~g~lvGIiT~~dil~~l~~~~~~~~~-----~~~~~~---~~~~~~~~~~~~viti~~d~~l~~A~~ 92 (123) T cd04627 22 GGIH-RVAVTEEESGEVIGILSQRRLVEFLWENARSFPG-----LDPLYP---IPLRDLTIGTSDVISINGDQPLIDALH 92 (123) T ss_pred CCCC-EEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCC-----HHHHHC---CCHHHHHCCCCCCEEECCCCCHHHHHH T ss_conf 9977-7999958999499999979999999863765661-----445411---200233237677179899995999999 Q ss_pred HHHHHCCCCEEEECCCCEEEEEEEEHHHHCC Q ss_conf 6676050416786688602788723545001 Q gi|254780889|r 117 LMKKYSISGIPVVESDVGKLVGILTNRDVRF 147 (493) Q Consensus 117 lm~~~~is~iPVVD~~~gkLiGIVT~rDir~ 147 (493) +|.+++++++||||++ |+|+|++|.+|+++ T Consensus 93 lM~~~~i~~lpVVD~~-g~lvGiiS~~Dlr~ 122 (123) T cd04627 93 LMHNEGISSVAVVDNQ-GNLIGNISVTDVRL 122 (123) T ss_pred HHHHHCCCEEEEECCC-CCEEEEEEHHHEEC T ss_conf 9998098878698599-96999989254221 No 142 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.59 E-value=2.2e-07 Score=72.40 Aligned_cols=51 Identities=24% Similarity=0.460 Sum_probs=43.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 786325730358999999986553306457479978688980210001017 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) ++++|+++++++.+|.++|.+++++.+||||++|+++|+||.+|+++.... T Consensus 1 kdVitv~p~~tl~ea~~~m~~~~i~~lpVvd~~g~lvGivT~~Dl~~~~~~ 51 (135) T cd04621 1 EDIATVHPEHSLLHVVDEMEKNGVGRVIVVDDNGKPVGVITYRDLAFAEFE 51 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHC T ss_conf 986898994979999999987499779999599939999987998866631 No 143 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.59 E-value=1.9e-07 Score=72.89 Aligned_cols=50 Identities=26% Similarity=0.373 Sum_probs=46.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 78632573035899999998655330645747997868898021000101 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++++|+.+++++.+|+++|.+++++.+||||++|+|+|+||.+|+++... T Consensus 1 k~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~gklvGiit~~Di~~~~~ 50 (116) T cd04643 1 EEVAYVQDTNTLRHALLVLTKHGYSAIPVLDKEGKYVGTISLTDILWKLK 50 (116) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf 97699999492999999999749987989869994999988999999874 No 144 >pfam01645 Glu_synthase Conserved region in glutamate synthase. This family represents a region of the glutamate synthase protein. This region is expressed as a separate subunit in the glutamate synthase alpha subunit from archaebacteria, or part of a large multidomain enzyme in other organisms. The aligned region of these proteins contains a putative FMN binding site and Fe-S cluster. Probab=98.59 E-value=1.2e-07 Score=74.34 Aligned_cols=207 Identities=22% Similarity=0.256 Sum_probs=118.0 Q ss_pred CCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCC-H-HHHHHHHHHHHHHHCCCC----CEECCCCCCHHHH Q ss_conf 326568765556651370156789998731366699824841-1-388999999999857987----2451464699999 Q gi|254780889|r 215 TKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHG-H-SQKVLDAVVQIKKNFPSL----LVMAGNIATAEGA 288 (493) Q Consensus 215 ~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahG-h-~~~~~~~i~~~k~~~~~~----~ii~GNv~t~~~~ 288 (493) .|+..|-.+-|.-|. + . -....-++.|+|++-----|. + -+-..+.|.+||+..+.. .+.+|+..... + T Consensus 145 AKPG~GG~Lpg~KVt--~-e-IA~~R~~~~G~d~iSP~~h~di~s~edL~~~I~~Lr~~~~~~PVgvKl~~~~~~~~i-a 219 (367) T pfam01645 145 AKPGEGGHLPGEKVS--P-E-IARIRGSPPGVGLISPPPHHDIYSIEDLAQLIYDLKEINPKAPISVKLVSGHGVGTI-A 219 (367) T ss_pred CCCCCCCCCCHHHCC--H-H-HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHH-H T ss_conf 788978837735448--9-9-996808999987678633478899999999999998417899459998147768999-9 Q ss_pred HHHHHCCCCEEEEEE-CCCCCCCCC-HHCCCCCCCCHHHHHHHHHHHCCC----CEEEEECCCCCHHHHEEEEEECCCEE Q ss_conf 999854875899610-278766531-000157431003788999862269----81997265437776213367148789 Q gi|254780889|r 289 LALIDAGADIIKVGI-GPGSICTTR-VVTGVGCPQLSAIMSVVEVAERAG----VAIVADGGIRFSGDIAKAIAAGSACV 362 (493) Q Consensus 289 ~~l~~~Gad~ikvGi-g~Gs~CtTr-~~~g~g~pq~~av~~~~~~~~~~~----~~iiadGGi~~~gdi~kAla~GA~~V 362 (493) .-+.++|+|.|-|-= .+|..-... ..--+|.|...++.++.++..+.+ |.+||+||++++.|++||+++|||+| T Consensus 220 ~~~aka~~D~I~IdG~eGGTGAaP~~~~d~~GlP~~~~L~~~~~~L~~~glR~~V~liasGgl~t~~Dv~ka~aLGAD~v 299 (367) T pfam01645 220 AGVAKAGADIILIDGHDGGTGASPKTSIKHAGLPWELALAEVHQTLVENGLRDRVSLIADGGLRTGADVAKAAALGADAV 299 (367) T ss_pred HHHHCCCCCEEEECCCCCCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHCCCHH T ss_conf 98753678889971789867755488997442469999999999998706757649997699788899999998565354 Q ss_pred EECC--HHH-CCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHH Q ss_conf 9323--010-1216776448736805888733200688752212221012333433100454026621057589989999 Q gi|254780889|r 363 MIGS--LLA-GTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQM 439 (493) Q Consensus 363 M~G~--~~a-gt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~ 439 (493) .+|. ||| ||.-+- .-+.| +-=+|.++-.... ..|+. ++++.+ .|.+++..+ T Consensus 300 ~~gt~~m~AlGCi~~~---~Ch~~---~CP~GIaTqdp~l---~~~l~----------~~~~~~-------rv~nf~~~~ 353 (367) T pfam01645 300 YIGTAALIALGCIMCR---VCHTN---TCPVGVATQDPEL---RKRLD----------FEGAPE-------RVVNYFRFL 353 (367) T ss_pred HHHHHHHHHHHCHHHH---HHCCC---CCCCCCCCCCHHH---HHCCC----------CCCHHH-------HHHHHHHHH T ss_conf 2347999986489866---52699---8998422579888---73269----------664799-------999999999 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999985551575 Q gi|254780889|r 440 SGGLKSSMGYVGA 452 (493) Q Consensus 440 ~~glrs~~~y~G~ 452 (493) ...+|.-+.-+|- T Consensus 354 ~~e~~~i~~a~G~ 366 (367) T pfam01645 354 AEEVRELLAALGF 366 (367) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999998588 No 145 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.57 E-value=6.1e-08 Score=76.38 Aligned_cols=52 Identities=37% Similarity=0.547 Sum_probs=47.4 Q ss_pred HCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 23881528789946543034565667605041678668860278872354500 Q gi|254780889|r 94 FESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 94 ~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) .++.|+++|++++|++|+.+|+++|.+++++++||||++ |+|+||||.+||- T Consensus 60 V~DIm~~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~-g~l~GiIT~~DIA 111 (113) T cd04597 60 VRDVINRKPVTARPNDPLREALNLMHEHNIRTLPVVDDD-GTPAGIITLLDLA 111 (113) T ss_pred HHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHH T ss_conf 666313788341999829999999998499878698899-9399998778743 No 146 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.56 E-value=7.6e-08 Score=75.69 Aligned_cols=51 Identities=33% Similarity=0.509 Sum_probs=45.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 786325730358999999986553306457479978688980210001017 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) ++++|+++++++.+|+++|.+++++++||||++|+|+|+||.+|+++.... T Consensus 1 kdvvTv~~~~~l~~a~~lm~~~~i~~lPVvd~~g~lvGiiT~~Dl~~~~~~ 51 (128) T cd04632 1 EDVITVREDDSVGKAINVLREHGISRLPVVDDNGKLTGIVTRHDIVDFVVR 51 (128) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHC T ss_conf 998698995929999999998399779999689978999988999998853 No 147 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=98.56 E-value=7.3e-08 Score=75.85 Aligned_cols=46 Identities=39% Similarity=0.668 Sum_probs=26.2 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 6325730358999999986553306457479978688980210001 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++|+++++++.+|.++|.+++++++||||++|+|+|++|.+||++. T Consensus 2 ~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~~~lvGiit~~Dil~~ 47 (49) T smart00116 2 VVTVSPDTTLEEALELLREHGIRRLPVVDEEGRLVGIVTRRDIIKA 47 (49) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 8699987929999999998099857699899919999887999986 No 148 >cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.56 E-value=4.6e-07 Score=70.07 Aligned_cols=94 Identities=23% Similarity=0.357 Sum_probs=82.7 Q ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 78994654303456566760504167866886027887235450014344672440026786325730358999999986 Q gi|254780889|r 102 PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQ 181 (493) Q Consensus 102 PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~ 181 (493) .+.+.-+.++..|...+.++++...|+-|...++..|++|..|+-..-. ++++.++.++.||.+.|.+ T Consensus 3 liV~D~~l~vk~Af~al~~ngi~~aplWds~~~~fvGmLt~~DfI~il~------------lv~i~P~~sL~da~~~l~~ 70 (98) T cd04618 3 LVVFDTKLPVKKAFNALVENGIRSAPLWDSRKQQFVGMLTITDFILILR------------LVSIHPERSLFDAALLLLK 70 (98) T ss_pred EEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHH------------HEECCCCCHHHHHHHHHHH T ss_conf 9999489758999999998692699987288767888864799998876------------2444876129999999986 Q ss_pred CCCCEEEEECCC-CCEEEEEEECCCHH Q ss_conf 553306457479-97868898021000 Q gi|254780889|r 182 HRIEKLLVVDDD-GCCIGLITVKDIER 207 (493) Q Consensus 182 ~ki~~LPVVDe~-g~LvGIIT~~DIlk 207 (493) ++++++||+|.+ |...-++|-+.+++ T Consensus 71 ~~ihrlPvid~~~~~~~~ilt~~~il~ 97 (98) T cd04618 71 NKIHRLPVIDPSTGTGLYILTSRRILK 97 (98) T ss_pred CCCCCCCCCCCCCCCEEEEEEEEHEEC T ss_conf 787505216479996589996302015 No 149 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=98.55 E-value=4.4e-07 Score=70.27 Aligned_cols=49 Identities=24% Similarity=0.485 Sum_probs=42.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |+++|+++++++.+|.++|.+++++.+||||++|+|+|++|.+|+++.. T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~~~lvGiit~~Dl~~~l 49 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDDGRLVGIVSLDDIREIL 49 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 9976958909899999999861997899992899499999999999999 No 150 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.54 E-value=2.9e-07 Score=71.49 Aligned_cols=51 Identities=29% Similarity=0.551 Sum_probs=46.8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 678632573035899999998655330645747997868898021000101 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) |++++|++++.++.+|.++|.+++++.+||+|++|+++|++|.+|+.+... T Consensus 1 tr~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~~~~vGiit~~Di~~~~~ 51 (110) T cd04605 1 SRPVVTISEDASIKEAAKLMIEENINHLPVVDEDGRLVGIVTSWDISKAVA 51 (110) T ss_pred CCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECHHHHHHHH T ss_conf 989989899399999999999729988999979991999996227888987 No 151 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.54 E-value=1.2e-07 Score=74.19 Aligned_cols=82 Identities=26% Similarity=0.426 Sum_probs=58.2 Q ss_pred CCEEEE---CCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCC Q ss_conf 317833---78523688899999977987999789898999999998742238815287899465430345656676050 Q gi|254780889|r 47 LPIMSA---AMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSI 123 (493) Q Consensus 47 iPivSs---~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~i 123 (493) +|++-. .---+||.+..-++++.++... ..+ .++.|.++|+|+.|++++.+|+++|.++++ T Consensus 27 l~Vvd~~g~lvGIiTe~Di~r~~~~~~~~~~---~~~-------------V~~vMt~~vitv~~~~~i~~a~~~M~~~~i 90 (114) T cd04619 27 VVVCDPHGKLAGVLTKTDVVRQMGRCGGPGC---TAP-------------VENVMTRAVVSCRPGDLLHDVWQVMKQRGL 90 (114) T ss_pred EEEECCCCEEEEEEEHHHHHHHHHHCCCCCC---CCC-------------HHHHCCCCCEEECCCCCHHHHHHHHHHCCC T ss_conf 9999899959999994999999873079866---789-------------999615898899998909999998887698 Q ss_pred CCEEEECCCCEEEEEEEEHHHH Q ss_conf 4167866886027887235450 Q gi|254780889|r 124 SGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 124 s~iPVVD~~~gkLiGIVT~rDi 145 (493) +++||+|++ ++++|++|.+|+ T Consensus 91 ~~lPVVD~~-~~~vGiit~~Di 111 (114) T cd04619 91 KNIPVVDEN-ARPLGVLNARDA 111 (114) T ss_pred CEEEEEECC-CEEEEEEEHHHE T ss_conf 588999079-959999993545 No 152 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=98.53 E-value=3.2e-07 Score=71.18 Aligned_cols=49 Identities=31% Similarity=0.647 Sum_probs=44.4 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |+++|+++++++.+|+++|.+++++.+||||++|+++|++|.+|+++.. T Consensus 1 r~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~~~lvGiit~~Di~~~~ 49 (121) T cd04584 1 RDVVTITPTTTIAEALELMREHKIRHLPVVDEEGRLVGIVTDRDLRDAS 49 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 9988968929899999999972998899990899399996589987522 No 153 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.53 E-value=1.4e-07 Score=73.74 Aligned_cols=91 Identities=25% Similarity=0.437 Sum_probs=55.8 Q ss_pred CCCEEEE--CCCCCCCHHHHHHHHHCCCEEE-ECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHC Q ss_conf 0317833--7852368889999997798799-978989899999999874223881528789946543034565667605 Q gi|254780889|r 46 NLPIMSA--AMDQVTDSRLAIAMAQAGGLGV-IHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYS 122 (493) Q Consensus 46 ~iPivSs--~MDTVTe~~MAiamA~~GGiGv-IHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~ 122 (493) .+|++-. -.--||+.++.-+++...|..- .+++.. .. -++.++.|..+|++++|++++.+|+++|.+++ T Consensus 26 ~lpVvd~~~lvGivT~~Di~~~~~~~~~~~~~~~~~~~-------~~-~~~v~~iM~~~~itv~~~~~l~~a~~~m~~~~ 97 (122) T cd04637 26 HLLVVEDNELVGVISDRDYLKAISPFLGTAGETEKDLA-------TL-NRRAHQIMTRDPITVSPDTPVDEASKLLLENS 97 (122) T ss_pred EEEEEECCEEEEEEEHHHHHHHHHHCCCCCHHHHCCCH-------HH-CCCHHHHCCCCCEEECCCCCHHHHHHHHHHCC T ss_conf 89999899999998889999875002453102211212-------30-68999946389839999984999999999749 Q ss_pred CCCEEEECCCCEEEEEEEEHHHH Q ss_conf 04167866886027887235450 Q gi|254780889|r 123 ISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 123 is~iPVVD~~~gkLiGIVT~rDi 145 (493) ++++||+|++ ++|+||||.+|+ T Consensus 98 i~~lPVvd~~-~~lvGiIT~~Di 119 (122) T cd04637 98 ISCLPVVDEN-GQLIGIITWKDL 119 (122) T ss_pred CEEEEEECCC-CEEEEEEEHHHH T ss_conf 3089999689-989999996883 No 154 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.53 E-value=1e-07 Score=74.78 Aligned_cols=50 Identities=42% Similarity=0.644 Sum_probs=47.3 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 67863257303589999999865533064574799786889802100010 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |++++|+++++++.+|.++|.+++++.+||||++|+++|++|.+|+++.. T Consensus 1 Trdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~g~lvGiit~~Dll~~~ 50 (124) T cd04600 1 SRDVVTVTPDTSLEEAWALLRRHRIKALPVVDGDRRLVGIVTQRDLLRHA 50 (124) T ss_pred CCCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCEEEEEEHHHHHHHH T ss_conf 98969989939799999999972997899995698088885699999877 No 155 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=98.52 E-value=6.3e-07 Score=69.11 Aligned_cols=49 Identities=31% Similarity=0.474 Sum_probs=37.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++|+|+++++++.+|.++|.+++++++||||++|+++|+||..|+.+.. T Consensus 1 t~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~g~l~Givt~~Dl~~~~ 49 (106) T cd04582 1 TEPITVRPDDPLSDALGLMDDSDLRALTVVDADGQPLGFVTRREAARAS 49 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHC T ss_conf 9983899989399999999973998799997999789998999987630 No 156 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.51 E-value=3.3e-07 Score=71.11 Aligned_cols=48 Identities=31% Similarity=0.486 Sum_probs=45.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 863257303589999999865533064574799786889802100010 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) +|+|+++++++.||+++|.+++++.+||||++++++||||.+|+.... T Consensus 2 ~Pvtv~~dtti~eA~~~M~~~~v~~l~VvD~~~~lvGIiT~~Dl~~~~ 49 (126) T cd04640 2 KPIVIPADTSIDEALELMIKHGVRLLLVVDSDDNFIGVITAVDLLGEE 49 (126) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCCCCH T ss_conf 896968999599999999983976798987999799999763001215 No 157 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=98.51 E-value=4.1e-07 Score=70.45 Aligned_cols=95 Identities=20% Similarity=0.163 Sum_probs=51.3 Q ss_pred CEEEECCCCCCCHHHHHHHHHCCCEEEECCCC-----CHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHC Q ss_conf 17833785236888999999779879997898-----9899999999874223881528789946543034565667605 Q gi|254780889|r 48 PIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNF-----SPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYS 122 (493) Q Consensus 48 PivSs~MDTVTe~~MAiamA~~GGiGvIHrn~-----sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~ 122 (493) |+--+|=|||.+..--..--..|++-|+..+- |..+-.+ ++-...+..|..++++++|++++.||+++|.+++ T Consensus 3 pVtV~pd~tV~eA~~lM~~~~i~~lPVvd~~klvGIvt~rDi~~--~~~~~v~d~Mt~~v~tv~p~~~l~ea~~lM~~~~ 80 (104) T cd04594 3 DIKVKDYDKVYEAKRIMIENDLLSLPVVDYNKFLGAVYLKDIEN--ATYGDVVDYIVRGIPYVRLTSTAEEAWEVMMKNK 80 (104) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEEECCEEEEEEEHHHHHH--CCCCCHHHEEECCCEEECCCCCHHHHHHHHHHCC T ss_conf 87989999399999999983988788998999999999799652--4699844547639888999897999999999869 Q ss_pred CCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 041678668860278872354500 Q gi|254780889|r 123 ISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 123 is~iPVVD~~~gkLiGIVT~rDir 146 (493) ++.+||+|+ |+|+||||.+|+. T Consensus 81 i~~lPVvd~--gklvGIIT~~Dil 102 (104) T cd04594 81 TRWCPVVDD--GKFKGIVTLDSIL 102 (104) T ss_pred CCEEEEEEC--CEEEEEEEHHHHH T ss_conf 787899989--9999999968953 No 158 >KOG0474 consensus Probab=98.51 E-value=1.7e-07 Score=73.21 Aligned_cols=171 Identities=22% Similarity=0.291 Sum_probs=116.6 Q ss_pred CEEEEE---EECCCCEECCCEEEECCCCCCCHHH-----HHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCE Q ss_conf 225334---3038816003178337852368889-----99999779879997898989999999987422388152878 Q gi|254780889|r 32 DIDIST---RIAKDFTLNLPIMSAAMDQVTDSRL-----AIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPV 103 (493) Q Consensus 32 ~vdl~t---~lt~~i~l~iPivSs~MDTVTe~~M-----AiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPV 103 (493) .|+|.. .+| |+.+-.||+-.=|-+=|-.++ =-...++-|+-.++-|-.+.- ++.+ ....|-.+.+ T Consensus 521 TvSL~VIl~E~T-n~~~~lPiMlvLliaK~VGD~FNegiYd~~i~LkgvP~Le~~pe~~m-----r~L~-a~ev~~~pvi 593 (762) T KOG0474 521 TVSLCVILLELT-NNLLLLPIMLVLLIAKTVGDSFNEGIYDIIIQLKGVPFLEWEPEPYM-----RNLT-AGEVMSKPVI 593 (762) T ss_pred EEEEEHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH-----HHHH-HHHHCCCCEE T ss_conf 844202788763-14666689999999999876641146778522067753357884676-----5311-7664268858 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEECCCC----EEEEEEEEHHHH--------------------------------CC Q ss_conf 994654303456566760504167866886----027887235450--------------------------------01 Q gi|254780889|r 104 TISPYATLADALALMKKYSISGIPVVESDV----GKLVGILTNRDV--------------------------------RF 147 (493) Q Consensus 104 ti~pd~TI~ea~~lm~~~~is~iPVVD~~~----gkLiGIVT~rDi--------------------------------r~ 147 (493) ++.+-+.+....++++...|.+|||||+.. ++|.|++-...+ |+ T Consensus 594 ~l~~~ekV~~Iv~vLk~t~HngFPVvd~~~~~~~~~l~GlILRshl~vlL~~~~f~~~~~~~~~~~~~~~~~~~d~a~r~ 673 (762) T KOG0474 594 CLNRVEKVAVIVDVLKSTNHNGFPVVDEPPSNEAGRLHGLILRSHLLVLLKKRVFVEESRSTFDLPVRRKFTFRDFAKRE 673 (762) T ss_pred EEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCHHHHHHCC T ss_conf 87322109999999873476887523578876102366888999999998744542057665676043037877764417 Q ss_pred -----------CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf -----------43446724400267863257303589999999865533064574799786889802100010 Q gi|254780889|r 148 -----------ASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 148 -----------~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ..+...++..+|...|.|+++++++..++.++..-.+++|.||++.++++|++||+|+.+-+ T Consensus 674 ~~i~dv~lt~~e~~~yvDl~p~~n~sPytV~~~mSl~k~~~lFR~lGLRhLlVv~~~~~~~gilTR~D~~~~~ 746 (762) T KOG0474 674 PSIEDVHLTSEEMEMYVDLHPFMNPSPYTVPETMSLAKAFILFRQLGLRHLLVVPKTNRVVGILTRKDLARYR 746 (762) T ss_pred CCHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 8556554166768643055334699996547431057899999973501589851778636677566434677 No 159 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.50 E-value=7.7e-07 Score=68.48 Aligned_cols=48 Identities=33% Similarity=0.432 Sum_probs=29.6 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++|+.+++++.+|.++|.+++++.+||+|++|+|+|++|.+|+.+. T Consensus 1 rdv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~g~lvGIiT~rDli~~ 48 (96) T cd04614 1 RNVPTVWEETPLPVAVRIMELANVKALPVLDDDGKLSGIITERDLIAK 48 (96) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHCCC T ss_conf 996390896989999999997299879999899949999988888257 No 160 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.49 E-value=6.2e-07 Score=69.16 Aligned_cols=49 Identities=27% Similarity=0.389 Sum_probs=42.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |+|+|+++++++.+|.++|.+++++.+||||++|+++|++|.+|+++.. T Consensus 1 r~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~~~lvGiit~~Dll~~~ 49 (114) T cd04629 1 RNPVTFTPDMSVTEAVEKLLKSKISGGPVVDDNGNLVGFLSEQDCLKQL 49 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 9899979929999999999971997899994899299999678999999 No 161 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.49 E-value=3.2e-07 Score=71.26 Aligned_cols=47 Identities=26% Similarity=0.221 Sum_probs=27.9 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86325730358999999986553306457479978688980210001 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|+|+++++++.+|.+.|.+++++.+||||++|+++|++|..|+++. T Consensus 2 ~P~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~~~lvGivT~~Di~~~ 48 (113) T cd04615 2 KPSCVVLNTDIARAVAEMYTSGSRALPVVDDKKRLVGIITRYDVLSY 48 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHH T ss_conf 59598996999999999997399789999489979999995999999 No 162 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.48 E-value=1.8e-07 Score=72.98 Aligned_cols=52 Identities=31% Similarity=0.470 Sum_probs=48.4 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 6786325730358999999986553306457479978688980210001017 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) |++++|+++++++.+|.++|.+++++.+||+|++|+++|+||..|+++.... T Consensus 1 T~~vitv~~~~tl~~a~~~m~~~~i~~lpVvd~~~~lvGiit~~Dil~~~~~ 52 (135) T cd04586 1 TTDVVTVSPETSVAEAARLMLDNHISGLPVVDDDGRLVGIVSEGDLLRRAEL 52 (135) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHHHHC T ss_conf 9999899993939999999997399679999089979999981897644541 No 163 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.48 E-value=9e-07 Score=67.99 Aligned_cols=50 Identities=30% Similarity=0.498 Sum_probs=43.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 78632573035899999998655330645747997868898021000101 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) |+++|+++++++.+|.++|.+++++.+||+|++++++|++|.+|+++... T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~lvGiiT~~Di~~~~~ 50 (112) T cd04624 1 RPVVTVDPDTSIREAAKLMAEENVGSVVVVDPDERPIGIVTERDIVRAVA 50 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECCHHHHHHH T ss_conf 99789789395999999998629988999939982999998204588986 No 164 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.47 E-value=4.8e-07 Score=69.95 Aligned_cols=48 Identities=27% Similarity=0.418 Sum_probs=41.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++|+++++++.||.++|.+++++.+||||++|+++|++|.+|+++. T Consensus 1 kdvvtv~~~~tv~ea~~lm~~~~i~~lpVvd~~~~lvGivT~~Dl~~~ 48 (106) T cd04638 1 ENVVYVTLPGTRDDVLELLKEYKVSGVPVVKKSGELVGIITRKDLLRN 48 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHC T ss_conf 998999994969999999997699989999389999999897896448 No 165 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.47 E-value=7.7e-07 Score=68.48 Aligned_cols=50 Identities=36% Similarity=0.646 Sum_probs=44.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 78632573035899999998655330645747997868898021000101 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++++|+++++++.+|.++|.+++++++||+|++|+++|++|.+|+++... T Consensus 1 sPvvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~~~lvGiit~~Dil~~~~ 50 (122) T cd04803 1 SPVVTLSEDDSLADAEELMREHRIRHLPVVNEDGKLVGLLTQRDLLRAAL 50 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 99199989699999999999749978999908986888965999999998 No 166 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.46 E-value=9.5e-07 Score=67.82 Aligned_cols=50 Identities=30% Similarity=0.514 Sum_probs=45.2 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 78632573035899999998655330645747997868898021000101 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++++|+++++++.+|+++|.+++++.+||||++|+++|++|..|+++... T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~~~lvGiit~~Dll~~~~ 50 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNEGRVVGIVSEGDLIRKIY 50 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 99879689197999999999729978999959993999988899999986 No 167 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=98.46 E-value=2.4e-07 Score=72.12 Aligned_cols=45 Identities=47% Similarity=0.808 Sum_probs=42.3 Q ss_pred CCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 8789946543034565667605041678668860278872354500 Q gi|254780889|r 101 NPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 101 dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) ||+|++|++++.||+++|.+++++++||||++ |+|+||+|.+|+. T Consensus 1 ~~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~-~~lvGiit~~Dil 45 (49) T smart00116 1 DVVTVSPDTTLEEALELLREHGIRRLPVVDEE-GRLVGIVTRRDII 45 (49) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHH T ss_conf 98699987929999999998099857699899-9199998879999 No 168 >COG4535 CorC Putative Mg2+ and Co2+ transporter CorC [Inorganic ion transport and metabolism] Probab=98.46 E-value=3.9e-07 Score=70.60 Aligned_cols=117 Identities=23% Similarity=0.371 Sum_probs=98.1 Q ss_pred HHHCCCEECC--CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH-CCCCCCC--CCCCCCCCCCCCCC Q ss_conf 4223881528--78994654303456566760504167866886027887235450-0143446--72440026786325 Q gi|254780889|r 92 KKFESGMVVN--PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV-RFASNAQ--QAVGELMTRNLITV 166 (493) Q Consensus 92 Kr~e~gmI~d--PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi-r~~~~~~--~~V~eiMt~~litv 166 (493) .+.++.||.- -+++..++++.+.+..+.+..||+|||+.++...+.||+-.+|+ +|..... ..++++. ++.+.+ T Consensus 67 l~vrDiMIPRSQM~~l~~~~~l~~~l~~iiesaHSRfPVi~edkD~v~GIL~AKDLL~~~~~~~~~F~i~~lL-RPav~V 145 (293) T COG4535 67 LRVRDIMIPRSQMITLKRNQTLDECLDVIIESAHSRFPVISEDKDHVEGILLAKDLLPFMRSDAEPFDIKELL-RPAVVV 145 (293) T ss_pred HHHHHHCCCHHHHEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHC-CCCEEC T ss_conf 5476611227782221013799999999998511558711477001344545887887740786546388861-552434 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7303589999999865533064574799786889802100010 Q gi|254780889|r 167 KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 167 ~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |++-.+.-.++-+..++.+.+.|||+.|-+.|++|..||+... T Consensus 146 PESKrvd~lLkeFR~~RnHMAIViDEfGgVsGLVTIEDiLEqI 188 (293) T COG4535 146 PESKRVDRLLKEFRSQRNHMAIVIDEFGGVSGLVTIEDILEQI 188 (293) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHH T ss_conf 6612699999999850472699982257720167799999998 No 169 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.46 E-value=2e-07 Score=72.65 Aligned_cols=47 Identities=26% Similarity=0.472 Sum_probs=32.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86325730358999999986553306457479978688980210001 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +|+|+++++++.||.++|.+++++.+||||++|+|+||+|.+|++|. T Consensus 2 pPvtv~~~~si~eA~~~M~~~~ig~l~VVD~~~~LvGIiT~rDl~r~ 48 (118) T cd04617 2 PPVVVRENTSVYDAIVTLFLEDVGSLFVVDEDGDLVGVVSRKDLLKA 48 (118) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 79798996999999999998399879998499959999878999999 No 170 >PRK06857 consensus Probab=98.46 E-value=5.9e-06 Score=62.16 Aligned_cols=131 Identities=23% Similarity=0.383 Sum_probs=97.6 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) ++++.++.|.++++ +||+. ..| .++..+-+++|+++|..++= T Consensus 1 ~M~~ii~~l~~~~i--ipVir----------~~~--------------------------~~~a~~~~~al~~gGi~~iE 42 (209) T PRK06857 1 TMEQIIEKLRALKV--VPVIA----------IDD--------------------------AEDILPLAKVLAENGLPVAE 42 (209) T ss_pred CHHHHHHHHHHCCE--EEEEE----------CCC--------------------------HHHHHHHHHHHHHCCCCEEE T ss_conf 97999999997997--99997----------599--------------------------99999999999987998899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHH Q ss_conf 82484113889999999998579872451464699999999854875899610278766531000157431003788999 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVE 330 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~ 330 (493) |-. -+...++.|+.+++.+|++.|=+|-|-|.++++..+++||+.+ +.|| + ..+..+ T Consensus 43 iTl---rt~~a~~~I~~l~~~~p~~~vGaGTV~~~e~~~~a~~aGA~Fi---VSP~---~--------------~~~v~~ 99 (209) T PRK06857 43 ITF---RSAAAAEAIRLLREAYPDMLIGAGTVLTPEQVDAAKEAGADFI---VSPG---F--------------NPNTVK 99 (209) T ss_pred EEC---CCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---C--------------CHHHHH T ss_conf 958---9932999999999758994899993767999999998399999---9089---9--------------999999 Q ss_pred HHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 8622698199726543777621336714878993 Q gi|254780889|r 331 VAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 331 ~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) .++++++|.|- |+-++.+|.+|+.+|++.|=+ T Consensus 100 ~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vKl 131 (209) T PRK06857 100 YCQQLNIPIVP--GVNNPSLVEQALEMGLTTLKF 131 (209) T ss_pred HHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf 99974996547--879999999999879998997 No 171 >KOG1764 consensus Probab=98.45 E-value=8e-07 Score=68.35 Aligned_cols=112 Identities=28% Similarity=0.452 Sum_probs=90.5 Q ss_pred ECCC-EEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH-CCCC----------CCCCCCCCC---CCCCC Q ss_conf 5287-8994654303456566760504167866886027887235450-0143----------446724400---26786 Q gi|254780889|r 99 VVNP-VTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV-RFAS----------NAQQAVGEL---MTRNL 163 (493) Q Consensus 99 I~dP-Vti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi-r~~~----------~~~~~V~ei---Mt~~l 163 (493) ...| +.+.|..++.++...+.++++..+||.|.+.+.++.++|.+-+ .+.. -...++.+. +..++ T Consensus 160 ~~~~~~~i~p~~s~l~~~~~l~~~~~~rvpv~d~~~~~v~~ilt~~rIl~~l~~~~~~~~~~~~l~~s~~dl~ig~~~~i 239 (381) T KOG1764 160 LKPPFVSISPESSLLDAVLLLIKSRIHRVPVIDPETGEVLYILTQRRILKFLWLNGRLLPLPSLLSKSLSDLGIGTWSNI 239 (381) T ss_pred CCCCCEEECCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHHHHHH T ss_conf 46774130786889997764235772475233656554211120999999998514564422455188766404778777 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 32573035899999998655330645747997868898021000101 Q gi|254780889|r 164 ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 164 itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ..+.+++++.+|+++|.++++..+||||++|+++|.+++.|+..... T Consensus 240 ~~i~~~~~v~~al~~m~~~~is~lpvV~~~g~~v~~~s~~Dv~~l~~ 286 (381) T KOG1764 240 ASISEDTPVIEALKIMSERRISALPVVDENGKKVGNYSRFDVIHLAR 286 (381) T ss_pred HHHCCCCCHHHHHHHHHHCCCCCCCEECCCCCEECCEEEEHHHHHHH T ss_conf 77517870899999998704574434757785532156200121564 No 172 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.45 E-value=7.2e-07 Score=68.68 Aligned_cols=48 Identities=25% Similarity=0.419 Sum_probs=43.2 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .+++|+++++++.||+++|.+++++.+||||++|+|+|++|.+|+... T Consensus 1 s~Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~g~lvGiis~~Dl~~~ 48 (126) T cd04642 1 SKVVSIDSDERVLDAFKLMRKNNISGLPVVDEKGKLIGNISASDLKGL 48 (126) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHHH T ss_conf 957999995999999999998098789999289909999998997655 No 173 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=98.45 E-value=1.3e-06 Score=66.76 Aligned_cols=49 Identities=27% Similarity=0.556 Sum_probs=43.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++|++++.++.+|+++|.+++++.+||||++|+++|++|.+|+.+.. T Consensus 2 r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~~~lvGivt~~di~~~~ 50 (109) T cd04583 2 KNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKDNKLLGIVSLESLEQAY 50 (109) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHCC T ss_conf 9899979939799999999973997899995899899999810434300 No 174 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=98.44 E-value=1.3e-06 Score=66.97 Aligned_cols=48 Identities=31% Similarity=0.407 Sum_probs=37.5 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 863257303589999999865533064574799786889802100010 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) +++|+++++++.||.++|.+++++.+||||++|+|+||+|.+|+++.. T Consensus 2 pP~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~g~lvGIvT~~Dl~r~~ 49 (115) T cd04593 2 PPPVLSATTPLREAAEQLIESKHGSALVVDRDGGVVGIITLPDLLRAL 49 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 783889919999999999874998999994999889998899999999 No 175 >COG4536 CorB Putative Mg2+ and Co2+ transporter CorB [Inorganic ion transport and metabolism] Probab=98.43 E-value=4.2e-07 Score=70.36 Aligned_cols=112 Identities=19% Similarity=0.297 Sum_probs=92.5 Q ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH-CCCCCCC-CCCCCCCC--CCCCCCCCCCCHHHHHH Q ss_conf 78994654303456566760504167866886027887235450-0143446-72440026--78632573035899999 Q gi|254780889|r 102 PVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV-RFASNAQ-QAVGELMT--RNLITVKKTVNLENAKA 177 (493) Q Consensus 102 PVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi-r~~~~~~-~~V~eiMt--~~litv~~~~~l~eA~~ 177 (493) ...+.-|++..+..+.+....|+++|+..++-...+|++..||+ +.....+ ..-.+++. .+|..+|+++++.+-+. T Consensus 212 i~~id~d~~~e~iv~ql~~s~HtRiplyr~~~DnIiGvlh~r~llr~l~e~~~~~k~d~~~~a~epyFVPe~Tpl~~QL~ 291 (423) T COG4536 212 IIGIDIDDPWEEIVRQLLHSPHTRIPLYRDDLDNIIGVLHVRDLLRLLNEKNEFTKEDILRAADEPYFVPEGTPLSDQLV 291 (423) T ss_pred EEEECCCCCHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCEECCCCCCHHHHHH T ss_conf 15534899899999999628877400144874675334319999999634586407579987448745689992899999 Q ss_pred HHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCC Q ss_conf 998655330645747997868898021000101742 Q gi|254780889|r 178 LLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPN 213 (493) Q Consensus 178 iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~ 213 (493) .|.++|.+...||||.|.+.|++|..||+.-.-... T Consensus 292 ~F~~~k~hialVVDEYG~i~GLVTLEDIlEEIVGdf 327 (423) T COG4536 292 AFQRNKKHIALVVDEYGDIQGLVTLEDILEEIVGDF 327 (423) T ss_pred HHHHHCCEEEEEEECCCCEEEEEEHHHHHHHHHCCC T ss_conf 999725127999824676776646999999985433 No 176 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=98.43 E-value=9.1e-07 Score=67.96 Aligned_cols=115 Identities=31% Similarity=0.433 Sum_probs=77.0 Q ss_pred EEEEEEEHHHH----------CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECC Q ss_conf 27887235450----------01434467244002678632573035899999998655330645747997868898021 Q gi|254780889|r 135 KLVGILTNRDV----------RFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKD 204 (493) Q Consensus 135 kLiGIVT~rDi----------r~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~D 204 (493) .+.|-|..+|= +...-+..+|+++|+++++++.+++++.||.++|.+++++.+||+|++ +++|++|.+| T Consensus 144 vv~G~V~g~Dd~~~~ilidi~~m~siPk~~V~~~~s~~~i~v~~d~tl~eaak~f~~~~i~GaPVvd~d-k~vGiit~~d 222 (294) T COG2524 144 VVEGKVIGRDDTANEILIDISKMVSIPKEKVKNLMSKKLITVRPDDTLREAAKLFYEKGIRGAPVVDDD-KIVGIITLSD 222 (294) T ss_pred EEEEEEECCCCCCCEEEEEEEEEEECCCCHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCCCCEECCC-CEEEEEEHHH T ss_conf 998699624566776999874326457302666266784676688639999999997286678510289-4589998899 Q ss_pred CHHCCCC--CCCCCCCCCCEEEEEEEECCCH--HHHHHHHHHHCCCC-EEEEEC Q ss_conf 0001017--4223265687655566513701--56789998731366-699824 Q gi|254780889|r 205 IERSQLN--PNATKDSKGRLRVAAAVSVAKD--IADRVGPLFDVNVD-LVVVDT 253 (493) Q Consensus 205 Ilk~~~~--p~A~~D~~grL~VgAAIg~~~d--~~eRa~~LveaGvD-viviD~ 253 (493) |.++... +++..++-=|- --+.+++| -.|-.+.+...++- ++|+|+ T Consensus 223 I~~aia~g~~~~kV~~~M~k---~vitI~eDe~i~dAir~M~~~nVGRLlV~ds 273 (294) T COG2524 223 IAKAIANGNLDAKVSDYMRK---NVITINEDEDIYDAIRLMNKNNVGRLLVTDS 273 (294) T ss_pred HHHHHHCCCCCCCHHHHHCC---CCCEECCCHHHHHHHHHHHHCCCCEEEEECC T ss_conf 99999768965438887505---7704757304999999987638646999846 No 177 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=98.43 E-value=1.4e-06 Score=66.56 Aligned_cols=49 Identities=43% Similarity=0.670 Sum_probs=34.0 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 88152878994654303456566760504167866886027887235450 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) +.|..+|+++.|++++.+++++|.+++++.+||+|++ ++++|+||.+|+ T Consensus 63 ~iM~~~~~tv~~~~~i~~a~~~m~~~~i~~lpVvd~~-~~~vGiit~~Dl 111 (114) T cd04604 63 DVMTRNPKTIDPDALAAEALELMEENKITALPVVDDN-GRPVGVLHIHDL 111 (114) T ss_pred HHHCCCCEEEECCCCHHHHHHHHHHCCEEEEEEECCC-CEEEEEEEHHHH T ss_conf 7435675899389886999999987795799999899-989999988995 No 178 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=98.41 E-value=3.6e-07 Score=70.87 Aligned_cols=54 Identities=26% Similarity=0.401 Sum_probs=48.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCC Q ss_conf 786325730358999999986553306457479978688980210001017422 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNA 214 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A 214 (493) ++++|+++++++.+|.++|.+++++.+||||++|+++|++|..|++++..-..+ T Consensus 2 ~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~g~~~G~vt~~~ll~~~~~~~~ 55 (124) T cd04608 2 KAPVTVLPTVTCAEAIEILKEKGFDQLPVVDESGKILGMVTLGNLLSSLSSGKV 55 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCC T ss_conf 999798998909999999998499856898799968888879999999973899 No 179 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=98.40 E-value=2e-06 Score=65.57 Aligned_cols=47 Identities=15% Similarity=0.309 Sum_probs=38.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC---CCCEEEEEEECCCHH Q ss_conf 78632573035899999998655330645747---997868898021000 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDD---DGCCIGLITVKDIER 207 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe---~g~LvGIIT~~DIlk 207 (493) ++|+|+++++++.||.++|.+++++.+||||+ +|+|+|++|.+|+.. T Consensus 2 ~dPvtv~p~~tv~ea~~lm~~~~~~~~pVvd~g~~~~klvGIvT~rDi~~ 51 (114) T cd04602 2 TDPSVLSPDHTVADVLEIKEKKGFSGIPVTEDGKSGGKLLGIVTSRDIDF 51 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEEEHHHHHH T ss_conf 88999799899999999999719877999965776896999999789523 No 180 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.39 E-value=1.5e-06 Score=66.38 Aligned_cols=49 Identities=29% Similarity=0.487 Sum_probs=41.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEECCCHHCC Q ss_conf 786325730358999999986553306457479-9786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDD-GCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~-g~LvGIIT~~DIlk~~ 209 (493) |+++|+++++++.+|+++|.+++++.+||||++ |+++|++|.+|+++.. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~ 50 (125) T cd04631 1 KDVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYL 50 (125) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 99779799094999999999839988999979999599999999999987 No 181 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.39 E-value=1.5e-06 Score=66.38 Aligned_cols=48 Identities=29% Similarity=0.413 Sum_probs=37.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 863257303589999999865533064574799786889802100010 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++.|+++++++.+|.++|.+++++.+||||++|+++|++|.+|+++.. T Consensus 2 ~~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~g~lvGivt~~di~~~l 49 (111) T cd04639 2 HFETLSPADTLDDAADALLATTQHEFPVVDGDGHLVGLLTRDDLIRAL 49 (111) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHH T ss_conf 887929989799999999971997899993899889999899999998 No 182 >PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=98.36 E-value=1.1e-05 Score=60.15 Aligned_cols=108 Identities=20% Similarity=0.336 Sum_probs=83.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+-+++|+++|..++=|-.- +...++.|+.+++.||++.|=+|-|-|.|.++..+++||+.+ +.|| ++ T Consensus 25 ~~a~~~~~al~~gGi~~iEiTl~---t~~a~~~I~~l~~~~p~~~iGaGTV~~~e~~~~a~~aGA~Fi---VSP~---~~ 95 (210) T PRK07455 25 ELGLQMAEAVAAGGMRLIEITWN---SDQPAELISQLREKLPECIIGTGTLLTLEDLEEAIAAGAQFC---FTPH---VD 95 (210) T ss_pred HHHHHHHHHHHHCCCCEEEEECC---CCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---CC T ss_conf 99999999999879988999689---988999999999878996898881878999999998699999---8688---88 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) .+..+.++++++|.|- |+-++.+|.+|+.+|++.|=+ T Consensus 96 --------------~~vi~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vKl 132 (210) T PRK07455 96 --------------LELIQAAVAADIPIIP--GALTPTEIVTAWQAGASCVKV 132 (210) T ss_pred --------------HHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf --------------9999999982997658--869999999999869984775 No 183 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.36 E-value=2.2e-06 Score=65.26 Aligned_cols=48 Identities=31% Similarity=0.536 Sum_probs=30.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++|+++++++.+|.++|.+++++++||+|+ |+++|++|.+|+++.. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~i~~lpVvd~-~~~vGiiT~~Di~~~l 48 (121) T cd04633 1 KPVITVSPDDRVSHARRLMLDHDISRLPVIEG-GKLVGIVTEKDIADAL 48 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHH T ss_conf 99479799398999999999709988999989-9988787599999999 No 184 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=98.35 E-value=2e-06 Score=65.52 Aligned_cols=48 Identities=33% Similarity=0.524 Sum_probs=34.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++++|+++++++.+|.++|.+++++.+||+| +|+++|++|.+|+++.. T Consensus 1 rdpvti~~~~tl~ea~~~m~~~~~~~lpVvd-~~~lvGiit~~Di~~~~ 48 (122) T cd04585 1 KNPITVTPDTSLMEALKLMKENSIRRLPVVD-RGKLVGIVTDRDLKLAS 48 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHHHH T ss_conf 9989968929699999999973998899998-99999999889999877 No 185 >PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=98.34 E-value=1.3e-05 Score=59.63 Aligned_cols=108 Identities=20% Similarity=0.379 Sum_probs=87.0 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+.+++|+++|..++=|-. .+....+.|+.+++.||++.|=+|-|-|.+.++..+++||+.+ +-|| + T Consensus 27 ~~a~~i~~al~~gGi~~iEiTl---~tp~a~~~I~~l~~~~p~~~vGaGTV~~~e~~~~a~~aGA~Fi---VSP~---~- 96 (212) T PRK05718 27 EDAVPLAKALVAGGLPVLEVTL---RTPAALEAIRAIRKEVPEALIGAGTVLNPEQLAQAIEAGAQFI---VSPG---L- 96 (212) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 9999999999987997899957---8961999999999758981796533134889999998499899---8489---9- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -.+..+.++++++|.|- |+-++.+|.+|+.+||+.|=+ T Consensus 97 -------------~~~v~~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vK~ 134 (212) T PRK05718 97 -------------TPPLLKACQDGPIPLIP--GVNTPSELMLAMELGLRTFKF 134 (212) T ss_pred -------------CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------89999999981997657--869999999999879998997 No 186 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.34 E-value=2.1e-06 Score=65.38 Aligned_cols=48 Identities=25% Similarity=0.486 Sum_probs=35.8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 6786325730358999999986553306457479978688980210001 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |++++|++++.++.+|.++|.+++++.+||+| +|+++|++|.+|++.. T Consensus 1 Tkdvvtv~~d~tv~ea~~~m~~~~~~~~pVvd-~~~lvGivt~~Dll~~ 48 (107) T cd04610 1 TRDVITVSPDNTVKDVIKLIKETGHDGFPVVD-NGKVVGIVSARDLLGK 48 (107) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEE-CCEEEEEEEHHHHCCC T ss_conf 98809989929599999999975997899998-9999998876453159 No 187 >cd02810 DHOD_DHPD_FMN Dihydroorotate dehydrogenase (DHOD) and Dihydropyrimidine dehydrogenase (DHPD) FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. DHPD catalyzes the first step in pyrimidine degradation: the NADPH-dependent reduction of uracil and thymine to the corresponding 5,6-dihydropyrimidines. DHPD contains two FAD, two FMN and eight [4Fe-4S] clusters, arranged in two electron transfer chains that pass its homodimeric interface twice. Two of Probab=98.33 E-value=4.4e-06 Score=63.04 Aligned_cols=137 Identities=20% Similarity=0.262 Sum_probs=86.3 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECC--C---C----H-HHHHHHHHHHHHHHCCCCCEE---CCCCCC---HHHHHHHHHC Q ss_conf 701567899987313666998248--4---1----1-388999999999857987245---146469---9999999854 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTA--H---G----H-SQKVLDAVVQIKKNFPSLLVM---AGNIAT---AEGALALIDA 294 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~a--h---G----h-~~~~~~~i~~~k~~~~~~~ii---~GNv~t---~~~~~~l~~~ 294 (493) .+++.+-++.+.++|+|+|.+.++ | + + .....+.++++|+.. ++||+ ..|... .+-++.+.++ T Consensus 110 ~~~~~~~a~~~~~~gad~lElNiScPn~~~~~~~~~~~~~~~~i~~~v~~~~-~~Pv~vKLsp~~~~~~~~~ia~~~~~~ 188 (289) T cd02810 110 KEDYVELARKIERAGAKALELNLSCPNVGGGRQLGQDPEAVANLLKAVKAAV-DIPLLVKLSPYFDLEDIVELAKAAERA 188 (289) T ss_pred HHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCC-CCCEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 7899999999998479848998403675655320149999999999998602-687488427887616899999999975 Q ss_pred CCCEEEEEE--CCCCCCCCC--HH-----CCC-CCCCCH-HHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 875899610--278766531--00-----015-743100-37889998622--698199726543777621336714878 Q gi|254780889|r 295 GADIIKVGI--GPGSICTTR--VV-----TGV-GCPQLS-AIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 295 Gad~ikvGi--g~Gs~CtTr--~~-----~g~-g~pq~~-av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) |+|++-+.= .+-.+.... .. -|. |.|-.. ++..+++..+. .++|||+-|||.+.-|+.+.|.+|||+ T Consensus 189 ga~gv~~~Nt~~~~~~~~~~~~~~~~~~~gGlSG~~i~~~al~~v~~~~~~~~~~i~Iig~GGI~~~~da~e~i~aGA~~ 268 (289) T cd02810 189 GADGLTAINTISGRVVDLKTVGPGPKRGTGGLSGAPIRPLALRWVARLAARLQLDIPIIGVGGIDSGEDVLEMLMAGASA 268 (289) T ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHCCCCH T ss_conf 99689996787765554444554456776523662778899999999999749996099989939999999999849979 Q ss_pred EEECCHH Q ss_conf 9932301 Q gi|254780889|r 362 VMIGSLL 368 (493) Q Consensus 362 VM~G~~~ 368 (493) |++++-| T Consensus 269 Vqv~Tal 275 (289) T cd02810 269 VQVATAL 275 (289) T ss_pred HHHHHHH T ss_conf 9998999 No 188 >PRK08104 consensus Probab=98.33 E-value=1.5e-05 Score=59.17 Aligned_cols=108 Identities=19% Similarity=0.318 Sum_probs=87.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+-+++|+++|..++=|-. -+...++.|+.+++.||++.|=+|-|-|.|.++..+++||+.+ +.|| + T Consensus 27 ~~a~~la~al~~gGi~~iEiTl---rt~~a~~~I~~l~~~~p~~~vGaGTV~~~e~~~~ai~aGA~Fi---VSP~---~- 96 (212) T PRK08104 27 EHAVPLAKALVAGGVRVLEVTL---RTPCALEAIRAIAKEVPEAIVGAGTVLNPQQLAEVTEAGAQFA---ISPG---L- 96 (212) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 9999999999987998899968---8814999999999868985685420267999999998599999---8489---9- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -.+..++++++++|.|- |+-++-+|.+|+.+|++.|=+ T Consensus 97 -------------~~~v~~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vKl 134 (212) T PRK08104 97 -------------TEELLKAATEGTIPLIP--GISTVSELMLGMDYGLTEFKF 134 (212) T ss_pred -------------CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------99999999982997656--769999999999879997997 No 189 >PRK06015 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=98.32 E-value=1.8e-05 Score=58.70 Aligned_cols=108 Identities=18% Similarity=0.265 Sum_probs=87.3 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+-+++|+++|..++=|-. -+...++.|+.++++||++.|=+|-|-|.|.++..+++||+.+ +-|| + T Consensus 27 ~~a~~~~~al~~gGi~~iEITl---rt~~a~~~I~~l~~~~p~~~vGaGTVl~~e~~~~a~~aGA~Fi---VSP~---~- 96 (212) T PRK06015 27 EHAVPLARALARGGLPAIEITL---RTPAALDAIRAVAAEVEEAIVGAGTILNAKQFEDAAKAGSRFI---VSPG---T- 96 (212) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 9999999999987998899968---9951999999999869996795421156999999998499899---8589---9- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -.+..+.++++++|.|- |+-++-+|.+|+.+|++.|=+ T Consensus 97 -------------~~~v~~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vKl 134 (212) T PRK06015 97 -------------TQELLAAANDSDVPLLP--GAITPSEVMALREEGYTVLKF 134 (212) T ss_pred -------------CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------99999999983997737--869999999999879998997 No 190 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=98.32 E-value=1.3e-06 Score=66.77 Aligned_cols=47 Identities=36% Similarity=0.600 Sum_probs=24.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) |+++|+++++++.||.++|.+++++.+||+| +++++|++|.+|+++. T Consensus 1 r~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd-~~~lvGiit~~Di~~~ 47 (110) T cd04588 1 KPLITLNPNATLREAARLFNTHHIHGAPVVD-DGKLVGIVTLSDIAHA 47 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEE-CCEEEEEEEHHHHHHH T ss_conf 9947968949899999999970998899998-9999999966898999 No 191 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.31 E-value=6.9e-07 Score=68.85 Aligned_cols=49 Identities=27% Similarity=0.432 Sum_probs=45.4 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 78632573035899999998655330645747997868898021000101 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) |+++|+.+++++.+|.++|.+++++.+||+|+ |+++|++|.+|+++... T Consensus 1 kdvvTv~~d~~l~ea~~~m~~~~i~~lPVvd~-~~lvGivT~~Dil~~~~ 49 (143) T cd04634 1 KNPITCNADDTISDAARLLRENKISGAPVLDG-GKLVGIVSESDILKLLV 49 (143) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 99889589398999999999749977999979-99999998699998875 No 192 >cd04734 OYE_like_3_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 3. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. One member of this subgroup, the Sinorhizobium meliloti stachydrine utilization protein stcD, has been idenified as a putative N-methylproline demethylase. Probab=98.30 E-value=5.8e-06 Score=62.21 Aligned_cols=141 Identities=20% Similarity=0.248 Sum_probs=97.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCCCCEECCC------- Q ss_conf 1567899987313666998248411------------------------388999999999857987245146------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPSLLVMAGN------- 281 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~~~ii~GN------- 281 (493) ++.+-|+...+||.|.|-|-.|||| .+..++.++.+|+..+.-.+|.=- T Consensus 142 ~f~~AA~~A~~AGfDgVEIH~ahGYLl~qFlSp~~N~RtDeYGGs~eNR~Rf~~EIi~~Ir~~vg~~f~i~~Ris~~~~~ 221 (343) T cd04734 142 AFADAARRCQAGGLDGVELQAAHGHLIDQFLSPLTNRRTDEYGGSLENRMRFLLEVLAAVRAAVGPDFIVGIRISGDEDT 221 (343) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC T ss_conf 99999999997399889844577746998469855899676798889998999999999999819877615886762356 Q ss_pred --CCCH----HHHHHHHHCC-CCEEEEEECCCC--CCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHE Q ss_conf --4699----9999998548-758996102787--665310001574310037889998622698199726543777621 Q gi|254780889|r 282 --IATA----EGALALIDAG-ADIIKVGIGPGS--ICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIA 352 (493) Q Consensus 282 --v~t~----~~~~~l~~~G-ad~ikvGig~Gs--~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~ 352 (493) --|. +-++.|.++| .|.+-|..|... .+.+........|+..-+..+....+..++|||+-||++++-++- T Consensus 222 ~~g~~~~e~~~~~~~l~~~G~vD~l~vs~g~~~~~~~~~~~~p~~~~~~g~~~~~a~~ik~~~~~pvi~~G~i~~~~~ae 301 (343) T cd04734 222 EGGLSPDEALEIAARLAAEGLIDYVNVSAGSYYTLLGLAHVVPSMGMPPGPFLPLAARIKQAVDLPVFHAGRIRDPAEAE 301 (343) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHH T ss_conf 89899899999999999669976899656754332221100687667764348899999997298599979989999999 Q ss_pred EEEEEC-CCEEEECCHHHCCCC Q ss_conf 336714-878993230101216 Q gi|254780889|r 353 KAIAAG-SACVMIGSLLAGTDE 373 (493) Q Consensus 353 kAla~G-A~~VM~G~~~agt~E 373 (493) ++|+-| +|.|.+|..|----+ T Consensus 302 ~~l~~g~~D~V~~gR~~ladPd 323 (343) T cd04734 302 QALAAGHADMVGMTRAHIADPH 323 (343) T ss_pred HHHHCCCCEEHHHHHHHHHCCC T ss_conf 9998799621697899997811 No 193 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.29 E-value=4.3e-06 Score=63.13 Aligned_cols=46 Identities=43% Similarity=0.609 Sum_probs=22.6 Q ss_pred EECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 152878994654303456566760504167866886027887235450 Q gi|254780889|r 98 MVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 98 mI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) |..+|+|++|++++.+|.++|.+++++.+||+|+ |+|+||||.+|+ T Consensus 57 Mt~~~itv~~~~~l~~a~~~M~~~~i~~lpVvd~--g~lvGiiT~~Di 102 (105) T cd04599 57 MTREVVTISPEASLLEAKRLMEEKKIERLPVLRE--RKLVGIITKGTI 102 (105) T ss_pred CCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHH T ss_conf 0279899979491999999779859979999989--999999994586 No 194 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.29 E-value=9.2e-07 Score=67.92 Aligned_cols=49 Identities=33% Similarity=0.584 Sum_probs=36.6 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |+++|+++++++.+|.++|.+++++.+||+|++++++|++|.+|+++.. T Consensus 1 r~vvti~~~~tl~ea~~~m~~~~~~~l~Vvd~~~~l~Giit~~Di~~~~ 49 (113) T cd04623 1 RDVITVRPDATVAEAAKLMAEKNIGAVVVVDDGGRLVGIFSERDIVRKV 49 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 9976968939799999999973998899990898299999768999999 No 195 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=98.29 E-value=2.4e-06 Score=64.89 Aligned_cols=48 Identities=31% Similarity=0.520 Sum_probs=40.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) .+++|+++++++.+|.++|.+++++.+||+| +|+++|++|.+|+++.. T Consensus 1 Pdvvtv~p~~tl~~a~~~m~~~~~~~lpVvd-~~~l~Givt~~di~~~~ 48 (111) T cd04612 1 PDVVTVPVDLTVDEVLALMFGERHRGYPVVD-DGRLVGIVTLADIRRVP 48 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHHHH T ss_conf 9849999929599999999971994899999-99999999899999877 No 196 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.28 E-value=2.3e-06 Score=65.09 Aligned_cols=47 Identities=26% Similarity=0.521 Sum_probs=33.9 Q ss_pred CEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 8152878994654303456566760504167866886027887235450 Q gi|254780889|r 97 GMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 97 gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) .|..+|+|++|++++.+|+++|.+++++++||+|+ +||+||||.+|+ T Consensus 62 iMt~~~itv~~~~~~~~a~~~M~~~~i~~LPVvd~--~klvGiit~~Di 108 (111) T cd04626 62 IVSQDVFYVNEEDTIDEALDIMREKQIGRLPVVDD--NKLIGVVRTKDI 108 (111) T ss_pred HCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHH T ss_conf 60699879999793999999999809988999999--999999996981 No 197 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.27 E-value=1.1e-06 Score=67.39 Aligned_cols=73 Identities=27% Similarity=0.416 Sum_probs=54.3 Q ss_pred CCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEE Q ss_conf 23688899999977987999789898999999998742238815287899465430345656676050416786688602 Q gi|254780889|r 56 QVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGK 135 (493) Q Consensus 56 TVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gk 135 (493) -||+.++--+++..|. ..++ .+| +..|..+|++++|++++.+|.++|.+++++.+||+|+ |+ T Consensus 38 IiT~rDi~~~~~~~~~--------~~~~-----~~V---~~iMt~~~~tv~~~~~i~~a~~~M~~~~i~~lpVvd~--g~ 99 (112) T cd04625 38 LLTFREVLQAMAQHGA--------GVLD-----TTV---RAIMNPEPIVASPDDSIDEVRRLMVERHLRYLPVLDG--GT 99 (112) T ss_pred EEEHHHHHHHHHHCCC--------CCCC-----CCH---HHHEECCCEEECCCCHHHHHHHHHHHCCCEEEEEEEC--CE T ss_conf 9987999999997099--------8011-----708---8833489779989996999999768669769999999--99 Q ss_pred EEEEEEHHHHC Q ss_conf 78872354500 Q gi|254780889|r 136 LVGILTNRDVR 146 (493) Q Consensus 136 LiGIVT~rDir 146 (493) |+|+||.+|+. T Consensus 100 lvGiit~~Di~ 110 (112) T cd04625 100 LLGVISFHDVA 110 (112) T ss_pred EEEEEEHHHHH T ss_conf 99999979963 No 198 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.26 E-value=3.9e-06 Score=63.40 Aligned_cols=46 Identities=26% Similarity=0.434 Sum_probs=24.6 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 6325730358999999986553306457479978688980210001 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++++++++.+|.+.|.+++++.+||+|++++++|++|..|+.|. T Consensus 4 pi~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~l~GiiT~~Di~r~ 49 (113) T cd04607 4 QLLVSPDASILDALRKIDKNALRIVLVVDENGRLLGTVTDGDIRRA 49 (113) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHHHH T ss_conf 8896994989999999987599789999799959999987277668 No 199 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=98.25 E-value=3.4e-06 Score=63.87 Aligned_cols=48 Identities=23% Similarity=0.382 Sum_probs=44.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 8632573035899999998655330645747997868898021000101 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) +++|+++++++.+|+++|.+++++.+||+|+ ++++|++|.+|+++... T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~-~~lvGiit~~Di~~~~~ 49 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD-GRVVGSIDESDLLDALI 49 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 8069999497999999999819988999989-99999999999999986 No 200 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.24 E-value=7.3e-06 Score=61.47 Aligned_cols=50 Identities=28% Similarity=0.458 Sum_probs=37.6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 678632573035899999998655330645747997868898021000101 Q gi|254780889|r 160 TRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 160 t~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) +++++|+++++++.||.++|.+++++.+||+|+ |+|+|++|.+|+.+... T Consensus 1 sspV~ti~p~~tv~ea~~~m~~~~i~~lpVvd~-~~lvGivT~~Di~~~~~ 50 (110) T cd04595 1 SSPVKTVRPEATIEEARELLLRYGHTALPVVEG-GRVVGIISRRDVEKALR 50 (110) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 989899799599999999999759988999989-99999997689887765 No 201 >PRK08782 consensus Probab=98.24 E-value=2.1e-05 Score=58.12 Aligned_cols=108 Identities=16% Similarity=0.331 Sum_probs=87.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+-+++|+++|+.++=|-. -+...++.|+.+++.+|++.+=+|-|-|.|.++..+++||+.+ +.|| T Consensus 29 ~~a~~~~eal~~gGi~~iEiTl---rt~~a~~~i~~l~~~~p~~~vGaGTV~~~e~~~~a~~aGA~Fi---VSP~----- 97 (219) T PRK08782 29 DQARRVADALLEGGLPAIELTL---RTPVAIEALAMLKRELPNIVIGAGTVLSERQLRQSVDAGADFL---VTPG----- 97 (219) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHCCCCEE---ECCC----- T ss_conf 9999999999987998799967---9933999999999868994799997058999999998499899---8789----- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) + -.+..++++++++|.|- |+-++.+|..|+.+|++.+=+ T Consensus 98 ----------~--~~~v~~~a~~~~i~~iP--Gv~TpSEi~~A~~~G~~~vKl 136 (219) T PRK08782 98 ----------T--PAPLARLLADAPIPAVP--GAATPTELLTLMGLGFRVCKL 136 (219) T ss_pred ----------C--CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf ----------9--79999999981997647--859999999999879998997 No 202 >cd02911 arch_FMN Archeal FMN-binding domain. This family of archaeal proteins are part of the NAD(P)H-dependent flavin oxidoreductase (oxidored) FMN-binding family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. The specific function of this group is unknown. Probab=98.24 E-value=0.00018 Score=51.45 Aligned_cols=35 Identities=26% Similarity=0.367 Sum_probs=28.8 Q ss_pred CEEEECCCCCCCHHHHHHHHHCCCEEEE-CCCCCHH Q ss_conf 1783378523688899999977987999-7898989 Q gi|254780889|r 48 PIMSAAMDQVTDSRLAIAMAQAGGLGVI-HRNFSPS 82 (493) Q Consensus 48 PivSs~MDTVTe~~MAiamA~~GGiGvI-Hrn~sie 82 (493) |++=|||--||.......+++.+|++++ ..+.+-+ T Consensus 1 PvvLAPMAGVTD~pFrr~~~~~~g~~~~gg~~~~~~ 36 (233) T cd02911 1 PVALASMAGITDGDFCRKRADHAGLVFLGGYNLDER 36 (233) T ss_pred CEEEECCCCCCCHHHHHHHHHHCCCEEEECCCCHHH T ss_conf 948857899958999999998379579721220189 No 203 >PRK03220 consensus Probab=98.23 E-value=1.7e-05 Score=58.92 Aligned_cols=186 Identities=22% Similarity=0.242 Sum_probs=118.1 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+...+-+||=++..-|--...++++... +.-.-.+-+|.. ++ ..|.++.|+++|+|-++ T Consensus 32 dP~~~a~~~~~~G~d~lhivDld~a~~g~~~n~~~I~~i~-----~~~~~pi~vGGG--Ir--s~e~~~~ll~~GadkVv 102 (257) T PRK03220 32 DPVELAAVYDAEGADELTFLDVTASSSGRATMLDVVRRTA-----EQVFIPLTVGGG--VR--TVEDVDSLLRAGADKVS 102 (257) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HCCCCCEEEECC--CC--CHHHHHHHHHCCCCEEE T ss_conf 9999999999869998999908887567630799999998-----506964898478--58--79999999981975087 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEE------CC-----------CCCC-----------HHHHHHHHHCCCCEEEEE Q ss_conf 8248411388999999999857987245------14-----------6469-----------999999985487589961 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVM------AG-----------NIAT-----------AEGALALIDAGADIIKVG 302 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii------~G-----------Nv~t-----------~~~~~~l~~~Gad~ikvG 302 (493) |-++--.+ .+.++.+.+.|++-.|+ -+ .|.| .+-++.+.+.|+.-+.+- T Consensus 103 igs~a~~~---p~~~~~~~~~fG~q~Iv~siD~k~~~~~~~~~~~g~~v~~~g~~~~t~~~~~~~i~~~~~~g~geil~t 179 (257) T PRK03220 103 VNTAAIAR---PELLAELARRFGSQCIVLSVDARRVPVGSAPTPSGFEVTTHGGRRGTGIDAVEWAARGAELGVGEILLN 179 (257) T ss_pred CHHHHHHC---CHHHHHHHHHCCCEEEEEEEEEEECCCCCCCCCCCEEEEECCCEEECCCCHHHHHHHHHHCCCCEEEEE T ss_conf 20667759---477789998709866999999886256774346874999728826028759999999862698889999 Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCE Q ss_conf 0278766531000157431003788999862269819972654377762133671487899323010121677644 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDI 378 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~ 378 (493) ...|.=|.-| |.+..+...+ +..++|+||-||+...-|+.+++..|++.|.+|+.|--.+-+|.+. T Consensus 180 ------dI~rDGt~~G-~d~~l~~~i~---~~~~~piIasGGv~s~~di~~l~~~g~~gv~~g~a~~~~~~s~~~~ 245 (257) T PRK03220 180 ------SMDADGTKAG-FDLEMLRAVR---AAVTVPVIASGGAGAVEHFAPAVAAGADAVLAASVFHFGELTIGQV 245 (257) T ss_pred ------EECCCCCCCC-CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCEEEEHHHHHCCCCCHHHH T ss_conf ------8868660237-8969999999---7489998998789999999999978997998746878899889999 No 204 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=98.23 E-value=1.8e-06 Score=65.85 Aligned_cols=49 Identities=31% Similarity=0.490 Sum_probs=31.7 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 88152878994654303456566760504167866886027887235450 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) +.|..+|+++.|++++.+|+++|.+++++.+||+|++ |+++||+|.+|+ T Consensus 62 ~iMt~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~-~~~vGiit~~Di 110 (113) T cd04587 62 RVMTPNPVCATSDTPVLEALHLMVQGKFRHLPVVDKS-GQVVGLLDVTKL 110 (113) T ss_pred EEECCCCEEEECCCCHHHHHHHHHHCCCEEEEEEECC-CEEEEEEEHHHH T ss_conf 6661684699079999999999987794199999269-989999984885 No 205 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=98.23 E-value=1.7e-06 Score=66.04 Aligned_cols=48 Identities=35% Similarity=0.512 Sum_probs=44.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++|+++++++.+|.++|.+++++.+||||++++|+|++|.+|+... T Consensus 2 rdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~~~l~Giit~~Dl~~~ 49 (110) T cd04601 2 RDPITVSPDATVAEALELMAEYGISGLPVVDDDGKLVGIVTNRDLRFE 49 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHH T ss_conf 889897993969999999998499889999489969878877677650 No 206 >PRK06552 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=98.23 E-value=3.8e-05 Score=56.31 Aligned_cols=108 Identities=19% Similarity=0.287 Sum_probs=70.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHC---CCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 01567899987313666998248411388999999999857---987245146469999999985487589961027876 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNF---PSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~---~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) ++..+.+++|+++|..+|=|-. .+....+.++.+++.| |++.|=+|-|-|.+.++..+++||+.+ +-|| T Consensus 25 ~~a~~~~~al~~gGi~~iEITl---~tp~a~~~i~~l~~~~~~~p~~~iGaGTV~~~e~~~~a~~aGA~Fi---VSP~-- 96 (209) T PRK06552 25 EEALKISLAVIKGGIKAIEVTY---TNPFASEVIKELVERYKDDPEVLIGAGTVLDAVTARQAILAGAQFI---VSPS-- 96 (209) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCEE---ECCC-- T ss_conf 9999999999987998899967---8975999999999981779981898872748999999998599889---7699-- Q ss_pred CCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 65310001574310037889998622698199726543777621336714878993 Q gi|254780889|r 309 CTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 309 CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) ++ .+..+.++++++|.|- |+-++.+|.+|+.+||+.|=+ T Consensus 97 -~~--------------~~v~~~a~~~~i~~iP--G~~TpsEi~~A~~~Ga~~vKl 135 (209) T PRK06552 97 -FN--------------RETAKICNRYQIPYLP--GCMTVTEIVTALEAGVDIVKL 135 (209) T ss_pred -CC--------------HHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -98--------------9999999985996417--979999999999869995885 No 207 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.23 E-value=1.6e-06 Score=66.29 Aligned_cols=49 Identities=29% Similarity=0.397 Sum_probs=39.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 7863257303589999999865533064574799786889802100010 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) |+|+|+++++++.||.++|.++++..+||||++++++|++|.+|+.|.. T Consensus 1 r~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~~~lvGIvT~~Di~r~~ 49 (115) T cd04620 1 RHPLTVTPDTPVADAIALMSQQGDSSCVLVVEKGRLLGIFTERDIVRLT 49 (115) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 9699969919999999999983994699993799299999869999999 No 208 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.22 E-value=1.6e-06 Score=66.25 Aligned_cols=48 Identities=19% Similarity=0.287 Sum_probs=30.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .++++++++.++.+|.++|.+++++++||||++++|+|++|.+|+++. T Consensus 2 ~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~~~lvGivt~~Di~~~ 49 (108) T cd04596 2 EDTGYLTTTDTVKDWHELNKETGHSRFPVVDEKNKVVGIVTSKDVAGK 49 (108) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHC T ss_conf 988898999979999999998599889999689909999997997517 No 209 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.22 E-value=3.9e-06 Score=63.40 Aligned_cols=87 Identities=22% Similarity=0.309 Sum_probs=58.0 Q ss_pred CCCEECCCEEEECCC----CCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHH Q ss_conf 881600317833785----2368889999997798799978989899999999874223881528789946543034565 Q gi|254780889|r 41 KDFTLNLPIMSAAMD----QVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALA 116 (493) Q Consensus 41 ~~i~l~iPivSs~MD----TVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~ 116 (493) ++++ .+|++..+-. -+|+.++..++...|. ++++ . +....|..++++++|++++.++++ T Consensus 22 ~~i~-~v~V~~~~~~~~vGiiT~~Di~~~~~~~~~--------~~~~-----~---~V~~vMt~~v~tv~~d~~~~~a~~ 84 (114) T cd04630 22 HGVS-SLVVEKRRESDAYGIVTMRDILKKVVAEGR--------DPDR-----V---NVYEIMTKPLISVSPDMDIKYCAR 84 (114) T ss_pred HCCC-EEEEEECCCCCEEEEEEEHHHHHHHHHCCC--------CCCC-----C---CHHHHCCCCEEEECCCCCHHHHHH T ss_conf 2998-899997899608999981898999984678--------9444-----4---986827035499999494999999 Q ss_pred HHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 667605041678668860278872354500 Q gi|254780889|r 117 LMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 117 lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) +|.+++++++||+|+ |+++||||.+|+. T Consensus 85 ~m~~~~i~~lpVvd~--~~lvGiit~~Dil 112 (114) T cd04630 85 LMERTNIRRAPVVEN--NELIGIISLTDIF 112 (114) T ss_pred HHHHCCCCEEEEEEC--CEEEEEEEHHHHH T ss_conf 999779729999999--9999999768964 No 210 >PRK01033 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=98.20 E-value=1.5e-05 Score=59.17 Aligned_cols=176 Identities=21% Similarity=0.219 Sum_probs=109.7 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+...+-|||=++..-|--...++++... +...-.+-+|.-| + ..|.++.|+++|+|-++ T Consensus 31 dP~~~ak~f~~~GadelhivDld~a~~g~~~n~~~I~~I~-----~~~~ipi~vGGGI--r--s~e~~~~ll~~GadkVi 101 (253) T PRK01033 31 DPINAVRIFNEKEADELIVLDIDASRKGREPNYELIENLA-----SECFMPLCYGGGI--K--TVEQAKRIFSLGVEKVS 101 (253) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHH-----HHCCCCEEEECCC--C--CHHHHHHHHHCCCCEEE T ss_conf 9999999999879998999947454248801699999999-----8769988986881--2--16888999867986699 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCC------CCEECC-----CCC-------C----HHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 824841138899999999985798------724514-----646-------9----999999985487589961027876 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPS------LLVMAG-----NIA-------T----AEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~------~~ii~G-----Nv~-------t----~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) +.++-- .-.+.++.+-+.|++ +++--| .|. | .+-+..+.+.|+.-+. T Consensus 102 igs~a~---~~p~~i~~~~~~fG~q~IvvsiD~k~~~~~~~~v~~~g~~~~t~~~~~~~~~~~~~~g~geil-------- 170 (253) T PRK01033 102 ISTAAL---EDPKLITEAAEIYGSQSVVVSIDVKKRLFGRYDVYTHNGTKKTGLDPVEFAKQAEELGAGEIV-------- 170 (253) T ss_pred ECCHHH---HCCHHHHHHHHHCCCCCEEEEEEEECCCCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCEEE-------- T ss_conf 998786---374165789987799769999998248778347898679536785589999998746977999-------- Q ss_pred CCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE-EECCCEEEECCHHH Q ss_conf 65310001574-310037889998622698199726543777621336-71487899323010 Q gi|254780889|r 309 CTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI-AAGSACVMIGSLLA 369 (493) Q Consensus 309 CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G~~~a 369 (493) ||-=..-|..- |-+..+.+.+ +..++|+||-||+...-|+.++| ..|+++|..||+|- T Consensus 171 ~TdI~rDGt~~G~d~~l~~~i~---~~~~ipiIasGGi~s~~di~~l~~~~~v~gv~~gs~F~ 230 (253) T PRK01033 171 LNSIDRDGVMKGYDLELIKKIS---SAVKIPVTALGGAGSLDDIADLIQEAGASAAAAGSLFV 230 (253) T ss_pred EEEECCCCCCCCCCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEEEEEE T ss_conf 9878488976687999999999---87899999978989999999999867973997831689 No 211 >cd02930 DCR_FMN 2,4-dienoyl-CoA reductase (DCR) FMN-binding domain. DCR in E. coli is an iron-sulfur flavoenzyme which contains FMN, FAD, and a 4Fe-4S cluster. It is also a monomer, unlike that of its eukaryotic counterparts which form homotetramers and lack the flavin and iron-sulfur cofactors. Metabolism of unsaturated fatty acids requires auxiliary enzymes in addition to those used in b-oxidation. After a given number of cycles through the b-oxidation pathway, those unsaturated fatty acyl-CoAs with double bonds at even-numbered carbon positions contain 2-trans, 4-cis double bonds that can not be modified by enoyl-CoA hydratase. DCR utilizes NADPH to remove the C4-C5 double bond. DCR can catalyze the reduction of both natural fatty acids with cis double bonds, as well as substrates containing trans double bonds. The reaction is initiated by hybrid transfer from NADPH to FAD, which in turn transfers electrons, one at a time, to FMN via the 4Fe-4S cluster. The fully reduced FMN provi Probab=98.20 E-value=1.2e-05 Score=59.92 Aligned_cols=138 Identities=19% Similarity=0.234 Sum_probs=94.7 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHC-CCCCEE--------- Q ss_conf 15678999873136669982484113------------------------88999999999857-987245--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNF-PSLLVM--------- 278 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~-~~~~ii--------- 278 (493) ++.+-|+...+||.|.|-|.-||||- +..++.++.+|+.. ++.+|. T Consensus 138 ~f~~AA~rA~~AGfDgVEIH~ahGyLl~qFLSp~~N~RtDeYGGs~eNR~Rf~~Eiv~aVr~~vg~d~~v~~Ris~~d~~ 217 (353) T cd02930 138 DFARCAALAREAGYDGVEIMGSEGYLINQFLAPRTNKRTDEWGGSFENRMRFPVEIVRAVRAAVGEDFIIIYRLSMLDLV 217 (353) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 99999999998299989962567614877338754788574579878887999999999999709987499973601268 Q ss_pred CCCCCCHHH----HHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEE Q ss_conf 146469999----9999854875899610278766531000157431003788999862269819972654377762133 Q gi|254780889|r 279 AGNIATAEG----ALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKA 354 (493) Q Consensus 279 ~GNv~t~~~----~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kA 354 (493) .|.. |.|- +..|.++|.|.+-|+.|.-+.= ........|.-.-...+..+.+..++|||+-|||+++-.+-++ T Consensus 218 ~~G~-~~~e~~~~~~~l~~~GvD~i~vs~G~~~~~--~~~~~~~~p~g~~~~~a~~ir~~~~~Pvi~~G~i~~p~~ae~~ 294 (353) T cd02930 218 EGGS-TWEEVVALAKALEAAGADILNTGIGWHEAR--VPTIATSVPRGAFAWATAKLKRAVDIPVIASNRINTPEVAERL 294 (353) T ss_pred CCCC-CHHHHHHHHHHHHHCCCCEEEECCCCCCCC--CCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHH T ss_conf 9998-999999999999981999999637744466--8753345772366999999887548348965997989999999 Q ss_pred EEEC-CCEEEECCHHHCCCC Q ss_conf 6714-878993230101216 Q gi|254780889|r 355 IAAG-SACVMIGSLLAGTDE 373 (493) Q Consensus 355 la~G-A~~VM~G~~~agt~E 373 (493) |+-| ||+|.||..|----+ T Consensus 295 l~~g~aD~V~~gR~liadPd 314 (353) T cd02930 295 LADGDADMVSMARPFLADPD 314 (353) T ss_pred HHCCCCCHHHHHHHHHHCCH T ss_conf 98799624784099876936 No 212 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=98.20 E-value=1.8e-06 Score=65.93 Aligned_cols=48 Identities=33% Similarity=0.462 Sum_probs=45.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++++++++++.||.++|.+++++.+||||++++|+||+|..||.+. T Consensus 1 t~~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~~~L~GIvt~gDi~R~ 48 (133) T cd04592 1 TKYIKVSPTTTLKEALNLMLDEKQSCVLVVDSDDFLEGILTLGDIQRF 48 (133) T ss_pred CCEEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 945998999859999999998188657998389978999787999999 No 213 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=98.20 E-value=2e-06 Score=65.57 Aligned_cols=55 Identities=36% Similarity=0.642 Sum_probs=49.2 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 4400267863257303589999999865533064574799786889802100010 Q gi|254780889|r 155 VGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 155 V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) +.++|+++++++++++++.++.+.|.+++.+++||+|++++++|++|++|+++.. T Consensus 1 v~~im~~~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~~~~~Gvit~~dl~~~~ 55 (57) T pfam00571 1 VKDIMTPDVVTVPPDTSLEEALELMRENGISRLPVVDEDGKLVGIVTLRDLLRAL 55 (57) T ss_pred CCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 9422779998997909099999999853995799992799499999789999986 No 214 >PRK08904 consensus Probab=98.19 E-value=2.7e-05 Score=57.41 Aligned_cols=108 Identities=25% Similarity=0.394 Sum_probs=86.3 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+-+++|+++|..++=|-. -+...++.|+.++++||++.|=+|-|-|.+.++..+++||+.+ +.|| + T Consensus 22 ~~a~~~a~al~~~Gi~~iEiTl---rtp~a~~~i~~l~~~~p~~~vGaGTVl~~e~~~~a~~aGA~Fi---VSP~---~- 91 (207) T PRK08904 22 STAVDLSRALVEGGIPTLEITL---RTPVGLDAIRLIAKEVPNAIVGAGTVTNPEQLKAVEDAGAVFA---ISPG---L- 91 (207) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 9999999999987998899957---9913999999999868987685531368999999998499999---8489---9- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) ..+..+.+++.++|.|- |+-++.+|.+|+.+|++.|=+ T Consensus 92 -------------~~~v~~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vK~ 129 (207) T PRK08904 92 -------------HESLAKAGHNSGIPLIP--GVATPGEIQLALEHGIDTLKL 129 (207) T ss_pred -------------CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------89999999983997657--869999999999879998997 No 215 >cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources. Probab=98.19 E-value=2.1e-05 Score=58.23 Aligned_cols=108 Identities=30% Similarity=0.485 Sum_probs=88.7 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+.+++|+++|+.++=|-.. +...++.++.+++.||++.|=+|-|-|.+.++..+++||+.+ +.|| + T Consensus 16 ~~a~~~~~al~~~Gi~~iEitl~---t~~a~~~i~~l~~~~~~~~iGaGTV~~~~~~~~a~~aGa~Fi---vsP~---~- 85 (190) T cd00452 16 EDALALAEALIEGGIRAIEITLR---TPGALEAIRALRKEFPEALIGAGTVLTPEQADAAIAAGAQFI---VSPG---L- 85 (190) T ss_pred HHHHHHHHHHHHCCCCEEEEECC---CCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 99999999999869988999678---802999999999868980896523477999999998599899---7377---9- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -.+..+.++++++|.|- |+-++.++.+|+.+|++.|.+ T Consensus 86 -------------~~~v~~~a~~~~~~~iP--Gv~TpsEi~~A~~~G~~~vK~ 123 (190) T cd00452 86 -------------DPEVVKAANRAGIPLLP--GVATPTEIMQALELGADIVKL 123 (190) T ss_pred -------------CHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------99999999982996657--879999999999879998998 No 216 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.17 E-value=2.6e-06 Score=64.64 Aligned_cols=49 Identities=33% Similarity=0.546 Sum_probs=33.8 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 88152878994654303456566760504167866886027887235450 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) ..|..+|+++.|++++.+|+++|.+++++.+||+|++ ++++|+||.+|+ T Consensus 62 ~iM~~~~i~v~~~~~l~~a~~~m~~~~i~~lpVvd~~-~~~vGiIt~~Di 110 (113) T cd04622 62 DVMTRGVVTVTEDDDVDEAARLMREHQVRRLPVVDDD-GRLVGIVSLGDL 110 (113) T ss_pred HHHCCCCEEEECCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHH T ss_conf 8641576799889987999999985796589999089-989999997995 No 217 >PRK13597 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=98.16 E-value=5.5e-05 Score=55.18 Aligned_cols=182 Identities=20% Similarity=0.187 Sum_probs=112.2 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+...|-+||=++..-|--...++++... +...-.+-||.-|- ..|.++.|+++|+|-++ T Consensus 32 dP~~~a~~~~~~Gad~lhlvDld~a~~~~~~n~~~I~~i~-----~~~~vpiqvGGGIr----s~e~~~~ll~~GadkVi 102 (252) T PRK13597 32 DPVEAARAYDEAGADELVFLDISATHEERAILLDVVARVA-----ERVFIPLTVGGGVR----SLEDARKLLLSGADKVS 102 (252) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HHCCCCEEEECCCC----CHHHHHHHHHCCCCEEE T ss_conf 9999999999869999999956466668663799999998-----62698289847713----08999999856987798 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEEC-----------------CCCCC----HHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 82484113889999999998579872451-----------------46469----9999999854875899610278766 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMA-----------------GNIAT----AEGALALIDAGADIIKVGIGPGSIC 309 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~-----------------GNv~t----~~~~~~l~~~Gad~ikvGig~Gs~C 309 (493) |-++--.+ -+.++.+.+.|++-.|++ |...| .+-+..+.+.|+.-+. | T Consensus 103 igS~a~~n---p~~i~~~~~~fG~q~Iv~~iD~~~~~~~~~v~~~~~~~~~~~~~~d~~~~~~~~G~geil--------~ 171 (252) T PRK13597 103 VNSAAVRR---PELIRELADHFGAQAVVLAIDARWRGDFPEVHVAGGRVPTGLHAVEWAVKGVELGAGEIL--------L 171 (252) T ss_pred ECHHHHHC---CHHHHHHHHHCCCCCEEEEEEEEECCCCCEEEECCCEEECCCCHHHHHHHHHHHCCCEEE--------E T ss_conf 32666749---378999998749965299998886189741675387275697699999999964899999--------9 Q ss_pred CCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCC Q ss_conf 5310001574-31003788999862269819972654377762133671487899323010121677 Q gi|254780889|r 310 TTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESP 375 (493) Q Consensus 310 tTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Esp 375 (493) |-=..-|... |-+..+... ....++|+||-||+...-|+.+++.+|++.|-+|+.|--..-++ T Consensus 172 tdI~rDGt~~G~d~~l~~~i---~~~~~~pvIasGGv~s~~dl~~l~~~g~~gvi~G~al~~~~~s~ 235 (252) T PRK13597 172 TSMDRDGTKEGYDLRLTRMV---AEAVGVPVIASGGAGRMEHFLEAFQAGAEAALAASVFHFGEIPI 235 (252) T ss_pred EEECCCCCCCCCCHHHHHHH---HHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEEHHHHHCCCCCH T ss_conf 75737684447695999999---85079989997898999999999878996998712767799999 No 218 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.15 E-value=2.8e-06 Score=64.49 Aligned_cols=47 Identities=19% Similarity=0.348 Sum_probs=43.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 86325730358999999986553306457479978688980210001 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) .++++++++++.+|+++|.+++++.+||||++++++|+||.+|+++. T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~~k~vG~it~~Dll~~ 48 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEENKVLGQVTLSDLLEI 48 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 44686599959999999987699768998899968899889998643 No 219 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=98.14 E-value=1.3e-05 Score=59.77 Aligned_cols=48 Identities=31% Similarity=0.431 Sum_probs=38.3 Q ss_pred CEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 81528789946543034565667605041678668860278872354500 Q gi|254780889|r 97 GMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 97 gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) .|..+|+|+.|++++.+|+++|.+++++.+||+|+ ++++|+||.+|+. T Consensus 62 vms~~~~ti~~~~~~~~a~~~M~~~~i~~lpV~d~--~~~vGivt~~Dil 109 (111) T cd04589 62 IATFPLITVDPDDFLFNALLLMTRHRIHRVVVREG--GEVVGVLEQTDLL 109 (111) T ss_pred HCCCCCEEECCCCCHHHHHHHHHHCCCEEEEEEEC--CEEEEEEEHHHHH T ss_conf 28799989999792999999999879808899989--9999999968822 No 220 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.12 E-value=1.2e-05 Score=59.87 Aligned_cols=48 Identities=23% Similarity=0.420 Sum_probs=36.4 Q ss_pred CCCCCCCCCCCHHHHHHH-HHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999-986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKAL-LHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~i-M~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +++.|++++.++.|+.+. +.+++.+.+||||++|+|+|++|..|+.+. T Consensus 1 Rdv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~g~l~Givt~~dl~~~ 49 (114) T cd04801 1 RDFPTVPAHLTLREFVREYVLGSNQRRFVVVDNEGRYVGIISLADLRAI 49 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHH T ss_conf 9955849999599999999850796689998789979999999998754 No 221 >PRK02506 dihydroorotate dehydrogenase 1A; Reviewed Probab=98.12 E-value=0.00019 Score=51.37 Aligned_cols=172 Identities=18% Similarity=0.312 Sum_probs=97.9 Q ss_pred HHHHHHHHHHHCCC-CEEEEEC--CC--C------HHHHHHHHHHHHHHHCCCCCEE---CCCCCC---HHHHHHHHHCC Q ss_conf 15678999873136-6699824--84--1------1388999999999857987245---146469---99999998548 Q gi|254780889|r 233 DIADRVGPLFDVNV-DLVVVDT--AH--G------HSQKVLDAVVQIKKNFPSLLVM---AGNIAT---AEGALALIDAG 295 (493) Q Consensus 233 d~~eRa~~LveaGv-DviviD~--ah--G------h~~~~~~~i~~~k~~~~~~~ii---~GNv~t---~~~~~~l~~~G 295 (493) ++.+-++.+.++++ |++-+.+ .| | +.+.+.+.++.+++... .|++ ..++.. .+.+..+-+.+ T Consensus 106 e~~~~~~~~~~~~~~~~ielNiScPNt~g~~~~~~d~~~~~~il~~v~~~~~-~Pi~vKlsP~~~~~~~~~~a~~~~~~~ 184 (308) T PRK02506 106 ETHTILKKIQASDFEGLVELNLSCPNVPGKPQIAYDFETTDQILTEVFTYFT-KPLGVKLPPYFDIVHFDQAAAIFNKYP 184 (308) T ss_pred HHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHH-CCCCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 7788899987547542554633378851055552289999999999998750-333455898777676999999856156 Q ss_pred CCEEEE--EECCCCCCCCCHH-----C---CC-CCC-CCHHHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 758996--1027876653100-----0---15-743-10037889998622--698199726543777621336714878 Q gi|254780889|r 296 ADIIKV--GIGPGSICTTRVV-----T---GV-GCP-QLSAIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 296 ad~ikv--Gig~Gs~CtTr~~-----~---g~-g~p-q~~av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) ++++-. .+++|-...++.. . |. |.| .-.++..+++..+. .++|||+-|||.+.-|+.+-|.|||++ T Consensus 185 ~~~i~~~nt~~~~~~i~~~~~~~~~~~~~GGlSG~~l~~~al~~v~~~~~~~~~~i~IIg~GGI~s~~Da~e~i~aGAs~ 264 (308) T PRK02506 185 LAFVNCVNSIGNGLVIEDESVVIKPKNGFGGIGGDYIKPTALANVHAFYQRLKPSIQIIGTGGVKTGRDAFEHILCGASM 264 (308) T ss_pred CCEEEEEECCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCC T ss_conf 53798870235662013775101567887887761133799999999999838996389866707899999999819872 Q ss_pred EEECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHH Q ss_conf 99323010121677644873680588873320068875221222101233343310045402662105758998999999 Q gi|254780889|r 362 VMIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSG 441 (493) Q Consensus 362 VM~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~ 441 (493) |.+++-|. |.|. .++..+.. T Consensus 265 VQv~Tal~-------------------~~Gp-----------------------------------------~~~~~I~~ 284 (308) T PRK02506 265 VQVGTALH-------------------KEGP-----------------------------------------AIFERITK 284 (308) T ss_pred EEEEEEEE-------------------EECC-----------------------------------------HHHHHHHH T ss_conf 06842220-------------------4594-----------------------------------------79999999 Q ss_pred HHHHHHHHCCCCCHHHHHHCCEEE Q ss_conf 999855515758776897375999 Q gi|254780889|r 442 GLKSSMGYVGASNIEEFQKKANFI 465 (493) Q Consensus 442 glrs~~~y~G~~~i~e~~~~~~f~ 465 (493) +|+.-|.--|..+|.|++.+.+|. T Consensus 285 ~L~~~l~~~G~~si~d~~G~~~~~ 308 (308) T PRK02506 285 ELKAIMVEKGYQSLEDFRGKLKYL 308 (308) T ss_pred HHHHHHHHCCCCCHHHHCCEEECC T ss_conf 999999984999889965442019 No 222 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.12 E-value=3.2e-06 Score=64.01 Aligned_cols=50 Identities=32% Similarity=0.550 Sum_probs=43.8 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 881528789946543034565667605041678668860278872354500 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) +.|..+|+++.|++++.+|.++|.+++++.+||+|++ |+|+|+||..|+. T Consensus 57 ~iM~~~~~~v~~~~~~~~a~~~m~~~~~~~lPVVd~~-~~lvGiIt~~Di~ 106 (109) T cd04606 57 DIMDTDVISVSADDDQEEVARLFEKYDLLALPVVDEE-GRLVGIITVDDVI 106 (109) T ss_pred HHHCCCEEECCCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHH T ss_conf 6300213633799989999999998527604689889-9799999968968 No 223 >pfam01081 Aldolase KDPG and KHG aldolase. This family includes the following members: 4-hydroxy-2-oxoglutarate aldolase (KHG-aldolase) Phospho-2-dehydro-3-deoxygluconate aldolase (KDPG-aldolase) Probab=98.11 E-value=3.7e-05 Score=56.42 Aligned_cols=108 Identities=21% Similarity=0.338 Sum_probs=76.9 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++..+.+++|+++|+.++=|-. .+...++.|+.+++++|++.|=+|-|-|.|.++..+++||+.+ +.|| + T Consensus 20 ~~a~~~~~al~~~Gi~~iEiTl---~t~~a~~~I~~l~~~~p~~~iGaGTV~~~e~~~~a~~aGA~Fi---vSP~---~- 89 (196) T pfam01081 20 EDALPLAEALAAGGIRVLEVTL---RTPCALDAIRLLRKNRPDALVGAGTVLNAQQLAEAAEAGAQFV---VSPG---L- 89 (196) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCEE---ECCC---C- T ss_conf 9999999999987998899947---9827999999999649996799983768999999997499999---9787---6- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -.+..++++++++|.|- |+-++.+|.+|+.+|++.|=+ T Consensus 90 -------------~~~v~~~a~~~~i~~iP--Gv~TpsEi~~A~~~G~~~vKl 127 (196) T pfam01081 90 -------------TADLLKHAVDVKIPLIP--GVSTPSEIMLGLDLGLTRFKF 127 (196) T ss_pred -------------HHHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -------------39999999973996637--859999999999879998997 No 224 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=98.11 E-value=7e-06 Score=61.61 Aligned_cols=156 Identities=21% Similarity=0.247 Sum_probs=101.7 Q ss_pred CCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEE--ECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHH Q ss_conf 46000575276432475422533430388160031783--3785236888999999779879997898989999999987 Q gi|254780889|r 14 LTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMS--AAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQV 91 (493) Q Consensus 14 lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivS--s~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kV 91 (493) ++=+|+++.=++|--...-+++--. .++ .++|+++ ++-|. . +|+.--+-+. ++.+++ T Consensus 87 i~~~D~~i~~S~SG~t~El~~~~~~-~k~--~~~~ii~it~~~~S----~----Lak~sd~~l~---~~~~~E------- 145 (321) T PRK11543 87 IESRDVMLFISYSGGAKELDLIIPR-LED--KSIALLAMTGKPTS----P----LGLAAKAVLD---ISVERE------- 145 (321) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHH-HHH--CCCCEEEEECCCCC----H----HHHHCCEEEE---CCCCCC------- T ss_conf 8789989999589881778877278-876--69868999789999----7----6882694897---266566------- Q ss_pred HHHCCCEECCCEEEECCCC------HHHH--HHHHHHHCCCCEEEECCCC--EEEEEEEEHHHHCCCCCCCCCCCCCCCC Q ss_conf 4223881528789946543------0345--6566760504167866886--0278872354500143446724400267 Q gi|254780889|r 92 KKFESGMVVNPVTISPYAT------LADA--LALMKKYSISGIPVVESDV--GKLVGILTNRDVRFASNAQQAVGELMTR 161 (493) Q Consensus 92 Kr~e~gmI~dPVti~pd~T------I~ea--~~lm~~~~is~iPVVD~~~--gkLiGIVT~rDir~~~~~~~~V~eiMt~ 161 (493) -+|..+-|.+| +.|| ..+|.+++|+. ++. .++-|-+- +.+..+|+++|.+ T Consensus 146 --------acp~~LaPTtStt~~lalGDALAv~lm~~r~F~~-----~dFa~~HPgG~LG-------k~Ll~~V~dlM~~ 205 (321) T PRK11543 146 --------ACPMHLAPTSSTVNTLMMGDALAMAVMQARGFNE-----EDFARSHPAGALG-------ARLLNKVHHLMRR 205 (321) T ss_pred --------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCH-----HHHHHCCCCCHHH-------HHHHHHHHHHHHC T ss_conf --------6656777508999999998899999999959998-----7897609898888-------9999999999733 Q ss_pred --CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf --8632573035899999998655330645747997868898021000101 Q gi|254780889|r 162 --NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 162 --~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++..+++++++.||...|.+.+...+.|||++++|+|+||-.|+-|... T Consensus 206 ~~~lP~v~~~~~l~dai~~ms~k~lG~v~Vvd~~~kL~GIITDGDLRR~l~ 256 (321) T PRK11543 206 GDAIPQVALTASVMDAMLELSRTGLGLVAVCDAQQQVKGVFTDGDLRRWLV 256 (321) T ss_pred CCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCEEEEEECHHHHHHHH T ss_conf 557898999987999999966488617998337750688874438999986 No 225 >COG3010 NanE Putative N-acetylmannosamine-6-phosphate epimerase [Carbohydrate transport and metabolism] Probab=98.11 E-value=4e-05 Score=56.20 Aligned_cols=129 Identities=23% Similarity=0.365 Sum_probs=90.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCC-CHHH-HHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 5678999873136669982484-1138-8999999999857987245146469999999985487589961027876653 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAH-GHSQ-KVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ah-Gh~~-~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) +.+-+.+|+++|+++|.+|.-- ---. ..-+.+++ .++|+..+|| .+.|.|-..+-.++|+|.|---+ |--|. T Consensus 87 tlkeVd~L~~~Ga~IIA~DaT~R~RP~~~~~~~i~~--~k~~~~l~MA-D~St~ee~l~a~~~G~D~IGTTL---sGYT~ 160 (229) T COG3010 87 TLKEVDALAEAGADIIAFDATDRPRPDGDLEELIAR--IKYPGQLAMA-DCSTFEEGLNAHKLGFDIIGTTL---SGYTG 160 (229) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH--HHCCCCEEEE-CCCCHHHHHHHHHCCCCEEECCC---CCCCC T ss_conf 189999999779909996255687984359999997--3357947873-25988888889973996782242---01468 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 10001574310037889998622698199726543777621336714878993230101216 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) .-. -.--|-+..+.++.+ .+.++||.|.+.++-...||+-.||++|-.||..-.-.| T Consensus 161 ~~~-~~~~pDf~lvk~l~~----~~~~vIAEGr~~tP~~Ak~a~~~Ga~aVvVGsAITRp~~ 217 (229) T COG3010 161 YTE-KPTEPDFQLVKQLSD----AGCRVIAEGRYNTPEQAKKAIEIGADAVVVGSAITRPEE 217 (229) T ss_pred CCC-CCCCCCHHHHHHHHH----CCCEEEEECCCCCHHHHHHHHHHCCEEEEECCCCCCHHH T ss_conf 998-778972899999986----799399517879999999999718808998743378799 No 226 >TIGR03572 WbuZ glycosyl amidation-associated protein WbuZ. This clade of sequences is highly similar to the HisF protein, but generally represents the second HisF homolog in the genome where the other is an authentic HisF observed in the context of a complete histidine biosynthesis operon. The similarity between these WbuZ sequences and true HisFs is such that often the closest match by BLAST of a WbuZ is a HisF. Only by making a multiple sequence alignment is the homology relationship among the WbuZ sequences made apparent. WbuZ genes are invariably observed in the presence of a homolog of the HisH protein (designated WbuY) and a proposed N-acetyl sugar amidotransferase designated in WbuX in E. coli, IfnA in P. aeriginosa and PseA in C. jejuni. Similarly, this trio of genes is invariably found in the context of saccharide biosynthesis loci. It has been shown that the WbuYZ homologs are not essential components of the activity expressed by WbuX, leading to the proposal that these to pr Probab=98.11 E-value=7.8e-05 Score=54.08 Aligned_cols=177 Identities=21% Similarity=0.250 Sum_probs=109.0 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.....+-+||=++..-|---..++++.. ++...-.+-+|.-|- ..|.++.|++.|+|-++ T Consensus 31 dP~~~ak~~~~~g~d~lhivDld~a~~~~~~n~~~I~~i-----~~~~~ipi~vGGGIr----s~e~~~~ll~~GadkVi 101 (232) T TIGR03572 31 DPVNAARIYNAKGADELIVLDIDASKRGREPLFELISNL-----AEECFMPLTVGGGIR----SLEDAKKLLSLGADKVS 101 (232) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHH-----HHHCCCCEEEEECEE----EHHHHHHHHHCCCCEEE T ss_conf 999999999986999999996876434882179999999-----997298589971330----38999999976996899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECC------------CCCC-----------HHHHHHHHHCCCCEEEEEECCCC Q ss_conf 824841138899999999985798724514------------6469-----------99999998548758996102787 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAG------------NIAT-----------AEGALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~G------------Nv~t-----------~~~~~~l~~~Gad~ikvGig~Gs 307 (493) +.++--- -.+.++.+-+.|++.-|+++ .|.| .|-...+.+.|+-.+.+- T Consensus 102 igs~a~~---~p~~~~~~~~~~G~q~ivvsiD~k~~~~~~~~~v~~~g~~~~~~~~~~~~i~~~~~~g~geii~t----- 173 (232) T TIGR03572 102 INTAALE---NPDLIEEAARRFGSQCVVVSIDVKKELDGSDYKVYSDNGRRATGRDPVEWAREAEQLGAGEILLN----- 173 (232) T ss_pred ECHHHHH---CCHHHHHHHHHCCCCCEEEEEEEECCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEE----- T ss_conf 3454521---93577899998699458999998416778727999667763579879999999873599899998----- Q ss_pred CCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE-EECCCEEEECCHHH Q ss_conf 665310001574310037889998622698199726543777621336-71487899323010 Q gi|254780889|r 308 ICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI-AAGSACVMIGSLLA 369 (493) Q Consensus 308 ~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G~~~a 369 (493) ...|.=+.-| |.+..+.+. +...++|+||-||+++.-|+.+++ ..|.++|..|++|- T Consensus 174 -dI~~DG~~~G-~d~~l~~~i---~~~~~~piiasGGi~~~~di~~l~~~~~~~gv~~gs~f~ 231 (232) T TIGR03572 174 -SIDRDGTMKG-YDLELIKTV---SDAVSIPVIALGGAGSLDDLVEVALEAGASAVAAASLFH 231 (232) T ss_pred -EECCCCCCCC-CCHHHHHHH---HHHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEEECCC T ss_conf -8857685676-899999999---986899999988989999999999858981999721144 No 227 >TIGR01182 eda 2-dehydro-3-deoxyphosphogluconate aldolase/4-hydroxy-2-oxoglutarate aldolase; InterPro: IPR000887 4-Hydroxy-2-oxoglutarate aldolase (4.1.3.16 from EC) (KHG-aldolase) catalyzes the interconversion of 4-hydroxy-2-oxoglutarate into pyruvate and glyoxylate. Phospho-2-dehydro-3-deoxygluconate aldolase (4.1.2.14 from EC) (KDPG-aldolase) catalyzes the interconversion of 6-phospho-2-dehydro-3-deoxy-D-gluconate into pyruvate and glyceraldehyde 3-phosphate. These two enzymes are structurally and functionally related . They are both homotrimeric proteins of approximately 220 amino-acid residues. They are class I aldolases whose catalytic mechanism involves the formation of a Schiff-base intermediate between the substrate and the epsilon-amino group of a lysine residue. In both enzymes, an arginine is required for catalytic activity.; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=98.11 E-value=3.3e-05 Score=56.79 Aligned_cols=106 Identities=24% Similarity=0.415 Sum_probs=90.3 Q ss_pred CHHHHHHHHHHHCCCCEEEE--ECCCCHHHHHHHHHHHHHHHCC-CCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 01567899987313666998--2484113889999999998579-87245146469999999985487589961027876 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVV--DTAHGHSQKVLDAVVQIKKNFP-SLLVMAGNIATAEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 232 ~d~~eRa~~LveaGvDvivi--D~ahGh~~~~~~~i~~~k~~~~-~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) ++..+-|++|+|.|+-++=| -+.+ ..|.|+.|++++| ++-|=||=|-|.|++++-.+||||.+ |-||.- T Consensus 20 ~~A~~lA~aL~egG~~~~EvTlRT~~-----A~~aI~~l~~~~P~~~~iGAGTVL~~~Q~~~A~~AGA~F~---vSPG~~ 91 (205) T TIGR01182 20 EDALPLAKALIEGGLRVLEVTLRTPV-----ALEAIRALRKEVPKDALIGAGTVLNPEQLRQAVAAGAQFI---VSPGLT 91 (205) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCC-----HHHHHHHHHHHCCCCCEECCCCCCCHHHHHHHHHCCCCEE---ECCCCC T ss_conf 77789999998679808988514721-----6899999997282334871676489899999997089578---769788 Q ss_pred CCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 65310001574310037889998622698199726543777621336714878993 Q gi|254780889|r 309 CTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 309 CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) -+.++.+++.++|+|= |+-++..|.+||-.|.+.+=+ T Consensus 92 -----------------p~l~~~~~~~~~P~iP--GV~tpsEi~~Al~~G~~~lKl 128 (205) T TIGR01182 92 -----------------PELAKHAKDKGIPIIP--GVATPSEIMLALELGITALKL 128 (205) T ss_pred -----------------HHHHHHHHHCCCCEEC--CCCCHHHHHHHHHCCCCEEEE T ss_conf -----------------8999998508881217--776878999998757746521 No 228 >cd02931 ER_like_FMN Enoate reductase (ER)-like FMN-binding domain. Enoate reductase catalyzes the NADH-dependent reduction of carbon-carbon double bonds of several molecules, including nonactivated 2-enoates, alpha,beta-unsaturated aldehydes, cyclic ketones, and methylketones. ERs are similar to 2,4-dienoyl-CoA reductase from E. coli and to the old yellow enzyme from Saccharomyces cerevisiae. Probab=98.10 E-value=2.9e-05 Score=57.17 Aligned_cols=139 Identities=17% Similarity=0.177 Sum_probs=92.6 Q ss_pred HHHHHHHHHHHCCCCEEEEECCC-CHH------------------------HHHHHHHHHHHHHCC-CCCEEC------- Q ss_conf 15678999873136669982484-113------------------------889999999998579-872451------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAH-GHS------------------------QKVLDAVVQIKKNFP-SLLVMA------- 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ah-Gh~------------------------~~~~~~i~~~k~~~~-~~~ii~------- 279 (493) ++.+-|+...+||.|.|-|--|| ||- +..++.++.+|+..+ +.+|.. T Consensus 151 ~f~~AA~rA~~AGfDgVEIH~ah~GyLl~qFlSp~~N~RtDeYGGSlenR~Rf~~Evi~aVR~~vg~d~~v~~R~s~~~~ 230 (382) T cd02931 151 KFGESAVIAKEAGFDGVEIHAVHEGYLLDQFTISLFNKRTDKYGGSLENRLRFAIEIVEEIKARCGEDFPVSLRYSVKSY 230 (382) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 99999999998499989962453035899854873589886458987885618999999999970988738999656334 Q ss_pred ------CCCC---------C----HHHHHHHHHCCCCEEEEEECCCC-CCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEE Q ss_conf ------4646---------9----99999998548758996102787-66531000157431003788999862269819 Q gi|254780889|r 280 ------GNIA---------T----AEGALALIDAGADIIKVGIGPGS-ICTTRVVTGVGCPQLSAIMSVVEVAERAGVAI 339 (493) Q Consensus 280 ------GNv~---------t----~~~~~~l~~~Gad~ikvGig~Gs-~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~i 339 (493) |..- + .+.++.|.++|.|.+-|-.|.-. ..-+ ......++-.-+..+..+.+..++|| T Consensus 231 ~~~~~~g~~~~~~~~~~g~~l~e~~~~~~~l~~~G~D~l~vs~g~~~~~~~~--~~~~~~~~g~~~~~a~~ik~~~~iPv 308 (382) T cd02931 231 IKDLRQGALPGEEFQEKGRDLEEGLKAAKILEEAGYDALDVDAGSYDAWYWN--HPPMYQKKGMYLPYCKALKEVVDVPV 308 (382) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCHHCC--CCCCCCCCCCCHHHHHHHHHHCCCCE T ss_conf 5665457885777888763599999999999983988896477742110103--79754676314899999998739988 Q ss_pred EEECCCCCHHHHEEEEEEC-CCEEEECCHHHCCCC Q ss_conf 9726543777621336714-878993230101216 Q gi|254780889|r 340 VADGGIRFSGDIAKAIAAG-SACVMIGSLLAGTDE 373 (493) Q Consensus 340 iadGGi~~~gdi~kAla~G-A~~VM~G~~~agt~E 373 (493) |+-|||+++-++-++|+-| ||.|.||..|-.--+ T Consensus 309 i~~G~i~~p~~ae~~l~~g~aD~V~~gR~~iadP~ 343 (382) T cd02931 309 IMAGRMEDPELASEAINEGIADMISLGRPLLADPD 343 (382) T ss_pred EEECCCCCHHHHHHHHHCCCCCHHHHHHHHHHCCC T ss_conf 99689699999999998699654362289886935 No 229 >PRK09140 2-dehydro-3-deoxy-6-phosphogalactonate aldolase; Reviewed Probab=98.09 E-value=2.3e-05 Score=57.97 Aligned_cols=109 Identities=21% Similarity=0.379 Sum_probs=83.3 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCC-CCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 70156789998731366699824841138899999999985798-72451464699999999854875899610278766 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPS-LLVMAGNIATAEGALALIDAGADIIKVGIGPGSIC 309 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~-~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~C 309 (493) .++..+.+++|+++|..++=|-. .+-..++.|+.++++||+ +.|=+|-|-|.|.++..+++||+.+ +.|+ T Consensus 21 ~~~a~~~~~al~~~Gi~~iEVTl---~tp~a~~~I~~l~~~~~~~~~iGAGTVlt~e~~~~ai~aGA~Fi---VSP~--- 91 (206) T PRK09140 21 PDEALAHVGALIEAGFRAIEIPL---NSPDPFDSIAALVKALGDDALIGAGTVLSPEQVDRLADAGGRLI---VTPN--- 91 (206) T ss_pred HHHHHHHHHHHHHCCCCEEEEEC---CCCCHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHCCCCEE---ECCC--- T ss_conf 99999999999986998899917---99769999999999679865998620467999999998599999---9999--- Q ss_pred CCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 5310001574310037889998622698199726543777621336714878993 Q gi|254780889|r 310 TTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 310 tTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) ++ .+..++++++++|.|- |+-++.++..|+.+||+.|=+ T Consensus 92 ~~--------------~~vi~~a~~~~i~~iP--G~~TPsEi~~A~~~Ga~~vKl 130 (206) T PRK09140 92 ID--------------PEVIRRAVAYGMTVMP--GVATPTEAFAALRAGADALKL 130 (206) T ss_pred CC--------------HHHHHHHHHCCCCCCC--CCCCHHHHHHHHHCCCCEEEE T ss_conf 98--------------9999999982996527--859999999999859871565 No 230 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=98.08 E-value=4.5e-06 Score=63.01 Aligned_cols=47 Identities=34% Similarity=0.547 Sum_probs=28.6 Q ss_pred EECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 152878994654303456566760504167866886027887235450 Q gi|254780889|r 98 MVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 98 mI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) |..+|+++.|++++.+|+++|.+++++.+||+|++ |+++|+||.+|+ T Consensus 62 M~~~~~tv~~~~~~~~a~~~m~~~~i~~lpVvd~~-~~~vGiiT~~Di 108 (111) T cd04611 62 MSSPLLTVPADTSLYDARQLMREHGIRHLVVVDDD-GELLGLLSQTDL 108 (111) T ss_pred HCCCCEEEECCCCHHHHHHHHHHHCCEEEEEECCC-CEEEEEEEHHHH T ss_conf 10683399789999999999997187399999469-989999985684 No 231 >cd04731 HisF The cyclase subunit of imidazoleglycerol phosphate synthase (HisF). Imidazole glycerol phosphate synthase (IGPS) catalyzes the fifth step of histidine biosynthesis, the formation of the imidazole ring. IGPS converts N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to imidazole glycerol phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR). This conversion involves two tightly coupled reactions in distinct active sites of IGPS. The two catalytic domains can be fused, like in fungi and plants, or peformed by a heterodimer (HisH-glutaminase and HisF-cyclase), like in bacteria. Probab=98.08 E-value=5.5e-05 Score=55.17 Aligned_cols=179 Identities=18% Similarity=0.207 Sum_probs=110.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.....+-+||=++...|--...++++.. ++.-.-.+-+|..|- ..+.++.++++|+|-++ T Consensus 28 dP~~~a~~~~~~gadelhivDld~a~~g~~~n~~~i~~i-----~~~~~~pi~vGGGIr----s~~~~~~~l~~GadkVv 98 (243) T cd04731 28 DPVELAKRYNEQGADELVFLDITASSEGRETMLDVVERV-----AEEVFIPLTVGGGIR----SLEDARRLLRAGADKVS 98 (243) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHH-----HHHCCCCEEEEEEEE----EHHHHHHHHHCCCCEEE T ss_conf 999999999986999999970673203770079999999-----986798689985066----47999999977997899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCC------CCEE-----CCCCCC-----------HHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 824841138899999999985798------7245-----146469-----------999999985487589961027876 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPS------LLVM-----AGNIAT-----------AEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~------~~ii-----~GNv~t-----------~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) |.++--.+ .+.++++.+.|++ +++. -+.+.+ .+-...+.+.|+.-+.+- T Consensus 99 igs~~~~n---~~~~~~~~~~~Gsq~Iv~siD~k~~~~~~~~v~~~~~~~~~~~~~~~~i~~~~~~G~geil~t------ 169 (243) T cd04731 99 INSAAVEN---PELIREIAKRFGSQCVVVSIDAKRRGDGGYEVYTHGGRKPTGLDAVEWAKEVEELGAGEILLT------ 169 (243) T ss_pred ECCCCCCC---CCHHHHHHHHCCCCCEEEEEEEEECCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEE------ T ss_conf 89844237---714357887569930999999765378962898469844126789999999984698789998------ Q ss_pred CCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEECCHHHCC Q ss_conf 653100015743100378899986226981997265437776213367-148789932301012 Q gi|254780889|r 309 CTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA-AGSACVMIGSLLAGT 371 (493) Q Consensus 309 CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~G~~~agt 371 (493) +..|.=|..| |.+..+...+ ...++|||+-||+++.-|+.++|- .|++.|.+|++|--. T Consensus 170 dI~~DGt~~G-~d~~l~~~i~---~~~~~piI~sGGi~~~~di~~~l~~~~~~gv~~g~~~~~~ 229 (243) T cd04731 170 SMDRDGTKKG-YDLELIRAVS---SAVNIPVIASGGAGKPEHFVEAFEEGGADAALAASIFHFG 229 (243) T ss_pred EECCCCCCCC-CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEHHHHHCC T ss_conf 7257685665-7999999999---8689999998899999999999987898299882276769 No 232 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.08 E-value=5.1e-06 Score=62.62 Aligned_cols=46 Identities=43% Similarity=0.667 Sum_probs=24.9 Q ss_pred EECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHH Q ss_conf 152878994654303456566760504167866886027887235450 Q gi|254780889|r 98 MVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDV 145 (493) Q Consensus 98 mI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDi 145 (493) |..+|++++|++++.+|+++|.+++++++||+|+ |+++||||..|+ T Consensus 64 M~~~~~tv~~~~~l~~a~~~m~~~~~~~lpVvd~--~~lvGiIt~~Di 109 (112) T cd04802 64 MSTPLITIDPNASLNEAAKLMAKHGIKRLPVVDD--DELVGIVTTTDI 109 (112) T ss_pred HCCCCEEECCCCHHHHHHHHHHHHCCCEEEEEEC--CEEEEEEECHHH T ss_conf 2348559947986999999878719858999989--999999980671 No 233 >PRK05211 consensus Probab=98.06 E-value=6.2e-05 Score=54.81 Aligned_cols=177 Identities=13% Similarity=0.136 Sum_probs=110.4 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.+.+.|-+||=++..-|--...++++.... --.=-|-||.-| + ..+.++.|+++|||=++ T Consensus 22 DP~~~ak~~~~~gadelhivDld~a~~g~~~n~~~I~~i~~-----~~~~Pl~vGGGI--r--s~~~i~~ll~~GadkVi 92 (248) T PRK05211 22 DIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAE-----VIDIPFCVAGGI--K--SVEDAREILSFGADKIS 92 (248) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHH-----HCCCCEEEECCC--C--CHHHHHHHHHCCCCEEE T ss_conf 99999999998699989999786776787214999999997-----679858962780--1--38999999987998899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEEC----------CCCC-------------C----HHHHHHHHHCCCCEEEEEE Q ss_conf 82484113889999999998579872451----------4646-------------9----9999999854875899610 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMA----------GNIA-------------T----AEGALALIDAGADIIKVGI 303 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~----------GNv~-------------t----~~~~~~l~~~Gad~ikvGi 303 (493) |-++-- .--+.++++-+.|++..|++ ++.. | .|-++.+.+.|+--|.+- T Consensus 93 igs~a~---~np~li~~~~~~fG~q~IvvsiD~~~~~~~~~~~v~~~~g~~~~~~~t~~~~~d~i~~~~~~G~geIl~t- 168 (248) T PRK05211 93 INSPAL---ADPTLITRLADRFGVQCIVVGIDSWFDAETGKYQVYQYTGDESRTKATQWETLDWVKEVQKRGAGEIVLN- 168 (248) T ss_pred ECCHHH---HCCHHHHHHHHHCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEE- T ss_conf 897676---1961899999857993699999710255578579998258656530477369999999997598669998- Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE-EECCCEEEECCHHH Q ss_conf 2787665310001574310037889998622698199726543777621336-71487899323010 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI-AAGSACVMIGSLLA 369 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G~~~a 369 (493) +..|.=|.-| |.+..+...+ +..++||||-||+.+.-|+.++| -+|++.|.+|++|- T Consensus 169 -----~IdrDG~~~G-~dl~l~~~i~---~~~~iPvIasGGv~s~~di~~~~~~~~~~gvi~gs~~~ 226 (248) T PRK05211 169 -----MMNQDGVRNG-YDLAQLKKVR---AICHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFH 226 (248) T ss_pred -----EECCCCCCCC-CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHH T ss_conf -----9878997278-8999999999---74699999988889999999999867984133048888 No 234 >COG0167 PyrD Dihydroorotate dehydrogenase [Nucleotide transport and metabolism] Probab=98.06 E-value=0.00044 Score=48.71 Aligned_cols=170 Identities=26% Similarity=0.331 Sum_probs=105.2 Q ss_pred CHHHHHHHHHHHCC-CCEEEEECCCCH----------HHHHHHHHHHHHHHCCCCCEE---CCCCCC-HHHHHHHHHCCC Q ss_conf 01567899987313-666998248411----------388999999999857987245---146469-999999985487 Q gi|254780889|r 232 KDIADRVGPLFDVN-VDLVVVDTAHGH----------SQKVLDAVVQIKKNFPSLLVM---AGNIAT-AEGALALIDAGA 296 (493) Q Consensus 232 ~d~~eRa~~LveaG-vDviviD~ahGh----------~~~~~~~i~~~k~~~~~~~ii---~GNv~t-~~~~~~l~~~Ga 296 (493) +++.+=+..+-+++ +|.+.+.++-=| -+.+-+.++++|+... +||. +.|+.. .+.|+.+.++|+ T Consensus 109 ~~~~d~~~~~~~~~~ad~ielNiScPnt~g~~~l~~~~e~l~~l~~~vk~~~~-~Pv~vKl~P~~~di~~iA~~~~~~g~ 187 (310) T COG0167 109 EAWADYARLLEEAGDADAIELNISCPNTPGGRALGQDPELLEKLLEAVKAATK-VPVFVKLAPNITDIDEIAKAAEEAGA 187 (310) T ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHCCC-CCEEEEECCCHHHHHHHHHHHHHCCC T ss_conf 88999999997507788799985389997746654399999999999986356-86599938888999999999997498 Q ss_pred CEEEEE--ECCCC-CCCCC-------HHCCCCCCCCH--HHHHHHHHHHCCC--CEEEEECCCCCHHHHEEEEEECCCEE Q ss_conf 589961--02787-66531-------00015743100--3788999862269--81997265437776213367148789 Q gi|254780889|r 297 DIIKVG--IGPGS-ICTTR-------VVTGVGCPQLS--AIMSVVEVAERAG--VAIVADGGIRFSGDIAKAIAAGSACV 362 (493) Q Consensus 297 d~ikvG--ig~Gs-~CtTr-------~~~g~g~pq~~--av~~~~~~~~~~~--~~iiadGGi~~~gdi~kAla~GA~~V 362 (493) |++-+- +.+|. +-... ..-|..=|++. |+.-.++..+..+ +|||.-|||.+..|..+-|.|||++| T Consensus 188 Dgl~~~NT~~~~~~i~~~~~~~~~~~~~GGLSG~~ikp~al~~v~~l~~~~~~~ipIIGvGGI~s~~DA~E~i~aGA~~v 267 (310) T COG0167 188 DGLIAINTTKSGMKIDLETKKPVLANETGGLSGPPLKPIALRVVAELYKRLGGDIPIIGVGGIETGEDALEFILAGASAV 267 (310) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCHH T ss_conf 58999700366553012345556676777757510027899999999984289974898468696999999998297564 Q ss_pred EECCHHHCCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEECCCCHHHHHHHHHHH Q ss_conf 93230101216776448736805888733200688752212221012333433100454026621057589989999999 Q gi|254780889|r 363 MIGSLLAGTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVPYKGPIASVLHQMSGG 442 (493) Q Consensus 363 M~G~~~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~~~G~~~~~~~~~~~g 442 (493) .+|+-| .|+|- .++.++..+ T Consensus 268 Qv~Tal-------------------~~~Gp-----------------------------------------~i~~~I~~~ 287 (310) T COG0167 268 QVGTAL-------------------IYKGP-----------------------------------------GIVKEIIKG 287 (310) T ss_pred EEEEEE-------------------EEECC-----------------------------------------HHHHHHHHH T ss_conf 041121-------------------02085-----------------------------------------099999999 Q ss_pred HHHHHHHCCCCCHHHHHHCC Q ss_conf 99855515758776897375 Q gi|254780889|r 443 LKSSMGYVGASNIEEFQKKA 462 (493) Q Consensus 443 lrs~~~y~G~~~i~e~~~~~ 462 (493) |..=|-.-|..+|+|++..+ T Consensus 288 l~~~l~~~g~~si~d~~G~~ 307 (310) T COG0167 288 LARWLEEKGFESIQDIIGSA 307 (310) T ss_pred HHHHHHHCCCCCHHHHHCHH T ss_conf 99999981998799984533 No 235 >cd04735 OYE_like_4_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 4. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=98.05 E-value=3.6e-05 Score=56.52 Aligned_cols=136 Identities=19% Similarity=0.221 Sum_probs=87.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCC-----CCCE------ Q ss_conf 15678999873136669982484113------------------------889999999998579-----8724------ Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFP-----SLLV------ 277 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~-----~~~i------ 277 (493) ++.+-|+...+||.|.+-|--||||- +..++.++.+|+... +.+| T Consensus 145 ~f~~AA~rA~~AGfDgVEiH~ahGYLl~qFlSp~~N~RtDeYGGs~eNR~Rf~~Eii~aIr~~vg~~~~~df~vgvRis~ 224 (353) T cd04735 145 AFGEATRRAIEAGFDGVEIHGANGYLIQQFFSPHSNRRTDEWGGSLENRMRFPLAVVKAVQEVIDKHADKDFILGYRFSP 224 (353) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCH T ss_conf 99999999998399989975465759998539988998473679889998899999999999854005897336751586 Q ss_pred ---ECCCCC---CHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf ---514646---99999999854875899610278766531000157431003788999862269819972654377762 Q gi|254780889|r 278 ---MAGNIA---TAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDI 351 (493) Q Consensus 278 ---i~GNv~---t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi 351 (493) .-|... +.+-++.|.++|+|.+-|..|. ...... ...+......+.........+|||+-|+|+++-.+ T Consensus 225 ~e~~~~G~~~~e~~~~~~~l~~~gvD~l~vs~g~--~~~~~~---~~~~~~~~~~~~~~~~~~~~iPvi~~G~i~~~~~a 299 (353) T cd04735 225 EEPEEPGIRMEDTLALVDKLADKGLDYLHISLWD--FDRKSR---RGRDDNQTIMELVKERIAGRLPLIAVGSINTPDDA 299 (353) T ss_pred HHCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCC--CCCCCC---CCCCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHH T ss_conf 5414799999999999999984799889960377--667776---67775355899999996789809998999989999 Q ss_pred EEEEEECCCEEEECCHHHCCCC Q ss_conf 1336714878993230101216 Q gi|254780889|r 352 AKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 352 ~kAla~GA~~VM~G~~~agt~E 373 (493) -++|+-|||.|.||..|-.--+ T Consensus 300 e~~l~~gaD~V~~gR~liadPd 321 (353) T cd04735 300 LEALETGADLVAIGRGLLVDPD 321 (353) T ss_pred HHHHHCCCCHHHHHHHHHHCHH T ss_conf 9999869982998699997931 No 236 >PRK01659 consensus Probab=98.04 E-value=9e-05 Score=53.65 Aligned_cols=184 Identities=16% Similarity=0.176 Sum_probs=110.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.+...+-+||=++..-|--...++++... +.-.-.+-+|.-| + ..|.++.|+++|+|-++ T Consensus 31 DP~~~ak~~~~~Gad~ihivDld~a~~g~~~n~~~I~~i~-----~~~~ipi~vGGGI--r--s~e~~~~~l~~GadkVi 101 (252) T PRK01659 31 DPVEIAAAYNEAGADELVFLDITATHEGRKTMVDVVEKVA-----AKVFIPLTVGGGI--S--SVKDMKRLLRAGADKVS 101 (252) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HHCCCCEEEECCE--E--CHHHHHHHHHCCCCEEE T ss_conf 9999999999879999999946766568864899999999-----7569747996332--0--06888898744885598 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECC------------CCC-----------CHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 824841138899999999985798724514------------646-----------999999998548758996102787 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAG------------NIA-----------TAEGALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~G------------Nv~-----------t~~~~~~l~~~Gad~ikvGig~Gs 307 (493) |.++-- .-.+.++.+.+.|++-.|+++ .+. -.+-.+.+.+.|+.-+.+ T Consensus 102 igs~a~---~n~~~i~~~~~~~G~q~IvvsiD~k~~~~~~~~~i~~~g~~~~~~~~~~~~i~~~~~~g~geil~------ 172 (252) T PRK01659 102 INSAAV---LRPELITEGADHFGSQCIVVAIDAKYDAEAGKWNVYTHGGRVDTGLDAIAWAKEAVRLGAGEILL------ 172 (252) T ss_pred ECHHHH---HCHHHHHHHHHHCCCEEEEEEEEEEEECCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEE------ T ss_conf 317775---29153214676468632699999897056886899968995767777999999999769977999------ Q ss_pred CCCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHHCCCCCCCC Q ss_conf 665310001574-310037889998622698199726543777621336714-8789932301012167764 Q gi|254780889|r 308 ICTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLAGTDESPGD 377 (493) Q Consensus 308 ~CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~agt~Espg~ 377 (493) |-=..-|... |.+..+.+. ++..++|+||-||+.+.-|+.+++..+ +++|..|++|--.+-++.+ T Consensus 173 --tdI~rDG~~~G~dl~l~~~i---~~~~~~PiIasGGi~~~~di~~l~~~~~v~gv~~g~~~~~~~~sl~e 239 (252) T PRK01659 173 --TSMDADGTKNGFDLRLTKAI---SEAVSVPVIASGGAGNADHMVEVFQKTTADAALAASIFHYGETSIKE 239 (252) T ss_pred --EEECCCCCCCCCCHHHHHHH---HHHCCCCEEEEECCCCHHHHHHHHHHCCCCEEEEHHHHHCCCCCHHH T ss_conf --98814585476898999999---98689999999179999999999974898265575477779999999 No 237 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=98.03 E-value=2e-05 Score=58.27 Aligned_cols=47 Identities=34% Similarity=0.624 Sum_probs=26.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf 786325730358999999986553306457479978688980210001 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) ++++|+++++++.+|.++|.+++++.+||+| +++++|++|.+|+++. T Consensus 1 r~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~-~~~~vGiit~~Di~~~ 47 (111) T cd04800 1 RPPVTCSPDTTIREAARLMTEHRVSSLLVVD-DGRLVGIVTDRDLRNR 47 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEECHHHHH T ss_conf 9998978919799999999972999899998-9999999995578899 No 238 >PRK07114 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=98.03 E-value=0.00019 Score=51.31 Aligned_cols=110 Identities=22% Similarity=0.286 Sum_probs=62.2 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHH-HHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 1567899987313666998248411388999-999999857987245146469999999985487589961027876653 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHSQKVLD-AVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~~~~~~-~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) +..+-+++|+++|+.++=|-.-.-....+++ .++..++.+|++.|=+|-|-|.+.++..+++||+.+ +.|| ++ T Consensus 29 ~a~~~a~aL~~gGi~~iEiTlrt~~a~~~i~~l~~~~~~~~p~~~iGaGTVl~~~~~~~a~~aGA~Fi---VSP~---~~ 102 (223) T PRK07114 29 VAKKVVKACYDGGVRAFEFTNRGDFAHEVFGELVKYAAKECPEMILGVGSIVDAATAALYIQLGANFV---VGPL---FN 102 (223) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCEE---ECCC---CC T ss_conf 99999999998899889995889658999999999998668980896551889999999998599899---9999---99 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) .+..+++.++++|.|- |+-++.+|..|+.+|++.|=+ T Consensus 103 --------------~~v~~~~~~~~~~~iP--Gv~TptEi~~A~~~G~~~vK~ 139 (223) T PRK07114 103 --------------EDIAKVCNRRKIPYSP--GCGSVSEIGFAEELGCEIVKI 139 (223) T ss_pred --------------HHHHHHHHHCCCCCCC--CCCCHHHHHHHHHCCCCEEEE T ss_conf --------------9999999983997537--319999999999879997988 No 239 >PRK02083 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=98.03 E-value=8.8e-05 Score=53.71 Aligned_cols=185 Identities=18% Similarity=0.207 Sum_probs=111.1 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.....+-++|=++..-|--...++++.... .-.-.+-+|..|- ..|.++.|+++|+|-++ T Consensus 31 dP~~~a~~~~~~gadel~ivDld~s~~~~~~~~~~I~~i~~-----~~~~pi~vGGGIr----s~e~~~~ll~~GadkVv 101 (253) T PRK02083 31 DPVELAKRYDEEGADELVFLDITASSEGRDTMKDVVERVAE-----QVFIPLTVGGGIR----SVEDARRLLRAGADKVS 101 (253) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHH-----HCCCCEEEECCCC----CHHHHHHHHHCCCCEEE T ss_conf 99999999998799989999562664577417999999998-----6398778517621----38987689877987899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEEC------C------CCCC-----------HHHHHHHHHCCCCEEEEEECCCC Q ss_conf 82484113889999999998579872451------4------6469-----------99999998548758996102787 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMA------G------NIAT-----------AEGALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~------G------Nv~t-----------~~~~~~l~~~Gad~ikvGig~Gs 307 (493) |.++--. -.+.++.+.+.|++.-|++ + .|.| .+-.+.+.+.|+.-+.+- T Consensus 102 igs~a~~---~p~~i~~~~~~~G~q~Iv~siD~~~~~~~~~~~v~~~~~~~~~~~~~~~~i~~~~~~g~geil~t----- 173 (253) T PRK02083 102 INSAAVA---DPELITELADRFGSQCIVVAIDAKRDGEPGRWEVFTHGGRKPTGIDAVEWAKEVQELGAGEILLT----- 173 (253) T ss_pred ECCHHHH---CCCHHHHHHHHCCCEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEE----- T ss_conf 9984653---85355788974698359999998873768718999807841255239999999875698789999----- Q ss_pred CCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE-EECCCEEEECCHHHCCCCCCCC Q ss_conf 665310001574310037889998622698199726543777621336-7148789932301012167764 Q gi|254780889|r 308 ICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI-AAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 308 ~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl-a~GA~~VM~G~~~agt~Espg~ 377 (493) +..|.=+..| |.+..+.+. ++..++||||-||+.+..|+.+|| -.|.+.|-+|++|--...++.+ T Consensus 174 -dI~rDG~~~G-~d~~l~~~i---~~~~~iPiI~sGGv~s~~di~~~l~~~~i~gv~~G~~~~~~~~sl~~ 239 (253) T PRK02083 174 -SMDQDGTKNG-YDLELTRAV---RDAVSVPVIASGGAGNLEHFAEAFTEGGADAALAASVFHFGEITIGE 239 (253) T ss_pred -EECCCCCCCC-CCHHHHHHH---HHHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEHHHHHCCCCCHHH T ss_conf -8855586678-899999999---97579999998899999999999986798099871277769999999 No 240 >PRK00830 consensus Probab=98.01 E-value=9.3e-05 Score=53.56 Aligned_cols=186 Identities=16% Similarity=0.185 Sum_probs=111.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) -+..++.+.+.+.+.+.|-+||=++..-|--...++++... +--.-.+-||..|- ..|.++.|+++|+|=+ T Consensus 34 gdP~~~ak~~~~~gadelhivDld~a~~g~~~~~~~I~~i~-----~~~~~pi~vGGGIr----s~e~~~~ll~~GadkV 104 (273) T PRK00830 34 GDPVELAKRYYEDGADELVFLDITASHEGRATMIDVIERTA-----EEVFIPLTVGGGIR----SIEDIRQILRAGADKV 104 (273) T ss_pred CCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHH-----HHCCCCEEEECCEE----ECCCHHHHHHCCCCEE T ss_conf 89999999999879998999953246468842799999999-----86699589608843----7732899997698639 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCCCCCEECCC----------------------------C-----------CCHHHHHH Q ss_conf 98248411388999999999857987245146----------------------------4-----------69999999 Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGN----------------------------I-----------ATAEGALA 290 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~~~~ii~GN----------------------------v-----------~t~~~~~~ 290 (493) +|-++-- .--+.++++-+.|++-.|+++= | ..-+-++. T Consensus 105 vIgS~a~---~np~~v~~~~~~fGsq~IvvsiD~k~~~~~~~~~~~~~~~~~~g~~~w~~v~~~g~~~~t~~~~~~~~~~ 181 (273) T PRK00830 105 SVNTAAV---KNPEFIREASDIFGSQCIVVAIDCKRNYNPKDNPDKTIVELEDGTCAWYEVVIYGGREFTGIDAVQWAKK 181 (273) T ss_pred ECHHHHH---HCCHHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCCEEEECCCCCCCEEEEEEECCCCCCCCCHHHHHHH T ss_conf 8379898---5907789999876990599999843376654567621454047874228999707803378679999999 Q ss_pred HHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHH Q ss_conf 98548758996102787665310001574310037889998622698199726543777621336714-87899323010 Q gi|254780889|r 291 LIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLA 369 (493) Q Consensus 291 l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~a 369 (493) +.+.|+--+.+- +..|.=|.-| |-+..+.+ .+...++||||-||+.+.-|+.+++-.+ +++|..|++|- T Consensus 182 ~~~~G~geil~t------dI~rDGt~~G-~d~~l~~~---i~~~~~iPvIasGGv~~~~di~~~~~~~~~~~v~~gs~f~ 251 (273) T PRK00830 182 VEELGAGEILLT------SMDRDGTKDG-YDIPITKK---ISEEVDIPVIASGGVGNPEHIYEGFSDGKADAALAASIFH 251 (273) T ss_pred HHHCCCCEEEEE------EECCCCCCCC-CCHHHHHH---HHHCCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEHHHH T ss_conf 986498868887------8757796568-89699999---9863799889988999999999999838986887700566 Q ss_pred CCCCCCCC Q ss_conf 12167764 Q gi|254780889|r 370 GTDESPGD 377 (493) Q Consensus 370 gt~Espg~ 377 (493) -.+-+|.+ T Consensus 252 ~~~~si~e 259 (273) T PRK00830 252 FNEYSIRE 259 (273) T ss_pred CCCCCHHH T ss_conf 69979999 No 241 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=98.00 E-value=8.4e-06 Score=61.03 Aligned_cols=59 Identities=24% Similarity=0.342 Sum_probs=54.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 46724400267863257303589999999865533064574799786889802100010 Q gi|254780889|r 151 AQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 151 ~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) .+.+|+++|.++..++++++++.|+.+.|.+++.+.+||||++|+++|+++++|+.+.. T Consensus 454 ~~~~v~dvm~~~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~g~l~Giv~l~dl~~~~ 512 (583) T PRK01862 454 RTTQMRELIQPAQTVVPLTASVADMTRVFLEYPVKYLYVTDDDGRFRGAVALKDITSDL 512 (583) T ss_pred HHCCHHHHCCCCCCEECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 50809996387882349999899999999847970799988999799997899988775 No 242 >cd02932 OYE_YqiM_FMN Old yellow enzyme (OYE) YqjM-like FMN binding domain. YqjM is involved in the oxidative stress response of Bacillus subtilis. Like the other OYE members, each monomer of YqjM contains FMN as a non-covalently bound cofactor and uses NADPH as a reducing agent. The YqjM enzyme exists as a homotetramer that is assembled as a dimer of catalytically dependent dimers, while other OYE members exist only as monomers or dimers. Moreover, the protein displays a shared active site architecture where an arginine finger at the COOH terminus of one monomer extends into the active site of the adjacent monomer and is directly involved in substrate recognition. Another remarkable difference in the binding of the ligand in YqjM is represented by the contribution of the NH2-terminal tyrosine instead of a COOH-terminal tyrosine in OYE and its homologs. Probab=98.00 E-value=6.8e-05 Score=54.53 Aligned_cols=133 Identities=22% Similarity=0.349 Sum_probs=88.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCCC-CCEE--------- Q ss_conf 15678999873136669982484113------------------------8899999999985798-7245--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFPS-LLVM--------- 278 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~~-~~ii--------- 278 (493) ++.+-|+...+||.|.+-|-.||||- +..++.++.+|+..+. .+|. T Consensus 155 ~f~~AA~rA~~AGfDGVEIH~ahGYLl~qFLsp~~N~RtDeYGGs~enR~Rf~~Eii~aVr~~vg~d~~vgvRis~~d~~ 234 (336) T cd02932 155 AFVAAARRAVEAGFDVIEIHAAHGYLLHQFLSPLSNKRTDEYGGSLENRMRFLLEVVDAVRAVWPEDKPLFVRISATDWV 234 (336) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC T ss_conf 99999999998399999863137479998369411677786799789998899999999999839988706896452357 Q ss_pred CCCCC---CHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE Q ss_conf 14646---999999998548758996102787665310001574310037889998622698199726543777621336 Q gi|254780889|r 279 AGNIA---TAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI 355 (493) Q Consensus 279 ~GNv~---t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl 355 (493) -|... +.+-++.|.++|+|.+-|-.|.-... .+...+ |.+. +.-+....+..++|||+-|+|+.+-.+-++| T Consensus 235 ~~g~~~~e~~~~a~~l~~~gvd~i~vs~G~~~~~-~~~~~~---~~~~-~~~a~~ik~~~~ipvi~~G~i~~p~~ae~~l 309 (336) T cd02932 235 EGGWDLEDSVELAKALKELGVDLIDVSSGGNSPA-QKIPVG---PGYQ-VPFAERIRQEAGIPVIAVGLITDPEQAEAIL 309 (336) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCC-CCCCCC---CCCC-HHHHHHHHHHCCCCEEEECCCCCHHHHHHHH T ss_conf 8998999999999999975997899558987766-667778---6426-7999999987898399979989999999999 Q ss_pred EEC-CCEEEECCHHHC Q ss_conf 714-878993230101 Q gi|254780889|r 356 AAG-SACVMIGSLLAG 370 (493) Q Consensus 356 a~G-A~~VM~G~~~ag 370 (493) ..| ||.|.+|..|-. T Consensus 310 ~~G~~DlV~~gR~~ia 325 (336) T cd02932 310 ESGRADLVALGRELLR 325 (336) T ss_pred HCCCCEEHHHHHHHHH T ss_conf 8799400686799997 No 243 >PRK02621 consensus Probab=97.99 E-value=8.1e-05 Score=53.97 Aligned_cols=184 Identities=21% Similarity=0.212 Sum_probs=112.4 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+...+-+||=++..-|--...++++... ..-.=-+-+|..|- ..+.++.|+++|||-++ T Consensus 31 dP~~~ak~~~~~gad~lhivDld~a~~~~~~~~~~I~~i~-----~~~~ipi~vGGGIr----s~e~~~~ll~~GadkVi 101 (254) T PRK02621 31 DPVELACRYSQAGADELVFLDITATHEGRATLIDVVYRTA-----EQVFIPLTVGGGIS----SLEGIKELLRAGADKVS 101 (254) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HHCCCCEEEECCEE----EHHHHHHHHHCCCCEEE T ss_conf 9999999999859999999826676567542899999999-----86798589963353----57999999974999899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCC------CEECCCCCC------------------HHHHHHHHHCCCCEEEEEECCC Q ss_conf 8248411388999999999857987------245146469------------------9999999854875899610278 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSL------LVMAGNIAT------------------AEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~------~ii~GNv~t------------------~~~~~~l~~~Gad~ikvGig~G 306 (493) |.++--.+ .+.++++.+.|++- ++.-+++.+ .+-++.+.+.|+.-+.+ T Consensus 102 i~s~a~~n---p~~~~~~~~~fG~q~Iv~siD~k~~~~~~~gw~~~~~~~~~~~~~~~~~~~~~~~~~g~geil~----- 173 (254) T PRK02621 102 LNSAAVRD---PDLVRQASDRFGSQCIVVAIDARRRKPDNPGWDVYVRGGRENTGLDAVEWAEEVAERGAGEILL----- 173 (254) T ss_pred ECCHHHHC---CCHHHHHHHHCCCCCEEEEEEEECCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCEEEE----- T ss_conf 98867647---3544556875698433999995535347886289966884557767999998877628896999----- Q ss_pred CCCCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHHCCCCCCCC Q ss_conf 7665310001574-310037889998622698199726543777621336714-8789932301012167764 Q gi|254780889|r 307 SICTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLAGTDESPGD 377 (493) Q Consensus 307 s~CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~agt~Espg~ 377 (493) |-=..-|..- |.+..+... ++..++|+|+-||+.+.-|+.++|..+ ++.|.+|++|--.+-++.+ T Consensus 174 ---tdI~~DGt~~G~d~~l~~~i---~~~~~iPvi~sGGi~s~edi~~~l~~~~v~gvivG~al~~~~~~l~e 240 (254) T PRK02621 174 ---TSMDGDGTQAGYDLELTRAI---AEAVEIPVIASGGAGCCDHIAEALTEGKAEAALLASLLHYGQLTIAE 240 (254) T ss_pred ---EEECCCCCCCCCCHHHHHHH---HHHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEHHHHHCCCCCHHH T ss_conf ---88804797576886999999---97179979997799999999999985898198775787889999999 No 244 >COG0800 Eda 2-keto-3-deoxy-6-phosphogluconate aldolase [Carbohydrate transport and metabolism] Probab=97.98 E-value=3.6e-05 Score=56.49 Aligned_cols=108 Identities=24% Similarity=0.433 Sum_probs=86.0 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 01567899987313666998248411388999999999857987245146469999999985487589961027876653 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ++...-+.+|++.|.++|=|-. .+....+.|+.+++.||+.-|=||-|-+++++++++++||+.+ +.|| ++ T Consensus 25 e~a~~~a~Ali~gGi~~IEITl---~sp~a~e~I~~l~~~~p~~lIGAGTVL~~~q~~~a~~aGa~fi---VsP~---~~ 95 (211) T COG0800 25 EEALPLAKALIEGGIPAIEITL---RTPAALEAIRALAKEFPEALIGAGTVLNPEQARQAIAAGAQFI---VSPG---LN 95 (211) T ss_pred HHHHHHHHHHHHCCCCEEEEEC---CCCCHHHHHHHHHHHCCCCEECCCCCCCHHHHHHHHHCCCCEE---ECCC---CC T ss_conf 9999999999976987699964---7987899999999867465882455669999999998599789---8999---99 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 10001574310037889998622698199726543777621336714878993 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) .+..+.+.++++|+| =|+-++..+.+|+-+|++.+=+ T Consensus 96 --------------~ev~~~a~~~~ip~~--PG~~TptEi~~Ale~G~~~lK~ 132 (211) T COG0800 96 --------------PEVAKAANRYGIPYI--PGVATPTEIMAALELGASALKF 132 (211) T ss_pred --------------HHHHHHHHHCCCCCC--CCCCCHHHHHHHHHCCHHHEEE T ss_conf --------------999999986799636--8879989999999807224564 No 245 >pfam00724 Oxidored_FMN NADH:flavin oxidoreductase / NADH oxidase family. Probab=97.98 E-value=6.9e-05 Score=54.50 Aligned_cols=138 Identities=16% Similarity=0.129 Sum_probs=91.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCC-CCE---------E Q ss_conf 1567899987313666998248411------------------------38899999999985798-724---------5 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPS-LLV---------M 278 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~-~~i---------i 278 (493) ++.+-|+...+||.|-|-|-.|||| .+..++.++.+|+..+. .+| . T Consensus 144 ~f~~AA~rA~~AGfDGVEIh~ahGyLl~QFLSp~~N~RtDeYGGslENR~Rf~~eIi~aIR~~vg~d~~i~vRis~~d~~ 223 (336) T pfam00724 144 EFVQAAKRAVEAGFDGVEIHGAHGYLLDQFLSPGSNKRTDEYGGSIENRARFPLEVVDAVKEAVGEDRPIGYRLSPDDVF 223 (336) T ss_pred HHHHHHHHHHHCCCCEEEECHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCC T ss_conf 99999999998299989961426789998628765889776788988975489999999999728776642674652246 Q ss_pred CCCCCCHHHH----HHHHHCCCCEEEEEECCCCCCCCCHHCCC-CCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEE Q ss_conf 1464699999----99985487589961027876653100015-743100378899986226981997265437776213 Q gi|254780889|r 279 AGNIATAEGA----LALIDAGADIIKVGIGPGSICTTRVVTGV-GCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAK 353 (493) Q Consensus 279 ~GNv~t~~~~----~~l~~~Gad~ikvGig~Gs~CtTr~~~g~-g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~k 353 (493) .+...+.|.. +.|.++|.|.+-+-.|...-.......+. ..+.......+..+....++|||+-|+++.+-.+.+ T Consensus 224 ~~g~~~~e~~~~~~~~~~~~g~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~pvi~~G~i~~~~~ae~ 303 (336) T pfam00724 224 EGGLTGAETLAQFAYAAGELGVRVLDGTRLAYIHAIEPRVTGPFPVETGQQVENNEFIKKVWKGPVITVGRINDPEFAAE 303 (336) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEECCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCHHHHHH T ss_conf 89988426899999999983877564276622320244335787756312478999999876985999699998999999 Q ss_pred EEEEC-CCEEEECCHHHC Q ss_conf 36714-878993230101 Q gi|254780889|r 354 AIAAG-SACVMIGSLLAG 370 (493) Q Consensus 354 Ala~G-A~~VM~G~~~ag 370 (493) +|+.| ||.|++|..|-- T Consensus 304 ~l~~g~~D~V~~gR~~ia 321 (336) T pfam00724 304 IVEEGRADLVAMGRPFLA 321 (336) T ss_pred HHHCCCCEEHHHHHHHHH T ss_conf 998799443686699997 No 246 >cd02803 OYE_like_FMN_family Old yellow enzyme (OYE)-like FMN binding domain. OYE was the first flavin-dependent enzyme identified, however its true physiological role remains elusive to this day. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=97.97 E-value=6.9e-05 Score=54.46 Aligned_cols=137 Identities=23% Similarity=0.227 Sum_probs=89.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCCCC-CEEC-------- Q ss_conf 15678999873136669982484113------------------------88999999999857987-2451-------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFPSL-LVMA-------- 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~~~-~ii~-------- 279 (493) ++.+-|+...+||.|.+-|-.||||- +..++.++.+|+..+.- +|.. T Consensus 142 ~f~~AA~~A~~AGfDgVEIh~ahGyLl~qFlSp~~N~RtD~YGGs~eNR~Rf~~eii~air~~vg~df~vgvRls~~d~~ 221 (327) T cd02803 142 DFAAAARRAKEAGFDGVEIHGAHGYLLSQFLSPYTNKRTDEYGGSLENRARFLLEIVAAVREAVGPDFPVGVRLSADDFV 221 (327) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECHHHCC T ss_conf 99999999998499989983576618887217546987777888989998999999999999739887617997702126 Q ss_pred CCCCC----HHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE Q ss_conf 46469----99999998548758996102787665310001574310037889998622698199726543777621336 Q gi|254780889|r 280 GNIAT----AEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI 355 (493) Q Consensus 280 GNv~t----~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl 355 (493) -+--| .+-++.|.++|+|.|-|-.|.-.- ..........|...-+..+..+.+..++|+|+-|+|++.-++.++| T Consensus 222 ~~g~~~~e~~~~~~~l~~~gvd~i~vs~g~~~~-~~~~~~~~~~~~~~~~~~~~~ik~~~~~pvi~~G~i~~~~~a~~~l 300 (327) T cd02803 222 PGGLTLEEAIEIAKALEEAGVDALHVSGGSYES-PPPIIPPPYVPEGYFLELAEKIKKAVKIPVIAVGGIRDPEVAEEIL 300 (327) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHH T ss_conf 899998999999999985599989977784566-7544678777752238999999997698199989989999999999 Q ss_pred EEC-CCEEEECCHHHC Q ss_conf 714-878993230101 Q gi|254780889|r 356 AAG-SACVMIGSLLAG 370 (493) Q Consensus 356 a~G-A~~VM~G~~~ag 370 (493) +.| +|.|++|..|-. T Consensus 301 ~~g~~D~V~~gR~~ia 316 (327) T cd02803 301 AEGKADLVALGRALLA 316 (327) T ss_pred HCCCCCHHHHHHHHHH T ss_conf 8899312586699997 No 247 >cd02940 DHPD_FMN Dihydropyrimidine dehydrogenase (DHPD) FMN-binding domain. DHPD catalyzes the first step in pyrimidine degradation: the NADPH-dependent reduction of uracil and thymine to the corresponding 5,6-dihydropyrimidines. DHPD contains two FAD, two FMN, and eight [4Fe-4S] clusters, arranged in two electron transfer chains that pass the dimer interface twice. Two of the Fe-S clusters show a hitherto unobserved coordination involving a glutamine residue. Probab=97.96 E-value=7e-05 Score=54.45 Aligned_cols=136 Identities=21% Similarity=0.288 Sum_probs=81.6 Q ss_pred CHHHHHHHHHHHCCCCEEEEECC--CC-----------HHHHHH-HHHHHHHHHCCCCCEEC---CCCCC-HHHHHHHHH Q ss_conf 01567899987313666998248--41-----------138899-99999998579872451---46469-999999985 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTA--HG-----------HSQKVL-DAVVQIKKNFPSLLVMA---GNIAT-AEGALALID 293 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~a--hG-----------h~~~~~-~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~l~~ 293 (493) +|+.+-++.+-++|+|+|.+.++ |. +....+ +.++.+|+.. ++|+++ .|+.. .+-++.+.+ T Consensus 113 ~~~~~~a~~~~~~gad~lElNiScPN~~~~~~~g~~~~~~~~~l~~i~~~v~~~~-~~Pi~vKLsP~~~~i~~ia~~~~~ 191 (299) T cd02940 113 EDWTELAKLVEEAGADALELNFSCPHGMPERGMGAAVGQDPELVEEICRWVREAV-KIPVIAKLTPNITDIREIARAAKE 191 (299) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHC-CCCEEEECCCCHHHHHHHHHHHHH T ss_conf 8999999999871888899826788987612345552449999999999998624-786489628871549999999998 Q ss_pred CCCCEEEEEECC--C--CCC--CCCHHC------C----CCCCCCH-HHHHHHHHHHC--CCCEEEEECCCCCHHHHEEE Q ss_conf 487589961027--8--766--531000------1----5743100-37889998622--69819972654377762133 Q gi|254780889|r 294 AGADIIKVGIGP--G--SIC--TTRVVT------G----VGCPQLS-AIMSVVEVAER--AGVAIVADGGIRFSGDIAKA 354 (493) Q Consensus 294 ~Gad~ikvGig~--G--s~C--tTr~~~------g----~g~pq~~-av~~~~~~~~~--~~~~iiadGGi~~~gdi~kA 354 (493) +|+|++-. +-- + .+. +.+... + .|.|-.. ++..+++..+. -++|||+-|||.+.-|+.+. T Consensus 192 ~gadgiv~-~Nt~~~~~~i~~d~~~~~~~~~~~~~~GGlSG~~l~~~al~~v~~i~~~~~~~i~Iig~GGI~s~~Da~e~ 270 (299) T cd02940 192 GGADGVSA-INTVNSLMGVDLDGTPPAPGVEGKTTYGGYSGPAVKPIALRAVSQIARAPEPGLPISGIGGIESWEDAAEF 270 (299) T ss_pred CCCCEEEE-ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHH T ss_conf 59989999-76667756544223566656456777784558788999999999999964899778998995999999999 Q ss_pred EEECCCEEEECCHHH Q ss_conf 671487899323010 Q gi|254780889|r 355 IAAGSACVMIGSLLA 369 (493) Q Consensus 355 la~GA~~VM~G~~~a 369 (493) |.|||+.|++++-|- T Consensus 271 i~aGAs~Vqv~Tal~ 285 (299) T cd02940 271 LLLGASVVQVCTAVM 285 (299) T ss_pred HHCCCCHHHHHHHHH T ss_conf 984998999989999 No 248 >PRK02747 consensus Probab=97.95 E-value=0.00011 Score=53.13 Aligned_cols=183 Identities=18% Similarity=0.189 Sum_probs=112.4 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+.+.|-+||=++..-|--...++++... .--.=.+-+|.-|- ..|.++.|+++|+|-++ T Consensus 31 dP~~~ak~~~~~Gadelh~vDl~~a~~~~~~~~~lI~~i~-----~~~~ipi~vGGGIr----s~e~~~~ll~~GadkVi 101 (257) T PRK02747 31 DPVEAARAYDAAGADELCFLDITASHENRGTMLDVVARTA-----EQCFMPLTVGGGVR----TVDDIRKLLLAGADKVS 101 (257) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HHCCCCEEEECCCC----CHHHHHHHHHCCCCEEE T ss_conf 9999999999869998999947677567552899999999-----86699889848820----73887899876996898 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECC---------------CCCC-----------HHHHHHHHHCCCCEEEEEEC Q ss_conf 824841138899999999985798724514---------------6469-----------99999998548758996102 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAG---------------NIAT-----------AEGALALIDAGADIIKVGIG 304 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~G---------------Nv~t-----------~~~~~~l~~~Gad~ikvGig 304 (493) |.++--. -.+.++.+.+.|++-.|+++ .+.| .+-++.+.+.|+.-+.+ T Consensus 102 igs~a~~---np~l~~~~~~~fG~q~Iv~siD~k~~~~~~~~~~~~i~~~~~~~~t~~~~~~~~~~~~~~G~geil~--- 175 (257) T PRK02747 102 INSAAVA---RPEFVAEAADKFGSQCIVVAIDAKRVSPAGENDRWEIFTHGGRKPTGIDAVEFAQKVVSLGAGEILL--- 175 (257) T ss_pred ECHHHHH---CCHHHHHHHHHCCCEEEEEEEEEEECCCCCCCCCEEEEECCCCEECCCHHHHHHHHHHHCCCCEEEE--- T ss_conf 3444654---8347777887559657999998775157677873899988984634303999999999709988999--- Q ss_pred CCCCCCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEE-CCCEEEECCHHHCCCCCCC Q ss_conf 787665310001574-31003788999862269819972654377762133671-4878993230101216776 Q gi|254780889|r 305 PGSICTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAA-GSACVMIGSLLAGTDESPG 376 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~-GA~~VM~G~~~agt~Espg 376 (493) |-=..-|..- |.+..+.+.++ ..++|+||-||+.+..|+.+++-- ++++|..|++|--.+-++. T Consensus 176 -----tdI~rDG~~~G~dl~l~~~i~~---~~~~pvIasGGv~~~~di~~~~~~~~~~av~~g~~~~~~~~~l~ 241 (257) T PRK02747 176 -----TSMDRDGTKAGFDLPLTRAIAD---AVRVPVIASGGVGTLDHLVEGVRDGHATAVLAASIFHFGTYTIG 241 (257) T ss_pred -----EEECCCCCCCCCCHHHHHHHHH---CCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEHHHHHCCCCCHH T ss_conf -----9883557326788699999986---07998999779999999999998389849988326776998999 No 249 >PRK13587 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=97.93 E-value=0.00039 Score=49.06 Aligned_cols=177 Identities=18% Similarity=0.234 Sum_probs=112.1 Q ss_pred CCHHHHHHHHHH-CCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCE Q ss_conf 358999999986-5533064574799786889802100010174223265687655566513701567899987313666 Q gi|254780889|r 170 VNLENAKALLHQ-HRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDL 248 (493) Q Consensus 170 ~~l~eA~~iM~~-~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDv 248 (493) .+..++.+...+ ...+.+-|||=++-.-|--...++++.... -..-.+-||.-| + ..+.++.++++|+|- T Consensus 31 ~~~~~~~~~~~~~~Ga~~lHvVDLdgA~~g~~~n~~~I~~i~~-----~~~~~iqvGGGI--R--s~e~i~~~l~~G~~r 101 (234) T PRK13587 31 RSAEESIAYYSQFECVNRIHIVDLIGAKAQHAREFDYIKSLRR-----LTTKDIEVGGGI--R--TKSQIMDYFAAGINY 101 (234) T ss_pred CCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHH-----HCCCCEEEECCC--C--CHHHHHHHHHCCCCE T ss_conf 9999999999983899889999787646897437999999984-----379867984654--7--599999999768999 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHCCC-----CCEECCCCCC-----------HHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 99824841138899999999985798-----7245146469-----------9999999854875899610278766531 Q gi|254780889|r 249 VVVDTAHGHSQKVLDAVVQIKKNFPS-----LLVMAGNIAT-----------AEGALALIDAGADIIKVGIGPGSICTTR 312 (493) Q Consensus 249 iviD~ahGh~~~~~~~i~~~k~~~~~-----~~ii~GNv~t-----------~~~~~~l~~~Gad~ikvGig~Gs~CtTr 312 (493) ++|-++--.+ .+.++++-+.||+ +++..|.|++ .+-++.+.+.|+..+.+- +|..-. T Consensus 102 ViigT~a~~~---~~~l~~~~~~f~~~Ivv~iD~~~~~v~~~GW~~~s~~~~~d~~~~~~~~g~~~il~T----dI~rDG 174 (234) T PRK13587 102 CIVGTKGIQD---TDWLKEMAHTFPGRIYLSVDAYGEDIKVNGWEEDTELNLFSFVRQLSDIPLGGIIYT----DIAKDG 174 (234) T ss_pred EEECCCCCCC---HHHHHHHHHHCCCCEEEEEECCCCEEEECCCCEECCCCHHHHHHHHHHCCCCEEEEE----CCCCCC T ss_conf 9988813028---699999998666776871202385454457514258679999999974398789984----026657 Q ss_pred HHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 00015743100378899986226981997265437776213367148789932301 Q gi|254780889|r 313 VVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 313 ~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) ...| |-+..+.+.+ +..++||||-|||++..|+.+.-.+|.+.|-+|..| T Consensus 175 tl~G---~n~el~~~i~---~~~~~pvIaSGGv~sl~Di~~L~~~gv~GvIvGkAl 224 (234) T PRK13587 175 KMSG---PNFELTGQLV---KATTIPVIASGGIRHQQDIQRLASLNVHAAIIGKAA 224 (234) T ss_pred CCCC---CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCEEEEEEEC T ss_conf 4557---9999999999---767999999899899999999998899899999750 No 250 >pfam00977 His_biosynth Histidine biosynthesis protein. Proteins involved in steps 4 and 6 of the histidine biosynthesis pathway are contained in this family. Histidine is formed by several complex and distinct biochemical reactions catalysed by eight enzymes. The enzymes in this Pfam entry are called His6 and His7 in eukaryotes and HisA and HisF in prokaryotes. The structure of HisA is known to be a TIM barrel fold. In some archaeal HisA proteins the TIM barrel is composed of two tandem repeats of a half barrel. This family belong to the common phosphate binding site TIM barrel family. Probab=97.92 E-value=0.00032 Score=49.72 Aligned_cols=174 Identities=22% Similarity=0.268 Sum_probs=102.8 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..+..+.+.+.+...+-|+|=++-+-|--...++++.. ++--.-.+-+|.-|. ..+.++.+++.|+|-++ T Consensus 30 dP~~~a~~~~~~g~d~i~ivDLda~~~~~~~n~~~i~~i-----~~~~~~pi~vgGGIr----s~e~~~~~l~~Ga~kvv 100 (229) T pfam00977 30 DPVELAKRYEEEGADELHFVDLDAAKEGRPVNLDLIEEI-----AEEVFIPVQVGGGIR----SLEDAERLLSAGADKVI 100 (229) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHH-----HHHCCCCEEEECCEE----EHHHHHHHHHCCCCEEE T ss_conf 999999999987999899996866302681069999999-----986698789964561----18999999976998999 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCC------CCEEC-CCCCCH-----------HHHHHHHHCCCCEEEEEECCCCCCCC- Q ss_conf 824841138899999999985798------72451-464699-----------99999985487589961027876653- Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPS------LLVMA-GNIATA-----------EGALALIDAGADIIKVGIGPGSICTT- 311 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~------~~ii~-GNv~t~-----------~~~~~l~~~Gad~ikvGig~Gs~CtT- 311 (493) +-++--.+ .+.++.+-+.|++ +++.- |.+.+. |-.+.+.+.|+-.+ +||. T Consensus 101 igs~~~~~---~~~~~~~~~~~g~q~iv~siD~k~~~~v~~~~~~~~~~~~~~~~i~~~~~~g~~ei--------i~tdi 169 (229) T pfam00977 101 IGTAAVKN---PELIKEAAEKFGSQCIVVAIDAKRDGKVAINGWREETGIDAVEWAKKLEELGAGEI--------LLTDI 169 (229) T ss_pred ECCCHHHC---HHHHHHHHHHCCCCCEEEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCEE--------EEEEE T ss_conf 58604309---37899999980986479999871451799806433567443344567765167506--------88775 Q ss_pred -CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf -100015743100378899986226981997265437776213367148789932301 Q gi|254780889|r 312 -RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 312 -r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) |+=|+-| |-+..+... ++..++|+|+-||+++..|+.++..+|++.|.+|+.| T Consensus 170 ~~dGt~~G-~d~~l~~~i---~~~~~~pii~~GGv~~~~di~~l~~~g~~gvivg~al 223 (229) T pfam00977 170 DRDGTLSG-PDLELTREL---AEAVNIPVIASGGVGSLEDLKELFSEGVDGVIAGSAL 223 (229) T ss_pred CCCCCCCC-CCHHHHHHH---HHHCCCCEEEECCCCCHHHHHHHHHCCCCEEEEHHHH T ss_conf 04275666-899999999---9768998999858999999999998799899985786 No 251 >PRK13523 NADPH dehydrogenase NamA; Provisional Probab=97.92 E-value=9.7e-05 Score=53.43 Aligned_cols=134 Identities=16% Similarity=0.247 Sum_probs=87.5 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCCCCEE---------C Q ss_conf 1567899987313666998248411------------------------388999999999857987245---------1 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPSLLVM---------A 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~~~ii---------~ 279 (493) ++.+-|+...+||.|.|-|-.|||| ....++.++.+|+..+. +++ - T Consensus 143 ~f~~AA~rA~~AGfDgVEIH~ahGYLl~QFlSp~~N~RtD~YGGs~eNR~Rf~lEii~aVr~~~~~-~v~vRis~~d~~~ 221 (337) T PRK13523 143 AFKQAAKRAKEAGFDVIEIHGAHGYLINEFLSPLSNKRTDEYGGSPENRYRFLREIIDAVKEVWDG-PLFVRISASDYHP 221 (337) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC-CEEEEECCCCCCC T ss_conf 999999999984999899813543589984792324895855888899988999999999986588-6399933655578 Q ss_pred CCCCC---HHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE Q ss_conf 46469---999999985487589961027876653100015743100378899986226981997265437776213367 Q gi|254780889|r 280 GNIAT---AEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA 356 (493) Q Consensus 280 GNv~t---~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla 356 (493) |.... .+-++.|.++|+|.+-|..| +........ .|-.. +.....+.+..++|||+-|+|+.+-.+-++|+ T Consensus 222 gG~~~~d~~~~~~~l~~~GvD~i~vs~G--~~~~~~~~~---~~g~~-~~~a~~ik~~~~ipvi~vG~i~~~~~ae~~l~ 295 (337) T PRK13523 222 DGLTVQDYVQYAKWMKEQGVDLIDVSSG--AVVPARIDV---YPGYQ-VPFAEHIKEHANIATGAVGLITTGAQAEEILN 295 (337) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEECCC--CCCCCCCCC---CCCCC-HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH T ss_conf 9989899999999999749998995788--554776777---87533-48999999876970999838699999999998 Q ss_pred EC-CCEEEECCHHHCCCC Q ss_conf 14-878993230101216 Q gi|254780889|r 357 AG-SACVMIGSLLAGTDE 373 (493) Q Consensus 357 ~G-A~~VM~G~~~agt~E 373 (493) .| ||.|+||..|-.--+ T Consensus 296 ~G~aD~V~~gR~~iadPd 313 (337) T PRK13523 296 NNRADLIFIGRELLRNPY 313 (337) T ss_pred CCCCCHHHHHHHHHHCCC T ss_conf 799479998999998910 No 252 >cd02929 TMADH_HD_FMN Trimethylamine dehydrogenase (TMADH) and histamine dehydrogenase (HD) FMN-binding domain. TMADH is an iron-sulfur flavoprotein that catalyzes the oxidative demethylation of trimethylamine to form dimethylamine and formaldehyde. The protein forms a symetrical dimer with each subunit containing one 4Fe-4S cluster and one FMN cofactor. It contains a unique flavin, in the form of a 6-S-cysteinyl FMN which is bent by ~25 degrees along the N5-N10 axis of the flavin isoalloxazine ring. This modification of the conformation of the flavin is thought to facilitate catalysis.The closely related histamine dehydrogenase catalyzes oxidative deamination of histamine. Probab=97.90 E-value=0.00025 Score=50.48 Aligned_cols=138 Identities=15% Similarity=0.113 Sum_probs=93.6 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCC-CCEE--------- Q ss_conf 1567899987313666998248411------------------------38899999999985798-7245--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPS-LLVM--------- 278 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~-~~ii--------- 278 (493) ++.+-|....+||.|.|-|--|||| ....++.++.+|+..+. .+|. T Consensus 151 ~f~~AA~rA~~AGfDgVEIHaahGYLl~qFlSp~~N~RtDeYGGS~enR~Rf~~Eii~aVr~~vg~df~i~~R~s~~~~~ 230 (370) T cd02929 151 WYVDAALRARDAGFDIVYVYAAHGYLPLQFLLPRYNKRTDEYGGSLENRARFWRETLEDTKDAVGDDCAVATRFSVDELI 230 (370) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCC T ss_conf 99999999998598989977113559997347745787774689889998999999999999719987599998941256 Q ss_pred -CCCCCCHHHHH---HHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEE Q ss_conf -14646999999---99854875899610278766531000157431003788999862269819972654377762133 Q gi|254780889|r 279 -AGNIATAEGAL---ALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKA 354 (493) Q Consensus 279 -~GNv~t~~~~~---~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kA 354 (493) .+...+.+-.. .+.+...|.+-|.+|..+-+.. .....|+..-+.-+..+.+..++|||+-|+|+.+-.+.++ T Consensus 231 ~~~g~~~~~~~~~~~~~~~~~~d~~~vs~g~~~~~~~---~~~~~~~g~~~~~~~~ik~~~~~Pvi~vG~i~~p~~ae~~ 307 (370) T cd02929 231 GPGGIESEGEGVEFVEMLDELPDLWDVNVGDWANDGE---DSRFYPEGHQEPYIKFVKQVTSKPVVGVGRFTSPDKMVEV 307 (370) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCC---CCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHH T ss_conf 8899988899999999973657979988555566567---7776786436599999998608808997897999999999 Q ss_pred EEEC-CCEEEECCHHHCCCC Q ss_conf 6714-878993230101216 Q gi|254780889|r 355 IAAG-SACVMIGSLLAGTDE 373 (493) Q Consensus 355 la~G-A~~VM~G~~~agt~E 373 (493) |+.| ||+|.||..|-.--+ T Consensus 308 l~~G~aD~V~~gR~llaDPd 327 (370) T cd02929 308 VKSGILDLIGAARPSIADPF 327 (370) T ss_pred HHCCCCCHHHHHHHHHHCCH T ss_conf 98799426453479876953 No 253 >PRK02145 consensus Probab=97.89 E-value=0.00012 Score=52.67 Aligned_cols=185 Identities=18% Similarity=0.209 Sum_probs=109.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+...+.+-+||=++..-|--...++++.. ++.-.--+-+|..|- ..+.++.|+++|||-++ T Consensus 32 dP~~~a~~~~~~GadelhivDld~a~~~~~~~~~~I~~i-----~~~~~iPi~vGGGIr----s~e~~~~ll~~GadkVi 102 (257) T PRK02145 32 DPVEIARRYDEQGADELTFLDITATSDGRDLILPIIEAV-----ASQVFIPLTVGGGVR----AVEDVRRLLNAGADKVS 102 (257) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHH-----HHHCCCCEEEECCCC----CHHHHHHHHHCCCCEEE T ss_conf 999999999987999899997888766754089999999-----965687489627730----46889999981998898 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECC---------------CCCC-----------HHHHHHHHHCCCCEEEEEEC Q ss_conf 824841138899999999985798724514---------------6469-----------99999998548758996102 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAG---------------NIAT-----------AEGALALIDAGADIIKVGIG 304 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~G---------------Nv~t-----------~~~~~~l~~~Gad~ikvGig 304 (493) +.++--. --+.++.+-+.|++-.|+++ +|.| .+-++.+.+.|+.-+.+- T Consensus 103 i~s~a~~---np~~v~~~~~~fG~q~Iv~siD~k~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~~~~G~geil~t-- 177 (257) T PRK02145 103 MNSSAVA---NPQLVRDAADKYGSQCIVVAIDAKRVSADGEPPRWEVFTHGGRKATGLDAVEWARKMAELGAGEILLT-- 177 (257) T ss_pred EHHHHHH---CCCHHHHHHHHCCCCCEEEEEEEEECCCCCCCCEEEEEECCCEECCCCCHHHHHHHHHHCCCCEEEEE-- T ss_conf 4155665---93022457876698344999998733677777508999778714367745576568876187868999-- Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHHCCCCCCCC Q ss_conf 787665310001574310037889998622698199726543777621336714-8789932301012167764 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLAGTDESPGD 377 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~agt~Espg~ 377 (493) +..|.=+..| |-+..+.+.. +..++|+||-||+.+..|+.+++-.| +++|-.|++|--.+-++.| T Consensus 178 ----dI~rDG~~~G-~dl~l~~~i~---~~~~ipvIasGGi~s~~di~~~~~~~~~~av~~g~~~~~~~~~i~e 243 (257) T PRK02145 178 ----SMDRDGTKSG-FDLALTRAVS---DAVPVPVIASGGVGSLQHLADGITEGHADAVLAASIFHYGEHTVGE 243 (257) T ss_pred ----EECCCCCCCC-CCHHHHHHHH---HCCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEHHHHHCCCCCHHH T ss_conf ----9847787788-8979999998---6269989998689999999999980898487653267779989999 No 254 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=97.89 E-value=2e-05 Score=58.37 Aligned_cols=48 Identities=33% Similarity=0.582 Sum_probs=44.4 Q ss_pred CEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 81528789946543034565667605041678668860278872354500 Q gi|254780889|r 97 GMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 97 gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) .|-.+|++++|++++.++.++|.+.+++.+||+++ |+|+||||.+|+. T Consensus 56 ~m~~~p~tv~~~~~l~~~~~lf~~~g~r~l~Vv~~--g~lvGiITr~Dl~ 103 (105) T cd04591 56 YIDPSPFTVSPRTSLEKVHQLFRKLGLRHLLVVDE--GRLVGIITRKDLL 103 (105) T ss_pred HCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHH T ss_conf 58799839899995999999999949958899579--9999999989953 No 255 >PRK13585 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=97.87 E-value=0.00023 Score=50.69 Aligned_cols=176 Identities=22% Similarity=0.272 Sum_probs=105.7 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) +|+ +..+.+.+.+.+.+-|||=++-+-|--.-.++++.... --.-.+-+|.-|- ..+.++.++++|+|-+ T Consensus 32 dP~-~~a~~~~~~Ga~~lhivDLd~a~~g~~~n~~~I~~i~~-----~~~~pi~vGGGIr----s~~~i~~~l~~Ga~kv 101 (240) T PRK13585 32 DPV-EVAKRWVDAGAKTLHLVDLDGAFEGSRKNADIIEKIVE-----ATDVSIQLGGGIR----SVEDAASLLDLGVDRV 101 (240) T ss_pred CHH-HHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHH-----HCCCCEEEECCCC----CHHHHHHHHHCCCCEE T ss_conf 999-99999998799979999897721189444999999997-----3797789978858----7999999997699899 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCC------CCCEECCCCCCH-----------HHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 982484113889999999998579------872451464699-----------999999854875899610278766531 Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFP------SLLVMAGNIATA-----------EGALALIDAGADIIKVGIGPGSICTTR 312 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~------~~~ii~GNv~t~-----------~~~~~l~~~Gad~ikvGig~Gs~CtTr 312 (493) ++-++--.+. +.++.+-+.|. ++++-.|.+.+. |-++.+.+.|+.-+.+- +..| T Consensus 102 vigs~~~~~~---~~~~~i~~~~G~~~ivvsiD~k~~~v~~~gw~~~~~~~~~e~~~~~~~~g~~eii~t------dI~~ 172 (240) T PRK13585 102 ILGTAAIENP---ELVRELSDEFGSERVMVSLDAKDGEVVIEGWTESTGKDPVEWAQRFEELGAGSILFT------NVDV 172 (240) T ss_pred EECCCCHHCC---HHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEE------EECC T ss_conf 9398113184---288999987397217999993065023247656788635577788886387358986------4233 Q ss_pred HHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 00015743100378899986226981997265437776213367148789932301 Q gi|254780889|r 313 VVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 313 ~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) .=|.-| |-+..+.+.. +..++|+|+-||+++..|+.+.-.+|++.|-+|+.| T Consensus 173 dGt~~G-~d~~~~~~i~---~~~~~pviasGGv~s~~di~~l~~~g~~gvivG~Al 224 (240) T PRK13585 173 EGLLQG-VNPEPVRELV---DSVDIPVIASGGVTSLDDVKALKEAGAAGVVVGSAL 224 (240) T ss_pred HHHHCC-CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCEEEEEHHH T ss_conf 223257-8989999999---868999999889999999999997899789987687 No 256 >cd04732 HisA HisA. Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase catalyzes the fourth step in histidine biosynthesis, an isomerisation of the aminoaldose moiety of ProFAR to the aminoketose of PRFAR (N-(5'-phospho-D-1'-ribulosylformimino)-5-amino-1-(5''-phospho-ribosyl)-4-imidazolecarboxamide). In bacteria and archaea, ProFAR isomerase is encoded by the HisA gene. Probab=97.86 E-value=0.00036 Score=49.34 Aligned_cols=176 Identities=24% Similarity=0.268 Sum_probs=105.2 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.....+-|+|=++-+-|----.++++... +-..-.+-+|. |++ ..+.++.|++.|+|-++ T Consensus 30 dP~~~a~~~~~~g~d~l~i~DLdaa~~~~~~n~~~I~~I~-----~~~~~pi~vGG--GIr--s~~~~~~l~~~Ga~kvv 100 (234) T cd04732 30 DPVEVAKKWEEAGAKWLHVVDLDGAKGGEPVNLELIEEIV-----KAVGIPVQVGG--GIR--SLEDIERLLDLGVSRVI 100 (234) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCHHHCCCCHHHHHHHHH-----HHCCCCEEECC--CCC--CHHHHHHHHHCCCCEEE T ss_conf 9999999999869998999967530308911599999999-----76795689737--717--59999999864887189 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCC------CCEECCCCCC-----------HHHHHHHHHCCCCEEEEEECCCCCCCCCH Q ss_conf 824841138899999999985798------7245146469-----------99999998548758996102787665310 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPS------LLVMAGNIAT-----------AEGALALIDAGADIIKVGIGPGSICTTRV 313 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~------~~ii~GNv~t-----------~~~~~~l~~~Gad~ikvGig~Gs~CtTr~ 313 (493) +-+.-- .-.+.++++-+.|++ +++--|++.+ .+-.+.+.+.|+..+.+- +..|+ T Consensus 101 i~s~~~---~~~~~~~~~~~~~G~q~iv~slD~k~~~~~~~~~~~~~~~~~~~~i~~~~~~g~geiilt------~i~~d 171 (234) T cd04732 101 IGTAAV---KNPELVKELLKEYGGERIVVGLDAKDGKVATKGWLETSEVSLEELAKRFEELGVKAIIYT------DISRD 171 (234) T ss_pred ECCCHH---HCHHHHHHHHHHCCCCCEEEEEEEECCCHHCCCCCEEECCCHHHHHHHHHHCCCCEEEEE------EECCC T ss_conf 714011---082789999998297646999997512000168640013516999999974586469987------64256 Q ss_pred HCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 0015743100378899986226981997265437776213367148789932301 Q gi|254780889|r 314 VTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 314 ~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) =|+-| |-+..+... .+..++|+|+-||+++..|+-+...+|++.|.+|+.| T Consensus 172 Gt~~G-~d~~ll~~i---~~~~~~p~i~~GGv~s~~di~~l~~~g~~gvivgsAl 222 (234) T cd04732 172 GTLSG-PNFELYKEL---AAATGIPVIASGGVSSLDDIKALKELGVAGVIVGKAL 222 (234) T ss_pred CCCCC-CCHHHHHHH---HHHCCCCEEEEECCCCHHHHHHHHHCCCCEEEEEHHH T ss_conf 65356-899999999---8657998999818999999999997799899998898 No 257 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.85 E-value=2.2e-05 Score=58.00 Aligned_cols=50 Identities=24% Similarity=0.304 Sum_probs=42.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHC-CCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 7863257303589999999865-53306457479978688980210001017 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQH-RIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~-ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) +++.|+++++++.+|.++|.++ .+..+|||| +|+++|+||+.|+++.... T Consensus 1 rpv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd-~~r~vGiisr~dl~~~~~~ 51 (119) T cd04598 1 RPAPTVSPDTTVNDVLERFERDPDLSALAVVD-DGRPVGLIMREALMELLST 51 (119) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCCEEEEEE-CCEEEEEEEHHHHHHHHCC T ss_conf 99857499984999999998789967689987-9988999989999998707 No 258 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=97.85 E-value=6.1e-05 Score=54.86 Aligned_cols=49 Identities=24% Similarity=0.404 Sum_probs=39.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEECCCHHCCC Q ss_conf 86325730358999999986553306457479-97868898021000101 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIEKLLVVDDD-GCCIGLITVKDIERSQL 210 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~-g~LvGIIT~~DIlk~~~ 210 (493) ++++++.+.+++++.++|.+++..++||++++ ++++|+++.+|+++... T Consensus 2 ~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~ 51 (111) T cd04590 2 DIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALA 51 (111) T ss_pred CEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHH T ss_conf 68999699999999999997799779999547887789999114456665 No 259 >PRK08255 salicylyl-CoA 5-hydroxylase; Reviewed Probab=97.85 E-value=0.00036 Score=49.32 Aligned_cols=131 Identities=22% Similarity=0.234 Sum_probs=91.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCCC-CCEEC-------- Q ss_conf 15678999873136669982484113------------------------8899999999985798-72451-------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFPS-LLVMA-------- 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~~-~~ii~-------- 279 (493) ++.+-++.-++||-|+|=|-.||||- ...++.++.+|+..|. .|+.+ T Consensus 557 ~F~~AA~rA~~AGFD~IEiH~AHGYLl~qFLSPlsN~RtDeYGGsleNR~Rf~lEV~~aVR~~~p~~~Pl~vRiSatDw~ 636 (770) T PRK08255 557 QFVAATRRAAEAGFDWLELHCAHGYLLSSFISPLTNQRTDEYGGSLENRLRYPLEVFRAVRAVWPADKPMSVRISAHDWV 636 (770) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCC T ss_conf 99999999998399989995234555887538644677543578888777889999999998678988669998510256 Q ss_pred CCCCCHH----HHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEE Q ss_conf 4646999----999998548758996102787665310001574310037889998622698199726543777621336 Q gi|254780889|r 280 GNIATAE----GALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAI 355 (493) Q Consensus 280 GNv~t~~----~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAl 355 (493) .+=-|.| -++.|.++|+|.|-|-.|.-+... +.+.|-+ ++ +--+..+.++.++|+||-|.|..+-..-..| T Consensus 637 ~gG~t~edsv~la~~l~~~GvD~IdvSsGg~~~~~-~p~~g~~---yQ-vpfA~~Ir~e~~i~t~AVG~I~~p~~Ae~Il 711 (770) T PRK08255 637 EGGNTPDDAVEIARAFKAAGADMIDVSSGQVSKDE-KPVYGRM---YQ-TPFADRIRNEAGIATIAVGAISEADHVNSII 711 (770) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCC-CCCCCCC---CC-HHHHHHHHHHCCCCEEEECCCCCHHHHHHHH T ss_conf 89999999999999999749989995788888667-7888876---56-6999999987599789961889999999999 Q ss_pred EEC-CCEEEECCHH Q ss_conf 714-8789932301 Q gi|254780889|r 356 AAG-SACVMIGSLL 368 (493) Q Consensus 356 a~G-A~~VM~G~~~ 368 (493) +.| ||.|+||.-| T Consensus 712 ~~GrADlValgR~~ 725 (770) T PRK08255 712 AAGRADLCALARPH 725 (770) T ss_pred HCCCCCEEEECHHH T ss_conf 76998875247776 No 260 >cd04733 OYE_like_2_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 2. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=97.84 E-value=0.00017 Score=51.72 Aligned_cols=135 Identities=19% Similarity=0.235 Sum_probs=86.4 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCC-CCCEE--------- Q ss_conf 1567899987313666998248411------------------------3889999999998579-87245--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFP-SLLVM--------- 278 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~-~~~ii--------- 278 (493) ++.+-|+...+||.|.+-|-.|||| .+..++.++.+|+..+ +.+|. T Consensus 150 ~f~~AA~rA~~AGfDgVEiH~ahGYLl~qFlS~~~N~RtDeYGGS~ENR~Rf~lEii~avr~~vg~d~~v~~Ris~~d~~ 229 (338) T cd04733 150 RFAHAARLAQEAGFDGVQIHAAHGYLLSQFLSPLTNKRTDEYGGSLENRARLLLEIYDAIRAAVGPGFPVGIKLNSADFQ 229 (338) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHC T ss_conf 99999999998399989982365548998629876899685798988998899999999999719988699984535424 Q ss_pred CCCCCC---HHHHHHHHHCCCCEEEEEECCC---C-CCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf 146469---9999999854875899610278---7-66531000157431003788999862269819972654377762 Q gi|254780889|r 279 AGNIAT---AEGALALIDAGADIIKVGIGPG---S-ICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDI 351 (493) Q Consensus 279 ~GNv~t---~~~~~~l~~~Gad~ikvGig~G---s-~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi 351 (493) -|-... .+-++.|.++|.|.+-|-.|.= . .......+ ..+.-.-+--+..+.+..++|||+-|+|++.-.+ T Consensus 230 ~~G~~~~d~~~~~~~l~~~GvD~i~vs~G~~~~~~~~~~~~~~~--~~~~~~~~~~a~~ik~~~~~Pvi~~G~i~~~~~a 307 (338) T cd04733 230 RGGFTEEDALEVVEALEEAGVDLVELSGGTYESPAMAGAKKEST--IAREAYFLEFAEKIRKVTKTPLMVTGGFRTRAAM 307 (338) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCC--CCCCCCHHHHHHHHHHHCCCCEEEECCCCCHHHH T ss_conf 79999899999999998769988994688545732247765444--5675105999999999849979998998999999 Q ss_pred EEEEEEC-CCEEEECCHHH Q ss_conf 1336714-87899323010 Q gi|254780889|r 352 AKAIAAG-SACVMIGSLLA 369 (493) Q Consensus 352 ~kAla~G-A~~VM~G~~~a 369 (493) .++|.-| ||.|.+|..|- T Consensus 308 e~~l~~g~~DlV~~gR~~i 326 (338) T cd04733 308 EQALASGAVDGIGLARPLA 326 (338) T ss_pred HHHHHCCCCEEHHHHHHHH T ss_conf 9999879951089889999 No 261 >PRK04281 consensus Probab=97.84 E-value=0.00036 Score=49.31 Aligned_cols=183 Identities=19% Similarity=0.178 Sum_probs=111.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.+...+-+||=++..-|--...++++... +--.-.+-+|..|- ..|.++.|+++|+|-++ T Consensus 31 dP~~~ak~~~~~GadelhivDld~a~~~~~~~~~~I~~i~-----~~~~vpi~vGGGIr----s~e~~~~ll~~GadkVi 101 (254) T PRK04281 31 DPVEAAKRYNGEGADELTFLDITASSDNRDTILHIIEEVA-----GQVFIPLTVGGGVR----TVADIRRLLNAGADKVS 101 (254) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH-----HHCCCCEEEECCEE----ECHHHHHHHHCCCCEEE T ss_conf 9999999999869999999968898777530899999998-----50796289977754----51889999976998899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCCCCEECC------C-------C-----------CCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 824841138899999999985798724514------6-------4-----------699999999854875899610278 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPSLLVMAG------N-------I-----------ATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~~~ii~G------N-------v-----------~t~~~~~~l~~~Gad~ikvGig~G 306 (493) |.++--.+ .+.++.+-+.|++-.|+++ + + ...|-+..+.+.|+.-+. T Consensus 102 igs~a~~n---p~~l~~~~~~fG~q~Iv~siD~k~~~~~~~~~~i~~~g~~~~t~~~~~~~~~~~~~~g~geil------ 172 (254) T PRK04281 102 INTAAVTR---PDLIDEAAGFFGSQAIVAAVDAKAVNPENTRWEIFTHGGRNPTGLDAVEWAVEMQKRGAGEIL------ 172 (254) T ss_pred ECHHHHHC---CHHHHHHHHHCCCEEEEEEEEEEEECCCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEE------ T ss_conf 77767649---267676787559821799998885024688459997588647754499999998752998999------ Q ss_pred CCCCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHHCCCCCCC Q ss_conf 7665310001574-310037889998622698199726543777621336714-878993230101216776 Q gi|254780889|r 307 SICTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLAGTDESPG 376 (493) Q Consensus 307 s~CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~agt~Espg 376 (493) ||-=..-|... |-+..+...+ +..++||||-||+.+.-|+.+++..| +++|-.|++|--..-++. T Consensus 173 --~tdI~rDGt~~G~d~~l~~~i~---~~~~iPvIasGGv~~~~di~~~~~~~~~~~v~~g~~~~~~~~sl~ 239 (254) T PRK04281 173 --LTGMDRDGTKQGFNLPLTRAVA---EAVDIPVIASGGVGNVRHLIEGITEGKADAVLAAGIFHFGEIAIR 239 (254) T ss_pred --EEEECCCCCCCCCCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEHHHHHCCCCCHH T ss_conf --9888578876876869999998---616998999789899999999998089888976437777998999 No 262 >pfam01207 Dus Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. Dus 1 from Saccharomyces cerevisiae acts on pre-tRNA-Phe, while Dus 2 acts on pre-tRNA-Tyr and pre-tRNA-Leu. Dus 1 is active as a single subunit, requiring NADPH or NADH, and is stimulated by the presence of FAD. Some family members may be targeted to the mitochondria and even have a role in mitochondria. Probab=97.82 E-value=0.00076 Score=47.02 Aligned_cols=100 Identities=26% Similarity=0.320 Sum_probs=61.9 Q ss_pred HHHHHHHHHHHHHCC---CCCEECC----CCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHH Q ss_conf 889999999998579---8724514----646999999998548758996102787665310001574310037889998 Q gi|254780889|r 259 QKVLDAVVQIKKNFP---SLLVMAG----NIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEV 331 (493) Q Consensus 259 ~~~~~~i~~~k~~~~---~~~ii~G----Nv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~ 331 (493) ..+.+.++.+++..+ ++.+=.| +..+.+-++.|.++|++.+-|- +-|+.+---|......+.++.+. T Consensus 108 ~~~~~iv~a~~~~~~~PVtvK~RlG~d~~~~~~~~~~~~l~~~G~~~itvH------~Rt~~q~~~g~a~w~~i~~~k~~ 181 (309) T pfam01207 108 DLVAQIVKAVVKAVDIPVTVKIRIGWDESHENAVEIARRVEDAGAQALTVH------GRTRAQNYEGPADWDAIKQVKQA 181 (309) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEE------CCCHHHCCCCCCCHHHHHHHHHH T ss_conf 899999999997558854675433788763889999999984688879996------76324026786541899999985 Q ss_pred HHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEECCH Q ss_conf 6226981997265437776213367-14878993230 Q gi|254780889|r 332 AERAGVAIVADGGIRFSGDIAKAIA-AGSACVMIGSL 367 (493) Q Consensus 332 ~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~G~~ 367 (493) ..+|||+-|+|.+.-|+.+.+. -|+|.||+|.- T Consensus 182 ---~~ipvi~NGdi~~~~d~~~~l~~tg~dgvMigRg 215 (309) T pfam01207 182 ---VSIPVIANGDITDAEDAQRCLSYTGADGVMIGRG 215 (309) T ss_pred ---CCCCEEEECCCCCHHHHHHHHHHHCCCEEEECHH T ss_conf ---8982898089488999999986109999998489 No 263 >cd04723 HisA_HisF Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase (HisA) and the cyclase subunit of imidazoleglycerol phosphate synthase (HisF). The ProFAR isomerase catalyzes the fourth step in histidine biosynthesis, an isomerisation of the aminoaldose moiety of ProFAR to the aminoketose of PRFAR (N-(5'-phospho-D-1'-ribulosylformimino)-5-amino-1-(5''-phospho-ribosyl)-4-imidazolecarboxamide). In bacteria and archaea, ProFAR isomerase is encoded by the HisA gene. The Imidazole glycerol phosphate synthase (IGPS) catalyzes the fifth step of histidine biosynthesis, the formation of the imidazole ring. IGPS converts N1-(5'-phosphoribulosyl)-formimino-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to imidazole glycerol phosphate (ImGP) and 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR). This conversion involves two tightly coupled reactions in distinct active sites of IGPS. The two catalytic domains can be fused, like in fungi and pl Probab=97.80 E-value=0.00095 Score=46.32 Aligned_cols=178 Identities=18% Similarity=0.199 Sum_probs=102.8 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +..++.+.+.+.+.+.+-|+|=+. ..|--...++++.. ++--.-.+-+|..|. ..+.++.+.+.|+|-++ T Consensus 36 dP~~~a~~~~~~ga~~lhivDLda-~~g~~~n~~~I~~i-----~~~~~~pi~vGGGIr----s~~~~~~~l~~Gadkvv 105 (233) T cd04723 36 DPLDVARAYKELGFRGLYIADLDA-IMGRGDNDEAIREL-----AAAWPLGLWVDGGIR----SLENAQEWLKRGASRVI 105 (233) T ss_pred CHHHHHHHHHHCCCCEEEEEECCC-CCCCCCHHHHHHHH-----HHHCCCCEEEECCCC----CHHHHHHHHHCCCCEEE T ss_conf 999999999987989899997865-46997539999999-----987899889970227----69999999860720152 Q ss_pred EECCCCHHHHHHHHHHHHHHH--CCCCCEECCCCCCH------HHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCC-CC Q ss_conf 824841138899999999985--79872451464699------9999998548758996102787665310001574-31 Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKN--FPSLLVMAGNIATA------EGALALIDAGADIIKVGIGPGSICTTRVVTGVGC-PQ 321 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~--~~~~~ii~GNv~t~------~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~-pq 321 (493) |-++--.+....+.++..-.. .-++++.-|++.+. +...+.++..+. .=++|.=..-|.+- |. T Consensus 106 igs~~~~~~~~~~~~~~~g~~~ivvslD~k~~~~~~~~~~~~~~~~~~~~~~~~~--------eii~t~Id~dGt~~G~d 177 (233) T cd04723 106 VGTETLPSDDDEDRLAALGEQRLVLSLDFRGGQLLKPTDFIGPEELLRRLAKWPE--------ELIVLDIDRVGSGQGPD 177 (233) T ss_pred CCCHHCCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHCC--------EEEEEEECCCCCCCCCC T ss_conf 4510049989999999978999899999989978724643489999999996589--------59998643446567779 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 003788999862269819972654377762133671487899323010 Q gi|254780889|r 322 LSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 322 ~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) +..+.... ....+|+|+-||+++..|+-+...+|++.|-+|+.|- T Consensus 178 ~~l~~~i~---~~~~~pvi~sGGv~s~~di~~l~~~g~~gvivg~alh 222 (233) T cd04723 178 LELLERLA---ARADIPVIAAGGVRSVEDLELLKKLGASGALVASALH 222 (233) T ss_pred HHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCEEEEEHHHH T ss_conf 99999999---8689989998899999999999978998999863977 No 264 >PRK00748 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated Probab=97.78 E-value=0.0008 Score=46.84 Aligned_cols=175 Identities=23% Similarity=0.261 Sum_probs=106.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) .+..++.+.+.+.....+-|||=++..-|--...++++... +--.-.+-+|.-| + ..|.++.++++|+|-+ T Consensus 29 ~dP~~~A~~~~~~Ga~~lhvvDLd~A~~g~~~n~~~I~~i~-----~~~~~pi~vGGGI--r--s~e~~~~~l~~GadkV 99 (241) T PRK00748 29 DDPLAQAQAWQDQGAEWLHLVDLDGAFAGRPVNLELIEAIV-----AAVDIPVQLGGGI--R--DLETVEAYLDAGVARV 99 (241) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHH-----HHCCCCEEEECCC--C--CHHHHHHHHHCCCCEE T ss_conf 89999999999879998999978542028820799999999-----8679999982770--7--4999999997697758 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCCCC-----CEECCCCCC-----------HHHHHHHHHCCCCEEEEEECCCCCCCC-- Q ss_conf 98248411388999999999857987-----245146469-----------999999985487589961027876653-- Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFPSL-----LVMAGNIAT-----------AEGALALIDAGADIIKVGIGPGSICTT-- 311 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~~~-----~ii~GNv~t-----------~~~~~~l~~~Gad~ikvGig~Gs~CtT-- 311 (493) +|-++--.+ .+.++.+-+.|++. ++--|.|++ .|-.+.+.+.|+..+. ||. T Consensus 100 vigS~a~~n---~~~i~~~~~~~g~~ivvsiD~k~~~v~~~gw~~~t~~~~~~~i~~~~~~G~~eii--------~tdI~ 168 (241) T PRK00748 100 IIGTAAVKN---PELVKEACKKFPGRIVVGLDARDGKVATRGWQEVSGVDLEDLAKRFEDAGVAAII--------YTDIS 168 (241) T ss_pred EECCHHHHC---HHHHHHHHHHCCCCEEEEEEECCCEEECCCCCCCCCCCHHHHHHHHHHCCCCEEE--------EEEEE T ss_conf 864710339---6899999862355579999821665401575546797489999999855875699--------98870 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEE---CCCEEEECCHH Q ss_conf 1000157431003788999862269819972654377762133671---48789932301 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAA---GSACVMIGSLL 368 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~---GA~~VM~G~~~ 368 (493) |.=|.-| |-+..+.+.. +..++|+||-||+++.-|+.+.-.+ |.+.|-+|+.| T Consensus 169 ~DGt~~G-~d~~l~~~i~---~~~~ipviasGGv~s~~Di~~L~~~~~~gv~gviiG~Al 224 (241) T PRK00748 169 RDGTLSG-PNVELTRELA---AATPIPVIASGGVSSLDDIRALKALGPEGVEGVIVGKAL 224 (241) T ss_pred CCCCCCC-CCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCCCCEEEEEHHH T ss_conf 5685476-8999999999---868998999889999999999986031792489987898 No 265 >PRK10415 tRNA-dihydrouridine synthase B; Provisional Probab=97.70 E-value=0.0029 Score=42.80 Aligned_cols=73 Identities=22% Similarity=0.300 Sum_probs=50.0 Q ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEE Q ss_conf 99999985487589961027876653100015743100378899986226981997265437776213367-14878993 Q gi|254780889|r 286 EGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA-AGSACVMI 364 (493) Q Consensus 286 ~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~ 364 (493) +-++.+.++|++.|-|- +-||.+.=-|.+....|.++.+ ...+|||+-|+|.+.-|+-+.+. .|+|.||+ T Consensus 153 ~~~~~~e~aG~~~itvH------gRT~~q~y~g~adw~~i~~vk~---~~~iPvi~NGDI~~~~da~~~l~~tg~dgvMi 223 (321) T PRK10415 153 EIAQLAEDCGIQALTIH------GRTRACLFNGEAEYDSIRAVKQ---KVSIPVIANGDITDPLKARAVLDYTGADALMI 223 (321) T ss_pred HHHHHHHHCCCCEEEEE------HHHHHHHHCCCCCHHHHHHHHH---CCCCCEEECCCCCCHHHHHHHHHHHCCCEEEE T ss_conf 99999985698899997------2213443169987799999985---47997896589199999999998629999997 Q ss_pred CCH Q ss_conf 230 Q gi|254780889|r 365 GSL 367 (493) Q Consensus 365 G~~ 367 (493) |.- T Consensus 224 gRg 226 (321) T PRK10415 224 GRA 226 (321) T ss_pred CHH T ss_conf 566 No 266 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=97.68 E-value=4.2e-05 Score=56.01 Aligned_cols=156 Identities=22% Similarity=0.328 Sum_probs=100.5 Q ss_pred CCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEE--ECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHH Q ss_conf 46000575276432475422533430388160031783--3785236888999999779879997898989999999987 Q gi|254780889|r 14 LTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMS--AAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQV 91 (493) Q Consensus 14 lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivS--s~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kV 91 (493) ++=+|+++.=++|.-...-+++- ++.++. ++|+++ ++-+ +. +|+.--+-+. ++.+++ T Consensus 92 i~~~D~~i~~S~SG~t~El~~ll-~~~~~~--~~~iI~it~~~~----S~----l~~~sd~~l~---~~~~~E------- 150 (326) T PRK10892 92 VTPQDVVIAISNSGESSEILALI-PVLKRL--HVPLICITGRPE----SS----MARAADVHLC---VKVPKE------- 150 (326) T ss_pred CCCCCEEEEEECCCCCHHHHHHH-HHHHHC--CCCEEEEECCCC----CH----HHHHCCEEEE---CCCCCC------- T ss_conf 77899899995898757787641-787766--985899956999----96----6880887896---578744------- Q ss_pred HHHCCCEECCCEEEECCCC------HHHH--HHHHHHHCCCCEEEECCCC--EEEEEEEEHHHHCCCCCCCCCCCCCCCC Q ss_conf 4223881528789946543------0345--6566760504167866886--0278872354500143446724400267 Q gi|254780889|r 92 KKFESGMVVNPVTISPYAT------LADA--LALMKKYSISGIPVVESDV--GKLVGILTNRDVRFASNAQQAVGELMTR 161 (493) Q Consensus 92 Kr~e~gmI~dPVti~pd~T------I~ea--~~lm~~~~is~iPVVD~~~--gkLiGIVT~rDir~~~~~~~~V~eiMt~ 161 (493) -+|.-+-|.+| +.|| ..+|.+++|+. ++. .++-|-+ .+.+..+|+++|.+ T Consensus 151 --------acp~~laPTtStt~~lalgDala~~lm~~rgF~~-----~dFa~~HPgG~L-------Gk~Ll~~V~dlM~~ 210 (326) T PRK10892 151 --------ACPLGLAPTSSTTATLVMGDALAVALLKARGFTA-----EDFALSHPGGAL-------GRKLLLRVSDIMHT 210 (326) T ss_pred --------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCH-----HHHHHCCCCCHH-------HHHHHHHHHHHHHC T ss_conf --------6656777518999999999899999999849998-----689760989777-------78888789998513 Q ss_pred --CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf --8632573035899999998655330645747997868898021000101 Q gi|254780889|r 162 --NLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 162 --~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++..+++++++.|+...|.+.+...+.|+|++++|+|++|-.|+-|..+ T Consensus 211 ~~~lP~v~~~~~l~dai~~mt~k~lG~v~Vvd~~gkL~GIiTDGDLRR~l~ 261 (326) T PRK10892 211 GDEIPHVKKTASLRDALLEITRKNLGMTVICDDNMMIEGIFTDGDLRRVFD 261 (326) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCEEEEEECHHHHHHHH T ss_conf 765885377602999999874589349999858996899986268999987 No 267 >cd02801 DUS_like_FMN Dihydrouridine synthase-like (DUS-like) FMN-binding domain. Members of this family catalyze the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. 1VHN, a putative flavin oxidoreductase, has high sequence similarity to DUS. The enzymatic mechanism of 1VHN is not known at the present. Probab=97.64 E-value=0.0025 Score=43.32 Aligned_cols=99 Identities=29% Similarity=0.339 Sum_probs=57.1 Q ss_pred HHHHHHHHHHHHHCCCCCE----ECC--CC-CCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHH Q ss_conf 8899999999985798724----514--64-6999999998548758996102787665310001574310037889998 Q gi|254780889|r 259 QKVLDAVVQIKKNFPSLLV----MAG--NI-ATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEV 331 (493) Q Consensus 259 ~~~~~~i~~~k~~~~~~~i----i~G--Nv-~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~ 331 (493) +.+.+.++.+++..+ ++| =.| +- .+.+-++.|.++|++.+-|- +-|+.+--.|.+....+.++.+ T Consensus 109 ~~v~~iv~~~~~~~~-ipVsvKiRlg~~~~~~~~~~~~~l~~~G~~~ltvH------~Rt~~q~~~~~a~~e~i~~~~~- 180 (231) T cd02801 109 ELVAEIVRAVREAVP-IPVTVKIRLGWDDEEETLELAKALEDAGASALTVH------GRTREQRYSGPADWDYIAEIKE- 180 (231) T ss_pred HHHHHHHHHHHHHCC-CCEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEE------CCCHHHCCCCCCCHHHHHHHHH- T ss_conf 899999999997569-94799997077863479999999997699899983------5614414677622699999986- Q ss_pred HHCCCCEEEEECCCCCHHHHEEEEEE-CCCEEEECCH Q ss_conf 62269819972654377762133671-4878993230 Q gi|254780889|r 332 AERAGVAIVADGGIRFSGDIAKAIAA-GSACVMIGSL 367 (493) Q Consensus 332 ~~~~~~~iiadGGi~~~gdi~kAla~-GA~~VM~G~~ 367 (493) ...+|||+-|+|.+.-|+-+.+.. |+|.||+|.- T Consensus 181 --~~~ipvi~NGdI~s~~d~~~~~~~tg~dgvMigRg 215 (231) T cd02801 181 --AVSIPVIANGDIFSLEDALRCLEQTGVDGVMIGRG 215 (231) T ss_pred --CCCCEEEEECCCCCHHHHHHHHHHHCCCEEEECHH T ss_conf --59977998389099999999998509999998788 No 268 >cd02933 OYE_like_FMN Old yellow enzyme (OYE)-like FMN binding domain. OYE was the first flavin-dependent enzyme identified, however its true physiological role remains elusive to this day. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Members of OYE family include 12-oxophytodienoate reductase, pentaerythritol tetranitrate reductase, morphinone reductase, and related enzymes. Probab=97.64 E-value=0.00045 Score=48.67 Aligned_cols=127 Identities=16% Similarity=0.114 Sum_probs=83.5 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCCCCEEC--------- Q ss_conf 1567899987313666998248411------------------------3889999999998579872451--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPSLLVMA--------- 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~~~ii~--------- 279 (493) ++.+-|....+||.|.|-|--|||| ....++.++.+|+..+.-.|.. T Consensus 153 ~f~~aA~ra~~AGfDgVEiHaaHGyLl~qFLSp~~N~RtDeYGGS~eNR~Rf~~Eii~aIR~~vg~d~i~vRlSp~~~~~ 232 (338) T cd02933 153 DFRQAARNAIEAGFDGVEIHGANGYLIDQFLRDGSNKRTDEYGGSIENRARFLLEVVDAVAEAIGADRVGIRLSPFGTFN 232 (338) T ss_pred HHHHHHHHHHHCCCCEEEEECCCHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCC T ss_conf 99999999998399999982244068998538532689897899989998999999999999729870899965766768 Q ss_pred CCCC--CH----HHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEE Q ss_conf 4646--99----99999985487589961027876653100015743100378899986226981997265437776213 Q gi|254780889|r 280 GNIA--TA----EGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAK 353 (493) Q Consensus 280 GNv~--t~----~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~k 353 (493) |... +. +-++.|.++|.|.+-|..+ +.... +.......+....+..++|||+-||| ++-.+.+ T Consensus 233 g~~~~~~~~~~~~~~~~l~~~gid~~~v~~~--~~~~~--------~~~~~~~~~~~ir~~~~~pvi~~G~i-~~~~ae~ 301 (338) T cd02933 233 DMGDSDPEATFSYLAKELNKRGLAYLHLVEP--RVAGN--------PEDQPPDFLDFLRKAFKGPLIAAGGY-DAESAEA 301 (338) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEECC--CCCCC--------CCCCCHHHHHHHHHHCCCCEEEECCC-CHHHHHH T ss_conf 8788877999999999999859988997268--77777--------77765779999999869979996998-9999999 Q ss_pred EEEEC-CCEEEECCHHHC Q ss_conf 36714-878993230101 Q gi|254780889|r 354 AIAAG-SACVMIGSLLAG 370 (493) Q Consensus 354 Ala~G-A~~VM~G~~~ag 370 (493) +|+.| ||.|.||..|=. T Consensus 302 ~l~~G~~D~V~~gR~lia 319 (338) T cd02933 302 ALADGKADLVAFGRPFIA 319 (338) T ss_pred HHHCCCCCHHHHHHHHHH T ss_conf 998799603685299987 No 269 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=97.60 E-value=9.3e-05 Score=53.56 Aligned_cols=51 Identities=35% Similarity=0.556 Sum_probs=45.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 786325730358999999986553306457479978688980210001017 Q gi|254780889|r 161 RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 161 ~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) ++++++++++++.+|.+.|.+++++.+||+|++|+++|++|.+|+++.... T Consensus 1 ~~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~~~~~G~it~~dl~~~~~~ 51 (113) T cd02205 1 RDVVTVSPDDTVAEALRLMLEHGISGLPVVDDDGRLVGIVTERDLLRALAE 51 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHH T ss_conf 997796892989999999998398099999399959999988998988763 No 270 >COG0042 tRNA-dihydrouridine synthase [Translation, ribosomal structure and biogenesis] Probab=97.59 E-value=0.0089 Score=39.34 Aligned_cols=31 Identities=23% Similarity=0.186 Sum_probs=23.6 Q ss_pred CCCEECCCEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 8816003178337852368889999997798 Q gi|254780889|r 41 KDFTLNLPIMSAAMDQVTDSRLAIAMAQAGG 71 (493) Q Consensus 41 ~~i~l~iPivSs~MDTVTe~~MAiamA~~GG 71 (493) ..+.+.-+++-|||-.||+..+-.-..+.|+ T Consensus 5 ~~~~~~~~~~lAPM~gvtd~~fR~l~~~~ga 35 (323) T COG0042 5 GLIELRNRVILAPMAGVTDLPFRRLARELGA 35 (323) T ss_pred CCCCCCCCEEEECCCCCCCHHHHHHHHHHCC T ss_conf 5555678778834889866899999999588 No 271 >PRK04128 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=97.58 E-value=0.0005 Score=48.33 Aligned_cols=170 Identities=22% Similarity=0.318 Sum_probs=103.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) +|++-|.. +.+. ...+-+||=++..-|--...++++... +...-.+-+|..| + ..|.++.+.+.|+|-+ T Consensus 31 dP~~~A~~-~~~~-a~~lhivDLd~a~~g~~~n~~~I~~i~-----~~~~~piqvGGGI--r--s~e~i~~~l~~Ga~kV 99 (228) T PRK04128 31 DPVEIALR-FSEY-VDKIHVVDLDGAFEGKPKNLDVVKNII-----EETGLKVQVGGGF--R--TYESIKDAYEIGVENV 99 (228) T ss_pred CHHHHHHH-HHHC-CCEEEEEECCCCCCCCCCHHHHHHHHH-----HHCCCEEEECCCC--C--CHHHHHHHHHCCCCEE T ss_conf 99999999-9966-998999988030149832699999998-----6549628973860--7--7999999996899769 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCCC----CCEECCCCCC----------HHHHHHHHHCCCCEEEEEECCCCCCCCCHHC Q ss_conf 9824841138899999999985798----7245146469----------9999999854875899610278766531000 Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFPS----LLVMAGNIAT----------AEGALALIDAGADIIKVGIGPGSICTTRVVT 315 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~~----~~ii~GNv~t----------~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~ 315 (493) ++-++-- + .+.++++.+.|++ +++-.|.+++ .+.+..+.+.++..+ +||.=..- T Consensus 100 iigt~a~-~---~~~l~~~~~~~~~ivvslD~k~~~v~~~gw~~~~~~~~~~~~~~~~~~~~~i--------i~tdI~~d 167 (228) T PRK04128 100 IIGTKAF-D---IEFLEKITSEFNGITVSLDVKGGRIATKGWLEESSIKVEDAYKMLRNYVNRF--------IYTSIERD 167 (228) T ss_pred EECCCCC-C---HHHHHHHHHHCCCCCEEEECCCCEEECCCCEECCCCCHHHHHHHHHHHCCCE--------EEEEECCC T ss_conf 8145125-8---0899999972647636887137868526848889988999999998638453--------76312654 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 15743100378899986226981997265437776213367148789932301 Q gi|254780889|r 316 GVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 316 g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) | +++-+-...+. ...+|+||-||+++..|+.+...+|++.|-+|+.| T Consensus 168 G----t~~G~~~~~~~--~~~~~iiasGGv~s~~Dl~~l~~~g~~gvivG~Al 214 (228) T PRK04128 168 G----TLTGIENIERF--WGDEEFIYAGGVSSIEDVKKLAEIGFSGAIIGKAL 214 (228) T ss_pred C----CCCCHHHHHHH--CCCCCEEEECCCCCHHHHHHHHHCCCCEEEEEHHH T ss_conf 3----00388999986--16896898789899999999996799899998538 No 272 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=97.57 E-value=0.00013 Score=52.45 Aligned_cols=50 Identities=48% Similarity=0.717 Sum_probs=44.8 Q ss_pred CCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC Q ss_conf 881528789946543034565667605041678668860278872354500 Q gi|254780889|r 96 SGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR 146 (493) Q Consensus 96 ~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir 146 (493) ..|..+++++++++++.++++.|.+++++.+||+|++ ++++|++|.+|+. T Consensus 3 ~im~~~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~-~~~~Gvit~~dl~ 52 (57) T pfam00571 3 DIMTPDVVTVPPDTSLEEALELMRENGISRLPVVDED-GKLVGIVTLRDLL 52 (57) T ss_pred CCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHH T ss_conf 2277999899790909999999985399579999279-9499999789999 No 273 >PRK10550 tRNA-dihydrouridine synthase C; Provisional Probab=97.57 E-value=0.0077 Score=39.81 Aligned_cols=30 Identities=17% Similarity=0.222 Sum_probs=24.7 Q ss_pred CEEEECCCCCCCHHHHHHHHHCCCEEEECC Q ss_conf 178337852368889999997798799978 Q gi|254780889|r 48 PIMSAAMDQVTDSRLAIAMAQAGGLGVIHR 77 (493) Q Consensus 48 PivSs~MDTVTe~~MAiamA~~GGiGvIHr 77 (493) .++-|||-.||...+=..+.+.||.++..- T Consensus 2 rl~LAPMaGvTd~~fR~l~~~~g~~dl~~T 31 (312) T PRK10550 2 RVLLAPMEGVLDSLVRELLTEVNDYDLCIT 31 (312) T ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCEEEE T ss_conf 099756788877999999999689988996 No 274 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=97.53 E-value=0.0018 Score=44.37 Aligned_cols=97 Identities=13% Similarity=0.168 Sum_probs=67.1 Q ss_pred HHHHHHHHCCCCEEEECCC--------------CEEEEEEEEHHHH----------CCCCCCC----CCCCCCCCCCCCC Q ss_conf 5656676050416786688--------------6027887235450----------0143446----7244002678632 Q gi|254780889|r 114 ALALMKKYSISGIPVVESD--------------VGKLVGILTNRDV----------RFASNAQ----QAVGELMTRNLIT 165 (493) Q Consensus 114 a~~lm~~~~is~iPVVD~~--------------~gkLiGIVT~rDi----------r~~~~~~----~~V~eiMt~~lit 165 (493) ..++..+.+.+.+.|+.+- .|+++-+=|-.++ +|..+.+ .+++++|.+++.+ T Consensus 207 l~~L~~~~~~TiifVTHDl~eA~~laDRIaVM~~G~Ivq~GtpeeI~~~Pa~~yV~~F~~~v~~~~~~~a~~i~~~~~~~ 286 (400) T PRK10070 207 LVKLQAKHQRTIVFISHDLDEAMRIGDRIAIMQNGEVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRTPNG 286 (400) T ss_pred HHHHHHHHCCEEEEECCCHHHHHHHCCEEEEEECCEEEEECCHHHHHHCCCCHHHHHHHCCCCHHHCEEHHHHHCCCCCC T ss_conf 99999985999999899999999869999999898899972889998679986899875547787702398962468752 Q ss_pred C---CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 5---73035899999998655330645747997868898021000101 Q gi|254780889|r 166 V---KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 166 v---~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) + .+.....+|++.|.+++...+.|+|++++++|+++..|+.+... T Consensus 287 ~~~~~~~~~~~~al~~m~~~~~~~~~vvd~~~~~~G~v~~~~~~~~~~ 334 (400) T PRK10070 287 LIRKTPGFGPRSALKLLQDEDREYGYVIERGNKFVGAVSIDSLKAALT 334 (400) T ss_pred EEECCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf 131488869999999998559867999869980889988999997763 No 275 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=97.50 E-value=8.5e-05 Score=53.85 Aligned_cols=57 Identities=28% Similarity=0.481 Sum_probs=39.8 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCC Q ss_conf 724400267863257303589999999865533064574799786889802100010 Q gi|254780889|r 153 QAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQ 209 (493) Q Consensus 153 ~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~ 209 (493) ++|.++++.+++++++..++.+|.+.|.+++.+.+.++|+++.++||+|++|+.+.. T Consensus 149 trv~~~~~~~~~~v~~~~~i~~aa~km~~~gv~s~v~l~~~~~~~GIvT~~dl~~~v 205 (610) T COG2905 149 TRVGEVKTLPAVTVSPQASIQDAARKMKDEGVSSLVVLDDSGPLLGIVTRKDLRSRV 205 (610) T ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCEEEHHHHHHHH T ss_conf 987777417875268657679999999960887289983798743324227778999 No 276 >PRK10415 tRNA-dihydrouridine synthase B; Provisional Probab=97.46 E-value=0.0019 Score=44.08 Aligned_cols=201 Identities=18% Similarity=0.239 Sum_probs=103.2 Q ss_pred CCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCC-HHHHHHHHHHH-HHHCCCEECCC--EEEE--CCCCHHHH Q ss_conf 8816003178337852368889999997798799978989-89999999987-42238815287--8994--65430345 Q gi|254780889|r 41 KDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFS-PSEQVAQVHQV-KKFESGMVVNP--VTIS--PYATLADA 114 (493) Q Consensus 41 ~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~s-ie~qa~~V~kV-Kr~e~gmI~dP--Vti~--pd~TI~ea 114 (493) -++.|+-|++-|||-.||...+= .|.+.-|-++..--|- .+.....-.+- .+..+.--..| +-+. .-..+.++ T Consensus 4 g~~~~~~~l~lAPMagvtd~~FR-~l~~~~Ga~l~~TEmv~a~~~~~~~~~~~~~~~~~~~~~~~~vQl~G~dp~~~a~A 82 (321) T PRK10415 4 GQYQLRNRLIAAPMAGITDRPFR-TLCYEMGAGLTVSEMMSSNPQVWESDKSRLRMVHVDEPGIRTVQIAGSDPKEMADA 82 (321) T ss_pred CCEECCCCEEECCCCCCCCHHHH-HHHHHHCCCEEEECCEEECHHHHCCHHHHHHHCCCCCCCCCEEEECCCCHHHHHHH T ss_conf 98844898897357899489999-99999883999987587127773384889863046788980599726999999999 Q ss_pred HHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCC--CCCCCCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 6566760504167866886027887235450014344672440026786--32573035899999998655330645747 Q gi|254780889|r 115 LALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNL--ITVKKTVNLENAKALLHQHRIEKLLVVDD 192 (493) Q Consensus 115 ~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~l--itv~~~~~l~eA~~iM~~~ki~~LPVVDe 192 (493) .++..+.+++.+ |= +...|++.+..+.- .-.....-+.+..+.|.+. -.+||. T Consensus 83 a~~~~~~g~~~I---Di------------------N~GCP~~kV~k~g~GsaLl~~p~~~~~iv~a~~~a--~~iPVT-- 137 (321) T PRK10415 83 ARINVESGAQII---DI------------------NMGCPAKKVNRKLAGSALLQYPDLVKSILTEVVNA--VDVPVT-- 137 (321) T ss_pred HHHHHHCCCCEE---EE------------------ECCCCHHHHCCCCCEEEHHCCHHHHHHHHHHHHHC--CCCCEE-- T ss_conf 998876499989---43------------------18999899707983650633989999999999734--487469-- Q ss_pred CCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECC---CHHHHHHHHHHHCCCCEEEEECCCCHHHHH-------H Q ss_conf 997868898021000101742232656876555665137---015678999873136669982484113889-------9 Q gi|254780889|r 193 DGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVA---KDIADRVGPLFDVNVDLVVVDTAHGHSQKV-------L 262 (493) Q Consensus 193 ~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~---~d~~eRa~~LveaGvDviviD~ahGh~~~~-------~ 262 (493) + |. |. |.. .+..+=+..+.++|+++|.| ||-+..+ - T Consensus 138 ------v-------Ki--------------Rl----G~~~~~~~~~~~~~~~e~aG~~~itv---HgRT~~q~y~g~adw 183 (321) T PRK10415 138 ------L-------KI--------------RT----GWAPEHRNCEEIAQLAEDCGIQALTI---HGRTRACLFNGEAEY 183 (321) T ss_pred ------E-------EE--------------EC----CCCCCHHHHHHHHHHHHHCCCCEEEE---EHHHHHHHHCCCCCH T ss_conf ------9-------98--------------46----88852243999999998569889999---722134431699877 Q ss_pred HHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHH-CCCCEEEEE Q ss_conf 99999998579872451-46469999999985-487589961 Q gi|254780889|r 263 DAVVQIKKNFPSLLVMA-GNIATAEGALALID-AGADIIKVG 302 (493) Q Consensus 263 ~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~-~Gad~ikvG 302 (493) +.|+.+|+.+ ++|||+ |+|-|++-++..++ .|+|+|-+| T Consensus 184 ~~i~~vk~~~-~iPvi~NGDI~~~~da~~~l~~tg~dgvMig 224 (321) T PRK10415 184 DSIRAVKQKV-SIPVIANGDITDPLKARAVLDYTGADALMIG 224 (321) T ss_pred HHHHHHHHCC-CCCEEECCCCCCHHHHHHHHHHHCCCEEEEC T ss_conf 9999998547-9978965891999999999986299999975 No 277 >PRK13586 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=97.45 E-value=0.0036 Score=42.18 Aligned_cols=172 Identities=15% Similarity=0.073 Sum_probs=90.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEE Q ss_conf 58999999986553306457479978688980210001017422326568765556651370156789998731366699 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVV 250 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviv 250 (493) +.-++.+.+.+...+.+-+||=++- .|-=...++++... +-..-.+-+|- |++ ..+.++.++++|+|-++ T Consensus 30 dP~~~a~~~~~~Ga~~lhvvDLdaa-~g~~~N~~~I~~i~-----~~~~~piqvGG--GIr--s~e~~~~~l~~Ga~kVi 99 (231) T PRK13586 30 DPLKIAEELYNEGYDSIHVVDLDAA-EGKGDNEEYIKRIC-----KIGFSWIQVGG--GIR--DVEKAERLLSYDCSAIV 99 (231) T ss_pred CHHHHHHHHHHCCCCEEEEEECCCC-CCCCCHHHHHHHHH-----HHCCCCEEEEC--CCC--CHHHHHHHHHCCCCEEE T ss_conf 9999999999879998999967156-89984399999999-----74598579856--717--69999999977998899 Q ss_pred EECCCCHHHHHHHHHHHHHHHCCC------CCEEC-CCCCC----------HHHHHHHHHCCCCEEEEEECCCCCCCC-- Q ss_conf 824841138899999999985798------72451-46469----------999999985487589961027876653-- Q gi|254780889|r 251 VDTAHGHSQKVLDAVVQIKKNFPS------LLVMA-GNIAT----------AEGALALIDAGADIIKVGIGPGSICTT-- 311 (493) Q Consensus 251 iD~ahGh~~~~~~~i~~~k~~~~~------~~ii~-GNv~t----------~~~~~~l~~~Gad~ikvGig~Gs~CtT-- 311 (493) |-++--.+. +.++.+-+.|+. +++-- |.+.+ .+-...+.+.|+-.+. ||. T Consensus 100 igS~a~~np---~~~~~~~~~~G~~~iv~siD~~~~~~v~~~Gw~~~~~~~~~~i~~~~~~g~~~ii--------~TdI~ 168 (231) T PRK13586 100 MSTLPFTNP---DTFRRIVAGIGENRVLVSVDYDDRKYVLIKGWKEKSMKVEDAISHVNSLESLGVI--------FTYVC 168 (231) T ss_pred ECHHHHHCH---HHHHHHHHHHCCCCEEEEEEECCCCEEEEECCCCCCCCHHHHHHHHHHCCCCEEE--------EEEEC T ss_conf 768887695---9999999984996689999975896899848726886699999999975998899--------97645 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 100015743100378899986226981997265437776213367148789932301 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) |.=|.-| |-+-..... ......|++| ||+++-.|+.+...+|++.|-+|+.| T Consensus 169 ~DGt~~G-~d~~l~~~i---~~~~~~~i~a-GGi~s~~Di~~L~~~G~~gaivG~Al 220 (231) T PRK13586 169 NEGTKNG-IDNNVKRYV---KLVKGEKEYA-GGIGSIQDLQKLKKMGFDYAIVGMSF 220 (231) T ss_pred CHHCCCC-CCHHHHHHH---HHCCCCCEEE-CCCCCHHHHHHHHHCCCCEEEEEHHH T ss_conf 1120368-998999999---8718995998-68899999999986799889999788 No 278 >PRK07028 bifunctional hexulose-6-phosphate synthase/ribonuclease regulator; Validated Probab=97.42 E-value=0.0034 Score=42.31 Aligned_cols=160 Identities=23% Similarity=0.366 Sum_probs=88.7 Q ss_pred CHHHHHHHHHHHCCCCEEEEEC-----CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 0156789998731366699824-----841138899999999985798724-5146469999999985487589961027 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDT-----AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~-----ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) +|-.+|++.|.+.|||+|++-+ +.|. .-++.++.+++.++ ++| +||.+ .++.+...+.+|||.+-|| T Consensus 118 ~d~~~ra~el~~lGvd~I~vH~G~D~Q~~g~--~p~~~l~~v~~~~~-~~vAVAGGi-~~~t~~~~v~~GAdIvIVG--- 190 (429) T PRK07028 118 PDPVKRAVELEELGVDIINVHVGIDQQMLGK--DPLELLKKVSEEVS-IPIAAAGGL-DAETAVKAVEAGADIVIVG--- 190 (429) T ss_pred CCHHHHHHHHHHCCCCEEEEEEEECHHHCCC--CHHHHHHHHHHHCC-CEEEEECCC-CHHHHHHHHHCCCCEEEEC--- T ss_conf 9889999999970998899976233553179--84999999997559-718996687-8776999997599899989--- Q ss_pred CCCCCCCHHCCCCC--------C-------CCHHHHHHHHHHHCCCCEEEEE---------------CCCCCHHHHEEEE Q ss_conf 87665310001574--------3-------1003788999862269819972---------------6543777621336 Q gi|254780889|r 306 GSICTTRVVTGVGC--------P-------QLSAIMSVVEVAERAGVAIVAD---------------GGIRFSGDIAKAI 355 (493) Q Consensus 306 Gs~CtTr~~~g~g~--------p-------q~~av~~~~~~~~~~~~~iiad---------------GGi~~~gdi~kAl 355 (493) |+|--.+.+++.-+ | +-+.=.|..+.......|=|+| +|+|..|-.+--- T Consensus 191 gaI~~a~dp~~aAr~ir~ai~~~~~~~~~~~~~~~~~~~~~~~~vs~~n~sdamhr~~~~~~~~p~~~~~k~~G~avTV~ 270 (429) T PRK07028 191 GNIYKSADVTGAARDIREALDSPPVPTDKFKKSLDEEIRELFKEVSTSNISDAMHRKGAMKGLKPLVRGLKMVGPAVTVQ 270 (429) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCEECCCCCEEEEEEEEEE T ss_conf 40057999799999999997376767652002238999999985188761156664045578544246863466679999 Q ss_pred EECCCEEEECCHHHCCCCCCCCEEE-ECCEEEEEEEC-CCCHHHHHH Q ss_conf 7148789932301012167764487-36805888733-200688752 Q gi|254780889|r 356 AAGSACVMIGSLLAGTDESPGDIFL-YQGRSFKSYRG-MGSVAAMER 400 (493) Q Consensus 356 a~GA~~VM~G~~~agt~Espg~~~~-~~g~~~K~yrG-m~S~~A~~~ 400 (493) ...-|+.|.=..+ -.-.||+++. .++..-...|| +.|..|+.+ T Consensus 271 t~~gD~~~~~~Ai--D~A~PGDViVId~~G~d~A~WGeLmt~aA~~r 315 (429) T PRK07028 271 TFAGDWAKPVEAI--DVAEPGDVLVIDNCSSDIAPWGGLATLSCLNK 315 (429) T ss_pred ECCCCCCCHHHHH--HCCCCCCEEEEECCCCCCEEEHHHHHHHHHHC T ss_conf 5688740567786--24799978999679997513089999999987 No 279 >cd00945 Aldolase_Class_I Class I aldolases. The class I aldolases use an active-site lysine which stablilzes a reaction intermediates via Schiff base formation, and have TIM beta/alpha barrel fold. The members of this family include 2-keto-3-deoxy-6-phosphogluconate (KDPG) and 2-keto-4-hydroxyglutarate (KHG) aldolases, transaldolase, dihydrodipicolinate synthase sub-family, Type I 3-dehydroquinate dehydratase, DeoC and DhnA proteins, and metal-independent fructose-1,6-bisphosphate aldolase. Although structurally similar, the class II aldolases use a different mechanism and are believed to have an independent evolutionary origin. Probab=97.40 E-value=0.008 Score=39.69 Aligned_cols=118 Identities=27% Similarity=0.368 Sum_probs=77.4 Q ss_pred HHHHHHHHHHCCCCEE--EEECCC---CHHHHHHHHHHHHHHHC-CCCC--EEC--CCCCC----HHHHHHHHHCCCCEE Q ss_conf 5678999873136669--982484---11388999999999857-9872--451--46469----999999985487589 Q gi|254780889|r 234 IADRVGPLFDVNVDLV--VVDTAH---GHSQKVLDAVVQIKKNF-PSLL--VMA--GNIAT----AEGALALIDAGADII 299 (493) Q Consensus 234 ~~eRa~~LveaGvDvi--viD~ah---Gh~~~~~~~i~~~k~~~-~~~~--ii~--GNv~t----~~~~~~l~~~Gad~i 299 (493) -...++..++.|+|-| +++..+ |....+.+-++.+.+.- ..++ +|. +..-+ ...++..+++|||.| T Consensus 67 k~~e~~~ai~~GAdeid~v~~~~~~~~~~~~~~~~ei~~v~~~~~~g~~~kvi~e~~~l~~~~~i~~a~~~~~~~Gadfv 146 (201) T cd00945 67 KVAEVEEAIDLGADEIDVVINIGSLKEGDWEEVLEEIAAVVEAADGGLPLKVILETRGLKTADEIAKAARIAAEAGADFI 146 (201) T ss_pred HHHHHHHHHHHCCCEEEEECCHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEE T ss_conf 99999999990999899740567775668899999999999973579837999616778999999999999998099879 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEE Q ss_conf 961027876653100015743100378899986226981997265437776213367148789 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACV 362 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~V 362 (493) |...|-+..+.| +..+....+.+. ..++|.+-||||+.-+....+.+||+.+ T Consensus 147 KtstG~~~~~at----------~~~v~~m~~~~~-~~~~vk~sGGi~~~~~a~~~l~aGa~~i 198 (201) T cd00945 147 KTSTGFGGGGAT----------VEDVKLMKEAVG-GRVGVKAAGGIKTLEDALAAIEAGADGI 198 (201) T ss_pred EECCCCCCCCCC----------HHHHHHHHHHHC-CCCEEECCCCCCCHHHHHHHHHHCCCEE T ss_conf 855887889899----------999999999828-7863863589799999999998286535 No 280 >KOG2335 consensus Probab=97.36 E-value=0.0029 Score=42.87 Aligned_cols=157 Identities=23% Similarity=0.280 Sum_probs=102.1 Q ss_pred CHHHHHHHHHHHCCCCEEEEEC-------CCC--------HHHHHHHHHHHHHHHCCCCCE-----ECC-CCCCHHHHHH Q ss_conf 0156789998731366699824-------841--------138899999999985798724-----514-6469999999 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDT-------AHG--------HSQKVLDAVVQIKKNFPSLLV-----MAG-NIATAEGALA 290 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~-------ahG--------h~~~~~~~i~~~k~~~~~~~i-----i~G-Nv~t~~~~~~ 290 (493) .+....|..|++..+|.|-+.. -+| .-+.+-++++.++..++. +| |.. .=-|.+=++- T Consensus 85 p~~ll~Aa~lv~~y~D~idlNcGCPq~~a~~g~yGa~L~~~~eLv~e~V~~v~~~l~~-pVs~KIRI~~d~~kTvd~ak~ 163 (358) T KOG2335 85 PENLLKAARLVQPYCDGIDLNCGCPQKVAKRGGYGAFLMDNPELVGEMVSAVRANLNV-PVSVKIRIFVDLEKTVDYAKM 163 (358) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCCCCEECCCHHHHHHHHHHHHHHCCC-CEEEEEEECCCHHHHHHHHHH T ss_conf 8999999998653347204158998788843772600023889999999999852599-869999855767878999999 Q ss_pred HHHCCCCEEEEEECCCCCCCCCHHCC--CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEECC- Q ss_conf 98548758996102787665310001--5743100378899986226981997265437776213367-1487899323- Q gi|254780889|r 291 LIDAGADIIKVGIGPGSICTTRVVTG--VGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA-AGSACVMIGS- 366 (493) Q Consensus 291 l~~~Gad~ikvGig~Gs~CtTr~~~g--~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~G~- 366 (493) +.+||++-+-|- | -||..-| .+.--..+++...+...+ +|+||-|+|.+..|+--.+- -|||.||.|. T Consensus 164 ~e~aG~~~ltVH---G---Rtr~~kg~~~~pad~~~i~~v~~~~~~--ipviaNGnI~~~~d~~~~~~~tG~dGVM~arg 235 (358) T KOG2335 164 LEDAGVSLLTVH---G---RTREQKGLKTGPADWEAIKAVRENVPD--IPVIANGNILSLEDVERCLKYTGADGVMSARG 235 (358) T ss_pred HHHCCCCEEEEE---C---CCHHHCCCCCCCCCHHHHHHHHHHCCC--CCEEEECCCCCHHHHHHHHHHHCCCEEEECCH T ss_conf 986798689993---6---557762888887677999999974767--70895088576899999999758746886000 Q ss_pred ------HH--HCCCCCCCCEEEECCEEEEEEECCCCHHH Q ss_conf ------01--01216776448736805888733200688 Q gi|254780889|r 367 ------LL--AGTDESPGDIFLYQGRSFKSYRGMGSVAA 397 (493) Q Consensus 367 ------~~--agt~Espg~~~~~~g~~~K~yrGm~S~~A 397 (493) +| +|..+.|.+.+.......++|.|-.|..+ T Consensus 236 lL~NPa~F~~~~~~~~~~~~~~~~l~~~~e~~g~~~~~~ 274 (358) T KOG2335 236 LLYNPALFLTAGYGPTPWGCVEEYLDIAREFGGLSSFSL 274 (358) T ss_pred HHCCCHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 003841202688777878899999999997279723667 No 281 >COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription] Probab=97.36 E-value=0.00012 Score=52.84 Aligned_cols=56 Identities=23% Similarity=0.408 Sum_probs=51.4 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 244002678632573035899999998655330645747997868898021000101 Q gi|254780889|r 154 AVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 154 ~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ..+.+|+++++.+.+++++.++..+|.+++++.+||+++ ++++|-||-.||.+... T Consensus 66 ta~~iM~spvv~v~pdDsi~~vv~lM~~~g~SQlPVi~~-~k~VGsItE~~iv~~~l 121 (187) T COG3620 66 TAKTIMHSPVVSVSPDDSISDVVNLMRDKGISQLPVIEE-DKVVGSITENDIVRALL 121 (187) T ss_pred EHHHHCCCCEEEECCHHHHHHHHHHHHHCCCCCCCEEEC-CEEEEEECHHHHHHHHH T ss_conf 576653587058772466999999999759752754108-86453314989999973 No 282 >KOG2550 consensus Probab=97.34 E-value=0.0036 Score=42.20 Aligned_cols=232 Identities=21% Similarity=0.301 Sum_probs=129.8 Q ss_pred ECCCEEEECCCCCCCHHHHH--------HHHH---CCC--EEEECCCCCHHHHHHHH-HHHHHHCCCEECCCEEEECCCC Q ss_conf 00317833785236888999--------9997---798--79997898989999999-9874223881528789946543 Q gi|254780889|r 45 LNLPIMSAAMDQVTDSRLAI--------AMAQ---AGG--LGVIHRNFSPSEQVAQV-HQVKKFESGMVVNPVTISPYAT 110 (493) Q Consensus 45 l~iPivSs~MDTVTe~~MAi--------amA~---~GG--iGvIHrn~sie~qa~~V-~kVKr~e~gmI~dPVti~pd~T 110 (493) .+-|+|-||-.||.+..=|. .+.. .|+ +|+|- ..+.+-+ ...+-+...|..+|++.+...+ T Consensus 115 ~~dp~v~sp~~tvg~v~~~k~~~gF~g~pvTe~g~~~~KLvG~vt-----srdi~f~~~~~~~~~~vmt~~~~~~~~gi~ 189 (503) T KOG2550 115 INNPIVISPTTTVGEVKEAKEKHGFSGIPVTEDGKRGSKLVGIIT-----SRDIQFLEDNSLLVSDVMTKNPVTGAQGIT 189 (503) T ss_pred CCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEE-----HHHHHHHHCCCCHHHHHHCCCCCCCCCCCC T ss_conf 568620377543105565206566335421157754651577774-----202345534551124330344301444466 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCC-CCCCCCCC---CCCCCHHHHHHHHHHCCCCE Q ss_conf 034565667605041678668860278872354500143446724400-26786325---73035899999998655330 Q gi|254780889|r 111 LADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGEL-MTRNLITV---KKTVNLENAKALLHQHRIEK 186 (493) Q Consensus 111 I~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~ei-Mt~~litv---~~~~~l~eA~~iM~~~ki~~ 186 (493) +.++-+++.+.+-+-+|||+++ +.|+-+++..|+.-.+ +-|...- -++.+.+. ...+.=.+-++++.+..... T Consensus 190 l~~~neiL~~~kkGkl~iv~~~-gelva~~~rtDl~k~~--~yPlask~~~kqll~gAaiGTre~dK~rl~ll~~aGvdv 266 (503) T KOG2550 190 LKEANEILKKIKKGKLPVVDDK-GELVAMLSRTDLMKNR--DYPLASKDSTKQLLCGAAIGTRDDDKERLDLLVQAGVDV 266 (503) T ss_pred HHHHHHHHHHHHCCCCCEECCC-CCEEEEEEHHHHHHHC--CCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCE T ss_conf 7788999876314865234367-7623343334566502--787555675413565101366630167788866348868 Q ss_pred EEEECCCCCEEEEEEECCCHH--CCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEE------------ Q ss_conf 645747997868898021000--101742232656876555665137015678999873136669982------------ Q gi|254780889|r 187 LLVVDDDGCCIGLITVKDIER--SQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVD------------ 252 (493) Q Consensus 187 LPVVDe~g~LvGIIT~~DIlk--~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD------------ 252 (493) +.+=..+|.-+--+. .+| .+.||.-. .+|. +--..+-++.|+.||||-|-|- T Consensus 267 viLDSSqGnS~~qie---mik~iK~~yP~l~-------ViaG----NVVT~~qa~nLI~aGaDgLrVGMGsGSiCiTqev 332 (503) T KOG2550 267 VILDSSQGNSIYQLE---MIKYIKETYPDLQ-------IIAG----NVVTKEQAANLIAAGADGLRVGMGSGSICITQKV 332 (503) T ss_pred EEEECCCCCCHHHHH---HHHHHHHHCCCCE-------EECC----CEEEHHHHHHHHHCCCCEEEECCCCCCEEEECEE T ss_conf 999668885045799---9999986688863-------4316----5533888999987367605752556750545301 Q ss_pred CCCCHH--HHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEE Q ss_conf 484113--889999999998579872451-464699999999854875899 Q gi|254780889|r 253 TAHGHS--QKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIK 300 (493) Q Consensus 253 ~ahGh~--~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ik 300 (493) .|-|.- ..+.+... ....| .+|+|+ |.+-+..-...-...||+.|- T Consensus 333 ma~GrpQ~TAVy~va~-~A~q~-gvpviADGGi~~~Ghi~KAl~lGAstVM 381 (503) T KOG2550 333 MACGRPQGTAVYKVAE-FANQF-GVPCIADGGIQNVGHVVKALGLGASTVM 381 (503) T ss_pred EECCCCCCCCHHHHHH-HHHHC-CCCEECCCCCCCCHHHHHHHHCCCHHHH T ss_conf 2326776200326999-99764-9965506875873177888753850631 No 283 >COG0107 HisF Imidazoleglycerol-phosphate synthase [Amino acid transport and metabolism] Probab=97.34 E-value=0.0015 Score=44.91 Aligned_cols=37 Identities=19% Similarity=0.425 Sum_probs=27.4 Q ss_pred HCCCCEEEEECCCCCHHHHEEEEEEC-CCEEEECCHHH Q ss_conf 22698199726543777621336714-87899323010 Q gi|254780889|r 333 ERAGVAIVADGGIRFSGDIAKAIAAG-SACVMIGSLLA 369 (493) Q Consensus 333 ~~~~~~iiadGGi~~~gdi~kAla~G-A~~VM~G~~~a 369 (493) ....+|+||-||.-+..|+.-|+..| ||++...|+|- T Consensus 196 ~~v~iPvIASGGaG~~ehf~eaf~~~~adAaLAAsiFH 233 (256) T COG0107 196 EAVNIPVIASGGAGKPEHFVEAFTEGKADAALAASIFH 233 (256) T ss_pred HHCCCCEEECCCCCCHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 64887889118989688999999815700887644331 No 284 >cd02911 arch_FMN Archeal FMN-binding domain. This family of archaeal proteins are part of the NAD(P)H-dependent flavin oxidoreductase (oxidored) FMN-binding family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. The specific function of this group is unknown. Probab=97.33 E-value=0.0024 Score=43.45 Aligned_cols=121 Identities=19% Similarity=0.291 Sum_probs=68.6 Q ss_pred ECCCHHHHHHHHHHHCCCCEEEEE--------CCCC-------HHHHHHHHHHHHHHHCCCCCE----ECC-CCCCHHHH Q ss_conf 137015678999873136669982--------4841-------138899999999985798724----514-64699999 Q gi|254780889|r 229 SVAKDIADRVGPLFDVNVDLVVVD--------TAHG-------HSQKVLDAVVQIKKNFPSLLV----MAG-NIATAEGA 288 (493) Q Consensus 229 g~~~d~~eRa~~LveaGvDviviD--------~ahG-------h~~~~~~~i~~~k~~~~~~~i----i~G-Nv~t~~~~ 288 (493) |...+...++..+++...|+|=|. +.+| +-..+.+.++.+|+. ++|| =.| +.-+.+-+ T Consensus 81 g~~~e~~~~Aa~~~~~~~d~IDiN~GCP~~kV~~~g~GsaLl~dp~~~~~iv~avk~~--~~PVtvKiR~G~d~~~~~~a 158 (233) T cd02911 81 SSSLEPLLNAAALVAKNAAILEINAHCRQPEMVEAGAGEALLKDPERLSEFIKALKET--GVPVSVKIRAGVDVDDEELA 158 (233) T ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCCHHHCCCCCHHHHCCCHHHHHHHHHHHHHH--CCCEEEEEECCCCCCHHHHH T ss_conf 6999999999999743699999979999289837975377738989999999999853--89842798569998889999 Q ss_pred HHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECC Q ss_conf 999854875899610278766531000157431003788999862269819972654377762133671487899323 Q gi|254780889|r 289 LALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGS 366 (493) Q Consensus 289 ~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~ 366 (493) +.+.++|+|++-|- ++.. + |..-...+. +. ...+|||+.|+|.+.-|.-+.|..|+|.||+|. T Consensus 159 ~~~e~aG~~~l~v~--------~~~~-~-~~ad~~~I~---~~--~~~i~VigNGDI~s~eda~~~~~~G~DgVMIgR 221 (233) T cd02911 159 RLIEKAGADIIHVD--------AMDP-G-NHADLKKIR---DI--STELFIIGNNSVTTIESAKEMFSYGADMVSVAR 221 (233) T ss_pred HHHHHHCCCEEEEE--------CCCC-C-CHHHHHHHH---HH--CCCCEEEEECCCCCHHHHHHHHHHCCCEEEECH T ss_conf 99998396079943--------2077-8-508999999---86--379879980896999999999985999999738 No 285 >PRK09426 methylmalonyl-CoA mutase; Reviewed Probab=97.31 E-value=0.0021 Score=43.91 Aligned_cols=12 Identities=17% Similarity=0.459 Sum_probs=5.2 Q ss_pred CCHHEEECCCCC Q ss_conf 600057527643 Q gi|254780889|r 15 TFDDVLLRPEFS 26 (493) Q Consensus 15 TfDDVllvP~~s 26 (493) |||.+-+-|-|+ T Consensus 30 T~eGi~ikPlYt 41 (715) T PRK09426 30 TPEGIDVKPLYT 41 (715) T ss_pred CCCCCEECCCCC T ss_conf 888884057747 No 286 >TIGR01037 pyrD_sub1_fam dihydroorotate dehydrogenase family protein; InterPro: IPR005720 Dihydroorotate dehydrogenase (1.3.3.1 from EC) (DHOdehase) catalyzes the fourth step in the de novo biosynthesis of pyrimidine, the conversion of dihydroorotate into orotate. DHOdehase is a ubiquitous FAD flavoprotein. In bacteria (gene pyrD), DHOdease is located on the inner side of the cytosolic membrane. In some yeasts, such as in Saccharomyces cerevisiae (gene URA1, subfamily 2), it is a cytosolic protein while in other eukaryotes it is found in the mitochondria . This describes dihydroorotate dehydrogenases subfamily 1 that includes a number of uncharacterised proteins and a domain of dihydropyrimidine dehydrogenase.; GO: 0004158 dihydroorotate oxidase activity, 0006207 'de novo' pyrimidine base biosynthetic process, 0005737 cytoplasm. Probab=97.31 E-value=0.00027 Score=50.25 Aligned_cols=104 Identities=27% Similarity=0.386 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHCCCCCEEC---CCCCC-HHHHHHHHHCCCCEEEE--EECCCCCCCCCHHCCCCCCCCH---------- Q ss_conf 89999999998579872451---46469-99999998548758996--1027876653100015743100---------- Q gi|254780889|r 260 KVLDAVVQIKKNFPSLLVMA---GNIAT-AEGALALIDAGADIIKV--GIGPGSICTTRVVTGVGCPQLS---------- 323 (493) Q Consensus 260 ~~~~~i~~~k~~~~~~~ii~---GNv~t-~~~~~~l~~~Gad~ikv--Gig~Gs~CtTr~~~g~g~pq~~---------- 323 (493) ...+.++.+|+.. ++||.+ .||.+ -|-|+.+.++|||++-. -+.|| =++--=.++|=|+ T Consensus 148 l~~~vv~avK~~~-d~Pv~aKLsPNV~Di~eiA~a~eeaGaDGlt~INTl~PG----MkIDI~~~kPiLaNk~GGlSGPA 222 (308) T TIGR01037 148 LSADVVKAVKDKV-DVPVFAKLSPNVTDITEIAKAAEEAGADGLTLINTLRPG----MKIDIKAKKPILANKTGGLSGPA 222 (308) T ss_pred HHHHHHHHHHCCC-CCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEHHHHHH----HHHHHHCCCCCCEECCCCCCCCC T ss_conf 9999999983000-786578648656689999888753277616400120346----77734207870000458850750 Q ss_pred ----HHHHHHHHHHCCC--CEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf ----3788999862269--81997265437776213367148789932301 Q gi|254780889|r 324 ----AIMSVVEVAERAG--VAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 324 ----av~~~~~~~~~~~--~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) ||.-+.+..+..+ +|||-=|||++--|.+-=|-|||++|=+|+-- T Consensus 223 IKPiA~r~VYdly~~~ddriPIiGvGGi~~~eDA~Efl~AGAsAVQvGtAv 273 (308) T TIGR01037 223 IKPIAVRMVYDLYKEVDDRIPIIGVGGITSFEDALEFLMAGASAVQVGTAV 273 (308) T ss_pred CCCEEEEEHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCEEEEEEEE T ss_conf 142212100004777378234686327455899999998522022000222 No 287 >pfam01791 DeoC DeoC/LacD family aldolase. This family includes diverse aldolase enzymes. This family includes the enzyme deoxyribose-phosphate aldolase EC:4.1.2.4, which is involved in nucleotide metabolism. The family also includes a group of related bacterial proteins of unknown function. The family also includes tagatose 1,6-diphosphate aldolase (EC:4.1.2.40) is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation. Probab=97.28 E-value=0.0036 Score=42.18 Aligned_cols=114 Identities=25% Similarity=0.307 Sum_probs=75.3 Q ss_pred HHHHHHCCCCEE--EEEC---CCCHHHHHHHHHHHHHHHCC--CCCEECC---------CCCC----HHHHHHHHHCCCC Q ss_conf 999873136669--9824---84113889999999998579--8724514---------6469----9999999854875 Q gi|254780889|r 238 VGPLFDVNVDLV--VVDT---AHGHSQKVLDAVVQIKKNFP--SLLVMAG---------NIAT----AEGALALIDAGAD 297 (493) Q Consensus 238 a~~LveaGvDvi--viD~---ahGh~~~~~~~i~~~k~~~~--~~~ii~G---------Nv~t----~~~~~~l~~~Gad 297 (493) ++..++.|+|.| ++.. -.|+...+++-++.+.+.-. .+++|.= +-.+ ..+++--+++||| T Consensus 80 v~~Ai~~GAdeid~vi~~~~~~~~~~~~~~~~i~~v~~~~~~~~lkvIiE~~~~~~~~~~~~~~~~i~~a~ria~e~GaD 159 (231) T pfam01791 80 AEEAIALGADEVDMVVNIGADGSEDEEQVLAEIAAVKEACHGEGLKVILEAYLRGEEIKDKVDPHLVAKATRVGAEAGAD 159 (231) T ss_pred HHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCHHCCCCCHHHHHHHHHHHHHHCCC T ss_conf 99999879988999972465789559999999999998631048708999851572100326899999999999995999 Q ss_pred EEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCC------CCHHHHEEEEEECCCE Q ss_conf 899610278766531000157431003788999862269819972654------3777621336714878 Q gi|254780889|r 298 IIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGI------RFSGDIAKAIAAGSAC 361 (493) Q Consensus 298 ~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi------~~~gdi~kAla~GA~~ 361 (493) .||+.-|-+...+ +...+....+......+||.+-||| |+--+...++.+||+. T Consensus 160 ~vKtstg~~~~ga----------t~~~v~~~~~~~~~~~~~Vk~sGGi~~~~~~~~l~~a~~~i~aGA~~ 219 (231) T pfam01791 160 FVKTSTGFGERGA----------TEEDVQIFKEVVGGAPPGVKASGGIKEKDFLRTVEDAIALIEAGADR 219 (231) T ss_pred EEEECCCCCCCCC----------CHHHHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCH T ss_conf 8981578788877----------88899999998568787489933868643789999999999869981 No 288 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=97.27 E-value=0.00041 Score=48.96 Aligned_cols=95 Identities=21% Similarity=0.329 Sum_probs=70.3 Q ss_pred CCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCC---CCCCEEEEEE Q ss_conf 72440026--786325730358999999986553306457479978688980210001017422326---5687655566 Q gi|254780889|r 153 QAVGELMT--RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKD---SKGRLRVAAA 227 (493) Q Consensus 153 ~~V~eiMt--~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D---~~grL~VgAA 227 (493) ..|.++|+ .+..+..+++.+++..++-.+.+..+.||||+..+++|++|.+|+...... .+.+ .+.-+.|++- T Consensus 188 ~~Vedi~~P~~~~~yL~~~d~v~d~~~l~~kt~~sRfPVvn~~~kvvGvVt~rDv~~~~~~--t~ieKVMtknp~tv~~~ 265 (432) T COG4109 188 ITVEDIMTPLEDTSYLRETDTVEDWLDLVEKTGHSRFPVVNRSMKVVGVVTMRDVLDKKPS--TTIEKVMTKNPITVRAK 265 (432) T ss_pred EEHHHHCCCCCCCEECCCCCCHHHHHHHHHHCCCCCCCEECCCCEEEEEEEEHHHHCCCCC--CCHHHHHCCCCEEECCC T ss_conf 5587744644343232464509999999997198734133144269999983464447998--61777742697554252 Q ss_pred EECCCHHHHHHHHHHHCCCCEE-EEEC Q ss_conf 5137015678999873136669-9824 Q gi|254780889|r 228 VSVAKDIADRVGPLFDVNVDLV-VVDT 253 (493) Q Consensus 228 Ig~~~d~~eRa~~LveaGvDvi-viD~ 253 (493) -+ ...-+..++-.|.+++ |+|. T Consensus 266 ts----VAsvaq~MiwE~iem~PVv~~ 288 (432) T COG4109 266 TS----VASVAQMMIWEGIEMLPVVDS 288 (432) T ss_pred CH----HHHHHHHHHHCCCEEEEEECC T ss_conf 26----889988887645526568837 No 289 >cd04738 DHOD_2_like Dihydroorotate dehydrogenase (DHOD) class 2. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences, their cellular location and their natural electron acceptor used to reoxidize the flavin group. Members of class 1 are cytosolic enzymes and multimers, while class 2 enzymes are membrane associated, monomeric and use respiratory quinones as their physiological electron acceptors. Probab=97.27 E-value=0.003 Score=42.75 Aligned_cols=130 Identities=21% Similarity=0.246 Sum_probs=75.0 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCC----C-----HHHHHHHHHHHHHHHC----CCCCEEC---CCCCC---HHHHHHHH Q ss_conf 015678999873136669982484----1-----1388999999999857----9872451---46469---99999998 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAH----G-----HSQKVLDAVVQIKKNF----PSLLVMA---GNIAT---AEGALALI 292 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ah----G-----h~~~~~~~i~~~k~~~----~~~~ii~---GNv~t---~~~~~~l~ 292 (493) +|+.+.+..| ..-+|.++|.++- | ..+.+.+.++.+++.. ..+||++ .|+.. .+.++.+. T Consensus 148 ~Dy~~~~~~l-~~~aDy~~iNiSsPNt~glr~lq~~~~l~~ll~~v~~~~~~~~~~~Pi~vKlsPD~~~~~i~~i~~~~~ 226 (327) T cd04738 148 EDYVIGVRKL-GPYADYLVVNVSSPNTPGLRDLQGKEALRELLTAVKEERNKLGKKVPLLVKIAPDLSDEELEDIADVAL 226 (327) T ss_pred HHHHHHHHHH-CCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 9999999985-355778999546889845100268899999999999999853778866998179976678999999999 Q ss_pred HCCCCEEEEEECCCCCCCCCHH-------CC------CCCCCCHHHHH-HHHHHHCC--CCEEEEECCCCCHHHHEEEEE Q ss_conf 5487589961027876653100-------01------57431003788-99986226--981997265437776213367 Q gi|254780889|r 293 DAGADIIKVGIGPGSICTTRVV-------TG------VGCPQLSAIMS-VVEVAERA--GVAIVADGGIRFSGDIAKAIA 356 (493) Q Consensus 293 ~~Gad~ikvGig~Gs~CtTr~~-------~g------~g~pq~~av~~-~~~~~~~~--~~~iiadGGi~~~gdi~kAla 356 (493) +.|+|+|-+.= ||-.. -. .|.|-...-.+ +..+.+.. .+|||+-|||.+..|+..-|. T Consensus 227 ~~g~dGvi~tN------Tt~~r~~~~~~~~~~~~GGlSG~pl~~~s~~~v~~v~~~~~~~~pIIgvGGI~s~~Da~e~i~ 300 (327) T cd04738 227 EHGVDGIIATN------TTISRPGLLRSPLANETGGLSGAPLKERSTEVLRELYKLTGGKIPIIGVGGISSGEDAYEKIR 300 (327) T ss_pred HCCCCEEEEEC------CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHH T ss_conf 73997899958------855542124565556668636406789999999999997499981999889799999999998 Q ss_pred ECCCEEEECCHH Q ss_conf 148789932301 Q gi|254780889|r 357 AGSACVMIGSLL 368 (493) Q Consensus 357 ~GA~~VM~G~~~ 368 (493) ||||.|++++-| T Consensus 301 aGAslVQiyT~l 312 (327) T cd04738 301 AGASLVQLYTGL 312 (327) T ss_pred CCCCHHHHHHHH T ss_conf 699699876898 No 290 >PRK06552 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=97.26 E-value=0.0066 Score=40.29 Aligned_cols=122 Identities=22% Similarity=0.290 Sum_probs=83.0 Q ss_pred CCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 87655566513701567899987313666998248411388999999999857987245146469999999985487589 Q gi|254780889|r 220 GRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 220 grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) .++.+||.-=. +.+-++..+++|++++|- -|.+..+++..+.. +++.+ ..+.|+.-....+++|+|.+ T Consensus 66 p~~~iGaGTV~---~~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PG~~TpsEi~~A~~~Ga~~v 133 (209) T PRK06552 66 PEVLIGAGTVL---DAVTARQAILAGAQFIVS---PSFNRETAKICNRY-----QIPYL-PGCMTVTEIVTALEAGVDIV 133 (209) T ss_pred CCEEEEEECCC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEE T ss_conf 98189887274---899999999859988976---99989999999985-----99641-79799999999998699958 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 9610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) |+ =|.+.. |..-+.++..- -.++++++-|||. ..++..-|.+|+.+|=+||-|- T Consensus 134 Kl--FPA~~~--------G~~yikal~~p-----~p~~~~~ptGGV~-~~N~~~~l~aG~~~vgvGs~l~ 187 (209) T PRK06552 134 KL--FPGSTV--------GPSFISAIKGP-----LPQVNIMVTGGVS-LDNVKDWFAAGADAVGIGGELN 187 (209) T ss_pred EE--CCHHHC--------CHHHHHHHHCC-----CCCCCEEECCCCC-HHHHHHHHHCCCCEEEECHHHC T ss_conf 85--833324--------89999998664-----8999288638999-8889999987998899865770 No 291 >pfam01081 Aldolase KDPG and KHG aldolase. This family includes the following members: 4-hydroxy-2-oxoglutarate aldolase (KHG-aldolase) Phospho-2-dehydro-3-deoxygluconate aldolase (KDPG-aldolase) Probab=97.24 E-value=0.0049 Score=41.21 Aligned_cols=123 Identities=20% Similarity=0.302 Sum_probs=81.4 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+++||.-=+ +.+-++..+++|+++++- -+.+..+++..+.. +++.+ ..|.|+.-+...+++|++.+| T Consensus 59 ~~~iGaGTV~---~~e~~~~a~~aGA~FivS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 126 (196) T pfam01081 59 DALVGAGTVL---NAQQLAEAAEAGAQFVVS---PGLTADLLKHAVDV-----KIPLI-PGVSTPSEIMLGLDLGLTRFK 126 (196) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCHHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 9679998376---899999999749999997---87639999999973-----99663-785999999999987999899 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 6102787665310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) += |.+.. | |...+.++.. . -.+++.++-|||.-. ++..-|.+|+.+|-+||.|.- T Consensus 127 lF--PA~~~------G-g~~~lkal~~---p--~p~~~f~ptGGv~~~-N~~~yl~~g~v~~~~GS~l~~ 181 (196) T pfam01081 127 FF--PAEAS------G-GVPAIKALAG---P--FPQVRFCPTGGIHPA-NVRDYLALPNILCVGGSWLVP 181 (196) T ss_pred EC--CCHHC------C-CHHHHHHHHC---C--CCCCEEEEECCCCHH-HHHHHHHCCCEEEEECHHHCC T ss_conf 78--73101------8-4999999857---7--999869980798988-899999689869998936489 No 292 >COG0106 HisA Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase [Amino acid transport and metabolism] Probab=97.24 E-value=0.011 Score=38.67 Aligned_cols=176 Identities=23% Similarity=0.282 Sum_probs=113.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) .+..+..+.+.+.....+-+||=+|-..|---..++++... +-....+-||.- ++ +.+.++.|.++|++-+ T Consensus 31 ~~P~~~a~~~~~~Ga~~lHlVDLdgA~~g~~~n~~~i~~i~-----~~~~~~vQvGGG--IR--s~~~v~~ll~~G~~rV 101 (241) T COG0106 31 DDPLEVAKKWSDQGAEWLHLVDLDGAKAGGPRNLEAIKEIL-----EATDVPVQVGGG--IR--SLEDVEALLDAGVARV 101 (241) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHH-----HHCCCCEEEECC--CC--CHHHHHHHHHCCCCEE T ss_conf 99899999999809958988626632158755499999999-----857997784087--67--8999999998799889 Q ss_pred EEECCCCHHHHHHHHHHHHHHHCCCCCEE-----CCCCCC-----------HHHHHHHHHCCCCEEEEEECCCCCCCCCH Q ss_conf 98248411388999999999857987245-----146469-----------99999998548758996102787665310 Q gi|254780889|r 250 VVDTAHGHSQKVLDAVVQIKKNFPSLLVM-----AGNIAT-----------AEGALALIDAGADIIKVGIGPGSICTTRV 313 (493) Q Consensus 250 viD~ahGh~~~~~~~i~~~k~~~~~~~ii-----~GNv~t-----------~~~~~~l~~~Gad~ikvGig~Gs~CtTr~ 313 (493) ++-++- ...-+.++.+.+.|++-.++ -|-+++ -+-++.+.+.|+..+ +||-=. T Consensus 102 iiGt~a---v~~p~~v~~~~~~~g~rivv~lD~r~g~vav~GW~e~s~~~~~~l~~~~~~~g~~~i--------i~TdI~ 170 (241) T COG0106 102 IIGTAA---VKNPDLVKELCEEYGDRIVVALDARDGKVAVSGWQEDSGVELEELAKRLEEVGLAHI--------LYTDIS 170 (241) T ss_pred EEECCE---ECCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHCCCCCHHHHHHHHHHCCCCEE--------EEEECC T ss_conf 980312---169999999999859828999971488532046101256789999999985787769--------998514 Q ss_pred HCCCC-CCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEE-CCCEEEECCHH Q ss_conf 00157-431003788999862269819972654377762133671-48789932301 Q gi|254780889|r 314 VTGVG-CPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAA-GSACVMIGSLL 368 (493) Q Consensus 314 ~~g~g-~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~-GA~~VM~G~~~ 368 (493) +-|.- =|-+..+.+.++ ..++|+||-||++.--||-.+-.. |..-|-+|+.| T Consensus 171 ~DGtl~G~n~~l~~~l~~---~~~ipviaSGGv~s~~Di~~l~~~~G~~GvIvG~AL 224 (241) T COG0106 171 RDGTLSGPNVDLVKELAE---AVDIPVIASGGVSSLDDIKALKELSGVEGVIVGRAL 224 (241) T ss_pred CCCCCCCCCHHHHHHHHH---HHCCCEEEECCCCCHHHHHHHHHCCCCCEEEEEHHH T ss_conf 466457778799999999---827678986686879999999855797289986689 No 293 >COG1902 NemA NADH:flavin oxidoreductases, Old Yellow Enzyme family [Energy production and conversion] Probab=97.22 E-value=0.0088 Score=39.37 Aligned_cols=135 Identities=24% Similarity=0.258 Sum_probs=85.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCCCCC-EEC--------- Q ss_conf 5678999873136669982484113------------------------889999999998579872-451--------- Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFPSLL-VMA--------- 279 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~~~~-ii~--------- 279 (493) +.+-|+.-.+||-|.+=|--||||- ...++.++.+|+..+.-. |.. T Consensus 151 f~~AA~rA~~AGFDgVEIH~AhGYLi~qFlsp~tN~RtD~YGGSlENR~Rf~~EVv~aVr~~vg~~~~vg~Rls~~d~~~ 230 (363) T COG1902 151 FARAARRAKEAGFDGVEIHGAHGYLLSQFLSPLTNKRTDEYGGSLENRARFLLEVVDAVREAVGADFPVGVRLSPDDFFD 230 (363) T ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCC T ss_conf 99999999983999899840444499985587557777766885899988999999999997298866999977454677 Q ss_pred --CCC--CCHHHHHHHHHCC-CCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEE Q ss_conf --464--6999999998548-75899610278766531000157431003788999862269819972654377762133 Q gi|254780889|r 280 --GNI--ATAEGALALIDAG-ADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKA 354 (493) Q Consensus 280 --GNv--~t~~~~~~l~~~G-ad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kA 354 (493) |.- ...+.++.|.+.| .|.+-|--|.-..--+....+ .-+|+. .........++|+|+-|+|+.....-.+ T Consensus 231 ~~g~~~~e~~~la~~L~~~G~~d~i~vs~~~~~~~~~~~~~~-~~~~~~---~a~~i~~~~~~pvi~~G~i~~~~~Ae~~ 306 (363) T COG1902 231 GGGLTIEEAVELAKALEEAGLVDYIHVSEGGYERGGTITVSG-PGYQVE---FAARIKKAVRIPVIAVGGINDPEQAEEI 306 (363) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCC-CCHHHH---HHHHHHHHCCCCEEEECCCCCHHHHHHH T ss_conf 888889999999999985588447996036445788744466-412478---9999988607877986897999999999 Q ss_pred EEEC-CCEEEECCHHHCCC Q ss_conf 6714-87899323010121 Q gi|254780889|r 355 IAAG-SACVMIGSLLAGTD 372 (493) Q Consensus 355 la~G-A~~VM~G~~~agt~ 372 (493) |+-| ||.|=+|..|---- T Consensus 307 l~~g~aDlVa~gR~~ladP 325 (363) T COG1902 307 LASGRADLVAMGRPFLADP 325 (363) T ss_pred HHCCCCCEEEECHHHHCCC T ss_conf 9829988887263665092 No 294 >COG0042 tRNA-dihydrouridine synthase [Translation, ribosomal structure and biogenesis] Probab=97.22 E-value=0.016 Score=37.47 Aligned_cols=126 Identities=24% Similarity=0.304 Sum_probs=87.3 Q ss_pred HHHHHHHHHHCCCCEEEEEC-------CC-C-------HHHHHHHHHHHHHHHCCCCCEE----CCC----CCCHHHHHH Q ss_conf 56789998731366699824-------84-1-------1388999999999857987245----146----469999999 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDT-------AH-G-------HSQKVLDAVVQIKKNFPSLLVM----AGN----IATAEGALA 290 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~-------ah-G-------h~~~~~~~i~~~k~~~~~~~ii----~GN----v~t~~~~~~ 290 (493) ..+-++.+.+.|+|.|=|.. .+ | ....+.+.++.+++..+++||= .|- ..+.+-++. T Consensus 81 laeaA~~~~~~g~~~IDlN~GCP~~~V~~~g~Ga~Ll~~p~lv~~iv~a~~~av~~iPVTVKiRlG~d~~~~~~~~ia~~ 160 (323) T COG0042 81 LAEAAKIAEELGADIIDLNCGCPSPKVVKGGAGAALLKNPELLAEIVKAMVEAVGDIPVTVKIRLGWDDDDILALEIARI 160 (323) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHCCCCCHHHCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHCCHHHHHHH T ss_conf 99999999866999898768999289808984477717989999999999985388874999857878002009999999 Q ss_pred HHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEE-CCCEEEECCH Q ss_conf 9854875899610278766531000157431003788999862269819972654377762133671-4878993230 Q gi|254780889|r 291 LIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAA-GSACVMIGSL 367 (493) Q Consensus 291 l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~-GA~~VM~G~~ 367 (493) +.++|++.+-|- +-||..-+-|-.-.-.|++..+.... +|||+-|+|++.-|.-.-|.- |+|.||+|.- T Consensus 161 ~~~~G~~~ltVH------gRtr~~~y~~~a~~~~I~~vk~~~~~--ipvi~NGdI~s~~~a~~~l~~tg~DgVMigRg 230 (323) T COG0042 161 LEDAGADALTVH------GRTRAQGYLGPADWDYIKELKEAVPS--IPVIANGDIKSLEDAKEMLEYTGADGVMIGRG 230 (323) T ss_pred HHHCCCCEEEEE------CCCHHHCCCCCCCHHHHHHHHHHCCC--CEEEECCCCCCHHHHHHHHHHHCCCEEEECHH T ss_conf 996798789995------56676468986487999999986799--75985799499999999998418987997435 No 295 >COG3010 NanE Putative N-acetylmannosamine-6-phosphate epimerase [Carbohydrate transport and metabolism] Probab=97.21 E-value=0.015 Score=37.82 Aligned_cols=92 Identities=24% Similarity=0.267 Sum_probs=62.8 Q ss_pred CCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHH-------HHHHHHH Q ss_conf 978688980210001017422326568765556651370156789998731366699824841138-------8999999 Q gi|254780889|r 194 GCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQ-------KVLDAVV 266 (493) Q Consensus 194 g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~-------~~~~~i~ 266 (493) .|..| ++.++++...||+ +|. =|-+++.+|.. ...++|+|++=- +-+||.. .-.+.+| T Consensus 110 ~RP~~--~~~~~i~~~k~~~-------~l~-MAD~St~ee~l----~a~~~G~D~IGT-TLsGYT~~~~~~~~pDf~lvk 174 (229) T COG3010 110 PRPDG--DLEELIARIKYPG-------QLA-MADCSTFEEGL----NAHKLGFDIIGT-TLSGYTGYTEKPTEPDFQLVK 174 (229) T ss_pred CCCCC--HHHHHHHHHHCCC-------CEE-EECCCCHHHHH----HHHHCCCCEEEC-CCCCCCCCCCCCCCCCHHHHH T ss_conf 79843--5999999733579-------478-73259888888----899739967822-420146899877897289999 Q ss_pred HHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 9998579872451-46469999999985487589961 Q gi|254780889|r 267 QIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 267 ~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .+.+ ...++|| |+..|++.+..-++.||++|-|| T Consensus 175 ~l~~--~~~~vIAEGr~~tP~~Ak~a~~~Ga~aVvVG 209 (229) T COG3010 175 QLSD--AGCRVIAEGRYNTPEQAKKAIEIGADAVVVG 209 (229) T ss_pred HHHH--CCCEEEEECCCCCHHHHHHHHHHCCEEEEEC T ss_conf 9986--7993995178799999999997188089987 No 296 >PRK05286 dihydroorotate dehydrogenase 2; Reviewed Probab=97.16 E-value=0.0081 Score=39.63 Aligned_cols=136 Identities=19% Similarity=0.224 Sum_probs=74.3 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCC--------HH-HHHHHHHHHHH---HHC-CCCCE---ECCCCCCH---HHHHHHH Q ss_conf 0156789998731366699824841--------13-88999999999---857-98724---51464699---9999998 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHG--------HS-QKVLDAVVQIK---KNF-PSLLV---MAGNIATA---EGALALI 292 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahG--------h~-~~~~~~i~~~k---~~~-~~~~i---i~GNv~t~---~~~~~l~ 292 (493) +|+.+-++.|.+ .+|.++|.++-= +. ....+.++.++ +.. ..+|| +..++... +.+.... T Consensus 154 ~Dy~~~~~~l~~-~aDy~~INiSsPNT~glr~lq~~~~L~~ll~~v~~~~~~~~~~~PI~vKisPDl~~~~l~~i~~~~~ 232 (336) T PRK05286 154 DDYLICLRKLYP-YADYFTVNISSPNTPGLRDLQAGEALDELLAALKEAQAELGKYVPLLLKIAPDLSDEELDDIADLAL 232 (336) T ss_pred HHHHHHHHHHHC-CCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHH T ss_conf 999999999826-3778999756899865200046699999999999999843788864883288888789999999999 Q ss_pred HCCCCEEEEEEC---CCCCCC--CCHHCC-CCCCCCHHHHHH-HHHHHCC--CCEEEEECCCCCHHHHEEEEEECCCEEE Q ss_conf 548758996102---787665--310001-574310037889-9986226--9819972654377762133671487899 Q gi|254780889|r 293 DAGADIIKVGIG---PGSICT--TRVVTG-VGCPQLSAIMSV-VEVAERA--GVAIVADGGIRFSGDIAKAIAAGSACVM 363 (493) Q Consensus 293 ~~Gad~ikvGig---~Gs~Ct--Tr~~~g-~g~pq~~av~~~-~~~~~~~--~~~iiadGGi~~~gdi~kAla~GA~~VM 363 (493) +.|+|++-+.=- --+.+. ....-| .|.|-..--.+| ..+.+.. .+|||+-|||.+..|+..-|.||||.|+ T Consensus 233 ~~~idGii~tNTt~~r~~l~~~~~~~~GGLSG~pl~~~s~~~v~~~~~~~~~~~pIIgvGGI~s~~da~~~i~aGAslVQ 312 (336) T PRK05286 233 EHGIDGIIATNTTLDRSGLEGPNAEEAGGLSGRPLFERSTEVIRRLYKELGGRLPIIGVGGIDSAEDAYEKIRAGASLVQ 312 (336) T ss_pred HHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCHHH T ss_conf 81986899958867664456655566687464067899999999999973999709998998999999999986996887 Q ss_pred ECCHH Q ss_conf 32301 Q gi|254780889|r 364 IGSLL 368 (493) Q Consensus 364 ~G~~~ 368 (493) +.+-| T Consensus 313 lyTgl 317 (336) T PRK05286 313 IYSGL 317 (336) T ss_pred HHHHH T ss_conf 41678 No 297 >PRK10605 N-ethylmaleimide reductase; Provisional Probab=97.16 E-value=0.0087 Score=39.42 Aligned_cols=131 Identities=18% Similarity=0.149 Sum_probs=83.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCH------------------------HHHHHHHHHHHHHHCCCCCEEC--------- Q ss_conf 1567899987313666998248411------------------------3889999999998579872451--------- Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGH------------------------SQKVLDAVVQIKKNFPSLLVMA--------- 279 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh------------------------~~~~~~~i~~~k~~~~~~~ii~--------- 279 (493) ++.+-|....+||.|.|-|-.|||| ....+|.++.+|+..+.-.|.. T Consensus 160 ~f~~AA~rA~~AGfDgVEIH~aHGYLl~qFLSp~tN~RtDeYGGSleNR~Rf~~Eii~aVr~avg~d~v~vRis~~~~~~ 239 (362) T PRK10605 160 DFRQAIANAREAGFDLVELHSAHGYLLHQFLSPSSNHRTDQYGGSVENRARLVLEVVDAGIAEWGADRIGIRISPLGTFN 239 (362) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 99999999998399989970247369997389766789887899789998899999999999739873799972665656 Q ss_pred --CCCCC-HH----HHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHE Q ss_conf --46469-99----999998548758996102787665310001574310037889998622698199726543777621 Q gi|254780889|r 280 --GNIAT-AE----GALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIA 352 (493) Q Consensus 280 --GNv~t-~~----~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~ 352 (493) .+-.+ .+ -+..|.++|.|.+-+-... ..-+.|..... +..+.+..++|||+-|++ ++-..- T Consensus 240 ~~~~g~~~~~~~~~~~~~l~~~gv~~l~~s~p~---------~~~~~~~~~~~--~~~ik~~v~~PVi~~G~~-tpe~Ae 307 (362) T PRK10605 240 NVDNGPNEEADALYLIEQLGKRGIAYLHMSEPD---------WAGGEPYSDAF--REKVRARFHGPIIGAGAY-TAEKAE 307 (362) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCC---------CCCCCCCCHHH--HHHHHHHCCCCEEECCCC-CHHHHH T ss_conf 665788768999999999986297499972685---------56898751999--999998769978976899-999999 Q ss_pred EEEEEC-CCEEEECCHHHCCCCCC Q ss_conf 336714-87899323010121677 Q gi|254780889|r 353 KAIAAG-SACVMIGSLLAGTDESP 375 (493) Q Consensus 353 kAla~G-A~~VM~G~~~agt~Esp 375 (493) ++|+.| ||.|+||..|=.--+=| T Consensus 308 ~~l~~G~aDlV~~gR~llADPd~~ 331 (362) T PRK10605 308 TLIGKGLIDAVAFGRDYIANPDLV 331 (362) T ss_pred HHHHCCCCCEEEEHHHHHHCCCHH T ss_conf 999889987981006877594589 No 298 >COG1253 TlyC Hemolysins and related proteins containing CBS domains [General function prediction only] Probab=97.16 E-value=0.0027 Score=43.07 Aligned_cols=76 Identities=25% Similarity=0.389 Sum_probs=60.7 Q ss_pred EEEEEHHHHC----CCCCCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC-CCCCEEEEEEECCCHHCC Q ss_conf 8872354500----14344672440026--7863257303589999999865533064574-799786889802100010 Q gi|254780889|r 137 VGILTNRDVR----FASNAQQAVGELMT--RNLITVKKTVNLENAKALLHQHRIEKLLVVD-DDGCCIGLITVKDIERSQ 209 (493) Q Consensus 137 iGIVT~rDir----~~~~~~~~V~eiMt--~~litv~~~~~l~eA~~iM~~~ki~~LPVVD-e~g~LvGIIT~~DIlk~~ 209 (493) .|.+....-+ ..+-.+++|+++|| .+++.++.+.+.+++.+.+.+++..++||.+ ..++++|++..+|+++.. T Consensus 186 ~g~~~~~E~~mi~~v~~l~~~~v~eiMtPR~~i~~l~~~~~~~~~~~~~~~~~~SR~PV~~~~~D~iiGiv~~Kdll~~~ 265 (429) T COG1253 186 EGVLEEEEREMINNVLDLDDRTVREIMTPRTDIVALDLTDTVEELIELILESGHSRIPVYDGDLDNIIGIVHVKDLLRAL 265 (429) T ss_pred CCCCCHHHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 68628899999996502137562617603155799418999999999997589865878857998378886899999764 Q ss_pred CCC Q ss_conf 174 Q gi|254780889|r 210 LNP 212 (493) Q Consensus 210 ~~p 212 (493) ... T Consensus 266 ~~~ 268 (429) T COG1253 266 LDG 268 (429) T ss_pred HCC T ss_conf 315 No 299 >cd04741 DHOD_1A_like Dihydroorotate dehydrogenase (DHOD) class 1A FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. Probab=97.14 E-value=0.0038 Score=41.97 Aligned_cols=133 Identities=17% Similarity=0.205 Sum_probs=72.2 Q ss_pred HHHHHHHHCCCCEEEEECC--C-------CHH-HHHHHHHHHHHHHCCCCCEE--CCCCCCHHHHHHHH---HCCCC--- Q ss_conf 7899987313666998248--4-------113-88999999999857987245--14646999999998---54875--- Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTA--H-------GHS-QKVLDAVVQIKKNFPSLLVM--AGNIATAEGALALI---DAGAD--- 297 (493) Q Consensus 236 eRa~~LveaGvDviviD~a--h-------Gh~-~~~~~~i~~~k~~~~~~~ii--~GNv~t~~~~~~l~---~~Gad--- 297 (493) .++....++++|+|.+.++ | ++. +...+.++++|+.. ++||+ .....+.+....++ .++++ T Consensus 110 ~~~~~~~~~~aD~ielNiScPn~~g~~~~~~~~~~~~~~~~~v~~~~-~~Pv~vKlsP~~~~~~~~~~~~~~~~~~~g~~ 188 (294) T cd04741 110 KKIAAHQKQFPLAMELNLSCPNVPGKPPPAYDFDATLEYLTAVKAAY-SIPVGVKTPPYTDPAQFDTLAEALNAFACPIS 188 (294) T ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCHHHCCCHHHHHHHHHHHHHCC-CCCEEEECCCCCCHHHHHHHHHHHHCCCCCEE T ss_conf 99986522556479997037898873100139999999999998415-78559972898887899999999865788747 Q ss_pred EEEE-E-ECCCCCCC-CCHHC----CCCCCCCH-------HHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 8996-1-02787665-31000----15743100-------37889998622--698199726543777621336714878 Q gi|254780889|r 298 IIKV-G-IGPGSICT-TRVVT----GVGCPQLS-------AIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 298 ~ikv-G-ig~Gs~Ct-Tr~~~----g~g~pq~~-------av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) ++-. . +++|-.-. ++... ..+++-+| ++..+++..+. .++|||+-|||.+.-|..+-|.|||+. T Consensus 189 ~i~~~nt~~~~l~id~~~~~~~~~~~~~~GGlSG~~l~p~al~~v~~~~~~~~~~i~Iig~GGI~s~~da~e~i~aGAs~ 268 (294) T cd04741 189 FITATNTLGNGLVLDPERETVVLKPKTGFGGLAGAYLHPLALGNVRTFRRLLPSEIQIIGVGGVLDGRGAFRMRLAGASA 268 (294) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCH T ss_conf 99988036776333577656433455666666785215899999999999749998799989979999999999839979 Q ss_pred EEECCHHH Q ss_conf 99323010 Q gi|254780889|r 362 VMIGSLLA 369 (493) Q Consensus 362 VM~G~~~a 369 (493) |++++-|- T Consensus 269 VQv~Tal~ 276 (294) T cd04741 269 VQVGTALG 276 (294) T ss_pred HHHHHHHH T ss_conf 99979999 No 300 >PTZ00170 D-ribulose-5-phosphate 3-epimerase; Provisional Probab=97.13 E-value=0.015 Score=37.74 Aligned_cols=125 Identities=20% Similarity=0.247 Sum_probs=77.8 Q ss_pred HHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCH-HHHHHHHHC-CCCEEE-EEECCCCCCCC Q ss_conf 678999873136669982484113889999999998579872451464699-999999854-875899-61027876653 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATA-EGALALIDA-GADIIK-VGIGPGSICTT 311 (493) Q Consensus 235 ~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~-~~~~~l~~~-Gad~ik-vGig~Gs~CtT 311 (493) ..-++.++++|+|.|.+-.-. .....+.++++|+.--. .-+|=|-.|+ +....+++. =.|.|. .++-||-. T Consensus 76 ~~~i~~~~~~gad~I~~H~E~--~~~~~~~i~~ik~~g~k-~GlAlnP~T~i~~l~~~l~~~~iD~VLlMsV~PGf~--- 149 (224) T PTZ00170 76 ERWVDSFAKAGASQFTFHIEA--TEDPKAVARKIRAAGMQ-VGVALKPKTPAEELFPLIDAGLVDMVLVMTVEPGFG--- 149 (224) T ss_pred HHHHHHHHHCCCCEEEECCCC--CCCHHHHHHHHHHHCCC-EEEEECCCCCHHHHHHHHHHCCCCEEEEEEECCCCC--- T ss_conf 887999986289679985001--33999999999971476-455607999879999997114457899985569987--- Q ss_pred CHHCCCCCCCCHH-HHHHHHHHHC-CCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 1000157431003-7889998622-69819972654377762133671487899323010121 Q gi|254780889|r 312 RVVTGVGCPQLSA-IMSVVEVAER-AGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 312 r~~~g~g~pq~~a-v~~~~~~~~~-~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) |-+.... +....+..+. .+..|..||||.. ..+.+...+|||.+-.||.+-+.+ T Consensus 150 ------GQ~Fi~~~l~KI~~lr~~~~~~~I~VDGGIn~-~ti~~l~~aGad~~V~GSaiF~~~ 205 (224) T PTZ00170 150 ------GQSFMHDMMPKVRQLRQRYPHLNIQVDGGINP-DTIDLAAEAGANVIVAGTSIFKAN 205 (224) T ss_pred ------CCCCCHHHHHHHHHHHHCCCCCEEEEECCCCH-HHHHHHHHCCCCEEEECHHHHCCC T ss_conf ------62145889999999985489975999589998-999999986999999785886799 No 301 >PRK12330 oxaloacetate decarboxylase; Provisional Probab=97.12 E-value=0.0079 Score=39.71 Aligned_cols=135 Identities=20% Similarity=0.227 Sum_probs=81.4 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEEC-------CCHHHHHHHHH Q ss_conf 03589999999865533064574799786889802100010174223265687655566513-------70156789998 Q gi|254780889|r 169 TVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSV-------AKDIADRVGPL 241 (493) Q Consensus 169 ~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~-------~~d~~eRa~~L 241 (493) +.-++...+...+|++...-|.|.-+.+-.+...- .+.+. .|.. +-++|.- -+-+.+.|+.| T Consensus 96 ddvv~~fv~~~~~~GidifRiFDaLNdv~Nm~~ai---------~~vk~-~G~~-~q~~i~yt~sPvht~~yy~~~ak~l 164 (499) T PRK12330 96 DEVVDRFVEKSAENGMDVFRVFDALNDLRNLETSI---------KAVKK-TGKH-AQGTICYTVSPIHTTEGFVEQAKRL 164 (499) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHH---------HHHHH-CCCE-EEEEEEEECCCCCCHHHHHHHHHHH T ss_conf 79999999999976998899724444577789999---------99997-1885-8999996058877899999999999 Q ss_pred HHCCCCEEEE-ECCCC-HHHHHHHHHHHHHHHC-CCCCEECCCCCCHH----HHHHHHHCCCCEEEEEECCCCCCCCCHH Q ss_conf 7313666998-24841-1388999999999857-98724514646999----9999985487589961027876653100 Q gi|254780889|r 242 FDVNVDLVVV-DTAHG-HSQKVLDAVVQIKKNF-PSLLVMAGNIATAE----GALALIDAGADIIKVGIGPGSICTTRVV 314 (493) Q Consensus 242 veaGvDvivi-D~ahG-h~~~~~~~i~~~k~~~-~~~~ii~GNv~t~~----~~~~l~~~Gad~ikvGig~Gs~CtTr~~ 314 (493) ++.|+|.||| |.|-- --..+.+.++.||+.+ |+++|-.-.=+|.- ....-++||+|+|-+.|.|=|-.|++.- T Consensus 165 ~~~G~d~i~IKDmAGll~P~~a~~LV~~lk~~~g~d~pI~~HtH~T~G~~~~~~l~AieAGvDivD~A~~~~s~gtsqp~ 244 (499) T PRK12330 165 LDMGCDSICIKDMAALLKPQPAYDLVKGIKEACGPDTRVHVHAHATTGVTLVSLMKAIEAGVDVVDTAISSMSLGPGHNP 244 (499) T ss_pred HHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCC T ss_conf 97599989984753467889999999999986389983798517887469999999998499887244543237988997 No 302 >PRK13117 consensus Probab=97.11 E-value=0.0082 Score=39.58 Aligned_cols=181 Identities=18% Similarity=0.180 Sum_probs=111.4 Q ss_pred CCCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 467244002---6786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGELM---TRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~eiM---t~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|-- ......+..+.++.+.++++.+-+. ..+|+| +.|....... T Consensus 54 fSDP~ADGpvIq~A~~rAL~~G~~~~~~~~~~~~ir~~~~~~piv--------lM~Y~N~i~~----------------- 108 (268) T PRK13117 54 FSDPLADGPTIQNANLRALAAGVTPAQCFELLAKIRAKYPTIPIG--------LLLYANLVFA----------------- 108 (268) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEE--------EEECCCHHHH----------------- T ss_conf 888565579999999999845996999999998850047898779--------9732628987----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 6651370156789998731366-69982484113889999999998-579872451464699999999854875899610 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) .....-+..+.++|+| +|+.|..+-.+... .+..++ ...-+++++ --.+.+-.+.+.+..-..|=+-- T Consensus 109 ------~G~e~F~~~~~~aGvdGvIipDLP~eE~~~~---~~~~~~~gl~~I~lv~-Ptt~~~Ri~~i~~~a~GFiY~vs 178 (268) T PRK13117 109 ------NGIDNFYARCAEAGVDSVLIADVPVEESAPF---RQAAKKHGIAPIFICP-PNADDDTLRQIASLGRGYTYLLS 178 (268) T ss_pred ------CCHHHHHHHHHHCCCCEEEECCCCHHHHHHH---HHHHHHCCCCEEEEEC-CCCCHHHHHHHHHHCCCEEEEEE T ss_conf ------1799999999976987798579997885899---9999867983799847-99999999999974798599983 Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHHHHHH-HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 27876653100015743100378899986-22698199726543777621336714878993230101216 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSVVEVA-ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~~~~~-~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) -.| +||..-+....+.+.-+.. +..++||..-=||+++.|+.+++..+||.|-+||.|-..-| T Consensus 179 ~~G-------vTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~~aDGvIVGSaiV~~i~ 242 (268) T PRK13117 179 RAG-------VTGAENKAAAPLNHLVEKLKEYNAPPPLQGFGISEPEQVKAAIKAGAAGAISGSAIVKIIE 242 (268) T ss_pred CCC-------CCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 677-------7889866627799999999964799869983789999999998638998998789999998 No 303 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=97.09 E-value=0.024 Score=36.22 Aligned_cols=183 Identities=21% Similarity=0.222 Sum_probs=113.1 Q ss_pred CCCCCCCCCCC---CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf 44672440026---786325730358999999986553306457479978688980210001017422326568765556 Q gi|254780889|r 150 NAQQAVGELMT---RNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 150 ~~~~~V~eiMt---~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) .++.|+.|--+ ....-+..+.+++++++++.+-+.. ..+..+++.|....+-+ T Consensus 53 PfSDPvADGP~Iq~A~~rAL~~g~t~~~~lel~~~~r~~------~~~~Pivlm~Y~Npi~~------------------ 108 (265) T COG0159 53 PFSDPVADGPTIQAAHLRALAAGVTLEDTLELVEEIRAK------GVKVPIVLMTYYNPIFN------------------ 108 (265) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC------CCCCCEEEEEECCHHHH------------------ T ss_conf 888867668899998999997799889999999999861------89998899870118877------------------ Q ss_pred EEECCCHHHHH-HHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 65137015678-9998731366-699824841138899999999985798724514646999999998548758996102 Q gi|254780889|r 227 AVSVAKDIADR-VGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 227 AIg~~~d~~eR-a~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) -..++ .+.+.++|+| +|+.|.-.-++....+..+ +....-+++++.| .+.+--+.+.+..-..+-..-= T Consensus 109 ------~Gie~F~~~~~~~GvdGlivpDLP~ee~~~~~~~~~--~~gi~~I~lvaPt-t~~~rl~~i~~~a~GFiY~vs~ 179 (265) T COG0159 109 ------YGIEKFLRRAKEAGVDGLLVPDLPPEESDELLKAAE--KHGIDPIFLVAPT-TPDERLKKIAEAASGFIYYVSR 179 (265) T ss_pred ------HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHH--HCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEEEEC T ss_conf ------359999999997599879857898667778999999--7698679886999-9989999999747985899966 Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHHHHH-CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 78766531000157431003788999862-2698199726543777621336714878993230101216 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVEVAE-RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~~~~-~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) .| +||+-.+....+.+.-+..+ -.++||..-=||+++.++.+.+.+ ||-|-.||-|-..-| T Consensus 180 ~G-------vTG~~~~~~~~~~~~v~~vr~~~~~Pv~vGFGIs~~e~~~~v~~~-ADGVIVGSAiV~~i~ 241 (265) T COG0159 180 MG-------VTGARNPVSADVKELVKRVRKYTDVPVLVGFGISSPEQAAQVAEA-ADGVIVGSAIVKIIE 241 (265) T ss_pred CC-------CCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHH-CCEEEECHHHHHHHH T ss_conf 66-------667776530469999999997448973874486999999999976-885797399999999 No 304 >PRK07028 bifunctional hexulose-6-phosphate synthase/ribonuclease regulator; Validated Probab=97.08 E-value=0.029 Score=35.68 Aligned_cols=121 Identities=29% Similarity=0.408 Sum_probs=80.9 Q ss_pred HHHHHHHCCCCEEEEE-CCCCHHHHHHHHHHHHHHHCCCCCEECCCCCC---HHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 8999873136669982-48411388999999999857987245146469---9999999854875899610278766531 Q gi|254780889|r 237 RVGPLFDVNVDLVVVD-TAHGHSQKVLDAVVQIKKNFPSLLVMAGNIAT---AEGALALIDAGADIIKVGIGPGSICTTR 312 (493) Q Consensus 237 Ra~~LveaGvDviviD-~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t---~~~~~~l~~~Gad~ikvGig~Gs~CtTr 312 (493) -++...+||+|+++|= .|+ .....+.++.-|+. ...+++-.+.. .+-++.|-+.|+|.|-+-.|-- T Consensus 73 Ea~~A~~AGADivtVlG~a~--d~TI~~aV~aA~k~--G~~v~vDlI~v~d~~~ra~el~~lGvd~I~vH~G~D------ 142 (429) T PRK07028 73 EVEMAAKAGADVVCILGVAD--DSTIADAVRAARKY--GVLVMADLINVPDPVKRAVELEELGVDIINVHVGID------ 142 (429) T ss_pred HHHHHHHCCCCEEEEECCCC--HHHHHHHHHHHHHC--CCEEEEEECCCCCHHHHHHHHHHCCCCEEEEEEEEC------ T ss_conf 99999876998899945788--36999999999970--988999855899889999999970998899976233------ Q ss_pred HHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 000157431003788999862269819972654377762133671487899323010121 Q gi|254780889|r 313 VVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 313 ~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) +...|...+..+.+ .++..++||=.-|||+ +..+.+++.+|||-|-.|+...++. T Consensus 143 -~Q~~g~~p~~~l~~---v~~~~~~~vAVAGGi~-~~t~~~~v~~GAdIvIVGgaI~~a~ 197 (429) T PRK07028 143 -QQMLGKDPLELLKK---VSEEVSIPIAAAGGLD-AETAVKAVEAGADIVIVGGNIYKSA 197 (429) T ss_pred -HHHCCCCHHHHHHH---HHHHCCCEEEEECCCC-HHHHHHHHHCCCCEEEECCCCCCCC T ss_conf -55317984999999---9975597189966878-7769999975998999894005799 No 305 >CHL00200 trpA tryptophan synthase alpha subunit; Provisional Probab=97.07 E-value=0.033 Score=35.29 Aligned_cols=169 Identities=18% Similarity=0.172 Sum_probs=96.4 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCC-EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHH Q ss_conf 63257303589999999865533-06457479978688980210001017422326568765556651370156789998 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRIE-KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki~-~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~L 241 (493) -..+..+.++.+.++++.+.+.+ ..|+| +.|....+-+ .| ...-+..+ T Consensus 67 ~~AL~~G~~~~~~~~~v~~~r~~~~~Piv--------lMtY~N~i~~-------------------yG----~e~F~~~~ 115 (263) T CHL00200 67 NRALKQGINLNKILSILSEVNGEIKAPIV--------IFTYYNPVLH-------------------YG----INKFIKKI 115 (263) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCCEE--------EEEEHHHHHH-------------------CC----HHHHHHHH T ss_conf 99997798777899999998606799889--------9862068887-------------------38----89999999 Q ss_pred HHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC Q ss_conf 731366-6998248411388999999999857987245146469999999985487589961027876653100015743 Q gi|254780889|r 242 FDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP 320 (493) Q Consensus 242 veaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p 320 (493) .++|+| +|+.|...-.+....+..+ +..+.-+++++.+ .+.+-.+.+.+..-..|=+= +..-+||..-. T Consensus 116 ~~~GvdGlIipDLP~eE~~~~~~~~~--~~gl~~I~lvaPt-t~~~Ri~~i~~~a~GFiY~v-------s~~GvTG~~~~ 185 (263) T CHL00200 116 SQAGVKGLIIPDLPYEESDYLISVCN--LYNIELILLIAPT-SSKSRIQKIARAAPGCIYLV-------STTGVTGLKTE 185 (263) T ss_pred HHCCCCEEECCCCCHHHHHHHHHHHH--HCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEEE-------ECCCCCCCCCC T ss_conf 98499868747999788899999998--5586216664789-96999999997289808985-------33655687544 Q ss_pred CCHHHHHHH-HHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 100378899-9862269819972654377762133671487899323010121 Q gi|254780889|r 321 QLSAIMSVV-EVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 321 q~~av~~~~-~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) --..+.+.- +..+..+.||..-=||+++-|+.+...+|||.|-+||.|-..- T Consensus 186 ~~~~l~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~~aDGvIVGSaiV~~i 238 (263) T CHL00200 186 LDKKLKKLIETIKKMTNKPIILGFGISTSEQIKQIKGWNINGIVIGSACVQIL 238 (263) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCEEEECHHHHHHH T ss_conf 51879999999997369984873587999999999745999999878999999 No 306 >TIGR00007 TIGR00007 phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase; InterPro: IPR006063 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase (5.3.1.16 from EC), also known as Phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase or HisA, catalyses the fourth step in histidine biosynthesis. HisA from Lactococcus lactis was found to be inactive . The putative HisA from Thermotoga maritima, is a conspicuous outlier to the set of all other HisA, including experimental HisA from the bacterium E. coli and the Archaeaon Methanococcus voltae. Neighbor joining shows HisA from Thermotoga maritima to be within the HisA family (with HisF as an outgroup) but with a long branch. ; GO: 0003949 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imidazole-4-carboxamide isomerase activity, 0000105 histidine biosynthetic process. Probab=97.07 E-value=0.013 Score=38.22 Aligned_cols=178 Identities=23% Similarity=0.287 Sum_probs=120.2 Q ss_pred CCCHHHHHHHHHH-CCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC Q ss_conf 0358999999986-553306457479978688980210001017422326568765556651370156789998731366 Q gi|254780889|r 169 TVNLENAKALLHQ-HRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVD 247 (493) Q Consensus 169 ~~~l~eA~~iM~~-~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvD 247 (493) ..++.++.+.+.+ .....|=|||=+|-+-|=....++++.... .-+.+.=||- |++ +.|.++.|+++|++ T Consensus 27 ~d~P~~~A~~~~~~~GA~~iHvVDLDGA~~g~~~N~~~i~~I~~-----~~~~~vQvGG--GIR--s~e~v~~ll~~Gv~ 97 (241) T TIGR00007 27 SDDPVEAAKKWEEFQGAKRIHVVDLDGALEGGPVNLEVIKKIVE-----ELGVPVQVGG--GIR--SLEDVEKLLDLGVD 97 (241) T ss_pred ECCHHHHHHHHHHCCCCCEEEEEECCHHHCCCCCHHHHHHHHHH-----HCCCCEEECC--CCC--CHHHHHHHHHCCCC T ss_conf 08989999999841697159998451000686200789999998-----6185179817--516--88999999973985 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHCC-C-----CCEECC-----CCCC-----------HHHHHHHHHCC-CCEEEEEEC Q ss_conf 69982484113889999999998579-8-----724514-----6469-----------99999998548-758996102 Q gi|254780889|r 248 LVVVDTAHGHSQKVLDAVVQIKKNFP-S-----LLVMAG-----NIAT-----------AEGALALIDAG-ADIIKVGIG 304 (493) Q Consensus 248 viviD~ahGh~~~~~~~i~~~k~~~~-~-----~~ii~G-----Nv~t-----------~~~~~~l~~~G-ad~ikvGig 304 (493) -++|=|+=--+ -+.++++-++|+ + ++.--| -|++ .+.++.|.+.| +..| T Consensus 98 RVI~GT~A~~~---~~~v~~~~~~~g~~~i~V~lD~~~g~~G~~~V~v~GW~E~s~~~~~~~~~~~~~~G~~~~i----- 169 (241) T TIGR00007 98 RVIIGTAAVEN---PDLVKELLKEYGPERIVVSLDARDGEEGVKEVAVKGWKEKSEVSLEELAKRLEELGELEGI----- 169 (241) T ss_pred EEEEEEEEECC---HHHHHHHHHHHCCCCEEEEEEEECCCCCEEEEEEEEEECCCCCCHHHHHHHHHHCCCCCEE----- T ss_conf 79973322108---6999999998489965999863148875178887404113562799999998515863368----- Q ss_pred CCCCCCCCHHCCCCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE--ECCCEEEECCHH Q ss_conf 787665310001574-3100378899986226981997265437776213367--148789932301 Q gi|254780889|r 305 PGSICTTRVVTGVGC-PQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA--AGSACVMIGSLL 368 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~-pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla--~GA~~VM~G~~~ 368 (493) |||-=..=|--- |-+....+.++. -.++|+||-|||+..-|+-++=. +|..-|-+|+.| T Consensus 170 ---i~TdI~~DGtl~G~n~~~~~~~~~~--~~~~~viaSGGv~s~~D~~~L~~~~~G~~GvIvGkAL 231 (241) T TIGR00007 170 ---IYTDISRDGTLSGPNFELTKELVKA--LVNVPVIASGGVSSIDDLRALKEIELGVYGVIVGKAL 231 (241) T ss_pred ---EEEEEECCCCEECCCCCHHHHHHHH--HCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEEEE T ss_conf ---9975200672007873288999987--3584189942657889999999715983279986211 No 307 >PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=97.06 E-value=0.0098 Score=39.05 Aligned_cols=122 Identities=15% Similarity=0.221 Sum_probs=79.6 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+++||.-=. +.|-+++.+++|++++|- -+.+..+++..+.. +++.+ ..|.|+.-+...+++|+|.+| T Consensus 64 ~~~iGaGTV~---~~e~~~~a~~aGA~FiVS---P~~~~~vi~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 131 (210) T PRK07455 64 ECIIGTGTLL---TLEDLEEAIAAGAQFCFT---PHVDLELIQAAVAA-----DIPII-PGALTPTEIVTAWQAGASCVK 131 (210) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 9689888187---899999999869999986---88889999999982-----99765-886999999999986998477 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) += |.+.. - |...+.++.. . -.++++++-|||. ..++..-|.+|+.+|=+||.|. T Consensus 132 lF--PA~~~-----G--G~~ylkal~~---p--~p~i~~~ptGGV~-~~n~~~yl~ag~~~vg~Gs~l~ 185 (210) T PRK07455 132 VF--PVQAV-----G--GADYIKSLQG---P--LGHIPLIPTGGVT-LENAQAFIQAGAIAVGLSSQLF 185 (210) T ss_pred EC--CCHHC-----C--CHHHHHHHHC---C--CCCCCEEECCCCC-HHHHHHHHHCCCEEEEECHHHC T ss_conf 50--51320-----6--7999999865---4--8999388789989-8889999968997999884618 No 308 >COG0517 FOG: CBS domain [General function prediction only] Probab=97.01 E-value=0.0015 Score=44.91 Aligned_cols=53 Identities=34% Similarity=0.551 Sum_probs=34.5 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 002678632573035899999998655330645747997868898021000101 Q gi|254780889|r 157 ELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 157 eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++|.++++++.++.++.+|...|.++++..+||++.. +++|++|..|+++... T Consensus 3 ~~~~~~~~~v~~~~~~~~a~~~m~~~~~~~~~v~~~~-~l~Giit~~di~~~~~ 55 (117) T COG0517 3 DIMTKDVITVKPDTSVRDALLLMSENGVSAVPVVDDG-KLVGIITERDILRALA 55 (117) T ss_pred CCCCCCCEEECCCCCHHHHHHHHHHCCCEEEEEEECC-EEEEEEEHHHHHHHHH T ss_conf 5442688089899819999999987493389974399-9999999899987664 No 309 >PRK13116 consensus Probab=96.99 E-value=0.038 Score=34.78 Aligned_cols=180 Identities=14% Similarity=0.172 Sum_probs=103.6 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---26786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- +.-....+..++++.++++++.+-|. ..+|+| +.|....+-+ T Consensus 54 FSDP~ADGPvIQ~A~~rAL~~G~~~~~~~~~v~~ir~~~~~~Piv--------lM~Y~N~i~~----------------- 108 (278) T PRK13116 54 FSDPVADGPTVAESHLRALDGGATVDSALEQIKRVRAAYPEVPIG--------MLIYGNVPFT----------------- 108 (278) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCEE--------EEECCCHHHH----------------- T ss_conf 888566689999999999976986789999999840358987689--------9805728877----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHH-HHHHHHCCCCEEEEE Q ss_conf 6651370156789998731366-699824841138899999999985-7987245146469999-999985487589961 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEG-ALALIDAGADIIKVG 302 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~-~~~l~~~Gad~ikvG 302 (493) .....-+..+.++|+| +|+.|...-.+.. ..+..++. ..-+++++. .|.+. .+.+.+..-..|-+- T Consensus 109 ------~G~e~F~~~~~~aGvdGlIipDLP~eE~~~---~~~~~~~~~i~~I~l~~p--tt~~~ri~~I~~~s~GFiY~V 177 (278) T PRK13116 109 ------RGLDRFYQEFAEAGADSILLPDVPVREGAP---FSAAAAAAGIDPIYIAPA--NASEKTLEGVSAASKGYIYAI 177 (278) T ss_pred ------CCHHHHHHHHHHCCCCEEEECCCCHHHHHH---HHHHHHHCCCCEEEEECC--CCCHHHHHHHHHHCCCEEEEE T ss_conf ------279999999977697589946999788899---999998657666999379--995999999997189739998 Q ss_pred ECCCCCCCCCHHCCCCCCCCH-HHHH-HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 027876653100015743100-3788-9998622698199726543777621336714878993230101216 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLS-AIMS-VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~-av~~-~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||......+ .+.+ ..+..+..++||..-=||+++-|+.+++..+||-|-+||.|-..-| T Consensus 178 -------S~~GvTG~~~~~~~~~l~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~~aDGVIVGSAiVk~Ie 243 (278) T PRK13116 178 -------SRDGVTGTERESSTDGLSAVVDNIKKFDGAPILLGFGISSPQHVADAIAAGASGAITGSAITKIIA 243 (278) T ss_pred -------ECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf -------635222688666678999999999845799879981679899999998668999998779999998 No 310 >KOG2335 consensus Probab=96.99 E-value=0.0025 Score=43.36 Aligned_cols=98 Identities=24% Similarity=0.400 Sum_probs=68.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHH---------HHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHH-CCCCEEEE Q ss_conf 156789998731366699824841138899---------99999998579872451-46469999999985-48758996 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHSQKVL---------DAVVQIKKNFPSLLVMA-GNIATAEGALALID-AGADIIKV 301 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~~~~~---------~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~-~Gad~ikv 301 (493) ++.+-+..+.+||++.|.| ||-...+. +.|+.+|+.+|++++++ |||-+.+-+...++ .|||+|-+ T Consensus 156 kTvd~ak~~e~aG~~~ltV---HGRtr~~kg~~~~pad~~~i~~v~~~~~~ipviaNGnI~~~~d~~~~~~~tG~dGVM~ 232 (358) T KOG2335 156 KTVDYAKMLEDAGVSLLTV---HGRTREQKGLKTGPADWEAIKAVRENVPDIPVIANGNILSLEDVERCLKYTGADGVMS 232 (358) T ss_pred HHHHHHHHHHHCCCCEEEE---ECCCHHHCCCCCCCCCHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCEEEE T ss_conf 7899999998679868999---3655776288888767799999997476770895088576899999999758746886 Q ss_pred EECCCCCCCCCHH--CCCCCCCCHHHHHHHHHHHCC Q ss_conf 1027876653100--015743100378899986226 Q gi|254780889|r 302 GIGPGSICTTRVV--TGVGCPQLSAIMSVVEVAERA 335 (493) Q Consensus 302 Gig~Gs~CtTr~~--~g~g~pq~~av~~~~~~~~~~ 335 (493) |= |..---..- +|.+.+....+.+--+.+.++ T Consensus 233 ar--glL~NPa~F~~~~~~~~~~~~~~~~l~~~~e~ 266 (358) T KOG2335 233 AR--GLLYNPALFLTAGYGPTPWGCVEEYLDIAREF 266 (358) T ss_pred CC--HHHCCCHHHCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 00--00038412026887778788999999999972 No 311 >pfam03060 NPD 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor. Probab=96.97 E-value=0.019 Score=37.04 Aligned_cols=67 Identities=27% Similarity=0.359 Sum_probs=51.1 Q ss_pred HHHHHHHHHCCCCEEEEEC--CCCHH-----HHHHHHHHHHHHHCCCCCEE-CCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 6789998731366699824--84113-----88999999999857987245-146469999999985487589961 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVDT--AHGHS-----QKVLDAVVQIKKNFPSLLVM-AGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 235 ~eRa~~LveaGvDviviD~--ahGh~-----~~~~~~i~~~k~~~~~~~ii-~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .+-+..+.++|+|+||+.- |-||. ......+..+++.. ++||| +|.++|.++....+..|||++-+| T Consensus 147 ~~~A~~a~~~G~D~iV~qG~eAGGH~G~~~~~~~~~L~~~v~~~~-~iPvIaAGGI~dg~~iaaalalGA~gV~mG 221 (330) T pfam03060 147 AKEARKAEAAGADAVVAQGPEAGGHRGTEVGTGTFLLVPTVVDAV-DIPVIAAGGIADGRGIAAALALGAEGVQIG 221 (330) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 999999998199989996676677788877730777789999871-697785266289999999996799899971 No 312 >TIGR03520 GldE gliding motility-associated protein GldE. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype. Probab=96.96 E-value=0.0008 Score=46.85 Aligned_cols=61 Identities=10% Similarity=0.201 Sum_probs=54.2 Q ss_pred CCCCCCCCCCCCC--CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEECCCHHCC Q ss_conf 3446724400267--86325730358999999986553306457479-9786889802100010 Q gi|254780889|r 149 SNAQQAVGELMTR--NLITVKKTVNLENAKALLHQHRIEKLLVVDDD-GCCIGLITVKDIERSQ 209 (493) Q Consensus 149 ~~~~~~V~eiMt~--~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~-g~LvGIIT~~DIlk~~ 209 (493) +-.+.+|+++|++ ++++++.+.+++++.+.+.+++..++||.+++ ++++|++..+|+++.. T Consensus 187 ~l~~~~v~eiMtPr~~v~~l~~~~~~~e~~~~~~~~~~SR~PV~~~~~D~iiGiv~~kDll~~~ 250 (408) T TIGR03520 187 SFGNTDTKQVMRPRLDIFALDIETSFSEIIPKIIENGYSRIPVYKETIDNITGVLYIKDLLPHL 250 (408) T ss_pred CCCCCEEHHCCCCHHHEEEEECCCHHHHHHHHHHHCCCCCCEEECCCCCCEEEEEEHHHHHHHH T ss_conf 7599890342354579588406443899999998579872226658987617997798998787 No 313 >PRK06512 thiamine-phosphate pyrophosphorylase; Provisional Probab=96.95 E-value=0.0061 Score=40.54 Aligned_cols=99 Identities=18% Similarity=0.270 Sum_probs=64.8 Q ss_pred HHHHHHHHHC-CCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC-CCHHHHHHHHHHHCCCCEEE Q ss_conf 9999999857-987245146469999999985487589961027876653100015743-10037889998622698199 Q gi|254780889|r 263 DAVVQIKKNF-PSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP-QLSAIMSVVEVAERAGVAIV 340 (493) Q Consensus 263 ~~i~~~k~~~-~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p-q~~av~~~~~~~~~~~~~ii 340 (493) ..++..|+.+ |+..|-+++..+.+.+....++|||.| ++|| +-+| ...-..| .+..+.+.+ ....+|++ T Consensus 98 ~~~~~aR~~lg~~~IIG~~~~~s~~~A~~A~e~GADYv--~fG~--~~~~--~k~~a~~~~l~~l~~~~---~~~~iP~V 168 (221) T PRK06512 98 AALAEAIEKHAPKMIVGFGNLRDRHGAMEVGELQPDYL--FFGK--LGAD--NKPEAHPRNLSLAEWWA---EMIEIPCI 168 (221) T ss_pred CCHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCEE--EECC--CCCC--CCCCCCCCCHHHHHHHH---HCCCCCEE T ss_conf 31999999847886786405788999999997399857--6578--7888--89988754258999999---74799989 Q ss_pred EECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 7265437776213367148789932301012 Q gi|254780889|r 341 ADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 341 adGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) |=|||. ..++..-+.+|||+|.+.|..-.. T Consensus 169 AIGGIt-~~n~~~v~~aGad~vAVisaI~~a 198 (221) T PRK06512 169 VQAGSD-LASIVEVAETGAEFVALGRAVFDA 198 (221) T ss_pred EECCCC-HHHHHHHHHHCCCEEEEHHHHHCC T ss_conf 982789-999999998199899885996089 No 314 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=96.95 E-value=0.0009 Score=46.49 Aligned_cols=51 Identities=10% Similarity=0.333 Sum_probs=25.6 Q ss_pred HHHHHHCCCCEEEEECCCCH-----HHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHH Q ss_conf 99987313666998248411-----3889999999998579872451464699999999 Q gi|254780889|r 238 VGPLFDVNVDLVVVDTAHGH-----SQKVLDAVVQIKKNFPSLLVMAGNIATAEGALAL 291 (493) Q Consensus 238 a~~LveaGvDviviD~ahGh-----~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l 291 (493) |++|+ ..-++|+.|=.-+. ...+.+.++.+.+.+...-|++ -...+-|..| T Consensus 176 ARALa-~~P~iLLmDEPfsaLD~~~r~~l~~~l~~L~~~~~~TiifV--THD~~EA~~l 231 (382) T TIGR03415 176 ARAFA-MDADILLMDEPFSALDPLIRTQLQDELLELQAKLNKTIIFV--SHDLDEALKI 231 (382) T ss_pred HHHHH-CCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEE--CCCHHHHHHH T ss_conf 99986-38998997088765599999999999999999869989998--7999999986 No 315 >cd04738 DHOD_2_like Dihydroorotate dehydrogenase (DHOD) class 2. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences, their cellular location and their natural electron acceptor used to reoxidize the flavin group. Members of class 1 are cytosolic enzymes and multimers, while class 2 enzymes are membrane associated, monomeric and use respiratory quinones as their physiological electron acceptors. Probab=96.94 E-value=0.043 Score=34.46 Aligned_cols=44 Identities=20% Similarity=0.373 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHCC-CCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 889999999998579-872451-46469999999985487589961 Q gi|254780889|r 259 QKVLDAVVQIKKNFP-SLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 259 ~~~~~~i~~~k~~~~-~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ....++++++++.++ +++||+ |.|.|++-+.+.+.+|||.|-++ T Consensus 264 ~~s~~~v~~v~~~~~~~~pIIgvGGI~s~~Da~e~i~aGAslVQiy 309 (327) T cd04738 264 ERSTEVLRELYKLTGGKIPIIGVGGISSGEDAYEKIRAGASLVQLY 309 (327) T ss_pred HHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCHHHHH T ss_conf 9999999999997499981999889799999999998699699876 No 316 >PRK06857 consensus Probab=96.90 E-value=0.045 Score=34.27 Aligned_cols=123 Identities=15% Similarity=0.272 Sum_probs=80.5 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+.|||.-=. +.+-+++.+++|++++|- -+.+..+++..+.. +++.+ ..+.|+.-.....++|+|.+| T Consensus 63 ~~~vGaGTV~---~~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 130 (209) T PRK06857 63 DMLIGAGTVL---TPEQVDAAKEAGADFIVS---PGFNPNTVKYCQQL-----NIPIV-PGVNNPSLVEQALEMGLTTLK 130 (209) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 9489999376---799999999839999990---89999999999974-----99654-787999999999987999899 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 6102787665310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) +==. +.+ | |..-+.++..- -.++++++-|||.- -++..-|++|+-++..||++.- T Consensus 131 lFPA--~~~------g-G~~~lkal~~p-----~p~~~~~ptGGV~~-~N~~~yl~~~~v~~~gGS~l~~ 185 (209) T PRK06857 131 FFPA--EAS------G-GVNMLKALLAP-----YPNLQIMPTGGINP-SNIKDYLAIPNVVACGGTWMVP 185 (209) T ss_pred ECCC--CCC------C-CHHHHHHHHCC-----CCCCEEEECCCCCH-HHHHHHHCCCCEEEEECHHHCC T ss_conf 7866--212------6-69999998653-----89980996489888-7899998599889998936589 No 317 >PRK08091 ribulose-phosphate 3-epimerase; Validated Probab=96.88 E-value=0.0068 Score=40.20 Aligned_cols=132 Identities=14% Similarity=0.169 Sum_probs=73.2 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCC-----CCCE---ECCCCCC-HHHHHHHHHCCCCEEE-EEECC Q ss_conf 78999873136669982484113889999999998579-----8724---5146469-9999999854875899-61027 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFP-----SLLV---MAGNIAT-AEGALALIDAGADIIK-VGIGP 305 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~-----~~~i---i~GNv~t-~~~~~~l~~~Gad~ik-vGig~ 305 (493) .-+..++++|+|.+++-.-- .....+.++.+|+... ...+ +|=|-.| .+....+++. +|.|. ..+-| T Consensus 82 ~~i~~~~~aGad~it~H~Ea--~~~~~~~i~~i~~~~~~~~~~~~~~~~GlAlnP~Tpve~l~~~L~~-vD~VLvMtV~P 158 (235) T PRK08091 82 TVAKACVKAGAHCITLQAEG--DIHLHHTLSWLGQQTVPVIGGEMPVLRGISLCPATPLDVLIPYLSD-VDVIQLLTLDP 158 (235) T ss_pred HHHHHHHHCCCCEEEECCCC--CCCHHHHHHHHHHHCCHHHHHHCCCCCEEEECCCCCHHHHHHHHHH-CCEEEEEEECC T ss_conf 99999997599899975455--5588999999998342022222075013897999988999998705-39999987668 Q ss_pred CCCCCCCHHCCCCCCCCHHHHHHHHHHHCC--CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 876653100015743100378899986226--981997265437776213367148789932301012167764 Q gi|254780889|r 306 GSICTTRVVTGVGCPQLSAIMSVVEVAERA--GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 306 Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~--~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) |-.=-. +--..+.-|.+.++...+. +..|..||||... .+.+...+|||.+-.||.+=+. +.|.+ T Consensus 159 GfgGQ~-----fi~~~l~KI~~l~~~~~~~~~~~~I~VDGGI~~~-ti~~~~~aGad~~V~GS~iF~~-~d~~e 225 (235) T PRK08091 159 RYGSKM-----RSSDLHERVAQLLCLLGDKREGKLIVIDGSMTQD-QLPSLIAQGIDWVVSGSALFSD-DRLVE 225 (235) T ss_pred CCCCCC-----CCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHH-HHHHHHHCCCCEEEECHHHCCC-CCHHH T ss_conf 988886-----7878999999999999964999159984898988-8999998399999978243379-99999 No 318 >PRK10550 tRNA-dihydrouridine synthase C; Provisional Probab=96.87 E-value=0.042 Score=34.52 Aligned_cols=99 Identities=23% Similarity=0.217 Sum_probs=54.5 Q ss_pred HHHHHHHHHHHHHCC-CCCE----ECCC---CCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC-CCHHHHHHH Q ss_conf 889999999998579-8724----5146---469999999985487589961027876653100015743-100378899 Q gi|254780889|r 259 QKVLDAVVQIKKNFP-SLLV----MAGN---IATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP-QLSAIMSVV 329 (493) Q Consensus 259 ~~~~~~i~~~k~~~~-~~~i----i~GN---v~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p-q~~av~~~~ 329 (493) +.+.+.++.+++..| ++|| =.|- -.+.|-++.+.++|++.|-|- .-||.+-=-|-+ -...+.++. T Consensus 117 ~~~~~iv~a~~~~v~~~iPVtvK~RlG~~~~~~~~e~~~~~~~~G~~~ltvH------~RT~~q~y~~~~~dw~~i~~~~ 190 (312) T PRK10550 117 ELIYQGAKAMREAVPAHLPVTVKVRLGWDSGDRKFEIADAVQQAGATELVVH------GRTKEQGYRAEHIDWQAIGEIR 190 (312) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEE------CCCHHHCCCCCCCCHHHHHHHH T ss_conf 7999999999974587899547753589986319999999997399879990------5526535899834899999999 Q ss_pred HHHHCCCCEEEEECCCCCHHHHEEEEE-ECCCEEEECC Q ss_conf 986226981997265437776213367-1487899323 Q gi|254780889|r 330 EVAERAGVAIVADGGIRFSGDIAKAIA-AGSACVMIGS 366 (493) Q Consensus 330 ~~~~~~~~~iiadGGi~~~gdi~kAla-~GA~~VM~G~ 366 (493) +. ..+|||+-|+|.+.-|..+.+. -|+|.||+|. T Consensus 191 ~~---~~iPvi~NGdI~s~~d~~~~~~~tg~dgvMiGR 225 (312) T PRK10550 191 QR---LTIPVIANGEIWDWQSAQQCMAISGCDAVMIGR 225 (312) T ss_pred HH---CCCCEEEECCCCCHHHHHHHHHHHCCCEEEECH T ss_conf 74---899899707959999999998714899999658 No 319 >cd04722 TIM_phosphate_binding TIM barrel proteins share a structurally conserved phosphate binding motif and in general share an eight beta/alpha closed barrel structure. Specific for this family is the conserved phosphate binding site at the edges of strands 7 and 8. The phosphate comes either from the substrate, as in the case of inosine monophosphate dehydrogenase (IMPDH), or from ribulose-5-phosphate 3-epimerase (RPE) or from cofactors, like FMN. Probab=96.86 E-value=0.049 Score=34.03 Aligned_cols=65 Identities=23% Similarity=0.239 Sum_probs=45.6 Q ss_pred HHHHHHHCCCCEEEEECCCC--H----HHHHHHHHHHHHHHCCCCCEE-CCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 89998731366699824841--1----388999999999857987245-146469999999985487589961 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDTAHG--H----SQKVLDAVVQIKKNFPSLLVM-AGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 237 Ra~~LveaGvDviviD~ahG--h----~~~~~~~i~~~k~~~~~~~ii-~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .+..+.+.|+|.+.++..-+ | .......+..+++.. ++||+ .|.+.|++.+...+..|||++-|| T Consensus 128 ~~~~a~~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ipvi~~gGi~~~~~~~~~~~~gAdgv~vG 199 (200) T cd04722 128 AAAAAEEAGVDEVGLGNGGGGGGGRDAVPIADLLLILAKRGS-KVPVIAGGGINDPEDAAEALALGADGVIVG 199 (200) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 999999809979997087467888766611689999999857-999899758799999999998599889818 No 320 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=96.86 E-value=0.001 Score=46.09 Aligned_cols=61 Identities=23% Similarity=0.480 Sum_probs=53.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHC-----CCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 446724400267863257303589999999865-----5330645747997868898021000101 Q gi|254780889|r 150 NAQQAVGELMTRNLITVKKTVNLENAKALLHQH-----RIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 150 ~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~-----ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) -.+..+..+|+.+.++++++.+..++...+.+. .+..+.|||++++|+|+++.++++.... T Consensus 129 y~e~taG~~Mt~e~v~l~~~~Tv~~al~~ir~~~~~~e~~~~lyVvD~~~~L~Gvvsl~~Ll~a~~ 194 (451) T COG2239 129 YPEDTAGRIMTTEFVTLPEDVTVDEALDRIRERAEDAETIYYLYVVDEKGKLLGVVSLRDLLTAEP 194 (451) T ss_pred CCHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEEEHHHHHCCCC T ss_conf 983442000130258720675899999999972367566237999877664677756999864895 No 321 >PRK13125 trpA tryptophan synthase subunit alpha; Provisional Probab=96.83 E-value=0.036 Score=34.97 Aligned_cols=127 Identities=20% Similarity=0.210 Sum_probs=83.3 Q ss_pred HHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCH Q ss_conf 789998731366-699824841138899999999985-798724514646999999998548758996102787665310 Q gi|254780889|r 236 DRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~ 313 (493) +-+..+.++|+| +|+.|...-+.+..-+..+..|+. +.-+++++.+ .+.+-.+.+.+..-..+=++ .+- T Consensus 95 ~F~~~~~~~GvdGvIipDLP~e~~ee~~~~~~~~~~~gl~~I~lvsPt-t~~~ri~~i~~~s~gFvY~~--------~~g 165 (247) T PRK13125 95 DLLNTAKEVGARGVLFPDLLIDFPDELEKYVELIRRYGLAPVFFTSPK-FPDRLIRRLSKLSPLFIYLG--------LYP 165 (247) T ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC-CCHHHHHHHHHHCCCEEEEE--------EEC T ss_conf 999999985997588338887546789999999997698469995799-81999999998689779999--------443 Q ss_pred HCCCCCCCCHHHHHHHHHHHC-C-CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 001574310037889998622-6-98199726543777621336714878993230101216 Q gi|254780889|r 314 VTGVGCPQLSAIMSVVEVAER-A-GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 314 ~~g~g~pq~~av~~~~~~~~~-~-~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +||.-.|.- +.+.-+..++ . ++||..-=||+++.|+.+++..|||-|-+||.|-..-| T Consensus 166 vTG~~~~~~--~~~~i~~ik~~~~~~Pv~vGFGI~t~e~v~~~~~~~aDGvIVGSaiVk~i~ 225 (247) T PRK13125 166 ATGVELPVY--VERNIKRVRELVGDVYLVAGFAIDSPEDAAKALSAGADGVVVGTAFIRRLE 225 (247) T ss_pred CCCCCCCCC--HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 678877325--999999999856999858832879999999998558999998789999999 No 322 >PRK11815 tRNA-dihydrouridine synthase A; Provisional Probab=96.81 E-value=0.0088 Score=39.37 Aligned_cols=196 Identities=17% Similarity=0.219 Sum_probs=89.0 Q ss_pred EECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCC-CHHHHHH--HHHHHHHHCCCEECCCEEEE----CCCCHHHHHH Q ss_conf 600317833785236888999999779879997898-9899999--99987422388152878994----6543034565 Q gi|254780889|r 44 TLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNF-SPSEQVA--QVHQVKKFESGMVVNPVTIS----PYATLADALA 116 (493) Q Consensus 44 ~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~-sie~qa~--~V~kVKr~e~gmI~dPVti~----pd~TI~ea~~ 116 (493) .++-++.-|||--||....=.-+-+.|.=+++---| +.+.-.. .-+..+.. --..|+.+. .-..+.+|.+ T Consensus 8 ~~~~~~~lAPM~gvTD~~fR~l~R~~~~~~l~yTEMvsa~al~~~~~~~~l~~~---~~E~Pv~vQl~G~dp~~la~Aa~ 84 (333) T PRK11815 8 ILPRRFSVAPMMDWTDRHCRYFHRLLSRHALLYTEMVTTGAIIHGDRERLLAFD---PEEHPVALQLGGSDPADLAEAAK 84 (333) T ss_pred CCCCCEEEECCCCCCCHHHHHHHHHHCCCCEEEECCEEEHHHHCCCHHHHHHCC---CCCCCEEEEECCCCHHHHHHHHH T ss_conf 168855863779980799999999977997798699851466617988885069---87798799974799999999999 Q ss_pred HHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCC--CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 6676050416786688602788723545001434467244002678--63257303589999999865533064574799 Q gi|254780889|r 117 LMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRN--LITVKKTVNLENAKALLHQHRIEKLLVVDDDG 194 (493) Q Consensus 117 lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~--litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g 194 (493) +..++++..+ |=+ ...|+..+.... -.-.....-+.+..+.|.+. -.+||. T Consensus 85 i~~~~g~d~I---DlN------------------~GCP~~kV~~g~~Ga~Lm~~p~~v~~iv~a~~~a--~~~PVT---- 137 (333) T PRK11815 85 LAEDWGYDEI---NLN------------------VGCPSDRVQNGRFGACLMAEPELVADCVKAMKDA--VSIPVT---- 137 (333) T ss_pred HHHHCCCCEE---CCC------------------CCCCHHHHHCCCEEHHHHCCHHHHHHHHHHHHHH--CCCCEE---- T ss_conf 9987398853---523------------------8998688732780178707999999999999873--488535---- Q ss_pred CEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCC-----HHHHHHHHHHHCCCCEEEEECCCCHHH-------H-- Q ss_conf 78688980210001017422326568765556651370-----156789998731366699824841138-------8-- Q gi|254780889|r 195 CCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAK-----DIADRVGPLFDVNVDLVVVDTAHGHSQ-------K-- 260 (493) Q Consensus 195 ~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~-----d~~eRa~~LveaGvDviviD~ahGh~~-------~-- 260 (493) + |. |. |..+ ...+-++.+.++|++.|.| ||-.. . T Consensus 138 ----v-------K~--------------Rl----G~d~~d~~~~l~~f~~~~~~aG~~~i~v---H~R~a~l~Glspk~n 185 (333) T PRK11815 138 ----V-------KH--------------RI----GIDDQDSYEFLCDFVDTVAEAGCDRFIV---HARKAWLKGLSPKEN 185 (333) T ss_pred ----E-------EE--------------EC----CCCCCCHHHHHHHHHHHHHHCCCCEEEE---EEHHHHHCCCCHHHH T ss_conf ----7-------86--------------31----6777752899999999999759988999---602787726787775 Q ss_pred ------HHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf ------9999999998579872451-46469999999985487589961 Q gi|254780889|r 261 ------VLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 261 ------~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) --+.+.++|+.+|+++||+ |+|-|.+.+...++. +|+|-+| T Consensus 186 R~ippl~~~~v~~lk~~~p~ipvi~NGdI~s~~~~~~~l~~-~DGVMiG 233 (333) T PRK11815 186 REIPPLDYDRVYRLKRDFPHLTIEINGGIKTLEEAKEHLQH-VDGVMIG 233 (333) T ss_pred CCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHC-CCEEEEH T ss_conf 05873048999999976678718845996999999999855-9962114 No 323 >PRK00043 thiE thiamine-phosphate pyrophosphorylase; Reviewed Probab=96.81 E-value=0.01 Score=38.91 Aligned_cols=100 Identities=26% Similarity=0.267 Sum_probs=61.7 Q ss_pred HHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECC Q ss_conf 99999857987245146469999999985487589961027876653100015743100378899986226981997265 Q gi|254780889|r 265 VVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGG 344 (493) Q Consensus 265 i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGG 344 (493) ++.+|+.++.-.+++-.+.+.+.+......|+|.+-+| ++.-|.-..+..-|+- +....+..+...+||+|=|| T Consensus 93 ~~~~r~~l~~~~iiG~S~h~~~e~~~A~~~gaDYi~~G----pvf~T~tK~~~~~~~g--~~~l~~~~~~~~iPvvAIGG 166 (210) T PRK00043 93 VADARAILGPDAIIGVSTHTLEEAAAAAAAGADYVGVG----PIFPTPTKKDAKPAVG--LELLREAREAIDIPIVAIGG 166 (210) T ss_pred HHHHHHHCCCCCEEEEECCCHHHHHHHHHHCCCEEEEC----CCCCCCCCCCCCCCCC--HHHHHHHHHHCCCCEEEECC T ss_conf 99999751988789984799999999988289838874----5214798888877789--99999999847999899808 Q ss_pred CCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 437776213367148789932301012 Q gi|254780889|r 345 IRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 345 i~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) | +.-++...+.+||+.+-+-|-+-+. T Consensus 167 I-~~~ni~~~~~~Ga~giAvis~I~~a 192 (210) T PRK00043 167 I-TPENAAEVLEAGADGVAVVSAITAA 192 (210) T ss_pred C-CHHHHHHHHHCCCCEEEEEHHHHCC T ss_conf 8-9999999998099999970897769 No 324 >cd00429 RPE Ribulose-5-phosphate 3-epimerase (RPE). This enzyme catalyses the interconversion of D-ribulose 5-phosphate (Ru5P) into D-xylulose 5-phosphate, as part of the Calvin cycle (reductive pentose phosphate pathway) in chloroplasts and in the oxidative pentose phosphate pathway. In the Calvin cycle Ru5P is phosphorylated by phosphoribulose kinase to ribulose-1,5-bisphosphate, which in turn is used by RubisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) to incorporate CO2 as the central step in carbohydrate synthesis. Probab=96.80 E-value=0.055 Score=33.69 Aligned_cols=124 Identities=23% Similarity=0.328 Sum_probs=76.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCHH-HHHHHHHHHHHHHCCCCCEECCCCCC-HHHHHHHHHCCCCEEEE-EECCCCCCCC Q ss_conf 678999873136669982484113-88999999999857987245146469-99999998548758996-1027876653 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVDTAHGHS-QKVLDAVVQIKKNFPSLLVMAGNIAT-AEGALALIDAGADIIKV-GIGPGSICTT 311 (493) Q Consensus 235 ~eRa~~LveaGvDviviD~ahGh~-~~~~~~i~~~k~~~~~~~ii~GNv~t-~~~~~~l~~~Gad~ikv-Gig~Gs~CtT 311 (493) ..-+..++++|+|.|++ |--+ ....+.++++|+.--..- +|=|-.| .+....+++. +|.|-+ ++-||- T Consensus 70 ~~~i~~~~~~g~d~I~~---H~E~~~~~~~~i~~ik~~g~~~G-lal~p~T~~~~l~~~l~~-~D~vliMtV~PGf---- 140 (211) T cd00429 70 ERYIEAFAKAGADIITF---HAEATDHLHRTIQLIKELGMKAG-VALNPGTPVEVLEPYLDE-VDLVLVMSVNPGF---- 140 (211) T ss_pred HHHHHHHHHHCCCEEEE---CCCCCCCHHHHHHHHHHCCCCCE-EEECCCCCHHHHHHHHHH-HCEEEEEEECCCC---- T ss_conf 77699999709988998---64322089999999997398723-575489998999999975-1522798746887---- Q ss_pred CHHCCCCCCCCH----HHHHHHHHHH--CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 100015743100----3788999862--2698199726543777621336714878993230101216 Q gi|254780889|r 312 RVVTGVGCPQLS----AIMSVVEVAE--RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 312 r~~~g~g~pq~~----av~~~~~~~~--~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) . |-+... -+.+.++... .++..|..||||+. ..+.+...+|||.+-+||.+-+.+. T Consensus 141 ---~--GQ~f~~~~~~ki~~l~~~~~~~~~~~~I~vDGGI~~-~~i~~l~~~Gad~~V~GS~iF~~~d 202 (211) T cd00429 141 ---G--GQKFIPEVLEKIRKLRELIPENNLNLLIEVDGGINL-ETIPLLAEAGADVLVAGSALFGSDD 202 (211) T ss_pred ---C--CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCH-HHHHHHHHCCCCEEEECHHHHCCCC T ss_conf ---8--875456799999999999986499859999678598-9999999859999997937758999 No 325 >PRK08745 ribulose-phosphate 3-epimerase; Provisional Probab=96.80 E-value=0.055 Score=33.69 Aligned_cols=122 Identities=19% Similarity=0.233 Sum_probs=74.1 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCH-HHHHHHHHCCCCEE-EEEECCCCCCCCCH Q ss_conf 78999873136669982484113889999999998579872451464699-99999985487589-96102787665310 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATA-EGALALIDAGADII-KVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~-~~~~~l~~~Gad~i-kvGig~Gs~CtTr~ 313 (493) .-++.++++|+|.|.+-.-. +....+.++++|+.-- ..-+|=|-.|+ +..+.+++. .|.| ..++-||- T Consensus 76 ~~i~~~~~aGad~i~~H~Ea--~~~~~~~i~~ik~~g~-k~GlalnP~T~~~~l~~~l~~-~D~VliMtV~PGf------ 145 (223) T PRK08745 76 RIVPDFADAGATTISFHPEA--SRHVHRTIQLIKSHGC-QAGLVLNPATPVDILDWVLPE-LDLVLVMSVNPGF------ 145 (223) T ss_pred HHHHHHHHCCCCEEEEEECC--CCCHHHHHHHHHHCCC-CEEEEECCCCCHHHHHHHHHH-CCEEEEEEECCCC------ T ss_conf 99999997399789996064--4299999999998398-446774699987999998864-7989998756998------ Q ss_pred HCCCCCCC----CHHHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 00157431----0037889998622--6981997265437776213367148789932301012 Q gi|254780889|r 314 VTGVGCPQ----LSAIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 314 ~~g~g~pq----~~av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) . |-+. +.-|.+.++...+ .+..|..||||+. ..+.+...+|||.+-.||.+=+. T Consensus 146 -~--GQ~f~~~~l~KI~~l~~~~~~~~~~~~I~VDGGI~~-~ti~~l~~aGad~~V~GSaiF~~ 205 (223) T PRK08745 146 -G--GQAFIPSALDKLRAIRKKIDALGKPIRLEIDGGVKA-DNIGAIAAAGADTFVAGSAIFNA 205 (223) T ss_pred -C--CCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCH-HHHHHHHHCCCCEEEECHHHHCC T ss_conf -8--754568899999999999986499945999788798-99999998699999974177579 No 326 >PRK13125 trpA tryptophan synthase subunit alpha; Provisional Probab=96.78 E-value=0.056 Score=33.63 Aligned_cols=68 Identities=22% Similarity=0.303 Sum_probs=45.8 Q ss_pred HHHHHHHHHCCCCEEEEE--CCCC--HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 678999873136669982--4841--138899999999985798724514-6469999999985487589961 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVD--TAHG--HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 235 ~eRa~~LveaGvDviviD--~ahG--h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) .+|.+.+.+..-.++-+- -.-| ......+.++++|+..+++||.+| .|.|++.++.+.++|||++-|| T Consensus 145 ~~ri~~i~~~s~gFvY~~~~gvTG~~~~~~~~~~i~~ik~~~~~~Pv~vGFGI~t~e~v~~~~~~~aDGvIVG 217 (247) T PRK13125 145 DRLIRRLSKLSPLFIYLGLYPATGVELPVYVERNIKRVRELVGDVYLVAGFAIDSPEDAAKALSAGADGVVVG 217 (247) T ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 9999999986897799994436788773259999999998569998588328799999999985589999987 No 327 >PRK13131 consensus Probab=96.78 E-value=0.021 Score=36.73 Aligned_cols=127 Identities=17% Similarity=0.178 Sum_probs=82.9 Q ss_pred HHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCH Q ss_conf 789998731366-699824841138899999999985-798724514646999999998548758996102787665310 Q gi|254780889|r 236 DRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~ 313 (493) .-+..+.++|+| +|+.|...-.+.. ..+..++. ..-+++++.+ .+.+-.+.+.+..-..|=+---.| T Consensus 106 ~F~~~~~~~GvdGvIipDLP~eE~~~---~~~~~~~~~l~~I~lvaPt-t~~~Ri~~i~~~s~GFiY~vs~~G------- 174 (257) T PRK13131 106 GFYAQAKECGVDSVLIADMPLIEKEL---VIKSAQKHQIKQIFIASPN-ASVKDLEQVATHSQGYIYTLARSG------- 174 (257) T ss_pred HHHHHHHHCCCCCEECCCCCHHHHHH---HHHHHHHCCCCEEEEECCC-CCHHHHHHHHHCCCCEEEEEECCC------- T ss_conf 99999986599856558999678899---9999997798479972899-988999999835897499984576------- Q ss_pred HCCCCCCCCHHHHHHH-HHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 0015743100378899-98622698199726543777621336714878993230101216 Q gi|254780889|r 314 VTGVGCPQLSAIMSVV-EVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 314 ~~g~g~pq~~av~~~~-~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +||.--..-..+.+.- +..+..++||..-=||+++-|+.++...|||.|-+||.|-..-| T Consensus 175 vTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~gaDGvIVGSaiV~~I~ 235 (257) T PRK13131 175 VTGASHTLENDASAIIKTLKTFSPTPALLGFGISKKEHITNAKGMGADGVICGSALVKIIE 235 (257) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 7798643407699999999966899879980579889999998559999998789999998 No 328 >PRK05581 ribulose-phosphate 3-epimerase; Validated Probab=96.77 E-value=0.057 Score=33.56 Aligned_cols=125 Identities=21% Similarity=0.256 Sum_probs=76.1 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCC-HHHHHHHHHCCCCEE-EEEECCCCCCCCCH Q ss_conf 7899987313666998248411388999999999857987245146469-999999985487589-96102787665310 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIAT-AEGALALIDAGADII-KVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t-~~~~~~l~~~Gad~i-kvGig~Gs~CtTr~ 313 (493) .-...++++|+|.|++-.- ......+.++++|+.--..- +|=|-.| .+..+.+++. +|.| -..+-||- T Consensus 75 ~~i~~~~~~g~d~I~~H~E--a~~~~~~~i~~ik~~g~k~G-lalnp~T~~~~l~~~l~~-iD~VlvMtV~PGf------ 144 (220) T PRK05581 75 RYVPDFAKAGADIITFHVE--ASEHIHRLLQLIKEAGIKAG-LVLNPATPLEYLEYVLPL-LDLVLLMSVNPGF------ 144 (220) T ss_pred HHHHHHHHCCCCEEEECCC--CCCCHHHHHHHHHHCCCCEE-EEECCCCCHHHHHHHHHH-HCEEEEEEECCCC------ T ss_conf 8799999739988998167--50279999999997499704-676699998999999874-1525899865887------ Q ss_pred HCCCC--CCCCHHHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 00157--4310037889998622--69819972654377762133671487899323010121 Q gi|254780889|r 314 VTGVG--CPQLSAIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 314 ~~g~g--~pq~~av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) .|-. -..+.-+.+.++...+ ++..|..||||+.. .+.+...+|||.+-+||.+-+.. T Consensus 145 -~GQ~f~~~~l~ki~~l~~~~~~~~~~~~I~VDGGIn~~-~i~~l~~~Gad~~V~GS~iF~~~ 205 (220) T PRK05581 145 -GGQKFIPEVLEKIREVRKLIDERGLDILIEVDGGVNAE-NIKECAEAGADVFVAGSAVFGAP 205 (220) T ss_pred -CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHH-HHHHHHHCCCCEEEECHHHHCCC T ss_conf -87645566999999999999845997559997898989-99999977999999794885799 No 329 >pfam00834 Ribul_P_3_epim Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate. Probab=96.77 E-value=0.057 Score=33.56 Aligned_cols=120 Identities=21% Similarity=0.309 Sum_probs=74.0 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCC-HHHHHHHHHCCCCEE-EEEECCCCCCCCCH Q ss_conf 7899987313666998248411388999999999857987245146469-999999985487589-96102787665310 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIAT-AEGALALIDAGADII-KVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t-~~~~~~l~~~Gad~i-kvGig~Gs~CtTr~ 313 (493) .-+..++++|+|.+.+-.- ......+.++++|+.--. .=+|=|-.| .+..+.+++. .|.| -.++-||- T Consensus 71 ~~i~~~~~~g~d~i~~H~E--~~~~~~~~i~~ik~~g~k-~GlAlnP~T~~~~l~~~l~~-iD~VLvMtV~PGf------ 140 (201) T pfam00834 71 RIIPDFAEAGADIISFHAE--ASDHPHRTIQLIKEAGAK-AGLVLNPATPLDAIEYLLDD-LDLVLLMSVNPGF------ 140 (201) T ss_pred HHHHHHHHCCCCEEEECHH--HHHCHHHHHHHHHHCCCE-EEEEECCCCCCHHHHHHHHH-CCEEEEEEECCCC------ T ss_conf 6399998739988997544--413799999999864972-68885699860288876742-7989998866898------ Q ss_pred HCCCCCC----CCHHHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 0015743----10037889998622--69819972654377762133671487899323010 Q gi|254780889|r 314 VTGVGCP----QLSAIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 314 ~~g~g~p----q~~av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) . |-+ .+.-|.+.++...+ ++..|..||||+.. .+.+...+|||.+-+||.+= T Consensus 141 -~--GQ~f~~~~l~KI~~lr~~~~~~~~~~~I~vDGGIn~~-ti~~l~~~Gad~~V~GSaiF 198 (201) T pfam00834 141 -G--GQSFIPSVLPKIRKVRKMIDEGGLDTLIEVDGGVNLD-NIPQIAEAGADVLVAGSAVF 198 (201) T ss_pred -C--CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHH-HHHHHHHCCCCEEEECCEEC T ss_conf -8--7645677999999999999826998079998988899-99999987999999780024 No 330 >PRK08104 consensus Probab=96.72 E-value=0.025 Score=36.09 Aligned_cols=123 Identities=19% Similarity=0.243 Sum_probs=79.8 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+.+||.-=. +.|-++..+++|++++|- -|.+..+++..+.. +++.+ ..|.|+.-....+++|+|.+| T Consensus 66 ~~~vGaGTV~---~~e~~~~ai~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 133 (212) T PRK08104 66 EAIVGAGTVL---NPQQLAEVTEAGAQFAIS---PGLTEELLKAATEG-----TIPLI-PGISTVSELMLGMDYGLTEFK 133 (212) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 8568542026---799999999859999984---89999999999982-----99765-676999999999987999799 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 6102787665310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) += |.+.+ -|...+.++.. . -.++++++-|||. .-++..-|++|+-.+..||+++- T Consensus 134 lF--PA~~~-------gG~~~lkal~~---p--~p~~~f~ptGGV~-~~N~~~yl~~~~v~~vgGS~l~~ 188 (212) T PRK08104 134 FF--PAEAN-------GGVKALQAISG---P--FSQIRFCPTGGIT-PANYRDYLALKSVLCIGGSWLVP 188 (212) T ss_pred EC--CCCCC-------CCHHHHHHHHC---C--CCCCEEEECCCCC-HHHHHHHHHCCCEEEEECHHHCC T ss_conf 78--76213-------74999999855---5--8998189648989-88999998079879998835389 No 331 >pfam04481 DUF561 Protein of unknown function (DUF561). Protein of unknown function found in a cyanobacterium, and the chloroplasts of algae. Probab=96.72 E-value=0.016 Score=37.52 Aligned_cols=133 Identities=18% Similarity=0.244 Sum_probs=91.0 Q ss_pred HHHHHHHHHCCCCEEEE---EC--CCC---HHHHHHHHHHHHHHHCCCCCEEC--CCCCCH----HHHHHHHHCCCCEEE Q ss_conf 67899987313666998---24--841---13889999999998579872451--464699----999999854875899 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVV---DT--AHG---HSQKVLDAVVQIKKNFPSLLVMA--GNIATA----EGALALIDAGADIIK 300 (493) Q Consensus 235 ~eRa~~LveaGvDvivi---D~--ahG---h~~~~~~~i~~~k~~~~~~~ii~--GNv~t~----~~~~~l~~~Gad~ik 300 (493) .+.....++||+|++=| |+ ++| ..+.+++..++.|+.+|++++-+ .-+-.. +-+.+|.++|+|.|. T Consensus 71 p~~f~~aV~AGA~lvEIGNfDsFY~qGr~f~a~eVL~Lt~~Tr~LLP~~~LsVTVPHiL~ld~Qv~LA~~L~~~GaDiIQ 150 (243) T pfam04481 71 PELLYEAVLAGADLVEIGNFDSFYKQGRVLSVCEIIALVKETRKLLPHTPLCVTIPHILKLKEQINLAKQLESLGIDLIQ 150 (243) T ss_pred HHHHHHHHHHCCCEEEECCHHHHHHCCCEECHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCEEE T ss_conf 78889999827878986453647654766449999999999997689984477457635678999999999981887787 Q ss_pred EEECCCCCCCCCHHCC---CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 6102787665310001---5743100378899986226981997265437776213367148789932301012 Q gi|254780889|r 301 VGIGPGSICTTRVVTG---VGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 301 vGig~Gs~CtTr~~~g---~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) --=|.-|.-+..-+.| --.|.+.+.++..+ ...+|||.--|+... -.--|+++||+.|=+||....- T Consensus 151 TEGgtss~p~~~g~~glIekaapTLAaay~IS~---~v~vPVlcASGlS~v-T~PmAiaaGAsGVGVGSavn~L 220 (243) T pfam04481 151 TEGKITSISKNHCVNDLIEKSASTLASTYEISK---HVQLPVICASGLSDV-TVPLAFSYGASGIGIGSAVSKL 220 (243) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH---CCCCCEEECCCCCHH-HHHHHHHCCCCCCCHHHHHHHC T ss_conf 289877788884257779887588999999986---178766754676421-4788997487710065776500 No 332 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=96.71 E-value=0.0017 Score=44.43 Aligned_cols=156 Identities=24% Similarity=0.302 Sum_probs=106.2 Q ss_pred CCCHHEEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHH Q ss_conf 46000575276432475422533430388160031783378523688899999977987999789898999999998742 Q gi|254780889|r 14 LTFDDVLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKK 93 (493) Q Consensus 14 lTfDDVllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr 93 (493) ++=.||.|.=+||-=..+--.|=.+| |+ +.+||+ ++|+..- -.|||.+-.- =..++|++| T Consensus 45 v~~~Dvvl~ISYSGEs~e~~~Lip~L-k~--~~~~li-----a~T~~p~-S~La~~Ad~~---L~i~v~kEa-------- 104 (272) T TIGR00393 45 VEPRDVVLLISYSGESKELDKLIPSL-KR--LSVKLI-----AVTGKPN-SSLARAADYV---LDIKVEKEA-------- 104 (272) T ss_pred CCCCCEEEEEECCCCHHHHHHHHHHH-HH--CCCEEE-----EECCCCC-CCHHHCCCEE---EEEEEEEEC-------- T ss_conf 58674799987688757787753577-75--597089-----9728889-7245306545---766775302-------- Q ss_pred HCCCEECCCEEEECCCC------HHHH--HHHHHHHCCCCEEEECCCC--EEEEEEEEHHHHCCCCCCCCCCCCCCCCC- Q ss_conf 23881528789946543------0345--6566760504167866886--02788723545001434467244002678- Q gi|254780889|r 94 FESGMVVNPVTISPYAT------LADA--LALMKKYSISGIPVVESDV--GKLVGILTNRDVRFASNAQQAVGELMTRN- 162 (493) Q Consensus 94 ~e~gmI~dPVti~pd~T------I~ea--~~lm~~~~is~iPVVD~~~--gkLiGIVT~rDir~~~~~~~~V~eiMt~~- 162 (493) .|+-+-|.+| +.|| ..+|+-++|+. +|. -++-|-+=.| +=.+|+++|.+. T Consensus 105 -------CP~~lAPTsSt~~TL~lGDaLa~al~~ArnF~~-----eDFA~~HPGG~LG~k-------LL~kV~dlm~t~d 165 (272) T TIGR00393 105 -------CPLNLAPTSSTTVTLALGDALAVALMKARNFSE-----EDFAKFHPGGALGRK-------LLVKVKDLMQTDD 165 (272) T ss_pred -------CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCH-----HHHHCCCCCHHHHHH-------HHHHHHHHHCCCC T ss_conf -------789897356899999988999999985258884-----244114870466678-------8887677640665 Q ss_pred -CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHC Q ss_conf -6325730358999999986553306457479978688980210001 Q gi|254780889|r 163 -LITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERS 208 (493) Q Consensus 163 -litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~ 208 (493) +..+.+.+++.||+-.|.+.+.....|+|++.+|.||+|=-||-|. T Consensus 166 ~lP~v~~tas~~DAL~e~~~~~LG~~~v~~~~~~~~Gv~tDGD~RR~ 212 (272) T TIGR00393 166 ELPLVAPTASFKDALLEMSRKRLGLAVVCDENEQLKGVFTDGDLRRV 212 (272) T ss_pred CCCCEECCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCHHHHH T ss_conf 67822236772021023313786179997155524678714659999 No 333 >PRK13127 consensus Probab=96.70 E-value=0.026 Score=36.00 Aligned_cols=170 Identities=16% Similarity=0.179 Sum_probs=100.7 Q ss_pred CCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHH Q ss_conf 6325730358999999986553-306457479978688980210001017422326568765556651370156789998 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~L 241 (493) ...+..+.++.+.++++.+-|. ...|+| +.|....+-+ ....+-+..+ T Consensus 63 ~rAL~~G~~~~~~~~~~~~~r~~~~~piv--------lM~Y~N~i~~-----------------------~G~e~F~~~~ 111 (262) T PRK13127 63 VRALSAGMKIDKYFELVKELRVDSSVPLV--------LMTYYNPVYR-----------------------YGVEKFVKKA 111 (262) T ss_pred HHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEECCHHHHH-----------------------CCHHHHHHHH T ss_conf 99997699799999999997456998879--------9966138876-----------------------0899999999 Q ss_pred HHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC Q ss_conf 731366-6998248411388999999999857987245146469999999985487589961027876653100015743 Q gi|254780889|r 242 FDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP 320 (493) Q Consensus 242 veaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p 320 (493) .++|+| +|+.|...-.+....+..+ +..+.-+++++.+- +.+-.+.+.+..-..|=+= +..-+||..-. T Consensus 112 ~~~GvdGlIipDLP~eE~~~~~~~~~--~~gi~~I~lvaPtt-~~~Ri~~i~~~a~gFiY~v-------s~~GvTG~~~~ 181 (262) T PRK13127 112 AEAGVSGLIIPDLPVEEATDLREACK--KHGLDLVFLVAPTT-PEERLKRIDEASSGFVYLV-------SRLGVTGARED 181 (262) T ss_pred HHCCCCEEEECCCCHHHHHHHHHHHH--HCCCEEEEEECCCC-CHHHHHHHHHCCCCEEEEE-------ECCCCCCCCCC T ss_conf 87599769966999789999999998--55832799858999-8999999984389818998-------43555687655 Q ss_pred CCHHHHH-HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 1003788-9998622698199726543777621336714878993230101216 Q gi|254780889|r 321 QLSAIMS-VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 321 q~~av~~-~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) --..+.+ +.+..+..+.||..-=||+++-|+.+....|||.|-+||.|-..-| T Consensus 182 ~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v~~~~~~~aDGvIVGSaiv~~i~ 235 (262) T PRK13127 182 VEEATFDLLKRARTTCKNKIAVGFGISKGEHAEELLDAGADGVIVGSALVDIIA 235 (262) T ss_pred CHHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 528899999999961799848993348899999998649999998789999999 No 334 >TIGR01303 IMP_DH_rel_1 IMP dehydrogenase family protein; InterPro: IPR005991 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase and restricted to the high GC Gram-positive bacteria.. Probab=96.70 E-value=0.022 Score=36.47 Aligned_cols=370 Identities=21% Similarity=0.280 Sum_probs=203.8 Q ss_pred EEECCCCCCCCCCCEEEEEEECCCCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEE-CCC-CC----HHHHHHHHHHHH Q ss_conf 575276432475422533430388160031783378523688899999977987999-789-89----899999999874 Q gi|254780889|r 19 VLLRPEFSNVLPRDIDISTRIAKDFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVI-HRN-FS----PSEQVAQVHQVK 92 (493) Q Consensus 19 VllvP~~s~v~~~~vdl~t~lt~~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvI-Hrn-~s----ie~qa~~V~kVK 92 (493) +.++||-..+.--.--++.--++..-|..||+-+|-+||....- .---|.-+.+|+ ..+ .+ -+...+-+.+.. T Consensus 69 ~~~~Pqd~P~~~~~~~~~~vk~~~~vldtP~~~~P~~~v~~a~~-l~~krah~~~~v~~~~~~P~G~v~~~~~~G~d~ft 147 (476) T TIGR01303 69 LVILPQDVPIEVVKETVAFVKSRDLVLDTPITLAPHDTVADALA-LIHKRAHGAAVVVEEDGKPVGVVTDKDLEGVDRFT 147 (476) T ss_pred EEEECCCCCHHHHHHHHHHHHHCCEEECCCEEECCCHHHHHHHH-HHHHHCCCEEEEEECCCCEEEEEECCCCCCHHHHH T ss_conf 68962667778999999876410202237416546233899999-87531143178981698126788515310004567 Q ss_pred HHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHH-HCCCCCCCCCCCCCCCC--CCCCCCCC Q ss_conf 2238815287899465430345656676050416786688602788723545-00143446724400267--86325730 Q gi|254780889|r 93 KFESGMVVNPVTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRD-VRFASNAQQAVGELMTR--NLITVKKT 169 (493) Q Consensus 93 r~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rD-ir~~~~~~~~V~eiMt~--~litv~~~ 169 (493) +.+..+..+-+++..+..-.++..++.++...-.|+++.+ |+|.||+|..- +|. ...+|.-+---+ =-..+--+ T Consensus 148 ~~~~~~~~~~~~~~~~~~P~~~f~~l~~~~~~~a~~~~~d-G~l~G~l~r~Ga~ra--~~y~Pa~d~~Grlr~~aa~Gin 224 (476) T TIGR01303 148 QVEEVMSTDLVTLPADLEPREAFDLLEEASRKLAPVVDAD-GSLAGILTRTGALRA--TLYTPAVDAAGRLRIGAAVGIN 224 (476) T ss_pred HHHHHHHHHHEEECCCCCHHHHHHHHHHHHHHHHHHHHCC-CCEEEEEECCCCCEE--ECCCCCCCCCCCEEEEEEECCC T ss_conf 8887775432020367888899999874101201111057-725554411430000--0015552567746776652024 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) -++..-.+.|.+.....|.|=-.+|+-.-++..--.++. +|.+=-+..|--|+. +-.+.|++||+|++ T Consensus 225 Gd~~~~~~~l~~aG~d~lv~dtahGhq~~~~~a~k~~~~-------ld~~~P~~aGn~v~a-----~G~rdl~~aGa~~~ 292 (476) T TIGR01303 225 GDVEGKAKALLDAGVDVLVIDTAHGHQEKMISAVKAVRA-------LDLRVPIVAGNVVSA-----EGVRDLVEAGANII 292 (476) T ss_pred CCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHH-------HCCCCCEEECCEEEC-----CCHHHHHHCCCCEE T ss_conf 432378999986688589983465406899999999986-------045586542422411-----00488874476189 Q ss_pred EEECCCCH--------------HHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHH Q ss_conf 98248411--------------3889999999998579872451-46469999999985487589961027876653100 Q gi|254780889|r 250 VVDTAHGH--------------SQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVGIGPGSICTTRVV 314 (493) Q Consensus 250 viD~ahGh--------------~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~ 314 (493) -+-+--|. -..+++.....|+. .-.||+ |.|-.+.-..--+.+||.-+-|| T Consensus 293 kvGvGPGamCttrm~tGvGrPqfsavleCa~~a~~~--G~h~WadGG~r~PrdvalalaaGasnvm~G------------ 358 (476) T TIGR01303 293 KVGVGPGAMCTTRMMTGVGRPQFSAVLECAAEARKL--GKHVWADGGVRHPRDVALALAAGASNVMVG------------ 358 (476) T ss_pred EECCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHC--CCEEEECCCCCCHHHHHHHHHHCCCCEEEE------------ T ss_conf 864688602344443057871378998988999860--772640688676377777665064302441------------ Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCCE----E-EECCEEEEEE Q ss_conf 0157431003788999862269819972654377762133671487899323010121677644----8-7368058887 Q gi|254780889|r 315 TGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGDI----F-LYQGRSFKSY 389 (493) Q Consensus 315 ~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~~----~-~~~g~~~K~y 389 (493) -|--|-...+||+-.-- +--| .+||=|-- + .++.+ ..- T Consensus 359 ------------------------sWf~GtyesPGdl~~~~----~~~~-------ykes~Gmas~rav~~r~~~--~~~ 401 (476) T TIGR01303 359 ------------------------SWFAGTYESPGDLRRDA----DGRA-------YKESFGMASKRAVEARTSK--EEA 401 (476) T ss_pred ------------------------CCCCCCCCCCCCHHHHC----CCCH-------HHHHHHHHHHHHHHHHHCC--CHH T ss_conf ------------------------10035547851011100----4740-------1345314568888642000--106 Q ss_pred ECCCCHHHHHHC-CHHHHCCCCCC----CCCCEECCCCEEEEECCC--CHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 332006887522-12221012333----433100454026621057--5899899999999985551575877 Q gi|254780889|r 390 RGMGSVAAMERG-SSARYSQDGVT----DVLKLVPEGIEGRVPYKG--PIASVLHQMSGGLKSSMGYVGASNI 455 (493) Q Consensus 390 rGm~S~~A~~~~-~~~ry~~~~~~----~~~k~v~eGv~~~v~~~G--~~~~~~~~~~~glrs~~~y~G~~~i 455 (493) |--+-..-.+.| |..|.+.+... +....+-.||.....|.| +++++..+-+-|+||+-+|+-.+-+ T Consensus 402 ~~~ar~alfeeGist~r~~~d~~~~Gvedl~d~i~~Gvrs~~ty~Ga~~~~~~~e~a~vG~qs~~GyaeG~P~ 474 (476) T TIGR01303 402 FDRARKALFEEGISTSRMFLDPARGGVEDLIDAIISGVRSSCTYAGADSLEELTERAVVGVQSAAGYAEGKPL 474 (476) T ss_pred HHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHCHHHHEEECCHHHHHHHHHCEEEEEECCCCCCCCCCC T ss_conf 8899998875164212256505778778999987502123001203356886520114564214653378878 No 335 >pfam01207 Dus Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. Dus 1 from Saccharomyces cerevisiae acts on pre-tRNA-Phe, while Dus 2 acts on pre-tRNA-Tyr and pre-tRNA-Leu. Dus 1 is active as a single subunit, requiring NADPH or NADH, and is stimulated by the presence of FAD. Some family members may be targeted to the mitochondria and even have a role in mitochondria. Probab=96.68 E-value=0.0099 Score=39.01 Aligned_cols=192 Identities=19% Similarity=0.244 Sum_probs=100.8 Q ss_pred EECCCCCCCHHHHHHHHHCCCEEEECCCC-CHHHHHHH----HHHHHHHCCCEECCCEEE----ECCCCHHHHHHHHHHH Q ss_conf 33785236888999999779879997898-98999999----998742238815287899----4654303456566760 Q gi|254780889|r 51 SAAMDQVTDSRLAIAMAQAGGLGVIHRNF-SPSEQVAQ----VHQVKKFESGMVVNPVTI----SPYATLADALALMKKY 121 (493) Q Consensus 51 Ss~MDTVTe~~MAiamA~~GGiGvIHrn~-sie~qa~~----V~kVKr~e~gmI~dPVti----~pd~TI~ea~~lm~~~ 121 (493) =|||-.||...+=.-..+.|+-.+..--| +.+..... .+.....+ -..|+.+ +....+.++.+++.++ T Consensus 2 LAPM~g~td~~fR~l~~~~g~~~l~~TEmv~a~~l~~~~~~~~~~~~~~~---~e~P~~~Ql~G~dp~~~~~aa~~~~~~ 78 (309) T pfam01207 2 LAPMAGVTDLPFRRLVREYGAGDLVVTEMVTAKAQLRPEKQRELMLPELE---EPTPLAVQLGGSDPALLAEAAKLVADL 78 (309) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCEEEECCEEEHHHHCCCHHHHHHCCCCC---CCCCEEEEEECCCHHHHHHHHHHHHHC T ss_conf 27889990799999999979592999798997135438875887420076---789728999369999999999998863 Q ss_pred CCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCC--CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEE Q ss_conf 504167866886027887235450014344672440026786--325730358999999986553306457479978688 Q gi|254780889|r 122 SISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNL--ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGL 199 (493) Q Consensus 122 ~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~l--itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGI 199 (493) ++..+ |=+ ...|+..+..+.. .-.....-+.+..+-|.+. -.+||-= T Consensus 79 g~d~I---DlN------------------~GCP~~~v~~~g~GsaLl~~p~~~~~iv~a~~~~--~~~PVtv-------- 127 (309) T pfam01207 79 GADII---DIN------------------MGCPAKKVTRGGAGAALLRDPDLVAQIVKAVVKA--VDIPVTV-------- 127 (309) T ss_pred CCCEE---EEE------------------CCCCHHHHCCCCCCEEHHHCHHHHHHHHHHHHHH--CCCCEEE-------- T ss_conf 99989---651------------------8999999878997762541778999999999975--5885467-------- Q ss_pred EEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHH-------HHHHHHHHHHC Q ss_conf 98021000101742232656876555665137015678999873136669982484113889-------99999999857 Q gi|254780889|r 200 ITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKV-------LDAVVQIKKNF 272 (493) Q Consensus 200 IT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~-------~~~i~~~k~~~ 272 (493) |.+ .|-- ....+..+-+..|.++|++.|.| ||-+..+ .+.++.+|+.+ T Consensus 128 -------K~R--------------lG~d-~~~~~~~~~~~~l~~~G~~~itv---H~Rt~~q~~~g~a~w~~i~~~k~~~ 182 (309) T pfam01207 128 -------KIR--------------IGWD-ESHENAVEIARRVEDAGAQALTV---HGRTRAQNYEGPADWDAIKQVKQAV 182 (309) T ss_pred -------EEE--------------CCCC-CCHHHHHHHHHHHHHCCCCEEEE---ECCCHHHCCCCCCCHHHHHHHHHHC T ss_conf -------543--------------3788-76388999999998468887999---6763240267865418999999858 Q ss_pred CCCCEEC-CCCCCHHHHHHHHH-CCCCEEEEE Q ss_conf 9872451-46469999999985-487589961 Q gi|254780889|r 273 PSLLVMA-GNIATAEGALALID-AGADIIKVG 302 (493) Q Consensus 273 ~~~~ii~-GNv~t~~~~~~l~~-~Gad~ikvG 302 (493) + +|||+ |+|-|++-++.+++ .|+|+|-+| T Consensus 183 ~-ipvi~NGdi~~~~d~~~~l~~tg~dgvMig 213 (309) T pfam01207 183 S-IPVIANGDITDAEDAQRCLSYTGADGVMIG 213 (309) T ss_pred C-CCEEEECCCCCHHHHHHHHHHHCCCEEEEC T ss_conf 9-828980894889999999861099999984 No 336 >PRK08904 consensus Probab=96.67 E-value=0.017 Score=37.29 Aligned_cols=122 Identities=20% Similarity=0.259 Sum_probs=79.6 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+.|||.-=. +.+.++..+++|++++|- -|.+..+++..+.. +++.+ ..+.|+.-+....++|+|.+| T Consensus 61 ~~~vGaGTVl---~~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 128 (207) T PRK08904 61 NAIVGAGTVT---NPEQLKAVEDAGAVFAIS---PGLHESLAKAGHNS-----GIPLI-PGVATPGEIQLALEHGIDTLK 128 (207) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 8768553136---899999999849999984---89989999999983-----99765-786999999999987999899 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) += |.+.+ | |...+.++..- -.+++++.-|||. ..++..-|++|+-++..||++. T Consensus 129 ~F--PA~~~------G-G~~~lkal~~p-----fp~i~~~pTGGV~-~~N~~~yl~~~~v~~vgGS~l~ 182 (207) T PRK08904 129 LF--PAEVV------G-GKAMLKALYGP-----YADVRFCPTGGIS-LATAPEYLALPNVLCVGGSWLT 182 (207) T ss_pred EC--CCHHC------C-CHHHHHHHHCC-----CCCCEEEECCCCC-HHHHHHHHHCCCEEEEECHHHC T ss_conf 77--62220------8-89999987465-----9998088658989-8789999818984999881438 No 337 >TIGR00400 mgtE magnesium transporter; InterPro: IPR006669 This is the MgtE family of magnesium transporters. All the archaebacterial and eukaryotic examples have two copies of the integral membrane region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg^2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport. Probab=96.67 E-value=0.0034 Score=42.34 Aligned_cols=111 Identities=24% Similarity=0.371 Sum_probs=89.6 Q ss_pred EECCCEEEECCCCHHHHHHHHHH-----HCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 15287899465430345656676-----0504167866886027887235450014344672440026786325730358 Q gi|254780889|r 98 MVVNPVTISPYATLADALALMKK-----YSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNL 172 (493) Q Consensus 98 mI~dPVti~pd~TI~ea~~lm~~-----~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l 172 (493) +...-+.+..+.++.+++..++. ..+..+-++++. .++.|++.-+|+-.. ..+..+.++|..+.+.+....+- T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~ 218 (460) T TIGR00400 141 LTTEYVWLKEDYTVGDALDYLRRVAETAEDIYTLYVTNES-KRLTGVLSLRDLILA-KPEEYLGDLLFPDGVSVDGLNDE 218 (460) T ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHEEEEEECCC-HHHHHHHHHHHHHHC-CCHHHHHHHHHHCCEEEEECCCC T ss_conf 0000011121001578899888876554210245763341-022000112333303-50567766542100135204540 Q ss_pred HH-HHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCC Q ss_conf 99-999998655330645747997868898021000101 Q gi|254780889|r 173 EN-AKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQL 210 (493) Q Consensus 173 ~e-A~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~ 210 (493) ++ +...+.++..-.+|++|++++++|+++..|++.... T Consensus 219 ~~~~~~~~~~~d~~~~p~~~~~~~~~g~~~~dd~~~~~~ 257 (460) T TIGR00400 219 EEDVARLIEKYDFLAVPVVDNEGRLVGIVTVDDILDVLE 257 (460) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 467898876402102001316771687542346889887 No 338 >PRK00366 ispG 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Reviewed Probab=96.62 E-value=0.029 Score=35.63 Aligned_cols=211 Identities=20% Similarity=0.276 Sum_probs=96.3 Q ss_pred CCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHH Q ss_conf 81600317833785236888999999779879997898989999999987422388152878994654303456566760 Q gi|254780889|r 42 DFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKY 121 (493) Q Consensus 42 ~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~ 121 (493) .|.-+-|++--.|.+.--.+ +|.-++|+.+....-.-+++ ++ -|+..-++++..+.+. T Consensus 21 ~IGG~~PI~VQSMt~t~T~D-------------------v~at~~Qi~~L~~aGceiVR--va-vp~~~~a~al~~I~~~ 78 (367) T PRK00366 21 PIGGDAPIVVQSMTNTDTAD-------------------VEATVAQIKRLARAGCEIVR--VA-VPDMEAAAALPEIKKQ 78 (367) T ss_pred EECCCCCEEEEECCCCCCHH-------------------HHHHHHHHHHHHHCCCCEEE--EC-CCCHHHHHHHHHHHHH T ss_conf 04799950257568997443-------------------89999999999984999899--84-5999999969999984 Q ss_pred CCCCEEEECCCCEEEEEEEEHHHHCCC-CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEE Q ss_conf 504167866886027887235450014-3446724400267863257303589999999865533064574799786889 Q gi|254780889|r 122 SISGIPVVESDVGKLVGILTNRDVRFA-SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLI 200 (493) Q Consensus 122 ~is~iPVVD~~~gkLiGIVT~rDir~~-~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGII 200 (493) +.+|++-|- ++ |.+.+ ...+.-+..+ .=+|=-+.....+.++.+.-.++++--=.=| ..|.| T Consensus 79 --~~iPlvADI--HF-------~~~lAl~a~~~g~~ki-RINPGNig~~e~~~~vv~~ak~~~ipIRIGv-N~GSL---- 141 (367) T PRK00366 79 --LPVPLVADI--HF-------DYRLALKAAEAGADAL-RINPGNIGRDERFKEVVEAAKDYGIPIRIGV-NAGSL---- 141 (367) T ss_pred --CCCCEECCC--CC-------CHHHHHHHHHHHHHHE-EECCCCCCCHHHHHHHHHHHHHCCCCEEECC-CCCCC---- T ss_conf --789703125--41-------4899999998641113-1378756864889999999998499766446-77876---- Q ss_pred EECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHC--C-CC-- Q ss_conf 802100010174223265687655566513701567899987313666998248411388999999999857--9-87-- Q gi|254780889|r 201 TVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNF--P-SL-- 275 (493) Q Consensus 201 T~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~--~-~~-- 275 (493) -+|++..-..|.+- +.-+...+.++.|.+-|-+-|++-.-...-..+++.-+.+.+.+ | ++ T Consensus 142 -~~~ll~kyg~~~~~-------------amveSAl~~~~~~e~~~f~~iviS~KaS~v~~~v~ayr~la~~~dyPLHLGv 207 (367) T PRK00366 142 -EKDLLEKYGEPTPE-------------ALVESALRHAKILEELGFDDIKISVKASDVQDLIAAYRLLAKRCDYPLHLGV 207 (367) T ss_pred -CHHHHHHCCCCCHH-------------HHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCEEEE T ss_conf -79999870898679-------------9999999999999978985599998728699999999999985798834514 Q ss_pred ----CEECCCCCCHHHHHHHHHCCC-CEEEEEECC Q ss_conf ----245146469999999985487-589961027 Q gi|254780889|r 276 ----LVMAGNIATAEGALALIDAGA-DIIKVGIGP 305 (493) Q Consensus 276 ----~ii~GNv~t~~~~~~l~~~Ga-d~ikvGig~ 305 (493) ...-|-|-+.-|.-.|..-|- |.|||-.-+ T Consensus 208 TEAG~~~~G~IKSsvgig~LL~~GIGDTIRVSLT~ 242 (367) T PRK00366 208 TEAGMGFKGTVKSAAGLGVLLQEGIGDTIRVSLTA 242 (367) T ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEECCC T ss_conf 26789978331406778999975996428985688 No 339 >cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources. Probab=96.62 E-value=0.072 Score=32.81 Aligned_cols=122 Identities=22% Similarity=0.354 Sum_probs=80.8 Q ss_pred CCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 87655566513701567899987313666998248411388999999999857987245146469999999985487589 Q gi|254780889|r 220 GRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 220 grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) +.+.+||.-=. +.+-++..+++|++++|- -|.+..+++..+ +. +++.+ ..+.|+.-+...+++|+|.+ T Consensus 54 ~~~~iGaGTV~---~~~~~~~a~~aGa~Fivs---P~~~~~v~~~a~---~~--~~~~i-PGv~TpsEi~~A~~~G~~~v 121 (190) T cd00452 54 PEALIGAGTVL---TPEQADAAIAAGAQFIVS---PGLDPEVVKAAN---RA--GIPLL-PGVATPTEIMQALELGADIV 121 (190) T ss_pred CCCEEEECCCC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHH---HC--CCCEE-CCCCCHHHHHHHHHCCCCEE T ss_conf 98089652347---799999999859989973---779999999999---82--99665-78799999999998799989 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 9610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) |+ =|.+.. |..-+.++..- -.++++++-|||.- .++..-|.+|+.+|=+|+.|- T Consensus 122 K~--FPa~~~--------G~~~lkal~~p-----fp~~~~~ptGGI~~-~N~~~yl~~gv~avG~g~~l~ 175 (190) T cd00452 122 KL--FPAEAV--------GPAYIKALKGP-----FPQVRFMPTGGVSL-DNAAEWLAAGVVAVGGGSLLP 175 (190) T ss_pred EE--CCCCCC--------CHHHHHHHHCC-----CCCCCEEEECCCCH-HHHHHHHHCCCEEEEECHHCC T ss_conf 98--955114--------99999998554-----89993899679998-889999968998999541258 No 340 >pfam00290 Trp_syntA Tryptophan synthase alpha chain. Probab=96.61 E-value=0.029 Score=35.66 Aligned_cols=179 Identities=19% Similarity=0.235 Sum_probs=106.5 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---26786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- +......+..+.++.+.++++.+-|. ...|++ +.|.-....+ T Consensus 46 FSDP~ADGpvIq~A~~~AL~~G~~~~~~~~~~~~~r~~~~~~piv--------lM~Y~N~i~~----------------- 100 (258) T pfam00290 46 FSDPLADGPTIQRANLRALAGGMTLDQTLELVEEIRNKGTSVPIV--------LMTYYNPVLN----------------- 100 (258) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEE--------EEEECHHHHH----------------- T ss_conf 888766589999999999986996999999999855128998889--------9852088987----------------- Q ss_pred EEEECCCHHHHH-HHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 665137015678-9998731366-69982484113889999999998-57987245146469999999985487589961 Q gi|254780889|r 226 AAVSVAKDIADR-VGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 226 AAIg~~~d~~eR-a~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) -..|+ +..+.++|+| +|+.|...-.+... .+.+++ .+.-+++++.+ .+.+-.+.+.+..-..|=+- T Consensus 101 -------~G~e~F~~~~~~~GvdGvIipDLP~eE~~~~---~~~~~~~~l~~I~lvsPt-t~~~Ri~~i~~~s~gFiY~v 169 (258) T pfam00290 101 -------YGIERFYAQAAEAGVDGLIIPDLPPEEADPL---REAAEKHGIDLIFLVAPT-TSDERLKTISEAASGFVYLV 169 (258) T ss_pred -------CCHHHHHHHHHHCCCCEEECCCCCHHHHHHH---HHHHHHCCCCEEEEECCC-CCHHHHHHHHHHCCCEEEEE T ss_conf -------2999999999975997787079998899999---999984584358884588-81999999996089808998 Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHHHHHHHH-HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 027876653100015743100378899986-22698199726543777621336714878993230101216 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIMSVVEVA-ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~~~~~~~-~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) --.| +||..-.....+.+.-+.. +..++||..-=||+++.|+- ++..+||.|-+||-|-..-| T Consensus 170 s~~G-------vTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~-~~~~~aDGvIVGSaiv~~i~ 233 (258) T pfam00290 170 SRAG-------VTGARNAFNAQLDELVERLKKYTNVPVAVGFGISTPEHVK-KIAAGADGVIVGSAIVDIIE 233 (258) T ss_pred ECCC-------CCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCHHHHH-HHHCCCCEEEECHHHHHHHH T ss_conf 5344-------5676555638899999999860699848994579999999-99815999998499999999 No 341 >cd04747 OYE_like_5_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 5. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase. Probab=96.60 E-value=0.013 Score=38.11 Aligned_cols=127 Identities=17% Similarity=0.233 Sum_probs=72.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHH------------------------HHHHHHHHHHHHHCC-CCCEEC------C-- Q ss_conf 5678999873136669982484113------------------------889999999998579-872451------4-- Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHS------------------------QKVLDAVVQIKKNFP-SLLVMA------G-- 280 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~------------------------~~~~~~i~~~k~~~~-~~~ii~------G-- 280 (493) +..-|+...+||.|.+-|-.||||- ...++.++.+|+... +.+|.. + T Consensus 146 f~~AA~rA~~AGfDGVEIH~ahGyLl~qFLSp~~N~RtDeYGGSlENR~Rf~~Eii~aVr~~vg~df~vg~Ris~~~~~~ 225 (361) T cd04747 146 FARAAADARRLGFDGIELHGAHGYLIDQFFWAGTNRRADGYGGSLAARSRFAAEVVKAIRAAVGPDFPIILRFSQWKQQD 225 (361) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCC T ss_conf 99999999983999899510446589983587438898878998798473699999999997299875999967765767 Q ss_pred ---C-CCCHHH----HHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCH---- Q ss_conf ---6-469999----9999854875899610278766531000157431003788999862269819972654377---- Q gi|254780889|r 281 ---N-IATAEG----ALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFS---- 348 (493) Q Consensus 281 ---N-v~t~~~----~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~---- 348 (493) . -.+.|. +..|.++|.|.+-+-.|. . ..-...|.. +. -+....+..++|+|+-|++... T Consensus 226 ~~~~~~~~~ee~~~~~~~l~~~GvD~i~~s~~~--~-~~p~~~~~~---~~---~~~~~k~~~~~p~~~~g~~~~~~~~~ 296 (361) T cd04747 226 YTARLADTPDELEALLAPLVDAGVDIFHCSTRR--F-WEPEFEGSE---LN---LAGWTKKLTGLPTITVGSVGLDGDFI 296 (361) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC--C-CCCCCCCCC---CC---HHHHHHHHCCCCEEEECCCHHHHHHH T ss_conf 665567999999999999997799989841344--5-676567774---44---88998886289758644410356777 Q ss_pred --------------HHHEEEEEEC-CCEEEECCHHH Q ss_conf --------------7621336714-87899323010 Q gi|254780889|r 349 --------------GDIAKAIAAG-SACVMIGSLLA 369 (493) Q Consensus 349 --------------gdi~kAla~G-A~~VM~G~~~a 369 (493) -++.++|+.| ||+|.||..|- T Consensus 297 ~~~~~~~~~~~~~~e~ae~~l~~G~~D~V~~gR~ll 332 (361) T cd04747 297 GAFAGDEGASPASLDRLLERLERGEFDLVAVGRALL 332 (361) T ss_pred HHHHCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 775124544777999999999869941389759999 No 342 >PRK02615 thiamine-phosphate pyrophosphorylase; Provisional Probab=96.60 E-value=0.021 Score=36.72 Aligned_cols=118 Identities=19% Similarity=0.303 Sum_probs=66.0 Q ss_pred EECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHH---HHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 51370156789998731366699824841138899999---999985798724514646999999998548758996102 Q gi|254780889|r 228 VSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAV---VQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 228 Ig~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i---~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) .++.++..+.+++.+++||++|-.---+......++.- +.+.+.|...+||=-.+ .--...|||+|-+|-. T Consensus 151 t~~~~~l~~~Ve~AL~gGv~ivQlR~K~~~~~~~l~~A~~l~~Lc~~y~a~fIINDrv------DlAlav~ADGVHLGQ~ 224 (345) T PRK02615 151 TSPSEDLLEVVEAALKAGVKLVQYRDKTGDDRERLEEAKQLKELCKRYGALFIVNDRV------DIALAVDADGVHLGQE 224 (345) T ss_pred ECCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCH------HHHHHCCCCEEEECCC T ss_conf 6896349999999997599889830589999999999999999999959948981969------9999749987755888 Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 7876653100015743100378899986226981997265437776213367148789932301012 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) .-++-..|. -.|-.-|----+.+.-++.+|...|||++-+|.+|.-. T Consensus 225 Dlpi~~aR~--------------------llG~~~iIG~S~h~~ee~~~A~~~gaDYig~Gpvf~T~ 271 (345) T PRK02615 225 DLPLAVARQ--------------------LLGPEKIIGRSTTNPEELAKAIAEGADYIGVGPVFPTP 271 (345) T ss_pred CCCHHHHHH--------------------HCCCCCEEEECCCCHHHHHHHHHCCCCEEEECCEEECC T ss_conf 789999998--------------------73999189961799999999986399979988774258 No 343 >pfam05690 ThiG Thiazole biosynthesis protein ThiG. This family consists of several bacterial thiazole biosynthesis protein G sequences. ThiG, together with ThiF and ThiH, is proposed to be involved in the synthesis of 4-methyl-5-(b-hydroxyethyl)thiazole (THZ) which is an intermediate in the thiazole production pathway. Probab=96.59 E-value=0.017 Score=37.28 Aligned_cols=82 Identities=23% Similarity=0.345 Sum_probs=58.0 Q ss_pred CCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 46999999998548758996102787665310001574310037889998622698199726543777621336714878 Q gi|254780889|r 282 IATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 282 v~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) -.++--++.|.+.|+.+|- =-||.- =+|-|.-....+.-. ..+.++|||-|.||..+.|.++|.-+|+|+ T Consensus 130 ~~D~v~akrLed~Gc~avM---PlgsPI----GSg~Gl~n~~~l~~i---~e~~~vPvIVDAGiG~pS~Aa~aMElG~Da 199 (246) T pfam05690 130 TDDPVLARRLEEAGCAAVM---PLGAPI----GSGLGLRNPENLRII---IEEADVPVIVDAGIGTPSDAAQAMELGADA 199 (246) T ss_pred CCCHHHHHHHHHCCCEEEE---ECCCCC----CCCCCCCCHHHHHHH---HHHCCCCEEEECCCCCHHHHHHHHHCCCCE T ss_conf 7998999999875984986---224401----368886899999999---996799889848989678899999745677 Q ss_pred EEECCHHHCCCC Q ss_conf 993230101216 Q gi|254780889|r 362 VMIGSLLAGTDE 373 (493) Q Consensus 362 VM~G~~~agt~E 373 (493) |++-+..|.... T Consensus 200 VLvNTAIA~A~d 211 (246) T pfam05690 200 VLLNTAIARAKD 211 (246) T ss_pred EEHHHHHHCCCC T ss_conf 773067773799 No 344 >pfam04309 G3P_antiterm Glycerol-3-phosphate responsive antiterminator. Intracellular glycerol is usually converted to glycerol-3-phosphate in an ATP-requiring phosphorylation reaction catalysed by glycerol kinase (GlpK) glycerol-3-phosphate activates the antiterminator GlpP. Probab=96.59 E-value=0.026 Score=35.99 Aligned_cols=36 Identities=33% Similarity=0.552 Sum_probs=30.9 Q ss_pred HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEE Q ss_conf 998622698199726543777621336714878993 Q gi|254780889|r 329 VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMI 364 (493) Q Consensus 329 ~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~ 364 (493) .+..++.+.||||-|=|++.-|+..||.+||++|.- T Consensus 132 ~~i~~~~~~PiIAGGLI~~~edv~~aL~aGA~aVST 167 (174) T pfam04309 132 QEITERTNIPIIAGGLIRTEEEVREALKAGAVAVST 167 (174) T ss_pred HHHHHHCCCCEEEECCCCCHHHHHHHHHCCCEEEEC T ss_conf 999974799999767838899999999849969987 No 345 >PRK12581 oxaloacetate decarboxylase; Provisional Probab=96.57 E-value=0.038 Score=34.85 Aligned_cols=82 Identities=27% Similarity=0.288 Sum_probs=58.8 Q ss_pred HHHHHHHHHHCCCCEEEE-ECCCC-HHHHHHHHHHHHHHHCCCCCEECCCCCC--HHH--HHHHHHCCCCEEEEEECCCC Q ss_conf 567899987313666998-24841-1388999999999857987245146469--999--99998548758996102787 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVV-DTAHG-HSQKVLDAVVQIKKNFPSLLVMAGNIAT--AEG--ALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 234 ~~eRa~~LveaGvDvivi-D~ahG-h~~~~~~~i~~~k~~~~~~~ii~GNv~t--~~~--~~~l~~~Gad~ikvGig~Gs 307 (493) +.+.+++|+++|+|.|+| |.|-- .-..+-+.++.||+ .+++||-.-.=.| -.. ...-++||||+|-+.|.|=| T Consensus 165 y~~~ak~l~~~Gad~I~iKDmaGlL~P~~a~~LV~~lK~-~~~iPI~~HtH~t~Gla~a~~laAieAGaDiVD~Ai~~~s 243 (468) T PRK12581 165 YLSLVKELVEMGADSICIKDMAGILTPKAAKELVSGIKA-MTNLPLIVHTHATSGISQMTYLAAVEAGADRIDTALSPFS 243 (468) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHH-CCCCCEEEEECCCCCHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 999999999739998998478777688999999999983-6798659982588754999999999819999974464535 Q ss_pred CCCCCHHCC Q ss_conf 665310001 Q gi|254780889|r 308 ICTTRVVTG 316 (493) Q Consensus 308 ~CtTr~~~g 316 (493) -+|.+.-++ T Consensus 244 ~gtSqP~~~ 252 (468) T PRK12581 244 EGTSQPATE 252 (468) T ss_pred CCCCCCCHH T ss_conf 798886699 No 346 >cd04729 NanE N-acetylmannosamine-6-phosphate epimerase (NanE) converts N-acetylmannosamine-6-phosphate to N-acetylglucosamine-6-phosphate. This reaction is part of the pathway that allows the usage of sialic acid as a carbohydrate source. Sialic acids are a family of related sugars that are found as a component of glycoproteins, gangliosides, and other sialoglycoconjugates. Probab=96.57 E-value=0.035 Score=35.05 Aligned_cols=71 Identities=23% Similarity=0.388 Sum_probs=53.1 Q ss_pred EEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHH-------HHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCC Q ss_conf 6651370156789998731366699824841138-------89999999998579872451-464699999999854875 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQ-------KVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGAD 297 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~-------~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad 297 (493) |-+++.+| +..-.++|+|++--- -.||.. .=++.++.+.+.+ .+++|+ |++.|++.+...+++||+ T Consensus 128 AD~st~ee----~~~A~~~G~D~vgTT-L~GYT~~t~~~~~PD~~lv~~l~~~~-~~pvIaEGri~tPe~a~~a~~~GA~ 201 (219) T cd04729 128 ADISTLEE----ALNAAKLGFDIIGTT-LSGYTEETAKTEDPDFELLKELRKAL-GIPVIAEGRINSPEQAAKALELGAD 201 (219) T ss_pred EECCCHHH----HHHHHHCCCCEEECC-CCCCCCCCCCCCCCCHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCC T ss_conf 75488999----999998499899702-14567787889998789999999975-9939970698999999999983998 Q ss_pred EEEEE Q ss_conf 89961 Q gi|254780889|r 298 IIKVG 302 (493) Q Consensus 298 ~ikvG 302 (493) +|-|| T Consensus 202 aVVVG 206 (219) T cd04729 202 AVVVG 206 (219) T ss_pred EEEEC T ss_conf 99989 No 347 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=96.56 E-value=0.026 Score=36.04 Aligned_cols=71 Identities=23% Similarity=0.375 Sum_probs=59.0 Q ss_pred HHHHHHHHHHCCCCEEEEEC-CCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 56789998731366699824-841138899999999985-798724514646999999998548758996102787 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDT-AHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~-ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs 307 (493) -.+=+++..+..+|+|.+-+ +-+|...+-++++.+++. ..++++++|.+...+-...|.+.|++.+ .|||+ T Consensus 39 ~eeiv~~A~~e~ad~IglSsL~g~h~~~~~~l~~~L~e~G~~di~v~vGG~Ip~~d~~~l~~~Gv~~v---f~pgt 111 (122) T cd02071 39 PEEIVEAAIQEDVDVIGLSSLSGGHMTLFPEVIELLRELGAGDILVVGGGIIPPEDYELLKEMGVAEI---FGPGT 111 (122) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHCCCCEE---ECCCC T ss_conf 99999999973998999964655447899999999997699984699945649899999997799889---89588 No 348 >PRK09140 2-dehydro-3-deoxy-6-phosphogalactonate aldolase; Reviewed Probab=96.56 E-value=0.045 Score=34.33 Aligned_cols=122 Identities=30% Similarity=0.390 Sum_probs=79.4 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) +++|||.-=++ .|.+++.+++|+++++- -+.+..+++..++. +++.+ -.+.|+.-....+++|+|.+| T Consensus 62 ~~~iGAGTVlt---~e~~~~ai~aGA~FiVS---P~~~~~vi~~a~~~-----~i~~i-PG~~TPsEi~~A~~~Ga~~vK 129 (206) T PRK09140 62 DALIGAGTVLS---PEQVDRLADAGGRLIVT---PNIDPEVIRRAVAY-----GMTVM-PGVATPTEAFAALRAGADALK 129 (206) T ss_pred CEEEEEEECCC---HHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCCC-CCCCCHHHHHHHHHCCCCEEE T ss_conf 65998620467---99999999859999999---99989999999982-----99652-785999999999985987156 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) += |.+.. |..-+.++... -..++++++-|||. ..++..-|.+||.+|=+||.|- T Consensus 130 lF--PA~~~--------Gp~~ikal~~p----~P~~~~~~ptGGV~-~~N~~~~l~aGa~avG~Gs~L~ 183 (206) T PRK09140 130 LF--PASQL--------GPAGIKALRAV----LPPDVPVFAVGGVT-PENLAPYLAAGAAGFGLGSALY 183 (206) T ss_pred EC--CHHCC--------CHHHHHHHHCC----CCCCCEEEECCCCC-HHHHHHHHHCCCCEEEECHHCC T ss_conf 57--51105--------99999998643----89999899537988-8889999986991999606515 No 349 >PRK02083 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=96.56 E-value=0.043 Score=34.43 Aligned_cols=70 Identities=23% Similarity=0.336 Sum_probs=51.0 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-|+...+.|+| ++++|. |........+.++++.+.. .+|+ ++|.|-|.|.++.|.++|||-|-+| T Consensus 30 gdP~~~a~~~~~~gadel~ivDld~s~~~~~~~~~~I~~i~~~~-~~pi~vGGGIrs~e~~~~ll~~GadkVvig 103 (253) T PRK02083 30 GDPVELAKRYDEEGADELVFLDITASSEGRDTMKDVVERVAEQV-FIPLTVGGGIRSVEDARRLLRAGADKVSIN 103 (253) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 89999999999879998999956266457741799999999863-987785176213898768987798789999 No 350 >PRK11573 hypothetical protein; Provisional Probab=96.55 E-value=0.0039 Score=41.90 Aligned_cols=60 Identities=13% Similarity=0.233 Sum_probs=53.1 Q ss_pred CCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEECCCHHCC Q ss_conf 44672440026--786325730358999999986553306457479-9786889802100010 Q gi|254780889|r 150 NAQQAVGELMT--RNLITVKKTVNLENAKALLHQHRIEKLLVVDDD-GCCIGLITVKDIERSQ 209 (493) Q Consensus 150 ~~~~~V~eiMt--~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~-g~LvGIIT~~DIlk~~ 209 (493) -.+.+|+++|+ .++++++.+.++.++.+.+.+++..++||.+++ ++++|++..+|+++.. T Consensus 184 l~~~~v~eiMtPR~~i~~l~~~~~~~e~~~~~~~~~~SR~PVy~~~~D~iiGiv~~kdl~~~~ 246 (413) T PRK11573 184 LEKVTVDDIMVPRNEIIGIDINDDWKSILRQLTHSPHGRIVLYRDSLDDAISMLRVREAWRLM 246 (413) T ss_pred CCCCEEHEEEEECCCEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEHHHHHHH T ss_conf 189890118620022689747899999999866338754258547888659999825778776 No 351 >PRK00507 deoxyribose-phosphate aldolase; Provisional Probab=96.52 E-value=0.083 Score=32.38 Aligned_cols=114 Identities=23% Similarity=0.242 Sum_probs=74.4 Q ss_pred HHHHHHHHCCCCEE--EEEC---CCCHHHHHHHHHHHHHHHCCC--CCEEC--CCCCCHH----HHHHHHHCCCCEEEEE Q ss_conf 78999873136669--9824---841138899999999985798--72451--4646999----9999985487589961 Q gi|254780889|r 236 DRVGPLFDVNVDLV--VVDT---AHGHSQKVLDAVVQIKKNFPS--LLVMA--GNIATAE----GALALIDAGADIIKVG 302 (493) Q Consensus 236 eRa~~LveaGvDvi--viD~---ahGh~~~~~~~i~~~k~~~~~--~~ii~--GNv~t~~----~~~~l~~~Gad~ikvG 302 (493) .-++..++.|+|=| |++. -.|+...+.+-++.+++..+. +.+|. |-. |.+ ..+..+++|||.||-. T Consensus 78 ~E~~~ai~~GAdEiD~Vin~~~~~~g~~~~v~~ei~~v~~~~~~~~lKVIlEt~~L-t~~ei~~a~~~~~~aGadfvKTS 156 (221) T PRK00507 78 FEAKDAIANGADEIDMVINIGALKSGDWDAVEADIRAVVEAAGGKVLKVIIETCLL-TDEEKVKACEICKEAGADFVKTS 156 (221) T ss_pred HHHHHHHHCCCCEEEEECCHHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCC-CHHHHHHHHHHHHHHCCCEEEEC T ss_conf 99999998599877740259999758488999999999987276736999744659-99999999999998297878605 Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 02787665310001574310037889998622698199726543777621336714878 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) -|=+....| +-.|.-..+... ..+.|=|-||||+.-+..+-|.+||+- T Consensus 157 TGf~~~gat----------~e~v~~m~~~~~-~~~giKasGGIrt~~~a~~~l~aGa~r 204 (221) T PRK00507 157 TGFSTGGAT----------VEDVKLMRETVG-PRVGVKASGGIRTLEDALAMIEAGATR 204 (221) T ss_pred CCCCCCCCC----------HHHHHHHHHHHC-CCCEEECCCCCCCHHHHHHHHHHCCHH T ss_conf 887889989----------999999999728-786386778989999999999827513 No 352 >cd02940 DHPD_FMN Dihydropyrimidine dehydrogenase (DHPD) FMN-binding domain. DHPD catalyzes the first step in pyrimidine degradation: the NADPH-dependent reduction of uracil and thymine to the corresponding 5,6-dihydropyrimidines. DHPD contains two FAD, two FMN, and eight [4Fe-4S] clusters, arranged in two electron transfer chains that pass the dimer interface twice. Two of the Fe-S clusters show a hitherto unobserved coordination involving a glutamine residue. Probab=96.51 E-value=0.019 Score=37.01 Aligned_cols=70 Identities=30% Similarity=0.342 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHCCCCEEEE-EC-----------C------------CCHH-----HHHHHHHHHHHHHC-CCCCEEC-CC Q ss_conf 1567899987313666998-24-----------8------------4113-----88999999999857-9872451-46 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVV-DT-----------A------------HGHS-----QKVLDAVVQIKKNF-PSLLVMA-GN 281 (493) Q Consensus 233 d~~eRa~~LveaGvDvivi-D~-----------a------------hGh~-----~~~~~~i~~~k~~~-~~~~ii~-GN 281 (493) |..+-+.++.++|+|.++. .+ . -|.| ...++.++++++.. ++++||+ |. T Consensus 181 ~i~~ia~~~~~~gadgiv~~Nt~~~~~~i~~d~~~~~~~~~~~~~~GGlSG~~l~~~al~~v~~i~~~~~~~i~Iig~GG 260 (299) T cd02940 181 DIREIARAAKEGGADGVSAINTVNSLMGVDLDGTPPAPGVEGKTTYGGYSGPAVKPIALRAVSQIARAPEPGLPISGIGG 260 (299) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 49999999998599899997666775654422356665645677778455878899999999999996489977899899 Q ss_pred CCCHHHHHHHHHCCCCEEEEE Q ss_conf 469999999985487589961 Q gi|254780889|r 282 IATAEGALALIDAGADIIKVG 302 (493) Q Consensus 282 v~t~~~~~~l~~~Gad~ikvG 302 (493) |.+++-+...+.+||+.|-++ T Consensus 261 I~s~~Da~e~i~aGAs~Vqv~ 281 (299) T cd02940 261 IESWEDAAEFLLLGASVVQVC 281 (299) T ss_pred CCCHHHHHHHHHCCCCHHHHH T ss_conf 599999999998499899998 No 353 >PRK04169 geranylgeranylglyceryl phosphate synthase-like protein; Reviewed Probab=96.51 E-value=0.084 Score=32.35 Aligned_cols=69 Identities=26% Similarity=0.365 Sum_probs=50.9 Q ss_pred HHHHHHHHHH--CCCCEEEEECCCCHHHHH-HHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873--136669982484113889-999999998579872-45146469999999985487589961 Q gi|254780889|r 234 IADRVGPLFD--VNVDLVVVDTAHGHSQKV-LDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~Lve--aGvDviviD~ahGh~~~~-~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +.-.+.+|.+ -|--++-++-.-|+..-+ .++++.+|+...+++ +++|.+-|++.++.+.++|||.|-|| T Consensus 142 ~i~a~~alA~~~~g~~~iYLE~gsga~~pv~~e~V~~v~~~l~~~~LivGGGIrs~e~a~~~~~aGAD~IVvG 214 (229) T PRK04169 142 DIAAYAALAAEYLGMPIVYLEYGGGAGDPVPPEMVKAVKKALTDTPLIVGGGIRSPEQAREMAKAGADTIVVG 214 (229) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 9999999999983980899965888899789999999997378987899289699999999997699999988 No 354 >pfam04551 GcpE GcpE protein. In a variety of organisms, including plants and several eubacteria, isoprenoids are synthesized by the mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway. Although different enzymes of this pathway have been described, the terminal biosynthetic steps of the MEP pathway have not been fully elucidated. GcpE gene of Escherichia coli is involved in this pathway. Probab=96.51 E-value=0.031 Score=35.45 Aligned_cols=266 Identities=20% Similarity=0.259 Sum_probs=121.7 Q ss_pred CCEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHH Q ss_conf 81600317833785236888999999779879997898989999999987422388152878994654303456566760 Q gi|254780889|r 42 DFTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKY 121 (493) Q Consensus 42 ~i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~ 121 (493) .|.-+-||.--.|.+.--.+ ++.-++|+.+....-.-+++ .--|+..-++++..+.+. T Consensus 10 ~IGG~~PI~VQSMt~t~T~D-------------------v~atv~Qi~~L~~aGceiVR---vavp~~~~a~al~~I~~~ 67 (345) T pfam04551 10 PIGGDAPISVQSMTNTDTRD-------------------VEATVAQIKRLEEAGCDIVR---VAVPDMEAAEALKEIKKQ 67 (345) T ss_pred EECCCCCEEEEECCCCCCCC-------------------HHHHHHHHHHHHHCCCCEEE---ECCCCHHHHHHHHHHHHH T ss_conf 26798957667568998620-------------------89999999999985999899---887999999849999985 Q ss_pred CCCCEEEECCCCEEEEEEEEHHHHCCC-CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEE Q ss_conf 504167866886027887235450014-3446724400267863257303589999999865533064574799786889 Q gi|254780889|r 122 SISGIPVVESDVGKLVGILTNRDVRFA-SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLI 200 (493) Q Consensus 122 ~is~iPVVD~~~gkLiGIVT~rDir~~-~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGII 200 (493) +.+|++-|- + .|.+.+ ...+.-+..+ .=+|=-+.....+.++.+.-.++++--=.=| ..|.| T Consensus 68 --~~iPlVADI--H-------F~~~lAl~a~~~g~~ki-RINPGNig~~~~~~~vv~~ak~~~~pIRIGv-N~GSL---- 130 (345) T pfam04551 68 --SPIPLVADI--H-------FDYRLALEAIEAGVDKI-RINPGNIGRREKVKEVVEAAKERGIPIRIGV-NSGSL---- 130 (345) T ss_pred --CCCCCEEEC--C-------CCHHHHHHHHHHHHCCC-CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCC---- T ss_conf --899714004--2-------48999999998622015-6799774867878999999998399789832-67776---- Q ss_pred EECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCC---- Q ss_conf 8021000101742232656876555665137015678999873136669982484113889999999998579872---- Q gi|254780889|r 201 TVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLL---- 276 (493) Q Consensus 201 T~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~---- 276 (493) .+|+++.-..|.+- +.-+...+-++.|-+-|-+-|++-.-...-..+++.-+.+.+.++ -| T Consensus 131 -~~~~l~ky~~~~~~-------------amveSAl~~~~~~e~~~f~~iviS~KsS~v~~~i~ayr~la~~~d-yPLHLG 195 (345) T pfam04551 131 -EKRILEKYGGPTPE-------------AMVESALEHVRILEELGFDDIVISLKASDVLVMIEAYRLLAEKTD-YPLHLG 195 (345) T ss_pred -CHHHHHHCCCCCHH-------------HHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHCC-CCCEEC T ss_conf -88999870898789-------------999999999999987899868999971868899999999997579-883447 Q ss_pred ------EECCCCCCHHHHHHHHHCCC-CEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHH-----HCCCCEEEEECC Q ss_conf ------45146469999999985487-589961027876653100015743100378899986-----226981997265 Q gi|254780889|r 277 ------VMAGNIATAEGALALIDAGA-DIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVA-----ERAGVAIVADGG 344 (493) Q Consensus 277 ------ii~GNv~t~~~~~~l~~~Ga-d~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~-----~~~~~~iiadGG 344 (493) ..-|-|-+.-|.-.|..-|- |.|||-.-.- | .--|.-|.+.. +.+++.+|+==+ T Consensus 196 vTEAG~~~~G~IKSs~gig~LL~~GIGDTIRVSLt~d-------------P-~~EV~v~~~IL~sl~lR~~g~~iISCPt 261 (345) T pfam04551 196 VTEAGTGEDGTIKSAIGIGALLAEGIGDTIRVSLTED-------------P-VEEVKVAFEILQSLGLRKRGVEIISCPT 261 (345) T ss_pred CCCCCCCCCCEEHHHHHHHHHHHCCCCCEEEEECCCC-------------C-HHHHHHHHHHHHHCCCCCCCCEEEECCC T ss_conf 7477799881155898899999718865399967899-------------5-4789999999997287757864875887 Q ss_pred C-CCHHHHE-------E---EEEECCCEEEECCHHHCCCCCC Q ss_conf 4-3777621-------3---3671487899323010121677 Q gi|254780889|r 345 I-RFSGDIA-------K---AIAAGSACVMIGSLLAGTDESP 375 (493) Q Consensus 345 i-~~~gdi~-------k---Ala~GA~~VM~G~~~agt~Esp 375 (493) + |+.-|+. + .+-.+=.--.||....|--|+- T Consensus 262 CGR~~~dl~~~a~~ve~~l~~~~~~l~vAVMGCvVNGPGEak 303 (345) T pfam04551 262 CGRTLFDLIKVAKEVEERLSHLKKPLKVAVMGCVVNGPGEAK 303 (345) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECEEECCCCCCC T ss_conf 466115599999999999737998988999780431876444 No 355 >PRK01130 N-acetylmannosamine-6-phosphate 2-epimerase; Provisional Probab=96.50 E-value=0.04 Score=34.70 Aligned_cols=70 Identities=23% Similarity=0.266 Sum_probs=52.1 Q ss_pred EEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHH---------HHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCC Q ss_conf 6651370156789998731366699824841138---------89999999998579872451-4646999999998548 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQ---------KVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAG 295 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~---------~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~G 295 (493) |-|++.+ -+....++|+|++--- -.||.. .-++.++.+.+. ++++|+ |++.|++.+..-+++| T Consensus 124 AD~st~e----ea~~A~~~G~D~V~TT-LsGYT~~t~~~~~~~pD~~lv~~l~~~--~~pvIaEGri~tPe~a~~al~~G 196 (222) T PRK01130 124 ADCSTLE----EGLAAAKLGFDFIGTT-LSGYTEYTEGETPEEPDFALLKELLKA--GCPVIAEGRINTPEQAKKALELG 196 (222) T ss_pred EECCCHH----HHHHHHHCCCCEEECC-CCCCCCCCCCCCCCCCCHHHHHHHHHC--CCCEEEECCCCCHHHHHHHHHCC T ss_conf 8548899----9999998499999723-345676767787899869999999958--99899747989999999999849 Q ss_pred CCEEEEE Q ss_conf 7589961 Q gi|254780889|r 296 ADIIKVG 302 (493) Q Consensus 296 ad~ikvG 302 (493) |++|-|| T Consensus 197 A~aVvVG 203 (222) T PRK01130 197 AHAVVVG 203 (222) T ss_pred CCEEEEC T ss_conf 9899989 No 356 >cd04728 ThiG Thiazole synthase (ThiG) is the tetrameric enzyme that is involved in the formation of the thiazole moiety of thiamin pyrophosphate, an essential ubiquitous cofactor that plays an important role in carbohydrate and amino acid metabolism. ThiG catalyzes the formation of thiazole from 1-deoxy-D-xylulose 5-phosphate (DXP) and dehydroglycine, with the help of the sulfur carrier protein ThiS that carries the sulfur needed for thiazole assembly on its carboxy terminus (ThiS-COSH). Probab=96.49 E-value=0.02 Score=36.78 Aligned_cols=82 Identities=27% Similarity=0.343 Sum_probs=58.0 Q ss_pred CCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 46999999998548758996102787665310001574310037889998622698199726543777621336714878 Q gi|254780889|r 282 IATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 282 v~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) -.++--++.|.++|+.+|-- -||.- =+|-|.-.-..+. ...++.++|||-|.||..+.|.+.|.-+|+|+ T Consensus 131 ~~D~v~akrLe~~Gc~avMP---lgsPI----GSg~Gl~n~~~l~---~i~e~~~vPvIVDAGiG~pS~Aa~aMElG~da 200 (248) T cd04728 131 TDDPVLAKRLEDAGCAAVMP---LGSPI----GSGQGLLNPYNLR---IIIERADVPVIVDAGIGTPSDAAQAMELGADA 200 (248) T ss_pred CCCHHHHHHHHHCCCEEEEE---CCCCC----CCCCCCCCHHHHH---HHHHHCCCCEEEECCCCCHHHHHHHHHCCCCE T ss_conf 88999999999749534520---45643----4798879999999---99984799889847999756789998726553 Q ss_pred EEECCHHHCCCC Q ss_conf 993230101216 Q gi|254780889|r 362 VMIGSLLAGTDE 373 (493) Q Consensus 362 VM~G~~~agt~E 373 (493) |++-+..|.... T Consensus 201 VL~NTAIA~A~d 212 (248) T cd04728 201 VLLNTAIAKAKD 212 (248) T ss_pred EEHHHHHHCCCC T ss_conf 345468771699 No 357 >TIGR00737 nifR3_yhdG putative TIM-barrel protein, nifR3 family; InterPro: IPR004652 This family represents one branch of COG0042 from COG (Predicted TIM-barrel enzymes, possibly dehydrogenases, nifR3 family) and includes NifR3 itself, from Rhodobacter capsulatus. Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes , and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. ; GO: 0016491 oxidoreductase activity, 0050660 FAD binding, 0008033 tRNA processing. Probab=96.47 E-value=0.0079 Score=39.72 Aligned_cols=210 Identities=18% Similarity=0.252 Sum_probs=103.5 Q ss_pred CCCEECCCEEEECCCCCCCHHH---HHHHH--HCCCE-EEECCCC-CHHHHHHHH-HHHHHHCCCEECC--CE----EEE Q ss_conf 8816003178337852368889---99999--77987-9997898-989999999-9874223881528--78----994 Q gi|254780889|r 41 KDFTLNLPIMSAAMDQVTDSRL---AIAMA--QAGGL-GVIHRNF-SPSEQVAQV-HQVKKFESGMVVN--PV----TIS 106 (493) Q Consensus 41 ~~i~l~iPivSs~MDTVTe~~M---AiamA--~~GGi-GvIHrn~-sie~qa~~V-~kVKr~e~gmI~d--PV----ti~ 106 (493) .+|.|+-|+|-|||=-||.... +..+. +.|-+ |..--=| |-+.+...- .+-++- --+..| |+ .=+ T Consensus 2 G~~~L~s~V~~APmAGvtD~~FR~l~~~~~~skvGtvagL~~~EMvs~~~~~~~~r~~~~~~-~~~~~~~~~~~~Ql~Gs 80 (336) T TIGR00737 2 GNIQLKSRVVLAPMAGVTDLAFRRLVAEYGQSKVGTVAGLTVSEMVSSEAIVYKSRERTKKL-LDIDEDETPISVQLFGS 80 (336) T ss_pred CCCCCCCCEEECCCCCCCCHHHHHHHHHHCCCCCCCCCCCCEECCCCCHHHHHHHHHHHHHH-HHHCCCCCCCEEEEECC T ss_conf 87223676564367787671789999985214433124100222004537886235557765-32125888547876478 Q ss_pred CCCCHHHHHH-HHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCCCCCC--CCCCCCCCCHHHHHHHHHHC- Q ss_conf 6543034565-6676050416786688602788723545001434467244002678--63257303589999999865- Q gi|254780889|r 107 PYATLADALA-LMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGELMTRN--LITVKKTVNLENAKALLHQH- 182 (493) Q Consensus 107 pd~TI~ea~~-lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~--litv~~~~~l~eA~~iM~~~- 182 (493) .-+++.||.+ +..+.+...+ |=| ...|+..|--+. =--.....-+.++++.+.+. T Consensus 81 ~P~~~aeAAk~i~~~~ga~~I---DiN------------------~GCP~~Kitk~~aGsaLl~~p~~~~~iv~~vV~AV 139 (336) T TIGR00737 81 DPDTMAEAAKLINEELGADII---DIN------------------MGCPVPKITKKGAGSALLRDPDLIGKIVKAVVDAV 139 (336) T ss_pred CHHHHHHHHHHHHHCCCCCEE---EEC------------------CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 826899999998530589888---536------------------76548842167635432358689999999999875 Q ss_pred CCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEE---CCCCHHH Q ss_conf 5330645747997868898021000101742232656876555665137015678999873136669982---4841138 Q gi|254780889|r 183 RIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVD---TAHGHSQ 259 (493) Q Consensus 183 ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD---~ahGh~~ 259 (493) +-..+||. +|.|.. -|++ +-+..|-++.+.++|+-+|.|- -|++|+- T Consensus 140 ~~~~iPVT---------------VK~R~G----WD~~-----------h~n~~~~a~~a~~~Ga~Av~lHGRTRaQ~Y~G 189 (336) T TIGR00737 140 DAQDIPVT---------------VKIRIG----WDDA-----------HINAVEAARIAEDAGAQAVTLHGRTRAQGYEG 189 (336) T ss_pred CCCCCCEE---------------EEEECC----CCCC-----------CCCHHHHHHHHHHCCCCEEEHHHHHHCCCCCC T ss_conf 18766516---------------655156----3624-----------48889999999872400021110000015788 Q ss_pred HH-HHHHHHHHHHCC-C--CCEEC-CCCCCHHHHHHHHH-CCCCEEEEE Q ss_conf 89-999999998579-8--72451-46469999999985-487589961 Q gi|254780889|r 260 KV-LDAVVQIKKNFP-S--LLVMA-GNIATAEGALALID-AGADIIKVG 302 (493) Q Consensus 260 ~~-~~~i~~~k~~~~-~--~~ii~-GNv~t~~~~~~l~~-~Gad~ikvG 302 (493) .. -|.|+.+|+... . +|||+ |.|-+++.|+..++ .|||+|-|| T Consensus 190 ~A~wd~I~~vKq~v~~~GeiPVigNGDi~~~~~A~~~L~~TG~DGvm~g 238 (336) T TIGR00737 190 EANWDIIARVKQAVRKEGEIPVIGNGDIFSPEDAKAMLETTGADGVMIG 238 (336) T ss_pred CCCHHHHHHHHHHHHCCCCCCEECCCCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 7606899999999716875332227742467899999863788689850 No 358 >PRK00208 thiG thiazole synthase; Reviewed Probab=96.47 E-value=0.022 Score=36.57 Aligned_cols=83 Identities=23% Similarity=0.318 Sum_probs=58.0 Q ss_pred CCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCE Q ss_conf 46999999998548758996102787665310001574310037889998622698199726543777621336714878 Q gi|254780889|r 282 IATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSAC 361 (493) Q Consensus 282 v~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~ 361 (493) -.++--++.|.++|+.+|- =-||.- =+|-|.-.-..+ ....++.++|||-|.||-.+.|.+.|.-+|+|+ T Consensus 132 ~~D~v~akrLe~~Gc~avM---PlgsPI----GSg~Gl~n~~~l---~~i~e~~~vPvIVDAGiG~pS~Aa~AMElG~Da 201 (256) T PRK00208 132 TDDPVLAKRLEEAGCAAVM---PLGAPI----GSGLGLLNPYNL---RIIIEQADVPVIVDAGIGTPSDAAQAMELGADA 201 (256) T ss_pred CCCHHHHHHHHHCCCEEEE---ECCCCC----CCCCCCCCHHHH---HHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCE T ss_conf 8898999999974953452---045643----479887999999---999986799889857889766789998625543 Q ss_pred EEECCHHHCCCCC Q ss_conf 9932301012167 Q gi|254780889|r 362 VMIGSLLAGTDES 374 (493) Q Consensus 362 VM~G~~~agt~Es 374 (493) |++-+..|....- T Consensus 202 VL~NTAIA~A~dP 214 (256) T PRK00208 202 VLLNTAIAVAGDP 214 (256) T ss_pred EEHHHHHHCCCCH T ss_conf 2355687726998 No 359 >PRK07114 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=96.45 E-value=0.019 Score=36.94 Aligned_cols=123 Identities=18% Similarity=0.251 Sum_probs=81.8 Q ss_pred CCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 87655566513701567899987313666998248411388999999999857987245146469999999985487589 Q gi|254780889|r 220 GRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 220 grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) ..+.+||.-=.. .+.++..+++|++++|- -|.+..+++..+.. +++. ...+.|+.-....+++|++.+ T Consensus 70 p~~~iGaGTVl~---~~~~~~a~~aGA~FiVS---P~~~~~v~~~~~~~-----~~~~-iPGv~TptEi~~A~~~G~~~v 137 (223) T PRK07114 70 PEMILGVGSIVD---AATAALYIQLGANFVVG---PLFNEDIAKVCNRR-----KIPY-SPGCGSVSEIGFAEELGCEIV 137 (223) T ss_pred CCCEEEEECCCC---HHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCC-CCCCCCHHHHHHHHHCCCCEE T ss_conf 980896551889---99999999859989999---99999999999983-----9975-373199999999998799979 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCC-CHHHHEEEEEECCCEEEECCHHH Q ss_conf 96102787665310001574310037889998622698199726543-77762133671487899323010 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIR-FSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~-~~gdi~kAla~GA~~VM~G~~~a 369 (493) |+= |++. .|...+.++..- -.+++++.-|||. +-..+..-|.+|+.+|=+||+|. T Consensus 138 K~F--Pa~~--------~G~~~lkal~~p-----~p~~~~~PtGGV~ps~~N~~~~l~ag~~~vG~GS~l~ 193 (223) T PRK07114 138 KIF--PGDV--------YGPEFVKAIKGP-----MPWTSIMPTGGVEPTEENLKSWFKAGATCVGMGSKLF 193 (223) T ss_pred EEC--CCCC--------CCHHHHHHHHCC-----CCCCCEEECCCCCCCHHHHHHHHHCCCEEEEECHHHC T ss_conf 889--7323--------599999998464-----9999688799988735509999968997999884638 No 360 >PRK10558 alpha-dehydro-beta-deoxy-D-glucarate aldolase; Provisional Probab=96.43 E-value=0.063 Score=33.22 Aligned_cols=105 Identities=14% Similarity=0.125 Sum_probs=62.4 Q ss_pred CCCCCCCCCC--CCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHH-HH-HHHHHHHHHHCCCCCEECCCCC Q ss_conf 1017422326--568765556651370156789998731366699824841138-89-9999999985798724514646 Q gi|254780889|r 208 SQLNPNATKD--SKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQ-KV-LDAVVQIKKNFPSLLVMAGNIA 283 (493) Q Consensus 208 ~~~~p~A~~D--~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~-~~-~~~i~~~k~~~~~~~ii~GNv~ 283 (493) +..|||..|. .+|+..+|.-+.+.. -.=++.+..+|.|+++||.-||... .. ..++..++. ....+++==--. T Consensus 3 ~~~~pN~lk~~L~~g~~~~G~~~~~~s--p~~~Ei~a~~G~Dfv~iD~EHg~~~~~~l~~~i~a~~~-~~~~~lVRvp~~ 79 (256) T PRK10558 3 NDVFPNKFKAALAAKQVQIGCWSALSN--PITTEVLGLAGFDWLVLDGEHAPNDVSTFIPQLMALKG-SASAPVVRVPTN 79 (256) T ss_pred CCCCCHHHHHHHHCCCCEEEEEECCCC--HHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHH-CCCCCEEECCCC T ss_conf 865780999999779977998857899--89999997289899998377899999999999999971-799748967889 Q ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHH Q ss_conf 99999999854875899610278766531000157431003788999862 Q gi|254780889|r 284 TAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAE 333 (493) Q Consensus 284 t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~ 333 (493) ++...+.++|+||++|- +|+.....++.++.+ T Consensus 80 ~~~~i~r~LD~Ga~Gvi------------------vP~V~s~eea~~~V~ 111 (256) T PRK10558 80 EPVIIKRLLDIGFYNFL------------------IPFVETAEEARRAVA 111 (256) T ss_pred CHHHHHHHHCCCCCEEE------------------ECCCCCHHHHHHHHH T ss_conf 88999999707987556------------------147699999999998 No 361 >PRK00748 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated Probab=96.42 E-value=0.059 Score=33.45 Aligned_cols=70 Identities=24% Similarity=0.304 Sum_probs=48.9 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .|-.+.|+...+.|+| +.++|. |-.....-.+.++.+.+... +|+ ++|.+-|.|.++.++++|||-|-+| T Consensus 29 ~dP~~~A~~~~~~Ga~~lhvvDLd~A~~g~~~n~~~I~~i~~~~~-~pi~vGGGIrs~e~~~~~l~~GadkVvig 102 (241) T PRK00748 29 DDPLAQAQAWQDQGAEWLHLVDLDGAFAGRPVNLELIEAIVAAVD-IPVQLGGGIRDLETVEAYLDAGVARVIIG 102 (241) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHCC-CCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 899999999998799989999785420288207999999998679-99998277074999999997697758864 No 362 >PRK02747 consensus Probab=96.41 E-value=0.062 Score=33.31 Aligned_cols=70 Identities=20% Similarity=0.305 Sum_probs=53.4 Q ss_pred CHHHHHHHHHHHCCCCEE-EEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 015678999873136669-9824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLV-VVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvDvi-viD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-++...+.|+|-| ++|. |........+.++.+.+... +| -++|.|-+.|.++.|.++|||-|-+| T Consensus 30 gdP~~~ak~~~~~Gadelh~vDl~~a~~~~~~~~~lI~~i~~~~~-ipi~vGGGIrs~e~~~~ll~~GadkViig 103 (257) T PRK02747 30 GDPVEAARAYDAAGADELCFLDITASHENRGTMLDVVARTAEQCF-MPLTVGGGVRTVDDIRKLLLAGADKVSIN 103 (257) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCC-CCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 899999999998699989999476775675528999999998669-98898488207388789987699689834 No 363 >PRK07695 transcriptional regulator TenI; Provisional Probab=96.40 E-value=0.061 Score=33.37 Aligned_cols=118 Identities=24% Similarity=0.364 Sum_probs=74.5 Q ss_pred HHHHHHHCCCCEEEEECCCCHHHHHHH-HHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHC Q ss_conf 899987313666998248411388999-9999998579872451464699999999854875899610278766531000 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDTAHGHSQKVLD-AVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVT 315 (493) Q Consensus 237 Ra~~LveaGvDviviD~ahGh~~~~~~-~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~ 315 (493) |+....+.++|-+=+ .+-+ .++..|+.+|+ .+|+.-+.+.+.+......|||.+- +| .+-.|.-.. T Consensus 65 ~~dlA~~~~adGVHL--------Gq~D~~~~~~R~~~~~-~~IG~S~h~~~e~~~a~~~gaDYi~--~G--pif~T~tK~ 131 (202) T PRK07695 65 RVDIALLLNIHRVQL--------GYRSFDVKSVREKFPY-LHVGYSVHSLEEAIQAEKNGADYVV--YG--HVFPTDCKK 131 (202) T ss_pred CHHHHHHCCCCEEEE--------CHHHCCHHHHHHHCCC-CEEEEECCCHHHHHHHHHCCCCEEE--EC--CCCCCCCCC T ss_conf 199998849998921--------8212199999987799-8999957999999999776999699--72--541268888 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 15743100378899986226981997265437776213367148789932301012 Q gi|254780889|r 316 GVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 316 g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) +. |.+ .+....+.++...+|++|=|||. ..++...+.+||+.+=+-|.+-+. T Consensus 132 ~~--~~~-G~~~l~~~~~~~~iPvvAIGGI~-~~ni~~v~~~Ga~giAvis~I~~a 183 (202) T PRK07695 132 GV--PAR-GLEELSEIARALSIPVIAIGGIT-PENTRDVLAAGVSGIAVMSGIFSS 183 (202) T ss_pred CC--CCC-CHHHHHHHHHHCCCCEEEECCCC-HHHHHHHHHCCCCEEEEHHHHHCC T ss_conf 98--878-99999999986799989986989-999999998299999971897769 No 364 >PRK13139 consensus Probab=96.34 E-value=0.051 Score=33.89 Aligned_cols=180 Identities=13% Similarity=0.144 Sum_probs=105.5 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf 46724400---26786325730358999999986553-306457479978688980210001017422326568765556 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) ++.|+.|- +...-..+..+.++.++++++.+-+. ...|+| +.|....+-+. T Consensus 53 FSDP~ADGpvIq~A~~~AL~~G~~~~~~~~~~~~~~~~~~~piv--------lM~Y~N~i~~~----------------- 107 (254) T PRK13139 53 FSEPMADGPVILKANQAALAGGFKVRECFDFAREVVAAFNIPFL--------FMTYYNILFKY----------------- 107 (254) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCEE--------EEEEHHHHHHC----------------- T ss_conf 88866658999999999997699799999999999724897689--------99525999870----------------- Q ss_pred EEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 651370156789998731366-69982484113889999999998-5798724514646999999998548758996102 Q gi|254780889|r 227 AVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 227 AIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) | ..+-+..+.++|+| +|+.|...-.+.... +..++ .+.-+++++.+. +.+-.+.+.+..-..|=+=- T Consensus 108 --G----~e~F~~~~~~~Gv~GvIipDLP~eE~~~~~---~~~~~~gl~~I~lvaPtt-~~~Ri~~i~~~a~gFiY~vs- 176 (254) T PRK13139 108 --G----VERFIDEVADIGVKGLIVPDLPPEQAQDYI---AQCRAKGMAPIGIYAPTS-TDERMGKIAAAADGFIYCVA- 176 (254) T ss_pred --C----HHHHHHHHHHCCCCEEECCCCCHHHHHHHH---HHHHHCCCCEEEEECCCC-CHHHHHHHHHCCCCEEEEEE- T ss_conf --9----999999999759985864799978899999---999846975799945899-98999999851698699996- Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHH-HHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 78766531000157431003788999-8622698199726543777621336714878993230101216 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVE-VAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~-~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) ..-+||.....-..+.+.-+ ..+..++||..-=||+++-|+ |.+..+||.|-+||.|-..-| T Consensus 177 ------~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v-~~~~~~aDGvIVGSaiVk~ie 239 (254) T PRK13139 177 ------RRGVTGSKTSFDEHVGAFLHRCRAATPLPLAVGFGVKSAADV-DYLKGKADIAVVGSQAIRLFD 239 (254) T ss_pred ------CCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHH-HHHHCCCCEEEECHHHHHHHH T ss_conf ------666679886645889999999985589987997377999999-999716999998889999999 No 365 >cd04742 NPD_FabD 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase FabD. NPD is part of the nitroalkaneoxidizing enzyme family, that catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDPs are members of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=96.34 E-value=0.042 Score=34.53 Aligned_cols=87 Identities=21% Similarity=0.337 Sum_probs=57.1 Q ss_pred ECCCCEECCCEEEECCC-CCCCHHHHHHHHHCCCEEEEC-CCCCHHHHHHHHHHHHHHC-CC--EECCCEEEECCCCHHH Q ss_conf 03881600317833785-236888999999779879997-8989899999999874223-88--1528789946543034 Q gi|254780889|r 39 IAKDFTLNLPIMSAAMD-QVTDSRLAIAMAQAGGLGVIH-RNFSPSEQVAQVHQVKKFE-SG--MVVNPVTISPYATLAD 113 (493) Q Consensus 39 lt~~i~l~iPivSs~MD-TVTe~~MAiamA~~GGiGvIH-rn~sie~qa~~V~kVKr~e-~g--mI~dPVti~pd~TI~e 113 (493) |.+...++-|-++-.|- -++..+|.++|++.|-||+.- -.+++++..+-|.++++.- ++ +=-| +.-+|+++-.| T Consensus 6 F~~~ygvryaY~aGaMa~GIaS~~lViam~~aG~lgffGa~GL~~~~ve~ai~~i~~~L~~~~pyg~N-Lihsp~~p~~E 84 (418) T cd04742 6 FKEDYGLRYAYVAGAMARGIASAELVVAMGKAGMLGFFGAGGLPLDEVEQAIERIQAALGNGEPYGVN-LIHSPDEPELE 84 (418) T ss_pred HHHHHCCCCEECCCCCCCCCCCHHHHHHHHHCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCEEE-ECCCCCCHHHH T ss_conf 89984887300024232664769999999857730455678999799999999999856899984687-33799965889 Q ss_pred --HHHHHHHHCCCCE Q ss_conf --5656676050416 Q gi|254780889|r 114 --ALALMKKYSISGI 126 (493) Q Consensus 114 --a~~lm~~~~is~i 126 (493) ..++..+++++.+ T Consensus 85 ~~~vdL~L~~gVr~V 99 (418) T cd04742 85 EGLVDLFLRHGVRVV 99 (418) T ss_pred HHHHHHHHHCCCCEE T ss_conf 999999997699889 No 366 >COG0821 gcpE 1-hydroxy-2-methyl-2-(e)-butenyl 4-diphosphate synthase [Lipid metabolism] Probab=96.34 E-value=0.076 Score=32.64 Aligned_cols=211 Identities=19% Similarity=0.241 Sum_probs=95.2 Q ss_pred CEECCCEEEECCCCCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHC Q ss_conf 16003178337852368889999997798799978989899999999874223881528789946543034565667605 Q gi|254780889|r 43 FTLNLPIMSAAMDQVTDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYS 122 (493) Q Consensus 43 i~l~iPivSs~MDTVTe~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~ 122 (493) +.-.-||+.-.|.+---. +++.-+++|++..+.-.-+++ .--|+.--++|+..+.++- T Consensus 16 vGgdaPI~VQSMTnT~T~-------------------Dv~aTv~QI~~L~~aG~dIVR---vtv~~~e~A~A~~~Ik~~~ 73 (361) T COG0821 16 VGGDAPIVVQSMTNTDTA-------------------DVEATVAQIKALERAGCDIVR---VTVPDMEAAEALKEIKQRL 73 (361) T ss_pred ECCCCCEEEEECCCCCCC-------------------CHHHHHHHHHHHHHCCCCEEE---EECCCHHHHHHHHHHHHHC T ss_conf 669995688861577730-------------------179999999999981897899---9369978999999999847 Q ss_pred CCCEEEECCCCEEEEEEEEHHHHCCC-CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEE Q ss_conf 04167866886027887235450014-34467244002678632573035899999998655330645747997868898 Q gi|254780889|r 123 ISGIPVVESDVGKLVGILTNRDVRFA-SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLIT 201 (493) Q Consensus 123 is~iPVVD~~~gkLiGIVT~rDir~~-~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT 201 (493) .+|+|-|= + .|.++. ...+..+..+- =+|=-+.....+.++.+.-.++++-.=.=|| .|.| T Consensus 74 --~vPLVaDi--H-------f~~rla~~~~~~g~~k~R-INPGNig~~~~v~~vVe~Ak~~g~piRIGVN-~GSL----- 135 (361) T COG0821 74 --NVPLVADI--H-------FDYRLALEAAECGVDKVR-INPGNIGFKDRVREVVEAAKDKGIPIRIGVN-AGSL----- 135 (361) T ss_pred --CCCEEEEE--E-------CCHHHHHHHHHCCCCEEE-ECCCCCCCHHHHHHHHHHHHHCCCCEEEECC-CCCH----- T ss_conf --99879873--0-------568999975642742698-7786567367799999999975998799526-6861----- Q ss_pred ECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHC--C-CC--- Q ss_conf 02100010174223265687655566513701567899987313666998248411388999999999857--9-87--- Q gi|254780889|r 202 VKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNF--P-SL--- 275 (493) Q Consensus 202 ~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~--~-~~--- 275 (493) -++++..-.+|-+ + +.-+...+-++.|.+-|-+=|.+.+-+..-..+++.-+.+.+.. | ++ T Consensus 136 ek~~~~ky~~pt~--e-----------alveSAl~~a~~~e~l~f~~i~iS~K~Sdv~~~v~aYr~lA~~~dyPLHLGvT 202 (361) T COG0821 136 EKRLLEKYGGPTP--E-----------ALVESALEHAELLEELGFDDIKVSVKASDVQLMVAAYRLLAKRCDYPLHLGVT 202 (361) T ss_pred HHHHHHHHCCCCH--H-----------HHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEE T ss_conf 6999998547987--8-----------99999999999999779986799987387899999999999866877201410 Q ss_pred ---CEECCCCCCHHHHHHHHHCCC-CEEEEEECCC Q ss_conf ---245146469999999985487-5899610278 Q gi|254780889|r 276 ---LVMAGNIATAEGALALIDAGA-DIIKVGIGPG 306 (493) Q Consensus 276 ---~ii~GNv~t~~~~~~l~~~Ga-d~ikvGig~G 306 (493) ....|-|.+.-+.-.|..-|- |.|||-+-+- T Consensus 203 EAG~~~~G~VkSa~alg~LL~eGIGDTIRVSLt~~ 237 (361) T COG0821 203 EAGMGFKGIVKSAAALGALLSEGIGDTIRVSLTAD 237 (361) T ss_pred CCCCCCCCEEHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 35676450007789999999725776599965888 No 367 >PRK12331 oxaloacetate decarboxylase; Provisional Probab=96.34 E-value=0.059 Score=33.43 Aligned_cols=82 Identities=23% Similarity=0.250 Sum_probs=58.5 Q ss_pred HHHHHHHHHHCCCCEEEE-ECCCC-HHHHHHHHHHHHHHHCCCCCEEC--CCCCCHHHH--HHHHHCCCCEEEEEECCCC Q ss_conf 567899987313666998-24841-13889999999998579872451--464699999--9998548758996102787 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVV-DTAHG-HSQKVLDAVVQIKKNFPSLLVMA--GNIATAEGA--LALIDAGADIIKVGIGPGS 307 (493) Q Consensus 234 ~~eRa~~LveaGvDvivi-D~ahG-h~~~~~~~i~~~k~~~~~~~ii~--GNv~t~~~~--~~l~~~Gad~ikvGig~Gs 307 (493) +.+.++.|.++|+|.|+| |+|-- --..+-+.|+.||+.+ +++|-. -|-.--..+ ..-++||||.|-+.|.|=| T Consensus 156 yv~~a~~l~~~Gad~I~ikD~aGll~P~~~~eLV~aLk~~~-~lpI~~HtH~t~Gla~an~laAieAGaDivD~a~~~~s 234 (463) T PRK12331 156 FVKLAKEMQEIGADSICIKDMAGILTPYVAYELVKCIKENV-TVPLEVHTHATSGIAEMTYLKAIEAGADIIDTAISPFA 234 (463) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHC-CCCEEEEECCCCCHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 99999999964998899867867768899999999999744-98569983688757999999999849999962353546 Q ss_pred CCCCCHHCC Q ss_conf 665310001 Q gi|254780889|r 308 ICTTRVVTG 316 (493) Q Consensus 308 ~CtTr~~~g 316 (493) -.|.+.-++ T Consensus 235 ~gtsqP~~~ 243 (463) T PRK12331 235 GGTSQPATE 243 (463) T ss_pred CCCCCCCHH T ss_conf 797898799 No 368 >PRK08508 biotin synthase; Provisional Probab=96.32 E-value=0.098 Score=31.86 Aligned_cols=138 Identities=18% Similarity=0.246 Sum_probs=84.5 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEE--EECC----------CCHHHHHHHHHHHHHHHCCCCCEECCCCC----C Q ss_conf 765556651370156789998731366699--8248----------41138899999999985798724514646----9 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVV--VDTA----------HGHSQKVLDAVVQIKKNFPSLLVMAGNIA----T 284 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviv--iD~a----------hGh~~~~~~~i~~~k~~~~~~~ii~GNv~----t 284 (493) .+-+.|.+|.. +.|.+++|-+||+|-.. +++| |.+.+ =+++++++|+. .+.+..|.+. | T Consensus 90 ~l~~c~slG~l--~~e~~~~LkeAGvdrY~hNlETs~~~y~~I~tThty~d-Rl~tl~~~k~a--Gl~vCsGgIiGlGEt 164 (279) T PRK08508 90 GLHLIACNGMA--SVEQLKELKKAGIFSYNHNLETSKEFFPKICTTHSWEE-RFQTCLNAKEA--GLGLCSGGIFGLGES 164 (279) T ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEECCCCCCCHHHHCCCCCCCCHHH-HHHHHHHHHHC--CCEEECCCEEECCCC T ss_conf 93576117857--99999999983971230766767687576589988899-99999999981--994867854478999 Q ss_pred HHH----HHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHC--C---CCEEEEECCC-CCHHHHE-E Q ss_conf 999----99998548758996102787665310001574310037889998622--6---9819972654-3777621-3 Q gi|254780889|r 285 AEG----ALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAER--A---GVAIVADGGI-RFSGDIA-K 353 (493) Q Consensus 285 ~~~----~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~--~---~~~iiadGGi-~~~gdi~-k 353 (493) .|. +..|-+-++|-|-+++=- -+--|.. ..|+++. .+|-+...- . +..|-.-||- .+.+|.- + T Consensus 165 ~edrve~a~~L~eL~~dsVPIN~li-Pi~GTPL----e~~~l~~-~e~lr~iAl~RlilP~a~Ir~agGRe~~l~~~q~~ 238 (279) T PRK08508 165 WEDRISMLKSLASLSPHSTPINFFI-PNPALPL----DTPTLSA-DEALEIVRLAKEALPNARLMVAGGREVVFGERQYE 238 (279) T ss_pred HHHHHHHHHHHHHCCCCEECCCCCC-CCCCCCC----CCCCCCH-HHHHHHHHHHHHHCCCCEEEECCCHHHHCHHHHHH T ss_conf 8999999999983899875156765-8999988----8899999-99999999999978987656246524455636999 Q ss_pred EEEECCCEEEECCHHH Q ss_conf 3671487899323010 Q gi|254780889|r 354 AIAAGSACVMIGSLLA 369 (493) Q Consensus 354 Ala~GA~~VM~G~~~a 369 (493) +|.+||+.+|.|.+|- T Consensus 239 ~~~aGaN~i~~G~yLT 254 (279) T PRK08508 239 IFEAGANAIVIGDYLT 254 (279) T ss_pred HHHHCCCEEEECCCCC T ss_conf 9984684688866527 No 369 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=96.30 E-value=0.057 Score=33.54 Aligned_cols=134 Identities=22% Similarity=0.191 Sum_probs=79.9 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECC-------CHHHHHHHHHHH Q ss_conf 5899999998655330645747997868898021000101742232656876555665137-------015678999873 Q gi|254780889|r 171 NLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVA-------KDIADRVGPLFD 243 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~-------~d~~eRa~~Lve 243 (493) -++...+...++++...-|.|.=+.+-.+... -.+.+. .|.. +-++|.-. +-+.+.|+.|++ T Consensus 98 vv~~fv~~~~~~GidifRiFD~LNdv~nm~~~---------~~~v~~-~G~~-~e~~i~yt~sP~ht~~yy~~~a~~l~~ 166 (580) T PRK09282 98 VVEKFVEKAAENGIDVFRIFDALNDVRNMETA---------IKAVKK-VGAH-AQGTISYTTSPVHTLETWVDLAKQLEE 166 (580) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHH---------HHHHHH-CCCE-EEEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 99999999997699789974012325778999---------999984-5987-999999715887638999999999997 Q ss_pred CCCCEEEE-ECCCC-HHHHHHHHHHHHHHHCCCCCEECCCCCCHHH----HHHHHHCCCCEEEEEECCCCCCCCCHHCC Q ss_conf 13666998-24841-1388999999999857987245146469999----99998548758996102787665310001 Q gi|254780889|r 244 VNVDLVVV-DTAHG-HSQKVLDAVVQIKKNFPSLLVMAGNIATAEG----ALALIDAGADIIKVGIGPGSICTTRVVTG 316 (493) Q Consensus 244 aGvDvivi-D~ahG-h~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~----~~~l~~~Gad~ikvGig~Gs~CtTr~~~g 316 (493) .|+|.|+| |.|-- .-..+-+.++.||+.+ ++||-.-.=.|.-. ...-++||||+|-+.+.|=|-.|.+.-++ T Consensus 167 ~G~d~i~iKDmaGll~P~~a~~LV~alk~~~-~lpI~~HtH~t~G~~~a~~l~A~eAGvdivD~a~~~~s~gtsqP~~~ 244 (580) T PRK09282 167 MGCDSICIKDMAGLLTPYAAYELVSALKKEV-DLPVHLHCHATTGLATMTYLKAVEAGVDILDTAISSMSMGTSHPPTE 244 (580) T ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHC-CCCEEEEECCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHH T ss_conf 4998899726655789899999999999861-98169982477647999999999848886700135534887898799 No 370 >PRK13121 consensus Probab=96.28 E-value=0.052 Score=33.85 Aligned_cols=181 Identities=17% Similarity=0.195 Sum_probs=106.0 Q ss_pred CCCCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEE Q ss_conf 4467244002---6786325730358999999986553--3064574799786889802100010174223265687655 Q gi|254780889|r 150 NAQQAVGELM---TRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRV 224 (493) Q Consensus 150 ~~~~~V~eiM---t~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~V 224 (493) .++.|+.|-. ......+..+.++.+.++++.+-|. ...|+| +.|....+.+. T Consensus 53 PfSDP~ADGPvIq~A~~rAL~~G~~~~~~~~~~~~~r~~~~~~Piv--------lM~Y~N~i~~y--------------- 109 (265) T PRK13121 53 PFSDPMADGPVIQRASERALAHGVSLRQVLAMVKEFRETNQTTPVV--------LMGYANPIERM--------------- 109 (265) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEE--------EEEHHHHHHHH--------------- T ss_conf 8899776589999999999977998467799999831037999989--------86214599997--------------- Q ss_pred EEEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56651370156789998731366-69982484113889999999998-57987245146469999999985487589961 Q gi|254780889|r 225 AAAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 225 gAAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ...+-+..+.++|+| +|+.|...-.+.... +.+++ .+.-+++++.+- +.+-.+.+.+..-..|=+= T Consensus 110 --------G~e~F~~~~~~aGvdGlIipDLP~eE~~~~~---~~~~~~gl~~I~lvaPtt-~~~Ri~~i~~~~~gFiY~V 177 (265) T PRK13121 110 --------GYDAFAAAARAAGVDGVLVVDYPPEECEEFA---AKMRAAGIDPIFLLAPTS-TDERIAAVARVASGYVYYV 177 (265) T ss_pred --------HHHHHHHHHHHCCCCEEECCCCCHHHHHHHH---HHHHHCCCCEEEEECCCC-CHHHHHHHHHHCCCEEEEE T ss_conf --------1999999998729873434899989999999---999865996689958999-8999999996289809997 Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHH-HHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 027876653100015743100378-89998622698199726543777621336714878993230101216 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIM-SVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~-~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||..-....-+. .+.+..+..++||..-=||+++-|+ +++..+||.|-+||.|-..-| T Consensus 178 -------s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~~-~~v~~~ADGvIVGSaiV~~i~ 241 (265) T PRK13121 178 -------SLKGVTGAATLDVSSVAAKLPAIRSHVPLPVGVGFGIRDAATA-RAVAEVADAVVIGSRLVQLIE 241 (265) T ss_pred -------ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHH-HHHHHCCCEEEECHHHHHHHH T ss_conf -------5555667775662889999999985479985997688989999-999811999998489999998 No 371 >PRK13111 trpA tryptophan synthase subunit alpha; Provisional Probab=96.28 E-value=0.081 Score=32.47 Aligned_cols=169 Identities=19% Similarity=0.239 Sum_probs=99.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHH Q ss_conf 86325730358999999986553-30645747997868898021000101742232656876555665137015678999 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGP 240 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~ 240 (493) ....+..++++.+.++++.+-|. ...|++ +.|....+-+ .| ..+-+.. T Consensus 60 ~~~AL~~G~~~~~~f~~~~~~r~~~~~piv--------lM~Y~N~i~~-------------------~G----~e~F~~~ 108 (256) T PRK13111 60 SLRALAAGVTLADVLELLREIRAKPTIPIV--------LMTYYNPIFQ-------------------YG----VEAFAAD 108 (256) T ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEEECCHHHH-------------------HC----HHHHHHH T ss_conf 999997799699999999998606899889--------9850308987-------------------09----9999999 Q ss_pred HHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCC Q ss_conf 8731366-69982484113889999999998-579872451464699999999854875899610278766531000157 Q gi|254780889|r 241 LFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVG 318 (493) Q Consensus 241 LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g 318 (493) +.++|+| +|+.|..--++... .+..++ ...-+++++.+. +.+-.+.+.+..-..|=+= +..-+||.. T Consensus 109 ~~~~GvdGvIipDLP~eE~~~~---~~~~~~~gi~~I~lvaPtt-~~~Ri~~i~~~s~gfiY~v-------s~~GvTG~~ 177 (256) T PRK13111 109 AAEAGVDGLIIPDLPPEEAEEF---RAAAKKHGIDLIFLVAPTT-TDERLKKIASHASGFVYYV-------SRAGVTGAR 177 (256) T ss_pred HHHCCCCEEEECCCCHHHHHHH---HHHHHHCCCEEEEEECCCC-CHHHHHHHHHHCCCEEEEE-------ECCCCCCCC T ss_conf 9975997798169997888999---9999975980899969999-8899999996269859998-------567767887 Q ss_pred CCCCHHHHHHHHHHH-CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 431003788999862-2698199726543777621336714878993230101216 Q gi|254780889|r 319 CPQLSAIMSVVEVAE-RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 319 ~pq~~av~~~~~~~~-~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) -+....+.+.-+..+ ..++||..-=||+++.|+.+ +..+||.|-+||.|-..-| T Consensus 178 ~~~~~~~~~~i~~ik~~t~~Pi~vGFGIs~~e~v~~-~~~~aDGvIVGSaiv~~i~ 232 (256) T PRK13111 178 SADAADVADLLARLKAHTDLPVAVGFGISTPEQAAA-IAEGADGVIVGSALVKIIE 232 (256) T ss_pred CCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHH-HHCCCCEEEECHHHHHHHH T ss_conf 666288999999998706897588528899999999-9745999998689999998 No 372 >PRK13112 consensus Probab=96.28 E-value=0.056 Score=33.60 Aligned_cols=181 Identities=15% Similarity=0.164 Sum_probs=106.9 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---26786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- ..-....+..+.++.++++++.+-+. ...|++ +.|.-..+.+. T Consensus 55 FSDPvADGPvIQ~A~~rAL~~G~~~~~~~~~~~~ir~~~~~~Piv--------lM~Y~N~i~~~---------------- 110 (279) T PRK13112 55 FSDPMADGPAIQAAGLRALKAGQTLAKTLYLAREFRKDDDTTPIV--------LMGYYNPIYIY---------------- 110 (279) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEE--------EEEECHHHHHH---------------- T ss_conf 898666579999999999976996889999999851348998879--------98512499884---------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 6651370156789998731366-699824841138899999999985798724514646999999998548758996102 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) | ...-+..+.++|+| +|+.|...-.+....+..+ +....-+++++.+. +.+-.+.+.+.....|-+= T Consensus 111 ---G----~e~F~~~~~~aGvdGvIipDLP~eE~~~~~~~~~--~~~i~~I~lvaPtt-~~eRi~~i~~~s~GFiY~V-- 178 (279) T PRK13112 111 ---G----VERFLTDAKAAGVDGLIVVDLPPEMDAELCIPAM--KAGINFIRLATPTT-DDKRLPKVLANTSGFVYYV-- 178 (279) T ss_pred ---C----HHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHH--HCCCCEEEEECCCC-CHHHHHHHHHCCCCCEEEE-- T ss_conf ---7----9999999997399879846999788899999998--57834699825899-8999999985278808998-- Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHH-HHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 78766531000157431003788999-8622698199726543777621336714878993230101216 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVE-VAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~-~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||...+....+.+--+ ..+..++||..-=||+++-| +|++..+||-|-+||.|-..-| T Consensus 179 -----s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~-~~~~~~~aDGvIVGSAiVk~Ie 242 (279) T PRK13112 179 -----SMTGITGSALADTSAVGEAVARIKRHTDLPVCVGFGVKTPEQ-ARAIAAHADGVVVGTAIVNALA 242 (279) T ss_pred -----ECCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEECCCCHHH-HHHHHCCCCEEEECHHHHHHHH T ss_conf -----356666766456488999999999717898767835699999-9999725999998779999998 No 373 >PRK13126 consensus Probab=96.27 E-value=0.11 Score=31.40 Aligned_cols=130 Identities=16% Similarity=0.140 Sum_probs=77.8 Q ss_pred HHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 56789998731366-699824841138899999999985-7987245146469999999985487589961027876653 Q gi|254780889|r 234 IADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 234 ~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) ..+....+.++|+| +|+.|.-.-+.+...+.++..++. ...+++++ --.+.+-.+.+.+..-..+=|+ . T Consensus 84 ~~~f~~~~~~aGvdGlIipDLP~e~~ee~~~~~~~~~~~gl~~I~lv~-ptt~~~ri~~i~~~s~gfiYvs--------~ 154 (237) T PRK13126 84 PAELFETAAEVGARGVLAPDLLIDFPGDLERYLELSREYGLAPSFFIP-SKFPHRLLRRLASLEPDFIYLG--------L 154 (237) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCEEEEEC-CCCHHHHHHHHHHHCCCEEEEE--------E T ss_conf 999999998749973883688877817789999999976997799738-9983999999998589879998--------6 Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCC-CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 100015743100378899986226-98199726543777621336714878993230101216 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERA-GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~-~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +-+||.-.|. +.-.......+.. ++|+..-=||+++-|+.+++..|||-|-+||.|-..-| T Consensus 155 ~gvTG~~~~~-~~~~~i~~ir~~~~~~Pv~vGFGI~t~e~~~~~~~~~aDGvIVGSaiV~~i~ 216 (237) T PRK13126 155 YAATGIELPV-YVERNVKTIRGLAGDVYLVAGFAIDSPEKAARLVKAGADGVVVGTAFMRRLS 216 (237) T ss_pred ECCCCCCCHH-HHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 5266764156-7999999999857899779994539999999998648999998789999999 No 374 >KOG2118 consensus Probab=96.26 E-value=0.022 Score=36.59 Aligned_cols=147 Identities=16% Similarity=0.139 Sum_probs=100.8 Q ss_pred CCCHHHHHHHHHCCCE-E-EECCCCCHHHHHHHHHHHHHHCCCE--ECCCEEEECCCCHH-HHHHHHHHHCCCCEEEECC Q ss_conf 3688899999977987-9-9978989899999999874223881--52878994654303-4565667605041678668 Q gi|254780889|r 57 VTDSRLAIAMAQAGGL-G-VIHRNFSPSEQVAQVHQVKKFESGM--VVNPVTISPYATLA-DALALMKKYSISGIPVVES 131 (493) Q Consensus 57 VTe~~MAiamA~~GGi-G-vIHrn~sie~qa~~V~kVKr~e~gm--I~dPVti~pd~TI~-ea~~lm~~~~is~iPVVD~ 131 (493) ..+..+...+-..=+. | ..|...+|-.-+ ..-..|..++.| |.++..+..+..+. +......+++++.+||.+. T Consensus 168 ~~~l~~lv~~~~~e~~~g~~~~~e~~ii~g~-l~l~ek~~~evmtpi~~~f~l~~n~~l~~~~~~~i~~~g~sripv~~~ 246 (498) T KOG2118 168 RASLLALVQLVGNEAGKGDLTYDELTIITGA-LELTEKLVGEVMTPIEDVFALDANTKLDRETVGEIVKHGYSRIPVYEQ 246 (498) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH-HHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCEEEECCC T ss_conf 2779999998750013575524577887536-899999988730614443001100210178888887658760000366 Q ss_pred CC-EEEEEEEEHHHHCCC----CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCC Q ss_conf 86-027887235450014----344672440026786325730358999999986553306457479978688980210 Q gi|254780889|r 132 DV-GKLVGILTNRDVRFA----SNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDI 205 (493) Q Consensus 132 ~~-gkLiGIVT~rDir~~----~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DI 205 (493) +. .++.+.+....+++. .....++.+.|-.++.-+++++++.+.++.+.+.+.+.+.|++ ...-.+++|..|+ T Consensus 247 ~~~~~i~~~L~~~~~~~~~~~~~~~~~~v~~~~~~~l~~vp~~~~~~~~l~~~~~~~~H~~~v~~-~~~~~~~~~l~~~ 324 (498) T KOG2118 247 EPKNKIGGLLVMNLLRLLQVEVPLEPLPVSESALLRLPLVPENMPLLDLLNEFQKGKSHMAVVRN-GHVDIFVLTLEDL 324 (498) T ss_pred CCCCEEHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC-CCCCEEEEECCCH T ss_conf 65200004443312332043345663003455412545688743088889998731213688824-8762444641414 No 375 >CHL00162 thiG thiamin biosynthesis protein G; Validated Probab=96.26 E-value=0.033 Score=35.25 Aligned_cols=87 Identities=23% Similarity=0.295 Sum_probs=59.8 Q ss_pred EECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEE Q ss_conf 45146469999999985487589961027876653100015743100378899986226981997265437776213367 Q gi|254780889|r 277 VMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIA 356 (493) Q Consensus 277 ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla 356 (493) |..=--.++--++.|.++|+.+|-- -||.- =+|-|.-.-..+. ...+..++|||-|.||-.+.|.+.|.- T Consensus 140 VlpY~~dD~v~akrLe~~Gc~avMP---lgsPI----GSg~Gl~n~~~l~---~i~e~~~vPvIVDAGiG~pSdAa~aME 209 (267) T CHL00162 140 VLPYINADPVLAKQLEDIGCATVMP---LGSPI----GSGQGLQNLLNLQ---IIIENAKIPVIIDAGIGTPSEASQAME 209 (267) T ss_pred EEEECCCCHHHHHHHHHCCCEEEEE---CCCCC----CCCCCCCCHHHHH---HHHHCCCCCEEEECCCCCHHHHHHHHH T ss_conf 9895489989999998659868863---45512----3688758999999---999648998899689896788899997 Q ss_pred ECCCEEEECCHHHCCCC Q ss_conf 14878993230101216 Q gi|254780889|r 357 AGSACVMIGSLLAGTDE 373 (493) Q Consensus 357 ~GA~~VM~G~~~agt~E 373 (493) +|+|+|++-+..|.... T Consensus 210 lG~DaVL~NTAIA~A~d 226 (267) T CHL00162 210 LGASGVLLNTAVAKAKN 226 (267) T ss_pred CCCCEEEECHHHHCCCC T ss_conf 46777870167671699 No 376 >PRK11750 gltB glutamate synthase subunit alpha; Provisional Probab=96.26 E-value=0.052 Score=33.85 Aligned_cols=143 Identities=19% Similarity=0.278 Sum_probs=86.5 Q ss_pred HHHCCCCEEEEE-ECCC--CCCCCCHHCCCCCCCCHHHHHHHHHHHCC----CCEEEEECCCCCHHHHEEEEEECCCEEE Q ss_conf 985487589961-0278--76653100015743100378899986226----9819972654377762133671487899 Q gi|254780889|r 291 LIDAGADIIKVG-IGPG--SICTTRVVTGVGCPQLSAIMSVVEVAERA----GVAIVADGGIRFSGDIAKAIAAGSACVM 363 (493) Q Consensus 291 l~~~Gad~ikvG-ig~G--s~CtTr~~~g~g~pq~~av~~~~~~~~~~----~~~iiadGGi~~~gdi~kAla~GA~~VM 363 (493) .++||||.|-+- --+| ..=-| .+.-.|.|-=--+.|.-++.-.. .|-+++|||+|+..||++|--+||+-.= T Consensus 1015 VAKA~AD~I~ISG~dGGTGAsp~t-SikhaGlPwElGlaEthq~L~~n~LR~rV~l~~DGglkTGrDVviaalLGAeefg 1093 (1483) T PRK11750 1015 VAKAYADLITISGYDGGTGASPLT-SVKYAGSPWELGLAETHQALVANGLRHKIRLQVDGGLKTGLDVIKAAILGAESFG 1093 (1483) T ss_pred HHHCCCCEEEECCCCCCCCCCCHH-HHCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHCCHHHC T ss_conf 120478889981688987767323-4203888654438999999997375463899966985616899999872605543 Q ss_pred ECCHH--H-CCCCCCCCEEEECCEEEEEEECCCCHHHHHHCCHHHHCCCCCCCCCCEECCCCEEEEE------CCCCHH- Q ss_conf 32301--0-1216776448736805888733200688752212221012333433100454026621------057589- Q gi|254780889|r 364 IGSLL--A-GTDESPGDIFLYQGRSFKSYRGMGSVAAMERGSSARYSQDGVTDVLKLVPEGIEGRVP------YKGPIA- 433 (493) Q Consensus 364 ~G~~~--a-gt~Espg~~~~~~g~~~K~yrGm~S~~A~~~~~~~ry~~~~~~~~~k~v~eGv~~~v~------~~G~~~- 433 (493) .|..- | ||. |.+ --+ ...-|-||..+-| |+|..+ T Consensus 1094 fgT~~Lia~GCi-------------------------M~R-----~CH------lntCpvGIATQ~~~Lr~~~f~G~pe~ 1137 (1483) T PRK11750 1094 FGTAPMVALGCK-------------------------YLR-----ICH------LNNCATGVATQDEKLRKNHYHGLPEM 1137 (1483) T ss_pred CCHHHHHHHHHH-------------------------HHH-----HHC------CCCCCCCCCCCCHHHHHHHCCCCHHH T ss_conf 442789984408-------------------------878-----642------58898740158988886342898899 Q ss_pred --HHHHHHHHHHHHHHHHCCCCCHHHHHHCCEEEEECHH Q ss_conf --9899999999985551575877689737599999553 Q gi|254780889|r 434 --SVLHQMSGGLKSSMGYVGASNIEEFQKKANFIRVSVA 470 (493) Q Consensus 434 --~~~~~~~~glrs~~~y~G~~~i~e~~~~~~f~~~t~a 470 (493) +++..+..-+|-=|...|.++|.|+..++.+.++... T Consensus 1138 vvn~f~~vAeevReilA~LG~rsl~e~iGr~dlL~~~~~ 1176 (1483) T PRK11750 1138 VMNYFEFIAEETREWMAQLGVRSLEDLIGRTDLLEELEG 1176 (1483) T ss_pred HHHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCCCCCC T ss_conf 999999999999999999684159998486340001567 No 377 >PRK13134 consensus Probab=96.26 E-value=0.062 Score=33.28 Aligned_cols=173 Identities=13% Similarity=0.102 Sum_probs=100.9 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCC-EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHH Q ss_conf 63257303589999999865533-06457479978688980210001017422326568765556651370156789998 Q gi|254780889|r 163 LITVKKTVNLENAKALLHQHRIE-KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 163 litv~~~~~l~eA~~iM~~~ki~-~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~L 241 (493) ...+..+.++.+.++.+.+.+.+ ..|+| +.|.-..+-+ ....+-+..+ T Consensus 71 ~rAL~~G~~~~~~~~~~~~~~~~~~~piv--------lMtY~N~i~~-----------------------yG~e~F~~~~ 119 (257) T PRK13134 71 QRALESGVTLRWIMDGLAARKGRLRAGLV--------LMGYLNPFMQ-----------------------YGFERFVRDA 119 (257) T ss_pred HHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEECCHHHHH-----------------------HHHHHHHHHH T ss_conf 99996799878999999987446899989--------9853459997-----------------------4689999999 Q ss_pred HHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC Q ss_conf 731366-6998248411388999999999857987245146469999999985487589961027876653100015743 Q gi|254780889|r 242 FDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP 320 (493) Q Consensus 242 veaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p 320 (493) .++|+| +|+.|..--.+....+.++ +..+.-+++++.+- +.+-.+.+.+..-..|=+= +..-+||.-.. T Consensus 120 ~~aGvdGvIipDLP~eE~~~~~~~~~--~~gi~~I~lvaPtt-~~~Ri~~i~~~s~gFIY~v-------s~~GvTG~~~~ 189 (257) T PRK13134 120 ADAGVAGCIIPDLPLDEDADLRALLA--ARGMDLIALVGPNT-GEGRMREYAAVASGYVYVV-------SVMGTTGVRDG 189 (257) T ss_pred HHCCCCEEEECCCCHHHHHHHHHHHH--HCCCCEEEECCCCC-CHHHHHHHHHHCCCEEEEE-------ECCCCCCCCCC T ss_conf 86798759946999778899999999--75982699638999-9999999996288808998-------43556687645 Q ss_pred CCHHHHHHHHHH-HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 100378899986-226981997265437776213367148789932301012167764 Q gi|254780889|r 321 QLSAIMSVVEVA-ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 321 q~~av~~~~~~~-~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) --..+.+.-+.. +-.++||..-=||+++-|+ +++..+||.|-+||.|-..-|.-|+ T Consensus 190 ~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v-~~~~~~aDGvIVGSaiVk~i~~~gd 246 (257) T PRK13134 190 LPVEVADTLARARQCFSIPVALGFGISRPAQL-EGLSHPPDAVIFGSALLRHLDAGGD 246 (257) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCCCHHHH-HHHHCCCCEEEECHHHHHHHHHCCC T ss_conf 52889999999997069987998067999999-9997039999987999999997898 No 378 >PRK13585 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=96.25 E-value=0.083 Score=32.39 Aligned_cols=97 Identities=26% Similarity=0.358 Sum_probs=59.7 Q ss_pred CCEEEEEC-CCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC-EEEEEC--CCCHHH Q ss_conf 33064574-79978688980210001017422326568765556651370156789998731366-699824--841138 Q gi|254780889|r 184 IEKLLVVD-DDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVD-LVVVDT--AHGHSQ 259 (493) Q Consensus 184 i~~LPVVD-e~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvD-viviD~--ahGh~~ 259 (493) .+-+|++| .+|+++-++.-+ ..+.+ +.+|-.+-|+.+.+.|+| +.++|. |-.... T Consensus 3 m~IiP~IDl~~Gk~Vrl~~G~-~~~~~--------------------~~~dP~~~a~~~~~~Ga~~lhivDLd~a~~g~~ 61 (240) T PRK13585 3 FEVIPAVDMLGGKCVQLVQGE-PGTET--------------------SYGDPVEVAKRWVDAGAKTLHLVDLDGAFEGSR 61 (240) T ss_pred EEEEEEEEEECCEEEEEECCC-CCCCE--------------------ECCCHHHHHHHHHHCCCCEEEEEECCCCCCCCC T ss_conf 899989998899189714610-17863--------------------668999999999987999799998977211894 Q ss_pred HHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 899999999985798724-5146469999999985487589961 Q gi|254780889|r 260 KVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 260 ~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .-.+.|+.+.+.+ .+|+ ++|.+-|.+.++.+.++|||-+-+| T Consensus 62 ~n~~~I~~i~~~~-~~pi~vGGGIrs~~~i~~~l~~Ga~kvvig 104 (240) T PRK13585 62 KNADIIEKIVEAT-DVSIQLGGGIRSVEDAASLLDLGVDRVILG 104 (240) T ss_pred CHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 4499999999737-977899788587999999997699899939 No 379 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=96.24 E-value=0.0054 Score=40.91 Aligned_cols=143 Identities=22% Similarity=0.283 Sum_probs=87.4 Q ss_pred HHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHC---CCEECCC-EEEECCCCHHHHH-H-----HHHHHCCCCEEEE Q ss_conf 889999997798799978989899999999874223---8815287-8994654303456-5-----6676050416786 Q gi|254780889|r 60 SRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFE---SGMVVNP-VTISPYATLADAL-A-----LMKKYSISGIPVV 129 (493) Q Consensus 60 ~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e---~gmI~dP-Vti~pd~TI~ea~-~-----lm~~~~is~iPVV 129 (493) -.||--|+++=||=|= =.-=..+|..||-.+ .+-=.+. |.|-|| +++--+ + =|.+++| + T Consensus 265 F~maRrLi~eEGlLvG-----GS~GsAvvaal~~A~~hP~l~~~~~~VvlLPD-S~R~YmtK~~nD~Wl~~~Gf-----l 333 (527) T TIGR01137 265 FKMARRLIKEEGLLVG-----GSSGSAVVAALKVAEDHPELKEDQVIVVLLPD-SIRNYMTKFLNDEWLLDNGF-----L 333 (527) T ss_pred HHHHHHHHHHCCEEEE-----CCHHHHHHHHHHHHHHCCCCCCCCEEEEEECC-CCCHHHHHEECHHHHHHCCC-----C T ss_conf 9999999874472640-----43035899999998726788888789998468-73000100104378864587-----5 Q ss_pred CCCCEEEEE---EEEHHHHC--------CCC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC------ Q ss_conf 688602788---72354500--------143-446724400267863257303589999999865533064574------ Q gi|254780889|r 130 ESDVGKLVG---ILTNRDVR--------FAS-NAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVD------ 191 (493) Q Consensus 130 D~~~gkLiG---IVT~rDir--------~~~-~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVD------ 191 (493) ++ ||-| ..|..|.. +.+ -....+++.--+.++++.+++++.+|.++|++++++.+|||+ T Consensus 334 ~~---rldg~~e~~~~~~~~~leasttky~~~~~~~~v~~l~L~~~v~~~~t~tv~~ai~iLre~GfdQ~PVv~pga~~P 410 (527) T TIGR01137 334 DD---RLDGSTESLTVKDVLKLEASTTKYEDVFGNARVKDLHLPALVTVHPTETVGDAIEILREYGFDQLPVVKPGAAEP 410 (527) T ss_pred CC---CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 10---115776134544455543332225564256364156688734546764089999999974873076207788889 Q ss_pred --CCCCEEEEEEECCCHHCCCCCCCCC Q ss_conf --7997868898021000101742232 Q gi|254780889|r 192 --DDGCCIGLITVKDIERSQLNPNATK 216 (493) Q Consensus 192 --e~g~LvGIIT~~DIlk~~~~p~A~~ 216 (493) +.++++|.++...+|+...-..|.. T Consensus 411 ~veag~v~G~v~l~~lL~~l~~~~a~~ 437 (527) T TIGR01137 411 DVEAGKVLGSVTLRELLSALFAKKAKL 437 (527) T ss_pred CCCCCEEEEEEEHHHHHHHHHHCCCCC T ss_conf 723534788742677899998424675 No 380 >cd02801 DUS_like_FMN Dihydrouridine synthase-like (DUS-like) FMN-binding domain. Members of this family catalyze the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. 1VHN, a putative flavin oxidoreductase, has high sequence similarity to DUS. The enzymatic mechanism of 1VHN is not known at the present. Probab=96.23 E-value=0.028 Score=35.82 Aligned_cols=67 Identities=28% Similarity=0.445 Sum_probs=51.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHH-------HHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHH-CCCCEEEEEE Q ss_conf 15678999873136669982484113889-------999999998579872451-46469999999985-4875899610 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHSQKV-------LDAVVQIKKNFPSLLVMA-GNIATAEGALALID-AGADIIKVGI 303 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~~~~-------~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~-~Gad~ikvGi 303 (493) +..+-+..|.++|++.|.| ||-+..+ .+.++.+|+ ++++++|+ |+|-|++.++.+.+ .|+|+|-+|= T Consensus 139 ~~~~~~~~l~~~G~~~ltv---H~Rt~~q~~~~~a~~e~i~~~~~-~~~ipvi~NGdI~s~~d~~~~~~~tg~dgvMigR 214 (231) T cd02801 139 ETLELAKALEDAGASALTV---HGRTREQRYSGPADWDYIAEIKE-AVSIPVIANGDIFSLEDALRCLEQTGVDGVMIGR 214 (231) T ss_pred HHHHHHHHHHHCCCCEEEE---ECCCHHHCCCCCCCHHHHHHHHH-CCCCEEEEECCCCCHHHHHHHHHHHCCCEEEECH T ss_conf 7999999999769989998---35614414677622699999986-5997799838909999999999850999999878 No 381 >PRK03512 thiamine-phosphate pyrophosphorylase; Provisional Probab=96.23 E-value=0.067 Score=33.07 Aligned_cols=100 Identities=17% Similarity=0.147 Sum_probs=54.3 Q ss_pred HHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHH-HHHCCCCEEEEEC Q ss_conf 999998579872451464699999999854875899610278766531000157431003788999-8622698199726 Q gi|254780889|r 265 VVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVE-VAERAGVAIVADG 343 (493) Q Consensus 265 i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~-~~~~~~~~iiadG 343 (493) ++..|+....-.+|+--+.+.+.+......|||.+ |+|| +-.|.-...-+.|+-. ....+ .....++||+|=| T Consensus 92 ~~~aR~~lg~~~iIG~S~h~~~e~~~A~~~gaDYi--g~Gp--if~T~TK~~~~~p~G~--~~l~~~~~~~~~iPvvAIG 165 (211) T PRK03512 92 TTDLNAIRAAGLRLGVSTHDDMEIDVALAARPSYI--ALGH--VFPTQTKQMPSAPQGL--AQLARHVERLADYPTVAIG 165 (211) T ss_pred HHHHHHHCCCCCEEEEECCCHHHHHHHHHCCCCEE--EECC--CCCCCCCCCCCCCCHH--HHHHHHHHHCCCCCEEEEC T ss_conf 89999866999789974299999999986499839--9856--3345887899887249--9999999971799989988 Q ss_pred CCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 5437776213367148789932301012 Q gi|254780889|r 344 GIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 344 Gi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) || +..++..-+.+||+.|=+=|.+.+. T Consensus 166 GI-~~~n~~~v~~~Ga~gvAViSaI~~a 192 (211) T PRK03512 166 GI-SLERAPAVLATGVGSIAVVSAITQA 192 (211) T ss_pred CC-CHHHHHHHHHCCCCEEEEEHHHHCC T ss_conf 96-8999999998399999995187469 No 382 >PRK12999 pyruvate carboxylase; Reviewed Probab=96.21 E-value=0.088 Score=32.22 Aligned_cols=138 Identities=17% Similarity=0.147 Sum_probs=89.3 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCC-------------H Q ss_conf 730358999999986553306457479978688980210001017422326568765556651370-------------1 Q gi|254780889|r 167 KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAK-------------D 233 (493) Q Consensus 167 ~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~-------------d 233 (493) .++.-++.-.+...++++...-|.|.-+.+-.+-+ --.+.+.. |.+ +-+||.-++ - T Consensus 625 Ypdnvv~~fv~~a~~~GiDvfRiFD~LN~~~nm~~---------~i~~v~~~-g~~-~e~aicyTGdi~dp~~~~~~l~y 693 (1147) T PRK12999 625 YPDNVVRAFVREAAAAGIDVFRIFDSLNWVENMRV---------AIDAVRET-GKV-AEAAICYTGDILDPARAKYDLDY 693 (1147) T ss_pred CCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHH---------HHHHHHHC-CCE-EEEEEEECCCCCCCCCCCCCHHH T ss_conf 97379999999999839891167313446677899---------99999970-988-99998742577787888766999 Q ss_pred HHHHHHHHHHCCCCEEEE-ECCCCH-HHHHHHHHHHHHHHCCCCCEECCCCCCHH----HHHHHHHCCCCEEEEEECCCC Q ss_conf 567899987313666998-248411-38899999999985798724514646999----999998548758996102787 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVV-DTAHGH-SQKVLDAVVQIKKNFPSLLVMAGNIATAE----GALALIDAGADIIKVGIGPGS 307 (493) Q Consensus 234 ~~eRa~~LveaGvDvivi-D~ahGh-~~~~~~~i~~~k~~~~~~~ii~GNv~t~~----~~~~l~~~Gad~ikvGig~Gs 307 (493) +.+-|+.|.+.|+|.|+| |.|--- -...-+.++.||+.+ ++||-.--=.|.- ....-++||+|+|-+.+.|=| T Consensus 694 y~~~a~~l~~~G~~~l~IKDMAGLl~P~aa~~LV~aLk~~~-~lPIhlHtHdTsG~~~at~laA~eAGvDiVD~A~ssmS 772 (1147) T PRK12999 694 YVDLAKELEAAGAHILAIKDMAGLLKPAAAYKLVSALKEEV-DLPIHLHTHDTSGNGLATYLAAAEAGVDIVDVAMASMS 772 (1147) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHC-CCCEEEECCCCCCHHHHHHHHHHHCCCCEEECCCHHHC T ss_conf 99999999976898899714454688799999999998616-98459843678855899999999859998963541102 Q ss_pred CCCCCHHCC Q ss_conf 665310001 Q gi|254780889|r 308 ICTTRVVTG 316 (493) Q Consensus 308 ~CtTr~~~g 316 (493) ..|.+.-++ T Consensus 773 g~TSQP~l~ 781 (1147) T PRK12999 773 GLTSQPSLS 781 (1147) T ss_pred CCCCCCCHH T ss_conf 898898899 No 383 >PRK06015 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=96.20 E-value=0.11 Score=31.52 Aligned_cols=123 Identities=21% Similarity=0.267 Sum_probs=78.3 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+.+||.-=. +.+-++..+++|++++|- -|.+..+++..+.. +++.+ ..+.|+.-.....++|+|.+| T Consensus 66 ~~~vGaGTVl---~~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~vK 133 (212) T PRK06015 66 EAIVGAGTIL---NAKQFEDAAKAGSRFIVS---PGTTQELLAAANDS-----DVPLL-PGAITPSEVMALREEGYTVLK 133 (212) T ss_pred CCEEEEEECC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 9679542115---699999999849989985---89999999999983-----99773-786999999999987999899 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 6102787665310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) += |.+.+- |..-+.++.. --.++++++-|||.. .++..-|.+++-.+..||+++- T Consensus 134 lF--PA~~~g-------G~~~lkal~~-----p~p~~~~~ptGGV~~-~N~~~yl~~~~v~~vgGs~l~~ 188 (212) T PRK06015 134 FF--PAEQAG-------GAAFLKALSS-----PLAGTFFCPTGGISL-KNARDYLSLPNVVCVGGSWVAP 188 (212) T ss_pred EC--CCCCCC-------CHHHHHHHHC-----CCCCCCEEECCCCCH-HHHHHHHCCCCEEEEECHHHCC T ss_conf 78--430016-------8999999857-----799998886289898-8899998089819998835389 No 384 >PRK13132 consensus Probab=96.18 E-value=0.081 Score=32.48 Aligned_cols=178 Identities=13% Similarity=0.174 Sum_probs=107.2 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEE Q ss_conf 46724400---267863257303589999999865533064574799786889802100010174223265687655566 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAA 227 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAA 227 (493) ++.|+.|- +......+..+.++.+.++++.+-+.+ .|++ +.|.-....+ T Consensus 48 fSDP~aDGPvIq~A~~~AL~~G~~~~~~~~~~~~ir~~-~piv--------lM~Y~N~i~~------------------- 99 (246) T PRK13132 48 YSDPLADGKLIADASFIALQQGVNTDTVFELLARVKTK-KALV--------FLVYYNLIFA------------------- 99 (246) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCEE--------EEEECHHHHH------------------- T ss_conf 88876558999999999987799899999999975369-9979--------9960108877------------------- Q ss_pred EECCCHHHH-HHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 513701567-89998731366-6998248411388999999999857987245146469999999985487589961027 Q gi|254780889|r 228 VSVAKDIAD-RVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 228 Ig~~~d~~e-Ra~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) -..| =+..+.++|+| +|+.|...-.+....+.++ +..+.-+++++- .|.+-.+.+.+..-..|=+= T Consensus 100 -----~G~e~F~~~~~~~GvdGlIipDLP~ee~~~~~~~~~--~~~i~~I~lvaP--Ts~~R~~~i~~~s~gfiY~v--- 167 (246) T PRK13132 100 -----YGLEKFVKKAKELGISGLIVPDLPFEESEELIKECE--KYNIALIPLISV--TSPKRAKKILKHAKGFIYAL--- 167 (246) T ss_pred -----CCHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHH--HCCCCEEEEECC--CCHHHHHHHHHCCCCCEEEE--- T ss_conf -----299999999987699857757999789899999999--859970144257--97899999995489827997--- Q ss_pred CCCCCCCHHCCCCCCCCHHHHHHHHHH-HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 876653100015743100378899986-22698199726543777621336714878993230101216 Q gi|254780889|r 306 GSICTTRVVTGVGCPQLSAIMSVVEVA-ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 306 Gs~CtTr~~~g~g~pq~~av~~~~~~~-~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||.-......+.++-+.. +..+.||..-=||+++-|+-+ +..+||.|-+||.|-..-| T Consensus 168 ----s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v~~-~~~~aDGvIVGSa~v~~i~ 231 (246) T PRK13132 168 ----GSIGVTGTKSVEEARLKDKVKEIKSFTDLPVAVGFGIKNNQDVKR-MRKYADGVIVGTSIVKLFK 231 (246) T ss_pred ----ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHH-HHHCCCEEEECHHHHHHHH T ss_conf ----535677776663688999999999628998699779899999999-9822999997099999998 No 385 >PRK08883 ribulose-phosphate 3-epimerase; Provisional Probab=96.13 E-value=0.13 Score=30.94 Aligned_cols=124 Identities=19% Similarity=0.266 Sum_probs=75.5 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHH-HHHHHHHCCCCEE-EEEECCCCCCCCCH Q ss_conf 789998731366699824841138899999999985798724514646999-9999985487589-96102787665310 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAE-GALALIDAGADII-KVGIGPGSICTTRV 313 (493) Q Consensus 236 eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~-~~~~l~~~Gad~i-kvGig~Gs~CtTr~ 313 (493) .-++.++++|+|.|.+-.-. +....+.++++|+.--. .=+|=|-.|+- ..+.+++. .|.| -..+-||-. T Consensus 72 ~~i~~~~~aGad~I~~H~Ea--~~~~~~~i~~Ik~~g~k-~GlalnP~T~~~~l~~~l~~-~D~VLvMtV~PGf~----- 142 (220) T PRK08883 72 RIIPDFAKAGASMITFHVEA--SEHVDRTLQLIKEHGCQ-AGVVLNPATPLAHLEYIMDK-VDLILLMSVNPGFG----- 142 (220) T ss_pred HHHHHHHHCCCCEEEECCCC--CCCHHHHHHHHHHCCCC-EEEEECCCCCHHHHHHHHHH-CCEEEEEEECCCCC----- T ss_conf 88999997599889985776--54999999999985996-68884799987999999974-69799987458988----- Q ss_pred HCCCC--CCCCHHHHHHHHHHHC--CCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 00157--4310037889998622--6981997265437776213367148789932301012 Q gi|254780889|r 314 VTGVG--CPQLSAIMSVVEVAER--AGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 314 ~~g~g--~pq~~av~~~~~~~~~--~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) |-. ...+.-+.+.++...+ ++..|..||||+. ..+.+...+|||.+-.||.+=+. T Consensus 143 --GQ~f~~~~l~Ki~~l~~~~~~~~~~~~I~VDGGI~~-~ti~~l~~aGad~~V~GS~iF~~ 201 (220) T PRK08883 143 --GQSFIPHTLDKLRAVRKMIDASGRDIRLEIDGGVKV-DNIREIAEAGADMFVAGSAIFGQ 201 (220) T ss_pred --CCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCH-HHHHHHHHCCCCEEEECHHHHCC T ss_conf --754557799999999998874499807999898789-99999998799999968267489 No 386 >CHL00200 trpA tryptophan synthase alpha subunit; Provisional Probab=96.12 E-value=0.088 Score=32.20 Aligned_cols=68 Identities=18% Similarity=0.211 Sum_probs=49.8 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699824841-------138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG-------HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG-------h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.++-.=+..| ......+.++++|+.. ++|+.+| .|.|+|.++.+.++|||++-|| T Consensus 156 ~~~Ri~~i~~~a~GFiY~vs~~GvTG~~~~~~~~l~~~i~~ik~~t-~~Pv~vGFGIs~~e~v~~~~~~~aDGvIVG 231 (263) T CHL00200 156 SKSRIQKIARAAPGCIYLVSTTGVTGLKTELDKKLKKLIETIKKMT-NKPIILGFGISTSEQIKQIKGWNINGIVIG 231 (263) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 6999999997289808985336556875445187999999999736-998487358799999999974599999987 No 387 >PRK13124 consensus Probab=96.09 E-value=0.073 Score=32.78 Aligned_cols=179 Identities=14% Similarity=0.149 Sum_probs=103.0 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCCC-EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf 46724400---267863257303589999999865533-06457479978688980210001017422326568765556 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRIE-KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki~-~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) ++.|+.|- +...-..+..+.++.++++++.+-+-+ ..|+| +.|....+-+ T Consensus 46 fSDP~ADGpvIq~A~~~AL~~G~~~~~~~~~~~~~r~~~~~piv--------lM~Y~N~i~~------------------ 99 (257) T PRK13124 46 YSDPLADGPVIQRASKRALNGGMNIVKAMELVGKMRKKVTIPIV--------YFTYYNPVLQ------------------ 99 (257) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEEHHHHHHH------------------ T ss_conf 88877657999999999997699689999999985244788889--------9750078987------------------ Q ss_pred EEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 651370156789998731366-69982484113889999999998-5798724514646999999998548758996102 Q gi|254780889|r 227 AVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 227 AIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) ....+-++.+.++|+| +|+.|...-.+.... +..++ .+.-+++++. .|.+-.+.+.+..-..|=+ + T Consensus 100 -----~G~e~F~~~~~~~Gv~GvIipDLP~eE~~~~~---~~~~~~gl~~I~lvaP--Ts~~Ri~~i~~~s~gFiY~-v- 167 (257) T PRK13124 100 -----YGLEKFFALARENGIDGLLIPDLPLEESGELQ---EICDKYGIYLIPLVAP--TSKERIKKIAEQAEGFVYC-V- 167 (257) T ss_pred -----HHHHHHHHHHHHCCCCEEECCCCCHHHHHHHH---HHHHHCCCCEEEEECC--CCHHHHHHHHHCCCCCEEE-E- T ss_conf -----57999999999759984777899979999999---9998668735788479--9679999998548983899-6- Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHHHHH-CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 78766531000157431003788999862-2698199726543777621336714878993230101216 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVEVAE-RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~~~~-~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||...+.-..+.++-+..+ ..+.||..-=||+++-|+ +.+..+||.|-+||.|-..-| T Consensus 168 -----s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v-~~~~~~ADGvIVGSaivk~i~ 231 (257) T PRK13124 168 -----SSLGVTGVREEIETDLEEFIRTVKQYSNVPVAVGFGISTPEQV-QKMKEIADGVVVGSALVEKIE 231 (257) T ss_pred -----ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHH-HHHHHHCCEEEECHHHHHHHH T ss_conf -----2466678765560889999999986179983898446999999-999801999998289999998 No 388 >PRK13129 consensus Probab=96.08 E-value=0.12 Score=31.23 Aligned_cols=170 Identities=15% Similarity=0.125 Sum_probs=100.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHH Q ss_conf 86325730358999999986553-30645747997868898021000101742232656876555665137015678999 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGP 240 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~ 240 (493) .-..+..+.++.++++++.+-|- ...|+| +.|.-..+-+ .....-+.. T Consensus 70 ~~~AL~~G~~~~~~~~~~~~~r~~~~~Piv--------lM~Y~N~i~~-----------------------~G~e~F~~~ 118 (267) T PRK13129 70 ATRALQSGTTLEKVLEMLESLKGKLTIPII--------LFTYYNPLLN-----------------------RGMERFCEQ 118 (267) T ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEEEHHHHHH-----------------------HHHHHHHHH T ss_conf 999997698789999999985434788889--------9861078988-----------------------559999999 Q ss_pred HHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCC Q ss_conf 8731366-69982484113889999999998-579872451464699999999854875899610278766531000157 Q gi|254780889|r 241 LFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVG 318 (493) Q Consensus 241 LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g 318 (493) +.++|+| +|+.|...-.+... .+..++ .+.-+++++.+ .+.+-.+.+.+.+-..|=+= +..-+||.. T Consensus 119 ~~~~GvdGvIipDLP~eE~~~~---~~~~~~~gl~~I~lvaPt-t~~~Ri~~i~~~~~gFiY~v-------s~~GvTG~~ 187 (267) T PRK13129 119 AAAAGVAGLVVPDLPLEEAEKL---SPIAAERGIDLILLVAPT-TPAERMKRIAQQSRGFTYLV-------SVTGVTGER 187 (267) T ss_pred HHHCCCCEEECCCCCHHHHHHH---HHHHHHCCCEEEEEECCC-CCHHHHHHHHHCCCCEEEEE-------ECCCCCCCC T ss_conf 9866987576789998999999---999985398168994899-96899999981689808987-------346656765 Q ss_pred CCCCHHHHHHHHHH-HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 43100378899986-22698199726543777621336714878993230101216 Q gi|254780889|r 319 CPQLSAIMSVVEVA-ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 319 ~pq~~av~~~~~~~-~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) -+.-..+.+..+.. +..++|+..-=||+++-|+.+...+|||.|-+||.|-..-| T Consensus 188 ~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~~ADGvIVGSaiV~~i~ 243 (267) T PRK13129 188 SQMENRVESLLQQLRQVTSKPIAVGFGISGPEQARQVREWGADGAIVGSAFVKRLA 243 (267) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 44508899999999983489817884479999999998549999998789999998 No 389 >PRK13118 consensus Probab=96.08 E-value=0.06 Score=33.40 Aligned_cols=181 Identities=15% Similarity=0.163 Sum_probs=104.7 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---26786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- .......+..+.++.+.++++.+-+. ...|+| +.|....+.. T Consensus 54 fSDP~ADGPvIq~A~~rAL~~G~~~~~~~~~v~~~r~~~~~~Piv--------lM~Y~N~i~~----------------- 108 (269) T PRK13118 54 FSDPMADGPAIQLASERALAAGQGLADVLQMVREFRQGDQTTPVV--------LMGYLNPIEI----------------- 108 (269) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEE--------EEEECCHHHH----------------- T ss_conf 888666579999999999967986889999999986438999989--------9740007878----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 6651370156789998731366-699824841138899999999985798724514646999999998548758996102 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) ....+-++.+.++|+| +|+.|...-.+....+..+ +..+.-+++++.+ .+.+-.+.+.+..-..|=+ + T Consensus 109 ------~G~e~F~~~~~~~GvdGvIipDLP~ee~~~~~~~~~--~~gl~~I~lvaPt-t~~~Ri~~i~~~a~gFiY~-v- 177 (269) T PRK13118 109 ------YGYERFVAQAKEAGVDGLILVDLPPEEADELRAPAQ--AHGLDFIRLTSPT-TSDERLPRVLEHASGYLYY-V- 177 (269) T ss_pred ------HHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHH--HCCCCHHEEECCC-CCHHHHHHHHHCCCCCEEE-E- T ss_conf ------639999999998599746458999789999999999--7598464036989-8789999998437883899-8- Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHHHH-HHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 78766531000157431003788999-8622698199726543777621336714878993230101216 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSVVE-VAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~~~-~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||..-.....+.+.-+ ..+..++||..-=||+++-|+ |++...||.|-+||.|-..-| T Consensus 178 -----s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~~-~~v~~~aDGvIVGSa~Vk~i~ 241 (269) T PRK13118 178 -----SLAGVTGAAALDTEHVEEAVARLRRHTDLPVVVGFGIRDAESA-AAIARLADGVVVGSALVDAIA 241 (269) T ss_pred -----ECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHH-HHHHCCCCEEEECHHHHHHHH T ss_conf -----5456678776671989999999996258981787167999999-999800999998589999998 No 390 >PRK13136 consensus Probab=96.07 E-value=0.11 Score=31.53 Aligned_cols=169 Identities=19% Similarity=0.217 Sum_probs=100.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHH Q ss_conf 86325730358999999986553-30645747997868898021000101742232656876555665137015678999 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGP 240 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~ 240 (493) .-..+..+.++.+.++++.+-|. ...|+| +.|....+-+ . ..+-.+. T Consensus 63 ~~rAL~~G~~~~~~~~~v~~~r~~~~~piv--------lM~Y~N~i~~----------------------~--G~~f~~~ 110 (253) T PRK13136 63 SIRALAQGTTLHDVLTLITSFRQHSEIPII--------LFTYFNPLLA----------------------A--GDKIYQQ 110 (253) T ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCCCEE--------EECCCHHHHH----------------------H--HHHHHHH T ss_conf 999998699799999999982257898889--------9865179999----------------------7--9999999 Q ss_pred HHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCC Q ss_conf 8731366-699824841138899999999985798724514646999999998548758996102787665310001574 Q gi|254780889|r 241 LFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGC 319 (493) Q Consensus 241 LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~ 319 (493) +.++|+| +|+.|.....+....+..+ +....-+++++-+. +.+-.+.+.+..-..|=+---.| +||.-- T Consensus 111 ~~~~GvdGlIipDLP~eE~~~~~~~~~--~~~i~~I~liaPtt-~~eRi~~i~~~a~gFiY~vs~~G-------vTG~~~ 180 (253) T PRK13136 111 MKSAGVDGCLVVDLPVEEAAPHLTACK--TAKIAPILLISPST-TQERLKKINEHGEGMLYYVCRPG-------TTGVRA 180 (253) T ss_pred HHHCCCCCEECCCCCHHHHHHHHHHHH--HCCCCCEEEECCCC-CHHHHHHHHHCCCCEEEEEECCC-------CCCCCC T ss_conf 997498720067899777699999999--75887125526899-88999999960898199985552-------368764 Q ss_pred CCCHHHHHH-HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 310037889-998622698199726543777621336714878993230101216 Q gi|254780889|r 320 PQLSAIMSV-VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 320 pq~~av~~~-~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) .--..+.+. .+..+..++||..-=||++.-|+.+.... ||.|-+||.|-..-| T Consensus 181 ~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~~~~-ADGvIVGSaiV~~i~ 234 (253) T PRK13136 181 TLPENFPAKMNQIKSMTSLPIVTGFGIANRKMAAQALQY-ADGFVIGSLFVKAIA 234 (253) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHH-CCEEEECHHHHHHHH T ss_conf 463889999999997269986997154999999999822-999998589999998 No 391 >PRK02145 consensus Probab=96.03 E-value=0.095 Score=31.97 Aligned_cols=70 Identities=17% Similarity=0.281 Sum_probs=50.9 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+.|+...+.|+| +.++|. |........+.++.+.+.. .+| -++|.|-+.+.++.|.++|||-|-+| T Consensus 31 gdP~~~a~~~~~~GadelhivDld~a~~~~~~~~~~I~~i~~~~-~iPi~vGGGIrs~e~~~~ll~~GadkVii~ 104 (257) T PRK02145 31 GDPVEIARRYDEQGADELTFLDITATSDGRDLILPIIEAVASQV-FIPLTVGGGVRAVEDVRRLLNAGADKVSMN 104 (257) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEH T ss_conf 89999999999879998999978887667540899999999656-874896277304688999998199889841 No 392 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=96.03 E-value=0.15 Score=30.60 Aligned_cols=67 Identities=24% Similarity=0.348 Sum_probs=48.5 Q ss_pred HHHHHHHHHHCCCCEEE-EE------CCCCHHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699-82------4841138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVV-VD------TAHGHSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviv-iD------~ahGh~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.++- +- +....+..+.+.++++|+.. ++|+.+| .|.+++.++++.++ ||++-|| T Consensus 159 ~~~rl~~i~~~a~GFiY~vs~~GvTG~~~~~~~~~~~~v~~vr~~~-~~Pv~vGFGIs~~e~~~~v~~~-ADGVIVG 233 (265) T COG0159 159 PDERLKKIAEAASGFIYYVSRMGVTGARNPVSADVKELVKRVRKYT-DVPVLVGFGISSPEQAAQVAEA-ADGVIVG 233 (265) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHH-CCEEEEC T ss_conf 9899999997479858999666666777653046999999999744-8973874486999999999976-8857973 No 393 >PRK13135 consensus Probab=96.02 E-value=0.099 Score=31.84 Aligned_cols=181 Identities=14% Similarity=0.153 Sum_probs=103.8 Q ss_pred CCCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf 467244002---6786325730358999999986553-306457479978688980210001017422326568765556 Q gi|254780889|r 151 AQQAVGELM---TRNLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 151 ~~~~V~eiM---t~~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) ++.|+.|-. ......+..+.++.+.++++.+-+- ...|+| +.|....+-+ T Consensus 54 fSDP~ADGPvIq~A~~rAL~~G~~~~~~~~~~~~~r~~~~~Piv--------lM~Y~N~i~~------------------ 107 (267) T PRK13135 54 FSDPMADGPTIQLSSERALAAGTTLPRILAMVRSVRRRCQVPIV--------LMGYYNPIFA------------------ 107 (267) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEECCHHHHH------------------ T ss_conf 89866658999999999997698499999999986335899889--------9842309988------------------ Q ss_pred EEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 651370156789998731366-6998248411388999999999857987245146469999999985487589961027 Q gi|254780889|r 227 AVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 227 AIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) .....-+..+.++|+| +|+.|...-.+....+..+ +..++-+++++ .-.+.+-.+.+.+..-..|=+ + T Consensus 108 -----yG~e~F~~~~~~~GvdGlIipDLP~ee~~~~~~~~~--~~~l~~I~lvs-Ptt~~~Ri~~i~~~s~GFiY~-V-- 176 (267) T PRK13135 108 -----YGLERFAADAAAAGVDGVLLVDLPPEEAEEFKACAD--RHGLDVIFLLT-PTSDESRIRTVARLGRGFVYY-V-- 176 (267) T ss_pred -----HHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHH--HCCCCEEEEEC-CCCCHHHHHHHHHCCCCEEEE-E-- T ss_conf -----468999999997499747637899788899999998--72961899808-989579999999618981899-8-- Q ss_pred CCCCCCCHHCCCCCCCCHHHHHHH-HHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 876653100015743100378899-98622698199726543777621336714878993230101216 Q gi|254780889|r 306 GSICTTRVVTGVGCPQLSAIMSVV-EVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 306 Gs~CtTr~~~g~g~pq~~av~~~~-~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||..-+--..+.+-- +..+..++||..-=||+++-|+ +.+..+||.|-+||.|-..-| T Consensus 177 ----s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v-~~i~~~ADGvIVGSaiVk~ie 240 (267) T PRK13135 177 ----SVTGVTGARSGVEATVGGNVAKIREKITVPVVVGFGISTPQQA-ADVAAMADGVVVGSALVKLFE 240 (267) T ss_pred ----ECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHH-HHHHCCCCEEEECHHHHHHHH T ss_conf ----5456667764444889999999986068984898167999999-999805999998789999998 No 394 >cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan. Probab=96.01 E-value=0.097 Score=31.90 Aligned_cols=170 Identities=15% Similarity=0.183 Sum_probs=99.6 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCC-EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHH Q ss_conf 863257303589999999865533-0645747997868898021000101742232656876555665137015678999 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRIE-KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGP 240 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki~-~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~ 240 (493) .-..+..+.++.+.++++.+-+.. ..|++ +.|....+... ....-++. T Consensus 51 ~~~aL~~g~~~~~~~~~~~~~r~~~~~piv--------lM~Y~N~i~~~-----------------------G~e~F~~~ 99 (242) T cd04724 51 SERALANGVTLKDVLELVKEIRKKNTIPIV--------LMGYYNPILQY-----------------------GLERFLRD 99 (242) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCCEE--------EEEECHHHHHH-----------------------CHHHHHHH T ss_conf 999997699499999999998734798889--------99844576652-----------------------89999999 Q ss_pred HHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCC Q ss_conf 8731366-699824841138899999999985798724514646999999998548758996102787665310001574 Q gi|254780889|r 241 LFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGC 319 (493) Q Consensus 241 LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~ 319 (493) +.++|+| +|+.|....++....+..+ +....-+++++.| .+.+-.+.+.+..-..+= -+ +..-+||... T Consensus 100 ~~~~Gv~GviipDLP~ee~~~~~~~~~--~~~i~~I~lvsPt-t~~~ri~~i~~~s~gfiY-~v------s~~GvTG~~~ 169 (242) T cd04724 100 AKEAGVDGLIIPDLPPEEAEEFREAAK--EYGLDLIFLVAPT-TPDERIKKIAELASGFIY-YV------SRTGVTGART 169 (242) T ss_pred HHHCCCCEEEECCCCHHHHHHHHHHHH--HCCCEEEEEECCC-CCHHHHHHHHHHCCCEEE-EE------ECCCCCCCCC T ss_conf 997599758706999578468999998--6598388996898-878999999974798499-98------5777778775 Q ss_pred CCCHHHHHHHHHHH-CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 31003788999862-2698199726543777621336714878993230101216 Q gi|254780889|r 320 PQLSAIMSVVEVAE-RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 320 pq~~av~~~~~~~~-~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +....+.+.-+..+ ..++||..-=||+++-|+.+.... ||.|-+||.|-..-| T Consensus 170 ~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~v~~~~~~-aDGvIVGSa~V~~i~ 223 (242) T cd04724 170 ELPDDLKELIKRIRKYTDLPIAVGFGISTPEQAAEVAKY-ADGVIVGSALVKIIE 223 (242) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHH-CCEEEECHHHHHHHH T ss_conf 564999999999987168974874387999999999965-999998789999999 No 395 >pfam04131 NanE Putative N-acetylmannosamine-6-phosphate epimerase. This family represents a putative ManNAc-6-P-to-GlcNAc-6P epimerase in the N-acetylmannosamine (ManNAc) utilisation pathway found mainly in pathogenic bacteria. Probab=96.00 E-value=0.027 Score=35.92 Aligned_cols=70 Identities=27% Similarity=0.311 Sum_probs=51.0 Q ss_pred EEEECCCHHHHHHHHHHHCCCCEEEEECCCCHH------HHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCE Q ss_conf 665137015678999873136669982484113------889999999998579872451-4646999999998548758 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHS------QKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADI 298 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvDviviD~ahGh~------~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ 298 (493) |-+++.+ -+..-.++|+|++---. .||. .--++.++.+++. .+++|+ |++.|+|.+..-+++||++ T Consensus 97 AD~st~e----ea~~A~~~G~D~I~TTL-~GYT~~t~~~~pD~~ll~~l~~~--~~pvIaEGri~tPe~a~~a~~~GA~a 169 (192) T pfam04131 97 ADCSTFE----EGLNAHKLGVDIVGTTL-SGYTGGSNPAEPDFQLVKTLSEA--GCFVIAEGRYNTPELAKKAIEIGADA 169 (192) T ss_pred EECCCHH----HHHHHHHCCCCEEECCC-CCCCCCCCCCCCCHHHHHHHHHC--CCCEEEECCCCCHHHHHHHHHCCCCE T ss_conf 9749999----99999985999998232-55789999999978999999868--99399857989999999999839989 Q ss_pred EEEE Q ss_conf 9961 Q gi|254780889|r 299 IKVG 302 (493) Q Consensus 299 ikvG 302 (493) |-|| T Consensus 170 VVVG 173 (192) T pfam04131 170 VTVG 173 (192) T ss_pred EEEC T ss_conf 9989 No 396 >PRK13115 consensus Probab=95.99 E-value=0.13 Score=31.10 Aligned_cols=181 Identities=15% Similarity=0.130 Sum_probs=104.6 Q ss_pred CCCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEE Q ss_conf 4672440---0267863257303589999999865533064574799786889802100010174223265687655566 Q gi|254780889|r 151 AQQAVGE---LMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAA 227 (493) Q Consensus 151 ~~~~V~e---iMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAA 227 (493) ++.|+.| |+.-....+..++++.+.++++...+-...|++ +.|.-..+-+ T Consensus 61 FSDP~ADGPvIQ~A~~rAL~~G~~~~~~f~~v~~~~~~~~Piv--------lM~Y~N~i~~------------------- 113 (269) T PRK13115 61 YSDPVMDGPVIQAATEAALRGGVRVRDVFRAVEAISAAGGRAL--------VMTYWNPVLR------------------- 113 (269) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCEE--------EHHHHHHHHH------------------- T ss_conf 8885666899999999999779959999999998415799888--------5475489987------------------- Q ss_pred EECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 51370156789998731366-69982484113889999999998579872451464699999999854875899610278 Q gi|254780889|r 228 VSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 228 Ig~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) ....+-++.+.++|+| +|+.|...-.+....+..+ +..+.-+++++.+- +.+-.+.+.+..-..|-+= T Consensus 114 ----yG~e~F~~~~~~~GvdGvIipDLP~eE~~~~~~~~~--~~gi~~I~LvaPtt-~~eRi~~i~~~a~GFIY~V---- 182 (269) T PRK13115 114 ----YGVDRFARDLAAAGGAGLITPDLIPDEAGEWLAASE--RHGLDRIFLVAPSS-TPERLAETVEASRGFVYAA---- 182 (269) T ss_pred ----HCHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHH--HCCEEEEEEECCCC-CHHHHHHHHHCCCCCEEEE---- T ss_conf ----369999999997399807647899789999999998--65812899858999-8899999984488808997---- Q ss_pred CCCCCCHHCCCCCCCCHHHHH-HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 766531000157431003788-9998622698199726543777621336714878993230101216 Q gi|254780889|r 307 SICTTRVVTGVGCPQLSAIMS-VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 307 s~CtTr~~~g~g~pq~~av~~-~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||..-.--..+.+ +.+..+..++||..-=||+++-|+.. +...||.|-+||.|-..-| T Consensus 183 ---s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~~~~-~~~~aDGvIVGSa~V~~i~ 246 (269) T PRK13115 183 ---STMGVTGARDAVSSAAPELVARVRAASDIPVCVGLGVSSAAQAAE-IAGYADGVIVGSALVSALL 246 (269) T ss_pred ---ECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHH-HHHHCCEEEECHHHHHHHH T ss_conf ---545456776444177999999999717998179727899999999-9802999998689999999 No 397 >PRK11840 bifunctional sulfur carrier protein/thiazole synthase protein; Provisional Probab=95.98 E-value=0.054 Score=33.74 Aligned_cols=126 Identities=28% Similarity=0.312 Sum_probs=76.4 Q ss_pred HHHHHHHHH-HC-CCCEEEEECCCCHHH-------HHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 567899987-31-366699824841138-------89999999998-579872451464699999999854875899610 Q gi|254780889|r 234 IADRVGPLF-DV-NVDLVVVDTAHGHSQ-------KVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 234 ~~eRa~~Lv-ea-GvDviviD~ahGh~~-------~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) ..-|...|. |+ |-|.|=+.+ +|+.. ..++..+.|-+ .| .|+.=--.++--++.|.++|+.+|- T Consensus 152 eAvr~a~lARe~~~t~~iKLEV-i~D~~tL~Pd~~etl~Aae~Lv~eGF---~VlpY~~dDpv~akrLed~Gc~avM--- 224 (327) T PRK11840 152 DAVRTLRLAREAGGWDLVKLEV-LGDQKTLYPDMVETLKAAEVLVKEGF---QVMVYCSDDPIAAKRLEDAGAVAVM--- 224 (327) T ss_pred HHHHHHHHHHHHCCCCEEEEEE-ECCCCCCCCCHHHHHHHHHHHHHCCC---EEEEEECCCHHHHHHHHHCCCEEEE--- T ss_conf 9999999999855998589998-07976679985899999999997898---8988716986899999875983886--- Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 2787665310001574310037889998622698199726543777621336714878993230101216 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) =-||.- =+|-|.-....+.-.. ...++|||-|.||-.+.|.++|+-+|+|+|++-+..|.... T Consensus 225 PlgsPI----GSg~Gi~n~~~i~~i~---e~~~vpvivDAGiG~pS~A~~aMElG~daVL~NTAiA~a~~ 287 (327) T PRK11840 225 PLGAPI----GSGLGIQNPYTIRLIV---EGAKVPVLVDAGVGTASDAAVAMELGCDGVLMNTAIAEAKN 287 (327) T ss_pred ECCCCC----CCCCCCCCHHHHHHHH---HHCCCCEEEECCCCCHHHHHHHHHCCCCEEEHHHHHHCCCC T ss_conf 224523----4788868999999999---73699789957989878999998636666663367672699 No 398 >cd02812 PcrB_like PcrB_like proteins. One member of this family, a protein from Archaeoglobus fulgidus, has been characterized as a (S)-3-O-geranylgeranylglyceryl phosphate synthase (AfGGGPS). AfGGGPS catalyzes the formation of an ether linkage between sn-glycerol-1-phosphate (G1P) and geranylgeranyl diphosphate (GGPP), the committed step in archaeal lipid biosynthesis. Therefore, it has been proposed that PcrB-like proteins are either prenyltransferases or are involved in lipoteichoic acid biosynthesis although the exact function is still unknown. Probab=95.98 E-value=0.12 Score=31.24 Aligned_cols=57 Identities=28% Similarity=0.416 Sum_probs=43.7 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 136669982484113889999999998579872-45146469999999985487589961 Q gi|254780889|r 244 VNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 244 aGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) -|--++-++- -|-. .-.++++..|+...+++ +++|++-++|.++.+.+||||.|-|| T Consensus 147 lg~~~iYLEg-SGa~-v~~e~V~~vk~~l~~~~LivGGGIrs~e~a~~~~~AgAD~IVvG 204 (219) T cd02812 147 LGMPIVYLEY-SGAY-GPPEVVRAVKKVLGDTPLIVGGGIRSGEQAKEMAEAGADTIVVG 204 (219) T ss_pred HCCCEEEEEC-CCCC-CCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 2993899956-8997-99999999998467970999289799999999998699999988 No 399 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=95.94 E-value=0.014 Score=37.97 Aligned_cols=140 Identities=21% Similarity=0.328 Sum_probs=84.6 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHC----------CCC------CCCCC---CCCCCCCCC Q ss_conf 89946543034565667605041678668860278872354500----------143------44672---440026786 Q gi|254780889|r 103 VTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVR----------FAS------NAQQA---VGELMTRNL 163 (493) Q Consensus 103 Vti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir----------~~~------~~~~~---V~eiMt~~l 163 (493) |.++-| +.||+++-.+ +.+..+ |.++=+=|--+|. |.. ..+-. |...|...+ T Consensus 185 vFitHD--lDEA~rigDR-----Ivilk~--GeiVQvGTPdeIL~NPaneyVe~F~~~~dl~qv~~P~~~~i~~~~~~~~ 255 (372) T TIGR01186 185 VFITHD--LDEALRIGDR-----IVILKA--GEIVQVGTPDEILRNPANEYVEEFIGKVDLSQVLSPDAERIAKRMNTVP 255 (372) T ss_pred EEEECC--HHHHHHHHHH-----HHHEEC--CCEEEECCCHHHHCCCCHHHHHHHHCCCEEEEECCCCHHHHHHHCCCCC T ss_conf 999417--6788767513-----201106--8678842846874288067999873751157523888899996527732 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCC--EEEEEEEECCCHHHHHHHHH Q ss_conf 3257303589999999865533064574799786889802100010174223265687--65556651370156789998 Q gi|254780889|r 164 ITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGR--LRVAAAVSVAKDIADRVGPL 241 (493) Q Consensus 164 itv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~gr--L~VgAAIg~~~d~~eRa~~L 241 (493) ++.+++-.+..|+++|.+.+++.+.|||+.++|.|+|...++-++........|-... +-|- =+|. -.|-.... T Consensus 256 i~~t~~~gp~~Al~Lm~~~~~~s~yvv~~~~~l~G~v~~~~~~~a~~~~~~~~~~l~~~~~tV~--~~t~--L~e~~~~v 331 (372) T TIGR01186 256 ITKTADKGPRSALKLMRDERVDSLYVVDRQNKLVGVVDVESIKQARKKAQSLQDVLIDDILTVD--EGTL--LRELLRKV 331 (372) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCC--CCCC--HHHHHHHH T ss_conf 6865898878999999865971799997285488778588999888787789998620011105--8760--48999998 Q ss_pred HHCCCC-EEEEECCC Q ss_conf 731366-69982484 Q gi|254780889|r 242 FDVNVD-LVVVDTAH 255 (493) Q Consensus 242 veaGvD-viviD~ah 255 (493) .++++. +=|||=-| T Consensus 332 ~~~~~~YvpVVDE~~ 346 (372) T TIGR01186 332 LKAGIKYVPVVDEDQ 346 (372) T ss_pred HHCCCCEEEEECCCC T ss_conf 507995666661456 No 400 >TIGR03572 WbuZ glycosyl amidation-associated protein WbuZ. This clade of sequences is highly similar to the HisF protein, but generally represents the second HisF homolog in the genome where the other is an authentic HisF observed in the context of a complete histidine biosynthesis operon. The similarity between these WbuZ sequences and true HisFs is such that often the closest match by BLAST of a WbuZ is a HisF. Only by making a multiple sequence alignment is the homology relationship among the WbuZ sequences made apparent. WbuZ genes are invariably observed in the presence of a homolog of the HisH protein (designated WbuY) and a proposed N-acetyl sugar amidotransferase designated in WbuX in E. coli, IfnA in P. aeriginosa and PseA in C. jejuni. Similarly, this trio of genes is invariably found in the context of saccharide biosynthesis loci. It has been shown that the WbuYZ homologs are not essential components of the activity expressed by WbuX, leading to the proposal that these to pr Probab=95.90 E-value=0.12 Score=31.22 Aligned_cols=70 Identities=19% Similarity=0.325 Sum_probs=50.7 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .|-.+-++...+.|+| +.++|. |-.....-.+.|+.+.+.. .+|+ ++|.|-|.|.++.|.++|||-|-+| T Consensus 30 gdP~~~ak~~~~~g~d~lhivDld~a~~~~~~n~~~I~~i~~~~-~ipi~vGGGIrs~e~~~~ll~~GadkViig 103 (232) T TIGR03572 30 GDPVNAARIYNAKGADELIVLDIDASKRGREPLFELISNLAEEC-FMPLTVGGGIRSLEDAKKLLSLGADKVSIN 103 (232) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHC-CCCEEEEECEEEHHHHHHHHHCCCCEEEEC T ss_conf 89999999999869999999968764348821799999999972-985899713303899999997699689934 No 401 >PRK04281 consensus Probab=95.89 E-value=0.01 Score=38.99 Aligned_cols=70 Identities=17% Similarity=0.318 Sum_probs=50.3 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-|+...+.|+| +.++|. |........+.|+.+.+.. .+|+ ++|.|-+.|.++.|.++|||-|-+| T Consensus 30 gdP~~~ak~~~~~GadelhivDld~a~~~~~~~~~~I~~i~~~~-~vpi~vGGGIrs~e~~~~ll~~GadkViig 103 (254) T PRK04281 30 GDPVEAAKRYNGEGADELTFLDITASSDNRDTILHIIEEVAGQV-FIPLTVGGGVRTVADIRRLLNAGADKVSIN 103 (254) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEECCEEECHHHHHHHHCCCCEEEEC T ss_conf 89999999999869999999968898777530899999998507-962899777545188999997699889977 No 402 >cd04730 NPD_like 2-Nitropropane dioxygenase (NPD), one of the nitroalkane oxidizing enzyme families, catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDP is a member of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=95.89 E-value=0.043 Score=34.46 Aligned_cols=66 Identities=24% Similarity=0.389 Sum_probs=51.0 Q ss_pred HHHHHHHHCCCCEEEEE--CCCCHHH----HHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 78999873136669982--4841138----89999999998579872-45146469999999985487589961 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVD--TAHGHSQ----KVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 236 eRa~~LveaGvDviviD--~ahGh~~----~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +-++.+.++|+|.|++. -|-||.. .....+..+++.. ++| +.+|.+.|.++.......|||+|-+| T Consensus 113 ~~A~~a~~~GaD~iv~qG~eAGGH~g~~~~~~~~lv~~v~~~~-~ipviaAGGI~~g~~i~aal~lGA~gV~~G 185 (236) T cd04730 113 EEARKAEAAGADALVAQGAEAGGHRGTFDIGTFALVPEVRDAV-DIPVIAAGGIADGRGIAAALALGADGVQMG 185 (236) T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEECCCCCHHHHHHHHHHCCCEEEEC T ss_conf 9999999818998999777777788987555677999999982-986896546277899999998089799955 No 403 >PRK13597 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=95.88 E-value=0.13 Score=30.90 Aligned_cols=70 Identities=24% Similarity=0.371 Sum_probs=53.5 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-|+...+.|+| +.++|. |........+.|+++.+... +| -++|.|-|.|.++.|.++|||-|-+| T Consensus 31 gdP~~~a~~~~~~Gad~lhlvDld~a~~~~~~n~~~I~~i~~~~~-vpiqvGGGIrs~e~~~~ll~~GadkViig 104 (252) T PRK13597 31 GDPVEAARAYDEAGADELVFLDISATHEERAILLDVVARVAERVF-IPLTVGGGVRSLEDARKLLLSGADKVSVN 104 (252) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCC-CCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 899999999998699999999564666686637999999986269-82898477130899999985698779832 No 404 >cd00564 TMP_TenI Thiamine monophosphate synthase (TMP synthase)/TenI. TMP synthase catalyzes an important step in the thiamine biosynthesis pathway, the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl) thiazole phosphate to yield thiamine phosphate. TenI is a enzymatically inactive regulatory protein involved in the regulation of several extracellular enzymes. This superfamily also contains other enzymatically inactive proteins with unknown functions. Probab=95.86 E-value=0.17 Score=30.11 Aligned_cols=100 Identities=25% Similarity=0.292 Sum_probs=59.9 Q ss_pred HHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECC Q ss_conf 99999857987245146469999999985487589961027876653100015743100378899986226981997265 Q gi|254780889|r 265 VVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGG 344 (493) Q Consensus 265 i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGG 344 (493) +...|+.++.-.+|+--+.+.+.+....+.|+|.+-+|===-+.|-....+..|... ..+.++..++||+|=|| T Consensus 85 ~~~~r~~~~~~~iiG~S~h~~~e~~~a~~~g~DYi~~gpvf~T~tK~~~~~~~g~~~------l~~~~~~~~~Pv~AiGG 158 (196) T cd00564 85 VAEARALLGPDLIIGVSTHSLEEALRAEELGADYVGFGPVFPTPTKPGAGPPLGLEL------LREIAELVEIPVVAIGG 158 (196) T ss_pred HHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHH------HHHHHHHCCCCEEEECC T ss_conf 999998728897588247889999998870999388646557898888778778899------99999867999899858 Q ss_pred CCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 437776213367148789932301012 Q gi|254780889|r 345 IRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 345 i~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) |. ..++..++.+||+.|=+-|.+-.. T Consensus 159 I~-~~ni~~~~~~G~~giAv~s~i~~~ 184 (196) T cd00564 159 IT-PENAAEVLAAGADGVAVISAITGA 184 (196) T ss_pred CC-HHHHHHHHHCCCCEEEEHHHHHCC T ss_conf 99-999999998099999972997779 No 405 >PRK13140 consensus Probab=95.83 E-value=0.18 Score=30.02 Aligned_cols=180 Identities=10% Similarity=0.061 Sum_probs=99.2 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEE Q ss_conf 46724400---26786325730358999999986553-306457479978688980210001017422326568765556 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI-EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAA 226 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki-~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgA 226 (493) ++.|+.|- .......+..+.++.+.++++.+-|. ...|+| +.|.-..+-+ T Consensus 51 fSDP~ADGpvIq~A~~rAL~~G~~~~~~~~~~~~~r~~~~~piv--------lM~Y~N~i~~------------------ 104 (257) T PRK13140 51 FSDPLADGPVIQKSATQALKNGMTLKLLFEQLKDIRKEVQIPLI--------LMGYLNPIMQ------------------ 104 (257) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCEE--------EEECHHHHHH------------------ T ss_conf 89877658999999999998699899999999997436898889--------9905599985------------------ Q ss_pred EEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEEC Q ss_conf 651370156789998731366-69982484113889999999998-5798724514646999999998548758996102 Q gi|254780889|r 227 AVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIG 304 (493) Q Consensus 227 AIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig 304 (493) ....+-+..+.++|+| +|+.|...-.+ .-+..+..++ .+.-+++++.+ .+.+-.+.+.+..-..|=+--- T Consensus 105 -----~G~e~F~~~~~~~GvdGlIipDLP~ee~--~~~~~~~~~~~~i~~I~lvaPt-t~~~Ri~~i~~~a~gFiY~vs~ 176 (257) T PRK13140 105 -----YGFEKFCKKCAETGIDGVIIPDLPFDDY--QEHYKEIFEKYGLKNIMLITPQ-TSEERIREIDSHTDGFIYMVSS 176 (257) T ss_pred -----HCHHHHHHHHHHCCCCEEEECCCCHHHH--HHHHHHHHHHCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEECCC T ss_conf -----1799999999984998698359985675--8999999998699779986899-9899999999739996687036 Q ss_pred CCCCCCCCHHCCCCCCCCHHHHHH-HHHHH-CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 787665310001574310037889-99862-269819972654377762133671487899323010121 Q gi|254780889|r 305 PGSICTTRVVTGVGCPQLSAIMSV-VEVAE-RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 305 ~Gs~CtTr~~~g~g~pq~~av~~~-~~~~~-~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) .| +||.-..--..+.+- .+..+ ..++||..-=||+++-|+ |.+...||.|-+||.|-..- T Consensus 177 ~G-------vTG~~~~~~~~~~~~i~~ik~~~~~~Pv~vGFGIs~~e~v-~~~~~~aDGvIVGSaivk~i 238 (257) T PRK13140 177 AS-------TTGAQQDFNNQKRAYFKRIKDMNLKNPRMVGFGISNKATF-NAACEYASGAIIGSAFVKLL 238 (257) T ss_pred CC-------CCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHH-HHHHHCCCEEEECHHHHHHH T ss_conf 56-------6688766515689999999982789986998057989999-99983199999879999999 No 406 >PRK08005 ribulose-phosphate 3-epimerase; Validated Probab=95.76 E-value=0.19 Score=29.82 Aligned_cols=174 Identities=21% Similarity=0.181 Sum_probs=92.8 Q ss_pred CHHHHHHHHHHCCCCE--EEEECCCCCEEEEEE-----ECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHH Q ss_conf 5899999998655330--645747997868898-----021000101742232656876555665137015678999873 Q gi|254780889|r 171 NLENAKALLHQHRIEK--LLVVDDDGCCIGLIT-----VKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFD 243 (493) Q Consensus 171 ~l~eA~~iM~~~ki~~--LPVVDe~g~LvGIIT-----~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~Lve 243 (493) .+.+-.+.+.+.++.. +=|. +|+.+-=+| .+++.+....|. | =+|+|- + -..-+..+++ T Consensus 14 ~L~~ei~~l~~~g~d~lHiDIM--DG~FVPNitfg~~~v~~ir~~t~~p~---D--vHLMv~-----~--P~~~i~~~~~ 79 (210) T PRK08005 14 RYAEALTALHDAPLGSLHLDIE--DTSFINNITFGMKTIQAVAQYTRHPL---S--FHLMVS-----S--PQRWLPWLAA 79 (210) T ss_pred HHHHHHHHHHHCCCCEEEEECC--CCCCCCCCCCCHHHHHHHHHCCCCCE---E--EEEEEC-----C--HHHHHHHHHH T ss_conf 9999999999779998998288--89827745629899999986189980---7--999868-----8--8999999997 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHH-HHHHHHHCCCCEEE-EEECCCCCCCCCHHCCCCCCC Q ss_conf 1366699824841138899999999985798724514646999-99999854875899-610278766531000157431 Q gi|254780889|r 244 VNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAE-GALALIDAGADIIK-VGIGPGSICTTRVVTGVGCPQ 321 (493) Q Consensus 244 aGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~-~~~~l~~~Gad~ik-vGig~Gs~CtTr~~~g~g~pq 321 (493) +|+|.+++-.-. ..+..+.++.+|+.--. .=+|=|-.|+- ...++++. .|.|- ..+-||-. |-+. T Consensus 80 ~g~d~it~H~Ea--~~~~~~~i~~Ik~~g~k-~GlAlnP~T~i~~~~~~l~~-vD~VLvMtV~PGf~---------GQ~F 146 (210) T PRK08005 80 IRPGWIFIHAES--VQNPSEILADIRAIGAK-AGLALNPATPLLPYRYLALQ-LDALMIMTSEPDGR---------GQQF 146 (210) T ss_pred CCCCEEEEECCC--CCCHHHHHHHHHHCCCE-EEEEECCCCCHHHHHHHHHC-CCEEEEEEECCCCC---------CCCC T ss_conf 299859993567--76999999999974980-78883799987998730400-79899987789998---------7211 Q ss_pred CHHHHHHHHHHHCC--CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 00378899986226--9819972654377762133671487899323010121 Q gi|254780889|r 322 LSAIMSVVEVAERA--GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 322 ~~av~~~~~~~~~~--~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) ...+++--+..+++ ...|+.||||... .+.+...+|||.+-.||.+-+.. T Consensus 147 i~~~~~KI~~~r~~~~~~~I~vDGGIn~~-t~~~~~~aGad~~V~GSaiF~~~ 198 (210) T PRK08005 147 IAAMCEKVSQSREHFPAAECWADGGITLR-AARLLAAAGAQHLVIGRALFTTA 198 (210) T ss_pred CHHHHHHHHHHHHHCCCCCEEEECCCCHH-HHHHHHHCCCCEEEECHHHHCCC T ss_conf 78899999999962877888997887889-99999986999999790653699 No 407 >COG1646 Predicted phosphate-binding enzymes, TIM-barrel fold [General function prediction only] Probab=95.73 E-value=0.19 Score=29.75 Aligned_cols=67 Identities=24% Similarity=0.364 Sum_probs=44.8 Q ss_pred HHHHHHHHHHC--CCCEEEEECCCCHHHHH-HHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731--36669982484113889-99999999857987245146469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDV--NVDLVVVDTAHGHSQKV-LDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~Lvea--GvDviviD~ahGh~~~~-~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +..-+.+|.+. |--++-+.-.-|+..-+ .++++..++.. ..+.+|.+-++|+|+.+.+||||.|-+| T Consensus 151 ~iaa~y~la~~~~g~~~~YlEagsga~~Pv~~e~v~~v~~~~--~LivGGGIrs~E~A~~~a~agAD~IVtG 220 (240) T COG1646 151 DIAAYYALAEKYLGMPVVYLEAGSGAGDPVPVEMVSRVLSDT--PLIVGGGIRSPEQAREMAEAGADTIVTG 220 (240) T ss_pred HHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHHHHCCC--EEEECCCCCCHHHHHHHHHCCCCEEEEC T ss_conf 899999999997198589998068889986889999861455--0898588498999999997179989977 No 408 >COG4175 ProV ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism] Probab=95.72 E-value=0.013 Score=38.16 Aligned_cols=18 Identities=22% Similarity=0.396 Sum_probs=12.3 Q ss_pred HHHHHHCCCCEEEEECCCC Q ss_conf 9998731366699824841 Q gi|254780889|r 238 VGPLFDVNVDLVVVDTAHG 256 (493) Q Consensus 238 a~~LveaGvDviviD~ahG 256 (493) |++| ....|++..|-|-. T Consensus 176 ARAl-a~~~~IlLMDEaFS 193 (386) T COG4175 176 ARAL-ANDPDILLMDEAFS 193 (386) T ss_pred HHHH-CCCCCEEEECCCHH T ss_conf 9987-03998788537023 No 409 >pfam00290 Trp_syntA Tryptophan synthase alpha chain. Probab=95.72 E-value=0.2 Score=29.71 Aligned_cols=67 Identities=25% Similarity=0.350 Sum_probs=48.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699824841-------138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG-------HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG-------h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+++..-.+|-.=+..| .+....+.++++|+.. ++||++| .|-|+|.++.+. ++||++-|| T Consensus 151 ~~~Ri~~i~~~s~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGIs~~e~v~~~~-~~aDGvIVG 225 (258) T pfam00290 151 SDERLKTISEAASGFVYLVSRAGVTGARNAFNAQLDELVERLKKYT-NVPVAVGFGISTPEHVKKIA-AGADGVIVG 225 (258) T ss_pred CHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCC-CCCEEEEECCCCHHHHHHHH-CCCCEEEEC T ss_conf 1999999996089808998534456765556388999999998606-99848994579999999998-159999984 No 410 >PRK13126 consensus Probab=95.68 E-value=0.2 Score=29.61 Aligned_cols=79 Identities=25% Similarity=0.306 Sum_probs=52.3 Q ss_pred HHHHHHHHHHCCCCEEEEE--CCCCH--HHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 5678999873136669982--48411--38899999999985798724514-6469999999985487589961027876 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVD--TAHGH--SQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD--~ahGh--~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) ..+|.+.+.+..-.++-+- -.-|- .....+.++.+|+..+++|+++| .|-|+|.+..+++++||++-|| |. T Consensus 135 ~~~ri~~i~~~s~gfiYvs~~gvTG~~~~~~~~~~i~~ir~~~~~~Pv~vGFGI~t~e~~~~~~~~~aDGvIVG----Sa 210 (237) T PRK13126 135 PHRLLRRLASLEPDFIYLGLYAATGIELPVYVERNVKTIRGLAGDVYLVAGFAIDSPEKAARLVKAGADGVVVG----TA 210 (237) T ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCCCCEEEEC----HH T ss_conf 39999999985898799986526676415679999999998578997799945399999999986489999987----89 Q ss_pred CCCCHHCC Q ss_conf 65310001 Q gi|254780889|r 309 CTTRVVTG 316 (493) Q Consensus 309 CtTr~~~g 316 (493) +.-++..+ T Consensus 211 iV~~i~~~ 218 (237) T PRK13126 211 FMRRLSSS 218 (237) T ss_pred HHHHHHHH T ss_conf 99999975 No 411 >PRK06512 thiamine-phosphate pyrophosphorylase; Provisional Probab=95.61 E-value=0.21 Score=29.43 Aligned_cols=37 Identities=5% Similarity=0.102 Sum_probs=14.7 Q ss_pred HHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 99999998579872451464699999999854875899 Q gi|254780889|r 263 DAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 263 ~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) +.++|.++.+ .+|+++=.=-|.+-+..++++|||+|= T Consensus 154 ~~l~~~~~~~-~iP~VAIGGIt~~n~~~v~~aGad~vA 190 (221) T PRK06512 154 SLAEWWAEMI-EIPCIVQAGSDLASIVEVAETGAEFVA 190 (221) T ss_pred HHHHHHHHCC-CCCEEEECCCCHHHHHHHHHHCCCEEE T ss_conf 8999999747-999899827899999999981998998 No 412 >PRK13114 consensus Probab=95.59 E-value=0.22 Score=29.40 Aligned_cols=180 Identities=13% Similarity=0.152 Sum_probs=102.2 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCCC--EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---267863257303589999999865533--0645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRIE--KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki~--~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- +......+..+.++.++++++.+-+.. ..|+| +.|....+-+ T Consensus 50 FSDP~ADGpvIq~A~~rAL~~G~~l~~~f~~v~~~r~~~~~~Piv--------lM~Y~N~i~~----------------- 104 (266) T PRK13114 50 FTDPMADGPAIQAANLRSLAAGTTTADIFRIAAEFRQRHPEVPLV--------LMGYANPMVR----------------- 104 (266) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEE--------EEEEHHHHHH----------------- T ss_conf 888677689999999999986997999999999987418998879--------9863019998----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 6651370156789998731366-69982484113889999999998-579872451464699999999854875899610 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) .....-+..+.++|+| +|+.|...-.+ .+..+.+++ .+.-+++++.+ .+.+-.+.+.+..-..| --+ T Consensus 105 ------~G~~~F~~~~~~aGvdG~IipDLP~eE~---~~~~~~~~~~gi~~I~liaPt-t~~~Ri~~i~~~a~gFi-Y~v 173 (266) T PRK13114 105 ------RGPDWFAAECKKAGVDGVICVDIPPEED---AELGPALRAAGIDPIRLATPT-TDAARLPAVLEGASGFL-YYV 173 (266) T ss_pred ------HHHHHHHHHHHHCCCCEEEECCCCHHHH---HHHHHHHHHCCCCEEEEECCC-CCHHHHHHHHHHCCCCE-EEE T ss_conf ------6499999999974997798458997888---999999997499726775699-97999999997389958-998 Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHH-HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 2787665310001574310037889-998622698199726543777621336714878993230101216 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSV-VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~-~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||.-.+...-+.+- .+..+..++||..-=||+++-| ++++...||.|-+||-|-..-| T Consensus 174 ------s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~-~~~~~~~ADGvIVGSaiVk~I~ 237 (266) T PRK13114 174 ------SVAGITGMQQAAQASIEAAVARIKAATDLPVAVGFGVRTPEQ-AAAIARVADGVVVGSAFVDLIG 237 (266) T ss_pred ------ECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHH-HHHHHHCCCEEEECHHHHHHHH T ss_conf ------445566776566588999999999707998699836698999-9999800999998199999998 No 413 >PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=95.59 E-value=0.18 Score=29.99 Aligned_cols=123 Identities=16% Similarity=0.275 Sum_probs=74.7 Q ss_pred CCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 87655566513701567899987313666998248411388999999999857987245146469999999985487589 Q gi|254780889|r 220 GRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 220 grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) +.+.|||.-=.. .+-++..+++|++++|- -+.+..+++..+.. +++.+ ..+.|+.-+....++|+|.+ T Consensus 65 p~~~vGaGTV~~---~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpsEi~~A~~~G~~~v 132 (212) T PRK05718 65 PEALIGAGTVLN---PEQLAQAIEAGAQFIVS---PGLTPPLLKACQDG-----PIPLI-PGVNTPSELMLAMELGLRTF 132 (212) T ss_pred CCCEEEEEEECC---HHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEE T ss_conf 981796533134---88999999849989984---89989999999981-----99765-78699999999998799989 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 9610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 300 KVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 300 kvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) |+= |.+.+ | |..-+.++.. . -.++++++-|||. ..++..-|+++.-...-||+|+ T Consensus 133 K~F--PA~~~------g-G~~~lkal~~---p--~p~i~~~ptGGV~-~~N~~~yl~~~~v~avgGS~l~ 187 (212) T PRK05718 133 KFF--PAEAS------G-GVKMLKALAG---P--FSDVRFCPTGGIS-PANYRDYLALPNVLCIGGSWMV 187 (212) T ss_pred EEC--CCCCC------C-CHHHHHHHHC---C--CCCCEEEECCCCC-HHHHHHHHCCCCEEEEECHHHC T ss_conf 978--76101------7-9999999856---5--8998288659989-8789999817886999873528 No 414 >PRK02621 consensus Probab=95.58 E-value=0.11 Score=31.54 Aligned_cols=70 Identities=24% Similarity=0.374 Sum_probs=52.4 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .|-.+.|+...+.|+| ++++|. |........+.|+++.+... +| -++|.|-|.|.++.|.++|||-|-+| T Consensus 30 gdP~~~ak~~~~~gad~lhivDld~a~~~~~~~~~~I~~i~~~~~-ipi~vGGGIrs~e~~~~ll~~GadkVii~ 103 (254) T PRK02621 30 GDPVELACRYSQAGADELVFLDITATHEGRATLIDVVYRTAEQVF-IPLTVGGGISSLEGIKELLRAGADKVSLN 103 (254) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCC-CCEEEECCEEEHHHHHHHHHCCCCEEEEC T ss_conf 899999999998599999998266765675428999999998679-85899633535799999997499989998 No 415 >pfam04476 DUF556 Protein of unknown function (DUF556). Family of uncharacterized, hypothetical prokaryotic proteins. Probab=95.57 E-value=0.11 Score=31.58 Aligned_cols=148 Identities=20% Similarity=0.227 Sum_probs=81.9 Q ss_pred ECCCHHHHHHHHHHHCCCCEEEE-ECCCCHHH-HHHHHHHHHHHHCCC---CCEECCCCCC-----HHHHHHHHHCCCCE Q ss_conf 13701567899987313666998-24841138-899999999985798---7245146469-----99999998548758 Q gi|254780889|r 229 SVAKDIADRVGPLFDVNVDLVVV-DTAHGHSQ-KVLDAVVQIKKNFPS---LLVMAGNIAT-----AEGALALIDAGADI 298 (493) Q Consensus 229 g~~~d~~eRa~~LveaGvDvivi-D~ahGh~~-~~~~~i~~~k~~~~~---~~ii~GNv~t-----~~~~~~l~~~Gad~ 298 (493) |++ ..+-|....++|+|+|=+ +-++|--- .....++.+++..+. +-...|++-. ...+......|+|. T Consensus 6 Svr--n~~EA~~a~~~GaDIIDvKnP~~GaLGa~~p~vi~~i~~~v~~~k~vSatiGD~p~~p~~is~aa~~~a~~GvDy 83 (235) T pfam04476 6 SPI--SVEEALEAIEGGADIIDVKNPAEGSLGANFPWVIREIRELTPKDKLVSATVGDVPYKPGTVSLAALGAAVSGADY 83 (235) T ss_pred CCC--CHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCE T ss_conf 479--999999998679998982798888888999999999999856787168984078899516789998755038998 Q ss_pred EEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCC--CCEEEEECCCCC-------HHHHEE-EEEECCCEEEECCHH Q ss_conf 9961027876653100015743100378899986226--981997265437-------776213-367148789932301 Q gi|254780889|r 299 IKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERA--GVAIVADGGIRF-------SGDIAK-AIAAGSACVMIGSLL 368 (493) Q Consensus 299 ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~--~~~iiadGGi~~-------~gdi~k-Ala~GA~~VM~G~~~ 368 (493) ||||+-+. .+... .+.....+.++.+.+ +..+++.+---+ +=+|.+ |..+|.+.+|+ T Consensus 84 VKVGl~~~---~~~~~------~~e~~~~~~~av~~~~~~~~vVav~~AD~~~~~~~~p~~i~~~a~~~G~~gvMi---- 150 (235) T pfam04476 84 IKVGLYGV---KNYDE------AVEVMKNVVRAVKDFDSSKIVVAAGYADAYRIGAVEPLDVPEIAADAGADIAML---- 150 (235) T ss_pred EEEECCCC---CCHHH------HHHHHHHHHHHHHHCCCCCEEEEEEECHHHHHCCCCHHHHHHHHHHCCCCEEEE---- T ss_conf 99943788---86799------999999999998722788669999601033313888356799999759978998---- Q ss_pred HCCCCCCCCEEEECCEEEEEEECCCCHHHHH Q ss_conf 0121677644873680588873320068875 Q gi|254780889|r 369 AGTDESPGDIFLYQGRSFKSYRGMGSVAAME 399 (493) Q Consensus 369 agt~Espg~~~~~~g~~~K~yrGm~S~~A~~ 399 (493) ++..++|+..=.+..+..+...- T Consensus 151 --------DT~~K~g~sl~d~~~~~~L~~fv 173 (235) T pfam04476 151 --------DTAIKDGTTLFDHMKIEDLESFV 173 (235) T ss_pred --------ECCCCCCCCHHHHCCHHHHHHHH T ss_conf --------74667897666649999999999 No 416 >cd04726 KGPDC_HPS 3-Keto-L-gulonate 6-phosphate decarboxylase (KGPDC) and D-arabino-3-hexulose-6-phosphate synthase (HPS). KGPDC catalyzes the formation of L-xylulose 5-phosphate and carbon dioxide from 3-keto-L-gulonate 6-phosphate as part of the anaerobic pathway for L-ascorbate utilization in some eubacteria. HPS catalyzes the formation of D-arabino-3-hexulose-6-phosphate from D-ribulose 5-phosphate and formaldehyde in microorganisms that can use formaldehyde as a carbon source. Both catalyze reactions that involve the Mg2+-assisted formation and stabilization of 1,2-enediolate reaction intermediates. Probab=95.55 E-value=0.081 Score=32.47 Aligned_cols=122 Identities=20% Similarity=0.300 Sum_probs=71.8 Q ss_pred HHHHHHCCCCEEEEECCCCHH-HHHHHHHHHHHHHCCC-CCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHC Q ss_conf 999873136669982484113-8899999999985798-72451464699999999854875899610278766531000 Q gi|254780889|r 238 VGPLFDVNVDLVVVDTAHGHS-QKVLDAVVQIKKNFPS-LLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVT 315 (493) Q Consensus 238 a~~LveaGvDviviD~ahGh~-~~~~~~i~~~k~~~~~-~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~ 315 (493) ++...++|+|.+.+ ||.. ..+++......+.++. +.+-.-++.+.+....+.+.|.|.+-+- +.+... T Consensus 70 a~~~~~~Gad~~tv---h~~~g~~~~~~a~~~a~~~~~~~~v~l~~~~~~~~~~~~~~~~~~~~~~~-------~~~d~~ 139 (202) T cd04726 70 AEMAFKAGADIVTV---LGAAPLSTIKKAVKAAKKYGKEVQVDLIGVEDPEKRAKLLKLGVDIVILH-------RGIDAQ 139 (202) T ss_pred HHHHHHHCCCEEEE---ECCCCHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHCCHHHHHHH-------HHHHHH T ss_conf 99999707988999---66689899999999886369679999836899999998885680588999-------878777 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 157431003788999862269819972654377762133671487899323010121 Q gi|254780889|r 316 GVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 316 g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) ..| ..+.+.+..+.....+..+..-||||- .++.+|+.+|||.+.+|+.-...+ T Consensus 140 ~~g--~~~~~~~~~~i~~~~~~~i~VtpGIr~-~t~~~a~~~gad~iVVGR~It~A~ 193 (202) T cd04726 140 AAG--GWWPEDDLKKVKKLLGVKVAVAGGITP-DTLPEFKKAGADIVIVGRAITGAA 193 (202) T ss_pred HCC--CCCHHHHHHHHHHHCCCCEEECCCCCH-HHHHHHHHCCCCEEEECCCCCCCC T ss_conf 528--997089999998623996788999885-409999975999999898345799 No 417 >COG2070 Dioxygenases related to 2-nitropropane dioxygenase [General function prediction only] Probab=95.55 E-value=0.047 Score=34.16 Aligned_cols=66 Identities=26% Similarity=0.357 Sum_probs=51.8 Q ss_pred HHHHHHHCCCCEEEEEC--CCCHHH------HHHHHHHHHHHHCCCCCEE-CCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 89998731366699824--841138------8999999999857987245-146469999999985487589961 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDT--AHGHSQ------KVLDAVVQIKKNFPSLLVM-AGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 237 Ra~~LveaGvDviviD~--ahGh~~------~~~~~i~~~k~~~~~~~ii-~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) -+....++|+|.++..- |-||.. .....+..+++.+..+||| +|.+++.++.......|||+|-+| T Consensus 139 ~A~~~~~~G~d~vI~~g~eAGGH~g~~~~~~~t~~Lv~ev~~~~~~iPViAAGGI~dg~~i~AAlalGA~gVq~G 213 (336) T COG2070 139 EALKAERAGADAVIAQGAEAGGHRGGVDLEVSTFALVPEVVDAVDGIPVIAAGGIADGRGIAAALALGADGVQMG 213 (336) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCHHHHHH T ss_conf 999998179988994377677868998877318889999999854897898768688699999998441685541 No 418 >PRK13138 consensus Probab=95.55 E-value=0.18 Score=30.00 Aligned_cols=182 Identities=15% Similarity=0.128 Sum_probs=100.5 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 46724400---26786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|- +......+..+.++.+.++++.+-+. ...|+| +.|.-...-+ T Consensus 50 FSDP~ADGPvIq~A~~rAL~~G~~~~~~~~~~~~ir~~~~~~piv--------lM~Y~N~i~~----------------- 104 (264) T PRK13138 50 FSDPVADGPVIQKAFKRALAHPFSMDKILEITAEIHKLHPEIPLV--------YLTYFNPLFS----------------- 104 (264) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCEE--------EEEEHHHHHH----------------- T ss_conf 888666589999999999977990889744677603358988889--------7521238988----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 6651370156789998731366-69982484113889999999998-579872451464699999999854875899610 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) .....-+..+.++|+| +|+.|...-..+ ..+..+.+++ ...-+++++.+ .+.+-.+.+.+. ++++---+ T Consensus 105 ------~G~e~F~~~~~~~GvdGlIipDLP~e~~E-~~~~~~~~~~~~i~~I~liaPt-t~~~Ri~~i~~~-s~gFiY~V 175 (264) T PRK13138 105 ------MGLEAFTERAKNSGIQGLIIPDLPFDTPE-AEEFFSQLERKKIDFIHLVTPA-TTEDRIQSMKSF-ASGFIYYV 175 (264) T ss_pred ------HCHHHHHHHHHHCCCCEEECCCCCCCCHH-HHHHHHHHHHCCCCEEEECCCC-CCHHHHHHHHHH-CCCEEEEE T ss_conf ------48999999998769775853689865033-5999999998699867521799-989999999973-88808987 Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHHH-HHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 27876653100015743100378899-98622698199726543777621336714878993230101216 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSVV-EVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~~-~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-+||.--.-...+.+.- +..+..++||..-=||+++.+ +|.+..+||-|-+||.|-..-| T Consensus 176 ------s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~-~~~~~~~ADGvIVGSaiv~~i~ 239 (264) T PRK13138 176 ------TSYGVTGERGAIASGLEDRIQMVRKIVGLPVCAGFGISTADQ-AKEISTYADGVIIGSAVQKIIE 239 (264) T ss_pred ------ECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHH-HHHHHHCCCEEEECHHHHHHHH T ss_conf ------545667876555376999999999743898388606798999-9999834999998199999999 No 419 >cd00959 DeoC 2-deoxyribose-5-phosphate aldolase (DERA) of the DeoC family. DERA belongs to the class I aldolases and catalyzes a reversible aldol reaction between acetaldehyde and glyceraldehyde 3-phosphate to generate 2-deoxyribose 5-phosphate. DERA is unique in catalyzing the aldol reaction between two aldehydes, and its broad substrate specificity confers considerable utility as a biocatalyst, offering an environmentally benign alternative to chiral transition metal catalysis of the asymmetric aldol reaction. Probab=95.55 E-value=0.22 Score=29.29 Aligned_cols=116 Identities=25% Similarity=0.313 Sum_probs=73.7 Q ss_pred HHHHHHHHCCCCEE--EEE---CCCCHHHHHHHHHHHHHHHCCC--CCEECCCCC-CHH----HHHHHHHCCCCEEEEEE Q ss_conf 78999873136669--982---4841138899999999985798--724514646-999----99999854875899610 Q gi|254780889|r 236 DRVGPLFDVNVDLV--VVD---TAHGHSQKVLDAVVQIKKNFPS--LLVMAGNIA-TAE----GALALIDAGADIIKVGI 303 (493) Q Consensus 236 eRa~~LveaGvDvi--viD---~ahGh~~~~~~~i~~~k~~~~~--~~ii~GNv~-t~~----~~~~l~~~Gad~ikvGi 303 (493) .-++..++.|+|=| |++ .-.|+...+.+-++.+++..+. +.+|.=.-. |.+ ..+-.+++|||.||-.- T Consensus 73 ~E~~~ai~~GAdEiD~Vin~~~~~~g~~~~v~~ei~~v~~~~~~~~lKVIlEt~~L~~~ei~~a~~~a~~aGadfvKTST 152 (203) T cd00959 73 AEAREAIADGADEIDMVINIGALKSGDYEAVYEEIAAVVEACGGAPLKVILETGLLTDEEIIKACEIAIEAGADFIKTST 152 (203) T ss_pred HHHHHHHHCCCCEEEEECCHHHHHCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEECC T ss_conf 99999998299877751239999607099999999999876288826999744659999999999999982978897158 Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEE Q ss_conf 27876653100015743100378899986226981997265437776213367148789 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACV 362 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~V 362 (493) |=+....| +-.+.-..+... ..+.|=|-||||+.-+...-|.+||+-. T Consensus 153 G~~~~gat----------~e~v~~m~~~~~-~~~giKasGGIrt~~~a~~~l~aGa~ri 200 (203) T cd00959 153 GFGPGGAT----------VEDVKLMKEAVG-GRVGVKAAGGIRTLEDALAMIEAGATRI 200 (203) T ss_pred CCCCCCCC----------HHHHHHHHHHHC-CCCEEECCCCCCCHHHHHHHHHHCCHHE T ss_conf 86889989----------999999999838-7860771589799999999998184122 No 420 >PRK13305 sgbH 3-keto-L-gulonate-6-phosphate decarboxylase; Provisional Probab=95.54 E-value=0.23 Score=29.27 Aligned_cols=132 Identities=16% Similarity=0.117 Sum_probs=83.2 Q ss_pred CCCHHHHHHHHHHHCCCCEEEE-ECCCCHHHHHHHHHHHHHHHCCC--CCEECCCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 3701567899987313666998-24841138899999999985798--72451464699999999854875899610278 Q gi|254780889|r 230 VAKDIADRVGPLFDVNVDLVVV-DTAHGHSQKVLDAVVQIKKNFPS--LLVMAGNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 230 ~~~d~~eRa~~LveaGvDvivi-D~ahGh~~~~~~~i~~~k~~~~~--~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) +-+...--++...++|+|+++| -.|+ ..-..+.++.-++.--. ++++ |+.+.+.++.+.+.|+|.+-+-.| T Consensus 65 t~DaG~~Ea~~a~~aGAD~vTVlg~A~--~~TI~~~~~~a~~~g~~v~vDli--~~~~~~~ak~~~~lgv~~v~~H~g-- 138 (220) T PRK13305 65 VADAGETLAQQAFGAGANWMTIICAAP--LATVEKGHAVAQSCGGEIQIELF--GNWTLDDARDWHRIGVRQAIYHRG-- 138 (220) T ss_pred ECCCHHHHHHHHHHCCCCEEEEECCCC--HHHHHHHHHHHHHCCCEEEEEEC--CCCCHHHHHHHHHCCCCEEEEEEC-- T ss_conf 526259999999865998899956689--79999999999980998999845--899878999999869988999833-- Q ss_pred CCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 7665310001574310037889998622698199726543777621336714878993230101216 Q gi|254780889|r 307 SICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 307 s~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +..-..|..-...-..-.+...+.+++|-.-|||+ +.++-++..+|++.+..|+...+.+. T Consensus 139 -----~D~q~~g~~~~~~~l~~~k~~~~~~~~vaVAGGI~-~~~i~~~~~~~~~ivIvGraIt~A~d 199 (220) T PRK13305 139 -----RDAQASGQQWGEADLARMKALSDIGLELSITGGIT-PADLPLFKDIRVKAFIAGRALAGAAN 199 (220) T ss_pred -----CCHHHCCCCCCHHHHHHHHHHHHCCCEEEEECCCC-HHHHHHHHCCCCCEEEECHHHCCCCC T ss_conf -----36765189863101999998760696499988878-88999997169989998936518999 No 421 >PRK08782 consensus Probab=95.52 E-value=0.23 Score=29.22 Aligned_cols=122 Identities=16% Similarity=0.273 Sum_probs=77.6 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) .+.+||.-=. +.|-++..+++|++++|- -|.+..+++..+.. +++.+ ..+.|+.-.....++|++.+| T Consensus 68 ~~~vGaGTV~---~~e~~~~a~~aGA~FiVS---P~~~~~v~~~a~~~-----~i~~i-PGv~TpSEi~~A~~~G~~~vK 135 (219) T PRK08782 68 NIVIGAGTVL---SERQLRQSVDAGADFLVT---PGTPAPLARLLADA-----PIPAV-PGAATPTELLTLMGLGFRVCK 135 (219) T ss_pred CCEEEEEEEC---CHHHHHHHHHCCCCEEEC---CCCCHHHHHHHHHC-----CCCEE-CCCCCHHHHHHHHHCCCCEEE T ss_conf 9479999705---899999999849989987---89979999999981-----99764-785999999999987999899 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) += |.+.+- |...+.++.. --.++++++-|||. .-++..-|++++-.+..||++. T Consensus 136 lF--PA~~~G-------g~~~lkal~~-----pfp~~~f~pTGGV~-~~N~~~yl~~~~v~~vgGS~l~ 189 (219) T PRK08782 136 LF--PATAVG-------GLQMLKGLAG-----PLSELKLCPTGGIS-ETNAAEFLSQPNVLCIGGSWMV 189 (219) T ss_pred EC--CCHHCC-------CHHHHHHHHC-----CCCCCEEEECCCCC-HHHHHHHHHCCCEEEEECHHHC T ss_conf 77--732208-------4999999847-----69998187679989-8789999807993999882538 No 422 >pfam00218 IGPS Indole-3-glycerol phosphate synthase. Probab=95.47 E-value=0.2 Score=29.70 Aligned_cols=120 Identities=18% Similarity=0.225 Sum_probs=70.3 Q ss_pred HHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCC Q ss_conf 73136669982484113889999999998579872451464699999999854875899610278766531000157431 Q gi|254780889|r 242 FDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQ 321 (493) Q Consensus 242 veaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq 321 (493) -..|+|+|.+=.+--......+.++.. +.+ .+++++ -|.+.+-.....++|++.| ||-+-..-|-..- T Consensus 128 r~~GADaiLLI~~~L~~~~l~~l~~~a-~~l-gl~~Lv-Evh~~~El~~al~~~a~iI--GINNRnL~tf~vd------- 195 (254) T pfam00218 128 RAYGADTVLLIVAVLSDELLEELYEYA-RSL-GMEPLV-EVHNEEELERALALGAKLI--GVNNRNLKTFEVD------- 195 (254) T ss_pred HHCCCCEEEHHHHCCCHHHHHHHHHHH-HHH-CCEEEE-EECCHHHHHHHHHCCCCEE--EECCCCHHHHHCC------- T ss_conf 980888631447119999999999999-984-886798-8689999999984899789--6327884651005------- Q ss_pred CHHHHHHHHHHH--CCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 003788999862--26981997265437776213367148789932301012167764 Q gi|254780889|r 322 LSAIMSVVEVAE--RAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 322 ~~av~~~~~~~~--~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) +..+.+.+. ..++.+||-.||++..|+.+...+|+|+|.+|..|-.. +.|++ T Consensus 196 ---~~~t~~L~~~ip~~~~~VsESGI~~~~di~~l~~~G~~~~LIGe~lm~~-~dp~~ 249 (254) T pfam00218 196 ---LNTTRRLAPMVPEDVLLVAESGISTPEDVEKLAKHGANAFLVGESLMRA-PDVRA 249 (254) T ss_pred ---HHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHHCC-CCHHH T ss_conf ---7999999955898987998389999999999998799999989687579-99899 No 423 >pfam02581 TMP-TENI Thiamine monophosphate synthase/TENI. Thiamine monophosphate synthase (TMP) (EC:2.5.1.3) catalyses the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to yield thiamine phosphate. This Pfam family also includes the regulatory protein TENI. Probab=95.43 E-value=0.15 Score=30.58 Aligned_cols=89 Identities=25% Similarity=0.362 Sum_probs=42.2 Q ss_pred HHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 99998579872451464699999999854875899610278766531000157431003788999862269819972654 Q gi|254780889|r 266 VQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGI 345 (493) Q Consensus 266 ~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi 345 (493) ...|+.++.-.++.-.+.+.+.+....+.|+|.+-. || +--|.-..+. -|. -.....+.++...+||+|=||| T Consensus 86 ~~~r~~~~~~~iiG~S~h~~~e~~~a~~~gaDYi~~--gp--vf~T~sK~~~-~~~--g~~~~~~~~~~~~~Pv~AiGGI 158 (180) T pfam02581 86 AEARELLGPDKIIGVSTHTLEEALEAEALGADYIGF--GP--VFPTPTKPDA-PPL--GLEGLRAIAEAVNIPVVAIGGI 158 (180) T ss_pred HHHHHHCCCCCEEEECCCCHHHHHHHHHCCCCCEEE--CC--CCCCCCCCCC-CCC--CHHHHHHHHHHCCCCEEEECCC T ss_conf 999986497768974478899999998719980887--47--6777999998-878--9899999998589999999098 Q ss_pred CCHHHHEEEEEECCCEE Q ss_conf 37776213367148789 Q gi|254780889|r 346 RFSGDIAKAIAAGSACV 362 (493) Q Consensus 346 ~~~gdi~kAla~GA~~V 362 (493) . ..++..++.+||+.| T Consensus 159 ~-~~n~~~~~~~Ga~gv 174 (180) T pfam02581 159 T-PENVPEVLEAGADGV 174 (180) T ss_pred C-HHHHHHHHHCCCCEE T ss_conf 9-999999998599889 No 424 >PRK13523 NADPH dehydrogenase NamA; Provisional Probab=95.43 E-value=0.037 Score=34.92 Aligned_cols=88 Identities=20% Similarity=0.175 Sum_probs=58.7 Q ss_pred CHHHHHHHHHHHCCCCEEEEECC-------CCHHHHHHHHHHHHHHHCCCCCEE-CCCCCCHHHHHHHHHCC-CCEEEEE Q ss_conf 01567899987313666998248-------411388999999999857987245-14646999999998548-7589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTA-------HGHSQKVLDAVVQIKKNFPSLLVM-AGNIATAEGALALIDAG-ADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~a-------hGh~~~~~~~i~~~k~~~~~~~ii-~GNv~t~~~~~~l~~~G-ad~ikvG 302 (493) +|..+-++.|.++|+|+|-+..- +.+..+.+...+.+|+.. +++++ +|.+.+++.++.+++.| ||.| T Consensus 227 ~d~~~~~~~l~~~GvD~i~vs~G~~~~~~~~~~~g~~~~~a~~ik~~~-~ipvi~vG~i~~~~~ae~~l~~G~aD~V--- 302 (337) T PRK13523 227 QDYVQYAKWMKEQGVDLIDVSSGAVVPARIDVYPGYQVPFAEHIKEHA-NIATGAVGLITTGAQAEEILNNNRADLI--- 302 (337) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCCHH--- T ss_conf 999999999997499989957885547767778753348999999876-9709998386999999999987994799--- Q ss_pred ECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 0278766531000157431003788999862269 Q gi|254780889|r 303 IGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAG 336 (493) Q Consensus 303 ig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~ 336 (493) ++|+|.+.-=..+.+++.+.+ T Consensus 303 -------------~~gR~~iadPd~p~kaa~~~~ 323 (337) T PRK13523 303 -------------FIGRELLRNPYFPRIAANELG 323 (337) T ss_pred -------------HHHHHHHHCCCHHHHHHHHCC T ss_conf -------------989999989109999997669 No 425 >PRK08255 salicylyl-CoA 5-hydroxylase; Reviewed Probab=95.41 E-value=0.098 Score=31.88 Aligned_cols=89 Identities=19% Similarity=0.256 Sum_probs=61.7 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCC--------HHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCC-CCEEEE Q ss_conf 0156789998731366699824841--------13889999999998579872-4514646999999998548-758996 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHG--------HSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAG-ADIIKV 301 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahG--------h~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~G-ad~ikv 301 (493) +|..+-++.|-++|+|+|-+-+-.- ...++....++||++.. ++ +-+|++.+++.++.++..| ||.|- T Consensus 643 edsv~la~~l~~~GvD~IdvSsGg~~~~~~p~~g~~yQvpfA~~Ir~e~~-i~t~AVG~I~~p~~Ae~Il~~GrADlVa- 720 (770) T PRK08255 643 DDAVEIARAFKAAGADMIDVSSGQVSKDEKPVYGRMYQTPFADRIRNEAG-IATIAVGAISEADHVNSIIAAGRADLCA- 720 (770) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCC-CCEEEECCCCCHHHHHHHHHCCCCCEEE- T ss_conf 99999999999749989995788888667788887656699999998759-9789961889999999999769988752- Q ss_pred EECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 102787665310001574310037889998622698 Q gi|254780889|r 302 GIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGV 337 (493) Q Consensus 302 Gig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~ 337 (493) +|+|+|.--+..-.++.+.|. T Consensus 721 ---------------lgR~~L~dP~W~l~aA~~lg~ 741 (770) T PRK08255 721 ---------------LARPHLADPAWTLHEAARIGY 741 (770) T ss_pred ---------------ECHHHHCCCCHHHHHHHHCCC T ss_conf ---------------477765199509999997599 No 426 >PRK03220 consensus Probab=95.38 E-value=0.13 Score=31.09 Aligned_cols=70 Identities=23% Similarity=0.313 Sum_probs=52.6 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .|-.+.|+...+.|+| +.++|. |-.......+.|+.+.+.. .+|+ ++|.+-|.|.++.|.++|||-|-+| T Consensus 31 gdP~~~a~~~~~~G~d~lhivDld~a~~g~~~n~~~I~~i~~~~-~~pi~vGGGIrs~e~~~~ll~~GadkVvig 104 (257) T PRK03220 31 GDPVELAAVYDAEGADELTFLDVTASSSGRATMLDVVRRTAEQV-FIPLTVGGGVRTVEDVDSLLRAGADKVSVN 104 (257) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC-CCCEEEECCCCCHHHHHHHHHCCCCEEECH T ss_conf 89999999999869998999908887567630799999998506-964898478587999999998197508720 No 427 >PRK11815 tRNA-dihydrouridine synthase A; Provisional Probab=95.37 E-value=0.26 Score=28.87 Aligned_cols=98 Identities=21% Similarity=0.286 Sum_probs=58.4 Q ss_pred HHHHHHHHHHHHHCCCCCE----ECC--CCCCHH----HHHHHHHCCCCEEEEEECCCCCCCCCHHCCCC------CCC- Q ss_conf 8899999999985798724----514--646999----99999854875899610278766531000157------431- Q gi|254780889|r 259 QKVLDAVVQIKKNFPSLLV----MAG--NIATAE----GALALIDAGADIIKVGIGPGSICTTRVVTGVG------CPQ- 321 (493) Q Consensus 259 ~~~~~~i~~~k~~~~~~~i----i~G--Nv~t~~----~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g------~pq- 321 (493) ..+.+.++.+++..+ +|| =.| +-.+++ -++.+.++|++.+-|-= -|....|.. +|. T Consensus 119 ~~v~~iv~a~~~a~~-~PVTvK~RlG~d~~d~~~~l~~f~~~~~~aG~~~i~vH~------R~a~l~Glspk~nR~ippl 191 (333) T PRK11815 119 ELVADCVKAMKDAVS-IPVTVKHRIGIDDQDSYEFLCDFVDTVAEAGCDRFIVHA------RKAWLKGLSPKENREIPPL 191 (333) T ss_pred HHHHHHHHHHHHHCC-CCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEE------HHHHHCCCCHHHHCCCCCH T ss_conf 999999999987348-853578631677775289999999999975998899960------2787726787775058730 Q ss_pred -CHHHHHHHHHHHCC-CCEEEEECCCCCHHHHEEEEEECCCEEEECCH Q ss_conf -00378899986226-98199726543777621336714878993230 Q gi|254780889|r 322 -LSAIMSVVEVAERA-GVAIVADGGIRFSGDIAKAIAAGSACVMIGSL 367 (493) Q Consensus 322 -~~av~~~~~~~~~~-~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~ 367 (493) ...|++.. ++. .+|||+-|||.+.-|+.+.|-- .|.||+|.- T Consensus 192 ~~~~v~~lk---~~~p~ipvi~NGdI~s~~~~~~~l~~-~DGVMiGRg 235 (333) T PRK11815 192 DYDRVYRLK---RDFPHLTIEINGGIKTLEEAKEHLQH-VDGVMIGRA 235 (333) T ss_pred HHHHHHHHH---HHCCCCEEEECCCCCCHHHHHHHHHC-CCEEEEHHH T ss_conf 489999999---76678718845996999999999855-996211486 No 428 >PRK06256 biotin synthase; Validated Probab=95.33 E-value=0.26 Score=28.86 Aligned_cols=185 Identities=17% Similarity=0.201 Sum_probs=97.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) ..+.+..+...+.+..++-+|-.-..+ +..++.+...--.+.|. +-.+.+.+.+|.. ..|-+..|-+||+|-. T Consensus 93 eeI~~~a~~a~~~G~~~~~lvtsg~~~----~~~~~e~v~~~i~~Ik~-~~~l~i~~slG~l--~~e~~~~LkeAGvd~y 165 (325) T PRK06256 93 EEIVEAAKEAIENGAGRFCIVASGRGP----SGREVDQVIEAVKAIKE-ETDLEICACLGLL--TEEQAERLKEAGVDRY 165 (325) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCCC----CHHHHHHHHHHHHHHHH-CCCEEEEEECCCC--CHHHHHHHHHCCCCEE T ss_conf 999999999998699889998604589----76789999999999862-2893688734889--9999999998699888 Q ss_pred E--EEC----------CCCHHHHHHHHHHHHHHHCCC--CCEECCCCCCHHHH----HHHHHCCCCEEEEEEC---CCCC Q ss_conf 9--824----------841138899999999985798--72451464699999----9998548758996102---7876 Q gi|254780889|r 250 V--VDT----------AHGHSQKVLDAVVQIKKNFPS--LLVMAGNIATAEGA----LALIDAGADIIKVGIG---PGSI 308 (493) Q Consensus 250 v--iD~----------ahGh~~~~~~~i~~~k~~~~~--~~ii~GNv~t~~~~----~~l~~~Gad~ikvGig---~Gs~ 308 (493) . +++ .|-+.+ =+++++..++.--. .-.|.|=-.|.|.- ..|-+-++|.|-++.= +| T Consensus 166 ~~nlETs~~~f~~i~~tht~~~-Rl~ti~~a~~aGi~vcsG~i~GlGEt~edrve~l~~Lr~l~~~sipin~l~P~~g-- 242 (325) T PRK06256 166 NHNLETSRSYFPNVVTTHTYED-RVDTCEMVKAAGIEPCSGGIIGMGETLEDRAEHAFFLKELDADSIPINFLNPIKG-- 242 (325) T ss_pred CCCCCCCHHHCCCCCCCCCHHH-HHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHCCCCCEEECCCCEECCC-- T ss_conf 6664406876388689988999-9999999998599646643766899989999999999719998895467010699-- Q ss_pred CCCCHHCCCCCCCCHHHHHHHHHHH-CC---CCEEEEECC-CCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 6531000157431003788999862-26---981997265-4377762133671487899323010 Q gi|254780889|r 309 CTTRVVTGVGCPQLSAIMSVVEVAE-RA---GVAIVADGG-IRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 309 CtTr~~~g~g~pq~~av~~~~~~~~-~~---~~~iiadGG-i~~~gdi~kAla~GA~~VM~G~~~a 369 (493) |.. -+.|+++......-.|- +. +.-|.--|| -...++.-..+-+||+++|.|.+|- T Consensus 243 --Tpl---~~~~~l~~~e~lr~iAi~Rl~~P~~~Ir~agGr~~~~~~~~~~~~~gan~~~~G~~lT 303 (325) T PRK06256 243 --TPL---EDLPELTPLECLKTIAIFRLINPDKEIRIAGGREIALRSLQPLALKGANSIFVGNYLT 303 (325) T ss_pred --CCC---CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHCHHHEECCCCC T ss_conf --866---8899989999999999999978995489707855225567999986173514666537 No 429 >KOG4175 consensus Probab=95.32 E-value=0.26 Score=28.78 Aligned_cols=68 Identities=19% Similarity=0.254 Sum_probs=49.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699824841-------138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG-------HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG-------h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..||.+.|+++---+|-+=+.-| -+..+-+.+.++|+...+.|+.+| .|.|+|-+++...| ||++-|| T Consensus 160 tdeRmell~~~adsFiYvVSrmG~TG~~~svn~~l~~L~qrvrk~t~dtPlAVGFGvst~EHf~qVgsv-aDGVvvG 235 (268) T KOG4175 160 TDERMELLVEAADSFIYVVSRMGVTGTRESVNEKLQSLLQRVRKATGDTPLAVGFGVSTPEHFKQVGSV-ADGVVVG 235 (268) T ss_pred HHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHH-CCCEEEC T ss_conf 089999999863255999871456661777888899999999986489862675066878998765203-2544760 No 430 >PRK13119 consensus Probab=95.31 E-value=0.27 Score=28.76 Aligned_cols=169 Identities=14% Similarity=0.169 Sum_probs=99.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHH Q ss_conf 86325730358999999986553--3064574799786889802100010174223265687655566513701567899 Q gi|254780889|r 162 NLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVG 239 (493) Q Consensus 162 ~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~ 239 (493) ....+..+.++.+.++++.+-|. ...|++ +.|....+.+ ....+-++ T Consensus 66 ~~rAL~~G~~~~~~~~~~~~ir~~~~~~piv--------lMtY~N~i~~-----------------------yG~e~F~~ 114 (261) T PRK13119 66 AERALANGISLRDVLDVVRKFRETDTQTPVV--------LMGYLNPVHK-----------------------MGYREFAQ 114 (261) T ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCCCCCEE--------EEECHHHHHH-----------------------HHHHHHHH T ss_conf 9999977997889999999865148998989--------9840378988-----------------------62999999 Q ss_pred HHHHCCCC-EEEEECCCCHHHHHHHHHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCC Q ss_conf 98731366-69982484113889999999998-57987245146469999999985487589961027876653100015 Q gi|254780889|r 240 PLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGV 317 (493) Q Consensus 240 ~LveaGvD-viviD~ahGh~~~~~~~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~ 317 (493) .+.++|+| +|+.|...-.+....+ .+++ ...-+++++. -.+.+-.+.+.+..-..|=+= +.+-+||. T Consensus 115 ~~~~~GvdGvIipDLP~ee~~~~~~---~~~~~gl~~I~lvaP-tt~~~Ri~~i~~~a~gFiY~v-------s~~GvTG~ 183 (261) T PRK13119 115 EAAKAGVDGVLTVDSPVETIDPLYR---ELKDNGVDCIFLIAP-TTTEDRIKTIAELAGGFVYYV-------SLKGVTGA 183 (261) T ss_pred HHHHCCCCEEEECCCCHHHHHHHHH---HHHHCCCCEEEEECC-CCCHHHHHHHHHHCCCEEEEE-------ECCCCCCC T ss_conf 9997598579836899788799999---999759976443079-998999999997289819997-------36666687 Q ss_pred CCCCCHHHHHH-HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 74310037889-998622698199726543777621336714878993230101216 Q gi|254780889|r 318 GCPQLSAIMSV-VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 318 g~pq~~av~~~-~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) .--....+.+- ....+..+.||..-=||+++-|+-+ +..+||.|-+||.|-..-| T Consensus 184 ~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~-~~~~aDGvIVGSaiV~~i~ 239 (261) T PRK13119 184 ASLDTDEVSRKIEYLHQYIDIPIGVGFGISNAESARK-IGRVADAVIVGSRIVKEIE 239 (261) T ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHH-HHCCCCEEEECHHHHHHHH T ss_conf 7555488999999998636998799836599999999-8734999998289999998 No 431 >PRK08508 biotin synthase; Provisional Probab=95.31 E-value=0.27 Score=28.75 Aligned_cols=82 Identities=22% Similarity=0.327 Sum_probs=40.4 Q ss_pred CEEEEEEEECCCHHHHHHH---HHHHCCCCEEEE-----------ECCCCHHHHHHHHHHHHHHHCCCCC--EECCCC-C Q ss_conf 7655566513701567899---987313666998-----------2484113889999999998579872--451464-6 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVG---PLFDVNVDLVVV-----------DTAHGHSQKVLDAVVQIKKNFPSLL--VMAGNI-A 283 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~---~LveaGvDvivi-----------D~ahGh~~~~~~~i~~~k~~~~~~~--ii~GNv-~ 283 (493) ++..|.-+|..+.+.+|++ .|-+-++|-+=| +.........+.+|.-.|-.+|+.. +-+|-- . T Consensus 152 ~vCsGgIiGlGEt~edrve~a~~L~eL~~dsVPIN~liPi~GTPLe~~~l~~~e~lr~iAl~RlilP~a~Ir~agGRe~~ 231 (279) T PRK08508 152 GLCSGGIFGLGESWEDRISMLKSLASLSPHSTPINFFIPNPALPLDTPTLSADEALEIVRLAKEALPNARLMVAGGREVV 231 (279) T ss_pred EEECCCEEECCCCHHHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCHHHH T ss_conf 48678544789998999999999983899875156765899998888999999999999999997898765624652445 Q ss_pred CHHHHHHHHHCCCCEEEEE Q ss_conf 9999999985487589961 Q gi|254780889|r 284 TAEGALALIDAGADIIKVG 302 (493) Q Consensus 284 t~~~~~~l~~~Gad~ikvG 302 (493) ..+.-..+..+||+++-+| T Consensus 232 l~~~q~~~~~aGaN~i~~G 250 (279) T PRK08508 232 FGERQYEIFEAGANAIVIG 250 (279) T ss_pred CHHHHHHHHHHCCCEEEEC T ss_conf 5636999998468468886 No 432 >cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=95.27 E-value=0.015 Score=37.80 Aligned_cols=42 Identities=38% Similarity=0.530 Sum_probs=20.5 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHH Q ss_conf 899465430345656676050416786688602788723545 Q gi|254780889|r 103 VTISPYATLADALALMKKYSISGIPVVESDVGKLVGILTNRD 144 (493) Q Consensus 103 Vti~pd~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rD 144 (493) ++++|+.++.||.+.|.+++++++||+|.+.|..+-++|.+. T Consensus 53 v~i~P~~sL~da~~~l~~~~ihrlPvid~~~~~~~~ilt~~~ 94 (98) T cd04618 53 VSIHPERSLFDAALLLLKNKIHRLPVIDPSTGTGLYILTSRR 94 (98) T ss_pred EECCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEH T ss_conf 444876129999999986787505216479996589996302 No 433 >COG0800 Eda 2-keto-3-deoxy-6-phosphogluconate aldolase [Carbohydrate transport and metabolism] Probab=95.24 E-value=0.094 Score=31.99 Aligned_cols=122 Identities=22% Similarity=0.371 Sum_probs=80.2 Q ss_pred CEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEE Q ss_conf 76555665137015678999873136669982484113889999999998579872451464699999999854875899 Q gi|254780889|r 221 RLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIK 300 (493) Q Consensus 221 rL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ik 300 (493) +.++||--=.+ .+-+.+++++|++++|- -|-+..+++... .+ +++++ ..+.|+.-+...+++|++++| T Consensus 64 ~~lIGAGTVL~---~~q~~~a~~aGa~fiVs---P~~~~ev~~~a~----~~-~ip~~-PG~~TptEi~~Ale~G~~~lK 131 (211) T COG0800 64 EALIGAGTVLN---PEQARQAIAAGAQFIVS---PGLNPEVAKAAN----RY-GIPYI-PGVATPTEIMAALELGASALK 131 (211) T ss_pred CCEECCCCCCC---HHHHHHHHHCCCCEEEC---CCCCHHHHHHHH----HC-CCCCC-CCCCCHHHHHHHHHCCHHHEE T ss_conf 65882455669---99999999859978989---999999999998----67-99636-887998999999980722456 Q ss_pred EEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 610278766531000157431003788999862269819972654377762133671487899323010 Q gi|254780889|r 301 VGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 301 vGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) .= |++.+ | |.+.+.++ ..--.+++++.-|||... ++..-|++|+.+|=+||+|. T Consensus 132 ~F--Pa~~~------G-g~~~~ka~-----~gP~~~v~~~pTGGVs~~-N~~~yla~gv~avG~Gs~l~ 185 (211) T COG0800 132 FF--PAEVV------G-GPAMLKAL-----AGPFPQVRFCPTGGVSLD-NAADYLAAGVVAVGLGSWLV 185 (211) T ss_pred EC--CCCCC------C-CHHHHHHH-----CCCCCCCEEEECCCCCHH-HHHHHHHCCCEEEECCCCCC T ss_conf 43--73113------7-69899987-----389999858546987877-79999717805995474426 No 434 >PRK00830 consensus Probab=95.24 E-value=0.16 Score=30.36 Aligned_cols=70 Identities=21% Similarity=0.376 Sum_probs=53.3 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 0156789998731366-699824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-|+...+.|+| ++++|. |.-......+.|+++.+.. .+| -++|.+-+.|.++.|.++|||-|-+| T Consensus 34 gdP~~~ak~~~~~gadelhivDld~a~~g~~~~~~~I~~i~~~~-~~pi~vGGGIrs~e~~~~ll~~GadkVvIg 107 (273) T PRK00830 34 GDPVELAKRYYEDGADELVFLDITASHEGRATMIDVIERTAEEV-FIPLTVGGGIRSIEDIRQILRAGADKVSVN 107 (273) T ss_pred CCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHC-CCCEEEECCEEECCCHHHHHHCCCCEEECH T ss_conf 89999999999879998999953246468842799999999866-995896088437732899997698639837 No 435 >PRK07259 dihydroorotate dehydrogenase 1B; Reviewed Probab=95.22 E-value=0.071 Score=32.86 Aligned_cols=76 Identities=28% Similarity=0.389 Sum_probs=52.1 Q ss_pred EEEECC-CHHHHHHHHHHHCCCCEEEE---------EC-------CCCH---H-----HHHHHHHHHHHHHCCCCCEEC- Q ss_conf 665137-01567899987313666998---------24-------8411---3-----889999999998579872451- Q gi|254780889|r 226 AAVSVA-KDIADRVGPLFDVNVDLVVV---------DT-------AHGH---S-----QKVLDAVVQIKKNFPSLLVMA- 279 (493) Q Consensus 226 AAIg~~-~d~~eRa~~LveaGvDvivi---------D~-------ahGh---~-----~~~~~~i~~~k~~~~~~~ii~- 279 (493) .-+++. .|..+-+.++.++|+|.+++ |. .|++ | ...+++++.+++.+ +++||+ T Consensus 162 vKlsP~~~~i~~ia~~~~~~gadgvv~~Nt~~~~~id~~~~~p~~~~~~GGlSG~~l~~~al~~v~~~~~~~-~ipIig~ 240 (301) T PRK07259 162 VKLTPNVTDIVEIAKAAEEAGADGLSLINTLKGMAIDIKTRKPILANVTGGLSGPAIKPIALRMVYQVAKAV-DIPIIGM 240 (301) T ss_pred EEECCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC-CCCEEEE T ss_conf 980787121999999999759988999567767653235677433578886347335189999999998516-9888976 Q ss_pred CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 46469999999985487589961 Q gi|254780889|r 280 GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 280 GNv~t~~~~~~l~~~Gad~ikvG 302 (493) |.|.|++-+.+.+.+||+.|-++ T Consensus 241 GGI~s~~da~e~i~aGAs~VQv~ 263 (301) T PRK07259 241 GGISTAEDAIEFMMAGASAVQVG 263 (301) T ss_pred CCCCCHHHHHHHHHCCCCHHHHH T ss_conf 79799999999998398798721 No 436 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=95.22 E-value=0.18 Score=30.03 Aligned_cols=71 Identities=15% Similarity=0.286 Sum_probs=49.8 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHH-HHHHHH-CCCCCEECCCCC------CHHHHHHHHHCCCCEEEEEECC Q ss_conf 56789998731366699824841138899999-999985-798724514646------9999999985487589961027 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHSQKVLDAV-VQIKKN-FPSLLVMAGNIA------TAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~~~~~~~i-~~~k~~-~~~~~ii~GNv~------t~~~~~~l~~~Gad~ikvGig~ 305 (493) ..+=+.+.++..+|+|.+-+-+||.......+ +.+++. ..++++++|.+- -.+-...|.+.|+|++ .|| T Consensus 43 pe~~v~~A~~~~ad~igiSsl~G~~~~~~~~l~~~l~~~g~~di~vvvGG~i~i~~~dp~~~~~~L~~~Gv~~V---F~p 119 (137) T PRK02261 43 QEEFIDAAIETDADAILVSSLYGHGEIDCRGLREKCIEAGLGDILLYVGGNLVVGKHDFEEVEKKFKEMGFDRV---FAP 119 (137) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCCEE---ECC T ss_conf 99999999873999999711112661279999999996799999699836216788783999999997797988---797 Q ss_pred CC Q ss_conf 87 Q gi|254780889|r 306 GS 307 (493) Q Consensus 306 Gs 307 (493) |+ T Consensus 120 GT 121 (137) T PRK02261 120 GT 121 (137) T ss_pred CC T ss_conf 88 No 437 >cd02812 PcrB_like PcrB_like proteins. One member of this family, a protein from Archaeoglobus fulgidus, has been characterized as a (S)-3-O-geranylgeranylglyceryl phosphate synthase (AfGGGPS). AfGGGPS catalyzes the formation of an ether linkage between sn-glycerol-1-phosphate (G1P) and geranylgeranyl diphosphate (GGPP), the committed step in archaeal lipid biosynthesis. Therefore, it has been proposed that PcrB-like proteins are either prenyltransferases or are involved in lipoteichoic acid biosynthesis although the exact function is still unknown. Probab=95.17 E-value=0.051 Score=33.92 Aligned_cols=33 Identities=21% Similarity=0.544 Sum_probs=31.4 Q ss_pred CCEEEEECCCCCHHHHEEEEEECCCEEEECCHH Q ss_conf 981997265437776213367148789932301 Q gi|254780889|r 336 GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 336 ~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ 368 (493) +.|+|--||||+.-..-.++.+|||.|-.|..+ T Consensus 175 ~~~LivGGGIrs~e~a~~~~~AgAD~IVvGn~i 207 (219) T cd02812 175 DTPLIVGGGIRSGEQAKEMAEAGADTIVVGNIV 207 (219) T ss_pred CCEEEEECCCCCHHHHHHHHHCCCCEEEECCCE T ss_conf 970999289799999999998699999988722 No 438 >PRK00507 deoxyribose-phosphate aldolase; Provisional Probab=95.15 E-value=0.3 Score=28.43 Aligned_cols=68 Identities=32% Similarity=0.451 Sum_probs=47.3 Q ss_pred HHHHH-HHHHCCCCEEEEECCCCHHH--HHHHHHHHHHHHC-CCCCEEC-CCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 67899-98731366699824841138--8999999999857-9872451-464699999999854875899610278 Q gi|254780889|r 235 ADRVG-PLFDVNVDLVVVDTAHGHSQ--KVLDAVVQIKKNF-PSLLVMA-GNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 235 ~eRa~-~LveaGvDviviD~ahGh~~--~~~~~i~~~k~~~-~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) +.++. ..+++|+|+|- ++.|+.. -..+.++.+++.. +.+.|-+ |.|-|++.+..++++||+- +|...| T Consensus 138 i~~a~~~~~~aGadfvK--TSTGf~~~gat~e~v~~m~~~~~~~~giKasGGIrt~~~a~~~l~aGa~r--iGtS~~ 210 (221) T PRK00507 138 KVKACEICKEAGADFVK--TSTGFSTGGATVEDVKLMRETVGPRVGVKASGGIRTLEDALAMIEAGATR--LGTSAG 210 (221) T ss_pred HHHHHHHHHHHCCCEEE--ECCCCCCCCCCHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHCCHH--CCCCCH T ss_conf 99999999982978786--05887889989999999999728786386778989999999999827513--216758 No 439 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=95.15 E-value=0.3 Score=28.41 Aligned_cols=216 Identities=20% Similarity=0.212 Sum_probs=131.5 Q ss_pred CHHHHHHHHHHHHHHCCCEECCCEEEECCCCHHHHHHHHHHHCCCCEEEE---CCCCEEEEEEEEHH-------HH--CC Q ss_conf 98999999998742238815287899465430345656676050416786---68860278872354-------50--01 Q gi|254780889|r 80 SPSEQVAQVHQVKKFESGMVVNPVTISPYATLADALALMKKYSISGIPVV---ESDVGKLVGILTNR-------DV--RF 147 (493) Q Consensus 80 sie~qa~~V~kVKr~e~gmI~dPVti~pd~TI~ea~~lm~~~~is~iPVV---D~~~gkLiGIVT~r-------Di--r~ 147 (493) =+++=++.||.-|+ +|--|+|.+||.+=+.-+++++.=+. ..-...+-+.++.- |+ || T Consensus 537 Gp~~fA~~vr~~k~----------~LL~DTT~RDAHQSLLATRvRthDL~~IA~~~ah~l~~lfSLE~WGGATFDVA~RF 606 (1169) T TIGR01235 537 GPEGFAKWVREQKS----------VLLTDTTFRDAHQSLLATRVRTHDLAKIAPTTAHALPNLFSLECWGGATFDVAMRF 606 (1169) T ss_pred CHHHHHHHHHHHCC----------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEHHCCCHHHHHHHHH T ss_conf 85889999998578----------22010330256676787887788665301268887303300020586116787651 Q ss_pred CC--CCCC------CCCCCC-----CC-C--CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 43--4467------244002-----67-8--6325730358999999986553306457479978688980210001017 Q gi|254780889|r 148 AS--NAQQ------AVGELM-----TR-N--LITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 148 ~~--~~~~------~V~eiM-----t~-~--litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) +. ++++ .|-.+. .. + =.|--||--+.+-.+.=.+++++-.-|-|.=|-|-+|.--=| T Consensus 607 L~EDPWeRL~~lR~~vPN~lfQMLLRgANaVGYtnYPDNvv~~Fv~qa~~~GiD~FRvFDsLN~l~~l~l~~d------- 679 (1169) T TIGR01235 607 LHEDPWERLEDLRKEVPNILFQMLLRGANAVGYTNYPDNVVKKFVKQAAQSGIDIFRVFDSLNWLENLKLGMD------- 679 (1169) T ss_pred CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHHH------- T ss_conf 5788379999998546852468887541000024775348999999888468307870011133210201636------- Q ss_pred CCCCCCCCCCEEEEEEEECCCH-------------HHHHHHHHHHCCCCEEEE-ECCCC-HHHHHHHHHHHHHHHCCCCC Q ss_conf 4223265687655566513701-------------567899987313666998-24841-13889999999998579872 Q gi|254780889|r 212 PNATKDSKGRLRVAAAVSVAKD-------------IADRVGPLFDVNVDLVVV-DTAHG-HSQKVLDAVVQIKKNFPSLL 276 (493) Q Consensus 212 p~A~~D~~grL~VgAAIg~~~d-------------~~eRa~~LveaGvDvivi-D~ahG-h~~~~~~~i~~~k~~~~~~~ 276 (493) |.++ .|.+ |=|||+-++| |.+=|..|++||+-+|+| |.|=- --.-....|+.||+.+|++| T Consensus 680 --Av~~-~gkv-~EAAicYtGDil~~~r~KYdL~YYt~lA~eL~~aGaHIL~iKDMAGlLKP~AAk~L~~ALre~~PD~P 755 (1169) T TIGR01235 680 --AVKE-AGKV-VEAAICYTGDILDPARKKYDLKYYTNLAEELVKAGAHILGIKDMAGLLKPAAAKLLIKALREKIPDLP 755 (1169) T ss_pred --HHHH-CCCE-EEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEHHHHCCCHHHHHHHHHHHHHHCCCCCC T ss_conf --7775-0997-99997520032688897437788999999999709803430101124428999999999973279985 Q ss_pred EECCCCCCHHHH----HHHHHCCCCEEEEEECCCCCCCCCHHCC Q ss_conf 451464699999----9998548758996102787665310001 Q gi|254780889|r 277 VMAGNIATAEGA----LALIDAGADIIKVGIGPGSICTTRVVTG 316 (493) Q Consensus 277 ii~GNv~t~~~~----~~l~~~Gad~ikvGig~Gs~CtTr~~~g 316 (493) |=.=-=-|+-.+ ..-.+||||+|=|-|-+=|--|.|.-.| T Consensus 756 iH~HTHDtSG~ava~~~aaveAGvDvvDvAv~smSG~TSQPS~~ 799 (1169) T TIGR01235 756 IHLHTHDTSGAAVASMLAAVEAGVDVVDVAVDSMSGLTSQPSLG 799 (1169) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCEEHHHHHHHCCCCHHHHHH T ss_conf 68626661268999999998758861145676410450033389 No 440 >cd04740 DHOD_1B_like Dihydroorotate dehydrogenase (DHOD) class 1B FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. Probab=95.12 E-value=0.068 Score=33.00 Aligned_cols=68 Identities=29% Similarity=0.410 Sum_probs=48.9 Q ss_pred HHHHHHHHHHHCCCCEEEE---------ECCC----------CHHH-----HHHHHHHHHHHHCCCCCEEC-CCCCCHHH Q ss_conf 1567899987313666998---------2484----------1138-----89999999998579872451-46469999 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVV---------DTAH----------GHSQ-----KVLDAVVQIKKNFPSLLVMA-GNIATAEG 287 (493) Q Consensus 233 d~~eRa~~LveaGvDvivi---------D~ah----------Gh~~-----~~~~~i~~~k~~~~~~~ii~-GNv~t~~~ 287 (493) |..+-+.++.++|+|.|++ |.-+ |.|- ..+++++.+++.. +++||+ |.|.|++- T Consensus 167 ~i~~ia~~~~~~g~dgiv~~NT~~~~~id~~~~~p~l~~~~GGlSG~~l~~~al~~v~~~~~~~-~ipIig~GGI~s~~d 245 (296) T cd04740 167 DIVEIARAAEEAGADGLTLINTLKGMAIDIETRKPILGNVTGGLSGPAIKPIALRMVYQVYKAV-EIPIIGVGGIASGED 245 (296) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC-CCCEEEECCCCCHHH T ss_conf 0999999999769988999746787663644467552455787686778899999999998545-888797579799999 Q ss_pred HHHHHHCCCCEEEE Q ss_conf 99998548758996 Q gi|254780889|r 288 ALALIDAGADIIKV 301 (493) Q Consensus 288 ~~~l~~~Gad~ikv 301 (493) +...+.+||+.|-+ T Consensus 246 a~e~i~aGAs~VQi 259 (296) T cd04740 246 ALEFLMAGASAVQV 259 (296) T ss_pred HHHHHHCCCCHHHH T ss_conf 99999839988872 No 441 >PRK13123 consensus Probab=95.10 E-value=0.24 Score=29.08 Aligned_cols=181 Identities=17% Similarity=0.160 Sum_probs=102.5 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEE Q ss_conf 46724400---267863257303589999999865533064574799786889802100010174223265687655566 Q gi|254780889|r 151 AQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAA 227 (493) Q Consensus 151 ~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAA 227 (493) ++.|+.|- +......+..+.++.++++.+.+.+. .+|+| +.|.-..+-+ T Consensus 52 FSDPvADGPvIq~A~~rAL~~G~~~~~~~~~~~~~~~-~~Piv--------lMtY~N~i~~------------------- 103 (256) T PRK13123 52 FSDPVADGPVIQLAGLRALASGVSLKAILQALIEKET-QIPLV--------IMTYINPVFQ------------------- 103 (256) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCC-CCCEE--------EEEHHHHHHH------------------- T ss_conf 8886665799999899998679969999988763057-99889--------7404258987------------------- Q ss_pred EECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 51370156789998731366-69982484113889999999998579872451464699999999854875899610278 Q gi|254780889|r 228 VSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 228 Ig~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) .....-++.+.++|+| +|+.|..+-.+....+..+. ..+.-+++++.+ .+.+-.+.+.+..-..|=+ + T Consensus 104 ----yG~e~F~~~~~~~GvdGvIipDLP~eE~~~~~~~~~~--~gi~~I~liaPt-t~~~Ri~~i~~~a~GFiY~-V--- 172 (256) T PRK13123 104 ----YGIERFVEDLAETGVKGLIIPDLPYEHQDFIAPLLRD--TDIALIPLVSLT-SPIERQKEIIKEAEGFIYA-V--- 172 (256) T ss_pred ----CCHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHH--CCCCEEEECCCC-CCHHHHHHHHHCCCCCEEE-E--- T ss_conf ----1899999999974997897379996789999999997--699778640899-9388999998607884899-7--- Q ss_pred CCCCCCHHCCCCCCCCHHHHHH-HHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCC Q ss_conf 7665310001574310037889-9986226981997265437776213367148789932301012167 Q gi|254780889|r 307 SICTTRVVTGVGCPQLSAIMSV-VEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDES 374 (493) Q Consensus 307 s~CtTr~~~g~g~pq~~av~~~-~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Es 374 (493) +..-+||..-..-+-+.+. .+..+-.++||..-=||+++-|+.+. ...||.|-+||.|-..-|. T Consensus 173 ---s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~~-~~~aDGvIVGSaiv~~i~~ 237 (256) T PRK13123 173 ---AVNGVTGKRGNYRDDLDSHLEKLKSIAHIPVLAGFGISTLEDVERF-NAVCDGVIVGSKIVELLHE 237 (256) T ss_pred ---ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH-HHHCCEEEEHHHHHHHHHH T ss_conf ---4455667653333889999999985689987997688999999999-7139999972999999997 No 442 >TIGR01769 GGGP geranylgeranylglyceryl phosphate synthase; InterPro: IPR010946 This entry represents geranylgeranylglyceryl phosphate synthase which catalyses the first committed step in the synthesis of ether-linked membrane lipids in archaea .. Probab=95.10 E-value=0.04 Score=34.69 Aligned_cols=96 Identities=24% Similarity=0.321 Sum_probs=59.0 Q ss_pred CCEEE---EECCCCC--EEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHH Q ss_conf 33064---5747997--868898021000101742232656876555665137015678999873136669982484113 Q gi|254780889|r 184 IEKLL---VVDDDGC--CIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHS 258 (493) Q Consensus 184 i~~LP---VVDe~g~--LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~ 258 (493) ++.+| |+=++|. -+|-|...-..-...+.. +||-...-++.-=--.=.|||- |-+ T Consensus 107 lE~Ip~AY~iv~~G~ktavG~vG~~~~Ip~~~~~~-----------A~~Y~LAA~Y~G~~~vYLEAGS---------GAs 166 (212) T TIGR01769 107 LEVIPTAYLIVGPGGKTAVGYVGKARLIPYDKPEI-----------AAAYALAAKYLGFKLVYLEAGS---------GAS 166 (212) T ss_pred CCCCCEEEEEECCCCCEEEEEEEECCCCCCCCHHH-----------HHHHHHHHHHHHHHHHHHHCCC---------CCC T ss_conf 56365226887588740446520012589986668-----------9999999987413512131057---------866 Q ss_pred HHHH-HHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEE Q ss_conf 8899-99999998579872-45146469999999985487589 Q gi|254780889|r 259 QKVL-DAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 259 ~~~~-~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~i 299 (493) +=+- |+|+.+|+.+|+++ |++|.+-.+|.|..+.++|||.| T Consensus 167 ~Pv~~e~i~~~k~~~~~I~LIVGGGIr~~EiA~~~v~aGAd~I 209 (212) T TIGR01769 167 EPVSPETISLVKKKISSIPLIVGGGIRSPEIALKIVLAGADVI 209 (212) T ss_pred CCCCHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHCCCEE T ss_conf 6786679999998548972775277588899999997089826 No 443 >PRK04169 geranylgeranylglyceryl phosphate synthase-like protein; Reviewed Probab=95.08 E-value=0.12 Score=31.21 Aligned_cols=34 Identities=21% Similarity=0.538 Sum_probs=31.7 Q ss_pred CCEEEEECCCCCHHHHEEEEEECCCEEEECCHHH Q ss_conf 9819972654377762133671487899323010 Q gi|254780889|r 336 GVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLA 369 (493) Q Consensus 336 ~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~a 369 (493) +.|+|--||||+.-..-.+..+|||.|-.|..+- T Consensus 185 ~~~LivGGGIrs~e~a~~~~~aGAD~IVvGn~ie 218 (229) T PRK04169 185 DTPLIVGGGIRSPEQAREMAKAGADTIVVGTIIE 218 (229) T ss_pred CCCEEEECCCCCHHHHHHHHHCCCCEEEECCCEE T ss_conf 9878992896999999999976999999886201 No 444 >PRK00278 trpC indole-3-glycerol-phosphate synthase; Reviewed Probab=95.07 E-value=0.2 Score=29.68 Aligned_cols=122 Identities=18% Similarity=0.287 Sum_probs=69.3 Q ss_pred HHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCC Q ss_conf 73136669982484113889999999998579872451464699999999854875899610278766531000157431 Q gi|254780889|r 242 FDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQ 321 (493) Q Consensus 242 veaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq 321 (493) ...|+|++.+=.+--......+.++..+ .+ .+++.+ -|.+.+-.....++|++. +||-+--.-|-..- T Consensus 130 ~~~GADaiLLI~~~L~~~~l~~l~~~a~-~l-gl~~Lv-Evh~~~El~~a~~~~a~i--IGINnRnL~t~~vd------- 197 (261) T PRK00278 130 RAAGADAILLIVAALDDEQLKELLDLAH-EL-GLDVLV-EVHDEEELERALKLGAPL--IGINNRNLKTFEVD------- 197 (261) T ss_pred HHCCCCCHHHHHHHCCHHHHHHHHHHHH-HH-CCEEEE-EECCHHHHHHHHHCCCCE--EEEECCCCHHCEEC------- T ss_conf 9818985789887558999999999999-82-990797-768999999998479988--98746771120037------- Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 00378899986226981997265437776213367148789932301012167764 Q gi|254780889|r 322 LSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 322 ~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) +....+.+... ..++.+||-.||+++.|+.+...+|+|+|.+|.-|-.+ +.|++ T Consensus 198 ~~~~~~L~~~i-p~~~~~VsESGI~~~~d~~~l~~~G~davLIGeslm~~-~dp~~ 251 (261) T PRK00278 198 LDTTERLAPLI-PKDRLLVSESGIFTPEDLKRLAKAGADAFLVGESLMRA-DDPGA 251 (261) T ss_pred HHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHHCC-CCHHH T ss_conf 89999999648-99988997999999999999997799999989787679-99899 No 445 >PRK13127 consensus Probab=95.04 E-value=0.32 Score=28.21 Aligned_cols=78 Identities=22% Similarity=0.319 Sum_probs=53.1 Q ss_pred HHHHHHHHHHCCCCEEEE-E------CCCCHHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEEECC Q ss_conf 567899987313666998-2------4841138899999999985798724514-6469999999985487589961027 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVV-D------TAHGHSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVGIGP 305 (493) Q Consensus 234 ~~eRa~~LveaGvDvivi-D------~ahGh~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvGig~ 305 (493) ..+|.+.+.+..-.+|-. - ..+.......+.++++|+.. ++|+.+| .|-|+|.++.+.++|||++-|| T Consensus 152 ~~~Ri~~i~~~a~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGI~~~e~v~~~~~~~aDGvIVG--- 227 (262) T PRK13127 152 PEERLKRIDEASSGFVYLVSRLGVTGAREDVEEATFDLLKRARTTC-KNKIAVGFGISKGEHAEELLDAGADGVIVG--- 227 (262) T ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCC-CCCEEEEEEECCHHHHHHHHHCCCCEEEEC--- T ss_conf 8999999984389818998435556876555288999999999617-998489933488999999986499999987--- Q ss_pred CCCCCCCHHCC Q ss_conf 87665310001 Q gi|254780889|r 306 GSICTTRVVTG 316 (493) Q Consensus 306 Gs~CtTr~~~g 316 (493) |.+.-++..+ T Consensus 228 -Saiv~~i~~~ 237 (262) T PRK13127 228 -SALVDIIAEG 237 (262) T ss_pred -HHHHHHHHHC T ss_conf -8999999966 No 446 >COG0167 PyrD Dihydroorotate dehydrogenase [Nucleotide transport and metabolism] Probab=95.04 E-value=0.23 Score=29.19 Aligned_cols=79 Identities=27% Similarity=0.397 Sum_probs=53.1 Q ss_pred EEEEEECCC-HHHHHHHHHHHCCCCEEE-E---------ECC-------C---CH-----HHHHHHHHHHHHHHC-CCCC Q ss_conf 556651370-156789998731366699-8---------248-------4---11-----388999999999857-9872 Q gi|254780889|r 224 VAAAVSVAK-DIADRVGPLFDVNVDLVV-V---------DTA-------H---GH-----SQKVLDAVVQIKKNF-PSLL 276 (493) Q Consensus 224 VgAAIg~~~-d~~eRa~~LveaGvDviv-i---------D~a-------h---Gh-----~~~~~~~i~~~k~~~-~~~~ 276 (493) |.+-+++.. |..+=|.++.++|+|.|+ + |.- + |. ....+++++.+.+.+ ++++ T Consensus 164 v~vKl~P~~~di~~iA~~~~~~g~Dgl~~~NT~~~~~~i~~~~~~~~~~~~~GGLSG~~ikp~al~~v~~l~~~~~~~ip 243 (310) T COG0167 164 VFVKLAPNITDIDEIAKAAEEAGADGLIAINTTKSGMKIDLETKKPVLANETGGLSGPPLKPIALRVVAELYKRLGGDIP 243 (310) T ss_pred EEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 59993888899999999999749858999700366553012345556676777757510027899999999984289974 Q ss_pred EEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 451-46469999999985487589961 Q gi|254780889|r 277 VMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 277 ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ||+ |.|.|.+-|...+.+||+.+-|| T Consensus 244 IIGvGGI~s~~DA~E~i~aGA~~vQv~ 270 (310) T COG0167 244 IIGVGGIETGEDALEFILAGASAVQVG 270 (310) T ss_pred EEEECCCCCHHHHHHHHHCCCCHHEEE T ss_conf 898468696999999998297564041 No 447 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=95.03 E-value=0.15 Score=30.55 Aligned_cols=121 Identities=24% Similarity=0.368 Sum_probs=73.7 Q ss_pred HHHHHHHCCCCEEEEECCCCH-HHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHC Q ss_conf 899987313666998248411-3889999999998579872451464699999999854875899610278766531000 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDTAHGH-SQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVT 315 (493) Q Consensus 237 Ra~~LveaGvDviviD~ahGh-~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~ 315 (493) =++.=+++|+|.|.+ |-= +....++++.||+.= -..-++=|=+|+--+...+=.-+|.|.+= .+.= T Consensus 73 ~~~~Fa~aGA~~I~v---H~Ea~~h~~R~l~~Ik~~G-~~AG~v~NP~TPl~~~~~~L~~~D~VLlM---------SVnP 139 (216) T TIGR01163 73 YIEDFAEAGADIITV---HAEATEHIHRLLQLIKELG-AKAGIVLNPATPLEALEYVLEDVDLVLLM---------SVNP 139 (216) T ss_pred HHHHHHHHCCCEEEE---ECCCCCCHHHHHHHHHHCC-CCEEEEECCCCCHHHHHHHHHHCCEEEEE---------EEEC T ss_conf 788999708998998---4377626799999999718-97068867999987899898762989988---------7607 Q ss_pred CCC----CC-CCHHHHHHHHHHH--CCC--CEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 157----43-1003788999862--269--81997265437776213367148789932301012 Q gi|254780889|r 316 GVG----CP-QLSAIMSVVEVAE--RAG--VAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 316 g~g----~p-q~~av~~~~~~~~--~~~--~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) ||| .| .+.=|.+.++..+ +.+ +-|--||||+ .-.|...-.||||.+-.||..=+. T Consensus 140 GFgGQkFIP~~~~Kir~~R~~id~~~~~~~~~ieVDGGv~-~~ni~~~~~AGAD~~VaGSaiF~~ 203 (216) T TIGR01163 140 GFGGQKFIPETLEKIRELRKMIDKLELGLSILIEVDGGVN-EDNIAEVAEAGADILVAGSAIFGA 203 (216) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCC-HHHHHHHHHCCCCEEEEEEEEECC T ss_conf 9988411057899999999999860279955899717989-767999997589899983102088 No 448 >cd01573 modD_like ModD; Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase) present in some modABC operons in bacteria, which are involved in molybdate transport. In general, QPRTases are part of the de novo synthesis pathway of NAD in both prokaryotes and eukaryotes. They catalyse the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. Probab=95.02 E-value=0.15 Score=30.52 Aligned_cols=64 Identities=22% Similarity=0.292 Sum_probs=31.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 567899987313666998248411388999999999857987245146469999999985487589 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) ..+-+...+++|+|.|.+|- --...+.+.++.+++..|.+.+-+-.=.|.+..+.+++.|+|.| T Consensus 192 ~~~q~~~a~~~g~d~I~LDn--~s~~~ik~~v~~~~~~~~~v~ieaSGgI~~~ni~~yA~tGvD~I 255 (272) T cd01573 192 SLEEALAAAEAGADILQLDK--FSPEELAELVPKLRSLAPPVLLAAAGGINIENAAAYAAAGADIL 255 (272) T ss_pred CHHHHHHHHHCCCCEEEECC--CCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCEE T ss_conf 99999999846999999779--99999999999974448876999989999999999997399999 No 449 >PRK05848 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=95.01 E-value=0.16 Score=30.40 Aligned_cols=66 Identities=26% Similarity=0.398 Sum_probs=33.1 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873136669982484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+-+...+++|+|+|.+|- --.+.+.+.++.+++.+|++.+-+ ||+ |.+....+.+.|+|.|=+| T Consensus 190 ~l~ea~~a~~~gaDiI~LDn--~~~e~~k~~v~~~~~~~~~v~ieaSGGI-~~~ni~~yA~~GvD~Is~g 256 (272) T PRK05848 190 SLEEAKEAMNAGADIVMCDN--MSVEETKEIVAYRNANYPHVLLEASGNI-SLETINAYAKSGVDAISSG 256 (272) T ss_pred CHHHHHHHHHCCCCEEEECC--CCHHHHHHHHHHHHCCCCCEEEEEECCC-CHHHHHHHHHCCCCEEECC T ss_conf 99999999845999899589--9999999999987464887799997999-8999999996599999879 No 450 >PRK09722 allulose-6-phosphate 3-epimerase; Provisional Probab=94.95 E-value=0.34 Score=28.03 Aligned_cols=134 Identities=25% Similarity=0.331 Sum_probs=76.1 Q ss_pred CCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEE--CCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCH-HHHHHHHHCCC Q ss_conf 876555665137015678999873136669982--484113889999999998579872451464699-99999985487 Q gi|254780889|r 220 GRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVD--TAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATA-EGALALIDAGA 296 (493) Q Consensus 220 grL~VgAAIg~~~d~~eRa~~LveaGvDviviD--~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~-~~~~~l~~~Ga 296 (493) =+|||- . -..-++.++++|+|.|.+- .++. ...+.++++|+.--. .=+|=|-.|+ +....+++- . T Consensus 62 vHLMv~-----~--P~~~i~~~~~~gad~It~H~Ea~~~---~~~~~i~~Ik~~g~k-~GlAlnP~Tpi~~i~~~l~~-v 129 (227) T PRK09722 62 CHLMVT-----R--PQDYIAQLADAGADFITLHPETING---QAFRLIDEIRRAGMK-VGLVLNPETPVEAIKYYIHL-A 129 (227) T ss_pred EEEEEC-----C--HHHHHHHHHHCCCCEEEECHHHCCC---CHHHHHHHHHHCCCC-EEEEECCCCCHHHHHHHHHH-C T ss_conf 899965-----8--8888999985499899956565056---599999999986997-22333899986688766743-7 Q ss_pred CEEE-EEECCCCCCCCCHHCCCC-CC-CCHHHHHHHHHHHCC--CCEEEEECCCCCHHHHEEEEEECCCEEEEC-CHHHC Q ss_conf 5899-610278766531000157-43-100378899986226--981997265437776213367148789932-30101 Q gi|254780889|r 297 DIIK-VGIGPGSICTTRVVTGVG-CP-QLSAIMSVVEVAERA--GVAIVADGGIRFSGDIAKAIAAGSACVMIG-SLLAG 370 (493) Q Consensus 297 d~ik-vGig~Gs~CtTr~~~g~g-~p-q~~av~~~~~~~~~~--~~~iiadGGi~~~gdi~kAla~GA~~VM~G-~~~ag 370 (493) |.|. ..+-||- .|-- .| .+.-|.+.++...+. +..|..||||. ...+.+...+|||.+-.| |.+=+ T Consensus 130 D~VLvMsV~PGf-------~GQ~Fi~~~l~KI~~lr~~~~~~~~~~~I~VDGGI~-~~~i~~~~~aGAd~~V~GssaiF~ 201 (227) T PRK09722 130 DKVTVMTVDPGF-------AGQPFIPEMLDKIAELKAWREREGLEYEIEVDGSCN-QKTYEKLMAAGADVFIVGTSGLFN 201 (227) T ss_pred CEEEEEEECCCC-------CCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCC-HHHHHHHHHCCCCEEEECCHHHCC T ss_conf 989999888999-------876566889999999999998259982699989888-999999998699999977489748 Q ss_pred CCC Q ss_conf 216 Q gi|254780889|r 371 TDE 373 (493) Q Consensus 371 t~E 373 (493) -.| T Consensus 202 ~~~ 204 (227) T PRK09722 202 HAE 204 (227) T ss_pred CCC T ss_conf 999 No 451 >PRK13129 consensus Probab=94.94 E-value=0.34 Score=28.02 Aligned_cols=68 Identities=22% Similarity=0.248 Sum_probs=50.3 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699824841-------138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG-------HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG-------h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.+|-.=+..| ....+.+.++++|+.. ++|+++| .+-|+|.++.+.++|||++-|| T Consensus 160 ~~~Ri~~i~~~~~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGIs~~e~v~~~~~~~ADGvIVG 235 (267) T PRK13129 160 PAERMKRIAQQSRGFTYLVSVTGVTGERSQMENRVESLLQQLRQVT-SKPIAVGFGISGPEQARQVREWGADGAIVG 235 (267) T ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHC-CCCEEEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 6899999981689808987346656765445088999999999834-898178844799999999985499999987 No 452 >PRK08645 bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed Probab=94.91 E-value=0.34 Score=27.97 Aligned_cols=141 Identities=17% Similarity=0.275 Sum_probs=70.8 Q ss_pred CCEEEEEEEECCCH-----HHHHHHHHHHCCCCEEEE-ECCCCHHHHH--HHHHHHHHHHCCCCCEECCCCCCHH----H Q ss_conf 87655566513701-----567899987313666998-2484113889--9999999985798724514646999----9 Q gi|254780889|r 220 GRLRVAAAVSVAKD-----IADRVGPLFDVNVDLVVV-DTAHGHSQKV--LDAVVQIKKNFPSLLVMAGNIATAE----G 287 (493) Q Consensus 220 grL~VgAAIg~~~d-----~~eRa~~LveaGvDvivi-D~ahGh~~~~--~~~i~~~k~~~~~~~ii~GNv~t~~----~ 287 (493) ++..+.+.+.++.+ ..+-|+.|-++|||+|.| |..-.- .++ +-+-..|++++. ++.|. -+++.| + T Consensus 323 ~~~~i~vEldpP~~~d~~~~~~~a~~l~~~Gvdai~i~D~~~a~-~R~~~~a~a~~l~~~~g-ie~i~-h~tcRDRN~ia 399 (608) T PRK08645 323 KKFTVIVELDPPKGLDTDKFLEGARALKEAGVDAITLADNSLAT-VRISNIALASLIKQEYG-IEPLV-HITCRDRNLIG 399 (608) T ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCH-HHHCHHHHHHHHHHHHC-CCEEE-EEEECCCHHHH T ss_conf 79679999549999987899999999987699789746887303-31169999999999739-97268-86204532999 Q ss_pred HH-HHHHCCCCEEEEEECCCCCCCCCHHCCCC-CCCCHHHHHHHHH-----HHCC-------CCE-----EEEECCCCCH Q ss_conf 99-99854875899610278766531000157-4310037889998-----6226-------981-----9972654377 Q gi|254780889|r 288 AL-ALIDAGADIIKVGIGPGSICTTRVVTGVG-CPQLSAIMSVVEV-----AERA-------GVA-----IVADGGIRFS 348 (493) Q Consensus 288 ~~-~l~~~Gad~ikvGig~Gs~CtTr~~~g~g-~pq~~av~~~~~~-----~~~~-------~~~-----iiadGGi~~~ 348 (493) .+ +|+-+-+-+|+ -=.|.|..-.-+| .|+.+.|++.-.. .+.. |.| =+.=|+.-++ T Consensus 400 lqs~LlGa~~lGi~-----nvL~lTGDp~~~Gd~p~ak~Vfd~dS~~Li~~i~~ln~G~~~~G~~l~~~~~f~~g~a~np 474 (608) T PRK08645 400 LQSHLLGLSALGIR-----NILAITGDPAKVGDFPGATSVYDLNSIGLIKLIKQLNDGISYSGKPLGKKTNFSIGGAFNP 474 (608) T ss_pred HHHHHHHHHHCCCC-----EEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCC T ss_conf 99998532661986-----4899648898879999982121477899999999983787888881789997578463189 Q ss_pred ---------HHHEEEEEECCCEEEECCHH Q ss_conf ---------76213367148789932301 Q gi|254780889|r 349 ---------GDIAKAIAAGSACVMIGSLL 368 (493) Q Consensus 349 ---------gdi~kAla~GA~~VM~G~~~ 368 (493) .-+.|-+.+||+++|-=-.| T Consensus 475 ~~~~~~~~~~rl~kKi~aGA~f~~TQpv~ 503 (608) T PRK08645 475 NVRNLDKAVKRLEKKIEAGADYFITQPIY 503 (608) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 99985999999999997798689855658 No 453 >PRK02227 hypothetical protein; Provisional Probab=94.90 E-value=0.2 Score=29.57 Aligned_cols=147 Identities=22% Similarity=0.224 Sum_probs=81.4 Q ss_pred ECCCHHHHHHHHHHHCCCCEEEE-ECCCCHHH-HHHHHHHHHHHHCCC---CCEECCCCC-CH----HHHHHHHHCCCCE Q ss_conf 13701567899987313666998-24841138-899999999985798---724514646-99----9999998548758 Q gi|254780889|r 229 SVAKDIADRVGPLFDVNVDLVVV-DTAHGHSQ-KVLDAVVQIKKNFPS---LLVMAGNIA-TA----EGALALIDAGADI 298 (493) Q Consensus 229 g~~~d~~eRa~~LveaGvDvivi-D~ahGh~~-~~~~~i~~~k~~~~~---~~ii~GNv~-t~----~~~~~l~~~Gad~ 298 (493) |++ ..+-|....++|+|+|=+ +-++|--- ...+.|+.++...+. +-.-.|++- .+ ..+..+...|+|. T Consensus 6 Svr--~~~EA~~a~~~gaDIIDvKnP~~GaLGa~~p~~I~~I~~~v~~~~~vSatiGD~p~~~~~i~~a~~~~a~~Gvdy 83 (239) T PRK02227 6 SVR--NLEEALEALEGGADIIDVKNPKEGSLGANFPWVIREIVAAVPGRKPVSATIGDVPYKPGTAALAALGAAATGVDY 83 (239) T ss_pred CCC--CHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCE T ss_conf 379--989999998579998982798888788999999999999857997179883697899379999998766139998 Q ss_pred EEEEECCCCCCCCCHHCCCCCCCC-HHHHHHHHHHHCC--CCE----EEEECCCC---CHHH-HEEEEEECCCEEEECCH Q ss_conf 996102787665310001574310-0378899986226--981----99726543---7776-21336714878993230 Q gi|254780889|r 299 IKVGIGPGSICTTRVVTGVGCPQL-SAIMSVVEVAERA--GVA----IVADGGIR---FSGD-IAKAIAAGSACVMIGSL 367 (493) Q Consensus 299 ikvGig~Gs~CtTr~~~g~g~pq~-~av~~~~~~~~~~--~~~----iiadGGi~---~~gd-i~kAla~GA~~VM~G~~ 367 (493) ||||+-++. +. .+. .....+..+.+.+ +++ ..||.+.- .+-+ +-.|..+|.+.||+ T Consensus 84 VKvGl~~~~---~~-------~~~~~~l~~v~~a~k~~~~~~~~VaV~yAD~~~~~~~~p~~i~~~a~~~g~~gvMi--- 150 (239) T PRK02227 84 VKVGLYGTK---TA-------EEAVEVMAAVVRAVKDLDPNKRVVAVLYADAGRDGSISPLDLPAIAAEAGFDGAML--- 150 (239) T ss_pred EEEECCCCC---CH-------HHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHCCCCEEEE--- T ss_conf 999457888---67-------99999999999987511679739999983264224788677899999859989998--- Q ss_pred HHCCCCCCCCEEEECCEEEEEEECCCCHHHHH Q ss_conf 10121677644873680588873320068875 Q gi|254780889|r 368 LAGTDESPGDIFLYQGRSFKSYRGMGSVAAME 399 (493) Q Consensus 368 ~agt~Espg~~~~~~g~~~K~yrGm~S~~A~~ 399 (493) ++..++|+..=.+..+..+...- T Consensus 151 ---------DT~~Kdg~sL~d~~~~~~L~~fv 173 (239) T PRK02227 151 ---------DTAGKDGRSLFDHMDEEELAEFV 173 (239) T ss_pred ---------ECCCCCCCCHHHCCCHHHHHHHH T ss_conf ---------63678887534238999999999 No 454 >COG0352 ThiE Thiamine monophosphate synthase [Coenzyme metabolism] Probab=94.89 E-value=0.23 Score=29.18 Aligned_cols=98 Identities=23% Similarity=0.317 Sum_probs=57.2 Q ss_pred HHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECC Q ss_conf 99999857987245146469999999985487589961027876653100015743100378899986226981997265 Q gi|254780889|r 265 VVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGG 344 (493) Q Consensus 265 i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGG 344 (493) +...++.++.--+|+--+.+.|-++...+.|+|.| |-|.+--|.-..+. .|+...... .+.....+|++|=|| T Consensus 94 ~~~ar~~~~~~~iIG~S~h~~eea~~A~~~g~DYv----~~GpifpT~tK~~~-~~~G~~~l~--~~~~~~~iP~vAIGG 166 (211) T COG0352 94 LAEARELLGPGLIIGLSTHDLEEALEAEELGADYV----GLGPIFPTSTKPDA-PPLGLEGLR--EIRELVNIPVVAIGG 166 (211) T ss_pred HHHHHHHCCCCCEEEEECCCHHHHHHHHHCCCCEE----EECCCCCCCCCCCC-CCCCHHHHH--HHHHHCCCCEEEECC T ss_conf 58899741999789830499999999876399999----98886788999887-746789999--999827999899848 Q ss_pred CCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 43777621336714878993230101 Q gi|254780889|r 345 IRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 345 i~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) |. .-.+...+.+||+.|=+-|.+.. T Consensus 167 i~-~~nv~~v~~~Ga~gVAvvsai~~ 191 (211) T COG0352 167 IN-LENVPEVLEAGADGVAVVSAITS 191 (211) T ss_pred CC-HHHHHHHHHHCCCEEEEHHHHHC T ss_conf 89-99999999829876972668660 No 455 >COG2185 Sbm Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) [Lipid metabolism] Probab=94.89 E-value=0.22 Score=29.29 Aligned_cols=69 Identities=26% Similarity=0.420 Sum_probs=52.5 Q ss_pred HHHHHHHHHCCCCEEEEEC-CCCHHHHHHHHHHHHHHHC-CCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 6789998731366699824-8411388999999999857-9872451464699999999854875899610278 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVVDT-AHGHSQKVLDAVVQIKKNF-PSLLVMAGNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 235 ~eRa~~LveaGvDviviD~-ahGh~~~~~~~i~~~k~~~-~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) .|=+.+.++..+|+|++-+ +-+|...+-+.++.+|+.- .++.+++|.+-.++..+.|.+.|+|.+ ++|| T Consensus 53 ~e~v~aA~~~dv~vIgvSsl~g~h~~l~~~lve~lre~G~~~i~v~~GGvip~~d~~~l~~~G~~~i---f~pg 123 (143) T COG2185 53 EEAVRAAVEEDVDVIGVSSLDGGHLTLVPGLVEALREAGVEDILVVVGGVIPPGDYQELKEMGVDRI---FGPG 123 (143) T ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCCCEE---ECCC T ss_conf 9999999864798899973440478999999999998197554886568668136799998186654---6899 No 456 >COG2022 ThiG Uncharacterized enzyme of thiazole biosynthesis [Nucleotide transport and metabolism] Probab=94.87 E-value=0.064 Score=33.18 Aligned_cols=124 Identities=19% Similarity=0.334 Sum_probs=71.7 Q ss_pred HHHHHH-H-CCCCEEEEECCCCHHHHH----HHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEE---EECCC Q ss_conf 899987-3-136669982484113889----999999998579872-4514646999999998548758996---10278 Q gi|254780889|r 237 RVGPLF-D-VNVDLVVVDTAHGHSQKV----LDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKV---GIGPG 306 (493) Q Consensus 237 Ra~~Lv-e-aGvDviviD~ahGh~~~~----~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikv---Gig~G 306 (493) |...|. | .+-|.|-+.+ +|+++.. +++++.-+....+=+ |.-=--.++--++.|.++|+.+|-= -||+| T Consensus 87 ~tArlARE~~~t~wiKlEV-i~d~~tLlPD~~etl~Aae~Lv~eGF~VlPY~~dD~v~arrLee~GcaavMPl~aPIGSg 165 (262) T COG2022 87 RTARLAREALGTNWIKLEV-IGDEKTLLPDPIETLKAAEQLVKEGFVVLPYTTDDPVLARRLEEAGCAAVMPLGAPIGSG 165 (262) T ss_pred HHHHHHHHHCCCCEEEEEE-ECCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHCCCEEECCCCCCCCCC T ss_conf 9999999970698489999-368765488757899999999867988850368878999999864966863356656678 Q ss_pred CCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCC Q ss_conf 76653100015743100378899986226981997265437776213367148789932301012167 Q gi|254780889|r 307 SICTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDES 374 (493) Q Consensus 307 s~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Es 374 (493) -|.-.-..+ .-...+.+||||-|-||-.+.|.+-|+-+|+|+|++-+..|+...- T Consensus 166 ----------~G~~n~~~l---~iiie~a~VPviVDAGiG~pSdAa~aMElG~DaVL~NTAiA~A~DP 220 (262) T COG2022 166 ----------LGLQNPYNL---EIIIEEADVPVIVDAGIGTPSDAAQAMELGADAVLLNTAIARAKDP 220 (262) T ss_pred ----------CCCCCHHHH---HHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCEEEHHHHHHCCCCH T ss_conf ----------675788999---9999738998898679897668899986055432325676603786 No 457 >cd00331 IGPS Indole-3-glycerol phosphate synthase (IGPS); an enzyme in the tryptophan biosynthetic pathway, catalyzing the ring closure reaction of 1-(o-carboxyphenylamino)-1-deoxyribulose-5-phosphate (CdRP) to indole-3-glycerol phosphate (IGP), accompanied by the release of carbon dioxide and water. IGPS is active as a separate monomer in most organisms, but is also found fused to other enzymes as part of a bifunctional or multifunctional enzyme involved in tryptophan biosynthesis. Probab=94.85 E-value=0.25 Score=28.89 Aligned_cols=123 Identities=22% Similarity=0.324 Sum_probs=69.1 Q ss_pred HHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCC Q ss_conf 87313666998248411388999999999857987245146469999999985487589961027876653100015743 Q gi|254780889|r 241 LFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCP 320 (493) Q Consensus 241 LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~p 320 (493) -...|+|++.+=.+---.....+.++..++ + .+++++ -|.+.+-.+...++|++.| ||-+-..-|-..- T Consensus 90 a~~~GAdaiLLI~~~L~~~~l~~l~~~a~~-l-gl~~Lv-Evh~~~El~~a~~~~a~iI--GINnRdL~t~~vd------ 158 (217) T cd00331 90 ARAAGADAVLLIVAALDDEQLKELYELARE-L-GMEVLV-EVHDEEELERALALGAKII--GINNRDLKTFEVD------ 158 (217) T ss_pred HHHCCCCCHHHHHHHCCHHHHHHHHHHHHH-H-CCEEEE-EECCHHHHHHHHHCCCCEE--EECCCCCHHCEEC------ T ss_conf 998199878798885499999999999999-4-982798-8589999999995799878--4216771230347------ Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCCC Q ss_conf 100378899986226981997265437776213367148789932301012167764 Q gi|254780889|r 321 QLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPGD 377 (493) Q Consensus 321 q~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg~ 377 (493) +....+..... ..++.+||-+||++..|+.+...+|+|+|.+|.-|-.+.. |++ T Consensus 159 -~~~~~~L~~~i-p~~~~~IsESGI~~~~di~~l~~~G~d~~LIG~sLm~~~~-p~~ 212 (217) T cd00331 159 -LNTTERLAPLI-PKDVILVSESGISTPEDVKRLAEAGADAVLIGESLMRAPD-PGA 212 (217) T ss_pred -HHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHHCCCC-HHH T ss_conf -89999999648-9898899827999999999999879999998978867999-899 No 458 >TIGR03151 enACPred_II putative enoyl-(acyl-carrier-protein) reductase II. This oxidoreductase of the 2-nitropropane dioxygenase family (pfam03060) is commonly found in apparent operons with genes involved in fatty acid biosynthesis. Furthermore, this genomic context generally includes the fabG 3-oxoacyl-[ACP] reductase and lacks the fabI enoyl-[ACP] reductase. Probab=94.85 E-value=0.16 Score=30.34 Aligned_cols=66 Identities=26% Similarity=0.395 Sum_probs=49.9 Q ss_pred HHHHHHHHCCCCEEEEEC--CCCHH--HHHHHHHHHHHHHCCCCCEE-CCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 789998731366699824--84113--88999999999857987245-146469999999985487589961 Q gi|254780889|r 236 DRVGPLFDVNVDLVVVDT--AHGHS--QKVLDAVVQIKKNFPSLLVM-AGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 236 eRa~~LveaGvDviviD~--ahGh~--~~~~~~i~~~k~~~~~~~ii-~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +-++.+.++|+|+|++.- |-||. ......+..+.+.+ ++||| +|.++|..+....+..|||++-+| T Consensus 120 ~~A~~a~~~G~D~iV~qG~EAGGH~G~~~~~~Lvp~v~d~~-~iPViAAGGI~dgr~iaaalalGA~gV~mG 190 (307) T TIGR03151 120 ALAKRMEKAGADAVIAEGMESGGHIGELTTMALVPQVVDAV-SIPVIAAGGIADGRGMAAAFALGAEAVQMG 190 (307) T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHC-CCCEEEECCCCCCHHHHHHHHCCCCEEEEC T ss_conf 99999996499999974554468778643787799998504-686576411336588999997188478744 No 459 >TIGR01496 DHPS dihydropteroate synthase; InterPro: IPR006390 This domain is present in sequences representing dihydropteroate synthase, the enzyme that catalyzes the second to last step in folic acid biosynthesis. Dihydropteroate synthase (2.5.1.15 from EC) (DHPS) catalyses the condensation of 6-hydroxymethyl-7,8-dihydropteridine pyrophosphate to para-aminobenzoic acid to form 7,8-dihydropteroate. This is the second step in the three-step pathway leading from 6-hydroxymethyl-7,8-dihydropterin to 7,8-dihydrofolate. DHPS is the target of sulphonamides, which are substrate analogues that compete with para-aminobenzoic acid. Bacterial DHPS (gene sul or folP) is a protein of about 275 to 315 amino acid residues that is either chromosomally encoded or found on various antibiotic resistance plasmids. In the lower eukaryote Pneumocystis carinii, DHPS is the C-terminal domain of a multifunctional folate synthesis enzyme (gene fas) .; GO: 0004156 dihydropteroate synthase activity, 0009396 folic acid and derivative biosynthetic process. Probab=94.81 E-value=0.15 Score=30.54 Aligned_cols=68 Identities=22% Similarity=0.414 Sum_probs=43.0 Q ss_pred HHHHHHHHHHCCCC-EEEEECCCC--------HHHHHHHHHHHHHHHCCCCCEE------------CC-CCC-------C Q ss_conf 56789998731366-699824841--------1388999999999857987245------------14-646-------9 Q gi|254780889|r 234 IADRVGPLFDVNVD-LVVVDTAHG--------HSQKVLDAVVQIKKNFPSLLVM------------AG-NIA-------T 284 (493) Q Consensus 234 ~~eRa~~LveaGvD-viviD~ahG--------h~~~~~~~i~~~k~~~~~~~ii------------~G-Nv~-------t 284 (493) ..+|++.|.++|++ =|++|--.| ||.-.+..++.++..+ ..||. .| |.- | T Consensus 157 l~~~~~~~~~~Gv~~~I~LDPG~GF~K~~t~~~Nl~ll~~l~~f~~~l-g~PlL~G~SRK~fiG~~~gg~~~~~~R~~Gt 235 (268) T TIGR01496 157 LEARAEELLAAGVAERIILDPGIGFGKSDTVEHNLELLKRLEEFKAVL-GYPLLVGVSRKSFIGALLGGETPPEERLEGT 235 (268) T ss_pred HHHHHHHHHHCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH-CCCCEEECCCHHHHHHHHCCCCCHHHHHHHH T ss_conf 999999999758866578717757788888767899998689999985-7702120110468886516788975642688 Q ss_pred HHHHHHHHHCCCCEEEEE Q ss_conf 999999985487589961 Q gi|254780889|r 285 AEGALALIDAGADIIKVG 302 (493) Q Consensus 285 ~~~~~~l~~~Gad~ikvG 302 (493) .-...+++..||+++||= T Consensus 236 ~a~~~~a~~~Ga~ivRVH 253 (268) T TIGR01496 236 LAASAYAVQKGADIVRVH 253 (268) T ss_pred HHHHHHHHHCCCCEEEEC T ss_conf 999999986798679853 No 460 >PRK07565 dihydroorotate dehydrogenase 2; Reviewed Probab=94.77 E-value=0.13 Score=30.90 Aligned_cols=69 Identities=20% Similarity=0.241 Sum_probs=48.8 Q ss_pred HHHHHHHHHHHCCCCEEE---------EECCC------C-HH-----HHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHH Q ss_conf 156789998731366699---------82484------1-13-----889999999998579872451-46469999999 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVV---------VDTAH------G-HS-----QKVLDAVVQIKKNFPSLLVMA-GNIATAEGALA 290 (493) Q Consensus 233 d~~eRa~~LveaGvDviv---------iD~ah------G-h~-----~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~ 290 (493) |-.+-|+++.++|+|.|+ ||+-. . .+ -..+..+..+++.. +++||+ |.|.|.+-+.. T Consensus 178 di~~iA~aa~~~Gadgv~~iNT~~~~~Id~e~~~~~~~~~lSgp~~~~~alr~v~~v~~~~-~ipIiG~GGI~sg~DaiE 256 (333) T PRK07565 178 NLANMAKRLDAAGADGLVLFNRFYQPDIDLETLEVVPGLVLSTPAELRLPLRWIAILSGRV-GADLAATTGVHDAEDVIK 256 (333) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCC-CCCEEEECCCCCHHHHHH T ss_conf 0999999999749988998436665633155443736866677431207889999996046-989888889598999999 Q ss_pred HHHCCCCEEEEE Q ss_conf 985487589961 Q gi|254780889|r 291 LIDAGADIIKVG 302 (493) Q Consensus 291 l~~~Gad~ikvG 302 (493) .+-|||++|-|| T Consensus 257 ~ilAGAsaVQv~ 268 (333) T PRK07565 257 MLLAGADVVMIA 268 (333) T ss_pred HHHCCCCHHEEE T ss_conf 998098863362 No 461 >pfam01729 QRPTase_C Quinolinate phosphoribosyl transferase, C-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to give rise to nicotinic acid mononucleotide (NaMN), pyrophosphate and carbon dioxide. The QA substrate is bound between the C-terminal domain of one subunit, and the N-terminal domain of the other. The C-terminal domain has a 7 beta-stranded TIM barrel-like fold. Probab=94.72 E-value=0.29 Score=28.54 Aligned_cols=67 Identities=22% Similarity=0.452 Sum_probs=51.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 15678999873136669982484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 233 d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +..+-+...+++|+|.|.+|--. .+.+.+.++.+++..|.+.+-+ ||+ |.+-...+...|+|.|=+| T Consensus 88 ~tl~e~~~a~~~~~d~I~LDn~s--pe~l~~~v~~l~~~~~~v~iEaSGgI-~~~ni~~yA~tGvD~IS~g 155 (169) T pfam01729 88 ENLEELEEALEAGADIIMLDNFS--PEEVREAVEELDERAGRVLLEVSGGI-TLDNVLEYAKTGVDVISVG 155 (169) T ss_pred EHHHHHHHHHHCCCCEEEECCCC--HHHHHHHHHHHHHHCCCEEEEECCCC-CHHHHHHHHHCCCCEEECC T ss_conf 01998999984699899977999--99999999999975896799961899-9999999997699999858 No 462 >KOG0474 consensus Probab=94.70 E-value=0.045 Score=34.32 Aligned_cols=57 Identities=21% Similarity=0.303 Sum_probs=45.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-----CCEEEEEEECCCHHC Q ss_conf 672440026786325730358999999986553306457479-----978688980210001 Q gi|254780889|r 152 QQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDD-----GCCIGLITVKDIERS 208 (493) Q Consensus 152 ~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~-----g~LvGIIT~~DIlk~ 208 (493) +..++|+|++++++++.-+......+++...+.+-.||||+. ++|.|+|-|+-+++. T Consensus 581 ~L~a~ev~~~pvi~l~~~ekV~~Iv~vLk~t~HngFPVvd~~~~~~~~~l~GlILRshl~vl 642 (762) T KOG0474 581 NLTAGEVMSKPVICLNRVEKVAVIVDVLKSTNHNGFPVVDEPPSNEAGRLHGLILRSHLLVL 642 (762) T ss_pred HHHHHHHCCCCEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 31176642688588732210999999987347688752357887610236688899999999 No 463 >pfam01884 PcrB PcrB family. This family contains proteins that are related to PcrB. The function of these proteins is unknown. Probab=94.64 E-value=0.4 Score=27.51 Aligned_cols=65 Identities=23% Similarity=0.381 Sum_probs=37.0 Q ss_pred HHHHHHH--CCCCEEEEECCCCHHHHH-HHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 8999873--136669982484113889-999999998579872-45146469999999985487589961 Q gi|254780889|r 237 RVGPLFD--VNVDLVVVDTAHGHSQKV-LDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 237 Ra~~Lve--aGvDviviD~ahGh~~~~-~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ...+|.+ -|--++-++.-.|-...+ -++++ .++...+++ +++|.+-++|.++.++++|||.|-|| T Consensus 143 a~~alA~~~~g~~~iYLEaGSGa~~~vp~~vi~-~~~~~~~~~LivGGGIrs~e~A~~~~~aGAD~IVvG 211 (231) T pfam01884 143 AYYALAAKLLGMRIFYLEAGSGAPGPVPEEVIA-VKKVLDDARLIVGGGIKSGEKAKEMARAGADVIVTG 211 (231) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHH-HHHCCCCCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 999999997098189997589999998999999-996468976899699799999999997799999979 No 464 >pfam02581 TMP-TENI Thiamine monophosphate synthase/TENI. Thiamine monophosphate synthase (TMP) (EC:2.5.1.3) catalyses the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to yield thiamine phosphate. This Pfam family also includes the regulatory protein TENI. Probab=94.62 E-value=0.23 Score=29.15 Aligned_cols=113 Identities=15% Similarity=0.245 Sum_probs=70.1 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHH---HHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 01567899987313666998248411388999999999---857987245146469999999985487589961027876 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIK---KNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k---~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~ 308 (493) ++..++++.++++|++++.+..-+-......+..+.++ +.+....+|-+++ +-+ .+.|+|+|-+|-..- T Consensus 12 ~~~~~~~~~~l~~Gv~~vqlR~K~~~~~~~~~~a~~l~~i~~~~~~~liin~~~---~lA---~~~~~dGvHL~~~d~-- 83 (180) T pfam02581 12 EDLLEVVEEALKGGVTIVQLREKDLDDRELLELAERLRALCRKYGVPLIINDRV---DLA---LAVGADGVHLGQDDL-- 83 (180) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCHH---HHH---HHHCCCEEECCCCCC-- T ss_conf 479999999998699999987898799999999999999999829979987859---999---972979894486766-- Q ss_pred CCCCHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHC Q ss_conf 65310001574310037889998622698199726543777621336714878993230101 Q gi|254780889|r 309 CTTRVVTGVGCPQLSAIMSVVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAG 370 (493) Q Consensus 309 CtTr~~~g~g~pq~~av~~~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~ag 370 (493) .+.+++. ..+--.|--..+++.-++.+|...|||++.+|..|.- T Consensus 84 ---------------~~~~~r~---~~~~~~iiG~S~h~~~e~~~a~~~gaDYi~~gpvf~T 127 (180) T pfam02581 84 ---------------PVAEARE---LLGPDKIIGVSTHTLEEALEAEALGADYIGFGPVFPT 127 (180) T ss_pred ---------------CHHHHHH---HCCCCCEEEECCCCHHHHHHHHHCCCCCEEECCCCCC T ss_conf ---------------7999998---6497768974478899999998719980887476777 No 465 >PRK00208 thiG thiazole synthase; Reviewed Probab=94.61 E-value=0.4 Score=27.46 Aligned_cols=92 Identities=16% Similarity=0.146 Sum_probs=66.6 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 73035899999998655330645747997868898021000101742232656876555665137015678999873136 Q gi|254780889|r 167 KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNV 246 (493) Q Consensus 167 ~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGv 246 (493) |.....-+|.+.+.+.+...+|-.++ |.--++.|.++|+ T Consensus 108 PD~~etl~Aae~Lv~eGF~VlpY~~~-----------------------------------------D~v~akrLe~~Gc 146 (256) T PRK00208 108 PDPIETLKAAEILVKEGFVVLPYCTD-----------------------------------------DPVLAKRLEEAGC 146 (256) T ss_pred CCHHHHHHHHHHHHHCCCEEEEECCC-----------------------------------------CHHHHHHHHHCCC T ss_conf 88689999999999889989786788-----------------------------------------9899999997495 Q ss_pred CEEE-----EECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 6699-----82484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 247 DLVV-----VDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 247 Dviv-----iD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .++- |-+-.| ..-...++.|++.+ ++|||+ ..+.++..+..-++.|+|+|.+. T Consensus 147 ~avMPlgsPIGSg~G--l~n~~~l~~i~e~~-~vPvIVDAGiG~pS~Aa~AMElG~DaVL~N 205 (256) T PRK00208 147 AAVMPLGAPIGSGLG--LLNPYNLRIIIEQA-DVPVIVDAGIGTPSDAAQAMELGADAVLLN 205 (256) T ss_pred EEEEECCCCCCCCCC--CCCHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEHH T ss_conf 345204564347988--79999999999867-998898578897667899986255432355 No 466 >COG0502 BioB Biotin synthase and related enzymes [Coenzyme metabolism] Probab=94.61 E-value=0.4 Score=27.46 Aligned_cols=181 Identities=19% Similarity=0.208 Sum_probs=101.4 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEE Q ss_conf 35899999998655330645747997868898021000101742232656876555665137015678999873136669 Q gi|254780889|r 170 VNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLV 249 (493) Q Consensus 170 ~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvi 249 (493) ..+-+..+.+.+.+..+..++-.-.. .| +|..+....-.+.+...| |.+.+..|.- +.|-++.|.+||+|.. T Consensus 87 eeIle~Ak~ak~~Ga~r~c~~aagr~-~~----~~~~~i~~~v~~Vk~~~~-le~c~slG~l--~~eq~~~L~~aGvd~y 158 (335) T COG0502 87 EEILEAAKKAKAAGATRFCMGAAGRG-PG----RDMEEVVEAIKAVKEELG-LEVCASLGML--TEEQAEKLADAGVDRY 158 (335) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCC-CC----CCHHHHHHHHHHHHHHCC-CHHHHCCCCC--CHHHHHHHHHCCHHHE T ss_conf 99999999999749950799873167-77----448999999999998469-2864025879--9999999997181133 Q ss_pred E--EEC----------CCCHHHHHHHHHHHHHHHCCCCCEEC----CCCCCHHH----HHHHHHCC-CCEEEEEECCCCC Q ss_conf 9--824----------84113889999999998579872451----46469999----99998548-7589961027876 Q gi|254780889|r 250 V--VDT----------AHGHSQKVLDAVVQIKKNFPSLLVMA----GNIATAEG----ALALIDAG-ADIIKVGIGPGSI 308 (493) Q Consensus 250 v--iD~----------ahGh~~~~~~~i~~~k~~~~~~~ii~----GNv~t~~~----~~~l~~~G-ad~ikvGig~Gs~ 308 (493) . +|+ .|.|.+ -+++++.+|+. .+.+.. |==.|.+- +..|.+-. .|-|-++.= T Consensus 159 nhNLeTs~~~y~~I~tt~t~ed-R~~tl~~vk~~--Gi~vcsGgI~GlGEs~eDri~~l~~L~~l~~pdsVPIn~l---- 231 (335) T COG0502 159 NHNLETSPEFYENIITTRTYED-RLNTLENVREA--GIEVCSGGIVGLGETVEDRAELLLELANLPTPDSVPINFL---- 231 (335) T ss_pred ECCCCCCHHHHCCCCCCCCHHH-HHHHHHHHHHC--CCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEE---- T ss_conf 0355569788756578988889-99999999980--9850451276189988899999999971899885423210---- Q ss_pred CCCCHHCCCC-CCCCHHHHHHHHHHH-CC---CCEEEEECCCCC-HHH-HEEEEEECCCEEEECC Q ss_conf 6531000157-431003788999862-26---981997265437-776-2133671487899323 Q gi|254780889|r 309 CTTRVVTGVG-CPQLSAIMSVVEVAE-RA---GVAIVADGGIRF-SGD-IAKAIAAGSACVMIGS 366 (493) Q Consensus 309 CtTr~~~g~g-~pq~~av~~~~~~~~-~~---~~~iiadGGi~~-~gd-i~kAla~GA~~VM~G~ 366 (493) .-..-|-.+ .|.++......-+|- +. ..-|.+-||.-. +.+ ...+|.+||+++|.|- T Consensus 232 -~P~~GTPle~~~~~~~~e~lk~IA~~Ri~~P~~~Ir~s~gr~~~~~~~q~~~~~aGansi~~g~ 295 (335) T COG0502 232 -NPIPGTPLENAKPLDPFEFLKTIAVARIIMPKSMIRLSAGRETMLPELQALAFMAGANSIFVGD 295 (335) T ss_pred -CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHHHCCCEEEECC T ss_conf -3799986665899998999999999999778645672588352258889999984566356524 No 467 >PRK13113 consensus Probab=94.60 E-value=0.41 Score=27.43 Aligned_cols=180 Identities=11% Similarity=0.176 Sum_probs=103.3 Q ss_pred CCCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEE Q ss_conf 467244002---6786325730358999999986553--30645747997868898021000101742232656876555 Q gi|254780889|r 151 AQQAVGELM---TRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVA 225 (493) Q Consensus 151 ~~~~V~eiM---t~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~Vg 225 (493) ++.|+.|-. ...-..+..+.++.+.++++.+-+. ...|++ +.|....+-+ T Consensus 54 FSDP~ADGPvIq~A~~rAL~~G~~~~~~~~~v~~~r~~~~~~Piv--------lM~Y~N~i~~----------------- 108 (263) T PRK13113 54 FTDPMADGPTIQLAGQRALEGGMTLDRTLDMVRAFRKEDDTTPIV--------MMGYYNPIYS----------------- 108 (263) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCEE--------EEECHHHHHH----------------- T ss_conf 888776589999999999977983889999999751238998889--------9831368988----------------- Q ss_pred EEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHH-CCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 6651370156789998731366-699824841138899999999985-79872451464699999999854875899610 Q gi|254780889|r 226 AAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKN-FPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 226 AAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~-~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) ....+-+..+.++|+| +|+.|...-.+. +..+..++. +.-+++++.+ .+.+-.+.+.+..-..|-+ + T Consensus 109 ------~G~e~F~~~~~~~GvdGvIipDLP~eE~~---~~~~~~~~~~l~~I~lvaPt-t~~~Ri~~i~~~a~gFiY~-V 177 (263) T PRK13113 109 ------RGVDRFLAEAKEAGIDGLIVVDLPPEEDS---ELCLPAQAAGLNFIRLATPT-TDDRRLPKVLQNTSGFVYY-V 177 (263) T ss_pred ------HHHHHHHHHHHHCCCCEEEECCCCHHHHH---HHHHHHHHCCCCEEEEECCC-CCHHHHHHHHHCCCCCEEE-E T ss_conf ------56999999987779436971799978889---99999997798679994799-9999999998338984899-8 Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHH-HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 278766531000157431003788-9998622698199726543777621336714878993230101216 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMS-VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~-~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +.+-+||..-+....+.+ +.+..+..++|+..-=||+++-|+ +++..+||.|-+||.|-..-| T Consensus 178 ------s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGI~~~e~~-~~~~~~ADGvIVGSa~v~~i~ 241 (263) T PRK13113 178 ------SITGITGAAAAQAADVAPEVARIKAATDLPVIVGFGITTPEAA-QAIAGVADGCVVGSAIVKLIG 241 (263) T ss_pred ------ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHH-HHHHCCCCEEEECHHHHHHHH T ss_conf ------3455668775543779999999985479988998378998999-999733999998689999998 No 468 >TIGR03128 RuMP_HxlA 3-hexulose-6-phosphate synthase. at the cost of also yielding formaldehyde. These latter species tend usually have a formaldehyde-activating enzyme to attach formaldehyde to the C1 carrier tetrahydromethanopterin. In these species, the enzyme is viewed as a lyase rather than a synthase and is called D-arabino 3-hexulose 6-phosphate formaldehyde lyase. Note that there is some overlap in specificity with the Escherichia coli enzyme 3-keto-L-gulonate 6-phosphate decarboxylase. Probab=94.58 E-value=0.41 Score=27.40 Aligned_cols=120 Identities=23% Similarity=0.317 Sum_probs=61.0 Q ss_pred HHHHHHHCCCCEEEEECCCCH--HHHHHHHHHHHHHHCCCCCEEC--CCCCC-HHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 899987313666998248411--3889999999998579872451--46469-999999985487589961027876653 Q gi|254780889|r 237 RVGPLFDVNVDLVVVDTAHGH--SQKVLDAVVQIKKNFPSLLVMA--GNIAT-AEGALALIDAGADIIKVGIGPGSICTT 311 (493) Q Consensus 237 Ra~~LveaGvDviviD~ahGh--~~~~~~~i~~~k~~~~~~~ii~--GNv~t-~~~~~~l~~~Gad~ikvGig~Gs~CtT 311 (493) -++...++|+|.+.+ ||. ...+...++..++ + ...+++ =|+.. .+-++.+.+.|+|.+-+-.|.. T Consensus 68 ~~~~~~~~Gad~itV---h~~~~~~ti~~a~~~a~~-~-~~~v~vdl~~~~~~~~~a~~~~~~g~d~v~~h~g~d----- 137 (206) T TIGR03128 68 EAEQAFAAGADIVTV---LGVADDATIKGAVKAAKK-H-GKEVQVDLINVKDKVKRAKELKELGADYIGVHTGLD----- 137 (206) T ss_pred HHHHHHHCCCCEEEE---ECCCCHHHHHHHHHHHHH-C-CCEEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCC----- T ss_conf 999999728989999---434897999999999997-3-997999974789889999999975898899502500----- Q ss_pred CHHCCCCCCCCHHHHHHHHHHHCCC-CEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCC Q ss_conf 1000157431003788999862269-819972654377762133671487899323010121 Q gi|254780889|r 312 RVVTGVGCPQLSAIMSVVEVAERAG-VAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTD 372 (493) Q Consensus 312 r~~~g~g~pq~~av~~~~~~~~~~~-~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~ 372 (493) ....+. +...+......... ..+..+||++- ....+|+.+|||.+.+|+...+.+ T Consensus 138 --~~~~~~---~~~~~~~~~~~~~~~~~i~v~gGi~~-~t~~~ai~~Gad~vVVGR~It~A~ 193 (206) T TIGR03128 138 --EQAKGQ---NPFEDLQTILKLVKEARVAVAGGINL-DTIPDVIKLGPDIVIVGGAITKAA 193 (206) T ss_pred --HHHCCC---CCHHHHHHHHHCCCCCCEEECCCCCC-CCHHHHHHCCCCEEEECCCCCCCC T ss_conf --443267---98899999986257873636798683-569999866999999896124799 No 469 >cd02810 DHOD_DHPD_FMN Dihydroorotate dehydrogenase (DHOD) and Dihydropyrimidine dehydrogenase (DHPD) FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. DHPD catalyzes the first step in pyrimidine degradation: the NADPH-dependent reduction of uracil and thymine to the corresponding 5,6-dihydropyrimidines. DHPD contains two FAD, two FMN and eight [4Fe-4S] clusters, arranged in two electron transfer chains that pass its homodimeric interface twice. Two of Probab=94.57 E-value=0.082 Score=32.41 Aligned_cols=69 Identities=23% Similarity=0.357 Sum_probs=49.4 Q ss_pred HHHHHHHHHHHCCCCEEEE-ECC---------------C---CHH-----HHHHHHHHHHHHHCC-CCCEEC-CCCCCHH Q ss_conf 1567899987313666998-248---------------4---113-----889999999998579-872451-4646999 Q gi|254780889|r 233 DIADRVGPLFDVNVDLVVV-DTA---------------H---GHS-----QKVLDAVVQIKKNFP-SLLVMA-GNIATAE 286 (493) Q Consensus 233 d~~eRa~~LveaGvDvivi-D~a---------------h---Gh~-----~~~~~~i~~~k~~~~-~~~ii~-GNv~t~~ 286 (493) +..+-++++.++|+|.+++ .+- + |.| ...++.++++++..+ +++||+ |.|.|++ T Consensus 177 ~~~~ia~~~~~~ga~gv~~~Nt~~~~~~~~~~~~~~~~~~~gGlSG~~i~~~al~~v~~~~~~~~~~i~Iig~GGI~~~~ 256 (289) T cd02810 177 DIVELAKAAERAGADGLTAINTISGRVVDLKTVGPGPKRGTGGLSGAPIRPLALRWVARLAARLQLDIPIIGVGGIDSGE 256 (289) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHH T ss_conf 89999999997599689996787765554444554456776523662778899999999999749996099989939999 Q ss_pred HHHHHHHCCCCEEEE Q ss_conf 999998548758996 Q gi|254780889|r 287 GALALIDAGADIIKV 301 (493) Q Consensus 287 ~~~~l~~~Gad~ikv 301 (493) .+.+.+.+|||++-+ T Consensus 257 da~e~i~aGA~~Vqv 271 (289) T cd02810 257 DVLEMLMAGASAVQV 271 (289) T ss_pred HHHHHHHCCCCHHHH T ss_conf 999999849979999 No 470 >pfam00977 His_biosynth Histidine biosynthesis protein. Proteins involved in steps 4 and 6 of the histidine biosynthesis pathway are contained in this family. Histidine is formed by several complex and distinct biochemical reactions catalysed by eight enzymes. The enzymes in this Pfam entry are called His6 and His7 in eukaryotes and HisA and HisF in prokaryotes. The structure of HisA is known to be a TIM barrel fold. In some archaeal HisA proteins the TIM barrel is composed of two tandem repeats of a half barrel. This family belong to the common phosphate binding site TIM barrel family. Probab=94.52 E-value=0.42 Score=27.32 Aligned_cols=69 Identities=26% Similarity=0.289 Sum_probs=44.3 Q ss_pred CHHHHHHHHHHHCCCC-EEEEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEE Q ss_conf 0156789998731366-699824--84113889999999998579872-4514646999999998548758996 Q gi|254780889|r 232 KDIADRVGPLFDVNVD-LVVVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKV 301 (493) Q Consensus 232 ~d~~eRa~~LveaGvD-viviD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikv 301 (493) .|-.+.|+.+.+.|+| +.++|. |-.....-.+.++.+.+... +| -+.|.+-+.+.++.+.++|||-|-+ T Consensus 29 gdP~~~a~~~~~~g~d~i~ivDLda~~~~~~~n~~~i~~i~~~~~-~pi~vgGGIrs~e~~~~~l~~Ga~kvvi 101 (229) T pfam00977 29 GDPVELAKRYEEEGADELHFVDLDAAKEGRPVNLDLIEEIAEEVF-IPVQVGGGIRSLEDAERLLSAGADKVII 101 (229) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCC-CCEEEECCEEEHHHHHHHHHCCCCEEEE T ss_conf 999999999998799989999686630268106999999998669-8789964561189999999769989995 No 471 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=94.52 E-value=0.26 Score=28.84 Aligned_cols=26 Identities=27% Similarity=0.443 Sum_probs=14.1 Q ss_pred CCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 87245146469999999985487589 Q gi|254780889|r 274 SLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 274 ~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) ++.|-+=.=.+.+.+..+.+||||.+ T Consensus 169 ~~~ieVDGGv~~~ni~~~~~AGAD~~ 194 (216) T TIGR01163 169 SILIEVDGGVNEDNIAEVAEAGADIL 194 (216) T ss_pred CEEEEECCCCCHHHHHHHHHCCCCEE T ss_conf 55899717989767999997589899 No 472 >PRK01659 consensus Probab=94.50 E-value=0.31 Score=28.26 Aligned_cols=70 Identities=19% Similarity=0.340 Sum_probs=49.2 Q ss_pred CHHHHHHHHHHHCCCCEE-EEEC--CCCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 015678999873136669-9824--84113889999999998579872-45146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLV-VVDT--AHGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvDvi-viD~--ahGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +|-.+-|+...+.|+|-| ++|. |.-.....++.|+++.+.. .+| -++|.|-|.|.++.|.++|||-|-+| T Consensus 30 gDP~~~ak~~~~~Gad~ihivDld~a~~g~~~n~~~I~~i~~~~-~ipi~vGGGIrs~e~~~~~l~~GadkViig 103 (252) T PRK01659 30 GDPVEIAAAYNEAGADELVFLDITATHEGRKTMVDVVEKVAAKV-FIPLTVGGGISSVKDMKRLLRAGADKVSIN 103 (252) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEECCEECHHHHHHHHHCCCCEEEEC T ss_conf 89999999999879999999946766568864899999999756-974799633200688889874488559831 No 473 >PRK00311 panB 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed Probab=94.48 E-value=0.43 Score=27.25 Aligned_cols=33 Identities=9% Similarity=0.277 Sum_probs=12.9 Q ss_pred HHHHHHHHH-HCCCCEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 567899987-3136669982484113889999999998 Q gi|254780889|r 234 IADRVGPLF-DVNVDLVVVDTAHGHSQKVLDAVVQIKK 270 (493) Q Consensus 234 ~~eRa~~Lv-eaGvDviviD~ahGh~~~~~~~i~~~k~ 270 (493) ..+-|..|+ |+|+|++=+.- . ....+.|+.|.+ T Consensus 97 a~~nA~rl~ke~gadaVKlEg---g-~~~~~~i~~l~~ 130 (266) T PRK00311 97 ALRNAGRLMKEAGAHAVKLEG---G-EELAETIRRLVE 130 (266) T ss_pred HHHHHHHHHHHCCCCEEEECC---C-CCHHHHHHHHHH T ss_conf 999999999856999899788---8-336799999998 No 474 >PRK13586 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=94.48 E-value=0.08 Score=32.50 Aligned_cols=69 Identities=23% Similarity=0.261 Sum_probs=50.9 Q ss_pred CHHHHHHHHHHHCCCCEE-EE--ECCCCHHHHHHHHHHHHHHHCCCCCE-ECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 015678999873136669-98--24841138899999999985798724-5146469999999985487589961 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLV-VV--DTAHGHSQKVLDAVVQIKKNFPSLLV-MAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 232 ~d~~eRa~~LveaGvDvi-vi--D~ahGh~~~~~~~i~~~k~~~~~~~i-i~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .|-.+-|+.+.++|+|-| ++ |-|.|...+ .+.|+.+.+.. .+|+ ++|.+-|.+.++.|+++|||-|-+| T Consensus 29 ~dP~~~a~~~~~~Ga~~lhvvDLdaa~g~~~N-~~~I~~i~~~~-~~piqvGGGIrs~e~~~~~l~~Ga~kViig 101 (231) T PRK13586 29 GDPLKIAEELYNEGYDSIHVVDLDAAEGKGDN-EEYIKRICKIG-FSWIQVGGGIRDVEKAERLLSYDCSAIVMS 101 (231) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCH-HHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 89999999999879998999967156899843-99999999745-985798567176999999997799889976 No 475 >COG5016 Pyruvate/oxaloacetate carboxyltransferase [Energy production and conversion] Probab=94.46 E-value=0.14 Score=30.68 Aligned_cols=131 Identities=23% Similarity=0.242 Sum_probs=80.8 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCC--CCCCCEEEEEEEEC-------CCHHHHHHH Q ss_conf 035899999998655330645747997868898021000101742232--65687655566513-------701567899 Q gi|254780889|r 169 TVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATK--DSKGRLRVAAAVSV-------AKDIADRVG 239 (493) Q Consensus 169 ~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~--D~~grL~VgAAIg~-------~~d~~eRa~ 239 (493) |+-++.-.+...+|++...-|.|.-+-. ++...|.+ -..|- -+-++|+- -+-+.+-++ T Consensus 97 DDvVe~Fv~ka~~nGidvfRiFDAlND~------------RNl~~ai~a~kk~G~-h~q~~i~YT~sPvHt~e~yv~~ak 163 (472) T COG5016 97 DDVVEKFVEKAAENGIDVFRIFDALNDV------------RNLKTAIKAAKKHGA-HVQGTISYTTSPVHTLEYYVELAK 163 (472) T ss_pred HHHHHHHHHHHHHCCCCEEEECHHCCCH------------HHHHHHHHHHHHCCC-EEEEEEEECCCCCCCHHHHHHHHH T ss_conf 5899999999986487579721110464------------679999999996186-368998742688652899999999 Q ss_pred HHHHCCCCEEEEECCCCH-H-HHHHHHHHHHHHHCCCCCEECCCCCCHHHH----HHHHHCCCCEEEEEECCCCCCCCCH Q ss_conf 987313666998248411-3-889999999998579872451464699999----9998548758996102787665310 Q gi|254780889|r 240 PLFDVNVDLVVVDTAHGH-S-QKVLDAVVQIKKNFPSLLVMAGNIATAEGA----LALIDAGADIIKVGIGPGSICTTRV 313 (493) Q Consensus 240 ~LveaGvDviviD~ahGh-~-~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~----~~l~~~Gad~ikvGig~Gs~CtTr~ 313 (493) .|.+.|+|-|||--.-|- + ...-+.++.+|+.++ ++|-.-+-+|..-+ ..-++||+|.|---|.|=|.-|.+. T Consensus 164 el~~~g~DSIciKDmaGlltP~~ayelVk~iK~~~~-~pv~lHtH~TsG~a~m~ylkAvEAGvD~iDTAisp~S~gtsqP 242 (472) T COG5016 164 ELLEMGVDSICIKDMAGLLTPYEAYELVKAIKKELP-VPVELHTHATSGMAEMTYLKAVEAGVDGIDTAISPLSGGTSQP 242 (472) T ss_pred HHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCC-CEEEEECCCCCCHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCC T ss_conf 999727987884000026986889999999997459-7069850455561799999999817642221004555788899 No 476 >PRK06739 pyruvate kinase; Validated Probab=94.44 E-value=0.16 Score=30.38 Aligned_cols=124 Identities=17% Similarity=0.209 Sum_probs=76.2 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHC--CCCCEECCCCCCHHHHHHHHH--CCCCEEEEEECCC Q ss_conf 701567899987313666998248411388999999999857--987245146469999999985--4875899610278 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNF--PSLLVMAGNIATAEGALALID--AGADIIKVGIGPG 306 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~--~~~~ii~GNv~t~~~~~~l~~--~Gad~ikvGig~G 306 (493) ++.|++-.+-.++.++|.+.+--.+- ..-+..++.+-..+ ++..|| --+.|++|.++|-+ .-+|+|-|-=|-= T Consensus 164 TekD~~dI~~a~~~~vD~valSFVrs--a~DI~~lr~~l~~~~~~~~~II-aKIE~~~al~NldeIi~~sDgIMIARGDL 240 (352) T PRK06739 164 TEKDKKDIQFLLEEDVDFIACSFVRK--PSHIKEIRDFIQQYKETSPNLI-AKIETMEAIENFQDICKEADGIMIARGDL 240 (352) T ss_pred CCCCHHHHHHHHHCCCCEEEECCCCC--HHHHHHHHHHHHHHCCCCCCEE-EEECCHHHHHHHHHHHHHCCEEEEECCCC T ss_conf 80269999999875999999857799--9999999999997268886279-98649989976999998559469965864 Q ss_pred CCCCCCHHCCCCCC--CCHHHHHHHHHHHCCCCEEEEECCC------------CCHHHHEEEEEECCCEEEEC Q ss_conf 76653100015743--1003788999862269819972654------------37776213367148789932 Q gi|254780889|r 307 SICTTRVVTGVGCP--QLSAIMSVVEVAERAGVAIVADGGI------------RFSGDIAKAIAAGSACVMIG 365 (493) Q Consensus 307 s~CtTr~~~g~g~p--q~~av~~~~~~~~~~~~~iiadGGi------------~~~gdi~kAla~GA~~VM~G 365 (493) ++.+.-+ -+| |-.-+..|. +++.|+|--.-+ -..-|++-|+--|||+|||. T Consensus 241 gvei~~e----~vp~~Qk~Ii~~c~----~~gkpvivATqmLeSM~~~p~PTRAEv~DVanav~dG~D~vmLs 305 (352) T PRK06739 241 GVELPYQ----FIPLLQKMMIQECN----RTNTYVITATQMLQSMVDHSIPTRAEVTDVFQAVLDGTNAVMLS 305 (352) T ss_pred CCCCCHH----HHHHHHHHHHHHHH----HCCCCEEEECCCHHHHCCCCCCCHHHHHHHHHHHHHCCCEEEEC T ss_conf 4636988----98999999999999----82997899636288763499887699999999998479889972 No 477 >cd04739 DHOD_like Dihydroorotate dehydrogenase (DHOD) like proteins. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. This subgroup has the conserved FMN binding site, but lacks some catalytic residues and may therefore be inactive. Probab=94.38 E-value=0.19 Score=29.74 Aligned_cols=76 Identities=20% Similarity=0.199 Sum_probs=49.1 Q ss_pred EEEECCC-HHHHHHHHHHHCCCCEEEE---------ECCC------------CHHHHHHHHHHHHHHHCCCCCEEC-CCC Q ss_conf 6651370-1567899987313666998---------2484------------113889999999998579872451-464 Q gi|254780889|r 226 AAVSVAK-DIADRVGPLFDVNVDLVVV---------DTAH------------GHSQKVLDAVVQIKKNFPSLLVMA-GNI 282 (493) Q Consensus 226 AAIg~~~-d~~eRa~~LveaGvDvivi---------D~ah------------Gh~~~~~~~i~~~k~~~~~~~ii~-GNv 282 (493) +-++++- |-.+-++++.++|+|.|++ |+-. +-.-..+..+..+++.. +++||+ |.| T Consensus 168 vKLsP~~~di~~ia~aa~~~GAdgi~liNT~~~~~id~~~~~~~~~~~lSg~~~~~~alr~v~~~~~~~-~ipIiG~GGI 246 (325) T cd04739 168 VKLSPFFSALAHMAKQLDAAGADGLVLFNRFYQPDIDLETLEVVPNLLLSSPAEIRLPLRWIAILSGRV-KASLAASGGV 246 (325) T ss_pred EECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCC-CCCEEEECCC T ss_conf 953998300999999999759988997357665642167641536877457530068899999996468-9898888895 Q ss_pred CCHHHHHHHHHCCCCEEEEE Q ss_conf 69999999985487589961 Q gi|254780889|r 283 ATAEGALALIDAGADIIKVG 302 (493) Q Consensus 283 ~t~~~~~~l~~~Gad~ikvG 302 (493) .|.+-+...+-+||++|-|| T Consensus 247 ~s~~Da~e~ilAGAsaVQv~ 266 (325) T cd04739 247 HDAEDVVKYLLAGADVVMTT 266 (325) T ss_pred CCHHHHHHHHHCCCCHHHEE T ss_conf 98999999998098876143 No 478 >PRK13135 consensus Probab=94.36 E-value=0.46 Score=27.07 Aligned_cols=67 Identities=25% Similarity=0.328 Sum_probs=48.1 Q ss_pred HHHHHHHHHHCCCCEE-EEEC------CCCHHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873136669-9824------841138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLV-VVDT------AHGHSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDvi-viD~------ahGh~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.+| ++-+ .......+.+.++++|+.. ++|+.+| .|-|+|.++.+ ..+||++-|| T Consensus 158 ~~~Ri~~i~~~s~GFiY~Vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGI~~~e~v~~i-~~~ADGvIVG 232 (267) T PRK13135 158 DESRIRTVARLGRGFVYYVSVTGVTGARSGVEATVGGNVAKIREKI-TVPVVVGFGISTPQQAADV-AAMADGVVVG 232 (267) T ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEECCCCHHHHHHH-HCCCCEEEEC T ss_conf 5799999996189818998545666776444488999999998606-8984898167999999999-8059999987 No 479 >COG0269 SgbH 3-hexulose-6-phosphate synthase and related proteins [Carbohydrate transport and metabolism] Probab=94.34 E-value=0.34 Score=28.03 Aligned_cols=34 Identities=26% Similarity=0.517 Sum_probs=16.4 Q ss_pred CEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCC Q ss_conf 81997265437776213367148789932301012 Q gi|254780889|r 337 VAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGT 371 (493) Q Consensus 337 ~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt 371 (493) ..+=--|||+ +-++-.....|++.|-.|+.+.+. T Consensus 165 ~~vAVaGGI~-~~~i~~~~~~~~~ivIvGraIt~a 198 (217) T COG0269 165 AKVAVAGGIT-PEDIPLFKGIGADIVIVGRAITGA 198 (217) T ss_pred CEEEEECCCC-HHHHHHHHCCCCCEEEECCHHCCC T ss_conf 3599866878-878899864899799988221489 No 480 >cd04728 ThiG Thiazole synthase (ThiG) is the tetrameric enzyme that is involved in the formation of the thiazole moiety of thiamin pyrophosphate, an essential ubiquitous cofactor that plays an important role in carbohydrate and amino acid metabolism. ThiG catalyzes the formation of thiazole from 1-deoxy-D-xylulose 5-phosphate (DXP) and dehydroglycine, with the help of the sulfur carrier protein ThiS that carries the sulfur needed for thiazole assembly on its carboxy terminus (ThiS-COSH). Probab=94.31 E-value=0.47 Score=27.00 Aligned_cols=93 Identities=17% Similarity=0.174 Sum_probs=67.9 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 57303589999999865533064574799786889802100010174223265687655566513701567899987313 Q gi|254780889|r 166 VKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVN 245 (493) Q Consensus 166 v~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaG 245 (493) .|.....-+|.+.|.+.+...+|-.+ ||..-+++|.++| T Consensus 106 lPD~~eTl~Aae~Lv~~GF~VlpY~~-----------------------------------------~D~v~akrLe~~G 144 (248) T cd04728 106 LPDPIETLKAAEILVKEGFTVLPYCT-----------------------------------------DDPVLAKRLEDAG 144 (248) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEECC-----------------------------------------CCHHHHHHHHHCC T ss_conf 98868999999999988998978678-----------------------------------------8999999999749 Q ss_pred CCEEE-----EECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 66699-----82484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 246 VDLVV-----VDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 246 vDviv-----iD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) +.++- |-+-.| ..-...|+.|++.+ ++|||+ ..+.++..+..-++.|+|+|.+. T Consensus 145 c~avMPlgsPIGSg~G--l~n~~~l~~i~e~~-~vPvIVDAGiG~pS~Aa~aMElG~daVL~N 204 (248) T cd04728 145 CAAVMPLGSPIGSGQG--LLNPYNLRIIIERA-DVPVIVDAGIGTPSDAAQAMELGADAVLLN 204 (248) T ss_pred CEEEEECCCCCCCCCC--CCCHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEHH T ss_conf 5345204564347988--79999999999847-998898479997567899987265533454 No 481 >PRK13131 consensus Probab=94.30 E-value=0.47 Score=26.98 Aligned_cols=68 Identities=13% Similarity=0.109 Sum_probs=47.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 56789998731366699824841-------138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHG-------HSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahG-------h~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.+|-.=+..| ......+.++++|+.. ++|+.+| .|.|+|.+..+...|||++-|| T Consensus 152 ~~~Ri~~i~~~s~GFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGIs~~e~v~~~~~~gaDGvIVG 227 (257) T PRK13131 152 SVKDLEQVATHSQGYIYTLARSGVTGASHTLENDASAIIKTLKTFS-PTPALLGFGISKKEHITNAKGMGADGVICG 227 (257) T ss_pred CHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC T ss_conf 8899999983589749998457677986434076999999999668-998799805798899999985599999987 No 482 >PRK05211 consensus Probab=94.26 E-value=0.48 Score=26.93 Aligned_cols=89 Identities=25% Similarity=0.327 Sum_probs=61.1 Q ss_pred CHHHHHHHHHHHCCCCE-EEEEC--C-CCHHHHHHHHHHHHHHHCCCCC-EECCCCCCHHHHHHHHHCCCCEEEEEECCC Q ss_conf 01567899987313666-99824--8-4113889999999998579872-451464699999999854875899610278 Q gi|254780889|r 232 KDIADRVGPLFDVNVDL-VVVDT--A-HGHSQKVLDAVVQIKKNFPSLL-VMAGNIATAEGALALIDAGADIIKVGIGPG 306 (493) Q Consensus 232 ~d~~eRa~~LveaGvDv-iviD~--a-hGh~~~~~~~i~~~k~~~~~~~-ii~GNv~t~~~~~~l~~~Gad~ikvGig~G 306 (493) +|-.+-|+...+.|+|- .++|. | .|.. .-.+.|+.+.+.. .+| -++|.|-|.+.++.|.++|||-|-+| T Consensus 21 gDP~~~ak~~~~~gadelhivDld~a~~g~~-~n~~~I~~i~~~~-~~Pl~vGGGIrs~~~i~~ll~~GadkViig---- 94 (248) T PRK05211 21 GDIVPLAKRYAEEGADELVFYDITASSDGRV-VDKSWVSRVAEVI-DIPFCVAGGIKSVEDAREILSFGADKISIN---- 94 (248) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEEC---- T ss_conf 8999999999986999899997867767872-1499999999767-985896278013899999998799889989---- Q ss_pred CCCCCCHHCCCCCCCCHHHHHHHHHHHCCCCE Q ss_conf 76653100015743100378899986226981 Q gi|254780889|r 307 SICTTRVVTGVGCPQLSAIMSVVEVAERAGVA 338 (493) Q Consensus 307 s~CtTr~~~g~g~pq~~av~~~~~~~~~~~~~ 338 (493) ...+--|.+- .++++.+|.. T Consensus 95 -------s~a~~np~li-----~~~~~~fG~q 114 (248) T PRK05211 95 -------SPALADPTLI-----TRLADRFGVQ 114 (248) T ss_pred -------CHHHHCCHHH-----HHHHHHCCCC T ss_conf -------7676196189-----9999857993 No 483 >cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan. Probab=94.23 E-value=0.48 Score=26.89 Aligned_cols=67 Identities=21% Similarity=0.315 Sum_probs=47.6 Q ss_pred HHHHHHHHHHCCCCEE-EEE------CCCCHHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873136669-982------4841138899999999985798724514-6469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLV-VVD------TAHGHSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDvi-viD------~ahGh~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|.+.+.+..-.++ ++- ...-.+....+.++++|+.. ++|+++| .|.|+|.++.+.+. ||++-|| T Consensus 141 ~~~ri~~i~~~s~gfiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t-~~Pv~vGFGI~~~e~v~~~~~~-aDGvIVG 215 (242) T cd04724 141 PDERIKKIAELASGFIYYVSRTGVTGARTELPDDLKELIKRIRKYT-DLPIAVGFGISTPEQAAEVAKY-ADGVIVG 215 (242) T ss_pred CHHHHHHHHHHCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHH-CCEEEEC T ss_conf 7899999997479849998577777877556499999999998716-8974874387999999999965-9999987 No 484 >PRK07428 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=94.23 E-value=0.39 Score=27.60 Aligned_cols=38 Identities=29% Similarity=0.425 Sum_probs=17.4 Q ss_pred HHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEE Q ss_conf 99999999857987245146469999999985487589 Q gi|254780889|r 262 LDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADII 299 (493) Q Consensus 262 ~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~i 299 (493) -+.++.+|+..|....|.=-|.|.+.++..+++|+|.| T Consensus 180 ~~ai~~~r~~~~~~~~IeVEv~~l~q~~~a~~~g~DiI 217 (285) T PRK07428 180 GEAITRIRQQIPYPLTIEVETETLEQVQEALEYGADII 217 (285) T ss_pred HHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCCCCEE T ss_conf 99999999748998269999698999999996699999 No 485 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=94.20 E-value=0.076 Score=32.67 Aligned_cols=16 Identities=25% Similarity=0.115 Sum_probs=6.1 Q ss_pred CCCEEEEEEECCCHHC Q ss_conf 9978688980210001 Q gi|254780889|r 193 DGCCIGLITVKDIERS 208 (493) Q Consensus 193 ~g~LvGIIT~~DIlk~ 208 (493) +|+++-+=|..+|+.+ T Consensus 240 ~G~IvQ~gtpeeI~~~ 255 (382) T TIGR03415 240 GGRIIQHGTPEEIVLN 255 (382) T ss_pred CCEEEEECCHHHHHHC T ss_conf 9989996588999867 No 486 >KOG0369 consensus Probab=94.19 E-value=0.49 Score=26.83 Aligned_cols=140 Identities=19% Similarity=0.169 Sum_probs=86.1 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCH-------------H Q ss_conf 303589999999865533064574799786889802100010174223265687655566513701-------------5 Q gi|254780889|r 168 KTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKD-------------I 234 (493) Q Consensus 168 ~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d-------------~ 234 (493) ++.-+..-.+.-.+++....-|.|.-+-|- .....-+|.+-.-| .|-|||+-.+| + T Consensus 652 PDNviykFce~Ak~nGmDiFRVFDsLN~lp---------nl~lGmeAagkAGG--VVEAai~YtGDv~dp~rtKY~L~YY 720 (1176) T KOG0369 652 PDNVIYKFCEQAKKNGMDIFRVFDSLNYLP---------NLLLGMEAAGKAGG--VVEAAICYTGDVLDPSRTKYNLDYY 720 (1176) T ss_pred CHHHHHHHHHHHHHCCCCEEEEHHHHHHHH---------HHHHHHHHHHCCCC--EEEEEEEECCCCCCCCCCCCCHHHH T ss_conf 716899999999854861566101331024---------54555665411587--5899885025557865441147999 Q ss_pred HHHHHHHHHCCCCEEEE-ECCCCH-HHHHHHHHHHHHHHCCCCCEECCCCCCHHHH----HHHHHCCCCEEEEEECCCCC Q ss_conf 67899987313666998-248411-3889999999998579872451464699999----99985487589961027876 Q gi|254780889|r 235 ADRVGPLFDVNVDLVVV-DTAHGH-SQKVLDAVVQIKKNFPSLLVMAGNIATAEGA----LALIDAGADIIKVGIGPGSI 308 (493) Q Consensus 235 ~eRa~~LveaGvDvivi-D~ahGh-~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~----~~l~~~Gad~ikvGig~Gs~ 308 (493) .+-+..|+++|..+++| |.|--- -....-.|..||..||++||-.-.-.|.-+. ..-+.||||.+-|-+-+=|. T Consensus 721 ~nlad~lV~agtHiL~IKDMAG~lKP~aa~lLi~alRdk~PdlPiHvHtHDtsGagVAsMlaca~AGADVVDvA~dSMSG 800 (1176) T KOG0369 721 LNLADKLVKAGTHILGIKDMAGVLKPEAAKLLIGALRDKFPDLPIHVHTHDTSGAGVASMLACALAGADVVDVAVDSMSG 800 (1176) T ss_pred HHHHHHHHHCCCEEEEEHHHHCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 98999998626718865444233676888999998874089985377416776387999999987399604433001456 Q ss_pred CCCCHHCCCC Q ss_conf 6531000157 Q gi|254780889|r 309 CTTRVVTGVG 318 (493) Q Consensus 309 CtTr~~~g~g 318 (493) -|.+...|.- T Consensus 801 mTSQPSmgA~ 810 (1176) T KOG0369 801 MTSQPSMGAL 810 (1176) T ss_pred CCCCCCHHHH T ss_conf 6568851032 No 487 >pfam05690 ThiG Thiazole biosynthesis protein ThiG. This family consists of several bacterial thiazole biosynthesis protein G sequences. ThiG, together with ThiF and ThiH, is proposed to be involved in the synthesis of 4-methyl-5-(b-hydroxyethyl)thiazole (THZ) which is an intermediate in the thiazole production pathway. Probab=94.17 E-value=0.5 Score=26.81 Aligned_cols=92 Identities=16% Similarity=0.191 Sum_probs=67.6 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 73035899999998655330645747997868898021000101742232656876555665137015678999873136 Q gi|254780889|r 167 KKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNV 246 (493) Q Consensus 167 ~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGv 246 (493) |.....-+|.+.+.+.+...+|-.++ |..-++.|.++|+ T Consensus 106 PD~~etl~Aae~Lv~eGF~VlpY~~~-----------------------------------------D~v~akrLed~Gc 144 (246) T pfam05690 106 PDPIETLKAAEILVKEGFTVLPYTTD-----------------------------------------DPVLARRLEEAGC 144 (246) T ss_pred CCHHHHHHHHHHHHHCCCEEEEECCC-----------------------------------------CHHHHHHHHHCCC T ss_conf 88789999999999789989886179-----------------------------------------9899999987598 Q ss_pred CEEE-----EECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 6699-----82484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 247 DLVV-----VDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 247 Dviv-----iD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) .++- |-+-.| ..-...++.|++.+ ++|||+ ..+.++..+..-++.|+|+|.|. T Consensus 145 ~avMPlgsPIGSg~G--l~n~~~l~~i~e~~-~vPvIVDAGiG~pS~Aa~aMElG~DaVLvN 203 (246) T pfam05690 145 AAVMPLGAPIGSGLG--LRNPENLRIIIEEA-DVPVIVDAGIGTPSDAAQAMELGADAVLLN 203 (246) T ss_pred EEEEECCCCCCCCCC--CCCHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCEEEHH T ss_conf 498622440136888--68999999999967-998898489896788999997456777730 No 488 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=94.11 E-value=0.21 Score=29.45 Aligned_cols=210 Identities=22% Similarity=0.358 Sum_probs=124.7 Q ss_pred CCCHHHHHHHHHHHCCC---CEEEECCCCEEEEEEEEHH-------H--HCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 54303456566760504---1678668860278872354-------5--0014344672440026786325730358999 Q gi|254780889|r 108 YATLADALALMKKYSIS---GIPVVESDVGKLVGILTNR-------D--VRFASNAQQAVGELMTRNLITVKKTVNLENA 175 (493) Q Consensus 108 d~TI~ea~~lm~~~~is---~iPVVD~~~gkLiGIVT~r-------D--ir~~~~~~~~V~eiMt~~litv~~~~~l~eA 175 (493) |.+++||.+-+.-.+.+ -+||.+.= -=+|+.+-- | +||+.. + |=..|.+. T Consensus 3 Dv~LRDaHQSL~ATRmRteDMLPi~~~L--D~vGfwSLEvWGGATFDaC~RFL~E-----------D-----PW~RLR~l 64 (616) T TIGR01108 3 DVVLRDAHQSLIATRMRTEDMLPILEKL--DDVGFWSLEVWGGATFDACIRFLNE-----------D-----PWERLREL 64 (616) T ss_pred EEEECCHHHHHHHHHCCHHHHHHHHHHH--HHCCCEEEEECCCHHHHHHHHHCCC-----------C-----HHHHHHHH T ss_conf 0135206776675305731368999875--0249556520244105578442488-----------8-----55899999 Q ss_pred HHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHH-HHHHHHCCCCEE-EEEC Q ss_conf 99998655330645747997868898021000101742232656876555665137015678-999873136669-9824 Q gi|254780889|r 176 KALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADR-VGPLFDVNVDLV-VVDT 253 (493) Q Consensus 176 ~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eR-a~~LveaGvDvi-viD~ 253 (493) .+.|.+.++.-|. .-.+|+| -|.||+ |..|+ +++.++.|+|++ ++|- T Consensus 65 k~~~pnT~L~MLL---RGQNLlG---------YRHYAD-------------------DVVe~FV~~a~~NG~DVFRiFDA 113 (616) T TIGR01108 65 KKALPNTKLQMLL---RGQNLLG---------YRHYAD-------------------DVVEAFVKKAVENGLDVFRIFDA 113 (616) T ss_pred HHHCCCCCHHHHH---CCCCCCC---------CCCCCH-------------------HHHHHHHHHHHHCCCEEEEEECC T ss_conf 9735787512342---0454234---------415843-------------------68999999999759808995124 Q ss_pred CCCHHHHHHHHHHHHHHHCC-CCCE-E---CCCCCCHHHH----HHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHH Q ss_conf 84113889999999998579-8724-5---1464699999----999854875899610278766531000157431003 Q gi|254780889|r 254 AHGHSQKVLDAVVQIKKNFP-SLLV-M---AGNIATAEGA----LALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSA 324 (493) Q Consensus 254 ahGh~~~~~~~i~~~k~~~~-~~~i-i---~GNv~t~~~~----~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~a 324 (493) =| ..+|+..+|+.+|+.-. +++. | .--|.|.|-+ +.|++-|+|.| |+ ...+|+=-|... T Consensus 114 LN-D~RNl~~ai~a~Kk~g~dHvQg~iSYTtSPvHTl~~yl~la~~L~~~G~DSI---------~I-KDMaGlLTP~~A- 181 (616) T TIGR01108 114 LN-DPRNLQKAIEAAKKHGADHVQGAISYTTSPVHTLEKYLELAKELLEMGVDSI---------CI-KDMAGLLTPKVA- 181 (616) T ss_pred CC-CHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEE---------EE-ECCCCCCCHHHH- T ss_conf 58-8778999999999738978999971246843678889999999998188605---------52-020046441589- Q ss_pred HHHHHHHH-HCCC-CEEE----EECCCCCHHHHEEEEEECCCEE--EECCHHHCCCCCCCCEEE Q ss_conf 78899986-2269-8199----7265437776213367148789--932301012167764487 Q gi|254780889|r 325 IMSVVEVA-ERAG-VAIV----ADGGIRFSGDIAKAIAAGSACV--MIGSLLAGTDESPGDIFL 380 (493) Q Consensus 325 v~~~~~~~-~~~~-~~ii----adGGi~~~gdi~kAla~GA~~V--M~G~~~agt~Espg~~~~ 380 (493) ||-..+. ++++ +||= +--|.-.- =..||+-||+|.+ =+-||=+||--+|+|.+. T Consensus 182 -YELV~alK~~~~n~pvhLH~H~TtGmA~~-AllkA~EAG~d~iDTAisS~S~gtSHPptE~lv 243 (616) T TIGR01108 182 -YELVSALKKEFGNLPVHLHSHATTGMAEM-ALLKAIEAGADMIDTAISSMSGGTSHPPTETLV 243 (616) T ss_pred -HHHHHHHHHHCCCEEEEEECCCCHHHHHH-HHHHHHHCCCCCCHHCCCCCCCCCCCCHHHHHH T ss_conf -99999997423974688632472337999-999888707880020055234788887479999 No 489 >cd06556 ICL_KPHMT Members of the ICL/PEPM_KPHMT enzyme superfamily catalyze the formation and cleavage of either P-C or C-C bonds. Typical members are phosphoenolpyruvate mutase (PEPM), phosphonopyruvate hydrolase (PPH), carboxyPEP mutase (CPEP mutase), oxaloacetate hydrolase (OAH), isocitrate lyase (ICL), 2-methylisocitrate lyase (MICL), and ketopantoate hydroxymethyltransferase (KPHMT). Probab=94.11 E-value=0.51 Score=26.72 Aligned_cols=13 Identities=23% Similarity=0.266 Sum_probs=4.5 Q ss_pred HHHHHCCCCEEEE Q ss_conf 9987313666998 Q gi|254780889|r 239 GPLFDVNVDLVVV 251 (493) Q Consensus 239 ~~LveaGvDvivi 251 (493) .+|.+||++.|++ T Consensus 163 ~ale~AGa~~ivl 175 (240) T cd06556 163 LAYAPAGADLIVM 175 (240) T ss_pred HHHHHCCCEEEEE T ss_conf 9998479849987 No 490 >TIGR03239 GarL 2-dehydro-3-deoxyglucarate aldolase. In E. coli this enzyme (GarL, ) 2-dehydro-3-deoxyglucarate aldolase acts in the catabolism of several sugars including D-galactarate, D-glucarate and L-idarate. In fact, 5-dehydro-4-deoxy-D-glucarate aldolase is a synonym for this enzyme as it is unclear in the literature whether the enzyme acts on only one of these or, as seems likely, has no preference. (Despite the apparent large difference in substrate stucture indicated by their names, 2-DH-3DO- and 5-DH-4DO-glucarate differ only by the chirality of most central hydroxyl-bearing carbon and is alternately named 2-DH-3DO-galactarate.) The reported product of D-galactarate dehydratase (4.2.1.42) is the 5DH-4DO-glucarate isomer and this enzyme is found proximal to the aldolase in many genomes (GenProp0714) where no epimerase is known. Similarly, the product of D-glucarate dehydratase (4.2.1.40) is again the 5-DH-4DO isomer, so the provenance of the 2-DH-3DO-glucarate isomer for which Probab=94.08 E-value=0.52 Score=26.69 Aligned_cols=94 Identities=12% Similarity=0.061 Sum_probs=49.3 Q ss_pred CCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHH--HHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCC Q ss_conf 68765556651370156789998731366699824841138--8999999999857987245146469999999985487 Q gi|254780889|r 219 KGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQ--KVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGA 296 (493) Q Consensus 219 ~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~--~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Ga 296 (493) +|+..+|.-+... .-.=++.+..+|.|.++||.-||... .+.+++..++.. ...+++==.-.++...+..+|+|| T Consensus 9 ~g~~~~G~~~~~~--sp~~~Ei~a~~G~D~v~iD~EHg~~~~~~~~~~~~a~~~~-g~~~iVRvp~~~~~~i~~~LD~Ga 85 (249) T TIGR03239 9 ARETLIGCWSALG--NPITTEVLGLAGFDWLLLDGEHAPNDVLTFIPQLMALKGS-ASAPVVRPPWNEPVIIKRLLDIGF 85 (249) T ss_pred CCCCEEEEEECCC--CHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHC-CCCCEEECCCCCHHHHHHHHCCCC T ss_conf 7998899874689--9899999971896999981044899999999999999846-998489799998789999970899 Q ss_pred CEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHHH Q ss_conf 5899610278766531000157431003788999862 Q gi|254780889|r 297 DIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVAE 333 (493) Q Consensus 297 d~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~~ 333 (493) ++|- +|++....++.++.+ T Consensus 86 ~Gii------------------vP~V~t~eea~~~v~ 104 (249) T TIGR03239 86 YNFL------------------IPFVESAEEAERAVA 104 (249) T ss_pred CEEE------------------ECCCCCHHHHHHHHH T ss_conf 8789------------------517455999999999 No 491 >PRK13120 consensus Probab=94.08 E-value=0.52 Score=26.68 Aligned_cols=185 Identities=18% Similarity=0.189 Sum_probs=106.0 Q ss_pred CCCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCC--CEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEE Q ss_conf 446724400---26786325730358999999986553--3064574799786889802100010174223265687655 Q gi|254780889|r 150 NAQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRI--EKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRV 224 (493) Q Consensus 150 ~~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki--~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~V 224 (493) .++.|+.|- ..-....+..+.++.++++++.+-|. ...|+| +.+ |-|.. T Consensus 57 PFSDPvADGPvIQ~A~~rAL~~G~~l~~vl~~v~~~r~~~~~~Piv--------lM~---------Y~Npi--------- 110 (285) T PRK13120 57 PFSDPMADGPVVQRAAERAIAQGVGLRRVLELVADFRRDDSVTPVV--------LMG---------YANPI--------- 110 (285) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCEE--------EEE---------HHHHH--------- T ss_conf 8787456689999999999976998446999999987348988889--------861---------05499--------- Q ss_pred EEEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEE Q ss_conf 56651370156789998731366-69982484113889999999998579872451464699999999854875899610 Q gi|254780889|r 225 AAAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGI 303 (493) Q Consensus 225 gAAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGi 303 (493) -......-++.+.++|+| +|+.|...-.+....+.++ +..+.-+++++.+ .+.+-.+.+.+..-..|-+= T Consensus 111 -----~~yG~e~F~~~~~~aGvdGlIIpDLP~EE~~~~~~~~~--~~gi~~I~LiaPt-T~~eRi~~I~~~s~GFvY~V- 181 (285) T PRK13120 111 -----ERMGQRAFAQAAQAAGVDGVLVVDYPPEEVDEFAAMLA--EAGVAPIFLLAPT-STEARIEAIGRVARGYVYYV- 181 (285) T ss_pred -----HHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHH--HCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEEE- T ss_conf -----99879999999998398779647999799999999999--6699658995799-98999999995089818998- Q ss_pred CCCCCCCCCHHCCCCCCCCHHHHH-HHHHHHCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCCCCC Q ss_conf 278766531000157431003788-9998622698199726543777621336714878993230101216776 Q gi|254780889|r 304 GPGSICTTRVVTGVGCPQLSAIMS-VVEVAERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDESPG 376 (493) Q Consensus 304 g~Gs~CtTr~~~g~g~pq~~av~~-~~~~~~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~Espg 376 (493) +..-+||..---..-+.+ ..+..+..++||..-=||+++-|+.+ +...||.|-+||-|-..-|--| T Consensus 182 ------S~~GVTG~~~~~~~~l~~~i~~ik~~t~~Pv~vGFGIs~~e~v~~-~~~~ADGvIVGSAiVk~Iee~~ 248 (285) T PRK13120 182 ------SLKGVTGAGSLDTDDVARKLALIRRHVHIPVGVGFGIRDAASAQR-IAAHADAVVIGSKLIETMEQAG 248 (285) T ss_pred ------ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHH-HHCCCCEEEECHHHHHHHHHCC T ss_conf ------656546887556688999999999726997599962598999999-9702999998789999999707 No 492 >PRK13140 consensus Probab=94.07 E-value=0.52 Score=26.67 Aligned_cols=91 Identities=12% Similarity=0.130 Sum_probs=58.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEE Q ss_conf 89999999865533064574799786889802100010174223265687655566513701567899987313666998 Q gi|254780889|r 172 LENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVV 251 (493) Q Consensus 172 l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDvivi 251 (493) .++......++++..++.+-.. ...+|.+.+.+..-.+|-. T Consensus 133 ~~~~~~~~~~~~i~~I~lvaPt---------------------------------------t~~~Ri~~i~~~a~gFiY~ 173 (257) T PRK13140 133 QEHYKEIFEKYGLKNIMLITPQ---------------------------------------TSEERIREIDSHTDGFIYM 173 (257) T ss_pred HHHHHHHHHHCCCCEEEEECCC---------------------------------------CCHHHHHHHHHHCCCCEEE T ss_conf 8999999998699779986899---------------------------------------9899999999739996687 Q ss_pred ECCC-------CHHHHHHHHHHHHHHHCCCCCEECC-CCCCHHHHHHHHHCCCCEEEEE Q ss_conf 2484-------1138899999999985798724514-6469999999985487589961 Q gi|254780889|r 252 DTAH-------GHSQKVLDAVVQIKKNFPSLLVMAG-NIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 252 D~ah-------Gh~~~~~~~i~~~k~~~~~~~ii~G-Nv~t~~~~~~l~~~Gad~ikvG 302 (493) =+-. .......+.++++|+...++|+++| .|.|+|.++.+... ||++-|| T Consensus 174 vs~~GvTG~~~~~~~~~~~~i~~ik~~~~~~Pv~vGFGIs~~e~v~~~~~~-aDGvIVG 231 (257) T PRK13140 174 VSSASTTGAQQDFNNQKRAYFKRIKDMNLKNPRMVGFGISNKATFNAACEY-ASGAIIG 231 (257) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHC-CCEEEEC T ss_conf 036566688766515689999999982789986998057989999999831-9999987 No 493 >COG0134 TrpC Indole-3-glycerol phosphate synthase [Amino acid transport and metabolism] Probab=94.05 E-value=0.5 Score=26.80 Aligned_cols=115 Identities=21% Similarity=0.345 Sum_probs=63.9 Q ss_pred HCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCC Q ss_conf 31366699824841138899999999985798724514646999999998548758996102787665310001574310 Q gi|254780889|r 243 DVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQL 322 (493) Q Consensus 243 eaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~ 322 (493) ..|+|++.+=++.--.....+.....++ + .+.+.+ -|.+.+-.....++|++.| ||-+-..-| | T Consensus 127 ~~GADavLLI~~~L~~~~l~el~~~A~~-L-Gm~~LV-EVh~~eEl~rAl~~ga~iI--GINnRdL~t-----------f 190 (254) T COG0134 127 AAGADAVLLIVAALDDEQLEELVDRAHE-L-GMEVLV-EVHNEEELERALKLGAKII--GINNRDLTT-----------L 190 (254) T ss_pred HCCCCCHHHHHHHCCHHHHHHHHHHHHH-C-CCCEEE-EECCHHHHHHHHHCCCCEE--EEECCCCCH-----------H T ss_conf 8085619999996399999999999997-6-992389-9789999999996799889--983788402-----------1 Q ss_pred HHHHHH-HHHH--HCCCCEEEEECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 037889-9986--22698199726543777621336714878993230101216 Q gi|254780889|r 323 SAIMSV-VEVA--ERAGVAIVADGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 323 ~av~~~-~~~~--~~~~~~iiadGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) +.-.+. .+.+ -..++.+|+-.||.++.|+-+....||+++.+|+-|-.++. T Consensus 191 ~vdl~~t~~la~~~p~~~~~IsESGI~~~~dv~~l~~~ga~a~LVG~slM~~~~ 244 (254) T COG0134 191 EVDLETTEKLAPLIPKDVILISESGISTPEDVRRLAKAGADAFLVGEALMRADD 244 (254) T ss_pred EECHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCCEEEECHHHHCCCC T ss_conf 006889999884487775899617989999999999748998996388856999 No 494 >PRK13305 sgbH 3-keto-L-gulonate-6-phosphate decarboxylase; Provisional Probab=93.98 E-value=0.33 Score=28.10 Aligned_cols=67 Identities=13% Similarity=0.187 Sum_probs=39.3 Q ss_pred HHHHHHHHHHCCCCEEEEECCC-----CH--HHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873136669982484-----11--388999999999857987245146469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAH-----GH--SQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ah-----Gh--~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) ..+|++.+.+.|+|.+++-.+. |. ...-++.++.+.. ..-.--++|.+- ++....+...|+|.+-|| T Consensus 118 ~~~~ak~~~~lgv~~v~~H~g~D~q~~g~~~~~~~l~~~k~~~~-~~~~vaVAGGI~-~~~i~~~~~~~~~ivIvG 191 (220) T PRK13305 118 TLDDARDWHRIGVRQAIYHRGRDAQASGQQWGEADLARMKALSD-IGLELSITGGIT-PADLPLFKDIRVKAFIAG 191 (220) T ss_pred CHHHHHHHHHCCCCEEEEEECCCHHHCCCCCCHHHHHHHHHHHH-CCCEEEEECCCC-HHHHHHHHCCCCCEEEEC T ss_conf 87899999986998899983336765189863101999998760-696499988878-889999971699899989 No 495 >TIGR00078 nadC nicotinate-nucleotide pyrophosphorylase; InterPro: IPR004393 Nicotinate-nucleotide pyrophosphorylase (2.4.2.19 from EC), also known as quinolinate phosphoribosyltransferase (decarboxylating), catalyses the conversion of nicotinate D-ribonucleotide, pyrophosphate and carbon dioxide into pyridine-2,3-dicarboxylate and 5-phospho-alpha-D-ribose 1-diphosphate. This enzyme is a type II phosphoribosyltransferase which provides the de novo source of nicotinate mononucleotide (NAMN) for NAD biosynthesis in both prokaryotes and eukaryotes , . Structural studies have shown that the active form of this enzyme is a homodimer with an unsual fold , , . The N-terminal forms a mixed alpha/beta domain, while the C-terminal region forms a multi-stranded, open alpha/beta barrel. The active site is found at the C-terminal ends of the beta strands of the alpha/beta barrel, and is bordered by the N-terminal domain of the second subunit of the dimer. It contains several conserved charged residues thought to be important determinants of substrate binding and catalysis. ; GO: 0004514 nicotinate-nucleotide diphosphorylase (carboxylating) activity, 0019363 pyridine nucleotide biosynthetic process. Probab=93.97 E-value=0.31 Score=28.32 Aligned_cols=66 Identities=24% Similarity=0.419 Sum_probs=52.3 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 5678999873136669982484113889999999998579872451-46469999999985487589961 Q gi|254780889|r 234 IADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 234 ~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~Gad~ikvG 302 (493) -.|=+.+-++||||+|-+| |--..-+.+.++.++..+|++-+=| ||| |.+....+...|+|.|=+| T Consensus 195 nlE~a~eA~~AGADiImLD--Nm~p~~~~~av~~~~~~~p~~~~EaSGGi-tl~n~~~ya~~gVD~IS~G 261 (276) T TIGR00078 195 NLEEAEEAAEAGADIIMLD--NMKPEEIKEAVELLKGRNPNVLVEASGGI-TLDNIEEYAETGVDVISSG 261 (276) T ss_pred CHHHHHHHHHCCCCEEECC--CCCHHHHHHHHHHHHHCCCEEEEEEECCC-CHHHHHHHHHCCCCEEECC T ss_conf 9899999997099599806--98947999999999702990899983699-8789999840897588327 No 496 >cd02930 DCR_FMN 2,4-dienoyl-CoA reductase (DCR) FMN-binding domain. DCR in E. coli is an iron-sulfur flavoenzyme which contains FMN, FAD, and a 4Fe-4S cluster. It is also a monomer, unlike that of its eukaryotic counterparts which form homotetramers and lack the flavin and iron-sulfur cofactors. Metabolism of unsaturated fatty acids requires auxiliary enzymes in addition to those used in b-oxidation. After a given number of cycles through the b-oxidation pathway, those unsaturated fatty acyl-CoAs with double bonds at even-numbered carbon positions contain 2-trans, 4-cis double bonds that can not be modified by enoyl-CoA hydratase. DCR utilizes NADPH to remove the C4-C5 double bond. DCR can catalyze the reduction of both natural fatty acids with cis double bonds, as well as substrates containing trans double bonds. The reaction is initiated by hybrid transfer from NADPH to FAD, which in turn transfers electrons, one at a time, to FMN via the 4Fe-4S cluster. The fully reduced FMN provi Probab=93.95 E-value=0.21 Score=29.57 Aligned_cols=84 Identities=21% Similarity=0.254 Sum_probs=56.0 Q ss_pred CHHHHHHHHHHHCCCCEEEEECCC----------C-HHHHHHHHHHHHHHHCCCCCEEC-CCCCCHHHHHHHHHCC-CCE Q ss_conf 015678999873136669982484----------1-13889999999998579872451-4646999999998548-758 Q gi|254780889|r 232 KDIADRVGPLFDVNVDLVVVDTAH----------G-HSQKVLDAVVQIKKNFPSLLVMA-GNIATAEGALALIDAG-ADI 298 (493) Q Consensus 232 ~d~~eRa~~LveaGvDviviD~ah----------G-h~~~~~~~i~~~k~~~~~~~ii~-GNv~t~~~~~~l~~~G-ad~ 298 (493) +|..+-+..|.++|+|+|-+.... . ......+..+.+|+.+. +|||+ |.+.|++.++.+++.| ||. T Consensus 224 ~e~~~~~~~l~~~GvD~i~vs~G~~~~~~~~~~~~~p~g~~~~~a~~ir~~~~-~Pvi~~G~i~~p~~ae~~l~~g~aD~ 302 (353) T cd02930 224 EEVVALAKALEAAGADILNTGIGWHEARVPTIATSVPRGAFAWATAKLKRAVD-IPVIASNRINTPEVAERLLADGDADM 302 (353) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC-CCEEECCCCCCHHHHHHHHHCCCCCH T ss_conf 99999999999819999996377444668753345772366999999887548-34896599798999999998799624 Q ss_pred EEEEECCCCCCCCCHHCCCCCCCCHHHHHHHHHH Q ss_conf 9961027876653100015743100378899986 Q gi|254780889|r 299 IKVGIGPGSICTTRVVTGVGCPQLSAIMSVVEVA 332 (493) Q Consensus 299 ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~~~ 332 (493) | ++|+|.+.-=.-+.++. T Consensus 303 V----------------~~gR~liadPdl~~K~~ 320 (353) T cd02930 303 V----------------SMARPFLADPDFVAKAA 320 (353) T ss_pred H----------------HHHHHHHHCCHHHHHHH T ss_conf 7----------------84099876936999998 No 497 >PRK13122 consensus Probab=93.92 E-value=0.55 Score=26.48 Aligned_cols=203 Identities=20% Similarity=0.188 Sum_probs=111.7 Q ss_pred CCHHHHHHHHHHHCCCCEEEECCCCEEEEEEEEHHHHCCCCCCCCCCCCC---CCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 43034565667605041678668860278872354500143446724400---267863257303589999999865533 Q gi|254780889|r 109 ATLADALALMKKYSISGIPVVESDVGKLVGILTNRDVRFASNAQQAVGEL---MTRNLITVKKTVNLENAKALLHQHRIE 185 (493) Q Consensus 109 ~TI~ea~~lm~~~~is~iPVVD~~~gkLiGIVT~rDir~~~~~~~~V~ei---Mt~~litv~~~~~l~eA~~iM~~~ki~ 185 (493) -...+.++.+.+.|.+-+ + +| -.++.|+.|- +......+..+.++.++++++.+.+-+ T Consensus 13 pd~~~~~~~l~~~GaDii---E------lG----------iPfSDP~ADGpvIQ~A~~rAL~~G~~~~~~~~~l~~~r~~ 73 (242) T PRK13122 13 KDLIENATLLSENGADII---E------IG----------VPFSDPVADGPVIMEAGQQAIKQGITIDYIFNQLEKHGDQ 73 (242) T ss_pred CCHHHHHHHHHHCCCCEE---E------EC----------CCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC T ss_conf 999999999997599999---9------78----------9888866658999999999997699899999999973136 Q ss_pred -EEEEECCCCCEEEEEEECCCHHCCCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC-EEEEECCCCHHHHHHH Q ss_conf -06457479978688980210001017422326568765556651370156789998731366-6998248411388999 Q gi|254780889|r 186 -KLLVVDDDGCCIGLITVKDIERSQLNPNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVD-LVVVDTAHGHSQKVLD 263 (493) Q Consensus 186 -~LPVVDe~g~LvGIIT~~DIlk~~~~p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvD-viviD~ahGh~~~~~~ 263 (493) ..|+| +.|.-..+.+. | ...-++.+.++|+| +|+.|...-.+.. T Consensus 74 ~~~piv--------lM~Y~N~i~~~-------------------G----~~~F~~~~~~~GvdGvIipDLP~ee~~~--- 119 (242) T PRK13122 74 IKCNYV--------LMTYYNIICHY-------------------G----EQAFFEKCRDTGVYGLIIPDLPYELSQR--- 119 (242) T ss_pred CCCCEE--------EEEECHHHHHH-------------------C----HHHHHHHHHHCCCCEEECCCCCHHHHHH--- T ss_conf 798779--------99851698872-------------------7----9999999987699867778998788999--- Q ss_pred HHHHHHH-HCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCHHCCCCCCCCHHHHHHHH-HHHCCCCEEEE Q ss_conf 9999998-579872451464699999999854875899610278766531000157431003788999-86226981997 Q gi|254780889|r 264 AVVQIKK-NFPSLLVMAGNIATAEGALALIDAGADIIKVGIGPGSICTTRVVTGVGCPQLSAIMSVVE-VAERAGVAIVA 341 (493) Q Consensus 264 ~i~~~k~-~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvGig~Gs~CtTr~~~g~g~pq~~av~~~~~-~~~~~~~~iia 341 (493) ..+.+++ ...-+++++.+ .+.+-.+.+.+..-..|=+= +..-+||.-.+....+.+.-+ ..+..+.||.. T Consensus 120 ~~~~~~~~gi~~I~lvaPt-t~~~Ri~~i~~~s~GFiY~v-------s~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~v 191 (242) T PRK13122 120 LKQQFSHYGVKIISLVAMT-TDDKRIKDIVSHAEGFIYTV-------TMNATTGQNGAFHPELKRKIESIKAIANVPVVA 191 (242) T ss_pred HHHHHHHCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEEE-------ECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 9999986798689871899-98999999998299966987-------335435765556588999999999725998587 Q ss_pred ECCCCCHHHHEEEEEECCCEEEECCHHHCCCC Q ss_conf 26543777621336714878993230101216 Q gi|254780889|r 342 DGGIRFSGDIAKAIAAGSACVMIGSLLAGTDE 373 (493) Q Consensus 342 dGGi~~~gdi~kAla~GA~~VM~G~~~agt~E 373 (493) -=||+++-|+.+ +..+||.|-+||.|-..-| T Consensus 192 GFGI~~~e~v~~-i~~~ADGvIVGSaivk~i~ 222 (242) T PRK13122 192 GFGIRTPQHVAD-IKEVADGIVIGSEIVKRFK 222 (242) T ss_pred ECCCCCHHHHHH-HHHHCCEEEECHHHHHHHH T ss_conf 158799999999-9811999998489999999 No 498 >COG2513 PrpB PEP phosphonomutase and related enzymes [Carbohydrate transport and metabolism] Probab=93.84 E-value=0.57 Score=26.37 Aligned_cols=179 Identities=16% Similarity=0.220 Sum_probs=92.1 Q ss_pred CCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEECCCEEEECCCC------HHHHHHHHHHHCCCCEEEECC Q ss_conf 68889999997798799978989899999999874223881528789946543------034565667605041678668 Q gi|254780889|r 58 TDSRLAIAMAQAGGLGVIHRNFSPSEQVAQVHQVKKFESGMVVNPVTISPYAT------LADALALMKKYSISGIPVVES 131 (493) Q Consensus 58 Te~~MAiamA~~GGiGvIHrn~sie~qa~~V~kVKr~e~gmI~dPVti~pd~T------I~ea~~lm~~~~is~iPVVD~ 131 (493) ++.-.|-+|++-- +|+ .+.++-+..++++.+.- +=|+.+.-|+- +.+.++.+.+.|..++-+-|. T Consensus 45 sg~~vA~slG~pD-~~~----~t~~e~~~~vrrI~~a~----~lPv~vD~dtGfG~~~nvartV~~~~~aG~agi~iEDq 115 (289) T COG2513 45 SGAGVAASLGLPD-LGI----TTLDEVLADARRITDAV----DLPVLVDIDTGFGEALNVARTVRELEQAGAAGIHIEDQ 115 (289) T ss_pred CCHHHHHHCCCCC-CCC----CCHHHHHHHHHHHHHHC----CCCEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 5488898617986-341----65999999999998652----78768865678873889999999999758626645322 Q ss_pred CCEEEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEECCCHHCCCC Q ss_conf 86027887235450014344672440026786325730358999999986553306457479978688980210001017 Q gi|254780889|r 132 DVGKLVGILTNRDVRFASNAQQAVGELMTRNLITVKKTVNLENAKALLHQHRIEKLLVVDDDGCCIGLITVKDIERSQLN 211 (493) Q Consensus 132 ~~gkLiGIVT~rDir~~~~~~~~V~eiMt~~litv~~~~~l~eA~~iM~~~ki~~LPVVDe~g~LvGIIT~~DIlk~~~~ 211 (493) -..|-.|.+..+. + .++++..+....-+--+ . +.-.=|+-|+|..-. T Consensus 116 ~~pk~cgh~~gk~------------------l------~~~~e~v~rIkAa~~a~-----~-~~~fvi~ARTda~~~--- 162 (289) T COG2513 116 VGPKRCGHLPGKE------------------L------VSIDEMVDRIKAAVEAR-----R-DPDFVIIARTDALLV--- 162 (289) T ss_pred CCCHHCCCCCCCC------------------C------CCHHHHHHHHHHHHHHC-----C-CCCEEEEEEHHHHHH--- T ss_conf 5502127888987------------------6------78999999999999715-----5-888599961487775--- Q ss_pred CCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCC-----HH Q ss_conf 4223265687655566513701567899987313666998248411388999999999857987245146469-----99 Q gi|254780889|r 212 PNATKDSKGRLRVAAAVSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIAT-----AE 286 (493) Q Consensus 212 p~A~~D~~grL~VgAAIg~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t-----~~ 286 (493) +--++.++|+.+-+|||+|.|+. +|-+. .+.++.+.+..+ +|+.+ |+.. .- T Consensus 163 -----------------~~ld~AI~Ra~AY~eAGAD~if~---~al~~--~e~i~~f~~av~-~pl~~-N~t~~g~tp~~ 218 (289) T COG2513 163 -----------------EGLDDAIERAQAYVEAGADAIFP---EALTD--LEEIRAFAEAVP-VPLPA-NITEFGKTPLL 218 (289) T ss_pred -----------------CCHHHHHHHHHHHHHCCCCEECC---CCCCC--HHHHHHHHHHCC-CCCEE-EEECCCCCCCC T ss_conf -----------------24799999999999759768713---66799--999999998468-77026-86324898876 Q ss_pred HHHHHHHCCCCEEEEE Q ss_conf 9999985487589961 Q gi|254780889|r 287 GALALIDAGADIIKVG 302 (493) Q Consensus 287 ~~~~l~~~Gad~ikvG 302 (493) ....|.++|...|-.| T Consensus 219 ~~~~L~~~Gv~~V~~~ 234 (289) T COG2513 219 TVAELAELGVKRVSYG 234 (289) T ss_pred CHHHHHHCCCEEEEEC T ss_conf 8999996696499978 No 499 >PTZ00300 pyruvate kinase; Provisional Probab=93.78 E-value=0.58 Score=26.31 Aligned_cols=125 Identities=15% Similarity=0.196 Sum_probs=75.7 Q ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH-HHHHHCCCCCEECCCCCCHHHHHHHHH--CCCCEEEEEECCCC Q ss_conf 701567899987313666998248411388999999-999857987245146469999999985--48758996102787 Q gi|254780889|r 231 AKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVV-QIKKNFPSLLVMAGNIATAEGALALID--AGADIIKVGIGPGS 307 (493) Q Consensus 231 ~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~-~~k~~~~~~~ii~GNv~t~~~~~~l~~--~Gad~ikvGig~Gs 307 (493) .+.|++-.+-.++.++|++.+--.+- ..-+..++ .+.+.-+++.|| --+.++++.++|-+ .-+|+|-|-=|-=+ T Consensus 146 TekD~~di~fa~~~~vD~IalSFVr~--a~DV~~~r~~l~~~g~~~~Ii-aKIE~~~av~NldeIi~~sDgIMVARGDLg 222 (454) T PTZ00300 146 SAKDCADLQFGVEQGVDMIFASFIRS--AEQVGEVRKALGAKGGDIMII-CKIENHQGVQNIDSIIEESDGIMVARGDLG 222 (454) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCC--HHHHHHHHHHHHHHCCCCEEE-EEECCHHHHHHHHHHHHHCCEEEEECCCCC T ss_conf 98789999999974999899906798--899999999999738976699-863246666646989985879999457430 Q ss_pred CCCCCHHCCCCCCCCH--HHHHHHHHHHCCCCEEEEECCCC------------CHHHHEEEEEECCCEEEECC Q ss_conf 6653100015743100--37889998622698199726543------------77762133671487899323 Q gi|254780889|r 308 ICTTRVVTGVGCPQLS--AIMSVVEVAERAGVAIVADGGIR------------FSGDIAKAIAAGSACVMIGS 366 (493) Q Consensus 308 ~CtTr~~~g~g~pq~~--av~~~~~~~~~~~~~iiadGGi~------------~~gdi~kAla~GA~~VM~G~ 366 (493) +...-+ -+|.++ -|..| ...|.|+|----+- ...|++-|.--|||+|||.+ T Consensus 223 vEi~~e----~vp~~Qk~Ii~~c----~~~gkpvI~ATqmLeSMi~~p~PTRAEvsDVaNAV~DG~DavMLS~ 287 (454) T PTZ00300 223 VEIPAE----KVVVAQKILISKC----NVAGKPVICATQMLESMTYNPRPTRAEVSDVANAVFNGADCVMLSG 287 (454) T ss_pred CCCCHH----HHHHHHHHHHHHH----HHCCCEEEEECCHHHHHCCCCCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf 317988----9789999999999----9839929997323776432999863889899999984778788745 No 500 >PRK06096 molybdenum transport protein ModD; Provisional Probab=93.78 E-value=0.55 Score=26.47 Aligned_cols=73 Identities=11% Similarity=0.086 Sum_probs=0.0 Q ss_pred EECCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHCCCCEEEEE Q ss_conf 513701567899987313666998248411388999999999857987245146469999999985487589961 Q gi|254780889|r 228 VSVAKDIADRVGPLFDVNVDLVVVDTAHGHSQKVLDAVVQIKKNFPSLLVMAGNIATAEGALALIDAGADIIKVG 302 (493) Q Consensus 228 Ig~~~d~~eRa~~LveaGvDviviD~ahGh~~~~~~~i~~~k~~~~~~~ii~GNv~t~~~~~~l~~~Gad~ikvG 302 (493) |-+--+..+-+...+++|+|+|.+| |--.+.+.+.++.++...|++.+-+-.=.|.+-...+++.|+|.+-+| T Consensus 192 IeVEv~tl~~~~ea~~~gaD~IlLD--nmsp~~l~~av~~~~~~~~~~~lEaSGGI~~~ni~~yA~tGVD~i~ts 264 (284) T PRK06096 192 IVVEADTPKEAIAALRAQPDVLQLD--KFSPQQATEIAQIAPSLAPHCTLALTGGINLTTLKNYLDCGIRLFITS 264 (284) T ss_pred EEEEECCHHHHHHHHHCCCCEEEEC--CCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 8998099999999985799999968--989999999999987217977999989999999999998099999828 Done!