Query gi|254780426|ref|YP_003064839.1| hemolysin protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 320 No_of_seqs 222 out of 3225 Neff 7.2 Searched_HMMs 39220 Date Sun May 29 16:08:47 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780426.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03520 GldE gliding motilit 100.0 0 0 511.4 25.2 300 2-314 101-407 (408) 2 PRK11573 hypothetical protein; 100.0 0 0 503.5 24.8 303 3-319 94-408 (413) 3 COG1253 TlyC Hemolysins and re 100.0 0 0 476.3 23.6 302 4-319 113-426 (429) 4 COG4536 CorB Putative Mg2+ and 100.0 0 0 448.5 23.4 306 2-319 104-420 (423) 5 COG4535 CorC Putative Mg2+ and 100.0 0 0 421.1 21.8 262 41-317 22-284 (293) 6 cd04590 CBS_pair_CorC_HlyC_ass 99.9 4.3E-26 1.1E-30 190.8 8.5 111 97-216 1-111 (111) 7 KOG2118 consensus 99.8 6.9E-19 1.8E-23 143.6 6.8 231 14-252 125-367 (498) 8 pfam03471 CorC_HlyC Transporte 99.7 1.4E-16 3.7E-21 128.4 10.5 78 234-315 2-80 (80) 9 cd04596 CBS_pair_DRTGG_assoc T 99.7 7.3E-17 1.9E-21 130.3 8.4 106 97-216 2-108 (108) 10 cd04639 CBS_pair_26 The CBS do 99.7 5.5E-16 1.4E-20 124.6 10.3 109 97-215 1-110 (111) 11 cd04631 CBS_pair_18 The CBS do 99.7 9.5E-16 2.4E-20 123.1 10.3 119 98-216 2-125 (125) 12 cd04610 CBS_pair_ParBc_assoc T 99.6 4.8E-15 1.2E-19 118.5 7.5 105 97-216 2-107 (107) 13 cd04600 CBS_pair_HPP_assoc Thi 99.6 2.1E-14 5.4E-19 114.2 10.2 119 97-216 2-124 (124) 14 COG3448 CBS-domain-containing 99.6 8E-14 2E-18 110.5 12.8 154 51-220 220-374 (382) 15 cd04611 CBS_pair_PAS_GGDEF_DUF 99.6 2.4E-14 6.2E-19 113.8 9.7 109 97-215 1-110 (111) 16 cd04635 CBS_pair_22 The CBS do 99.6 3.9E-14 9.8E-19 112.5 10.1 117 98-215 2-121 (122) 17 cd04632 CBS_pair_19 The CBS do 99.6 3.5E-14 8.9E-19 112.8 9.7 119 97-216 1-128 (128) 18 cd04588 CBS_pair_CAP-ED_DUF294 99.5 3.1E-14 7.8E-19 113.2 9.2 108 98-216 2-110 (110) 19 cd04612 CBS_pair_SpoIVFB_EriC_ 99.5 2.3E-14 5.9E-19 114.0 8.5 109 98-216 2-111 (111) 20 cd04585 CBS_pair_ACT_assoc2 Th 99.5 4.6E-14 1.2E-18 112.0 9.8 117 98-216 2-122 (122) 21 cd04643 CBS_pair_30 The CBS do 99.5 7.2E-14 1.8E-18 110.7 10.0 115 97-216 1-116 (116) 22 cd04803 CBS_pair_15 The CBS do 99.5 9.2E-14 2.3E-18 110.0 10.3 116 99-215 3-121 (122) 23 cd04613 CBS_pair_SpoIVFB_EriC_ 99.5 6.1E-14 1.5E-18 111.2 8.9 112 97-216 1-114 (114) 24 cd04594 CBS_pair_EriC_assoc_ar 99.5 6.3E-14 1.6E-18 111.1 8.8 101 99-216 3-104 (104) 25 COG4109 Predicted transcriptio 99.5 9.2E-14 2.3E-18 110.1 9.3 131 74-218 170-306 (432) 26 cd04642 CBS_pair_29 The CBS do 99.5 9.7E-14 2.5E-18 109.9 9.3 118 97-215 1-125 (126) 27 TIGR01302 IMP_dehydrog inosine 99.5 1.2E-13 3.1E-18 109.3 8.7 177 51-254 44-244 (476) 28 cd04624 CBS_pair_11 The CBS do 99.5 6.5E-14 1.6E-18 111.0 7.2 110 98-216 2-112 (112) 29 cd04622 CBS_pair_9 The CBS dom 99.5 1.9E-13 4.8E-18 108.0 9.2 110 98-216 2-113 (113) 30 cd04637 CBS_pair_24 The CBS do 99.5 3.5E-13 9E-18 106.2 10.1 117 97-215 1-121 (122) 31 cd04627 CBS_pair_14 The CBS do 99.5 3.2E-13 8.1E-18 106.5 9.7 115 99-213 3-120 (123) 32 cd04586 CBS_pair_BON_assoc Thi 99.5 2.5E-13 6.4E-18 107.2 9.0 118 97-216 2-135 (135) 33 cd04633 CBS_pair_20 The CBS do 99.5 2.7E-13 6.9E-18 107.0 9.0 115 98-215 2-120 (121) 34 cd04601 CBS_pair_IMPDH This cd 99.5 1.7E-13 4.4E-18 108.3 8.0 106 97-215 2-109 (110) 35 cd04621 CBS_pair_8 The CBS dom 99.5 3.1E-13 7.9E-18 106.6 9.3 118 97-216 1-135 (135) 36 cd04604 CBS_pair_KpsF_GutQ_ass 99.5 4.2E-13 1.1E-17 105.7 9.3 112 97-216 2-114 (114) 37 cd04609 CBS_pair_PALP_assoc2 T 99.5 3.4E-13 8.6E-18 106.3 8.6 107 98-215 2-109 (110) 38 cd04584 CBS_pair_ACT_assoc Thi 99.4 7.6E-13 1.9E-17 104.0 9.6 116 98-215 2-120 (121) 39 cd04623 CBS_pair_10 The CBS do 99.4 6.2E-13 1.6E-17 104.6 9.0 111 98-216 2-113 (113) 40 cd04636 CBS_pair_23 The CBS do 99.4 1E-12 2.6E-17 103.2 10.0 117 97-215 1-131 (132) 41 cd04619 CBS_pair_6 The CBS dom 99.4 1E-12 2.6E-17 103.2 10.0 111 98-215 2-113 (114) 42 cd04634 CBS_pair_21 The CBS do 99.4 5.7E-13 1.4E-17 104.9 8.7 116 97-215 1-142 (143) 43 cd04629 CBS_pair_16 The CBS do 99.4 8.8E-13 2.2E-17 103.6 9.4 112 98-216 2-114 (114) 44 cd04630 CBS_pair_17 The CBS do 99.4 1.1E-12 2.8E-17 102.9 9.8 111 98-215 2-113 (114) 45 cd04638 CBS_pair_25 The CBS do 99.4 5.4E-13 1.4E-17 105.0 8.1 105 97-216 1-106 (106) 46 cd04605 CBS_pair_MET2_assoc Th 99.4 3.5E-13 8.9E-18 106.2 7.1 107 97-215 2-109 (110) 47 cd04626 CBS_pair_13 The CBS do 99.4 1.2E-12 3.2E-17 102.6 9.7 109 97-215 1-110 (111) 48 cd04641 CBS_pair_28 The CBS do 99.4 9.3E-13 2.4E-17 103.4 9.0 112 97-215 1-119 (120) 49 cd04615 CBS_pair_2 The CBS dom 99.4 1.9E-12 4.9E-17 101.4 10.6 109 99-215 3-112 (113) 50 cd04625 CBS_pair_12 The CBS do 99.4 1.6E-12 4E-17 102.0 10.0 111 97-216 1-112 (112) 51 COG2524 Predicted transcriptio 99.4 1.3E-12 3.3E-17 102.5 7.9 130 78-220 163-293 (294) 52 cd04593 CBS_pair_EriC_assoc_ba 99.4 2.2E-12 5.7E-17 100.9 8.7 110 99-216 3-115 (115) 53 cd04595 CBS_pair_DHH_polyA_Pol 99.4 1.5E-12 3.8E-17 102.1 7.6 107 97-215 2-109 (110) 54 cd04587 CBS_pair_CAP-ED_DUF294 99.4 1.3E-12 3.2E-17 102.6 7.2 112 97-216 1-113 (113) 55 cd04583 CBS_pair_ABC_OpuCA_ass 99.4 1.1E-12 2.9E-17 102.9 6.8 107 97-216 2-109 (109) 56 cd04599 CBS_pair_GGDEF_assoc2 99.4 1.9E-12 4.8E-17 101.5 7.8 103 98-216 2-105 (105) 57 cd04801 CBS_pair_M50_like This 99.4 3.8E-12 9.8E-17 99.4 9.1 108 98-215 2-113 (114) 58 cd04602 CBS_pair_IMPDH_2 This 99.4 2.6E-12 6.5E-17 100.6 7.7 107 97-215 2-113 (114) 59 cd04582 CBS_pair_ABC_OpuCA_ass 99.4 5.5E-12 1.4E-16 98.4 9.4 104 97-215 1-105 (106) 60 cd04620 CBS_pair_7 The CBS dom 99.4 6.7E-12 1.7E-16 97.8 9.8 110 98-215 2-114 (115) 61 cd04589 CBS_pair_CAP-ED_DUF294 99.3 1.4E-11 3.5E-16 95.8 10.3 108 98-215 2-110 (111) 62 cd04802 CBS_pair_3 The CBS dom 99.3 9.6E-12 2.4E-16 96.8 9.1 108 98-214 2-110 (112) 63 cd04607 CBS_pair_NTP_transfera 99.3 1.1E-11 2.7E-16 96.5 9.0 107 100-215 5-112 (113) 64 cd04800 CBS_pair_CAP-ED_DUF294 99.3 1.1E-11 2.8E-16 96.4 8.3 108 98-215 2-110 (111) 65 cd04606 CBS_pair_Mg_transporte 99.3 1.3E-11 3.2E-16 96.0 7.9 102 102-217 2-109 (109) 66 PRK01862 putative voltage-gate 99.3 4.8E-11 1.2E-15 92.3 9.8 124 84-217 453-579 (583) 67 cd04617 CBS_pair_4 The CBS dom 99.3 4.2E-11 1.1E-15 92.6 9.0 108 99-214 3-116 (118) 68 cd04603 CBS_pair_KefB_assoc Th 99.2 5.2E-11 1.3E-15 92.0 9.0 108 98-215 2-110 (111) 69 cd04614 CBS_pair_1 The CBS dom 99.2 2.4E-11 6.2E-16 94.2 6.6 94 98-215 2-95 (96) 70 PRK07107 inositol-5-monophosph 99.2 6.5E-11 1.7E-15 91.4 8.7 166 55-248 63-240 (497) 71 cd04591 CBS_pair_EriC_assoc_eu 99.2 1.6E-10 4.1E-15 88.8 9.2 100 97-215 2-104 (105) 72 COG2239 MgtE Mg/Co/Ni transpor 99.2 3.6E-10 9.1E-15 86.5 10.6 135 68-218 113-253 (451) 73 cd04598 CBS_pair_GGDEF_assoc T 99.1 3.5E-10 8.8E-15 86.6 8.6 112 98-215 2-118 (119) 74 PRK11543 gutQ D-arabinose 5-ph 99.1 1.2E-10 2.9E-15 89.7 5.8 121 87-216 197-318 (321) 75 cd04640 CBS_pair_27 The CBS do 99.1 3.8E-10 9.7E-15 86.3 8.2 109 99-214 3-124 (126) 76 cd02205 CBS_pair The CBS domai 99.1 4.7E-10 1.2E-14 85.7 8.4 110 98-215 2-112 (113) 77 PRK07807 inositol-5-monophosph 99.1 4.8E-10 1.2E-14 85.7 7.3 160 55-247 56-223 (479) 78 PTZ00314 inosine-5'-monophosph 99.0 2.4E-09 6E-14 81.1 8.9 163 56-247 62-234 (499) 79 cd04608 CBS_pair_PALP_assoc Th 99.0 1.1E-09 2.9E-14 83.3 6.8 113 97-217 2-124 (124) 80 PRK05567 inositol-5'-monophosp 99.0 9.4E-10 2.4E-14 83.8 6.2 125 72-216 77-203 (486) 81 PRK10892 D-arabinose 5-phospha 99.0 1.6E-09 4E-14 82.3 7.3 122 86-216 201-323 (326) 82 COG3620 Predicted transcriptio 98.9 5.5E-09 1.4E-13 78.8 8.2 141 66-219 39-185 (187) 83 KOG1764 consensus 98.8 1.5E-08 3.8E-13 75.9 6.1 136 86-222 225-363 (381) 84 COG0517 FOG: CBS domain [Gener 98.7 9.8E-08 2.5E-12 70.6 8.9 113 90-214 2-117 (117) 85 COG2905 Predicted signal-trans 98.7 6.6E-08 1.7E-12 71.7 7.9 124 83-217 145-269 (610) 86 PRK10070 glycine betaine trans 98.7 1.9E-07 5E-12 68.6 9.9 133 75-220 262-395 (400) 87 pfam00478 IMPDH IMP dehydrogen 98.6 1.6E-07 4.1E-12 69.1 6.4 149 72-248 72-220 (467) 88 KOG0474 consensus 98.5 1.6E-07 4.1E-12 69.1 5.4 135 83-219 578-749 (762) 89 TIGR01302 IMP_dehydrog inosine 98.4 3.9E-07 1E-11 66.6 5.3 188 85-299 150-394 (476) 90 cd04592 CBS_pair_EriC_assoc_eu 98.4 2.2E-06 5.6E-11 61.7 7.9 106 97-203 1-120 (133) 91 cd04617 CBS_pair_4 The CBS dom 98.2 7.6E-07 1.9E-11 64.7 3.2 50 170-219 2-51 (118) 92 cd04629 CBS_pair_16 The CBS do 98.2 6.5E-06 1.7E-10 58.6 7.6 85 169-255 1-86 (114) 93 cd04625 CBS_pair_12 The CBS do 98.2 2.3E-06 5.9E-11 61.6 5.2 67 74-144 41-111 (112) 94 TIGR00393 kpsF sugar isomerase 98.2 1.1E-06 2.8E-11 63.7 3.3 135 52-208 135-271 (272) 95 TIGR01137 cysta_beta cystathio 98.2 9.5E-06 2.4E-10 57.5 8.1 130 83-217 362-526 (527) 96 cd04636 CBS_pair_23 The CBS do 98.2 5.2E-06 1.3E-10 59.3 6.7 113 169-284 1-114 (132) 97 cd04593 CBS_pair_EriC_assoc_ba 98.2 8.8E-06 2.2E-10 57.7 7.5 49 170-218 2-50 (115) 98 cd04622 CBS_pair_9 The CBS dom 98.2 1.7E-06 4.4E-11 62.4 3.7 60 82-144 53-112 (113) 99 cd04587 CBS_pair_CAP-ED_DUF294 98.2 3.3E-06 8.4E-11 60.5 5.0 58 83-143 54-111 (113) 100 cd04619 CBS_pair_6 The CBS dom 98.1 2.3E-06 5.8E-11 61.6 4.1 69 73-144 41-113 (114) 101 cd04640 CBS_pair_27 The CBS do 98.1 2.9E-06 7.3E-11 60.9 4.5 47 170-216 2-48 (126) 102 cd04630 CBS_pair_17 The CBS do 98.1 3.7E-06 9.4E-11 60.2 5.1 67 74-144 43-113 (114) 103 cd04633 CBS_pair_20 The CBS do 98.1 4.2E-06 1.1E-10 59.9 5.2 49 169-218 1-49 (121) 104 cd04624 CBS_pair_11 The CBS do 98.1 1E-05 2.6E-10 57.4 7.2 83 169-255 1-83 (112) 105 cd04589 CBS_pair_CAP-ED_DUF294 98.1 3.4E-06 8.6E-11 60.5 4.7 55 86-144 56-110 (111) 106 cd04613 CBS_pair_SpoIVFB_EriC_ 98.1 1.5E-05 3.7E-10 56.3 7.6 83 169-255 1-84 (114) 107 cd04623 CBS_pair_10 The CBS do 98.1 1.5E-05 3.9E-10 56.1 7.8 51 169-219 1-51 (113) 108 cd04586 CBS_pair_BON_assoc Thi 98.1 3.7E-06 9.4E-11 60.2 4.2 51 169-219 2-52 (135) 109 cd04626 CBS_pair_13 The CBS do 98.1 3.1E-06 8E-11 60.7 3.8 56 84-143 54-109 (111) 110 TIGR01186 proV glycine betaine 98.1 3E-05 7.6E-10 54.3 8.7 120 88-220 243-364 (372) 111 cd04602 CBS_pair_IMPDH_2 This 98.1 4.7E-06 1.2E-10 59.5 4.5 61 83-144 53-113 (114) 112 cd04637 CBS_pair_24 The CBS do 98.1 5.1E-06 1.3E-10 59.3 4.6 48 170-218 2-49 (122) 113 cd04621 CBS_pair_8 The CBS dom 98.1 3.1E-06 8E-11 60.7 3.5 47 171-217 3-49 (135) 114 cd04600 CBS_pair_HPP_assoc Thi 98.1 2.3E-05 5.9E-10 55.0 7.9 52 169-220 2-53 (124) 115 cd04803 CBS_pair_15 The CBS do 98.0 2.7E-05 7E-10 54.5 8.0 53 169-221 1-53 (122) 116 cd04615 CBS_pair_2 The CBS dom 98.0 2.2E-05 5.6E-10 55.1 7.4 49 170-218 2-50 (113) 117 cd04641 CBS_pair_28 The CBS do 98.0 2.5E-05 6.3E-10 54.8 7.6 57 169-225 1-57 (120) 118 cd04594 CBS_pair_EriC_assoc_ar 98.0 8.6E-06 2.2E-10 57.8 4.8 56 85-144 48-103 (104) 119 cd04598 CBS_pair_GGDEF_assoc T 98.0 1.8E-05 4.5E-10 55.8 6.2 85 169-255 1-87 (119) 120 cd04643 CBS_pair_30 The CBS do 98.0 2.9E-05 7.5E-10 54.3 7.2 50 170-219 2-51 (116) 121 cd04627 CBS_pair_14 The CBS do 98.0 4.4E-05 1.1E-09 53.2 8.1 51 169-219 1-52 (123) 122 cd04584 CBS_pair_ACT_assoc Thi 98.0 3.3E-05 8.4E-10 54.0 7.2 49 169-217 1-49 (121) 123 KOG1764 consensus 97.9 0.00011 2.9E-09 50.4 9.6 185 70-255 133-328 (381) 124 cd04595 CBS_pair_DHH_polyA_Pol 97.9 1.5E-05 3.7E-10 56.3 5.0 49 169-218 2-50 (110) 125 cd04604 CBS_pair_KpsF_GutQ_ass 97.9 1.2E-05 3E-10 56.9 4.5 83 170-255 3-85 (114) 126 PRK07107 inositol-5-monophosph 97.9 1.2E-05 3.2E-10 56.8 4.5 63 86-149 160-222 (497) 127 KOG0475 consensus 97.9 4.2E-05 1.1E-09 53.3 7.2 130 86-216 542-693 (696) 128 cd04632 CBS_pair_19 The CBS do 97.9 8.1E-06 2.1E-10 58.0 3.6 50 170-219 2-51 (128) 129 cd04608 CBS_pair_PALP_assoc Th 97.9 2.3E-05 5.9E-10 55.0 5.9 86 168-256 1-86 (124) 130 cd04800 CBS_pair_CAP-ED_DUF294 97.9 3.6E-05 9.2E-10 53.7 6.9 83 169-255 1-83 (111) 131 pfam00571 CBS CBS domain. CBS 97.9 1.3E-05 3.3E-10 56.6 4.5 49 170-218 8-56 (57) 132 cd04638 CBS_pair_25 The CBS do 97.9 1.1E-05 2.9E-10 57.1 4.1 46 170-215 2-47 (106) 133 cd04597 CBS_pair_DRTGG_assoc2 97.9 1.7E-05 4.5E-10 55.8 5.0 58 158-215 54-112 (113) 134 cd04801 CBS_pair_M50_like This 97.9 1.4E-05 3.5E-10 56.5 4.2 84 169-255 1-85 (114) 135 cd04588 CBS_pair_CAP-ED_DUF294 97.9 5.3E-05 1.3E-09 52.7 7.2 81 169-255 1-81 (110) 136 cd04802 CBS_pair_3 The CBS dom 97.9 1.2E-05 2.9E-10 57.0 3.6 57 83-143 54-110 (112) 137 cd04618 CBS_pair_5 The CBS dom 97.9 7.5E-05 1.9E-09 51.7 7.7 94 98-214 2-96 (98) 138 smart00116 CBS Domain in cysta 97.9 2.4E-05 6.2E-10 54.9 4.9 47 171-217 2-48 (49) 139 cd04585 CBS_pair_ACT_assoc2 Th 97.9 1.7E-05 4.2E-10 56.0 4.1 50 169-219 1-50 (122) 140 cd04620 CBS_pair_7 The CBS dom 97.9 7.4E-05 1.9E-09 51.7 7.3 50 169-218 1-50 (115) 141 cd04635 CBS_pair_22 The CBS do 97.9 7.1E-05 1.8E-09 51.8 7.2 49 169-217 1-49 (122) 142 cd04612 CBS_pair_SpoIVFB_EriC_ 97.8 2.2E-05 5.6E-10 55.1 4.4 81 170-255 2-82 (111) 143 cd04611 CBS_pair_PAS_GGDEF_DUF 97.8 2.6E-05 6.6E-10 54.7 4.7 47 171-218 3-49 (111) 144 cd04631 CBS_pair_18 The CBS do 97.8 9.7E-05 2.5E-09 50.9 7.7 52 169-220 1-53 (125) 145 cd04583 CBS_pair_ABC_OpuCA_ass 97.8 6E-05 1.5E-09 52.3 6.5 50 168-217 1-50 (109) 146 cd04610 CBS_pair_ParBc_assoc T 97.8 1.3E-05 3.4E-10 56.6 3.1 45 169-214 2-46 (107) 147 cd04634 CBS_pair_21 The CBS do 97.8 2.2E-05 5.5E-10 55.2 4.2 49 169-218 1-49 (143) 148 cd04605 CBS_pair_MET2_assoc Th 97.8 3.6E-05 9.3E-10 53.7 4.9 50 169-218 2-51 (110) 149 cd04592 CBS_pair_EriC_assoc_eu 97.8 0.00013 3.2E-09 50.1 7.7 48 171-218 3-50 (133) 150 cd04596 CBS_pair_DRTGG_assoc T 97.8 2.1E-05 5.4E-10 55.2 3.5 47 169-215 2-48 (108) 151 cd04639 CBS_pair_26 The CBS do 97.8 0.00014 3.7E-09 49.8 7.6 50 170-219 2-51 (111) 152 cd04607 CBS_pair_NTP_transfera 97.8 0.00011 2.9E-09 50.5 7.0 82 170-255 3-84 (113) 153 cd04601 CBS_pair_IMPDH This cd 97.7 3.9E-05 9.9E-10 53.5 4.3 47 168-214 1-47 (110) 154 cd04609 CBS_pair_PALP_assoc2 T 97.7 0.00015 3.8E-09 49.7 7.1 82 170-256 2-84 (110) 155 KOG2550 consensus 97.7 6.6E-05 1.7E-09 52.0 5.2 125 72-215 98-225 (503) 156 PRK05326 potassium/proton anti 97.7 0.00082 2.1E-08 44.8 10.7 100 201-314 462-561 (563) 157 cd04599 CBS_pair_GGDEF_assoc2 97.7 3.5E-05 9E-10 53.8 3.4 55 85-143 49-103 (105) 158 cd04597 CBS_pair_DRTGG_assoc2 97.7 3.3E-05 8.3E-10 54.0 3.1 76 64-143 28-111 (113) 159 PRK10892 D-arabinose 5-phospha 97.7 6.2E-05 1.6E-09 52.2 4.4 55 164-218 204-261 (326) 160 cd04606 CBS_pair_Mg_transporte 97.7 5.7E-05 1.4E-09 52.4 4.2 58 84-144 50-107 (109) 161 cd04603 CBS_pair_KefB_assoc Th 97.6 3.9E-05 9.9E-10 53.5 2.6 47 170-216 2-48 (111) 162 pfam00571 CBS CBS domain. CBS 97.6 0.0001 2.6E-09 50.7 4.5 55 89-146 1-55 (57) 163 cd04582 CBS_pair_ABC_OpuCA_ass 97.5 0.00013 3.4E-09 50.0 4.8 48 170-217 2-49 (106) 164 smart00116 CBS Domain in cysta 97.5 0.00012 3.1E-09 50.2 4.6 48 98-146 1-48 (49) 165 cd04642 CBS_pair_29 The CBS do 97.5 0.00052 1.3E-08 46.2 7.5 49 170-218 2-50 (126) 166 PRK01862 putative voltage-gate 97.5 0.00062 1.6E-08 45.6 7.7 93 161-256 455-548 (583) 167 cd02205 CBS_pair The CBS domai 97.4 0.00075 1.9E-08 45.1 7.2 83 170-255 2-84 (113) 168 PRK11543 gutQ D-arabinose 5-ph 97.4 0.00017 4.4E-09 49.3 3.7 56 164-219 199-257 (321) 169 COG2905 Predicted signal-trans 97.3 0.0005 1.3E-08 46.2 5.3 71 73-147 197-270 (610) 170 cd04614 CBS_pair_1 The CBS dom 97.3 0.00024 6.1E-09 48.3 3.3 46 169-214 1-46 (96) 171 TIGR00400 mgtE magnesium trans 97.1 0.0043 1.1E-07 40.1 8.3 142 68-224 116-263 (460) 172 PRK05567 inositol-5'-monophosp 97.1 0.00037 9.3E-09 47.1 2.6 61 85-147 145-205 (486) 173 PRK07807 inositol-5-monophosph 97.0 0.0011 2.8E-08 44.0 4.6 58 85-145 146-203 (479) 174 pfam01595 DUF21 Domain of unkn 96.9 0.0018 4.7E-08 42.5 5.0 71 4-74 104-182 (182) 175 COG3448 CBS-domain-containing 96.9 0.001 2.6E-08 44.2 3.5 82 126-217 219-301 (382) 176 PTZ00314 inosine-5'-monophosph 96.9 0.00077 2E-08 45.0 2.8 60 84-146 155-214 (499) 177 TIGR03415 ABC_choXWV_ATP choli 96.8 0.0015 3.7E-08 43.2 4.1 52 166-219 330-381 (382) 178 cd04590 CBS_pair_CorC_HlyC_ass 96.7 0.0043 1.1E-07 40.1 5.8 81 170-256 2-83 (111) 179 COG4175 ProV ABC-type proline/ 96.7 0.0015 3.8E-08 43.1 3.3 124 74-220 261-385 (386) 180 cd04591 CBS_pair_EriC_assoc_eu 96.6 0.0021 5.5E-08 42.1 3.7 50 91-144 55-104 (105) 181 COG3263 NhaP-type Na+/H+ and K 96.5 0.0098 2.5E-07 37.8 6.5 74 237-314 494-567 (574) 182 COG4109 Predicted transcriptio 96.3 0.0036 9.2E-08 40.6 3.2 118 162-289 188-318 (432) 183 COG2524 Predicted transcriptio 96.2 0.0068 1.7E-07 38.8 4.2 92 160-257 170-262 (294) 184 COG0517 FOG: CBS domain [Gener 96.1 0.0097 2.5E-07 37.8 4.7 52 169-221 7-58 (117) 185 pfam00478 IMPDH IMP dehydrogen 96.1 0.008 2E-07 38.4 4.0 61 86-147 140-200 (467) 186 PRK10070 glycine betaine trans 95.7 0.057 1.4E-06 32.8 7.2 125 122-256 234-364 (400) 187 TIGR01303 IMP_DH_rel_1 IMP deh 95.6 0.022 5.6E-07 35.5 4.6 108 97-218 96-204 (476) 188 COG2239 MgtE Mg/Co/Ni transpor 94.2 0.058 1.5E-06 32.7 3.6 46 169-214 140-190 (451) 189 PRK11573 hypothetical protein; 94.2 0.23 5.8E-06 28.8 6.7 124 160-287 185-319 (413) 190 COG3620 Predicted transcriptio 94.0 0.19 5E-06 29.3 6.1 55 164-219 67-122 (187) 191 TIGR03520 GldE gliding motilit 94.0 0.3 7.7E-06 28.0 7.0 59 160-218 189-251 (408) 192 TIGR01137 cysta_beta cystathio 93.1 0.13 3.4E-06 30.4 4.0 90 168-257 373-488 (527) 193 KOG2550 consensus 91.7 0.15 3.9E-06 29.9 2.8 69 70-145 158-226 (503) 194 TIGR00393 kpsF sugar isomerase 91.4 0.13 3.3E-06 30.5 2.1 85 164-250 157-246 (272) 195 TIGR01186 proV glycine betaine 90.5 0.5 1.3E-05 26.6 4.5 146 98-260 183-337 (372) 196 cd04618 CBS_pair_5 The CBS dom 88.7 0.2 5.1E-06 29.2 1.4 46 173-218 5-51 (98) 197 KOG0476 consensus 85.8 1.3 3.4E-05 23.8 4.2 62 85-148 586-648 (931) 198 COG1125 OpuBA ABC-type proline 85.1 0.93 2.4E-05 24.8 3.1 41 176-216 268-308 (309) 199 cd04381 RhoGap_RalBP1 RhoGap_R 82.3 1.8 4.7E-05 22.9 3.7 47 238-284 40-87 (182) 200 cd04389 RhoGAP_KIAA1688 RhoGAP 82.3 2.1 5.3E-05 22.6 3.9 14 246-259 146-159 (187) 201 cd04402 RhoGAP_ARHGAP20 RhoGAP 80.6 2.1 5.4E-05 22.5 3.5 11 246-256 143-153 (192) 202 cd04406 RhoGAP_myosin_IXA RhoG 77.9 3.5 8.8E-05 21.1 3.9 48 237-284 34-83 (186) 203 cd04392 RhoGAP_ARHGAP19 RhoGAP 75.6 4.1 0.0001 20.6 3.7 39 70-113 5-43 (208) 204 PRK13225 phosphoglycolate phos 75.4 2.8 7.2E-05 21.7 2.9 102 38-145 83-184 (273) 205 cd04407 RhoGAP_myosin_IXB RhoG 73.6 4.6 0.00012 20.3 3.6 48 237-284 34-83 (186) 206 cd04408 RhoGAP_GMIP RhoGAP_GMI 72.5 4.9 0.00013 20.1 3.5 16 244-259 155-170 (200) 207 cd04385 RhoGAP_ARAP RhoGAP_ARA 72.4 5 0.00013 20.0 3.6 15 269-283 70-84 (184) 208 cd04393 RhoGAP_FAM13A1a RhoGAP 69.1 7.7 0.0002 18.8 3.9 11 246-256 150-160 (189) 209 cd04400 RhoGAP_fBEM3 RhoGAP_fB 68.8 7.4 0.00019 18.9 3.8 49 237-285 42-95 (190) 210 cd04403 RhoGAP_ARHGAP27_15_12_ 67.1 7.9 0.0002 18.7 3.6 16 268-283 70-85 (187) 211 cd04396 RhoGAP_fSAC7_BAG7 RhoG 66.7 8.5 0.00022 18.5 3.7 49 237-285 51-104 (225) 212 TIGR03279 cyano_FeS_chp putati 66.7 9.8 0.00025 18.1 4.6 79 191-278 348-431 (433) 213 COG2216 KdpB High-affinity K+ 66.7 4.5 0.00011 20.3 2.3 37 106-144 413-449 (681) 214 cd04374 RhoGAP_Graf RhoGAP_Gra 66.4 7.5 0.00019 18.9 3.4 19 267-285 85-103 (203) 215 cd04373 RhoGAP_p190 RhoGAP_p19 66.3 6.4 0.00016 19.3 3.0 16 268-283 68-83 (185) 216 cd04398 RhoGAP_fRGD1 RhoGAP_fR 66.1 7.8 0.0002 18.8 3.4 13 271-283 76-88 (192) 217 cd04378 RhoGAP_GMIP_PARG1 RhoG 65.9 8.1 0.00021 18.7 3.5 15 245-259 158-172 (203) 218 cd04387 RhoGAP_Bcr RhoGAP_Bcr: 65.8 8.1 0.00021 18.7 3.5 15 245-259 146-160 (196) 219 TIGR01497 kdpB K+-transporting 65.6 3.4 8.6E-05 21.2 1.5 38 105-144 411-448 (675) 220 cd04386 RhoGAP_nadrin RhoGAP_n 65.5 8.3 0.00021 18.6 3.5 15 245-259 150-164 (203) 221 cd04395 RhoGAP_ARHGAP21 RhoGAP 65.4 8.7 0.00022 18.4 3.6 11 246-256 150-160 (196) 222 KOG4269 consensus 64.6 7.1 0.00018 19.1 3.0 50 236-285 938-992 (1112) 223 TIGR02475 CobW cobalamin biosy 64.4 2.7 6.9E-05 21.8 0.8 27 178-207 19-45 (349) 224 cd04404 RhoGAP-p50rhoGAP RhoGA 64.2 9.6 0.00025 18.2 3.6 10 247-256 152-161 (195) 225 cd00159 RhoGAP RhoGAP: GTPase- 62.6 10 0.00026 18.0 3.5 11 246-256 129-139 (169) 226 pfam04459 DUF512 Protein of un 62.5 12 0.0003 17.6 4.2 69 191-261 127-199 (205) 227 cd04383 RhoGAP_srGAP RhoGAP_sr 61.0 11 0.00027 17.9 3.4 11 246-256 149-159 (188) 228 cd04376 RhoGAP_ARHGAP6 RhoGAP_ 60.6 12 0.0003 17.6 3.5 15 245-259 147-161 (206) 229 PRK06769 hypothetical protein; 58.5 7.4 0.00019 18.9 2.2 77 180-260 33-111 (175) 230 PRK04149 sat sulfate adenylylt 57.1 14 0.00037 17.0 3.8 24 189-214 95-118 (390) 231 cd04384 RhoGAP_CdGAP RhoGAP_Cd 56.6 14 0.00037 17.0 3.4 16 268-283 72-87 (195) 232 TIGR01818 ntrC nitrogen regula 56.3 6.8 0.00017 19.2 1.7 62 210-276 198-266 (471) 233 smart00324 RhoGAP GTPase-activ 54.1 16 0.00041 16.7 3.8 11 246-256 133-143 (174) 234 COG2015 Alkyl sulfatase and re 53.1 4.8 0.00012 20.2 0.5 126 167-300 138-277 (655) 235 TIGR01969 minD_arch cell divis 48.6 6.2 0.00016 19.4 0.5 50 174-223 140-197 (258) 236 PRK11230 glycolate oxidase sub 47.8 15 0.00037 17.0 2.3 70 51-125 62-146 (499) 237 cd04372 RhoGAP_chimaerin RhoGA 47.4 21 0.00052 16.0 3.1 12 245-256 147-158 (194) 238 pfam00620 RhoGAP RhoGAP domain 47.1 21 0.00053 16.0 3.5 16 268-283 53-68 (152) 239 KOG4270 consensus 44.8 22 0.00057 15.8 4.0 51 237-287 186-237 (577) 240 pfam11305 DUF3107 Protein of u 44.4 23 0.00058 15.7 3.1 50 49-108 17-66 (74) 241 pfam01268 FTHFS Formate--tetra 43.5 12 0.00029 17.7 1.2 39 56-94 139-179 (555) 242 PRK05800 cobU adenosylcobinami 40.8 20 0.00052 16.0 2.2 64 179-248 104-167 (170) 243 TIGR01193 bacteriocin_ABC ABC- 40.1 26 0.00065 15.4 2.6 31 55-85 345-375 (710) 244 COG0277 GlcD FAD/FMN-containin 39.2 20 0.0005 16.2 1.9 72 50-126 37-122 (459) 245 COG0523 Putative GTPases (G3E 38.7 12 0.00031 17.5 0.7 18 129-146 116-133 (323) 246 pfam05198 IF3_N Translation in 38.1 28 0.00072 15.1 2.7 24 193-216 13-36 (76) 247 cd00477 FTHFS Formyltetrahydro 35.6 24 0.00061 15.6 1.8 38 56-94 123-163 (524) 248 cd07178 terB_like_YebE telluri 35.5 31 0.00079 14.8 3.6 28 57-84 3-30 (95) 249 COG4360 APA2 ATP adenylyltrans 35.4 27 0.0007 15.2 2.1 35 182-216 96-130 (298) 250 PRK08942 D,D-heptose 1,7-bisph 35.2 27 0.00068 15.3 2.0 37 178-214 32-68 (181) 251 PRK13506 formate--tetrahydrofo 35.1 17 0.00044 16.5 1.0 19 76-94 175-193 (577) 252 pfam02283 CobU Cobinamide kina 34.4 32 0.00083 14.7 2.7 64 179-248 101-164 (166) 253 TIGR02194 GlrX_NrdH Glutaredox 32.7 29 0.00074 15.0 1.9 37 99-136 26-63 (72) 254 pfam04278 Tic22 Tic22-like fam 31.0 37 0.00094 14.4 3.5 91 97-218 137-232 (259) 255 PRK00028 infC translation init 29.5 39 0.00099 14.2 2.6 33 106-138 33-65 (175) 256 cd04409 RhoGAP_PARG1 RhoGAP_PA 29.4 39 0.001 14.2 3.4 15 245-259 166-180 (211) 257 pfam04391 DUF533 Protein of un 29.3 39 0.001 14.2 3.7 30 55-84 80-109 (187) 258 cd00544 CobU Adenosylcobinamid 28.3 41 0.001 14.1 2.5 64 179-248 104-167 (169) 259 TIGR02472 sucr_P_syn_N sucrose 26.2 12 0.00031 17.5 -1.0 59 50-128 164-222 (445) 260 pfam01565 FAD_binding_4 FAD bi 25.6 44 0.0011 13.8 1.8 69 237-307 62-135 (138) 261 pfam02743 Cache_1 Cache domain 25.6 40 0.001 14.1 1.6 25 180-204 41-65 (81) 262 PRK08053 sulfur carrier protei 24.8 47 0.0012 13.6 2.3 31 170-200 8-38 (66) 263 pfam09269 DUF1967 Domain of un 24.5 48 0.0012 13.6 5.1 56 232-298 2-67 (68) 264 cd00517 ATPS ATP-sulfurylase ( 24.5 48 0.0012 13.6 2.3 10 110-119 146-155 (356) 265 PRK11537 putative GTP-binding 24.4 31 0.00079 14.8 0.8 12 131-142 124-135 (317) 266 cd07311 terB_like_1 tellurium 24.4 48 0.0012 13.6 3.3 68 42-113 8-79 (150) 267 TIGR02886 spore_II_AA anti-sig 24.0 49 0.0012 13.6 2.0 39 67-105 15-55 (106) 268 PRK10329 glutaredoxin-like pro 22.3 53 0.0013 13.3 1.7 37 99-136 28-64 (81) 269 PRK06193 hypothetical protein; 21.8 54 0.0014 13.3 1.7 55 198-253 156-210 (211) 270 COG2979 Uncharacterized protei 21.5 54 0.0014 13.2 3.6 30 56-85 113-142 (225) 271 cd03112 CobW_like The function 21.3 40 0.001 14.1 0.9 25 178-204 15-39 (158) 272 cd02930 DCR_FMN 2,4-dienoyl-Co 21.3 55 0.0014 13.2 2.7 17 54-70 76-92 (353) 273 TIGR00421 ubiX_pad polyprenyl 21.1 56 0.0014 13.2 1.6 62 169-233 113-179 (181) 274 PRK10975 TDP-fucosamine acetyl 20.9 44 0.0011 13.9 1.0 45 169-213 110-158 (231) 275 KOG2916 consensus 20.8 31 0.0008 14.8 0.3 102 101-209 157-265 (304) 276 TIGR03595 Obg_CgtA_exten Obg f 20.8 56 0.0014 13.1 5.4 56 232-298 3-68 (69) 277 PRK05446 imidazole glycerol-ph 20.5 57 0.0015 13.1 1.7 35 108-143 35-69 (354) 278 pfam02492 cobW CobW/HypB/UreG, 20.4 43 0.0011 13.9 0.9 15 69-83 8-22 (174) No 1 >TIGR03520 GldE gliding motility-associated protein GldE. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype. Probab=100.00 E-value=0 Score=511.35 Aligned_cols=300 Identities=22% Similarity=0.390 Sum_probs=271.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHH---HHHHHHHHHHHH--HHHHHHHHCCCCCC-CCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 766799887655555544432035---687530268998--99998763378888-899999999997997689898999 Q gi|254780426|r 2 MSDFKINSSEHPKEKNEGIDLSPS---ISKTSVPAFWTR--TIRALCKTWKLMKL-SHHKGIRLPRIDASLEGIFSETEK 75 (320) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~p~---l~~i~~P~vw~~--~~~~~~~~~~~~~~-~~~eel~~l~~~~~~~G~i~~~E~ 75 (320) .+|..|++.+.....+-+....|. +.++++|++|.+ .++.+++.++.... .++||++.+++.++++|.++ +|+ T Consensus 101 fGEiiPK~iA~~~~e~ia~~~a~~l~~~~~l~~Pl~~~l~~~~~~l~~~~g~~~~~~s~eel~~~i~~~~~~g~~~-eE~ 179 (408) T TIGR03520 101 FGEILPKVYANRNNLKFAKFMAYPINILDKVFSPISLPLRAITNFIHKKFGKQKSNISVDQLSQALELTDEEDTTK-EEQ 179 (408) T ss_pred HHHHCCHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCH-HHH T ss_conf 4110106788769899999999999999999999999999999988651156789899999999998540348778-999 Q ss_pred HHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCC Q ss_conf 99998730477576413120012999863010367789798659817998418866422410299999876622356432 Q gi|254780426|r 76 EIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNN 155 (320) Q Consensus 76 ~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~ 155 (320) +|++|+|+|++++|+|+||||+++++++.+++++++++.+.+++|||||||+++.|||+|++|+||++.+... T Consensus 180 ~~i~~vl~l~~~~v~eiMtPr~~v~~l~~~~~~~e~~~~~~~~~~SR~PV~~~~~D~iiGiv~~kDll~~~~~------- 252 (408) T TIGR03520 180 KILQGIVSFGNTDTKQVMRPRLDIFALDIETSFSEIIPKIIENGYSRIPVYKETIDNITGVLYIKDLLPHLNK------- 252 (408) T ss_pred HHHHHHHCCCCCEEHHCCCCHHHEEEEECCCHHHHHHHHHHHCCCCCCEEECCCCCCEEEEEEHHHHHHHHHC------- T ss_conf 9999982759989034235457958840644389999999857987222665898761799779899878736------- Q ss_pred CCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCC Q ss_conf 23334576788515657971788512323432013565116540776620466758899998742221000000001246 Q gi|254780426|r 156 INLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAV 235 (320) Q Consensus 156 ~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~ 235 (320) .+..+.+++|||+|||+++++.++|++||+++.|||+|+||||++.|+||+|||+|+|||||.||+|.++..+.++ T Consensus 253 ----~~~~~~~~~r~~~~Vpe~~~~~~lL~~~~~~~~~~AiVvDEyG~~~GiVTlEDilEeivGei~DE~d~~~~~~~~~ 328 (408) T TIGR03520 253 ----KNFDWQSLLREPYFVPENKKLDDLLRDFQEKKNHLAIVVDEYGGTSGLVTLEDIIEEIVGDISDEFDDEDLIYSKI 328 (408) T ss_pred ----CCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEEC T ss_conf ----9998899863214899977889999999863874899998999878998889989999588868677665550873 Q ss_pred CCCEEEEECCCCHHHHHHHHCCCCCCCC-CCCCCCCHHHHHHHHHCCCCCCCCEEEEECCEEEEEEEECCCEEEEEEEEE Q ss_conf 7733783045359999998288888777-746743179999998423473798899758959999995188789999996 Q gi|254780426|r 236 SDNTFIVDARTDLEELAKIIGTDCNCLK-GEQDVDSLGGLIFSVLDRIPARGEVVLEIPGFEIVILDADVRCVRRVRIRR 314 (320) Q Consensus 236 ~~g~~~v~G~~~l~~l~~~l~~~l~~~~-~~~~~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v~v~~ 314 (320) ++|+|+++|+++++|+++.++++++..+ .+++|+|+|||+++++|++|++||++ .|+||+|+|++++++||.+|+|+. T Consensus 329 ~~~~~~v~G~~~l~dl~~~l~~~~~~~~~~~~~~~Tl~G~il~~l~~iP~~ge~v-~~~~~~f~V~~~~~~rI~~V~vt~ 407 (408) T TIGR03520 329 DDNNYVFEGKTSLKDFYKILKLEEDMFDEVKGEAETLAGFLLEISGGFPKKGEKI-TFENFEFTIEAMDKKRIKQVKVTI 407 (408) T ss_pred CCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEE-EECCEEEEEEEEECCEEEEEEEEE T ss_conf 7986999802699999998589988765657897708999999858689999999-999999999984078677999997 No 2 >PRK11573 hypothetical protein; Provisional Probab=100.00 E-value=0 Score=503.47 Aligned_cols=303 Identities=20% Similarity=0.348 Sum_probs=265.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCH---HHHHHHHHHHHHH--HHHHHHHHCCCCCC------CCHHHHHHHHHHHHHCCCCC Q ss_conf 6679988765555554443203---5687530268998--99998763378888------89999999999799768989 Q gi|254780426|r 3 SDFKINSSEHPKEKNEGIDLSP---SISKTSVPAFWTR--TIRALCKTWKLMKL------SHHKGIRLPRIDASLEGIFS 71 (320) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~p---~l~~i~~P~vw~~--~~~~~~~~~~~~~~------~~~eel~~l~~~~~~~G~i~ 71 (320) +|..|++.+..++.+-+....+ .+.++++|++|.. .++.+++++|..+. .++||++.++.++ +|.++ T Consensus 94 gEiiPK~la~~~~~~~a~~~a~~l~~~~~~~~Plv~~~~~~s~~l~rl~g~~~~~~~~~~~t~eEl~~lv~~~--~~~l~ 171 (413) T PRK11573 94 AEVLPKTIAALYPEKVAYPSSFLLAPLQILMMPLVWLLNTITRLLMRLMGIKTDIVVSGSLSKEELRTIVHES--RSQIS 171 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC--CCCCC T ss_conf 9784899999789999999999999999999889999999999777532667775445536899999766410--26678 Q ss_pred HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCC Q ss_conf 89999999873047757641312001299986301036778979865981799841886642241029999987662235 Q gi|254780426|r 72 ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYA 151 (320) Q Consensus 72 ~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~ 151 (320) ++|++|++|+|+|++++|+||||||+++++++.+++++++++.+.+++|||||||+++.||++|++|+||++..+..... T Consensus 172 ~~e~~mi~~il~l~~~~v~eiMtPR~~i~~l~~~~~~~e~~~~~~~~~~SR~PVy~~~~D~iiGiv~~kdl~~~~~~~~~ 251 (413) T PRK11573 172 RRNQDMLLSVLDLEKVTVDDIMVPRNEIIGIDINDDWKSILRQLTHSPHGRIVLYRDSLDDAISMLRVREAWRLMTEKKE 251 (413) T ss_pred HHHHHHHHHHHCCCCCEEHEEEEECCCEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEHHHHHHHHCCCC T ss_conf 79999999872518989011862002268974789999999986633875425854788865999982577877642666 Q ss_pred CCCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCH-HH Q ss_conf 64322333457678851565797178851232343201356511654077662046675889999874222100000-00 Q gi|254780426|r 152 KTNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSK-KS 230 (320) Q Consensus 152 ~~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~-~~ 230 (320) .....+...+++++|||+++++.++|..||+++.|||+|+||||++.|+||+|||+|+|||||.||+|.+ .. T Consensus 252 -------~~~~~l~~~~~~~~fVpe~~~~~~lL~~~~~~~~h~AiVvDEyGg~~GiVTlEDilEeivGei~de~d~~~~~ 324 (413) T PRK11573 252 -------FTKETMLRAADEIYFVPEGTPLSTQLVKFQRNKKKVGLVVNEYGDIQGLVTVEDILEEIVGDFTTSMSPTLAE 324 (413) T ss_pred -------CCHHHHHHHCCCCEEECCCCHHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCC T ss_conf -------8878887514787796898909999999884286379999489988899549999999838775556764330 Q ss_pred CCCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEECCEEEEEEEECCCEEEEE Q ss_conf 01246773378304535999999828888877774674317999999842347379889975895999999518878999 Q gi|254780426|r 231 MISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGEVVLEIPGFEIVILDADVRCVRRV 310 (320) Q Consensus 231 ~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v 310 (320) .+.+.++|+|+++|.++++|+++.||+++| +++|+|+|||+++++|++|++|+++ .++||+|+|++++++||.+| T Consensus 325 ~~~~~~dg~~~v~G~~~l~dl~~~l~~~lp----~~~~~Tl~G~il~~lg~iP~~Ge~v-~~~~~~f~V~~~~~~rI~~v 399 (413) T PRK11573 325 EVTPQNDGSVIIDGTANVREINKAFNWHLP----EDDARTVNGVILEALEEIPVAGTRV-RIGEYDIDILDVQDNMIKQV 399 (413) T ss_pred CEEECCCCEEEEECCCCHHHHHHHHCCCCC----CCCCCCHHHHHHHHHCCCCCCCCEE-EECCEEEEEEEEECCEEEEE T ss_conf 218878973999767799999998598999----9998628999999858689899999-99999999998307847699 Q ss_pred EEEECCCCC Q ss_conf 999634347 Q gi|254780426|r 311 RIRRLVHFK 319 (320) Q Consensus 311 ~v~~l~~~~ 319 (320) +|+|+...+ T Consensus 400 ~i~~~~p~~ 408 (413) T PRK11573 400 KVTPVKPLR 408 (413) T ss_pred EEEECCCCC T ss_conf 999789997 No 3 >COG1253 TlyC Hemolysins and related proteins containing CBS domains [General function prediction only] Probab=100.00 E-value=0 Score=476.27 Aligned_cols=302 Identities=32% Similarity=0.538 Sum_probs=261.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCH---HHHHHHHHHHHH--HHHHHHHHHCCCCCC------CCHHHHHHHHHHHHHCCCCCH Q ss_conf 679988765555554443203---568753026899--899998763378888------899999999997997689898 Q gi|254780426|r 4 DFKINSSEHPKEKNEGIDLSP---SISKTSVPAFWT--RTIRALCKTWKLMKL------SHHKGIRLPRIDASLEGIFSE 72 (320) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~p---~l~~i~~P~vw~--~~~~~~~~~~~~~~~------~~~eel~~l~~~~~~~G~i~~ 72 (320) +..|++.+.......+....| .+.++++|++|. +++++++++++.... .+.++++ ++.+|.++|.+++ T Consensus 113 El~PK~~a~~~~e~va~~~a~~~~~~~~l~~P~i~~~~~~a~~il~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~~~ 191 (429) T COG1253 113 ELVPKSIAIRNPEKVALLIAPPLRFFYRLLYPLIWLLNRIANAILRLFGVEPVEEEALTSTEEELE-LVSESAEEGVLEE 191 (429) T ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHH-HHHCCCCCCCCCH T ss_conf 151799998677999998889999999998589999999999999861888666432331167888-8640012686288 Q ss_pred HHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCC Q ss_conf 99999998730477576413120012999863010367789798659817998418866422410299999876622356 Q gi|254780426|r 73 TEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAK 152 (320) Q Consensus 73 ~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~ 152 (320) +|+.|++|+|+|++++|+++||||+++++++.+.+.+++.+.+.+++|||+|||+++.||++|++|+||++.+...... T Consensus 192 ~E~~mi~~v~~l~~~~v~eiMtPR~~i~~l~~~~~~~~~~~~~~~~~~SR~PV~~~~~D~iiGiv~~Kdll~~~~~~~~- 270 (429) T COG1253 192 EEREMINNVLDLDDRTVREIMTPRTDIVALDLTDTVEELIELILESGHSRIPVYDGDLDNIIGIVHVKDLLRALLDGQS- 270 (429) T ss_pred HHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCC- T ss_conf 9999999650213756261760315579941899999999999758986587885799837888689999976431564- Q ss_pred CCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCC-HHHC Q ss_conf 432233345767885156579717885123234320135651165407766204667588999987422210000-0000 Q gi|254780426|r 153 TNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHS-KKSM 231 (320) Q Consensus 153 ~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~-~~~~ 231 (320) .......++||+||||++++.++|++||+++.|||+|+||||++.|+||+|||+|+|+|||.||+|. +++. T Consensus 271 --------~~~~~~~~~~~~~Vpet~~~~~lL~~~r~~~~hmAiVvDEyG~~~GlVTleDIiEeIvGei~de~d~~~~~~ 342 (429) T COG1253 271 --------DLDLRVLVRPPLFVPETLSLSDLLEEFREERTHMAIVVDEYGGVEGLVTLEDIIEEIVGEIPDEHDEDEEED 342 (429) T ss_pred --------CCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCEEEEEEECCCCEEEECCHHHHHHHHHCCCCCCCCCCCCCC T ss_conf --------444200257871778989799999998863874999993789843326589999998487754345545642 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEECCEEEEEEEECCCEEEEEE Q ss_conf 12467733783045359999998288888777746743179999998423473798899758959999995188789999 Q gi|254780426|r 232 ISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGEVVLEIPGFEIVILDADVRCVRRVR 311 (320) Q Consensus 232 i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v~ 311 (320) +.++.+++|+++|.++++++++.+|+++ +.+++|+|+|||+++++|++|++|+.+ .+++|+|+|++++++||++|+ T Consensus 343 ~~~~~~~~~~v~G~~~l~e~~~~l~~~~---~~~~~~~Ti~G~v~~~lg~iP~~g~~~-~~~~~~~~V~~~~~~rI~~v~ 418 (429) T COG1253 343 IIQRDDDGWLVDGRVPLEELEELLGIDL---DEEEDYDTIAGLVLSLLGRIPKVGESV-VIGGLRFEVLDMDGRRIDKVL 418 (429) T ss_pred CEEECCCCEEEECCCCHHHHHHHHCCCC---CCCCCCCCHHHHHHHHCCCCCCCCCEE-EECCCEEEEEECCCCEEEEEE T ss_conf 2686389679957315999998729887---546786449999999819989888879-974860899984476589999 Q ss_pred EEECCCCC Q ss_conf 99634347 Q gi|254780426|r 312 IRRLVHFK 319 (320) Q Consensus 312 v~~l~~~~ 319 (320) |+|..... T Consensus 419 i~~~~~~~ 426 (429) T COG1253 419 IVRLKDKP 426 (429) T ss_pred EEECCCCC T ss_conf 99768888 No 4 >COG4536 CorB Putative Mg2+ and Co2+ transporter CorB [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=448.53 Aligned_cols=306 Identities=23% Similarity=0.397 Sum_probs=279.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCH---HHHHHHHHHHHHHH--HHHHHHHCCCCCC------CCHHHHHHHHHHHHHCCCC Q ss_conf 76679988765555554443203---56875302689989--9998763378888------8999999999979976898 Q gi|254780426|r 2 MSDFKINSSEHPKEKNEGIDLSP---SISKTSVPAFWTRT--IRALCKTWKLMKL------SHHKGIRLPRIDASLEGIF 70 (320) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~p---~l~~i~~P~vw~~~--~~~~~~~~~~~~~------~~~eel~~l~~~~~~~G~i 70 (320) -++.-|++.++++...-+-..++ .+.+++.|++|+.+ ++.+++++|.+.. .+.||++.++++++.+|.. T Consensus 104 FaEVlPKt~Aa~~perva~~~s~~l~~l~~l~~Plv~lln~it~~llrl~gi~~~~~~~~~~s~EElR~~v~~~~~e~~~ 183 (423) T COG4536 104 FAEVLPKTIAALYPERVALPSSFILAILVRLFGPLVWLLNAITRRLLRLLGINLDQAVSQLSSKEELRTAVNESGSEGSV 183 (423) T ss_pred HHHHCCHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 99762017765283654413457899999999999999999999999995768554664336789998999886133562 Q ss_pred CHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCC Q ss_conf 98999999987304775764131200129998630103677897986598179984188664224102999998766223 Q gi|254780426|r 71 SETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMY 150 (320) Q Consensus 71 ~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~ 150 (320) .+.+++|+-++|+|.+.+|.|||+||++|.++|.+++.+++.+.+..+.|+|+|||+++.|||+|++|+||+++++.+.. T Consensus 184 ~~~~rdmL~gvLDLe~~tV~DIMvpR~~i~~id~d~~~e~iv~ql~~s~HtRiplyr~~~DnIiGvlh~r~llr~l~e~~ 263 (423) T COG4536 184 NKIDRDMLLGVLDLENLTVSDIMVPRNEIIGIDIDDPWEEIVRQLLHSPHTRIPLYRDDLDNIIGVLHVRDLLRLLNEKN 263 (423) T ss_pred CCCHHHHHHCCCCCCCCEEEEEECCCCCEEEECCCCCHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 31037877431153135544002132221553489989999999962887740014487467533431999999963458 Q ss_pred CCCCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHH Q ss_conf 56432233345767885156579717885123234320135651165407766204667588999987422210000000 Q gi|254780426|r 151 AKTNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKS 230 (320) Q Consensus 151 ~~~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~ 230 (320) . .....+...+++|+|||+++++.+.|..||+++.|+|+||||||.+.|+||+|||+|||||++.||+|...+ T Consensus 264 ~-------~~k~d~~~~a~epyFVPe~Tpl~~QL~~F~~~k~hialVVDEYG~i~GLVTLEDIlEEIVGdftde~d~~~~ 336 (423) T COG4536 264 E-------FTKEDILRAADEPYFVPEGTPLSDQLVAFQRNKKHIALVVDEYGDIQGLVTLEDILEEIVGDFTDEHDTLAK 336 (423) T ss_pred C-------CCHHHHHHHHCCCEECCCCCCHHHHHHHHHHHCCEEEEEEECCCCEEEEEEHHHHHHHHHCCCCCCCCCCCC T ss_conf 6-------407579987448745689992899999999725127999824676776646999999985433233664331 Q ss_pred CCCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEECCEEEEEEEECCCEEEEE Q ss_conf 01246773378304535999999828888877774674317999999842347379889975895999999518878999 Q gi|254780426|r 231 MISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGEVVLEIPGFEIVILDADVRCVRRV 310 (320) Q Consensus 231 ~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v 310 (320) .+.+.+||++++||+++++|+|+.|+|++|+ ++..|++|+++++++.||+.|..+ .+.||+|+|++++.|||++| T Consensus 337 ev~~q~dgs~iidGs~~iRdlNr~l~W~Lp~----e~a~TinGLvle~le~IP~~~~~~-~i~~~~~~Il~~~~N~I~~v 411 (423) T COG4536 337 EVIPQSDGSFIIDGSANVRDLNRALDWNLPD----EDARTINGLVLEELEEIPEEGQAV-RIGGYRIDILKVQENRIKQV 411 (423) T ss_pred CCCCCCCCCEEEECCCCHHHHHHHCCCCCCC----CCCHHHHHHHHHHHHHCCCCCEEE-EECCEEEEEEEHHCCCEEEE T ss_conf 0023568748982887589988861988999----864347789999998589987189-99377999987210613168 Q ss_pred EEEECCCCC Q ss_conf 999634347 Q gi|254780426|r 311 RIRRLVHFK 319 (320) Q Consensus 311 ~v~~l~~~~ 319 (320) +|.++...+ T Consensus 412 ri~~~~~~~ 420 (423) T COG4536 412 KITPLKALE 420 (423) T ss_pred EEEECCCCC T ss_conf 877346557 No 5 >COG4535 CorC Putative Mg2+ and Co2+ transporter CorC [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=421.09 Aligned_cols=262 Identities=25% Similarity=0.455 Sum_probs=244.2 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCC Q ss_conf 98763378888899999999997997689898999999987304775764131200129998630103677897986598 Q gi|254780426|r 41 ALCKTWKLMKLSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGR 120 (320) Q Consensus 41 ~~~~~~~~~~~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~ 120 (320) ++.++|...+ .+++|+-.++..++..+.|+++...|+.+++++++++|+|||+||++++++..+.++++.+..+.++.| T Consensus 22 L~~~~f~gEp-knr~eLl~liRdse~n~LiD~dt~~mlEGvm~iadl~vrDiMIPRSQM~~l~~~~~l~~~l~~iiesaH 100 (293) T COG4535 22 LLSQLFHGEP-KNREELLELIRDSEQNELIDADTLDMLEGVMDIADLRVRDIMIPRSQMITLKRNQTLDECLDVIIESAH 100 (293) T ss_pred HHHHHHCCCC-CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHEECCCCCCHHHHHHHHHHHCC T ss_conf 9999836898-678999999987645346275689999888999875476611227782221013799999999998511 Q ss_pred CEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEEC Q ss_conf 17998418866422410299999876622356432233345767885156579717885123234320135651165407 Q gi|254780426|r 121 SWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDE 200 (320) Q Consensus 121 sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE 200 (320) |||||+.+++|+|.|++++||||+++... ...+.+++++||+.+|||++.+..+|+.||.+|.|||+|||| T Consensus 101 SRfPVi~edkD~v~GIL~AKDLL~~~~~~---------~~~F~i~~lLRPav~VPESKrvd~lLkeFR~~RnHMAIViDE 171 (293) T COG4535 101 SRFPVISEDKDHVEGILLAKDLLPFMRSD---------AEPFDIKELLRPAVVVPESKRVDRLLKEFRSQRNHMAIVIDE 171 (293) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCC---------CCCCCHHHHCCCCEECCCCHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 55871147700134454588788774078---------654638886155243466126999999998504726999822 Q ss_pred CCCCEEEEEHHHHHHHHHHHHHHHCCCH-HHCCCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 7662046675889999874222100000-000124677337830453599999982888887777467431799999984 Q gi|254780426|r 201 HGGTDGLVSYEDIVSVLMRDITSEHHSK-KSMISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVL 279 (320) Q Consensus 201 ~G~~~GiVTleDIle~ivGei~de~d~~-~~~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l 279 (320) ||++.|+||.|||||.|||+|.||||.+ ...|.++++++|.|.|.++|++||+.||.+++ +++|+|+||+++..+ T Consensus 172 fGgVsGLVTIEDiLEqIVGdIEDE~Deee~~dI~~ls~~~~~vrALT~IedFN~~F~t~Fs----DeevDTIGGLVm~af 247 (293) T COG4535 172 FGGVSGLVTIEDILEQIVGDIEDEYDEEEDADIRQLSRHTWRVRALTEIEDFNEAFGTHFS----DEEVDTIGGLVMQAF 247 (293) T ss_pred CCCEEEEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCCCC----HHHHHHHHHHHHHHH T ss_conf 5772016779999999854331013505544467634772699844608998998668887----222121667999983 Q ss_pred CCCCCCCCEEEEECCEEEEEEEECCCEEEEEEEEECCC Q ss_conf 23473798899758959999995188789999996343 Q gi|254780426|r 280 DRIPARGEVVLEIPGFEIVILDADVRCVRRVRIRRLVH 317 (320) Q Consensus 280 ~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v~v~~l~~ 317 (320) ||+|+.||++ .++||.|+|..+|.|||-+++|+.+.. T Consensus 248 GhLP~RGE~i-~i~~~~FkV~~ADsRrv~~l~vt~~~~ 284 (293) T COG4535 248 GHLPARGEKI-DIGGYQFKVARADSRRVIQLHVTVPDD 284 (293) T ss_pred CCCCCCCCEE-EECCEEEEEEECCCCEEEEEEEECCCC T ss_conf 6687778778-874637898604652279999765887 No 6 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=99.93 E-value=4.3e-26 Score=190.83 Aligned_cols=111 Identities=37% Similarity=0.601 Sum_probs=102.5 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEEECC Q ss_conf 12999863010367789798659817998418866422410299999876622356432233345767885156579717 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNILFVPS 176 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~Vpe 176 (320) +++++++.++|++++++.+.+++|||||||+++.|+++|+++.||++.+..... ....+..+++|++++|+ T Consensus 1 ~~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~~~~---------~~~~~~~~~~~~~~v~~ 71 (111) T cd04590 1 TDIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALAEGE---------EDLDLRDLLRPPLFVPE 71 (111) T ss_pred CCEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHHCCC---------CCCCHHHHCCCCEECCH T ss_conf 968999699999999999997799779999547887789999114456665278---------76407764044222255 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.++|..||+++.|+|+|+||||+++||||++||||+ T Consensus 72 ~~~l~~~l~~~~~~~~~~a~VvDe~G~~~GivT~~DilE~ 111 (111) T cd04590 72 STPLDDLLEEMRKERSHMAIVVDEYGGTAGLVTLEDILEE 111 (111) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHCC T ss_conf 6658899984355496799999699999999980485377 No 7 >KOG2118 consensus Probab=99.77 E-value=6.9e-19 Score=143.62 Aligned_cols=231 Identities=18% Similarity=0.227 Sum_probs=162.0 Q ss_pred CCCCCCCCCCH---HHHHHHHHHHHHHHHH--HHHHHCC-CCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCE Q ss_conf 55554443203---5687530268998999--9876337-8888899999999997997689898999999987304775 Q gi|254780426|r 14 KEKNEGIDLSP---SISKTSVPAFWTRTIR--ALCKTWK-LMKLSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKI 87 (320) Q Consensus 14 ~~~~~~~~~~p---~l~~i~~P~vw~~~~~--~~~~~~~-~~~~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~ 87 (320) +.-..++...| .+.-+++|+-|..+.. .+....+ +......+.+-.+.......|.+..+|..|+.++++|.++ T Consensus 125 ~gl~vga~~~~~~~i~~~l~~PI~~p~~~~d~~lg~~~~~~~~~~~l~~lv~~~~~e~~~g~~~~~e~~ii~g~l~l~ek 204 (498) T KOG2118 125 YGLAVGANLVPLVRILSFLCLPIAYPFSKLDTALGLNLGEFLKRASLLALVQLVGNEAGKGDLTYDELTIITGALELTEK 204 (498) T ss_pred HCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 32301543100999999997168867424456540244211112779999998750013575524577887536899999 Q ss_pred EEHHEEECCCEEEEEEECCCHH-HHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 7641312001299986301036-778979865981799841886642241029999987662235643223334576788 Q gi|254780426|r 88 RIDDIMISRVSINAVEDKATVY-EAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESN 166 (320) Q Consensus 88 ~v~diMtPr~~i~~i~~~~~~~-e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~ 166 (320) .+++||||-.++..++.+..+. +....+.++||||+|||+.+..+++|.+..+.++.+..... . ......-.. T Consensus 205 ~~~evmtpi~~~f~l~~n~~l~~~~~~~i~~~g~sripv~~~~~~~~i~~~L~~~~~~~~~~~~-~-----~~~~~v~~~ 278 (498) T KOG2118 205 LVGEVMTPIEDVFALDANTKLDRETVGEIVKHGYSRIPVYEQEPKNKIGGLLVMNLLRLLQVEV-P-----LEPLPVSES 278 (498) T ss_pred HHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCEEEECCCCCCCEEHHHHHHHHHHHCCCCC-C-----CCCCHHHHH T ss_conf 9887306144430011002101788888876587600003666520000444331233204334-5-----663003455 Q ss_pred HCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHH-----HCCCCCCCCEEE Q ss_conf 515657971788512323432013565116540776620466758899998742221000000-----001246773378 Q gi|254780426|r 167 LIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKK-----SMISAVSDNTFI 241 (320) Q Consensus 167 ~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~-----~~i~~~~~g~~~ 241 (320) ..++...+|+++++.+.+..||+.+.|||+|.. +....|++|++|+ +++++|+.||.|... ..+....-+.|. T Consensus 279 ~~~~l~~vp~~~~~~~~l~~~~~~~~H~~~v~~-~~~~~~~~~l~~~-~~~~~ev~de~d~~~~~~~~~~~~~~~~~~~~ 356 (498) T KOG2118 279 ALLRLPLVPENMPLLDLLNEFQKGKSHMAVVRN-GHVDIFVLTLEDL-EEVVKEVEDEEDKEEIKFETGIFTLRGIGDLD 356 (498) T ss_pred HHCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC-CCCCEEEEECCCH-HHHHCCCCCCCCCCCCCCCCCCEEECCCCCCC T ss_conf 412545688743088889998731213688824-8762444641414-54302133412454440113441340553124 Q ss_pred EECCCCHHHHH Q ss_conf 30453599999 Q gi|254780426|r 242 VDARTDLEELA 252 (320) Q Consensus 242 v~G~~~l~~l~ 252 (320) ....+.+.+.. T Consensus 357 ~~~v~~~~~~~ 367 (498) T KOG2118 357 AGAVTDIDQLG 367 (498) T ss_pred CCCCCHHHHHC T ss_conf 55332254523 No 8 >pfam03471 CorC_HlyC Transporter associated domain. This small domain is found in a family of proteins with the pfam01595 domain and two CBS domains with this domain found at the C-terminus of the proteins, the domain is also found at the C terminus of some Na+/H+ antiporters. This domain is also found in CorC that is involved in Magnesium and cobalt efflux. The function of this domain is uncertain but might be involved in modulating transport of ion substrates. Probab=99.70 E-value=1.4e-16 Score=128.41 Aligned_cols=78 Identities=33% Similarity=0.739 Sum_probs=71.4 Q ss_pred CCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEE-EEECCEEEEEEEECCCEEEEEEE Q ss_conf 46773378304535999999828888877774674317999999842347379889-97589599999951887899999 Q gi|254780426|r 234 AVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGEVV-LEIPGFEIVILDADVRCVRRVRI 312 (320) Q Consensus 234 ~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge~v-~~~~g~~f~V~~~~~~rI~~v~v 312 (320) ++++|+|+++|.++|+++++.|+++++. ++|+|||||+++++|++|++||++ .++++|+|+|++++++||.+|+| T Consensus 2 ~~~~~~~~v~G~~~l~dl~~~~~~~l~~----~~~~TigG~i~~~l~~iP~~ge~~~i~~~~~~f~V~~~~~~rI~~v~i 77 (80) T pfam03471 2 KLDDGSYLVDGRAPLDDLNELLGLDLPE----EDYDTLAGLVLELLGRIPEVGDKVEVDLGGLRFTVLEVDGRRIKKVRV 77 (80) T ss_pred EECCCEEEEEECCCHHHHHHHHCCCCCC----CCCEEHHHHHHHHHCCCCCCCCEEEEEECCEEEEEEEEECCEEEEEEE T ss_conf 8889889998026999999987979998----652059999999868699999889999799899999911798989999 Q ss_pred EEC Q ss_conf 963 Q gi|254780426|r 313 RRL 315 (320) Q Consensus 313 ~~l 315 (320) +++ T Consensus 78 ~~~ 80 (80) T pfam03471 78 TKL 80 (80) T ss_pred EEC T ss_conf 989 No 9 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.70 E-value=7.3e-17 Score=130.34 Aligned_cols=106 Identities=20% Similarity=0.254 Sum_probs=95.2 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++|+.++++++.+++|+++||++++ ++++|+++.+|++.. .....+.++| +++.+++ T Consensus 2 ~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~-~~lvGivt~~Di~~~-------------~~~~~v~~iM~~~~~~v~ 67 (108) T cd04596 2 EDTGYLTTTDTVKDWHELNKETGHSRFPVVDEK-NKVVGIVTSKDVAGK-------------DPDTTIEKVMTKNPITVN 67 (108) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHC-------------CCCCCHHHHEECCCCCCC T ss_conf 988898999979999999998599889999689-909999997997517-------------988726785004762059 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++.++++.|.+++.+.++|+||+|.++|+||.+||+++ T Consensus 68 ~~~~~~~~~~~m~~~~~~~lPVVde~~~lvGiIT~~Dil~A 108 (108) T cd04596 68 PKTSVASVAHMMIWEGIEMLPVVDDNKKLLGIISRQDVLKA 108 (108) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99889999999998498778999169959999997996319 No 10 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.67 E-value=5.5e-16 Score=124.59 Aligned_cols=109 Identities=17% Similarity=0.189 Sum_probs=97.3 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) +++.++..++|+.++.+.+.+++++++||.+++. +++|+++.+|++.++..+. ...++.++| +++..++ T Consensus 1 T~~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~g-~lvGivt~~di~~~l~~~~---------~~~~V~~vM~~~v~tv~ 70 (111) T cd04639 1 THFETLSPADTLDDAADALLATTQHEFPVVDGDG-HLVGLLTRDDLIRALAEGG---------PDAPVRGVMRRDFPTVS 70 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-CEEEEEEHHHHHHHHHCCC---------CCCCHHHHCCCCCEEEC T ss_conf 9887929989799999999971997899993899-8899998999999987269---------99705763546987998 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++.++++.|.+++.+.++|||++|.++|+||.+||+| T Consensus 71 ~~~~l~~~~~~m~~~~~~~lpVVd~~g~lvGivT~~Di~E 110 (111) T cd04639 71 PSATLDAVLRLMQQGGAPAVPVVDGSGRLVGLVTLENVGE 110 (111) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9994999999999589988989957992999999788544 No 11 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.66 E-value=9.5e-16 Score=123.07 Aligned_cols=119 Identities=18% Similarity=0.286 Sum_probs=100.8 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC----CCCCCCCCCHHHHC-CCCE Q ss_conf 299986301036778979865981799841886642241029999987662235643----22333457678851-5657 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN----NINLNIQLSESNLI-KNIL 172 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~----~~~~~~~~~l~~~~-r~~~ 172 (320) ++++++.++|+.++++.+.+++++++||++++.++++|+++.+|++.++..+..... ........++.++| +++. T Consensus 2 ~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~v~ 81 (125) T cd04631 2 DVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYLGGGEKFNKIKTGNGLEAINEPVRSIMTRNVI 81 (125) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCHHHHCCCHHHCCCCHHHHCCCCCE T ss_conf 97797990949999999998399889999799995999999999999873354310121011023217899995668988 Q ss_pred EECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 97178851232343201356511654077662046675889999 Q gi|254780426|r 173 FVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 173 ~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++++++.++++.|.+++.|.++|+|++|.+.||||..||+.+ T Consensus 82 tv~~~~~l~~a~~~m~~~~i~~lpVvd~~g~~~GiIt~~DilrA 125 (125) T cd04631 82 TITPDDSIKDAAELMLEKRVGGLPVVDDDGKLVGIVTERDLLKA 125 (125) T ss_pred EECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99799998999999986395099999589939999988997369 No 12 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.58 E-value=4.8e-15 Score=118.45 Aligned_cols=105 Identities=19% Similarity=0.330 Sum_probs=94.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++|+.++++.+.+++|+++||+++ ++++|+++.+|++.. ....++.++| +++++++ T Consensus 2 kdvvtv~~d~tv~ea~~~m~~~~~~~~pVvd~--~~lvGivt~~Dll~~-------------~~~~~v~~iM~~~~~tv~ 66 (107) T cd04610 2 RDVITVSPDNTVKDVIKLIKETGHDGFPVVDN--GKVVGIVSARDLLGK-------------DPDETVEEIMSKDLVVAV 66 (107) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHCCC-------------CCCCCHHHHCCCCCCCCC T ss_conf 88099899295999999999759978999989--999998876453159-------------987726786057860169 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++.++++.|.+++.+.++|+||.|.++||||..||+.+ T Consensus 67 ~~~~l~~a~~~m~~~~~~~lpVvde~g~lvGiiT~~Dilra 107 (107) T cd04610 67 PEMDIMDAARVMFRTGISKLPVVDENNNLVGIITNTDVIRS 107 (107) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99869999999998299689699239989999994784439 No 13 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.57 E-value=2.1e-14 Score=114.22 Aligned_cols=119 Identities=14% Similarity=0.198 Sum_probs=99.6 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC---CCCCCCCCCHHHHCC-CCE Q ss_conf 1299986301036778979865981799841886642241029999987662235643---223334576788515-657 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN---NINLNIQLSESNLIK-NIL 172 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~---~~~~~~~~~l~~~~r-~~~ 172 (320) .+++++..++|+.++++.+.+++++++||.+++. +++|+++.+|++.+......... .........+.++|. ++. T Consensus 2 rdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~g-~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~~ 80 (124) T cd04600 2 RDVVTVTPDTSLEEAWALLRRHRIKALPVVDGDR-RLVGIVTQRDLLRHARPDGRRPLRGRLRGRDKPETVGDIMSPPVV 80 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCC-CEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCE T ss_conf 8969989939799999999972997899995698-088885699999877504653101222105578898995779986 Q ss_pred EECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 97178851232343201356511654077662046675889999 Q gi|254780426|r 173 FVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 173 ~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .+++++++.++++.|.+++.|..+|+|+.|.+.|+||..||+.+ T Consensus 81 ~v~~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiIt~~Dil~A 124 (124) T cd04600 81 TVRPDTPIAELVPLLADGGHHHVPVVDEDRRLVGIVTQTDLIAA 124 (124) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99898979999999997793499999579989999978997469 No 14 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=99.57 E-value=8e-14 Score=110.45 Aligned_cols=154 Identities=16% Similarity=0.253 Sum_probs=124.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 88999999999979976898989999999873047757641312001299986301036778979865981799841886 Q gi|254780426|r 51 LSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSL 130 (320) Q Consensus 51 ~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~ 130 (320) ..++||+..++.+ .|.+-+.+ ++.+.++.|||+ .+++++..+++++++++++.+|++.-+||.+++. T Consensus 220 DIdrddLe~llr~---------~elqa~~R--~~~~LtcadIMS--rdVvtv~~~ts~dhA~~ll~~H~ikaLPV~d~~~ 286 (382) T COG3448 220 DIDRDDLERLLRE---------TELQALRR--RMGELTCADIMS--RDVVTVSTDTSIDHARKLLQEHRIKALPVLDEHR 286 (382) T ss_pred CCCHHHHHHHHHH---------HHHHHHHH--HHCCCCHHHHCC--CCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 5788999999999---------99999999--856430888637--4535017767757999999976865045326666 Q ss_pred CCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEE Q ss_conf 642241029999987662235643223334576788515-6579717885123234320135651165407766204667 Q gi|254780426|r 131 DNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVS 209 (320) Q Consensus 131 d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVT 209 (320) .++|+|..+|++..... .............++.+|. |+..+-..+++.+++..|-.++.|...|+|+.|.++|||| T Consensus 287 -rl~GiVt~~dl~~~a~~--~p~qrlr~~~~~~vk~imt~~v~tv~pdtpa~~lvp~lad~g~H~lpvld~~g~lvGIvs 363 (382) T COG3448 287 -RLVGIVTQRDLLKHARP--SPFQRLRFLRPPTVKGIMTTPVVTVRPDTPAVELVPRLADEGLHALPVLDAAGKLVGIVS 363 (382) T ss_pred -CEEEEEEHHHHHHCCCC--CHHHHHHCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEECCCCCEEEEEE T ss_conf -10032448877601685--667775315998634434576335358970888888761377313668868883888862 Q ss_pred HHHHHHHHHHH Q ss_conf 58899998742 Q gi|254780426|r 210 YEDIVSVLMRD 220 (320) Q Consensus 210 leDIle~ivGe 220 (320) ..|++-++... T Consensus 364 QtDliaal~r~ 374 (382) T COG3448 364 QTDLIAALYRN 374 (382) T ss_pred HHHHHHHHHHH T ss_conf 78899999999 No 15 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=99.56 E-value=2.4e-14 Score=113.81 Aligned_cols=109 Identities=23% Similarity=0.328 Sum_probs=97.4 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ++++++.+++|+.++.+.+.+++++++||.+++ +++|+++.+|++.++..... ...++.++| +++.+++ T Consensus 1 t~vvTv~p~~tv~ea~~~m~~~~i~~~~V~~~~--~~vGiiT~~Di~~~~~~~~~--------~~~~v~~iM~~~~~tv~ 70 (111) T cd04611 1 TQILTCPPDTSLAEAASRMRERRISSIVVVDDG--RPLGIVTERDILRLLASGPD--------LQTPVGEVMSSPLLTVP 70 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEEHHHHHHHHHCCC--------CCCCHHHHHCCCCEEEE T ss_conf 971998896979999999997199889999799--99999985598998742488--------77888997106833997 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++.++++.|.+++.+.++|+||+|.++|+||..||+. T Consensus 71 ~~~~~~~a~~~m~~~~i~~lpVvd~~~~~vGiiT~~Dilr 110 (111) T cd04611 71 ADTSLYDARQLMREHGIRHLVVVDDDGELLGLLSQTDLLQ 110 (111) T ss_pred CCCCHHHHHHHHHHHCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 8999999999999718739999946998999998568427 No 16 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.55 E-value=3.9e-14 Score=112.52 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=96.7 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCC--CCCCCCCCCCCHHHHCC-CCEEE Q ss_conf 2999863010367789798659817998418866422410299999876622356--43223334576788515-65797 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAK--TNNINLNIQLSESNLIK-NILFV 174 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~--~~~~~~~~~~~l~~~~r-~~~~V 174 (320) +++++..++++.++++.+.+++++++||.+++. +++|+++.+|++.+....... ...........+.++|. +++++ T Consensus 2 ~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~g-~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~itv 80 (122) T cd04635 2 EPVTCTPDDPVSKVWDLMLESGFTGLPVVQKAG-ELIGIITRRDIIRAGSVRTSVEDQQRTQTKASPTVEKIMSTPVYSV 80 (122) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-CEEEEEEHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHCCCCCEEE T ss_conf 999968929799999999971994899991898-2999999899876554001200344314534689999686898799 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 17885123234320135651165407766204667588999 Q gi|254780426|r 175 PSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 175 pe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++++.+++..|.+++.+..+|+|+.|.++||||..||+. T Consensus 81 ~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiIt~~Dilr 121 (122) T cd04635 81 TPDDSIATAVELMLEHDIGRLPVVNEKDQLVGIVDRHDVLK 121 (122) T ss_pred CCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 89693999999889809988989937998999999799636 No 17 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.55 E-value=3.5e-14 Score=112.79 Aligned_cols=119 Identities=16% Similarity=0.285 Sum_probs=98.1 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC------CCCCCCCCCHHHHC-C Q ss_conf 1299986301036778979865981799841886642241029999987662235643------22333457678851-5 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN------NINLNIQLSESNLI-K 169 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~------~~~~~~~~~l~~~~-r 169 (320) .+++++..++++.++++++.+++++++||++++. +++|+++.+|++.++........ ........++.++| + T Consensus 1 kdvvTv~~~~~l~~a~~lm~~~~i~~lPVvd~~g-~lvGiiT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~ 79 (128) T cd04632 1 EDVITVREDDSVGKAINVLREHGISRLPVVDDNG-KLTGIVTRHDIVDFVVRDRDKARTGDRSGEKERMLDLPVYDAMSS 79 (128) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCHHCCHHHHCCHHHHHHCCCHHHHCCC T ss_conf 9986989959299999999983997799996899-789999889999988532101011221001223406799996679 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEE--ECCCCCEEEEEHHHHHHH Q ss_conf 65797178851232343201356511654--077662046675889999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVI--DEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVv--DE~G~~~GiVTleDIle~ 216 (320) +++++++++++.+++..|.+++.+..+|+ |+.|.++||||..||+.+ T Consensus 80 ~vitv~~~~~i~~a~~~M~~~~i~~lpVv~~dd~~~lvGIiT~~DilrA 128 (128) T cd04632 80 PVITASPNDSVRDAVDRMLENDDSSVVVVTPDDDTKVVGILTKKDVLRA 128 (128) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9879999293999999999779978966847899989999996996469 No 18 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=99.55 E-value=3.1e-14 Score=113.16 Aligned_cols=108 Identities=18% Similarity=0.302 Sum_probs=96.2 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) +++++..++++.|+.+.+.+++++++||.++ ++++|+++.+|++.++..... ..++.++| ++++++++ T Consensus 2 ~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd~--~~lvGiit~~Di~~~~~~~~~---------~~~V~~~M~~~~~tv~~ 70 (110) T cd04588 2 PLITLNPNATLREAARLFNTHHIHGAPVVDD--GKLVGIVTLSDIAHAIARGLE---------LAKVKDVMTKDVITIDE 70 (110) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCC---------CCHHHHHCCCCCCEECC T ss_conf 9479689498999999999709988999989--999999966898999873797---------78034520489729858 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.++++.|++++.+..+|+|+.|.++||||..||+.+ T Consensus 71 d~~l~~~~~~m~~~~v~~l~Vvd~~~~lvGIiT~~Dllka 110 (110) T cd04588 71 DEQLYDAIRLMNKHNVGRLIVTDDEGRPVGIITRTDILRS 110 (110) T ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 9969999999987799789999669999999982483468 No 19 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=99.55 E-value=2.3e-14 Score=113.95 Aligned_cols=109 Identities=19% Similarity=0.245 Sum_probs=93.9 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++|+.++++.+.+++++++||.++ ++++|+++.+|++.+...+.... .....+| +++.++++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~~~~lpVvd~--~~l~Givt~~di~~~~~~~~~~~--------~~v~~im~~~~~~v~~ 71 (111) T cd04612 2 DVVTVPVDLTVDEVLALMFGERHRGYPVVDD--GRLVGIVTLADIRRVPAEGREAT--------VLVGDVMTRDPVTASP 71 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCC--------CEEEEEECCCCCCCCC T ss_conf 8499999295999999999719948999999--99999998999998775147777--------5699997079605299 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.+++..|++++.+..+|+|++|.+.|+||..||+.+ T Consensus 72 ~~~l~~a~~~m~~~~~~~lpVvd~~g~l~GiiT~~DilrA 111 (111) T cd04612 72 DETLRDALKRMAERDIGRLPVVDDSGRLVGIVSRSDLLRA 111 (111) T ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 9979999999997698199999689989999986886349 No 20 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=99.55 E-value=4.6e-14 Score=112.02 Aligned_cols=117 Identities=17% Similarity=0.235 Sum_probs=98.6 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCC---CCCCCCCHHHHC-CCCEE Q ss_conf 29998630103677897986598179984188664224102999998766223564322---333457678851-56579 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNI---NLNIQLSESNLI-KNILF 173 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~---~~~~~~~l~~~~-r~~~~ 173 (320) ++++++.++|+.++++.+.+++++++||.++ ++++|+++.+|++.++.......... .......+.++| +++++ T Consensus 2 dpvti~~~~tl~ea~~~m~~~~~~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~t 79 (122) T cd04585 2 NPITVTPDTSLMEALKLMKENSIRRLPVVDR--GKLVGIVTDRDLKLASPSKATTLDIWELYYLLSKIKVSDIMTRDPIT 79 (122) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCEE T ss_conf 9899689296999999999739988999989--99999998899998775046553335542120388999957789889 Q ss_pred ECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 7178851232343201356511654077662046675889999 Q gi|254780426|r 174 VPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 174 Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++++.+++..|.+++.+.++|+|+.|.+.|+||..||+.+ T Consensus 80 v~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiiT~~DilkA 122 (122) T cd04585 80 VSPDASVEEAAELMLERKISGLPVVDDQGRLVGIITESDLFRA 122 (122) T ss_pred EECCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 9388669999999987598789999589989999987897469 No 21 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.53 E-value=7.2e-14 Score=110.74 Aligned_cols=115 Identities=17% Similarity=0.304 Sum_probs=95.4 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++|+.++++.+.+++||++||.+++. +++|+++.+|++.++...... .........+.++| +++.++. T Consensus 1 k~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~g-klvGiit~~Di~~~~~~~~~~--~~~~~~~~~V~~iM~~~~~tv~ 77 (116) T cd04643 1 EEVAYVQDTNTLRHALLVLTKHGYSAIPVLDKEG-KYVGTISLTDILWKLKGLENL--DLERLVDLKVIDVMNTDVPVII 77 (116) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCCCC--CHHHHCCCCHHHHHCCCCEEEE T ss_conf 9769999949299999999974998798986999-499998899999987436532--2023116687795156988990 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++.++.+++..|.+++ +.+|+|+.|.++||||..|||.+ T Consensus 78 ~~~~i~~~~~~m~~~~--~lpVVD~~~~lvGIITr~DilkA 116 (116) T cd04643 78 DDADIEEILHLLIDQP--FLPVVDDDGIFIGIITRREILKA 116 (116) T ss_pred CCCCHHHHHHHHHHCC--EEEEEECCCEEEEEEEHHHHHCC T ss_conf 7999999999998789--89999459999999992795579 No 22 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.53 E-value=9.2e-14 Score=110.04 Aligned_cols=116 Identities=13% Similarity=0.265 Sum_probs=98.4 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC--CCCCCCCCCHHHHC-CCCEEEC Q ss_conf 99986301036778979865981799841886642241029999987662235643--22333457678851-5657971 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN--NINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~--~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ++++..++++.++++.+.+++++++||++++. +++|+++.+|++.++........ ........++.++| +++.+++ T Consensus 3 vvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~~-~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~tv~ 81 (122) T cd04803 3 VVTLSEDDSLADAEELMREHRIRHLPVVNEDG-KLVGLLTQRDLLRAALSSLSDNGEESLTKERDVPVAEVMKTDVLTVT 81 (122) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHCCCCCEEEC T ss_conf 19998969999999999974997899990898-68889659999999984376420121025358899996689986998 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++.++++.|.+++.+..+|+|+.|.++|+||..||+. T Consensus 82 ~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiiT~~Dilk 121 (122) T cd04803 82 PDTPLREAAEIMVENKIGCLPVVDDKGTLVGIITRSDFLR 121 (122) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHEEE T ss_conf 9998999999998568509999947998999999515564 No 23 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=99.52 E-value=6.1e-14 Score=111.22 Aligned_cols=112 Identities=17% Similarity=0.230 Sum_probs=96.3 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++++.++++.+.+++++++||++++. +++|+++.+|++.++..... .....+.++| +++.++. T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~~-~lvGiit~~Dl~~~l~~~~~-------~~~~~v~~im~~~~~~v~ 72 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDDG-RLVGIVSLDDIREILFDPSL-------YDLVVASDIMTKPPVVVY 72 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHHCCC-------CCCCEEEHHCCCCCEEEE T ss_conf 9976958909899999999861997899992899-49999999999999970566-------557293120362965991 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEE-CCCCCEEEEEHHHHHHH Q ss_conf 788512323432013565116540-77662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVID-EHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvD-E~G~~~GiVTleDIle~ 216 (320) +++++.++++.|++++.+..+||| |.|.++|+||..|||.+ T Consensus 73 ~~~~l~~a~~~m~~~~~~~lPVVd~~~g~lvGiit~~Dil~A 114 (114) T cd04613 73 PEDSLEDALKKFEDSDYEQLPVVDDDPGKLLGILSRSDLLSA 114 (114) T ss_pred CCCCHHHHHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 367889999999975975999998999989999998995359 No 24 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=99.52 E-value=6.3e-14 Score=111.10 Aligned_cols=101 Identities=21% Similarity=0.283 Sum_probs=88.6 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECCC Q ss_conf 9998630103677897986598179984188664224102999998766223564322333457678851-565797178 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPSS 177 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe~ 177 (320) -+++.+++|+.++.++|.+++++++||++++ +++|+++.+|+... ...++.++| +++.++.++ T Consensus 3 pVtV~pd~tV~eA~~lM~~~~i~~lPVvd~~--klvGIvt~rDi~~~--------------~~~~v~d~Mt~~v~tv~p~ 66 (104) T cd04594 3 DIKVKDYDKVYEAKRIMIENDLLSLPVVDYN--KFLGAVYLKDIENA--------------TYGDVVDYIVRGIPYVRLT 66 (104) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEHHHHHHC--------------CCCCHHHEEECCCEEECCC T ss_conf 8798999939999999998398878899899--99999997996524--------------6998445476398889998 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 851232343201356511654077662046675889999 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++.++++.|.+++.+...|+|+ |.++||||+.||+++ T Consensus 67 ~~l~ea~~lM~~~~i~~lPVvd~-gklvGIIT~~Dil~A 104 (104) T cd04594 67 STAEEAWEVMMKNKTRWCPVVDD-GKFKGIVTLDSILDA 104 (104) T ss_pred CCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 97999999999869787899989-999999996895378 No 25 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=99.51 E-value=9.2e-14 Score=110.05 Aligned_cols=131 Identities=23% Similarity=0.314 Sum_probs=116.2 Q ss_pred HHHHHHHHHC-----CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHH Q ss_conf 9999998730-----47757641312001299986301036778979865981799841886642241029999987662 Q gi|254780426|r 74 EKEIFNNILR-----FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISH 148 (320) Q Consensus 74 E~~ii~~vl~-----l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~ 148 (320) -+.||++++. =+=.+|.|||+|..+..++..+++++++.....+++||||||.+.+. .++|+|..||++. T Consensus 170 VAtmIN~Al~n~lIKkdI~~Vedi~~P~~~~~yL~~~d~v~d~~~l~~kt~~sRfPVvn~~~-kvvGvVt~rDv~~---- 244 (432) T COG4109 170 VATMINKALSNQLIKKDIITVEDIMTPLEDTSYLRETDTVEDWLDLVEKTGHSRFPVVNRSM-KVVGVVTMRDVLD---- 244 (432) T ss_pred HHHHHHHHHHHHHHHHHEEEHHHHCCCCCCCEECCCCCCHHHHHHHHHHCCCCCCCEECCCC-EEEEEEEEHHHHC---- T ss_conf 99999899877666443255877446443432324645099999999971987341331442-6999998346444---- Q ss_pred CCCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 2356432233345767885156-579717885123234320135651165407766204667588999987 Q gi|254780426|r 149 MYAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 149 ~~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ..+..++...|.. |++|-..+++..+-+.|--+...|..|+|+.-.+.|+||..|+|+.+- T Consensus 245 ---------~~~~t~ieKVMtknp~tv~~~tsVAsvaq~MiwE~iem~PVv~~n~~llGiitR~dvlk~lq 306 (432) T COG4109 245 ---------KKPSTTIEKVMTKNPITVRAKTSVASVAQMMIWEGIEMLPVVDSNNTLLGIITRQDVLKSLQ 306 (432) T ss_pred ---------CCCCCCHHHHHCCCCEEECCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf ---------79986177774269755425226889988887645526568837861898988999999998 No 26 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.51 E-value=9.7e-14 Score=109.88 Aligned_cols=118 Identities=16% Similarity=0.237 Sum_probs=93.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCC-------CCCCCCCHHHHCC Q ss_conf 129998630103677897986598179984188664224102999998766223564322-------3334576788515 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNI-------NLNIQLSESNLIK 169 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~-------~~~~~~~l~~~~r 169 (320) ++++++..++++.++++.+.+++++.+||.+++. +++|+++.+|+.............. ............+ T Consensus 1 s~Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~g-~lvGiis~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (126) T cd04642 1 SKVVSIDSDERVLDAFKLMRKNNISGLPVVDEKG-KLIGNISASDLKGLLLSPDDLLLYRTITFKELSEKFTDSDGVKSR 79 (126) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCC-EEEEEEEHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCEEEECCCC T ss_conf 9579999959999999999980987899992899-099999989976555144303444455554203344234202447 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++.+++++++.++++.|.+++.|..+||||.|.++||||+.||+. T Consensus 80 ~~itv~~~~~l~~~~~~m~~~~i~~lpVVD~~~~lvGIIs~~Dil~ 125 (126) T cd04642 80 PLITCTPSSTLKEVITKLVANKVHRVWVVDEEGKPIGVITLTDIIS 125 (126) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9989999693999999999869749999989995999999289421 No 27 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=99.50 E-value=1.2e-13 Score=109.27 Aligned_cols=177 Identities=22% Similarity=0.306 Sum_probs=137.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCC-----HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEE Q ss_conf 889999999999799768989-----899999998730477576413120012999863010367789798659817998 Q gi|254780426|r 51 LSHHKGIRLPRIDASLEGIFS-----ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPV 125 (320) Q Consensus 51 ~~~~eel~~l~~~~~~~G~i~-----~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV 125 (320) .+||.++..-++...-=|+|+ +.+.+++++|=++. ..++-|.+.+++.++.|+.++.+++.++++|-+|| T Consensus 44 TVTE~~MAiAMA~~GGiGVIH~N~~~E~Qae~V~~VKr~e-----~g~i~redp~t~~P~~tv~~~~~l~~~~gisG~PV 118 (476) T TIGR01302 44 TVTESRMAIAMAREGGIGVIHRNMSIERQAEEVKRVKRAE-----NGIISREDPVTISPETTVEDVLELMERKGISGIPV 118 (476) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHC-----CCEEEECCCEEECCCCHHHHHHHHCCCCCCCEEEE T ss_conf 2459999999985499479944799899999988752320-----66065148868479851899997322157654578 Q ss_pred ECCCCC------CCCCCEEH---HHHHHHHHHCCCCCCCCCCCCCCC--HHHHCC---CCEEECCCCCHHHHHHHHHCCC Q ss_conf 418866------42241029---999987662235643223334576--788515---6579717885123234320135 Q gi|254780426|r 126 YKNSLD------NPRGMVHM---RDVISYISHMYAKTNNINLNIQLS--ESNLIK---NILFVPSSMLVSDLLTNIQESR 191 (320) Q Consensus 126 ~~~~~d------~iiGiv~~---kDll~~~~~~~~~~~~~~~~~~~~--l~~~~r---~~~~Vpe~~~l~~lL~~m~~~~ 191 (320) .++..+ +++|||+. ||+..+. +...+ +.++|. .++.+|+...+.++++.|+++| T Consensus 119 v~~G~~~g~ktGKLvGIiT~sqWrD~~f~~------------~~~~~daV~~~MT~~~~~iT~~e~i~~e~A~~~L~~~r 186 (476) T TIGR01302 119 VEDGKDGGPKTGKLVGIITKSQWRDVRFVK------------DKGKKDAVSEVMTPREELITVPEGIDLEEALKVLHKHR 186 (476) T ss_pred EECCCCCCCCEEEEEEEEECCCEEECCCCC------------CCCCCCCEEECCCCCCCCEEECCCCCHHHHHHHHHHHC T ss_conf 836889897100699998377225411010------------46883301011203764348416777899999988608 Q ss_pred CCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCC---CCCEEEEECCCCH--HHHHHH Q ss_conf 65116540776620466758899998742221000000001246---7733783045359--999998 Q gi|254780426|r 192 IRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAV---SDNTFIVDARTDL--EELAKI 254 (320) Q Consensus 192 ~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~---~~g~~~v~G~~~l--~~l~~~ 254 (320) ..-++|||+.|.++||||+.||.... + .|...+- .+|.|+|-+.+.- +|++|. T Consensus 187 ~ekLpvVd~~~~lVgLiT~~Di~~~~------~----~P~A~kd~vG~~GrL~VgAAvg~r~~D~~R~ 244 (476) T TIGR01302 187 IEKLPVVDKDGELVGLITVKDIVKRR------E----FPHASKDTVGENGRLIVGAAVGTREDDLERA 244 (476) T ss_pred CCEEEEECCCCCEEEEEEHHHHHHHH------H----CCCCCCCCCCCCCEEEEEEEECCCCCCHHHH T ss_conf 65047882789889998644788986------3----8887788748886089998846898618999 No 28 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.49 E-value=6.5e-14 Score=111.05 Aligned_cols=110 Identities=20% Similarity=0.334 Sum_probs=96.9 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++++.++.+.+.+++++.+||++++. +++|+++.+|++.++..+.. ...++.++| ++++++++ T Consensus 2 ~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~~-~lvGiiT~~Di~~~~~~~~~--------~~~~v~~~m~~~~~tv~~ 72 (112) T cd04624 2 PVVTVDPDTSIREAAKLMAEENVGSVVVVDPDE-RPIGIVTERDIVRAVAAGID--------LDTPVSEIMTRDLVTVDP 72 (112) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEECCHHHHHHHCCCC--------CCCCEEEEECCCCEEECC T ss_conf 978978939599999999862998899993998-29999982045889864874--------344204565167167879 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.+++..|.+++.+..+|+|+.|.++|+||..||+.| T Consensus 73 ~~~l~~a~~~m~~~~~~~l~Vvd~~g~lvGiit~~Dilke 112 (112) T cd04624 73 DEPVAEAAKLMRKNNIRHHLVVDKGGELVGVISIRDLVRE 112 (112) T ss_pred CCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHCCC T ss_conf 9989999999997498399899899999999980772569 No 29 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.49 E-value=1.9e-13 Score=108.01 Aligned_cols=110 Identities=21% Similarity=0.318 Sum_probs=95.3 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH-HHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 2999863010367789798659817998418866422410299999-8766223564322333457678851-5657971 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI-SYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll-~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ++++++.++|+.++++.+.+++++.+||.++ ++++|+++.+|++ ..+..+.. ....++.++| +++++++ T Consensus 2 ~~vti~~~~tl~~a~~~m~~~~~~~v~V~d~--~~~vGiiT~~Di~~~~~~~~~~-------~~~~~v~~iM~~~~i~v~ 72 (113) T cd04622 2 DVVTVSPDDTIREAARLMREHDVGALPVCEN--DRLVGIVTDRDIVVRAVAEGRD-------PDTTTVGDVMTRGVVTVT 72 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHHHHHHHCCCC-------HHHCCHHHHHCCCCEEEE T ss_conf 9779689398999999999729988999989--9999999632788999973998-------200776786415767998 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++.++.++++.|++++.|..+|+|+.|.++|+||..||+.+ T Consensus 73 ~~~~l~~a~~~m~~~~i~~lpVvd~~~~~vGiIt~~Dilra 113 (113) T cd04622 73 EDDDVDEAARLMREHQVRRLPVVDDDGRLVGIVSLGDLARA 113 (113) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 89987999999985796589999089989999997995477 No 30 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.48 E-value=3.5e-13 Score=106.21 Aligned_cols=117 Identities=19% Similarity=0.278 Sum_probs=98.1 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC---CCCCCCCCCHHHHC-CCCE Q ss_conf 1299986301036778979865981799841886642241029999987662235643---22333457678851-5657 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN---NINLNIQLSESNLI-KNIL 172 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~---~~~~~~~~~l~~~~-r~~~ 172 (320) ++++++..++++.++++.+.+++++++||.+++ .++|+++.+|++.++........ ........++.++| +++. T Consensus 1 T~viTv~p~~tl~~a~~~m~~~~i~~lpVvd~~--~lvGivT~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~i 78 (122) T cd04637 1 TRVVTVEMDDRLEEVREIFEKHKFHHLLVVEDN--ELVGVISDRDYLKAISPFLGTAGETEKDLATLNRRAHQIMTRDPI 78 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHHHHHHCCCCCHHHHCCCHHHCCCHHHHCCCCCE T ss_conf 974998896969999999987499889999899--999998889999875002453102211212306899994638983 Q ss_pred EECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 9717885123234320135651165407766204667588999 Q gi|254780426|r 173 FVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 173 ~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++++++.+++..|.+++.+..+|+|+.|.++||||..||+. T Consensus 79 tv~~~~~l~~a~~~m~~~~i~~lPVvd~~~~lvGiIT~~Diik 121 (122) T cd04637 79 TVSPDTPVDEASKLLLENSISCLPVVDENGQLIGIITWKDLLK 121 (122) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999984999999999749308999968998999999688327 No 31 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.48 E-value=3.2e-13 Score=106.50 Aligned_cols=115 Identities=14% Similarity=0.241 Sum_probs=94.6 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHH--HH-CCCCEEEC Q ss_conf 9998630103677897986598179984188664224102999998766223564322333457678--85-15657971 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSES--NL-IKNILFVP 175 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~--~~-~r~~~~Vp 175 (320) .++++.++|+.++.+.+.+++++|+||.+++..+++|+++.+|++.++.................+. .+ .+++..++ T Consensus 3 fi~v~~~~sl~~a~~lm~~~~i~rlpVvd~~~g~lvGIiT~~dil~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~viti~ 82 (123) T cd04627 3 FIPVPSTASLFQAIEILGSGGIHRVAVTEEESGEVIGILSQRRLVEFLWENARSFPGLDPLYPIPLRDLTIGTSDVISIN 82 (123) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHCCCCCCEEEC T ss_conf 64848989299999999749977799995899949999997999999986376566144541120023323767717989 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHH Q ss_conf 78851232343201356511654077662046675889 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDI 213 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDI 213 (320) +++++.++++.|.+++.+-..|||+.|.++|++|..|| T Consensus 83 ~d~~l~~A~~lM~~~~i~~lpVVD~~g~lvGiiS~~Dl 120 (123) T cd04627 83 GDQPLIDALHLMHNEGISSVAVVDNQGNLIGNISVTDV 120 (123) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHE T ss_conf 99959999999998098878698599969999892542 No 32 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.47 E-value=2.5e-13 Score=107.19 Aligned_cols=118 Identities=17% Similarity=0.216 Sum_probs=96.3 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC---------------CCCCCCC Q ss_conf 1299986301036778979865981799841886642241029999987662235643---------------2233345 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN---------------NINLNIQ 161 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~---------------~~~~~~~ 161 (320) ++++++..++|+.++.+.+.+++++++||.+++. +++|+++.+|++.....+..... ....... T Consensus 2 ~~vitv~~~~tl~~a~~~m~~~~i~~lpVvd~~~-~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (135) T cd04586 2 TDVVTVSPETSVAEAARLMLDNHISGLPVVDDDG-RLVGIVSEGDLLRRAELGTERRRARWLDLLAGAEELAAAFVRSHG 80 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-EEEEEEEHHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHCCCCC T ss_conf 9998999939399999999973996799990899-799999818976445415654114555430231034554304479 Q ss_pred CCHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 7678851-565797178851232343201356511654077662046675889999 Q gi|254780426|r 162 LSESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 162 ~~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .++.++| ++++++++++++.+++..|.+++.+..+|+|+ |.++|+||..||+.+ T Consensus 81 ~~V~~iM~~~~~~v~~~~~l~~~~~~m~~~~i~~lpVvd~-~~lvGiIt~~DilkA 135 (135) T cd04586 81 RKVADVMTRPVVTVGEDTPLAEVAELMEEHRIKRVPVVRG-GRLVGIVSRADLLRA 135 (135) T ss_pred CCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 8989957899879959795999999999759609999999-999999997996459 No 33 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.47 E-value=2.7e-13 Score=106.96 Aligned_cols=115 Identities=20% Similarity=0.274 Sum_probs=95.5 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCC---CCCCCCCHHHHC-CCCEE Q ss_conf 29998630103677897986598179984188664224102999998766223564322---333457678851-56579 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNI---NLNIQLSESNLI-KNILF 173 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~---~~~~~~~l~~~~-r~~~~ 173 (320) ++++++.++|+.++.+.+.+++++++||.++ ++++|+++.+|++..+.......... .......+.++| +++++ T Consensus 2 ~vvtv~~~~tl~ea~~~m~~~~i~~lpVvd~--~~~vGiiT~~Di~~~l~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~t 79 (121) T cd04633 2 PVITVSPDDRVSHARRLMLDHDISRLPVIEG--GKLVGIVTEKDIADALRSFRPLVRDRHQERRIRNLPVSDIMTRPVIT 79 (121) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCCCHHHCCCCCCCEEECCCCCCCEE T ss_conf 9479799398999999999709988999989--99887875999999997446533201102451366855334379889 Q ss_pred ECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 717885123234320135651165407766204667588999 Q gi|254780426|r 174 VPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 174 Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++++.+++..|.+++.+..+|+|+ |.++||||..||+. T Consensus 80 v~~~~~l~~~~~~m~~~~~~~lpVvd~-~~lvGIIT~~Dilr 120 (121) T cd04633 80 IEPDTSVSDVASLMLENNIGGLPVVDD-GKLVGIVTRTDILR 120 (121) T ss_pred EECCCCHHHHHHHHHHHCCEEEEEEEC-CEEEEEEEHHHHHC T ss_conf 909898999999989719859999989-99999998346028 No 34 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=99.47 E-value=1.7e-13 Score=108.27 Aligned_cols=106 Identities=26% Similarity=0.367 Sum_probs=91.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC--CCCEEE Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851--565797 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI--KNILFV 174 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~--r~~~~V 174 (320) .+++++..++|+.++++.+.+++++++||++++. +++|+++.+|+.... ....++.++| .+.... T Consensus 2 rdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~~-~l~Giit~~Dl~~~~------------~~~~~v~~iM~~~~~~~~ 68 (110) T cd04601 2 RDPITVSPDATVAEALELMAEYGISGLPVVDDDG-KLVGIVTNRDLRFET------------DLDKPVSEVMTPENLLTT 68 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-EEEEEEEHHHHHHHC------------CCCCCCEEEEECCCEEEE T ss_conf 8898979939699999999984998899994899-698788776776505------------767860357705750870 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 17885123234320135651165407766204667588999 Q gi|254780426|r 175 PSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 175 pe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++++.++++.|++++.+.++|+|+.|.++|+||+.||+. T Consensus 69 ~~~~~~~~~~~~m~~~~i~~lpVvd~~~~lvGiIt~~Dilk 109 (110) T cd04601 69 VEGTSLEEALELLHEHKIEKLPVVDDEGKLKGLITVKDIEK 109 (110) T ss_pred CCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHCC T ss_conf 47687999999998668519999988999999999598437 No 35 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.47 E-value=3.1e-13 Score=106.58 Aligned_cols=118 Identities=19% Similarity=0.204 Sum_probs=96.6 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCC----------------CCCCC Q ss_conf 12999863010367789798659817998418866422410299999876622356432----------------23334 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNN----------------INLNI 160 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~----------------~~~~~ 160 (320) .+++++..++++.++.+.+.+++++++||.+++. +++|+++.+|++.++......... ..... T Consensus 1 kdVitv~p~~tl~ea~~~m~~~~i~~lpVvd~~g-~lvGivT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (135) T cd04621 1 EDIATVHPEHSLLHVVDEMEKNGVGRVIVVDDNG-KPVGVITYRDLAFAEFEDNERGLPKKSIKMKRKAGQKRYRYVKEV 79 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHCCCC T ss_conf 9868989949799999999874997799995999-399999879988666314544310233333101002332101446 Q ss_pred CCCHHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 576788515-65797178851232343201356511654077662046675889999 Q gi|254780426|r 161 QLSESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 161 ~~~l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ...+.++|+ +++.+++++++.++++.|.+++.+-.+|+|+ |.++||||..||+.| T Consensus 80 ~~~v~diMt~~vitv~~d~~l~~a~~~M~~~~i~~lPVvd~-~~lvGiIt~~Dilre 135 (135) T cd04621 80 PLVAEDIMTEEIITVSPNDDVVDAAKLMLEANISGLPVVDN-DNIVGVITKTDICRE 135 (135) T ss_pred CCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHCCC T ss_conf 88998938899889989893999999999719988999989-999999985881168 No 36 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=99.46 E-value=4.2e-13 Score=105.72 Aligned_cols=112 Identities=16% Similarity=0.271 Sum_probs=98.1 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .++..+..++|+.++++++.+++++.+||++++. +++|+++-+|+.+++..+... ...++.++| +++.++. T Consensus 2 ~~vp~V~~~~si~ea~~~m~~~~~~~v~Vvd~~~-~lvGiiT~~Dl~r~~~~~~~~-------~~~~v~~iM~~~~~tv~ 73 (114) T cd04604 2 DALPLVSPDTSLKDALLEMSRKGLGMTAVVDEDG-RLVGIFTDGDLRRALEKGLDI-------LTLPVADVMTRNPKTID 73 (114) T ss_pred CCCCEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCCCC-------CCCCHHHHHCCCCEEEE T ss_conf 9477889949399999999976997899997999-399999899999998716664-------56797474356758993 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++.++.+++..|++++.+..+|+|+.|.+.|+||..||+.+ T Consensus 74 ~~~~i~~a~~~m~~~~i~~lpVvd~~~~~vGiit~~DllkA 114 (114) T cd04604 74 PDALAAEALELMEENKITALPVVDDNGRPVGVLHIHDLLRA 114 (114) T ss_pred CCCCHHHHHHHHHHCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 89886999999987795799999899989999988995469 No 37 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=99.46 E-value=3.4e-13 Score=106.34 Aligned_cols=107 Identities=21% Similarity=0.353 Sum_probs=90.1 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) +++++..++|+.++++++.+++++++||.++ ++++|+++.+|++.++..+... ...++.++| +++..+++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~-------~~~~v~~iM~~~~~~v~~ 72 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD--GRVVGSIDESDLLDALIEGKAK-------FSLPVREVMGEPLPTVDP 72 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCC-------CCCEEEEEEECCCEEECC T ss_conf 8069999497999999999819988999989--9999999999999998638764-------687698456759459789 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++.+++..|.+. +.++|+||.|.+.||||..||+. T Consensus 73 ~~~l~~~~~~~~~~--~~~~vV~~~g~lvGIvT~~Dil~ 109 (110) T cd04609 73 DAPIEELSELLDRG--NVAVVVDEGGKFVGIITRADLLK 109 (110) T ss_pred CCCHHHHHHHHHHC--CCCEEEECCCEEEEEEEHHHHHC T ss_conf 89499999988767--98249961999999998368529 No 38 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=99.44 E-value=7.6e-13 Score=104.03 Aligned_cols=116 Identities=17% Similarity=0.298 Sum_probs=95.5 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCC--CCCCCCCCCCCHHHHC-CCCEEE Q ss_conf 2999863010367789798659817998418866422410299999876622356--4322333457678851-565797 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAK--TNNINLNIQLSESNLI-KNILFV 174 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~--~~~~~~~~~~~l~~~~-r~~~~V 174 (320) +++++..++++.++++++.+++++++||.+++. +++|+++.+|++.+....... ..........++.++| ++++++ T Consensus 2 ~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~~-~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~tv 80 (121) T cd04584 2 DVVTITPTTTIAEALELMREHKIRHLPVVDEEG-RLVGIVTDRDLRDASPSPFTTLSEHELYLLLKMPVKEIMTKDVITV 80 (121) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHCCCCCHHHHHHCCCCCCCCHHHHCCCCCCEE T ss_conf 988968929899999999972998899990899-3999965899875220122002444202514689999667998589 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 17885123234320135651165407766204667588999 Q gi|254780426|r 175 PSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 175 pe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++++.++++.|.+++.+..+|+|+ |.++||||..||+. T Consensus 81 ~~~~~l~~a~~~m~~~~i~~lpVvd~-~~lvGivt~~Dilr 120 (121) T cd04584 81 HPLDTVEEAALLMREHRIGCLPVVED-GRLVGIITETDLLR 120 (121) T ss_pred CCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHC T ss_conf 89995999999988669619999999-99999998226006 No 39 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.44 E-value=6.2e-13 Score=104.61 Aligned_cols=111 Identities=23% Similarity=0.321 Sum_probs=96.2 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++|+.++.+.+.+++++.+||.+++ +.++|+++.+|++.++...... ....++.++| ++++++.+ T Consensus 2 ~vvti~~~~tl~ea~~~m~~~~~~~l~Vvd~~-~~l~Giit~~Di~~~~~~~~~~------~~~~~V~~iM~~~~~~v~~ 74 (113) T cd04623 2 DVITVRPDATVAEAAKLMAEKNIGAVVVVDDG-GRLVGIFSERDIVRKVALRGAS------ALDTPVSEIMTRNVITVTP 74 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCC------HHHCHHHHHCCCEEEEEEC T ss_conf 97696893979999999997399889999089-8299999768999999864998------4664012312110289916 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.++++.|.+++.+..+|+|+ |.++|+||..||+.+ T Consensus 75 ~~~l~~a~~~m~~~~i~~lpV~d~-~~lvGiit~~Dllra 113 (113) T cd04623 75 DDTVDEAMALMTERRFRHLPVVDG-GKLVGIVSIGDVVKA 113 (113) T ss_pred CCCHHHHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 991999999999769259999999-999999997996469 No 40 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.44 E-value=1e-12 Score=103.24 Aligned_cols=117 Identities=20% Similarity=0.239 Sum_probs=96.9 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCC-------------CCCCCCCCC Q ss_conf 1299986301036778979865981799841886642241029999987662235643-------------223334576 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTN-------------NINLNIQLS 163 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~-------------~~~~~~~~~ 163 (320) .+++++..++|+.++++.+.+++++.+||.+++. .++|+++.+|++..+........ ........+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~~-~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNEG-RVVGIVSEGDLIRKIYKGKGLFYVTLLYSVIFLDESKIKKLLGKK 79 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCC T ss_conf 9987968919799999999972997899995999-399998889999998626762000000000000233310203899 Q ss_pred HHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 78851-56579717885123234320135651165407766204667588999 Q gi|254780426|r 164 ESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 164 l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +.++| ++++.+.++.++.+++..|.+++.|-.+|+|+ |.++||||..||+. T Consensus 80 v~diMt~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~-~~lvGiIt~~Dilr 131 (132) T cd04636 80 VEEIMTKKVITVDEDTTIEDVARIMSKKNIKRLPVVDD-GKLVGIISRGDIIR 131 (132) T ss_pred HHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHC T ss_conf 99967899879999794999999999629878999989-99999999688234 No 41 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.44 E-value=1e-12 Score=103.16 Aligned_cols=111 Identities=14% Similarity=0.226 Sum_probs=97.6 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) .++.++.++|+.|+.+.+.+++.+.+||++++. +++|+++-+|++.++...... ....+++++| ++++.+.+ T Consensus 2 rvitV~~~~ti~eA~~~M~~~~ig~l~Vvd~~g-~lvGIiTe~Di~r~~~~~~~~------~~~~~V~~vMt~~vitv~~ 74 (114) T cd04619 2 RLAKIDVNATLQRAAKILGEPGIDLVVVCDPHG-KLAGVLTKTDVVRQMGRCGGP------GCTAPVENVMTRAVVSCRP 74 (114) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHHCCCC------CCCCCHHHHCCCCCEEECC T ss_conf 312989869399999999874998899998999-599999949999998730798------6678999961589889999 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .+++.+++..|.+++.+-.+|+|+.|.++|+||..||+. T Consensus 75 ~~~i~~a~~~M~~~~i~~lPVVD~~~~~vGiit~~Dilk 113 (114) T cd04619 75 GDLLHDVWQVMKQRGLKNIPVVDENARPLGVLNARDALK 113 (114) T ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHEEC T ss_conf 890999999888769858899907995999999354542 No 42 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.44 E-value=5.7e-13 Score=104.86 Aligned_cols=116 Identities=15% Similarity=0.245 Sum_probs=96.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCC----------------------- Q ss_conf 129998630103677897986598179984188664224102999998766223564----------------------- Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKT----------------------- 153 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~----------------------- 153 (320) .+++++..++++.++++.+.+++++++||.++ ++++|+++.+|++.++..+.... T Consensus 1 kdvvTv~~d~~l~ea~~~m~~~~i~~lPVvd~--~~lvGivT~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (143) T cd04634 1 KNPITCNADDTISDAARLLRENKISGAPVLDG--GKLVGIVSESDILKLLVTHDPSGNLWLPSPLELIELPLREFINWEE 78 (143) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99889589398999999999749977999979--9999999869999887516755432356602332111232222667 Q ss_pred --CCCCCCCCCCHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf --322333457678851-56579717885123234320135651165407766204667588999 Q gi|254780426|r 154 --NNINLNIQLSESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 154 --~~~~~~~~~~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .........++.++| ++++.+++++++.++.+.|.+++.+..+|+|+ |.++||||..|+++ T Consensus 79 ~~~~~~~~~~~~v~diM~~~~itv~~~~~l~~a~~~m~~~~i~~lPVvd~-~~lvGIItr~Dilk 142 (143) T cd04634 79 TKRALTDAGKMKVRDIMTKKVITISPDASIEDAAELMVRHKIKRLPVVED-GRLVGIVTRGDIIE 142 (143) T ss_pred HHHHHHHHCCCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHCCC T ss_conf 77665432278989958889879989896999999999739888999989-99999999577308 No 43 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.43 E-value=8.8e-13 Score=103.62 Aligned_cols=112 Identities=18% Similarity=0.180 Sum_probs=95.4 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC-CCEEECC Q ss_conf 299986301036778979865981799841886642241029999987662235643223334576788515-6579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK-NILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r-~~~~Vpe 176 (320) +++++..++|+.++++.+.+++++.+||.+++. +++|+++.+|++..+...... .....++.++|+ +++.+++ T Consensus 2 ~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~~-~lvGiit~~Dll~~~~~~~~~-----~~~~~~v~~iM~~~~~tv~~ 75 (114) T cd04629 2 NPVTFTPDMSVTEAVEKLLKSKISGGPVVDDNG-NLVGFLSEQDCLKQLLESSYH-----CDGVATVRDIMTTEVLTVSP 75 (114) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-EEEEEEEHHHHHHHHHHHHCC-----CCCCCEEHHHCCCCCEEEEC T ss_conf 899979929999999999971997899994899-299999678999999775405-----88765918851468879957 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.+++..|.+++.+..+|+|+ |.++|+||..||+.+ T Consensus 76 ~~~~~~a~~~m~~~~~~~lpVvd~-g~lvGiIt~~DilrA 114 (114) T cd04629 76 DDSIVDLAQLMLKAKPKRYPVVDD-GKLVGQISRRDVLRA 114 (114) T ss_pred CCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 999999999988669839999989-999999997996339 No 44 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.43 E-value=1.1e-12 Score=102.94 Aligned_cols=111 Identities=23% Similarity=0.308 Sum_probs=96.8 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++|+.++.+.|.+++.+++||.+++.+.++|+++.+|++..+...... ....++.++| ++++.+.+ T Consensus 2 dvitv~~~~tv~eA~~~M~~~~i~~v~V~~~~~~~~vGiiT~~Di~~~~~~~~~~------~~~~~V~~vMt~~v~tv~~ 75 (114) T cd04630 2 NVVTIDGLATVAEALQLMKEHGVSSLVVEKRRESDAYGIVTMRDILKKVVAEGRD------PDRVNVYEIMTKPLISVSP 75 (114) T ss_pred CEEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHHCCCC------CCCCCHHHHCCCCEEEECC T ss_conf 8499998293999999999829988999978996089999818989999846789------4444986827035499999 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .+++.+++..|.+++.+.++|+| .|.++|+||..||+. T Consensus 76 d~~~~~a~~~m~~~~i~~lpVvd-~~~lvGiit~~Dil~ 113 (114) T cd04630 76 DMDIKYCARLMERTNIRRAPVVE-NNELIGIISLTDIFL 113 (114) T ss_pred CCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHC T ss_conf 49499999999977972999999-999999997689647 No 45 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.43 E-value=5.4e-13 Score=105.01 Aligned_cols=105 Identities=23% Similarity=0.259 Sum_probs=91.3 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++|++++++++.+++++.+||.+++. +++|+++.+|++... ....+..+| ++++++. T Consensus 1 kdvvtv~~~~tv~ea~~lm~~~~i~~lpVvd~~~-~lvGivT~~Dl~~~~-------------~~~~v~~iMt~~~~tv~ 66 (106) T cd04638 1 ENVVYVTLPGTRDDVLELLKEYKVSGVPVVKKSG-ELVGIITRKDLLRNP-------------EEEQLALLMTRDPPTVS 66 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-EEEEEEEHHHHHHCC-------------CCCHHHHHHCCCCEECC T ss_conf 9989999949699999999976999899993899-999998978964487-------------50244675358973679 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++.++++.|.+++.+..+|+|+ |.++||||..||+.+ T Consensus 67 ~~~~~~~a~~~m~~~~i~~lpVvd~-~~lvGiIt~~DilkA 106 (106) T cd04638 67 PDDDVKEAAKLMVENNIRRVPVVDD-GKLVGIVTVADIVRA 106 (106) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9491999999998769759999999-999999997995359 No 46 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.43 E-value=3.5e-13 Score=106.23 Aligned_cols=107 Identities=16% Similarity=0.261 Sum_probs=95.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++++.++++.+.+++++++||.+++ ++++|+++.+|++..+.... ..+.++| +++++++ T Consensus 2 r~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~-~~~vGiit~~Di~~~~~~~~-----------~~v~~im~~~~~~~~ 69 (110) T cd04605 2 RPVVTISEDASIKEAAKLMIEENINHLPVVDED-GRLVGIVTSWDISKAVARDK-----------KSVEDIMTRNVITAT 69 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEECHHHHHHHHCCC-----------CCHHHCEEECHHHHH T ss_conf 899898993999999999997299889999799-91999996227888987079-----------763360430012014 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +..++.++++.|++++.+..+|+|+.|.++|+||..||+. T Consensus 70 ~~~~~~~a~~~m~~~~i~~lPVvD~~~~lvGiiT~~Di~k 109 (110) T cd04605 70 PDEPIDVAARKMERHNISALPVVDAENRVIGIITSEDISK 109 (110) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHCC T ss_conf 6558999999999839619999918998999999377211 No 47 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.43 E-value=1.2e-12 Score=102.61 Aligned_cols=109 Identities=19% Similarity=0.323 Sum_probs=95.2 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) +++.+++.++++.++++.+.+++++.+||++++. +++|+++.+|++.+..... ....++.++| ++++++. T Consensus 1 tdv~tv~~~~tl~eA~~~m~~~~~~~~~Vvd~~~-~l~GIiT~~Di~~~~~~~~--------~~~~~V~~iMt~~~itv~ 71 (111) T cd04626 1 TDFPTIDEDASIREALHEMLKYNTNEIIVKDNEE-KLKGVVTFTDILDLDLFES--------FLEKKVFNIVSQDVFYVN 71 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCC--------CCCCCHHHHCCCCCEEEC T ss_conf 9973999629399999999984998899996999-7999999499999985268--------778888996069987999 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++.++.+++..|++++.+..+|+|+ |.++|+||..||++ T Consensus 72 ~~~~~~~a~~~M~~~~i~~LPVvd~-~klvGiit~~Dil~ 110 (111) T cd04626 72 EEDTIDEALDIMREKQIGRLPVVDD-NKLIGVVRTKDILD 110 (111) T ss_pred CCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHCC T ss_conf 9793999999999809988999999-99999999698148 No 48 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.43 E-value=9.3e-13 Score=103.44 Aligned_cols=112 Identities=15% Similarity=0.224 Sum_probs=92.2 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-------C Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-------5 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-------K 169 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-------r 169 (320) .+++++.+++++.++++.+.++++|.+||.+++. +++|+++.+|++.....+..... .......+ . T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~g-~lvGiis~~Dl~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 73 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDENG-KVVDVYSRFDVINLAKEGAYNNL------DLTVGEALERRSQDFE 73 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCCCCC------CCCHHHHHHCCCCCCC T ss_conf 9978989989999999999980986699987899-69899759999988751554434------4208776421555547 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++..+++++++.+++..|.+++.|..+||||.|.++|+||+.||+. T Consensus 74 ~v~t~~~~~~l~~~~~~m~~~~i~rlpVVD~~~rlvGiIs~~Dilr 119 (120) T cd04641 74 GVRTCSPDDCLRTIFDLIVKARVHRLVVVDENKRVEGIISLSDILQ 119 (120) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 8569899997999999999849857989989997999998688238 No 49 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.42 E-value=1.9e-12 Score=101.37 Aligned_cols=109 Identities=17% Similarity=0.229 Sum_probs=95.9 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECCC Q ss_conf 9998630103677897986598179984188664224102999998766223564322333457678851-565797178 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPSS 177 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe~ 177 (320) -.++..++++.++++.+.+++++.+||.+++. +++|+++.+|++.+...... ....++.++| +++.++.++ T Consensus 3 P~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~~-~lvGivT~~Di~~~~~~~~~-------~~~~~v~~iMt~~~~ti~~~ 74 (113) T cd04615 3 PSCVVLNTDIARAVAEMYTSGSRALPVVDDKK-RLVGIITRYDVLSYALESEE-------LKDAKVREVMNSPVITIDAN 74 (113) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHHCCC-------CCCCCHHHHCCCCCEEEECC T ss_conf 95989969999999999973997899994899-79999995999999980798-------24589999275697899089 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 85123234320135651165407766204667588999 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++.+++..|++++.+..+|+|+.|.++|+||..||+. T Consensus 75 ~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiit~~Dilr 112 (113) T cd04615 75 DSIAKARWLMSNNNISRLPVLDDKGKVGGIVTEDDILR 112 (113) T ss_pred CCHHHHHHHHHHHCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 93999999999749948999979993999999688326 No 50 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.42 E-value=1.6e-12 Score=101.99 Aligned_cols=111 Identities=15% Similarity=0.257 Sum_probs=96.7 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ..++++.+++++.++.+.+.+++++.+||.+++ +++|+++-+|++..+...... ....+++++| +++..++ T Consensus 1 ~~V~Tv~pd~~l~eA~~~M~~~~i~~lvV~~~g--~lvGIiT~rDi~~~~~~~~~~------~~~~~V~~iMt~~~~tv~ 72 (112) T cd04625 1 DTIYTVAPETLLSEAVATMAEQDLGSLVVMERG--ELVGLLTFREVLQAMAQHGAG------VLDTTVRAIMNPEPIVAS 72 (112) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHHHHHHCCCC------CCCCCHHHHEECCCEEEC T ss_conf 947998995979999999987499879995799--999999879999999970998------011708883348977998 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++.++++.|.+++.+..+|+|+ |.++|+||..||+.+ T Consensus 73 ~~~~i~~a~~~M~~~~i~~lpVvd~-g~lvGiit~~Di~rA 112 (112) T cd04625 73 PDDSIDEVRRLMVERHLRYLPVLDG-GTLLGVISFHDVAKA 112 (112) T ss_pred CCCHHHHHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9996999999768669769999999-999999997996359 No 51 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=99.39 E-value=1.3e-12 Score=102.53 Aligned_cols=130 Identities=17% Similarity=0.305 Sum_probs=116.8 Q ss_pred HHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCC Q ss_conf 99873047757641312001299986301036778979865981799841886642241029999987662235643223 Q gi|254780426|r 78 FNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNIN 157 (320) Q Consensus 78 i~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~ 157 (320) +..+..+.+.+|+++|++ .+.+++.++|+.++.+.+.++++.-.||.+++ +++|+++..|+..++.++.. T Consensus 163 i~~m~siPk~~V~~~~s~--~~i~v~~d~tl~eaak~f~~~~i~GaPVvd~d--k~vGiit~~dI~~aia~g~~------ 232 (294) T COG2524 163 ISKMVSIPKEKVKNLMSK--KLITVRPDDTLREAAKLFYEKGIRGAPVVDDD--KIVGIITLSDIAKAIANGNL------ 232 (294) T ss_pred EEEEEECCCCHHHHHCCC--CCEEECCCCCHHHHHHHHHHCCCCCCCEECCC--CEEEEEEHHHHHHHHHCCCC------ T ss_conf 743264573026662667--84676688639999999997286678510289--45899988999999976896------ Q ss_pred CCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHH Q ss_conf 3345767885156-57971788512323432013565116540776620466758899998742 Q gi|254780426|r 158 LNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 158 ~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGe 220 (320) ...++++||+ ++.+.+++.++++++.|.+++....+|+|..|..+|++|..|||..|.|. T Consensus 233 ---~~kV~~~M~k~vitI~eDe~i~dAir~M~~~nVGRLlV~ds~gkpvGiITrTDIL~~ia~~ 293 (294) T COG2524 233 ---DAKVSDYMRKNVITINEDEDIYDAIRLMNKNNVGRLLVTDSNGKPVGIITRTDILTRIAGL 293 (294) T ss_pred ---CCCHHHHHCCCCCEECCCHHHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf ---5438887505770475730499999998763864699984699678788557888886436 No 52 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=99.38 E-value=2.2e-12 Score=100.95 Aligned_cols=110 Identities=15% Similarity=0.150 Sum_probs=94.0 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECCC Q ss_conf 9998630103677897986598179984188664224102999998766223564322333457678851-565797178 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPSS 177 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe~ 177 (320) ..++..++++.|+.+.+.+++++.+||.+++. .++|+++.+|++.++...... ....+.++| +++++++++ T Consensus 3 P~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~g-~lvGIvT~~Dl~r~~~~~~~~-------~~~~v~~vmt~~~~tv~pd 74 (115) T cd04593 3 PPVLSATTPLREAAEQLIESKHGSALVVDRDG-GVVGIITLPDLLRALEADEAG-------EPSAVDEVATPPLLTVHPD 74 (115) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCCC-------CCCEEEEEECCCCEEECCC T ss_conf 83889919999999999874998999994999-889998899999999738886-------6779454303896689999 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECC--CCCEEEEEHHHHHHH Q ss_conf 851232343201356511654077--662046675889999 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEH--GGTDGLVSYEDIVSV 216 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~--G~~~GiVTleDIle~ 216 (320) .++.++++.|.+++.+...|+|+. |.++|+||..||+.+ T Consensus 75 ~~l~~al~~M~~~~i~~LPVVd~~~~~~lvGiit~~Dil~A 115 (115) T cd04593 75 EPLAHALDRMASRGLRQLPVVDRGNPGQVLGLLTRENVLLA 115 (115) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHCC T ss_conf 94999999999869988999978999989999997996569 No 53 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.38 E-value=1.5e-12 Score=102.15 Aligned_cols=107 Identities=22% Similarity=0.381 Sum_probs=93.1 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) +.++++..++++.++.+.+.+++++.+||.+++ +++|+++.+|+...+..+. .+.++.++| +++..+. T Consensus 2 spV~ti~p~~tv~ea~~~m~~~~i~~lpVvd~~--~lvGivT~~Di~~~~~~~~---------~~~~v~~iM~~~~~tv~ 70 (110) T cd04595 2 SPVKTVRPEATIEEARELLLRYGHTALPVVEGG--RVVGIISRRDVEKALRHGL---------GHAPVKDYMSTDVVTVP 70 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEHHHHHHHHHCCC---------CCCCCEEEEECCEEEEC T ss_conf 898997995999999999997599889999899--9999997689887765376---------78812103253668866 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++.+++..|.+++.+..+|+|+ |.++|+||..||+. T Consensus 71 ~~~~i~~a~~~m~~~~~~~lpVvd~-g~lvGiit~~Dilk 109 (110) T cd04595 71 PDTPLSEVQELMVEHDIGRVPVVED-GRLVGIVTRTDLLR 109 (110) T ss_pred CCCCHHHHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHEEC T ss_conf 9993999999889749139999989-99999998415615 No 54 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=99.38 E-value=1.3e-12 Score=102.57 Aligned_cols=112 Identities=17% Similarity=0.175 Sum_probs=93.5 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ++.+++..++|+.++.+.+.+++++.+||.+++ +++|+++.+|++..+...... ....++.++| +++.+++ T Consensus 1 t~Pitv~p~~ti~ea~~~M~~~~i~~~~V~d~~--~l~GivT~~Dl~~~~~~~~~~------~~~~~v~~iMt~~~~tv~ 72 (113) T cd04587 1 TKPATVSPTTTVQEAAKLMREKRVSCVLVMDGN--KLVGIFTSKDIALRVVAQGLD------PESTLVERVMTPNPVCAT 72 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEEHHHHHHHHHCCCC------HHHCEEEEEECCCCEEEE T ss_conf 989898994999999999997299889999899--999999845987899873998------320898866616846990 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++.++.+++..|.+++.+-.+|+|+.|.++|+||..||+.+ T Consensus 73 ~~~~l~~a~~~m~~~~i~~lpVvd~~~~~vGiit~~Dil~A 113 (113) T cd04587 73 SDTPVLEALHLMVQGKFRHLPVVDKSGQVVGLLDVTKLTHA 113 (113) T ss_pred CCCCHHHHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 79999999999987794199999269989999984885278 No 55 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=99.38 E-value=1.1e-12 Score=102.85 Aligned_cols=107 Identities=16% Similarity=0.212 Sum_probs=93.2 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) .+++++..++++.++++.+.+++++.+||++++. .++|+++.+|+..... ...++.++| +++..+. T Consensus 2 r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~~-~lvGivt~~di~~~~~------------~~~~v~~im~~~~~tv~ 68 (109) T cd04583 2 KNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKDN-KLLGIVSLESLEQAYK------------EAKSLEDIMLEDVFTVQ 68 (109) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-EEEEEEEHHHHHHCCC------------CCCEEEECCEEEEEEEC T ss_conf 9899979939799999999973997899995899-8999998104343005------------89678751262028974 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +++++.++++.|.+++.+..+|+|+.|.++|+||..||+++ T Consensus 69 ~~~~~~~a~~~m~~~~~~~lPVVd~~~~lvGiiT~~dll~t 109 (109) T cd04583 69 PDASLRDVLGLVLKRGPKYVPVVDEDGKLVGLITRSSLVDT 109 (109) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 89999999999985598388899649999999996882376 No 56 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.37 E-value=1.9e-12 Score=101.46 Aligned_cols=103 Identities=16% Similarity=0.190 Sum_probs=90.3 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) +++++..++|+.++.+.|.+++++.+||.+++ +++|+++.+|+..+ .....+.++| ++++++++ T Consensus 2 d~vtv~p~~tv~ea~~lM~~~~i~~lpVv~~~--~lvGivT~~Dl~~~-------------~~~~~v~~iMt~~~itv~~ 66 (105) T cd04599 2 DPITIDPLDSVGRAARLMEKHRIGGLPVVEDG--KLVGIITSRDVRRA-------------HPNRLVADAMTREVVTISP 66 (105) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEECHHHHCC-------------CCCCCHHHHCCCCCEEECC T ss_conf 98996894989999999986399889999999--99899981053127-------------8887968850279899979 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 8851232343201356511654077662046675889999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++++.++++.|.+++.+..+|+|+ |.++|+||..||..| T Consensus 67 ~~~l~~a~~~M~~~~i~~lpVvd~-g~lvGiiT~~Divre 105 (105) T cd04599 67 EASLLEAKRLMEEKKIERLPVLRE-RKLVGIITKGTIALE 105 (105) T ss_pred CCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEECHHHCCC T ss_conf 491999999779859979999989-999999994586158 No 57 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.37 E-value=3.8e-12 Score=99.42 Aligned_cols=108 Identities=16% Similarity=0.222 Sum_probs=91.5 Q ss_pred EEEEEEECCCHHHHHHH-HHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC---CCCEE Q ss_conf 29998630103677897-986598179984188664224102999998766223564322333457678851---56579 Q gi|254780426|r 98 SINAVEDKATVYEAMLM-FEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI---KNILF 173 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~-i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~---r~~~~ 173 (320) ++.+++.++|+.|+.+. +..+++++|||.+++. .++|+++..|+........ ....+.+++ +.... T Consensus 2 dv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~g-~l~Givt~~dl~~~~~~~~---------~~~~v~~v~~~~~~~~~ 71 (114) T cd04801 2 DFPTVPAHLTLREFVREYVLGSNQRRFVVVDNEG-RYVGIISLADLRAIPTSQW---------AQTTVIQVMTPAAKLVT 71 (114) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCC-EEEEEEEHHHHHHHHHHCC---------CCCCHHHHCCCCCCCEE T ss_conf 9558499995999999998507966899987899-7999999999875454025---------68847782102778758 Q ss_pred ECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 717885123234320135651165407766204667588999 Q gi|254780426|r 174 VPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 174 Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +.+++++.++++.|.+++.+..+|||+.|.++|+||..||+. T Consensus 72 v~~~~~l~~al~~m~~~~~~~LpVVd~~g~lvGiit~~Dilr 113 (114) T cd04801 72 VLSEESLAEVLKLLEEQGLDELAVVEDSGQVIGLITEADLLR 113 (114) T ss_pred ECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 999997999999999769867889947990999999788436 No 58 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=99.36 E-value=2.6e-12 Score=100.58 Aligned_cols=107 Identities=19% Similarity=0.265 Sum_probs=89.1 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCC--CCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCC---C Q ss_conf 129998630103677897986598179984188--66422410299999876622356432233345767885156---5 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNS--LDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKN---I 171 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~--~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~---~ 171 (320) ++.+++..++|+.++++++.+++|+++||.++. ...++|+++.+|+... .....++.++|.+ + T Consensus 2 ~dPvtv~p~~tv~ea~~lm~~~~~~~~pVvd~g~~~~klvGIvT~rDi~~~------------~~~~~~v~~iMt~~~~~ 69 (114) T cd04602 2 TDPSVLSPDHTVADVLEIKEKKGFSGIPVTEDGKSGGKLLGIVTSRDIDFL------------TDSETPLSEVMTPREVL 69 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEEEHHHHHHC------------CCCCCCCCCEEECCCEE T ss_conf 889997998999999999997198779999657768969999997895235------------07676563516527668 Q ss_pred EEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 79717885123234320135651165407766204667588999 Q gi|254780426|r 172 LFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 172 ~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +..+++.++.++++.|.+++.+..+||||.|.++||||..||+. T Consensus 70 i~~~~~~~l~ea~~~m~~~~i~~LPVVd~~g~LvGiIT~~Di~k 113 (114) T cd04602 70 VVAPTGITLEEANEILRESKKGKLPIVNDDGELVALVTRSDLKK 113 (114) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHCC T ss_conf 99479878999999999749667819978996999999244036 No 59 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=99.36 E-value=5.5e-12 Score=98.42 Aligned_cols=104 Identities=14% Similarity=0.102 Sum_probs=91.2 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ++.+++..++++.++++.+.+++++.+||++++. +++|+++.+|+.... .....++| +.+..++ T Consensus 1 t~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~g-~l~Givt~~Dl~~~~--------------~~~v~d~m~~~~~tv~ 65 (106) T cd04582 1 TEPITVRPDDPLSDALGLMDDSDLRALTVVDADG-QPLGFVTRREAARAS--------------GGCCGDHAEPFKVTVS 65 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHC--------------CCCHHHCCCCCCEEEC T ss_conf 9983899989399999999973998799997999-789998999987630--------------8953451025766997 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 7885123234320135651165407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .+.++.++++.|.+++.+...|||+.|.++|+||..||+. T Consensus 66 ~d~~l~~a~~~M~~~~i~~lPVVD~~grlvGivT~~Di~~ 105 (106) T cd04582 66 VDDDLRIVLSRMFAHDMSWLPCVDEDGRYVGEVTQRSIAD 105 (106) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999999999999729876258989990999998688425 No 60 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.36 E-value=6.7e-12 Score=97.85 Aligned_cols=110 Identities=19% Similarity=0.213 Sum_probs=93.5 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) +.+++..++|+.++++.|.+++.+.+||.+++. +++|+++-+|+++.+..+.. ....++.++| +++.++.+ T Consensus 2 ~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~~-~lvGIvT~~Di~r~~~~~~~-------~~~~~v~~vMt~~~itv~~ 73 (115) T cd04620 2 HPLTVTPDTPVADAIALMSQQGDSSCVLVVEKG-RLLGIFTERDIVRLTAIGKD-------LSDLPIGEVMTQPVVTLQE 73 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCCC-------CCCCCHHHHCCCCCEEECC T ss_conf 699969919999999999983994699993799-29999986999999985998-------1348988963689679838 Q ss_pred CC--CHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 88--5123234320135651165407766204667588999 Q gi|254780426|r 177 SM--LVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~--~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +. .+.+++..|.+++.+-.+|+|+.|.++|+||..||+. T Consensus 74 ~~~~~i~~a~~~M~~~~i~~lpVvd~~g~lvGiit~~Dilk 114 (115) T cd04620 74 SEIQDIFTALSLFRQHQIRHLPVLDDQGQLIGLVTAESIRQ 114 (115) T ss_pred CCCCCHHHHHHHHHHCCEEEEEEECCCCEEEEEEEHHHCCC T ss_conf 99726999999987559528999957997999999256232 No 61 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=99.34 E-value=1.4e-11 Score=95.81 Aligned_cols=108 Identities=15% Similarity=0.181 Sum_probs=94.3 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) .+++++.++|+.++.+.+.+++.+.+||.+++ +.+|+++-+|+..++..+.. ....++.++| +++.++++ T Consensus 2 ~pvtv~~~~ti~ea~~~M~~~~i~~~vV~~~~--~~~GIvT~~Dl~~~~~~~~~-------~~~~~V~~vms~~~~ti~~ 72 (111) T cd04589 2 PPLIVDASTSIRDAARLMREHGADALLVRDGD--PRLGIVTRTDLLDAVLLDGL-------PSSTPVGEIATFPLITVDP 72 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHHHHHCCCC-------CCCCCHHHHCCCCCEEECC T ss_conf 99899992989999999997099889997699--27999997998999982999-------6788999928799989999 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++.+++..|.+++.+..+|+|+ |.++|+||..|||. T Consensus 73 ~~~~~~a~~~M~~~~i~~lpV~d~-~~~vGivt~~Dil~ 110 (111) T cd04589 73 DDFLFNALLLMTRHRIHRVVVREG-GEVVGVLEQTDLLS 110 (111) T ss_pred CCCHHHHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHC T ss_conf 792999999999879808899989-99999999688228 No 62 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.33 E-value=9.6e-12 Score=96.82 Aligned_cols=108 Identities=19% Similarity=0.314 Sum_probs=93.9 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++|+.++.+.+.+++++.+||.+++ +++|+++.+|++..+..... .....++.++| +++.++++ T Consensus 2 ~vvti~~~~tl~~a~~~m~~~~i~~l~V~d~~--~~vGivt~~Di~~~~~~~~~------~~~~~~v~~iM~~~~~tv~~ 73 (112) T cd04802 2 NVITVDPDTTVYEAANIMTENNIGRLIVVDNE--KPVGIITERDLVKKVVSRNL------KPREVPVGEVMSTPLITIDP 73 (112) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEECHHHHHHHHCCC------CCCCCCHHHHHCCCCEEECC T ss_conf 97896891989999999997499879999899--99999982187888875479------81238999972348559947 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 88512323432013565116540776620466758899 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) ++++.++++.|.+++.+..+|+|+ |.+.|+||..||+ T Consensus 74 ~~~l~~a~~~m~~~~~~~lpVvd~-~~lvGiIt~~Dil 110 (112) T cd04802 74 NASLNEAAKLMAKHGIKRLPVVDD-DELVGIVTTTDIV 110 (112) T ss_pred CCHHHHHHHHHHHHCCCEEEEEEC-CEEEEEEECHHHC T ss_conf 986999999878719858999989-9999999806710 No 63 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.32 E-value=1.1e-11 Score=96.52 Aligned_cols=107 Identities=20% Similarity=0.252 Sum_probs=95.2 Q ss_pred EEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECCCC Q ss_conf 998630103677897986598179984188664224102999998766223564322333457678851-5657971788 Q gi|254780426|r 100 NAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPSSM 178 (320) Q Consensus 100 ~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe~~ 178 (320) +++..++|+.++++.+.++++..+||.+++. .++|+++-+|+.+++..+.. .+.++.++| +++.++.++. T Consensus 5 i~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~~-~l~GiiT~~Di~r~l~~~~~--------~~~~v~~im~~~~~~i~~~~ 75 (113) T cd04607 5 LLVSPDASILDALRKIDKNALRIVLVVDENG-RLLGTVTDGDIRRALLKGLS--------LDDPVSEVMNRNPITAKVGS 75 (113) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEECHHHHHHHHCCCC--------CCCCHHHHHCCEEEEEECCC T ss_conf 8969949899999999875997899997999-59999987277668762898--------45615675212129998799 Q ss_pred CHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 5123234320135651165407766204667588999 Q gi|254780426|r 179 LVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 179 ~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++.+++..|.+.+.+..+|+|+.|.++|++|..||+. T Consensus 76 ~~~~a~~~m~~~~i~~lPVvd~~~~lvGiit~~Dll~ 112 (113) T cd04607 76 SREEILALMRERSIRHLPILDEEGRVVGLATLDDLLS 112 (113) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 8999999998779749999978994999999799308 No 64 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=99.30 E-value=1.1e-11 Score=96.38 Aligned_cols=108 Identities=19% Similarity=0.211 Sum_probs=93.4 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) ++++++.++|+.++.+.+.+++++.+||.++ ++++|+++.+|++..+..... ..+.++.++| +++.++++ T Consensus 2 ~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~~--~~~vGiit~~Di~~~~~~~~~-------~~~~~v~~im~~~~~tv~~ 72 (111) T cd04800 2 PPVTCSPDTTIREAARLMTEHRVSSLLVVDD--GRLVGIVTDRDLRNRVVAEGL-------DPDTPVSEVMTAPPITIPP 72 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEECHHHHHHHHCCC-------CCCCCHHHHCCCCCEEEEC T ss_conf 9989789197999999999729998999989--999999995578899871679-------8566588851268179988 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++.+++..|.+++.+..+|+|+ |.+.|+||..||+. T Consensus 73 ~~~l~~a~~~m~~~~i~~lpVvd~-~~lvGiit~~Dilk 110 (111) T cd04800 73 DATVFEALLLMLERGIHHLPVVDD-GRLVGVISATDLLR 110 (111) T ss_pred CCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHEEC T ss_conf 991999999998539958999989-99999998545736 No 65 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.29 E-value=1.3e-11 Score=96.04 Aligned_cols=102 Identities=19% Similarity=0.302 Sum_probs=88.8 Q ss_pred EEECCCHHHHHHHHHHCC-----CCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC-CCEEEC Q ss_conf 863010367789798659-----81799841886642241029999987662235643223334576788515-657971 Q gi|254780426|r 102 VEDKATVYEAMLMFEKYG-----RSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK-NILFVP 175 (320) Q Consensus 102 i~~~~~~~e~~~~i~~~~-----~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r-~~~~Vp 175 (320) +..+.|++++++.++++. .+.+||++++. +.+|+++.+|++.. .++.++.++|+ ++.+++ T Consensus 2 ~~~~~TV~eai~~lr~~~~~~~~~~~i~Vvd~~~-~l~G~vt~~dll~~-------------~~~~~v~~iM~~~~~~v~ 67 (109) T cd04606 2 VREDWTVGEALEYLRRNADDPETIYYIYVVDEEG-RLLGVVSLRDLLLA-------------DPDTPVSDIMDTDVISVS 67 (109) T ss_pred CCCCCCHHHHHHHHHHCCCCCCEEEEEEEECCCC-EEEEEEEHHHHHHC-------------CCCCCHHHHHCCCEEECC T ss_conf 6998899999999985188856177999997999-79999986772241-------------888716663002136337 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 788512323432013565116540776620466758899998 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) +++++.++++.|.+++.+.++|||+.|.+.|+||..|+++.+ T Consensus 68 ~~~~~~~a~~~m~~~~~~~lPVVd~~~~lvGiIt~~Di~~~~ 109 (109) T cd04606 68 ADDDQEEVARLFEKYDLLALPVVDEEGRLVGIITVDDVIDVI 109 (109) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHHC T ss_conf 999899999999985276046898899799999968968449 No 66 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=99.26 E-value=4.8e-11 Score=92.26 Aligned_cols=124 Identities=15% Similarity=0.214 Sum_probs=105.2 Q ss_pred CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 47757641312001299986301036778979865981799841886642241029999987662235643223334576 Q gi|254780426|r 84 FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLS 163 (320) Q Consensus 84 l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~ 163 (320) +...+|+|+|.+ +...++.++++.|+.+.+.+++|+.+||.+++. ..+|+++.+|+...+.+.. ...... T Consensus 453 l~~~~v~dvm~~--~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~g-~l~Giv~l~dl~~~~~~~~-------~~~~~~ 522 (583) T PRK01862 453 LRTTQMRELIQP--AQTVVPLTASVADMTRVFLEYPVKYLYVTDDDG-RFRGAVALKDITSDLLDKR-------DTTDKT 522 (583) T ss_pred HHHCCHHHHCCC--CCCEECCCCCHHHHHHHHHHCCCEEEEEECCCC-EEEEEEEHHHHHHHHHCCC-------CCCCCC T ss_conf 850809996387--882349999899999999847970799988999-7999978999887751635-------556460 Q ss_pred HHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCC--CCEEEEEHHHHHHHH Q ss_conf 788515-657971788512323432013565116540776--620466758899998 Q gi|254780426|r 164 ESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHG--GTDGLVSYEDIVSVL 217 (320) Q Consensus 164 l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G--~~~GiVTleDIle~i 217 (320) ..++++ ++.++.+++++.++++.|.+.......|||+.+ .++|+||..|++.+- T Consensus 523 ~~~~~~~~~~~l~~~~sL~~al~~f~~~~~~~LPVVd~~~~~~lvGiit~~dil~aY 579 (583) T PRK01862 523 AADYAHTPFPLLTPDMPLRDALEHFMAFQGERLPVVESEASPTLAGVVYKTSLLDAY 579 (583) T ss_pred HHHHHCCCCCEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHH T ss_conf 999817999578899989999999996699877899559998799997789999999 No 67 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.25 E-value=4.2e-11 Score=92.60 Aligned_cols=108 Identities=14% Similarity=0.135 Sum_probs=93.8 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC---CCEEEC Q ss_conf 99986301036778979865981799841886642241029999987662235643223334576788515---657971 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK---NILFVP 175 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r---~~~~Vp 175 (320) .+++..++++.+|.+.+.+++++.+||.+++. .++|+++-+|+++.+..+.. ....++.++|. .+.++. T Consensus 3 Pvtv~~~~si~eA~~~M~~~~ig~l~VVD~~~-~LvGIiT~rDl~r~~~~~~~-------~~~~pV~~iMT~~P~vvt~~ 74 (118) T cd04617 3 PVVVRENTSVYDAIVTLFLEDVGSLFVVDEDG-DLVGVVSRKDLLKASIGGAD-------LQKVPVGVIMTRMPNITTTT 74 (118) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCC-------CCCCCHHHHHCCCCCEEEEC T ss_conf 97989969999999999983998799984999-59999878999999974898-------46782999857799719978 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCC---CCEEEEEHHHHH Q ss_conf 788512323432013565116540776---620466758899 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHG---GTDGLVSYEDIV 214 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G---~~~GiVTleDIl 214 (320) ++.++.++++.|.+++.+-..|||+.| .++|+||..||+ T Consensus 75 pd~~v~~A~~~M~~~~i~~LPVVd~~~~~~~vvGiIT~~Di~ 116 (118) T cd04617 75 PEESVLEAAKKLIEHQVDSLPVVEKVDEGLEVIGRITKTNIT 116 (118) T ss_pred CCCCHHHHHHHHHHCCCCEECEEECCCCCEEEEEEEECCCEE T ss_conf 999399999999982999856794399740899999775150 No 68 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.24 E-value=5.2e-11 Score=92.01 Aligned_cols=108 Identities=19% Similarity=0.246 Sum_probs=92.6 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851-56579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe 176 (320) +-++++.+.|+.++++.+.+.+++.+||++++. .++|+++..|++....... ......++| .++..+.+ T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~~-k~vG~it~~Dll~~~~~~~---------~~~~V~d~m~~~v~~i~~ 71 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEEN-KVLGQVTLSDLLEIGPNDY---------ETLKVCEVYIVPVPIVYC 71 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-CEEEEEEHHHHHHHCHHHH---------HCCCHHHHHCCCCCCCCC T ss_conf 446865999599999999876997689988999-6889988999864384455---------324387751567746389 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +.++.++++.|++.+.+...|||..|.+.|+||-.|||. T Consensus 72 ~~~v~dalr~~~~~~~~~l~VVD~~grlvGiI~~rdlLr 110 (111) T cd04603 72 DSKVTDLLRIFRETEPPVVAVVDKEGKLVGTIYERELLR 110 (111) T ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHC T ss_conf 976899999987449988999938997999996255123 No 69 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.22 E-value=2.4e-11 Score=94.18 Aligned_cols=94 Identities=18% Similarity=0.288 Sum_probs=86.9 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEEECCC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851565797178 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNILFVPSS 177 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~Vpe~ 177 (320) +++++..++++.++.+.+.+++++.+||++++. .++|+++-+|++. ..++++++++ T Consensus 2 dv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~g-~lvGIiT~rDli~-----------------------~~~~itv~~~ 57 (96) T cd04614 2 NVPTVWEETPLPVAVRIMELANVKALPVLDDDG-KLSGIITERDLIA-----------------------KSEVVTATKR 57 (96) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEECHHHCC-----------------------CCCCEEECCC T ss_conf 963908969899999999972998799998999-4999998888825-----------------------7788797998 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 85123234320135651165407766204667588999 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++.++.+.|.+++.+..+|+|+.|.++||||..||+. T Consensus 58 ~~v~eaa~lM~~~~I~~LPVvd~~~~lvGiit~~Dllk 95 (96) T cd04614 58 TTVSECAQKMKRNRIEQIPIINGNDKLIGLLRDHDLLK 95 (96) T ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHCCC T ss_conf 90999999999859988628989991999988798048 No 70 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=99.22 E-value=6.5e-11 Score=91.36 Aligned_cols=166 Identities=14% Similarity=0.156 Sum_probs=121.1 Q ss_pred HHHHHHHHHHHHCCC--CC-----HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 999999997997689--89-----89999999873047757641312001299986301036778979865981799841 Q gi|254780426|r 55 KGIRLPRIDASLEGI--FS-----ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYK 127 (320) Q Consensus 55 eel~~l~~~~~~~G~--i~-----~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~ 127 (320) -|-++-+..+.+.|. |. +++.++++.+=++..--+. +-+++.+++++.++++++.+++||-+||.+ T Consensus 63 TE~~MAIamA~~GGiGVIH~Nmsie~Qa~~V~~VKr~esg~I~-------dPvti~p~~Tv~da~~l~~k~~~sg~PVvd 135 (497) T PRK07107 63 SDDNMAIALAREGGLSFIFGSQSIESEAAMVRRVKNHKAGFVI-------SDSNLTPDNTLADVLDLKERTGHSTIAVTE 135 (497) T ss_pred CCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 5099999999779979987999999999999999563567668-------981758986499999988871996234510 Q ss_pred CC--CCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC---CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCC Q ss_conf 88--6642241029999987662235643223334576788515---657971788512323432013565116540776 Q gi|254780426|r 128 NS--LDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK---NILFVPSSMLVSDLLTNIQESRIRMALVIDEHG 202 (320) Q Consensus 128 ~~--~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r---~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G 202 (320) +. ..+++|+|+-+|+-.. ......++.++|. .++++|+..++.++...|++++...++|||+.| T Consensus 136 ~g~~~gkLvGIvT~RD~rf~-----------~~~~~~~V~~vMT~~~~Lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~g 204 (497) T PRK07107 136 DGTANGKLLGIVTSRDYRVS-----------RMSLDTKVKDFMTPFEKLVVAKVGITLKEANNIIWDHKLNTLPIIDKNQ 204 (497) T ss_pred CCCCCCEEEEEEECCEEECC-----------CCCCCCCHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEECCCC T ss_conf 37859879999846401111-----------3677776656406765427846889999999999863444302783799 Q ss_pred CCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 6204667588999987422210000000012467733783045359 Q gi|254780426|r 203 GTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 203 ~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) .+.|++|..||.... ++ +...+-..|.++|-+.+.. T Consensus 205 ~L~gLiT~kDi~k~~------~~----P~a~~D~~grL~VgAAIg~ 240 (497) T PRK07107 205 HLVYMVFRKDYDSHK------EN----PLELLDSSKRYVVGAGINT 240 (497) T ss_pred CEEEEEEHHHHHHHH------HC----CCCCCCCCCCEEEEEECCC T ss_conf 589999837888864------39----6311376788889996377 No 71 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=99.18 E-value=1.6e-10 Score=88.79 Aligned_cols=100 Identities=15% Similarity=0.232 Sum_probs=87.9 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCC--CCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEE Q ss_conf 1299986301036778979865981799841886--64224102999998766223564322333457678851-56579 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSL--DNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILF 173 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~--d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~ 173 (320) ..++.+..+.+++++.+.+.+++|+.|||.++.. ..++|++..+|+...+. .+| +.|+. T Consensus 2 ~~vv~~~~~~tv~~~~~ll~~~~~~g~PVv~~~~~~~~lvG~i~~~dl~~~l~------------------~~m~~~p~t 63 (105) T cd04591 2 PLVVLLPEGMTVEDLESLLSTTSHNGFPVVDSTEESPRLVGYILRSQLVVALK------------------NYIDPSPFT 63 (105) T ss_pred CCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEEHHHHHHHHH------------------HHCCCCCEE T ss_conf 96899389989999999999779975858836887888999999999999999------------------758799839 Q ss_pred ECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 717885123234320135651165407766204667588999 Q gi|254780426|r 174 VPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 174 Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) +++++++.++++.|++.+.+..+|+| .|.++|+||..|++. T Consensus 64 v~~~~~l~~~~~lf~~~g~r~l~Vv~-~g~lvGiITr~Dl~~ 104 (105) T cd04591 64 VSPRTSLEKVHQLFRKLGLRHLLVVD-EGRLVGIITRKDLLK 104 (105) T ss_pred ECCCCCHHHHHHHHHHHCCCEEEEEE-CCEEEEEEEHHHHHC T ss_conf 89999599999999994995889957-999999999899534 No 72 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=99.18 E-value=3.6e-10 Score=86.51 Aligned_cols=135 Identities=16% Similarity=0.262 Sum_probs=116.7 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCC---CEEEEE--CCCCCCCCCCEEHHHH Q ss_conf 89898999999987304775764131200129998630103677897986598---179984--1886642241029999 Q gi|254780426|r 68 GIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGR---SWMPVY--KNSLDNPRGMVHMRDV 142 (320) Q Consensus 68 G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~---sR~PV~--~~~~d~iiGiv~~kDl 142 (320) ..++++++..++..+...+-+|..+|+ ++.+++..+.|+++++..+++.+. +++.+| +.+. .+.|++..+++ T Consensus 113 ~~l~~~~r~~v~~~l~y~e~taG~~Mt--~e~v~l~~~~Tv~~al~~ir~~~~~~e~~~~lyVvD~~~-~L~Gvvsl~~L 189 (451) T COG2239 113 SLLDPEERARVRQLLSYPEDTAGRIMT--TEFVTLPEDVTVDEALDRIRERAEDAETIYYLYVVDEKG-KLLGVVSLRDL 189 (451) T ss_pred HHCCHHHHHHHHHHCCCCHHHHHCCCE--EEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCC-CEEEEEEHHHH T ss_conf 818999999999865898344200013--025872067589999999997236756623799987766-46777569998 Q ss_pred HHHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 9876622356432233345767885156-579717885123234320135651165407766204667588999987 Q gi|254780426|r 143 ISYISHMYAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 143 l~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) +.. .+...+.++|.+ +.+|+..+...++-+.|++...-.+.|||+.|..+|+||.+|+++.+- T Consensus 190 l~a-------------~~~~~i~~im~~~~~~V~~~~dqeevA~~~~~ydl~a~PVVd~~~~LiG~itiDDiidvi~ 253 (451) T COG2239 190 LTA-------------EPDELLKDLMEDDVVSVLADDDQEEVARLFEKYDLLAVPVVDEDNRLIGIITIDDIIDVIE 253 (451) T ss_pred HCC-------------CCHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHCCEECCEECCCCCEEEEEEHHHHHHHHH T ss_conf 648-------------9576899872435624365578799999999828701535778984632554999999999 No 73 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.13 E-value=3.5e-10 Score=86.62 Aligned_cols=112 Identities=17% Similarity=0.265 Sum_probs=92.0 Q ss_pred EEEEEEECCCHHHHHHHHHHC-CCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEEC Q ss_conf 299986301036778979865-9817998418866422410299999876622356432233345767885156-57971 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKY-GRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKN-ILFVP 175 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~-~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vp 175 (320) .+.++..++++.++.+.+.++ ..+.+||++++ .++|+++-+|++..+...+.. +.....++.++|.+ ++.|. T Consensus 2 pv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd~~--r~vGiisr~dl~~~~~~~~g~----~l~~~~pV~~~M~~~p~~v~ 75 (119) T cd04598 2 PAPTVSPDTTVNDVLERFERDPDLSALAVVDDG--RPVGLIMREALMELLSTPYGR----ALYGKKPVSEVMDPDPLIVE 75 (119) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEEEEECC--EEEEEEEHHHHHHHHCCCCCH----HHHCCCCHHHHCCCCCEEEC T ss_conf 985749998499999999878996768998799--889999899999987074322----33038958997168987998 Q ss_pred CCCCHHHHHHHHHCCCCCCE---EEEECCCCCEEEEEHHHHHH Q ss_conf 78851232343201356511---65407766204667588999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMA---LVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~a---iVvDE~G~~~GiVTleDIle 215 (320) +++++.++.+.|.++..+.. +||+|.|...||+|..|++. T Consensus 76 ~~~~i~~~~~~~~~~~~~~l~~~ivV~~~g~~~Givt~~DLlr 118 (119) T cd04598 76 ADTPLEEVSRLATGRDSQNLYDGFIVTEEGRYLGIGTVKDLLR 118 (119) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEEHHHHHC T ss_conf 9896999999998569332578858957998999989899407 No 74 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=99.11 E-value=1.2e-10 Score=89.74 Aligned_cols=121 Identities=19% Similarity=0.228 Sum_probs=104.5 Q ss_pred EEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 57641312001299986301036778979865981799841886642241029999987662235643223334576788 Q gi|254780426|r 87 IRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESN 166 (320) Q Consensus 87 ~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~ 166 (320) .+|+|+|.+-.++..++.++++.+++..+.+.++--.-|.+++. .++|+++--||-+.+.+... . +.++.+ T Consensus 197 ~~V~dlM~~~~~lP~v~~~~~l~dai~~ms~k~lG~v~Vvd~~~-kL~GIITDGDLRR~l~k~~~------L--~~~v~~ 267 (321) T PRK11543 197 NKVHHLMRRGDAIPQVALTASVMDAMLELSRTGLGLVAVCDAQQ-QVKGVFTDGDLRRWLVGGGA------L--TTPVNE 267 (321) T ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCCC-CEEEEEECHHHHHHHHCCCC------H--HHHHHH T ss_conf 99999973355789899998799999996648861799833775-06888744389999863785------6--768999 Q ss_pred HC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 51-565797178851232343201356511654077662046675889999 Q gi|254780426|r 167 LI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 167 ~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) +| +.|.++.+++.+.++++.|++++..-.+|+|+.|.++|+|.+.|++.+ T Consensus 268 vMT~nP~tI~~d~la~eAl~iM~~~kI~~LpVvd~~~k~vGiIhiHDLl~~ 318 (321) T PRK11543 268 AMTVGGTTLQAQSRAIDAKEILMKRKITAAPVVDENGKLTGAINLQDFYQA 318 (321) T ss_pred HHCCCCCEECCCCHHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHC T ss_conf 848999578998719999999998798689998589959999638978676 No 75 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.11 E-value=3.8e-10 Score=86.33 Aligned_cols=109 Identities=19% Similarity=0.229 Sum_probs=86.4 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH-----HHHHCCCCCCCCCCCCCCCHHHHCCCCEE Q ss_conf 9998630103677897986598179984188664224102999998-----76622356432233345767885156579 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS-----YISHMYAKTNNINLNIQLSESNLIKNILF 173 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~-----~~~~~~~~~~~~~~~~~~~l~~~~r~~~~ 173 (320) -+++++++|+.++++.+.+++.+.+||.+++. +++|+++.+||+. ....... ...+..+.++|+|... T Consensus 3 Pvtv~~dtti~eA~~~M~~~~v~~l~VvD~~~-~lvGIiT~~Dl~~~~~~~~~~~~~~------~~~~l~V~dvMt~~~~ 75 (126) T cd04640 3 PIVIPADTSIDEALELMIKHGVRLLLVVDSDD-NFIGVITAVDLLGEEPIKRIQEGGI------SRSELTVADVMTPKED 75 (126) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-CEEEEEECCCCCCCHHHHHHHHCCC------CHHHCCHHHCCCCCCC T ss_conf 96968999599999999983976798987999-7999997630012156888864489------8255053133554520 Q ss_pred EC-------CCCCHHHHHHHHHCCCCCCEEEEECCC-CCEEEEEHHHHH Q ss_conf 71-------788512323432013565116540776-620466758899 Q gi|254780426|r 174 VP-------SSMLVSDLLTNIQESRIRMALVIDEHG-GTDGLVSYEDIV 214 (320) Q Consensus 174 Vp-------e~~~l~~lL~~m~~~~~~~aiVvDE~G-~~~GiVTleDIl 214 (320) +. ++.++.+++..|++++.|-++|+|+.| .+.|++|..||. T Consensus 76 ~~a~~~~~~~~a~v~d~~~~m~~~~~~hllVvd~~~~~l~GiiS~~DI~ 124 (126) T cd04640 76 LKALDLEELENASVGDVVETLKASGRQHALVVDREHHQIRGIISTSDIA 124 (126) T ss_pred EEEECCCCCCCCCHHHHHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHC T ss_conf 4630220003485999999999839808749989998898999879942 No 76 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=99.10 E-value=4.7e-10 Score=85.71 Aligned_cols=110 Identities=22% Similarity=0.307 Sum_probs=92.9 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHH-CCCCEEECC Q ss_conf 2999863010367789798659817998418866422410299999876622356432233345767885-156579717 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNL-IKNILFVPS 176 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~-~r~~~~Vpe 176 (320) ++++++.++++.++++.+.+++++.+||++++ ++++|+++.+|++..+......... .+..+ .++++++.+ T Consensus 2 ~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~-~~~~G~it~~dl~~~~~~~~~~~~~-------~~~~~~~~~~~~v~~ 73 (113) T cd02205 2 DVVTVSPDDTVAEALRLMLEHGISGLPVVDDD-GRLVGIVTERDLLRALAEGGLDPLV-------TVGDVMTRDVVTVSP 73 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCC-------CEEEEEECCCEEEEE T ss_conf 97796892989999999998398099999399-9599999889989887631574103-------304567505468973 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 885123234320135651165407766204667588999 Q gi|254780426|r 177 SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 177 ~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ++++.+++..|.+.+.+..+|+|+.|.+.|++|..|++. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~Vvd~~~~~iG~vs~~di~~ 112 (113) T cd02205 74 DTSLEEAAELMLEHGIRRLPVVDDEGRLVGIVTRSDILR 112 (113) T ss_pred CCHHHHHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHC T ss_conf 211779999999849729999966997999998899746 No 77 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=99.07 E-value=4.8e-10 Score=85.70 Aligned_cols=160 Identities=14% Similarity=0.204 Sum_probs=117.5 Q ss_pred HHHHHHHHHHHHCCC--CC-----HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 999999997997689--89-----89999999873047757641312001299986301036778979865981799841 Q gi|254780426|r 55 KGIRLPRIDASLEGI--FS-----ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYK 127 (320) Q Consensus 55 eel~~l~~~~~~~G~--i~-----~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~ 127 (320) -|-++-+..+...|. |+ +++.++++.+=+. +-|. .+-+++..+.++.++++++.+++|+-+||.+ T Consensus 56 TE~~MAIamA~~GGiGvIH~Nmsie~Qa~~V~kVK~~------~~~i--~~Pvti~pd~tv~d~~~l~~~~~~sg~pVv~ 127 (479) T PRK07807 56 AGRRMAETVARRGGLVVLPQDLPIDAVAETVAWVKSR------DLVF--DTPVTLAPDDTVSDALALIHKRAHGAVVVVD 127 (479) T ss_pred CCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHCC------CCEE--CCCEEECCCCCHHHHHHHHHHHCCCCCCEEC T ss_conf 7499999999779859987999999999999997113------7743--6998978987199999999983788874146 Q ss_pred CCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEE Q ss_conf 886642241029999987662235643223334576788515-6579717885123234320135651165407766204 Q gi|254780426|r 128 NSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDG 206 (320) Q Consensus 128 ~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~G 206 (320) ++. .++|+|+-+|+- +. ....++.++|+ +++++|+.+++.++...|++++....+|||+.|.+.| T Consensus 128 ~~g-kLvGIvT~RDir-~~------------d~~~~v~~vMT~~lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~~~L~g 193 (479) T PRK07807 128 EEG-RPVGLVTEADCR-GV------------DRFTRVGDVMSTDLVTLPAGTDPRKAFDLLEAAPVKVAPVVDADGRLAG 193 (479) T ss_pred CCC-CEEEEEECHHHC-CC------------CCCCCHHHHHCCCCEEECCCCCHHHHHHHHHHCCCCCCEEECCCCEEEE T ss_conf 799-478898213411-47------------7777588862577366136678799999997535231137756992999 Q ss_pred EEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 66758899998742221000000001246773378304535 Q gi|254780426|r 207 LVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTD 247 (320) Q Consensus 207 iVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~ 247 (320) +||..||... ..+ +... -.+|.++|-+.+. T Consensus 194 LiT~kDi~k~------~~~----~~a~-D~~grL~VgAAVG 223 (479) T PRK07807 194 VLTRTGALRA------TIY----TPAV-DAAGRLRVAAAVG 223 (479) T ss_pred EEEEEHHHHC------CCC----CCCC-CHHHCEEEEEEEC T ss_conf 9996116454------468----8777-8223456788725 No 78 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=99.01 E-value=2.4e-09 Score=81.14 Aligned_cols=163 Identities=18% Similarity=0.236 Sum_probs=118.6 Q ss_pred HHHHHHHHHHHCC--CCC-----HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 9999999799768--989-----899999998730477576413120012999863010367789798659817998418 Q gi|254780426|r 56 GIRLPRIDASLEG--IFS-----ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKN 128 (320) Q Consensus 56 el~~l~~~~~~~G--~i~-----~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~ 128 (320) |-++-+..+...| +|+ +++.++++.+=++.+- |+ .+.+++.++.++.++++++.+++||-+||.++ T Consensus 62 e~~MAiamA~~GGiGVIHrn~sie~Q~~~V~~VKr~e~g-----~i--~dPit~~p~~ti~d~~~l~~~~~~sg~pVv~~ 134 (499) T PTZ00314 62 EHKMAIAMALMGGIGVIHNNCTVERQVEEVKKVKRFENG-----FI--MDPKSLSPEHTVSDVIEIKDKKGFSGIPITED 134 (499) T ss_pred CHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCC-----CC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC T ss_conf 899999999779869987999999999999998763066-----20--69827389862999997567518863679863 Q ss_pred C--CCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCE Q ss_conf 8--66422410299999876622356432233345767885156-57971788512323432013565116540776620 Q gi|254780426|r 129 S--LDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTD 205 (320) Q Consensus 129 ~--~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~ 205 (320) + ...++|+++-+|+.. . .....++.++|.+ +++.++..++.++.+.|++++....+|||+.|.+. T Consensus 135 g~~~gkL~GIvT~rD~~f-~-----------~d~~~~v~~vMt~~lvt~~~~isl~eA~~ll~~~kieklpvVd~~g~L~ 202 (499) T PTZ00314 135 GRPGGKLLGIVTSKDIDF-V-----------KDKSTPVSEIMTTDLVVGRYPITLEDANDVLNRSRKGVLPIVNDNGELV 202 (499) T ss_pred CCCCCEEEEEEECCCEEC-C-----------CCCCCCHHHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCEEECCCCCEE T ss_conf 886886899983620232-3-----------0677588886156726725999999999999860331230665789589 Q ss_pred EEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 466758899998742221000000001246773378304535 Q gi|254780426|r 206 GLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTD 247 (320) Q Consensus 206 GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~ 247 (320) |++|..|+...- ..+...+-.+|.++|-+.+. T Consensus 203 glit~kDi~k~~----------~~P~a~~D~~grL~VgAAVg 234 (499) T PTZ00314 203 ALVSRSDAVKNR----------DYPHASKDENKQLLVGAAIS 234 (499) T ss_pred EEEECCHHHHHH----------HCCCHHHHCCCCEEEEEEEC T ss_conf 986303487753----------38712220138789999947 No 79 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=99.00 E-value=1.1e-09 Score=83.25 Aligned_cols=113 Identities=20% Similarity=0.185 Sum_probs=85.7 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ..++++..+.|++++.++|++++++++||.+++. +++|+++.+|++.++..+..... .++.+.| ++...+. T Consensus 2 ~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~g-~~~G~vt~~~ll~~~~~~~~~~~-------~~V~~vm~~~~~~v~ 73 (124) T cd04608 2 KAPVTVLPTVTCAEAIEILKEKGFDQLPVVDESG-KILGMVTLGNLLSSLSSGKVQPS-------DPVSKALYKQFKRVN 73 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCCCCC-------CCHHHHHCCCCCCCC T ss_conf 9997989989099999999984998568987999-68888879999999973899999-------828997446565147 Q ss_pred CCCCHHH---------HHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 7885123---------23432013565116540776620466758899998 Q gi|254780426|r 176 SSMLVSD---------LLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 176 e~~~l~~---------lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) .+.++.. .+..|..++...++++|+.|.+.||||.-|||..| T Consensus 74 ~~~~l~~ls~~l~~~~~~lvv~~~~~~~~~v~~~~~k~vGIvTr~DLL~yl 124 (124) T cd04608 74 KNDTLGKLSRILETDAFLLVFFEQISSAAIGKEKQEKPIGIVTKIDLLSYI 124 (124) T ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCEEEEEEEHHHHHHHC T ss_conf 998556731036732789999986025676277799899998679977039 No 80 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=98.99 E-value=9.4e-10 Score=83.78 Aligned_cols=125 Identities=25% Similarity=0.404 Sum_probs=101.4 Q ss_pred HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCC Q ss_conf 89999999873047757641312001299986301036778979865981799841886642241029999987662235 Q gi|254780426|r 72 ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYA 151 (320) Q Consensus 72 ~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~ 151 (320) +++.++++.+=+..+ -|+ .+.++++++.++.++.+++.+++|+-+||.+++. +++|+++.+|+- +. T Consensus 77 e~Q~~~V~~VKr~e~-----g~i--~~P~tl~P~~tv~d~~~l~~~~~~sg~PVv~~~~-kL~GiiT~rD~~-f~----- 142 (486) T PRK05567 77 EEQAEEVRKVKRSES-----GVV--TDPVTVTPDTTLAEALALMARYGISGVPVVDEEG-KLVGIITNRDVR-FE----- 142 (486) T ss_pred HHHHHHHHHHHHHHC-----CCC--CCCCCCCCCCCHHHHHHHHHHHCCCCCEEECCCC-CEEEEEECCCEE-CC----- T ss_conf 999999999975306-----713--7986768988899999999972878614876799-478886142011-00----- Q ss_pred CCCCCCCCCCCCHHHHCCC--CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 6432233345767885156--5797178851232343201356511654077662046675889999 Q gi|254780426|r 152 KTNNINLNIQLSESNLIKN--ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 152 ~~~~~~~~~~~~l~~~~r~--~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .....++.++|.+ .+++++..++.++.+.|++++...++|||+.|.+.|++|..|+... T Consensus 143 ------~~~~~~V~~vMT~~~lvt~~~~is~~eA~~~l~~~kieklPvVd~~g~L~Glit~kDi~k~ 203 (486) T PRK05567 143 ------TDLSQPVSEVMTKERLVTVPEGTTLEEALELLHEHRIEKLLVVDDNGRLKGLITVKDIEKA 203 (486) T ss_pred ------CCCCCCHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCEECCCCCEEEEEEHHHHHHH T ss_conf ------2677654675345732892588999999999997313034277468968888776677765 No 81 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=98.99 E-value=1.6e-09 Score=82.30 Aligned_cols=122 Identities=20% Similarity=0.254 Sum_probs=104.5 Q ss_pred CEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHH Q ss_conf 75764131200129998630103677897986598179984188664224102999998766223564322333457678 Q gi|254780426|r 86 KIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSES 165 (320) Q Consensus 86 ~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~ 165 (320) -.+|+|+|.+..++..++.++++.+++..+.+.++--.-|++++. .++|+++--||-+++..... ....... T Consensus 201 l~~V~dlM~~~~~lP~v~~~~~l~dai~~mt~k~lG~v~Vvd~~g-kL~GIiTDGDLRR~l~~~~~-------i~~~~~~ 272 (326) T PRK10892 201 LLRVSDIMHTGDEIPHVKKTASLRDALLEITRKNLGMTVICDDNM-MIEGIFTDGDLRRVFDMGVD-------LRQLSIA 272 (326) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCC-CEEEEEECHHHHHHHHHCCC-------CCCCCHH T ss_conf 878999851376588537760299999987458934999985899-68999862689999870688-------3128799 Q ss_pred HHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 851-565797178851232343201356511654077662046675889999 Q gi|254780426|r 166 NLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 166 ~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ++| +.|.++.+++.+.++|..|++++..-.+|+|+ +.++|+|.+.|++.+ T Consensus 273 diMT~nP~tI~~d~la~eAL~iMe~~kIt~L~Vvd~-~k~vGIihihDll~~ 323 (326) T PRK10892 273 DVMTPGGIRVRPGILAVDALNLMQSRHITSVLVADG-DQLLGVLHMHDLLRA 323 (326) T ss_pred HHHCCCCCEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHC T ss_conf 971899967899886999999998639818999889-999999767978676 No 82 >COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription] Probab=98.93 E-value=5.5e-09 Score=78.76 Aligned_cols=141 Identities=16% Similarity=0.247 Sum_probs=104.3 Q ss_pred HCCCCCHHHHH---HHHHHHCC--CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHH Q ss_conf 76898989999---99987304--77576413120012999863010367789798659817998418866422410299 Q gi|254780426|r 66 LEGIFSETEKE---IFNNILRF--RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMR 140 (320) Q Consensus 66 ~~G~i~~~E~~---ii~~vl~l--~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~k 140 (320) +.|.+++.-+. |+.-..+- ...+++.+|+ ..++.+.+++++.++.+++..+|||.+||.++ |.++|-++-. T Consensus 39 E~G~vdPrlSt~k~Il~aL~e~e~~~ita~~iM~--spvv~v~pdDsi~~vv~lM~~~g~SQlPVi~~--~k~VGsItE~ 114 (187) T COG3620 39 EAGKVDPRLSTVKRILEALEEAEKTRITAKTIMH--SPVVSVSPDDSISDVVNLMRDKGISQLPVIEE--DKVVGSITEN 114 (187) T ss_pred HCCCCCCCHHHHHHHHHHHHHHHCCEEEHHHHCC--CCEEEECCHHHHHHHHHHHHHCCCCCCCEEEC--CEEEEEECHH T ss_conf 6388793089999999999985466675766535--87058772466999999999759752754108--8645331498 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 999876622356432233345767885156-5797178851232343201356511654077662046675889999874 Q gi|254780426|r 141 DVISYISHMYAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 141 Dll~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ++...+.++... -.....+.+|-+ -+.|.++.++.-.-..+..+. |+.|-|.|.++||||..||+..+-| T Consensus 115 ~iv~~~le~~e~------i~~~~vr~vM~e~fP~Vs~~~~l~vI~~LL~~~~---AVlV~e~G~~vGIITk~DI~k~~~~ 185 (187) T COG3620 115 DIVRALLEGMES------IRSLRVREVMGEPFPTVSPDESLNVISQLLEEHP---AVLVVENGKVVGIITKADIMKLLAG 185 (187) T ss_pred HHHHHHHCCCCC------HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCC---EEEEEECCCEEEEEEHHHHHHHHHC T ss_conf 999997304221------2112189885588876799998799999984198---3899868936888759999998724 No 83 >KOG1764 consensus Probab=98.78 E-value=1.5e-08 Score=75.88 Aligned_cols=136 Identities=20% Similarity=0.269 Sum_probs=81.0 Q ss_pred CEEEHHEEECCC-EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCC-CCCCCC Q ss_conf 757641312001-299986301036778979865981799841886642241029999987662235643223-334576 Q gi|254780426|r 86 KIRIDDIMISRV-SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNIN-LNIQLS 163 (320) Q Consensus 86 ~~~v~diMtPr~-~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~-~~~~~~ 163 (320) .++..|.=++-- .+..+..+.++.++++.+...+.|-+||.+.+... +|+....|+......+........ ...... T Consensus 225 ~~s~~dl~ig~~~~i~~i~~~~~v~~al~~m~~~~is~lpvV~~~g~~-v~~~s~~Dv~~l~~~~~~~~~~~~~l~~~~~ 303 (381) T KOG1764 225 SKSLSDLGIGTWSNIASISEDTPVIEALKIMSERRISALPVVDENGKK-VGNYSRFDVIHLAREGTYNNLDLSCLSEALS 303 (381) T ss_pred HCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCEECCCCCE-ECCEEEEHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 188766404778777775178708999999987045744347577855-3215620012156427767777324213533 Q ss_pred HHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHH Q ss_conf 788515-65797178851232343201356511654077662046675889999874222 Q gi|254780426|r 164 ESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDIT 222 (320) Q Consensus 164 l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~ 222 (320) .+.... +++..-.+.++..++.+|..+++|-+.|||+.|.++|+||+.||+..++-.-. T Consensus 304 ~~~~~~~~vvtc~~~ssL~~vi~~lv~~~vHRl~VVd~~~~l~GvvSLsDil~~l~~~p~ 363 (381) T KOG1764 304 HRPIRFEGVVTCRPTSTLAEVIDKLVAHRVHRLWVVDEDGVLVGVISLSDILSYLVLTPS 363 (381) T ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCC T ss_conf 155456873798437749999999987386059998689867987558999999974737 No 84 >COG0517 FOG: CBS domain [General function prediction only] Probab=98.72 E-value=9.8e-08 Score=70.55 Aligned_cols=113 Identities=22% Similarity=0.383 Sum_probs=95.9 Q ss_pred HHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCC Q ss_conf 41312001299986301036778979865981799841886642241029999987662235643223334576788515 Q gi|254780426|r 90 DDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIK 169 (320) Q Consensus 90 ~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r 169 (320) .++|. .+.+.+..+.++.++...+.+++++++||.+.. .++|+++.+|++.+......... .+..++. T Consensus 2 ~~~~~--~~~~~v~~~~~~~~a~~~m~~~~~~~~~v~~~~--~l~Giit~~di~~~~~~~~~~~~--------~v~~i~~ 69 (117) T COG0517 2 KDIMT--KDVITVKPDTSVRDALLLMSENGVSAVPVVDDG--KLVGIITERDILRALAAGGKRLL--------PVKEVMT 69 (117) T ss_pred CCCCC--CCCEEECCCCCHHHHHHHHHHCCCEEEEEEECC--EEEEEEEHHHHHHHHHCCCCCCC--------HHHHHHC T ss_conf 65442--688089899819999999987493389974399--99999998999876641665533--------1788625 Q ss_pred -CCEEECCCCCHHHHHHHHHC-CCCCCEEEEECCC-CCEEEEEHHHHH Q ss_conf -65797178851232343201-3565116540776-620466758899 Q gi|254780426|r 170 -NILFVPSSMLVSDLLTNIQE-SRIRMALVIDEHG-GTDGLVSYEDIV 214 (320) Q Consensus 170 -~~~~Vpe~~~l~~lL~~m~~-~~~~~aiVvDE~G-~~~GiVTleDIl 214 (320) ++..+.++.++.+++..|.+ .+.+..+|+|+.+ .++|++|..|++ T Consensus 70 ~~~~~~~~~~~~~~~~~~m~~~~~~~~lpVv~~~~~~lvGivt~~di~ 117 (117) T COG0517 70 KPVVTVDPDTPLEEALELMVERHKIRRLPVVDDDGGKLVGIITLSDIL 117 (117) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCCEEEEEEEHHHCC T ss_conf 884677799889999999987267762569988988799898878749 No 85 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=98.71 E-value=6.6e-08 Score=71.67 Aligned_cols=124 Identities=19% Similarity=0.242 Sum_probs=100.4 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 04775764131200129998630103677897986598179984188664224102999998766223564322333457 Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQL 162 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~ 162 (320) +|--.++.++|+ ...+.++..+++.++...+.+.|.|-+.+.+++. ...|||+-||+...+..... +... T Consensus 145 e~~~trv~~~~~--~~~~~v~~~~~i~~aa~km~~~gv~s~v~l~~~~-~~~GIvT~~dl~~~v~~~g~-------~~~~ 214 (610) T COG2905 145 EFILTRVGEVKT--LPAVTVSPQASIQDAARKMKDEGVSSLVVLDDSG-PLLGIVTRKDLRSRVIADGR-------SKTQ 214 (610) T ss_pred HHHHHHHHHHHC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC-CCCCEEEHHHHHHHHHHCCC-------CCCC T ss_conf 588998777741--7875268657679999999960887289983798-74332422777899986589-------7423 Q ss_pred CHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 678851-5657971788512323432013565116540776620466758899998 Q gi|254780426|r 163 SESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 163 ~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) ++..+| .|++.|..+..+.+++-.|-++++|-.+|. |.|.+.|+||+.||+.-. T Consensus 215 ~V~evmT~p~~svd~~~~~feAml~m~r~~I~hl~V~-e~gq~~Gilt~~dIl~l~ 269 (610) T COG2905 215 KVSEVMTSPVISVDRGDFLFEAMLMMLRNRIKHLPVT-EDGQPLGILTLTDILRLF 269 (610) T ss_pred CHHHHHCCCCEEECCCCHHHHHHHHHHHHCCCEEEEE-CCCEEEEEEEHHHHHHHH T ss_conf 3545414684454476569999999998077442361-189046776699999761 No 86 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=98.70 E-value=1.9e-07 Score=68.59 Aligned_cols=133 Identities=17% Similarity=0.179 Sum_probs=97.0 Q ss_pred HHHHHHHHCCCCEEEHHEEECC-CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCC Q ss_conf 9999987304775764131200-129998630103677897986598179984188664224102999998766223564 Q gi|254780426|r 75 KEIFNNILRFRKIRIDDIMISR-VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKT 153 (320) Q Consensus 75 ~~ii~~vl~l~~~~v~diMtPr-~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~ 153 (320) ++.+.++=...-.+|++||.+- ..++.........++++.+.+++.+.+.|.+.+. .+.|++...++........ T Consensus 262 ~~F~~~v~~~~~~~a~~i~~~~~~~~~~~~~~~~~~~al~~m~~~~~~~~~vvd~~~-~~~G~v~~~~~~~~~~~~~--- 337 (400) T PRK10070 262 RTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYGYVIERGN-KFVGAVSIDSLKAALTQQQ--- 337 (400) T ss_pred HHHHCCCCHHHCEEHHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEEHHHHHHHHHCCC--- T ss_conf 987554778770239896246875213148886999999999855986799986998-0889988999997763377--- Q ss_pred CCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHH Q ss_conf 3223334576788515657971788512323432013565116540776620466758899998742 Q gi|254780426|r 154 NNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 154 ~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGe 220 (320) .....+.+.+..|++++++.+++..|.+.... ..|||+.|.+.|+||...++..|-.+ T Consensus 338 --------~~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~-~~Vvd~~~~l~G~it~~~ll~~L~~~ 395 (400) T PRK10070 338 --------GLDAALIDAPLAVDAQTPLSELLSHVGQAPCA-VPVVDEDQQYVGIISKGMLLRALDRE 395 (400) T ss_pred --------CHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCC-EEEECCCCCEEEEEEHHHHHHHHHHC T ss_conf --------63667505884239999899999999728996-38987999199999879999998752 No 87 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=98.57 E-value=1.6e-07 Score=69.12 Aligned_cols=149 Identities=21% Similarity=0.273 Sum_probs=105.4 Q ss_pred HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCC Q ss_conf 89999999873047757641312001299986301036778979865981799841886642241029999987662235 Q gi|254780426|r 72 ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYA 151 (320) Q Consensus 72 ~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~ 151 (320) +++.++++.+=++. ..|+ .+.+++.++.++.++.+++.+++|+-+||.+++. .+.|++..+|....-.. T Consensus 72 e~Q~~~V~~VK~~e-----~g~i--~~Piti~p~~ti~~~~~l~~~~~~sg~pVv~~~~-~~~~i~~~~~~~~~~~~--- 140 (467) T pfam00478 72 EEQAEEVRKVKRFE-----SGFI--TDPVTVSPDTTVAEALELKERYGISGVPVTEDGK-LLGGLVGIRTSRDIDFL--- 140 (467) T ss_pred HHHHHHHHHHHHCC-----CCCC--CCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCC-EEEEEEEEECCCCCCCC--- T ss_conf 99999999997433-----6821--7875369876599999998972878326820797-46899877624433445--- Q ss_pred CCCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHC Q ss_conf 64322333457678851565797178851232343201356511654077662046675889999874222100000000 Q gi|254780426|r 152 KTNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSM 231 (320) Q Consensus 152 ~~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~ 231 (320) ...........+++.+++..++.++.+.|++++...++|||+.|.+.|++|..|+... ..+ +. T Consensus 141 -------~~~~~~~~~~~~lvt~~~~~~~~ea~~~l~~~kieklpvvd~~g~L~glit~kDi~k~------~~~----P~ 203 (467) T pfam00478 141 -------SKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDDGELVGLITRKDIEKA------RDY----PN 203 (467) T ss_pred -------CCCCEEECCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEECCCCCEEEEEEHHHHHHH------HCC----CC T ss_conf -------4320110135542772488998999999997556415444678837888874346774------207----85 Q ss_pred CCCCCCCEEEEECCCCH Q ss_conf 12467733783045359 Q gi|254780426|r 232 ISAVSDNTFIVDARTDL 248 (320) Q Consensus 232 i~~~~~g~~~v~G~~~l 248 (320) ..+-..|.++|-+.+.. T Consensus 204 a~~D~~grL~VgAAVG~ 220 (467) T pfam00478 204 ASKDAQGRLLVGAAVGT 220 (467) T ss_pred CCCCCCCCEEEEEEECC T ss_conf 22265677799998067 No 88 >KOG0474 consensus Probab=98.52 E-value=1.6e-07 Score=69.11 Aligned_cols=135 Identities=10% Similarity=0.217 Sum_probs=104.0 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCC----CCCCCEEHHHHHHHHHHCCCCCC---- Q ss_conf 0477576413120012999863010367789798659817998418866----42241029999987662235643---- Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLD----NPRGMVHMRDVISYISHMYAKTN---- 154 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d----~iiGiv~~kDll~~~~~~~~~~~---- 154 (320) .+.+.+|+|+|.+ .+++++.-..+..+.+.++.+.|.-|||.++... .+.|++..+.++..+.+..-... T Consensus 578 ~mr~L~a~ev~~~--pvi~l~~~ekV~~Iv~vLk~t~HngFPVvd~~~~~~~~~l~GlILRshl~vlL~~~~f~~~~~~~ 655 (762) T KOG0474 578 YMRNLTAGEVMSK--PVICLNRVEKVAVIVDVLKSTNHNGFPVVDEPPSNEAGRLHGLILRSHLLVLLKKRVFVEESRST 655 (762) T ss_pred HHHHHHHHHHCCC--CEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 7653117664268--85887322109999999873476887523578876102366888999999998744542057665 Q ss_pred -C--------------------------CCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEE Q ss_conf -2--------------------------233345767885156-579717885123234320135651165407766204 Q gi|254780426|r 155 -N--------------------------INLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDG 206 (320) Q Consensus 155 -~--------------------------~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~G 206 (320) . .+..--.++..++.| |+.||+++++..++..||+-..+-.+||+.-.+..| T Consensus 656 ~~~~~~~~~~~~d~a~r~~~i~dv~lt~~e~~~yvDl~p~~n~sPytV~~~mSl~k~~~lFR~lGLRhLlVv~~~~~~~g 735 (762) T KOG0474 656 FDLPVRRKFTFRDFAKREPSIEDVHLTSEEMEMYVDLHPFMNPSPYTVPETMSLAKAFILFRQLGLRHLLVVPKTNRVVG 735 (762) T ss_pred CCCCHHHCCCHHHHHHCCCCHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEE T ss_conf 67604303787776441785565541667686430553346999965474310578999999735015898517786366 Q ss_pred EEEHHHHH-HHHHH Q ss_conf 66758899-99874 Q gi|254780426|r 207 LVSYEDIV-SVLMR 219 (320) Q Consensus 207 iVTleDIl-e~ivG 219 (320) ++|..|+. +.+.| T Consensus 736 ilTR~D~~~~~~l~ 749 (762) T KOG0474 736 ILTRKDLARYRILG 749 (762) T ss_pred EEEHHHHHHHHHHC T ss_conf 77566434677732 No 89 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=98.42 E-value=3.9e-07 Score=66.59 Aligned_cols=188 Identities=15% Similarity=0.245 Sum_probs=114.1 Q ss_pred CCEE--EHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 7757--64131200129998630103677897986598179984188664224102999998766223564322333457 Q gi|254780426|r 85 RKIR--IDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQL 162 (320) Q Consensus 85 ~~~~--v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~ 162 (320) ...+ |.++||||.++++.+.+-+++++.+.+.+++..++||.+++. +++|+|+.+|+.+....+.+..+..-.+.+. T Consensus 150 ~~~~daV~~~MT~~~~~iT~~e~i~~e~A~~~L~~~r~ekLpvVd~~~-~lVgLiT~~Di~~~~~~P~A~kd~vG~~GrL 228 (476) T TIGR01302 150 KGKKDAVSEVMTPREELITVPEGIDLEEALKVLHKHRIEKLPVVDKDG-ELVGLITVKDIVKRREFPHASKDTVGENGRL 228 (476) T ss_pred CCCCCCEEECCCCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-CEEEEEEHHHHHHHHHCCCCCCCCCCCCCEE T ss_conf 688330101120376434841677789999998860865047882789-8899986447889863888778874888608 Q ss_pred C-----------------HHHHCCCCEEECC----CCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHH Q ss_conf 6-----------------7885156579717----885123234320135651165407766204667588999987422 Q gi|254780426|r 163 S-----------------ESNLIKNILFVPS----SMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDI 221 (320) Q Consensus 163 ~-----------------l~~~~r~~~~Vpe----~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei 221 (320) . |.+-.=+++++-- +....+.++.+|+...++=+|. |++.=-=--++|+++-+ T Consensus 229 ~VgAAvg~r~~D~~R~~~L~~AGvDv~viDsshGhs~~vl~~ik~~k~~Yp~~~iia---GNVaT~~~a~~LI~AgA--- 302 (476) T TIGR01302 229 IVGAAVGTREDDLERAEALVEAGVDVIVIDSSHGHSIYVLDSIKKIKKTYPDLDIIA---GNVATAEQAKALIDAGA--- 302 (476) T ss_pred EEEEEECCCCCCHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHCCEEEEEE---CCCCCHHHHHHHHHCCC--- T ss_conf 999884689861899999996596589981665453789999999986388057994---34411788988985288--- Q ss_pred HHHCCCHHHCCCC--CCCCE-------------------------------EEEECCCCHH-HHHHHHCCCCCCCCCCCC Q ss_conf 2100000000124--67733-------------------------------7830453599-999982888887777467 Q gi|254780426|r 222 TSEHHSKKSMISA--VSDNT-------------------------------FIVDARTDLE-ELAKIIGTDCNCLKGEQD 267 (320) Q Consensus 222 ~de~d~~~~~i~~--~~~g~-------------------------------~~v~G~~~l~-~l~~~l~~~l~~~~~~~~ 267 (320) | ..+ +++|| .+.||.+.=. |+.+.|-+--+ T Consensus 303 ----D-----g~rVGiGpGSICTTr~V~gVGvPQ~TAv~~Va~~A~~~Gi~VIADGGIr~SGDivKAlAaGA~------- 366 (476) T TIGR01302 303 ----D-----GLRVGIGPGSICTTRIVAGVGVPQITAVYDVAEYAAQSGIPVIADGGIRYSGDIVKALAAGAD------- 366 (476) T ss_pred ----C-----EEEECCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCC------- T ss_conf ----8-----789836889811001565127626889999999997279909983775625589999981677------- Q ss_pred CCCHHHHHHHHHCCCCCCCCEEEEECCEEEEE Q ss_conf 43179999998423473798899758959999 Q gi|254780426|r 268 VDSLGGLIFSVLDRIPARGEVVLEIPGFEIVI 299 (320) Q Consensus 268 ~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V 299 (320) +-=||+|+-.--+ .+|+.++ .+|-+|++ T Consensus 367 aVMlGslLAGT~E---sPGe~~~-~nGrryK~ 394 (476) T TIGR01302 367 AVMLGSLLAGTTE---SPGEYYI-INGRRYKS 394 (476) T ss_pred EEHHCCCCCCCCC---CCCCEEE-ECCEEEEE T ss_conf 2202342101633---8872699-66878987 No 90 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=98.37 E-value=2.2e-06 Score=61.70 Aligned_cols=106 Identities=9% Similarity=0.034 Sum_probs=72.4 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCC-C---CC----------C Q ss_conf 12999863010367789798659817998418866422410299999876622356432233-3---45----------7 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINL-N---IQ----------L 162 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~-~---~~----------~ 162 (320) ++.+.+..++|+.|++++|.+.+.+..||.+++. .++|++...||..++............ . .. . T Consensus 1 t~~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~~-~L~GIvt~gDi~R~~~~~~~~~~~~~~~~~~~~~~~vs~v~t~~~ 79 (133) T cd04592 1 TKYIKVSPTTTLKEALNLMLDEKQSCVLVVDSDD-FLEGILTLGDIQRFLFTNKTTRVQPEDETKQTNTCLVSSVCTKGI 79 (133) T ss_pred CCEEEECCCCCHHHHHHHHHHHCCCEEEEECCCC-CEEEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEE T ss_conf 9459989998599999999981886579983899-789997879999998762356456320001244432334564203 Q ss_pred CHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCC Q ss_conf 67885156579717885123234320135651165407766 Q gi|254780426|r 163 SESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGG 203 (320) Q Consensus 163 ~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~ 203 (320) ....-.+.++.+++++++.++.+.|.+++.+-..||++.++ T Consensus 80 ~~~~~~~~~~tv~pd~~l~~a~~lM~~~~i~~LPVV~~~~~ 120 (133) T cd04592 80 SYGGQECGLWTCTPDTDLTTAKKLMEAKGVKQLPVVKRGVD 120 (133) T ss_pred EECCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCEEECCCC T ss_conf 62551100389789999999999999869885876707766 No 91 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.24 E-value=7.6e-07 Score=64.70 Aligned_cols=50 Identities=22% Similarity=0.311 Sum_probs=39.9 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999874 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ||+++++++++.+++..|.+++....+|||+.|.++||+|-.|++..+.+ T Consensus 2 pPvtv~~~~si~eA~~~M~~~~ig~l~VVD~~~~LvGIiT~rDl~r~~~~ 51 (118) T cd04617 2 PPVVVRENTSVYDAIVTLFLEDVGSLFVVDEDGDLVGVVSRKDLLKASIG 51 (118) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHC T ss_conf 79798996999999999998399879998499959999878999999974 No 92 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.22 E-value=6.5e-06 Score=58.61 Aligned_cols=85 Identities=19% Similarity=0.323 Sum_probs=54.3 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHH-HCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 5657971788512323432013565116540776620466758899998742221-000000001246773378304535 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITS-EHHSKKSMISAVSDNTFIVDARTD 247 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~d-e~d~~~~~i~~~~~g~~~v~G~~~ 247 (320) |.|+++.+++++.++++.|.+++.+-++|+|+.|.++|++|..|++..+..+... .....-..+- ...-..++...+ T Consensus 1 r~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~~~lvGiit~~Dll~~~~~~~~~~~~~~~v~~iM--~~~~~tv~~~~~ 78 (114) T cd04629 1 RNPVTFTPDMSVTEAVEKLLKSKISGGPVVDDNGNLVGFLSEQDCLKQLLESSYHCDGVATVRDIM--TTEVLTVSPDDS 78 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHHHHHHHCCCCCCCEEHHHC--CCCCEEEECCCC T ss_conf 989997992999999999997199789999489929999967899999977540588765918851--468879957999 Q ss_pred HHHHHHHH Q ss_conf 99999982 Q gi|254780426|r 248 LEELAKII 255 (320) Q Consensus 248 l~~l~~~l 255 (320) +.+..+.+ T Consensus 79 ~~~a~~~m 86 (114) T cd04629 79 IVDLAQLM 86 (114) T ss_pred HHHHHHHH T ss_conf 99999998 No 93 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.21 E-value=2.3e-06 Score=61.56 Aligned_cols=67 Identities=13% Similarity=0.223 Sum_probs=51.1 Q ss_pred HHHHHHHHH----CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 999999873----04775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 74 EKEIFNNIL----RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 74 E~~ii~~vl----~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) |+++++.+. ++.+.+|+++|++ ++++++.++++.++.+.|.++++.++||.+++ +++|+++.+|+++ T Consensus 41 ~rDi~~~~~~~~~~~~~~~V~~iMt~--~~~tv~~~~~i~~a~~~M~~~~i~~lpVvd~g--~lvGiit~~Di~r 111 (112) T cd04625 41 FREVLQAMAQHGAGVLDTTVRAIMNP--EPIVASPDDSIDEVRRLMVERHLRYLPVLDGG--TLLGVISFHDVAK 111 (112) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHEEC--CCEEECCCCHHHHHHHHHHHCCCEEEEEEECC--EEEEEEEHHHHHC T ss_conf 79999999970998011708883348--97799899969999997686697699999999--9999999799635 No 94 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=98.20 E-value=1.1e-06 Score=63.71 Aligned_cols=135 Identities=13% Similarity=0.206 Sum_probs=107.9 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 89999999999799768989899999998730477576413120012999863010367789798659817998418866 Q gi|254780426|r 52 SHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLD 131 (320) Q Consensus 52 ~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d 131 (320) -++||... ++-.|.|.. +++ .+|+|+|.-+.++..+-..+++.+++-.+.+.+-----||+++. T Consensus 135 F~~eDFA~----~HPGG~LG~---kLL--------~kV~dlm~t~d~lP~v~~tas~~DAL~e~~~~~LG~~~v~~~~~- 198 (272) T TIGR00393 135 FSEEDFAK----FHPGGALGR---KLL--------VKVKDLMQTDDELPLVAPTASFKDALLEMSRKRLGLAVVCDENE- 198 (272) T ss_pred CCHHHHHC----CCCCHHHHH---HHH--------HHHHHHHCCCCCCCCEECCCCCEEEEEEEECCCCCEEEEEECCC- T ss_conf 88424411----487046667---888--------87677640665678222367720210233137861799971555- Q ss_pred CCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEEC-CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEE Q ss_conf 422410299999876622356432233345767885156-57971-788512323432013565116540776620466 Q gi|254780426|r 132 NPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKN-ILFVP-SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLV 208 (320) Q Consensus 132 ~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vp-e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiV 208 (320) .+.||++==|+-+.+....... -+.+++++|.+ +..+. ....+.++++.|++++++.++|||+.|.+.|+| T Consensus 199 ~~~Gv~tDGD~RR~l~~~g~~~------l~~~v~~~mT~~p~~~~n~~~~l~~A~~~l~~~kI~~~~vvdd~nkl~G~~ 271 (272) T TIGR00393 199 QLKGVFTDGDLRRVLALLGGGA------LKKEVKDFMTLGPKTLKNSDELLVEALEFLKKRKITSLVVVDDENKLLGVL 271 (272) T ss_pred CEEEEEECCHHHHHHHHHCCHH------CCCCHHHHCCCCCEEEECHHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEE T ss_conf 2467871465999998816600------023122210688813401456899999998727942899983897178753 No 95 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=98.20 E-value=9.5e-06 Score=57.52 Aligned_cols=130 Identities=23% Similarity=0.314 Sum_probs=99.8 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEEC--------CCCCCCCCCEEHHHHHHHHHHCCCCCC Q ss_conf 047757641312001299986301036778979865981799841--------886642241029999987662235643 Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYK--------NSLDNPRGMVHMRDVISYISHMYAKTN 154 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~--------~~~d~iiGiv~~kDll~~~~~~~~~~~ 154 (320) .|.+.++++.-.| .++.+..+.++.+++++++++||+.+||.. .+. .|.|.|..+.+|..+...++... T Consensus 362 ~~~~~~v~~l~L~--~~v~~~~t~tv~~ai~iLre~GfdQ~PVv~pga~~P~veag-~v~G~v~l~~lL~~l~~~~a~~~ 438 (527) T TIGR01137 362 VFGNARVKDLHLP--ALVTVHPTETVGDAIEILREYGFDQLPVVKPGAAEPDVEAG-KVLGSVTLRELLSALFAKKAKLE 438 (527) T ss_pred HHCCCEEECCCCC--CCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC-EEEEEEEHHHHHHHHHHCCCCCC T ss_conf 4256364156688--73454676408999999997487307620778888972353-47887426778999984246755 Q ss_pred CCCCCCCCCHHHHC---------------CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCC------------CEEE Q ss_conf 22333457678851---------------56579717885123234320135651165407766------------2046 Q gi|254780426|r 155 NINLNIQLSESNLI---------------KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGG------------TDGL 207 (320) Q Consensus 155 ~~~~~~~~~l~~~~---------------r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~------------~~Gi 207 (320) +.-...-...+.+- .+-.-+-.+.++.+|-+.|.. .+.|+|++|.-. +.|| T Consensus 439 D~v~Gkyldfk~~~~Fndvssynenk~~~~~~~~i~~~~~l~~L~~~le~--~~~a~V~~e~~pyhstGkssqrq~~~gv 516 (527) T TIGR01137 439 DAVSGKYLDFKVMSRFNDVSSYNENKSKKKKFIQIGEGEKLADLSKFLEK--NSSAIVVEEGKPYHSTGKSSQRQMVIGV 516 (527) T ss_pred CCCCCCCHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHCC--CCCEEEEECCCCCCCCCCCCCCEEEEEE T ss_conf 44343000344453210100010102467676544677518999998516--9618997078232125654243278986 Q ss_pred EEHHHHHHHH Q ss_conf 6758899998 Q gi|254780426|r 208 VSYEDIVSVL 217 (320) Q Consensus 208 VTleDIle~i 217 (320) ||.-|||.-| T Consensus 517 VT~~DLL~fL 526 (527) T TIGR01137 517 VTKIDLLSFL 526 (527) T ss_pred EEHHHHHHHC T ss_conf 4412244312 No 96 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.19 E-value=5.2e-06 Score=59.25 Aligned_cols=113 Identities=13% Similarity=0.228 Sum_probs=64.6 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 56579717885123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) |.++.+++++++.++++.|++++.+.++|+|+.|.++|+||..|++..+........................-....++ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~~~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 80 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNEGRVVGIVSEGDLIRKIYKGKGLFYVTLLYSVIFLDESKIKKLLGKKV 80 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCCH T ss_conf 99879689197999999999729978999959993999988899999986267620000000000002333102038999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCHHHHHHH-HHCCCCC Q ss_conf 99999828888877774674317999999-8423473 Q gi|254780426|r 249 EELAKIIGTDCNCLKGEQDVDSLGGLIFS-VLDRIPA 284 (320) Q Consensus 249 ~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~-~l~~iP~ 284 (320) .++ +..+..+..++.....+.-++.+ .+.++|- T Consensus 81 ~di---Mt~~~~tv~~~~~l~~a~~~m~~~~i~~lpV 114 (132) T cd04636 81 EEI---MTKKVITVDEDTTIEDVARIMSKKNIKRLPV 114 (132) T ss_pred HHH---CCCCCEEECCCCCHHHHHHHHHHCCCCEEEE T ss_conf 996---7899879999794999999999629878999 No 97 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=98.17 E-value=8.8e-06 Score=57.74 Aligned_cols=49 Identities=22% Similarity=0.388 Sum_probs=39.6 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999987 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ||++|++++++.++++.|.+++.+.++|+|+.|.++||||..|++..+. T Consensus 2 pP~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~g~lvGIvT~~Dl~r~~~ 50 (115) T cd04593 2 PPPVLSATTPLREAAEQLIESKHGSALVVDRDGGVVGIITLPDLLRALE 50 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 7838899199999999998749989999949998899988999999997 No 98 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.16 E-value=1.7e-06 Score=62.39 Aligned_cols=60 Identities=18% Similarity=0.252 Sum_probs=43.0 Q ss_pred HCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 304775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 82 LRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 82 l~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) ..+...+|+++|++ ++++++.++++.++++.+.+++++++||.+++. +++|+++..|+++ T Consensus 53 ~~~~~~~v~~iM~~--~~i~v~~~~~l~~a~~~m~~~~i~~lpVvd~~~-~~vGiIt~~Dilr 112 (113) T cd04622 53 RDPDTTTVGDVMTR--GVVTVTEDDDVDEAARLMREHQVRRLPVVDDDG-RLVGIVSLGDLAR 112 (113) T ss_pred CCHHHCCHHHHHCC--CCEEEECCCCHHHHHHHHHHCCCCEEEEEECCC-EEEEEEEHHHHHC T ss_conf 98200776786415--767998899879999999857965899990899-8999999799547 No 99 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=98.16 E-value=3.3e-06 Score=60.54 Aligned_cols=58 Identities=14% Similarity=0.192 Sum_probs=31.9 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH Q ss_conf 0477576413120012999863010367789798659817998418866422410299999 Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI 143 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll 143 (320) +..+.+|+++|++ ++.+++.++++.++++.+.+++++++||.+++. +++|+++..||+ T Consensus 54 ~~~~~~v~~iMt~--~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~~-~~vGiit~~Dil 111 (113) T cd04587 54 DPESTLVERVMTP--NPVCATSDTPVLEALHLMVQGKFRHLPVVDKSG-QVVGLLDVTKLT 111 (113) T ss_pred CHHHCEEEEEECC--CCEEEECCCCHHHHHHHHHHCCCEEEEEEECCC-EEEEEEEHHHHH T ss_conf 8320898866616--846990799999999999877941999992699-899999848852 No 100 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.15 E-value=2.3e-06 Score=61.59 Aligned_cols=69 Identities=14% Similarity=0.312 Sum_probs=42.9 Q ss_pred HHHHHHHHHHC----CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 99999998730----4775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 73 TEKEIFNNILR----FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 73 ~E~~ii~~vl~----l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) .|++++..+.. ..+.+|+++|| .+++++..++++.++++.+.++++.++||++++. .++|+++..|+++ T Consensus 41 Te~Di~r~~~~~~~~~~~~~V~~vMt--~~vitv~~~~~i~~a~~~M~~~~i~~lPVVD~~~-~~vGiit~~Dilk 113 (114) T cd04619 41 TKTDVVRQMGRCGGPGCTAPVENVMT--RAVVSCRPGDLLHDVWQVMKQRGLKNIPVVDENA-RPLGVLNARDALK 113 (114) T ss_pred EHHHHHHHHHHCCCCCCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCC-EEEEEEEHHHEEC T ss_conf 94999999873079866789999615--8988999989099999988876985889990799-5999999354542 No 101 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.14 E-value=2.9e-06 Score=60.93 Aligned_cols=47 Identities=17% Similarity=0.382 Sum_probs=39.1 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .|+.|.+++++.++++.|++++....+|+|+.|.++||||..|++.+ T Consensus 2 ~Pvtv~~dtti~eA~~~M~~~~v~~l~VvD~~~~lvGIiT~~Dl~~~ 48 (126) T cd04640 2 KPIVIPADTSIDEALELMIKHGVRLLLVVDSDDNFIGVITAVDLLGE 48 (126) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCCCC T ss_conf 89696899959999999998397679898799979999976300121 No 102 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.14 E-value=3.7e-06 Score=60.23 Aligned_cols=67 Identities=15% Similarity=0.330 Sum_probs=49.2 Q ss_pred HHHHHHHHH----CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 999999873----04775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 74 EKEIFNNIL----RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 74 E~~ii~~vl----~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) +++++.+++ +..+.+|+++|++ .+.+++.++++.++++.+.+++..++||.+++ +++|+++.+|++. T Consensus 43 ~~Di~~~~~~~~~~~~~~~V~~vMt~--~v~tv~~d~~~~~a~~~m~~~~i~~lpVvd~~--~lvGiit~~Dil~ 113 (114) T cd04630 43 MRDILKKVVAEGRDPDRVNVYEIMTK--PLISVSPDMDIKYCARLMERTNIRRAPVVENN--ELIGIISLTDIFL 113 (114) T ss_pred EHHHHHHHHHCCCCCCCCCHHHHCCC--CEEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHC T ss_conf 18989999846789444498682703--54999994949999999997797299999999--9999997689647 No 103 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.14 E-value=4.2e-06 Score=59.86 Aligned_cols=49 Identities=27% Similarity=0.285 Sum_probs=29.0 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ||++.+.+++++.++++.|++++.+-.+|+|+ |.++|++|..|++..+. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~i~~lpVvd~-~~~vGiiT~~Di~~~l~ 49 (121) T cd04633 1 KPVITVSPDDRVSHARRLMLDHDISRLPVIEG-GKLVGIVTEKDIADALR 49 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 99479799398999999999709988999989-99887875999999997 No 104 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.14 E-value=1e-05 Score=57.37 Aligned_cols=83 Identities=18% Similarity=0.293 Sum_probs=58.8 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 56579717885123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) ||++.+++++++.++++.|++++.+.++|+|+.|.++|++|..|++..+...... ..+-..-....-..++...++ T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~lvGiiT~~Di~~~~~~~~~~----~~~v~~~m~~~~~tv~~~~~l 76 (112) T cd04624 1 RPVVTVDPDTSIREAAKLMAEENVGSVVVVDPDERPIGIVTERDIVRAVAAGIDL----DTPVSEIMTRDLVTVDPDEPV 76 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECCHHHHHHHCCCCC----CCCEEEEECCCCEEECCCCCH T ss_conf 9978978939599999999862998899993998299999820458898648743----442045651671678799989 Q ss_pred HHHHHHH Q ss_conf 9999982 Q gi|254780426|r 249 EELAKII 255 (320) Q Consensus 249 ~~l~~~l 255 (320) .+..+.+ T Consensus 77 ~~a~~~m 83 (112) T cd04624 77 AEAAKLM 83 (112) T ss_pred HHHHHHH T ss_conf 9999999 No 105 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=98.14 E-value=3.4e-06 Score=60.46 Aligned_cols=55 Identities=15% Similarity=0.302 Sum_probs=28.7 Q ss_pred CEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 75764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 86 KIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 86 ~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) +.+|+++|+ .++.+++.++++.++++.|.+++.+++||.+++ +++|+++.+||+. T Consensus 56 ~~~V~~vms--~~~~ti~~~~~~~~a~~~M~~~~i~~lpV~d~~--~~vGivt~~Dil~ 110 (111) T cd04589 56 STPVGEIAT--FPLITVDPDDFLFNALLLMTRHRIHRVVVREGG--EVVGVLEQTDLLS 110 (111) T ss_pred CCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCEEEEEEECC--EEEEEEEHHHHHC T ss_conf 889999287--999899997929999999998798088999899--9999999688228 No 106 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=98.11 E-value=1.5e-05 Score=56.32 Aligned_cols=83 Identities=23% Similarity=0.347 Sum_probs=52.8 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCC-CCCEEEEECCCC Q ss_conf 5657971788512323432013565116540776620466758899998742221000000001246-773378304535 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAV-SDNTFIVDARTD 247 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~-~~g~~~v~G~~~ 247 (320) |.++.+++++++.++++.|.+++.+..+|+|+.|.+.|+||..|++..+.-.-. .....+... ...-..++-..+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~~~lvGiit~~Dl~~~l~~~~~----~~~~~v~~im~~~~~~v~~~~~ 76 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDDGRLVGIVSLDDIREILFDPSL----YDLVVASDIMTKPPVVVYPEDS 76 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHHCCC----CCCCEEEHHCCCCCEEEECCCC T ss_conf 997695890989999999986199789999289949999999999999970566----5572931203629659913678 Q ss_pred HHHHHHHH Q ss_conf 99999982 Q gi|254780426|r 248 LEELAKII 255 (320) Q Consensus 248 l~~l~~~l 255 (320) +.+..+.| T Consensus 77 l~~a~~~m 84 (114) T cd04613 77 LEDALKKF 84 (114) T ss_pred HHHHHHHH T ss_conf 89999999 No 107 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.11 E-value=1.5e-05 Score=56.14 Aligned_cols=51 Identities=24% Similarity=0.360 Sum_probs=28.6 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 565797178851232343201356511654077662046675889999874 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) |+++++++++++.++++.|.+++.+-++|+|+.|.+.|++|..|++..+.. T Consensus 1 r~vvti~~~~tl~ea~~~m~~~~~~~l~Vvd~~~~l~Giit~~Di~~~~~~ 51 (113) T cd04623 1 RDVITVRPDATVAEAAKLMAEKNIGAVVVVDDGGRLVGIFSERDIVRKVAL 51 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHH T ss_conf 997696893979999999997399889999089829999976899999986 No 108 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.09 E-value=3.7e-06 Score=60.22 Aligned_cols=51 Identities=20% Similarity=0.257 Sum_probs=36.6 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 565797178851232343201356511654077662046675889999874 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ++++.+++++++.++++.|++++.+..+|+|+.|.+.|+||..|++..+.. T Consensus 2 ~~vitv~~~~tl~~a~~~m~~~~i~~lpVvd~~~~lvGiit~~Dil~~~~~ 52 (135) T cd04586 2 TDVVTVSPETSVAEAARLMLDNHISGLPVVDDDGRLVGIVSEGDLLRRAEL 52 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHHHHC T ss_conf 999899993939999999997399679999089979999981897644541 No 109 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.09 E-value=3.1e-06 Score=60.67 Aligned_cols=56 Identities=20% Similarity=0.357 Sum_probs=37.7 Q ss_pred CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH Q ss_conf 477576413120012999863010367789798659817998418866422410299999 Q gi|254780426|r 84 FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI 143 (320) Q Consensus 84 l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll 143 (320) ..+.+|+++|+ .++.++..++++.++++.|.+++.+|+||.+++ .++|+++.+|++ T Consensus 54 ~~~~~V~~iMt--~~~itv~~~~~~~~a~~~M~~~~i~~LPVvd~~--klvGiit~~Dil 109 (111) T cd04626 54 FLEKKVFNIVS--QDVFYVNEEDTIDEALDIMREKQIGRLPVVDDN--KLIGVVRTKDIL 109 (111) T ss_pred CCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEHHHHC T ss_conf 77888899606--998799997939999999998099889999999--999999969814 No 110 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=98.08 E-value=3e-05 Score=54.28 Aligned_cols=120 Identities=17% Similarity=0.192 Sum_probs=98.1 Q ss_pred EEHHEEECC-CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 764131200-1299986301036778979865981799841886642241029999987662235643223334576788 Q gi|254780426|r 88 RIDDIMISR-VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESN 166 (320) Q Consensus 88 ~v~diMtPr-~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~ 166 (320) +|++||.-. +..+...++.++..|++++++.+.+.+-|.+... ...|+|.+.++-.+.... ..+.+ T Consensus 243 ~~~~i~~~~~~~~i~~t~~~gp~~Al~Lm~~~~~~s~yvv~~~~-~l~G~v~~~~~~~a~~~~------------~~~~~ 309 (372) T TIGR01186 243 DAERIAKRMNTVPITKTADKGPRSALKLMRDERVDSLYVVDRQN-KLVGVVDVESIKQARKKA------------QSLQD 309 (372) T ss_pred CHHHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCC-EEEEEEEHHHHHHHHHHH------------HHHHH T ss_conf 88999965277326865898878999999865971799997285-488778588999888787------------78999 Q ss_pred HC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHH Q ss_conf 51-5657971788512323432013565116540776620466758899998742 Q gi|254780426|r 167 LI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 167 ~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGe 220 (320) ++ ..++.|++.+.+.+++..--+..+....||||.-...||||..-++..+-|. T Consensus 310 ~l~~~~~tV~~~t~L~e~~~~v~~~~~~YvpVVDE~~~~~GIv~r~~L~~~l~d~ 364 (372) T TIGR01186 310 VLIDDILTVDEGTLLRELLRKVLKAGIKYVPVVDEDQRLVGIVTRASLVDALYDS 364 (372) T ss_pred HHHHCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 8620011105876048999998507995666661456458676477899987166 No 111 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=98.08 E-value=4.7e-06 Score=59.54 Aligned_cols=61 Identities=13% Similarity=0.169 Sum_probs=42.9 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 04775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) .-.+.++.++||++...++.+.+.+++++.+.|.++++.++||.+++. +++|+++.+|+++ T Consensus 53 ~~~~~~v~~iMt~~~~~i~~~~~~~l~ea~~~m~~~~i~~LPVVd~~g-~LvGiIT~~Di~k 113 (114) T cd04602 53 TDSETPLSEVMTPREVLVVAPTGITLEEANEILRESKKGKLPIVNDDG-ELVALVTRSDLKK 113 (114) T ss_pred CCCCCCCCCEEECCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEEEEECHHHCC T ss_conf 076765635165276689947987899999999974966781997899-6999999244036 No 112 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.07 E-value=5.1e-06 Score=59.32 Aligned_cols=48 Identities=13% Similarity=0.257 Sum_probs=30.4 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999987 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) +++.+.+++++.++++.|++++.+..+|+|+ |.++|+||..|++..+. T Consensus 2 ~viTv~p~~tl~~a~~~m~~~~i~~lpVvd~-~~lvGivT~~Di~~~~~ 49 (122) T cd04637 2 RVVTVEMDDRLEEVREIFEKHKFHHLLVVED-NELVGVISDRDYLKAIS 49 (122) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 7499889696999999998749988999989-99999988899998750 No 113 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.07 E-value=3.1e-06 Score=60.69 Aligned_cols=47 Identities=17% Similarity=0.387 Sum_probs=25.4 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 57971788512323432013565116540776620466758899998 Q gi|254780426|r 171 ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 171 ~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) ++.|++++++.++++.|.+++.+-++|+|+.|.++||||..||+..+ T Consensus 3 Vitv~p~~tl~ea~~~m~~~~i~~lpVvd~~g~lvGivT~~Dl~~~~ 49 (135) T cd04621 3 IATVHPEHSLLHVVDEMEKNGVGRVIVVDDNGKPVGVITYRDLAFAE 49 (135) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHH T ss_conf 68989949799999999874997799995999399999879988666 No 114 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.07 E-value=2.3e-05 Score=54.99 Aligned_cols=52 Identities=17% Similarity=0.303 Sum_probs=47.8 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHH Q ss_conf 5657971788512323432013565116540776620466758899998742 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGe 220 (320) |+++.+++++++.++++.|++++.+..+|+|+.|.++|++|..|++..+... T Consensus 2 rdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~g~lvGiit~~Dll~~~~~~ 53 (124) T cd04600 2 RDVVTVTPDTSLEEAWALLRRHRIKALPVVDGDRRLVGIVTQRDLLRHARPD 53 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCEEEEEEHHHHHHHHHCC T ss_conf 8969989939799999999972997899995698088885699999877504 No 115 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.05 E-value=2.7e-05 Score=54.52 Aligned_cols=53 Identities=21% Similarity=0.358 Sum_probs=48.0 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987422 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDI 221 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei 221 (320) .|++.+++++++.++++.|.+++.+..+|+|+.|.++|+||..|++..+.... T Consensus 1 sPvvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~~~lvGiit~~Dil~~~~~~~ 53 (122) T cd04803 1 SPVVTLSEDDSLADAEELMREHRIRHLPVVNEDGKLVGLLTQRDLLRAALSSL 53 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHHCC T ss_conf 99199989699999999999749978999908986888965999999998437 No 116 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.04 E-value=2.2e-05 Score=55.13 Aligned_cols=49 Identities=16% Similarity=0.190 Sum_probs=39.6 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999987 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) +|+++++++++.++++.|.+++.+.++|+|+.|.+.|++|..|++..+. T Consensus 2 ~P~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~~~lvGivT~~Di~~~~~ 50 (113) T cd04615 2 KPSCVVLNTDIARAVAEMYTSGSRALPVVDDKKRLVGIITRYDVLSYAL 50 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf 5959899699999999999739978999948997999999599999998 No 117 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.04 E-value=2.5e-05 Score=54.82 Aligned_cols=57 Identities=21% Similarity=0.268 Sum_probs=50.0 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHC Q ss_conf 565797178851232343201356511654077662046675889999874222100 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEH 225 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~ 225 (320) |+++++.+++++.++++.|.+++..-+.|||+.|.++|++|..|++..+.+...+.. T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~g~lvGiis~~Dl~~~~~~~~~~~~ 57 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDENGKVVDVYSRFDVINLAKEGAYNNL 57 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCC T ss_conf 997898998999999999998098669998789969899759999988751554434 No 118 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=98.01 E-value=8.6e-06 Score=57.81 Aligned_cols=56 Identities=21% Similarity=0.324 Sum_probs=46.5 Q ss_pred CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 85 RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 85 ~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) ...+++++|+ .++.++..+++++++++.|.++++.|+||.+++ +++|+++.+|++. T Consensus 48 ~~~~v~d~Mt--~~v~tv~p~~~l~ea~~lM~~~~i~~lPVvd~g--klvGIIT~~Dil~ 103 (104) T cd04594 48 TYGDVVDYIV--RGIPYVRLTSTAEEAWEVMMKNKTRWCPVVDDG--KFKGIVTLDSILD 103 (104) T ss_pred CCCCHHHEEE--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEEHHHHHC T ss_conf 6998445476--398889998979999999998697878999899--9999999689537 No 119 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.00 E-value=1.8e-05 Score=55.76 Aligned_cols=85 Identities=22% Similarity=0.358 Sum_probs=62.8 Q ss_pred CCCEEECCCCCHHHHHHHHHCC-CCCCEEEEECCCCCEEEEEHHHHHHHHHHHHH-HHCCCHHHCCCCCCCCEEEEECCC Q ss_conf 5657971788512323432013-56511654077662046675889999874222-100000000124677337830453 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQES-RIRMALVIDEHGGTDGLVSYEDIVSVLMRDIT-SEHHSKKSMISAVSDNTFIVDART 246 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~-~~~~aiVvDE~G~~~GiVTleDIle~ivGei~-de~d~~~~~i~~~~~g~~~v~G~~ 246 (320) ||+..+++++++.++++.|+++ ..+..+|+|+ |..+|+||..|+++.+.+.+. +.+.. .+--.-....-..++..+ T Consensus 1 rpv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd~-~r~vGiisr~dl~~~~~~~~g~~l~~~-~pV~~~M~~~p~~v~~~~ 78 (119) T cd04598 1 RPAPTVSPDTTVNDVLERFERDPDLSALAVVDD-GRPVGLIMREALMELLSTPYGRALYGK-KPVSEVMDPDPLIVEADT 78 (119) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEC-CEEEEEEEHHHHHHHHCCCCCHHHHCC-CCHHHHCCCCCEEECCCC T ss_conf 998574999849999999987899676899879-988999989999998707432233038-958997168987998989 Q ss_pred CHHHHHHHH Q ss_conf 599999982 Q gi|254780426|r 247 DLEELAKII 255 (320) Q Consensus 247 ~l~~l~~~l 255 (320) +++++.+.+ T Consensus 79 ~i~~~~~~~ 87 (119) T cd04598 79 PLEEVSRLA 87 (119) T ss_pred CHHHHHHHH T ss_conf 699999999 No 120 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.99 E-value=2.9e-05 Score=54.30 Aligned_cols=50 Identities=20% Similarity=0.267 Sum_probs=46.8 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999874 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ++.++.++.++.++++.|++++.+-.+|+|+.|.++|+||..|++..+.+ T Consensus 2 ~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~gklvGiit~~Di~~~~~~ 51 (116) T cd04643 2 EVAYVQDTNTLRHALLVLTKHGYSAIPVLDKEGKYVGTISLTDILWKLKG 51 (116) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHC T ss_conf 76999994929999999997499879898699949999889999998743 No 121 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.99 E-value=4.4e-05 Score=53.17 Aligned_cols=51 Identities=22% Similarity=0.306 Sum_probs=46.4 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEEC-CCCCEEEEEHHHHHHHHHH Q ss_conf 56579717885123234320135651165407-7662046675889999874 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDE-HGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE-~G~~~GiVTleDIle~ivG 219 (320) .|.+.|+++.++.++++.|.+++.|..+|+|+ .|.++||||..||++.+.. T Consensus 1 ~Pfi~v~~~~sl~~a~~lm~~~~i~rlpVvd~~~g~lvGIiT~~dil~~l~~ 52 (123) T cd04627 1 EPFIPVPSTASLFQAIEILGSGGIHRVAVTEEESGEVIGILSQRRLVEFLWE 52 (123) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHH T ss_conf 9964848989299999999749977799995899949999997999999986 No 122 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=97.97 E-value=3.3e-05 Score=53.98 Aligned_cols=49 Identities=22% Similarity=0.436 Sum_probs=39.3 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 5657971788512323432013565116540776620466758899998 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) |+++.|++++++.++++.|++++.+.++|+|+.|.+.|++|..|++... T Consensus 1 r~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~~~lvGiit~~Di~~~~ 49 (121) T cd04584 1 RDVVTITPTTTIAEALELMREHKIRHLPVVDEEGRLVGIVTDRDLRDAS 49 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 9988968929899999999972998899990899399996589987522 No 123 >KOG1764 consensus Probab=97.95 E-value=0.00011 Score=50.45 Aligned_cols=185 Identities=15% Similarity=0.126 Sum_probs=120.9 Q ss_pred CCHHHHHHHHHHHCCCCEEE--HHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHH Q ss_conf 89899999998730477576--4131200129998630103677897986598179984188664224102999998766 Q gi|254780426|r 70 FSETEKEIFNNILRFRKIRI--DDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYIS 147 (320) Q Consensus 70 i~~~E~~ii~~vl~l~~~~v--~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~ 147 (320) ...+....+....-...+.+ ..-++++...+++....+.-++...+.++++.|+||.+...-++..++..+-++.++. T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~p~~s~l~~~~~l~~~~~~rvpv~d~~~~~v~~ilt~~rIl~~l~ 212 (381) T KOG1764 133 SSLDNIEVLEDSQLSKRREVECLLKETLKPPFVSISPESSLLDAVLLLIKSRIHRVPVIDPETGEVLYILTQRRILKFLW 212 (381) T ss_pred CCHHHHHHHHHHHCCCCCEEEEEEECCCCCCCEEECCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHH T ss_conf 75677765302201455124432104446774130786889997764235772475233656554211120999999998 Q ss_pred HCCCCCCCCCCCCCCCHHHH----CCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHH Q ss_conf 22356432233345767885----15657971788512323432013565116540776620466758899998742221 Q gi|254780426|r 148 HMYAKTNNINLNIQLSESNL----IKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITS 223 (320) Q Consensus 148 ~~~~~~~~~~~~~~~~l~~~----~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~d 223 (320) ....... ...-...++.++ ...+.++.+++++.++++.|..++..-..|||++|+.+|.++..|+.-.+...-.+ T Consensus 213 ~~~~~~~-~~~~l~~s~~dl~ig~~~~i~~i~~~~~v~~al~~m~~~~is~lpvV~~~g~~v~~~s~~Dv~~l~~~~~~~ 291 (381) T KOG1764 213 LNGRLLP-LPSLLSKSLSDLGIGTWSNIASISEDTPVIEALKIMSERRISALPVVDENGKKVGNYSRFDVIHLAREGTYN 291 (381) T ss_pred HCCCCCC-CHHHHHCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCEECCCCCEECCEEEEHHHHHHHCCCCC T ss_conf 5145644-224551887664047787777751787089999999870457443475778553215620012156427767 Q ss_pred HCCCH---H-HCC-CCCCCCEEEEECCCCHHHHHHHH Q ss_conf 00000---0-001-24677337830453599999982 Q gi|254780426|r 224 EHHSK---K-SMI-SAVSDNTFIVDARTDLEELAKII 255 (320) Q Consensus 224 e~d~~---~-~~i-~~~~~g~~~v~G~~~l~~l~~~l 255 (320) -.+.. + ... ....++.+.|.-.-++..+-+.+ T Consensus 292 ~~~~~~l~~~~~~~~~~~~~vvtc~~~ssL~~vi~~l 328 (381) T KOG1764 292 NLDLSCLSEALSHRPIRFEGVVTCRPTSTLAEVIDKL 328 (381) T ss_pred CCCHHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHH T ss_conf 7773242135331554568737984377499999999 No 124 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.94 E-value=1.5e-05 Score=56.31 Aligned_cols=49 Identities=14% Similarity=0.162 Sum_probs=32.2 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) .|++.+++++++.++++.|.+++.+-++|+|. |.+.|++|..|++..+. T Consensus 2 spV~ti~p~~tv~ea~~~m~~~~i~~lpVvd~-~~lvGivT~~Di~~~~~ 50 (110) T cd04595 2 SPVKTVRPEATIEEARELLLRYGHTALPVVEG-GRVVGIISRRDVEKALR 50 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 89899799599999999999759988999989-99999997689887765 No 125 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=97.94 E-value=1.2e-05 Score=56.88 Aligned_cols=83 Identities=17% Similarity=0.246 Sum_probs=51.4 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124677337830453599 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLE 249 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~ 249 (320) .+++|.+++++.+++..|.+++.+.++|+|+.|.+.|++|..||...+......... .-..+- ...-+.+....++. T Consensus 3 ~vp~V~~~~si~ea~~~m~~~~~~~v~Vvd~~~~lvGiiT~~Dl~r~~~~~~~~~~~-~v~~iM--~~~~~tv~~~~~i~ 79 (114) T cd04604 3 ALPLVSPDTSLKDALLEMSRKGLGMTAVVDEDGRLVGIFTDGDLRRALEKGLDILTL-PVADVM--TRNPKTIDPDALAA 79 (114) T ss_pred CCCEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHCCCCCCCC-CHHHHH--CCCCEEEECCCCHH T ss_conf 477889949399999999976997899997999399999899999998716664567-974743--56758993898869 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|254780426|r 250 ELAKII 255 (320) Q Consensus 250 ~l~~~l 255 (320) +..+.+ T Consensus 80 ~a~~~m 85 (114) T cd04604 80 EALELM 85 (114) T ss_pred HHHHHH T ss_conf 999999 No 126 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=97.94 E-value=1.2e-05 Score=56.76 Aligned_cols=63 Identities=16% Similarity=0.078 Sum_probs=57.2 Q ss_pred CEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHC Q ss_conf 7576413120012999863010367789798659817998418866422410299999876622 Q gi|254780426|r 86 KIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHM 149 (320) Q Consensus 86 ~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~ 149 (320) +.+|+++|||+.++++.+...+++|+.+++.+++..++||.+++. +++|+++.||+....... T Consensus 160 ~~~V~~vMT~~~~Lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~g-~L~gLiT~kDi~k~~~~P 222 (497) T PRK07107 160 DTKVKDFMTPFEKLVVAKVGITLKEANNIIWDHKLNTLPIIDKNQ-HLVYMVFRKDYDSHKENP 222 (497) T ss_pred CCCHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEECCCC-CEEEEEEHHHHHHHHHCC T ss_conf 776656406765427846889999999999863444302783799-589999837888864396 No 127 >KOG0475 consensus Probab=97.94 E-value=4.2e-05 Score=53.31 Aligned_cols=130 Identities=12% Similarity=0.141 Sum_probs=94.2 Q ss_pred CEEEHHEEECCC-E--EEEEEE-CCCHHHHHHHHHHCCCCEEEEE-CCCCCCCCCCEEHHHHHHHHHHCCCCCC------ Q ss_conf 757641312001-2--999863-0103677897986598179984-1886642241029999987662235643------ Q gi|254780426|r 86 KIRIDDIMISRV-S--INAVED-KATVYEAMLMFEKYGRSWMPVY-KNSLDNPRGMVHMRDVISYISHMYAKTN------ 154 (320) Q Consensus 86 ~~~v~diMtPr~-~--i~~i~~-~~~~~e~~~~i~~~~~sR~PV~-~~~~d~iiGiv~~kDll~~~~~~~~~~~------ 154 (320) +.-+.++|-|+. + ...+.. +.+++++...+.++.|+-|||. +++....+|++..+|+...+........ T Consensus 542 ~t~~~~v~~p~~~~~~L~~i~~~s~tl~~le~~~~~t~~sgfpvvl~~~sq~lvGfv~rr~l~~~i~~ar~~q~~~~~~~ 621 (696) T KOG0475 542 STLAIPVMEPCRSESCLIVITQDSMTLEDLESLMEDTDFSGFPVVLSEDSQRLVGFVLRRNLFLAILNARKIQSFIVTTS 621 (696) T ss_pred CCHHHHHHCHHCCCHHHEECCCCCEEHHHHHHHHHHCCCCCCEEEECCCCCEEEEEECHHHHHHHHHHHCCCCCCCEECC T ss_conf 20334431310176222120556516899988886244278527971564226788715788898865300565413111 Q ss_pred ----------CCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf ----------2233345767885156-5797178851232343201356511654077662046675889999 Q gi|254780426|r 155 ----------NINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 155 ----------~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .........+++++.. |..|-..++...++..|++-... -+.|++.|...|+||-.|++.. T Consensus 622 ~~f~~~~~~~~~~~~~~~~lk~il~~tp~tv~d~tp~~~v~~~F~~lg~~-~~~v~~~G~l~Giitkkd~l~~ 693 (696) T KOG0475 622 IYFNDPSPSAVAGIPSRLDLKDILDMTPFTVTDLTPMETVVDLFRKLGLR-QILVTKNGILLGIITKKDCLRH 693 (696) T ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHCCE-EEEECCCCEEEEEEEHHHHHHH T ss_conf 33678874345899987576320468760343458489999999861725-8998159826766534898886 No 128 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.94 E-value=8.1e-06 Score=57.97 Aligned_cols=50 Identities=26% Similarity=0.444 Sum_probs=40.2 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999874 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) .+++|++++++.+++..|++++.+-.+|+|+.|.+.|+||..|++..++. T Consensus 2 dvvTv~~~~~l~~a~~lm~~~~i~~lPVvd~~g~lvGiiT~~Dl~~~~~~ 51 (128) T cd04632 2 DVITVREDDSVGKAINVLREHGISRLPVVDDNGKLTGIVTRHDIVDFVVR 51 (128) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHC T ss_conf 98698995929999999998399779999689978999988999998853 No 129 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=97.93 E-value=2.3e-05 Score=54.98 Aligned_cols=86 Identities=19% Similarity=0.262 Sum_probs=63.5 Q ss_pred CCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 15657971788512323432013565116540776620466758899998742221000000001246773378304535 Q gi|254780426|r 168 IKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTD 247 (320) Q Consensus 168 ~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~ 247 (320) +.||+.|.++.++.++++.|++++..-..|+|+.|.+.|+||..|++..++..-....+.-.. -.....-.+....+ T Consensus 1 l~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~g~~~G~vt~~~ll~~~~~~~~~~~~~V~~---vm~~~~~~v~~~~~ 77 (124) T cd04608 1 LKAPVTVLPTVTCAEAIEILKEKGFDQLPVVDESGKILGMVTLGNLLSSLSSGKVQPSDPVSK---ALYKQFKRVNKNDT 77 (124) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCHHH---HHCCCCCCCCCCCC T ss_conf 999979899890999999999849985689879996888887999999997389999982899---74465651479985 Q ss_pred HHHHHHHHC Q ss_conf 999999828 Q gi|254780426|r 248 LEELAKIIG 256 (320) Q Consensus 248 l~~l~~~l~ 256 (320) +..+.+.+. T Consensus 78 l~~ls~~l~ 86 (124) T cd04608 78 LGKLSRILE 86 (124) T ss_pred HHHHHCCCC T ss_conf 567310367 No 130 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=97.93 E-value=3.6e-05 Score=53.72 Aligned_cols=83 Identities=13% Similarity=0.215 Sum_probs=54.7 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 56579717885123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) |||+.+++++++.++++.|.+++.+-++|+|+ |.+.|++|..|++..+...-.+... .-..+ .....+.++...++ T Consensus 1 r~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~~-~~~vGiit~~Di~~~~~~~~~~~~~-~v~~i--m~~~~~tv~~~~~l 76 (111) T cd04800 1 RPPVTCSPDTTIREAARLMTEHRVSSLLVVDD-GRLVGIVTDRDLRNRVVAEGLDPDT-PVSEV--MTAPPITIPPDATV 76 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEECHHHHHHHHCCCCCCC-CHHHH--CCCCCEEEECCCCH T ss_conf 99989789197999999999729998999989-9999999955788998716798566-58885--12681799889919 Q ss_pred HHHHHHH Q ss_conf 9999982 Q gi|254780426|r 249 EELAKII 255 (320) Q Consensus 249 ~~l~~~l 255 (320) .+..+.+ T Consensus 77 ~~a~~~m 83 (111) T cd04800 77 FEALLLM 83 (111) T ss_pred HHHHHHH T ss_conf 9999999 No 131 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=97.93 E-value=1.3e-05 Score=56.63 Aligned_cols=49 Identities=27% Similarity=0.489 Sum_probs=43.5 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999987 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ++.++++++++.+++..|++.+.+.++|+|+.|...|+||..|++..+. T Consensus 8 ~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~~~~~Gvit~~dl~~~~~ 56 (57) T pfam00571 8 DVVTVPPDTSLEEALELMRENGISRLPVVDEDGKLVGIVTLRDLLRALL 56 (57) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHC T ss_conf 9989979090999999998539957999927994999997899999866 No 132 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.92 E-value=1.1e-05 Score=57.05 Aligned_cols=46 Identities=20% Similarity=0.398 Sum_probs=29.6 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .+.++.+++++.++++.|++++.+-.+|+|+.|.++|+||..|++. T Consensus 2 dvvtv~~~~tv~ea~~lm~~~~i~~lpVvd~~~~lvGivT~~Dl~~ 47 (106) T cd04638 2 NVVYVTLPGTRDDVLELLKEYKVSGVPVVKKSGELVGIITRKDLLR 47 (106) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHH T ss_conf 9899999496999999999769998999938999999989789644 No 133 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.92 E-value=1.7e-05 Score=55.79 Aligned_cols=58 Identities=17% Similarity=0.344 Sum_probs=42.1 Q ss_pred CCCCCCHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 33457678851-56579717885123234320135651165407766204667588999 Q gi|254780426|r 158 LNIQLSESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 158 ~~~~~~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ......+++++ +++..+++++++.++++.|++++.+-..|||+.|.+.||||+.||-| T Consensus 54 ~~v~~qV~DIm~~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~g~l~GiIT~~DIAe 112 (113) T cd04597 54 ADVHPRVRDVINRKPVTARPNDPLREALNLMHEHNIRTLPVVDDDGTPAGIITLLDLAE 112 (113) T ss_pred CCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 15650566631378834199982999999999849987869889993999987787435 No 134 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.90 E-value=1.4e-05 Score=56.50 Aligned_cols=84 Identities=17% Similarity=0.298 Sum_probs=45.6 Q ss_pred CCCEEECCCCCHHHHHHHH-HCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCC Q ss_conf 5657971788512323432-013565116540776620466758899998742221000000001246773378304535 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNI-QESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTD 247 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m-~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~ 247 (320) |.+..||+++++.++++.| .+.+.+..+|+|+.|.+.|+||..|+......+-. +..-..+.........+.-..+ T Consensus 1 Rdv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~g~l~Givt~~dl~~~~~~~~~---~~~v~~v~~~~~~~~~v~~~~~ 77 (114) T cd04801 1 RDFPTVPAHLTLREFVREYVLGSNQRRFVVVDNEGRYVGIISLADLRAIPTSQWA---QTTVIQVMTPAAKLVTVLSEES 77 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHHHCCC---CCCHHHHCCCCCCCEEECCCCC T ss_conf 9955849999599999999850796689998789979999999998754540256---8847782102778758999997 Q ss_pred HHHHHHHH Q ss_conf 99999982 Q gi|254780426|r 248 LEELAKII 255 (320) Q Consensus 248 l~~l~~~l 255 (320) +.+.-+.| T Consensus 78 l~~al~~m 85 (114) T cd04801 78 LAEVLKLL 85 (114) T ss_pred HHHHHHHH T ss_conf 99999999 No 135 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=97.90 E-value=5.3e-05 Score=52.65 Aligned_cols=81 Identities=17% Similarity=0.335 Sum_probs=53.4 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 56579717885123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) |++..+++++++.++++.|++++.+.++|+|+ |.+.|++|..|++..+...... ..-..+ ....-+.++...++ T Consensus 1 r~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd~-~~lvGiit~~Di~~~~~~~~~~---~~V~~~--M~~~~~tv~~d~~l 74 (110) T cd04588 1 KPLITLNPNATLREAARLFNTHHIHGAPVVDD-GKLVGIVTLSDIAHAIARGLEL---AKVKDV--MTKDVITIDEDEQL 74 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHHCCCCC---CHHHHH--CCCCCCEECCCCCH T ss_conf 99479689498999999999709988999989-9999999668989998737977---803452--04897298589969 Q ss_pred HHHHHHH Q ss_conf 9999982 Q gi|254780426|r 249 EELAKII 255 (320) Q Consensus 249 ~~l~~~l 255 (320) .+..+.| T Consensus 75 ~~~~~~m 81 (110) T cd04588 75 YDAIRLM 81 (110) T ss_pred HHHHHHH T ss_conf 9999999 No 136 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.88 E-value=1.2e-05 Score=56.97 Aligned_cols=57 Identities=21% Similarity=0.453 Sum_probs=40.1 Q ss_pred CCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH Q ss_conf 0477576413120012999863010367789798659817998418866422410299999 Q gi|254780426|r 83 RFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI 143 (320) Q Consensus 83 ~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll 143 (320) +..+.+|+++|++ ++.+++.++++.++++.+.++++.++||.+++ +++|+++..|++ T Consensus 54 ~~~~~~v~~iM~~--~~~tv~~~~~l~~a~~~m~~~~~~~lpVvd~~--~lvGiIt~~Dil 110 (112) T cd04802 54 KPREVPVGEVMST--PLITIDPNASLNEAAKLMAKHGIKRLPVVDDD--ELVGIVTTTDIV 110 (112) T ss_pred CCCCCCHHHHHCC--CCEEECCCCHHHHHHHHHHHHCCCEEEEEECC--EEEEEEECHHHC T ss_conf 8123899997234--85599479869999998787198589999899--999999806710 No 137 >cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.88 E-value=7.5e-05 Score=51.66 Aligned_cols=94 Identities=17% Similarity=0.218 Sum_probs=83.5 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEEECCC Q ss_conf 29998630103677897986598179984188664224102999998766223564322333457678851565797178 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNILFVPSS 177 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~Vpe~ 177 (320) .++.+|.+.++..+...+.++|+.--|+++....+.+|++++-|.+..+. .+.+.+. T Consensus 2 kliV~D~~l~vk~Af~al~~ngi~~aplWds~~~~fvGmLt~~DfI~il~-----------------------lv~i~P~ 58 (98) T cd04618 2 KLVVFDTKLPVKKAFNALVENGIRSAPLWDSRKQQFVGMLTITDFILILR-----------------------LVSIHPE 58 (98) T ss_pred EEEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHH-----------------------HEECCCC T ss_conf 09999489758999999998692699987288767888864799998876-----------------------2444876 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECC-CCCEEEEEHHHHH Q ss_conf 851232343201356511654077-6620466758899 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEH-GGTDGLVSYEDIV 214 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~-G~~~GiVTleDIl 214 (320) .++.++.+.|.+++.|..+|+|.. |...-++|...++ T Consensus 59 ~sL~da~~~l~~~~ihrlPvid~~~~~~~~ilt~~~il 96 (98) T cd04618 59 RSLFDAALLLLKNKIHRLPVIDPSTGTGLYILTSRRIL 96 (98) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEHEE T ss_conf 12999999998678750521647999658999630201 No 138 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=97.86 E-value=2.4e-05 Score=54.86 Aligned_cols=47 Identities=30% Similarity=0.466 Sum_probs=35.7 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 57971788512323432013565116540776620466758899998 Q gi|254780426|r 171 ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 171 ~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) +..+++++++.++++.|++++.+-.+|||+.|.++||||..||+..+ T Consensus 2 ~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~~~lvGiit~~Dil~~l 48 (49) T smart00116 2 VVTVSPDTTLEEALELLREHGIRRLPVVDEEGRLVGIVTRRDIIKAL 48 (49) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 86999879299999999980998576998999199998879999864 No 139 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=97.86 E-value=1.7e-05 Score=55.95 Aligned_cols=50 Identities=26% Similarity=0.305 Sum_probs=34.7 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 565797178851232343201356511654077662046675889999874 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) |.|+++++++++.++++.|++++.+..+|+| .|.+.|+||..|++..+.. T Consensus 1 rdpvti~~~~tl~ea~~~m~~~~~~~lpVvd-~~~lvGiit~~Di~~~~~~ 50 (122) T cd04585 1 KNPITVTPDTSLMEALKLMKENSIRRLPVVD-RGKLVGIVTDRDLKLASPS 50 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEE-CCEEEEEEEHHHHHHHHHC T ss_conf 9989968929699999999973998899998-9999999988999987750 No 140 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.85 E-value=7.4e-05 Score=51.68 Aligned_cols=50 Identities=20% Similarity=0.271 Sum_probs=42.5 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) |.|+.+++++++.+++..|.+++..-++|||+.|.+.||+|-.|++..+. T Consensus 1 r~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~~~lvGIvT~~Di~r~~~ 50 (115) T cd04620 1 RHPLTVTPDTPVADAIALMSQQGDSSCVLVVEKGRLLGIFTERDIVRLTA 50 (115) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf 96999699199999999999839946999937992999998699999998 No 141 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.85 E-value=7.1e-05 Score=51.82 Aligned_cols=49 Identities=18% Similarity=0.195 Sum_probs=42.9 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 5657971788512323432013565116540776620466758899998 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) |.++++.+++++.++++.|.+++..-.+|+|+.|.++|+||..|++... T Consensus 1 r~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~g~lvGiit~~Dll~~~ 49 (122) T cd04635 1 REPVTCTPDDPVSKVWDLMLESGFTGLPVVQKAGELIGIITRRDIIRAG 49 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHH T ss_conf 9999968929799999999971994899991898299999989987655 No 142 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=97.84 E-value=2.2e-05 Score=55.14 Aligned_cols=81 Identities=20% Similarity=0.281 Sum_probs=48.1 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124677337830453599 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLE 249 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~ 249 (320) .++++++++++.++++.|++++..-.+|+|+ |.+.|+||..|++..+..+-... .....-....-..++...++. T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~~~~lpVvd~-~~l~Givt~~di~~~~~~~~~~~----~~v~~im~~~~~~v~~~~~l~ 76 (111) T cd04612 2 DVVTVPVDLTVDEVLALMFGERHRGYPVVDD-GRLVGIVTLADIRRVPAEGREAT----VLVGDVMTRDPVTASPDETLR 76 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHHHCCCCCC----CEEEEEECCCCCCCCCCCCHH T ss_conf 8499999295999999999719948999999-99999998999998775147777----569999707960529999799 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|254780426|r 250 ELAKII 255 (320) Q Consensus 250 ~l~~~l 255 (320) +..+.| T Consensus 77 ~a~~~m 82 (111) T cd04612 77 DALKRM 82 (111) T ss_pred HHHHHH T ss_conf 999999 No 143 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=97.84 E-value=2.6e-05 Score=54.69 Aligned_cols=47 Identities=30% Similarity=0.533 Sum_probs=23.0 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 579717885123234320135651165407766204667588999987 Q gi|254780426|r 171 ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 171 ~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ++.+++++++.++++.|++++.+..+|+|+ |+..|++|..|++..+. T Consensus 3 vvTv~p~~tv~ea~~~m~~~~i~~~~V~~~-~~~vGiiT~~Di~~~~~ 49 (111) T cd04611 3 ILTCPPDTSLAEAASRMRERRISSIVVVDD-GRPLGIVTERDILRLLA 49 (111) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEEHHHHHHHH T ss_conf 199889697999999999719988999979-99999998559899874 No 144 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.84 E-value=9.7e-05 Score=50.91 Aligned_cols=52 Identities=23% Similarity=0.359 Sum_probs=44.0 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEEC-CCCCEEEEEHHHHHHHHHHH Q ss_conf 56579717885123234320135651165407-76620466758899998742 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDE-HGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE-~G~~~GiVTleDIle~ivGe 220 (320) |+++++++++++.++++.|++++.+-++|+|+ .|.+.|++|..|++..+.+. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~~~~ 53 (125) T cd04631 1 KDVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYLGGG 53 (125) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 99779799094999999999839988999979999599999999999987335 No 145 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=97.83 E-value=6e-05 Score=52.27 Aligned_cols=50 Identities=20% Similarity=0.363 Sum_probs=44.4 Q ss_pred CCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 15657971788512323432013565116540776620466758899998 Q gi|254780426|r 168 IKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 168 ~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) +|.|+.+.+++++.++++.|++++.+.++|+|+.|.+.|+||..||.... T Consensus 1 ~r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~~~lvGivt~~di~~~~ 50 (109) T cd04583 1 IKNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKDNKLLGIVSLESLEQAY 50 (109) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHCC T ss_conf 99899979939799999999973997899995899899999810434300 No 146 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.82 E-value=1.3e-05 Score=56.57 Aligned_cols=45 Identities=27% Similarity=0.417 Sum_probs=28.3 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 5657971788512323432013565116540776620466758899 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) |+++.+++++++.++++.|++++.+-.+|+| .|.++|+||..|++ T Consensus 2 kdvvtv~~d~tv~ea~~~m~~~~~~~~pVvd-~~~lvGivt~~Dll 46 (107) T cd04610 2 RDVITVSPDNTVKDVIKLIKETGHDGFPVVD-NGKVVGIVSARDLL 46 (107) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEE-CCEEEEEEEHHHHC T ss_conf 8809989929599999999975997899998-99999988764531 No 147 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.82 E-value=2.2e-05 Score=55.19 Aligned_cols=49 Identities=33% Similarity=0.407 Sum_probs=35.9 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) |.|+++.+++++.++++.|++++.+-++|+|+ |.++|+||..|++..+. T Consensus 1 kdvvTv~~d~~l~ea~~~m~~~~i~~lPVvd~-~~lvGivT~~Dil~~~~ 49 (143) T cd04634 1 KNPITCNADDTISDAARLLRENKISGAPVLDG-GKLVGIVSESDILKLLV 49 (143) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHH T ss_conf 99889589398999999999749977999979-99999998699998875 No 148 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.79 E-value=3.6e-05 Score=53.70 Aligned_cols=50 Identities=20% Similarity=0.298 Sum_probs=46.3 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) |+++.+.+++++.++++.|++++.+.++|+|+.|.+.|++|..|++..+. T Consensus 2 r~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~~~~vGiit~~Di~~~~~ 51 (110) T cd04605 2 RPVVTISEDASIKEAAKLMIEENINHLPVVDEDGRLVGIVTSWDISKAVA 51 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECHHHHHHHH T ss_conf 89989899399999999999729988999979991999996227888987 No 149 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=97.79 E-value=0.00013 Score=50.15 Aligned_cols=48 Identities=19% Similarity=0.333 Sum_probs=44.1 Q ss_pred CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 579717885123234320135651165407766204667588999987 Q gi|254780426|r 171 ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 171 ~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) -+.|++++++.+++..|..++++.++|||+.|.++||||..||...+. T Consensus 3 ~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~~~L~GIvt~gDi~R~~~ 50 (133) T cd04592 3 YIKVSPTTTLKEALNLMLDEKQSCVLVVDSDDFLEGILTLGDIQRFLF 50 (133) T ss_pred EEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 599899985999999999818865799838997899978799999987 No 150 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.77 E-value=2.1e-05 Score=55.23 Aligned_cols=47 Identities=19% Similarity=0.246 Sum_probs=40.5 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) ..+.|+.++.++.++++.|++++.+-.+|+|+.|.++|+||..|++. T Consensus 2 ~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~~~lvGivt~~Di~~ 48 (108) T cd04596 2 EDTGYLTTTDTVKDWHELNKETGHSRFPVVDEKNKVVGIVTSKDVAG 48 (108) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHH T ss_conf 98889899997999999999859988999968990999999799751 No 151 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.77 E-value=0.00014 Score=49.78 Aligned_cols=50 Identities=16% Similarity=0.244 Sum_probs=37.3 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999874 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ....+++++++.++++.|.+.+.+..+|+|+.|.+.|+||..|++..+.+ T Consensus 2 ~~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~g~lvGivt~~di~~~l~~ 51 (111) T cd04639 2 HFETLSPADTLDDAADALLATTQHEFPVVDGDGHLVGLLTRDDLIRALAE 51 (111) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHHHC T ss_conf 88792998979999999997199789999389988999989999999872 No 152 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.76 E-value=0.00011 Score=50.45 Aligned_cols=82 Identities=24% Similarity=0.393 Sum_probs=52.9 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124677337830453599 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLE 249 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~ 249 (320) .|++|++++++.+++..|.+++...++|+|+.|.+.|++|-.|+...+.....- +..-..+- ...-..+....++. T Consensus 3 kpi~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~l~GiiT~~Di~r~l~~~~~~--~~~v~~im--~~~~~~i~~~~~~~ 78 (113) T cd04607 3 KQLLVSPDASILDALRKIDKNALRIVLVVDENGRLLGTVTDGDIRRALLKGLSL--DDPVSEVM--NRNPITAKVGSSRE 78 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHHHHHHCCCCC--CCCHHHHH--CCEEEEEECCCCHH T ss_conf 688969949899999999875997899997999599999872776687628984--56156752--12129998799899 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|254780426|r 250 ELAKII 255 (320) Q Consensus 250 ~l~~~l 255 (320) +..+.+ T Consensus 79 ~a~~~m 84 (113) T cd04607 79 EILALM 84 (113) T ss_pred HHHHHH T ss_conf 999999 No 153 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=97.74 E-value=3.9e-05 Score=53.51 Aligned_cols=47 Identities=26% Similarity=0.361 Sum_probs=37.9 Q ss_pred CCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 15657971788512323432013565116540776620466758899 Q gi|254780426|r 168 IKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 168 ~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) +|.|+.+++++++.++++.|.+++.+..+|+|+.|.+.|++|..|+. T Consensus 1 irdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~~~l~Giit~~Dl~ 47 (110) T cd04601 1 IRDPITVSPDATVAEALELMAEYGISGLPVVDDDGKLVGIVTNRDLR 47 (110) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHH T ss_conf 98898979939699999999984998899994899698788776776 No 154 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=97.72 E-value=0.00015 Score=49.72 Aligned_cols=82 Identities=18% Similarity=0.312 Sum_probs=59.7 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCC-CCCCEEEEECCCCH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124-67733783045359 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISA-VSDNTFIVDARTDL 248 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~-~~~g~~~v~G~~~l 248 (320) .++.+.+++++.++++.|++++.+..+|+|+ |.++|+||..|++..+.+.-... . ..+.. ....-..++-..++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~-~~lvGiit~~Di~~~~~~~~~~~---~-~~v~~iM~~~~~~v~~~~~l 76 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD-GRVVGSIDESDLLDALIEGKAKF---S-LPVREVMGEPLPTVDPDAPI 76 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHHHCCCCCC---C-CEEEEEEECCCEEECCCCCH T ss_conf 8069999497999999999819988999989-99999999999999986387646---8-76984567594597898949 Q ss_pred HHHHHHHC Q ss_conf 99999828 Q gi|254780426|r 249 EELAKIIG 256 (320) Q Consensus 249 ~~l~~~l~ 256 (320) +++.+.|. T Consensus 77 ~~~~~~~~ 84 (110) T cd04609 77 EELSELLD 84 (110) T ss_pred HHHHHHHH T ss_conf 99999887 No 155 >KOG2550 consensus Probab=97.72 E-value=6.6e-05 Score=52.00 Aligned_cols=125 Identities=17% Similarity=0.208 Sum_probs=100.3 Q ss_pred HHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCC--CCCCCCCEEHHHHHHHHHHC Q ss_conf 8999999987304775764131200129998630103677897986598179984188--66422410299999876622 Q gi|254780426|r 72 ETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNS--LDNPRGMVHMRDVISYISHM 149 (320) Q Consensus 72 ~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~--~d~iiGiv~~kDll~~~~~~ 149 (320) +.+..++.++-.|..--+. +-+.+.++.++.++++....++|+-+||.++. ..+.+|+|+.+|+-.. . T Consensus 98 e~QA~~v~~vK~~~~g~~~-------dp~v~sp~~tvg~v~~~k~~~gF~g~pvTe~g~~~~KLvG~vtsrdi~f~-~-- 167 (503) T KOG2550 98 EDQADMVRRVKNYENGFIN-------NPIVISPTTTVGEVKEAKEKHGFSGIPVTEDGKRGSKLVGIITSRDIQFL-E-- 167 (503) T ss_pred HHHHHHHHHHHHHHCCCCC-------CCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHH-H-- T ss_conf 8877888888875225456-------86203775431055652065663354211577546515777742023455-3-- Q ss_pred CCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH Q ss_conf 356432233345767885156-579717885123234320135651165407766204667588999 Q gi|254780426|r 150 YAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS 215 (320) Q Consensus 150 ~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle 215 (320) .....+.++|.+ ++..|....+.++=+.+++.+..-.+|||+.|+++-+++..|+.. T Consensus 168 ---------~~~~~~~~vmt~~~~~~~~gi~l~~~neiL~~~kkGkl~iv~~~gelva~~~rtDl~k 225 (503) T KOG2550 168 ---------DNSLLVSDVMTKNPVTGAQGITLKEANEILKKIKKGKLPVVDDKGELVAMLSRTDLMK 225 (503) T ss_pred ---------CCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEECCCCCEEEEEEHHHHHH T ss_conf ---------4551124330344301444466778899987631486523436776233433345665 No 156 >PRK05326 potassium/proton antiporter; Reviewed Probab=97.71 E-value=0.00082 Score=44.84 Aligned_cols=100 Identities=22% Similarity=0.311 Sum_probs=74.5 Q ss_pred CCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 76620466758899998742221000000001246773378304535999999828888877774674317999999842 Q gi|254780426|r 201 HGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLD 280 (320) Q Consensus 201 ~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~ 280 (320) -|...=+++.++-.+++-.- +... +.... .+|.|.++|..++.|+...+|. ++.-.|++.|+..+++ T Consensus 462 ~gD~l~v~~~~~~~~~l~~~----f~~~-~~~~~-~~g~f~~~~~~~~~~l~~~yg~-------~~~~~tl~~~~~~~~~ 528 (563) T PRK05326 462 AGDVLLVLGRERDLPALDRL----FSQA-PPVAL-DRGDFILEGEAKLADLALIYGL-------RDKQQTLAELVQQRLG 528 (563) T ss_pred CCCEEEEEECCCCHHHHHHH----HCCC-CCCCC-CCCCEEECCCCCHHHHHHHHCC-------CCCCCCHHHHHHHHHC T ss_conf 89999999567625688887----0797-76766-6871677687658999987288-------8667869999999838 Q ss_pred CCCCCCCEEEEECCEEEEEEEECCCEEEEEEEEE Q ss_conf 3473798899758959999995188789999996 Q gi|254780426|r 281 RIPARGEVVLEIPGFEIVILDADVRCVRRVRIRR 314 (320) Q Consensus 281 ~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v~v~~ 314 (320) +-|..||++ .++++.+.|.+++++||.++-++- T Consensus 529 ~~~~~gd~~-~~~~~~l~v~~~~~~~i~~vgl~~ 561 (563) T PRK05326 529 GAPVVGDQV-ELGGIIWTVAELDDGRVTKIGLRL 561 (563) T ss_pred CCCCCCCEE-EECCEEEEEEECCCCEEEEEEEEC T ss_conf 999988865-657759999962299898998646 No 157 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.68 E-value=3.5e-05 Score=53.79 Aligned_cols=55 Identities=16% Similarity=0.285 Sum_probs=37.1 Q ss_pred CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH Q ss_conf 77576413120012999863010367789798659817998418866422410299999 Q gi|254780426|r 85 RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI 143 (320) Q Consensus 85 ~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll 143 (320) .+..+.++|++ ++.+++.++++.++.+.|.++++.|+||.+++ +++|+++..|+. T Consensus 49 ~~~~v~~iMt~--~~itv~~~~~l~~a~~~M~~~~i~~lpVvd~g--~lvGiiT~~Div 103 (105) T cd04599 49 PNRLVADAMTR--EVVTISPEASLLEAKRLMEEKKIERLPVLRER--KLVGIITKGTIA 103 (105) T ss_pred CCCCHHHHCCC--CCEEECCCCCHHHHHHHHHHCCCCEEEEEECC--EEEEEEECHHHC T ss_conf 88796885027--98999794919999997798599799999899--999999945861 No 158 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.67 E-value=3.3e-05 Score=54.02 Aligned_cols=76 Identities=16% Similarity=0.241 Sum_probs=60.1 Q ss_pred HHHCCCCCHHHHHHHHHHHCC--------CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCC Q ss_conf 997689898999999987304--------775764131200129998630103677897986598179984188664224 Q gi|254780426|r 64 ASLEGIFSETEKEIFNNILRF--------RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRG 135 (320) Q Consensus 64 ~~~~G~i~~~E~~ii~~vl~l--------~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiG 135 (320) ..+.|.++++-+-+++ -|.+ -.-+|+|+|+ .+++++..++|+.++++.|.+++..++||.+++. .++| T Consensus 28 ~~r~G~~n~et~~vL~-~f~~~~P~li~~v~~qV~DIm~--~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~g-~l~G 103 (113) T cd04597 28 AARLGEPNPQTRYVLE-YLGIEPPILLADVHPRVRDVIN--RKPVTARPNDPLREALNLMHEHNIRTLPVVDDDG-TPAG 103 (113) T ss_pred EECCCCCCHHHHHHHH-HCCCCCCEECCCCCCHHHHHCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCC-EEEE T ss_conf 7306899989999999-8499998002156505666313--7883419998299999999984998786988999-3999 Q ss_pred CEEHHHHH Q ss_conf 10299999 Q gi|254780426|r 136 MVHMRDVI 143 (320) Q Consensus 136 iv~~kDll 143 (320) +++.+||- T Consensus 104 iIT~~DIA 111 (113) T cd04597 104 IITLLDLA 111 (113) T ss_pred EEEHHHHH T ss_conf 98778743 No 159 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=97.66 E-value=6.2e-05 Score=52.20 Aligned_cols=55 Identities=20% Similarity=0.337 Sum_probs=48.8 Q ss_pred HHHHCC---CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 788515---6579717885123234320135651165407766204667588999987 Q gi|254780426|r 164 ESNLIK---NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 164 l~~~~r---~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ++++|+ +.++|++++++.+++..|-+.+..+++|+|+.|.+.||+|=-|+=..+- T Consensus 204 V~dlM~~~~~lP~v~~~~~l~dai~~mt~k~lG~v~Vvd~~gkL~GIiTDGDLRR~l~ 261 (326) T PRK10892 204 VSDIMHTGDEIPHVKKTASLRDALLEITRKNLGMTVICDDNMMIEGIFTDGDLRRVFD 261 (326) T ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCEEEEEECHHHHHHHH T ss_conf 9998513765885377602999999874589349999858996899986268999987 No 160 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.66 E-value=5.7e-05 Score=52.45 Aligned_cols=58 Identities=26% Similarity=0.336 Sum_probs=47.8 Q ss_pred CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 4775764131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 84 FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 84 l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) -.+.+++++|++ +..+++.++++.++.+.+.+++++++||.+++. +++|+++..|++. T Consensus 50 ~~~~~v~~iM~~--~~~~v~~~~~~~~a~~~m~~~~~~~lPVVd~~~-~lvGiIt~~Di~~ 107 (109) T cd04606 50 DPDTPVSDIMDT--DVISVSADDDQEEVARLFEKYDLLALPVVDEEG-RLVGIITVDDVID 107 (109) T ss_pred CCCCCHHHHHCC--CEEECCCCCCHHHHHHHHHHHCCCEEEEECCCC-EEEEEEEHHHHHH T ss_conf 888716663002--136337999899999999985276046898899-7999999689684 No 161 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.59 E-value=3.9e-05 Score=53.52 Aligned_cols=47 Identities=21% Similarity=0.319 Sum_probs=41.6 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 65797178851232343201356511654077662046675889999 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .|+++...+++.+++..|.+.+..-++||||.+..+|++|..|+++. T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~~k~vG~it~~Dll~~ 48 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEENKVLGQVTLSDLLEI 48 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 44686599959999999987699768998899968899889998643 No 162 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=97.57 E-value=0.0001 Score=50.73 Aligned_cols=55 Identities=25% Similarity=0.476 Sum_probs=49.8 Q ss_pred EHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHH Q ss_conf 6413120012999863010367789798659817998418866422410299999876 Q gi|254780426|r 89 IDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYI 146 (320) Q Consensus 89 v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~ 146 (320) +.++|++ +..+++.++++.++++.+.+++++++||++++ +.++|+++.+|++.++ T Consensus 1 v~~im~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~-~~~~Gvit~~dl~~~~ 55 (57) T pfam00571 1 VKDIMTP--DVVTVPPDTSLEEALELMRENGISRLPVVDED-GKLVGIVTLRDLLRAL 55 (57) T ss_pred CCCCCCC--CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHH T ss_conf 9422779--99899790909999999985399579999279-9499999789999986 No 163 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=97.55 E-value=0.00013 Score=50.00 Aligned_cols=48 Identities=23% Similarity=0.293 Sum_probs=42.2 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHH Q ss_conf 657971788512323432013565116540776620466758899998 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVL 217 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~i 217 (320) +|+.+.+++++.++++.|++++.+-++|||+.|...|+||..|+.... T Consensus 2 ~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~g~l~Givt~~Dl~~~~ 49 (106) T cd04582 2 EPITVRPDDPLSDALGLMDDSDLRALTVVDADGQPLGFVTRREAARAS 49 (106) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHC T ss_conf 983899989399999999973998799997999789998999987630 No 164 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=97.54 E-value=0.00012 Score=50.24 Aligned_cols=48 Identities=25% Similarity=0.406 Sum_probs=43.6 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHH Q ss_conf 2999863010367789798659817998418866422410299999876 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYI 146 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~ 146 (320) +++++..++++.++.+.+.+++++++||.+++ ++++|+++.+|++.++ T Consensus 1 ~~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~-~~lvGiit~~Dil~~l 48 (49) T smart00116 1 DVVTVSPDTTLEEALELLREHGIRRLPVVDEE-GRLVGIVTRRDIIKAL 48 (49) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHH T ss_conf 98699987929999999998099857699899-9199998879999864 No 165 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.52 E-value=0.00052 Score=46.15 Aligned_cols=49 Identities=24% Similarity=0.331 Sum_probs=43.1 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 6579717885123234320135651165407766204667588999987 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) +++.+.+++++.++++.|++++.+-.+|+|+.|.++|++|..|+..... T Consensus 2 ~Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~g~lvGiis~~Dl~~~~~ 50 (126) T cd04642 2 KVVSIDSDERVLDAFKLMRKNNISGLPVVDEKGKLIGNISASDLKGLLL 50 (126) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHHHHC T ss_conf 5799999599999999999809878999928990999999899765551 No 166 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=97.49 E-value=0.00062 Score=45.65 Aligned_cols=93 Identities=18% Similarity=0.211 Sum_probs=63.2 Q ss_pred CCCHHHHCC-CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCE Q ss_conf 576788515-6579717885123234320135651165407766204667588999987422210000000012467733 Q gi|254780426|r 161 QLSESNLIK-NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNT 239 (320) Q Consensus 161 ~~~l~~~~r-~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~ 239 (320) ...+.++|+ ....+++++++.++.+.|.+.+.+-..|+|+.|...|+|+++|+...+..+-.. ....-.++... .- T Consensus 455 ~~~v~dvm~~~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~g~l~Giv~l~dl~~~~~~~~~~-~~~~~~~~~~~--~~ 531 (583) T PRK01862 455 TTQMRELIQPAQTVVPLTASVADMTRVFLEYPVKYLYVTDDDGRFRGAVALKDITSDLLDKRDT-TDKTAADYAHT--PF 531 (583) T ss_pred HCCHHHHCCCCCCEECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHHCCCCC-CCCCHHHHHCC--CC T ss_conf 0809996387882349999899999999847970799988999799997899988775163555-64609998179--99 Q ss_pred EEEECCCCHHHHHHHHC Q ss_conf 78304535999999828 Q gi|254780426|r 240 FIVDARTDLEELAKIIG 256 (320) Q Consensus 240 ~~v~G~~~l~~l~~~l~ 256 (320) -.+.-..++.+.-+.|. T Consensus 532 ~~l~~~~sL~~al~~f~ 548 (583) T PRK01862 532 PLLTPDMPLRDALEHFM 548 (583) T ss_pred CEECCCCCHHHHHHHHH T ss_conf 57889998999999999 No 167 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=97.41 E-value=0.00075 Score=45.09 Aligned_cols=83 Identities=22% Similarity=0.294 Sum_probs=58.9 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124677337830453599 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLE 249 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~ 249 (320) ++.++++++++.+++..|++++.+.++|+|+.|.+.|++|..|++..+.....+.... .-.-.....+.+....++. T Consensus 2 ~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~~~~~G~it~~dl~~~~~~~~~~~~~~---~~~~~~~~~~~v~~~~~~~ 78 (113) T cd02205 2 DVVTVSPDDTVAEALRLMLEHGISGLPVVDDDGRLVGIVTERDLLRALAEGGLDPLVT---VGDVMTRDVVTVSPDTSLE 78 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCC---EEEEEECCCEEEEECCHHH T ss_conf 9779689298999999999839809999939995999998899898876315741033---0456750546897321177 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|254780426|r 250 ELAKII 255 (320) Q Consensus 250 ~l~~~l 255 (320) +....+ T Consensus 79 ~~~~~~ 84 (113) T cd02205 79 EAAELM 84 (113) T ss_pred HHHHHH T ss_conf 999999 No 168 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=97.39 E-value=0.00017 Score=49.26 Aligned_cols=56 Identities=18% Similarity=0.262 Sum_probs=49.4 Q ss_pred HHHHCCC---CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 7885156---5797178851232343201356511654077662046675889999874 Q gi|254780426|r 164 ESNLIKN---ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 164 l~~~~r~---~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) +.++|++ .++|++++++.+++..|-+.+..+++|+|+.|.+.|++|--||=..+.. T Consensus 199 V~dlM~~~~~lP~v~~~~~l~dai~~ms~k~lG~v~Vvd~~~kL~GIITDGDLRR~l~k 257 (321) T PRK11543 199 VHHLMRRGDAIPQVALTASVMDAMLELSRTGLGLVAVCDAQQQVKGVFTDGDLRRWLVG 257 (321) T ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCEEEEEECHHHHHHHHC T ss_conf 99997335578989999879999999664886179983377506888744389999863 No 169 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=97.30 E-value=0.0005 Score=46.23 Aligned_cols=71 Identities=21% Similarity=0.479 Sum_probs=63.4 Q ss_pred HHHHHHHHHHC---CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHH Q ss_conf 99999998730---4775764131200129998630103677897986598179984188664224102999998766 Q gi|254780426|r 73 TEKEIFNNILR---FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYIS 147 (320) Q Consensus 73 ~E~~ii~~vl~---l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~ 147 (320) ++++|..+++. =...+|.++|| ..+.+++..+-+-|++-.|.+.+..|+||++++ +++|+++..|++.... T Consensus 197 T~~dl~~~v~~~g~~~~~~V~evmT--~p~~svd~~~~~feAml~m~r~~I~hl~V~e~g--q~~Gilt~~dIl~l~s 270 (610) T COG2905 197 TRKDLRSRVIADGRSKTQKVSEVMT--SPVISVDRGDFLFEAMLMMLRNRIKHLPVTEDG--QPLGILTLTDILRLFS 270 (610) T ss_pred EHHHHHHHHHHCCCCCCCCHHHHHC--CCCEEECCCCHHHHHHHHHHHHCCCEEEEECCC--EEEEEEEHHHHHHHHC T ss_conf 2277789998658974233545414--684454476569999999998077442361189--0467766999997617 No 170 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.26 E-value=0.00024 Score=48.34 Aligned_cols=46 Identities=15% Similarity=0.315 Sum_probs=31.7 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 5657971788512323432013565116540776620466758899 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) |.++.|.+.+++.++++.|.+++....+|+|+.|.++||+|-.|++ T Consensus 1 rdv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~g~lvGIiT~rDli 46 (96) T cd04614 1 RNVPTVWEETPLPVAVRIMELANVKALPVLDDDGKLSGIITERDLI 46 (96) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHC T ss_conf 9963908969899999999972998799998999499999888882 No 171 >TIGR00400 mgtE magnesium transporter; InterPro: IPR006669 This is the MgtE family of magnesium transporters. All the archaebacterial and eukaryotic examples have two copies of the integral membrane region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg^2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport. Probab=97.10 E-value=0.0043 Score=40.09 Aligned_cols=142 Identities=20% Similarity=0.247 Sum_probs=109.3 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCC---CCEEEEE-CCCCCCCCCCEEHHHHH Q ss_conf 8989899999998730477576413120012999863010367789798659---8179984-18866422410299999 Q gi|254780426|r 68 GIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYG---RSWMPVY-KNSLDNPRGMVHMRDVI 143 (320) Q Consensus 68 G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~---~sR~PV~-~~~~d~iiGiv~~kDll 143 (320) .....+++..++..+.+..-++..+|+ .+.+++..+-+..+++..++... -+-+..| .++..++.|++..+|++ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 193 (460) T TIGR00400 116 ASLSEEERKAVNLLLGYPPDSAGRILT--TEYVWLKEDYTVGDALDYLRRVAETAEDIYTLYVTNESKRLTGVLSLRDLI 193 (460) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEE--EEEEECCCCCHHHHHHHHHHHHHHHHHHHEEEEEECCCHHHHHHHHHHHHH T ss_conf 642123567888873478532342000--000111210015788998888765542102457633410220001123333 Q ss_pred HHHHHCCCCCCCCCCCCCCCHHHHCCC-CEEECCCCCHHH-HHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHH Q ss_conf 876622356432233345767885156-579717885123-234320135651165407766204667588999987422 Q gi|254780426|r 144 SYISHMYAKTNNINLNIQLSESNLIKN-ILFVPSSMLVSD-LLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDI 221 (320) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~Vpe~~~l~~-lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei 221 (320) .. .++..+.+++++ ..++.....-.+ +...+++...-...|+|+.|...|++|.+|+++.+..+- T Consensus 194 ~~-------------~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~d~~~~p~~~~~~~~~g~~~~dd~~~~~~~~~ 260 (460) T TIGR00400 194 LA-------------KPEEYLGDLLFPDGVSVDGLNDEEEDVARLIEKYDFLAVPVVDNEGRLVGIVTVDDILDVLEEEA 260 (460) T ss_pred HC-------------CCHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHHH T ss_conf 03-------------50567766542100135204540467898876402102001316771687542346889887520 Q ss_pred HHH Q ss_conf 210 Q gi|254780426|r 222 TSE 224 (320) Q Consensus 222 ~de 224 (320) .++ T Consensus 261 ~~d 263 (460) T TIGR00400 261 TED 263 (460) T ss_pred HHH T ss_conf 023 No 172 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=97.07 E-value=0.00037 Score=47.13 Aligned_cols=61 Identities=13% Similarity=0.206 Sum_probs=54.2 Q ss_pred CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHH Q ss_conf 775764131200129998630103677897986598179984188664224102999998766 Q gi|254780426|r 85 RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYIS 147 (320) Q Consensus 85 ~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~ 147 (320) .+.+++++||| .++++...+.+++++.+++.+++..++||.+++. +++|+++.||+.+.-. T Consensus 145 ~~~~V~~vMT~-~~lvt~~~~is~~eA~~~l~~~kieklPvVd~~g-~L~Glit~kDi~k~~~ 205 (486) T PRK05567 145 LSQPVSEVMTK-ERLVTVPEGTTLEEALELLHEHRIEKLLVVDDNG-RLKGLITVKDIEKAEE 205 (486) T ss_pred CCCCHHHHHCC-CCEEEECCCCCHHHHHHHHHHHHHCCCCEECCCC-CEEEEEEHHHHHHHHH T ss_conf 77654675345-7328925889999999999973130342774689-6888877667776520 No 173 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=97.01 E-value=0.0011 Score=43.97 Aligned_cols=58 Identities=16% Similarity=0.182 Sum_probs=52.3 Q ss_pred CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHH Q ss_conf 7757641312001299986301036778979865981799841886642241029999987 Q gi|254780426|r 85 RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISY 145 (320) Q Consensus 85 ~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~ 145 (320) ...+++++||| ++++.+...+++++.+++.+++...+||.+++. ++.|+++.||+.+. T Consensus 146 ~~~~v~~vMT~--~lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~~-~L~gLiT~kDi~k~ 203 (479) T PRK07807 146 RFTRVGDVMST--DLVTLPAGTDPRKAFDLLEAAPVKVAPVVDADG-RLAGVLTRTGALRA 203 (479) T ss_pred CCCCHHHHHCC--CCEEECCCCCHHHHHHHHHHCCCCCCEEECCCC-EEEEEEEEEHHHHC T ss_conf 77758886257--736613667879999999753523113775699-29999996116454 No 174 >pfam01595 DUF21 Domain of unknown function DUF21. This transmembrane region has no known function. Many of the sequences in this family are annotated as hemolysins, however this is due to a similarity to hemolysin C from Brachyspira hyodysenteriae, which does not contain this domain. This domain is found in the N-terminus of the proteins adjacent to two intracellular CBS domains pfam00571. Probab=96.90 E-value=0.0018 Score=42.53 Aligned_cols=71 Identities=14% Similarity=0.230 Sum_probs=48.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCH---HHHHHHHHHHHHH--HHHHHHHHCCCCC---CCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 679988765555554443203---5687530268998--9999876337888---889999999999799768989899 Q gi|254780426|r 4 DFKINSSEHPKEKNEGIDLSP---SISKTSVPAFWTR--TIRALCKTWKLMK---LSHHKGIRLPRIDASLEGIFSETE 74 (320) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~p---~l~~i~~P~vw~~--~~~~~~~~~~~~~---~~~~eel~~l~~~~~~~G~i~~~E 74 (320) |..|++.+......-+....| .+.++++|++|.. ..+.+.+.+|..+ ..++||++.++++|+++|.++++| T Consensus 104 Ei~PK~ia~~~~~~~a~~~a~~~~~~~~l~~Pl~~~~~~i~~~i~~~~g~~~~~~~~s~eEl~~lv~~~~~~G~i~~~E 182 (182) T pfam01595 104 EILPKTLARRNPERIALRVAPPLRVLMKLLYPLVWLLSKLSNLLLRLFGVKPEEPAVTEEELRALVDEGEEEGVIEEEE 182 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 5879999998899999999999999999999999999999999998629998889999999999999999879989899 No 175 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=96.87 E-value=0.001 Score=44.19 Aligned_cols=82 Identities=18% Similarity=0.308 Sum_probs=61.5 Q ss_pred ECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCC Q ss_conf 4188664224102999998766223564322333457678851-565797178851232343201356511654077662 Q gi|254780426|r 126 YKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGT 204 (320) Q Consensus 126 ~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~ 204 (320) .+=++|++--+++--.+-.+.... .+..-.++| |++..|++.+++.++..+|++++..-..|+||.-.+ T Consensus 219 lDIdrddLe~llr~~elqa~~R~~----------~~LtcadIMSrdVvtv~~~ts~dhA~~ll~~H~ikaLPV~d~~~rl 288 (382) T COG3448 219 LDIDRDDLERLLRETELQALRRRM----------GELTCADIMSRDVVTVSTDTSIDHARKLLQEHRIKALPVLDEHRRL 288 (382) T ss_pred ECCCHHHHHHHHHHHHHHHHHHHH----------CCCCHHHHCCCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCE T ss_conf 157889999999999999999985----------6430888637453501776775799999997686504532666610 Q ss_pred EEEEEHHHHHHHH Q ss_conf 0466758899998 Q gi|254780426|r 205 DGLVSYEDIVSVL 217 (320) Q Consensus 205 ~GiVTleDIle~i 217 (320) +|+||+.|+++.. T Consensus 289 ~GiVt~~dl~~~a 301 (382) T COG3448 289 VGIVTQRDLLKHA 301 (382) T ss_pred EEEEEHHHHHHCC T ss_conf 0324488776016 No 176 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=96.87 E-value=0.00077 Score=45.02 Aligned_cols=60 Identities=12% Similarity=0.163 Sum_probs=53.1 Q ss_pred CCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHH Q ss_conf 477576413120012999863010367789798659817998418866422410299999876 Q gi|254780426|r 84 FRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYI 146 (320) Q Consensus 84 l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~ 146 (320) -.+.+|+++||| ++++.+.+.+++++.+++.+++..++||.+++. +++|+++.||+.+.- T Consensus 155 d~~~~v~~vMt~--~lvt~~~~isl~eA~~ll~~~kieklpvVd~~g-~L~glit~kDi~k~~ 214 (499) T PTZ00314 155 DKSTPVSEIMTT--DLVVGRYPITLEDANDVLNRSRKGVLPIVNDNG-ELVALVSRSDAVKNR 214 (499) T ss_pred CCCCCHHHHHCC--CCEEECCCCCHHHHHHHHHHHHHCCCEEECCCC-CEEEEEECCHHHHHH T ss_conf 677588886156--726725999999999999860331230665789-589986303487753 No 177 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=96.85 E-value=0.0015 Score=43.18 Aligned_cols=52 Identities=17% Similarity=0.290 Sum_probs=43.2 Q ss_pred HHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 851565797178851232343201356511654077662046675889999874 Q gi|254780426|r 166 NLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 166 ~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) .+...+..+++.+++.+++..+.+.... +|||+.|.+.|+||..++++++.| T Consensus 330 ~~~~~~~~~~~~~~l~~~l~~~~~~~~~--~~v~~~g~l~G~i~~~~~l~~l~g 381 (382) T TIGR03415 330 SLEAAPTVINPDTLMRDVLAARHRTGGA--ILLVENGRIVGVIGDDNIYHALLG 381 (382) T ss_pred HHCCCCCEECCCCCHHHHHHHHHHCCCC--EEEECCCEEEEEEEHHHHHHHHHC T ss_conf 3125773338989399999999858998--099059959999868999998627 No 178 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=96.73 E-value=0.0043 Score=40.14 Aligned_cols=81 Identities=19% Similarity=0.238 Sum_probs=60.8 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECC-CCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 65797178851232343201356511654077-66204667588999987422210000000012467733783045359 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEH-GGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~-G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) .+.+++.++++.++++.|++.+....+|+|+. +.++|+|+..|++.....+ .....+.......+.++..+++ T Consensus 2 ~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~~~------~~~~~~~~~~~~~~~v~~~~~l 75 (111) T cd04590 2 DIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALAEG------EEDLDLRDLLRPPLFVPESTPL 75 (111) T ss_pred CEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHHCC------CCCCCHHHHCCCCEECCHHHHH T ss_conf 6899969999999999999779977999954788778999911445666527------8764077640442222556658 Q ss_pred HHHHHHHC Q ss_conf 99999828 Q gi|254780426|r 249 EELAKIIG 256 (320) Q Consensus 249 ~~l~~~l~ 256 (320) .++-+.|. T Consensus 76 ~~~l~~~~ 83 (111) T cd04590 76 DDLLEEMR 83 (111) T ss_pred HHHHHHHH T ss_conf 89998435 No 179 >COG4175 ProV ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism] Probab=96.70 E-value=0.0015 Score=43.12 Aligned_cols=124 Identities=22% Similarity=0.321 Sum_probs=85.4 Q ss_pred HHHHHHHHHCCCCEEEHHEEECCCEEEEEEE-CCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCC Q ss_conf 9999998730477576413120012999863-010367789798659817998418866422410299999876622356 Q gi|254780426|r 74 EKEIFNNILRFRKIRIDDIMISRVSINAVED-KATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAK 152 (320) Q Consensus 74 E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~-~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~ 152 (320) -++.++++=...=.++++||.+ .+...... ......+++.+...+-+-+-++. ...+.+|++..-++... T Consensus 261 V~~Fv~~v~~~~VltA~~IM~~-~~~~~~~~~~~~~~~al~~~~~~~~~~~~~~~-~~~~~~g~v~~~~~~~~------- 331 (386) T COG4175 261 VRDFVRNVDRSRVLTAKDIMRR-PDLLIRKTPGDGPRVALKLLRDEGREYGYAVD-RGNKFVGVVSIDSLVKA------- 331 (386) T ss_pred HHHHHHCCCHHHEEEHHHHHCC-CCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHH-CCCCEEEEEECCCHHCC------- T ss_conf 9999843775332238776314-54344455555631121233322411367774-25752558964620101------- Q ss_pred CCCCCCCCCCCHHHHCCCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHH Q ss_conf 43223334576788515657971788512323432013565116540776620466758899998742 Q gi|254780426|r 153 TNNINLNIQLSESNLIKNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRD 220 (320) Q Consensus 153 ~~~~~~~~~~~l~~~~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGe 220 (320) ....++..++.++++.+.+...++...++| |+||.|..+|+++...++++|.+. T Consensus 332 -------------~~~~~~~~v~~d~~~~~~~~~~~~~~~p~a-Vvde~~r~vG~i~~~~vl~aL~~~ 385 (386) T COG4175 332 -------------ALIDDVLTVDADTPLSEILARIRQAPCPVA-VVDEDGRYVGIISRGELLEALARI 385 (386) T ss_pred -------------CCCCCCCCCCCCCHHHHHHHHHHCCCCCEE-EECCCCCEEEEECHHHHHHHHHCC T ss_conf -------------014653356764248899998752898546-885788577786388999997458 No 180 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=96.63 E-value=0.0021 Score=42.10 Aligned_cols=50 Identities=16% Similarity=0.286 Sum_probs=43.9 Q ss_pred HEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 131200129998630103677897986598179984188664224102999998 Q gi|254780426|r 91 DIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 91 diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) .+|.+ ...+++.++++.++.+.+.+.|..++||.+++ .++|+|+.||+++ T Consensus 55 ~~m~~--~p~tv~~~~~l~~~~~lf~~~g~r~l~Vv~~g--~lvGiITr~Dl~~ 104 (105) T cd04591 55 NYIDP--SPFTVSPRTSLEKVHQLFRKLGLRHLLVVDEG--RLVGIITRKDLLK 104 (105) T ss_pred HHCCC--CCEEECCCCCHHHHHHHHHHHCCCEEEEEECC--EEEEEEEHHHHHC T ss_conf 75879--98398999959999999999499588995799--9999999899534 No 181 >COG3263 NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain [Inorganic ion transport and metabolism] Probab=96.53 E-value=0.0098 Score=37.78 Aligned_cols=74 Identities=20% Similarity=0.341 Sum_probs=63.7 Q ss_pred CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEECCEEEEEEEECCCEEEEEEEEE Q ss_conf 733783045359999998288888777746743179999998423473798899758959999995188789999996 Q gi|254780426|r 237 DNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGEVVLEIPGFEIVILDADVRCVRRVRIRR 314 (320) Q Consensus 237 ~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge~v~~~~g~~f~V~~~~~~rI~~v~v~~ 314 (320) -|.++++...+..++.-..+.+.- +.+...|+++++..++|..|..||.+ +|+|+...|.+.+.+||.++-++- T Consensus 494 fg~f~~e~~~~~~~~~~~yg~~~~---~~d~~~~~~~~i~~~lg~~~v~~Drv-~~~~l~~~v~~~~d~~it~ig~~~ 567 (574) T COG3263 494 FGDFILEPSAKAADVALAYGLGDG---YADKAQTLGELVTQLLGGAPVLGDRV-EFGGLIWTVAEKEDNRVTKIGVRV 567 (574) T ss_pred HCCHHCCCCHHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHHCCCCEEEEEE-EECCEEEEEEECCCCEEEEEEEEC T ss_conf 121320722346679998579987---03665357788999837873220157-875679998532247344223422 No 182 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=96.30 E-value=0.0036 Score=40.62 Aligned_cols=118 Identities=18% Similarity=0.305 Sum_probs=85.5 Q ss_pred CCHHHHCCC---CEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCC-CCC Q ss_conf 767885156---5797178851232343201356511654077662046675889999874222100000000124-677 Q gi|254780426|r 162 LSESNLIKN---ILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISA-VSD 237 (320) Q Consensus 162 ~~l~~~~r~---~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~-~~~ 237 (320) ..+.++++| ..|..++.++.+.++.-.+.+.....|+|++-.++|+||+.|++..- ... .+.+ ... T Consensus 188 ~~Vedi~~P~~~~~yL~~~d~v~d~~~l~~kt~~sRfPVvn~~~kvvGvVt~rDv~~~~------~~t----~ieKVMtk 257 (432) T COG4109 188 ITVEDIMTPLEDTSYLRETDTVEDWLDLVEKTGHSRFPVVNRSMKVVGVVTMRDVLDKK------PST----TIEKVMTK 257 (432) T ss_pred EEHHHHCCCCCCCEECCCCCCHHHHHHHHHHCCCCCCCEECCCCEEEEEEEEHHHHCCC------CCC----CHHHHHCC T ss_conf 55877446443432324645099999999971987341331442699999834644479------986----17777426 Q ss_pred CEEEEECCCCHHHHHHHHCCCC-CCCCCCCCCCCHHHHH--------HHHHCCCCCCCCEE Q ss_conf 3378304535999999828888-8777746743179999--------99842347379889 Q gi|254780426|r 238 NTFIVDARTDLEELAKIIGTDC-NCLKGEQDVDSLGGLI--------FSVLDRIPARGEVV 289 (320) Q Consensus 238 g~~~v~G~~~l~~l~~~l~~~l-~~~~~~~~~~Ti~G~i--------~~~l~~iP~~Ge~v 289 (320) +-+.+.+.+++-.+...+.|+- +-.|--.+-.|+.|.+ |...++-|++|+++ T Consensus 258 np~tv~~~tsVAsvaq~MiwE~iem~PVv~~n~~llGiitR~dvlk~lq~~q~qpqvget~ 318 (432) T COG4109 258 NPITVRAKTSVASVAQMMIWEGIEMLPVVDSNNTLLGIITRQDVLKSLQMIQRQPQVGETI 318 (432) T ss_pred CCEEECCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHHHHCCCCCCCCCH T ss_conf 9755425226889988887645526568837861898988999999998751288655447 No 183 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=96.20 E-value=0.0068 Score=38.82 Aligned_cols=92 Identities=18% Similarity=0.270 Sum_probs=69.8 Q ss_pred CCCCHHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCC Q ss_conf 457678851-5657971788512323432013565116540776620466758899998742221000000001246773 Q gi|254780426|r 160 IQLSESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDN 238 (320) Q Consensus 160 ~~~~l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g 238 (320) +..++..+| ++++.+-.++++.++-..|-+++.+-|.|+|.. .++|++|+.||..++.-- +. +..-..+-+ .+ T Consensus 170 Pk~~V~~~~s~~~i~v~~d~tl~eaak~f~~~~i~GaPVvd~d-k~vGiit~~dI~~aia~g--~~-~~kV~~~M~--k~ 243 (294) T COG2524 170 PKEKVKNLMSKKLITVRPDDTLREAAKLFYEKGIRGAPVVDDD-KIVGIITLSDIAKAIANG--NL-DAKVSDYMR--KN 243 (294) T ss_pred CCCHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCCCCEECCC-CEEEEEEHHHHHHHHHCC--CC-CCCHHHHHC--CC T ss_conf 7302666266784676688639999999997286678510289-458999889999999768--96-543888750--57 Q ss_pred EEEEECCCCHHHHHHHHCC Q ss_conf 3783045359999998288 Q gi|254780426|r 239 TFIVDARTDLEELAKIIGT 257 (320) Q Consensus 239 ~~~v~G~~~l~~l~~~l~~ 257 (320) .+.+++...|.|.-+.+.. T Consensus 244 vitI~eDe~i~dAir~M~~ 262 (294) T COG2524 244 VITINEDEDIYDAIRLMNK 262 (294) T ss_pred CCEECCCHHHHHHHHHHHH T ss_conf 7047573049999999876 No 184 >COG0517 FOG: CBS domain [General function prediction only] Probab=96.13 E-value=0.0097 Score=37.82 Aligned_cols=52 Identities=25% Similarity=0.375 Sum_probs=39.6 Q ss_pred CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHH Q ss_conf 56579717885123234320135651165407766204667588999987422 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDI 221 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei 221 (320) +.+.++.+..++.+++..|.+++...+.|++. +...|++|..|++..+.... T Consensus 7 ~~~~~v~~~~~~~~a~~~m~~~~~~~~~v~~~-~~l~Giit~~di~~~~~~~~ 58 (117) T COG0517 7 KDVITVKPDTSVRDALLLMSENGVSAVPVVDD-GKLVGIITERDILRALAAGG 58 (117) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHHHHHCCC T ss_conf 68808989981999999998749338997439-99999999899987664166 No 185 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=96.07 E-value=0.008 Score=38.37 Aligned_cols=61 Identities=11% Similarity=0.148 Sum_probs=53.7 Q ss_pred CEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHH Q ss_conf 75764131200129998630103677897986598179984188664224102999998766 Q gi|254780426|r 86 KIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYIS 147 (320) Q Consensus 86 ~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~ 147 (320) ..++...|+|+.++++...+.+.+++.+++.+++...+||.+++. +++|+++.||+...-. T Consensus 140 ~~~~~~~~~~~~~lvt~~~~~~~~ea~~~l~~~kieklpvvd~~g-~L~glit~kDi~k~~~ 200 (467) T pfam00478 140 LSKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDDG-ELVGLITRKDIEKARD 200 (467) T ss_pred CCCCCEEECCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEECCCC-CEEEEEEHHHHHHHHC T ss_conf 543201101355427724889989999999975564154446788-3788887434677420 No 186 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=95.74 E-value=0.057 Score=32.79 Aligned_cols=125 Identities=16% Similarity=0.230 Sum_probs=71.0 Q ss_pred EEEEECCCCCCCCCCEEHHHHHHHHHHCCCC--CCCCCCCCCCCHHHHCCC-CEEECC---CCCHHHHHHHHHCCCCCCE Q ss_conf 7998418866422410299999876622356--432233345767885156-579717---8851232343201356511 Q gi|254780426|r 122 WMPVYKNSLDNPRGMVHMRDVISYISHMYAK--TNNINLNIQLSESNLIKN-ILFVPS---SMLVSDLLTNIQESRIRMA 195 (320) Q Consensus 122 R~PV~~~~~d~iiGiv~~kDll~~~~~~~~~--~~~~~~~~~~~l~~~~r~-~~~Vpe---~~~l~~lL~~m~~~~~~~a 195 (320) |+-|.++. .|+-+=+..+++....+.+-. ....+...-....++|++ +..+.. ......+++.|+.++...+ T Consensus 234 RIaVM~~G--~Ivq~GtpeeI~~~Pa~~yV~~F~~~v~~~~~~~a~~i~~~~~~~~~~~~~~~~~~~al~~m~~~~~~~~ 311 (400) T PRK10070 234 RIAIMQNG--EVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYG 311 (400) T ss_pred EEEEEECC--EEEEECCHHHHHHCCCCHHHHHHHCCCCHHHCEEHHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCEE T ss_conf 99999898--8999728899986799868998755477877023989624687521314888699999999985598679 Q ss_pred EEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHHHHHHHHC Q ss_conf 6540776620466758899998742221000000001246773378304535999999828 Q gi|254780426|r 196 LVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLEELAKIIG 256 (320) Q Consensus 196 iVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~~l~~~l~ 256 (320) +|+|+.+.+.|+||.+++....-.. ..-.. + ..+....++..+++.++-..+. T Consensus 312 ~vvd~~~~~~G~v~~~~~~~~~~~~-----~~~~~-~--~~~~~~~v~~~~~l~~~~~~~~ 364 (400) T PRK10070 312 YVIERGNKFVGAVSIDSLKAALTQQ-----QGLDA-A--LIDAPLAVDAQTPLSELLSHVG 364 (400) T ss_pred EEECCCCEEEEEEEHHHHHHHHHCC-----CCHHH-H--HHCCCCCCCCCCCHHHHHHHHH T ss_conf 9986998088998899999776337-----76366-7--5058842399998999999997 No 187 >TIGR01303 IMP_DH_rel_1 IMP dehydrogenase family protein; InterPro: IPR005991 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase and restricted to the high GC Gram-positive bacteria.. Probab=95.58 E-value=0.022 Score=35.50 Aligned_cols=108 Identities=15% Similarity=0.242 Sum_probs=89.3 Q ss_pred CEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHC-CCCEEEC Q ss_conf 129998630103677897986598179984188664224102999998766223564322333457678851-5657971 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLI-KNILFVP 175 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~-r~~~~Vp 175 (320) ..-+.+.++.++.+++.++.+.-|.---|.+++. +.+|+|.-+|+...- .-..+.+++ +..+.+| T Consensus 96 dtP~~~~P~~~v~~a~~l~~krah~~~~v~~~~~-~P~G~v~~~~~~G~d-------------~ft~~~~~~~~~~~~~~ 161 (476) T TIGR01303 96 DTPITLAPHDTVADALALIHKRAHGAAVVVEEDG-KPVGVVTDKDLEGVD-------------RFTQVEEVMSTDLVTLP 161 (476) T ss_pred CCCEEECCCHHHHHHHHHHHHHCCCEEEEEECCC-CEEEEEECCCCCCHH-------------HHHHHHHHHHHHHEEEC T ss_conf 3741654623389999987531143178981698-126788515310004-------------56788877754320203 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 7885123234320135651165407766204667588999987 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) +...-.+++..+.+....+|.++|..|.+.|++|..--+.+-+ T Consensus 162 ~~~~P~~~f~~l~~~~~~~a~~~~~dG~l~G~l~r~Ga~ra~~ 204 (476) T TIGR01303 162 ADLEPREAFDLLEEASRKLAPVVDADGSLAGILTRTGALRATL 204 (476) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCEEEC T ss_conf 6788889999987410120111105772555441143000000 No 188 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=94.16 E-value=0.058 Score=32.74 Aligned_cols=46 Identities=30% Similarity=0.515 Sum_probs=29.4 Q ss_pred CCCEEECCCCCHHHHHHHHHC-----CCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 565797178851232343201-----3565116540776620466758899 Q gi|254780426|r 169 KNILFVPSSMLVSDLLTNIQE-----SRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 169 r~~~~Vpe~~~l~~lL~~m~~-----~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) .+-+.++++.++.+++..+|+ +..+...|+|+.+...|+|++.+++ T Consensus 140 ~e~v~l~~~~Tv~~al~~ir~~~~~~e~~~~lyVvD~~~~L~Gvvsl~~Ll 190 (451) T COG2239 140 TEFVTLPEDVTVDEALDRIRERAEDAETIYYLYVVDEKGKLLGVVSLRDLL 190 (451) T ss_pred EEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEEEHHHHH T ss_conf 025872067589999999997236756623799987766467775699986 No 189 >PRK11573 hypothetical protein; Provisional Probab=94.16 E-value=0.23 Score=28.81 Aligned_cols=124 Identities=7% Similarity=0.098 Sum_probs=74.2 Q ss_pred CCCCHHHHCCC---CEEECCCCCHHHHHHHHHCCCCCCEEEEECC-CCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCC Q ss_conf 45767885156---5797178851232343201356511654077-6620466758899998742221000000001246 Q gi|254780426|r 160 IQLSESNLIKN---ILFVPSSMLVSDLLTNIQESRIRMALVIDEH-GGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAV 235 (320) Q Consensus 160 ~~~~l~~~~r~---~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~-G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~ 235 (320) .+..++++|.| +..++.+.++.++++.+.+.+....+|.++. .+++|++-..|++..+.. ..+.....+.+. T Consensus 185 ~~~~v~eiMtPR~~i~~l~~~~~~~e~~~~~~~~~~SR~PVy~~~~D~iiGiv~~kdl~~~~~~----~~~~~~~~l~~~ 260 (413) T PRK11573 185 EKVTVDDIMVPRNEIIGIDINDDWKSILRQLTHSPHGRIVLYRDSLDDAISMLRVREAWRLMTE----KKEFTKETMLRA 260 (413) T ss_pred CCCEEHEEEEECCCEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEHHHHHHHHC----CCCCCHHHHHHH T ss_conf 8989011862002268974789999999986633875425854788865999982577877642----666887888751 Q ss_pred CCCEEEEECCCCHHHHHHHHCC---CCCC----CCCCCCCCCHHHHHHHHHCCCCCCCC Q ss_conf 7733783045359999998288---8887----77746743179999998423473798 Q gi|254780426|r 236 SDNTFIVDARTDLEELAKIIGT---DCNC----LKGEQDVDSLGGLIFSVLDRIPARGE 287 (320) Q Consensus 236 ~~g~~~v~G~~~l~~l~~~l~~---~l~~----~~~~~~~~Ti~G~i~~~l~~iP~~Ge 287 (320) ...-+.|+-.+++.++-+.|.- ++.. .+.-...-|+-..+=+..|.+....+ T Consensus 261 ~~~~~fVpe~~~~~~lL~~~~~~~~h~AiVvDEyGg~~GiVTlEDilEeivGei~de~d 319 (413) T PRK11573 261 ADEIYFVPEGTPLSTQLVKFQRNKKKVGLVVNEYGDIQGLVTVEDILEEIVGDFTTSMS 319 (413) T ss_pred CCCCEEECCCCHHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCC T ss_conf 47877968989099999998842863799994899888995499999998387755567 No 190 >COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription] Probab=94.04 E-value=0.19 Score=29.28 Aligned_cols=55 Identities=22% Similarity=0.361 Sum_probs=47.9 Q ss_pred HHHHC-CCCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH Q ss_conf 78851-565797178851232343201356511654077662046675889999874 Q gi|254780426|r 164 ESNLI-KNILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR 219 (320) Q Consensus 164 l~~~~-r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG 219 (320) ...+| +|++++.+..++.+++..|+..+..-..|+++ |.++|-||-.||...+.- T Consensus 67 a~~iM~spvv~v~pdDsi~~vv~lM~~~g~SQlPVi~~-~k~VGsItE~~iv~~~le 122 (187) T COG3620 67 AKTIMHSPVVSVSPDDSISDVVNLMRDKGISQLPVIEE-DKVVGSITENDIVRALLE 122 (187) T ss_pred HHHHCCCCEEEECCHHHHHHHHHHHHHCCCCCCCEEEC-CEEEEEECHHHHHHHHHC T ss_conf 76653587058772466999999999759752754108-864533149899999730 No 191 >TIGR03520 GldE gliding motility-associated protein GldE. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype. Probab=93.97 E-value=0.3 Score=28.04 Aligned_cols=59 Identities=15% Similarity=0.221 Sum_probs=49.5 Q ss_pred CCCCHHHHCCC---CEEECCCCCHHHHHHHHHCCCCCCEEEEECC-CCCEEEEEHHHHHHHHH Q ss_conf 45767885156---5797178851232343201356511654077-66204667588999987 Q gi|254780426|r 160 IQLSESNLIKN---ILFVPSSMLVSDLLTNIQESRIRMALVIDEH-GGTDGLVSYEDIVSVLM 218 (320) Q Consensus 160 ~~~~l~~~~r~---~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~-G~~~GiVTleDIle~iv 218 (320) .+...+++|.| +.+++.++++.++++.+++.+....+|.++. ..+.|++...|++..+- T Consensus 189 ~~~~v~eiMtPr~~v~~l~~~~~~~e~~~~~~~~~~SR~PV~~~~~D~iiGiv~~kDll~~~~ 251 (408) T TIGR03520 189 GNTDTKQVMRPRLDIFALDIETSFSEIIPKIIENGYSRIPVYKETIDNITGVLYIKDLLPHLN 251 (408) T ss_pred CCCEEHHCCCCHHHEEEEECCCHHHHHHHHHHHCCCCCCEEECCCCCCEEEEEEHHHHHHHHH T ss_conf 998903423545795884064438999999985798722266589876179977989987873 No 192 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=93.11 E-value=0.13 Score=30.36 Aligned_cols=90 Identities=17% Similarity=0.240 Sum_probs=65.4 Q ss_pred CCCCEEECCCCCHHHHHHHHHCCCCCCEEEEE--------CCCCCEEEEEHHHHHHHHHHHHHHHCCC-H--HHCCCCCC Q ss_conf 15657971788512323432013565116540--------7766204667588999987422210000-0--00012467 Q gi|254780426|r 168 IKNILFVPSSMLVSDLLTNIQESRIRMALVID--------EHGGTDGLVSYEDIVSVLMRDITSEHHS-K--KSMISAVS 236 (320) Q Consensus 168 ~r~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvD--------E~G~~~GiVTleDIle~ivGei~de~d~-~--~~~i~~~~ 236 (320) +.+++.+.++.++-++.+.+|+.+..-+.||+ |-|.+.|.||+..+|..++-.=....|. . .-++..+. T Consensus 373 L~~~v~~~~t~tv~~ai~iLre~GfdQ~PVv~pga~~P~veag~v~G~v~l~~lL~~l~~~~a~~~D~v~Gkyldfk~~~ 452 (527) T TIGR01137 373 LPALVTVHPTETVGDAIEILREYGFDQLPVVKPGAAEPDVEAGKVLGSVTLRELLSALFAKKAKLEDAVSGKYLDFKVMS 452 (527) T ss_pred CCCCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHCCCCCCCCCCCCCHHHHHHH T ss_conf 88734546764089999999974873076207788889723534788742677899998424675544343000344453 Q ss_pred ---------------CCEEEEECCCCHHHHHHHHCC Q ss_conf ---------------733783045359999998288 Q gi|254780426|r 237 ---------------DNTFIVDARTDLEELAKIIGT 257 (320) Q Consensus 237 ---------------~g~~~v~G~~~l~~l~~~l~~ 257 (320) .--.-+.-..+|.++++.|+- T Consensus 453 ~Fndvssynenk~~~~~~~~i~~~~~l~~L~~~le~ 488 (527) T TIGR01137 453 RFNDVSSYNENKSKKKKFIQIGEGEKLADLSKFLEK 488 (527) T ss_pred HHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHCC T ss_conf 210100010102467676544677518999998516 No 193 >KOG2550 consensus Probab=91.69 E-value=0.15 Score=29.95 Aligned_cols=69 Identities=14% Similarity=0.206 Sum_probs=56.2 Q ss_pred CCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHH Q ss_conf 8989999999873047757641312001299986301036778979865981799841886642241029999987 Q gi|254780426|r 70 FSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISY 145 (320) Q Consensus 70 i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~ 145 (320) +...+.+.+ .=..+.+.++|||+. ++.+...+++++-+++.++.--++||.+++. +.+-++...||.+. T Consensus 158 vtsrdi~f~----~~~~~~~~~vmt~~~--~~~~~gi~l~~~neiL~~~kkGkl~iv~~~g-elva~~~rtDl~k~ 226 (503) T KOG2550 158 ITSRDIQFL----EDNSLLVSDVMTKNP--VTGAQGITLKEANEILKKIKKGKLPVVDDKG-ELVAMLSRTDLMKN 226 (503) T ss_pred EEHHHHHHH----HCCCCHHHHHHCCCC--CCCCCCCCHHHHHHHHHHHHCCCCCEECCCC-CEEEEEEHHHHHHH T ss_conf 742023455----345511243303443--0144446677889998763148652343677-62334333456650 No 194 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=91.35 E-value=0.13 Score=30.46 Aligned_cols=85 Identities=16% Similarity=0.239 Sum_probs=58.8 Q ss_pred HHHHCC---CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHH--HHHHHHHHHCCCHHHCCCCCCCC Q ss_conf 788515---6579717885123234320135651165407766204667588999--98742221000000001246773 Q gi|254780426|r 164 ESNLIK---NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVS--VLMRDITSEHHSKKSMISAVSDN 238 (320) Q Consensus 164 l~~~~r---~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle--~ivGei~de~d~~~~~i~~~~~g 238 (320) +.++|| .++.|-.++++.|+|-.|.+.+.-+|+|+|+.-.+.||+|=-||=. ..+|--. -..+-.++-..+.- T Consensus 157 V~dlm~t~d~lP~v~~tas~~DAL~e~~~~~LG~~~v~~~~~~~~Gv~tDGD~RR~l~~~g~~~--l~~~v~~~mT~~p~ 234 (272) T TIGR00393 157 VKDLMQTDDELPLVAPTASFKDALLEMSRKRLGLAVVCDENEQLKGVFTDGDLRRVLALLGGGA--LKKEVKDFMTLGPK 234 (272) T ss_pred HHHHHCCCCCCCCEECCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCHHHHHHHHHCCHH--CCCCHHHHCCCCCE T ss_conf 6776406656782223677202102331378617999715552467871465999998816600--02312221068881 Q ss_pred EEEEECCCCHHH Q ss_conf 378304535999 Q gi|254780426|r 239 TFIVDARTDLEE 250 (320) Q Consensus 239 ~~~v~G~~~l~~ 250 (320) ++.-.+..-++- T Consensus 235 ~~~n~~~~l~~A 246 (272) T TIGR00393 235 TLKNSDELLVEA 246 (272) T ss_pred EEECHHHHHHHH T ss_conf 340145689999 No 195 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=90.48 E-value=0.5 Score=26.58 Aligned_cols=146 Identities=15% Similarity=0.184 Sum_probs=86.3 Q ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCC--CCCCCCCCCC--CHHHHC----C Q ss_conf 2999863010367789798659817998418866422410299999876622356--4322333457--678851----5 Q gi|254780426|r 98 SINAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVISYISHMYAK--TNNINLNIQL--SESNLI----K 169 (320) Q Consensus 98 ~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~--~~~~~~~~~~--~l~~~~----r 169 (320) -|+++.+|. +||+++--+ +-+..+. +|+-+=+--++|.--.+.+-. ....+..... ...++. + T Consensus 183 TIvFitHDl--DEA~rigDR-----Ivilk~G--eiVQvGTPdeIL~NPaneyVe~F~~~~dl~qv~~P~~~~i~~~~~~ 253 (372) T TIGR01186 183 TIVFITHDL--DEALRIGDR-----IVILKAG--EIVQVGTPDEILRNPANEYVEEFIGKVDLSQVLSPDAERIAKRMNT 253 (372) T ss_pred EEEEEECCH--HHHHHHHHH-----HHHEECC--CEEEECCCHHHHCCCCHHHHHHHHCCCEEEEECCCCHHHHHHHCCC T ss_conf 089994176--788767513-----2011068--6788428468742880679998737511575238888999965277 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHH Q ss_conf 65797178851232343201356511654077662046675889999874222100000000124677337830453599 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLE 249 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~ 249 (320) .++.+........||+.||..+..-..|||....+.|.|..+++=.+.-- +..-.. -..+.-+.|++.++|. T Consensus 254 ~~i~~t~~~gp~~Al~Lm~~~~~~s~yvv~~~~~l~G~v~~~~~~~a~~~------~~~~~~--~l~~~~~tV~~~t~L~ 325 (372) T TIGR01186 254 VPITKTADKGPRSALKLMRDERVDSLYVVDRQNKLVGVVDVESIKQARKK------AQSLQD--VLIDDILTVDEGTLLR 325 (372) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHHHHHH------HHHHHH--HHHHCCCCCCCCCCHH T ss_conf 32686589887899999986597179999728548877858899988878------778999--8620011105876048 Q ss_pred HHHHHHC-CCCC Q ss_conf 9999828-8888 Q gi|254780426|r 250 ELAKIIG-TDCN 260 (320) Q Consensus 250 ~l~~~l~-~~l~ 260 (320) |+-+..+ ...+ T Consensus 326 e~~~~v~~~~~~ 337 (372) T TIGR01186 326 ELLRKVLKAGIK 337 (372) T ss_pred HHHHHHHHCCCC T ss_conf 999998507995 No 196 >cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=88.73 E-value=0.2 Score=29.18 Aligned_cols=46 Identities=20% Similarity=0.302 Sum_probs=31.8 Q ss_pred EECCCCCHHHHHHHHHCCCCCCEEEEE-CCCCCEEEEEHHHHHHHHH Q ss_conf 971788512323432013565116540-7766204667588999987 Q gi|254780426|r 173 FVPSSMLVSDLLTNIQESRIRMALVID-EHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 173 ~Vpe~~~l~~lL~~m~~~~~~~aiVvD-E~G~~~GiVTleDIle~iv 218 (320) .+--..++..++..|..++...|++-| ..++++|++|..|.+..+- T Consensus 5 V~D~~l~vk~Af~al~~ngi~~aplWds~~~~fvGmLt~~DfI~il~ 51 (98) T cd04618 5 VFDTKLPVKKAFNALVENGIRSAPLWDSRKQQFVGMLTITDFILILR 51 (98) T ss_pred EEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHH T ss_conf 99489758999999998692699987288767888864799998876 No 197 >KOG0476 consensus Probab=85.78 E-value=1.3 Score=23.78 Aligned_cols=62 Identities=13% Similarity=0.198 Sum_probs=48.2 Q ss_pred CCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECC-CCCCCCCCEEHHHHHHHHHH Q ss_conf 77576413120012999863010367789798659817998418-86642241029999987662 Q gi|254780426|r 85 RKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKN-SLDNPRGMVHMRDVISYISH 148 (320) Q Consensus 85 ~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~-~~d~iiGiv~~kDll~~~~~ 148 (320) ..+.|+++|+ ++|.++-.+.|+.|+.+.+..+...-||+.+. +.--.+|.|..+.|..++.. T Consensus 586 h~v~VE~iMV--~dv~yI~k~~Ty~elre~l~~~~lR~~PlV~s~esmiLlGSV~R~~L~~ll~~ 648 (931) T KOG0476 586 HTVKVEHIMV--TDVKYITKDTTYRELREALQTTTLRSFPLVESKESMILLGSVARRYLTALLQR 648 (931) T ss_pred EEEEEEEECC--CCCEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHH T ss_conf 6887430012--45204533573999999997375320144358644233016379999999985 No 198 >COG1125 OpuBA ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism] Probab=85.09 E-value=0.93 Score=24.81 Aligned_cols=41 Identities=22% Similarity=0.319 Sum_probs=31.7 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 78851232343201356511654077662046675889999 Q gi|254780426|r 176 SSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 176 e~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) ......+++..|-......+.|+|+.|...|.||..|++.. T Consensus 268 ~~~~~~~~ls~~~~~~~~~~~Vvd~~g~~~G~vt~~~l~~~ 308 (309) T COG1125 268 EGFVDRDALSDFLARGRSVLPVVDEDGRPLGTVTRADLLDE 308 (309) T ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 10445899999986687246788799967658879987523 No 199 >cd04381 RhoGap_RalBP1 RhoGap_RalBP1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in RalBP1 proteins, also known as RLIP, RLIP76 or cytocentrin. RalBP1 plays an important role in endocytosis during interphase. During mitosis, RalBP1 transiently associates with the centromere and has been shown to play an essential role in the proper assembly of the mitotic apparatus. RalBP1 is an effector of the Ral GTPase which itself is an effector of Ras. RalBP1 contains a RhoGAP domain, which shows weak activity towards Rac1 and Cdc42, but not towards Ral, and a Ral effector domain binding motif. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low int Probab=82.33 E-value=1.8 Score=22.89 Aligned_cols=47 Identities=15% Similarity=0.196 Sum_probs=17.3 Q ss_pred CEEEEECCCC-HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 3378304535-9999998288888777746743179999998423473 Q gi|254780426|r 238 NTFIVDARTD-LEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPA 284 (320) Q Consensus 238 g~~~v~G~~~-l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~ 284 (320) |-|+++|... ++++.+.|+-.-...-.+.+.+++||++-..|..+|+ T Consensus 40 GIFRisG~~~~i~~lk~~~d~~~~~~l~~~d~h~vA~lLK~fLReLPe 87 (182) T cd04381 40 GIYKVSGIKSKVDELKAAYNRRESPNLEEYEPPTVASLLKQYLRELPE 87 (182) T ss_pred CEEECCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 704578979999999999806899980005789999999999984998 No 200 >cd04389 RhoGAP_KIAA1688 RhoGAP_KIAA1688: GTPase-activator protein (GAP) domain for Rho-like GTPases found in KIAA1688-like proteins; KIAA1688 is a protein of unknown function that contains a RhoGAP domain and a myosin tail homology 4 (MyTH4) domain. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=82.26 E-value=2.1 Score=22.55 Aligned_cols=14 Identities=21% Similarity=0.425 Sum_probs=5.3 Q ss_pred CCHHHHHHHHCCCC Q ss_conf 35999999828888 Q gi|254780426|r 246 TDLEELAKIIGTDC 259 (320) Q Consensus 246 ~~l~~l~~~l~~~l 259 (320) |..+.+.-.||=++ T Consensus 146 M~~~NLAivf~P~L 159 (187) T cd04389 146 MDVSNLAMVFAPNI 159 (187) T ss_pred CCHHHHHHHHCCHH T ss_conf 98777699862430 No 201 >cd04402 RhoGAP_ARHGAP20 RhoGAP_ARHGAP20: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP20-like proteins. ArhGAP20, also known as KIAA1391 and RA-RhoGAP, contains a RhoGAP, a RA, and a PH domain, and ANXL repeats. ArhGAP20 is activated by Rap1 and induces inactivation of Rho, which in turn leads to neurite outgrowth. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=80.64 E-value=2.1 Score=22.46 Aligned_cols=11 Identities=36% Similarity=0.244 Sum_probs=4.1 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 143 M~~~NLAivf~ 153 (192) T cd04402 143 MDAFNLAVCIA 153 (192) T ss_pred CCHHHHHHHHH T ss_conf 97899999961 No 202 >cd04406 RhoGAP_myosin_IXA RhoGAP_myosin_IXA: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in myosins IXA. Class IX myosins contain a characteristic head domain, a neck domain and a tail domain which contains a C6H2-zinc binding motif and a Rho-GAP domain. Class IX myosins are single-headed, processive myosins that are partly cytoplasmic, and partly associated with membranes and the actin cytoskeleton. Class IX myosins are implicated in the regulation of neuronal morphogenesis and function of sensory systems, like the inner ear. There are two major isoforms, myosin IXA and IXB with several splice variants, which are both expressed in developing neurons. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell prolife Probab=77.91 E-value=3.5 Score=21.09 Aligned_cols=48 Identities=17% Similarity=0.324 Sum_probs=18.0 Q ss_pred CCEEEEECCCC-HHHHHHHHCCCCCC-CCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 73378304535-99999982888887-77746743179999998423473 Q gi|254780426|r 237 DNTFIVDARTD-LEELAKIIGTDCNC-LKGEQDVDSLGGLIFSVLDRIPA 284 (320) Q Consensus 237 ~g~~~v~G~~~-l~~l~~~l~~~l~~-~~~~~~~~Ti~G~i~~~l~~iP~ 284 (320) +|-|+++|... ++++.+.++.+... ...+.+++|+||++-..|..+|+ T Consensus 34 eGIFR~sG~~~~i~~L~~~~d~~~~~~~~~~~~v~~vaslLK~fLReLPe 83 (186) T cd04406 34 EGIYRKSGSTNKIKELRQGLDTDANSVNLDDYNIHVIASVFKQWLRDLPN 83 (186) T ss_pred CCEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 88044068799999999997259987886646789999999999984899 No 203 >cd04392 RhoGAP_ARHGAP19 RhoGAP_ARHGAP19: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP19-like proteins. The function of ArhGAP19 is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=75.59 E-value=4.1 Score=20.63 Aligned_cols=39 Identities=3% Similarity=0.118 Sum_probs=15.7 Q ss_pred CCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHH Q ss_conf 89899999998730477576413120012999863010367789 Q gi|254780426|r 70 FSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAML 113 (320) Q Consensus 70 i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~ 113 (320) +++.--..|.+++++=++ .+.+ .-+.-++.+.+--+.++ T Consensus 5 ~~~~~~~~i~~~I~~le~---~l~~--EGIFRvsGs~~~i~~Lk 43 (208) T cd04392 5 LTEEGIAQIYQLIEYLEK---NLRV--EGLFRKPGNSARQQELR 43 (208) T ss_pred CCHHHHHHHHHHHHHHHH---CCCC--CCCEECCCCHHHHHHHH T ss_conf 767689999999999983---8997--28412788799999999 No 204 >PRK13225 phosphoglycolate phosphatase; Provisional Probab=75.45 E-value=2.8 Score=21.68 Aligned_cols=102 Identities=11% Similarity=0.012 Sum_probs=64.7 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHH Q ss_conf 99998763378888899999999997997689898999999987304775764131200129998630103677897986 Q gi|254780426|r 38 TIRALCKTWKLMKLSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEK 117 (320) Q Consensus 38 ~~~~~~~~~~~~~~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~ 117 (320) .++.+-.-||..+ .+++|++.+...++++ .+...+-.+.+=.|-| ++|+.-|. .++..+..=.-+.+++..+.+ T Consensus 83 I~NrLA~eFGY~P-i~~~dl~~LrnlsaRe-IIk~SgIs~~KiPflL--krVk~el~--~~i~~l~pf~GI~e~L~~L~~ 156 (273) T PRK13225 83 IANAHAPDFGYDP-IDERDYAQLRQWSSRT-IVRRAGLSPWQQARLL--QRVQRQLG--DCLPALQLFPGVADLLAQLRS 156 (273) T ss_pred HHHHHHHHCCCCC-CCHHHHHHHHCCCHHH-HHHHHCCCHHHHHHHH--HHHHHHHH--HHCCCCCCCCCHHHHHHHHHH T ss_conf 9999899739899-9989999996478999-9998489877768999--99999998--404304567777999999997 Q ss_pred CCCCEEEEECCCCCCCCCCEEHHHHHHH Q ss_conf 5981799841886642241029999987 Q gi|254780426|r 118 YGRSWMPVYKNSLDNPRGMVHMRDVISY 145 (320) Q Consensus 118 ~~~sR~PV~~~~~d~iiGiv~~kDll~~ 145 (320) .||..--|..++.+||.-++..-.|... T Consensus 157 ~G~~LGIITSNs~~NV~~fL~~~~L~~l 184 (273) T PRK13225 157 RSLCLGILSSNSRQNIEAFLQRQGLRSL 184 (273) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 7967999706828899999986238988 No 205 >cd04407 RhoGAP_myosin_IXB RhoGAP_myosin_IXB: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in myosins IXB. Class IX myosins contain a characteristic head domain, a neck domain and a tail domain which contains a C6H2-zinc binding motif and a Rho-GAP domain. Class IX myosins are single-headed, processive myosins that are partly cytoplasmic, and partly associated with membranes and the actin cytoskeleton. Class IX myosins are implicated in the regulation of neuronal morphogenesis and function of sensory systems, like the inner ear. There are two major isoforms, myosin IXA and IXB with several splice variants, which are both expressed in developing neurons Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell prolifer Probab=73.63 E-value=4.6 Score=20.26 Aligned_cols=48 Identities=15% Similarity=0.342 Sum_probs=18.0 Q ss_pred CCEEEEECCCC-HHHHHHHHCCCCC-CCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 73378304535-9999998288888-777746743179999998423473 Q gi|254780426|r 237 DNTFIVDARTD-LEELAKIIGTDCN-CLKGEQDVDSLGGLIFSVLDRIPA 284 (320) Q Consensus 237 ~g~~~v~G~~~-l~~l~~~l~~~l~-~~~~~~~~~Ti~G~i~~~l~~iP~ 284 (320) +|-|+++|... ++.+.+.++.+-. ....+.+.+|+||++...|..+|+ T Consensus 34 eGIfR~~G~~~~i~~l~~~~~~~~~~~~~~~~~vh~va~lLK~fLReLpe 83 (186) T cd04407 34 EGIYRKSGSANRMKELHQLLQADPENVKLENYPIHAITGLLKQWLRELPE 83 (186) T ss_pred CCCEECCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 99515898899999999998069865674558789999999999982898 No 206 >cd04408 RhoGAP_GMIP RhoGAP_GMIP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of GMIP (Gem interacting protein). GMIP plays important roles in neurite growth and axonal guidance, and interacts with Gem, a member of the RGK subfamily of the Ras small GTPase superfamily, through the N-terminal half of the protein. GMIP contains a C-terminal RhoGAP domain. GMIP inhibits RhoA function, but is inactive towards Rac1 and Cdc41. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=72.50 E-value=4.9 Score=20.08 Aligned_cols=16 Identities=13% Similarity=0.283 Sum_probs=7.1 Q ss_pred CCCCHHHHHHHHCCCC Q ss_conf 4535999999828888 Q gi|254780426|r 244 ARTDLEELAKIIGTDC 259 (320) Q Consensus 244 G~~~l~~l~~~l~~~l 259 (320) -.|....+.-.|+=.+ T Consensus 155 NkMt~~NLAivf~P~L 170 (200) T cd04408 155 NKMSPNNLGIVFGPTL 170 (200) T ss_pred CCCCHHHHHHHHHHHH T ss_conf 2798799999951050 No 207 >cd04385 RhoGAP_ARAP RhoGAP_ARAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in ARAPs. ARAPs (also known as centaurin deltas) contain, besides the RhoGAP domain, an Arf GAP, ankyrin repeat ras-associating, and PH domains. Since their ArfGAP activity is PIP3-dependent, ARAPs are considered integration points for phosphoinositide, Arf and Rho signaling. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=72.44 E-value=5 Score=20.02 Aligned_cols=15 Identities=13% Similarity=0.408 Sum_probs=5.3 Q ss_pred CCHHHHHHHHHCCCC Q ss_conf 317999999842347 Q gi|254780426|r 269 DSLGGLIFSVLDRIP 283 (320) Q Consensus 269 ~Ti~G~i~~~l~~iP 283 (320) +++||++...|..+| T Consensus 70 h~va~lLK~flReLP 84 (184) T cd04385 70 HDVADVLKRFLRDLP 84 (184) T ss_pred HHHHHHHHHHHHHCC T ss_conf 899999999997099 No 208 >cd04393 RhoGAP_FAM13A1a RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude. Probab=69.12 E-value=7.7 Score=18.82 Aligned_cols=11 Identities=36% Similarity=0.588 Sum_probs=3.8 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 150 M~~~NLAivfa 160 (189) T cd04393 150 MTAENLAAVFG 160 (189) T ss_pred CCHHHHHHHHC T ss_conf 98787399864 No 209 >cd04400 RhoGAP_fBEM3 RhoGAP_fBEM3: RhoGAP (GTPase-activator [GAP] protein for Rho-like small GTPases) domain of fungal BEM3-like proteins. Bem3 is a GAP protein of Cdc42, and is specifically involved in the control of the initial assembly of the septin ring in yeast bud formation. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=68.84 E-value=7.4 Score=18.91 Aligned_cols=49 Identities=18% Similarity=0.342 Sum_probs=24.5 Q ss_pred CCEEEEECCCC-HHHHHHHHCCCCCC----CCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 73378304535-99999982888887----777467431799999984234737 Q gi|254780426|r 237 DNTFIVDARTD-LEELAKIIGTDCNC----LKGEQDVDSLGGLIFSVLDRIPAR 285 (320) Q Consensus 237 ~g~~~v~G~~~-l~~l~~~l~~~l~~----~~~~~~~~Ti~G~i~~~l~~iP~~ 285 (320) +|-|+++|... ++++.+.|+.+.+. .....+.+++||++-..|..+|+. T Consensus 42 EGIFR~sG~~~~i~~l~~~~~~~~~~~~~~~~~~~dvh~va~lLK~flReLPep 95 (190) T cd04400 42 EGIFRLSGSASVIKQLKERFNTEYDVDLFSSSLYPDVHTVAGLLKLYLRELPTL 95 (190) T ss_pred CCEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 741213786999999999985689988554434477799999999998619988 No 210 >cd04403 RhoGAP_ARHGAP27_15_12_9 RhoGAP_ARHGAP27_15_12_9: GTPase-activator protein (GAP) domain for Rho-like GTPases found in ARHGAP27 (also called CAMGAP1), ARHGAP15, 12 and 9-like proteins; This subgroup of ARHGAPs are multidomain proteins that contain RhoGAP, PH, SH3 and WW domains. Most members that are studied show GAP activity towards Rac1, some additionally show activity towards Cdc42. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=67.14 E-value=7.9 Score=18.74 Aligned_cols=16 Identities=13% Similarity=0.409 Sum_probs=6.5 Q ss_pred CCCHHHHHHHHHCCCC Q ss_conf 4317999999842347 Q gi|254780426|r 268 VDSLGGLIFSVLDRIP 283 (320) Q Consensus 268 ~~Ti~G~i~~~l~~iP 283 (320) .+++||++...|..+| T Consensus 70 ~h~va~lLK~fLReLP 85 (187) T cd04403 70 IHVITGALKLFFRELP 85 (187) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 8999999999998589 No 211 >cd04396 RhoGAP_fSAC7_BAG7 RhoGAP_fSAC7_BAG7: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of fungal SAC7 and BAG7-like proteins. Both proteins are GTPase activating proteins of Rho1, but differ functionally in vivo: SAC7, but not BAG7, is involved in the control of Rho1-mediated activation of the PKC-MPK1 pathway. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.74 E-value=8.5 Score=18.53 Aligned_cols=49 Identities=18% Similarity=0.251 Sum_probs=22.2 Q ss_pred CCEEEEECCCC-HHHHHHHHCCCCC----CCCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 73378304535-9999998288888----7777467431799999984234737 Q gi|254780426|r 237 DNTFIVDARTD-LEELAKIIGTDCN----CLKGEQDVDSLGGLIFSVLDRIPAR 285 (320) Q Consensus 237 ~g~~~v~G~~~-l~~l~~~l~~~l~----~~~~~~~~~Ti~G~i~~~l~~iP~~ 285 (320) +|-|+++|... ++++.+.|+.+-. ....+.+++++||++...|..+|++ T Consensus 51 EGIfRvsG~~~~I~~Lk~~fd~~~~~~~~~~~~~~~vh~vA~lLK~fLReLPeP 104 (225) T cd04396 51 EGIFRVAGSSKRIRELQLIFSTPPDYGKSFDWDGYTVHDAASVLRRYLNNLPEP 104 (225) T ss_pred CCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 880268987999999999976686546677854387999999999999838986 No 212 >TIGR03279 cyano_FeS_chp putative FeS-containing Cyanobacterial-specific oxidoreductase. Members of this protein family are predicted FeS-containing oxidoreductases of unknown function, apparently restricted to and universal across the Cyanobacteria. The high trusted cutoff score for this model, 700 bits, excludes homologs from other lineages. This exclusion seems justified because a significant number of sequence positions are simultaneously unique to and invariant across the Cyanobacteria, suggesting a specialized, conserved function, perhaps related to photosynthesis. A distantly related protein family, TIGR03278, in universal in and restricted to archaeal methanogens, and may be linked to methanogenesis. Probab=66.71 E-value=9.8 Score=18.12 Aligned_cols=79 Identities=18% Similarity=0.233 Sum_probs=46.2 Q ss_pred CCCCEEEEECC-CC---CEEEEEHHHHHHHHHHHH-HHHCCCHHHCCCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCC Q ss_conf 56511654077-66---204667588999987422-21000000001246773378304535999999828888877774 Q gi|254780426|r 191 RIRMALVIDEH-GG---TDGLVSYEDIVSVLMRDI-TSEHHSKKSMISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGE 265 (320) Q Consensus 191 ~~~~aiVvDE~-G~---~~GiVTleDIle~ivGei-~de~d~~~~~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~ 265 (320) ..++..|.++| |+ +.|++|=.||++++-|.. .|.-= -....-+.+++.|+ -.++++++.+.+|..... T Consensus 348 ~i~v~~I~N~fFG~~vtVtGLLTG~Dii~~L~~~~lgd~ll-iP~vmL~~g~~vfL--DD~t~~~l~~~L~~~V~v---- 420 (433) T TIGR03279 348 ELDLHGLASDYWGQEITVTGLLTGQDLIAGLKGKDLGDGLL-LPSVMLKHGELVFL--DDLTVEEVAEALGTPIRV---- 420 (433) T ss_pred EEEEEEECCCCCCCCEEEEECCCHHHHHHHHCCCCCCCEEE-ECHHHHCCCCCEEC--CCCCHHHHHHHCCCCEEE---- T ss_conf 79999935688899833722533899999867998898699-62799468997378--989899999870997899---- Q ss_pred CCCCCHHHHHHHH Q ss_conf 6743179999998 Q gi|254780426|r 266 QDVDSLGGLIFSV 278 (320) Q Consensus 266 ~~~~Ti~G~i~~~ 278 (320) +++-+.|+-.. T Consensus 421 --v~~~~d~v~~~ 431 (433) T TIGR03279 421 --VAGGADLLEHA 431 (433) T ss_pred --ECCHHHHHHHH T ss_conf --67899999997 No 213 >COG2216 KdpB High-affinity K+ transport system, ATPase chain B [Inorganic ion transport and metabolism] Probab=66.70 E-value=4.5 Score=20.34 Aligned_cols=37 Identities=16% Similarity=0.289 Sum_probs=28.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 103677897986598179984188664224102999998 Q gi|254780426|r 106 ATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 106 ~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) ..++.+.+.+.+.|-+-++||+++ .+.|++|.||+++ T Consensus 413 ~~l~~~~~~vs~~GGTPL~V~~~~--~~~GVI~LkDivK 449 (681) T COG2216 413 EDLDAAVDEVSRLGGTPLVVVENG--RILGVIYLKDIVK 449 (681) T ss_pred HHHHHHHHHHHHCCCCCEEEEECC--EEEEEEEEHHHCC T ss_conf 789999999985589744998899--9999998643135 No 214 >cd04374 RhoGAP_Graf RhoGAP_Graf: GTPase-activator protein (GAP) domain for Rho-like GTPases found in GRAF (GTPase regulator associated with focal adhesion kinase); Graf is a multi-domain protein, containing SH3 and PH domains, that binds focal adhesion kinase and influences cytoskeletal changes mediated by Rho proteins. Graf exhibits GAP activity toward RhoA and Cdc42, but only weakly activates Rac1. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.38 E-value=7.5 Score=18.89 Aligned_cols=19 Identities=11% Similarity=0.389 Sum_probs=8.7 Q ss_pred CCCCHHHHHHHHHCCCCCC Q ss_conf 7431799999984234737 Q gi|254780426|r 267 DVDSLGGLIFSVLDRIPAR 285 (320) Q Consensus 267 ~~~Ti~G~i~~~l~~iP~~ 285 (320) +.+++||.+-..|..+|++ T Consensus 85 dvh~VAs~LK~yLReLPeP 103 (203) T cd04374 85 EIKTITSALKTYLRNLPEP 103 (203) T ss_pred CHHHHHHHHHHHHHHCCCC T ss_conf 5999999999999838986 No 215 >cd04373 RhoGAP_p190 RhoGAP_p190: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of p190-like proteins. p190, also named RhoGAP5, plays a role in neuritogenesis and axon branch stability. p190 shows a preference for Rho, over Rac and Cdc42, and consists of an N-terminal GTPase domain and a C-terminal GAP domain. The central portion of p190 contains important regulatory phosphorylation sites. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.29 E-value=6.4 Score=19.34 Aligned_cols=16 Identities=25% Similarity=0.615 Sum_probs=6.4 Q ss_pred CCCHHHHHHHHHCCCC Q ss_conf 4317999999842347 Q gi|254780426|r 268 VDSLGGLIFSVLDRIP 283 (320) Q Consensus 268 ~~Ti~G~i~~~l~~iP 283 (320) ++|+||++-..|..+| T Consensus 68 vh~va~lLK~fLReLP 83 (185) T cd04373 68 VNAVAGALKSFFSELP 83 (185) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999998189 No 216 >cd04398 RhoGAP_fRGD1 RhoGAP_fRGD1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of fungal RGD1-like proteins. Yeast Rgd1 is a GAP protein for Rho3 and Rho4 and plays a role in low-pH response. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.07 E-value=7.8 Score=18.77 Aligned_cols=13 Identities=15% Similarity=0.429 Sum_probs=4.1 Q ss_pred HHHHHHHHHCCCC Q ss_conf 7999999842347 Q gi|254780426|r 271 LGGLIFSVLDRIP 283 (320) Q Consensus 271 i~G~i~~~l~~iP 283 (320) +||++...|..+| T Consensus 76 va~lLK~flReLP 88 (192) T cd04398 76 VASLLKLFFRELP 88 (192) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999998689 No 217 >cd04378 RhoGAP_GMIP_PARG1 RhoGAP_GMIP_PARG1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of GMIP (Gem interacting protein) and PARG1 (PTPL1-associated RhoGAP1). GMIP plays important roles in neurite growth and axonal guidance, and interacts with Gem, a member of the RGK subfamily of the Ras small GTPase superfamily, through the N-terminal half of the protein. GMIP contains a C-terminal RhoGAP domain. GMIP inhibits RhoA function, but is inactive towards Rac1 and Cdc41. PARG1 interacts with Rap2, also a member of the Ras small GTPase superfamily whose exact function is unknown, and shows strong preference for Rho. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases ge Probab=65.89 E-value=8.1 Score=18.68 Aligned_cols=15 Identities=13% Similarity=0.308 Sum_probs=6.4 Q ss_pred CCCHHHHHHHHCCCC Q ss_conf 535999999828888 Q gi|254780426|r 245 RTDLEELAKIIGTDC 259 (320) Q Consensus 245 ~~~l~~l~~~l~~~l 259 (320) .|....+.-.|+=.+ T Consensus 158 kM~~~NLAivf~P~L 172 (203) T cd04378 158 KMSPNNLGIVFGPTL 172 (203) T ss_pred CCCHHHHHHHHCCHH T ss_conf 698798699971152 No 218 >cd04387 RhoGAP_Bcr RhoGAP_Bcr: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Bcr (breakpoint cluster region protein)-like proteins. Bcr is a multidomain protein with a variety of enzymatic functions. It contains a RhoGAP and a Rho GEF domain, a Ser/Thr kinase domain, an N-terminal oligomerization domain, and a C-terminal PDZ binding domain, in addition to PH and C2 domains. Bcr is a negative regulator of: i) RacGTPase, via the Rho GAP domain, ii) the Ras-Raf-MEK-ERK pathway, via phosphorylation of the Ras binding protein AF-6, and iii) the Wnt signaling pathway through binding beta-catenin. Bcr can form a complex with beta-catenin and Tcf1. The Wnt signaling pathway is involved in cell proliferation, differentiation, and cell renewal. Bcr was discovered as a fusion partner of Abl. The Bcr-Abl fusion is characteristic for a large majority of chronic myelogenous leukemias (CML). Small GTPases cluster into distinct families, and all act as molecular switch Probab=65.85 E-value=8.1 Score=18.66 Aligned_cols=15 Identities=27% Similarity=0.527 Sum_probs=5.9 Q ss_pred CCCHHHHHHHHCCCC Q ss_conf 535999999828888 Q gi|254780426|r 245 RTDLEELAKIIGTDC 259 (320) Q Consensus 245 ~~~l~~l~~~l~~~l 259 (320) .|+...+.-.||=.+ T Consensus 146 kMt~~NLAivfgP~L 160 (196) T cd04387 146 KMSLHNLATVFGPTL 160 (196) T ss_pred CCCHHHHHHHHHHHC T ss_conf 798999699983004 No 219 >TIGR01497 kdpB K+-transporting ATPase, B subunit; InterPro: IPR006391 These sequences describe the P-type ATPase subunit of the complex responsible for translocating potassium ions across biological membranes in microbes. In Escherichia coli and other species, this complex consists of the proteins KdpA, KdpB, KdpC and KdpF. KdpB is the ATPase subunit, while KdpA is the potassium-ion translocating subunit . The function of KdpC is unclear, although it has been suggested to couple the ATPase subunit to the ion-translocating subunit , while KdpF serves to stabilize the complex . The potassium P-type ATPases have been characterised as Type IA based on a phylogenetic analysis which places this clade closest to the heavy-metal translocating ATPases (Type IB) . Others place this clade closer to the Na+/K+ antiporter type (Type IIC) based on physical characteristics . ; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=65.64 E-value=3.4 Score=21.15 Aligned_cols=38 Identities=18% Similarity=0.279 Sum_probs=31.8 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHHH Q ss_conf 0103677897986598179984188664224102999998 Q gi|254780426|r 105 KATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVIS 144 (320) Q Consensus 105 ~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll~ 144 (320) ...++++.+++.+.|=+=+-||.++ +|.|+++.||+.+ T Consensus 411 p~dlD~av~qVa~~GgTPLvVc~dn--~iyGVIyLKDiVK 448 (675) T TIGR01497 411 PKDLDEAVDQVAKQGGTPLVVCVDN--KIYGVIYLKDIVK 448 (675) T ss_pred CHHHHHHHHHHHHCCCCCEEEEECC--EEEEEEEEECCCC T ss_conf 8376788999873289847999757--7898898730137 No 220 >cd04386 RhoGAP_nadrin RhoGAP_nadrin: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Nadrin-like proteins. Nadrin, also named Rich-1, has been shown to be involved in the regulation of Ca2+-dependent exocytosis in neurons and recently has been implicated in tight junction maintenance in mammalian epithelium. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=65.46 E-value=8.3 Score=18.58 Aligned_cols=15 Identities=7% Similarity=0.262 Sum_probs=6.4 Q ss_pred CCCHHHHHHHHCCCC Q ss_conf 535999999828888 Q gi|254780426|r 245 RTDLEELAKIIGTDC 259 (320) Q Consensus 245 ~~~l~~l~~~l~~~l 259 (320) .|....+.-.|+=.+ T Consensus 150 kM~~~NLAivf~P~L 164 (203) T cd04386 150 KMSPSNIAIVLAPNL 164 (203) T ss_pred CCCHHHHHHHHCCCC T ss_conf 575535377714530 No 221 >cd04395 RhoGAP_ARHGAP21 RhoGAP_ARHGAP21: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP21-like proteins. ArhGAP21 is a multi-domain protein, containing RhoGAP, PH and PDZ domains, and is believed to play a role in the organization of the cell-cell junction complex. It has been shown to function as a GAP of Cdc42 and RhoA, and to interact with alpha-catenin and Arf6. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=65.37 E-value=8.7 Score=18.45 Aligned_cols=11 Identities=27% Similarity=0.489 Sum_probs=3.8 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 150 M~~~NLaivf~ 160 (196) T cd04395 150 MEPRNLAIVFG 160 (196) T ss_pred CCHHHHHHHHC T ss_conf 97567788833 No 222 >KOG4269 consensus Probab=64.63 E-value=7.1 Score=19.05 Aligned_cols=50 Identities=18% Similarity=0.407 Sum_probs=38.7 Q ss_pred CCCEEEEECCC-CHHHHHHHHCCC----CCCCCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 77337830453-599999982888----887777467431799999984234737 Q gi|254780426|r 236 SDNTFIVDART-DLEELAKIIGTD----CNCLKGEQDVDSLGGLIFSVLDRIPAR 285 (320) Q Consensus 236 ~~g~~~v~G~~-~l~~l~~~l~~~----l~~~~~~~~~~Ti~G~i~~~l~~iP~~ 285 (320) ..|-|.++|.. .|..|.++|+-+ +.....+-+++||||++--+|..+|+. T Consensus 938 EeGIyRlSGsaT~Ik~Lke~Fd~~~n~di~~~d~E~dVn~IaGlLKLYlR~LP~~ 992 (1112) T KOG4269 938 EEGIYRLSGSATDIKALKEQFDENVNKDILSMDSEMDVNAIAGLLKLYLRELPEP 992 (1112) T ss_pred HHCEEEECCCHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 2044773250778999999860124600332012003899888999999858840 No 223 >TIGR02475 CobW cobalamin biosynthesis protein CobW; InterPro: IPR012824 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobW proteins, which are generally found proximal to the trimeric cobaltochelatase subunit CobN, an essential protein for vitamin B12 (cobalamin) biosynthesis . They contain a P-loop nucleotide-binding loop in the N-terminal domain and a histidine-rich region in the C-terminal portion suggesting a role in metal binding, possibly as an intermediary between the cobalt transport and chelation systems. CobW might be involved in cobalt reduction leading to cobalt(I) corrinoids.; GO: 0009236 cobalamin biosynthetic process. Probab=64.36 E-value=2.7 Score=21.79 Aligned_cols=27 Identities=30% Similarity=0.624 Sum_probs=16.9 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCEEE Q ss_conf 851232343201356511654077662046 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGTDGL 207 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~~Gi 207 (320) |-+-.+|+ -.++.++|++|||||.+ || T Consensus 19 TLiRhlL~--NA~GkRiAvIVNEFGd~-Gi 45 (349) T TIGR02475 19 TLIRHLLE--NAEGKRIAVIVNEFGDL-GI 45 (349) T ss_pred HHHHHHHH--CCCCCEEEEEEECCCCC-CC T ss_conf 99999971--74787699998525560-63 No 224 >cd04404 RhoGAP-p50rhoGAP RhoGAP-p50rhoGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of p50RhoGAP-like proteins; p50RhoGAP, also known as RhoGAP-1, contains a C-terminal RhoGAP domain and an N-terminal Sec14 domain which binds phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3). It is ubiquitously expressed and preferentially active on Cdc42. This subgroup also contains closely related ARHGAP8. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=64.16 E-value=9.6 Score=18.18 Aligned_cols=10 Identities=30% Similarity=0.451 Sum_probs=3.1 Q ss_pred CHHHHHHHHC Q ss_conf 5999999828 Q gi|254780426|r 247 DLEELAKIIG 256 (320) Q Consensus 247 ~l~~l~~~l~ 256 (320) ....+.-.|+ T Consensus 152 ~~~NLAivf~ 161 (195) T cd04404 152 TNSNLAVVFG 161 (195) T ss_pred CHHHHHHHHC T ss_conf 8888399861 No 225 >cd00159 RhoGAP RhoGAP: GTPase-activator protein (GAP) for Rho-like GTPases; GAPs towards Rho/Rac/Cdc42-like small GTPases. Small GTPases (G proteins) cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when bound to GDP. The Rho family of small G proteins, which includes Cdc42Hs, activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. G proteins generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. The RhoGAPs are one of the major classes of regulators of Rho G proteins. Probab=62.59 E-value=10 Score=18.00 Aligned_cols=11 Identities=18% Similarity=0.250 Sum_probs=3.6 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 129 M~~~nLa~vf~ 139 (169) T cd00159 129 MTASNLAIVFA 139 (169) T ss_pred CCHHHHHHHHH T ss_conf 98899999984 No 226 >pfam04459 DUF512 Protein of unknown function (DUF512). Family of uncharacterized prokaryotic proteins. Probab=62.54 E-value=12 Score=17.61 Aligned_cols=69 Identities=19% Similarity=0.318 Sum_probs=38.6 Q ss_pred CCCCEEEEECC-CC---CEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHHHHHHHHCCCCCC Q ss_conf 56511654077-66---2046675889999874222100000000124677337830453599999982888887 Q gi|254780426|r 191 RIRMALVIDEH-GG---TDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLEELAKIIGTDCNC 261 (320) Q Consensus 191 ~~~~aiVvDE~-G~---~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~ 261 (320) ..++..|.++| |+ +.|++|-.||+++|-+.-.-+.=--....-+.+++.+ +| .++++++.+.+|..... T Consensus 127 ~v~v~~v~N~fFG~~vtVaGLLTg~Dii~~L~~~~lgd~lliP~~ml~~~~~~f-LD-D~t~eel~~~lg~~v~v 199 (205) T pfam04459 127 TVNVVAVKNDFFGETITVAGLLTGQDIIEELDGKELGDTLLLPGVMLKHGEEVF-LD-DMTVEELEEALGVPVIV 199 (205) T ss_pred EEEEEEECCCCCCCCEEEEEEEEHHHHHHHHCCCCCCCEEEECHHHHCCCCCEE-CC-CCCHHHHHHHHCCCEEE T ss_conf 699999533665785358761015899999668888978995689957899756-89-99899999983997899 No 227 >cd04383 RhoGAP_srGAP RhoGAP_srGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in srGAPs. srGAPs are components of the intracellular part of Slit-Robo signalling pathway that is important for axon guidance and cell migration. srGAPs contain an N-terminal FCH domain, a central RhoGAP domain and a C-terminal SH3 domain; this SH3 domain interacts with the intracellular proline-rich-tail of the Roundabout receptor (Robo). This interaction with Robo then activates the rhoGAP domain which in turn inhibits Cdc42 activity. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific group Probab=61.01 E-value=11 Score=17.86 Aligned_cols=11 Identities=36% Similarity=0.329 Sum_probs=3.8 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 149 M~~~NLAivfa 159 (188) T cd04383 149 MDPYNLAICFG 159 (188) T ss_pred CCHHHHHHHHH T ss_conf 99788299882 No 228 >cd04376 RhoGAP_ARHGAP6 RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=60.61 E-value=12 Score=17.62 Aligned_cols=15 Identities=27% Similarity=0.379 Sum_probs=6.1 Q ss_pred CCCHHHHHHHHCCCC Q ss_conf 535999999828888 Q gi|254780426|r 245 RTDLEELAKIIGTDC 259 (320) Q Consensus 245 ~~~l~~l~~~l~~~l 259 (320) .|....+.-.|+=.+ T Consensus 147 kM~~~NLAivf~P~L 161 (206) T cd04376 147 KMTSLNLATIFGPNL 161 (206) T ss_pred CCCHHHHHHHHHHHH T ss_conf 897288789645021 No 229 >PRK06769 hypothetical protein; Validated Probab=58.51 E-value=7.4 Score=18.91 Aligned_cols=77 Identities=16% Similarity=0.141 Sum_probs=48.8 Q ss_pred HHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHH-HHHHHCC-CHHHCCCCCCCCEEEEECCCCHHHHHHHHCC Q ss_conf 1232343201356511654077662046675889999874-2221000-0000012467733783045359999998288 Q gi|254780426|r 180 VSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMR-DITSEHH-SKKSMISAVSDNTFIVDARTDLEELAKIIGT 257 (320) Q Consensus 180 l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivG-ei~de~d-~~~~~i~~~~~g~~~v~G~~~l~~l~~~l~~ 257 (320) +.+++..+++.+..+.+|-+..|---|+.|.+|+.+++-| .+.+-+- .. ..-+.-.++=++.--+....+.+++ T Consensus 33 ~~~ai~~L~~~g~~i~vvTNQsgI~rG~~t~~d~~~~l~~~~id~iy~CPh----~~~~~c~cRKP~pGMi~~a~~~~~i 108 (175) T PRK06769 33 TKASLQKLKAKNIKIFSFTNQPGIADGIATVADFVQELKGFGFDDIYVCPH----KHGDGCECRKPSTGMLLQAAEKHGL 108 (175) T ss_pred HHHHHHHHHHCCCEEEEEECCCHHCCCCCCHHHHHHHHCCCCCCEEEECCC----CCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 899999999869959999688211368779999999974999898998899----9967898989987899999998098 Q ss_pred CCC Q ss_conf 888 Q gi|254780426|r 258 DCN 260 (320) Q Consensus 258 ~l~ 260 (320) ++. T Consensus 109 dl~ 111 (175) T PRK06769 109 DLT 111 (175) T ss_pred CHH T ss_conf 666 No 230 >PRK04149 sat sulfate adenylyltransferase; Reviewed Probab=57.09 E-value=14 Score=17.01 Aligned_cols=24 Identities=25% Similarity=0.392 Sum_probs=11.2 Q ss_pred CCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 13565116540776620466758899 Q gi|254780426|r 189 ESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 189 ~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) +.+..+|+ .+| |..+|+++.+|+. T Consensus 95 ~~g~~vaL-~~e-g~~~ail~V~ei~ 118 (390) T PRK04149 95 KEGDEVAL-VYK-GEPYGVLEVEEIY 118 (390) T ss_pred CCCCEEEE-ECC-CCEEEEEEEEEEE T ss_conf 88988998-069-9689999964542 No 231 >cd04384 RhoGAP_CdGAP RhoGAP_CdGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of CdGAP-like proteins; CdGAP contains an N-terminal RhoGAP domain and a C-terminal proline-rich region, and it is active on both Cdc42 and Rac1 but not RhoA. CdGAP is recruited to focal adhesions via the interaction with the scaffold protein actopaxin (alpha-parvin). Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=56.63 E-value=14 Score=17.01 Aligned_cols=16 Identities=19% Similarity=0.360 Sum_probs=6.2 Q ss_pred CCCHHHHHHHHHCCCC Q ss_conf 4317999999842347 Q gi|254780426|r 268 VDSLGGLIFSVLDRIP 283 (320) Q Consensus 268 ~~Ti~G~i~~~l~~iP 283 (320) .+++||++-..|..+| T Consensus 72 vh~vA~lLK~fLReLP 87 (195) T cd04384 72 IHSVSSLCKLYFRELP 87 (195) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 7999999999998589 No 232 >TIGR01818 ntrC nitrogen regulation protein NR(I); InterPro: IPR010114 This entry represents Nitrogen regulatory protein C (NtrC), which is a bacterial enhancer-binding protein that activates the transcription of genes encoding enzymes required for nitrogen metabolism. It is phosphorylated by NtrB and interacts with sigma-54. One of the best studied examples is its activation of the gene glnA, which encodes the enzyme glutamine synthetase.. NtrC is composed of three domains , . The 124 residue N-terminal domain is homologous to receiver domains of other response regulator proteins in two-component signal transduction systems , . The 240 residue central domain of NtrC is homologous to a domain found in all activators of the sigma-54 RNA polymerase holoenzyme , . The C-terminal domain has been indicated to contain the determinants necessary for both DNA-binding and dimerization of full-length NtrC.; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0009399 nitrogen fixation, 0050906 detection of stimulus during sensory perception. Probab=56.27 E-value=6.8 Score=19.18 Aligned_cols=62 Identities=11% Similarity=0.226 Sum_probs=43.1 Q ss_pred HHHHHH-HHHHHHHHHCC----CHHHCCCCCCCCEEEEE--CCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHH Q ss_conf 588999-98742221000----00000124677337830--453599999982888887777467431799999 Q gi|254780426|r 210 YEDIVS-VLMRDITSEHH----SKKSMISAVSDNTFIVD--ARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIF 276 (320) Q Consensus 210 leDIle-~ivGei~de~d----~~~~~i~~~~~g~~~v~--G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~ 276 (320) ..|+|| ||||--.--|- .....+.+-+-|+.-.| |.||++-=-+.|.+- .+++|.||||... T Consensus 198 P~~L~ESELFGHEkGAFTGA~~~~~GRFEQA~GGTLFLDEIGDMPl~~QTRLLRVL-----~~G~f~rVGG~~p 266 (471) T TIGR01818 198 PKDLIESELFGHEKGAFTGANTRRQGRFEQADGGTLFLDEIGDMPLDLQTRLLRVL-----ADGEFYRVGGRTP 266 (471) T ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCEECCCCCCCCHHHHHHHHHHH-----CCCCEEEECCCEE T ss_conf 24665543214565300010255677725348882101021678776877798864-----0786577268202 No 233 >smart00324 RhoGAP GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers. Probab=54.11 E-value=16 Score=16.70 Aligned_cols=11 Identities=27% Similarity=0.440 Sum_probs=4.3 Q ss_pred CCHHHHHHHHC Q ss_conf 35999999828 Q gi|254780426|r 246 TDLEELAKIIG 256 (320) Q Consensus 246 ~~l~~l~~~l~ 256 (320) |....+.-.|+ T Consensus 133 M~~~nLa~vfa 143 (174) T smart00324 133 MTARNLAIVFG 143 (174) T ss_pred CCHHHHHHHHH T ss_conf 98899899984 No 234 >COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=53.12 E-value=4.8 Score=20.17 Aligned_cols=126 Identities=21% Similarity=0.248 Sum_probs=63.4 Q ss_pred HCCCCEEECCCCCH-HHHHHHHHCCCCCCEEE-----EECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEE Q ss_conf 51565797178851-23234320135651165-----4077662046675889999874222100000000124677337 Q gi|254780426|r 167 LIKNILFVPSSMLV-SDLLTNIQESRIRMALV-----IDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTF 240 (320) Q Consensus 167 ~~r~~~~Vpe~~~l-~~lL~~m~~~~~~~aiV-----vDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~ 240 (320) ++-+|++.|++..+ .++..+-+-.+--.|+| .|-||++.|||+..|+=.-=|--|.-..--+ .. +.++ T Consensus 138 IViDpL~t~~tA~aAldl~~~~~g~rPV~aVIYtHsH~DHfGGVkGiv~eadV~sGkV~iiAP~GFme-~a---vaEN-- 211 (655) T COG2015 138 IVIDPLVTPETAKAALDLYNQHRGQRPVVAVIYTHSHSDHFGGVKGIVSEADVKSGKVQIIAPAGFME-EA---VAEN-- 211 (655) T ss_pred EEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCEEECCCHHHCCCCCEEEECCHHHHH-HH---HHHH-- T ss_conf 99815678288999999999765889748999612542013771001578884667413764605789-99---9875-- Q ss_pred EEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHH--------HHHHHCCCCCCCCEEEEECCEEEEEE Q ss_conf 8304535999999828888877774674317999--------99984234737988997589599999 Q gi|254780426|r 241 IVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGL--------IFSVLDRIPARGEVVLEIPGFEIVIL 300 (320) Q Consensus 241 ~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~--------i~~~l~~iP~~Ge~v~~~~g~~f~V~ 300 (320) +..|..-.+.-.-++|.-||... .+.++-==|- ++.-..-|-+.||.. .++|..|+-. T Consensus 212 vlAGnaM~RRa~YqyG~~Lp~g~-~G~V~~giGk~la~G~vsLiaPT~~I~~~gE~~-~iDGV~~~Fq 277 (655) T COG2015 212 VLAGNAMSRRAQYQYGTLLPPGA-QGQVGCGIGKTLATGEVSLIAPTKIIEETGETL-TIDGVEFEFQ 277 (655) T ss_pred HHHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCCCCCCCCCEEEECCEEEEECCCCEE-EEECEEEEEE T ss_conf 44105676666643123358886-675040103322468525645567862367258-8705078996 No 235 >TIGR01969 minD_arch cell division ATPase MinD; InterPro: IPR010224 Proper placement of the bacterial cell division site requires the site-specific inactivation of other potential division sites. In Escherichia coli, selection of the correct mid-cell site is mediated by the MinC, MinD and MinE proteins. Several members of this family are found in archaeal genomes but their function is uncharacterised. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. This entry represents the archaeal MinD family. The exact roles of the various archaeal MinD homologs are unknown. . Probab=48.64 E-value=6.2 Score=19.43 Aligned_cols=50 Identities=14% Similarity=0.193 Sum_probs=22.0 Q ss_pred ECCCCCHHHHHHHH---HCCCC-CCEEEEECCCCCEEEEEHH---HHHHH-HHHHHHH Q ss_conf 71788512323432---01356-5116540776620466758---89999-8742221 Q gi|254780426|r 174 VPSSMLVSDLLTNI---QESRI-RMALVIDEHGGTDGLVSYE---DIVSV-LMRDITS 223 (320) Q Consensus 174 Vpe~~~l~~lL~~m---~~~~~-~~aiVvDE~G~~~GiVTle---DIle~-ivGei~d 223 (320) =||-.++.|+|+.= ++-+. =+.+|+|-+|...+=++.+ +|||. +.|.|++ T Consensus 140 NPEi~SItDaLK~k~va~~lGt~ilG~vlNRv~~~~tel~~~eiE~iLevPVl~~vPE 197 (258) T TIGR01969 140 NPEISSITDALKVKIVAEKLGTAILGVVLNRVTRDKTELGREEIEAILEVPVLGVVPE 197 (258) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCHHHHHHHHCCCEEEEECC T ss_conf 7654467778899999876088324689960236666378889998847973898569 No 236 >PRK11230 glycolate oxidase subunit GlcD; Provisional Probab=47.82 E-value=15 Score=16.97 Aligned_cols=70 Identities=13% Similarity=-0.015 Sum_probs=40.8 Q ss_pred CCCHHHHHHHHHHHHHCCC--------------CCHHHHHHHHHHHCCCCE-EEHHEEECCCEEEEEEECCCHHHHHHHH Q ss_conf 8899999999997997689--------------898999999987304775-7641312001299986301036778979 Q gi|254780426|r 51 LSHHKGIRLPRIDASLEGI--------------FSETEKEIFNNILRFRKI-RIDDIMISRVSINAVEDKATVYEAMLMF 115 (320) Q Consensus 51 ~~~~eel~~l~~~~~~~G~--------------i~~~E~~ii~~vl~l~~~-~v~diMtPr~~i~~i~~~~~~~e~~~~i 115 (320) ..+.||+..++....+.|. -.+....+ ++++..+ ++.+| -|-...+.+.+.....++.+.+ T Consensus 62 P~s~eeV~~iv~~a~~~~ipvvprGggTgL~Gga~p~~~gI---vl~l~rmn~Ilei-d~~~~~v~VeaGv~~~~l~~~l 137 (499) T PRK11230 62 PKQMEQVQALLAVCHRLRVPVVARGAGTGLSGGALPLEKGV---LLVMARFNRILDI-DPVGRRARVQPGVRNLAISQAA 137 (499) T ss_pred ECCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCEE---EEEECCCCCEEEE-ECCCCEEEEECCCCHHHHHHHH T ss_conf 09999999999999987990999789832265812689829---9980012770688-2689979994581079999989 Q ss_pred HHCCCCEEEE Q ss_conf 8659817998 Q gi|254780426|r 116 EKYGRSWMPV 125 (320) Q Consensus 116 ~~~~~sR~PV 125 (320) .++|+. ||. T Consensus 138 ~~~Gl~-~p~ 146 (499) T PRK11230 138 APHGLY-YAP 146 (499) T ss_pred HHCCCC-CCC T ss_conf 866985-488 No 237 >cd04372 RhoGAP_chimaerin RhoGAP_chimaerin: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of chimaerins. Chimaerins are a family of phorbolester- and diacylglycerol-responsive GAPs specific for the Rho-like GTPase Rac. Chimaerins exist in two alternative splice forms that each contain a C-terminal GAP domain, and a central C1 domain which binds phorbol esters, inducing a conformational change that activates the protein; one splice form is lacking the N-terminal Src homology-2 (SH2) domain. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GT Probab=47.44 E-value=21 Score=16.02 Aligned_cols=12 Identities=25% Similarity=0.576 Sum_probs=4.6 Q ss_pred CCCHHHHHHHHC Q ss_conf 535999999828 Q gi|254780426|r 245 RTDLEELAKIIG 256 (320) Q Consensus 245 ~~~l~~l~~~l~ 256 (320) .|+...+.-.|+ T Consensus 147 kM~~~NLAivf~ 158 (194) T cd04372 147 KMNAENLGIVFG 158 (194) T ss_pred CCCHHHHHHHHH T ss_conf 694999999988 No 238 >pfam00620 RhoGAP RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. Probab=47.11 E-value=21 Score=15.99 Aligned_cols=16 Identities=25% Similarity=0.449 Sum_probs=5.5 Q ss_pred CCCHHHHHHHHHCCCC Q ss_conf 4317999999842347 Q gi|254780426|r 268 VDSLGGLIFSVLDRIP 283 (320) Q Consensus 268 ~~Ti~G~i~~~l~~iP 283 (320) .+++||++...|..+| T Consensus 53 ~~~va~~LK~flreLp 68 (152) T pfam00620 53 VHVVASLLKLFLRELP 68 (152) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9899999999998599 No 239 >KOG4270 consensus Probab=44.84 E-value=22 Score=15.76 Aligned_cols=51 Identities=20% Similarity=0.261 Sum_probs=36.3 Q ss_pred CCEEEEECCCC-HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCC Q ss_conf 73378304535-9999998288888777746743179999998423473798 Q gi|254780426|r 237 DNTFIVDARTD-LEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDRIPARGE 287 (320) Q Consensus 237 ~g~~~v~G~~~-l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~iP~~Ge 287 (320) +|-|.+.|.-+ ++.+.++|+-..-.....-+++++||++-.+|..+|+.+= T Consensus 186 EGlFRi~~~~sk~e~lr~~ld~g~v~~~~~iDvH~~agllKayLRELPepvl 237 (577) T KOG4270 186 EGLFRINGEASKVERLREALDCGVVPDQLYIDVHCLAGLLKAYLRELPEPVL 237 (577) T ss_pred CCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 2111137872078999999728864663437788888999999986787677 No 240 >pfam11305 DUF3107 Protein of unknown function (DUF3107). Some members in this family of proteins are annotated as ATP-binding proteins however this cannot be confirmed. Currently no function is known. Probab=44.38 E-value=23 Score=15.72 Aligned_cols=50 Identities=12% Similarity=0.128 Sum_probs=40.6 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCH Q ss_conf 888899999999997997689898999999987304775764131200129998630103 Q gi|254780426|r 49 MKLSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATV 108 (320) Q Consensus 49 ~~~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~ 108 (320) ....+.+|+...+..+-..|. .+|.|.|.+=+.+++|...+.++....+- T Consensus 17 es~~s~~ev~~~v~~Al~~~~----------~~l~LtD~kGr~vlVpa~~iaYVEiG~~~ 66 (74) T pfam11305 17 ESAQSPEEVEALVAEALADGS----------GVLSLTDDKGRRVLVPAAALAYVEIGAEE 66 (74) T ss_pred ECCCCHHHHHHHHHHHHHCCC----------CEEEEEECCCCEEEEECCCEEEEEECCCC T ss_conf 658999999999999984899----------76998938998999985528999976898 No 241 >pfam01268 FTHFS Formate--tetrahydrofolate ligase. Probab=43.51 E-value=12 Score=17.66 Aligned_cols=39 Identities=8% Similarity=0.159 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHCCC-CCHH-HHHHHHHHHCCCCEEEHHEEE Q ss_conf 99999997997689-8989-999999873047757641312 Q gi|254780426|r 56 GIRLPRIDASLEGI-FSET-EKEIFNNILRFRKIRIDDIMI 94 (320) Q Consensus 56 el~~l~~~~~~~G~-i~~~-E~~ii~~vl~l~~~~v~diMt 94 (320) -+.++++..-.+|- ++-+ .+-..+++++++|+..|+|-+ T Consensus 139 LlaA~iDNhi~~gn~L~IDp~~I~w~Rv~DmNDR~LR~I~i 179 (555) T pfam01268 139 LLAAAIDNHIYHGNELDIDPRRITWKRVLDMNDRALRNIVV 179 (555) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHEEE T ss_conf 99999988885167678484413366642010344213023 No 242 >PRK05800 cobU adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated Probab=40.80 E-value=20 Score=16.03 Aligned_cols=64 Identities=22% Similarity=0.278 Sum_probs=45.1 Q ss_pred CHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 5123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 179 LVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 179 ~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) .+..++..+++...++.+|-+|-|- |+|-...+-. .+.|.--.-++.+....|..|+|-..+|+ T Consensus 104 ~~~~ll~~l~~~~~~~ViVsnEVG~--Givp~~~~~R----~frd~lG~lNq~lA~~Ad~V~lvvaGip~ 167 (170) T PRK05800 104 EIEALLAALQRLPAKIILVSNEVGM--GIVPEYRLGR----RFRDIAGRLNQQLAAAADEVYLVVAGLPL 167 (170) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCC--CCCCCCHHHH----HHHHHHHHHHHHHHHHCCEEEEEECCCCE T ss_conf 9999999998279978999778677--7889987999----99999999999999877988999578656 No 243 >TIGR01193 bacteriocin_ABC ABC-type bacteriocin transporter; InterPro: IPR005897 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains ABC-type bacteriocin transporter. In general, bacteriocins are agents which are responsible for killing or inhibiting the closely related species or even different strains of the same species. Bacteriocins are encoded by bacterial plasmids. Bacteriocins are named after the species and hence in literature one encounters various names e.g., leucocin from Leuconostic geldium; pedicocin from Pedicoccus acidilactici; sakacin from Lactobacillus sake etc. Peptide bacteriocins are exported across the cytoplasmic membrane by a dedicated ATP-binding cassette (ABC) transporter. These ABC-transporters have an N-terminal peptidase domain that belong to MEROPS peptidase family C39 (clan CA); a central multi-pass transmembrane region and a C-terminal ABC transporter domain. These transporters have dual function: (i) they remove the N-terminal leader peptide from its bacteriocin precursor by cleavage at a Gly-Gly bond and (ii) transport the mature bacteriocin across the cytoplasmic membrane. This represents a novel strategy for secretion of bacterial proteins . Many bacteria are known to regulate diverse physiological processes through this system, such as bioluminescence, regulation of sporulation, virulence factor expression, antibiotics production, competence for genetic transformation, and activation of biofilm formation .; GO: 0008234 cysteine-type peptidase activity, 0019534 toxin transporter activity, 0006810 transport, 0016020 membrane. Probab=40.13 E-value=26 Score=15.39 Aligned_cols=31 Identities=10% Similarity=0.159 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHCCC Q ss_conf 9999999979976898989999999873047 Q gi|254780426|r 55 KGIRLPRIDASLEGIFSETEKEIFNNILRFR 85 (320) Q Consensus 55 eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~ 85 (320) |=+++|-.|.+.-.-|+.+-.+.+++.|.+. T Consensus 345 ETIKsL~sE~~~y~KId~~F~~yL~K~f~y~ 375 (710) T TIGR01193 345 ETIKSLTSEAERYKKIDSEFEDYLNKSFKYQ 375 (710) T ss_pred CEEECCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 0010052366777666788888888887778 No 244 >COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion] Probab=39.18 E-value=20 Score=16.16 Aligned_cols=72 Identities=13% Similarity=0.046 Sum_probs=44.1 Q ss_pred CCCCHHHHHHHHHHHHHCCC-C-------------CHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHH Q ss_conf 88899999999997997689-8-------------989999999873047757641312001299986301036778979 Q gi|254780426|r 50 KLSHHKGIRLPRIDASLEGI-F-------------SETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMF 115 (320) Q Consensus 50 ~~~~~eel~~l~~~~~~~G~-i-------------~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i 115 (320) ...+++|+..++..+.+.|. + .+.. .+ ++++..+.--.---|-...+.+.+..++.++.+.+ T Consensus 37 ~p~s~~ev~~iv~~a~~~~~~v~prG~gt~~~g~~~~~~-gv---vl~l~~mn~i~~iD~~~~~~~v~aGv~l~~l~~~l 112 (459) T COG0277 37 FPKSEEEVAAILRLANENGIPVVPRGGGTSLSGGAVPDG-GV---VLDLSRLNRILEIDPEDGTATVQAGVTLEDLEKAL 112 (459) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCC-EE---EEECHHCCCCEEECCCCCEEEECCCCCHHHHHHHH T ss_conf 789999999999999975990899789977767735786-49---99863306865446889989985896889999999 Q ss_pred HHCCCCEEEEE Q ss_conf 86598179984 Q gi|254780426|r 116 EKYGRSWMPVY 126 (320) Q Consensus 116 ~~~~~sR~PV~ 126 (320) ..+|+. +|+. T Consensus 113 ~~~Gl~-~p~~ 122 (459) T COG0277 113 APHGLF-LPVD 122 (459) T ss_pred HHCCCC-CCCC T ss_conf 976982-4888 No 245 >COG0523 Putative GTPases (G3E family) [General function prediction only] Probab=38.69 E-value=12 Score=17.52 Aligned_cols=18 Identities=17% Similarity=0.194 Sum_probs=12.1 Q ss_pred CCCCCCCCEEHHHHHHHH Q ss_conf 866422410299999876 Q gi|254780426|r 129 SLDNPRGMVHMRDVISYI 146 (320) Q Consensus 129 ~~d~iiGiv~~kDll~~~ 146 (320) ..|-++.+|.++.+.... T Consensus 116 ~ld~vvtvVDa~~~~~~~ 133 (323) T COG0523 116 RLDGVVTVVDAAHFLEGL 133 (323) T ss_pred EECCEEEEEEHHHHHHHH T ss_conf 413369998478865456 No 246 >pfam05198 IF3_N Translation initiation factor IF-3, N-terminal domain. Probab=38.10 E-value=28 Score=15.09 Aligned_cols=24 Identities=33% Similarity=0.479 Sum_probs=19.1 Q ss_pred CCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 511654077662046675889999 Q gi|254780426|r 193 RMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 193 ~~aiVvDE~G~~~GiVTleDIle~ 216 (320) .-..|+|+.|.-.|+++.++-++. T Consensus 13 ~~VrlI~~~G~~lGv~~~~eAl~~ 36 (76) T pfam05198 13 REVRLIDEDGEQLGVVSRAEALRL 36 (76) T ss_pred CEEEEECCCCCEECCEEHHHHHHH T ss_conf 999998899929661769999999 No 247 >cd00477 FTHFS Formyltetrahydrofolate synthetase (FTHFS) catalyzes the ATP-dependent activation of formate ion via its addition to the N10 position of tetrahydrofolate. FTHFS is a highly expressed key enzyme in both the Wood-Ljungdahl pathway of autotrophic CO2 fixation (acetogenesis) and the glycine synthase/reductase pathways of purinolysis. The key physiological role of this enzyme in acetogens is to catalyze the formylation of tetrahydrofolate, an initial step in the reduction of carbon dioxide and other one-carbon precursors to acetate. In purinolytic organisms, the enzymatic reaction is reversed, liberating formate from 10-formyltetrahydrofolate with concurrent production of ATP. Probab=35.57 E-value=24 Score=15.57 Aligned_cols=38 Identities=11% Similarity=0.137 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHCCC---CCHHHHHHHHHHHCCCCEEEHHEEE Q ss_conf 99999997997689---8989999999873047757641312 Q gi|254780426|r 56 GIRLPRIDASLEGI---FSETEKEIFNNILRFRKIRIDDIMI 94 (320) Q Consensus 56 el~~l~~~~~~~G~---i~~~E~~ii~~vl~l~~~~v~diMt 94 (320) -+.++++..-.+|. |++ .+-..+++++++|+..|+|.+ T Consensus 123 LlaA~iDn~i~~gn~l~IDp-~~I~w~Rv~DmNDR~LR~I~v 163 (524) T cd00477 123 LLAAAIDNHIHHGNRLDIDP-RRITWKRVLDVNDRALRKIVI 163 (524) T ss_pred HHHHHHHHHHHCCCCCCCCC-CEEEEEEECCCCHHHHHHEEE T ss_conf 99999999886355448887-757998753561333312365 No 248 >cd07178 terB_like_YebE tellurium resistance terB-like protein, subgroup 3. This family includes several uncharacterized bacterial proteins including an Escherichia coli protein called YebE. Protein sequence homology analysis shows they are similar to tellurium resistance protein terB, but the function of this family is unknown. Probab=35.51 E-value=31 Score=14.83 Aligned_cols=28 Identities=21% Similarity=0.159 Sum_probs=23.8 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHCC Q ss_conf 9999997997689898999999987304 Q gi|254780426|r 57 IRLPRIDASLEGIFSETEKEIFNNILRF 84 (320) Q Consensus 57 l~~l~~~~~~~G~i~~~E~~ii~~vl~l 84 (320) |+.++..+...|.|+..|+..|.+-+.= T Consensus 3 lrAMIaAAkADG~ID~~Er~~I~~~l~~ 30 (95) T cd07178 3 LRAMIAAAKADGHIDEAERARILGELGE 30 (95) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 9999999871699699999999999987 No 249 >COG4360 APA2 ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism] Probab=35.39 E-value=27 Score=15.20 Aligned_cols=35 Identities=14% Similarity=0.221 Sum_probs=22.3 Q ss_pred HHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHH Q ss_conf 32343201356511654077662046675889999 Q gi|254780426|r 182 DLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSV 216 (320) Q Consensus 182 ~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ 216 (320) -+|..|.--..|..+|..||-.-.-.+|+.|..-+ T Consensus 96 ~llNKF~VVdeHlLiVTrefedQ~s~LTl~Df~ta 130 (298) T COG4360 96 LLLNKFPVVDEHLLIVTREFEDQESALTLADFTTA 130 (298) T ss_pred HHHHCCCCCCCEEEEEEHHHHHCCCCCCHHHHHHH T ss_conf 56516876664357864032201265788899999 No 250 >PRK08942 D,D-heptose 1,7-bisphosphate phosphatase; Validated Probab=35.25 E-value=27 Score=15.29 Aligned_cols=37 Identities=11% Similarity=0.234 Sum_probs=32.6 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHH Q ss_conf 8512323432013565116540776620466758899 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIV 214 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIl 214 (320) ..+.++|..+++.+..+++|-+.-|---|..|.+|+- T Consensus 32 ~g~~~al~~l~~~g~~~~ivTNQsGI~rG~~t~~~~~ 68 (181) T PRK08942 32 PGSIEAIARLKQAGYRVVVATNQSGIARGLFTEAQLN 68 (181) T ss_pred CCHHHHHHHHHHCCCEEEEEECCHHHCCCCCCHHHHH T ss_conf 7889999999987996999958713425867799999 No 251 >PRK13506 formate--tetrahydrofolate ligase; Provisional Probab=35.10 E-value=17 Score=16.54 Aligned_cols=19 Identities=5% Similarity=0.231 Sum_probs=14.9 Q ss_pred HHHHHHHCCCCEEEHHEEE Q ss_conf 9999873047757641312 Q gi|254780426|r 76 EIFNNILRFRKIRIDDIMI 94 (320) Q Consensus 76 ~ii~~vl~l~~~~v~diMt 94 (320) -..+++++++|+..|+|.+ T Consensus 175 I~w~Rv~DmNDR~LR~I~v 193 (577) T PRK13506 175 ILWKRVVDHNDRALRMITV 193 (577) T ss_pred EEEEEEEECCCHHCCEEEE T ss_conf 6777777653510030356 No 252 >pfam02283 CobU Cobinamide kinase / cobinamide phosphate guanyltransferase. This family is composed of a group of bifunctional cobalamin biosynthesis enzymes which display cobinamide kinase and cobinamide phosphate guanyltransferase activity. The crystal structure of the enzyme reveals the molecule to be a trimer with a propeller-like shape. Probab=34.39 E-value=32 Score=14.71 Aligned_cols=64 Identities=22% Similarity=0.283 Sum_probs=45.1 Q ss_pred CHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 5123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 179 LVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 179 ~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) .+.+++..+++...++.+|-+|-|- |+|-..-+-. .+.|.--.-++.+...+|..|+|-..+|+ T Consensus 101 ~~~~ll~~l~~~~~~~ViVsnEvG~--Givp~~~~~R----~f~d~lG~lNq~lA~~ad~V~lvvaGipl 164 (166) T pfam02283 101 EVDELLAALKARPAPVVLVSNEVGL--GIVPENALGR----RFRDLLGRLNQRLAAAADEVYLVVAGLPL 164 (166) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCC--CCCCCCHHHH----HHHHHHHHHHHHHHHHCCEEEEEECCCCE T ss_conf 9999999997489979999767678--8758987899----99999999999999878988999568444 No 253 >TIGR02194 GlrX_NrdH Glutaredoxin-like protein NrdH; InterPro: IPR011909 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterised by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=32.65 E-value=29 Score=15.04 Aligned_cols=37 Identities=16% Similarity=0.157 Sum_probs=27.3 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECC-CCCCCCCC Q ss_conf 999863010367789798659817998418-86642241 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKN-SLDNPRGM 136 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~-~~d~iiGi 136 (320) -..+|.+.. .|+++.+++-||.-.||... +.+.+.|+ T Consensus 26 Fe~inid~~-pea~d~vk~lGF~~~PVi~~~g~~~~SGF 63 (72) T TIGR02194 26 FEEINIDEQ-PEAVDYVKALGFRQVPVIVAEGDESWSGF 63 (72) T ss_pred EEEEECCCC-HHHHHHHHHCCCEECCEEEECCCEEEECC T ss_conf 157635888-43789998748630762676797002133 No 254 >pfam04278 Tic22 Tic22-like family. The preprotein translocation at the inner envelope membrane of chloroplasts so far involves five proteins: Tic110, Tic55, Tic40, Tic22 (this family) and Tic20. The molecular function of these proteins has not yet been established. Probab=31.03 E-value=37 Score=14.36 Aligned_cols=91 Identities=20% Similarity=0.282 Sum_probs=59.7 Q ss_pred CEEEEEEECCCHHHHHHHHHHCC-----CCEEEEECCCCCCCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCC Q ss_conf 12999863010367789798659-----8179984188664224102999998766223564322333457678851565 Q gi|254780426|r 97 VSINAVEDKATVYEAMLMFEKYG-----RSWMPVYKNSLDNPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNI 171 (320) Q Consensus 97 ~~i~~i~~~~~~~e~~~~i~~~~-----~sR~PV~~~~~d~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 171 (320) -...+++....++-|++.+.+.| |.-.||+-..... |++.++. .+. +-+ T Consensus 137 l~f~fvP~~~qv~~A~~ll~~~gq~~~~F~GVPlF~~~~g~--~~Lti~~----------------~n~--------~~i 190 (259) T pfam04278 137 VAFRFVPDPKQVKNALELLKKNGQNVDGFRGVPVFQSRSGE--GYLTIQK----------------DNK--------QYI 190 (259) T ss_pred EEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCC--CEEEEEC----------------CCC--------EEE T ss_conf 06996389899999999999709877789885568873886--2488952----------------991--------797 Q ss_pred EEECCCCCHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHH Q ss_conf 79717885123234320135651165407766204667588999987 Q gi|254780426|r 172 LFVPSSMLVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLM 218 (320) Q Consensus 172 ~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~iv 218 (320) ++.-+...+...|+++++.+-.++.=+. +=+++||+|++.+- T Consensus 191 P~FF~Kedlq~~l~~~k~qqp~l~~~~~-----I~V~~Le~vi~~m~ 232 (259) T pfam04278 191 PVFFSKEDLQRMLDRAKRQQPDLAVSVK-----IEVGSLEDVLKKML 232 (259) T ss_pred EEEEEHHHHHHHHHHHHHHCCCCCCCCE-----EEEEEHHHHHHHHH T ss_conf 5686379999999997775757678854-----99974999999986 No 255 >PRK00028 infC translation initiation factor IF-3; Reviewed Probab=29.52 E-value=39 Score=14.19 Aligned_cols=33 Identities=15% Similarity=0.195 Sum_probs=18.5 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCCCCCCEE Q ss_conf 103677897986598179984188664224102 Q gi|254780426|r 106 ATVYEAMLMFEKYGRSWMPVYKNSLDNPRGMVH 138 (320) Q Consensus 106 ~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~ 138 (320) .+..+|++...+.+..-+-|-.+..=-|.-++. T Consensus 33 ~~~~eAl~~A~~~~LDLV~vs~~~~PPVcKi~d 65 (175) T PRK00028 33 VSIREALRLAEEAGLDLVEISPNAKPPVCKIMD 65 (175) T ss_pred EEHHHHHHHHHHCCCCEEEECCCCCCCEEEEEH T ss_conf 649999999998199489987899998689850 No 256 >cd04409 RhoGAP_PARG1 RhoGAP_PARG1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of PARG1 (PTPL1-associated RhoGAP1). PARG1 was originally cloned as an interaction partner of PTPL1, an intracellular protein-tyrosine phosphatase. PARG1 interacts with Rap2, also a member of the Ras small GTPase superfamily whose exact function is unknown, and shows strong preference for Rho. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=29.37 E-value=39 Score=14.18 Aligned_cols=15 Identities=20% Similarity=0.337 Sum_probs=7.3 Q ss_pred CCCHHHHHHHHCCCC Q ss_conf 535999999828888 Q gi|254780426|r 245 RTDLEELAKIIGTDC 259 (320) Q Consensus 245 ~~~l~~l~~~l~~~l 259 (320) .|....+.-.||=.+ T Consensus 166 kM~~~NLAivfgP~L 180 (211) T cd04409 166 KMSASNLGIIFGPTL 180 (211) T ss_pred CCCHHHHHHHHHHHC T ss_conf 798899999972140 No 257 >pfam04391 DUF533 Protein of unknown function (DUF533). Some family members may be secreted or integral membrane proteins. Probab=29.34 E-value=39 Score=14.17 Aligned_cols=30 Identities=20% Similarity=0.115 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHCC Q ss_conf 999999997997689898999999987304 Q gi|254780426|r 55 KGIRLPRIDASLEGIFSETEKEIFNNILRF 84 (320) Q Consensus 55 eel~~l~~~~~~~G~i~~~E~~ii~~vl~l 84 (320) -=|+.++..+...|.|+.+|+..|...+.= T Consensus 80 ~llrAMIaAAkADG~ID~~Er~~I~~~l~~ 109 (187) T pfam04391 80 LLLRAMIAAAKADGHIDEEERARIEQELGE 109 (187) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 999999999985599599999999999987 No 258 >cd00544 CobU Adenosylcobinamide kinase / adenosylcobinamide phosphate guanyltransferase (CobU). CobU is bifunctional cobalbumin biosynthesis enzymes which display adenosylcobinamide kinase and adenosylcobinamide phosphate guanyltransferase activity. This enzyme is a homotrimer with a propeller-like shape. Probab=28.26 E-value=41 Score=14.05 Aligned_cols=64 Identities=20% Similarity=0.285 Sum_probs=44.8 Q ss_pred CHHHHHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCH Q ss_conf 5123234320135651165407766204667588999987422210000000012467733783045359 Q gi|254780426|r 179 LVSDLLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDL 248 (320) Q Consensus 179 ~l~~lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l 248 (320) .+..++..+++...++.+|-+|-|- |+|-..-+-.. +.|.--.-++.+.+..|..|++-..+|+ T Consensus 104 ~i~~l~~~l~~~~~~~ViVSnEVG~--givp~~~~~R~----f~d~lG~lNQ~iA~~ad~V~lvvaGipl 167 (169) T cd00544 104 EIDALLAAVRNKPGTLILVSNEVGL--GVVPENALGRR----FRDELGRLNQRLAALADEVYLVVSGIPL 167 (169) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCC--CCCCCCHHHHH----HHHHHHHHHHHHHHHCCEEEEEECCCCE T ss_conf 9999999997089979999658778--98899889999----9999999999999976988999568455 No 259 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=26.25 E-value=12 Score=17.52 Aligned_cols=59 Identities=12% Similarity=0.119 Sum_probs=28.3 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 8889999999999799768989899999998730477576413120012999863010367789798659817998418 Q gi|254780426|r 50 KLSHHKGIRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAMLMFEKYGRSWMPVYKN 128 (320) Q Consensus 50 ~~~~~eel~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~ 128 (320) ....+++|...-..|.+ |+++|..+-. ..++..+..+-+++=......+.=.|.=|.-. T Consensus 164 aPkP~~~IE~~f~is~R---I~AEE~tL~~-----------------AslvitST~QEi~~QY~~Y~~y~P~r~~VIPP 222 (445) T TIGR02472 164 APKPPQEIEKQFNISRR---IEAEEETLAH-----------------ASLVITSTHQEIEEQYALYDSYDPERMEVIPP 222 (445) T ss_pred CCCCHHHHHHHHCCCCC---HHHHHHHHHH-----------------CCEEEEECCCEEHHHHCCCCCCCCCCCEECCC T ss_conf 78877899986122641---4788999851-----------------47458614510321210147867021135178 No 260 >pfam01565 FAD_binding_4 FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD binding site, called the PP-loop, between residues 99-110. The FAD molecule is covalently bound in the known structure, however the residue that links to the FAD is not in the alignment. VAO catalyses the oxidation of a wide variety of substrates, ranging form aromatic amines to 4-alkylphenols. Other members of this family include D-lactate dehydrogenase, this enzyme catalyses the conversion of D-lactate to pyruvate using FAD as a co-factor; mitomycin radical oxidase, this enzyme oxidizes the reduced form of mitomycins and is involved in mitomycin resistance. This family includes MurB an UDP-N-acetylenolpyruvoylglucosamine reductase enzyme EC:1.1.1.158. This enzyme is involved in the biosynthesis of peptidoglycan. Probab=25.58 E-value=44 Score=13.82 Aligned_cols=69 Identities=20% Similarity=0.174 Sum_probs=40.8 Q ss_pred CCEEEEECCCCHHHHHHHH---CCCCCCCCCCCCCCCHHHHHHHHHCC-C-CCCCCEEEEECCEEEEEEEECCCEE Q ss_conf 7337830453599999982---88888777746743179999998423-4-7379889975895999999518878 Q gi|254780426|r 237 DNTFIVDARTDLEELAKII---GTDCNCLKGEQDVDSLGGLIFSVLDR-I-PARGEVVLEIPGFEIVILDADVRCV 307 (320) Q Consensus 237 ~g~~~v~G~~~l~~l~~~l---~~~l~~~~~~~~~~Ti~G~i~~~l~~-i-P~~Ge~v~~~~g~~f~V~~~~~~rI 307 (320) +....+.+.+++.++.+.+ |+-++..+.....-|+||++..--.- . ...|-.. + .=..++|+..||.-+ T Consensus 62 ~~~v~v~aG~~~~~l~~~l~~~g~~~~~~~~~~~~~tvGG~i~~~~~G~~~~~~G~~~-d-~v~~~~vV~~~G~iv 135 (138) T pfam01565 62 NGTATVEAGVTLGDLVRALAAKGLLLGLEPGSGIPGTVGGAIATNGGGYGSEKYGLTR-D-NVLSLEVVLADGEVV 135 (138) T ss_pred CCEEEEECCCCHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCHH-H-EEEEEEEECCCCCEE T ss_conf 8489995780999999999972998863167667620312124896556745646488-7-489999996897199 No 261 >pfam02743 Cache_1 Cache domain. Probab=25.58 E-value=40 Score=14.13 Aligned_cols=25 Identities=16% Similarity=0.396 Sum_probs=11.7 Q ss_pred HHHHHHHHHCCCCCCEEEEECCCCC Q ss_conf 1232343201356511654077662 Q gi|254780426|r 180 VSDLLTNIQESRIRMALVIDEHGGT 204 (320) Q Consensus 180 l~~lL~~m~~~~~~~aiVvDE~G~~ 204 (320) +.+.+..++-.+...++++|+.|.+ T Consensus 41 l~~~v~~~~~g~~Gy~flvd~~G~i 65 (81) T pfam02743 41 LLKITKSIKLGKTGYAFIVDNNGKV 65 (81) T ss_pred HHHHHHHCCCCCCEEEEEECCCCCE T ss_conf 9988874502896099999399999 No 262 >PRK08053 sulfur carrier protein ThiS; Provisional Probab=24.77 E-value=47 Score=13.65 Aligned_cols=31 Identities=19% Similarity=0.292 Sum_probs=25.1 Q ss_pred CCEEECCCCCHHHHHHHHHCCCCCCEEEEEC Q ss_conf 6579717885123234320135651165407 Q gi|254780426|r 170 NILFVPSSMLVSDLLTNIQESRIRMALVIDE 200 (320) Q Consensus 170 ~~~~Vpe~~~l~~lL~~m~~~~~~~aiVvDE 200 (320) .+.-+++..++.++|..+......+|+.++. T Consensus 8 e~~e~~~~~tl~~Ll~~l~~~~~giAvAiN~ 38 (66) T PRK08053 8 QPMQCAAGQTVHELLEQLNQLQPGAALAINQ 38 (66) T ss_pred EEEECCCCCCHHHHHHHHCCCCCCEEEEECC T ss_conf 8878689985999998727888858999889 No 263 >pfam09269 DUF1967 Domain of unknown function (DUF1967). Members of this family contain a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. They are predominantly found in the bacterial GTP-binding protein Obg, and are still functionally uncharacterized. Probab=24.50 E-value=48 Score=13.61 Aligned_cols=56 Identities=29% Similarity=0.392 Sum_probs=33.5 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCC----------CCCCCCEEEEECCEEEE Q ss_conf 12467733783045359999998288888777746743179999998423----------47379889975895999 Q gi|254780426|r 232 ISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDR----------IPARGEVVLEIPGFEIV 298 (320) Q Consensus 232 i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~----------iP~~Ge~v~~~~g~~f~ 298 (320) +.+.++|.|.|.|.-. +.+-...+++- .+.+. ++...|.+ =-+.||+| .+++++|+ T Consensus 2 I~~~~~g~f~V~G~~i-er~~~~t~~~~--------~ea~~-~f~~~L~~~Gv~~~L~~~G~k~GD~V-~Ig~~eFe 67 (68) T pfam09269 2 IEREGDGVFVVEGPKI-ERLVKMTNFDN--------DESLR-RFARILKKLGVEDALRKAGAKDGDTV-RIGDFEFE 67 (68) T ss_pred EEEECCCEEEEECHHH-HHHHHHCCCCC--------HHHHH-HHHHHHHHCCHHHHHHHCCCCCCCEE-EEEEEEEE T ss_conf 7990795599977389-99998778998--------99999-99999998897899997699997999-99127996 No 264 >cd00517 ATPS ATP-sulfurylase (ATPS), also known as sulfate adenylate transferase, catalyzes the transfer of an adenylyl group from ATP to sulfate, forming adenosine 5'-phosphosulfate (APS). This reaction is generally accompanied by a further reaction, catalyzed by APS kinase, in which APS is phosphorylated to yield 3'-phospho-APS (PAPS). In some organisms the APS kinase is a separate protein, while in others it is incorporated with ATP sulfurylase in a bifunctional enzyme that catalyzes both reactions. In bifunctional proteins, the domain that performs the kinase activity can be attached at the N-terminal end of the sulfurylase unit or at the C-terminal end, depending on the organism. While the reaction is ubiquitous among organisms, the physiological role of the reaction varies. In some organisms it is used to generate APS from sulfate and ATP, while in others it proceeds in the opposite direction to generate ATP from APS and pyrophosphate. ATP sulfurylase can be a monomer, a hom Probab=24.45 E-value=48 Score=13.61 Aligned_cols=10 Identities=30% Similarity=0.328 Sum_probs=3.2 Q ss_pred HHHHHHHHCC Q ss_conf 7789798659 Q gi|254780426|r 110 EAMLMFEKYG 119 (320) Q Consensus 110 e~~~~i~~~~ 119 (320) +..+.+.+.| T Consensus 146 e~r~~~~~~g 155 (356) T cd00517 146 ELREEFKEKG 155 (356) T ss_pred HHHHHHHHCC T ss_conf 9999998568 No 265 >PRK11537 putative GTP-binding protein YjiA; Provisional Probab=24.43 E-value=31 Score=14.83 Aligned_cols=12 Identities=17% Similarity=0.163 Sum_probs=5.0 Q ss_pred CCCCCCEEHHHH Q ss_conf 642241029999 Q gi|254780426|r 131 DNPRGMVHMRDV 142 (320) Q Consensus 131 d~iiGiv~~kDl 142 (320) |.++.+|.+... T Consensus 124 d~vVtvVDa~~~ 135 (317) T PRK11537 124 DGVIALVDAVHA 135 (317) T ss_pred CCEEEEEEHHHH T ss_conf 655999866555 No 266 >cd07311 terB_like_1 tellurium resistance terB-like protein, subgroup 1. This family includes several uncharacterized bacterial proteins. The prototype of this CD is tellurite resistance protein from Nostoc punctiforme that belongs to COG3793. Its precise biological function and its mechanism responsible for tellurium resistance still remains rather poorly understood. Probab=24.37 E-value=48 Score=13.60 Aligned_cols=68 Identities=13% Similarity=0.150 Sum_probs=47.6 Q ss_pred HHHHCCCCCCCCHHH----HHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEHHEEECCCEEEEEEECCCHHHHHH Q ss_conf 876337888889999----999999799768989899999998730477576413120012999863010367789 Q gi|254780426|r 42 LCKTWKLMKLSHHKG----IRLPRIDASLEGIFSETEKEIFNNILRFRKIRIDDIMISRVSINAVEDKATVYEAML 113 (320) Q Consensus 42 ~~~~~~~~~~~~~ee----l~~l~~~~~~~G~i~~~E~~ii~~vl~l~~~~v~diMtPr~~i~~i~~~~~~~e~~~ 113 (320) .-..||+....+.++ ++.++.-+...|+++++|++.+.+...--..+..-+|.-... +.+.++.+... T Consensus 8 f~~~~gf~~~p~~e~~~~Y~kall~~A~~Dg~~~~~E~~~~~g~~~a~g~~~d~veel~~~----~~De~l~~~~~ 79 (150) T cd07311 8 FQQTWGFDQIPTNQDKLAYLKALLVCAKGDGVISPEERDWAIGYAAARGGDADMVEELKEY----TADEDLEEVDF 79 (150) T ss_pred HHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHC----CCCHHHHHHHH T ss_conf 8886076668854789999999998742566668899999999999949999999998734----54111999984 No 267 >TIGR02886 spore_II_AA anti-sigma F factor antagonist; InterPro: IPR014237 This represents the anti-sigma F factor antagonist, also known as stage II sporulation protein AA. This protein is universally conserved in the endospore-forming bacteria, all of which belong to the Firmcutes.; GO: 0045152 antisigma factor binding, 0006355 regulation of transcription DNA-dependent, 0030435 sporulation. Probab=24.00 E-value=49 Score=13.55 Aligned_cols=39 Identities=8% Similarity=0.225 Sum_probs=30.6 Q ss_pred CCCCCHHHHHHHHHHHC--CCCEEEHHEEECCCEEEEEEEC Q ss_conf 68989899999998730--4775764131200129998630 Q gi|254780426|r 67 EGIFSETEKEIFNNILR--FRKIRIDDIMISRVSINAVEDK 105 (320) Q Consensus 67 ~G~i~~~E~~ii~~vl~--l~~~~v~diMtPr~~i~~i~~~ 105 (320) .|.||+++++.+.+..+ +....++++-.=..++.++|++ T Consensus 15 ~GELDHH~aE~~R~~i~~~i~~~~~~~~ilnL~~vtFMDSS 55 (106) T TIGR02886 15 SGELDHHTAEEVRRKIDDAIERRPIKHLILNLKNVTFMDSS 55 (106) T ss_pred EECCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCEEEEC T ss_conf 61004467899999999898507983689862776478651 No 268 >PRK10329 glutaredoxin-like protein; Provisional Probab=22.27 E-value=53 Score=13.33 Aligned_cols=37 Identities=16% Similarity=0.067 Sum_probs=25.0 Q ss_pred EEEEEECCCHHHHHHHHHHCCCCEEEEECCCCCCCCCC Q ss_conf 99986301036778979865981799841886642241 Q gi|254780426|r 99 INAVEDKATVYEAMLMFEKYGRSWMPVYKNSLDNPRGM 136 (320) Q Consensus 99 i~~i~~~~~~~e~~~~i~~~~~sR~PV~~~~~d~iiGi 136 (320) ...+|.+.. .++++.+...||...||...+.+...|+ T Consensus 28 y~~vdi~~d-pea~~~vk~~G~~q~PVV~~~~~~wsGF 64 (81) T PRK10329 28 FEMINVDRV-PEAADTLRAQGFRQLPVVIAGDLSWSGF 64 (81) T ss_pred EEEEECCCC-HHHHHHHHHCCCCCCCEEEECCCEECCC T ss_conf 299858999-9999999976985599899699544360 No 269 >PRK06193 hypothetical protein; Provisional Probab=21.80 E-value=54 Score=13.27 Aligned_cols=55 Identities=20% Similarity=0.080 Sum_probs=40.7 Q ss_pred EECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCCCCCCCEEEEECCCCHHHHHH Q ss_conf 40776620466758899998742221000000001246773378304535999999 Q gi|254780426|r 198 IDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMISAVSDNTFIVDARTDLEELAK 253 (320) Q Consensus 198 vDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~~~~~g~~~v~G~~~l~~l~~ 253 (320) ...-|.-.=+||..+.++++.|-..+. ..+.--+.+.+.|.|.+=|+++.+++.+ T Consensus 156 ~p~~G~N~VlVtHqvni~alTGi~P~~-eG~~vV~~P~g~g~f~vlg~i~P~~W~~ 210 (211) T PRK06193 156 PPAPGTNTVLVGHDDNFEAATGIYPVP-EGIAYVFEPDGGQPFKPLARIPPEEWAK 210 (211) T ss_pred CCCCCCEEEEEECCCCHHHHHCCCCCC-CCEEEEEEECCCCCCEEEEECCHHHHHH T ss_conf 898997489996875187772877788-8729999547999924887548888532 No 270 >COG2979 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.55 E-value=54 Score=13.24 Aligned_cols=30 Identities=20% Similarity=0.139 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHCCC Q ss_conf 999999979976898989999999873047 Q gi|254780426|r 56 GIRLPRIDASLEGIFSETEKEIFNNILRFR 85 (320) Q Consensus 56 el~~l~~~~~~~G~i~~~E~~ii~~vl~l~ 85 (320) =|++|+..+...|.|++.|+.+|..-++.+ T Consensus 113 ll~AmIaAAkaDGhIDe~ERa~I~~~l~~s 142 (225) T COG2979 113 LLRAMIAAAKADGHIDEKERARIMQKLQES 142 (225) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 999999998624776799999999999871 No 271 >cd03112 CobW_like The function of this protein family is unkown. The amino acid sequence of YjiA protein in E. coli contains several conserved motifs that characterizes it as a P-loop GTPase. YijA gene is among the genes significantly induced in response to DNA-damage caused by mitomycin. YijA gene is a homologue of the CobW gene which encodes the cobalamin synthesis protein/P47K. Probab=21.33 E-value=40 Score=14.14 Aligned_cols=25 Identities=24% Similarity=0.482 Sum_probs=13.5 Q ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCC Q ss_conf 851232343201356511654077662 Q gi|254780426|r 178 MLVSDLLTNIQESRIRMALVIDEHGGT 204 (320) Q Consensus 178 ~~l~~lL~~m~~~~~~~aiVvDE~G~~ 204 (320) +-+..+|+. ....+.|++++|+|.+ T Consensus 15 Tll~~lL~~--~~~~~~avIvNEfG~~ 39 (158) T cd03112 15 TLLNHILTE--QHGRKIAVIENEFGEV 39 (158) T ss_pred HHHHHHHHC--CCCCCEEEEEECCCCC T ss_conf 999999847--8899779997076554 No 272 >cd02930 DCR_FMN 2,4-dienoyl-CoA reductase (DCR) FMN-binding domain. DCR in E. coli is an iron-sulfur flavoenzyme which contains FMN, FAD, and a 4Fe-4S cluster. It is also a monomer, unlike that of its eukaryotic counterparts which form homotetramers and lack the flavin and iron-sulfur cofactors. Metabolism of unsaturated fatty acids requires auxiliary enzymes in addition to those used in b-oxidation. After a given number of cycles through the b-oxidation pathway, those unsaturated fatty acyl-CoAs with double bonds at even-numbered carbon positions contain 2-trans, 4-cis double bonds that can not be modified by enoyl-CoA hydratase. DCR utilizes NADPH to remove the C4-C5 double bond. DCR can catalyze the reduction of both natural fatty acids with cis double bonds, as well as substrates containing trans double bonds. The reaction is initiated by hybrid transfer from NADPH to FAD, which in turn transfers electrons, one at a time, to FMN via the 4Fe-4S cluster. The fully reduced FMN provi Probab=21.25 E-value=55 Score=13.20 Aligned_cols=17 Identities=29% Similarity=0.122 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999979976898 Q gi|254780426|r 54 HKGIRLPRIDASLEGIF 70 (320) Q Consensus 54 ~eel~~l~~~~~~~G~i 70 (320) -+.++.+++.-+++|.. T Consensus 76 i~~~k~l~~~vH~~G~~ 92 (353) T cd02930 76 AAGHRLITDAVHAEGGK 92 (353) T ss_pred HHHHHHHHHHHHHCCCE T ss_conf 99999999999976997 No 273 >TIGR00421 ubiX_pad polyprenyl P-hydroxybenzoate and phenylacrylic acid decarboxylases; InterPro: IPR004507 This family contains flavoproteins, which are aromatic acid decarboxylases. An example is the Saccharomyces cerevisiae gene, PAD1 that encodes phenylacrylic acid decarboxylase. Mutations of this gene are viable and confer resistance to cinnamic acid. ; GO: 0016831 carboxy-lyase activity. Probab=21.13 E-value=56 Score=13.19 Aligned_cols=62 Identities=13% Similarity=0.235 Sum_probs=31.5 Q ss_pred CCCEEECCCCCHHH-----HHHHHHCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCHHHCCC Q ss_conf 56579717885123-----234320135651165407766204667588999987422210000000012 Q gi|254780426|r 169 KNILFVPSSMLVSD-----LLTNIQESRIRMALVIDEHGGTDGLVSYEDIVSVLMRDITSEHHSKKSMIS 233 (320) Q Consensus 169 r~~~~Vpe~~~l~~-----lL~~m~~~~~~~aiVvDE~G~~~GiVTleDIle~ivGei~de~d~~~~~i~ 233 (320) |+.+.+|--+|+.. +|+.-+..-+=|=.|--=|-... +++|++|.+||-+-|-+-.+.+.++ T Consensus 113 RkLvL~~REtPL~SiHLENmL~L~~~G~IIlPP~PaFY~rPk---S~~Dl~~~~VgR~LD~lGI~~d~f~ 179 (181) T TIGR00421 113 RKLVLVPRETPLNSIHLENMLRLSRMGAIILPPVPAFYNRPK---SVEDLIDFIVGRVLDQLGIENDLFR 179 (181) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCC---CHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 424640367887515489999998279253279554447898---8788986779877776244378888 No 274 >PRK10975 TDP-fucosamine acetyltransferase; Provisional Probab=20.93 E-value=44 Score=13.86 Aligned_cols=45 Identities=13% Similarity=0.191 Sum_probs=25.1 Q ss_pred CCCEEECCCCCHHH-HH-H-HHHCCCCCC-EEEEECCCCCEEEEEHHHH Q ss_conf 56579717885123-23-4-320135651-1654077662046675889 Q gi|254780426|r 169 KNILFVPSSMLVSD-LL-T-NIQESRIRM-ALVIDEHGGTDGLVSYEDI 213 (320) Q Consensus 169 r~~~~Vpe~~~l~~-lL-~-~m~~~~~~~-aiVvDE~G~~~GiVTleDI 213 (320) |.|.|-++...-.- .. + .-+..--|. .++.|+.|.++|.||+.++ T Consensus 110 r~P~~~~~~~~rfY~~Wie~aV~G~fD~~cLv~~~~~g~i~GFvtlr~l 158 (231) T PRK10975 110 RAPWYAPDASGRFYAQWIENAVRGTFDHQCLILRDATGQIRGFVSLREL 158 (231) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCEEEEEEEEEC T ss_conf 7987797898789999999984666585689998689977899999852 No 275 >KOG2916 consensus Probab=20.84 E-value=31 Score=14.82 Aligned_cols=102 Identities=17% Similarity=0.183 Sum_probs=53.9 Q ss_pred EEEECCCHHHHHHHHHHCCCCEEEE-ECCCCC----CCCCCEEHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCEEEC Q ss_conf 9863010367789798659817998-418866----42241029999987662235643223334576788515657971 Q gi|254780426|r 101 AVEDKATVYEAMLMFEKYGRSWMPV-YKNSLD----NPRGMVHMRDVISYISHMYAKTNNINLNIQLSESNLIKNILFVP 175 (320) Q Consensus 101 ~i~~~~~~~e~~~~i~~~~~sR~PV-~~~~~d----~iiGiv~~kDll~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~Vp 175 (320) .++.+..+.+.+..-.+.+++-=|| .+.+.+ .--||.++|..+..-.+.....-++ .+ .++.||+||- T Consensus 157 ~l~~~~~vk~~Llk~Irrrltpq~vKIRADIEv~Cf~yeGI~aIK~alk~a~~~~te~~pi------ki-~LIApPlYVl 229 (304) T KOG2916 157 ELDLSEEVKEVLLKNIRRRLTPQTVKIRADIEVSCFGYEGIDAIKAALKAALNLSTEECPI------KI-KLIAPPLYVL 229 (304) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCE------EE-EEECCCEEEE T ss_conf 6677778999999999875488655787325898606567999999999887279665745------89-9855864899 Q ss_pred CCCCHHH--HHHHHHCCCCCCEEEEECCCCCEEEEE Q ss_conf 7885123--234320135651165407766204667 Q gi|254780426|r 176 SSMLVSD--LLTNIQESRIRMALVIDEHGGTDGLVS 209 (320) Q Consensus 176 e~~~l~~--lL~~m~~~~~~~aiVvDE~G~~~GiVT 209 (320) -+.++.+ .+..+...=.++-..+-+|||+--+.. T Consensus 230 TT~tldK~~g~e~l~~Ai~~i~~~I~~~~G~~~V~~ 265 (304) T KOG2916 230 TTQTLDKTKGLEVLEEAIEVIISKIEEYGGTFTVIM 265 (304) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEE T ss_conf 886302011199999999999998875097799985 No 276 >TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence. Probab=20.77 E-value=56 Score=13.14 Aligned_cols=56 Identities=21% Similarity=0.281 Sum_probs=33.0 Q ss_pred CCCCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCC----------CCCCCCEEEEECCEEEE Q ss_conf 12467733783045359999998288888777746743179999998423----------47379889975895999 Q gi|254780426|r 232 ISAVSDNTFIVDARTDLEELAKIIGTDCNCLKGEQDVDSLGGLIFSVLDR----------IPARGEVVLEIPGFEIV 298 (320) Q Consensus 232 i~~~~~g~~~v~G~~~l~~l~~~l~~~l~~~~~~~~~~Ti~G~i~~~l~~----------iP~~Ge~v~~~~g~~f~ 298 (320) |.+.++|.|.|.|.- ++.+-...+++- .+.+. ++...|.+ =-+.||+| .+++++|+ T Consensus 3 I~r~~dg~~~V~G~~-ier~~~~t~~~~--------~ea~~-rf~~~Lk~~Gv~~~L~~~G~k~GD~V-~I~~~eFe 68 (69) T TIGR03595 3 IIRDEDGVFVVSGKK-IERWVAKTPFNN--------DENLR-RFARKLKKLGVEDALRKAGAKDGDTV-RIGDFEFE 68 (69) T ss_pred EEECCCCEEEEECCH-HHHHHHHCCCCC--------HHHHH-HHHHHHHHCCHHHHHHHCCCCCCCEE-EECCEEEE T ss_conf 999789769997728-999998669999--------99999-99999998897899998699999999-99348996 No 277 >PRK05446 imidazole glycerol-phosphate dehydratase/histidinol phosphatase; Provisional Probab=20.46 E-value=57 Score=13.09 Aligned_cols=35 Identities=20% Similarity=0.102 Sum_probs=15.7 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCEEHHHHH Q ss_conf 367789798659817998418866422410299999 Q gi|254780426|r 108 VYEAMLMFEKYGRSWMPVYKNSLDNPRGMVHMRDVI 143 (320) Q Consensus 108 ~~e~~~~i~~~~~sR~PV~~~~~d~iiGiv~~kDll 143 (320) +.++++.+.+.||.-+-|..-+. ==.|+...+|+- T Consensus 35 v~~aLk~L~~~GY~liVVTNQsG-IgrG~~t~ed~~ 69 (354) T PRK05446 35 VIPALLKLQKAGYKLVMVTNQDG-LGTDSFPQEDFD 69 (354) T ss_pred HHHHHHHHHHCCCEEEEEECCCC-CCCCCCCHHHHH T ss_conf 89999999987998999948851-147988999999 No 278 >pfam02492 cobW CobW/HypB/UreG, nucleotide-binding domain. This domain is found in HypB, a hydrogenase expression / formation protein, and UreG a urease accessory protein. Both these proteins contain a P-loop nucleotide binding motif. HypB has GTPase activity and is a guanine nucleotide binding protein. It is not known whether UreG binds GTP or some other nucleotide. Both enzymes are involved in nickel binding. HypB can store nickel and is required for nickel dependent hydrogenase expression. UreG is required for functional incorporation of the urease nickel metallocenter. GTP hydrolysis may required by these proteins for nickel incorporation into other nickel proteins. This family of domains also contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans. Probab=20.37 E-value=43 Score=13.90 Aligned_cols=15 Identities=13% Similarity=0.492 Sum_probs=5.7 Q ss_pred CCCHHHHHHHHHHHC Q ss_conf 989899999998730 Q gi|254780426|r 69 IFSETEKEIFNNILR 83 (320) Q Consensus 69 ~i~~~E~~ii~~vl~ 83 (320) -+.+--..+++++++ T Consensus 8 FLGsGKTTll~~ll~ 22 (174) T pfam02492 8 FLGSGKTTLLEHLLR 22 (174) T ss_pred CCCCCHHHHHHHHHH T ss_conf 887889999999998 Done!