Query gi|255764470|ref|YP_003064828.2| integral membrane protein TerC [Candidatus Liberibacter asiaticus str. psy62] Match_columns 523 No_of_seqs 354 out of 3532 Neff 6.1 Searched_HMMs 39220 Date Sun May 29 16:25:30 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764470.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG1253 TlyC Hemolysins and re 100.0 0 0 541.1 42.5 407 90-521 12-425 (429) 2 PRK11573 hypothetical protein; 100.0 0 0 507.9 42.7 303 209-522 93-408 (413) 3 TIGR03520 GldE gliding motilit 100.0 0 0 506.9 40.0 299 208-517 100-407 (408) 4 COG0861 TerC Membrane protein 100.0 0 0 467.6 28.7 241 1-243 6-250 (254) 5 COG4536 CorB Putative Mg2+ and 100.0 0 0 449.7 33.9 334 148-521 76-419 (423) 6 COG4535 CorC Putative Mg2+ and 100.0 0 0 407.2 18.8 273 240-521 10-285 (293) 7 pfam03741 TerC Integral membra 100.0 8.5E-41 2.8E-45 317.2 21.4 178 16-208 1-184 (184) 8 cd04590 CBS_pair_CorC_HlyC_ass 99.9 7.9E-24 2E-28 193.2 11.3 110 314-423 1-111 (111) 9 pfam04332 DUF475 Protein of un 99.7 2.4E-15 6E-20 131.2 12.2 205 27-240 1-293 (294) 10 KOG2118 consensus 99.6 5E-16 1.3E-20 136.2 7.4 149 285-435 185-338 (498) 11 COG2899 Uncharacterized protei 99.6 2.2E-15 5.7E-20 131.4 10.6 219 15-241 39-341 (346) 12 pfam03471 CorC_HlyC Transporte 99.6 4.4E-15 1.1E-19 129.3 11.2 77 440-518 2-80 (80) 13 cd04596 CBS_pair_DRTGG_assoc T 99.6 4.1E-14 1E-18 122.2 12.0 106 314-423 2-108 (108) 14 cd04639 CBS_pair_26 The CBS do 99.5 6.1E-13 1.6E-17 113.6 12.5 108 314-422 1-110 (111) 15 cd04631 CBS_pair_18 The CBS do 99.4 5E-12 1.3E-16 106.9 12.5 110 314-423 1-125 (125) 16 cd04610 CBS_pair_ParBc_assoc T 99.4 4.6E-12 1.2E-16 107.2 11.3 105 314-423 2-107 (107) 17 cd04612 CBS_pair_SpoIVFB_EriC_ 99.4 4.1E-12 1E-16 107.5 10.4 108 314-423 1-111 (111) 18 COG4109 Predicted transcriptio 99.4 1.1E-11 2.8E-16 104.4 11.5 129 293-425 172-306 (432) 19 cd04611 CBS_pair_PAS_GGDEF_DUF 99.3 3.2E-11 8.2E-16 101.0 12.6 107 314-422 1-110 (111) 20 cd04600 CBS_pair_HPP_assoc Thi 99.3 5.1E-11 1.3E-15 99.6 12.5 109 314-423 2-124 (124) 21 cd04613 CBS_pair_SpoIVFB_EriC_ 99.3 4.1E-11 1E-15 100.2 11.8 109 314-423 1-114 (114) 22 cd04588 CBS_pair_CAP-ED_DUF294 99.3 4.2E-11 1.1E-15 100.2 11.8 108 314-423 1-110 (110) 23 COG3448 CBS-domain-containing 99.3 1.1E-09 2.8E-14 89.8 18.4 270 134-427 82-374 (382) 24 cd04585 CBS_pair_ACT_assoc2 Th 99.3 7.7E-11 2E-15 98.2 12.1 108 314-423 1-122 (122) 25 cd04642 CBS_pair_29 The CBS do 99.3 1.2E-10 3.1E-15 96.8 13.0 108 314-422 1-125 (126) 26 cd04594 CBS_pair_EriC_assoc_ar 99.3 6E-11 1.5E-15 99.0 11.3 101 317-423 4-104 (104) 27 cd04635 CBS_pair_22 The CBS do 99.3 9.7E-11 2.5E-15 97.5 12.3 109 314-423 1-122 (122) 28 cd04643 CBS_pair_30 The CBS do 99.3 1.1E-10 2.8E-15 97.1 12.2 107 314-423 1-116 (116) 29 cd04632 CBS_pair_19 The CBS do 99.3 1.2E-10 3.1E-15 96.8 12.4 109 314-423 1-128 (128) 30 cd04624 CBS_pair_11 The CBS do 99.3 6.4E-11 1.6E-15 98.8 10.4 109 314-423 1-112 (112) 31 cd04605 CBS_pair_MET2_assoc Th 99.3 5.7E-11 1.4E-15 99.2 10.0 107 314-422 2-109 (110) 32 cd04601 CBS_pair_IMPDH This cd 99.3 9E-11 2.3E-15 97.7 10.8 106 314-422 2-109 (110) 33 TIGR01302 IMP_dehydrog inosine 99.3 3E-10 7.7E-15 93.9 13.4 176 271-460 45-244 (476) 34 cd04622 CBS_pair_9 The CBS dom 99.2 1.8E-10 4.7E-15 95.5 11.7 108 314-423 1-113 (113) 35 cd04803 CBS_pair_15 The CBS do 99.2 2.6E-10 6.6E-15 94.4 12.1 106 316-422 3-121 (122) 36 cd04604 CBS_pair_KpsF_GutQ_ass 99.2 3.7E-10 9.4E-15 93.3 12.4 109 314-423 2-114 (114) 37 cd04623 CBS_pair_10 The CBS do 99.2 3.9E-10 1E-14 93.1 12.0 108 314-423 1-113 (113) 38 cd04638 CBS_pair_25 The CBS do 99.2 3.6E-10 9.1E-15 93.4 11.8 105 314-423 1-106 (106) 39 cd04633 CBS_pair_20 The CBS do 99.2 4.9E-10 1.3E-14 92.3 12.0 106 314-422 1-120 (121) 40 cd04584 CBS_pair_ACT_assoc Thi 99.2 6E-10 1.5E-14 91.7 12.3 107 314-422 1-120 (121) 41 cd04586 CBS_pair_BON_assoc Thi 99.2 4.7E-10 1.2E-14 92.5 11.8 108 314-423 2-135 (135) 42 cd04609 CBS_pair_PALP_assoc2 T 99.2 2.1E-10 5.2E-15 95.1 9.9 104 315-422 2-109 (110) 43 cd04637 CBS_pair_24 The CBS do 99.2 8.6E-10 2.2E-14 90.6 12.7 107 314-422 1-121 (122) 44 cd04621 CBS_pair_8 The CBS dom 99.2 5.9E-10 1.5E-14 91.8 11.7 108 314-423 1-135 (135) 45 cd04583 CBS_pair_ABC_OpuCA_ass 99.2 4.4E-10 1.1E-14 92.7 10.4 107 314-423 2-109 (109) 46 cd04625 CBS_pair_12 The CBS do 99.2 1.1E-09 2.7E-14 89.9 12.3 107 314-423 1-112 (112) 47 cd04630 CBS_pair_17 The CBS do 99.2 1E-09 2.7E-14 90.0 12.1 109 314-423 1-114 (114) 48 cd04615 CBS_pair_2 The CBS dom 99.2 1.9E-09 4.8E-14 88.1 13.3 105 317-422 4-112 (113) 49 cd04629 CBS_pair_16 The CBS do 99.1 1.1E-09 2.9E-14 89.7 11.9 107 315-423 2-114 (114) 50 cd04626 CBS_pair_13 The CBS do 99.1 1.1E-09 2.9E-14 89.7 11.7 107 314-422 1-110 (111) 51 cd04634 CBS_pair_21 The CBS do 99.1 1.2E-09 3E-14 89.5 11.8 106 314-422 1-142 (143) 52 cd04801 CBS_pair_M50_like This 99.1 1.9E-09 4.8E-14 88.1 12.8 108 314-422 1-113 (114) 53 cd04636 CBS_pair_23 The CBS do 99.1 1.5E-09 3.9E-14 88.8 12.1 108 314-423 1-132 (132) 54 cd04641 CBS_pair_28 The CBS do 99.1 1.8E-09 4.6E-14 88.2 12.0 108 314-422 1-119 (120) 55 PRK01862 putative voltage-gate 99.1 1.5E-09 3.9E-14 88.8 11.5 120 301-423 453-578 (583) 56 cd04599 CBS_pair_GGDEF_assoc2 99.1 1.7E-09 4.4E-14 88.3 10.9 104 314-423 1-105 (105) 57 cd04593 CBS_pair_EriC_assoc_ba 99.1 2.5E-09 6.5E-14 87.1 11.6 107 316-423 3-115 (115) 58 cd04595 CBS_pair_DHH_polyA_Pol 99.1 1.6E-09 4E-14 88.6 10.3 106 314-422 2-109 (110) 59 cd04602 CBS_pair_IMPDH_2 This 99.1 1.9E-09 4.9E-14 88.0 10.7 107 314-422 2-113 (114) 60 cd04619 CBS_pair_6 The CBS dom 99.1 3.5E-09 8.9E-14 86.1 12.0 107 315-422 2-113 (114) 61 cd04582 CBS_pair_ABC_OpuCA_ass 99.1 4.5E-09 1.1E-13 85.3 12.3 105 314-422 1-105 (106) 62 cd04587 CBS_pair_CAP-ED_DUF294 99.1 2.5E-09 6.5E-14 87.1 11.0 108 314-423 1-113 (113) 63 cd04606 CBS_pair_Mg_transporte 99.1 3.2E-09 8.2E-14 86.4 11.2 102 319-424 2-109 (109) 64 cd04627 CBS_pair_14 The CBS do 99.1 4.4E-09 1.1E-13 85.4 12.0 105 316-420 3-120 (123) 65 cd04591 CBS_pair_EriC_assoc_eu 99.0 5.1E-09 1.3E-13 85.0 11.9 101 314-422 2-104 (105) 66 PRK07107 inositol-5-monophosph 99.0 5.1E-09 1.3E-13 84.9 11.6 170 271-457 62-243 (497) 67 COG2524 Predicted transcriptio 99.0 3.2E-09 8.2E-14 86.4 9.9 245 175-427 27-293 (294) 68 cd04800 CBS_pair_CAP-ED_DUF294 99.0 4.9E-09 1.3E-13 85.0 10.7 105 315-422 2-110 (111) 69 cd04589 CBS_pair_CAP-ED_DUF294 99.0 1.3E-08 3.3E-13 81.9 12.7 105 315-422 2-110 (111) 70 cd04607 CBS_pair_NTP_transfera 99.0 9.2E-09 2.4E-13 83.0 11.9 106 317-423 5-113 (113) 71 cd04603 CBS_pair_KefB_assoc Th 99.0 1.3E-08 3.3E-13 81.9 12.0 107 315-422 2-110 (111) 72 cd04620 CBS_pair_7 The CBS dom 99.0 1.5E-08 3.7E-13 81.6 11.9 107 315-422 2-114 (115) 73 cd04802 CBS_pair_3 The CBS dom 99.0 1E-08 2.6E-13 82.7 11.1 105 314-421 1-110 (112) 74 cd02205 CBS_pair The CBS domai 99.0 1E-08 2.7E-13 82.7 10.8 107 315-422 2-112 (113) 75 cd04614 CBS_pair_1 The CBS dom 98.9 1.3E-08 3.4E-13 81.9 10.4 95 314-422 1-95 (96) 76 COG2239 MgtE Mg/Co/Ni transpor 98.9 7.7E-08 2E-12 76.3 13.7 136 285-425 113-253 (451) 77 cd04617 CBS_pair_4 The CBS dom 98.8 6.8E-08 1.7E-12 76.7 11.2 104 317-421 4-116 (118) 78 PRK11543 gutQ D-arabinose 5-ph 98.8 2.6E-08 6.7E-13 79.7 8.4 209 213-423 87-318 (321) 79 cd04598 CBS_pair_GGDEF_assoc T 98.8 8.2E-08 2.1E-12 76.1 10.4 107 315-423 2-119 (119) 80 COG3620 Predicted transcriptio 98.8 5.3E-07 1.3E-11 70.2 13.4 119 302-427 63-186 (187) 81 cd04640 CBS_pair_27 The CBS do 98.8 1.7E-07 4.3E-12 73.8 10.9 104 317-421 4-124 (126) 82 PTZ00314 inosine-5'-monophosph 98.7 2.3E-07 5.8E-12 72.9 11.4 165 271-453 60-234 (499) 83 PRK07807 inositol-5-monophosph 98.7 1.4E-07 3.6E-12 74.4 9.6 162 270-453 54-223 (479) 84 PRK05567 inositol-5'-monophosp 98.6 4.6E-07 1.2E-11 70.6 10.5 163 272-453 53-224 (486) 85 cd04608 CBS_pair_PALP_assoc Th 98.6 7.7E-07 2E-11 69.0 10.4 110 314-424 2-124 (124) 86 PRK10892 D-arabinose 5-phospha 98.6 3.8E-07 9.7E-12 71.2 8.5 207 214-423 93-323 (326) 87 PRK10070 glycine betaine trans 98.5 4E-06 1E-10 63.8 12.4 136 291-429 261-397 (400) 88 pfam00478 IMPDH IMP dehydrogen 98.5 1.3E-06 3.4E-11 67.3 9.7 165 273-454 49-220 (467) 89 COG0517 FOG: CBS domain [Gener 98.4 5.2E-06 1.3E-10 62.9 11.4 110 308-421 3-117 (117) 90 TIGR01302 IMP_dehydrog inosine 98.3 2.1E-06 5.3E-11 65.8 6.9 185 303-502 151-394 (476) 91 KOG1764 consensus 98.1 2.2E-05 5.7E-10 58.3 8.4 117 315-432 238-366 (381) 92 PRK10995 multiple drug resista 98.1 0.0017 4.3E-08 44.6 21.1 190 2-210 1-214 (222) 93 COG2905 Predicted signal-trans 98.1 3.3E-05 8.4E-10 57.1 8.4 108 314-423 157-268 (610) 94 cd04629 CBS_pair_16 The CBS do 98.0 4.6E-05 1.2E-09 56.0 8.4 52 376-427 1-52 (114) 95 cd04587 CBS_pair_CAP-ED_DUF294 98.0 4.8E-05 1.2E-09 55.9 7.8 65 295-362 49-113 (113) 96 KOG0474 consensus 98.0 1.4E-05 3.7E-10 59.7 5.1 129 300-430 578-753 (762) 97 cd04593 CBS_pair_EriC_assoc_ba 97.9 0.00014 3.5E-09 52.6 9.7 82 377-461 2-84 (115) 98 cd04640 CBS_pair_27 The CBS do 97.9 3.4E-05 8.7E-10 57.0 6.0 45 377-421 2-46 (126) 99 cd04622 CBS_pair_9 The CBS dom 97.9 2.6E-05 6.6E-10 57.9 5.4 61 299-362 53-113 (113) 100 cd04602 CBS_pair_IMPDH_2 This 97.9 4.8E-05 1.2E-09 55.9 6.6 62 299-361 52-113 (114) 101 PRK05326 potassium/proton anti 97.9 0.00028 7.2E-09 50.2 10.5 71 442-517 491-561 (563) 102 cd04625 CBS_pair_12 The CBS do 97.9 4E-05 1E-09 56.4 6.1 60 299-362 53-112 (112) 103 cd04630 CBS_pair_17 The CBS do 97.8 5.1E-05 1.3E-09 55.7 6.3 60 299-362 55-114 (114) 104 cd04592 CBS_pair_EriC_assoc_eu 97.8 0.00023 5.9E-09 50.9 9.6 47 378-424 3-49 (133) 105 PRK11111 hypothetical protein; 97.8 0.0041 1.1E-07 41.7 15.9 175 21-214 18-210 (214) 106 TIGR01186 proV glycine betaine 97.8 0.00063 1.6E-08 47.7 11.7 112 314-427 253-364 (372) 107 cd04613 CBS_pair_SpoIVFB_EriC_ 97.8 0.00021 5.4E-09 51.1 8.7 84 376-461 1-84 (114) 108 cd04624 CBS_pair_11 The CBS do 97.8 0.00027 6.8E-09 50.4 9.1 83 376-461 1-83 (112) 109 cd04586 CBS_pair_BON_assoc Thi 97.7 4.7E-05 1.2E-09 55.9 4.9 61 297-361 74-134 (135) 110 cd04595 CBS_pair_DHH_polyA_Pol 97.7 0.00014 3.7E-09 52.4 7.3 57 301-361 53-109 (110) 111 cd04600 CBS_pair_HPP_assoc Thi 97.7 0.00028 7.1E-09 50.3 8.6 53 376-428 2-54 (124) 112 PRK01862 putative voltage-gate 97.7 0.0059 1.5E-07 40.6 15.3 90 370-461 457-547 (583) 113 cd04619 CBS_pair_6 The CBS dom 97.7 6.6E-05 1.7E-09 54.9 5.3 56 303-361 58-113 (114) 114 cd04617 CBS_pair_4 The CBS dom 97.7 0.0003 7.6E-09 50.1 8.4 50 377-426 2-51 (118) 115 cd04623 CBS_pair_10 The CBS do 97.7 0.00033 8.3E-09 49.8 8.5 56 302-361 57-112 (113) 116 PRK10739 putative dITP- and XT 97.7 0.0065 1.7E-07 40.3 16.6 168 23-212 14-194 (197) 117 cd04594 CBS_pair_EriC_assoc_ar 97.7 0.00015 3.8E-09 52.3 6.7 57 302-362 48-104 (104) 118 cd04615 CBS_pair_2 The CBS dom 97.7 0.00032 8.1E-09 49.9 8.3 50 377-426 2-51 (113) 119 KOG0475 consensus 97.7 0.0069 1.8E-07 40.1 19.9 240 172-423 368-693 (696) 120 cd04621 CBS_pair_8 The CBS dom 97.7 6.6E-05 1.7E-09 54.9 4.6 60 298-361 75-134 (135) 121 cd04626 CBS_pair_13 The CBS do 97.7 9E-05 2.3E-09 53.9 5.1 57 301-361 54-110 (111) 122 cd04620 CBS_pair_7 The CBS dom 97.6 0.00032 8.1E-09 49.9 7.9 52 376-427 1-52 (115) 123 cd04633 CBS_pair_20 The CBS do 97.6 0.00016 4.1E-09 52.0 6.3 57 301-361 64-120 (121) 124 cd04598 CBS_pair_GGDEF_assoc T 97.6 0.00026 6.7E-09 50.5 7.3 85 376-461 1-87 (119) 125 cd04800 CBS_pair_CAP-ED_DUF294 97.6 0.00038 9.7E-09 49.3 8.0 83 376-461 1-83 (111) 126 cd04604 CBS_pair_KpsF_GutQ_ass 97.6 0.00057 1.5E-08 48.0 8.9 83 377-461 3-85 (114) 127 cd04638 CBS_pair_25 The CBS do 97.6 0.00013 3.3E-09 52.7 5.6 57 302-362 50-106 (106) 128 PRK07107 inositol-5-monophosph 97.6 0.00013 3.4E-09 52.7 5.4 116 303-422 160-278 (497) 129 cd04801 CBS_pair_M50_like This 97.6 0.00057 1.5E-08 48.0 8.6 84 376-461 1-85 (114) 130 pfam00571 CBS CBS domain. CBS 97.6 0.00018 4.6E-09 51.7 6.0 50 376-425 7-56 (57) 131 cd04803 CBS_pair_15 The CBS do 97.6 0.00065 1.7E-08 47.6 8.7 55 376-430 1-55 (122) 132 cd04637 CBS_pair_24 The CBS do 97.6 0.00019 4.7E-09 51.6 5.8 56 303-361 66-121 (122) 133 cd04589 CBS_pair_CAP-ED_DUF294 97.6 0.00061 1.5E-08 47.8 8.3 51 376-427 1-51 (111) 134 cd04636 CBS_pair_23 The CBS do 97.6 0.00021 5.3E-09 51.2 5.9 52 376-427 1-52 (132) 135 cd04583 CBS_pair_ABC_OpuCA_ass 97.5 0.00062 1.6E-08 47.8 8.2 80 375-461 1-80 (109) 136 cd04627 CBS_pair_14 The CBS do 97.5 0.00084 2.1E-08 46.8 8.7 86 376-461 1-94 (123) 137 cd04632 CBS_pair_19 The CBS do 97.5 0.00015 3.7E-09 52.3 4.8 59 301-361 68-127 (128) 138 cd04607 CBS_pair_NTP_transfera 97.5 0.0008 2E-08 46.9 8.3 82 377-461 3-84 (113) 139 cd04585 CBS_pair_ACT_assoc2 Th 97.5 0.00018 4.6E-09 51.7 4.9 57 302-361 65-121 (122) 140 cd04592 CBS_pair_EriC_assoc_eu 97.5 0.0011 2.7E-08 46.0 8.8 97 314-411 1-121 (133) 141 cd04610 CBS_pair_ParBc_assoc T 97.5 0.00016 4.2E-09 52.0 4.6 56 303-361 51-106 (107) 142 cd04584 CBS_pair_ACT_assoc Thi 97.5 0.00076 1.9E-08 47.1 8.0 49 376-424 1-49 (121) 143 cd04802 CBS_pair_3 The CBS dom 97.5 0.00016 4E-09 52.1 4.4 58 300-361 54-111 (112) 144 cd04601 CBS_pair_IMPDH This cd 97.5 0.0012 3E-08 45.7 8.9 46 375-420 1-46 (110) 145 cd04588 CBS_pair_CAP-ED_DUF294 97.5 0.00093 2.4E-08 46.5 8.2 81 376-461 1-81 (110) 146 cd04631 CBS_pair_18 The CBS do 97.4 0.0012 3E-08 45.7 8.6 52 376-427 1-53 (125) 147 cd04634 CBS_pair_21 The CBS do 97.4 0.00029 7.3E-09 50.2 5.3 57 300-360 85-141 (143) 148 cd04639 CBS_pair_26 The CBS do 97.4 0.00035 8.9E-09 49.6 5.7 49 378-426 3-51 (111) 149 pfam01595 DUF21 Domain of unkn 97.4 0.014 3.6E-07 37.9 14.9 76 209-291 102-182 (182) 150 cd04611 CBS_pair_PAS_GGDEF_DUF 97.4 0.00023 6E-09 50.8 4.8 56 303-361 55-110 (111) 151 cd04596 CBS_pair_DRTGG_assoc T 97.4 0.00027 6.8E-09 50.4 5.0 47 376-422 2-48 (108) 152 cd04608 CBS_pair_PALP_assoc Th 97.4 0.0007 1.8E-08 47.4 7.1 84 375-460 1-84 (124) 153 cd04635 CBS_pair_22 The CBS do 97.4 0.00027 6.9E-09 50.4 4.9 48 376-423 1-48 (122) 154 cd04609 CBS_pair_PALP_assoc2 T 97.4 0.0024 6E-08 43.5 9.5 83 377-462 2-84 (110) 155 cd04612 CBS_pair_SpoIVFB_EriC_ 97.4 0.0011 2.8E-08 45.9 7.8 46 378-424 3-48 (111) 156 cd04643 CBS_pair_30 The CBS do 97.4 0.0014 3.5E-08 45.3 8.2 51 377-427 2-52 (116) 157 pfam00571 CBS CBS domain. CBS 97.4 0.00053 1.4E-08 48.2 6.1 55 306-363 1-55 (57) 158 cd04641 CBS_pair_28 The CBS do 97.4 0.0014 3.6E-08 45.1 8.3 56 376-431 1-56 (120) 159 cd04605 CBS_pair_MET2_assoc Th 97.3 0.00062 1.6E-08 47.7 6.1 51 376-426 2-52 (110) 160 smart00116 CBS Domain in cysta 97.3 0.00075 1.9E-08 47.2 6.3 47 378-424 2-48 (49) 161 cd04582 CBS_pair_ABC_OpuCA_ass 97.3 0.0012 3.1E-08 45.7 7.2 53 376-428 1-53 (106) 162 cd04618 CBS_pair_5 The CBS dom 97.3 0.0036 9.2E-08 42.2 9.6 95 314-421 1-96 (98) 163 cd04599 CBS_pair_GGDEF_assoc2 97.3 0.0004 1E-08 49.2 4.6 57 301-361 48-104 (105) 164 TIGR01137 cysta_beta cystathio 97.3 0.0055 1.4E-07 40.9 10.4 121 300-424 362-526 (527) 165 TIGR00393 kpsF sugar isomerase 97.2 0.00037 9.4E-09 49.4 4.1 166 249-415 88-271 (272) 166 cd04606 CBS_pair_Mg_transporte 97.2 0.00076 1.9E-08 47.1 5.5 59 301-362 50-108 (109) 167 cd04597 CBS_pair_DRTGG_assoc2 97.2 0.00064 1.6E-08 47.7 5.0 49 374-422 64-112 (113) 168 cd02205 CBS_pair The CBS domai 97.2 0.0054 1.4E-07 40.9 9.7 84 376-461 1-84 (113) 169 PRK10892 D-arabinose 5-phospha 97.2 0.00069 1.8E-08 47.4 4.9 74 285-362 249-323 (326) 170 PRK07807 inositol-5-monophosph 97.2 0.0014 3.6E-08 45.1 6.3 62 302-366 146-207 (479) 171 KOG2550 consensus 97.1 0.0022 5.5E-08 43.8 6.7 150 289-456 98-250 (503) 172 PRK11543 gutQ D-arabinose 5-ph 97.1 0.00089 2.3E-08 46.6 4.6 75 284-362 243-318 (321) 173 smart00116 CBS Domain in cysta 97.1 0.002 5.1E-08 44.0 6.4 48 315-363 1-48 (49) 174 cd04597 CBS_pair_DRTGG_assoc2 97.0 0.001 2.6E-08 46.2 4.6 57 302-361 56-112 (113) 175 COG2905 Predicted signal-trans 97.0 0.0061 1.6E-07 40.5 8.2 113 248-366 138-272 (610) 176 cd04642 CBS_pair_29 The CBS do 96.9 0.0061 1.6E-07 40.5 8.0 48 378-425 3-50 (126) 177 cd04603 CBS_pair_KefB_assoc Th 96.9 0.0027 6.9E-08 43.1 6.1 47 377-423 2-48 (111) 178 COG4175 ProV ABC-type proline/ 96.9 0.002 5.1E-08 44.1 5.2 127 289-427 259-385 (386) 179 COG2095 MarC Multiple antibiot 96.9 0.047 1.2E-06 34.0 18.1 174 19-212 13-199 (203) 180 COG3263 NhaP-type Na+/H+ and K 96.9 0.0043 1.1E-07 41.6 6.7 75 444-519 495-569 (574) 181 pfam01914 MarC MarC family int 96.8 0.058 1.5E-06 33.4 21.6 177 12-210 5-200 (203) 182 PRK05567 inositol-5'-monophosp 96.7 0.0088 2.2E-07 39.3 7.4 62 302-365 145-206 (486) 183 PTZ00314 inosine-5'-monophosph 96.7 0.0076 1.9E-07 39.8 7.0 62 301-365 155-216 (499) 184 cd04614 CBS_pair_1 The CBS dom 96.7 0.003 7.6E-08 42.8 4.7 45 376-420 1-45 (96) 185 KOG1764 consensus 96.7 0.011 2.8E-07 38.6 7.6 136 292-427 138-288 (381) 186 TIGR03415 ABC_choXWV_ATP choli 96.7 0.0034 8.6E-08 42.4 4.9 121 292-427 262-382 (382) 187 TIGR00400 mgtE magnesium trans 96.5 0.016 4E-07 37.5 7.2 155 285-444 116-276 (460) 188 PRK11469 hypothetical protein; 96.2 0.12 2.9E-06 31.2 21.2 179 12-210 18-205 (206) 189 TIGR00427 TIGR00427 conserved 96.2 0.13 3.2E-06 30.9 15.1 169 24-209 21-206 (206) 190 PRK10229 threonine efflux syst 96.0 0.15 3.7E-06 30.4 19.1 92 12-115 5-97 (206) 191 pfam03741 TerC Integral membra 95.9 0.16 4.1E-06 30.1 10.8 76 135-210 2-83 (184) 192 PRK10070 glycine betaine trans 95.7 0.053 1.4E-06 33.6 7.0 126 322-461 222-363 (400) 193 COG0517 FOG: CBS domain [Gener 95.7 0.035 9E-07 34.9 6.0 53 305-359 63-116 (117) 194 cd04591 CBS_pair_EriC_assoc_eu 95.6 0.039 9.9E-07 34.6 6.0 50 309-362 56-105 (105) 195 pfam00478 IMPDH IMP dehydrogen 95.6 0.094 2.4E-06 31.8 7.9 61 304-365 141-201 (467) 196 cd04590 CBS_pair_CorC_HlyC_ass 95.4 0.067 1.7E-06 32.9 6.6 81 377-462 2-83 (111) 197 COG2524 Predicted transcriptio 95.4 0.087 2.2E-06 32.1 7.0 124 331-463 134-262 (294) 198 TIGR01303 IMP_DH_rel_1 IMP deh 95.3 0.083 2.1E-06 32.2 6.9 105 316-424 98-203 (476) 199 COG1971 Predicted membrane pro 95.2 0.27 6.9E-06 28.5 22.2 179 14-211 2-190 (190) 200 COG3448 CBS-domain-containing 95.1 0.016 4.2E-07 37.4 2.5 62 300-364 311-372 (382) 201 COG0861 TerC Membrane protein 94.9 0.34 8.6E-06 27.8 11.8 124 92-233 82-215 (254) 202 COG4109 Predicted transcriptio 94.3 0.082 2.1E-06 32.2 4.7 110 376-494 198-319 (432) 203 COG2239 MgtE Mg/Co/Ni transpor 93.7 0.12 2.9E-06 31.2 4.4 60 303-365 195-254 (451) 204 PRK10520 rhtB homoserine/homos 92.5 0.85 2.2E-05 24.8 20.6 94 8-113 2-96 (205) 205 KOG2550 consensus 92.3 0.17 4.4E-06 29.9 3.7 62 300-364 167-228 (503) 206 PRK10062 hypothetical protein; 91.5 1.1 2.8E-05 24.0 15.2 119 91-209 89-245 (303) 207 pfam05661 DUF808 Protein of un 90.9 1.2 3.2E-05 23.6 15.1 120 91-210 89-245 (293) 208 pfam01810 LysE LysE type trans 90.7 1.3 3.3E-05 23.5 17.1 79 24-116 6-86 (191) 209 PRK10958 leucine export protei 88.6 1.8 4.7E-05 22.4 20.4 88 10-111 6-95 (213) 210 TIGR00949 2A76 Homoserine/Thre 86.5 2.4 6.1E-05 21.6 11.0 20 90-109 55-74 (194) 211 TIGR00393 kpsF sugar isomerase 86.4 0.3 7.6E-06 28.2 1.1 51 377-427 166-218 (272) 212 pfam03596 Cad Cadmium resistan 84.4 0.62 1.6E-05 25.8 1.9 32 25-56 4-35 (192) 213 COG3090 DctM TRAP-type C4-dica 82.3 3.6 9.1E-05 20.3 10.2 22 211-232 125-146 (177) 214 COG1125 OpuBA ABC-type proline 81.0 3.2 8.3E-05 20.6 4.6 43 381-423 266-308 (309) 215 COG1280 RhtB Putative threonin 80.7 4 0.0001 19.9 21.0 92 10-116 4-99 (208) 216 TIGR00935 2a45 arsenical pump 79.4 4.4 0.00011 19.6 12.2 102 83-195 315-425 (507) 217 pfam04459 DUF512 Protein of un 78.2 4.8 0.00012 19.4 6.6 111 347-465 72-197 (205) 218 KOG0476 consensus 78.0 4.6 0.00012 19.5 4.5 60 301-362 585-645 (931) 219 PRK10323 neutral amino-acid ef 76.8 5.2 0.00013 19.1 22.2 113 8-136 2-115 (195) 220 PRK09877 2,3-diketo-L-gulonate 75.7 5.5 0.00014 18.9 8.7 18 210-227 113-130 (157) 221 KOG1077 consensus 75.0 5.3 0.00013 19.0 4.2 31 300-330 649-684 (938) 222 COG2354 Uncharacterized protei 75.0 5.7 0.00015 18.8 5.9 62 92-157 90-160 (303) 223 TIGR03279 cyano_FeS_chp putati 74.6 4.5 0.00011 19.5 3.7 66 399-466 349-419 (433) 224 PRK06769 hypothetical protein; 73.2 5.9 0.00015 18.7 4.1 44 384-427 30-73 (175) 225 cd04374 RhoGAP_Graf RhoGAP_Gra 73.0 4.7 0.00012 19.4 3.5 13 474-486 88-100 (203) 226 COG2216 KdpB High-affinity K+ 70.3 7.3 0.00019 18.0 4.5 82 324-407 414-518 (681) 227 cd04396 RhoGAP_fSAC7_BAG7 RhoG 66.8 8.1 0.00021 17.7 3.6 17 473-489 88-104 (225) 228 cd04385 RhoGAP_ARAP RhoGAP_ARA 66.3 8.7 0.00022 17.4 3.8 11 225-235 128-138 (184) 229 COG1968 BacA Undecaprenyl pyro 66.3 8.7 0.00022 17.4 18.9 39 152-190 213-251 (270) 230 TIGR01497 kdpB K+-transporting 61.2 8.7 0.00022 17.5 2.9 43 322-366 411-453 (675) 231 PRK09304 arginine exporter pro 60.1 11 0.00028 16.7 19.2 120 90-210 56-190 (197) 232 PRK10054 putative MFS-type tra 59.5 11 0.00029 16.6 26.7 43 9-51 94-136 (395) 233 KOG4269 consensus 59.3 11 0.00029 16.6 3.3 52 441-492 937-997 (1112) 234 COG2015 Alkyl sulfatase and re 58.7 6.5 0.00016 18.4 1.9 166 326-504 97-278 (655) 235 cd04377 RhoGAP_myosin_IX RhoGA 58.4 12 0.0003 16.5 3.9 10 384-393 163-172 (186) 236 pfam09269 DUF1967 Domain of un 58.2 12 0.0003 16.4 4.9 56 438-501 2-67 (68) 237 KOG2927 consensus 57.8 12 0.00029 16.6 3.1 14 280-293 313-326 (372) 238 pfam11298 DUF3099 Protein of u 57.7 12 0.00031 16.4 4.3 53 41-108 10-62 (73) 239 pfam03904 DUF334 Domain of unk 57.7 12 0.00031 16.4 6.2 16 121-136 188-203 (229) 240 cd04381 RhoGap_RalBP1 RhoGap_R 56.8 13 0.00032 16.3 3.6 13 474-486 73-85 (182) 241 cd04402 RhoGAP_ARHGAP20 RhoGAP 54.7 14 0.00034 16.0 3.7 13 225-237 125-137 (192) 242 PRK04149 sat sulfate adenylylt 53.3 6.7 0.00017 18.3 1.3 23 462-485 360-382 (390) 243 TIGR00849 gutA PTS system, glu 53.0 14 0.00036 15.9 3.3 63 440-506 10-77 (129) 244 COG3290 CitA Signal transducti 52.0 15 0.00038 15.8 4.9 44 13-56 14-57 (537) 245 COG0809 QueA S-adenosylmethion 50.1 16 0.0004 15.5 3.2 175 296-505 105-294 (348) 246 PRK08053 sulfur carrier protei 49.8 16 0.00041 15.5 3.2 32 376-407 7-38 (66) 247 pfam11683 DUF3278 Protein of u 49.6 16 0.00041 15.5 10.5 78 36-113 17-95 (129) 248 COG2119 Predicted membrane pro 49.2 16 0.00042 15.4 16.6 174 13-209 6-186 (190) 249 TIGR03595 Obg_CgtA_exten Obg f 48.9 16 0.00042 15.4 3.9 61 437-501 2-68 (69) 250 TIGR01476 chlor_syn_BchG bacte 47.2 17 0.00044 15.2 2.8 19 216-234 242-261 (295) 251 cd04393 RhoGAP_FAM13A1a RhoGAP 47.2 17 0.00044 15.2 4.0 10 226-235 133-142 (189) 252 COG5128 Transport protein part 46.8 12 0.0003 16.5 1.7 86 176-267 30-120 (208) 253 PRK08942 D,D-heptose 1,7-bisph 46.7 18 0.00045 15.2 3.8 38 384-421 31-68 (181) 254 TIGR02840 spore_YtaF putative 46.6 18 0.00045 15.2 15.6 174 17-202 2-212 (215) 255 cd00159 RhoGAP RhoGAP: GTPase- 46.5 18 0.00045 15.2 3.5 22 212-236 101-122 (169) 256 pfam11700 ATG22 Vacuole efflux 46.4 18 0.00046 15.2 23.0 100 132-231 290-397 (477) 257 TIGR02475 CobW cobalamin biosy 46.3 9.9 0.00025 17.0 1.3 49 395-446 27-75 (349) 258 cd04376 RhoGAP_ARHGAP6 RhoGAP_ 46.1 18 0.00046 15.1 3.4 13 225-237 119-131 (206) 259 TIGR01084 mutY A/G-specific ad 43.4 12 0.0003 16.5 1.3 141 178-347 60-236 (297) 260 COG4708 Predicted membrane pro 43.4 18 0.00045 15.2 2.2 17 211-227 104-120 (169) 261 cd04400 RhoGAP_fBEM3 RhoGAP_fB 42.6 20 0.00052 14.7 3.7 17 472-488 78-94 (190) 262 pfam01927 DUF82 Protein of unk 41.8 21 0.00053 14.7 3.8 78 255-347 14-100 (146) 263 COG1269 NtpI Archaeal/vacuolar 41.7 21 0.00053 14.7 11.6 35 313-347 191-225 (660) 264 PRK05659 sulfur carrier protei 40.7 22 0.00055 14.6 3.0 31 377-407 8-38 (66) 265 cd04395 RhoGAP_ARHGAP21 RhoGAP 40.5 22 0.00055 14.5 4.0 11 225-235 132-142 (196) 266 cd04387 RhoGAP_Bcr RhoGAP_Bcr: 40.0 22 0.00056 14.5 3.6 11 225-235 129-139 (196) 267 TIGR02829 spore_III_AE stage I 39.8 22 0.00057 14.5 5.8 20 185-204 328-347 (392) 268 PRK13762 tRNA-modifying enzyme 38.1 13 0.00032 16.3 0.7 121 272-399 92-221 (321) 269 pfam04224 DUF417 Protein of un 37.4 24 0.00062 14.2 5.8 24 209-232 127-150 (153) 270 smart00324 RhoGAP GTPase-activ 37.4 24 0.00062 14.2 3.6 21 213-236 106-126 (174) 271 pfam10083 DUF2321 Uncharacteri 37.2 21 0.00054 14.6 1.8 44 271-319 82-125 (158) 272 PRK12297 obgE GTPase ObgE; Rev 36.7 25 0.00063 14.1 4.7 59 437-503 360-428 (429) 273 TIGR01946 rnfD electron transp 36.6 25 0.00063 14.1 2.2 15 47-61 70-84 (346) 274 PRK10429 melibiose:sodium symp 36.5 25 0.00064 14.1 22.8 37 15-51 110-147 (473) 275 PRK00468 hypothetical protein; 36.1 25 0.00064 14.1 2.0 48 417-464 1-48 (75) 276 PRK06083 sulfur carrier protei 35.8 26 0.00065 14.0 3.2 31 377-407 11-41 (69) 277 PRK05863 sulfur carrier protei 35.3 26 0.00066 14.0 3.0 31 377-407 8-38 (65) 278 pfam10797 YhfT Protein of unkn 35.1 26 0.00067 13.9 8.1 54 157-210 310-373 (421) 279 TIGR02398 gluc_glyc_Psyn gluco 34.2 18 0.00045 15.2 1.0 58 299-362 252-311 (495) 280 COG1279 Lysine efflux permease 33.9 27 0.0007 13.8 19.7 164 13-199 7-199 (202) 281 TIGR00892 2A0113 monocarboxyla 33.7 27 0.0007 13.8 3.1 28 74-101 165-192 (475) 282 TIGR02965 xanthine_xdhB xanthi 33.6 12 0.00031 16.4 0.0 51 375-429 314-364 (761) 283 COG1656 Uncharacterized conser 32.3 29 0.00074 13.6 3.8 82 251-347 17-107 (165) 284 PRK06771 hypothetical protein; 32.1 29 0.00074 13.6 6.1 60 219-283 9-68 (93) 285 TIGR01687 moaD_arch MoaD famil 31.6 23 0.00058 14.4 1.2 52 414-466 25-79 (93) 286 TIGR01969 minD_arch cell divis 31.2 19 0.00049 14.9 0.8 139 309-461 72-231 (258) 287 pfam04123 DUF373 Domain of unk 30.9 30 0.00077 13.5 13.8 34 171-204 275-310 (344) 288 COG4023 SBH1 Preprotein transl 29.3 32 0.00082 13.3 2.5 21 214-234 34-54 (57) 289 pfam01747 ATP-sulfurylase ATP- 29.3 23 0.00059 14.3 0.9 24 320-343 99-122 (310) 290 TIGR02862 spore_BofA pro-sigma 29.3 32 0.00082 13.3 7.8 60 159-221 4-63 (83) 291 COG2211 MelB Na+/melibiose sym 29.2 32 0.00082 13.3 20.0 171 10-192 111-308 (467) 292 cd03709 lepA_C lepA_C: This fa 28.7 33 0.00084 13.2 2.2 50 413-462 7-57 (80) 293 pfam00620 RhoGAP RhoGAP domain 28.4 33 0.00085 13.2 3.5 12 224-235 111-122 (152) 294 TIGR00846 caca2 calcium/proton 28.1 34 0.00086 13.1 6.8 76 92-167 204-295 (393) 295 cd00517 ATPS ATP-sulfurylase ( 28.1 23 0.00059 14.3 0.7 20 322-341 141-160 (356) 296 TIGR00988 hip integration host 27.7 33 0.00084 13.2 1.5 15 483-497 60-74 (94) 297 KOG3880 consensus 27.1 35 0.00089 13.0 4.4 31 218-249 160-190 (409) 298 pfam09139 Mmp37 Mitochondrial 26.9 35 0.0009 13.0 1.6 70 352-426 105-176 (324) 299 cd00565 ThiS ThiS (ThiaminS) i 26.4 36 0.00092 12.9 3.0 31 377-407 7-37 (65) 300 PHA00728 hypothetical protein 26.4 17 0.00044 15.3 -0.2 55 288-346 35-89 (153) 301 cd07178 terB_like_YebE telluri 26.3 36 0.00092 12.9 3.4 27 274-300 3-29 (95) 302 COG0523 Putative GTPases (G3E 26.3 36 0.00092 12.9 4.6 108 289-411 69-187 (323) 303 PRK05446 imidazole glycerol-ph 26.2 36 0.00092 12.9 3.5 36 324-360 34-69 (354) 304 COG2215 ABC-type uncharacteriz 25.3 38 0.00096 12.8 17.4 187 6-207 48-299 (303) 305 CHL00124 acpP acyl carrier pro 25.2 38 0.00096 12.8 1.8 70 416-487 2-80 (84) 306 PRK11195 major facilitator fam 25.2 38 0.00096 12.8 4.9 25 207-231 369-393 (415) 307 cd04394 RhoGAP-ARHGAP11A RhoGA 24.5 39 0.00099 12.7 3.7 11 225-235 128-138 (202) 308 cd01709 RT_like_1 RT_like_1: A 24.3 23 0.00059 14.3 0.2 208 3-226 45-303 (346) 309 TIGR01246 dapE_proteo succinyl 23.5 40 0.001 12.6 2.2 66 414-485 208-281 (383) 310 COG2979 Uncharacterized protei 23.5 41 0.001 12.6 3.3 31 273-303 113-143 (225) 311 pfam02492 cobW CobW/HypB/UreG, 23.4 40 0.001 12.6 1.2 10 400-409 29-38 (174) 312 PRK01658 holin-like protein; V 23.2 41 0.001 12.5 5.1 45 144-188 67-118 (122) 313 PRK12449 acyl carrier protein; 23.0 41 0.0011 12.5 1.9 31 455-485 47-78 (80) 314 PRK00982 acpP acyl carrier pro 22.8 42 0.0011 12.5 1.4 31 453-483 43-74 (78) 315 cd03112 CobW_like The function 22.7 38 0.00097 12.7 1.0 47 396-455 24-70 (158) 316 PRK06193 hypothetical protein; 22.7 42 0.0011 12.4 3.1 55 405-459 156-210 (211) 317 PRK01821 hypothetical protein; 22.5 42 0.0011 12.4 5.3 46 144-189 72-124 (133) 318 pfam01575 MaoC_dehydratas MaoC 22.1 43 0.0011 12.4 3.8 98 413-516 16-116 (123) 319 PRK05350 acyl carrier protein; 21.8 37 0.00093 12.9 0.8 66 415-482 2-76 (82) 320 PRK05883 acyl carrier protein; 21.2 45 0.0011 12.2 2.3 36 453-488 48-84 (85) 321 COG1194 MutY A/G-specific DNA 21.1 45 0.0011 12.2 2.1 152 178-355 55-242 (342) 322 PRK11360 sensory histidine kin 21.1 45 0.0011 12.2 8.1 81 142-238 174-256 (607) 323 TIGR01173 glmU UDP-N-acetylglu 21.0 45 0.0012 12.2 3.6 86 389-493 118-218 (461) 324 PRK05122 major facilitator sup 20.9 45 0.0012 12.2 25.2 63 174-238 266-332 (395) 325 TIGR03046 PS_II_psbV2 photosys 20.6 46 0.0012 12.2 2.9 101 385-490 21-151 (155) 326 TIGR00711 efflux_EmrB drug res 20.4 46 0.0012 12.1 5.8 268 5-303 148-449 (501) 327 PRK01064 hypothetical protein; 20.1 47 0.0012 12.1 1.4 47 417-463 1-47 (78) 328 COG3877 Uncharacterized protei 20.0 47 0.0012 12.1 6.6 65 227-294 49-118 (122) No 1 >COG1253 TlyC Hemolysins and related proteins containing CBS domains [General function prediction only] Probab=100.00 E-value=0 Score=541.12 Aligned_cols=407 Identities=30% Similarity=0.447 Sum_probs=334.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 48999999999999865555302576421111222332099999999999999857789999985414699999999999 Q gi|255764470|r 90 DIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAVSA 169 (523) Q Consensus 90 ~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~isi 169 (523) .+|.+.|.....-.|...++ +...+...+. ..+..+.+..++.-.|..+|.+.+...++..-.-++...++.. T Consensus 12 ~li~l~~ff~a~E~A~~s~~-~~rl~~~~~~----g~~~a~~~~~~~~~~~~~ls~~qigitl~~i~~g~~~~~~~~~-- 84 (429) T COG1253 12 LLIALSAFFSAAEFALVSLR-RSRLEQLAEE----GNKRAKAALKLIERLNRYLSTVQLGITLVSLLLGAVGEPALAA-- 84 (429) T ss_pred HHHHHHHHHHHHHHHHHHCC-HHHHHHHHHC----CCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- T ss_conf 99999999999999998604-9999998763----3752798999984817788899999999999999999999999-- Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Q ss_conf 9999998999999984852899999999999999999774573370278899899999999989999987554301-037 Q gi|255764470|r 170 LMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYASIGFSGIIEFFNQVARRNREQLMSP-SRL 248 (523) Q Consensus 170 ~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~~~fs~~ve~~n~~~~~~~~~~~~~-~~~ 248 (523) ++..+.....+. + .+| +.++...+..+-+..|.|.|+.++..+..++....++..+...+ ..+ T Consensus 85 --~l~~~~~~~~~~---~---~~~--------~~~~t~l~~i~gEl~PK~~a~~~~e~va~~~a~~~~~~~~l~~P~i~~ 148 (429) T COG1253 85 --LLEPLLEALGLS---A---ALS--------FAIITFLHVVFGELVPKSIAIRNPEKVALLIAPPLRFFYRLLYPLIWL 148 (429) T ss_pred --HHHHHHHHCCHH---H---HHH--------HHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf --988765101226---7---889--------999998888620151799998677999998889999999998589999 Q ss_pred HHHHHHHHHHHHCCCCCCCC---CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCH Q ss_conf 77667656744156888766---798889999996774667898999999975113786674653400028996089998 Q gi|255764470|r 249 RARTADAVLRLLGGKPIQPQ---GLNVKADVLLPTQHEKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVD 325 (523) Q Consensus 249 ~eRtA~aVLRLlG~~~~~~~---~~~ee~~~li~~~~~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~ 325 (523) +.++|++++|++|+++.+++ .+.++.+ +++++.++|.++++|++|++|+|+|++++|+++||||++++|+|.+++. T Consensus 149 ~~~~a~~il~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~~~~E~~mi~~v~~l~~~~v~eiMtPR~~i~~l~~~~~~ 227 (429) T COG1253 149 LNRIANAILRLFGVEPVEEEALTSTEEELE-LVSESAEEGVLEEEEREMINNVLDLDDRTVREIMTPRTDIVALDLTDTV 227 (429) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCHHHHH-HHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCCCH T ss_conf 999999999861888666432331167888-8640012686288999999965021375626176031557994189999 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-CHHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEE Q ss_conf 9999999857997888876785527888888887620202466-016887327435238999899999885038627775 Q gi|255764470|r 326 EDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-MNFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMV 404 (523) Q Consensus 326 ~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiV 404 (523) +++.+.+.+++|||+|||+++.||++|++|+||++.....++. ......++||++|||++++.++|++||+++.|||+| T Consensus 228 ~~~~~~~~~~~~SR~PV~~~~~D~iiGiv~~Kdll~~~~~~~~~~~~~~~~~~~~~Vpet~~~~~lL~~~r~~~~hmAiV 307 (429) T COG1253 228 EELIELILESGHSRIPVYDGDLDNIIGIVHVKDLLRALLDGQSDLDLRVLVRPPLFVPETLSLSDLLEEFREERTHMAIV 307 (429) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCEEEEE T ss_conf 99999997589865878857998378886899999764315644442002578717789897999999988638749999 Q ss_pred ECCCCCEEEEEEHHHHHHHHCCCCCCCCCC--CCCEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHH Q ss_conf 228674256525999999870664457888--631288349859987001199999982889999877824078999988 Q gi|255764470|r 405 LDEYGVLEGMITPANILEAIAGDFPDEDDQ--KLDITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWR 482 (523) Q Consensus 405 vDE~G~~~GIVTleDIlE~IvGei~DE~d~--~~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~ 482 (523) +||||+++|+||+||++|+|+||++||+|+ ++.+.+.++++|+++|.++++|+++.+|.++ +++++|+|+|||++++ T Consensus 308 vDEyG~~~GlVTleDIiEeIvGei~de~d~~~~~~~~~~~~~~~~v~G~~~l~e~~~~l~~~~-~~~~~~~Ti~G~v~~~ 386 (429) T COG1253 308 VDEYGGVEGLVTLEDIIEEIVGEIPDEHDEDEEEDIIQRDDDGWLVDGRVPLEELEELLGIDL-DEEEDYDTIAGLVLSL 386 (429) T ss_pred EECCCCEEEECCHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCEEEECCCCHHHHHHHHCCCC-CCCCCCCCHHHHHHHH T ss_conf 937898433265899999984877543455456422686389679957315999998729887-5467864499999998 Q ss_pred HCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEECCCC Q ss_conf 312488998899999999999961887789999945888 Q gi|255764470|r 483 LGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSGLQNL 521 (523) Q Consensus 483 l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~l~~~ 521 (523) +||+|++||++.+++|+|+|++++++||++|+|+|.++. T Consensus 387 lg~iP~~g~~~~~~~~~~~V~~~~~~rI~~v~i~~~~~~ 425 (429) T COG1253 387 LGRIPKVGESVVIGGLRFEVLDMDGRRIDKVLIVRLKDK 425 (429) T ss_pred CCCCCCCCCEEEECCCEEEEEECCCCEEEEEEEEECCCC T ss_conf 199898888799748608999844765899999976888 No 2 >PRK11573 hypothetical protein; Provisional Probab=100.00 E-value=0 Score=507.87 Aligned_cols=303 Identities=24% Similarity=0.350 Sum_probs=260.4 Q ss_pred HCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---CCCCHHHHHHHHHH Q ss_conf 4573370278-----899899999999989999987554301037776676567441568887---66798889999996 Q gi|255764470|r 209 LHFDIPKGYL-----YASIGFSGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGGKPIQ---PQGLNVKADVLLPT 280 (523) Q Consensus 209 ~~~~ipk~y~-----y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~~~~~---~~~~~ee~~~li~~ 280 (523) ||-=+||.+- -+|..++..+-.+..+ ..+...+....++.++|++|.++.. ...+++|+..++++ T Consensus 93 fgEiiPK~la~~~~~~~a~~~a~~l~~~~~~-------~~Plv~~~~~~s~~l~rl~g~~~~~~~~~~~t~eEl~~lv~~ 165 (413) T PRK11573 93 FAEVLPKTIAALYPEKVAYPSSFLLAPLQIL-------MMPLVWLLNTITRLLMRLMGIKTDIVVSGSLSKEELRTIVHE 165 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 9978489999978999999999999999999-------988999999999977753266777544553689999976641 Q ss_pred HHHCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHH Q ss_conf 77466789899999997511378667465340002899608999899999998579978888767855278888888876 Q gi|255764470|r 281 QHEKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLL 360 (523) Q Consensus 281 ~~~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl 360 (523) + +|.++++|++|++|+|+|++++|+|+||||++++++|.+++++++.+.+.+++|||+|||+++.||++|++|+||++ T Consensus 166 ~--~~~l~~~e~~mi~~il~l~~~~v~eiMtPR~~i~~l~~~~~~~e~~~~~~~~~~SR~PVy~~~~D~iiGiv~~kdl~ 243 (413) T PRK11573 166 S--RSQISRRNQDMLLSVLDLEKVTVDDIMVPRNEIIGIDINDDWKSILRQLTHSPHGRIVLYRDSLDDAISMLRVREAW 243 (413) T ss_pred C--CCCCCHHHHHHHHHHHCCCCCEEHEEEEECCCEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEEHHHH T ss_conf 0--26678799999998725189890118620022689747899999999866338754258547888659999825778 Q ss_pred HHHHCCCCCHHH---HHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCC--C Q ss_conf 202024660168---87327435238999899999885038627775228674256525999999870664457888--6 Q gi|255764470|r 361 RDLLEEGSMNFK---RSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQ--K 435 (523) Q Consensus 361 ~~~~~~~~~~~~---~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~--~ 435 (523) ....++++.+.. +..|+|+||||++++.++|++||+++.|||+|+||||++.|+||+||++|+|+||+.||+|. . T Consensus 244 ~~~~~~~~~~~~~l~~~~~~~~fVpe~~~~~~lL~~~~~~~~h~AiVvDEyGg~~GiVTlEDilEeivGei~de~d~~~~ 323 (413) T PRK11573 244 RLMTEKKEFTKETMLRAADEIYFVPEGTPLSTQLVKFQRNKKKVGLVVNEYGDIQGLVTVEDILEEIVGDFTTSMSPTLA 323 (413) T ss_pred HHHHCCCCCCHHHHHHHCCCCEEECCCCHHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCC T ss_conf 77642666887888751478779689890999999988428637999948998889954999999983877555676433 Q ss_pred CCEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEE Q ss_conf 31288349859987001199999982889999877824078999988312488998899999999999961887789999 Q gi|255764470|r 436 LDITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKV 515 (523) Q Consensus 436 ~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V 515 (523) ..+.+.+||+|+++|+++++|+|+.+|+++|+ ++|+|+|||+++++|++|++||++.++||+|+|++++++||++|+| T Consensus 324 ~~~~~~~dg~~~v~G~~~l~dl~~~l~~~lp~--~~~~Tl~G~il~~lg~iP~~Ge~v~~~~~~f~V~~~~~~rI~~v~i 401 (413) T PRK11573 324 EEVTPQNDGSVIIDGTANVREINKAFNWHLPE--DDARTVNGVILEALEEIPVAGTRVRIGEYDIDILDVQDNMIKQVKV 401 (413) T ss_pred CCEEECCCCEEEEECCCCHHHHHHHHCCCCCC--CCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEE T ss_conf 02188789739997677999999985989999--9986289999998586898999999999999999830784769999 Q ss_pred EECCCCC Q ss_conf 9458888 Q gi|255764470|r 516 SGLQNLS 522 (523) Q Consensus 516 ~~l~~~~ 522 (523) +|++|.+ T Consensus 402 ~~~~p~~ 408 (413) T PRK11573 402 TPVKPLR 408 (413) T ss_pred EECCCCC T ss_conf 9789997 No 3 >TIGR03520 GldE gliding motility-associated protein GldE. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype. Probab=100.00 E-value=0 Score=506.88 Aligned_cols=299 Identities=25% Similarity=0.359 Sum_probs=261.9 Q ss_pred HHCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 74573370278-----8998999999999899999875543010377766765674415688876679888999999677 Q gi|255764470|r 208 GLHFDIPKGYL-----YASIGFSGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGGKPIQPQGLNVKADVLLPTQH 282 (523) Q Consensus 208 g~~~~ipk~y~-----y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~~~~~~~~~~ee~~~li~~~~ 282 (523) =||-=+||.+- ..|..++..+-++..+. .+...+-.+.++.++|++|.+. ...+.+|+..+++.+. T Consensus 100 ifGEiiPK~iA~~~~e~ia~~~a~~l~~~~~l~-------~Pl~~~l~~~~~~l~~~~g~~~--~~~s~eel~~~i~~~~ 170 (408) T TIGR03520 100 LFGEILPKVYANRNNLKFAKFMAYPINILDKVF-------SPISLPLRAITNFIHKKFGKQK--SNISVDQLSQALELTD 170 (408) T ss_pred HHHHHCCHHHHHHCHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHCCCCC--CCCCHHHHHHHHHHHH T ss_conf 741101067887698999999999999999999-------9999999999998865115678--9899999999998540 Q ss_pred HCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 46678989999999751137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 283 EKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 283 ~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) ++|.. ++|++|++|+|+|++++|+|+||||++++++|.+++.+++.+.+.+++|||+|||+++.||++|++|+||++.. T Consensus 171 ~~g~~-~eE~~~i~~vl~l~~~~v~eiMtPr~~v~~l~~~~~~~e~~~~~~~~~~SR~PV~~~~~D~iiGiv~~kDll~~ 249 (408) T TIGR03520 171 EEDTT-KEEQKILQGIVSFGNTDTKQVMRPRLDIFALDIETSFSEIIPKIIENGYSRIPVYKETIDNITGVLYIKDLLPH 249 (408) T ss_pred CCCCC-HHHHHHHHHHHCCCCCEEHHCCCCHHHEEEEECCCHHHHHHHHHHHCCCCCCEEECCCCCCEEEEEEHHHHHHH T ss_conf 34877-89999999982759989034235457958840644389999999857987222665898761799779899878 Q ss_pred HHCCCCCHHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCC-CCEEEE Q ss_conf 2024660168873274352389998999998850386277752286742565259999998706644578886-312883 Q gi|255764470|r 363 LLEEGSMNFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQK-LDITVG 441 (523) Q Consensus 363 ~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~-~~i~~~ 441 (523) .. .++.++.+++|||+||||++++.++|++||+++.|||+|+||||+++|+||+||++|+|+||+.||+|++ ..+++. T Consensus 250 ~~-~~~~~~~~~~r~~~~Vpe~~~~~~lL~~~~~~~~~~AiVvDEyG~~~GiVTlEDilEeivGei~DE~d~~~~~~~~~ 328 (408) T TIGR03520 250 LN-KKNFDWQSLLREPYFVPENKKLDDLLRDFQEKKNHLAIVVDEYGGTSGLVTLEDIIEEIVGDISDEFDDEDLIYSKI 328 (408) T ss_pred HH-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEEC T ss_conf 73-69998899863214899977889999999863874899998999878998889989999588868677665550873 Q ss_pred CCCEEEEEECCCHHHHHHHHCCCCCC---CCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEE Q ss_conf 49859987001199999982889999---87782407899998831248899889999999999996188778999994 Q gi|255764470|r 442 DDGSLTVDGWIDVRYASKLFGVNLVD---EDDRYSTLAGFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSG 517 (523) Q Consensus 442 ~d~~~~v~G~~~l~el~~~l~~~l~~---e~~~~~TlaGlil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~ 517 (523) +||+|+++|+++++|+++.+++++|. ++++|+|+|||+++++||+|++||++.++||+|+|+++++|||++|+|+. T Consensus 329 ~~~~~~v~G~~~l~dl~~~l~~~~~~~~~~~~~~~Tl~G~il~~l~~iP~~ge~v~~~~~~f~V~~~~~~rI~~V~vt~ 407 (408) T TIGR03520 329 DDNNYVFEGKTSLKDFYKILKLEEDMFDEVKGEAETLAGFLLEISGGFPKKGEKITFENFEFTIEAMDKKRIKQVKVTI 407 (408) T ss_pred CCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEE T ss_conf 7986999802699999998589988765657897708999999858689999999999999999984078677999997 No 4 >COG0861 TerC Membrane protein TerC, possibly involved in tellurium resistance [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=467.63 Aligned_cols=241 Identities=49% Similarity=0.776 Sum_probs=228.7 Q ss_pred CCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91465358789999999999999974439999999851799899999999999999999999999999999974247999 Q gi|255764470|r 1 MFFSWIYDYYVWIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFF 80 (523) Q Consensus 1 m~~~~~~~p~~w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~ 80 (523) ++|+|+.||+.|++++|++++|+|||+||++|||+++++|||+||+|||+||+.+|+++|++++++++ |++++++|++. T Consensus 6 ~~~~~~~~~~~~~~l~tl~~lE~vL~iDN~iviai~~~~Lp~~qr~kal~~Gl~~A~v~R~~ll~~~s-~Ll~l~~~l~~ 84 (254) T COG0861 6 LYMEWLADPAAWVALLTLILLEIVLGIDNAIVIAILASKLPPKQRKKALFIGLAGALVLRIILLASIS-WLLTLTQPLLY 84 (254) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH T ss_conf 99998527168999999999999987249999999982389999879999999999999999999999-99980389999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHH Q ss_conf 86345322248999999999999865555302576421111222-33209999999999999985778999998541469 Q gi|255764470|r 81 LKGLSFSGRDIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFF-SPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFS 159 (523) Q Consensus 81 i~~~~~s~~~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~-~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~ 159 (523) +++..++|++++++.||.++.+|+++|.|++.|+++... ++.. +.++||.++.||+++|++||+|||+||+|||+|++ T Consensus 85 ~fg~~L~~~~~~ll~~~~~~~~k~~~~~~~~~~~~~~~~-~~~~~~~~~f~~ai~~I~i~D~vFSlDSV~Aa~g~~~~~~ 163 (254) T COG0861 85 IFGLYLLWRDIKLLLGGLFLLFKATKELHERLEGEEFFV-NGKLKKATPFWGAIIQIELADLVFSLDSVIAAVGMAGHPF 163 (254) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 999999999999972430377788988765305630111-3432126859999999999999986538999999946855 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH---HCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999899999998485289999999999999999977---4573370278899899999999989999 Q gi|255764470|r 160 VMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEG---LHFDIPKGYLYASIGFSGIIEFFNQVAR 236 (523) Q Consensus 160 i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg---~~~~ipk~y~y~~~~fs~~ve~~n~~~~ 236 (523) +|++|+++|+++|+++|+++.+|++|||+++++|++||++||++|++|+ +++|+||+|+|++++||+++|.+|+..+ T Consensus 164 im~~a~i~aI~~m~~aa~~l~~ll~r~p~l~~~~~~iL~~IG~kli~~~~~~~~~~ip~~~~~~~v~f~vl~~~~~~~~~ 243 (254) T COG0861 164 VMVTAVIFAILVMRFAAFLLARLLERHPTLKYLALVILLFIGVKLILEGLAHFGFHIPKGYLYLAVGFSVLIELLNISAR 243 (254) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999998638999999999999999999601011431780689999999999999999988 Q ss_pred HHHHHHH Q ss_conf 9875543 Q gi|255764470|r 237 RNREQLM 243 (523) Q Consensus 237 ~~~~~~~ 243 (523) +|+++.. T Consensus 244 ~~~~~~~ 250 (254) T COG0861 244 KKKKKKK 250 (254) T ss_pred HHHHHCC T ss_conf 6353011 No 5 >COG4536 CorB Putative Mg2+ and Co2+ transporter CorB [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=449.75 Aligned_cols=334 Identities=27% Similarity=0.452 Sum_probs=281.2 Q ss_pred HHHHHHHH--HCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 99999854--1469999999999999999989999999-84852899999999999999999774573370278899899 Q gi|255764470|r 148 VVTAIGMV--QHFSVMAIAVAVSALMMMAVSQPMIRYI-SRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYASIGF 224 (523) Q Consensus 148 Vitavg~~--~~~~i~~~a~~isi~~m~~~s~~i~~~i-~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~~~f 224 (523) +.|.+|+. .+.++-++-.+.. .+|+++|..+-|-+ -.||.=+.+..||++.+=.. .| T Consensus 76 laT~~~irl~Gd~GvaIAt~~mT-~vilvFaEVlPKt~Aa~~perva~~~s~~l~~l~~-------------------l~ 135 (423) T COG4536 76 LATILGIRLYGDAGVAIATGVLT-FVILVFAEVLPKTIAALYPERVALPSSFILAILVR-------------------LF 135 (423) T ss_pred HHHHHHHHHHCCCHHHHHHHHHH-HHHHHHHHHCCHHHHHHCHHHHHHHHHHHHHHHHH-------------------HH T ss_conf 99999999715323999988889-99999997620177652836544134578999999-------------------99 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCC Q ss_conf 999999989999987554301037776676567441568887---66798889999996774667898999999975113 Q gi|255764470|r 225 SGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGGKPIQ---PQGLNVKADVLLPTQHEKHIISDQEKDMVQSVLTL 301 (523) Q Consensus 225 s~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~~~~~---~~~~~ee~~~li~~~~~~g~l~~~E~~mi~~vL~L 301 (523) .-++=++|.+ .+.++|++|.+... +..+.||.+.++++++++|...+++++|+-++|+| T Consensus 136 ~Plv~lln~i------------------t~~llrl~gi~~~~~~~~~~s~EElR~~v~~~~~e~~~~~~~rdmL~gvLDL 197 (423) T COG4536 136 GPLVWLLNAI------------------TRRLLRLLGINLDQAVSQLSSKEELRTAVNESGSEGSVNKIDRDMLLGVLDL 197 (423) T ss_pred HHHHHHHHHH------------------HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCC T ss_conf 9999999999------------------9999999576855466433678999899988613356231037877431153 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC---CHHHHHHCCC Q ss_conf 7866746534000289960899989999999857997888876785527888888887620202466---0168873274 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS---MNFKRSIRKP 378 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~---~~~~~lir~~ 378 (523) .+.+|.||||||++|..+|.|++.+++.+++.+++|+|+|+|+++.||++|++|+||+++...+.++ .++....++| T Consensus 198 e~~tV~DIMvpR~~i~~id~d~~~e~iv~ql~~s~HtRiplyr~~~DnIiGvlh~r~llr~l~e~~~~~k~d~~~~a~ep 277 (423) T COG4536 198 ENLTVSDIMVPRNEIIGIDIDDPWEEIVRQLLHSPHTRIPLYRDDLDNIIGVLHVRDLLRLLNEKNEFTKEDILRAADEP 277 (423) T ss_pred CCCEEEEEECCCCCEEEECCCCCHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCC T ss_conf 13554400213222155348998999999996288774001448746753343199999996345864075799874487 Q ss_pred EEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCC-CCCEEEECCCEEEEEECCCHHHH Q ss_conf 35238999899999885038627775228674256525999999870664457888-63128834985998700119999 Q gi|255764470|r 379 LVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQ-KLDITVGDDGSLTVDGWIDVRYA 457 (523) Q Consensus 379 ~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~-~~~i~~~~d~~~~v~G~~~l~el 457 (523) +||||++++.+.|..||+++.|+|+||||||.++|+||+|||+|||||||.||+|. ..++.+++||++++||+.+++|+ T Consensus 278 yFVPe~Tpl~~QL~~F~~~k~hialVVDEYG~i~GLVTLEDIlEEIVGdftde~d~~~~ev~~q~dgs~iidGs~~iRdl 357 (423) T COG4536 278 YFVPEGTPLSDQLVAFQRNKKHIALVVDEYGDIQGLVTLEDILEEIVGDFTDEHDTLAKEVIPQSDGSFIIDGSANVRDL 357 (423) T ss_pred EECCCCCCHHHHHHHHHHHCCEEEEEEECCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCCCHHHH T ss_conf 45689992899999999725127999824676776646999999985433233664331002356874898288758998 Q ss_pred HHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEECCCC Q ss_conf 9982889999877824078999988312488998899999999999961887789999945888 Q gi|255764470|r 458 SKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSGLQNL 521 (523) Q Consensus 458 ~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~l~~~ 521 (523) |+.|+|+||++++ .|++|++++++++||+.|.++.++|++|+|++++.|||++||++++... T Consensus 358 Nr~l~W~Lp~e~a--~TinGLvle~le~IP~~~~~~~i~~~~~~Il~~~~N~I~~vri~~~~~~ 419 (423) T COG4536 358 NRALDWNLPDEDA--RTINGLVLEELEEIPEEGQAVRIGGYRIDILKVQENRIKQVKITPLKAL 419 (423) T ss_pred HHHCCCCCCCCCC--HHHHHHHHHHHHHCCCCCEEEEECCEEEEEEEHHCCCEEEEEEEECCCC T ss_conf 8861988999864--3477899999985899871899937799998721061316887734655 No 6 >COG4535 CorC Putative Mg2+ and Co2+ transporter CorC [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=407.18 Aligned_cols=273 Identities=26% Similarity=0.425 Sum_probs=249.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEE Q ss_conf 55430103777667656744156888766798889999996774667898999999975113786674653400028996 Q gi|255764470|r 240 EQLMSPSRLRARTADAVLRLLGGKPIQPQGLNVKADVLLPTQHEKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWL 319 (523) Q Consensus 240 ~~~~~~~~~~eRtA~aVLRLlG~~~~~~~~~~ee~~~li~~~~~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~l 319 (523) .....++++++|-.+ +++.+. ....+|+-.++..+++.+.++++-+.|+++|++.++.+|||||+||++++++ T Consensus 10 ~~~~~~ksffe~L~~---~~f~gE----pknr~eLl~liRdse~n~LiD~dt~~mlEGvm~iadl~vrDiMIPRSQM~~l 82 (293) T COG4535 10 DSANKKKSFFERLLS---QLFHGE----PKNREELLELIRDSEQNELIDADTLDMLEGVMDIADLRVRDIMIPRSQMITL 82 (293) T ss_pred CCCCCCCCHHHHHHH---HHHCCC----CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHEEC T ss_conf 766656649999999---983689----8678999999987645346275689999888999875476611227782221 Q ss_pred ECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC-CCCHHHHHHCCCEEECCCCCHHHHHHHHHHCC Q ss_conf 08999899999998579978888767855278888888876202024-66016887327435238999899999885038 Q gi|255764470|r 320 DVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE-GSMNFKRSIRKPLVVHENISVLKLMERLRKSS 398 (523) Q Consensus 320 d~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~-~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~ 398 (523) +.+++++++...+.++.||||||+.++.|++.|++++||||+....+ ...++++++||+.+|||++.+..+|+.||.++ T Consensus 83 ~~~~~l~~~l~~iiesaHSRfPVi~edkD~v~GIL~AKDLL~~~~~~~~~F~i~~lLRPav~VPESKrvd~lLkeFR~~R 162 (293) T COG4535 83 KRNQTLDECLDVIIESAHSRFPVISEDKDHVEGILLAKDLLPFMRSDAEPFDIKELLRPAVVVPESKRVDRLLKEFRSQR 162 (293) T ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCEECCCCHHHHHHHHHHHHHC T ss_conf 01379999999999851155871147700134454588788774078654638886155243466126999999998504 Q ss_pred CEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCC--CCEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHH Q ss_conf 6277752286742565259999998706644578886--31288349859987001199999982889999877824078 Q gi|255764470|r 399 QTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQK--LDITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLA 476 (523) Q Consensus 399 ~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~--~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~Tla 476 (523) .|||+|+||||++.|+||.||++|.|||+|.||+|++ .++.++++++|.+.+.++++++|+.||.++++ ++++|+| T Consensus 163 nHMAIViDEfGgVsGLVTIEDiLEqIVGdIEDE~Deee~~dI~~ls~~~~~vrALT~IedFN~~F~t~FsD--eevDTIG 240 (293) T COG4535 163 NHMAIVIDEFGGVSGLVTIEDILEQIVGDIEDEYDEEEDADIRQLSRHTWRVRALTEIEDFNEAFGTHFSD--EEVDTIG 240 (293) T ss_pred CCEEEEEECCCCEEEEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCCCCH--HHHHHHH T ss_conf 72699982257720167799999998543310135055444676347726998446089989986688872--2212166 Q ss_pred HHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEECCCC Q ss_conf 999988312488998899999999999961887789999945888 Q gi|255764470|r 477 GFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSGLQNL 521 (523) Q Consensus 477 Glil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~l~~~ 521 (523) |+++..+|++|+.||++.++||.|+|...|.||+-.++|+.++.- T Consensus 241 GLVm~afGhLP~RGE~i~i~~~~FkV~~ADsRrv~~l~vt~~~~~ 285 (293) T COG4535 241 GLVMQAFGHLPARGEKIDIGGYQFKVARADSRRVIQLHVTVPDDS 285 (293) T ss_pred HHHHHHHCCCCCCCCEEEECCEEEEEEECCCCEEEEEEEECCCCC T ss_conf 799998366877787788746378986046522799997658876 No 7 >pfam03741 TerC Integral membrane protein TerC family. This family contains a number of integral membrane proteins that also contains the TerC protein. TerC has been implicated in resistance to tellurium. This protein may be involved in efflux of tellurium ions. The tellurite-resistant Escherichia coli strain KL53 was found during testing of the group of clinical isolates for antibiotics and heavy metal ion resistance. Determinant of the tellurite resistance of the strain was located on a large conjugative plasmid. Analyses showed, the genes terB, terC, terD and terE are essential for conservation of the resistance. The members of the family contain a number of conserved aspartates that could be involved in binding to metal ions. Probab=100.00 E-value=8.5e-41 Score=317.22 Aligned_cols=178 Identities=34% Similarity=0.449 Sum_probs=159.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999974439999999851799899999999999999999999999999999974247999863453222489999 Q gi|255764470|r 16 ATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLIL 95 (523) Q Consensus 16 ~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~ 95 (523) +|++++|++||+||++||+++++++|++||+||++||+.+|+++|+++++.++ |+++++. .++.. T Consensus 1 ltl~~lE~vLs~DN~ivial~~~~lP~~~r~~al~~Gi~~A~v~R~i~i~~~~-~ll~~~~--------------~l~~i 65 (184) T pfam03741 1 LTLYLLEISLSVDNAFVIALIFRKLPPEQQKKALFWGIIGALVLRIILILLGS-ALLELFD--------------WILLI 65 (184) T ss_pred CEEHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--------------HHHHH T ss_conf 92026559988549999999980489999999999999999999999999999-9999707--------------99999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCC------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 999999998655553025764211------11222332099999999999999857789999985414699999999999 Q gi|255764470|r 96 GGFFLLFKGTIELHERLEGDGFDK------KHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAVSA 169 (523) Q Consensus 96 GG~fLl~k~~~el~~~~e~~~~~~------~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~isi 169 (523) ||+||+|++++++++..+.++++. ....+..++||.++.||+++|++||+|||+||+|+|+|+++|++|+++|+ T Consensus 66 gg~~Li~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~av~~I~~~DlvFSlDSV~aa~g~t~~~~ii~~~~i~si 145 (184) T pfam03741 66 GGAFLLYTAIKLLRENEEDDEENKIVRELKKVLPVTSSSLWLAVIQIELADLVFSLDSVPAAVGITDDPFIVITGNIFAI 145 (184) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99999999999987323443200234544301565403799999999999999876289999998135599999999999 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999899999998485289999999999999999977 Q gi|255764470|r 170 LMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEG 208 (523) Q Consensus 170 ~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg 208 (523) +.|+++|+.++++++|||.++++|++||++||+.|++|| T Consensus 146 ~~m~~~~~~~~~~i~~~~~l~~~~~~~L~~ig~kLi~eg 184 (184) T pfam03741 146 LGLRFLAFLLAKLIERFPYLKYLAAAILGFIGVKLLLEG 184 (184) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999999999999999876999999999999999996089 No 8 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=99.91 E-value=7.9e-24 Score=193.19 Aligned_cols=110 Identities=42% Similarity=0.667 Sum_probs=103.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-CHHHHHHCCCEEECCCCCHHHHHH Q ss_conf 0289960899989999999857997888876785527888888887620202466-016887327435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-MNFKRSIRKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-~~~~~lir~~~~Vpe~~~l~~lL~ 392 (523) +++++++.++|++++++.+.+++|||+|||+++.|+++|++|.||++.....++. .....++|||+++|+++++.++|+ T Consensus 1 ~~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~l~ 80 (111) T cd04590 1 TDIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALAEGEEDLDLRDLLRPPLFVPESTPLDDLLE 80 (111) T ss_pred CCEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHHCCCCCCCHHHHCCCCEECCHHHHHHHHHH T ss_conf 96899969999999999999779977999954788778999911445666527876407764044222255665889998 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .||+++.|+|+|+||||+++|+||++|++|+ T Consensus 81 ~~~~~~~~~a~VvDe~G~~~GivT~~DilE~ 111 (111) T cd04590 81 EMRKERSHMAIVVDEYGGTAGLVTLEDILEE 111 (111) T ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHCC T ss_conf 4355496799999699999999980485377 No 9 >pfam04332 DUF475 Protein of unknown function (DUF475). Predicted to be an integral membrane protein with multiple membrane spans. Probab=99.66 E-value=2.4e-15 Score=131.22 Aligned_cols=205 Identities=18% Similarity=0.242 Sum_probs=139.1 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHH Q ss_conf 439999999851799899999999999999-999999999999999974247999863453-----------22248999 Q gi|255764470|r 27 IDNLIFITLLVEKLPLAQRGKALVFGLTFA-MVTRIALLASLSYWIVMLQQPLFFLKGLSF-----------SGRDIVLI 94 (523) Q Consensus 27 ~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A-~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~-----------s~~~lill 94 (523) .||||+.|-.-+++.|.+||+.+.||+..| +.||+++-..+. |+.+=-.| .......+ ..+..|.. T Consensus 1 FDNAVVNA~vLk~Ms~~Wqk~FLt~GIlIAVFGMRlvFPllIV-~~~a~lgp-~ea~~lAl~~p~~Y~~~l~~ahp~Iaa 78 (294) T pfam04332 1 FDNAVVNASILKRMSPFWQKMFLTIGILIAVFGMRLIFPLVIV-AVSAGLDP-IEAMKLALENPDRYEELLTAAHPSIAA 78 (294) T ss_pred CCCCEEEHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCH-HHHHHHHHCCHHHHHHHHHHCCHHHHH T ss_conf 9720111887864369999999998199998988876657887-65278898-999999836978899999843688998 Q ss_pred HHHHHHHHHHHHHHHHH--------------------------------------------------------------- Q ss_conf 99999999986555530--------------------------------------------------------------- Q gi|255764470|r 95 LGGFFLLFKGTIELHER--------------------------------------------------------------- 111 (523) Q Consensus 95 ~GG~fLl~k~~~el~~~--------------------------------------------------------------- 111 (523) +||.||+....+-+.+. T Consensus 79 FGG~FLlmvfL~fffdek~~~Wl~~iE~~l~k~g~~~~~~v~i~l~~ll~~~~~~~~~~~~~~~vl~ag~~Gl~~y~~v~ 158 (294) T pfam04332 79 FGGMFLLMVFLDFFFDDRDIKWLKWIERPLARIGKLDAISAIIVFVLLLIFAFFFTAPAEQFQTVLFAGLLGLLTYLLVD 158 (294) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 85999999999870206636543698799987277121299999999999999833640156899999999999999999 Q ss_pred -----CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH------HHHHHHHHHHHHH Q ss_conf -----25764211112223320999999999999998577899999854146999999999------9999999989999 Q gi|255764470|r 112 -----LEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAV------SALMMMAVSQPMI 180 (523) Q Consensus 112 -----~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~i------si~~m~~~s~~i~ 180 (523) .+.++.+........++-.....+.+++|+.||+|.|+.|.++|+|++++++|+.| |+.+|++-.+.+. T Consensus 159 ~l~~~~~~~~~~~~~~~~~~k~g~~~FLYLEVLDASFSFDGVIGAFAiT~~i~iI~iGLGIGAmfVRSlTi~lV~kgtL~ 238 (294) T pfam04332 159 GFGRLFEPDGSPTGAVRAAGKAGLALFLYLEVLDASFSFDGVIGAFAITNDIVIIALGLGIGAMFVRSLTVYLVRKGTLD 238 (294) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEHHHHHCCHHHHHHHCCHHHEEEEEEEEEEEECCHHH T ss_conf 99988255677633443446889999999999861102154310135512579999602134210040256755447087 Q ss_pred HHHH--HCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9998--48528999999999999999997745733702788998999999999899999875 Q gi|255764470|r 181 RYIS--RHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYASIGFSGIIEFFNQVARRNRE 240 (523) Q Consensus 181 ~~i~--~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~~~fs~~ve~~n~~~~~~~~ 240 (523) +++. .---.-+.|| ++.+..+.+++|||-- +-.-+++.++.=.+-.-.|+||+ T Consensus 239 ~Y~YLEHGA~yAIg~L------avIMl~~~~~~~iPE~-iTgliG~~~Ig~a~~sSi~~nrr 293 (294) T pfam04332 239 DYVYLEHGAHYAIGAL------AVIMLLSIRRIEIPEV-VTGLVGVVFIGASLWSSVIRNRR 293 (294) T ss_pred HHHHHHCCHHHHHHHH------HHHHHHHCCCCCCCHH-HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 7677731389999999------9999876146357468-88677999999999999998734 No 10 >KOG2118 consensus Probab=99.64 E-value=5e-16 Score=136.16 Aligned_cols=149 Identities=23% Similarity=0.303 Sum_probs=130.8 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHH-HHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHH Q ss_conf 678989999999751137866746534000289960899989-9999998579978888767855278888888876202 Q gi|255764470|r 285 HIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDE-DLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDL 363 (523) Q Consensus 285 g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~-e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~ 363 (523) |.+..+|..++.+++++.++.++++|||-.+++.++.+.... +....+.+++|||+|||+++..+.+|....+.++... T Consensus 185 g~~~~~e~~ii~g~l~l~ek~~~evmtpi~~~f~l~~n~~l~~~~~~~i~~~g~sripv~~~~~~~~i~~~L~~~~~~~~ 264 (498) T KOG2118 185 GDLTYDELTIITGALELTEKLVGEVMTPIEDVFALDANTKLDRETVGEIVKHGYSRIPVYEQEPKNKIGGLLVMNLLRLL 264 (498) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCEEEECCCCCCCEEHHHHHHHHHHHC T ss_conf 75524577887536899999988730614443001100210178888887658760000366652000044433123320 Q ss_pred H----CCCCCHHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCC Q ss_conf 0----24660168873274352389998999998850386277752286742565259999998706644578886 Q gi|255764470|r 364 L----EEGSMNFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQK 435 (523) Q Consensus 364 ~----~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~ 435 (523) . .....--....++...+|++++..+.++.||+.+.|||+|.. +....|++|++|+ |++++|+.||.|.+ T Consensus 265 ~~~~~~~~~~v~~~~~~~l~~vp~~~~~~~~l~~~~~~~~H~~~v~~-~~~~~~~~~l~~~-~~~~~ev~de~d~~ 338 (498) T KOG2118 265 QVEVPLEPLPVSESALLRLPLVPENMPLLDLLNEFQKGKSHMAVVRN-GHVDIFVLTLEDL-EEVVKEVEDEEDKE 338 (498) T ss_pred CCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC-CCCCEEEEECCCH-HHHHCCCCCCCCCC T ss_conf 43345663003455412545688743088889998731213688824-8762444641414-54302133412454 No 11 >COG2899 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.64 E-value=2.2e-15 Score=131.40 Aligned_cols=219 Identities=18% Similarity=0.234 Sum_probs=146.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH---H-------H Q ss_conf 999999999974439999999851799899999999999999-9999999999999999742479998---6-------3 Q gi|255764470|r 15 LATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFA-MVTRIALLASLSYWIVMLQQPLFFL---K-------G 83 (523) Q Consensus 15 l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A-~v~R~~ll~~~~~wi~~l~~pl~~i---~-------~ 83 (523) ...|-+||++||.|||++.|-.-+++.|.+|||.+.||+..| +.||+++-..+. |+..=..|.=.. . . T Consensus 39 ~~vLavLEiSLSFDNAIvNA~iLk~MS~~Wqk~FLT~GIlIAVFGMRlvFPl~IV-~vaa~~~pi~a~~lAl~~P~~Y~~ 117 (346) T COG2899 39 CAVLAVLEISLSFDNAIVNAAILKDMSPFWQKRFLTWGILIAVFGMRLVFPLVIV-AVAAGLDPIRAMKLALEPPESYAK 117 (346) T ss_pred HHHHHHHHHHEECHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCHHHHHHHHCCCCHHHHH T ss_conf 9999996301021187861898875369999999999999999764887778898-886177908999987069377899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HC------------------------------------CCCCC-------- Q ss_conf 453222489999999999998655553--02------------------------------------57642-------- Q gi|255764470|r 84 LSFSGRDIVLILGGFFLLFKGTIELHE--RL------------------------------------EGDGF-------- 117 (523) Q Consensus 84 ~~~s~~~lill~GG~fLl~k~~~el~~--~~------------------------------------e~~~~-------- 117 (523) .--..+.-|-.+||.||+....+...+ |. ..+.+ T Consensus 118 ii~~aH~~IAAFGG~FLlMv~L~fffd~erd~hWl~~iE~~~arig~~~~v~vi~~~~lll~~s~~l~~~~~~~~~l~Ag 197 (346) T COG2899 118 IITDAHPQIAAFGGTFLLMVFLDFFFDHERDVHWLKWIERPLARIGRLGGVEVIVAIALLLLFSRLLTASADRGTVLIAG 197 (346) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEHHHH T ss_conf 99855830556400899999988731840013366668789987447787020899999999998834755443042977 Q ss_pred -------------------CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH------HHHHH Q ss_conf -------------------11112223320999999999999998577899999854146999999999------99999 Q gi|255764470|r 118 -------------------DKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAV------SALMM 172 (523) Q Consensus 118 -------------------~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~i------si~~m 172 (523) .++......+.-.+.-.+.+++|+.||+|.|+.|.++|.|+.++.+|..| |+.+| T Consensus 198 l~GlltyLlV~~vg~l~~~~~~~~~~a~kaGla~FLYLEVLDAsFSFDGViGAFAiT~d~vIIalGLgIGAmfVRSiTi~ 277 (346) T COG2899 198 LLGLLTYLLVDGVGGLLDATQQAMQAAGKAGLAAFLYLEVLDASFSFDGVIGAFAITTDPVIIALGLGIGAMFVRSITIY 277 (346) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHEECCCHHHEEEEEEEEE T ss_conf 99999999999842285428888766521325678889887310012652333200157203100032130342515889 Q ss_pred HHHHHHHHHHH--HHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99989999999--8485289999999999999999977457337027889989999999998999998755 Q gi|255764470|r 173 MAVSQPMIRYI--SRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYASIGFSGIIEFFNQVARRNREQ 241 (523) Q Consensus 173 ~~~s~~i~~~i--~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~~~fs~~ve~~n~~~~~~~~~ 241 (523) ++--+.+.+++ +--.-.-+++|+++++.+.. -+|||.- +-.-+|-.++.-.+---.|+||+. T Consensus 278 LV~kgTL~~y~yLEHGAhyAI~~Laviml~s~~------~~hIpEv-vTgL~Ga~fIgls~~sSv~~Nr~~ 341 (346) T COG2899 278 LVEKGTLDEYVYLEHGAHYAIGALAVIMLLSTD------RFHIPEV-VTGLVGAVFIGLSLWSSVRYNRRE 341 (346) T ss_pred EEECCCHHHHHHHHCCHHHHHHHHHHHHHHHHH------HEEHHHH-HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 886372877787750358999999999998510------0423789-987667999999999999986750 No 12 >pfam03471 CorC_HlyC Transporter associated domain. This small domain is found in a family of proteins with the pfam01595 domain and two CBS domains with this domain found at the C-terminus of the proteins, the domain is also found at the C terminus of some Na+/H+ antiporters. This domain is also found in CorC that is involved in Magnesium and cobalt efflux. The function of this domain is uncertain but might be involved in modulating transport of ion substrates. Probab=99.63 E-value=4.4e-15 Score=129.28 Aligned_cols=77 Identities=34% Similarity=0.608 Sum_probs=71.6 Q ss_pred EECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCE--EEECCEEEEEEEEECCEEEEEEEEE Q ss_conf 83498599870011999999828899998778240789999883124889988--9999999999996188778999994 Q gi|255764470|r 440 VGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEI--FTEMNLKFEIIRLEGHNIDRVKVSG 517 (523) Q Consensus 440 ~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~--i~~~g~~f~V~~~~~~rI~~V~V~~ 517 (523) +.++|+|+++|+++++++|+.+++++|+ ++|+|+|||+++++|++|++||+ +.+++|+|+|++++++||.+|+|++ T Consensus 2 ~~~~~~~~v~G~~~l~dl~~~~~~~l~~--~~~~TigG~i~~~l~~iP~~ge~~~i~~~~~~f~V~~~~~~rI~~v~i~~ 79 (80) T pfam03471 2 KLDDGSYLVDGRAPLDDLNELLGLDLPE--EDYDTLAGLVLELLGRIPEVGDKVEVDLGGLRFTVLEVDGRRIKKVRVTK 79 (80) T ss_pred EECCCEEEEEECCCHHHHHHHHCCCCCC--CCCEEHHHHHHHHHCCCCCCCCEEEEEECCEEEEEEEEECCEEEEEEEEE T ss_conf 8889889998026999999987979998--65205999999986869999988999979989999991179898999998 Q ss_pred C Q ss_conf 5 Q gi|255764470|r 518 L 518 (523) Q Consensus 518 l 518 (523) . T Consensus 80 ~ 80 (80) T pfam03471 80 L 80 (80) T ss_pred C T ss_conf 9 No 13 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.58 E-value=4.1e-14 Score=122.18 Aligned_cols=106 Identities=24% Similarity=0.386 Sum_probs=95.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873-27435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~ 392 (523) .|++++..++|++++.+.+.+++||++||++++ ++++|+++.||++... .+....+.+ ++|.++++++++.++++ T Consensus 2 ~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~-~~lvGivt~~Di~~~~---~~~~v~~iM~~~~~~v~~~~~~~~~~~ 77 (108) T cd04596 2 EDTGYLTTTDTVKDWHELNKETGHSRFPVVDEK-NKVVGIVTSKDVAGKD---PDTTIEKVMTKNPITVNPKTSVASVAH 77 (108) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHCC---CCCCHHHHEECCCCCCCCCCCHHHHHH T ss_conf 988898999979999999998599889999689-9099999979975179---887267850047620599988999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .|.+++.+.+.|+||+|...|+||.+|++++ T Consensus 78 ~m~~~~~~~lPVVde~~~lvGiIT~~Dil~A 108 (108) T cd04596 78 MMIWEGIEMLPVVDDNKKLLGIISRQDVLKA 108 (108) T ss_pred HHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9998498778999169959999997996319 No 14 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.50 E-value=6.1e-13 Score=113.58 Aligned_cols=108 Identities=24% Similarity=0.355 Sum_probs=96.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-CCHHHHHH-CCCEEECCCCCHHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246-60168873-2743523899989999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-SMNFKRSI-RKPLVVHENISVLKLM 391 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-~~~~~~li-r~~~~Vpe~~~l~~lL 391 (523) +++.++..++|++++.+.+.+++++++||.+++ ++++|+++.+|+++....+. .....+.+ +++..++++.++.+++ T Consensus 1 T~~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~-g~lvGivt~~di~~~l~~~~~~~~V~~vM~~~v~tv~~~~~l~~~~ 79 (111) T cd04639 1 THFETLSPADTLDDAADALLATTQHEFPVVDGD-GHLVGLLTRDDLIRALAEGGPDAPVRGVMRRDFPTVSPSATLDAVL 79 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CCEEEEEEHHHHHHHHHCCCCCCCHHHHCCCCCEEECCCCCHHHHH T ss_conf 988792998979999999997199789999389-9889999899999998726999705763546987998999499999 Q ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9885038627775228674256525999999 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +.|.+++.+.+.|+||+|...|+||.+|++| T Consensus 80 ~~m~~~~~~~lpVVd~~g~lvGivT~~Di~E 110 (111) T cd04639 80 RLMQQGGAPAVPVVDGSGRLVGLVTLENVGE 110 (111) T ss_pred HHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999589988989957992999999788544 No 15 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.42 E-value=5e-12 Score=106.93 Aligned_cols=110 Identities=20% Similarity=0.288 Sum_probs=94.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC--------------CCHHHHH-HCCC Q ss_conf 028996089998999999985799788887678552788888888762020246--------------6016887-3274 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG--------------SMNFKRS-IRKP 378 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~--------------~~~~~~l-ir~~ 378 (523) .++++++.++|+.++.+.+.+++++++||++++.++++|++..+|++.....+. ..+..++ .+++ T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~v 80 (125) T cd04631 1 KDVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYLGGGEKFNKIKTGNGLEAINEPVRSIMTRNV 80 (125) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCHHHHCCCHHHCCCCHHHHCCCCC T ss_conf 99779799094999999999839988999979999599999999999987335431012101102321789999566898 Q ss_pred EEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 352389998999998850386277752286742565259999998 Q gi|255764470|r 379 LVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 379 ~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .+++++.++.++++.|.+++.|-..|+||.|...|+||..|++.+ T Consensus 81 ~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~g~~~GiIt~~DilrA 125 (125) T cd04631 81 ITITPDDSIKDAAELMLEKRVGGLPVVDDDGKLVGIVTERDLLKA 125 (125) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 899799998999999986395099999589939999988997369 No 16 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.40 E-value=4.6e-12 Score=107.20 Aligned_cols=105 Identities=24% Similarity=0.408 Sum_probs=93.5 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873-27435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~ 392 (523) .++++++.+++++++.+.+.+++++++||+++ ++++|+++.+|++.. ..+.+.++++ +++.++++++++.++++ T Consensus 2 kdvvtv~~d~tv~ea~~~m~~~~~~~~pVvd~--~~lvGivt~~Dll~~---~~~~~v~~iM~~~~~tv~~~~~l~~a~~ 76 (107) T cd04610 2 RDVITVSPDNTVKDVIKLIKETGHDGFPVVDN--GKVVGIVSARDLLGK---DPDETVEEIMSKDLVVAVPEMDIMDAAR 76 (107) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHCCC---CCCCCHHHHCCCCCCCCCCCCCHHHHHH T ss_conf 88099899295999999999759978999989--999998876453159---9877267860578601699986999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .|.+++.+...|+||.|...|+||..|++.+ T Consensus 77 ~m~~~~~~~lpVvde~g~lvGiiT~~Dilra 107 (107) T cd04610 77 VMFRTGISKLPVVDENNNLVGIITNTDVIRS 107 (107) T ss_pred HHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9998299689699239989999994784439 No 17 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=99.39 E-value=4.1e-12 Score=107.54 Aligned_cols=108 Identities=19% Similarity=0.346 Sum_probs=94.6 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCH--HH-HHHCCCEEECCCCCHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246601--68-873274352389998999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMN--FK-RSIRKPLVVHENISVLKL 390 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~--~~-~lir~~~~Vpe~~~l~~l 390 (523) .++++++.++|+.++.+.+.+++++++||.++ ++++|++..+|++.....+.... .. -..+++.++++++++.++ T Consensus 1 Pdvvtv~p~~tl~~a~~~m~~~~~~~lpVvd~--~~l~Givt~~di~~~~~~~~~~~~~v~~im~~~~~~v~~~~~l~~a 78 (111) T cd04612 1 PDVVTVPVDLTVDEVLALMFGERHRGYPVVDD--GRLVGIVTLADIRRVPAEGREATVLVGDVMTRDPVTASPDETLRDA 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCHHHH T ss_conf 98499999295999999999719948999999--9999999899999877514777756999970796052999979999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 998850386277752286742565259999998 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++.|++++.+...|+|+.|.+.|++|..|++.+ T Consensus 79 ~~~m~~~~~~~lpVvd~~g~l~GiiT~~DilrA 111 (111) T cd04612 79 LKRMAERDIGRLPVVDDSGRLVGIVSRSDLLRA 111 (111) T ss_pred HHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 999997698199999689989999986886349 No 18 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=99.37 E-value=1.1e-11 Score=104.37 Aligned_cols=129 Identities=26% Similarity=0.402 Sum_probs=112.8 Q ss_pred HHHHHHH-----CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC Q ss_conf 9999751-----13786674653400028996089998999999985799788887678552788888888762020246 Q gi|255764470|r 293 DMVQSVL-----TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG 367 (523) Q Consensus 293 ~mi~~vL-----~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~ 367 (523) .||..++ .=+=.+|.|+|+|..+-.++..++++++..+...+++||||||.+.+ -.++|+|..||+.... + T Consensus 172 tmIN~Al~n~lIKkdI~~Vedi~~P~~~~~yL~~~d~v~d~~~l~~kt~~sRfPVvn~~-~kvvGvVt~rDv~~~~---~ 247 (432) T COG4109 172 TMINKALSNQLIKKDIITVEDIMTPLEDTSYLRETDTVEDWLDLVEKTGHSRFPVVNRS-MKVVGVVTMRDVLDKK---P 247 (432) T ss_pred HHHHHHHHHHHHHHHEEEHHHHCCCCCCCEECCCCCCHHHHHHHHHHCCCCCCCEECCC-CEEEEEEEEHHHHCCC---C T ss_conf 99989987766644325587744644343232464509999999997198734133144-2699999834644479---9 Q ss_pred CCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHC Q ss_conf 60168873-27435238999899999885038627775228674256525999999870 Q gi|255764470|r 368 SMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIA 425 (523) Q Consensus 368 ~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~Iv 425 (523) +.+++..+ |.|.+|..++++...-+.|-.+...|..|+|+.....|+||..|+++.+- T Consensus 248 ~t~ieKVMtknp~tv~~~tsVAsvaq~MiwE~iem~PVv~~n~~llGiitR~dvlk~lq 306 (432) T COG4109 248 STTIEKVMTKNPITVRAKTSVASVAQMMIWEGIEMLPVVDSNNTLLGIITRQDVLKSLQ 306 (432) T ss_pred CCCHHHHHCCCCEEECCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHH T ss_conf 86177774269755425226889988887645526568837861898988999999998 No 19 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=99.34 E-value=3.2e-11 Score=100.99 Aligned_cols=107 Identities=21% Similarity=0.289 Sum_probs=95.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC--CHHHHHH-CCCEEECCCCCHHHH Q ss_conf 0289960899989999999857997888876785527888888887620202466--0168873-274352389998999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS--MNFKRSI-RKPLVVHENISVLKL 390 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~--~~~~~li-r~~~~Vpe~~~l~~l 390 (523) ++++++..++|+.++.+.+.+++++++||.++ ++++|++..+|++.....+.+ ....+++ +|+.+++++.++.++ T Consensus 1 t~vvTv~p~~tv~ea~~~m~~~~i~~~~V~~~--~~~vGiiT~~Di~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~~~~a 78 (111) T cd04611 1 TQILTCPPDTSLAEAASRMRERRISSIVVVDD--GRPLGIVTERDILRLLASGPDLQTPVGEVMSSPLLTVPADTSLYDA 78 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCEEEECCCCHHHH T ss_conf 97199889697999999999719988999979--9999999855989987424887788899710683399789999999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.|++++.+-..|+||.|...|++|..|++. T Consensus 79 ~~~m~~~~i~~lpVvd~~~~~vGiiT~~Dilr 110 (111) T cd04611 79 RQLMREHGIRHLVVVDDDGELLGLLSQTDLLQ 110 (111) T ss_pred HHHHHHHCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 99999718739999946998999998568427 No 20 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.32 E-value=5.1e-11 Score=99.56 Aligned_cols=109 Identities=22% Similarity=0.324 Sum_probs=93.9 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC-------------CCCHHHHHH-CCCE Q ss_conf 02899608999899999998579978888767855278888888876202024-------------660168873-2743 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE-------------GSMNFKRSI-RKPL 379 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~-------------~~~~~~~li-r~~~ 379 (523) .++++++.++++.++.+.+.+++++++||.+++ ++++|++..+|++...... ......+.+ +++. T Consensus 2 rdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~-g~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~~ 80 (124) T cd04600 2 RDVVTVTPDTSLEEAWALLRRHRIKALPVVDGD-RRLVGIVTQRDLLRHARPDGRRPLRGRLRGRDKPETVGDIMSPPVV 80 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CCEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCE T ss_conf 896998993979999999997299789999569-8088885699999877504653101222105578898995779986 Q ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 52389998999998850386277752286742565259999998 Q gi|255764470|r 380 VVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 380 ~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .+++++++.++++.|.+++.|...|+||.|...|+||..|++.+ T Consensus 81 ~v~~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiIt~~Dil~A 124 (124) T cd04600 81 TVRPDTPIAELVPLLADGGHHHVPVVDEDRRLVGIVTQTDLIAA 124 (124) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99898979999999997793499999579989999978997469 No 21 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=99.32 E-value=4.1e-11 Score=100.24 Aligned_cols=109 Identities=22% Similarity=0.333 Sum_probs=93.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC---CCHHHHHH-CCCEEECCCCCHHH Q ss_conf 028996089998999999985799788887678552788888888762020246---60168873-27435238999899 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG---SMNFKRSI-RKPLVVHENISVLK 389 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~---~~~~~~li-r~~~~Vpe~~~l~~ 389 (523) .+++++..++++.++.+.+.+++++++||++++ ++++|+++.+|++....++. .....+.+ ++|.++++++++.+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~-~~lvGiit~~Dl~~~l~~~~~~~~~~v~~im~~~~~~v~~~~~l~~ 79 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDD-GRLVGIVSLDDIREILFDPSLYDLVVASDIMTKPPVVVYPEDSLED 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCEEEHHCCCCCEEEECCCCHHH T ss_conf 997695890989999999986199789999289-9499999999999999705665572931203629659913678899 Q ss_pred HHHHHHHCCCEEEEEEC-CCCCEEEEEEHHHHHHH Q ss_conf 99988503862777522-86742565259999998 Q gi|255764470|r 390 LMERLRKSSQTFVMVLD-EYGVLEGMITPANILEA 423 (523) Q Consensus 390 lL~~fr~~~~~lAiVvD-E~G~~~GIVTleDIlE~ 423 (523) +++.|++++.+...|+| |.|...|++|.+|++.+ T Consensus 80 a~~~m~~~~~~~lPVVd~~~g~lvGiit~~Dil~A 114 (114) T cd04613 80 ALKKFEDSDYEQLPVVDDDPGKLLGILSRSDLLSA 114 (114) T ss_pred HHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 99999975975999998999989999998995359 No 22 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=99.31 E-value=4.2e-11 Score=100.17 Aligned_cols=108 Identities=18% Similarity=0.267 Sum_probs=94.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-CHHHHHH-CCCEEECCCCCHHHHH Q ss_conf 0289960899989999999857997888876785527888888887620202466-0168873-2743523899989999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-MNFKRSI-RKPLVVHENISVLKLM 391 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-~~~~~li-r~~~~Vpe~~~l~~lL 391 (523) .++++++.++++.|+.+.+.+++++++||.++ ++++|++..+|++.....+.. ....+++ +++.++++++++.+++ T Consensus 1 r~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd~--~~lvGiit~~Di~~~~~~~~~~~~V~~~M~~~~~tv~~d~~l~~~~ 78 (110) T cd04588 1 KPLITLNPNATLREAARLFNTHHIHGAPVVDD--GKLVGIVTLSDIAHAIARGLELAKVKDVMTKDVITIDEDEQLYDAI 78 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCHHHHHCCCCCCEECCCCCHHHHH T ss_conf 99479689498999999999709988999989--9999999668989998737977803452048972985899699999 Q ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 98850386277752286742565259999998 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) +.|++++.+-..|+|+.|...||+|..|++.+ T Consensus 79 ~~m~~~~v~~l~Vvd~~~~lvGIiT~~Dllka 110 (110) T cd04588 79 RLMNKHNVGRLIVTDDEGRPVGIITRTDILRS 110 (110) T ss_pred HHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99987799789999669999999982483468 No 23 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=99.30 E-value=1.1e-09 Score=89.81 Aligned_cols=270 Identities=20% Similarity=0.236 Sum_probs=166.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HHHHHHHHCCC Q ss_conf 99999999985778999998541469999999999999999989999999848528999999999999-99999774573 Q gi|255764470|r 134 LQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLGFLLMIG-FLLIIEGLHFD 212 (523) Q Consensus 134 ~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG-~~L~~eg~~~~ 212 (523) +.|+-=+++=++=+|..+-=..|.+.-+-.|+.++|..|.+.- --||-=.-.||. .++| -....-|+.|- T Consensus 82 wsiiGGNlvaAlvgvtva~~vgd~~la~~lavsLaI~~M~~~r-------cLHPPsgAvALt--avlgGpav~~~g~~F~ 152 (382) T COG3448 82 WSIIGGNLVAALVGVTVAYFVGDPVLASGLAVSLAIGAMFALR-------CLHPPSGAVALT--AVLGGPAVHRLGYNFV 152 (382) T ss_pred HHHHCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHC-------CCCCCCHHHHHH--HHHCCCCCCCCCCCCE T ss_conf 6664223999998665565527807888999999999999861-------778984078899--8846840104787705 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCC---CCCHHHHHHHHHHHHHCCCCC Q ss_conf 37027889989999999998999998755430103777667656744156-88876---679888999999677466789 Q gi|255764470|r 213 IPKGYLYASIGFSGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGG-KPIQP---QGLNVKADVLLPTQHEKHIIS 288 (523) Q Consensus 213 ipk~y~y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~-~~~~~---~~~~ee~~~li~~~~~~g~l~ 288 (523) .----+-..+--.+. -++|-.++|+--++. ++ +-|| .-|- +++.. .-+.++++.-+..-.|-=-++ T Consensus 153 l~Pval~SliLv~~a-~lynnl~~r~YPh~~----~~-p~an----~HgT~Dppp~~rvgfs~~Dld~aL~~~~E~lDId 222 (382) T COG3448 153 LWPVALNSLILVGLA-LLYNNLTRRTYPHHW----LS-PPAN----LHGTADPPPSQRVGFSSEDLDAALQRLGETLDID 222 (382) T ss_pred EHHHHHHHHHHHHHH-HHHHHHHCCCCCCCC----CC-CCCC----CCCCCCCCCHHCCCCCHHHHHHHHHHCCCEECCC T ss_conf 336644209999999-998787458789865----78-8622----3577899931015888788999999628501578 Q ss_pred ---------HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHH Q ss_conf ---------89999999751137866746534000289960899989999999857997888876785527888888887 Q gi|255764470|r 289 ---------DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDL 359 (523) Q Consensus 289 ---------~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDL 359 (523) +.|.+-+.+ +..+.+..|||+ .|++++..+++++++.+.+++++..-+||.+++ .+++|+|..+|+ T Consensus 223 rddLe~llr~~elqa~~R--~~~~LtcadIMS--rdVvtv~~~ts~dhA~~ll~~H~ikaLPV~d~~-~rl~GiVt~~dl 297 (382) T COG3448 223 RDDLERLLRETELQALRR--RMGELTCADIMS--RDVVTVSTDTSIDHARKLLQEHRIKALPVLDEH-RRLVGIVTQRDL 297 (382) T ss_pred HHHHHHHHHHHHHHHHHH--HHCCCCHHHHCC--CCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCC-CCEEEEEEHHHH T ss_conf 899999999999999999--856430888637--453501776775799999997686504532666-610032448877 Q ss_pred HHHHHCC--------CCCHHHHHHC-CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 6202024--------6601688732-743523899989999988503862777522867425652599999987066 Q gi|255764470|r 360 LRDLLEE--------GSMNFKRSIR-KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 360 l~~~~~~--------~~~~~~~lir-~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) +...... ++...+..+. +...+..+++..+++-.|-.++.|...|+|+.|...||||..|++-++.-+ T Consensus 298 ~~~a~~~p~qrlr~~~~~~vk~imt~~v~tv~pdtpa~~lvp~lad~g~H~lpvld~~g~lvGIvsQtDliaal~r~ 374 (382) T COG3448 298 LKHARPSPFQRLRFLRPPTVKGIMTTPVVTVRPDTPAVELVPRLADEGLHALPVLDAAGKLVGIVSQTDLIAALYRN 374 (382) T ss_pred HHCCCCCHHHHHHCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCEEEEECCCCCEEEEEEHHHHHHHHHHH T ss_conf 60168566777531599863443457633535897088888876137731366886888388886278899999999 No 24 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=99.29 E-value=7.7e-11 Score=98.23 Aligned_cols=108 Identities=22% Similarity=0.300 Sum_probs=93.1 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-------------CCHHHHHH-CCCE Q ss_conf 028996089998999999985799788887678552788888888762020246-------------60168873-2743 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-------------SMNFKRSI-RKPL 379 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-------------~~~~~~li-r~~~ 379 (523) .+.++++.++|+.++.+.+.++++|++||.++ ++++|+++.+|++.....+. +....+.+ +++. T Consensus 1 rdpvti~~~~tl~ea~~~m~~~~~~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~ 78 (122) T cd04585 1 KNPITVTPDTSLMEALKLMKENSIRRLPVVDR--GKLVGIVTDRDLKLASPSKATTLDIWELYYLLSKIKVSDIMTRDPI 78 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHCCCCCE T ss_conf 99899689296999999999739988999989--9999999889999877504655333554212038899995778988 Q ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 52389998999998850386277752286742565259999998 Q gi|255764470|r 380 VVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 380 ~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++++++.++++.|.+++.+...|+|+.|.+.|+||..||+.+ T Consensus 79 tv~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiiT~~DilkA 122 (122) T cd04585 79 TVSPDASVEEAAELMLERKISGLPVVDDQGRLVGIITESDLFRA 122 (122) T ss_pred EEECCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99388669999999987598789999589989999987897469 No 25 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.29 E-value=1.2e-10 Score=96.75 Aligned_cols=108 Identities=23% Similarity=0.222 Sum_probs=91.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-----------------CHHHHHHC Q ss_conf 0289960899989999999857997888876785527888888887620202466-----------------01688732 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-----------------MNFKRSIR 376 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-----------------~~~~~lir 376 (523) .++++++.++++.|+.+.+.++++|.+||.+++ ++++|++..+|+......... .......+ T Consensus 1 s~Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~-g~lvGiis~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (126) T cd04642 1 SKVVSIDSDERVLDAFKLMRKNNISGLPVVDEK-GKLIGNISASDLKGLLLSPDDLLLYRTITFKELSEKFTDSDGVKSR 79 (126) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCEEEECCCC T ss_conf 957999995999999999998098789999289-9099999989976555144303444455554203344234202447 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 7435238999899999885038627775228674256525999999 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.++++++++.++++.|.+++.|-..|+||.|...|+||..|++. T Consensus 80 ~~itv~~~~~l~~~~~~m~~~~i~~lpVVD~~~~lvGIIs~~Dil~ 125 (126) T cd04642 80 PLITCTPSSTLKEVITKLVANKVHRVWVVDEEGKPIGVITLTDIIS 125 (126) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9989999693999999999869749999989995999999289421 No 26 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=99.29 E-value=6e-11 Score=99.04 Aligned_cols=101 Identities=20% Similarity=0.257 Sum_probs=88.5 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHHHHH Q ss_conf 99608999899999998579978888767855278888888876202024660168873274352389998999998850 Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMERLRK 396 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~ 396 (523) +++..++|++++.+.+.+++++++||.++ ++++|++..+|+..... .+ ..+.+.+++.++.+++++.++++.|.+ T Consensus 4 VtV~pd~tV~eA~~lM~~~~i~~lPVvd~--~klvGIvt~rDi~~~~~--~~-v~d~Mt~~v~tv~p~~~l~ea~~lM~~ 78 (104) T cd04594 4 IKVKDYDKVYEAKRIMIENDLLSLPVVDY--NKFLGAVYLKDIENATY--GD-VVDYIVRGIPYVRLTSTAEEAWEVMMK 78 (104) T ss_pred EEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHCCC--CC-HHHEEECCCEEECCCCCHHHHHHHHHH T ss_conf 79899993999999999839887889989--99999999799652469--98-445476398889998979999999998 Q ss_pred CCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 386277752286742565259999998 Q gi|255764470|r 397 SSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 397 ~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++.+...|+|+ |...|+||..||+|+ T Consensus 79 ~~i~~lPVvd~-gklvGIIT~~Dil~A 104 (104) T cd04594 79 NKTRWCPVVDD-GKFKGIVTLDSILDA 104 (104) T ss_pred CCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 69787899989-999999996895378 No 27 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.29 E-value=9.7e-11 Score=97.50 Aligned_cols=109 Identities=22% Similarity=0.263 Sum_probs=92.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC------------CCCCHHHHHH-CCCEE Q ss_conf 0289960899989999999857997888876785527888888887620202------------4660168873-27435 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE------------EGSMNFKRSI-RKPLV 380 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~------------~~~~~~~~li-r~~~~ 380 (523) .+++++..++++.++.+.+.++++|++||.+++ ++++|+++.+|+++.... ......++.+ +++++ T Consensus 1 r~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~-g~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~it 79 (122) T cd04635 1 REPVTCTPDDPVSKVWDLMLESGFTGLPVVQKA-GELIGIITRRDIIRAGSVRTSVEDQQRTQTKASPTVEKIMSTPVYS 79 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CCEEEEEEHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHCCCCCEE T ss_conf 999996892979999999997199489999189-8299999989987655400120034431453468999968689879 Q ss_pred ECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 2389998999998850386277752286742565259999998 Q gi|255764470|r 381 VHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 381 Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) +++++++.++.+.|.+++.+-..|+|+.|...|+||..|++.+ T Consensus 80 v~~~~~l~~a~~~m~~~~i~~lpVvd~~g~lvGiIt~~DilrA 122 (122) T cd04635 80 VTPDDSIATAVELMLEHDIGRLPVVNEKDQLVGIVDRHDVLKA 122 (122) T ss_pred ECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9896939999998898099889899379989999997996369 No 28 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.28 E-value=1.1e-10 Score=97.15 Aligned_cols=107 Identities=21% Similarity=0.338 Sum_probs=90.1 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC--------CHHHHH-HCCCEEECCC Q ss_conf 0289960899989999999857997888876785527888888887620202466--------016887-3274352389 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS--------MNFKRS-IRKPLVVHEN 384 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~--------~~~~~l-ir~~~~Vpe~ 384 (523) .++++++.++|+.++.+.+.++++|++||.+++ .+++|++..+|++......+. ....+. .+++.++.++ T Consensus 1 k~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~-gklvGiit~~Di~~~~~~~~~~~~~~~~~~~V~~iM~~~~~tv~~~ 79 (116) T cd04643 1 EEVAYVQDTNTLRHALLVLTKHGYSAIPVLDKE-GKYVGTISLTDILWKLKGLENLDLERLVDLKVIDVMNTDVPVIIDD 79 (116) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCHHHHCCCCHHHHHCCCCEEEECC T ss_conf 976999994929999999997499879898699-9499998899999987436532202311668779515698899079 Q ss_pred CCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 998999998850386277752286742565259999998 Q gi|255764470|r 385 ISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 385 ~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .++.++++.|.+++ +..|+|+.|...||||..|++.+ T Consensus 80 ~~i~~~~~~m~~~~--~lpVVD~~~~lvGIITr~DilkA 116 (116) T cd04643 80 ADIEEILHLLIDQP--FLPVVDDDGIFIGIITRREILKA 116 (116) T ss_pred CCHHHHHHHHHHCC--EEEEEECCCEEEEEEEHHHHHCC T ss_conf 99999999998789--89999459999999992795579 No 29 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.28 E-value=1.2e-10 Score=96.76 Aligned_cols=109 Identities=19% Similarity=0.302 Sum_probs=92.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----------------CCHHHHH-HC Q ss_conf 028996089998999999985799788887678552788888888762020246----------------6016887-32 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----------------SMNFKRS-IR 376 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----------------~~~~~~l-ir 376 (523) .+++++..++++.++.+.+.+++++++||++++ ++++|++..+|++.....+. +...++. .+ T Consensus 1 kdvvTv~~~~~l~~a~~lm~~~~i~~lPVvd~~-g~lvGiiT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~ 79 (128) T cd04632 1 EDVITVREDDSVGKAINVLREHGISRLPVVDDN-GKLTGIVTRHDIVDFVVRDRDKARTGDRSGEKERMLDLPVYDAMSS 79 (128) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCHHCCHHHHCCHHHHHHCCCHHHHCCC T ss_conf 998698995929999999998399779999689-9789999889999988532101011221001223406799996679 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEE--CCCCCEEEEEEHHHHHHH Q ss_conf 74352389998999998850386277752--286742565259999998 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVL--DEYGVLEGMITPANILEA 423 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVv--DE~G~~~GIVTleDIlE~ 423 (523) +++++++++++.++++.|.+++.+-..|+ |+.|...||||..|++.+ T Consensus 80 ~vitv~~~~~i~~a~~~M~~~~i~~lpVv~~dd~~~lvGIiT~~DilrA 128 (128) T cd04632 80 PVITASPNDSVRDAVDRMLENDDSSVVVVTPDDDTKVVGILTKKDVLRA 128 (128) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9879999293999999999779978966847899989999996996469 No 30 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.26 E-value=6.4e-11 Score=98.83 Aligned_cols=109 Identities=20% Similarity=0.254 Sum_probs=95.4 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC--CHHHHH-HCCCEEECCCCCHHHH Q ss_conf 0289960899989999999857997888876785527888888887620202466--016887-3274352389998999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS--MNFKRS-IRKPLVVHENISVLKL 390 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~--~~~~~l-ir~~~~Vpe~~~l~~l 390 (523) .++++++.++++.++.+.+.+++++.+||++++ ++++|++..+|+++....+.+ ....++ .+++.+++++.++.++ T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~-~~lvGiiT~~Di~~~~~~~~~~~~~v~~~m~~~~~tv~~~~~l~~a 79 (112) T cd04624 1 RPVVTVDPDTSIREAAKLMAEENVGSVVVVDPD-ERPIGIVTERDIVRAVAAGIDLDTPVSEIMTRDLVTVDPDEPVAEA 79 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEECCHHHHHHHCCCCCCCCEEEEECCCCEEECCCCCHHHH T ss_conf 997897893959999999986299889999399-8299999820458898648743442045651671678799989999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 998850386277752286742565259999998 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++.|++++.+-..|+|+.|...|++|..|++.| T Consensus 80 ~~~m~~~~~~~l~Vvd~~g~lvGiit~~Dilke 112 (112) T cd04624 80 AKLMRKNNIRHHLVVDKGGELVGVISIRDLVRE 112 (112) T ss_pred HHHHHHCCCEEEEEECCCCEEEEEEEHHHHCCC T ss_conf 999997498399899899999999980772569 No 31 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.26 E-value=5.7e-11 Score=99.22 Aligned_cols=107 Identities=14% Similarity=0.266 Sum_probs=95.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873-27435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~ 392 (523) .++++++.++++.++.+.+.+++++.+||.+++ ++++|++..+|+.+....+.. ..++++ +++.+++++.++.++++ T Consensus 2 r~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~-~~~vGiit~~Di~~~~~~~~~-~v~~im~~~~~~~~~~~~~~~a~~ 79 (110) T cd04605 2 RPVVTISEDASIKEAAKLMIEENINHLPVVDED-GRLVGIVTSWDISKAVARDKK-SVEDIMTRNVITATPDEPIDVAAR 79 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEECHHHHHHHHCCCC-CHHHCEEECHHHHHHHHHHHHHHH T ss_conf 899898993999999999997299889999799-919999962278889870797-633604300120146558999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 885038627775228674256525999999 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) .|++++.+...|+|+.|...|++|..|++. T Consensus 80 ~m~~~~i~~lPVvD~~~~lvGiiT~~Di~k 109 (110) T cd04605 80 KMERHNISALPVVDAENRVIGIITSEDISK 109 (110) T ss_pred HHHHCCCEEEEEECCCCEEEEEEEHHHHCC T ss_conf 999839619999918998999999377211 No 32 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=99.25 E-value=9e-11 Score=97.73 Aligned_cols=106 Identities=23% Similarity=0.274 Sum_probs=91.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-C-CCEEECCCCCHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873-2-743523899989999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-R-KPLVVHENISVLKLM 391 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r-~~~~Vpe~~~l~~lL 391 (523) .+++++..++|+.++.+.+.+++++++||++++ ++++|++..+|+..... .+.+..+++ + +....|+++++.+++ T Consensus 2 rdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~-~~l~Giit~~Dl~~~~~--~~~~v~~iM~~~~~~~~~~~~~~~~~~ 78 (110) T cd04601 2 RDPITVSPDATVAEALELMAEYGISGLPVVDDD-GKLVGIVTNRDLRFETD--LDKPVSEVMTPENLLTTVEGTSLEEAL 78 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHHCC--CCCCCEEEEECCCEEEECCCCCHHHHH T ss_conf 889897993969999999998499889999489-96987887767765057--678603577057508704768799999 Q ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9885038627775228674256525999999 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +.|++++.+...|+||.|...|+||..|++. T Consensus 79 ~~m~~~~i~~lpVvd~~~~lvGiIt~~Dilk 109 (110) T cd04601 79 ELLHEHKIEKLPVVDDEGKLKGLITVKDIEK 109 (110) T ss_pred HHHHHCCCEEEEEECCCCEEEEEEEHHHHCC T ss_conf 9998668519999988999999999598437 No 33 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=99.25 E-value=3e-10 Score=93.90 Aligned_cols=176 Identities=19% Similarity=0.251 Sum_probs=140.3 Q ss_pred HHHHHHHHHHHHHCCC------C-CHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 8889999996774667------8-98999999975113786674653400028996089998999999985799788887 Q gi|255764470|r 271 NVKADVLLPTQHEKHI------I-SDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVA 343 (523) Q Consensus 271 ~ee~~~li~~~~~~g~------l-~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~ 343 (523) -.|.++-+++..+.|. . -+++-+|+++| ++.+..++-|.|..++..+.+++++.+...++++|=+||. T Consensus 45 VTE~~MAiAMA~~GGiGVIH~N~~~E~Qae~V~~V-----Kr~e~g~i~redp~t~~P~~tv~~~~~l~~~~gisG~PVv 119 (476) T TIGR01302 45 VTESRMAIAMAREGGIGVIHRNMSIERQAEEVKRV-----KRAENGIISREDPVTISPETTVEDVLELMERKGISGIPVV 119 (476) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHH-----HHHCCCEEEECCCEEECCCCHHHHHHHHCCCCCCCEEEEE T ss_conf 45999999998549947994479989999998875-----2320660651488684798518999973221576545788 Q ss_pred CC-CCC-----CEEEEEEH---HHHHHHHHCCCCCHHHHH-H--CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCE Q ss_conf 67-855-----27888888---887620202466016887-3--274352389998999998850386277752286742 Q gi|255764470|r 344 QG-SLD-----SFIGIVSA---RDLLRDLLEEGSMNFKRS-I--RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVL 411 (523) Q Consensus 344 ~~-~~D-----~iiGiv~~---kDLl~~~~~~~~~~~~~l-i--r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~ 411 (523) ++ +.+ +++|||.. ||+-...+.+.+....++ . .+.+++|+...+.++++.|+++|..-..|||+.|.+ T Consensus 120 ~~G~~~g~ktGKLvGIiT~sqWrD~~f~~~~~~~daV~~~MT~~~~~iT~~e~i~~e~A~~~L~~~r~ekLpvVd~~~~l 199 (476) T TIGR01302 120 EDGKDGGPKTGKLVGIITKSQWRDVRFVKDKGKKDAVSEVMTPREELITVPEGIDLEEALKVLHKHRIEKLPVVDKDGEL 199 (476) T ss_pred ECCCCCCCCEEEEEEEEECCCEEECCCCCCCCCCCCEEECCCCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCE T ss_conf 36889897100699998377225411010468833010112037643484167778999999886086504788278988 Q ss_pred EEEEEHHHHHHHHCCCCCCCCCCCCCEEEE---CCCEEEEEECCCH--HHHHHH Q ss_conf 565259999998706644578886312883---4985998700119--999998 Q gi|255764470|r 412 EGMITPANILEAIAGDFPDEDDQKLDITVG---DDGSLTVDGWIDV--RYASKL 460 (523) Q Consensus 412 ~GIVTleDIlE~IvGei~DE~d~~~~i~~~---~d~~~~v~G~~~l--~el~~~ 460 (523) .||||..|+.+.. ++ |...+- .+|+|+|-+.+.- +|.+|. T Consensus 200 VgLiT~~Di~~~~--~~-------P~A~kd~vG~~GrL~VgAAvg~r~~D~~R~ 244 (476) T TIGR01302 200 VGLITVKDIVKRR--EF-------PHASKDTVGENGRLIVGAAVGTREDDLERA 244 (476) T ss_pred EEEEEHHHHHHHH--HC-------CCCCCCCCCCCCEEEEEEEECCCCCCHHHH T ss_conf 9998644788986--38-------887788748886089998846898618999 No 34 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.24 E-value=1.8e-10 Score=95.50 Aligned_cols=108 Identities=21% Similarity=0.329 Sum_probs=93.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHH-HHHHCCC---CCHHHHHH-CCCEEECCCCCHH Q ss_conf 02899608999899999998579978888767855278888888876-2020246---60168873-2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLL-RDLLEEG---SMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl-~~~~~~~---~~~~~~li-r~~~~Vpe~~~l~ 388 (523) .++++++.++|+.++.+.+.+++++.+||.++ ++++|++.-+|+. .....+. .....+.+ +++.+++++.++. T Consensus 1 r~~vti~~~~tl~~a~~~m~~~~~~~v~V~d~--~~~vGiiT~~Di~~~~~~~~~~~~~~~v~~iM~~~~i~v~~~~~l~ 78 (113) T cd04622 1 RDVVTVSPDDTIREAARLMREHDVGALPVCEN--DRLVGIVTDRDIVVRAVAEGRDPDTTTVGDVMTRGVVTVTEDDDVD 78 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHHHHHHHCCCCHHHCCHHHHHCCCCEEEECCCCHH T ss_conf 99779689398999999999729988999989--9999999632788999973998200776786415767998899879 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|++++.|...|+|+.|.+.|+||..|++.+ T Consensus 79 ~a~~~m~~~~i~~lpVvd~~~~~vGiIt~~Dilra 113 (113) T cd04622 79 EAARLMREHQVRRLPVVDDDGRLVGIVSLGDLARA 113 (113) T ss_pred HHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99999985796589999089989999997995477 No 35 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.23 E-value=2.6e-10 Score=94.37 Aligned_cols=106 Identities=23% Similarity=0.329 Sum_probs=91.3 Q ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC------------CCCHHHHHH-CCCEEEC Q ss_conf 899608999899999998579978888767855278888888876202024------------660168873-2743523 Q gi|255764470|r 316 IVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE------------GSMNFKRSI-RKPLVVH 382 (523) Q Consensus 316 i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~------------~~~~~~~li-r~~~~Vp 382 (523) ++++..++++.++.+.+.+++++++||++++ ++++|++..+|++...... ......+++ +++.+++ T Consensus 3 vvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~-~~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~v~~im~~~~~tv~ 81 (122) T cd04803 3 VVTLSEDDSLADAEELMREHRIRHLPVVNED-GKLVGLLTQRDLLRAALSSLSDNGEESLTKERDVPVAEVMKTDVLTVT 81 (122) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHCCCCCEEEC T ss_conf 1999896999999999997499789999089-868889659999999984376420121025358899996689986998 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 8999899999885038627775228674256525999999 Q gi|255764470|r 383 ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 383 e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +++++.++++.|.+++.+...|+|+.|...|+||..|++. T Consensus 82 ~~~~l~~~~~~m~~~~~~~lpVvd~~~~lvGiiT~~Dilk 121 (122) T cd04803 82 PDTPLREAAEIMVENKIGCLPVVDDKGTLVGIITRSDFLR 121 (122) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHEEE T ss_conf 9998999999998568509999947998999999515564 No 36 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=99.22 E-value=3.7e-10 Score=93.25 Aligned_cols=109 Identities=17% Similarity=0.258 Sum_probs=95.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC---CCHHHHHH-CCCEEECCCCCHHH Q ss_conf 028996089998999999985799788887678552788888888762020246---60168873-27435238999899 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG---SMNFKRSI-RKPLVVHENISVLK 389 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~---~~~~~~li-r~~~~Vpe~~~l~~ 389 (523) +++..++.++|+.++.+.+.+++++..||++++ ++++|++.-+|+.+....+. +.+..+++ ++|.+++++.++.+ T Consensus 2 ~~vp~V~~~~si~ea~~~m~~~~~~~v~Vvd~~-~~lvGiiT~~Dl~r~~~~~~~~~~~~v~~iM~~~~~tv~~~~~i~~ 80 (114) T cd04604 2 DALPLVSPDTSLKDALLEMSRKGLGMTAVVDED-GRLVGIFTDGDLRRALEKGLDILTLPVADVMTRNPKTIDPDALAAE 80 (114) T ss_pred CCCCEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCEEEECCCCHHH T ss_conf 947788994939999999997699789999799-9399999899999998716664567974743567589938988699 Q ss_pred HHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 9998850386277752286742565259999998 Q gi|255764470|r 390 LMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 390 lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) +++.|++++.+-..|+|+.|...|++|..|++.+ T Consensus 81 a~~~m~~~~i~~lpVvd~~~~~vGiit~~DllkA 114 (114) T cd04604 81 ALELMEENKITALPVVDDNGRPVGVLHIHDLLRA 114 (114) T ss_pred HHHHHHHCCEEEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9999987795799999899989999988995469 No 37 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.21 E-value=3.9e-10 Score=93.06 Aligned_cols=108 Identities=18% Similarity=0.296 Sum_probs=93.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCHHHHHH-CCCEEECCCCCHH Q ss_conf 028996089998999999985799788887678552788888888762020246----60168873-2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~~~~li-r~~~~Vpe~~~l~ 388 (523) .++++++.++|+.++.+.+.+++.+.+||.+++ ++++|++.-+|+++...... +.+..+.+ +++.++.+++++. T Consensus 1 r~vvti~~~~tl~ea~~~m~~~~~~~l~Vvd~~-~~l~Giit~~Di~~~~~~~~~~~~~~~V~~iM~~~~~~v~~~~~l~ 79 (113) T cd04623 1 RDVITVRPDATVAEAAKLMAEKNIGAVVVVDDG-GRLVGIFSERDIVRKVALRGASALDTPVSEIMTRNVITVTPDDTVD 79 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCHHHCHHHHHCCCEEEEEECCCCHH T ss_conf 997696893979999999997399889999089-8299999768999999864998466401231211028991699199 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|.+++.+...|+|+ |.+.|+||..|++.+ T Consensus 80 ~a~~~m~~~~i~~lpV~d~-~~lvGiit~~Dllra 113 (113) T cd04623 80 EAMALMTERRFRHLPVVDG-GKLVGIVSIGDVVKA 113 (113) T ss_pred HHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9999999769259999999-999999997996469 No 38 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.20 E-value=3.6e-10 Score=93.35 Aligned_cols=105 Identities=27% Similarity=0.392 Sum_probs=92.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHH-HHCCCEEECCCCCHHHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246601688-7327435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKR-SIRKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~-lir~~~~Vpe~~~l~~lL~ 392 (523) .++++++.++|++|+.+.+.+++++.+||.+++ ++++|++..+|++....+ ..... +.++|.++++++++.++++ T Consensus 1 kdvvtv~~~~tv~ea~~lm~~~~i~~lpVvd~~-~~lvGivT~~Dl~~~~~~---~~v~~iMt~~~~tv~~~~~~~~a~~ 76 (106) T cd04638 1 ENVVYVTLPGTRDDVLELLKEYKVSGVPVVKKS-GELVGIITRKDLLRNPEE---EQLALLMTRDPPTVSPDDDVKEAAK 76 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHCCCC---CHHHHHHCCCCEECCCCCCHHHHHH T ss_conf 998999994969999999997699989999389-999999897896448750---2446753589736799491999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .|.+++.+...|+|+ |...|+||..|++.+ T Consensus 77 ~m~~~~i~~lpVvd~-~~lvGiIt~~DilkA 106 (106) T cd04638 77 LMVENNIRRVPVVDD-GKLVGIVTVADIVRA 106 (106) T ss_pred HHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 998769759999999-999999997995359 No 39 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.19 E-value=4.9e-10 Score=92.34 Aligned_cols=106 Identities=21% Similarity=0.332 Sum_probs=90.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-------------CHHHHH-HCCCE Q ss_conf 0289960899989999999857997888876785527888888887620202466-------------016887-32743 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-------------MNFKRS-IRKPL 379 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-------------~~~~~l-ir~~~ 379 (523) .++++++.++|+.++.+.+.+++++++||.++ ++++|++..+|++........ ....+. .+++. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~i~~lpVvd~--~~~vGiiT~~Di~~~l~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~ 78 (121) T cd04633 1 KPVITVSPDDRVSHARRLMLDHDISRLPVIEG--GKLVGIVTEKDIADALRSFRPLVRDRHQERRIRNLPVSDIMTRPVI 78 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCCCHHHCCCCCCCEEECCCCCCCE T ss_conf 99479799398999999999709988999989--9988787599999999744653320110245136685533437988 Q ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 5238999899999885038627775228674256525999999 Q gi|255764470|r 380 VVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 380 ~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +++++.++.++++.|.+++.+...|+|+ |...|+||..|++. T Consensus 79 tv~~~~~l~~~~~~m~~~~~~~lpVvd~-~~lvGIIT~~Dilr 120 (121) T cd04633 79 TIEPDTSVSDVASLMLENNIGGLPVVDD-GKLVGIVTRTDILR 120 (121) T ss_pred EEECCCCHHHHHHHHHHHCCEEEEEEEC-CEEEEEEEHHHHHC T ss_conf 9909898999999989719859999989-99999998346028 No 40 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=99.19 E-value=6e-10 Score=91.74 Aligned_cols=107 Identities=21% Similarity=0.257 Sum_probs=91.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHH------------CCCCCHHHHHH-CCCEE Q ss_conf 028996089998999999985799788887678552788888888762020------------24660168873-27435 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLL------------EEGSMNFKRSI-RKPLV 380 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~------------~~~~~~~~~li-r~~~~ 380 (523) .++++++.++|+.++.+.+.+++++++||.+++ ++++|++..+|++.... ........+.+ +++.+ T Consensus 1 r~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~-~~lvGiit~~Di~~~~~~~~~~~~~~~~~~~~~~~V~~iM~~~~~t 79 (121) T cd04584 1 RDVVTITPTTTIAEALELMREHKIRHLPVVDEE-GRLVGIVTDRDLRDASPSPFTTLSEHELYLLLKMPVKEIMTKDVIT 79 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHCCCCCHHHHHHCCCCCCCCHHHHCCCCCCE T ss_conf 998896892989999999997299889999089-9399996589987522012200244420251468999966799858 Q ss_pred ECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 238999899999885038627775228674256525999999 Q gi|255764470|r 381 VHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 381 Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +++++++.++++.|.+++.+-..|+|+ |...|+||..|++. T Consensus 80 v~~~~~l~~a~~~m~~~~i~~lpVvd~-~~lvGivt~~Dilr 120 (121) T cd04584 80 VHPLDTVEEAALLMREHRIGCLPVVED-GRLVGIITETDLLR 120 (121) T ss_pred ECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHC T ss_conf 989995999999988669619999999-99999998226006 No 41 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.19 E-value=4.7e-10 Score=92.48 Aligned_cols=108 Identities=24% Similarity=0.313 Sum_probs=91.5 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-------------------------C Q ss_conf 028996089998999999985799788887678552788888888762020246-------------------------6 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-------------------------S 368 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-------------------------~ 368 (523) .++++++.++|+.++.+.+.+++++++||.+++ ++++|++..+|++.....+. . T Consensus 2 ~~vitv~~~~tl~~a~~~m~~~~i~~lpVvd~~-~~lvGiit~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (135) T cd04586 2 TDVVTVSPETSVAEAARLMLDNHISGLPVVDDD-GRLVGIVSEGDLLRRAELGTERRRARWLDLLAGAEELAAAFVRSHG 80 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHCCCCC T ss_conf 999899993939999999997399679999089-9799999818976445415654114555430231034554304479 Q ss_pred CHHHHH-HCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 016887-3274352389998999998850386277752286742565259999998 Q gi|255764470|r 369 MNFKRS-IRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 369 ~~~~~l-ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ...++. .+++.++++++++.++++.|.+++.+...|+|+ |...|+||..|++.+ T Consensus 81 ~~V~~iM~~~~~~v~~~~~l~~~~~~m~~~~i~~lpVvd~-~~lvGiIt~~DilkA 135 (135) T cd04586 81 RKVADVMTRPVVTVGEDTPLAEVAELMEEHRIKRVPVVRG-GRLVGIVSRADLLRA 135 (135) T ss_pred CCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 8989957899879959795999999999759609999999-999999997996459 No 42 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=99.19 E-value=2.1e-10 Score=95.11 Aligned_cols=104 Identities=24% Similarity=0.388 Sum_probs=87.7 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC---CHHHHHH-CCCEEECCCCCHHHH Q ss_conf 289960899989999999857997888876785527888888887620202466---0168873-274352389998999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS---MNFKRSI-RKPLVVHENISVLKL 390 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~---~~~~~li-r~~~~Vpe~~~l~~l 390 (523) |+++++.++|+.++.+.+.+.+++++||.++ ++++|++..+|++.....+.. ....+.+ +++.++++++++.++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~--~~lvGiit~~Di~~~~~~~~~~~~~~v~~iM~~~~~~v~~~~~l~~~ 79 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD--GRVVGSIDESDLLDALIEGKAKFSLPVREVMGEPLPTVDPDAPIEEL 79 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCEEEEEEECCCEEECCCCCHHHH T ss_conf 8069999497999999999819988999989--99999999999999986387646876984567594597898949999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) .+.|++. +.++|+||.|...|+||..|+++ T Consensus 80 ~~~~~~~--~~~~vV~~~g~lvGIvT~~Dil~ 109 (110) T cd04609 80 SELLDRG--NVAVVVDEGGKFVGIITRADLLK 109 (110) T ss_pred HHHHHHC--CCCEEEECCCEEEEEEEHHHHHC T ss_conf 9988767--98249961999999998368529 No 43 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.18 E-value=8.6e-10 Score=90.58 Aligned_cols=107 Identities=22% Similarity=0.314 Sum_probs=92.6 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-------------CCHHHHHH-CCCE Q ss_conf 028996089998999999985799788887678552788888888762020246-------------60168873-2743 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-------------SMNFKRSI-RKPL 379 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-------------~~~~~~li-r~~~ 379 (523) ++++++..++++.++.+.+.+.+.+++||.++ +.++|++..+|++....... .....+.+ ++|. T Consensus 1 T~viTv~p~~tl~~a~~~m~~~~i~~lpVvd~--~~lvGivT~~Di~~~~~~~~~~~~~~~~~~~~~~~~v~~iM~~~~i 78 (122) T cd04637 1 TRVVTVEMDDRLEEVREIFEKHKFHHLLVVED--NELVGVISDRDYLKAISPFLGTAGETEKDLATLNRRAHQIMTRDPI 78 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCHHHHCCCHHHCCCHHHHCCCCCE T ss_conf 97499889696999999998749988999989--9999998889999875002453102211212306899994638983 Q ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 5238999899999885038627775228674256525999999 Q gi|255764470|r 380 VVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 380 ~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++++++++.++++.|.+++.+...|+|+.|...|+||..|++. T Consensus 79 tv~~~~~l~~a~~~m~~~~i~~lPVvd~~~~lvGiIT~~Diik 121 (122) T cd04637 79 TVSPDTPVDEASKLLLENSISCLPVVDENGQLIGIITWKDLLK 121 (122) T ss_pred EECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999984999999999749308999968998999999688327 No 44 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.17 E-value=5.9e-10 Score=91.78 Aligned_cols=108 Identities=17% Similarity=0.199 Sum_probs=91.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-------------------------- Q ss_conf 028996089998999999985799788887678552788888888762020246-------------------------- Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-------------------------- 367 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-------------------------- 367 (523) .|++++..++++.++.+.+.+++++++||.+++ ++++|++..+|++....... T Consensus 1 kdVitv~p~~tl~ea~~~m~~~~i~~lpVvd~~-g~lvGivT~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (135) T cd04621 1 EDIATVHPEHSLLHVVDEMEKNGVGRVIVVDDN-GKPVGVITYRDLAFAEFEDNERGLPKKSIKMKRKAGQKRYRYVKEV 79 (135) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHCCCC T ss_conf 986898994979999999987499779999599-9399999879988666314544310233333101002332101446 Q ss_pred CCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 60168873-274352389998999998850386277752286742565259999998 Q gi|255764470|r 368 SMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 368 ~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .....+.+ +++.++++++++.++.+.|.+++.+-..|+|+ |...|+||..||+.| T Consensus 80 ~~~v~diMt~~vitv~~d~~l~~a~~~M~~~~i~~lPVvd~-~~lvGiIt~~Dilre 135 (135) T cd04621 80 PLVAEDIMTEEIITVSPNDDVVDAAKLMLEANISGLPVVDN-DNIVGVITKTDICRE 135 (135) T ss_pred CCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHCCC T ss_conf 88998938899889989893999999999719988999989-999999985881168 No 45 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=99.16 E-value=4.4e-10 Score=92.72 Aligned_cols=107 Identities=19% Similarity=0.261 Sum_probs=93.9 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873-27435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~ 392 (523) .+.+++..++|+.++.+.+.+++++.+||++++ +.++|++..+|+......+. ...+.+ +++..+.++.++.++++ T Consensus 2 r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~-~~lvGivt~~di~~~~~~~~--~v~~im~~~~~tv~~~~~~~~a~~ 78 (109) T cd04583 2 KNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKD-NKLLGIVSLESLEQAYKEAK--SLEDIMLEDVFTVQPDASLRDVLG 78 (109) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECC-CEEEEEEEHHHHHHCCCCCC--EEEECCEEEEEEECCCCCHHHHHH T ss_conf 989997993979999999997399789999589-98999998104343005896--787512620289748999999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .|.+++.+...|+||.|...|+||..|++++ T Consensus 79 ~m~~~~~~~lPVVd~~~~lvGiiT~~dll~t 109 (109) T cd04583 79 LVLKRGPKYVPVVDEDGKLVGLITRSSLVDT 109 (109) T ss_pred HHHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 9985598388899649999999996882376 No 46 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.16 E-value=1.1e-09 Score=89.87 Aligned_cols=107 Identities=17% Similarity=0.300 Sum_probs=93.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCHHHHHH-CCCEEECCCCCHH Q ss_conf 028996089998999999985799788887678552788888888762020246----60168873-2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~~~~li-r~~~~Vpe~~~l~ 388 (523) +++++++.++++.++.+.+.+++.+.+||.++ ++++|++.-+|+++...... +.+.++.+ ++|..+++++++. T Consensus 1 ~~V~Tv~pd~~l~eA~~~M~~~~i~~lvV~~~--g~lvGIiT~rDi~~~~~~~~~~~~~~~V~~iMt~~~~tv~~~~~i~ 78 (112) T cd04625 1 DTIYTVAPETLLSEAVATMAEQDLGSLVVMER--GELVGLLTFREVLQAMAQHGAGVLDTTVRAIMNPEPIVASPDDSID 78 (112) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHHCCCCCCCCCHHHHEECCCEEECCCCHHH T ss_conf 94799899597999999998749987999579--9999999879999999970998011708883348977998999699 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|.+++.+-..|+|+ |...|++|..|++.+ T Consensus 79 ~a~~~M~~~~i~~lpVvd~-g~lvGiit~~Di~rA 112 (112) T cd04625 79 EVRRLMVERHLRYLPVLDG-GTLLGVISFHDVAKA 112 (112) T ss_pred HHHHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9999768669769999999-999999997996359 No 47 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.15 E-value=1e-09 Score=89.95 Aligned_cols=109 Identities=18% Similarity=0.248 Sum_probs=93.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC-CC---CCHHHHHH-CCCEEECCCCCHH Q ss_conf 0289960899989999999857997888876785527888888887620202-46---60168873-2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE-EG---SMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~-~~---~~~~~~li-r~~~~Vpe~~~l~ 388 (523) .++++++.++|+.++.+.+.+++.+++||.+++.++++|++.-+|++..... +. .....+++ +++..+++++++. T Consensus 1 pdvitv~~~~tv~eA~~~M~~~~i~~v~V~~~~~~~~vGiiT~~Di~~~~~~~~~~~~~~~V~~vMt~~v~tv~~d~~~~ 80 (114) T cd04630 1 PNVVTIDGLATVAEALQLMKEHGVSSLVVEKRRESDAYGIVTMRDILKKVVAEGRDPDRVNVYEIMTKPLISVSPDMDIK 80 (114) T ss_pred CCEEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCEEEECCCCCHH T ss_conf 98499998293999999999829988999978996089999818989999846789444498682703549999949499 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|++.+.+.+.|+|. |.+.|+||..|++.+ T Consensus 81 ~a~~~m~~~~i~~lpVvd~-~~lvGiit~~Dil~A 114 (114) T cd04630 81 YCARLMERTNIRRAPVVEN-NELIGIISLTDIFLA 114 (114) T ss_pred HHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9999999779729999999-999999976896479 No 48 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.15 E-value=1.9e-09 Score=88.07 Aligned_cols=105 Identities=20% Similarity=0.278 Sum_probs=92.5 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC---CCHHHHH-HCCCEEECCCCCHHHHHH Q ss_conf 996089998999999985799788887678552788888888762020246---6016887-327435238999899999 Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG---SMNFKRS-IRKPLVVHENISVLKLME 392 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~---~~~~~~l-ir~~~~Vpe~~~l~~lL~ 392 (523) .++..++++.++.+++.+++++.+||.+++ ++++|++..+|++......+ +....+. .++|.+++++.++.++++ T Consensus 4 ~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~-~~lvGivT~~Di~~~~~~~~~~~~~~v~~iMt~~~~ti~~~~~l~~a~~ 82 (113) T cd04615 4 SCVVLNTDIARAVAEMYTSGSRALPVVDDK-KRLVGIITRYDVLSYALESEELKDAKVREVMNSPVITIDANDSIAKARW 82 (113) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCHHHHHH T ss_conf 598996999999999997399789999489-9799999959999999807982458999927569789908993999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 885038627775228674256525999999 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) .|++++.+-..|+|+.|...|++|..|++. T Consensus 83 ~m~~~~i~~lpVvd~~g~lvGiit~~Dilr 112 (113) T cd04615 83 LMSNNNISRLPVLDDKGKVGGIVTEDDILR 112 (113) T ss_pred HHHHHCCEEEEEECCCCEEEEEEEHHHHHC T ss_conf 999749948999979993999999688326 No 49 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.14 E-value=1.1e-09 Score=89.74 Aligned_cols=107 Identities=25% Similarity=0.347 Sum_probs=92.0 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC-----CCCHHHHHH-CCCEEECCCCCHH Q ss_conf 2899608999899999998579978888767855278888888876202024-----660168873-2743523899989 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE-----GSMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~-----~~~~~~~li-r~~~~Vpe~~~l~ 388 (523) +.+++..++|+.++.+.+.+++++.+||.+++ ++++|++..+|++...... ......+.+ +++.++++++++. T Consensus 2 ~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~-~~lvGiit~~Dll~~~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~~~ 80 (114) T cd04629 2 NPVTFTPDMSVTEAVEKLLKSKISGGPVVDDN-GNLVGFLSEQDCLKQLLESSYHCDGVATVRDIMTTEVLTVSPDDSIV 80 (114) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHHHHHHHCCCCCCCEEHHHCCCCCEEEECCCCHH T ss_conf 89997992999999999997199789999489-92999996789999997754058876591885146887995799999 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|.+++.+...|+|+ |...|++|..|++.+ T Consensus 81 ~a~~~m~~~~~~~lpVvd~-g~lvGiIt~~DilrA 114 (114) T cd04629 81 DLAQLMLKAKPKRYPVVDD-GKLVGQISRRDVLRA 114 (114) T ss_pred HHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 9999988669839999989-999999997996339 No 50 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.14 E-value=1.1e-09 Score=89.70 Aligned_cols=107 Identities=18% Similarity=0.254 Sum_probs=93.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC--CCHHHHH-HCCCEEECCCCCHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246--6016887-3274352389998999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG--SMNFKRS-IRKPLVVHENISVLKL 390 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~--~~~~~~l-ir~~~~Vpe~~~l~~l 390 (523) +++.+++.++|+.|+.+.+.+++.+.+||.+++ ++++|++..+|+++....+. +.+..++ .++|.++.++.++.++ T Consensus 1 tdv~tv~~~~tl~eA~~~m~~~~~~~~~Vvd~~-~~l~GIiT~~Di~~~~~~~~~~~~~V~~iMt~~~itv~~~~~~~~a 79 (111) T cd04626 1 TDFPTIDEDASIREALHEMLKYNTNEIIVKDNE-EKLKGVVTFTDILDLDLFESFLEKKVFNIVSQDVFYVNEEDTIDEA 79 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCHHHHCCCCCEEECCCCCHHHH T ss_conf 997399962939999999998499889999699-9799999949999998526877888899606998799997939999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.|++++.+-..|+|+ |...|++|..||++ T Consensus 80 ~~~M~~~~i~~LPVvd~-~klvGiit~~Dil~ 110 (111) T cd04626 80 LDIMREKQIGRLPVVDD-NKLIGVVRTKDILD 110 (111) T ss_pred HHHHHHHCCCEEEEEEC-CEEEEEEEHHHHCC T ss_conf 99999809988999999-99999999698148 No 51 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.14 E-value=1.2e-09 Score=89.54 Aligned_cols=106 Identities=23% Similarity=0.345 Sum_probs=90.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-------------------------- Q ss_conf 028996089998999999985799788887678552788888888762020246-------------------------- Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-------------------------- 367 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-------------------------- 367 (523) .+++++..++++.++.+.+.++++|++||.++ |+++|++..+|+++....+. T Consensus 1 kdvvTv~~d~~l~ea~~~m~~~~i~~lPVvd~--~~lvGivT~~Dil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (143) T cd04634 1 KNPITCNADDTISDAARLLRENKISGAPVLDG--GKLVGIVSESDILKLLVTHDPSGNLWLPSPLELIELPLREFINWEE 78 (143) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99889589398999999999749977999979--9999999869999887516755432356602332111232222667 Q ss_pred ---------CCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf ---------60168873-27435238999899999885038627775228674256525999999 Q gi|255764470|r 368 ---------SMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 368 ---------~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ..+.++.+ +++..+++++++.++.+.|.+++.+-..|+|+ |...|+||..|+++ T Consensus 79 ~~~~~~~~~~~~v~diM~~~~itv~~~~~l~~a~~~m~~~~i~~lPVvd~-~~lvGIItr~Dilk 142 (143) T cd04634 79 TKRALTDAGKMKVRDIMTKKVITISPDASIEDAAELMVRHKIKRLPVVED-GRLVGIVTRGDIIE 142 (143) T ss_pred HHHHHHHHCCCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHCCC T ss_conf 77665432278989958889879989896999999999739888999989-99999999577308 No 52 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.14 E-value=1.9e-09 Score=88.08 Aligned_cols=108 Identities=19% Similarity=0.279 Sum_probs=91.2 Q ss_pred CEEEEEECCCCHHHHHHH-HHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-CCHHHHHH---CCCEEECCCCCHH Q ss_conf 028996089998999999-985799788887678552788888888762020246-60168873---2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWK-ILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-SMNFKRSI---RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~-i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-~~~~~~li---r~~~~Vpe~~~l~ 388 (523) .|+.+++.+.|++|+.+. +.+++|++|||.+++ +.++|++..+|+.....+.. .....+.+ +.+..+.+++++. T Consensus 1 Rdv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~-g~l~Givt~~dl~~~~~~~~~~~~v~~v~~~~~~~~~v~~~~~l~ 79 (114) T cd04801 1 RDFPTVPAHLTLREFVREYVLGSNQRRFVVVDNE-GRYVGIISLADLRAIPTSQWAQTTVIQVMTPAAKLVTVLSEESLA 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCHHHHCCCCCCCEEECCCCCHH T ss_conf 9955849999599999999850796689998789-979999999998754540256884778210277875899999799 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9999885038627775228674256525999999 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++++.|.+++.+-..|+|+.|...|++|..|++. T Consensus 80 ~al~~m~~~~~~~LpVVd~~g~lvGiit~~Dilr 113 (114) T cd04801 80 EVLKLLEEQGLDELAVVEDSGQVIGLITEADLLR 113 (114) T ss_pred HHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9999999769867889947990999999788436 No 53 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.13 E-value=1.5e-09 Score=88.79 Aligned_cols=108 Identities=21% Similarity=0.348 Sum_probs=91.5 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-----------------------CCH Q ss_conf 028996089998999999985799788887678552788888888762020246-----------------------601 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-----------------------SMN 370 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-----------------------~~~ 370 (523) .+++++..++|+.++.+.+.+++++.+||.+++ +.++|++..+|++....... ..+ T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~-~~lvGiit~~Dll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNE-GRVVGIVSEGDLIRKIYKGKGLFYVTLLYSVIFLDESKIKKLLGKK 79 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCC T ss_conf 998796891979999999997299789999599-9399998889999998626762000000000000233310203899 Q ss_pred HHH-HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 688-73274352389998999998850386277752286742565259999998 Q gi|255764470|r 371 FKR-SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 371 ~~~-lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ..+ +.++|.++.++.++.++++.|.+++.|-..|+|+ |...|+||..|++.+ T Consensus 80 v~diMt~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~-~~lvGiIt~~Dilra 132 (132) T cd04636 80 VEEIMTKKVITVDEDTTIEDVARIMSKKNIKRLPVVDD-GKLVGIISRGDIIRS 132 (132) T ss_pred HHHHCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHCC T ss_conf 99967899879999794999999999629878999989-999999996882349 No 54 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.12 E-value=1.8e-09 Score=88.23 Aligned_cols=108 Identities=16% Similarity=0.204 Sum_probs=90.6 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCHHHHHH-------CCCEEEC Q ss_conf 028996089998999999985799788887678552788888888762020246----60168873-------2743523 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMNFKRSI-------RKPLVVH 382 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~~~~li-------r~~~~Vp 382 (523) .+++++..++++.++.+.+.+++.|.+||.+++ ++++|++..+|++.....+. .......+ .++..++ T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~-g~lvGiis~~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~t~~ 79 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDEN-GKVVDVYSRFDVINLAKEGAYNNLDLTVGEALERRSQDFEGVRTCS 79 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCCEEEC T ss_conf 997898998999999999998098669998789-9698997599999887515544344208776421555547856989 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 8999899999885038627775228674256525999999 Q gi|255764470|r 383 ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 383 e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.++.++++.|.+++.|-..|+||.|...|+||..||+. T Consensus 80 ~~~~l~~~~~~m~~~~i~rlpVVD~~~rlvGiIs~~Dilr 119 (120) T cd04641 80 PDDCLRTIFDLIVKARVHRLVVVDENKRVEGIISLSDILQ 119 (120) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 9997999999999849857989989997999998688238 No 55 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=99.11 E-value=1.5e-09 Score=88.77 Aligned_cols=120 Identities=15% Similarity=0.229 Sum_probs=101.1 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC---CCCHHHHHHC- Q ss_conf 378667465340002899608999899999998579978888767855278888888876202024---6601688732- Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE---GSMNFKRSIR- 376 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~---~~~~~~~lir- 376 (523) +++.+|+|+|. ++...++.+++++|+.+.+.+++|+.+||.+++ +..+|+++.+|+.....++ ++....++++ T Consensus 453 l~~~~v~dvm~--~~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~-g~l~Giv~l~dl~~~~~~~~~~~~~~~~~~~~~ 529 (583) T PRK01862 453 LRTTQMRELIQ--PAQTVVPLTASVADMTRVFLEYPVKYLYVTDDD-GRFRGAVALKDITSDLLDKRDTTDKTAADYAHT 529 (583) T ss_pred HHHCCHHHHCC--CCCCEECCCCCHHHHHHHHHHCCCEEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCC T ss_conf 85080999638--788234999989999999984797079998899-979999789998877516355564609998179 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCC--CEEEEEEHHHHHHH Q ss_conf 743523899989999988503862777522867--42565259999998 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYG--VLEGMITPANILEA 423 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G--~~~GIVTleDIlE~ 423 (523) ++..+.+++++.++++.|.+....-..|||+.+ ...|++|-.|++++ T Consensus 530 ~~~~l~~~~sL~~al~~f~~~~~~~LPVVd~~~~~~lvGiit~~dil~a 578 (583) T PRK01862 530 PFPLLTPDMPLRDALEHFMAFQGERLPVVESEASPTLAGVVYKTSLLDA 578 (583) T ss_pred CCCEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHH T ss_conf 9957889998999999999669987789955999879999778999999 No 56 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.09 E-value=1.7e-09 Score=88.33 Aligned_cols=104 Identities=18% Similarity=0.199 Sum_probs=90.1 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHH-HHCCCEEECCCCCHHHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246601688-7327435238999899999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKR-SIRKPLVVHENISVLKLME 392 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~-lir~~~~Vpe~~~l~~lL~ 392 (523) .+.++++.++|+.++.+.+.+++++.+||.++ ++++|++..+|+.....+ ....+ +.+++.++++++++.++++ T Consensus 1 rd~vtv~p~~tv~ea~~lM~~~~i~~lpVv~~--~~lvGivT~~Dl~~~~~~---~~v~~iMt~~~itv~~~~~l~~a~~ 75 (105) T cd04599 1 RDPITIDPLDSVGRAARLMEKHRIGGLPVVED--GKLVGIITSRDVRRAHPN---RLVADAMTREVVTISPEASLLEAKR 75 (105) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHHCCCCC---CCHHHHCCCCCEEECCCCCHHHHHH T ss_conf 99899689498999999998639988999999--999899981053127888---7968850279899979491999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 8850386277752286742565259999998 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) .|.+++.+-..|+|+ |...|++|..||..| T Consensus 76 ~M~~~~i~~lpVvd~-g~lvGiiT~~Divre 105 (105) T cd04599 76 LMEEKKIERLPVLRE-RKLVGIITKGTIALE 105 (105) T ss_pred HHHHCCCCEEEEEEC-CEEEEEEECHHHCCC T ss_conf 779859979999989-999999994586158 No 57 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=99.08 E-value=2.5e-09 Score=87.13 Aligned_cols=107 Identities=21% Similarity=0.280 Sum_probs=91.8 Q ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC---CHHHHHH-CCCEEECCCCCHHHHH Q ss_conf 89960899989999999857997888876785527888888887620202466---0168873-2743523899989999 Q gi|255764470|r 316 IVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS---MNFKRSI-RKPLVVHENISVLKLM 391 (523) Q Consensus 316 i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~---~~~~~li-r~~~~Vpe~~~l~~lL 391 (523) -.+++.++++.|+.+.+.+++++.+||.+++ +.++|++..+|+++....+.. ....+.+ +++.+++++.++.+++ T Consensus 3 P~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~-g~lvGIvT~~Dl~r~~~~~~~~~~~~v~~vmt~~~~tv~pd~~l~~al 81 (115) T cd04593 3 PPVLSATTPLREAAEQLIESKHGSALVVDRD-GGVVGIITLPDLLRALEADEAGEPSAVDEVATPPLLTVHPDEPLAHAL 81 (115) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCEEEEEECCCCEEECCCCHHHHHH T ss_conf 8388991999999999987499899999499-988999889999999973888667794543038966899999499999 Q ss_pred HHHHHCCCEEEEEECCC--CCEEEEEEHHHHHHH Q ss_conf 98850386277752286--742565259999998 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEY--GVLEGMITPANILEA 423 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~--G~~~GIVTleDIlE~ 423 (523) +.|.+++.+-..|+|+- |...|++|.+||+.+ T Consensus 82 ~~M~~~~i~~LPVVd~~~~~~lvGiit~~Dil~A 115 (115) T cd04593 82 DRMASRGLRQLPVVDRGNPGQVLGLLTRENVLLA 115 (115) T ss_pred HHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHCC T ss_conf 9999869988999978999989999997996569 No 58 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.08 E-value=1.6e-09 Score=88.64 Aligned_cols=106 Identities=22% Similarity=0.340 Sum_probs=91.6 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-CCHHHHH-HCCCEEECCCCCHHHHH Q ss_conf 028996089998999999985799788887678552788888888762020246-6016887-32743523899989999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-SMNFKRS-IRKPLVVHENISVLKLM 391 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-~~~~~~l-ir~~~~Vpe~~~l~~lL 391 (523) +.+.++..++|+.|+.+.+.+++++.+||.++ ++++|++.-+|+.+....+. +.+..++ .+++..+.+++++.+++ T Consensus 2 spV~ti~p~~tv~ea~~~m~~~~i~~lpVvd~--~~lvGivT~~Di~~~~~~~~~~~~v~~iM~~~~~tv~~~~~i~~a~ 79 (110) T cd04595 2 SPVKTVRPEATIEEARELLLRYGHTALPVVEG--GRVVGIISRRDVEKALRHGLGHAPVKDYMSTDVVTVPPDTPLSEVQ 79 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCEEEEECCEEEECCCCCHHHHH T ss_conf 89899799599999999999759988999989--9999999768988776537678812103253668866999399999 Q ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9885038627775228674256525999999 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +.|.+++.+...|+|+ |...|++|..|++. T Consensus 80 ~~m~~~~~~~lpVvd~-g~lvGiit~~Dilk 109 (110) T cd04595 80 ELMVEHDIGRVPVVED-GRLVGIVTRTDLLR 109 (110) T ss_pred HHHHHCCCEEEEEEEC-CEEEEEEEHHHEEC T ss_conf 9889749139999989-99999998415615 No 59 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=99.08 E-value=1.9e-09 Score=88.02 Aligned_cols=107 Identities=20% Similarity=0.242 Sum_probs=89.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCC--CCCEEEEEEHHHHHHHHHCCCCCHHHHHHC---CCEEECCCCCHH Q ss_conf 028996089998999999985799788887678--552788888888762020246601688732---743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGS--LDSFIGIVSARDLLRDLLEEGSMNFKRSIR---KPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~--~D~iiGiv~~kDLl~~~~~~~~~~~~~lir---~~~~Vpe~~~l~ 388 (523) ++.+++..++|++++.+.+.+++|+.+||.++. .+.++|++..+|+.... .......+.+. .+...+++.++. T Consensus 2 ~dPvtv~p~~tv~ea~~lm~~~~~~~~pVvd~g~~~~klvGIvT~rDi~~~~--~~~~~v~~iMt~~~~~i~~~~~~~l~ 79 (114) T cd04602 2 TDPSVLSPDHTVADVLEIKEKKGFSGIPVTEDGKSGGKLLGIVTSRDIDFLT--DSETPLSEVMTPREVLVVAPTGITLE 79 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEEEHHHHHHCC--CCCCCCCCEEECCCEEEEECCCCCHH T ss_conf 8899979989999999999971987799996577689699999978952350--76765635165276689947987899 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9999885038627775228674256525999999 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.+.|.+++.+...|+||.|...||+|..|++. T Consensus 80 ea~~~m~~~~i~~LPVVd~~g~LvGiIT~~Di~k 113 (114) T cd04602 80 EANEILRESKKGKLPIVNDDGELVALVTRSDLKK 113 (114) T ss_pred HHHHHHHHCCCCEEEEECCCCEEEEEEECHHHCC T ss_conf 9999999749667819978996999999244036 No 60 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=99.08 E-value=3.5e-09 Score=86.13 Aligned_cols=107 Identities=15% Similarity=0.203 Sum_probs=93.6 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCHHHHH-HCCCEEECCCCCHHH Q ss_conf 28996089998999999985799788887678552788888888762020246----6016887-327435238999899 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMNFKRS-IRKPLVVHENISVLK 389 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~~~~l-ir~~~~Vpe~~~l~~ 389 (523) ++..++.++|+.|+.+.+.+.+.+.+||++++ ++++|++.-+|+.+.....+ +...++. .+++.++++++++.+ T Consensus 2 rvitV~~~~ti~eA~~~M~~~~ig~l~Vvd~~-g~lvGIiTe~Di~r~~~~~~~~~~~~~V~~vMt~~vitv~~~~~i~~ 80 (114) T cd04619 2 RLAKIDVNATLQRAAKILGEPGIDLVVVCDPH-GKLAGVLTKTDVVRQMGRCGGPGCTAPVENVMTRAVVSCRPGDLLHD 80 (114) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCHHH T ss_conf 31298986939999999987499889999899-95999999499999987307986678999961589889999890999 Q ss_pred HHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 999885038627775228674256525999999 Q gi|255764470|r 390 LMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 390 lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +++.|.+++.+-..|+||.|...|++|..|++. T Consensus 81 a~~~M~~~~i~~lPVVD~~~~~vGiit~~Dilk 113 (114) T cd04619 81 VWQVMKQRGLKNIPVVDENARPLGVLNARDALK 113 (114) T ss_pred HHHHHHHCCCCEEEEEECCCEEEEEEEHHHEEC T ss_conf 999888769858899907995999999354542 No 61 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=99.07 E-value=4.5e-09 Score=85.32 Aligned_cols=105 Identities=15% Similarity=0.110 Sum_probs=91.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873274352389998999998 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMER 393 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~ 393 (523) ++-.++..++++.++.+.+.+++++.+||++++ .+++|++..+|+...... ..-+...+.+..++.+.++.+++.. T Consensus 1 t~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~-g~l~Givt~~Dl~~~~~~---~v~d~m~~~~~tv~~d~~l~~a~~~ 76 (106) T cd04582 1 TEPITVRPDDPLSDALGLMDDSDLRALTVVDAD-GQPLGFVTRREAARASGG---CCGDHAEPFKVTVSVDDDLRIVLSR 76 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHCCC---CHHHCCCCCCEEECCCCCHHHHHHH T ss_conf 998389998939999999997399879999799-978999899998763089---5345102576699799999999999 Q ss_pred HHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 85038627775228674256525999999 Q gi|255764470|r 394 LRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 394 fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) |.+++.+...|+|+.|...|+||..||++ T Consensus 77 M~~~~i~~lPVVD~~grlvGivT~~Di~~ 105 (106) T cd04582 77 MFAHDMSWLPCVDEDGRYVGEVTQRSIAD 105 (106) T ss_pred HHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 99729876258989990999998688425 No 62 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=99.07 E-value=2.5e-09 Score=87.12 Aligned_cols=108 Identities=16% Similarity=0.230 Sum_probs=91.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHH-HCCCC---CHHHH-HHCCCEEECCCCCHH Q ss_conf 02899608999899999998579978888767855278888888876202-02466---01688-732743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDL-LEEGS---MNFKR-SIRKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~-~~~~~---~~~~~-lir~~~~Vpe~~~l~ 388 (523) ++-++++.++|+.++.+.+.+.+.+.+||.++ ++++|++..+|++... .++.+ ....+ +.++|.+++++.++. T Consensus 1 t~Pitv~p~~ti~ea~~~M~~~~i~~~~V~d~--~~l~GivT~~Dl~~~~~~~~~~~~~~~v~~iMt~~~~tv~~~~~l~ 78 (113) T cd04587 1 TKPATVSPTTTVQEAAKLMREKRVSCVLVMDG--NKLVGIFTSKDIALRVVAQGLDPESTLVERVMTPNPVCATSDTPVL 78 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEEHHHHHHHHHCCCCHHHCEEEEEECCCCEEEECCCCHH T ss_conf 98989899499999999999729988999989--9999999845987899873998320898866616846990799999 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 99998850386277752286742565259999998 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++++.|.+++.+-..|+|+.|...|++|..||+.+ T Consensus 79 ~a~~~m~~~~i~~lpVvd~~~~~vGiit~~Dil~A 113 (113) T cd04587 79 EALHLMVQGKFRHLPVVDKSGQVVGLLDVTKLTHA 113 (113) T ss_pred HHHHHHHHCCCEEEEEEECCCEEEEEEEHHHHHCC T ss_conf 99999987794199999269989999984885278 No 63 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.06 E-value=3.2e-09 Score=86.40 Aligned_cols=102 Identities=19% Similarity=0.205 Sum_probs=88.7 Q ss_pred EECCCCHHHHHHHHHHCC-----CCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHC-CCEEECCCCCHHHHHH Q ss_conf 608999899999998579-----9788887678552788888888762020246601688732-7435238999899999 Q gi|255764470|r 319 LDVNCVDEDLQWKILELG-----HSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIR-KPLVVHENISVLKLME 392 (523) Q Consensus 319 ld~~~s~~e~~~~i~~~~-----~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir-~~~~Vpe~~~l~~lL~ 392 (523) +..+.|++++++.+++.. .+.+||.+++ +..+|++..+|++.. .++..+.++++ ++.+++++.++.++.+ T Consensus 2 ~~~~~TV~eai~~lr~~~~~~~~~~~i~Vvd~~-~~l~G~vt~~dll~~---~~~~~v~~iM~~~~~~v~~~~~~~~a~~ 77 (109) T cd04606 2 VREDWTVGEALEYLRRNADDPETIYYIYVVDEE-GRLLGVVSLRDLLLA---DPDTPVSDIMDTDVISVSADDDQEEVAR 77 (109) T ss_pred CCCCCCHHHHHHHHHHCCCCCCEEEEEEEECCC-CEEEEEEEHHHHHHC---CCCCCHHHHHCCCEEECCCCCCHHHHHH T ss_conf 699889999999998518885617799999799-979999986772241---8887166630021363379998999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 88503862777522867425652599999987 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) .|++++.+...|+|+.|...|++|..|+++.+ T Consensus 78 ~m~~~~~~~lPVVd~~~~lvGiIt~~Di~~~~ 109 (109) T cd04606 78 LFEKYDLLALPVVDEEGRLVGIITVDDVIDVI 109 (109) T ss_pred HHHHHCCCEEEEECCCCEEEEEEEHHHHHHHC T ss_conf 99985276046898899799999968968449 No 64 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=99.06 E-value=4.4e-09 Score=85.38 Aligned_cols=105 Identities=17% Similarity=0.207 Sum_probs=89.4 Q ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCC----------HHH---HHHCCCEEEC Q ss_conf 899608999899999998579978888767855278888888876202024660----------168---8732743523 Q gi|255764470|r 316 IVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSM----------NFK---RSIRKPLVVH 382 (523) Q Consensus 316 i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~----------~~~---~lir~~~~Vp 382 (523) .++++.++|+.++.+.+.+++++|+||.+++.++++|++.-+|++....++... ... -..+++..++ T Consensus 3 fi~v~~~~sl~~a~~lm~~~~i~rlpVvd~~~g~lvGIiT~~dil~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~viti~ 82 (123) T cd04627 3 FIPVPSTASLFQAIEILGSGGIHRVAVTEEESGEVIGILSQRRLVEFLWENARSFPGLDPLYPIPLRDLTIGTSDVISIN 82 (123) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHCCCCCCEEEC T ss_conf 64848989299999999749977799995899949999997999999986376566144541120023323767717989 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHH Q ss_conf 89998999998850386277752286742565259999 Q gi|255764470|r 383 ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANI 420 (523) Q Consensus 383 e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDI 420 (523) +++++.++++.|.+++.+-..|||+.|...|++|..|+ T Consensus 83 ~d~~l~~A~~lM~~~~i~~lpVVD~~g~lvGiiS~~Dl 120 (123) T cd04627 83 GDQPLIDALHLMHNEGISSVAVVDNQGNLIGNISVTDV 120 (123) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHE T ss_conf 99959999999998098878698599969999892542 No 65 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=99.05 E-value=5.1e-09 Score=84.95 Aligned_cols=101 Identities=27% Similarity=0.380 Sum_probs=88.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCC--CCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHH Q ss_conf 0289960899989999999857997888876785--52788888888762020246601688732743523899989999 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSL--DSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLM 391 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~--D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL 391 (523) .+++.+..+.+.+++.+.+.+++|+.+||.+++. ..++|++..+|+...... .+-+.|+.+++++++.++. T Consensus 2 ~~vv~~~~~~tv~~~~~ll~~~~~~g~PVv~~~~~~~~lvG~i~~~dl~~~l~~-------~m~~~p~tv~~~~~l~~~~ 74 (105) T cd04591 2 PLVVLLPEGMTVEDLESLLSTTSHNGFPVVDSTEESPRLVGYILRSQLVVALKN-------YIDPSPFTVSPRTSLEKVH 74 (105) T ss_pred CCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCEEEEEEEHHHHHHHHHH-------HCCCCCEEECCCCCHHHHH T ss_conf 968993899899999999997799758588368878889999999999999997-------5879983989999599999 Q ss_pred HHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9885038627775228674256525999999 Q gi|255764470|r 392 ERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 392 ~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +.|++.+.+...|+|+ |...|+||..|++. T Consensus 75 ~lf~~~g~r~l~Vv~~-g~lvGiITr~Dl~~ 104 (105) T cd04591 75 QLFRKLGLRHLLVVDE-GRLVGIITRKDLLK 104 (105) T ss_pred HHHHHHCCCEEEEEEC-CEEEEEEEHHHHHC T ss_conf 9999949958899579-99999999899534 No 66 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=99.04 E-value=5.1e-09 Score=84.93 Aligned_cols=170 Identities=16% Similarity=0.089 Sum_probs=124.6 Q ss_pred HHHHHHHHHHHHHCCC------CC-HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 8889999996774667------89-8999999975113786674653400028996089998999999985799788887 Q gi|255764470|r 271 NVKADVLLPTQHEKHI------IS-DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVA 343 (523) Q Consensus 271 ~ee~~~li~~~~~~g~------l~-~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~ 343 (523) -.|.++-++.+.+.|. .. +++.++++.|=+...--+. +-+++..+++++++.+...+.+||-+||. T Consensus 62 VTE~~MAIamA~~GGiGVIH~Nmsie~Qa~~V~~VKr~esg~I~-------dPvti~p~~Tv~da~~l~~k~~~sg~PVv 134 (497) T PRK07107 62 VSDDNMAIALAREGGLSFIFGSQSIESEAAMVRRVKNHKAGFVI-------SDSNLTPDNTLADVLDLKERTGHSTIAVT 134 (497) T ss_pred CCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHCCCCCCCC-------CCCCCCCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 05099999999779979987999999999999999563567668-------98175898649999998887199623451 Q ss_pred CC-C-CCCEEEEEEHHHHHHHHHCCCCCHHHHHHC---CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHH Q ss_conf 67-8-552788888888762020246601688732---743523899989999988503862777522867425652599 Q gi|255764470|r 344 QG-S-LDSFIGIVSARDLLRDLLEEGSMNFKRSIR---KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPA 418 (523) Q Consensus 344 ~~-~-~D~iiGiv~~kDLl~~~~~~~~~~~~~lir---~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTle 418 (523) ++ + .++++|++..+|+-.. ..+.+.+..+++. ++.++|+.+++.++.+.|++++.....|||+.|...|++|.+ T Consensus 135 d~g~~~gkLvGIvT~RD~rf~-~~~~~~~V~~vMT~~~~Lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~g~L~gLiT~k 213 (497) T PRK07107 135 EDGTANGKLLGIVTSRDYRVS-RMSLDTKVKDFMTPFEKLVVAKVGITLKEANNIIWDHKLNTLPIIDKNQHLVYMVFRK 213 (497) T ss_pred ECCCCCCEEEEEEECCEEECC-CCCCCCCHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEECCCCCEEEEEEHH T ss_conf 037859879999846401111-3677776656406765427846889999999999863444302783799589999837 Q ss_pred HHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHH Q ss_conf 999987066445788863128834985998700119999 Q gi|255764470|r 419 NILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYA 457 (523) Q Consensus 419 DIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el 457 (523) |+....-- +...+.++|+++|-+.....|. T Consensus 214 Di~k~~~~---------P~a~~D~~grL~VgAAIg~~d~ 243 (497) T PRK07107 214 DYDSHKEN---------PLELLDSSKRYVVGAGINTRDY 243 (497) T ss_pred HHHHHHHC---------CCCCCCCCCCEEEEEECCCCCH T ss_conf 88886439---------6311376788889996377789 No 67 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=99.02 E-value=3.2e-09 Score=86.37 Aligned_cols=245 Identities=17% Similarity=0.267 Sum_probs=154.3 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH----HCCCCCCHHHHHHHHHHHHHHHH--------HHHHHHHHHHH Q ss_conf 9899999998485289999999999999999977----45733702788998999999999--------89999987554 Q gi|255764470|r 175 VSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEG----LHFDIPKGYLYASIGFSGIIEFF--------NQVARRNREQL 242 (523) Q Consensus 175 ~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg----~~~~ipk~y~y~~~~fs~~ve~~--------n~~~~~~~~~~ 242 (523) -++-|+..++|||..+==-..-|=..|. .|| -|-..|-.--|-++.-.-+...+ |+..--+..+. T Consensus 27 KgeeIA~~l~rnpGTVRNqmq~LkaLgL---VegvpGPkGGY~PT~kAYe~L~iqt~~~~~~Vpi~~~G~~v~~~~a~~I 103 (294) T COG2524 27 KGEEIAEVLNRNPGTVRNQMQSLKALGL---VEGVPGPKGGYKPTSKAYEALSIQTLETEFVVPIYKDGNVVEDVKALKI 103 (294) T ss_pred CHHHHHHHHCCCCCHHHHHHHHHHHCCC---CCCCCCCCCCCCCCHHHHHHHCCCCCCCCEEEEEEECCEEEEEEEEEEE T ss_conf 5599999874596169999999885275---2256689889653389998744577775148889876828501378888 Q ss_pred HHHHHHHHHHHHHHHHHHCC-CCCCCCCC---HHHH-HHHHHHHHHCCCCCHHHHHH---HHHHHCCCCEEEEEEEEECC Q ss_conf 30103777667656744156-88876679---8889-99999677466789899999---99751137866746534000 Q gi|255764470|r 243 MSPSRLRARTADAVLRLLGG-KPIQPQGL---NVKA-DVLLPTQHEKHIISDQEKDM---VQSVLTLADRPAKSIMTPRT 314 (523) Q Consensus 243 ~~~~~~~eRtA~aVLRLlG~-~~~~~~~~---~ee~-~~li~~~~~~g~l~~~E~~m---i~~vL~L~e~~V~~IMtPR~ 314 (523) .-+.-.-.++-+|.+|..|- ++-..... .... ..++-.+.=.| -++.++++ +..+.+....+|.++|+ . T Consensus 104 e~~~i~~p~~c~a~i~v~Gdi~~~~~gD~VrVGPtP~~klvv~G~V~g-~Dd~~~~ilidi~~m~siPk~~V~~~~s--~ 180 (294) T COG2524 104 ELTSISNPDACRAVIRVVGDIRKINIGDSVRVGPTPVNKLVVEGKVIG-RDDTANEILIDISKMVSIPKEKVKNLMS--K 180 (294) T ss_pred EEECCCCCCCCCEEEEEEECCCCCCCCCEEEECCCCCCEEEEEEEEEC-CCCCCCEEEEEEEEEEECCCCHHHHHCC--C T ss_conf 972137976541689998013347889867877754424899869962-4566776999874326457302666266--7 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC-CCHHHHHH-CCCEEECCCCCHHHHHH Q ss_conf 28996089998999999985799788887678552788888888762020246-60168873-27435238999899999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG-SMNFKRSI-RKPLVVHENISVLKLME 392 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~-~~~~~~li-r~~~~Vpe~~~l~~lL~ 392 (523) +...++.++|+.|+.+.+.+.+.+-.||.+++ +++|++...|+..+..++. +.+.++++ +.++.+.+++.+.++++ T Consensus 181 ~~i~v~~d~tl~eaak~f~~~~i~GaPVvd~d--k~vGiit~~dI~~aia~g~~~~kV~~~M~k~vitI~eDe~i~dAir 258 (294) T COG2524 181 KLITVRPDDTLREAAKLFYEKGIRGAPVVDDD--KIVGIITLSDIAKAIANGNLDAKVSDYMRKNVITINEDEDIYDAIR 258 (294) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCCCCEECCC--CEEEEEEHHHHHHHHHCCCCCCCHHHHHCCCCCEECCCHHHHHHHH T ss_conf 84676688639999999997286678510289--4589998899999997689654388875057704757304999999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 88503862777522867425652599999987066 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) .|.+++..-.+|+|..|...|++|-.|||..|+|. T Consensus 259 ~M~~~nVGRLlV~ds~gkpvGiITrTDIL~~ia~~ 293 (294) T COG2524 259 LMNKNNVGRLLVTDSNGKPVGIITRTDILTRIAGL 293 (294) T ss_pred HHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 98763864699984699678788557888886436 No 68 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=99.02 E-value=4.9e-09 Score=85.04 Aligned_cols=105 Identities=19% Similarity=0.298 Sum_probs=89.6 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC---CCCHHHHHH-CCCEEECCCCCHHHH Q ss_conf 2899608999899999998579978888767855278888888876202024---660168873-274352389998999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE---GSMNFKRSI-RKPLVVHENISVLKL 390 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~---~~~~~~~li-r~~~~Vpe~~~l~~l 390 (523) +.++++.++|+.++.+.+.+++.+.+||.++ ++++|++..+|++...... .+....+++ +++.+++++.++.++ T Consensus 2 ~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~~--~~~vGiit~~Di~~~~~~~~~~~~~~v~~im~~~~~tv~~~~~l~~a 79 (111) T cd04800 2 PPVTCSPDTTIREAARLMTEHRVSSLLVVDD--GRLVGIVTDRDLRNRVVAEGLDPDTPVSEVMTAPPITIPPDATVFEA 79 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEECHHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCHHHH T ss_conf 9989789197999999999729998999989--99999999557889987167985665888512681799889919999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.|.+++.+-..|+|+ |...|+||..|++. T Consensus 80 ~~~m~~~~i~~lpVvd~-~~lvGiit~~Dilk 110 (111) T cd04800 80 LLLMLERGIHHLPVVDD-GRLVGVISATDLLR 110 (111) T ss_pred HHHHHHCCCCEEEEEEC-CEEEEEEEHHHEEC T ss_conf 99998539958999989-99999998545736 No 69 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=99.01 E-value=1.3e-08 Score=81.94 Aligned_cols=105 Identities=21% Similarity=0.280 Sum_probs=90.6 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC---CCHHHHHH-CCCEEECCCCCHHHH Q ss_conf 28996089998999999985799788887678552788888888762020246---60168873-274352389998999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG---SMNFKRSI-RKPLVVHENISVLKL 390 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~---~~~~~~li-r~~~~Vpe~~~l~~l 390 (523) +.++++.++|+.++.+.+.+.+.+.+||.++ ++.+|++.-+|+.+....+. +....+++ +++.++++++++.++ T Consensus 2 ~pvtv~~~~ti~ea~~~M~~~~i~~~vV~~~--~~~~GIvT~~Dl~~~~~~~~~~~~~~V~~vms~~~~ti~~~~~~~~a 79 (111) T cd04589 2 PPLIVDASTSIRDAARLMREHGADALLVRDG--DPRLGIVTRTDLLDAVLLDGLPSSTPVGEIATFPLITVDPDDFLFNA 79 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCHHHH T ss_conf 9989999298999999999709988999769--92799999799899998299967889999287999899997929999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++.|++++.+...|+|+ |...|+||..||+. T Consensus 80 ~~~M~~~~i~~lpV~d~-~~~vGivt~~Dil~ 110 (111) T cd04589 80 LLLMTRHRIHRVVVREG-GEVVGVLEQTDLLS 110 (111) T ss_pred HHHHHHCCCEEEEEEEC-CEEEEEEEHHHHHC T ss_conf 99999879808899989-99999999688228 No 70 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=99.01 E-value=9.2e-09 Score=83.04 Aligned_cols=106 Identities=22% Similarity=0.262 Sum_probs=92.8 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC--CHHHHHH-CCCEEECCCCCHHHHHHH Q ss_conf 9960899989999999857997888876785527888888887620202466--0168873-274352389998999998 Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS--MNFKRSI-RKPLVVHENISVLKLMER 393 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~--~~~~~li-r~~~~Vpe~~~l~~lL~~ 393 (523) ++++.++|+.++.+++.+++++.+||.+++ +.++|++.-+|+.+....+.+ .+..+++ ++|.++.++.++.++++. T Consensus 5 i~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~-~~l~GiiT~~Di~r~l~~~~~~~~~v~~im~~~~~~i~~~~~~~~a~~~ 83 (113) T cd04607 5 LLVSPDASILDALRKIDKNALRIVLVVDEN-GRLLGTVTDGDIRRALLKGLSLDDPVSEVMNRNPITAKVGSSREEILAL 83 (113) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEECHHHHHHHHCCCCCCCCHHHHHCCEEEEEECCCCHHHHHHH T ss_conf 896994989999999987599789999799-9599999872776687628984561567521212999879989999999 Q ss_pred HHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 850386277752286742565259999998 Q gi|255764470|r 394 LRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 394 fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) |.+.+.+-..|+|+.|.+.|++|..|++.+ T Consensus 84 m~~~~i~~lPVvd~~~~lvGiit~~Dll~~ 113 (113) T cd04607 84 MRERSIRHLPILDEEGRVVGLATLDDLLSK 113 (113) T ss_pred HHHCCCCEEEEECCCCEEEEEEEHHHHHCC T ss_conf 987797499999789949999997993086 No 71 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.99 E-value=1.3e-08 Score=81.93 Aligned_cols=107 Identities=22% Similarity=0.224 Sum_probs=92.2 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC-CHHH-HHHCCCEEECCCCCHHHHHH Q ss_conf 289960899989999999857997888876785527888888887620202466-0168-87327435238999899999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS-MNFK-RSIRKPLVVHENISVLKLME 392 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~-~~~~-~lir~~~~Vpe~~~l~~lL~ 392 (523) +-++++.+.|+.++.+++.+.+++.+||.+++ ..++|++..+|++....+..+ .... -+..++..++++.++.+++. T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~-~k~vG~it~~Dll~~~~~~~~~~~V~d~m~~~v~~i~~~~~v~dalr 80 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEE-NKVLGQVTLSDLLEIGPNDYETLKVCEVYIVPVPIVYCDSKVTDLLR 80 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CCEEEEEEHHHHHHHCHHHHHCCCHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 44686599959999999987699768998899-96889988999864384455324387751567746389976899999 Q ss_pred HHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 885038627775228674256525999999 Q gi|255764470|r 393 RLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 393 ~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) .|++.+.+...|||.-|...|+||-.|++. T Consensus 81 ~~~~~~~~~l~VVD~~grlvGiI~~rdlLr 110 (111) T cd04603 81 IFRETEPPVVAVVDKEGKLVGTIYERELLR 110 (111) T ss_pred HHHHCCCCEEEEECCCCCEEEEEEHHHHHC T ss_conf 987449988999938997999996255123 No 72 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.98 E-value=1.5e-08 Score=81.59 Aligned_cols=107 Identities=17% Similarity=0.253 Sum_probs=90.6 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC---CHHHHHH-CCCEEECCC--CCHH Q ss_conf 289960899989999999857997888876785527888888887620202466---0168873-274352389--9989 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS---MNFKRSI-RKPLVVHEN--ISVL 388 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~---~~~~~li-r~~~~Vpe~--~~l~ 388 (523) +.++++.++++.|+.+.+.+.+.+.+||.+++ ++++|++.-+|+.+....+.+ ....+.+ +++.++.++ .++. T Consensus 2 ~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~-~~lvGIvT~~Di~r~~~~~~~~~~~~v~~vMt~~~itv~~~~~~~i~ 80 (115) T cd04620 2 HPLTVTPDTPVADAIALMSQQGDSSCVLVVEK-GRLLGIFTERDIVRLTAIGKDLSDLPIGEVMTQPVVTLQESEIQDIF 80 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCCCHH T ss_conf 69996991999999999998399469999379-92999998699999998599813489889636896798389972699 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 9999885038627775228674256525999999 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++++.|++++.+-..|+|+.|...|++|..||+. T Consensus 81 ~a~~~M~~~~i~~lpVvd~~g~lvGiit~~Dilk 114 (115) T cd04620 81 TALSLFRQHQIRHLPVLDDQGQLIGLVTAESIRQ 114 (115) T ss_pred HHHHHHHHCCEEEEEEECCCCEEEEEEEHHHCCC T ss_conf 9999987559528999957997999999256232 No 73 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.98 E-value=1e-08 Score=82.68 Aligned_cols=105 Identities=17% Similarity=0.292 Sum_probs=90.4 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC----CCCHHHHHH-CCCEEECCCCCHH Q ss_conf 02899608999899999998579978888767855278888888876202024----660168873-2743523899989 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE----GSMNFKRSI-RKPLVVHENISVL 388 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~----~~~~~~~li-r~~~~Vpe~~~l~ 388 (523) .++++++.++|+.++.+.+.+.+++.+||.++ ++++|++.-+|++...... .+.+..+.+ +++.+++++.++. T Consensus 1 r~vvti~~~~tl~~a~~~m~~~~i~~l~V~d~--~~~vGivt~~Di~~~~~~~~~~~~~~~v~~iM~~~~~tv~~~~~l~ 78 (112) T cd04802 1 KNVITVDPDTTVYEAANIMTENNIGRLIVVDN--EKPVGIITERDLVKKVVSRNLKPREVPVGEVMSTPLITIDPNASLN 78 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEECHHHHHHHHCCCCCCCCCHHHHHCCCCEEECCCCHHH T ss_conf 99789689198999999999749987999989--9999999821878888754798123899997234855994798699 Q ss_pred HHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHH Q ss_conf 999988503862777522867425652599999 Q gi|255764470|r 389 KLMERLRKSSQTFVMVLDEYGVLEGMITPANIL 421 (523) Q Consensus 389 ~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIl 421 (523) ++++.|.+++.+-..|+|+ |...|++|..|++ T Consensus 79 ~a~~~m~~~~~~~lpVvd~-~~lvGiIt~~Dil 110 (112) T cd04802 79 EAAKLMAKHGIKRLPVVDD-DELVGIVTTTDIV 110 (112) T ss_pred HHHHHHHHHCCCEEEEEEC-CEEEEEEECHHHC T ss_conf 9999878719858999989-9999999806710 No 74 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=98.97 E-value=1e-08 Score=82.66 Aligned_cols=107 Identities=28% Similarity=0.386 Sum_probs=94.1 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHH----HHHHCCCEEECCCCCHHHH Q ss_conf 289960899989999999857997888876785527888888887620202466016----8873274352389998999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNF----KRSIRKPLVVHENISVLKL 390 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~----~~lir~~~~Vpe~~~l~~l 390 (523) +...++.++++.++.+.+.+++++.+||++++ ++++|+++.+|++........... ....+++.+++++.++.++ T Consensus 2 ~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~-~~~~G~it~~dl~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~ 80 (113) T cd02205 2 DVVTVSPDDTVAEALRLMLEHGISGLPVVDDD-GRLVGIVTERDLLRALAEGGLDPLVTVGDVMTRDVVTVSPDTSLEEA 80 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCCCEEEEEECCCEEEEECCHHHHH T ss_conf 97796892989999999998398099999399-95999998899898876315741033045675054689732117799 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 99885038627775228674256525999999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) +..|.+++.+...|+|+.|...|++|..|++. T Consensus 81 ~~~~~~~~~~~~~Vvd~~~~~iG~vs~~di~~ 112 (113) T cd02205 81 AELMLEHGIRRLPVVDDEGRLVGIVTRSDILR 112 (113) T ss_pred HHHHHHCCCEEEEEEECCCEEEEEEEHHHHHC T ss_conf 99999849729999966997999998899746 No 75 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.94 E-value=1.3e-08 Score=81.92 Aligned_cols=95 Identities=12% Similarity=0.198 Sum_probs=87.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873274352389998999998 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMER 393 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~ 393 (523) .++.++..+++++++.+.+.+.+++.+||++++ +.++|++.-+|+... .++.++++++++.++.+. T Consensus 1 rdv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~-g~lvGIiT~rDli~~-------------~~~itv~~~~~v~eaa~l 66 (96) T cd04614 1 RNVPTVWEETPLPVAVRIMELANVKALPVLDDD-GKLSGIITERDLIAK-------------SEVVTATKRTTVSECAQK 66 (96) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEECHHHCCC-------------CCCEEECCCCCHHHHHHH T ss_conf 996390896989999999997299879999899-949999988888257-------------788797998909999999 Q ss_pred HHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 85038627775228674256525999999 Q gi|255764470|r 394 LRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 394 fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) |.+++.+-..|+|+.|...|++|..|++. T Consensus 67 M~~~~I~~LPVvd~~~~lvGiit~~Dllk 95 (96) T cd04614 67 MKRNRIEQIPIINGNDKLIGLLRDHDLLK 95 (96) T ss_pred HHHCCCCEEEEECCCCCEEEEEEHHHCCC T ss_conf 99859988628989991999988798048 No 76 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=98.92 E-value=7.7e-08 Score=76.31 Aligned_cols=136 Identities=24% Similarity=0.284 Sum_probs=115.4 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCC---CEEEEEC-CCCCCEEEEEEHHHHH Q ss_conf 67898999999975113786674653400028996089998999999985799---7888876-7855278888888876 Q gi|255764470|r 285 HIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGH---SRFPVAQ-GSLDSFIGIVSARDLL 360 (523) Q Consensus 285 g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~---SR~PV~~-~~~D~iiGiv~~kDLl 360 (523) ..++++|++.++..++..+-++..+|| ++.+++..+.|.++++..+++.+. +++.+|= +......|++..|+++ T Consensus 113 ~~l~~~~r~~v~~~l~y~e~taG~~Mt--~e~v~l~~~~Tv~~al~~ir~~~~~~e~~~~lyVvD~~~~L~Gvvsl~~Ll 190 (451) T COG2239 113 SLLDPEERARVRQLLSYPEDTAGRIMT--TEFVTLPEDVTVDEALDRIRERAEDAETIYYLYVVDEKGKLLGVVSLRDLL 190 (451) T ss_pred HHCCHHHHHHHHHHCCCCHHHHHCCCE--EEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEEEHHHHH T ss_conf 818999999999865898344200013--025872067589999999997236756623799987766467775699986 Q ss_pred HHHHCCCCCHHHHHHC-CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHC Q ss_conf 2020246601688732-7435238999899999885038627775228674256525999999870 Q gi|255764470|r 361 RDLLEEGSMNFKRSIR-KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIA 425 (523) Q Consensus 361 ~~~~~~~~~~~~~lir-~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~Iv 425 (523) ... ++..+.+.+. .+.+|+..+...++-+.|++..---+-|||+.+...|++|.+|+++.+- T Consensus 191 ~a~---~~~~i~~im~~~~~~V~~~~dqeevA~~~~~ydl~a~PVVd~~~~LiG~itiDDiidvi~ 253 (451) T COG2239 191 TAE---PDELLKDLMEDDVVSVLADDDQEEVARLFEKYDLLAVPVVDEDNRLIGIITIDDIIDVIE 253 (451) T ss_pred CCC---CHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHCCEECCEECCCCCEEEEEEHHHHHHHHH T ss_conf 489---576899872435624365578799999999828701535778984632554999999999 No 77 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=98.84 E-value=6.8e-08 Score=76.71 Aligned_cols=104 Identities=19% Similarity=0.195 Sum_probs=89.7 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC---CCHHHHHHC---CCEEECCCCCHHHH Q ss_conf 996089998999999985799788887678552788888888762020246---601688732---74352389998999 Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG---SMNFKRSIR---KPLVVHENISVLKL 390 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~---~~~~~~lir---~~~~Vpe~~~l~~l 390 (523) +++..++++.++.+.+.+.+.+-+||.+++ +.++|++.-||+++....+. +....+++. +..++.++.++.++ T Consensus 4 vtv~~~~si~eA~~~M~~~~ig~l~VVD~~-~~LvGIiT~rDl~r~~~~~~~~~~~pV~~iMT~~P~vvt~~pd~~v~~A 82 (118) T cd04617 4 VVVRENTSVYDAIVTLFLEDVGSLFVVDED-GDLVGVVSRKDLLKASIGGADLQKVPVGVIMTRMPNITTTTPEESVLEA 82 (118) T ss_pred EEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCCEEEECCCCCHHHH T ss_conf 798996999999999998399879998499-9599998789999999748984678299985779971997899939999 Q ss_pred HHHHHHCCCEEEEEECCCC---CEEEEEEHHHHH Q ss_conf 9988503862777522867---425652599999 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYG---VLEGMITPANIL 421 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G---~~~GIVTleDIl 421 (523) ++.|.+++.+-..|||+.| ...|+||..||+ T Consensus 83 ~~~M~~~~i~~LPVVd~~~~~~~vvGiIT~~Di~ 116 (118) T cd04617 83 AKKLIEHQVDSLPVVEKVDEGLEVIGRITKTNIT 116 (118) T ss_pred HHHHHHCCCCEECEEECCCCCEEEEEEEECCCEE T ss_conf 9999982999856794399740899999775150 No 78 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=98.82 E-value=2.6e-08 Score=79.73 Aligned_cols=209 Identities=17% Similarity=0.167 Sum_probs=141.5 Q ss_pred CCCHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHCCCCCCCCC---CHHHH------H Q ss_conf 37027889989999----999998999998755430103----7776676567441568887667---98889------9 Q gi|255764470|r 213 IPKGYLYASIGFSG----IIEFFNQVARRNREQLMSPSR----LRARTADAVLRLLGGKPIQPQG---LNVKA------D 275 (523) Q Consensus 213 ipk~y~y~~~~fs~----~ve~~n~~~~~~~~~~~~~~~----~~eRtA~aVLRLlG~~~~~~~~---~~ee~------~ 275 (523) |-++-+-.++..|- ++.+++ ..+++..+...... ---+.++.++.+--.+.+.+.. +.... + T Consensus 87 i~~~D~~i~~S~SG~t~El~~~~~-~~k~~~~~ii~it~~~~S~Lak~sd~~l~~~~~~Eacp~~LaPTtStt~~lalGD 165 (321) T PRK11543 87 IESRDVMLFISYSGGAKELDLIIP-RLEDKSIALLAMTGKPTSPLGLAAKAVLDISVEREACPMHLAPTSSTVNTLMMGD 165 (321) T ss_pred CCCCCEEEEEECCCCCHHHHHHHH-HHHHCCCCEEEEECCCCCHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 878998999958988177887727-8876698689997899997688269489726656666567775089999999988 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHCCCC---EEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEE Q ss_conf 9999677466789899999997511378---6674653400028996089998999999985799788887678552788 Q gi|255764470|r 276 VLLPTQHEKHIISDQEKDMVQSVLTLAD---RPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIG 352 (523) Q Consensus 276 ~li~~~~~~g~l~~~E~~mi~~vL~L~e---~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiG 352 (523) ++.-.--+.-.|.+++-...+--=.|+. .+|+|+|.+-.++..++.++++.+++..+.+.++--.-|.+++ ++++| T Consensus 166 ALAv~lm~~r~F~~~dFa~~HPgG~LGk~Ll~~V~dlM~~~~~lP~v~~~~~l~dai~~ms~k~lG~v~Vvd~~-~kL~G 244 (321) T PRK11543 166 ALAMAVMQARGFNEEDFARSHPAGALGARLLNKVHHLMRRGDAIPQVALTASVMDAMLELSRTGLGLVAVCDAQ-QQVKG 244 (321) T ss_pred HHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCC-CCEEE T ss_conf 99999999959998789760989888899999999997335578989999879999999664886179983377-50688 Q ss_pred EEEHHHHHHHHHCCCCCH--HH-HHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 888888762020246601--68-873274352389998999998850386277752286742565259999998 Q gi|255764470|r 353 IVSARDLLRDLLEEGSMN--FK-RSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 353 iv~~kDLl~~~~~~~~~~--~~-~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++.--|+-+....+.+.+ .. -+.++|.++.+++.+.++++.|++++..-.+|+|+.|.+.|++.+.|++.+ T Consensus 245 IITDGDLRR~l~k~~~L~~~v~~vMT~nP~tI~~d~la~eAl~iM~~~kI~~LpVvd~~~k~vGiIhiHDLl~~ 318 (321) T PRK11543 245 VFTDGDLRRWLVGGGALTTPVNEAMTVGGTTLQAQSRAIDAKEILMKRKITAAPVVDENGKLTGAINLQDFYQA 318 (321) T ss_pred EEECHHHHHHHHCCCCHHHHHHHHHCCCCCEECCCCHHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHC T ss_conf 87443899998637856768999848999578998719999999998798689998589959999638978676 No 79 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=98.79 E-value=8.2e-08 Score=76.10 Aligned_cols=107 Identities=14% Similarity=0.182 Sum_probs=85.7 Q ss_pred EEEEEECCCCHHHHHHHHHHC-CCCEEEEECCCCCCEEEEEEHHHHHHHHHCC------CCCHHHHHH-CCCEEECCCCC Q ss_conf 289960899989999999857-9978888767855278888888876202024------660168873-27435238999 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILEL-GHSRFPVAQGSLDSFIGIVSARDLLRDLLEE------GSMNFKRSI-RKPLVVHENIS 386 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~-~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~------~~~~~~~li-r~~~~Vpe~~~ 386 (523) .+.++..+++.+++.+.+.++ ..+-+||+++ +.++|+++-+|++...... .+.+..+++ ++|..+.++++ T Consensus 2 pv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd~--~r~vGiisr~dl~~~~~~~~g~~l~~~~pV~~~M~~~p~~v~~~~~ 79 (119) T cd04598 2 PAPTVSPDTTVNDVLERFERDPDLSALAVVDD--GRPVGLIMREALMELLSTPYGRALYGKKPVSEVMDPDPLIVEADTP 79 (119) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEC--CEEEEEEEHHHHHHHHCCCCCHHHHCCCCHHHHCCCCCEEECCCCC T ss_conf 98574999849999999987899676899879--9889999899999987074322330389589971689879989896 Q ss_pred HHHHHHHHHHCCCEEE---EEECCCCCEEEEEEHHHHHHH Q ss_conf 8999998850386277---752286742565259999998 Q gi|255764470|r 387 VLKLMERLRKSSQTFV---MVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 387 l~~lL~~fr~~~~~lA---iVvDE~G~~~GIVTleDIlE~ 423 (523) +.++.+.|.+++.+-. +||+|.|...|++|..|++.+ T Consensus 80 i~~~~~~~~~~~~~~l~~~ivV~~~g~~~Givt~~DLlr~ 119 (119) T cd04598 80 LEEVSRLATGRDSQNLYDGFIVTEEGRYLGIGTVKDLLRQ 119 (119) T ss_pred HHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEEHHHHHCC T ss_conf 9999999985693325788589579989999898994079 No 80 >COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription] Probab=98.75 E-value=5.3e-07 Score=70.20 Aligned_cols=119 Identities=26% Similarity=0.403 Sum_probs=95.8 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCHHHHHHCC Q ss_conf 786674653400028996089998999999985799788887678552788888888762020246----6016887327 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMNFKRSIRK 377 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~~~~lir~ 377 (523) ...+++.+|+ ..++.+..+++++++.+.+.+.+||.+||.++ |.++|-+.-.++.+...++. ....++.+-+ T Consensus 63 ~~ita~~iM~--spvv~v~pdDsi~~vv~lM~~~g~SQlPVi~~--~k~VGsItE~~iv~~~le~~e~i~~~~vr~vM~e 138 (187) T COG3620 63 TRITAKTIMH--SPVVSVSPDDSISDVVNLMRDKGISQLPVIEE--DKVVGSITENDIVRALLEGMESIRSLRVREVMGE 138 (187) T ss_pred CEEEHHHHCC--CCEEEECCHHHHHHHHHHHHHCCCCCCCEEEC--CEEEEEECHHHHHHHHHCCCCCHHHHHHHHHHCC T ss_conf 6675766535--87058772466999999999759752754108--8645331498999997304221211218988558 Q ss_pred C-EEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 4-3523899989999988503862777522867425652599999987066 Q gi|255764470|r 378 P-LVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 378 ~-~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) | +.|.++.++...-..+..++ |+.|-|.|.+.||+|-.|++..+.|- T Consensus 139 ~fP~Vs~~~~l~vI~~LL~~~~---AVlV~e~G~~vGIITk~DI~k~~~~~ 186 (187) T COG3620 139 PFPTVSPDESLNVISQLLEEHP---AVLVVENGKVVGIITKADIMKLLAGT 186 (187) T ss_pred CCCCCCCCCCHHHHHHHHHHCC---EEEEEECCCEEEEEEHHHHHHHHHCC T ss_conf 8876799998799999984198---38998689368887599999987246 No 81 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.75 E-value=1.7e-07 Score=73.81 Aligned_cols=104 Identities=22% Similarity=0.368 Sum_probs=83.3 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH-----HHCC----CCCHHHHHHCCCEEEC----- Q ss_conf 9960899989999999857997888876785527888888887620-----2024----6601688732743523----- Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD-----LLEE----GSMNFKRSIRKPLVVH----- 382 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~-----~~~~----~~~~~~~lir~~~~Vp----- 382 (523) ++++.+.|+.|+.+.+.+++.+.+||.+++ ++++|++..+|+... ..++ .+....++++|-..+. T Consensus 4 vtv~~dtti~eA~~~M~~~~v~~l~VvD~~-~~lvGIiT~~Dl~~~~~~~~~~~~~~~~~~l~V~dvMt~~~~~~a~~~~ 82 (126) T cd04640 4 IVIPADTSIDEALELMIKHGVRLLLVVDSD-DNFIGVITAVDLLGEEPIKRIQEGGISRSELTVADVMTPKEDLKALDLE 82 (126) T ss_pred EEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEECCCCCCCHHHHHHHHCCCCHHHCCHHHCCCCCCCEEEECCC T ss_conf 696899959999999998397679898799-9799999763001215688886448982550531335545204630220 Q ss_pred --CCCCHHHHHHHHHHCCCEEEEEECCCC-CEEEEEEHHHHH Q ss_conf --899989999988503862777522867-425652599999 Q gi|255764470|r 383 --ENISVLKLMERLRKSSQTFVMVLDEYG-VLEGMITPANIL 421 (523) Q Consensus 383 --e~~~l~~lL~~fr~~~~~lAiVvDE~G-~~~GIVTleDIl 421 (523) ++.++.+++..|++++.|-++|+|+-| .+.|++|..||. T Consensus 83 ~~~~a~v~d~~~~m~~~~~~hllVvd~~~~~l~GiiS~~DI~ 124 (126) T cd04640 83 ELENASVGDVVETLKASGRQHALVVDREHHQIRGIISTSDIA 124 (126) T ss_pred CCCCCCHHHHHHHHHHCCCEEEEEEECCCCEEEEEEEHHHHC T ss_conf 003485999999999839808749989998898999879942 No 82 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=98.75 E-value=2.3e-07 Score=72.87 Aligned_cols=165 Identities=14% Similarity=0.173 Sum_probs=119.9 Q ss_pred HHHHHHHHHHHHHCCC--CC-----HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 8889999996774667--89-----8999999975113786674653400028996089998999999985799788887 Q gi|255764470|r 271 NVKADVLLPTQHEKHI--IS-----DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVA 343 (523) Q Consensus 271 ~ee~~~li~~~~~~g~--l~-----~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~ 343 (523) -.|.++-++.+...|. +. +++.++++.|=+... -|+ .+-+++..+++++++.+...+.+||-+||. T Consensus 60 VTe~~MAiamA~~GGiGVIHrn~sie~Q~~~V~~VKr~e~-----g~i--~dPit~~p~~ti~d~~~l~~~~~~sg~pVv 132 (499) T PTZ00314 60 VTEHKMAIAMALMGGIGVIHNNCTVERQVEEVKKVKRFEN-----GFI--MDPKSLSPEHTVSDVIEIKDKKGFSGIPIT 132 (499) T ss_pred CCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCC-----CCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 5689999999977986998799999999999999876306-----620--698273898629999975675188636798 Q ss_pred CCC--CCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHH Q ss_conf 678--55278888888876202024660168873-274352389998999998850386277752286742565259999 Q gi|255764470|r 344 QGS--LDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANI 420 (523) Q Consensus 344 ~~~--~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDI 420 (523) ++. .+.++|++..+|+-.. ++.+.+..+.+ +.+.+.++..++.++.+.|++++..-..|||+.|...|++|..|+ T Consensus 133 ~~g~~~gkL~GIvT~rD~~f~--~d~~~~v~~vMt~~lvt~~~~isl~eA~~ll~~~kieklpvVd~~g~L~glit~kDi 210 (499) T PTZ00314 133 EDGRPGGKLLGIVTSKDIDFV--KDKSTPVSEIMTTDLVVGRYPITLEDANDVLNRSRKGVLPIVNDNGELVALVSRSDA 210 (499) T ss_pred ECCCCCCEEEEEEECCCEECC--CCCCCCHHHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCEEECCCCCEEEEEECCHH T ss_conf 638868868999836202323--067758888615672672599999999999986033123066578958998630348 Q ss_pred HHHHCCCCCCCCCCCCCEEEECCCEEEEEECCC Q ss_conf 998706644578886312883498599870011 Q gi|255764470|r 421 LEAIAGDFPDEDDQKLDITVGDDGSLTVDGWID 453 (523) Q Consensus 421 lE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~ 453 (523) ...- .-+...+.++|+++|-+.+. T Consensus 211 ~k~~---------~~P~a~~D~~grL~VgAAVg 234 (499) T PTZ00314 211 VKNR---------DYPHASKDENKQLLVGAAIS 234 (499) T ss_pred HHHH---------HCCCHHHHCCCCEEEEEEEC T ss_conf 7753---------38712220138789999947 No 83 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=98.72 E-value=1.4e-07 Score=74.38 Aligned_cols=162 Identities=15% Similarity=0.127 Sum_probs=120.3 Q ss_pred CHHHHHHHHHHHHHCCC------C-CHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEE Q ss_conf 98889999996774667------8-9899999997511378667465340002899608999899999998579978888 Q gi|255764470|r 270 LNVKADVLLPTQHEKHI------I-SDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPV 342 (523) Q Consensus 270 ~~ee~~~li~~~~~~g~------l-~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV 342 (523) +-.|.++-++.+.+.|. . -+++.+++..|=+ .+-|. .+-+++..+++++++.+...+.+||-+|| T Consensus 54 TVTE~~MAIamA~~GGiGvIH~Nmsie~Qa~~V~kVK~------~~~~i--~~Pvti~pd~tv~d~~~l~~~~~~sg~pV 125 (479) T PRK07807 54 AVAGRRMAETVARRGGLVVLPQDLPIDAVAETVAWVKS------RDLVF--DTPVTLAPDDTVSDALALIHKRAHGAVVV 125 (479) T ss_pred CCCCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHC------CCCEE--CCCEEECCCCCHHHHHHHHHHHCCCCCCE T ss_conf 44749999999977985998799999999999999711------37743--69989789871999999999837888741 Q ss_pred ECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHH Q ss_conf 767855278888888876202024660168873-2743523899989999988503862777522867425652599999 Q gi|255764470|r 343 AQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANIL 421 (523) Q Consensus 343 ~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIl 421 (523) .+++ +.++|++..||+-.. +...+..+.+ .++.++|+.+++.++.+.|++++..-..|+|+.|...|++|.+|+. T Consensus 126 v~~~-gkLvGIvT~RDir~~---d~~~~v~~vMT~~lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~~~L~gLiT~kDi~ 201 (479) T PRK07807 126 VDEE-GRPVGLVTEADCRGV---DRFTRVGDVMSTDLVTLPAGTDPRKAFDLLEAAPVKVAPVVDADGRLAGVLTRTGAL 201 (479) T ss_pred ECCC-CCEEEEEECHHHCCC---CCCCCHHHHHCCCCEEECCCCCHHHHHHHHHHCCCCCCEEECCCCEEEEEEEEEHHH T ss_conf 4679-947889821341147---777758886257736613667879999999753523113775699299999961164 Q ss_pred HHHCCCCCCCCCCCCCEEEECCCEEEEEECCC Q ss_conf 98706644578886312883498599870011 Q gi|255764470|r 422 EAIAGDFPDEDDQKLDITVGDDGSLTVDGWID 453 (523) Q Consensus 422 E~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~ 453 (523) .. . +.+ ... .++|+++|-+... T Consensus 202 k~------~--~~~-~a~-D~~grL~VgAAVG 223 (479) T PRK07807 202 RA------T--IYT-PAV-DAAGRLRVAAAVG 223 (479) T ss_pred HC------C--CCC-CCC-CHHHCEEEEEEEC T ss_conf 54------4--688-777-8223456788725 No 84 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=98.65 E-value=4.6e-07 Score=70.64 Aligned_cols=163 Identities=22% Similarity=0.327 Sum_probs=117.4 Q ss_pred HHHHHHHHHHHHCCC--C----C-HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 889999996774667--8----9-89999999751137866746534000289960899989999999857997888876 Q gi|255764470|r 272 VKADVLLPTQHEKHI--I----S-DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQ 344 (523) Q Consensus 272 ee~~~li~~~~~~g~--l----~-~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~ 344 (523) .|.++-++.+...|. + . +++.+++..|=+... -|+ .+-.+++.+++++++.+...+.++|-+||.+ T Consensus 53 Te~~MAiamA~~GGlGVIHrn~sie~Q~~~V~~VKr~e~-----g~i--~~P~tl~P~~tv~d~~~l~~~~~~sg~PVv~ 125 (486) T PRK05567 53 TEARLAIAMAQEGGIGVIHKNMSIEEQAEEVRKVKRSES-----GVV--TDPVTVTPDTTLAEALALMARYGISGVPVVD 125 (486) T ss_pred CCHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHHC-----CCC--CCCCCCCCCCCHHHHHHHHHHHCCCCCEEEC T ss_conf 789999999988987999899999999999999975306-----713--7986768988899999999972878614876 Q ss_pred CCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHC--CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 78552788888888762020246601688732--7435238999899999885038627775228674256525999999 Q gi|255764470|r 345 GSLDSFIGIVSARDLLRDLLEEGSMNFKRSIR--KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 345 ~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir--~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ++ .+++|++..+|+-.. .+.+....+.+. +..++++..++.++.+.|++++..-..|+|+.|...|++|..|+.. T Consensus 126 ~~-~kL~GiiT~rD~~f~--~~~~~~V~~vMT~~~lvt~~~~is~~eA~~~l~~~kieklPvVd~~g~L~Glit~kDi~k 202 (486) T PRK05567 126 EE-GKLVGIITNRDVRFE--TDLSQPVSEVMTKERLVTVPEGTTLEEALELLHEHRIEKLLVVDDNGRLKGLITVKDIEK 202 (486) T ss_pred CC-CCEEEEEECCCEECC--CCCCCCHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCEECCCCCEEEEEEHHHHHH T ss_conf 79-947888614201100--267765467534573289258899999999999731303427746896888877667776 Q ss_pred HHCCCCCCCCCCCCCEEEECCCEEEEEECCC Q ss_conf 8706644578886312883498599870011 Q gi|255764470|r 423 AIAGDFPDEDDQKLDITVGDDGSLTVDGWID 453 (523) Q Consensus 423 ~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~ 453 (523) .- +.| ...+.++|+++|-+... T Consensus 203 ~~--~~P-------~A~~D~~grL~VgAAVg 224 (486) T PRK05567 203 AE--EFP-------NACKDEQGRLRVGAAVG 224 (486) T ss_pred HH--HCC-------CCCCCCCCCEEEEEEEC T ss_conf 52--088-------51236668889999962 No 85 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=98.59 E-value=7.7e-07 Score=68.98 Aligned_cols=110 Identities=16% Similarity=0.153 Sum_probs=84.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC---CHHHH-HHCCCEEECCCCCHHH Q ss_conf 0289960899989999999857997888876785527888888887620202466---01688-7327435238999899 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS---MNFKR-SIRKPLVVHENISVLK 389 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~---~~~~~-lir~~~~Vpe~~~l~~ 389 (523) ...+++..++|++++.+.++++++|++||.+++ ++++|++..+|++.....++. ....+ +.++...+..+.++.. T Consensus 2 ~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~-g~~~G~vt~~~ll~~~~~~~~~~~~~V~~vm~~~~~~v~~~~~l~~ 80 (124) T cd04608 2 KAPVTVLPTVTCAEAIEILKEKGFDQLPVVDES-GKILGMVTLGNLLSSLSSGKVQPSDPVSKALYKQFKRVNKNDTLGK 80 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCCCCCHHH T ss_conf 999798998909999999998499856898799-9688888799999999738999998289974465651479985567 Q ss_pred ---------HHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf ---------99988503862777522867425652599999987 Q gi|255764470|r 390 ---------LMERLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 390 ---------lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) .+..+..++..-++++|+.|...||||-.|++.-| T Consensus 81 ls~~l~~~~~~lvv~~~~~~~~~v~~~~~k~vGIvTr~DLL~yl 124 (124) T cd04608 81 LSRILETDAFLLVFFEQISSAAIGKEKQEKPIGIVTKIDLLSYI 124 (124) T ss_pred HHCCCCCCHHHHHHHHHHCCCCCCCCCCCEEEEEEEHHHHHHHC T ss_conf 31036732789999986025676277799899998679977039 No 86 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=98.58 E-value=3.8e-07 Score=71.25 Aligned_cols=207 Identities=19% Similarity=0.164 Sum_probs=140.7 Q ss_pred CCHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHCCCCCCCCC---CHHHH------HH Q ss_conf 7027889989999----9999989999987554301----037776676567441568887667---98889------99 Q gi|255764470|r 214 PKGYLYASIGFSG----IIEFFNQVARRNREQLMSP----SRLRARTADAVLRLLGGKPIQPQG---LNVKA------DV 276 (523) Q Consensus 214 pk~y~y~~~~fs~----~ve~~n~~~~~~~~~~~~~----~~~~eRtA~aVLRLlG~~~~~~~~---~~ee~------~~ 276 (523) -++-+-.++..|- ++++++ ..+++..+.... .+---+.++.++.+=-.+.+.+.. +.... ++ T Consensus 93 ~~~D~~i~~S~SG~t~El~~ll~-~~~~~~~~iI~it~~~~S~l~~~sd~~l~~~~~~Eacp~~laPTtStt~~lalgDa 171 (326) T PRK10892 93 TPQDVVIAISNSGESSEILALIP-VLKRLHVPLICITGRPESSMARAADVHLCVKVPKEACPLGLAPTSSTTATLVMGDA 171 (326) T ss_pred CCCCEEEEEECCCCCHHHHHHHH-HHHHCCCCEEEEECCCCCHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 78998999958987577876417-87766985899956999966880887896578744665677751899999999989 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHCCCC---EEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEE Q ss_conf 999677466789899999997511378---66746534000289960899989999999857997888876785527888 Q gi|255764470|r 277 LLPTQHEKHIISDQEKDMVQSVLTLAD---RPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGI 353 (523) Q Consensus 277 li~~~~~~g~l~~~E~~mi~~vL~L~e---~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGi 353 (523) +.-.--+.-.|.+++-...+--=+|+. .+|+|+|.+..++..++.++++.+++..+.+.++--.-|++++ ..++|+ T Consensus 172 la~~lm~~rgF~~~dFa~~HPgG~LGk~Ll~~V~dlM~~~~~lP~v~~~~~l~dai~~mt~k~lG~v~Vvd~~-gkL~GI 250 (326) T PRK10892 172 LAVALLKARGFTAEDFALSHPGGALGRKLLLRVSDIMHTGDEIPHVKKTASLRDALLEITRKNLGMTVICDDN-MMIEGI 250 (326) T ss_pred HHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCC-CCEEEE T ss_conf 9999999849998689760989777788887899985137658853776029999998745893499998589-968999 Q ss_pred EEHHHHHHHHHCCCC---CHHHH-HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 888887620202466---01688-73274352389998999998850386277752286742565259999998 Q gi|255764470|r 354 VSARDLLRDLLEEGS---MNFKR-SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 354 v~~kDLl~~~~~~~~---~~~~~-lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) +.--||-+....+.+ ....+ +.+.|.++.+++.+.++++.|++++..-.+|+|+ +...|++.+.|++.+ T Consensus 251 iTDGDLRR~l~~~~~i~~~~~~diMT~nP~tI~~d~la~eAL~iMe~~kIt~L~Vvd~-~k~vGIihihDll~~ 323 (326) T PRK10892 251 FTDGDLRRVFDMGVDLRQLSIADVMTPGGIRVRPGILAVDALNLMQSRHITSVLVADG-DQLLGVLHMHDLLRA 323 (326) T ss_pred EECHHHHHHHHHCCCCCCCCHHHHHCCCCCEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHC T ss_conf 8626899998706883128799971899967899886999999998639818999889-999999767978676 No 87 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=98.52 E-value=4e-06 Score=63.78 Aligned_cols=136 Identities=17% Similarity=0.159 Sum_probs=105.2 Q ss_pred HHHHHHHHHCCCCEEEEEEEEE-CCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCC Q ss_conf 9999997511378667465340-002899608999899999998579978888767855278888888876202024660 Q gi|255764470|r 291 EKDMVQSVLTLADRPAKSIMTP-RTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSM 369 (523) Q Consensus 291 E~~mi~~vL~L~e~~V~~IMtP-R~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~ 369 (523) =++.++++=...-.+++++|.+ ...+..-+......++++.+.+.+.+...|.+.+ ..+.|++..+++.......... T Consensus 261 V~~F~~~v~~~~~~~a~~i~~~~~~~~~~~~~~~~~~~al~~m~~~~~~~~~vvd~~-~~~~G~v~~~~~~~~~~~~~~~ 339 (400) T PRK10070 261 VRTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYGYVIERG-NKFVGAVSIDSLKAALTQQQGL 339 (400) T ss_pred HHHHHCCCCHHHCEEHHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHHHCCCCH T ss_conf 998755477877023989624687521314888699999999985598679998699-8088998899999776337763 Q ss_pred HHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCC Q ss_conf 168873274352389998999998850386277752286742565259999998706644 Q gi|255764470|r 370 NFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFP 429 (523) Q Consensus 370 ~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~ 429 (523) ...+.+.+..+++++++.+++..+.+++..+ .||||.|...|+||...+++++..+-. T Consensus 340 -~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~-~Vvd~~~~l~G~it~~~ll~~L~~~~~ 397 (400) T PRK10070 340 -DAALIDAPLAVDAQTPLSELLSHVGQAPCAV-PVVDEDQQYVGIISKGMLLRALDREGV 397 (400) T ss_pred -HHHHHCCCCCCCCCCCHHHHHHHHHHCCCCE-EEECCCCCEEEEEEHHHHHHHHHHCCC T ss_conf -6675058842399998999999997289963-898799919999987999999875268 No 88 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=98.51 E-value=1.3e-06 Score=67.28 Aligned_cols=165 Identities=19% Similarity=0.166 Sum_probs=115.6 Q ss_pred HHHHHHHHHHHCCC--C----C-HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECC Q ss_conf 89999996774667--8----9-899999997511378667465340002899608999899999998579978888767 Q gi|255764470|r 273 KADVLLPTQHEKHI--I----S-DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQG 345 (523) Q Consensus 273 e~~~li~~~~~~g~--l----~-~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~ 345 (523) |.++-++.+.+.|. + + +++.++++.|=+.. ..|+ .+-.++..+.+++++.+...+.++|-+||.++ T Consensus 49 e~~MAiamA~~GGiGVIHrn~sie~Q~~~V~~VK~~e-----~g~i--~~Piti~p~~ti~~~~~l~~~~~~sg~pVv~~ 121 (467) T pfam00478 49 ESRMAIAMAREGGIGVIHKNMSIEEQAEEVRKVKRFE-----SGFI--TDPVTVSPDTTVAEALELKERYGISGVPVTED 121 (467) T ss_pred CHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHCC-----CCCC--CCCCCCCCCCCHHHHHHHHHHHCCCCEEEECC T ss_conf 8999999998898899968999999999999997433-----6821--78753698765999999989728783268207 Q ss_pred CCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHC Q ss_conf 85527888888887620202466016887327435238999899999885038627775228674256525999999870 Q gi|255764470|r 346 SLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIA 425 (523) Q Consensus 346 ~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~Iv 425 (523) + ....|++..+|.-...............+++.+.++.++..++.+.|++++..-+.|+|+.|...|++|..|+...-- T Consensus 122 ~-~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~lvt~~~~~~~~ea~~~l~~~kieklpvvd~~g~L~glit~kDi~k~~~ 200 (467) T pfam00478 122 G-KLLGGLVGIRTSRDIDFLSKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDDGELVGLITRKDIEKARD 200 (467) T ss_pred C-CEEEEEEEEECCCCCCCCCCCCEEECCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEECCCCCEEEEEEHHHHHHHHC T ss_conf 9-746899877624433445432011013554277248899899999999755641544467883788887434677420 Q ss_pred CCCCCCCCCCCCEEEECCCEEEEEECCCH Q ss_conf 66445788863128834985998700119 Q gi|255764470|r 426 GDFPDEDDQKLDITVGDDGSLTVDGWIDV 454 (523) Q Consensus 426 Gei~DE~d~~~~i~~~~d~~~~v~G~~~l 454 (523) - +...+.++|+++|-+.... T Consensus 201 ~---------P~a~~D~~grL~VgAAVG~ 220 (467) T pfam00478 201 Y---------PNASKDAQGRLLVGAAVGT 220 (467) T ss_pred C---------CCCCCCCCCCEEEEEEECC T ss_conf 7---------8522265677799998067 No 89 >COG0517 FOG: CBS domain [General function prediction only] Probab=98.44 E-value=5.2e-06 Score=62.92 Aligned_cols=110 Identities=26% Similarity=0.394 Sum_probs=95.2 Q ss_pred EEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCH--HHHHHC-CCEEECCC Q ss_conf 653400028996089998999999985799788887678552788888888762020246601--688732-74352389 Q gi|255764470|r 308 SIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMN--FKRSIR-KPLVVHEN 384 (523) Q Consensus 308 ~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~--~~~lir-~~~~Vpe~ 384 (523) ++|. .+...+..+.+..++...+.+.+++++||.++. ..+|++..+|++.....+.... ....+. ++..+.++ T Consensus 3 ~~~~--~~~~~v~~~~~~~~a~~~m~~~~~~~~~v~~~~--~l~Giit~~di~~~~~~~~~~~~~v~~i~~~~~~~~~~~ 78 (117) T COG0517 3 DIMT--KDVITVKPDTSVRDALLLMSENGVSAVPVVDDG--KLVGIITERDILRALAAGGKRLLPVKEVMTKPVVTVDPD 78 (117) T ss_pred CCCC--CCCEEECCCCCHHHHHHHHHHCCCEEEEEEECC--EEEEEEEHHHHHHHHHCCCCCCCHHHHHHCCCCEEECCC T ss_conf 5442--688089899819999999987493389974399--999999989998766416655331788625884677799 Q ss_pred CCHHHHHHHHHH-CCCEEEEEECCCC-CEEEEEEHHHHH Q ss_conf 998999998850-3862777522867-425652599999 Q gi|255764470|r 385 ISVLKLMERLRK-SSQTFVMVLDEYG-VLEGMITPANIL 421 (523) Q Consensus 385 ~~l~~lL~~fr~-~~~~lAiVvDE~G-~~~GIVTleDIl 421 (523) .++.++++.|.+ .+.+...|+|+.+ ...|++|..|++ T Consensus 79 ~~~~~~~~~m~~~~~~~~lpVv~~~~~~lvGivt~~di~ 117 (117) T COG0517 79 TPLEEALELMVERHKIRRLPVVDDDGGKLVGIITLSDIL 117 (117) T ss_pred CCHHHHHHHHHHHCCCCEEEEEECCCCEEEEEEEHHHCC T ss_conf 889999999987267762569988988799898878749 No 90 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=98.32 E-value=2.1e-06 Score=65.81 Aligned_cols=185 Identities=14% Similarity=0.205 Sum_probs=113.7 Q ss_pred CEE--EEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC------------ Q ss_conf 866--746534000289960899989999999857997888876785527888888887620202466------------ Q gi|255764470|r 303 DRP--AKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS------------ 368 (523) Q Consensus 303 e~~--V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~------------ 368 (523) ..+ |.++||||.++++.+.+-+++++.+.+.+++..|+||.+++ ++.+|.|..+|+.+....+.. T Consensus 151 ~~~daV~~~MT~~~~~iT~~e~i~~e~A~~~L~~~r~ekLpvVd~~-~~lVgLiT~~Di~~~~~~P~A~kd~vG~~GrL~ 229 (476) T TIGR01302 151 GKKDAVSEVMTPREELITVPEGIDLEEALKVLHKHRIEKLPVVDKD-GELVGLITVKDIVKRREFPHASKDTVGENGRLI 229 (476) T ss_pred CCCCCEEECCCCCCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEE T ss_conf 8833010112037643484167778999999886086504788278-988999864478898638887788748886089 Q ss_pred ---------CHHHH---H---HCCCEEEC----CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEH---HHHHHHHCC Q ss_conf ---------01688---7---32743523----89998999998850386277752286742565259---999998706 Q gi|255764470|r 369 ---------MNFKR---S---IRKPLVVH----ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITP---ANILEAIAG 426 (523) Q Consensus 369 ---------~~~~~---l---ir~~~~Vp----e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTl---eDIlE~IvG 426 (523) .+.++ + -=..+++- .+....+.++++|+...++-+|.-. ++|. +||+++=+- T Consensus 230 VgAAvg~r~~D~~R~~~L~~AGvDv~viDsshGhs~~vl~~ik~~k~~Yp~~~iiaGN------VaT~~~a~~LI~AgAD 303 (476) T TIGR01302 230 VGAAVGTREDDLERAEALVEAGVDVIVIDSSHGHSIYVLDSIKKIKKTYPDLDIIAGN------VATAEQAKALIDAGAD 303 (476) T ss_pred EEEEECCCCCCHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHCCEEEEEECC------CCCHHHHHHHHHCCCC T ss_conf 9988468986189999999659658998166545378999999998638805799434------4117889889852888 Q ss_pred CCCCC-CC----CCC-----------------CEEEECCCEEEEEECCC-HHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 64457-88----863-----------------12883498599870011-999999828899998778240789999883 Q gi|255764470|r 427 DFPDE-DD----QKL-----------------DITVGDDGSLTVDGWID-VRYASKLFGVNLVDEDDRYSTLAGFILWRL 483 (523) Q Consensus 427 ei~DE-~d----~~~-----------------~i~~~~d~~~~v~G~~~-l~el~~~l~~~l~~e~~~~~TlaGlil~~l 483 (523) =+.=- .. ... +.-+...=..++||.+. -=|+.++|-.- ++.-=||+|+---- T Consensus 304 g~rVGiGpGSICTTr~V~gVGvPQ~TAv~~Va~~A~~~Gi~VIADGGIr~SGDivKAlAaG-----A~aVMlGslLAGT~ 378 (476) T TIGR01302 304 GLRVGIGPGSICTTRIVAGVGVPQITAVYDVAEYAAQSGIPVIADGGIRYSGDIVKALAAG-----ADAVMLGSLLAGTT 378 (476) T ss_pred EEEECCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHC-----CCEEHHCCCCCCCC T ss_conf 7898368898110015651276268899999999972799099837756255899999816-----77220234210163 Q ss_pred CCCCCCCCEEEECCEEEEE Q ss_conf 1248899889999999999 Q gi|255764470|r 484 GHLPQEKEIFTEMNLKFEI 502 (523) Q Consensus 484 ~~iP~~Ge~i~~~g~~f~V 502 (523) + .+|+....+|-+|+. T Consensus 379 E---sPGe~~~~nGrryK~ 394 (476) T TIGR01302 379 E---SPGEYYIINGRRYKS 394 (476) T ss_pred C---CCCCEEEECCEEEEE T ss_conf 3---887269966878987 No 91 >KOG1764 consensus Probab=98.10 E-value=2.2e-05 Score=58.29 Aligned_cols=117 Identities=18% Similarity=0.163 Sum_probs=93.8 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC----CCH-H--HHHH-----CCCEEEC Q ss_conf 28996089998999999985799788887678552788888888762020246----601-6--8873-----2743523 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG----SMN-F--KRSI-----RKPLVVH 382 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~----~~~-~--~~li-----r~~~~Vp 382 (523) ++..++.+.++.++++++.+.+.|-+||.+.+. +.+|.....|+.....++. +.. + .... -++.... T Consensus 238 ~i~~i~~~~~v~~al~~m~~~~is~lpvV~~~g-~~v~~~s~~Dv~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~vvtc~ 316 (381) T KOG1764 238 NIASISEDTPVIEALKIMSERRISALPVVDENG-KKVGNYSRFDVIHLAREGTYNNLDLSCLSEALSHRPIRFEGVVTCR 316 (381) T ss_pred HHHHHCCCCCHHHHHHHHHHCCCCCCCEECCCC-CEECCEEEEHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCEEEE T ss_conf 777751787089999999870457443475778-5532156200121564277677773242135331554568737984 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCC Q ss_conf 89998999998850386277752286742565259999998706644578 Q gi|255764470|r 383 ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDED 432 (523) Q Consensus 383 e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~ 432 (523) .+-++.+.++++.+++.|-+.||||.|..+|+||+.|++..++-.-.... T Consensus 317 ~~ssL~~vi~~lv~~~vHRl~VVd~~~~l~GvvSLsDil~~l~~~p~~~~ 366 (381) T KOG1764 317 PTSTLAEVIDKLVAHRVHRLWVVDEDGVLVGVISLSDILSYLVLTPSGGT 366 (381) T ss_pred ECCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCC T ss_conf 37749999999987386059998689867987558999999974737789 No 92 >PRK10995 multiple drug resistance protein MarC; Provisional Probab=98.06 E-value=0.0017 Score=44.61 Aligned_cols=190 Identities=14% Similarity=0.176 Sum_probs=98.7 Q ss_pred CHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 14653587899999999999999744399999998517998999999999999999999999999999999742479998 Q gi|255764470|r 2 FFSWIYDYYVWIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFL 81 (523) Q Consensus 2 ~~~~~~~p~~w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i 81 (523) |||.+. ++-.++.+| .=++=-+-|+-+..-++++.++++|+|........|.+.=+++++... .+++. T Consensus 1 m~~~~~--~~~~~~v~L--f~IinPig~~PiFlslt~~~~~~~r~~ia~~a~~~a~~iLl~f~~~G~--------~iL~~ 68 (222) T PRK10995 1 MLDLFK--AIGLGLVVL--LPLANPLTTVALFLGLSGNMTSEERNRQSLMASVYVFAIMMVAFYAGQ--------LVMDT 68 (222) T ss_pred CHHHHH--HHHHHHHHH--HHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH T ss_conf 978999--999999999--999671306999999848999999999999999999999999999899--------99999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC------CCCCCCCC-----HH-------HHHHHHHHHHHHHH Q ss_conf 63453222489999999999998655553025764211------11222332-----09-------99999999999998 Q gi|255764470|r 82 KGLSFSGRDIVLILGGFFLLFKGTIELHERLEGDGFDK------KHKFFSPV-----SW-------QVIVLQIVILDLVF 143 (523) Q Consensus 82 ~~~~~s~~~lill~GG~fLl~k~~~el~~~~e~~~~~~------~~~~~~~~-----~~-------~~~v~~I~v~D~vF 143 (523) ++.+.. -+.+.||+.|+..+++.+..+.......+ ........ ++ -+++.++++.-... T Consensus 69 fGIsl~---afrIaGGiiL~~ia~~Ml~~~~~~~~~~~~~~~~~~~~~~~~~~iaivPLAiPllaGPGaIatvi~l~~~~ 145 (222) T PRK10995 69 FGISIP---GLRIAGGLIVAFIGFRMLFPQQKAEDSPEAKSKSEELADEPSANIAFVPLAMPSTAGPGTIAMIISSASTI 145 (222) T ss_pred HCCCHH---HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHC T ss_conf 699899---99999999999999998535554433432234532211122343002101142135846999999998623 Q ss_pred H----HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH--HHHHHHHHHHHHHHHHHC Q ss_conf 5----778999998541469999999999999999989999999848528999--999999999999997745 Q gi|255764470|r 144 S----LDSVVTAIGMVQHFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVIL--CLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 144 S----lDsVitavg~~~~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l--~l~fl~~iG~~L~~eg~~ 210 (523) . ..+....++. .+.+..+.+.+-+++..|.++.|++.+...-.+- .-.+|..||+-++.+|.. T Consensus 146 ~~~~~~~~~~~~~~~----~~~i~~~~~~~~~~l~~s~~i~r~LG~~g~~vl~RimGllL~aiaVq~i~~Gi~ 214 (222) T PRK10995 146 REGSTFADWVLMVAP----PLIFLLVALILWGSLRSSGAIMRLLGKGGIEAISRLMGFLLVCMGVQFIINGIL 214 (222) T ss_pred CCCCCHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 543204779999999----999999999999999976999999888899999999999999999999999999 No 93 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=98.05 E-value=3.3e-05 Score=57.08 Aligned_cols=108 Identities=19% Similarity=0.241 Sum_probs=72.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC-CCC--CHHHH-HHCCCEEECCCCCHHH Q ss_conf 0289960899989999999857997888876785527888888887620202-466--01688-7327435238999899 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE-EGS--MNFKR-SIRKPLVVHENISVLK 389 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~-~~~--~~~~~-lir~~~~Vpe~~~l~~ 389 (523) ..-+.++..+++.++..++.+.|-|-+.+.+++ +...||+.-||+...... +.+ .+... ...||+.|..+..+.+ T Consensus 157 ~~~~~v~~~~~i~~aa~km~~~gv~s~v~l~~~-~~~~GIvT~~dl~~~v~~~g~~~~~~V~evmT~p~~svd~~~~~fe 235 (610) T COG2905 157 LPAVTVSPQASIQDAARKMKDEGVSSLVVLDDS-GPLLGIVTRKDLRSRVIADGRSKTQKVSEVMTSPVISVDRGDFLFE 235 (610) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCCCCEEEHHHHHHHHHHCCCCCCCCHHHHHCCCCEEECCCCHHHH T ss_conf 787526865767999999996088728998379-8743324227778999865897423354541468445447656999 Q ss_pred HHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 9998850386277752286742565259999998 Q gi|255764470|r 390 LMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 390 lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++-.|.+++.|-..|.++ |.+.|++|+.||+.- T Consensus 236 Aml~m~r~~I~hl~V~e~-gq~~Gilt~~dIl~l 268 (610) T COG2905 236 AMLMMLRNRIKHLPVTED-GQPLGILTLTDILRL 268 (610) T ss_pred HHHHHHHHCCCEEEEECC-CEEEEEEEHHHHHHH T ss_conf 999999807744236118-904677669999976 No 94 >cd04629 CBS_pair_16 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=98.00 E-value=4.6e-05 Score=56.01 Aligned_cols=52 Identities=27% Similarity=0.411 Sum_probs=35.8 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 2743523899989999988503862777522867425652599999987066 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) |.|.++.+++++.++++.|++++.+-..|+|+.|...|++|..|++..+..+ T Consensus 1 r~pvtv~~~~tl~~a~~~m~~~~~~~~pVvd~~~~lvGiit~~Dll~~~~~~ 52 (114) T cd04629 1 RNPVTFTPDMSVTEAVEKLLKSKISGGPVVDDNGNLVGFLSEQDCLKQLLES 52 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHHHHHHH T ss_conf 9899979929999999999971997899994899299999678999999775 No 95 >cd04587 CBS_pair_CAP-ED_DUF294_PBI_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pai Probab=97.96 E-value=4.8e-05 Score=55.89 Aligned_cols=65 Identities=15% Similarity=0.182 Sum_probs=48.1 Q ss_pred HHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 99751137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 295 VQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 295 i~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) +.+-++..+.+|+++|+ .+..+++.++++.++.+.+.+++++++||.+++ ++++|+++..|++++ T Consensus 49 ~~~~~~~~~~~v~~iMt--~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~-~~~vGiit~~Dil~A 113 (113) T cd04587 49 VAQGLDPESTLVERVMT--PNPVCATSDTPVLEALHLMVQGKFRHLPVVDKS-GQVVGLLDVTKLTHA 113 (113) T ss_pred HHCCCCHHHCEEEEEEC--CCCEEEECCCCHHHHHHHHHHCCCEEEEEEECC-CEEEEEEEHHHHHCC T ss_conf 87399832089886661--684699079999999999987794199999269-989999984885278 No 96 >KOG0474 consensus Probab=97.96 E-value=1.4e-05 Score=59.70 Aligned_cols=129 Identities=18% Similarity=0.230 Sum_probs=96.8 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCC----CEEEEEEHHHHHHHHHCC--------- Q ss_conf 1378667465340002899608999899999998579978888767855----278888888876202024--------- Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLD----SFIGIVSARDLLRDLLEE--------- 366 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D----~iiGiv~~kDLl~~~~~~--------- 366 (523) ..+..+++|+|.+ .+++++.-..++.+.+.++++.|.-|||.++..+ .+.|++..+.++...... T Consensus 578 ~mr~L~a~ev~~~--pvi~l~~~ekV~~Iv~vLk~t~HngFPVvd~~~~~~~~~l~GlILRshl~vlL~~~~f~~~~~~~ 655 (762) T KOG0474 578 YMRNLTAGEVMSK--PVICLNRVEKVAVIVDVLKSTNHNGFPVVDEPPSNEAGRLHGLILRSHLLVLLKKRVFVEESRST 655 (762) T ss_pred HHHHHHHHHHCCC--CEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 7653117664268--85887322109999999873476887523578876102366888999999998744542057665 Q ss_pred ----------------CCCHH----------------HHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEE Q ss_conf ----------------66016----------------8873-27435238999899999885038627775228674256 Q gi|255764470|r 367 ----------------GSMNF----------------KRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEG 413 (523) Q Consensus 367 ----------------~~~~~----------------~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~G 413 (523) +..++ ..++ -.|+.||+++++..++..||+=.-+--+|++.-.+..| T Consensus 656 ~~~~~~~~~~~~d~a~r~~~i~dv~lt~~e~~~yvDl~p~~n~sPytV~~~mSl~k~~~lFR~lGLRhLlVv~~~~~~~g 735 (762) T KOG0474 656 FDLPVRRKFTFRDFAKREPSIEDVHLTSEEMEMYVDLHPFMNPSPYTVPETMSLAKAFILFRQLGLRHLLVVPKTNRVVG 735 (762) T ss_pred CCCCHHHCCCHHHHHHCCCCHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCCCEEE T ss_conf 67604303787776441785565541667686430553346999965474310578999999735015898517786366 Q ss_pred EEEHHHHH-HHHCCCCCC Q ss_conf 52599999-987066445 Q gi|255764470|r 414 MITPANIL-EAIAGDFPD 430 (523) Q Consensus 414 IVTleDIl-E~IvGei~D 430 (523) ++|..|+. +.+.|-.++ T Consensus 736 ilTR~D~~~~~~l~~~~~ 753 (762) T KOG0474 736 ILTRKDLARYRILGLEPH 753 (762) T ss_pred EEEHHHHHHHHHHCCCCC T ss_conf 775664346777325666 No 97 >cd04593 CBS_pair_EriC_assoc_bac_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in bacteria and archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS d Probab=97.93 E-value=0.00014 Score=52.58 Aligned_cols=82 Identities=22% Similarity=0.226 Sum_probs=48.3 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEE-ECCCEEEEEECCCHH Q ss_conf 7435238999899999885038627775228674256525999999870664457888631288-349859987001199 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITV-GDDGSLTVDGWIDVR 455 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~-~~d~~~~v~G~~~l~ 455 (523) ||.+|++++++.++++.|.+++.+-..|+|+.|...||+|..|+.+.+..+-. .....+.. -...-..+.-..++. T Consensus 2 pP~tV~~~~tv~ea~~~m~~~~~~~~pVvD~~g~lvGIvT~~Dl~r~~~~~~~---~~~~~v~~vmt~~~~tv~pd~~l~ 78 (115) T cd04593 2 PPPVLSATTPLREAAEQLIESKHGSALVVDRDGGVVGIITLPDLLRALEADEA---GEPSAVDEVATPPLLTVHPDEPLA 78 (115) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCC---CCCCEEEEEECCCCEEECCCCHHH T ss_conf 78388991999999999987499899999499988999889999999973888---667794543038966899999499 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 79 ~al~~M 84 (115) T cd04593 79 HALDRM 84 (115) T ss_pred HHHHHH T ss_conf 999999 No 98 >cd04640 CBS_pair_27 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.89 E-value=3.4e-05 Score=56.98 Aligned_cols=45 Identities=27% Similarity=0.471 Sum_probs=23.0 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHH Q ss_conf 743523899989999988503862777522867425652599999 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANIL 421 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIl 421 (523) +|..+.+++++.++++.|++++....+|+|+.|...||+|..|+. T Consensus 2 ~Pvtv~~dtti~eA~~~M~~~~v~~l~VvD~~~~lvGIiT~~Dl~ 46 (126) T cd04640 2 KPIVIPADTSIDEALELMIKHGVRLLLVVDSDDNFIGVITAVDLL 46 (126) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCC T ss_conf 896968999599999999983976798987999799999763001 No 99 >cd04622 CBS_pair_9 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.89 E-value=2.6e-05 Score=57.85 Aligned_cols=61 Identities=23% Similarity=0.269 Sum_probs=50.4 Q ss_pred HCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 1137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 299 LTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 299 L~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) .+..+.+|+++|+ .++.+++.++++.++.+.+.+++++++||.+++ ++++|++...|++++ T Consensus 53 ~~~~~~~v~~iM~--~~~i~v~~~~~l~~a~~~m~~~~i~~lpVvd~~-~~~vGiIt~~Dilra 113 (113) T cd04622 53 RDPDTTTVGDVMT--RGVVTVTEDDDVDEAARLMREHQVRRLPVVDDD-GRLVGIVSLGDLARA 113 (113) T ss_pred CCHHHCCHHHHHC--CCCEEEECCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHCC T ss_conf 9820077678641--576799889987999999985796589999089-989999997995477 No 100 >cd04602 CBS_pair_IMPDH_2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentos Probab=97.87 E-value=4.8e-05 Score=55.89 Aligned_cols=62 Identities=23% Similarity=0.379 Sum_probs=54.4 Q ss_pred HCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 113786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 299 LTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 299 L~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) .+-.+.+++++|||+...++.+.+.+++++.+.+.++++.|+||.+++ ++++|++..+|+++ T Consensus 52 ~~~~~~~v~~iMt~~~~~i~~~~~~~l~ea~~~m~~~~i~~LPVVd~~-g~LvGiIT~~Di~k 113 (114) T cd04602 52 LTDSETPLSEVMTPREVLVVAPTGITLEEANEILRESKKGKLPIVNDD-GELVALVTRSDLKK 113 (114) T ss_pred CCCCCCCCCCEEECCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEECHHHCC T ss_conf 507676563516527668994798789999999997496678199789-96999999244036 No 101 >PRK05326 potassium/proton antiporter; Reviewed Probab=97.87 E-value=0.00028 Score=50.25 Aligned_cols=71 Identities=18% Similarity=0.344 Sum_probs=63.5 Q ss_pred CCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEE Q ss_conf 4985998700119999998288999987782407899998831248899889999999999996188778999994 Q gi|255764470|r 442 DDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSG 517 (523) Q Consensus 442 ~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~ 517 (523) .+|.|.++|..++.|+....|. + +...|++.|+.+++++.|..||++.+++++..|.+++++||.+|-+.. T Consensus 491 ~~g~f~~~~~~~~~~l~~~yg~--~---~~~~tl~~~~~~~~~~~~~~gd~~~~~~~~l~v~~~~~~~i~~vgl~~ 561 (563) T PRK05326 491 DRGDFILEGEAKLADLALIYGL--R---DKQQTLAELVQQRLGGAPVVGDQVELGGIIWTVAELDDGRVTKIGLRL 561 (563) T ss_pred CCCCEEECCCCCHHHHHHHHCC--C---CCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEECCCCEEEEEEEEC T ss_conf 6871677687658999987288--8---667869999999838999988865657759999962299898998646 No 102 >cd04625 CBS_pair_12 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.86 E-value=4e-05 Score=56.45 Aligned_cols=60 Identities=18% Similarity=0.340 Sum_probs=51.1 Q ss_pred HCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 1137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 299 LTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 299 L~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) .+..+.+|+++|+ .+..+++.+++++++.+.+.+++.+++||.++ ++++|+++.+|++++ T Consensus 53 ~~~~~~~V~~iMt--~~~~tv~~~~~i~~a~~~M~~~~i~~lpVvd~--g~lvGiit~~Di~rA 112 (112) T cd04625 53 AGVLDTTVRAIMN--PEPIVASPDDSIDEVRRLMVERHLRYLPVLDG--GTLLGVISFHDVAKA 112 (112) T ss_pred CCCCCCCHHHHEE--CCCEEECCCCHHHHHHHHHHHCCCEEEEEEEC--CEEEEEEEHHHHHCC T ss_conf 9801170888334--89779989996999999768669769999999--999999997996359 No 103 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.84 E-value=5.1e-05 Score=55.68 Aligned_cols=60 Identities=18% Similarity=0.189 Sum_probs=51.7 Q ss_pred HCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 1137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 299 LTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 299 L~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) -+..+.+|+++|+ .++.+++.+++++++.+.+.+.+..++||.++ ++++|++..+|++++ T Consensus 55 ~~~~~~~V~~vMt--~~v~tv~~d~~~~~a~~~m~~~~i~~lpVvd~--~~lvGiit~~Dil~A 114 (114) T cd04630 55 RDPDRVNVYEIMT--KPLISVSPDMDIKYCARLMERTNIRRAPVVEN--NELIGIISLTDIFLA 114 (114) T ss_pred CCCCCCCHHHHCC--CCEEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHCC T ss_conf 8944449868270--35499999494999999999779729999999--999999976896479 No 104 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=97.84 E-value=0.00023 Score=50.90 Aligned_cols=47 Identities=21% Similarity=0.338 Sum_probs=36.1 Q ss_pred CEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 43523899989999988503862777522867425652599999987 Q gi|255764470|r 378 PLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 378 ~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) -+.|+.++++.++++.|..++++.++|||+.|...||||..||.... T Consensus 3 ~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~~~L~GIvt~gDi~R~~ 49 (133) T cd04592 3 YIKVSPTTTLKEALNLMLDEKQSCVLVVDSDDFLEGILTLGDIQRFL 49 (133) T ss_pred EEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 59989998599999999981886579983899789997879999998 No 105 >PRK11111 hypothetical protein; Provisional Probab=97.83 E-value=0.0041 Score=41.74 Aligned_cols=175 Identities=15% Similarity=0.148 Sum_probs=91.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99997443999999985179989999999999999999999999999999997424799986345322248999999999 Q gi|255764470|r 21 LELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFL 100 (523) Q Consensus 21 lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fL 100 (523) .=++=-+-|+-+..-++++.+++||+|........|.+.=..+++.. +++++.++.... -+.+.||+.| T Consensus 18 f~iinPig~~pifl~lt~~~~~~~r~~ia~~a~~~a~~il~~f~~~G--------~~iL~~fGIsl~---afrIaGGiiL 86 (214) T PRK11111 18 FALVNPVGILPVFISMTSHQTAAERNKTNLTANFSVAIILLISLFLG--------DFILNLFGISID---SFRIAGGILV 86 (214) T ss_pred HHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHCCCHH---HHHHHHHHHH T ss_conf 99975200699999996799999999999999999999999999988--------999999699899---9999949999 Q ss_pred HHHHHHHHHHHCCCCCCC--CCCCCCCCCHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH--HHH Q ss_conf 999865555302576421--11122233209------------99999999999998577899999854146999--999 Q gi|255764470|r 101 LFKGTIELHERLEGDGFD--KKHKFFSPVSW------------QVIVLQIVILDLVFSLDSVVTAIGMVQHFSVM--AIA 164 (523) Q Consensus 101 l~k~~~el~~~~e~~~~~--~~~~~~~~~~~------------~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~--~~a 164 (523) +..+.+.++.+....... +........+. .+++.++. ++|=+ +-+..+.+.+. +.. T Consensus 87 ~~ial~Ml~g~~~~~~~~~~e~~~~~~~~~iaivPLAiPllaGPGaIttvi----~~~~~----~~~~~~~~~~~~ai~~ 158 (214) T PRK11111 87 VSIAMSMISGKLGEDKQNKQEKSETAVRESIGVVPLALPLMAGPGAISSTI----VWGTR----YHSWSNLLGFSVAIAL 158 (214) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH----HHHHH----CCCHHHHHHHHHHHHH T ss_conf 999999864865665566221432112233210222454146858999999----99864----5648899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHH--HHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999999998999999984852899--99999999999999977457337 Q gi|255764470|r 165 VAVSALMMMAVSQPMIRYISRHTTVVI--LCLGFLLMIGFLLIIEGLHFDIP 214 (523) Q Consensus 165 ~~isi~~m~~~s~~i~~~i~~~p~~~~--l~l~fl~~iG~~L~~eg~~~~ip 214 (523) +.+...+.+..|.++.|++.+.-.-.+ +.-.+|..||+.++.+|..--.| T Consensus 159 ~~~~~~~~l~~a~~i~r~LG~~g~~vi~RlmGliL~aiaVq~i~~Gi~~~fp 210 (214) T PRK11111 159 FALCCWGLFRMAPWLVRLLGQTGINVITRIMGLLLMALGIEFIVTGIKGLFP 210 (214) T ss_pred HHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999899999998888999999999999999999999999999978 No 106 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=97.82 E-value=0.00063 Score=47.69 Aligned_cols=112 Identities=18% Similarity=0.193 Sum_probs=95.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873274352389998999998 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMER 393 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~ 393 (523) +.-+....+.++.++++.+++.+-+-+-|.+.. ....|+|++.++-.+...... --+-+...++.|++.|.+.+.+.. T Consensus 253 ~~~i~~t~~~gp~~Al~Lm~~~~~~s~yvv~~~-~~l~G~v~~~~~~~a~~~~~~-~~~~l~~~~~tV~~~t~L~e~~~~ 330 (372) T TIGR01186 253 TVPITKTADKGPRSALKLMRDERVDSLYVVDRQ-NKLVGVVDVESIKQARKKAQS-LQDVLIDDILTVDEGTLLRELLRK 330 (372) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHHHHHHHHH-HHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 732686589887899999986597179999728-548877858899988878778-999862001110587604899999 Q ss_pred HHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 8503862777522867425652599999987066 Q gi|255764470|r 394 LRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 394 fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) -.+.++...-||||.-...||||-.-++..+.|. T Consensus 331 v~~~~~~YvpVVDE~~~~~GIv~r~~L~~~l~d~ 364 (372) T TIGR01186 331 VLKAGIKYVPVVDEDQRLVGIVTRASLVDALYDS 364 (372) T ss_pred HHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 8507995666661456458676477899987166 No 107 >cd04613 CBS_pair_SpoIVFB_EriC_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase a Probab=97.78 E-value=0.00021 Score=51.15 Aligned_cols=84 Identities=20% Similarity=0.185 Sum_probs=53.3 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) |.|..+++++++.++++.|++++.+...|+||.|...|++|..|+.+.+.-+-..+.-.-.++ ...+-..++-..++. T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~~pVvd~~~~lvGiit~~Dl~~~l~~~~~~~~~~v~~i--m~~~~~~v~~~~~l~ 78 (114) T cd04613 1 RDVVTIPEDTPLNELLDVIAHSPENNFPVVDDDGRLVGIVSLDDIREILFDPSLYDLVVASDI--MTKPPVVVYPEDSLE 78 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHHCCCCCCCEEEHH--CCCCCEEEECCCCHH T ss_conf 997695890989999999986199789999289949999999999999970566557293120--362965991367889 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 79 ~a~~~m 84 (114) T cd04613 79 DALKKF 84 (114) T ss_pred HHHHHH T ss_conf 999999 No 108 >cd04624 CBS_pair_11 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.77 E-value=0.00027 Score=50.43 Aligned_cols=83 Identities=23% Similarity=0.208 Sum_probs=55.5 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) ||+..+++++++.++++.|++++.+.++|+|+.|...|++|-.|++..+...... +.+ --.-...+-..++...++. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~lvGiiT~~Di~~~~~~~~~~--~~~-v~~~m~~~~~tv~~~~~l~ 77 (112) T cd04624 1 RPVVTVDPDTSIREAAKLMAEENVGSVVVVDPDERPIGIVTERDIVRAVAAGIDL--DTP-VSEIMTRDLVTVDPDEPVA 77 (112) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECCHHHHHHHCCCCC--CCC-EEEEECCCCEEECCCCCHH T ss_conf 9978978939599999999862998899993998299999820458898648743--442-0456516716787999899 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 78 ~a~~~m 83 (112) T cd04624 78 EAAKLM 83 (112) T ss_pred HHHHHH T ss_conf 999999 No 109 >cd04586 CBS_pair_BON_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain. BON is a putative phospholipid-binding domain found in a family of osmotic shock protection proteins. It is also found in some secretins and a group of potential haemolysins. Its likely function is attachment to phospholipid membranes. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.75 E-value=4.7e-05 Score=55.93 Aligned_cols=61 Identities=28% Similarity=0.384 Sum_probs=37.8 Q ss_pred HHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 75113786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 297 SVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 297 ~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.......+|+++|+ .++.+++.+++++++.+.+.++++.++||.++ ++++|++..+|+++ T Consensus 74 ~~~~~~~~~V~~iM~--~~~~~v~~~~~l~~~~~~m~~~~i~~lpVvd~--~~lvGiIt~~Dilk 134 (135) T cd04586 74 AFVRSHGRKVADVMT--RPVVTVGEDTPLAEVAELMEEHRIKRVPVVRG--GRLVGIVSRADLLR 134 (135) T ss_pred HHCCCCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHC T ss_conf 430447989899578--99879959795999999999759609999999--99999999799645 No 110 >cd04595 CBS_pair_DHH_polyA_Pol_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with an upstream DHH domain which performs a phosphoesterase function and a downstream polyA polymerase domain. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.75 E-value=0.00014 Score=52.40 Aligned_cols=57 Identities=32% Similarity=0.494 Sum_probs=39.4 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 3786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.+.+|.++|+ .++.+++.+++++++.+.+.++++.|+||.++ ++++|++..+|+++ T Consensus 53 ~~~~~v~~iM~--~~~~tv~~~~~i~~a~~~m~~~~~~~lpVvd~--g~lvGiit~~Dilk 109 (110) T cd04595 53 LGHAPVKDYMS--TDVVTVPPDTPLSEVQELMVEHDIGRVPVVED--GRLVGIVTRTDLLR 109 (110) T ss_pred CCCCCCEEEEE--CCEEEECCCCCHHHHHHHHHHCCCEEEEEEEC--CEEEEEEEHHHEEC T ss_conf 67881210325--36688669993999999889749139999989--99999998415615 No 111 >cd04600 CBS_pair_HPP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the HPP motif domain. These proteins are integral membrane proteins with four transmembrane spanning helices. The function of these proteins is uncertain, but they are thought to be transporters. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.73 E-value=0.00028 Score=50.30 Aligned_cols=53 Identities=23% Similarity=0.241 Sum_probs=45.1 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCC Q ss_conf 27435238999899999885038627775228674256525999999870664 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDF 428 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei 428 (523) |++.++++++++.++++.|++++.+-..|+|+.|...|++|..|+++.+..+. T Consensus 2 rdvvtv~~~~tl~~a~~~m~~~~~~~lpVvd~~g~lvGiit~~Dll~~~~~~~ 54 (124) T cd04600 2 RDVVTVTPDTSLEEAWALLRRHRIKALPVVDGDRRLVGIVTQRDLLRHARPDG 54 (124) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCEEEEEEHHHHHHHHHCCC T ss_conf 89699899397999999999729978999956980888856999998775046 No 112 >PRK01862 putative voltage-gated ClC-type chloride channel ClcB; Provisional Probab=97.72 E-value=0.0059 Score=40.60 Aligned_cols=90 Identities=17% Similarity=0.173 Sum_probs=62.9 Q ss_pred HHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEE Q ss_conf 168873-2743523899989999988503862777522867425652599999987066445788863128834985998 Q gi|255764470|r 370 NFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTV 448 (523) Q Consensus 370 ~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v 448 (523) +.++.+ +....+++++++.++.+.|++++.+-..|+|+.|...|+|+++|+...+..+-..+...-.++.+.+ --.+ T Consensus 457 ~v~dvm~~~~~~v~~~~~l~e~~~~~~~~~~~~~~Vvd~~g~l~Giv~l~dl~~~~~~~~~~~~~~~~~~~~~~--~~~l 534 (583) T PRK01862 457 QMRELIQPAQTVVPLTASVADMTRVFLEYPVKYLYVTDDDGRFRGAVALKDITSDLLDKRDTTDKTAADYAHTP--FPLL 534 (583) T ss_pred CHHHHCCCCCCEECCCCCHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHHCCCCCCCCCHHHHHCCC--CCEE T ss_conf 09996387882349999899999999847970799988999799997899988775163555646099981799--9578 Q ss_pred EECCCHHHHHHHH Q ss_conf 7001199999982 Q gi|255764470|r 449 DGWIDVRYASKLF 461 (523) Q Consensus 449 ~G~~~l~el~~~l 461 (523) .-+.++.+.-+.+ T Consensus 535 ~~~~sL~~al~~f 547 (583) T PRK01862 535 TPDMPLRDALEHF 547 (583) T ss_pred CCCCCHHHHHHHH T ss_conf 8999899999999 No 113 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.72 E-value=6.6e-05 Score=54.88 Aligned_cols=56 Identities=23% Similarity=0.356 Sum_probs=40.5 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 86674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.+|+++|| .++++++.+++..++.+.+.+.+.+++||.+++ ..++|+++.+|+++ T Consensus 58 ~~~V~~vMt--~~vitv~~~~~i~~a~~~M~~~~i~~lPVVD~~-~~~vGiit~~Dilk 113 (114) T cd04619 58 TAPVENVMT--RAVVSCRPGDLLHDVWQVMKQRGLKNIPVVDEN-ARPLGVLNARDALK 113 (114) T ss_pred CCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHEEC T ss_conf 789999615--898899998909999998887698588999079-95999999354542 No 114 >cd04617 CBS_pair_4 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.70 E-value=0.0003 Score=50.06 Aligned_cols=50 Identities=34% Similarity=0.470 Sum_probs=30.7 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCC Q ss_conf 74352389998999998850386277752286742565259999998706 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAG 426 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvG 426 (523) ||.++++++++.++.+.|.+++..-..|+||.|...||+|-.|++....+ T Consensus 2 pPvtv~~~~si~eA~~~M~~~~ig~l~VVD~~~~LvGIiT~rDl~r~~~~ 51 (118) T cd04617 2 PPVVVRENTSVYDAIVTLFLEDVGSLFVVDEDGDLVGVVSRKDLLKASIG 51 (118) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHC T ss_conf 79798996999999999998399879998499959999878999999974 No 115 >cd04623 CBS_pair_10 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.70 E-value=0.00033 Score=49.79 Aligned_cols=56 Identities=25% Similarity=0.427 Sum_probs=35.8 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) .+.+|+++|++ ++.+++.++++.++.+.+.+++.+++||.++ ++++|++...|+++ T Consensus 57 ~~~~V~~iM~~--~~~~v~~~~~l~~a~~~m~~~~i~~lpV~d~--~~lvGiit~~Dllr 112 (113) T cd04623 57 LDTPVSEIMTR--NVITVTPDDTVDEAMALMTERRFRHLPVVDG--GKLVGIVSIGDVVK 112 (113) T ss_pred HHCHHHHHCCC--EEEEEECCCCHHHHHHHHHHCCCEEEEEEEC--CEEEEEEEHHHHHC T ss_conf 66401231211--0289916991999999999769259999999--99999999799646 No 116 >PRK10739 putative dITP- and XTP- hydrolase; Provisional Probab=97.69 E-value=0.0065 Score=40.31 Aligned_cols=168 Identities=15% Similarity=0.246 Sum_probs=95.3 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99744399999998517998999999999999999999999999999999742479998634532224899999999999 Q gi|255764470|r 23 LVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLLF 102 (523) Q Consensus 23 ivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl~ 102 (523) ++=-+-|+-+..-++++.++++|+|........|.+.=+++.++..+ +++.++.+... ..+.||+.|+. T Consensus 14 iinPig~~PiFlslt~~~~~~~r~~ia~~a~~~a~~il~~F~~~G~~--------il~~fgIsi~a---frIaGGilL~~ 82 (197) T PRK10739 14 IMDPLGNLPIFMSVLKHLEPKRRRAIMIRELLIALLVMLVFLFAGEK--------ILAFLNLRAET---VSISGGIILFL 82 (197) T ss_pred HHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHCCCHHH---HHHHHHHHHHH T ss_conf 97621069999998479999999999999999999999999998999--------99994998899---99987899999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCC-----CCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH---HHHH Q ss_conf 98655553025764211112223-----3209---99999999999998577899999854146999999999---9999 Q gi|255764470|r 103 KGTIELHERLEGDGFDKKHKFFS-----PVSW---QVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAV---SALM 171 (523) Q Consensus 103 k~~~el~~~~e~~~~~~~~~~~~-----~~~~---~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~i---si~~ 171 (523) .+++.++.+.+.+..+..+.... +.++ -+++.++++. |=+ ..++...++.++.+ ...+ T Consensus 83 ia~~Ml~~~~~~~~~~~~~~~~~~ivPLAiPllaGPGaIttvi~l----~~~-------~~~~~~~~~~ai~~~~~~~~~ 151 (197) T PRK10739 83 IAIKMIFPSAEGNSSGLPAGEEPFLVPLAIPLVAGPSILATLMLL----SHQ-------YPNQMGHLVIALLIAWGGTFV 151 (197) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHH----HHH-------CCHHHHHHHHHHHHHHHHHHH T ss_conf 999986266656655556555420230015322586899999999----830-------422479999999999999999 Q ss_pred HHHHHHHHHHHHHHCCHHHH--HHHHHHHHHHHHHHHHHHCCC Q ss_conf 99998999999984852899--999999999999999774573 Q gi|255764470|r 172 MMAVSQPMIRYISRHTTVVI--LCLGFLLMIGFLLIIEGLHFD 212 (523) Q Consensus 172 m~~~s~~i~~~i~~~p~~~~--l~l~fl~~iG~~L~~eg~~~~ 212 (523) .+..|..+.|++.+.-.-++ +.-.+|..||+-.+.+|...- T Consensus 152 ~l~~s~~i~r~lG~~G~~ii~RlmGliLaaIavq~i~~Gik~~ 194 (197) T PRK10739 152 ILLQSSLFLRLLGEKGVNALERLMGLILVMMSTQMFLDGIRMW 194 (197) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999899999979899999999999999999999999999987 No 117 >cd04594 CBS_pair_EriC_assoc_archaea This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the EriC CIC-type chloride channels in archaea. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS do Probab=97.69 E-value=0.00015 Score=52.32 Aligned_cols=57 Identities=16% Similarity=0.233 Sum_probs=49.7 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 7866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) ...+|+++|| .++.++..+++++++.+.+.+++.+|+||.++ ++++|++..+|++.+ T Consensus 48 ~~~~v~d~Mt--~~v~tv~p~~~l~ea~~lM~~~~i~~lPVvd~--gklvGIIT~~Dil~A 104 (104) T cd04594 48 TYGDVVDYIV--RGIPYVRLTSTAEEAWEVMMKNKTRWCPVVDD--GKFKGIVTLDSILDA 104 (104) T ss_pred CCCCHHHEEE--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHCC T ss_conf 6998445476--39888999897999999999869787899989--999999996895378 No 118 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.68 E-value=0.00032 Score=49.86 Aligned_cols=50 Identities=24% Similarity=0.293 Sum_probs=39.0 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCC Q ss_conf 74352389998999998850386277752286742565259999998706 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAG 426 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvG 426 (523) +|.++++++++.++++.|++++.+-+.|+|+.|...|++|..|++..... T Consensus 2 ~P~tv~~d~~l~~A~~~m~~~~~~~~pVvd~~~~lvGivT~~Di~~~~~~ 51 (113) T cd04615 2 KPSCVVLNTDIARAVAEMYTSGSRALPVVDDKKRLVGIITRYDVLSYALE 51 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHH T ss_conf 59598996999999999997399789999489979999995999999980 No 119 >KOG0475 consensus Probab=97.67 E-value=0.0069 Score=40.10 Aligned_cols=240 Identities=18% Similarity=0.229 Sum_probs=136.5 Q ss_pred HHHHHHHHHHHHHHCCHH------------HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH--------HHHHHHHHHH Q ss_conf 999989999999848528------------99999999999999999774573370278899--------8999999999 Q gi|255764470|r 172 MMAVSQPMIRYISRHTTV------------VILCLGFLLMIGFLLIIEGLHFDIPKGYLYAS--------IGFSGIIEFF 231 (523) Q Consensus 172 m~~~s~~i~~~i~~~p~~------------~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~--------~~fs~~ve~~ 231 (523) -...+..+..+.|+-|.. -+--|.+...++..+.+=-||..+|-||.--+ ++.++++|.+ T Consensus 368 r~~~~e~i~~Lf~~C~~~~s~~l~~~~~~~~~~~L~~a~i~k~~~t~ftfGakvP~GifvPSmaiGA~aGr~vG~~~e~l 447 (696) T KOG0475 368 RFNMSELITILFNKCSPSSSTSLPETTVYSAAILLLLALILKIALTIFTFGAKVPAGIFVPSLAIGAIAGRAVGIIVEQL 447 (696) T ss_pred HHCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCHHHHHHHHHHHHHHHHHHH T ss_conf 62587789999861677666667654049999999999999999999655136763155313888888888999999987 Q ss_pred HHHHHHHHHHHHHH-------------------------HHHHHHHH--HHHHHHHCCCCCCCCCCHHHH-HHHHHHHHH Q ss_conf 89999987554301-------------------------03777667--656744156888766798889-999996774 Q gi|255764470|r 232 NQVARRNREQLMSP-------------------------SRLRARTA--DAVLRLLGGKPIQPQGLNVKA-DVLLPTQHE 283 (523) Q Consensus 232 n~~~~~~~~~~~~~-------------------------~~~~eRtA--~aVLRLlG~~~~~~~~~~ee~-~~li~~~~~ 283 (523) -|.-.-|---..-. .=|+|=|- |-|+-+|+ ..-. +-+.+...+ T Consensus 448 ~~~~~~~~fg~~ci~Pg~Ya~vGaAA~LsGvtrltvtlVVImFELTG~l~~IlPLm~---------av~~SkwI~d~~gk 518 (696) T KOG0475 448 AQNPDFNLFGLSCATPGAYALVGAAATLSGVTRLTVTLVVIMFELTGALNYILPLML---------AVMISKWIGDGLGK 518 (696) T ss_pred HCCCCCCCCCHHHCCCHHHHHHHHHHHHCCCCEEEEEEEEEEEECCCCCEEHHHHHH---------HHHHHHHHHHCCCC T ss_conf 217775544410147138899888998617514567888886540476000798999---------99999988732120 Q ss_pred CCCCCHHHHHHHHHHHCCCC------EEEEEEEEECCE---EEEEECC-CCHHHHHHHHHHCCCCEEEEE-CCCCCCEEE Q ss_conf 66789899999997511378------667465340002---8996089-998999999985799788887-678552788 Q gi|255764470|r 284 KHIISDQEKDMVQSVLTLAD------RPAKSIMTPRTE---IVWLDVN-CVDEDLQWKILELGHSRFPVA-QGSLDSFIG 352 (523) Q Consensus 284 ~g~l~~~E~~mi~~vL~L~e------~~V~~IMtPR~~---i~~ld~~-~s~~e~~~~i~~~~~SR~PV~-~~~~D~iiG 352 (523) +| -.+.+.+++|.-.|++ +-..++|-|+.. ...+..+ .+++++...+.++.||-|||. +++....+| T Consensus 519 ~g--Iyda~I~~ng~P~l~~k~e~~~t~~~~v~~p~~~~~~L~~i~~~s~tl~~le~~~~~t~~sgfpvvl~~~sq~lvG 596 (696) T KOG0475 519 TG--IYDAHIELNGYPFLDSKSEFSSTLAIPVMEPCRSESCLIVITQDSMTLEDLESLMEDTDFSGFPVVLSEDSQRLVG 596 (696) T ss_pred CH--HHHHHHHHCCCCCCCCCCCCCCCHHHHHHCHHCCCHHHEECCCCCEEHHHHHHHHHHCCCCCCEEEECCCCCEEEE T ss_conf 02--7888888619777653221122033443131017622212055651689998888624427852797156422678 Q ss_pred EEEHHHHHHHHHCCC--------------------------CCHHHHHH-CCCEEECCCCCHHHHHHHHHHCCCEEEEEE Q ss_conf 888888762020246--------------------------60168873-274352389998999998850386277752 Q gi|255764470|r 353 IVSARDLLRDLLEEG--------------------------SMNFKRSI-RKPLVVHENISVLKLMERLRKSSQTFVMVL 405 (523) Q Consensus 353 iv~~kDLl~~~~~~~--------------------------~~~~~~li-r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVv 405 (523) ++..+|+.....+-+ ..++++.+ -.|..|.+.++...+++.|++-...-.+ + T Consensus 597 fv~rr~l~~~i~~ar~~q~~~~~~~~~f~~~~~~~~~~~~~~~~lk~il~~tp~tv~d~tp~~~v~~~F~~lg~~~~~-v 675 (696) T KOG0475 597 FVLRRNLFLAILNARKIQSFIVTTSIYFNDPSPSAVAGIPSRLDLKDILDMTPFTVTDLTPMETVVDLFRKLGLRQIL-V 675 (696) T ss_pred EECHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHCCEEEE-E T ss_conf 871578889886530056541311133678874345899987576320468760343458489999999861725899-8 Q ss_pred CCCCCEEEEEEHHHHHHH Q ss_conf 286742565259999998 Q gi|255764470|r 406 DEYGVLEGMITPANILEA 423 (523) Q Consensus 406 DE~G~~~GIVTleDIlE~ 423 (523) ++.|...|+||-+|++.. T Consensus 676 ~~~G~l~Giitkkd~l~~ 693 (696) T KOG0475 676 TKNGILLGIITKKDCLRH 693 (696) T ss_pred CCCCEEEEEEEHHHHHHH T ss_conf 159826766534898886 No 120 >cd04621 CBS_pair_8 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.67 E-value=6.6e-05 Score=54.86 Aligned_cols=60 Identities=28% Similarity=0.353 Sum_probs=44.3 Q ss_pred HHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 5113786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 298 VLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 298 vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) .......+|+|+|+ .++.+++.++++.++.+.+.+++.+++||.++ ++++|++..+|+++ T Consensus 75 ~~~~~~~~v~diMt--~~vitv~~d~~l~~a~~~M~~~~i~~lPVvd~--~~lvGiIt~~Dilr 134 (135) T cd04621 75 YVKEVPLVAEDIMT--EEIITVSPNDDVVDAAKLMLEANISGLPVVDN--DNIVGVITKTDICR 134 (135) T ss_pred HCCCCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHCC T ss_conf 01446889989388--99889989893999999999719988999989--99999998588116 No 121 >cd04626 CBS_pair_13 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.65 E-value=9e-05 Score=53.90 Aligned_cols=57 Identities=18% Similarity=0.383 Sum_probs=39.2 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 3786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) ..+.+|+++|+ .++.+++.+++++++.+.+.+++.+|+||.++ ++++|+++.+|++. T Consensus 54 ~~~~~V~~iMt--~~~itv~~~~~~~~a~~~M~~~~i~~LPVvd~--~klvGiit~~Dil~ 110 (111) T cd04626 54 FLEKKVFNIVS--QDVFYVNEEDTIDEALDIMREKQIGRLPVVDD--NKLIGVVRTKDILD 110 (111) T ss_pred CCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHCC T ss_conf 77888899606--99879999793999999999809988999999--99999999698148 No 122 >cd04620 CBS_pair_7 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.65 E-value=0.00032 Score=49.90 Aligned_cols=52 Identities=23% Similarity=0.343 Sum_probs=43.4 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 2743523899989999988503862777522867425652599999987066 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) |.|+++++++++.++++.|.+++..-++|+||.|...|++|-.|++..+... T Consensus 1 r~pitv~pd~tv~ea~~~M~~~~~~~~~vV~~~~~lvGIvT~~Di~r~~~~~ 52 (115) T cd04620 1 RHPLTVTPDTPVADAIALMSQQGDSSCVLVVEKGRLLGIFTERDIVRLTAIG 52 (115) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHCC T ss_conf 9699969919999999999983994699993799299999869999999859 No 123 >cd04633 CBS_pair_20 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.64 E-value=0.00016 Score=52.01 Aligned_cols=57 Identities=26% Similarity=0.474 Sum_probs=33.8 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 3786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.+.+|+++|+ .++.+++.++++.++.+.+.+++++++||.++ ++++|++..+|+++ T Consensus 64 ~~~~~V~~iM~--~~~~tv~~~~~l~~~~~~m~~~~~~~lpVvd~--~~lvGIIT~~Dilr 120 (121) T cd04633 64 IRNLPVSDIMT--RPVITIEPDTSVSDVASLMLENNIGGLPVVDD--GKLVGIVTRTDILR 120 (121) T ss_pred CCCCEEECCCC--CCCEEEECCCCHHHHHHHHHHHCCEEEEEEEC--CEEEEEEEHHHHHC T ss_conf 13668553343--79889909898999999989719859999989--99999998346028 No 124 >cd04598 CBS_pair_GGDEF_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.63 E-value=0.00026 Score=50.50 Aligned_cols=85 Identities=21% Similarity=0.289 Sum_probs=66.0 Q ss_pred CCCEEECCCCCHHHHHHHHHHC-CCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCC-CCCCCCEEEECCCEEEEEECCC Q ss_conf 2743523899989999988503-8627775228674256525999999870664457-8886312883498599870011 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKS-SQTFVMVLDEYGVLEGMITPANILEAIAGDFPDE-DDQKLDITVGDDGSLTVDGWID 453 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~-~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE-~d~~~~i~~~~d~~~~v~G~~~ 453 (523) ||+..+++++++.++++.|+++ +.+...|+|+ |...|+||-.|+++.+.+.+.-+ +...+--.--+.+-..++..++ T Consensus 1 rpv~tv~pd~t~~ev~~~f~~~~~i~~lpVvd~-~r~vGiisr~dl~~~~~~~~g~~l~~~~pV~~~M~~~p~~v~~~~~ 79 (119) T cd04598 1 RPAPTVSPDTTVNDVLERFERDPDLSALAVVDD-GRPVGLIMREALMELLSTPYGRALYGKKPVSEVMDPDPLIVEADTP 79 (119) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCCEEEEEEC-CEEEEEEEHHHHHHHHCCCCCHHHHCCCCHHHHCCCCCEEECCCCC T ss_conf 998574999849999999987899676899879-9889999899999987074322330389589971689879989896 Q ss_pred HHHHHHHH Q ss_conf 99999982 Q gi|255764470|r 454 VRYASKLF 461 (523) Q Consensus 454 l~el~~~l 461 (523) ++++.+.+ T Consensus 80 i~~~~~~~ 87 (119) T cd04598 80 LEEVSRLA 87 (119) T ss_pred HHHHHHHH T ss_conf 99999999 No 125 >cd04800 CBS_pair_CAP-ED_DUF294_PBI_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with either the CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain or the PB1 (Phox and Bem1p) domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. The PB1 domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pa Probab=97.62 E-value=0.00038 Score=49.32 Aligned_cols=83 Identities=17% Similarity=0.248 Sum_probs=51.0 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) |||.++++++++.++.+.|++++.+-.+|+|+ |...|++|..|++..+..+-.+....-.++ -..+-..++...++. T Consensus 1 r~~vtv~~~~tv~ea~~~m~~~~i~~v~V~~~-~~~vGiit~~Di~~~~~~~~~~~~~~v~~i--m~~~~~tv~~~~~l~ 77 (111) T cd04800 1 RPPVTCSPDTTIREAARLMTEHRVSSLLVVDD-GRLVGIVTDRDLRNRVVAEGLDPDTPVSEV--MTAPPITIPPDATVF 77 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEECHHHHHHHHCCCCCCCCHHHH--CCCCCEEEECCCCHH T ss_conf 99989789197999999999729998999989-999999995578899871679856658885--126817998899199 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 78 ~a~~~m 83 (111) T cd04800 78 EALLLM 83 (111) T ss_pred HHHHHH T ss_conf 999999 No 126 >cd04604 CBS_pair_KpsF_GutQ_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with KpsF/GutQ domains in the API [A5P (D-arabinose 5-phosphate) isomerase] protein. These APIs catalyze the conversion of the pentose pathway intermediate D-ribulose 5-phosphate into A5P, a precursor of 3-deoxy-D-manno-octulosonate, which is an integral carbohydrate component of various glycolipids coating the surface of the outer membrane of Gram-negative bacteria, including lipopolysaccharide and many group 2 K-antigen capsules. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other funct Probab=97.62 E-value=0.00057 Score=48.02 Aligned_cols=83 Identities=18% Similarity=0.162 Sum_probs=49.5 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHH Q ss_conf 74352389998999998850386277752286742565259999998706644578886312883498599870011999 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRY 456 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~e 456 (523) ..++|.+++++.++++.|++++.+.++|+|+.|...|++|-.|+...+......+...-.++- ...-+.++...++.+ T Consensus 3 ~vp~V~~~~si~ea~~~m~~~~~~~v~Vvd~~~~lvGiiT~~Dl~r~~~~~~~~~~~~v~~iM--~~~~~tv~~~~~i~~ 80 (114) T cd04604 3 ALPLVSPDTSLKDALLEMSRKGLGMTAVVDEDGRLVGIFTDGDLRRALEKGLDILTLPVADVM--TRNPKTIDPDALAAE 80 (114) T ss_pred CCCEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHCCCCCCCCCHHHHH--CCCCEEEECCCCHHH T ss_conf 477889949399999999976997899997999399999899999998716664567974743--567589938988699 Q ss_pred HHHHH Q ss_conf 99982 Q gi|255764470|r 457 ASKLF 461 (523) Q Consensus 457 l~~~l 461 (523) ..+.+ T Consensus 81 a~~~m 85 (114) T cd04604 81 ALELM 85 (114) T ss_pred HHHHH T ss_conf 99999 No 127 >cd04638 CBS_pair_25 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.62 E-value=0.00013 Score=52.71 Aligned_cols=57 Identities=19% Similarity=0.281 Sum_probs=38.1 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 7866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) .+.++.++|+ .++.+++.++++.++.+.+.+++..|+||.++ ++++|+++.+|++++ T Consensus 50 ~~~~v~~iMt--~~~~tv~~~~~~~~a~~~m~~~~i~~lpVvd~--~~lvGiIt~~DilkA 106 (106) T cd04638 50 EEEQLALLMT--RDPPTVSPDDDVKEAAKLMVENNIRRVPVVDD--GKLVGIVTVADIVRA 106 (106) T ss_pred CCCHHHHHHC--CCCEECCCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHCC T ss_conf 5024467535--89736799491999999998769759999999--999999997995359 No 128 >PRK07107 inositol-5-monophosphate dehydrogenase; Validated Probab=97.60 E-value=0.00013 Score=52.66 Aligned_cols=116 Identities=16% Similarity=0.138 Sum_probs=75.3 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEEC Q ss_conf 86674653400028996089998999999985799788887678552788888888762020246601688732743523 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVH 382 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vp 382 (523) +++|+++|||++++++.+...+++|+.+.+.+++-.++||.+++ .++.|++..||+......+.. ..+..-| + .|- T Consensus 160 ~~~V~~vMT~~~~Lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~-g~L~gLiT~kDi~k~~~~P~a-~~D~~gr-L-~Vg 235 (497) T PRK07107 160 DTKVKDFMTPFEKLVVAKVGITLKEANNIIWDHKLNTLPIIDKN-QHLVYMVFRKDYDSHKENPLE-LLDSSKR-Y-VVG 235 (497) T ss_pred CCCHHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEECCC-CCEEEEEEHHHHHHHHHCCCC-CCCCCCC-E-EEE T ss_conf 77665640676542784688999999999986344430278379-958999983788886439631-1376788-8-899 Q ss_pred CCC---CHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 899---9899999885038627775228674256525999999 Q gi|255764470|r 383 ENI---SVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 383 e~~---~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ... ...+-.+.+-+......+|=-.+|.-.+.+..-.-+. T Consensus 236 AAIg~~d~~eRa~~Lv~aGvD~lviD~AhGhs~~v~~~ik~ik 278 (497) T PRK07107 236 AGINTRDYEERVPALVEAGADVLCIDSSDGYSEWQKRTLDYIK 278 (497) T ss_pred EECCCCCHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHH T ss_conf 9637778999999999859999980343535299999999999 No 129 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.60 E-value=0.00057 Score=48.02 Aligned_cols=84 Identities=17% Similarity=0.188 Sum_probs=48.1 Q ss_pred CCCEEECCCCCHHHHHHHH-HHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCH Q ss_conf 2743523899989999988-503862777522867425652599999987066445788863128834985998700119 Q gi|255764470|r 376 RKPLVVHENISVLKLMERL-RKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDV 454 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~f-r~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l 454 (523) |.+.+||+++++.++.+.| .+++.+...|+|+.|...|+||..|+......+-++ ....++.........+.-.+++ T Consensus 1 Rdv~tv~~~~tl~e~~~~~~~~~~~~~~pVvd~~g~l~Givt~~dl~~~~~~~~~~--~~v~~v~~~~~~~~~v~~~~~l 78 (114) T cd04801 1 RDFPTVPAHLTLREFVREYVLGSNQRRFVVVDNEGRYVGIISLADLRAIPTSQWAQ--TTVIQVMTPAAKLVTVLSEESL 78 (114) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEEHHHHHHHHHHCCCC--CCHHHHCCCCCCCEEECCCCCH T ss_conf 99558499995999999998507966899987899799999999987545402568--8477821027787589999979 Q ss_pred HHHHHHH Q ss_conf 9999982 Q gi|255764470|r 455 RYASKLF 461 (523) Q Consensus 455 ~el~~~l 461 (523) .+.-+.+ T Consensus 79 ~~al~~m 85 (114) T cd04801 79 AEVLKLL 85 (114) T ss_pred HHHHHHH T ss_conf 9999999 No 130 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=97.60 E-value=0.00018 Score=51.69 Aligned_cols=50 Identities=26% Similarity=0.382 Sum_probs=42.3 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHC Q ss_conf 27435238999899999885038627775228674256525999999870 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIA 425 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~Iv 425 (523) +++.++++++++.++++.|++.+.+.++|+|+.|...|++|.+|+++.+. T Consensus 7 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~~~~~Gvit~~dl~~~~~ 56 (57) T pfam00571 7 PDVVTVPPDTSLEEALELMRENGISRLPVVDEDGKLVGIVTLRDLLRALL 56 (57) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHC T ss_conf 99989979090999999998539957999927994999997899999866 No 131 >cd04803 CBS_pair_15 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.58 E-value=0.00065 Score=47.62 Aligned_cols=55 Identities=24% Similarity=0.345 Sum_probs=45.1 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCC Q ss_conf 2743523899989999988503862777522867425652599999987066445 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPD 430 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~D 430 (523) .|+..+++++++.++++.|++++.+-..|+|+.|...|++|..|++..+.....+ T Consensus 1 sPvvtv~~~~tl~~a~~~m~~~~i~~lpVvd~~~~lvGiit~~Dil~~~~~~~~~ 55 (122) T cd04803 1 SPVVTLSEDDSLADAEELMREHRIRHLPVVNEDGKLVGLLTQRDLLRAALSSLSD 55 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHHCCCC T ss_conf 9919998969999999999974997899990898688896599999999843764 No 132 >cd04637 CBS_pair_24 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.57 E-value=0.00019 Score=51.58 Aligned_cols=56 Identities=29% Similarity=0.427 Sum_probs=30.4 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 86674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +++|+++|+ .++.+++.+++++++.+.+.+++++++||.+++ ++++|++..+|+++ T Consensus 66 ~~~v~~iM~--~~~itv~~~~~l~~a~~~m~~~~i~~lPVvd~~-~~lvGiIT~~Diik 121 (122) T cd04637 66 NRRAHQIMT--RDPITVSPDTPVDEASKLLLENSISCLPVVDEN-GQLIGIITWKDLLK 121 (122) T ss_pred CCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCC-CEEEEEEEHHHHHC T ss_conf 689999463--898399999849999999997493089999689-98999999688327 No 133 >cd04589 CBS_pair_CAP-ED_DUF294_assoc_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the bacterial CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or Probab=97.56 E-value=0.00061 Score=47.82 Aligned_cols=51 Identities=25% Similarity=0.395 Sum_probs=30.9 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 2743523899989999988503862777522867425652599999987066 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) |||.++++++++.++.+.|++++.+.++|.|+.+ ..||+|-.|+...+..+ T Consensus 1 R~pvtv~~~~ti~ea~~~M~~~~i~~~vV~~~~~-~~GIvT~~Dl~~~~~~~ 51 (111) T cd04589 1 RPPLIVDASTSIRDAARLMREHGADALLVRDGDP-RLGIVTRTDLLDAVLLD 51 (111) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCE-EEEEEEHHHHHHHHHCC T ss_conf 9998999929899999999970998899976992-79999979989999829 No 134 >cd04636 CBS_pair_23 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.55 E-value=0.00021 Score=51.23 Aligned_cols=52 Identities=17% Similarity=0.301 Sum_probs=34.9 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 2743523899989999988503862777522867425652599999987066 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) |.+.++++++++.++++.|++++.+-..|+|+.|...|++|..|++..+... T Consensus 1 rdvvtv~~~~tl~ea~~~m~~~~~~~lpVvd~~~~lvGiit~~Dll~~~~~~ 52 (132) T cd04636 1 KDVITVKKDDTLRDVVEILLTGKISGVPVVDNEGRVVGIVSEGDLIRKIYKG 52 (132) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 9987968919799999999972997899995999399998889999998626 No 135 >cd04583 CBS_pair_ABC_OpuCA_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyz Probab=97.54 E-value=0.00062 Score=47.76 Aligned_cols=80 Identities=18% Similarity=0.246 Sum_probs=49.3 Q ss_pred HCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCH Q ss_conf 32743523899989999988503862777522867425652599999987066445788863128834985998700119 Q gi|255764470|r 375 IRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDV 454 (523) Q Consensus 375 ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l 454 (523) +|.|..+.++.++.++++.|++++.+...|+|+.|...|++|..|+....... ..-.++ ...+-..+.-..++ T Consensus 1 ~r~pitv~~~~tv~eal~~m~~~~~~~l~Vvd~~~~lvGivt~~di~~~~~~~-----~~v~~i--m~~~~~tv~~~~~~ 73 (109) T cd04583 1 IKNPVTITPDRTLAEAIKLMRDKKVDSLLVVDKDNKLLGIVSLESLEQAYKEA-----KSLEDI--MLEDVFTVQPDASL 73 (109) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHHHHCCCCC-----CEEEEC--CEEEEEEECCCCCH T ss_conf 99899979939799999999973997899995899899999810434300589-----678751--26202897489999 Q ss_pred HHHHHHH Q ss_conf 9999982 Q gi|255764470|r 455 RYASKLF 461 (523) Q Consensus 455 ~el~~~l 461 (523) .+..+.+ T Consensus 74 ~~a~~~m 80 (109) T cd04583 74 RDVLGLV 80 (109) T ss_pred HHHHHHH T ss_conf 9999999 No 136 >cd04627 CBS_pair_14 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.53 E-value=0.00084 Score=46.78 Aligned_cols=86 Identities=16% Similarity=0.196 Sum_probs=58.0 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECC-CCCEEEEEEHHHHHHHHCCCCCCCCCC----CCCEEE---ECCCEEE Q ss_conf 27435238999899999885038627775228-674256525999999870664457888----631288---3498599 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDE-YGVLEGMITPANILEAIAGDFPDEDDQ----KLDITV---GDDGSLT 447 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE-~G~~~GIVTleDIlE~IvGei~DE~d~----~~~i~~---~~d~~~~ 447 (523) +|.+.++++.++.++++.|++.+.|-..|+|+ .|.+.||+|..|+++.+.......... ...... -..+-.. T Consensus 1 ~Pfi~v~~~~sl~~a~~lm~~~~i~rlpVvd~~~g~lvGIiT~~dil~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~vit 80 (123) T cd04627 1 EPFIPVPSTASLFQAIEILGSGGIHRVAVTEEESGEVIGILSQRRLVEFLWENARSFPGLDPLYPIPLRDLTIGTSDVIS 80 (123) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHCCCCCCEE T ss_conf 99648489892999999997499777999958999499999979999999863765661445411200233237677179 Q ss_pred EEECCCHHHHHHHH Q ss_conf 87001199999982 Q gi|255764470|r 448 VDGWIDVRYASKLF 461 (523) Q Consensus 448 v~G~~~l~el~~~l 461 (523) +....++.+..+.+ T Consensus 81 i~~d~~l~~A~~lM 94 (123) T cd04627 81 INGDQPLIDALHLM 94 (123) T ss_pred ECCCCCHHHHHHHH T ss_conf 89999599999999 No 137 >cd04632 CBS_pair_19 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.53 E-value=0.00015 Score=52.32 Aligned_cols=59 Identities=24% Similarity=0.316 Sum_probs=33.1 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEEC-CCCCCEEEEEEHHHHHH Q ss_conf 37866746534000289960899989999999857997888876-78552788888888762 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQ-GSLDSFIGIVSARDLLR 361 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~-~~~D~iiGiv~~kDLl~ 361 (523) +.+.+|+++|+ .++.+++.++++.++.+.+.+++.+++||.+ ++-+.++|++..+|+++ T Consensus 68 ~~~~~V~~iM~--~~vitv~~~~~i~~a~~~M~~~~i~~lpVv~~dd~~~lvGIiT~~Dilr 127 (128) T cd04632 68 MLDLPVYDAMS--SPVITASPNDSVRDAVDRMLENDDSSVVVVTPDDDTKVVGILTKKDVLR 127 (128) T ss_pred HHCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHC T ss_conf 40679999667--9987999929399999999977997896684789998999999699646 No 138 >cd04607 CBS_pair_NTP_transferase_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain associated with the NTP (Nucleotidyl transferase) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.50 E-value=0.0008 Score=46.93 Aligned_cols=82 Identities=23% Similarity=0.336 Sum_probs=51.1 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHH Q ss_conf 74352389998999998850386277752286742565259999998706644578886312883498599870011999 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRY 456 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~e 456 (523) .|+++++++++.++++.|.+++.+.++|+|+.|...|++|-.|+...+......+. .-.++- ..+-..++-..++.+ T Consensus 3 kpi~v~~~~tl~ea~~~m~~~~~~~~~Vvd~~~~l~GiiT~~Di~r~l~~~~~~~~-~v~~im--~~~~~~i~~~~~~~~ 79 (113) T cd04607 3 KQLLVSPDASILDALRKIDKNALRIVLVVDENGRLLGTVTDGDIRRALLKGLSLDD-PVSEVM--NRNPITAKVGSSREE 79 (113) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHHHHHHHCCCCCCC-CHHHHH--CCEEEEEECCCCHHH T ss_conf 68896994989999999987599789999799959999987277668762898456-156752--121299987998999 Q ss_pred HHHHH Q ss_conf 99982 Q gi|255764470|r 457 ASKLF 461 (523) Q Consensus 457 l~~~l 461 (523) ..+.+ T Consensus 80 a~~~m 84 (113) T cd04607 80 ILALM 84 (113) T ss_pred HHHHH T ss_conf 99999 No 139 >cd04585 CBS_pair_ACT_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The i Probab=97.49 E-value=0.00018 Score=51.68 Aligned_cols=57 Identities=25% Similarity=0.369 Sum_probs=26.3 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) .+.+|.++|+ .++.+++.++++.++.+.+.+++.+++||.+++ ++++|++..+|+++ T Consensus 65 ~~~~v~~im~--~~~~tv~~~~~l~~a~~~m~~~~i~~lpVvd~~-g~lvGiiT~~Dilk 121 (122) T cd04585 65 SKIKVSDIMT--RDPITVSPDASVEEAAELMLERKISGLPVVDDQ-GRLVGIITESDLFR 121 (122) T ss_pred CCCCHHHHCC--CCCEEEECCCCHHHHHHHHHHCCCCEEEEEECC-CEEEEEEEHHHHHC T ss_conf 3889999577--898899388669999999987598789999589-98999998789746 No 140 >cd04592 CBS_pair_EriC_assoc_euk This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually Probab=97.49 E-value=0.0011 Score=46.02 Aligned_cols=97 Identities=11% Similarity=0.057 Sum_probs=78.1 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCC--CCH--------------------- Q ss_conf 028996089998999999985799788887678552788888888762020246--601--------------------- Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEG--SMN--------------------- 370 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~--~~~--------------------- 370 (523) ++.+.+..+.|+.|+.+.+.+.+-|..||.+++ +.++|++...|+.+.....+ ... T Consensus 1 t~~vtV~p~ttv~EA~~lM~~~~~~~~~VVD~~-~~L~GIvt~gDi~R~~~~~~~~~~~~~~~~~~~~~~~vs~v~t~~~ 79 (133) T cd04592 1 TKYIKVSPTTTLKEALNLMLDEKQSCVLVVDSD-DFLEGILTLGDIQRFLFTNKTTRVQPEDETKQTNTCLVSSVCTKGI 79 (133) T ss_pred CCEEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEE T ss_conf 945998999859999999998188657998389-9789997879999998762356456320001244432334564203 Q ss_pred -HHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCE Q ss_conf -68873274352389998999998850386277752286742 Q gi|255764470|r 371 -FKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVL 411 (523) Q Consensus 371 -~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~ 411 (523) ...-.+.++.+++++++.++.+.|.+.+.+-.-||++.++. T Consensus 80 ~~~~~~~~~~tv~pd~~l~~a~~lM~~~~i~~LPVV~~~~~~ 121 (133) T cd04592 80 SYGGQECGLWTCTPDTDLTTAKKLMEAKGVKQLPVVKRGVDR 121 (133) T ss_pred EECCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCEEECCCCC T ss_conf 625511003897899999999999998698858767077666 No 141 >cd04610 CBS_pair_ParBc_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a ParBc (ParB-like nuclease) domain downstream. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.48 E-value=0.00016 Score=51.99 Aligned_cols=56 Identities=23% Similarity=0.431 Sum_probs=36.2 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 86674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.+|+++|+ .++.+++.+.++.++.+.+.++++.|+||.+++ ++++|++...|+++ T Consensus 51 ~~~v~~iM~--~~~~tv~~~~~l~~a~~~m~~~~~~~lpVvde~-g~lvGiiT~~Dilr 106 (107) T cd04610 51 DETVEEIMS--KDLVVAVPEMDIMDAARVMFRTGISKLPVVDEN-NNLVGIITNTDVIR 106 (107) T ss_pred CCCHHHHCC--CCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHC T ss_conf 772678605--786016999869999999998299689699239-98999999478443 No 142 >cd04584 CBS_pair_ACT_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the acetoin utilization proteins in bacteria. Acetoin is a product of fermentative metabolism in many prokaryotic and eukaryotic microorganisms. They produce acetoin as an external carbon storage compound and then later reuse it as a carbon and energy source during their stationary phase and sporulation. In addition these CBS domains are associated with a downstream ACT domain, which is linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to regulation of the linked enzyme. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The in Probab=97.48 E-value=0.00076 Score=47.10 Aligned_cols=49 Identities=22% Similarity=0.394 Sum_probs=30.9 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 2743523899989999988503862777522867425652599999987 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) |++..+++++++.++++.|++++.+-..|+|+.|...|++|..|++... T Consensus 1 r~vvtv~~~~sl~~a~~lm~~~~~~~lpVvd~~~~lvGiit~~Di~~~~ 49 (121) T cd04584 1 RDVVTITPTTTIAEALELMREHKIRHLPVVDEEGRLVGIVTDRDLRDAS 49 (121) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 9988968929899999999972998899990899399996589987522 No 143 >cd04802 CBS_pair_3 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.48 E-value=0.00016 Score=52.15 Aligned_cols=58 Identities=24% Similarity=0.444 Sum_probs=47.3 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 13786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +..+.+|+++|+ .++.+++.++++.++.+.+.++++.++||.++ ++++|++...|++. T Consensus 54 ~~~~~~v~~iM~--~~~~tv~~~~~l~~a~~~m~~~~~~~lpVvd~--~~lvGiIt~~Dil~ 111 (112) T cd04802 54 KPREVPVGEVMS--TPLITIDPNASLNEAAKLMAKHGIKRLPVVDD--DELVGIVTTTDIVM 111 (112) T ss_pred CCCCCCHHHHHC--CCCEEECCCCHHHHHHHHHHHHCCCEEEEEEC--CEEEEEEECHHHCC T ss_conf 812389999723--48559947986999999878719858999989--99999998067105 No 144 >cd04601 CBS_pair_IMPDH This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa. Probab=97.47 E-value=0.0012 Score=45.68 Aligned_cols=46 Identities=26% Similarity=0.452 Sum_probs=28.5 Q ss_pred HCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHH Q ss_conf 3274352389998999998850386277752286742565259999 Q gi|255764470|r 375 IRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANI 420 (523) Q Consensus 375 ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDI 420 (523) +|.|..+++++++.++++.|.+.+.+-..|+|+.|...|++|-.|+ T Consensus 1 irdvvtv~~~~sl~~a~~~m~~~~~~~~pVvd~~~~l~Giit~~Dl 46 (110) T cd04601 1 IRDPITVSPDATVAEALELMAEYGISGLPVVDDDGKLVGIVTNRDL 46 (110) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCEEEEEEEHHHH T ss_conf 9889897993969999999998499889999489969878877677 No 145 >cd04588 CBS_pair_CAP-ED_DUF294_assoc_arch This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the archaeal CAP_ED (cAMP receptor protein effector domain) family of transcription factors and the DUF294 domain. Members of CAP_ED, include CAP which binds cAMP, FNR (fumarate and nitrate reductase) which uses an iron-sulfur cluster to sense oxygen, and CooA a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. DUF294 is a putative nucleotidyltransferase with a conserved DxD motif. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. Probab=97.46 E-value=0.00093 Score=46.47 Aligned_cols=81 Identities=20% Similarity=0.232 Sum_probs=48.0 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) |++..+++++++.++++.|++++.+-+.|+|+ |...|++|..|++..+.....+. .-.++ -..+-+.++...++. T Consensus 1 r~vvtv~p~~sl~ea~~~m~~~~i~~~pVvd~-~~lvGiit~~Di~~~~~~~~~~~--~V~~~--M~~~~~tv~~d~~l~ 75 (110) T cd04588 1 KPLITLNPNATLREAARLFNTHHIHGAPVVDD-GKLVGIVTLSDIAHAIARGLELA--KVKDV--MTKDVITIDEDEQLY 75 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEEC-CEEEEEEEHHHHHHHHHCCCCCC--HHHHH--CCCCCCEECCCCCHH T ss_conf 99479689498999999999709988999989-99999996689899987379778--03452--048972985899699 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 76 ~~~~~m 81 (110) T cd04588 76 DAIRLM 81 (110) T ss_pred HHHHHH T ss_conf 999999 No 146 >cd04631 CBS_pair_18 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.44 E-value=0.0012 Score=45.69 Aligned_cols=52 Identities=23% Similarity=0.397 Sum_probs=32.2 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECC-CCCEEEEEEHHHHHHHHCCC Q ss_conf 27435238999899999885038627775228-67425652599999987066 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDE-YGVLEGMITPANILEAIAGD 427 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE-~G~~~GIVTleDIlE~IvGe 427 (523) |++.++++++++.++++.|++++.+-..|+|+ .|...|++|..|++..+.+. T Consensus 1 r~vvtv~~~~tl~ea~~~m~~~~v~~~PVvd~~~~~lvGiit~~Di~~~~~~~ 53 (125) T cd04631 1 KDVVTVPPTTPIMEAAKIMVRNGFRRLPVVDEGTGKLVGIITATDILKYLGGG 53 (125) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEHHHHHHHHHCC T ss_conf 99779799094999999999839988999979999599999999999987335 No 147 >cd04634 CBS_pair_21 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.44 E-value=0.00029 Score=50.20 Aligned_cols=57 Identities=21% Similarity=0.375 Sum_probs=34.7 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHH Q ss_conf 1378667465340002899608999899999998579978888767855278888888876 Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLL 360 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl 360 (523) +...++|+++|+ .++.+++.+++++++.+.+.+.+..++||.++ ++++|++..+|++ T Consensus 85 ~~~~~~v~diM~--~~~itv~~~~~l~~a~~~m~~~~i~~lPVvd~--~~lvGIItr~Dil 141 (143) T cd04634 85 DAGKMKVRDIMT--KKVITISPDASIEDAAELMVRHKIKRLPVVED--GRLVGIVTRGDII 141 (143) T ss_pred HHCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHCC T ss_conf 322789899588--89879989896999999999739888999989--9999999957730 No 148 >cd04639 CBS_pair_26 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.43 E-value=0.00035 Score=49.58 Aligned_cols=49 Identities=20% Similarity=0.223 Sum_probs=28.2 Q ss_pred CEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCC Q ss_conf 4352389998999998850386277752286742565259999998706 Q gi|255764470|r 378 PLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAG 426 (523) Q Consensus 378 ~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvG 426 (523) ..+++++.++.++++.|.+++.+-..|+|+.|...|++|..|++..+.+ T Consensus 3 ~~Tv~p~~tl~~a~~~~~~~~~~~~PVvd~~g~lvGivt~~di~~~l~~ 51 (111) T cd04639 3 FETLSPADTLDDAADALLATTQHEFPVVDGDGHLVGLLTRDDLIRALAE 51 (111) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHHHHC T ss_conf 8792998979999999997199789999389988999989999999872 No 149 >pfam01595 DUF21 Domain of unknown function DUF21. This transmembrane region has no known function. Many of the sequences in this family are annotated as hemolysins, however this is due to a similarity to hemolysin C from Brachyspira hyodysenteriae, which does not contain this domain. This domain is found in the N-terminus of the proteins adjacent to two intracellular CBS domains pfam00571. Probab=97.43 E-value=0.014 Score=37.86 Aligned_cols=76 Identities=20% Similarity=0.292 Sum_probs=53.5 Q ss_pred HCCCCCCHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 457337027-----889989999999998999998755430103777667656744156888766798889999996774 Q gi|255764470|r 209 LHFDIPKGY-----LYASIGFSGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGGKPIQPQGLNVKADVLLPTQHE 283 (523) Q Consensus 209 ~~~~ipk~y-----~y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~~~~~~~~~~ee~~~li~~~~~ 283 (523) ||--+||.+ ..++..++-.+..+..+ ..+...+-...++.++|++|.++.++..++||++.+++.+++ T Consensus 102 ~gEi~PK~ia~~~~~~~a~~~a~~~~~~~~l-------~~Pl~~~~~~i~~~i~~~~g~~~~~~~~s~eEl~~lv~~~~~ 174 (182) T pfam01595 102 FGEILPKTLARRNPERIALRVAPPLRVLMKL-------LYPLVWLLSKLSNLLLRLFGVKPEEPAVTEEELRALVDEGEE 174 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 9958799999988999999999999999999-------999999999999999986299988899999999999999998 Q ss_pred CCCCCHHH Q ss_conf 66789899 Q gi|255764470|r 284 KHIISDQE 291 (523) Q Consensus 284 ~g~l~~~E 291 (523) +|.++++| T Consensus 175 ~G~i~~~E 182 (182) T pfam01595 175 EGVIEEEE 182 (182) T ss_pred CCCCCCCC T ss_conf 79989899 No 150 >cd04611 CBS_pair_PAS_GGDEF_DUF1_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with a PAS domain, a GGDEF (DiGuanylate-Cyclase (DGC) domain, and a DUF1 domain downstream. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CB Probab=97.43 E-value=0.00023 Score=50.84 Aligned_cols=56 Identities=20% Similarity=0.348 Sum_probs=38.3 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 86674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) +.+|+++|+ .++.+++.++++.++.+.+.+++.+|+||.+++ ++++|++..+|+++ T Consensus 55 ~~~v~~iM~--~~~~tv~~~~~~~~a~~~m~~~~i~~lpVvd~~-~~~vGiiT~~Dilr 110 (111) T cd04611 55 QTPVGEVMS--SPLLTVPADTSLYDARQLMREHGIRHLVVVDDD-GELLGLLSQTDLLQ 110 (111) T ss_pred CCCHHHHHC--CCCEEEECCCCHHHHHHHHHHHCCEEEEEECCC-CEEEEEEEHHHHHC T ss_conf 788899710--683399789999999999997187399999469-98999998568427 No 151 >cd04596 CBS_pair_DRTGG_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.42 E-value=0.00027 Score=50.44 Aligned_cols=47 Identities=15% Similarity=0.174 Sum_probs=28.6 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 27435238999899999885038627775228674256525999999 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ..+.|+.++.++.++++.|++++.+-..|+|+.|...|++|..|+++ T Consensus 2 ~dv~~l~p~~tv~~a~~lm~~~~~~~~PVvd~~~~lvGivt~~Di~~ 48 (108) T cd04596 2 EDTGYLTTTDTVKDWHELNKETGHSRFPVVDEKNKVVGIVTSKDVAG 48 (108) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCEEEEEEEHHHHHH T ss_conf 98889899997999999999859988999968990999999799751 No 152 >cd04608 CBS_pair_PALP_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a poten Probab=97.41 E-value=0.0007 Score=47.38 Aligned_cols=84 Identities=19% Similarity=0.236 Sum_probs=60.8 Q ss_pred HCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCH Q ss_conf 32743523899989999988503862777522867425652599999987066445788863128834985998700119 Q gi|255764470|r 375 IRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDV 454 (523) Q Consensus 375 ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l 454 (523) +.||..+.++.++.++++.|++++..-..|+||.|...|+||..|++..++..-....+.-... ....--.++...++ T Consensus 1 l~~~vtv~p~~tv~~a~~~m~~~~i~~lPVvd~~g~~~G~vt~~~ll~~~~~~~~~~~~~V~~v--m~~~~~~v~~~~~l 78 (124) T cd04608 1 LKAPVTVLPTVTCAEAIEILKEKGFDQLPVVDESGKILGMVTLGNLLSSLSSGKVQPSDPVSKA--LYKQFKRVNKNDTL 78 (124) T ss_pred CCCCEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCCCCCHHHH--HCCCCCCCCCCCCH T ss_conf 9999798998909999999998499856898799968888879999999973899999828997--44656514799855 Q ss_pred HHHHHH Q ss_conf 999998 Q gi|255764470|r 455 RYASKL 460 (523) Q Consensus 455 ~el~~~ 460 (523) ..+.+. T Consensus 79 ~~ls~~ 84 (124) T cd04608 79 GKLSRI 84 (124) T ss_pred HHHHCC T ss_conf 673103 No 153 >cd04635 CBS_pair_22 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.40 E-value=0.00027 Score=50.37 Aligned_cols=48 Identities=29% Similarity=0.380 Sum_probs=36.1 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 274352389998999998850386277752286742565259999998 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) |.|+++.++.++.++++.|.+++..-..|+|+.|...|+||..|++.. T Consensus 1 r~vitv~~d~~l~~a~~~m~~~~i~~lPVVd~~g~lvGiit~~Dll~~ 48 (122) T cd04635 1 REPVTCTPDDPVSKVWDLMLESGFTGLPVVQKAGELIGIITRRDIIRA 48 (122) T ss_pred CCCEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCEEEEEEHHHHHHH T ss_conf 999996892979999999997199489999189829999998998765 No 154 >cd04609 CBS_pair_PALP_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the pyridoxal-phosphate (PALP) dependent enzyme domain upstream. The vitamin B6 complex comprises pyridoxine, pyridoxal, and pyridoxamine, as well as the 5'-phosphate esters of pyridoxal (PALP) and pyridoxamine, the last two being the biologically active coenzyme derivatives. The members of the PALP family are principally involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and other amine-containing compounds. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a pote Probab=97.39 E-value=0.0024 Score=43.50 Aligned_cols=83 Identities=20% Similarity=0.230 Sum_probs=60.8 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHH Q ss_conf 74352389998999998850386277752286742565259999998706644578886312883498599870011999 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRY 456 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~e 456 (523) .+.++.++.++.++++.|++++.+-..|+|+ |...|++|..|++..+...-......-.++ ....-..++-..++++ T Consensus 2 dvvtv~p~~tl~~a~~~m~~~~i~~lpVvd~-~~lvGiit~~Di~~~~~~~~~~~~~~v~~i--M~~~~~~v~~~~~l~~ 78 (110) T cd04609 2 DVVSVAPDDTVSQAIERMREYGVSQLPVVDD-GRVVGSIDESDLLDALIEGKAKFSLPVREV--MGEPLPTVDPDAPIEE 78 (110) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHHHCCCCCCCCEEEEE--EECCCEEECCCCCHHH T ss_conf 8069999497999999999819988999989-999999999999999863876468769845--6759459789894999 Q ss_pred HHHHHC Q ss_conf 999828 Q gi|255764470|r 457 ASKLFG 462 (523) Q Consensus 457 l~~~l~ 462 (523) +.+.+. T Consensus 79 ~~~~~~ 84 (110) T cd04609 79 LSELLD 84 (110) T ss_pred HHHHHH T ss_conf 999887 No 155 >cd04612 CBS_pair_SpoIVFB_EriC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with either the SpoIVFB domain (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) or the chloride channel protein EriC. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A ), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. EriC is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase an Probab=97.38 E-value=0.0011 Score=45.91 Aligned_cols=46 Identities=22% Similarity=0.268 Sum_probs=24.6 Q ss_pred CEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 43523899989999988503862777522867425652599999987 Q gi|255764470|r 378 PLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 378 ~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) |.++++++++.++++.|++++..-..|+|+ |...|+||..|++... T Consensus 3 vvtv~p~~tl~~a~~~m~~~~~~~lpVvd~-~~l~Givt~~di~~~~ 48 (111) T cd04612 3 VVTVPVDLTVDEVLALMFGERHRGYPVVDD-GRLVGIVTLADIRRVP 48 (111) T ss_pred CEEECCCCCHHHHHHHHHHCCCCEEEEEEC-CEEEEEEEHHHHHHHH T ss_conf 499999295999999999719948999999-9999999899999877 No 156 >cd04643 CBS_pair_30 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.38 E-value=0.0014 Score=45.25 Aligned_cols=51 Identities=24% Similarity=0.262 Sum_probs=46.4 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 743523899989999988503862777522867425652599999987066 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) +..++.++.++.++++.|++++.+-..|+|+.|...|++|..|++..+.+. T Consensus 2 ~Vvtv~~~~tl~~A~~~m~~~~i~~lPVvd~~gklvGiit~~Di~~~~~~~ 52 (116) T cd04643 2 EVAYVQDTNTLRHALLVLTKHGYSAIPVLDKEGKYVGTISLTDILWKLKGL 52 (116) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHCC T ss_conf 769999949299999999974998798986999499998899999987436 No 157 >pfam00571 CBS CBS domain. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a single CBS domain. Pairs of these domains have been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP. Probab=97.37 E-value=0.00053 Score=48.23 Aligned_cols=55 Identities=40% Similarity=0.560 Sum_probs=50.2 Q ss_pred EEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHH Q ss_conf 7465340002899608999899999998579978888767855278888888876202 Q gi|255764470|r 306 AKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDL 363 (523) Q Consensus 306 V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~ 363 (523) +.++|++ +..+++.++++.++.+.+.+.+++++||++++ ++++|+++.+|++... T Consensus 1 v~~im~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~Vvd~~-~~~~Gvit~~dl~~~~ 55 (57) T pfam00571 1 VKDIMTP--DVVTVPPDTSLEEALELMRENGISRLPVVDED-GKLVGIVTLRDLLRAL 55 (57) T ss_pred CCCCCCC--CCEEECCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHHH T ss_conf 9422779--99899790909999999985399579999279-9499999789999986 No 158 >cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=97.37 E-value=0.0014 Score=45.12 Aligned_cols=56 Identities=7% Similarity=0.192 Sum_probs=49.5 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCC Q ss_conf 27435238999899999885038627775228674256525999999870664457 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDE 431 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE 431 (523) |++.++.+++++.++++.|.+++..-..|+|+.|...|++|-.|++..+.++..+. T Consensus 1 r~Vitv~pd~~l~da~~lm~~~~is~lPVVD~~g~lvGiis~~Dl~~~~~~~~~~~ 56 (120) T cd04641 1 KNIATARPDTPLIDVLDMLVERRVSALPIVDENGKVVDVYSRFDVINLAKEGAYNN 56 (120) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHHHCCCCCC T ss_conf 99789899899999999999809866999878996989975999998875155443 No 159 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.34 E-value=0.00062 Score=47.75 Aligned_cols=51 Identities=25% Similarity=0.362 Sum_probs=44.3 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCC Q ss_conf 274352389998999998850386277752286742565259999998706 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAG 426 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvG 426 (523) |++.++.+++++.++++.|++++.+...|+|+.|...|++|..|+...+.- T Consensus 2 r~vvtv~~~~tv~~a~~~m~~~~~~~lpVvd~~~~~vGiit~~Di~~~~~~ 52 (110) T cd04605 2 RPVVTISEDASIKEAAKLMIEENINHLPVVDEDGRLVGIVTSWDISKAVAR 52 (110) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECHHHHHHHHC T ss_conf 899898993999999999997299889999799919999962278889870 No 160 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=97.32 E-value=0.00075 Score=47.16 Aligned_cols=47 Identities=28% Similarity=0.422 Sum_probs=30.7 Q ss_pred CEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 43523899989999988503862777522867425652599999987 Q gi|255764470|r 378 PLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 378 ~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) +.++++++++.++++.|++++.+-..|+||.|...|++|..|++..+ T Consensus 2 ~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~~~lvGiit~~Dil~~l 48 (49) T smart00116 2 VVTVSPDTTLEEALELLREHGIRRLPVVDEEGRLVGIVTRRDIIKAL 48 (49) T ss_pred CEEECCCCCHHHHHHHHHHHCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 86999879299999999980998576998999199998879999864 No 161 >cd04582 CBS_pair_ABC_OpuCA_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the ABC transporter OpuCA. OpuCA is the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment but the function of the CBS domains in OpuCA remains unknown. In the related ABC transporter, OpuA, the tandem CBS domains have been shown to function as sensors for ionic strength, whereby they control the transport activity through an electronic switching mechanism. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. They are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzi Probab=97.30 E-value=0.0012 Score=45.67 Aligned_cols=53 Identities=19% Similarity=0.239 Sum_probs=45.5 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCC Q ss_conf 27435238999899999885038627775228674256525999999870664 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDF 428 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei 428 (523) .+|+.+.+++++.++++.|++++.+-.+|+|+.|...|+||..|+.+..-+.. T Consensus 1 t~pvtv~pd~~l~eA~~lm~~~~~~~l~VVd~~g~l~Givt~~Dl~~~~~~~v 53 (106) T cd04582 1 TEPITVRPDDPLSDALGLMDDSDLRALTVVDADGQPLGFVTRREAARASGGCC 53 (106) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHHCCCCH T ss_conf 99838999893999999999739987999979997899989999876308953 No 162 >cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=97.29 E-value=0.0036 Score=42.18 Aligned_cols=95 Identities=20% Similarity=0.234 Sum_probs=83.9 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHH Q ss_conf 02899608999899999998579978888767855278888888876202024660168873274352389998999998 Q gi|255764470|r 314 TEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMER 393 (523) Q Consensus 314 ~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~ 393 (523) .+++.+|.+.++.++...+.+.+..--|+|+.+..+.+|++.+-|...... ...+.+..++.++.+. T Consensus 1 skliV~D~~l~vk~Af~al~~ngi~~aplWds~~~~fvGmLt~~DfI~il~-------------lv~i~P~~sL~da~~~ 67 (98) T cd04618 1 SKLVVFDTKLPVKKAFNALVENGIRSAPLWDSRKQQFVGMLTITDFILILR-------------LVSIHPERSLFDAALL 67 (98) T ss_pred CEEEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHH-------------HEECCCCCHHHHHHHH T ss_conf 909999489758999999998692699987288767888864799998876-------------2444876129999999 Q ss_pred HHHCCCEEEEEECCC-CCEEEEEEHHHHH Q ss_conf 850386277752286-7425652599999 Q gi|255764470|r 394 LRKSSQTFVMVLDEY-GVLEGMITPANIL 421 (523) Q Consensus 394 fr~~~~~lAiVvDE~-G~~~GIVTleDIl 421 (523) |.+++.|-..|+|.. |...-++|...++ T Consensus 68 l~~~~ihrlPvid~~~~~~~~ilt~~~il 96 (98) T cd04618 68 LLKNKIHRLPVIDPSTGTGLYILTSRRIL 96 (98) T ss_pred HHHCCCCCCCCCCCCCCCEEEEEEEEHEE T ss_conf 98678750521647999658999630201 No 163 >cd04599 CBS_pair_GGDEF_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the GGDEF (DiGuanylate-Cyclase (DGC)) domain. The GGDEF domain has been suggested to be homologous to the adenylyl cyclase catalytic domain and is thought to be involved in regulating cell surface adhesiveness in bacteria. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.29 E-value=0.0004 Score=49.17 Aligned_cols=57 Identities=19% Similarity=0.256 Sum_probs=45.5 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 3786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) ..+++|.++|+ .++.+++.++++.++.+.+.+++..|+||.++ ++++|++...|+.+ T Consensus 48 ~~~~~v~~iMt--~~~itv~~~~~l~~a~~~M~~~~i~~lpVvd~--g~lvGiiT~~Divr 104 (105) T cd04599 48 HPNRLVADAMT--REVVTISPEASLLEAKRLMEEKKIERLPVLRE--RKLVGIITKGTIAL 104 (105) T ss_pred CCCCCHHHHCC--CCCEEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEECHHHCC T ss_conf 88879688502--79899979491999999779859979999989--99999999458615 No 164 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=97.28 E-value=0.0055 Score=40.85 Aligned_cols=121 Identities=21% Similarity=0.312 Sum_probs=94.8 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEEC---CCC----CCEEEEEEHHHHHHHHHCCCC---- Q ss_conf 137866746534000289960899989999999857997888876---785----527888888887620202466---- Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQ---GSL----DSFIGIVSARDLLRDLLEEGS---- 368 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~---~~~----D~iiGiv~~kDLl~~~~~~~~---- 368 (523) .+...+|++.-. ..++.+..+.++.++++.+++.|++.+||.+ +.+ .++.|.|..+.+|..+..++. T Consensus 362 ~~~~~~v~~l~L--~~~v~~~~t~tv~~ai~iLre~GfdQ~PVv~pga~~P~veag~v~G~v~l~~lL~~l~~~~a~~~D 439 (527) T TIGR01137 362 VFGNARVKDLHL--PALVTVHPTETVGDAIEILREYGFDQLPVVKPGAAEPDVEAGKVLGSVTLRELLSALFAKKAKLED 439 (527) T ss_pred HHCCCEEECCCC--CCCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHHHHHCCCCCCC T ss_conf 425636415668--873454676408999999997487307620778888972353478874267789999842467554 Q ss_pred ------CHHHHHH---------------CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCC------------EEEEE Q ss_conf ------0168873---------------27435238999899999885038627775228674------------25652 Q gi|255764470|r 369 ------MNFKRSI---------------RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGV------------LEGMI 415 (523) Q Consensus 369 ------~~~~~li---------------r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~------------~~GIV 415 (523) .+.+.+- .+-.-+.++.++.++-+-|++. +.|+|++|.-. +.|+| T Consensus 440 ~v~Gkyldfk~~~~Fndvssynenk~~~~~~~~i~~~~~l~~L~~~le~~--~~a~V~~e~~pyhstGkssqrq~~~gvV 517 (527) T TIGR01137 440 AVSGKYLDFKVMSRFNDVSSYNENKSKKKKFIQIGEGEKLADLSKFLEKN--SSAIVVEEGKPYHSTGKSSQRQMVIGVV 517 (527) T ss_pred CCCCCCHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCC--CCEEEEECCCCCCCCCCCCCCEEEEEEE T ss_conf 43430003444532101000101024676765446775189999985169--6189970782321256542432789864 Q ss_pred EHHHHHHHH Q ss_conf 599999987 Q gi|255764470|r 416 TPANILEAI 424 (523) Q Consensus 416 TleDIlE~I 424 (523) |--|||.-+ T Consensus 518 T~~DLL~fL 526 (527) T TIGR01137 518 TKIDLLSFL 526 (527) T ss_pred EHHHHHHHC T ss_conf 412244312 No 165 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=97.24 E-value=0.00037 Score=49.39 Aligned_cols=166 Identities=20% Similarity=0.194 Sum_probs=113.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCCC---HHHHH------HHHHHHHHCCCCCHHHHHHHHHHHCCCC---EEEEEEEEECCEE Q ss_conf 7766765674415688876679---88899------9999677466789899999997511378---6674653400028 Q gi|255764470|r 249 RARTADAVLRLLGGKPIQPQGL---NVKAD------VLLPTQHEKHIISDQEKDMVQSVLTLAD---RPAKSIMTPRTEI 316 (523) Q Consensus 249 ~eRtA~aVLRLlG~~~~~~~~~---~ee~~------~li~~~~~~g~l~~~E~~mi~~vL~L~e---~~V~~IMtPR~~i 316 (523) =-|.|+.+|-+-=.|..=+.+. ..... ++.----+.-.|++|.-..-+=.=.|+. .+|+|+|--++++ T Consensus 88 La~~Ad~~L~i~v~kEaCP~~lAPTsSt~~TL~lGDaLa~al~~ArnF~~eDFA~~HPGG~LG~kLL~kV~dlm~t~d~l 167 (272) T TIGR00393 88 LARAADYVLDIKVEKEACPLNLAPTSSTTVTLALGDALAVALMKARNFSEEDFAKFHPGGALGRKLLVKVKDLMQTDDEL 167 (272) T ss_pred HHHCCCEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 45306545766775302789897356899999988999999985258884244114870466678888767764066567 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH-HHCC-C--CCHHHHH-HCCCEEEC-CCCCHHHH Q ss_conf 9960899989999999857997888876785527888888887620-2024-6--6016887-32743523-89998999 Q gi|255764470|r 317 VWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD-LLEE-G--SMNFKRS-IRKPLVVH-ENISVLKL 390 (523) Q Consensus 317 ~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~-~~~~-~--~~~~~~l-ir~~~~Vp-e~~~l~~l 390 (523) -.+-..+++.+++-.+.+.+-----||+++ .++.|++.=-|+=+. ...+ . +...++. .-.|..+. .+.-+.++ T Consensus 168 P~v~~tas~~DAL~e~~~~~LG~~~v~~~~-~~~~Gv~tDGD~RR~l~~~g~~~l~~~v~~~mT~~p~~~~n~~~~l~~A 246 (272) T TIGR00393 168 PLVAPTASFKDALLEMSRKRLGLAVVCDEN-EQLKGVFTDGDLRRVLALLGGGALKKEVKDFMTLGPKTLKNSDELLVEA 246 (272) T ss_pred CCEECCCCCEEEEEEEECCCCCEEEEEECC-CCEEEEEECCHHHHHHHHHCCHHCCCCHHHHCCCCCEEEECHHHHHHHH T ss_conf 822236772021023313786179997155-5246787146599999881660002312221068881340145689999 Q ss_pred HHHHHHCCCEEEEEECCCCCEEEEE Q ss_conf 9988503862777522867425652 Q gi|255764470|r 391 MERLRKSSQTFVMVLDEYGVLEGMI 415 (523) Q Consensus 391 L~~fr~~~~~lAiVvDE~G~~~GIV 415 (523) ++-|++++.|.++||||.|.+.|++ T Consensus 247 ~~~l~~~kI~~~~vvdd~nkl~G~~ 271 (272) T TIGR00393 247 LEFLKKRKITSLVVVDDENKLLGVL 271 (272) T ss_pred HHHHHHCCCEEEEEECCCCEEEEEE T ss_conf 9998727942899983897178753 No 166 >cd04606 CBS_pair_Mg_transporter This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain in the magnesium transporter, MgtE. MgtE and its homologs are found in eubacteria, archaebacteria, and eukaryota. Members of this family transport Mg2+ or other divalent cations into the cell via two highly conserved aspartates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.22 E-value=0.00076 Score=47.09 Aligned_cols=59 Identities=19% Similarity=0.360 Sum_probs=49.3 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 37866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) -.+.+++++|+ .+..+++.+++.+++.+.+.+++.+++||.+++ ++++|++...|++.. T Consensus 50 ~~~~~v~~iM~--~~~~~v~~~~~~~~a~~~m~~~~~~~lPVVd~~-~~lvGiIt~~Di~~~ 108 (109) T cd04606 50 DPDTPVSDIMD--TDVISVSADDDQEEVARLFEKYDLLALPVVDEE-GRLVGIITVDDVIDV 108 (109) T ss_pred CCCCCHHHHHC--CCEEECCCCCCHHHHHHHHHHHCCCEEEEECCC-CEEEEEEEHHHHHHH T ss_conf 88871666300--213633799989999999998527604689889-979999996896844 No 167 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.21 E-value=0.00064 Score=47.67 Aligned_cols=49 Identities=22% Similarity=0.331 Sum_probs=30.9 Q ss_pred HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH Q ss_conf 7327435238999899999885038627775228674256525999999 Q gi|255764470|r 374 SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE 422 (523) Q Consensus 374 lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE 422 (523) ..+++..+++++++.++++.|++.+.+-..|+|+.|...|+||..||-| T Consensus 64 m~~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~g~l~GiIT~~DIAe 112 (113) T cd04597 64 INRKPVTARPNDPLREALNLMHEHNIRTLPVVDDDGTPAGIITLLDLAE 112 (113) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHC T ss_conf 1378834199982999999999849987869889993999987787435 No 168 >cd02205 CBS_pair The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generali Probab=97.21 E-value=0.0054 Score=40.87 Aligned_cols=84 Identities=21% Similarity=0.205 Sum_probs=63.1 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) +++.++++++++.++++.|++++.+-+.|+|+.|...|++|..|++..+.....+.......+ ...+.+.+....++. T Consensus 1 ~~~~~v~~~~~l~~a~~~m~~~~~~~~~Vvd~~~~~~G~it~~dl~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~ 78 (113) T cd02205 1 RDVVTVSPDDTVAEALRLMLEHGISGLPVVDDDGRLVGIVTERDLLRALAEGGLDPLVTVGDV--MTRDVVTVSPDTSLE 78 (113) T ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHCCCCCCCCEEEE--EECCCEEEEECCHHH T ss_conf 997796892989999999998398099999399959999988998988763157410330456--750546897321177 Q ss_pred HHHHHH Q ss_conf 999982 Q gi|255764470|r 456 YASKLF 461 (523) Q Consensus 456 el~~~l 461 (523) +..+.+ T Consensus 79 ~~~~~~ 84 (113) T cd02205 79 EAAELM 84 (113) T ss_pred HHHHHH T ss_conf 999999 No 169 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=97.17 E-value=0.00069 Score=47.40 Aligned_cols=74 Identities=22% Similarity=0.313 Sum_probs=41.8 Q ss_pred CCCCH-HHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 67898-9999999751137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 285 HIISD-QEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 285 g~l~~-~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) |.|.+ +=|+.++.-.++.+.+++++||+.. .+++.+....++++.+.+..-+-+||.++ ++++|++|..|+++. T Consensus 249 GIiTDGDLRR~l~~~~~i~~~~~~diMT~nP--~tI~~d~la~eAL~iMe~~kIt~L~Vvd~--~k~vGIihihDll~~ 323 (326) T PRK10892 249 GIFTDGDLRRVFDMGVDLRQLSIADVMTPGG--IRVRPGILAVDALNLMQSRHITSVLVADG--DQLLGVLHMHDLLRA 323 (326) T ss_pred EEEECHHHHHHHHHCCCCCCCCHHHHHCCCC--CEECCCCCHHHHHHHHHHCCCCEEEEEEC--CEEEEEEEHHHHHHC T ss_conf 9986268999987068831287999718999--67899886999999998639818999889--999999767978676 No 170 >PRK07807 inositol-5-monophosphate dehydrogenase; Validated Probab=97.15 E-value=0.0014 Score=45.10 Aligned_cols=62 Identities=13% Similarity=0.112 Sum_probs=54.1 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC Q ss_conf 78667465340002899608999899999998579978888767855278888888876202024 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE 366 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~ 366 (523) ...+|+++||| ++++.+.+.+.+|+.+.+.+++-.++||.+++ +++.|.+..||+.+....+ T Consensus 146 ~~~~v~~vMT~--~lvt~~~g~sl~eA~~lL~~~kiekLpvVd~~-~~L~gLiT~kDi~k~~~~~ 207 (479) T PRK07807 146 RFTRVGDVMST--DLVTLPAGTDPRKAFDLLEAAPVKVAPVVDAD-GRLAGVLTRTGALRATIYT 207 (479) T ss_pred CCCCHHHHHCC--CCEEECCCCCHHHHHHHHHHCCCCCCEEECCC-CEEEEEEEEEHHHHCCCCC T ss_conf 77758886257--73661366787999999975352311377569-9299999961164544688 No 171 >KOG2550 consensus Probab=97.08 E-value=0.0022 Score=43.79 Aligned_cols=150 Identities=19% Similarity=0.257 Sum_probs=105.3 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCC--CCCEEEEEEHHHHHHHHHCC Q ss_conf 8999999975113786674653400028996089998999999985799788887678--55278888888876202024 Q gi|255764470|r 289 DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGS--LDSFIGIVSARDLLRDLLEE 366 (523) Q Consensus 289 ~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~--~D~iiGiv~~kDLl~~~~~~ 366 (523) +.+..|+..+=.|..--+ ++-+.+..+.++.+.++.-..++|+-+||.++. ..+.+|+|..+|+-.. +. T Consensus 98 e~QA~~v~~vK~~~~g~~-------~dp~v~sp~~tvg~v~~~k~~~gF~g~pvTe~g~~~~KLvG~vtsrdi~f~--~~ 168 (503) T KOG2550 98 EDQADMVRRVKNYENGFI-------NNPIVISPTTTVGEVKEAKEKHGFSGIPVTEDGKRGSKLVGIITSRDIQFL--ED 168 (503) T ss_pred HHHHHHHHHHHHHHCCCC-------CCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHH--HC T ss_conf 887788888887522545-------686203775431055652065663354211577546515777742023455--34 Q ss_pred CCCHHHH-HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCE Q ss_conf 6601688-732743523899989999988503862777522867425652599999987066445788863128834985 Q gi|255764470|r 367 GSMNFKR-SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGS 445 (523) Q Consensus 367 ~~~~~~~-lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~ 445 (523) ....+.+ ....+.+.|...++.++=+.+++++..-..|||+.|...-+++..|+...- +-| -..+.+++. T Consensus 169 ~~~~~~~vmt~~~~~~~~gi~l~~~neiL~~~kkGkl~iv~~~gelva~~~rtDl~k~~--------~yP-lask~~~kq 239 (503) T KOG2550 169 NSLLVSDVMTKNPVTGAQGITLKEANEILKKIKKGKLPVVDDKGELVAMLSRTDLMKNR--------DYP-LASKDSTKQ 239 (503) T ss_pred CCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEECCCCCEEEEEEHHHHHHHC--------CCC-CCCCCCCCC T ss_conf 55112433034430144446677889998763148652343677623343334566502--------787-555675413 Q ss_pred EEEEECCCHHH Q ss_conf 99870011999 Q gi|255764470|r 446 LTVDGWIDVRY 456 (523) Q Consensus 446 ~~v~G~~~l~e 456 (523) |++.+...-+| T Consensus 240 ll~gAaiGTre 250 (503) T KOG2550 240 LLCGAAIGTRD 250 (503) T ss_pred EEEEECCCCCC T ss_conf 56510136663 No 172 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=97.07 E-value=0.00089 Score=46.60 Aligned_cols=75 Identities=16% Similarity=0.215 Sum_probs=51.3 Q ss_pred CCCCCH-HHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 667898-9999999751137866746534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 284 KHIISD-QEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 284 ~g~l~~-~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) .|.|.+ +=|+.+.+-..|+ .+|.++||+-. .+++.+....++++.+.+..-+-+||.+++ +.++|++|..|+++. T Consensus 243 ~GIITDGDLRR~l~k~~~L~-~~v~~vMT~nP--~tI~~d~la~eAl~iM~~~kI~~LpVvd~~-~k~vGiIhiHDLl~~ 318 (321) T PRK11543 243 KGVFTDGDLRRWLVGGGALT-TPVNEAMTVGG--TTLQAQSRAIDAKEILMKRKITAAPVVDEN-GKLTGAINLQDFYQA 318 (321) T ss_pred EEEEECHHHHHHHHCCCCHH-HHHHHHHCCCC--CEECCCCHHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHHC T ss_conf 88874438999986378567-68999848999--578998719999999998798689998589-959999638978676 No 173 >smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease. Probab=97.06 E-value=0.002 Score=44.01 Aligned_cols=48 Identities=27% Similarity=0.466 Sum_probs=43.9 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHH Q ss_conf 2899608999899999998579978888767855278888888876202 Q gi|255764470|r 315 EIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDL 363 (523) Q Consensus 315 ~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~ 363 (523) +++++..+++++++.+.+.+++++++||.+++ ++++|++..+|+++.. T Consensus 1 ~~itv~~~~~v~~a~~~m~~~~i~~lPVVd~~-~~lvGiit~~Dil~~l 48 (49) T smart00116 1 DVVTVSPDTTLEEALELLREHGIRRLPVVDEE-GRLVGIVTRRDIIKAL 48 (49) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCEEEEECCC-CCEEEEEEHHHHHHHH T ss_conf 98699987929999999998099857699899-9199998879999864 No 174 >cd04597 CBS_pair_DRTGG_assoc2 This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with a DRTGG domain upstream. The function of the DRTGG domain, named after its conserved residues, is unknown. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=97.03 E-value=0.001 Score=46.22 Aligned_cols=57 Identities=16% Similarity=0.142 Sum_probs=51.4 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHH Q ss_conf 786674653400028996089998999999985799788887678552788888888762 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLR 361 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~ 361 (523) -.-+|+|+|+ .++++++.++|+.++.+.+.+++..++||.+++ +.++|++..+|+.. T Consensus 56 v~~qV~DIm~--~~~vti~~~~sl~eA~~lM~e~~i~~LPVVD~~-g~l~GiIT~~DIAe 112 (113) T cd04597 56 VHPRVRDVIN--RKPVTARPNDPLREALNLMHEHNIRTLPVVDDD-GTPAGIITLLDLAE 112 (113) T ss_pred CCCHHHHHCC--CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CEEEEEEEHHHHHC T ss_conf 6505666313--788341999829999999998499878698899-93999987787435 No 175 >COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms] Probab=96.98 E-value=0.0061 Score=40.49 Aligned_cols=113 Identities=19% Similarity=0.230 Sum_probs=79.1 Q ss_pred HHH-HHHHHHHHHHCCCCC------CCCCCHHHH-HHHHHHHHH-----------CCCCCHHHHHHHHHHHC---CCCEE Q ss_conf 777-667656744156888------766798889-999996774-----------66789899999997511---37866 Q gi|255764470|r 248 LRA-RTADAVLRLLGGKPI------QPQGLNVKA-DVLLPTQHE-----------KHIISDQEKDMVQSVLT---LADRP 305 (523) Q Consensus 248 ~~e-RtA~aVLRLlG~~~~------~~~~~~ee~-~~li~~~~~-----------~g~l~~~E~~mi~~vL~---L~e~~ 305 (523) ++| -+++.+++..|--.. .+..+-++. ..+.+++.. .|. -.++.|..+|+. -.+.+ T Consensus 138 ~~eq~~~e~~~trv~~~~~~~~~~v~~~~~i~~aa~km~~~gv~s~v~l~~~~~~~GI--vT~~dl~~~v~~~g~~~~~~ 215 (610) T COG2905 138 LAEQGESEFILTRVGEVKTLPAVTVSPQASIQDAARKMKDEGVSSLVVLDDSGPLLGI--VTRKDLRSRVIADGRSKTQK 215 (610) T ss_pred HHHHCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCE--EEHHHHHHHHHHCCCCCCCC T ss_conf 9861410588998777741787526865767999999996088728998379874332--42277789998658974233 Q ss_pred EEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC Q ss_conf 7465340002899608999899999998579978888767855278888888876202024 Q gi|255764470|r 306 AKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE 366 (523) Q Consensus 306 V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~ 366 (523) |.++|| ..+.+++.++-+=|++-.+.+.+-.|+||+++ .+++|+++..|++.....+ T Consensus 216 V~evmT--~p~~svd~~~~~feAml~m~r~~I~hl~V~e~--gq~~Gilt~~dIl~l~s~~ 272 (610) T COG2905 216 VSEVMT--SPVISVDRGDFLFEAMLMMLRNRIKHLPVTED--GQPLGILTLTDILRLFSQN 272 (610) T ss_pred HHHHHC--CCCEEECCCCHHHHHHHHHHHHCCCEEEEECC--CEEEEEEEHHHHHHHHCCC T ss_conf 545414--68445447656999999999807744236118--9046776699999761778 No 176 >cd04642 CBS_pair_29 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=96.95 E-value=0.0061 Score=40.51 Aligned_cols=48 Identities=23% Similarity=0.328 Sum_probs=29.2 Q ss_pred CEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHC Q ss_conf 435238999899999885038627775228674256525999999870 Q gi|255764470|r 378 PLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIA 425 (523) Q Consensus 378 ~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~Iv 425 (523) ...+.+++++.++++.|++++.+-..|+||.|...|++|..|+..... T Consensus 3 Vitv~~~~~l~eA~~lm~~~~i~~lPVvd~~g~lvGiis~~Dl~~~~~ 50 (126) T cd04642 3 VVSIDSDERVLDAFKLMRKNNISGLPVVDEKGKLIGNISASDLKGLLL 50 (126) T ss_pred EEEECCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEEEEHHHHHHHHC T ss_conf 799999599999999999809878999928990999999899765551 No 177 >cd04603 CBS_pair_KefB_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the KefB (Kef-type K+ transport systems) domain which is involved in inorganic ion transport and metabolism. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=96.93 E-value=0.0027 Score=43.06 Aligned_cols=47 Identities=19% Similarity=0.280 Sum_probs=39.0 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 74352389998999998850386277752286742565259999998 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) +|.++..++++.++++.|.+.+..-++||||.+...|++|..|+++. T Consensus 2 ~pvtv~~~~~L~~a~~~~~e~~~~~i~VVD~~~k~vG~it~~Dll~~ 48 (111) T cd04603 2 QTVSVNCENPLREAIKMINELGARAVVVVDEENKVLGQVTLSDLLEI 48 (111) T ss_pred CCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 44686599959999999987699768998899968899889998643 No 178 >COG4175 ProV ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism] Probab=96.90 E-value=0.002 Score=44.06 Aligned_cols=127 Identities=22% Similarity=0.260 Sum_probs=92.0 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCC Q ss_conf 89999999751137866746534000289960899989999999857997888876785527888888887620202466 Q gi|255764470|r 289 DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGS 368 (523) Q Consensus 289 ~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~ 368 (523) +-=++.++++=...=.++++||.|......-...+......+...+..-+-+-++. ...+.+|++..-+.... T Consensus 259 dYV~~Fv~~v~~~~VltA~~IM~~~~~~~~~~~~~~~~~al~~~~~~~~~~~~~~~-~~~~~~g~v~~~~~~~~------ 331 (386) T COG4175 259 DYVRDFVRNVDRSRVLTAKDIMRRPDLLIRKTPGDGPRVALKLLRDEGREYGYAVD-RGNKFVGVVSIDSLVKA------ 331 (386) T ss_pred HHHHHHHHCCCHHHEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHH-CCCCEEEEEECCCHHCC------ T ss_conf 78999984377533223877631454344455555631121233322411367774-25752558964620101------ Q ss_pred CHHHHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 01688732743523899989999988503862777522867425652599999987066 Q gi|255764470|r 369 MNFKRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 369 ~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) ....++..++.++++.+.+...+++.+++|+ +||.|...|+++...+++++.+. T Consensus 332 ----~~~~~~~~v~~d~~~~~~~~~~~~~~~p~aV-vde~~r~vG~i~~~~vl~aL~~~ 385 (386) T COG4175 332 ----ALIDDVLTVDADTPLSEILARIRQAPCPVAV-VDEDGRYVGIISRGELLEALARI 385 (386) T ss_pred ----CCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-ECCCCCEEEEECHHHHHHHHHCC T ss_conf ----0146533567642488999987528985468-85788577786388999997458 No 179 >COG2095 MarC Multiple antibiotic transporter [Intracellular trafficking and secretion] Probab=96.87 E-value=0.047 Score=34.00 Aligned_cols=174 Identities=15% Similarity=0.200 Sum_probs=100.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999974439999999851799899999999999999999999999999999974247999863453222489999999 Q gi|255764470|r 19 IALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGF 98 (523) Q Consensus 19 ~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~ 98 (523) .++=++=...|+-+..-+++++|+++|+|...--...|+..=.+++++.. + +++.|+.+. +-....||+ T Consensus 13 ~Lf~i~dP~G~ipvf~slt~~~~~~~r~~v~~ra~i~a~~ill~f~~~G~-~-------il~~fgIsi---~a~rIAGGi 81 (203) T COG2095 13 LLFAIIDPIGNLPVFISLTKGLSPEERNRVALRASIIALLILLVFLLLGE-G-------ILRFFGISI---DAFRIAGGI 81 (203) T ss_pred HHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-H-------HHHHHCCCH---HHHHHHHHH T ss_conf 99998577850387999976899999999999999999999999999999-9-------999968866---777874049 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCC---C----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCHHHHHHHHHHHHH Q ss_conf 99999865555302576421111222---3----320999999999999998577899999854-146999999999999 Q gi|255764470|r 99 FLLFKGTIELHERLEGDGFDKKHKFF---S----PVSWQVIVLQIVILDLVFSLDSVVTAIGMV-QHFSVMAIAVAVSAL 170 (523) Q Consensus 99 fLl~k~~~el~~~~e~~~~~~~~~~~---~----~~~~~~~v~~I~v~D~vFSlDsVitavg~~-~~~~i~~~a~~isi~ 170 (523) .|.+.+.+.+..+.+....+.++... . +.++.+--. ++.+++.-.+-. ++.+..+.++.+... T Consensus 82 lLf~ia~~ml~~~~~~~~~~~e~~~~~~iaivPLA~PliaGPg---------~Ist~i~~~~~~~~~~~~~~~~i~~~~~ 152 (203) T COG2095 82 LLFLIALRMLFGPTSRPKKKREEGQEDSIAIVPLAIPLIAGPG---------TIATVIVLSSQYGNSKLAVVLAILLASL 152 (203) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCH---------HHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 8999999995687677777866576578025446575534759---------9999999981357440999999999999 Q ss_pred ---HHHHHHHHHHHHHHHCCHHHH--HHHHHHHHHHHHHHHHHHCCC Q ss_conf ---999998999999984852899--999999999999999774573 Q gi|255764470|r 171 ---MMMAVSQPMIRYISRHTTVVI--LCLGFLLMIGFLLIIEGLHFD 212 (523) Q Consensus 171 ---~m~~~s~~i~~~i~~~p~~~~--l~l~fl~~iG~~L~~eg~~~~ 212 (523) +.+..|..+.|++.+.....+ +--.+|..+|+.++.+|...- T Consensus 153 ~~~~~l~~s~~i~r~lG~~G~~vl~RimGllL~al~vq~i~~Gl~~~ 199 (203) T COG2095 153 LTYLILLSSSRIMRLLGKTGLNALTRIMGLLLAALGVQMILDGLRGI 199 (203) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999998899999977888999999999999999999999999998 No 180 >COG3263 NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain [Inorganic ion transport and metabolism] Probab=96.87 E-value=0.0043 Score=41.58 Aligned_cols=75 Identities=15% Similarity=0.170 Sum_probs=64.4 Q ss_pred CEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECCEEEEEEEEECC Q ss_conf 8599870011999999828899998778240789999883124889988999999999999618877899999458 Q gi|255764470|r 444 GSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEIFTEMNLKFEIIRLEGHNIDRVKVSGLQ 519 (523) Q Consensus 444 ~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~i~~~g~~f~V~~~~~~rI~~V~V~~l~ 519 (523) |.+..+.+-...+++-..+.+.. +.+.-.|+++++..+++..|..||++++++...+|.+.+++||.++-++--+ T Consensus 495 g~f~~e~~~~~~~~~~~yg~~~~-~~d~~~~~~~~i~~~lg~~~v~~Drv~~~~l~~~v~~~~d~~it~ig~~~~~ 569 (574) T COG3263 495 GDFILEPSAKAADVALAYGLGDG-YADKAQTLGELVTQLLGGAPVLGDRVEFGGLIWTVAEKEDNRVTKIGVRVEE 569 (574) T ss_pred CCHHCCCCHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEECCCCEEEEEEEECCC T ss_conf 21320722346679998579987-0366535778899983787322015787567999853224734422342230 No 181 >pfam01914 MarC MarC family integral membrane protein. Integral membrane protein family that includes the antibiotic resistance protein MarC. These proteins may be transporters. Probab=96.75 E-value=0.058 Score=33.37 Aligned_cols=177 Identities=16% Similarity=0.207 Sum_probs=97.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999997443999999985179989999999999999999999999999999997424799986345322248 Q gi|255764470|r 12 WIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDI 91 (523) Q Consensus 12 w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~l 91 (523) +-++.++ .=++=-+-|+-+..-++++.++++|+|........|.+.=..+++... +++..++.+... T Consensus 5 l~~f~~L--f~iinPig~~pifl~lt~~~~~~~r~~ia~~a~~~a~~il~~f~~~G~--------~iL~~fGIsl~a--- 71 (203) T pfam01914 5 FSAFLSL--FAIINPIGNVPVFISLTENYPAEERKRIILRASIIAFLILLLFLVFGK--------LILKLFGISIDA--- 71 (203) T ss_pred HHHHHHH--HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHCCCHHH--- T ss_conf 9999999--999846347999999858999999999999999999999999999999--------999996887999--- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC--CCCCCH---------H---HHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999998655553025764211112--223320---------9---9999999999999857789999985414 Q gi|255764470|r 92 VLILGGFFLLFKGTIELHERLEGDGFDKKHK--FFSPVS---------W---QVIVLQIVILDLVFSLDSVVTAIGMVQH 157 (523) Q Consensus 92 ill~GG~fLl~k~~~el~~~~e~~~~~~~~~--~~~~~~---------~---~~~v~~I~v~D~vFSlDsVitavg~~~~ 157 (523) +.+.||+.|+..+++.+..+.+....++++. ...... + .+++.++++ ++=+. -+.. + T Consensus 72 frIaGGiiL~~ial~ml~~~~~~~~~~~~~~~~~~~~~~~aivPLaiPllaGPG~Ittvi~----l~~~~----~~~~-~ 142 (203) T pfam01914 72 FRIAGGILLFLIAIDMLFGKQSKEKTSKSEKEESEDIDEIAVVPLAIPLIAGPGAITTTMV----LMAEH----GTIG-L 142 (203) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCHHHHHHHH----HHHHC----CCHH-H T ss_conf 9998289999999999658655333464433332222344100001310368389999999----98655----8788-9 Q ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHCCHHHH--HHHHHHHHHHHHHHHHHHC Q ss_conf 6999999---999999999998999999984852899--9999999999999997745 Q gi|255764470|r 158 FSVMAIA---VAVSALMMMAVSQPMIRYISRHTTVVI--LCLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 158 ~~i~~~a---~~isi~~m~~~s~~i~~~i~~~p~~~~--l~l~fl~~iG~~L~~eg~~ 210 (523) ......+ +.....+.+..|+++.|++.+.-.-.+ +.-.+|..||+.++.+|.. T Consensus 143 ~~~~~~ai~~~~~~~~l~l~~s~~i~k~lG~~g~~ii~Rl~GliL~aiavq~i~~Gi~ 200 (203) T pfam01914 143 KIAVILAILLAWLITFLILLSSSFIIRLLGRTGINALTRIMGLLLAAIAVQMIVTGIK 200 (203) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999988999999888999999999999999999999999889 No 182 >PRK05567 inositol-5'-monophosphate dehydrogenase; Reviewed Probab=96.74 E-value=0.0088 Score=39.35 Aligned_cols=62 Identities=15% Similarity=0.190 Sum_probs=54.8 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC Q ss_conf 7866746534000289960899989999999857997888876785527888888887620202 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE 365 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~ 365 (523) .+.+|+++||| .++++.+.+.+.+++.+.+.+++..++||.+++ .+++|++..||+.+.... T Consensus 145 ~~~~V~~vMT~-~~lvt~~~~is~~eA~~~l~~~kieklPvVd~~-g~L~Glit~kDi~k~~~~ 206 (486) T PRK05567 145 LSQPVSEVMTK-ERLVTVPEGTTLEEALELLHEHRIEKLLVVDDN-GRLKGLITVKDIEKAEEF 206 (486) T ss_pred CCCCHHHHHCC-CCEEEECCCCCHHHHHHHHHHHHHCCCCEECCC-CCEEEEEEHHHHHHHHHC T ss_conf 77654675345-732892588999999999997313034277468-968888776677765208 No 183 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=96.73 E-value=0.0076 Score=39.79 Aligned_cols=62 Identities=18% Similarity=0.193 Sum_probs=54.6 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC Q ss_conf 37866746534000289960899989999999857997888876785527888888887620202 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE 365 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~ 365 (523) -..++|+++||| +.+..+.+.+.+++.+.+.+++..++||.+++ ++++|.+..||+.+.... T Consensus 155 d~~~~v~~vMt~--~lvt~~~~isl~eA~~ll~~~kieklpvVd~~-g~L~glit~kDi~k~~~~ 216 (499) T PTZ00314 155 DKSTPVSEIMTT--DLVVGRYPITLEDANDVLNRSRKGVLPIVNDN-GELVALVSRSDAVKNRDY 216 (499) T ss_pred CCCCCHHHHHCC--CCEEECCCCCHHHHHHHHHHHHHCCCEEECCC-CCEEEEEECCHHHHHHHC T ss_conf 677588886156--72672599999999999986033123066578-958998630348775338 No 184 >cd04614 CBS_pair_1 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=96.68 E-value=0.003 Score=42.75 Aligned_cols=45 Identities=24% Similarity=0.327 Sum_probs=24.2 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHH Q ss_conf 274352389998999998850386277752286742565259999 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANI 420 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDI 420 (523) |++..+.+.+++.++++.|++++.....|+|+.|...||+|-.|+ T Consensus 1 rdv~tv~~~~~v~~a~~~m~~~~i~~lpVvd~~g~lvGIiT~rDl 45 (96) T cd04614 1 RNVPTVWEETPLPVAVRIMELANVKALPVLDDDGKLSGIITERDL 45 (96) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCEEEEEEECHHH T ss_conf 996390896989999999997299879999899949999988888 No 185 >KOG1764 consensus Probab=96.68 E-value=0.011 Score=38.59 Aligned_cols=136 Identities=14% Similarity=0.161 Sum_probs=89.6 Q ss_pred HHHHHHHHCCCCEEE--EEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCC Q ss_conf 999997511378667--465340002899608999899999998579978888767855278888888876202024660 Q gi|255764470|r 292 KDMVQSVLTLADRPA--KSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSM 369 (523) Q Consensus 292 ~~mi~~vL~L~e~~V--~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~ 369 (523) .+-+....-...+++ ..-++++...+++....+.-+....+.++++.|+||.+.+..++..++..+-++......... T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~p~~s~l~~~~~l~~~~~~rvpv~d~~~~~v~~ilt~~rIl~~l~~~~~~ 217 (381) T KOG1764 138 IEVLEDSQLSKRREVECLLKETLKPPFVSISPESSLLDAVLLLIKSRIHRVPVIDPETGEVLYILTQRRILKFLWLNGRL 217 (381) T ss_pred HHHHHHHHCCCCCEEEEEEECCCCCCCEEECCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 76530220145512443210444677413078688999776423577247523365655421112099999999851456 Q ss_pred H---------HHH----HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 1---------688----732743523899989999988503862777522867425652599999987066 Q gi|255764470|r 370 N---------FKR----SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 370 ~---------~~~----lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) . +.+ ......++.+++++.++++.|+.++..=..|||+.|..+|..+.-|+.-..... T Consensus 218 ~~~~~~l~~s~~dl~ig~~~~i~~i~~~~~v~~al~~m~~~~is~lpvV~~~g~~v~~~s~~Dv~~l~~~~ 288 (381) T KOG1764 218 LPLPSLLSKSLSDLGIGTWSNIASISEDTPVIEALKIMSERRISALPVVDENGKKVGNYSRFDVIHLAREG 288 (381) T ss_pred CCCHHHHHCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCEECCCCCEECCEEEEHHHHHHHCC T ss_conf 44224551887664047787777751787089999999870457443475778553215620012156427 No 186 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=96.68 E-value=0.0034 Score=42.38 Aligned_cols=121 Identities=17% Similarity=0.184 Sum_probs=69.4 Q ss_pred HHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHH Q ss_conf 99999751137866746534000289960899989999999857997888876785527888888887620202466016 Q gi|255764470|r 292 KDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNF 371 (523) Q Consensus 292 ~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~ 371 (523) ++.+.++=.+.-.+++++|.|...+...+... ....+ ..+..-+..... ......+........... T Consensus 262 ~~Fvg~~~~~~vl~a~~vm~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~ 328 (382) T TIGR03415 262 ADFVAHTNPLNVLTARSLMRPLTDLEHVDGGW--CVSDR-----RDTWLFTIDKQV------RRRDAKLPVQAWAAEQEV 328 (382) T ss_pred HHHCCCCCCCCEEEHHHHCCCCCCCCCCCCCH--HHHHH-----HCCCCCCCCHHH------HCCCCCCHHHHCCCCCCH T ss_conf 86229787211256878436741001357403--34433-----200001320221------014431022221344302 Q ss_pred HHHHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 88732743523899989999988503862777522867425652599999987066 Q gi|255764470|r 372 KRSIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 372 ~~lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) ......+..+++++++.+++..+.++.... ||||.|...|+||..++++++.|. T Consensus 329 ~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~--~v~~~g~l~G~i~~~~~l~~l~g~ 382 (382) T TIGR03415 329 ESLEAAPTVINPDTLMRDVLAARHRTGGAI--LLVENGRIVGVIGDDNIYHALLGH 382 (382) T ss_pred HHHCCCCCEECCCCCHHHHHHHHHHCCCCE--EEECCCEEEEEEEHHHHHHHHHCC T ss_conf 231257733389893999999998589980--990599599998689999986279 No 187 >TIGR00400 mgtE magnesium transporter; InterPro: IPR006669 This is the MgtE family of magnesium transporters. All the archaebacterial and eukaryotic examples have two copies of the integral membrane region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg^2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport. Probab=96.46 E-value=0.016 Score=37.48 Aligned_cols=155 Identities=19% Similarity=0.220 Sum_probs=114.3 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHH---CCCCEEEEE-CCCCCCEEEEEEHHHHH Q ss_conf 67898999999975113786674653400028996089998999999985---799788887-67855278888888876 Q gi|255764470|r 285 HIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILE---LGHSRFPVA-QGSLDSFIGIVSARDLL 360 (523) Q Consensus 285 g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~---~~~SR~PV~-~~~~D~iiGiv~~kDLl 360 (523) .....+|+..++..+...+-+...+|| .+.+|+..+-+..+....++. ...+-+..| .++..+..|.+..+|+. T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 193 (460) T TIGR00400 116 ASLSEEERKAVNLLLGYPPDSAGRILT--TEYVWLKEDYTVGDALDYLRRVAETAEDIYTLYVTNESKRLTGVLSLRDLI 193 (460) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEE--EEEEECCCCCHHHHHHHHHHHHHHHHHHHEEEEEECCCHHHHHHHHHHHHH T ss_conf 642123567888873478532342000--000111210015788998888765542102457633410220001123333 Q ss_pred HHHHCCCCCHHHHHH-CCCEEECCCCCHHH-HHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCE Q ss_conf 202024660168873-27435238999899-9998850386277752286742565259999998706644578886312 Q gi|255764470|r 361 RDLLEEGSMNFKRSI-RKPLVVHENISVLK-LMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDI 438 (523) Q Consensus 361 ~~~~~~~~~~~~~li-r~~~~Vpe~~~l~~-lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i 438 (523) .... +......+ ....++.....-.+ .-..+++...--..++|+.|...|++|.+|+++.+..+-.++....... T Consensus 194 ~~~~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~d~~~~p~~~~~~~~~g~~~~dd~~~~~~~~~~~d~~~~~~~ 270 (460) T TIGR00400 194 LAKP---EEYLGDLLFPDGVSVDGLNDEEEDVARLIEKYDFLAVPVVDNEGRLVGIVTVDDILDVLEEEATEDFYEAGAV 270 (460) T ss_pred HCCC---HHHHHHHHHHCCEEEEECCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCC T ss_conf 0350---5677665421001352045404678988764021020013167716875423468898875200235654203 Q ss_pred EEECCC Q ss_conf 883498 Q gi|255764470|r 439 TVGDDG 444 (523) Q Consensus 439 ~~~~d~ 444 (523) ...++. T Consensus 271 ~~~~~~ 276 (460) T TIGR00400 271 KGLGDD 276 (460) T ss_pred CCCCCC T ss_conf 555530 No 188 >PRK11469 hypothetical protein; Provisional Probab=96.24 E-value=0.12 Score=31.17 Aligned_cols=179 Identities=17% Similarity=0.235 Sum_probs=96.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q ss_conf 999999999999974439999999851-799899999999999999999999999999999974-247999863453222 Q gi|255764470|r 12 WIGLATLIALELVLGIDNLIFITLLVE-KLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVML-QQPLFFLKGLSFSGR 89 (523) Q Consensus 12 w~~l~tl~~lEivLs~DN~i~iai~~~-~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l-~~pl~~i~~~~~s~~ 89 (523) -.+++++.++-+.||.|-- ..|+... ..+....+++..+++....+- .++.+.. |..+. ...++.-+.+ T Consensus 18 ~M~~~~i~llaiaLsmDAF-aVsi~~G~~~~~~~~~~~l~~al~FG~fQ-~~MPllG--~~~G~~~~~~i~~~dh----- 88 (206) T PRK11469 18 VMNITATVLLAFGMSMDAF-AASIGKGATLHKPKFSEALRTGLIFGAVE-TLTPLIG--WGMGMLASRFVLEWNH----- 88 (206) T ss_pred EECHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHH--HHHHHHHHHHHHHHHH----- T ss_conf 3529999999999999999-99996510447875999999999999999-9999999--9999999999987689----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCHHHHHHHHHHH Q ss_conf 489999999999998655553025764211112223320999999999999998577899999854-1469999999999 Q gi|255764470|r 90 DIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMV-QHFSVMAIAVAVS 168 (523) Q Consensus 90 ~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~-~~~~i~~~a~~is 168 (523) ..+...|.+.+.+.++|...+.++++.. .....++|. +..+-++=|+|.--.=++.+ -+..+...+.+|+ T Consensus 89 ----WiAf~LL~~IG~~MI~e~~~~~~~~~~~-~~~~~~~~~----Ll~lAiATSIDAlAVGvsla~l~~~i~~~~lvIG 159 (206) T PRK11469 89 ----WIAFVLLIFLGGRMIIEGFRGADDEDEE-PRRRHGFWL----LVTTAIATSLDAMAVGVGLAFLQVNIIATALAIG 159 (206) T ss_pred ----HHHHHHHHHHHHHHHHHHHCCCCCCCCC-CCCCCCHHH----HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf ----9999999999999999986566632113-444676999----9999999869999999999984630999999999 Q ss_pred HH--HHHHHH----HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99--999998----9999999848528999999999999999997745 Q gi|255764470|r 169 AL--MMMAVS----QPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 169 i~--~m~~~s----~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~ 210 (523) +. +|-++. +.+.++..++. .+++=.-|+.||.....|-++ T Consensus 160 ~vT~~~s~~Gv~lG~~~G~~~g~~A--ei~GG~VLI~IGlkiL~eHl~ 205 (206) T PRK11469 160 CATLIMSTLGMMVGRFIGSIIGKKA--EILGGLVLIGIGVQILWTHFH 205 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999889870199--999999999999999998837 No 189 >TIGR00427 TIGR00427 conserved hypothetical protein TIGR00427; InterPro: IPR002771 Members of this family are integral membrane proteins that includes the antibiotic resistance protein MarC. These proteins may be transporters. . Probab=96.15 E-value=0.13 Score=30.89 Aligned_cols=169 Identities=21% Similarity=0.251 Sum_probs=97.3 Q ss_pred HHHHHHH-HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9744399-999998517998999999999999999999999999999999742479998634532224899999999999 Q gi|255764470|r 24 VLGIDNL-IFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLLF 102 (523) Q Consensus 24 vLs~DN~-i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl~ 102 (523) +==+=|. ||++ +|++-|.++|+|.-.=-...|.+.=.+++++.- | .|+.|+.+.+. ...+||++++- T Consensus 21 ~nP~G~~PiFi~-LTe~y~~~~R~~i~kkA~I~~~~IL~~f~~fG~-~-------Il~~FGIsIda---friAGGILlf~ 88 (206) T TIGR00427 21 INPIGNVPIFIS-LTESYTAEEREKIAKKAVISAFVILLIFLLFGD-L-------ILKYFGISIDA---FRIAGGILLFL 88 (206) T ss_pred HCCCCCCCHHHC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-H-------HHHHHCCCHHH---HHHHHHHHHHH T ss_conf 547550232211-488887789999989999999999999998745-7-------88660872789---89988999999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH-----CHHHHHHHHHHHHH-- Q ss_conf 986555530257642111122233209999999999999985778999-----998541-----46999999999999-- Q gi|255764470|r 103 KGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVT-----AIGMVQ-----HFSVMAIAVAVSAL-- 170 (523) Q Consensus 103 k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVit-----avg~~~-----~~~i~~~a~~isi~-- 170 (523) .|++.++.+...+..++++.... ...+-.|.+.=++-=+=+=+. -|-+|+ +..+++.|+..++. T Consensus 89 Ia~dml~gK~~~~~~~k~~E~~e----~~~~dsI~v~PLA~PL~aGPGaIt~~mvl~~~~~~I~~k~~V~l~Ial~~~~~ 164 (206) T TIGR00427 89 IALDMLSGKESEKKKHKHDEKEE----SKEIDSIAVVPLALPLIAGPGAITATMVLIAKASDIGEKFLVVLAIALVLLIT 164 (206) T ss_pred HHHHHHHCCCCHHHHCCHHHHHH----HHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999840255045204134786----44654102221245420365169999999970330899999999999999999 Q ss_pred -HHHHHHHHHHHHHHHCCHH---HHHHHHHHHHHHHHHHHHHH Q ss_conf -9999989999999848528---99999999999999999774 Q gi|255764470|r 171 -MMMAVSQPMIRYISRHTTV---VILCLGFLLMIGFLLIIEGL 209 (523) Q Consensus 171 -~m~~~s~~i~~~i~~~p~~---~~l~l~fl~~iG~~L~~eg~ 209 (523) +....|.++.+-++|.-.= -++|| +|..|++-++.+|. T Consensus 165 ~l~~~~~~~i~~~lg~~Gin~~tRimGL-~L~AiaV~~I~~Gi 206 (206) T TIGR00427 165 FLLFLYAAFIIRRLGRTGINVITRIMGL-LLAAIAVEFIVTGI 206 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCC T ss_conf 9999876899988623018899999999-99998888751589 No 190 >PRK10229 threonine efflux system; Provisional Probab=96.01 E-value=0.15 Score=30.42 Aligned_cols=92 Identities=17% Similarity=0.224 Sum_probs=47.8 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999-999744399999998517998999999999999999999999999999999742479998634532224 Q gi|255764470|r 12 WIGLATLIAL-ELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRD 90 (523) Q Consensus 12 w~~l~tl~~l-EivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~ 90 (523) |..++...++ =+.=|-|.+.++....++= .++--+--.|+..+...-..+.......+..-...+| . T Consensus 5 ~l~~~l~~l~~~isPGPd~~lv~~~s~~~G--~r~g~~~~~Gi~~G~~v~~~la~~Gl~~l~~~~~~~f----------~ 72 (206) T PRK10229 5 FLTVAMVHIVALMSPGPDFFFVSQTAVSRS--RKEAMMGVLGITCGVMVWAGVALLGLHLIIEKMAWLH----------T 72 (206) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------H T ss_conf 999999999996198667999999998857--9989999999999999999999999999999759999----------9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999865555302576 Q gi|255764470|r 91 IVLILGGFFLLFKGTIELHERLEGD 115 (523) Q Consensus 91 lill~GG~fLl~k~~~el~~~~e~~ 115 (523) .+.++|+.||+|.+++-+++..... T Consensus 73 ~ik~~Ga~YL~yLG~~~lr~~~~~~ 97 (206) T PRK10229 73 IIMVGGGLYLCWMGYQMLRGALKKE 97 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9999999999999999998642356 No 191 >pfam03741 TerC Integral membrane protein TerC family. This family contains a number of integral membrane proteins that also contains the TerC protein. TerC has been implicated in resistance to tellurium. This protein may be involved in efflux of tellurium ions. The tellurite-resistant Escherichia coli strain KL53 was found during testing of the group of clinical isolates for antibiotics and heavy metal ion resistance. Determinant of the tellurite resistance of the strain was located on a large conjugative plasmid. Analyses showed, the genes terB, terC, terD and terE are essential for conservation of the resistance. The members of the family contain a number of conserved aspartates that could be involved in binding to metal ions. Probab=95.92 E-value=0.16 Score=30.13 Aligned_cols=76 Identities=11% Similarity=0.154 Sum_probs=59.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999857789999985414------699999999999999999899999998485289999999999999999977 Q gi|255764470|r 135 QIVILDLVFSLDSVVTAIGMVQH------FSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEG 208 (523) Q Consensus 135 ~I~v~D~vFSlDsVitavg~~~~------~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg 208 (523) |+.++|.+-|.||++.-.-++++ -..+..|+..++..=+.+.-..+..++.+|.+..++-.||+++|..+..+. T Consensus 2 tl~~lE~vLs~DN~ivial~~~~lP~~~r~~al~~Gi~~A~v~R~i~i~~~~~ll~~~~~l~~igg~~Li~~~~k~~~~~ 81 (184) T pfam03741 2 TLYLLEISLSVDNAFVIALIFRKLPPEQQKKALFWGIIGALVLRIILILLGSALLELFDWILLIGGAFLLYTAIKLLREN 81 (184) T ss_pred EEHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 20265599885499999999804899999999999999999999999999999999707999999999999999998732 Q ss_pred HC Q ss_conf 45 Q gi|255764470|r 209 LH 210 (523) Q Consensus 209 ~~ 210 (523) .+ T Consensus 82 ~~ 83 (184) T pfam03741 82 EE 83 (184) T ss_pred CC T ss_conf 34 No 192 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=95.73 E-value=0.053 Score=33.61 Aligned_cols=126 Identities=11% Similarity=0.130 Sum_probs=74.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC------------CCCHHHHHHCC-CEEEC---CCC Q ss_conf 999899999998579978888767855278888888876202024------------66016887327-43523---899 Q gi|255764470|r 322 NCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE------------GSMNFKRSIRK-PLVVH---ENI 385 (523) Q Consensus 322 ~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~------------~~~~~~~lir~-~~~Vp---e~~ 385 (523) .-+.+|++.. -.|+-|.++ ..++-+=...+++....+. +-.+..+.+++ |..+. +.. T Consensus 222 THDl~eA~~l-----aDRIaVM~~--G~Ivq~GtpeeI~~~Pa~~yV~~F~~~v~~~~~~~a~~i~~~~~~~~~~~~~~~ 294 (400) T PRK10070 222 SHDLDEAMRI-----GDRIAIMQN--GEVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRTPNGLIRKTPGF 294 (400) T ss_pred CCCHHHHHHH-----CCEEEEEEC--CEEEEECCHHHHHHCCCCHHHHHHHCCCCHHHCEEHHHHHCCCCCCEEECCCCC T ss_conf 9999999986-----999999989--889997288999867998689987554778770239896246875213148886 Q ss_pred CHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHH Q ss_conf 9899999885038627775228674256525999999870664457888631288349859987001199999982 Q gi|255764470|r 386 SVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLF 461 (523) Q Consensus 386 ~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l 461 (523) ....+++.|++++...++|+|+.+...|+||.+|+-...-.. ..-.... .+....++..+++.++-..+ T Consensus 295 ~~~~al~~m~~~~~~~~~vvd~~~~~~G~v~~~~~~~~~~~~-----~~~~~~~--~~~~~~v~~~~~l~~~~~~~ 363 (400) T PRK10070 295 GPRSALKLLQDEDREYGYVIERGNKFVGAVSIDSLKAALTQQ-----QGLDAAL--IDAPLAVDAQTPLSELLSHV 363 (400) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHCC-----CCHHHHH--HCCCCCCCCCCCHHHHHHHH T ss_conf 999999999855986799986998088998899999776337-----7636675--05884239999899999999 No 193 >COG0517 FOG: CBS domain [General function prediction only] Probab=95.69 E-value=0.035 Score=34.92 Aligned_cols=53 Identities=26% Similarity=0.408 Sum_probs=27.9 Q ss_pred EEEEEEEECCEEEEEECCCCHHHHHHHHHH-CCCCEEEEECCCCCCEEEEEEHHHH Q ss_conf 674653400028996089998999999985-7997888876785527888888887 Q gi|255764470|r 305 PAKSIMTPRTEIVWLDVNCVDEDLQWKILE-LGHSRFPVAQGSLDSFIGIVSARDL 359 (523) Q Consensus 305 ~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~-~~~SR~PV~~~~~D~iiGiv~~kDL 359 (523) ++.++|++ ...+++.+.+.+++.+.+.+ .+++++||.+++..+++|+++.+|+ T Consensus 63 ~v~~i~~~--~~~~~~~~~~~~~~~~~m~~~~~~~~lpVv~~~~~~lvGivt~~di 116 (117) T COG0517 63 PVKEVMTK--PVVTVDPDTPLEEALELMVERHKIRRLPVVDDDGGKLVGIITLSDI 116 (117) T ss_pred HHHHHHCC--CCEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCCEEEEEEEHHHC T ss_conf 17886258--8467779988999999998726776256998898879989887874 No 194 >cd04591 CBS_pair_EriC_assoc_euk_bac This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS Probab=95.64 E-value=0.039 Score=34.62 Aligned_cols=50 Identities=20% Similarity=0.341 Sum_probs=43.9 Q ss_pred EEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHH Q ss_conf 534000289960899989999999857997888876785527888888887620 Q gi|255764470|r 309 IMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 309 IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~ 362 (523) +|.+ .-.+++.+++++++.+.+.+.+.+++||.++ +.++|++..||++++ T Consensus 56 ~m~~--~p~tv~~~~~l~~~~~lf~~~g~r~l~Vv~~--g~lvGiITr~Dl~~a 105 (105) T cd04591 56 YIDP--SPFTVSPRTSLEKVHQLFRKLGLRHLLVVDE--GRLVGIITRKDLLKA 105 (105) T ss_pred HCCC--CCEEECCCCCHHHHHHHHHHHCCCEEEEEEC--CEEEEEEEHHHHHCC T ss_conf 5879--9839899995999999999949958899579--999999998995349 No 195 >pfam00478 IMPDH IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family. Probab=95.60 E-value=0.094 Score=31.83 Aligned_cols=61 Identities=13% Similarity=0.210 Sum_probs=54.4 Q ss_pred EEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC Q ss_conf 66746534000289960899989999999857997888876785527888888887620202 Q gi|255764470|r 304 RPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE 365 (523) Q Consensus 304 ~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~ 365 (523) .++.+.|+|+.++++...+.+.+++.+.+.+++..++||.+++ ++++|++..||+...... T Consensus 141 ~~~~~~~~~~~~lvt~~~~~~~~ea~~~l~~~kieklpvvd~~-g~L~glit~kDi~k~~~~ 201 (467) T pfam00478 141 SKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDD-GELVGLITRKDIEKARDY 201 (467) T ss_pred CCCCEEECCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEECCC-CCEEEEEEHHHHHHHHCC T ss_conf 4320110135542772488998999999997556415444678-837888874346774207 No 196 >cd04590 CBS_pair_CorC_HlyC_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, Probab=95.41 E-value=0.067 Score=32.87 Aligned_cols=81 Identities=16% Similarity=0.135 Sum_probs=54.1 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCC-CCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 74352389998999998850386277752286-74256525999999870664457888631288349859987001199 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEY-GVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~-G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) .+.+++.+.++.++++.+++++..--.|.|+. +.+.|+++..|++.....+-. + ..+...-..-+.++..+++. T Consensus 2 ~v~~i~~~~s~~e~~~~~~~~~~sr~PV~~~~~~~ivGil~~kDll~~~~~~~~---~--~~~~~~~~~~~~v~~~~~l~ 76 (111) T cd04590 2 DIVALDADDTLEEILELIAESGHSRFPVYDGDLDNIIGVVHVKDLLRALAEGEE---D--LDLRDLLRPPLFVPESTPLD 76 (111) T ss_pred CEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCEEEEEEEHHHHHHHHCCCC---C--CCHHHHCCCCEECCHHHHHH T ss_conf 689996999999999999977997799995478877899991144566652787---6--40776404422225566588 Q ss_pred HHHHHHC Q ss_conf 9999828 Q gi|255764470|r 456 YASKLFG 462 (523) Q Consensus 456 el~~~l~ 462 (523) ++-+.+. T Consensus 77 ~~l~~~~ 83 (111) T cd04590 77 DLLEEMR 83 (111) T ss_pred HHHHHHH T ss_conf 9998435 No 197 >COG2524 Predicted transcriptional regulator, contains C-terminal CBS domains [Transcription] Probab=95.36 E-value=0.087 Score=32.05 Aligned_cols=124 Identities=16% Similarity=0.157 Sum_probs=81.5 Q ss_pred HHHHCCCCEEEEECC--CCCCEEEE--EEHHHHHHHHHCCCCCHHHHH-HCCCEEECCCCCHHHHHHHHHHCCCEEEEEE Q ss_conf 998579978888767--85527888--888887620202466016887-3274352389998999998850386277752 Q gi|255764470|r 331 KILELGHSRFPVAQG--SLDSFIGI--VSARDLLRDLLEEGSMNFKRS-IRKPLVVHENISVLKLMERLRKSSQTFVMVL 405 (523) Q Consensus 331 ~i~~~~~SR~PV~~~--~~D~iiGi--v~~kDLl~~~~~~~~~~~~~l-ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVv 405 (523) ++--+|++|+-|... ..|+.-+. +.+.. ...-++.+..++ .++++.+..++++-++-..|-+++.+-|.|+ T Consensus 134 rVGPtP~~klvv~G~V~g~Dd~~~~ilidi~~----m~siPk~~V~~~~s~~~i~v~~d~tl~eaak~f~~~~i~GaPVv 209 (294) T COG2524 134 RVGPTPVNKLVVEGKVIGRDDTANEILIDISK----MVSIPKEKVKNLMSKKLITVRPDDTLREAAKLFYEKGIRGAPVV 209 (294) T ss_pred EECCCCCCEEEEEEEEECCCCCCCEEEEEEEE----EEECCCCHHHHHCCCCCEEECCCCCHHHHHHHHHHCCCCCCCEE T ss_conf 87775442489986996245667769998743----26457302666266784676688639999999997286678510 Q ss_pred CCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCC Q ss_conf 2867425652599999987066445788863128834985998700119999998288 Q gi|255764470|r 406 DEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGV 463 (523) Q Consensus 406 DE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~ 463 (523) |.. .+.|++|+.|+.++++-- +.+-...++- ..+-..+++...+.|.-+.++. T Consensus 210 d~d-k~vGiit~~dI~~aia~g--~~~~kV~~~M--~k~vitI~eDe~i~dAir~M~~ 262 (294) T COG2524 210 DDD-KIVGIITLSDIAKAIANG--NLDAKVSDYM--RKNVITINEDEDIYDAIRLMNK 262 (294) T ss_pred CCC-CEEEEEEHHHHHHHHHCC--CCCCCHHHHH--CCCCCEECCCHHHHHHHHHHHH T ss_conf 289-458999889999999768--9654388875--0577047573049999999876 No 198 >TIGR01303 IMP_DH_rel_1 IMP dehydrogenase family protein; InterPro: IPR005991 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase and restricted to the high GC Gram-positive bacteria.. Probab=95.34 E-value=0.083 Score=32.21 Aligned_cols=105 Identities=18% Similarity=0.174 Sum_probs=87.8 Q ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHH-HHHCCCEEECCCCCHHHHHHHH Q ss_conf 899608999899999998579978888767855278888888876202024660168-8732743523899989999988 Q gi|255764470|r 316 IVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFK-RSIRKPLVVHENISVLKLMERL 394 (523) Q Consensus 316 i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~-~lir~~~~Vpe~~~l~~lL~~f 394 (523) =+.+..+.++.++...+.+.-|.---|.+++ .+++|++.-+|+...-.-. .+. -..+..+.+|+++.-.++++.+ T Consensus 98 P~~~~P~~~v~~a~~l~~krah~~~~v~~~~-~~P~G~v~~~~~~G~d~ft---~~~~~~~~~~~~~~~~~~P~~~f~~l 173 (476) T TIGR01303 98 PITLAPHDTVADALALIHKRAHGAAVVVEED-GKPVGVVTDKDLEGVDRFT---QVEEVMSTDLVTLPADLEPREAFDLL 173 (476) T ss_pred CEEECCCHHHHHHHHHHHHHCCCEEEEEECC-CCEEEEEECCCCCCHHHHH---HHHHHHHHHHEEECCCCCHHHHHHHH T ss_conf 4165462338999998753114317898169-8126788515310004567---88877754320203678888999998 Q ss_pred HHCCCEEEEEECCCCCEEEEEEHHHHHHHH Q ss_conf 503862777522867425652599999987 Q gi|255764470|r 395 RKSSQTFVMVLDEYGVLEGMITPANILEAI 424 (523) Q Consensus 395 r~~~~~lAiVvDE~G~~~GIVTleDIlE~I 424 (523) .+....+|.++|..|...|++|..--+.+- T Consensus 174 ~~~~~~~a~~~~~dG~l~G~l~r~Ga~ra~ 203 (476) T TIGR01303 174 EEASRKLAPVVDADGSLAGILTRTGALRAT 203 (476) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCEEE T ss_conf 741012011110577255544114300000 No 199 >COG1971 Predicted membrane protein [Function unknown] Probab=95.25 E-value=0.27 Score=28.47 Aligned_cols=179 Identities=19% Similarity=0.296 Sum_probs=94.6 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q ss_conf 99999999999744399-99999851799899999999999999999999999999999974247-99986345322248 Q gi|255764470|r 14 GLATLIALELVLGIDNL-IFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQP-LFFLKGLSFSGRDI 91 (523) Q Consensus 14 ~l~tl~~lEivLs~DN~-i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~p-l~~i~~~~~s~~~l 91 (523) .+.++.++-+.+|.||- +-++.=.++.+++. +.++..|+....+.-+-.+. -|..+...+ +...+ T Consensus 2 ~~~sllllA~alsmDAFav~l~~G~~~~k~~~-~~~L~ia~~fG~f~~i~pli---G~~~g~~~s~~i~~~--------- 68 (190) T COG1971 2 NIISLLLLAIALSMDAFAVSLGKGLAKHKIRF-KEALVIALIFGVFQAIMPLI---GWFIGKFLSTFIAEW--------- 68 (190) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH--------- T ss_conf 58999999999721699999983156425108-99999999999999999999---999999999999999--------- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH-CHHHHHHHHHHHH Q ss_conf 99999999999986555530257642111122233209-999999999999985778999998541-4699999999999 Q gi|255764470|r 92 VLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSW-QVIVLQIVILDLVFSLDSVVTAIGMVQ-HFSVMAIAVAVSA 169 (523) Q Consensus 92 ill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~-~~~v~~I~v~D~vFSlDsVitavg~~~-~~~i~~~a~~isi 169 (523) =...|+..|.+.+.+.+++...+++++..+...+ ..+ |. ...++-++-|+|+-..-++++. +..+...++.+++ T Consensus 69 ~~wigf~lL~~lG~~mI~e~f~~~~~~~~~~~~~-~~~~~~---~~~~laiatSidal~vG~~~a~lgv~i~~~av~iG~ 144 (190) T COG1971 69 AHWIGFVLLIILGLKMIIEGFKNEEDEFVDPAEK-HDLNFK---ELILLAIATSIDALAVGVGLAFLGVNILLAAVAIGL 144 (190) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCC-CHHHHH---HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 9999999999999999999803543102135200-103499---999999999998888753399742369999999999 Q ss_pred --HHHHHHH----HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf --9999998----99999998485289999999999999999977457 Q gi|255764470|r 170 --LMMMAVS----QPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLHF 211 (523) Q Consensus 170 --~~m~~~s----~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~ 211 (523) .+|.+.. +.+.+|..+|- ..++---|..+|...+.|.+++ T Consensus 145 ~T~il~~~G~~IG~~~g~~~g~~a--e~lgGiiLI~~G~~iL~~~~~~ 190 (190) T COG1971 145 ITLILSALGAIIGRKLGKFLGKYA--EILGGIILIGIGVKILLEHLSF 190 (190) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHCCC T ss_conf 999999999999999998872399--9998899999999999976039 No 200 >COG3448 CBS-domain-containing membrane protein [Signal transduction mechanisms] Probab=95.05 E-value=0.016 Score=37.35 Aligned_cols=62 Identities=27% Similarity=0.391 Sum_probs=33.7 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHH Q ss_conf 13786674653400028996089998999999985799788887678552788888888762020 Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLL 364 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~ 364 (523) .+..-+|+.+|+| .+.++..+++..++.-.+.+.++.-+||.++. ..++|++.-.|++.... T Consensus 311 ~~~~~~vk~imt~--~v~tv~pdtpa~~lvp~lad~g~H~lpvld~~-g~lvGIvsQtDliaal~ 372 (382) T COG3448 311 FLRPPTVKGIMTT--PVVTVRPDTPAVELVPRLADEGLHALPVLDAA-GKLVGIVSQTDLIAALY 372 (382) T ss_pred CCCCCCCCCCCCC--CCEEECCCCCHHHHHHHHHCCCCCEEEEECCC-CCEEEEEEHHHHHHHHH T ss_conf 1599863443457--63353589708888887613773136688688-83888862788999999 No 201 >COG0861 TerC Membrane protein TerC, possibly involved in tellurium resistance [Inorganic ion transport and metabolism] Probab=94.88 E-value=0.34 Score=27.76 Aligned_cols=124 Identities=17% Similarity=0.123 Sum_probs=53.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCC--C-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHH Q ss_conf 9999999999998655553025764211--1-122233209999999999999985778999998541469999999999 Q gi|255764470|r 92 VLILGGFFLLFKGTIELHERLEGDGFDK--K-HKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQHFSVMAIAVAVS 168 (523) Q Consensus 92 ill~GG~fLl~k~~~el~~~~e~~~~~~--~-~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~~~i~~~a~~is 168 (523) ++...|.+|+|++.+.+.+..+...+.. . +.......+. .+..+ ....+|..+.-+.++ T Consensus 82 l~~~fg~~L~~~~~~ll~~~~~~~~k~~~~~~~~~~~~~~~~---------------~~~~~---~~~~f~~ai~~I~i~ 143 (254) T COG0861 82 LLYIFGLYLLWRDIKLLLGGLFLLFKATKELHERLEGEEFFV---------------NGKLK---KATPFWGAIIQIELA 143 (254) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCC---------------CCCCC---CCCCHHHHHHHHHHH T ss_conf 999999999999999972430377788988765305630111---------------34321---268599999999999 Q ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHH---HHHHHHHHHHHHCCCCCCHHHHHH----HHHHHHHHHHHH Q ss_conf 99999998999999984852899999999---999999999774573370278899----899999999989 Q gi|255764470|r 169 ALMMMAVSQPMIRYISRHTTVVILCLGFL---LMIGFLLIIEGLHFDIPKGYLYAS----IGFSGIIEFFNQ 233 (523) Q Consensus 169 i~~m~~~s~~i~~~i~~~p~~~~l~l~fl---~~iG~~L~~eg~~~~ipk~y~y~~----~~fs~~ve~~n~ 233 (523) =.+|-+=|=+-.-=+..||.+.+.|..|= +.-+..++++=.+-|-.=.|+-++ +|+++++|..+. T Consensus 144 D~vFSlDSV~Aa~g~~~~~~im~~a~i~aI~~m~~aa~~l~~ll~r~p~l~~~~~~iL~~IG~kli~~~~~~ 215 (254) T COG0861 144 DLVFSLDSVIAAVGMAGHPFVMVTAVIFAILVMRFAAFLLARLLERHPTLKYLALVILLFIGVKLILEGLAH 215 (254) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999865389999999468559999999999999999999999999863899999999999999999960101 No 202 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=94.34 E-value=0.082 Score=32.24 Aligned_cols=110 Identities=19% Similarity=0.200 Sum_probs=64.6 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 27435238999899999885038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) ..+.|..++.++.+-++.-.+.+..--.|+|++-.+.|+||..|+... .+..+--.-...+-..+.+.+++. T Consensus 198 ~~~~yL~~~d~v~d~~~l~~kt~~sRfPVvn~~~kvvGvVt~rDv~~~--------~~~t~ieKVMtknp~tv~~~tsVA 269 (432) T COG4109 198 EDTSYLRETDTVEDWLDLVEKTGHSRFPVVNRSMKVVGVVTMRDVLDK--------KPSTTIEKVMTKNPITVRAKTSVA 269 (432) T ss_pred CCCEECCCCCCHHHHHHHHHHCCCCCCCEECCCCEEEEEEEEHHHHCC--------CCCCCHHHHHCCCCEEECCCCHHH T ss_conf 343232464509999999997198734133144269999983464447--------998617777426975542522688 Q ss_pred HHHHHHCCC----CCCCCCCCCCHHHHH--------HHHHCCCCCCCCEEE Q ss_conf 999982889----999877824078999--------988312488998899 Q gi|255764470|r 456 YASKLFGVN----LVDEDDRYSTLAGFI--------LWRLGHLPQEKEIFT 494 (523) Q Consensus 456 el~~~l~~~----l~~e~~~~~TlaGli--------l~~l~~iP~~Ge~i~ 494 (523) .......|+ +|-- ++-.|+.|.+ +...++-|++|+++. T Consensus 270 svaq~MiwE~iem~PVv-~~n~~llGiitR~dvlk~lq~~q~qpqvget~~ 319 (432) T COG4109 270 SVAQMMIWEGIEMLPVV-DSNNTLLGIITRQDVLKSLQMIQRQPQVGETIS 319 (432) T ss_pred HHHHHHHHCCCEEEEEE-CCCCEEEEEEEHHHHHHHHHHHCCCCCCCCCHH T ss_conf 99888876455265688-378618989889999999987512886554477 No 203 >COG2239 MgtE Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism] Probab=93.70 E-value=0.12 Score=31.16 Aligned_cols=60 Identities=20% Similarity=0.297 Sum_probs=30.0 Q ss_pred CEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHC Q ss_conf 866746534000289960899989999999857997888876785527888888887620202 Q gi|255764470|r 303 DRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLE 365 (523) Q Consensus 303 e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~ 365 (523) +.+++++|.++ ++.++.+++.+++.+.+.++...-.||.+++ ...+|++..-|++....+ T Consensus 195 ~~~i~~im~~~--~~~V~~~~dqeevA~~~~~ydl~a~PVVd~~-~~LiG~itiDDiidvi~e 254 (451) T COG2239 195 DELLKDLMEDD--VVSVLADDDQEEVARLFEKYDLLAVPVVDED-NRLIGIITIDDIIDVIEE 254 (451) T ss_pred HHHHHHHHCCC--CEEECCCCCHHHHHHHHHHHCCEECCEECCC-CCEEEEEEHHHHHHHHHH T ss_conf 76899872435--6243655787999999998287015357789-846325549999999999 No 204 >PRK10520 rhtB homoserine/homoserine lactone efflux protein; Provisional Probab=92.53 E-value=0.85 Score=24.83 Aligned_cols=94 Identities=14% Similarity=0.146 Sum_probs=52.0 Q ss_pred CHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8789999999999-999974439999999851799899999999999999999999999999999974247999863453 Q gi|255764470|r 8 DYYVWIGLATLIA-LELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSF 86 (523) Q Consensus 8 ~p~~w~~l~tl~~-lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~ 86 (523) |-+.|.+|+...+ +-+.=|-||+.+++.-.++= .++--.-.+|...+...=..+.......+..-...++ T Consensus 2 ~l~~~l~f~~~~~~~~itPGP~~~~vi~~~~~~G--~r~~~~~~~G~~~G~~~~~~la~~Gl~~l~~~~p~~f------- 72 (205) T PRK10520 2 TLEWWLAYLLTSIILSLSPGSGAINTMSTSLSHG--YRGAVASIAGLQTGLAIHIVLVGVGLGALFSQSLLAF------- 72 (205) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------- T ss_conf 8999999999999996298688999999998858--8989999999999999999999999999999789999------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 222489999999999998655553025 Q gi|255764470|r 87 SGRDIVLILGGFFLLFKGTIELHERLE 113 (523) Q Consensus 87 s~~~lill~GG~fLl~k~~~el~~~~e 113 (523) ..+.+.|+.||+|.+++-.+.+.. T Consensus 73 ---~~lk~~Ga~YLi~Lg~~~~r~~~~ 96 (205) T PRK10520 73 ---EVLKWAGAAYLIWLGIQQWRAAGA 96 (205) T ss_pred ---HHHHHHHHHHHHHHHHHHHHCCCC T ss_conf ---999999999999999999852566 No 205 >KOG2550 consensus Probab=92.33 E-value=0.17 Score=29.90 Aligned_cols=62 Identities=13% Similarity=0.170 Sum_probs=54.2 Q ss_pred CCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHH Q ss_conf 13786674653400028996089998999999985799788887678552788888888762020 Q gi|255764470|r 300 TLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLL 364 (523) Q Consensus 300 ~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~ 364 (523) +=....+.++|||+. ++.+.+.++++.-+++.++.--++||.+++ ||.+.++...||.+... T Consensus 167 ~~~~~~~~~vmt~~~--~~~~~gi~l~~~neiL~~~kkGkl~iv~~~-gelva~~~rtDl~k~~~ 228 (503) T KOG2550 167 EDNSLLVSDVMTKNP--VTGAQGITLKEANEILKKIKKGKLPVVDDK-GELVAMLSRTDLMKNRD 228 (503) T ss_pred HCCCCHHHHHHCCCC--CCCCCCCCHHHHHHHHHHHHCCCCCEECCC-CCEEEEEEHHHHHHHCC T ss_conf 345511243303443--014444667788999876314865234367-76233433345665027 No 206 >PRK10062 hypothetical protein; Provisional Probab=91.48 E-value=1.1 Score=24.01 Aligned_cols=119 Identities=19% Similarity=0.238 Sum_probs=60.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC-------CC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CH Q ss_conf 89999999999998655553025764211112223-------32----09999999999999985778999998541-46 Q gi|255764470|r 91 IVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFS-------PV----SWQVIVLQIVILDLVFSLDSVVTAIGMVQ-HF 158 (523) Q Consensus 91 lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~-------~~----~~~~~v~~I~v~D~vFSlDsVitavg~~~-~~ 158 (523) .++.+||.||.|-+..-++|++.++.+++...... .. .=-..+..=+-+|.+.|-.=+.-+.|-.. .. T Consensus 89 plLmlGG~yLcfEGaEKv~h~~~~~~~~~~~~~~~~~~~~~~~~d~~~~E~~kikgAIrTDfILSaEImvIaL~~V~~~~ 168 (303) T PRK10062 89 PLLMIGGAFLCFEGVEKVLHMLEARKHKEDPAQRQQRLEKLAAQDPLKFEKDKVKGAIRTDFILSAEIVAITLGIVAEAP 168 (303) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 99996458967702999999973665534435565543124545567889998763778747886999999850135641 Q ss_pred ------HHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf ------99999999999999999--------------------8999999984852899999999999999999774 Q gi|255764470|r 159 ------SVMAIAVAVSALMMMAV--------------------SQPMIRYISRHTTVVILCLGFLLMIGFLLIIEGL 209 (523) Q Consensus 159 ------~i~~~a~~isi~~m~~~--------------------s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~ 209 (523) .+.++|+++.+.+.-.. ++.+.+.+-+.--..+=.|++...+-+.++.-|. T Consensus 169 ~~~q~~vL~~Vai~iTv~VYG~ValIVKmDD~Gl~L~~~~~~~~~~~Gr~Lv~~mP~lmk~LsvvGT~AMl~VGG~I 245 (303) T PRK10062 169 LLNQVLVLSGIALVVTVGVYGLVGVIVKIDDLGYWLAEKSSALAQALGKGLLIIAPWLMKALSIVGTLAMFLVGGGI 245 (303) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHEEEEEHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE T ss_conf 99999999999999999998474137874314589885744289999999999999999999999999999946535 No 207 >pfam05661 DUF808 Protein of unknown function (DUF808). This family consists of several bacterial proteins of unknown function. Probab=90.92 E-value=1.2 Score=23.63 Aligned_cols=120 Identities=20% Similarity=0.252 Sum_probs=60.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC---CC-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--H Q ss_conf 89999999999998655553025764211112223---32-------099999999999999857789999985414--6 Q gi|255764470|r 91 IVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFS---PV-------SWQVIVLQIVILDLVFSLDSVVTAIGMVQH--F 158 (523) Q Consensus 91 lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~---~~-------~~~~~v~~I~v~D~vFSlDsVitavg~~~~--~ 158 (523) .++.+||.||.|-+...++|+..++++++++.... .. .=-..+..=+-+|.+.|-.=+.-+.|-..+ + T Consensus 89 plLmlGG~yLcfEGaEKv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~E~~kVkgAIrTDfILSaEImvIaL~~v~~~~~ 168 (293) T pfam05661 89 PLLMLGGLFLCFEGFEKLAHKFLHHEDAEEHEERLEALADPAVDLVAFEKDKIKGAIRTDFILSAEIIVIALGTVAGASF 168 (293) T ss_pred HHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 99997579988721999999980676432204454202554445778899987637787468869999998401246409 Q ss_pred -----HHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf -----99999999999999999--------------------89999999848528999999999999999997745 Q gi|255764470|r 159 -----SVMAIAVAVSALMMMAV--------------------SQPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 159 -----~i~~~a~~isi~~m~~~--------------------s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~ 210 (523) .+.+.|+.+.+.+.-.. ++.+.+.+=|.--..+=.|++...+-+.++.-|.= T Consensus 169 ~~q~~vL~~Vai~iT~~VYG~ValIVKmDD~Gl~L~~~~~~~~~~~GrgLV~~mP~lmk~LsvvGT~AMlwVGG~Ii 245 (293) T pfam05661 169 LTQVLVLSGIALLMTVGVYGLVAGIVKLDDLGLYLSQKSGALARKLGRGLLAAAPWLMKSLSVVGTAAMFLVGGGIL 245 (293) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHEEEEEHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEE T ss_conf 99999999999999999985752278741257899857553999999899999999999999999999999454367 No 208 >pfam01810 LysE LysE type translocator. This family consists of various hypothetical proteins and an l-lysine exporter LysE from Corynebacterium glutamicum which is proposed to be the first of a novel family of translocators. LysE exports l-lysine from the cell into the surrounding medium and is predicted to span the membrane six times. The physiological function of the exporter is to excrete excess l-Lysine as a result of natural flux imbalances or peptide hydrolysis; and also after artificial deregulation of l-Lysine biosynthesis as used by the biotechnology. industry for the production of l-lysine. Probab=90.68 E-value=1.3 Score=23.48 Aligned_cols=79 Identities=19% Similarity=0.162 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97443999999985179989999999999999--9999999999999999974247999863453222489999999999 Q gi|255764470|r 24 VLGIDNLIFITLLVEKLPLAQRGKALVFGLTF--AMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLL 101 (523) Q Consensus 24 vLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~--A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl 101 (523) .-|-||+.+++-..++ -++++...+++. +...=+.+.......+..-. | .....+.++|++||+ T Consensus 6 sPGP~~~~vi~~~~~~----G~~~~~~~~~G~~~g~~~~~~l~~~Gl~~l~~~~-~---------~~~~~l~~~G~~yL~ 71 (191) T pfam01810 6 PPGPVNAFVLTQSIRR----GFKHGLLVGLGAALGDAVLILLGLFGLAVLLGAS-P---------ILFTVLKLGGGAYLL 71 (191) T ss_pred CCCHHHHHHHHHHHHH----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H---------HHHHHHHHHHHHHHH T ss_conf 5767899999999985----8799999999999999999999999999999865-9---------999999999999999 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 998655553025764 Q gi|255764470|r 102 FKGTIELHERLEGDG 116 (523) Q Consensus 102 ~k~~~el~~~~e~~~ 116 (523) |.+++-++.+..... T Consensus 72 ylg~~~~~~~~~~~~ 86 (191) T pfam01810 72 YLGILTLRAAWSPKS 86 (191) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999982456543 No 209 >PRK10958 leucine export protein LeuE; Provisional Probab=88.59 E-value=1.8 Score=22.38 Aligned_cols=88 Identities=20% Similarity=0.271 Sum_probs=52.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999997443999999985179989999999--99999999999999999999999742479998634532 Q gi|255764470|r 10 YVWIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKAL--VFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFS 87 (523) Q Consensus 10 ~~w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~--~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s 87 (523) +.|.=+++.+++-+.=|-||+.+++-..++= ++++. ..|+..+...-..+.......++.-...+| T Consensus 6 ~l~~F~~~~~~~~itPGP~~~~v~~~sl~~G----~~~~l~~~~G~~~G~~i~~~~a~~Gl~aLl~~~p~~f-------- 73 (213) T PRK10958 6 DLWTYLVGAIFIVLLPGPNSLYVLSTAAQRG----VKAGYRAACGVFIGDAVLMFLAAAGVASLLKATPLLF-------- 73 (213) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------- T ss_conf 9999999999997088576999999998858----8989999999999999999999999999999999999-------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 224899999999999986555530 Q gi|255764470|r 88 GRDIVLILGGFFLLFKGTIELHER 111 (523) Q Consensus 88 ~~~lill~GG~fLl~k~~~el~~~ 111 (523) ..+.+.|+.||+|.+.+-++.. T Consensus 74 --~vlk~~Ga~YLi~LGik~lrs~ 95 (213) T PRK10958 74 --NVVKYLGAAYLLYLGVKMLRAA 95 (213) T ss_pred --HHHHHHHHHHHHHHHHHHHHHH T ss_conf --9999999999999999999988 No 210 >TIGR00949 2A76 Homoserine/Threonine efflux protein; InterPro: IPR004778 Proteins in this group include, A chemotactic transduction protein from Pseudomonas aeruginosa. The homoserine/homoserine lactone efflux protein from Escherichia coli. and a number of hypothetical proteins.; GO: 0016021 integral to membrane. Probab=86.53 E-value=2.4 Score=21.55 Aligned_cols=20 Identities=30% Similarity=0.672 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 48999999999999865555 Q gi|255764470|r 90 DIVLILGGFFLLFKGTIELH 109 (523) Q Consensus 90 ~lill~GG~fLl~k~~~el~ 109 (523) ..|+..||.||+|.+++.++ T Consensus 55 ~~ik~~G~aYL~ylG~~~~R 74 (194) T TIGR00949 55 TVIKYLGGAYLIYLGVKMLR 74 (194) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999998 No 211 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=86.42 E-value=0.3 Score=28.17 Aligned_cols=51 Identities=20% Similarity=0.285 Sum_probs=47.1 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHH--HHCCC Q ss_conf 7435238999899999885038627775228674256525999999--87066 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILE--AIAGD 427 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE--~IvGe 427 (523) +.+.|-.++++.|+|-+|.+.+-.+|+|+||.-.++|++|-=|+=. ..+|- T Consensus 166 ~lP~v~~tas~~DAL~e~~~~~LG~~~v~~~~~~~~Gv~tDGD~RR~l~~~g~ 218 (272) T TIGR00393 166 ELPLVAPTASFKDALLEMSRKRLGLAVVCDENEQLKGVFTDGDLRRVLALLGG 218 (272) T ss_pred CCCCEECCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCHHHHHHHHHCC T ss_conf 67822236772021023313786179997155524678714659999988166 No 212 >pfam03596 Cad Cadmium resistance transporter. Probab=84.36 E-value=0.62 Score=25.81 Aligned_cols=32 Identities=19% Similarity=0.346 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 74439999999851799899999999999999 Q gi|255764470|r 25 LGIDNLIFITLLVEKLPLAQRGKALVFGLTFA 56 (523) Q Consensus 25 Ls~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A 56 (523) =.+|+.+++.+...+.+...|+|-+.+|=.+- T Consensus 4 TniDdi~vL~~fFa~~~~~~~~~~IviGqylG 35 (192) T pfam03596 4 TALDLLIILIILFAQFKQRKSHKHIYIGQYLG 35 (192) T ss_pred ECHHHHHHHHHHHHHCCCCCCCEEEEEEHHHH T ss_conf 13779999999997657889733888447877 No 213 >COG3090 DctM TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism] Probab=82.29 E-value=3.6 Score=20.28 Aligned_cols=22 Identities=23% Similarity=0.691 Sum_probs=14.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH Q ss_conf 7337027889989999999998 Q gi|255764470|r 211 FDIPKGYLYASIGFSGIIEFFN 232 (523) Q Consensus 211 ~~ipk~y~y~~~~fs~~ve~~n 232 (523) ..+|-+|.|.++-++-..-.+- T Consensus 125 l~ip~~~~y~alpi~~~l~~l~ 146 (177) T COG3090 125 LGIPMGWVYLALPIGGVLMALR 146 (177) T ss_pred CCCCHHHHHHHHHHHHHHHHHH T ss_conf 3416889999999999999999 No 214 >COG1125 OpuBA ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism] Probab=81.00 E-value=3.2 Score=20.57 Aligned_cols=43 Identities=21% Similarity=0.295 Sum_probs=35.5 Q ss_pred ECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHH Q ss_conf 2389998999998850386277752286742565259999998 Q gi|255764470|r 381 VHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEA 423 (523) Q Consensus 381 Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~ 423 (523) ++....-.+++..+-..+...+-|+||.|...|.+|.+|++.. T Consensus 266 ~~~~~~~~~~ls~~~~~~~~~~~Vvd~~g~~~G~vt~~~l~~~ 308 (309) T COG1125 266 LLEGFVDRDALSDFLARGRSVLPVVDEDGRPLGTVTRADLLDE 308 (309) T ss_pred CCCCHHHHHHHHHHHHCCCCEEEEECCCCCEEEEEEHHHHHHH T ss_conf 4410445899999986687246788799967658879987523 No 215 >COG1280 RhtB Putative threonine efflux protein [Amino acid transport and metabolism] Probab=80.75 E-value=4 Score=19.90 Aligned_cols=92 Identities=17% Similarity=0.203 Sum_probs=50.0 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH--HHHH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 899999-999999999744399999998517998999999999999999999--9999-999999997424799986345 Q gi|255764470|r 10 YVWIGL-ATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTR--IALL-ASLSYWIVMLQQPLFFLKGLS 85 (523) Q Consensus 10 ~~w~~l-~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R--~~ll-~~~~~wi~~l~~pl~~i~~~~ 85 (523) +.|..+ ......-+.=|=||+.+++-..++= |+.+...+++.+...= +.+. +.++. +..-.. T Consensus 4 ~~~l~~~~~~~~~~~~PGP~~~~v~~~~~~~G----~~~g~~~~~G~~~G~~~~~~l~~~Gl~a-ll~~~~--------- 69 (208) T COG1280 4 TNLLAFLLAALVLAATPGPDNLLVLARSLSRG----RRAGLATALGIALGDLVHMLLAALGLAA-LLATSP--------- 69 (208) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHC----HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--------- T ss_conf 99999999999996598916999999999857----8999999999999999999999999999-999729--------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 3222489999999999998655553025764 Q gi|255764470|r 86 FSGRDIVLILGGFFLLFKGTIELHERLEGDG 116 (523) Q Consensus 86 ~s~~~lill~GG~fLl~k~~~el~~~~e~~~ 116 (523) .....+.++|+.||+|.+++-++.+.+..+ T Consensus 70 -~~f~~lk~~GaaYL~ylg~~~lra~~~~~~ 99 (208) T COG1280 70 -ALFTVLKLAGAAYLLYLGWKALRAGGAALA 99 (208) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf -999999999999999999999857664555 No 216 >TIGR00935 2a45 arsenical pump membrane protein; InterPro: IPR000802 Arsenic is a toxic metalloid whose trivalent and pentavalent ions inhibit a variety of biochemical processes. Operons that encode arsenic resistance have been found in multicopy plasmids from both Gram-positive and Gram-negative bacteria . The resistance mechanism is encoded from a single operon, which houses an anion pump. The pump has two polypeptide components: a catalytic subunit (the ArsA protein), which functions as an oxyanion-stimulated ATPase; and an arsenite export component (the ArsB protein), which is associated with the inner membrane . The ArsA and ArsB proteins are thought to form a membrane complex that functions as an anion-translocating ATPase. The ArsB protein is distinguished by its overall hydrophobic character, in keeping with its role as a membrane-associated channel. Sequence analysis reveals the presence of 13 putative transmembrane (TM) regions.; GO: 0015105 arsenite transmembrane transporter activity, 0016021 integral to membrane. Probab=79.41 E-value=4.4 Score=19.60 Aligned_cols=102 Identities=16% Similarity=0.235 Sum_probs=56.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH--HHHHHHH-HHCHH Q ss_conf 34532224899999999999986555530257642111122233209999999999999985778--9999985-41469 Q gi|255764470|r 83 GLSFSGRDIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDS--VVTAIGM-VQHFS 159 (523) Q Consensus 83 ~~~~s~~~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDs--Vitavg~-~~~~~ 159 (523) +.+.|.-..+-..+-+++.-|+. -.+.++.-+..+|..++..+..==++++|.| .+.-+|= |+-+. T Consensus 315 ~~~~s~~a~~~~l~~~~l~~~~~-----------~~~~~~~l~~apW~iv~Fs~glylvv~gL~~~GL~e~lg~~T~~~i 383 (507) T TIGR00935 315 GIPVSALAAIAALILIALAILAD-----------IIDTEKVLRRAPWQIVVFSLGLYLVVYGLKNLGLTEYLGEQTELLI 383 (507) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC-----------HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHH T ss_conf 52368799999999999997501-----------6757789600772899999999999999851207887875457887 Q ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHCC---HHHHHHHH Q ss_conf 9999---999999999999899999998485---28999999 Q gi|255764470|r 160 VMAI---AVAVSALMMMAVSQPMIRYISRHT---TVVILCLG 195 (523) Q Consensus 160 i~~~---a~~isi~~m~~~s~~i~~~i~~~p---~~~~l~l~ 195 (523) ++.. |+..|+..|.|.|...|.+||+=| ++..-+|+ T Consensus 384 l~~~~~~gL~aAil~~~~~sA~lSS~~nN~P~T~t~l~~~ls 425 (507) T TIGR00935 384 LNVLEDRGLIAAILGTGFVSAFLSSIMNNMPLTATMLVGALS 425 (507) T ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 403324148999999999999999987043437899999871 No 217 >pfam04459 DUF512 Protein of unknown function (DUF512). Family of uncharacterized prokaryotic proteins. Probab=78.21 E-value=4.8 Score=19.36 Aligned_cols=111 Identities=15% Similarity=0.257 Sum_probs=57.0 Q ss_pred CCCEEEEE--EHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHHHHH----C--CCEEEEEECC-CCC---EEEE Q ss_conf 55278888--888876202024660168873274352389998999998850----3--8627775228-674---2565 Q gi|255764470|r 347 LDSFIGIV--SARDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMERLRK----S--SQTFVMVLDE-YGV---LEGM 414 (523) Q Consensus 347 ~D~iiGiv--~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~----~--~~~lAiVvDE-~G~---~~GI 414 (523) .+|-+|++ ...++......-+.. ..-++. .+--.+.+...++++-+ . ..++..|.++ ||+ +.|+ T Consensus 72 ~enGVGm~r~f~~e~~~~l~~lp~~---~~~~~~-tivTG~~a~~~l~~l~~~l~~~gl~v~v~~v~N~fFG~~vtVaGL 147 (205) T pfam04459 72 IENGVGMIRLFLDEFLEALPRLPAL---KIPKKA-TIVTGNLAAPALEPIVERLKIEGLTVNVVAVKNDFFGETITVAGL 147 (205) T ss_pred HHCCCCHHHHHHHHHHHHHHHCCCC---CCCEEE-EEEEEHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCEEEEEE T ss_conf 4245027999999999998745235---888069-998304179999999998456995699999533665785358761 Q ss_pred EEHHHHHHHHCCCCCCCCCC---CCCEEEECCCEEEEEECCCHHHHHHHHCCCC Q ss_conf 25999999870664457888---6312883498599870011999999828899 Q gi|255764470|r 415 ITPANILEAIAGDFPDEDDQ---KLDITVGDDGSLTVDGWIDVRYASKLFGVNL 465 (523) Q Consensus 415 VTleDIlE~IvGei~DE~d~---~~~i~~~~d~~~~v~G~~~l~el~~~l~~~l 465 (523) +|-+||++++-+.- -.|. +....+.+++.++ -.++++++.+.+|.+. T Consensus 148 LTg~Dii~~L~~~~--lgd~lliP~~ml~~~~~~fL--DD~t~eel~~~lg~~v 197 (205) T pfam04459 148 LTGQDIIEELDGKE--LGDTLLLPGVMLKHGEEVFL--DDMTVEELEEALGVPV 197 (205) T ss_pred EEHHHHHHHHCCCC--CCCEEEECHHHHCCCCCEEC--CCCCHHHHHHHHCCCE T ss_conf 01589999966888--89789956899578997568--9998999999839978 No 218 >KOG0476 consensus Probab=77.96 E-value=4.6 Score=19.48 Aligned_cols=60 Identities=17% Similarity=0.247 Sum_probs=39.3 Q ss_pred CCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCC-CCCEEEEEEHHHHHHH Q ss_conf 3786674653400028996089998999999985799788887678-5527888888887620 Q gi|255764470|r 301 LADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGS-LDSFIGIVSARDLLRD 362 (523) Q Consensus 301 L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~-~D~iiGiv~~kDLl~~ 362 (523) .....|+++|+ +|+.++..+.|+.|+++.+..++-.-||+.+.. .--.+|-|..+.|... T Consensus 585 ~h~v~VE~iMV--~dv~yI~k~~Ty~elre~l~~~~lR~~PlV~s~esmiLlGSV~R~~L~~l 645 (931) T KOG0476 585 VHTVKVEHIMV--TDVKYITKDTTYRELREALQTTTLRSFPLVESKESMILLGSVARRYLTAL 645 (931) T ss_pred EEEEEEEEECC--CCCEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEHHHHHHHHHH T ss_conf 16887430012--45204533573999999997375320144358644233016379999999 No 219 >PRK10323 neutral amino-acid efflux protein; Provisional Probab=76.81 E-value=5.2 Score=19.08 Aligned_cols=113 Identities=11% Similarity=0.128 Sum_probs=57.2 Q ss_pred CHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8789999999999-999974439999999851799899999999999999999999999999999974247999863453 Q gi|255764470|r 8 DYYVWIGLATLIA-LELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSF 86 (523) Q Consensus 8 ~p~~w~~l~tl~~-lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~ 86 (523) +|+.+.++.+-.+ .-+.=|=+|+...+.-++.= -.|.-.-..|+...+.+-..+..... .. ++.. .- T Consensus 2 ~~~ll~a~~~f~~v~siTPGPnnil~~~~g~~~G--~r~tlp~~~Gi~~G~~~~~~l~~~gl---~~----l~~~---~P 69 (195) T PRK10323 2 TPTLLSAFWTYTLITAMTPGPNNILALSSATSHG--FRQSTRVLAGMSLGFLIVMLLCAGIS---FS----LAVI---DP 69 (195) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC--HHHHHHHHHHHHHHHHHHHHHHHHHH---HH----HHHH---HH T ss_conf 8899999999999996588769999999998568--99999999889899999999999999---99----9998---88 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 22248999999999999865555302576421111222332099999999 Q gi|255764470|r 87 SGRDIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQI 136 (523) Q Consensus 87 s~~~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I 136 (523) ...+.+...|..||+|.+++-...+. .+++...+...++.+...|. T Consensus 70 ~~~~~lk~~Ga~YLlyLA~ki~~s~~----~~~~~~~~~~~F~~g~~~Q~ 115 (195) T PRK10323 70 AAVHLLSWAGAAYIVWLAWKIATSPT----KEDGLQPKPISFWASFALQF 115 (195) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC----CCCCCCCCCHHHHHHHHHHH T ss_conf 99999999999999999999971987----65567768746999999997 No 220 >PRK09877 2,3-diketo-L-gulonate TRAP transporter small permease protein YiaM; Provisional Probab=75.73 E-value=5.5 Score=18.89 Aligned_cols=18 Identities=22% Similarity=0.451 Sum_probs=12.2 Q ss_pred CCCCCCHHHHHHHHHHHH Q ss_conf 573370278899899999 Q gi|255764470|r 210 HFDIPKGYLYASIGFSGI 227 (523) Q Consensus 210 ~~~ipk~y~y~~~~fs~~ 227 (523) ...+|-|++|.++-.|-. T Consensus 113 ~lgiPm~~~Y~a~~vsgv 130 (157) T PRK09877 113 ILGLPIGLMYAACLPTSL 130 (157) T ss_pred CCCCCHHHHHHHHHHHHH T ss_conf 118779999999999999 No 221 >KOG1077 consensus Probab=75.00 E-value=5.3 Score=19.03 Aligned_cols=31 Identities=10% Similarity=0.130 Sum_probs=17.5 Q ss_pred CCCCEEEEEEEE-----ECCEEEEEECCCCHHHHHH Q ss_conf 137866746534-----0002899608999899999 Q gi|255764470|r 300 TLADRPAKSIMT-----PRTEIVWLDVNCVDEDLQW 330 (523) Q Consensus 300 ~L~e~~V~~IMt-----PR~~i~~ld~~~s~~e~~~ 330 (523) ...+-+-+..|+ |+.-...++.+.+.-++.. T Consensus 649 ~~~s~st~~~~v~~~p~~n~t~~~~~~~~~~~di~s 684 (938) T KOG1077 649 SKRSNSTDPLSVPSPPPPNNTISSVNSQIPSVDIFS 684 (938) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHC T ss_conf 677898885547799998888667888998366543 No 222 >COG2354 Uncharacterized protein conserved in bacteria [Function unknown] Probab=75.00 E-value=5.7 Score=18.76 Aligned_cols=62 Identities=18% Similarity=0.320 Sum_probs=32.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCC----CCCCCCCCCC-----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999986555530257----6421111222-----332099999999999999857789999985414 Q gi|255764470|r 92 VLILGGFFLLFKGTIELHERLEG----DGFDKKHKFF-----SPVSWQVIVLQIVILDLVFSLDSVVTAIGMVQH 157 (523) Q Consensus 92 ill~GG~fLl~k~~~el~~~~e~----~~~~~~~~~~-----~~~~~~~~v~~I~v~D~vFSlDsVitavg~~~~ 157 (523) ++.+||.||.|-+.-.+.|+... +++++..+.. .+..+..++. +|...|=.=+-.+-|-..+ T Consensus 90 LLmlGG~yLcFEG~EKvl~~~~~~~~~~~~~~~~~~~~~~~~Ek~~v~gAIR----TDfILSaEIm~iaL~~va~ 160 (303) T COG2354 90 LLMLGGAYLCFEGAEKVLHKFLHRKHGEDHEARPQSAEPPEHEKDKVGGAIR----TDFILSAEIMAIALAEVAE 160 (303) T ss_pred HHHHCCHHHHHCCHHHHHHHHCCCCCCCCHHHHHCCCCCHHHHHHHHCCHHE----ECEEHHHHHHHHHCCCCCC T ss_conf 9984405300310999999864775556955530347874677867333000----0110233677653046654 No 223 >TIGR03279 cyano_FeS_chp putative FeS-containing Cyanobacterial-specific oxidoreductase. Members of this protein family are predicted FeS-containing oxidoreductases of unknown function, apparently restricted to and universal across the Cyanobacteria. The high trusted cutoff score for this model, 700 bits, excludes homologs from other lineages. This exclusion seems justified because a significant number of sequence positions are simultaneously unique to and invariant across the Cyanobacteria, suggesting a specialized, conserved function, perhaps related to photosynthesis. A distantly related protein family, TIGR03278, in universal in and restricted to archaeal methanogens, and may be linked to methanogenesis. Probab=74.55 E-value=4.5 Score=19.53 Aligned_cols=66 Identities=11% Similarity=0.174 Sum_probs=42.8 Q ss_pred CEEEEEECC-CCC---EEEEEEHHHHHHHHCCC-CCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCCCCC Q ss_conf 627775228-674---25652599999987066-445788863128834985998700119999998288999 Q gi|255764470|r 399 QTFVMVLDE-YGV---LEGMITPANILEAIAGD-FPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGVNLV 466 (523) Q Consensus 399 ~~lAiVvDE-~G~---~~GIVTleDIlE~IvGe-i~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~ 466 (523) .++..|.++ ||+ +.|++|=.||++++-|. ..|.-=-+....+.+++.++ -.++++++++.++.+.. T Consensus 349 i~v~~I~N~fFG~~vtVtGLLTG~Dii~~L~~~~lgd~lliP~vmL~~g~~vfL--DD~t~~~l~~~L~~~V~ 419 (433) T TIGR03279 349 LDLHGLASDYWGQEITVTGLLTGQDLIAGLKGKDLGDGLLLPSVMLKHGELVFL--DDLTVEEVAEALGTPIR 419 (433) T ss_pred EEEEEECCCCCCCCEEEEECCCHHHHHHHHCCCCCCCEEEECHHHHCCCCCEEC--CCCCHHHHHHHCCCCEE T ss_conf 999993568889983372253389999986799889869962799468997378--98989999987099789 No 224 >PRK06769 hypothetical protein; Validated Probab=73.23 E-value=5.9 Score=18.68 Aligned_cols=44 Identities=14% Similarity=0.249 Sum_probs=38.1 Q ss_pred CCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCC Q ss_conf 99989999988503862777522867425652599999987066 Q gi|255764470|r 384 NISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGD 427 (523) Q Consensus 384 ~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGe 427 (523) --.+.+++..+++.+..+.+|.+..|--.|..|.+|+.+++-|- T Consensus 30 ~~~~~~ai~~L~~~g~~i~vvTNQsgI~rG~~t~~d~~~~l~~~ 73 (175) T PRK06769 30 FPFTKASLQKLKAKNIKIFSFTNQPGIADGIATVADFVQELKGF 73 (175) T ss_pred CCCHHHHHHHHHHCCCEEEEEECCCHHCCCCCCHHHHHHHHCCC T ss_conf 78889999999986995999968821136877999999997499 No 225 >cd04374 RhoGAP_Graf RhoGAP_Graf: GTPase-activator protein (GAP) domain for Rho-like GTPases found in GRAF (GTPase regulator associated with focal adhesion kinase); Graf is a multi-domain protein, containing SH3 and PH domains, that binds focal adhesion kinase and influences cytoskeletal changes mediated by Rho proteins. Graf exhibits GAP activity toward RhoA and Cdc42, but only weakly activates Rac1. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=73.02 E-value=4.7 Score=19.41 Aligned_cols=13 Identities=23% Similarity=0.197 Sum_probs=4.0 Q ss_pred CHHHHHHHHHCCC Q ss_conf 0789999883124 Q gi|255764470|r 474 TLAGFILWRLGHL 486 (523) Q Consensus 474 TlaGlil~~l~~i 486 (523) ++||.+-..|..+ T Consensus 88 ~VAs~LK~yLReL 100 (203) T cd04374 88 TITSALKTYLRNL 100 (203) T ss_pred HHHHHHHHHHHHC T ss_conf 9999999999838 No 226 >COG2216 KdpB High-affinity K+ transport system, ATPase chain B [Inorganic ion transport and metabolism] Probab=70.34 E-value=7.3 Score=18.01 Aligned_cols=82 Identities=13% Similarity=0.183 Sum_probs=49.6 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHH------------------HHHCC---CEEEC Q ss_conf 9899999998579978888767855278888888876202024660168------------------87327---43523 Q gi|255764470|r 324 VDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFK------------------RSIRK---PLVVH 382 (523) Q Consensus 324 s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~------------------~lir~---~~~Vp 382 (523) .+++..+.+.+.|-+-++||++ +.+.|++|.||+.+.....+=..++ ....+ --|+. T Consensus 414 ~l~~~~~~vs~~GGTPL~V~~~--~~~~GVI~LkDivK~Gi~ERf~elR~MgIkTvM~TGDN~~TAa~IA~EAGVDdfiA 491 (681) T COG2216 414 DLDAAVDEVSRLGGTPLVVVEN--GRILGVIYLKDIVKPGIKERFAELRKMGIKTVMITGDNPLTAAAIAAEAGVDDFIA 491 (681) T ss_pred HHHHHHHHHHHCCCCCEEEEEC--CEEEEEEEEHHHCCHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHCCHHHHH T ss_conf 8999999998558974499889--99999998643135348999999996597599980897777999998718014655 Q ss_pred CCCCHHH--HHHHHHHCCCEEEEEECC Q ss_conf 8999899--999885038627775228 Q gi|255764470|r 383 ENISVLK--LMERLRKSSQTFVMVLDE 407 (523) Q Consensus 383 e~~~l~~--lL~~fr~~~~~lAiVvDE 407 (523) |.+|-++ +.++-|.++.-.|..=|- T Consensus 492 eatPEdK~~~I~~eQ~~grlVAMtGDG 518 (681) T COG2216 492 EATPEDKLALIRQEQAEGRLVAMTGDG 518 (681) T ss_pred CCCHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 289588999999887459589980799 No 227 >cd04396 RhoGAP_fSAC7_BAG7 RhoGAP_fSAC7_BAG7: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of fungal SAC7 and BAG7-like proteins. Both proteins are GTPase activating proteins of Rho1, but differ functionally in vivo: SAC7, but not BAG7, is involved in the control of Rho1-mediated activation of the PKC-MPK1 pathway. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.76 E-value=8.1 Score=17.66 Aligned_cols=17 Identities=24% Similarity=0.337 Sum_probs=7.2 Q ss_pred CCHHHHHHHHHCCCCCC Q ss_conf 40789999883124889 Q gi|255764470|r 473 STLAGFILWRLGHLPQE 489 (523) Q Consensus 473 ~TlaGlil~~l~~iP~~ 489 (523) +++||++-..+..+|++ T Consensus 88 h~vA~lLK~fLReLPeP 104 (225) T cd04396 88 HDAASVLRRYLNNLPEP 104 (225) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999838986 No 228 >cd04385 RhoGAP_ARAP RhoGAP_ARAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in ARAPs. ARAPs (also known as centaurin deltas) contain, besides the RhoGAP domain, an Arf GAP, ankyrin repeat ras-associating, and PH domains. Since their ArfGAP activity is PIP3-dependent, ARAPs are considered integration points for phosphoinositide, Arf and Rho signaling. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=66.31 E-value=8.7 Score=17.44 Aligned_cols=11 Identities=18% Similarity=0.244 Sum_probs=5.1 Q ss_pred HHHHHHHHHHH Q ss_conf 99999998999 Q gi|255764470|r 225 SGIIEFFNQVA 235 (523) Q Consensus 225 s~~ve~~n~~~ 235 (523) ..+++.++.++ T Consensus 128 ~~L~~~L~~V~ 138 (184) T cd04385 128 KVLIGHLYRVQ 138 (184) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 229 >COG1968 BacA Undecaprenyl pyrophosphate phosphatase [Lipid transport and metabolism] Probab=66.27 E-value=8.7 Score=17.44 Aligned_cols=39 Identities=13% Similarity=0.315 Sum_probs=31.7 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 985414699999999999999999899999998485289 Q gi|255764470|r 152 IGMVQHFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVV 190 (523) Q Consensus 152 vg~~~~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~ 190 (523) .+.+.+...+.+|.+.|..+=+++=+.+.+|++||.... T Consensus 213 ~~~~~~~~~l~vg~i~AFvv~~~~I~~ll~~i~~~~~~~ 251 (270) T COG1968 213 ALSAADLPILLVGFIVAFVVSLIAIKFLLRFIKRHSFIP 251 (270) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 000210699999999999999999999999998689702 No 230 >TIGR01497 kdpB K+-transporting ATPase, B subunit; InterPro: IPR006391 These sequences describe the P-type ATPase subunit of the complex responsible for translocating potassium ions across biological membranes in microbes. In Escherichia coli and other species, this complex consists of the proteins KdpA, KdpB, KdpC and KdpF. KdpB is the ATPase subunit, while KdpA is the potassium-ion translocating subunit . The function of KdpC is unclear, although it has been suggested to couple the ATPase subunit to the ion-translocating subunit , while KdpF serves to stabilize the complex . The potassium P-type ATPases have been characterised as Type IA based on a phylogenetic analysis which places this clade closest to the heavy-metal translocating ATPases (Type IB) . Others place this clade closer to the Na+/K+ antiporter type (Type IIC) based on physical characteristics . ; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=61.15 E-value=8.7 Score=17.46 Aligned_cols=43 Identities=12% Similarity=0.217 Sum_probs=34.5 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCC Q ss_conf 999899999998579978888767855278888888876202024 Q gi|255764470|r 322 NCVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEE 366 (523) Q Consensus 322 ~~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~ 366 (523) ...+++..+++.+.|=+=+-||.++ +|.|+++.||..+..... T Consensus 411 p~dlD~av~qVa~~GgTPLvVc~dn--~iyGVIyLKDiVK~Gi~E 453 (675) T TIGR01497 411 PKDLDEAVDQVAKQGGTPLVVCVDN--KIYGVIYLKDIVKSGIKE 453 (675) T ss_pred CHHHHHHHHHHHHCCCCCEEEEECC--EEEEEEEEECCCCCCHHH T ss_conf 8376788999873289847999757--789889873013786045 No 231 >PRK09304 arginine exporter protein; Provisional Probab=60.07 E-value=11 Score=16.66 Aligned_cols=120 Identities=15% Similarity=0.096 Sum_probs=57.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH-H--------HHHHHHHHHHHHHHHHHHHCHHH Q ss_conf 48999999999999865555302576421111222332099999999-9--------99999857789999985414699 Q gi|255764470|r 90 DIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQI-V--------ILDLVFSLDSVVTAIGMVQHFSV 160 (523) Q Consensus 90 ~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I-~--------v~D~vFSlDsVitavg~~~~~~i 160 (523) ..+...|.+||+|-++.-++....++....... ....+.+.++.+- . .+|-++=+-|+-+-.+-.+..+- T Consensus 56 ~~~~~~G~~fL~~yG~~~~rsa~~~~~~~~~~~-~~~~~~~~~i~~~lavT~LNPHvyLDTv~llGs~s~~~~~~~k~~F 134 (197) T PRK09304 56 ALVTWGGVAFLLWYGFGALKTAMSSNIELASAE-VLKQGRWKIIATMLAVTWLNPHVYLDTFVVLGSLGGQLDVEPKRWF 134 (197) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 999999999999999999999836663333223-3322289999999999971815888899997430034780378999 Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHCCHHHHH---HHHHHHHHHHHHHHHHHC Q ss_conf 9999999999---9999989999999848528999---999999999999997745 Q gi|255764470|r 161 MAIAVAVSAL---MMMAVSQPMIRYISRHTTVVIL---CLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 161 ~~~a~~isi~---~m~~~s~~i~~~i~~~p~~~~l---~l~fl~~iG~~L~~eg~~ 210 (523) ..-++.-|.. ..-++++.++++.++-..-+++ +-.+...+++.|.-+|+. T Consensus 135 ~~Ga~~AS~iWF~~Lg~ga~~ls~~~~~p~~wr~id~~i~~iM~~~A~~L~~~g~~ 190 (197) T PRK09304 135 ALGTISASFLWFFGLALLAAWLAPRLRTAKAQRIINLVVGCVMWFIALQLARQGIA 190 (197) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999989886884799999999999999999999999999 No 232 >PRK10054 putative MFS-type transporter YdeE; Provisional Probab=59.46 E-value=11 Score=16.59 Aligned_cols=43 Identities=7% Similarity=0.115 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 7899999999999999744399999998517998999999999 Q gi|255764470|r 9 YYVWIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVF 51 (523) Q Consensus 9 p~~w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~ 51 (523) ++.|.-++...+..+--|.=+...-|++++-.|+++|+|+-.+ T Consensus 94 ~~~~~~~l~~~l~g~~~~~~~pa~~A~iaD~~~~~~R~~~f~l 136 (395) T PRK10054 94 NNVTLVVLFFALINCAYSVFSTVLKAWFADNLSSTSKTKIFSL 136 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 8899999999999999998999999999843887789899999 No 233 >KOG4269 consensus Probab=59.27 E-value=11 Score=16.61 Aligned_cols=52 Identities=21% Similarity=0.401 Sum_probs=39.6 Q ss_pred ECCCEEEEEECC-CHHHHHHHHCCC------CCCCCCCCCCHHHHHHHHHCCCCCC--CCE Q ss_conf 349859987001-199999982889------9998778240789999883124889--988 Q gi|255764470|r 441 GDDGSLTVDGWI-DVRYASKLFGVN------LVDEDDRYSTLAGFILWRLGHLPQE--KEI 492 (523) Q Consensus 441 ~~d~~~~v~G~~-~l~el~~~l~~~------l~~e~~~~~TlaGlil~~l~~iP~~--Ge~ 492 (523) .+.|-|.++|.. .+..+.+.|+-+ ..+++.+.+|+||++--.|..+|+. ||. T Consensus 937 eEeGIyRlSGsaT~Ik~Lke~Fd~~~n~di~~~d~E~dVn~IaGlLKLYlR~LP~~Ll~de 997 (1112) T KOG4269 937 EEEGIYRLSGSATDIKALKEQFDENVNKDILSMDSEMDVNAIAGLLKLYLRELPEPLLTDE 997 (1112) T ss_pred CHHCEEEECCCHHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCHH T ss_conf 0204477325077899999986012460033201200389988899999985884002504 No 234 >COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=58.71 E-value=6.5 Score=18.38 Aligned_cols=166 Identities=18% Similarity=0.225 Sum_probs=85.2 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHHHHHCCCEEECCCCCH-HHHHHHHHHCCCEEEEE Q ss_conf 99999998579978888767855278888888876202024660168873274352389998-99999885038627775 Q gi|255764470|r 326 EDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFKRSIRKPLVVHENISV-LKLMERLRKSSQTFVMV 404 (523) Q Consensus 326 ~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l-~~lL~~fr~~~~~lAiV 404 (523) +-.++--.+..|--|-|.++ .=++.|+ |+-.-.--.++. .-++-.|++.||+... .++..+-+-++.-.|+| T Consensus 97 SLwRQaqLn~~~GLfkVtd~-iYQVRG~----DisNITfveGdt--g~IViDpL~t~~tA~aAldl~~~~~g~rPV~aVI 169 (655) T COG2015 97 SLWRQAQLNAKHGLFKVTDG-IYQVRGF----DISNITFVEGDT--GWIVIDPLVTPETAKAALDLYNQHRGQRPVVAVI 169 (655) T ss_pred HHHHHHHHHHHCCEEEECCC-EEEEECC----CCCCEEEECCCC--CEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 89999887642570562366-3576020----133227980785--2699815678288999999999765889748999 Q ss_pred -----ECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHH- Q ss_conf -----22867425652599999987066445788863128834985998700119999998288999987782407899- Q gi|255764470|r 405 -----LDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGF- 478 (523) Q Consensus 405 -----vDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGl- 478 (523) .|.||++.|||+..|+=.-=|--|.-+.=-+.. +.++ +..|..-.+.-.=.+|.-||..-.. +-=+|. T Consensus 170 YtHsH~DHfGGVkGiv~eadV~sGkV~iiAP~GFme~a---vaEN--vlAGnaM~RRa~YqyG~~Lp~g~~G-~V~~giG 243 (655) T COG2015 170 YTHSHSDHFGGVKGIVSEADVKSGKVQIIAPAGFMEEA---VAEN--VLAGNAMSRRAQYQYGTLLPPGAQG-QVGCGIG 243 (655) T ss_pred EECCCCCCCCCEEECCCHHHCCCCCEEEECCHHHHHHH---HHHH--HHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCCC T ss_conf 61254201377100157888466741376460578999---9875--4410567666664312335888667-5040103 Q ss_pred ---------HHHHHCCCCCCCCEEEECCEEEEEEE Q ss_conf ---------99883124889988999999999999 Q gi|255764470|r 479 ---------ILWRLGHLPQEKEIFTEMNLKFEIIR 504 (523) Q Consensus 479 ---------il~~l~~iP~~Ge~i~~~g~~f~V~~ 504 (523) ++.-..-|-++||+..++|.+|+-.- T Consensus 244 k~la~G~vsLiaPT~~I~~~gE~~~iDGV~~~Fq~ 278 (655) T COG2015 244 KTLATGEVSLIAPTKIIEETGETLTIDGVEFEFQM 278 (655) T ss_pred CCCCCCCEEEECCEEEEECCCCEEEEECEEEEEEE T ss_conf 32246852564556786236725887050789962 No 235 >cd04377 RhoGAP_myosin_IX RhoGAP_myosin_IX: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in class IX myosins. Class IX myosins contain a characteristic head domain, a neck domain, a tail domain which contains a C6H2-zinc binding motif and a RhoGAP domain. Class IX myosins are single-headed, processive myosins that are partly cytoplasmic, and partly associated with membranes and the actin cytoskeleton. Class IX myosins are implicated in the regulation of neuronal morphogenesis and function of sensory systems, like the inner ear. There are two major isoforms, myosin IXA and IXB with several splice variants, which are both expressed in developing neurons. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell prolifer Probab=58.45 E-value=12 Score=16.47 Aligned_cols=10 Identities=10% Similarity=0.202 Sum_probs=3.4 Q ss_pred CCCHHHHHHH Q ss_conf 9998999998 Q gi|255764470|r 384 NISVLKLMER 393 (523) Q Consensus 384 ~~~l~~lL~~ 393 (523) +.+..+++.. T Consensus 163 ~~~~~~~l~~ 172 (186) T cd04377 163 TADPLQSLQD 172 (186) T ss_pred CCCHHHHHHH T ss_conf 9998999996 No 236 >pfam09269 DUF1967 Domain of unknown function (DUF1967). Members of this family contain a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. They are predominantly found in the bacterial GTP-binding protein Obg, and are still functionally uncharacterized. Probab=58.23 E-value=12 Score=16.45 Aligned_cols=56 Identities=18% Similarity=0.221 Sum_probs=34.6 Q ss_pred EEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCC---------C-CCCCCEEEECCEEEE Q ss_conf 288349859987001199999982889999877824078999988312---------4-889988999999999 Q gi|255764470|r 438 ITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGH---------L-PQEKEIFTEMNLKFE 501 (523) Q Consensus 438 i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~---------i-P~~Ge~i~~~g~~f~ 501 (523) +.+.++|.|.|.|.- ++.+-...+++-++ .+. ++...|++ . -+.||++.+++++|+ T Consensus 2 I~~~~~g~f~V~G~~-ier~~~~t~~~~~e------a~~-~f~~~L~~~Gv~~~L~~~G~k~GD~V~Ig~~eFe 67 (68) T pfam09269 2 IEREGDGVFVVEGPK-IERLVKMTNFDNDE------SLR-RFARILKKLGVEDALRKAGAKDGDTVRIGDFEFE 67 (68) T ss_pred EEEECCCEEEEECHH-HHHHHHHCCCCCHH------HHH-HHHHHHHHCCHHHHHHHCCCCCCCEEEEEEEEEE T ss_conf 799079559997738-99999877899899------999-9999999889789999769999799999127996 No 237 >KOG2927 consensus Probab=57.81 E-value=12 Score=16.55 Aligned_cols=14 Identities=29% Similarity=0.259 Sum_probs=6.0 Q ss_pred HHHHCCCCCHHHHH Q ss_conf 67746678989999 Q gi|255764470|r 280 TQHEKHIISDQEKD 293 (523) Q Consensus 280 ~~~~~g~l~~~E~~ 293 (523) +...+|.+++.|.+ T Consensus 313 ~~~~~~~i~~t~~e 326 (372) T KOG2927 313 EQDMSGHIDPTEKE 326 (372) T ss_pred HHHHCCCCCCCCHH T ss_conf 44420687886023 No 238 >pfam11298 DUF3099 Protein of unknown function (DUF3099). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known. Probab=57.69 E-value=12 Score=16.38 Aligned_cols=53 Identities=19% Similarity=0.206 Sum_probs=31.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98999999999999999999999999999999742479998634532224899999999999986555 Q gi|255764470|r 41 PLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLLFKGTIEL 108 (523) Q Consensus 41 p~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl~k~~~el 108 (523) ..+++.|-+.| ...+.+|+.++..... . ...+ |-.++.+.|+++|-|.+.--- T Consensus 10 ~~d~~~R~rrY--~i~M~~R~~c~vla~~-~-------~~~~-----w~~~~~~~~av~LPwiAVviA 62 (73) T pfam11298 10 EEDRRHRMRKY--VVLMAVRIVCLILAAV-V-------YGLG-----WLALVIIAGAVPLPWIAVVIA 62 (73) T ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHH-H-------HCCH-----HHHHHHHHHHHCCCCEEEEEC T ss_conf 88998789999--9999999999999999-8-------0207-----999999999603251467861 No 239 >pfam03904 DUF334 Domain of unknown function (DUF334). Staphylococcus aureus plasmid proteins with no characterized function. Probab=57.68 E-value=12 Score=16.38 Aligned_cols=16 Identities=13% Similarity=0.098 Sum_probs=8.9 Q ss_pred CCCCCCCHHHHHHHHH Q ss_conf 1222332099999999 Q gi|255764470|r 121 HKFFSPVSWQVIVLQI 136 (523) Q Consensus 121 ~~~~~~~~~~~~v~~I 136 (523) +..+++.+||.++|+| T Consensus 188 ~kIKaSEgf~~~lWYi 203 (229) T pfam03904 188 GKIKASEGFISFLWYI 203 (229) T ss_pred HHHHHCHHHHHHHHHH T ss_conf 8873110399999999 No 240 >cd04381 RhoGap_RalBP1 RhoGap_RalBP1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in RalBP1 proteins, also known as RLIP, RLIP76 or cytocentrin. RalBP1 plays an important role in endocytosis during interphase. During mitosis, RalBP1 transiently associates with the centromere and has been shown to play an essential role in the proper assembly of the mitotic apparatus. RalBP1 is an effector of the Ral GTPase which itself is an effector of Ras. RalBP1 contains a RhoGAP domain, which shows weak activity towards Rac1 and Cdc42, but not towards Ral, and a Ral effector domain binding motif. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low int Probab=56.75 E-value=13 Score=16.28 Aligned_cols=13 Identities=31% Similarity=0.327 Sum_probs=4.0 Q ss_pred CHHHHHHHHHCCC Q ss_conf 0789999883124 Q gi|255764470|r 474 TLAGFILWRLGHL 486 (523) Q Consensus 474 TlaGlil~~l~~i 486 (523) ++||++-..+..+ T Consensus 73 ~vA~lLK~fLReL 85 (182) T cd04381 73 TVASLLKQYLREL 85 (182) T ss_pred HHHHHHHHHHHHC T ss_conf 9999999999849 No 241 >cd04402 RhoGAP_ARHGAP20 RhoGAP_ARHGAP20: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP20-like proteins. ArhGAP20, also known as KIAA1391 and RA-RhoGAP, contains a RhoGAP, a RA, and a PH domain, and ANXL repeats. ArhGAP20 is activated by Rap1 and induces inactivation of Rho, which in turn leads to neurite outgrowth. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=54.67 E-value=14 Score=16.04 Aligned_cols=13 Identities=8% Similarity=0.322 Sum_probs=6.3 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999899999 Q gi|255764470|r 225 SGIIEFFNQVARR 237 (523) Q Consensus 225 s~~ve~~n~~~~~ 237 (523) ..++..++.+++. T Consensus 125 ~~L~~~L~~V~~~ 137 (192) T cd04402 125 KHLICVLHNISQN 137 (192) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999965 No 242 >PRK04149 sat sulfate adenylyltransferase; Reviewed Probab=53.32 E-value=6.7 Score=18.29 Aligned_cols=23 Identities=17% Similarity=0.085 Sum_probs=9.7 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 889999877824078999988312 Q gi|255764470|r 462 GVNLVDEDDRYSTLAGFILWRLGH 485 (523) Q Consensus 462 ~~~l~~e~~~~~TlaGlil~~l~~ 485 (523) |...|+.- -.--++--+.+..+. T Consensus 360 G~~~P~~f-~rpEV~~iL~~~y~~ 382 (390) T PRK04149 360 GERPPPEF-SRPEVAEVLIKGLKK 382 (390) T ss_pred CCCCCCCC-CCHHHHHHHHHHHHH T ss_conf 89899544-728999999999997 No 243 >TIGR00849 gutA PTS system, glucitol/sorbitol-specific IIA component; InterPro: IPR004716 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . The Man family is unique in several respects among PTS permease families. It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. This family consists only of glucitol-specific transporters, and occur both in Gram-negative and Gram-positive bacteria. The system in Escherichia coli consists of a IIA protein, and a IIBC protein. This family is specific for the IIA component.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0005737 cytoplasm. Probab=53.01 E-value=14 Score=15.86 Aligned_cols=63 Identities=10% Similarity=0.138 Sum_probs=48.0 Q ss_pred EECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCC-----CCCCEEEECCEEEEEEEEE Q ss_conf 834985998700119999998288999987782407899998831248-----8998899999999999961 Q gi|255764470|r 440 VGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLP-----QEKEIFTEMNLKFEIIRLE 506 (523) Q Consensus 440 ~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP-----~~Ge~i~~~g~~f~V~~~~ 506 (523) +..+|+|+ |..-+..+.+.+-+-+-+... +||+.|...+..+.- ++|..+.+++.+|+|..+- T Consensus 10 ~~~~Gryi--G~~a~~~l~~~m~ItFgegAP--~dlaeYCfi~~~~e~kgG~l~pG~~f~ig~~~ypvTAVG 77 (129) T TIGR00849 10 TVKDGRYI--GDLAITALEEGMIITFGEGAP--DDLAEYCFIHNIKEVKGGTLKPGQVFSIGEIAYPVTAVG 77 (129) T ss_pred EEECCEEH--HHHHHHHHHCCEEEEECCCCH--HHHHHHHHHHCCCCCCCCEECCCCEEEECCEECCEEEEC T ss_conf 13021101--267875532471898558981--247765554303223676006864343577531045310 No 244 >COG3290 CitA Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms] Probab=52.03 E-value=15 Score=15.75 Aligned_cols=44 Identities=25% Similarity=0.138 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 99999999999974439999999851799899999999999999 Q gi|255764470|r 13 IGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFA 56 (523) Q Consensus 13 ~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A 56 (523) +.++...++=++|++=-.++.+..+..+-.+..+||+..+-..| T Consensus 14 i~~li~~~v~~~l~~~~~~~~~~~~~~~~~q~~~~Al~~Ar~vA 57 (537) T COG3290 14 ITLLIVAVVVFVLLLLTGILFVQTSDTLRDQLEHKALSTARTVA 57 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999989999999985 No 245 >COG0809 QueA S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) [Translation, ribosomal structure and biogenesis] Probab=50.10 E-value=16 Score=15.54 Aligned_cols=175 Identities=16% Similarity=0.130 Sum_probs=86.1 Q ss_pred HHHHCCCCEEEEEEEEECCEE--EEEECC-CCHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHHHHHHCCCCCHHH Q ss_conf 975113786674653400028--996089-99899999998579978888767855278888888876202024660168 Q gi|255764470|r 296 QSVLTLADRPAKSIMTPRTEI--VWLDVN-CVDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLLRDLLEEGSMNFK 372 (523) Q Consensus 296 ~~vL~L~e~~V~~IMtPR~~i--~~ld~~-~s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl~~~~~~~~~~~~ 372 (523) -..+.|++- .+..|+.|.+- ..+.-+ .....+.+.+.+.||--+|-|=.+.++ .+|--. -+. T Consensus 105 G~~i~f~~~-l~a~v~e~~~~g~~~l~F~~~~~~~l~e~L~~~G~~PLPPYI~~~~~------~~d~~~----YQT---- 169 (348) T COG0809 105 GDEIYFGDG-LKATVLERLEHGLRLLEFDYEGIFSLLELLEKYGEMPLPPYIKRKLD------ELDRDR----YQT---- 169 (348) T ss_pred CCEEEECCC-CEEEEEEECCCCEEEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCC------CCCHHH----CEE---- T ss_conf 998994798-26999996288529999945785059999997599899813387622------124443----246---- Q ss_pred HHHCCCEE-ECCCCC---HHHHHHHHHHCCCEEEEEECCC--CCEEEEEEHHHHHHH----HCCCCCCCCCCCCCEEEEC Q ss_conf 87327435-238999---8999998850386277752286--742565259999998----7066445788863128834 Q gi|255764470|r 373 RSIRKPLV-VHENIS---VLKLMERLRKSSQTFVMVLDEY--GVLEGMITPANILEA----IAGDFPDEDDQKLDITVGD 442 (523) Q Consensus 373 ~lir~~~~-Vpe~~~---l~~lL~~fr~~~~~lAiVvDE~--G~~~GIVTleDIlE~----IvGei~DE~d~~~~i~~~~ 442 (523) -+.+.+=- -.++.. -.++|+++++.+.+++.|.=.- |.+.-+-+ +|+-|- =..+++.|.-+.-.-.+.. T Consensus 170 VYak~~GsvAAPTAGLHFt~~LL~kLk~kGv~~afvTLHVGaGTF~pV~~-~~i~eH~MH~E~~~v~~eta~~i~~~k~~ 248 (348) T COG0809 170 VYAKEPGSVAAPTAGLHFTEELLEKLKAKGVEIAFVTLHVGAGTFRPVKV-ENIEEHKMHSEYYEVPQETADAINAAKAR 248 (348) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEEEEEECCCCCCCCEE-CCCCCCCCCHHHEECCHHHHHHHHHHHHC T ss_conf 64258872004768777899999999977953999999964655434120-44334443055232599999999999973 Q ss_pred CCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCC--CEEEECCEEEEEEEE Q ss_conf 985998700119999998288999987782407899998831248899--889999999999996 Q gi|255764470|r 443 DGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEK--EIFTEMNLKFEIIRL 505 (523) Q Consensus 443 d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~G--e~i~~~g~~f~V~~~ 505 (523) .|+.++-|+++++.++..-.-+-. .|..| +-+.+.||+|++.++ T Consensus 249 GgRIiaVGTTs~R~LEsa~~~~~~-------------------~~~sg~T~IFI~PGy~~~~vD~ 294 (348) T COG0809 249 GGRIIAVGTTSVRTLESAAREAGL-------------------KAFSGWTDIFIYPGYRFKVVDA 294 (348) T ss_pred CCEEEEECCHHHHHHHHHHCCCCC-------------------CCCCCCCCEEECCCCCCEECCE T ss_conf 984999816157778877513776-------------------7676762178627985267011 No 246 >PRK08053 sulfur carrier protein ThiS; Provisional Probab=49.78 E-value=16 Score=15.51 Aligned_cols=32 Identities=16% Similarity=0.354 Sum_probs=27.7 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 27435238999899999885038627775228 Q gi|255764470|r 376 RKPLVVHENISVLKLMERLRKSSQTFVMVLDE 407 (523) Q Consensus 376 r~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE 407 (523) -+|.-+++.+++.++++++.....++|+.++. T Consensus 7 ~e~~e~~~~~tl~~Ll~~l~~~~~giAvAiN~ 38 (66) T PRK08053 7 DQPMQCAAGQTVHELLEQLNQLQPGAALAINQ 38 (66) T ss_pred CEEEECCCCCCHHHHHHHHCCCCCCEEEEECC T ss_conf 98878689985999998727888858999889 No 247 >pfam11683 DUF3278 Protein of unknown function (DUF3278). This bacterial family of proteins has no known function. Probab=49.62 E-value=16 Score=15.49 Aligned_cols=78 Identities=13% Similarity=0.057 Sum_probs=45.3 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 8517998999999999999999999999-99999999974247999863453222489999999999998655553025 Q gi|255764470|r 36 LVEKLPLAQRGKALVFGLTFAMVTRIAL-LASLSYWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLLFKGTIELHERLE 113 (523) Q Consensus 36 ~~~~Lp~~~r~~a~~~Gl~~A~v~R~~l-l~~~~~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl~k~~~el~~~~e 113 (523) ++..|.+.+|+.+=.+|=-+.+.+=..+ +.....++...+.|--..+...+..--++.+.-+.+..|+.-+..-...| T Consensus 17 I~GpLDEykrqe~~rIGN~afi~lf~~~~~~~~Ia~~l~~~~p~~va~~~~~~~il~~~l~~~~Yi~~~~~r~~l~~~E 95 (129) T pfam11683 17 ISGPLDEYKRQEANRIGNNAFIILFLILIAGSLIALFLASKYPEEVAYIYPSTILLLFTLIASGYIMYQSRKKGLTDLE 95 (129) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9998018999999985416999999999999999999998559999999999999999999999999998772475114 No 248 >COG2119 Predicted membrane protein [Function unknown] Probab=49.20 E-value=16 Score=15.45 Aligned_cols=174 Identities=16% Similarity=0.198 Sum_probs=90.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999974439999999851799899999999999999999999999999999974247999863453222489 Q gi|255764470|r 13 IGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIV 92 (523) Q Consensus 13 ~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~li 92 (523) +++..+..-|+ +|.--+|+++-. -+.|++-.+-|+.+|...--.+-.....|.. ..++.+.+. T Consensus 6 ~s~~~v~laEi---GDKT~lia~llA---~r~~~~~v~~g~~~a~~~m~~la~~vG~~~~-----------~~~~~~~~~ 68 (190) T COG2119 6 VSLLMVALAEI---GDKTQLIAMLLA---MRYRRWPVFAGIAIALFAMHALAVLVGHAAA-----------SLLPERPLA 68 (190) T ss_pred HHHHHHHHHHH---CCHHHHHHHHHH---HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-----------CCCCHHHHH T ss_conf 88899999983---528999999999---7668800379999999999999999988875-----------148445799 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCH-HHHHHHHHHHH Q ss_conf 999999999998655553025764211112223320999999999999998577899999854--146-99999999999 Q gi|255764470|r 93 LILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQIVILDLVFSLDSVVTAIGMV--QHF-SVMAIAVAVSA 169 (523) Q Consensus 93 ll~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~I~v~D~vFSlDsVitavg~~--~~~-~i~~~a~~isi 169 (523) ...|-.|+.+ +.+.+.+..+.+++.+ ..+...-|..+...+-+ .=+.=-+=|+.++++ +|. |.+++|-.+++ T Consensus 69 ~~~~~~Flaf-av~~l~edk~~~~e~~--~~~~~~~f~~tfi~~Fl--aE~GDKTQiATIaLaA~~~~~~~V~~Gt~lg~ 143 (190) T COG2119 69 WASGVLFLAF-AVWMLIEDKEDDEEAQ--AASPRGVFVTTFITFFL--AELGDKTQIATIALAADYHSPWAVFAGTTLGM 143 (190) T ss_pred HHHHHHHHHH-HHHHHCCCCCCCCCCC--CCCCCCHHHHHHHHHHH--HHHCCHHHHHHHHHHHCCCCCEEEEHHHHHHH T ss_conf 9999999999-9998066221311334--45540489999999999--99553889999999603799656616348999 Q ss_pred HHHHHHHHHHHHHHHH-CCHHH---HHHHHHHHHHHHHHHHHHH Q ss_conf 9999998999999984-85289---9999999999999999774 Q gi|255764470|r 170 LMMMAVSQPMIRYISR-HTTVV---ILCLGFLLMIGFLLIIEGL 209 (523) Q Consensus 170 ~~m~~~s~~i~~~i~~-~p~~~---~l~l~fl~~iG~~L~~eg~ 209 (523) .+--.-+-...+++.+ .|.-+ .=|+.|+++ |..+.-+++ T Consensus 144 ~l~s~laVl~G~~ia~ki~~r~l~~~aallFl~f-al~~~~~~~ 186 (190) T COG2119 144 ILASVLAVLLGKLIAGKLPERLLRFIAALLFLIF-ALVLLWQVF 186 (190) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 9998899997799871178899999999999999-999999877 No 249 >TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence. Probab=48.87 E-value=16 Score=15.41 Aligned_cols=61 Identities=16% Similarity=0.164 Sum_probs=34.6 Q ss_pred CEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHH-----HHHHHCCC-CCCCCEEEECCEEEE Q ss_conf 128834985998700119999998288999987782407899-----99883124-889988999999999 Q gi|255764470|r 437 DITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGF-----ILWRLGHL-PQEKEIFTEMNLKFE 501 (523) Q Consensus 437 ~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGl-----il~~l~~i-P~~Ge~i~~~g~~f~ 501 (523) .+.+.++|.|.|.|.- ++.+-...+++-++ + ..-++-. +-+.|.+. -+.||++.+++++|+ T Consensus 2 ~I~r~~dg~~~V~G~~-ier~~~~t~~~~~e--a-~~rf~~~Lk~~Gv~~~L~~~G~k~GD~V~I~~~eFe 68 (69) T TIGR03595 2 EIIRDEDGVFVVSGKK-IERWVAKTPFNNDE--N-LRRFARKLKKLGVEDALRKAGAKDGDTVRIGDFEFE 68 (69) T ss_pred EEEECCCCEEEEECCH-HHHHHHHCCCCCHH--H-HHHHHHHHHHCCHHHHHHHCCCCCCCEEEECCEEEE T ss_conf 8999789769997728-99999866999999--9-999999999889789999869999999999348996 No 250 >TIGR01476 chlor_syn_BchG bacteriochlorophyll/chlorophyll synthetase; InterPro: IPR006372 These are a subfamily of a large family of polyprenyltransferases that also includes 4-hydroxybenzoate octaprenyltransferase and protoheme IX farnesyltransferase (heme O synthase). Members of this family are found exclusively in photosynthetic organisms, including a single copy in Arabidopsis thaliana.; GO: 0030494 bacteriochlorophyll biosynthetic process. Probab=47.25 E-value=17 Score=15.24 Aligned_cols=19 Identities=21% Similarity=0.195 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHH-HHHHHH Q ss_conf 2788998999999-999899 Q gi|255764470|r 216 GYLYASIGFSGII-EFFNQV 234 (523) Q Consensus 216 ~y~y~~~~fs~~v-e~~n~~ 234 (523) -|+|+++.|-++| ..+||. T Consensus 242 ~~~~A~~v~~lLvAq~~~~~ 261 (295) T TIGR01476 242 QPLVATIVLLLLVAQIIAQA 261 (295) T ss_pred CHHHHHHHHHHHHHHHHHHH T ss_conf 37899999999843366424 No 251 >cd04393 RhoGAP_FAM13A1a RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude. Probab=47.16 E-value=17 Score=15.23 Aligned_cols=10 Identities=30% Similarity=0.444 Sum_probs=4.1 Q ss_pred HHHHHHHHHH Q ss_conf 9999998999 Q gi|255764470|r 226 GIIEFFNQVA 235 (523) Q Consensus 226 ~~ve~~n~~~ 235 (523) .+++.++.++ T Consensus 133 ~L~~~L~~v~ 142 (189) T cd04393 133 FLCHFLSNVA 142 (189) T ss_pred HHHHHHHHHH T ss_conf 9999999998 No 252 >COG5128 Transport protein particle (TRAPP) complex subunit [Intracellular trafficking and secretion] Probab=46.83 E-value=12 Score=16.52 Aligned_cols=86 Identities=20% Similarity=0.318 Sum_probs=51.6 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH----HCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899999998485289999999999999999977----45733-7027889989999999998999998755430103777 Q gi|255764470|r 176 SQPMIRYISRHTTVVILCLGFLLMIGFLLIIEG----LHFDI-PKGYLYASIGFSGIIEFFNQVARRNREQLMSPSRLRA 250 (523) Q Consensus 176 s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg----~~~~i-pk~y~y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~e 250 (523) +.-+ +|+++.-.+-..|..|--||--..---. |.-.. ..|| -+| --++|+|| +.+||-++-.....+-+ T Consensus 30 ~qnl-~~~~~ev~lStmaflf~emI~~l~~q~s~~~dfE~kL~~~Gy---~vG-ikLlEL~n-fr~rnpkre~rIl~iLq 103 (208) T COG5128 30 EQNL-KIMKREVPLSTMAFLFCEMIEYLMEQRSGIQDFEAKLKSIGY---EVG-IKLLELCN-FRRRNPKREVRILTILQ 103 (208) T ss_pred HHHH-HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHH-HHHHHHHH-HHHHCCCCHHHHHHHHH T ss_conf 7767-877514865889999999999999984126769999998618---988-99999988-86308642124999999 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 66765674415688876 Q gi|255764470|r 251 RTADAVLRLLGGKPIQP 267 (523) Q Consensus 251 RtA~aVLRLlG~~~~~~ 267 (523) |..-.+..++-++..+. T Consensus 104 ~ih~~lwsylf~~~~d~ 120 (208) T COG5128 104 RIHFDLWSYLFGDSDDR 120 (208) T ss_pred HHHHHHHHHHHCCCHHH T ss_conf 99989999983461677 No 253 >PRK08942 D,D-heptose 1,7-bisphosphate phosphatase; Validated Probab=46.68 E-value=18 Score=15.18 Aligned_cols=38 Identities=18% Similarity=0.369 Sum_probs=33.8 Q ss_pred CCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHH Q ss_conf 99989999988503862777522867425652599999 Q gi|255764470|r 384 NISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANIL 421 (523) Q Consensus 384 ~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIl 421 (523) --.+.+++..+++.+-.+.+|.+.-|--.|..|.+|+- T Consensus 31 ~~g~~~al~~l~~~g~~~~ivTNQsGI~rG~~t~~~~~ 68 (181) T PRK08942 31 IPGSIEAIARLKQAGYRVVVATNQSGIARGLFTEAQLN 68 (181) T ss_pred CCCHHHHHHHHHHCCCEEEEEECCHHHCCCCCCHHHHH T ss_conf 77889999999987996999958713425867799999 No 254 >TIGR02840 spore_YtaF putative sporulation protein YtaF; InterPro: IPR014205 This entry represents YtaF, which represents a protein family identified, at the time of the publication of the Carboxydothermus hydrogenoformans genome. The phylogenetic profile of the members matches the subset of the Firmicutes capable of forming endospores. These species include Bacillus anthracis, Clostridium tetani, Thermoanaerobacter tengcongensis, and Geobacillus kaustophilus.. Probab=46.55 E-value=18 Score=15.17 Aligned_cols=174 Identities=20% Similarity=0.273 Sum_probs=86.9 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999744399-99999851799899999999999999999999999999999974247999863453222489999 Q gi|255764470|r 17 TLIALELVLGIDNL-IFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGLSFSGRDIVLIL 95 (523) Q Consensus 17 tl~~lEivLs~DN~-i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~~~s~~~lill~ 95 (523) .++++=..+|.||. |=++==.||+-=.--- +..+=..-|+++=+.|.+.. |+...=-|-++ -.-...||+. T Consensus 2 ~~llLA~A~SlD~f~vG~~YGlrkIkip~~S-~liIa~~S~l~~~~sM~~G~--~L~~~lP~~~~-----~~lG~~iL~~ 73 (215) T TIGR02840 2 SLLLLAFAVSLDSFGVGIAYGLRKIKIPFLS-LLIIAVISGLFIFISMLLGK--LLAKFLPPKVT-----EILGAIILIA 73 (215) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH-HHHHHHHHHHHHHHHHHHHH--HHHHHCCHHHH-----HHHHHHHHHH T ss_conf 7999999999999999999752465316699-99999999999999999999--99974676788-----8999999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCC-C----CCCCHHHHHHHHHHH--------------------HHHHHHHHHHHH Q ss_conf 999999998655553025764211112-2----233209999999999--------------------999985778999 Q gi|255764470|r 96 GGFFLLFKGTIELHERLEGDGFDKKHK-F----FSPVSWQVIVLQIVI--------------------LDLVFSLDSVVT 150 (523) Q Consensus 96 GG~fLl~k~~~el~~~~e~~~~~~~~~-~----~~~~~~~~~v~~I~v--------------------~D~vFSlDsVit 150 (523) -|+|.+|.+.++-.++. +.+.++. . -..+.|. .|.||.- +=++-|+||.-+ T Consensus 74 ~G~~~~~~~~r~k~~~~---~~~~~~~~~~vWr~~~~~lg-~~~~IL~~P~~AD~D~SG~I~~kEAllLG~ALsLDafGa 149 (215) T TIGR02840 74 IGIWIIYNAFRPKKERK---KRSSEKDLSKVWRFELKSLG-IVIQILRNPEVADLDKSGNITGKEALLLGIALSLDAFGA 149 (215) T ss_pred HHHHHHHHHHCCCCCCC---CCCCCCCCCCEEEEEEEHHH-HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999999860578887---63001357610244650013-556653377626777886200899999999999999999 Q ss_pred HHHHH-HC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH----HHHH----HHHHHHHHHH Q ss_conf 99854-14--6999999999999999998999999984852----8999----9999999999 Q gi|255764470|r 151 AIGMV-QH--FSVMAIAVAVSALMMMAVSQPMIRYISRHTT----VVIL----CLGFLLMIGF 202 (523) Q Consensus 151 avg~~-~~--~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~----~~~l----~l~fl~~iG~ 202 (523) =.|.+ -+ ...+.+...+.=.+-+++=-.+.++..++.. ++=. .=..|++||+ T Consensus 150 GiGAsl~G~~p~~~~~~~~~~S~~fv~~G~~lG~~~~~~~~G~FT~~~~s~~l~G~~Li~lG~ 212 (215) T TIGR02840 150 GIGASLLGLNPLATSILVAVMSFIFVSLGLKLGKKVSKVSIGKFTIKKFSTFLPGILLILLGV 212 (215) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 899988611789999999999999999999886788740453135877886648999999999 No 255 >cd00159 RhoGAP RhoGAP: GTPase-activator protein (GAP) for Rho-like GTPases; GAPs towards Rho/Rac/Cdc42-like small GTPases. Small GTPases (G proteins) cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when bound to GDP. The Rho family of small G proteins, which includes Cdc42Hs, activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. G proteins generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. The RhoGAPs are one of the major classes of regulators of Rho G proteins. Probab=46.49 E-value=18 Score=15.16 Aligned_cols=22 Identities=5% Similarity=0.229 Sum_probs=9.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3370278899899999999989999 Q gi|255764470|r 212 DIPKGYLYASIGFSGIIEFFNQVAR 236 (523) Q Consensus 212 ~ipk~y~y~~~~fs~~ve~~n~~~~ 236 (523) ..|+.+. -.+..+++.++.+++ T Consensus 101 ~Lp~~~~---~~L~~l~~~l~~v~~ 122 (169) T cd00159 101 SLPPENR---DLLKYLLKLLHKISQ 122 (169) T ss_pred HCCHHHH---HHHHHHHHHHHHHHH T ss_conf 7998999---999999999999985 No 256 >pfam11700 ATG22 Vacuole effluxer Atg22 like. Autophagy is a major survival survival mechanism in which eukaryotes recycle cellular nutrients during stress conditions. Atg22, Avt3 and Avt4 are partially redundant vacuolar effluxes, which mediate the efflux of leucine and other amino acids resulting from autophagy. This family also includes other transporter proteins. Probab=46.39 E-value=18 Score=15.15 Aligned_cols=100 Identities=17% Similarity=0.178 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHH-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHH---HH- Q ss_conf 99999999999857789999--98541-4699999999999999999899999998485-28999999999999---99- Q gi|255764470|r 132 IVLQIVILDLVFSLDSVVTA--IGMVQ-HFSVMAIAVAVSALMMMAVSQPMIRYISRHT-TVVILCLGFLLMIG---FL- 203 (523) Q Consensus 132 ~v~~I~v~D~vFSlDsVita--vg~~~-~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p-~~~~l~l~fl~~iG---~~- 203 (523) -....--+..++++=++.++ .|++. ++.+..+..-+.-.+--+....+.+-....| ...++++.+..++. .. T Consensus 290 ~~~y~Dgl~ti~~~~~iy~~~vlg~s~~~l~~~~l~~~i~a~~Ga~~~g~l~~r~g~k~~~~~~~~l~~~~~i~l~g~~g 369 (477) T pfam11700 290 WFLLSDGIATVNSLAILFAKTVLGMSTAALIVLSITAQLAAVLGAFLWPFLSRRFGLKTKQTLIAIIVLWELIPLYGLLG 369 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998279899999999999870988889999999999999999999999998949813899999999999999999999 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 9997745733702788998999999999 Q gi|255764470|r 204 LIIEGLHFDIPKGYLYASIGFSGIIEFF 231 (523) Q Consensus 204 L~~eg~~~~ipk~y~y~~~~fs~~ve~~ 231 (523) +.....|+|-|-.+..++..+.+..-.. T Consensus 370 ~~~~~~~~~~~~~f~~~~~~~G~~~G~~ 397 (477) T pfam11700 370 IIPNVIGFKNPWEFYVLAALYGLVLGGL 397 (477) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 8862114345399999999999986146 No 257 >TIGR02475 CobW cobalamin biosynthesis protein CobW; InterPro: IPR012824 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobW proteins, which are generally found proximal to the trimeric cobaltochelatase subunit CobN, an essential protein for vitamin B12 (cobalamin) biosynthesis . They contain a P-loop nucleotide-binding loop in the N-terminal domain and a histidine-rich region in the C-terminal portion suggesting a role in metal binding, possibly as an intermediary between the cobalt transport and chelation systems. CobW might be involved in cobalt reduction leading to cobalt(I) corrinoids.; GO: 0009236 cobalamin biosynthetic process. Probab=46.35 E-value=9.9 Score=17.02 Aligned_cols=49 Identities=20% Similarity=0.447 Sum_probs=29.6 Q ss_pred HHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEE Q ss_conf 5038627775228674256525999999870664457888631288349859 Q gi|255764470|r 395 RKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSL 446 (523) Q Consensus 395 r~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~ 446 (523) -.++.++|++|||||.. |+ ==+||...-=+-=+|.+....+.++.||.. T Consensus 27 NA~GkRiAvIVNEFGd~-Gi--DG~iLk~Cgi~~C~Eeeaq~~i~ELaNGCi 75 (349) T TIGR02475 27 NAEGKRIAVIVNEFGDL-GI--DGEILKACGIEACSEEEAQVNIVELANGCI 75 (349) T ss_pred CCCCCEEEEEEECCCCC-CC--CHHHHHHCCCCCCCCHHHCCCEEECCCCCE T ss_conf 74787699998525560-63--679996435678880120001488088973 No 258 >cd04376 RhoGAP_ARHGAP6 RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=46.06 E-value=18 Score=15.12 Aligned_cols=13 Identities=23% Similarity=0.580 Sum_probs=7.2 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999899999 Q gi|255764470|r 225 SGIIEFFNQVARR 237 (523) Q Consensus 225 s~~ve~~n~~~~~ 237 (523) ..+++.++.+++. T Consensus 119 ~~L~~~L~~V~~~ 131 (206) T cd04376 119 HRLLKFLHTVAEH 131 (206) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999984 No 259 >TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760 The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions. Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate. This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=43.45 E-value=12 Score=16.47 Aligned_cols=141 Identities=18% Similarity=0.126 Sum_probs=73.9 Q ss_pred HHHHHHHHCCHHHHHHHHHH------------------HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999984852899999999------------------999999999774573370278899899999999989999987 Q gi|255764470|r 178 PMIRYISRHTTVVILCLGFL------------------LMIGFLLIIEGLHFDIPKGYLYASIGFSGIIEFFNQVARRNR 239 (523) Q Consensus 178 ~i~~~i~~~p~~~~l~l~fl------------------~~iG~~L~~eg~~~~ipk~y~y~~~~fs~~ve~~n~~~~~~~ 239 (523) +.-+|+++||||.-||=+=. |-=+..-|.|-||-.+|+.+ +.-+. T Consensus 60 Yf~rFle~FPTv~~LA~A~~deVL~lW~GLGYYaRARNL~kAA~~v~~~fGG~fP~d~-----------------~~~~~ 122 (297) T TIGR01084 60 YFERFLERFPTVQALANAPQDEVLKLWEGLGYYARARNLHKAAQEVVEEFGGEFPQDL-----------------EDLKA 122 (297) T ss_pred CCHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCH-----------------HHHHH T ss_conf 0047664278857874779657999862578678889999999999987188177237-----------------97851 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCC-----------------CCHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHCC Q ss_conf 55430103777667656744156888766-----------------7988899999967746678-98999999975113 Q gi|255764470|r 240 EQLMSPSRLRARTADAVLRLLGGKPIQPQ-----------------GLNVKADVLLPTQHEKHII-SDQEKDMVQSVLTL 301 (523) Q Consensus 240 ~~~~~~~~~~eRtA~aVLRLlG~~~~~~~-----------------~~~ee~~~li~~~~~~g~l-~~~E~~mi~~vL~L 301 (523) .+- .=-.||.||+.+-=+++..-. ....+.+.-+....++-.= ..+=...=++++|| T Consensus 123 L~G-----VG~yTAgAils~a~~~~~p~~DGNV~RVLsR~fA~~~~~~~k~~e~~l~~~~~~llpe~~~~~~~nqalmDl 197 (297) T TIGR01084 123 LPG-----VGRYTAGAILSFAYNKPVPILDGNVKRVLSRLFAVEGWPGKKKVENRLWELAESLLPEKADPEAFNQALMDL 197 (297) T ss_pred CCC-----CCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHC T ss_conf 789-----762179999998726876201540788999998624898873488999999998588656865888899862 Q ss_pred CCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 7866746534000289960899989999999857997888876785 Q gi|255764470|r 302 ADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSL 347 (523) Q Consensus 302 ~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~ 347 (523) +-+ |-+|..=- =.-.|+.+...-..+..-..||+-+... T Consensus 198 GA~----iC~rk~P~---C~~CPl~~~C~A~~~~~~~~yP~kk~~~ 236 (297) T TIGR01084 198 GAL----ICTRKKPK---CDLCPLQDFCLAYKQGTPEEYPVKKKKK 236 (297) T ss_pred CCC----CCCCCCCC---CCCCCHHHHHHHHHCCCHHHCCCCCCCC T ss_conf 361----03784785---4548706655565427711167877310 No 260 >COG4708 Predicted membrane protein [Function unknown] Probab=43.41 E-value=18 Score=15.21 Aligned_cols=17 Identities=35% Similarity=0.260 Sum_probs=10.9 Q ss_pred CCCCCHHHHHHHHHHHH Q ss_conf 73370278899899999 Q gi|255764470|r 211 FDIPKGYLYASIGFSGI 227 (523) Q Consensus 211 ~~ipk~y~y~~~~fs~~ 227 (523) --+||.|+|+++-||+- T Consensus 104 gi~~k~~i~~~i~fsis 120 (169) T COG4708 104 GIINKAFIFFSILFSIS 120 (169) T ss_pred CCCHHHHHHCCHHHHHH T ss_conf 52304453240899999 No 261 >cd04400 RhoGAP_fBEM3 RhoGAP_fBEM3: RhoGAP (GTPase-activator [GAP] protein for Rho-like small GTPases) domain of fungal BEM3-like proteins. Bem3 is a GAP protein of Cdc42, and is specifically involved in the control of the initial assembly of the septin ring in yeast bud formation. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=42.56 E-value=20 Score=14.75 Aligned_cols=17 Identities=35% Similarity=0.534 Sum_probs=6.9 Q ss_pred CCCHHHHHHHHHCCCCC Q ss_conf 24078999988312488 Q gi|255764470|r 472 YSTLAGFILWRLGHLPQ 488 (523) Q Consensus 472 ~~TlaGlil~~l~~iP~ 488 (523) .+++||++-..+..+|+ T Consensus 78 vh~va~lLK~flReLPe 94 (190) T cd04400 78 VHTVAGLLKLYLRELPT 94 (190) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 79999999999861998 No 262 >pfam01927 DUF82 Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges. Probab=41.85 E-value=21 Score=14.67 Aligned_cols=78 Identities=26% Similarity=0.313 Sum_probs=44.4 Q ss_pred HHHHHHCCCCCCCCCCHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHH Q ss_conf 56744156888766798889999996774-66789899999997511378667465340002899608999899999998 Q gi|255764470|r 255 AVLRLLGGKPIQPQGLNVKADVLLPTQHE-KHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKIL 333 (523) Q Consensus 255 aVLRLlG~~~~~~~~~~ee~~~li~~~~~-~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~ 333 (523) .-||++|-........ +.+.++..+.+ +-++=...+.+.++... .....+..+++.+++.+.+. T Consensus 14 r~LR~lG~Dt~~~~~~--~D~~l~~~A~~E~RilLTrD~~L~~r~~~-------------~~~~ll~~~~~~~Ql~ev~~ 78 (146) T pfam01927 14 RWLRLLGYDTLYDNDF--EDDELLAIAAKEGRILLTRDRGLLKRRLL-------------TGGVLVRSLDPEEQLREVIE 78 (146) T ss_pred HHHHHCCCCEEECCCC--CHHHHHHHHHHCCCEEEECCHHHHHHCCC-------------CCEEEECCCCHHHHHHHHHH T ss_conf 9999869887015999--85999999866498999678899973323-------------78499789999999999999 Q ss_pred HCC--------CCEEEEECCCC Q ss_conf 579--------97888876785 Q gi|255764470|r 334 ELG--------HSRFPVAQGSL 347 (523) Q Consensus 334 ~~~--------~SR~PV~~~~~ 347 (523) .-+ +||=|.|++.+ T Consensus 79 ~~~l~~~~~~~~sRC~~CN~~L 100 (146) T pfam01927 79 RFGLKLSLKPEFSRCLKCNGPL 100 (146) T ss_pred HCCCCCCCCCCCCCCCCCCCCC T ss_conf 6699743377778565679899 No 263 >COG1269 NtpI Archaeal/vacuolar-type H+-ATPase subunit I [Energy production and conversion] Probab=41.75 E-value=21 Score=14.66 Aligned_cols=35 Identities=17% Similarity=0.010 Sum_probs=24.8 Q ss_pred CCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 00289960899989999999857997888876785 Q gi|255764470|r 313 RTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSL 347 (523) Q Consensus 313 R~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~ 347 (523) +.-++....+...+++.+...+.+..++.|.+.+. T Consensus 191 ~~~~v~~~~~~~~~~v~~~~~~~~~~~~~v~~~~~ 225 (660) T COG1269 191 SVVIVVAHGAEDLDKVSKILNELGFELYEVPEFDG 225 (660) T ss_pred CEEEEEEECCCCCHHHHHHHHHCCCEECCCCCCCC T ss_conf 36999985122206799998755774345655568 No 264 >PRK05659 sulfur carrier protein ThiS; Validated Probab=40.73 E-value=22 Score=14.55 Aligned_cols=31 Identities=16% Similarity=0.231 Sum_probs=26.2 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 7435238999899999885038627775228 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDE 407 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE 407 (523) ++..+|+.+++.++|+++.-...++|+-++. T Consensus 8 e~~~~~~~~tl~~Ll~~l~~~~~~vAVevN~ 38 (66) T PRK05659 8 EPRELPDGESVAALLAREGLAGRRVAVEVNG 38 (66) T ss_pred EEEECCCCCCHHHHHHHCCCCCCCEEEEECC T ss_conf 6878699996999998769999809999899 No 265 >cd04395 RhoGAP_ARHGAP21 RhoGAP_ARHGAP21: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP21-like proteins. ArhGAP21 is a multi-domain protein, containing RhoGAP, PH and PDZ domains, and is believed to play a role in the organization of the cell-cell junction complex. It has been shown to function as a GAP of Cdc42 and RhoA, and to interact with alpha-catenin and Arf6. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=40.52 E-value=22 Score=14.53 Aligned_cols=11 Identities=27% Similarity=0.437 Sum_probs=4.2 Q ss_pred HHHHHHHHHHH Q ss_conf 99999998999 Q gi|255764470|r 225 SGIIEFFNQVA 235 (523) Q Consensus 225 s~~ve~~n~~~ 235 (523) .-++..++.++ T Consensus 132 ~~L~~~L~~V~ 142 (196) T cd04395 132 KHLIRHLKTVA 142 (196) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 266 >cd04387 RhoGAP_Bcr RhoGAP_Bcr: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Bcr (breakpoint cluster region protein)-like proteins. Bcr is a multidomain protein with a variety of enzymatic functions. It contains a RhoGAP and a Rho GEF domain, a Ser/Thr kinase domain, an N-terminal oligomerization domain, and a C-terminal PDZ binding domain, in addition to PH and C2 domains. Bcr is a negative regulator of: i) RacGTPase, via the Rho GAP domain, ii) the Ras-Raf-MEK-ERK pathway, via phosphorylation of the Ras binding protein AF-6, and iii) the Wnt signaling pathway through binding beta-catenin. Bcr can form a complex with beta-catenin and Tcf1. The Wnt signaling pathway is involved in cell proliferation, differentiation, and cell renewal. Bcr was discovered as a fusion partner of Abl. The Bcr-Abl fusion is characteristic for a large majority of chronic myelogenous leukemias (CML). Small GTPases cluster into distinct families, and all act as molecular switch Probab=39.99 E-value=22 Score=14.47 Aligned_cols=11 Identities=18% Similarity=0.265 Sum_probs=4.4 Q ss_pred HHHHHHHHHHH Q ss_conf 99999998999 Q gi|255764470|r 225 SGIIEFFNQVA 235 (523) Q Consensus 225 s~~ve~~n~~~ 235 (523) ..++..++.++ T Consensus 129 ~~L~~~L~~V~ 139 (196) T cd04387 129 LFLLHHLKRVA 139 (196) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 267 >TIGR02829 spore_III_AE stage III sporulation protein AE; InterPro: IPR014194 This entry represents the stage III sporulation protein AE, which is encoded in a spore formation operon spoIIIAABCDEFGH under the control of sigma G . A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species. . Probab=39.82 E-value=22 Score=14.45 Aligned_cols=20 Identities=20% Similarity=0.401 Sum_probs=13.9 Q ss_pred HCCHHHHHHHHHHHHHHHHH Q ss_conf 48528999999999999999 Q gi|255764470|r 185 RHTTVVILCLGFLLMIGFLL 204 (523) Q Consensus 185 ~~p~~~~l~l~fl~~iG~~L 204 (523) .||.+|+++|+|+-=+-..+ T Consensus 328 aFP~IKil~lA~iyK~~AA~ 347 (392) T TIGR02829 328 AFPAIKILSLAFIYKLAAAV 347 (392) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 97799999999999999875 No 268 >PRK13762 tRNA-modifying enzyme; Provisional Probab=38.10 E-value=13 Score=16.28 Aligned_cols=121 Identities=12% Similarity=0.168 Sum_probs=70.0 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHH---CCCCEEEEEEEEECCEEEEEECCCCH----HHHHHHHHHCCCCEEEEEC Q ss_conf 8899999967746678989999999751---13786674653400028996089998----9999999857997888876 Q gi|255764470|r 272 VKADVLLPTQHEKHIISDQEKDMVQSVL---TLADRPAKSIMTPRTEIVWLDVNCVD----EDLQWKILELGHSRFPVAQ 344 (523) Q Consensus 272 ee~~~li~~~~~~g~l~~~E~~mi~~vL---~L~e~~V~~IMtPR~~i~~ld~~~s~----~e~~~~i~~~~~SR~PV~~ 344 (523) ++.+.+++.+- +..+.+|.+-- ........+.|.||.-..++..+-+. .|+++.+.+.+.|-|.|.+ T Consensus 92 DdPe~Ive~~i------~~h~~li~g~kG~p~v~~er~~EA~~p~H~AiSL~GEPtlYP~l~eLi~~~h~r~~stFLVTN 165 (321) T PRK13762 92 DDPEEIVEESI------KEQRKLLSGYKGNPKVDKEKFEEALEPKHVAISLSGEPTLYPRLPELIEEFHKRGFTTFLVTN 165 (321) T ss_pred CCHHHHHHHHH------HHHHHHHHCCCCCCCCCHHHHHHHCCCHHEEHHHCCCCCCCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 89899999999------999999730589999898999872791001022048863202189999999857983799828 Q ss_pred CCC-CCEEEEEEH-HHHHHHHHCCCCCHHHHHHCCCEEECCCCCHHHHHHHHHHCCC Q ss_conf 785-527888888-8876202024660168873274352389998999998850386 Q gi|255764470|r 345 GSL-DSFIGIVSA-RDLLRDLLEEGSMNFKRSIRKPLVVHENISVLKLMERLRKSSQ 399 (523) Q Consensus 345 ~~~-D~iiGiv~~-kDLl~~~~~~~~~~~~~lir~~~~Vpe~~~l~~lL~~fr~~~~ 399 (523) |.. |.+-..-.. ..|.-......+...+.+.||.. -.-=-.+.+-|+.++..++ T Consensus 166 g~~P~~l~~l~~~PTQLYvSldAp~~e~~k~i~rPl~-~d~Wer~~~sL~~L~~~~~ 221 (321) T PRK13762 166 GTRPDVLEKLEAEPTQLYVSLDAPDKETYNRINRPVI-PDAWERILETLELLPSKKT 221 (321) T ss_pred CCCHHHHHHCCCCCCEEEEEEECCCHHHHHHHHCCCC-HHHHHHHHHHHHHHHCCCC T ss_conf 9898999856676542799800699999998616553-3189999999997543798 No 269 >pfam04224 DUF417 Protein of unknown function, DUF417. This family of uncharacterized proteins appears to be restricted to proteobacteria. Probab=37.43 E-value=24 Score=14.20 Aligned_cols=24 Identities=13% Similarity=-0.204 Sum_probs=9.4 Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 457337027889989999999998 Q gi|255764470|r 209 LHFDIPKGYLYASIGFSGIIEFFN 232 (523) Q Consensus 209 ~~~~ipk~y~y~~~~fs~~ve~~n 232 (523) .|+-+=|--+.-|.+..++.+..| T Consensus 127 ~g~FllKDivlLa~~l~l~~~~~~ 150 (153) T pfam04224 127 VGRFVLKDILLLAGALVVAKYDAN 150 (153) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 880349789999989999997888 No 270 >smart00324 RhoGAP GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers. Probab=37.36 E-value=24 Score=14.19 Aligned_cols=21 Identities=19% Similarity=0.322 Sum_probs=9.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 370278899899999999989999 Q gi|255764470|r 213 IPKGYLYASIGFSGIIEFFNQVAR 236 (523) Q Consensus 213 ipk~y~y~~~~fs~~ve~~n~~~~ 236 (523) .|+.+.. .+..++..|+.+++ T Consensus 106 LP~~n~~---~L~~l~~~L~~v~~ 126 (174) T smart00324 106 LPPANRA---TLRYLLAHLNRVAE 126 (174) T ss_pred CCHHHHH---HHHHHHHHHHHHHH T ss_conf 9889999---99999999999997 No 271 >pfam10083 DUF2321 Uncharacterized protein conserved in bacteria (DUF2321). Members of this family of hypothetical bacterial proteins have no known function. Probab=37.17 E-value=21 Score=14.59 Aligned_cols=44 Identities=30% Similarity=0.348 Sum_probs=31.8 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEE Q ss_conf 8889999996774667898999999975113786674653400028996 Q gi|255764470|r 271 NVKADVLLPTQHEKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWL 319 (523) Q Consensus 271 ~ee~~~li~~~~~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~l 319 (523) +..+++..+...+...++++|++-++. ++.|.++. +||+++..+ T Consensus 82 ~~~leaa~elv~e~~~Ls~dE~e~f~~--sl~dl~~d---~PkT~va~~ 125 (158) T pfam10083 82 KRKLEAANELIEEDDNLSPDEKEQFQA--DLPDLTVE---TPRTKVAAL 125 (158) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHH--HHHHHHHC---CCCHHHHHH T ss_conf 999999999998640079889999985--34888643---851256999 No 272 >PRK12297 obgE GTPase ObgE; Reviewed Probab=36.68 E-value=25 Score=14.12 Aligned_cols=59 Identities=24% Similarity=0.330 Sum_probs=35.6 Q ss_pred CEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHC---------CC-CCCCCEEEECCEEEEEE Q ss_conf 128834985998700119999998288999987782407899998831---------24-88998899999999999 Q gi|255764470|r 437 DITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLG---------HL-PQEKEIFTEMNLKFEII 503 (523) Q Consensus 437 ~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~---------~i-P~~Ge~i~~~g~~f~V~ 503 (523) .+.+.++|.|.|.|.- ++.+...++++-+ +++. ++...|+ +. -+.||++.+.+++|+-. T Consensus 360 ~I~~~~dg~f~V~G~~-ier~~~~td~~~~------ea~~-~f~~~Lk~~Gv~~~L~~~Gak~GD~V~Ig~~eFe~~ 428 (429) T PRK12297 360 EITRDDDGTFVVSGEK-IERLFKMTNFNRD------ESLR-RFARQLKKMGVDEALRERGAKDGDIVRIGDFEFEFV 428 (429) T ss_pred EEEECCCCEEEEECHH-HHHHHHHCCCCCH------HHHH-HHHHHHHHCCHHHHHHHCCCCCCCEEEECCEEEEEE T ss_conf 9998789779997718-9999977799989------9999-999999988978999976999999999999999997 No 273 >TIGR01946 rnfD electron transport complex, RnfABCDGE type, D subunit; InterPro: IPR011303 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation . A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized . This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This entry describes the D subunit.; GO: 0006118 electron transport, 0016021 integral to membrane. Probab=36.63 E-value=25 Score=14.11 Aligned_cols=15 Identities=27% Similarity=0.361 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|255764470|r 47 KALVFGLTFAMVTRI 61 (523) Q Consensus 47 ~a~~~Gl~~A~v~R~ 61 (523) -|..-|+.+|+..=- T Consensus 70 Sa~lTg~LLAv~iPp 84 (346) T TIGR01946 70 SAILTGLLLAVAIPP 84 (346) T ss_pred HHHHHHHHHHHHHHH T ss_conf 799999999997533 No 274 >PRK10429 melibiose:sodium symporter; Provisional Probab=36.52 E-value=25 Score=14.10 Aligned_cols=37 Identities=19% Similarity=0.270 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHH Q ss_conf 999999999974439999999851-7998999999999 Q gi|255764470|r 15 LATLIALELVLGIDNLIFITLLVE-KLPLAQRGKALVF 51 (523) Q Consensus 15 l~tl~~lEivLs~DN~i~iai~~~-~Lp~~~r~~a~~~ 51 (523) +++.++..+.-+.=|.=+-|+.+. .-.++||.+.--| T Consensus 110 ~~~yil~~~~yT~~~vPy~aL~~elT~D~~ERt~l~~~ 147 (473) T PRK10429 110 CVTYILWGMTYTIMDIPFWSLVPTITLDKREREQLVPY 147 (473) T ss_pred HHHHHHHHHHHHHEECCCCCCHHHHCCCHHHHHHHHHH T ss_conf 99999999999841563000003430688889999999 No 275 >PRK00468 hypothetical protein; Provisional Probab=36.11 E-value=25 Score=14.06 Aligned_cols=48 Identities=17% Similarity=0.192 Sum_probs=34.2 Q ss_pred HHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCCC Q ss_conf 999999870664457888631288349859987001199999982889 Q gi|255764470|r 417 PANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGVN 464 (523) Q Consensus 417 leDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~~ 464 (523) ++|++|.|+..+.|..|+-.--...++++.+..=+.+=+|+-+..|-+ T Consensus 1 Mk~Lve~iv~~LVd~Pd~V~V~~~~~~~~~~~~l~V~~~D~GkVIGk~ 48 (75) T PRK00468 1 MKELVETIAKALVDNPDAVQVNEIEGEQSIILELKVAPEDMGKVIGKQ 48 (75) T ss_pred CHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEEECHHHCCCEECCC T ss_conf 978999999984899883899999679879999999935586466778 No 276 >PRK06083 sulfur carrier protein ThiS; Provisional Probab=35.80 E-value=26 Score=14.02 Aligned_cols=31 Identities=16% Similarity=0.283 Sum_probs=23.8 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 7435238999899999885038627775228 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDE 407 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE 407 (523) ++.-+++..++.++++++.-...++|+.++. T Consensus 11 e~~~v~~~~tl~~Ll~~l~l~~~giAvaiN~ 41 (69) T PRK06083 11 QSYQVPTSSSLAQIIEQLSLPELGCVFAINN 41 (69) T ss_pred EEEECCCCCCHHHHHHHHCCCCCCEEEEECC T ss_conf 8877389983999999738998868999778 No 277 >PRK05863 sulfur carrier protein ThiS; Provisional Probab=35.25 E-value=26 Score=13.96 Aligned_cols=31 Identities=19% Similarity=0.289 Sum_probs=22.6 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 7435238999899999885038627775228 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDE 407 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE 407 (523) ++.-+|+.+++.++++++.-...++|+-+|+ T Consensus 8 e~~e~~~~~Tva~Ll~~l~~~~~giAVavN~ 38 (65) T PRK05863 8 EQVEVDEQTTVAALLDSLGFPEKGIAVAVDW 38 (65) T ss_pred EEEECCCCCCHHHHHHHCCCCCCCEEEEECC T ss_conf 3578699870999998729998858999789 No 278 >pfam10797 YhfT Protein of unknown function. This family is conserved in Firmicutes and Proteobacteria. The function is not known but several members are annotated as being homologues of E coli YhfT, a protein thought to be involved in fatty acid oxidation. Probab=35.09 E-value=26 Score=13.94 Aligned_cols=54 Identities=20% Similarity=0.178 Sum_probs=31.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH----------HHHHHHHHHHHHHHHHHHC Q ss_conf 46999999999999999998999999984852899----------9999999999999997745 Q gi|255764470|r 157 HFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVI----------LCLGFLLMIGFLLIIEGLH 210 (523) Q Consensus 157 ~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~----------l~l~fl~~iG~~L~~eg~~ 210 (523) |.++-.+...+-+.+=+..=..+.||.+|+|.++- -+|-+=+++|..+-++..+ T Consensus 310 np~vA~i~Ga~v~~~Ev~lL~~iak~LdkfP~vr~~gd~IRtAM~~~levALLiGg~~Aan~m~ 373 (421) T pfam10797 310 NPIVAAILGAVIIFLEVLLLRRIAKFLDRFPGVRNSGDNIRTAMTRLLEVALLIGGANAANKMA 373 (421) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8189999999999999999999999860297643436779999999999999981699998756 No 279 >TIGR02398 gluc_glyc_Psyn glucosylglycerol-phosphate synthase; InterPro: IPR012764 Glucosylglycerol-phosphate synthase catalyzes the key step in the biosynthesis of the osmolyte glucosylglycerol. It is known in several cyanobacteria and in Pseudomonas anguilliseptica. The enzyme is closely related to the alpha,alpha-trehalose-phosphate synthase, likewise involved in osmolyte biosynthesis, of Escherichia coli and many other bacteria. A close homolog from Xanthomonas campestris is excluded from this family.; GO: 0016758 transferase activity transferring hexosyl groups, 0051473 glucosylglycerol biosynthetic process. Probab=34.23 E-value=18 Score=15.20 Aligned_cols=58 Identities=21% Similarity=0.088 Sum_probs=29.4 Q ss_pred HCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCC-CEEEE-ECCCCCCEEEEEEHHHHHHH Q ss_conf 113786674653400028996089998999999985799-78888-76785527888888887620 Q gi|255764470|r 299 LTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGH-SRFPV-AQGSLDSFIGIVSARDLLRD 362 (523) Q Consensus 299 L~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~-SR~PV-~~~~~D~iiGiv~~kDLl~~ 362 (523) .+|...+|. |--++|...-...++++-.+++.+.-- ++=.+ --+.+|=.-| .+++|.+ T Consensus 252 v~L~a~PVG---~~p~~ir~~~~~a~~~e~~e~ir~el~kg~KLILSa~RvDYtKG---~~E~L~a 311 (495) T TIGR02398 252 VKLDAFPVG---TDPERIREAVAAASVRERIEKIREELAKGRKLILSAERVDYTKG---IKELLVA 311 (495) T ss_pred EEECCCCCC---CCHHHHHHHHHCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCC---HHHHHHH T ss_conf 873153557---88899999873454789999999986048526885010011044---1577888 No 280 >COG1279 Lysine efflux permease [General function prediction only] Probab=33.85 E-value=27 Score=13.81 Aligned_cols=164 Identities=21% Similarity=0.216 Sum_probs=78.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999974439999999851799899999999999999999999999----99999997424799986345322 Q gi|255764470|r 13 IGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLA----SLSYWIVMLQQPLFFLKGLSFSG 88 (523) Q Consensus 13 ~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~----~~~~wi~~l~~pl~~i~~~~~s~ 88 (523) -||+--.-+=+-+|.-|+.++-=-.+| ++++..=+. +.+.-++++. ..+..+ ...-.. T Consensus 7 ~Gf~l~~~LI~pIGaQNaFVl~QGi~r------~~~l~~~~~-c~i~D~~Li~~gv~G~~~li-----------~~~p~l 68 (202) T COG1279 7 QGFLLGASLILPIGAQNAFVLNQGIRR------EYVLPIALL-CAISDIVLISAGVFGVGALI-----------AKSPWL 68 (202) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHH------CCHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----------HHCHHH T ss_conf 899999999985340359999988866------219999999-99999999999998899999-----------867899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH-HH--------HHHHHHHHHHHHHHHHHH--HC Q ss_conf 24899999999999986555530257642111122233209999999-99--------999998577899999854--14 Q gi|255764470|r 89 RDIVLILGGFFLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVIVLQ-IV--------ILDLVFSLDSVVTAIGMV--QH 157 (523) Q Consensus 89 ~~lill~GG~fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~v~~-I~--------v~D~vFSlDsVitavg~~--~~ 157 (523) .+.+.+.|.+||+|-++.-++....+....+ .....+.+++.++.+ +. .+|-++ +|.+.|.. ++ T Consensus 69 ~~i~~~~G~~FLl~yg~~a~~~a~~~~~~~~-~~~~~~~~~~~~l~~alavT~LNPhvyLDtvv----liGs~~~~~~~~ 143 (202) T COG1279 69 LLIVRWGGAAFLLYYGLLALKSAPRGPSQLQ-VAEFTKLKLKKVLLFALAVTLLNPHVYLDTVV----LIGSLAAQLSDE 143 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHCC-CCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHH----HHHHHHHHCCCC T ss_conf 9999999899999999999985066600045-46766222799999999999807213344677----651216540652 Q ss_pred HHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 699999999999999999--------------899999998485289999999999 Q gi|255764470|r 158 FSVMAIAVAVSALMMMAV--------------SQPMIRYISRHTTVVILCLGFLLM 199 (523) Q Consensus 158 ~~i~~~a~~isi~~m~~~--------------s~~i~~~i~~~p~~~~l~l~fl~~ 199 (523) --..+++..++.-.+.|+ +...-+.+|+--.++|.+++|-+. T Consensus 144 ~k~~F~~Ga~~aS~~WF~~L~~~a~~l~~~~~~pk~~riin~vva~vM~~ia~~L~ 199 (202) T COG1279 144 AKWFFALGAISASFLWFFLLALGARWLSPLLANPKAWRIINLVVAVVMWALAVKLA 199 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 12789998999999999999999999761105819999999999999999999995 No 281 >TIGR00892 2A0113 monocarboxylate transporter; InterPro: IPR004743 Proton-linked monocarboxylate transporters catalyse the rapid transport across the plasma membrane of many monocarboxylates such as lactate, pyruvate, branched-chain oxo acids derived from leucine, valine and isoleucine, and the ketone bodies acetoacetate, beta-hydroxybutyrate and acetate.; GO: 0015355 secondary active monocarboxylate transmembrane transporter activity, 0015711 organic anion transport, 0016021 integral to membrane. Probab=33.69 E-value=27 Score=13.79 Aligned_cols=28 Identities=39% Similarity=0.390 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4247999863453222489999999999 Q gi|255764470|r 74 LQQPLFFLKGLSFSGRDIVLILGGFFLL 101 (523) Q Consensus 74 l~~pl~~i~~~~~s~~~lill~GG~fLl 101 (523) ...||=+.+.-.|-||.=.|++||+||- T Consensus 165 ~L~Pl~q~L~~~fGWrG~FLilGGllLh 192 (475) T TIGR00892 165 TLAPLNQYLFESFGWRGSFLILGGLLLH 192 (475) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 6567889987615850478999999998 No 282 >TIGR02965 xanthine_xdhB xanthine dehydrogenase, molybdopterin binding subunit; InterPro: IPR014309 Members of this entry are the molybdopterin-containing large subunit of xanthine dehydrogenase (or, in, eukaryotes, the molybdopterin-binding domain) and those enzymes that reduces the purine pool by catabolizing xanthine to urate. This entry contains primarily bacterial sequences; it does not manage to include all eukaryotic xanthine dehydrogenases and thereby discriminate them from the closely related enzyme aldehyde dehydrogenase.; GO: 0004854 xanthine dehydrogenase activity, 0004855 xanthine oxidase activity, 0030151 molybdenum ion binding, 0050660 FAD binding. Probab=33.63 E-value=12 Score=16.42 Aligned_cols=51 Identities=10% Similarity=0.183 Sum_probs=33.6 Q ss_pred HCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCC Q ss_conf 3274352389998999998850386277752286742565259999998706644 Q gi|255764470|r 375 IRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAGDFP 429 (523) Q Consensus 375 ir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~ 429 (523) .-..+|.|+-.--..-++. ++.=.----=|||.+|+++.|.|+++++-.+- T Consensus 314 ~DNAYflp~v~~~s~~~kT----NT~SnTAfRGFGGPQGm~~iE~~~d~vA~~lG 364 (761) T TIGR02965 314 ADNAYFLPDVRITSHRLKT----NTVSNTAFRGFGGPQGMVAIERIIDEVARALG 364 (761) T ss_pred CCCCCCCCCCEEEEEEEEC----CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 2334416562573211221----46644412278854579999999999999838 No 283 >COG1656 Uncharacterized conserved protein [Function unknown] Probab=32.30 E-value=29 Score=13.63 Aligned_cols=82 Identities=20% Similarity=0.307 Sum_probs=48.9 Q ss_pred HHHHHHHHHHCCCCCCCC-CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHH Q ss_conf 667656744156888766-7988899999967746678989999999751137866746534000289960899989999 Q gi|255764470|r 251 RTADAVLRLLGGKPIQPQ-GLNVKADVLLPTQHEKHIISDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQ 329 (523) Q Consensus 251 RtA~aVLRLlG~~~~~~~-~~~ee~~~li~~~~~~g~l~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~ 329 (523) +.|. .||++|....... ...+++..+ +. .+...+-...+.++.+. ...- ..+.+..++..+++. T Consensus 17 ~LAR-wLRllGydt~~~~~~~d~~i~~i-~~-~e~rIllTRDr~L~~r~-k~g~-----------~~i~i~~~s~~~Ql~ 81 (165) T COG1656 17 KLAR-WLRLLGYDTVYSSNESDDEIILI-AK-KEGRILLTRDRELYKRA-KLGI-----------KAILIRSDSIEEQLA 81 (165) T ss_pred HHHH-HHHHCCCCEEEECCCCCHHHHHH-HH-CCCEEEEECCHHHHHHH-HCCC-----------CEEEEECCCHHHHHH T ss_conf 9999-99980884254236783889998-74-47808996229999986-4267-----------328971798799999 Q ss_pred HHHH---H-----CCCCEEEEECCCC Q ss_conf 9998---5-----7997888876785 Q gi|255764470|r 330 WKIL---E-----LGHSRFPVAQGSL 347 (523) Q Consensus 330 ~~i~---~-----~~~SR~PV~~~~~ 347 (523) +.+. . -.+||=|.|++.+ T Consensus 82 e~~~~~~l~~~~~~e~~RCp~CN~~L 107 (165) T COG1656 82 EFLARLGLKPRLFPEFSRCPECNGEL 107 (165) T ss_pred HHHHHHCCCHHCCCCCCCCCCCCCEE T ss_conf 99998352200160103274349780 No 284 >PRK06771 hypothetical protein; Provisional Probab=32.10 E-value=29 Score=13.61 Aligned_cols=60 Identities=18% Similarity=0.297 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 89989999999998999998755430103777667656744156888766798889999996774 Q gi|255764470|r 219 YASIGFSGIIEFFNQVARRNREQLMSPSRLRARTADAVLRLLGGKPIQPQGLNVKADVLLPTQHE 283 (523) Q Consensus 219 y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~eRtA~aVLRLlG~~~~~~~~~~ee~~~li~~~~~ 283 (523) .++.+|--++|-+|++-.++..|.... |...+.+.+=+|....+++ .+.|+..+++.+.. T Consensus 9 i~~Fg~iYi~ekL~~~ekkTd~rLK~i----E~rlq~itke~gi~~~e~e-in~elr~Lv~~gk~ 68 (93) T PRK06771 9 IAIFGFIYIVEKLNKIEKKTDARLKRM----EDRLQLITKEMGIVDREPE-INKELRQLMEEGKT 68 (93) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHCCCCCCCC-CCHHHHHHHHCCCE T ss_conf 999899999999999988788899989----9999999997283116876-13999999985770 No 285 >TIGR01687 moaD_arch MoaD family protein; InterPro: IPR010038 Members of this family appear to be archaeal and bacterial (proteobacteria and Thermus) versions of MoaD, subunit 1 of molybdopterin converting factor.. Probab=31.59 E-value=23 Score=14.38 Aligned_cols=52 Identities=19% Similarity=0.302 Sum_probs=35.1 Q ss_pred EEEHHHHHHHHCCCCCCCCCC--CC-CEEEECCCEEEEEECCCHHHHHHHHCCCCC Q ss_conf 525999999870664457888--63-128834985998700119999998288999 Q gi|255764470|r 414 MITPANILEAIAGDFPDEDDQ--KL-DITVGDDGSLTVDGWIDVRYASKLFGVNLV 466 (523) Q Consensus 414 IVTleDIlE~IvGei~DE~d~--~~-~i~~~~d~~~~v~G~~~l~el~~~l~~~l~ 466 (523) .=|.+|++++|.-..+++..+ .. .....++=..+|+|+.+++.++. |+.+|. T Consensus 25 g~tv~dll~~l~~~Yp~~~~e~~~et~~~~~~~v~ilvNGran~~~l~G-L~~~Lk 79 (93) T TIGR01687 25 GKTVGDLLEELSSRYPKEFSELFKETGLGLVPNVIILVNGRANVDWLEG-LETELK 79 (93) T ss_pred CCCHHHHHHHHHHHCCHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCC-CCCCCC T ss_conf 9807998999886156556665147788764657898516414322036-575232 No 286 >TIGR01969 minD_arch cell division ATPase MinD; InterPro: IPR010224 Proper placement of the bacterial cell division site requires the site-specific inactivation of other potential division sites. In Escherichia coli, selection of the correct mid-cell site is mediated by the MinC, MinD and MinE proteins. Several members of this family are found in archaeal genomes but their function is uncharacterised. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. This entry represents the archaeal MinD family. The exact roles of the various archaeal MinD homologs are unknown. . Probab=31.21 E-value=19 Score=14.91 Aligned_cols=139 Identities=16% Similarity=0.126 Sum_probs=70.7 Q ss_pred EEEECCEEEEEECCCCHHHHHHHHHHCCCCEEE-EECCCCCCEEEEEEHHHHH-HHHHCCCC---CHHHHHHCCCE--EE Q ss_conf 534000289960899989999999857997888-8767855278888888876-20202466---01688732743--52 Q gi|255764470|r 309 IMTPRTEIVWLDVNCVDEDLQWKILELGHSRFP-VAQGSLDSFIGIVSARDLL-RDLLEEGS---MNFKRSIRKPL--VV 381 (523) Q Consensus 309 IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~P-V~~~~~D~iiGiv~~kDLl-~~~~~~~~---~~~~~lir~~~--~V 381 (523) -|+|..+|.-++..-|+|.++ +..-.|+| |.+. +...-|++ -+...+=+ ..--..-.+.+ .= T Consensus 72 Y~gp~GnV~V~PagvSLEg~r----KA~~~~L~dV~~~-------i~~~~D~lLIDAPAGL~~~a~~Al~~a~elLLVvN 140 (258) T TIGR01969 72 YEGPEGNVKVIPAGVSLEGLR----KADPDKLEDVLKE-------IIDDTDFLLIDAPAGLERDAVTALAAADELLLVVN 140 (258) T ss_pred CCCCCCCEEEECCCCCHHHCC----CCCHHHHHHHHHH-------HHCCCCEEEEECCCCCCHHHHHHHHHCCCCEEEEC T ss_conf 028898447850612210001----2683332899998-------72043778874789833789999986186648667 Q ss_pred CCCCCHHHHHHHH---HHCC-CEEEEEECCCCCEEEEEEHH---HHHHH-HCCCCCCCCCC------CCCEEEECCCEEE Q ss_conf 3899989999988---5038-62777522867425652599---99998-70664457888------6312883498599 Q gi|255764470|r 382 HENISVLKLMERL---RKSS-QTFVMVLDEYGVLEGMITPA---NILEA-IAGDFPDEDDQ------KLDITVGDDGSLT 447 (523) Q Consensus 382 pe~~~l~~lL~~f---r~~~-~~lAiVvDE~G~~~GIVTle---DIlE~-IvGei~DE~d~------~~~i~~~~d~~~~ 447 (523) ||-.++.|+|+.= ++-. .=+.+|+|.+|...+=++.+ +|||. +.|++|+..+- -.+++...++| T Consensus 141 PEi~SItDaLK~k~va~~lGt~ilG~vlNRv~~~~tel~~~eiE~iLevPVl~~vPEDP~VR~AAa~G~P~V~~~P~S-- 218 (258) T TIGR01969 141 PEISSITDALKVKIVAEKLGTAILGVVLNRVTRDKTELGREEIEAILEVPVLGVVPEDPEVRRAAAFGEPVVVYSPNS-- 218 (258) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCHHHHHHHHCCCEEEEECCCHHHHHHHHCCCCEEEECCCC-- T ss_conf 654467778899999876088324689960236666378889998847973898569843445642485368847998-- Q ss_pred EEECCCHHHHHHHH Q ss_conf 87001199999982 Q gi|255764470|r 448 VDGWIDVRYASKLF 461 (523) Q Consensus 448 v~G~~~l~el~~~l 461 (523) +|+-.+-++...+ T Consensus 219 -pAA~A~~eLA~~l 231 (258) T TIGR01969 219 -PAAQAFMELAAEL 231 (258) T ss_pred -HHHHHHHHHHHHH T ss_conf -7899999999997 No 287 >pfam04123 DUF373 Domain of unknown function (DUF373). Archaeal domain of unknown function. Predicted to be an integral membrane protein with six transmembrane regions. Probab=30.91 E-value=30 Score=13.47 Aligned_cols=34 Identities=18% Similarity=0.331 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHCCHH-H-HHHHHHHHHHHHHH Q ss_conf 9999989999999848528-9-99999999999999 Q gi|255764470|r 171 MMMAVSQPMIRYISRHTTV-V-ILCLGFLLMIGFLL 204 (523) Q Consensus 171 ~m~~~s~~i~~~i~~~p~~-~-~l~l~fl~~iG~~L 204 (523) ++.++.+.+.+++++.+.. . +.+..|+..+|..+ T Consensus 275 l~~~~G~~id~~l~~~~~~~~~i~~~f~~~a~~~vv 310 (344) T pfam04123 275 LIAFLGKLIDEYLNGRFRLWRYINAPFFVLAIGLVV 310 (344) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 999999999999748954599999999999999999 No 288 >COG4023 SBH1 Preprotein translocase subunit Sec61beta [Intracellular trafficking and secretion] Probab=29.33 E-value=32 Score=13.29 Aligned_cols=21 Identities=19% Similarity=0.262 Sum_probs=16.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 702788998999999999899 Q gi|255764470|r 214 PKGYLYASIGFSGIIEFFNQV 234 (523) Q Consensus 214 pk~y~y~~~~fs~~ve~~n~~ 234 (523) |+--+|.+++++++|.+-|.+ T Consensus 34 P~~vV~~~~av~~lvi~a~~f 54 (57) T COG4023 34 PRLVVYAGIAVAILVIAAHIF 54 (57) T ss_pred CHHHHHHHHHHHHHHHHHHHH T ss_conf 413678999999999999862 No 289 >pfam01747 ATP-sulfurylase ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PAPS (phosphoadenosine-phosphosulfate) synthesis from inorganic sulphate. ATP sulfurylase catalyses the synthesis of adenosine-phosphosulfate APS from ATP and inorganic sulphate. Probab=29.30 E-value=23 Score=14.31 Aligned_cols=24 Identities=13% Similarity=0.073 Sum_probs=13.2 Q ss_pred ECCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 089998999999985799788887 Q gi|255764470|r 320 DVNCVDEDLQWKILELGHSRFPVA 343 (523) Q Consensus 320 d~~~s~~e~~~~i~~~~~SR~PV~ 343 (523) +...+++++++.+.+.+-+++.-+ T Consensus 99 ~~r~tP~e~r~~~~~~gw~~Vvaf 122 (310) T pfam01747 99 KLRLTPAELRALFKEKGWKTVVAF 122 (310) T ss_pred HHCCCHHHHHHHHHHCCCCEEEEE T ss_conf 541799999999986598616899 No 290 >TIGR02862 spore_BofA pro-sigmaK processing inhibitor BofA; InterPro: IPR010001 This family contains the sigmaK-factor processing regulatory protein BofA (Bypass-of-forespore protein A) (approximately 80 residues long). During sporulation in Bacillus subtilis, transcription is controlled in the developing sporangium by a cascade of sporulation-specific transcription factors (sigma factors). Following engulfment, processing of sigmaK is inhibited by BofA. It has been suggested that this effect is exerted by alteration of the level of the SpoIVFA protein .. Probab=29.26 E-value=32 Score=13.28 Aligned_cols=60 Identities=18% Similarity=0.217 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 999999999999999998999999984852899999999999999999774573370278899 Q gi|255764470|r 159 SVMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYAS 221 (523) Q Consensus 159 ~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~ 221 (523) -..+.|+++-..+.-..+||+-...+ ...-+-.+++++-=+..+...||+|||=+-.=+. T Consensus 4 i~~Ilal~~llli~g~~~KpLr~i~~---~~~~~~~G~~LLf~~N~fG~~f~~hiPIN~~Ta~ 63 (83) T TIGR02862 4 ISIILALVILLLIVGAPVKPLRLIGK---AAINLVIGALLLFIVNVFGGAFGIHIPINLVTAL 63 (83) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 67689999999999998789999999---9999999999999999886765680655799999 No 291 >COG2211 MelB Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism] Probab=29.18 E-value=32 Score=13.27 Aligned_cols=171 Identities=18% Similarity=0.223 Sum_probs=90.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q ss_conf 89999999999999974439999999851-7998999999999999999999999999999999742479998634---- Q gi|255764470|r 10 YVWIGLATLIALELVLGIDNLIFITLLVE-KLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLKGL---- 84 (523) Q Consensus 10 ~~w~~l~tl~~lEivLs~DN~i~iai~~~-~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~~~---- 84 (523) ..+-+++|-+++++.-+.=|+=+-|+... .-.|++|.+.--|=-.+|-+--+ ....+..|+...++- T Consensus 111 k~~ya~vtY~l~~l~YT~vniPy~al~~~iT~d~~ER~~l~s~R~~~~~~g~~--------l~~~~~~plv~~~g~~~~~ 182 (467) T COG2211 111 KLIYALVTYMLLGLGYTLVNIPYGALGPEITQDPQERASLTSWRMVFASLGGL--------LVAVLFPPLVKLFGGGDKA 182 (467) T ss_pred CHHHHHHHHHHHHHHHHHEECCHHHCCHHHCCCHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHCCCCCC T ss_conf 24899999999999998311725314644339988878899899999989999--------9999999999880898510 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-CCCC---------CCCHHHHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf -532224899999999999986555530257642111-1222---------332099999999999999-8577899999 Q gi|255764470|r 85 -SFSGRDIVLILGGFFLLFKGTIELHERLEGDGFDKK-HKFF---------SPVSWQVIVLQIVILDLV-FSLDSVVTAI 152 (523) Q Consensus 85 -~~s~~~lill~GG~fLl~k~~~el~~~~e~~~~~~~-~~~~---------~~~~~~~~v~~I~v~D~v-FSlDsVitav 152 (523) ++-.-..++..-|..+++-+++-.|||.......+. .+.+ +.++|. ...++++. +.-.|+..++ T Consensus 183 ~g~~~~~~~~~vi~~i~~l~~~~~v~ER~~~~~~~~~~~~~~~~~~~~~~~~Nrp~~----~~l~~~l~~~~~~~i~~s~ 258 (467) T COG2211 183 LGYQGTALVLGVIGVILLLFCFFNVKERVVETQPTKTGVKLKLKDSFLLIFKNRPLL----LLLLMNLLLFIAFNIRGSI 258 (467) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHH----HHHHHHHHHHHHHHHHHHH T ss_conf 409999999999999999999987505667666676533443888999997264699----9999999999999998565 Q ss_pred HH---------HH-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 85---------41-469999999999999999989999999848528999 Q gi|255764470|r 153 GM---------VQ-HFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVIL 192 (523) Q Consensus 153 g~---------~~-~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l 192 (523) .+ .+ ..+....+...+...|..+...+++...|+.....- T Consensus 259 ~~yy~~y~lg~~~l~~~~~~~~~~~~~l~~~~~~p~L~~~~gkk~~~~~~ 308 (467) T COG2211 259 MVYYVTYVLGDPELFAYLLLLASGAGLLIGLILWPRLVKKFGKKKLFLIG 308 (467) T ss_pred HHEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 22158887587188999999999999999998689999995629999999 No 292 >cd03709 lepA_C lepA_C: This family represents the C-terminal region of LepA, a GTP-binding protein localized in the cytoplasmic membrane. LepA is ubiquitous in Bacteria and Eukaryota (e.g. Saccharomyces cerevisiae GUF1p), but is missing from Archaea. LepA exhibits significant homology to elongation factors (EFs) Tu and G. The function(s) of the proteins in this family are unknown. The N-terminal domain of LepA is homologous to a domain of similar size found in initiation factor 2 (IF2), and in EF-Tu and EF-G (factors required for translation in Escherichia coli). Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including S. cerevisiae GUF1) originated within the bacterial LepA family. LepA has never been observed in archaea, and eukaryl LepA is organellar. LepA is therefore a true bacterial GTPase, found only in the bacterial lineage. Probab=28.70 E-value=33 Score=13.21 Aligned_cols=50 Identities=18% Similarity=0.196 Sum_probs=36.5 Q ss_pred EEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHH-HHHHC Q ss_conf 652599999987066445788863128834985998700119999-99828 Q gi|255764470|r 413 GMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYA-SKLFG 462 (523) Q Consensus 413 GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el-~~~l~ 462 (523) =|+++++-+..+.|.+...--.-......+++..++...+|+.|+ .+..+ T Consensus 7 ~I~~P~ey~G~V~~ll~~rRG~~~~~~~~~~~~~~i~~~vPlaE~~~~f~~ 57 (80) T cd03709 7 TIITPSEYLGAIMELCQERRGVQKDMEYLDANRVMLTYELPLAEIVYDFFD 57 (80) T ss_pred EEECCHHHHHHHHHHHHHHHCEEECCEECCCCEEEEEEECCHHHHHHHHHH T ss_conf 996858989999999998425772546548987999999447997654778 No 293 >pfam00620 RhoGAP RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. Probab=28.41 E-value=33 Score=13.18 Aligned_cols=12 Identities=25% Similarity=0.456 Sum_probs=5.1 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999998999 Q gi|255764470|r 224 FSGIIEFFNQVA 235 (523) Q Consensus 224 fs~~ve~~n~~~ 235 (523) +..+++.+++++ T Consensus 111 L~~l~~~L~~v~ 122 (152) T pfam00620 111 LRYLLAHLNRVA 122 (152) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 294 >TIGR00846 caca2 calcium/proton exchanger; InterPro: IPR004713 Proteins of the Ca2+:Cation Antiporter (CaCA) family are found ubiquitously, having been identified in animals, plants, yeast, archaea and widely divergent bacteria. All of the characterised animal proteins catalyze Ca2+:Na+ exchange although some also transport K+. The NCX1 plasma membrane protein exchanges 3 Na+ for 1 Ca2+. The Escherichia coli ChaA protein catalyses Ca2+:H+ antiport but may also catalyze Na+:H+ antiport. All remaining well-characterised members of the family catalyze Ca2+:H+ exchange. This family is specific for the calcium ion/proton exchangers of the CaCA family. ; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0016021 integral to membrane. Probab=28.08 E-value=34 Score=13.14 Aligned_cols=76 Identities=29% Similarity=0.312 Sum_probs=39.4 Q ss_pred HHHHHHHHHHHHHH--HHHHHHCCCCCC---CCCCCCC--CCCHHHHHH----HHHHH----HHHHHHHHHHHHHHHHHH Q ss_conf 99999999999986--555530257642---1111222--332099999----99999----999985778999998541 Q gi|255764470|r 92 VLILGGFFLLFKGT--IELHERLEGDGF---DKKHKFF--SPVSWQVIV----LQIVI----LDLVFSLDSVVTAIGMVQ 156 (523) Q Consensus 92 ill~GG~fLl~k~~--~el~~~~e~~~~---~~~~~~~--~~~~~~~~v----~~I~v----~D~vFSlDsVitavg~~~ 156 (523) .++.=+.||+++-. +++++..+.++. ++.+... ..++||..+ .+|++ --.|-|+||+....|+++ T Consensus 204 ~~~~Y~~fL~fql~tHr~lF~~~~~~~~~~~~~~~~~P~~~~S~~~~~~~L~~~ti~v~~~a~~~v~t~~~~~~~~gls~ 283 (393) T TIGR00846 204 MLILYVAFLVFQLVTHRQLFEPEEEADSDVDDEVHEEPTAVLSPLSAAIWLVGATIVVALLAEYLVDTIDSVVESLGLSV 283 (393) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCH T ss_conf 99999999787623317610112420577788877887412227999999999999999999762165789999737872 Q ss_pred C-HHHHHHHHHH Q ss_conf 4-6999999999 Q gi|255764470|r 157 H-FSVMAIAVAV 167 (523) Q Consensus 157 ~-~~i~~~a~~i 167 (523) - +.+++++++. T Consensus 284 ~FiG~i~~~iVg 295 (393) T TIGR00846 284 AFIGVIIIPIVG 295 (393) T ss_pred HHHHHHHHHHHH T ss_conf 788999998884 No 295 >cd00517 ATPS ATP-sulfurylase (ATPS), also known as sulfate adenylate transferase, catalyzes the transfer of an adenylyl group from ATP to sulfate, forming adenosine 5'-phosphosulfate (APS). This reaction is generally accompanied by a further reaction, catalyzed by APS kinase, in which APS is phosphorylated to yield 3'-phospho-APS (PAPS). In some organisms the APS kinase is a separate protein, while in others it is incorporated with ATP sulfurylase in a bifunctional enzyme that catalyzes both reactions. In bifunctional proteins, the domain that performs the kinase activity can be attached at the N-terminal end of the sulfurylase unit or at the C-terminal end, depending on the organism. While the reaction is ubiquitous among organisms, the physiological role of the reaction varies. In some organisms it is used to generate APS from sulfate and ATP, while in others it proceeds in the opposite direction to generate ATP from APS and pyrophosphate. ATP sulfurylase can be a monomer, a hom Probab=28.07 E-value=23 Score=14.34 Aligned_cols=20 Identities=15% Similarity=0.136 Sum_probs=9.2 Q ss_pred CCCHHHHHHHHHHCCCCEEE Q ss_conf 99989999999857997888 Q gi|255764470|r 322 NCVDEDLQWKILELGHSRFP 341 (523) Q Consensus 322 ~~s~~e~~~~i~~~~~SR~P 341 (523) ..++++.++.+.+.+..++. T Consensus 141 ~~tP~e~r~~~~~~gw~~Vv 160 (356) T cd00517 141 RLTPAELREEFKEKGWDTVV 160 (356) T ss_pred CCCHHHHHHHHHHCCCCEEE T ss_conf 37999999999856886588 No 296 >TIGR00988 hip integration host factor, beta subunit; InterPro: IPR005685 Integration host factor (IHF) is a heterodimeric protein consisting of alpha (IPR005684 from INTERPRO) and beta subunits. IHF binds the minor groove of DNA in a sequence-specific manner to induce a large bend. This bending stabilises distinct DNA conformations that are required during several bacterial processes, such as recombination, transposition, replication and transcription .; GO: 0003677 DNA binding, 0005694 chromosome. Probab=27.70 E-value=33 Score=13.21 Aligned_cols=15 Identities=20% Similarity=0.178 Sum_probs=10.5 Q ss_pred HCCCCCCCCEEEECC Q ss_conf 312488998899999 Q gi|255764470|r 483 LGHLPQEKEIFTEMN 497 (523) Q Consensus 483 l~~iP~~Ge~i~~~g 497 (523) .||-|++|+++..++ T Consensus 60 ~GRNPkTG~~V~ld~ 74 (94) T TIGR00988 60 TGRNPKTGESVALDG 74 (94) T ss_pred CCCCCCCCCEEEECC T ss_conf 898899887665478 No 297 >KOG3880 consensus Probab=27.05 E-value=35 Score=13.01 Aligned_cols=31 Identities=10% Similarity=0.086 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88998999999999899999875543010377 Q gi|255764470|r 218 LYASIGFSGIIEFFNQVARRNREQLMSPSRLR 249 (523) Q Consensus 218 ~y~~~~fs~~ve~~n~~~~~~~~~~~~~~~~~ 249 (523) +-.|..+..+-+.+| +.-|+..-.+...|.. T Consensus 160 liGa~SYa~lT~~~~-~spk~Tlli~l~lP~l 190 (409) T KOG3880 160 LIGASSYAFLTSWAN-LSPKSTLLIMLFLPAL 190 (409) T ss_pred HHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHH T ss_conf 553567888750117-9835677999988999 No 298 >pfam09139 Mmp37 Mitochondrial matrix Mmp37. MMp37 is a mitochondrial matrix protein that functions in the translocation of proteins across the mitochondrial inner membrane. Probab=26.87 E-value=35 Score=12.99 Aligned_cols=70 Identities=26% Similarity=0.303 Sum_probs=41.7 Q ss_pred EEEEHHHHHHHHHCCCCCHH-HHHHCCCEEEC-CCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHHHHHCC Q ss_conf 88888887620202466016-88732743523-89998999998850386277752286742565259999998706 Q gi|255764470|r 352 GIVSARDLLRDLLEEGSMNF-KRSIRKPLVVH-ENISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANILEAIAG 426 (523) Q Consensus 352 Giv~~kDLl~~~~~~~~~~~-~~lir~~~~Vp-e~~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIlE~IvG 426 (523) |+|..+|++.++.+=...-+ .++-+|...+. ++..+..+.+.=.++-.+.|+..=. .. -|-+|+.++|+| T Consensus 105 GVVs~~~L~~DL~~W~tLYlAGRLqKPV~il~~~~~~l~~a~~~NL~sA~~~ALLlLP-e~----FtE~~Ly~~Iag 176 (324) T pfam09139 105 GVVSMETLLKDLLDWDTLYLAGRLQKPVKVLKPDDPRIRLANQYNLKSALTAALLLLP-ES----FTEFQLYHTIAG 176 (324) T ss_pred EEEEHHHHHHHHHHCCHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC-CC----CCHHHHHHHHHC T ss_conf 7778999999985241340532357875432479989999999999999999998588-86----889999999855 No 299 >cd00565 ThiS ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity. Probab=26.42 E-value=36 Score=12.94 Aligned_cols=31 Identities=19% Similarity=0.339 Sum_probs=24.3 Q ss_pred CCEEECCCCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 7435238999899999885038627775228 Q gi|255764470|r 377 KPLVVHENISVLKLMERLRKSSQTFVMVLDE 407 (523) Q Consensus 377 ~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE 407 (523) ++..+++.+++.++|+++.-...++|+.++. T Consensus 7 ~~~~~~~~~tl~~ll~~~~~~~~~vaV~~N~ 37 (65) T cd00565 7 EPREVEEGTTLAELLEELGLPPEGVAVEVNG 37 (65) T ss_pred EEEECCCCCCHHHHHHHCCCCCCCEEEEECC T ss_conf 9888699997999998769998859999999 No 300 >PHA00728 hypothetical protein Probab=26.40 E-value=17 Score=15.25 Aligned_cols=55 Identities=16% Similarity=0.393 Sum_probs=43.5 Q ss_pred CHHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 98999999975113786674653400028996089998999999985799788887678 Q gi|255764470|r 288 SDQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGS 346 (523) Q Consensus 288 ~~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~ 346 (523) ++++.+-|+|-.....|.+.+..-|+.-+++++.++ +--++....|.|+|.|=|+ T Consensus 35 ~~e~~~eiEnPYTVTNRaIsElV~PkDTMfYLsgnq----isLILtAfEfarLP~YFgE 89 (153) T PHA00728 35 EEEELQEIENPYTVTNRAITELVEPKDTMFYLSGNQ----ISLILTAFEFARLPAYFGE 89 (153) T ss_pred HHHHHHHHCCCCEEHHHHHHHHCCCCCCEEEECCCC----HHHHHHHHHHHHCHHHCCC T ss_conf 356687723871220578887559764158863772----2357789888524554077 No 301 >cd07178 terB_like_YebE tellurium resistance terB-like protein, subgroup 3. This family includes several uncharacterized bacterial proteins including an Escherichia coli protein called YebE. Protein sequence homology analysis shows they are similar to tellurium resistance protein terB, but the function of this family is unknown. Probab=26.34 E-value=36 Score=12.93 Aligned_cols=27 Identities=11% Similarity=0.207 Sum_probs=18.3 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHC Q ss_conf 999999677466789899999997511 Q gi|255764470|r 274 ADVLLPTQHEKHIISDQEKDMVQSVLT 300 (523) Q Consensus 274 ~~~li~~~~~~g~l~~~E~~mi~~vL~ 300 (523) +++++.-....|.++++|++.|.+-++ T Consensus 3 lrAMIaAAkADG~ID~~Er~~I~~~l~ 29 (95) T cd07178 3 LRAMIAAAKADGHIDEAERARILGELG 29 (95) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 999999987169969999999999998 No 302 >COG0523 Putative GTPases (G3E family) [General function prediction only] Probab=26.28 E-value=36 Score=12.92 Aligned_cols=108 Identities=19% Similarity=0.240 Sum_probs=47.0 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEEECCEEEEEECC--CCHHHHHHHHHHCCCCEEEEECC--CCCCEEEEEEHHHHHHHHH Q ss_conf 8999999975113786674653400028996089--99899999998579978888767--8552788888888762020 Q gi|255764470|r 289 DQEKDMVQSVLTLADRPAKSIMTPRTEIVWLDVN--CVDEDLQWKILELGHSRFPVAQG--SLDSFIGIVSARDLLRDLL 364 (523) Q Consensus 289 ~~E~~mi~~vL~L~e~~V~~IMtPR~~i~~ld~~--~s~~e~~~~i~~~~~SR~PV~~~--~~D~iiGiv~~kDLl~~~~ 364 (523) ..=..++++..+..++ .|...+... +.+..+.+.+... |.+.. ..|-++.+|..+-+..... T Consensus 69 ~dl~~~~~~L~~~~~~---------~D~ivIEtTGlA~P~pv~~t~~~~-----~~l~~~~~ld~vvtvVDa~~~~~~~~ 134 (323) T COG0523 69 DDLLPALERLLRRRDR---------PDRLVIETTGLADPAPVIQTFLTD-----PELADGVRLDGVVTVVDAAHFLEGLD 134 (323) T ss_pred CHHHHHHHHHHHCCCC---------CCEEEEECCCCCCCHHHHHHHHCC-----CCCCCCEEECCEEEEEEHHHHHHHHH T ss_conf 2158999999852568---------998999688777869999986065-----12245404133699984788654567 Q ss_pred CCCCCHHHH-------HHCCCEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCE Q ss_conf 246601688-------73274352389998999998850386277752286742 Q gi|255764470|r 365 EEGSMNFKR-------SIRKPLVVHENISVLKLMERLRKSSQTFVMVLDEYGVL 411 (523) Q Consensus 365 ~~~~~~~~~-------lir~~~~Vpe~~~l~~lL~~fr~~~~~lAiVvDE~G~~ 411 (523) ...+.-.+. .+.+.=-+.+.. +..+-...|+-+..--++.-.||.+ T Consensus 135 ~~~~~~~~Qia~AD~ivlNK~Dlv~~~~-l~~l~~~l~~lnp~A~i~~~~~~~~ 187 (323) T COG0523 135 AIAELAEDQLAFADVIVLNKTDLVDAEE-LEALEARLRKLNPRARIIETSYGDV 187 (323) T ss_pred HHHHHHHHHHHHCCEEEEECCCCCCHHH-HHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 7999999999867999983645689889-9999999997599986998123668 No 303 >PRK05446 imidazole glycerol-phosphate dehydratase/histidinol phosphatase; Provisional Probab=26.23 E-value=36 Score=12.91 Aligned_cols=36 Identities=11% Similarity=-0.095 Sum_probs=22.2 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCEEEEEEHHHHH Q ss_conf 9899999998579978888767855278888888876 Q gi|255764470|r 324 VDEDLQWKILELGHSRFPVAQGSLDSFIGIVSARDLL 360 (523) Q Consensus 324 s~~e~~~~i~~~~~SR~PV~~~~~D~iiGiv~~kDLl 360 (523) ..-++++.+.+.+|--+-|...+ +==.|+...+|+- T Consensus 34 gv~~aLk~L~~~GY~liVVTNQs-GIgrG~~t~ed~~ 69 (354) T PRK05446 34 GVIPALLKLQKAGYKLVMVTNQD-GLGTDSFPQEDFD 69 (354) T ss_pred CHHHHHHHHHHCCCEEEEEECCC-CCCCCCCCHHHHH T ss_conf 78999999998799899994885-1147988999999 No 304 >COG2215 ABC-type uncharacterized transport system, permease component [General function prediction only] Probab=25.30 E-value=38 Score=12.79 Aligned_cols=187 Identities=23% Similarity=0.329 Sum_probs=83.0 Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH---HHHH------HHHH Q ss_conf 358789999999999999974439--------9999998517998999999999999999999---9999------9999 Q gi|255764470|r 6 IYDYYVWIGLATLIALELVLGIDN--------LIFITLLVEKLPLAQRGKALVFGLTFAMVTR---IALL------ASLS 68 (523) Q Consensus 6 ~~~p~~w~~l~tl~~lEivLs~DN--------~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R---~~ll------~~~~ 68 (523) .-||.. .-+++.+-..+|+=- ++.-+-+... +.+=||+...+...++.-- ++++ ...+ T Consensus 48 ~~~~~~---~w~li~~SflyGvlHAlgPGHgKaviasylia~--~~~lk~~ilLsf~~sllqG~~Av~l~~~~~~v~~~~ 122 (303) T COG2215 48 LEDGNA---GWTLIPLSFLYGVLHALGPGHGKAVIATYLIAH--KATLKRAILLSFLASLLQGLTAVVLLLAFLGVLRLS 122 (303) T ss_pred HHCCCC---HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHC--CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 847730---789999999999886048985468999999835--421148799999999998869999999999999754 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---------------------CCCCCCC---CCC Q ss_conf 9999742479998634532224899999999999986555530257---------------------6421111---222 Q gi|255764470|r 69 YWIVMLQQPLFFLKGLSFSGRDIVLILGGFFLLFKGTIELHERLEG---------------------DGFDKKH---KFF 124 (523) Q Consensus 69 ~wi~~l~~pl~~i~~~~~s~~~lill~GG~fLl~k~~~el~~~~e~---------------------~~~~~~~---~~~ 124 (523) .+-.+.++|++++..+. ++++-|++++|...+.++.+..+ .++++.. ... T Consensus 123 s~~~~~s~~~lE~~S~~------Lli~~G~w~~~r~lr~l~~~~~~~~~~~~~~~~~~~h~H~~~~~Cgh~H~~d~~~~~ 196 (303) T COG2215 123 SITFALSEPWLELISFL------LLILLGLWLLWRTLRRLRHRHPKHPHFAAHPHPDHDHDHHYQCACGHAHAPDPKRLG 196 (303) T ss_pred HHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHC T ss_conf 66776515899999999------999999999999998860257876300257786667765211355545689867723 Q ss_pred CCCHHHHHHH------------HHHHHHHHHHHH----HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH-HH----- Q ss_conf 3320999999------------999999998577----8999998541469999999999999999989999-99----- Q gi|255764470|r 125 SPVSWQVIVL------------QIVILDLVFSLD----SVVTAIGMVQHFSVMAIAVAVSALMMMAVSQPMI-RY----- 182 (523) Q Consensus 125 ~~~~~~~~v~------------~I~v~D~vFSlD----sVitavg~~~~~~i~~~a~~isi~~m~~~s~~i~-~~----- 182 (523) ....++.... .|.|+=++-|+| .++.+.+|+--+.+.+.++. .+.+.+|... ++ T Consensus 197 ~~~~~~~~~~~~l~~GLrPCpgAi~VLlfal~lgl~~~Gil~VlamS~GtalTvs~lA----~~av~ak~~a~~~~g~~~ 272 (303) T COG2215 197 QAVDWKQQWLFGLTGGLRPCPGAIFVLLFALSLGLYTLGILSVLAMSIGTALTVSALA----LLAVTAKNTAVRLSGFRT 272 (303) T ss_pred CCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHCCHH T ss_conf 4322899999999806766837999999999945487789999999971799999999----999999999999815417 Q ss_pred -HHHCC-HHHHHHHHHHHHHHHHHHHH Q ss_conf -98485-28999999999999999997 Q gi|255764470|r 183 -ISRHT-TVVILCLGFLLMIGFLLIIE 207 (523) Q Consensus 183 -i~~~p-~~~~l~l~fl~~iG~~L~~e 207 (523) -.|-| .+++.+-.++++.|..++.. T Consensus 273 ~~~~~~~~~~l~~gli~l~~g~~~l~~ 299 (303) T COG2215 273 LAKRISYIVSLLGGLIGLYFGLHLLLG 299 (303) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 778788999999999999999999987 No 305 >CHL00124 acpP acyl carrier protein; Validated Probab=25.22 E-value=38 Score=12.78 Aligned_cols=70 Identities=16% Similarity=0.174 Sum_probs=37.8 Q ss_pred EHHHHHHHHCCCCCCCCCCC-CCEEEECCCEEEE-EECC------CHHHHHHHHCCCCCCCC-CCCCCHHHHHHHHHCCC Q ss_conf 59999998706644578886-3128834985998-7001------19999998288999987-78240789999883124 Q gi|255764470|r 416 TPANILEAIAGDFPDEDDQK-LDITVGDDGSLTV-DGWI------DVRYASKLFGVNLVDED-DRYSTLAGFILWRLGHL 486 (523) Q Consensus 416 TleDIlE~IvGei~DE~d~~-~~i~~~~d~~~~v-~G~~------~l~el~~~l~~~l~~e~-~~~~TlaGlil~~l~~i 486 (523) |-+++++.+..=+.+...-+ ..+. ++.++.= =|.= -+-.+++.||+++|+++ ....|++..+-..-+++ T Consensus 2 t~~ei~ekV~~ii~e~l~v~~~~It--~ds~f~~DLG~DSLd~vElvm~lE~~F~IeIpded~~~l~TV~d~v~yI~~~l 79 (84) T CHL00124 2 TEQDIFEKVQSIVAEQLGIEKSEVT--LDSNFQRDLGADSLDTVELVMAIEEEFDIEIPDEAAEKIATVQEAVDFISQKI 79 (84) T ss_pred CHHHHHHHHHHHHHHHHCCCHHHCC--CCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHHHHH T ss_conf 8899999999999988499999869--83035661398769999999999999899778899987798999999999988 Q ss_pred C Q ss_conf 8 Q gi|255764470|r 487 P 487 (523) Q Consensus 487 P 487 (523) - T Consensus 80 ~ 80 (84) T CHL00124 80 N 80 (84) T ss_pred H T ss_conf 6 No 306 >PRK11195 major facilitator family transporter; Provisional Probab=25.19 E-value=38 Score=12.78 Aligned_cols=25 Identities=4% Similarity=-0.165 Sum_probs=9.3 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 7745733702788998999999999 Q gi|255764470|r 207 EGLHFDIPKGYLYASIGFSGIIEFF 231 (523) Q Consensus 207 eg~~~~ipk~y~y~~~~fs~~ve~~ 231 (523) -.+|.+++---++++..+++..=.+ T Consensus 369 ~~~g~~~~~~~~~~g~~~~~~~~~~ 393 (415) T PRK11195 369 VKLGVPVVPVIVGFGLFVALAMWLL 393 (415) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9879979999999999999999999 No 307 >cd04394 RhoGAP-ARHGAP11A RhoGAP-ARHGAP11A: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP11A-like proteins. The mouse homolog of human ArhGAP11A has been detected as a gene exclusively expressed in immature ganglion cells, potentially playing a role in retinal development. The exact function of ArhGAP11A is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=24.50 E-value=39 Score=12.69 Aligned_cols=11 Identities=27% Similarity=0.374 Sum_probs=4.6 Q ss_pred HHHHHHHHHHH Q ss_conf 99999998999 Q gi|255764470|r 225 SGIIEFFNQVA 235 (523) Q Consensus 225 s~~ve~~n~~~ 235 (523) .-++..++.++ T Consensus 128 ~~L~~~L~~V~ 138 (202) T cd04394 128 RYFFSFLYDVA 138 (202) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 308 >cd01709 RT_like_1 RT_like_1: A subfamily of reverse transcriptases (RTs). An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses. These elements can be divided into two major groups. One group contains retroviruses and DNA viruses whose propagation involves an RNA intermediate. They are grouped together with transposable elements containing long terminal repeats (LTRs). The other group, also called poly(A)-type retrotransposons, contain fungal mitochondrial introns and transposable elements that lack LTRs. Probab=24.33 E-value=23 Score=14.32 Aligned_cols=208 Identities=14% Similarity=0.069 Sum_probs=89.8 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 46535878999999999999997443999999985179989999999999999999999999999999997424799986 Q gi|255764470|r 3 FSWIYDYYVWIGLATLIALELVLGIDNLIFITLLVEKLPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLK 82 (523) Q Consensus 3 ~~~~~~p~~w~~l~tl~~lEivLs~DN~i~iai~~~~Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~ 82 (523) |+++==|+-|+.|+. -+||.-|-.++- ...-|++.|+|..=..-.+.-++-=..||++-+-+-.-|.-.+ T Consensus 45 L~FfGv~~~Wl~FFk-kfLeaPL~f~~d------~~~a~~RiRkRG~P~sH~ls~~~GE~vLFcmD~aVNq~t~G~~--- 114 (346) T cd01709 45 LKFFGVPEKWLDFFK-KFLEAPLRFVAD------GPDAPPRIRKRGTPMSHALSDVFGELVLFCLDFAVNQATDGGL--- 114 (346) T ss_pred HHHCCCCHHHHHHHH-HHHHCCCCCCCC------CCCCCCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCE--- T ss_conf 998389879999999-986387215678------9999852111789846769999776999988887623567713--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCCCC-CCCCC------------------------------CCCHH Q ss_conf 3453222489999999999998655553--025764211-11222------------------------------33209 Q gi|255764470|r 83 GLSFSGRDIVLILGGFFLLFKGTIELHE--RLEGDGFDK-KHKFF------------------------------SPVSW 129 (523) Q Consensus 83 ~~~~s~~~lill~GG~fLl~k~~~el~~--~~e~~~~~~-~~~~~------------------------------~~~~~ 129 (523) ..-.+|=+.++|----.-||++.+.+ ++-|-+-++ +.++. ..++= T Consensus 115 --LYRlHDD~Wfwg~~~~cv~AW~~m~~Fa~vmGLe~N~eKTGSv~i~~~~~~~~~~~~~~LP~G~irwGFL~Ldp~tG~ 192 (346) T cd01709 115 --LYRLHDDLWFWGQPETCAKAWKAIQEFAKVMGLELNKEKTGSVYLSDDTKTRDTTIDATLPEGPVRWGFLKLDPKTGR 192 (346) T ss_pred --EEEEECCEEEECCHHHHHHHHHHHHHHHHHHCCEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCC T ss_conf --688617468717848899999999999998472235566525896279988786644558999833555776477785 Q ss_pred HHHHHHHHH----------HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH---- Q ss_conf 999999999----------999985778999998541469999999999999999989999999848528999999---- Q gi|255764470|r 130 QVIVLQIVI----------LDLVFSLDSVVTAIGMVQHFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLG---- 195 (523) Q Consensus 130 ~~~v~~I~v----------~D~vFSlDsVitavg~~~~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~---- 195 (523) | .|-|-.| +|. -.||+.=|..=+...=-++.--++-|.+-++-..+-..+..|..+.-.-.. T Consensus 193 f-~IDQ~~Vd~Hi~EL~~QL~~---ckSv~~WIqaWNsy~grFF~~nFGkPA~CFGr~Hvd~~lsT~~rIq~~lF~~~~g 268 (346) T cd01709 193 W-EIDQSQVDAHIDELRKQLDA---CKSVLSWIQAWNSYIGRFFSNNFGKPANCFGREHVDAILATHERIQRRLFPDAGG 268 (346) T ss_pred E-EECHHHHHHHHHHHHHHHHH---CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 4-43189999999999999860---7409999999999999999871798310147878999999999999985778899 Q ss_pred ---HHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHH Q ss_conf ---99999999999774573-37027889989999 Q gi|255764470|r 196 ---FLLMIGFLLIIEGLHFD-IPKGYLYASIGFSG 226 (523) Q Consensus 196 ---fl~~iG~~L~~eg~~~~-ipk~y~y~~~~fs~ 226 (523) =..--=-..+++.||.| ||-||+||++...- T Consensus 269 ~~~sv~~~Lr~~i~~RFgv~~iPDgf~yfP~elGG 303 (346) T cd01709 269 GGGSVTEYLRQMIESRFGVSDIPDGFFYFPEELGG 303 (346) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCEEECHHHCCC T ss_conf 86408999999999856988798300568163088 No 309 >TIGR01246 dapE_proteo succinyl-diaminopimelate desuccinylase; InterPro: IPR005941 The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . Succinyl-diaminopimelate desuccinylase (3.5.1.18 from EC) hydrolyses N-succinyl-L,L-diaminopimelic acid which is required for the bacterial synthesis of lysine and meso-diaminopimelic acid. This group of bacterial sequences belong to the MEROPS peptidase family M20 (clan MH), subfamily M20A (non-peptidase homologs). ; GO: 0009014 succinyl-diaminopimelate desuccinylase activity, 0009089 lysine biosynthetic process via diaminopimelate. Probab=23.54 E-value=40 Score=12.57 Aligned_cols=66 Identities=18% Similarity=0.189 Sum_probs=32.2 Q ss_pred EEEHHHHHHHHCCCCCCCCCC--CC-CEE--EEC--CC-EEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 525999999870664457888--63-128--834--98-59987001199999982889999877824078999988312 Q gi|255764470|r 414 MITPANILEAIAGDFPDEDDQ--KL-DIT--VGD--DG-SLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGH 485 (523) Q Consensus 414 IVTleDIlE~IvGei~DE~d~--~~-~i~--~~~--d~-~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~ 485 (523) |=+.-=+|.+|++.-+||.++ ++ .++ -+. .| +=+++|++.+ .||+...++.. ..+|--.|.+.|.. T Consensus 208 ~H~~~p~L~~L~~~~lD~G~e~F~ptsl~I~NI~aGtG~~NVIPg~L~v-----~FN~Rfs~e~~-~e~~k~~v~~il~~ 281 (383) T TIGR01246 208 VHKAAPVLAELTAIKLDEGNEFFPPTSLQITNIHAGTGANNVIPGELKV-----QFNIRFSTEVS-EETLKSRVEAILDQ 281 (383) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCHHHHC-----CCCCCCCCCCC-HHHHHHHHHHHHHH T ss_conf 0369999999861128896343589753224557888988766111200-----13410286677-17899999999974 No 310 >COG2979 Uncharacterized protein conserved in bacteria [Function unknown] Probab=23.47 E-value=41 Score=12.56 Aligned_cols=31 Identities=10% Similarity=0.177 Sum_probs=24.5 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHCCCC Q ss_conf 8999999677466789899999997511378 Q gi|255764470|r 273 KADVLLPTQHEKHIISDQEKDMVQSVLTLAD 303 (523) Q Consensus 273 e~~~li~~~~~~g~l~~~E~~mi~~vL~L~e 303 (523) -+.+++.-...+|.++++||.+|.+-++-+. T Consensus 113 ll~AmIaAAkaDGhIDe~ERa~I~~~l~~sG 143 (225) T COG2979 113 LLRAMIAAAKADGHIDEKERARIMQKLQESG 143 (225) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC T ss_conf 9999999986247767999999999998717 No 311 >pfam02492 cobW CobW/HypB/UreG, nucleotide-binding domain. This domain is found in HypB, a hydrogenase expression / formation protein, and UreG a urease accessory protein. Both these proteins contain a P-loop nucleotide binding motif. HypB has GTPase activity and is a guanine nucleotide binding protein. It is not known whether UreG binds GTP or some other nucleotide. Both enzymes are involved in nickel binding. HypB can store nickel and is required for nickel dependent hydrogenase expression. UreG is required for functional incorporation of the urease nickel metallocenter. GTP hydrolysis may required by these proteins for nickel incorporation into other nickel proteins. This family of domains also contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans. Probab=23.39 E-value=40 Score=12.58 Aligned_cols=10 Identities=10% Similarity=0.810 Sum_probs=3.8 Q ss_pred EEEEEECCCC Q ss_conf 2777522867 Q gi|255764470|r 400 TFVMVLDEYG 409 (523) Q Consensus 400 ~lAiVvDE~G 409 (523) +.|++++|+| T Consensus 29 ~~avI~Ne~g 38 (174) T pfam02492 29 KIAVIVNDFG 38 (174) T ss_pred CEEEEEECCC T ss_conf 4799993365 No 312 >PRK01658 holin-like protein; Validated Probab=23.21 E-value=41 Score=12.52 Aligned_cols=45 Identities=16% Similarity=0.290 Sum_probs=31.6 Q ss_pred HHHHHHHHHHHHHC-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 57789999985414-------6999999999999999998999999984852 Q gi|255764470|r 144 SLDSVVTAIGMVQH-------FSVMAIAVAVSALMMMAVSQPMIRYISRHTT 188 (523) Q Consensus 144 SlDsVitavg~~~~-------~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~ 188 (523) ++==|.++||+.++ .+-++.++++|..+-+..+...++++.|+.. T Consensus 67 ~lfFVPa~Vgim~y~~ll~~~g~~i~~~i~vsT~~vm~vtg~~~q~l~r~k~ 118 (122) T PRK01658 67 PLFFIPSAVGVMNYGDFLSSKGISLFLVVVISTFVVMIVTGYLTQLLAKRKE 118 (122) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988604869998899999988999999999999999999999999999887 No 313 >PRK12449 acyl carrier protein; Provisional Probab=23.05 E-value=41 Score=12.50 Aligned_cols=31 Identities=13% Similarity=0.236 Sum_probs=22.3 Q ss_pred HHHHHHHCCCCCCCC-CCCCCHHHHHHHHHCC Q ss_conf 999998288999987-7824078999988312 Q gi|255764470|r 455 RYASKLFGVNLVDED-DRYSTLAGFILWRLGH 485 (523) Q Consensus 455 ~el~~~l~~~l~~e~-~~~~TlaGlil~~l~~ 485 (523) -.+++.||+++|+++ ++..|++..+-+...| T Consensus 47 m~lEdeFgi~IpDe~ae~i~Tv~d~vdYI~~k 78 (80) T PRK12449 47 INVEDEFHLAIPDEDVEDMVFMGDLLDYLVQR 78 (80) T ss_pred HHHHHHHCCCCCHHHHHHCCCHHHHHHHHHHH T ss_conf 99999979978889998778799999999986 No 314 >PRK00982 acpP acyl carrier protein; Provisional Probab=22.77 E-value=42 Score=12.46 Aligned_cols=31 Identities=23% Similarity=0.305 Sum_probs=22.3 Q ss_pred CHHHHHHHHCCCCCCCC-CCCCCHHHHHHHHH Q ss_conf 19999998288999987-78240789999883 Q gi|255764470|r 453 DVRYASKLFGVNLVDED-DRYSTLAGFILWRL 483 (523) Q Consensus 453 ~l~el~~~l~~~l~~e~-~~~~TlaGlil~~l 483 (523) -+-.+++.||+++|+++ .++.|++.++-..- T Consensus 43 lv~~lEe~F~i~Ipde~~~~~~TV~dlv~yI~ 74 (78) T PRK00982 43 LVMALEEEFGVEIPDEEAEKIKTVGDAVDYIE 74 (78) T ss_pred HHHHHHHHHCCCCCHHHHHHCCCHHHHHHHHH T ss_conf 99999999699268999974898999999998 No 315 >cd03112 CobW_like The function of this protein family is unkown. The amino acid sequence of YjiA protein in E. coli contains several conserved motifs that characterizes it as a P-loop GTPase. YijA gene is among the genes significantly induced in response to DNA-damage caused by mitomycin. YijA gene is a homologue of the CobW gene which encodes the cobalamin synthesis protein/P47K. Probab=22.70 E-value=38 Score=12.74 Aligned_cols=47 Identities=15% Similarity=0.289 Sum_probs=27.3 Q ss_pred HCCCEEEEEECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHH Q ss_conf 038627775228674256525999999870664457888631288349859987001199 Q gi|255764470|r 396 KSSQTFVMVLDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVR 455 (523) Q Consensus 396 ~~~~~lAiVvDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~ 455 (523) +...+.|++++|||.+ |+= .+++. +...++.++++|..=++..-++. T Consensus 24 ~~~~~~avIvNEfG~~-~ID------~~ll~------~~~~~v~el~~GCiCCs~~~dl~ 70 (158) T cd03112 24 QHGRKIAVIENEFGEV-GID------NQLVV------DTDEEIIEMNNGCICCTVRGDLI 70 (158) T ss_pred CCCCCEEEEEECCCCC-CCC------HHHHH------CCCCEEEEECCCEEEEECCCHHH T ss_conf 8899779997076554-631------66763------78824999338714652251589 No 316 >PRK06193 hypothetical protein; Provisional Probab=22.68 E-value=42 Score=12.45 Aligned_cols=55 Identities=13% Similarity=0.053 Sum_probs=39.4 Q ss_pred ECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHH Q ss_conf 2286742565259999998706644578886312883498599870011999999 Q gi|255764470|r 405 LDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASK 459 (523) Q Consensus 405 vDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~ 459 (523) -.+-|.-.=+||..+.++++.|-.++...+-.-+...+++.+.+-|+++-+++.+ T Consensus 156 ~p~~G~N~VlVtHqvni~alTGi~P~~eG~~vV~~P~g~g~f~vlg~i~P~~W~~ 210 (211) T PRK06193 156 PPAPGTNTVLVGHDDNFEAATGIYPVPEGIAYVFEPDGGQPFKPLARIPPEEWAK 210 (211) T ss_pred CCCCCCEEEEEECCCCHHHHHCCCCCCCCEEEEEEECCCCCCEEEEECCHHHHHH T ss_conf 8989974899968751877728777888729999547999924887548888532 No 317 >PRK01821 hypothetical protein; Provisional Probab=22.49 E-value=42 Score=12.42 Aligned_cols=46 Identities=26% Similarity=0.359 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHHC-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 57789999985414-------69999999999999999989999999848528 Q gi|255764470|r 144 SLDSVVTAIGMVQH-------FSVMAIAVAVSALMMMAVSQPMIRYISRHTTV 189 (523) Q Consensus 144 SlDsVitavg~~~~-------~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~ 189 (523) ++==|.+.||+-++ ..-++.++++|-.+.+.....+++++.|..-+ T Consensus 72 ~lfFVPa~VGim~~~~ll~~~~~~i~~~~viSTiivm~vTG~~~q~l~~rkk~ 124 (133) T PRK01821 72 ALLFVPIGVGVMQYFDLLRAQFGPVVVSCAVSTLVVLLVVGWSSHYVHGERKV 124 (133) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99998878799999999999899999999999999999999999999975420 No 318 >pfam01575 MaoC_dehydratas MaoC like domain. The maoC gene is part of a operon with maoA which is involved in the synthesis of monoamine oxidase. The MaoC protein is found to share similarity with a wide variety of enzymes; estradiol 17 beta-dehydrogenase 4, peroxisomal hydratase-dehydrogenase-epimerase, fatty acid synthase beta subunit. Several bacterial proteins that are composed solely of this domain have (R)-specific enoyl-CoA hydratase activity. This domain is also present in the NodN nodulation protein N. Probab=22.11 E-value=43 Score=12.37 Aligned_cols=98 Identities=16% Similarity=0.085 Sum_probs=57.2 Q ss_pred EEEEHHHHHH--HHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCC Q ss_conf 6525999999--87066445788863128834985998700119999998288999987782407899998831248899 Q gi|255764470|r 413 GMITPANILE--AIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEK 490 (523) Q Consensus 413 GIVTleDIlE--~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~G 490 (523) --||.+|+.. .+.||...-+-++...++.+=+.-++.|.+.+.-....+...++.... +.-|..--++.+.-..| T Consensus 16 rtvT~~di~~fa~~sgD~nPiH~D~~~A~~~~f~~~iahG~~t~s~~~~~~~~~~~~~~~---~~~~~~~~rF~~PV~~G 92 (123) T pfam01575 16 VTVTPADIALFALVSGDHNPIHVDPEFAKLAGFGGPIAHGMLTLAIARGLVEEQGGDNVV---ARYGGWSVRFTGPVFPG 92 (123) T ss_pred EECCHHHHHHHHHHHCCCCCCCCCHHHHHHCCCCCEECCCHHHHHHHHHHHHHHCCCCCE---EEEEEEEEEECCCCCCC T ss_conf 851999999999977899901468778835789980215177999999998887277741---55552177877887889 Q ss_pred CEEEECCEEEEEEEEEC-CEEEEEEEE Q ss_conf 88999999999999618-877899999 Q gi|255764470|r 491 EIFTEMNLKFEIIRLEG-HNIDRVKVS 516 (523) Q Consensus 491 e~i~~~g~~f~V~~~~~-~rI~~V~V~ 516 (523) |++.... +|.+++. +++.++..+ T Consensus 93 Dtl~~~~---ev~~~~~~k~~~~~~~~ 116 (123) T pfam01575 93 DTLRTEV---EVVGKRDGREIKVVETT 116 (123) T ss_pred CEEEEEE---EEEEEECCCCEEEEEEE T ss_conf 9999999---99999868778999999 No 319 >PRK05350 acyl carrier protein; Provisional Probab=21.79 E-value=37 Score=12.87 Aligned_cols=66 Identities=21% Similarity=0.173 Sum_probs=39.2 Q ss_pred EEHHHHHHHHCCCCCCCCCCCCC-EEEECCCEEEEE-EC--C----CHHHHHHHHCCCCCCCC-CCCCCHHHHHHHH Q ss_conf 25999999870664457888631-288349859987-00--1----19999998288999987-7824078999988 Q gi|255764470|r 415 ITPANILEAIAGDFPDEDDQKLD-ITVGDDGSLTVD-GW--I----DVRYASKLFGVNLVDED-DRYSTLAGFILWR 482 (523) Q Consensus 415 VTleDIlE~IvGei~DE~d~~~~-i~~~~d~~~~v~-G~--~----~l~el~~~l~~~l~~e~-~~~~TlaGlil~~ 482 (523) .|-+.|+|.+.+=+.++.+-+++ +. ++.++.-+ |. + -+-.+++.+|..+|+++ ....|++..+-.. T Consensus 2 ~t~eeI~e~v~~il~e~feid~~~It--pea~l~~DL~lDSlD~VdLvv~le~~fG~kI~~E~~~~irTV~D~v~~I 76 (82) T PRK05350 2 QTREQILEMLRAILVELFEIEPEDIT--PEANLYEDLDLDSIDAVDLVVKLQELTGKKIKPEEFKSVRTVNDVVNAV 76 (82) T ss_pred CCHHHHHHHHHHHHHHHHCCCHHHCC--CCCCHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHCCCCHHHHHHHH T ss_conf 85999999999999999689999859--8640020269634879999999999978988989985776099999999 No 320 >PRK05883 acyl carrier protein; Validated Probab=21.22 E-value=45 Score=12.25 Aligned_cols=36 Identities=17% Similarity=0.151 Sum_probs=26.6 Q ss_pred CHHHHHHHHCCCCCCCC-CCCCCHHHHHHHHHCCCCC Q ss_conf 19999998288999987-7824078999988312488 Q gi|255764470|r 453 DVRYASKLFGVNLVDED-DRYSTLAGFILWRLGHLPQ 488 (523) Q Consensus 453 ~l~el~~~l~~~l~~e~-~~~~TlaGlil~~l~~iP~ 488 (523) -+-.+++.||+.+++++ ..+.|+|.+--...-++|. T Consensus 48 ~vi~ie~~fGi~i~eEdll~~~TVGDl~~~I~~kv~~ 84 (85) T PRK05883 48 GMVAIEERLGVALSEEDLLSCDTVGDLEAAIAAKVRD 84 (85) T ss_pred HHHHHHHHHCCCCCHHHHHCCCHHHHHHHHHHHHCCC T ss_conf 9999999866635899985236099999999743047 No 321 >COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=21.08 E-value=45 Score=12.23 Aligned_cols=152 Identities=18% Similarity=0.137 Sum_probs=73.8 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------- Q ss_conf 9999998485289999999999999999977457337027889989999999998999998755430103---------- Q gi|255764470|r 178 PMIRYISRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLYASIGFSGIIEFFNQVARRNREQLMSPSR---------- 247 (523) Q Consensus 178 ~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y~~~~fs~~ve~~n~~~~~~~~~~~~~~~---------- 247 (523) +-.+|++|+|+++.||-+=.- -+.-.=+|+| |-.=| --++..|+.+.+++....| T Consensus 55 yy~~fl~rfPti~~LA~A~~~--evl~~W~gLG------YysRA-------rnL~~~A~~v~~~~~G~~P~~~~~l~~Lp 119 (342) T COG1194 55 YYERFLERFPTIKALAAAPED--EVLKAWEGLG------YYSRA-------RNLHKAAQEVVERHGGEFPDDEEELAALP 119 (342) T ss_pred HHHHHHHHCCCHHHHHCCCHH--HHHHHHHHCC------HHHHH-------HHHHHHHHHHHHHCCCCCCCCHHHHHHCC T ss_conf 699999868998998668888--9999987167------37899-------99999999999981997999999998678 Q ss_pred -HHHHHHHHHHHHH--------------------CCCCCCC-CCCHHHHHHHHHHHHHCCCCCHH-HHHHHHHHHCCCCE Q ss_conf -7776676567441--------------------5688876-67988899999967746678989-99999975113786 Q gi|255764470|r 248 -LRARTADAVLRLL--------------------GGKPIQP-QGLNVKADVLLPTQHEKHIISDQ-EKDMVQSVLTLADR 304 (523) Q Consensus 248 -~~eRtA~aVLRLl--------------------G~~~~~~-~~~~ee~~~li~~~~~~g~l~~~-E~~mi~~vL~L~e~ 304 (523) .-.-||.||+-.- +++.... ..++.+...+. ++=+.... -.+.-+.+++|+.. T Consensus 120 GiG~yTa~Ail~~a~~~~~~~lDgNV~RVl~R~f~i~~~~~~~~~~~~~~~~~----~~ll~p~~~~~~fnqammdlGA~ 195 (342) T COG1194 120 GVGPYTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDIGKPKTKKELWELA----EQLLTPDRRPGDFNQAMMDLGAT 195 (342) T ss_pred CCCHHHHHHHHHHHHCCCCCEEECCHHEEEHHHHCCCCCCCCCCHHHHHHHHH----HHHCCCCCCHHHHHHHHHHHHHH T ss_conf 97388999999987189875330542211266531436556541058999999----97448988767999999986367 Q ss_pred EEEEEEEECCEEEEEECCCCHHHHHHHHHHCCCCEEEEECCCCCC---EEEEEE Q ss_conf 674653400028996089998999999985799788887678552---788888 Q gi|255764470|r 305 PAKSIMTPRTEIVWLDVNCVDEDLQWKILELGHSRFPVAQGSLDS---FIGIVS 355 (523) Q Consensus 305 ~V~~IMtPR~~i~~ld~~~s~~e~~~~i~~~~~SR~PV~~~~~D~---iiGiv~ 355 (523) |-||++--. ...++.+......+.....+||-...... ..+++- T Consensus 196 ----ICt~~~P~C---~~CPl~~~c~a~~~g~~~~~P~k~~k~~~~~~~~~~~~ 242 (342) T COG1194 196 ----ICTAKKPKC---SLCPLRDNCAAYRNGTPEKYPVKKPKKKLPRRFAAFLI 242 (342) T ss_pred ----HHCCCCCCC---CCCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHEEEEEE T ss_conf ----615899987---73946688999981996447876655455100335778 No 322 >PRK11360 sensory histidine kinase AtoS; Provisional Probab=21.05 E-value=45 Score=12.22 Aligned_cols=81 Identities=12% Similarity=0.130 Sum_probs=51.4 Q ss_pred HHHHHHHHHHHHHHH--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 985778999998541--469999999999999999989999999848528999999999999999997745733702788 Q gi|255764470|r 142 VFSLDSVVTAIGMVQ--HFSVMAIAVAVSALMMMAVSQPMIRYISRHTTVVILCLGFLLMIGFLLIIEGLHFDIPKGYLY 219 (523) Q Consensus 142 vFSlDsVitavg~~~--~~~i~~~a~~isi~~m~~~s~~i~~~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~~ipk~y~y 219 (523) ..++++|-..+.-.+ -++...+|+++++..-.+.|+.+++-+++ +- -|...++.++...+|+.+ T Consensus 174 ~~~~~~i~~~~~~~~~~~~~~~~i~l~~~~~~~~~l~~~i~~~i~~-----~~-------~~~~~~~~~~~~~~~~~~-- 239 (607) T PRK11360 174 NELTEDIRRQAWKMDVRIYIVLTAGLVIGLLLIVLLSRRLSANVDI-----IT-------DGLSTLEQDLPTRLPPLP-- 239 (607) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HH-------HHHHHHHCCCCCCCCCCC-- T ss_conf 0718999999999999999999999999999999999886446999-----99-------999997407774588889-- Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9989999999998999998 Q gi|255764470|r 220 ASIGFSGIIEFFNQVARRN 238 (523) Q Consensus 220 ~~~~fs~~ve~~n~~~~~~ 238 (523) =-+.-+...||+.+++= T Consensus 240 --dEiG~la~~~N~ma~~L 256 (607) T PRK11360 240 --GELGQISQAINNLAQAL 256 (607) T ss_pred --CCHHHHHHHHHHHHHHH T ss_conf --95899999999999999 No 323 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=20.97 E-value=45 Score=12.21 Aligned_cols=86 Identities=17% Similarity=0.296 Sum_probs=46.2 Q ss_pred HHHHHHHH-CCCEEEEE--------------ECCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCC Q ss_conf 99998850-38627775--------------2286742565259999998706644578886312883498599870011 Q gi|255764470|r 389 KLMERLRK-SSQTFVMV--------------LDEYGVLEGMITPANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWID 453 (523) Q Consensus 389 ~lL~~fr~-~~~~lAiV--------------vDE~G~~~GIVTleDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~ 453 (523) ++++.-++ ++..+++- .++.|.+.-|| |-.|.++++..+.++.-|.|.+++..- T Consensus 118 ~Ll~~~~~~~~~~~tlLt~~l~DP~GYGRI~r~~~g~V~~IV-----------E~KDA~~eqk~I~eiNtG~y~~~~~~L 186 (461) T TIGR01173 118 RLLESHRQLNGAKVTLLTAKLEDPTGYGRIIRENDGAVQAIV-----------EEKDANEEQKAIKEINTGVYVFDGAAL 186 (461) T ss_pred HHHHHHHCCCCCCEEEEEEEECCCCCCCEEEECCCCCEEEEE-----------CCCCCCHHHHCCCEEEEEEEEECHHHH T ss_conf 988863000104517999871888954358984899589997-----------335988698035278887999832899 Q ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEE Q ss_conf 9999998288999987782407899998831248899889 Q gi|255764470|r 454 VRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEKEIF 493 (523) Q Consensus 454 l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~Ge~i 493 (523) -+-|.+.=+-+- ..|| |+.+...-.-..|..+ T Consensus 187 ~~~L~~l~n~Na---qgEY-----YLTD~ia~a~~~g~~v 218 (461) T TIGR01173 187 KRWLPKLSNNNA---QGEY-----YLTDVIALAVADGEEV 218 (461) T ss_pred HHHHHHHCCCCC---CCCC-----CHHHHHHHHHCCCCEE T ss_conf 988876287704---4431-----4789999985089478 No 324 >PRK05122 major facilitator superfamily transporter; Provisional Probab=20.86 E-value=45 Score=12.19 Aligned_cols=63 Identities=17% Similarity=0.182 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHH-HHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99899999998485289999999999-9999999774573370---27889989999999998999998 Q gi|255764470|r 174 AVSQPMIRYISRHTTVVILCLGFLLM-IGFLLIIEGLHFDIPK---GYLYASIGFSGIIEFFNQVARRN 238 (523) Q Consensus 174 ~~s~~i~~~i~~~p~~~~l~l~fl~~-iG~~L~~eg~~~~ipk---~y~y~~~~fs~~ve~~n~~~~~~ 238 (523) ...-...+..+|+...+++..+++++ +|..+++ +--+.+- +.+-..++|++..-.+|..+.++ T Consensus 266 ~~r~~~g~l~dr~g~~~~i~~~~~l~~~~l~ll~--~~~~~~~ll~a~~l~G~G~g~~~P~l~a~~~~~ 332 (395) T PRK05122 266 GARLLFGNLINRLGGLRVAIVSLAVEILGLLLLW--LAPSPWVALIGAALTGFGFSLVFPALGVEAVKR 332 (395) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999832787860644899999999999999999--872099999999999999988889999999864 No 325 >TIGR03046 PS_II_psbV2 photosystem II cytochrome PsbV2. Members of this protein family are PsbV2, a protein closely related cytochrome c-550 (PsbV), a protein important to the water-splitting and oxygen-evolving activity of photosystem II. Mutant studies in Thermosynechococcus elongatus showed PsbV2 can partially replace PsbV, from which it appears to have arisen first by duplication, then by intergenic recombination with a different gene. Probab=20.56 E-value=46 Score=12.15 Aligned_cols=101 Identities=21% Similarity=0.195 Sum_probs=54.2 Q ss_pred CCHHHHHHHHHHCCCEEEEEECCCCCEEEEEEHHHHH-----------------------------HHHCCCCCCCCCCC Q ss_conf 9989999988503862777522867425652599999-----------------------------98706644578886 Q gi|255764470|r 385 ISVLKLMERLRKSSQTFVMVLDEYGVLEGMITPANIL-----------------------------EAIAGDFPDEDDQK 435 (523) Q Consensus 385 ~~l~~lL~~fr~~~~~lAiVvDE~G~~~GIVTleDIl-----------------------------E~IvGei~DE~d~~ 435 (523) ..+++...++.+....+.+-.|+-|.+. ..|.+++- |.+.|--|..+.-. T Consensus 21 ~~vD~yv~~yl~~~~pvtvPln~~G~t~-t~s~eq~~~GkrlF~~~C~~CH~GG~TktNpnV~L~~e~L~~AtP~RDni~ 99 (155) T TIGR03046 21 ARVDPYVKQYLRVTGPVELPLDAQGELK-TFTPEQLTDGKNLFESNCLNCHVGGATLPNPNVSLSLKDLKGATPPRDTIQ 99 (155) T ss_pred HHHHHHHHHHHCCCCCEEECCCCCCCEE-EECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCCCCCCHH T ss_conf 6650899986145774461137899868-857999998999999887640168603579877849899656999843399 Q ss_pred CC-EEEECCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCC Q ss_conf 31-28834985998700119999998288999987782407899998831248899 Q gi|255764470|r 436 LD-ITVGDDGSLTVDGWIDVRYASKLFGVNLVDEDDRYSTLAGFILWRLGHLPQEK 490 (523) Q Consensus 436 ~~-i~~~~d~~~~v~G~~~l~el~~~l~~~l~~e~~~~~TlaGlil~~l~~iP~~G 490 (523) .- -...++-+| ||..++.++..-..--+. +++...+||||+..-+.-|.=| T Consensus 100 ~LVdY~k~PtsY--DG~~~i~e~hps~~s~Lt--dedL~~iAg~IL~aa~~ap~WG 151 (155) T TIGR03046 100 SLVAYQRDPMSY--DGSEESYGCRPVPEDWMD--DEEVENLAAFILRAAQKAPGWG 151 (155) T ss_pred HHHHHHHCCCCC--CCCHHHHHHCCCCCCCCC--HHHHHHHHHHHHHCCCCCCCCC T ss_conf 999996299766--771566673568400268--8999999999874211377768 No 326 >TIGR00711 efflux_EmrB drug resistance MFS transporter, drug:H+ antiporter-1 (14 Spanner) (DHA2) family; InterPro: IPR004638 This subfamily of drug efflux proteins, a part of the major facilitator family, is predicted to have 14 potential membrane-spanning regions. Members with known activities include EmrB (multiple drug resistance efflux pump) in Escherichia coli, FarB (antibacterial fatty acid resistance) in Neisseria gonorrhoeae, TcmA (tetracenomycin C resistance) in Streptomyces glaucescens, etc. In most cases, the efflux pump is described as having a second component encoded in the same operon, such as EmrA of E. coli.; GO: 0006810 transport, 0016021 integral to membrane. Probab=20.43 E-value=46 Score=12.13 Aligned_cols=268 Identities=16% Similarity=0.201 Sum_probs=106.0 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 53587899999999999999744399999998517--9989999999999999999999999999999997424799986 Q gi|255764470|r 5 WIYDYYVWIGLATLIALELVLGIDNLIFITLLVEK--LPLAQRGKALVFGLTFAMVTRIALLASLSYWIVMLQQPLFFLK 82 (523) Q Consensus 5 ~~~~p~~w~~l~tl~~lEivLs~DN~i~iai~~~~--Lp~~~r~~a~~~Gl~~A~v~R~~ll~~~~~wi~~l~~pl~~i~ 82 (523) |+.|--.|- -..++++--|+=|.++-....++ -.++++++-=..|+.+-.+--..|++..+. +-..-+ T Consensus 148 wI~e~~~Wr---w~F~~NvPiG~~v~~~a~~~l~~d~~~~~~~~~~D~~G~~L~~~g~~~Ll~~ls~---g~~~GW---- 217 (501) T TIGR00711 148 WIIENYEWR---WIFYINVPIGIIVVVVALFILPEDREKEKASKNLDFLGLLLLSVGLGSLLYVLSE---GQEEGW---- 217 (501) T ss_pred HHHHCCCCH---HEEEHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CCCCCC---- T ss_conf 887326222---2021257899999999999844666661666311278899999999999987521---566889---- Q ss_pred HHHHHH-HHHHHHHHHH--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHH--HH----HHHHHHHHHHHHHHHHHHH Q ss_conf 345322-2489999999--9999986555530257642111122233209999--99----9999999985778999998 Q gi|255764470|r 83 GLSFSG-RDIVLILGGF--FLLFKGTIELHERLEGDGFDKKHKFFSPVSWQVI--VL----QIVILDLVFSLDSVVTAIG 153 (523) Q Consensus 83 ~~~~s~-~~lill~GG~--fLl~k~~~el~~~~e~~~~~~~~~~~~~~~~~~~--v~----~I~v~D~vFSlDsVitavg 153 (523) ++- .=+++-+-|+ |..+...+|+.++ ++--+-+--+.. .|... +. .+..--.++=|=.=+--+ T Consensus 218 ---~~~~~il~~~i~~~~~~~~fv~~sel~~~---nP~~D~~~F~~~-~Ft~~f~~~~~~~~~~lyg~~~LLP~~Lq~~- 289 (501) T TIGR00711 218 ---FDSNLILILSIVGLVAIIAFVVRSELTDD---NPLVDLRLFKYR-NFTIGFCVYSLLLGLGLYGAVYLLPLYLQQV- 289 (501) T ss_pred ---CCCCHHHHHHHHHHHHHHHHHHHHHHCCC---CCCCCHHHHCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf ---98631569999999999999997874146---880044340202-5788999999999999999999999999986- Q ss_pred HHHCHHHHHHHHHHHH--HHHHHHHHHHHH-HHHHCCHHHHHHHHHHHHHHHHHHHHHHCCC-------------CCCHH Q ss_conf 5414699999999999--999999899999-9984852899999999999999999774573-------------37027 Q gi|255764470|r 154 MVQHFSVMAIAVAVSA--LMMMAVSQPMIR-YISRHTTVVILCLGFLLMIGFLLIIEGLHFD-------------IPKGY 217 (523) Q Consensus 154 ~~~~~~i~~~a~~isi--~~m~~~s~~i~~-~i~~~p~~~~l~l~fl~~iG~~L~~eg~~~~-------------ipk~y 217 (523) -|..-+-.|+.+.- +.++..|=...+ +.+|.-.=+ ++.||+.+.+=|++.. +++|| T Consensus 290 --~~~~~~~~Gl~~~P~gl~~~l~sPIaG~~~~~k~~~R~------l~~ig~~~~a~~~y~~~~~~~~~~~f~~~~~~G~ 361 (501) T TIGR00711 290 --LGYTALQAGLHILPVGLAPLLSSPIAGRMLGDKIDPRL------LVTIGLILYAVGFYLRAFTLTPDTDFLAIALPGQ 361 (501) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH------HHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf --27997899999988999999970799999841468068------9999999999999888522155786188886779 Q ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHCCCCCCCC----CCHHHHHHHHHHHHHCC-CCCHH Q ss_conf 889989999999998999998--755430103777667656744156888766----79888999999677466-78989 Q gi|255764470|r 218 LYASIGFSGIIEFFNQVARRN--REQLMSPSRLRARTADAVLRLLGGKPIQPQ----GLNVKADVLLPTQHEKH-IISDQ 290 (523) Q Consensus 218 ~y~~~~fs~~ve~~n~~~~~~--~~~~~~~~~~~eRtA~aVLRLlG~~~~~~~----~~~ee~~~li~~~~~~g-~l~~~ 290 (523) .=-+++-|...-=+|-++-.+ ++|.-....|.. .+|-+||-=...- -+..+..-.-+..+... ..+.+ T Consensus 362 ~i~G~g~~~~fmP~~~~al~~Lp~~~~~~gs~L~n-----~~R~~gGSiGTal~t~~~~n~~~~~~~~~~e~~~e~~~~~ 436 (501) T TIGR00711 362 FIRGFGMGCFFMPLTTIALSKLPAEKVARGSSLSN-----FLRQLGGSIGTALITTILTNRTKFHHSQLTEKINEKVNPN 436 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99999999999875577761252357622689999-----9999888999999999998668999999999623456710 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9999997511378 Q gi|255764470|r 291 EKDMVQSVLTLAD 303 (523) Q Consensus 291 E~~mi~~vL~L~e 303 (523) -.+..++..++.. T Consensus 437 ~~~~~~~~~~~~~ 449 (501) T TIGR00711 437 FLSKLSQLAELGS 449 (501) T ss_pred HHHHHHHHHHHCC T ss_conf 7988868886051 No 327 >PRK01064 hypothetical protein; Provisional Probab=20.12 E-value=47 Score=12.09 Aligned_cols=47 Identities=13% Similarity=0.094 Sum_probs=30.7 Q ss_pred HHHHHHHHCCCCCCCCCCCCCEEEECCCEEEEEECCCHHHHHHHHCC Q ss_conf 99999987066445788863128834985998700119999998288 Q gi|255764470|r 417 PANILEAIAGDFPDEDDQKLDITVGDDGSLTVDGWIDVRYASKLFGV 463 (523) Q Consensus 417 leDIlE~IvGei~DE~d~~~~i~~~~d~~~~v~G~~~l~el~~~l~~ 463 (523) ++|++|.|+-.+.|..|+-.--...++++.+..=++.=+|+-+..|- T Consensus 1 mkeLve~ivk~LVd~Pd~V~V~~~~~~~~~~~~l~V~~~D~GkVIGk 47 (78) T PRK01064 1 MKEFLAYIVKNLVDRPEEVHIKEVQGTHTIIYELTVAKPDIGKIIGK 47 (78) T ss_pred CHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEEECCCCCCCEECC T ss_conf 97899999998579987389999947987999999990018558788 No 328 >COG3877 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.04 E-value=47 Score=12.08 Aligned_cols=65 Identities=28% Similarity=0.398 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 9999989999987554-----3010377766765674415688876679888999999677466789899999 Q gi|255764470|r 227 IIEFFNQVARRNREQL-----MSPSRLRARTADAVLRLLGGKPIQPQGLNVKADVLLPTQHEKHIISDQEKDM 294 (523) Q Consensus 227 ~ve~~n~~~~~~~~~~-----~~~~~~~eRtA~aVLRLlG~~~~~~~~~~ee~~~li~~~~~~g~l~~~E~~m 294 (523) +|++| ..+|-|-+.. ++--.+|-| -+.+||-||-.|..+...+...+.++++ -+.|.++++|..- T Consensus 49 Fv~lf-~r~RGnlKEvEr~lg~sYptvR~k-ld~vlramgy~p~~e~~~~i~~~~i~~q-le~Gei~peeA~~ 118 (122) T COG3877 49 FVELF-LRCRGNLKEVERELGISYPTVRTK-LDEVLRAMGYNPDSENSVNIGKKKIIDQ-LEKGEISPEEAIK 118 (122) T ss_pred HHHHH-HHHCCCHHHHHHHHCCCCHHHHHH-HHHHHHHCCCCCCCCCHHHHHHHHHHHH-HHCCCCCHHHHHH T ss_conf 99999-997257999999977761789989-9999998089989987045538999999-9817879999999 Done!