Query gi|254780936|ref|YP_003065349.1| hypothetical protein CLIBASIA_04175 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 182 No_of_seqs 108 out of 875 Neff 8.2 Searched_HMMs 39220 Date Mon May 30 01:29:04 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780936.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 cd06167 LabA_like LabA_like pr 100.0 3.9E-30 1E-34 208.9 18.6 146 6-166 1-149 (149) 2 COG1432 Uncharacterized conser 99.9 7.2E-27 1.8E-31 188.5 14.7 171 1-180 1-174 (181) 3 pfam01936 DUF88 Protein of unk 99.9 3.3E-26 8.3E-31 184.4 13.9 138 7-167 1-140 (140) 4 TIGR00288 TIGR00288 conserved 98.9 5.7E-09 1.4E-13 76.5 8.5 133 4-166 22-156 (161) 5 pfam04396 consensus 98.6 1.7E-06 4.4E-11 61.0 12.1 130 6-161 1-141 (149) 6 smart00475 53EXOc 5'-3' exonuc 95.5 0.22 5.6E-06 29.1 14.9 53 65-130 88-140 (259) 7 cd00008 53EXOc 5'-3' exonuclea 94.7 0.38 9.8E-06 27.6 12.9 53 65-130 89-141 (240) 8 pfam02739 5_3_exonuc_N 5'-3' e 94.4 0.44 1.1E-05 27.2 11.2 61 65-142 90-150 (169) 9 TIGR00593 pola DNA polymerase 93.9 0.57 1.4E-05 26.5 10.0 142 10-178 2-178 (1005) 10 COG0603 Predicted PP-loop supe 93.1 0.58 1.5E-05 26.4 7.2 14 124-137 137-150 (222) 11 PRK09482 xni exonuclease IX; P 91.2 1.3 3.3E-05 24.3 14.6 118 7-143 3-147 (256) 12 PRK05929 consensus 91.1 1.3 3.4E-05 24.2 9.0 141 5-178 1-171 (870) 13 PRK08928 consensus 90.6 1.5 3.7E-05 23.9 13.0 121 4-143 2-151 (861) 14 PRK07456 consensus 90.3 1.5 4E-05 23.8 12.2 118 1-131 1-154 (975) 15 PRK08076 consensus 90.0 1.7 4.2E-05 23.6 10.5 114 5-131 1-143 (877) 16 PRK07997 consensus 89.8 1.7 4.4E-05 23.5 13.3 141 4-178 5-174 (928) 17 PRK07625 consensus 89.2 1.9 4.8E-05 23.2 13.0 141 5-178 4-173 (922) 18 PRK06887 consensus 89.0 2 5E-05 23.1 13.2 145 1-178 1-174 (954) 19 PRK07898 consensus 88.6 2.1 5.3E-05 23.0 12.4 125 1-144 9-164 (902) 20 PRK05755 DNA polymerase I; Pro 88.1 2.2 5.7E-05 22.8 13.7 140 6-178 4-172 (889) 21 TIGR00364 TIGR00364 exsB prote 87.5 1.1 2.8E-05 24.7 4.5 55 116-171 2-68 (227) 22 TIGR02637 RhaS rhamnose ABC tr 87.3 2.3 5.8E-05 22.7 6.0 64 66-142 20-87 (307) 23 PRK13059 putative lipid kinase 87.1 2.6 6.5E-05 22.4 10.9 83 60-156 18-100 (294) 24 PRK11557 putative DNA-binding 87.0 2.6 6.6E-05 22.4 11.1 62 98-165 165-229 (282) 25 pfam11977 RNase_Zc3h12a Zc3h12 86.9 2.6 6.7E-05 22.3 6.6 107 9-127 4-116 (155) 26 cd06318 PBP1_ABC_sugar_binding 86.8 2.7 6.8E-05 22.3 10.3 84 57-146 136-225 (282) 27 PRK07556 consensus 86.8 2.7 6.8E-05 22.3 7.4 81 65-178 100-180 (977) 28 PRK07300 consensus 86.6 2.7 6.9E-05 22.2 8.2 113 4-130 2-150 (880) 29 cd06313 PBP1_ABC_sugar_binding 86.6 2.7 7E-05 22.2 10.9 109 33-164 116-230 (272) 30 pfam06508 ExsB ExsB. This fami 86.5 2.8 7E-05 22.2 6.2 42 115-158 2-43 (137) 31 PRK02947 hypothetical protein; 86.3 1.4 3.6E-05 24.0 4.5 67 99-165 92-168 (247) 32 TIGR03127 RuMP_HxlB 6-phospho 85.9 2.6 6.6E-05 22.4 5.7 105 27-167 14-124 (179) 33 cd06305 PBP1_methylthioribose_ 85.8 3 7.7E-05 22.0 10.4 36 111-146 180-218 (273) 34 PRK08835 consensus 85.7 3 7.7E-05 21.9 12.3 124 1-143 1-153 (931) 35 PRK05797 consensus 84.6 3.4 8.7E-05 21.6 10.1 114 4-131 1-143 (869) 36 cd06316 PBP1_ABC_sugar_binding 84.4 3.5 8.9E-05 21.6 11.0 116 9-147 101-221 (294) 37 cd05005 SIS_PHI Hexulose-6-pho 84.2 3.6 9.1E-05 21.5 5.8 102 30-167 23-127 (179) 38 cd06409 PB1_MUG70 The MUG70 pr 83.7 1.9 4.8E-05 23.2 4.2 30 115-144 54-83 (86) 39 cd06312 PBP1_ABC_sugar_binding 83.6 3.8 9.6E-05 21.4 11.0 89 56-164 137-231 (271) 40 pfam10740 DUF2529 Protein of u 83.1 1.7 4.3E-05 23.5 3.7 59 111-173 80-141 (172) 41 cd06309 PBP1_YtfQ_like Peripla 82.8 4 0.0001 21.2 10.0 79 55-146 135-222 (273) 42 cd01996 Alpha_ANH_like_III Thi 82.0 3.8 9.6E-05 21.3 5.2 31 117-147 83-113 (154) 43 PRK09701 D-allose transporter 81.9 4.4 0.00011 20.9 11.1 89 56-162 168-262 (311) 44 PRK10355 xylF D-xylose transpo 81.8 4.4 0.00011 20.9 10.3 61 97-162 191-257 (330) 45 cd06317 PBP1_ABC_sugar_binding 81.6 4.5 0.00011 20.9 10.1 81 54-146 135-222 (275) 46 PRK05354 arginine decarboxylas 81.4 4.5 0.00012 20.8 7.1 116 26-145 46-179 (634) 47 cd01995 ExsB ExsB is a transcr 81.1 4.7 0.00012 20.8 6.3 35 124-161 98-134 (169) 48 PRK02155 ppnK inorganic polyph 80.7 4.8 0.00012 20.7 6.2 79 56-143 16-94 (291) 49 cd01538 PBP1_ABC_xylose_bindin 80.4 4.9 0.00013 20.6 8.9 93 58-163 139-237 (288) 50 cd05014 SIS_Kpsf KpsF-like pro 80.3 4.9 0.00013 20.6 5.3 84 64-170 16-102 (128) 51 COG1737 RpiR Transcriptional r 80.0 5.1 0.00013 20.6 10.3 112 27-166 114-228 (281) 52 PRK11106 queuosine biosynthesi 79.2 5.4 0.00014 20.4 6.3 30 123-155 136-167 (231) 53 PRK03378 ppnK inorganic polyph 78.3 5.7 0.00015 20.2 5.9 79 56-143 16-94 (292) 54 PRK08786 consensus 78.1 5.8 0.00015 20.2 12.4 61 65-144 88-148 (927) 55 TIGR02095 glgA glycogen/starch 78.0 5.8 0.00015 20.2 8.7 89 71-164 315-409 (517) 56 cd06322 PBP1_ABC_sugar_binding 78.0 5.8 0.00015 20.2 8.5 79 57-146 133-213 (267) 57 cd06302 PBP1_LsrB_Quorum_Sensi 77.9 5.9 0.00015 20.1 10.9 112 33-164 116-233 (298) 58 cd06311 PBP1_ABC_sugar_binding 77.8 5.9 0.00015 20.1 6.7 112 9-145 102-219 (274) 59 cd06292 PBP1_LacI_like_10 Liga 76.7 6.4 0.00016 19.9 11.7 102 33-146 113-218 (273) 60 cd06308 PBP1_sensor_kinase_lik 76.6 6.4 0.00016 19.9 10.3 91 56-162 134-231 (270) 61 cd06300 PBP1_ABC_sugar_binding 76.1 6.6 0.00017 19.8 11.1 94 34-146 119-217 (272) 62 cd06275 PBP1_PurR Ligand-bindi 75.5 6.8 0.00017 19.7 7.2 101 33-146 108-215 (269) 63 cd06274 PBP1_FruR Ligand bindi 74.8 7.1 0.00018 19.6 7.4 102 33-147 107-216 (264) 64 PRK08434 consensus 74.8 7.1 0.00018 19.6 10.2 113 5-131 1-142 (887) 65 PRK11337 DNA-binding transcrip 74.3 7.3 0.00019 19.6 12.2 63 98-166 174-239 (293) 66 cd05013 SIS_RpiR RpiR-like pro 74.2 7.4 0.00019 19.5 11.7 105 31-166 4-111 (139) 67 cd01574 PBP1_LacI Ligand-bindi 74.0 7.4 0.00019 19.5 11.5 99 32-146 106-211 (264) 68 cd06277 PBP1_LacI_like_1 Ligan 73.6 7.6 0.00019 19.4 10.4 103 32-146 108-214 (268) 69 cd06270 PBP1_GalS_like Ligand 72.5 8.1 0.00021 19.3 10.9 101 33-146 107-214 (268) 70 PRK13545 tagH teichoic acids e 72.4 8.1 0.00021 19.3 4.7 63 98-168 148-218 (549) 71 PRK02231 ppnK inorganic polyph 72.2 8.2 0.00021 19.2 5.0 71 64-143 3-73 (272) 72 cd05710 SIS_1 A subgroup of th 72.0 8.3 0.00021 19.2 5.9 47 115-167 50-99 (120) 73 cd06319 PBP1_ABC_sugar_binding 70.7 8.9 0.00023 19.0 11.7 85 53-146 134-220 (277) 74 cd05008 SIS_GlmS_GlmD_1 SIS (S 70.6 8.9 0.00023 19.0 5.2 51 113-169 47-100 (126) 75 cd01543 PBP1_XylR Ligand-bindi 68.8 9.7 0.00025 18.8 9.7 78 59-146 123-208 (265) 76 COG4821 Uncharacterized protei 67.6 10 0.00026 18.6 4.5 56 110-165 102-165 (243) 77 cd06306 PBP1_TorT-like TorT-li 67.2 10 0.00027 18.6 14.0 109 35-162 117-229 (268) 78 cd05006 SIS_GmhA Phosphoheptos 65.7 11 0.00029 18.4 5.6 52 109-166 98-152 (177) 79 cd01537 PBP1_Repressors_Sugar_ 65.7 11 0.00029 18.4 9.4 100 33-146 109-215 (264) 80 pfam09338 Gly_reductase Glycin 65.0 7.4 0.00019 19.5 3.2 89 66-160 260-356 (428) 81 TIGR01703 hybrid_clust hydroxy 64.3 6.9 0.00018 19.7 2.9 41 119-166 357-406 (567) 82 cd01539 PBP1_GGBP Periplasmic 64.1 12 0.00031 18.2 10.0 93 56-164 151-254 (303) 83 cd01471 vWA_micronemal_protein 64.0 12 0.00031 18.2 5.9 63 96-161 83-159 (186) 84 cd06295 PBP1_CelR Ligand bindi 63.5 12 0.00031 18.1 11.7 100 33-146 116-223 (275) 85 COG0794 GutQ Predicted sugar p 62.5 13 0.00033 18.0 5.6 85 61-168 52-139 (202) 86 TIGR03573 WbuX N-acetyl sugar 61.8 13 0.00034 17.9 5.2 20 125-144 149-168 (343) 87 PRK13337 putative lipid kinase 60.8 14 0.00035 17.8 10.7 79 48-141 7-87 (305) 88 PRK04160 diphthine synthase; P 60.6 14 0.00035 17.8 4.9 45 99-143 61-110 (259) 89 COG2117 Predicted subunit of t 60.5 14 0.00036 17.8 4.8 15 154-168 139-153 (198) 90 cd01482 vWA_collagen_alphaI-XI 60.1 14 0.00036 17.8 5.7 62 96-161 77-151 (164) 91 PRK04759 consensus 59.8 14 0.00037 17.7 7.0 80 56-143 16-95 (294) 92 cd06272 PBP1_hexuronate_repres 59.6 14 0.00037 17.7 9.9 102 33-147 102-210 (261) 93 cd01475 vWA_Matrilin VWA_Matri 59.2 15 0.00038 17.7 6.3 62 96-161 79-156 (224) 94 TIGR00642 mmCoA_mut_beta methy 59.1 15 0.00038 17.6 6.0 50 94-143 548-601 (642) 95 cd06282 PBP1_GntR_like_2 Ligan 59.1 15 0.00038 17.6 8.5 104 33-147 107-214 (266) 96 cd06310 PBP1_ABC_sugar_binding 58.9 15 0.00038 17.6 10.9 79 54-146 133-218 (273) 97 PRK10703 DNA-binding transcrip 58.8 15 0.00038 17.6 7.9 100 34-146 170-276 (335) 98 PRK01911 ppnK inorganic polyph 57.9 15 0.00039 17.5 5.5 82 56-144 11-94 (290) 99 TIGR01350 lipoamide_DH dihydro 57.9 15 0.00038 17.6 3.7 29 113-142 2-30 (481) 100 cd06271 PBP1_AglR_RafR_like Li 57.9 15 0.0004 17.5 8.2 102 33-147 111-219 (268) 101 cd01977 Nitrogenase_VFe_alpha 57.4 16 0.0004 17.5 4.1 35 114-148 289-324 (415) 102 cd01391 Periplasmic_Binding_Pr 57.2 16 0.00041 17.4 12.0 102 33-147 115-219 (269) 103 PRK11041 DNA-binding transcrip 56.2 17 0.00042 17.3 8.3 101 33-146 175-282 (341) 104 PRK11543 gutQ D-arabinose 5-ph 55.9 17 0.00043 17.3 4.2 51 113-169 90-143 (321) 105 pfam02110 HK Hydroxyethylthiaz 55.9 11 0.00027 18.6 2.6 72 65-143 98-174 (246) 106 cd06298 PBP1_CcpA_like Ligand- 55.8 17 0.00043 17.3 8.3 103 33-147 107-215 (268) 107 PRK03372 ppnK inorganic polyph 55.6 17 0.00043 17.3 6.8 86 56-143 15-100 (303) 108 PRK08533 flagellar accessory p 55.3 8.7 0.00022 19.1 2.1 68 100-167 103-184 (230) 109 pfam05991 DUF901 Protein of un 55.3 17 0.00044 17.2 7.3 117 10-160 1-127 (166) 110 PRK04539 ppnK inorganic polyph 54.7 17 0.00044 17.2 7.0 83 57-143 17-99 (296) 111 cd06320 PBP1_allose_binding Pe 54.2 18 0.00045 17.1 9.9 80 54-146 132-217 (275) 112 COG1134 TagH ABC-type polysacc 53.8 18 0.00046 17.1 4.0 74 78-168 141-222 (249) 113 cd06321 PBP1_ABC_sugar_binding 51.1 20 0.00051 16.8 8.8 74 59-145 135-214 (271) 114 pfam09001 DUF1890 Domain of un 51.0 20 0.00051 16.8 4.2 90 64-160 17-113 (138) 115 PRK11303 DNA-binding transcrip 50.7 20 0.00051 16.8 10.3 99 33-146 170-275 (330) 116 cd06281 PBP1_LacI_like_5 Ligan 50.2 21 0.00052 16.7 8.9 103 33-146 107-213 (269) 117 TIGR00034 aroFGH phospho-2-deh 50.2 17 0.00043 17.3 2.9 33 91-123 181-215 (348) 118 pfam06258 DUF1022 Protein of u 50.1 21 0.00053 16.7 4.3 29 115-145 227-255 (308) 119 pfam06057 VirJ Bacterial virul 49.6 21 0.00054 16.7 3.4 10 6-15 2-11 (192) 120 PRK02649 ppnK inorganic polyph 49.6 21 0.00054 16.7 6.7 85 56-143 12-99 (305) 121 PRK05249 soluble pyridine nucl 49.4 21 0.00054 16.7 3.4 39 105-143 168-206 (465) 122 PRK09355 hydroxyethylthiazole 48.6 9.2 0.00024 18.9 1.4 71 66-143 104-179 (262) 123 cd03238 ABC_UvrA The excision 48.5 22 0.00056 16.6 3.9 39 122-168 125-163 (176) 124 pfam01380 SIS SIS domain. SIS 48.4 22 0.00056 16.6 5.6 51 111-167 52-105 (131) 125 cd06355 PBP1_FmdD_like Peripla 48.4 21 0.00054 16.6 3.2 66 62-138 149-214 (348) 126 PRK10423 transcriptional repre 48.3 22 0.00056 16.6 8.0 100 34-146 166-272 (327) 127 KOG1116 consensus 48.1 19 0.00047 17.0 2.9 35 103-137 225-261 (579) 128 cd03270 ABC_UvrA_I The excisio 48.1 22 0.00057 16.5 3.7 40 121-168 174-213 (226) 129 PRK11302 DNA-binding transcrip 48.0 22 0.00057 16.5 12.2 61 98-165 161-224 (284) 130 PRK05928 hemD uroporphyrinogen 47.6 23 0.00058 16.5 6.8 51 65-123 140-190 (252) 131 cd05213 NAD_bind_Glutamyl_tRNA 47.5 23 0.00058 16.5 3.6 65 108-174 234-301 (311) 132 PRK00349 uvrA excinuclease ABC 47.4 23 0.00058 16.5 4.0 40 121-168 525-565 (944) 133 PRK10401 DNA-binding transcrip 47.2 23 0.00058 16.5 7.2 101 34-147 168-275 (346) 134 TIGR01212 TIGR01212 radical SA 46.9 23 0.00059 16.4 3.5 38 123-161 165-202 (307) 135 cd04104 p47_IIGP_like p47 (47- 46.9 23 0.00059 16.4 4.3 33 111-143 79-114 (197) 136 PRK05976 dihydrolipoamide dehy 46.7 23 0.00059 16.4 5.0 31 114-144 176-206 (464) 137 PRK13937 phosphoheptose isomer 46.7 23 0.0006 16.4 5.7 52 109-166 107-161 (192) 138 cd06294 PBP1_ycjW_transcriptio 46.1 24 0.00061 16.4 11.1 101 33-146 113-220 (270) 139 PRK01231 ppnK inorganic polyph 45.6 24 0.00062 16.3 5.8 79 56-143 15-93 (296) 140 cd06315 PBP1_ABC_sugar_binding 44.9 25 0.00063 16.2 9.6 11 114-124 218-228 (280) 141 TIGR02853 spore_dpaA dipicolin 44.8 25 0.00064 16.2 4.3 70 94-173 135-208 (288) 142 cd06290 PBP1_LacI_like_9 Ligan 44.7 25 0.00064 16.2 8.0 104 33-146 106-213 (265) 143 PRK11914 diacylglycerol kinase 44.7 25 0.00064 16.2 9.5 85 43-142 6-92 (304) 144 cd01170 THZ_kinase 4-methyl-5- 43.9 15 0.00037 17.7 1.8 70 67-143 100-175 (242) 145 COG1105 FruK Fructose-1-phosph 43.5 26 0.00067 16.1 6.9 45 98-143 114-166 (310) 146 cd01481 vWA_collagen_alpha3-VI 43.4 26 0.00067 16.1 5.8 60 98-161 80-154 (165) 147 PRK06370 mercuric reductase; V 43.2 26 0.00067 16.1 4.7 31 115-145 172-202 (459) 148 PRK13222 phosphoglycolate phos 43.1 27 0.00068 16.1 5.8 71 99-175 151-223 (228) 149 KOG1434 consensus 43.0 27 0.00068 16.1 5.7 104 8-122 151-264 (335) 150 pfam01861 DUF43 Protein of unk 42.1 26 0.00067 16.1 2.9 26 117-142 48-73 (243) 151 PRK00676 hemA glutamyl-tRNA re 42.1 27 0.0007 16.0 3.5 66 108-176 222-290 (338) 152 pfam05761 5_nucleotid 5' nucle 41.8 25 0.00063 16.3 2.7 30 112-143 177-206 (448) 153 PRK09526 lacI lac repressor; R 41.8 28 0.00071 15.9 10.6 99 33-146 172-277 (342) 154 cd06276 PBP1_FucR_like Ligand- 41.8 28 0.00071 15.9 8.0 33 115-147 164-200 (247) 155 cd06280 PBP1_LacI_like_4 Ligan 41.2 28 0.00072 15.9 8.4 101 32-146 105-209 (263) 156 cd02072 Glm_B12_BD B12 binding 41.1 28 0.00073 15.9 3.6 18 64-81 17-34 (128) 157 cd06273 PBP1_GntR_like_1 This 41.0 29 0.00073 15.9 11.9 102 32-146 106-215 (268) 158 COG3660 Predicted nucleoside-d 40.7 29 0.00074 15.8 5.8 35 103-143 238-272 (329) 159 cd06284 PBP1_LacI_like_6 Ligan 40.5 29 0.00074 15.8 12.2 100 33-146 106-213 (267) 160 PRK10892 D-arabinose 5-phospha 40.4 29 0.00074 15.8 4.8 51 113-169 95-148 (326) 161 cd06323 PBP1_ribose_binding Pe 40.3 29 0.00075 15.8 8.2 76 56-145 133-214 (268) 162 cd06299 PBP1_LacI_like_13 Liga 40.2 29 0.00075 15.8 7.4 100 33-146 107-212 (265) 163 cd01712 ThiI ThiI is required 40.1 29 0.00075 15.8 5.3 16 38-53 21-36 (177) 164 TIGR00750 lao LAO/AO transport 40.1 29 0.00075 15.8 3.2 25 113-137 158-185 (333) 165 pfam03852 Vsr DNA mismatch end 40.1 15 0.00038 17.6 1.4 15 65-79 26-40 (75) 166 PRK00654 glgA glycogen synthas 39.9 30 0.00076 15.8 6.7 66 94-164 304-372 (476) 167 cd06285 PBP1_LacI_like_7 Ligan 39.9 30 0.00076 15.7 7.9 102 33-147 105-213 (265) 168 cd06293 PBP1_LacI_like_11 Liga 39.7 30 0.00076 15.7 8.1 100 33-146 107-214 (269) 169 PTZ00175 diphthine synthase; P 39.7 30 0.00076 15.7 4.9 39 105-143 68-110 (272) 170 COG0258 Exo 5'-3' exonuclease 39.5 30 0.00077 15.7 8.1 112 7-131 11-153 (310) 171 PRK10444 UMP phosphatase; Prov 39.4 29 0.00074 15.8 2.8 26 116-141 111-140 (248) 172 cd00221 Vsr Very Short Patch R 39.3 15 0.00039 17.6 1.3 13 4-16 51-63 (115) 173 COG1879 RbsB ABC-type sugar tr 39.3 30 0.00077 15.7 10.9 74 59-145 173-253 (322) 174 cd06324 PBP1_ABC_sugar_binding 39.0 31 0.00078 15.7 10.4 78 56-147 154-240 (305) 175 PRK10727 DNA-binding transcrip 38.9 31 0.00078 15.7 7.5 105 33-147 167-275 (342) 176 TIGR01530 nadN NAD nucleotidas 38.9 8 0.0002 19.3 -0.2 65 60-142 168-234 (550) 177 cd06288 PBP1_sucrose_transcrip 38.7 31 0.00079 15.6 8.8 101 32-146 106-214 (269) 178 cd03791 GT1_Glycogen_synthase_ 38.6 31 0.00079 15.6 7.1 67 93-164 306-375 (476) 179 TIGR03407 urea_ABC_UrtA urea A 38.4 31 0.0008 15.6 3.3 66 62-138 150-215 (359) 180 PRK01710 murD UDP-N-acetylmura 38.3 31 0.0008 15.6 4.1 48 96-144 334-384 (458) 181 PRK10535 macrolide transporter 37.4 32 0.00083 15.5 3.4 38 123-168 181-218 (648) 182 cd06296 PBP1_CatR_like Ligand- 37.3 33 0.00083 15.5 10.6 100 33-146 108-215 (270) 183 COG4634 Uncharacterized protei 37.2 33 0.00083 15.5 7.0 64 65-142 12-75 (113) 184 COG1435 Tdk Thymidine kinase [ 36.9 33 0.00084 15.5 4.4 58 122-179 95-156 (201) 185 TIGR01317 GOGAT_sm_gam glutama 36.6 13 0.00034 17.9 0.7 48 122-169 292-341 (517) 186 KOG1461 consensus 36.2 34 0.00087 15.4 2.8 41 104-144 113-163 (673) 187 cd02008 TPP_IOR_alpha Thiamine 36.1 34 0.00087 15.4 5.3 39 110-148 66-108 (178) 188 PRK12480 D-lactate dehydrogena 35.7 34 0.00088 15.3 5.8 17 126-142 159-175 (330) 189 pfam03447 NAD_binding_3 Homose 35.4 35 0.00089 15.3 5.3 42 101-142 45-88 (116) 190 cd06407 PB1_NLP A PB1 domain i 35.3 35 0.00089 15.3 3.6 29 115-143 51-79 (82) 191 COG1569 Predicted nucleic acid 35.2 13 0.00034 17.9 0.5 25 115-143 105-129 (142) 192 TIGR00632 vsr DNA mismatch end 35.0 18 0.00046 17.1 1.1 54 4-78 79-137 (143) 193 PRK06292 dihydrolipoamide dehy 34.5 36 0.00092 15.2 5.0 32 114-145 170-201 (460) 194 cd06357 PBP1_AmiC Periplasmic 34.5 36 0.00092 15.2 3.4 66 63-138 149-215 (360) 195 PRK06467 dihydrolipoamide dehy 34.1 37 0.00093 15.2 4.5 42 104-145 165-206 (472) 196 PRK00635 excinuclease ABC subu 34.0 37 0.00094 15.2 3.9 35 125-167 517-551 (1809) 197 cd01542 PBP1_TreR_like Ligand- 33.9 37 0.00094 15.2 9.3 102 33-146 105-210 (259) 198 pfam08497 Radical_SAM_N Radica 33.4 38 0.00096 15.1 5.4 13 5-17 68-80 (298) 199 KOG3123 consensus 33.1 38 0.00097 15.1 3.5 39 105-143 68-110 (272) 200 cd03213 ABCG_EPDR ABCG transpo 32.8 38 0.00098 15.0 3.8 42 121-168 146-187 (194) 201 PRK08659 2-oxoglutarate ferred 32.7 39 0.00098 15.0 6.2 49 113-162 274-324 (377) 202 pfam00532 Peripla_BP_1 family. 32.6 39 0.00099 15.0 8.7 93 33-137 111-206 (281) 203 PRK13210 putative L-xylulose 5 32.6 39 0.00099 15.0 3.5 46 116-161 226-272 (284) 204 pfam03709 OKR_DC_1_N Orn/Lys/A 32.4 39 0.001 15.0 7.9 83 63-166 6-92 (111) 205 PRK10886 DnaA initiator-associ 32.2 39 0.001 15.0 5.2 66 98-166 95-163 (196) 206 PRK11658 UDP-4-amino-4-deoxy-L 32.1 40 0.001 15.0 5.3 101 25-137 106-211 (379) 207 cd01477 vWA_F09G8-8_type VWA F 32.0 40 0.001 15.0 5.0 51 111-161 129-186 (193) 208 cd06278 PBP1_LacI_like_2 Ligan 31.8 40 0.001 14.9 8.0 100 32-146 105-212 (266) 209 PRK02645 ppnK inorganic polyph 31.7 40 0.001 14.9 8.2 77 55-144 13-89 (304) 210 cd06578 HemD Uroporphyrinogen- 31.6 40 0.001 14.9 6.7 20 66-85 137-156 (239) 211 PRK08366 vorA 2-ketoisovalerat 31.6 40 0.001 14.9 6.2 48 114-162 265-314 (394) 212 PRK06116 glutathione reductase 31.3 41 0.001 14.9 3.7 31 114-144 168-198 (450) 213 TIGR02467 CbiE precorrin-6y C5 31.3 41 0.001 14.9 5.5 57 98-162 51-111 (210) 214 PRK07818 dihydrolipoamide dehy 31.1 41 0.001 14.9 3.7 31 114-144 173-203 (467) 215 PRK06115 dihydrolipoamide dehy 30.5 42 0.0011 14.8 3.6 39 106-144 167-205 (466) 216 TIGR03378 glycerol3P_GlpB glyc 30.4 42 0.0011 14.8 3.8 37 109-145 217-256 (419) 217 PRK10563 6-phosphogluconate ph 30.4 42 0.0011 14.8 3.7 69 99-174 144-213 (221) 218 TIGR00559 pdxJ pyridoxal phosp 30.4 42 0.0011 14.8 6.2 70 66-161 79-149 (265) 219 TIGR02153 gatD_arch glutamyl-t 30.3 42 0.0011 14.8 2.7 66 104-173 298-377 (413) 220 PRK08306 dipicolinate synthase 30.2 43 0.0011 14.8 4.9 54 115-175 153-210 (296) 221 cd01469 vWA_integrins_alpha_su 29.9 43 0.0011 14.7 5.9 48 114-161 105-158 (177) 222 cd02071 MM_CoA_mut_B12_BD meth 29.8 43 0.0011 14.7 3.9 62 64-142 17-85 (122) 223 COG2210 Peroxiredoxin family p 29.8 43 0.0011 14.7 4.5 20 124-143 77-97 (137) 224 COG0399 WecE Predicted pyridox 29.6 43 0.0011 14.7 5.3 109 9-139 98-215 (374) 225 PRK13938 phosphoheptose isomer 29.6 43 0.0011 14.7 5.6 52 109-166 110-164 (196) 226 cd02553 PseudoU_synth_RsuA Pse 29.5 44 0.0011 14.7 3.3 46 114-162 47-92 (167) 227 cd01567 NAPRTase_PncB Nicotina 29.5 44 0.0011 14.7 5.5 46 99-144 274-325 (343) 228 PRK08367 porA pyruvate ferredo 29.2 44 0.0011 14.7 6.2 49 113-162 262-312 (395) 229 COG0771 MurD UDP-N-acetylmuram 29.1 44 0.0011 14.7 4.6 48 111-161 340-392 (448) 230 cd03195 GST_C_4 GST_C family, 28.9 30 0.00078 15.7 1.5 58 95-167 42-105 (114) 231 cd02554 PseudoU_synth_RluF Pse 28.3 41 0.0011 14.8 2.1 47 113-162 43-89 (164) 232 PRK09271 flavodoxin; Provision 28.3 46 0.0012 14.6 8.4 84 46-148 4-97 (160) 233 PRK05329 anaerobic glycerol-3- 28.1 46 0.0012 14.5 3.6 40 106-145 213-255 (425) 234 COG1242 Predicted Fe-S oxidore 28.1 46 0.0012 14.5 3.6 38 123-161 168-205 (312) 235 cd02556 PseudoU_synth_RluB Pse 28.0 46 0.0012 14.5 2.4 48 113-163 46-93 (167) 236 PRK07251 pyridine nucleotide-d 28.0 46 0.0012 14.5 6.3 32 113-144 157-188 (438) 237 cd01480 vWA_collagen_alpha_1-V 27.7 47 0.0012 14.5 5.9 62 96-161 86-161 (186) 238 cd01540 PBP1_arabinose_binding 27.6 47 0.0012 14.5 12.4 78 59-146 145-230 (289) 239 cd02555 PSSA_1 PSSA_1: Pseudou 27.2 48 0.0012 14.4 2.4 47 113-162 57-103 (177) 240 TIGR00250 TIGR00250 conserved 27.2 48 0.0012 14.4 2.2 42 109-150 22-65 (133) 241 cd06342 PBP1_ABC_LIVBP_like Ty 27.0 48 0.0012 14.4 12.1 112 31-160 123-234 (334) 242 PRK01254 hypothetical protein; 27.0 48 0.0012 14.4 5.5 19 121-139 571-589 (742) 243 TIGR00338 serB phosphoserine p 26.8 49 0.0012 14.4 3.0 46 115-173 173-220 (223) 244 TIGR02177 PorB_KorB 2-oxoacid: 26.7 29 0.00074 15.8 1.1 11 115-125 73-83 (302) 245 TIGR00064 ftsY signal recognit 26.5 49 0.0013 14.4 2.3 76 25-120 94-173 (284) 246 PRK11394 23S rRNA pseudouridin 26.4 50 0.0013 14.4 3.2 47 113-162 72-118 (207) 247 PRK09288 purT phosphoribosylgl 26.4 50 0.0013 14.4 3.2 89 66-160 141-248 (395) 248 PRK13228 consensus 26.4 50 0.0013 14.4 5.6 71 98-174 153-225 (232) 249 TIGR01501 MthylAspMutase methy 26.3 50 0.0013 14.4 4.0 60 69-144 24-88 (134) 250 pfam02590 SPOUT_MTase Predicte 26.3 50 0.0013 14.3 3.5 68 97-168 48-126 (155) 251 cd06267 PBP1_LacI_sugar_bindin 26.2 50 0.0013 14.3 11.4 100 33-146 107-214 (264) 252 PRK04761 ppnK inorganic polyph 26.2 50 0.0013 14.3 4.2 28 113-140 26-53 (246) 253 COG1004 Ugd Predicted UDP-gluc 26.1 50 0.0013 14.3 4.8 21 112-132 163-183 (414) 254 PRK13224 consensus 26.1 50 0.0013 14.3 6.2 69 98-175 143-213 (216) 255 PHA00439 exonuclease 26.1 28 0.00071 15.9 0.9 20 112-131 135-154 (288) 256 PRK13538 cytochrome c biogenes 26.0 50 0.0013 14.3 4.3 38 121-167 164-201 (204) 257 PTZ00153 lipoamide dehydrogena 25.9 51 0.0013 14.3 5.0 59 109-168 120-186 (673) 258 cd03271 ABC_UvrA_II The excisi 25.8 51 0.0013 14.3 4.5 37 123-167 209-245 (261) 259 PRK10653 D-ribose transporter 25.7 51 0.0013 14.3 13.2 78 55-146 159-241 (295) 260 PRK10475 23S rRNA pseudouridin 25.6 51 0.0013 14.3 3.1 47 113-162 110-156 (290) 261 cd01541 PBP1_AraR Ligand-bindi 25.5 51 0.0013 14.3 8.7 102 32-146 111-220 (273) 262 PRK00561 ppnK inorganic polyph 25.2 52 0.0013 14.2 3.9 30 113-142 34-63 (259) 263 TIGR00393 kpsF sugar isomerase 25.1 52 0.0013 14.2 4.1 51 111-167 46-99 (272) 264 PRK06327 dihydrolipoamide dehy 25.0 53 0.0013 14.2 3.7 18 126-143 196-213 (475) 265 pfam01408 GFO_IDH_MocA Oxidore 25.0 53 0.0013 14.2 5.1 32 110-141 60-91 (120) 266 cd01545 PBP1_SalR Ligand-bindi 24.8 53 0.0013 14.2 9.5 105 33-147 109-217 (270) 267 cd06331 PBP1_AmiC_like Type I 24.8 53 0.0014 14.2 2.7 67 61-138 147-213 (333) 268 cd03375 TPP_OGFOR Thiamine pyr 24.6 53 0.0014 14.1 3.9 29 114-143 70-103 (193) 269 cd06286 PBP1_CcpB_like Ligand- 24.6 54 0.0014 14.1 11.7 105 33-147 105-213 (260) 270 PRK13651 cobalt transporter AT 24.4 53 0.0014 14.2 2.0 31 96-126 190-224 (304) 271 COG1121 ZnuC ABC-type Mn/Zn tr 24.4 54 0.0014 14.1 2.8 87 64-167 118-213 (254) 272 PRK10936 periplasmic sensory p 24.3 54 0.0014 14.1 12.4 109 33-162 159-273 (340) 273 TIGR02456 treS_nterm trehalose 23.9 55 0.0014 14.1 3.4 75 59-154 26-103 (560) 274 PRK00955 hypothetical protein; 23.9 55 0.0014 14.1 5.4 41 113-153 478-523 (599) 275 TIGR02244 HAD-IG-Ncltidse HAD 23.9 55 0.0014 14.1 3.1 60 98-161 184-258 (402) 276 pfam05762 VWA_CoxE VWA domain 23.8 55 0.0014 14.1 7.8 140 7-166 59-217 (223) 277 cd01472 vWA_collagen von Wille 23.8 55 0.0014 14.1 5.6 62 96-161 77-151 (164) 278 cd03246 ABCC_Protease_Secretio 23.6 56 0.0014 14.0 3.5 40 121-168 131-170 (173) 279 cd01575 PBP1_GntR Ligand-bindi 23.6 56 0.0014 14.0 11.5 100 33-146 107-214 (268) 280 cd06330 PBP1_Arsenic_SBP_like 23.5 56 0.0014 14.0 3.5 67 62-137 154-220 (346) 281 PRK12421 ATP phosphoribosyltra 23.4 56 0.0014 14.0 4.8 46 98-143 318-364 (391) 282 cd06341 PBP1_ABC_ligand_bindin 23.2 57 0.0014 14.0 11.7 102 32-147 122-223 (341) 283 cd06334 PBP1_ABC_ligand_bindin 23.2 57 0.0015 14.0 10.2 87 62-164 156-247 (351) 284 cd06303 PBP1_LuxPQ_Quorum_Sens 23.2 57 0.0015 14.0 10.2 87 60-164 146-238 (280) 285 cd03232 ABC_PDR_domain2 The pl 23.2 57 0.0015 14.0 3.9 42 121-168 143-184 (192) 286 COG0673 MviM Predicted dehydro 23.1 57 0.0015 14.0 5.3 31 111-141 66-96 (342) 287 pfam06506 PrpR_N Propionate ca 23.0 57 0.0015 14.0 4.8 22 120-141 107-128 (169) 288 TIGR03669 urea_ABC_arch urea A 23.0 57 0.0015 14.0 3.7 26 118-143 170-195 (374) 289 COG2454 Uncharacterized conser 22.9 52 0.0013 14.2 1.7 26 119-144 139-165 (211) 290 cd06307 PBP1_uncharacterized_s 22.8 58 0.0015 13.9 9.8 89 57-164 138-233 (275) 291 TIGR00694 thiM hydroxyethylthi 22.4 21 0.00054 16.7 -0.3 80 69-157 102-190 (282) 292 PRK00414 gmhA phosphoheptose i 22.4 59 0.0015 13.9 5.5 52 109-166 108-162 (192) 293 TIGR02280 PaaB1 phenylacetate 22.4 59 0.0015 13.9 2.7 46 97-161 73-118 (259) 294 pfam01301 Glyco_hydro_35 Glyco 22.3 59 0.0015 13.9 8.6 56 31-86 27-88 (317) 295 pfam02630 SCO1-SenC SCO1/SenC. 22.2 59 0.0015 13.9 4.3 87 40-140 37-124 (159) 296 PRK13055 putative lipid kinase 22.1 60 0.0015 13.8 10.5 82 46-139 5-86 (334) 297 PRK13644 cbiO cobalt transport 22.0 58 0.0015 13.9 1.9 38 122-167 172-209 (274) 298 PRK06154 hypothetical protein; 21.9 60 0.0015 13.8 4.1 43 109-151 436-481 (556) 299 PRK09124 pyruvate dehydrogenas 21.7 61 0.0016 13.8 3.6 49 109-157 422-473 (574) 300 pfam02775 TPP_enzyme_C Thiamin 21.7 61 0.0016 13.8 2.0 31 115-145 48-81 (150) 301 pfam01640 Peptidase_C10 Peptid 21.7 26 0.00065 16.1 -0.0 23 122-144 292-314 (378) 302 cd03224 ABC_TM1139_LivF_branch 21.5 61 0.0016 13.8 2.1 39 123-168 169-207 (222) 303 PRK11269 glyoxylate carboligas 21.5 62 0.0016 13.8 4.4 45 109-153 433-480 (591) 304 PRK12474 hypothetical protein; 21.4 62 0.0016 13.8 3.9 44 114-157 408-454 (518) 305 pfam01513 NAD_kinase ATP-NAD k 21.3 62 0.0016 13.7 3.9 33 111-143 34-66 (243) 306 cd01536 PBP1_ABC_sugar_binding 21.3 62 0.0016 13.7 11.4 82 52-146 129-216 (267) 307 cd06283 PBP1_RegR_EndR_KdgR_li 21.2 62 0.0016 13.7 11.0 104 33-146 107-215 (267) 308 TIGR02252 DREG-2 REG-2-like, H 21.2 62 0.0016 13.7 3.0 44 98-143 97-142 (224) 309 pfam09818 ABC_ATPase Predicted 21.1 55 0.0014 14.1 1.6 45 118-169 364-413 (447) 310 COG2145 ThiM Hydroxyethylthiaz 21.1 31 0.0008 15.6 0.3 67 70-143 109-180 (265) 311 PRK10785 maltodextrin glucosid 20.8 63 0.0016 13.7 3.8 21 124-144 351-372 (608) 312 COG2241 CobL Precorrin-6B meth 20.7 64 0.0016 13.7 5.7 27 112-138 68-94 (210) 313 TIGR01194 cyc_pep_trnsptr cycl 20.7 44 0.0011 14.7 1.0 124 28-168 412-545 (555) 314 pfam08423 Rad51 Rad51. Rad51 i 20.3 65 0.0017 13.6 4.4 16 106-121 175-190 (261) 315 cd03823 GT1_ExpE7_like This fa 20.3 65 0.0017 13.6 5.4 68 92-164 200-267 (359) 316 PRK12778 putative bifunctional 20.3 65 0.0017 13.6 5.6 15 65-79 494-508 (760) 317 PRK13886 conjugal transfer pro 20.3 65 0.0017 13.6 7.0 78 96-173 63-159 (241) 318 cd02514 GT13_GLCNAC-TI GT13_GL 20.3 65 0.0017 13.6 2.7 10 113-122 98-107 (334) 319 PTZ00052 thioredoxin reductase 20.2 65 0.0017 13.6 3.1 39 105-143 214-252 (541) 320 PRK08010 pyridine nucleotide-d 20.1 66 0.0017 13.6 3.8 38 106-143 151-188 (441) 321 PRK05752 uroporphyrinogen-III 20.1 66 0.0017 13.6 6.9 32 113-144 130-161 (255) 322 PRK11614 livF leucine/isoleuci 20.1 66 0.0017 13.6 2.2 39 123-168 174-212 (237) 323 COG0855 Ppk Polyphosphate kina 20.0 66 0.0017 13.6 8.2 44 103-146 357-407 (696) No 1 >cd06167 LabA_like LabA_like proteins. A well conserved group of bacterial proteins with no defined function. LabA, a member from Synechococcus elongatus PCC 7942, has been shown to play a role in cyanobacterial circadian timing. It is required for negative feedback regulation of the autokinase/autophosphatase KaiC, a central component of the circadian clock system. In particular, LabA seems necessary for KaiC-dependent repression of gene expression. Probab=99.97 E-value=3.9e-30 Score=208.91 Aligned_cols=146 Identities=36% Similarity=0.539 Sum_probs=128.8 Q ss_pred CEEEEEEEHHHHHHHHHHCC-CCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEE Q ss_conf 71899980789999998508-76798999999851792999999730688731467775799988975595697534589 Q gi|254780936|r 6 EKIALFIDGANLYASSKALG-FDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKE 84 (182) Q Consensus 6 ~rvaIfID~~Nl~~~~~~~~-~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~ 84 (182) +|++|||||+|++++.++.+ +.+||.+|++++...+.+...++|.+...+ ...+.|.++|+..|++++.++... T Consensus 1 ~r~~ifiDg~Nl~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~r~Y~~~~~~-----~~~~~~~~~L~~~g~~~~~~~~~~ 75 (149) T cd06167 1 KRVAVFIDGENLYYSLRDLGGKRFDYRKLLEFLRDGGEIVLARAYGNWTSP-----ERQRGFLDALRRLGFEPIQKPLRT 75 (149) T ss_pred CEEEEEECHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCEEEEEEEECCCCC-----HHHHHHHHHHHHCCCEEEEEEEEE T ss_conf 929999815138899997347776899999999707977999999648884-----567899999997598479961130 Q ss_pred ECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 53788865435653299999999942--1158899983866589999999984988999983157765470899985211 Q gi|254780936|r 85 FTENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 85 ~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) . ...||++|+.|++||++++ +++|++||||||+||+|+++++|+.||+|+++++ +..+|++|+++||. T Consensus 76 ~------~~~kk~~Dv~la~D~~~~a~~~~~D~~vLvSgD~Df~p~v~~lr~~Gk~V~v~~~----~~~~s~~L~~~~d~ 145 (149) T cd06167 76 R------GSGKKGVDVALAIDALELAYKRRIDTIVLVSGDSDFVPLVERLRELGKRVIVVGF----EAKTSRELRKAADR 145 (149) T ss_pred C------CCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEC----CCCCCHHHHHHHHH T ss_conf 4------7766642099999999996138999899995777279999999987999999974----88786999997111 Q ss_pred CEEH Q ss_conf 1518 Q gi|254780936|r 163 FMDL 166 (182) Q Consensus 163 fi~l 166 (182) |+++ T Consensus 146 fi~l 149 (149) T cd06167 146 FIDL 149 (149) T ss_pred CEEC T ss_conf 1729 No 2 >COG1432 Uncharacterized conserved protein [Function unknown] Probab=99.95 E-value=7.2e-27 Score=188.50 Aligned_cols=171 Identities=41% Similarity=0.550 Sum_probs=146.6 Q ss_pred CCCCCCEEEEEEEHHHHHHHH-HHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 987887189998078999999-8508767989999998517929999997306887314677757999889755956975 Q gi|254780936|r 1 MFDPREKIALFIDGANLYASS-KALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVA 79 (182) Q Consensus 1 m~~~~~rvaIfID~~Nl~~~~-~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~ 79 (182) |..+++|+++|||+.|++++. +.+++.+|+.+++........+.++++|....+.. ..+......+++...|+++.. T Consensus 1 ~~~~~~~i~~~id~~n~~~~~~~~~~~~~d~~~ll~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~v~~ 78 (181) T COG1432 1 TPIMVDRIALFIDGKNLYASQRKNLGILIDYRKLLRKELKRIYLYEAAPYTAEYGDI--EIQEVFLNEYELRSDGFTVVT 78 (181) T ss_pred CCCCCCCEEEEEEHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEHHHHHHHCCHH--HHHHHHHHHHHHHCCCEEEEE T ss_conf 976566300345332204567651561026999999864577485126565231615--541344128887337718983 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHH Q ss_conf 34589537888654356532999999999421--1588999838665899999999849889999831577654708999 Q gi|254780936|r 80 KVAKEFTENCGRKRVKSSMDVELAVDAFEQSE--GLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLR 157 (182) Q Consensus 80 ~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~--~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~ 157 (182) ++...+.+ ....++++||.|++||++++. ++|+++|+|||+||+|+++.+++.|++|+++++. +++|.+|+ T Consensus 79 ~~~~~~~~---~~~~k~~vDv~la~D~~~l~~~~~~D~ivl~SgD~DF~p~v~~~~~~G~rv~v~~~~----~~~s~~L~ 151 (181) T COG1432 79 SNLDGFTD---LRITKGDVDVELAVDAMELADKKNVDTIVLFSGDGDFIPLVEAARDKGKRVEVAGIE----PMTSSDLR 151 (181) T ss_pred ECCCCCCC---CCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHCCCEEEEEECC----CCCHHHHH T ss_conf 04333342---110103545999999999741146786999747765288999999669879999457----76879998 Q ss_pred HHCCCCEEHHHHHHHHCCCCCCC Q ss_conf 85211151899787622583115 Q gi|254780936|r 158 RQADYFMDLAYLKNEIARDPDED 180 (182) Q Consensus 158 ~~ad~fi~l~~l~~~i~r~~~~~ 180 (182) ++||+|++|+++.+.+...+.+. T Consensus 152 ~~aD~~i~L~~~~~~~~~~~~~~ 174 (181) T COG1432 152 NAADYYIDLKSLEEAILLPEIER 174 (181) T ss_pred HHCCCEEECCHHHHHHCCCCCCC T ss_conf 63261677707665442332345 No 3 >pfam01936 DUF88 Protein of unknown function DUF88. This highly conserved bacterial protein has no known function. The alignment contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase (Bateman A pers. obs). Probab=99.94 E-value=3.3e-26 Score=184.38 Aligned_cols=138 Identities=36% Similarity=0.531 Sum_probs=119.7 Q ss_pred EEEEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEEC Q ss_conf 18999807899999985087679899999985179299999973068873146777579998897559569753458953 Q gi|254780936|r 7 KIALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFT 86 (182) Q Consensus 7 rvaIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~ 86 (182) |+|||||++|++++. .+|+.++++.+.+.+.+..+++|+++..+. ++.|.+.|.+.|++++.+|... T Consensus 1 rvaVfiD~~N~~~~~-----~~d~~~l~~~i~~~g~v~~~~aY~~~~~~~------~~~~~~~L~~~Gi~v~~~~~~~-- 67 (140) T pfam01936 1 RVAVFIDGENCPVPA-----GIDYRKVLEEIKSGGEVVRARAYGDWSDPK------LRKFPDALSSTGIPVQHKPLTK-- 67 (140) T ss_pred CEEEEEECCCCCCCC-----CCCHHHHHHHHHCCCCEEEEEEEECCCCCC------HHHHHHHHHHCCCEEEEECCCC-- T ss_conf 979999626587532-----479999999998279789999994788610------4779999998697189945515-- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 788865435653299999999942--115889998386658999999998498899998315776547089998521115 Q gi|254780936|r 87 ENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 87 ~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) ..|+++|+.|++||++.+ +++|++||+|||+||+|+++.||++|++|++++. ++.+|.+|+++||+|+ T Consensus 68 ------~~Kn~~D~~l~vD~~~~~~~~~~d~~ilvsgD~Df~~~~~~lr~~g~~v~~~~~----~~~~s~~L~~~~d~f~ 137 (140) T pfam01936 68 ------SGKNAVDVGLAVDALELAYDNNPDTFVLVSGDGDFAPLLERLRERGKRVEVLGA----EPSTSDALINAADRFI 137 (140) T ss_pred ------CCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEC----CCCCCHHHHHHCCEEE T ss_conf ------876503499999999997338999899992674079999999988999999965----8878799998669249 Q ss_pred EHH Q ss_conf 189 Q gi|254780936|r 165 DLA 167 (182) Q Consensus 165 ~l~ 167 (182) +++ T Consensus 138 ~l~ 140 (140) T pfam01936 138 DLE 140 (140) T ss_pred ECC T ss_conf 487 No 4 >TIGR00288 TIGR00288 conserved hypothetical protein TIGR00288; InterPro: IPR002790 This entry describes archaebacterial proteins of unknown function. . Probab=98.94 E-value=5.7e-09 Score=76.54 Aligned_cols=133 Identities=21% Similarity=0.312 Sum_probs=107.8 Q ss_pred CCCEEEEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 88718999807899999985087679899999985179299999973068873146777579998897559569753458 Q gi|254780936|r 4 PREKIALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAK 83 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~ 83 (182) ..+++++++||.|+... .+.++...+.+.+.+.+.+.....+.+.. ......+++.+.||++.. T Consensus 22 ~~~~~g~l~dgp~~l~~----~~~~~l~~~~~~~~~~g~~~~g~~~l~~~--------~~~~l~~~~~~~g~~p~~---- 85 (161) T TIGR00288 22 GEKKLGLLVDGPNMLRK----EFNLDLDLVRELLSEYGDLKVGKVLLNQY--------ASDKLIEAVVNQGFEPIV---- 85 (161) T ss_pred CCCCEEEEECCCHHHHH----HHCCCHHHHHHHHHHHCCHHHHHHHHHHH--------HHHHHHHHHHHCCCCCEE---- T ss_conf 32111356526023445----53025789998875304501234444222--------014677787606876224---- Q ss_pred EECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 953788865435653299999999942--115889998386658999999998498899998315776547089998521 Q gi|254780936|r 84 EFTENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 84 ~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) ..+.+|+.++++++++. +++|.+.+++.|+||.|++...++.|+.+.+++...|+ |..|++.+| T Consensus 86 ----------~~gd~d~~~~~~~~~l~~~p~~d~~~~~~~~~d~~p~~~~~~~~g~~~~~~g~~pg~----~~~l~~~~d 151 (161) T TIGR00288 86 ----------VAGDVDVELAVEALELIYNPNIDVVALVTRDADFLPLLNKAKENGKETIVIGAEPGF----STALQNSAD 151 (161) T ss_pred ----------EECCCCHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHCCCEEEEEECCCCC----HHHHHCCCC T ss_conf ----------304431123444444430687433443303530478888775337605886257660----123211233 Q ss_pred CCEEH Q ss_conf 11518 Q gi|254780936|r 162 YFMDL 166 (182) Q Consensus 162 ~fi~l 166 (182) ..+-+ T Consensus 152 ~~~~~ 156 (161) T TIGR00288 152 DAIIL 156 (161) T ss_pred EEEEE T ss_conf 03651 No 5 >pfam04396 consensus Probab=98.59 E-value=1.7e-06 Score=61.03 Aligned_cols=130 Identities=18% Similarity=0.221 Sum_probs=83.7 Q ss_pred CEEEEEEEHHHHHHHHHHCCCCCCHHHH----HHHHHHCC--EEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 7189998078999999850876798999----99985179--29999997306887314677757999889755956975 Q gi|254780936|r 6 EKIALFIDGANLYASSKALGFDIDYRKL----LKAFRSRA--IVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVA 79 (182) Q Consensus 6 ~rvaIfID~~Nl~~~~~~~~~~~d~~~L----~~~l~~~~--~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~ 79 (182) -+|+||.|.+|...-. .++..++ ...+...| ..+-...|....... ....++|...|+.+++ T Consensus 1 akv~VfWDiEnCpvP~-----g~~~~~V~~~I~~al~~~Gy~g~vsi~aygd~~~~~-------~~~~~~L~stGI~l~h 68 (149) T pfam04396 1 AKTSVFWDIEDCPVPD-----GLDARDVAPNIKSALRKLGYRGPVSITAYGDLNKIP-------REILRALSSTGISLAH 68 (149) T ss_pred CCEEEEECCCCCCCCC-----CCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCC-------HHHHHHHHHCCCEEEE T ss_conf 9548996243688999-----989999999999999974999866999987468698-------8999999976986885 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHH---CCCCEEEEECCC-HHHHHHHHHH-HHCCCEEEEEEEECCCCCCCHH Q ss_conf 3458953788865435653299999999942---115889998386-6589999999-9849889999831577654708 Q gi|254780936|r 80 KVAKEFTENCGRKRVKSSMDVELAVDAFEQS---EGLEHLVIFSGD-GCFTTLVAAL-QRKVKKVTIVSTVLSDPSMASD 154 (182) Q Consensus 80 ~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a---~~~d~~iLvSGD-~Df~pli~~l-r~~Gk~V~v~~~~~~~~~~~S~ 154 (182) .| ...|+.+|-.|.+||+..+ +.--+++|+||| .||...+..| +.++-.|.++.. ....+. T Consensus 69 vp----------~g~k~aad~~m~~d~~~~a~~np~Pati~LISgd~~dfa~~l~~L~~~r~Y~vlLa~p----~~a~~~ 134 (149) T pfam04396 69 VP----------AGAKDARDKKMLVDILLWALDNPPPANLMLISGDDRDFASALHDLLRKRGYNILLAQP----EKASSS 134 (149) T ss_pred CC----------CCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHCCCCEEEEECC----CCCCHH T ss_conf 78----------8775347889999999999709997079999577076799999998712855999748----978789 Q ss_pred HHHHHCC Q ss_conf 9998521 Q gi|254780936|r 155 QLRRQAD 161 (182) Q Consensus 155 ~L~~~ad 161 (182) .|..++. T Consensus 135 ~l~~a~~ 141 (149) T pfam04396 135 PALLAAA 141 (149) T ss_pred HHHHHHH T ss_conf 9998763 No 6 >smart00475 53EXOc 5'-3' exonuclease. Probab=95.54 E-value=0.22 Score=29.10 Aligned_cols=53 Identities=23% Similarity=0.136 Sum_probs=35.1 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH Q ss_conf 999889755956975345895378886543565329999999994211588999838665899999 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~ 130 (182) ...+.|...|+.+...| .-.+|=-|+.=+......-..++|+|||.||..++. T Consensus 88 ~i~~~l~~~gi~~~~~~-------------g~EADDvIatla~~~~~~~~~v~IvS~DkDl~QLv~ 140 (259) T smart00475 88 LIKELLDALGIPVLEVE-------------GYEADDVIATLAKKAEAEGYEVRIVSGDKDLLQLVS 140 (259) T ss_pred HHHHHHHHCCCCEEECC-------------CCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCC T ss_conf 99999997699876238-------------854799999999999986992699508837888488 No 7 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=94.69 E-value=0.38 Score=27.56 Aligned_cols=53 Identities=23% Similarity=0.157 Sum_probs=34.0 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH Q ss_conf 999889755956975345895378886543565329999999994211588999838665899999 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~ 130 (182) ...+.|...|+.+...| .-.+|=-|+.=+-.....-..++|+|+|.||..++. T Consensus 89 ~~~~~l~~~gi~~~~~~-------------~~EADD~ia~la~~~~~~~~~v~IvS~DkD~~QLv~ 141 (240) T cd00008 89 LIKELLEALGIPVLEIE-------------GYEADDVIGTLAKKAEAEGYKVVIVSGDKDLLQLVS 141 (240) T ss_pred HHHHHHHHCCCCEEEEC-------------CCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHH T ss_conf 99999997799789746-------------843899999999999866970699628806888724 No 8 >pfam02739 5_3_exonuc_N 5'-3' exonuclease, N-terminal resolvase-like domain. Probab=94.45 E-value=0.44 Score=27.20 Aligned_cols=61 Identities=20% Similarity=0.157 Sum_probs=35.5 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 999889755956975345895378886543565329999999994211588999838665899999999849889999 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) ...+.+...|+.+...+- -.+|=-|+.=+.....+=..++|+|||.||..||.. ..+|.+. T Consensus 90 ~i~~~l~~~gi~~~~~~g-------------~EADDvIatla~~~~~~~~~v~I~S~DkDl~QLv~~----~~~V~~~ 150 (169) T pfam02739 90 LIKELLDALGIPVLEVPG-------------YEADDVIGTLAKKAEKEGYDVRIVSGDKDLLQLVSD----KVNVAVL 150 (169) T ss_pred HHHHHHHHCEEEEEEECC-------------CCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCC----CCEEEEE T ss_conf 999998750144785248-------------531567999999998589919998899985885329----9039994 No 9 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=93.90 E-value=0.57 Score=26.50 Aligned_cols=142 Identities=21% Similarity=0.281 Sum_probs=73.8 Q ss_pred EEEEHHHH----HHHHHH----C----CCCCC----H-HHHHHHHHHCCE-EEEEEEEECCCC---------------CC Q ss_conf 99807899----999985----0----87679----8-999999851792-999999730688---------------73 Q gi|254780936|r 10 LFIDGANL----YASSKA----L----GFDID----Y-RKLLKAFRSRAI-VIRAYYYTTVVG---------------DP 56 (182) Q Consensus 10 IfID~~Nl----~~~~~~----~----~~~~d----~-~~L~~~l~~~~~-l~~~~~y~~~~~---------------~~ 56 (182) ++|||.++ ||+.+. + |...+ | .-|+..+++..+ -..+..|.+..+ .- T Consensus 2 ~LiDG~~lafRayfAl~~~~~~L~~~~G~~t~A~yGF~~~L~~~l~~~~Pm~~~~~aFD~~~~TFR~e~Y~~YKa~R~~t 81 (1005) T TIGR00593 2 LLIDGHSLAFRAYFALPNNKKPLTNSKGEPTNAVYGFTKMLLKLLKEEKPMTYVAVAFDSGTKTFRHEAYEEYKANRAKT 81 (1005) T ss_pred EEEEHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHCHHHCCCCCCC T ss_conf 23403479999988622114666667887116778889999999974189516998876888851666253315788888 Q ss_pred HHHHHHHHH-HHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-CCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 146777579-998897559569753458953788865435653299999999942-115889998386658999999998 Q gi|254780936|r 57 EQQFSPLHP-LLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-EGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 57 ~~~~~~~~~-~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) ++....+-. ..+.|...||.+...+-. | +|=-||.=+-... .+|+ ++|+|||.|+.-|+..- T Consensus 82 P~el~~Qi~~i~e~l~algi~~~e~~GY-----------E--ADD~IaTLA~~a~~~G~~-V~I~sgDrD~lQLvs~e-- 145 (1005) T TIGR00593 82 PEELIEQIPLIKELLEALGIPILEVEGY-----------E--ADDVIATLAKQAEKEGYE-VRIISGDRDLLQLVSDE-- 145 (1005) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEECCCC-----------C--HHHHHHHHHHHHHHCCCE-EEEEECCCCCCCCCCCC-- T ss_conf 8746876899999999719818822887-----------1--012899988877546854-89983784621121786-- Q ss_pred CCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 49889999831577654708999852111518997876225831 Q gi|254780936|r 135 KVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 135 ~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) +|.|.... ...+.++.. .++.+.+.+..|-+|+ T Consensus 146 ---~V~V~~~~-~~~g~~~~~-------~~t~e~V~eKyGv~P~ 178 (1005) T TIGR00593 146 ---NVKVLIPE-RKKGKTEFT-------EITPEYVVEKYGVTPS 178 (1005) T ss_pred ---CEEEEECC-CCCCCCCCE-------ECCHHHHHHHCCCCHH T ss_conf ---14886435-456754310-------1177898754186745 No 10 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=93.07 E-value=0.58 Score=26.42 Aligned_cols=14 Identities=7% Similarity=-0.097 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 58999999998498 Q gi|254780936|r 124 CFTTLVAALQRKVK 137 (182) Q Consensus 124 Df~pli~~lr~~Gk 137 (182) +|+.+++.+-++|- T Consensus 137 efi~a~~~~~~l~~ 150 (222) T COG0603 137 EFIEALNEALNLGT 150 (222) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999998625 No 11 >PRK09482 xni exonuclease IX; Provisional Probab=91.24 E-value=1.3 Score=24.29 Aligned_cols=118 Identities=16% Similarity=0.083 Sum_probs=57.2 Q ss_pred EEEEEEEHHHHHHHHHHC----CCCC----C-HHHHHHHHHHCCEEEEEEEEEC-CC----------------CCCHHHH Q ss_conf 189998078999999850----8767----9-8999999851792999999730-68----------------8731467 Q gi|254780936|r 7 KIALFIDGANLYASSKAL----GFDI----D-YRKLLKAFRSRAIVIRAYYYTT-VV----------------GDPEQQF 60 (182) Q Consensus 7 rvaIfID~~Nl~~~~~~~----~~~~----d-~~~L~~~l~~~~~l~~~~~y~~-~~----------------~~~~~~~ 60 (182) +--++|||.|+.+..-.. +-.- . ...|.+.+.....-..+..+.+ .+ ++.++.. T Consensus 3 ~~llLIDG~~l~~Ray~a~p~~~~~~~~~~~~~~~l~kli~~~~P~~i~v~fD~~~~~~~fR~e~~~~YKanR~~~pd~L 82 (256) T PRK09482 3 NHLLIIDALNLIRRIHAVQPSPNDIAACVETCQHALDKLIRHSQPTHAVAVFDGDERSSGWRHQLLPDYKAGRKPMPEAL 82 (256) T ss_pred CCEEEECCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCCCCHHH T ss_conf 72899846289999984377899988678999999999999719987999984799876537776488760999999899 Q ss_pred -HHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEE Q ss_conf -7757999889755956975345895378886543565329999999994211588999838665899999999849889 Q gi|254780936|r 61 -SPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 61 -~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V 139 (182) .......+.|...|+.+...+- -.+|=-|+.=+-....+=..++|+|||.||..||. .+| T Consensus 83 ~~Q~~~i~~~l~~~gi~~~~~~g-------------~EADDiIatla~~~~~~~~~v~IvS~DkDl~QLv~------~~v 143 (256) T PRK09482 83 AQGLPAIRAAFEELGIDSWLADG-------------NEADDLIATLAVKVAQAGHQATIVSTDKGYCQLLS------PTI 143 (256) T ss_pred HHHHHHHHHHHHHCCCCEEEECC-------------CCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCC------CCE T ss_conf 99999999999977998983578-------------14889999999999977996999966888999675------984 Q ss_pred EEEE Q ss_conf 9998 Q gi|254780936|r 140 TIVS 143 (182) Q Consensus 140 ~v~~ 143 (182) .+.. T Consensus 144 ~v~~ 147 (256) T PRK09482 144 QIRD 147 (256) T ss_pred EEEE T ss_conf 9997 No 12 >PRK05929 consensus Probab=91.09 E-value=1.3 Score=24.20 Aligned_cols=141 Identities=11% Similarity=0.076 Sum_probs=69.8 Q ss_pred CCEEEEEEEHHHHHH----HHHH----CCCCCC----HHH-HHHHHHHCCEEEEEEEEECC-----CC-----------C Q ss_conf 871899980789999----9985----087679----899-99998517929999997306-----88-----------7 Q gi|254780936|r 5 REKIALFIDGANLYA----SSKA----LGFDID----YRK-LLKAFRSRAIVIRAYYYTTV-----VG-----------D 55 (182) Q Consensus 5 ~~rvaIfID~~Nl~~----~~~~----~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~-----~~-----------~ 55 (182) |+|. ++|||..+-+ +... .|...+ |.. |+..+.+...-.-+..|.+. +. + T Consensus 1 Mkkl-~LiDg~sl~fRAy~Alp~l~~s~G~pTnAv~GF~~mL~kll~~~~p~~~avaFD~~~~~tfRhe~~~~YKanR~~ 79 (870) T PRK05929 1 MKKL-FVLDASGFVFRAYFALPEMKNPNGESTQAVFGFIRSLNKLIKEFSPEYMVAVFDGPNNKQSRQEIYADYKSNRDK 79 (870) T ss_pred CCEE-EEEECCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHCCCCCC T ss_conf 9719-999175799999807998879999870509999999999998659988999987979997164412565389999 Q ss_pred CHHHHH-HHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 314677-7579998897559569753458953788865435653299999999942115889998386658999999998 Q gi|254780936|r 56 PEQQFS-PLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 56 ~~~~~~-~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) .++... ......+.+...|+.+...+-. .+|=-|+.=+-....+=-.++|||||.|+..||.. T Consensus 80 ~p~~l~~Qip~i~e~l~a~gip~~~~~Gy-------------EADDiIgTla~~~~~~g~~v~IvTgDKD~~QLv~~--- 143 (870) T PRK05929 80 KFEDLPEQIALVKEYCSLLGLSYLEKEGV-------------EADDVIASITKKAVAEGYEVCICTADKDLLQLVNS--- 143 (870) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEECCCCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCC--- T ss_conf 98899999999999999879988350893-------------58999999999999779929998489976241779--- Q ss_pred CCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 49889999831577654708999852111518997876225831 Q gi|254780936|r 135 KVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 135 ~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) +|.++.. .+.. ..++.+.+.+..+-+|. T Consensus 144 ---~v~v~~~---~k~~----------~~~d~~~V~ek~Gv~P~ 171 (870) T PRK05929 144 ---HVVAWNP---WKDQ----------EVIGYNEVIEQYGVPPG 171 (870) T ss_pred ---CEEEEEC---CCCC----------CCCCHHHHHHHHCCCHH T ss_conf ---8799976---8888----------12679999999796989 No 13 >PRK08928 consensus Probab=90.63 E-value=1.5 Score=23.93 Aligned_cols=121 Identities=17% Similarity=0.155 Sum_probs=62.5 Q ss_pred CCCEEEEEEEHHHHHHHH----HH----CCCCCC----HHHHH-HHHHHCCEEEEEEEEECC---------------CCC Q ss_conf 887189998078999999----85----087679----89999-998517929999997306---------------887 Q gi|254780936|r 4 PREKIALFIDGANLYASS----KA----LGFDID----YRKLL-KAFRSRAIVIRAYYYTTV---------------VGD 55 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~----~~----~~~~~d----~~~L~-~~l~~~~~l~~~~~y~~~---------------~~~ 55 (182) +.++.-++|||..+.+-. .. .|...+ |...+ ..+.....-..+..|... +++ T Consensus 2 ~~k~~l~liDg~sl~fRAf~Al~~l~t~~G~pTnAi~GF~~mL~~~~~~~~P~~~~v~fD~~~~TFR~e~~~~YKa~R~~ 81 (861) T PRK08928 2 TKKNTFLLIDGYGFLFRAYYALQPLTTSKGEPIGGVYGFINMLLKLLSDFKPDYLAVVFDSGGKNFRHDIYPEYKANRPP 81 (861) T ss_pred CCCCCEEEEECCHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCCCCCC T ss_conf 99994899967489999980799887999988244999999999999863998899999795996224332565389999 Q ss_pred CHHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 31467775-79998897559569753458953788865435653299999999942115889998386658999999998 Q gi|254780936|r 56 PEQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 56 ~~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) .++....+ ....+.+...|+.+...+-. .+|=-|+.=+-..+..=..++|||||.|+..||.. T Consensus 82 ~pedl~~Qip~i~~~~~~~gi~~~~~~gy-------------EADDvIgTla~~~~~~g~~v~IvSgDKDl~QLV~~--- 145 (861) T PRK08928 82 PPEDLISQFPLLREAVSALNIPIEEKNGY-------------EADDIIATLATKYASLGDEVVIISSDKDLLQLMNE--- 145 (861) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCC--- T ss_conf 98889999999999999879998544893-------------58999999999999779909998489975031889--- Q ss_pred CCCEEEEEE Q ss_conf 498899998 Q gi|254780936|r 135 KVKKVTIVS 143 (182) Q Consensus 135 ~Gk~V~v~~ 143 (182) +|.++. T Consensus 146 ---~v~v~~ 151 (861) T PRK08928 146 ---NIKIYD 151 (861) T ss_pred ---CEEEEE T ss_conf ---859998 No 14 >PRK07456 consensus Probab=90.33 E-value=1.5 Score=23.77 Aligned_cols=118 Identities=19% Similarity=0.178 Sum_probs=59.6 Q ss_pred CCC-CCCEEEEEEEHHHHH----HHHH---------HCCCCCC----HHH-HHHHHHHCCEEEEEEEEECC--------- Q ss_conf 987-887189998078999----9998---------5087679----899-99998517929999997306--------- Q gi|254780936|r 1 MFD-PREKIALFIDGANLY----ASSK---------ALGFDID----YRK-LLKAFRSRAIVIRAYYYTTV--------- 52 (182) Q Consensus 1 m~~-~~~rvaIfID~~Nl~----~~~~---------~~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~--------- 52 (182) |.+ ..+++-++|||..+. |+.- +.|...+ |.. |+..+.+...-.-+..|... T Consensus 1 ~~~~~~k~~l~liDg~~~~fRay~A~~~~~~~pl~~~~G~~tnAv~GF~~~l~kli~~~~P~~iavaFD~~~~TFR~e~y 80 (975) T PRK07456 1 MPEASKKPLLLLVDGHSLAFRSFYAFSKGGEGGLRTKTGIPTSVTYGFLKSLLDNCKTQKPQGVAIAFDTAEPTFRHEAD 80 (975) T ss_pred CCCCCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHH T ss_conf 99877899089995557899998077867888864899987214999999999999863998899998288995254231 Q ss_pred ------CCCCHHHHHHHHHHH-HHHH-HCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf ------887314677757999-8897-55956975345895378886543565329999999994211588999838665 Q gi|254780936|r 53 ------VGDPEQQFSPLHPLL-DWLH-YNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGC 124 (182) Q Consensus 53 ------~~~~~~~~~~~~~~~-~~l~-~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~D 124 (182) +++.++....+..+. +.|+ ..|+.+...+-. .+|=-|+.=+-..+..=..++|||||.| T Consensus 81 ~~YKA~R~~~PedL~~Qip~i~e~l~~algIp~~e~~Gy-------------EADDiIaTla~~a~~~g~~v~IvSgDKD 147 (975) T PRK07456 81 PNYKANRDVAPDDFFQDLEQLQEILEEALNLPICTAPGY-------------EADDVLGTLANRAADQGWRVRILSGDRD 147 (975) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 364079999988899999999999998369987417880-------------1999999999999977990899788824 Q ss_pred HHHHHHH Q ss_conf 8999999 Q gi|254780936|r 125 FTTLVAA 131 (182) Q Consensus 125 f~pli~~ 131 (182) +..||.. T Consensus 148 l~QLV~d 154 (975) T PRK07456 148 LFQLVDD 154 (975) T ss_pred HHHHCCC T ss_conf 8860878 No 15 >PRK08076 consensus Probab=89.96 E-value=1.7 Score=23.58 Aligned_cols=114 Identities=15% Similarity=0.125 Sum_probs=58.6 Q ss_pred CCEEEEEEEHHHHHHHH----H----HCCCCCC----HHH-HHHHHHHCCEEEEEEEEECCC---------------CCC Q ss_conf 87189998078999999----8----5087679----899-999985179299999973068---------------873 Q gi|254780936|r 5 REKIALFIDGANLYASS----K----ALGFDID----YRK-LLKAFRSRAIVIRAYYYTTVV---------------GDP 56 (182) Q Consensus 5 ~~rvaIfID~~Nl~~~~----~----~~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~~---------------~~~ 56 (182) |++--++|||..+.|.. - +.|...+ |.. |+..+.+...-..+..+.... ++. T Consensus 1 m~k~l~liDg~~~~~Ra~~a~p~l~~~~G~~t~ai~Gf~~~l~k~~~~~~p~~~~v~fD~~~~tfR~~~~~~YKa~R~~~ 80 (877) T PRK08076 1 LEKKLVLVDGNSIAYRAFFALPLLNNDKGIHTNAVYGFTMMLMKILEEEKPTHMLVAFDAGKTTFRHKTFKEYKGGRQKT 80 (877) T ss_pred CCCEEEEEECCHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCCCCCCC T ss_conf 99719999574799999818987879999871349999999999998649988999986948963223416761899999 Q ss_pred HHHHHH-HHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 146777-579998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 57 EQQFSP-LHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 57 ~~~~~~-~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) ++.... .....+.+...|+.+...+-. .+|=-|+.=+-.....=-.++|+|||.|+..||.. T Consensus 81 p~~L~~Q~~~i~~~~~~~gi~~~~~~g~-------------EADDvi~tla~~~~~~~~~v~i~s~DkD~~QLv~~ 143 (877) T PRK08076 81 PPELSEQFPFIRELLDAFNIPRYELENY-------------EADDIIGTLAKEAEKDGFEVKVISGDKDLLQLVSD 143 (877) T ss_pred CHHHHHHHHHHHHHHHHCCCCEECCCCC-------------CHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCC T ss_conf 8889999999999999879988443890-------------09999999999999779909998289876252749 No 16 >PRK07997 consensus Probab=89.76 E-value=1.7 Score=23.48 Aligned_cols=141 Identities=15% Similarity=0.190 Sum_probs=70.9 Q ss_pred CCCEEEEEEEHHHHHHHH----H----HCCCCCC----HHHH-HHHHHHCCEEEEEEEEECC---------------CCC Q ss_conf 887189998078999999----8----5087679----8999-9998517929999997306---------------887 Q gi|254780936|r 4 PREKIALFIDGANLYASS----K----ALGFDID----YRKL-LKAFRSRAIVIRAYYYTTV---------------VGD 55 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~----~----~~~~~~d----~~~L-~~~l~~~~~l~~~~~y~~~---------------~~~ 55 (182) |.++ -++|||..+-|.. . +.|...+ |... +..+.+...-..+..|... +++ T Consensus 5 ~~~~-l~LIDg~slifRAfyAl~~l~~s~G~pTnAi~GF~~mL~kli~~~~P~~iavaFD~~~~TFR~e~y~~YKA~R~~ 83 (928) T PRK07997 5 PENP-LILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLIMQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPP 83 (928) T ss_pred CCCC-EEEEECCHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCCCCCC T ss_conf 9997-799967579999980788677998966207999999999999864998799998088996034330565289999 Q ss_pred CHHHHHHHH-HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 314677757-9998897559569753458953788865435653299999999942115889998386658999999998 Q gi|254780936|r 56 PEQQFSPLH-PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 56 ~~~~~~~~~-~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) -++....+- ...+.+...|+.+...+-. .+|=-|+.=+-.....-..++|||||.|+..||. T Consensus 84 ~PedL~~Qip~i~e~l~a~gIp~~~~~Gy-------------EADDiIgTla~~~~~~g~~v~IvSgDKDl~QLV~---- 146 (928) T PRK07997 84 MPDDLRAQIEPLHAMVKAMGLPLLAVSGV-------------EADDVIGTLAREAEKAGRPVLISTGDKDMAQLVT---- 146 (928) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEECCCCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCC---- T ss_conf 98789999999999999879988555892-------------4999999999999967995999838987655378---- Q ss_pred CCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 49889999831577654708999852111518997876225831 Q gi|254780936|r 135 KVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 135 ~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) ..|.++.... +.+++-+.+.+..+-+|+ T Consensus 147 --~~v~v~~~~~--------------~~~~~~~~V~ek~GV~P~ 174 (928) T PRK07997 147 --PNITLINTMT--------------NTILGPEEVVNKYGVPPE 174 (928) T ss_pred --CCEEEEECCC--------------CEEECHHHHHHHHCCCHH T ss_conf --9859998889--------------848779999998797989 No 17 >PRK07625 consensus Probab=89.19 E-value=1.9 Score=23.22 Aligned_cols=141 Identities=14% Similarity=0.165 Sum_probs=70.5 Q ss_pred CCEEEEEEEHHHHH----HHHHH----CCCCCC----HHHH-HHHHHHCCEEEEEEEEECC---------------CCCC Q ss_conf 87189998078999----99985----087679----8999-9998517929999997306---------------8873 Q gi|254780936|r 5 REKIALFIDGANLY----ASSKA----LGFDID----YRKL-LKAFRSRAIVIRAYYYTTV---------------VGDP 56 (182) Q Consensus 5 ~~rvaIfID~~Nl~----~~~~~----~~~~~d----~~~L-~~~l~~~~~l~~~~~y~~~---------------~~~~ 56 (182) .+|.-++|||..+- |+... .|...+ |... +..+.+...-.-+..|... +++- T Consensus 4 ~~k~l~liDg~~lifRAfyAlp~l~ts~G~pTnAi~GF~~mL~kli~e~~P~~iavaFD~~~~TFR~e~y~eYKAnR~~~ 83 (922) T PRK07625 4 EGKTLLLVDGSSYLYRAYHAMPDLRGPGGEPTGALYGIVNMLRRMRKDVSAEYSACVFDAKGKTFRDDLYADYKAHRPPM 83 (922) T ss_pred CCCEEEEECCCHHHHHHHHCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHCHHHCCCCCCC T ss_conf 99979998473899999827888879989871179999999999998659988999971989963400135651899999 Q ss_pred HHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 1467775-799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 57 EQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 57 ~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) ++....+ ....+.+...|+.+...+-. .+|=-|+.=+-....+=-.++|||||.|+..||. T Consensus 84 PedL~~Qip~i~e~l~a~gIp~~e~~Gy-------------EADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV~----- 145 (922) T PRK07625 84 PPDLALQIEPIHEAVRALGWPLLMIEGV-------------EADDVIGTLARQAEQHGMNVIVSTGDKDLAQLVT----- 145 (922) T ss_pred CHHHHHHHHHHHHHHHHCCCCEECCCCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCC----- T ss_conf 8889998999999999879988452891-------------5899999999999978893999818986866189----- Q ss_pred CCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 9889999831577654708999852111518997876225831 Q gi|254780936|r 136 VKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 136 Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) .+|.++.. .++ ..++.+.+.+..+-+|. T Consensus 146 -~~v~i~~~---~k~-----------~~~d~~~V~ek~GV~P~ 173 (922) T PRK07625 146 -DRVTLVNT---MTN-----------EVLDRDGVLAKFGVPPE 173 (922) T ss_pred -CCEEEEEC---CCC-----------CCCCHHHHHHHHCCCHH T ss_conf -98699978---899-----------27089999998797989 No 18 >PRK06887 consensus Probab=89.01 E-value=2 Score=23.14 Aligned_cols=145 Identities=14% Similarity=0.158 Sum_probs=72.1 Q ss_pred CCCCCCEEEEEEEHHHHHH----HHH----HCCCCCC----HHHH-HHHHHHCCEEEEEEEEECC--------------- Q ss_conf 9878871899980789999----998----5087679----8999-9998517929999997306--------------- Q gi|254780936|r 1 MFDPREKIALFIDGANLYA----SSK----ALGFDID----YRKL-LKAFRSRAIVIRAYYYTTV--------------- 52 (182) Q Consensus 1 m~~~~~rvaIfID~~Nl~~----~~~----~~~~~~d----~~~L-~~~l~~~~~l~~~~~y~~~--------------- 52 (182) |.+-.++.-++|||..+-+ +.. +.|...+ |... +..+.+...-..+..|... T Consensus 1 ~~~~~~~~L~LIDg~sl~fRAfyAl~~l~ts~G~pTnAi~GF~~ml~kli~~~kP~~iavaFD~~~~TFR~e~y~~YKa~ 80 (954) T PRK06887 1 MAQIATNPLVLVDGSSYLYRAFHAFPPLTNSAGEPTGAMYGVLNMLKSLISQVQPSHIAVVFDAKGKTFRDELFEQYKSH 80 (954) T ss_pred CCCCCCCCEEEEECCHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCCC T ss_conf 98778998899967589999980788675998987335999999999999871998899998088996033230565379 Q ss_pred CCCCHHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 88731467775-79998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 53 VGDPEQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 53 ~~~~~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) +++.++....+ ....+.+...|+.+...+-. .+|=-|+.=+-.....-..++|||||.|+..||.. T Consensus 81 R~~~PeeL~~Qip~i~e~l~a~gip~~~~~Gy-------------EADDiIgTla~~~~~~g~~v~IvS~DKDl~QLV~~ 147 (954) T PRK06887 81 RPPMPDDLRKQIQPLHDIIRALGIPLLSVEGV-------------EADDVIGTLAVQASSAGKKVLISTGDKDMAQLVDD 147 (954) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHCCC T ss_conf 99998789999999999999879988556893-------------49999999999999789929997588770121679 Q ss_pred HHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 99849889999831577654708999852111518997876225831 Q gi|254780936|r 132 LQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 132 lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) +|.++.... +.+++-+.+.+..+-+|. T Consensus 148 ------~v~i~~~~~--------------~~~~d~~~V~ek~GV~P~ 174 (954) T PRK06887 148 ------NIMLINTMN--------------NSLLDREGVIEKYGIPPE 174 (954) T ss_pred ------CEEEEECCC--------------CEEECHHHHHHHHCCCHH T ss_conf ------879998989--------------827579999898797989 No 19 >PRK07898 consensus Probab=88.63 E-value=2.1 Score=22.98 Aligned_cols=125 Identities=19% Similarity=0.215 Sum_probs=62.9 Q ss_pred CCCCCCEEEEEEEHHHHH----HHHH------HCCCCCC----HHH-HHHHHHHCCEEEEEEEEECC------------- Q ss_conf 987887189998078999----9998------5087679----899-99998517929999997306------------- Q gi|254780936|r 1 MFDPREKIALFIDGANLY----ASSK------ALGFDID----YRK-LLKAFRSRAIVIRAYYYTTV------------- 52 (182) Q Consensus 1 m~~~~~rvaIfID~~Nl~----~~~~------~~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~------------- 52 (182) |-+..++.-++|||..+. |+.- +.|...+ |.. |+..+.+...-..+..|... T Consensus 9 ~~~~~k~~l~liDg~sl~fRAy~Al~~~~l~~~~G~pTnAi~GF~~ml~kli~~~~P~~iavaFD~~~~TFRhe~y~~YK 88 (902) T PRK07898 9 MEDTEKPTLMLLDGHSLAFRAFYALPAENFSTSGGQTTNAVYGFTAMLINLLRDEQPTHVAVAFDVSRQTFRTEEYPEYK 88 (902) T ss_pred CCCCCCCEEEEEECCHHHHHHHCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHC T ss_conf 98788980899958489999971699766839999870259999999999998739988999997959960310136753 Q ss_pred --CCCCHHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH Q ss_conf --88731467775-799988975595697534589537888654356532999999999421158899983866589999 Q gi|254780936|r 53 --VGDPEQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLV 129 (182) Q Consensus 53 --~~~~~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli 129 (182) +++-++.-..+ ....+.+...|+.+...+- . .+|=-|+.=+-.....=-.++|||||.|+..|| T Consensus 89 A~R~~~PedL~~Qip~i~e~l~a~gIp~le~~G-----------y--EADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV 155 (902) T PRK07898 89 ATRSATPDEFKGQVDLIKEVLGALGITVLAKPG-----------F--EADDIIATLATQAEAEGYRVLVVTGDRDALQLV 155 (902) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCC-----------C--CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHC T ss_conf 899999889999899999999987998856389-----------6--589999999999997799699976899775517 Q ss_pred HHHHHCCCEEEEEEE Q ss_conf 999984988999983 Q gi|254780936|r 130 AALQRKVKKVTIVST 144 (182) Q Consensus 130 ~~lr~~Gk~V~v~~~ 144 (182) .. .|.++.. T Consensus 156 ~d------~v~vl~~ 164 (902) T PRK07898 156 SD------DVTVLYP 164 (902) T ss_pred CC------CEEEEEC T ss_conf 89------8799977 No 20 >PRK05755 DNA polymerase I; Provisional Probab=88.14 E-value=2.2 Score=22.78 Aligned_cols=140 Identities=18% Similarity=0.184 Sum_probs=69.8 Q ss_pred CEEEEEEEHHHHHHHH----H----HCCCCCC----HH-HHHHHHHHCCEEEEEEEEECC---------------CCCCH Q ss_conf 7189998078999999----8----5087679----89-999998517929999997306---------------88731 Q gi|254780936|r 6 EKIALFIDGANLYASS----K----ALGFDID----YR-KLLKAFRSRAIVIRAYYYTTV---------------VGDPE 57 (182) Q Consensus 6 ~rvaIfID~~Nl~~~~----~----~~~~~~d----~~-~L~~~l~~~~~l~~~~~y~~~---------------~~~~~ 57 (182) ++.-++|||..+.+.. . +.|...+ |. -|+..+.....-..+..|... +++.+ T Consensus 4 k~~l~liDg~~~~~Ra~~a~~~l~~~~G~~tnai~Gf~~~l~~~~~~~~p~~~~v~fD~~~~tfR~~~y~~YKa~R~~~P 83 (889) T PRK05755 4 KKTLLLVDGSSLAFRAFYALPPLTNSDGLPTGAVYGFLNMLLKLLKDEKPTHIAVAFDAKGKTFRHELYPEYKANRPPMP 83 (889) T ss_pred CCCEEEEECCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHCCCCCCCC T ss_conf 89489995858999998078988799998725299999999999986299889999639899635210556427999999 Q ss_pred HHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCC Q ss_conf 467775-7999889755956975345895378886543565329999999994211588999838665899999999849 Q gi|254780936|r 58 QQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKV 136 (182) Q Consensus 58 ~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~G 136 (182) +....+ ....+.+...|+.+...+-. .+|=-|+.=+-.....=..++|||||.|+..||. T Consensus 84 ~~l~~Q~~~i~~~l~~~gi~~~~~~g~-------------EADDii~tla~~~~~~~~~v~i~s~DkD~~QLv~------ 144 (889) T PRK05755 84 EDLREQIPLLREALEALGIPLLEVEGY-------------EADDVIGTLAKQAEAAGFEVLIVTGDKDLLQLVD------ 144 (889) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCC------ T ss_conf 899999999999999879988646892-------------3999999999999977995999808987856188------ Q ss_pred CEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 889999831577654708999852111518997876225831 Q gi|254780936|r 137 KKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 137 k~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) .+|.++-...+ ..++.+.+.+..+-+|. T Consensus 145 ~~v~~~~~~k~--------------~~~~~~~v~ek~Gv~P~ 172 (889) T PRK05755 145 DNVTVLNTMKN--------------ERIDPEGVIEKYGVTPE 172 (889) T ss_pred CCEEEEECCCC--------------CEECHHHHHHHHCCCHH T ss_conf 98699989999--------------28589999999796989 No 21 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=87.54 E-value=1.1 Score=24.74 Aligned_cols=55 Identities=22% Similarity=0.249 Sum_probs=44.4 Q ss_pred EEEECCCHHHHHHHHHHHHCC--CEEEEEEEECCCCCCCHHHHHHHCC---------C-CEEHHHHHH Q ss_conf 999838665899999999849--8899998315776547089998521---------1-151899787 Q gi|254780936|r 116 LVIFSGDGCFTTLVAALQRKV--KKVTIVSTVLSDPSMASDQLRRQAD---------Y-FMDLAYLKN 171 (182) Q Consensus 116 ~iLvSGD~Df~pli~~lr~~G--k~V~v~~~~~~~~~~~S~~L~~~ad---------~-fi~l~~l~~ 171 (182) ++|+||=-|=+-+.-.++..| .+|+.++|.+|++. .+.||..|.+ . .|||+-+.. T Consensus 2 v~~lSGG~DStT~~~~a~~~~GkyeV~a~TF~YGQR~-H~~Ele~A~~ia~~Lgi~~~~~~Dl~~l~~ 68 (227) T TIGR00364 2 VVVLSGGQDSTTVLLIALDEGGKYEVHAITFDYGQRA-HSRELESARKIAEALGIRHHFVIDLSLLKQ 68 (227) T ss_pred EEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCHHHH-HHHHHHHHHHHHHHHCCCCEEEECHHHHHH T ss_conf 6874373468999999996179507998546501378-999999999999980897078617799985 No 22 >TIGR02637 RhaS rhamnose ABC transporter, rhamnose-binding protein; InterPro: IPR013459 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This sugar-binding component of ABC transporter complexes is found in rhamnose catabolism operon contexts. Mutation of the gene in Rhizobium leguminosarum abolishes rhamnose transport and prevents growth on rhamnose as a carbon source.. Probab=87.34 E-value=2.3 Score=22.72 Aligned_cols=64 Identities=9% Similarity=0.104 Sum_probs=49.4 Q ss_pred HHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--HCCCCEEEEECCCHH-HHHHHHHHHHCCCEEEE Q ss_conf 998897559-56975345895378886543565329999999994--211588999838665-89999999984988999 Q gi|254780936|r 66 LLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--SEGLEHLVIFSGDGC-FTTLVAALQRKVKKVTI 141 (182) Q Consensus 66 ~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a~~~d~~iLvSGD~D-f~pli~~lr~~Gk~V~v 141 (182) =.++++++| .++++. .-.+++++=-++++-. ++.+|-|+|-..|.| ++|+++++.++|.+|+- T Consensus 20 a~eAaKeLG~~~~i~~-------------~p~~~~~~gQi~~vNsLiaQ~V~AIaiSAnDpdAlvpalkkA~~rgI~Vv~ 86 (307) T TIGR02637 20 AEEAAKELGSVEIIYT-------------GPTATTAEGQIEVVNSLIAQKVDAIAISANDPDALVPALKKAMKRGIKVVT 86 (307) T ss_pred HHHHHHHHCCEEEEEC-------------CCCCCCCHHHHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHHHCCCEEEE T ss_conf 5788876097479980-------------787664001322100031057006997027852278999999856982998 Q ss_pred E Q ss_conf 9 Q gi|254780936|r 142 V 142 (182) Q Consensus 142 ~ 142 (182) + T Consensus 87 ~ 87 (307) T TIGR02637 87 W 87 (307) T ss_pred E T ss_conf 5 No 23 >PRK13059 putative lipid kinase; Reviewed Probab=87.13 E-value=2.6 Score=22.41 Aligned_cols=83 Identities=18% Similarity=0.274 Sum_probs=52.8 Q ss_pred HHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEE Q ss_conf 77757999889755956975345895378886543565329999999994211588999838665899999999849889 Q gi|254780936|r 60 FSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 60 ~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V 139 (182) ........+.|+..|+++.... ++.+-|...+.+..+ +.+|.+|.+-|||=.-.++.-+...+..+ T Consensus 18 ~~~~~~i~~~l~~~~~~~~~~~------------t~~~~~~~~a~~~~~--~~~d~vv~~GGDGTinevvngl~~~~~~~ 83 (294) T PRK13059 18 ISELDKVIEIHQKYGYLVVPYR------------ISLGCDLKEAFKDID--ESYKYILIAGGDGTVDNVVNAMKKLNIDI 83 (294) T ss_pred HHHHHHHHHHHHHCCCEEEEEE------------EECCCHHHHHHHHHH--CCCCEEEEEECCCHHHHHHHHHHHCCCCC T ss_conf 8899999999997798899998------------535608999999887--48988999956788999999998569998 Q ss_pred EEEEEECCCCCCCHHHH Q ss_conf 99983157765470899 Q gi|254780936|r 140 TIVSTVLSDPSMASDQL 156 (182) Q Consensus 140 ~v~~~~~~~~~~~S~~L 156 (182) .+-...-|-.+--++.| T Consensus 84 ~lgiiP~GTgNdfAr~L 100 (294) T PRK13059 84 PIGILPVGTANDFAKFI 100 (294) T ss_pred CEEEEECCCCCHHHHHC T ss_conf 57998267750789982 No 24 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=87.03 E-value=2.6 Score=22.37 Aligned_cols=62 Identities=15% Similarity=0.126 Sum_probs=46.0 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 32999999999421158899983866---589999999984988999983157765470899985211151 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMD 165 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~ 165 (182) .|.++..........=|++|.+|--| +.+.+++.+|+.|.+|+.++-. ....|.+.||..+. T Consensus 165 ~d~~~~~~~~~~l~~~Dv~I~iS~sG~t~~~~~~~~~Ak~~Ga~iIaIT~~------~~spLa~~aD~~L~ 229 (282) T PRK11557 165 RDMHALLATVQALSPDDLLLAISYSGERRELNLAADEALRVGGKVLAITGF------TPNALQQRASHCLY 229 (282) T ss_pred CCHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCC------CCCCHHHHCCEEEE T ss_conf 886899999981899998999859999789999999999879939997298------99815886999998 No 25 >pfam11977 RNase_Zc3h12a Zc3h12a-like Ribonuclease domain. This domain is found in the Zc3h12a protein which has shown to be a ribonuclease that controls the stability of a set of inflammatory genes. It has been suggested that this domain belongs to the PIN domain superfamily. Probab=86.87 E-value=2.6 Score=22.32 Aligned_cols=107 Identities=22% Similarity=0.193 Sum_probs=48.3 Q ss_pred EEEEEHHHHHHHHHHCCCCCCHHHH---HHHHHHCCE-EEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEE Q ss_conf 9998078999999850876798999---999851792-999999730688731467775799988975595697534589 Q gi|254780936|r 9 ALFIDGANLYASSKALGFDIDYRKL---LKAFRSRAI-VIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKE 84 (182) Q Consensus 9 aIfID~~Nl~~~~~~~~~~~d~~~L---~~~l~~~~~-l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~ 84 (182) -|.|||.|+-++-. .+..++-..| ++++...+. ....+. ..+...........+.++..|...|.-+.+ |-+ T Consensus 4 ~IvIDGsNVA~~hg-~~~~fs~~gI~~av~yf~~rGh~~i~vf~-p~~r~~~~~~~~~~~~~L~~L~~~~~l~~t-Ps~- 79 (155) T pfam11977 4 PIVIDGSNVAMSHG-NKEFFSCRGLLLAVDYFRKRGHEVITVFV-PNWRYKARDGKVTDQHELERLIRLGLIVFT-PSR- 79 (155) T ss_pred EEEECCHHHHHHCC-CCCCCCHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHCCCCCCCCHHHHHHHHHCCCEEEC-CCC- T ss_conf 18987648888739-99845089999999999985994599989-378731777888779999999978978974-565- Q ss_pred ECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCH--HHHH Q ss_conf 537888654356532999999999421158899983866--5899 Q gi|254780936|r 85 FTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDG--CFTT 127 (182) Q Consensus 85 ~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~--Df~p 127 (182) ...+. ....-|=. =|+++|.+.+.+| ||+|- |++. T Consensus 80 --~~~g~--~~~~yDDr---~iL~~A~~~~g~I-VSND~yRD~~~ 116 (155) T pfam11977 80 --TLDGR--RIVSYDDR---FILELAEETDGVI-VSNDNFRDLAD 116 (155) T ss_pred --CCCCC--CCCCCCHH---HHHHHHHHCCCEE-EECCHHHHHHH T ss_conf --67885--43331189---9999999839999-97826889987 No 26 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=86.79 E-value=2.7 Score=22.29 Aligned_cols=84 Identities=12% Similarity=0.001 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH----HCCCCEEEEECCCHHHHHHHHHH Q ss_conf 14677757999889755956975345895378886543565329999999994----21158899983866589999999 Q gi|254780936|r 57 EQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ----SEGLEHLVIFSGDGCFTTLVAAL 132 (182) Q Consensus 57 ~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~----a~~~d~~iLvSGD~Df~pli~~l 132 (182) .....+...|.+++...+......+-....+. ...+++..-+..+++. -+.++ .|+.++|.--..+++.+ T Consensus 136 ~~~~~R~~Gf~~~l~~~~~~~~~~~~~~iv~~-----~~~~~~~~~a~~~~~~~L~~~p~i~-~i~~~nD~~a~Ga~~Al 209 (282) T cd06318 136 LVGQARRDGFLLGVSEAQLRKYGKTNFTIVAQ-----GYGDWTREGGLKAMEDLLVAHPDIN-VVYSENDDMALGAMRVL 209 (282) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE-----CCCCCCHHHHHHHHHHHHHHCCCCE-EEEECCCHHHHHHHHHH T ss_conf 46999999999999845554367787289840-----6789779999999999997589970-99988967999999999 Q ss_pred HHCCC--EEEEEEEEC Q ss_conf 98498--899998315 Q gi|254780936|r 133 QRKVK--KVTIVSTVL 146 (182) Q Consensus 133 r~~Gk--~V~v~~~~~ 146 (182) ++.|+ .|.++++.. T Consensus 210 ~~~G~~~~v~vvG~D~ 225 (282) T cd06318 210 AEAGKTDDVKVAAADG 225 (282) T ss_pred HHCCCCCCEEEEEECC T ss_conf 9669999829999898 No 27 >PRK07556 consensus Probab=86.78 E-value=2.7 Score=22.29 Aligned_cols=81 Identities=15% Similarity=0.091 Sum_probs=46.7 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 99988975595697534589537888654356532999999999421158899983866589999999984988999983 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ...+.+...|+.+...+-. .+|=-|+.=+-.....=..++|||||.|+..||.. +|.++.. T Consensus 100 ~i~e~l~algIp~l~~~Gy-------------EADDiIgTlA~~~~~~g~~v~IvSgDKDl~QLV~d------~V~i~~~ 160 (977) T PRK07556 100 LIREATRAFNLPCIEKEGF-------------EADDLIATYARLAREAGGDVTIISSDKDLMQLVGD------GVSMYDP 160 (977) T ss_pred HHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCC------CEEEEEC T ss_conf 9999999879988546895-------------48999999999999779929998189867562779------8699978 Q ss_pred ECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCCCC Q ss_conf 1577654708999852111518997876225831 Q gi|254780936|r 145 VLSDPSMASDQLRRQADYFMDLAYLKNEIARDPD 178 (182) Q Consensus 145 ~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~~~ 178 (182) .++ ..++.+++.+..+-+|. T Consensus 161 ---~k~-----------~~~d~~~V~eK~GV~P~ 180 (977) T PRK07556 161 ---MKD-----------KRIGIPEVIEKFGVPPE 180 (977) T ss_pred ---CCC-----------CEECHHHHHHHHCCCHH T ss_conf ---898-----------48179999998696989 No 28 >PRK07300 consensus Probab=86.63 E-value=2.7 Score=22.24 Aligned_cols=113 Identities=13% Similarity=0.147 Sum_probs=56.0 Q ss_pred CCCEEEEEEEHHHHHHHH----H--------HCCCCCC----HHHH-HHHHHHCCEEEEEEEEECCC------------- Q ss_conf 887189998078999999----8--------5087679----8999-99985179299999973068------------- Q gi|254780936|r 4 PREKIALFIDGANLYASS----K--------ALGFDID----YRKL-LKAFRSRAIVIRAYYYTTVV------------- 53 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~----~--------~~~~~~d----~~~L-~~~l~~~~~l~~~~~y~~~~------------- 53 (182) +.++.-++|||..+-|.. . +.|...+ |... +..+.....-..+..|.... T Consensus 2 ~~kk~l~LIDG~slifRAfyAl~~~~~~l~ts~G~pTnAi~GF~~mL~kli~~~~P~~iavaFD~~~~TFR~~~~~~YKa 81 (880) T PRK07300 2 ENKNKLLLIDGSSVAFRAFFALYNQIDRFKNHSGLHTNAIYGFHLMLDHMMKRVQPTHVLVAFDAGKTTFRTEMYADYKA 81 (880) T ss_pred CCCCCEEEEECCHHHHHHHCCCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCC T ss_conf 98981899967488999871788777887299998823599999999999986499889999969599601423157527 Q ss_pred --CCCHHHHHH-HHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHC--CCC-EEEEECCCHHHHH Q ss_conf --873146777-5799988975595697534589537888654356532999999999421--158-8999838665899 Q gi|254780936|r 54 --GDPEQQFSP-LHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSE--GLE-HLVIFSGDGCFTT 127 (182) Q Consensus 54 --~~~~~~~~~-~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~--~~d-~~iLvSGD~Df~p 127 (182) ++.++.... .....+.+...|+.+...+-. .+|=-|+.=+ ..+. ... .++|||||.|+.. T Consensus 82 ~R~~~P~~l~~Q~~~i~~~~~~~~i~~~~~~g~-------------EADDvi~tla-k~a~~~~~~~~v~Ivs~DkD~~Q 147 (880) T PRK07300 82 GRAKTPDEFREQFPYIREMLTALGIAYYELEHY-------------EADDIIGTLD-KMAERTEVPFDVTIVSGDKDLIQ 147 (880) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCC-------------CHHHHHHHHH-HHHHHCCCCCEEEEECCCCCCCC T ss_conf 999998899999999999999879988252891-------------4999999999-99996499971899748986161 Q ss_pred HHH Q ss_conf 999 Q gi|254780936|r 128 LVA 130 (182) Q Consensus 128 li~ 130 (182) ||. T Consensus 148 LV~ 150 (880) T PRK07300 148 LTD 150 (880) T ss_pred CCC T ss_conf 076 No 29 >cd06313 PBP1_ABC_sugar_binding_like_5 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=86.60 E-value=2.7 Score=22.23 Aligned_cols=109 Identities=15% Similarity=0.115 Sum_probs=63.9 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH- Q ss_conf 99998517929999997306887314677757999889755-9569753458953788865435653299999999942- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a- 110 (182) |.+.+...+++.. ..+.+. ......+...|.+.+... ++++.... ..++|...+..+++.. T Consensus 116 l~~~~~~~g~v~i---i~g~~~-~~~~~~R~~Gf~~~l~~~~~i~vv~~~-------------~~~~~~~~a~~~~~~~L 178 (272) T cd06313 116 LCNAMGGKGKIAM---LQGALG-HTGAQGRAQGFNDVIKKYPDIEVVDEQ-------------PANWDVSKAARIWETWL 178 (272) T ss_pred HHHHCCCCCEEEE---EECCCC-CHHHHHHHHHHHHHHHHCCCCEEEEEE-------------ECCCCHHHHHHHHHHHH T ss_conf 9985699972999---847988-689999999999999877991999986-------------26876899999999999 Q ss_pred ---CCCCEEEEECCCHHHHHHHHHHHHCCC-EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf ---115889998386658999999998498-899998315776547089998521115 Q gi|254780936|r 111 ---EGLEHLVIFSGDGCFTTLVAALQRKVK-KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 111 ---~~~d~~iLvSGD~Df~pli~~lr~~Gk-~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) +++| .|+..+|.--.-++..+++.|+ +|.++++.. +...|+...+..+ T Consensus 179 ~~~pdid-~I~~~nD~~a~Ga~~Al~~aG~~~v~v~g~Dg-----~~~~l~~I~~G~~ 230 (272) T cd06313 179 TKYPQLD-GAFCHNDSMALAAYQIMKAAGRTKIVIGGVDG-----DPPAIQAVSDGRM 230 (272) T ss_pred HHCCCCC-EEEECCCHHHHHHHHHHHHCCCCCCEEEEECC-----CHHHHHHHHCCCC T ss_conf 6499987-89988847789999999976999868999789-----9999999985991 No 30 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=86.54 E-value=2.8 Score=22.21 Aligned_cols=42 Identities=21% Similarity=0.141 Sum_probs=26.0 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHH Q ss_conf 89998386658999999998498899998315776547089998 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRR 158 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~ 158 (182) .+||+||=-|=+-++-.+++.|..|+.+++.+|++. +.+|.. T Consensus 2 avvl~SGG~DSt~~l~~a~~~~~~v~ait~dYGQ~~--~~Ei~~ 43 (137) T pfam06508 2 AVVLLSGGLDSTTCLAWAKKEGYEVYALTFDYGQRH--SKELEC 43 (137) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCH--HHHHHH T ss_conf 899917878999999999986996899981478873--999999 No 31 >PRK02947 hypothetical protein; Provisional Probab=86.29 E-value=1.4 Score=24.03 Aligned_cols=67 Identities=22% Similarity=0.209 Sum_probs=46.3 Q ss_pred HHHHHHHHHHHH--CCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCH-----HHHHHHCCCCEE Q ss_conf 299999999942--1158899983866---589999999984988999983157765470-----899985211151 Q gi|254780936|r 99 DVELAVDAFEQS--EGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMAS-----DQLRRQADYFMD 165 (182) Q Consensus 99 Dv~laiD~~~~a--~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S-----~~L~~~ad~fi~ 165 (182) .-.++-.+++.. +.=|++|++|.-| ==++++..+|++|.+|+.++-..-....+| ..|-+.||-.|| T Consensus 92 ~~g~a~~il~~~~i~~~Dvlii~SnSG~N~~pVE~A~~ak~~G~~VIaiTS~~~s~~~~srH~SGkkL~d~aDiviD 168 (247) T PRK02947 92 IEGYAKLILDRYPIRPGDVLIIVSNSGRNAVPIEMALEAKERGAKVIAVTSLAYSKSVASRHSSGKRLYEVADVVID 168 (247) T ss_pred CCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCCCCCCCCCCEEHHHCCEEEE T ss_conf 55099999986799999889999678777689999999998699699996678816789989766711563667865 No 32 >TIGR03127 RuMP_HxlB 6-phospho 3-hexuloisomerase. Members of this protein family are 6-phospho 3-hexuloisomerase (PHI), or the PHI domain of a fusion protein. This enzyme is part of the ribulose monophosphate (RuMP) pathway, which in one direction removes the toxic metabolite formaldehyde by assimilation into fructose-6-phosphate. In the other direction, in species lacking a complete pentose phosphate pathway, the RuMP pathway yields ribulose-5-phosphate, necessary for nucleotide biosynthesis, at the cost of also yielding formaldehyde. These latter species tend usually have a formaldehyde-activating enzyme to attach formaldehyde to the C1 carrier tetrahydromethanopterin. Probab=85.86 E-value=2.6 Score=22.38 Aligned_cols=105 Identities=19% Similarity=0.168 Sum_probs=63.7 Q ss_pred CCCHH---HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHH Q ss_conf 67989---999998517929999997306887314677757999889755956975345895378886543565329999 Q gi|254780936|r 27 DIDYR---KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELA 103 (182) Q Consensus 27 ~~d~~---~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~la 103 (182) .+|.. ++.+.+...+ +.+.|+.-.. ....+.|...|.+.|+.+... .+.... T Consensus 14 ~id~~~i~~~v~~I~~a~---~I~v~G~G~S-----g~ia~~~a~rL~~~G~~~~~~-----~d~~~~------------ 68 (179) T TIGR03127 14 RIDEEELDKLADKIIKAK---RIFVAGAGRS-----GLVGKAFAMRLMHLGFNVYVV-----GETTTP------------ 68 (179) T ss_pred HCCHHHHHHHHHHHHCCC---CEEEEEECCH-----HHHHHHHHHHHHHCCCEEEEE-----CCCCCC------------ T ss_conf 669999999999997399---0999995543-----999999999998519757997-----644446------------ Q ss_pred HHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 99999421158899983866---58999999998498899998315776547089998521115189 Q gi|254780936|r 104 VDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 104 iD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ....=|.+|++|+-| +.+.+++.+|+.|.+++.++-. ....|.+.||..+.+. T Consensus 69 -----~i~~~Dv~I~iS~SGeT~e~~~~~~~aK~~ga~ii~IT~~------~~S~Lak~aD~~l~ip 124 (179) T TIGR03127 69 -----SIKKGDLLIAISGSGETESLVTVAKKAKEIGATVAAITTN------PESTLGKLADVVVEIP 124 (179) T ss_pred -----CCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECC------CCCHHHHHCCEEEEEC T ss_conf -----6999999999819999689999999999879929999798------9897799499999906 No 33 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=85.81 E-value=3 Score=21.97 Aligned_cols=36 Identities=11% Similarity=-0.002 Sum_probs=27.3 Q ss_pred CCCC-EEEEECCCHHHHHHHHHHHHCCC--EEEEEEEEC Q ss_conf 1158-89998386658999999998498--899998315 Q gi|254780936|r 111 EGLE-HLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVL 146 (182) Q Consensus 111 ~~~d-~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~ 146 (182) +..+ .+|+.++|.--..+++.+++.|+ +|.++++.. T Consensus 180 pd~~~~ai~~~nd~~a~Ga~~Al~~~G~~~~v~v~G~D~ 218 (273) T cd06305 180 PKGGIDAIWAAWDEFAKGAKQALDEAGRTDEIKIYGVDI 218 (273) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCEEEEECC T ss_conf 887627999358578999999999769999948999789 No 34 >PRK08835 consensus Probab=85.72 E-value=3 Score=21.95 Aligned_cols=124 Identities=18% Similarity=0.171 Sum_probs=61.6 Q ss_pred CCCCCCEEEEEEEHHHHHHH----HHH----CCCCCC----HHH-HHHHHHHCCEEEEEEEEECC--------------- Q ss_conf 98788718999807899999----985----087679----899-99998517929999997306--------------- Q gi|254780936|r 1 MFDPREKIALFIDGANLYAS----SKA----LGFDID----YRK-LLKAFRSRAIVIRAYYYTTV--------------- 52 (182) Q Consensus 1 m~~~~~rvaIfID~~Nl~~~----~~~----~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~--------------- 52 (182) |.+-.++.-++|||..+-+- .-. .|...+ |.. |+..+.+...-..+..|... T Consensus 1 m~~~~~~~L~LIDg~slifRAfyA~p~l~ts~G~pTnAv~GF~~mL~kli~~~~P~~iavaFD~~~~TfR~~~y~~YKa~ 80 (931) T PRK08835 1 MATIPENPLILIDGSSYLYRAFHAYPGTMSNGEIPTNAVYGVVNMLRSMMRQFASDRIAVIFDAKGKTFRDDMYPEYKAN 80 (931) T ss_pred CCCCCCCCEEEEECCHHHHHHHHCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHCCC T ss_conf 99899997799965579999982799878999988306999999999999864998799997498996234340575179 Q ss_pred CCCCHHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 88731467775-79998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 53 VGDPEQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 53 ~~~~~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) +++.++....+ ....+.+...|+.+...+-. .+|=-|+.=+-.....=-.++|||||.|+..||.. T Consensus 81 R~~~pedL~~Q~p~i~e~l~a~gIp~~~~~Gy-------------EADDiIgTla~~~~~~g~~v~IvS~DKDl~QLV~~ 147 (931) T PRK08835 81 RPPMPDDLRCQIEPLHQVIKAMGLPLIAIEGV-------------EADDVIGTLAYQASQAGMPVLISTGDKDMAQLVDD 147 (931) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCCC T ss_conf 99998789999999999999879988445891-------------49999999999999779939998289976341789 Q ss_pred HHHCCCEEEEEE Q ss_conf 998498899998 Q gi|254780936|r 132 LQRKVKKVTIVS 143 (182) Q Consensus 132 lr~~Gk~V~v~~ 143 (182) .|.++. T Consensus 148 ------~v~v~~ 153 (931) T PRK08835 148 ------NITLIN 153 (931) T ss_pred ------CEEEEE T ss_conf ------859998 No 35 >PRK05797 consensus Probab=84.60 E-value=3.4 Score=21.63 Aligned_cols=114 Identities=14% Similarity=0.105 Sum_probs=56.7 Q ss_pred CCCEEEEEEEHHHHHHHH----HH----CCCCCC----HHH-HHHHHHHCCEEEEEEEEECC---------------CCC Q ss_conf 887189998078999999----85----087679----899-99998517929999997306---------------887 Q gi|254780936|r 4 PREKIALFIDGANLYASS----KA----LGFDID----YRK-LLKAFRSRAIVIRAYYYTTV---------------VGD 55 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~----~~----~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~---------------~~~ 55 (182) .|+|. ++|||..+-|-. .. .|...+ |.. |+..+.....-..+..+... +++ T Consensus 1 ~mk~l-~liDg~~~~~Ra~~a~~~l~~~~G~pt~ai~Gf~~~l~~~~~~~~p~~~~v~fD~~~~tfR~~~y~~YKa~R~~ 79 (869) T PRK05797 1 NMKRL-LILDGNSLMNRAFYALPPLTNSEGIHTNAIYGFTNMLLKMKEEIKPDYIVVAFDKKAPTFRHKEYKDYKAGRKK 79 (869) T ss_pred CCCEE-EEEECCHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHCHHHCCCCCC T ss_conf 99759-99937589999980799888998987054999999999999873998899998593996042123364289999 Q ss_pred CHHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 31467775-79998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 56 PEQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 56 ~~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) .++....+ ....+.+...|+.+...+-. .+|=-|+.=+-....+=-.++|||||.|+..||.. T Consensus 80 ~P~~l~~Q~~~i~~~~~~~gi~~~~~~g~-------------EADDvi~tla~~~~~~g~~v~i~s~DkD~~QLv~~ 143 (869) T PRK05797 80 MPEELAEQFPILKELLDALNINIFEIDGF-------------EADDLIGTLSKFAEEKGIEVYIVTGDKDALQLASD 143 (869) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEECCC-------------CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCC T ss_conf 99799999999999999889987445891-------------58989999999999789958998079860010879 No 36 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=84.39 E-value=3.5 Score=21.57 Aligned_cols=116 Identities=17% Similarity=0.045 Sum_probs=59.2 Q ss_pred EEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCC Q ss_conf 99980789999998508767989999998517929999997306887314677757999889755956975345895378 Q gi|254780936|r 9 ALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTEN 88 (182) Q Consensus 9 aIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~ 88 (182) .-||-.+|.-.+... -..|.+.+.+.+.+.. .+.. ........+...|.+.+...+..+..... T Consensus 101 ~~~v~~Dn~~~G~~a------a~~l~~~~~~~g~v~i--i~~~--~~~~~~~~R~~Gf~~~l~~~~p~i~iv~~------ 164 (294) T cd06316 101 AGIVTDDNYGNGQIA------ADALAKALPGKGKVGL--IYHG--ADYFVTNQRDQGFKETIKKNYPDITIVAE------ 164 (294) T ss_pred EEEEECCHHHHHHHH------HHHHHHHCCCCCCEEE--EECC--CCCHHHHHHHHHHHHHHHHCCCCEEEEEE------ T ss_conf 999948889999999------9999962666880899--9758--99768999999999999720897299984------ Q ss_pred CCCCCCCCCCHHHHHHHHHH----HHCCCCEEEEECCCHHHHHHHHHHHHCCC-EEEEEEEECC Q ss_conf 88654356532999999999----42115889998386658999999998498-8999983157 Q gi|254780936|r 89 CGRKRVKSSMDVELAVDAFE----QSEGLEHLVIFSGDGCFTTLVAALQRKVK-KVTIVSTVLS 147 (182) Q Consensus 89 ~~~~~~kk~~Dv~laiD~~~----~a~~~d~~iLvSGD~Df~pli~~lr~~Gk-~V~v~~~~~~ 147 (182) ....+..-+..+++ --+.++ .|+..+|.--.-+++.+++.|+ .+.++++..+ T Consensus 165 ------~~~~~~~~a~~~~~~~L~~~Pdi~-~I~~~nd~~a~Ga~~Al~~aG~~~v~ivg~d~g 221 (294) T cd06316 165 ------KGIDGPSKAEDIANAMLTQNPDLK-GIYAVWDVPAEGVIAALRAAGRDDIKVTTVDLG 221 (294) T ss_pred ------CCCCCHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHHHHHHHHCCCCCCEEEEEECC T ss_conf ------388888999999999997399988-899868636599999999819998379999589 No 37 >cd05005 SIS_PHI Hexulose-6-phosphate isomerase (PHI). PHI is a member of the SIS (Sugar ISomerase domain) superfamily. In the ribulose monophosphate pathway of formaldehyde fixation, hexulose-6-phosphate synthase catalyzes the condensation of ribulose-5-phosphate with formadelhyde to become hexulose-6-phosphate, which is then isomerized to fructose-6-phosphate by PHI. Probab=84.18 E-value=3.6 Score=21.51 Aligned_cols=102 Identities=18% Similarity=0.185 Sum_probs=61.4 Q ss_pred HHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 89999998517929999997306887314677757999889755956975345895378886543565329999999994 Q gi|254780936|r 30 YRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ 109 (182) Q Consensus 30 ~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~ 109 (182) +.++.+.+.+.+ +.+.|+.-.. ....+.|...|.+.|+.+... .+...+ . T Consensus 23 i~~~~~~I~~a~---~I~~~G~G~S-----g~va~~~a~rl~~lG~~~~~~-----~d~~~~-----------------~ 72 (179) T cd05005 23 LDKLISAILNAK---RIFVYGAGRS-----GLVAKAFAMRLMHLGLNVYVV-----GETTTP-----------------A 72 (179) T ss_pred HHHHHHHHHCCC---CEEEEEECCH-----HHHHHHHHHHHHHCCCCEEEE-----CCCCCC-----------------C T ss_conf 999999997499---4899985625-----877999999997359801243-----565557-----------------7 Q ss_pred HCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 21158899983866---58999999998498899998315776547089998521115189 Q gi|254780936|r 110 SEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 110 a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ...=|.+|++|+-| +.+.+++.+|+.|.+|+.++-. ....|.+.||..+.+. T Consensus 73 i~~~Dv~I~iS~SG~T~~~~~~~~~aK~~ga~iI~IT~~------~~S~la~~aD~~l~ip 127 (179) T cd05005 73 IGPGDLLIAISGSGETSSVVNAAEKAKKAGAKVVLITSN------PDSPLAKLADVVVVIP 127 (179) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECC------CCCHHHHHCCEEEECC T ss_conf 999999999819999568999999999879919999798------9997899589999818 No 38 >cd06409 PB1_MUG70 The MUG70 protein is a product of the meiotically up-regulated gene 70 which has a role in meiosis and harbors a PB1 domain. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domains depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic amino acid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is Probab=83.71 E-value=1.9 Score=23.25 Aligned_cols=30 Identities=13% Similarity=0.101 Sum_probs=26.2 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 899983866589999999984988999983 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ..|++|.|+|++.+|...|..|.++.-... T Consensus 54 D~VliTsD~DL~eaV~~aR~~g~~~v~L~v 83 (86) T cd06409 54 DIVLITSDSDLVAAVLVARSAGLKKLDLHL 83 (86) T ss_pred CEEEEEECHHHHHHHHHHHHCCCCEEEEEE T ss_conf 989994036699999999984997689983 No 39 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=83.63 E-value=3.8 Score=21.37 Aligned_cols=89 Identities=15% Similarity=0.044 Sum_probs=55.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH----CCCCEEEEECCCHHHHHHHHH Q ss_conf 3146777579998897559569753458953788865435653299999999942----115889998386658999999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS----EGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~pli~~ 131 (182) ......+...|.+.|...+..+.... .+.|..-+.++++.. ++++ .|+.++|.--..+++. T Consensus 137 ~~~~~~R~~Gf~~~l~~~~~~~~~~~--------------~~~d~~~a~~~~~~~L~~~pdi~-~I~~~nd~~a~ga~~A 201 (271) T cd06312 137 NVTLEDRCAGFADGLGGAGITEEVIE--------------TGADPTEVASRIAAYLRANPDVD-AVLTLGAPSAAPAAKA 201 (271) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEE--------------CCCCHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHHHHH T ss_conf 86999999999999875598079985--------------79999999999999998498988-8998896789999999 Q ss_pred HHHCCC--EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 998498--899998315776547089998521115 Q gi|254780936|r 132 LQRKVK--KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 132 lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) +++.|+ +|.++++... ...|...-+..+ T Consensus 202 l~~ag~~~~i~vvg~D~~-----~~~~~~i~~G~i 231 (271) T cd06312 202 LKQAGLKGKVKLGGFDLS-----PATLQAIKAGYI 231 (271) T ss_pred HHHCCCCCCEEEEEECCC-----HHHHHHHHCCCC T ss_conf 996699998599998899-----999999986995 No 40 >pfam10740 DUF2529 Protein of unknown function (DUF2529). This family is conserved in the Bacillales. The function is not known. Several members are annotated as being YWJG, a protein expressed downstream of pyrG, a gene encoding for cytidine triphosphate synthetase. Probab=83.06 E-value=1.7 Score=23.54 Aligned_cols=59 Identities=24% Similarity=0.261 Sum_probs=43.9 Q ss_pred CCCCEEEEEC---CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHH Q ss_conf 1158899983---86658999999998498899998315776547089998521115189978762 Q gi|254780936|r 111 EGLEHLVIFS---GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEI 173 (182) Q Consensus 111 ~~~d~~iLvS---GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i 173 (182) ..-|++.|+| +|.+-+.++++|+++|..+.+++.. .+.. ..|.+.||.||++.-.++.+ T Consensus 80 t~tDRVLlfs~~~~d~e~~~~a~~L~~~gi~~v~Vs~~--~~~~--~~l~~~~~vhId~~l~~~Lv 141 (172) T pfam10740 80 TETDRVLLFSRFSNDEEAVALAKKLYDQGIDVVAVSPN--AKEG--DSLADHADVHIDLKLKRPLV 141 (172) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEECC--CCCC--CCCHHHEEEEEECCCCCCCC T ss_conf 75576999769999789999999999869988998168--8888--76144402667647787866 No 41 >cd06309 PBP1_YtfQ_like Periplasmic binding domain of ABC-type YtfQ-like transport systems. Periplasmic binding domain of ABC-type YtfQ-like transport systems. The YtfQ protein from Escherichia coli is up-regulated under glucose-limited conditions and shares homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their ligand specificity is not determined experimentally. Probab=82.80 E-value=4 Score=21.16 Aligned_cols=79 Identities=13% Similarity=0.064 Sum_probs=48.1 Q ss_pred CCHHHHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH----CCCCEEEEECCCHHHHHHH Q ss_conf 7314677757999889755-9569753458953788865435653299999999942----1158899983866589999 Q gi|254780936|r 55 DPEQQFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS----EGLEHLVIFSGDGCFTTLV 129 (182) Q Consensus 55 ~~~~~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~pli 129 (182) .......+...|.+.|.+. ++++.... .++++..-+..+++.. +.-=++|+.++|.--+-++ T Consensus 135 ~~~~~~~R~~Gf~~~l~~~~~~~iv~~~-------------~~~~~~~~~~~~~~~~L~~~p~~i~ai~~~nd~~a~Ga~ 201 (273) T cd06309 135 GSSVAIDRKKGFAEVIKKYPNMKIVASQ-------------TGDFTRAKGKEVMEALLKAHGDDIDAVYAHNDEMALGAI 201 (273) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHH T ss_conf 9859999999999999878998676541-------------234688999999999997278888889867857999999 Q ss_pred HHHHHCCC----EEEEEEEEC Q ss_conf 99998498----899998315 Q gi|254780936|r 130 AALQRKVK----KVTIVSTVL 146 (182) Q Consensus 130 ~~lr~~Gk----~V~v~~~~~ 146 (182) +.|++.|+ .|.++++.. T Consensus 202 ~al~~~G~~~~~di~ivg~D~ 222 (273) T cd06309 202 QAIKAAGKKPGKDIKIVSIDG 222 (273) T ss_pred HHHHHCCCCCCCCEEEEEECC T ss_conf 999983999999869999899 No 42 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=81.96 E-value=3.8 Score=21.35 Aligned_cols=31 Identities=13% Similarity=0.006 Sum_probs=15.6 Q ss_pred EEECCCHHHHHHHHHHHHCCCEEEEEEEECC Q ss_conf 9983866589999999984988999983157 Q gi|254780936|r 117 VIFSGDGCFTTLVAALQRKVKKVTIVSTVLS 147 (182) Q Consensus 117 iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~ 147 (182) .....-+=|.-+++.+.+.+...++.++..+ T Consensus 83 ~~~~~~~i~~~~~~~A~k~~IplIi~Gen~~ 113 (154) T cd01996 83 CWPCDTAIFTSLYKVALKFGIPLIITGENPA 113 (154) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 4889987998999999993999899727835 No 43 >PRK09701 D-allose transporter subunit; Provisional Probab=81.88 E-value=4.4 Score=20.95 Aligned_cols=89 Identities=13% Similarity=0.028 Sum_probs=50.8 Q ss_pred CHHHHHHHHHHHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCE-EEEECCCHHHHHHHHH Q ss_conf 3146777579998897559-569753458953788865435653299999999942--11588-9998386658999999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLEH-LVIFSGDGCFTTLVAA 131 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~-~iLvSGD~Df~pli~~ 131 (182) ......+...|.+.+...+ +++.... -.++|..-+.++++.. .+-|. .|+.++|.--.-+++. T Consensus 168 ~~~~~~R~~G~~~~~~~~~~~~iv~~~-------------~~~~d~~~a~~~~~~~L~a~Pdi~~I~~~nd~~a~Ga~~A 234 (311) T PRK09701 168 NASGEARRNGATEAFKKASQIKLVASQ-------------PADWDRIKALDVATNVLQRNPNIKAIYCANDTMAMGVAQA 234 (311) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHH T ss_conf 617999998899998757984677320-------------3776587799999999954999768997797268999999 Q ss_pred HHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 998498--8999983157765470899985211 Q gi|254780936|r 132 LQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 132 lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) +++.|+ +|.++++... ...|...-+. T Consensus 235 l~~aG~~~~i~vvG~D~~-----~~~~~~I~~G 262 (311) T PRK09701 235 VANAGKTGKVLVVGTDGI-----PEARKMVEAG 262 (311) T ss_pred HHHCCCCCCEEEEEECCC-----HHHHHHHHCC T ss_conf 996699998399997899-----9999999859 No 44 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=81.81 E-value=4.4 Score=20.93 Aligned_cols=61 Identities=20% Similarity=0.118 Sum_probs=36.2 Q ss_pred CCHHHHHHHHHHHH--CCCC--EEEEECCCHHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 53299999999942--1158--89998386658999999998498--8999983157765470899985211 Q gi|254780936|r 97 SMDVELAVDAFEQS--EGLE--HLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 97 ~~Dv~laiD~~~~a--~~~d--~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) +++-.-+...|+.. .+-+ ..|+..+|.==.-+++.|++.|+ +|.++++.. +...|+...++ T Consensus 191 ~w~~~~A~~~me~~L~a~~~~id~V~a~ND~mAlGA~~Al~~aG~~~~v~V~G~Dg-----~~~al~~I~~G 257 (330) T PRK10355 191 GWLPENALKIMENALTANNNKIDAVVASNDATAGGAIQALSAQGLSGKVAISGQDA-----DLAAIKRIVAG 257 (330) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEECCCC-----CHHHHHHHHCC T ss_conf 42288999999999960998516999899714899999999669999976990279-----98999999749 No 45 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=81.60 E-value=4.5 Score=20.89 Aligned_cols=81 Identities=16% Similarity=0.017 Sum_probs=48.4 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--H---CCCCEEEEECCCHHHHHH Q ss_conf 87314677757999889755956975345895378886543565329999999994--2---115889998386658999 Q gi|254780936|r 54 GDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--S---EGLEHLVIFSGDGCFTTL 128 (182) Q Consensus 54 ~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a---~~~d~~iLvSGD~Df~pl 128 (182) ........+...|.+.|...+..+...... ..+++..-+...++. . +.+| +|+..+|.--.-+ T Consensus 135 ~~~~~~~~R~~Gf~~~l~~~~~~i~iv~~~-----------~~~~~~~~a~~~~~~lL~~~~~~i~-~I~~~nd~~a~Ga 202 (275) T cd06317 135 PGNGTAIERQKGFEDELAEVCPGVEVLDTQ-----------PADWDREKAQVAMEALITKFGDDID-GVYAGDDNMARGA 202 (275) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEEEE-----------ECCCCHHHHHHHHHHHHHHCCCCCC-EEECCCCHHHHHH T ss_conf 998599999999999999858875698887-----------4147789999999999873457533-4405895789999 Q ss_pred HHHHHHCCC--EEEEEEEEC Q ss_conf 999998498--899998315 Q gi|254780936|r 129 VAALQRKVK--KVTIVSTVL 146 (182) Q Consensus 129 i~~lr~~Gk--~V~v~~~~~ 146 (182) ++.+++.|+ +|.++++.. T Consensus 203 ~~Al~~~G~~~~v~vvg~d~ 222 (275) T cd06317 203 LNAAKEAGLAGGIVIVGANN 222 (275) T ss_pred HHHHHHCCCCCCCEEEEECC T ss_conf 99999749989987998798 No 46 >PRK05354 arginine decarboxylase; Provisional Probab=81.42 E-value=4.5 Score=20.85 Aligned_cols=116 Identities=17% Similarity=0.184 Sum_probs=70.9 Q ss_pred CCCCHHHHHHHHHHCCE-E-EEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCC------------- Q ss_conf 76798999999851792-9-99999730688731467775799988975595697534589537888------------- Q gi|254780936|r 26 FDIDYRKLLKAFRSRAI-V-IRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCG------------- 90 (182) Q Consensus 26 ~~~d~~~L~~~l~~~~~-l-~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~------------- 90 (182) ..+|...+.+.+...+- + +-.++ .... ......-...|..+.+..|++-...++.+.+.+.. T Consensus 46 ~~idL~~lv~~l~~~G~~~P~llRF-~dIL--~~ri~~l~~aF~~Ai~e~~Y~g~y~gVyPIKVNQ~r~VVeeI~~~G~~ 122 (634) T PRK05354 46 GAIDLAELVKELRARGLRLPLLLRF-PDIL--QDRVRSLNAAFKKAIEEYGYQGDYQGVYPIKVNQQRHVVEEIVASGKP 122 (634) T ss_pred CCCCHHHHHHHHHHCCCCCCEEEEC-HHHH--HHHHHHHHHHHHHHHHHHCCCCCCCEECCEEECCHHHHHHHHHHHCCC T ss_conf 8747999999999769999779964-6999--999999999999999982988866304223436528999999972765 Q ss_pred -CCCCCCCCHHHHHHHHHHHHCCCCEEEEECC--CHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf -6543565329999999994211588999838--665899999999849889999831 Q gi|254780936|r 91 -RKRVKSSMDVELAVDAFEQSEGLEHLVIFSG--DGCFTTLVAALQRKVKKVTIVSTV 145 (182) Q Consensus 91 -~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSG--D~Df~pli~~lr~~Gk~V~v~~~~ 145 (182) ..+-|.|.--+|.+ +|.+..+-+..|++-| |.+|+.++-..++.|++|+++-+. T Consensus 123 ~~~GLEAGSKpEL~a-~la~~~~~~aliiCNGyKD~eyI~LAl~a~klG~~v~iViEk 179 (634) T PRK05354 123 YNLGLEAGSKPELMA-VLALANDPGSLIVCNGYKDREYIRLALIGRKLGHKVFIVIEK 179 (634) T ss_pred CCCCCCCCCHHHHHH-HHHHCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCEEEEECC T ss_conf 655553586899999-986237999559956827699999999999769966999746 No 47 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=81.11 E-value=4.7 Score=20.78 Aligned_cols=35 Identities=9% Similarity=0.066 Sum_probs=20.7 Q ss_pred HHHHHHHHHHHC--CCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 589999999984--98899998315776547089998521 Q gi|254780936|r 124 CFTTLVAALQRK--VKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 124 Df~pli~~lr~~--Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +|+..++.+-+. +..|.+.+ |.-.++-.+..+.+- T Consensus 98 ~F~~a~~~~~~~~~~~~v~i~a---P~~~~~K~eiv~~g~ 134 (169) T cd01995 98 EFIEAMNKALNLGTENGIKIHA---PLIDLSKAEIVRLGG 134 (169) T ss_pred HHHHHHHHHHHHCCCCCCEEEE---CCCCCCHHHHHHHHH T ss_conf 9999999999830879858984---333598999999998 No 48 >PRK02155 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=80.74 E-value=4.8 Score=20.70 Aligned_cols=79 Identities=18% Similarity=0.204 Sum_probs=51.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 31467775799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) ........+...++|.+.|+++..-+-.....+ .. + .-+.+.-+.+.++|.+|.+-|||-+..+++.+... T Consensus 16 ~~~~~~~~~~l~~~L~~~g~~v~~e~~~a~~~~-----~~-~---~~~~~~~~~~~~~Dlvi~lGGDGTlL~~a~~~~~~ 86 (291) T PRK02155 16 TPGIAEPLEALAACIAKRGFEVVFEADTARNTG-----LT-G---YPALTPAEIGARADVAVVLGGDGTMLGIGRQLAPY 86 (291) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCC-----CC-C---CCCCCHHHHCCCCCEEEEECCCHHHHHHHHHHHCC T ss_conf 868999999999999978899999555777639-----98-8---77479799463767899976788999999987115 Q ss_pred CCEEEEEE Q ss_conf 98899998 Q gi|254780936|r 136 VKKVTIVS 143 (182) Q Consensus 136 Gk~V~v~~ 143 (182) ++.|..+- T Consensus 87 ~~PilGiN 94 (291) T PRK02155 87 GTPLIGIN 94 (291) T ss_pred CCCEEEEE T ss_conf 99689985 No 49 >cd01538 PBP1_ABC_xylose_binding Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic xylose-binding protein is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=80.40 E-value=4.9 Score=20.63 Aligned_cols=93 Identities=15% Similarity=0.043 Sum_probs=51.7 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCC-C-EEEEECCCHHHHHHHHHHH Q ss_conf 46777579998897559569753458953788865435653299999999942--115-8-8999838665899999999 Q gi|254780936|r 58 QQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGL-E-HLVIFSGDGCFTTLVAALQ 133 (182) Q Consensus 58 ~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~-d-~~iLvSGD~Df~pli~~lr 133 (182) ....+...|.+.|...+-......+... .-.+++..-+...++.. .+. + .+|+...|.--.-+++.|+ T Consensus 139 ~~~~r~~G~~~~l~~~~~~~~~~~v~~~--------~~~~~~~~~a~~~~~~~L~~~~~~i~aI~a~nD~~A~Ga~~Al~ 210 (288) T cd01538 139 NAKLFFNGAMSVLKPLIDSGKITIVGEV--------ATPDWDPETAQKRMENALTANYNKVDGVLAANDGTAGGAIAALK 210 (288) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEEE--------CCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 5999998799999762324780899863--------36785778899999999984655575898558567799999999 Q ss_pred HCCC--EEEEEEEECCCCCCCHHHHHHHCCCC Q ss_conf 8498--89999831577654708999852111 Q gi|254780936|r 134 RKVK--KVTIVSTVLSDPSMASDQLRRQADYF 163 (182) Q Consensus 134 ~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~f 163 (182) +.|+ +|.++++.. +...|+...+.- T Consensus 211 ~aG~~~~v~vvG~Dg-----~~~~l~~I~~G~ 237 (288) T cd01538 211 AAGLAGKPPVTGQDA-----ELAAVQRIVEGT 237 (288) T ss_pred HCCCCCCCEEEECCC-----CHHHHHHHHCCC T ss_conf 749999873983599-----899999997398 No 50 >cd05014 SIS_Kpsf KpsF-like protein. KpsF is an arabinose-5-phosphate isomerase which contains SIS (Sugar ISomerase) domains. SIS domains are found in many phosphosugar isomerases and phosphosugar binding proteins. KpsF catalyzes the reversible reaction of ribulose 5-phosphate to arabinose 5-phosphate. This is the second step in the CMP-Kdo biosynthesis pathway. Probab=80.26 E-value=4.9 Score=20.62 Aligned_cols=84 Identities=14% Similarity=0.162 Sum_probs=52.5 Q ss_pred HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEE Q ss_conf 799988975595697534589537888654356532999999999421158899983866---58999999998498899 Q gi|254780936|r 64 HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVT 140 (182) Q Consensus 64 ~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~ 140 (182) +.+...|.+.|+....-. |.++..-.+.....=|.+|++|.-| +.+.+++.+|++|.+++ T Consensus 16 ~~~~~kl~r~G~~s~~~~-----------------~~~~~~~~~~~i~~~Dv~i~iS~SG~T~e~~~~~~~ak~~g~~vI 78 (128) T cd05014 16 RKIAATLSSTGTPAFFLH-----------------PTEALHGDLGMVTPGDVVIAISNSGETDELLNLLPHLKRRGAPII 78 (128) T ss_pred HHHHHHHHHCCCCEEECC-----------------CHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCEE T ss_conf 999999995799457547-----------------315554113477899999999799998679999999986378589 Q ss_pred EEEEECCCCCCCHHHHHHHCCCCEEHHHHH Q ss_conf 998315776547089998521115189978 Q gi|254780936|r 141 IVSTVLSDPSMASDQLRRQADYFMDLAYLK 170 (182) Q Consensus 141 v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~ 170 (182) .++-. ....|.+.||..+.+.... T Consensus 79 ~iT~~------~~S~La~~ad~~l~~~~~e 102 (128) T cd05014 79 AITGN------PNSTLAKLSDVVLDLPVEE 102 (128) T ss_pred EEECC------CCCHHHHHCCEEEECCCCC T ss_conf 99879------9996799689899779986 No 51 >COG1737 RpiR Transcriptional regulators [Transcription] Probab=80.04 E-value=5.1 Score=20.56 Aligned_cols=112 Identities=18% Similarity=0.097 Sum_probs=64.5 Q ss_pred CCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHH Q ss_conf 67989999998517929999997306887314677757999889755956975345895378886543565329999999 Q gi|254780936|r 27 DIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDA 106 (182) Q Consensus 27 ~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~ 106 (182) .+|...|-+.+..-..-.+.++|+.... ...-.-+...|.+.|..+... .|..+.... T Consensus 114 ~l~~~~l~~av~~L~~A~rI~~~G~g~S-----~~vA~~~~~~l~~ig~~~~~~-----------------~d~~~~~~~ 171 (281) T COG1737 114 LLDEEALERAVELLAKARRIYFFGLGSS-----GLVASDLAYKLMRIGLNVVAL-----------------SDTHGQLMQ 171 (281) T ss_pred HCCHHHHHHHHHHHHCCCCEEEEECCCH-----HHHHHHHHHHHHHCCCCEEEE-----------------CCCHHHHHH T ss_conf 3689999999999970695799964835-----999999999999819966996-----------------684179999 Q ss_pred HHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 99421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 107 FEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 107 ~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) ......=|.+|.+|--+ +.+.+++.++++|.+|+.++-.. ...|.+.||..+.. T Consensus 172 ~~~~~~~Dv~i~iS~sG~t~e~i~~a~~ak~~ga~vIaiT~~~------~spla~~Ad~~L~~ 228 (281) T COG1737 172 LALLTPGDVVIAISFSGYTREIVEAAELAKERGAKVIAITDSA------DSPLAKLADIVLLV 228 (281) T ss_pred HHHCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCCHHHHCCEEEEE T ss_conf 9838999889998489996899999999997799299983999------99366654778871 No 52 >PRK11106 queuosine biosynthesis protein QueC; Provisional Probab=79.18 E-value=5.4 Score=20.39 Aligned_cols=30 Identities=3% Similarity=-0.098 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHH Q ss_conf 658999999998498--899998315776547089 Q gi|254780936|r 123 GCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQ 155 (182) Q Consensus 123 ~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~ 155 (182) .+|+.+++.+-..|. .|.+.+ +.-..+-.+ T Consensus 136 ~eFi~a~~~al~~g~~~~i~i~a---Pl~~l~K~e 167 (231) T PRK11106 136 DEFVKALNHAVVLGMAKDIRFET---PLMWLDKAE 167 (231) T ss_pred HHHHHHHHHHHHHCCCCCEEEEE---CCCCCCHHH T ss_conf 99999999999851789819983---777788889 No 53 >PRK03378 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=78.26 E-value=5.7 Score=20.21 Aligned_cols=79 Identities=18% Similarity=0.243 Sum_probs=50.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 31467775799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) .++.........++|.+.|+++..-+.....-+. .+.. ..++.+..+++|.+|.+-|||=|..+++.+... T Consensus 16 ~~~a~~~~~~l~~~L~~~g~~v~ld~~~a~~l~~------~~~~---~~~~~~~~~~~Dlvi~lGGDGT~L~aar~~~~~ 86 (292) T PRK03378 16 HPTALTTHEMLYRWLCTKGYEVIVEQQIAHELQL------KNVK---TGTLAEIGQQADLAVVVGGDGNMLGAARTLARY 86 (292) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEECHHHHHHCCC------CCCC---CCCHHHHCCCCCEEEEECCCHHHHHHHHHHCCC T ss_conf 8589999999999999789989983478765299------8655---478668530568999977788999999985436 Q ss_pred CCEEEEEE Q ss_conf 98899998 Q gi|254780936|r 136 VKKVTIVS 143 (182) Q Consensus 136 Gk~V~v~~ 143 (182) ++.+..+- T Consensus 87 ~~PilGIN 94 (292) T PRK03378 87 DIKVIGIN 94 (292) T ss_pred CCCEEEEE T ss_conf 99689883 No 54 >PRK08786 consensus Probab=78.08 E-value=5.8 Score=20.18 Aligned_cols=61 Identities=25% Similarity=0.307 Sum_probs=37.1 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 99988975595697534589537888654356532999999999421158899983866589999999984988999983 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ...+.+...|+.+...+-. .+|=-|+.=+...+..=-.++|||||.|+..||. ..|.++.. T Consensus 88 ~i~~~~~~~gi~~~~~~g~-------------EADDii~tla~~~~~~g~~v~IvSgDKD~~QLV~------~~v~i~~~ 148 (927) T PRK08786 88 PMCDIVHALGIDILRIDGV-------------EADDVIGTLALQAAADGLAVTISTGDKDFAQLVR------PGVELVNT 148 (927) T ss_pred HHHHHHHHCCCCEECCCCC-------------CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCC------CCEEEEEC T ss_conf 9999999879988543892-------------6899999999999978991999828987524178------98599988 No 55 >TIGR02095 glgA glycogen/starch synthases, ADP-glucose type; InterPro: IPR011835 This family consists of glycogen (or starch) synthases that use ADP-glucose (2.4.1.21 from EC), rather than UDP-glucose (2.4.1.11 from EC) as in animals, as the glucose donor. This enzyme is found in bacteria and plants. Whether the name given is glycogen synthase or starch synthase depends on context, and therefore on substrate.; GO: 0009011 starch synthase activity, 0009250 glucan biosynthetic process. Probab=77.99 E-value=5.8 Score=20.16 Aligned_cols=89 Identities=17% Similarity=0.105 Sum_probs=61.9 Q ss_pred HHCCCEEE--EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC--CCEEEEECCCHHHHHHHHHHH--HCCCEEEEEEE Q ss_conf 75595697--5345895378886543565329999999994211--588999838665899999999--84988999983 Q gi|254780936|r 71 HYNGFQVV--AKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG--LEHLVIFSGDGCFTTLVAALQ--RKVKKVTIVST 144 (182) Q Consensus 71 ~~~g~~v~--~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~--~d~~iLvSGD~Df~pli~~lr--~~Gk~V~v~~~ 144 (182) ++.|..+. .+|+..... +-..+||+|+-+.+=-.-+... ...+||=|||-.+-.....+. ++...-..+.. T Consensus 315 ~~lGL~~~Y~~~Pl~~~is---RL~~QKG~Dl~~~a~~~ll~~~~~~Qlv~lG~Gdp~le~~l~~la~~~~~p~~~~~~~ 391 (517) T TIGR02095 315 EELGLPVDYDDVPLFGVIS---RLVEQKGVDLLLAALPELLELGDFGQLVVLGTGDPELEEALRELADHERYPGKVRVII 391 (517) T ss_pred HHHCCCCCCCCCCEEEEEE---CCCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 9819887888853799982---2562442789999999997117966899970488799999999999963789489996 Q ss_pred ECCCCCCCHHHHHHHCCCCE Q ss_conf 15776547089998521115 Q gi|254780936|r 145 VLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 145 ~~~~~~~~S~~L~~~ad~fi 164 (182) ++-..-|+.+-..||-|+ T Consensus 392 --~yde~LAh~iyAgaD~~l 409 (517) T TIGR02095 392 --GYDEALAHRIYAGADFFL 409 (517) T ss_pred --ECCHHHHHHHHHCCCEEE T ss_conf --258799998972377688 No 56 >cd06322 PBP1_ABC_sugar_binding_like_12 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=77.98 E-value=5.8 Score=20.16 Aligned_cols=79 Identities=13% Similarity=0.104 Sum_probs=46.5 Q ss_pred HHHHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 14677757999889755-95697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 57 EQQFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 57 ~~~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) .....+...|.++|... ++++.... ... ..++ -...+.+++..-+.+ ++|+.++|.--.-++..+++. T Consensus 133 ~~~~~R~~Gf~~al~~~~~~~~~~~~----~~~----~~~~--~~~~~~~~L~~~p~~-~aI~~~nD~~A~Ga~~al~~~ 201 (267) T cd06322 133 QSVVDRVRGFKEALADYPNIKIVAVQ----PGI----TRAE--ALTAAQNILQANPDL-DGIFAFGDDAALGAVSAIKAA 201 (267) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEE----CCC----CHHH--HHHHHHHHHHHCCCC-CEEEECCCHHHHHHHHHHHHC T ss_conf 36999999999999977994699983----578----8799--999999998628777-489981879999999999975 Q ss_pred CC-EEEEEEEEC Q ss_conf 98-899998315 Q gi|254780936|r 136 VK-KVTIVSTVL 146 (182) Q Consensus 136 Gk-~V~v~~~~~ 146 (182) |+ .+.++++.. T Consensus 202 g~~di~vvGfD~ 213 (267) T cd06322 202 GRDNVKVIGFDG 213 (267) T ss_pred CCCCCEEEEECC T ss_conf 999988999949 No 57 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=77.91 E-value=5.9 Score=20.15 Aligned_cols=112 Identities=9% Similarity=0.006 Sum_probs=61.0 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999999942-- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) |.+.+...+++... .+.+ .......+...|.+.+...++.-+...-. ...++|..-+...++.. T Consensus 116 l~~~~g~~g~v~i~---~g~~-~~~~~~~R~~g~~~~l~~~~~~~~~vv~~----------~~~~~~~~~a~~~~~~~L~ 181 (298) T cd06302 116 LAEQMGGKGEYAIF---VGSL-TATNQNAWIDAAKAYQKEKYYPMLELVDR----------QYGDDDADKSYQTAQELLK 181 (298) T ss_pred HHHHHCCCCCEEEE---ECCC-CCHHHHHHHHHHHHHHHHCCCCCEEEEEE----------ECCCCCHHHHHHHHHHHHH T ss_conf 99971379958999---8588-98679999999999986348998799986----------1378748999999999997 Q ss_pred --CCCCEEEEECCCHHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf --115889998386658999999998498--899998315776547089998521115 Q gi|254780936|r 111 --EGLEHLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 111 --~~~d~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) +.++ .|+..+|..-.-+++.+++.|+ +|.++++.. +...|....+..+ T Consensus 182 ~~Pdi~-~I~~~nd~~a~Ga~~Al~~aG~~~~V~vvG~D~-----~~~~~~~I~~G~i 233 (298) T cd06302 182 AYPDLK-GIIGPTSVGIPGAARAVEEAGLKGKVAVTGLGL-----PNQMAPYVKSGAV 233 (298) T ss_pred HCCCCC-EEEECCCHHHHHHHHHHHHCCCCCCEEEEEECC-----CHHHHHHHHCCCC T ss_conf 599912-999789816999999999679999859999688-----7999999985982 No 58 >cd06311 PBP1_ABC_sugar_binding_like_3 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=77.85 E-value=5.9 Score=20.14 Aligned_cols=112 Identities=9% Similarity=0.096 Sum_probs=65.7 Q ss_pred EEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCC Q ss_conf 99980789999998508767989999998517929999997306887314677757999889755956975345895378 Q gi|254780936|r 9 ALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTEN 88 (182) Q Consensus 9 aIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~ 88 (182) ..||=.+|...+.... .-|.+.+...+++... .+.+. .....+...|.+.++..++++... T Consensus 102 ~~~v~~Dn~~~G~~~a------~~l~~~~~~~g~v~il---~G~~~--~~~~~R~~Gf~~~l~~~~i~vv~~-------- 162 (274) T cd06311 102 DLYVAGDNYGMGRVAG------EYIATKLGGNGNIVVL---RGIPT--PIDNERVDAFDAAIAKYPIKILDR-------- 162 (274) T ss_pred EEEEECCHHHHHHHHH------HHHHHHHCCCCCEEEE---ECCCC--CHHHHHHHHHHHHHHHCCCEEEEE-------- T ss_conf 9999457899999999------9999970887857999---68998--457999999999998779889985-------- Q ss_pred CCCCCCCCCCHHHHHHHHHHHH----CCCCEEEEECCCHHHHHHHHHHHHCCCE--EEEEEEE Q ss_conf 8865435653299999999942----1158899983866589999999984988--9999831 Q gi|254780936|r 89 CGRKRVKSSMDVELAVDAFEQS----EGLEHLVIFSGDGCFTTLVAALQRKVKK--VTIVSTV 145 (182) Q Consensus 89 ~~~~~~kk~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~pli~~lr~~Gk~--V~v~~~~ 145 (182) ...++|..-+..+++.. ++++ .|+..+|.-=.-+++.|++.|+. +.++++. T Consensus 163 -----~~~~~~~~~a~~~~~~~L~~~pdi~-~Ifa~nD~ma~Ga~~Al~~aG~~~~~~vvG~D 219 (274) T cd06311 163 -----QYANWNRDDAFSVMQDLLTKFPKID-AVWAHDDDMAVGVLAAIKQAGRTDIKFVVGGA 219 (274) T ss_pred -----ECCCCCHHHHHHHHHHHHHCCCCCC-EEEECCCCHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf -----0387568999999999984388888-89988980589999999976999997899965 No 59 >cd06292 PBP1_LacI_like_10 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=76.68 E-value=6.4 Score=19.93 Aligned_cols=102 Identities=12% Similarity=0.055 Sum_probs=60.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+.+.+.+.-..+ |...+........+...|.++++..|.............. .+ ..-.++..++. .. T Consensus 113 a~~~L~~~G~~~i~--~i~~~~~~~~~~~R~~G~~~a~~~~~~~~~~~~i~~~~~~-----~~--~~~~~~~~~l~--~~ 181 (273) T cd06292 113 AVRHLVALGHRRIG--FASGPGRTVPRRRKIAGFRAALEEAGLEPPEALVARGMFS-----VE--GGQAAAVELLG--SG 181 (273) T ss_pred HHHHHHHHCCCEEE--EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCC-----HH--HHHHHHHHHHH--CC T ss_conf 99999985998089--9517877715999999999999984999771489716777-----78--89999999974--79 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 58899983866589999999984988----99998315 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) . ++|+++.|.--..+++.+++.|.+ +-|+++.. T Consensus 182 ~-~ai~~~nD~~A~g~~~~l~~~g~~ip~di~VigfD~ 218 (273) T cd06292 182 P-TAIVAASDLMALGAIRAARRRGLRVPEDVSVVGYDD 218 (273) T ss_pred C-CEEEECCCHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 9-657745859999999999983998999659999998 No 60 >cd06308 PBP1_sensor_kinase_like Periplasmic binding domain of two-component sensor kinase signaling systems. Periplasmic binding domain of two-component sensor kinase signaling systems, some of which are fused with a C-terminal histidine kinase A domain (HisK) and/or a signal receiver domain (REC). Members of this group share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily and are predicted to be involved in sensing of environmental stimuli; their substrate specificities, however, are not known in detail. Probab=76.55 E-value=6.4 Score=19.91 Aligned_cols=91 Identities=13% Similarity=0.068 Sum_probs=55.8 Q ss_pred CHHHHHHHHHHHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH----CCCCEEEEECCCHHHHHHHH Q ss_conf 3146777579998897559-569753458953788865435653299999999942----11588999838665899999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS----EGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~pli~ 130 (182) ......+...|.+.|...+ .++.... ..+++..-+..+++.. +.++ +|+..+|.-=.-+++ T Consensus 134 ~~~~~~R~~Gf~~~l~~~~~~~vv~~~-------------~~~~~~~~a~~~~~~~L~~~pdi~-aI~a~nD~~A~Ga~~ 199 (270) T cd06308 134 SSPAIERHDGFKEALSKYPKIKIVAQQ-------------DGDWLKEKAEEKMEELLQANPDID-LVYAHNDPMALGAYL 199 (270) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHHHH T ss_conf 854999999999999866895246686-------------176679999999999985088888-899778658999999 Q ss_pred HHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 9998498--8999983157765470899985211 Q gi|254780936|r 131 ALQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 131 ~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) .+++.|+ +|.++++..... ....|....+. T Consensus 200 Al~~aG~~~~i~vvg~D~~~~--~~~~l~~i~~g 231 (270) T cd06308 200 AAKRAGREKEIKFIGIDGLPG--PGGGIEAVRDG 231 (270) T ss_pred HHHHCCCCCCCEEEEECCCCC--HHHHHHHHHCC T ss_conf 999769999958998889933--69999998738 No 61 >cd06300 PBP1_ABC_sugar_binding_like_1 Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their substrate specificity is not known in detail. Probab=76.12 E-value=6.6 Score=19.84 Aligned_cols=94 Identities=14% Similarity=0.155 Sum_probs=54.8 Q ss_pred HHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 9998517929999997306887314677757999889755-9569753458953788865435653299999999942-- Q gi|254780936|r 34 LKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 34 ~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) .+.+.+.+++... .+.+ .......+...|.+.|... ++++... ...++|..-+...++.. T Consensus 119 ~~~l~~~g~i~~i---~G~~-~~~~~~~R~~Gf~~~l~~~p~i~iv~~-------------~~~~~~~~~a~~~~~~~L~ 181 (272) T cd06300 119 VKELGGKGNVLVV---RGLA-GHPVDEDRYAGAKEVLKEYPGIKIVGE-------------VYGDWDQAVAQKAVADFLA 181 (272) T ss_pred HHHCCCCCCEEEE---ECCC-CCHHHHHHHHHHHHHHHHCCCEEEEEE-------------ECCCCCHHHHHHHHHHHHH T ss_conf 8503577763899---4488-984999999999999864142488655-------------4488888999999999998 Q ss_pred --CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEEC Q ss_conf --115889998386658999999998498899998315 Q gi|254780936|r 111 --EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVL 146 (182) Q Consensus 111 --~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~ 146 (182) +.+|.+ ++.|.+=..+++.+++.|+++.++.... T Consensus 182 ~~pdid~I--~~~d~~a~Ga~~Al~~aG~~~~~~~g~~ 217 (272) T cd06300 182 SNPDVDGI--WTQGGDAVGAVQAFEQAGRDIPPVTGED 217 (272) T ss_pred HCCCCEEE--EECCCCHHHHHHHHHHCCCCCCEEEEEC T ss_conf 68996099--9778218999999996699988699975 No 62 >cd06275 PBP1_PurR Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. This dimeric PurR belongs to the LacI-GalR family of transcription regulators and is activated to bind to DNA operator sites by initially binding either of high affinity corepressors, hypoxanthine or guanine. PurR is composed of two functional domains: aan N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the purine transcription repressor undergoes a Probab=75.54 E-value=6.8 Score=19.74 Aligned_cols=101 Identities=12% Similarity=-0.004 Sum_probs=59.0 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994--- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--- 109 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--- 109 (182) +.+++.+.+.-..+ |.+.+........+...|.++++..|..+....+ ..+..+..-+.+++.. T Consensus 108 a~~~Li~~G~~~i~--~i~~~~~~~~~~~R~~Gf~~al~~~~i~~~~~~i-----------~~~~~~~~~~~~~~~~~l~ 174 (269) T cd06275 108 ATRHLIELGHRRIG--CITGPLEKAPAQQRLAGFRRAMAEAGLPVNPGWI-----------VEGDFECEGGYEAMQRLLA 174 (269) T ss_pred HHHHHHHHCCCEEE--ECCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHH-----------CCCCCCHHHHHHHHHHHHH T ss_conf 99999982997177--5569977716999999999999985999765550-----------3677776889999999997 Q ss_pred HCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 21158899983866589999999984988----99998315 Q gi|254780936|r 110 SEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 110 a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ...--++|++++|.--..+++.|++.|.+ +-|++|.. T Consensus 175 ~~~~~~aii~~~d~~A~g~~~~l~~~gi~vP~di~vvgfd~ 215 (269) T cd06275 175 QPKRPTAVFCGNDLMAMGALCAAQEAGLRVPQDLSIIGYDD 215 (269) T ss_pred HCCCCCEEECCCHHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 08996612318749999999999980887899867998778 No 63 >cd06274 PBP1_FruR Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs. Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to members of the type I periplasmic binding protein superfamily. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor Probab=74.78 E-value=7.1 Score=19.62 Aligned_cols=102 Identities=9% Similarity=0.007 Sum_probs=59.6 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHH----HHHH Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999----9999 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAV----DAFE 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~lai----D~~~ 108 (182) +.+.+.+.+.-..++. +..........+...|.+++++.|..+....+. .+..+..-.. +.+. T Consensus 107 a~~~L~~~Ghr~I~~i--~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~i~-----------~~~~~~~~g~~~~~~~l~ 173 (264) T cd06274 107 LTRELLAAPPEEVLFL--GGLPELSPSRERLAGFRQALADAGLPVQPDWIY-----------AEGYSPESGYQLMAELLA 173 (264) T ss_pred HHHHHHHCCCCEEEEE--CCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEE-----------ECCCCHHHHHHHHHHHHH T ss_conf 9999998599669997--689877309999999999999779987740798-----------268887899999999998 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf 421158899983866589999999984988----999983157 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) ..+..-++|++++|.--..+++.+++.|.+ +-|++|... T Consensus 174 ~~~~~p~ai~~~nd~~a~g~l~~l~~~g~~vP~disvvgfDd~ 216 (264) T cd06274 174 RLGRLPRALFTTSYTLLEGVLRFLRERPGLAPSDLRIATFDDH 216 (264) T ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 6568874220176999999999999859999998799998986 No 64 >PRK08434 consensus Probab=74.78 E-value=7.1 Score=19.62 Aligned_cols=113 Identities=13% Similarity=0.138 Sum_probs=55.2 Q ss_pred CCEEEEEEEHHHHHH----HHHH----CCCCCC----HHH-HHHHHHHCCEEEEEEEEECC---------------CCCC Q ss_conf 871899980789999----9985----087679----899-99998517929999997306---------------8873 Q gi|254780936|r 5 REKIALFIDGANLYA----SSKA----LGFDID----YRK-LLKAFRSRAIVIRAYYYTTV---------------VGDP 56 (182) Q Consensus 5 ~~rvaIfID~~Nl~~----~~~~----~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~---------------~~~~ 56 (182) |++ -++|||..+-+ +... .|...+ |.. |+..+.....-..+..|... +++- T Consensus 1 mk~-l~liDg~sl~fRAy~A~~~l~t~~G~pTnAv~GF~~mL~kli~~~~P~~iavaFD~~~~TFRhe~y~~YKAnR~~~ 79 (887) T PRK08434 1 MKT-LTIIDTFGFFFRSYYALPPLKNSQGFPTGLLSGFANFIYSLKNEFKSDYIVFALDSKGKTFRSEIDPNYKANRPPP 79 (887) T ss_pred CCC-EEEEECCHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHCCCCCCC T ss_conf 980-8998064899999707999879999880359999999999998659988999984959962210205752899999 Q ss_pred HHHHHHH-HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 1467775-79998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 57 EQQFSPL-HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 57 ~~~~~~~-~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) ++....+ ....+.+...|+.+...+- . .+|=-|+.=+-.....=-.++|||||.|+..||.. T Consensus 80 PedL~~Qip~i~e~l~algIp~~e~~G-----------y--EADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV~d 142 (887) T PRK08434 80 PEELLEQLPIAIEWIEKMGFKSLSQEG-----------Y--EADDIIASFVKFAKEQDIFVRIVSHDKDLYQLIDD 142 (887) T ss_pred CHHHHHHHHHHHHHHHHCCCCEECCCC-----------C--CHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCC T ss_conf 989999999999999987998845589-----------2--49999999999999789939998089877761839 No 65 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=74.35 E-value=7.3 Score=19.56 Aligned_cols=63 Identities=13% Similarity=0.163 Sum_probs=45.4 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 32999999999421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) .|.++..-.......=|++|++|--+ +.+.+++.+|++|.+|+.++-. ....|.+.||..+.. T Consensus 174 ~d~~~~~~~a~~l~~~Dv~i~iS~sG~t~e~i~~~~~Ak~~Ga~vI~IT~~------~~SpLa~~aD~vL~~ 239 (293) T PRK11337 174 PDAHIMMMSASLLQEGDVVLVVSHSGRTSDVKAAVELAKQNGAKIICITHS------YHSPIAKLADYIICS 239 (293) T ss_pred CCHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCC------CCCHHHHHCCEEEEC T ss_conf 777899999971899988999818999889999999999879949997699------998468958998864 No 66 >cd05013 SIS_RpiR RpiR-like protein. RpiR contains a SIS (Sugar ISomerase) domain, which is found in many phosphosugar isomerases and phosphosugar binding proteins. In E. coli, rpiR negatively regulates the expression of rpiB gene. Both rpiB and rpiA are ribose phosphate isomerases that catalyze the reversible reactions of ribose 5-phosphate into ribulose 5-phosphate. Probab=74.16 E-value=7.4 Score=19.53 Aligned_cols=105 Identities=16% Similarity=0.064 Sum_probs=64.1 Q ss_pred HHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99999985179299999973068873146777579998897559569753458953788865435653299999999942 Q gi|254780936|r 31 RKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS 110 (182) Q Consensus 31 ~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a 110 (182) .+..+.+.+. .+.+.|+... .......+...|...|..+...+ |........... T Consensus 4 ~~~~~~i~~a---~~I~i~G~G~-----S~~~A~~~~~~l~~~g~~~~~~~-----------------~~~~~~~~~~~~ 58 (139) T cd05013 4 EKAVDLLAKA---RRIYIFGVGS-----SGLVAEYLAYKLLRLGKPVVLLS-----------------DPHLQLMSAANL 58 (139) T ss_pred HHHHHHHHCC---CEEEEEECCH-----HHHHHHHHHHHHCCCCCEEEECC-----------------CHHHHHHHHHCC T ss_conf 9999999759---9289998081-----59999999999725898279879-----------------627788887459 Q ss_pred CCCCEEEEECCCHH---HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 11588999838665---899999999849889999831577654708999852111518 Q gi|254780936|r 111 EGLEHLVIFSGDGC---FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 111 ~~~d~~iLvSGD~D---f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) ..-|.+|++|--+- .+.+++.+|++|.+|+.++-.. ...|.+.||..+.. T Consensus 59 ~~~d~~i~iS~sg~~~~~~~~~~~ak~~g~~ii~IT~~~------~s~l~~~ad~~l~~ 111 (139) T cd05013 59 TPGDVVIAISFSGETKETVEAAEIAKERGAKVIAITDSA------NSPLAKLADIVLLV 111 (139) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCHHHHHCCEEEEC T ss_conf 999999997686363789999999998699799997999------99779969999982 No 67 >cd01574 PBP1_LacI Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of LacI is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=74.03 E-value=7.4 Score=19.51 Aligned_cols=99 Identities=12% Similarity=-0.064 Sum_probs=60.9 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H Q ss_conf 999998517929999997306887314677757999889755956975345895378886543565329999999994-2 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S 110 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a 110 (182) .+.+.+.+.|.-..+ |.+.+........+...|.+++.+.|..+.... .+..+..-+..++.. . T Consensus 106 ~a~~~L~~~G~~~i~--~i~~~~~~~~~~~R~~G~~~al~~~~i~~~~~~-------------~~~~~~~~~~~~~~~l~ 170 (264) T cd01574 106 LATEHLLELGHRTIA--HVAGPEEWLSARARLAGWRAALEAAGIAPPPVL-------------EGDWSAESGYRAGRELL 170 (264) T ss_pred HHHHHHHHCCCCCEE--ECCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEE-------------ECCCCHHHHHHHHHHHH T ss_conf 999999984998273--258997775799999999999998698974488-------------66899799999999998 Q ss_pred --CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf --1158899983866589999999984988----99998315 Q gi|254780936|r 111 --EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 --~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) +.+ ++|++++|.--.-++..+++.|++ +.+++|.. T Consensus 171 ~~~~~-~aii~~~D~~A~g~i~~l~~~g~~iP~di~iigfd~ 211 (264) T cd01574 171 REGDP-TAVFAANDQMALGVLRALHELGLRVPDDVSVVGFDD 211 (264) T ss_pred HCCCC-CEEEECCHHHHHHHHHHHHHHCCCCCCCCEEEEECC T ss_conf 37999-689636429999999999982998898847998688 No 68 >cd06277 PBP1_LacI_like_1 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=73.55 E-value=7.6 Score=19.43 Aligned_cols=103 Identities=11% Similarity=-0.038 Sum_probs=61.1 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99999851792999999730688731467775799988975595697534589537888654356532999999999421 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSE 111 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~ 111 (182) .+.+.+.+.|.-..++ .+.+........+...|.++++..|.++......... . ..-.-+...+.... T Consensus 108 ~a~~~L~~~Ghr~I~~--i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~~~~~~---------~-~~~~~~~~~~~~~~ 175 (268) T cd06277 108 AATEYLIEKGHRKIGF--VGDPLYSPSFEERYEGYKKALLDHGIPFNEDYDITEK---------E-EDEEDIGKFIDELK 175 (268) T ss_pred HHHHHHHHCCCCCEEE--EECCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEECC---------C-CHHHHHHHHHHHCC T ss_conf 9999999709983699--7278888129999999999999859999822466258---------7-77999999998478 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 158899983866589999999984988----99998315 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..-++|++++|.=-..+++.+++.|.+ +-++++.. T Consensus 176 ~~ptAi~~~nD~~A~g~~~~l~~~g~~vP~Disivgfdd 214 (268) T cd06277 176 PLPTAFFCSNDGVAFLLIKVLKEMGIRVPEDVSVIGFDD 214 (268) T ss_pred CCCCEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 899879968859999999999985999998628999788 No 69 >cd06270 PBP1_GalS_like Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalS is a dimeric protein like GalR,and its major role is in regulating expression of the high-affinity galactose transporter encoded by the mgl operon, whereas GalR is the exclusive regulator of galactose permease, the low-affinity galactose transporter. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are homologous to the periplasmic sugar bindi Probab=72.47 E-value=8.1 Score=19.27 Aligned_cols=101 Identities=16% Similarity=0.040 Sum_probs=60.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994-2- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a- 110 (182) +.+++.+.+.-..+ |.+.+........+...|.+++...|..+....+. ....+..-+.++++. . T Consensus 107 a~~~L~~~G~~~i~--~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~i~-----------~~~~~~~~g~~~~~~ll~ 173 (268) T cd06270 107 ATEHLIELGHRKIA--CITGPLTKEDARLRLQGYRDALAEAGIALDESLII-----------EGDFTEEGGYAAMQELLA 173 (268) T ss_pred HHHHHHHCCCCCEE--EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE-----------ECCCCHHHHHHHHHHHHH T ss_conf 99999983998389--84598888319999999999999859997800476-----------167767889998899995 Q ss_pred CCCC-EEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 1158-899983866589999999984988----99998315 Q gi|254780936|r 111 EGLE-HLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 ~~~d-~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) .+-+ ++|++++|.=-..+++.+++.|.+ +-|+++.. T Consensus 174 ~~~~~~ai~~~nD~~A~g~~~~l~~~g~~vP~disvigfdd 214 (268) T cd06270 174 RGAPFTAVFCANDEMAAGAISALREHGISVPQDVSIIGFDD 214 (268) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 69998745533779999999999982999999769999898 No 70 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=72.39 E-value=8.1 Score=19.26 Aligned_cols=63 Identities=19% Similarity=0.232 Sum_probs=41.4 Q ss_pred CHHHHHHHHHHHHCCCCEE----EEECCCHHHHH----HHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 3299999999942115889----99838665899----99999984988999983157765470899985211151899 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHL----VIFSGDGCFTT----LVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~----iLvSGD~Df~p----li~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) .-.+|+.-+..+. +.|+. +|-.||..|.. -+..+++.|+.+.+++- .-..+++.||+.+||+. T Consensus 148 MkaRLgFAIA~~~-dPDILIIDEaLSVGD~~F~~Kc~~rm~ef~e~gkTIvfVSH-------sl~~Vk~~C~R~iWLe~ 218 (549) T PRK13545 148 MKSRLGFAISVHI-NPDILVIDEALSVGDQTFTKKCLDKMNEFKEQGKTIFFISH-------SLSQVKSFCTKALWLHY 218 (549) T ss_pred HHHHHHHHHHHHC-CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEC-------CHHHHHHHCCEEEEEEC T ss_conf 8999999999824-99999994620057899999999999999978988999958-------88999985731063438 No 71 >PRK02231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=72.17 E-value=8.2 Score=19.23 Aligned_cols=71 Identities=17% Similarity=0.236 Sum_probs=47.1 Q ss_pred HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 79998897559569753458953788865435653299999999942115889998386658999999998498899998 Q gi|254780936|r 64 HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 64 ~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) +...+||+..|++|..-+...... +.+-.-..++-+...+.|.+|.+-|||=+..+++.+...++.+..+- T Consensus 3 ~~l~~~l~~~g~~v~l~~~~a~~l---------~~~~~~~~~~~e~~~~~Dlii~iGGDGT~L~a~r~~~~~~iPilGiN 73 (272) T PRK02231 3 KNLFHWLMERGYQVLVEKEVGETL---------ELSFNHLASLEEIGQRAQLAIVIGGDGNMLGRARVLAKYDIPLIGIN 73 (272) T ss_pred HHHHHHHHHCCCEEEEEHHHHHHC---------CCCCCCCCCHHHHCCCCCEEEEECCCHHHHHHHHHHHCCCCCEEEEC T ss_conf 899999997899999977998764---------99866647988970177899997875899999998600599789653 No 72 >cd05710 SIS_1 A subgroup of the SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Probab=72.03 E-value=8.3 Score=19.21 Aligned_cols=47 Identities=17% Similarity=0.139 Sum_probs=35.0 Q ss_pred EEEEEC--C-CHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 899983--8-6658999999998498899998315776547089998521115189 Q gi|254780936|r 115 HLVIFS--G-DGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 115 ~~iLvS--G-D~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) .+|++| | -.|-+.+++.+|++|.+++.++-.. ...|.+.||..+... T Consensus 50 lvi~iS~SG~T~e~i~a~~~ak~~g~~tiaiT~~~------~S~la~~aD~~i~~g 99 (120) T cd05710 50 VVILASHSGNTKETVAAAKFAKEKGATVIGLTDDE------DSPLAKLADYVIVYG 99 (120) T ss_pred EEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCHHHHHCCEEEECC T ss_conf 99998379897899999999998699599998989------898799679888889 No 73 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=70.69 E-value=8.9 Score=19.02 Aligned_cols=85 Identities=8% Similarity=0.131 Sum_probs=50.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHH Q ss_conf 88731467775799988975595697534589537888654356532999999999421158899983866589999999 Q gi|254780936|r 53 VGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAAL 132 (182) Q Consensus 53 ~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~l 132 (182) +........+...|.+++...|..+... ....+. ..+++ ...+-+++..-+.++ .|++++|.--.-+++.+ T Consensus 134 ~~~~~~~~~R~~Gf~~~l~~~~~~~~~~--~~~~~~----~~~~~--~~~~~~~L~~~p~i~-ai~~~~d~~a~Ga~~a~ 204 (277) T cd06319 134 PQKRKNGQKRTKGFKEAMKEAGCDLAGI--RQQKDF----SYQET--FDYTNDLLTANPDIR-AIWLQGSDRYQGALDAI 204 (277) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEE--EEECCC----CHHHH--HHHHHHHHHHCCCCC-EEEECCCHHHHHHHHHH T ss_conf 8776239999999999998759976799--960899----99999--999999997389976-89966988999999999 Q ss_pred HHCCC--EEEEEEEEC Q ss_conf 98498--899998315 Q gi|254780936|r 133 QRKVK--KVTIVSTVL 146 (182) Q Consensus 133 r~~Gk--~V~v~~~~~ 146 (182) ++.|+ +|.++++.. T Consensus 205 ~~~G~~~~i~vvg~D~ 220 (277) T cd06319 205 ATAGKTGKVLLICFDA 220 (277) T ss_pred HHCCCCCCEEEEECCC T ss_conf 9759999858996379 No 74 >cd05008 SIS_GlmS_GlmD_1 SIS (Sugar ISomerase) domain repeat 1 found in Glucosamine 6-phosphate synthase (GlmS) and Glucosamine-6-phosphate deaminase (GlmD). The SIS domain is found in many phosphosugar isomerases and phosphosugar binding proteins. GlmS contains a N-terminal glutaminase domain and two C-terminal SIS domains and catalyzes the first step in hexosamine metabolism, converting fructose 6-phosphate into glucosamine 6-phosphate using glutamine as nitrogen source. The glutaminase domain hydrolyzes glutamine to glutamate and ammonia. Ammonia is transferred through a channel to the isomerase domain for glucosamine 6-phosphate synthesis. The end product of the pathway is N-acetylglucosamine, which plays multiple roles in eukaryotic cells including being a building block of bacterial and fungal cell walls. In the absence of glutamine, GlmS catalyzes the isomerization of fructose 6-phosphate into glucose 6- phosphate (PGI-like activity). Glucosamine-6-phosphate deaminase (GlmD) cont Probab=70.64 E-value=8.9 Score=19.02 Aligned_cols=51 Identities=22% Similarity=0.223 Sum_probs=38.2 Q ss_pred CCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHH Q ss_conf 58899983866---5899999999849889999831577654708999852111518997 Q gi|254780936|r 113 LEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYL 169 (182) Q Consensus 113 ~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l 169 (182) -|.+|.+|.-| |-+.+++.+|+.|.+++.++-. ....|.+.||..+++.-- T Consensus 47 ~~lvI~iS~SG~t~e~i~a~~~a~~~g~~~i~iT~~------~~S~la~~ad~~i~~~~g 100 (126) T cd05008 47 DTLVIAISQSGETADTLAALRLAKEKGAKTVAITNV------VGSTLAREADYVLYLRAG 100 (126) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEECC------CCCHHHHHCCEEEECCCC T ss_conf 859999868979800788999999829948987069------999789968987986987 No 75 >cd01543 PBP1_XylR Ligand-binding domain of DNA transcription repressor specific for xylose (XylR). Ligand-binding domain of DNA transcription repressor specific for xylose (XylR), a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of XylR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=68.82 E-value=9.7 Score=18.78 Aligned_cols=78 Identities=9% Similarity=-0.024 Sum_probs=45.3 Q ss_pred HHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH----HHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 67775799988975595697534589537888654356532999999999----42115889998386658999999998 Q gi|254780936|r 59 QFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE----QSEGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 59 ~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~----~a~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) ...+.+.|.++++..|+++...+... .....+-..+...+. ..+ --++|++++|.=-+-+++.+++ T Consensus 123 ~~~R~~Gf~~al~~~g~~~~~~~~~~---------~~~~~~~~~~~~~l~~~l~~~~-~ptAi~~~nD~~A~g~l~~l~~ 192 (265) T cd01543 123 SDEREEAFRQLVAEAGYECSFFYRGL---------STDAQSWEEEQEELAQWLQSLP-KPVGIFACTDARARQLLEACRR 192 (265) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCC---------CCCCCCHHHHHHHHHHHHHCCC-CCEEEEECCHHHHHHHHHHHHH T ss_conf 99999999999997799976341245---------4467539999999999983599-9839998836999999999998 Q ss_pred CCCEE----EEEEEEC Q ss_conf 49889----9998315 Q gi|254780936|r 135 KVKKV----TIVSTVL 146 (182) Q Consensus 135 ~Gk~V----~v~~~~~ 146 (182) +|.+| -|++|.. T Consensus 193 ~Gi~VP~disViGfD~ 208 (265) T cd01543 193 AGIAVPEEVAVLGVDN 208 (265) T ss_pred HHCCCCCCEEEEEECC T ss_conf 3225777559999828 No 76 >COG4821 Uncharacterized protein containing SIS (Sugar ISomerase) phosphosugar binding domain [General function prediction only] Probab=67.63 E-value=10 Score=18.62 Aligned_cols=56 Identities=25% Similarity=0.210 Sum_probs=40.3 Q ss_pred HCCCCEEEEECCCHH---HHHHHHHHHHCCCEEEEEEEECCCCCCCH-----HHHHHHCCCCEE Q ss_conf 211588999838665---89999999984988999983157765470-----899985211151 Q gi|254780936|r 110 SEGLEHLVIFSGDGC---FTTLVAALQRKVKKVTIVSTVLSDPSMAS-----DQLRRQADYFMD 165 (182) Q Consensus 110 a~~~d~~iLvSGD~D---f~pli~~lr~~Gk~V~v~~~~~~~~~~~S-----~~L~~~ad~fi~ 165 (182) .+.=|+++++|.-+- =+.+.+..|++|..|++++...-....+| ..|-+.+|-.++ T Consensus 102 i~~~DVliviSnSGrNpvpie~A~~~rekGa~vI~vTSl~ySq~~~SRh~SGK~Ly~~aDvVlD 165 (243) T COG4821 102 IRPNDVLIVISNSGRNPVPIEVAEYAREKGAKVIAVTSLDYSQSQASRHKSGKLLYEFADVVLD 165 (243) T ss_pred CCCCCEEEEEECCCCCCCCHHHHHHHHHCCCEEEEEEHHHHHHHCHHCCCCHHHHHHHCCEEEE T ss_conf 8878889998079989844389999986597699985454323111015415677652105651 No 77 >cd06306 PBP1_TorT-like TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. The Tor respiratory system is consists of three proteins (TorC, TorA, and TorD) and is induced in the presence of TMAO. The TMAO control is tightly regulated by three proteins: TorS, TorT, and TorR. Thus, the disruption of any of these proteins can abolish the Tor respiratory induction. TorT shares homology with the sugar-binding domain of the type I periplasmic binding proteins. The members of TorT-like family bind TMAO or related compounds and are predicted to be involved in signal transduction and/or substrate transport. Probab=67.22 E-value=10 Score=18.57 Aligned_cols=109 Identities=14% Similarity=0.059 Sum_probs=63.1 Q ss_pred HHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--HCC Q ss_conf 998517929999997306887314677757999889755956975345895378886543565329999999994--211 Q gi|254780936|r 35 KAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--SEG 112 (182) Q Consensus 35 ~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a~~ 112 (182) +.....+.........+... ......+...|.+.|...++++.... ..++|...+.+.++. ..+ T Consensus 117 ~~~~~~g~~~~v~i~~g~~~-~~~~~~R~~Gf~~~l~~~~~~iv~~~-------------~~~~~~~~a~~~~~~~L~~~ 182 (268) T cd06306 117 QRHPKGSKPAKVAWFPGPKG-AGWVKAVEKGFRDALAGSAIEISAIK-------------YGDTGKEVQRKLVEEALEAH 182 (268) T ss_pred HHHHCCCCCEEEEEEECCCC-CHHHHHHHHHHHHHHHCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHHC T ss_conf 85150399539999967977-62999999889998832895488873-------------17874999999999998629 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 5889998386658999999998498--8999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -|.-+++.+|.....++..+++.|+ +|.++++.. +...|....+. T Consensus 183 pdi~~i~~~~~~a~~a~~a~~~~g~~~~v~vvg~d~-----~~~~~~~i~~G 229 (268) T cd06306 183 PDIDYIVGSAVAAEAAVGILRQRGLTDQIKIVSTYL-----SHAVYRGLKRG 229 (268) T ss_pred CCCCEEEECCHHHHHHHHHHHHCCCCCCEEEEEECC-----CHHHHHHHHCC T ss_conf 998899957467999999998607799759999369-----98999999859 No 78 >cd05006 SIS_GmhA Phosphoheptose isomerase is a member of the SIS (Sugar ISomerase) superfamily. Phosphoheptose isomerase catalyzes the isomerization of sedoheptulose 7-phosphate into D-glycero-D-mannoheptose 7-phosphate. This is the first step of the biosynthesis of gram-negative bacteria inner core lipopolysaccharide precursor, L-glycero-D-mannoheptose (Gmh). Probab=65.72 E-value=11 Score=18.39 Aligned_cols=52 Identities=15% Similarity=0.089 Sum_probs=39.4 Q ss_pred HHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 109 QSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 109 ~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) +++.=|.+|++|+-+ ..+.+++.+|++|.+|+.++-.. ...|.+.||..+.+ T Consensus 98 ~~~~gDili~iS~sG~s~nii~a~~~ak~~g~~~i~ltg~~------gg~l~~~~D~~i~v 152 (177) T cd05006 98 LGQPGDVLIGISTSGNSPNVLKALEAAKERGMKTIALTGRD------GGKLLELADIEIHV 152 (177) T ss_pred HCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCHHHHHCCEEEEE T ss_conf 44689889998189999899999999998799899998789------86167619989996 No 79 >cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem Probab=65.70 E-value=11 Score=18.39 Aligned_cols=100 Identities=10% Similarity=-0.024 Sum_probs=56.4 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999999942-- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) +.+++.+.+.- ...+...+........+...|.++++..|..... ... ..+.+..-+.++++.. T Consensus 109 ~~~~l~~~g~~--~i~~i~~~~~~~~~~~R~~g~~~~~~~~~~~~~~--~~~----------~~~~~~~~~~~~~~~~l~ 174 (264) T cd01537 109 AGEHLAEKGHR--RIALLAGPLGSSTARERVAGFKDALKEAGPIEIV--LVQ----------EGDWDAEKGYQAAEELLT 174 (264) T ss_pred HHHHHHHCCCC--CEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCE--EEE----------ECCCCHHHHHHHHHHHHH T ss_conf 99999862986--2453048877706999999999999977999853--799----------579998999999999985 Q ss_pred CC-CCEEEEECCCHHHHHHHHHHHHCCC----EEEEEEEEC Q ss_conf 11-5889998386658999999998498----899998315 Q gi|254780936|r 111 EG-LEHLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVL 146 (182) Q Consensus 111 ~~-~d~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~ 146 (182) .+ -=++|++++|.--..++..++++|. .|.|+++.. T Consensus 175 ~~~~~dai~~~~d~~A~gv~~al~~~g~~ip~di~ivg~d~ 215 (264) T cd01537 175 AHPDPTAIFAANDDMALGALRALREAGLRVPDDISVIGFDG 215 (264) T ss_pred HCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 08997699989869999999999974999999839999997 No 80 >pfam09338 Gly_reductase Glycine/sarcosine/betaine reductase component B subunits. This is a family of glycine reductase, sarcosine reductase and betaine reductases. These enzymes catalyse the following reactions. sarcosine reductase: Acetyl phosphate + methylamine + thioredoxin disulphide = N-methylglycine + phosphate + thioredoxin Acetyl phosphate + NH(3) + thioredoxin disulphide = glycine + phosphate + thioredoxin. betaine reductase: Acetyl phosphate + trimethylamine + thioredoxin disulphide = N,N,N-trimethylglycine + phosphate + thioredoxin. Probab=65.04 E-value=7.4 Score=19.53 Aligned_cols=89 Identities=16% Similarity=0.097 Sum_probs=56.4 Q ss_pred HHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH-HHCC--CCEEEEE-----CCCHHHHHHHHHHHHCCC Q ss_conf 9988975595697534589537888654356532999999999-4211--5889998-----386658999999998498 Q gi|254780936|r 66 LLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE-QSEG--LEHLVIF-----SGDGCFTTLVAALQRKVK 137 (182) Q Consensus 66 ~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~-~a~~--~d~~iLv-----SGD~Df~pli~~lr~~Gk 137 (182) ..+-.+++|.++...-+.-...+. .-.|-.-+.++.. ++.. .|.+|+- .-|.|++-.+.++-+.|. T Consensus 260 I~~Ly~~HGKdinF~GVI~tnenv------~l~~KeR~s~~~akLa~~LgaDgvIvseEG~GNPD~Dl~m~~~k~E~~GI 333 (428) T pfam09338 260 IEDLYRRHGKDINFLGVIITNENV------TLADKERSSSYVAKLAEMLGADGVIVSEEGFGNPDADLMMNCRKLEEKGI 333 (428) T ss_pred HHHHHHHHCCCEEEEEEEECCCCC------CHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 999999869902578999348844------05666778999999999859978999435588942789999999998699 Q ss_pred EEEEEEEECCCCCCCHHHHHHHC Q ss_conf 89999831577654708999852 Q gi|254780936|r 138 KVTIVSTVLSDPSMASDQLRRQA 160 (182) Q Consensus 138 ~V~v~~~~~~~~~~~S~~L~~~a 160 (182) ++.+++...+.+..+|+.|..++ T Consensus 334 ktV~itdE~~GrDG~sq~Lad~~ 356 (428) T pfam09338 334 KTVLITDEYAGRDGASQSLADAD 356 (428) T ss_pred CEEEEEEEECCCCCCCCCCCCCC T ss_conf 49997105006789877230268 No 81 >TIGR01703 hybrid_clust hydroxylamine reductase; InterPro: IPR010048 Hybrid cluster proteins (HCP, or Prismane) have been identified in bacteria, archaea and eukaryotic protozoa. No specific function has yet been assigned to these proteins, but it may involve oxidoreductase enzymatic activity. These proteins contain one 4Fe-4S cluster, and one hybrid 4Fe-2O-2S cluster, the latter being similar to the Ni-Fe-S cluster found in carbon monoxide dehydrogenase enzymes (IPR010047 from INTERPRO) , . This subfamily is heterogeneous with respect to the presence or absence of a region of about 100 amino acids not far from the N terminus of the protein. Members have been described as monomeric. ; GO: 0016661 oxidoreductase activity acting on other nitrogenous compounds as donors, 0051536 iron-sulfur cluster binding, 0006118 electron transport, 0005737 cytoplasm. Probab=64.29 E-value=6.9 Score=19.72 Aligned_cols=41 Identities=12% Similarity=0.146 Sum_probs=29.6 Q ss_pred ECCCHHHHHHHHHHHHCC-------CEEEEEEEECCCCCCCHHHHHHH--CCCCEEH Q ss_conf 838665899999999849-------88999983157765470899985--2111518 Q gi|254780936|r 119 FSGDGCFTTLVAALQRKV-------KKVTIVSTVLSDPSMASDQLRRQ--ADYFMDL 166 (182) Q Consensus 119 vSGD~Df~pli~~lr~~G-------k~V~v~~~~~~~~~~~S~~L~~~--ad~fi~l 166 (182) |.|..||.|+|+++++.- +.-+++||. -..+-+. ||+.+++ T Consensus 357 vnGqKDF~pvI~~Al~~pg~~~~~~~~~it~GF~-------h~~~L~~ktADK~vel 406 (567) T TIGR01703 357 VNGQKDFSPVIEKALELPGFPKELEEGTITTGFG-------HHTILALKTADKIVEL 406 (567) T ss_pred CCCCCCHHHHHHHHHCCCCCCHHHHCCEEEEECH-------HHHHHHHHHCCCCHHH T ss_conf 7777782899999853884870331775687030-------8889864521400105 No 82 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=64.08 E-value=12 Score=18.20 Aligned_cols=93 Identities=15% Similarity=0.087 Sum_probs=60.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCC--EEEEECCCHHHHHHHHH Q ss_conf 3146777579998897559569753458953788865435653299999999942--1158--89998386658999999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLE--HLVIFSGDGCFTTLVAA 131 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d--~~iLvSGD~Df~pli~~ 131 (182) ......+...|.+.|...|+.+..... .-.++|..-+...++.. .+-| ..|+..+|.--.-+++. T Consensus 151 ~~~~~~R~~Gf~~~l~~~~~~~~iv~~-----------~~~~w~~~~a~~~~e~~L~~~~d~id~I~a~ND~mAlGA~~A 219 (303) T cd01539 151 HPDAIARTKYSIETLNDAGIKTEELAS-----------DTANWDRAQAKDKMDALLLKYGDKIEAVIANNDAMALGAIEA 219 (303) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEE-----------EECCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH T ss_conf 978999998699999854980157778-----------716878899999999999858997359997894779999999 Q ss_pred HHHCCC-------EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 998498-------899998315776547089998521115 Q gi|254780936|r 132 LQRKVK-------KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 132 lr~~Gk-------~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) +++.|. .|.|+++... ...|....+.-+ T Consensus 220 l~~aG~~~~~~~~~I~VvG~Dg~-----~~~l~aI~~G~~ 254 (303) T cd01539 220 LQKYGYNKGDKSKNIPVVGVDAL-----PEALELIKKGSM 254 (303) T ss_pred HHHCCCCCCCCCCCCEEEEECCC-----HHHHHHHHCCCC T ss_conf 99759876667888679986799-----999999983998 No 83 >cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners. Probab=64.01 E-value=12 Score=18.19 Aligned_cols=63 Identities=11% Similarity=0.078 Sum_probs=40.8 Q ss_pred CCCHHHHHHHHHHHH--------CC-CCEEEEEC-----CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 653299999999942--------11-58899983-----86658999999998498899998315776547089998521 Q gi|254780936|r 96 SSMDVELAVDAFEQS--------EG-LEHLVIFS-----GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 96 k~~Dv~laiD~~~~a--------~~-~d~~iLvS-----GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +++.+..|++.+... ++ -..+||+| .|.+-.+.+..||+.|..+.++++..+. -..+|+..|+ T Consensus 83 g~T~Tg~AL~~a~~~~f~~~g~R~~vpkv~illTDG~s~d~~~~~~~a~~Lr~~GV~ifavGVG~~v---~~~eL~~Iag 159 (186) T cd01471 83 GSTNTTSALLVVEKHLFDTRGNRENAPQLVIIMTDGIPDSKFRTLKEARKLRERGVIIAVLGVGQGV---NHEENRSLVG 159 (186) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHCCCEEEEEECCCCC---CHHHHHHHCC T ss_conf 9677999999999972114688999985999990698778525899999999889999999834324---9999999709 No 84 >cd06295 PBP1_CelR Ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. This group includes the ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. The binding of CelR to the celE promoter is inhibited specifically by cellobiose. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn chang Probab=63.52 E-value=12 Score=18.13 Aligned_cols=100 Identities=14% Similarity=0.070 Sum_probs=60.4 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999---- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---- 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---- 108 (182) ..+.+.+.+.-..+ |.+.+........+...|.+++...|..+...... ..+.+..-+..++. T Consensus 116 ~~~~L~~~G~~~I~--~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~l~ 182 (275) T cd06295 116 ATEHLLARGRRRIA--FLGGPQDMPEGEERLEGYREALAEAGLPLDPRLVA-----------PGDFTEESGRAAMRALLE 182 (275) T ss_pred HHHHHHHHCCCEEE--EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEE-----------ECCCCHHHHHHHHHHHHH T ss_conf 99999980998798--70588667269999999999999869999941799-----------657766879999888985 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 421158899983866589999999984988----99998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..+.+ ++|+++.|.--..++..++++|.+ +-|+++.. T Consensus 183 ~~~~~-~ai~~~nD~~A~g~~~~~~~~g~~iP~disIigfd~ 223 (275) T cd06295 183 RGPDF-DAVFAASDLMALGALRALREAGRRVPEDVAVVGFDD 223 (275) T ss_pred CCCCC-CEEEECCCHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 49998-703414758789999999974999898569999678 No 85 >COG0794 GutQ Predicted sugar phosphate isomerase involved in capsule formation [Cell envelope biogenesis, outer membrane] Probab=62.47 E-value=13 Score=18.01 Aligned_cols=85 Identities=16% Similarity=0.125 Sum_probs=53.3 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCC Q ss_conf 775799988975595697534589537888654356532999999999421158899983866---58999999998498 Q gi|254780936|r 61 SPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVK 137 (182) Q Consensus 61 ~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk 137 (182) .--+.|.-.|...|+......--+-..+ |.. ....=|.+|++||-| ...-++..+|+.|. T Consensus 52 ~Igkk~Aa~L~s~G~~a~fv~p~ea~hg----------dlg-------~i~~~DvviaiS~SGeT~el~~~~~~aK~~g~ 114 (202) T COG0794 52 LIGKKFAARLASTGTPAFFVGPAEALHG----------DLG-------MITPGDVVIAISGSGETKELLNLAPKAKRLGA 114 (202) T ss_pred HHHHHHHHHHHCCCCCEEEECCCHHCCC----------CCC-------CCCCCCEEEEEECCCCHHHHHHHHHHHHHCCC T ss_conf 9999999999735995599667301037----------756-------78988899998089717799999999997599 Q ss_pred EEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 8999983157765470899985211151899 Q gi|254780936|r 138 KVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 138 ~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) +++.++.... | .|-+.+|-++.+.. T Consensus 115 ~liaiT~~~~-----S-sLak~aDvvl~ip~ 139 (202) T COG0794 115 KLIAITSNPD-----S-SLAKAADVVLVIPV 139 (202) T ss_pred CEEEEECCCC-----C-HHHHHCCEEEECCC T ss_conf 4899958999-----8-67874686997367 No 86 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=61.79 E-value=13 Score=17.94 Aligned_cols=20 Identities=10% Similarity=0.074 Sum_probs=11.1 Q ss_pred HHHHHHHHHHCCCEEEEEEE Q ss_conf 89999999984988999983 Q gi|254780936|r 125 FTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 125 f~pli~~lr~~Gk~V~v~~~ 144 (182) |.-+++.+.+.|...++.|. T Consensus 149 ~~~~~~iA~k~~IplIi~Ge 168 (343) T TIGR03573 149 FASVYQVALKFNIPLIIWGE 168 (343) T ss_pred HHHHHHHHHHCCCCEEEECC T ss_conf 99999999981999899756 No 87 >PRK13337 putative lipid kinase; Reviewed Probab=60.82 E-value=14 Score=17.83 Aligned_cols=79 Identities=20% Similarity=0.255 Sum_probs=46.3 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHH Q ss_conf 973068873146777579998897559569753458953788865435653299999999942--115889998386658 Q gi|254780936|r 48 YYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCF 125 (182) Q Consensus 48 ~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df 125 (182) ...+..................|++.|+++...+. +..-| +.+..+.+ ..+|.+|.+-|||=. T Consensus 7 ~I~NP~sG~g~~~~~~~~i~~~l~~~g~~~~~~~T------------~~~g~---a~~~a~~~~~~~~d~vv~~GGDGTv 71 (305) T PRK13337 7 IIYNPTSGRELFKKNLPDVLQKLEQAGYETSAHAT------------TGPGD---ATLAARQAAERNFDLVIAAGGDGTL 71 (305) T ss_pred EEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEE------------CCHHH---HHHHHHHHHHCCCCEEEEEECCHHH T ss_conf 99997568864677899999999987996999982------------78028---9999999877799989999576289 Q ss_pred HHHHHHHHHCCCEEEE Q ss_conf 9999999984988999 Q gi|254780936|r 126 TTLVAALQRKVKKVTI 141 (182) Q Consensus 126 ~pli~~lr~~Gk~V~v 141 (182) -.++.-+.....+..+ T Consensus 72 ~evvngl~~~~~~~~l 87 (305) T PRK13337 72 NEVVNGLAEKENRPKL 87 (305) T ss_pred HHHHHHHHCCCCCCEE T ss_conf 9999998558998628 No 88 >PRK04160 diphthine synthase; Provisional Probab=60.63 E-value=14 Score=17.81 Aligned_cols=45 Identities=22% Similarity=0.273 Sum_probs=32.2 Q ss_pred HHHHH-HHHHHHHCCCCEEEEECCCHH----HHHHHHHHHHCCCEEEEEE Q ss_conf 29999-999994211588999838665----8999999998498899998 Q gi|254780936|r 99 DVELA-VDAFEQSEGLEHLVIFSGDGC----FTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 99 Dv~la-iD~~~~a~~~d~~iLvSGD~D----f~pli~~lr~~Gk~V~v~~ 143 (182) +++-. -.+++.|..-+++.|+.||-= ...++.+++++|..|.|+. T Consensus 61 ~vE~~~~~il~~A~~k~Va~Lv~GDP~~atTH~~l~~~a~~~gI~v~VIh 110 (259) T PRK04160 61 DVEEESGRILEEAKDKDVALLTAGDPMVATTHVDLRLEARKKGIEVRIIH 110 (259) T ss_pred HHHHHHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEC T ss_conf 87633999999873188899953885550049999999998799789977 No 89 >COG2117 Predicted subunit of tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=60.46 E-value=14 Score=17.79 Aligned_cols=15 Identities=20% Similarity=0.215 Sum_probs=6.5 Q ss_pred HHHHHHCCCCEEHHH Q ss_conf 899985211151899 Q gi|254780936|r 154 DQLRRQADYFMDLAY 168 (182) Q Consensus 154 ~~L~~~ad~fi~l~~ 168 (182) ..++.-+..++.++. T Consensus 139 kti~~Lv~~~f~~e~ 153 (198) T COG2117 139 KTIRRLVSAIFILEE 153 (198) T ss_pred HHHHHHHHHHEEEEC T ss_conf 899999887734101 No 90 >cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=60.11 E-value=14 Score=17.75 Aligned_cols=62 Identities=11% Similarity=0.092 Sum_probs=40.0 Q ss_pred CCCHHHHHHHHHH-HH--------CC-CCEEEEEC---CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 6532999999999-42--------11-58899983---86658999999998498899998315776547089998521 Q gi|254780936|r 96 SSMDVELAVDAFE-QS--------EG-LEHLVIFS---GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 96 k~~Dv~laiD~~~-~a--------~~-~d~~iLvS---GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +++.+.-|++.+. .. ++ -..+||+| ...|...+.+.||..|..|..+++.. .-..+|+..|. T Consensus 77 g~t~~~~AL~~~~~~~f~~~~g~R~~~~kvlvliTDG~s~d~~~~~a~~lr~~gv~i~~VGVg~----~~~~eL~~IAs 151 (164) T cd01482 77 GNTRTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKD----ADESELKMIAS 151 (164) T ss_pred CCCCHHHHHHHHHHHHHCHHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCEEEEEECCC----CCHHHHHHHHC T ss_conf 9972899999999986150028988886079996079884338999999998893899997883----78999999968 No 91 >PRK04759 consensus Probab=59.79 E-value=14 Score=17.72 Aligned_cols=80 Identities=18% Similarity=0.177 Sum_probs=50.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 31467775799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) ..+.........++|...|+++..-+.. .........+.-. +..+...++|.+|.+-|||=+..+++.+... T Consensus 16 ~~~~~~~~~~l~~~L~~~g~~v~vd~~~--~~~l~~~~~~~~~------~~~~l~~~~Dlvi~lGGDGTlL~aar~~~~~ 87 (294) T PRK04759 16 DQQAIQTHKELYHWLTSLGYTVFIDDRL--AAILTDVPQEHFA------SLVELGKKADLAIVVGGDGNMLGAARVLSRF 87 (294) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEECHHH--HHHCCCCCCCCCC------CHHHCCCCCCEEEEECCCHHHHHHHHHHCCC T ss_conf 9799999999999998689999991688--6553347722237------7556376656899984785899999986016 Q ss_pred CCEEEEEE Q ss_conf 98899998 Q gi|254780936|r 136 VKKVTIVS 143 (182) Q Consensus 136 Gk~V~v~~ 143 (182) +..|..+- T Consensus 88 ~~PilgiN 95 (294) T PRK04759 88 DISVIGVN 95 (294) T ss_pred CCCEEEEE T ss_conf 99689884 No 92 >cd06272 PBP1_hexuronate_repressor_like Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor Probab=59.63 E-value=14 Score=17.70 Aligned_cols=102 Identities=14% Similarity=0.088 Sum_probs=62.0 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994-2- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a- 110 (182) ..+.+.+.|.=..++. +.+........+...|.+++...|..+....+.. .+.+..-+.+++.. . T Consensus 102 a~~~L~~~G~~~i~~i--~~~~~~~~~~~R~~gf~~al~~~g~~~~~~~i~~-----------~~~~~~~~~~~~~~ll~ 168 (261) T cd06272 102 AVLYLAEKGHKKIAYI--GDLSLDRRQRKRFKGFLETCDENGISISDSHIDV-----------DGLSAEGGDNAAKKLLK 168 (261) T ss_pred HHHHHHHCCCCCEEEE--CCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE-----------CCCHHHHHHHHHHHHHH T ss_conf 9999998699828996--6988873599999999999998599855138983-----------03103679999999985 Q ss_pred -CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf -1158899983866589999999984988----999983157 Q gi|254780936|r 111 -EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 111 -~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) ..-=++|+++.|.=-..+++.++++|.+ +-|+++... T Consensus 169 ~~~~p~Ai~~~nD~~A~g~~~~l~~~g~~vP~dvsviGfDd~ 210 (261) T cd06272 169 ESDLPTAIICGSYDIALGVLSALNKQGISIPEDIEIISYDNI 210 (261) T ss_pred CCCCCCCEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECCC T ss_conf 278986101276899999999999809999995599998881 No 93 >cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands. Probab=59.18 E-value=15 Score=17.65 Aligned_cols=62 Identities=16% Similarity=0.148 Sum_probs=39.7 Q ss_pred CCCHHHHHHHHH-HHH-------C----CCCEEEEECCCH----HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHH Q ss_conf 653299999999-942-------1----158899983866----589999999984988999983157765470899985 Q gi|254780936|r 96 SSMDVELAVDAF-EQS-------E----GLEHLVIFSGDG----CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQ 159 (182) Q Consensus 96 k~~Dv~laiD~~-~~a-------~----~~d~~iLvSGD~----Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ 159 (182) +++-...|+.-+ +.. + ++-.++++=-|| |..++++.||+.|..|..+++... --.+|... T Consensus 79 ggT~Tg~AL~~~~~~~f~~~~G~Rp~~~~vpkvlIviTDG~s~D~v~~~A~~lr~~GV~ifaVGVg~~----~~~eL~~I 154 (224) T cd01475 79 TGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRA----DEEELREI 154 (224) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCEEEEEECCCC----CHHHHHHH T ss_conf 84469999999999727702399875568985999971798766389999999987988999963747----98999998 Q ss_pred CC Q ss_conf 21 Q gi|254780936|r 160 AD 161 (182) Q Consensus 160 ad 161 (182) |. T Consensus 155 As 156 (224) T cd01475 155 AS 156 (224) T ss_pred HC T ss_conf 55 No 94 >TIGR00642 mmCoA_mut_beta methylmalonyl-CoA mutase, small subunit; InterPro: IPR004608 Methylmalonyl-CoA mutase (5.4.99.2 from EC) catalyses the isomerization of succinyl-CoA to methylmalonyl-CoA during the synthesis of propionate from tricarboxylic acid-cycle intermediates in propionic acid fermentation. The adenosylcobalamin-binding, catalytic chain of methylmalonyl-CoA mutase may form homodimers, as in the mitochondrion and in Escherichia coli, or heterodimers with a shorter, homologous chain that does not bind adenosylcobalamin. This family is this non-catalytic beta chain, as found in the enzyme from Propionibacterium freudenreichii, for which the 3-dimensional structure has been solved .; GO: 0004494 methylmalonyl-CoA mutase activity, 0031419 cobalamin binding, 0019652 lactate fermentation to propionate and acetate. Probab=59.09 E-value=15 Score=17.65 Aligned_cols=50 Identities=16% Similarity=0.181 Sum_probs=41.7 Q ss_pred CCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHH---H-HHHHHHHCCCEEEEEE Q ss_conf 356532999999999421158899983866589---9-9999998498899998 Q gi|254780936|r 94 VKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFT---T-LVAALQRKVKKVTIVS 143 (182) Q Consensus 94 ~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~---p-li~~lr~~Gk~V~v~~ 143 (182) .++.++++..+|+...+..-++++|+|.|.-|. + ++..++..|..-.+++ T Consensus 548 ~~G~~~ae~v~~A~~~agsa~vavlCssd~~Ya~~~~~~a~a~~aAG~~~~yla 601 (642) T TIGR00642 548 DEGTKSAEEVVDAVKKAGSADVAVLCSSDDEYAEEAADVAKAAKAAGAAELYLA 601 (642) T ss_pred CCCCCCHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 688600325899998458984898758758888756899999975684056422 No 95 >cd06282 PBP1_GntR_like_2 Ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding Probab=59.06 E-value=15 Score=17.64 Aligned_cols=104 Identities=11% Similarity=0.008 Sum_probs=59.5 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+.+.+.+.-..++... ..........+...|.++++..|....... . .+.. ...+.....+++...+. T Consensus 107 ~~~~L~~~G~~~i~~i~~-~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~--~-~~~~------~~~~~~~~~~~l~~~~~ 176 (266) T cd06282 107 VAQALAALGHRRIAMLAG-RLAASDRARQRYAGYRAAMRAAGLAPLPPV--E-IPFN------TAALPSALLALLTAHPA 176 (266) T ss_pred HHHHHHHCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHCCCCCCCEE--E-ECCC------HHHHHHHHHHHHHCCCC T ss_conf 999999739956999977-788886899999999999998699976179--9-3487------26899999999837999 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf 58899983866589999999984988----999983157 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) . ++|+++.|.--.-++..+++.|.+ |-|+++... T Consensus 177 ~-~ai~~~nD~~A~g~l~al~~~g~~vP~disIigfd~~ 214 (266) T cd06282 177 P-TAIFCSNDLLALAVIRALRRLGLRVPDDLSVVGFDGI 214 (266) T ss_pred C-CEEEECCCHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 8-4798538889999999999849999998499998970 No 96 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=58.89 E-value=15 Score=17.62 Aligned_cols=79 Identities=20% Similarity=0.168 Sum_probs=48.2 Q ss_pred CCCHHHHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH----HCCCCEEEEECCCHHHHHH Q ss_conf 87314677757999889755-956975345895378886543565329999999994----2115889998386658999 Q gi|254780936|r 54 GDPEQQFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ----SEGLEHLVIFSGDGCFTTL 128 (182) Q Consensus 54 ~~~~~~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~----a~~~d~~iLvSGD~Df~pl 128 (182) ........+...|.++|... ++++... ...+.|..-+.+.++. -+.++ +|+.++|.--..+ T Consensus 133 ~~~~~~~~R~~Gf~~~l~~~~~~~i~~~-------------~~~~~~~~~~~~~~~~~L~~~pdi~-ai~~~nd~~a~ga 198 (273) T cd06310 133 PGSSTTDQREEGFLEGLKEYPGIEIVAT-------------QYSDSDYAKALDITEDLLTANPDLK-GIFGANEGSAVGA 198 (273) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCCCE-------------ECCCCHHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHH T ss_conf 9981699999999999987689753320-------------0265248999999999997589875-7984384899999 Q ss_pred HHHHHHCCCE--EEEEEEEC Q ss_conf 9999984988--99998315 Q gi|254780936|r 129 VAALQRKVKK--VTIVSTVL 146 (182) Q Consensus 129 i~~lr~~Gk~--V~v~~~~~ 146 (182) ++.+++.|++ +.++++.. T Consensus 199 ~~al~~~g~~~~i~vvg~D~ 218 (273) T cd06310 199 ARAVRQAGKAGKVKVVGFDA 218 (273) T ss_pred HHHHHHCCCCCCEEEEEECC T ss_conf 99999779999939999898 No 97 >PRK10703 DNA-binding transcriptional repressor PurR; Provisional Probab=58.82 E-value=15 Score=17.62 Aligned_cols=100 Identities=11% Similarity=0.012 Sum_probs=55.9 Q ss_pred HHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H-- Q ss_conf 9998517929999997306887314677757999889755956975345895378886543565329999999994-2-- Q gi|254780936|r 34 LKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S-- 110 (182) Q Consensus 34 ~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a-- 110 (182) .+.|.+.|.=..++.. .+........+...|.++++..|..+....+. +.+.+..-..+++.. . T Consensus 170 ~~~L~~~Ghr~Ia~i~--~~~~~~~~~~R~~gf~~al~~~gl~~~~~~i~-----------~~~~~~~~g~~~~~~ll~~ 236 (335) T PRK10703 170 GRYLIERGHRDIGVIP--GPLERNTGAGRLAGFMKAMEEANIKVPEEWIV-----------QGDFEPESGYEAMQQILSQ 236 (335) T ss_pred HHHHHHCCCCEEEEEE--CCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEE-----------ECCCCHHHHHHHHHHHHHC T ss_conf 9999975998699995--88765217899999999999859998813289-----------6578858899999999956 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 1158899983866589999999984988----99998315 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..--++|++++|.=-..+++.++++|.+ +-|++|.. T Consensus 237 ~~~ptAi~~~nD~~A~g~~~~l~~~g~~VP~DisvigfDd 276 (335) T PRK10703 237 EHRPTAVFCGGDIMAMGAICAADEMGLRVPQDISVIGYDN 276 (335) T ss_pred CCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 9998668968759999999999971999999749999888 No 98 >PRK01911 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=57.93 E-value=15 Score=17.52 Aligned_cols=82 Identities=13% Similarity=0.223 Sum_probs=48.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEE-EECCCCCCCCCCCCCHHHHHHHHHH-HHCCCCEEEEECCCHHHHHHHHHHH Q ss_conf 3146777579998897559569753458-9537888654356532999999999-4211588999838665899999999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAK-EFTENCGRKRVKSSMDVELAVDAFE-QSEGLEHLVIFSGDGCFTTLVAALQ 133 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~-~~~~~~~~~~~kk~~Dv~laiD~~~-~a~~~d~~iLvSGD~Df~pli~~lr 133 (182) ..+.....+...++|.+.|+++..-.-. ...... ...+. +. .+..+ ....+|.+|.+-|||=|..+++.+. T Consensus 11 ~~~~~~~~~~Li~~L~~~g~~v~le~~~a~~l~~~--~~~~~--~~---~~~~~~~~~~~Dlvi~iGGDGT~L~a~~~~~ 83 (290) T PRK01911 11 QASKSPHIKRLFELLEEHGAEIYIEEEFLNFLTQD--LKFEP--KY---KGFFDGNNFDFDMVISIGGDGTFLRAAARVG 83 (290) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHC--CCCCC--CC---CCCCCCCCCCCCEEEEECCCHHHHHHHHHHH T ss_conf 84799999999999998899899976897877651--25664--31---2321135777779999787689999999861 Q ss_pred HCCCEEEEEEE Q ss_conf 84988999983 Q gi|254780936|r 134 RKVKKVTIVST 144 (182) Q Consensus 134 ~~Gk~V~v~~~ 144 (182) ..++.|..+-. T Consensus 84 ~~~iPilGiN~ 94 (290) T PRK01911 84 NSGIPILGINT 94 (290) T ss_pred HCCCCEEEEEC T ss_conf 25996899944 No 99 >TIGR01350 lipoamide_DH dihydrolipoamide dehydrogenase; InterPro: IPR006258 These sequences represent dihydrolipoamide dehydrogenase, a flavoprotein that acts in a number of ways. It is the E3 component of dehydrogenase complexes for pyruvate, 2-oxoglutarate, 2-oxoisovalerate, and acetoin. It can also serve as the L protein of the glycine cleavage system. This family includes a few members known to have distinct functions (ferric leghemoglobin reductase and NADH:ferredoxin oxidoreductase) but that may be predicted by homology to act as dihydrolipoamide dehydrogenase as well. The motif GGXCXXXGCXP near the N-terminus contains a redox-active disulphide. ; GO: 0004148 dihydrolipoyl dehydrogenase activity, 0050660 FAD binding, 0006118 electron transport. Probab=57.90 E-value=15 Score=17.61 Aligned_cols=29 Identities=31% Similarity=0.373 Sum_probs=13.7 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 588999838665899999999849889999 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) ||++||=+|=+-|+.+|+.+|. |++|.++ T Consensus 2 yD~vViGgGPGGYVAAIrAAQl-G~KValv 30 (481) T TIGR01350 2 YDVVVIGGGPGGYVAAIRAAQL-GLKVALV 30 (481) T ss_pred CCEEEECCCCCHHHHHHHHHHC-CCEEEEE T ss_conf 5189987787727999999864-9808999 No 100 >cd06271 PBP1_AglR_RafR_like Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the represso Probab=57.86 E-value=15 Score=17.52 Aligned_cols=102 Identities=13% Similarity=-0.035 Sum_probs=59.2 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999999942-- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) +.+.+.+.|.-..++ .+.+........+...|.++++..|..+...... +...+..-+.+++... T Consensus 111 a~~~L~~~G~~~I~~--i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~~~-----------~~~~~~~~~~~~~~~ll~ 177 (268) T cd06271 111 AVRRLIALGHRRIAL--LNPPEDLTFAQHRRAGYRRALAEAGLPLDPALIV-----------SGDMTEEGGYAAAAELLA 177 (268) T ss_pred HHHHHHHHCCCEEEE--CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEE-----------ECCCCHHHHHHHHHHHHH T ss_conf 999999859874875--4887546079999999999999839997711365-----------068757779999999996 Q ss_pred -CCCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEECC Q ss_conf -11588999838665899999999849889----99983157 Q gi|254780936|r 111 -EGLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVLS 147 (182) Q Consensus 111 -~~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~~ 147 (182) ..--++|++++|.=-..+++.+++.|.+| -|++|... T Consensus 178 ~~~~p~Ai~~~nD~~A~g~l~~l~~~g~~vP~DisvigfD~~ 219 (268) T cd06271 178 LPDRPTAIVCSSELMALGVLAALAEAGLRPGRDVSVVGFDDS 219 (268) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCC T ss_conf 599986899777799999999999829998999799997882 No 101 >cd01977 Nitrogenase_VFe_alpha Nitrogenase_VFe_alpha -like: Nitrogenase VFe protein, alpha subunit like. This group contains proteins similar to the alpha subunits of, the VFe protein of the vanadium-dependent (V-) nitrogenase and the FeFe protein of the iron only (Fe-) nitrogenase Nitrogenase catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. In addition to V- and Fe- nitrogenases there is a molybdenum (Mo)-dependent nitrogenase which is the most widespread and best characterized of these systems. These systems consist of component 1 (VFe protein, FeFe protein or, MoFe protein respectively) and, component 2 (Fe protein). MoFe is an alpha2beta2 tetramer, V-and Fe- nitrogenases are alpha2beta2delta2 hexamers. The alpha and beta subunits of VFe and FeFe are similar to the alpha and beta subunits of MoFe. For MoFe each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha sub Probab=57.41 E-value=16 Score=17.47 Aligned_cols=35 Identities=11% Similarity=0.200 Sum_probs=28.7 Q ss_pred CEEEEECCCHHHHHHHHHHH-HCCCEEEEEEEECCC Q ss_conf 88999838665899999999-849889999831577 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQ-RKVKKVTIVSTVLSD 148 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr-~~Gk~V~v~~~~~~~ 148 (182) -.++|++|+.|.+.++..+. +.|.+|..++...+. T Consensus 289 Kkv~i~~g~~~~~~~~~~~~~ElGmevv~~~~~~~~ 324 (415) T cd01977 289 KKVCIWTGGPKLWHWTKVIEDELGMQVVAMSSKFGH 324 (415) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCCCC T ss_conf 989997585589999999998668589996054588 No 102 >cd01391 Periplasmic_Binding_Protein_Type_1 Type 1 periplasmic binding fold superfamily. Type 1 periplasmic binding fold superfamily. This model and hierarchy represent the ligand binding domains of the LacI family of transcriptional regulators, periplasmic binding proteins of the ABC-type transport systems, the family C G-protein couples receptors (GPCRs), membrane bound guanylyl cyclases including the family of natriuretic peptide receptors (NPRs), and the N-terminal leucine/isoleucine/valine- binding protein (LIVBP)-like domains of the ionotropic glutamate receptors (iGluRs). In LacI-like transcriptional regulator and the bacterial periplasmic binding proteins the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. Periplasmic sugar binding proteins are one of the components of ABC transporters and are involved in the active transport of water-soluble ligands. The LacI family of proteins con Probab=57.20 E-value=16 Score=17.45 Aligned_cols=102 Identities=12% Similarity=0.037 Sum_probs=59.5 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+++.+.+.-..++.+ .+.. .....+...|.+.++..|.++..... ... ....+...+...+...+. T Consensus 115 ~~~~l~~~g~~~i~~i~--~~~~-~~~~~r~~g~~~~~~~~g~~~~~~~~---~~~------~~~~~~~~~~~~l~~~~~ 182 (269) T cd01391 115 AAEYLAEKGWKRVALIY--GDDG-AYGRERLEGFKAALKKAGIEVVAIEY---GDL------DTEKGFQALLQLLKAAPK 182 (269) T ss_pred HHHHHHHHCCCEEEEEC--CCCC-HHHHHHHHHHHHHHHHCCCCEEEEEE---ECC------CCCHHHHHHHHHHHHCCC T ss_conf 98689871896169973--7985-77999999999999986998679997---134------321489999999861899 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCC---EEEEEEEECC Q ss_conf 5889998386658999999998498---8999983157 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVK---KVTIVSTVLS 147 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk---~V~v~~~~~~ 147 (182) -++|++++|..-..+++.+++.|. .+.++++... T Consensus 183 -~~ai~~~~d~~a~g~~~a~~~~G~~p~d~~iig~d~~ 219 (269) T cd01391 183 -PDAIFACNDEMAAGALKAAREAGLTPGDISIIGFDGS 219 (269) T ss_pred -CEEEEECCHHHHHHHHHHHHHCCCCCCCEEEEEECCC T ss_conf -7399988889999999999986989476599817898 No 103 >PRK11041 DNA-binding transcriptional regulator CytR; Provisional Probab=56.17 E-value=17 Score=17.34 Aligned_cols=101 Identities=14% Similarity=0.030 Sum_probs=58.4 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994--- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--- 109 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--- 109 (182) ..+.|.+.|.-..++. +.+........+...|.+++++.|..+....+. ....+..-...++.. T Consensus 175 a~~~L~~~Ghr~I~~i--~~~~~~~~~~~R~~G~~~al~~~gl~~~~~~i~-----------~~~~~~~~g~~~~~~ll~ 241 (341) T PRK11041 175 AVNYLYELGHKRIACI--AGPEEMPLCHYRLQGYVQALRRCGITVDPQYIA-----------RGDFTFEAGAKAMQQLLE 241 (341) T ss_pred HHHHHHHCCCCEEEEE--ECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEE-----------ECCCCHHHHHHHHHHHHH T ss_conf 9999997499769999--689877689999999999999869999910489-----------546767999999999995 Q ss_pred HCCCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEEC Q ss_conf 211588999838665899999999849889----9998315 Q gi|254780936|r 110 SEGLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVL 146 (182) Q Consensus 110 a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~ 146 (182) ...--++|++++|.==+.++..++++|.+| -|++|.. T Consensus 242 ~~~~ptAi~~~nD~~A~g~l~al~~~Gl~VP~DisViGfdd 282 (341) T PRK11041 242 LPQPPTAVFCHSDVMALGALSQAKRQGLKVPQDLSIIGFDN 282 (341) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 59998579876779999999999971898999659999888 No 104 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=55.87 E-value=17 Score=17.31 Aligned_cols=51 Identities=14% Similarity=0.199 Sum_probs=36.8 Q ss_pred CCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHH Q ss_conf 58899983866---5899999999849889999831577654708999852111518997 Q gi|254780936|r 113 LEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYL 169 (182) Q Consensus 113 ~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l 169 (182) =|+++++|.-| ..+.++..+|+.|.+++.++- +....|.+.||-.+++... T Consensus 90 ~D~~i~~S~SG~t~El~~~~~~~k~~~~~ii~it~------~~~S~Lak~sd~~l~~~~~ 143 (321) T PRK11543 90 RDVMLFISYSGGAKELDLIIPRLEDKSIALLAMTG------KPTSPLGLAAKAVLDISVE 143 (321) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEEC------CCCCHHHHHCCEEEECCCC T ss_conf 99899995898817788772788766986899978------9999768826948972665 No 105 >pfam02110 HK Hydroxyethylthiazole kinase family. Probab=55.87 E-value=11 Score=18.57 Aligned_cols=72 Identities=15% Similarity=0.283 Sum_probs=39.9 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHH----HHHHHHH-HHCCCCEEEEECCCHHHHHHHHHHHHCCCEE Q ss_conf 9998897559569753458953788865435653299----9999999-4211588999838665899999999849889 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVE----LAVDAFE-QSEGLEHLVIFSGDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~----laiD~~~-~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V 139 (182) .-.+.|....+.++..-.-+...-.+....-+|+|.. -++++.. ++++++++|++||-.||+ ..|.++ T Consensus 98 ~~~~lL~~~~~tvIrGN~sEI~aL~g~~~~~kGVDs~~~~~~~~~~a~~lA~~~~~vvvvTG~~D~V-------tdg~~~ 170 (246) T pfam02110 98 TILELLNSGGVAAIRGNAGEILALAGEEGLMRGVDSGEGATAAIRAAQRVARKYGTVVVVTGEVDYV-------SDGRRT 170 (246) T ss_pred HHHHHHHHCCCCEEECCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECCCEEE-------ECCCEE T ss_conf 9999997289989954699999985776888792689846789999999999819899986898299-------769879 Q ss_pred EEEE Q ss_conf 9998 Q gi|254780936|r 140 TIVS 143 (182) Q Consensus 140 ~v~~ 143 (182) ..+. T Consensus 171 ~~v~ 174 (246) T pfam02110 171 YVIH 174 (246) T ss_pred EEEE T ss_conf 9995 No 106 >cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators. Probab=55.77 E-value=17 Score=17.30 Aligned_cols=103 Identities=13% Similarity=0.027 Sum_probs=59.7 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999999942-- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) ..+.+.+.+.=..++... ..........+...|.+++...|..+....+. ..+.+..-..+++... T Consensus 107 a~~~L~~~G~~~I~~i~~-~~~~~~~~~~R~~gf~~a~~~~g~~~~~~~i~-----------~~~~~~~~~~~~~~~l~~ 174 (268) T cd06298 107 ATELLIKNGHKKIAFISG-PLEDSINGDERLAGYKEALSEANIEFDESLIF-----------EGDYTYESGYELAEELLE 174 (268) T ss_pred HHHHHHHCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEE-----------CCCHHHHHHHHHHHHHHH T ss_conf 999999729966999967-87776259999999999999849997601200-----------354037789999999983 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEECC Q ss_conf 11588999838665899999999849889----99983157 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVLS 147 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~~ 147 (182) ..--++|++++|.-=..++..+++.|.+| -|+++... T Consensus 175 ~~~~~ai~~~nD~~A~g~l~~l~~~g~~vP~disvigfDd~ 215 (268) T cd06298 175 DGKPTAAFVTDDELAIGILNAAQDAGLKVPEDFEIIGFNNT 215 (268) T ss_pred CCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 69997899668799999999999749999987499997881 No 107 >PRK03372 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=55.57 E-value=17 Score=17.28 Aligned_cols=86 Identities=15% Similarity=0.162 Sum_probs=49.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 31467775799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) .++.....+....+|...|+++..-........... ..-+......-+.-..+..+|.+|.+-|||=+..+++.+... T Consensus 15 ~~~a~~~a~~l~~~L~~~g~~v~v~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~Dlvi~lGGDGT~L~aar~~~~~ 92 (303) T PRK03372 15 RDEATESARRVVKQLGDAGIGVRVLAAEAADLPDDM--RALGVEIEVVDADPDAADGCELVLVLGGDGTFLRAAELARNA 92 (303) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHCCC T ss_conf 989999999999999978898999704010024445--555654223453223357855899977878999999984447 Q ss_pred CCEEEEEE Q ss_conf 98899998 Q gi|254780936|r 136 VKKVTIVS 143 (182) Q Consensus 136 Gk~V~v~~ 143 (182) +.-|..+- T Consensus 93 ~iPilGiN 100 (303) T PRK03372 93 DVPVLGVN 100 (303) T ss_pred CCCEEEEE T ss_conf 99889872 No 108 >PRK08533 flagellar accessory protein FlaH; Reviewed Probab=55.34 E-value=8.7 Score=19.07 Aligned_cols=68 Identities=10% Similarity=0.136 Sum_probs=41.7 Q ss_pred HHHHHHHHHHH--CCCCEEE------EECCCHH------HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 99999999942--1158899------9838665------89999999984988999983157765470899985211151 Q gi|254780936|r 100 VELAVDAFEQS--EGLEHLV------IFSGDGC------FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMD 165 (182) Q Consensus 100 v~laiD~~~~a--~~~d~~i------LvSGD~D------f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~ 165 (182) ..+..++++.. .+.|.+| |++.|++ |.--++++.+.||.+.+..-.......+-..++.+||.++. T Consensus 103 ~~~L~~ll~~~~~~~~dvIIIDSlS~l~~~~~~~~~~~~~~~~lk~l~s~gktIilTv~p~~~~e~~l~~lrs~aDv~i~ 182 (230) T PRK08533 103 RDFLDKLMNTRRFYEKDVVIIDSLSSLVSRDASEVQIRDLMAFFKRISSLNKVIILTANPKELPESVLLILRTASTILIR 182 (230) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCHHHHHCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHEEEEEEEE T ss_conf 89999997326643798999905318851677789999999999999858988999956331362454420410489999 Q ss_pred HH Q ss_conf 89 Q gi|254780936|r 166 LA 167 (182) Q Consensus 166 l~ 167 (182) |. T Consensus 183 L~ 184 (230) T PRK08533 183 LE 184 (230) T ss_pred EE T ss_conf 87 No 109 >pfam05991 DUF901 Protein of unknown function (DUF901). This family consists of several hypothetical bacterial proteins as well as some uncharacterized sequences from Arabidopsis thaliana. The function of this family is unknown. Probab=55.27 E-value=17 Score=17.25 Aligned_cols=117 Identities=15% Similarity=0.120 Sum_probs=57.7 Q ss_pred EEEEHHHHHHH---HHHCCCCCCH----HHHHHHHHHC---CEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 99807899999---9850876798----9999998517---929999997306887314677757999889755956975 Q gi|254780936|r 10 LFIDGANLYAS---SKALGFDIDY----RKLLKAFRSR---AIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVA 79 (182) Q Consensus 10 IfID~~Nl~~~---~~~~~~~~d~----~~L~~~l~~~---~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~ 79 (182) ++|||-|+-++ ++.+--.-++ .+|++.+.+. .......+|.+...+..... .....|++++. T Consensus 1 LiVDGYNvI~aw~~l~~l~~~~~le~AR~~Li~~L~~y~~~~~~~v~vVFDa~~~~~~~~~--------~~~~~~i~vvf 72 (166) T pfam05991 1 LLVDGYNVIGAWPELKKLKDRGSLEAARDKLIEILANYQAFKGYRVIVVFDAHYVPGSEEE--------YEKYGGIEVVF 72 (166) T ss_pred CEECCHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCC--------EEECCCEEEEE T ss_conf 9275335617339999653479999999999999998753459679999856648999874--------34509889998 Q ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHH Q ss_conf 34589537888654356532999999999421158899983866589999999984988999983157765470899985 Q gi|254780936|r 80 KVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQ 159 (182) Q Consensus 80 ~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ 159 (182) .+ ....+|-.|---+.++.+....++++|+|..---. +...|... .+|++|... T Consensus 73 T~------------~~etAD~~IEr~v~~~~~~~~~v~VvTSD~~~q~~---v~g~GA~~-----------isa~el~~~ 126 (166) T pfam05991 73 TK------------EGETADSYIEKLVAELNRRKKQVTVVTSDRAEQWT---IFGRGALR-----------ISSRELAEE 126 (166) T ss_pred CC------------CCCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH---HHCCCCEE-----------ECHHHHHHH T ss_conf 88------------99987899999999961688449999288899989---51089659-----------869999999 Q ss_pred C Q ss_conf 2 Q gi|254780936|r 160 A 160 (182) Q Consensus 160 a 160 (182) - T Consensus 127 v 127 (166) T pfam05991 127 V 127 (166) T ss_pred H T ss_conf 9 No 110 >PRK04539 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=54.73 E-value=17 Score=17.19 Aligned_cols=83 Identities=17% Similarity=0.292 Sum_probs=48.2 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCC Q ss_conf 14677757999889755956975345895378886543565329999999994211588999838665899999999849 Q gi|254780936|r 57 EQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKV 136 (182) Q Consensus 57 ~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~G 136 (182) .+..+..+....+|...|+++..-+... .+.........+.. .++..+....+|.+|.+-|||=+..+++.+.... T Consensus 17 ~~~~~~~~~l~~~L~~~g~~v~ld~~~~-~~~~~~~~~~~~~~---~~~~~~~~~~~Dlii~lGGDGTlL~a~r~~~~~~ 92 (296) T PRK04539 17 PDIQDTAHTLITFLKQHGFTVYLDEVGV-KEGCIYTQDTVGCH---IVNKTELGQYCDLVAVLGGDGTFLSVAREIAPRA 92 (296) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCHH-HHCCCCCCCCCCCC---CCCHHHHCCCCCEEEEECCCHHHHHHHHHHHHCC T ss_conf 7999999999999997879999965411-20354643345655---4686781646779999787089999999860059 Q ss_pred CEEEEEE Q ss_conf 8899998 Q gi|254780936|r 137 KKVTIVS 143 (182) Q Consensus 137 k~V~v~~ 143 (182) ..|..+- T Consensus 93 ~PilGiN 99 (296) T PRK04539 93 VPIIGIN 99 (296) T ss_pred CCEEEEE T ss_conf 9789984 No 111 >cd06320 PBP1_allose_binding Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. The members of this group are belonging to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Like other periplasmic receptors of the ABC-type transport systems, the allose-binding protein consists of two alpha/beta domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Probab=54.24 E-value=18 Score=17.14 Aligned_cols=80 Identities=19% Similarity=0.114 Sum_probs=50.5 Q ss_pred CCCHHHHHHHHHHHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCC-EEEEECCCHHHHHHH Q ss_conf 873146777579998897559-569753458953788865435653299999999942--1158-899983866589999 Q gi|254780936|r 54 GDPEQQFSPLHPLLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLE-HLVIFSGDGCFTTLV 129 (182) Q Consensus 54 ~~~~~~~~~~~~~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d-~~iLvSGD~Df~pli 129 (182) +.......+...|.++|...+ +++... ...++|..-+.++++.. .+-| ++|++++|.-=.-++ T Consensus 132 ~~~~~~~~R~~Gf~~~l~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~L~~~pd~~ai~~~nD~~A~Ga~ 198 (275) T cd06320 132 AGAFAAEQRTEGFTEAIKKASGIEVVAS-------------QPADWDREKAYDVATTILQRNPDLKAIYCNNDTMALGVV 198 (275) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEE-------------CCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHH T ss_conf 9971799999999999997799835643-------------155303899999999988628766779966988999999 Q ss_pred HHHHHCCC--EEEEEEEEC Q ss_conf 99998498--899998315 Q gi|254780936|r 130 AALQRKVK--KVTIVSTVL 146 (182) Q Consensus 130 ~~lr~~Gk--~V~v~~~~~ 146 (182) +.+++.|+ +|.++++.. T Consensus 199 ~al~~~g~~~di~vvG~D~ 217 (275) T cd06320 199 EAVKNAGKQGKVLVVGTDG 217 (275) T ss_pred HHHHHCCCCCCEEEEEECC T ss_conf 9999769999829999897 No 112 >COG1134 TagH ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=53.76 E-value=18 Score=17.10 Aligned_cols=74 Identities=22% Similarity=0.286 Sum_probs=43.1 Q ss_pred EEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEE----EEECCCHHHHHH----HHHHHHCCCEEEEEEEECCCC Q ss_conf 753458953788865435653299999999942115889----998386658999----999998498899998315776 Q gi|254780936|r 78 VAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHL----VIFSGDGCFTTL----VAALQRKVKKVTIVSTVLSDP 149 (182) Q Consensus 78 ~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~----iLvSGD~Df~pl----i~~lr~~Gk~V~v~~~~~~~~ 149 (182) +..|++.++. |.=++|+--+..+. +.|+. +|-.||..|..= +..+.+.++.+++++ T Consensus 141 i~~PvktYSS---------GM~aRLaFsia~~~-~pdILllDEvlavGD~~F~~K~~~rl~e~~~~~~tiv~VS------ 204 (249) T COG1134 141 IDQPVKTYSS---------GMYARLAFSVATHV-EPDILLLDEVLAVGDAAFQEKCLERLNELVEKNKTIVLVS------ 204 (249) T ss_pred HHCCHHHCCH---------HHHHHHHHHHHHHC-CCCEEEEEHHHHCCCHHHHHHHHHHHHHHHHCCCEEEEEE------ T ss_conf 6174411258---------89999888753313-7878998613440779999999999999997598799997------ Q ss_pred CCCHHHHHHHCCCCEEHHH Q ss_conf 5470899985211151899 Q gi|254780936|r 150 SMASDQLRRQADYFMDLAY 168 (182) Q Consensus 150 ~~~S~~L~~~ad~fi~l~~ 168 (182) +--...++.||..++|+. T Consensus 205 -Hd~~~I~~~Cd~~i~l~~ 222 (249) T COG1134 205 -HDLGAIKQYCDRAIWLEH 222 (249) T ss_pred -CCHHHHHHHCCEEEEEEC T ss_conf -887999985670599858 No 113 >cd06321 PBP1_ABC_sugar_binding_like_11 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=51.06 E-value=20 Score=16.83 Aligned_cols=74 Identities=11% Similarity=0.027 Sum_probs=43.4 Q ss_pred HHHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH----HHCCCCEEEEECCCHHHHHHHHHHH Q ss_conf 677757999889755-95697534589537888654356532999999999----4211588999838665899999999 Q gi|254780936|r 59 QFSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE----QSEGLEHLVIFSGDGCFTTLVAALQ 133 (182) Q Consensus 59 ~~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~----~a~~~d~~iLvSGD~Df~pli~~lr 133 (182) ...+...|.++|... +....... .....+..-+..+++ ..+.+| +|+.++|.-=.-+++.++ T Consensus 135 ~~~R~~G~~~~l~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~ll~~~p~~~-aifa~nD~~A~Ga~~a~~ 201 (271) T cd06321 135 VLDRVAGCKAALAKYPGIKLLSDD------------QNGKGSRDGGLRVMQGLLTRFPKLD-GVFAINDPTAIGADLAAK 201 (271) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEE------------ECCCCCHHHHHHHHHHHHHCCCCCC-EEEECCCHHHHHHHHHHH T ss_conf 999999999999877997688767------------2288789999999999985088677-899778689999999999 Q ss_pred HCCCE-EEEEEEE Q ss_conf 84988-9999831 Q gi|254780936|r 134 RKVKK-VTIVSTV 145 (182) Q Consensus 134 ~~Gk~-V~v~~~~ 145 (182) +.|+. +.++++. T Consensus 202 ~~G~~~i~ivg~d 214 (271) T cd06321 202 QAGRNDIKITSVD 214 (271) T ss_pred HCCCCCCEEEEEE T ss_conf 7597999899996 No 114 >pfam09001 DUF1890 Domain of unknown function (DUF1890). This domain is found in a set of hypothetical archaeal proteins. Probab=50.95 E-value=20 Score=16.82 Aligned_cols=90 Identities=14% Similarity=0.125 Sum_probs=49.3 Q ss_pred HHHHHHHHHCCCEEEEE------EEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHHHHCC Q ss_conf 79998897559569753------4589537888654356532999999999421158-8999838665899999999849 Q gi|254780936|r 64 HPLLDWLHYNGFQVVAK------VAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLE-HLVIFSGDGCFTTLVAALQRKV 136 (182) Q Consensus 64 ~~~~~~l~~~g~~v~~~------~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d-~~iLvSGD~Df~pli~~lr~~G 136 (182) ......|+..|+++... -+.+.+|+.+. ..++-+|+.=++.-..- ..+| ++..+..|+-...++..-.-.. T Consensus 17 lYl~~~Lk~~G~~v~Va~npAA~kLl~vaDpe~~-Ylk~~~dld~~l~~i~e-~~~d~~~~FvHNDagvsYa~T~~~i~~ 94 (138) T pfam09001 17 LYLSNKLKDKGFRVVVAANPAALKLLEVADPEKY-YLKEVVDLDKGIADIAE-GDFDLIFGFVHNDAGVSYAVTMKAVSN 94 (138) T ss_pred HHHHHHHHHCCCCEEEECCHHHHHHHHHCCCCCC-HHCCEEEHHHHHHHHHC-CCCCEEEEEEECCCHHHHHHHHHHHCC T ss_conf 9999998766960699669888736121187640-01043559998745520-688879999735622667898887547 Q ss_pred CEEEEEEEECCCCCCCHHHHHHHC Q ss_conf 889999831577654708999852 Q gi|254780936|r 137 KKVTIVSTVLSDPSMASDQLRRQA 160 (182) Q Consensus 137 k~V~v~~~~~~~~~~~S~~L~~~a 160 (182) ++...+.|.. . +.+|.+.- T Consensus 95 ~~~~aiVFg~----~-~e~la~~i 113 (138) T pfam09001 95 AKTYAIVFGE----H-AEELAETI 113 (138) T ss_pred CCEEEEEECC----C-HHHHHHHH T ss_conf 8658999358----8-79999986 No 115 >PRK11303 DNA-binding transcriptional regulator FruR; Provisional Probab=50.72 E-value=20 Score=16.80 Aligned_cols=99 Identities=12% Similarity=0.081 Sum_probs=55.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994-2- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a- 110 (182) +.+.+.+.+.=.. .|.+..++......+...|.++++..|..+... +. ...+..-+..+++. . T Consensus 170 a~~~L~~~GhrrI--~~i~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~----~~---------~~~~~~~g~~~~~~ll~ 234 (330) T PRK11303 170 LAESLLKFPAESI--LLLGALPELSVSFLREQGFRQALKDDPREVHFL----YA---------ESFSREAGAQLFEKWLE 234 (330) T ss_pred HHHHHHHCCCCEE--EEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEE----EC---------CCCCHHHHHHHHHHHHH T ss_conf 9999998699979--999689756379999999999999679980599----64---------88998999999999983 Q ss_pred -CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf -1158899983866589999999984988----99998315 Q gi|254780936|r 111 -EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 -~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ...-++|++++|.=-..+++.+++.|.+ |-|++|.. T Consensus 235 ~~~~p~Ai~~~nD~~A~g~l~al~~~g~~VP~DvsvigfDd 275 (330) T PRK11303 235 THGMPDALFTTSYTLLQGVLDVLLERPGKLPSDLAIATFGD 275 (330) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 79999779974899999999999985999999759999996 No 116 >cd06281 PBP1_LacI_like_5 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=50.25 E-value=21 Score=16.75 Aligned_cols=103 Identities=10% Similarity=0.003 Sum_probs=58.6 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) ..+.+.+.|.-..+ |.+.+........+...|.++++..|..+..... .... ..+.+ -..+..++.. .. T Consensus 107 a~~~L~~~G~~~I~--~i~~~~~~~~~~~R~~G~~~al~~~g~~~~~~~~--~~~~----~~~~~--~~~~~~ll~~-~~ 175 (269) T cd06281 107 AVEYLISLGHRRIA--LVGGGSNTRPGRERLEGYKAAFAAAGLPPDPALV--RLST----PAASG--FDATRALLAL-PD 175 (269) T ss_pred HHHHHHHCCCCCEE--EEECCCCCHHHHHHHHHHHHHHHHCCCCCCCEEE--ECCC----CHHHH--HHHHHHHHHC-CC T ss_conf 99999975998668--9807887767999999999999977999994399--7799----74899--9999999807-99 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 58899983866589999999984988----99998315 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) --+++++++|.--..++..+++.|.+ |-|++|.. T Consensus 176 ~p~ai~~~~d~~A~g~l~~l~~~g~~VP~disviGfD~ 213 (269) T cd06281 176 RPTAIIAGGTQVLVGVLRALREAGLRIPRDLSVISIGD 213 (269) T ss_pred CCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 99656512338899999999981999999869999888 No 117 >TIGR00034 aroFGH phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006219 Members of this group catalyze the first enzymatic reaction of the shikimate pathway. The common (shikimate) pathway links metabolism of carbohydrates to biosynthesis of aromatic amino acids phenylalanine, tyrosine, tryptophan, and derivatives in microorganisms and in plants. In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway, and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The pathway begins with the stereospecific condensation of E4P and PEP to yield 7-phospho 2-dehydro 3-deoxy-D-arabino-heptulosonate (DAHP), catalyzed by 3-deoxy-7-phosphoheptulonate synthase (DAHPS) (EC 2.5.1.54, formerly EC 4.1.2.15). The divalent metal cation requirement of this enzyme can be satisfied by a broad range of metals . A Cys residue in a Cys-X-X-His motif has been identified as part of a metal binding site . In Escherichia coli, the enzyme exists in three isoforms, each specifically inhibited by one of the three aromatic amino acids. DAHP synthetases fall into two classes, class I (represented by this entry) and class II (represented by PIRSF015573 from PIRSF). Class I was believed to be limited to microorganisms and class II to plants. However, a more recent study showed that class II also contains enzymes from a microbial eukaryote and several bacteria . Brick and Woodard proposed that the difference between the two classes lies in their metal ion requirement for activity. Whereas class I requires no metal cation, class II is dependent on a metal cation for activity. However, recently a class I DAHP synthase from Thermotoga maritima has been purified, characterised, and shown to be a metalloenzyme . The three-dimentional structures of DAHP synthases have been determined , , , , . The DAHPS(Phe) monomer is a (beta/alpha)8 barrel with an additional N-terminal beta strand and helices and an extra beta sheet near the C terminus . The active site is located in a cleft at the carboxyl end of the barrel . The allosteric feedback inhibition binding site of DAHPS(Phe) is composed of residues from two adjacent subunits of a tight dimer and is at least 20 angstroms away from the closest active site . The absence of the shikimate pathway in animals makes it an attractive target for nontoxic herbicidal, antimicrobial, and antifungal agents. The nontoxic herbicide glyphosphate competitively inhibits 3-phosphoshikimate 1-carboxyvinyltransferase, the sixth enzymatic reaction of the pathway.; GO: 0003849 3-deoxy-7-phosphoheptulonate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=50.18 E-value=17 Score=17.27 Aligned_cols=33 Identities=21% Similarity=0.355 Sum_probs=21.0 Q ss_pred CCCCCCCCH--HHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 654356532--999999999421158899983866 Q gi|254780936|r 91 RKRVKSSMD--VELAVDAFEQSEGLEHLVIFSGDG 123 (182) Q Consensus 91 ~~~~kk~~D--v~laiD~~~~a~~~d~~iLvSGD~ 123 (182) +...||||| +.+|+||+.-|.+-..|+=++-+| T Consensus 181 PVGFKNGTDG~~~vAiDA~~AA~~~H~Fls~~k~G 215 (348) T TIGR00034 181 PVGFKNGTDGNLKVAIDAIRAAAAPHYFLSVTKDG 215 (348) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHCCCEEECCCCCC T ss_conf 33431357874889999999960698373126877 No 118 >pfam06258 DUF1022 Protein of unknown function (DUF1022). This family consists of several hypothetical eukaryotic and prokaryotic proteins. The function of this family is unknown. Probab=50.13 E-value=21 Score=16.74 Aligned_cols=29 Identities=14% Similarity=0.231 Sum_probs=23.6 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 8999838665899999999849889999831 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVSTV 145 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~ 145 (182) .+|+||+|| +-.+-++-.-||.|.|+-.. T Consensus 227 d~iiVT~DS--vSMisEA~~tGkPV~i~~l~ 255 (308) T pfam06258 227 DAVVVTADS--VSMVSEAAATGAPVGVLPLE 255 (308) T ss_pred CEEEEECCH--HHHHHHHHHCCCCEEEEECC T ss_conf 868990671--88999998649977999677 No 119 >pfam06057 VirJ Bacterial virulence protein (VirJ). This family consists of several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localized to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium. Probab=49.61 E-value=21 Score=16.69 Aligned_cols=10 Identities=30% Similarity=0.773 Sum_probs=6.1 Q ss_pred CEEEEEEEHH Q ss_conf 7189998078 Q gi|254780936|r 6 EKIALFIDGA 15 (182) Q Consensus 6 ~rvaIfID~~ 15 (182) .++|||+=|+ T Consensus 2 dtlav~~SGD 11 (192) T pfam06057 2 DTVAVFYSGD 11 (192) T ss_pred CEEEEEEECC T ss_conf 7799999468 No 120 >PRK02649 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=49.61 E-value=21 Score=16.69 Aligned_cols=85 Identities=13% Similarity=0.119 Sum_probs=49.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---HHCCCCEEEEECCCHHHHHHHHHH Q ss_conf 31467775799988975595697534589537888654356532999999999---421158899983866589999999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---QSEGLEHLVIFSGDGCFTTLVAAL 132 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---~a~~~d~~iLvSGD~Df~pli~~l 132 (182) .++.........++|++.|++|..-......-+. ......-....+|.+. ....+|.+|.+-|||=+..+++.+ T Consensus 12 ~~~a~~~a~~l~~~L~~~gi~v~l~~~~a~~l~~---~~~~~~~~~~~~~~~~~~~~~~~~Dl~IvlGGDGT~L~aar~~ 88 (305) T PRK02649 12 KPLAVRTAEELQDKLEAAGWEVVRASSSGGMLGY---ANPDQPVCHTGIDALVPEGFDSSMKFAIVLGGDGTVLSAARQT 88 (305) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCHHHHCCC---CCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHH T ss_conf 9899999999999999889999997441232287---8755321124411126333577733999983769999999985 Q ss_pred HHCCCEEEEEE Q ss_conf 98498899998 Q gi|254780936|r 133 QRKVKKVTIVS 143 (182) Q Consensus 133 r~~Gk~V~v~~ 143 (182) ...+..+..+- T Consensus 89 ~~~~iPilGIN 99 (305) T PRK02649 89 APCGIPLLTIN 99 (305) T ss_pred CCCCCCEEEEE T ss_conf 33699789894 No 121 >PRK05249 soluble pyridine nucleotide transhydrogenase; Provisional Probab=49.42 E-value=21 Score=16.67 Aligned_cols=39 Identities=10% Similarity=0.095 Sum_probs=19.5 Q ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999942115889998386658999999998498899998 Q gi|254780936|r 105 DAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 105 D~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) +++.+..--..++++-|.-==++....++..|.+|+++. T Consensus 168 ~~l~l~~~Pk~vvIIGgG~ig~E~A~~~~~lG~~Vtiv~ 206 (465) T PRK05249 168 SILSLDHLPRSLIIYGAGVIGCEYASIFRGLGVKVDLIN 206 (465) T ss_pred HHHHHCCCCCEEEEECCCCCHHHHHHHHHHHCCEEEEEE T ss_conf 854330079759999998217999999996098778972 No 122 >PRK09355 hydroxyethylthiazole kinase; Validated Probab=48.62 E-value=9.2 Score=18.92 Aligned_cols=71 Identities=20% Similarity=0.200 Sum_probs=38.4 Q ss_pred HHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHH----HHHHHHH-HHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEE Q ss_conf 99889755956975345895378886543565329----9999999-942115889998386658999999998498899 Q gi|254780936|r 66 LLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDV----ELAVDAF-EQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVT 140 (182) Q Consensus 66 ~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv----~laiD~~-~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~ 140 (182) -.+.|....+.++..-.-+...-.+....-+|+|. .-++++. +++++++++|++||-.||+- .|.++. T Consensus 104 ~~~LL~~~~~tvIrGN~sEI~aL~g~~~~~kGVDs~~~~~~~~~~a~~lA~~~~~vvvvTG~~D~Vt-------dg~~~~ 176 (262) T PRK09355 104 ALELLEEVKPAVIRGNASEIAALAGEAGETKGVDSTDGSADAVEIAKALAKKYGTVVVVTGEVDYIT-------DGERVV 176 (262) T ss_pred HHHHHHHCCCCEEECCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECCCCEEE-------CCCEEE T ss_conf 9999973898799546999999858878883937898525699999999998398899966872896-------596799 Q ss_pred EEE Q ss_conf 998 Q gi|254780936|r 141 IVS 143 (182) Q Consensus 141 v~~ 143 (182) .+. T Consensus 177 ~v~ 179 (262) T PRK09355 177 SVH 179 (262) T ss_pred EEE T ss_conf 996 No 123 >cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=48.53 E-value=22 Score=16.58 Aligned_cols=39 Identities=13% Similarity=0.180 Sum_probs=27.8 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 66589999999984988999983157765470899985211151899 Q gi|254780936|r 122 DGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 122 D~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) -..+..+++.+++.|+.|++++ +. .++.+.||+.+.+++ T Consensus 125 ~~~l~~~l~~l~~~g~TvI~vt----Hd----~~~~~~aDrii~l~~ 163 (176) T cd03238 125 INQLLEVIKGLIDLGNTVILIE----HN----LDVLSSADWIIDFGP 163 (176) T ss_pred HHHHHHHHHHHHHCCCEEEEEE----CC----HHHHHHCCEEEEECC T ss_conf 9999999999998799899994----78----799983999999459 No 124 >pfam01380 SIS SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS domains bind to the end-product of the pathway. Probab=48.40 E-value=22 Score=16.57 Aligned_cols=51 Identities=16% Similarity=0.223 Sum_probs=36.9 Q ss_pred CCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 1158899983866---58999999998498899998315776547089998521115189 Q gi|254780936|r 111 EGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 111 ~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ..=|.+|++|--+ |.+.+++.+|++|.+++.++-.. ...|-+.||..+.+. T Consensus 52 ~~~d~vi~iS~sG~t~~~~~~~~~a~~~g~~~i~iT~~~------~S~la~~ad~~l~~~ 105 (131) T pfam01380 52 DPDDLVIAISQSGETRDLLEAAKLLKARGAKIIAITDSK------GSPLAREADHVLYII 105 (131) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCHHHHHCCEEEECC T ss_conf 999999995489865768987899998499699998999------997899689989879 No 125 >cd06355 PBP1_FmdD_like Periplasmic component (FmdD) of an active transport system for short-chain amides and urea (FmdDEF). This group includes the periplasmic component (FmdD) of an active transport system for short-chain amides and urea (FmdDEF), found in Methylophilus methylotrophus, and its homologs from other bacteria. FmdD, a type I periplasmic binding protein, is induced by short-chain amides and urea and repressed by excess ammonia, while FmdE and FmdF are hydrophobic transmembrane proteins. FmdDEF is predicted to be an ATP-dependent transporter and closely resembles the periplasmic binding protein and the two transmembrane proteins present in various hydrophobic amino acid-binding transport systems. Probab=48.36 E-value=21 Score=16.65 Aligned_cols=66 Identities=12% Similarity=0.011 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCE Q ss_conf 75799988975595697534589537888654356532999999999421158899983866589999999984988 Q gi|254780936|r 62 PLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKK 138 (182) Q Consensus 62 ~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~ 138 (182) ....+...++..|.++...... ..+..|..=.+.-+.. .+.|.+++...-.|.+.+++.+++.|.. T Consensus 149 ~~~~~~~~~~~~G~~vv~~~~~----------~~g~~Dfs~~l~ki~~-a~pD~v~~~~~g~~~~~~~~q~~~~G~~ 214 (348) T cd06355 149 ANKILKAQLESLGGEVVGEEYL----------PLGHTDFQSIINKIKA-AKPDVVVSTVNGDSNVAFFKQLKAAGIT 214 (348) T ss_pred HHHHHHHHHHHCCCEEEEEEEC----------CCCCCCHHHHHHHHHH-CCCCEEEEECCCCCHHHHHHHHHHHCCC T ss_conf 9999999999859989999813----------7997567999999997-6999999947651248999999981787 No 126 >PRK10423 transcriptional repressor RbsR; Provisional Probab=48.33 E-value=22 Score=16.56 Aligned_cols=100 Identities=12% Similarity=0.014 Sum_probs=54.5 Q ss_pred HHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH---H Q ss_conf 9998517929999997306887314677757999889755956975345895378886543565329999999994---2 Q gi|254780936|r 34 LKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ---S 110 (182) Q Consensus 34 ~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~---a 110 (182) .++|.+.|.-..++. +.+.+......+...|.++++..|..+...... . ...+..-..+++.. . T Consensus 166 ~~~L~~~Ghr~I~~i--~~~~~~~~~~~R~~gf~~al~~~gl~~~~~~~~-~----------~~~~~~~g~~~~~~ll~~ 232 (327) T PRK10423 166 TQYLIDKGHTRIACI--TGPLDKTPARLRLEGYRAAMKRAGLNIPDGYEV-T----------GDFEFNGGFDAMQQLLAH 232 (327) T ss_pred HHHHHHCCCCEEEEE--CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEE-E----------CCCCHHHHHHHHHHHHHC T ss_conf 999997599749996--688766179999999999999859998843699-6----------046737899999999847 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 1158899983866589999999984988----99998315 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) +.--++|+++.|.=-..+++.+++.|.+ +-|++|.. T Consensus 233 ~~~ptAi~~~nD~~A~g~~~al~~~g~~iP~DisvigfDd 272 (327) T PRK10423 233 PLRPQAVFTGNDAMAVGVYQALYQAGLQVPQDIAVIGYDD 272 (327) T ss_pred CCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 9999845517809999999999986999999759999758 No 127 >KOG1116 consensus Probab=48.07 E-value=19 Score=17.02 Aligned_cols=35 Identities=20% Similarity=0.329 Sum_probs=27.1 Q ss_pred HHHHHH--HHCCCCEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 999999--42115889998386658999999998498 Q gi|254780936|r 103 AVDAFE--QSEGLEHLVIFSGDGCFTTLVAALQRKVK 137 (182) Q Consensus 103 aiD~~~--~a~~~d~~iLvSGD~Df~pli~~lr~~Gk 137 (182) |.+++. ...++|.+|.|||||=+.+++.=|-++.. T Consensus 225 Arei~rt~dl~kyDgIv~vsGDGl~hEVlNGLl~R~D 261 (579) T KOG1116 225 AREIVRTLDLGKYDGIVCVSGDGLLHEVLNGLLERPD 261 (579) T ss_pred HHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCC T ss_conf 9999876420455538996278677885503000655 No 128 >cd03270 ABC_UvrA_I The excision repair protein UvrA domain I; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=48.06 E-value=22 Score=16.54 Aligned_cols=40 Identities=18% Similarity=0.227 Sum_probs=29.0 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 866589999999984988999983157765470899985211151899 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) .-.-...+++++++.|+.|++++ +. .++.+.||+.+.+++ T Consensus 174 ~~~~i~~~i~~l~~~G~Tvi~Vs----Hd----~~~~~~aDriivm~~ 213 (226) T cd03270 174 DNDRLIETLKRLRDLGNTVLVVE----HD----EDTIRAADHVIDIGP 213 (226) T ss_pred HHHHHHHHHHHHHHCCCEEEEEE----EC----HHHHHHCCEEEEECC T ss_conf 99999999999997699899997----25----789984899999449 No 129 >PRK11302 DNA-binding transcriptional regulator HexR; Provisional Probab=48.01 E-value=22 Score=16.53 Aligned_cols=61 Identities=18% Similarity=0.194 Sum_probs=42.9 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 32999999999421158899983866---589999999984988999983157765470899985211151 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMD 165 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~ 165 (182) .|..+..-....+..=|.+|++|--| +.+.+++.+|++|.+|+.++ . . + | .|.+.||..+. T Consensus 161 ~d~~~~~~~a~~~~~~Dvvi~iS~sG~t~e~i~~~~~Ak~~ga~vIaIT-~--~-~--S-pLa~~AD~~L~ 224 (284) T PRK11302 161 DDIVMQRMSCMNSSDGDVVVLISHTGRTKSLVELAQLARENGATVIGIT-A--A-G--S-PLAREATLALT 224 (284) T ss_pred CCHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEC-C--C-C--C-HHHHHCCEEEE T ss_conf 8789999999719998889997379998799999999998799589977-9--9-9--8-14886898898 No 130 >PRK05928 hemD uroporphyrinogen-III synthase; Reviewed Probab=47.57 E-value=23 Score=16.49 Aligned_cols=51 Identities=16% Similarity=0.113 Sum_probs=24.8 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 99988975595697534589537888654356532999999999421158899983866 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDG 123 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~ 123 (182) .+.+.|...|+.+...++....... .. ......+.+. ...+|++++.|+-+ T Consensus 140 ~L~~~L~~~g~~v~~~~~Y~~~~~~------~~-~~~~~~~~~~-~~~~d~i~ftS~~~ 190 (252) T PRK05928 140 LLGDFLQERGADVDECEVYERKPPK------LE-GAEELIEELQ-TGEVDAIIFTSPSM 190 (252) T ss_pred HHHHHHHHCCCEEEEEEEEEEECCC------CC-CHHHHHHHHH-CCCCCEEEEECHHH T ss_conf 7899999779847999865763788------88-2799999986-28987999909999 No 131 >cd05213 NAD_bind_Glutamyl_tRNA_reduct NADP-binding domain of glutamyl-tRNA reductase. Glutamyl-tRNA reductase catalyzes the conversion of glutamyl-tRNA to glutamate-1-semialdehyde, initiating the synthesis of tetrapyrrole. Whereas tRNAs are generally associated with peptide bond formation in protein translation, here the tRNA activates glutamate in the initiation of tetrapyrrole biosynthesis in archaea, plants and many bacteria. In the first step, activated glutamate is reduced to glutamate-1-semi-aldehyde via the NADPH dependent glutamyl-tRNA reductase. Glutamyl-tRNA reductase forms a V-shaped dimer. Each monomer has 3 domains: an N-terminal catalytic domain, a classic nucleotide binding domain, and a C-terminal dimerization domain. Although the representative structure 1GPJ lacks a bound NADPH, a theoretical binding pocket has been described. (PMID 11172694). Amino acid dehydrogenase (DH)-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, Probab=47.51 E-value=23 Score=16.48 Aligned_cols=65 Identities=15% Similarity=0.173 Sum_probs=27.1 Q ss_pred HHHCCCCEEEEECCCHHHHHHHHHHH--HCCCEEEEEEEECCCCCCCHHHHHHHCC-CCEEHHHHHHHHC Q ss_conf 94211588999838665899999999--8498899998315776547089998521-1151899787622 Q gi|254780936|r 108 EQSEGLEHLVIFSGDGCFTTLVAALQ--RKVKKVTIVSTVLSDPSMASDQLRRQAD-YFMDLAYLKNEIA 174 (182) Q Consensus 108 ~~a~~~d~~iLvSGD~Df~pli~~lr--~~Gk~V~v~~~~~~~~~~~S~~L~~~ad-~fi~l~~l~~~i~ 174 (182) +.....|.+|-.||=.+++-..+.+. ..++...++ .-+.+..+.....+-.+ ...++|+|..... T Consensus 234 ~~l~~~DvvisaT~s~~~~~~~~~~~~~~~~~~~~ii--DLavPRdid~~v~~l~~v~L~~iDdL~~i~~ 301 (311) T cd05213 234 ELLNEADVVISATGAPHYAKIVERAMKKRSGKPRLIV--DLAVPRDIEPEVGELEGVRLYTIDDLEEVVE 301 (311) T ss_pred HHHHHCCEEEEECCCCCCHHHHHHHHHHCCCCCEEEE--EECCCCCCCHHHCCCCCEEEEEHHHHHHHHH T ss_conf 9997689999927999620359999753479976999--9179998773347769979987899999999 No 132 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=47.37 E-value=23 Score=16.47 Aligned_cols=40 Identities=20% Similarity=0.318 Sum_probs=26.6 Q ss_pred CCHH-HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 8665-89999999984988999983157765470899985211151899 Q gi|254780936|r 121 GDGC-FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 121 GD~D-f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) -|-+ .+.++++||..|-.|+||= + -.++.++||+.||+.+ T Consensus 525 rD~~rLi~~L~~Lrd~GNTVlVVE----H----D~~~i~~AD~iIDlGP 565 (944) T PRK00349 525 RDNDRLIETLKRLRDLGNTLIVVE----H----DEDTIRAADYIVDIGP 565 (944) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEE----C----CHHHHHHCCEEEEECC T ss_conf 899999999999985798599983----5----6877752877897178 No 133 >PRK10401 DNA-binding transcriptional regulator GalS; Provisional Probab=47.18 E-value=23 Score=16.45 Aligned_cols=101 Identities=12% Similarity=-0.014 Sum_probs=55.9 Q ss_pred HHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH-HH-C Q ss_conf 999851792999999730688731467775799988975595697534589537888654356532999999999-42-1 Q gi|254780936|r 34 LKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE-QS-E 111 (182) Q Consensus 34 ~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~-~a-~ 111 (182) .+.|.+.|.=..++. +.+........+...|.+++...|..+....+. ....|..-.-.++. +. . T Consensus 168 ~~~L~~~GhrrI~~i--~~~~~~~~~~~R~~G~~~al~~~gl~~~~~~v~-----------~~~~~~~~~~~~~~~ll~~ 234 (346) T PRK10401 168 TRMLLNNGHQRIGYL--SSSHGIEDDAMRKAGWLSALKEQGIIPPESWIG-----------TGTPDMQGGEAAMVELLGR 234 (346) T ss_pred HHHHHHCCCCCEEEE--ECCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEE-----------ECCCCHHHHHHHHHHHHHC T ss_conf 999996299828999--479887168999999999999849998701487-----------2687578899999999815 Q ss_pred C-CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf 1-58899983866589999999984988----999983157 Q gi|254780936|r 112 G-LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 112 ~-~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) + --++|++++|.==.-+++.++++|.+ |-|++|... T Consensus 235 ~~~~tAi~~~nD~~A~g~l~al~~~Gl~VP~DisViGfDd~ 275 (346) T PRK10401 235 NLQLTAVFAYNDNMAAGALTALKDNGIAIPLHLSIIGFDDI 275 (346) T ss_pred CCCCCCCEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 99988614448899999999999859989987289998980 No 134 >TIGR01212 TIGR01212 radical SAM protein, TIGR01212 family; InterPro: IPR005911 This uncharacterised protein family shows significant similarity to IPR005910 from INTERPRO, a longer protein that is a histone acetyltransferase at its C-terminus and is a subunit of RNA polymerase II (in yeast). This family lacks the GNAT acetyltransferase domain.. Probab=46.95 E-value=23 Score=16.43 Aligned_cols=38 Identities=18% Similarity=0.359 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 658999999998498899998315776547089998521 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +||+.++.++|.+|.+|. .....|.++-.-.+..+.|- T Consensus 165 ~~y~~a~~~~~krGikVC-~H~I~GLPgE~~~~~~eTak 202 (307) T TIGR01212 165 ACYVDAVKRARKRGIKVC-SHVILGLPGEDREEMLETAK 202 (307) T ss_pred HHHHHHHHHHHHCCCEEE-EEEEECCCCCCHHHHHHHHH T ss_conf 899999999976598899-99874289888889999999 No 135 >cd04104 p47_IIGP_like p47 (47-kDa) family. The p47 GTPase family consists of several highly homologous proteins, including IGTP, TGTP/Mg21, IRG-47, GTPI, LRG-47, and IIGP1. They are found in higher eukaryotes where they play a role in immune resistance against intracellular pathogens. p47 proteins exist at low resting levels in mouse cells, but are strongly induced by Type II interferon (IFN-gamma). ITGP is critical for resistance to Toxoplasma gondii infection and in involved in inhibition of Coxsackievirus-B3-induced apoptosis. TGTP was shown to limit vesicular stomatitis virus (VSV) infection of fibroblasts in vitro. IRG-47 is involved in resistance to T. gondii infection. LRG-47 has been implicated in resistance to T. gondii, Listeria monocytogenes, Leishmania, and mycobacterial infections. IIGP1 has been shown to localize to the ER and to the Golgi membranes in IFN-induced cells and inflamed tissues. In macrophages, IIGP1 interacts with hook3, a microtubule binding protei Probab=46.86 E-value=23 Score=16.42 Aligned_cols=33 Identities=24% Similarity=0.214 Sum_probs=26.9 Q ss_pred CCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEE Q ss_conf 1158899983866---58999999998498899998 Q gi|254780936|r 111 EGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 111 ~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~ 143 (182) ..||.||++|.+. --+.+++.+++.||++.+|- T Consensus 79 ~~yD~fiiiss~rf~~nd~~la~~i~~~gK~fyfVR 114 (197) T cd04104 79 SEYDFFIIISSTRFSSNDVKLAKAIQCMGKKFYFVR 114 (197) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 545789998388641426999999998099289998 No 136 >PRK05976 dihydrolipoamide dehydrogenase; Validated Probab=46.73 E-value=23 Score=16.41 Aligned_cols=31 Identities=23% Similarity=0.192 Sum_probs=17.8 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 8899983866589999999984988999983 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ..++++-|--==+++...+.+.|.+|.++.. T Consensus 176 k~v~ViGgG~ig~E~A~~~~~lG~~Vtii~~ 206 (464) T PRK05976 176 KSLVVVGGGYIGLEWGSMLRKFGVEVTVVEA 206 (464) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEE T ss_conf 5599989968999999999953986999985 No 137 >PRK13937 phosphoheptose isomerase; Provisional Probab=46.66 E-value=23 Score=16.40 Aligned_cols=52 Identities=8% Similarity=0.021 Sum_probs=38.1 Q ss_pred HHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 109 QSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 109 ~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) +++.=|.++.+|+.+ ..+.+++.++++|.+|+.++-.. ...|.+.+|.-+.+ T Consensus 107 ~~~~gDili~iS~SGnS~Nii~A~~~A~~~g~~~i~ltG~~------gg~l~~~~D~~i~v 161 (192) T PRK13937 107 LGRPGDVLIGISTSGNSPNVLAALEKARELGMTTIGLTGRD------GGKMKELCDLLLIV 161 (192) T ss_pred CCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CHHHHHHCCEEEEE T ss_conf 57889889997799999799999999998799799998889------60578839989995 No 138 >cd06294 PBP1_ycjW_transcription_regulator_like Ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors. This group includes the ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=46.14 E-value=24 Score=16.35 Aligned_cols=101 Identities=13% Similarity=-0.023 Sum_probs=58.7 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994-2- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a- 110 (182) +.+.+.+.|.-..++. +.+........+...|.++++..|..+....+. ..+.+..-+..++.. . T Consensus 113 a~~~L~~~Ghr~I~~i--~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~-----------~~~~~~~~~~~~~~~ll~ 179 (270) T cd06294 113 ATEYLIKLGHKKIAFV--GGDLDLEVTQDRLQGYKQALEDHGIPDRNEVII-----------SLDFSEEGGYKALKKLLE 179 (270) T ss_pred HHHHHHHCCCCEEEEE--CCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEE-----------CCCCCHHHHHHHHHHHHH T ss_conf 9999997699569998--189986589999999999999869997733331-----------377667889999999984 Q ss_pred -CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf -1158899983866589999999984988----99998315 Q gi|254780936|r 111 -EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 -~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..-=++|++++|.=-..+++.++++|.+ +-|++|.. T Consensus 180 ~~~~~tai~~~nD~~A~g~~~al~~~g~~iP~disvvgfDd 220 (270) T cd06294 180 QHPRPTAIVATDDLLALGVLKVLNELGLKVPEDLSIIGFNN 220 (270) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 69998489987749999999999983999999849999897 No 139 >PRK01231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=45.58 E-value=24 Score=16.30 Aligned_cols=79 Identities=18% Similarity=0.167 Sum_probs=48.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 31467775799988975595697534589537888654356532999999999421158899983866589999999984 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) .++.....+....+|.+.|+++..-+....... ..+. -..+.-..+..+|.+|.+-|||=+..+++.+... T Consensus 15 ~~~a~~~~~~l~~~L~~~gi~v~ld~~~a~~~~------~~~~---~~~~~~~~~~~~Dlii~lGGDGT~L~~~~~~~~~ 85 (296) T PRK01231 15 SSQVVETLRRLKRFLLDRHLHVILEEETAEVLP------GHGL---QTVSRKLLGEVCDLVIVVGGDGSLLGAARALARH 85 (296) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCC------CCCC---CCCCHHHCCCCEEEEEEECCCCHHHHHHHHHCCC T ss_conf 879999999999999878899999324776658------7886---6245334165304999957872899999996036 Q ss_pred CCEEEEEE Q ss_conf 98899998 Q gi|254780936|r 136 VKKVTIVS 143 (182) Q Consensus 136 Gk~V~v~~ 143 (182) .+.|..+- T Consensus 86 ~~PilGiN 93 (296) T PRK01231 86 NVPVLGIN 93 (296) T ss_pred CCCEEEEE T ss_conf 99789885 No 140 >cd06315 PBP1_ABC_sugar_binding_like_6 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=44.93 E-value=25 Score=16.23 Aligned_cols=11 Identities=36% Similarity=0.531 Sum_probs=4.4 Q ss_pred CEEEEECCCHH Q ss_conf 88999838665 Q gi|254780936|r 114 EHLVIFSGDGC 124 (182) Q Consensus 114 d~~iLvSGD~D 124 (182) +-+-++.||++ T Consensus 218 ~~~~v~aGDGs 228 (280) T cd06315 218 DPRNISAGDGS 228 (280) T ss_pred CCEEEEECCCC T ss_conf 96489806799 No 141 >TIGR02853 spore_dpaA dipicolinic acid synthetase, A subunit; InterPro: IPR014215 This predicted Rossman fold-containing protein is the A subunit of dipicolinic acid synthetase as found in most, though not all, endospore-forming low-GC Gram-positive bacteria; it is absent in Clostridium. The B subunit is represented by IPR014214 from INTERPRO. This protein is also known as SpoVFA.. Probab=44.81 E-value=25 Score=16.22 Aligned_cols=70 Identities=14% Similarity=0.196 Sum_probs=39.1 Q ss_pred CCCCCHHHHHHHHHHHH-CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHH---HHCCCCEEHHHH Q ss_conf 35653299999999942-11588999838665899999999849889999831577654708999---852111518997 Q gi|254780936|r 94 VKSSMDVELAVDAFEQS-EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLR---RQADYFMDLAYL 169 (182) Q Consensus 94 ~kk~~Dv~laiD~~~~a-~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~---~~ad~fi~l~~l 169 (182) .|+ =+.||++...+. ..-..+||==|=--.+ +.+.++..|-+|.|.+= .+..|. +.|-.=+.+.+| T Consensus 135 aEG--AimMA~e~td~TIHgS~v~VlGfGRtG~t-iAr~f~aLGA~V~V~AR-------~~~dlARI~E~g~~P~~~~~L 204 (288) T TIGR02853 135 AEG--AIMMAIEHTDFTIHGSNVMVLGFGRTGMT-IARTFSALGARVSVGAR-------SSADLARITEMGLEPVPLNKL 204 (288) T ss_pred HHH--HHHHHHHCCCCCEECCEEEEECCCCHHHH-HHHHHHHCCCEEEEECC-------CHHHHHHHHHHCCCCCCHHHH T ss_conf 678--99999724896250134578844705689-99999726980575317-------836789999960688271678 Q ss_pred HHHH Q ss_conf 8762 Q gi|254780936|r 170 KNEI 173 (182) Q Consensus 170 ~~~i 173 (182) .+.+ T Consensus 205 ~~~v 208 (288) T TIGR02853 205 EEKV 208 (288) T ss_pred HHHH T ss_conf 8765 No 142 >cd06290 PBP1_LacI_like_9 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=44.74 E-value=25 Score=16.22 Aligned_cols=104 Identities=13% Similarity=0.011 Sum_probs=58.7 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+++.+.+.-..+ |.+.+........+...|.++++..|..+....+. ..+ ...+. -...+.+++..-+. T Consensus 106 ~~~~L~~~G~~~i~--~i~~~~~~~~~~~R~~Gf~~a~~~~g~~~~~~~~~-~~~----~~~~~--~~~~~~~~l~~~~~ 176 (265) T cd06290 106 ATQHLIDLGHRRIA--HITGPRGHIDARDRLAGYRKALEEAGLEVQPDLIV-QGD----FEEES--GLEAVEELLQRGPD 176 (265) T ss_pred HHHHHHHHCCCCEE--EECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEE-CCC----CCCHH--HHHHHHHHHHHCCC T ss_conf 99999982998655--60488787019999999999999869999888834-166----77178--99999999971999 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 58899983866589999999984988----99998315 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) . ++|+++.|.--..++..+++.|.+ +-|+++.. T Consensus 177 ~-~ai~~~nD~~A~g~~~~~~~~g~~vP~di~vigfD~ 213 (265) T cd06290 177 F-TAIFAANDQTAYGARLALYRRGLRVPEDVSLIGFDD 213 (265) T ss_pred C-CCCEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 8-842105789999999999980999999879999998 No 143 >PRK11914 diacylglycerol kinase; Reviewed Probab=44.71 E-value=25 Score=16.21 Aligned_cols=85 Identities=19% Similarity=0.259 Sum_probs=48.6 Q ss_pred EEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEEC Q ss_conf 99999973068873146777579998897559569753458953788865435653299999999942--1158899983 Q gi|254780936|r 43 VIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFS 120 (182) Q Consensus 43 l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvS 120 (182) ..++....+..................|++.|+.+..... +..-| +.++.+.+ +.+|.+|.+- T Consensus 6 m~kv~vIvNP~aG~g~~~~~~~~i~~~l~~~g~~~~~~~t------------~~~~~---a~~la~~a~~~g~d~vv~~G 70 (304) T PRK11914 6 IGKVTVLTNPLSGHGAAPHAAERAIARLHHRGVDVVEIVG------------TDAHH---ARHLVAAALAKGTDALVVVG 70 (304) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEC------------CCHHH---HHHHHHHHHHCCCCEEEEEE T ss_conf 8669999997779985688999999999987990999932------------78789---99999988864996999995 Q ss_pred CCHHHHHHHHHHHHCCCEEEEE Q ss_conf 8665899999999849889999 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~ 142 (182) ||+=.-.++..+...+....++ T Consensus 71 GDGTv~ev~~~l~~~~~plgii 92 (304) T PRK11914 71 GDGVISNALQVLAGTDIPLGII 92 (304) T ss_pred CCHHHHHHHHHHCCCCCEEEEE T ss_conf 6259889876413578608996 No 144 >cd01170 THZ_kinase 4-methyl-5-beta-hydroxyethylthiazole (Thz) kinase catalyzes the phosphorylation of the hydroxylgroup of Thz. A reaction that allows cells to recycle Thz into the thiamine biosynthesis pathway, as an alternative to its synthesis from cysteine, tyrosine and 1-deoxy-D-xylulose-5-phosphate. Probab=43.89 E-value=15 Score=17.67 Aligned_cols=70 Identities=17% Similarity=0.208 Sum_probs=38.0 Q ss_pred HHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHH-----HHHHHHHH-HHCCCCEEEEECCCHHHHHHHHHHHHCCCEEE Q ss_conf 9889755956975345895378886543565329-----99999999-42115889998386658999999998498899 Q gi|254780936|r 67 LDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDV-----ELAVDAFE-QSEGLEHLVIFSGDGCFTTLVAALQRKVKKVT 140 (182) Q Consensus 67 ~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv-----~laiD~~~-~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~ 140 (182) .+.|....+.++..-.-+...-.+....-+|+|. .-.+++.. ++.++.++|++||-.||+ ..|.++. T Consensus 100 ~~lL~~~~~tvIrGN~sEI~aL~g~~~~~kGVDs~~~~~~~~~~~a~~lA~~~~~vVvvTG~~D~V-------~dg~~~~ 172 (242) T cd01170 100 KELLAEGQPTVIRGNASEIAALAGLTGLGKGVDSSSSDEEDALELAKALARKYGAVVVVTGEVDYI-------TDGERVV 172 (242) T ss_pred HHHHHHCCCCEEECCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECCCCEE-------ECCCEEE T ss_conf 999973898799537999999957778986944898555789999999999829799998786189-------7597799 Q ss_pred EEE Q ss_conf 998 Q gi|254780936|r 141 IVS 143 (182) Q Consensus 141 v~~ 143 (182) .+. T Consensus 173 ~v~ 175 (242) T cd01170 173 VVK 175 (242) T ss_pred EEE T ss_conf 983 No 145 >COG1105 FruK Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) [Carbohydrate transport and metabolism] Probab=43.48 E-value=26 Score=16.09 Aligned_cols=45 Identities=20% Similarity=0.371 Sum_probs=33.6 Q ss_pred CHHHHHHHHHHH-HCCCCEEEEECC-------CHHHHHHHHHHHHCCCEEEEEE Q ss_conf 329999999994-211588999838-------6658999999998498899998 Q gi|254780936|r 98 MDVELAVDAFEQ-SEGLEHLVIFSG-------DGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 98 ~Dv~laiD~~~~-a~~~d~~iLvSG-------D~Df~pli~~lr~~Gk~V~v~~ 143 (182) .+.+.-++-+.. ....|. |++|| -.+|..+++.+|+.|++|.+=+ T Consensus 114 ~~~~~~l~~~~~~l~~~d~-VvlsGSlP~g~~~d~y~~li~~~~~~g~~vilD~ 166 (310) T COG1105 114 AELEQFLEQLKALLESDDI-VVLSGSLPPGVPPDAYAELIRILRQQGAKVILDT 166 (310) T ss_pred HHHHHHHHHHHHHCCCCCE-EEEECCCCCCCCHHHHHHHHHHHHHCCCEEEEEC T ss_conf 9999999999975664889-9990889999997999999999986598399979 No 146 >cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=43.40 E-value=26 Score=16.09 Aligned_cols=60 Identities=18% Similarity=0.182 Sum_probs=38.0 Q ss_pred CHHHHHHHHHH-HH----------CCC-CEEEEEC---CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 32999999999-42----------115-8899983---86658999999998498899998315776547089998521 Q gi|254780936|r 98 MDVELAVDAFE-QS----------EGL-EHLVIFS---GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 98 ~Dv~laiD~~~-~a----------~~~-d~~iLvS---GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +....|++.+. .. +++ ..+||+| ...|.....+.||..|..+..+++. +.-..+|...|. T Consensus 80 t~tg~AL~~a~~~~f~~~~g~R~r~~v~kvlvviTdG~s~d~~~~~a~~lr~~gV~i~aVGvg----~~~~~eL~~IAs 154 (165) T cd01481 80 LNTGSALDYVVKNLFTKSAGSRIEEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGAR----NADLAELQQIAF 154 (165) T ss_pred EEHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCEEEEEECC----CCCHHHHHHHHC T ss_conf 369999999999716756788755799869999848988537899999999889789999689----799999999858 No 147 >PRK06370 mercuric reductase; Validated Probab=43.23 E-value=26 Score=16.07 Aligned_cols=31 Identities=26% Similarity=0.280 Sum_probs=18.1 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 8999838665899999999849889999831 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVSTV 145 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~ 145 (182) .++++-|--==+++...+.+.|++|+++.-. T Consensus 172 ~v~ViGgG~ig~E~A~~~~~~G~~Vtlv~~~ 202 (459) T PRK06370 172 HLAVIGGGYIGLEFAQAFRRFGSEVTVVERG 202 (459) T ss_pred EEEEECCCHHHHHHHHHHHHHCCEEEEEEEC T ss_conf 4999898477999999999639889999965 No 148 >PRK13222 phosphoglycolate phosphatase; Provisional Probab=43.07 E-value=27 Score=16.06 Aligned_cols=71 Identities=15% Similarity=0.053 Sum_probs=44.6 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHH-CCCCE-EHHHHHHHHCC Q ss_conf 2999999999421158899983866589999999984988999983157765470899985-21115-18997876225 Q gi|254780936|r 99 DVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQ-ADYFM-DLAYLKNEIAR 175 (182) Q Consensus 99 Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~-ad~fi-~l~~l~~~i~r 175 (182) |-.+-..+++...---.=+|+=||+- .=++.+++.|.+++.+.. |+ ++..+|..+ ||..+ ++.+|.+.++= T Consensus 151 ~P~~~~~a~~~lg~~p~e~l~VGDs~--~Di~aA~~aG~~~i~v~~--G~--~~~~~l~~~~ad~vi~~~~eL~~~Lg~ 223 (228) T PRK13222 151 DPAPLLLACEKLGIDPEQMLFVGDSR--NDIQAAKAAGCPSVGVTY--GY--NYGEDIALSEPDVVIDHFAELLPALGL 223 (228) T ss_pred CHHHHHHHHHHCCCCCCCEEEEECCH--HHHHHHHHCCCEEEEECC--CC--CCHHHHHHCCCCEEECCHHHHHHHHCH T ss_conf 86999999998197934348980688--899999996994999878--99--997789768999998999999998553 No 149 >KOG1434 consensus Probab=43.03 E-value=27 Score=16.05 Aligned_cols=104 Identities=18% Similarity=0.227 Sum_probs=56.9 Q ss_pred EEEEEEHHHHHHHHH----HCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCC-CEEE---- Q ss_conf 899980789999998----5087679899999985179299999973068873146777579998897559-5697---- Q gi|254780936|r 8 IALFIDGANLYASSK----ALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNG-FQVV---- 78 (182) Q Consensus 8 vaIfID~~Nl~~~~~----~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g-~~v~---- 78 (182) -++|||.++-|.--+ .-++++|.+..++-+. ..+.|. ..+.-......-+.+.+.| +.+. T Consensus 151 k~ifIDTEgtFrpdRi~~IAe~~~~d~d~~LdNI~------y~Ra~~-----se~qmelv~~L~~~~se~g~~rlvIVDs 219 (335) T KOG1434 151 KAIFIDTEGTFRPDRIKDIAERFKVDPDFTLDNIL------YFRAYN-----SEEQMELVYLLGDFLSEHGKYRLVIVDS 219 (335) T ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCHHHHHHHHH------HHHHCC-----HHHHHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 28999258861458999999874889889887778------877728-----6999999999888874448579999800 Q ss_pred -EEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf -53458953788865435653299999999942115889998386 Q gi|254780936|r 79 -AKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGD 122 (182) Q Consensus 79 -~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD 122 (182) +.+++.--++.+........=..|+.-++.+|.++..+|++++- T Consensus 220 Ima~FRvDy~grgeLseRqqkLn~ml~kl~~laeefnvAVfltNQ 264 (335) T KOG1434 220 IMALFRVDYDGRGELSERQQKLNQMLQKLNKLAEEFNVAVFLTNQ 264 (335) T ss_pred EEHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEEECC T ss_conf 000302145664308899999999999999998752279998411 No 150 >pfam01861 DUF43 Protein of unknown function DUF43. This family includes archaebacterial proteins of unknown function. All the members are 350-400 amino acids long. Probab=42.15 E-value=26 Score=16.09 Aligned_cols=26 Identities=35% Similarity=0.542 Sum_probs=9.7 Q ss_pred EEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 99838665899999999849889999 Q gi|254780936|r 117 VIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 117 iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) |||=||-|++-+.-.+....++|.|+ T Consensus 48 iL~vGDDDLtSlA~al~~~p~~I~Vv 73 (243) T pfam01861 48 ILVLGDDDLTSLAAALTGLPKRIAVV 73 (243) T ss_pred EEEECCCCHHHHHHHHCCCCCEEEEE T ss_conf 99972861878999964898548999 No 151 >PRK00676 hemA glutamyl-tRNA reductase; Validated Probab=42.14 E-value=27 Score=15.97 Aligned_cols=66 Identities=6% Similarity=0.135 Sum_probs=34.1 Q ss_pred HHHCCCCEEEEECCCHHHHH---HHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHCCC Q ss_conf 94211588999838665899---9999998498899998315776547089998521115189978762258 Q gi|254780936|r 108 EQSEGLEHLVIFSGDGCFTT---LVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIARD 176 (182) Q Consensus 108 ~~a~~~d~~iLvSGD~Df~p---li~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~r~ 176 (182) .+...+|.++..|+.+.++- .++.+....++..+ .-+.+......-...-+.|.++|||...+++. T Consensus 222 ~~~~~~diii~~ts~~~~~~~~~~~e~~~~~~~rlii---DLAVPRDI~~~~~~~~~~~l~IDDL~~iv~~N 290 (338) T PRK00676 222 SFRDPYDVIFFGSSESAYAFPHLSWESLASIPDRIVF---DFNVPRTFPWSETPFGHRYLDMDFISEWVQKH 290 (338) T ss_pred CCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCCCEEE---EECCCCCCCHHHCCCCCEEEEHHHHHHHHHHH T ss_conf 0246578899967886545656789998715698799---95079998710257798998588689999999 No 152 >pfam05761 5_nucleotid 5' nucleotidase family. This family of eukaryotic proteins includes 5' nucleotidase enzymes, such as purine 5'-nucleotidase EC:3.1.3.5. Probab=41.84 E-value=25 Score=16.26 Aligned_cols=30 Identities=27% Similarity=0.341 Sum_probs=15.9 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 15889998386658999999998498899998 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) +.+.+|.= |.+..+++.++|+.||++.+++ T Consensus 177 dp~kYv~k--dp~l~~~L~~lr~~GKklFLiT 206 (448) T pfam05761 177 DLEKYVIK--DPELPPALSRLRESGKKLFLLT 206 (448) T ss_pred CHHHHHHC--CCCHHHHHHHHHHCCCEEEEEC T ss_conf 98988535--9308899999997197499981 No 153 >PRK09526 lacI lac repressor; Reviewed Probab=41.80 E-value=28 Score=15.93 Aligned_cols=99 Identities=11% Similarity=-0.032 Sum_probs=55.6 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH-HH- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999-42- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE-QS- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~-~a- 110 (182) +.+++.+.|.=..++.- .+........+...|.++++..|..... .. ....+..-+.+.+. +. T Consensus 172 a~~~L~~~Ghr~I~~i~--~~~~~~~~~~R~~Gf~~~~~~~~l~~~~----~~---------~~~~~~~~~~~~~~~~l~ 236 (342) T PRK09526 172 GVEHLVELGHQQIALLA--GPLSSVSARLRLAGWLEYLTRNQLSPIA----VR---------EGDWSAMSGYQQTLQMLN 236 (342) T ss_pred HHHHHHHCCCCEEEEEE--CCCCCCHHHHHHHHHHHHHHHCCCCCCE----EE---------ECCCCHHHHHHHHHHHHH T ss_conf 99999975997599995--8976600999999999999977989616----99---------568987899999999983 Q ss_pred -CCCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEEC Q ss_conf -11588999838665899999999849889----9998315 Q gi|254780936|r 111 -EGLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVL 146 (182) Q Consensus 111 -~~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~ 146 (182) ..--++|++++|.=-+.++..++++|.+| -|++|.. T Consensus 237 ~~~~ptai~~~~D~~A~g~i~~l~~~g~~vP~DisIigfDd 277 (342) T PRK09526 237 EGPVFTAVLVANDQMALGVLRALHESGLRVPGDISVIGYDD 277 (342) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 69999758756779999999999980999999749999898 No 154 >cd06276 PBP1_FucR_like Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. FcuR acts as an inducer of fucRRIAK and as a corepressor of another locus that regulates production of fucosylated glycans. FcuR and its close homologs in this group are a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes t Probab=41.78 E-value=28 Score=15.93 Aligned_cols=33 Identities=18% Similarity=0.140 Sum_probs=27.2 Q ss_pred EEEEECCCHHHHHHHHHHHHCCC----EEEEEEEECC Q ss_conf 89998386658999999998498----8999983157 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVLS 147 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~~ 147 (182) ++.++.+|.|.+-+++.+++.|. .+-++|+... T Consensus 164 ~~yi~~~d~dlv~vik~~~~~gl~ip~DisiIgfdd~ 200 (247) T cd06276 164 DLYIILSDTDLVFLIKKARESGLLLGKDIGIISYNDT 200 (247) T ss_pred EEEEEECHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 2999977189999999999849999873699975774 No 155 >cd06280 PBP1_LacI_like_4 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=41.21 E-value=28 Score=15.88 Aligned_cols=101 Identities=6% Similarity=-0.028 Sum_probs=59.0 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99999851792999999730688731467775799988975595697534589537888654356532999999999421 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSE 111 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~ 111 (182) .+.+.+.+.+.-..++... . . ......+...|.+++++.|..+.... .... .+. -...+.+++.. + T Consensus 105 ~a~~~L~~~Ghr~I~~l~g-~-~-~~~~~~R~~Gf~~al~~~gi~~~~~~----~~~~----~~~--~~~~~~~~l~~-~ 170 (263) T cd06280 105 TLVEHLVAQGYRRIGGLFG-N-A-STTGAERRAGYEDAMRRHGLAPDARF----VAPT----AEA--AEAALAAWLAA-P 170 (263) T ss_pred HHHHHHHHHCCCEEEEEEC-C-C-CCCHHHHHHHHHHHHHHCCCCCCEEE----ECCC----HHH--HHHHHHHHHHC-C T ss_conf 9999998608966999938-9-9-84399999999999997699976378----2487----678--99999999827-9 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 158899983866589999999984988----99998315 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) .--++|+++.|.=-..+++.+++.|.+ +-|++|.. T Consensus 171 ~~~~ai~~~nD~~A~g~l~~l~~~g~~vP~disviGfDd 209 (263) T cd06280 171 ERPEALVASNGLLLLGALRAVRAAGLRIPQDLALAGFDN 209 (263) T ss_pred CCCCEEEECCHHHHHHHHHHHHHHCCCCCCCCEEEEECC T ss_conf 999789967889999999999984899999878999778 No 156 >cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains. Probab=41.07 E-value=28 Score=15.86 Aligned_cols=18 Identities=28% Similarity=0.167 Sum_probs=11.2 Q ss_pred HHHHHHHHHCCCEEEEEE Q ss_conf 799988975595697534 Q gi|254780936|r 64 HPLLDWLHYNGFQVVAKV 81 (182) Q Consensus 64 ~~~~~~l~~~g~~v~~~~ 81 (182) .-....|+..||+|+.-. T Consensus 17 ~iva~~l~~~GfeVi~lG 34 (128) T cd02072 17 KILDHAFTEAGFNVVNLG 34 (128) T ss_pred HHHHHHHHHCCCEEEECC T ss_conf 999999997897298479 No 157 >cd06273 PBP1_GntR_like_1 This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational Probab=40.97 E-value=29 Score=15.85 Aligned_cols=102 Identities=12% Similarity=-0.037 Sum_probs=58.6 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHH----HHHHHH Q ss_conf 9999985179299999973068873146777579998897559569753458953788865435653299----999999 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVE----LAVDAF 107 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~----laiD~~ 107 (182) .+.+++.+.|.-..++. .+..........+...|.++++..|..+....... .+.+.. .+.+++ T Consensus 106 ~~~~~L~~~G~~~i~~i-~~~~~~~~~~~~R~~gf~~al~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~l 173 (268) T cd06273 106 LAARHLIALGHRRIAMI-FGPTQGNDRARARRAGVRAALAEAGLELPELWQVE-----------APYSIADGRAALRQLL 173 (268) T ss_pred HHHHHHHHCCCCCEEEE-ECCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHEEE-----------CCCCHHHHHHHHHHHH T ss_conf 99999987298747999-46877777899999999999998099987142031-----------5789899999999998 Q ss_pred HHHCCCCEEEEECCCHHHHHHHHHHHHCCC----EEEEEEEEC Q ss_conf 942115889998386658999999998498----899998315 Q gi|254780936|r 108 EQSEGLEHLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVL 146 (182) Q Consensus 108 ~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~ 146 (182) ..... -++|+++.|.=-..++..++++|. .|-++++.. T Consensus 174 ~~~~~-~~ai~~~nd~~A~g~i~~l~~~g~~iP~disvigfd~ 215 (268) T cd06273 174 EQPPR-PTAVICGNDVLALGALYEARRLGLSVPEDLSIVGFDD 215 (268) T ss_pred HCCCC-CCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 47999-8489977879999999999963999999679999998 No 158 >COG3660 Predicted nucleoside-diphosphate-sugar epimerase [Cell envelope biogenesis, outer membrane] Probab=40.70 E-value=29 Score=15.83 Aligned_cols=35 Identities=17% Similarity=0.294 Sum_probs=24.6 Q ss_pred HHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 99999942115889998386658999999998498899998 Q gi|254780936|r 103 AVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 103 aiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) =+|||.-|+ +||+|.|| +-+..++-+-||-|.++. T Consensus 238 Y~~~La~Ad----yii~TaDS--inM~sEAasTgkPv~~~~ 272 (329) T COG3660 238 YIDMLAAAD----YIISTADS--INMCSEAASTGKPVFILE 272 (329) T ss_pred HHHHHHHCC----EEEEECCH--HHHHHHHHCCCCCEEEEE T ss_conf 688885213----37870430--124578760499759980 No 159 >cd06284 PBP1_LacI_like_6 Ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group includes the ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group belongs to the the LacI-GalR family repressors and are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding. Probab=40.53 E-value=29 Score=15.81 Aligned_cols=100 Identities=12% Similarity=0.024 Sum_probs=60.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999---- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---- 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---- 108 (182) +.+++.+.+.- ...|...+........+...|.++++..|..+...... .++.+..-+.+++. T Consensus 106 ~~~~L~~~G~~--~i~~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~l~ 172 (267) T cd06284 106 AVDHLISLGHR--RIALITGPRDNPLARDRLEGYRQALAEAGLPADEELIQ-----------EGDFSLESGYAAARRLLA 172 (267) T ss_pred HHHHHHHHCCC--EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEE-----------CCCCCHHHHHHHHHHHHH T ss_conf 99999970997--69996288887259999999999999839985512773-----------278767778899999996 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 421158899983866589999999984988----99998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..+.+ ++|++++|.=-.-+++.+++.|.+ |.++++.. T Consensus 173 ~~~~~-~ai~~~nD~~A~g~~~~l~~~g~~vP~di~vigfd~ 213 (267) T cd06284 173 LPDRP-TAIFCFSDEMAIGAISALKELGLRVPEDISVVGFDD 213 (267) T ss_pred CCCCC-CEEEECCCHHHHHHHHHHHHCCCCCCCEEEEEEECC T ss_conf 59999-802657868899999999983999996279998778 No 160 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=40.40 E-value=29 Score=15.80 Aligned_cols=51 Identities=16% Similarity=0.236 Sum_probs=35.9 Q ss_pred CCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHH Q ss_conf 58899983866---5899999999849889999831577654708999852111518997 Q gi|254780936|r 113 LEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYL 169 (182) Q Consensus 113 ~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l 169 (182) =|+++++|.-| ..+.++..+|+.|.+++.++- +....|.+.||-.+.+... T Consensus 95 ~D~~i~~S~SG~t~El~~ll~~~~~~~~~iI~it~------~~~S~l~~~sd~~l~~~~~ 148 (326) T PRK10892 95 QDVVIAISNSGESSEILALIPVLKRLHVPLICITG------RPESSMARAADVHLCVKVP 148 (326) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEEC------CCCCHHHHHCCEEEECCCC T ss_conf 99899995898757787641787766985899956------9999668808878965787 No 161 >cd06323 PBP1_ribose_binding Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Members of this group are belonging to the type I periplasmic binding protein superfamily, whose members are involved in chemotaxis, ATP-binding cassette transport, and intercellular communication in central nervous system. The thermophilic and mesophilic ribose-binding proteins are structurally very similar, but differ substantially in thermal stability. Probab=40.26 E-value=29 Score=15.78 Aligned_cols=76 Identities=16% Similarity=0.117 Sum_probs=45.0 Q ss_pred CHHHHHHHHHHHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH----HHCCCCEEEEECCCHHHHHHHH Q ss_conf 3146777579998897559-5697534589537888654356532999999999----4211588999838665899999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE----QSEGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~----~a~~~d~~iLvSGD~Df~pli~ 130 (182) ......+...|.+++...+ .++.... ..+.+..-+.++++ ..+.+ ++|++++|.--.-+++ T Consensus 133 ~~~~~~R~~gf~~a~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~lL~~~~~~-~aI~~~nD~~A~ga~~ 198 (268) T cd06323 133 ASAARERGKGFHEVVDKYPGLKVVASQ-------------PADFDRAKGLNVMENILQAHPDI-KGVFAQNDEMALGAIE 198 (268) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEE-------------CCCHHHHHHHHHHHHHHHCCCCC-CEEEECCCHHHHHHHH T ss_conf 858999999999999877995588772-------------24202799999999998629888-5899778399999999 Q ss_pred HHHHCCC-EEEEEEEE Q ss_conf 9998498-89999831 Q gi|254780936|r 131 ALQRKVK-KVTIVSTV 145 (182) Q Consensus 131 ~lr~~Gk-~V~v~~~~ 145 (182) .+++.|. .+.++++. T Consensus 199 al~~~g~~di~viGfD 214 (268) T cd06323 199 ALKAAGKDDVKVVGFD 214 (268) T ss_pred HHHHHCCCCEEEEEEC T ss_conf 9997099982999888 No 162 >cd06299 PBP1_LacI_like_13 Ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This group includes the ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This regulatory protein is a member of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial trans Probab=40.18 E-value=29 Score=15.78 Aligned_cols=100 Identities=9% Similarity=0.027 Sum_probs=58.3 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--H Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994--2 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--S 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a 110 (182) ..+.+.+.+.-..++. +.+........+...|.++++..|..+...... . ++.+..-+..++.. . T Consensus 107 a~~~L~~~G~~~i~~i--~~~~~~~~~~~R~~gf~~a~~~~~~~~~~~~i~-~----------~~~~~~~~~~~~~~~l~ 173 (265) T cd06299 107 AVSLLVALGHKKIGYI--SGPQDTSTGRERLEAFRQACASLGLEVNEDLVV-L----------GGYSQESGYAGATKLLD 173 (265) T ss_pred HHHHHHHCCCCEEEEE--CCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEE-E----------ECCCHHHHHHHHHHHHH T ss_conf 9999997499838860--489887569999999999999769999930599-8----------40355789999999996 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 1158899983866589999999984988----99998315 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ++. ++|+++.|.--..++..++++|.+ |-|+++.. T Consensus 174 ~~p-~ai~~~nD~~A~g~~~~l~~~g~~vP~dvsvvgfd~ 212 (265) T cd06299 174 QGA-TAIIAGDSMMTIGAIRAIHDAGLVIGEDISLIGFDD 212 (265) T ss_pred CCC-CEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 399-854506858899999999983899999349999997 No 163 >cd01712 ThiI ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway. It belongs to the Adenosine Nucleotide Hydrolysis suoerfamily and predicted to bind to Adenosine nucleotide. Probab=40.12 E-value=29 Score=15.77 Aligned_cols=16 Identities=6% Similarity=-0.081 Sum_probs=7.7 Q ss_pred HHCCEEEEEEEEECCC Q ss_conf 5179299999973068 Q gi|254780936|r 38 RSRAIVIRAYYYTTVV 53 (182) Q Consensus 38 ~~~~~l~~~~~y~~~~ 53 (182) .++.++.-.++..... T Consensus 21 krG~~V~~l~f~~~~~ 36 (177) T cd01712 21 KRGIEVDALHFNSGPF 36 (177) T ss_pred HCCCEEEEEEEECCCC T ss_conf 8799899999989998 No 164 >TIGR00750 lao LAO/AO transport system ATPase; InterPro: IPR005129 Bacterial periplasmic transport systems require the function of a specific substrate-binding protein, located in the periplasm, and several cytoplasmic membrane transport components. In Escherichia coli K-12, the arginine-ornithine transport system requires an arginine-ornithine-binding protein and the lysine-arginine-ornithine (LAO) transport system includes a LAO-binding protein. Both periplasmic proteins can be phosphorylated by a single kinase, ArgK resulting in reduced levels of transport activity of the periplasmic transport systems that include each of the binding proteins. The ArgK protein acts as an ATPase enzyme and as a kinase.. Probab=40.10 E-value=29 Score=15.77 Aligned_cols=25 Identities=16% Similarity=0.231 Sum_probs=11.5 Q ss_pred CCEEEEEC--CCHHHHHHHHH-HHHCCC Q ss_conf 58899983--86658999999-998498 Q gi|254780936|r 113 LEHLVIFS--GDGCFTTLVAA-LQRKVK 137 (182) Q Consensus 113 ~d~~iLvS--GD~Df~pli~~-lr~~Gk 137 (182) .||+|+|+ |=||=+..+++ +-|-|= T Consensus 158 aDT~v~v~~pg~GDd~Q~iKaG~mEiaD 185 (333) T TIGR00750 158 ADTFVVVTIPGTGDDVQGIKAGVMEIAD 185 (333) T ss_pred HCEEEEEECCCCCCHHHHHHHHHHEEEE T ss_conf 1505898548878346666544302324 No 165 >pfam03852 Vsr DNA mismatch endonuclease Vsr. Probab=40.07 E-value=15 Score=17.63 Aligned_cols=15 Identities=20% Similarity=0.122 Sum_probs=8.5 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 999889755956975 Q gi|254780936|r 65 PLLDWLHYNGFQVVA 79 (182) Q Consensus 65 ~~~~~l~~~g~~v~~ 79 (182) .....|.+.|+..+. T Consensus 26 ~vr~~L~~~G~ryr~ 40 (75) T pfam03852 26 ALRSLLFGLGLRYRK 40 (75) T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999988988997 No 166 >PRK00654 glgA glycogen synthase; Provisional Probab=39.94 E-value=30 Score=15.75 Aligned_cols=66 Identities=18% Similarity=0.336 Sum_probs=39.7 Q ss_pred CCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 35653299999999942--1158899983866589999999984-98899998315776547089998521115 Q gi|254780936|r 94 VKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRK-VKKVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 94 ~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~-Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) .+||+| +.++++... .+...+|+-|||..+...++.+... .-+|.+.. ++...-|..+-.+||-|+ T Consensus 304 ~qKG~~--ll~~a~~~~~~~~~~~vi~G~G~~~~~~~l~~l~~~~~~~~~~~~---gf~e~l~~~iya~aD~~l 372 (476) T PRK00654 304 EQKGLD--LVLEALDEILEQGGQLVLLGTGDPELEEAFRALAARYPGRVGVQI---GYDEALAHRIYAGADFFL 372 (476) T ss_pred CCCCHH--HHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEE---CCCHHHHHHHHHHCCEEE T ss_conf 567889--999999999970998999945978999999999987798889995---788689899987288786 No 167 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=39.87 E-value=30 Score=15.75 Aligned_cols=102 Identities=16% Similarity=0.025 Sum_probs=59.6 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--- Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994--- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--- 109 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--- 109 (182) +.+++.+.+.-..+ +.+.+........+...|.+++...|..+....+ ...+.+..-...++.. T Consensus 105 ~~~~Li~~G~~~i~--~l~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~i-----------~~~~~~~~~~~~~~~~~l~ 171 (265) T cd06285 105 ATRHLLDLGHRRIA--VLAGPDYASTARDRLAGFRAALAEAGIEVPPERI-----------VYSGFDIEGGEAAAEKLLR 171 (265) T ss_pred HHHHHHHHCCCCEE--ECCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHE-----------ECCCCCHHHHHHHHHHHHH T ss_conf 99999983886587--5478877705999999999999986999875660-----------1366215569999999985 Q ss_pred HCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf 21158899983866589999999984988----999983157 Q gi|254780936|r 110 SEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 110 a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) .+.--++|+++.|.=-.-++..+++.|.+ +.+++|... T Consensus 172 ~~~~~~ai~~~nD~~A~g~~~~l~~~g~~iP~di~iigfd~~ 213 (265) T cd06285 172 SDSPPTAIFAVNDFAAIGVMGAARDRGLRVPDDVALVGYNDI 213 (265) T ss_pred CCCCCCCEECCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 699988100378899999999999859999998799998782 No 168 >cd06293 PBP1_LacI_like_11 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=39.74 E-value=30 Score=15.73 Aligned_cols=100 Identities=9% Similarity=0.009 Sum_probs=58.4 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHH----HHHHH Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999----99999 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELA----VDAFE 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~la----iD~~~ 108 (182) +.+++.+.+.=..+ |.+.+........+...|.++++..|......... .. +.+...+ -+++. T Consensus 107 ~~~~L~~~G~~~I~--~i~~~~~~~~~~~R~~G~~~a~~~~~~~~~~~~~~-~~----------~~~~~~~~~~~~~~l~ 173 (269) T cd06293 107 ATRHLARAGHRRIA--FVGGPDALISARERYAGYREALAEAHIPEVPEYVC-FG----------DYTREFGRAAAAQLLA 173 (269) T ss_pred HHHHHHHHCCCEEE--EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE-EC----------CCCHHHHHHHHHHHHH T ss_conf 99999982998389--95488776339999999999999769999934899-55----------6311359999999984 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 421158899983866589999999984988----99998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ..+. -++|+++.|.=-..+++.+++.|.+ +-|+++.. T Consensus 174 ~~~~-~~av~~~nD~~A~g~l~~l~~~gi~vP~disiigfd~ 214 (269) T cd06293 174 RGDP-PTAIFAASDEIAIGLLEVLRERGLSIPGDMSLVGFDD 214 (269) T ss_pred CCCC-CCEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 7999-8589968859999999999984999999659998888 No 169 >PTZ00175 diphthine synthase; Provisional Probab=39.72 E-value=30 Score=15.73 Aligned_cols=39 Identities=18% Similarity=0.279 Sum_probs=32.8 Q ss_pred HHHHHHCCCCEEEEECCCH----HHHHHHHHHHHCCCEEEEEE Q ss_conf 9999421158899983866----58999999998498899998 Q gi|254780936|r 105 DAFEQSEGLEHLVIFSGDG----CFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 105 D~~~~a~~~d~~iLvSGD~----Df~pli~~lr~~Gk~V~v~~ 143 (182) .+++.|.+-+++.|+.||- =.+.++.+++++|..|.|+. T Consensus 68 ~il~~A~~~~ValLvvGDP~~ATTH~dL~lrA~e~gI~~~VIh 110 (272) T PTZ00175 68 KILEEAKDKNVALLVVGDPFCATTHTDLVLRAKEQGIDVEVIH 110 (272) T ss_pred HHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCEEEEC T ss_conf 9998632298899975886663537999999998799889978 No 170 >COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair] Probab=39.46 E-value=30 Score=15.71 Aligned_cols=112 Identities=18% Similarity=0.191 Sum_probs=55.2 Q ss_pred EEEEEEEHHHHHHHHH----H-----CCCCCC----HHH-HHHHHHHCCEEEEEEEEECCCCCCHHH------------- Q ss_conf 1899980789999998----5-----087679----899-999985179299999973068873146------------- Q gi|254780936|r 7 KIALFIDGANLYASSK----A-----LGFDID----YRK-LLKAFRSRAIVIRAYYYTTVVGDPEQQ------------- 59 (182) Q Consensus 7 rvaIfID~~Nl~~~~~----~-----~~~~~d----~~~-L~~~l~~~~~l~~~~~y~~~~~~~~~~------------- 59 (182) .--++|||.++-|..- . .|.... +.. |.+.+.......-++.|.+.++..... T Consensus 11 ~~l~~IDg~~~lyr~~~a~~~~~~~~~g~~~~~~~~~~~~l~~~~~~~~~~~~~~vFD~~~~tfR~~~~~~yK~~R~~~~ 90 (310) T COG0258 11 GKLLLIDGSSLLYRALHALPQPLGNPLGDPTGAVSGFLGMLYRLIRLLEPTHPVVVFDGKPPTFRHELLEEYKANREKEM 90 (310) T ss_pred CCEEEEECCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCC T ss_conf 85699968578999997033012687778444999999999999985678617999769996227888999984689558 Q ss_pred ---H-HHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf ---7-77579998897559569753458953788865435653299999999942115889998386658999999 Q gi|254780936|r 60 ---F-SPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 60 ---~-~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~ 131 (182) . .......+.+...|+.+...+- .+ +|--++..+-.....-....++|||.|+..++.. T Consensus 91 p~~l~~q~~~i~~~~~~~~~~~l~~~G-----------~e--add~i~t~A~~a~~~g~~~~I~S~DkD~lql~~~ 153 (310) T COG0258 91 PDELAPQIPILTELLVALGIPLLELMG-----------IE--ADDPIETLAQKAYKKGDVVLIISGDKDLLQLVSP 153 (310) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHCCC-----------CC--CCCCCCHHHHHHHHCCCCEEEEECCCCEEEECCC T ss_conf 899999889999999983825532789-----------68--3144408999988649957999789670011489 No 171 >PRK10444 UMP phosphatase; Provisional Probab=39.44 E-value=29 Score=15.79 Aligned_cols=26 Identities=12% Similarity=0.117 Sum_probs=11.5 Q ss_pred EEEECCCHHH----HHHHHHHHHCCCEEEE Q ss_conf 9998386658----9999999984988999 Q gi|254780936|r 116 LVIFSGDGCF----TTLVAALQRKVKKVTI 141 (182) Q Consensus 116 ~iLvSGD~Df----~pli~~lr~~Gk~V~v 141 (182) +|++..|..| ...+..+..+|.+.++ T Consensus 111 ~Vvvg~~~~~~~~~l~~a~~~~~~g~~~i~ 140 (248) T PRK10444 111 FVIVGETRSYNWDMMHKAAYFVANGARFIA 140 (248) T ss_pred EEEEECCCCCCHHHHHHHHHHHHCCCEEEE T ss_conf 899955576799999999999988990999 No 172 >cd00221 Vsr Very Short Patch Repair (Vsr) Endonuclease. Endonucleases in DNA repair that recognize damaged DNA and cleave the phosphodiester backbone. Vsr endonucleases have a common endonuclease topology that has been tailored for recognition of TG mismatches. Probab=39.33 E-value=15 Score=17.58 Aligned_cols=13 Identities=38% Similarity=0.856 Sum_probs=6.7 Q ss_pred CCCEEEEEEEHHH Q ss_conf 8871899980789 Q gi|254780936|r 4 PREKIALFIDGAN 16 (182) Q Consensus 4 ~~~rvaIfID~~N 16 (182) |..|+||||||-. T Consensus 51 ~~~k~aIFvdGCF 63 (115) T cd00221 51 PGYRLAIFVDGCF 63 (115) T ss_pred CCCCEEEEECCEE T ss_conf 5762899932404 No 173 >COG1879 RbsB ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=39.28 E-value=30 Score=15.69 Aligned_cols=74 Identities=19% Similarity=0.152 Sum_probs=39.8 Q ss_pred HHHHHHHHHHHHHHCCCE--EEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCC-EEEEECCCHHHHHHHHHHH Q ss_conf 677757999889755956--9753458953788865435653299999999942--1158-8999838665899999999 Q gi|254780936|r 59 QFSPLHPLLDWLHYNGFQ--VVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLE-HLVIFSGDGCFTTLVAALQ 133 (182) Q Consensus 59 ~~~~~~~~~~~l~~~g~~--v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d-~~iLvSGD~Df~pli~~lr 133 (182) ...+...|.+.+...+.. +.. ....++|..-+..+++.. .+-| ..|+..||..-.-++..++ T Consensus 173 ~~~R~~G~~~~l~~~~~~~~v~~-------------~~~~~~~~~~a~~~~~~~L~~~pdi~~i~~~~d~~a~ga~~a~~ 239 (322) T COG1879 173 AEERVKGFRDALKEHPPDIEVVD-------------VQTGDWDRDKALEVMEDLLAANPDIDGIYAANDGMALGAIQALK 239 (322) T ss_pred HHHHHHHHHHHHHHCCCCEEEEE-------------ECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH T ss_conf 99999999999975599559975-------------43688768899999999997589944999899706999999999 Q ss_pred HCCCE--EEEEEEE Q ss_conf 84988--9999831 Q gi|254780936|r 134 RKVKK--VTIVSTV 145 (182) Q Consensus 134 ~~Gk~--V~v~~~~ 145 (182) +.|+. |.++++. T Consensus 240 ~ag~~~~v~v~g~D 253 (322) T COG1879 240 AAGRKGDVVVVGFD 253 (322) T ss_pred HCCCCCCEEEEEEC T ss_conf 76998806999947 No 174 >cd06324 PBP1_ABC_sugar_binding_like_13 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=38.97 E-value=31 Score=15.66 Aligned_cols=78 Identities=13% Similarity=-0.001 Sum_probs=50.2 Q ss_pred CHHHHHHHHHHHHHHHHCC-CEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH----CCCCEEEEECCCHHHHHHHH Q ss_conf 3146777579998897559-569753458953788865435653299999999942----11588999838665899999 Q gi|254780936|r 56 PEQQFSPLHPLLDWLHYNG-FQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS----EGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 56 ~~~~~~~~~~~~~~l~~~g-~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~pli~ 130 (182) ......+...|.+.|...+ +++... .-.++|..-+.++++.. +.+| .|+..+|.==.-+++ T Consensus 154 ~~~~~~R~~Gf~~~l~~~p~~~vv~~-------------~~~~~~~~~a~~~~~~lL~~~Pdi~-aI~~~ND~mAlGAv~ 219 (305) T cd06324 154 TPAAILREAGLRRALAEHPDVRLRQV-------------VYAGWSEDEAYEQAENLLKRYPDVR-LIWAANDQMAFGALR 219 (305) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEE-------------ECCCCCHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHHHH T ss_conf 86899999899999986899589888-------------6189878999999999997399998-899788578999999 Q ss_pred HHHHCCC----EEEEEEEECC Q ss_conf 9998498----8999983157 Q gi|254780936|r 131 ALQRKVK----KVTIVSTVLS 147 (182) Q Consensus 131 ~lr~~Gk----~V~v~~~~~~ 147 (182) .+++.|+ .|.|+++... T Consensus 220 Al~~aG~~~~~dv~IvG~Dg~ 240 (305) T cd06324 220 AAKEAGRKPGRDVLFGGVNWS 240 (305) T ss_pred HHHHCCCCCCCCEEEEEECCC T ss_conf 999749987998899970698 No 175 >PRK10727 DNA-binding transcriptional regulator GalR; Provisional Probab=38.94 E-value=31 Score=15.66 Aligned_cols=105 Identities=10% Similarity=0.066 Sum_probs=57.3 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) ..+.|.+.|.=..++.. .+........+...|.+++++.|..+....+. ..+.. .+.+ ...+.+++...+. T Consensus 167 a~~~L~~~Ghr~Ia~i~--~~~~~~~~~~R~~g~~~al~~~gl~~~~~~i~-~~~~~----~~~~--~~~~~~ll~~~~~ 237 (342) T PRK10727 167 ATRHLIQQGHTRIGYLC--SNHSISDAEDRLQGYYDALAESGIPANDRLVT-FGEPD----ESGG--EQAMTELLGRGRN 237 (342) T ss_pred HHHHHHHCCCCEEEEEE--CCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEE-ECCCC----HHHH--HHHHHHHHHCCCC T ss_conf 99999974997599996--89888359999999999999769999820588-15787----7889--9999999855999 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEECC Q ss_conf 588999838665899999999849889----99983157 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVLS 147 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~~ 147 (182) . ++|++++|.==..+++.++++|.+| -|++|... T Consensus 238 ~-tAi~~~nD~~A~g~l~al~~~Gl~VP~DisViGfDd~ 275 (342) T PRK10727 238 F-TAVACYNDSMAAGAMGVLNDNGIDVPGEISLIGFDDV 275 (342) T ss_pred C-CCCEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 7-7415546999999999999829999998699988971 No 176 >TIGR01530 nadN NAD nucleotidase; InterPro: IPR006420 This family describes NadN of Haemophilus influenzae and a small number of close homologs in pathogenic, Gram-negative bacteria. NadN is a periplasmic enzyme that cleaves NAD (nicotinamide adenine dinucleotide) to NMN (nicotinamide mononucleotide) and AMP. The NMN must be converted by a 5-nucleotidase to nicotinamide riboside for import. NadN belongs a large family of 5-nucleotidases and has NMN 5-nucleotidase activity for NMN, AMP, etc.; GO: 0008253 5'-nucleotidase activity. Probab=38.87 E-value=8 Score=19.31 Aligned_cols=65 Identities=17% Similarity=0.147 Sum_probs=42.8 Q ss_pred HHHHHHHHHHHHHCCCE-EEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCC-CEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 77757999889755956-9753458953788865435653299999999942115-889998386658999999998498 Q gi|254780936|r 60 FSPLHPLLDWLHYNGFQ-VVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGL-EHLVIFSGDGCFTTLVAALQRKVK 137 (182) Q Consensus 60 ~~~~~~~~~~l~~~g~~-v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~-d~~iLvSGD~Df~pli~~lr~~Gk 137 (182) -...+....+|+.+|+. +++-. +...||+ .+.|+++ |+=||||||+.|.-==+.|++++. T Consensus 168 I~~~~~~~~~l~QQG~NKIILLS---------HAG~EKN---------~EIAQ~~~~IDVIV~GD~HYL~G~~~L~~L~L 229 (550) T TIGR01530 168 IEAVRIAAVELKQQGVNKIILLS---------HAGFEKN---------IEIAQKVSDIDVIVSGDSHYLLGDDELRRLKL 229 (550) T ss_pred HHHHHHHHHHHHHCCCCEEEEEE---------CCCCCCC---------CHHHCCCCCEEEEEECCEEEECCCCCCCCCCC T ss_conf 89999999987637843466531---------1786224---------02001426600798577024525533121678 Q ss_pred EEEEE Q ss_conf 89999 Q gi|254780936|r 138 KVTIV 142 (182) Q Consensus 138 ~V~v~ 142 (182) -|+.- T Consensus 230 PV~~E 234 (550) T TIGR01530 230 PVVYE 234 (550) T ss_pred CEEEE T ss_conf 62432 No 177 >cd06288 PBP1_sucrose_transcription_regulator Ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=38.70 E-value=31 Score=15.63 Aligned_cols=101 Identities=9% Similarity=-0.030 Sum_probs=59.5 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHH----HHH Q ss_conf 9999985179299999973068873146777579998897559569753458953788865435653299999----999 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAV----DAF 107 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~lai----D~~ 107 (182) .+.+.+.+.|.-..++ .+.+........+...|.+++...|..+...... ....+..-+. +++ T Consensus 106 ~a~~~L~~~G~~~i~~--i~~~~~~~~~~~R~~gf~~a~~~~~i~~~~~~i~-----------~~~~~~~~~~~~~~~ll 172 (269) T cd06288 106 DATRHLLAAGHRRIAF--INGEPWMLAAKDRLKGYRQALAEAGIPFDPDLVV-----------HGDWSADDGYEAAAALL 172 (269) T ss_pred HHHHHHHHHCCCCEEE--EECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEE-----------CCCCCHHHHHHHHHHHH T ss_conf 9999999848875578--7489887579999999999999769986642241-----------26776677777779998 Q ss_pred HHHCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 9421158899983866589999999984988----99998315 Q gi|254780936|r 108 EQSEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 108 ~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) ...+.. ++|++++|.--..++..++++|.+ +-+++|.. T Consensus 173 ~~~~~~-dai~~~~d~~A~g~~~~l~~~gi~vP~di~iigfD~ 214 (269) T cd06288 173 DLDDRP-TAIFCGNDRMAMGAYQALLERGLRIPQDVSVVGFDN 214 (269) T ss_pred HCCCCC-CEECCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 558998-610147789999999999985999999889999888 No 178 >cd03791 GT1_Glycogen_synthase_DULL1_like This family is most closely related to the GT1 family of glycosyltransferases. Glycogen synthase catalyzes the formation and elongation of the alpha-1,4-glucose backbone using ADP-glucose, the second and key step of glycogen biosynthesis. This family includes starch synthases of plants, such as DULL1 in Zea mays and glycogen synthases of various organisms. Probab=38.64 E-value=31 Score=15.63 Aligned_cols=67 Identities=18% Similarity=0.313 Sum_probs=40.9 Q ss_pred CCCCCCHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 435653299999999942--1158899983866589999999984-98899998315776547089998521115 Q gi|254780936|r 93 RVKSSMDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRK-VKKVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 93 ~~kk~~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~-Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) ..+||+| +.++++... .....+|+-|||..+...+..+... +-++.+.. ++....+..|-.+||-|+ T Consensus 306 ~~qKG~d--ll~~a~~~~~~~~~~~vi~G~G~~~~e~~~~~l~~~~~~~~~~~~---~~~e~l~~~lya~aD~~l 375 (476) T cd03791 306 TEQKGID--LLLEALPELLELGGQLVILGSGDPEYEEALRELAARYPGRVAVLI---GYDEALAHLIYAGADFFL 375 (476) T ss_pred CCCCCHH--HHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEE---ECCHHHHHHHHHHCCEEE T ss_conf 2024899--999999999963988999946977899999999976899599998---068788999998499974 No 179 >TIGR03407 urea_ABC_UrtA urea ABC transporter, urea binding protein. Members of this protein family are ABC transporter substrate-binding proteins associated with urea transport and metabolism. This protein is found in a conserved five-gene transport operon typically found adjacent to urease genes. It was shown in Cyanobacteria that disruption leads to the loss of high-affinity urea transport activity. Members of this protein family tend to have the twin-arginine signal for Sec-independent transport across the plasma membrane. Probab=38.36 E-value=31 Score=15.60 Aligned_cols=66 Identities=9% Similarity=0.006 Sum_probs=38.7 Q ss_pred HHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCE Q ss_conf 75799988975595697534589537888654356532999999999421158899983866589999999984988 Q gi|254780936|r 62 PLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKK 138 (182) Q Consensus 62 ~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~ 138 (182) ....+...+++.|.++...... ..+..|..-.+.-+. +.+.|.+++...-.|.+.+++.+++.|.. T Consensus 150 ~~~~~~~~~~~~G~~vv~~~~~----------~~g~~Dfs~~l~kik-~a~pD~v~~~~~~~~~~~~~kq~~~~G~~ 215 (359) T TIGR03407 150 ANKIIKAYLKSLGGTVVGEDYT----------PLGHTDFQTIINKIK-AFKPDVVFNTLNGDSNVAFFKQLKNAGIT 215 (359) T ss_pred HHHHHHHHHHHCCCCEEEEEEC----------CCCCCCHHHHHHHHH-HCCCCEEEEECCCCCHHHHHHHHHHHCCC T ss_conf 9999999999759973778825----------798755899999999-76999999937462057999999982899 No 180 >PRK01710 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=38.35 E-value=31 Score=15.60 Aligned_cols=48 Identities=15% Similarity=0.162 Sum_probs=30.8 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEE Q ss_conf 6532999999999421158899983866---589999999984988999983 Q gi|254780936|r 96 SSMDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 96 k~~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~ 144 (182) |++-+.-++-+++.-++ ..++|+-|.. ||.|+...+.++-+.|++++. T Consensus 334 kaTn~~s~~~AL~~~~~-~iilI~GG~~K~~d~~~l~~~~~~~~k~vi~~G~ 384 (458) T PRK01710 334 IASSPTRTLAGLNAFEK-PVILIAGGYDKKIPFEPLAEEGYEKIKILILMGD 384 (458) T ss_pred CCCCHHHHHHHHHCCCC-CEEEEECCCCCCCCHHHHHHHHHHHCCEEEEECC T ss_conf 66898999999971677-7299964656678869999987630529999788 No 181 >PRK10535 macrolide transporter ATP-binding /permease protein; Provisional Probab=37.41 E-value=32 Score=15.51 Aligned_cols=38 Identities=18% Similarity=0.274 Sum_probs=26.3 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 6589999999984988999983157765470899985211151899 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) .....+++.|++.|+.|++++ + ..++.+.||+.+.+.| T Consensus 181 ~~v~~ll~~l~~~G~tii~vt----H----d~~~a~~a~Rii~~~d 218 (648) T PRK10535 181 EEVMAILHQLRDRGHTVIIVT----H----DPQVAAQAERVIEIRD 218 (648) T ss_pred HHHHHHHHHHHHCCCEEEEEC----C----CHHHHHHCCEEEEEEC T ss_conf 999999999997799999976----4----8699964997999958 No 182 >cd06296 PBP1_CatR_like Ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group includes the ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group belongs to the the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=37.27 E-value=33 Score=15.49 Aligned_cols=100 Identities=12% Similarity=-0.032 Sum_probs=59.7 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999---- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---- 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---- 108 (182) ..+++.+.+.-.. .|.+.+........+...|.+++...|..+..... ..+..+..-+..++. T Consensus 108 a~~~L~~~G~~~I--~~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~i-----------~~~~~~~~~~~~~~~~ll~ 174 (270) T cd06296 108 ATEHLLELGHRRI--GFITGPPDLLCSRARLDGYRAALAEAGIPVDPALV-----------REGDFSTESGFRAAAELLA 174 (270) T ss_pred HHHHHHHHCCCCE--EEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHE-----------EECCCHHHHHHHHHHHHHH T ss_conf 9999998589728--88459988655999999999999986999885553-----------3065017899999999995 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEEC Q ss_conf 4211588999838665899999999849889----9998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~ 146 (182) ..+.. ++|+++.|.--..++..++++|.+| -++++.. T Consensus 175 ~~~~~-~aii~~~D~~A~g~~~~l~~~gi~vP~di~iigfd~ 215 (270) T cd06296 175 LPERP-TAIFAGNDLMALGVYEAARERGLRIPEDLSVVGFDD 215 (270) T ss_pred CCCCC-CCEEECCCHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 49988-620347868999999999986999999887999788 No 183 >COG4634 Uncharacterized protein conserved in bacteria [Function unknown] Probab=37.16 E-value=33 Score=15.48 Aligned_cols=64 Identities=23% Similarity=0.386 Sum_probs=34.9 Q ss_pred HHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 999889755956975345895378886543565329999999994211588999838665899999999849889999 Q gi|254780936|r 65 PLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 65 ~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) .+.++|...|++-.+- .+. +-++..|++| +..|. -+..||+|-|+||..+...+.+--+-+++- T Consensus 12 rlad~l~~~g~e~~h~-----r~l----g~~da~D~EI----~a~A~-~~~~iivTkDsDF~~la~~~G~Ppki~wLr 75 (113) T COG4634 12 RLADWLPMAGIEAVHW-----RDL----GLRDATDIEI----WAYAR-RNNRIIVTKDSDFADLALTLGSPPKIVWLR 75 (113) T ss_pred HHHHHHHHCCCCEEEE-----CCC----CCCCCCCHHH----HHHHH-HCCCEEEECCCCHHHHHHHCCCCCEEEEEE T ss_conf 8999865435510345-----145----7678855999----99987-569189971732899999708998589998 No 184 >COG1435 Tdk Thymidine kinase [Nucleotide transport and metabolism] Probab=36.86 E-value=33 Score=15.45 Aligned_cols=58 Identities=17% Similarity=0.048 Sum_probs=43.5 Q ss_pred CHHHHHHHHHHHHC-CCEEEEEEEECCCCC---CCHHHHHHHCCCCEEHHHHHHHHCCCCCC Q ss_conf 66589999999984-988999983157765---47089998521115189978762258311 Q gi|254780936|r 122 DGCFTTLVAALQRK-VKKVTIVSTVLSDPS---MASDQLRRQADYFMDLAYLKNEIARDPDE 179 (182) Q Consensus 122 D~Df~pli~~lr~~-Gk~V~v~~~~~~~~~---~~S~~L~~~ad~fi~l~~l~~~i~r~~~~ 179 (182) |.+.+..+.+|-.. |..|++.+-...+++ .+|..|-..||+...|.-+=...++.+.. T Consensus 95 ~~~~v~~l~~lad~lgi~Vi~~GL~~DFrgepFe~s~~Lla~ADkv~kL~aIC~~Cg~~At~ 156 (201) T COG1435 95 DEELVYVLNELADRLGIPVICYGLDTDFRGEPFEGSKYLLAIADKVTKLKAICNVCGRKATR 156 (201) T ss_pred CHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 97899999999861498899954465313287844899999987899988889875881069 No 185 >TIGR01317 GOGAT_sm_gam glutamate synthases, NADH/NADPH, small subunit; InterPro: IPR006005 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This describes a family in several archaeal and deeply branched bacterial lineages of a homotetrameric form of the NADPH-dependent or NADH-dependent glutamate synthase (1.4.1.13 from EC and 1.4.1.14 from EC respectively) small subunit. There is no corresponding large subunit. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=36.57 E-value=13 Score=17.90 Aligned_cols=48 Identities=19% Similarity=0.195 Sum_probs=37.0 Q ss_pred CHHHHHHH--HHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHH Q ss_conf 66589999--9999849889999831577654708999852111518997 Q gi|254780936|r 122 DGCFTTLV--AALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYL 169 (182) Q Consensus 122 D~Df~pli--~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l 169 (182) |+||-.-. -.++.+||+|+|++=..+...+...++|+-|-..+.++-+ T Consensus 292 ~~~~k~~~GqP~I~akGK~VvvIGGGDTG~DCvGTs~RhGA~sV~qFE~m 341 (517) T TIGR01317 292 DDDFKDKDGQPFIKAKGKKVVVIGGGDTGADCVGTSLRHGAASVHQFEIM 341 (517) T ss_pred CCCCCCCCCCCCEEECCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 88536667887354228678997578756224563235543552302568 No 186 >KOG1461 consensus Probab=36.16 E-value=34 Score=15.38 Aligned_cols=41 Identities=20% Similarity=0.342 Sum_probs=27.9 Q ss_pred HHHHHHHCC----CCEEEEECCCH----HHHHHHHHHHHCCC--EEEEEEE Q ss_conf 999994211----58899983866----58999999998498--8999983 Q gi|254780936|r 104 VDAFEQSEG----LEHLVIFSGDG----CFTTLVAALQRKVK--KVTIVST 144 (182) Q Consensus 104 iD~~~~a~~----~d~~iLvSGD~----Df~pli~~lr~~Gk--~V~v~~~ 144 (182) =|+|++..+ -..|||||||- +|.+++++-|.+.| +..+.+. T Consensus 113 GDamR~id~k~litgDFiLVsgd~vsN~pl~~~l~eHr~r~k~Dk~~iMTm 163 (673) T KOG1461 113 GDAMRDIDEKQLITGDFILVSGDTVSNMPLRNVLEEHRKRRKEDKDAIMTM 163 (673) T ss_pred HHHHHHHHHCCEEECCEEEEECCEEECCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 889888774260106469983773424756999999998765172435789 No 187 >cd02008 TPP_IOR_alpha Thiamine pyrophosphate (TPP) family, IOR-alpha subfamily, TPP-binding module; composed of proteins similar to indolepyruvate ferredoxin oxidoreductase (IOR) alpha subunit. IOR catalyzes the oxidative decarboxylation of arylpyruvates, such as indolepyruvate or phenylpyruvate, which are generated by the transamination of aromatic amino acids, to the corresponding aryl acetyl-CoA. Probab=36.08 E-value=34 Score=15.37 Aligned_cols=39 Identities=15% Similarity=0.087 Sum_probs=22.0 Q ss_pred HCCCCEEEEECCCHHHH----HHHHHHHHCCCEEEEEEEECCC Q ss_conf 21158899983866589----9999999849889999831577 Q gi|254780936|r 110 SEGLEHLVIFSGDGCFT----TLVAALQRKVKKVTIVSTVLSD 148 (182) Q Consensus 110 a~~~d~~iLvSGD~Df~----pli~~lr~~Gk~V~v~~~~~~~ 148 (182) +..=..+|-++||+||. .-+..+-+++..+.++....+. T Consensus 66 a~p~~~Vv~~~GDG~~~~~G~~~l~~av~~~~~it~iv~nN~~ 108 (178) T cd02008 66 ASEDKKVVAVIGDSTFFHSGILGLINAVYNKANITVVILDNRT 108 (178) T ss_pred HCCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 7899837999576277663199999999829795999994853 No 188 >PRK12480 D-lactate dehydrogenase; Provisional Probab=35.74 E-value=34 Score=15.34 Aligned_cols=17 Identities=6% Similarity=-0.107 Sum_probs=10.3 Q ss_pred HHHHHHHHHCCCEEEEE Q ss_conf 99999999849889999 Q gi|254780936|r 126 TTLVAALQRKVKKVTIV 142 (182) Q Consensus 126 ~pli~~lr~~Gk~V~v~ 142 (182) -.++++++..|.+|..+ T Consensus 159 ~~va~~~~~fg~~Vi~y 175 (330) T PRK12480 159 AATAKIYAGFGATITAY 175 (330) T ss_pred HHHHHHHHHCCCEEEEE T ss_conf 99999987569989998 No 189 >pfam03447 NAD_binding_3 Homoserine dehydrogenase, NAD binding domain. This domain adopts a Rossmann NAD binding fold. The C-terminal domain of homoserine dehydrogenase contributes a single helix to this structural domain, which is not included in the Pfam model. Probab=35.39 E-value=35 Score=15.31 Aligned_cols=42 Identities=12% Similarity=0.069 Sum_probs=32.2 Q ss_pred HHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 9999999942--11588999838665899999999849889999 Q gi|254780936|r 101 ELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 101 ~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) .+..|+-++. +.+|.+|-++|..-....+..+-+.||.|+.. T Consensus 45 ~~~~d~~~ll~~~~iDvVVE~~g~~~~~~~~~~aL~~GkhVVTa 88 (116) T pfam03447 45 PVTLDLDDLVADPRPDVVVECASSEAVAEYVLKALKAGKHVVTA 88 (116) T ss_pred EEECCHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEE T ss_conf 15779999961889989998899489999999999879989990 No 190 >cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte Probab=35.31 E-value=35 Score=15.30 Aligned_cols=29 Identities=7% Similarity=0.095 Sum_probs=24.7 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 89998386658999999998498899998 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) .-|++++|+|+-+.+...+.-|.+++=+. T Consensus 51 EWvlLtcDaDL~EC~dv~~~s~~~tikl~ 79 (82) T cd06407 51 EWVLLTCDADLEECIDVYRSSGSHTIRLL 79 (82) T ss_pred CEEEEECCCCHHHHHHHHHHCCCCEEEEE T ss_conf 88999713469999999864788779999 No 191 >COG1569 Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only] Probab=35.17 E-value=13 Score=17.91 Aligned_cols=25 Identities=24% Similarity=0.361 Sum_probs=18.8 Q ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 89998386658999999998498899998 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) --+|+|||.|..- +|...++|.+++ T Consensus 105 A~~lvTgD~dLL~----lr~~n~~~~Ilt 129 (142) T COG1569 105 ADYLVTGDQDLLV----LRDENKRVEILT 129 (142) T ss_pred CCEEEECCHHHHE----ECCCCCCEEECC T ss_conf 8889972615501----036678579737 No 192 >TIGR00632 vsr DNA mismatch endonuclease Vsr; InterPro: IPR004603 This entry represents VSR (very short patch repair) endonucleases, which occur in a variety of bacteria. VSR recognises a TG mismatched base pair, generated after spontaneous deamination of methylated cytosines, and cleaves the phosphate backbone on the 5' side of the thymine . GT mismatches can lead to C-to-T transition mutations if not repaired. VSR repairs the mismatches in favour of the G-containing strand. In Escherichia coli, this endonuclease nicks double-stranded DNA within the sequence CT(AT)GN or NT(AT)GG next to the thymidine residue, which is mismatched to 2'-deoxyguanosine . The incision is mismatch-dependent and strand specific. The structure of VSR is similar to the core structure of restriction endonucleases, which have a 3-layer alpha/beta/alpha topology . ; GO: 0004519 endonuclease activity, 0006298 mismatch repair. Probab=35.02 E-value=18 Score=17.13 Aligned_cols=54 Identities=22% Similarity=0.271 Sum_probs=27.9 Q ss_pred CCCEEEEEEEHHHHHHHHH-HCCCCC----CHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 8871899980789999998-508767----98999999851792999999730688731467775799988975595697 Q gi|254780936|r 4 PREKIALFIDGANLYASSK-ALGFDI----DYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVV 78 (182) Q Consensus 4 ~~~rvaIfID~~Nl~~~~~-~~~~~~----d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~ 78 (182) +.-|++|||||.-..+-.. ..+..+ || .+..+.. +..+-+.-.+.|.+.|..|. T Consensus 79 ~~~k~~iFidGCFWHg~~C~~~~~~p~s~~~f--W~~Ki~~-------------------N~~RD~~~~~~L~~~GWrv~ 137 (143) T TIGR00632 79 KEYKVVIFIDGCFWHGHDCDILGKVPKSRKDF--WSPKIEK-------------------NVERDRRVNDALAEEGWRVL 137 (143) T ss_pred CCCEEEEEECCCEECCCCCCCCCCCCCCCHHC--CHHHHHH-------------------HHHCCCCHHHHHHHCCCEEE T ss_conf 47358886027300587674014788664230--3688874-------------------25303117899984599898 No 193 >PRK06292 dihydrolipoamide dehydrogenase; Validated Probab=34.47 E-value=36 Score=15.21 Aligned_cols=32 Identities=28% Similarity=0.200 Sum_probs=19.6 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 88999838665899999999849889999831 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQRKVKKVTIVSTV 145 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~ 145 (182) ..++++-|--==+++...+.+.|++|+++.-. T Consensus 170 k~v~VIGgG~ig~E~A~~l~~~G~~Vtvv~~~ 201 (460) T PRK06292 170 KSLAVIGGGVIGLELGQALSRLGVKVTVFERG 201 (460) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 56999998678879999998469869999624 No 194 >cd06357 PBP1_AmiC Periplasmic binding domain of amidase (AmiC) that belongs to the type I periplasmic binding fold protein family. This group includes the periplasmic binding domain of amidase (AmiC) that belongs to the type I periplasmic binding fold protein family. AmiC controls expression of the amidase operon by the ligand-triggered conformational switch. In the absence of ligand or presence of butyramide (repressor), AmiC (the ligand sensor and negative regulator) adopts an open conformation and inhibits the transcription antitermination function of AmiR by direct protein-protein interaction. In the presence of inducing ligands such as acetamide, AmiC adopts a closed conformation which disrupts a silencing AmiC-AmiR complex and the expression of amidase and other genes of the operon are induced. Probab=34.45 E-value=36 Score=15.21 Aligned_cols=66 Identities=12% Similarity=-0.072 Sum_probs=31.3 Q ss_pred HHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHHHHCCCE Q ss_conf 57999889755956975345895378886543565329999999994-21158899983866589999999984988 Q gi|254780936|r 63 LHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-SEGLEHLVIFSGDGCFTTLVAALQRKVKK 138 (182) Q Consensus 63 ~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a~~~d~~iLvSGD~Df~pli~~lr~~Gk~ 138 (182) ...+.+.+.+.|.++.......+ ..+.+|.. --+.+. +.+-|.+++..--.|...+++-+++.|.. T Consensus 149 ~~~~~~~~~~~G~~vv~e~~~p~--------~~~~~Dfs--~~l~ki~~a~pD~V~~~~~g~~~~~~~~q~~~~G~~ 215 (360) T cd06357 149 NRIMRDLLEQRGGEVLGERYLPL--------GASDEDFA--RIVEEIREAQPDFIFSTLVGQSSYAFYRAYAAAGFD 215 (360) T ss_pred HHHHHHHHHHCCCEEEEEEEECC--------CCCHHHHH--HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCC T ss_conf 99999999975996988998569--------99706699--999999867999999945476279999999983778 No 195 >PRK06467 dihydrolipoamide dehydrogenase; Reviewed Probab=34.09 E-value=37 Score=15.18 Aligned_cols=42 Identities=21% Similarity=0.224 Sum_probs=23.6 Q ss_pred HHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 999994211588999838665899999999849889999831 Q gi|254780936|r 104 VDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTV 145 (182) Q Consensus 104 iD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~ 145 (182) .|++.+..--..++++-|--==+++...+.+.|++|+++... T Consensus 165 ~~~l~l~~~P~~v~IiGgG~ig~E~A~~~~~lG~~Vtli~~~ 206 (472) T PRK06467 165 TDALELKEVPKRLLVMGGGIIGLEMGTVYHRLGSEVDVVEMF 206 (472) T ss_pred HHHCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 665110139955999956472899999998529868999604 No 196 >PRK00635 excinuclease ABC subunit A; Provisional Probab=33.99 E-value=37 Score=15.17 Aligned_cols=35 Identities=20% Similarity=0.382 Sum_probs=17.1 Q ss_pred HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 8999999998498899998315776547089998521115189 Q gi|254780936|r 125 FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 125 f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ++.++++||..|-.|+||= + -.+...+||+.||+. T Consensus 517 Li~~L~~Lrd~GNTViVVE----H----D~~~i~~AD~iIDiG 551 (1809) T PRK00635 517 LIQVIKKLRDQGNTVLLVE----H----DEQMISLADRIIDIG 551 (1809) T ss_pred HHHHHHHHHHCCCEEEEEE----C----CHHHHHHCCEEEECC T ss_conf 9999999985699799996----6----899996277789617 No 197 >cd01542 PBP1_TreR_like Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of TreR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding wh Probab=33.87 E-value=37 Score=15.15 Aligned_cols=102 Identities=16% Similarity=0.038 Sum_probs=56.3 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+.+.+.+.-..++. +....+......+...|.+++++.|...... ...+. ..+. -...+.++++ ++ T Consensus 105 a~~~L~~~Ghr~I~~i-~~~~~~~~~~~~R~~G~~~al~~~g~~~~~~---~~~~~----~~~~--~~~~~~~ll~--~~ 172 (259) T cd01542 105 LGEYLAQQGHKNIAYL-GVSESDIAVGILRKQGYLDALKEHGICPPNI---VETDF----SYES--AYEAAQELLE--PQ 172 (259) T ss_pred HHHHHHHHCCCEEEEE-ECCCCCCHHHHHHHHHHHHHHHHCCCCCCEE---EECCC----CHHH--HHHHHHHHHH--CC T ss_conf 9999998289859999-5586560679999999999999769996359---96578----7778--9999999974--57 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf 58899983866589999999984988----99998315 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) --++|++++|.=-..+++.+++.|.+ +-|+++.. T Consensus 173 ~~~Ai~~~nD~~A~g~~~~l~~~g~~vP~Disvigfdd 210 (259) T cd01542 173 PPDAIVCATDTIALGAMKYLQELGRRIPEDISVAGFGG 210 (259) T ss_pred CCCCEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 99832434589999999999981999999869999898 No 198 >pfam08497 Radical_SAM_N Radical SAM N-terminal. This domain tends to occur to the N-terminus of the pfam04055 domain in hypothetical bacterial proteins. Probab=33.38 E-value=38 Score=15.10 Aligned_cols=13 Identities=15% Similarity=0.383 Sum_probs=8.6 Q ss_pred CCEEEEEEEHHHH Q ss_conf 8718999807899 Q gi|254780936|r 5 REKIALFIDGANL 17 (182) Q Consensus 5 ~~rvaIfID~~Nl 17 (182) +||....|-++|+ T Consensus 68 ~PrLffgVsaGn~ 80 (298) T pfam08497 68 RPRLFFGVTAGNM 80 (298) T ss_pred CCCEEEEECCCCH T ss_conf 9847988636648 No 199 >KOG3123 consensus Probab=33.09 E-value=38 Score=15.07 Aligned_cols=39 Identities=13% Similarity=0.221 Sum_probs=29.2 Q ss_pred HHHHHHCCCCEEEEECCCH----HHHHHHHHHHHCCCEEEEEE Q ss_conf 9999421158899983866----58999999998498899998 Q gi|254780936|r 105 DAFEQSEGLEHLVIFSGDG----CFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 105 D~~~~a~~~d~~iLvSGD~----Df~pli~~lr~~Gk~V~v~~ 143 (182) +++..|.+-|++.||-||- -...++.++++.|..|.++. T Consensus 68 ~il~~ad~~dVa~LVVGdPfgATTHsDlvlRAk~~~ipv~vIH 110 (272) T KOG3123 68 KILDEADKEDVAFLVVGDPFGATTHSDLVLRAKELGIPVEVIH 110 (272) T ss_pred HHHHHHHHCCEEEEEECCCCCCCCHHHHHEEHHHCCCCEEEEE T ss_conf 7764120126479996586666420354102565499769981 No 200 >cd03213 ABCG_EPDR ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus. Probab=32.85 E-value=38 Score=15.05 Aligned_cols=42 Identities=12% Similarity=0.058 Sum_probs=29.3 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 866589999999984988999983157765470899985211151899 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) .-.++..+++.+++.|+.|++++ ...+.++.+.||+.+-|++ T Consensus 146 ~~~~i~~~l~~l~~~g~tvi~~t------H~~~~~~~~~~Drv~vl~~ 187 (194) T cd03213 146 SALQVMSLLRRLADTGRTIICSI------HQPSSEIFELFDKLLLLSQ 187 (194) T ss_pred HHHHHHHHHHHHHHCCCEEEEEE------CCCHHHHHHHCCEEEEEEC T ss_conf 99999999999996898999995------8885999997999999989 No 201 >PRK08659 2-oxoglutarate ferredoxin oxidoreductase subunit alpha; Validated Probab=32.70 E-value=39 Score=15.03 Aligned_cols=49 Identities=16% Similarity=0.117 Sum_probs=35.7 Q ss_pred CCEEEEECCCH--HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 58899983866--589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDG--CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~--Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -|++|+..|-+ -.-.+++.||+.|.+|-++.... .+.....+|++.+.+ T Consensus 274 Adi~IV~~GS~~~~~~eAvd~Lr~~G~kvg~l~ir~-l~PfP~~~i~~~l~~ 324 (377) T PRK08659 274 AEVVVVAYGSVARSARRAVKEAREEGIKVGLFRLIT-VWPFPEEAIRELAKK 324 (377) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEE-ECCCCHHHHHHHHHC T ss_conf 989999968557899999999996498688999722-069998999999856 No 202 >pfam00532 Peripla_BP_1 family. This family includes the periplasmic binding proteins, and the LacI family transcriptional regulators. The periplasmic binding proteins are the primary receptors for chemotaxis and transport of many sugar based solutes. The LacI family of proteins consist of transcriptional regulators related to the lac repressor. In this case, generally the sugar binding domain binds a sugar which changes the DNA binding activity of the repressor domain (pfam00356). Probab=32.57 E-value=39 Score=15.02 Aligned_cols=93 Identities=11% Similarity=-0.045 Sum_probs=51.9 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--H Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994--2 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--S 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a 110 (182) ..++|.+.+.-... ++.+.+........+...|.++|++.|..+....+.. ...+..-+.+++.. . T Consensus 111 a~~~Li~~Ghrr~i-~~~~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~~-----------~~~~~~~g~~~~~~ll~ 178 (281) T pfam00532 111 STQYLIAEGHKRPI-AVMAGPASALTARERVQGFMAALAAAGREVKIYHVAT-----------GDNDIPDAALAANAMLV 178 (281) T ss_pred HHHHHHHHCCCCCE-EEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE-----------CCCCHHHHHHHHHHHHH T ss_conf 99999983897704-7730787650789999999999997599998206883-----------68888999999999985 Q ss_pred CCC-CEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 115-889998386658999999998498 Q gi|254780936|r 111 EGL-EHLVIFSGDGCFTTLVAALQRKVK 137 (182) Q Consensus 111 ~~~-d~~iLvSGD~Df~pli~~lr~~Gk 137 (182) .+- =++|++++|.==.-+++.+++.|+ T Consensus 179 ~~~~~~Ai~~~nD~~A~G~~~al~~~G~ 206 (281) T pfam00532 179 SHPTIDAIVAMNDEAAMGAVRALLKQGR 206 (281) T ss_pred CCCCCEEEEECCHHHHHHHHHHHHHCCC T ss_conf 4999609997587999999999998299 No 203 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=32.55 E-value=39 Score=15.02 Aligned_cols=46 Identities=7% Similarity=0.052 Sum_probs=28.0 Q ss_pred EEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCC-CCHHHHHHHCC Q ss_conf 99983866589999999984988999983157765-47089998521 Q gi|254780936|r 116 LVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPS-MASDQLRRQAD 161 (182) Q Consensus 116 ~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~-~~S~~L~~~ad 161 (182) +-+=+||-||..++..|++.|-+-.+..+...... ..-.+++++.+ T Consensus 226 vp~G~G~vDf~~~~~~L~~~gY~G~~~iE~w~~~~~~~~~~i~~a~~ 272 (284) T PRK13210 226 VPFGEGCVDFVGIFKTLKELNYRGPFLIEMWTEKAEEPVAEIIQARR 272 (284) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHH T ss_conf 58898765889999999981996327999924899787999999999 No 204 >pfam03709 OKR_DC_1_N Orn/Lys/Arg decarboxylase, N-terminal domain. This domain has a flavodoxin-like fold, and is termed the "wing" domain because of its position in the overall 3D structure. Probab=32.42 E-value=39 Score=15.00 Aligned_cols=83 Identities=20% Similarity=0.243 Sum_probs=51.7 Q ss_pred HHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH---CCCCEEEEECCCHHHHHHHHHHHHCCCEE Q ss_conf 579998897559569753458953788865435653299999999942---11588999838665899999999849889 Q gi|254780936|r 63 LHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS---EGLEHLVIFSGDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 63 ~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a---~~~d~~iLvSGD~Df~pli~~lr~~Gk~V 139 (182) .+....+|...|++|+.. .+.| |+...+ ..+. +|++|=|.....++..++++|..+ T Consensus 6 ~~~l~~~l~~~~~~vv~~---------------~~~~-----d~~~~~~~~~~i~-avil~~~~~~~~ll~~ir~~n~~l 64 (111) T pfam03709 6 SRELAEALEATGREVVDA---------------TSTD-----DLLSLIETFTDIA-AVVLSWDDEARGLLDEIRRRNFDL 64 (111) T ss_pred HHHHHHHHHHCCCEEEEE---------------CCHH-----HHHHHHHHCCCCC-EEEEEECHHHHHHHHHHHHHCCCC T ss_conf 899999999789889974---------------8789-----9999998387876-899984606899999999747899 Q ss_pred EEEEEECC-CCCCCHHHHHHHCCCCEEH Q ss_conf 99983157-7654708999852111518 Q gi|254780936|r 140 TIVSTVLS-DPSMASDQLRRQADYFMDL 166 (182) Q Consensus 140 ~v~~~~~~-~~~~~S~~L~~~ad~fi~l 166 (182) =|+..... .....+.+.-...++|+++ T Consensus 65 PvFl~~~~~~~~~vp~~~l~~i~gfi~l 92 (111) T pfam03709 65 PVFLLAETRTSEDVPLDVLREIDGFIEL 92 (111) T ss_pred CEEEEECHHHHHCCCHHHHHHHHHHHHH T ss_conf 9898844125750999999988786736 No 205 >PRK10886 DnaA initiator-associating protein DiaA; Provisional Probab=32.21 E-value=39 Score=14.98 Aligned_cols=66 Identities=14% Similarity=0.039 Sum_probs=39.8 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 32999999999421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) -|--.+-.+-.+.+.=|.+|.+|.-+ ..+.+++.++++|.+|+.++-..|. --..|...+|--|.+ T Consensus 95 ~~~vF~rQl~alg~~gDiLi~iStSGnS~Nii~Ai~~A~~~g~~~i~ltG~~gg---~l~~l~~~~Di~I~v 163 (196) T PRK10886 95 HDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGG---ELAGLLGPQDVEIRI 163 (196) T ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCCH---HHHHHCCCCCEEEEC T ss_conf 999999999985679998999948999989999999999889989999768862---366515767989986 No 206 >PRK11658 UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase; Provisional Probab=32.08 E-value=40 Score=14.97 Aligned_cols=101 Identities=15% Similarity=0.065 Sum_probs=50.6 Q ss_pred CCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHH Q ss_conf 87679899999985179299999973068873146777579998897559569753458953788865435653299999 Q gi|254780936|r 25 GFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAV 104 (182) Q Consensus 25 ~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~lai 104 (182) .+.+|+..+-+.+...-..+...-+.+.+.+... +.+..++.|+.++.=--.-. +......+-|+ ..++ T Consensus 106 t~~id~~~~e~~it~~tkaIi~Vh~~G~~~d~~~-------i~~i~~~~~i~vIEDaA~a~--Ga~~~g~~~Gs-~g~a- 174 (379) T PRK11658 106 TLMVTPEAIEAAITPRTKAIIPVHYAGAPADLDA-------IRAIGERYGIPVIEDAAHAV--GTYYKGRHIGA-RGTA- 174 (379) T ss_pred CCCCCHHHHHHHCCCCCEEEEEECCCCCCCCHHH-------HHHHHHHCCCEEEECCHHHH--CCCCCCCCCCC-CCCE- T ss_conf 5354889998654826549998568898663779-------99999975981897083553--66547986676-2445- Q ss_pred HHHHHH-----CCCCEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 999942-----115889998386658999999998498 Q gi|254780936|r 105 DAFEQS-----EGLEHLVIFSGDGCFTTLVAALQRKVK 137 (182) Q Consensus 105 D~~~~a-----~~~d~~iLvSGD~Df~pli~~lr~~Gk 137 (182) +.++- ..-+-=.|+|.|.++...++.+|.+|. T Consensus 175 -~fSF~~~K~ittGeGG~ivtnd~~l~~~~r~lr~hG~ 211 (379) T PRK11658 175 -IFSFHAIKNITCAEGGLIVTDDEELADRLRMLKFHGL 211 (379) T ss_pred -EEECCCCCCCCCCCCEEEEECCHHHHHHHHHHHHCCC T ss_conf -7856666678777874999699999999999997798 No 207 >cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo Probab=31.97 E-value=40 Score=14.96 Aligned_cols=51 Identities=20% Similarity=0.249 Sum_probs=31.3 Q ss_pred CCCC-EEEEECCC------HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 1158-89998386------658999999998498899998315776547089998521 Q gi|254780936|r 111 EGLE-HLVIFSGD------GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 111 ~~~d-~~iLvSGD------~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) .+|- .+|++++| -|=.|+..+||..|..++-+++....-...-..|.+.|. T Consensus 129 ~nykKVVIVyAs~y~~~g~~dp~pvA~rLK~~Gv~IiTVa~~q~~~~~~~~~L~~IAS 186 (193) T cd01477 129 ENYKKVVIVFASDYNDEGSNDPRPIAARLKSTGIAIITVAFTQDESSNLLDKLGKIAS 186 (193) T ss_pred CCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHCC T ss_conf 4862799999502467898886999999987697899998268875889998887579 No 208 >cd06278 PBP1_LacI_like_2 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=31.84 E-value=40 Score=14.94 Aligned_cols=100 Identities=12% Similarity=0.023 Sum_probs=57.5 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-H Q ss_conf 999998517929999997306887314677757999889755956975345895378886543565329999999994-2 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-S 110 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a 110 (182) .+.+++.+.|.-..++ .+.+........+...|.++|+..|..+.... . .+.+..-+.+++.. . T Consensus 105 ~a~~~L~~~Gh~~I~~--i~~~~~~~~~~~R~~G~~~al~~~gl~~~~~~----~---------~~~~~~~g~~~~~~ll 169 (266) T cd06278 105 LAAELLLAKGCRRIAF--IGGPADTSTSRERERGFRDALAAAGVPVVVEE----A---------GDYSYEGGYEAARRLL 169 (266) T ss_pred HHHHHHHHCCCCEEEE--EECCCCCCHHHHHHHHHHHHHHHCCCCCEEEE----C---------CCCCHHHHHHHHHHHH T ss_conf 9999999809988999--96888881499999999999998699943898----1---------8999899999999998 Q ss_pred --CCCCEEEEECCCHHHHHHHHHHH-HCCCE----EEEEEEEC Q ss_conf --11588999838665899999999-84988----99998315 Q gi|254780936|r 111 --EGLEHLVIFSGDGCFTTLVAALQ-RKVKK----VTIVSTVL 146 (182) Q Consensus 111 --~~~d~~iLvSGD~Df~pli~~lr-~~Gk~----V~v~~~~~ 146 (182) ..--++|++++|.-=..+++.++ +.|.+ +-|+++.. T Consensus 170 ~~~~~ptAi~~~nD~~A~g~l~~l~~~~g~~vP~disvvgfD~ 212 (266) T cd06278 170 ASRPRPDAIFCANDLLAIGVMDAARQEGGLRVPEDVSVIGFDD 212 (266) T ss_pred HCCCCCCCCEECCHHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 4599988625077899999999999735888888669998788 No 209 >PRK02645 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=31.69 E-value=40 Score=14.93 Aligned_cols=77 Identities=12% Similarity=0.047 Sum_probs=49.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 73146777579998897559569753458953788865435653299999999942115889998386658999999998 Q gi|254780936|r 55 DPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQR 134 (182) Q Consensus 55 ~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~ 134 (182) ..++.....+...++|...|+++...+-...... ... ........+|.+|.+-|||=+..+++.+.. T Consensus 13 ~~~~A~~~a~~l~~~L~~rGi~v~~~~~~~~~~~-----~~~--------~~~~~~~~~Dlvi~lGGDGT~L~aar~~~~ 79 (304) T PRK02645 13 GSPQAKEAAERCAKQLEARGIKVLMGPSGPKDNP-----YPV--------FLASAEELPDLAIVLGGDGTVLAAARHLAP 79 (304) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCC-----CCC--------CCCCCCCCCCEEEEECCCHHHHHHHHHHCC T ss_conf 9999999999999999988899998444344477-----762--------001446688899997868899999998542 Q ss_pred CCCEEEEEEE Q ss_conf 4988999983 Q gi|254780936|r 135 KVKKVTIVST 144 (182) Q Consensus 135 ~Gk~V~v~~~ 144 (182) .+.-|..+-. T Consensus 80 ~~iPilGiN~ 89 (304) T PRK02645 80 HDIPILSFNV 89 (304) T ss_pred CCCCEEEEEC T ss_conf 6998899824 No 210 >cd06578 HemD Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme. This ubiquitous enzyme is present in eukaryotes, bacteria and archaea. Mutations in the human uroporphyrinogen-III synthase gene cause congenital erythropoietic porphyria, a recessive inborn error of metabolism also known as Gunther disease. Probab=31.64 E-value=40 Score=14.92 Aligned_cols=20 Identities=20% Similarity=0.056 Sum_probs=11.8 Q ss_pred HHHHHHHCCCEEEEEEEEEE Q ss_conf 99889755956975345895 Q gi|254780936|r 66 LLDWLHYNGFQVVAKVAKEF 85 (182) Q Consensus 66 ~~~~l~~~g~~v~~~~~~~~ 85 (182) +.+.|...|+.+....+..+ T Consensus 137 L~~~L~~~g~~v~~~~~Y~~ 156 (239) T cd06578 137 LAEALRERGAEVDEVEVYRT 156 (239) T ss_pred HHHHHHHCCCEEEEEEEEEE T ss_conf 99999978986789988665 No 211 >PRK08366 vorA 2-ketoisovalerate ferredoxin oxidoreductase subunit alpha; Reviewed Probab=31.61 E-value=40 Score=14.92 Aligned_cols=48 Identities=17% Similarity=0.183 Sum_probs=36.6 Q ss_pred CEEEEECCC--HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 889998386--6589999999984988999983157765470899985211 Q gi|254780936|r 114 EHLVIFSGD--GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 114 d~~iLvSGD--~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) |.+|+.-|- +---.++++||+.|.+|-++.... .+.....+|++...+ T Consensus 265 d~viV~~GS~~~~~~eaVd~lr~~G~kVG~l~vr~-~rPFP~~~l~~~l~~ 314 (394) T PRK08366 265 DFVFMGMGSLMGTVKEAVDLLRKEGYKVGYAKVRW-FRPFPKEELYEIAES 314 (394) T ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEE-ECCCCHHHHHHHHCC T ss_conf 79999937518999999999997598665899988-489999999998757 No 212 >PRK06116 glutathione reductase; Validated Probab=31.30 E-value=41 Score=14.89 Aligned_cols=31 Identities=3% Similarity=0.052 Sum_probs=15.7 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 8899983866589999999984988999983 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ..++++-|--==+++...+.+.|.+|+++.- T Consensus 168 ~~v~IiGgG~ig~E~A~~~~~lG~~Vtlv~~ 198 (450) T PRK06116 168 KRVAVVGAGYIAVEFAGVLHGLGSETHLFVR 198 (450) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCEEEEEEE T ss_conf 7799999966699999999960984899994 No 213 >TIGR02467 CbiE precorrin-6y C5,15-methyltransferase (decarboxylating), CbiE subunit; InterPro: IPR012818 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CbiE subunit of precorrin-6Y C5,15-methyltransferase (2.1.1.132 from EC) from the anaerobic pathway, a bifunctional enzyme that catalyses two methylations (at C-5 and C-15) in precorrin-6Y, as well as the decarboxylation of the acetate side chain located in ring C, in order to generate precorrin-8X. In the anaerobic pathway, two enzymes are required to produce precorrin-8X: CbiE and CbiT, which can be fused as CbiET (sometimes called CobL) . In the aerobic pathway, the bifunctional enzyme is CobL .; GO: 0008276 protein methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=31.28 E-value=41 Score=14.89 Aligned_cols=57 Identities=16% Similarity=0.212 Sum_probs=37.1 Q ss_pred CHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 3299999999942--115889998386658999999998498--8999983157765470899985211 Q gi|254780936|r 98 MDVELAVDAFEQS--EGLEHLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 98 ~Dv~laiD~~~~a--~~~d~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) .|..-.++.+.-. .+-+++||.|||-=|--+...|.++.. ++.++ +.+| +++-+|-+ T Consensus 51 ~~l~~~~~~i~~~~~~~~~vvvLaSGDP~F~Gig~~l~~~~~~~~~~~i-------Pg~S-S~Q~A~AR 111 (210) T TIGR02467 51 KDLKELLEFIAATREKEKRVVVLASGDPLFYGIGRTLAERLGKEQLEII-------PGIS-SVQLAFAR 111 (210) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCEEEEC-------CCHH-HHHHHHHH T ss_conf 7867899999887534897699963797532289999860543326871-------6667-89999997 No 214 >PRK07818 dihydrolipoamide dehydrogenase; Reviewed Probab=31.12 E-value=41 Score=14.87 Aligned_cols=31 Identities=23% Similarity=0.151 Sum_probs=19.9 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 8899983866589999999984988999983 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ..++++-|--==++....+...|++|.++.. T Consensus 173 k~v~IiGgG~ig~E~A~~~~~lG~~Vtiv~~ 203 (467) T PRK07818 173 KSIVIAGAGAIGMEFAYVLKNYGVDVTIVEF 203 (467) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEE T ss_conf 6699999879999999999832987777621 No 215 >PRK06115 dihydrolipoamide dehydrogenase; Reviewed Probab=30.49 E-value=42 Score=14.80 Aligned_cols=39 Identities=26% Similarity=0.206 Sum_probs=25.3 Q ss_pred HHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 999421158899983866589999999984988999983 Q gi|254780936|r 106 AFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 106 ~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) ++++..--..++++-|--==+++...+++.|++|+++.- T Consensus 167 ~~~l~~lPk~l~iiGgG~ig~E~A~~~~~lG~~Vtlv~~ 205 (466) T PRK06115 167 ALSLPEVPKHLVVIGAGVIGLELGSVWRRLGAQVTVVEY 205 (466) T ss_pred CCCHHHCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE T ss_conf 003544896799995858999999999861984789983 No 216 >TIGR03378 glycerol3P_GlpB glycerol-3-phosphate dehydrogenase, anaerobic, B subunit. Members of this protein family are the B subunit, product of the glpB gene, of a three-subunit, membrane-anchored, FAD-dependent anaerobic glycerol-3-phosphate dehydrogenase. Probab=30.43 E-value=42 Score=14.80 Aligned_cols=37 Identities=19% Similarity=0.186 Sum_probs=24.6 Q ss_pred HHCCCCEEEE--ECCCHHHHHHHHHHH-HCCCEEEEEEEE Q ss_conf 4211588999--838665899999999-849889999831 Q gi|254780936|r 109 QSEGLEHLVI--FSGDGCFTTLVAALQ-RKVKKVTIVSTV 145 (182) Q Consensus 109 ~a~~~d~~iL--vSGD~Df~pli~~lr-~~Gk~V~v~~~~ 145 (182) ++...|.+++ |-|+.|-..++++|+ ..|+.|.-+-+. T Consensus 217 ~~~~~d~V~lPAvlGl~~~~~~~~~L~~~lg~~v~EvPtl 256 (419) T TIGR03378 217 LANGADLVVLPACFGLGDGLELLRELEQATGLTLCELPTM 256 (419) T ss_pred HCCCCCEEEECCEECCCCHHHHHHHHHHHHCCCEEECCCC T ss_conf 5689889998964178973899999999889888988999 No 217 >PRK10563 6-phosphogluconate phosphatase; Provisional Probab=30.39 E-value=42 Score=14.79 Aligned_cols=69 Identities=17% Similarity=0.110 Sum_probs=41.1 Q ss_pred HHHHHHHHHHHHC-CCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHHC Q ss_conf 2999999999421-158899983866589999999984988999983157765470899985211151899787622 Q gi|254780936|r 99 DVELAVDAFEQSE-GLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEIA 174 (182) Q Consensus 99 Dv~laiD~~~~a~-~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i~ 174 (182) |-.+-+.+.+... .-+.++.| |||. .=|+..++.|.+|+.++.... -+..+..+...|.++.+++..++ T Consensus 144 ~Pdiyl~A~~~lgv~p~~clvi-EDS~--~Gi~AA~aAGm~vi~~~~~~~----~~~~~~~~~~~~~~~~~l~~l~~ 213 (221) T PRK10563 144 DPALMFHAAEAMNVNVENCILV-DDSS--AGAQSGIAAGMEVFYFCADPH----NKPIVHPKVTTFTDLAQLPELWK 213 (221) T ss_pred CCHHHHHHHHHCCCCHHHEEEE-ECCH--HHHHHHHHCCCEEEEECCCCC----CCCCCCCCCCCHHCHHHHHHHHH T ss_conf 8099999999829986784878-4899--999999985996999899987----65443443234107998199999 No 218 >TIGR00559 pdxJ pyridoxal phosphate biosynthetic protein PdxJ; InterPro: IPR004569 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . In Escherichia coli, the pdx genes involved in vitamin B6 have been characterised , , . This entry represents PdxJ, which catalyses the condensation of 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one and 1-deoxy-D-xylulose-5-phosphate to form pyridoxine-5'-phosphate. The product of the PdxJ reaction is then oxidized by PdxH to pyridoxal 5'-phosphate.; GO: 0008615 pyridoxine biosynthetic process, 0005737 cytoplasm. Probab=30.36 E-value=42 Score=14.79 Aligned_cols=70 Identities=16% Similarity=0.071 Sum_probs=37.9 Q ss_pred HHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHHHHCCCEEEEEEE Q ss_conf 99889755956975345895378886543565329999999994211588999838665-89999999984988999983 Q gi|254780936|r 66 LLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGC-FTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 66 ~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~D-f~pli~~lr~~Gk~V~v~~~ 144 (182) +.-.+++..-.|...| +......+++|+| ||+-.| +-.+++++++.|.+|=++- T Consensus 79 ~~~~~knkP~~vTlVP-----e~r~evTtegGLD-------------------va~~~dkL~~~~~~~~~~GI~vSlFI- 133 (265) T TIGR00559 79 IAVEIKNKPEKVTLVP-----EKRDEVTTEGGLD-------------------VARLKDKLKELVKRLHEAGIEVSLFI- 133 (265) T ss_pred HHHHHCCCCCEEEECC-----CCCCEEECCCCCC-------------------HHHHHHHHHHHHHHHHHCCCEEEEEE- T ss_conf 9998558997387416-----9886030264400-------------------01104679999999986798587742- Q ss_pred ECCCCCCCHHHHHHHCC Q ss_conf 15776547089998521 Q gi|254780936|r 145 VLSDPSMASDQLRRQAD 161 (182) Q Consensus 145 ~~~~~~~~S~~L~~~ad 161 (182) .+.+...+++-..-+| T Consensus 134 -d~~~d~i~~A~e~g~~ 149 (265) T TIGR00559 134 -DADKDQISAAAEVGAD 149 (265) T ss_pred -CCCHHHHHHHHHCCCC T ss_conf -5446888999970898 No 219 >TIGR02153 gatD_arch glutamyl-tRNA(Gln) amidotransferase, subunit D; InterPro: IPR011878 This peptide is found only in the Archaea. It is part of a heterodimer, with GatD (IPR004414 from INTERPRO), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn) .; GO: 0006450 regulation of translational fidelity. Probab=30.30 E-value=42 Score=14.78 Aligned_cols=66 Identities=11% Similarity=0.041 Sum_probs=41.7 Q ss_pred HHHHHHH--CCCC-EEEEECCCH----HHHHHHHHHHHCCCEEEEEEEECCCCCCCH-------HHHHHHCCCCEEHHHH Q ss_conf 9999942--1158-899983866----589999999984988999983157765470-------8999852111518997 Q gi|254780936|r 104 VDAFEQS--EGLE-HLVIFSGDG----CFTTLVAALQRKVKKVTIVSTVLSDPSMAS-------DQLRRQADYFMDLAYL 169 (182) Q Consensus 104 iD~~~~a--~~~d-~~iLvSGD~----Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S-------~~L~~~ad~fi~l~~l 169 (182) -++++.. ..|- .+|.=||=| |++|.|+++.+.|.-|..- +|...+.+. |+|+++ +.|..+|+ T Consensus 298 p~il~~~~d~GykGiViEGTGLGHvs~~~ip~i~ra~d~Gv~V~MT--SQClyGRVNlnVY~TGR~Ll~a--GVIp~EDM 373 (413) T TIGR02153 298 PEILEFLVDKGYKGIVIEGTGLGHVSEDWIPSIKRATDDGVPVVMT--SQCLYGRVNLNVYSTGRELLKA--GVIPGEDM 373 (413) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE--EEEECCEECCCCCCCHHHHHHC--CCCCCCCC T ss_conf 8899998518715999833787555235899999987589689996--1143350507534334899975--87278888 Q ss_pred HHHH Q ss_conf 8762 Q gi|254780936|r 170 KNEI 173 (182) Q Consensus 170 ~~~i 173 (182) .++. T Consensus 374 lPEv 377 (413) T TIGR02153 374 LPEV 377 (413) T ss_pred CHHH T ss_conf 7699 No 220 >PRK08306 dipicolinate synthase subunit A; Reviewed Probab=30.17 E-value=43 Score=14.77 Aligned_cols=54 Identities=26% Similarity=0.198 Sum_probs=33.2 Q ss_pred EEEEECCCHH-HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHH---HCCCCEEHHHHHHHHCC Q ss_conf 8999838665-8999999998498899998315776547089998---52111518997876225 Q gi|254780936|r 115 HLVIFSGDGC-FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRR---QADYFMDLAYLKNEIAR 175 (182) Q Consensus 115 ~~iLvSGD~D-f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~---~ad~fi~l~~l~~~i~r 175 (182) --++|.|=|= =..+++.++..|.+|.|++-. +..|.+ ..-+.+.++++.+.+.. T Consensus 153 s~~lVlG~Gr~G~~lA~~l~~lGA~V~V~aR~-------~~~~a~a~~~G~~~v~~~~L~~~~~~ 210 (296) T PRK08306 153 SNVLVLGFGRTGMTLARTLKALGAKVKVGARK-------SAHLARITEMGYSPVHLSELAEEVGE 210 (296) T ss_pred CCEEEECCCHHHHHHHHHHHHCCCEEEEEECC-------HHHHHHHHHCCCEEEEHHHHHHHHCC T ss_conf 65899898689999999999779969999799-------99999999769858719999977356 No 221 >cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions. Probab=29.92 E-value=43 Score=14.74 Aligned_cols=48 Identities=10% Similarity=0.103 Sum_probs=31.2 Q ss_pred CEEEEECC-C----HHHHHHHHHHHHCCCEEEEEEEECCCCC-CCHHHHHHHCC Q ss_conf 88999838-6----6589999999984988999983157765-47089998521 Q gi|254780936|r 114 EHLVIFSG-D----GCFTTLVAALQRKVKKVTIVSTVLSDPS-MASDQLRRQAD 161 (182) Q Consensus 114 d~~iLvSG-D----~Df~pli~~lr~~Gk~V~v~~~~~~~~~-~~S~~L~~~ad 161 (182) .++||+|. . .+-..+++.+|..|.+|..+++...... ....+|...|. T Consensus 105 kv~ivlTDG~s~d~~~~~~~~~~lk~~gv~vf~VGvG~~~~~~~~~~eL~~iAs 158 (177) T cd01469 105 KVLVVITDGESHDDPLLKDVIPQAEREGIIRYAIGVGGHFQRENSREELKTIAS 158 (177) T ss_pred EEEEEEECCCCCCCCCHHHHHHHHHHCCEEEEEEEECCCCCCCCCHHHHHHHHC T ss_conf 699999789867750149999999979908999995551467451999999967 No 222 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=29.85 E-value=43 Score=14.74 Aligned_cols=62 Identities=18% Similarity=0.282 Sum_probs=35.1 Q ss_pred HHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCCEEEEEC--CC--HHHHHHHHHHHHCCC Q ss_conf 79998897559569753458953788865435653299999999942--1158899983--86--658999999998498 Q gi|254780936|r 64 HPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLEHLVIFS--GD--GCFTTLVAALQRKVK 137 (182) Q Consensus 64 ~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d~~iLvS--GD--~Df~pli~~lr~~Gk 137 (182) +-....|+..||+|+.-.+... .+ ++++.| ...|.+.|-| |- ..+-.+++.|++.|. T Consensus 17 ~iva~~l~d~GfeVi~lG~~~s------------~e-----eiv~~A~~e~ad~IglSsL~g~h~~~~~~l~~~L~e~G~ 79 (122) T cd02071 17 KVIARALRDAGFEVIYTGLRQT------------PE-----EIVEAAIQEDVDVIGLSSLSGGHMTLFPEVIELLRELGA 79 (122) T ss_pred HHHHHHHHHCCCEEEECCCCCC------------HH-----HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCC T ss_conf 9999999978976996799889------------99-----999999973998999964655447899999999997699 Q ss_pred E-EEEE Q ss_conf 8-9999 Q gi|254780936|r 138 K-VTIV 142 (182) Q Consensus 138 ~-V~v~ 142 (182) + +.++ T Consensus 80 ~di~v~ 85 (122) T cd02071 80 GDILVV 85 (122) T ss_pred CCCEEE T ss_conf 984699 No 223 >COG2210 Peroxiredoxin family protein [General function prediction only] Probab=29.82 E-value=43 Score=14.73 Aligned_cols=20 Identities=20% Similarity=0.184 Sum_probs=9.5 Q ss_pred HHHHHHHHHHH-CCCEEEEEE Q ss_conf 58999999998-498899998 Q gi|254780936|r 124 CFTTLVAALQR-KVKKVTIVS 143 (182) Q Consensus 124 Df~pli~~lr~-~Gk~V~v~~ 143 (182) +..+.+..++. .|.+|+.++ T Consensus 77 ~~le~l~~ak~~~~vkv~AC~ 97 (137) T COG2210 77 AWLELLREAKKDGGVKVYACS 97 (137) T ss_pred CHHHHHHHHHHHCCCEEEEEH T ss_conf 079999987561692899705 No 224 >COG0399 WecE Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis [Cell envelope biogenesis, outer membrane] Probab=29.65 E-value=43 Score=14.71 Aligned_cols=109 Identities=16% Similarity=0.131 Sum_probs=57.6 Q ss_pred EEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCC Q ss_conf 99980789999998508767989999998517929999997306887314677757999889755956975345895378 Q gi|254780936|r 9 ALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTEN 88 (182) Q Consensus 9 aIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~ 88 (182) -||||-+.- .+++|...+-+.+...-......-+.+.+.+.. .+.+-.++.|.-|+.=--.-. T Consensus 98 PVFvDid~~-------T~nid~~~ie~aIt~~tKAIipVhl~G~~~dm~-------~i~~la~~~~l~vIEDaAqa~--- 160 (374) T COG0399 98 PVFVDIDPD-------TLNIDPDLIEAAITPRTKAIIPVHLAGQPCDMD-------AIMALAKRHGLPVIEDAAQAH--- 160 (374) T ss_pred EEEEECCCC-------CCCCCHHHHHHHCCCCCEEEEEEHHCCCCCCHH-------HHHHHHHHCCCEEEEECCHHC--- T ss_conf 799964764-------147799999987155774999851216877989-------999999873984998760320--- Q ss_pred CCCCCCC--CCCHHHHHHHHHHH---H-CC---CCEEEEECCCHHHHHHHHHHHHCCCEE Q ss_conf 8865435--65329999999994---2-11---588999838665899999999849889 Q gi|254780936|r 89 CGRKRVK--SSMDVELAVDAFEQ---S-EG---LEHLVIFSGDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 89 ~~~~~~k--k~~Dv~laiD~~~~---a-~~---~d~~iLvSGD~Df~pli~~lr~~Gk~V 139 (182) +.....| ++.. |+-.. + .+ .+-=.++|.|.++...++.+|.+|+.+ T Consensus 161 Ga~y~gk~vGt~G-----d~~~fSF~~~K~ittgEGGav~tnd~ela~k~~~lr~hG~~~ 215 (374) T COG0399 161 GATYKGKKVGSFG-----DIGAFSFHATKNLTTGEGGAVVTNDEELAEKARSLRNHGLSR 215 (374) T ss_pred CCEECCCCCCCCC-----CEEEEEECCCCCCCCCCCEEEEECCHHHHHHHHHHHHHCCCC T ss_conf 6723584255666-----468998658776556675079838899999999999837677 No 225 >PRK13938 phosphoheptose isomerase; Provisional Probab=29.62 E-value=43 Score=14.71 Aligned_cols=52 Identities=15% Similarity=0.180 Sum_probs=37.7 Q ss_pred HHCCCCEEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 421158899983866---5899999999849889999831577654708999852111518 Q gi|254780936|r 109 QSEGLEHLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 109 ~a~~~d~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) +++.=|.+|.+|+-| -.+.+++.++++|.+|+.++-.. ...|.+.||..+.+ T Consensus 110 l~~~gDvLi~iStSGnS~NIi~A~~~Ak~~g~~~i~ltG~~------gg~l~~l~D~~i~v 164 (196) T PRK13938 110 SARPGDTLFAISTSGNSMSVLRAAKTARELGVTVVAMTGES------GGQLAEFADFLINV 164 (196) T ss_pred CCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CCHHHHCCCEEEEE T ss_conf 16899989997799999999999999998799899998899------63177548989993 No 226 >cd02553 PseudoU_synth_RsuA PseudoU_synth_RsuA: Pseudouridine synthase, Escherichia coli RsuA like. This group is comprised of eukaryotic and bacterial proteins similar to Escherichia coli RsuA. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. E.coli RsuA makes psi516 in 16S RNA. Psi at this position is not generally conserved in other organisms. Probab=29.54 E-value=44 Score=14.70 Aligned_cols=46 Identities=17% Similarity=0.224 Sum_probs=31.9 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 8899983866589999999984988999983157765470899985211 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) .-.+|+|.|++|+.-+.+-+..=.+...+.+.... +...|.+.... T Consensus 47 ~GLLLlTNDG~l~~~l~hP~~~i~K~Y~V~v~~~i---~~~~l~~l~~G 92 (167) T cd02553 47 TGLLLLTNDGQLAHRLTSPKKHVPKTYEVTLAGPL---TEDDIEAFAEG 92 (167) T ss_pred CCEEEECCCHHHHHHHHCCCCCCCEEEEEEECCCC---CHHHHHHHHCC T ss_conf 62299918888997641444788889999876789---99999999769 No 227 >cd01567 NAPRTase_PncB Nicotinate phosphoribosyltransferase (NAPRTase) family. Nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. This salvage pathway serves to recycle NAD degradation products. Probab=29.46 E-value=44 Score=14.69 Aligned_cols=46 Identities=17% Similarity=0.051 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHH---CC--CCEEEEECCCHHHHHHHHHHHHCCC-EEEEEEE Q ss_conf 299999999942---11--5889998386658999999998498-8999983 Q gi|254780936|r 99 DVELAVDAFEQS---EG--LEHLVIFSGDGCFTTLVAALQRKVK-KVTIVST 144 (182) Q Consensus 99 Dv~laiD~~~~a---~~--~d~~iLvSGD~Df~pli~~lr~~Gk-~V~v~~~ 144 (182) =..++..+.++. .- -+..|++|||.|+...++.++..+. .+.++++ T Consensus 274 p~~~~~~~~~~ld~~G~~~~~~~iv~S~~ld~e~~i~~~~~~~~~~~~~FGV 325 (343) T cd01567 274 PVELIKKVRKHLDELGIDLNKKKIIISGDLDTEEAIELLLEQGASPNDAFGV 325 (343) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHCCCCCCEEEEE T ss_conf 5999999999999759997873899989998899999999738987539970 No 228 >PRK08367 porA pyruvate ferredoxin oxidoreductase subunit alpha; Reviewed Probab=29.20 E-value=44 Score=14.67 Aligned_cols=49 Identities=16% Similarity=0.191 Sum_probs=38.0 Q ss_pred CCEEEEECCC--HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 5889998386--6589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGD--GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD--~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -|.+|+.-|- +---.+++.||+.|.+|.++-.. ..|.....+|++.+.. T Consensus 262 Ae~viV~~GS~~~~~k~AVd~lR~~G~KVGllr~r-~~rPFP~e~l~~~~k~ 312 (395) T PRK08367 262 AEIIFVTMGSLAGTLKEWVDKKREEGYKVGAAKIT-VYRPFPVEEIRELAKK 312 (395) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEE-EECCCCHHHHHHHHCC T ss_conf 88999981742899999999999749857689988-8589999999998657 No 229 >COG0771 MurD UDP-N-acetylmuramoylalanine-D-glutamate ligase [Cell envelope biogenesis, outer membrane] Probab=29.09 E-value=44 Score=14.65 Aligned_cols=48 Identities=21% Similarity=0.427 Sum_probs=33.2 Q ss_pred CCCC--EEEEECCCH---HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 1158--899983866---58999999998498899998315776547089998521 Q gi|254780936|r 111 EGLE--HLVIFSGDG---CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 111 ~~~d--~~iLvSGD~---Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) ..++ .+.|+.|+. ||.|+...++..-+.+++++.. ....++.|.+... T Consensus 340 ~~~~~~v~lI~GG~~Kg~df~~L~~~~~~~~~~~~~~G~~---~~~i~~~l~~~~~ 392 (448) T COG0771 340 SGFDGPVILIAGGDDKGADFSPLAEILAKVIKKLVLIGED---AEKIAAALKEAGP 392 (448) T ss_pred HCCCCCEEEEECCCCCCCCHHHHHHHHHHCCEEEEEECCC---HHHHHHHHHHCCC T ss_conf 7189977999777788888168899774155599994788---8999999875278 No 230 >cd03195 GST_C_4 GST_C family, unknown subfamily 4; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=28.92 E-value=30 Score=15.68 Aligned_cols=58 Identities=16% Similarity=0.126 Sum_probs=35.3 Q ss_pred CCCCHHHHHHHHHHHH-----CC-CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 5653299999999942-----11-5889998386658999999998498899998315776547089998521115189 Q gi|254780936|r 95 KSSMDVELAVDAFEQS-----EG-LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 95 kk~~Dv~laiD~~~~a-----~~-~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ....|+.-.+.+.+.. ++ +..+.| .|.||.|++.++..+|..| +..|..++....... T Consensus 42 a~~ad~~rl~~~a~~ll~~g~p~LFG~wsI--ADa~~Apv~~Rl~~yg~~v-------------p~~~~~Ya~~~~~~p 105 (114) T cd03195 42 AAQAAAEKLIAVAEALLPPGAANLFGEWCI--ADTDLALMLNRLVLNGDPV-------------PERLRDYARRQWQRP 105 (114) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHH--HHHHHHHHHHHHHHCCCCC-------------CHHHHHHHHHHHCCH T ss_conf 999999999999999874689864253439--9899999999998838999-------------989999999998788 No 231 >cd02554 PseudoU_synth_RluF PseudoU_synth_RluF_like: Pseudouridine synthase, Escherichia coli RluF like. This group is comprised of bacterial proteins similar to Escherichia coli RluF. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. E.coli RluF makes psi2604 in 23S RNA. psi2604 has only been detected in E. coli. It is absent from other eubacteria despite a precursor U at that site and from eukarya and archea which lack a precursor U at that site. Probab=28.31 E-value=41 Score=14.84 Aligned_cols=47 Identities=13% Similarity=0.176 Sum_probs=31.3 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 58899983866589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -.-.+|+|.|+||+.-+.+-+..=.++..+...... +...|.....+ T Consensus 43 s~GLLLlTNDg~l~~~L~hP~~~i~k~Y~V~~~~~i---~~~~l~~l~~G 89 (164) T cd02554 43 SEGLILLTNDGDLVNKILHADNNHEKEYLVTVNKPI---TDEFIEGMSNG 89 (164) T ss_pred CCCEEEEECCHHHHHHHCCCCCCCCCEEEEEECCCC---CHHHHHHHHCC T ss_conf 856599957889997621998889629999995669---99999987478 No 232 >PRK09271 flavodoxin; Provisional Probab=28.29 E-value=46 Score=14.57 Aligned_cols=84 Identities=14% Similarity=0.141 Sum_probs=48.0 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEC---CC Q ss_conf 999730688731467775799988975595697534589537888654356532999999999421158899983---86 Q gi|254780936|r 46 AYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFS---GD 122 (182) Q Consensus 46 ~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvS---GD 122 (182) ...|.+..++..... ....+.|+..|++|.... +|.....+.-.....+|.++|.| || T Consensus 4 lIvYaS~TGNTE~vA---~~I~~~l~~~G~eV~~~e----------------~d~~~~~~~~~d~~~yDl~llG~yTwgd 64 (160) T PRK09271 4 LLAYASLSGNTREVA---RMIAARCEEAGHAVTWVE----------------TDLQTLAQAPLDPEEYDLFLLGTWTDNA 64 (160) T ss_pred EEEEECCCCCHHHHH---HHHHHHHHHCCCEEEEEE----------------ECCHHHHHCCCCCCCCCEEEEECCCCCC T ss_conf 999984887689999---999999997698237987----------------0102322033561118889996570489 Q ss_pred H-------HHHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 6-------5899999999849889999831577 Q gi|254780936|r 123 G-------CFTTLVAALQRKVKKVTIVSTVLSD 148 (182) Q Consensus 123 ~-------Df~pli~~lr~~Gk~V~v~~~~~~~ 148 (182) | ||.+-+..+--.||+|.|++..... T Consensus 65 G~lPdE~~DF~e~L~~~dl~gk~~avFGsGDt~ 97 (160) T PRK09271 65 GRTPPEMKRFIAELVETIGKPPNVAVFGTGETQ 97 (160) T ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCC T ss_conf 989667899999998604568769998777776 No 233 >PRK05329 anaerobic glycerol-3-phosphate dehydrogenase subunit B; Validated Probab=28.08 E-value=46 Score=14.54 Aligned_cols=40 Identities=15% Similarity=0.022 Sum_probs=27.8 Q ss_pred HHHHHCCCCEEEE--ECCCHHHHHHHHHHH-HCCCEEEEEEEE Q ss_conf 9994211588999--838665899999999-849889999831 Q gi|254780936|r 106 AFEQSEGLEHLVI--FSGDGCFTTLVAALQ-RKVKKVTIVSTV 145 (182) Q Consensus 106 ~~~~a~~~d~~iL--vSGD~Df~pli~~lr-~~Gk~V~v~~~~ 145 (182) +..++...|.+++ |-|..|-..+.++|+ ..|+.|.-+-+. T Consensus 213 l~~~~~~~d~V~lPAvlGl~~~~~v~~~L~~~lg~~v~EvpTl 255 (425) T PRK05329 213 LKPLAGDAEAVLLPACLGLDDDAAVLAELEEALGCPVFELPTL 255 (425) T ss_pred HHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCEEECCCC T ss_conf 9975689888998973068976899999999889988988899 No 234 >COG1242 Predicted Fe-S oxidoreductase [General function prediction only] Probab=28.07 E-value=46 Score=14.54 Aligned_cols=38 Identities=21% Similarity=0.366 Sum_probs=25.4 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 658999999998498899998315776547089998521 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) ++|+.++.++|++|.+|. .....|.++-+..+..+.+. T Consensus 168 ~~y~dav~r~rkrgIkvc-~HiI~GLPgE~~~~mleTak 205 (312) T COG1242 168 ACYVDAVKRLRKRGIKVC-THLINGLPGETRDEMLETAK 205 (312) T ss_pred HHHHHHHHHHHHCCCEEE-EEEEECCCCCCHHHHHHHHH T ss_conf 999999999997497498-88840798888899999999 No 235 >cd02556 PseudoU_synth_RluB PseudoU_synth_RluB: Pseudouridine synthase, Escherichia coli RluB like. This group is comprised of bacterial and eukaryotic proteins similar to E. coli RluB. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. E.coli RluB makes psi2605 in 23S RNA. psi2605 has been detected in eubacteria but, not in eukarya and archea despite the presence of a precursor U at that site. Probab=28.02 E-value=46 Score=14.54 Aligned_cols=48 Identities=17% Similarity=0.234 Sum_probs=31.5 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCC Q ss_conf 588999838665899999999849889999831577654708999852111 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYF 163 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~f 163 (182) -.-.+|+|.|++|+.-+.+=+..-.++..+...... ....|.....++ T Consensus 46 s~GLLLlTnDG~l~~~l~hP~~~i~k~Y~V~v~~~~---~~~~l~~l~~Gv 93 (167) T cd02556 46 TEGLLLFTNDGELANRLMHPSNEIEREYAVRVFGQV---TDEQLKSLKKGV 93 (167) T ss_pred CCCEEEEECCHHHHHHHCCCCCCCCEEEEEEECCCC---CHHHHHHHHCCC T ss_conf 678599927836757640987788579999975899---999999987797 No 236 >PRK07251 pyridine nucleotide-disulfide oxidoreductase; Provisional Probab=27.99 E-value=46 Score=14.53 Aligned_cols=32 Identities=19% Similarity=0.216 Sum_probs=21.5 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 58899983866589999999984988999983 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) -..++++-|.-==+++...+.+.|.+|+++.- T Consensus 157 p~~v~ViGgG~ig~E~A~~~~~lG~~Vtli~~ 188 (438) T PRK07251 157 PKRLGILGGGNIGLEFAGLYNKLGSKVTVLDA 188 (438) T ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE T ss_conf 97699988864588999999834876899984 No 237 >cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions. Probab=27.74 E-value=47 Score=14.51 Aligned_cols=62 Identities=16% Similarity=0.152 Sum_probs=39.1 Q ss_pred CCCHHHHHHHHHHH----H--CC-CCEEEEEC-CC------HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 65329999999994----2--11-58899983-86------658999999998498899998315776547089998521 Q gi|254780936|r 96 SSMDVELAVDAFEQ----S--EG-LEHLVIFS-GD------GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 96 k~~Dv~laiD~~~~----a--~~-~d~~iLvS-GD------~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +++.+.-|+..+.. . +. -..+||+| |- ....++++.+|..|..|..+++.. ....+|...|. T Consensus 86 G~T~tg~AL~~a~~~~~~~~r~~~~kvlvliTDG~S~~~~~~~~~~aa~~lr~~GV~ifaVGVG~----~~~~eL~~IAs 161 (186) T cd01480 86 GGTFTDCALKYATEQLLEGSHQKENKFLLVITDGHSDGSPDGGIEKAVNEADHLGIKIFFVAVGS----QNEEPLSRIAC 161 (186) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC----CCHHHHHHHHC T ss_conf 98629999999999986136789853899984587666740669999999998798999999474----88799999858 No 238 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=27.63 E-value=47 Score=14.49 Aligned_cols=78 Identities=18% Similarity=0.089 Sum_probs=51.3 Q ss_pred HHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH----CCCCE-EEEECCCHHHHHHHHHHH Q ss_conf 6777579998897559569753458953788865435653299999999942----11588-999838665899999999 Q gi|254780936|r 59 QFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS----EGLEH-LVIFSGDGCFTTLVAALQ 133 (182) Q Consensus 59 ~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a----~~~d~-~iLvSGD~Df~pli~~lr 133 (182) ...+...|.+.|...|+.-..... ....++|..-+.++++.. +.++. +|...+|.-=+-+++.++ T Consensus 145 ~~~R~~Gf~~~l~~~g~p~~~vv~----------~~~~~~~~~~a~~~~~~~L~a~pdi~~~~i~a~ND~mAlGA~~Al~ 214 (289) T cd01540 145 AKPRTDGALEALKAPGFPEANIFQ----------APQKTTDTEGAFDAAASTLTKNPNVKNWIIYGLNDETVLGAVRATE 214 (289) T ss_pred HHHHHHHHHHHHHHCCCCCCEEEE----------ECCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHH T ss_conf 889987699999635999607998----------4568866999999999999758998843999779668899999999 Q ss_pred HCC---CEEEEEEEEC Q ss_conf 849---8899998315 Q gi|254780936|r 134 RKV---KKVTIVSTVL 146 (182) Q Consensus 134 ~~G---k~V~v~~~~~ 146 (182) +.| ++|.++++.. T Consensus 215 ~aGl~~~~v~vvgidg 230 (289) T cd01540 215 QSGIAAADVIGVGING 230 (289) T ss_pred HCCCCCCCEEEEEEEC T ss_conf 7499889769999728 No 239 >cd02555 PSSA_1 PSSA_1: Pseudouridine synthase, a subgroup of the RsuA family. This group is comprised of bacterial proteins assigned to the RsuA family of pseudouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. The TruA family is comprised of proteins related to Escherichia coli RsuA. Probab=27.23 E-value=48 Score=14.45 Aligned_cols=47 Identities=17% Similarity=0.154 Sum_probs=32.4 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 58899983866589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) ..-.+|+|.|+||+--+.+=+..-.++..+.+.... +...+...... T Consensus 57 s~GLLllTNDG~l~~~l~~P~~~i~k~Y~V~v~~~~---~~~~l~~l~~G 103 (177) T cd02555 57 ASGLLVFSQDGRVLRKLIGDASRLEQEYLVEVRGEL---TAGGLERLNHG 103 (177) T ss_pred CCEEEEEECCHHHHHHHCCCCCCCCEEEEEEECCCC---CHHHHHHHHCC T ss_conf 811899918889998754986689789999989879---99999875287 No 240 >TIGR00250 TIGR00250 conserved hypothetical protein TIGR00250; InterPro: IPR005227 Holliday junction resolvases (HJRs) are key enzymes of DNA recombination. The principal HJRs are now known or confidently predicted for all bacteria and archaea whose genomes have been completely sequenced, with many species encoding multiple potential HJRs. Structural and evolutionary relationships of HJRs and related nucleases suggests that the HJR function has evolved independently from at least four distinct structural folds, namely RNase H, endonuclease, endonuclease VIIcolicin E and RusA (IPR008822 from INTERPRO): The endonuclease fold, whose structural prototypes are the phage exonuclease, the very short patch repair nuclease (Vsr) and type II restriction enzymes, is shown to encompass by far a greater diversity of nucleases than previously suspected. This fold unifies archaeal HJRs (IPR002732 from INTERPRO), repair nucleases such as RecB (IPR004586 from INTERPRO) and Vsr (IPR004603 from INTERPRO), restriction enzymes and a variety of predicted nucleases whose specific activities remain to be determined. The RNase H fold characterises the RuvC family (IPR002176 from INTERPRO), which is nearly ubiquitous in bacteria, and in addition the YqgF family (IPR005227 from INTERPRO). The proteins of this family, typified by Escherichia coli YqgF, are likely to function as an alternative to RuvC in most bacteria, but could be the principal HJRs in low-GC Gram-positive bacteria and Aquifex. Endonuclease VII of phage T4 (IPR004211 from INTERPRO) is shown to serve as a structural template for many nucleases, including McrA and other type II restriction enzymes. Together with colicin E7, endonuclease VII defines a distinct metal-dependent nuclease fold. Horizontal gene transfer, lineage-specific gene loss and gene family expansion, and non-orthologous gene displacement seem to have been major forces in the evolution of HJRs and related nucleases. A remarkable case of displacement is seen in the Lyme disease spirochete Borrelia burgdorferi, which does not possess any of the typical HJRs, but instead encodes, in its chromosome and each of the linear plasmids, members of the exonuclease family predicted to function as HJRs. The diversity of HJRs and related nucleases in bacteria and archaea contrasts with their near absence in eukaryotes. The few detected eukaryotic representatives of the endonuclease fold and the RNase H fold have probably been acquired from bacteria via horizontal gene transfer. The identity of the principal HJR(s) involved in recombination in eukaryotes remains uncertain; this function could be performed by topoisomerase IB or by a novel, so far undetected, class of enzymes. Likely HJRs and related nucleases were identified in the genomes of numerous bacterial and eukaryotic DNA viruses. Gene flow between viral and cellular genomes has probably played a major role in the evolution of this class of enzymes. This family represents the YqgF family of putative Holliday junction resolvases. With the exception of the spirochetes, the YqgF family is represented in all bacterial lineages, including the mycoplasmas with their highly degenerate genomes. The RuvC resolvases are conspicuously absent in the low-GC Gram-positive bacterial lineage, with the exception of Ureaplasma urealyticum (Q9PQY7 from SWISSPROT, ). Furthermore, loss of function ruvC mutants of E. coli show a residual HJR activity that cannot be ascribed to the prophage-encoded RusA resolvase . This suggests that the YqgF family proteins could be alternative HJRs whose function partially overlaps with that of RuvC . ; GO: 0000150 recombinase activity, 0003677 DNA binding, 0004518 nuclease activity, 0006281 DNA repair, 0006310 DNA recombination, 0006974 response to DNA damage stimulus. Probab=27.23 E-value=48 Score=14.45 Aligned_cols=42 Identities=12% Similarity=0.171 Sum_probs=28.5 Q ss_pred HHCCCCEEEEECCCHHH--HHHHHHHHHCCCEEEEEEEECCCCC Q ss_conf 42115889998386658--9999999984988999983157765 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCF--TTLVAALQRKVKKVTIVSTVLSDPS 150 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df--~pli~~lr~~Gk~V~v~~~~~~~~~ 150 (182) .|+.+.++..-+++.|+ ..+-+-++++....+|+++...-.+ T Consensus 22 tA~gi~tik~~~~~~d~Gl~~i~~l~ke~~~d~~vvGlP~nM~G 65 (133) T TIGR00250 22 TAQGIPTIKAQDGEPDWGLSRIEELLKEWKVDKIVVGLPLNMDG 65 (133) T ss_pred EECCCHHHHHCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCC T ss_conf 10440311210488862179999987315889788617878788 No 241 >cd06342 PBP1_ABC_LIVBP_like Type I periplasmic ligand-binding domain of ABC (Atpase Binding Cassette)-type active transport systems that are involved in the transport of all three branched chain aliphatic amino acids (leucine, isoleucine and valine). This subgroup includes the type I periplasmic ligand-binding domain of ABC (Atpase Binding Cassette)-type active transport systems that are involved in the transport of all three branched chain aliphatic amino acids (leucine, isoleucine and valine). This subgroup also includes a leucine-specific binding protein (or LivK), which is very similar in sequence and structure to leucine-isoleucine-valine binding protein (LIVBP). ABC-type active transport systems are transmembrane proteins that function in the transport of diverse sets of substrates across extra- and intracellular membranes, including carbohydrates, amino acids, inorganic ions, dipeptides and oligopeptides, metabolic products, lipids and sterols, and heme, to name a few. Probab=27.02 E-value=48 Score=14.43 Aligned_cols=112 Identities=10% Similarity=-0.022 Sum_probs=62.5 Q ss_pred HHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99999985179299999973068873146777579998897559569753458953788865435653299999999942 Q gi|254780936|r 31 RKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS 110 (182) Q Consensus 31 ~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a 110 (182) ..+.+++.+.....+...... .... .......+.+.++..|.++..... + ..+..|..=.+..+. + T Consensus 123 ~~~~~~l~~~~~~~~vai~~~-~~~~--g~~~~~~~~~~l~~~g~~vv~~~~--~--------~~~~~d~~~~i~~l~-~ 188 (334) T cd06342 123 PAAAKYAVETLKAKKVAIIDD-KTAY--GQGLADEFKKALKAAGGKVVAREG--T--------TDGATDFSAILTKIK-A 188 (334) T ss_pred HHHHHHHHHHCCCCEEEEEEC-CCCC--CHHHHHHHHHHHHHCCCEEEEEEE--C--------CCCCCCHHHHHHHHH-H T ss_conf 999999998759968999915-6554--455799999999974983999996--5--------887666578999998-6 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHC Q ss_conf 11588999838665899999999849889999831577654708999852 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQA 160 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~a 160 (182) .+.|.+++.....|.+.+++.+++.|..+.+++.. +..+..+.+.+ T Consensus 189 ~~~d~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~----~~~~~~~~~~~ 234 (334) T cd06342 189 ANPDAVFFGGYYPEAGPLVRQMRQLGLKAPFMGGD----GLCDPEFIKIA 234 (334) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHCCCCCEEEEEC----CCCCHHHHHHH T ss_conf 59999999267558999999999769997599967----77878999864 No 242 >PRK01254 hypothetical protein; Provisional Probab=27.00 E-value=48 Score=14.42 Aligned_cols=19 Identities=16% Similarity=0.163 Sum_probs=10.4 Q ss_pred CCHHHHHHHHHHHHCCCEE Q ss_conf 8665899999999849889 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKV 139 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V 139 (182) .|.|++.|...||..|.+. T Consensus 571 t~~Dm~~LA~~lk~~~~~p 589 (742) T PRK01254 571 TDEDMVNLALWLKKNRFRL 589 (742) T ss_pred CHHHHHHHHHHHHHCCCCH T ss_conf 9999999999999739997 No 243 >TIGR00338 serB phosphoserine phosphatase SerB; InterPro: IPR004469 Phosphoserine phosphatase (SerB), (3.1.3.3 from EC), also known as O-phosphoserine phosphohydrolase, is involved in both serine and glycine biosynthesis. It catalyzes the reaction 3-phospho-serine + H2O = L-serine + phosphate, which is the last step in the biosynthesis of serine from carbohydrates. The reaction proceeds via the formation of a phosphoryl-enzyme intermediate. It acts as a homodimer, and requires magnesium as a cofactor.; GO: 0004647 phosphoserine phosphatase activity, 0006564 L-serine biosynthetic process. Probab=26.81 E-value=49 Score=14.40 Aligned_cols=46 Identities=17% Similarity=0.292 Sum_probs=24.8 Q ss_pred EEEEECCCH--HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHHHHHH Q ss_conf 899983866--58999999998498899998315776547089998521115189978762 Q gi|254780936|r 115 HLVIFSGDG--CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYLKNEI 173 (182) Q Consensus 115 ~~iLvSGD~--Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l~~~i 173 (182) .-+++=||| |. +. ++..|..+ +| ++-.-|++.||..|+-.+|.+.. T Consensus 173 ~~~vavGDGANDl-~m---~~~AglgI---AF------~AKp~l~~~A~~~i~~~dL~~i~ 220 (223) T TIGR00338 173 ENTVAVGDGANDL-SM---IKAAGLGI---AF------NAKPKLKQKADIVINKKDLTDIL 220 (223) T ss_pred HHEEEECCCCCHH-HH---HHHCCCEE---EE------CCCCHHCCCCCEEECHHHHHHHH T ss_conf 2187762785608-88---86278734---12------37600002157034547799998 No 244 >TIGR02177 PorB_KorB 2-oxoacid:acceptor oxidoreductase, beta subunit, pyruvate/2-ketoisovalerate family; InterPro: IPR011896 A number of anaerobic and microaerophilic species lack pyruvate dehydrogenase and have instead a four subunit, oxygen-sensitive pyruvate oxidoreductase, with either ferredoxins or flavodoxins used as the acceptor. Several related four-subunit enzymes may exist in the same species. This entry describes the beta subunits, representing mostly pyruvate and 2-ketoisovalerate specific enzymes.. Probab=26.73 E-value=29 Score=15.79 Aligned_cols=11 Identities=27% Similarity=0.782 Sum_probs=6.8 Q ss_pred EEEEECCCHHH Q ss_conf 89998386658 Q gi|254780936|r 115 HLVIFSGDGCF 125 (182) Q Consensus 115 ~~iLvSGD~Df 125 (182) .+|+.+||||. T Consensus 73 ~Viv~GGDGD~ 83 (302) T TIGR02177 73 KVIVVGGDGDL 83 (302) T ss_pred EEEEEECCCCC T ss_conf 68986068641 No 245 >TIGR00064 ftsY signal recognition particle-docking protein FtsY; InterPro: IPR004390 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family includes the cell division ABC transporter and the periplasmic substrate-binding protein FtsY. There is a weak division between FtsY and SRP54; both are GTPases. In Escherichia coli, ftsY is an essential gene located in an operon with cell division genes ftsE and ftsX, but its apparent function is as the signal recognition particle docking protein.; GO: 0005525 GTP binding. Probab=26.54 E-value=49 Score=14.37 Aligned_cols=76 Identities=17% Similarity=0.268 Sum_probs=39.5 Q ss_pred CCCCCHHHHHHHHHHCCE-EEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCH-HHH Q ss_conf 876798999999851792-999999730688731467775799988975595697534589537888654356532-999 Q gi|254780936|r 25 GFDIDYRKLLKAFRSRAI-VIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMD-VEL 102 (182) Q Consensus 25 ~~~~d~~~L~~~l~~~~~-l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~D-v~l 102 (182) |..--..||.+++..+|. ++-+-. +.=+.-...+...|-++.|..++..+ ++++| ... T Consensus 94 GKTTTIaKLA~~l~~~Gk~V~laAg-------DTFRAAA~EQL~~Wa~R~gv~vi~~~-------------~gn~DPAaV 153 (284) T TIGR00064 94 GKTTTIAKLANKLKKQGKSVLLAAG-------DTFRAAAIEQLEVWAKRLGVDVIKQK-------------EGNADPAAV 153 (284) T ss_pred CHHHHHHHHHHHHHHCCCEEEEECC-------CHHHHHHHHHHHHHHHHHCCEEEECC-------------CCCCCCHHH T ss_conf 6010288999999874990899827-------52479999999998988387554078-------------898871789 Q ss_pred HHHHHHHH--CCCCEEEEEC Q ss_conf 99999942--1158899983 Q gi|254780936|r 103 AVDAFEQS--EGLEHLVIFS 120 (182) Q Consensus 103 aiD~~~~a--~~~d~~iLvS 120 (182) +=|+++.| ++||.+++=| T Consensus 154 ~fDAi~~Ak~~niDvvliDT 173 (284) T TIGR00064 154 IFDAIQAAKARNIDVVLIDT 173 (284) T ss_pred HHHHHHHHHHCCCCEEEEEC T ss_conf 99989999874997899734 No 246 >PRK11394 23S rRNA pseudouridine synthase E; Provisional Probab=26.40 E-value=50 Score=14.36 Aligned_cols=47 Identities=15% Similarity=0.198 Sum_probs=34.1 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 58899983866589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -.-.+|+|.|+||+--+.+=+..-.++..+.+... .+...|.....+ T Consensus 72 SeGLLLlTNDG~la~~L~hP~~~i~K~Y~V~v~g~---~~~~~l~~l~~G 118 (207) T PRK11394 72 SEGLLVLTNNGALQARLTQPGKRTGKIYYVQVEGI---PTQDALEALRNG 118 (207) T ss_pred CCCEEEECCCHHHHHHHHCCCCCCCEEEEEEECCC---CCHHHHHHHHCC T ss_conf 87729980878999875256579778999999168---999999999779 No 247 >PRK09288 purT phosphoribosylglycinamide formyltransferase 2; Validated Probab=26.39 E-value=50 Score=14.36 Aligned_cols=89 Identities=20% Similarity=0.153 Sum_probs=48.6 Q ss_pred HHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH-HH----------------CCCCEEEEECCCHH--HH Q ss_conf 9988975595697534589537888654356532999999999-42----------------11588999838665--89 Q gi|254780936|r 66 LLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE-QS----------------EGLEHLVIFSGDGC--FT 126 (182) Q Consensus 66 ~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~-~a----------------~~~d~~iLvSGD~D--f~ 126 (182) +.+.+...|+-++.|+...+ |+.|....++..|..-+.+... .+ ..+..++.-++|+. |. T Consensus 141 l~~~~~~lG~P~vlK~~~G~-DGKGq~~v~~~~dl~~a~~~a~~~~~~~~~~~ivE~~v~f~~E~Svi~aR~~~g~~~~~ 219 (395) T PRK09288 141 LRAAVEEIGYPCVVKPVMSS-SGKGQSVVRSPEDIEAAWEYAQEGGRGGAGRVIVEEFIDFDYEITLLTVRAVDGGTHFC 219 (395) T ss_pred HHHHHHHHCCCEEEECCCCC-CCCCEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCEEEEEEEEECCCCCEEEE T ss_conf 99999982987587224157-88752896687888999999985075789708998885361799999999699988996 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHC Q ss_conf 9999999849889999831577654708999852 Q gi|254780936|r 127 TLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQA 160 (182) Q Consensus 127 pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~a 160 (182) |.++..++.|.-+..+. +...|.++.+.| T Consensus 220 ~pve~~~~~gil~~s~~-----Pa~i~~~v~~~a 248 (395) T PRK09288 220 APIGHRQEDGDYRESWQ-----PQPMSPAALEEA 248 (395) T ss_pred CCCCCEEECCEEEEEEC-----CCCCCHHHHHHH T ss_conf 68542553342799715-----778999999999 No 248 >PRK13228 consensus Probab=26.35 E-value=50 Score=14.35 Aligned_cols=71 Identities=17% Similarity=0.002 Sum_probs=44.0 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHH-CCCCE-EHHHHHHHHC Q ss_conf 32999999999421158899983866589999999984988999983157765470899985-21115-1899787622 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQ-ADYFM-DLAYLKNEIA 174 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~-ad~fi-~l~~l~~~i~ 174 (182) -|-.+-+.+++...---.=+++=||+- .=++..++.|.+++.+.... ....+|.++ ||..+ ++.+|.+.+. T Consensus 153 P~P~~~l~a~~~lg~~p~~~v~VgDs~--~di~aA~~AGi~~i~V~~G~----~~~~~l~~~~ad~vv~~l~el~~lL~ 225 (232) T PRK13228 153 PDPLPLLHACEVLGIDPANMVLVGDSA--NDVIAARAAGCPVFCVPYGY----ADGHGLAALDCDAIVDSLPEAYALLT 225 (232) T ss_pred CCHHHHHHHHHHHCCCCCCEEEECCCH--HHHHHHHHCCCEEEEECCCC----CCHHHHHHCCCCEEECCHHHHHHHHH T ss_conf 879999999999297934089884888--79999999599399987999----99778987899999999999999975 No 249 >TIGR01501 MthylAspMutase methylaspartate mutase, S subunit; InterPro: IPR006394 These sequences represent the S (sigma) subunit of methylaspartate mutase (glutamate mutase), a cobalamin-dependent enzyme that catalyzes the first step in a pathway of glutamate fermentation. ; GO: 0016866 intramolecular transferase activity, 0019670 anaerobic glutamate catabolic process. Probab=26.35 E-value=50 Score=14.35 Aligned_cols=60 Identities=15% Similarity=0.130 Sum_probs=37.9 Q ss_pred HHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEC-----CCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 8975595697534589537888654356532999999999421158899983-----86658999999998498899998 Q gi|254780936|r 69 WLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFS-----GDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 69 ~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvS-----GD~Df~pli~~lr~~Gk~V~v~~ 143 (182) +|.+.||+|+.-=+..-.+.....-.+.++| +|||| ||-|..-+=+++-+.|..=++++ T Consensus 24 af~nagF~V~NlGv~~~~~efi~AAiet~AD----------------AilvSSlYGhg~~DC~Glr~~c~eaGl~~illY 87 (134) T TIGR01501 24 AFRNAGFTVVNLGVLTAQEEFIKAAIETDAD----------------AILVSSLYGHGEIDCAGLRDKCAEAGLDAILLY 87 (134) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHCCCCC----------------EEEEECCCCCCCCCHHHHHHHHHHCCCCCEEEE T ss_conf 8732898898721316157888773228998----------------899830207800012457899986588827998 Q ss_pred E Q ss_conf 3 Q gi|254780936|r 144 T 144 (182) Q Consensus 144 ~ 144 (182) + T Consensus 88 V 88 (134) T TIGR01501 88 V 88 (134) T ss_pred E T ss_conf 7 No 250 >pfam02590 SPOUT_MTase Predicted SPOUT methyltransferase. This family of proteins are predicted to be SPOUT methyltransferases. Probab=26.27 E-value=50 Score=14.34 Aligned_cols=68 Identities=13% Similarity=0.126 Sum_probs=42.6 Q ss_pred CCHHHHHHHHHHHH---CCCCEEEEECCCH------HHHHHHHHHHHCCCE--EEEEEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 53299999999942---1158899983866------589999999984988--999983157765470899985211151 Q gi|254780936|r 97 SMDVELAVDAFEQS---EGLEHLVIFSGDG------CFTTLVAALQRKVKK--VTIVSTVLSDPSMASDQLRRQADYFMD 165 (182) Q Consensus 97 ~~Dv~laiD~~~~a---~~~d~~iLvSGD~------Df~pli~~lr~~Gk~--V~v~~~~~~~~~~~S~~L~~~ad~fi~ 165 (182) +++..+..+..... +.-+.+|++.-.| +|...+...+..|.. +.++|-..|. +..|++-||..+. T Consensus 48 ~~~~~~~~E~~~i~~~i~~~~~~I~LDe~Gk~~sS~~fa~~i~~~~~~g~~~i~FiIGGa~G~----~~~~~~~a~~~~S 123 (155) T pfam02590 48 DIEAIKEKEGERILAAIPPGSYVIALDERGKELSSEEFAKLLEDLRLKGKSDITFVIGGSDGL----SEEVLKRANEKLS 123 (155) T ss_pred CHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCC----CHHHHHHHHHEEE T ss_conf 899999999999997379999899993799837879999999999833897559998379886----9899975666344 Q ss_pred HHH Q ss_conf 899 Q gi|254780936|r 166 LAY 168 (182) Q Consensus 166 l~~ 168 (182) |.+ T Consensus 124 ls~ 126 (155) T pfam02590 124 LSK 126 (155) T ss_pred CCC T ss_conf 136 No 251 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=26.19 E-value=50 Score=14.33 Aligned_cols=100 Identities=12% Similarity=0.012 Sum_probs=54.3 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999---- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---- 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---- 108 (182) +.+++.+.+.- ...|............+...|.+++...|......... ....+..-+..++. T Consensus 107 a~~~L~~~G~~--~i~~i~~~~~~~~~~~R~~gf~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~l~ 173 (264) T cd06267 107 AVEHLIELGHR--RIAFIGGPPDLSTARERLEGYREALEEAGIPLDEELIV-----------EGDFSEESGYEAARELLA 173 (264) T ss_pred HHHHHHHHCCC--CEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEE-----------CCCCCHHHHHHHHHHHHH T ss_conf 99999871888--56896389877069999999999999869998853075-----------278977999999999996 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCC----EEEEEEEEC Q ss_conf 42115889998386658999999998498----899998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~ 146 (182) ..+.+ ++|++++|.--..+++.+++.|. .+-|+++.. T Consensus 174 ~~~~~-~ai~~~~D~~A~g~~~~l~~~g~~iP~di~Iig~d~ 214 (264) T cd06267 174 SGERP-TAIFAANDLMAIGALRALRELGLRVPEDVSVVGFDD 214 (264) T ss_pred CCCCC-CEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECC T ss_conf 59999-899878879999999999982999999779999998 No 252 >PRK04761 ppnK inorganic polyphosphate/ATP-NAD kinase; Reviewed Probab=26.15 E-value=50 Score=14.33 Aligned_cols=28 Identities=21% Similarity=0.287 Sum_probs=12.0 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEE Q ss_conf 5889998386658999999998498899 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVT 140 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~ 140 (182) .|.+|.+-|||-|..+++.+...++-+. T Consensus 26 aDliIvlGGDGT~L~a~r~~~~~~~Pil 53 (246) T PRK04761 26 ADVIVALGGDGFMLQTLHRYMNSGKPVY 53 (246) T ss_pred CCEEEEECCCHHHHHHHHHHHCCCCCEE T ss_conf 9999998987999999998741499379 No 253 >COG1004 Ugd Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane] Probab=26.13 E-value=50 Score=14.33 Aligned_cols=21 Identities=14% Similarity=0.056 Sum_probs=9.4 Q ss_pred CCCEEEEECCCHHHHHHHHHH Q ss_conf 158899983866589999999 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAAL 132 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~l 132 (182) +-|.+|+=+-+.+=..+++.| T Consensus 163 ~PdRIViG~~~~~a~~~~~el 183 (414) T COG1004 163 YPDRIVIGVRSERAAAVLREL 183 (414) T ss_pred CCCEEEECCCCHHHHHHHHHH T ss_conf 898389925771689999999 No 254 >PRK13224 consensus Probab=26.08 E-value=50 Score=14.32 Aligned_cols=69 Identities=17% Similarity=0.106 Sum_probs=44.4 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCE--EHHHHHHHHCC Q ss_conf 3299999999942115889998386658999999998498899998315776547089998521115--18997876225 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFM--DLAYLKNEIAR 175 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi--~l~~l~~~i~r 175 (182) -|-.+.+-+++..+ .+- +|+=||+ ..=++..++.|.+++.++..++ +.++.+....++ ++++|++.+++ T Consensus 143 P~Pe~~l~a~~~l~-~~~-~l~VGDs--~~D~~aA~~AGi~~i~v~~Gy~-----~~~~~~~~~~~~i~~~~el~~~l~~ 213 (216) T PRK13224 143 PDPAPLRAAFAALG-DEP-VLYVGDS--EVDAETAENAGVPFLLFTEGYR-----KAPVHELPHHGAFDDHAELPDLVAR 213 (216) T ss_pred CCHHHHHHHHHHCC-CCC-EEEECCC--HHHHHHHHHHCCCEEEECCCCC-----CCCHHHCCCCCHHCCHHHHHHHHHH T ss_conf 99999999997379-999-8997598--8789999994998899777979-----9896773846261799989999999 No 255 >PHA00439 exonuclease Probab=26.05 E-value=28 Score=15.91 Aligned_cols=20 Identities=30% Similarity=0.421 Sum_probs=15.4 Q ss_pred CCCEEEEECCCHHHHHHHHH Q ss_conf 15889998386658999999 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAA 131 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~ 131 (182) ..+..||||+|.||-.+-.. T Consensus 135 g~~~~iivS~DKDm~qiPg~ 154 (288) T PHA00439 135 GADKAVLVSCDKDFKTIPNC 154 (288) T ss_pred CCCCEEEEECCCCCCCCCCE T ss_conf 89862898279882308860 No 256 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=26.03 E-value=50 Score=14.31 Aligned_cols=38 Identities=18% Similarity=0.199 Sum_probs=29.2 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 86658999999998498899998315776547089998521115189 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) +=..+..+++.+++.|+.|++++ .++|..+||+.+.|+ T Consensus 164 ~~~~l~~~l~~~~~~g~~vi~~t---------H~dl~~~ad~v~vl~ 201 (204) T PRK13538 164 GVARLEQLLAQHAEQGGMVILTT---------HQDLPGASDKVRKLR 201 (204) T ss_pred HHHHHHHHHHHHHHCCCEEEEEE---------CHHHHHHCCEEEEEC T ss_conf 99999999999985899899998---------669898769999973 No 257 >PTZ00153 lipoamide dehydrogenase; Provisional Probab=25.89 E-value=51 Score=14.30 Aligned_cols=59 Identities=19% Similarity=0.215 Sum_probs=39.8 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEEC----C---CCC-CCHHHHHHHCCCCEEHHH Q ss_conf 42115889998386658999999998498899998315----7---765-470899985211151899 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVL----S---DPS-MASDQLRRQADYFMDLAY 168 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~----~---~~~-~~S~~L~~~ad~fi~l~~ 168 (182) .+..||.+||=+|=+-|+.+++.. ++|.+|.++.-.. | .++ -.|.+|-.++..|-++.+ T Consensus 120 ~~~eYDlvIIG~G~GGy~AAi~Aa-q~GLKvaiiegekd~lGGTClNrGCIPSKALL~as~~~re~~~ 186 (673) T PTZ00153 120 NAEEYDLAIIGCGVGGHAAAINAM-ERGLKVIIFAGDEDCIGGTCVNVGCIPSKALLYATGKYRELKN 186 (673) T ss_pred CHHHCCEEEECCCCCHHHHHHHHH-HCCCEEEEEECCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 843579899998762899999999-8498699993687874772427773211999987357998861 No 258 >cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=25.82 E-value=51 Score=14.29 Aligned_cols=37 Identities=19% Similarity=0.293 Sum_probs=27.3 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 658999999998498899998315776547089998521115189 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ..+..+++.+++.|+.|++++ +. -++.+.||+.+.|. T Consensus 209 ~~l~~~l~~l~~~G~Tvi~it----H~----l~~~~~aD~IivLg 245 (261) T cd03271 209 KKLLEVLQRLVDKGNTVVVIE----HN----LDVIKCADWIIDLG 245 (261) T ss_pred HHHHHHHHHHHHCCCEEEEEE----EC----HHHHHHCCEEEECC T ss_conf 999999999997899999984----77----88997389999934 No 259 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=25.69 E-value=51 Score=14.28 Aligned_cols=78 Identities=15% Similarity=0.107 Sum_probs=51.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH----HCCCCEEEEECCCHHHHHHHH Q ss_conf 7314677757999889755956975345895378886543565329999999994----211588999838665899999 Q gi|254780936|r 55 DPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ----SEGLEHLVIFSGDGCFTTLVA 130 (182) Q Consensus 55 ~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~----a~~~d~~iLvSGD~Df~pli~ 130 (182) .......+...|.++|...++++.... ..+.|..-+..+++. -+.+| +|+.++|.==+-+++ T Consensus 159 ~~~~~~~R~~Gf~~al~~~~~~~~~~~-------------~~~~~~~~g~~~~~~lL~~~p~~~-aIf~~nD~~A~Ga~~ 224 (295) T PRK10653 159 GTSAARERGEGFQQAVAAHKFNVLASQ-------------PADFDRTKGLNVMQNLLTAHPDVQ-AVFAQNDEMALGALR 224 (295) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHHCCCCC-EEEECCCHHHHHHHH T ss_conf 986899999999999986698687530-------------477789999999999996089977-899779779999999 Q ss_pred HHHHCCC-EEEEEEEEC Q ss_conf 9998498-899998315 Q gi|254780936|r 131 ALQRKVK-KVTIVSTVL 146 (182) Q Consensus 131 ~lr~~Gk-~V~v~~~~~ 146 (182) .+++.|. .|.+++|.. T Consensus 225 al~~~G~~di~vvGfD~ 241 (295) T PRK10653 225 ALQTAGKSDVMVVGFDG 241 (295) T ss_pred HHHHCCCCCCEEEEECC T ss_conf 99980999839992899 No 260 >PRK10475 23S rRNA pseudouridine synthase F; Provisional Probab=25.55 E-value=51 Score=14.26 Aligned_cols=47 Identities=11% Similarity=0.109 Sum_probs=33.9 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf 58899983866589999999984988999983157765470899985211 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) -.-.+|+|.|+||+--+.+-+..-.+.+.+.+.... +...|...... T Consensus 110 SeGLLlLTNDG~l~~~l~hP~~~i~K~Y~V~v~g~i---~~~~l~~l~~G 156 (290) T PRK10475 110 SQGLIFLTNHGDLVNKILRAGNDHEKEYLVTVDKPI---TDEFIRGMGAG 156 (290) T ss_pred CCCEEEECCCHHHHHHHHCCCCCCCEEEEEEECCCC---CHHHHHHHHCC T ss_conf 767599859879999873576789559999989858---99999998589 No 261 >cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i Probab=25.52 E-value=51 Score=14.26 Aligned_cols=102 Identities=11% Similarity=0.042 Sum_probs=56.5 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH-H- Q ss_conf 99999851792999999730688731467775799988975595697534589537888654356532999999999-4- Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE-Q- 109 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~-~- 109 (182) .+.+++.+.+.-..++.+. .. ......+...|.+++...|..+.......+. ...+..-+.+.+. + T Consensus 111 ~a~~~L~~~G~~~i~~i~~--~~-~~~~~~R~~gf~~a~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~ll 178 (273) T cd01541 111 KATEYLIELGHRKIAGIFK--AD-DLQGVKRMKGFIKAYREHGIPFNPSNVITYT---------TEEKEEKLFEKIKEIL 178 (273) T ss_pred HHHHHHHHCCCCCEEEEEC--CC-CCCHHHHHHHHHHHHHHCCCCCCHHHCCCCC---------CCCHHHHHHHHHHHHH T ss_conf 9999998569960899816--76-3107999999999999879997723322446---------7516899999999998 Q ss_pred --HCCCCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEEC Q ss_conf --21158899983866589999999984988----99998315 Q gi|254780936|r 110 --SEGLEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVL 146 (182) Q Consensus 110 --a~~~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~ 146 (182) .+.+ ++|++++|.=-..++..+++.|.+ |.++++.. T Consensus 179 ~~~~~~-~av~~~~D~~A~g~~~~l~~~g~~vP~dvsIigfd~ 220 (273) T cd01541 179 KRPERP-TAIVCYNDEIALRVIDLLKELGLKIPEDISVVGFDD 220 (273) T ss_pred HCCCCC-CEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEECCC T ss_conf 369999-569848889998899865314888887724761287 No 262 >PRK00561 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=25.23 E-value=52 Score=14.22 Aligned_cols=30 Identities=27% Similarity=0.363 Sum_probs=17.3 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 588999838665899999999849889999 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIV 142 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~ 142 (182) .|.+|.+-|||=|..+++.+...++.+..+ T Consensus 34 ~DlviviGGDGT~L~a~~~~~~~~iPilGI 63 (259) T PRK00561 34 ADYLFVLGGDGFFVSTAANYNCAGCKVVGI 63 (259) T ss_pred CCEEEEECCHHHHHHHHHHHCCCCCCEEEE T ss_conf 999999897199999999855479968999 No 263 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=25.07 E-value=52 Score=14.20 Aligned_cols=51 Identities=29% Similarity=0.483 Sum_probs=38.1 Q ss_pred CCCCEEEEEC--CCH-HHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 1158899983--866-58999999998498899998315776547089998521115189 Q gi|254780936|r 111 EGLEHLVIFS--GDG-CFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 111 ~~~d~~iLvS--GD~-Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) ...|+++++| |.+ |..-+|-+|++.+.+.+.++-. + .| .|-.+||.+.++. T Consensus 46 ~~~Dvvl~ISYSGEs~e~~~Lip~Lk~~~~~lia~T~~----p-~S-~La~~Ad~~L~i~ 99 (272) T TIGR00393 46 EPRDVVLLISYSGESKELDKLIPSLKRLSVKLIAVTGK----P-NS-SLARAADYVLDIK 99 (272) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCC----C-CC-CHHHCCCEEEEEE T ss_conf 86747999876887577877535777559708997288----8-97-2453065457667 No 264 >PRK06327 dihydrolipoamide dehydrogenase; Validated Probab=24.97 E-value=53 Score=14.19 Aligned_cols=18 Identities=28% Similarity=0.248 Sum_probs=10.1 Q ss_pred HHHHHHHHHCCCEEEEEE Q ss_conf 999999998498899998 Q gi|254780936|r 126 TTLVAALQRKVKKVTIVS 143 (182) Q Consensus 126 ~pli~~lr~~Gk~V~v~~ 143 (182) ++....+++.|.+|+++. T Consensus 196 ~E~A~~~~~lG~~Vtli~ 213 (475) T PRK06327 196 LELGSVWRRLGAEVTILE 213 (475) T ss_pred HHHHHHHHHCCCEEEEEE T ss_conf 999999995498599998 No 265 >pfam01408 GFO_IDH_MocA Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilize NADP or NAD. This family is called the GFO/IDH/MOCA family in swiss-prot. Probab=24.97 E-value=53 Score=14.19 Aligned_cols=32 Identities=16% Similarity=0.163 Sum_probs=28.5 Q ss_pred HCCCCEEEEECCCHHHHHHHHHHHHCCCEEEE Q ss_conf 21158899983866589999999984988999 Q gi|254780936|r 110 SEGLEHLVIFSGDGCFTTLVAALQRKVKKVTI 141 (182) Q Consensus 110 a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v 141 (182) .+++|.++++|..+-..+.+..+-+.|+.|.+ T Consensus 60 ~~~iD~v~I~tp~~~H~~~~~~~l~~g~~v~~ 91 (120) T pfam01408 60 DPDVDAVSVATPPGLHFELALAALEAGKHVLV 91 (120) T ss_pred CCCCCEEEECCCHHHHHHHHHHHHHHCCEEEE T ss_conf 77889899908746189999999981998999 No 266 >cd01545 PBP1_SalR Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. The SalR binds to glucose based compound Salicin which is chemically related to aspirin. The ligand-binding of SalR is structurally homologous to the periplasmic sugar-binding domain of ABC-transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand bind Probab=24.84 E-value=53 Score=14.18 Aligned_cols=105 Identities=13% Similarity=0.095 Sum_probs=60.6 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+++.+.+.-..++. ..+........+.+.|.+++...|..+....... + ....+.+ ...+-.++..... T Consensus 109 a~~~L~~~G~~~I~~i--~g~~~~~~~~~R~~Gf~~a~~~~gi~~~~~~i~~---~--~~s~~~~--~~~~~~ll~~~~~ 179 (270) T cd01545 109 MTRHLIDLGHRRIAFI--AGPPDHRASAERLEGYRDALAEAGLPLDPELVAQ---G--DFTFESG--LEAAEALLALPDR 179 (270) T ss_pred HHHHHHHCCCCEEEEE--CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEC---C--CCCCHHH--HHHHHHHHHCCCC T ss_conf 9999997499659993--6997773699999999999998399977212202---6--5310269--9999999855999 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCC----EEEEEEEECC Q ss_conf 5889998386658999999998498----8999983157 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVLS 147 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~~ 147 (182) -++|++++|.=-.-++..+++.|. .|-|+++... T Consensus 180 -~tai~~~nD~~A~g~~~~l~~~g~~iP~dvsiigfD~~ 217 (270) T cd01545 180 -PTAIFASNDDMAAGVLAVAHRRGLRVPDDLSVVGFDDT 217 (270) T ss_pred -CCEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECCH T ss_conf -85899788899999999999818999998699998885 No 267 >cd06331 PBP1_AmiC_like Type I periplasmic components of amide-binding protein (AmiC) and the active transport system for short-chain and urea (FmdDEF). This group includes the type I periplasmic components of amide-binding protein (AmiC) and the active transport system for short-chain and urea (FmdDEF), found in bacteria and Archaea. AmiC controls expression of the amidase operon by a ligand-triggered conformational switch. In the absence of ligand or presence of butyramide (repressor), AmiC (the ligand sensor and negative regulator) adopts an open conformation and inhibits the transcription antitermination function of AmiR by direct protein-protein interaction. In the presence of inducing ligands such as acetamide, AmiC adopts a closed conformation which disrupts a silencing AmiC-AmiR complex and the expression of amidase and other genes of the operon is induced. FmdDEF is predicted to be an ATP-dependent transporter and closely resembles the periplasmic binding protein and the two t Probab=24.79 E-value=53 Score=14.17 Aligned_cols=67 Identities=9% Similarity=-0.019 Sum_probs=39.5 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCE Q ss_conf 775799988975595697534589537888654356532999999999421158899983866589999999984988 Q gi|254780936|r 61 SPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKK 138 (182) Q Consensus 61 ~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~ 138 (182) .....|.+.++..|.++...... .-+..|..=.+.-+. +.+.|.+++...-.|.+.+++-+++.|.. T Consensus 147 ~~~~~~~~~~~~~G~~vv~~~~~----------~~~~~Dfs~~l~~i~-~~~pD~v~~~~~~~~~~~~~~q~~~~G~~ 213 (333) T cd06331 147 ESNRIARALLEELGGEVVGEEYL----------PLGTSDFGSVIEKIK-AAGPDVVLSTLVGDSNVAFYRQFAAAGLD 213 (333) T ss_pred HHHHHHHHHHHHCCCEEEEEEEE----------CCCCCCHHHHHHHHH-HCCCCEEEEECCCCHHHHHHHHHHHCCCC T ss_conf 99999999999859949999970----------699724799999987-65999999935660399999999975999 No 268 >cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity. Probab=24.61 E-value=53 Score=14.15 Aligned_cols=29 Identities=24% Similarity=0.518 Sum_probs=16.8 Q ss_pred CEEEEECCCHHHHH-----HHHHHHHCCCEEEEEE Q ss_conf 88999838665899-----9999998498899998 Q gi|254780936|r 114 EHLVIFSGDGCFTT-----LVAALQRKVKKVTIVS 143 (182) Q Consensus 114 d~~iLvSGD~Df~p-----li~~lr~~Gk~V~v~~ 143 (182) -.+|.++|||||.- ++..+ +++..+.++. T Consensus 70 ~~Vva~~GDG~~~~iG~~~l~~A~-~rn~~i~~iv 103 (193) T cd03375 70 LTVIVVSGDGDLAAIGGNHFIHAA-RRNIDITVIV 103 (193) T ss_pred CEEEEEEECCHHHHCCHHHHHHHH-HHCCCEEEEE T ss_conf 739999706557645618899999-8099749999 No 269 >cd06286 PBP1_CcpB_like Ligand-binding domain of a novel transcription factor implicated in catabolite repression in Bacillus and Clostridium species. This group includes the ligand-binding domain of a novel transcription factor implicated in catabolite repression in Bacillus and Clostridium species. CcpB is 30% identical in sequence to CcpA which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. Like CcpA, the DNA-binding protein CcpB exerts its catabolite-repressing effect by a mechanism dependent on the presence of HPr(Ser-P), the small phosphocarrier proteins of the phosphoenolpyruvate-sugar phosphotransferase system, but with a less significant degree. Probab=24.58 E-value=54 Score=14.15 Aligned_cols=105 Identities=9% Similarity=0.020 Sum_probs=59.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99998517929999997306887314677757999889755956975345895378886543565329999999994211 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEG 112 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~ 112 (182) +.+.+.+.+.-.. .|...+........+...|.+++...|..+...... ... ...+.+ -.++..++...+. T Consensus 105 ~~~~L~~~G~~~i--~~i~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~--~~~---~~~~~g--~~~~~~~~~~~~~ 175 (260) T cd06286 105 ALKYLIQKGYRKI--AYCIGRKKSLNSQSRKKAYKDALEEYGLTPDEEWIF--EGC---FTIEDG--ERIGHQLLKMKDR 175 (260) T ss_pred HHHHHHHCCCCEE--EEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEE--CCC---CCHHHH--HHHHHHHHHCCCC T ss_conf 9999997599739--999389887049999999999999779997730364--465---206679--9999999844999 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCE----EEEEEEECC Q ss_conf 58899983866589999999984988----999983157 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKK----VTIVSTVLS 147 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~----V~v~~~~~~ 147 (182) . +++++++|.=-.-++..+++.|.+ +.|++|... T Consensus 176 p-~ai~~~nd~~A~g~~~al~~~g~~vP~di~vigfdd~ 213 (260) T cd06286 176 P-DAIFTGSDEVAAGIITEAKKQGIRVPEDLAIIGFDNQ 213 (260) T ss_pred C-CEEEECCHHHHHHHHHHHHHCCCCCCCCEEEEEECCH T ss_conf 8-7367368999999999999729999997699998984 No 270 >PRK13651 cobalt transporter ATP-binding subunit; Provisional Probab=24.44 E-value=53 Score=14.17 Aligned_cols=31 Identities=10% Similarity=0.058 Sum_probs=15.1 Q ss_pred CCCHHHHHHHHHHHH----CCCCEEEEECCCHHHH Q ss_conf 653299999999942----1158899983866589 Q gi|254780936|r 96 SSMDVELAVDAFEQS----EGLEHLVIFSGDGCFT 126 (182) Q Consensus 96 k~~Dv~laiD~~~~a----~~~d~~iLvSGD~Df~ 126 (182) .+.|..-.-++++.. .+=-++|++|=|=|++ T Consensus 190 agLDp~~~~~i~~~l~~L~~~G~TVI~vTHdm~~v 224 (304) T PRK13651 190 AGLDPQGVKEILEIFDTLNKKGKTIILVTHDLDNV 224 (304) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH T ss_conf 65898999999999999997799999986789999 No 271 >COG1121 ZnuC ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism] Probab=24.43 E-value=54 Score=14.13 Aligned_cols=87 Identities=15% Similarity=0.150 Sum_probs=51.9 Q ss_pred HHHHHHHHHCCCE-EEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEE--------ECCCHHHHHHHHHHHH Q ss_conf 7999889755956-975345895378886543565329999999994211588999--------8386658999999998 Q gi|254780936|r 64 HPLLDWLHYNGFQ-VVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVI--------FSGDGCFTTLVAALQR 134 (182) Q Consensus 64 ~~~~~~l~~~g~~-v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iL--------vSGD~Df~pli~~lr~ 134 (182) ....++|+..|.. +..+++...+.+ -..-+.=|-.++++-|.++| +.+-.+|..++.+++. T Consensus 118 ~~v~~aL~~Vgm~~~~~r~i~~LSGG----------Q~QRV~lARAL~~~p~lllLDEP~~gvD~~~~~~i~~lL~~l~~ 187 (254) T COG1121 118 EKVDEALERVGMEDLRDRQIGELSGG----------QKQRVLLARALAQNPDLLLLDEPFTGVDVAGQKEIYDLLKELRQ 187 (254) T ss_pred HHHHHHHHHCCCHHHHCCCCCCCCCH----------HHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999999839266647955546727----------99999999985369998996687545798789999999999987 Q ss_pred CCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 498899998315776547089998521115189 Q gi|254780936|r 135 KVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 135 ~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) .|+.|.+++=. -....+.||+.+.|. T Consensus 188 eg~tIl~vtHD-------L~~v~~~~D~vi~Ln 213 (254) T COG1121 188 EGKTVLMVTHD-------LGLVMAYFDRVICLN 213 (254) T ss_pred CCCEEEEEECC-------CHHHHHHCCEEEEEC T ss_conf 89889999588-------177685388799985 No 272 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=24.35 E-value=54 Score=14.12 Aligned_cols=109 Identities=12% Similarity=0.052 Sum_probs=59.4 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH-- Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999999942-- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS-- 110 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a-- 110 (182) |.+.+..++......+..+.... .....+.+.|.+++...++++.... -.++|-..+.++++.. T Consensus 159 la~~~~~~~~~~~Va~l~G~~~~-~~~~~r~~GF~~al~~~~ikIv~~q-------------~ad~~r~~a~~~~e~iL~ 224 (340) T PRK10936 159 LAQWHPKGSGKLNVALLPGPEGA-GGSKAVEQGFRAAIKGSDVRIVDIA-------------HGDNDKELQRNLLQELLE 224 (340) T ss_pred HHHHHCCCCCCEEEEEECCCCCC-CHHHHHHHHHHHHHCCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHH T ss_conf 99973157984399995287665-1476898879999707996799884-------------377749999999999998 Q ss_pred --CCCCEEEEECCCHHHHHHHHHHHHCCC--EEEEEEEECCCCCCCHHHHHHHCCC Q ss_conf --115889998386658999999998498--8999983157765470899985211 Q gi|254780936|r 111 --EGLEHLVIFSGDGCFTTLVAALQRKVK--KVTIVSTVLSDPSMASDQLRRQADY 162 (182) Q Consensus 111 --~~~d~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~ 162 (182) +.+|- |+..+---.-++..+++.|+ +|.++++.. +...++..-++ T Consensus 225 ~~pdid~--I~g~~~~a~~Ai~al~~~g~~~~v~ivg~D~-----s~~~~~~Ik~G 273 (340) T PRK10936 225 RHPEIDY--IAGSAVAAEAAIGELRGRNLTDPVKLVSFYL-----SHQVYRGLKRG 273 (340) T ss_pred HCCCCCE--EEECCHHHHHHHHHHHHCCCCCCEEEEEECC-----CHHHHHHHHCC T ss_conf 4999679--9807566999999999608888749999269-----99999999849 No 273 >TIGR02456 treS_nterm trehalose synthase; InterPro: IPR012810 Trehalose synthase interconverts maltose and alpha,alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor . More information about this protein can be found at Protein of the Month: alpha-Amylase .; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process. Probab=23.95 E-value=55 Score=14.07 Aligned_cols=75 Identities=20% Similarity=0.156 Sum_probs=43.5 Q ss_pred HHHHHHHHHHHHHHCCCEEE-EEEEEE--ECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 67775799988975595697-534589--537888654356532999999999421158899983866589999999984 Q gi|254780936|r 59 QFSPLHPLLDWLHYNGFQVV-AKVAKE--FTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 59 ~~~~~~~~~~~l~~~g~~v~-~~~~~~--~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~ 135 (182) .+.=....++.|+.+|+.+. +-|+-+ ..|++. | +-|=+...+.+-|+ .||..+|.++.++ T Consensus 26 Df~GLt~KLDYL~~LGVD~LWLlPFf~SPLRDdGY--------D---vsDY~~ilPe~Gtl------dDF~~fv~~AH~R 88 (560) T TIGR02456 26 DFPGLTSKLDYLKELGVDALWLLPFFKSPLRDDGY--------D---VSDYRAILPEYGTL------DDFKEFVDEAHAR 88 (560) T ss_pred CHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCC--------C---HHHHHCCCCCCCCH------HHHHHHHHHHHHC T ss_conf 72345211047786232211316777887767882--------0---34230006778877------8999999999757 Q ss_pred CCEEEEEEEECCCCCCCHH Q ss_conf 9889999831577654708 Q gi|254780936|r 136 VKKVTIVSTVLSDPSMASD 154 (182) Q Consensus 136 Gk~V~v~~~~~~~~~~~S~ 154 (182) |.||++== +=+|||. T Consensus 89 g~RvI~dL----V~NHTSd 103 (560) T TIGR02456 89 GMRVIIDL----VLNHTSD 103 (560) T ss_pred CCEEEEEC----CCCCCCC T ss_conf 98899963----7574358 No 274 >PRK00955 hypothetical protein; Provisional Probab=23.86 E-value=55 Score=14.06 Aligned_cols=41 Identities=15% Similarity=-0.001 Sum_probs=23.5 Q ss_pred CCEEEEECC-----CHHHHHHHHHHHHCCCEEEEEEEECCCCCCCH Q ss_conf 588999838-----66589999999984988999983157765470 Q gi|254780936|r 113 LEHLVIFSG-----DGCFTTLVAALQRKVKKVTIVSTVLSDPSMAS 153 (182) Q Consensus 113 ~d~~iLvSG-----D~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S 153 (182) +=+-.|+|+ +.|++.|.+.||+.|.+..=+-.-.+-+...| T Consensus 478 ylvPY~issHPGct~~dm~~La~~lk~~~~~peQVQdF~PTP~T~s 523 (599) T PRK00955 478 FLVPYLMSSHPGSTLEDAIELAEYLKDLGYQPEQVQDFYPTPGTLS 523 (599) T ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHCCCCHHHCEEEECCCCHHH T ss_conf 0477788169998999999999999973999777311007851898 No 275 >TIGR02244 HAD-IG-Ncltidse HAD superfamily (subfamily IG) hydrolase, 5'-nucleotidase; InterPro: IPR008380 This family includes a 5'-nucleotidase, 3.1.3.5 from EC, specific for purines (IMP and GMP) . These enzymes are members of the Haloacid Dehalogenase (HAD) superfamily. HAD members are recognized by three short motifs {hhhhDxDx(T/V)}, {hhhh(T/S)}, and either {hhhh(D/E)(D/E)x(3-4)(G/N)} or {hhhh(G/N)(D/E)x(3-4)(D/E)} (where "h" stands for a hydrophobic residue). Crystal structures of many HAD enzymes has verified PSI-PRED predictions of secondary structural elements which show each of the "hhhh" sequences of the motifs as part of beta sheets. This subfamily of enzymes is part of "Subfamily I" of the HAD superfamily by virtue of a "cap" domain in between motifs 1 and 2. This subfamily's cap domain has a different predicted secondary structure than all other known HAD enzymes and thus has been designated "subfamily IG", the domain appears to consist of a mixed alpha/beta fold.. Probab=23.85 E-value=55 Score=14.06 Aligned_cols=60 Identities=20% Similarity=0.229 Sum_probs=38.9 Q ss_pred CHHHHHHHHHHH---------------HCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 329999999994---------------2115889998386658999999998498899998315776547089998521 Q gi|254780936|r 98 MDVELAVDAFEQ---------------SEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 98 ~Dv~laiD~~~~---------------a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) -||.=|++-+.. +.+.+.+|+ -|.-..+.+++||+.||++.+++= +...-|-.-++-.+. T Consensus 184 ~DV~~A~~~vH~~G~lknDqsLYf~~v~~~~e~yv~--~dp~l~~~L~~l~~~GKklFLlTN--S~y~ytd~~M~~~~~ 258 (402) T TIGR02244 184 QDVRDALDWVHVEGSLKNDQSLYFKKVMSNPEKYVL--RDPELVRFLEKLREHGKKLFLLTN--SDYDYTDKGMKYLLG 258 (402) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCC--CCCHHHHHHHHHHHHCCEEEEEEC--CCHHHHHHHHHHHHH T ss_conf 999999875232454310146888999863752114--785689999999972780798864--880666668887520 No 276 >pfam05762 VWA_CoxE VWA domain containing CoxE-like protein. This family is annotated by SMART as containing a VWA type domain. The exact function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon is several bacteria. Probab=23.84 E-value=55 Score=14.06 Aligned_cols=140 Identities=19% Similarity=0.239 Sum_probs=67.9 Q ss_pred EEEEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEEC Q ss_conf 18999807899999985087679899999985179299999973068873146777579998897559569753458953 Q gi|254780936|r 7 KIALFIDGANLYASSKALGFDIDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFT 86 (182) Q Consensus 7 rvaIfID~~Nl~~~~~~~~~~~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~ 86 (182) ++.+++|.+.=. ..+. .-+..|+..+.....-++++.|.....+........ .-.++|....-.+ . T Consensus 59 ~lVvl~DVSGSM---~~ys--~~~L~~~~al~~~~~rv~~F~F~t~l~~vT~~l~~~-d~~~al~~~~~~~--------~ 124 (223) T pfam05762 59 RLVLLLDVSGSM---ADYS--RIFLALLHALLAGRPRTRLFAFGTRLTDLTRALRER-DPAEALLRVSARV--------E 124 (223) T ss_pred CEEEEECCCCCC---HHHH--HHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHCC-CHHHHHHHHHHHH--------C T ss_conf 589997378874---9999--999999999985468615999836489888887128-9999999999860--------3 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHH--CC-C--CEEEEECCCH-------HHHHHHHHHHHCCCEEEEEEEE--CCCCCCC Q ss_conf 788865435653299999999942--11-5--8899983866-------5899999999849889999831--5776547 Q gi|254780936|r 87 ENCGRKRVKSSMDVELAVDAFEQS--EG-L--EHLVIFSGDG-------CFTTLVAALQRKVKKVTIVSTV--LSDPSMA 152 (182) Q Consensus 87 ~~~~~~~~kk~~Dv~laiD~~~~a--~~-~--d~~iLvSGD~-------Df~pli~~lr~~Gk~V~v~~~~--~~~~~~~ 152 (182) .--+++|+.=++.-.... .. + .++||+-||+ +....+++|++.+.+|+-+.-. .++.... T Consensus 125 ------~~~GgT~ig~al~~f~~~~~~~~l~~~t~ViilsDg~~~~~~~~l~~~l~~L~~~~~rviWLNPl~~~~w~~~d 198 (223) T pfam05762 125 ------DWGGGTRIGAALAYFNELWTRPALSRGAVVVLVSDGLERGDSEELLAEVARLVRSARRLVWLNPLPDLGWPGYD 198 (223) T ss_pred ------CCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCC T ss_conf ------66799749999999998503034678867999723010388318999999999837879998998712288888 Q ss_pred --HHHHHH---HCCCCEEH Q ss_conf --089998---52111518 Q gi|254780936|r 153 --SDQLRR---QADYFMDL 166 (182) Q Consensus 153 --S~~L~~---~ad~fi~l 166 (182) +..++. .||.|... T Consensus 199 ~~a~~~~aa~p~vd~~~p~ 217 (223) T pfam05762 199 PRARGLRAAGPHVDAFRPL 217 (223) T ss_pred HHHHHHHHHCCCCCCCCCC T ss_conf 7899999827540112575 No 277 >cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif. Probab=23.84 E-value=55 Score=14.06 Aligned_cols=62 Identities=18% Similarity=0.207 Sum_probs=39.6 Q ss_pred CCCHHHHHHHHHH-HH--------CC-CCEEEEEC-C--CHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 6532999999999-42--------11-58899983-8--6658999999998498899998315776547089998521 Q gi|254780936|r 96 SSMDVELAVDAFE-QS--------EG-LEHLVIFS-G--DGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 96 k~~Dv~laiD~~~-~a--------~~-~d~~iLvS-G--D~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) +++.+.-|++.+. .. ++ -..+||+| | +.|....+..+|+.|.+|.++++.. .-..+|+..|. T Consensus 77 g~t~~~~AL~~~~~~~~~~~~~~r~~~~kvvvllTDG~s~~~~~~~a~~lr~~Gi~v~~VGig~----~~~~~L~~iAs 151 (164) T cd01472 77 GGTNTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKN----ADEEELKQIAS 151 (164) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCEEEEEECCC----CCHHHHHHHHC T ss_conf 9752999999999986353578767851599998379986408899999998898899997884----79999999967 No 278 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=23.62 E-value=56 Score=14.03 Aligned_cols=40 Identities=18% Similarity=0.232 Sum_probs=28.0 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 866589999999984988999983157765470899985211151899 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) +...+..+++.++..|+.|++++ ++ .++...||+.+-+++ T Consensus 131 ~e~~i~~~l~~l~~~~~Tvi~vt----H~----~~~~~~aD~Iivl~~ 170 (173) T cd03246 131 GERALNQAIAALKAAGATRIVIA----HR----PETLASADRILVLED 170 (173) T ss_pred HHHHHHHHHHHHHHCCCEEEEEC----CC----HHHHHHCCEEEEEEC T ss_conf 99999999997864898999984----79----999984999999939 No 279 >cd01575 PBP1_GntR Ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. This group represents the ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of GntR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding, Probab=23.60 E-value=56 Score=14.03 Aligned_cols=100 Identities=14% Similarity=0.064 Sum_probs=58.1 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHH---- Q ss_conf 9999851792999999730688731467775799988975595697534589537888654356532999999999---- Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFE---- 108 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~---- 108 (182) ..+++.+.+.-. ..|.+.+.+......+...|.++++..|..+..... . ..+....-..++++ T Consensus 107 a~~~L~~~G~~~--i~~i~~~~~~~~~~~R~~G~~~al~~~g~~~~~~~~--~---------~~~~~~~~~~~~~~~~l~ 173 (268) T cd01575 107 MARHLLARGYRR--IGFLGARMDDTRAQQRLEGFRAALRAAGLDPPLVVT--T---------PEPSSFALGRELLAELLA 173 (268) T ss_pred HHHHHHHCCCCE--EEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEE--E---------CCCCCHHHHHHHHHHHHH T ss_conf 999999749976--999848988736999999999999976989880899--5---------489988999999999995 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHCCC----EEEEEEEEC Q ss_conf 42115889998386658999999998498----899998315 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQRKVK----KVTIVSTVL 146 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr~~Gk----~V~v~~~~~ 146 (182) ..+.+| +|+++.|.==..++..+++.|. .+-|++|.. T Consensus 174 ~~~~~d-ai~~~nD~~A~g~~~~l~~~g~~iP~disvigfdd 214 (268) T cd01575 174 RWPDLD-AVFCSNDDLALGALFECQRRGISVPEDIAIAGFGD 214 (268) T ss_pred CCCCCC-EEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 699983-79987889999999999983899999779999897 No 280 >cd06330 PBP1_Arsenic_SBP_like Periplasmic solute-binding domain of active transport proteins. Periplasmic solute-binding domain of active transport proteins found in bacteria and Archaea that is predicted to be involved in the efflux of toxic compounds. Members of this subgroup include proteins from Herminiimonas arsenicoxydans, which is resistant to arsenic and various heavy metals such as cadmium and zinc. Moreover, they show significant sequence similarity to the cluster of AmiC and active transport systems for short-chain amides and urea (FmdDEF), and thus are likely to exhibit a ligand-binding mode similar to that of the amide sensor protein AmiC from Pseudomonas aeruginosa. Probab=23.55 E-value=56 Score=14.02 Aligned_cols=67 Identities=10% Similarity=-0.079 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 7579998897559569753458953788865435653299999999942115889998386658999999998498 Q gi|254780936|r 62 PLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVK 137 (182) Q Consensus 62 ~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk 137 (182) ....|.+.+++.|..+........ .-+..|..=.+..+. ..+.|.+++...-.|.+.+++.+++.|. T Consensus 154 ~~~~~~~~~~~~g~~~~vv~~~~~--------~~~~~D~s~~l~~i~-~~~pd~v~~~~~~~~~~~~~~q~~~~G~ 220 (346) T cd06330 154 AWADFKAALKRLRPDVEVVSEQWP--------KLGAPDYGSEITALL-AAKPDAIFSSLWGGDLVTFVRQANARGL 220 (346) T ss_pred HHHHHHHHHHHHCCCCEEEEEEEC--------CCCCCCHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHHHHCCC T ss_conf 999999999861789658888825--------899864699999999-6799999990664579999999997498 No 281 >PRK12421 ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=23.37 E-value=56 Score=14.00 Aligned_cols=46 Identities=17% Similarity=0.195 Sum_probs=29.2 Q ss_pred CHHHHHHHHHHHHCC-CCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 329999999994211-5889998386658999999998498899998 Q gi|254780936|r 98 MDVELAVDAFEQSEG-LEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 98 ~Dv~laiD~~~~a~~-~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) .|+.-.+.++..... -.+++-.+.|.++..++..||+.|.+|+..- T Consensus 318 l~l~rL~~~~~~~~~~~~i~~p~~~~~~~~~~i~~LR~~G~~Vv~~l 364 (391) T PRK12421 318 MDLRALLALLTQFAEPEAVLAPWGDDPDLLAAIAELRQQGERVVQLL 364 (391) T ss_pred ECHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 66999997265668888799767997999999999996797999838 No 282 >cd06341 PBP1_ABC_ligand_binding_like_7 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=23.24 E-value=57 Score=13.99 Aligned_cols=102 Identities=10% Similarity=0.023 Sum_probs=59.8 Q ss_pred HHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99999851792999999730688731467775799988975595697534589537888654356532999999999421 Q gi|254780936|r 32 KLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSE 111 (182) Q Consensus 32 ~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~ 111 (182) .+..++.+.+....+..+.... .........+...+++.|.++...... ..+..|..=.+.-+. +. T Consensus 122 ~~~~~~~~~~~~k~~~~~~~~~---~~g~~~~~~~~~~~~~~G~~vv~~~~~----------~~~~~D~~~~v~~l~-~~ 187 (341) T cd06341 122 TWGDFAKDQGGTRAVALVTALS---AAVSAAAALLARSLAAAGVSVAGIVVI----------TATAPDPTPQAQQAA-AA 187 (341) T ss_pred HHHHHHHHCCCCCEEEEEECCC---HHHHHHHHHHHHHHHHCCCCEEEEEEC----------CCCCCCHHHHHHHHH-HC T ss_conf 9999999648984899993583---588999999999999759954578744----------899987789999998-56 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECC Q ss_conf 158899983866589999999984988999983157 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLS 147 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~ 147 (182) +.|.+++.+.-.+.+.+++.+++.|.+..++..... T Consensus 188 ~pd~V~~~~~~~~~~~~~k~~~~~G~~~~~~~~~~~ 223 (341) T cd06341 188 GADAIITVLDAAVCASVLKAVRAAGLTPKVVLSGTC 223 (341) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 969999906847899999999976999718995576 No 283 >cd06334 PBP1_ABC_ligand_binding_like_1 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters, such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=23.20 E-value=57 Score=13.98 Aligned_cols=87 Identities=13% Similarity=0.126 Sum_probs=56.8 Q ss_pred HHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHHHHCCCEEE Q ss_conf 757999889755956975345895378886543565329999999994-2115889998386658999999998498899 Q gi|254780936|r 62 PLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ-SEGLEHLVIFSGDGCFTTLVAALQRKVKKVT 140 (182) Q Consensus 62 ~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~-a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~ 140 (182) ....+...++..|+++...... ..+.+|..= -+... +.+.|.+++.+-..+-..+++-+++.|.++. T Consensus 156 ~~~~~~~~~~~~G~~vv~~~~~----------~~~~~Dft~--~l~~i~~a~pD~V~~~~~~~~~~~~~kqa~~~G~~~~ 223 (351) T cd06334 156 PIEALKALAEKLGFEVVLEPVP----------PPGPNDQKA--QWLQIRRSGPDYVILWGWGVMNPVAIKEAKRVGLDDK 223 (351) T ss_pred HHHHHHHHHHHCCCEEEEEEEC----------CCCCCCHHH--HHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCCCC T ss_conf 9999999999769979888806----------999835899--9999997698999993773789999999997599985 Q ss_pred EEEEECCCCCCCHHHHH----HHCCCCE Q ss_conf 99831577654708999----8521115 Q gi|254780936|r 141 IVSTVLSDPSMASDQLR----RQADYFM 164 (182) Q Consensus 141 v~~~~~~~~~~~S~~L~----~~ad~fi 164 (182) +++.... .+..+. .+|+.++ T Consensus 224 ~ig~~~~----~~~~~~~~aG~aa~G~~ 247 (351) T cd06334 224 FIGNWWS----GDEEDVKPAGDAAKGYK 247 (351) T ss_pred EEEEECC----CCHHHHHHHHHHCCCEE T ss_conf 7975167----88999997211117738 No 284 >cd06303 PBP1_LuxPQ_Quorum_Sensing Periplasmic binding protein (LuxP) of autoinducer-2 (AI-2) receptor LuxPQ from Vibrio harveyi and its close homologs. Periplasmic binding protein (LuxP) of autoinducer-2 (AI-2) receptor LuxPQ from Vibrio harveyi and its close homologs from other bacteria. The members of this group are highly homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transport of many sugar based solutes in bacteria and archaea, and that are members of the type I periplasmic binding protein superfamily. The Vibrio harveyi AI-2 receptor consists of two polypeptides, LuxP and LuxQ: LuxP is a periplasmic binding protein that binds AI-2 by clamping it between two domains, LuxQ is an integral membrane protein belonging to the two-component sensor kinase family. Unlike AI-2 bound to the LsrB receptor in Salmonella typhimurium, the Vibrio harveyi AI-2 signaling molecule has an unusual furanosyl borate Probab=23.18 E-value=57 Score=13.98 Aligned_cols=87 Identities=10% Similarity=0.046 Sum_probs=59.5 Q ss_pred HHHHHHHHHHHHHC-CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCCC-EEEEECCCHHHHHHHHHHHHC Q ss_conf 77757999889755-9569753458953788865435653299999999942--1158-899983866589999999984 Q gi|254780936|r 60 FSPLHPLLDWLHYN-GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQS--EGLE-HLVIFSGDGCFTTLVAALQRK 135 (182) Q Consensus 60 ~~~~~~~~~~l~~~-g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a--~~~d-~~iLvSGD~Df~pli~~lr~~ 135 (182) ..+.+.|.+.+... ++++.... -.++|-.-+.++++.. .+-| .+|..++|.--.-+++.|++. T Consensus 146 ~~R~~GF~~~~~~~~~i~iv~~~-------------~~~~~r~~a~~~a~~~L~~~Pdi~~I~a~nd~mAlGa~~Al~~a 212 (280) T cd06303 146 TARGDTFIDCVHARNNWTLTSEF-------------YTDATRQKAYQATSDILSNNPDVDFIYACSTDIALGASDALKEL 212 (280) T ss_pred HHHHHHHHHHHHHCCCCEEEEEE-------------CCCCCHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHC T ss_conf 78998899999868996799885-------------27766899999999999869893099978945789999999973 Q ss_pred CC--EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 98--899998315776547089998521115 Q gi|254780936|r 136 VK--KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 136 Gk--~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) |+ +|.++++.. ++.+|....++-+ T Consensus 213 g~~~~i~vvG~dg-----~~~~l~ai~~G~l 238 (280) T cd06303 213 GREDDILINGWGG-----GSAELDAIQQGEL 238 (280) T ss_pred CCCCCEEEEEECC-----CHHHHHHHHCCCC T ss_conf 9878879997279-----9999999984998 No 285 >cd03232 ABC_PDR_domain2 The pleiotropic drug resistance-like (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=23.18 E-value=57 Score=13.98 Aligned_cols=42 Identities=10% Similarity=0.037 Sum_probs=30.3 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 866589999999984988999983157765470899985211151899 Q gi|254780936|r 121 GDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 121 GD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) +-..+..+++.+++.|+.|.+++ ...+.++.+.||+.+-|++ T Consensus 143 ~~~~i~~~l~~l~~~g~tiii~t------h~~~~~i~~~~Driivl~~ 184 (192) T cd03232 143 AAYNIVRFLKKLADSGQAILCTI------HQPSASIFEKFDRLLLLKR 184 (192) T ss_pred HHHHHHHHHHHHHHCCCEEEEEE------CCCHHHHHHHCCEEEEECC T ss_conf 99999999999996999999998------3637999987999999979 No 286 >COG0673 MviM Predicted dehydrogenases and related proteins [General function prediction only] Probab=23.13 E-value=57 Score=13.97 Aligned_cols=31 Identities=16% Similarity=0.133 Sum_probs=21.3 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCEEEE Q ss_conf 1158899983866589999999984988999 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKKVTI 141 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v 141 (182) +.+|.++|.|-+.-..+.+..+-+.||.|.+ T Consensus 66 ~~iD~V~Iatp~~~H~~~a~~AL~aGkhVl~ 96 (342) T COG0673 66 PDIDAVYIATPNALHAELALAALEAGKHVLC 96 (342) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCEEEE T ss_conf 9998899969806779999999977996999 No 287 >pfam06506 PrpR_N Propionate catabolism activator. This domain is found at the N terminus of several sigma54- dependent transcriptional activators including PrpR, which activates catabolism of propionate. Probab=23.04 E-value=57 Score=13.96 Aligned_cols=22 Identities=18% Similarity=0.146 Sum_probs=10.7 Q ss_pred CCCHHHHHHHHHHHHCCCEEEE Q ss_conf 3866589999999984988999 Q gi|254780936|r 120 SGDGCFTTLVAALQRKVKKVTI 141 (182) Q Consensus 120 SGD~Df~pli~~lr~~Gk~V~v 141 (182) ..-.|..+.++.+++.|.+|++ T Consensus 107 ~~~~e~~~~i~~l~~~G~~vvV 128 (169) T pfam06506 107 QSEEEARAAVKELKAQGIKVIV 128 (169) T ss_pred CCHHHHHHHHHHHHHCCCEEEE T ss_conf 6889999999999986995998 No 288 >TIGR03669 urea_ABC_arch urea ABC transporter, substrate-binding protein, archaeal type. Members of this protein family are identified as the substrate-binding protein of a urea ABC transport system by similarity to a known urea transporter from Corynebacterium glutamicum, operon structure, proximity of its operons to urease (urea-utilization protein) operons, and by Partial Phylogenetic Profiling vs. urea utilization. Probab=23.04 E-value=57 Score=13.96 Aligned_cols=26 Identities=12% Similarity=0.129 Sum_probs=16.5 Q ss_pred EECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 98386658999999998498899998 Q gi|254780936|r 118 IFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 118 LvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) .=.|+.||.+.+.++++.+..+++.. T Consensus 170 ~p~g~~Dfs~~l~kI~~a~pD~V~~~ 195 (374) T TIGR03669 170 IPLSVSQFSSTIQNIQKADPDFVMSM 195 (374) T ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 68999668999999986799999990 No 289 >COG2454 Uncharacterized conserved protein [Function unknown] Probab=22.86 E-value=52 Score=14.22 Aligned_cols=26 Identities=8% Similarity=-0.043 Sum_probs=14.5 Q ss_pred ECCCHHHHHHHHH-HHHCCCEEEEEEE Q ss_conf 8386658999999-9984988999983 Q gi|254780936|r 119 FSGDGCFTTLVAA-LQRKVKKVTIVST 144 (182) Q Consensus 119 vSGD~Df~pli~~-lr~~Gk~V~v~~~ 144 (182) +|+-|-....+++ +++.|..+.+..+ T Consensus 139 vs~SGel~~~i~~~mK~~~I~g~~~lv 165 (211) T COG2454 139 VSKSGELAGRIEEKMKSLGIPGEASLV 165 (211) T ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEC T ss_conf 886177899999987755887506762 No 290 >cd06307 PBP1_uncharacterized_sugar_binding Periplasmic sugar-binding domain of uncharacterized transport systems. Periplasmic sugar-binding domain of uncharacterized transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. The members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes. Probab=22.84 E-value=58 Score=13.94 Aligned_cols=89 Identities=17% Similarity=0.155 Sum_probs=49.1 Q ss_pred HHHHHHHHHHHHHHHHC--CCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH--HCCCCE-EEEECCCHHHHHHHHH Q ss_conf 14677757999889755--956975345895378886543565329999999994--211588-9998386658999999 Q gi|254780936|r 57 EQQFSPLHPLLDWLHYN--GFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ--SEGLEH-LVIFSGDGCFTTLVAA 131 (182) Q Consensus 57 ~~~~~~~~~~~~~l~~~--g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~--a~~~d~-~iLvSGD~Df~pli~~ 131 (182) .....+...|.+.|+.. ++++... .....|-.-+.++++. ..+-|. .|.++||+- ..+++. T Consensus 138 ~~~~~R~~Gf~~~l~~~~p~~~iv~~-------------~~~~~~~~~a~~~~~~lL~~~Pdi~ai~~~~~~~-~ga~~A 203 (275) T cd06307 138 RGHEEREMGFRSVLREEFPGLRVLET-------------LEGLDDPARAYEATRKLLARHPDLVGIYNAGGGN-RGVIRA 203 (275) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEE-------------ECCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHH T ss_conf 32999999999999865888740246-------------6178865999999999997499962999848716-999999 Q ss_pred HHHCCC--EEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 998498--899998315776547089998521115 Q gi|254780936|r 132 LQRKVK--KVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 132 lr~~Gk--~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) +++.|+ ++.++++... ...+....+..+ T Consensus 204 l~~~g~~~~v~vvg~D~~-----~~~~~~l~~G~i 233 (275) T cd06307 204 LREAGRAGKVVFVGHELT-----PETRAALRDGTI 233 (275) T ss_pred HHHCCCCCCEEEEECCCC-----HHHHHHHHCCCC T ss_conf 997399998699962799-----899999981983 No 291 >TIGR00694 thiM hydroxyethylthiazole kinase; InterPro: IPR000417 Thiamine pyrophosphate (TPP), a required cofactor for many enzymes in the cell, is synthesised de novo in Salmonella typhimurium . Five kinase activities have been implicated in TPP synthesis, which involves joining a 4-methyl-5-(beta-hydroxyethyl)thiazole (THZ) moiety and a 4-amino-5- hydroxymethyl-2-methylpyrimidine (HMP) moiety , . THZ kinase (2.7.1.50 from EC) activity is involved in the salvage synthesis of TH-P from the thiazole:2-methyl-4-amino-5-hydroxymethylpyrimidine diphosphate + 4-4-methyl-5-(2-phosphonooxyethyl)-thiazole = pyrophosphate + thiamin monophosphate Hydroxyethylthiazole kinase expression is regulated at the mRNA level by intracellular thiamin pyrophosphate .; GO: 0004417 hydroxyethylthiazole kinase activity, 0009228 thiamin biosynthetic process. Probab=22.40 E-value=21 Score=16.67 Aligned_cols=80 Identities=15% Similarity=0.209 Sum_probs=46.1 Q ss_pred HHHHCCCEEEEEEEEEECCCCCCC-CCCCCCHHHH--HHHHHH----HHCCCCEEEEECCCHHHHHHHHHHHHCCCE--E Q ss_conf 897559569753458953788865-4356532999--999999----421158899983866589999999984988--9 Q gi|254780936|r 69 WLHYNGFQVVAKVAKEFTENCGRK-RVKSSMDVEL--AVDAFE----QSEGLEHLVIFSGDGCFTTLVAALQRKVKK--V 139 (182) Q Consensus 69 ~l~~~g~~v~~~~~~~~~~~~~~~-~~kk~~Dv~l--aiD~~~----~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~--V 139 (182) -|.+-++.++..-.-+...-.+.. +.-||+|..= +.|+++ .|+++.++|++||--|++ .-|.+ | T Consensus 102 lL~~g~~a~IkGN~gEI~~L~G~~~~~~rGVDs~~~~~~~~~~~~~~~A~~~~~vvv~TG~~D~v-------sDG~~GG~ 174 (282) T TIGR00694 102 LLSEGKVAAIKGNAGEILSLAGEEDGKMRGVDSGSQGAADAIAAAQEVARKYGTVVVVTGEVDYV-------SDGRSGGV 174 (282) T ss_pred HHHHCCCCEEECCHHHHHHHHHHHCCCCCCEECCCCCCHHHHHHHHHHHHHCCCEEEEECCEEEE-------ECCCCCCE T ss_conf 98517820674577999988466528401102664221358999999888709478999877567-------51886546 Q ss_pred EEEEEECCCCCCCHHHHH Q ss_conf 999831577654708999 Q gi|254780936|r 140 TIVSTVLSDPSMASDQLR 157 (182) Q Consensus 140 ~v~~~~~~~~~~~S~~L~ 157 (182) ..+. .|.++.|+..|- T Consensus 175 ~~~~--nG~~GitaedlP 190 (282) T TIGR00694 175 YVIH--NGTEGITAEDLP 190 (282) T ss_pred EEEE--CCCCCCCCCCCC T ss_conf 8875--689876313367 No 292 >PRK00414 gmhA phosphoheptose isomerase; Reviewed Probab=22.38 E-value=59 Score=13.88 Aligned_cols=52 Identities=17% Similarity=0.114 Sum_probs=37.6 Q ss_pred HHCCCCEEEEECCCHH---HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEH Q ss_conf 4211588999838665---899999999849889999831577654708999852111518 Q gi|254780936|r 109 QSEGLEHLVIFSGDGC---FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDL 166 (182) Q Consensus 109 ~a~~~d~~iLvSGD~D---f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l 166 (182) +++.=|.++.+|+-|. .+.+++.++++|.+|+.++-.. ...|.+.+|--+.+ T Consensus 108 ~~~~gDvLi~iS~SGnS~Nii~A~~~Ak~~g~~~i~ltG~~------GG~l~~l~D~~i~V 162 (192) T PRK00414 108 VGREGDVLLGISTSGNSGNIIKAIEAAREKGMKVITLTGKD------GGKMAGLADIEIRV 162 (192) T ss_pred HCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCC------CHHHHHHCCEEEEE T ss_conf 17899999997689999999999999998899899997799------61477608979991 No 293 >TIGR02280 PaaB1 phenylacetate degradation probable enoyl-CoA hydratase paaB; InterPro: IPR011968 This family of proteins are found within apparent operons for the degradation of phenylacetic acid . These proteins contain the enoyl-CoA hydratase domain. This activity is consistent with current hypotheses for the degradation pathway which involve the ligation of phenylacetate with coenzyme A (paaF), hydroxylation (paaGHIJK), ring-opening (paaN) and degradation of the resulting fatty acid-like compound to a Krebs cycle intermediate (paaABCDE).. Probab=22.36 E-value=59 Score=13.88 Aligned_cols=46 Identities=24% Similarity=0.221 Sum_probs=34.5 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCC Q ss_conf 53299999999942115889998386658999999998498899998315776547089998521 Q gi|254780936|r 97 SMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQAD 161 (182) Q Consensus 97 ~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad 161 (182) ..|....||- =|-|||++|+..=|-|++. +.|+..-+-..|-=+|| T Consensus 73 ~pDLG~~ve~-----------------fYnPLvRrL~~Lp~PVv~A--VNGVAAGAGAnLALAcD 118 (259) T TIGR02280 73 APDLGETVEK-----------------FYNPLVRRLRALPKPVVCA--VNGVAAGAGANLALACD 118 (259) T ss_pred CCCCCCHHHH-----------------HHCHHHHHHHHCCCCEEEE--ECCCHHHHHHHHHHHHH T ss_conf 5853221777-----------------5074899997269977985--35103447788887511 No 294 >pfam01301 Glyco_hydro_35 Glycosyl hydrolases family 35. Probab=22.26 E-value=59 Score=13.86 Aligned_cols=56 Identities=11% Similarity=0.129 Sum_probs=35.2 Q ss_pred HHHHHHHHH-CCEEEEEEEEECCCCCCH-----HHHHHHHHHHHHHHHCCCEEEEEEEEEEC Q ss_conf 999999851-792999999730688731-----46777579998897559569753458953 Q gi|254780936|r 31 RKLLKAFRS-RAIVIRAYYYTTVVGDPE-----QQFSPLHPLLDWLHYNGFQVVAKVAKEFT 86 (182) Q Consensus 31 ~~L~~~l~~-~~~l~~~~~y~~~~~~~~-----~~~~~~~~~~~~l~~~g~~v~~~~~~~~~ 86 (182) ...++.++. +.+.+..|..-+.-++.+ +.......|++..++.|.-|+.+|--+.+ T Consensus 27 ~drL~K~KA~GlN~V~tYv~Wn~HE~~~G~~dF~G~~Dl~~Fl~la~~~GL~VilRpGPYic 88 (317) T pfam01301 27 PDRLQKAKALGLNTIETYVFWNLHEPEPGQYDFSGILDLVKFIKLAQEAGLYVILRPGPYIC 88 (317) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCEEEECCCCEEE T ss_conf 99999999739978998557634589788401235678999999999869879952688563 No 295 >pfam02630 SCO1-SenC SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c oxidase activity and maximal induction of genes encoding the light-harvesting and reaction centre complexes of R. capsulatus. Probab=22.24 E-value=59 Score=13.86 Aligned_cols=87 Identities=8% Similarity=0.029 Sum_probs=42.1 Q ss_pred CCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCC-CEEEE Q ss_conf 79299999973068873146777579998897559569753458953788865435653299999999942115-88999 Q gi|254780936|r 40 RAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGL-EHLVI 118 (182) Q Consensus 40 ~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~-d~~iL 118 (182) +...+..+.|+.=++-.+..-.......+.|...+..+...-+ -.|+. . |..=.+.. ++..+ ..++- T Consensus 37 Gk~~lv~FgfT~CpdiCP~~~~~l~~~~~~l~~~~~~v~~v~i--TvDPe------r--DTpe~L~~--Y~~~f~~~~~~ 104 (159) T pfam02630 37 GKLSLIYFGFTNCPDICPPALDRLTDIIKKLKAENIDVQPVFI--SVDPE------R--DTPKVLKE--YLKNFHPSFIG 104 (159) T ss_pred CCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEE--EECCC------C--CCHHHHHH--HHHHCCCCCCE T ss_conf 9889999988899964589999999999985304775599999--96248------8--99999999--99972998553 Q ss_pred ECCCHHHHHHHHHHHHCCCEEE Q ss_conf 8386658999999998498899 Q gi|254780936|r 119 FSGDGCFTTLVAALQRKVKKVT 140 (182) Q Consensus 119 vSGD~Df~pli~~lr~~Gk~V~ 140 (182) +||+.+-+.-+ ++..|.... T Consensus 105 ltg~~~~i~~l--~~~~~v~~~ 124 (159) T pfam02630 105 LTGTTDEIKSV--AKKFKVFYS 124 (159) T ss_pred EECCHHHHHHH--HHHHCCEEE T ss_conf 48999999999--997186089 No 296 >PRK13055 putative lipid kinase; Reviewed Probab=22.13 E-value=60 Score=13.85 Aligned_cols=82 Identities=18% Similarity=0.214 Sum_probs=43.6 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHH Q ss_conf 99973068873146777579998897559569753458953788865435653299999999942115889998386658 Q gi|254780936|r 46 AYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCF 125 (182) Q Consensus 46 ~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df 125 (182) +..-.+................+.|+..|+++....... +.+.-..++-.+. ...+|.+|.+-|||=. T Consensus 5 ~~lIvNP~SG~g~~~~~~~~i~~~L~~~g~e~~~~~tt~----------~~~~a~~~a~~a~--~~g~d~Iva~GGDGTi 72 (334) T PRK13055 5 ARLIYNPTSGQEIMKKNVADILDILEQAGYETSAFQTTP----------EPNSAKNEARRAA--KAGFDLIIAAGGDGTI 72 (334) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEEC----------CCCHHHHHHHHHH--HCCCCEEEEECCCCHH T ss_conf 999999877897578899999999998698599999441----------7857999999876--5799899998776089 Q ss_pred HHHHHHHHHCCCEE Q ss_conf 99999999849889 Q gi|254780936|r 126 TTLVAALQRKVKKV 139 (182) Q Consensus 126 ~pli~~lr~~Gk~V 139 (182) -.++.-+.....+. T Consensus 73 nevvngl~~~~~~~ 86 (334) T PRK13055 73 NEVVNGIAPLEKRP 86 (334) T ss_pred HHHHHHHHCCCCCC T ss_conf 99999873469997 No 297 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=22.03 E-value=58 Score=13.91 Aligned_cols=38 Identities=16% Similarity=0.208 Sum_probs=18.6 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHH Q ss_conf 6658999999998498899998315776547089998521115189 Q gi|254780936|r 122 DGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLA 167 (182) Q Consensus 122 D~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~ 167 (182) -..+..+++++++.|+.|++++ + ..+ ++ ..||+.+-++ T Consensus 172 ~~~i~~~l~~L~~~g~TvI~it----H--dl~-~~-~~aDrvivl~ 209 (274) T PRK13644 172 GIAVLERIKKLHEKGKTIVYIT----H--NLE-EL-HDADRIIVMD 209 (274) T ss_pred HHHHHHHHHHHHHCCCEEEEEE----E--CHH-HH-HCCCEEEEEE T ss_conf 9999999999986899999983----3--789-99-7199899998 No 298 >PRK06154 hypothetical protein; Provisional Probab=21.92 E-value=60 Score=13.82 Aligned_cols=43 Identities=9% Similarity=-0.070 Sum_probs=29.7 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHH---HHCCCEEEEEEEECCCCCC Q ss_conf 421158899983866589999999---9849889999831577654 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAAL---QRKVKKVTIVSTVLSDPSM 151 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~l---r~~Gk~V~v~~~~~~~~~~ 151 (182) ++..=..+|.++|||.|--.+.+| .++|..|.++.+-.+..+. T Consensus 436 lA~p~~~Vv~i~GDGsf~m~~qEL~Ta~r~~lpv~viV~NN~~~g~ 481 (556) T PRK06154 436 LARPDALVANLLGDAAFGECGMDFETAVRNRIPTVTILLNNGCMGG 481 (556) T ss_pred HHCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHH T ss_conf 7389993899987379887799999999949997899996806389 No 299 >PRK09124 pyruvate dehydrogenase; Provisional Probab=21.73 E-value=61 Score=13.80 Aligned_cols=49 Identities=24% Similarity=0.264 Sum_probs=33.8 Q ss_pred HHCCCCEEEEECCCHHHHHHHHH---HHHCCCEEEEEEEECCCCCCCHHHHH Q ss_conf 42115889998386658999999---99849889999831577654708999 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAA---LQRKVKKVTIVSTVLSDPSMASDQLR 157 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~---lr~~Gk~V~v~~~~~~~~~~~S~~L~ 157 (182) ++..=..+|.++|||.|.-.+.+ +.+++..|.++.+-.+..+....+.+ T Consensus 422 ~A~p~r~Vv~i~GDG~f~m~~~EL~Ta~r~~lpv~ivV~NN~~~g~i~~~~~ 473 (574) T PRK09124 422 AAHPGRQVVSLSGDGGFSMLMGDFLSLVQLKLPVKIVVFNNSVLGFVAMEMK 473 (574) T ss_pred HHCCCCCEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHH T ss_conf 8689994899988628875499999999978895899997816789999999 No 300 >pfam02775 TPP_enzyme_C Thiamine pyrophosphate enzyme, C-terminal TPP binding domain. Probab=21.70 E-value=61 Score=13.79 Aligned_cols=31 Identities=26% Similarity=0.258 Sum_probs=11.7 Q ss_pred EEEEECCCHHHHHHHHH---HHHCCCEEEEEEEE Q ss_conf 89998386658999999---99849889999831 Q gi|254780936|r 115 HLVIFSGDGCFTTLVAA---LQRKVKKVTIVSTV 145 (182) Q Consensus 115 ~~iLvSGD~Df~pli~~---lr~~Gk~V~v~~~~ 145 (182) .+|.++||+.|.--+.+ +.+++..+.++.+. T Consensus 48 ~vi~i~GDG~f~~~~~el~Ta~~~~~~i~~iv~n 81 (150) T pfam02775 48 PVVAIAGDGGFQMNGQELATAVRYNLPITVVVLN 81 (150) T ss_pred EEEEEECCCCHHCCCCHHHHHHHHCCCCCEEEEE T ss_conf 4999988972111343799999848885689997 No 301 >pfam01640 Peptidase_C10 Peptidase C10 family. Probab=21.66 E-value=26 Score=16.14 Aligned_cols=23 Identities=4% Similarity=0.030 Sum_probs=15.7 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 66589999999984988999983 Q gi|254780936|r 122 DGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 122 D~Df~pli~~lr~~Gk~V~v~~~ 144 (182) +++|..+|..=-..|.-|...+. T Consensus 292 ~~~w~~~i~~EL~~grPV~y~G~ 314 (378) T pfam01640 292 GKDWEAMIDKELAENQPVYYAGA 314 (378) T ss_pred HHHHHHHHHHHHHCCCCEEEECC T ss_conf 89999999999875998899814 No 302 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=21.54 E-value=61 Score=13.77 Aligned_cols=39 Identities=13% Similarity=0.019 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 6589999999984988999983157765470899985211151899 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) ..+..+++.+++.|+.|.+++ + --.+..+.||+++-+.+ T Consensus 169 ~~i~~~l~~l~~~g~tii~vt----H---dl~~~~~~~Drv~vl~~ 207 (222) T cd03224 169 EEIFEAIRELRDEGVTILLVE----Q---NARFALEIADRAYVLER 207 (222) T ss_pred HHHHHHHHHHHHCCCEEEEEC----C---CHHHHHHHCCEEEEEEC T ss_conf 999999999995699999990----8---58999996999999979 No 303 >PRK11269 glyoxylate carboligase; Provisional Probab=21.51 E-value=62 Score=13.77 Aligned_cols=45 Identities=18% Similarity=0.081 Sum_probs=32.7 Q ss_pred HHCCCCEEEEECCCHHHHHHHHHHH---HCCCEEEEEEEECCCCCCCH Q ss_conf 4211588999838665899999999---84988999983157765470 Q gi|254780936|r 109 QSEGLEHLVIFSGDGCFTTLVAALQ---RKVKKVTIVSTVLSDPSMAS 153 (182) Q Consensus 109 ~a~~~d~~iLvSGD~Df~pli~~lr---~~Gk~V~v~~~~~~~~~~~S 153 (182) ++.-=..+|.++||++|--.+.+|- ++|..|+++.+-.+..+++- T Consensus 433 lA~Pdr~VV~i~GDG~f~m~~qEL~Tavr~~lpvv~vV~NN~~~G~Ir 480 (591) T PRK11269 433 AADPERPVVALSGDYDFQFLIEELAVGAQFNLPYIHVLVNNAYLGLIR 480 (591) T ss_pred HHCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHH T ss_conf 978999689998736875419999999995889089999688230999 No 304 >PRK12474 hypothetical protein; Provisional Probab=21.41 E-value=62 Score=13.76 Aligned_cols=44 Identities=25% Similarity=0.271 Sum_probs=30.0 Q ss_pred CEEEEECCCHHHHHHHHHH---HHCCCEEEEEEEECCCCCCCHHHHH Q ss_conf 8899983866589999999---9849889999831577654708999 Q gi|254780936|r 114 EHLVIFSGDGCFTTLVAAL---QRKVKKVTIVSTVLSDPSMASDQLR 157 (182) Q Consensus 114 d~~iLvSGD~Df~pli~~l---r~~Gk~V~v~~~~~~~~~~~S~~L~ 157 (182) ..+|.++||+.|.-.+.+| .++|..|+++.+..+..+....+++ T Consensus 408 r~Vv~i~GDG~f~m~~~EL~Tavr~~lpi~~vV~NN~~yg~i~~~~~ 454 (518) T PRK12474 408 RKVVCPQGDGGAAYTMQALWTMARENLDVTVVIFANRSYAILNGELQ 454 (518) T ss_pred CCEEEEECCCCHHCCHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHH T ss_conf 81899947810410799999999978592999997863016999999 No 305 >pfam01513 NAD_kinase ATP-NAD kinase. Members of this family include ATP-NAD kinases EC:2.7.1.23, which catalyses the phosphorylation of NAD to NADP utilising ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus. Also includes NADH kinases EC:2.7.1.86. Probab=21.27 E-value=62 Score=13.74 Aligned_cols=33 Identities=21% Similarity=0.349 Sum_probs=26.1 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 115889998386658999999998498899998 Q gi|254780936|r 111 EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 111 ~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) .++|.+|.+-|||=|..+++.+...++.|..+. T Consensus 34 ~~~Dlii~lGGDGT~L~~~~~~~~~~~PilGin 66 (243) T pfam01513 34 EGVDLIVVLGGDGTALDAARLLGDHDIPILGIN 66 (243) T ss_pred CCCCEEEEECCCHHHHHHHHHHCCCCCCEEEEE T ss_conf 598899998987899999998456799589985 No 306 >cd01536 PBP1_ABC_sugar_binding_like Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. The members of this family function as the primary receptors for chemotaxis and transport of many sugar based solutes in bacteria and archaea. The sugar binding domain is also homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Moreover, this periplasmic binding domain, also known as Venus flytrap domain, undergoes transition from an open to a closed conformational state upon the binding of ligands such as lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. This family also includes the periplasmic binding domain of autoinducer-2 (AI-2 Probab=21.27 E-value=62 Score=13.74 Aligned_cols=82 Identities=20% Similarity=0.114 Sum_probs=49.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH----HCCCCEEEEECCCHHHHH Q ss_conf 6887314677757999889755956975345895378886543565329999999994----211588999838665899 Q gi|254780936|r 52 VVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAVDAFEQ----SEGLEHLVIFSGDGCFTT 127 (182) Q Consensus 52 ~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~laiD~~~~----a~~~d~~iLvSGD~Df~p 127 (182) .+........+...|.++|.+.+-...... ...+.+..-+..+++. .+.+ ++|++++|.--.. T Consensus 129 g~~~~~~~~~R~~Gf~~al~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~l~~~~~~-~ai~~~nd~~a~g 195 (267) T cd01536 129 GPPGSSNAQERVKGFRDALKEYPDIEIVAV------------QDGNWDREKALQAMEDLLQANPDI-DAIFAANDSMALG 195 (267) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCCHH------------HCCCCCHHHHHHHHHHHHHCCCCC-EEEEECCCHHHHH T ss_conf 899886899999999999984558662022------------200110257889999776417986-2999716499999 Q ss_pred HHHHHHHCCC--EEEEEEEEC Q ss_conf 9999998498--899998315 Q gi|254780936|r 128 LVAALQRKVK--KVTIVSTVL 146 (182) Q Consensus 128 li~~lr~~Gk--~V~v~~~~~ 146 (182) +++.+++.|+ .+.++++.. T Consensus 196 ~~~al~~~g~~~di~viG~D~ 216 (267) T cd01536 196 AVAALKAAGRKGDVKIVGVDG 216 (267) T ss_pred HHHHHHHCCCCCCEEEEEECC T ss_conf 999999769999939999898 No 307 >cd06283 PBP1_RegR_EndR_KdgR_like Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR. Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR, all of which are members of the LacI-GalR family of bacterial transcription regulators. RegR regulates bacterial competence and the expression of virulence factors, including hyaluronidase. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=21.21 E-value=62 Score=13.73 Aligned_cols=104 Identities=5% Similarity=0.069 Sum_probs=57.3 Q ss_pred HHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHH-HHHHHHC Q ss_conf 999985179299999973068873146777579998897559569753458953788865435653299999-9999421 Q gi|254780936|r 33 LLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKVAKEFTENCGRKRVKSSMDVELAV-DAFEQSE 111 (182) Q Consensus 33 L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv~lai-D~~~~a~ 111 (182) +.+++.+.+.-..++.. +..........+...|.+++...|..+....... .. ....+..-++ .++.-.+ T Consensus 107 ~~~~L~~~G~~~i~~i~-~~~~~~~~~~~R~~g~~~a~~~~~~~~~~~~~~~-~~-------~~~~~~~~~~~~~l~~~~ 177 (267) T cd06283 107 AVDHLIEKGYERILFVT-EPLDEISPRMERYEGFKEALAEHGIGVNEELIEI-DD-------EDADELDERLRQLLNKPK 177 (267) T ss_pred HHHHHHHCCCCEEEEEE-CCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE-CC-------CCHHHHHHHHHHHHHCCC T ss_conf 99999975997299995-6877883799999999999997599765148983-35-------426779999999984799 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCEE----EEEEEEC Q ss_conf 1588999838665899999999849889----9998315 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKKV----TIVSTVL 146 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~V----~v~~~~~ 146 (182) .+ ++|+++.|.--..++..+++.|.+| -|+++.. T Consensus 178 ~~-~aii~~~D~~A~g~~~~l~~~g~~vP~disVvg~dd 215 (267) T cd06283 178 KK-TAIFAANGLILLEVLKALKELGIRIPEDVGLIGFDD 215 (267) T ss_pred CC-CEECCCCCHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 98-642037709999999999982998898766885388 No 308 >TIGR02252 DREG-2 REG-2-like, HAD superfamily (subfamily IA) hydrolase; InterPro: IPR011949 This family of proteins includes uncharacterised sequences from eukaryotes, cyanobacteria and Leptospira as well as the DREG-2 protein from Drosophila melanogaster which has been identified as a rhythmically (diurnally) regulated gene . This family is a member of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The superfamily is defined by the presence of three short catalytic motifs . The subfamilies are defined based on the location and the observed or predicted fold of a so-called, capping domain, , or the absence of such a domain. This family is a member of subfamily 1A in which the cap domain consists of a predicted alpha helical bundle found in between the first and second catalytic motifs. A distinctive feature of this family is a conserved tandem pair of tryptophan residues in the cap domain. The most divergent sequences included within the scope of this entry are from plants and have "FW" at this position instead. Most likely, these sequences, like the vast majority of HAD sequences, represent phosphatase enzymes.. Probab=21.21 E-value=62 Score=13.73 Aligned_cols=44 Identities=11% Similarity=0.084 Sum_probs=28.1 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCC--EEEEEE Q ss_conf 3299999999942115889998386658999999998498--899998 Q gi|254780936|r 98 MDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVK--KVTIVS 143 (182) Q Consensus 98 ~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk--~V~v~~ 143 (182) ..-.++-.++++=..-..-.++ ++|..++++.+|+.|+ .+.|++ T Consensus 97 ~~~~~~~~Ly~~F~~~~~W~~~--~~d~~~~L~~~r~~g~k~~lgviS 142 (224) T TIGR02252 97 SFEKIFEELYSYFATPEPWQVY--PPDAQKLLKDLREEGLKVKLGVIS 142 (224) T ss_pred CHHHHHHHHHHHHCCCCCCEEC--CCCHHHHHHHHHHCCCCEEEEEEE T ss_conf 4789999999872787786203--755789999999728965899874 No 309 >pfam09818 ABC_ATPase Predicted ATPase of the ABC class. Members of this family include various bacterial predicted ABC class ATPases. Probab=21.12 E-value=55 Score=14.06 Aligned_cols=45 Identities=16% Similarity=0.146 Sum_probs=30.9 Q ss_pred EECCCH-HHHHHHHHHHHC----CCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHHH Q ss_conf 983866-589999999984----9889999831577654708999852111518997 Q gi|254780936|r 118 IFSGDG-CFTTLVAALQRK----VKKVTIVSTVLSDPSMASDQLRRQADYFMDLAYL 169 (182) Q Consensus 118 LvSGD~-Df~pli~~lr~~----Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~l 169 (182) ||+.|. =-+|+++++|+. |..++++. |. |..+-..||..|-+++. T Consensus 364 Lv~~~kEPITPfvdrvr~L~~~~GVStilV~---Gg----sgdy~~vAD~VI~Md~Y 413 (447) T pfam09818 364 LVSKEKEPITPFVDRVRSLYDDLGVSTILVV---GG----SGDYLDVADTVIQMDDY 413 (447) T ss_pred HHHCCCCCCCHHHHHHHHHHHHCCCEEEEEE---CC----CCCCHHHCCEEEEECCC T ss_conf 7515887776089999999986295499995---66----51135348889996386 No 310 >COG2145 ThiM Hydroxyethylthiazole kinase, sugar kinase family [Coenzyme metabolism] Probab=21.06 E-value=31 Score=15.60 Aligned_cols=67 Identities=18% Similarity=0.226 Sum_probs=38.0 Q ss_pred HHHCCCEEEEEEEEEECCCCCCCCCCCCCHH-HHHHHHHHHH----CCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9755956975345895378886543565329-9999999942----115889998386658999999998498899998 Q gi|254780936|r 70 LHYNGFQVVAKVAKEFTENCGRKRVKSSMDV-ELAVDAFEQS----EGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 70 l~~~g~~v~~~~~~~~~~~~~~~~~kk~~Dv-~laiD~~~~a----~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) |.+..+.++..-.-+...-.+..+.-||+|. .-..|.++.+ +.+.++|++||--|++ ..|.+|.++. T Consensus 109 L~~~~~~~IrGN~sEI~~Lag~~~~~kGVDa~~~~~~~~~~a~~~A~~~~~vvvvTG~vD~I-------sdg~~~~~i~ 180 (265) T COG2145 109 LAEVKPAAIRGNASEIAALAGEAGGGKGVDAGDGAADAIEAAKKAAQKYGTVVVVTGEVDYI-------SDGTRVVVIH 180 (265) T ss_pred HHHCCCCEEECCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECCEEEE-------EECCEEEEEE T ss_conf 97068747825699999973302667454666431469999999988759189997782489-------7087699987 No 311 >PRK10785 maltodextrin glucosidase; Provisional Probab=20.84 E-value=63 Score=13.68 Aligned_cols=21 Identities=10% Similarity=0.059 Sum_probs=9.1 Q ss_pred HHHHHH-HHHHHCCCEEEEEEE Q ss_conf 589999-999984988999983 Q gi|254780936|r 124 CFTTLV-AALQRKVKKVTIVST 144 (182) Q Consensus 124 Df~pli-~~lr~~Gk~V~v~~~ 144 (182) .|...+ +.+++.+..+.++++ T Consensus 351 ~f~~~~r~~vk~~~Pd~~ilGE 372 (608) T PRK10785 351 QHVAGITQAAKEENPEAYILGE 372 (608) T ss_pred HHHHHHHHHHHHHCCCCEEEEE T ss_conf 9999999999976898489976 No 312 >COG2241 CobL Precorrin-6B methylase 1 [Coenzyme metabolism] Probab=20.74 E-value=64 Score=13.67 Aligned_cols=27 Identities=33% Similarity=0.447 Sum_probs=19.4 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCE Q ss_conf 158899983866589999999984988 Q gi|254780936|r 112 GLEHLVIFSGDGCFTTLVAALQRKVKK 138 (182) Q Consensus 112 ~~d~~iLvSGD~Df~pli~~lr~~Gk~ 138 (182) ..+++||.|||--|--+-..++..... T Consensus 68 g~~v~VLasGDP~f~G~g~~l~~~~~~ 94 (210) T COG2241 68 GRDVVVLASGDPLFSGVGRLLRRKFSC 94 (210) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCC T ss_conf 997699963896035368999986476 No 313 >TIGR01194 cyc_pep_trnsptr cyclic peptide transporter; InterPro: IPR005898 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Bacteria have elaborate pathways for the production of toxins and secondary metabolites. Many such compounds, including syringomycin and pyoverdine are synthesized on non-ribosomal templates consisting of a multienzyme complex. On several occasions the proteins of the complex and transporter protein are present on the same operon. Often times these compounds cross the biological membrane by specific transporters. This model describes a family of cyclic peptide transporters in bacteria. Syringomycin is an amphipathic, cyclic lipodepsipeptide when inserted into host causes formation of channels, permeable to a variety of cations. On the other hand, pyoverdine is a cyclic octa-peptidyl dihydroxyquinoline, which is efficient in sequestering iron for uptake.; GO: 0005524 ATP binding, 0015197 peptide transporter activity, 0015833 peptide transport, 0016021 integral to membrane. Probab=20.72 E-value=44 Score=14.71 Aligned_cols=124 Identities=20% Similarity=0.230 Sum_probs=58.1 Q ss_pred CCHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE-EEEEC-CCCCCCCCCCCCHHHHHHH Q ss_conf 798999999851792999999730688731467775799988975595697534-58953-7888654356532999999 Q gi|254780936|r 28 IDYRKLLKAFRSRAIVIRAYYYTTVVGDPEQQFSPLHPLLDWLHYNGFQVVAKV-AKEFT-ENCGRKRVKSSMDVELAVD 105 (182) Q Consensus 28 ~d~~~L~~~l~~~~~l~~~~~y~~~~~~~~~~~~~~~~~~~~l~~~g~~v~~~~-~~~~~-~~~~~~~~kk~~Dv~laiD 105 (182) =||..|...+..++.|- .....+++.+..+.++-.+.|.+++.+-..+- --.++ ....-.+++| +||+= T Consensus 412 ~DYR~LFS~~F~Dy~LF-----~~L~~Pde~~~~S~~~A~~YL~~L~~~~KV~~~~~~yStTT~LS~GQ~K----RLAL~ 482 (555) T TIGR01194 412 DDYRDLFSAVFADYYLF-----DDLVQPDEKKQASLDNATTYLSRLELEDKVKVEDLNYSTTTDLSYGQRK----RLALV 482 (555) T ss_pred HHHHHHHHHHHHHHHHH-----HHHCCCCCCCCCCHHHHHHHHHHCCCCCCEEEECCCCCHHHHCCCCHHH----HHHHH T ss_conf 23899999998889988-----8642876666654588999997526211146604763302101121368----89999 Q ss_pred HHHHHCC-CCEEEEECCCHH-------HHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 9994211-588999838665-------89999999984988999983157765470899985211151899 Q gi|254780936|r 106 AFEQSEG-LEHLVIFSGDGC-------FTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 106 ~~~~a~~-~d~~iLvSGD~D-------f~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) ..=+.++ +=.+===..|.| |.+++=-||++||.+.|++- -...-+.||+.|-|.+ T Consensus 483 ~AyLE~RP~~~lDEWAADQDP~F~R~FY~ELLPDLKR~GKTI~~I~H--------DD~YF~lAD~~I~L~~ 545 (555) T TIGR01194 483 NAYLEDRPVLLLDEWAADQDPAFKRVFYEELLPDLKRRGKTILVISH--------DDRYFELADRLIKLAD 545 (555) T ss_pred HHHHHCCCEEEEECCCCCCCCCCCCCCHHHCCHHHHHCCCEEEEEEC--------CCHHHHHHHHHHHHHC T ss_conf 87763487246532102687863321053237113007877999802--------7247899999998737 No 314 >pfam08423 Rad51 Rad51. Rad51 is a DNA repair and recombination protein and is a homologue of the bacterial ATPase RecA protein. Probab=20.33 E-value=65 Score=13.61 Aligned_cols=16 Identities=6% Similarity=0.115 Sum_probs=10.9 Q ss_pred HHHHHCCCCEEEEECC Q ss_conf 9994211588999838 Q gi|254780936|r 106 AFEQSEGLEHLVIFSG 121 (182) Q Consensus 106 ~~~~a~~~d~~iLvSG 121 (182) ...+|..++.+|++|+ T Consensus 175 L~~lA~~~~~aVvvTN 190 (261) T pfam08423 175 LQRLADEFGVAVVITN 190 (261) T ss_pred HHHHHHHCCCEEEEEE T ss_conf 9999998095899960 No 315 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=20.32 E-value=65 Score=13.61 Aligned_cols=68 Identities=13% Similarity=0.137 Sum_probs=40.7 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCE Q ss_conf 5435653299999999942115889998386658999999998498899998315776547089998521115 Q gi|254780936|r 92 KRVKSSMDVELAVDAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFM 164 (182) Q Consensus 92 ~~~kk~~Dv~laiD~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi 164 (182) ...+||.| +.++++...++-+.-.++-||+...+........+.+|...+... ..--.++-..||-|+ T Consensus 200 l~~~Kg~~--~li~a~~~l~~~~~~l~i~G~g~~~~~~~~~~~~~~~v~f~G~~~---~~~~~~~~~~adi~v 267 (359) T cd03823 200 LTPHKGVD--LLLEAFKRLPRGDIELVIVGNGLELEEESYELEGDPRVEFLGAYP---QEEIDDFYAEIDVLV 267 (359) T ss_pred CCCCCCHH--HHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHCCCCCEECCCCC---HHHHHHHHHHCCCCC T ss_conf 97625999--999999855557828999778605689999972577648806567---899999998657365 No 316 >PRK12778 putative bifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta; Provisional Probab=20.31 E-value=65 Score=13.61 Aligned_cols=15 Identities=13% Similarity=-0.008 Sum_probs=6.6 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 999889755956975 Q gi|254780936|r 65 PLLDWLHYNGFQVVA 79 (182) Q Consensus 65 ~~~~~l~~~g~~v~~ 79 (182) .-.+.+...|+++.. T Consensus 494 ~eI~~l~~lGV~~~~ 508 (760) T PRK12778 494 VEIENLVKMGVTFEK 508 (760) T ss_pred HHHHHHHHCCCEEEC T ss_conf 999999864979989 No 317 >PRK13886 conjugal transfer protein TraL; Provisional Probab=20.28 E-value=65 Score=13.61 Aligned_cols=78 Identities=15% Similarity=0.102 Sum_probs=49.2 Q ss_pred CCCHHHHHHHHHHHH-CCCCEEEEECCCHHHHHHHHH---------HHHCCCEEEEEEEECCCC--CCCHHHHHHHCCCC Q ss_conf 653299999999942-115889998386658999999---------998498899998315776--54708999852111 Q gi|254780936|r 96 SSMDVELAVDAFEQS-EGLEHLVIFSGDGCFTTLVAA---------LQRKVKKVTIVSTVLSDP--SMASDQLRRQADYF 163 (182) Q Consensus 96 k~~Dv~laiD~~~~a-~~~d~~iLvSGD~Df~pli~~---------lr~~Gk~V~v~~~~~~~~--~~~S~~L~~~ad~f 163 (182) ..+|.+.--.+++.. ..=+.+|+=+|=|=|+|+... +++.|.+|.+-++..|.. .-|-.-|..-++.| T Consensus 63 ~~i~~r~FD~l~E~i~~~~~~~VID~GAstfvPL~~Yl~eN~v~~ll~e~G~~v~iH~vi~GGqa~~DT~~gl~~l~~~~ 142 (241) T PRK13886 63 DEINTRNFDALVEMIASTEGDVIIDNGASSFVPLSHYLISNQVPALLQDMGHELVVHTVVTGGQALLDTVSGFAQLASQF 142 (241) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHCCHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHC T ss_conf 86566666899999856899889988517677899999878869999977973999998459842778999999999865 Q ss_pred -------EEHHHHHHHH Q ss_conf -------5189978762 Q gi|254780936|r 164 -------MDLAYLKNEI 173 (182) Q Consensus 164 -------i~l~~l~~~i 173 (182) +++.+..+.+ T Consensus 143 ~~~~~~VVWlN~~~G~i 159 (241) T PRK13886 143 PAECLFVVWLNPYWGPI 159 (241) T ss_pred CCCCCEEEEECCCCCCC T ss_conf 87787899974887864 No 318 >cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13. Probab=20.28 E-value=65 Score=13.61 Aligned_cols=10 Identities=20% Similarity=0.534 Sum_probs=3.4 Q ss_pred CCEEEEECCC Q ss_conf 5889998386 Q gi|254780936|r 113 LEHLVIFSGD 122 (182) Q Consensus 113 ~d~~iLvSGD 122 (182) ++.+||+--| T Consensus 98 ~~~vIIlEDD 107 (334) T cd02514 98 YSFVIILEDD 107 (334) T ss_pred CCEEEEEECC T ss_conf 8769999456 No 319 >PTZ00052 thioredoxin reductase; Provisional Probab=20.24 E-value=65 Score=13.60 Aligned_cols=39 Identities=13% Similarity=0.065 Sum_probs=28.8 Q ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999942115889998386658999999998498899998 Q gi|254780936|r 105 DAFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 105 D~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) +++++..--..++++-|--==+++...+.+.|.+|+++. T Consensus 214 ~~l~l~~lP~~lvIIGgG~IG~E~A~if~~lGs~VTi~~ 252 (541) T PTZ00052 214 DIFSLKKSPGKTLVVGASYVALECAGFLNSLGFDVTVAV 252 (541) T ss_pred HHCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 220653489628998986999999999997598589995 No 320 >PRK08010 pyridine nucleotide-disulfide oxidoreductase; Provisional Probab=20.15 E-value=66 Score=13.59 Aligned_cols=38 Identities=24% Similarity=0.168 Sum_probs=21.9 Q ss_pred HHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 99942115889998386658999999998498899998 Q gi|254780936|r 106 AFEQSEGLEHLVIFSGDGCFTTLVAALQRKVKKVTIVS 143 (182) Q Consensus 106 ~~~~a~~~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~ 143 (182) ++++..--..++++-|.-==+++...+++.|.+|+++- T Consensus 151 ~~~l~~lP~~l~IiGgG~ig~E~A~~~~~lG~~Vtiie 188 (441) T PRK08010 151 LLNLKELPGHLGILGGGYIGVEFASMFANFGSKVTILE 188 (441) T ss_pred HHCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 52521169679998985899999999997598788970 No 321 >PRK05752 uroporphyrinogen-III synthase; Validated Probab=20.07 E-value=66 Score=13.58 Aligned_cols=32 Identities=19% Similarity=0.263 Sum_probs=22.9 Q ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 58899983866589999999984988999983 Q gi|254780936|r 113 LEHLVIFSGDGCFTTLVAALQRKVKKVTIVST 144 (182) Q Consensus 113 ~d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~ 144 (182) =..+.|+.|++-=.-+.+.|+++|.+|..+.. T Consensus 130 g~rVLi~rG~gGR~~L~~~L~~rGa~V~~~~~ 161 (255) T PRK05752 130 DPRVLIMRGEGGRELLAERLREQGASVDYLEL 161 (255) T ss_pred CCEEEEEECCCCHHHHHHHHHHCCCEEEEEEE T ss_conf 88799981676178999999988998989987 No 322 >PRK11614 livF leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=20.05 E-value=66 Score=13.58 Aligned_cols=39 Identities=15% Similarity=0.096 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHCCCCEEHHH Q ss_conf 6589999999984988999983157765470899985211151899 Q gi|254780936|r 123 GCFTTLVAALQRKVKKVTIVSTVLSDPSMASDQLRRQADYFMDLAY 168 (182) Q Consensus 123 ~Df~pli~~lr~~Gk~V~v~~~~~~~~~~~S~~L~~~ad~fi~l~~ 168 (182) ..+..+++.+++.|+.|.+++ + --.++.+.||+.+-|.+ T Consensus 174 ~~i~~~l~~l~~~g~tii~vs----H---~l~~~~~~aDrv~vl~~ 212 (237) T PRK11614 174 QQIFDTIEQLREQGMTIFLVE----Q---NANQALKLADRGYVLEN 212 (237) T ss_pred HHHHHHHHHHHHCCCEEEEEE----C---CHHHHHHHCCEEEEEEC T ss_conf 999999999996799999994----7---58999996999999979 No 323 >COG0855 Ppk Polyphosphate kinase [Inorganic ion transport and metabolism] Probab=20.05 E-value=66 Score=13.58 Aligned_cols=44 Identities=18% Similarity=0.193 Sum_probs=34.7 Q ss_pred HHHHHHHH---CCC----CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEEC Q ss_conf 99999942---115----889998386658999999998498899998315 Q gi|254780936|r 103 AVDAFEQS---EGL----EHLVIFSGDGCFTTLVAALQRKVKKVTIVSTVL 146 (182) Q Consensus 103 aiD~~~~a---~~~----d~~iLvSGD~Df~pli~~lr~~Gk~V~v~~~~~ 146 (182) ++|.++.| +++ -|+.=+|+||--+.++-.+-+.||+|+++.+-+ T Consensus 357 Vv~fl~qAA~DP~VLAIKqTLYRt~~dSpIV~ALi~AA~nGKqVtvlVELk 407 (696) T COG0855 357 VVEFLRQAAADPDVLAIKQTLYRTSKDSPIVRALIDAAENGKQVTVLVELK 407 (696) T ss_pred HHHHHHHHHCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEEEH T ss_conf 999999861499858988778752899879999999998098689999972 Done!