Query gi|254780898|ref|YP_003065311.1| hypothetical protein CLIBASIA_03975 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 207 No_of_seqs 149 out of 737 Neff 7.0 Searched_HMMs 39220 Date Tue May 31 14:40:04 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780898.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam03162 Y_phosphatase2 Tyros 100.0 1.1E-28 2.8E-33 200.9 13.9 134 41-184 4-142 (150) 2 KOG1720 consensus 99.9 1E-21 2.7E-26 157.0 11.7 168 13-186 22-206 (225) 3 pfam04273 DUF442 Putative phos 99.8 3.6E-20 9.3E-25 147.3 13.3 104 43-150 1-107 (110) 4 KOG1572 consensus 99.8 3E-20 7.6E-25 147.8 12.4 134 41-184 57-199 (249) 5 COG2365 Protein tyrosine/serin 99.8 5.6E-19 1.4E-23 139.8 7.3 132 41-172 44-181 (249) 6 COG3453 Uncharacterized protei 99.7 4.5E-16 1.2E-20 121.5 13.8 116 44-164 3-121 (130) 7 cd00127 DSPc Dual specificity 99.7 1.2E-15 3.1E-20 118.8 14.1 130 44-183 2-135 (139) 8 smart00195 DSPc Dual specifici 99.6 1.1E-14 2.7E-19 112.9 13.4 126 46-183 3-132 (138) 9 TIGR01244 TIGR01244 conserved 99.6 1E-14 2.7E-19 112.9 12.4 117 43-164 1-121 (136) 10 PTZ00242 protein tyrosine phos 99.6 1.1E-14 2.8E-19 112.8 12.3 114 54-173 24-146 (172) 11 pfam00782 DSPc Dual specificit 99.6 2.3E-14 5.8E-19 110.8 12.9 121 52-183 1-125 (131) 12 KOG1719 consensus 99.6 9.8E-14 2.5E-18 106.8 12.7 121 43-167 24-148 (183) 13 PRK12361 hypothetical protein; 99.5 1.1E-12 2.9E-17 100.2 12.0 140 31-184 79-225 (546) 14 KOG1716 consensus 99.4 1.7E-11 4.3E-16 92.8 13.4 147 42-197 73-223 (285) 15 pfam05706 CDKN3 Cyclin-depende 99.3 1E-10 2.6E-15 87.9 13.9 129 60-190 61-199 (207) 16 KOG1718 consensus 99.2 5.2E-10 1.3E-14 83.4 12.8 142 41-194 14-159 (198) 17 COG2453 CDC14 Predicted protei 99.2 3.4E-10 8.7E-15 84.5 10.2 77 91-167 65-145 (180) 18 KOG1717 consensus 99.1 1.8E-09 4.5E-14 80.0 12.2 130 46-185 174-307 (343) 19 KOG2836 consensus 98.8 6.6E-08 1.7E-12 70.1 10.4 111 56-173 23-142 (173) 20 KOG2283 consensus 98.7 1.6E-08 4.2E-13 73.9 4.3 136 23-169 6-149 (434) 21 pfam00102 Y_phosphatase Protei 98.7 2.2E-07 5.5E-12 66.9 9.3 122 52-173 44-220 (234) 22 smart00194 PTPc Protein tyrosi 98.6 2.1E-07 5.3E-12 67.0 8.8 114 59-172 81-243 (258) 23 cd00047 PTPc Protein tyrosine 98.5 7.5E-07 1.9E-11 63.5 9.2 122 52-173 43-217 (231) 24 smart00404 PTPc_motif Protein 98.2 8.3E-06 2.1E-10 56.9 8.6 76 100-175 3-93 (105) 25 smart00012 PTPc_DSPc Protein t 98.2 8.3E-06 2.1E-10 56.9 8.6 76 100-175 3-93 (105) 26 KOG2386 consensus 98.2 3.3E-06 8.5E-11 59.4 6.0 132 57-193 47-188 (393) 27 COG5599 PTP2 Protein tyrosine 98.0 5.4E-06 1.4E-10 58.1 3.8 50 105-154 192-244 (302) 28 COG5350 Predicted protein tyro 97.7 0.00029 7.4E-09 47.2 8.9 106 61-166 22-132 (172) 29 KOG0791 consensus 97.7 7E-05 1.8E-09 51.1 5.4 124 49-172 163-336 (374) 30 KOG0792 consensus 97.6 0.00028 7.2E-09 47.3 7.6 132 59-193 950-1132(1144) 31 KOG4228 consensus 97.6 4.2E-05 1.1E-09 52.5 2.6 47 106-152 703-754 (1087) 32 KOG0789 consensus 97.6 0.0002 5.2E-09 48.2 6.0 108 61-168 181-345 (415) 33 KOG0790 consensus 97.4 6.9E-05 1.8E-09 51.1 1.3 47 104-150 420-473 (600) 34 KOG4228 consensus 97.2 0.0021 5.5E-08 41.7 7.3 108 59-167 907-1062(1087) 35 KOG0793 consensus 97.0 0.00084 2.1E-08 44.3 3.7 113 39-151 787-950 (1004) 36 cd01522 RHOD_1 Member of the R 96.5 0.015 3.9E-07 36.4 7.1 86 58-157 1-90 (117) 37 cd01518 RHOD_YceA Member of th 96.3 0.011 2.9E-07 37.2 5.5 83 56-156 2-86 (101) 38 cd01533 4RHOD_Repeat_2 Member 96.2 0.014 3.6E-07 36.6 5.9 87 52-157 6-92 (109) 39 KOG4471 consensus 96.2 0.014 3.7E-07 36.5 5.9 51 119-171 362-422 (717) 40 cd01447 Polysulfide_ST Polysul 95.8 0.017 4.2E-07 36.1 4.5 82 58-157 1-87 (103) 41 cd01527 RHOD_YgaP Member of th 95.4 0.026 6.8E-07 34.9 4.5 69 58-148 4-73 (99) 42 cd01444 GlpE_ST GlpE sulfurtra 95.3 0.069 1.8E-06 32.2 6.4 82 57-157 1-82 (96) 43 PRK10287 thiosulfate:cyanide s 95.3 0.021 5.4E-07 35.4 3.8 64 75-157 23-86 (104) 44 cd01528 RHOD_2 Member of the R 95.2 0.16 4.1E-06 29.9 8.0 81 57-157 1-84 (101) 45 PRK08762 molybdopterin biosynt 95.0 0.043 1.1E-06 33.6 4.5 79 57-158 5-85 (379) 46 cd01526 RHOD_ThiF Member of th 95.0 0.034 8.7E-07 34.2 4.0 87 57-157 9-98 (122) 47 PRK00162 glpE thiosulfate sulf 94.8 0.12 3.2E-06 30.6 6.6 78 58-157 7-84 (104) 48 PRK08057 cobalt-precorrin-6x r 94.6 0.36 9.1E-06 27.8 8.4 85 53-139 43-134 (241) 49 cd01523 RHOD_Lact_B Member of 94.5 0.17 4.4E-06 29.7 6.7 84 58-157 1-87 (100) 50 pfam02571 CbiJ Precorrin-6x re 93.8 0.42 1.1E-05 27.3 7.4 82 53-134 45-134 (246) 51 KOG1089 consensus 93.7 0.17 4.4E-06 29.7 5.3 66 124-189 340-428 (573) 52 cd01449 TST_Repeat_2 Thiosulfa 93.4 0.25 6.3E-06 28.8 5.7 95 59-157 2-104 (118) 53 cd01519 RHOD_HSP67B2 Member of 93.3 0.071 1.8E-06 32.2 2.8 86 59-157 2-92 (106) 54 cd01534 4RHOD_Repeat_3 Member 93.3 0.22 5.5E-06 29.1 5.3 79 59-157 2-82 (95) 55 TIGR02090 LEU1_arch isopropylm 92.9 0.4 1E-05 27.4 6.2 121 65-191 121-269 (371) 56 COG5016 Pyruvate/oxaloacetate 92.6 0.53 1.3E-05 26.7 6.4 79 91-173 162-243 (472) 57 PRK07878 molybdopterin biosynt 92.0 1 2.7E-05 24.8 7.9 81 57-157 288-369 (392) 58 cd01524 RHOD_Pyr_redox Member 91.9 0.98 2.5E-05 25.0 7.1 74 61-157 4-77 (90) 59 pfam00581 Rhodanese Rhodanese- 91.6 0.77 2E-05 25.6 6.3 78 59-148 1-79 (106) 60 PRK05597 molybdopterin biosynt 90.5 0.29 7.5E-06 28.3 3.3 29 126-157 312-340 (355) 61 PRK07411 hypothetical protein; 90.2 1.5 3.8E-05 23.8 6.7 84 59-158 285-369 (390) 62 KOG1530 consensus 90.1 0.26 6.8E-06 28.6 2.8 77 60-148 27-108 (136) 63 cd01521 RHOD_PspE2 Member of t 89.1 1.8 4.5E-05 23.4 6.4 81 58-157 10-92 (110) 64 TIGR01108 oadA oxaloacetate de 89.0 2 5E-05 23.1 6.6 80 91-174 156-239 (616) 65 cd01532 4RHOD_Repeat_1 Member 88.9 0.9 2.3E-05 25.2 4.8 70 71-157 9-78 (92) 66 cd01529 4RHOD_Repeats Member o 88.8 1.3 3.2E-05 24.3 5.5 29 126-157 54-82 (96) 67 PRK05600 thiamine biosynthesis 88.3 2.2 5.5E-05 22.8 6.4 85 57-157 272-358 (370) 68 pfam04343 DUF488 Protein of un 87.3 1.7 4.5E-05 23.4 5.5 47 60-106 2-52 (123) 69 cd00158 RHOD Rhodanese Homolog 86.6 1.3 3.2E-05 24.3 4.4 72 65-156 4-75 (89) 70 COG2099 CobK Precorrin-6x redu 86.1 2.9 7.4E-05 22.0 7.1 80 52-133 47-133 (257) 71 pfam02126 PTE Phosphotriestera 85.0 3.3 8.3E-05 21.7 10.4 95 62-159 43-184 (308) 72 pfam04179 Init_tRNA_PT Initiat 83.7 3.7 9.4E-05 21.4 8.4 123 71-206 312-442 (445) 73 COG0607 PspE Rhodanese-related 83.5 2.1 5.3E-05 22.9 4.3 75 65-159 13-89 (110) 74 TIGR02493 PFLA pyruvate format 82.9 3.5 8.9E-05 21.5 5.3 91 42-133 60-168 (243) 75 TIGR01975 isoAsp_dipep beta-as 82.7 3 7.7E-05 21.9 4.9 21 54-74 164-184 (391) 76 TIGR00010 TIGR00010 hydrolase, 82.3 0.9 2.3E-05 25.2 2.1 44 98-141 101-144 (269) 77 TIGR03217 4OH_2_O_val_ald 4-hy 81.6 4.5 0.00011 20.8 9.0 89 62-154 118-211 (333) 78 pfam05925 IpgD Enterobacterial 79.1 0.59 1.5E-05 26.4 0.3 17 132-148 467-483 (569) 79 cd00530 PTE Phosphotriesterase 78.4 5.6 0.00014 20.2 9.7 96 61-159 36-178 (293) 80 PRK10076 pyruvate formate lyas 78.2 5.7 0.00015 20.2 6.6 91 42-135 32-134 (213) 81 PRK08195 4-hydroxy-2-ketovaler 77.8 5.9 0.00015 20.1 9.7 88 62-153 119-211 (337) 82 PRK12315 1-deoxy-D-xylulose-5- 76.5 6.4 0.00016 19.9 5.1 44 94-141 203-246 (581) 83 KOG0566 consensus 75.3 6.9 0.00018 19.7 6.3 34 51-84 286-323 (1080) 84 COG2089 SpsE Sialic acid synth 72.0 8.3 0.00021 19.2 5.3 31 109-139 154-186 (347) 85 COG0787 Alr Alanine racemase [ 71.9 8.3 0.00021 19.2 6.2 109 67-188 69-183 (360) 86 PRK05444 1-deoxy-D-xylulose-5- 69.4 9.1 0.00023 18.9 4.3 45 93-141 193-237 (576) 87 PRK12330 oxaloacetate decarbox 68.8 9.7 0.00025 18.7 7.6 77 92-172 162-243 (499) 88 pfam00682 HMGL-like HMGL-like. 66.9 11 0.00027 18.5 9.0 84 64-151 114-201 (237) 89 cd01310 TatD_DNAse TatD like p 66.3 11 0.00028 18.4 9.3 89 114-203 104-203 (251) 90 smart00450 RHOD Rhodanese Homo 65.2 3.2 8.2E-05 21.7 1.3 29 126-157 54-82 (100) 91 PRK09629 bifunctional thiosulf 63.0 13 0.00032 18.0 5.0 86 71-162 161-254 (610) 92 PRK12581 oxaloacetate decarbox 62.5 13 0.00032 18.0 8.8 78 90-171 168-248 (468) 93 PRK09875 putative hydrolase; P 61.5 13 0.00034 17.9 9.5 95 61-159 38-180 (292) 94 cd01531 Acr2p Eukaryotic arsen 61.1 13 0.00034 17.8 7.8 77 57-149 3-83 (113) 95 cd02003 TPP_IolD Thiamine pyro 60.8 14 0.00035 17.8 6.2 26 112-137 158-183 (205) 96 TIGR01245 trpD anthranilate ph 59.0 15 0.00037 17.6 5.6 25 58-82 153-177 (331) 97 KOG1468 consensus 59.0 2.3 5.9E-05 22.6 -0.3 96 53-154 85-184 (354) 98 KOG0348 consensus 57.7 14 0.00035 17.7 3.5 44 32-78 226-269 (708) 99 COG0161 BioA Adenosylmethionin 55.5 14 0.00035 17.8 3.1 76 111-191 232-310 (449) 100 COG3616 Predicted amino acid a 55.4 12 0.00032 18.1 2.9 35 112-146 119-155 (368) 101 COG0084 TatD Mg-dependent DNas 55.0 7.8 0.0002 19.3 1.8 53 115-167 112-170 (256) 102 PRK09506 mrcB penicillin-bindi 54.8 17 0.00044 17.2 6.3 63 20-83 68-142 (839) 103 KOG4428 consensus 54.5 8.1 0.00021 19.2 1.8 33 116-148 713-745 (813) 104 COG1154 Dxs Deoxyxylulose-5-ph 54.1 18 0.00045 17.1 4.6 46 92-141 237-282 (627) 105 PRK12331 oxaloacetate decarbox 53.6 18 0.00046 17.0 9.2 103 65-171 130-239 (463) 106 PRK07259 dihydroorotate dehydr 52.1 19 0.00048 16.9 8.0 97 42-139 56-168 (301) 107 PRK05692 hydroxymethylglutaryl 51.5 19 0.0005 16.8 8.8 73 64-137 126-208 (287) 108 TIGR03121 one_C_dehyd_A formyl 50.3 16 0.00042 17.3 2.9 37 101-137 199-237 (556) 109 PRK11493 sseA 3-mercaptopyruva 49.9 21 0.00052 16.7 4.7 80 71-157 167-257 (281) 110 cd01448 TST_Repeat_1 Thiosulfa 48.7 22 0.00055 16.5 8.4 93 59-157 3-106 (122) 111 cd01017 AdcA Metal binding pro 47.9 22 0.00056 16.5 5.7 54 87-141 178-232 (282) 112 PRK01060 endonuclease IV; Prov 47.5 22 0.00057 16.4 7.4 91 52-142 38-153 (281) 113 cd01525 RHOD_Kc Member of the 47.2 23 0.00058 16.4 5.7 87 59-157 2-91 (105) 114 TIGR02418 acolac_catab acetola 47.1 23 0.00058 16.4 6.4 22 112-133 516-539 (553) 115 PRK05802 hypothetical protein; 45.9 24 0.0006 16.3 7.8 77 58-141 187-265 (328) 116 smart00822 PKS_KR This enzymat 45.3 24 0.00062 16.2 7.5 79 62-144 15-96 (180) 117 PRK10425 DNase TatD; Provision 45.0 24 0.00062 16.2 9.9 54 114-167 104-167 (258) 118 cd01520 RHOD_YbbB Member of th 44.8 25 0.00063 16.2 5.5 88 59-159 2-115 (128) 119 cd02004 TPP_BZL_OCoD_HPCL Thia 44.6 25 0.00063 16.1 7.1 36 72-107 66-101 (172) 120 PRK11858 aksA trans-homoaconit 43.5 26 0.00066 16.0 8.8 85 65-153 123-210 (378) 121 COG0182 Predicted translation 42.4 21 0.00055 16.6 2.5 35 121-156 144-182 (346) 122 TIGR02249 integrase_gron integ 42.4 15 0.00039 17.5 1.7 71 127-207 150-225 (320) 123 PRK09522 bifunctional anthrani 42.2 27 0.00069 15.9 7.1 25 57-81 354-378 (531) 124 cd04740 DHOD_1B_like Dihydroor 42.1 27 0.00069 15.9 7.8 75 66-141 84-167 (296) 125 COG3830 ACT domain-containing 41.5 13 0.00032 18.0 1.2 19 137-155 9-27 (90) 126 cd01304 FMDH_A Formylmethanofu 41.2 19 0.00048 16.9 2.0 42 101-143 195-238 (541) 127 cd02010 TPP_ALS Thiamine pyrop 40.4 29 0.00073 15.7 7.0 34 73-106 67-100 (177) 128 TIGR02277 PaaX_trns_reg phenyl 38.7 31 0.00078 15.6 3.5 44 34-78 107-150 (287) 129 cd01019 ZnuA Zinc binding prot 37.6 32 0.00081 15.5 8.1 49 90-139 189-238 (286) 130 cd07021 Clp_protease_NfeD_like 37.5 32 0.00082 15.5 7.8 73 75-148 3-78 (178) 131 TIGR03167 tRNA_sel_U_synt tRNA 37.5 32 0.00082 15.5 6.0 118 53-173 82-248 (311) 132 TIGR00676 fadh2 5,10-methylene 37.4 32 0.00082 15.5 3.2 24 60-83 87-113 (302) 133 COG1001 AdeC Adenine deaminase 36.5 33 0.00085 15.4 7.3 135 54-189 87-267 (584) 134 PRK06036 translation initiatio 35.9 34 0.00086 15.3 4.6 30 122-152 143-175 (339) 135 smart00518 AP2Ec AP endonuclea 35.8 34 0.00087 15.3 8.4 91 52-142 36-147 (273) 136 TIGR00707 argD acetylornithine 35.4 35 0.00088 15.2 3.5 47 110-160 201-251 (402) 137 PRK08334 translation initiatio 34.6 33 0.00084 15.4 2.4 28 129-156 161-192 (356) 138 cd01443 Cdc25_Acr2p Cdc25 enzy 34.3 36 0.00092 15.1 7.0 78 57-150 3-88 (113) 139 cd05567 PTS_IIB_mannitol PTS_I 34.1 36 0.00092 15.1 3.4 27 129-155 1-27 (87) 140 TIGR02624 rhamnu_1P_ald rhamnu 33.8 37 0.00093 15.1 3.4 92 57-153 36-156 (273) 141 pfam08659 KR KR domain. This e 33.8 37 0.00093 15.1 8.3 80 61-144 14-96 (181) 142 PRK00142 hypothetical protein; 33.8 37 0.00093 15.1 6.3 41 127-183 170-210 (315) 143 PRK05875 short chain dehydroge 33.1 38 0.00096 15.0 5.7 80 61-143 21-102 (277) 144 pfam01026 TatD_DNase TatD rela 32.7 38 0.00097 15.0 5.7 25 114-138 107-134 (255) 145 smart00400 ZnF_CHCC zinc finge 32.6 36 0.00093 15.1 2.4 34 130-165 21-54 (55) 146 pfam02263 GBP Guanylate-bindin 32.1 28 0.00071 15.8 1.7 10 110-119 115-124 (264) 147 TIGR01829 AcAcCoA_reduct aceto 32.0 39 0.001 14.9 7.4 86 64-158 18-105 (244) 148 PRK08292 AMP nucleosidase; Pro 32.0 39 0.001 14.9 3.1 40 96-139 273-312 (487) 149 cd01535 4RHOD_Repeat_4 Member 31.8 39 0.001 14.9 5.1 66 64-148 3-68 (145) 150 pfam12091 DUF3567 Protein of u 31.6 38 0.00097 15.0 2.3 60 67-134 24-85 (85) 151 PRK05451 dihydroorotase; Provi 31.3 40 0.001 14.8 2.9 31 107-137 110-140 (345) 152 PRK09282 pyruvate carboxylase 30.7 41 0.001 14.8 6.8 78 90-171 160-240 (580) 153 pfam04909 Amidohydro_2 Amidohy 30.7 41 0.0011 14.8 5.4 76 63-138 89-173 (270) 154 PRK11449 putative deoxyribonuc 30.5 41 0.0011 14.8 8.8 25 118-142 117-141 (258) 155 COG1038 PycA Pyruvate carboxyl 30.5 42 0.0011 14.7 6.8 101 87-191 696-817 (1149) 156 cd06290 PBP1_LacI_like_9 Ligan 30.4 42 0.0011 14.7 6.1 95 61-158 104-202 (265) 157 PRK05720 mtnA methylthioribose 29.3 43 0.0011 14.6 3.3 18 52-69 88-105 (351) 158 PRK10853 hypothetical protein; 28.6 28 0.00072 15.8 1.3 29 53-81 32-60 (118) 159 PRK09389 (R)-citramalate synth 28.4 45 0.0011 14.5 6.4 83 65-151 120-205 (487) 160 pfam00591 Glycos_transf_3 Glyc 28.2 45 0.0012 14.5 5.5 72 57-144 85-158 (254) 161 PRK09432 metF 5,10-methylenete 28.0 46 0.0012 14.5 3.4 18 68-85 107-124 (296) 162 KOG1200 consensus 27.9 46 0.0012 14.5 6.6 92 59-163 26-121 (256) 163 cd06294 PBP1_ycjW_transcriptio 27.8 46 0.0012 14.5 8.6 92 65-159 115-210 (270) 164 pfam04396 consensus 27.6 47 0.0012 14.4 6.4 71 60-130 24-95 (149) 165 PRK07049 methionine gamma-lyas 27.5 47 0.0012 14.4 5.4 34 40-76 119-152 (427) 166 PRK10499 N,N'-diacetylchitobio 27.4 47 0.0012 14.4 2.4 19 128-147 3-21 (106) 167 pfam00282 Pyridoxal_deC Pyrido 27.4 47 0.0012 14.4 5.5 58 88-145 151-209 (370) 168 PRK11784 tRNA 2-selenouridine 27.3 47 0.0012 14.4 7.2 29 129-159 90-118 (333) 169 PRK08742 adenosylmethionine--8 26.8 24 0.00061 16.2 0.6 72 111-187 254-328 (472) 170 cd06288 PBP1_sucrose_transcrip 26.5 49 0.0012 14.3 7.3 88 61-148 105-196 (269) 171 PRK01415 hypothetical protein; 26.3 49 0.0013 14.3 7.9 96 54-183 110-210 (247) 172 cd03035 ArsC_Yffb Arsenate Red 25.7 48 0.0012 14.4 2.0 30 52-81 30-59 (105) 173 TIGR03586 PseI pseudaminic aci 25.5 51 0.0013 14.2 7.6 70 63-139 82-173 (327) 174 PRK12999 pyruvate carboxylase; 25.4 51 0.0013 14.2 5.6 80 88-171 695-777 (1147) 175 cd00568 TPP_enzymes Thiamine p 25.2 51 0.0013 14.2 6.6 23 114-136 145-167 (168) 176 TIGR01511 ATPase-IB1_Cu copper 25.1 52 0.0013 14.2 5.9 60 53-123 366-428 (545) 177 KOG1430 consensus 24.8 52 0.0013 14.1 2.2 29 128-157 242-270 (361) 178 PRK01269 thiamine biosynthesis 24.8 52 0.0013 14.1 4.8 11 146-156 192-202 (483) 179 TIGR01717 AMP-nucleosdse AMP n 24.6 53 0.0013 14.1 2.2 55 92-152 265-320 (486) 180 PRK08605 D-lactate dehydrogena 24.5 53 0.0014 14.1 6.3 56 49-108 47-102 (332) 181 cd02014 TPP_POX Thiamine pyrop 24.0 54 0.0014 14.0 6.3 37 72-108 69-105 (178) 182 PRK07454 short chain dehydroge 23.9 54 0.0014 14.0 7.0 78 60-144 19-98 (241) 183 PRK11194 hypothetical protein; 23.8 55 0.0014 14.0 7.3 57 109-166 307-363 (372) 184 cd02001 TPP_ComE_PpyrDC Thiami 23.4 56 0.0014 13.9 6.0 24 113-136 130-153 (157) 185 COG3088 CcmH Uncharacterized p 23.2 56 0.0014 13.9 2.6 29 144-172 66-94 (153) 186 cd01294 DHOase Dihydroorotase 23.1 56 0.0014 13.9 3.7 30 108-137 106-135 (335) 187 cd06818 PLPDE_III_cryptic_DSD 22.8 57 0.0015 13.9 6.7 26 113-138 142-169 (382) 188 COG1099 Predicted metal-depend 22.7 57 0.0015 13.9 9.3 101 59-161 13-160 (254) 189 pfam11385 DUF3189 Protein of u 22.6 58 0.0015 13.8 2.7 13 114-126 77-89 (148) 190 COG2897 SseA Rhodanese-related 22.4 58 0.0015 13.8 5.1 84 60-146 160-251 (285) 191 COG1229 FwdA Formylmethanofura 22.2 59 0.0015 13.8 3.0 44 99-143 200-245 (575) 192 PRK13132 consensus 22.0 59 0.0015 13.8 5.8 76 59-145 101-176 (246) 193 KOG1540 consensus 21.9 59 0.0015 13.8 1.9 16 48-64 129-144 (296) 194 pfam01487 DHquinase_I Type I 3 21.6 60 0.0015 13.7 10.4 23 61-83 43-65 (222) 195 cd07037 TPP_PYR_MenD Pyrimidin 21.6 60 0.0015 13.7 8.9 85 64-158 4-89 (162) 196 cd01545 PBP1_SalR Ligand-bindi 21.6 60 0.0015 13.7 8.3 87 63-149 109-199 (270) 197 pfam04582 Reo_sigmaC Reovirus 21.5 39 0.001 14.9 0.9 47 114-166 194-247 (326) 198 COG0160 GabT 4-aminobutyrate a 21.4 61 0.0016 13.7 3.1 48 109-159 237-285 (447) 199 cd06819 PLPDE_III_LS_D-TA Type 21.4 61 0.0016 13.7 6.0 14 131-144 128-141 (358) 200 cd02012 TPP_TK Thiamine pyroph 21.3 61 0.0016 13.7 4.8 28 114-141 202-230 (255) 201 PRK13479 2-aminoethylphosphona 21.2 62 0.0016 13.7 5.4 48 90-137 95-142 (368) 202 COG1054 Predicted sulfurtransf 21.2 62 0.0016 13.7 6.7 58 110-183 149-211 (308) 203 cd03371 TPP_PpyrDC Thiamine py 20.6 63 0.0016 13.6 5.7 28 112-139 137-166 (188) 204 pfam00106 adh_short short chai 20.5 64 0.0016 13.6 9.2 76 61-139 14-91 (167) 205 cd06821 PLPDE_III_D-TA Type II 20.5 64 0.0016 13.6 7.1 34 113-146 111-146 (361) 206 COG5495 Uncharacterized conser 20.4 64 0.0016 13.6 3.0 25 125-150 88-112 (289) 207 pfam09265 Cytokin-bind Cytokin 20.4 64 0.0016 13.6 2.4 24 122-145 162-190 (280) 208 cd06271 PBP1_AglR_RafR_like Li 20.3 64 0.0016 13.6 8.3 93 64-159 112-208 (268) 209 TIGR00418 thrS threonyl-tRNA s 20.1 50 0.0013 14.2 1.2 31 125-157 452-482 (595) 210 pfam03918 CcmH Cytochrome C bi 20.1 65 0.0017 13.5 2.8 57 113-172 24-87 (145) No 1 >pfam03162 Y_phosphatase2 Tyrosine phosphatase family. This family is closely related to the pfam00102 and pfam00782 families. Probab=99.96 E-value=1.1e-28 Score=200.93 Aligned_cols=134 Identities=20% Similarity=0.370 Sum_probs=112.1 Q ss_pred HCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCC----CCHHH Q ss_conf 07872502894771389998999999997499389973787876544457999996698499875268889----89899 Q gi|254780898|r 41 TQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRE----LNDEQ 116 (207) Q Consensus 41 ~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~----~~~~~ 116 (207) -.||+.|.+| +||||||++.++ ++++++||||||+|+.+.+ .+...+.+++.||+++++++.+... .+++. T Consensus 4 P~NF~~Ve~g-lYRSa~P~~~nf-~fL~~L~LKTII~L~~e~~---~~~~~~f~~~~~I~~~~i~~~~~~~~~~~i~~~~ 78 (150) T pfam03162 4 PLNFSPVESG-LYRSSYPRANNF-SFLRSLRLKTIISLSPEPY---PQDNLQFLESEHIKLYHIHMEGNKDPFVNIPSHL 78 (150) T ss_pred CCCCCCCCCC-CCCCCCCCHHHH-HHHHHCCCCEEEEECCCCC---CHHHHHHHHHCCCEEEEEEECCCCCCCCCCCHHH T ss_conf 7135666799-420679883324-8999779978999489899---8678989987794699986358888777779999 Q ss_pred HHHHHHHH-HCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH Q ss_conf 99999999-427994999828998889999999999983999899999610002642674245999999 Q gi|254780898|r 117 IKQLISIL-KTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITF 184 (207) Q Consensus 117 i~~~~~il-~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~ 184 (207) +.+.++++ +..++||||||.+|+||||+++||||. +|||+...+..|+ ..|.+++...+|..| T Consensus 79 v~~aL~iild~~n~PvLIHC~~G~~RTG~vvgclRk-lQ~Ws~~~i~~Ey----~~fa~~k~r~~~~~f 142 (150) T pfam03162 79 LRRALKLLLNKDNYPVLIHCNRGKHRTGLVIGCLRK-LQKWSLASILNEY----RRFSGSKARIVDEEF 142 (150) T ss_pred HHHHHHHHHCCCCCCEEEEECCCCCEEHHHHHHHHH-HCCCCHHHHHHHH----HHHCCCCCCHHHHHH T ss_conf 999999985857898899979799133018999999-8377579999999----997389873889999 No 2 >KOG1720 consensus Probab=99.87 E-value=1e-21 Score=157.01 Aligned_cols=168 Identities=20% Similarity=0.221 Sum_probs=139.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCC--------------CHHHHHHHHHHCCCCEEEEC Q ss_conf 9999999999853210245578788876078725028947713899--------------98999999997499389973 Q gi|254780898|r 13 IFYIKILLGVLVLCAVSLGLYFLTITTFTQNFHAVVPHEIYRSAQP--------------NGTFIEYLKKEYGIKSILNL 78 (207) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Nf~~V~~g~lyRS~qP--------------~~~~~~~l~k~~GIktIInL 78 (207) .-++.+-.|.+.+--+.+..|..+.++.+.+|.||.||++.+-+-| .+..+..+++.+.+++||.| T Consensus 22 ~~~r~~~~g~l~~~~~~~~~ye~ye~ve~gdfnwI~p~~~i~f~~p~~~s~gi~~~f~~~~~~~~~~~~~~~~v~s~vrl 101 (225) T KOG1720 22 GVWKALQSGWLDFSSFNVDEYEHYEAVENGDFNWIIPDRFIAFAGPHLKSRGIESGFPLHLPQPYIQYFKNNNVTSIVRL 101 (225) T ss_pred HHHHHHHCCCCCHHEECCHHHEEEECCCCCCCCEECCCHHHHHCCCCCCCCCHHHCCCCCCCHHHHHHHHHCCCCEEEEC T ss_conf 77888760641012240030301213478975224263466532766555451222535687038888641453157874 Q ss_pred CCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCC--CCEEEECCCCCCHHHHHHHHHHHHHCCC Q ss_conf 78787654445799999669849987526888989899999999994279--9499982899888999999999998399 Q gi|254780898|r 79 RGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAP--KPLLIHCKSGADRTGLASAVYLYIVAHY 156 (207) Q Consensus 79 r~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~--~PVlVHC~aG~~RTG~~~alyli~~~g~ 156 (207) .+.. .+.+...+.||.|+++++.|..+|++..+..|+++.++++ +.|.|||++|.||||+++|||+|...|+ T Consensus 102 n~~~------yd~~~f~~~Gi~h~~l~f~Dg~tP~~~~v~~fv~i~e~~~~~g~iaVHCkaGlGRTG~liAc~lmy~~g~ 175 (225) T KOG1720 102 NKRL------YDAKRFTDAGIDHHDLFFADGSTPTDAIVKEFVKIVENAEKGGKIAVHCKAGLGRTGTLIACYLMYEYGM 175 (225) T ss_pred CCCC------CCHHHHCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHCC T ss_conf 7778------8867713367346100047899998899999999999887368189982368884108999999999678 Q ss_pred CHHHHHHHHHH-CCCCCCCCCHHHHHHHHHH Q ss_conf 98999996100-0264267424599999999 Q gi|254780898|r 157 PKEEAHRQLSM-LYGHFPVLKTITMDITFEK 186 (207) Q Consensus 157 ~~~eA~~el~~-~ygh~~~~~~~~ld~~~~~ 186 (207) +++||+.+|+. |-|-+-.+++..+-+-|.. T Consensus 176 ta~eaI~~lR~~RpG~V~gpqQ~~l~~~q~~ 206 (225) T KOG1720 176 TAGEAIAWLRICRPGAVIGPQQHKLLHKQRD 206 (225) T ss_pred CHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999618866447789999999999 No 3 >pfam04273 DUF442 Putative phosphatase (DUF442). Although this domain is uncharacterized it seems likely that it performs a phosphatase function. Probab=99.85 E-value=3.6e-20 Score=147.30 Aligned_cols=104 Identities=22% Similarity=0.323 Sum_probs=92.5 Q ss_pred CCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCC---CCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHH Q ss_conf 872502894771389998999999997499389973787876---54445799999669849987526888989899999 Q gi|254780898|r 43 NFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPE---SWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQ 119 (207) Q Consensus 43 Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~---~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~ 119 (207) ||++|.+ .+|.|+||+++++..+ ++.|+|||||+|+++|. +....+.+.++++||.++++|+.. ..++++.+.. T Consensus 1 d~~~i~~-~~~vs~Qi~~~di~~l-a~~GfktIInnRPd~E~~~qp~~~~~~~~a~~~Gl~y~~iPv~~-~~~t~~~v~~ 77 (110) T pfam04273 1 DYREINE-DLSVSPQIQPDDIAAA-ARAGFRSVINNRPDGEEPGQPSNAAEQAAARAAGLAYRFIPVIS-GQITEADVEA 77 (110) T ss_pred CCEECCC-CEEECCCCCHHHHHHH-HHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCC-CCCCHHHHHH T ss_conf 9536478-8757599899999999-98598388533888777899888999999998399799964477-8989999999 Q ss_pred HHHHHHCCCCCEEEECCCCCCHHHHHHHHHH Q ss_conf 9999942799499982899888999999999 Q gi|254780898|r 120 LISILKTAPKPLLIHCKSGADRTGLASAVYL 150 (207) Q Consensus 120 ~~~il~~~~~PVlVHC~aG~~RTG~~~alyl 150 (207) |.+++++.++|||+||++|. |+..++++.. T Consensus 78 f~~~l~~~~~Pvl~~CrSG~-Rs~~lwala~ 107 (110) T pfam04273 78 FQRALAAAEGPVLAHCRSGT-RALNLYALSQ 107 (110) T ss_pred HHHHHHHCCCCEEEECCCCH-HHHHHHHHHH T ss_conf 99999858998999889987-7999999985 No 4 >KOG1572 consensus Probab=99.84 E-value=3e-20 Score=147.84 Aligned_cols=134 Identities=26% Similarity=0.401 Sum_probs=111.6 Q ss_pred HCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCC----CC---- Q ss_conf 0787250289477138999899999999749938997378787654445799999669849987526888----98---- Q gi|254780898|r 41 TQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATR----EL---- 112 (207) Q Consensus 41 ~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~----~~---- 112 (207) --||+.|.++ |||||+|.+..+ .+++.+++|+||.|..+. +.+....+++..+|+++++.+.+.. .| T Consensus 57 PlnFs~V~~~-lyRSg~P~~~Nf-sFL~~L~LksIisL~pE~---yp~~nl~f~~~~~Ik~~~i~ie~~k~~~k~P~~~~ 131 (249) T KOG1572 57 PLNFSMVDNG-LYRSGFPRPENF-SFLKTLHLKSIISLCPEP---YPEENLNFLESNGIKLYQIGIEGEKDNKKEPFVNI 131 (249) T ss_pred CCCCCCCCCC-EEECCCCCCCCH-HHHHHHHHHEEEEECCCC---CCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCC T ss_conf 7654523432-000579872034-799984441377846888---87388889986694599986365434555899887 Q ss_pred CHHHHHHHHH-HHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHH Q ss_conf 9899999999-99427994999828998889999999999983999899999610002642674245999999 Q gi|254780898|r 113 NDEQIKQLIS-ILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITF 184 (207) Q Consensus 113 ~~~~i~~~~~-il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~ 184 (207) ..+.+...++ +++..++|+|+||.-|++|||++++|.| .+++|+..-..+| |.-|..++...+|.-| T Consensus 132 ~~~~i~~~l~~lld~~N~P~Lihc~rGkhRtg~lVgclR-klq~W~lssil~E----y~~fa~sk~r~~d~~F 199 (249) T KOG1572 132 PDHSIRKALKVLLDKRNYPILIHCKRGKHRTGCLVGCLR-KLQNWSLSSILDE----YLRFAGSKGRRVDLRF 199 (249) T ss_pred HHHHHHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHH-HHHCCCHHHHHHH----HHHHCCCHHHHHHHHH T ss_conf 479999999988312678657764888710131699999-9855615677889----9875052257789999 No 5 >COG2365 Protein tyrosine/serine phosphatase [Signal transduction mechanisms] Probab=99.78 E-value=5.6e-19 Score=139.80 Aligned_cols=132 Identities=30% Similarity=0.398 Sum_probs=104.4 Q ss_pred HCCCCCCCCCCEEECCCCCHHHHH-HHHHHCCCCEEEECCCCC--CCCCCHHHHHH-HHHCCCEEEEEECCCCCCCCHHH Q ss_conf 078725028947713899989999-999974993899737878--76544457999-99669849987526888989899 Q gi|254780898|r 41 TQNFHAVVPHEIYRSAQPNGTFIE-YLKKEYGIKSILNLRGKL--PESWHKEEEKA-ANDLGIQLINFPLSATRELNDEQ 116 (207) Q Consensus 41 ~~Nf~~V~~g~lyRS~qP~~~~~~-~l~k~~GIktIInLr~~~--~~~~~~~e~~~-~~~~gi~~i~ip~~~~~~~~~~~ 116 (207) ..||+.|.+...|||+||.+.+.. .+.+.++++|+|+|+.+. ...++..+... +...++....++.........+. T Consensus 44 ~~~~~~i~~~~~~Rs~~p~~~~~~~~~~~~~~l~~~i~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 123 (249) T COG2365 44 PLNFLGIIPIIDYRSGQPVPVQPDPELLDALYLKTIINLRDESNTNVELYTDHLINWDKAAIIMFESYRSFPTREDAAER 123 (249) T ss_pred CCCCCCCCCEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHCCCCCCCHHHHH T ss_conf 23334304302304699544676811235743122003466655001224555654233100214540456653137999 Q ss_pred HHHHHHHHHCCC-CCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH-CCCCC Q ss_conf 999999994279-949998289988899999999999839998999996100-02642 Q gi|254780898|r 117 IKQLISILKTAP-KPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSM-LYGHF 172 (207) Q Consensus 117 i~~~~~il~~~~-~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~-~ygh~ 172 (207) +..++.++.+.+ +|||+||.+|+||||+++||||...++++...+..++.. +|++. T Consensus 124 ~~~~~~l~~~~e~~PvL~HC~~GkdRTGl~~al~r~~~~~~~~~v~~dyl~~~~~~~~ 181 (249) T COG2365 124 LVELLQLLADAENGPVLIHCTAGKDRTGLVAALYRKLVGGSDETVAADYLLTNRYGEP 181 (249) T ss_pred HHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCH T ss_conf 9999999862246887773378861376899999999647564799998652667541 No 6 >COG3453 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.71 E-value=4.5e-16 Score=121.52 Aligned_cols=116 Identities=19% Similarity=0.251 Sum_probs=100.3 Q ss_pred CCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCC---CCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHH Q ss_conf 72502894771389998999999997499389973787876---544457999996698499875268889898999999 Q gi|254780898|r 44 FHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPE---SWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQL 120 (207) Q Consensus 44 f~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~---~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~ 120 (207) +..| .+.++.|+||+++++..+ +.+|+|||||.|++.|+ |.+..+.+.++++|+.|.++|+.. ..++++.|+.| T Consensus 3 i~~I-~d~lsVsgQi~~~D~~~i-aa~GFksiI~nRPDgEe~~QP~~~~i~~aa~~aGl~y~~iPV~~-~~iT~~dV~~f 79 (130) T COG3453 3 IRRI-NDRLSVSGQISPADIASI-AALGFKSIICNRPDGEEPGQPGFAAIAAAAEAAGLTYTHIPVTG-GGITEADVEAF 79 (130) T ss_pred CEEC-CCCEEECCCCCHHHHHHH-HHHCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC-CCCCHHHHHHH T ss_conf 1661-543423698988899999-97042210016998778899974999999996698258763479-98799999999 Q ss_pred HHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 99994279949998289988899999999999839998999996 Q gi|254780898|r 121 ISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQ 164 (207) Q Consensus 121 ~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~e 164 (207) .+.+++.++|||.||++|. |+-.+.++-. ...|++.+|+... T Consensus 80 ~~Al~eaegPVlayCrsGt-Rs~~ly~~~~-~~~gm~~de~~a~ 121 (130) T COG3453 80 QRALDEAEGPVLAYCRSGT-RSLNLYGLGE-LDGGMSRDEIEAL 121 (130) T ss_pred HHHHHHCCCCEEEEECCCC-HHHHHHHHHH-HHCCCCHHHHHHH T ss_conf 9999970898786545770-6889999999-8669988999998 No 7 >cd00127 DSPc Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like. Probab=99.69 E-value=1.2e-15 Score=118.85 Aligned_cols=130 Identities=17% Similarity=0.250 Sum_probs=97.9 Q ss_pred CCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCC-HHHHHHHHH Q ss_conf 7250289477138999899999999749938997378787654445799999669849987526888989-899999999 Q gi|254780898|r 44 FHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELN-DEQIKQLIS 122 (207) Q Consensus 44 f~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~-~~~i~~~~~ 122 (207) ..+|.|| ||-|+.+.+.+.. +++++||+.|||+..+.+... ....+++|..+|+.|....+ ...+....+ T Consensus 2 ~s~I~~~-LylG~~~~a~~~~-~L~~~gI~~Iin~~~~~~~~~-------~~~~~~~y~~i~~~D~~~~~i~~~~~~~~~ 72 (139) T cd00127 2 LSEITPG-LYLGSYPAASDKE-LLKKLGITHVLNVAKEVPNEN-------LFLSDFNYLYVPILDLPSQDISKYFDEAVD 72 (139) T ss_pred CCEEECC-EEECCHHHHCCHH-HHHHCCCEEEEEECCCCCCCC-------CCCCCCEEEEEECCCCCCCCHHHHHHHHHH T ss_conf 8867799-7998932758999-998789809998046777543-------368898799998976999857999999999 Q ss_pred HHH---CCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 994---2799499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 123 ILK---TAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 123 il~---~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) .++ +.+++|||||.+|.+|+++++++|+|...||+.++|.+.+.-.+..+. +..+++... T Consensus 73 fI~~~~~~~~~VLVHC~~G~sRS~~iv~aYLm~~~~~~~~~A~~~v~~~Rp~i~-pn~~f~~qL 135 (139) T cd00127 73 FIDDAREKGGKVLVHCLAGVSRSATLVIAYLMKTLGLSLREAYEFVKSRRPIIS-PNAGFMRQL 135 (139) T ss_pred HHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCC-CCHHHHHHH T ss_conf 999999728968998998774209999999999819999999999999799758-998999999 No 8 >smart00195 DSPc Dual specificity phosphatase, catalytic domain. Probab=99.63 E-value=1.1e-14 Score=112.87 Aligned_cols=126 Identities=17% Similarity=0.306 Sum_probs=95.5 Q ss_pred CCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCH-HHHHHHHHHH Q ss_conf 502894771389998999999997499389973787876544457999996698499875268889898-9999999999 Q gi|254780898|r 46 AVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELND-EQIKQLISIL 124 (207) Q Consensus 46 ~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~-~~i~~~~~il 124 (207) .|.|| ||-|+++.+.+.. +++++||+.|||+..+.+. + ...++.+.++|+.|....+. +.+....+.+ T Consensus 3 eI~~~-lylG~~~~a~d~~-~L~~~~I~~Iln~~~~~~~--~-------~~~~~~~~~i~i~D~~~~~i~~~~~~~~~fI 71 (138) T smart00195 3 EILPH-LYLGSYSSALNLA-LLKKLGITHVINVTNEVPN--L-------NKKGFTYLGVPILDNTETKISPYFPEAVEFI 71 (138) T ss_pred EEECC-EEECCHHHHCCHH-HHHHCCCEEEEEECCCCCC--C-------CCCCCEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 66898-6999976857999-9998798599993257677--8-------9898679999784189866899999999999 Q ss_pred H---CCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 4---2799499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 125 K---TAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 125 ~---~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) + ..++.|||||.+|.+|+++++++|+|...||+.++|...+.-.+..+ .++.+++... T Consensus 72 ~~~~~~~~~VLVHC~~G~sRS~~vv~aYLm~~~~~~~~~A~~~v~~~Rp~i-~pn~~f~~QL 132 (138) T smart00195 72 EDAEKKGGKVLVHCQAGVSRSATLIIAYLMKYRNLSLNDAYDFVKDRRPII-SPNFGFLRQL 132 (138) T ss_pred HHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC-CCCHHHHHHH T ss_conf 999976997999999877554999999999983999999999999979977-9899999999 No 9 >TIGR01244 TIGR01244 conserved hypothetical protein TIGR01244; InterPro: IPR005939 This is a family of largely hypothetical proteins of unknown function. . Probab=99.62 E-value=1e-14 Score=112.94 Aligned_cols=117 Identities=24% Similarity=0.362 Sum_probs=101.5 Q ss_pred CCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCC---CCCHHHHHHHHHCCCE-EEEEECCCCCCCCHHHHH Q ss_conf 872502894771389998999999997499389973787876---5444579999966984-998752688898989999 Q gi|254780898|r 43 NFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPE---SWHKEEEKAANDLGIQ-LINFPLSATRELNDEQIK 118 (207) Q Consensus 43 Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~---~~~~~e~~~~~~~gi~-~i~ip~~~~~~~~~~~i~ 118 (207) .|..|.|. ||-|.|.++.|+++. .++|||||||.|++.|+ +....-...++.+|+. |+|+|+.. +.++++.|+ T Consensus 1 d~r~lte~-l~vsPQ~~~aD~a~a-A~lGf~tvInNRPD~Ee~~qp~~A~~~aaa~aAG~~Gy~H~Pv~~-g~~~pd~ve 77 (136) T TIGR01244 1 DLRKLTED-LAVSPQVTKADIARA-ARLGFKTVINNRPDREEEEQPDSAAIKAAAEAAGLTGYLHLPVTA-GDLTPDDVE 77 (136) T ss_pred CCCCCCCC-CEECCCCCHHHHHHH-HHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCCEECCCCC-CCCCHHHHH T ss_conf 96422743-200478898889999-854636651068888888888728899999856888610054217-888762689 Q ss_pred HHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 9999994279949998289988899999999999839998999996 Q gi|254780898|r 119 QLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQ 164 (207) Q Consensus 119 ~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~e 164 (207) .|...+..++.|||-.|++|- |+-..+++-+.. +|.+.||.+++ T Consensus 78 ~fraa~~aA~~PVLA~CRsGT-Rss~lW~~~qA~-~G~p~eeiv~~ 121 (136) T TIGR01244 78 TFRAALEAAESPVLAYCRSGT-RSSLLWALRQAA-EGVPVEEIVRR 121 (136) T ss_pred HHHHHHHHCCCCEEEECCCCH-HHHHHHHHHHHC-CCCCHHHHHHH T ss_conf 999999822898675314665-688999888631-79988999998 No 10 >PTZ00242 protein tyrosine phosphatase; Provisional Probab=99.62 E-value=1.1e-14 Score=112.81 Aligned_cols=114 Identities=13% Similarity=0.250 Sum_probs=93.8 Q ss_pred ECCCCCHHHHHH---HHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC--- Q ss_conf 138999899999---99974993899737878765444579999966984998752688898989999999999427--- Q gi|254780898|r 54 RSAQPNGTFIEY---LKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTA--- 127 (207) Q Consensus 54 RS~qP~~~~~~~---l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~--- 127 (207) -...|+...+.. .++++|+++||.+..+ .++.+..++.||.+.+.+++|+.+|+.+.+..++++++.. T Consensus 24 I~d~Pt~~~l~~yi~elkk~~Vt~vVRvCe~------tYd~~~~e~~GI~v~D~~FdDGs~Pp~~IV~~wl~Ll~~~f~e 97 (172) T PTZ00242 24 ILDAPSPSNLPLYIKELQRRNVRHLVRVCGP------TYDAELLEKNGIEVHEWPFDDGAPPPRDVVDNWLKLLDQEKAK 97 (172) T ss_pred EECCCCCCHHHHHHHHHHHCCCEEEEEECCC------CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 8559980329999999997699099981776------6585688767956987677899989699999999999998511 Q ss_pred --CCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH-CCCCCC Q ss_conf --9949998289988899999999999839998999996100-026426 Q gi|254780898|r 128 --PKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSM-LYGHFP 173 (207) Q Consensus 128 --~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~-~ygh~~ 173 (207) +..|.|||.+|.||||+++|+|+|...|++.++|+...+. |.|-+- T Consensus 98 ~p~~~~~~hc~ag~grap~lva~~Lie~~~m~~~dAi~~iR~~R~Gain 146 (172) T PTZ00242 98 LPPETIAVHCVAGLGRAPILVALALVEYGGMSPLDAIGFVRERRKGAIN 146 (172) T ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCC T ss_conf 8998699981247784168999999996499989999999973886502 No 11 >pfam00782 DSPc Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region. Probab=99.61 E-value=2.3e-14 Score=110.80 Aligned_cols=121 Identities=17% Similarity=0.286 Sum_probs=87.5 Q ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCC-HHHHHHHHHHHH---CC Q ss_conf 77138999899999999749938997378787654445799999669849987526888989-899999999994---27 Q gi|254780898|r 52 IYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELN-DEQIKQLISILK---TA 127 (207) Q Consensus 52 lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~-~~~i~~~~~il~---~~ 127 (207) ||-++.+...+ . .++++||+.|||+..+.+..+. ..++.|.++|+.|....+ ...+....+.++ +. T Consensus 1 lylG~~~~a~~-~-~L~~~~It~Iln~~~e~~~~~~--------~~~~~yl~i~i~D~~~~~i~~~~~~~~~fI~~~~~~ 70 (131) T pfam00782 1 LYLGSYPTASK-A-FLEKLGITHVINVTREVPNSNL--------NSGILYLGIPVEDNHETNISKYLPEAVEFIDDAIQK 70 (131) T ss_pred CEECCHHHCCH-H-HHHHCCCCEEEEEECCCCCCCC--------CCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 98499789299-9-9998899499997279887668--------899879998752099987799999999999999864 Q ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 99499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 128 PKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 128 ~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) +++|||||.+|.+|+++++++|+|..++|+.++|.+.+.-.... ..++.+++... T Consensus 71 g~~VLVHC~~G~sRS~~i~~aYLm~~~~~~~~~A~~~v~~~R~~-~~pn~~F~~qL 125 (131) T pfam00782 71 GGKVLVHCQAGISRSATLIIAYLMKTRNLSLNEAYSFVKERRPI-ISPNFGFKRQL 125 (131) T ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCCCHHHHHHH T ss_conf 99699999998740199999999998299999999999998998-89899999999 No 12 >KOG1719 consensus Probab=99.56 E-value=9.8e-14 Score=106.82 Aligned_cols=121 Identities=17% Similarity=0.253 Sum_probs=93.6 Q ss_pred CCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCC-CCCCHHHHHHHH Q ss_conf 8725028947713899989999999974993899737878765444579999966984998752688-898989999999 Q gi|254780898|r 43 NFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSAT-RELNDEQIKQLI 121 (207) Q Consensus 43 Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~-~~~~~~~i~~~~ 121 (207) |...+. ..+.-|..|-...-.++++++|++.||++..+.|... .....+..||.+..+|+.|. +.|+.+.+.+.. T Consensus 24 ~wy~~~-~~v~~~~~~FrS~~~~~i~ke~v~gvv~~ne~yE~~a---~s~~wk~~giE~L~i~T~D~~~~Ps~~~i~~aV 99 (183) T KOG1719 24 RWYRID-EFVILGAMPFRSMDVPLIKKENVGGVVTLNEPYELLA---PSNLWKNYGIEFLVIPTRDYTGAPSLENIQKAV 99 (183) T ss_pred CEEEEC-CEEEEEECCCCCCCCHHHHHCCCCEEEEECCCHHHHH---HHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 216303-4689961365444326888618872899578334332---367888606405874221015798899999999 Q ss_pred HHHHC---CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99942---79949998289988899999999999839998999996100 Q gi|254780898|r 122 SILKT---APKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSM 167 (207) Q Consensus 122 ~il~~---~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~ 167 (207) +.+.. ..+-|+|||+||.+|+.++++||||.+.+|++++|..-+.- T Consensus 100 eFi~k~asLGktvYVHCKAGRtRSaTvV~cYLmq~~~wtpe~A~~~vr~ 148 (183) T KOG1719 100 EFIHKNASLGKTVYVHCKAGRTRSATVVACYLMQHKNWTPEAAVEHVRK 148 (183) T ss_pred HHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9999613348769999647886412423435556448898999999983 No 13 >PRK12361 hypothetical protein; Provisional Probab=99.47 E-value=1.1e-12 Score=100.16 Aligned_cols=140 Identities=23% Similarity=0.323 Sum_probs=107.3 Q ss_pred HHHHHHHHHH--HCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCC-CCCHHHHHHHHHCCCEEEEEECC Q ss_conf 5578788876--07872502894771389998999999997499389973787876-54445799999669849987526 Q gi|254780898|r 31 GLYFLTITTF--TQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPE-SWHKEEEKAANDLGIQLINFPLS 107 (207) Q Consensus 31 ~~~~~~~~~~--~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~-~~~~~e~~~~~~~gi~~i~ip~~ 107 (207) ++|....|.- -+.+..|.+| ||-+.++.|.+++.+ +.+||..|+++..|-+. +|. +.+..+.|..+|+- T Consensus 79 ~l~~~~~r~~~~~p~~~~i~~~-l~lg~r~~~~~~~~l-~~~~~~avlD~T~Ef~~l~~~------~~~~~~~Yl~vP~l 150 (546) T PRK12361 79 QLYNAWARKRDSVPSIQKIDEN-LYLGCRLFPADLEKL-KSNKITAILDVTAEFDGLDWS------LYEEDIDYLNIPVL 150 (546) T ss_pred HHHHHHHHHCCCCCCHHHHCCC-EEEECCCCHHHHHHC-CCCCCCEEEEECCCCCCCCCC------CCCCCCCCCCCCHH T ss_conf 9999999740479834550566-477134685556540-204898699951436787530------12445430045421 Q ss_pred CCCCCCHHHHHHHHHHHHC---CCCCEEEECCCCCCHHHHHHHHHHHHHCC-CCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 8889898999999999942---79949998289988899999999999839-9989999961000264267424599999 Q gi|254780898|r 108 ATRELNDEQIKQLISILKT---APKPLLIHCKSGADRTGLASAVYLYIVAH-YPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 108 ~~~~~~~~~i~~~~~il~~---~~~PVlVHC~aG~~RTG~~~alyli~~~g-~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) |...|+.+++.+.+.-+++ .+++|+|||.-|.+|+.+|+|.||..... .+.++++++.+- ..+|..+++. T Consensus 151 D~~~P~~~ql~~Av~w~~~~~~~~~~V~vhCAlG~gRSv~v~aAyll~~~~~~~~~~~~~~i~~------~r~ta~ln~~ 224 (546) T PRK12361 151 DHSVPTSAQLNQAINWLHRQVRAGKSVVVHCALGRGRSVLVLAAYLLCKDKQLSVEEVLQQIKQ------IRKTARLNKW 224 (546) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHH------HHHHCCCCHH T ss_conf 1679899999999999999996699689987888961599999999733888999999999999------9875247899 Q ss_pred H Q ss_conf 9 Q gi|254780898|r 184 F 184 (207) Q Consensus 184 ~ 184 (207) + T Consensus 225 q 225 (546) T PRK12361 225 Q 225 (546) T ss_pred H T ss_conf 9 No 14 >KOG1716 consensus Probab=99.39 E-value=1.7e-11 Score=92.78 Aligned_cols=147 Identities=17% Similarity=0.250 Sum_probs=109.9 Q ss_pred CCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCC----HHHH Q ss_conf 787250289477138999899999999749938997378787654445799999669849987526888989----8999 Q gi|254780898|r 42 QNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELN----DEQI 117 (207) Q Consensus 42 ~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~----~~~i 117 (207) .+...|.|+ +|-++++.+.+... ++++||..|+|+....+... .....++.|..+++.|....+ .+.. T Consensus 73 ~~~~~i~p~-l~lg~~~~~~~~~~-l~~~~it~vln~~~~~~~~~------~~~~~~~~y~~i~~~D~~~~~i~~~~~~~ 144 (285) T KOG1716 73 NPIVEILPN-LYLGSQGVASDPDL-LKKLGITHVLNVSSSCPNPR------FLKEQGIKYLRIPVEDNPSTDILQHFPEA 144 (285) T ss_pred CCCEEECCC-CEECCCCCCCCHHH-HHHCCCCEEEEEEECCCCCC------CCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 774245686-30167663305678-87549986998552478643------23445541133301058753779999999 Q ss_pred HHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999994279949998289988899999999999839998999996100026426742459999999999852003672 Q gi|254780898|r 118 KQLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKITQLYPNNVSK 197 (207) Q Consensus 118 ~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~~~~~~~~~~~ 197 (207) ..|++......+.|||||.+|..|+.+++.+|+|...||+.++|.+.+.-+..-+. ++.+++-...+.-..+..+.... T Consensus 145 ~~fI~~a~~~~~~vlVHC~~GvSRSat~viAYlM~~~~~~l~~A~~~vk~~R~~i~-PN~gf~~QL~~~e~~l~~~~~~~ 223 (285) T KOG1716 145 ISFIEKAREKGGKVLVHCQAGVSRSATLVIAYLMKYEGLSLEDAYELVKSRRPIIS-PNFGFLRQLLEFEKRLSKKSPSQ 223 (285) T ss_pred HHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC-CCHHHHHHHHHHHHHHCCCCCCC T ss_conf 99999998679859998388850768999999999809989999999983499768-89899999999998744556543 No 15 >pfam05706 CDKN3 Cyclin-dependent kinase inhibitor 3 (CDKN3). This family consists of cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner. Probab=99.32 E-value=1e-10 Score=87.87 Aligned_cols=129 Identities=19% Similarity=0.131 Sum_probs=96.2 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCC--HHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH---CCCCCEEEE Q ss_conf 8999999997499389973787876544--45799999669849987526888989899999999994---279949998 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLPESWH--KEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK---TAPKPLLIH 134 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~~~~~--~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~---~~~~PVlVH 134 (207) ..++..+ +..||..|+.|....|...| ..-.+..++.|+..+|+|++|...|+.++..++++-+. +.++.++|| T Consensus 61 q~Dleel-k~~giqdVfvlctRgEL~~yrVP~LLd~yq~~G~~vhH~PipDg~~P~~~~c~~ileEl~~~L~~grk~lih 139 (207) T pfam05706 61 QKDTEEL-KSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGIITHHHPIADGGTPDIASCCEIMEELATCLKNNRKTLIH 139 (207) T ss_pred HHHHHHH-HHCCCCEEEEEEECCHHHHHCCCCHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 9989999-977996699996234366627530999999769168854778999987899999999999999839848999 Q ss_pred CCCCCCHHHHHHHHHHHHHCC--CCHHHHHHHHHHCCCCCCC---CCHHHHHHHHHHHHHH Q ss_conf 289988899999999999839--9989999961000264267---4245999999999985 Q gi|254780898|r 135 CKSGADRTGLASAVYLYIVAH--YPKEEAHRQLSMLYGHFPV---LKTITMDITFEKITQL 190 (207) Q Consensus 135 C~aG~~RTG~~~alyli~~~g--~~~~eA~~el~~~ygh~~~---~~~~~ld~~~~~~~~~ 190 (207) |..|.||+|+++||.+.. .+ +++++|+....-..|.-.. .+.-++..|.|+++.. T Consensus 140 c~gglgrsglvAAclLl~-l~~~~sp~~AI~~vRe~Rg~~AIet~~Q~nFl~~Fr~~~~~~ 199 (207) T pfam05706 140 SVGGLGRSCLVAACLLLH-LSDPISAEQAIDMHEDLRGGGAIQKEALMNFLHEFFDGLAAH 199 (207) T ss_pred EECCCCCHHHHHHHHHHH-CCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 844654034788888775-079889999999998526877501799999999998878999 No 16 >KOG1718 consensus Probab=99.21 E-value=5.2e-10 Score=83.38 Aligned_cols=142 Identities=18% Similarity=0.198 Sum_probs=99.1 Q ss_pred HCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCC----CHHH Q ss_conf 078725028947713899989999999974993899737878765444579999966984998752688898----9899 Q gi|254780898|r 41 TQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATREL----NDEQ 116 (207) Q Consensus 41 ~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~----~~~~ 116 (207) .+-+++|.++ ||-|+--.+.+- .++++.||..|||...+.+...+ -++++..+|+.|...- ..+. T Consensus 14 ~~~~SqIt~s-Lfl~~GvaA~~k-~~l~~~~It~IiNat~E~pn~~l---------~~~qy~kv~~~D~p~~~l~~hfD~ 82 (198) T KOG1718 14 IGGMSQITPS-LFLSNGVAANDK-LLLKKRKITCIINATTEVPNTSL---------PDIQYMKVPLEDTPQARLYDHFDP 82 (198) T ss_pred CCCHHHCCCC-EEEECCCCCCCH-HHHHHCCCEEEEECCCCCCCCCC---------CCCEEEEEECCCCCCCHHHHHHHH T ss_conf 4326441742-167545134688-88975584299975667788647---------774068877266886626665027 Q ss_pred HHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 999999994279949998289988899999999999839998999996100026426742459999999999852003 Q gi|254780898|r 117 IKQLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKITQLYPNN 194 (207) Q Consensus 117 i~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~~~~~~~~ 194 (207) +...++-+....|.+||||.||..|+...+.+|||.+++++.-||..+..-+..-+ .++.|+-...-++=.+++-++ T Consensus 83 vAD~I~~v~~~gG~TLvHC~AGVSRSAsLClAYLmK~~~msLreAy~~vKa~RpiI-RPN~GFw~QLi~YE~qL~g~~ 159 (198) T KOG1718 83 VADKIHSVIMRGGKTLVHCVAGVSRSASLCLAYLMKYHCMSLREAYHWVKARRPII-RPNVGFWRQLIDYEQQLFGNA 159 (198) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCEE-CCCCCHHHHHHHHHHHHCCCC T ss_conf 88898898860993799870034313999999999972144999998987417616-788328999999999842787 No 17 >COG2453 CDC14 Predicted protein-tyrosine phosphatase [Signal transduction mechanisms] Probab=99.17 E-value=3.4e-10 Score=84.53 Aligned_cols=77 Identities=21% Similarity=0.395 Sum_probs=65.8 Q ss_pred HHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC---CCCCEEEECCCCCCHHHHHHHHHHHHH-CCCCHHHHHHHHH Q ss_conf 999996698499875268889898999999999942---799499982899888999999999998-3999899999610 Q gi|254780898|r 91 EKAANDLGIQLINFPLSATRELNDEQIKQLISILKT---APKPLLIHCKSGADRTGLASAVYLYIV-AHYPKEEAHRQLS 166 (207) Q Consensus 91 ~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~---~~~PVlVHC~aG~~RTG~~~alyli~~-~g~~~~eA~~el~ 166 (207) ....+..|+.+.++|+.|...|+.+.+.++.+.+++ ..++|+|||..|.+|||+++|.|+|.. .++..++|+..+. T Consensus 65 ~~~~~~~~~~~~~~~~~D~~~p~~~~l~~~v~~i~~~~~~g~kVvVHC~~GigRSgtviaA~lm~~~~~~~~~~~i~~~~ 144 (180) T COG2453 65 VAIEENDGIQVLHLPILDGTVPDLEDLDKIVDFIEEALSKGKKVVVHCQGGIGRSGTVIAAYLMLYGGLSLADEAIAVKR 144 (180) T ss_pred HHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 10002368755741036777889999999999999998449869998789986689999999999649998999999998 Q ss_pred H Q ss_conf 0 Q gi|254780898|r 167 M 167 (207) Q Consensus 167 ~ 167 (207) . T Consensus 145 ~ 145 (180) T COG2453 145 R 145 (180) T ss_pred H T ss_conf 5 No 18 >KOG1717 consensus Probab=99.13 E-value=1.8e-09 Score=80.03 Aligned_cols=130 Identities=20% Similarity=0.338 Sum_probs=90.5 Q ss_pred CCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCH-HHHHHHHHHH Q ss_conf 502894771389998999999997499389973787876544457999996698499875268889898-9999999999 Q gi|254780898|r 46 AVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELND-EQIKQLISIL 124 (207) Q Consensus 46 ~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~-~~i~~~~~il 124 (207) +|.| .||-++--+..+.+ .++++||++|||+.+.-+-. +. +.-.+.|..||++|.-.... .-..+.+.++ T Consensus 174 ~ilp-~LYLg~a~ds~Nld-vLkk~gI~yviNVTpnlpn~-fe------~~g~f~YkqipisDh~Sqnls~ffpEAIsfI 244 (343) T KOG1717 174 EILP-NLYLGCAKDSTNLD-VLKKYGIKYVINVTPNLPNN-FE------NNGEFIYKQIPISDHASQNLSQFFPEAISFI 244 (343) T ss_pred HHCC-CCEECCCCCCCCHH-HHHHCCCEEEEECCCCCCCH-HH------CCCCEEEEEEECCCHHHHHHHHHHHHHHHHH T ss_conf 3143-01023643242188-89865954899779887620-10------4874379984246457665744428889998 Q ss_pred HC---CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHH Q ss_conf 42---79949998289988899999999999839998999996100026426742459999999 Q gi|254780898|r 125 KT---APKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFE 185 (207) Q Consensus 125 ~~---~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~ 185 (207) ++ .+--|||||-+|..|+-++++.|||.....+..+|+....-+...++ +.-.+|....+ T Consensus 245 deArsk~cgvLVHClaGISRSvTvtvaYLMqkl~lslndAyd~Vk~kksnis-PNFnFMgQLld 307 (343) T KOG1717 245 DEARSKNCGVLVHCLAGISRSVTVTVAYLMQKLNLSLNDAYDFVKHKKSNIS-PNFNFMGQLLD 307 (343) T ss_pred HHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC-CCCCHHHHHHH T ss_conf 8764157767876501454046889999999843556768999987114789-88324688888 No 19 >KOG2836 consensus Probab=98.81 E-value=6.6e-08 Score=70.13 Aligned_cols=111 Identities=18% Similarity=0.328 Sum_probs=82.9 Q ss_pred CCCCHHHH---HHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH----CCC Q ss_conf 89998999---99999749938997378787654445799999669849987526888989899999999994----279 Q gi|254780898|r 56 AQPNGTFI---EYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK----TAP 128 (207) Q Consensus 56 ~qP~~~~~---~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~----~~~ 128 (207) -.|+...+ -.-++++|+.|||.+.... | +....++.||...+.+++++.+|+.+.++....+.. +.+ T Consensus 23 hnPtnaTln~fieELkKygvttvVRVCe~T----Y--dt~~lek~GI~Vldw~f~dg~ppp~qvv~~w~~l~~~~f~e~p 96 (173) T KOG2836 23 HNPTNATLNKFIEELKKYGVTTVVRVCEPT----Y--DTTPLEKEGITVLDWPFDDGAPPPNQVVDDWLSLVKTKFREEP 96 (173) T ss_pred CCCCCHHHHHHHHHHHHCCCEEEEEECCCC----C--CCCHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 598711499999999963972899951464----6--7652444582576336656999907888999999999986189 Q ss_pred -CCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH-CCCCCC Q ss_conf -949998289988899999999999839998999996100-026426 Q gi|254780898|r 129 -KPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSM-LYGHFP 173 (207) Q Consensus 129 -~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~-~ygh~~ 173 (207) .-|.|||-+|.||+.+.+|+-||. .|+--|+|.+...- +.|-+- T Consensus 97 ~~cvavhcvaglgrapvlvalalie-~gmkyedave~ir~krrga~n 142 (173) T KOG2836 97 GCCVAVHCVAGLGRAPVLVALALIE-AGMKYEDAVEMIRQKRRGAIN 142 (173) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCC T ss_conf 9769988632367423999999998-265379999999998624311 No 20 >KOG2283 consensus Probab=98.70 E-value=1.6e-08 Score=73.95 Aligned_cols=136 Identities=17% Similarity=0.194 Sum_probs=95.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCCHHHHHHHHHHC--CCCEEEECCCCCCCCCCHHHHHHHHHCCCE Q ss_conf 853210245578788876078725028947713899989999999974--993899737878765444579999966984 Q gi|254780898|r 23 LVLCAVSLGLYFLTITTFTQNFHAVVPHEIYRSAQPNGTFIEYLKKEY--GIKSILNLRGKLPESWHKEEEKAANDLGIQ 100 (207) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~--GIktIInLr~~~~~~~~~~e~~~~~~~gi~ 100 (207) ....+++|...|++.|.+..-|-.=.-+.+||.. -+++..++... |==.|-||+++.- |+. ....=+ T Consensus 6 ~~~~~~DLDltYIT~rIIamsfPa~~~es~yRN~---l~dV~~fL~s~H~~~y~vyNL~~er~---yd~-----~~f~g~ 74 (434) T KOG2283 6 YNEGGFDLDLTYITSRIIAMSFPAEGIESLYRNN---LEDVVLFLDSKHKDHYKVYNLSSERL---YDP-----SRFHGR 74 (434) T ss_pred HHHCCCCCCCEEEEEEEEEEECCCCCCHHHHCCC---HHHHHHHHHHCCCCCEEEEECCCCCC---CCC-----CCCCCC T ss_conf 1002666542123633799837888616652287---99999998621477447986576554---772-----103664 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHC-----CCCCEEEECCCCCCHHHHHHHHHHHHHCCCC-HHHHHHHHHHCC Q ss_conf 99875268889898999999999942-----7994999828998889999999999983999-899999610002 Q gi|254780898|r 101 LINFPLSATRELNDEQIKQLISILKT-----APKPLLIHCKSGADRTGLASAVYLYIVAHYP-KEEAHRQLSMLY 169 (207) Q Consensus 101 ~i~ip~~~~~~~~~~~i~~~~~il~~-----~~~PVlVHC~aG~~RTG~~~alyli~~~g~~-~~eA~~el~~~y 169 (207) ...+++++.++|+.+.+..+.+-|++ .+.=+.|||++|++|||+++++|++...-.+ .++|+..+..++ T Consensus 75 V~~~~~~Dh~~P~L~~l~~~c~~~~~WL~~d~~nVvvvHCk~Gkgrtg~~icA~L~~~~~~~ta~eald~~~~kR 149 (434) T KOG2283 75 VARFGFDDHNPPPLELLCPFCKSMDNWLSEDPKNVVVVHCKAGKGRTGVMICAYLIYSGISATAEEALDYFNEKR 149 (434) T ss_pred EEECCCCCCCCCCHHHHHHHHHCHHHHHHCCCCCEEEEECCCCCCCEEEEEEHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 466678778999688899998789989745865369998168986248987477776100588789999986340 No 21 >pfam00102 Y_phosphatase Protein-tyrosine phosphatase. Probab=98.66 E-value=2.2e-07 Score=66.88 Aligned_cols=122 Identities=15% Similarity=0.226 Sum_probs=69.7 Q ss_pred EEECCC-CC---HHHHHHHHHHCCCCEEEECCCCCCC------CCCHHHHHHHH---------------HC--------- Q ss_conf 771389-99---8999999997499389973787876------54445799999---------------66--------- Q gi|254780898|r 52 IYRSAQ-PN---GTFIEYLKKEYGIKSILNLRGKLPE------SWHKEEEKAAN---------------DL--------- 97 (207) Q Consensus 52 lyRS~q-P~---~~~~~~l~k~~GIktIInLr~~~~~------~~~~~e~~~~~---------------~~--------- 97 (207) -|-..| |. -++|..++-+.++++||.|....|. .++..+..... .. T Consensus 44 ~~I~tQ~Pl~~T~~dFW~mv~~~~v~~IVmL~~~~e~~~~kc~~YwP~~~~~~~~~g~~~V~~~~~~~~~~~~~~r~l~v 123 (234) T pfam00102 44 AYIATQGPLPNTIEDFWRMVWEQKVTTIVMLTELVEKGREKCAQYWPDEGGESLTYGDFTVTLVSEETKEADYTVRTLEL 123 (234) T ss_pred EEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEECCCCCCCCEEECCEEEEEEEEEECCCCEEEEEEEE T ss_conf 49997899768999999999863586589834543278324435446899974788789999999997789989999899 Q ss_pred ---------CCEEEEE-ECCCCCCC-CHHHHHHHHHHHHC----CCCCEEEECCCCCCHHHHHHHHHHHHH-----CCCC Q ss_conf ---------9849987-52688898-98999999999942----799499982899888999999999998-----3999 Q gi|254780898|r 98 ---------GIQLINF-PLSATREL-NDEQIKQLISILKT----APKPLLIHCKSGADRTGLASAVYLYIV-----AHYP 157 (207) Q Consensus 98 ---------gi~~i~i-p~~~~~~~-~~~~i~~~~~il~~----~~~PVlVHC~aG~~RTG~~~alyli~~-----~g~~ 157 (207) -+.|+++ .+++.+.| +...+..++..+.+ .++|++|||.+|.||||+++|+..+.. ...+ T Consensus 124 ~~~~~~~~~~v~h~~~~~Wpd~~~P~~~~~~l~l~~~v~~~~~~~~~PivVHC~~G~gRtG~f~a~~~~~~~l~~~~~vd 203 (234) T pfam00102 124 SNSGDDESRTVTHFHYTGWPDHGVPESPKSLLDLLRKVRKSNPPRSGPIVVHCSAGVGRTGTFIAIDILLQQLEAEGEVD 203 (234) T ss_pred EECCCCCEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 98899925999999978999889679889999999999975467899989994899963228999999999998669989 Q ss_pred HHHHHHHHHH-CCCCCC Q ss_conf 8999996100-026426 Q gi|254780898|r 158 KEEAHRQLSM-LYGHFP 173 (207) Q Consensus 158 ~~eA~~el~~-~ygh~~ 173 (207) ..++.+.|.- |.+-+. T Consensus 204 v~~~v~~lR~qR~~~V~ 220 (234) T pfam00102 204 VFQIVKELRSQRPGMVQ 220 (234) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 99999999874544269 No 22 >smart00194 PTPc Protein tyrosine phosphatase, catalytic domain. Probab=98.65 E-value=2.1e-07 Score=66.98 Aligned_cols=114 Identities=17% Similarity=0.218 Sum_probs=68.3 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCC------CCCHHHHHH--HH-----------------------------HCCCEE Q ss_conf 98999999997499389973787876------544457999--99-----------------------------669849 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPE------SWHKEEEKA--AN-----------------------------DLGIQL 101 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~------~~~~~e~~~--~~-----------------------------~~gi~~ 101 (207) +-++|..++-++++++||.|....|. .+|..+.+. +. ..-+.| T Consensus 81 T~~dFW~MV~~~~v~~IVmL~~~~E~~~~kc~~YwP~~~~~~~~g~~~V~~~~~~~~~~~~~r~l~v~~~~~~~~~~v~h 160 (258) T smart00194 81 TVEDFWRMVWEEKVTTIVMLTELEEKGREKCAQYWPEEEEPLTYGDITVTLKSVEKVDDYTIRTLEVTNTGCSETRTVTH 160 (258) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCEEECCEEEEEEEEEECCCEEEEEEEEEECCCCCEEEEEE T ss_conf 89999999862147668982352217704564117999974074578999999997798899999999899885289999 Q ss_pred EEEE-CCCCCCC-CHHHHHHHHHHHHC----CCCCEEEECCCCCCHHHHHHHHHHHHH-----CCCCHHHHHHHHHH-CC Q ss_conf 9875-2688898-98999999999942----799499982899888999999999998-----39998999996100-02 Q gi|254780898|r 102 INFP-LSATREL-NDEQIKQLISILKT----APKPLLIHCKSGADRTGLASAVYLYIV-----AHYPKEEAHRQLSM-LY 169 (207) Q Consensus 102 i~ip-~~~~~~~-~~~~i~~~~~il~~----~~~PVlVHC~aG~~RTG~~~alyli~~-----~g~~~~eA~~el~~-~y 169 (207) +++. +++.+.| +...+..++..+.+ .++|++|||.+|.||||+++|+..+.. ...+..++.+.|+- |. T Consensus 161 ~~~~~Wpd~~~P~~~~~~l~li~~v~~~~~~~~~PivVHCs~G~gRtG~f~a~~~~~~~l~~~~~vdv~~~v~~lR~qR~ 240 (258) T smart00194 161 YHYTNWPDHGVPESPKSILDLVRAVRKSQSTSTGPIVVHCSAGVGRTGTFIAIDILLQQLEAGKEVDIFEIVKELRSQRP 240 (258) T ss_pred EEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCC T ss_conf 98689998897798799999999999761468999899959999712289999999999985799899999999986120 Q ss_pred CCC Q ss_conf 642 Q gi|254780898|r 170 GHF 172 (207) Q Consensus 170 gh~ 172 (207) +-+ T Consensus 241 ~~V 243 (258) T smart00194 241 GMV 243 (258) T ss_pred HHC T ss_conf 324 No 23 >cd00047 PTPc Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active. Probab=98.54 E-value=7.5e-07 Score=63.48 Aligned_cols=122 Identities=16% Similarity=0.223 Sum_probs=70.8 Q ss_pred EEECCCCCH---HHHHHHHHHCCCCEEEECCCCCCC------CCCHHHHHH--HHH------------------------ Q ss_conf 771389998---999999997499389973787876------544457999--996------------------------ Q gi|254780898|r 52 IYRSAQPNG---TFIEYLKKEYGIKSILNLRGKLPE------SWHKEEEKA--AND------------------------ 96 (207) Q Consensus 52 lyRS~qP~~---~~~~~l~k~~GIktIInLr~~~~~------~~~~~e~~~--~~~------------------------ 96 (207) ...+--|.+ ++|.+++-+.++++||.|....|. .+|..+... +.. T Consensus 43 fI~tQ~Pl~~T~~dFW~Mv~~~~v~~IVmL~~~~e~~~~kc~~YwP~~~~~~~~g~~~I~~~~~~~~~~~~~r~l~v~~~ 122 (231) T cd00047 43 YIATQGPLPNTVEDFWRMVWEQKVPVIVMLTELVEKGREKCAQYWPEEEGSLTYGDITVTLVSEEKLDDYTVRTLKLSNT 122 (231) T ss_pred EEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEECCCCCCCEEECCEEEEEEEEEECCCEEEEEEEEEEC T ss_conf 99978997788999999998315856999313342671545302788998758656999999999669889999999989 Q ss_pred -----CCCEEEEEE-CCCCCCC-CHHHHHHHHHHHHC-----CCCCEEEECCCCCCHHHHHHHHHHHHH-----CCCCHH Q ss_conf -----698499875-2688898-98999999999942-----799499982899888999999999998-----399989 Q gi|254780898|r 97 -----LGIQLINFP-LSATREL-NDEQIKQLISILKT-----APKPLLIHCKSGADRTGLASAVYLYIV-----AHYPKE 159 (207) Q Consensus 97 -----~gi~~i~ip-~~~~~~~-~~~~i~~~~~il~~-----~~~PVlVHC~aG~~RTG~~~alyli~~-----~g~~~~ 159 (207) .-+.|+++. +++.+.| +...+..++..+++ .++|++|||.+|.||||+++|+..+.. ...+.. T Consensus 123 ~~~~~r~v~h~~y~~Wpd~~~P~~~~~~l~l~~~v~~~~~~~~~~PivVHC~~G~gRsg~f~a~~~~~~~l~~~~~vdv~ 202 (231) T cd00047 123 GTGETRTVTHFQYTGWPDHGVPESPDSLLDLLRKVRKSQQQPGSGPIVVHCSAGVGRTGTFIAIDILLQRLEAEGVVDIF 202 (231) T ss_pred CCCCEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 99967999999889999899579879999999999998516789999999699986545999999999999867998999 Q ss_pred HHHHHHHH-CCCCCC Q ss_conf 99996100-026426 Q gi|254780898|r 160 EAHRQLSM-LYGHFP 173 (207) Q Consensus 160 eA~~el~~-~ygh~~ 173 (207) ++.+.|+- |.+-+- T Consensus 203 ~~v~~lR~qR~~~Vq 217 (231) T cd00047 203 QTVKELRSQRPGMVQ 217 (231) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999873744039 No 24 >smart00404 PTPc_motif Protein tyrosine phosphatase, catalytic domain motif. Probab=98.24 E-value=8.3e-06 Score=56.90 Aligned_cols=76 Identities=16% Similarity=0.303 Sum_probs=48.0 Q ss_pred EEEEE-ECCCCCCCCH-HHHHHHHHHHHC------CCCCEEEECCCCCCHHHHHHHHHHHHHC-----C-CCHHHHHHHH Q ss_conf 49987-5268889898-999999999942------7994999828998889999999999983-----9-9989999961 Q gi|254780898|r 100 QLINF-PLSATRELND-EQIKQLISILKT------APKPLLIHCKSGADRTGLASAVYLYIVA-----H-YPKEEAHRQL 165 (207) Q Consensus 100 ~~i~i-p~~~~~~~~~-~~i~~~~~il~~------~~~PVlVHC~aG~~RTG~~~alyli~~~-----g-~~~~eA~~el 165 (207) .++++ .+++.+.|.. ..+..++..+++ .++|++|||.+|.||||+++|++.+..+ + .+..++...+ T Consensus 3 ~~~~~~~Wp~~~~P~~~~~~~~~~~~v~~~~~~~~~~~pivVHC~~G~gRtg~f~a~~~~~~~~~~~~~~~~v~~~v~~i 82 (105) T smart00404 3 KHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESETGEVDIFQTVKEL 82 (105) T ss_pred EEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 99865898989988986999999999999986157998999876999733689999999999998558976799999999 Q ss_pred HH-CCCCCCCC Q ss_conf 00-02642674 Q gi|254780898|r 166 SM-LYGHFPVL 175 (207) Q Consensus 166 ~~-~ygh~~~~ 175 (207) +. |.+-+... T Consensus 83 R~~R~~~v~~~ 93 (105) T smart00404 83 RKQRPGMVQTF 93 (105) T ss_pred HHHCCCCCCCH T ss_conf 96685721949 No 25 >smart00012 PTPc_DSPc Protein tyrosine phosphatase, catalytic domain, undefined specificity. Protein tyrosine phosphatases. Homologues detected by this profile and not by those of "PTPc" or "DSPc" are predicted to be protein phosphatases with a similar fold to DSPs and PTPs, yet with unpredicted specificities. Probab=98.24 E-value=8.3e-06 Score=56.90 Aligned_cols=76 Identities=16% Similarity=0.303 Sum_probs=48.0 Q ss_pred EEEEE-ECCCCCCCCH-HHHHHHHHHHHC------CCCCEEEECCCCCCHHHHHHHHHHHHHC-----C-CCHHHHHHHH Q ss_conf 49987-5268889898-999999999942------7994999828998889999999999983-----9-9989999961 Q gi|254780898|r 100 QLINF-PLSATRELND-EQIKQLISILKT------APKPLLIHCKSGADRTGLASAVYLYIVA-----H-YPKEEAHRQL 165 (207) Q Consensus 100 ~~i~i-p~~~~~~~~~-~~i~~~~~il~~------~~~PVlVHC~aG~~RTG~~~alyli~~~-----g-~~~~eA~~el 165 (207) .++++ .+++.+.|.. ..+..++..+++ .++|++|||.+|.||||+++|++.+..+ + .+..++...+ T Consensus 3 ~~~~~~~Wp~~~~P~~~~~~~~~~~~v~~~~~~~~~~~pivVHC~~G~gRtg~f~a~~~~~~~~~~~~~~~~v~~~v~~i 82 (105) T smart00012 3 KHYHYTGWPDHGVPESPDSILEFLRAVKKNLNQSSSSGPVVVHCSAGVGRTGTFVALDILLQQLESETGEVDIFQTVKEL 82 (105) T ss_pred EEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 99865898989988986999999999999986157998999876999733689999999999998558976799999999 Q ss_pred HH-CCCCCCCC Q ss_conf 00-02642674 Q gi|254780898|r 166 SM-LYGHFPVL 175 (207) Q Consensus 166 ~~-~ygh~~~~ 175 (207) +. |.+-+... T Consensus 83 R~~R~~~v~~~ 93 (105) T smart00012 83 RKQRPGMVQTF 93 (105) T ss_pred HHHCCCCCCCH T ss_conf 96685721949 No 26 >KOG2386 consensus Probab=98.21 E-value=3.3e-06 Score=59.41 Aligned_cols=132 Identities=17% Similarity=0.262 Sum_probs=87.3 Q ss_pred CCCHHHHHHHHHHCC--CCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCC-CCCHHHHHHHHHHHHC------- Q ss_conf 999899999999749--938997378787654445799999669849987526888-9898999999999942------- Q gi|254780898|r 57 QPNGTFIEYLKKEYG--IKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATR-ELNDEQIKQLISILKT------- 126 (207) Q Consensus 57 qP~~~~~~~l~k~~G--IktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~-~~~~~~i~~~~~il~~------- 126 (207) +=++.++...++++| +.-+|+|-... -++ +....++.|+.|+.+...... .|+.+.+..|..+.+. T Consensus 47 ~f~~~dl~~~l~~~~~~vgl~iDltnt~--ryy--~~~~~~~~g~~Y~K~~c~g~~~vp~~~~v~~fv~~v~~f~~~~~~ 122 (393) T KOG2386 47 RFQPKDLFELLKEHNYKVGLKIDLTNTL--RYY--DKPELEERGVKYLKRNCPGRGVVPRTELVDKFVKLVKGFVDDTKL 122 (393) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCEE--EEE--CCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 3579999999986381678997245302--432--043335666058870258755678842257899999988750248 Q ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 7994999828998889999999999983999899999610002642674245999999999985200 Q gi|254780898|r 127 APKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKITQLYPN 193 (207) Q Consensus 127 ~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~~~~~~~ 193 (207) .+.=|+|||.+|..|||-+++.||+...+|+..+|.+.+.--.++.- -+..-++..+..+....|. T Consensus 123 ~~~LI~vhcthG~NrtgyLI~~yL~~~~~~s~~~aik~f~~~r~~gi-~k~dyi~~L~~~~~~~~p~ 188 (393) T KOG2386 123 DDELIGVHCTHGLNRTGYLICAYLADVGGYSSSEAIKRFADARPPGI-EKQDYIDALYSRYHDIFPF 188 (393) T ss_pred CCCEEEEECCCCCCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCC-CCHHHHHHHHHCCCCCCCC T ss_conf 98789984788634414156445420257667799999997377766-7558899875252235665 No 27 >COG5599 PTP2 Protein tyrosine phosphatase [Signal transduction mechanisms] Probab=98.01 E-value=5.4e-06 Score=58.11 Aligned_cols=50 Identities=20% Similarity=0.375 Sum_probs=37.4 Q ss_pred ECCCCCCCCHHHHHHHHHHHHC---CCCCEEEECCCCCCHHHHHHHHHHHHHC Q ss_conf 5268889898999999999942---7994999828998889999999999983 Q gi|254780898|r 105 PLSATRELNDEQIKQLISILKT---APKPLLIHCKSGADRTGLASAVYLYIVA 154 (207) Q Consensus 105 p~~~~~~~~~~~i~~~~~il~~---~~~PVlVHC~aG~~RTG~~~alyli~~~ 154 (207) .+.|...|+-..+.+++.-+.. ..+|++|||.||.||||+++|+--++.. T Consensus 192 nW~D~~~p~i~sl~~~~~sl~~sp~~t~piiVHCSAGvGRTGTFIalD~ll~~ 244 (302) T COG5599 192 NWVDFNVPDIRSLTEVIHSLNDSPVRTGPIIVHCSAGVGRTGTFIALDILLRM 244 (302) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCEEEEHHHHHHC T ss_conf 85224786789999999875048678887799942677775035558988736 No 28 >COG5350 Predicted protein tyrosine phosphatase [General function prediction only] Probab=97.75 E-value=0.00029 Score=47.19 Aligned_cols=106 Identities=19% Similarity=0.234 Sum_probs=68.5 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEE--CCCCCCCCHHHHHHHHHHHHCC--CCCEEEECC Q ss_conf 999999997499389973787876544457999996698499875--2688898989999999999427--994999828 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFP--LSATRELNDEQIKQLISILKTA--PKPLLIHCK 136 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip--~~~~~~~~~~~i~~~~~il~~~--~~PVlVHC~ 136 (207) ....+...++|-...|+|+.....-.-..+...-.-+.+..-|+. -..+..|.++.+...+++.++. ..|++|||. T Consensus 22 ~~~ae~~~rh~~t~mlsl~a~~t~~~~pa~~~~erhL~l~fnDI~~~~~g~~ap~e~Hv~~i~DF~~~wp~~apllIHC~ 101 (172) T COG5350 22 SVIAETAARHGPTHMLSLLAKGTYFHRPAVIAAERHLTLHFNDIAEPDDGWIAPGEAHVRAIIDFADEWPRFAPLLIHCY 101 (172) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 99988876148725787503665455723354543146764463578863447888999999999860765464520010 Q ss_pred CCCCHHHHHHHH-HHHHHCCCCHHHHHHHHH Q ss_conf 998889999999-999983999899999610 Q gi|254780898|r 137 SGADRTGLASAV-YLYIVAHYPKEEAHRQLS 166 (207) Q Consensus 137 aG~~RTG~~~al-yli~~~g~~~~eA~~el~ 166 (207) +|.+|+..+... -+.+...++.+|..+.|. T Consensus 102 aGISRStA~A~i~a~ala~~~de~ela~~Lr 132 (172) T COG5350 102 AGISRSTAAALIAALALAPDMDETELAERLR 132 (172) T ss_pred CCCCCCHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 0456036999999876355468699999998 No 29 >KOG0791 consensus Probab=97.71 E-value=7e-05 Score=51.07 Aligned_cols=124 Identities=17% Similarity=0.165 Sum_probs=71.1 Q ss_pred CCCEEECCC---C-CHHHHHHHHHHCCCCEEEECCCCCCCC------CCHHHHH-------------------------- Q ss_conf 894771389---9-989999999974993899737878765------4445799-------------------------- Q gi|254780898|r 49 PHEIYRSAQ---P-NGTFIEYLKKEYGIKSILNLRGKLPES------WHKEEEK-------------------------- 92 (207) Q Consensus 49 ~g~lyRS~q---P-~~~~~~~l~k~~GIktIInLr~~~~~~------~~~~e~~-------------------------- 92 (207) +++.|-..| | +.++|.+++=+..+..||-|..-.|.+ ++..+.. T Consensus 163 s~~~fIAtQGPlp~t~~~fW~mvwq~~~~~IVmlt~~~e~~R~kc~~ywP~~~~~~~~gdi~V~~v~e~~~~~w~ir~~~ 242 (374) T KOG0791 163 SPREFIATQGPLPETRDDFWKMVWQQKSHIIVMLTKCNEKGRVKCDEYWPDEEVPVAYGDITVTMVSEESLDEWTIREFR 242 (374) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHCCCCCCCCEECCEEEEEECHHHCCCCEEEEEE T ss_conf 71038986078888726789998875164499986323033146565465565562123379999222224770677776 Q ss_pred ---HHHHCCCEEEEE-ECCCCCCCC-HHHHHHHHH----HHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCH----- Q ss_conf ---999669849987-526888989-899999999----994279949998289988899999999999839998----- Q gi|254780898|r 93 ---AANDLGIQLINF-PLSATRELN-DEQIKQLIS----ILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPK----- 158 (207) Q Consensus 93 ---~~~~~gi~~i~i-p~~~~~~~~-~~~i~~~~~----il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~----- 158 (207) .++...+.+++. .+++.++|+ ...+.+|.. .+...++|++|||.+|.+|||+++|+-+++.+--+. T Consensus 243 l~~~~~sr~ir~f~y~~wPd~gvp~~~~sl~~f~~~~r~~~~~~~~p~iVhCSAGVgRTGTFiald~LLqq~~~~~~vdi 322 (374) T KOG0791 243 LNYAGESRKIRHFHYTAWPDFGVPSSTESLLQFVRMVRQSLDTSKGPTIVHCSAGVGRTGTFIALDRLLQQIDSEETVDI 322 (374) T ss_pred EECCCCCCEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHCCCCCCCH T ss_conf 30135652057898840335678887134899999998650557886778960256764525769999997356562318 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999610002642 Q gi|254780898|r 159 EEAHRQLSMLYGHF 172 (207) Q Consensus 159 ~eA~~el~~~ygh~ 172 (207) -....+|+...+|. T Consensus 323 ~~iv~~lR~~R~~m 336 (374) T KOG0791 323 FGVVLELRSARMLM 336 (374) T ss_pred HHHHHHHHHHCCCC T ss_conf 89999867515510 No 30 >KOG0792 consensus Probab=97.65 E-value=0.00028 Score=47.29 Aligned_cols=132 Identities=15% Similarity=0.211 Sum_probs=72.1 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCC------CCHHH-HHHH--HHC----------------CCEEEE---------- Q ss_conf 989999999974993899737878765------44457-9999--966----------------984998---------- Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPES------WHKEE-EKAA--NDL----------------GIQLIN---------- 103 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~------~~~~e-~~~~--~~~----------------gi~~i~---------- 103 (207) |-.+|.+.+=+.|+..|+=|..+.|.+ +|..- .+.. .++ +++..| T Consensus 950 T~~DFWQMVWEQ~~~lIvMlT~e~EggR~KchqYWPr~~~~t~~ygrf~v~~~~~~~t~~y~tr~m~l~~~~t~eeR~V~ 1029 (1144) T KOG0792 950 TCTDFWQMVWEQGSTLIVMLTTEVEGGRVKCHQYWPRLGHETMEYGRFQVTCVFEQQTTCYVTREMTLKDLQTREERTVW 1029 (1144) T ss_pred HHHHHHHHHHHCCCEEEEEEEEHHHCCEECCCCCCCCCCCCCEECCCEEEEEEEECCCCCEEEEEEEEEECCCCCEEEEE T ss_conf 27889999986375399997551216702020106877735111064689999723656479876788632677446655 Q ss_pred ----EECCCCCCCCH-HHHHHHHHHHHC---C-CCCEEEECCCCCCHHHHHHH----HHHHHHCC--CCHHHHHHHHHHC Q ss_conf ----75268889898-999999999942---7-99499982899888999999----99999839--9989999961000 Q gi|254780898|r 104 ----FPLSATRELND-EQIKQLISILKT---A-PKPLLIHCKSGADRTGLASA----VYLYIVAH--YPKEEAHRQLSML 168 (207) Q Consensus 104 ----ip~~~~~~~~~-~~i~~~~~il~~---~-~~PVlVHC~aG~~RTG~~~a----lyli~~~g--~~~~eA~~el~~~ 168 (207) ..+++.+.|++ .....|++-+.. . +-||+|||.||.||||+++. +|++. ++ .++-...+.|+.. T Consensus 1030 hLQYtaWPDHg~P~D~~~FL~FleevrsvR~~t~pPilvHCSAGiGRTGVlIl~e~~l~lle-~Ne~vdi~divr~mR~Q 1108 (1144) T KOG0792 1030 HLQYTAWPDHGVPDDPNDFLDFLEEVRSVRRGTNPPILVHCSAGIGRTGVLILMETALCLLE-HNEPVDILDIVRTMRDQ 1108 (1144) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCEEHHHHHHHHHHHH-CCCCCCHHHHHHHHHHH T ss_conf 55542466678987768899999999998616899868972477875401069999999986-28988789999999987 Q ss_pred C-CCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 2-642674245999999999985200 Q gi|254780898|r 169 Y-GHFPVLKTITMDITFEKITQLYPN 193 (207) Q Consensus 169 y-gh~~~~~~~~ld~~~~~~~~~~~~ 193 (207) . .-+....+| -...+.|-+.|.+ T Consensus 1109 R~~mVQT~~QY--kFVyevil~~l~~ 1132 (1144) T KOG0792 1109 RAMMVQTLSQY--KFVYEVILRVLKR 1132 (1144) T ss_pred HHHHCCCHHHH--HHHHHHHHHHHHH T ss_conf 75520665786--5799999999985 No 31 >KOG4228 consensus Probab=97.58 E-value=4.2e-05 Score=52.49 Aligned_cols=47 Identities=19% Similarity=0.338 Sum_probs=28.8 Q ss_pred CCCCCCCC-HHHHHHHHHHHHCC----CCCEEEECCCCCCHHHHHHHHHHHH Q ss_conf 26888989-89999999999427----9949998289988899999999999 Q gi|254780898|r 106 LSATRELN-DEQIKQLISILKTA----PKPLLIHCKSGADRTGLASAVYLYI 152 (207) Q Consensus 106 ~~~~~~~~-~~~i~~~~~il~~~----~~PVlVHC~aG~~RTG~~~alyli~ 152 (207) +++.+.|. ..-+..|++-.+.. .||++|||.||.||||+++++--|+ T Consensus 703 Wpd~gvPe~~t~lL~f~rrvk~~~p~~aGPiVVHCSAGvGRTG~fi~iDaml 754 (1087) T KOG4228 703 WPDHGVPETPTGLLKFRRRVKTFNPPDAGPIVVHCSAGVGRTGCFIVIDAML 754 (1087) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCEEEEEHHHH T ss_conf 7888985462078999887404798678987998989988852478858989 No 32 >KOG0789 consensus Probab=97.57 E-value=0.0002 Score=48.16 Aligned_cols=108 Identities=16% Similarity=0.247 Sum_probs=59.7 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCC------CCHHHH---HHHHHCC--------------------------------- Q ss_conf 9999999974993899737878765------444579---9999669--------------------------------- Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPES------WHKEEE---KAANDLG--------------------------------- 98 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~------~~~~e~---~~~~~~g--------------------------------- 98 (207) ++|.+.+-+.++.+||-|....|.+ ++..+. ..+...+ T Consensus 181 ~dFW~mv~~~k~~~Ivml~~~~E~~~~kc~~Y~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (415) T KOG0789 181 EDFWRMVFEEKVESVVLLCSDEELGAAECASYFPSKRGVKLTFLEFGKKFVENRKVEMKQQDQTSTVGLLLLVLPEGLEI 260 (415) T ss_pred HHHHHHHHHCCCCEEEEEECHHHCCCCCCCEEEECCCCCCCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEECCCCCCC T ss_conf 13688987427865999807556675214322016888644301115078987776402455421256889964788766 Q ss_pred ---CEEEEE-ECCCCCCCC-HHHHHHHHH--HHHC--CCCCEEEECCCCCCHHHHHHHHHHHHHC--C----CCHHHHHH Q ss_conf ---849987-526888989-899999999--9942--7994999828998889999999999983--9----99899999 Q gi|254780898|r 99 ---IQLINF-PLSATRELN-DEQIKQLIS--ILKT--APKPLLIHCKSGADRTGLASAVYLYIVA--H----YPKEEAHR 163 (207) Q Consensus 99 ---i~~i~i-p~~~~~~~~-~~~i~~~~~--il~~--~~~PVlVHC~aG~~RTG~~~alyli~~~--g----~~~~eA~~ 163 (207) +.|.+. .+++.+.|. ...+..++. .... ..+|+.|||.+|.||||+++++-.+..+ + -+..+... T Consensus 261 ~~~v~~~~~~~WPd~~~p~~~~~~l~~~~~~~~~~~~~~~P~vVhcsaG~gRtgt~v~~~~~~~~~~~~~~~~~~~~~~~ 340 (415) T KOG0789 261 SSSVVHYHYINWPDHGAPDSVKSILPLLRQSVLELRPKQEPIEVHCSAGAGRAGTLVLIEHALIELQGPEGEPPIDEILR 340 (415) T ss_pred CCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 73689986689876678723788999988888750689999588868999614238889999999866788886799999 Q ss_pred HHHHC Q ss_conf 61000 Q gi|254780898|r 164 QLSML 168 (207) Q Consensus 164 el~~~ 168 (207) +++.. T Consensus 341 ~iR~q 345 (415) T KOG0789 341 EIRYQ 345 (415) T ss_pred HHHHH T ss_conf 99851 No 33 >KOG0790 consensus Probab=97.35 E-value=6.9e-05 Score=51.14 Aligned_cols=47 Identities=15% Similarity=0.364 Sum_probs=32.7 Q ss_pred EECCCCCCCC-HHHHHHHHHHHHC------CCCCEEEECCCCCCHHHHHHHHHH Q ss_conf 7526888989-8999999999942------799499982899888999999999 Q gi|254780898|r 104 FPLSATRELN-DEQIKQLISILKT------APKPLLIHCKSGADRTGLASAVYL 150 (207) Q Consensus 104 ip~~~~~~~~-~~~i~~~~~il~~------~~~PVlVHC~aG~~RTG~~~alyl 150 (207) ..+++.+.|. +-.+..|++-.+. ..+||.|||.||.||||+++.+-. T Consensus 420 ~tWPDHGvP~dPg~vLnFLe~V~~rq~~l~~AgpIvVHCSAGIGrTGTfiViD~ 473 (600) T KOG0790 420 LTWPDHGVPSDPGGVLNFLEEVNHRQESLMDAGPIVVHCSAGIGRTGTFIVIDM 473 (600) T ss_pred ECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCEEEEHHH T ss_conf 104667986995488878988634012424458679976677677623998299 No 34 >KOG4228 consensus Probab=97.17 E-value=0.0021 Score=41.73 Aligned_cols=108 Identities=19% Similarity=0.309 Sum_probs=58.7 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCC---C--CC------------------HHHHHHHHHCCCEEE---------EEE- Q ss_conf 98999999997499389973787876---5--44------------------457999996698499---------875- Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPE---S--WH------------------KEEEKAANDLGIQLI---------NFP- 105 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~---~--~~------------------~~e~~~~~~~gi~~i---------~ip- 105 (207) +-++|.+++-+++..|||.|..-++. + |. ..+....+..+|+.. ++. T Consensus 907 T~~DFWrmi~d~~~tsiVmL~~l~~~~~C~qyw~~~g~~~yg~i~Ve~~~~~~~~~~t~r~f~i~n~~~~~~r~v~qfq~ 986 (1087) T KOG4228 907 TVEDFWRMIWDQNVTSIVMLTELKHPEKCPQYWPPEGSQRYGPIEVEDMNEHINPQYTAREFGVTNEREKQSRTVRQFQF 986 (1087) T ss_pred CHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCCEECCCEEEEECCCCCCHHHHHHHHEEEECCCCCCEEEEEEEE T ss_conf 06788887650550699983146761005654598676203768997420200246654631555312467269999885 Q ss_pred --CCCCCCC--CHHHHH---HHHHHHHC--CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHH------HHHHHHHH Q ss_conf --2688898--989999---99999942--799499982899888999999999998399989------99996100 Q gi|254780898|r 106 --LSATREL--NDEQIK---QLISILKT--APKPLLIHCKSGADRTGLASAVYLYIVAHYPKE------EAHRQLSM 167 (207) Q Consensus 106 --~~~~~~~--~~~~i~---~~~~il~~--~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~------eA~~el~~ 167 (207) ++..+.+ ....+. +...-.+. .++|+.|||..|.+|||+++|+-.++ +....+ .+.+-|.. T Consensus 987 ~~WP~~~~~p~~~~~~~~i~~~~~~~q~~~~~~P~~Vhc~nG~~rsg~f~ai~~l~-e~~~~e~~vDVfq~vk~Lr~ 1062 (1087) T KOG4228 987 TGWPEYGKPPQSKGPISKIPSVASKWQQLGADGPIIVHCLNGVGRTGTFCAISILL-ERMRKEGVVDVFQTVKTLRF 1062 (1087) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCEEEHHHHHHHH-HHHHHCCCEEEEHHHHHHHH T ss_conf 27854576877741356678888788852678987999717986243166999999-98763283366525445543 No 35 >KOG0793 consensus Probab=96.96 E-value=0.00084 Score=44.29 Aligned_cols=113 Identities=17% Similarity=0.225 Sum_probs=63.9 Q ss_pred HHHCCCCCCCCC----CEEECCC-CC---HHHHHHHHHHCCCCEEEECCCCCCCC------CCHHHHHH----------- Q ss_conf 760787250289----4771389-99---89999999974993899737878765------44457999----------- Q gi|254780898|r 39 TFTQNFHAVVPH----EIYRSAQ-PN---GTFIEYLKKEYGIKSILNLRGKLPES------WHKEEEKA----------- 93 (207) Q Consensus 39 ~~~~Nf~~V~~g----~lyRS~q-P~---~~~~~~l~k~~GIktIInLr~~~~~~------~~~~e~~~----------- 93 (207) .+|-|-+.|.+. -.|-..| |- -.+|.+.+=+.|.--||+|..-.|.. +|.+|... T Consensus 787 ~dYiNAS~I~DhDPR~paYIAtQgPl~stiA~FWQmvWe~G~~vIV~Lt~l~Engv~qc~rYWPdeGselyhiyEV~LVS 866 (1004) T KOG0793 787 SDYINASPIMDHDPRNPAYIATQGPLPSTIADFWQMVWESGCVVIVMLTPLAENGVRQCYRYWPDEGSELYHIYEVNLVS 866 (1004) T ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCHHHCCHHHHHHCCCCCCCCEEEEEEEEEEH T ss_conf 65203554435898763322236887158999999999738379998267565564566523887776033467864302 Q ss_pred --------------HHHC------CCEEEE-EECCCCCCCC-HHHHHHHHHHHHCC----CCCEEEECCCCCCHHHHHHH Q ss_conf --------------9966------984998-7526888989-89999999999427----99499982899888999999 Q gi|254780898|r 94 --------------ANDL------GIQLIN-FPLSATRELN-DEQIKQLISILKTA----PKPLLIHCKSGADRTGLASA 147 (207) Q Consensus 94 --------------~~~~------gi~~i~-ip~~~~~~~~-~~~i~~~~~il~~~----~~PVlVHC~aG~~RTG~~~a 147 (207) .+++ -++..| +.+.+.+.|. ...+..|.+-.+++ .-||+|||..|+||||+.++ T Consensus 867 EHIWceDfLVRSFYLKNlqtseTRTvTQFHfLSWp~egvPasarslLdFRRKVNK~YRGRScpIiVH~sdGaGRTG~Yil 946 (1004) T KOG0793 867 EHIWCEDFLVRSFYLKNLQTSETRTVTQFHFLSWPDEGVPASARSLLDFRRKVNKCYRGRSCPIIVHCSDGAGRTGTYIL 946 (1004) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCEEEE T ss_conf 44303467899999853133552135544541254568863137899999885242157778669970478886312562 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780898|r 148 VYLY 151 (207) Q Consensus 148 lyli 151 (207) +-++ T Consensus 947 iDmv 950 (1004) T KOG0793 947 IDMV 950 (1004) T ss_pred HHHH T ss_conf 9999 No 36 >cd01522 RHOD_1 Member of the Rhodanese Homology Domain superfamily, subgroup 1. This CD includes the putative rhodanese-related sulfurtransferases of several uncharacterized proteins. Probab=96.46 E-value=0.015 Score=36.37 Aligned_cols=86 Identities=16% Similarity=0.215 Sum_probs=48.5 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCC--CCCCCHHHHHHHHHHHHCCCCCEEE Q ss_conf 99899999999749938997378787654445799999669-8-499875268--8898989999999999427994999 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSA--TRELNDEQIKQLISILKTAPKPLLI 133 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~--~~~~~~~~i~~~~~il~~~~~PVlV 133 (207) .||.+...++++.+=-.+|++|.+.|.. ..| | .-+++|... ...++++-+..+.+.+ ..+.||+| T Consensus 1 Itp~eA~e~L~~~~~~~lIDVRt~~E~~----------~~G~ipga~~I~~~~~~~~~~~~~f~~~l~~~~-~~d~~ivv 69 (117) T cd01522 1 LTPAEAWALLQADPQAVLVDVRTEAEWK----------FVGGVPDAVHVAWQVYPDMEINPNFLAELEEKV-GKDRPVLL 69 (117) T ss_pred CCHHHHHHHHHHCCCEEEEECCCHHHHH----------HHCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHC-CCCCEEEE T ss_conf 9989999999849983999899889998----------629888851120343112245812789999745-99986999 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 828998889999999999983999 Q gi|254780898|r 134 HCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 134 HC~aG~~RTG~~~alyli~~~g~~ 157 (207) +|++|. |++.++.. +...|+. T Consensus 70 ~C~sG~-RS~~Aa~~--L~~~Gf~ 90 (117) T cd01522 70 LCRSGN-RSIAAAEA--AAQAGFT 90 (117) T ss_pred ECCCCC-HHHHHHHH--HHHCCCC T ss_conf 889981-59999999--9985987 No 37 >cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins. Probab=96.25 E-value=0.011 Score=37.16 Aligned_cols=83 Identities=20% Similarity=0.129 Sum_probs=46.3 Q ss_pred CCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHHHHHHHHH-CCCCCEEE Q ss_conf 89998999999997499389973787876544457999996698-49987526888989899999999994-27994999 Q gi|254780898|r 56 AQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIKQLISILK-TAPKPLLI 133 (207) Q Consensus 56 ~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~~~~~il~-~~~~PVlV 133 (207) ...+|+++..++++..+ .||++|.+.|.. .-.| .-+++|+.... +....+.+..+ ..++||+| T Consensus 2 ~~isp~e~~~~i~~~~~-~liDVR~~~E~~----------~Ghi~gA~nip~~~f~----~~~~~~~~~~~~~k~k~ivv 66 (101) T cd01518 2 TYLSPAEWNELLEDPEV-VLLDVRNDYEYD----------IGHFKGAVNPDVDTFR----EFPFWLDENLDLLKGKKVLM 66 (101) T ss_pred CCCCHHHHHHHHHCCCE-EEEECCCHHHHH----------CCEEEEEECCCCCCHH----HHHHHHHHHHCCCCCCEEEE T ss_conf 63399999999818991-999896979980----------7853031858830244----42799987212143876999 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCC Q ss_conf 82899888999999999998399 Q gi|254780898|r 134 HCKSGADRTGLASAVYLYIVAHY 156 (207) Q Consensus 134 HC~aG~~RTG~~~alyli~~~g~ 156 (207) +|.+|. |+..++.. | ...|+ T Consensus 67 yC~~G~-RS~~Aa~~-L-~~~Gf 86 (101) T cd01518 67 YCTGGI-RCEKASAY-L-KERGF 86 (101) T ss_pred ECCCCH-HHHHHHHH-H-HHCCC T ss_conf 859982-79999999-9-98498 No 38 >cd01533 4RHOD_Repeat_2 Member of the Rhodanese Homology Domain superfamily, repeat 2. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 2nd repeat which does contain the putative catalytic Cys residue. Probab=96.22 E-value=0.014 Score=36.58 Aligned_cols=87 Identities=14% Similarity=0.184 Sum_probs=52.8 Q ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCE Q ss_conf 77138999899999999749938997378787654445799999669849987526888989899999999994279949 Q gi|254780898|r 52 IYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPL 131 (207) Q Consensus 52 lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PV 131 (207) ..+-...+++++..++.+-.=-.+|+.|.+.|.. .. |||- +...|-.+....+.++..+.+.|| T Consensus 6 ~~~~P~Is~~elk~~l~~g~~~~llDVRe~~E~~----------~~-----hIPG-A~~iP~~~l~~r~~~l~pd~~~~i 69 (109) T cd01533 6 VRHTPSVSADELAALQARGAPLVVLDGRRFDEYR----------KM-----TIPG-SVSCPGAELVLRVGELAPDPRTPI 69 (109) T ss_pred HHCCCCCCHHHHHHHHHCCCCEEEEECCCHHHHH----------HC-----CCCC-CEECCHHHHHHHHHHHCCCCCCEE T ss_conf 0059957899999999779985999698988886----------07-----5899-888772378776987388999859 Q ss_pred EEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 99828998889999999999983999 Q gi|254780898|r 132 LIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 132 lVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) ++||.+|. |+..++.. | .-.|++ T Consensus 70 vv~C~~G~-RS~~aa~~-L-~~~G~~ 92 (109) T cd01533 70 VVNCAGRT-RSIIGAQS-L-INAGLP 92 (109) T ss_pred EEECCCCH-HHHHHHHH-H-HHCCCC T ss_conf 99889984-99999999-9-986998 No 39 >KOG4471 consensus Probab=96.21 E-value=0.014 Score=36.53 Aligned_cols=51 Identities=22% Similarity=0.463 Sum_probs=31.9 Q ss_pred HHHHHHH---CCCCCEEEECCCCCCHHHHHHHHHHHHH-------CCCCHHHHHHHHHHCCCC Q ss_conf 9999994---2799499982899888999999999998-------399989999961000264 Q gi|254780898|r 119 QLISILK---TAPKPLLIHCKSGADRTGLASAVYLYIV-------AHYPKEEAHRQLSMLYGH 171 (207) Q Consensus 119 ~~~~il~---~~~~PVlVHC~aG~~RTG~~~alyli~~-------~g~~~~eA~~el~~~ygh 171 (207) ....+.+ ....||||||.-|-|||.-.+++-.++. +|+-.---..|| .+|| T Consensus 362 ga~~Ia~kVe~~~~sVlVHCSDGWDRT~QlvsLA~LlLDpYYRTieGFqvLVEkeWL--sFGH 422 (717) T KOG4471 362 GAVRIADKVESESRSVLVHCSDGWDRTAQLVSLAMLLLDPYYRTIEGFQVLVEKEWL--SFGH 422 (717) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH--HCCC T ss_conf 889999998517963899857875323899999998725155323556888777777--5177 No 40 >cd01447 Polysulfide_ST Polysulfide-sulfurtransferase - Rhodanese Homology Domain. This domain is believed to serve as a polysulfide binding and transferase domain in anaerobic gram-negative bacteria, functioning in oxidative phosphorylation with polysulfide-sulfur as a terminal electron acceptor. The active site contains the same conserved cysteine that is the catalytic residue in other Rhodanese Homology Domain proteins. Probab=95.77 E-value=0.017 Score=36.15 Aligned_cols=82 Identities=18% Similarity=0.177 Sum_probs=43.4 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCCC---CCCCHHHHHHHHHHHHCCCCCEE Q ss_conf 99899999999749938997378787654445799999669-8-4998752688---89898999999999942799499 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSAT---RELNDEQIKQLISILKTAPKPLL 132 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~~---~~~~~~~i~~~~~il~~~~~PVl 132 (207) .+|+++..++++-++ .+|++|.+.|. ...| | .-+++|.... ..+.. .+.......++||+ T Consensus 1 Isp~ea~~ll~~~~~-~lIDVR~~~E~----------~~~G~I~gA~~ip~~~l~~~~~~~~----~~~~~~~~~~~~iv 65 (103) T cd01447 1 LSPEDARALLGSPGV-LLVDVRDPREL----------ERTGMIPGAFHAPRGMLEFWADPDS----PYHKPAFAEDKPFV 65 (103) T ss_pred CCHHHHHHHHHCCCE-EEEECCCHHHH----------HHCCCCCCCEECCHHHHHHHHCCCC----HHHHHHCCCCCEEE T ss_conf 988999999838895-99989888999----------8248678888077767777628230----44565218888799 Q ss_pred EECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 9828998889999999999983999 Q gi|254780898|r 133 IHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 133 VHC~aG~~RTG~~~alyli~~~g~~ 157 (207) ++|++|. |+..++ -++ ...|+. T Consensus 66 ~~C~~G~-RS~~aa-~~L-~~~Gy~ 87 (103) T cd01447 66 FYCASGW-RSALAG-KTL-QDMGLK 87 (103) T ss_pred EECCCCC-HHHHHH-HHH-HHCCCC T ss_conf 9869980-599999-999-985984 No 41 >cd01527 RHOD_YgaP Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YgaP, and similar uncharacterized putative rhodanese-related sulfurtransferases. Probab=95.40 E-value=0.026 Score=34.86 Aligned_cols=69 Identities=19% Similarity=0.192 Sum_probs=40.4 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 998999999997499389973787876544457999996698-4998752688898989999999999427994999828 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) .+|++..+++++ |. .+|++|.+.|. ....| .-+++|+ .++.... .-...++||++||+ T Consensus 4 Isp~ea~~l~~~-ga-vliDVR~~~E~----------~~ghIpgA~~iPl--------~~l~~~~-~~~~~~~~vv~~C~ 62 (99) T cd01527 4 ISPNDACELLAQ-GA-VLVDIREPDEY----------LRERIPGARLVPL--------SQLESEG-LPLVGANAIIFHCR 62 (99) T ss_pred ECHHHHHHHHHC-CC-EEEECCCHHHH----------HCCCCCCCEECCH--------HHHHHHH-CCCCCCCEEEEECC T ss_conf 799999999978-89-89988988999----------6278987850788--------8876740-23578986999879 Q ss_pred CCCCHHHHHHHH Q ss_conf 998889999999 Q gi|254780898|r 137 SGADRTGLASAV 148 (207) Q Consensus 137 aG~~RTG~~~al 148 (207) +|. |+..++.. T Consensus 63 ~G~-RS~~aa~~ 73 (99) T cd01527 63 SGM-RTQQNAER 73 (99) T ss_pred CCH-HHHHHHHH T ss_conf 973-69999999 No 42 >cd01444 GlpE_ST GlpE sulfurtransferase (ST) and homologs are members of the Rhodanese Homology Domain superfamily. Unlike other rhodanese sulfurtransferases, GlpE is a single domain protein but indications are that it functions as a dimer. The active site contains a catalytically active cysteine. Probab=95.31 E-value=0.069 Score=32.24 Aligned_cols=82 Identities=13% Similarity=0.195 Sum_probs=45.8 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 99989999999974993899737878765444579999966984998752688898989999999999427994999828 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) |.|++++..++++..=-.+|++|.+.+. + ...| |+|- ....+...+..+..-+ ..++||+|+|. T Consensus 1 rIs~~el~~~l~~~~~~~liDVR~~~e~-------~--~~~~----hi~g--a~~i~~~~~~~~~~~~-~~~~~ivv~C~ 64 (96) T cd01444 1 RISVDELAELLAAGEAPVLLDVRDPASY-------A--ALPD----HIPG--AIHLDEDSLDDWLGDL-DRDRPVVVYCY 64 (96) T ss_pred CCCHHHHHHHHHCCCCCEEEECCCHHHH-------H--HHCC----CCCC--CCCCCHHHHHHHHHHH-CCCCCEEEEEC T ss_conf 9798999999848999689989888999-------8--6208----7778--6312366799988985-68981899808 Q ss_pred CCCCHHHHHHHHHHHHHCCCC Q ss_conf 998889999999999983999 Q gi|254780898|r 137 SGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 137 aG~~RTG~~~alyli~~~g~~ 157 (207) +|. |+..++. ++. ..|++ T Consensus 65 ~G~-rs~~aa~-~L~-~~G~~ 82 (96) T cd01444 65 HGN-SSAQLAQ-ALR-EAGFT 82 (96) T ss_pred CCH-HHHHHHH-HHH-HHCCC T ss_conf 966-8999999-999-82898 No 43 >PRK10287 thiosulfate:cyanide sulfurtransferase; Provisional Probab=95.31 E-value=0.021 Score=35.45 Aligned_cols=64 Identities=14% Similarity=0.188 Sum_probs=37.3 Q ss_pred EEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHC Q ss_conf 99737878765444579999966984998752688898989999999999427994999828998889999999999983 Q gi|254780898|r 75 ILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVA 154 (207) Q Consensus 75 IInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~ 154 (207) +|++|.+.|. +.- |++- +...|-.+......++..+.+.|++|||++|. |++.++.. | ... T Consensus 23 ~IDVRt~~Ef----------~~G-----HI~g-AiNIPl~ei~~~~~el~~dK~~~IvvyC~sG~-RS~~A~~~-L-~~~ 83 (104) T PRK10287 23 WIDVRVPEQY----------QQE-----HVQG-AINIPLKEVKERIATAVPDKNDTVKVYCNAGR-QSGQAKEI-L-SEM 83 (104) T ss_pred EEECCCHHHH----------HCC-----CCCC-CEECCHHHHHHHHHHHCCCCCCCEEEECCCCH-HHHHHHHH-H-HHC T ss_conf 7888988999----------708-----9765-78575889999998704477993899869983-89999999-9-976 Q ss_pred CCC Q ss_conf 999 Q gi|254780898|r 155 HYP 157 (207) Q Consensus 155 g~~ 157 (207) |.+ T Consensus 84 Gy~ 86 (104) T PRK10287 84 GYT 86 (104) T ss_pred CCC T ss_conf 997 No 44 >cd01528 RHOD_2 Member of the Rhodanese Homology Domain superfamily, subgroup 2. Subgroup 2 includes uncharacterized putative rhodanese-related domains. Probab=95.22 E-value=0.16 Score=29.92 Aligned_cols=81 Identities=12% Similarity=0.085 Sum_probs=43.5 Q ss_pred CCCHHHHHHHHHHCCC-CEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHH-HHHH-CCCCCEEE Q ss_conf 9998999999997499-3899737878765444579999966984998752688898989999999-9994-27994999 Q gi|254780898|r 57 QPNGTFIEYLKKEYGI-KSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLI-SILK-TAPKPLLI 133 (207) Q Consensus 57 qP~~~~~~~l~k~~GI-ktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~-~il~-~~~~PVlV 133 (207) |.|+.++..++.+..= -.||++|.+.+. +.. |+|- +...| ...+.... ++.. ..++||++ T Consensus 1 eIs~~el~~~l~~~~~~~~liDVR~~~E~----------~~~-----hI~g-ai~iP-~~~~~~~~~~~~~~~~d~~iv~ 63 (101) T cd01528 1 QISVAELAEWLADEREEPVLIDVREPEEL----------EIA-----FLPG-FLHLP-MSEIPERSKELDSDNPDKDIVV 63 (101) T ss_pred CCCHHHHHHHHHCCCCCEEEEECCCHHHH----------HHC-----CCCC-CCCCC-HHHHHHHHHHHHHHCCCCCEEE T ss_conf 95989999999739998399978874897----------317-----6798-86688-7789999997341168980899 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 828998889999999999983999 Q gi|254780898|r 134 HCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 134 HC~aG~~RTG~~~alyli~~~g~~ 157 (207) +|.+|. |+..+ |-+|.. .|++ T Consensus 64 ~C~~G~-rS~~a-a~~L~~-~G~~ 84 (101) T cd01528 64 LCHHGG-RSMQV-AQWLLR-QGFE 84 (101) T ss_pred ECCCCH-HHHHH-HHHHHH-CCCC T ss_conf 838981-59999-999998-5997 No 45 >PRK08762 molybdopterin biosynthesis protein MoeB; Validated Probab=94.98 E-value=0.043 Score=33.56 Aligned_cols=79 Identities=15% Similarity=0.180 Sum_probs=45.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHH-HHHHHHHCCCCCEEEE Q ss_conf 9998999999997499389973787876544457999996698-4998752688898989999-9999994279949998 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIK-QLISILKTAPKPLLIH 134 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~-~~~~il~~~~~PVlVH 134 (207) ..+|++...+++ .| -.+|++|.++|. +.-.| .-+++|+ ..++ .......+.++||+|+ T Consensus 5 eItp~ea~~~~~-~g-avliDVRe~~E~----------~~GhIpgA~~ip~--------~~le~~~~~~~pdkd~~ivvy 64 (379) T PRK08762 5 EISPAQARARAL-QG-AVLIDVRQAHER----------ASGQAEGALAIAQ--------GFLELQPATHLPDHDREIVLI 64 (379) T ss_pred CCCHHHHHHHHH-CC-CEEEECCCHHHH----------HCCCCCCCEECCH--------HHHHHHHHHCCCCCCCEEEEE T ss_conf 658999999987-89-889989887899----------5788878887882--------445541330188999859999 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 289988899999999999839998 Q gi|254780898|r 135 CKSGADRTGLASAVYLYIVAHYPK 158 (207) Q Consensus 135 C~aG~~RTG~~~alyli~~~g~~~ 158 (207) |++|. |++.++.. | ...|++. T Consensus 65 C~sG~-RS~~Aa~~-L-~~~Gy~~ 85 (379) T PRK08762 65 CQSGK-RSAHAAAD-L-RELGYRR 85 (379) T ss_pred CCCCH-HHHHHHHH-H-HHCCCCC T ss_conf 99987-99999999-9-9769966 No 46 >cd01526 RHOD_ThiF Member of the Rhodanese Homology Domain superfamily. This CD includes several putative molybdopterin synthase sulfurylases including the molybdenum cofactor biosynthetic protein (CnxF) of Aspergillus nidulans and the molybdenum cofactor synthesis protein 3 (MOCS3) of Homo sapiens. These rhodanese-like domains are found C-terminal of the ThiF and MoeZ_MoeB domains. Probab=94.97 E-value=0.034 Score=34.17 Aligned_cols=87 Identities=18% Similarity=0.150 Sum_probs=45.4 Q ss_pred CCCHHHHHHHHHHCCCC-EEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHHHHHHH-HHCCCCCEEE Q ss_conf 99989999999974993-89973787876544457999996698-499875268889898999999999-9427994999 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIK-SILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIKQLISI-LKTAPKPLLI 133 (207) Q Consensus 57 qP~~~~~~~l~k~~GIk-tIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~~~~~i-l~~~~~PVlV 133 (207) +.|++++..+++. |-. .+|++|.+.+. +.-.| .-+++|+........+........ -...+.||+| T Consensus 9 ~Is~~el~~~l~~-~~~~~llDVR~~~E~----------~~~hI~gainiP~~~l~~~~~~~~~~~~~~~~~~~d~~Ivv 77 (122) T cd01526 9 RVSVKDYKNILQA-GKKHVLLDVRPKVHF----------EICRLPEAINIPLSELLSKAAELKSLQELPLDNDKDSPIYV 77 (122) T ss_pred CCCHHHHHHHHHC-CCCCEEEECCCHHHH----------CCCCCCCCEECCHHHHHHHHHHHHHHHCCCCCCCCCCCEEE T ss_conf 6099999999966-999789979888998----------11617898965889988888764554301224689987899 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 828998889999999999983999 Q gi|254780898|r 134 HCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 134 HC~aG~~RTG~~~alyli~~~g~~ 157 (207) +|.+|. |+..++ .+|.. .|.+ T Consensus 78 ~C~~G~-rS~~Aa-~~L~~-~G~~ 98 (122) T cd01526 78 VCRRGN-DSQTAV-RKLKE-LGLE 98 (122) T ss_pred ECCCCC-HHHHHH-HHHHH-CCCC T ss_conf 849996-599999-99998-1998 No 47 >PRK00162 glpE thiosulfate sulfurtransferase; Validated Probab=94.83 E-value=0.12 Score=30.63 Aligned_cols=78 Identities=15% Similarity=0.170 Sum_probs=43.7 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 99899999999749938997378787654445799999669849987526888989899999999994279949998289 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) .+++++.+++++ |=-.||++|.+.+. +.- |+|- +...| ...+..++.-+ +.++|++++|.+ T Consensus 7 Is~~e~~~~~~~-~~~~liDVR~~~e~----------~~g-----HI~g-A~~ip-~~~l~~~~~~~-~~d~~ivv~C~~ 67 (104) T PRK00162 7 INVEQAHQKLQE-GEAVLVDIRDPQSF----------AMG-----HAPG-AFHLT-NDSLGAFMRDA-DFDTPVMVMCYH 67 (104) T ss_pred ECHHHHHHHHHC-CCEEEEECCCHHHH----------HCC-----CCCC-CCCCC-HHHHHHHHHHC-CCCCCEEEEECC T ss_conf 489999999978-99399989988999----------638-----7578-86389-89999999736-888988999598 Q ss_pred CCCHHHHHHHHHHHHHCCCC Q ss_conf 98889999999999983999 Q gi|254780898|r 138 GADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 138 G~~RTG~~~alyli~~~g~~ 157 (207) |. |+..++ -+|.. .|+. T Consensus 68 G~-rS~~aa-~~L~~-~Gf~ 84 (104) T PRK00162 68 GN-SSQGAA-QYLLQ-QGFD 84 (104) T ss_pred CH-HHHHHH-HHHHH-CCCC T ss_conf 81-899999-99997-3764 No 48 >PRK08057 cobalt-precorrin-6x reductase; Reviewed Probab=94.58 E-value=0.36 Score=27.76 Aligned_cols=85 Identities=14% Similarity=0.165 Sum_probs=57.4 Q ss_pred EECCCC-CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCC------HHHHHHHHHHHH Q ss_conf 713899-9899999999749938997378787654445799999669849987526888989------899999999994 Q gi|254780898|r 53 YRSAQP-NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELN------DEQIKQLISILK 125 (207) Q Consensus 53 yRS~qP-~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~------~~~i~~~~~il~ 125 (207) .|.|.. +.+.+..++++++|+.|||-.-+--..-..+-.+.|+++||.|+.+--+.+.... .+.++...+.+. T Consensus 43 i~~G~~~~~~~l~~~l~~~~i~~VIDATHPfA~~is~~a~~a~~~~~ipylR~eRp~~~~~~~d~~~~v~s~~ea~~~l~ 122 (241) T PRK08057 43 VRVGGFGGAEGLAAYLREEGIDLVVDATHPYAAQISANAAAACRALGIPYLRLERPPWQPQPGDDWIEVDDIAAAAEALA 122 (241) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCEEEECCHHHHHHHHH T ss_conf 79888899999999999679989998999708999999999999869707999678656788877899799999999876 Q ss_pred CCCCCEEEECCCCC Q ss_conf 27994999828998 Q gi|254780898|r 126 TAPKPLLIHCKSGA 139 (207) Q Consensus 126 ~~~~PVlVHC~aG~ 139 (207) ..++.||+- .|. T Consensus 123 ~~~~~vllt--tG~ 134 (241) T PRK08057 123 PFGRRVLLT--TGR 134 (241) T ss_pred HCCCCEEEE--CCC T ss_conf 438857983--161 No 49 >cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain. Probab=94.55 E-value=0.17 Score=29.71 Aligned_cols=84 Identities=13% Similarity=0.047 Sum_probs=46.4 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCE-EEEEECCCCCCCCHHHHHHHHHHHHC--CCCCEEEE Q ss_conf 9989999999974993899737878765444579999966984-99875268889898999999999942--79949998 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQ-LINFPLSATRELNDEQIKQLISILKT--APKPLLIH 134 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~-~i~ip~~~~~~~~~~~i~~~~~il~~--~~~PVlVH 134 (207) .+|+++...+..-.--.||++|.+.|.. .-.|. -+++|+ ..|..+......+.++. .+++|+|+ T Consensus 1 I~p~el~~~l~~~e~~~llDVR~~~E~~----------~ghI~Ga~~ip~---~~~~~~~~~~~~~~~~~lp~d~~ivv~ 67 (100) T cd01523 1 LDPEDLYARLLAGQPLFILDVRNESDYE----------RWKIDGENNTPY---FDPYFDFLEIEEDILDQLPDDQEVTVI 67 (100) T ss_pred CCHHHHHHHHHCCCCCEEEECCCHHHHH----------CCCCCCCCCCCC---CCHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 9889999999779996899899778884----------284558736774---202444445688899647899839999 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 28998889999999999983999 Q gi|254780898|r 135 CKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 135 C~aG~~RTG~~~alyli~~~g~~ 157 (207) |.+|. |+..++. +| ...|++ T Consensus 68 C~~G~-rS~~aa~-~L-~~~G~~ 87 (100) T cd01523 68 CAKEG-SSQFVAE-LL-AERGYD 87 (100) T ss_pred ECCCH-HHHHHHH-HH-HHCCCE T ss_conf 28965-6999999-99-987983 No 50 >pfam02571 CbiJ Precorrin-6x reductase CbiJ/CobK. This family consists of Precorrin-6x reductase EC:1.3.1.54. This enzyme catalyses the reaction: precorrin-6Y + NADP(+) <= precorrin-6X + NADPH. CbiJ and CobK both catalyse the reduction of macocycle in the colbalmin biosynthesis pathway. Probab=93.77 E-value=0.42 Score=27.30 Aligned_cols=82 Identities=15% Similarity=0.241 Sum_probs=54.4 Q ss_pred EECCCC-CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCC------HHHHHHHHHHHH Q ss_conf 713899-9899999999749938997378787654445799999669849987526888989------899999999994 Q gi|254780898|r 53 YRSAQP-NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELN------DEQIKQLISILK 125 (207) Q Consensus 53 yRS~qP-~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~------~~~i~~~~~il~ 125 (207) .++|.. +++++..++++++|+.|||-.-+--..-..+-.+.|+++||.|+.+--+.+...+ .+.+++..+.+. T Consensus 45 ~~~G~l~~~~~m~~~i~~~~i~~vIDATHPfA~~is~na~~a~~~~~ipyiR~eRp~~~~~~~d~~~~v~s~~ea~~~l~ 124 (246) T pfam02571 45 VRVGGFGGADGLAAYLREEGIDAVIDATHPFAAQISRNAAAACKELGVPLLRLERPAWQPGPGDRWIYVDSLAEAAAALA 124 (246) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCEEEECCHHHHHHHHH T ss_conf 89799899999999999779979998999968999999999999859968996062126788987799799999999876 Q ss_pred CCC-CCEEEE Q ss_conf 279-949998 Q gi|254780898|r 126 TAP-KPLLIH 134 (207) Q Consensus 126 ~~~-~PVlVH 134 (207) +.. +.||+- T Consensus 125 ~~~~~~Vllt 134 (246) T pfam02571 125 ELPGKRVFLT 134 (246) T ss_pred HCCCCEEEEE T ss_conf 4678658984 No 51 >KOG1089 consensus Probab=93.65 E-value=0.17 Score=29.74 Aligned_cols=66 Identities=21% Similarity=0.304 Sum_probs=39.4 Q ss_pred HHCCCCCEEEECCCCCCHHHHHHHHHHHHHC-------CCCHHHHHHHHH------HCCCCCC----CC------CHHHH Q ss_conf 9427994999828998889999999999983-------999899999610------0026426----74------24599 Q gi|254780898|r 124 LKTAPKPLLIHCKSGADRTGLASAVYLYIVA-------HYPKEEAHRQLS------MLYGHFP----VL------KTITM 180 (207) Q Consensus 124 l~~~~~PVlVHC~aG~~RTG~~~alyli~~~-------g~~~~eA~~el~------~~ygh~~----~~------~~~~l 180 (207) ++...-+|||||.-|.|||-.|+.+--+... |+-.-.-+.|+. -|.||.. .+ -+-+| T Consensus 340 l~~~~~sVlvhcsdGwDrT~qV~SLaQllLDP~yRTi~GFqsLIeKeWi~~GH~F~~Rc~hl~~~~~~~ke~SPvF~qFL 419 (573) T KOG1089 340 LSSEGASVLVHCSDGWDRTCQVSSLAQLLLDPYYRTIKGFQSLIEKEWISFGHKFLDRCGHLAYNDGDSKEESPVFLQFL 419 (573) T ss_pred HHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHH T ss_conf 97179759997468742367999999997283035688799999999987378178760774456678743471899999 Q ss_pred HHHHHHHHH Q ss_conf 999999998 Q gi|254780898|r 181 DITFEKITQ 189 (207) Q Consensus 181 d~~~~~~~~ 189 (207) |-+|..+++ T Consensus 420 DcvwQl~~Q 428 (573) T KOG1089 420 DCVWQLLEQ 428 (573) T ss_pred HHHHHHHHH T ss_conf 999999853 No 52 >cd01449 TST_Repeat_2 Thiosulfate sulfurtransferase (TST), C-terminal, catalytic domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the second repeat. Only the second repeat contains the catalytically active Cys residue. Probab=93.38 E-value=0.25 Score=28.77 Aligned_cols=95 Identities=18% Similarity=0.234 Sum_probs=46.9 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCC-----CCCCCHHHHHHHHHHH-HCCCCC Q ss_conf 9899999999749938997378787654445799999669-8-499875268-----8898989999999999-427994 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSA-----TRELNDEQIKQLISIL-KTAPKP 130 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~-----~~~~~~~~i~~~~~il-~~~~~P 130 (207) +.+++...+.+-++ .||+.|.+.+............+.| | .-+++|+.. ....+.+++.+...-+ -+.+++ T Consensus 2 ~~dev~~~l~~~~~-~liD~R~~~ey~G~~~~~~~~~r~GhIpgAinip~~~~~~~~~~~~~~~~l~~~~~~~gi~~~~~ 80 (118) T cd01449 2 TAEEVLANLDSGDV-QLVDARSPERFRGEVPEPRPGLRSGHIPGAVNIPWTSLLDEDGTFKSPEELRALFAALGITPDKP 80 (118) T ss_pred CHHHHHHHHCCCCC-EEEECCCHHHHCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 88999997679993-89979877773476467677771889899535474253135776589999999987608998886 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 999828998889999999999983999 Q gi|254780898|r 131 LLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 131 VlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) |.++|.+|. |+..+. ..+ ...|+. T Consensus 81 iV~yC~sG~-rA~~~~-~~l-~~lG~~ 104 (118) T cd01449 81 VIVYCGSGV-TACVLL-LAL-ELLGYK 104 (118) T ss_pred EEEECCCCH-HHHHHH-HHH-HHCCCC T ss_conf 777479988-999999-999-980899 No 53 >cd01519 RHOD_HSP67B2 Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized. Probab=93.31 E-value=0.071 Score=32.18 Aligned_cols=86 Identities=23% Similarity=0.281 Sum_probs=46.3 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCC---CCCHHHHHHHHHHHH-CCCCCEEE Q ss_conf 98999999997499389973787876544457999996698-49987526888---989899999999994-27994999 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATR---ELNDEQIKQLISILK-TAPKPLLI 133 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~---~~~~~~i~~~~~il~-~~~~PVlV 133 (207) |-+++..+++..-=..||+.|.+.|. ..-.| .-+++|+.... ..+++.+.+...... ..++||++ T Consensus 2 s~~e~k~ll~~~p~~vLIDvR~~~E~----------~~ghIpgainip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iv~ 71 (106) T cd01519 2 SFEEVKNLPNPHPNKVLIDVREPEEL----------KTGKIPGAINIPLSSLPDALALSEEEFEKKYGFPKPSKDKELIF 71 (106) T ss_pred CHHHHHHHHHCCCCEEEEECCCHHHH----------HCCCCCCCEECCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCEEE T ss_conf 88999999860989599989788999----------75978888745776655652399999999970215888996899 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 828998889999999999983999 Q gi|254780898|r 134 HCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 134 HC~aG~~RTG~~~alyli~~~g~~ 157 (207) +|.+|. |+..++ .++ ...|++ T Consensus 72 yC~~G~-rS~~a~-~~l-~~~Gy~ 92 (106) T cd01519 72 YCKAGV-RSKAAA-ELA-RSLGYE 92 (106) T ss_pred ECCCCC-HHHHHH-HHH-HHCCCC T ss_conf 839997-399999-999-986998 No 54 >cd01534 4RHOD_Repeat_3 Member of the Rhodanese Homology Domain superfamily, repeat 3. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 3rd repeat which does not contain the putative catalytic Cys residue. Probab=93.29 E-value=0.22 Score=29.12 Aligned_cols=79 Identities=9% Similarity=0.082 Sum_probs=45.3 Q ss_pred CHHHHHHHHHHCCC--CEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 98999999997499--3899737878765444579999966984998752688898989999999999427994999828 Q gi|254780898|r 59 NGTFIEYLKKEYGI--KSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 59 ~~~~~~~l~k~~GI--ktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) ++.++..++.+ |= -.||++|.+.|. +.- |+|- +...|-.+..+..-++....+.||+++|. T Consensus 2 d~a~l~~w~~~-~~r~~~llDVR~~~E~----------~~G-----HIpG-A~~iP~~~L~~~~~~~~~~k~~~ivl~C~ 64 (95) T cd01534 2 GAAELARWAAE-GDRTVYRFDVRTPEEY----------EAG-----HLPG-FRHTPGGQLVQETDHFAPVRGARIVLADD 64 (95) T ss_pred CHHHHHHHHHC-CCCCEEEEECCCHHHH----------HCC-----CCCC-CEECCHHHHHHHHHHHCCCCCCEEEEECC T ss_conf 88999999868-9973799979988999----------738-----8877-78888999877765424668983999859 Q ss_pred CCCCHHHHHHHHHHHHHCCCC Q ss_conf 998889999999999983999 Q gi|254780898|r 137 SGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 137 aG~~RTG~~~alyli~~~g~~ 157 (207) +|. |+..++.. | ...|++ T Consensus 65 ~G~-RS~~AA~~-L-~~~G~~ 82 (95) T cd01534 65 DGV-RADMTASW-L-AQMGWE 82 (95) T ss_pred CCC-CHHHHHHH-H-HHCCCC T ss_conf 998-79999999-9-986997 No 55 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=92.94 E-value=0.4 Score=27.40 Aligned_cols=121 Identities=24% Similarity=0.306 Sum_probs=70.7 Q ss_pred HHHHHCCCCEEEECCCCCC----CCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH-CCC--CCEEEECCC Q ss_conf 9999749938997378787----654445799999669849987526888989899999999994-279--949998289 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKLP----ESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK-TAP--KPLLIHCKS 137 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~~----~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~-~~~--~PVlVHC~a 137 (207) +|.|++|+ ||-+-.++- -++.=.-...+.++|=..+++| +.-+..+|+.+..++.-+. +.+ .||=|||+- T Consensus 121 EYAKEHGL--iVEfSAEDATRtd~dfLIk~~k~A~eAGADRi~~~-DTVGV~~P~km~~l~k~~k~~~kKd~~~sVHCHN 197 (371) T TIGR02090 121 EYAKEHGL--IVEFSAEDATRTDIDFLIKVFKKAEEAGADRINVA-DTVGVLTPQKMEELIKKIKENVKKDLPVSVHCHN 197 (371) T ss_pred HHHHHCCC--EEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 87752573--55317788765867899999871332167775507-8556368267999999999863588705886208 Q ss_pred CCCHHHHHHHHHHHHH-C-----------------CCCHHHHHHHHHHCCCC---CCCCCHHHHHHHHHHHHHHC Q ss_conf 9888999999999998-3-----------------99989999961000264---26742459999999999852 Q gi|254780898|r 138 GADRTGLASAVYLYIV-A-----------------HYPKEEAHRQLSMLYGH---FPVLKTITMDITFEKITQLY 191 (207) Q Consensus 138 G~~RTG~~~alyli~~-~-----------------g~~~~eA~~el~~~ygh---~~~~~~~~ld~~~~~~~~~~ 191 (207) =-| ++.|=-..-+ . +-+.||-..-|..-||. +...+-|=+=++-++|.... T Consensus 198 DFG---lAtANsi~gv~aGA~~vH~TvNGiGERAGNAaLEEVV~AL~~LYG~~~~IK~ekLYE~SrlVs~l~~~~ 269 (371) T TIGR02090 198 DFG---LATANSIAGVLAGAEQVHVTVNGIGERAGNAALEEVVMALKVLYGVDTKIKTEKLYETSRLVSKLSGVK 269 (371) T ss_pred CCC---HHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCC T ss_conf 601---889999999722835773555576101345889999999898714102445300201777899871788 No 56 >COG5016 Pyruvate/oxaloacetate carboxyltransferase [Energy production and conversion] Probab=92.57 E-value=0.53 Score=26.68 Aligned_cols=79 Identities=22% Similarity=0.272 Sum_probs=53.4 Q ss_pred HHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHHHHHC-CCC-HHHHHHHHHH Q ss_conf 999996698499875268889898999999999942-7994999828998889999999999983-999-8999996100 Q gi|254780898|r 91 EKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTGLASAVYLYIVA-HYP-KEEAHRQLSM 167 (207) Q Consensus 91 ~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG~~~alyli~~~-g~~-~~eA~~el~~ 167 (207) .....+.|+..|-|. +..+..++...-+++..+++ .+-||-+||++ -||+..++|+..++ |.+ .+.|++-||+ T Consensus 162 akel~~~g~DSIciK-DmaGlltP~~ayelVk~iK~~~~~pv~lHtH~---TsG~a~m~ylkAvEAGvD~iDTAisp~S~ 237 (472) T COG5016 162 AKELLEMGVDSICIK-DMAGLLTPYEAYELVKAIKKELPVPVELHTHA---TSGMAEMTYLKAVEAGVDGIDTAISPLSG 237 (472) T ss_pred HHHHHHCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHHCCCEEEEECCC---CCCHHHHHHHHHHHHCCCHHHHHHCCCCC T ss_conf 999997279878840-00026986889999999997459706985045---55617999999998176422210045557 Q ss_pred CCCCCC Q ss_conf 026426 Q gi|254780898|r 168 LYGHFP 173 (207) Q Consensus 168 ~ygh~~ 173 (207) +++|=+ T Consensus 238 gtsqP~ 243 (472) T COG5016 238 GTSQPA 243 (472) T ss_pred CCCCCC T ss_conf 888995 No 57 >PRK07878 molybdopterin biosynthesis-like protein MoeZ; Validated Probab=92.04 E-value=1 Score=24.80 Aligned_cols=81 Identities=15% Similarity=0.192 Sum_probs=41.2 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 9998999999997499389973787876544457999996698-499875268889898999999999942799499982 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIKQLISILKTAPKPLLIHC 135 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC 135 (207) ..++.++.+++.+..=-++|++|.+.|... ..| .-+++|+.... .. ..+.++ ..+++++|+| T Consensus 288 ~Is~~ELk~~l~~~~~~~LIDVRe~~E~~~----------~~Ipga~~IPl~~l~--~~---~~l~~l--~kd~~IVvyC 350 (392) T PRK07878 288 TITPRELREWLDSGKKLALIDVREPVEWDI----------VHIPGAQLIPKSEIN--SG---EALAKL--PQDRTIVLYC 350 (392) T ss_pred CCCHHHHHHHHHCCCCCEEEECCCHHHCCE----------EECCCCEECCHHHHC--CC---HHHHHC--CCCCCEEEEC T ss_conf 679999999984699857997898343331----------167898555777801--40---247645--8998199987 Q ss_pred CCCCCHHHHHHHHHHHHHCCCC Q ss_conf 8998889999999999983999 Q gi|254780898|r 136 KSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 136 ~aG~~RTG~~~alyli~~~g~~ 157 (207) ++|. |+-.++.. |. ..|++ T Consensus 351 ~sG~-RS~~A~~~-L~-~~G~~ 369 (392) T PRK07878 351 KTGV-RSAEALAA-VK-KAGFS 369 (392) T ss_pred CCCH-HHHHHHHH-HH-HCCCC T ss_conf 9986-89999999-99-74998 No 58 >cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain. Probab=91.89 E-value=0.98 Score=24.99 Aligned_cols=74 Identities=14% Similarity=0.229 Sum_probs=39.1 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCC Q ss_conf 99999999749938997378787654445799999669849987526888989899999999994279949998289988 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGAD 140 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~ 140 (207) .++..+. +.++ .||++|.+.|.. .- |+|- +...|- ..+...+.-+ +.++|++++|.+|. T Consensus 4 ~el~~~~-~~~~-~liDVR~~~E~~----------~~-----hI~g-ai~iP~-~~l~~~~~~l-~k~~~ivv~C~~G~- 62 (90) T cd01524 4 HELDNYR-ADGV-TLIDVRTPQEFE----------KG-----HIKG-AINIPL-DELRDRLNEL-PKDKEIIVYCAVGL- 62 (90) T ss_pred HHHHHHH-CCCE-EEEECCCHHHHH----------CC-----CCCC-CCCCCH-HHHHHHHHHH-CCCCEEEEECCCCC- T ss_conf 8999875-6998-999798789995----------59-----5688-746850-7799999986-15870999889980- Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 89999999999983999 Q gi|254780898|r 141 RTGLASAVYLYIVAHYP 157 (207) Q Consensus 141 RTG~~~alyli~~~g~~ 157 (207) |+..++ -+| ...|+. T Consensus 63 rS~~aa-~~L-~~~G~~ 77 (90) T cd01524 63 RGYIAA-RIL-TQNGFK 77 (90) T ss_pred HHHHHH-HHH-HHCCCC T ss_conf 599999-999-986998 No 59 >pfam00581 Rhodanese Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases. Probab=91.62 E-value=0.77 Score=25.63 Aligned_cols=78 Identities=19% Similarity=0.256 Sum_probs=37.3 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 98999999997499389973787876544457999996698-49987526888989899999999994279949998289 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) +++++..++++.++ -||++|.+.+. .+..| .-+++|..............-.......+.+|+++|.. T Consensus 1 s~~el~~~~~~~~~-~iiDvR~~~e~----------~~ghI~gAi~i~~~~~~~~~~~~~~~~~~~~~~~~~~ii~yc~~ 69 (106) T pfam00581 1 TAEELKALLEDEDV-VLIDVRSPEEY----------AKGHIPGAVNIPLPLPSLDKLESELEELGKKLDKDKTIVVYCES 69 (106) T ss_pred CHHHHHHHHCCCCE-EEEECCCHHHH----------HCCCCCCCCCCCCCHHHCCCCHHHHHHHHHCCCCCCEEEEECCC T ss_conf 98999999809997-99979888999----------63813796564200000000279999986347999859998899 Q ss_pred CCCHHHHHHHH Q ss_conf 98889999999 Q gi|254780898|r 138 GADRTGLASAV 148 (207) Q Consensus 138 G~~RTG~~~al 148 (207) |. |++..++. T Consensus 70 g~-~~~~~~~~ 79 (106) T pfam00581 70 GN-RSGQAAAL 79 (106) T ss_pred CC-CHHHHHHH T ss_conf 98-33999999 No 60 >PRK05597 molybdopterin biosynthesis protein MoeB; Validated Probab=90.53 E-value=0.29 Score=28.27 Aligned_cols=29 Identities=38% Similarity=0.575 Sum_probs=20.1 Q ss_pred CCCCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 27994999828998889999999999983999 Q gi|254780898|r 126 TAPKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 126 ~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) ..+++++|||++|. |+..++.. |. ..|++ T Consensus 312 ~~~k~Ivv~C~sG~-RS~~Aa~~-L~-~~Gf~ 340 (355) T PRK05597 312 SAGKEVLVYCAAGV-RSAQAIAI-LE-RAGYT 340 (355) T ss_pred CCCCCEEEECCCCH-HHHHHHHH-HH-HCCCC T ss_conf 79991999989987-99999999-99-75998 No 61 >PRK07411 hypothetical protein; Validated Probab=90.18 E-value=1.5 Score=23.82 Aligned_cols=84 Identities=18% Similarity=0.166 Sum_probs=41.2 Q ss_pred CHHHHHHHHHHCCCC-EEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 989999999974993-8997378787654445799999669849987526888989899999999994279949998289 Q gi|254780898|r 59 NGTFIEYLKKEYGIK-SILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 59 ~~~~~~~l~k~~GIk-tIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) ++.++.++++.-.=+ .+|+.|.+.|. +.+.-. .-+++|+......+ .+..+.++ ..+++++|+|++ T Consensus 285 s~~el~~ll~s~~~~~~LIDVRe~~E~-------~~~~Ip--gsi~IPL~~l~~~~--~~~~i~el--~~~k~ivV~Crs 351 (390) T PRK07411 285 TVTELKALLDSGADDFVLIDVRNPNEY-------EIARIP--GSVLVPLPDIENGP--GVEKVKEL--LNGHRLIAHCKM 351 (390) T ss_pred CHHHHHHHHHCCCCCEEEEECCCHHHH-------HHCCCC--CCEECCHHHHCCCC--HHHHHHHC--CCCCCEEEECCC T ss_conf 799999998579997089977887777-------541288--87124788841762--06679751--799938999898 Q ss_pred CCCHHHHHHHHHHHHHCCCCH Q ss_conf 988899999999999839998 Q gi|254780898|r 138 GADRTGLASAVYLYIVAHYPK 158 (207) Q Consensus 138 G~~RTG~~~alyli~~~g~~~ 158 (207) |. |+-.++. +|. ..|+.. T Consensus 352 G~-RS~~A~~-~L~-~~G~~~ 369 (390) T PRK07411 352 GG-RSAKALG-ILK-EAGIEG 369 (390) T ss_pred CH-HHHHHHH-HHH-HCCCCC T ss_conf 77-8999999-999-869983 No 62 >KOG1530 consensus Probab=90.07 E-value=0.26 Score=28.56 Aligned_cols=77 Identities=23% Similarity=0.199 Sum_probs=43.3 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC-EEEEEECCCC----CCCCHHHHHHHHHHHHCCCCCEEEE Q ss_conf 8999999997499389973787876544457999996698-4998752688----8989899999999994279949998 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI-QLINFPLSAT----RELNDEQIKQLISILKTAPKPLLIH 134 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi-~~i~ip~~~~----~~~~~~~i~~~~~il~~~~~PVlVH 134 (207) -++...++. .|=+.-|++|.++|- ++-++ .-|++|.... ...+++-..++-......++.+.++ T Consensus 27 ~~qvk~L~~-~~~~~llDVRepeEf----------k~gh~~~siNiPy~~~~~~~~l~~~eF~kqvg~~kp~~d~eiIf~ 95 (136) T KOG1530 27 VEQVKNLLQ-HPDVVLLDVREPEEF----------KQGHIPASINIPYMSRPGAGALKNPEFLKQVGSSKPPHDKEIIFG 95 (136) T ss_pred HHHHHHHHC-CCCEEEEEECCHHHH----------HCCCCCCEEECCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCEEEE T ss_conf 999987742-898799960687886----------414776258534424462125588889988434589877747999 Q ss_pred CCCCCCHHHHHHHH Q ss_conf 28998889999999 Q gi|254780898|r 135 CKSGADRTGLASAV 148 (207) Q Consensus 135 C~aG~~RTG~~~al 148 (207) |++|. |+..+.-. T Consensus 96 C~SG~-Rs~~A~~~ 108 (136) T KOG1530 96 CASGV-RSLKATKI 108 (136) T ss_pred ECCCC-CHHHHHHH T ss_conf 54686-13678999 No 63 >cd01521 RHOD_PspE2 Member of the Rhodanese Homology Domain superfamily. This CD includes the putative rhodanese-like protein, Psp2, of Yersinia pestis biovar Medievalis and other similar uncharacterized proteins. Probab=89.08 E-value=1.8 Score=23.40 Aligned_cols=81 Identities=15% Similarity=0.138 Sum_probs=43.2 Q ss_pred CCHHHHHHHHHHCCCC--EEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 9989999999974993--89973787876544457999996698499875268889898999999999942799499982 Q gi|254780898|r 58 PNGTFIEYLKKEYGIK--SILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHC 135 (207) Q Consensus 58 P~~~~~~~l~k~~GIk--tIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC 135 (207) .++.++.+.+++ |=. .||+.|.+.+. ++- |||- +...|..+........+ ..++|++|+| T Consensus 10 ~d~~dv~~~l~~-g~~~~vlIDvR~~~ey----------~~g-----HIpG-AiniP~~~l~~~~~~~l-~~d~~iVvyC 71 (110) T cd01521 10 TDCWDVAIALKN-GKPDFVLVDVRSAEAY----------ARG-----HVPG-AINLPHREICENATAKL-DKEKLFVVYC 71 (110) T ss_pred CCHHHHHHHHHC-CCCCEEEEECCCHHHH----------HCC-----CCCC-CCCCCHHHHHHHHHHHC-CCCCEEEEEE T ss_conf 889999999986-9979899979998999----------758-----9887-00088788877789767-9997189997 Q ss_pred CCCCCHHHHHHHHHHHHHCCCC Q ss_conf 8998889999999999983999 Q gi|254780898|r 136 KSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 136 ~aG~~RTG~~~alyli~~~g~~ 157 (207) .+|....+..+|..| ...|++ T Consensus 72 ~~~~~~~s~~aA~~L-~~~Gf~ 92 (110) T cd01521 72 DGPGCNGATKAALKL-AELGFP 92 (110) T ss_pred CCCCCCCHHHHHHHH-HHCCCC T ss_conf 899864179999999-977997 No 64 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=88.99 E-value=2 Score=23.10 Aligned_cols=80 Identities=23% Similarity=0.253 Sum_probs=56.7 Q ss_pred HHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CC-CCEEEECCCCCCHHHHHHHHHHHHHC-CC-CHHHHHHHHH Q ss_conf 999996698499875268889898999999999942-79-94999828998889999999999983-99-9899999610 Q gi|254780898|r 91 EKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-AP-KPLLIHCKSGADRTGLASAVYLYIVA-HY-PKEEAHRQLS 166 (207) Q Consensus 91 ~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~-~PVlVHC~aG~~RTG~~~alyli~~~-g~-~~~eA~~el~ 166 (207) .+..-+.|+.-|-|- +-.+-+++...-.+...+++ -+ -||-+||++ =||+..++++-.++ |. -.+.|++=|| T Consensus 156 a~~L~~~G~DSI~IK-DMaGlLTP~~AYELV~alK~~~~n~pvhLH~H~---TtGmA~~AllkA~EAG~d~iDTAisS~S 231 (616) T TIGR01108 156 AKELLEMGVDSICIK-DMAGLLTPKVAYELVSALKKEFGNLPVHLHSHA---TTGMAEMALLKAIEAGADMIDTAISSMS 231 (616) T ss_pred HHHHHHHCCCEEEEE-CCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCC---CHHHHHHHHHHHHHCCCCCCHHCCCCCC T ss_conf 999998188605520-200464415899999999742397468863247---2337999999888707880020055234 Q ss_pred HCCCCCCC Q ss_conf 00264267 Q gi|254780898|r 167 MLYGHFPV 174 (207) Q Consensus 167 ~~ygh~~~ 174 (207) .+|+|=|. T Consensus 232 ~gtSHPpt 239 (616) T TIGR01108 232 GGTSHPPT 239 (616) T ss_pred CCCCCCHH T ss_conf 78888747 No 65 >cd01532 4RHOD_Repeat_1 Member of the Rhodanese Homology Domain superfamily, repeat 1. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 1st repeat which does not contain the putative catalytic Cys residue. Probab=88.95 E-value=0.9 Score=25.22 Aligned_cols=70 Identities=10% Similarity=0.037 Sum_probs=34.9 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHH Q ss_conf 99389973787876544457999996698499875268889898999999999942799499982899888999999999 Q gi|254780898|r 71 GIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADRTGLASAVYL 150 (207) Q Consensus 71 GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyl 150 (207) .=-.||++|.++|. .+- |+|- +...|-...-.+....+.+.+.||.|.|.+|.......+|- . T Consensus 9 ~e~~liDVRe~~ef----------~~g-----HI~g-A~niP~~~le~~~~~~~p~kd~~Ivvyc~~g~~~~s~~Aa~-~ 71 (92) T cd01532 9 EEIALIDVREEDPF----------AQS-----HPLW-AANLPLSRLELDAWVRIPRRDTPIVVYGEGGGEDLAPRAAR-R 71 (92) T ss_pred CCEEEEECCCHHHH----------HCC-----CCCC-CCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHH-H T ss_conf 98899978987888----------528-----8856-77298889877677638799982999969999629999999-9 Q ss_pred HHHCCCC Q ss_conf 9983999 Q gi|254780898|r 151 YIVAHYP 157 (207) Q Consensus 151 i~~~g~~ 157 (207) +...|++ T Consensus 72 L~~~Gy~ 78 (92) T cd01532 72 LSELGYT 78 (92) T ss_pred HHHCCCC T ss_conf 9986996 No 66 >cd01529 4RHOD_Repeats Member of the Rhodanese Homology Domain superfamily. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. Only the second and most of the fourth repeats contain the putative catalytic Cys residue. This CD aligns the 1st , 2nd, 3rd, and 4th repeats. Probab=88.82 E-value=1.3 Score=24.31 Aligned_cols=29 Identities=10% Similarity=0.133 Sum_probs=19.4 Q ss_pred CCCCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 27994999828998889999999999983999 Q gi|254780898|r 126 TAPKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 126 ~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) +.+.||+|+|.+|. |+..++.. |. ..|++ T Consensus 54 ~~~~~ivv~C~~G~-Rs~~Aa~~-L~-~~G~~ 82 (96) T cd01529 54 GRATRYVLTCDGSL-LARFAAQE-LL-ALGGK 82 (96) T ss_pred CCCCEEEEECCCCC-HHHHHHHH-HH-HCCCC T ss_conf 99997999869984-69999999-99-85898 No 67 >PRK05600 thiamine biosynthesis protein ThiF; Validated Probab=88.28 E-value=2.2 Score=22.83 Aligned_cols=85 Identities=18% Similarity=0.199 Sum_probs=41.9 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHC-CCEEEEEECCCCCCCCHHHHHHHHHHHHCC-CCCEEEE Q ss_conf 99989999999974993899737878765444579999966-984998752688898989999999999427-9949998 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDL-GIQLINFPLSATRELNDEQIKQLISILKTA-PKPLLIH 134 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~-gi~~i~ip~~~~~~~~~~~i~~~~~il~~~-~~PVlVH 134 (207) .+.+..+..+.+. +=-.+|++|.+.|. +.+.-. +-..+++|+...... . ..++..+.. ..+|+|+ T Consensus 272 ~~~~~~~~~~~~~-~~~~LiDVRe~~E~-------~~~~~p~~~~~i~iPl~~l~~~--~---~~~~~~~~~~~~~vvv~ 338 (370) T PRK05600 272 RTDTTSLIDATLN-GSATLLDVREPHEV-------LLKDLPEGGASLKLPLSAITDD--A---DILHALSPIDGDNVVVY 338 (370) T ss_pred CCCHHHHHHHHHC-CCEEEEECCCHHHH-------HHCCCCCCCCCEECCHHHHCCC--C---HHHHHCCCCCCCEEEEE T ss_conf 6877889987506-98089989898998-------6334666677267765673485--0---23342034789859999 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 28998889999999999983999 Q gi|254780898|r 135 CKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 135 C~aG~~RTG~~~alyli~~~g~~ 157 (207) |++|. |+-.++...+- .|.+ T Consensus 339 CrsG~-RS~~Av~~L~~--~G~~ 358 (370) T PRK05600 339 CASGI-RSADFIEKYSH--LGHE 358 (370) T ss_pred CCCCH-HHHHHHHHHHH--HCCC T ss_conf 79987-99999999997--2998 No 68 >pfam04343 DUF488 Protein of unknown function, DUF488. This family includes several proteins of uncharacterized function. Probab=87.34 E-value=1.7 Score=23.41 Aligned_cols=47 Identities=17% Similarity=0.090 Sum_probs=31.7 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCC--CCCCHHH--HHHHHHCCCEEEEEEC Q ss_conf 899999999749938997378787--6544457--9999966984998752 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLP--ESWHKEE--EKAANDLGIQLINFPL 106 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~--~~~~~~e--~~~~~~~gi~~i~ip~ 106 (207) .++|-.+++..||+.+|+.|.-.- .+.+..+ ....++.||.|++++- T Consensus 2 ie~f~~~L~~~gi~~lVDVR~~P~Sr~~gf~K~~L~~~L~~~gI~Y~~~~~ 52 (123) T pfam04343 2 LKRFYDLLKADGIRVLVDVRLWPRSRNPGFSKEALALDLPEAGIAYVWLKE 52 (123) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEHHH T ss_conf 899999999879869998378877578896899999999985981776164 No 69 >cd00158 RHOD Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein. The cysteine containing enzymatically active version of the domain is also found in the Cdc25 class of protein phosphatases and a variety of proteins such as sulfide dehydrogenases and certain stress proteins such as senesence specific protein 1 in plants, PspE and GlpE in bacteria and cyanide and arsenate resistance proteins. Inactive versions (no active site cysteine) are also seen in dual specificity phosphatases, ubiquitin hydrolases from yeast and in sulfuryltransferases, where they are believed to play a regulatory role in multidomain proteins. Probab=86.60 E-value=1.3 Score=24.28 Aligned_cols=72 Identities=21% Similarity=0.182 Sum_probs=33.6 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHHH Q ss_conf 99997499389973787876544457999996698499875268889898999999999942799499982899888999 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADRTGL 144 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG~ 144 (207) .++++.+ -.||++|.+.+. +.. |+|- +...|..+............+.||+++|.+|. |+.. T Consensus 4 ~~l~~~~-~~iiDvR~~~e~----------~~g-----hi~g-a~~ip~~~~~~~~~~~~~~~~~~iv~~C~~g~-rs~~ 65 (89) T cd00158 4 ELLDDED-AVLLDVREPEEY----------AAG-----HIPG-AINIPLSELEERAALLELDKDKPIVVYCRSGN-RSAR 65 (89) T ss_pred HHHHCCC-EEEEECCCHHHH----------HCC-----CCCC-CCCCCHHHHHHHHHHHHCCCCCCEEEECCCCC-CCHH T ss_conf 9963969-799989588999----------609-----2788-65675567777665441268987787759898-1278 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999998399 Q gi|254780898|r 145 ASAVYLYIVAHY 156 (207) Q Consensus 145 ~~alyli~~~g~ 156 (207) ++ .++. ..|. T Consensus 66 aa-~~L~-~~G~ 75 (89) T cd00158 66 AA-KLLR-KAGG 75 (89) T ss_pred HH-HHHH-HCCC T ss_conf 99-9999-8089 No 70 >COG2099 CobK Precorrin-6x reductase [Coenzyme metabolism] Probab=86.09 E-value=2.9 Score=22.04 Aligned_cols=80 Identities=13% Similarity=0.170 Sum_probs=55.6 Q ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCC--CCHHHHHHHHHCCCEEEEEECCCCCC--C---CHHHHHHHHHHH Q ss_conf 7713899989999999974993899737878765--44457999996698499875268889--8---989999999999 Q gi|254780898|r 52 IYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPES--WHKEEEKAANDLGIQLINFPLSATRE--L---NDEQIKQLISIL 124 (207) Q Consensus 52 lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~--~~~~e~~~~~~~gi~~i~ip~~~~~~--~---~~~~i~~~~~il 124 (207) .-.++....+.+..++++++|+-+|+=. +|.. --.+-.+.|++.||.|+.+.-+.+.. + ....++...+++ T Consensus 47 ~~~~G~l~~e~l~~~l~e~~i~llIDAT--HPyAa~iS~Na~~aake~gipy~r~eRP~~~~~gd~~~~V~d~~ea~~~~ 124 (257) T COG2099 47 VRVGGFLGAEGLAAFLREEGIDLLIDAT--HPYAARISQNAARAAKETGIPYLRLERPPWAPNGDNWIEVADIEEAAEAA 124 (257) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECC--CHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECCHHHHHHHH T ss_conf 6652768878999999974988899788--75799998999999998599679987775545799669845899999987 Q ss_pred HCCCCCEEE Q ss_conf 427994999 Q gi|254780898|r 125 KTAPKPLLI 133 (207) Q Consensus 125 ~~~~~PVlV 133 (207) .+..+.|+. T Consensus 125 ~~~~~rVfl 133 (257) T COG2099 125 KQLGRRVFL 133 (257) T ss_pred HCCCCCEEE T ss_conf 504773799 No 71 >pfam02126 PTE Phosphotriesterase family. Probab=84.98 E-value=3.3 Score=21.70 Aligned_cols=95 Identities=18% Similarity=0.204 Sum_probs=48.3 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEE---------ECC------------------------- Q ss_conf 9999999749938997378787654445799999669849987---------526------------------------- Q gi|254780898|r 62 FIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINF---------PLS------------------------- 107 (207) Q Consensus 62 ~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~i---------p~~------------------------- 107 (207) .+..+ |+.|.+|||+.....-......-.+..++.|++.+-- |-. T Consensus 43 el~~f-k~~Gg~tiVd~T~~g~GRd~~~l~~is~~tGv~IV~~TG~y~~~~~p~~~~~~s~e~la~~~i~ei~~Gi~~T~ 121 (308) T pfam02126 43 ELLYL-KARGVRTIVENTTTGLGRDVHTLAWVARATGVHIVAATGLYVDADHPLATRLRSVEELTDVFVREIEHGIDGTG 121 (308) T ss_pred HHHHH-HHHCCCEEEECCCCCCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHCCCCCC T ss_conf 99999-98289879956897768699999999996398399747666787698766626999999999999984366887 Q ss_pred ----------CCCCCCH--HHHHHHH-HHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf ----------8889898--9999999-99942799499982899888999999999998399989 Q gi|254780898|r 108 ----------ATRELND--EQIKQLI-SILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 108 ----------~~~~~~~--~~i~~~~-~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) +....++ +.+.+.. ..-.+..-||.+||..|. |.|.-..=++ ..+|.+.+ T Consensus 122 ikaG~Ik~~~~~~~it~~E~k~lrAaA~A~~~TG~pI~~H~~~~~-~~~~e~l~il-~e~Gvd~~ 184 (308) T pfam02126 122 IKAGIIGEIGTSWKLTPSEEKVLEAAAHAHAQTGCPVSTHTGASP-RAGLQQIRIL-QEEGVDLS 184 (308) T ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEECCCCCC-CCHHHHHHHH-HHCCCCHH T ss_conf 452078883058999878999999999999976993797079875-5519999999-98598944 No 72 >pfam04179 Init_tRNA_PT Initiator tRNA phosphoribosyl transferase. This enzyme (EC:2.4.2.-) modifies exclusively the initiator tRNA in position 64 using 5'-phosphoribosyl-1'-pyrophosphate as the modification donor. As the initiator tRNA participates both in the initiation and elongation of translation, the 2'-O-ribosyl phosphate modification discriminates the initiator tRNAs from the elongator tRNAs. Probab=83.72 E-value=3.7 Score=21.36 Aligned_cols=123 Identities=15% Similarity=0.201 Sum_probs=60.3 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHH---HHHHHHHH----C-CCCCEEEECCCCCCHH Q ss_conf 99389973787876544457999996698499875268889898999---99999994----2-7994999828998889 Q gi|254780898|r 71 GIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQI---KQLISILK----T-APKPLLIHCKSGADRT 142 (207) Q Consensus 71 GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i---~~~~~il~----~-~~~PVlVHC~aG~~RT 142 (207) -+-.||++-+...... .....-.++++|+.....-....- ......+. . ..++++|+|..|+|=. T Consensus 312 ~~d~vi~~~~~~~~~~-------~~~~~~~~l~~~~~~~K~~s~~Lr~~Lp~~~~fi~~~l~~~~~~~ilV~C~tGkDlS 384 (445) T pfam04179 312 KFDCVINCSPSPTESE-------TWEKQKKRLHLGIGSSKKGSRDLRKALPSICDFIKSQLSSDPSKKILVACDSGKDLS 384 (445) T ss_pred CCCEEEEECCCCCCCC-------CCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHH T ss_conf 6467999589766566-------667776668468765763489999988999999998616788984899908983379 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHCCCC Q ss_conf 9999999999839998999996100026426742459999999999852003672233200257 Q gi|254780898|r 143 GLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKITQLYPNNVSKGDTEQPMNA 206 (207) Q Consensus 143 G~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (207) -.++-+.+-++.. .+.+... ...--...|+.+-.+ .-.|....|+.++.-.+=|.+|+ T Consensus 385 vgvaLailc~~fd--~~g~~~~---~~~~~~i~K~~Ir~r-L~~I~~~~~~~nPSR~tLqsVN~ 442 (445) T pfam04179 385 VGVALAILCLYFD--DDGKLDP---SFEERSINKDFIRQR-LAKIISSLPNVNPSRATLQSVNS 442 (445) T ss_pred HHHHHHHHHHHCC--CCCCCCC---CCCCCCCCHHHHHHH-HHHHHHHCCCCCCCHHHHHHHHH T ss_conf 9999999998608--5468568---854456699999999-99999728888898889999998 No 73 >COG0607 PspE Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism] Probab=83.47 E-value=2.1 Score=22.95 Aligned_cols=75 Identities=25% Similarity=0.309 Sum_probs=38.4 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCE-EEEEECCCCCCCCHHHHHHHHHHH-HCCCCCEEEECCCCCCHH Q ss_conf 999974993899737878765444579999966984-998752688898989999999999-427994999828998889 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQ-LINFPLSATRELNDEQIKQLISIL-KTAPKPLLIHCKSGADRT 142 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~-~i~ip~~~~~~~~~~~i~~~~~il-~~~~~PVlVHC~aG~~RT 142 (207) .++....=-.||++|.+.|. +. .+-.. .+++|+.. +....... ...++|++|+|++|. |+ T Consensus 13 ~~~~~~~~~~liDvR~~~e~-------~~--~~i~~~~~~ip~~~--------~~~~~~~~~~~~~~~ivv~C~~G~-rS 74 (110) T COG0607 13 ALLLAGEDAVLLDVREPEEY-------ER--GHIPGAAINIPLSE--------LKAAENLLELPDDDPIVVYCASGV-RS 74 (110) T ss_pred HHHHCCCCCEEEECCCHHHH-------HC--CCCCCCCCCCCHHH--------HHHHCCCCCCCCCCEEEEECCCCC-CH T ss_conf 98743699879968898998-------60--56877302477789--------887422433467986999999975-58 Q ss_pred HHHHHHHHHHHCCCCHH Q ss_conf 99999999998399989 Q gi|254780898|r 143 GLASAVYLYIVAHYPKE 159 (207) Q Consensus 143 G~~~alyli~~~g~~~~ 159 (207) +.++.. +.. .|.+.- T Consensus 75 ~~aa~~-L~~-~G~~~~ 89 (110) T COG0607 75 AAAAAA-LKL-AGFTNV 89 (110) T ss_pred HHHHHH-HHH-CCCCCC T ss_conf 999999-998-699751 No 74 >TIGR02493 PFLA pyruvate formate-lyase 1-activating enzyme; InterPro: IPR012838 Members of this family are iron-sulphur proteins with a radical-SAM domain. A single glycine residue in 2.3.1.54 from EC, formate C-acetyltransferase (formate-pyruvate lyase), is oxidized to the corresponding radical by transfer of a proton from its CH2 to AdoMet with concomitant cleavage of the latter. The reaction requires Fe2+. The first stage is reduction of the AdoMet to give methionine and the 5'-deoxyadenosin-5-yl radical, which then abstracts a hydrogen radical from the glycine residue .; GO: 0043365 [formate-C-acetyltransferase]-activating enzyme activity. Probab=82.91 E-value=3.5 Score=21.50 Aligned_cols=91 Identities=16% Similarity=0.197 Sum_probs=49.1 Q ss_pred CCCCCCCCCCE-EECCCCC--H---HHHHHHHHH-CCCCEEEECCC----CCCCCCCHHHHHHHHHCCCEEEEEECCC-- Q ss_conf 78725028947-7138999--8---999999997-49938997378----7876544457999996698499875268-- Q gi|254780898|r 42 QNFHAVVPHEI-YRSAQPN--G---TFIEYLKKE-YGIKSILNLRG----KLPESWHKEEEKAANDLGIQLINFPLSA-- 108 (207) Q Consensus 42 ~Nf~~V~~g~l-yRS~qP~--~---~~~~~l~k~-~GIktIInLr~----~~~~~~~~~e~~~~~~~gi~~i~ip~~~-- 108 (207) .+|..-.=|-| ..+|-|. + ..+..+.|+ .||-|=|+=.| -.. .+...-.+..+...+-+.||--.+ T Consensus 60 k~f~~~sGGGvT~SGGEPllQ~~F~~~~f~~cK~~~GiHTclDT~GGCftf~~-~~~~~~~~lLe~TDLvLLDiK~~~~~ 138 (243) T TIGR02493 60 KDFFKASGGGVTVSGGEPLLQPEFVLELFKACKKELGIHTCLDTSGGCFTFLG-GFTEAADELLEVTDLVLLDIKHIDPE 138 (243) T ss_pred HHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHC-CHHHHHHHHHHCCCEEEEECCCCCHH T ss_conf 98872079958986895020169999999999985698388744883433412-12489997510058788623436812 Q ss_pred -----CCCCCHHHHHHHHHHHHCCCCCEEE Q ss_conf -----8898989999999999427994999 Q gi|254780898|r 109 -----TRELNDEQIKQLISILKTAPKPLLI 133 (207) Q Consensus 109 -----~~~~~~~~i~~~~~il~~~~~PVlV 133 (207) .+.-+-+....|.+.+++.++|+.| T Consensus 139 ~y~~LTg~~~~~ptl~Fa~~L~~~~kP~Wi 168 (243) T TIGR02493 139 KYKKLTGVKSLEPTLEFAKYLAKRNKPIWI 168 (243) T ss_pred HCCCCCCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 400014567752458999999965898899 No 75 >TIGR01975 isoAsp_dipep beta-aspartyl peptidase; InterPro: IPR010229 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of proteins include metallopeptidases belonging to the MEROPS peptidase family M38 (clan MJ, beta-aspartyl dipeptidase family). This entry includes the beta-aspartyl dipeptidase from Escherichia coli, (3.4.19.5 from EC, IadA), which degrades isoaspartyl dipeptides and may unblock degradation of proteins that cannot be repaired. This entry also describes closely related proteins from other species (e.g. Clostridium perfringens, Thermoanaerobacter tengcongensis) that may have an equivalent in function. This family shows homology to dihydroorotases. The L-isoaspartyl derivative of Asp arises non-enzymatically over time as a form of protein damage. In this isomerisation, the connectivity of the polypeptide changes to pass through the beta-carboxyl of the side chain. Much but not all of this damage can be repaired by protein-L-isoaspartate (D-aspartate) O-methyltransferase.. Probab=82.73 E-value=3 Score=21.92 Aligned_cols=21 Identities=33% Similarity=0.321 Sum_probs=16.3 Q ss_pred ECCCCCHHHHHHHHHHCCCCE Q ss_conf 138999899999999749938 Q gi|254780898|r 54 RSAQPNGTFIEYLKKEYGIKS 74 (207) Q Consensus 54 RS~qP~~~~~~~l~k~~GIkt 74 (207) ||+||+-+++.++.++-.+.- T Consensus 164 RSaqPt~~~L~~~aAeARVGG 184 (391) T TIGR01975 164 RSAQPTVEELTNLAAEARVGG 184 (391) T ss_pred CCCCCCHHHHHHHHHHHCCCC T ss_conf 788744899999977511241 No 76 >TIGR00010 TIGR00010 hydrolase, TatD family; InterPro: IPR015991 Escherichia coli TatD (synonym MttC) has been shown experimentally to be a DNase . On the basis of operon structure, it was originally thought to be involved in the Sec-independent export of protein precursors bearing twin arginine signal peptides by the Tat system . This alternative membrane targeting and translocation system (Mtt), also called twin arginine translocase (Tat) (MttA1A2BC or TatABCD), has been shown to transport fully folded proteins to, or across, the membrane , . However, it was later determined that TatD is not required for this translocation process , . Furthermore, it was noted that no exact correlation exists between the ability of an organism to synthesise a TatD-like protein and whether that organism has a Tat protein transport pathway . Also, with the exception of Escherichia coli , the genetic linkage between TatD and other Tat genes (operon structure) is not maintained. Members of this group may be distantly related to a large 3D fold-based domain superfamily of metalloenzymes . The description of this fold superfamily was based on an analysis of conservation patterns in three dimensions, and the discovery that the same active-site architecture occurs in a large set of enzymes involved primarily in nucleotide metabolism. The group is thought to include urease, dihydroorotase, allantoinase, hydantoinase, AMP-, adenine- and cytosine- deaminases, imidazolonepropionase, aryldialkylphosphatase, chlorohydrolase, formylmethanofuran dehydrogenase, and other enzymes .. Probab=82.32 E-value=0.9 Score=25.21 Aligned_cols=44 Identities=14% Similarity=0.324 Sum_probs=23.5 Q ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCH Q ss_conf 98499875268889898999999999942799499982899888 Q gi|254780898|r 98 GIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADR 141 (207) Q Consensus 98 gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~R 141 (207) |+.|.+-+-.....-..+....++++.++.++||.|||+-=..+ T Consensus 101 GLDY~~~~~~~e~~~Q~e~F~~q~~lA~~~~~P~iiH~RdA~~d 144 (269) T TIGR00010 101 GLDYYKADEKKERRRQKEVFRAQLQLAEELNLPVIIHARDAEED 144 (269) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHH T ss_conf 44043068752178999999999999997199689978845799 No 77 >TIGR03217 4OH_2_O_val_ald 4-hydroxy-2-oxovalerate aldolase. Members of this protein family are 4-hydroxy-2-oxovalerate aldolase, also called 4-hydroxy-2-ketovalerate aldolase and 2-oxo-4-hydroxypentanoate aldolase. This enzyme, part of the pathway for the meta-cleavage of catechol, produces pyruvate and acetaldehyde. Acetaldehyde is then converted by acetaldehyde dehydrogenase (acylating) (DmpF; EC 1.2.1.10) to acetyl-CoA. The two enzymes are tightly associated. Probab=81.55 E-value=4.5 Score=20.84 Aligned_cols=89 Identities=13% Similarity=0.137 Sum_probs=58.8 Q ss_pred HHHHHHHHCCCCEEEECCCCCCC--CCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC---CCCEEEECC Q ss_conf 99999997499389973787876--5444579999966984998752688898989999999999427---994999828 Q gi|254780898|r 62 FIEYLKKEYGIKSILNLRGKLPE--SWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTA---PKPLLIHCK 136 (207) Q Consensus 62 ~~~~l~k~~GIktIInLr~~~~~--~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~---~~PVlVHC~ 136 (207) +...+.+++|++++.++--.... ..........+..|...+.+. ++.+...++.+..++..+.+. +-|+=+||+ T Consensus 118 ~~i~~ak~~G~~v~~~~~~s~~~~~e~l~~~a~~~~~~Gad~I~i~-DT~G~~~P~~v~~~v~~l~~~~~~~i~ig~H~H 196 (333) T TIGR03217 118 QHIGMARELGMDTVGFLMMSHMTPPEKLAEQAKLMESYGADCVYIV-DSAGAMLPDDVRDRVRALKAVLKPETQVGFHAH 196 (333) T ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEEE T ss_conf 9999999769809999750568999999999999985699999975-964468999999999999986299754889861 Q ss_pred CCCCHHHHHHHHHHHHHC Q ss_conf 998889999999999983 Q gi|254780898|r 137 SGADRTGLASAVYLYIVA 154 (207) Q Consensus 137 aG~~RTG~~~alyli~~~ 154 (207) .- .|+.+|-.+..++ T Consensus 197 Nn---lGlAvANslaAi~ 211 (333) T TIGR03217 197 HN---LSLAVANSIAAIE 211 (333) T ss_pred CC---CCHHHHHHHHHHH T ss_conf 78---7729999999998 No 78 >pfam05925 IpgD Enterobacterial virulence protein IpgD. This family consists of several enterobacterial IpgD like virulence factor proteins. In the Gram-negative pathogen Shigella flexneri, the virulence factor IpgD is translocated directly into eukaryotic cells and acts as a potent inositol 4-phosphatase that specifically dephosphorylates phosphatidylinositol 4,5-bisphosphate [PtdIns(4,5)P(2)] into phosphatidylinositol 5-monophosphate [PtdIns(5)P] that then accumulates. Transformation of PtdIns(4,5)P(2) into PtdIns(5)P by IpgD is responsible for dramatic morphological changes of the host cell, leading to a decrease in membrane tether force associated with membrane blebbing and actin filament remodelling. Probab=79.08 E-value=0.59 Score=26.36 Aligned_cols=17 Identities=53% Similarity=0.841 Sum_probs=13.6 Q ss_pred EEECCCCCCHHHHHHHH Q ss_conf 99828998889999999 Q gi|254780898|r 132 LIHCKSGADRTGLASAV 148 (207) Q Consensus 132 lVHC~aG~~RTG~~~al 148 (207) .--|++|+||||++-+- T Consensus 467 ~WNCKSGKDRTGM~DaE 483 (569) T pfam05925 467 AWNCKSGKDRTGMMDAE 483 (569) T ss_pred CCCCCCCCCCCCCHHHH T ss_conf 22467887754431799 No 79 >cd00530 PTE Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active. Probab=78.42 E-value=5.6 Score=20.21 Aligned_cols=96 Identities=13% Similarity=0.144 Sum_probs=47.9 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEE---------EC------------------------- Q ss_conf 99999999749938997378787654445799999669849987---------52------------------------- Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINF---------PL------------------------- 106 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~i---------p~------------------------- 106 (207) +.+..+ |+.|.+|||+.....-......-.+..++.|++.+-- |- T Consensus 36 ~el~~~-k~~Gg~tiVD~T~~g~GRd~~~L~~is~~tgv~IV~~TG~y~~~~~p~~~~~~s~e~la~~~v~ei~~Gi~gT 114 (293) T cd00530 36 EELKRF-RAHGGRTIVDATPPGIGRDVEKLAEVARATGVNIVAATGFYKDAFYPEWVRLRSVEELTDMLIREIEEGIEGT 114 (293) T ss_pred HHHHHH-HHHCCCEEEECCCCCCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHCCCCC T ss_conf 999999-9848987995478666769999999999639828967878877668978870689999999999998404688 Q ss_pred ----------CCCCCCCHH--HHHHH-HHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf ----------688898989--99999-999942799499982899888999999999998399989 Q gi|254780898|r 107 ----------SATRELNDE--QIKQL-ISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 107 ----------~~~~~~~~~--~i~~~-~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) .+....++. .+.+. ...-.+..-||.+||..|. +.|.-..=++ ...|.+.+ T Consensus 115 ~ikaG~Ik~~~~~~~it~~E~k~lrAaA~A~~~TG~pI~~Ht~~g~-~~~~e~~~il-~~~G~d~~ 178 (293) T cd00530 115 GIKAGIIKEAGGSPAITPLEEKVLRAAARAQKETGVPISTHTQAGL-TMGLEQLRIL-EEEGVDPS 178 (293) T ss_pred CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCCCCC-CCCHHHHHHH-HHCCCCHH T ss_conf 8342068761468999889999999999999987982898279886-5309999999-98598922 No 80 >PRK10076 pyruvate formate lyase II activase; Provisional Probab=78.20 E-value=5.7 Score=20.17 Aligned_cols=91 Identities=9% Similarity=0.096 Sum_probs=47.4 Q ss_pred CCCCCCCCCCE-EECCCCC--HH---HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCC----- Q ss_conf 78725028947-7138999--89---9999999749938997378787654445799999669849987526888----- Q gi|254780898|r 42 QNFHAVVPHEI-YRSAQPN--GT---FIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATR----- 110 (207) Q Consensus 42 ~Nf~~V~~g~l-yRS~qP~--~~---~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~----- 110 (207) ..|..-.-|-| ..+|.|. ++ .+.+..++.||.|-|.-.+..+ +..-.+......+-++|+-..+.. T Consensus 32 ~~fy~~SgGGVT~SGGEpl~Q~~F~~ellk~~k~~gihtaieTsG~~~---~~~~~~~~~~~Dl~L~DiK~~d~~~h~~~ 108 (213) T PRK10076 32 DIFFRTSGGGVTLSGGEVLMQAEFATRFLQRLRLWGVSCAIETAGDAP---ASKLLPLAKLCDEVLFDLKIMDATQARDV 108 (213) T ss_pred HHHHHCCCCEEEEECCHHHCCHHHHHHHHHHHHHCCCCEEEECCCCCC---HHHHHHHHHHCCEEEEEECCCCHHHHHHH T ss_conf 999824798078607526359999999999998669988997688888---99999999845989986177984899999 Q ss_pred -CCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf -9898999999999942799499982 Q gi|254780898|r 111 -ELNDEQIKQLISILKTAPKPLLIHC 135 (207) Q Consensus 111 -~~~~~~i~~~~~il~~~~~PVlVHC 135 (207) --+.+.+...++.+.+.+.++.|.+ T Consensus 109 TG~~n~~il~Nl~~l~~~~~~v~iR~ 134 (213) T PRK10076 109 VKMNLPRVLENLRLLVSEGVNVIPRL 134 (213) T ss_pred HCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 79993999999999996799689988 No 81 >PRK08195 4-hydroxy-2-ketovalerate aldolase; Validated Probab=77.84 E-value=5.9 Score=20.10 Aligned_cols=88 Identities=19% Similarity=0.166 Sum_probs=58.0 Q ss_pred HHHHHHHHCCCCEEEECCCCCCC--CCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC---CCCEEEECC Q ss_conf 99999997499389973787876--5444579999966984998752688898989999999999427---994999828 Q gi|254780898|r 62 FIEYLKKEYGIKSILNLRGKLPE--SWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTA---PKPLLIHCK 136 (207) Q Consensus 62 ~~~~l~k~~GIktIInLr~~~~~--~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~---~~PVlVHC~ 136 (207) +.-.+.+++|.+++.++-..... .........+++.|...+.+. ++.+...++++......+.+. +-|+=+||+ T Consensus 119 ~~i~~ar~~G~~v~~~lm~s~~~~~e~l~~~a~~~~~~Gad~I~l~-DT~G~~~P~~v~~~v~~l~~~l~~~i~igfH~H 197 (337) T PRK08195 119 QHIGLARELGMDTVGFLMMSHMASPEKLAEQAKLMESYGAQCVYVV-DSAGALLPDDVRARVRALRAALKPDTQVGFHGH 197 (337) T ss_pred HHHHHHHHCCCEEEEEEHHCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 9999999779939997511024899999999999986599999978-987667999999999999986499854999853 Q ss_pred CCCCHHHHHHHHHHHHH Q ss_conf 99888999999999998 Q gi|254780898|r 137 SGADRTGLASAVYLYIV 153 (207) Q Consensus 137 aG~~RTG~~~alyli~~ 153 (207) .-. |+.+|-.+..+ T Consensus 198 Nnl---GlAvANslaAv 211 (337) T PRK08195 198 HNL---GLGVANSLAAV 211 (337) T ss_pred CCC---CHHHHHHHHHH T ss_conf 886---75999999999 No 82 >PRK12315 1-deoxy-D-xylulose-5-phosphate synthase; Provisional Probab=76.52 E-value=6.4 Score=19.87 Aligned_cols=44 Identities=14% Similarity=0.256 Sum_probs=31.9 Q ss_pred HHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCH Q ss_conf 996698499875268889898999999999942799499982899888 Q gi|254780898|r 94 ANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADR 141 (207) Q Consensus 94 ~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~R 141 (207) ++..|..++.+. +. -+.+.+.+.++..++.++|++|||+.=+|. T Consensus 203 fe~~G~~~~~v~--DG--hd~~~i~~al~~ak~~~~P~~I~v~T~kG~ 246 (581) T PRK12315 203 FKAMGLDYRYVE--DG--NDLESLIELFKEVKDIDHPIVLHIHTLKGK 246 (581) T ss_pred HHHCCCEEEECC--CC--CCHHHHHHHHHHHHHCCCCEEEEEEECCCC T ss_conf 998498688257--99--999999999999863699979999967863 No 83 >KOG0566 consensus Probab=75.29 E-value=6.9 Score=19.67 Aligned_cols=34 Identities=29% Similarity=0.399 Sum_probs=24.4 Q ss_pred CEEECCCCCH----HHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 4771389998----999999997499389973787876 Q gi|254780898|r 51 EIYRSAQPNG----TFIEYLKKEYGIKSILNLRGKLPE 84 (207) Q Consensus 51 ~lyRS~qP~~----~~~~~l~k~~GIktIInLr~~~~~ 84 (207) ++.||.+-+. ..|..+.++||--.||||-+..+. T Consensus 286 ~itR~~easq~aFdrHf~~L~~~YG~v~IVNLLgtk~g 323 (1080) T KOG0566 286 KITRSFEASQPAFDRHFSKLREKYGPVYIVNLLGTKEG 323 (1080) T ss_pred EEECCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCH T ss_conf 88613023057899999999986196899985668615 No 84 >COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane] Probab=71.95 E-value=8.3 Score=19.16 Aligned_cols=31 Identities=32% Similarity=0.549 Sum_probs=24.7 Q ss_pred CCCCCHHHHHHHHHHHHCCCCC--EEEECCCCC Q ss_conf 8898989999999999427994--999828998 Q gi|254780898|r 109 TRELNDEQIKQLISILKTAPKP--LLIHCKSGA 139 (207) Q Consensus 109 ~~~~~~~~i~~~~~il~~~~~P--VlVHC~aG~ 139 (207) .+.-+.+.++..+.++.+.+.| +|.||.++- T Consensus 154 TGma~~~ei~~av~~~r~~g~~~i~LLhC~s~Y 186 (347) T COG2089 154 TGMATIEEIEEAVAILRENGNPDIALLHCTSAY 186 (347) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 466407779999999996699876999835789 No 85 >COG0787 Alr Alanine racemase [Cell envelope biogenesis, outer membrane] Probab=71.88 E-value=8.3 Score=19.15 Aligned_cols=109 Identities=21% Similarity=0.258 Sum_probs=61.6 Q ss_pred HHHCCCC--EEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHH Q ss_conf 9974993--89973787876544457999996698499875268889898999999999942-79949998289988899 Q gi|254780898|r 67 KKEYGIK--SILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTG 143 (207) Q Consensus 67 ~k~~GIk--tIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG 143 (207) +++.||+ .|+-|.+..+. .+.+.+.+.++.. ...+.++++.+.....+ ..-+|.++=.+|.+|-| T Consensus 69 LR~~gi~~~~IlvL~g~~~~----~~~~~~~~~~l~~--------~v~s~~ql~~l~~~~~~~~~l~vhLkiDTGM~RlG 136 (360) T COG0787 69 LREAGITGAPILVLEGFFPA----EELELAAAYNLTP--------VVNSLEQLEALKNAALKNKPLKVHLKIDTGMNRLG 136 (360) T ss_pred HHHCCCCCCCEEEECCCCCH----HHHHHHHHCCCEE--------EECCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCC T ss_conf 99718889978997375883----5678999869738--------88999999999975510696279999778987579 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCH---HHHHHHHHHHH Q ss_conf 9999999998399989999961000264267424---59999999999 Q gi|254780898|r 144 LASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKT---ITMDITFEKIT 188 (207) Q Consensus 144 ~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~---~~ld~~~~~~~ 188 (207) +-..-+...........-.. +..-+-||+.+.. ..-...++.|. T Consensus 137 ~~~~~~~~~~~~~~~~~~~~-~~gi~SHfa~ADe~~~~~~~~Q~~~F~ 183 (360) T COG0787 137 LRPEEAVALAIDLIALKNLD-LEGIFSHFACADEPEDPYTLKQLERFN 183 (360) T ss_pred CCHHHHHHHHHHHHHCCCCC-EEEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf 88489999999876324875-599971567788889827899999999 No 86 >PRK05444 1-deoxy-D-xylulose-5-phosphate synthase; Provisional Probab=69.43 E-value=9.1 Score=18.89 Aligned_cols=45 Identities=18% Similarity=0.367 Sum_probs=30.3 Q ss_pred HHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCH Q ss_conf 9996698499875268889898999999999942799499982899888 Q gi|254780898|r 93 AANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADR 141 (207) Q Consensus 93 ~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~R 141 (207) .++..|..|+. ++ |. -+.+.+.+.++..++.++|++|||+.=+++ T Consensus 193 ~Fea~Gw~~i~-~i-DG--hd~~~i~~al~~ak~~~~P~lI~~kT~~G~ 237 (576) T PRK05444 193 LFEELGFKYIG-PI-DG--HDLDALVETLKNAKDLKGPVLLHVVTKKGK 237 (576) T ss_pred HHHHCCCEEEC-CC-CC--CCHHHHHHHHHHHHHCCCCEEEEEEEECCC T ss_conf 78980994868-65-89--899999999998886699989999970566 No 87 >PRK12330 oxaloacetate decarboxylase; Provisional Probab=68.77 E-value=9.7 Score=18.73 Aligned_cols=77 Identities=19% Similarity=0.228 Sum_probs=48.1 Q ss_pred HHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC---CCCEEEECCCCCCHHHHHHHHHHHHHC-CCC-HHHHHHHHH Q ss_conf 999966984998752688898989999999999427---994999828998889999999999983-999-899999610 Q gi|254780898|r 92 KAANDLGIQLINFPLSATRELNDEQIKQLISILKTA---PKPLLIHCKSGADRTGLASAVYLYIVA-HYP-KEEAHRQLS 166 (207) Q Consensus 92 ~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~---~~PVlVHC~aG~~RTG~~~alyli~~~-g~~-~~eA~~el~ 166 (207) ....+.|...+-|. +..+..++..+..++..+++. +-||-+||+.. +|+..++|+..++ |.+ .+.|..-|+ T Consensus 162 k~l~~~G~d~i~IK-DmAGll~P~~a~~LV~~lk~~~g~d~pI~~HtH~T---~G~~~~~~l~AieAGvDivD~A~~~~s 237 (499) T PRK12330 162 KRLLDMGCDSICIK-DMAALLKPQPAYDLVKGIKEACGPDTRVHVHAHAT---TGVTLVSLMKAIEAGVDVVDTAISSMS 237 (499) T ss_pred HHHHHCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCC---CCHHHHHHHHHHHCCCCEECCCCCCCC T ss_conf 99997599989984-75346788999999999998638998379851788---746999999999849988724454323 Q ss_pred HCCCCC Q ss_conf 002642 Q gi|254780898|r 167 MLYGHF 172 (207) Q Consensus 167 ~~ygh~ 172 (207) ++++|= T Consensus 238 ~gtsqp 243 (499) T PRK12330 238 LGPGHN 243 (499) T ss_pred CCCCCC T ss_conf 798899 No 88 >pfam00682 HMGL-like HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase. Probab=66.86 E-value=11 Score=18.49 Aligned_cols=84 Identities=25% Similarity=0.288 Sum_probs=50.0 Q ss_pred HHHHHHCCCCEEEECCCCCC--CCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-C-CCCEEEECCCCC Q ss_conf 99999749938997378787--6544457999996698499875268889898999999999942-7-994999828998 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGKLP--ESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-A-PKPLLIHCKSGA 139 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~~~--~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~-~~PVlVHC~aG~ 139 (207) ..+.++.|++..+..-.... ..+...-.+.+.+.|...+.++ +..+..++.++..++..+.+ . +-|+-+||+.-. T Consensus 114 i~~a~~~g~~v~f~~~~~~~~~~~~~~~~~~~~~~~G~~~i~l~-DT~G~~~P~~v~~lv~~l~~~~~~~~i~~H~Hn~~ 192 (237) T pfam00682 114 VEAARSAGIDVELGCEDAGRTDLAFLIEVVEVAQEAGATRINIA-DTVGVLTPNEAADLISALKDRVPPVIIEVHCHNDL 192 (237) T ss_pred HHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCC T ss_conf 99999869905884051232478899999999986198579736-86455798999999999997089871588744886 Q ss_pred CHHHHHHHHHHH Q ss_conf 889999999999 Q gi|254780898|r 140 DRTGLASAVYLY 151 (207) Q Consensus 140 ~RTG~~~alyli 151 (207) |+..|-++. T Consensus 193 ---Gla~aN~l~ 201 (237) T pfam00682 193 ---GMAVANSLA 201 (237) T ss_pred ---CHHHHHHHH T ss_conf ---729999999 No 89 >cd01310 TatD_DNAse TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent DNase activity. Probab=66.30 E-value=11 Score=18.42 Aligned_cols=89 Identities=16% Similarity=0.220 Sum_probs=37.4 Q ss_pred HHHHHHH---HHHHHCCCCCEEEECCCCCCHHHHHHHHH----HHHHCCC--CHHHHHHHHHHCCCCCCCCC--HHHHHH Q ss_conf 8999999---99994279949998289988899999999----9998399--98999996100026426742--459999 Q gi|254780898|r 114 DEQIKQL---ISILKTAPKPLLIHCKSGADRTGLASAVY----LYIVAHY--PKEEAHRQLSMLYGHFPVLK--TITMDI 182 (207) Q Consensus 114 ~~~i~~~---~~il~~~~~PVlVHC~aG~~RTG~~~aly----li~~~g~--~~~eA~~el~~~ygh~~~~~--~~~ld~ 182 (207) +.|...| +++..+.+.||.|||...-+.+=-+.--| ...++.+ +.++|.+-+..+| +++.+. ++--.. T Consensus 104 ~~Q~~~F~~ql~lA~~~~lPv~iH~R~A~~~~l~iL~~~~~~~~~v~H~fsG~~~~a~~~l~~G~-yisi~g~it~~~~~ 182 (251) T cd01310 104 EVQKEVFRAQLELAKELNLPVVIHSRDAHEDVLEILKEYGPPKRGVFHCFSGSAEEAKELLDLGF-YISISGIVTFKNAN 182 (251) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHCCE-EEECCCCCCCCHHH T ss_conf 99999999999999875788489864547999999997088877388856899999998611890-89537411142489 Q ss_pred HHHHHHHHCCCCCCCHHHHHC Q ss_conf 999999852003672233200 Q gi|254780898|r 183 TFEKITQLYPNNVSKGDTEQP 203 (207) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~ 203 (207) -...+.+..|.+--.-.++.| T Consensus 183 ~~~~~~~~iPldrlllETDsP 203 (251) T cd01310 183 ELREVVKEIPLERLLLETDSP 203 (251) T ss_pred HHHHHHHHCCCCEEEEECCCC T ss_conf 999999859952289825897 No 90 >smart00450 RHOD Rhodanese Homology Domain. An alpha beta fold found duplicated in the Rhodanese protein. The the Cysteine containing enzymatically active version of the domain is also found in the CDC25 class of protein phosphatases and a variety of proteins such as sulfide dehydrogenases and stress proteins such as Senesence specific protein 1 in plants, PspE and GlpE in bacteria and cyanide and arsenate resistance proteins. Inactive versions with a loss of the cysteine are also seen in Dual specificity phosphatases, ubiquitin hydrolases from yeast and in sulfuryltransferases. These are likely to play a role in protein interactions. Probab=65.20 E-value=3.2 Score=21.73 Aligned_cols=29 Identities=28% Similarity=0.506 Sum_probs=17.0 Q ss_pred CCCCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 27994999828998889999999999983999 Q gi|254780898|r 126 TAPKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 126 ~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) ..+.+|+++|..|. |+.. ++.++ ...|.. T Consensus 54 ~~~~~vi~yc~~g~-~s~~-~~~~l-~~~G~~ 82 (100) T smart00450 54 DKDKPVVVYCRSGN-RSAK-AAWLL-RELGFK 82 (100) T ss_pred CCCCEEEEECCCCC-CHHH-HHHHH-HHCCCC T ss_conf 79984999879998-0699-99999-980898 No 91 >PRK09629 bifunctional thiosulfate sulfurtransferase/phosphatidylserine decarboxylase; Provisional Probab=62.96 E-value=13 Score=18.03 Aligned_cols=86 Identities=24% Similarity=0.300 Sum_probs=46.0 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCCCC-----CCCHHHHHHHHHHH-HCCCCCEEEECCCCCCHH Q ss_conf 9938997378787654445799999669-8-49987526888-----98989999999999-427994999828998889 Q gi|254780898|r 71 GIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSATR-----ELNDEQIKQLISIL-KTAPKPLLIHCKSGADRT 142 (207) Q Consensus 71 GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~~~-----~~~~~~i~~~~~il-~~~~~PVlVHC~aG~~RT 142 (207) .--.|++.|++.|... +...+.+.| | .-+++++...- --..+.+.+.++-+ -+.+++|.+||..|- |+ T Consensus 161 ~~~~I~DaRSp~EY~G---e~~~A~RGGHIPGAvnieW~~~ld~~~~fK~~eeL~ell~~~GItpDkeIItYCqt~~-RS 236 (610) T PRK09629 161 ADLAIWDARAPTEYSG---EKVVAAKGGHIPGAVNFEWTAGMDKARNLRIRQDMPEILRDLGITPDKEVITHCQTHH-RS 236 (610) T ss_pred CCCEEEECCCHHHCCC---CEECCCCCCCCCCCCCCHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCC-EE T ss_conf 9807986898675277---5411022788998666717763397789688799999999759998998999899762-50 Q ss_pred HHHHHHHHHHHCCCCHHHHH Q ss_conf 99999999998399989999 Q gi|254780898|r 143 GLASAVYLYIVAHYPKEEAH 162 (207) Q Consensus 143 G~~~alyli~~~g~~~~eA~ 162 (207) |.....- ...|++.=.++ T Consensus 237 s~TylVl--klLGyp~VR~Y 254 (610) T PRK09629 237 GFTYLVA--KALGYPRVKAY 254 (610) T ss_pred HHHHHHH--HHCCCCCCCCC T ss_conf 9999999--97596502133 No 92 >PRK12581 oxaloacetate decarboxylase; Provisional Probab=62.54 E-value=13 Score=17.98 Aligned_cols=78 Identities=22% Similarity=0.277 Sum_probs=49.6 Q ss_pred HHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHHHHHC-CCC-HHHHHHHHH Q ss_conf 7999996698499875268889898999999999942-7994999828998889999999999983-999-899999610 Q gi|254780898|r 90 EEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTGLASAVYLYIVA-HYP-KEEAHRQLS 166 (207) Q Consensus 90 e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG~~~alyli~~~-g~~-~~eA~~el~ 166 (207) -.....+.|...+.+. +..+..++..+..++..+++ .+-||-+||+. =+|+..|.|+..++ |.+ .+.|..-|+ T Consensus 168 ~ak~l~~~Gad~I~iK-DmaGlL~P~~a~~LV~~lK~~~~iPI~~HtH~---t~Gla~a~~laAieAGaDiVD~Ai~~~s 243 (468) T PRK12581 168 LVKELVEMGADSICIK-DMAGILTPKAAKELVSGIKAMTNLPLIVHTHA---TSGISQMTYLAAVEAGADRIDTALSPFS 243 (468) T ss_pred HHHHHHHCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHCCCCCEEEEECC---CCCHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 9999997399989984-78777688999999999983679865998258---8754999999999819999974464535 Q ss_pred HCCCC Q ss_conf 00264 Q gi|254780898|r 167 MLYGH 171 (207) Q Consensus 167 ~~ygh 171 (207) +.++| T Consensus 244 ~gtSq 248 (468) T PRK12581 244 EGTSQ 248 (468) T ss_pred CCCCC T ss_conf 79888 No 93 >PRK09875 putative hydrolase; Provisional Probab=61.53 E-value=13 Score=17.86 Aligned_cols=95 Identities=11% Similarity=0.171 Sum_probs=46.5 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEE---------E-------------------------- Q ss_conf 99999999749938997378787654445799999669849987---------5-------------------------- Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINF---------P-------------------------- 105 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~i---------p-------------------------- 105 (207) +.+..+ ++.|++|||+.....=......-.+..++.|++.+-- | T Consensus 38 ~El~~~-k~~Gg~tiVd~T~~g~GRd~~~l~~is~~tGv~IV~~TG~y~~~~~p~~~~~~s~e~la~~~i~ei~~Gi~gT 116 (292) T PRK09875 38 QEMNDL-MTRGVRNVIEMTNRYMGRNAQFMLDVMRETGINVVACTGYYQDAFFPEHVATRSVQELAQEMVDEIEQGIDGT 116 (292) T ss_pred HHHHHH-HHCCCCEEEECCCCCCCCCHHHHHHHHHHCCCCEEECCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHCCCCC T ss_conf 999999-9809997997788776859999999999729909981777877668877761799999999999998045677 Q ss_pred ---------CCCC-CCCCHHHHH---HHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf ---------2688-898989999---99999942799499982899888999999999998399989 Q gi|254780898|r 106 ---------LSAT-RELNDEQIK---QLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 106 ---------~~~~-~~~~~~~i~---~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) +.+. ...++.... +....-.+..-||.+||.. ++.|.-..=+ ...+|.+++ T Consensus 117 ~ikaG~Igeig~~~~~it~~E~k~lrAaA~A~~~TG~pI~~Ht~~--g~~g~e~l~i-l~e~Gvd~~ 180 (292) T PRK09875 117 ELKAGIIAEIGSSEGKITPLEEKVFIAAALAHNQTGRPISTHTSF--STMGLEQLAL-LQAHGVDLS 180 (292) T ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEECCCC--CCCHHHHHHH-HHHCCCCHH T ss_conf 835215555404789889899999999999999879976866888--8447999999-998498921 No 94 >cd01531 Acr2p Eukaryotic arsenate resistance proteins are members of the Rhodanese Homology Domain superfamily. Included in this CD is the Saccharomyces cerevisiae arsenate reductase protein, Acr2p, and other yeast and plant homologs. Probab=61.14 E-value=13 Score=17.82 Aligned_cols=77 Identities=16% Similarity=0.202 Sum_probs=42.2 Q ss_pred CCCHHHHHHHHHHC-CCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEE Q ss_conf 99989999999974-9938997378787654445799999669-8-499875268889898999999999942-799499 Q gi|254780898|r 57 QPNGTFIEYLKKEY-GIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSATRELNDEQIKQLISILKT-APKPLL 132 (207) Q Consensus 57 qP~~~~~~~l~k~~-GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVl 132 (207) ..+++++..++++- .=-.||+.| +.|. . -| | .-+++|... ....+..+.+.+.. ....|. T Consensus 3 yi~~~~L~~~~~~~~~~~~IIDvR-~~dy---~--------GGHI~gAiNiP~~~----~~~~~~~l~~~~~~~~~~~vV 66 (113) T cd01531 3 YISPAQLKGWIRNGRPPFQVVDVR-DEDY---A--------GGHIKGSWHYPSTR----FKAQLNQLVQLLSGSKKDTVV 66 (113) T ss_pred CCCHHHHHHHHHCCCCCEEEEECC-HHHC---C--------CCEECCCEECCCHH----HHHHHHHHHHHHHHCCCEEEE T ss_conf 249999999997599987999767-6752---8--------88757857617077----887679999888635960899 Q ss_pred EECCCCCCHHHHHHHHH Q ss_conf 98289988899999999 Q gi|254780898|r 133 IHCKSGADRTGLASAVY 149 (207) Q Consensus 133 VHC~aG~~RTG~~~aly 149 (207) +||.....|...++..+ T Consensus 67 fHC~~Sq~RGP~~A~~~ 83 (113) T cd01531 67 FHCALSQVRGPSAARKF 83 (113) T ss_pred EECCCCCCCCHHHHHHH T ss_conf 98676776019999999 No 95 >cd02003 TPP_IolD Thiamine pyrophosphate (TPP) family, IolD subfamily, TPP-binding module; composed of proteins similar to Rhizobium leguminosarum bv. viciae IolD. IolD plays an important role in myo-inositol catabolism. Probab=60.83 E-value=14 Score=17.79 Aligned_cols=26 Identities=19% Similarity=0.235 Sum_probs=17.1 Q ss_pred CCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 89899999999994279949998289 Q gi|254780898|r 112 LNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 112 ~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) -+.+.+...++-+.+.++|++|+|.- T Consensus 158 ~~~~eL~~Al~~Al~~~gP~vIev~v 183 (205) T cd02003 158 KTIEELKAALAKAKASDRTTVIVIKT 183 (205) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEEE T ss_conf 88899999999999689969999993 No 96 >TIGR01245 trpD anthranilate phosphoribosyltransferase; InterPro: IPR005940 In many widely different species, including Escherichia coli, Thermotoga maritima, and Archaeoglobus fulgidus, this enzyme is a C-terminal part of a multifunctional protein together with glutamine amidotransferase; the fusion protein is designated anthranilate synthase component II (4.1.3.27 from EC).; GO: 0004048 anthranilate phosphoribosyltransferase activity, 0000162 tryptophan biosynthetic process. Probab=59.03 E-value=15 Score=17.59 Aligned_cols=25 Identities=28% Similarity=0.656 Sum_probs=18.3 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 9989999999974993899737878 Q gi|254780898|r 58 PNGTFIEYLKKEYGIKSILNLRGKL 82 (207) Q Consensus 58 P~~~~~~~l~k~~GIktIInLr~~~ 82 (207) |.-+.+.+..+++|+|||-|+=|+- T Consensus 153 pa~k~v~pvR~~Lg~rTvFN~LGPL 177 (331) T TIGR01245 153 PAMKHVAPVRRELGVRTVFNLLGPL 177 (331) T ss_pred HHHHHHCCCHHHCCCCEEECCHHHC T ss_conf 4656420411116864020121000 No 97 >KOG1468 consensus Probab=58.97 E-value=2.3 Score=22.62 Aligned_cols=96 Identities=21% Similarity=0.290 Sum_probs=50.4 Q ss_pred EECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCC-CE Q ss_conf 71389998999999997499389973787876544457999996698499875268889898999999999942799-49 Q gi|254780898|r 53 YRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPK-PL 131 (207) Q Consensus 53 yRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~-PV 131 (207) ..|+.|++-++..-+ .++|.|+.=..+.+... .+.+-..--..+.=.+.+.+..-..-.+.+++...+.++ .| T Consensus 85 LvssRPTAVnl~~aa--~~lk~i~~~~~~~~~~~----~~~~~~~~e~ml~~dl~~N~~ig~~g~~~Llq~~~~~~kltV 158 (354) T KOG1468 85 LVSSRPTAVNLANAA--NELKPIAASEDKSEKAK----REKCISYTEDMLEKDLADNRAIGDNGAKELLQAVKDKGKLTV 158 (354) T ss_pred HHHCCCHHHHHHHHH--HHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 874693165299899--88899998632017899----999999999999988764055667789999986489883699 Q ss_pred EEECCCCCCHH---HHHHHHHHHHHC Q ss_conf 99828998889---999999999983 Q gi|254780898|r 132 LIHCKSGADRT---GLASAVYLYIVA 154 (207) Q Consensus 132 lVHC~aG~~RT---G~~~alyli~~~ 154 (207) |-||..|+=-| |+..+..|.+.. T Consensus 159 lThCNTGSLATagyGTALGVIRsLh~ 184 (354) T KOG1468 159 LTHCNTGSLATAGYGTALGVIRSLHS 184 (354) T ss_pred EEEECCCCHHHCCCCHHHHHHHHHHH T ss_conf 98506874021144368899999876 No 98 >KOG0348 consensus Probab=57.66 E-value=14 Score=17.75 Aligned_cols=44 Identities=18% Similarity=0.342 Sum_probs=31.4 Q ss_pred HHHHHHHHHHCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEC Q ss_conf 57878887607872502894771389998999999997499389973 Q gi|254780898|r 32 LYFLTITTFTQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNL 78 (207) Q Consensus 32 ~~~~~~~~~~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInL 78 (207) .-+.+.+.+..+||||+||.|..+.--..+. +++ + .||.-+|.- T Consensus 226 Q~y~~~qKLl~~~hWIVPg~lmGGEkkKSEK-ARL-R-KGiNILIgT 269 (708) T KOG0348 226 QIYETVQKLLKPFHWIVPGVLMGGEKKKSEK-ARL-R-KGINILIGT 269 (708) T ss_pred HHHHHHHHHHCCCEEEEECEEECCCCCCCHH-HHH-H-CCCEEEECC T ss_conf 9999999872574377302122363310178-887-5-485489758 No 99 >COG0161 BioA Adenosylmethionine-8-amino-7-oxononanoate aminotransferase [Coenzyme metabolism] Probab=55.46 E-value=14 Score=17.79 Aligned_cols=76 Identities=18% Similarity=0.132 Sum_probs=46.2 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEEC---CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 9898999999999942799499982---8998889999999999983999899999610002642674245999999999 Q gi|254780898|r 111 ELNDEQIKQLISILKTAPKPLLIHC---KSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKI 187 (207) Q Consensus 111 ~~~~~~i~~~~~il~~~~~PVlVHC---~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~ 187 (207) .|++..+..+.++.++-+ ||+=| ..|-||||.+-|| ...|..++-..--.-+.=|++|.+-+..=|++++.+ T Consensus 232 ~pp~~Yl~~vr~iC~ky~--ILlI~DEV~tGFGRTG~~FA~---e~~gi~PDi~~~aKGLT~GY~Pl~a~l~~~~I~~~~ 306 (449) T COG0161 232 VPPPGYLKRVREICDKYG--ILLIADEVATGFGRTGKMFAC---EHAGIVPDILCLAKGLTGGYLPLSAVLTSDRIYEAF 306 (449) T ss_pred CCCHHHHHHHHHHHHHCC--CEEEECCCEECCCCCCHHHHH---HHCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 187689999999999839--479930110078857515566---525999784655131012650107675059999998 Q ss_pred HHHC Q ss_conf 9852 Q gi|254780898|r 188 TQLY 191 (207) Q Consensus 188 ~~~~ 191 (207) ..-+ T Consensus 307 ~~~~ 310 (449) T COG0161 307 SDGD 310 (449) T ss_pred HCCC T ss_conf 6466 No 100 >COG3616 Predicted amino acid aldolase or racemase [Amino acid transport and metabolism] Probab=55.45 E-value=12 Score=18.06 Aligned_cols=35 Identities=31% Similarity=0.460 Sum_probs=28.1 Q ss_pred CCHHHHHHHHHHHHCCCC--CEEEECCCCCCHHHHHH Q ss_conf 898999999999942799--49998289988899999 Q gi|254780898|r 112 LNDEQIKQLISILKTAPK--PLLIHCKSGADRTGLAS 146 (207) Q Consensus 112 ~~~~~i~~~~~il~~~~~--PVlVHC~aG~~RTG~~~ 146 (207) -+.++++.+.++..+..+ +|+|-|-.|.||+|+.. T Consensus 119 Ds~~~~~~l~~~~~~~~~pl~v~iE~D~G~~R~Gv~t 155 (368) T COG3616 119 DSVEQLDALAALARDAGKPLRVLIEIDSGLHRSGVRT 155 (368) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCC T ss_conf 7889999999998754997469999578888667578 No 101 >COG0084 TatD Mg-dependent DNase [DNA replication, recombination, and repair] Probab=55.02 E-value=7.8 Score=19.33 Aligned_cols=53 Identities=19% Similarity=0.198 Sum_probs=27.2 Q ss_pred HHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHH----HHCC--CCHHHHHHHHHH Q ss_conf 9999999999427994999828998889999999999----9839--998999996100 Q gi|254780898|r 115 EQIKQLISILKTAPKPLLIHCKSGADRTGLASAVYLY----IVAH--YPKEEAHRQLSM 167 (207) Q Consensus 115 ~~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli----~~~g--~~~~eA~~el~~ 167 (207) +....++++..+.+.||.|||..--+.+--+.--+.. .++. =+.++|.+-+.. T Consensus 112 ~~F~~ql~lA~~~~lPviIH~R~A~~d~~~iL~~~~~~~~gi~HcFsGs~e~a~~~~d~ 170 (256) T COG0084 112 EVFEAQLELAKELNLPVIIHTRDAHEDTLEILKEEGAPVGGVLHCFSGSAEEARKLLDL 170 (256) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHC T ss_conf 99999999999739977998674279999999864878888998378979999999976 No 102 >PRK09506 mrcB penicillin-binding protein 1b; Reviewed Probab=54.82 E-value=17 Score=17.16 Aligned_cols=63 Identities=11% Similarity=0.194 Sum_probs=38.6 Q ss_pred HHHHHHHHHHHHHHHHHHH-----HHHCCCCCCCCCCEE-------ECCCCCHHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 9998532102455787888-----760787250289477-------138999899999999749938997378787 Q gi|254780898|r 20 LGVLVLCAVSLGLYFLTIT-----TFTQNFHAVVPHEIY-------RSAQPNGTFIEYLKKEYGIKSILNLRGKLP 83 (207) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~-----~~~~Nf~~V~~g~ly-------RS~qP~~~~~~~l~k~~GIktIInLr~~~~ 83 (207) +.++....++++.|.+++. .|.++-..+ |-+|| -+...+..++...++.+|.+.+-+...+.+ T Consensus 68 ~~~~~~~~~~~~~y~~yLd~~i~~~f~g~~w~l-Pa~Vyar~l~l~~g~~~~~~~~~~eL~~l~Yr~~~~~~~~G~ 142 (839) T PRK09506 68 LKLFIVFAVLIAIYGVYLDQKIRSRIDGKVWQL-PAAVYGRMVNLEPDMSYSKNEMVKLLEATQYRQVSKMTRPGE 142 (839) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECC-CHHHEECCEECCCCCCCCHHHHHHHHHHCCCEECCCCCCCEE T ss_conf 999999999999998410577886637862056-754321201106899589999999999749703589999805 No 103 >KOG4428 consensus Probab=54.49 E-value=8.1 Score=19.23 Aligned_cols=33 Identities=21% Similarity=0.368 Sum_probs=23.6 Q ss_pred HHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHH Q ss_conf 999999999427994999828998889999999 Q gi|254780898|r 116 QIKQLISILKTAPKPLLIHCKSGADRTGLASAV 148 (207) Q Consensus 116 ~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~al 148 (207) .+.....+....+|--.+-|++|+|||++.+.+ T Consensus 713 i~~~amTic~~l~G~a~tscks~KDrT~MsvtL 745 (813) T KOG4428 713 IVWEAMTICRVLNGHAFTSCKSAKDRTRMSVTL 745 (813) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEC T ss_conf 999999998540772688612345655303440 No 104 >COG1154 Dxs Deoxyxylulose-5-phosphate synthase [Coenzyme metabolism / Lipid metabolism] Probab=54.14 E-value=18 Score=17.09 Aligned_cols=46 Identities=22% Similarity=0.387 Sum_probs=32.1 Q ss_pred HHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCH Q ss_conf 99996698499875268889898999999999942799499982899888 Q gi|254780898|r 92 KAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADR 141 (207) Q Consensus 92 ~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~R 141 (207) ..++++|++|+- |+++ -+.+.+...++-+++.++|+++|+..=||+ T Consensus 237 ~lFeelGf~YiG-PiDG---Hni~~Li~~Lk~~kd~~gPvllHv~T~KGK 282 (627) T COG1154 237 TLFEELGFNYIG-PIDG---HNLEELIPTLKNAKDLKGPVLLHVVTKKGK 282 (627) T ss_pred HHHHHHCCEEEC-CCCC---CCHHHHHHHHHHHHCCCCCEEEEEEECCCC T ss_conf 258981874678-8587---789999999999853799889999965888 No 105 >PRK12331 oxaloacetate decarboxylase; Provisional Probab=53.63 E-value=18 Score=17.04 Aligned_cols=103 Identities=16% Similarity=0.179 Sum_probs=60.9 Q ss_pred HHHHHCCCCEEEECCCCCC----CCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCC Q ss_conf 9999749938997378787----6544457999996698499875268889898999999999942-7994999828998 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKLP----ESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGA 139 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~~----~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~ 139 (207) ...++.|-...+.+.-... ..++-.-.....+.|...+.+. +..+..++..+..++..+++ .+-||-+||+. T Consensus 130 ~~~k~~G~~~~~~i~yt~sp~~t~~yyv~~a~~l~~~Gad~I~ik-D~aGll~P~~~~eLV~aLk~~~~lpI~~HtH~-- 206 (463) T PRK12331 130 KATKKAGGHAQVAISYTTSPVHTIDYFVKLAKEMQEIGADSICIK-DMAGILTPYVAYELVKCIKENVTVPLEVHTHA-- 206 (463) T ss_pred HHHHHCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC-- T ss_conf 999970786999999725886769999999999996499889986-78677688999999999997449856998368-- Q ss_pred CHHHHHHHHHHHHHC-CCC-HHHHHHHHHHCCCC Q ss_conf 889999999999983-999-89999961000264 Q gi|254780898|r 140 DRTGLASAVYLYIVA-HYP-KEEAHRQLSMLYGH 171 (207) Q Consensus 140 ~RTG~~~alyli~~~-g~~-~~eA~~el~~~ygh 171 (207) =+|+..|.|+..++ |.+ .+.|..-|+..++| T Consensus 207 -t~Gla~an~laAieAGaDivD~a~~~~s~gtsq 239 (463) T PRK12331 207 -TSGIAEMTYLKAIEAGADIIDTAISPFAGGTSQ 239 (463) T ss_pred -CCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCC T ss_conf -875799999999984999996235354679789 No 106 >PRK07259 dihydroorotate dehydrogenase 1B; Reviewed Probab=52.12 E-value=19 Score=16.88 Aligned_cols=97 Identities=12% Similarity=0.180 Sum_probs=55.8 Q ss_pred CCCCCCCCCCEEECCCCCH--H----HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHC-CCEEEEEECCCCCC--- Q ss_conf 7872502894771389998--9----999999974993899737878765444579999966-98499875268889--- Q gi|254780898|r 42 QNFHAVVPHEIYRSAQPNG--T----FIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDL-GIQLINFPLSATRE--- 111 (207) Q Consensus 42 ~Nf~~V~~g~lyRS~qP~~--~----~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~-gi~~i~ip~~~~~~--- 111 (207) +.++.+..+-+.+-+.+++ + ++.+..++.++.-|+|+.+.+...|... .+..++. +..++.+.++.-+. T Consensus 56 Pr~~~~~~~~iN~~G~~n~G~~~~~~~~~~~~~~~~~pvi~si~~~~~~d~~~~-~~~l~~~~~ad~ielNiScPn~~~~ 134 (301) T PRK07259 56 PRIAETPGGMLNAVGLQNPGVDAFIEEELPWLEEFDTPIIANIAGSTPEEYVEV-AEKLSKAPNVDAIELNISCPNVKHG 134 (301) T ss_pred CCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHH-HHHHHCCCCCCEEEEECCCCCCCCH T ss_conf 858966621003466788359999999997642069987997376776899999-9986455688889996547888852 Q ss_pred -----CCHHHHHHHHHHHHC-CCCCEEEECCCCC Q ss_conf -----898999999999942-7994999828998 Q gi|254780898|r 112 -----LNDEQIKQLISILKT-APKPLLIHCKSGA 139 (207) Q Consensus 112 -----~~~~~i~~~~~il~~-~~~PVlVHC~aG~ 139 (207) .+++.+..+++.+.+ ...||+|-=.-.. T Consensus 135 g~~~~~~~~~l~~i~~~v~~~~~~Pv~vKlsP~~ 168 (301) T PRK07259 135 GMAFGTDPELAYEVVKAVKSVSKVPVIVKLTPNV 168 (301) T ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCH T ss_conf 6660879999999999998734897799807871 No 107 >PRK05692 hydroxymethylglutaryl-CoA lyase; Provisional Probab=51.49 E-value=19 Score=16.82 Aligned_cols=73 Identities=14% Similarity=0.116 Sum_probs=44.0 Q ss_pred HHHHHHCCCCEEEECCCC----C----CCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH-CCCC-CEEE Q ss_conf 999997499389973787----8----7654445799999669849987526888989899999999994-2799-4999 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGK----L----PESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK-TAPK-PLLI 133 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~----~----~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~-~~~~-PVlV 133 (207) ..+.++.|++....+--. . +........+.+.+.|...+.++ +..+..++.++..++..+. ..+. ++-+ T Consensus 126 i~~a~~~g~~v~~~i~~afg~p~~~~~~~~~l~~~~~~~~~~Ga~~I~la-DT~G~a~P~~v~~~i~~v~~~~~~~~i~~ 204 (287) T PRK05692 126 AEAAKQAGVRVRGYVSCVLGCPYEGEVPPEAVADVAERLFALGCDEISLG-DTIGVGTPGQVRRVLEAVLAEFPAERLAG 204 (287) T ss_pred HHHHHHCCCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECC-CCCCCCCHHHHHHHHHHHHHHCCCCCEEE T ss_conf 99999769879998740136764686489999999999985799785447-65566699999999999998668872356 Q ss_pred ECCC Q ss_conf 8289 Q gi|254780898|r 134 HCKS 137 (207) Q Consensus 134 HC~a 137 (207) ||+. T Consensus 205 H~Hn 208 (287) T PRK05692 205 HFHD 208 (287) T ss_pred EECC T ss_conf 7448 No 108 >TIGR03121 one_C_dehyd_A formylmethanofuran dehydrogenase subunit A. Members of this largely archaeal protein family are subunit A of the formylmethanofuran dehydrogenase. Nomenclature in some bacteria may reflect inclusion of the formyltransferase described by TIGR03119 as part of the complex, and therefore call this protein formyltransferase/hydrolase complex Fhc subunit A. Note that this model does not distinguish tungsten (FwdA) from molybdenum-containing (FmdA) forms of this enzyme; a single gene from this family is expressed constitutively in Methanobacterium thermoautotrophicum, which has both tungsten and molybdenum forms and may work interchangeably. Probab=50.32 E-value=16 Score=17.29 Aligned_cols=37 Identities=22% Similarity=0.329 Sum_probs=25.0 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHC--CCCCEEEECCC Q ss_conf 99875268889898999999999942--79949998289 Q gi|254780898|r 101 LINFPLSATRELNDEQIKQLISILKT--APKPLLIHCKS 137 (207) Q Consensus 101 ~i~ip~~~~~~~~~~~i~~~~~il~~--~~~PVlVHC~a 137 (207) -++-+++..+.-+.+.+..+.+..+. .+.|+-|||.. T Consensus 199 ~ldd~~p~~~vTpreIi~~L~~a~e~LglPH~lHvH~nn 237 (556) T TIGR03121 199 SLDDPVPYFGITPREIIKGLARANEELGLPHSIHVHCNN 237 (556) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 455788886888999999999999854899635771266 No 109 >PRK11493 sseA 3-mercaptopyruvate sulfurtransferase; Provisional Probab=49.90 E-value=21 Score=16.67 Aligned_cols=80 Identities=14% Similarity=0.210 Sum_probs=37.7 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEE------C----CCCCCCCHHHHHHHHHHH-HCCCCCEEEECCCCC Q ss_conf 99389973787876544457999996698499875------2----688898989999999999-427994999828998 Q gi|254780898|r 71 GIKSILNLRGKLPESWHKEEEKAANDLGIQLINFP------L----SATRELNDEQIKQLISIL-KTAPKPLLIHCKSGA 139 (207) Q Consensus 71 GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip------~----~~~~~~~~~~i~~~~~il-~~~~~PVlVHC~aG~ 139 (207) +-..||+-|...+......+. ..+++.-||| + .+..-.+.+.++++..-. -..++||.++|.+|. T Consensus 167 ~~~~ivDaR~~~~y~G~~~~~----~~~~r~GhIPgA~n~p~~~~~~~g~lk~~~el~~~~~~~gi~~~~~vi~yC~sG~ 242 (281) T PRK11493 167 KTAQIVDARPAARFNAEVDEP----RPGLRRGHIPGALNVPWTELVREGELKTTDELDAIFFGRGVSFDRPIIASCGSGV 242 (281) T ss_pred CCCEEEECCCHHHHCCCCCCC----CCCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCHH T ss_conf 985388168787716735787----5345787788977678999963998189999999999759899999999897199 Q ss_pred CHHHHHHHHHHHHHCCCC Q ss_conf 889999999999983999 Q gi|254780898|r 140 DRTGLASAVYLYIVAHYP 157 (207) Q Consensus 140 ~RTG~~~alyli~~~g~~ 157 (207) |++... +.+.. .|+. T Consensus 243 -~As~~~-~~l~~-lG~~ 257 (281) T PRK11493 243 -TAAVVV-LALAT-LDVP 257 (281) T ss_pred -HHHHHH-HHHHH-CCCC T ss_conf -999999-99998-6999 No 110 >cd01448 TST_Repeat_1 Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction. Probab=48.65 E-value=22 Score=16.54 Aligned_cols=93 Identities=14% Similarity=0.150 Sum_probs=47.0 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCE-EEEEECCC---------CCCCCHHHHHHHHHHHH-CC Q ss_conf 989999999974993899737878765444579999966984-99875268---------88989899999999994-27 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQ-LINFPLSA---------TRELNDEQIKQLISILK-TA 127 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~-~i~ip~~~---------~~~~~~~~i~~~~~il~-~~ 127 (207) +++.+...+.+..++ ||+.|...+.. ...+...+-.|. -+++++.. ...|+.+++.+++.-+- +. T Consensus 3 s~~wL~~~l~~~~v~-IiD~r~~~~~~---~~~~~y~~gHIPGAi~~~~~~~~~~~~~~~~~l~~~~~l~~~l~~~GI~~ 78 (122) T cd01448 3 SPDWLAEHLDDPDVR-ILDARWYLPDR---DGRKEYLEGHIPGAVFFDLDEDLDDKSPGPHMLPSPEEFAELLGSLGISN 78 (122) T ss_pred CHHHHHHHHCCCCEE-EEEECCCCCCC---CHHHHHHHCCCCCCEECCCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 889999862899969-99820898998---77899986768896464506441689998877889899999999809888 Q ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 994999828998889999999999983999 Q gi|254780898|r 128 PKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 128 ~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) +.+|.+.|..|. +.++-+ .+.....|.+ T Consensus 79 ~~~VV~Y~~~~~-~~a~r~-~w~L~~~G~~ 106 (122) T cd01448 79 DDTVVVYDDGGG-FFAARA-WWTLRYFGHE 106 (122) T ss_pred CCEEEEEECCCC-HHHHHH-HHHHHHCCCC T ss_conf 975999808987-389999-9999983998 No 111 >cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence). Probab=47.92 E-value=22 Score=16.47 Aligned_cols=54 Identities=19% Similarity=0.266 Sum_probs=31.7 Q ss_pred CHHHHHHHHHCCCEEEEEE-CCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCH Q ss_conf 4457999996698499875-268889898999999999942799499982899888 Q gi|254780898|r 87 HKEEEKAANDLGIQLINFP-LSATRELNDEQIKQLISILKTAPKPLLIHCKSGADR 141 (207) Q Consensus 87 ~~~e~~~~~~~gi~~i~ip-~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~R 141 (207) ++...-++++.|++.+.+. +.....|+...+.++.+.+++.+=++++ |...... T Consensus 178 H~af~Yf~~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~If-~e~~~~~ 232 (282) T cd01017 178 HAAFGYLARRYGLKQIAIVGVSPEVEPSPKQLAELVEFVKKSDVKYIF-FEENASS 232 (282) T ss_pred CCHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEE-EECCCCC T ss_conf 533789998669848654334888899999999999999985998999-8189990 No 112 >PRK01060 endonuclease IV; Provisional Probab=47.50 E-value=22 Score=16.43 Aligned_cols=91 Identities=18% Similarity=0.331 Sum_probs=57.7 Q ss_pred EEECCCCCHHHH---HHHHHHCCCCE---------EEECCCCCCCC------CCHHHHHHHHHCCCEEEEEECCCC-CCC Q ss_conf 771389998999---99999749938---------99737878765------444579999966984998752688-898 Q gi|254780898|r 52 IYRSAQPNGTFI---EYLKKEYGIKS---------ILNLRGKLPES------WHKEEEKAANDLGIQLINFPLSAT-REL 112 (207) Q Consensus 52 lyRS~qP~~~~~---~~l~k~~GIkt---------IInLr~~~~~~------~~~~e~~~~~~~gi~~i~ip~~~~-~~~ 112 (207) -|+...+++++. ....+++|+.. +|||.++++.- ....|.+.|+.+|+.++.+..-+. +.. T Consensus 38 ~w~~~~~~~~~~~~f~~~~~~~~~~~~~iv~Ha~YlINLasp~~~~~~kS~~~l~~el~r~~~lG~~~vV~HpGs~~g~~ 117 (281) T PRK01060 38 QWKRKPLEELNIEAFKAACEKYGISPEDILVHAPYLINLGNPNPEKLEKSRDALIDEIERCEALGAKLLNFHPGSHLGAI 117 (281) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCC T ss_conf 89999999899999999999819997545752453232689987899999999999999999809985984576336888 Q ss_pred CHH-HHHHHHH----HHHCCC-CCEEEECCCCCCHH Q ss_conf 989-9999999----994279-94999828998889 Q gi|254780898|r 113 NDE-QIKQLIS----ILKTAP-KPLLIHCKSGADRT 142 (207) Q Consensus 113 ~~~-~i~~~~~----il~~~~-~PVlVHC~aG~~RT 142 (207) .++ .+..+.+ +++..+ -++++--.||.|-+ T Consensus 118 ~~~~gi~~i~~~l~~~l~~~~~~~ilLEn~AGqG~~ 153 (281) T PRK01060 118 DEADCLDRIAESLNEALDKTQGVTIVLENTAGQGSE 153 (281) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC T ss_conf 799999999999999981589867999724899982 No 113 >cd01525 RHOD_Kc Member of the Rhodanese Homology Domain superfamily. Included in this CD are the rhodanese-like domains found C-terminal of the serine/threonine protein kinases catalytic (S_TKc) domain and the Tre-2, BUB2p, Cdc16p (TBC) domain. The putative active site Cys residue is not present in this CD. Probab=47.15 E-value=23 Score=16.40 Aligned_cols=87 Identities=18% Similarity=0.088 Sum_probs=48.1 Q ss_pred CHHHHHHHHHHCCCCE-EEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCC--CHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 9899999999749938-99737878765444579999966984998752688898--98999999999942799499982 Q gi|254780898|r 59 NGTFIEYLKKEYGIKS-ILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATREL--NDEQIKQLISILKTAPKPLLIHC 135 (207) Q Consensus 59 ~~~~~~~l~k~~GIkt-IInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~--~~~~i~~~~~il~~~~~PVlVHC 135 (207) +..++..+.+....|. ||++|...+.. +-+=..-+++|+...... ..++....-.+-+..++||.|=| T Consensus 2 s~~d~~~L~~~~~~~~~vvD~R~~~ef~---------~ghi~~siniPf~~~~~~~~~L~~~~~~~~Lek~Kgk~VViV~ 72 (105) T cd01525 2 SVYDVIRLLDNSPAKLAAVDIRSSPDFR---------RGHIEGSINIPFSSVFLKEGELEQLPTVPRLENYKGKIIVIVS 72 (105) T ss_pred CHHHHHHHHHCCCCCEEEEECCCHHHHC---------CCEECCCCCCCCHHHCCCCCCCCCCCCHHHHHHCCCCEEEEEC T ss_conf 6899999985697757999777878854---------6611275505730201554422136425899975898299988 Q ss_pred CCCCCHHHHHHHHHHHHHCCCC Q ss_conf 8998889999999999983999 Q gi|254780898|r 136 KSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 136 ~aG~~RTG~~~alyli~~~g~~ 157 (207) .+|. .+.. .|-.|+. .|++ T Consensus 73 ~~g~-~a~~-fa~~Lvk-~Gf~ 91 (105) T cd01525 73 HSHK-HAAL-FAAFLVK-CGVP 91 (105) T ss_pred CCCC-CHHH-HHHHHHH-CCCC T ss_conf 9986-7999-9999998-4998 No 114 >TIGR02418 acolac_catab acetolactate synthase, catabolic; InterPro: IPR012782 Acetolactate synthase is a thiamin pyrophosphate-dependent enzyme that combines two molecules of pyruvate to yield 2-acetolactate with the release of CO2. It exists in two distinct forms that have different properties, though they are clearly related , . The biosynthetic form, found in plants, fungi and bacteria, is involved in the biosynthesis of branched-chain amino acids and contains non-catalytic FAD. The catabolic form is found only in some bacteria, participates in butanediol fermentation, and does not contain FAD. This entry represents the catabolic form of acetolactate synthase, generally found in species that posses acetolactate decarboxylase, which is often encoded in the same operon. The enzyme is thought to be a homotetramer, though this has not conclusively been shown. Each monomer consists of three distinct domains, designated, starting from the N-terminus, as alpha, beta and gamma . The core of the alpha and gamma domains is a six-stranded parallel beta sheet surrounded by alpha helices, while the the beta domain forms a double Rossman fold. The thiamin pyrophosphate cofactor has an unusual conformation that has not been seen in other enzymes using this cofactor, including the biosynthetic form of acetolactate synthase. The active site is located in a funnel at the dimer interface, bordered by residues from all three domains.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding. Probab=47.12 E-value=23 Score=16.40 Aligned_cols=22 Identities=5% Similarity=0.231 Sum_probs=10.6 Q ss_pred CCHHHHHHHHHHH--HCCCCCEEE Q ss_conf 8989999999999--427994999 Q gi|254780898|r 112 LNDEQIKQLISIL--KTAPKPLLI 133 (207) Q Consensus 112 ~~~~~i~~~~~il--~~~~~PVlV 133 (207) -++++++..++-. .+.+|||+| T Consensus 516 ~~~d~L~~tLr~Ali~~~eGPvvv 539 (553) T TIGR02418 516 ESPDELEPTLRQALIMEVEGPVVV 539 (553) T ss_pred CCHHHHHHHHHHHCEECCCCCEEE T ss_conf 897899999874120227888899 No 115 >PRK05802 hypothetical protein; Provisional Probab=45.95 E-value=24 Score=16.28 Aligned_cols=77 Identities=29% Similarity=0.339 Sum_probs=48.1 Q ss_pred CCHHHHHHHHHHCCCC--EEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 9989999999974993--89973787876544457999996698499875268889898999999999942799499982 Q gi|254780898|r 58 PNGTFIEYLKKEYGIK--SILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHC 135 (207) Q Consensus 58 P~~~~~~~l~k~~GIk--tIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC 135 (207) |..--+.+|.+ .|=+ .|++=..- +. ..-.+..++.+++.++.++-+.+..+++.-..+.+++++ +.+-+||| T Consensus 187 P~vpVikkL~~-n~NkV~vIid~~~~-~~---~fI~eyl~~~~~~~i~~~~l~~g~lsee~K~~i~~~ik~-~~~~lI~~ 260 (328) T PRK05802 187 PAVPVIKKLYS-NGNKVIVILDKGPF-EN---NFIKEYLEEYNIEVIECNLLDKGELSEEGKDILKEIIKK-EDINLIHC 260 (328) T ss_pred CCHHHHHHHHH-CCCEEEEEECCCCC-CC---HHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHH-CCCCEEEE T ss_conf 55599999986-89979999858855-31---659999985783699831422344578899999999973-79668996 Q ss_pred CCCCCH Q ss_conf 899888 Q gi|254780898|r 136 KSGADR 141 (207) Q Consensus 136 ~aG~~R 141 (207) +|.|- T Consensus 261 -~G~D~ 265 (328) T PRK05802 261 -GGSDI 265 (328) T ss_pred -CCHHH T ss_conf -67189 No 116 >smart00822 PKS_KR This enzymatic domain is part of bacterial polyketide synthases and catalyses the first step in the reductive modification of the beta-carbonyl centres in the growing polyketide chain. It uses NADPH to reduce the keto group to a hydroxy group. Probab=45.32 E-value=24 Score=16.22 Aligned_cols=79 Identities=15% Similarity=0.151 Sum_probs=37.7 Q ss_pred HHHHHHHHCCCCEEE-ECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCC--EEEECCCC Q ss_conf 999999974993899-737878765444579999966984998752688898989999999999427994--99982899 Q gi|254780898|r 62 FIEYLKKEYGIKSIL-NLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKP--LLIHCKSG 138 (207) Q Consensus 62 ~~~~l~k~~GIktII-nLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~P--VlVHC~aG 138 (207) .+...+.+.|-+.|+ .=|.+.+........+..+..|.+...++. +.-++++++.+.+...+.-+| ++|||. | T Consensus 15 ~~a~~la~~Ga~~vvl~~R~~~~~~~~~~~~~~~~~~g~~v~~~~~---Dv~~~~~~~~~v~~~~~~~g~id~lvn~A-G 90 (180) T smart00822 15 ELARWLAERGARHLVLLSRSGPDAPGAAELLAELEALGAEVTVVAC---DVADRAALAAALAAIPARLGPLRGVIHAA-G 90 (180) T ss_pred HHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEE---CCCCHHHHHHHHHHHHHHCCCEEEEEEEC-C T ss_conf 9999999879988999868987818899999999956996999980---26886776667776799739837999424-6 Q ss_pred CCHHHH Q ss_conf 888999 Q gi|254780898|r 139 ADRTGL 144 (207) Q Consensus 139 ~~RTG~ 144 (207) ..+.+. T Consensus 91 ~~~~~~ 96 (180) T smart00822 91 VLDDGL 96 (180) T ss_pred CCCCCC T ss_conf 669977 No 117 >PRK10425 DNase TatD; Provisional Probab=44.98 E-value=24 Score=16.19 Aligned_cols=54 Identities=19% Similarity=0.335 Sum_probs=28.8 Q ss_pred HHHHHH---HHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHH-----CCC--CHHHHHHHHHH Q ss_conf 899999---9999942799499982899888999999999998-----399--98999996100 Q gi|254780898|r 114 DEQIKQ---LISILKTAPKPLLIHCKSGADRTGLASAVYLYIV-----AHY--PKEEAHRQLSM 167 (207) Q Consensus 114 ~~~i~~---~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~-----~g~--~~~eA~~el~~ 167 (207) ..|... .+++..+.+.||.|||...-+.+=-+.--|.-.. +.+ +.++|.+-+.. T Consensus 104 ~~Q~~~F~~ql~lA~~~~lPviiH~R~A~~~~l~iL~~~~~~~~~~v~HcFsGs~e~a~~~l~~ 167 (258) T PRK10425 104 EEQERAFVAQLRIAAELNMPVFMHCRDAHERFLTLLEPWLDKLPGAVLHCFTGTREEMQACVAR 167 (258) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHC T ss_conf 9999999999999999799879996441589999999971668673343068999999999974 No 118 >cd01520 RHOD_YbbB Member of the Rhodanese Homology Domain superfamily. This CD includes several putative ATP /GTP binding proteins including E. coli YbbB. Probab=44.82 E-value=25 Score=16.17 Aligned_cols=88 Identities=18% Similarity=0.200 Sum_probs=44.7 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCC-----------CCHH-----------H Q ss_conf 98999999997499389973787876544457999996698499875268889-----------8989-----------9 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRE-----------LNDE-----------Q 116 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~-----------~~~~-----------~ 116 (207) +.+++..+.+ .+ .-+|++|++.|.. +.+=-.-+++|+-+... ...+ . T Consensus 2 ~~~~~~~l~~-~~-~plIDVRSP~Ef~---------~ghiPgAiNiPll~d~ER~~VGt~YK~~g~~~A~~lG~~lv~~~ 70 (128) T cd01520 2 TAEDLLALRK-AD-GPLIDVRSPKEFF---------EGHLPGAINLPLLDDEERALVGTLYKQQGREAAIELGLELVSGK 70 (128) T ss_pred CHHHHHHHHH-CC-CCEEECCCHHHHH---------CCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 7899999996-69-9768758878873---------59888876360218188999858888549899999999999889 Q ss_pred HHHHH-HHH---HCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 99999-999---42799499982899888999999999998399989 Q gi|254780898|r 117 IKQLI-SIL---KTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 117 i~~~~-~il---~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) +..++ +.. .+.+.+++|+|.-|--|++++. ..+...|+... T Consensus 71 l~~~~~~~~~~~~~~~~~~~vyC~RGGmRS~s~~--~~L~~~G~~v~ 115 (128) T cd01520 71 LKRILNEAWEARLERDPKLLIYCARGGMRSQSLA--WLLESLGIDVP 115 (128) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHH--HHHHHCCCCCC T ss_conf 9999999999861469988999825770679999--99998599731 No 119 >cd02004 TPP_BZL_OCoD_HPCL Thiamine pyrophosphate (TPP) family, BZL_OCoD_HPCL subfamily, TPP-binding module; composed of proteins similar to benzaldehyde lyase (BZL), oxalyl-CoA decarboxylase (OCoD) and 2-hydroxyphytanoyl-CoA lyase (2-HPCL). Pseudomonas fluorescens biovar I BZL cleaves the acyloin linkage of benzoin producing 2 molecules of benzaldehyde and enabling the Pseudomonas to grow on benzoin as the sole carbon and energy source. OCoD has a role in the detoxification of oxalate, catalyzing the decarboxylation of oxalyl-CoA to formate. 2-HPCL is a peroxisomal enzyme which plays a role in the alpha-oxidation of 3-methyl-branched fatty acids, catalyzing the cleavage of 2-hydroxy-3-methylacyl-CoA into formyl-CoA and a 2-methyl-branched fatty aldehyde. All these enzymes depend on Mg2+ and TPP for activity. Probab=44.56 E-value=25 Score=16.15 Aligned_cols=36 Identities=11% Similarity=0.068 Sum_probs=18.7 Q ss_pred CCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECC Q ss_conf 938997378787654445799999669849987526 Q gi|254780898|r 72 IKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLS 107 (207) Q Consensus 72 IktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~ 107 (207) =+.||++-++..-...-.|...+.+.++..+-+-+. T Consensus 66 ~~~Vv~i~GDG~f~m~~~EL~Ta~~~~lpi~~iV~n 101 (172) T cd02004 66 DKRVVLVEGDGAFGFSGMELETAVRYNLPIVVVVGN 101 (172) T ss_pred CCEEEEEECCCCHHCCCHHHHHHHHHCCCEEEEEEE T ss_conf 984999977973402407888999849885999982 No 120 >PRK11858 aksA trans-homoaconitate synthase; Reviewed Probab=43.50 E-value=26 Score=16.05 Aligned_cols=85 Identities=20% Similarity=0.276 Sum_probs=53.3 Q ss_pred HHHHHCCCCEEEECCCCC--CCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCH Q ss_conf 999974993899737878--76544457999996698499875268889898999999999942-799499982899888 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKL--PESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADR 141 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~--~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~R 141 (207) .+.+++|++.-++.-... +..+.....+.+.+.|...+.++ ++.+..++.++..++..+.+ .+-|+=+||+. - T Consensus 123 ~~Ak~~Gl~v~f~~eD~~r~~~~~l~~~~~~a~~~Gad~I~l~-DT~G~~~P~~v~~~v~~l~~~~~~~i~~H~HN---d 198 (378) T PRK11858 123 EYAKDHGLYVSFSAEDASRTDLDFLIEAAKAAEEAGADRVRFC-DTVGILDPFTMYELVKELVEAVDIPIEVHCHN---D 198 (378) T ss_pred HHHHHCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEC---C T ss_conf 9999769869994401256899999999999997499899963-65566699999999999997269855999707---7 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999998 Q gi|254780898|r 142 TGLASAVYLYIV 153 (207) Q Consensus 142 TG~~~alyli~~ 153 (207) .|+.+|--+..+ T Consensus 199 ~GlAvANalaAv 210 (378) T PRK11858 199 FGMATANAIAGI 210 (378) T ss_pred CCHHHHHHHHHH T ss_conf 555999999999 No 121 >COG0182 Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis] Probab=42.43 E-value=21 Score=16.55 Aligned_cols=35 Identities=29% Similarity=0.339 Sum_probs=20.8 Q ss_pred HHHHHCCCCCEEEECCCCCCHH---HHHHHHHHHH-HCCC Q ss_conf 9999427994999828998889---9999999999-8399 Q gi|254780898|r 121 ISILKTAPKPLLIHCKSGADRT---GLASAVYLYI-VAHY 156 (207) Q Consensus 121 ~~il~~~~~PVlVHC~aG~~RT---G~~~alyli~-~~g~ 156 (207) .+++.+.. .||-||.+|+==| |++.+..|.. .+|. T Consensus 144 ~~ll~~~~-~VLThCNaGaLAt~~~GTAlgviR~a~~~gk 182 (346) T COG0182 144 AELLPDGD-TVLTHCNAGALATVGYGTALGVIRSAHEEGK 182 (346) T ss_pred HHHHCCCC-EEEEEECCCCEEECCCCCHHHHHHHHHHCCC T ss_conf 98703688-2873416874432576504799999997798 No 122 >TIGR02249 integrase_gron integron integrase; InterPro: IPR011946 Members of this family are integrases associated with integrons (and super-integrons), which are systems for incorporating and expressing cassettes of laterally transferred DNA. Incorporation occurs at an attI site. A super-integron, as in Vibrio sp., may include over 100 cassettes. This family belongs to the phage integrase family that also includes recombinases XerC (IPR011931 from INTERPRO) and XerD (IPR011932 from INTERPRO), which are bacterial housekeeping proteins. Within this family of integron integrases, some are designated by class, e.g. IntI4, a class 4 integron integrase from Vibrio cholerae N16961.; GO: 0003677 DNA binding, 0006310 DNA recombination, 0015074 DNA integration. Probab=42.41 E-value=15 Score=17.47 Aligned_cols=71 Identities=17% Similarity=0.139 Sum_probs=43.1 Q ss_pred CCCCEEEEC-CCCCCHHHHHHHH-HHHHHCCCCHHHH--HHH-HHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 799499982-8998889999999-9999839998999--996-1000264267424599999999998520036722332 Q gi|254780898|r 127 APKPLLIHC-KSGADRTGLASAV-YLYIVAHYPKEEA--HRQ-LSMLYGHFPVLKTITMDITFEKITQLYPNNVSKGDTE 201 (207) Q Consensus 127 ~~~PVlVHC-~aG~~RTG~~~al-yli~~~g~~~~eA--~~e-l~~~ygh~~~~~~~~ld~~~~~~~~~~~~~~~~~~~~ 201 (207) ..+-|.|+| |.||||+=++.-= +--+.+-.....+ .++ +.-+||-++. ...+++.|||+.. .-.| T Consensus 150 ~~~~I~vr~GKG~KdR~v~Lp~~L~~~L~~q~~~~~~~h~~Dl~~~G~g~V~L---------P~AL~rKYPnA~~-~~~W 219 (320) T TIGR02249 150 DYGEIRVRDGKGGKDRVVTLPKSLAPPLREQLERARALHEKDLLAEGYGGVYL---------PHALARKYPNASK-EWGW 219 (320) T ss_pred CCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEH---------HHHHHCCCCCCCC-CCCC T ss_conf 37668997366898861268175579999999999999998640588530100---------4554215787356-6787 Q ss_pred HCCCCC Q ss_conf 002579 Q gi|254780898|r 202 QPMNAT 207 (207) Q Consensus 202 ~~~~~~ 207 (207) |=++|| T Consensus 220 qylFPs 225 (320) T TIGR02249 220 QYLFPS 225 (320) T ss_pred EEECCC T ss_conf 665377 No 123 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=42.15 E-value=27 Score=15.91 Aligned_cols=25 Identities=8% Similarity=0.198 Sum_probs=17.7 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCC Q ss_conf 9998999999997499389973787 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGK 81 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~ 81 (207) +|.-..+.+..+++|++||.|+-++ T Consensus 354 Hpamk~v~pvRk~Lg~rTiFNiLGP 378 (531) T PRK09522 354 HTGFRHAMPVRQQLKTRTLFNVLGP 378 (531) T ss_pred CHHHHHHHHHHHHCCCCCCEECCCC T ss_conf 6669999999997198851213455 No 124 >cd04740 DHOD_1B_like Dihydroorotate dehydrogenase (DHOD) class 1B FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. Probab=42.13 E-value=27 Score=15.91 Aligned_cols=75 Identities=17% Similarity=0.352 Sum_probs=48.0 Q ss_pred HHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCC--------CHHHHHHHHHHHHC-CCCCEEEECC Q ss_conf 99974993899737878765444579999966984998752688898--------98999999999942-7994999828 Q gi|254780898|r 66 LKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATREL--------NDEQIKQLISILKT-APKPLLIHCK 136 (207) Q Consensus 66 l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~--------~~~~i~~~~~il~~-~~~PVlVHC~ 136 (207) ..+..++.-|+|+.+..++.| ..-.+..++.|..++.+.+++-+.+ +.+.+.+++..+++ ...||+|-=. T Consensus 84 ~~~~~~~pvi~si~~~~~~d~-~~~~~~~~~~gad~ielNiScPNt~~~g~~~~~~~~~~~~i~~~vk~~~~~Pi~vKls 162 (296) T cd04740 84 WLREFGTPVIASIAGSTVEEF-VEVAEKLADAGADAIELNISCPNVKGGGMAFGTDPEAVAEIVKAVKKATDVPVIVKLT 162 (296) T ss_pred HHCCCCCEEEEEECCCCHHHH-HHHHHHHHHCCCCEEEEECCCCCCCCCHHHHCCCHHHHHHHHHHHHHCCCCCEEEECC T ss_conf 635689718998168987899-9999998864898899978899867636775749999999999998604896699718 Q ss_pred CCCCH Q ss_conf 99888 Q gi|254780898|r 137 SGADR 141 (207) Q Consensus 137 aG~~R 141 (207) -..+. T Consensus 163 P~~~~ 167 (296) T cd04740 163 PNVTD 167 (296) T ss_pred CCCCH T ss_conf 98000 No 125 >COG3830 ACT domain-containing protein [Signal transduction mechanisms] Probab=41.46 E-value=13 Score=18.02 Aligned_cols=19 Identities=37% Similarity=0.595 Sum_probs=13.7 Q ss_pred CCCCHHHHHHHHHHHHHCC Q ss_conf 9988899999999999839 Q gi|254780898|r 137 SGADRTGLASAVYLYIVAH 155 (207) Q Consensus 137 aG~~RTG~~~alyli~~~g 155 (207) .|+||+|++++...++-++ T Consensus 9 ~GkDr~GIva~is~vLAe~ 27 (90) T COG3830 9 IGKDRVGIVAAVSRVLAEH 27 (90) T ss_pred ECCCCCCHHHHHHHHHHHC T ss_conf 7588773349999999976 No 126 >cd01304 FMDH_A Formylmethanofuran dehydrogenase (FMDH) subunit A; Methanogenic bacteria and archea derive the energy for autotrophic growth from methanogenesis, the reduction of CO2 with molecular hydrogen as the electron donor. FMDH catalyzes the first step in methanogenesis, the formyl-methanofuran synthesis. In this step, CO2 is bound to methanofuran and subsequently reduced to the formyl state with electrons derived from hydrogen. Probab=41.19 E-value=19 Score=16.92 Aligned_cols=42 Identities=19% Similarity=0.318 Sum_probs=26.2 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHC--CCCCEEEECCCCCCHHH Q ss_conf 99875268889898999999999942--79949998289988899 Q gi|254780898|r 101 LINFPLSATRELNDEQIKQLISILKT--APKPLLIHCKSGADRTG 143 (207) Q Consensus 101 ~i~ip~~~~~~~~~~~i~~~~~il~~--~~~PVlVHC~aG~~RTG 143 (207) .++=+++.++.-+.+.+..+.++.+. .+.|+-|||.. .|-.| T Consensus 195 ~lDd~~p~~~vtpr~Ii~~L~~a~e~LglPH~lHvH~nN-LG~PG 238 (541) T cd01304 195 SLDDPVPYFDITPREILKGLAEANEELGLPHSIHVHCNN-LGVPG 238 (541) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHH-CCCCC T ss_conf 566888887868999999999999865999603201231-79998 No 127 >cd02010 TPP_ALS Thiamine pyrophosphate (TPP) family, Acetolactate synthase (ALS) subfamily, TPP-binding module; composed of proteins similar to Klebsiella pneumoniae ALS, a catabolic enzyme required for butanediol fermentation. ALS catalyzes the conversion of 2 molecules of pyruvate to acetolactate and carbon dioxide. ALS does not contain FAD, and requires TPP and a divalent metal cation for activity. Probab=40.39 E-value=29 Score=15.74 Aligned_cols=34 Identities=18% Similarity=0.194 Sum_probs=15.9 Q ss_pred CEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEEC Q ss_conf 3899737878765444579999966984998752 Q gi|254780898|r 73 KSILNLRGKLPESWHKEEEKAANDLGIQLINFPL 106 (207) Q Consensus 73 ktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~ 106 (207) |.||++-++..-...-.|.+.+.+.++..+-+-+ T Consensus 67 ~~Vv~i~GDG~f~m~~~EL~Ta~~~~lpi~iiV~ 100 (177) T cd02010 67 RKVVAVSGDGGFMMNSQELETAVRLKIPLVVLIW 100 (177) T ss_pred CEEEEEECCCCCEECCHHHHHHHHHCCCEEEEEE T ss_conf 8599992898611066688899972884039999 No 128 >TIGR02277 PaaX_trns_reg phenylacetic acid degradation operon negative regulatory protein PaaX; InterPro: IPR011965 This transcriptional regulator is always found in association with operons believed to be involved in the degradation of phenylacetic acid . The gene product has been shown to bind to the promoter sites and repress their transcription .. Probab=38.69 E-value=31 Score=15.58 Aligned_cols=44 Identities=18% Similarity=0.243 Sum_probs=35.5 Q ss_pred HHHHHHHHCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEC Q ss_conf 878887607872502894771389998999999997499389973 Q gi|254780898|r 34 FLTITTFTQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSILNL 78 (207) Q Consensus 34 ~~~~~~~~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktIInL 78 (207) .......+.-|+.+.|+ ++.+.+|.++++..++++.||-.+.-. T Consensus 107 e~r~~L~w~GFG~laP~-l~~~P~~~~~~v~tllek~G~~~~~~~ 150 (287) T TIGR02277 107 ELRKELVWSGFGLLAPS-LWVAPSPLPEDVETLLEKLGVADIAPA 150 (287) T ss_pred HHHHHHHHCCCCCCCCC-EEEECCCHHHHHHHHHHHCCCCCHHHH T ss_conf 99999987487511571-046058528899999986388600255 No 129 >cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=37.62 E-value=32 Score=15.47 Aligned_cols=49 Identities=4% Similarity=0.071 Sum_probs=24.6 Q ss_pred HHHHHHHCCCEEEEE-ECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCC Q ss_conf 799999669849987-52688898989999999999427994999828998 Q gi|254780898|r 90 EEKAANDLGIQLINF-PLSATRELNDEQIKQLISILKTAPKPLLIHCKSGA 139 (207) Q Consensus 90 e~~~~~~~gi~~i~i-p~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~ 139 (207) ..-++++.|++.+.+ .......|+...+.++.+.+++.+-++ |-|..+. T Consensus 189 f~Yfa~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~-If~e~~~ 238 (286) T cd01019 189 YGYFEKRYGLTQAGVFTIDPEIDPGAKRLAKIRKEIKEKGATC-VFAEPQF 238 (286) T ss_pred HHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCE-EEEECCC T ss_conf 6899986698386787224677899999999999999839988-9982898 No 130 >cd07021 Clp_protease_NfeD_like Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease. Nodulation formation efficiency D (NfeD; stomatin operon partner protein, STOPP; DUF107) is a member of membrane-anchored ClpP-class proteases. Currently, more than 300 NfeD homologs have been identified - all of which are bacterial or archaeal in origin. Majority of these genomes have been shown to possess operons containing a homologous NfeD/stomatin gene pair, causing NfeD to be previously named STOPP (stomatin operon partner protein). NfeD homologs can be divided into two groups: long and short forms. Long-form homologs have a putative ClpP-class serine protease domain while the short form homologs do not. Downstream from the ClpP-class domain is the so-called NfeD or DUF107 domain. N-terminal region of the NfeD homolog PH1510 (1510-N or PH1510-N) from Pyrococcus horikoshii has been shown to possess serine protease activity and has a Ser-Lys catalytic dyad, preferentiall Probab=37.46 E-value=32 Score=15.46 Aligned_cols=73 Identities=12% Similarity=0.304 Sum_probs=42.6 Q ss_pred EEECCCCCCCC---CCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHH Q ss_conf 99737878765---444579999966984998752688898989999999999427994999828998889999999 Q gi|254780898|r 75 ILNLRGKLPES---WHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADRTGLASAV 148 (207) Q Consensus 75 IInLr~~~~~~---~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~al 148 (207) ||.+.+.-+.. +.....+.+++.|-..+-+.+.+-+- .......+.+.+.+.+.||.+.+..-+-..|.++++ T Consensus 3 vi~i~g~I~~~~~~~l~r~l~~A~~~~a~~ivl~idTpGG-~v~~~~~I~~~I~~~~~pvv~~V~~~AaSAG~~Ia~ 78 (178) T cd07021 3 VIPIEGEIDPGLAAFVERALKEAKEEGADAVVLDIDTPGG-RVDSALEIVDLILNSPIPTIAYVNDRAASAGALIAL 78 (178) T ss_pred EEEECCEECHHHHHHHHHHHHHHHHCCCCEEEEEEECCCC-CHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHH T ss_conf 9997356868899999999999996899789999979998-689999999999848999999999920779999998 No 131 >TIGR03167 tRNA_sel_U_synt tRNA 2-selenouridine synthase. The Escherichia coli YbbB protein was shown to encode a selenophosphate-dependent tRNA 2-selenouridine synthase, essential for modification of some tRNAs to replace a sulfur atom with selenium. This enzyme works with SelD, the selenium donor protein, which also acts in selenocysteine incorporation. Although the members of this protein family show a fairly deep split, sequences from both sides of the split are supported by co-occurence with, and often proximity to, the selD gene. Probab=37.46 E-value=32 Score=15.46 Aligned_cols=118 Identities=19% Similarity=0.321 Sum_probs=55.0 Q ss_pred EECCCCCHHHHHHHHHHCCCCEEEECCC---------------------------CCCCCCCHHHHHHHHHCCCEEEEEE Q ss_conf 7138999899999999749938997378---------------------------7876544457999996698499875 Q gi|254780898|r 53 YRSAQPNGTFIEYLKKEYGIKSILNLRG---------------------------KLPESWHKEEEKAANDLGIQLINFP 105 (207) Q Consensus 53 yRS~qP~~~~~~~l~k~~GIktIInLr~---------------------------~~~~~~~~~e~~~~~~~gi~~i~ip 105 (207) +|+|+-|..-. .++++.|++ |.-|.| ....+- ..-....++.|.+.+|+- T Consensus 82 wRGG~RS~s~~-~~L~~~G~~-v~~l~GGYKayR~~v~~~l~~~~~~~~~~vl~G~TG~GK-T~iL~~L~~~G~qviDLE 158 (311) T TIGR03167 82 WRGGMRSGSLA-WLLAQIGFR-VPRLEGGYKAYRRFVIDQLEELPQPFPLIVLGGMTGSGK-TELLHALANAGAQVLDLE 158 (311) T ss_pred CCCCHHHHHHH-HHHHHCCCC-CCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCH-HHHHHHHHHCCCCCCCHH T ss_conf 47618789999-999985998-431578389999999999971454687699878888778-999999997699742589 Q ss_pred -----------C-CCCCCCCHHHHHHHHHH-HH--CCCCCEEEECCCC-CCHHHHHHHHHHHHHCC------CCHHHHHH Q ss_conf -----------2-68889898999999999-94--2799499982899-88899999999999839------99899999 Q gi|254780898|r 106 -----------L-SATRELNDEQIKQLISI-LK--TAPKPLLIHCKSG-ADRTGLASAVYLYIVAH------YPKEEAHR 163 (207) Q Consensus 106 -----------~-~~~~~~~~~~i~~~~~i-l~--~~~~PVlVHC~aG-~~RTG~~~alyli~~~g------~~~~eA~~ 163 (207) - .....|+....+..+.. +. +..+||+|-|-+= .|+.-+=.++|-....+ .|.++-.+ T Consensus 159 glAnHRGS~FG~~~~~~QPsQk~FEn~l~~~l~~~~~~~~i~vE~ES~~IG~~~iP~~l~~~M~~a~~i~i~~~le~Rv~ 238 (311) T TIGR03167 159 GLANHRGSSFGALGLGPQPSQKRFENALAEALRRLDPGRPIFVEDESRRIGRVALPDALFEAMRAAPLVELEASLEERVE 238 (311) T ss_pred HHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCEEEEECCHHHHHH T ss_conf 98631465346888899978999999999999817888956996033130571389999999851998999899999999 Q ss_pred HHHHCCCCCC Q ss_conf 6100026426 Q gi|254780898|r 164 QLSMLYGHFP 173 (207) Q Consensus 164 el~~~ygh~~ 173 (207) .+.-.|.++. T Consensus 239 ~l~~~Y~~~~ 248 (311) T TIGR03167 239 RLVEEYAGFE 248 (311) T ss_pred HHHHHHHHHH T ss_conf 9999863043 No 132 >TIGR00676 fadh2 5,10-methylenetetrahydrofolate reductase; InterPro: IPR004620 The enzyme activities methylenetetrahydrofolate reductase (1.5.1.20 from EC) and 5,10-methylenetetrahydrofolate reductase (FADH) (1.7.99.5 from EC) differ in that the former (assigned in many eukaryotes) is defined to use NADP+ as an acceptor, while the latter (assigned in many bacteria) is flexible with respect to the acceptor. Both convert 5-methyltetrahydrofolate to 5,10-methylenetetrahydrofolate. From a larger set of proteins assigned as one or the other, this family describes the subset of proteins found in bacteria, and currently designated 5,10-methylenetetrahydrofolate reductase. This protein is an FAD-containing flavoprotein.; GO: 0009086 methionine biosynthetic process. Probab=37.43 E-value=32 Score=15.45 Aligned_cols=24 Identities=46% Similarity=0.693 Sum_probs=16.4 Q ss_pred HHHHHHH---HHHCCCCEEEECCCCCC Q ss_conf 8999999---99749938997378787 Q gi|254780898|r 60 GTFIEYL---KKEYGIKSILNLRGKLP 83 (207) Q Consensus 60 ~~~~~~l---~k~~GIktIInLr~~~~ 83 (207) .+++... .++.||+.||=||||.+ T Consensus 87 ~~e~~~~L~~y~~~Gi~~ilALRGD~p 113 (302) T TIGR00676 87 REEIEEILREYRELGIRNILALRGDPP 113 (302) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 899999999998748867987437688 No 133 >COG1001 AdeC Adenine deaminase [Nucleotide transport and metabolism] Probab=36.46 E-value=33 Score=15.36 Aligned_cols=135 Identities=16% Similarity=0.085 Sum_probs=70.6 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCC----CCCCCCHHHHHHHHHCCCEEEEEECCCCC----------CCCHH---- Q ss_conf 1389998999999997499389973787----87654445799999669849987526888----------98989---- Q gi|254780898|r 54 RSAQPNGTFIEYLKKEYGIKSILNLRGK----LPESWHKEEEKAANDLGIQLINFPLSATR----------ELNDE---- 115 (207) Q Consensus 54 RS~qP~~~~~~~l~k~~GIktIInLr~~----~~~~~~~~e~~~~~~~gi~~i~ip~~~~~----------~~~~~---- 115 (207) -|+..+|..|++..-.+|..|||+=--+ ........-.+.+++..++.+-. .++.- ..+.+ T Consensus 87 ESSm~tP~~FA~~Vlp~GtTtvV~DPHEIaNV~G~~Gi~~ml~~a~~~pl~~~~~-~pScVPat~~Et~Ga~l~a~~i~e 165 (584) T COG1001 87 ESSMLTPSEFARAVLPHGTTTVVSDPHEIANVLGEDGIRFMLDEAKETPLKVYVM-LPSCVPATPFETSGAELTAEDIKE 165 (584) T ss_pred CCCCCCHHHHHHHHHCCCCEEEEECCHHHHHHCCHHHHHHHHHHHHHCCEEEEEE-CCCCCCCCCCCCCCCEECHHHHHH T ss_conf 0036689999988651785589617289876136889999998775297689996-666766775346784603999999 Q ss_pred ------------------------HHHHHHHHHHCCCCCEEEECCC--CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCC Q ss_conf ------------------------9999999994279949998289--98889999999999983999899999610002 Q gi|254780898|r 116 ------------------------QIKQLISILKTAPKPLLIHCKS--GADRTGLASAVYLYIVAHYPKEEAHRQLSMLY 169 (207) Q Consensus 116 ------------------------~i~~~~~il~~~~~PVlVHC~a--G~~RTG~~~alyli~~~g~~~~eA~~el~~~y 169 (207) .+..-++.....+++|-=||-. |+.=++-+.|=..--++..+.|||+..|..++ T Consensus 166 ~~~~p~Vigl~E~Mn~pgVi~~D~~~l~kl~a~~~~~k~VdGHapgl~g~~Ln~Y~aaGi~tDHE~~t~EEa~~klr~Gm 245 (584) T COG1001 166 LLEHPEVIGLGEMMNFPGVIEGDPDMLAKLEAARKAGKPVDGHAPGLSGKELNAYIAAGISTDHESTTAEEALEKLRLGM 245 (584) T ss_pred HHHCCCCCCHHHHCCCCHHCCCCHHHHHHHHHHHHCCCEECCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 86287714235532771021598889999999987498534648999807888898538886765499999999995786 Q ss_pred C-CCC-CCCHHHHHHHHHHHHH Q ss_conf 6-426-7424599999999998 Q gi|254780898|r 170 G-HFP-VLKTITMDITFEKITQ 189 (207) Q Consensus 170 g-h~~-~~~~~~ld~~~~~~~~ 189 (207) - -+. .+-...+..+...+.+ T Consensus 246 ~i~iReGS~a~dl~~l~~~i~e 267 (584) T COG1001 246 KIMIREGSAAKDLAALLPAITE 267 (584) T ss_pred EEEEECCCHHHHHHHHHHHHHH T ss_conf 8999857415509999999761 No 134 >PRK06036 translation initiation factor IF-2B subunit alpha; Provisional Probab=35.87 E-value=34 Score=15.30 Aligned_cols=30 Identities=30% Similarity=0.424 Sum_probs=18.6 Q ss_pred HHHHCCCCCEEEECCCCCCHH---HHHHHHHHHH Q ss_conf 999427994999828998889---9999999999 Q gi|254780898|r 122 SILKTAPKPLLIHCKSGADRT---GLASAVYLYI 152 (207) Q Consensus 122 ~il~~~~~PVlVHC~aG~~RT---G~~~alyli~ 152 (207) +++.+ +..||-||.+|.==| |++.+..+.. T Consensus 143 ~li~d-g~~ILTHCNaG~LAT~g~GTAlgvIr~a 175 (339) T PRK06036 143 KLLKD-GDTVLTHCNAGRLACVDWGTALGVVRSA 175 (339) T ss_pred HHHCC-CCEEEEECCCCCHHHCCCCCHHHHHHHH T ss_conf 98289-9829964488722316753399999999 No 135 >smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites Probab=35.77 E-value=34 Score=15.29 Aligned_cols=91 Identities=18% Similarity=0.285 Sum_probs=57.6 Q ss_pred EEECCCCCHHHH---HHHHHHCCCCE------EEECCCCCCCC------CCHHHHHHHHHCCCEEEEEECCCCC-CCCHH Q ss_conf 771389998999---99999749938------99737878765------4445799999669849987526888-98989 Q gi|254780898|r 52 IYRSAQPNGTFI---EYLKKEYGIKS------ILNLRGKLPES------WHKEEEKAANDLGIQLINFPLSATR-ELNDE 115 (207) Q Consensus 52 lyRS~qP~~~~~---~~l~k~~GIkt------IInLr~~~~~~------~~~~e~~~~~~~gi~~i~ip~~~~~-~~~~~ 115 (207) -|++...++++. ....+++|+.. +|||.++++.- ....|.+.|+.+|+.++.+..-+.. ...++ T Consensus 36 ~w~~~~~~~~~~~~f~~~~~~~~i~~~~Ha~YlINLas~~~e~~~kS~~~l~~el~~~~~lG~~~vV~HpGs~~~~~~e~ 115 (273) T smart00518 36 SWKGVRLSEETAEKFKEALKENNIDVSVHAPYLINLASPDKEKVEKSIERLIDEIKRCEELGIKALVFHPGSYLKQSKEE 115 (273) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHH T ss_conf 89999999999999999999839966864421004579988999999999999999999809985873453135888899 Q ss_pred HHHHHHH----HHHCCC-CCEEEECCCCCCHH Q ss_conf 9999999----994279-94999828998889 Q gi|254780898|r 116 QIKQLIS----ILKTAP-KPLLIHCKSGADRT 142 (207) Q Consensus 116 ~i~~~~~----il~~~~-~PVlVHC~aG~~RT 142 (207) .+..+.+ +++..+ -++++=-.||.+-+ T Consensus 116 ~i~~i~~~l~~vl~~~~~~~lllEntAGqG~~ 147 (273) T smart00518 116 ALNRIIESLNEVIDETKGVVILLETTAGKGSQ 147 (273) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCCC T ss_conf 99999999999982369867997457899884 No 136 >TIGR00707 argD acetylornithine and succinylornithine aminotransferases; InterPro: IPR004636 This family of proteins, for which ornithine aminotransferases form an outgroup, consists mostly of proteins designated acetylornithine aminotransferase. However, the two very closely related members from Escherichia coli are assigned different enzymatic activities. One is acetylornithine aminotransferase (2.6.1.11 from EC), ArgD, an enzyme of arginine biosynthesis, while another is succinylornithine aminotransferase, an enzyme of the arginine succinyltransferase pathway, an ammonia-generating pathway of arginine catabolism . Members of this family may also act on ornithine, like ornithine aminotransferase (2.6.1.13 from EC) and on succinyldiaminopimelate, like N-succinyldiaminopmelate-aminotransferase (2.6.1.17 from EC, DapC, an enzyme of lysine biosynthesis) . ; GO: 0008483 transaminase activity, 0006525 arginine metabolic process. Probab=35.37 E-value=35 Score=15.25 Aligned_cols=47 Identities=23% Similarity=0.202 Sum_probs=35.3 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEEEE--CCCCCCHHHHHHHHHHHHHCC--CCHHH Q ss_conf 8989899999999994279949998--289988899999999999839--99899 Q gi|254780898|r 110 RELNDEQIKQLISILKTAPKPLLIH--CKSGADRTGLASAVYLYIVAH--YPKEE 160 (207) Q Consensus 110 ~~~~~~~i~~~~~il~~~~~PVlVH--C~aG~~RTG~~~alyli~~~g--~~~~e 160 (207) .+++.+-+..+.++.++.+ -+||+ =+.|.||||-+.|- .+.| ..++- T Consensus 201 ~pa~~~fL~a~r~lc~~~~-~LLI~DEVQtG~GRTG~~fA~---~Hygse~~PDi 251 (402) T TIGR00707 201 VPADKEFLKALRELCEDKD-ALLIFDEVQTGLGRTGKLFAY---EHYGSEVEPDI 251 (402) T ss_pred CCCCHHHHHHHHHHHHHCC-EEEEEEEEEECCCCCHHHHHH---HCCCCCCCCHH T ss_conf 3487789999999986489-189994475287743322331---20688455238 No 137 >PRK08334 translation initiation factor IF-2B subunit beta; Validated Probab=34.62 E-value=33 Score=15.37 Aligned_cols=28 Identities=29% Similarity=0.389 Sum_probs=17.2 Q ss_pred CCEEEECCCCCCHH---HHHHHHHHHH-HCCC Q ss_conf 94999828998889---9999999999-8399 Q gi|254780898|r 129 KPLLIHCKSGADRT---GLASAVYLYI-VAHY 156 (207) Q Consensus 129 ~PVlVHC~aG~~RT---G~~~alyli~-~~g~ 156 (207) +.||=||.+|+==| |++.+..+.. .+|. T Consensus 161 g~ILTHCNaGaLAT~g~GTAlgvir~a~~~gk 192 (356) T PRK08334 161 GNVLTHCNAGSLATVHLGTVGAVVRVMHKDGT 192 (356) T ss_pred CCEEEECCCCHHHHCCCCCHHHHHHHHHHCCC T ss_conf 98762158744654355336999999998699 No 138 >cd01443 Cdc25_Acr2p Cdc25 enzymes are members of the Rhodanese Homology Domain (RHOD) superfamily. Also included in this CD are eukaryotic arsenate resistance proteins such as Saccharomyces cerevisiae Acr2p and similar proteins. Cdc25 phosphatases activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. The Cdc25 and Acr2p RHOD domains have the signature motif (H/YCxxxxxR). Probab=34.27 E-value=36 Score=15.14 Aligned_cols=78 Identities=17% Similarity=0.152 Sum_probs=42.8 Q ss_pred CCCHHHHHHHHHHCC-----CCEEEECCCCCCCCCCHHHHHHHHHCC--CEEEEEECCCCCCCCHHHHHHHHHHHHC-CC Q ss_conf 999899999999749-----938997378787654445799999669--8499875268889898999999999942-79 Q gi|254780898|r 57 QPNGTFIEYLKKEYG-----IKSILNLRGKLPESWHKEEEKAANDLG--IQLINFPLSATRELNDEQIKQLISILKT-AP 128 (207) Q Consensus 57 qP~~~~~~~l~k~~G-----IktIInLr~~~~~~~~~~e~~~~~~~g--i~~i~ip~~~~~~~~~~~i~~~~~il~~-~~ 128 (207) ..+++.+..+++... =-.||+.|.+ |.. -| ..-+++|..+ ..+.+..+...... .. T Consensus 3 ~Is~e~L~~ll~~~~~~~~~~~~IIDcR~~-ey~-----------GGHI~gA~Nip~~~----~~~~~~~~~~~~~~~~~ 66 (113) T cd01443 3 YISPEELVALLENSDSNAGKDFVVVDLRRD-DYE-----------GGHIKGSINLPAQS----CYQTLPQVYALFSLAGV 66 (113) T ss_pred CCCHHHHHHHHHCCCCCCCCCEEEEEECCC-CCC-----------CCEECCCEECCCHH----HHHHHHHHHHHHHHCCC T ss_conf 689999999982787444685799980787-668-----------87745858657475----88878999887664386 Q ss_pred CCEEEECCCCCCHHHHHHHHHH Q ss_conf 9499982899888999999999 Q gi|254780898|r 129 KPLLIHCKSGADRTGLASAVYL 150 (207) Q Consensus 129 ~PVlVHC~aG~~RTG~~~alyl 150 (207) ..+.+||..-..|...++-..+ T Consensus 67 ~~iIfhCe~Sq~RGP~~A~~lr 88 (113) T cd01443 67 KLAIFYCGSSQGRGPRAARWFA 88 (113) T ss_pred EEEEEEECCCCCCHHHHHHHHH T ss_conf 4999980233420199999999 No 139 >cd05567 PTS_IIB_mannitol PTS_IIB_mannitol: subunit IIB of enzyme II (EII) of the mannitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a mannitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIA, IIB, and IIC domains are expressed from the mtlA gene as a single protein, also known as the mannitol PTS permease, the mtl transporter, or MtlA. MtlA is only functional as a dimer with the dimer contacts occuring between the IIC domains. MtlA takes up exogenous mannitol releasing the phosphate ester into the cytoplasm in preparation for oxidation to fructose-6-phosphate by the NAD-dependent mannitol-P dehydrogenase (MtlD). The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include mannitol, chitobiose/lichenan, ascorbate, lactose, galactitol, fructose, and a s Probab=34.07 E-value=36 Score=15.12 Aligned_cols=27 Identities=15% Similarity=0.406 Sum_probs=21.0 Q ss_pred CCEEEECCCCCCHHHHHHHHHHHHHCC Q ss_conf 949998289988899999999999839 Q gi|254780898|r 129 KPLLIHCKSGADRTGLASAVYLYIVAH 155 (207) Q Consensus 129 ~PVlVHC~aG~~RTG~~~alyli~~~g 155 (207) +.|++.|-+|.|.+.+.+...+-.... T Consensus 1 kkIvfACDAGmGSSaMgas~lrk~lk~ 27 (87) T cd05567 1 KKIVFACDAGMGSSAMGASVLRKKLKK 27 (87) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 959999289985899999999999998 No 140 >TIGR02624 rhamnu_1P_ald rhamnulose-1-phosphate aldolase; InterPro: IPR013447 Proteins in this entry match the enzyme RhaD, rhamnulose-1-phosphate aldolase (4.1.2.19 from EC).; GO: 0008994 rhamnulose-1-phosphate aldolase activity. Probab=33.78 E-value=37 Score=15.09 Aligned_cols=92 Identities=14% Similarity=0.209 Sum_probs=53.0 Q ss_pred CCCHHHHHHHHH-HCCCCEEEECCCCCCC-----------C-CCHHHHHHHHHCCCEEEE---------E--ECCCCCCC Q ss_conf 999899999999-7499389973787876-----------5-444579999966984998---------7--52688898 Q gi|254780898|r 57 QPNGTFIEYLKK-EYGIKSILNLRGKLPE-----------S-WHKEEEKAANDLGIQLIN---------F--PLSATREL 112 (207) Q Consensus 57 qP~~~~~~~l~k-~~GIktIInLr~~~~~-----------~-~~~~e~~~~~~~gi~~i~---------i--p~~~~~~~ 112 (207) ..+.++++++.. -..+.--|.|..+-.. + ++.+ .....+.++-.+. + .+.+.+.| T Consensus 36 ~L~~eEv~~Y~~~~~q~~R~Ipl~fd~~~LAg~yFlVTGSGKyFkn-v~~~P~~nLg~~rVs~dG~~~~llWGl~dgg~P 114 (273) T TIGR02624 36 RLKEEEVEPYLDDFKQVLRKIPLKFDAPELAGKYFLVTGSGKYFKN-VEENPAENLGVLRVSEDGESVHLLWGLTDGGLP 114 (273) T ss_pred CCCHHHHHHHHCCCCCCEEEEECCCCHHHHCCCCEEEECCCHHHHH-HHCCCCCCEEEEEECCCCCEEEEEECCCCCCCC T ss_conf 1576789886146241103673146868856870699356532221-111701464678981588745653011588966 Q ss_pred CHHHHHHHHHHH---H-CC-CCCEEEECCCCCCHHHHHHHHHHHHH Q ss_conf 989999999999---4-27-99499982899888999999999998 Q gi|254780898|r 113 NDEQIKQLISIL---K-TA-PKPLLIHCKSGADRTGLASAVYLYIV 153 (207) Q Consensus 113 ~~~~i~~~~~il---~-~~-~~PVlVHC~aG~~RTG~~~alyli~~ 153 (207) +-|--..|..-+ + +. +..|.+||+| |=+++.-|-+.. T Consensus 115 TSELpsHfmSHI~Rl~~dpG~~RViMHcHA----TnLiAmtft~eL 156 (273) T TIGR02624 115 TSELPSHFMSHIARLKVDPGENRVIMHCHA----TNLIAMTFTHEL 156 (273) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEECCH----HHHHHHHHCCCC T ss_conf 414577899899976068678707861575----576555410004 No 141 >pfam08659 KR KR domain. This enzymatic domain is part of bacterial polyketide synthases and catalyses the first step in the reductive modification of the beta-carbonyl centres in the growing polyketide chain. It uses NADPH to reduce the keto group to a hydroxy group. Probab=33.78 E-value=37 Score=15.09 Aligned_cols=80 Identities=15% Similarity=0.197 Sum_probs=42.0 Q ss_pred HHHHHHHHHCCCCEEEEC-CCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCC--CEEEECCC Q ss_conf 999999997499389973-787876544457999996698499875268889898999999999942799--49998289 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNL-RGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPK--PLLIHCKS 137 (207) Q Consensus 61 ~~~~~l~k~~GIktIInL-r~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~--PVlVHC~a 137 (207) ..+.+.+.+.|-+.|+-. |.+..........+..+..|.+..-+..+ .-+++.++.+.+.+.+.-+ -++||| + T Consensus 14 ~aia~~la~~Ga~~vvl~~r~~~~~~~~~~~~~~l~~~g~~~~~~~~D---v~d~~~v~~~~~~~~~~~g~id~lvnn-A 89 (181) T pfam08659 14 LELARWLAERGARHLVLLSRSGAPDPEAEALLAELEARGAEVTVVACD---VSDRDAVAALLAEIRADGPPLRGVIHA-A 89 (181) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECC---CCCHHHHHHHHHHHHHHCCCEEEEEEE-C T ss_conf 999999998799789998689766299999999999659969999756---899999998886579873984899954-4 Q ss_pred CCCHHHH Q ss_conf 9888999 Q gi|254780898|r 138 GADRTGL 144 (207) Q Consensus 138 G~~RTG~ 144 (207) |..+.+. T Consensus 90 G~~~~~~ 96 (181) T pfam08659 90 GVLRDAL 96 (181) T ss_pred CCCCCCC T ss_conf 6678856 No 142 >PRK00142 hypothetical protein; Provisional Probab=33.75 E-value=37 Score=15.09 Aligned_cols=41 Identities=20% Similarity=0.212 Sum_probs=27.7 Q ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 799499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 127 APKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 127 ~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) .++||++.|..|. |.--+++. +. ..|+ -.+-+.+.|++.++ T Consensus 170 k~kkI~myCTGGI-RCEKasa~-l~-~~Gf-------------~~VyqL~GGIl~Yl 210 (315) T PRK00142 170 KDKKVVMYCTGGI-RCEKASAW-MK-EEGF-------------KEVYQLEGGIITYG 210 (315) T ss_pred CCCCEEEECCCCE-EEEECHHH-HH-HCCC-------------CEEEEECCCHHHHH T ss_conf 4996899889964-78650598-98-6199-------------85799553399998 No 143 >PRK05875 short chain dehydrogenase; Provisional Probab=33.06 E-value=38 Score=15.01 Aligned_cols=80 Identities=14% Similarity=0.134 Sum_probs=33.8 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCC-CEEEECCCC Q ss_conf 999999997499389973787876544457999996698499875268889898999999999942-799-499982899 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APK-PLLIHCKSG 138 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~-PVlVHC~aG 138 (207) ..+...+.+.|-+-|++=|.++.......+.......+-....++ .+.-+++++.++.+...+ ..+ -+||||.+| T Consensus 21 ~aiA~~la~~Ga~Vii~~r~~~~l~~~~~~l~~~~~~~~~v~~~~---~Dvs~~~~v~~~v~~~~~~~g~iD~LVnnAg~ 97 (277) T PRK05875 21 KGVAAALVAAGAAVMIVGRNPDKLAAAAEEIEALAGGAGAVRYEP---ADVTNEDEVARAVDAATAWHGRLHGVVHCAGG 97 (277) T ss_pred HHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEE---CCCCCHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 999999998799899997988999999999996127886289995---78999999999999999984995399987813 Q ss_pred CCHHH Q ss_conf 88899 Q gi|254780898|r 139 ADRTG 143 (207) Q Consensus 139 ~~RTG 143 (207) ....| T Consensus 98 ~~~~~ 102 (277) T PRK05875 98 SETIG 102 (277) T ss_pred CCCCC T ss_conf 67879 No 144 >pfam01026 TatD_DNase TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme. Probab=32.67 E-value=38 Score=14.98 Aligned_cols=25 Identities=20% Similarity=0.434 Sum_probs=15.5 Q ss_pred HHHHHHH---HHHHHCCCCCEEEECCCC Q ss_conf 8999999---999942799499982899 Q gi|254780898|r 114 DEQIKQL---ISILKTAPKPLLIHCKSG 138 (207) Q Consensus 114 ~~~i~~~---~~il~~~~~PVlVHC~aG 138 (207) +.|...| +++..+.+.||.+||... T Consensus 107 ~~Q~~~F~~ql~lA~~~~lPvilH~r~a 134 (255) T pfam01026 107 EAQEEVFRRQLELAKELDLPVVIHTRDA 134 (255) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCH T ss_conf 9999999999999987089748984553 No 145 >smart00400 ZnF_CHCC zinc finger. Probab=32.63 E-value=36 Score=15.10 Aligned_cols=34 Identities=18% Similarity=0.214 Sum_probs=28.0 Q ss_pred CEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 499982899888999999999998399989999961 Q gi|254780898|r 130 PLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQL 165 (207) Q Consensus 130 PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el 165 (207) .=+.||.+ -++.|-++-.+ +..+|.+-.||+++| T Consensus 21 k~~~~CF~-Cg~~Gd~i~fv-~~~~~~~f~eAv~~L 54 (55) T smart00400 21 KQFFHCFG-CGAGGNVISFL-MKYDKLSFVEAVKKL 54 (55) T ss_pred CCEEEECC-CCCCCCHHHHH-HHHHCCCHHHHHHHH T ss_conf 99799678-99998655468-988498999999986 No 146 >pfam02263 GBP Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP. Probab=32.13 E-value=28 Score=15.83 Aligned_cols=10 Identities=20% Similarity=0.581 Sum_probs=4.0 Q ss_pred CCCCHHHHHH Q ss_conf 8989899999 Q gi|254780898|r 110 RELNDEQIKQ 119 (207) Q Consensus 110 ~~~~~~~i~~ 119 (207) +.+++..++. T Consensus 115 g~I~e~~l~~ 124 (264) T pfam02263 115 QTINQQALQQ 124 (264) T ss_pred CCCCHHHHHH T ss_conf 8868899999 No 147 >TIGR01829 AcAcCoA_reduct acetoacetyl-CoA reductase; InterPro: IPR011283 This entry represent acetoacetyl-CoA reductase, a member of the family short-chain-alcohol dehydrogenases. Note that, despite the precision implied by the enzyme name, the reaction of 1.1.1.36 from EC is defined more generally as (R)-3-hydroxyacyl-CoA + NADP+ = 3-oxoacyl-CoA + NADPH. Members of this family may act in the biosynthesis of poly-beta-hydroxybutyrate (e.g. Rhizobium meliloti) and related poly-beta-hydroxyalkanoates. Note that the member of this family from Azospirillum brasilense, designated NodG, appears to lack acetoacetyl-CoA reductase activity and to act instead in the production of nodulation factor. ; GO: 0018454 acetoacetyl-CoA reductase activity, 0042619 poly-hydroxybutyrate biosynthetic process, 0005737 cytoplasm. Probab=32.02 E-value=39 Score=14.91 Aligned_cols=86 Identities=16% Similarity=0.180 Sum_probs=58.7 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCC--EEEECCCCCCH Q ss_conf 9999974993899737878765444579999966984998752688898989999999999427994--99982899888 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKP--LLIHCKSGADR 141 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~P--VlVHC~aG~~R 141 (207) .+|.+ .|.|-|=|..+.++ +...+..++.++.=.++.+.-.+.-+-|.-...++-++...+| |||=| ||.-| T Consensus 18 ~rL~~-dG~~V~An~~p~N~----~~a~~W~~~~~~~g~~~~~~~~DV~~~e~c~~~v~~v~a~lGpvDvLVNN-AGITR 91 (244) T TIGR01829 18 TRLAK-DGYRVAANYVPSNE----ERAEAWLQEQGAQGFDFAVVEGDVSSFEDCKAAVAKVEAELGPVDVLVNN-AGITR 91 (244) T ss_pred HHHHH-CCCEEEECCCCCCH----HHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEEC-CCCCC T ss_conf 99987-59889881789825----89999998626985147898727677789999999999711953689868-86440 Q ss_pred HHHHHHHHHHHHCCCCH Q ss_conf 99999999999839998 Q gi|254780898|r 142 TGLASAVYLYIVAHYPK 158 (207) Q Consensus 142 TG~~~alyli~~~g~~~ 158 (207) =.++- .|..+.|+. T Consensus 92 D~~F~---KM~~~qW~~ 105 (244) T TIGR01829 92 DKTFK---KMTYEQWSE 105 (244) T ss_pred CHHHC---CCCHHHHHH T ss_conf 30312---499846888 No 148 >PRK08292 AMP nucleosidase; Provisional Probab=32.01 E-value=39 Score=14.91 Aligned_cols=40 Identities=25% Similarity=0.478 Sum_probs=24.5 Q ss_pred HCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCC Q ss_conf 66984998752688898989999999999427994999828998 Q gi|254780898|r 96 DLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGA 139 (207) Q Consensus 96 ~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~ 139 (207) ..||+.++|.+ ++++...+...+.++...--=++=|| +|. T Consensus 273 ~~GiTlvNiGv---GpsNAktitDhlavlrP~awlmlGhC-gGL 312 (487) T PRK08292 273 GQGITLVNIGV---GPSNAKTITDHLAVLRPHAWLMIGHC-GGL 312 (487) T ss_pred CCCEEEEEECC---CCCHHHHHHHHHHHCCCCEEEEEECC-CCC T ss_conf 99538998366---88407789888875087659998413-576 No 149 >cd01535 4RHOD_Repeat_4 Member of the Rhodanese Homology Domain superfamily, repeat 4. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 4th repeat which, in general, contains the putative catalytic Cys residue. Probab=31.79 E-value=39 Score=14.88 Aligned_cols=66 Identities=17% Similarity=0.159 Sum_probs=37.8 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHH Q ss_conf 99999749938997378787654445799999669849987526888989899999999994279949998289988899 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGADRTG 143 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG 143 (207) +.++.+-|=-.||++|...+. .+- |||- ..-.....+.+.+.-+. ...|+++.|.+|. |+. T Consensus 3 ~~wL~~~~~~~viDVr~s~ey----------~~g-----HIpG--A~~~~r~~L~~~l~~lp-~~~~vVl~c~~g~-~A~ 63 (145) T cd01535 3 AAWLGEGGQTAVVDVTASANY----------VKR-----HIPG--AWWVLRAQLAQALEKLP-AAERYVLTCGSSL-LAR 63 (145) T ss_pred HHHHHCCCCEEEEECCCHHHH----------HHC-----CCCC--CEECCHHHHHHHHHHCC-CCCCEEEEECCCH-HHH T ss_conf 899857998799975882888----------611-----3235--76476989999998568-8887799948956-999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780898|r 144 LASAV 148 (207) Q Consensus 144 ~~~al 148 (207) .+++- T Consensus 64 ~AA~~ 68 (145) T cd01535 64 FAAAD 68 (145) T ss_pred HHHHH T ss_conf 99999 No 150 >pfam12091 DUF3567 Protein of unknown function (DUF3567). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 90 amino acids in length. This protein has a conserved EIVDK sequence motif. Probab=31.62 E-value=38 Score=14.99 Aligned_cols=60 Identities=12% Similarity=0.223 Sum_probs=35.8 Q ss_pred HHHCCCCEEEECCCCCCCCCCHHH-HHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEE Q ss_conf 997499389973787876544457-999996698499875268889898999999999942-79949998 Q gi|254780898|r 67 KKEYGIKSILNLRGKLPESWHKEE-EKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIH 134 (207) Q Consensus 67 ~k~~GIktIInLr~~~~~~~~~~e-~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVH 134 (207) +.+.|+. ||+=|...|- +.+.. .+.+.+ + +.-.....|+.+.++.++.-.+. ...||.+| T Consensus 24 l~r~G~E-IVDK~a~rEi-fl~Gs~Ae~Fr~-~-----V~~l~~~ePs~EEvdd~L~~y~~l~~~Pvv~H 85 (85) T pfam12091 24 LARGGYE-IVDKRAGKEI-YLDGSLAEMFRE-D-----VKALIEDEPTQEEVDDFLGQYDGLMQQPVVLH 85 (85) T ss_pred HHCCCCE-EEECCCCCEE-EECCHHHHHHHH-H-----HHHHHHCCCCHHHHHHHHHHHHHHHHCCEECC T ss_conf 3149828-7535777358-877489999999-9-----99998719988999999998888860874249 No 151 >PRK05451 dihydroorotase; Provisional Probab=31.29 E-value=40 Score=14.83 Aligned_cols=31 Identities=23% Similarity=0.335 Sum_probs=23.1 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 6888989899999999994279949998289 Q gi|254780898|r 107 SATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 107 ~~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) ++.+..+.+.+...++.|++.+-|+++||-. T Consensus 110 S~~GV~~~~~~~~~le~m~~~g~pLliHGEv 140 (345) T PRK05451 110 SDAGVTDIEKIYPVLEAMQKIGMPLLVHGEV 140 (345) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEECCCC T ss_conf 7421423999999999999759904521454 No 152 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=30.69 E-value=41 Score=14.77 Aligned_cols=78 Identities=28% Similarity=0.371 Sum_probs=45.9 Q ss_pred HHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHHHHHC-CCCH-HHHHHHHH Q ss_conf 7999996698499875268889898999999999942-7994999828998889999999999983-9998-99999610 Q gi|254780898|r 90 EEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTGLASAVYLYIVA-HYPK-EEAHRQLS 166 (207) Q Consensus 90 e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG~~~alyli~~~-g~~~-~eA~~el~ 166 (207) -.....+.|...+-|. +..+..++..+..+...+++ .+-||-+||+. -+|+..++|+...+ |.+. +-|..-|+ T Consensus 160 ~a~~l~~~G~d~i~iK-DmaGll~P~~a~~LV~alk~~~~lpI~~HtH~---t~G~~~a~~l~A~eAGvdivD~a~~~~s 235 (580) T PRK09282 160 LAKQLEEMGCDSICIK-DMAGLLTPYAAYELVSALKKEVDLPVHLHCHA---TTGLATMTYLKAVEAGVDILDTAISSMS 235 (580) T ss_pred HHHHHHHCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC---CCCHHHHHHHHHHHHCCCEEEECCCCCC T ss_conf 9999997499889972-66557898999999999998619816998247---7647999999999848886700135534 Q ss_pred HCCCC Q ss_conf 00264 Q gi|254780898|r 167 MLYGH 171 (207) Q Consensus 167 ~~ygh 171 (207) ..++| T Consensus 236 ~gtsq 240 (580) T PRK09282 236 MGTSH 240 (580) T ss_pred CCCCC T ss_conf 88789 No 153 >pfam04909 Amidohydro_2 Amidohydrolase. These proteins are amidohydrolases that are related to pfam01979. Probab=30.67 E-value=41 Score=14.77 Aligned_cols=76 Identities=14% Similarity=0.104 Sum_probs=31.7 Q ss_pred HHHHHHHCCCCEEEECCCCCCCC-----CCHHHHHHHHHCCCEEE-EEECCCCC-CCCHHHHHHHHHHHHCCCC-C-EEE Q ss_conf 99999974993899737878765-----44457999996698499-87526888-9898999999999942799-4-999 Q gi|254780898|r 63 IEYLKKEYGIKSILNLRGKLPES-----WHKEEEKAANDLGIQLI-NFPLSATR-ELNDEQIKQLISILKTAPK-P-LLI 133 (207) Q Consensus 63 ~~~l~k~~GIktIInLr~~~~~~-----~~~~e~~~~~~~gi~~i-~ip~~~~~-~~~~~~i~~~~~il~~~~~-P-VlV 133 (207) +.+.+.+.|++.|--.-...... .+..-.+.|+++|+... |....... .........+..++++-++ + |+. T Consensus 89 l~r~~~~~g~~Gv~l~~~~~~~~~~~~~~~~~~~~~~~e~~~pv~iH~~~~~~~~~~~~~~~~~~~~~~~~~P~l~iv~~ 168 (270) T pfam04909 89 LERRLAELGFRGVRLNPHPGGGPLLDPRLDRPIYEALAELGLPVDLHTGTGFAGEDTDASQPLLLEGLARRFPDLKIVLD 168 (270) T ss_pred HHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEE T ss_conf 99974838806999768578877766688999999999869989994588877656461369999999997899859997 Q ss_pred ECCCC Q ss_conf 82899 Q gi|254780898|r 134 HCKSG 138 (207) Q Consensus 134 HC~aG 138 (207) ||..+ T Consensus 169 H~G~~ 173 (270) T pfam04909 169 HAGGP 173 (270) T ss_pred CCCCC T ss_conf 89888 No 154 >PRK11449 putative deoxyribonuclease YjjV; Provisional Probab=30.53 E-value=41 Score=14.75 Aligned_cols=25 Identities=16% Similarity=0.413 Sum_probs=15.6 Q ss_pred HHHHHHHHCCCCCEEEECCCCCCHH Q ss_conf 9999999427994999828998889 Q gi|254780898|r 118 KQLISILKTAPKPLLIHCKSGADRT 142 (207) Q Consensus 118 ~~~~~il~~~~~PVlVHC~aG~~RT 142 (207) .+.+++..+.+.||.|||....+.+ T Consensus 117 ~~ql~lA~~~~lPvviH~R~A~~~~ 141 (258) T PRK11449 117 DEQLKLAKRYDLPVILHSRRTHDKL 141 (258) T ss_pred HHHHHHHHHHCCCEEEECHHHHHHH T ss_conf 9999999984997598622137999 No 155 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=30.48 E-value=42 Score=14.75 Aligned_cols=101 Identities=21% Similarity=0.260 Sum_probs=48.2 Q ss_pred CHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHHHH-HCCCCHHH-HHH Q ss_conf 4457999996698499875268889898999999999942-79949998289988899999999999-83999899-999 Q gi|254780898|r 87 HKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTGLASAVYLYI-VAHYPKEE-AHR 163 (207) Q Consensus 87 ~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG~~~alyli~-~~g~~~~e-A~~ 163 (207) |..-.+..++.|...+-+.= ..+...+.....++..+++ .+-||-+|-+ |-+|..+|-|+.. ..|.+.-+ |.. T Consensus 696 Y~~lA~el~~~GaHIlaIKD-MAGLLKP~AA~~Li~aLr~~~dlPIHlHTH---DTsG~~~at~~aA~~AGvDivD~A~~ 771 (1149) T COG1038 696 YVKLAKELEKAGAHILAIKD-MAGLLKPAAAYRLISALRETVDLPIHLHTH---DTSGNGVATYLAAVEAGVDIVDVAMA 771 (1149) T ss_pred HHHHHHHHHHCCCCEEEEHH-HHHCCCHHHHHHHHHHHHHHCCCCEEEECC---CCCCCHHHHHHHHHHCCCCHHHHHHH T ss_conf 99999999966772787633-342147699999999998744884487516---78871899999999838636666544 Q ss_pred HHHHCCC------------------CCCCCCHHHHHHHHHHHHHHC Q ss_conf 6100026------------------426742459999999999852 Q gi|254780898|r 164 QLSMLYG------------------HFPVLKTITMDITFEKITQLY 191 (207) Q Consensus 164 el~~~yg------------------h~~~~~~~~ld~~~~~~~~~~ 191 (207) -|+..-. .+....-..+++.|+.....| T Consensus 772 smsG~TSQPsl~si~~al~~~~r~~~ld~~~~~~i~~YWe~vR~~Y 817 (1149) T COG1038 772 SMSGLTSQPSLNSIVAALAGTERDTGLDIEAVRELSFYWEAVRKLY 817 (1149) T ss_pred HCCCCCCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 2148877864888999960898787889799998899999999865 No 156 >cd06290 PBP1_LacI_like_9 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=30.39 E-value=42 Score=14.74 Aligned_cols=95 Identities=13% Similarity=0.100 Sum_probs=52.5 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCC----CHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 99999999749938997378787654----44579999966984998752688898989999999999427994999828 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESW----HKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~----~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) .....++.+.|-|.|..+.+...... .....+++++.|+....--+.......+...+.+.++++..+.|-.|-|. T Consensus 104 ~~~~~~L~~~G~~~i~~i~~~~~~~~~~~R~~Gf~~a~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ai~~~ 183 (265) T cd06290 104 YLATQHLIDLGHRRIAHITGPRGHIDARDRLAGYRKALEEAGLEVQPDLIVQGDFEEESGLEAVEELLQRGPDFTAIFAA 183 (265) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEECCCCCCHHHHHHHHHHHHHCCCCCCCEEC T ss_conf 99999999829986556048878701999999999999986999988883416677178999999999719998842105 Q ss_pred CCCCHHHHHHHHHHHHHCCCCH Q ss_conf 9988899999999999839998 Q gi|254780898|r 137 SGADRTGLASAVYLYIVAHYPK 158 (207) Q Consensus 137 aG~~RTG~~~alyli~~~g~~~ 158 (207) .-.-=-|++.+|.. .|... T Consensus 184 nD~~A~g~~~~~~~---~g~~v 202 (265) T cd06290 184 NDQTAYGARLALYR---RGLRV 202 (265) T ss_pred CHHHHHHHHHHHHH---HCCCC T ss_conf 78999999999998---09999 No 157 >PRK05720 mtnA methylthioribose-1-phosphate isomerase; Reviewed Probab=29.29 E-value=43 Score=14.62 Aligned_cols=18 Identities=6% Similarity=0.150 Sum_probs=13.7 Q ss_pred EEECCCCCHHHHHHHHHH Q ss_conf 771389998999999997 Q gi|254780898|r 52 IYRSAQPNGTFIEYLKKE 69 (207) Q Consensus 52 lyRS~qP~~~~~~~l~k~ 69 (207) ..+++.||.-++.+.+.+ T Consensus 88 ~L~~tRPTAVNL~wAl~r 105 (351) T PRK05720 88 YLAATRPTAVNLFWALDR 105 (351) T ss_pred HHHHCCCCHHHHHHHHHH T ss_conf 998659737789999999 No 158 >PRK10853 hypothetical protein; Provisional Probab=28.65 E-value=28 Score=15.79 Aligned_cols=29 Identities=28% Similarity=0.495 Sum_probs=23.9 Q ss_pred EECCCCCHHHHHHHHHHCCCCEEEECCCC Q ss_conf 71389998999999997499389973787 Q gi|254780898|r 53 YRSAQPNGTFIEYLKKEYGIKSILNLRGK 81 (207) Q Consensus 53 yRS~qP~~~~~~~l~k~~GIktIInLr~~ 81 (207) ||-.-|+.+.+..++++.|+..+||=|+. T Consensus 32 ~k~~~~~~~~l~~wl~~~G~~~liN~rgt 60 (118) T PRK10853 32 YRVDGLDSELLNGFIAELGWEALLNTRGT 60 (118) T ss_pred CCCCCCCHHHHHHHHHHCCHHHHHCCCCC T ss_conf 11469899999999996298987516663 No 159 >PRK09389 (R)-citramalate synthase; Provisional Probab=28.39 E-value=45 Score=14.52 Aligned_cols=83 Identities=27% Similarity=0.264 Sum_probs=43.3 Q ss_pred HHHHHCCCCEEEECCCCC--CCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH-CCCCCEEEECCCCCCH Q ss_conf 999974993899737878--7654445799999669849987526888989899999999994-2799499982899888 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKL--PESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK-TAPKPLLIHCKSGADR 141 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~--~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~-~~~~PVlVHC~aG~~R 141 (207) .+++++|.+-.++.-... +..+...-.+.+.+.|...+.+|= ..+..++..+..+.+-+. ..+.|+=+||+--. T Consensus 120 ~~ak~~g~~v~~~~ED~sr~~~~fl~e~~~~a~~aga~~i~l~D-TvG~~~P~~~~~~i~~l~~~~~~~i~vH~HND~-- 196 (487) T PRK09389 120 EYAKDHGLIVELSGEDASRADLDFLKELYRAGIEAGADRICFCD-TVGILTPERSYELFKRLSESLKIPISIHCHNDF-- 196 (487) T ss_pred HHHHHCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCC-CCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCC-- T ss_conf 99997497799921066555779999999999973899622488-888879999999986300467854899705997-- Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780898|r 142 TGLASAVYLY 151 (207) Q Consensus 142 TG~~~alyli 151 (207) |+.+|.-+. T Consensus 197 -GlAvANsla 205 (487) T PRK09389 197 -GLATANTLA 205 (487) T ss_pred -CHHHHHHHH T ss_conf -779999999 No 160 >pfam00591 Glycos_transf_3 Glycosyl transferase family, a/b domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate. Probab=28.22 E-value=45 Score=14.50 Aligned_cols=72 Identities=22% Similarity=0.382 Sum_probs=38.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC--CCCEEEE Q ss_conf 99989999999974993899737878765444579999966984998752688898989999999999427--9949998 Q gi|254780898|r 57 QPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTA--PKPLLIH 134 (207) Q Consensus 57 qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~--~~PVlVH 134 (207) .|.-..+.+..+++|++||.|+-++--.| .+..+.-+.+ -+.+....+.++++.. +..++|| T Consensus 85 hPamk~v~~~R~~lg~rTifNllgPL~NP-----------a~~~~qliGv-----~~~~~~~~~a~~l~~lg~~~a~vV~ 148 (254) T pfam00591 85 HPAMKHVAPVRKELGTRTIFNLLGPLTNP-----------ARPGLQLLGV-----YSKELVGPIAEVLERLGEERALVVH 148 (254) T ss_pred CHHHHHHHHHHHHCCCCCCHHHHHHCCCC-----------CCCCCEEEEE-----ECHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 98999999999870889803321010687-----------7788469999-----5758869999999972985589997 Q ss_pred CCCCCCHHHH Q ss_conf 2899888999 Q gi|254780898|r 135 CKSGADRTGL 144 (207) Q Consensus 135 C~aG~~RTG~ 144 (207) -.-|.|.-.+ T Consensus 149 g~dGlDEis~ 158 (254) T pfam00591 149 GGDGLDEVTT 158 (254) T ss_pred CCCCCCEEEC T ss_conf 1799870212 No 161 >PRK09432 metF 5,10-methylenetetrahydrofolate reductase; Provisional Probab=27.96 E-value=46 Score=14.47 Aligned_cols=18 Identities=44% Similarity=0.776 Sum_probs=14.3 Q ss_pred HHCCCCEEEECCCCCCCC Q ss_conf 974993899737878765 Q gi|254780898|r 68 KEYGIKSILNLRGKLPES 85 (207) Q Consensus 68 k~~GIktIInLr~~~~~~ 85 (207) +..||+.|+-|||+.+.. T Consensus 107 ~~~GI~nILALRGD~P~g 124 (296) T PRK09432 107 WNNGIRHIVALRGDLPPG 124 (296) T ss_pred HHCCCCEEEECCCCCCCC T ss_conf 975975486548989999 No 162 >KOG1200 consensus Probab=27.93 E-value=46 Score=14.47 Aligned_cols=92 Identities=20% Similarity=0.247 Sum_probs=56.0 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCC--EEEEEECCCCCCCCHHHHHHHHHHH-HCCCC-CEEEE Q ss_conf 98999999997499389973787876544457999996698--4998752688898989999999999-42799-49998 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGI--QLINFPLSATRELNDEQIKQLISIL-KTAPK-PLLIH 134 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi--~~i~ip~~~~~~~~~~~i~~~~~il-~~~~~-PVlVH 134 (207) -...+...+.+.|-|-+++=+... .-.+.+..++= .|..|.- +.-..+.++..++-+ +.... .|+|. T Consensus 26 IGrAia~~la~~Garv~v~dl~~~------~A~ata~~L~g~~~h~aF~~---DVS~a~~v~~~l~e~~k~~g~psvlVn 96 (256) T KOG1200 26 IGRAIAQLLAKKGARVAVADLDSA------AAEATAGDLGGYGDHSAFSC---DVSKAHDVQNTLEEMEKSLGTPSVLVN 96 (256) T ss_pred HHHHHHHHHHHCCCEEEEEECCHH------HHHHHHHHCCCCCCCCEEEE---CCCCHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 779999999746967997503224------47999862688776523530---467578899999999984299728997 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 28998889999999999983999899999 Q gi|254780898|r 135 CKSGADRTGLASAVYLYIVAHYPKEEAHR 163 (207) Q Consensus 135 C~aG~~RTG~~~alyli~~~g~~~~eA~~ 163 (207) | +|..|-++.. +|..+.|+.-.+.+ T Consensus 97 c-AGItrD~~Ll---rmkq~qwd~vi~vN 121 (256) T KOG1200 97 C-AGITRDGLLL---RMKQEQWDSVIAVN 121 (256) T ss_pred C-CCCCCCCCEE---ECCHHHHHHHHHHH T ss_conf 5-7646530201---32488888899751 No 163 >cd06294 PBP1_ycjW_transcription_regulator_like Ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors. This group includes the ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=27.79 E-value=46 Score=14.45 Aligned_cols=92 Identities=18% Similarity=0.123 Sum_probs=48.7 Q ss_pred HHHHHCCCCEEEECCCCCCCC----CCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCC Q ss_conf 999974993899737878765----4445799999669849987526888989899999999994279949998289988 Q gi|254780898|r 65 YLKKEYGIKSILNLRGKLPES----WHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGAD 140 (207) Q Consensus 65 ~l~k~~GIktIInLr~~~~~~----~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~ 140 (207) +.+.+.|-|.|.-+.++.+.. ......++++++|+.....-+...........+.+.++++..+.|-.|-|....- T Consensus 115 ~~L~~~Ghr~I~~i~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~tai~~~nD~~ 194 (270) T cd06294 115 EYLIKLGHKKIAFVGGDLDLEVTQDRLQGYKQALEDHGIPDRNEVIISLDFSEEGGYKALKKLLEQHPRPTAIVATDDLL 194 (270) T ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCCEEEECCHHH T ss_conf 99997699569998189986589999999999999869997733331377667889999999984699984899877499 Q ss_pred HHHHHHHHHHHHHCCCCHH Q ss_conf 8999999999998399989 Q gi|254780898|r 141 RTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 141 RTG~~~alyli~~~g~~~~ 159 (207) =.|++-++.. .|.... T Consensus 195 A~g~~~al~~---~g~~iP 210 (270) T cd06294 195 ALGVLKVLNE---LGLKVP 210 (270) T ss_pred HHHHHHHHHH---CCCCCC T ss_conf 9999999998---399999 No 164 >pfam04396 consensus Probab=27.60 E-value=47 Score=14.43 Aligned_cols=71 Identities=15% Similarity=0.218 Sum_probs=40.4 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCC-CCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 89999999974993899737878765-444579999966984998752688898989999999999427994 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLPES-WHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKP 130 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~~~-~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~P 130 (207) ...+...+++.|++..|..+.-.+.. ....-.+.....||..+|+|-...+..+...+..++....+.+.| T Consensus 24 ~~~I~~al~~~Gy~g~vsi~aygd~~~~~~~~~~~L~stGI~l~hvp~g~k~aad~~m~~d~~~~a~~np~P 95 (149) T pfam04396 24 APNIKSALRKLGYRGPVSITAYGDLNKIPREILRALSSTGISLAHVPAGAKDARDKKMLVDILLWALDNPPP 95 (149) T ss_pred HHHHHHHHHHCCCCCCEEEEEEEECCCCCHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 999999999749998669999874686988999999976986885788775347889999999999709997 No 165 >PRK07049 methionine gamma-lyase; Validated Probab=27.49 E-value=47 Score=14.42 Aligned_cols=34 Identities=12% Similarity=0.225 Sum_probs=20.6 Q ss_pred HHCCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEE Q ss_conf 6078725028947713899989999999974993899 Q gi|254780898|r 40 FTQNFHAVVPHEIYRSAQPNGTFIEYLKKEYGIKSIL 76 (207) Q Consensus 40 ~~~Nf~~V~~g~lyRS~qP~~~~~~~l~k~~GIktII 76 (207) ..+.=|.|.++.+|.+. ..-+...+.+.||+.+- T Consensus 119 l~~Gd~iv~~~~~YggT---~~l~~~~l~~~GI~~~~ 152 (427) T PRK07049 119 VRPGDVIVHSQPLYGGT---ETLLAKTFRNFGVGAVG 152 (427) T ss_pred CCCCCEEEECCCCCHHH---HHHHHHHHHHCCCEEEE T ss_conf 18999999899982789---99999999975988999 No 166 >PRK10499 N,N'-diacetylchitobiose-specific PTS system transporter subunit IIB; Provisional Probab=27.42 E-value=47 Score=14.41 Aligned_cols=19 Identities=26% Similarity=0.548 Sum_probs=10.5 Q ss_pred CCCEEEECCCCCCHHHHHHH Q ss_conf 99499982899888999999 Q gi|254780898|r 128 PKPLLIHCKSGADRTGLASA 147 (207) Q Consensus 128 ~~PVlVHC~aG~~RTG~~~a 147 (207) .+.|++-|.+|. .|++++- T Consensus 3 ~kkIlL~C~aGM-STSlLv~ 21 (106) T PRK10499 3 KKHIYLFCSAGM-STSLLVS 21 (106) T ss_pred CCEEEEECCCCC-CHHHHHH T ss_conf 877999889998-7899999 No 167 >pfam00282 Pyridoxal_deC Pyridoxal-dependent decarboxylase conserved domain. Probab=27.42 E-value=47 Score=14.41 Aligned_cols=58 Identities=21% Similarity=0.281 Sum_probs=36.6 Q ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCC-CCCEEEECCCCCCHHHHH Q ss_conf 4579999966984998752688898989999999999427-994999828998889999 Q gi|254780898|r 88 KEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTA-PKPLLIHCKSGADRTGLA 145 (207) Q Consensus 88 ~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~-~~PVlVHC~aG~~RTG~~ 145 (207) ..-...+.-+|+..+.+|+++....+.+.+++.++...+. ..|++|-+.+|.--||.+ T Consensus 151 ~Sv~Kaa~~lGlg~~~V~~d~~~~m~~~~L~~~i~~~~~~g~~p~~VvataGTT~tGai 209 (370) T pfam00282 151 SSIEKAALIAGVELREIPTDENGKMRGMDLEKAIEEDKENGLIPFFVCATLGTTGSGAF 209 (370) T ss_pred HHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC T ss_conf 99999999749983695368988829999999999998679974499986377177134 No 168 >PRK11784 tRNA 2-selenouridine synthase; Provisional Probab=27.31 E-value=47 Score=14.40 Aligned_cols=29 Identities=21% Similarity=0.164 Sum_probs=17.4 Q ss_pred CCEEEECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 9499982899888999999999998399989 Q gi|254780898|r 129 KPLLIHCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 129 ~PVlVHC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) ..++|+|.-|--|++.++-. + ...||+.. T Consensus 90 ~~~~iyCwRGG~RS~~~~~~-L-~~~G~~v~ 118 (333) T PRK11784 90 PRPLLYCWRGGLRSGSVQQW-L-KEIGIDVP 118 (333) T ss_pred CCEEEEECCCCHHHHHHHHH-H-HHCCCCCC T ss_conf 97699973675668999999-9-98699756 No 169 >PRK08742 adenosylmethionine--8-amino-7-oxononanoate transaminase; Provisional Probab=26.84 E-value=24 Score=16.23 Aligned_cols=72 Identities=11% Similarity=0.118 Sum_probs=39.9 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEEC---CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 9898999999999942799499982---8998889999999999983999899999610002642674245999999999 Q gi|254780898|r 111 ELNDEQIKQLISILKTAPKPLLIHC---KSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDITFEKI 187 (207) Q Consensus 111 ~~~~~~i~~~~~il~~~~~PVlVHC---~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~~~~~ 187 (207) .|++..+..+.++.++.+ ||+-| ..|-||||.+.|+- ..|..++-..--..+.=|++|.+-...=+.+++.+ T Consensus 254 ~p~~~yl~~lr~lc~~~g--iLLI~DEV~tGfGRtG~~fa~e---~~gv~PDiv~~aKgltgGy~Plsav~~~~eI~~~~ 328 (472) T PRK08742 254 MHHPAYLRRARELCDAHG--AFLIADEIATGFGRTGTLFACE---QAGVMPDLLCLSKGLTGGFLPLSAVLATQQLYDAF 328 (472) T ss_pred CCCHHHHHHHHHHHHHHC--CEEEECCCCCCCCCCCCCEEEC---CCCCCCCEEEECHHHCCCCCCEEEEEECHHHHHHH T ss_conf 689999999999988729--6688403301687134452312---46779972012121038877707898769999765 No 170 >cd06288 PBP1_sucrose_transcription_regulator Ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=26.48 E-value=49 Score=14.31 Aligned_cols=88 Identities=16% Similarity=0.066 Sum_probs=50.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCC----CHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 99999999749938997378787654----44579999966984998752688898989999999999427994999828 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESW----HKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~----~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) ....+.+.+.|.+.|.-+.++..... .....+++++.|+.....-+...........+.+.++++..+.|-.|-|. T Consensus 105 ~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~a~~~~~i~~~~~~i~~~~~~~~~~~~~~~~ll~~~~~~dai~~~ 184 (269) T cd06288 105 YDATRHLLAAGHRRIAFINGEPWMLAAKDRLKGYRQALAEAGIPFDPDLVVHGDWSADDGYEAAAALLDLDDRPTAIFCG 184 (269) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCCEECCC T ss_conf 99999999848875578748988757999999999999976998664224126776677777779998558998610147 Q ss_pred CCCCHHHHHHHH Q ss_conf 998889999999 Q gi|254780898|r 137 SGADRTGLASAV 148 (207) Q Consensus 137 aG~~RTG~~~al 148 (207) .-.-=-|++-++ T Consensus 185 ~d~~A~g~~~~l 196 (269) T cd06288 185 NDRMAMGAYQAL 196 (269) T ss_pred CHHHHHHHHHHH T ss_conf 789999999999 No 171 >PRK01415 hypothetical protein; Validated Probab=26.30 E-value=49 Score=14.29 Aligned_cols=96 Identities=14% Similarity=0.174 Sum_probs=50.5 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHH-----CCC Q ss_conf 138999899999999749938997378787654445799999669849987526888989899999999994-----279 Q gi|254780898|r 54 RSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILK-----TAP 128 (207) Q Consensus 54 RS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~-----~~~ 128 (207) ++...+|+++..++.+.. -.||+.|.+-|.. |-+ + ...-.|......+|-..++ ..+ T Consensus 110 ~G~~l~P~ewn~li~~~d-vvviD~RN~YE~~-------------IG~--F--knAi~P~~~~FrEFp~~v~~~~~~~k~ 171 (247) T PRK01415 110 KGEYIEPKDWDEFITKQD-VIVIDTRNDYEVE-------------VGT--F--KSAINPNTKTFKQFPAWVQQNQELLKG 171 (247) T ss_pred CCCCCCHHHHHHHHHCCC-EEEEECCCCEEEE-------------EEE--E--CCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 685359899999974688-4997437671775-------------107--5--487179965645569999974766259 Q ss_pred CCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 9499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 129 KPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 129 ~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) +||++.|..|. |.--+++. + ...|+. .+-+.+.|++.++ T Consensus 172 kkI~myCTGGI-RCEKas~y-l-~~~Gf~-------------~VyqL~GGIl~Y~ 210 (247) T PRK01415 172 KKIAMFCTGGI-RCEKSTSL-L-KSIGYN-------------EVYHLKGGILQYL 210 (247) T ss_pred CCEEEECCCCE-EEHHHHHH-H-HHCCCC-------------CEEEECHHHHHHH T ss_conf 86899827972-05486899-9-871987-------------0056451899999 No 172 >cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine. Probab=25.72 E-value=48 Score=14.37 Aligned_cols=30 Identities=27% Similarity=0.392 Sum_probs=23.0 Q ss_pred EEECCCCCHHHHHHHHHHCCCCEEEECCCC Q ss_conf 771389998999999997499389973787 Q gi|254780898|r 52 IYRSAQPNGTFIEYLKKEYGIKSILNLRGK 81 (207) Q Consensus 52 lyRS~qP~~~~~~~l~k~~GIktIInLr~~ 81 (207) =|+..-|+.+.+..++++.|+..++|=|+. T Consensus 30 d~~~~p~~~~el~~~~~~~G~~~l~n~r~~ 59 (105) T cd03035 30 DYRKDGLDAATLERWLAKVGWETLLNKRGT 59 (105) T ss_pred EEECCCCCHHHHHHHHHHCCHHHHHCCCCH T ss_conf 756789899999999986790676433358 No 173 >TIGR03586 PseI pseudaminic acid synthase. Probab=25.46 E-value=51 Score=14.19 Aligned_cols=70 Identities=16% Similarity=0.171 Sum_probs=39.7 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCC--------------------CCCCCHHHHHHHHH Q ss_conf 9999997499389973787876544457999996698499875268--------------------88989899999999 Q gi|254780898|r 63 IEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSA--------------------TRELNDEQIKQLIS 122 (207) Q Consensus 63 ~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~--------------------~~~~~~~~i~~~~~ 122 (207) +..+.++.||.-..+. +.....+.+.+.|+..+.++-.+ .+.-+.+.+...++ T Consensus 82 L~~~~k~~gI~f~~t~-------fd~~s~~~l~~l~~~~~KIaS~d~~n~~Li~~iak~~kpviiStG~s~~~EI~~av~ 154 (327) T TIGR03586 82 LFERAKELGLTIFSSP-------FDETAVDFLESLDVPAYKIASFEITDLPLIRYVAKTGKPIIMSTGIATLEEIEEAVE 154 (327) T ss_pred HHHHHHHCCCEEEECC-------CCHHHHHHHHHCCCCEEEECCHHHCCHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH T ss_conf 9999998599799788-------997999999870799599894153665699998664886799688875456788888 Q ss_pred HHHCCCC--CEEEECCCCC Q ss_conf 9942799--4999828998 Q gi|254780898|r 123 ILKTAPK--PLLIHCKSGA 139 (207) Q Consensus 123 il~~~~~--PVlVHC~aG~ 139 (207) ++..... =+++||-+.- T Consensus 155 ~~~~~~~~~i~llhC~s~Y 173 (327) T TIGR03586 155 AAREAGCKDLVLLKCTSSY 173 (327) T ss_pred HHHHHCCCCEEECCCCCCC T ss_conf 7765025571565258589 No 174 >PRK12999 pyruvate carboxylase; Reviewed Probab=25.44 E-value=51 Score=14.19 Aligned_cols=80 Identities=21% Similarity=0.254 Sum_probs=48.7 Q ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCCCEEEECCCCCCHHHHHHHHHHHHH-CCCCH-HHHHHH Q ss_conf 457999996698499875268889898999999999942-799499982899888999999999998-39998-999996 Q gi|254780898|r 88 KEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APKPLLIHCKSGADRTGLASAVYLYIV-AHYPK-EEAHRQ 164 (207) Q Consensus 88 ~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~PVlVHC~aG~~RTG~~~alyli~~-~g~~~-~eA~~e 164 (207) -.-....++.|...+-|. +-.+...+.....+...+++ .+-||-+||+. -+|..+|.|+... .|.+. +.|..- T Consensus 695 ~~~a~~l~~~G~~~l~IK-DMAGLl~P~aa~~LV~aLk~~~~lPIhlHtHd---TsG~~~at~laA~eAGvDiVD~A~ss 770 (1147) T PRK12999 695 VDLAKELEAAGAHILAIK-DMAGLLKPAAAYKLVSALKEEVDLPIHLHTHD---TSGNGLATYLAAAEAGVDIVDVAMAS 770 (1147) T ss_pred HHHHHHHHHCCCCEEEEC-CCCCCCCHHHHHHHHHHHHHHCCCCEEEECCC---CCCHHHHHHHHHHHCCCCEEECCCHH T ss_conf 999999997689889971-44546887999999999986169845984367---88558999999998599989635411 Q ss_pred HHHCCCC Q ss_conf 1000264 Q gi|254780898|r 165 LSMLYGH 171 (207) Q Consensus 165 l~~~ygh 171 (207) |+..++| T Consensus 771 mSg~TSQ 777 (1147) T PRK12999 771 MSGLTSQ 777 (1147) T ss_pred HCCCCCC T ss_conf 0289889 No 175 >cd00568 TPP_enzymes Thiamine pyrophosphate (TPP) enzyme family, TPP-binding module; found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. These enzymes include, among others, the E1 components of the pyruvate, the acetoin and the branched chain alpha-keto acid dehydrogenase complexes. Probab=25.23 E-value=51 Score=14.16 Aligned_cols=23 Identities=26% Similarity=0.329 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHCCCCCEEEECC Q ss_conf 89999999999427994999828 Q gi|254780898|r 114 DEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 114 ~~~i~~~~~il~~~~~PVlVHC~ 136 (207) .+++...++-..+.++|++|+.+ T Consensus 145 ~~el~~al~~a~~~~~p~li~V~ 167 (168) T cd00568 145 PEDLEAALAEALAAGGPALIEVK 167 (168) T ss_pred HHHHHHHHHHHHHCCCCEEEEEE T ss_conf 99999999999838997899998 No 176 >TIGR01511 ATPase-IB1_Cu copper-translocating P-type ATPase; InterPro: IPR006403 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents P-type ATPases that specialise in cation and copper transport; those transporting copper (3.6.3.4 from EC) have been classified as belonging to the IB subfamily , , . These proteins are involved in a variety of processes in both prokaryotes and eukaryotes. Some of these P-ATPases import copper into cells, such as in Arabidopsis sp., which require copper to create functional hormone receptors for ethylene signalling. Other P-ATPases are involved in copper export out of cells, such as occurs with the efflux of hepatic copper into the bile in eukaryotes. In bacteria, some of these P-ATPases play a role in osmotic adaptation. Certain copper ATPases have been implicated in both Wilson and Menkes diseases . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0004008 copper-exporting ATPase activity, 0005524 ATP binding, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=25.14 E-value=52 Score=14.15 Aligned_cols=60 Identities=22% Similarity=0.264 Sum_probs=35.4 Q ss_pred EECCC-CCHH--HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 71389-9989--99999997499389973787876544457999996698499875268889898999999999 Q gi|254780898|r 53 YRSAQ-PNGT--FIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISI 123 (207) Q Consensus 53 yRS~q-P~~~--~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~i 123 (207) +.-++ |.|+ .+-+.+|+.||++++ |-||++ ..-...++++||+++-= ..+++..+...++ T Consensus 366 ~~~~D~lkp~a~~~i~~Lk~~Gi~~~m-LtGDN~----~~A~a~A~~lGI~v~Ae------v~P~~K~a~ik~l 428 (545) T TIGR01511 366 LALADQLKPEAKEVIQALKRRGIEPVM-LTGDNR----KTAKAVAKELGINVRAE------VLPDDKAALIKEL 428 (545) T ss_pred EEECCCCCHHHHHHHHHHHHCCCEEEE-EECCCH----HHHHHHHHHHCCEEEEC------CCHHHHHHHHHHH T ss_conf 866474688689999999875987999-866987----99999999728333317------8807689999998 No 177 >KOG1430 consensus Probab=24.85 E-value=52 Score=14.12 Aligned_cols=29 Identities=10% Similarity=-0.166 Sum_probs=11.3 Q ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 994999828998889999999999983999 Q gi|254780898|r 128 PKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 128 ~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) .+.++.+|.---.++..... -++..-|.+ T Consensus 242 ~Gq~yfI~d~~p~~~~~~~~-~l~~~lg~~ 270 (361) T KOG1430 242 NGQFYFITDDTPVRFFDFLS-PLVKALGYC 270 (361) T ss_pred CCEEEEEECCCCCHHHHHHH-HHHHHCCCC T ss_conf 85089986898120368889-998843999 No 178 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=24.83 E-value=52 Score=14.12 Aligned_cols=11 Identities=18% Similarity=0.232 Sum_probs=4.4 Q ss_pred HHHHHHHHCCC Q ss_conf 99999998399 Q gi|254780898|r 146 SAVYLYIVAHY 156 (207) Q Consensus 146 ~alyli~~~g~ 156 (207) +|.|++...|. T Consensus 192 VAa~~mmKRG~ 202 (483) T PRK01269 192 VASYMLMRRGS 202 (483) T ss_pred HHHHHHHHCCC T ss_conf 88999984598 No 179 >TIGR01717 AMP-nucleosdse AMP nucleosidase; InterPro: IPR011271 This entry represents the AMP nucleosidase from proteobacteria but also including sequences from Corynebacterium, a Gram-positive organism. The Escherichia coli protein is the most well studied .. Probab=24.60 E-value=53 Score=14.09 Aligned_cols=55 Identities=27% Similarity=0.483 Sum_probs=30.5 Q ss_pred HHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEE-ECCCCCCHHHHHHHHHHHH Q ss_conf 999966984998752688898989999999999427994999-8289988899999999999 Q gi|254780898|r 92 KAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLI-HCKSGADRTGLASAVYLYI 152 (207) Q Consensus 92 ~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlV-HC~aG~~RTG~~~alyli~ 152 (207) -...-.||+.|+|.+- +-+..++...+.++...-= ++| || +|. |....++=|-.. T Consensus 265 i~~~G~GITlvNIGVG---PSNAKtITDHLAVLRP~~W-lMiGHC-aGL-r~sq~iGDyvLA 320 (486) T TIGR01717 265 ITADGDGITLVNIGVG---PSNAKTITDHLAVLRPHAW-LMIGHC-AGL-RESQRIGDYVLA 320 (486) T ss_pred HHCCCCCEEEEECCCC---CCCHHHHHHHHHHCCCCCE-EEECCC-CCH-HHHHHHHHHHHH T ss_conf 8506993289960748---8621235410221188742-872223-041-232343125552 No 180 >PRK08605 D-lactate dehydrogenase; Validated Probab=24.49 E-value=53 Score=14.08 Aligned_cols=56 Identities=23% Similarity=0.182 Sum_probs=33.9 Q ss_pred CCCEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCC Q ss_conf 894771389998999999997499389973787876544457999996698499875268 Q gi|254780898|r 49 PHEIYRSAQPNGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSA 108 (207) Q Consensus 49 ~g~lyRS~qP~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~ 108 (207) ++-+.||.-+-.+++-..++++|||-|..- +.. ...-+.+.|++.||...+.|-.. T Consensus 47 D~i~v~~~~~i~~~vl~~~~~l~iK~I~r~-gvG---~DnIDl~~a~~~gI~V~n~P~~~ 102 (332) T PRK08605 47 DGLSLSQQIPLSEAIYKLLNELGIKQIAQR-SAG---FDTYDLELANKYNIIISNVPSYS 102 (332) T ss_pred CEEEEECCCCCCHHHHHHHHHCCCEEEEEC-CCC---CCHHHHHHHHHCCCEEEECCCCC T ss_conf 889997079779999996663697799977-721---74546999997999899599868 No 181 >cd02014 TPP_POX Thiamine pyrophosphate (TPP) family, Pyruvate oxidase (POX) subfamily, TPP-binding module; composed of proteins similar to Lactobacillus plantarum POX, which plays a key role in controlling acetate production under aerobic conditions. POX decarboxylates pyruvate, producing hydrogen peroxide and the energy-storage metabolite acetylphosphate. It requires FAD in addition to TPP and a divalent cation as cofactors. Probab=24.03 E-value=54 Score=14.02 Aligned_cols=37 Identities=11% Similarity=0.077 Sum_probs=19.7 Q ss_pred CCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCC Q ss_conf 9389973787876544457999996698499875268 Q gi|254780898|r 72 IKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSA 108 (207) Q Consensus 72 IktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~ 108 (207) =+.||.+-++..-...-.|.+.+.+.++..+-+-+.. T Consensus 69 ~~~Vv~i~GDG~f~m~~~El~Ta~~~~lpi~~vV~nN 105 (178) T cd02014 69 DRQVIALSGDGGFAMLMGDLITAVKYNLPVIVVVFNN 105 (178) T ss_pred CCEEEEEECCCHHHCCHHHHHHHHHHCCCCEEEEEEC T ss_conf 9808999777067406189998998589813999989 No 182 >PRK07454 short chain dehydrogenase; Provisional Probab=23.91 E-value=54 Score=14.01 Aligned_cols=78 Identities=21% Similarity=0.182 Sum_probs=43.6 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC-CCC-CEEEECCC Q ss_conf 8999999997499389973787876544457999996698499875268889898999999999942-799-49998289 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKT-APK-PLLIHCKS 137 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~-~~~-PVlVHC~a 137 (207) ...++..+.+.|.+-++.=|.++. .+.-.+.++..|.+...++. +.-+++.++.+.+.+.+ ... -+||+| + T Consensus 19 G~a~A~~la~~G~~V~l~~R~~~~---l~~~~~e~~~~g~~~~~~~~---Dvt~~~~v~~~~~~~~~~~G~iDiLVnN-A 91 (241) T PRK07454 19 GKATALAFAKAGWDLALVARSQDA---LEALAEELRSTGVKVAAYSI---DLSNPEAIAPGIAELLEQFGCPSVLINN-A 91 (241) T ss_pred HHHHHHHHHHCCCEEEEEECCHHH---HHHHHHHHHHCCCCEEEEEE---ECCCHHHHHHHHHHHHHHCCCCCEEEEC-C T ss_conf 999999999879989999899999---99999999965992899995---1899999999999999975998899988-9 Q ss_pred CCCHHHH Q ss_conf 9888999 Q gi|254780898|r 138 GADRTGL 144 (207) Q Consensus 138 G~~RTG~ 144 (207) |....|. T Consensus 92 G~~~~~~ 98 (241) T PRK07454 92 GAAYTGP 98 (241) T ss_pred CCCCCCC T ss_conf 8899999 No 183 >PRK11194 hypothetical protein; Provisional Probab=23.82 E-value=55 Score=14.00 Aligned_cols=57 Identities=11% Similarity=0.121 Sum_probs=42.8 Q ss_pred CCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 8898989999999999427994999828998889999999999983999899999610 Q gi|254780898|r 109 TRELNDEQIKQLISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLS 166 (207) Q Consensus 109 ~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~ 166 (207) ...|+++.+..|.+++.+..-++.|-=..|.|-.+.. +=-.-.+.+.+..++.++|. T Consensus 307 ~~~p~~~~i~~F~~~L~~~gi~vtvR~s~G~DI~AAC-GQLa~~~~~~~~~~~~~~~~ 363 (372) T PRK11194 307 YGRSSNSRIDRFSKVLMEYGFTVIVRKTRGDDIDAAC-GQLAGDVIDRTKRTLKKRMQ 363 (372) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHH-HCCHHHHHCCCHHHHHHHHH T ss_conf 8799999999999999978991797589985333540-42625452600588998865 No 184 >cd02001 TPP_ComE_PpyrDC Thiamine pyrophosphate (TPP) family, ComE and PpyrDC subfamily, TPP-binding module; composed of proteins similar to sulfopyruvate decarboxylase beta subunit (ComE) and phosphonopyruvate decarboxylase (Ppyr decarboxylase). Methanococcus jannaschii sulfopyruvate decarboxylase (ComDE) is a dodecamer of six alpha (D) subunits and six (E) beta subunits which, catalyzes the decarboxylation of sulfopyruvic acid to sulfoacetaldehyde in the coenzyme M pathway. Ppyr decarboxylase is a homotrimeric enzyme which functions in the biosynthesis of C-P compounds such as bialaphos tripeptide in Streptomyces hygroscopicus. Ppyr decarboxylase and ComDE require TPP and divalent metal cation cofactors. Probab=23.44 E-value=56 Score=13.95 Aligned_cols=24 Identities=17% Similarity=0.127 Sum_probs=16.1 Q ss_pred CHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 989999999999427994999828 Q gi|254780898|r 113 NDEQIKQLISILKTAPKPLLIHCK 136 (207) Q Consensus 113 ~~~~i~~~~~il~~~~~PVlVHC~ 136 (207) +.+.+...++-+.+.++|+++|-+ T Consensus 130 ~~~el~~al~~~l~~~GP~fIeV~ 153 (157) T cd02001 130 LLGGLGSEFAGLLATTGPTLLHAP 153 (157) T ss_pred CHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 999999999999849999899999 No 185 >COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones] Probab=23.17 E-value=56 Score=13.92 Aligned_cols=29 Identities=21% Similarity=0.337 Sum_probs=23.0 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHCCCCC Q ss_conf 99999999983999899999610002642 Q gi|254780898|r 144 LASAVYLYIVAHYPKEEAHRQLSMLYGHF 172 (207) Q Consensus 144 ~~~alyli~~~g~~~~eA~~el~~~ygh~ 172 (207) +..-.|-++.+|.|..+.+..|.-+||+| T Consensus 66 lR~~V~e~l~eGkS~~qIid~mVaRYG~F 94 (153) T COG3088 66 LRHQVYELLQEGKSDQQIIDYMVARYGEF 94 (153) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCE T ss_conf 99999999985786899999999853463 No 186 >cd01294 DHOase Dihydroorotase (DHOase) catalyzes the reversible interconversion of carbamoyl aspartate to dihydroorotate, a key reaction in the pyrimidine biosynthesis. In contrast to the large polyfunctional CAD proteins of higher organisms, this group of DHOases is monofunctional and mainly dimeric. Probab=23.06 E-value=56 Score=13.90 Aligned_cols=30 Identities=20% Similarity=0.297 Sum_probs=22.2 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 888989899999999994279949998289 Q gi|254780898|r 108 ATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 108 ~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) +.+..+.+.+....+.++..+.|+.+||-. T Consensus 106 ~~Gv~~~~~~~~~~e~~~~~~~~l~~H~E~ 135 (335) T cd01294 106 QGGVTDLEKIYPVLEAMQKLGMPLLVHGEV 135 (335) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 557655899999999998669967774765 No 187 >cd06818 PLPDE_III_cryptic_DSD Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Bacterial Cryptic D-Serine Dehydratase. This subfamily is composed of Burkholderia cepacia cryptic D-serine dehydratase (cryptic DSD), which is also called D-serine deaminase, and similar bacterial proteins. Members of this subfamily are fold type III PLP-dependent enzymes with similarity to bacterial alanine racemase (AR), which contains an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. AR exists as dimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. Based on similarity, it is possible cryptic DSDs may also form dimers. Cryptic DSDs are distinct from the ubiquitous bacterial DSDs coded by the dsdA gene, mammalian L-serine dehydratases (LSD) and mammalian serine racemase (SerRac), which are fold type II PLP-dependent enzymes. At present, the enzymatic and biochemical properties Probab=22.85 E-value=57 Score=13.88 Aligned_cols=26 Identities=15% Similarity=0.311 Sum_probs=11.7 Q ss_pred CHHHHHHHHHHHHCCCC-CEE-EECCCC Q ss_conf 98999999999942799-499-982899 Q gi|254780898|r 113 NDEQIKQLISILKTAPK-PLL-IHCKSG 138 (207) Q Consensus 113 ~~~~i~~~~~il~~~~~-PVl-VHC~aG 138 (207) +.+.+..+.+.+...++ .+. +||+.| T Consensus 142 ~~~~a~~l~~~i~~~~~L~~~Gl~~y~G 169 (382) T cd06818 142 TEAEALALADAIAASPALRLAGVEGYEG 169 (382) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEEECC T ss_conf 8699999999997299965946885467 No 188 >COG1099 Predicted metal-dependent hydrolases with the TIM-barrel fold [General function prediction only] Probab=22.69 E-value=57 Score=13.86 Aligned_cols=101 Identities=13% Similarity=0.172 Sum_probs=56.0 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCC-------------CHHHHHHHHHCCCEEE-EEECCCCCCCC----------- Q ss_conf 9899999999749938997378787654-------------4457999996698499-87526888989----------- Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESW-------------HKEEEKAANDLGIQLI-NFPLSATRELN----------- 113 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~-------------~~~e~~~~~~~gi~~i-~ip~~~~~~~~----------- 113 (207) +-+++..+ +..||++||.+.-+. .+. ...|...+++.|+++. -+.+.++..|+ T Consensus 13 ~~eDlekM-a~sGI~~Vit~AhdP-~~~~~~~v~~~h~~rl~~~E~~Ra~~~Gl~~~vavGvHPr~iP~e~~~~l~~L~~ 90 (254) T COG1099 13 GFEDLEKM-ALSGIREVITLAHDP-YPMKTAEVYLDHFRRLLGVEPERAEKAGLKLKVAVGVHPRAIPPELEEVLEELEE 90 (254) T ss_pred CHHHHHHH-HHHCHHHHHHCCCCC-CCCCCHHHHHHHHHHHHCCCHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHH T ss_conf 47799999-983816664203688-7766688999999999722531688608304677333777789059999999886 Q ss_pred -------------------HHHHHHH---HHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf -------------------8999999---99994279949998289988899999999999839998999 Q gi|254780898|r 114 -------------------DEQIKQL---ISILKTAPKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEA 161 (207) Q Consensus 114 -------------------~~~i~~~---~~il~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA 161 (207) ++..+-| +++.++.+=|+.||--.+..--++--.+-.....|.....+ T Consensus 91 ~l~~e~VvAiGEiGLe~~t~~E~evf~~QL~LA~e~dvPviVHTPr~nK~e~t~~ildi~~~~~l~~~lv 160 (254) T COG1099 91 LLSNEDVVAIGEIGLEEATDEEKEVFREQLELARELDVPVIVHTPRRNKKEATSKILDILIESGLKPSLV 160 (254) T ss_pred HCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHE T ss_conf 5164783676403322178889999999999998749967996787760567899999998749980121 No 189 >pfam11385 DUF3189 Protein of unknown function (DUF3189). This family of proteins with unknown function appears to be restricted to Firmicutes Probab=22.56 E-value=58 Score=13.84 Aligned_cols=13 Identities=31% Similarity=0.570 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHHC Q ss_conf 8999999999942 Q gi|254780898|r 114 DEQIKQLISILKT 126 (207) Q Consensus 114 ~~~i~~~~~il~~ 126 (207) ...+..+.+++.. T Consensus 77 ~~~l~~l~~i~~~ 89 (148) T pfam11385 77 EPALKDLLEILGI 89 (148) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999999678 No 190 >COG2897 SseA Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism] Probab=22.41 E-value=58 Score=13.82 Aligned_cols=84 Identities=20% Similarity=0.241 Sum_probs=38.2 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCC-C-EEEEEECCC-----CCCCCHHHHHHHHHHH-HCCCCCE Q ss_conf 899999999749938997378787654445799999669-8-499875268-----8898989999999999-4279949 Q gi|254780898|r 60 GTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLG-I-QLINFPLSA-----TRELNDEQIKQLISIL-KTAPKPL 131 (207) Q Consensus 60 ~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~g-i-~~i~ip~~~-----~~~~~~~~i~~~~~il-~~~~~PV 131 (207) ...+.... +.+=+.||+.|+..+......|..... .| | --+++|+.+ ..-.+++.+..+.+-. -+.+++| T Consensus 160 ~~~~~~~~-~~~~~~liDaR~~~rf~G~~~ep~~~~-~GHIPGAiNipw~~~~~~~~~~~~~~~~~~l~~~~gi~~~~~v 237 (285) T COG2897 160 ATLVADAL-EVPAVLLIDARSPERFRGKEPEPRDGK-AGHIPGAINIPWTDLVDDGGLFKSPEEIARLYADAGIDPDKEV 237 (285) T ss_pred HHHHHHHH-CCCCEEEEECCCHHHHCCCCCCCCCCC-CCCCCCCCCCCHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCE T ss_conf 99999885-489727985688888078887778887-8788887576789972678764969999999986189988888 Q ss_pred EEECCCCCCHHHHHH Q ss_conf 998289988899999 Q gi|254780898|r 132 LIHCKSGADRTGLAS 146 (207) Q Consensus 132 lVHC~aG~~RTG~~~ 146 (207) .++|.+|. |+.... T Consensus 238 I~yCgsG~-~As~~~ 251 (285) T COG2897 238 IVYCGSGV-RASVTW 251 (285) T ss_pred EEECCCCH-HHHHHH T ss_conf 99838963-699999 No 191 >COG1229 FwdA Formylmethanofuran dehydrogenase subunit A [Energy production and conversion] Probab=22.19 E-value=59 Score=13.79 Aligned_cols=44 Identities=20% Similarity=0.391 Sum_probs=26.4 Q ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHC--CCCCEEEECCCCCCHHH Q ss_conf 8499875268889898999999999942--79949998289988899 Q gi|254780898|r 99 IQLINFPLSATRELNDEQIKQLISILKT--APKPLLIHCKSGADRTG 143 (207) Q Consensus 99 i~~i~ip~~~~~~~~~~~i~~~~~il~~--~~~PVlVHC~aG~~RTG 143 (207) +.-++=|++....-+.+.+..+.++-+. .+.+|-+||.. .|--| T Consensus 200 V~~Lddpvp~FdiTP~EIi~gl~e~nE~L~lphsiHlH~N~-LG~PG 245 (575) T COG1229 200 VHSLDDPVPYFDITPREIIKGLAEVNERLGLPHSIHLHCNN-LGHPG 245 (575) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEECCCC-CCCCC T ss_conf 56566888886568999999899888864899627865334-79998 No 192 >PRK13132 consensus Probab=21.96 E-value=59 Score=13.77 Aligned_cols=76 Identities=24% Similarity=0.173 Sum_probs=48.3 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCC Q ss_conf 98999999997499389973787876544457999996698499875268889898999999999942799499982899 Q gi|254780898|r 59 NGTFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSG 138 (207) Q Consensus 59 ~~~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG 138 (207) ..++|..-.++.|+..+|----+.|+ ...-.+.|++.|+..+.+--+ +.+ .....+.+.+.+ +|.|-+- T Consensus 101 G~e~F~~~~~~~GvdGlIipDLP~ee--~~~~~~~~~~~~i~~I~lvaP-----Ts~--~R~~~i~~~s~g--fiY~vs~ 169 (246) T PRK13132 101 GLEKFVKKAKELGISGLIVPDLPFEE--SEELIKECEKYNIALIPLISV-----TSP--KRAKKILKHAKG--FIYALGS 169 (246) T ss_pred CHHHHHHHHHHCCCCEEECCCCCHHH--HHHHHHHHHHCCCCEEEEECC-----CCH--HHHHHHHHCCCC--CEEEEEC T ss_conf 99999999987699857757999789--899999999859970144257-----978--999999954898--2799753 Q ss_pred CCHHHHH Q ss_conf 8889999 Q gi|254780898|r 139 ADRTGLA 145 (207) Q Consensus 139 ~~RTG~~ 145 (207) .+.||.- T Consensus 170 ~GvTG~~ 176 (246) T PRK13132 170 IGVTGTK 176 (246) T ss_pred CCCCCCC T ss_conf 5677776 No 193 >KOG1540 consensus Probab=21.93 E-value=59 Score=13.76 Aligned_cols=16 Identities=6% Similarity=-0.004 Sum_probs=6.7 Q ss_pred CCCCEEECCCCCHHHHH Q ss_conf 28947713899989999 Q gi|254780898|r 48 VPHEIYRSAQPNGTFIE 64 (207) Q Consensus 48 ~~g~lyRS~qP~~~~~~ 64 (207) .+.++..+.+ +|.++. T Consensus 129 ~~~~V~v~Di-np~mL~ 144 (296) T KOG1540 129 RESKVTVLDI-NPHMLA 144 (296) T ss_pred CCCEEEEEEC-CHHHHH T ss_conf 7761799938-989999 No 194 >pfam01487 DHquinase_I Type I 3-dehydroquinase. Type I 3-dehydroquinase, (3-dehydroquinate dehydratase or DHQase.) Catalyses the cis-dehydration of 3-dehydroquinate via a covalent imine intermediate giving dehydroshikimate. Dehydroquinase functions in the shikimate pathway which is involved in the biosynthesis of aromatic amino acids. Type II 3-dehydroquinase catalyses the trans-dehydration of 3-dehydroshikimate see pfam01220. Probab=21.62 E-value=60 Score=13.72 Aligned_cols=23 Identities=22% Similarity=0.346 Sum_probs=10.8 Q ss_pred HHHHHHHHHCCCCEEEECCCCCC Q ss_conf 99999999749938997378787 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLP 83 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~ 83 (207) +.+..+.+..++..|+.+|...+ T Consensus 43 ~~l~~l~~~~~~PiI~T~R~~~e 65 (222) T pfam01487 43 EQLSALRESTGLPIIFTVRTKSE 65 (222) T ss_pred HHHHHHHHHCCCCEEEEECCHHH T ss_conf 99999998389977999687788 No 195 >cd07037 TPP_PYR_MenD Pyrimidine (PYR) binding domain of 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexadiene-1-carboxylate synthase (MenD) and related proteins. Thiamine pyrophosphate (TPP family), pyrimidine (PYR) binding domain of 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexadiene-1-carboxylate (SEPHCHC) synthase (MenD) subfamily. The PYR domain is found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. TPP binds in the cleft formed by a PYR domain and a PP domain. The PYR domain, binds the aminopyrimidine ring of TPP, the PP domain binds the diphosphate residue. The PYR and PP domains have a common fold, but do not share strong sequence conservation. The PP domain is not included in this sub-family. Most TPP-dependent enzymes have the PYR and PP domains on the same subunit although these domains can be alternatively arranged in the primary structure. TPP-dependent enzymes are multisubunit proteins, the smallest catalytic unit being a dim Probab=21.60 E-value=60 Score=13.72 Aligned_cols=85 Identities=18% Similarity=0.198 Sum_probs=51.6 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHH-HHHCCCCCEEEECCCCCCHH Q ss_conf 99999749938997378787654445799999669849987526888989899999999-99427994999828998889 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLIS-ILKTAPKPLLIHCKSGADRT 142 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~-il~~~~~PVlVHC~aG~~RT 142 (207) .+.++++||+.|.-+-+....+.++ ...+..+++++... ++......-+ ......+|..+.|.+|-+=| T Consensus 4 v~~L~~~GV~~vFg~pG~~~~~l~~---a~~~~~~i~~i~~~-------hE~~A~~mA~gyar~tgkp~v~~~t~GPG~~ 73 (162) T cd07037 4 VEELKRLGVRDVVISPGSRSAPLAL---AAAEHPEFRLHVRV-------DERSAAFFALGLAKASGRPVAVVCTSGTAVA 73 (162) T ss_pred HHHHHHCCCCEEEECCCCCHHHHHH---HHHHCCCCEEEEEC-------CHHHHHHHHHHHHHHHCCCEEEEECCCCHHH T ss_conf 9999987999999907702899999---99858997699536-------6778999999999987998799947883588 Q ss_pred HHHHHHHHHHHCCCCH Q ss_conf 9999999999839998 Q gi|254780898|r 143 GLASAVYLYIVAHYPK 158 (207) Q Consensus 143 G~~~alyli~~~g~~~ 158 (207) -++.++.-....+.+. T Consensus 74 N~~~gl~~A~~d~~Pv 89 (162) T cd07037 74 NLLPAVVEAYYSGVPL 89 (162) T ss_pred HHHHHHHHHHHCCCCE T ss_conf 8889999875329988 No 196 >cd01545 PBP1_SalR Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. The SalR binds to glucose based compound Salicin which is chemically related to aspirin. The ligand-binding of SalR is structurally homologous to the periplasmic sugar-binding domain of ABC-transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand bind Probab=21.57 E-value=60 Score=13.72 Aligned_cols=87 Identities=11% Similarity=0.101 Sum_probs=45.8 Q ss_pred HHHHHHHCCCCEEEECCCCCCCC----CCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCC Q ss_conf 99999974993899737878765----44457999996698499875268889898999999999942799499982899 Q gi|254780898|r 63 IEYLKKEYGIKSILNLRGKLPES----WHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSG 138 (207) Q Consensus 63 ~~~l~k~~GIktIInLr~~~~~~----~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG 138 (207) ..+++.+.|-+.|.-+.++.... ......+++++.|+.....-+....--.+........+++..++|-.|.|... T Consensus 109 a~~~L~~~G~~~I~~i~g~~~~~~~~~R~~Gf~~a~~~~gi~~~~~~i~~~~~s~~~~~~~~~~ll~~~~~~tai~~~nD 188 (270) T cd01545 109 MTRHLIDLGHRRIAFIAGPPDHRASAERLEGYRDALAEAGLPLDPELVAQGDFTFESGLEAAEALLALPDRPTAIFASND 188 (270) T ss_pred HHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCHHHHHHHHHHHHCCCCCCEEEECCH T ss_conf 99999974996599936997773699999999999998399977212202653102699999999855999858997888 Q ss_pred CCHHHHHHHHH Q ss_conf 88899999999 Q gi|254780898|r 139 ADRTGLASAVY 149 (207) Q Consensus 139 ~~RTG~~~aly 149 (207) .-=.|++.+|. T Consensus 189 ~~A~g~~~~l~ 199 (270) T cd01545 189 DMAAGVLAVAH 199 (270) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 197 >pfam04582 Reo_sigmaC Reovirus sigma C capsid protein. Probab=21.47 E-value=39 Score=14.91 Aligned_cols=47 Identities=21% Similarity=0.254 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHCCCC-------CEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 8999999999942799-------4999828998889999999999983999899999610 Q gi|254780898|r 114 DEQIKQLISILKTAPK-------PLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLS 166 (207) Q Consensus 114 ~~~i~~~~~il~~~~~-------PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~ 166 (207) ..++.+|.....-..+ -|.+|| ||. ||-.+.- -.+..++....-.|. T Consensus 194 ~AqLmqFqW~vrge~gss~sidm~VnAH~-HGr-RTDy~mS----s~~slTvT~~~~sLv 247 (326) T pfam04582 194 DAQLMQFQWMAKGEDGSSDSIDMTVNAHC-HGR-RTDYMMS----TTGSLTVTSNPVSLV 247 (326) T ss_pred HHHHHHEEEEEECCCCCCCEEEEEEEEEE-CCC-CCEEEEE----CCCCCEECCCCEEEE T ss_conf 77765302222126788772677898771-586-5436663----367606516736999 No 198 >COG0160 GabT 4-aminobutyrate aminotransferase and related aminotransferases [Amino acid transport and metabolism] Probab=21.39 E-value=61 Score=13.69 Aligned_cols=48 Identities=17% Similarity=0.073 Sum_probs=30.6 Q ss_pred CCCCCHHHHHHHHHHHHCCCCCEEE-ECCCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 8898989999999999427994999-82899888999999999998399989 Q gi|254780898|r 109 TRELNDEQIKQLISILKTAPKPLLI-HCKSGADRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 109 ~~~~~~~~i~~~~~il~~~~~PVlV-HC~aG~~RTG~~~alyli~~~g~~~~ 159 (207) ...|++.-++.+.++.++.+-++.+ --++|.+|||.+.|. ...|..++ T Consensus 237 ~~v~p~~fl~~l~~~~~~~gillI~DEVQtG~GRTG~~fa~---E~~gv~PD 285 (447) T COG0160 237 IIVPPKGFLKALRKLCREHGILLIADEVQTGFGRTGKMFAF---EHFGVEPD 285 (447) T ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHH---HHCCCCCC T ss_conf 74798999999999999819889995665588866260122---42099987 No 199 >cd06819 PLPDE_III_LS_D-TA Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Low Specificity D-Threonine Aldolase. Low specificity D-threonine aldolase (Low specificity D-TA, EC 4.3.1.18), encoded by dtaAS gene from Arthrobacter sp. strain DK-38, is the prototype of this subfamily. Low specificity D-TAs are fold type III PLP-dependent enzymes that catalyze the interconversion between D-threonine/D-allo-threonine and glycine plus acetaldehyde. Both PLP and divalent cations (eg. Mn2+) are required for catalytic activity. Members of this subfamily show similarity to bacterial alanine racemase (AR), which contains an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. Based on its similarity to AR, it is possible that low specificity D-TAs also form dimers in solution. Experimental data show that t Probab=21.38 E-value=61 Score=13.69 Aligned_cols=14 Identities=29% Similarity=0.639 Sum_probs=6.1 Q ss_pred EEEECCCCCCHHHH Q ss_conf 99982899888999 Q gi|254780898|r 131 LLIHCKSGADRTGL 144 (207) Q Consensus 131 VlVHC~aG~~RTG~ 144 (207) |+|-...|.+|||+ T Consensus 128 V~Ievd~G~~R~Gv 141 (358) T cd06819 128 VLVEIDVGQGRCGV 141 (358) T ss_pred EEEEECCCCCCCCC T ss_conf 99997889987788 No 200 >cd02012 TPP_TK Thiamine pyrophosphate (TPP) family, Transketolase (TK) subfamily, TPP-binding module; TK catalyzes the transfer of a two-carbon unit from ketose phosphates to aldose phosphates. In heterotrophic organisms, TK provides a link between glycolysis and the pentose phosphate pathway and provides precursors for nucleotide, aromatic amino acid and vitamin biosynthesis. In addition, the enzyme plays a central role in the Calvin cycle in plants. Typically, TKs are homodimers. They require TPP and divalent cations, such as magnesium ions, for activity. Probab=21.33 E-value=61 Score=13.68 Aligned_cols=28 Identities=29% Similarity=0.277 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHH-CCCCCEEEECCCCCCH Q ss_conf 899999999994-2799499982899888 Q gi|254780898|r 114 DEQIKQLISILK-TAPKPLLIHCKSGADR 141 (207) Q Consensus 114 ~~~i~~~~~il~-~~~~PVlVHC~aG~~R 141 (207) .+.+...++-.. ..++|++|||+.=+++ T Consensus 202 ~~~i~~a~~~a~~~~~kP~~I~~~T~KGk 230 (255) T cd02012 202 VEEILAALEEAKKSKGKPTLIIAKTIKGK 230 (255) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEEEECC T ss_conf 99999999999867999589999965114 No 201 >PRK13479 2-aminoethylphosphonate--pyruvate transaminase; Provisional Probab=21.19 E-value=62 Score=13.67 Aligned_cols=48 Identities=17% Similarity=0.270 Sum_probs=23.6 Q ss_pred HHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 799999669849987526888989899999999994279949998289 Q gi|254780898|r 90 EEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKS 137 (207) Q Consensus 90 e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~a 137 (207) -.+.|+..|+....+..+-...++.+.+++.++--.+...=.++||-. T Consensus 95 ~~~ia~~~g~~v~~l~~~~g~~~~~~~ve~~L~~~~~~~~v~~vH~ET 142 (368) T PRK13479 95 IAKIAEYLGIAHVVLDTGEDEPPDAAAVEAALAADPRITHVALVHCET 142 (368) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 999999819975998789999889999999997499967899985067 No 202 >COG1054 Predicted sulfurtransferase [General function prediction only] Probab=21.16 E-value=62 Score=13.66 Aligned_cols=58 Identities=17% Similarity=0.169 Sum_probs=33.7 Q ss_pred CCCCHHHHHHHHHHH----HC-CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH Q ss_conf 898989999999999----42-799499982899888999999999998399989999961000264267424599999 Q gi|254780898|r 110 RELNDEQIKQLISIL----KT-APKPLLIHCKSGADRTGLASAVYLYIVAHYPKEEAHRQLSMLYGHFPVLKTITMDIT 183 (207) Q Consensus 110 ~~~~~~~i~~~~~il----~~-~~~PVlVHC~aG~~RTG~~~alyli~~~g~~~~eA~~el~~~ygh~~~~~~~~ld~~ 183 (207) -.|+-+....|..-+ +. .++||++.|..|. |.--.++. |+.+| |.++.+.+.|++.+. T Consensus 149 v~p~~~tFrefP~~v~~~~~~~~~KkVvmyCTGGI-RCEKas~~--m~~~G-------------F~eVyhL~GGIl~Y~ 211 (308) T COG1054 149 VEPDIETFREFPAWVEENLDLLKDKKVVMYCTGGI-RCEKASAW--MKENG-------------FKEVYHLEGGILKYL 211 (308) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCE-EEHHHHHH--HHHHC-------------CHHHHCCCCHHHHHH T ss_conf 58873656655799999887625995899848864-00462899--98705-------------135312020587876 No 203 >cd03371 TPP_PpyrDC Thiamine pyrophosphate (TPP) family, PpyrDC subfamily, TPP-binding module; composed of proteins similar to phosphonopyruvate decarboxylase (PpyrDC) proteins. PpyrDC is a homotrimeric enzyme which functions in the biosynthesis of C-P compounds such as bialaphos tripeptide in Streptomyces hygroscopicus. These proteins require TPP and divalent metal cation cofactors. Probab=20.65 E-value=63 Score=13.60 Aligned_cols=28 Identities=18% Similarity=0.274 Sum_probs=16.9 Q ss_pred CCHHHHHHHHHHHHCCCCCEEEE--CCCCC Q ss_conf 89899999999994279949998--28998 Q gi|254780898|r 112 LNDEQIKQLISILKTAPKPLLIH--CKSGA 139 (207) Q Consensus 112 ~~~~~i~~~~~il~~~~~PVlVH--C~aG~ 139 (207) -+.+++...++-..+.++|++|+ |..|. T Consensus 137 ~~~~el~~al~~al~~~gP~~ieV~idpg~ 166 (188) T cd03371 137 PSLEELVAALAKALAADGPAFIEVKVRPGS 166 (188) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEEECCCC T ss_conf 899999999999984899999999978999 No 204 >pfam00106 adh_short short chain dehydrogenase. This family contains a wide variety of dehydrogenases. Probab=20.46 E-value=64 Score=13.57 Aligned_cols=76 Identities=11% Similarity=0.057 Sum_probs=40.4 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCC--CEEEECCCC Q ss_conf 999999997499389973787876544457999996698499875268889898999999999942799--499982899 Q gi|254780898|r 61 TFIEYLKKEYGIKSILNLRGKLPESWHKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPK--PLLIHCKSG 138 (207) Q Consensus 61 ~~~~~l~k~~GIktIInLr~~~~~~~~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~--PVlVHC~aG 138 (207) ..+...+.+.|-+.|+......+........+..+..|.+...+.. +.-+++.++.+.+.+.+..+ -++|||... T Consensus 14 ~aia~~la~~Ga~~vv~~~r~~~~~~~~~~~~~~~~~g~~~~~~~~---Dv~~~~~v~~~~~~~~~~~g~iD~linnAG~ 90 (167) T pfam00106 14 LALARWLAAEGARHLVLVSRRGDAPGAAELVAELEALGAEVTVAAC---DVADRDALAALLAALPAALGPLDGVVHNAGV 90 (167) T ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEEC---CCCCHHHHHHHHHHHHHHCCCCCEEEECCCC T ss_conf 9999999987994899965996768999999999955985999984---6999999999999999975997399988712 Q ss_pred C Q ss_conf 8 Q gi|254780898|r 139 A 139 (207) Q Consensus 139 ~ 139 (207) . T Consensus 91 ~ 91 (167) T pfam00106 91 L 91 (167) T ss_pred C T ss_conf 6 No 205 >cd06821 PLPDE_III_D-TA Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme D-Threonine Aldolase. D-threonine aldolase (D-TA, EC 4.3.1.18) reversibly catalyzes the aldol cleavage of D-threonine into glycine and acetaldehyde, and the synthesis of D-threonine from glycine and acetaldehyde. Its activity is present in several genera of bacteria but not in fungi. It requires PLP and a divalent cation such as Co2+, Ni2+, Mn2+, or Mg2+ as cofactors for catalytic activity and thermal stability. Members of this subfamily show similarity to bacterial alanine racemase (AR), a fold type III PLP-dependent enzyme which contains an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. Based on its similarity to AR, it is possible that low specificity D-TAs also form dimers in solution. Experimental data show that Probab=20.46 E-value=64 Score=13.57 Aligned_cols=34 Identities=26% Similarity=0.407 Sum_probs=22.0 Q ss_pred CHHHHHHHHHHHHCCC--CCEEEECCCCCCHHHHHH Q ss_conf 9899999999994279--949998289988899999 Q gi|254780898|r 113 NDEQIKQLISILKTAP--KPLLIHCKSGADRTGLAS 146 (207) Q Consensus 113 ~~~~i~~~~~il~~~~--~PVlVHC~aG~~RTG~~~ 146 (207) +.+.+..+.+.....+ -+|+|-+..|.+|||+.. T Consensus 111 s~~~~~~l~~~~~~~~~~~~V~ievD~G~~R~Gv~~ 146 (361) T cd06821 111 DLEAAEALSAAAGSAGLTLSVLLDVNTGMNRTGIAP 146 (361) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCC T ss_conf 899999999999866996069999767888688888 No 206 >COG5495 Uncharacterized conserved protein [Function unknown] Probab=20.38 E-value=64 Score=13.56 Aligned_cols=25 Identities=28% Similarity=0.598 Sum_probs=12.9 Q ss_pred HCCCCCEEEECCCCCCHHHHHHHHHH Q ss_conf 42799499982899888999999999 Q gi|254780898|r 125 KTAPKPLLIHCKSGADRTGLASAVYL 150 (207) Q Consensus 125 ~~~~~PVlVHC~aG~~RTG~~~alyl 150 (207) .+.++-+++|| +|.+-.|++.-+-+ T Consensus 88 ~~rpg~iv~Hc-Sga~~~~il~~~gr 112 (289) T COG5495 88 LNRPGTIVAHC-SGANGSGILAPLGR 112 (289) T ss_pred CCCCCEEEEEC-CCCCCHHHHHHHHH T ss_conf 46898389974-68874045425421 No 207 >pfam09265 Cytokin-bind Cytokinin dehydrogenase 1, FAD and cytokinin binding. Members of this family adopt an alpha+beta sandwich structure with an antiparallel beta-sheet, in a ferredoxin-like fold. They are predominantly found in plant cytokinin dehydrogenase 1, where they are capable of binding both FAD and cytokinin substrates. The substrate displays a 'plug-into-socket' binding mode that seals the catalytic site and precisely positions the carbon atom undergoing oxidation in close contact with the reactive locus of the flavin. Probab=20.35 E-value=64 Score=13.56 Aligned_cols=24 Identities=29% Similarity=0.423 Sum_probs=12.9 Q ss_pred HHHHCC--CCCEEEECCCC-C--CHHHHH Q ss_conf 999427--99499982899-8--889999 Q gi|254780898|r 122 SILKTA--PKPLLIHCKSG-A--DRTGLA 145 (207) Q Consensus 122 ~il~~~--~~PVlVHC~aG-~--~RTG~~ 145 (207) .++.+. .||+||+=..- + +|+-++ T Consensus 162 ~il~~~~~~GpiLvYP~~r~kwd~r~s~v 190 (280) T pfam09265 162 GILKNTNNSGPILVYPMNRSKWDDRMSAV 190 (280) T ss_pred HHHHCCCCCCEEEEEECCCCCCCCCCCCC T ss_conf 99745799861899864476778876544 No 208 >cd06271 PBP1_AglR_RafR_like Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the represso Probab=20.33 E-value=64 Score=13.55 Aligned_cols=93 Identities=15% Similarity=0.103 Sum_probs=49.9 Q ss_pred HHHHHHCCCCEEEECCCCCCCCC----CHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCC Q ss_conf 99999749938997378787654----44579999966984998752688898989999999999427994999828998 Q gi|254780898|r 64 EYLKKEYGIKSILNLRGKLPESW----HKEEEKAANDLGIQLINFPLSATRELNDEQIKQLISILKTAPKPLLIHCKSGA 139 (207) Q Consensus 64 ~~l~k~~GIktIInLr~~~~~~~----~~~e~~~~~~~gi~~i~ip~~~~~~~~~~~i~~~~~il~~~~~PVlVHC~aG~ 139 (207) ...+.+.|-|.|.-+.+...... .....++++++|+.....-+...........+.+.++++..+.|-.|-|.... T Consensus 112 ~~~L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~p~Ai~~~nD~ 191 (268) T cd06271 112 VRRLIALGHRRIALLNPPEDLTFAQHRRAGYRRALAEAGLPLDPALIVSGDMTEEGGYAAAAELLALPDRPTAIVCSSEL 191 (268) T ss_pred HHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHHHHHHHHHCCCCCCEEEECCHH T ss_conf 99999859874875488754607999999999999983999771136506875777999999999659998689977779 Q ss_pred CHHHHHHHHHHHHHCCCCHH Q ss_conf 88999999999998399989 Q gi|254780898|r 140 DRTGLASAVYLYIVAHYPKE 159 (207) Q Consensus 140 ~RTG~~~alyli~~~g~~~~ 159 (207) -=.|++-+|.. .|.... T Consensus 192 ~A~g~l~~l~~---~g~~vP 208 (268) T cd06271 192 MALGVLAALAE---AGLRPG 208 (268) T ss_pred HHHHHHHHHHH---CCCCCC T ss_conf 99999999998---299989 No 209 >TIGR00418 thrS threonyl-tRNA synthetase; InterPro: IPR002320 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Threonyl-tRNA synthetase (6.1.1.3 from EC) exists as a monomer and belongs to class IIa. The enzyme from Escherichia coli represses the translation of its own mRNA. The crystal structure of the complex between tRNA(Thr) and ThrRS show structural features that reveal novel strategies for providing specificity in tRNA selection. These include an amino-terminal domain containing a novel protein fold that makes minor groove contacts with the tRNA acceptor stem. The enzyme induces a large deformation of the anticodon loop, resulting in an interaction between two adjacent anticodon bases, which accounts for their prominent role in tRNA identity and translational regulation. A zinc ion found in the active site is implicated in amino acid recognition/discrimination . The zinc ion may act to ensure that only amino acids that possess a hydroxyl group attached to the beta-position are activated .; GO: 0004829 threonine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006435 threonyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=20.06 E-value=50 Score=14.22 Aligned_cols=31 Identities=16% Similarity=0.112 Sum_probs=15.9 Q ss_pred HCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 427994999828998889999999999983999 Q gi|254780898|r 125 KTAPKPLLIHCKSGADRTGLASAVYLYIVAHYP 157 (207) Q Consensus 125 ~~~~~PVlVHC~aG~~RTG~~~alyli~~~g~~ 157 (207) .+...||.|||.= .|--=-++|. |+...+-. T Consensus 452 ~~~~~Pv~IHra~-~GS~ERfi~i-L~E~~~G~ 482 (595) T TIGR00418 452 NKEKRPVMIHRAI-LGSIERFIAI-LLEKYAGA 482 (595) T ss_pred CCCCCCEEEECCC-CHHHHHHHHH-HHHHHHHH T ss_conf 8700677898044-0058999999-99875222 No 210 >pfam03918 CcmH Cytochrome C biogenesis protein. Members of this family include NrfF, CcmH, CycL, Ccl2. Probab=20.05 E-value=65 Score=13.52 Aligned_cols=57 Identities=21% Similarity=0.281 Sum_probs=35.0 Q ss_pred CHHHHHHHHHHHHCCCCCEEEECCCCC---CHHH----HHHHHHHHHHCCCCHHHHHHHHHHCCCCC Q ss_conf 989999999999427994999828998---8899----99999999983999899999610002642 Q gi|254780898|r 113 NDEQIKQLISILKTAPKPLLIHCKSGA---DRTG----LASAVYLYIVAHYPKEEAHRQLSMLYGHF 172 (207) Q Consensus 113 ~~~~i~~~~~il~~~~~PVlVHC~aG~---~RTG----~~~alyli~~~g~~~~eA~~el~~~ygh~ 172 (207) ++++-+.+.++.++.-=|+ |..-. ..++ +-.-.+.++.+|.+.+|.++.|.-+||.+ T Consensus 24 d~~~e~r~~~l~~~lRCpv---Cq~qsi~dS~a~iA~dmR~~i~~~i~~G~sd~eI~~~~~~rYG~~ 87 (145) T pfam03918 24 DPAQEARARELSKELRCPV---CQNQSIADSNAPLARDLRLEVREMLVEGKSDEEIIDYMVARYGDF 87 (145) T ss_pred CHHHHHHHHHHHHHCCCCC---CCCCCHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC T ss_conf 9899999999998278888---799765007779999999999999985999999999999863861 Done!