Query gi|254780180|ref|YP_003064593.1| S-adenosylmethionine:tRNA ribosyltransferase-isomerase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 360 No_of_seqs 165 out of 1731 Neff 6.3 Searched_HMMs 39220 Date Mon May 23 07:41:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780180.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00147 queA S-adenosylmethio 100.0 0 0 1065.7 33.9 345 1-360 1-345 (345) 2 COG0809 QueA S-adenosylmethion 100.0 0 0 1038.0 30.1 344 1-359 1-345 (348) 3 TIGR00113 queA S-adenosylmethi 100.0 0 0 1046.4 21.6 350 1-357 2-364 (364) 4 PRK01424 S-adenosylmethionine: 100.0 0 0 1025.2 30.8 337 1-359 1-365 (366) 5 pfam02547 Queuosine_synth Queu 100.0 0 0 589.7 20.7 217 1-232 1-218 (218) 6 TIGR00630 uvra excinuclease AB 95.5 0.0049 1.3E-07 42.8 1.0 62 174-270 875-951 (956) 7 TIGR03470 HpnH hopanoid biosyn 91.1 0.41 1E-05 29.1 4.5 127 185-343 96-228 (318) 8 PRK00635 excinuclease ABC subu 88.8 0.33 8.5E-06 29.7 2.5 10 261-270 557-566 (1809) 9 PRK00349 uvrA excinuclease ABC 88.7 0.33 8.4E-06 29.7 2.4 12 261-272 570-581 (944) 10 COG0178 UvrA Excinuclease ATPa 86.9 0.64 1.6E-05 27.6 3.0 11 261-271 562-572 (935) 11 TIGR02409 carnitine_bodg gamma 81.5 0.93 2.4E-05 26.5 1.9 20 52-71 334-353 (384) 12 PRK05301 pyrroloquinoline quin 76.8 2.8 7.3E-05 23.0 3.2 80 184-275 86-167 (375) 13 pfam10987 DUF2806 Protein of u 74.0 2 5E-05 24.2 1.8 94 263-358 69-186 (223) 14 COG0647 NagD Predicted sugar p 68.1 11 0.00027 18.9 5.3 25 257-281 202-226 (269) 15 COG4820 EutJ Ethanolamine util 65.3 2.5 6.4E-05 23.4 0.8 82 243-327 181-268 (277) 16 PRK11587 putative phosphatase; 64.8 12 0.00031 18.5 4.9 70 201-270 88-183 (218) 17 pfam02668 TauD Taurine catabol 64.2 3.2 8.1E-05 22.6 1.2 22 44-65 22-43 (253) 18 TIGR02329 propionate_PrpR prop 62.5 2.4 6.1E-05 23.6 0.3 11 153-163 427-437 (658) 19 PRK13224 consensus 60.4 14 0.00036 18.0 3.9 119 179-317 72-208 (216) 20 PRK13679 hypothetical protein; 60.0 8.4 0.00022 19.6 2.7 30 202-231 49-79 (168) 21 PRK10658 alpha-xylosidase YicI 58.8 9 0.00023 19.4 2.7 20 147-166 244-267 (772) 22 cd01425 RPS2 Ribosomal protein 55.7 17 0.00044 17.4 5.3 59 251-322 45-105 (193) 23 PRK07102 short chain dehydroge 55.5 17 0.00044 17.4 4.1 76 196-279 154-229 (243) 24 pfam11253 DUF3052 Protein of u 55.5 3.7 9.3E-05 22.2 0.2 15 190-204 96-110 (127) 25 PRK13227 consensus 55.4 16 0.00042 17.6 3.5 74 201-281 101-187 (228) 26 KOG3888 consensus 53.7 7.5 0.00019 20.0 1.6 11 21-31 20-30 (407) 27 cd03130 GATase1_CobB Type 1 gl 53.3 19 0.00048 17.1 3.6 13 5-17 8-20 (198) 28 cd05126 Mth938 Mth938 domain. 53.1 19 0.00048 17.1 5.2 43 198-268 72-114 (117) 29 cd01300 YtcJ_like YtcJ_like me 52.4 10 0.00027 18.9 2.1 10 189-198 267-276 (479) 30 COG2175 TauD Probable taurine 51.1 12 0.0003 18.5 2.3 23 286-308 194-216 (286) 31 cd05017 SIS_PGI_PMI_1 The memb 49.5 21 0.00054 16.7 3.4 100 202-329 14-115 (119) 32 PRK01077 cobyrinic acid a,c-di 49.4 21 0.00054 16.7 5.8 85 190-284 249-345 (451) 33 TIGR00075 hypD hydrogenase exp 48.5 12 0.00031 18.5 2.0 24 27-56 43-67 (384) 34 PRK13226 phosphoglycolate phos 47.5 23 0.00058 16.5 4.2 74 201-281 92-178 (221) 35 pfam09861 DUF2088 Uncharacteri 46.5 17 0.00044 17.3 2.5 66 199-272 72-140 (203) 36 PRK06139 short chain dehydroge 45.3 25 0.00063 16.3 4.2 136 180-328 151-310 (324) 37 COG2100 Predicted Fe-S oxidore 44.0 13 0.00033 18.3 1.5 134 147-304 158-303 (414) 38 PRK12311 rpsB 30S ribosomal pr 42.8 27 0.00068 16.0 5.0 15 254-268 174-188 (332) 39 cd01994 Alpha_ANH_like_IV This 42.7 27 0.00068 16.0 3.1 31 197-234 95-125 (194) 40 pfam08004 DUF1699 Protein of u 42.0 20 0.0005 16.9 2.2 79 140-263 23-105 (131) 41 cd06555 ASCH_PF0470_like ASC-1 41.7 28 0.0007 15.9 3.6 71 107-192 30-104 (109) 42 TIGR02410 carnitine_TMLD trime 41.2 16 0.0004 17.7 1.6 17 54-70 353-369 (397) 43 PRK07024 short chain dehydroge 40.4 29 0.00074 15.8 3.8 72 196-276 156-228 (256) 44 smart00684 DM15 Tandem repeat 40.4 5.2 0.00013 21.1 -0.9 21 331-351 16-36 (39) 45 COG0300 DltE Short-chain dehyd 40.4 29 0.00074 15.8 4.4 68 196-270 162-235 (265) 46 pfam01113 DapB_N Dihydrodipico 40.4 21 0.00053 16.8 2.1 27 245-272 72-98 (122) 47 PRK06101 short chain dehydroge 38.0 31 0.0008 15.5 4.0 66 198-270 149-214 (241) 48 PRK13288 pyrophosphatase PpaX; 37.5 32 0.00081 15.5 4.0 80 192-281 81-173 (214) 49 cd03215 ABC_Carb_Monos_II This 37.0 21 0.00054 16.7 1.7 31 193-223 130-165 (182) 50 TIGR03351 PhnX-like phosphonat 36.7 33 0.00084 15.4 3.8 43 202-244 93-150 (220) 51 pfam04687 Microvir_H Microviru 36.2 17 0.00044 17.4 1.2 20 192-211 218-237 (310) 52 COG0752 GlyQ Glycyl-tRNA synth 36.2 12 0.0003 18.5 0.4 27 87-113 40-67 (298) 53 smart00095 TR_THY Transthyreti 35.7 34 0.00086 15.3 3.5 67 84-167 20-86 (121) 54 cd03224 ABC_TM1139_LivF_branch 35.5 17 0.00043 17.4 1.1 55 193-272 158-217 (222) 55 cd02759 MopB_Acetylene-hydrata 35.4 34 0.00087 15.2 3.8 23 243-265 264-286 (477) 56 KOG4132 consensus 35.3 34 0.00088 15.2 4.1 19 261-279 213-231 (260) 57 pfam00318 Ribosomal_S2 Ribosom 35.1 35 0.00088 15.2 5.0 60 251-323 45-106 (205) 58 COG5623 CLP1 Predicted GTPase 34.9 35 0.00089 15.2 3.4 16 53-68 38-53 (424) 59 TIGR01277 thiQ thiamine ABC tr 34.7 20 0.00051 16.9 1.3 16 155-173 58-73 (213) 60 COG0436 Aspartate/tyrosine/aro 34.7 35 0.0009 15.1 3.0 24 42-65 99-123 (393) 61 PRK09189 uroporphyrinogen-III 34.4 36 0.00091 15.1 6.9 79 197-279 123-214 (237) 62 PRK05855 short chain dehydroge 34.2 17 0.00044 17.3 0.9 80 198-278 473-563 (582) 63 cd03271 ABC_UvrA_II The excisi 33.2 31 0.00078 15.6 2.1 32 192-223 197-233 (261) 64 COG1797 CobB Cobyrinic acid a, 32.7 38 0.00096 14.9 3.9 198 44-268 101-329 (451) 65 TIGR01120 rpiB ribose 5-phosph 32.6 38 0.00097 14.9 3.0 60 195-271 8-67 (143) 66 PHA00369 H minor spike protein 32.4 23 0.00059 16.4 1.4 20 192-211 230-249 (337) 67 PRK05299 rpsB 30S ribosomal pr 32.4 38 0.00097 14.9 5.0 13 259-271 61-73 (255) 68 PRK13355 bifunctional HTH-doma 32.1 33 0.00083 15.4 2.1 16 5-20 158-177 (518) 69 pfam07755 DUF1611 Protein of u 31.3 40 0.001 14.8 6.2 47 261-309 112-167 (302) 70 PRK08622 galactose-6-phosphate 31.3 27 0.00069 16.0 1.5 26 191-216 5-30 (171) 71 cd05821 TLP_Transthyretin Tran 31.3 40 0.001 14.8 3.7 65 86-167 25-89 (121) 72 PRK10148 hypothetical protein; 30.9 40 0.001 14.7 2.8 25 160-185 1-25 (147) 73 PRK07904 short chain dehydroge 30.8 40 0.001 14.7 3.5 65 198-271 167-232 (253) 74 KOG0442 consensus 30.7 24 0.00062 16.3 1.2 90 149-248 599-701 (892) 75 TIGR02219 phage_NlpC_fam putat 30.3 26 0.00065 16.1 1.3 11 51-61 76-86 (135) 76 KOG1017 consensus 30.3 36 0.00092 15.1 2.0 36 264-300 197-232 (267) 77 cd06578 HemD Uroporphyrinogen- 29.8 42 0.0011 14.6 5.2 91 187-279 121-219 (239) 78 KOG0210 consensus 29.8 42 0.0011 14.6 5.2 24 52-75 180-203 (1051) 79 cd05710 SIS_1 A subgroup of th 29.8 42 0.0011 14.6 3.2 21 248-268 60-80 (120) 80 PRK07309 aromatic amino acid a 29.7 42 0.0011 14.6 2.5 29 293-321 346-376 (390) 81 PRK00556 minC septum formation 29.7 42 0.0011 14.6 3.8 15 47-61 46-60 (195) 82 TIGR03356 BGL beta-galactosida 29.2 20 0.00052 16.9 0.6 12 6-17 115-126 (427) 83 PRK00002 aroB 3-dehydroquinate 29.1 43 0.0011 14.5 6.2 56 262-318 237-303 (360) 84 PRK09700 D-allose transporter 28.8 44 0.0011 14.5 2.2 31 193-223 171-206 (510) 85 pfam01058 Oxidored_q6 NADH ubi 28.7 44 0.0011 14.5 2.7 30 249-286 55-84 (124) 86 TIGR01116 ATPase-IIA1_Ca calci 28.5 44 0.0011 14.4 3.5 109 149-292 402-512 (800) 87 smart00052 EAL Putative diguan 28.0 45 0.0011 14.4 3.2 82 201-282 135-223 (241) 88 PRK07306 ribonucleotide-diphos 27.9 42 0.0011 14.6 2.1 14 333-346 682-695 (720) 89 COG2723 BglB Beta-glucosidase/ 27.5 19 0.00048 17.1 0.2 58 203-265 345-402 (460) 90 TIGR01959 nuoF_fam NADH-quinon 27.4 46 0.0012 14.3 3.0 36 108-157 237-272 (420) 91 cd05006 SIS_GmhA Phosphoheptos 27.2 46 0.0012 14.3 3.2 21 248-268 114-134 (177) 92 TIGR02128 G6PI_arch bifunction 27.1 46 0.0012 14.3 2.9 29 42-71 41-70 (338) 93 TIGR02324 CP_lyasePhnL phospho 27.1 45 0.0011 14.4 2.0 23 194-216 176-203 (224) 94 PRK10444 UMP phosphatase; Prov 27.0 47 0.0012 14.3 5.0 19 261-279 190-208 (248) 95 cd00288 Pyruvate_Kinase Pyruva 27.0 47 0.0012 14.3 4.1 21 46-66 111-131 (480) 96 cd05762 Ig8_MLCK Eighth immuno 26.9 47 0.0012 14.3 2.7 46 97-145 32-77 (98) 97 PRK05928 hemD uroporphyrinogen 26.8 47 0.0012 14.2 4.9 85 190-279 128-226 (252) 98 pfam08981 consensus 26.7 47 0.0012 14.2 3.0 59 202-279 40-98 (181) 99 pfam03088 Str_synth Strictosid 26.6 47 0.0012 14.2 2.6 44 21-64 32-76 (89) 100 PRK00048 dihydrodipicolinate r 26.6 42 0.0011 14.6 1.8 108 136-261 69-176 (265) 101 PRK10826 2-deoxyglucose-6-phos 26.4 48 0.0012 14.2 4.7 77 198-281 94-183 (222) 102 PRK09919 hypothetical protein; 26.4 48 0.0012 14.2 3.3 40 52-105 38-79 (114) 103 PRK13478 phosphonoacetaldehyde 26.1 48 0.0012 14.2 2.1 77 198-280 103-194 (267) 104 PRK13549 xylose transporter AT 26.0 49 0.0012 14.1 2.3 31 193-223 169-204 (513) 105 PRK07683 aminotransferase A; V 25.9 49 0.0012 14.1 3.7 30 27-63 91-120 (387) 106 cd03870 M14_CPA Peptidase M14 25.7 49 0.0013 14.1 4.3 116 202-329 171-290 (301) 107 PRK07449 2-succinyl-6-hydroxy- 25.7 49 0.0013 14.1 3.0 29 254-282 415-448 (548) 108 COG0637 Predicted phosphatase/ 25.6 49 0.0013 14.1 4.8 84 185-273 80-190 (221) 109 PRK07212 consensus 25.6 49 0.0013 14.1 2.9 27 293-319 339-365 (378) 110 PRK08115 ribonucleotide-diphos 25.5 24 0.0006 16.4 0.4 145 196-346 569-761 (857) 111 PRK13645 cbiO cobalt transport 25.3 42 0.0011 14.6 1.6 31 193-223 176-212 (289) 112 PRK09348 glyQ glycyl-tRNA synt 25.3 24 0.00061 16.3 0.4 25 89-113 39-64 (291) 113 pfam06434 Aconitase_2_N Aconit 25.1 50 0.0013 14.0 4.9 50 105-162 120-169 (204) 114 PRK06203 aroB 3-dehydroquinate 25.1 50 0.0013 14.0 7.6 59 260-320 255-324 (390) 115 COG1129 MglA ABC-type sugar tr 25.1 43 0.0011 14.5 1.7 29 193-221 171-204 (500) 116 PRK13640 cbiO cobalt transport 25.0 49 0.0013 14.1 2.0 54 193-272 170-229 (283) 117 CHL00067 rps2 ribosomal protei 24.9 51 0.0013 14.0 5.1 60 251-323 53-114 (227) 118 PRK07632 ribonucleotide-diphos 24.9 51 0.0013 14.0 2.3 37 310-346 623-676 (696) 119 pfam01964 ThiC ThiC family. Th 24.8 43 0.0011 14.5 1.6 39 180-224 123-162 (421) 120 TIGR01135 glmS glucosamine--fr 24.8 51 0.0013 14.0 2.8 26 26-58 95-120 (628) 121 PRK06853 indolepyruvate oxidor 24.8 51 0.0013 14.0 2.8 15 52-66 86-100 (197) 122 PRK13223 phosphoglycolate phos 24.6 51 0.0013 14.0 4.8 87 179-269 88-201 (272) 123 KOG1876 consensus 24.5 19 0.00047 17.1 -0.3 10 306-315 114-123 (168) 124 PRK12615 galactose-6-phosphate 24.4 47 0.0012 14.3 1.7 69 190-275 4-72 (171) 125 PRK13228 consensus 24.3 52 0.0013 13.9 4.9 73 201-280 101-186 (232) 126 PRK10762 D-ribose transporter 24.2 45 0.0012 14.3 1.7 31 193-223 167-202 (501) 127 cd06246 M14_CPB2 Peptidase M14 24.2 29 0.00075 15.7 0.7 103 215-329 185-291 (300) 128 PRK08636 aspartate aminotransf 24.1 52 0.0013 13.9 2.3 43 293-338 358-402 (403) 129 TIGR00718 sda_alpha L-serine d 24.1 27 0.00068 16.0 0.5 15 183-197 103-117 (301) 130 PRK06225 aspartate aminotransf 24.0 53 0.0013 13.9 2.6 17 46-62 92-108 (375) 131 TIGR02238 recomb_DMC1 meiotic 23.6 36 0.00092 15.1 1.0 85 203-315 121-207 (314) 132 pfam00877 NLPC_P60 NlpC/P60 fa 23.5 54 0.0014 13.8 2.1 14 51-64 49-62 (105) 133 cd00733 GlyRS_alpha_core Class 23.5 25 0.00063 16.2 0.2 25 89-113 37-62 (279) 134 pfam06962 rRNA_methylase Putat 23.5 54 0.0014 13.8 3.4 59 41-117 30-90 (140) 135 cd03218 ABC_YhbG The ABC trans 23.4 48 0.0012 14.2 1.6 55 193-272 159-218 (232) 136 PRK12898 secA preprotein trans 23.3 54 0.0014 13.8 3.0 56 222-279 449-504 (673) 137 COG0149 TpiA Triosephosphate i 23.2 43 0.0011 14.5 1.4 63 181-272 65-132 (251) 138 TIGR00963 secA preprotein tran 22.9 55 0.0014 13.7 2.8 57 222-280 396-452 (904) 139 cd03270 ABC_UvrA_I The excisio 22.9 55 0.0014 13.7 2.1 32 192-223 164-200 (226) 140 TIGR03019 pepcterm_femAB FemAB 22.8 55 0.0014 13.7 2.0 19 335-353 243-261 (330) 141 PRK12903 secA preprotein trans 22.7 56 0.0014 13.7 2.7 169 149-338 329-557 (885) 142 PRK11907 bifunctional 2',3'-cy 22.6 56 0.0014 13.7 2.9 43 268-310 540-585 (810) 143 PRK12899 secA preprotein trans 22.6 56 0.0014 13.7 4.8 79 194-279 462-584 (969) 144 TIGR00034 aroFGH phospho-2-deh 22.6 56 0.0014 13.7 4.0 63 3-77 5-97 (348) 145 PRK13263 ureE urease accessory 22.6 56 0.0014 13.7 3.5 56 10-70 14-74 (207) 146 PRK06857 consensus 22.4 56 0.0014 13.7 2.3 14 153-166 5-18 (209) 147 PRK07109 short chain dehydroge 22.2 57 0.0015 13.6 5.1 127 180-317 153-302 (338) 148 PRK11590 hypothetical protein; 22.1 57 0.0015 13.6 2.1 26 328-354 160-186 (211) 149 PRK11614 livF leucine/isoleuci 22.1 45 0.0012 14.4 1.3 55 193-272 163-222 (237) 150 pfam02455 Hex_IIIa Hexon-assoc 22.0 57 0.0015 13.6 1.9 13 59-71 229-241 (491) 151 cd03217 ABC_FeS_Assembly ABC-t 21.8 54 0.0014 13.8 1.6 61 192-277 129-194 (200) 152 pfam02091 tRNA-synt_2e Glycyl- 21.5 26 0.00066 16.1 -0.0 24 90-113 37-61 (284) 153 cd02767 MopB_ydeP The MopB_yde 21.4 59 0.0015 13.5 3.1 16 300-315 470-485 (574) 154 TIGR01166 cbiO cobalt ABC tran 21.4 49 0.0012 14.1 1.3 37 188-224 139-189 (190) 155 KOG0318 consensus 21.4 30 0.00077 15.6 0.3 23 12-34 7-29 (603) 156 PRK13631 cbiO cobalt transport 21.3 48 0.0012 14.2 1.3 32 193-224 202-238 (320) 157 TIGR03314 Se_ssnA putative sel 21.3 43 0.0011 14.5 1.0 20 198-217 262-281 (441) 158 pfam00224 PK Pyruvate kinase, 21.1 60 0.0015 13.5 3.2 32 245-280 257-288 (348) 159 CHL00131 ycf16 sulfate ABC tra 21.1 60 0.0015 13.5 2.1 56 193-272 176-236 (252) 160 PRK11124 artP arginine transpo 21.0 47 0.0012 14.2 1.2 55 193-272 167-226 (242) 161 pfam01924 HypD Hydrogenase for 20.8 61 0.0015 13.4 1.8 110 207-342 117-246 (355) 162 PRK09200 preprotein translocas 20.8 61 0.0015 13.4 2.7 56 222-279 393-448 (799) 163 cd07018 S49_SppA_67K_type Sign 20.7 61 0.0016 13.4 2.6 36 183-223 102-139 (222) 164 PRK04302 triosephosphate isome 20.5 61 0.0016 13.4 2.1 65 180-274 61-127 (223) 165 COG3480 SdrC Predicted secrete 20.4 39 0.00098 14.9 0.7 60 189-250 230-298 (342) 166 PRK10895 putative ABC transpor 20.4 57 0.0015 13.6 1.5 55 193-272 163-222 (241) 167 KOG3987 consensus 20.3 62 0.0016 13.4 4.1 64 199-268 92-160 (288) 168 TIGR01829 AcAcCoA_reduct aceto 20.3 36 0.00093 15.0 0.5 31 181-218 148-179 (244) 169 TIGR00348 hsdR type I site-spe 20.3 36 0.00092 15.1 0.5 32 290-321 717-751 (813) 170 PRK13548 hmuV hemin importer A 20.2 61 0.0016 13.4 1.6 80 193-303 166-253 (257) 171 PRK05439 pantothenate kinase; 20.1 63 0.0016 13.3 2.2 34 189-225 93-126 (312) No 1 >PRK00147 queA S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional Probab=100.00 E-value=0 Score=1065.66 Aligned_cols=345 Identities=48% Similarity=0.788 Sum_probs=333.8 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEEEEEEEECCCC Q ss_conf 98633588889778616998781203135663788875315310143478726988799994795688899876404677 Q gi|254780180|r 1 MMVKEFDFDLPPSRIALRPVFPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQLNGVRFCHIN 80 (360) Q Consensus 1 M~lsdfDy~LP~elIAq~P~~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipARL~g~k~~~~~ 80 (360) |+||||||+||+|||||+|+++||+|||||++++++ .|+|+.|+||++||+||||||||||||+||||+|+| T Consensus 1 M~lsdfdy~LP~elIAq~P~~~Rd~sRLLV~~~~~~---~i~h~~F~dl~~~L~~gDlLV~NnTkVipARL~g~k----- 72 (345) T PRK00147 1 MKLSDFDFDLPEELIAQYPLEPRDASRLLVLDRRTG---EIEHRHFRDLLDYLRPGDLLVFNDTRVIPARLFGRK----- 72 (345) T ss_pred CCHHHCCCCCCHHHHCCCCCCCCCCCEEEEEECCCC---CEEEEEHHHHHHHCCCCCEEEEECCEEEEEEEEEEE----- T ss_conf 963342989995997579999866464889988999---477645644585479998999978989998999992----- Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCC Q ss_conf 77650589981124866101122115643698579970578874069999603557724999726786300025442035 Q gi|254780180|r 81 RREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTI 160 (360) Q Consensus 81 ~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~i 160 (360) .+|++||++++++.+++.|.|++||+||+|+|++|.|.+ .+.++|+++.++|.+.++|+.++.++.++|+++|+| T Consensus 73 ~tgg~vEilll~~~~~~~w~~lvk~~kk~k~G~~l~~~~-----~~~~~v~~~~~~g~~~l~f~~~~~~~~~~l~~~G~~ 147 (345) T PRK00147 73 ETGGKIEVLLLRPLGEDRWLALVRPAKRPKPGTKLLFGD-----GLKAEVLERLEDGGRILRFSYDGEIFLELLEELGHM 147 (345) T ss_pred CCCCEEEEEEEEECCCCEEEEEEECCCCCCCCCEEEECC-----CEEEEEEEECCCCEEEEEECCCCHHHHHHHHHCCCC T ss_conf 689879999987748987999995488899999998179-----848999996389769999758842799999975998 Q ss_pred CCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 78522212345652114331013456757400555688789999999997215530378731656657887410135667 Q gi|254780180|r 161 PLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIM 240 (360) Q Consensus 161 PLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~m 240 (360) |||||| +|++++.|.+|||||||+++|||||||||||||++||++|++|||++++||||||+|||+||++|+|++|+| T Consensus 148 PLPPYI--kr~~~~~D~~rYQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~vTLHVG~GTF~Pv~~e~i~~H~m 225 (345) T PRK00147 148 PLPPYI--KRPDEDADRERYQTVYAKEPGAVAAPTAGLHFTEELLEKLKAKGVEIAFVTLHVGAGTFRPVRVEDIEEHKM 225 (345) T ss_pred CCCCCC--CCCCCHHHHHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 998302--577653468876465427998343475778789999999997797289999985566575555455455777 Q ss_pred CCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHH Q ss_conf 70007785999999998753798099962878998999985289846888722366574096114312552775545789 Q gi|254780180|r 241 HSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTL 320 (360) Q Consensus 241 H~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stl 320 (360) |+|||+||++|+++||+||++|||||||||||||||||+++.+|.+++++|||||||+|||+|++||+||||||+||||| T Consensus 226 H~E~~~I~~~t~~~I~~ak~~g~rIiAVGTT~~RaLEsa~~~~g~~~~~~g~TdlfI~Pgy~f~~vd~LiTNFHlPkStL 305 (345) T PRK00147 226 HSEWYEVSQETADAINATKARGGRVIAVGTTSVRTLESAARADGELKPFSGETDIFIYPGYRFKVVDALITNFHLPKSTL 305 (345) T ss_pred CCEEEEECHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCCEECEEEECCCCCCCHH T ss_conf 66589989999999998886498699992528999999997379704677844879789997618187422573783489 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHCCCEEECCCCHHHCCCCC Q ss_conf 9999987198999999999998698261101022233679 Q gi|254780180|r 321 LMLVSAFCGIEETKKMYQHAISHAYRFYSYGDTSLLFQKR 360 (360) Q Consensus 321 l~Lv~Af~G~~~~~~~Y~~Ai~~~yrF~syGDamLi~~~~ 360 (360) |||||||+|+++|+++|++||+++||||||||||||+|.. T Consensus 306 lmLVsAf~G~~~~~~aY~~Ai~~~YrF~SyGDamLi~~~~ 345 (345) T PRK00147 306 LMLVSAFAGRERILAAYKHAIEEKYRFFSYGDAMLILRNA 345 (345) T ss_pred HHHHHHHCCHHHHHHHHHHHHHCCCCEECCCCHHEECCCC T ss_conf 9999996797999999999998598345324230043689 No 2 >COG0809 QueA S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=1038.03 Aligned_cols=344 Identities=51% Similarity=0.815 Sum_probs=330.9 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEEEEEEEECCCC Q ss_conf 98633588889778616998781203135663788875315310143478726988799994795688899876404677 Q gi|254780180|r 1 MMVKEFDFDLPPSRIALRPVFPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQLNGVRFCHIN 80 (360) Q Consensus 1 M~lsdfDy~LP~elIAq~P~~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipARL~g~k~~~~~ 80 (360) |+++||||+||++||||+|+++||+|||||++++++ .+.|+.|.||+|||+|||+|||||||||||||+|.|. T Consensus 1 m~~~~fdf~LPeelIAq~P~~~R~~sRLLvl~~~~~---~~~h~~F~dl~dyl~~GD~LVfNdTrVIpARl~G~k~---- 73 (348) T COG0809 1 MRLSDFDFDLPEELIAQYPAEPRDSSRLLVLDRKTG---EIEHRTFYDLIDYLNPGDLLVFNDTRVIPARLFGRKH---- 73 (348) T ss_pred CCCCCCCCCCCHHHHHCCCCCCCCCCEEEEEECCCC---CEECCCHHHHHHHCCCCCEEEEECCEEECHHEEECCC---- T ss_conf 956557888997897169998676533899847888---3332545456753487878998368076032050027---- Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCC-CHHHHHHHHHC Q ss_conf 77650589981124866101122115643698579970578874069999603557724999726786-30002544203 Q gi|254780180|r 81 RREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGI-ELERQISLVGT 159 (360) Q Consensus 81 ~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~-~~~~~L~~~G~ 159 (360) .+|++||+++++..+++.|.|++||+|++|+|+++.|.+. ++++|+++.+.|.+.++|+..+. .+.++|+++|+ T Consensus 74 ~~g~~vEvll~~~~~~~~w~al~~~~kr~k~G~~i~f~~~-----l~a~v~e~~~~g~~~l~F~~~~~~~l~e~L~~~G~ 148 (348) T COG0809 74 ESGGKVEVLLERRLDDNRWLALIKPSKRLKAGDEIYFGDG-----LKATVLERLEHGLRLLEFDYEGIFSLLELLEKYGE 148 (348) T ss_pred CCCCEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEECCC-----CEEEEEEECCCCEEEEEEECCCCHHHHHHHHHCCC T ss_conf 8896599998726178827999432567888998994798-----26999996288529999945785059999997599 Q ss_pred CCCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 57852221234565211433101345675740055568878999999999721553037873165665788741013566 Q gi|254780180|r 160 IPLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHI 239 (360) Q Consensus 160 iPLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~ 239 (360) |||||||+ |+.++.|++||||||||.+|||||||||||||++||++|++|||+++|||||||+|||+||++++|++|+ T Consensus 149 ~PLPPYI~--~~~~~~d~~~YQTVYak~~GsvAAPTAGLHFt~~LL~kLk~kGv~~afvTLHVGaGTF~pV~~~~i~eH~ 226 (348) T COG0809 149 MPLPPYIK--RKLDELDRDRYQTVYAKEPGSVAAPTAGLHFTEELLEKLKAKGVEIAFVTLHVGAGTFRPVKVENIEEHK 226 (348) T ss_pred CCCCCCCC--CCCCCCCHHHCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEEEEEECCCCCCCCEECCCCCCC T ss_conf 89981338--7622124443246642588720047687778999999999779539999999646554341204433444 Q ss_pred CCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHH Q ss_conf 77000778599999999875379809996287899899998528984688872236657409611431255277554578 Q gi|254780180|r 240 MHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKST 319 (360) Q Consensus 240 mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~St 319 (360) ||+|||+||++|+++||++|++|||||||||||+|+|||+++.++ +++++|||||||||||+|++||+||||||+|||| T Consensus 227 MH~E~~~v~~eta~~i~~~k~~GgRIiaVGTTs~R~LEsa~~~~~-~~~~sg~T~IFI~PGy~~~~vD~LiTNFHlPkST 305 (348) T COG0809 227 MHSEYYEVPQETADAINAAKARGGRIIAVGTTSVRTLESAAREAG-LKAFSGWTDIFIYPGYRFKVVDALITNFHLPKST 305 (348) T ss_pred CCHHHEECCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCC-CCCCCCCCCEEECCCCCCEECCEEHCCCCCCCCH T ss_conf 305523259999999999997398499981615777887751377-6767676217862798526701100047687238 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHCCCEEECCCCHHHCCCC Q ss_conf 9999998719899999999999869826110102223367 Q gi|254780180|r 320 LLMLVSAFCGIEETKKMYQHAISHAYRFYSYGDTSLLFQK 359 (360) Q Consensus 320 ll~Lv~Af~G~~~~~~~Y~~Ai~~~yrF~syGDamLi~~~ 359 (360) ||||||||+|++++++||++||+++||||||||||||+|+ T Consensus 306 LlMLVsAFaG~~~~~~aY~~Ai~~~YRFfSyGDamlI~~~ 345 (348) T COG0809 306 LLMLVSAFAGREELLAAYKHAIEQKYRFFSYGDAMLILRN 345 (348) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCEEEECCCEEEEEEC T ss_conf 9999998628799999999999846545312677998636 No 3 >TIGR00113 queA S-adenosylmethionine:tRNA ribosyltransferase-isomerase; InterPro: IPR003699 Queuosine is a hypermodified nucleoside that usually occurs in the first position of the anticodon of tRNAs specifying the amino acids asparagine, aspartate, histidine, and tyrosine. The hypermodified nucleoside is found in bacteria and eukaryotes . Queuosine is synthesized de novo exclusively in bacteria; for eukaryotes the compound is a nutrient factor. Queuosine biosynthesis protein, or S-adenosylmethionine:tRNA -ribosyltransferase-isomerase, is required for the synthesis of the queuosine precursor (oQ). ; GO: 0003824 catalytic activity, 0008616 queuosine biosynthetic process. Probab=100.00 E-value=0 Score=1046.43 Aligned_cols=350 Identities=45% Similarity=0.724 Sum_probs=320.9 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEEEEEEEECCCC Q ss_conf 98633588889778616998781203135663788875315310143478726988799994795688899876404677 Q gi|254780180|r 1 MMVKEFDFDLPPSRIALRPVFPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQLNGVRFCHIN 80 (360) Q Consensus 1 M~lsdfDy~LP~elIAq~P~~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipARL~g~k~~~~~ 80 (360) |++++|||+||++||||+|..+|++|||||+|+.++ +|.|+.|.|+.+++++|||||||||||+||||||.|.... T Consensus 2 m~~~~fdf~LP~~LIA~yP~~~R~~sRLlvld~~~~---~i~H~~F~~~~~~~~~gdLlvlNnTkV~pARlfG~K~~~l- 77 (364) T TIGR00113 2 MKVSSFDFELPEELIAQYPVEPRDSSRLLVLDKQTG---KITHKTFEDVLDFFNEGDLLVLNNTKVIPARLFGRKESFL- 77 (364) T ss_pred CCCCCCCCCCCHHHHHCCCCCCCCCCCEEEECCCCC---EECCCCHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCC- T ss_conf 733444733666887328798542401135518873---1604504776741388866884376511100012511018- Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCC-CCCCC-EEEEEEE-CCCCCC-EEEEECCCCC-----CHH Q ss_conf 7765058998112486610112211564369857997057-88740-6999960-355772-4999726786-----300 Q gi|254780180|r 81 RREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQ-DGQSR-LEATVID-KWNTGE-ILLVFSISGI-----ELE 151 (360) Q Consensus 81 ~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~-~~~~~-l~~~vi~-~~~~g~-~~i~f~~~~~-----~~~ 151 (360) -|++||+++++..+++.|.+++|++||+|.|+.+.|++. ++..+ +-+.|.. ....++ +.+.|..+.. .++ T Consensus 78 -Pg~~vE~L~~~~~~~~~~~a~~k~sK~~K~G~~~~f~~~~~ge~~dl~~~i~~~~~~~~Gq~~~e~~~~~~~~nqe~~~ 156 (364) T TIGR00113 78 -PGGKVEVLLLRRLDENRFLALIKPSKKLKVGAKVKFGEDYDGEKPDLLKAIVAELLAHNGQRLLEFEFNDGPLNQEVLL 156 (364) T ss_pred -CCCEEEEEEEECCCCCEEEEEECCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHH T ss_conf -8740799852003685578776146760366246668765667768999999999713991799998789851257899 Q ss_pred HHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCC Q ss_conf 02544203578522212345652114331013456757400555688789999999997215530378731656657887 Q gi|254780180|r 152 RQISLVGTIPLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVK 231 (360) Q Consensus 152 ~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~ 231 (360) ++|+.+|+||||||| +|+++..|.++|||||||+.|||||||||||||++||++|++|||+++|||||||+|||+||+ T Consensus 157 d~L~~~G~mPLPPYI--kR~d~~~D~~~YQTvYsk~~GavAAPTAGLHF~e~ll~kL~~kgv~~~F~TLHVGaGTF~pV~ 234 (364) T TIGR00113 157 DVLESYGHMPLPPYI--KRPDEKADEERYQTVYSKKVGAVAAPTAGLHFSEELLEKLKAKGVEIAFVTLHVGAGTFRPVE 234 (364) T ss_pred HHHHHCCCCCCCCCC--CCCHHHHHHHCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEEEEEEECCCCCCEE T ss_conf 887615823864888--887344000037734543542000342444579889999995798676876453054475024 Q ss_pred CCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCC-C---CCCCCCCCCCEEECCCCCCEEEE Q ss_conf 4101356677000778599999999875379809996287899899998528-9---84688872236657409611431 Q gi|254780180|r 232 VEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTED-G---IINPWSGFTNIFITPGYRFRAVD 307 (360) Q Consensus 232 ~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~-~---~~~~~~g~T~lfI~Pg~~f~~vd 307 (360) +|+|+||.||+||+.||++++++|+++|++|+|||||||||||+||+++... + .++++.+||||||||||+|++|| T Consensus 235 ~~~i~dH~mH~E~~~vp~~~~~~~~k~~~~G~Ri~aVGTTsvRslE~~a~~~~~~~~~~~~~~~~T~iFiYPG~~~~~Vd 314 (364) T TIGR00113 235 VENIEDHVMHAEYLEVPQETVEALEKTRESGGRIIAVGTTSVRSLESAAQEADEPRDLLEPFFGDTDIFIYPGYQFKVVD 314 (364) T ss_pred ECCCCCCCCCCHHEECCHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCCHHHHHCCCCCEEECCCCEEEEEE T ss_conf 40210364200001268789999999997098399987614777988876304771002011167436760674045340 Q ss_pred EEEECCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHCCCEEECCCCHHHCC Q ss_conf 25527755457899999987198999999999998698261101022233 Q gi|254780180|r 308 ILMSNFHLPKSTLLMLVSAFCGIEETKKMYQHAISHAYRFYSYGDTSLLF 357 (360) Q Consensus 308 ~LiTNFH~P~Stll~Lv~Af~G~~~~~~~Y~~Ai~~~yrF~syGDamLi~ 357 (360) ||||||||||||||||||||+|++++|++|+.||+++||||||||||||+ T Consensus 315 ~LiTNFHLPkSsLiMLVSAl~G~~~~~~~Y~~Av~~~YRFfSyGDAMlI~ 364 (364) T TIGR00113 315 GLITNFHLPKSSLIMLVSALAGRENTMNAYKTAVEEKYRFFSYGDAMLIL 364 (364) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCEEECC T ss_conf 00206627026799999998737999999999984488512215301029 No 4 >PRK01424 S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional Probab=100.00 E-value=0 Score=1025.22 Aligned_cols=337 Identities=44% Similarity=0.750 Sum_probs=318.1 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEEEEEEEECCCC Q ss_conf 98633588889778616998781203135663788875315310143478726988799994795688899876404677 Q gi|254780180|r 1 MMVKEFDFDLPPSRIALRPVFPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQLNGVRFCHIN 80 (360) Q Consensus 1 M~lsdfDy~LP~elIAq~P~~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipARL~g~k~~~~~ 80 (360) |++|||||+||+|||||+|+++||+|||||+++.. + |..|+||++||+||||||||||||+||||++ T Consensus 1 M~lsdfdy~LP~elIAq~P~~~Rd~SRLLV~~~~~-----~-~~~F~dl~~~L~~gDlLV~NnTkVipARL~~------- 67 (366) T PRK01424 1 MKLSDFDFDLPSELIAQYPSSERDNSDLLIAVTPP-----I-KTKFYNIIDYLKEGDLLVFNNSKVIKAKLNL------- 67 (366) T ss_pred CCHHHCCCCCCHHHHCCCCCCCCCCCEEEECCCCC-----C-EEEEECHHHHCCCCCEEEEECCEEEEEEEEC------- T ss_conf 95213388989599767998997514221517986-----4-0566030642899999999899987568854------- Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCC Q ss_conf 77650589981124866101122115643698579970578874069999603557724999726786300025442035 Q gi|254780180|r 81 RREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTI 160 (360) Q Consensus 81 ~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~i 160 (360) |+++|+++++.++++.|.|++||+||+++|+.+.|.+ ..+.+.++.+.|.+.++|+.++.++.++|+++|+| T Consensus 68 --gk~vei~l~~~l~~~~w~~l~kp~kk~~~G~~l~~~~------~~~~v~~~~~~g~~~~~f~~~~~~~~~~l~~~G~~ 139 (366) T PRK01424 68 --GKNITINLNQKLSDNSWSAFAKPARKLNVGDEFYFDN------HKVIITEKLAMGEIKVKFELDNISVFEFLDKYGEM 139 (366) T ss_pred --CCCEEEEEEEECCCCEEEEEEECCCCCCCCCEEEECC------CEEEEEEECCCCEEEEEEECCCCCHHHHHHHCCCC T ss_conf --8823899876416874899995267874699898679------74899995069629999957998699999865999 Q ss_pred CCCHHHCCCCC----------------------------CCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 78522212345----------------------------65211433101345675740055568878999999999721 Q gi|254780180|r 161 PLPPYIARKRP----------------------------IDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIG 212 (360) Q Consensus 161 PLPPYI~r~r~----------------------------~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kG 212 (360) ||||||+|.+. +++.|.+|||||||+++|||||||||||||++||++|++|| T Consensus 140 PLPPYI~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~rYQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kG 219 (366) T PRK01424 140 PLPVYIRRSHSLCHPVATLTGSKTNNNNDWIPWSNDGMTDQKNDNDRYQTIYSQIEGSVAAPTAGLHFTKDILDKLKAKG 219 (366) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHCC T ss_conf 99875205554553323333332333223444333456752014666412432888840167676888999999999869 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 55303787316566578874101356677000778599999999875379809996287899899998528984688872 Q gi|254780180|r 213 IKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGF 292 (360) Q Consensus 213 i~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~ 292 (360) |++++||||||+|||+||++|+|.+|+||+|||+||++++++||+||++|||||||||||||||||+++ +|.+++++|| T Consensus 220 v~ia~vTLHVG~GTF~PV~~e~i~~H~mH~E~~~I~~~t~~~In~ak~~g~RViAVGTT~vRaLEs~~~-~g~~~~~~g~ 298 (366) T PRK01424 220 IQTAFVTLHVGAGTFLPVKTENIHEHKMHTEYCSITPETAEIINKAKQEGRRIIAVGTTTLRTLESSCN-NGIVKAGSFE 298 (366) T ss_pred CCEEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHH-CCCCCCCCCC T ss_conf 927999998657764788877633588746689979999999999987189289994516887887751-6985778883 Q ss_pred CCEEECCCCCCEEEEEEEECCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHCCCEEECCCCHHHCCCC Q ss_conf 2366574096114312552775545789999998719899999999999869826110102223367 Q gi|254780180|r 293 TNIFITPGYRFRAVDILMSNFHLPKSTLLMLVSAFCGIEETKKMYQHAISHAYRFYSYGDTSLLFQK 359 (360) Q Consensus 293 T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~Lv~Af~G~~~~~~~Y~~Ai~~~yrF~syGDamLi~~~ 359 (360) |||||+|||+|++||+||||||+||||||||||||+|+++|+++|++||+++||||||||||||+|| T Consensus 299 TdlfI~Pgy~fkvvD~LiTNFHlPkSTLlmLVsAfaG~~~~~~aY~~Ai~~~YRF~SyGDamLI~rk 365 (366) T PRK01424 299 TDIFITPGFKFQTADMLLTNFHFPKSTLFMLICAFAGFKEMHELYKYAIKEKMRFFSYGDATLLYRK 365 (366) T ss_pred CCEEECCCCCCCEECEEEECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCHHHHEECC T ss_conf 2869789898858186454674782599999999579699999999999879861052305525336 No 5 >pfam02547 Queuosine_synth Queuosine biosynthesis protein. Queuosine (Q) biosynthesis protein, or S-adenosylmethionine:tRNA -ribosyltransferase-isomerase, is required for the synthesis of the queuosine precursor (oQ). Q is a hypermodified nucleoside usually found at the first position of the anticodon of asparagine, aspartate, histidine, and tyrosine tRNAs. Probab=100.00 E-value=0 Score=589.70 Aligned_cols=217 Identities=46% Similarity=0.754 Sum_probs=204.1 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEEEEEEEECCCC Q ss_conf 98633588889778616998781203135663788875315310143478726988799994795688899876404677 Q gi|254780180|r 1 MMVKEFDFDLPPSRIALRPVFPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQLNGVRFCHIN 80 (360) Q Consensus 1 M~lsdfDy~LP~elIAq~P~~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipARL~g~k~~~~~ 80 (360) |+||||||+||++||||+|+++||+|||||++++++ .|.|+.|+||++||+||||||||||||+||||+|+| T Consensus 1 m~ls~fdy~LP~elIAq~P~~~Rd~sRLLv~~~~~~---~i~h~~F~dl~~~L~~gDlLV~NnTkVipARL~g~k----- 72 (218) T pfam02547 1 MKLSDFDYELPEELIAQYPAEPRDSSRLLVLDRETG---EIEHRRFSDLPDYLRPGDLLVFNDTRVIPARLFGRK----- 72 (218) T ss_pred CCHHHCCCCCCHHHHCCCCCCCCCCCEEEEEECCCC---CEEEEEHHHHHHHHCCCCEEEECCCEEEEEEEEEEE----- T ss_conf 953452889993897589999976454888988999---577767867698766598899738879998999994----- Q ss_pred CCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCC-CCCHHHHHHHHHC Q ss_conf 776505899811248661011221156436985799705788740699996035577249997267-8630002544203 Q gi|254780180|r 81 RREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSIS-GIELERQISLVGT 159 (360) Q Consensus 81 ~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~-~~~~~~~L~~~G~ 159 (360) .+|++||+++++.++++.|.|++||+||+|.|+++.|.+ .++++|+++.+.|.++++|.++ ..++.++|+++|+ T Consensus 73 ~tgg~vEill~~~l~~~~w~~lik~~kk~k~G~~l~~~~-----~~~~~v~~~~~~g~~~~~~~~~~~~~~~~~l~~~G~ 147 (218) T pfam02547 73 ETGGKVEVLLLRPLDDGRWLALVKPSKKLKPGDTLEFGD-----GLKAEVLERLEEGGRLVRFQFEGGEDLLELLERYGH 147 (218) T ss_pred CCCCEEEEEEEEECCCCEEEEEEEECCCCCCCCEEEECC-----CCEEEEEEEECCCCEEEEEEECCCCHHHHHHHHCCC T ss_conf 689989999985548986999996079999899999699-----818999998579866999993785319999997599 Q ss_pred CCCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC Q ss_conf 5785222123456521143310134567574005556887899999999972155303787316566578874 Q gi|254780180|r 160 IPLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKV 232 (360) Q Consensus 160 iPLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~ 232 (360) ||||||| +|+++..|.+|||||||+++|||||||||||||++||++|++|||++++||||||+|||+||++ T Consensus 148 ~PLPPYI--~R~~~~~D~~~YQTVfA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~i~~iTLHVG~GTF~Pvrv 218 (218) T pfam02547 148 IPLPPYI--KRPDEEEDRERYQTVYAKKPGSVAAPTAGLHFTPELLERLKAKGVEIAFVTLHVGAGTFLPVRV 218 (218) T ss_pred CCCCCCC--CCCCCHHHHHHHCCCCCCCCCCEECCCCCCCCCHHHHHHHHHCCCEEEEEEEEECCCCCCCCCC T ss_conf 8998201--5776401266635201599984666868677999999999988984999998366987567529 No 6 >TIGR00630 uvra excinuclease ABC, A subunit; InterPro: IPR004602 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products . The UvrC protein contain 4 conserved regions: a central region which interact with UvrB (Uvr domain), a Helix hairpin Helix (HhH) domain important for 5 prime incision of damage DNA and the homology regions 1 and 2 of unknown function. UvrC homology region 2 is specific for UvrC proteins, whereas UvrC homology region 1 is also shared by few other nucleases.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0009380 excinuclease repair complex. Probab=95.48 E-value=0.0049 Score=42.85 Aligned_cols=62 Identities=34% Similarity=0.548 Sum_probs=44.5 Q ss_pred CCCHHHHCCCCCCCCCCCCCCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEEC Q ss_conf 21143310134567574005556887899-----9999999721553037873165665788741013566770007785 Q gi|254780180|r 174 ARDYVDYQTTYAKIQGSVAAPTAGLHFTS-----NLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFID 248 (360) Q Consensus 174 ~~D~~~YQTVyA~~~GSVAAPTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~ 248 (360) -+-+..=+|.|==.+ ||-||||.+ ++|++|.++|-.+. +-+|.|| T Consensus 875 Lskr~TG~TLYiLDE-----PTTGLHf~Di~kLL~VlqrLv~~GNTvv------------------vIEHNLD------- 924 (956) T TIGR00630 875 LSKRSTGRTLYILDE-----PTTGLHFDDIKKLLEVLQRLVDKGNTVV------------------VIEHNLD------- 924 (956) T ss_pred HCCCCCCCEEEEECC-----CCCCCHHHHHHHHHHHHHHHHHCCCEEE------------------EEEECCC------- T ss_conf 078777875899628-----7977418999999999999985499699------------------9965485------- Q ss_pred HHHHHHHHHHH----------HCCCEEEEECH Q ss_conf 99999999875----------37980999628 Q gi|254780180|r 249 APTAQALNSVK----------SRGGRIVSVGT 270 (360) Q Consensus 249 ~~~~~~I~~ak----------~~g~rIiAVGT 270 (360) -|+.|- .+||.|||.|| T Consensus 925 -----VIK~ADyIIDLGPEGG~~GG~ivA~Gt 951 (956) T TIGR00630 925 -----VIKTADYIIDLGPEGGDRGGTIVASGT 951 (956) T ss_pred -----EEEECCEEEECCCCCCCCCCEEEEEEC T ss_conf -----677425789758767778887999825 No 7 >TIGR03470 HpnH hopanoid biosynthesis associated radical SAM protein HpnH. The sequences represented by this model are members of the radical SAM superfamily of enzymes (pfam04055). These enzymes utilize an iron-sulfur redox cluster and S-adenosylmethionine to carry out diverse radical mediated reactions. The members of this clade are frequently found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. The linkage between SHC and this radical SAM enzyme is strong; one is nearly always observed in the same genome where the other is found. A hopanoid biosynthesis locus was described in Zymomonas mobilis consisting of the genes HpnA-E and SHC (HpnF). Continuing past SHC are found a phosphorylase enzyme (ZMO0873, i.e. HpnG, TIGR03468) and this radical SAM enzyme (ZMO0874) which we name here HpnH. Granted, in Z. mobilis, HpnH is in a convergent orientation with respect to HpnA-G, but one gene beyond HpnH Probab=91.14 E-value=0.41 Score=29.05 Aligned_cols=127 Identities=21% Similarity=0.265 Sum_probs=77.7 Q ss_pred CCCCC-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEE---ECHHHHHHHHHHHH Q ss_conf 56757-400555688789999999997215530378731656657887410135667700077---85999999998753 Q gi|254780180|r 185 AKIQG-SVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGF---IDAPTAQALNSVKS 260 (360) Q Consensus 185 A~~~G-SVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~---i~~~~~~~I~~ak~ 260 (360) |+..| -|.-.|-|+ +.++-+++++..|.....|.|. |+. ..|.+..- +=+.+++.|+.||+ T Consensus 96 a~~~g~~v~l~TNG~-Ll~k~i~~~~~~~~~~~~VsLD-G~~-------------e~HD~~r~~~G~Fd~av~aIr~ak~ 160 (318) T TIGR03470 96 LVARKKFVYLCTNAL-LLEKKLDKFEPSPYLTFSVHLD-GLR-------------EHHDASVCREGVFDRAVEAIREAKA 160 (318) T ss_pred HHHCCCEEEEECCHH-HHHHHHHHHHHCCCCEEEEECC-CCH-------------HHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 997599799977552-0099999985188836999801-787-------------8866887179779999999999998 Q ss_pred CCCEEEEECHHHHH--HHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 79809996287899--8999985289846888722366574096114312552775545789999998719899999999 Q gi|254780180|r 261 RGGRIVSVGTTSLR--LLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLMLVSAFCGIEETKKMYQ 338 (360) Q Consensus 261 ~g~rIiAVGTT~~R--~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~Lv~Af~G~~~~~~~Y~ 338 (360) .|=|| .|-||+.. ..+....--..+. --|--.+-|.|||.+.-+. .- ..|.+++.++++.+ T Consensus 161 ~G~~V-~iN~Tvf~~~n~~~i~~~~d~~~-~lgVdgi~isp~y~Ye~ap---------~q------~~fl~r~~~~~lfr 223 (318) T TIGR03470 161 RGFRV-TTNTTLFNDTDPEEVAEFFDYLT-DLGVDGMTISPGYAYEKAP---------DQ------DHFLGRRQTKKLFR 223 (318) T ss_pred CCCCE-EEEEEEECCCCHHHHHHHHHHHH-HCCCCEEEECCCCCCCCCC---------CC------CCCCCHHHHHHHHH T ss_conf 69946-79989706899999999999998-7699738976653102376---------31------11158999999999 Q ss_pred HHHHC Q ss_conf 99986 Q gi|254780180|r 339 HAISH 343 (360) Q Consensus 339 ~Ai~~ 343 (360) ..++. T Consensus 224 ~il~~ 228 (318) T TIGR03470 224 EVLSN 228 (318) T ss_pred HHHHH T ss_conf 99986 No 8 >PRK00635 excinuclease ABC subunit A; Provisional Probab=88.76 E-value=0.33 Score=29.70 Aligned_cols=10 Identities=30% Similarity=0.614 Sum_probs=4.3 Q ss_pred CCCEEEEECH Q ss_conf 7980999628 Q gi|254780180|r 261 RGGRIVSVGT 270 (360) Q Consensus 261 ~g~rIiAVGT 270 (360) .||+|++-|| T Consensus 557 ~GGevv~~Gt 566 (1809) T PRK00635 557 FGGEVLFNGS 566 (1809) T ss_pred CCCEEEEECC T ss_conf 7978999508 No 9 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=88.66 E-value=0.33 Score=29.73 Aligned_cols=12 Identities=42% Similarity=0.839 Sum_probs=9.3 Q ss_pred CCCEEEEECHHH Q ss_conf 798099962878 Q gi|254780180|r 261 RGGRIVSVGTTS 272 (360) Q Consensus 261 ~g~rIiAVGTT~ 272 (360) .||+|++-||.. T Consensus 570 ~GGeiv~~Gt~~ 581 (944) T PRK00349 570 HGGEVVASGTPE 581 (944) T ss_pred CCCEEEEEECHH T ss_conf 788799974899 No 10 >COG0178 UvrA Excinuclease ATPase subunit [DNA replication, recombination, and repair] Probab=86.87 E-value=0.64 Score=27.64 Aligned_cols=11 Identities=55% Similarity=0.909 Sum_probs=8.1 Q ss_pred CCCEEEEECHH Q ss_conf 79809996287 Q gi|254780180|r 261 RGGRIVSVGTT 271 (360) Q Consensus 261 ~g~rIiAVGTT 271 (360) .||.|+|-||- T Consensus 562 ~GGeIv~~Gtp 572 (935) T COG0178 562 HGGEIVAEGTP 572 (935) T ss_pred CCCEEEECCCH T ss_conf 78789970589 No 11 >TIGR02409 carnitine_bodg gamma-butyrobetaine hydroxylase; InterPro: IPR012775 Members of this protein family are gamma-butyrobetaine hydroxylase, both bacterial and eukaryotic. This enzyme catalyzes the last step in the conversion of lysine to carnitine. Carnitine can serve as a compatible solvent in bacteria and also participates in fatty acid metabolism.; GO: 0005506 iron ion binding, 0016706 oxidoreductase activity acting on paired donors with incorporation or reduction of molecular oxygen 2-oxoglutarate as one donor and incorporation of one atom each of oxygen into both donors, 0045329 carnitine biosynthetic process. Probab=81.47 E-value=0.93 Score=26.50 Aligned_cols=20 Identities=35% Similarity=0.581 Sum_probs=16.8 Q ss_pred HCCCCCEEEEECCCEEEEEE Q ss_conf 26988799994795688899 Q gi|254780180|r 52 FLNSNDAIVFNNTKVITAQL 71 (360) Q Consensus 52 ~L~~gDLLV~NnTKVipARL 71 (360) -|+|||+++|-|+||+.||= T Consensus 334 kL~~Gd~~~~DN~R~lHaR~ 353 (384) T TIGR02409 334 KLEPGDLVLFDNTRVLHARD 353 (384) T ss_pred ECCCCCEEEEECCEEECCCC T ss_conf 81778889981730020031 No 12 >PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional Probab=76.84 E-value=2.8 Score=22.99 Aligned_cols=80 Identities=20% Similarity=0.236 Sum_probs=50.3 Q ss_pred CCCCCC-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CHHEEECHHHHHHHHHHHHC Q ss_conf 456757-4005556887899999999972155303787316566578874101356677-00077859999999987537 Q gi|254780180|r 184 YAKIQG-SVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMH-SEIGFIDAPTAQALNSVKSR 261 (360) Q Consensus 184 yA~~~G-SVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH-~E~~~i~~~~~~~I~~ak~~ 261 (360) ||+..| .+..-|=|.-+|++.+++|++.|+....|.|+ |.. | .+.|.-.. .- -=+.+.+.|...++. T Consensus 86 ~a~~~G~~~~l~TNG~lit~~~a~~L~~~gl~~v~vSlD-g~~---~----e~hD~~rG~~G---~f~~~~~~i~~l~~~ 154 (375) T PRK05301 86 HARRLGLYTNLITSGVGLTEARLAALKAAGLDHIQLSFQ-DSD---P----ELADRIAGTRG---AFAQKLEVARLVKAH 154 (375) T ss_pred HHHHCCCEEEEEECCCCCCHHHHHHHHHHCCCEEEEECC-CCC---H----HHHHHHCCCCC---HHHHHHHHHHHHHHC T ss_conf 999769758996067455799999998509988999567-798---7----78777637886---299999999999974 Q ss_pred CCEEEEECHHHHHH Q ss_conf 98099962878998 Q gi|254780180|r 262 GGRIVSVGTTSLRL 275 (360) Q Consensus 262 g~rIiAVGTT~~R~ 275 (360) |-+ +.|-||++|. T Consensus 155 Gi~-v~i~~ti~r~ 167 (375) T PRK05301 155 GYP-LTLNAVIHRH 167 (375) T ss_pred CCC-EEEEEEECCC T ss_conf 981-6999872305 No 13 >pfam10987 DUF2806 Protein of unknown function (DUF2806). This bacterial family of proteins has no known function. Probab=74.05 E-value=2 Score=24.17 Aligned_cols=94 Identities=23% Similarity=0.241 Sum_probs=55.5 Q ss_pred CEEEEECHHHHHHHHHHHHCC-----------CCCCCCCCCCCEEECCCC-----------CCEEEEEEEECCCCCHHHH Q ss_conf 809996287899899998528-----------984688872236657409-----------6114312552775545789 Q gi|254780180|r 263 GRIVSVGTTSLRLLETATTED-----------GIINPWSGFTNIFITPGY-----------RFRAVDILMSNFHLPKSTL 320 (360) Q Consensus 263 ~rIiAVGTT~~R~LEs~~~~~-----------~~~~~~~g~T~lfI~Pg~-----------~f~~vd~LiTNFH~P~Stl 320 (360) +-|.-=||+|+|||....+-. +..-.+.+.++..|.=|| ++....-.+.||.+|=|++ T Consensus 69 ~Ei~~PGsfSlrtL~~Lk~mt~~eA~lf~ka~~l~~~~~~d~~~kll~g~~~~~~~~~l~~~~~~~~~~l~~~gl~ys~l 148 (223) T pfam10987 69 GEITNPGTFSLRTLQTLKSLTKREAQLFQKACSLAVKVGGDSSLKLLLGYYKQGGLFSLFRKSSTENLNLGQFGLPYSSL 148 (223) T ss_pred HHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHEEEECCCCCHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCHHHH T ss_conf 35079987478999999835999999999997570667884505565313204762001125663200301259984479 Q ss_pred HHHHHHHH--CHHHHHHHHHHHHHCCCEEECCCCHHHCCC Q ss_conf 99999871--989999999999986982611010222336 Q gi|254780180|r 321 LMLVSAFC--GIEETKKMYQHAISHAYRFYSYGDTSLLFQ 358 (360) Q Consensus 321 l~Lv~Af~--G~~~~~~~Y~~Ai~~~yrF~syGDamLi~~ 358 (360) +.|+.+=. +.+ .+.=....+..+.|+-+|-.|.+-+ T Consensus 149 l~L~elGLl~~~e--~es~~~~~~~~l~l~~~~~~~~l~~ 186 (223) T pfam10987 149 LTLQELGLLHSSE--LESGELPKDQYLPLSYSGKSLSLTP 186 (223) T ss_pred HHHHHCCCCCCCC--EEEEEECCCCCEEEEECCEEEEEEE T ss_conf 9999838863463--2455415898458998896799998 No 14 >COG0647 NagD Predicted sugar phosphatases of the HAD superfamily [Carbohydrate transport and metabolism] Probab=68.10 E-value=11 Score=18.90 Aligned_cols=25 Identities=12% Similarity=0.045 Sum_probs=14.0 Q ss_pred HHHHCCCEEEEECHHHHHHHHHHHH Q ss_conf 8753798099962878998999985 Q gi|254780180|r 257 SVKSRGGRIVSVGTTSLRLLETATT 281 (360) Q Consensus 257 ~ak~~g~rIiAVGTT~~R~LEs~~~ 281 (360) ......++++.||-...-=+..+.+ T Consensus 202 ~~~~~~~~~~mVGD~~~TDI~~a~~ 226 (269) T COG0647 202 KLGLDRSEVLMVGDRLDTDILGAKA 226 (269) T ss_pred HHCCCCCCEEEECCCCHHHHHHHHH T ss_conf 8179832089985882456999998 No 15 >COG4820 EutJ Ethanolamine utilization protein, possible chaperonin [Amino acid transport and metabolism] Probab=65.26 E-value=2.5 Score=23.39 Aligned_cols=82 Identities=18% Similarity=0.186 Sum_probs=58.1 Q ss_pred HHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCCCCEEEC------CCCCCEEEEEEEECCCCC Q ss_conf 00778599999999875379809996287899899998528984688872236657------409611431255277554 Q gi|254780180|r 243 EIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGFTNIFIT------PGYRFRAVDILMSNFHLP 316 (360) Q Consensus 243 E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~T~lfI~------Pg~~f~~vd~LiTNFH~P 316 (360) -+|-|+-|.++++++.++.+.-|+.|=+-|+--.-..... --..+|.+|+||- ||++=-.-..|=-|.|+| T Consensus 181 G~ygi~~EeAE~~Kr~~k~~~Eif~~v~PV~eKMAeIv~~---hie~~~i~dl~lvGGac~~~g~e~~Fe~~l~l~v~~P 257 (277) T COG4820 181 GNYGISLEEAEQYKRGHKKGEEIFPVVKPVYEKMAEIVAR---HIEGQGITDLWLVGGACMQPGVEELFEKQLALQVHLP 257 (277) T ss_pred CCCCCCHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHHHHH---HHCCCCCCCEEEECCCCCCCCHHHHHHHHHCCCCCCC T ss_conf 3568677678876421445201120020799999999999---8445887605886363267027999998746411367 Q ss_pred HHHHHHHHHHH Q ss_conf 57899999987 Q gi|254780180|r 317 KSTLLMLVSAF 327 (360) Q Consensus 317 ~Stll~Lv~Af 327 (360) .-++.|---+. T Consensus 258 ~~p~y~TPLgI 268 (277) T COG4820 258 QHPLYMTPLGI 268 (277) T ss_pred CCCCEECHHHH T ss_conf 88402131452 No 16 >PRK11587 putative phosphatase; Provisional Probab=64.80 E-value=12 Score=18.46 Aligned_cols=70 Identities=19% Similarity=0.146 Sum_probs=49.0 Q ss_pred CHHHHHHHHHHCCCCEEEEE-----------ECCCCCCCC-CCCCCCCCCCCCCHH--------------EEECHHHHHH Q ss_conf 99999999972155303787-----------316566578-874101356677000--------------7785999999 Q gi|254780180|r 201 TSNLLSRLISIGIKVYFVTL-----------HVGAGTFMP-VKVEDTDDHIMHSEI--------------GFIDAPTAQA 254 (360) Q Consensus 201 t~~ll~~L~~kGi~~~~iTL-----------HVG~GTF~P-i~~~~i~~h~mH~E~--------------~~i~~~~~~~ 254 (360) -.++|+.|+++|+.++-+|= ..|+.-|.- |..+++..+|-|.|- +-+=+++..- T Consensus 88 ~~elL~~L~~~gi~~avvTs~~~~~~~~~~~~~~l~~~~~~v~~ddv~~~KP~Pd~yl~A~~~Lg~~p~~~lvieDS~~g 167 (218) T PRK11587 88 AIALLNHLNKAGIPWAIVTSGSMPVASARHKAAGLPAPEVFVTAERVKRGKPEPDAYLLGAQLLGLAPQECVVVEDAPAG 167 (218) T ss_pred HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCCHHHEEEEECCHHH T ss_conf 99999999877997788726827899999972389978699971105578998689999999829897778999477888 Q ss_pred HHHHHHCCCEEEEECH Q ss_conf 9987537980999628 Q gi|254780180|r 255 LNSVKSRGGRIVSVGT 270 (360) Q Consensus 255 I~~ak~~g~rIiAVGT 270 (360) |..|++.|-++|||.. T Consensus 168 i~AA~~AG~~~v~V~~ 183 (218) T PRK11587 168 VLSGLAAGCHVIAVNA 183 (218) T ss_pred HHHHHHCCCEEEEECC T ss_conf 9999986998999879 No 17 >pfam02668 TauD Taurine catabolism dioxygenase TauD, TfdA family. This family consists of taurine catabolism dioxygenases of the TauD, TfdA family. TauD from E. coli is a alpha-ketoglutarate-dependent taurine dioxygenase. This enzyme catalyses the oxygenolytic release of sulfite from taurine. TfdA from Burkholderia sp. is a 2,4-dichlorophenoxyacetic acid/alpha-ketoglutarate dioxygenase. TfdA from Alcaligenes eutrophus JMP134 is a 2,4-dichlorophenoxyacetate monooxygenase. Also included are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1. Probab=64.21 E-value=3.2 Score=22.63 Aligned_cols=22 Identities=14% Similarity=0.324 Sum_probs=13.4 Q ss_pred EEHHHHHHHCCCCCEEEEECCC Q ss_conf 0143478726988799994795 Q gi|254780180|r 44 HLVSDLPAFLNSNDAIVFNNTK 65 (360) Q Consensus 44 ~~F~dl~~~L~~gDLLV~NnTK 65 (360) ..+..|.+.|..--++||.|-. T Consensus 22 ~~~~~i~~~l~~~Gvvv~r~~~ 43 (253) T pfam02668 22 ATLAELREALAEHGVLLFRGQP 43 (253) T ss_pred HHHHHHHHHHHHHCEEEECCCC T ss_conf 9999999999970799989999 No 18 >TIGR02329 propionate_PrpR propionate catabolism operon regulatory protein PrpR; InterPro: IPR012704 At least five distinct pathways exist for the catabolism of propionate by way of propionyl-CoA. Members of this family represent the transcriptional regulatory protein PrpR which is, in most cases, divergently transcribed from the operon that encodes the genes involved in the methylcitric acid cycle of propionate catabolism. This protein is required for the expression of the proteins involved in this pathway . 2-methylcitric acid, an intermediate in this pathway, has been proposed to be a co-activator of PrpR .; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0019629 propionate catabolic process 2-methylcitrate cycle, 0005737 cytoplasm. Probab=62.50 E-value=2.4 Score=23.56 Aligned_cols=11 Identities=36% Similarity=0.884 Sum_probs=8.6 Q ss_pred HHHHHHCCCCC Q ss_conf 25442035785 Q gi|254780180|r 153 QISLVGTIPLP 163 (360) Q Consensus 153 ~L~~~G~iPLP 163 (360) +|+++|||||| T Consensus 427 FLDEIGEmPLP 437 (658) T TIGR02329 427 FLDEIGEMPLP 437 (658) T ss_pred EEHHHCCCCCH T ss_conf 00100357861 No 19 >PRK13224 consensus Probab=60.41 E-value=14 Score=18.00 Aligned_cols=119 Identities=17% Similarity=0.189 Sum_probs=71.0 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEE-----------EECCCCCCCC--CCCCCCCCCCCCCHHE Q ss_conf 31013456757400555688789999999997215530378-----------7316566578--8741013566770007 Q gi|254780180|r 179 DYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVT-----------LHVGAGTFMP--VKVEDTDDHIMHSEIG 245 (360) Q Consensus 179 ~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iT-----------LHVG~GTF~P--i~~~~i~~h~mH~E~~ 245 (360) +|+.-|.+....-..|--|. .++|+.|+++|+.++=+| =|.|+.-|-. |..+++..+|-|.|-| T Consensus 72 ~~~~~y~~~~~~~~~~~pG~---~e~L~~L~~~g~~laivTn~~~~~~~~~L~~~gl~~~Fd~iv~~d~v~~~KP~Pe~~ 148 (216) T PRK13224 72 RFMAHYEADATALTRVYPGV---EAALRHLKAEGWRLGLCTNKPVGPTRAILAHFGLAELFDVVIGGDSLPQRKPDPAPL 148 (216) T ss_pred HHHHHHHHHCCCCCCCCCCH---HHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCHHHCCEEECCCCCCCCCCCHHHH T ss_conf 99999997402348728379---999999997799668760893477888998739443247465446689999999999 Q ss_pred EECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCE-----EEEEEEECCCCCH Q ss_conf 78599999999875379809996287899899998528984688872236657409611-----4312552775545 Q gi|254780180|r 246 FIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGFTNIFITPGYRFR-----AVDILMSNFHLPK 317 (360) Q Consensus 246 ~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~-----~vd~LiTNFH~P~ 317 (360) ..+++.++ -...+.||=| ..=+++|.+ .|-.-+...-||.-. -.++.|..|+... T Consensus 149 ---l~a~~~l~-----~~~~l~VGDs-~~D~~aA~~--------AGi~~i~v~~Gy~~~~~~~~~~~~~i~~~~el~ 208 (216) T PRK13224 149 ---RAAFAALG-----DEPVLYVGDS-EVDAETAEN--------AGVPFLLFTEGYRKAPVHELPHHGAFDDHAELP 208 (216) T ss_pred ---HHHHHHCC-----CCCEEEECCC-HHHHHHHHH--------HCCCEEEECCCCCCCCHHHCCCCCHHCCHHHHH T ss_conf ---99997379-----9998997598-878999999--------499889977797998967738462617999899 No 20 >PRK13679 hypothetical protein; Provisional Probab=59.99 E-value=8.4 Score=19.60 Aligned_cols=30 Identities=37% Similarity=0.497 Sum_probs=24.2 Q ss_pred HHHHHHHHHHCCCCEEEEEEC-CCCCCCCCC Q ss_conf 999999997215530378731-656657887 Q gi|254780180|r 202 SNLLSRLISIGIKVYFVTLHV-GAGTFMPVK 231 (360) Q Consensus 202 ~~ll~~L~~kGi~~~~iTLHV-G~GTF~Pi~ 231 (360) +++.+.|++---++..+++|+ |.|||.|+. T Consensus 49 ~~l~~~L~~~a~~~~pf~i~l~g~~sF~P~~ 79 (168) T PRK13679 49 DSIVEELRTIAKGTPPVTLHVTKASSFAPVN 79 (168) T ss_pred HHHHHHHHHHHHCCCCEEEEEEEEECCCCCC T ss_conf 9999999999850798899983472357888 No 21 >PRK10658 alpha-xylosidase YicI; Provisional Probab=58.82 E-value=9 Score=19.40 Aligned_cols=20 Identities=25% Similarity=0.346 Sum_probs=14.0 Q ss_pred CCCHHHHHHHH----HCCCCCHHH Q ss_conf 86300025442----035785222 Q gi|254780180|r 147 GIELERQISLV----GTIPLPPYI 166 (360) Q Consensus 147 ~~~~~~~L~~~----G~iPLPPYI 166 (360) +.+..+++++| |++||||+- T Consensus 244 GptpkdVl~~YT~LTGrp~lPP~W 267 (772) T PRK10658 244 GPTPKEVLERYTALTGRPALPPAW 267 (772) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCH T ss_conf 999999999998876889889725 No 22 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=55.68 E-value=17 Score=17.38 Aligned_cols=59 Identities=17% Similarity=0.249 Sum_probs=29.9 Q ss_pred HHHHHHHHHHCCCEEEEECHH--HHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHH Q ss_conf 999999875379809996287--899899998528984688872236657409611431255277554578999 Q gi|254780180|r 251 TAQALNSVKSRGGRIVSVGTT--SLRLLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLM 322 (360) Q Consensus 251 ~~~~I~~ak~~g~rIiAVGTT--~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~ 322 (360) ++.-|.+....||.|+-|||- .-+..|.++...|. |-...+.+-|++|||-..+.++-- T Consensus 45 A~~~i~~~~~~~g~ILfVgtk~~~~~~v~~~a~~~~~-------------~yv~~rWigG~LTN~~~~~~~i~~ 105 (193) T cd01425 45 ALNFIANIAAKGGKILFVGTKPQAQRAVKKFAERTGS-------------FYVNGRWLGGTLTNWKTIRKSIKR 105 (193) T ss_pred HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCC-------------EEECCEECCCCCCCHHHHHHHHHH T ss_conf 9999999997599599997578999999999998399-------------454485758772488888668998 No 23 >PRK07102 short chain dehydrogenase; Provisional Probab=55.54 E-value=17 Score=17.37 Aligned_cols=76 Identities=21% Similarity=0.173 Sum_probs=52.1 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHH Q ss_conf 68878999999999721553037873165665788741013566770007785999999998753798099962878998 Q gi|254780180|r 196 AGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRL 275 (360) Q Consensus 196 AGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~ 275 (360) |=.+|++.|-.++..+||.+..|. =| +|+++-....++..-...-|++++++|.++-.+|++.+-++-- .|. T Consensus 154 al~~~~~sL~~El~~~gI~V~~v~--PG-----~v~T~m~~~~~~~~~~~~~pe~vA~~i~~ai~~~k~~i~~p~~-~~~ 225 (243) T PRK07102 154 ALTAFLSGLRNRLFKSGVHVLTVK--PG-----FVDTPMTAGLKLPGPLTAQPEEVAKDIFRAIEKGKDVLYTPWF-WRL 225 (243) T ss_pred HHHHHHHHHHHHHHCCCEEEEEEE--EC-----CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCHH-HHH T ss_conf 999999999998502091999997--18-----8967566689998877699999999999999769989985868-999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780180|r 276 LETA 279 (360) Q Consensus 276 LEs~ 279 (360) +-.. T Consensus 226 ~~~l 229 (243) T PRK07102 226 IMLI 229 (243) T ss_pred HHHH T ss_conf 9999 No 24 >pfam11253 DUF3052 Protein of unknown function (DUF3052). This family of proteins with unknown function appears to be restricted to Actinobacteria. Probab=55.54 E-value=3.7 Score=22.22 Aligned_cols=15 Identities=67% Similarity=0.842 Sum_probs=11.8 Q ss_pred CCCCCCCCCCCCHHH Q ss_conf 400555688789999 Q gi|254780180|r 190 SVAAPTAGLHFTSNL 204 (360) Q Consensus 190 SVAAPTAGLHFt~~l 204 (360) +=||||||||-|..+ T Consensus 96 ~eAA~tAGL~~Tss~ 110 (127) T pfam11253 96 AEAAPTAGLHQTSSV 110 (127) T ss_pred HHHHHHCCCCCCCCC T ss_conf 766541484012355 No 25 >PRK13227 consensus Probab=55.37 E-value=16 Score=17.55 Aligned_cols=74 Identities=16% Similarity=0.165 Sum_probs=43.8 Q ss_pred CHHHHHHHHHHCCCCEEEEE-----------ECCCCC-CCC-CCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 99999999972155303787-----------316566-578-87410135667700077859999999987537980999 Q gi|254780180|r 201 TSNLLSRLISIGIKVYFVTL-----------HVGAGT-FMP-VKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 201 t~~ll~~L~~kGi~~~~iTL-----------HVG~GT-F~P-i~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) ..++|+.|+++|+.++=+|= +.|+.. |.- |..+++...|-|+|-|. .+++..+ -....++. T Consensus 101 v~~~L~~Lk~~g~~~~ivTn~~~~~~~~~l~~~gl~~~Fd~iv~~ddv~~~KP~P~~~~---~~~~~l~---~~p~~~l~ 174 (228) T PRK13227 101 VKETLEALKAQGYPLALVTNKPTPFVAPLLEALGIADYFSVVIGGDSLPNKKPHPAPLL---LVCEKLG---IAPEQMLF 174 (228) T ss_pred HHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCHHHHH---HHHHHHC---CCCCEEEE T ss_conf 99999999987996578845706766678886093554344300333577799759999---9999819---69754999 Q ss_pred ECHHHHHHHHHHHH Q ss_conf 62878998999985 Q gi|254780180|r 268 VGTTSLRLLETATT 281 (360) Q Consensus 268 VGTT~~R~LEs~~~ 281 (360) ||=| ..=+++|.+ T Consensus 175 VGDs-~~Di~aA~~ 187 (228) T PRK13227 175 VGDS-RNDILAAKA 187 (228) T ss_pred ECCC-HHHHHHHHH T ss_conf 7778-889999999 No 26 >KOG3888 consensus Probab=53.70 E-value=7.5 Score=19.97 Aligned_cols=11 Identities=9% Similarity=0.226 Sum_probs=5.2 Q ss_pred CCCCCCEEEEE Q ss_conf 78120313566 Q gi|254780180|r 21 FPRDSARLMVV 31 (360) Q Consensus 21 ~~Rd~SRLLV~ 31 (360) +-++.++++.+ T Consensus 20 ~~~n~~~iv~v 30 (407) T KOG3888 20 AAANRSHIVQV 30 (407) T ss_pred HHCCCCCEEEE T ss_conf 41477644760 No 27 >cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB. Probab=53.27 E-value=19 Score=17.12 Aligned_cols=13 Identities=31% Similarity=0.496 Sum_probs=5.8 Q ss_pred CCCCCCCHHHHCC Q ss_conf 3588889778616 Q gi|254780180|r 5 EFDFDLPPSRIAL 17 (360) Q Consensus 5 dfDy~LP~elIAq 17 (360) .|.|..|+.+-+. T Consensus 8 AF~FyY~enl~~L 20 (198) T cd03130 8 AFNFYYPENLELL 20 (198) T ss_pred CCCEECHHHHHHH T ss_conf 5255579999999 No 28 >cd05126 Mth938 Mth938 domain. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. The function of the protein has not been determined. Probab=53.05 E-value=19 Score=17.09 Aligned_cols=43 Identities=19% Similarity=0.375 Sum_probs=29.8 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEE Q ss_conf 87899999999972155303787316566578874101356677000778599999999875379809996 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSV 268 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAV 268 (360) |.+++++.+.|+++||++. ..-++++++.-|.-..+|+||.|+ T Consensus 72 l~v~p~~~~~l~~~gi~v~----------------------------~~~T~~A~rtyN~L~~eGrrV~A~ 114 (117) T cd05126 72 LKVPPETVEKLEKRGVEVL----------------------------VLPTEEAVKRYNELAGKGRRVLAV 114 (117) T ss_pred CCCCHHHHHHHHHCCCCEE----------------------------EECHHHHHHHHHHHHHCCCCEEEE T ss_conf 6489999999997598189----------------------------837599999999998489713899 No 29 >cd01300 YtcJ_like YtcJ_like metal dependent amidohydrolases. YtcJ is a Bacillus subtilis ORF of unknown function. The Arabidopsis homolog LAF3 has been identified as a factor required for photochrome A signalling. Probab=52.43 E-value=10 Score=18.94 Aligned_cols=10 Identities=50% Similarity=0.690 Sum_probs=4.3 Q ss_pred CCCCCCCCCC Q ss_conf 7400555688 Q gi|254780180|r 189 GSVAAPTAGL 198 (360) Q Consensus 189 GSVAAPTAGL 198 (360) |++.+-||.+ T Consensus 267 G~~~~~ta~~ 276 (479) T cd01300 267 GSLGSRTAAL 276 (479) T ss_pred CCHHHHHHHH T ss_conf 8715546877 No 30 >COG2175 TauD Probable taurine catabolism dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=51.06 E-value=12 Score=18.52 Aligned_cols=23 Identities=26% Similarity=0.585 Sum_probs=15.8 Q ss_pred CCCCCCCCCEEECCCCCCEEEEE Q ss_conf 46888722366574096114312 Q gi|254780180|r 286 INPWSGFTNIFITPGYRFRAVDI 308 (360) Q Consensus 286 ~~~~~g~T~lfI~Pg~~f~~vd~ 308 (360) ..|..|.-.|||.+||..++++- T Consensus 194 ~hP~~g~~~l~~n~~~~~~i~g~ 216 (286) T COG2175 194 THPETGEKALFVNEGFTTRIVGL 216 (286) T ss_pred ECCCCCCEEEEECCCCCEECCCC T ss_conf 67876742236447761313677 No 31 >cd05017 SIS_PGI_PMI_1 The members of this protein family contain the SIS (Sugar ISomerase) domain and have both the phosphoglucose isomerase (PGI) and the phosphomannose isomerase (PMI) functions. These functions catalyze the reversible reactions of glucose 6-phosphate to fructose 6-phosphate, and mannose 6-phosphate to fructose 6-phosphate, respectively at an equal rate. This protein contains two SIS domains. This alignment is based on the first SIS domain. Probab=49.51 E-value=21 Score=16.72 Aligned_cols=100 Identities=21% Similarity=0.252 Sum_probs=55.6 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC--CCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHH Q ss_conf 99999999721553037873165665788741013566--7700077859999999987537980999628789989999 Q gi|254780180|r 202 SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHI--MHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 202 ~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~--mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~ 279 (360) -++++.+-.... -..+..|-+ .+.|= .+.++. +-.-|--=++||.....+|+++|.+||||. +- =.|+.. T Consensus 14 gd~l~~~~~~~~-~vPv~v~rd--y~lP~---~v~~~~lvi~~S~SGnTeEtl~~~~~a~~~ga~vi~it-sG-G~L~~~ 85 (119) T cd05017 14 GDLLESLLLDEA-KIPVYVVKD--YTLPA---FVDRKTLVIAVSYSGNTEETLSAVEQAKERGAKIVAIT-SG-GKLLEM 85 (119) T ss_pred HHHHHHHHHCCC-CCCEEEECC--CCCCC---CCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEC-CC-CHHHHH T ss_conf 999999970579-988798489--98987---55878789999289796899999999998599099984-99-559999 Q ss_pred HHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHHHHHHHHC Q ss_conf 85289846888722366574096114312552775545789999998719 Q gi|254780180|r 280 TTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLMLVSAFCG 329 (360) Q Consensus 280 ~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~Lv~Af~G 329 (360) +.+++. .....|+ -++||+.+=-+..+..+ T Consensus 86 a~~~~~--------p~v~iP~------------g~~PR~a~g~~~~~ll~ 115 (119) T cd05017 86 AREHGV--------PVIIIPK------------GLQPRAAFPYLFTALLN 115 (119) T ss_pred HHHCCC--------CEEECCC------------CCCCHHHHHHHHHHHHH T ss_conf 988799--------9898799------------99957999999999999 No 32 >PRK01077 cobyrinic acid a,c-diamide synthase; Validated Probab=49.40 E-value=21 Score=16.70 Aligned_cols=85 Identities=20% Similarity=0.248 Sum_probs=53.9 Q ss_pred CCCCCCCCCCCC-HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC-----------CCCCCHHEEECHHHHHHHHH Q ss_conf 400555688789-999999997215530378731656657887410135-----------66770007785999999998 Q gi|254780180|r 190 SVAAPTAGLHFT-SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDD-----------HIMHSEIGFIDAPTAQALNS 257 (360) Q Consensus 190 SVAAPTAGLHFt-~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~-----------h~mH~E~~~i~~~~~~~I~~ 257 (360) +||--. ..+|. ++.++.|++.|+++.+ |.|++.+.+-+ ..+|.+.-.=.....+.|.+ T Consensus 249 ava~D~-AF~FyY~enl~~L~~~Gael~~---------fSPl~d~~lP~~~D~l~lpGGyPEl~a~~L~~n~~l~~~I~~ 318 (451) T PRK01077 249 AVARDA-AFSFYYPDNLELLRAAGAELVF---------FSPLADEALPDDCDALYLGGGYPELFAAELAANTSMRASIRA 318 (451) T ss_pred EEECCC-CCCCCCHHHHHHHHHCCCEEEE---------ECCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 994342-2475576679999978988999---------577678889866788995798502439999870899999999 Q ss_pred HHHCCCEEEEECHHHHHHHHHHHHCCC Q ss_conf 753798099962878998999985289 Q gi|254780180|r 258 VKSRGGRIVSVGTTSLRLLETATTEDG 284 (360) Q Consensus 258 ak~~g~rIiAVGTT~~R~LEs~~~~~~ 284 (360) +-++|++|+|+.==-+=--|+..+.+| T Consensus 319 ~~~~G~pi~aeCGG~~~Lg~~i~d~~G 345 (451) T PRK01077 319 AAAAGKPIYAECGGLMYLGESLEDADG 345 (451) T ss_pred HHHCCCCEEEECHHHHHHHHHCCCCCC T ss_conf 998699789987117887543277899 No 33 >TIGR00075 hypD hydrogenase expression/formation protein HypD; InterPro: IPR002780 HypD is involved in the hyp operon which is needed for the activity of the three hydrogenase isoenzymes in Escherichia coli. HypD is one of the genes needed for formation of these enzymes . This protein has been found in Gram-negative and Gram-positive bacteria and Archaea. HypD contains many possible metal binding residues, which may bind to nickel. Transposon Tn5 insertions into HypD resulted in Rhizobium leguminosarum mutants that lacked any hydrogenase activity in symbiosis with peas .; GO: 0046872 metal ion binding. Probab=48.47 E-value=12 Score=18.45 Aligned_cols=24 Identities=21% Similarity=0.299 Sum_probs=11.5 Q ss_pred EEEEECCCCCCCCEEEEEEH-HHHHHHCCCC Q ss_conf 13566378887531531014-3478726988 Q gi|254780180|r 27 RLMVVHPNLSCPLVISDHLV-SDLPAFLNSN 56 (360) Q Consensus 27 RLLV~~r~~~~~~~i~h~~F-~dl~~~L~~g 56 (360) |+|-+..+ -+|+.+ +.|-++||+| T Consensus 43 ~IMevCGg------HeHtI~KYGl~~LLPeN 67 (384) T TIGR00075 43 KIMEVCGG------HEHTIMKYGLRDLLPEN 67 (384) T ss_pred EEEEECCC------CCCCEEEECHHHHCCCC T ss_conf 78851379------73310352635628764 No 34 >PRK13226 phosphoglycolate phosphatase; Provisional Probab=47.47 E-value=23 Score=16.50 Aligned_cols=74 Identities=19% Similarity=0.130 Sum_probs=45.4 Q ss_pred CHHHHHHHHHHCCCCEEEEE-----------ECCCCCC-CC-CCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 99999999972155303787-----------3165665-78-87410135667700077859999999987537980999 Q gi|254780180|r 201 TSNLLSRLISIGIKVYFVTL-----------HVGAGTF-MP-VKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 201 t~~ll~~L~~kGi~~~~iTL-----------HVG~GTF-~P-i~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) ..++|++|+++|+.++-+|= +.|+.-+ .- |..+++..+|-|.|-|. .+++.++ -.....+. T Consensus 92 v~e~L~~L~~~g~~~~VvTnk~~~~~~~~l~~~gl~~~F~~iv~~ddv~~~KP~Pd~~l---~a~~~lg---v~p~e~v~ 165 (221) T PRK13226 92 VEMMLARLECAGCVWGIVTNKPEYLARLILPQLGWEQRCAVLIGGDTLAERKPHPLPLL---TAAERIG---VAPTQCVY 165 (221) T ss_pred HHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCHHHHH---HHHHHHC---CCHHHEEE T ss_conf 99999999977990898559818889999998398522330001444567788869999---9999949---89536888 Q ss_pred ECHHHHHHHHHHHH Q ss_conf 62878998999985 Q gi|254780180|r 268 VGTTSLRLLETATT 281 (360) Q Consensus 268 VGTT~~R~LEs~~~ 281 (360) ||=+. .=++++.. T Consensus 166 VGDs~-~Di~aA~a 178 (221) T PRK13226 166 VGDDE-RDILAARA 178 (221) T ss_pred ECCCH-HHHHHHHH T ss_conf 89998-79999999 No 35 >pfam09861 DUF2088 Uncharacterized conserved protein (DUF2088). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=46.46 E-value=17 Score=17.35 Aligned_cols=66 Identities=26% Similarity=0.319 Sum_probs=43.5 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHH---HHHHHHCCCEEEEECHHH Q ss_conf 78999999999721553037873165665788741013566770007785999999---998753798099962878 Q gi|254780180|r 199 HFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQA---LNSVKSRGGRIVSVGTTS 272 (360) Q Consensus 199 HFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~---I~~ak~~g~rIiAVGTT~ 272 (360) -+=+.++++|.+.||.-..||+=++.||=+|-+.|.+.+- +.++.++. +|.--..-...+-+|||+ T Consensus 72 ~il~~ll~~L~~~Gv~~~~I~iv~A~G~Hr~~t~eE~~~~--------lG~~i~~~~~v~~Hd~~d~~~lv~lG~T~ 140 (203) T pfam09861 72 IILPLLLEELNAVGIPDEDITILVATGTHRPMTEEELKKI--------LGEEVVKRIRIVNHDADDPDELVYLGTTS 140 (203) T ss_pred HHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHH--------HHHHHHCCEEEEECCCCCHHHCEEECCCC T ss_conf 5599999999975998125899982687999998999998--------60886638689957999857837812079 No 36 >PRK06139 short chain dehydrogenase; Provisional Probab=45.28 E-value=25 Score=16.27 Aligned_cols=136 Identities=17% Similarity=0.084 Sum_probs=74.7 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCCC------CCCCC-CHHEEECHHH Q ss_conf 10134567574005556887899999999972-1553037873165665788741013------56677-0007785999 Q gi|254780180|r 180 YQTTYAKIQGSVAAPTAGLHFTSNLLSRLISI-GIKVYFVTLHVGAGTFMPVKVEDTD------DHIMH-SEIGFIDAPT 251 (360) Q Consensus 180 YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~k-Gi~~~~iTLHVG~GTF~Pi~~~~i~------~h~mH-~E~~~i~~~~ 251 (360) +|+.|+--.. |=..||+.|-.+|.+. ||.+.-|.. | +|++.-.+ .+++. ...++=|+.+ T Consensus 151 ~~saY~ASK~------Av~gftesLr~EL~~~~gI~Vt~V~P--g-----~v~TP~~~~~~~~~~~~~~~~~p~~~pe~v 217 (324) T PRK06139 151 YAAAYSASKF------GLRGFSEALRAELTDFPDIHVCDVYP--A-----FVDTPGFRHGANYTGRRLTPPPPMYDPRRV 217 (324) T ss_pred CCCHHHHHHH------HHHHHHHHHHHHHCCCCCCEEEEEEC--C-----CCCCCCCHHHHHCCCCCCCCCCCCCCHHHH T ss_conf 9841989999------99999999999837998918999857--9-----958852014353378788999998799999 Q ss_pred HHHHHHHHHCCCEEEEECHHHHHHHHH--------------HHHC--CCCCCCCCCCCCEEECCCCCCEEEEEEEECCCC Q ss_conf 999998753798099962878998999--------------9852--898468887223665740961143125527755 Q gi|254780180|r 252 AQALNSVKSRGGRIVSVGTTSLRLLET--------------ATTE--DGIINPWSGFTNIFITPGYRFRAVDILMSNFHL 315 (360) Q Consensus 252 ~~~I~~ak~~g~rIiAVGTT~~R~LEs--------------~~~~--~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~ 315 (360) +++|-.+-.+.+|-+-||.++.-+.=. .+.. ...-.+-...-|||=-+.-...+--+.-..=+. T Consensus 218 A~ai~~~~~~~~r~~~vG~~~~~~~~~~~~~P~l~~r~~~~~~~~~~~~~~~~~~~~~nl~~p~~~~~~~~g~~~~~~~~ 297 (324) T PRK06139 218 AKAMVRLADRPRNTTTVGTAARLMRLAHFLAPGLFARLMGRLTRRYLARAPRAARSSGNLFEPPSGAGGIDGGWRSRPRR 297 (324) T ss_pred HHHHHHHHHCCCCEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99999998389972541868999999988682789999999999873179999999997357999998877986533445 Q ss_pred CHHHHHHHHHHHH Q ss_conf 4578999999871 Q gi|254780180|r 316 PKSTLLMLVSAFC 328 (360) Q Consensus 316 P~Stll~Lv~Af~ 328 (360) ....++.+..|++ T Consensus 298 ~~~~~~~~~~~~~ 310 (324) T PRK06139 298 STRPLLVIGAAVA 310 (324) T ss_pred CCCHHHHHHHHHH T ss_conf 6401899999999 No 37 >COG2100 Predicted Fe-S oxidoreductase [General function prediction only] Probab=44.01 E-value=13 Score=18.30 Aligned_cols=134 Identities=16% Similarity=0.145 Sum_probs=80.2 Q ss_pred CCCHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCC-CCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 863000254420357852221234565211433101345-6757400555688789999999997215530378731656 Q gi|254780180|r 147 GIELERQISLVGTIPLPPYIARKRPIDARDYVDYQTTYA-KIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAG 225 (360) Q Consensus 147 ~~~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA-~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~G 225 (360) +..++.-|+..|+.-|.|+|.+ -. |-.=. +.-+-|++-|-|--.++++.++|.+.|.....+.+|- = T Consensus 158 gkglEaHlDGqGEP~lYP~l~~--lV--------qalk~~~~v~vVSmQTng~~L~~~lv~eLeeAGLdRiNlSv~a--L 225 (414) T COG2100 158 GKGLEAHLDGQGEPLLYPHLVD--LV--------QALKEHKGVEVVSMQTNGVLLSKKLVDELEEAGLDRINLSVDA--L 225 (414) T ss_pred CCCEEEEECCCCCCCCCHHHHH--HH--------HHHHCCCCCEEEEEEECCEECCHHHHHHHHHHCCCEEEEECCC--C T ss_conf 7872787537888754533999--99--------9974389842899850764445999999997087558862023--7 Q ss_pred CCCCCCCCCCCCCCCCCHHEEECH--HHHHHHHHHHHCCCEEEE----EC---HHHHHHHHHHHHCCCCCCCCCCCCCEE Q ss_conf 657887410135667700077859--999999987537980999----62---878998999985289846888722366 Q gi|254780180|r 226 TFMPVKVEDTDDHIMHSEIGFIDA--PTAQALNSVKSRGGRIVS----VG---TTSLRLLETATTEDGIINPWSGFTNIF 296 (360) Q Consensus 226 TF~Pi~~~~i~~h~mH~E~~~i~~--~~~~~I~~ak~~g~rIiA----VG---TT~~R~LEs~~~~~~~~~~~~g~T~lf 296 (360) .+...+--|-.++|.|.. ++++.|.+++-. =+|| -| --..+.+|.|.. .| ++.+|+-+- T Consensus 226 ------Dpk~Ak~L~G~~dYdv~kvle~aE~i~~a~id--vlIaPv~lPG~ND~E~~~iIe~A~~-iG---aGkk~p~lg 293 (414) T COG2100 226 ------DPKLAKMLAGRKDYDVKKVLEVAEYIANAGID--VLIAPVWLPGVNDDEMPKIIEWARE-IG---AGKKWPPLG 293 (414) T ss_pred ------CHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCC--EEEEEEECCCCCHHHHHHHHHHHHH-HC---CCCCCCCCC T ss_conf ------98898774284011789999999999867988--8983144278681778999999998-48---887799853 Q ss_pred --ECCCCCCE Q ss_conf --57409611 Q gi|254780180|r 297 --ITPGYRFR 304 (360) Q Consensus 297 --I~Pg~~f~ 304 (360) -|-+|+|. T Consensus 294 iQkyipyk~G 303 (414) T COG2100 294 IQKYIPYKFG 303 (414) T ss_pred EEEEEEECCC T ss_conf 0775540206 No 38 >PRK12311 rpsB 30S ribosomal protein S2; Provisional Probab=42.80 E-value=27 Score=16.01 Aligned_cols=15 Identities=13% Similarity=0.310 Sum_probs=5.3 Q ss_pred HHHHHHHCCCEEEEE Q ss_conf 999875379809996 Q gi|254780180|r 254 ALNSVKSRGGRIVSV 268 (360) Q Consensus 254 ~I~~ak~~g~rIiAV 268 (360) +|++|++-|=+|||| T Consensus 174 AV~EA~kLgIPvIaI 188 (332) T PRK12311 174 AIQEAQRLGIPVAAI 188 (332) T ss_pred HHHHHHHCCCCEEEE T ss_conf 999999809798999 No 39 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=42.73 E-value=27 Score=16.00 Aligned_cols=31 Identities=16% Similarity=0.246 Sum_probs=13.5 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCC Q ss_conf 88789999999997215530378731656657887410 Q gi|254780180|r 197 GLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVED 234 (360) Q Consensus 197 GLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~ 234 (360) |==|++...+.++.. .-..|+--+.|+...+ T Consensus 95 GdI~s~~qr~~~e~~-------c~~lgl~~~~PLW~~~ 125 (194) T cd01994 95 GAILSEYQRTRVERV-------CERLGLEPLAPLWGRD 125 (194) T ss_pred CCCCCHHHHHHHHHH-------HHHCCCEEECHHCCCC T ss_conf 963328899999999-------9973988870010799 No 40 >pfam08004 DUF1699 Protein of unknown function (DUF1699). This family contains many archaeal proteins which have very conserved sequences. Probab=41.97 E-value=20 Score=16.94 Aligned_cols=79 Identities=22% Similarity=0.225 Sum_probs=45.8 Q ss_pred EEEECCCCCCHHHHHHHHHC---CCCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCE Q ss_conf 99972678630002544203---578522212345652114331013456757400555688789999999997215530 Q gi|254780180|r 140 LLVFSISGIELERQISLVGT---IPLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVY 216 (360) Q Consensus 140 ~i~f~~~~~~~~~~L~~~G~---iPLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~ 216 (360) -+.|-.+..|+..+.+..-. +-||+ ..|+||= +.+=-=|..+||+.. T Consensus 23 HLAFRpSNkDif~Lv~tCP~ie~iQlP~-------------Sy~~tvS-----------------ksi~mfLemq~I~Li 72 (131) T pfam08004 23 HLAFRPSNKDIFNLVETCPKLEAIQLPK-------------SYYRTVS-----------------KSIEMFLEMQGIQLI 72 (131) T ss_pred EEEECCCCHHHHHHHHHCCCCEEEECCH-------------HHHHHHH-----------------HHHHHHHHHHCEEEE T ss_conf 7654687503999986397654886788-------------9999999-----------------999999987170466 Q ss_pred EEEEECCCCCCCCCCCCCCCCC-CCCCHHEEECHHHHHHHHHHHHCCC Q ss_conf 3787316566578874101356-6770007785999999998753798 Q gi|254780180|r 217 FVTLHVGAGTFMPVKVEDTDDH-IMHSEIGFIDAPTAQALNSVKSRGG 263 (360) Q Consensus 217 ~iTLHVG~GTF~Pi~~~~i~~h-~mH~E~~~i~~~~~~~I~~ak~~g~ 263 (360) -| |+--| +=-.|||.||+.+++.|+.-+.+|. T Consensus 73 -----eG----------DVwGHrkDinEYy~i~~~viekI~el~~eG~ 105 (131) T pfam08004 73 -----EG----------DVWGHRKDINEYYTVSQSVIERIRELKAEGI 105 (131) T ss_pred -----EC----------CCCCCCCCCHHHCCCCHHHHHHHHHHHHCCC T ss_conf -----32----------4422212333040147999999999997699 No 41 >cd06555 ASCH_PF0470_like ASC-1 homology domain, subfamily similar to Pyrococcus furiosus Pf0470. The ASCH domain, a small beta-barrel domain found in all three kingdoms of life, resembles the RNA-binding PUA domain and may also interact with RNA. ASCH has been proposed to function as an RNA-binding domain during coactivation, RNA-processing and the regulation of prokaryotic translation. Probab=41.71 E-value=28 Score=15.90 Aligned_cols=71 Identities=23% Similarity=0.232 Sum_probs=42.4 Q ss_pred CCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCCC Q ss_conf 64369857997057887406999960355772499972678630002544203578522212345652114331013456 Q gi|254780180|r 107 KVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTIPLPPYIARKRPIDARDYVDYQTTYAK 186 (360) Q Consensus 107 kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA~ 186 (360) +++++||+|.|...++...+.++|++. -+| .++.++|+..|--=+=|=+. .-+...+.|...|.+ T Consensus 30 ~~ik~GD~I~F~~~~~~~~l~v~V~~v-------~~Y----~sF~ell~~e~l~~~~p~~~----s~eeg~~~~~~~Y~~ 94 (109) T cd06555 30 QQIKVGDKILFNDLDTGQQLLVKVVDI-------RKY----DSFRELLEEEGLEKVGPGVD----SIEEGVKDTYKIYSK 94 (109) T ss_pred HHCCCCCEEEEEECCCCCEEEEEEEEE-------EEC----CCHHHHHHHCCHHHCCCCCC----CHHHHHHHHHHHCCH T ss_conf 718999999999879987899999899-------973----99999998729876389999----799999999987899 Q ss_pred C----CCCCC Q ss_conf 7----57400 Q gi|254780180|r 187 I----QGSVA 192 (360) Q Consensus 187 ~----~GSVA 192 (360) . .|-|| T Consensus 95 e~e~~~GVvA 104 (109) T cd06555 95 EQEKKYGVLA 104 (109) T ss_pred HHHHHCCEEE T ss_conf 7987558899 No 42 >TIGR02410 carnitine_TMLD trimethyllysine dioxygenase; InterPro: IPR012776 Members of this family with known function act as trimethyllysine dioxygenase, an enzyme in the pathway for carnitine biosynthesis from lysine. This enzyme is homologous to gamma-butyrobetaine,2-oxoglutarate dioxygenase, which catalyzes the last step in carnitine biosynthesis. Members of this family appear to be eukaryotic only.; GO: 0005506 iron ion binding, 0031418 L-ascorbic acid binding, 0050353 trimethyllysine dioxygenase activity, 0045329 carnitine biosynthetic process. Probab=41.16 E-value=16 Score=17.70 Aligned_cols=17 Identities=18% Similarity=0.546 Sum_probs=5.7 Q ss_pred CCCCEEEEECCCEEEEE Q ss_conf 98879999479568889 Q gi|254780180|r 54 NSNDAIVFNNTKVITAQ 70 (360) Q Consensus 54 ~~gDLLV~NnTKVipAR 70 (360) +||.+|||-|=||+-|| T Consensus 353 ~PG~~lifDNWRvLH~R 369 (397) T TIGR02410 353 RPGTVLIFDNWRVLHSR 369 (397) T ss_pred CCCEEEEEECCEEECCC T ss_conf 37728997076002164 No 43 >PRK07024 short chain dehydrogenase; Provisional Probab=40.42 E-value=29 Score=15.76 Aligned_cols=72 Identities=15% Similarity=0.164 Sum_probs=46.0 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEE-CHHHHHHHHHHHHCCCEEEEECHHHHH Q ss_conf 6887899999999972155303787316566578874101356677000778-599999999875379809996287899 Q gi|254780180|r 196 AGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFI-DAPTAQALNSVKSRGGRIVSVGTTSLR 274 (360) Q Consensus 196 AGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i-~~~~~~~I~~ak~~g~rIiAVGTT~~R 274 (360) |=.+|++.|-.+|+..||.+.. ++-| +|+++-. ++.-....+.+ ++++++.|.++-.+|++.+.++. ..| T Consensus 156 al~~~~esL~~el~~~gI~V~~--i~PG-----~v~T~m~-~~~~~~~p~~~~pe~vA~~i~~ai~~~~~~~~~p~-~~~ 226 (256) T PRK07024 156 AAIKYLESLRVELRPAGVRVVT--IAPG-----YIRTPMT-AHNPYPMPFLMDADRFAARAARAIARGRSFRVIPW-QMG 226 (256) T ss_pred HHHHHHHHHHHHHCCCCEEEEE--EEEC-----CCCCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCH-HHH T ss_conf 9999999999986577948999--9718-----9958877-77999998768999999999999975998898786-899 Q ss_pred HH Q ss_conf 89 Q gi|254780180|r 275 LL 276 (360) Q Consensus 275 ~L 276 (360) .+ T Consensus 227 ~~ 228 (256) T PRK07024 227 VV 228 (256) T ss_pred HH T ss_conf 99 No 44 >smart00684 DM15 Tandem repeat in fly CG14066 (La related protein), human KIAA0731 and worm R144.7. Unknown function. Probab=40.38 E-value=5.2 Score=21.09 Aligned_cols=21 Identities=38% Similarity=0.776 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHCCCEEECCC Q ss_conf 999999999998698261101 Q gi|254780180|r 331 EETKKMYQHAISHAYRFYSYG 351 (360) Q Consensus 331 ~~~~~~Y~~Ai~~~yrF~syG 351 (360) +.-.+.|.+-++.=|||+||| T Consensus 16 ed~~~~~ryglecLfRf~Syg 36 (39) T smart00684 16 EDRKSLGRYELNCLYRFWSYG 36 (39) T ss_pred HHHHHCCCHHHHHHHHHHHCC T ss_conf 989865750267869887314 No 45 >COG0300 DltE Short-chain dehydrogenases of various substrate specificities [General function prediction only] Probab=40.37 E-value=29 Score=15.76 Aligned_cols=68 Identities=26% Similarity=0.292 Sum_probs=48.1 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC-C-----CCCCCHHEEECHHHHHHHHHHHHCCCEEEEEC Q ss_conf 68878999999999721553037873165665788741013-5-----66770007785999999998753798099962 Q gi|254780180|r 196 AGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTD-D-----HIMHSEIGFIDAPTAQALNSVKSRGGRIVSVG 269 (360) Q Consensus 196 AGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~-~-----h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVG 269 (360) +=++||+.|-.+|+..||.+..|. .| |++++=.+ + ...-.+++.=++++++.+..+-..++|.+=+| T Consensus 162 ~v~~fSeaL~~EL~~~gV~V~~v~--PG-----~~~T~f~~~~~~~~~~~~~~~~~~~~~~va~~~~~~l~~~k~~ii~~ 234 (265) T COG0300 162 FVLSFSEALREELKGTGVKVTAVC--PG-----PTRTEFFDAKGSDVYLLSPGELVLSPEDVAEAALKALEKGKREIIPG 234 (265) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEE--CC-----CCCCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHCCCCEEECC T ss_conf 999999999998358984999996--57-----33355333444432112321230699999999999985098368237 Q ss_pred H Q ss_conf 8 Q gi|254780180|r 270 T 270 (360) Q Consensus 270 T 270 (360) . T Consensus 235 ~ 235 (265) T COG0300 235 L 235 (265) T ss_pred H T ss_conf 0 No 46 >pfam01113 DapB_N Dihydrodipicolinate reductase, N-terminus. Dihydrodipicolinate reductase (DapB) reduces the alpha,beta-unsaturated cyclic imine, dihydro-dipicolinate. This reaction is the second committed step in the biosynthesis of L-lysine and its precursor meso-diaminopimelate, which are critical for both protein and cell wall biosynthesis. The N-terminal domain of DapB binds the dinucleotide NADPH. Probab=40.36 E-value=21 Score=16.81 Aligned_cols=27 Identities=26% Similarity=0.270 Sum_probs=13.9 Q ss_pred EEECHHHHHHHHHHHHCCCEEEEECHHH Q ss_conf 7785999999998753798099962878 Q gi|254780180|r 245 GFIDAPTAQALNSVKSRGGRIVSVGTTS 272 (360) Q Consensus 245 ~~i~~~~~~~I~~ak~~g~rIiAVGTT~ 272 (360) |..|+.+.+.+..+.+.|-+++ ||||= T Consensus 72 FS~p~~~~~~~~~~~~~~~~~V-iGTTG 98 (122) T pfam01113 72 FTTPEATLENLELALKHGKPLV-IGTTG 98 (122) T ss_pred ECCHHHHHHHHHHHHHCCCCEE-EECCC T ss_conf 0687899999999996799889-98999 No 47 >PRK06101 short chain dehydrogenase; Provisional Probab=38.05 E-value=31 Score=15.51 Aligned_cols=66 Identities=23% Similarity=0.160 Sum_probs=46.7 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECH Q ss_conf 8789999999997215530378731656657887410135667700077859999999987537980999628 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGT 270 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGT 270 (360) ..||+.|-.+|...||.+..|. =| +|+++=.++.....-...=++++++.|.++-.+|++.|.++- T Consensus 149 ~~~~~sLa~el~~~gI~V~~V~--PG-----~v~T~m~~~~~~~~p~~~~~e~~A~~i~~~i~~~k~~i~~P~ 214 (241) T PRK06101 149 SYFARTLALDLKKKGIKVVTVF--PG-----FVATPLTDKNTFAMPMIITVEQASQAIRKQLAAGKSHIYFPA 214 (241) T ss_pred HHHHHHHHHHHCCCCCEEEEEE--EC-----CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCH T ss_conf 9999999998525495899997--18-----993888778998898757999999999999974997999698 No 48 >PRK13288 pyrophosphatase PpaX; Provisional Probab=37.54 E-value=32 Score=15.46 Aligned_cols=80 Identities=18% Similarity=0.136 Sum_probs=50.9 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCEEEEE-----------ECCCCC-CCC-CCCCCCCCCCCCCHHEEECHHHHHHHHHH Q ss_conf 05556887899999999972155303787-----------316566-578-87410135667700077859999999987 Q gi|254780180|r 192 AAPTAGLHFTSNLLSRLISIGIKVYFVTL-----------HVGAGT-FMP-VKVEDTDDHIMHSEIGFIDAPTAQALNSV 258 (360) Q Consensus 192 AAPTAGLHFt~~ll~~L~~kGi~~~~iTL-----------HVG~GT-F~P-i~~~~i~~h~mH~E~~~i~~~~~~~I~~a 258 (360) ..|-.| ..++|+.|+++|+.++-+|= +.|+.- |.. |..+++...|.|+|.|.- +++.+ T Consensus 81 ~~~~~g---v~e~L~~L~~~g~~l~ivTn~~~~~~~~~l~~~gl~~~Fd~iv~~ddv~~~KP~P~~~~~---a~~~l--- 151 (214) T PRK13288 81 VEEYET---VYETLQTLKKQGYKLGIVTTKARDTVEMGLKLTGLDKFFDVVVTLDDVEHAKPDPEPVQK---ALELL--- 151 (214) T ss_pred CCCCCC---HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCHHHHHH---HHHHH--- T ss_conf 863747---999999999779926775357569999999972221443157861334456987999999---99993--- Q ss_pred HHCCCEEEEECHHHHHHHHHHHH Q ss_conf 53798099962878998999985 Q gi|254780180|r 259 KSRGGRIVSVGTTSLRLLETATT 281 (360) Q Consensus 259 k~~g~rIiAVGTT~~R~LEs~~~ 281 (360) ...-..++.||=+. .-+++|.+ T Consensus 152 ~~~p~e~l~VGDs~-~Di~aA~~ 173 (214) T PRK13288 152 GAKPEEALMVGDNY-HDILAGKN 173 (214) T ss_pred CCCCCCEEEEECCH-HHHHHHHH T ss_conf 95978689996898-99999999 No 49 >cd03215 ABC_Carb_Monos_II This family represents domain II of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. In members of Carb_Monos family the single hydrophobic gene product forms a homodimer, while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=36.97 E-value=21 Score=16.75 Aligned_cols=31 Identities=23% Similarity=0.430 Sum_probs=22.4 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECC Q ss_conf 555688789-----9999999972155303787316 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG 223 (360) =|||||-.. ++++.+++++|..+..+|.+.. T Consensus 130 EPT~gLD~~~~~~i~~~i~~l~~~g~tvi~isHdl~ 165 (182) T cd03215 130 EPTRGVDVGAKAEIYRLIRELADAGKAVLLISSELD 165 (182) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHH T ss_conf 875458999999999999999978999999968799 No 50 >TIGR03351 PhnX-like phosphonatase-like hydrolase. This clade of sequences are the closest homologs to the PhnX enzyme, phosphonoacetaldehyde (Pald) hydrolase (phosphonatase, TIGR01422). This phosphonatase-like enzyme and PhnX itself are members of the haloacid dehalogenase (HAD) superfamily (pfam00702) having a a number of distinctive features that set them apart from typical HAD enzymes. The typical HAD N-terminal motif DxDx(T/V) here is DxAGT and the usual conserved lysine prior to the C-terminal motif is instead an arginine. Also distinctive of phosphonatase, and particular to its bi-catalytic mechanism is a conserved lysine in the variable "cap" domain. This lysine forms a Schiff base with the aldehyde of phosphonoacetaldehyde, providing, through the resulting positive charge, a polarization of the C-P bond necesary for cleavage as well as a route to the initial product of cleavage, an ene-amine. The conservation of these elements in this phosphonatase-like enzyme suggests that the Probab=36.71 E-value=33 Score=15.37 Aligned_cols=43 Identities=26% Similarity=0.281 Sum_probs=31.0 Q ss_pred HHHHHHHHHHCCCCEEEE----------E-ECC---CCCCCC-CCCCCCCCCCCCCHH Q ss_conf 999999997215530378----------7-316---566578-874101356677000 Q gi|254780180|r 202 SNLLSRLISIGIKVYFVT----------L-HVG---AGTFMP-VKVEDTDDHIMHSEI 244 (360) Q Consensus 202 ~~ll~~L~~kGi~~~~iT----------L-HVG---~GTF~P-i~~~~i~~h~mH~E~ 244 (360) .++|+.|+++|++++-+| | +.| ...|.. |..+++..+|-+.|- T Consensus 93 ~elL~~L~~~gi~~av~T~~~~~~~~~~l~~~g~~~~~~fd~vv~~~dv~~~KP~Pd~ 150 (220) T TIGR03351 93 EEAFRSLRSSGIKVALTTGFDRDTAERLLEKLGWTVGDDVDAVVCPSDVAAGRPAPDL 150 (220) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCCCCCHHH T ss_conf 9999999987997766359977999999998276404322404563435667987899 No 51 >pfam04687 Microvir_H Microvirus H protein (pilot protein). A single molecule of H protein is found on each of the 12 spikes on the microvirus shell. H is involved in the ejection of the phage DNA, and at least one copy is injected into the host's periplasmic space along with the ssDNA viral genome. Part of H is thought to lie outside the shell, where it recognizes lipopolysaccharide from virus-sensitive strains. Part of H may lie within the capsid, since mutations in H can influence the DNA ejection mechanism by affecting the DNA-protein interactions. H may span the capsid through the hydrophilic channels formed by G proteins. Probab=36.23 E-value=17 Score=17.39 Aligned_cols=20 Identities=35% Similarity=0.419 Sum_probs=17.9 Q ss_pred CCCCCCCCCCHHHHHHHHHH Q ss_conf 05556887899999999972 Q gi|254780180|r 192 AAPTAGLHFTSNLLSRLISI 211 (360) Q Consensus 192 AAPTAGLHFt~~ll~~L~~k 211 (360) -|-|||-|||.+-|..|..| T Consensus 218 qAQtaGQ~fTNdQI~eltrK 237 (310) T pfam04687 218 QAQTAGQYFTNDQIKELTRK 237 (310) T ss_pred HHHHHHCCCCHHHHHHHHHH T ss_conf 88875201536899999999 No 52 >COG0752 GlyQ Glycyl-tRNA synthetase, alpha subunit [Translation, ribosomal structure and biogenesis] Probab=36.20 E-value=12 Score=18.53 Aligned_cols=27 Identities=26% Similarity=0.639 Sum_probs=19.1 Q ss_pred EEEEECCCCCCCHH-HHHHCCCCCCCCC Q ss_conf 89981124866101-1221156436985 Q gi|254780180|r 87 SCTLHMRVSPNSWS-VYARPSKVIKKGD 113 (360) Q Consensus 87 eill~~~~~~~~w~-~l~kp~kklk~G~ 113 (360) -..+++.+.+.-|. +.+.|++|.+.|. T Consensus 40 PaTfLralGpePw~aAYVqPSRRP~DGR 67 (298) T COG0752 40 PATFLRALGPEPWNAAYVQPSRRPTDGR 67 (298) T ss_pred HHHHHHHCCCCCCCEEEECCCCCCCCCC T ss_conf 6889976099764100105677998887 No 53 >smart00095 TR_THY Transthyretin. Probab=35.75 E-value=34 Score=15.26 Aligned_cols=67 Identities=16% Similarity=0.354 Sum_probs=39.3 Q ss_pred CEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCCCCC Q ss_conf 50589981124866101122115643698579970578874069999603557724999726786300025442035785 Q gi|254780180|r 84 KEISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTIPLP 163 (360) Q Consensus 84 ~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~iPLP 163 (360) +.|.+.+.+...++.|..+.. +..=..|..=.+.. -+....|.+.+.|. ..+++...|.-|.. T Consensus 20 agv~V~L~~~~~~~~w~~ia~-~~Tn~DGR~~~l~~-----------~~~~~~G~YrL~F~-----t~~Yf~~~g~~~Fy 82 (121) T smart00095 20 VNVAVKVFKKTEEGTWEPFAS-GKTNESGEIHELTT-----------DEKFVEGLYKVEFD-----TKSYWKALGISPFH 82 (121) T ss_pred CCCEEEEEEECCCCCEEEEEE-EEECCCCCCCCCCC-----------CCCCCCEEEEEEEE-----HHHHHHHCCCCCCC T ss_conf 898999999879997799788-76389988567677-----------14467654999998-----17968764999899 Q ss_pred HHHC Q ss_conf 2221 Q gi|254780180|r 164 PYIA 167 (360) Q Consensus 164 PYI~ 167 (360) |||. T Consensus 83 P~V~ 86 (121) T smart00095 83 EYAD 86 (121) T ss_pred CCEE T ss_conf 8578 No 54 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=35.52 E-value=17 Score=17.41 Aligned_cols=55 Identities=22% Similarity=0.387 Sum_probs=34.3 Q ss_pred CCCCCCC-----CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 5556887-----89999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLH-----FTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLH-----Ft~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||- .-.+++.+|++.|+.+..+|.+...- ..+.|+ +.| -.+|||+| T Consensus 158 EPt~gLD~~~~~~i~~~l~~l~~~g~tii~vtHdl~~~-------~~~~Dr------v~v------------l~~G~iv~ 212 (222) T cd03224 158 EPSEGLAPKIVEEIFEAIRELRDEGVTILLVEQNARFA-------LEIADR------AYV------------LERGRVVL 212 (222) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-------HHHCCE------EEE------------EECCEEEE T ss_conf 86547999999999999999995699999990858999-------996999------999------------97999999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) -||.. T Consensus 213 ~G~~~ 217 (222) T cd03224 213 EGTAA 217 (222) T ss_pred ECCHH T ss_conf 85899 No 55 >cd02759 MopB_Acetylene-hydratase The MopB_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins. The acetylene hydratase of Pelobacter acetylenicus is a tungsten iron-sulfur protein involved in the fermentation of acetylene to ethanol and acetate. Members of this CD belong to the molybdopterin_binding (MopB) superfamily of proteins. Probab=35.44 E-value=34 Score=15.23 Aligned_cols=23 Identities=13% Similarity=0.040 Sum_probs=11.1 Q ss_pred HHEEECHHHHHHHHHHHHCCCEE Q ss_conf 00778599999999875379809 Q gi|254780180|r 243 EIGFIDAPTAQALNSVKSRGGRI 265 (360) Q Consensus 243 E~~~i~~~~~~~I~~ak~~g~rI 265 (360) |-.-||++++..+-+.-++.++. T Consensus 264 ~itGv~~~~I~~~A~~~a~~~~~ 286 (477) T cd02759 264 EITGVPAEKIRKAARLYATAKPA 286 (477) T ss_pred HHHCCCHHHHHHHHHHHHHCCCE T ss_conf 66399899999999999717978 No 56 >KOG4132 consensus Probab=35.34 E-value=34 Score=15.22 Aligned_cols=19 Identities=21% Similarity=0.354 Sum_probs=16.2 Q ss_pred CCCEEEEECHHHHHHHHHH Q ss_conf 7980999628789989999 Q gi|254780180|r 261 RGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 261 ~g~rIiAVGTT~~R~LEs~ 279 (360) ...|++|||-|+-|+||.. T Consensus 213 ~~~k~aaIGPtT~kaL~~~ 231 (260) T KOG4132 213 DHLKLAAIGPTTRKALEDL 231 (260) T ss_pred HHEEEEEECCCHHHHHHHC T ss_conf 0205777576268899975 No 57 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=35.14 E-value=35 Score=15.20 Aligned_cols=60 Identities=20% Similarity=0.347 Sum_probs=33.4 Q ss_pred HHHHHHHHHHCCCEEEEECHH--HHHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHHH Q ss_conf 999999875379809996287--8998999985289846888722366574096114312552775545789999 Q gi|254780180|r 251 TAQALNSVKSRGGRIVSVGTT--SLRLLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLML 323 (360) Q Consensus 251 ~~~~I~~ak~~g~rIiAVGTT--~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~L 323 (360) ++.-|...-.+|++|+-|||- .-+.++.++...|.. | -..+.+.|++|||-.=+.++--| T Consensus 45 A~~~i~~i~~~~~~iLfVgTk~~~~~~v~~~a~~~~~~---------y----v~~rWigG~LTN~~~i~~~~~~l 106 (205) T pfam00318 45 AANFIKEIAAKGGKILFVGTKKQAQEAVKKFAKRTGQF---------Y----VNGRWLGGTLTNWKTIKKSIKKL 106 (205) T ss_pred HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCC---------C----CCCCCCCCCCCCHHHHHHHHHHH T ss_conf 99999999966982899977789999999999983997---------4----31643387236699998888777 No 58 >COG5623 CLP1 Predicted GTPase subunit of the pre-mRNA cleavage complex [Translation, ribosomal structure and biogenesis] Probab=34.89 E-value=35 Score=15.17 Aligned_cols=16 Identities=31% Similarity=0.380 Sum_probs=8.0 Q ss_pred CCCCCEEEEECCCEEE Q ss_conf 6988799994795688 Q gi|254780180|r 53 LNSNDAIVFNNTKVIT 68 (360) Q Consensus 53 L~~gDLLV~NnTKVip 68 (360) |--++---|-|+|..- T Consensus 38 La~~~~Y~F~n~K~~I 53 (424) T COG5623 38 LANERWYAFRNTKTFI 53 (424) T ss_pred HCCCCCEEEEEEEEEE T ss_conf 3058852346400338 No 59 >TIGR01277 thiQ thiamine ABC transporter, ATP-binding protein; InterPro: IPR005968 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The model describes thiamine ABC transporter, ATP-binding protein, believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. This protein is found so far only in Proteobacteria, and is found in complete genomes only if the ThiB and ThiP subunits are also found. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate. ; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=34.71 E-value=20 Score=16.91 Aligned_cols=16 Identities=31% Similarity=0.557 Sum_probs=9.0 Q ss_pred HHHHCCCCCHHHCCCCCCC Q ss_conf 4420357852221234565 Q gi|254780180|r 155 SLVGTIPLPPYIARKRPID 173 (360) Q Consensus 155 ~~~G~iPLPPYI~r~r~~~ 173 (360) +..-++-.|||= ||.+ T Consensus 58 nd~~~t~~aPy~---RPvS 73 (213) T TIGR01277 58 NDKDHTRLAPYR---RPVS 73 (213) T ss_pred CCCCCCCCCCCC---CCCC T ss_conf 780122688877---7750 No 60 >COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=34.69 E-value=35 Score=15.15 Aligned_cols=24 Identities=21% Similarity=0.299 Sum_probs=14.9 Q ss_pred EEEEHHHHHHHCCCCC-EEEEECCC Q ss_conf 3101434787269887-99994795 Q gi|254780180|r 42 SDHLVSDLPAFLNSND-AIVFNNTK 65 (360) Q Consensus 42 ~h~~F~dl~~~L~~gD-LLV~NnTK 65 (360) ++..+.-+.-+++||| +||.+=.- T Consensus 99 ~~al~~a~~al~~pGDeVlip~P~Y 123 (393) T COG0436 99 KEALFLAFLALLNPGDEVLIPDPGY 123 (393) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 9999999999718998899838998 No 61 >PRK09189 uroporphyrinogen-III synthase; Validated Probab=34.36 E-value=36 Score=15.11 Aligned_cols=79 Identities=11% Similarity=0.098 Sum_probs=33.3 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC-CC----CCCCCCCCHHEEECHHHHHHHHHHH--------HCCC Q ss_conf 887899999999972155303787316566578874-10----1356677000778599999999875--------3798 Q gi|254780180|r 197 GLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKV-ED----TDDHIMHSEIGFIDAPTAQALNSVK--------SRGG 263 (360) Q Consensus 197 GLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~-~~----i~~h~mH~E~~~i~~~~~~~I~~ak--------~~g~ 263 (360) |=+=.+.|-+.|++.|+.+..+.+ |-|-.+-.. +. +.+..+.. -..-|..+++...+.- ..+- T Consensus 123 G~~~~~~l~~~L~~~G~~v~~~~~---Y~~~~~~~~~~~~~~~l~~~~~d~-vl~~S~~ta~~f~~l~~~~~l~~~~~~~ 198 (237) T PRK09189 123 GRPRQPVFEDRLAAAGISFRVAEC---YDTLPVMYSPATLSAILGGAPFDA-VLLYSRVAAASFFEAMRLSIAPPADEKT 198 (237) T ss_pred CCCCCHHHHHHHHHCCCEEEEEEE---EEEECCCCCCHHHHHHHHCCCCCE-EEEECHHHHHHHHHHHHHCCCHHHHHCC T ss_conf 875404679999878986999988---754248889799999996499888-9998989999999998761453688589 Q ss_pred EEEEECHHHHHHHHHH Q ss_conf 0999628789989999 Q gi|254780180|r 264 RIVSVGTTSLRLLETA 279 (360) Q Consensus 264 rIiAVGTT~~R~LEs~ 279 (360) +++|+|-.+..+|.+. T Consensus 199 ~~~ciS~~vA~~l~~~ 214 (237) T PRK09189 199 RFLCISARVAQALPSG 214 (237) T ss_pred CEEEECHHHHHHHHHC T ss_conf 7999899999999866 No 62 >PRK05855 short chain dehydrogenase; Validated Probab=34.16 E-value=17 Score=17.33 Aligned_cols=80 Identities=21% Similarity=0.232 Sum_probs=45.4 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEE---CCC---CCCCCCCCCCCCCCCCC-CHHE----EECHHHHHHHHHHHHCCCEEE Q ss_conf 878999999999721553037873---165---66578874101356677-0007----785999999998753798099 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTLH---VGA---GTFMPVKVEDTDDHIMH-SEIG----FIDAPTAQALNSVKSRGGRIV 266 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTLH---VG~---GTF~Pi~~~~i~~h~mH-~E~~----~i~~~~~~~I~~ak~~g~rIi 266 (360) +.||+.|-.+|...||.+..|.=- .++ +.|.-...++....+.- ...| .=|++++++|-++-.+++++| T Consensus 473 ~gftesLr~ELa~~GI~V~aVcPG~I~T~I~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~Pe~vA~~Il~aV~rnr~~v 552 (582) T PRK05855 473 LMLSECLRAELAEAGIGVTAICPGFVDTNIVATTRFAGADAEDEARRRKRADKLYARRGYGPEKVAKAIVSAVKRNKAVV 552 (582) T ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf 99999999985302977999931846467555566478760256778887766654059999999999999985599889 Q ss_pred EECHHHHHHHHH Q ss_conf 962878998999 Q gi|254780180|r 267 SVGTTSLRLLET 278 (360) Q Consensus 267 AVGTT~~R~LEs 278 (360) -||... |.+.. T Consensus 553 ~Vg~~A-r~~~~ 563 (582) T PRK05855 553 PVGPEA-HALYG 563 (582) T ss_pred EECHHH-HHHHH T ss_conf 868999-99999 No 63 >cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=33.20 E-value=31 Score=15.59 Aligned_cols=32 Identities=34% Similarity=0.644 Sum_probs=23.3 Q ss_pred CCCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC Q ss_conf 05556887899-----999999972155303787316 Q gi|254780180|r 192 AAPTAGLHFTS-----NLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 192 AAPTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG 223 (360) ==|||||.... ++|++|+++|..+..||.... T Consensus 197 DEPTs~LD~~~~~~l~~~l~~l~~~G~Tvi~itH~l~ 233 (261) T cd03271 197 DEPTTGLHFHDVKKLLEVLQRLVDKGNTVVVIEHNLD 233 (261) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECHH T ss_conf 4863459989999999999999978999999847788 No 64 >COG1797 CobB Cobyrinic acid a,c-diamide synthase [Coenzyme metabolism] Probab=32.71 E-value=38 Score=14.93 Aligned_cols=198 Identities=17% Similarity=0.246 Sum_probs=102.7 Q ss_pred EEHHHHHHHCCCCCEEEEECCCE---EEEEEEEEEECCCCCCCCEEEEEEECCCCCCCHHHHHHCCCCCCCC-CEEEECC Q ss_conf 01434787269887999947956---8889987640467777650589981124866101122115643698-5799705 Q gi|254780180|r 44 HLVSDLPAFLNSNDAIVFNNTKV---ITAQLNGVRFCHINRREKEISCTLHMRVSPNSWSVYARPSKVIKKG-DILHFFS 119 (360) Q Consensus 44 ~~F~dl~~~L~~gDLLV~NnTKV---ipARL~g~k~~~~~~~g~~ieill~~~~~~~~w~~l~kp~kklk~G-~~l~~~~ 119 (360) ..-.++...|.--=+||+|-+.. +-|-+.|.+.. ....+|.=.++.+....+-.-++|.+=.-..| .++-+.. T Consensus 101 gSTA~lAk~l~~PVvLVid~~~~s~S~AAiv~G~~~f---dp~v~iaGVIlNrVgserH~~llr~Ale~~~gv~vlG~lp 177 (451) T COG1797 101 GSTADLAKLLGAPVVLVVDASGLSRSVAAIVKGFKHF---DPDVNIAGVILNRVGSERHYELLRDALEEYTGVPVLGYLP 177 (451) T ss_pred CCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHC---CCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 7799999985999899995752257899999889861---9988257899724777889999998755327985798742 Q ss_pred CCCCCCEEEEEEECCCCCCEEEEE--------------CCCCCCHHHHHHHHHCC-CCCHHHCCCCCCCCCCHHHHCCCC Q ss_conf 788740699996035577249997--------------26786300025442035-785222123456521143310134 Q gi|254780180|r 120 QDGQSRLEATVIDKWNTGEILLVF--------------SISGIELERQISLVGTI-PLPPYIARKRPIDARDYVDYQTTY 184 (360) Q Consensus 120 ~~~~~~l~~~vi~~~~~g~~~i~f--------------~~~~~~~~~~L~~~G~i-PLPPYI~r~r~~~~~D~~~YQTVy 184 (360) .+.. ++ +-++ --|.....- -....|++.+++--.+- |++|=.. ..+|+.- T Consensus 178 r~~~--l~--lp~R-HLGLV~a~E~~~~~~~~~~~a~~v~~~vDld~l~~ia~~~~~~~~~~~----------~~~~~~~ 242 (451) T COG1797 178 RDDD--LE--LPSR-HLGLVPASERLELEAKLEALAEVVEKHVDLDALLEIASSAGPLEPDLS----------PEPERGN 242 (451) T ss_pred CCCC--CC--CCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCC----------CCCCCCC T ss_conf 7855--67--8541-326534303444899999999999860789889998743688898855----------3200168 Q ss_pred CC-CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC-----------CCCCCHHEEECHHHH Q ss_conf 56-757400555688789999999997215530378731656657887410135-----------667700077859999 Q gi|254780180|r 185 AK-IQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDD-----------HIMHSEIGFIDAPTA 252 (360) Q Consensus 185 A~-~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~-----------h~mH~E~~~i~~~~~ 252 (360) -. .-=+||.=-|==-.=++.++-|++.|.++.| |.|++.+.+-+ -.+|.|.-+=.+... T Consensus 243 ~~~~rIAVA~D~AF~FyY~~nl~~Lr~~GAelv~---------FSPL~D~~lP~~~D~vYlgGGYPElfA~~L~~n~~~~ 313 (451) T COG1797 243 PLGVRIAVARDAAFNFYYPENLELLREAGAELVF---------FSPLADEELPPDVDAVYLGGGYPELFAEELSANESMR 313 (451) T ss_pred CCCCEEEEEECCHHCCCCHHHHHHHHHCCCEEEE---------ECCCCCCCCCCCCCEEEECCCCHHHHHHHHHHCHHHH T ss_conf 7673699985451000329999999977977998---------5786777799778889968977488999984479999 Q ss_pred HHHHHHHHCCCEEEEE Q ss_conf 9999875379809996 Q gi|254780180|r 253 QALNSVKSRGGRIVSV 268 (360) Q Consensus 253 ~~I~~ak~~g~rIiAV 268 (360) ++|++.-+.|.+|.|= T Consensus 314 ~~i~~~~~~G~piyaE 329 (451) T COG1797 314 RAIKAFAAAGKPIYAE 329 (451) T ss_pred HHHHHHHHCCCCEEEE T ss_conf 9999998769945885 No 65 >TIGR01120 rpiB ribose 5-phosphate isomerase B; InterPro: IPR004785 Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes). RpiB is also involved in metabolism of the rare sugar, allose, in addition to ribose sugars. The structures of RpiA and RpiB are distinct, RpiB having a Rossmann-type alpha/beta/alpha sandwich topology . This entry represents the RpiB enzyme.; GO: 0004751 ribose-5-phosphate isomerase activity, 0006098 pentose-phosphate shunt. Probab=32.58 E-value=38 Score=14.92 Aligned_cols=60 Identities=27% Similarity=0.323 Sum_probs=44.6 Q ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHH Q ss_conf 56887899999999972155303787316566578874101356677000778599999999875379809996287 Q gi|254780180|r 195 TAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTT 271 (360) Q Consensus 195 TAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT 271 (360) =||||.-+++-+-|+++|+++..+ ||+.--+++.+. --++.+++|..-+.++| |+-.||= T Consensus 8 HaGf~Lke~ik~fL~~~g~kvid~------GT~~~ertdYP~----------YAk~V~~av~~g~~~~G-ILiCgTG 67 (143) T TIGR01120 8 HAGFILKEKIKEFLEDKGFKVIDV------GTLASERTDYPD----------YAKEVVRAVLEGKAERG-ILICGTG 67 (143) T ss_pred CCCCHHHHHHHHHHCCCCEEEEEE------CCCCCCCCCCCH----------HHHHHHHHHHCCCCCCC-EEEEECC T ss_conf 656234788999861398579875------378867636606----------89999999744876676-6885053 No 66 >PHA00369 H minor spike protein Probab=32.44 E-value=23 Score=16.44 Aligned_cols=20 Identities=40% Similarity=0.556 Sum_probs=18.0 Q ss_pred CCCCCCCCCCHHHHHHHHHH Q ss_conf 05556887899999999972 Q gi|254780180|r 192 AAPTAGLHFTSNLLSRLISI 211 (360) Q Consensus 192 AAPTAGLHFt~~ll~~L~~k 211 (360) -|-|+|-|||++-+..|..| T Consensus 230 qAQ~~GQh~TN~Qi~~l~rK 249 (337) T PHA00369 230 QAQTAGQHFTNDQIKELTRK 249 (337) T ss_pred HHHHHHCCCCHHHHHHHHHH T ss_conf 88763022536899999998 No 67 >PRK05299 rpsB 30S ribosomal protein S2; Provisional Probab=32.40 E-value=38 Score=14.89 Aligned_cols=13 Identities=46% Similarity=0.833 Sum_probs=6.8 Q ss_pred HHCCCEEEEECHH Q ss_conf 5379809996287 Q gi|254780180|r 259 KSRGGRIVSVGTT 271 (360) Q Consensus 259 k~~g~rIiAVGTT 271 (360) -.+||.|+-|||- T Consensus 61 ~~~gg~iLFVgTk 73 (255) T PRK05299 61 AANGGKILFVGTK 73 (255) T ss_pred HHCCCCEEEEECC T ss_conf 9769936999665 No 68 >PRK13355 bifunctional HTH-domain containing protein/aminotransferase; Provisional Probab=32.14 E-value=33 Score=15.37 Aligned_cols=16 Identities=25% Similarity=0.343 Sum_probs=9.8 Q ss_pred CCCCCCCHHH----HCCCCC Q ss_conf 3588889778----616998 Q gi|254780180|r 5 EFDFDLPPSR----IALRPV 20 (360) Q Consensus 5 dfDy~LP~el----IAq~P~ 20 (360) -|.|+-|+|. |.+-|. T Consensus 158 ~fgf~~p~~~~~~~~~~~~~ 177 (518) T PRK13355 158 PFGFRTPDEVVYDMAHQLPD 177 (518) T ss_pred CCCCCCCHHHHHHHHHHCCC T ss_conf 66888948999999973723 No 69 >pfam07755 DUF1611 Protein of unknown function (DUF1611). This region is found in a number of hypothetical bacterial and archaeal proteins. The region is approximately 350 residues long. A member of this family is thought to associate with another subunit to form an H+-transporting ATPase, but no evidence has been found to support this. Probab=31.32 E-value=40 Score=14.77 Aligned_cols=47 Identities=36% Similarity=0.540 Sum_probs=21.1 Q ss_pred CCCEEEEECH-------HHHHHHHHHHHCCCCCC--CCCCCCCEEECCCCCCEEEEEE Q ss_conf 7980999628-------78998999985289846--8887223665740961143125 Q gi|254780180|r 261 RGGRIVSVGT-------TSLRLLETATTEDGIIN--PWSGFTNIFITPGYRFRAVDIL 309 (360) Q Consensus 261 ~g~rIiAVGT-------T~~R~LEs~~~~~~~~~--~~~g~T~lfI~Pg~~f~~vd~L 309 (360) ++.||..||| |+.+.|+-+....|.-. -..|.|-|.| -|+-| .+|++ T Consensus 112 ~~~rvl~vGTDcavGK~tTal~l~~~l~~~Gi~a~fiaTGQTGili-~g~Gv-~iDav 167 (302) T pfam07755 112 KAKRVLTVGTDCAVGKMTTALELERALRERGLNAAFVATGQTGILI-AGYGV-PLDAV 167 (302) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEEE-ECCEE-EECCC T ss_conf 9878999605733407899999999999779984799727613688-15247-84322 No 70 >PRK08622 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=31.31 E-value=27 Score=15.95 Aligned_cols=26 Identities=15% Similarity=0.001 Sum_probs=15.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCE Q ss_conf 00555688789999999997215530 Q gi|254780180|r 191 VAAPTAGLHFTSNLLSRLISIGIKVY 216 (360) Q Consensus 191 VAAPTAGLHFt~~ll~~L~~kGi~~~ 216 (360) +++=-||..+-+.|.+.|+++|+++. T Consensus 5 IgsDHaG~eLKe~l~~~L~~~G~eV~ 30 (171) T PRK08622 5 IGCDHIVTDEKMAVSDFLKSKGHEVI 30 (171) T ss_pred EEECCHHHHHHHHHHHHHHHCCCEEE T ss_conf 98486069999999999997899899 No 71 >cd05821 TLP_Transthyretin Transthyretin (TTR) is a 55 kDa protein responsible for the transport of thyroid hormones and retinol in vertebrates. TTR distributes the two thyroid hormones T3 (3,5,3'-triiodo-L-thyronine) and T4 (Thyroxin, or 3,5,3',5'-tetraiodo-L-thyronine), as well as retinol (vitamin A) through the formation of a macromolecular complex that includes each of these as well as retinol-binding protein. Misfolded forms of TTR are implicated in the amyloid diseases familial amyloidotic polyneuropathy and senile systemic amyloidosis. TTR forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits, which differ in their ligand binding affinity. A negative cooperativity has been observed for the binding of T4 and other TTR ligands. A fraction of plasma TTR is carried in high density lipoproteins by bindi Probab=31.25 E-value=40 Score=14.76 Aligned_cols=65 Identities=17% Similarity=0.385 Sum_probs=36.8 Q ss_pred EEEEEECCCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCCCCCHH Q ss_conf 58998112486610112211564369857997057887406999960355772499972678630002544203578522 Q gi|254780180|r 86 ISCTLHMRVSPNSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTIPLPPY 165 (360) Q Consensus 86 ieill~~~~~~~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~iPLPPY 165 (360) |.+.|.+...++.|..+. .+..=..|.+=.+.. -.....|.+.++|. ..+++.+.|.-|..|+ T Consensus 25 v~V~L~~~~~~~~w~~i~-~~~Tn~DGR~~~l~~-----------~~~~~~G~Y~L~F~-----~~~Yf~~~g~~~Fyp~ 87 (121) T cd05821 25 VAVKVFKKTADGSWEPFA-SGKTTETGEIHGLTT-----------DEQFTEGVYKVEFD-----TKAYWKKLGISPFHEY 87 (121) T ss_pred CEEEEEEECCCCCEEEEE-EEECCCCCCCCCCCC-----------CCCCCCEEEEEEEE-----HHHHHHHCCCCCCCCC T ss_conf 999999988999878976-885599767266667-----------34567777999998-----4897875599978886 Q ss_pred HC Q ss_conf 21 Q gi|254780180|r 166 IA 167 (360) Q Consensus 166 I~ 167 (360) |. T Consensus 88 V~ 89 (121) T cd05821 88 AE 89 (121) T ss_pred CC T ss_conf 30 No 72 >PRK10148 hypothetical protein; Provisional Probab=30.94 E-value=40 Score=14.73 Aligned_cols=25 Identities=32% Similarity=0.469 Sum_probs=10.6 Q ss_pred CCCCHHHCCCCCCCCCCHHHHCCCCC Q ss_conf 57852221234565211433101345 Q gi|254780180|r 160 IPLPPYIARKRPIDARDYVDYQTTYA 185 (360) Q Consensus 160 iPLPPYI~r~r~~~~~D~~~YQTVyA 185 (360) |+|-|||.=.. .-..-.+.||.||. T Consensus 1 M~l~PYL~F~G-n~~EA~~FY~~vfg 25 (147) T PRK10148 1 MPLSPYLSFAG-NCADAIAYYQQTLG 25 (147) T ss_pred CCCEEEEECCC-CHHHHHHHHHHHHC T ss_conf 94147675298-99999999999859 No 73 >PRK07904 short chain dehydrogenase; Provisional Probab=30.78 E-value=40 Score=14.71 Aligned_cols=65 Identities=17% Similarity=0.161 Sum_probs=45.3 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEE-CHHHHHHHHHHHHCCCEEEEECHH Q ss_conf 87899999999972155303787316566578874101356677000778-599999999875379809996287 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFI-DAPTAQALNSVKSRGGRIVSVGTT 271 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i-~~~~~~~I~~ak~~g~rIiAVGTT 271 (360) .+|++.|-.+|...||.+..|. = | +|+++=..+.+ ...+.+ +++++++|.++-.+|+++|-++.- T Consensus 167 ~~f~~~L~~el~~~gIrV~~V~--P--G---~V~T~mt~~~~--~~p~~~~~e~vA~~i~~ai~~~k~~i~~p~~ 232 (253) T PRK07904 167 DGFYLGLGEALREYGVRVLVIR--P--G---QVRTRMSADVK--EAPLTVDKEDVANLAVTAVAKGKELVWAPPA 232 (253) T ss_pred HHHHHHHHHHHCCCCCEEEEEE--C--C---CCCCCCCCCCC--CCCCCCCHHHHHHHHHHHHHCCCCEEECCHH T ss_conf 9999999998477288899997--2--7---88676568999--8997689999999999999859969994877 No 74 >KOG0442 consensus Probab=30.68 E-value=24 Score=16.28 Aligned_cols=90 Identities=21% Similarity=0.159 Sum_probs=52.1 Q ss_pred CHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHH----CCCCCCCCCCCCCCCC---------CCCCCHHHHHHHHHHCCCC Q ss_conf 30002544203578522212345652114331----0134567574005556---------8878999999999721553 Q gi|254780180|r 149 ELERQISLVGTIPLPPYIARKRPIDARDYVDY----QTTYAKIQGSVAAPTA---------GLHFTSNLLSRLISIGIKV 215 (360) Q Consensus 149 ~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~Y----QTVyA~~~GSVAAPTA---------GLHFt~~ll~~L~~kGi~~ 215 (360) +++.++...+.|-+|-+-.-.+........++ |..=++++|--+ |-| ++-|--+|=.-|-.+|+++ T Consensus 599 sFe~LIre~ssm~ip~e~dg~~~~~~~~~~~~~~~~~~~~tr~aggq~-~~~t~e~pVIVDmREf~SsLP~~Lh~~G~~V 677 (892) T KOG0442 599 SFEKLIREKSSLRIPLEQDGDCDEANLLEITPASGDRNSNTRKAGGQQ-PEATKETPVIVDMREFRSSLPSLLHRKGLRV 677 (892) T ss_pred HHHHHHHHCCCEEEEEECCCCHHCCCCCEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEHHHHHHHCHHHHHHCCCEE T ss_conf 799997523440422204663110464200256544454433467666-5434356468666987755548888579658 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCHHEEEC Q ss_conf 037873165665788741013566770007785 Q gi|254780180|r 216 YFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFID 248 (360) Q Consensus 216 ~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~ 248 (360) ..+||-|| +.|---.|-=|.-.|+ T Consensus 678 ip~tL~vG---------DYIlSP~icVERKSIs 701 (892) T KOG0442 678 IPCTLEVG---------DYILSPDICVERKSIS 701 (892) T ss_pred EEEEECCC---------CEECCCCCEEEECCHH T ss_conf 88860246---------7772660413403499 No 75 >TIGR02219 phage_NlpC_fam putative phage cell wall peptidase, NlpC/P60 family; InterPro: IPR011929 Members of this family show sequence similarity to members of the NlpC/P60 family, described by Anantharaman and Aravind . The NlpC/P60 family includes a number of characterised bacterial cell wall hydrolases. Members of this related family are all found in prophage regions of bacterial genomes.. Probab=30.27 E-value=26 Score=16.15 Aligned_cols=11 Identities=18% Similarity=0.398 Sum_probs=6.6 Q ss_pred HHCCCCCEEEE Q ss_conf 72698879999 Q gi|254780180|r 51 AFLNSNDAIVF 61 (360) Q Consensus 51 ~~L~~gDLLV~ 61 (360) +-.+||||||| T Consensus 76 ~~~qpGDlLlF 86 (135) T TIGR02219 76 EAAQPGDLLLF 86 (135) T ss_pred CCCCCCCEEEE T ss_conf 87888766777 No 76 >KOG1017 consensus Probab=30.27 E-value=36 Score=15.05 Aligned_cols=36 Identities=31% Similarity=0.471 Sum_probs=18.6 Q ss_pred EEEEECHHHHHHHHHHHHCCCCCCCCCCCCCEEECCC Q ss_conf 0999628789989999852898468887223665740 Q gi|254780180|r 264 RIVSVGTTSLRLLETATTEDGIINPWSGFTNIFITPG 300 (360) Q Consensus 264 rIiAVGTT~~R~LEs~~~~~~~~~~~~g~T~lfI~Pg 300 (360) .|+.-|+||..|.|-. .+.|+-..----.+|||.|- T Consensus 197 Pi~stGnTV~~Av~VL-~EhgVp~s~IiL~sLF~tP~ 232 (267) T KOG1017 197 PIISTGNTVCKAVEVL-KEHGVPDSNIILVSLFITPT 232 (267) T ss_pred EEECCCCCHHHHHHHH-HHCCCCCCEEEEEEEECCCH T ss_conf 1304781389999999-98199960089877431650 No 77 >cd06578 HemD Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme. This ubiquitous enzyme is present in eukaryotes, bacteria and archaea. Mutations in the human uroporphyrinogen-III synthase gene cause congenital erythropoietic porphyria, a recessive inborn error of metabolism also known as Gunther disease. Probab=29.84 E-value=42 Score=14.60 Aligned_cols=91 Identities=15% Similarity=0.166 Sum_probs=54.8 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCC-CCCCCCCCCCCCCCCCHHEEECHHHHHHHHHH------- Q ss_conf 7574005556887899999999972155303787316566-57887410135667700077859999999987------- Q gi|254780180|r 187 IQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGT-FMPVKVEDTDDHIMHSEIGFIDAPTAQALNSV------- 258 (360) Q Consensus 187 ~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GT-F~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~a------- 258 (360) ....+--|+.+.- .+.|.+.|+++|+++..+.+---.-. ..+...+.+.+.... --..-|..+++..-+. T Consensus 121 ~~~~il~~~g~~~-~~~L~~~L~~~g~~v~~~~~Y~~~~~~~~~~~~~~i~~~~~~-~i~ftS~~~~~~~~~~~~~~~~~ 198 (239) T cd06578 121 KGKRILRPRGGRA-REDLAEALRERGAEVDEVEVYRTVPPDLDAELLELLEEGAID-AVLFTSPSTVRNLLELLGKEGRA 198 (239) T ss_pred CCCEEEEECCCCC-CHHHHHHHHHCCCEEEEEEEEEECCCCCCHHHHHHHHCCCCC-EEEECCHHHHHHHHHHHHHHHHH T ss_conf 8986999847877-568999999789867899886651788888999999748988-99990999999999986664156 Q ss_pred HHCCCEEEEECHHHHHHHHHH Q ss_conf 537980999628789989999 Q gi|254780180|r 259 KSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 259 k~~g~rIiAVGTT~~R~LEs~ 279 (360) ...+.+++|+|-++..+|+.. T Consensus 199 ~~~~~~~v~ig~~ta~~l~~~ 219 (239) T cd06578 199 LLKNVKIAAIGPRTAEALREL 219 (239) T ss_pred HHCCCEEEEECHHHHHHHHHC T ss_conf 750988999899999999976 No 78 >KOG0210 consensus Probab=29.78 E-value=42 Score=14.60 Aligned_cols=24 Identities=17% Similarity=0.284 Sum_probs=12.8 Q ss_pred HCCCCCEEEEECCCEEEEEEEEEE Q ss_conf 269887999947956888998764 Q gi|254780180|r 52 FLNSNDAIVFNNTKVITAQLNGVR 75 (360) Q Consensus 52 ~L~~gDLLV~NnTKVipARL~g~k 75 (360) -++-|||+++-.-.-+||-+.-.| T Consensus 180 ~i~vGDvi~v~K~~RVPADmilLr 203 (1051) T KOG0210 180 DIKVGDVIIVHKDERVPADMILLR 203 (1051) T ss_pred CCCCCCEEEEECCCCCCCCEEEEE T ss_conf 553342799954875775169998 No 79 >cd05710 SIS_1 A subgroup of the SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Probab=29.75 E-value=42 Score=14.59 Aligned_cols=21 Identities=19% Similarity=0.354 Sum_probs=17.9 Q ss_pred CHHHHHHHHHHHHCCCEEEEE Q ss_conf 599999999875379809996 Q gi|254780180|r 248 DAPTAQALNSVKSRGGRIVSV 268 (360) Q Consensus 248 ~~~~~~~I~~ak~~g~rIiAV 268 (360) +++++++++.+|+.|.++||+ T Consensus 60 T~e~i~a~~~ak~~g~~tiai 80 (120) T cd05710 60 TKETVAAAKFAKEKGATVIGL 80 (120) T ss_pred CHHHHHHHHHHHHCCCEEEEE T ss_conf 789999999999869959999 No 80 >PRK07309 aromatic amino acid aminotransferase; Validated Probab=29.70 E-value=42 Score=14.59 Aligned_cols=29 Identities=17% Similarity=0.285 Sum_probs=18.1 Q ss_pred CCEEECCCCCCEE--EEEEEECCCCCHHHHH Q ss_conf 2366574096114--3125527755457899 Q gi|254780180|r 293 TNIFITPGYRFRA--VDILMSNFHLPKSTLL 321 (360) Q Consensus 293 T~lfI~Pg~~f~~--vd~LiTNFH~P~Stll 321 (360) ..+.+.||..|.. -+.+==||=.|.+.|- T Consensus 346 ~gV~v~PG~~Fg~~g~g~iRl~~a~~~e~l~ 376 (390) T PRK07309 346 KAVAIIPGVAFGPYGEGYVRLSYAASMETIK 376 (390) T ss_pred CCEEEECCHHHCCCCCCEEEEEEECCHHHHH T ss_conf 9999977732288999879999828999999 No 81 >PRK00556 minC septum formation inhibitor; Reviewed Probab=29.66 E-value=42 Score=14.58 Aligned_cols=15 Identities=20% Similarity=0.541 Sum_probs=8.4 Q ss_pred HHHHHHCCCCCEEEE Q ss_conf 347872698879999 Q gi|254780180|r 47 SDLPAFLNSNDAIVF 61 (360) Q Consensus 47 ~dl~~~L~~gDLLV~ 61 (360) .+|-+||+...|.-+ T Consensus 46 ~ei~~fL~~~~L~f~ 60 (195) T PRK00556 46 PEVKAFLQKHQLLFL 60 (195) T ss_pred HHHHHHHHHCCCEEE T ss_conf 999999987492699 No 82 >TIGR03356 BGL beta-galactosidase. Probab=29.24 E-value=20 Score=16.87 Aligned_cols=12 Identities=25% Similarity=0.526 Sum_probs=6.8 Q ss_pred CCCCCCHHHHCC Q ss_conf 588889778616 Q gi|254780180|r 6 FDFDLPPSRIAL 17 (360) Q Consensus 6 fDy~LP~elIAq 17 (360) |.|+||.-|-.. T Consensus 115 ~Hfd~P~~l~~~ 126 (427) T TIGR03356 115 YHWDLPQALEDR 126 (427) T ss_pred ECCCCHHHHHHC T ss_conf 556782999980 No 83 >PRK00002 aroB 3-dehydroquinate synthase; Reviewed Probab=29.12 E-value=43 Score=14.52 Aligned_cols=56 Identities=16% Similarity=0.076 Sum_probs=33.7 Q ss_pred CCEEEEECHHHHHHHHHHHHCCCCCCCCCC-----------CCCEEECCCCCCEEEEEEEECCCCCHH Q ss_conf 980999628789989999852898468887-----------223665740961143125527755457 Q gi|254780180|r 262 GGRIVSVGTTSLRLLETATTEDGIINPWSG-----------FTNIFITPGYRFRAVDILMSNFHLPKS 318 (360) Q Consensus 262 g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g-----------~T~lfI~Pg~~f~~vd~LiTNFH~P~S 318 (360) -+++.--|-|.-.+||++.+- +.+.-++. .-.+-+.+.-.+.-+..++..+.+|.+ T Consensus 237 ~R~~LNfGHT~gHAiE~~~~~-~~l~HGeAVaiGM~~a~~ls~~~g~~~~~~~~~i~~ll~~~glp~~ 303 (360) T PRK00002 237 LRALLNLGHTFGHAIEALTGY-GRWLHGEAVAIGMVMAARLSERLGLLSEADAERIRALLERAGLPTS 303 (360) T ss_pred CHHHEECCCHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 513115640668999987399-8737299999979999999998189999999999999998698975 No 84 >PRK09700 D-allose transporter ATP-binding protein; Provisional Probab=28.76 E-value=44 Score=14.48 Aligned_cols=31 Identities=16% Similarity=0.375 Sum_probs=23.5 Q ss_pred CCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC Q ss_conf 5556887899-----999999972155303787316 Q gi|254780180|r 193 APTAGLHFTS-----NLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 193 APTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG 223 (360) =|||+|-... +++.+|+++|+.+.+||.|.. T Consensus 171 EPTa~Ld~~~~~~l~~~l~~l~~~g~tii~isH~l~ 206 (510) T PRK09700 171 EPTSSLTNKEVDYLFLIMNQLRKEGTAIVYISHKLA 206 (510) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHH T ss_conf 885666867899999998888872871799952367 No 85 >pfam01058 Oxidored_q6 NADH ubiquinone oxidoreductase, 20 Kd subunit. Probab=28.69 E-value=44 Score=14.47 Aligned_cols=30 Identities=20% Similarity=0.337 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCC Q ss_conf 99999999875379809996287899899998528984 Q gi|254780180|r 249 APTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGII 286 (360) Q Consensus 249 ~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~ 286 (360) ++..+.+.+++.+-+-|||||| |+...|.. T Consensus 55 ~~~~~~~~e~~~~ak~vIAvG~--------CA~~GGi~ 84 (124) T pfam01058 55 KEALERLYELAPKAKYVIAVGT--------CAAFGGIP 84 (124) T ss_pred HHHHHHHHHHCCCCCEEEEECC--------CCCCCCCC T ss_conf 8999999986421463899613--------42568644 No 86 >TIGR01116 ATPase-IIA1_Ca calcium-translocating P-type ATPase, SERCA-type; InterPro: IPR005782 This model describes Ca2+ ATPases (3.6.3.8 from EC) from eukaryotes and includes significant representatives from plants. Plasma membrane Ca2+ ATPases are primarily responsible for extrusion of calcium from cytoplasmic milieu to outside, coupled to the hydrolyis of ATP. Besides Ca2+ ATPases, Na+/Ca2+ exchangers are also involved in the maintenance of Ca2+ homestasis in the cell. ; GO: 0005388 calcium-transporting ATPase activity, 0006816 calcium ion transport, 0016021 integral to membrane. Probab=28.47 E-value=44 Score=14.44 Aligned_cols=109 Identities=20% Similarity=0.189 Sum_probs=60.5 Q ss_pred CHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCCC--CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 30002544203578522212345652114331013456--7574005556887899999999972155303787316566 Q gi|254780180|r 149 ELERQISLVGTIPLPPYIARKRPIDARDYVDYQTTYAK--IQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGT 226 (360) Q Consensus 149 ~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA~--~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GT 226 (360) ..+.+|++--++ .-+...-+.+...=|+|-|. .-|+=|==+=+|-|-+| -+..|+=+ || T Consensus 402 APE~~L~rct~I------~~~g~~VPL~~~~k~~il~~~~~~~~~aLRCLA~A~~~E---------s~Ltf~G~-vG--- 462 (800) T TIGR01116 402 APEEVLERCTHI------LLKGSAVPLTEKMKNTILASIKEMSSEALRCLALAFKEE---------SKLTFIGV-VG--- 462 (800) T ss_pred CHHHHHHHCCEE------EECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEECCC---------CCCEEEEE-EE--- T ss_conf 724677233553------205701347846789999999987641324110012146---------78678888-50--- Q ss_pred CCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 578874101356677000778599999999875379809996287899899998528984688872 Q gi|254780180|r 227 FMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSGF 292 (360) Q Consensus 227 F~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g~ 292 (360) +.|-. -+|.+++|.+||+.|=|||=|===-==|=|+.+++=|.++.|+|. T Consensus 463 --------l~DPP--------R~EV~~ai~~CR~AGIrVImITGD~KeTA~AicR~IG~lsSfTG~ 512 (800) T TIGR01116 463 --------LLDPP--------RPEVADAIEKCREAGIRVIMITGDNKETAEAICRRIGILSSFTGR 512 (800) T ss_pred --------CCCCC--------CHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCC T ss_conf --------68948--------168999999887379789998479824689997675010242000 No 87 >smart00052 EAL Putative diguanylate phosphodiesterase. Putative diguanylate phosphodiesterase, present in a variety of bacteria. Probab=28.04 E-value=45 Score=14.39 Aligned_cols=82 Identities=15% Similarity=0.177 Sum_probs=54.5 Q ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEE---ECHH----HHHHHHHHHHCCCEEEEECHHHH Q ss_conf 9999999997215530378731656657887410135667700077---8599----99999987537980999628789 Q gi|254780180|r 201 TSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGF---IDAP----TAQALNSVKSRGGRIVSVGTTSL 273 (360) Q Consensus 201 t~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~---i~~~----~~~~I~~ak~~g~rIiAVGTT~~ 273 (360) ..+.++.|++.|++++-=-.-.|..+|..+..=.++--|++..+.. -+.. .-..++-++..|-+|||-|=-+- T Consensus 135 ~~~~i~~l~~~G~~iaiDdfG~~~~~~~~l~~l~~d~iKld~~li~~~~~~~~~~~~~~~l~~~a~~~g~~viaegVE~~ 214 (241) T smart00052 135 AVATLQRLRELGVRIALDDFGTGYSSLSYLKRLPVDLLKIDKSFVRDLQTDPEDEAIVQSIIELAQKLGLQVVAEGVETP 214 (241) T ss_pred HHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHCCCHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCH T ss_conf 99999999970997875378998101899986572042247999961326845589999999999985998999718819 Q ss_pred HHHHHHHHC Q ss_conf 989999852 Q gi|254780180|r 274 RLLETATTE 282 (360) Q Consensus 274 R~LEs~~~~ 282 (360) ..++.+..- T Consensus 215 ~~~~~l~~~ 223 (241) T smart00052 215 EQLDLLRSL 223 (241) T ss_pred HHHHHHHHC T ss_conf 999999974 No 88 >PRK07306 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=27.89 E-value=42 Score=14.55 Aligned_cols=14 Identities=7% Similarity=0.088 Sum_probs=6.4 Q ss_pred HHHHHHHHHHCCCE Q ss_conf 99999999986982 Q gi|254780180|r 333 TKKMYQHAISHAYR 346 (360) Q Consensus 333 ~~~~Y~~Ai~~~yr 346 (360) +.++|-+|=++|.+ T Consensus 682 L~~l~~yAWkkGLK 695 (720) T PRK07306 682 LSILRNYAFNKGIK 695 (720) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999999972845 No 89 >COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism] Probab=27.51 E-value=19 Score=17.09 Aligned_cols=58 Identities=17% Similarity=0.114 Sum_probs=28.7 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEE Q ss_conf 999999972155303787316566578874101356677000778599999999875379809 Q gi|254780180|r 203 NLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRI 265 (360) Q Consensus 203 ~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rI 265 (360) .++.++.++=-.-.+|| -=|+|-+..++...|.|- -+..--.+-..++.+|...|-.| T Consensus 345 ~~l~~~~~rY~~p~fIt-ENG~G~~d~~~~~~i~Dd----yRI~Yl~~Hl~~v~~AI~dGv~v 402 (460) T COG2723 345 DILEKLYERYGIPLFIT-ENGLGVKDEVDFDGINDD----YRIDYLKEHLKAVKKAIEDGVDV 402 (460) T ss_pred HHHHHHHHHHCCCEEEE-CCCCCCCCCCCCCCCCCH----HHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99999999749975896-688875465456776745----88999999999999999758872 No 90 >TIGR01959 nuoF_fam NADH-quinone oxidoreductase, F subunit; InterPro: IPR011537 This entry describes the F chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. ; GO: 0010181 FMN binding, 0016651 oxidoreductase activity acting on NADH or NADPH, 0051287 NAD binding, 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport. Probab=27.42 E-value=46 Score=14.32 Aligned_cols=36 Identities=19% Similarity=0.166 Sum_probs=18.9 Q ss_pred CCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHH Q ss_conf 43698579970578874069999603557724999726786300025442 Q gi|254780180|r 108 VIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLV 157 (360) Q Consensus 108 klk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~ 157 (360) .-..|.+|.= ++-.|-. .|.+.+.+- ..+.++|+.| T Consensus 237 ~~~~GtKLfs--------~SGhV~k---PG~yE~plG---~pl~ELledy 272 (420) T TIGR01959 237 EKSPGTKLFS--------VSGHVNK---PGNYELPLG---IPLRELLEDY 272 (420) T ss_pred CCCCCCEEEE--------EEEEEEC---CCEEEEECC---CCHHHHHHHH T ss_conf 8888854875--------3106657---944770067---7689999986 No 91 >cd05006 SIS_GmhA Phosphoheptose isomerase is a member of the SIS (Sugar ISomerase) superfamily. Phosphoheptose isomerase catalyzes the isomerization of sedoheptulose 7-phosphate into D-glycero-D-mannoheptose 7-phosphate. This is the first step of the biosynthesis of gram-negative bacteria inner core lipopolysaccharide precursor, L-glycero-D-mannoheptose (Gmh). Probab=27.19 E-value=46 Score=14.29 Aligned_cols=21 Identities=24% Similarity=0.445 Sum_probs=15.0 Q ss_pred CHHHHHHHHHHHHCCCEEEEE Q ss_conf 599999999875379809996 Q gi|254780180|r 248 DAPTAQALNSVKSRGGRIVSV 268 (360) Q Consensus 248 ~~~~~~~I~~ak~~g~rIiAV 268 (360) +++..++++.||+.|-++|++ T Consensus 114 s~nii~a~~~ak~~g~~~i~l 134 (177) T cd05006 114 SPNVLKALEAAKERGMKTIAL 134 (177) T ss_pred CHHHHHHHHHHHHCCCEEEEE T ss_conf 989999999999879989999 No 92 >TIGR02128 G6PI_arch bifunctional phosphoglucose/phosphomannose isomerase; InterPro: IPR011857 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyzes the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals . Three classes of PMI have been defined . These bifunctional isomerases form a distinct phylogenetic cluster within the larger phosphoglucose isomerase (PGI) superfamily . They show relatively low sequence identity to other PGIs, but contain similar structural elements and show almost complete conservation of the catalytic residues in the active site, indicating they use a similar catalytic mechanism . The family appears to have originated in the archaea, with the bacterial proteins being acquired through horizontal transfer.; GO: 0004347 glucose-6-phosphate isomerase activity, 0004476 mannose-6-phosphate isomerase activity. Probab=27.11 E-value=46 Score=14.28 Aligned_cols=29 Identities=10% Similarity=0.051 Sum_probs=13.9 Q ss_pred EEEEHHHHHHHCC-CCCEEEEECCCEEEEEE Q ss_conf 3101434787269-88799994795688899 Q gi|254780180|r 42 SDHLVSDLPAFLN-SNDAIVFNNTKVITAQL 71 (360) Q Consensus 42 ~h~~F~dl~~~L~-~gDLLV~NnTKVipARL 71 (360) -=+.++++..-.+ ..-++ +++=..+|++. T Consensus 41 ~G~~~~~~~~~~~~~~Pvf-vv~DY~~p~~v 70 (338) T TIGR02128 41 AGRILSELLLEKSFEVPVF-VVKDYRLPAFV 70 (338) T ss_pred HHHHHHHHHHHHCCCCCEE-EECCCCCCCCC T ss_conf 9999999865324788888-87077896988 No 93 >TIGR02324 CP_lyasePhnL phosphonate C-P lyase system protein PhnL; InterPro: IPR012701 Members of this family are the PhnL protein of C-P lyase systems for utilization of phosphonates. These systems resemble phosphonatase-based systems in having a three-component ABC transporter, where IPR005769 from INTERPRO is the permease, IPR005770 from INTERPRO is the phosphonates binding protein, and IPR012693 from INTERPRO is the ATP-binding cassette (ABC) protein. They differ, however, in having, typically, ten or more additional genes, many of which are believed to form a membrane-associated C-P lyase complex. This protein (PhnL) and the adjacent-encoded PhnK (IPR012700 from INTERPRO) resemble transporter ATP-binding proteins but are suggested, based on mutagenesis studies, to be part of this C-P lyase complex rather than part of a transporter per se .. Probab=27.08 E-value=45 Score=14.41 Aligned_cols=23 Identities=30% Similarity=0.429 Sum_probs=15.7 Q ss_pred CCCCCCCCH-----HHHHHHHHHCCCCE Q ss_conf 556887899-----99999997215530 Q gi|254780180|r 194 PTAGLHFTS-----NLLSRLISIGIKVY 216 (360) Q Consensus 194 PTAGLHFt~-----~ll~~L~~kGi~~~ 216 (360) |||+|-=|+ |||++.|.+|+-+. T Consensus 176 PTASLd~~nr~vVvELi~e~K~~G~Ali 203 (224) T TIGR02324 176 PTASLDATNRKVVVELIAEAKARGAALI 203 (224) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf 5402411424889999999976597689 No 94 >PRK10444 UMP phosphatase; Provisional Probab=27.04 E-value=47 Score=14.27 Aligned_cols=19 Identities=16% Similarity=0.085 Sum_probs=10.0 Q ss_pred CCCEEEEECHHHHHHHHHH Q ss_conf 7980999628789989999 Q gi|254780180|r 261 RGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 261 ~g~rIiAVGTT~~R~LEs~ 279 (360) +-.+++.||=+..==+..+ T Consensus 190 ~~~~~vmIGD~l~TDI~~a 208 (248) T PRK10444 190 HSEETVIVGDNLRTDILAG 208 (248) T ss_pred CHHHEEEECCCHHHHHHHH T ss_conf 8100799878805679999 No 95 >cd00288 Pyruvate_Kinase Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer. Probab=26.97 E-value=47 Score=14.26 Aligned_cols=21 Identities=10% Similarity=0.249 Sum_probs=10.9 Q ss_pred HHHHHHHCCCCCEEEEECCCE Q ss_conf 434787269887999947956 Q gi|254780180|r 46 VSDLPAFLNSNDAIVFNNTKV 66 (360) Q Consensus 46 F~dl~~~L~~gDLLV~NnTKV 66 (360) +.++.+.+++||.+.++|-++ T Consensus 111 ~~~l~~~v~~Gd~I~idDG~i 131 (480) T cd00288 111 YKNLTKDVSPGNTILVDDGLL 131 (480) T ss_pred HHHHHHHCCCCCEEEEECCEE T ss_conf 288897658998899947838 No 96 >cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has Probab=26.94 E-value=47 Score=14.26 Aligned_cols=46 Identities=20% Similarity=0.247 Sum_probs=30.8 Q ss_pred CCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECC Q ss_conf 6101122115643698579970578874069999603557724999726 Q gi|254780180|r 97 NSWSVYARPSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSI 145 (360) Q Consensus 97 ~~w~~l~kp~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~ 145 (360) -.| .|.++.++.++.+.+...++...|...-..+.+.|.+.+.... T Consensus 32 v~W---~k~~~~i~~~~~i~i~~~~~~~~L~I~~~~~~D~G~Yt~~~~N 77 (98) T cd05762 32 CTW---MKFRKQIQEGEGIKIENTENSSKLTITEGQQEHCGCYTLEVEN 77 (98) T ss_pred EEE---EECCEECCCCCCEEEEECCCEEEEEECCCCCCCCEEEEEEEEE T ss_conf 999---9899998789979999878848999999882279999999995 No 97 >PRK05928 hemD uroporphyrinogen-III synthase; Reviewed Probab=26.83 E-value=47 Score=14.24 Aligned_cols=85 Identities=18% Similarity=0.073 Sum_probs=49.4 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCC-C-----CCCCCCCCCCCHHEEECHHHHHHHHHHH---- Q ss_conf 40055568878999999999721553037873165665788-7-----4101356677000778599999999875---- Q gi|254780180|r 190 SVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPV-K-----VEDTDDHIMHSEIGFIDAPTAQALNSVK---- 259 (360) Q Consensus 190 SVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi-~-----~~~i~~h~mH~E~~~i~~~~~~~I~~ak---- 259 (360) .|--|.. -+=.+.|.+.|++.|+++..+..-- |-.|- . .+.+.+...+ -..+-|..+++...+.- T Consensus 128 ~vL~~~g-~~~~~~L~~~L~~~g~~v~~~~~Y~---~~~~~~~~~~~~~~~~~~~~~d-~i~ftS~~~v~~~~~~~~~~~ 202 (252) T PRK05928 128 RVLYLRG-NGGRPLLGDFLQERGADVDECEVYE---RKPPKLEGAEELIEELQTGEVD-AIIFTSPSMVRAFLSLAPELY 202 (252) T ss_pred EEEEEEC-CCCCHHHHHHHHHCCCEEEEEEEEE---EECCCCCCHHHHHHHHHCCCCC-EEEEECHHHHHHHHHHHHHHC T ss_conf 7999816-7666578999997798479998657---6378888279999998628987-999909999999999866403 Q ss_pred ----HCCCEEEEECHHHHHHHHHH Q ss_conf ----37980999628789989999 Q gi|254780180|r 260 ----SRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 260 ----~~g~rIiAVGTT~~R~LEs~ 279 (360) ..+-+|+|+|.++..+++.+ T Consensus 203 ~~~~~~~~~iv~ig~~ta~~~~~~ 226 (252) T PRK05928 203 RRHWLLRCRAVAIGKRTAEALKEL 226 (252) T ss_pred CHHHHHCCEEEEECHHHHHHHHHC T ss_conf 025651887999899999999986 No 98 >pfam08981 consensus Probab=26.67 E-value=47 Score=14.22 Aligned_cols=59 Identities=27% Similarity=0.309 Sum_probs=31.7 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHH Q ss_conf 999999997215530378731656657887410135667700077859999999987537980999628789989999 Q gi|254780180|r 202 SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 202 ~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~ 279 (360) ..+++.++.+|+++.-||.|+|.-. .. ..++++++.+.+. +.|.+| -.||-++.-+|.+ T Consensus 40 ~k~~e~~~~~~~~lvvVth~~GF~~--------pg-------~~e~~~e~~~~L~---~~G~~V-~t~tH~lsg~eR~ 98 (181) T pfam08981 40 LKAAEALEGTNLNVVVVTHHAGFSE--------PG-------EQEMDPEVRKELE---ERGVKV-LTGTHALSGLERA 98 (181) T ss_pred HHHHHHHCCCCCEEEEEECCCCCCC--------CC-------CCCCCHHHHHHHH---HCCCEE-EEECCCCCCCHHH T ss_conf 9999871358955999946457789--------98-------5638999999999---779889-9950313442354 No 99 >pfam03088 Str_synth Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine. Probab=26.64 E-value=47 Score=14.22 Aligned_cols=44 Identities=20% Similarity=0.223 Sum_probs=17.6 Q ss_pred CCCCCCEEEEECCCCCCCCEEE-EEEHHHHHHHCCCCCEEEEECC Q ss_conf 7812031356637888753153-1014347872698879999479 Q gi|254780180|r 21 FPRDSARLMVVHPNLSCPLVIS-DHLVSDLPAFLNSNDAIVFNNT 64 (360) Q Consensus 21 ~~Rd~SRLLV~~r~~~~~~~i~-h~~F~dl~~~L~~gDLLV~NnT 64 (360) +.+...|||.+|+.++....+. .-.|.|=...=+.++-++++.| T Consensus 32 e~~~tGRLl~ydp~t~~~~vL~~~L~f~NGVals~D~~~vlv~Et 76 (89) T pfam03088 32 EGDKTGRLMKYDPSTKVTKVLLKDLYFPNGIALSPDGSFVLFCET 76 (89) T ss_pred CCCCCCEEEEEECCCCEEEEEECCCCCCCEEEECCCCCEEEEECC T ss_conf 579975399995999948994218854761798789999999905 No 100 >PRK00048 dihydrodipicolinate reductase; Provisional Probab=26.59 E-value=42 Score=14.61 Aligned_cols=108 Identities=20% Similarity=0.205 Sum_probs=59.6 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 77249997267863000254420357852221234565211433101345675740055568878999999999721553 Q gi|254780180|r 136 TGEILLVFSISGIELERQISLVGTIPLPPYIARKRPIDARDYVDYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKV 215 (360) Q Consensus 136 ~g~~~i~f~~~~~~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~ 215 (360) ...+++.|+.+. .+.+.++..-+...|.-|--.. -++.+.+.-+. +|++-+-+-||-=+|= -.+|.++-++=.+. T Consensus 69 ~~DVvIDFS~p~-~~~~~~~~~~~~~~~~ViGTTG-~~~~~~~~i~~-~s~~ipil~apNfSlG--vnll~~l~~~aa~~ 143 (265) T PRK00048 69 DFDVLIDFTTPE-ATLENLEFALEHGKPLVIGTTG-FTEEQLAALRE-AAKKIPVVIAPNFSVG--VNLLMKLAEKAAKY 143 (265) T ss_pred CCCEEEECCCHH-HHHHHHHHHHHCCCCEEEEECC-CCHHHHHHHHH-HCCCCCEEEECCHHHH--HHHHHHHHHHHHHH T ss_conf 599899899889-9999999999749977996089-99899999997-4658878997855899--99999999999975 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHC Q ss_conf 0378731656657887410135667700077859999999987537 Q gi|254780180|r 216 YFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSR 261 (360) Q Consensus 216 ~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~ 261 (360) | +.| ..|=++.| |....--|.-||-.+-+.-++ T Consensus 144 ----l----~~~---dieIiE~H--H~~K~DaPSGTAl~la~~i~~ 176 (265) T PRK00048 144 ----L----GDY---DIEIIEAH--HRHKVDAPSGTALKLAEAIAE 176 (265) T ss_pred ----C----CCC---CEEEEEEC--CCCCCCCCCHHHHHHHHHHHH T ss_conf ----6----657---77999906--888899998889999999998 No 101 >PRK10826 2-deoxyglucose-6-phosphatase; Provisional Probab=26.39 E-value=48 Score=14.19 Aligned_cols=77 Identities=17% Similarity=0.090 Sum_probs=45.2 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEE-----------ECCC-CCCCC-CCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCE Q ss_conf 87899999999972155303787-----------3165-66578-87410135667700077859999999987537980 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTL-----------HVGA-GTFMP-VKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGR 264 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTL-----------HVG~-GTF~P-i~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~r 264 (360) +.--.++|+.|+++|+.++-+|= +.|+ .-|.- +..+++..+|-|.|-|.. +++++ .....+ T Consensus 94 ~pGv~e~L~~l~~~g~~l~i~Ts~~~~~~~~~L~~~gl~~~Fd~iv~~ddv~~~KP~Pe~yl~---A~~~l---g~~p~e 167 (222) T PRK10826 94 LPGVREALALCKAQGLKIGLASASPLHMLEAVLTMLDLRDYFDALASAEKLPYSKPHPEVYLN---CAAKL---GVDPLT 167 (222) T ss_pred CCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHH---HHHHH---CCCHHH T ss_conf 866999999999779975886288489999999874996411057635325677998499999---99995---989889 Q ss_pred EEEECHHHHHHHHHHHH Q ss_conf 99962878998999985 Q gi|254780180|r 265 IVSVGTTSLRLLETATT 281 (360) Q Consensus 265 IiAVGTT~~R~LEs~~~ 281 (360) .+.||=+ ..-++++.+ T Consensus 168 ~lvveDS-~~gi~aA~~ 183 (222) T PRK10826 168 CVALEDS-FNGLIAAKA 183 (222) T ss_pred EEEECCC-HHHHHHHHH T ss_conf 6878389-999999998 No 102 >PRK09919 hypothetical protein; Provisional Probab=26.37 E-value=48 Score=14.19 Aligned_cols=40 Identities=20% Similarity=0.323 Sum_probs=28.5 Q ss_pred HCCCCCEEEEECCCEEEEEEEEEEECCCCCCCCEEEEE--EECCCCCCCHHHHHHC Q ss_conf 26988799994795688899876404677776505899--8112486610112211 Q gi|254780180|r 52 FLNSNDAIVFNNTKVITAQLNGVRFCHINRREKEISCT--LHMRVSPNSWSVYARP 105 (360) Q Consensus 52 ~L~~gDLLV~NnTKVipARL~g~k~~~~~~~g~~ieil--l~~~~~~~~w~~l~kp 105 (360) ||+|||++-.+++.|+- . +....+. -.-+.++..|..+ |. T Consensus 38 ~L~pG~~i~~~~~gvli---N----------~k~~~itIynvt~yn~~lW~~i-k~ 79 (114) T PRK09919 38 FLPPGSIFTPVKSGILL---N----------DKEYPITIYNVTPFNKALWSLL-KS 79 (114) T ss_pred EECCCCEEEECCCEEEE---C----------CCEEEEEEEEECCCCHHHHHHH-HH T ss_conf 80899888974883898---8----------9386678998445799999987-55 No 103 >PRK13478 phosphonoacetaldehyde hydrolase; Provisional Probab=26.10 E-value=48 Score=14.19 Aligned_cols=77 Identities=17% Similarity=0.108 Sum_probs=46.0 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEE------E-----CCCCCCCC---CCCCCCCCCCCCCHHEEECHHHHHHHHHHHH-CC Q ss_conf 87899999999972155303787------3-----16566578---8741013566770007785999999998753-79 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYFVTL------H-----VGAGTFMP---VKVEDTDDHIMHSEIGFIDAPTAQALNSVKS-RG 262 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~iTL------H-----VG~GTF~P---i~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~-~g 262 (360) +.=..++++.|+++||+++-.|= . .+.--|.| |..+++...+-|.+-+ ..+..+..- .. T Consensus 103 iPGv~e~l~~Lr~~Gikig~~TG~~~~~~~~vl~~a~~~g~~pd~~vs~ddv~~grP~P~mi------~~a~~~Lgv~~~ 176 (267) T PRK13478 103 IPGVLEVIAALRARGIKIGSTTGYTREMMDVVVPAAAAQGYRPDHVVATDDVPAGRPAPWMA------LKNVIELGVYDV 176 (267) T ss_pred CCCHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCHHHH------HHHHHHCCCCCH T ss_conf 74499999999874983999789978999999999985597777585200079999897999------999998089986 Q ss_pred CEEEEECHHHHHHHHHHH Q ss_conf 809996287899899998 Q gi|254780180|r 263 GRIVSVGTTSLRLLETAT 280 (360) Q Consensus 263 ~rIiAVGTT~~R~LEs~~ 280 (360) ..+|.||-|+.=..|... T Consensus 177 ~~vvkVgDT~~dI~~G~n 194 (267) T PRK13478 177 AACVKVDDTVPGIEEGRN 194 (267) T ss_pred HHEEEECCCHHHHHHHHH T ss_conf 893777898889999997 No 104 >PRK13549 xylose transporter ATP-binding subunit; Provisional Probab=25.99 E-value=49 Score=14.14 Aligned_cols=31 Identities=23% Similarity=0.449 Sum_probs=23.1 Q ss_pred CCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC Q ss_conf 5556887899-----999999972155303787316 Q gi|254780180|r 193 APTAGLHFTS-----NLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 193 APTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG 223 (360) =|||+|--.. +++.+|+++|+.+.+||.+.. T Consensus 169 EPt~~LD~~~~~~l~~~l~~l~~~g~til~isH~l~ 204 (513) T PRK13549 169 EPTASLTESETAILLDIIRDLKQHGIACIYISHKLN 204 (513) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHH T ss_conf 675788857779999999998755967999945714 No 105 >PRK07683 aminotransferase A; Validated Probab=25.89 E-value=49 Score=14.13 Aligned_cols=30 Identities=10% Similarity=0.249 Sum_probs=18.6 Q ss_pred EEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEEC Q ss_conf 1356637888753153101434787269887999947 Q gi|254780180|r 27 RLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNN 63 (360) Q Consensus 27 RLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~Nn 63 (360) ..+|.... ....+.-+--+++|||-+++.+ T Consensus 91 ~I~it~G~-------~~al~~~~~~~~~pGD~Vlv~~ 120 (387) T PRK07683 91 EIIVTIGA-------SEAIDIAFRTILEPGTEVILPA 120 (387) T ss_pred EEEEECCH-------HHHHHHHHHHHCCCCCEEEECC T ss_conf 59992782-------9999999999769999899837 No 106 >cd03870 M14_CPA Peptidase M14 Carboxypeptidase (CP) A (CPA) belongs to the A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPA enzymes generally favor hydrophobic residues. A/B subfamily enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The procarboxypeptidase A (PCPA) is produced by the exocrine pancreas and stored as a stable zymogen in the pancreatic granules until secretion into the digestive tract occurs. This subfamily includes CPA1, CPA2 and CPA4 forms. Within these A forms, there are slightly different specificities, with CPA1 preferring aliphatic and small aromatic residues, and CPA2 p Probab=25.71 E-value=49 Score=14.11 Aligned_cols=116 Identities=20% Similarity=0.264 Sum_probs=66.6 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCC-CC--CCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHH Q ss_conf 999999997215530378731656657-88--741013566770007785999999998753798099962878998999 Q gi|254780180|r 202 SNLLSRLISIGIKVYFVTLHVGAGTFM-PV--KVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLET 278 (360) Q Consensus 202 ~~ll~~L~~kGi~~~~iTLHVG~GTF~-Pi--~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs 278 (360) ..+.+-+++.+--.++|++|-.-.-+. |- .++...+ +.|...|....+++|.+. .|....||+++ ..| T Consensus 171 ~Av~~~~~~~~~i~~~i~~HSygq~il~P~gyt~~~~pd---~~~l~~~a~~~a~ai~~~---~g~~Y~~G~~~-~~l-- 241 (301) T cd03870 171 KSIVDFVKSHGNFKAFISIHSYSQLLLYPYGYTTQSIPD---KTELNQVAKSAVAALKSL---YGTSYKYGSII-TTI-- 241 (301) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCEEEECCCCCCCCCCC---HHHHHHHHHHHHHHHHHH---HCCCCCCCCHH-HEE-- T ss_conf 999999984878269999715665588567678879989---999999999999999997---69986257555-332-- Q ss_pred HHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHHHH-HHHHC Q ss_conf 9852898468887223665740961143125527755457899999-98719 Q gi|254780180|r 279 ATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLMLV-SAFCG 329 (360) Q Consensus 279 ~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~Lv-~Af~G 329 (360) +...|.-..|.-. .=|..-|.+.+-|.---.|++|.+-..=.. .+|+| T Consensus 242 -Y~asG~s~Dw~y~--~gi~~syt~ELrd~g~~GF~lP~~~I~p~~~E~~~g 290 (301) T cd03870 242 -YQASGGSIDWSYN--QGIKYSFTFELRDTGRYGFLLPASQIIPTAQETWLG 290 (301) T ss_pred -ECCCCCHHHHHHC--CCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf -2467778887600--699489999957898899719989936789999999 No 107 >PRK07449 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylic acid synthase/2-oxoglutarate decarboxylase; Validated Probab=25.69 E-value=49 Score=14.10 Aligned_cols=29 Identities=24% Similarity=0.252 Sum_probs=16.2 Q ss_pred HHHHHHHCCCEEEEE-CH--H--HHHHHHHHHHC Q ss_conf 999875379809996-28--7--89989999852 Q gi|254780180|r 254 ALNSVKSRGGRIVSV-GT--T--SLRLLETATTE 282 (360) Q Consensus 254 ~I~~ak~~g~rIiAV-GT--T--~~R~LEs~~~~ 282 (360) +|=.+.+.++|++|| |= = .+=.|+++... T Consensus 415 AiGa~~a~~~~vv~i~GDGsf~~~~~eL~Ta~~~ 448 (548) T PRK07449 415 AAGVARASGKPTVALIGDLSFLHDLNGLLLLKQE 448 (548) T ss_pred HHHHHHCCCCCEEEEECCHHHHHCHHHHHHHHHC T ss_conf 9999757899889995545876045799999866 No 108 >COG0637 Predicted phosphatase/phosphohexomutase [General function prediction only] Probab=25.65 E-value=49 Score=14.10 Aligned_cols=84 Identities=20% Similarity=0.260 Sum_probs=57.0 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEE-----------CC-CCCCCC-CCCCCCCCCCCCC--------- Q ss_conf 5675740055568878999999999721553037873-----------16-566578-8741013566770--------- Q gi|254780180|r 185 AKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLH-----------VG-AGTFMP-VKVEDTDDHIMHS--------- 242 (360) Q Consensus 185 A~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLH-----------VG-~GTF~P-i~~~~i~~h~mH~--------- 242 (360) ....+..+-|-+ .++|+.|+++|+.++-.+=. +| ..-|.. |..+++.++|-|. T Consensus 80 ~~~~~~~~~pGv-----~~~l~~L~~~~i~~avaS~s~~~~~~~~L~~~gl~~~f~~~v~~~dv~~~KP~Pd~yL~Aa~~ 154 (221) T COG0637 80 LELEGLKPIPGV-----VELLEQLKARGIPLAVASSSPRRAAERVLARLGLLDYFDVIVTADDVARGKPAPDIYLLAAER 154 (221) T ss_pred HCCCCCCCCCCH-----HHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCCHHHCCEEEECCCCCCCCCCCHHHHHHHHH T ss_conf 413567755119-----999999875599889974983888999999838833156377321046789897999999998 Q ss_pred -----HHEEECHHHHHHHHHHHHCCCEEEEECHHHH Q ss_conf -----0077859999999987537980999628789 Q gi|254780180|r 243 -----EIGFIDAPTAQALNSVKSRGGRIVSVGTTSL 273 (360) Q Consensus 243 -----E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~ 273 (360) +.|-+=+++..-|..+|+.|-++|+|....- T Consensus 155 Lgv~P~~CvviEDs~~Gi~Aa~aAGm~vv~v~~~~~ 190 (221) T COG0637 155 LGVDPEECVVVEDSPAGIQAAKAAGMRVVGVPAGHD 190 (221) T ss_pred CCCCHHHEEEEECCHHHHHHHHHCCCEEEEECCCCC T ss_conf 499912579895778889999987998999738887 No 109 >PRK07212 consensus Probab=25.64 E-value=49 Score=14.10 Aligned_cols=27 Identities=19% Similarity=0.335 Sum_probs=15.8 Q ss_pred CCEEECCCCCCEEEEEEEECCCCCHHH Q ss_conf 236657409611431255277554578 Q gi|254780180|r 293 TNIFITPGYRFRAVDILMSNFHLPKST 319 (360) Q Consensus 293 T~lfI~Pg~~f~~vd~LiTNFH~P~St 319 (360) ..+.+.||..|..-..+==||=.|... T Consensus 339 ~gV~v~pG~~Fg~~g~iRl~~a~~~e~ 365 (378) T PRK07212 339 AHVAVTPGSAFGAPGWIRISYATSMER 365 (378) T ss_pred CCEEEEECCCCCCCCEEEEEEECCHHH T ss_conf 989998174459999399998089999 No 110 >PRK08115 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=25.50 E-value=24 Score=16.38 Aligned_cols=145 Identities=14% Similarity=0.189 Sum_probs=76.8 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHE-------EECHHHHHHHHHHHHCCCEEE-- Q ss_conf 68878999999999721553037873165665788741013566770007-------785999999998753798099-- Q gi|254780180|r 196 AGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIG-------FIDAPTAQALNSVKSRGGRIV-- 266 (360) Q Consensus 196 AGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~-------~i~~~~~~~I~~ak~~g~rIi-- 266 (360) .|+-|++++++.+... --.+.+.|----|.|---......+..+..|.| .+|++..+.|.+..-++.-+. T Consensus 569 EA~~lae~Vm~~I~~~-Ay~aSveLAkErG~FP~fdas~~~e~~~~re~fl~sg~~k~lpedlre~I~k~GiRNa~ltTI 647 (857) T PRK08115 569 EGNKLVDEIFETIATT-AYRTSIELAKEKGSFPFLVGVTDEETARLRERFIQTGYMKKMPEDIRQAILKTGIRNSHLLTV 647 (857) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCHHHHCCHHHHHHHHHCCCCCCCCCCC T ss_conf 9999999999999999-999999999700888676531112320235554113345548799999997549650023450 Q ss_pred -EECHHHHHHHHHHHHCCCCCCCCCCC---------------CCE-----EECCCC-CCEEEEEEEECCCCCHHHHHHHH Q ss_conf -96287899899998528984688872---------------236-----657409-61143125527755457899999 Q gi|254780180|r 267 -SVGTTSLRLLETATTEDGIINPWSGF---------------TNI-----FITPGY-RFRAVDILMSNFHLPKSTLLMLV 324 (360) Q Consensus 267 -AVGTT~~R~LEs~~~~~~~~~~~~g~---------------T~l-----fI~Pg~-~f~~vd~LiTNFH~P~Stll~Lv 324 (360) .-||||+=+==| .|. .|.-.+ .++ ..+|.. +=.+.+..+|---+.-.-|+-|- T Consensus 648 APTGTISliAgvS----SGI-EPvFA~~Y~R~v~~g~f~ev~~~~v~e~~~~~~~~~~~~LPe~FVTA~DIsPedHVrMQ 722 (857) T PRK08115 648 APTGSTGTMVGVS----TGL-EPYFSFSYFRSGRLGKFIEVKADIVQEYLDIHPEADENHLPEWFVSAMDLKPEAHADVQ 722 (857) T ss_pred CCCCHHHHHHCCC----CCC-CCCCCCEEEEEECCCCCEEECHHHHHHHHHHCCCCCHHHCCCEEEECCCCCHHHHHHHH T ss_conf 7865899885506----773-76763158983214776020179999999975878934575716671759999999999 Q ss_pred HHHH-----------------CHHHHHHHHHHHHHCCCE Q ss_conf 9871-----------------989999999999986982 Q gi|254780180|r 325 SAFC-----------------GIEETKKMYQHAISHAYR 346 (360) Q Consensus 325 ~Af~-----------------G~~~~~~~Y~~Ai~~~yr 346 (360) +||- ..+.+.++|..|-+.|.+ T Consensus 723 AA~QkwVDsSISKTINlPndATvEDV~~vY~~AY~~GCK 761 (857) T PRK08115 723 CIIQRWIDSSISKTVNAPKGYTVEQVEKVYERLYRGGAK 761 (857) T ss_pred HHHHHHHCCCCCCCEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf 988644034010433288988999999999999976997 No 111 >PRK13645 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=25.35 E-value=42 Score=14.60 Aligned_cols=31 Identities=32% Similarity=0.452 Sum_probs=24.1 Q ss_pred CCCCCCCCC-----HHHHHHHH-HHCCCCEEEEEECC Q ss_conf 555688789-----99999999-72155303787316 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLI-SIGIKVYFVTLHVG 223 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~-~kGi~~~~iTLHVG 223 (360) =|||||--. .+++.+|. +.|..+..||.++. T Consensus 176 EPTagLDp~~~~~i~~ll~~L~~~~g~Tvi~vtHdm~ 212 (289) T PRK13645 176 EPTGGLDPKGEEDFINLFERLNKEYKKRIIMVTHNMD 212 (289) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCHH T ss_conf 8876489899999999999999956999999915999 No 112 >PRK09348 glyQ glycyl-tRNA synthetase subunit alpha; Validated Probab=25.34 E-value=24 Score=16.35 Aligned_cols=25 Identities=28% Similarity=0.678 Sum_probs=14.0 Q ss_pred EEECCCCCCCHH-HHHHCCCCCCCCC Q ss_conf 981124866101-1221156436985 Q gi|254780180|r 89 TLHMRVSPNSWS-VYARPSKVIKKGD 113 (360) Q Consensus 89 ll~~~~~~~~w~-~l~kp~kklk~G~ 113 (360) ++++-+.+.-|. |.+.|++|...|. T Consensus 39 Tflr~lgp~pw~~aYvqPsrRP~DgR 64 (291) T PRK09348 39 TFLRALGPEPWNAAYVQPSRRPTDGR 64 (291) T ss_pred HHHHCCCCCCCCEEEECCCCCCCCCC T ss_conf 88850288764054213457888775 No 113 >pfam06434 Aconitase_2_N Aconitate hydratase 2 N-terminus. This family represents the N-terminal region of several bacterial Aconitate hydratase 2 proteins and is found in conjunction with pfam00330. Probab=25.11 E-value=50 Score=14.03 Aligned_cols=50 Identities=22% Similarity=0.525 Sum_probs=30.3 Q ss_pred CCCCCCCCCEEEECCCCCCCCEEEEEEECCCCCCEEEEECCCCCCHHHHHHHHHCCCC Q ss_conf 1564369857997057887406999960355772499972678630002544203578 Q gi|254780180|r 105 PSKVIKKGDILHFFSQDGQSRLEATVIDKWNTGEILLVFSISGIELERQISLVGTIPL 162 (360) Q Consensus 105 p~kklk~G~~l~~~~~~~~~~l~~~vi~~~~~g~~~i~f~~~~~~~~~~L~~~G~iPL 162 (360) .--++..|++|.+....+ ++.+ +.|+....|+.....+.+....=|++|| T Consensus 120 dv~~l~tGdvi~I~p~~g------~i~~--~~g~v~~~f~l~p~~l~DevRAGGri~L 169 (204) T pfam06434 120 DVSSLNTGDVITIYPYEG------KITK--EDGEVISTFELKPNTLLDEVRAGGRIPL 169 (204) T ss_pred ECCCCCCCCEEEEECCCC------EEEC--CCCCEEEEEECCCCCHHHHHHCCCCEEE T ss_conf 403166798899965788------6976--9997898876277427888754883799 No 114 >PRK06203 aroB 3-dehydroquinate synthase; Reviewed Probab=25.08 E-value=50 Score=14.03 Aligned_cols=59 Identities=12% Similarity=0.002 Sum_probs=33.4 Q ss_pred HCCCEEEEECHHHHHHHHHHHHCCCCCCCCCC-----------CCCEEECCCCCCEEEEEEEECCCCCHHHH Q ss_conf 37980999628789989999852898468887-----------22366574096114312552775545789 Q gi|254780180|r 260 SRGGRIVSVGTTSLRLLETATTEDGIINPWSG-----------FTNIFITPGYRFRAVDILMSNFHLPKSTL 320 (360) Q Consensus 260 ~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g-----------~T~lfI~Pg~~f~~vd~LiTNFH~P~Stl 320 (360) ...+|++.-|-|.--++|+...- .+.-++. .-.+-..|.-.+.-+-.++.++-+|-+-- T Consensus 255 ~g~rr~LNfGHT~gHaiE~~~~~--~l~HGeAVaiGm~~aa~lS~~~g~l~~~~~~ri~~ll~~~gLP~~~p 324 (390) T PRK06203 255 MGSSRPLDFGHWAAHKLESLTNY--RLRHGEAVAIGIALDTLYSYLAGLLPEAEWQRVLNTLKALGFTLYHP 324 (390) T ss_pred HCCHHHHHHHHHHHHHHHHCCCC--CCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCH T ss_conf 25635661010267888616799--98719999998999999999808999999999999999869998770 No 115 >COG1129 MglA ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism] Probab=25.07 E-value=43 Score=14.49 Aligned_cols=29 Identities=28% Similarity=0.596 Sum_probs=23.1 Q ss_pred CCCCCCCCCH-----HHHHHHHHHCCCCEEEEEE Q ss_conf 5556887899-----9999999721553037873 Q gi|254780180|r 193 APTAGLHFTS-----NLLSRLISIGIKVYFVTLH 221 (360) Q Consensus 193 APTAGLHFt~-----~ll~~L~~kGi~~~~iTLH 221 (360) =|||+|-=.+ +++.+|+++|+.+.||+.. T Consensus 171 EPTaaLt~~E~~~Lf~~ir~Lk~~Gv~iI~ISHr 204 (500) T COG1129 171 EPTAALTVKETERLFDLIRRLKAQGVAIIYISHR 204 (500) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 9876799899999999999999689889998676 No 116 >PRK13640 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=24.96 E-value=49 Score=14.09 Aligned_cols=54 Identities=13% Similarity=0.249 Sum_probs=35.4 Q ss_pred CCCCCCCCC-----HHHHHHH-HHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEE Q ss_conf 555688789-----9999999-9721553037873165665788741013566770007785999999998753798099 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRL-ISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIV 266 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L-~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIi 266 (360) =|||||-.. .+++.+| +++|+.+..||.++-.- .+..+..| -+.|||+ T Consensus 170 EPTs~LD~~~~~~i~~~l~~l~~e~g~TvI~itHd~~~a--------------~~aDrv~v------------m~~G~iv 223 (283) T PRK13640 170 ESTSMLDPAGKEQILKLIRKLMKDNNLTIISITHDIDEA--------------AGADQVLV------------LDDGKLL 223 (283) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECHHHH--------------HHCCEEEE------------EECCEEE T ss_conf 874548989999999999999970698999997887899--------------70998999------------9999999 Q ss_pred EECHHH Q ss_conf 962878 Q gi|254780180|r 267 SVGTTS 272 (360) Q Consensus 267 AVGTT~ 272 (360) +-||.- T Consensus 224 ~~G~p~ 229 (283) T PRK13640 224 AQGSPV 229 (283) T ss_pred EECCHH T ss_conf 977789 No 117 >CHL00067 rps2 ribosomal protein S2 Probab=24.93 E-value=51 Score=14.01 Aligned_cols=60 Identities=20% Similarity=0.285 Sum_probs=34.3 Q ss_pred HHHHHHHHHHCCCEEEEECHHH--HHHHHHHHHCCCCCCCCCCCCCEEECCCCCCEEEEEEEECCCCCHHHHHHH Q ss_conf 9999998753798099962878--998999985289846888722366574096114312552775545789999 Q gi|254780180|r 251 TAQALNSVKSRGGRIVSVGTTS--LRLLETATTEDGIINPWSGFTNIFITPGYRFRAVDILMSNFHLPKSTLLML 323 (360) Q Consensus 251 ~~~~I~~ak~~g~rIiAVGTT~--~R~LEs~~~~~~~~~~~~g~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~L 323 (360) ++.-|.+.-..||.|+=|||-- -..++.++...|.. | -..+..-|++|||-.-+.++--| T Consensus 53 A~~~i~~~~~~~g~ILfVgTk~~~~~~i~~~A~~~~~~---------y----V~~rWlgG~LTN~~ti~k~i~~l 114 (227) T CHL00067 53 ACDLVFDAASRGKKFLFVGTKKQAADLVASAAIRARCH---------Y----VNKKWLGGMLTNWSTTKTRLKKL 114 (227) T ss_pred HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCC---------E----EECCCCCCCCCCHHHHHHHHHHH T ss_conf 99999999854994799976587899999999981997---------1----53764287444789997799999 No 118 >PRK07632 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=24.86 E-value=51 Score=14.00 Aligned_cols=37 Identities=16% Similarity=0.304 Sum_probs=25.7 Q ss_pred EECCCCCHHHHHHHHHHHH-----------------CHHHHHHHHHHHHHCCCE Q ss_conf 5277554578999999871-----------------989999999999986982 Q gi|254780180|r 310 MSNFHLPKSTLLMLVSAFC-----------------GIEETKKMYQHAISHAYR 346 (360) Q Consensus 310 iTNFH~P~Stll~Lv~Af~-----------------G~~~~~~~Y~~Ai~~~yr 346 (360) -|-+..+.-.++-+.+++. ..+.+.++|-+|=++|.+ T Consensus 623 kTA~eI~q~~iId~aA~rQk~IDQsqS~nLf~~~~~t~~dl~~~~~yAwk~GlK 676 (696) T PRK07632 623 KSAYDMDQFKLIDLIATIQTHIDQGISTILYVNSEISTRELARLYVYAHHKGLK 676 (696) T ss_pred CHHHHCCHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCH T ss_conf 308647969999999998177566420510769988999999999999981832 No 119 >pfam01964 ThiC ThiC family. ThiC is found within the thiamine biosynthesis operon. ThiC is involved in pyrimidine biosynthesis. The precise catalytic function of ThiC is still not known. ThiC participates in the formation of 4-Amino-5-hydroxymethyl-2-methylpyrimidine from AIR, an intermediate in the de novo pyrimidine biosynthesis. Probab=24.85 E-value=43 Score=14.49 Aligned_cols=39 Identities=31% Similarity=0.431 Sum_probs=21.5 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH-CCCCEEEEEECCC Q ss_conf 10134567574005556887899999999972-1553037873165 Q gi|254780180|r 180 YQTTYAKIQGSVAAPTAGLHFTSNLLSRLISI-GIKVYFVTLHVGA 224 (360) Q Consensus 180 YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~k-Gi~~~~iTLHVG~ 224 (360) ||.+ .+..|++..=|+ +++++.++++ --.+.|+|+|.|+ T Consensus 123 Yqa~-~~~~~~~~~mt~-----d~~~~~ie~qa~~GVDfmTiH~Gi 162 (421) T pfam01964 123 YQAA-ERKGGSVVDMTE-----DDLFDVIEEQAKDGVDFMTIHAGV 162 (421) T ss_pred HHHH-HHHCCCHHHCCH-----HHHHHHHHHHHHHCCCEEEECCCH T ss_conf 9999-996599454999-----999999999997288778740001 No 120 >TIGR01135 glmS glucosamine--fructose-6-phosphate aminotransferase, isomerizing; InterPro: IPR005855 Glucosamine:fructose-6-phosphate aminotransferase (2.6.1.16 from EC) catalyses the formation of glucosamine 6-phosphate and is the first and rate-limiting enzyme of the hexosamine biosynthetic pathway. The final product of the hexosamine pathway, UDP-N-acetyl glucosamine, is an active precursor of numerous macromolecules containing amino sugars. This family of sequences belong to the MEROPS peptidase family C44 (clan PB(C)), and are classified as non-peptidase homologs.; GO: 0004360 glutamine-fructose-6-phosphate transaminase (isomerizing) activity, 0016051 carbohydrate biosynthetic process, 0005737 cytoplasm. Probab=24.84 E-value=51 Score=14.00 Aligned_cols=26 Identities=27% Similarity=0.470 Sum_probs=16.2 Q ss_pred CEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCE Q ss_conf 313566378887531531014347872698879 Q gi|254780180|r 26 ARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDA 58 (360) Q Consensus 26 SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDL 58 (360) +|--|+|+. +|+. |..|=+.|..-+. T Consensus 95 g~iaVVHNG-----IIEN--y~~Lr~eL~~~G~ 120 (628) T TIGR01135 95 GRIAVVHNG-----IIEN--YAELREELEARGH 120 (628) T ss_pred CCEEEEECC-----EECC--CHHHHHHHHHCCC T ss_conf 527999778-----2705--4889989984892 No 121 >PRK06853 indolepyruvate oxidoreductase subunit B; Reviewed Probab=24.81 E-value=51 Score=13.99 Aligned_cols=15 Identities=20% Similarity=0.530 Sum_probs=5.6 Q ss_pred HCCCCCEEEEECCCE Q ss_conf 269887999947956 Q gi|254780180|r 52 FLNSNDAIVFNNTKV 66 (360) Q Consensus 52 ~L~~gDLLV~NnTKV 66 (360) +++||..+|+|++.+ T Consensus 86 ~l~~~~~~ivn~~~~ 100 (197) T PRK06853 86 YLKPGGKVVVNTQPI 100 (197) T ss_pred CCCCCEEEEEECCCC T ss_conf 158885999978876 No 122 >PRK13223 phosphoglycolate phosphatase; Provisional Probab=24.58 E-value=51 Score=13.96 Aligned_cols=87 Identities=13% Similarity=0.168 Sum_probs=51.1 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEE-----------ECCCCCCCC--CCCCCCCCCCCCCHHE Q ss_conf 310134567574005556887899999999972155303787-----------316566578--8741013566770007 Q gi|254780180|r 179 DYQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTL-----------HVGAGTFMP--VKVEDTDDHIMHSEIG 245 (360) Q Consensus 179 ~YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTL-----------HVG~GTF~P--i~~~~i~~h~mH~E~~ 245 (360) .|...|++... . |.=..--.++|+.|+++|+..+=+|= +.|++.|-. |..+++...|-|.|-+ T Consensus 88 ~f~~~y~~~~~-~---~~lyPGV~e~L~~L~~~Gi~laIvTnk~~~~~~~iL~~lgL~~~Fd~VvggDdv~~~KPdP~~l 163 (272) T PRK13223 88 LFMEAYAESHE-L---TVVYPGVRDTLKWLRKQGVEMALITNKPERFVAPLLDQMKIGRYFRWIIGGDTLPQKKPDPAAL 163 (272) T ss_pred HHHHHHHHHCC-C---CCCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCHHHH T ss_conf 99999997365-5---6768479999999997899789963981789999998749724146453457699999998999 Q ss_pred --------------EECHHHHHHHHHHHHCCCEEEEEC Q ss_conf --------------785999999998753798099962 Q gi|254780180|r 246 --------------FIDAPTAQALNSVKSRGGRIVSVG 269 (360) Q Consensus 246 --------------~i~~~~~~~I~~ak~~g~rIiAVG 269 (360) -+=.++..-|..||+.|=++|+|- T Consensus 164 l~ale~lGv~P~e~l~VGDS~~DI~AAkaAGi~~VgVt 201 (272) T PRK13223 164 LFVMKMAGVPPSQSLFVGDSRSDVLAAKAAGVQCVALS 201 (272) T ss_pred HHHHHHHCCCHHHEEEECCCHHHHHHHHHCCCCEEEEC T ss_conf 99999959897688888898889999998699589974 No 123 >KOG1876 consensus Probab=24.54 E-value=19 Score=17.14 Aligned_cols=10 Identities=40% Similarity=0.936 Sum_probs=4.6 Q ss_pred EEEEEECCCC Q ss_conf 3125527755 Q gi|254780180|r 306 VDILMSNFHL 315 (360) Q Consensus 306 vd~LiTNFH~ 315 (360) ++-||||||. T Consensus 114 IsfLItn~ht 123 (168) T KOG1876 114 ISFLITNFHT 123 (168) T ss_pred HHHHHHCCCH T ss_conf 1365432646 No 124 >PRK12615 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=24.40 E-value=47 Score=14.26 Aligned_cols=69 Identities=13% Similarity=0.040 Sum_probs=43.3 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEEC Q ss_conf 40055568878999999999721553037873165665788741013566770007785999999998753798099962 Q gi|254780180|r 190 SVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVG 269 (360) Q Consensus 190 SVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVG 269 (360) ++++=-||...-+.+.+.|+++|+++.. +||+-.-.++ |-.....++++|.+-+.. .=|+-.| T Consensus 4 ~IgsDHaG~~LK~~i~~~L~~~G~eV~D------~Gt~~~~svD----------YPd~a~~Va~~V~~~~~~-~GIliCG 66 (171) T PRK12615 4 AIGCDHIVTNEKMAVSDFLKSKGYDVID------CGTYDHTRTH----------YPIFGKKVGEAVVNGQAD-LGVCICG 66 (171) T ss_pred EEEECCHHHHHHHHHHHHHHHCCCEEEE------CCCCCCCCCC----------CHHHHHHHHHHHHCCCCC-EEEEEEC T ss_conf 9996861599999999999988997897------7999988788----------269999999999769976-4699817 Q ss_pred HHHHHH Q ss_conf 878998 Q gi|254780180|r 270 TTSLRL 275 (360) Q Consensus 270 TT~~R~ 275 (360) |-.==+ T Consensus 67 TGiGms 72 (171) T PRK12615 67 TGVGIN 72 (171) T ss_pred CCHHHH T ss_conf 857699 No 125 >PRK13228 consensus Probab=24.26 E-value=52 Score=13.92 Aligned_cols=73 Identities=15% Similarity=0.143 Sum_probs=38.7 Q ss_pred CHHHHHHHHHHCCCCEEEEE-----------ECCCCC-CCC-CCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 99999999972155303787-----------316566-578-87410135667700077859999999987537980999 Q gi|254780180|r 201 TSNLLSRLISIGIKVYFVTL-----------HVGAGT-FMP-VKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 201 t~~ll~~L~~kGi~~~~iTL-----------HVG~GT-F~P-i~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) ..++|+.|+++|+.++=+|= +.|+.- |.- |..+++...|-+.|-|. .+++.+ ...-..++. T Consensus 101 ~~e~L~~L~~~g~~l~ivT~~~~~~~~~~l~~~gl~~~Fd~iv~~d~v~~~KP~P~~~l---~a~~~l---g~~p~~~v~ 174 (232) T PRK13228 101 VEAGLALLKALGYRLACITNKPEILAVPLLKQTGLADYFELILGGDSLPEKKPDPLPLL---HACEVL---GIDPANMVL 174 (232) T ss_pred HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHH---HHHHHH---CCCCCCEEE T ss_conf 99999999977996899808968999998753023333105530433678898799999---999992---979340898 Q ss_pred ECHHHHHHHHHHH Q ss_conf 6287899899998 Q gi|254780180|r 268 VGTTSLRLLETAT 280 (360) Q Consensus 268 VGTT~~R~LEs~~ 280 (360) ||=|. .=+++|. T Consensus 175 VgDs~-~di~aA~ 186 (232) T PRK13228 175 VGDSA-NDVIAAR 186 (232) T ss_pred ECCCH-HHHHHHH T ss_conf 84888-7999999 No 126 >PRK10762 D-ribose transporter ATP binding protein; Provisional Probab=24.25 E-value=45 Score=14.34 Aligned_cols=31 Identities=23% Similarity=0.361 Sum_probs=20.6 Q ss_pred CCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC Q ss_conf 5556887899-----999999972155303787316 Q gi|254780180|r 193 APTAGLHFTS-----NLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 193 APTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG 223 (360) =|||||--.. +++.+|+++|+.+..||.+.. T Consensus 167 EPt~~Ld~~~~~~l~~~l~~l~~~g~til~isH~l~ 202 (501) T PRK10762 167 EPTDALTDTETESLFRVIRELKSQGRGIVYISHRMK 202 (501) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCHH T ss_conf 875578878889999988787744841477514257 No 127 >cd06246 M14_CPB2 Peptidase M14 Carboxypeptidase (CP) B2 (CPB2, also known as plasma carboxypeptidase B, carboxypeptidase U, and CPU), belongs to the carboxpeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPB2 enzyme displays B-like activity; it only cleaves the basic residues lysine or arginine. It is produced and secreted by the liver as the inactive precursor, procarboxypeptidase U or PCPB2, commonly referred to as thrombin-activatable fibrinolysis inhibitor (TAFI). It circulates in plasma as a zymogen bound to plasminogen, and the active enzyme, TAFIa, inhibits fibrinolysis. It is highly regulated, increased TAFI concentrations are thought to increase the risk of thrombosis and coronary artery disease by reducing fibrinolytic activity whil Probab=24.15 E-value=29 Score=15.71 Aligned_cols=103 Identities=18% Similarity=0.317 Sum_probs=62.1 Q ss_pred CEEEEEECCCCCCC-CCC--CCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 30378731656657-887--410135667700077859999999987537980999628789989999852898468887 Q gi|254780180|r 215 VYFVTLHVGAGTFM-PVK--VEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATTEDGIINPWSG 291 (360) Q Consensus 215 ~~~iTLHVG~GTF~-Pi~--~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~~~~~~~~~~g 291 (360) .++|++|-...-+. |-. .+...+ |.|...|..+.+++|..+ .+....||+.+ |..+...|.-..|.- T Consensus 185 ~~~l~~Hsygq~il~P~gyt~~~~~~---~~~l~~~a~~~~~ai~~~---~g~~Y~~G~~~----~~lY~asG~s~Dw~y 254 (300) T cd06246 185 KAYISMHSYSQMILFPYSYTRSKSKD---HEELSLLAKEAVRAIRRT---SNNRYTHGSGA----ETIYLAPGGSDDWAY 254 (300) T ss_pred EEEEEECCCCCEEEECCCCCCCCCCC---HHHHHHHHHHHHHHHHHH---HCCCEEECCCC----CEEECCCCCHHHHHH T ss_conf 89999625554799566688889989---999999999999999997---68923877752----543435785888887 Q ss_pred CCCEEECCCCCCEEEEEEEECCCCCHHHHHHHH-HHHHC Q ss_conf 223665740961143125527755457899999-98719 Q gi|254780180|r 292 FTNIFITPGYRFRAVDILMSNFHLPKSTLLMLV-SAFCG 329 (360) Q Consensus 292 ~T~lfI~Pg~~f~~vd~LiTNFH~P~Stll~Lv-~Af~G 329 (360) ++-|.-.|.+.+-|.=-.+|.+|.+-..=.. .+|+| T Consensus 255 --~~gi~~syt~ELrd~g~~GF~lP~~~I~p~~~E~~~g 291 (300) T cd06246 255 --DLGIKYSFTFELRDTGTYGFLLPESYIKPTCSEALAA 291 (300) T ss_pred --HCCCEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf --3798299999958898899849989906789999999 No 128 >PRK08636 aspartate aminotransferase; Provisional Probab=24.12 E-value=52 Score=13.90 Aligned_cols=43 Identities=14% Similarity=0.233 Sum_probs=25.7 Q ss_pred CCEEECCCCCCEE--EEEEEECCCCCHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 2366574096114--312552775545789999998719899999999 Q gi|254780180|r 293 TNIFITPGYRFRA--VDILMSNFHLPKSTLLMLVSAFCGIEETKKMYQ 338 (360) Q Consensus 293 T~lfI~Pg~~f~~--vd~LiTNFH~P~Stll~Lv~Af~G~~~~~~~Y~ 338 (360) +.+.+.||..|.. -+.+=-+|=.|... |..|+--.+.+++-|+ T Consensus 358 ~gV~v~PG~~Fg~~g~~~vRisf~~~~e~---l~ea~~Rl~~~l~e~~ 402 (403) T PRK08636 358 AKVAVSPGIGFGEYGDEYVRIALIENENR---IRQAARNIKKFLKEFE 402 (403) T ss_pred CCEEEECCHHHCCCCCCEEEEEECCCHHH---HHHHHHHHHHHHHHHC T ss_conf 97999887532889998799997189999---9999999999999844 No 129 >TIGR00718 sda_alpha L-serine dehydratase, iron-sulfur-dependent, alpha subunit; InterPro: IPR004642 L-serine dehydratase converts serine into pyruvate in the gluconeogenesis pathway from serine. This model describes the alpha chain of an iron-sulphur-dependent L-serine dehydratase, found in Bacillus subtilis. A fairly deep split in a UPGMA tree separates members of this family of alpha chains from the homologous region of single chain forms such as found in Escherichia coli. This family of enzymes is not homologous to the pyridoxal phosphate-dependent threonine deaminases and eukaryotic serine deaminases. Synonym(s): Serine deaminase, L-hydroxyaminoacid dehydratase ; GO: 0003941 L-serine ammonia-lyase activity, 0006094 gluconeogenesis. Probab=24.11 E-value=27 Score=16.00 Aligned_cols=15 Identities=53% Similarity=0.589 Sum_probs=9.5 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 345675740055568 Q gi|254780180|r 183 TYAKIQGSVAAPTAG 197 (360) Q Consensus 183 VyA~~~GSVAAPTAG 197 (360) |=|.-.-=||+|||| T Consensus 103 VNA~MG~I~A~PTAG 117 (301) T TIGR00718 103 VNAAMGKIVAAPTAG 117 (301) T ss_pred HHHHCCCEECCCCCC T ss_conf 655307555268887 No 130 >PRK06225 aspartate aminotransferase; Provisional Probab=23.99 E-value=53 Score=13.89 Aligned_cols=17 Identities=24% Similarity=0.239 Sum_probs=11.3 Q ss_pred HHHHHHHCCCCCEEEEE Q ss_conf 43478726988799994 Q gi|254780180|r 46 VSDLPAFLNSNDAIVFN 62 (360) Q Consensus 46 F~dl~~~L~~gDLLV~N 62 (360) +.-+.-+++|||-+++- T Consensus 92 ~~~~~a~~~pGD~vl~~ 108 (375) T PRK06225 92 YLVIRAFLSPGTNTVTS 108 (375) T ss_pred HHHHHHHCCCCCEEEEC T ss_conf 99999977998989984 No 131 >TIGR02238 recomb_DMC1 meiotic recombinase Dmc1; InterPro: IPR011940 This entry describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (IPR011938 from INTERPRO) and RadB (IPR011939 from INTERPRO) and bacterial RecA. It has been characterised for human as a recombinase active only in meiosis .; GO: 0003677 DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0007131 meiotic recombination, 0005634 nucleus. Probab=23.64 E-value=36 Score=15.08 Aligned_cols=85 Identities=18% Similarity=0.252 Sum_probs=47.9 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHH-HHHHHHHHHHCCCEEEEECHHHHHHHHHHHH Q ss_conf 999999972155303787316566578874101356677000778599-9999998753798099962878998999985 Q gi|254780180|r 203 NLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAP-TAQALNSVKSRGGRIVSVGTTSLRLLETATT 281 (360) Q Consensus 203 ~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~-~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~ 281 (360) +|=..+.--+=+++||- -=|||+|-|...|. |.|.++.+ +.+.|.-||+ ..+-.. T Consensus 121 QLP~~~~g~~GKv~yID---TEgTFRPdri~~IA------erf~~D~~~~L~Ni~yaRA---------------~~~Eh~ 176 (314) T TIGR02238 121 QLPREMGGGEGKVAYID---TEGTFRPDRIKAIA------ERFGVDGEAVLDNILYARA---------------YTSEHQ 176 (314) T ss_pred HCCCCCCCCCCEEEEEE---CCCCCCHHHHHHHH------HHCCCCHHHHHHHHHHHHH---------------HCHHHH T ss_conf 06312579984289981---78885817899887------5407898899988898875---------------042789 Q ss_pred CCCCCCCCCCCCCEEECC-CCCCEEEEEEEECCCC Q ss_conf 289846888722366574-0961143125527755 Q gi|254780180|r 282 EDGIINPWSGFTNIFITP-GYRFRAVDILMSNFHL 315 (360) Q Consensus 282 ~~~~~~~~~g~T~lfI~P-g~~f~~vd~LiTNFH~ 315 (360) ..-.......|--- .|+.-+||-.+.+|-. T Consensus 177 ----~~l~~~~~~~~~e~~~f~llivDSI~alFRV 207 (314) T TIGR02238 177 ----MELLEKLAAKFAEEGDFRLLIVDSIMALFRV 207 (314) T ss_pred ----HHHHHHHHHHHHCCCCEEEEEEHHHHHHHCC T ss_conf ----9999999887522687168864202554131 No 132 >pfam00877 NLPC_P60 NlpC/P60 family. The function of this domain is unknown. It is found in several lipoproteins. Probab=23.54 E-value=54 Score=13.83 Aligned_cols=14 Identities=14% Similarity=0.223 Sum_probs=8.8 Q ss_pred HHCCCCCEEEEECC Q ss_conf 72698879999479 Q gi|254780180|r 51 AFLNSNDAIVFNNT 64 (360) Q Consensus 51 ~~L~~gDLLV~NnT 64 (360) +-++||||+.|+++ T Consensus 49 ~~~~~GDLvFf~~~ 62 (105) T pfam00877 49 SEPQRGDLVFFGTG 62 (105) T ss_pred HHCCCCCEEEECCC T ss_conf 99898788997789 No 133 >cd00733 GlyRS_alpha_core Class II Glycyl-tRNA synthetase (GlyRS) alpha subunit core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes and in arabidopsis. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. This alignment contains only sequences from the GlyRS form which heterotetramerizes. The homodimer form of GlyRS is in a different family of class II aaRS. Class II assignment is based upon structure and the presence of three characteristic sequence motifs. Probab=23.51 E-value=25 Score=16.24 Aligned_cols=25 Identities=28% Similarity=0.663 Sum_probs=15.9 Q ss_pred EEECCCCCCCHH-HHHHCCCCCCCCC Q ss_conf 981124866101-1221156436985 Q gi|254780180|r 89 TLHMRVSPNSWS-VYARPSKVIKKGD 113 (360) Q Consensus 89 ll~~~~~~~~w~-~l~kp~kklk~G~ 113 (360) ++++-+.+..|. |.+.|++|...|. T Consensus 37 T~lr~lg~~pw~~aYvqPsrRP~DgR 62 (279) T cd00733 37 TFLRALGPEPWNVAYVEPSRRPTDGR 62 (279) T ss_pred HHHHHCCCCCCEEEEECCCCCCCCCC T ss_conf 98863199863155424677999987 No 134 >pfam06962 rRNA_methylase Putative rRNA methylase. This family contains a number of putative rRNA methylases. Note that many family members are hypothetical proteins. Probab=23.50 E-value=54 Score=13.82 Aligned_cols=59 Identities=14% Similarity=0.257 Sum_probs=30.9 Q ss_pred EEEEEHHHHHHHCCCC--CEEEEECCCEEEEEEEEEEECCCCCCCCEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEE Q ss_conf 5310143478726988--7999947956888998764046777765058998112486610112211564369857997 Q gi|254780180|r 41 ISDHLVSDLPAFLNSN--DAIVFNNTKVITAQLNGVRFCHINRREKEISCTLHMRVSPNSWSVYARPSKVIKKGDILHF 117 (360) Q Consensus 41 i~h~~F~dl~~~L~~g--DLLV~NnTKVipARL~g~k~~~~~~~g~~ieill~~~~~~~~w~~l~kp~kklk~G~~l~~ 117 (360) +.+..-.++.+|++++ |+.|||= |- . . ||.=.+. -..+.+-.++-+....+++|-.+.+ T Consensus 30 li~~~He~l~~~v~~~~i~~~~FNL---------GY----L-P-ggDk~i~---T~~~tTi~Al~~al~lL~~gG~i~i 90 (140) T pfam06962 30 LILDSHENIDEYIPEGPVKAAIFNL---------GY----L-P-GGDKSIT---TKPDTTLEAIKKLLELLKPGGLIIL 90 (140) T ss_pred EEECCHHHHHHHCCCCCCCEEEEEC---------CC----C-C-CCCCCCE---ECCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 9977988898637646766899966---------76----8-8-9998758---7772099999999996366989999 No 135 >cd03218 ABC_YhbG The ABC transporters belonging to the YhbG family are similar to members of the Mj1267_LivG family, which is involved in the transport of branched-chain amino acids. The genes yhbG and yhbN are located in a single operon and may function together in cell envelope during biogenesis. YhbG is the putative ATP-binding cassette component and YhbN is the putative periplasmic-binding protein. Depletion of each gene product leads to growth arrest, irreversible cell damage and loss of viability in E. coli. The YhbG homolog (NtrA) is essential in Rhizobium meliloti, a symbiotic nitrogen-fixing bacterium. Probab=23.40 E-value=48 Score=14.17 Aligned_cols=55 Identities=24% Similarity=0.373 Sum_probs=34.1 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 555688789-----999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||--. .+++.+|+++|+.+..+|..+..- +.+.|+ +.| -..|||+| T Consensus 159 EPt~GLDp~~~~~i~~~i~~l~~~g~tili~tH~l~~~-------~~~~dr------v~v------------l~~G~ii~ 213 (232) T cd03218 159 EPFAGVDPIAVQDIQKIIKILKDRGIGVLITDHNVRET-------LSITDR------AYI------------IYEGKVLA 213 (232) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHH-------HHHCCE------EEE------------EECCEEEE T ss_conf 98568899999999999999995899999992839999-------986999------999------------97998999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) -||.- T Consensus 214 ~G~~~ 218 (232) T cd03218 214 EGTPE 218 (232) T ss_pred ECCHH T ss_conf 85899 No 136 >PRK12898 secA preprotein translocase subunit SecA; Reviewed Probab=23.33 E-value=54 Score=13.80 Aligned_cols=56 Identities=20% Similarity=0.118 Sum_probs=37.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHH Q ss_conf 1656657887410135667700077859999999987537980999628789989999 Q gi|254780180|r 222 VGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 222 VG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~ 279 (360) |-+-|-+|+.-.|..|..--++.-- -...++.|.+..+.|.+| =|||+|+..=|-. T Consensus 449 v~IPTnrP~~R~D~~D~vy~t~~~K-~~Aii~ei~~~~~~GqPV-LVGT~SVe~SE~l 504 (673) T PRK12898 449 VRIPTNRPSQRIVLPTEVFLTLAAK-WAAVAARVRELHASGRPV-LVGTRSVAASERL 504 (673) T ss_pred EECCCCCCCCCEECCCEEECCHHHH-HHHHHHHHHHHHHCCCCE-EEEECCHHHHHHH T ss_conf 9868999730123798585699999-999999999998679988-9983758999999 No 137 >COG0149 TpiA Triosephosphate isomerase [Carbohydrate transport and metabolism] Probab=23.16 E-value=43 Score=14.51 Aligned_cols=63 Identities=29% Similarity=0.395 Sum_probs=38.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHE----EECHHHHHHHH Q ss_conf 01345675740055568878999999999721553037873165665788741013566770007----78599999999 Q gi|254780180|r 181 QTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIG----FIDAPTAQALN 256 (360) Q Consensus 181 QTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~----~i~~~~~~~I~ 256 (360) |.++....||--- |.+.++|.++ |++..-| | |||+- +.++...+++. T Consensus 65 Qn~~~~~~GA~TG-----eiS~~mL~d~---G~~~vii----G-----------------HSERR~~~~E~d~~i~~K~~ 115 (251) T COG0149 65 QNVDPEDSGAFTG-----EISAEMLKDL---GAKYVLI----G-----------------HSERRLYFGETDELIAKKVK 115 (251) T ss_pred CCCCCCCCCCCCC-----CCCHHHHHHC---CCCEEEE----C-----------------CCCCCCCCCCCHHHHHHHHH T ss_conf 6688655778668-----3889999986---9988997----8-----------------50112435634699999999 Q ss_pred HHHHCCC-EEEEECHHH Q ss_conf 8753798-099962878 Q gi|254780180|r 257 SVKSRGG-RIVSVGTTS 272 (360) Q Consensus 257 ~ak~~g~-rIiAVGTT~ 272 (360) .+++.|= .|+|||-|- T Consensus 116 aa~~~Gl~pIlCvGEtl 132 (251) T COG0149 116 AAKEAGLTPILCVGETL 132 (251) T ss_pred HHHHCCCEEEEECCCCH T ss_conf 99988996899858977 No 138 >TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport. Probab=22.88 E-value=55 Score=13.74 Aligned_cols=57 Identities=25% Similarity=0.268 Sum_probs=43.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHH Q ss_conf 16566578874101356677000778599999999875379809996287899899998 Q gi|254780180|r 222 VGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETAT 280 (360) Q Consensus 222 VG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~ 280 (360) |-.=|++|+.=.|-.|-.=-+|+.=- .+.++.|.+..+.|++| =||||||=.=|... T Consensus 396 v~vPTNrp~~R~D~~DlvY~te~~Kw-~Av~~e~~~~h~~GqPv-LvGT~svE~SE~LS 452 (904) T TIGR00963 396 VVVPTNRPVIRKDLPDLVYKTEEEKW-KAVVDEIKEIHAKGQPV-LVGTTSVEKSELLS 452 (904) T ss_pred EECCCCCCCCCCCCCCCEECCHHHHH-HHHHHHHHHHHHCCCCE-EEEECCHHHHHHHH T ss_conf 55166772344677773464768899-99999999987468987-77522177789999 No 139 >cd03270 ABC_UvrA_I The excision repair protein UvrA domain I; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=22.87 E-value=55 Score=13.74 Aligned_cols=32 Identities=31% Similarity=0.436 Sum_probs=23.8 Q ss_pred CCCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC Q ss_conf 05556887899-----999999972155303787316 Q gi|254780180|r 192 AAPTAGLHFTS-----NLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 192 AAPTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG 223 (360) ==|||||.-.. ++|.+|+++|..+..||.+.. T Consensus 164 DEPTsgLD~~~~~~i~~~i~~l~~~G~Tvi~VsHd~~ 200 (226) T cd03270 164 DEPSIGLHPRDNDRLIETLKRLRDLGNTVLVVEHDED 200 (226) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECHH T ss_conf 2873337989999999999999976998999972578 No 140 >TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members Probab=22.76 E-value=55 Score=13.72 Aligned_cols=19 Identities=16% Similarity=0.559 Sum_probs=9.8 Q ss_pred HHHHHHHHCCCEEECCCCH Q ss_conf 9999999869826110102 Q gi|254780180|r 335 KMYQHAISHAYRFYSYGDT 353 (360) Q Consensus 335 ~~Y~~Ai~~~yrF~syGDa 353 (360) ++-++|.++||+.|.||-| T Consensus 243 ~~i~~A~e~G~~~fDFGrS 261 (330) T TIGR03019 243 ELMRRACERGLRVFDFGRS 261 (330) T ss_pred HHHHHHHHCCCCEEECCCC T ss_conf 9999999879968837999 No 141 >PRK12903 secA preprotein translocase subunit SecA; Reviewed Probab=22.66 E-value=56 Score=13.71 Aligned_cols=169 Identities=19% Similarity=0.266 Sum_probs=85.4 Q ss_pred CHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHCCCCCC---CCCCCCCCCCCCCCCH-HHHHHHHHHCCCCEEEEEECCC Q ss_conf 30002544203578522212345652114331013456---7574005556887899-9999999721553037873165 Q gi|254780180|r 149 ELERQISLVGTIPLPPYIARKRPIDARDYVDYQTTYAK---IQGSVAAPTAGLHFTS-NLLSRLISIGIKVYFVTLHVGA 224 (360) Q Consensus 149 ~~~~~L~~~G~iPLPPYI~r~r~~~~~D~~~YQTVyA~---~~GSVAAPTAGLHFt~-~ll~~L~~kGi~~~~iTLHVG~ 224 (360) .+-+.++.--.+|+-+= ......--||.-|-. -.|- --|| -|+ +-+.++= |..+ |-. T Consensus 329 GLhQaiEaKE~v~i~~e------~~t~a~iT~Qn~Fr~y~kl~Gm--TGTa---~te~~Ef~~iY--~l~v------~~i 389 (885) T PRK12903 329 GLQQAIQAKEMVEIEPE------TKTLATITYQNFFRLFKKLSGM--TGTA---KTEEQEFIDIY--NMRV------NVV 389 (885) T ss_pred HHHHHHHHHCCCCCCCC------CCEEEEEEHHHHHHHCCHHCCC--CCCC---HHHHHHHHHHH--CCEE------EEC T ss_conf 68999998728877988------7313663099898646042267--6777---88999999986--9736------981 Q ss_pred CCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHH---CCC----CCCCC-------- Q ss_conf 665788741013566770007785999999998753798099962878998999985---289----84688-------- Q gi|254780180|r 225 GTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETATT---EDG----IINPW-------- 289 (360) Q Consensus 225 GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~~---~~~----~~~~~-------- 289 (360) -|.+|+.-.+..+..--++.- =-...++.|.+..+.|++| =|||+|+..=|-... ..| ++.+- T Consensus 390 Ptn~p~~R~d~~d~vy~t~~~-k~~av~~ei~~~~~~gqPv-Lvgt~sve~Se~ls~~L~~~~i~h~vLNAk~~~~EA~I 467 (885) T PRK12903 390 PTNKPVIRKDEPDSIFGTKHA-KWKAVVKEVKRVHEKGQPI-LIGTAQVEDSEQLHEYLLEANIPHTVLNAKQHAREAEI 467 (885) T ss_pred CCCCCCEEECCCCHHHCCHHH-HHHHHHHHHHHHHHCCCCE-EEECCCHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHH T ss_conf 899985214283231078999-9999999999998579988-99568479999999999856877035546865889999 Q ss_pred ---C-------------C-CCCEEECCCC-CCEEEEEEEECCCC-----------------CHHHHHH------HHHHHH Q ss_conf ---8-------------7-2236657409-61143125527755-----------------4578999------999871 Q gi|254780180|r 290 ---S-------------G-FTNIFITPGY-RFRAVDILMSNFHL-----------------PKSTLLM------LVSAFC 328 (360) Q Consensus 290 ---~-------------g-~T~lfI~Pg~-~f~~vd~LiTNFH~-----------------P~Stll~------Lv~Af~ 328 (360) . | =|||-+-++- +-.--.-+-|.-|. |.||-+. |.--|+ T Consensus 468 ia~AG~~g~VTiATNMAGRGTDI~Lg~~v~~~GGL~VigterhesrRiD~QLrGR~gRQGDpG~s~f~~SleD~lmr~F~ 547 (885) T PRK12903 468 IAKAGQKGAITIATNMAGRGTDIKLSKGVLELGGLYVLGTEKAESRRIDNQLRGRSGRQGDVGESRFFISLDDQLFRRFA 547 (885) T ss_pred HHHCCCCCCEEEECCCCCCCCCCCCCHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHC T ss_conf 99548999189837667899685741658972985898426562045888860743468999735999972648999745 Q ss_pred CHHHHHHHHH Q ss_conf 9899999999 Q gi|254780180|r 329 GIEETKKMYQ 338 (360) Q Consensus 329 G~~~~~~~Y~ 338 (360) |.+.|++.+. T Consensus 548 ~~~~~~~~~~ 557 (885) T PRK12903 548 NFDKIKEAFK 557 (885) T ss_pred CHHHHHHHHH T ss_conf 7999998861 No 142 >PRK11907 bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase precursor protein; Reviewed Probab=22.63 E-value=56 Score=13.71 Aligned_cols=43 Identities=23% Similarity=0.264 Sum_probs=29.0 Q ss_pred ECHHHHHHHHHHHHCCCCCCCCCCCCCEEE---CCCCCCEEEEEEE Q ss_conf 628789989999852898468887223665---7409611431255 Q gi|254780180|r 268 VGTTSLRLLETATTEDGIINPWSGFTNIFI---TPGYRFRAVDILM 310 (360) Q Consensus 268 VGTT~~R~LEs~~~~~~~~~~~~g~T~lfI---~Pg~~f~~vd~Li 310 (360) -|--+-.-||-++..-+.+.|.....+-.| +|+|.|-++||+- T Consensus 540 TGaqlkeWLE~SAg~FnQIdP~~~~~Q~Lin~~fp~YNFDVIdGvt 585 (810) T PRK11907 540 TGAQLKEWLEMSAGQFNQIDPNNKEPQNLINTDYRTYNFDVIDGLT 585 (810) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 4799999999745400312889988500038899876654135814 No 143 >PRK12899 secA preprotein translocase subunit SecA; Reviewed Probab=22.63 E-value=56 Score=13.71 Aligned_cols=79 Identities=24% Similarity=0.386 Sum_probs=50.6 Q ss_pred CCCCCCCCHHHHHHHHHH-CCCCE-------EEEEE-------------------------------CCCCCCCCCCCCC Q ss_conf 556887899999999972-15530-------37873-------------------------------1656657887410 Q gi|254780180|r 194 PTAGLHFTSNLLSRLISI-GIKVY-------FVTLH-------------------------------VGAGTFMPVKVED 234 (360) Q Consensus 194 PTAGLHFt~~ll~~L~~k-Gi~~~-------~iTLH-------------------------------VG~GTF~Pi~~~~ 234 (360) +..|..|++.|=+.+++| ||++. .||+. |-.-|-+|+.-.| T Consensus 462 ~m~gRr~sdGLHQAIEAKEgV~I~~e~qTlAsIT~QnfFR~Y~kLsGMTGTA~tea~Ef~~IY~L~Vv~IPTNrP~~R~D 541 (969) T PRK12899 462 PQPGRRFSEGLHQAIEAKEHVTIRKESQTFATVTLQNFFRLYEKLAGMTGTAITESREFKEIYNLYVLQVPTFKPCLRID 541 (969) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCHHHCCCCCCCHHHHHHHHHHHCCCEEEECCCCCCEEEC T ss_conf 58898455218899998738986767613655369999830504306865747689999998688359967999821232 Q ss_pred CCCCCCCCHHEEECHH-----HHHHHHHHHHCCCEEEEECHHHHHHHHHH Q ss_conf 1356677000778599-----99999987537980999628789989999 Q gi|254780180|r 235 TDDHIMHSEIGFIDAP-----TAQALNSVKSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 235 i~~h~mH~E~~~i~~~-----~~~~I~~ak~~g~rIiAVGTT~~R~LEs~ 279 (360) ..|. ++.+.+ .++.|.+..+.|++| =|||||+..=|-. T Consensus 542 ~~D~------vy~T~~~K~~Avi~eI~~~h~~GqPV-LVGT~SVe~SE~l 584 (969) T PRK12899 542 HNDE------FYMTEREKYHAIVNEIASIHRSGNPI-LIGTESVEVSEKL 584 (969) T ss_pred CCCE------EECCHHHHHHHHHHHHHHHHHCCCCE-EEEECCHHHHHHH T ss_conf 8883------85289999999999999999579988-9983857999999 No 144 >TIGR00034 aroFGH phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006219 Members of this group catalyze the first enzymatic reaction of the shikimate pathway. The common (shikimate) pathway links metabolism of carbohydrates to biosynthesis of aromatic amino acids phenylalanine, tyrosine, tryptophan, and derivatives in microorganisms and in plants. In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway, and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The pathway begins with the stereospecific condensation of E4P and PEP to yield 7-phospho 2-dehydro 3-deoxy-D-arabino-heptulosonate (DAHP), catalyzed by 3-deoxy-7-phosphoheptulonate synthase (DAHPS) (EC 2.5.1.54, formerly EC 4.1.2.15). The divalent metal cation requirement of this enzyme can be satisfied by a broad range of metals . A Cys residue in a Cys-X-X-His motif has been identified as part of a metal binding site . In Escherichia coli, the enzyme exists in three isoforms, each specifically inhibited by one of the three aromatic amino acids. DAHP synthetases fall into two classes, class I (represented by this entry) and class II (represented by PIRSF015573 from PIRSF). Class I was believed to be limited to microorganisms and class II to plants. However, a more recent study showed that class II also contains enzymes from a microbial eukaryote and several bacteria . Brick and Woodard proposed that the difference between the two classes lies in their metal ion requirement for activity. Whereas class I requires no metal cation, class II is dependent on a metal cation for activity. However, recently a class I DAHP synthase from Thermotoga maritima has been purified, characterised, and shown to be a metalloenzyme . The three-dimentional structures of DAHP synthases have been determined , , , , . The DAHPS(Phe) monomer is a (beta/alpha)8 barrel with an additional N-terminal beta strand and helices and an extra beta sheet near the C terminus . The active site is located in a cleft at the carboxyl end of the barrel . The allosteric feedback inhibition binding site of DAHPS(Phe) is composed of residues from two adjacent subunits of a tight dimer and is at least 20 angstroms away from the closest active site . The absence of the shikimate pathway in animals makes it an attractive target for nontoxic herbicidal, antimicrobial, and antifungal agents. The nontoxic herbicide glyphosphate competitively inhibits 3-phosphoshikimate 1-carboxyvinyltransferase, the sixth enzymatic reaction of the pathway.; GO: 0003849 3-deoxy-7-phosphoheptulonate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=22.62 E-value=56 Score=13.70 Aligned_cols=63 Identities=29% Similarity=0.306 Sum_probs=42.8 Q ss_pred CCCC--CCCC--CHHHHCCCCCCC---------C---------CCCEEEEECCCCCCCCEEEE--------EEHHHHHHH Q ss_conf 6335--8888--977861699878---------1---------20313566378887531531--------014347872 Q gi|254780180|r 3 VKEF--DFDL--PPSRIALRPVFP---------R---------DSARLMVVHPNLSCPLVISD--------HLVSDLPAF 52 (360) Q Consensus 3 lsdf--Dy~L--P~elIAq~P~~~---------R---------d~SRLLV~~r~~~~~~~i~h--------~~F~dl~~~ 52 (360) +..| |=-+ |.+|.+++|+.+ | ++-||||+= .++.|+| .++..|.+- T Consensus 5 ~~~~td~~L~~~P~~L~~~~P~t~~~~~~v~~~R~~i~~I~~G~D~RLLVVi----GPCSiHDP~AA~eYA~RL~~L~~~ 80 (348) T TIGR00034 5 IVRITDDPLLMTPAELAAKFPLTPKQAENVAQSRQEIADILAGKDDRLLVVI----GPCSIHDPEAAIEYATRLKKLREK 80 (348) T ss_pred CEEECCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEE----CCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 0012067444784999974787887999999889999998579997689997----588857468999999999999987 Q ss_pred CCCCCEEEEECCCEEEEEEEEEEEC Q ss_conf 6988799994795688899876404 Q gi|254780180|r 53 LNSNDAIVFNNTKVITAQLNGVRFC 77 (360) Q Consensus 53 L~~gDLLV~NnTKVipARL~g~k~~ 77 (360) |...=.+|. |.|..|.+ T Consensus 81 l~~~L~IvM--------RvYfEKPR 97 (348) T TIGR00034 81 LKDDLEIVM--------RVYFEKPR 97 (348) T ss_pred HCCCCEEEE--------EEEECCCC T ss_conf 503760787--------64316987 No 145 >PRK13263 ureE urease accessory protein UreE; Provisional Probab=22.56 E-value=56 Score=13.70 Aligned_cols=56 Identities=20% Similarity=0.189 Sum_probs=39.2 Q ss_pred CCHHHHCCCCC-----CCCCCCEEEEECCCCCCCCEEEEEEHHHHHHHCCCCCEEEEECCCEEEEE Q ss_conf 89778616998-----78120313566378887531531014347872698879999479568889 Q gi|254780180|r 10 LPPSRIALRPV-----FPRDSARLMVVHPNLSCPLVISDHLVSDLPAFLNSNDAIVFNNTKVITAQ 70 (360) Q Consensus 10 LP~elIAq~P~-----~~Rd~SRLLV~~r~~~~~~~i~h~~F~dl~~~L~~gDLLV~NnTKVipAR 70 (360) |-.-||++-|. +.|-.||+-+...+.. +--.|-.=...|+.||+|+..|-+++-.+ T Consensus 14 la~~l~~ra~TltL~~d~R~ksR~r~~~d~G~-----e~~l~LpRg~~L~~GD~L~~edG~~V~V~ 74 (207) T PRK13263 14 LAAVLVARAPTLTLAFDARCKSRLAATLDTGE-----EVALVLPRGTVLRDGDVLVADDGALVRVA 74 (207) T ss_pred HHHHHHHHCCEEEECHHHHHHHHEEEECCCCC-----EEEEECCCCCCCCCCCEEECCCCCEEEEE T ss_conf 00888832884786688851210137769997-----89998888726479998870899999999 No 146 >PRK06857 consensus Probab=22.43 E-value=56 Score=13.68 Aligned_cols=14 Identities=21% Similarity=0.342 Sum_probs=4.9 Q ss_pred HHHHHHCCCCCHHH Q ss_conf 25442035785222 Q gi|254780180|r 153 QISLVGTIPLPPYI 166 (360) Q Consensus 153 ~L~~~G~iPLPPYI 166 (360) +++...+-++=|-| T Consensus 5 ii~~l~~~~iipVi 18 (209) T PRK06857 5 IIEKLRALKVVPVI 18 (209) T ss_pred HHHHHHHCCEEEEE T ss_conf 99999979979999 No 147 >PRK07109 short chain dehydrogenase; Provisional Probab=22.16 E-value=57 Score=13.64 Aligned_cols=127 Identities=13% Similarity=0.048 Sum_probs=69.6 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC------CC-CCCCHHEEECHHHH Q ss_conf 101345675740055568878999999999721553037873165665788741013------56-67700077859999 Q gi|254780180|r 180 YQTTYAKIQGSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTD------DH-IMHSEIGFIDAPTA 252 (360) Q Consensus 180 YQTVyA~~~GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~------~h-~mH~E~~~i~~~~~ 252 (360) +|+.|+--..+| .-||+.|-.+|...|-.|.-.+.+=| .|.+.-.+ .+ .-..--++=|+.++ T Consensus 153 ~~saY~ASK~Av------~GftesLr~EL~~~~s~I~Vt~V~Pg-----~VdTP~f~~~~n~~~~~~~~~pp~~~pe~vA 221 (338) T PRK07109 153 LQSAYCAAKHAI------RGFTDSLRCELLHDGSPVSVTMVQPP-----AVNTPQFDWARSYLPVEPQPVPPIYQPEVVA 221 (338) T ss_pred CCCHHHHHHHHH------HHHHHHHHHHHHHCCCCEEEEEEECC-----CCCCCHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf 981799999999------99999999999867998189997579-----8779742444523798888999998989999 Q ss_pred HHHHHHHHCCCEEEEECHHHHHHHHH---------------HHHCCCCCCCC-CCCCCEEECCCCCCEEEEEEEECCCCC Q ss_conf 99998753798099962878998999---------------98528984688-872236657409611431255277554 Q gi|254780180|r 253 QALNSVKSRGGRIVSVGTTSLRLLET---------------ATTEDGIINPW-SGFTNIFITPGYRFRAVDILMSNFHLP 316 (360) Q Consensus 253 ~~I~~ak~~g~rIiAVGTT~~R~LEs---------------~~~~~~~~~~~-~g~T~lfI~Pg~~f~~vd~LiTNFH~P 316 (360) ++|-.+-.+.+|-+-||..+.-+.=. .+.....-.+- ....|-+-.|...-.-+++-.+.--.+ T Consensus 222 ~ai~~~a~~p~r~~~vg~~~~~~~~~~~~~P~l~dr~~~~~~~~~q~~~~~~~~~~~~~l~~p~~~~~~~~g~~~~~~~~ 301 (338) T PRK07109 222 DAILFAAEHPRRELWVGGPAKAAILGNRLAPGLLDRYLARTGYRGQQTDRPADPDRPDNLYEPVPGDHGAHGRFGDRARR 301 (338) T ss_pred HHHHHHHHCCCCEEEECHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 99999974898524307789999999755847899999997677540379999999986568898988777886626550 Q ss_pred H Q ss_conf 5 Q gi|254780180|r 317 K 317 (360) Q Consensus 317 ~ 317 (360) . T Consensus 302 ~ 302 (338) T PRK07109 302 R 302 (338) T ss_pred C T ss_conf 6 No 148 >PRK11590 hypothetical protein; Provisional Probab=22.11 E-value=57 Score=13.63 Aligned_cols=26 Identities=31% Similarity=0.630 Sum_probs=20.8 Q ss_pred HCHHHHHHHHHHHHHCCCEEEC-CCCHH Q ss_conf 1989999999999986982611-01022 Q gi|254780180|r 328 CGIEETKKMYQHAISHAYRFYS-YGDTS 354 (360) Q Consensus 328 ~G~~~~~~~Y~~Ai~~~yrF~s-yGDam 354 (360) -|.+.+..+ ++.+....++|| ||||- T Consensus 160 ~G~EKV~rL-~~~lg~~~~l~~aYsDS~ 186 (211) T PRK11590 160 LGHEKVAQL-ERKIGTPLRLYSGYSDSK 186 (211) T ss_pred CCHHHHHHH-HHHCCCCCEEEEECCCCC T ss_conf 767999999-985189950565057874 No 149 >PRK11614 livF leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=22.07 E-value=45 Score=14.35 Aligned_cols=55 Identities=20% Similarity=0.404 Sum_probs=34.1 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 555688789-----999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||--. .+++.+|+++|+.+..+|.+...- ..+.|+ ..| -++|||++ T Consensus 163 EPt~gLD~~~~~~i~~~l~~l~~~g~tii~vsH~l~~~-------~~~aDr------v~v------------l~~G~iv~ 217 (237) T PRK11614 163 EPSLGLAPIIIQQIFDTIEQLREQGMTIFLVEQNANQA-------LKLADR------GYV------------LENGHVVL 217 (237) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHH-------HHHCCE------EEE------------EECCEEEE T ss_conf 97557999999999999999996799999994758999-------996999------999------------97999999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) -||+- T Consensus 218 ~Gt~~ 222 (237) T PRK11614 218 SDTGD 222 (237) T ss_pred ECCHH T ss_conf 99999 No 150 >pfam02455 Hex_IIIa Hexon-associated protein (IIIa). The major capsid protein of the adenovirus strain is also known as a hexon. This is a family of hexon-associated proteins (protein IIIa). Probab=22.04 E-value=57 Score=13.63 Aligned_cols=13 Identities=15% Similarity=0.266 Sum_probs=7.6 Q ss_pred EEEECCCEEEEEE Q ss_conf 9994795688899 Q gi|254780180|r 59 IVFNNTKVITAQL 71 (360) Q Consensus 59 LV~NnTKVipARL 71 (360) |+-|.++++|.-. T Consensus 229 LltNr~~~~p~~~ 241 (491) T pfam02455 229 LLTNREKRLPEPY 241 (491) T ss_pred HHHCCCCCCCCCC T ss_conf 8747650479987 No 151 >cd03217 ABC_FeS_Assembly ABC-type transport system involved in Fe-S cluster assembly, ATPase component. Biosynthesis of iron-sulfur clusters (Fe-S) depends on multiprotein systems. The SUF system of E. coli and Erwinia chrysanthemi is important for Fe-S biogenesis under stressful conditions. The SUF system is made of six proteins: SufC is an atypical cytoplasmic ABC-ATPase, which forms a complex with SufB and SufD; SufA plays the role of a scaffold protein for assembly of iron-sulfur clusters and delivery to target proteins; SufS is a cysteine desulfurase which mobilizes the sulfur atom from cysteine and provides it to the cluster; SufE has no associated function yet. Probab=21.84 E-value=54 Score=13.82 Aligned_cols=61 Identities=23% Similarity=0.344 Sum_probs=36.3 Q ss_pred CCCCCCCCC-----CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEE Q ss_conf 055568878-----999999999721553037873165665788741013566770007785999999998753798099 Q gi|254780180|r 192 AAPTAGLHF-----TSNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIV 266 (360) Q Consensus 192 AAPTAGLHF-----t~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIi 266 (360) -=|||||-- -.+++.+++++|..+..+|.+.-.-.+ + .+.+..| -.+|||+ T Consensus 129 DEPtsgLD~~~~~~i~~~i~~l~~~g~tiiiitH~~~~~~~-------~-----~~Drv~v------------l~~Gkiv 184 (200) T cd03217 129 DEPDSGLDIDALRLVAEVINKLREEGKSVLIITHYQRLLDY-------I-----KPDRVHV------------LYDGRIV 184 (200) T ss_pred ECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECHHHHHH-------H-----CCCEEEE------------EECCEEE T ss_conf 69622699999999999999998579999999963687766-------4-----6999999------------8799999 Q ss_pred EECHHHHHHHH Q ss_conf 96287899899 Q gi|254780180|r 267 SVGTTSLRLLE 277 (360) Q Consensus 267 AVGTT~~R~LE 277 (360) +.||+- -++| T Consensus 185 ~~G~~e-e~~~ 194 (200) T cd03217 185 KSGDKE-LALE 194 (200) T ss_pred EEECHH-HHHH T ss_conf 997999-9845 No 152 >pfam02091 tRNA-synt_2e Glycyl-tRNA synthetase alpha subunit. Probab=21.49 E-value=26 Score=16.09 Aligned_cols=24 Identities=25% Similarity=0.656 Sum_probs=14.6 Q ss_pred EECCCCCCCHH-HHHHCCCCCCCCC Q ss_conf 81124866101-1221156436985 Q gi|254780180|r 90 LHMRVSPNSWS-VYARPSKVIKKGD 113 (360) Q Consensus 90 l~~~~~~~~w~-~l~kp~kklk~G~ 113 (360) +++-+.+.-|. |.+.|++|...|. T Consensus 37 ~lr~lgp~pw~~aYvqPsrRP~DgR 61 (284) T pfam02091 37 FLRALGPEPWNVAYVEPSRRPTDGR 61 (284) T ss_pred HHHHCCCCCCEEEEECCCCCCCCCC T ss_conf 8863199864155424677999987 No 153 >cd02767 MopB_ydeP The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site. These members belong to the molybdopterin_binding (MopB) superfamily of proteins. Probab=21.44 E-value=59 Score=13.54 Aligned_cols=16 Identities=31% Similarity=0.194 Sum_probs=6.9 Q ss_pred CCCCEEEEEEEECCCC Q ss_conf 0961143125527755 Q gi|254780180|r 300 GYRFRAVDILMSNFHL 315 (360) Q Consensus 300 g~~f~~vd~LiTNFH~ 315 (360) |+++-.|.--+-..|. T Consensus 470 g~q~vtve~s~~~v~~ 485 (574) T cd02767 470 GAQAVTVEDSMSMTHT 485 (574) T ss_pred CCCEEEEECCCCCEEE T ss_conf 7651799768773562 No 154 >TIGR01166 cbiO cobalt ABC transporter, ATP-binding protein; InterPro: IPR005876 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP binding subunit of the multisubunit cobalt transporter in bacteria and its equivalents in archaea. This superfamily includes two groups, one which catalyses the uptake of small molecules, including ions from the external milieu and the other group which is engaged in the efflux of small molecular weight compounds and ions from within the cell. Energy derived from the hydrolysis of ATP drives both the process of uptake and efflux.; GO: 0006824 cobalt ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=21.35 E-value=49 Score=14.13 Aligned_cols=37 Identities=41% Similarity=0.477 Sum_probs=29.6 Q ss_pred CCCCCC---------CCCCCC-----CCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 574005---------556887-----8999999999721553037873165 Q gi|254780180|r 188 QGSVAA---------PTAGLH-----FTSNLLSRLISIGIKVYFVTLHVGA 224 (360) Q Consensus 188 ~GSVAA---------PTAGLH-----Ft~~ll~~L~~kGi~~~~iTLHVG~ 224 (360) .|+||+ |||||= -=-++|.+|.+.|-.+..=|..|-+ T Consensus 139 AGAvAM~Pd~l~LDEPTAGLDp~G~~q~~~~l~~L~~~G~tvv~STHDvdL 189 (190) T TIGR01166 139 AGAVAMRPDVLLLDEPTAGLDPAGAEQLLAILRRLRAEGTTVVISTHDVDL 189 (190) T ss_pred HHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 758861663466427888978747999999988787239989997253201 No 155 >KOG0318 consensus Probab=21.35 E-value=30 Score=15.63 Aligned_cols=23 Identities=17% Similarity=0.170 Sum_probs=14.6 Q ss_pred HHHHCCCCCCCCCCCEEEEECCC Q ss_conf 77861699878120313566378 Q gi|254780180|r 12 PSRIALRPVFPRDSARLMVVHPN 34 (360) Q Consensus 12 ~elIAq~P~~~Rd~SRLLV~~r~ 34 (360) ++.+|.-|..+|...-+|=.|++ T Consensus 7 k~~~~~lPst~Rg~~~~ig~dpk 29 (603) T KOG0318 7 KETFASLPSTERGVPIIIGGDPK 29 (603) T ss_pred HHHCCCCCCCCCCCCEEECCCCC T ss_conf 10046887546895159636888 No 156 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=21.32 E-value=48 Score=14.15 Aligned_cols=32 Identities=25% Similarity=0.303 Sum_probs=24.3 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCC Q ss_conf 555688789-----99999999721553037873165 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGA 224 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~ 224 (360) =|||||--. .+++.++++.|..+..||.+.+. T Consensus 202 EPTagLDp~~~~~i~~li~~l~~~g~TiilvTHdm~~ 238 (320) T PRK13631 202 EPTAGLDPKGEHEMMQLILDAKANNKTVFVITHTMEH 238 (320) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH T ss_conf 8755599899999999999999629999999478999 No 157 >TIGR03314 Se_ssnA putative selenium metabolism protein SsnA. Members of this protein family are found exclusively in genomes that contain putative set of labile selenium-dependent enzyme accessory proteins as well as homologs of a labile selenium-dependent purine hydroxylase. A mutant in this gene in Escherichia coli had improved stationary phase viability. The function is unknown. Probab=21.31 E-value=43 Score=14.54 Aligned_cols=20 Identities=10% Similarity=0.037 Sum_probs=9.0 Q ss_pred CCCCHHHHHHHHHHCCCCEE Q ss_conf 87899999999972155303 Q gi|254780180|r 198 LHFTSNLLSRLISIGIKVYF 217 (360) Q Consensus 198 LHFt~~ll~~L~~kGi~~~~ 217 (360) -|.|++=++.|+++|+.++. T Consensus 262 v~l~~~ei~lla~~G~~vah 281 (441) T TIGR03314 262 IYLSDREIELLNETDTFVVH 281 (441) T ss_pred CCCCHHHHHHHHHCCCEEEE T ss_conf 66999999999982982887 No 158 >pfam00224 PK Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains. Probab=21.09 E-value=60 Score=13.49 Aligned_cols=32 Identities=13% Similarity=0.172 Sum_probs=21.7 Q ss_pred EEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHH Q ss_conf 778599999999875379809996287899899998 Q gi|254780180|r 245 GFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETAT 280 (360) Q Consensus 245 ~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~~ 280 (360) -.||.---+.|..|++.|++|| +++.-|||-. T Consensus 257 e~vp~~Qk~ii~~~~~~~kpvi----vATqmLeSM~ 288 (348) T pfam00224 257 EEVFLAQKMLIAKCNLAGKPVI----TATQMLESMI 288 (348) T ss_pred HHHHHHHHHHHHHHHHCCCCEE----EECCCHHHHC T ss_conf 8989999999999998599299----9827577651 No 159 >CHL00131 ycf16 sulfate ABC transporter protein; Validated Probab=21.05 E-value=60 Score=13.49 Aligned_cols=56 Identities=14% Similarity=0.290 Sum_probs=34.3 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 555688789-----999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||--. .++|.+|+++|..+..||.|...-.+. ..|+ ..| -+.||||+ T Consensus 176 EPTsgLD~~~~~~i~~~l~~l~~~g~tii~itH~~~~~~~~------~aDr------i~v------------m~~G~Iv~ 231 (252) T CHL00131 176 ETDSGLDIDALKIIAEGINKLATSDNAIILITHYQRLLDYI------KPDY------VHV------------MQNGKIIK 231 (252) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECHHHHHHH------CCCE------EEE------------EECCEEEE T ss_conf 98766999999999999999985899999999866989877------3999------999------------87989999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) -||.- T Consensus 232 ~Gt~e 236 (252) T CHL00131 232 TGDAE 236 (252) T ss_pred EECHH T ss_conf 97999 No 160 >PRK11124 artP arginine transporter ATP-binding subunit; Provisional Probab=20.99 E-value=47 Score=14.21 Aligned_cols=55 Identities=29% Similarity=0.352 Sum_probs=33.7 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 555688789-----999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||--. .++|.+|++.|+.+..||.....- ..+. ....| -++|||++ T Consensus 167 EPTs~LD~~~~~~i~~ll~~l~~~g~tii~vtHdl~~~-------~~~a------dri~v------------l~~G~iv~ 221 (242) T PRK11124 167 EPTAALDPEITAQIVSIIRELAETGITQVIVTHEVEVA-------RKTA------SRVVY------------MENGHIVE 221 (242) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-------HHHC------CEEEE------------EECCEEEE T ss_conf 88654899999999999999984299899988899999-------9969------99999------------97989999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) -||.. T Consensus 222 ~Gt~~ 226 (242) T PRK11124 222 QGDAS 226 (242) T ss_pred ECCHH T ss_conf 93999 No 161 >pfam01924 HypD Hydrogenase formation hypA family. HypD is involved in hydrogenase formation. It contains many possible metal binding residues, which may bind to nickel. Transposon Tn5 insertions into hypD resulted in R. leguminosarum mutants that lacked any hydrogenase activity in symbiosis with peas. Probab=20.77 E-value=61 Score=13.45 Aligned_cols=110 Identities=21% Similarity=0.232 Sum_probs=57.9 Q ss_pred HHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHH---CCCEEEEECHHHHHHHHHHHHCC Q ss_conf 999721553037873165665788741013566770007785999999998753---79809996287899899998528 Q gi|254780180|r 207 RLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKS---RGGRIVSVGTTSLRLLETATTED 283 (360) Q Consensus 207 ~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~---~g~rIiAVGTT~~R~LEs~~~~~ 283 (360) +|.+++=+..-|=+-||.-|= ...+|..|.++++ ++=.|+...-++-=++|..... T Consensus 117 ~iA~~nP~k~vVF~avGFETT--------------------aP~~A~~i~~a~~~~~~Nf~vl~~hkl~pPa~~~ll~~- 175 (355) T pfam01924 117 KIARENPDKEVVFFAIGFETT--------------------APATAAAILQAKAEGLKNFSVLSNHVLVPPAMEALLED- 175 (355) T ss_pred HHHHHCCCCEEEEEECCCCCC--------------------CHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHC- T ss_conf 999978998599994365556--------------------57999999999980999589977024148999999819- Q ss_pred CCCCCCCCCCCEEECCCCCCEE--------------EEEEEECCCCC--HHHHHHHHHHH-HCHHHHHHHHHHHHH Q ss_conf 9846888722366574096114--------------31255277554--57899999987-198999999999998 Q gi|254780180|r 284 GIINPWSGFTNIFITPGYRFRA--------------VDILMSNFHLP--KSTLLMLVSAF-CGIEETKKMYQHAIS 342 (360) Q Consensus 284 ~~~~~~~g~T~lfI~Pg~~f~~--------------vd~LiTNFH~P--~Stll~Lv~Af-~G~~~~~~~Y~~Ai~ 342 (360) + +-.-|-||.||.-=-+ +-+.+|+|.-- -.+++|||... .|+-.+-+.|..+++ T Consensus 176 ~-----~~~idgfi~PGHVstI~G~~~y~~l~~~y~~P~VVaGFEp~DiL~~i~~lv~q~~~g~~~v~N~Y~r~V~ 246 (355) T pfam01924 176 P-----EVRIDGFIGPGHVSTIIGTEPYEFLAEEYGVPVVVAGFEPLDILQAILMLVRQLNEGRAEVENQYTRAVK 246 (355) T ss_pred C-----CCCCCEEEECCEEEEEECHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCEEEECCCEEEC T ss_conf 9-----9875458736866788451366999998299866437896999999999999997798539981224678 No 162 >PRK09200 preprotein translocase subunit SecA; Reviewed Probab=20.75 E-value=61 Score=13.45 Aligned_cols=56 Identities=18% Similarity=0.140 Sum_probs=37.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEEECHHHHHHHHHH Q ss_conf 1656657887410135667700077859999999987537980999628789989999 Q gi|254780180|r 222 VGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVSVGTTSLRLLETA 279 (360) Q Consensus 222 VG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiAVGTT~~R~LEs~ 279 (360) |-.-|.+|+.-++..+..--++.- =-...++.|.+..+.|++| =|||+|+..=|-. T Consensus 393 ~~iPtn~p~~r~d~~d~v~~t~~~-k~~av~~~v~~~~~~g~pv-Lvgt~sv~~Se~~ 448 (799) T PRK09200 393 VQIPTNRPIIRIDYPDKVFATVDE-KYKAVIEEVKERHETGRPV-LIGTGSIEQSEYF 448 (799) T ss_pred EECCCCCCCEEEECCCCCCCCHHH-HHHHHHHHHHHHHHCCCCE-EEEECCHHHHHHH T ss_conf 988999985445474301089999-9999999999998579988-9983857999999 No 163 >cd07018 S49_SppA_67K_type Signal peptide peptidase A (SppA) 67K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 67K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily contain an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Unlike the eukaryotic functional homologs that are proposed to be aspartic proteases, site-directed mutagenesis and sequence analysis have shown that members in this subfamily, mostly bacterial, are serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys Probab=20.67 E-value=61 Score=13.43 Aligned_cols=36 Identities=36% Similarity=0.528 Sum_probs=14.6 Q ss_pred CCCCCCCCCCC--CCCCCCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 34567574005--556887899999999972155303787316 Q gi|254780180|r 183 TYAKIQGSVAA--PTAGLHFTSNLLSRLISIGIKVYFVTLHVG 223 (360) Q Consensus 183 VyA~~~GSVAA--PTAGLHFt~~ll~~L~~kGi~~~~iTLHVG 223 (360) |||...|+|-. -.+...|=.++++++ ||+...+ ++| T Consensus 102 I~a~p~~~vg~iGv~~~~~~~k~ll~kl---Gi~~~~~--~~G 139 (222) T cd07018 102 IYLNPSGSVELTGLSAETLFFKGLLDKL---GVEVQVF--RVG 139 (222) T ss_pred EEECCCCEEEEEEEEEECCCHHHHHHHC---CCEEEEE--ECC T ss_conf 5776887687764375421278899986---9806998--424 No 164 >PRK04302 triosephosphate isomerase; Provisional Probab=20.53 E-value=61 Score=13.41 Aligned_cols=65 Identities=20% Similarity=0.141 Sum_probs=37.4 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCHHH-HHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHH Q ss_conf 1013456757400555688789999-999997215530378731656657887410135667700077859999999987 Q gi|254780180|r 180 YQTTYAKIQGSVAAPTAGLHFTSNL-LSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSV 258 (360) Q Consensus 180 YQTVyA~~~GSVAAPTAGLHFt~~l-l~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~a 258 (360) =|.++....|+ ||-++ ..-|++-|++.+-| | |||+=..-.|+-+.|+++ T Consensus 61 AQn~~~~~~GA---------~TGevS~~mL~d~G~~~vIl----G-----------------HSERR~~~~E~~~~v~~a 110 (223) T PRK04302 61 AQHVDAVGPGA---------HTGHILPEAVKDAGAVGTLL----N-----------------HSERRLRLADIEAAVERA 110 (223) T ss_pred ECCCCCCCCCC---------CCCCCCHHHHHHCCCCEEEE----C-----------------CCHHHHHCCCHHHHHHHH T ss_conf 54388987868---------62633099999859999995----6-----------------533100032279999999 Q ss_pred HHCCC-EEEEECHHHHH Q ss_conf 53798-09996287899 Q gi|254780180|r 259 KSRGG-RIVSVGTTSLR 274 (360) Q Consensus 259 k~~g~-rIiAVGTT~~R 274 (360) .+.|= .|+|||++-.. T Consensus 111 ~~~gl~~I~Cv~~~~~~ 127 (223) T PRK04302 111 KELGLESVVCTNNPETS 127 (223) T ss_pred HHCCCEEEEECCCHHHH T ss_conf 98699489972739988 No 165 >COG3480 SdrC Predicted secreted protein containing a PDZ domain [Signal transduction mechanisms] Probab=20.40 E-value=39 Score=14.85 Aligned_cols=60 Identities=18% Similarity=0.287 Sum_probs=36.5 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCC---------CCCCCCCCCCCCCHHEEECHH Q ss_conf 74005556887899999999972155303787316566578---------874101356677000778599 Q gi|254780180|r 189 GSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAGTFMP---------VKVEDTDDHIMHSEIGFIDAP 250 (360) Q Consensus 189 GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~GTF~P---------i~~~~i~~h~mH~E~~~i~~~ 250 (360) ..+--|+|||-||=++.++|....-..- +-=-|-||.-+ |..+-..-++--.+.|++|.+ T Consensus 230 ~~IGGPSAGLMFSL~Iy~qlt~~DL~~g--~~IAGTGTI~~DG~VG~IGGI~qKvvAA~~AGA~vFf~P~~ 298 (342) T COG3480 230 ENIGGPSAGLMFSLAIYDQLTKGDLTGG--RFIAGTGTIEVDGKVGPIGGIDQKVVAAAKAGADVFFVPAD 298 (342) T ss_pred CCCCCCCHHHEEEHHHHHHCCCCCCCCC--EEEECCEEECCCCCCCCCCCHHHHHHHHHHCCCCEEEECCC T ss_conf 4479975433352988864053113586--69841113346883357454767767787659859984187 No 166 >PRK10895 putative ABC transporter ATP-binding protein YhbG; Provisional Probab=20.40 E-value=57 Score=13.62 Aligned_cols=55 Identities=20% Similarity=0.364 Sum_probs=34.8 Q ss_pred CCCCCCCCC-----HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 555688789-----999999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFT-----SNLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt-----~~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||--. .+++++++++|+.+..+|.+...- ..+.|+ ..| -.+|+|++ T Consensus 163 EPt~gLD~~~~~~i~~~l~~l~~~g~tvl~~tHdl~~~-------~~~~dr------v~v------------l~~G~iv~ 217 (241) T PRK10895 163 EPFAGVDPISVIDIKRIIEHLRDSGLGVLITDHNVRET-------LAVCER------AYI------------VSQGHLIA 217 (241) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH-------HHHCCE------EEE------------EECCEEEE T ss_conf 87547999999999999999996499999990729999-------997999------999------------98999999 Q ss_pred ECHHH Q ss_conf 62878 Q gi|254780180|r 268 VGTTS 272 (360) Q Consensus 268 VGTT~ 272 (360) .||.. T Consensus 218 ~G~p~ 222 (241) T PRK10895 218 HGTPT 222 (241) T ss_pred ECCHH T ss_conf 95899 No 167 >KOG3987 consensus Probab=20.32 E-value=62 Score=13.38 Aligned_cols=64 Identities=16% Similarity=0.169 Sum_probs=28.5 Q ss_pred CCCHHHHHHHHHHC-CCCE---EEEEECCCCCCCCCCCCCCCCC-CCCCHHEEECHHHHHHHHHHHHCCCEEEEE Q ss_conf 78999999999721-5530---3787316566578874101356-677000778599999999875379809996 Q gi|254780180|r 199 HFTSNLLSRLISIG-IKVY---FVTLHVGAGTFMPVKVEDTDDH-IMHSEIGFIDAPTAQALNSVKSRGGRIVSV 268 (360) Q Consensus 199 HFt~~ll~~L~~kG-i~~~---~iTLHVG~GTF~Pi~~~~i~~h-~mH~E~~~i~~~~~~~I~~ak~~g~rIiAV 268 (360) -|+++.+++|-.-| -.+. ---|+.|+| ..+|..| .-|-|..+-.+-+..-+++-++.|-+|+.. T Consensus 92 ifSe~QF~klL~i~~p~w~~~~~~lLDlGAG------dGeit~~m~p~feevyATElS~tMr~rL~kk~ynVl~~ 160 (288) T KOG3987 92 IFSEEQFRKLLVIGGPAWGQEPVTLLDLGAG------DGEITLRMAPTFEEVYATELSWTMRDRLKKKNYNVLTE 160 (288) T ss_pred EECHHHHHHHHHCCCCCCCCCCEEEEECCCC------CCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 8569999999831787558987068861678------86100121421899999876699999986527736650 No 168 >TIGR01829 AcAcCoA_reduct acetoacetyl-CoA reductase; InterPro: IPR011283 This entry represent acetoacetyl-CoA reductase, a member of the family short-chain-alcohol dehydrogenases. Note that, despite the precision implied by the enzyme name, the reaction of 1.1.1.36 from EC is defined more generally as (R)-3-hydroxyacyl-CoA + NADP+ = 3-oxoacyl-CoA + NADPH. Members of this family may act in the biosynthesis of poly-beta-hydroxybutyrate (e.g. Rhizobium meliloti) and related poly-beta-hydroxyalkanoates. Note that the member of this family from Azospirillum brasilense, designated NodG, appears to lack acetoacetyl-CoA reductase activity and to act instead in the production of nodulation factor. ; GO: 0018454 acetoacetyl-CoA reductase activity, 0042619 poly-hydroxybutyrate biosynthetic process, 0005737 cytoplasm. Probab=20.31 E-value=36 Score=15.03 Aligned_cols=31 Identities=35% Similarity=0.508 Sum_probs=24.1 Q ss_pred CCCCCCCCCCCCCCCCCCC-CCHHHHHHHHHHCCCCEEE Q ss_conf 0134567574005556887-8999999999721553037 Q gi|254780180|r 181 QTTYAKIQGSVAAPTAGLH-FTSNLLSRLISIGIKVYFV 218 (360) Q Consensus 181 QTVyA~~~GSVAAPTAGLH-Ft~~ll~~L~~kGi~~~~i 218 (360) ||=|| |-=||+| ||..|=++...|||.+-.| T Consensus 148 QtNYS-------AAKAG~iGFTkALA~E~A~kGvTVN~i 179 (244) T TIGR01829 148 QTNYS-------AAKAGMIGFTKALAQEGARKGVTVNVI 179 (244) T ss_pred HHHHH-------HHHHHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 04589-------886215677799997211038567545 No 169 >TIGR00348 hsdR type I site-specific deoxyribonuclease, HsdR family; InterPro: IPR004473 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements , , as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type I restriction endonucleases are components of prokaryotic DNA restriction-modification mechanisms that protects the organism against invading foreign DNA. Type I enzymes have three different subunits subunits - M (modification), S (specificity) and R (restriction) - that form multifunctional enzymes with restriction (3.1.21.3 from EC), methylase (2.1.1.72 from EC) and ATPase activities , . The S subunit is required for both restriction and modification and is responsible for recognition of the DNA sequence specific for the system. The M subunit is necessary for modification, and the R subunit is required for restriction. These enzymes use S-Adenosyl-L-methionine (AdoMet) as the methyl group donor in the methylation reaction, and have a requirement for ATP. They recognize asymmetric DNA sequences split into two domains of specific sequence, one 3-4 bp long and another 4-5 bp long, separated by a nonspecific spacer 6-8 bp in length. Cleavage occurs a considerable distance from the recognition sites, rarely less than 400 bp away and up to 7000 bp away. Adenosyl residues are methylated, one on each strand of the recognition sequence. These enzymes are widespread in eubacteria and archaea. In enteric bacteria they have been subdivide into four families: types IA, IB, IC and ID. This entry represents the R subunit (HsdR) of type I restriction endonucleases such as EcoRI (3.1.21.3 from EC), which recognises the DNA sequence 5'-GAATTC; the R protein (HsdR) is required for both nuclease and ATPase activity , , , .; GO: 0003677 DNA binding, 0005524 ATP binding, 0009035 Type I site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system. Probab=20.25 E-value=36 Score=15.05 Aligned_cols=32 Identities=25% Similarity=0.464 Sum_probs=24.6 Q ss_pred CCCCCEEECCCCCCE---EEEEEEECCCCCHHHHH Q ss_conf 872236657409611---43125527755457899 Q gi|254780180|r 290 SGFTNIFITPGYRFR---AVDILMSNFHLPKSTLL 321 (360) Q Consensus 290 ~g~T~lfI~Pg~~f~---~vd~LiTNFH~P~Stll 321 (360) .-++.=|+.|.-+.+ |||.|||+|=.|-=.=| T Consensus 717 ~~~~~~F~~p~~~~k~LIV~dMlLTGFDAP~L~Tl 751 (813) T TIGR00348 717 EKLLERFKDPEENLKLLIVVDMLLTGFDAPILNTL 751 (813) T ss_pred HHHHHHHCCCCCCCCEEEEECCCCCCCCHHHHHHH T ss_conf 68898707888888479870770225661123323 No 170 >PRK13548 hmuV hemin importer ATP-binding subunit; Provisional Probab=20.23 E-value=61 Score=13.44 Aligned_cols=80 Identities=23% Similarity=0.279 Sum_probs=46.0 Q ss_pred CCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHEEECHHHHHHHHHHHHCCCEEEE Q ss_conf 5556887899-----99999997215530378731656657887410135667700077859999999987537980999 Q gi|254780180|r 193 APTAGLHFTS-----NLLSRLISIGIKVYFVTLHVGAGTFMPVKVEDTDDHIMHSEIGFIDAPTAQALNSVKSRGGRIVS 267 (360) Q Consensus 193 APTAGLHFt~-----~ll~~L~~kGi~~~~iTLHVG~GTF~Pi~~~~i~~h~mH~E~~~i~~~~~~~I~~ak~~g~rIiA 267 (360) =|||||-... +++.+|+++|..+..+|.+...- ..+.|+ ..| -++|||+| T Consensus 166 EPtsgLD~~~~~~i~~ll~~l~~~g~tvl~vtHdl~~~-------~~~aDr------v~v------------l~~G~iv~ 220 (257) T PRK13548 166 EPTSALDLAHQHHVLRLARQLAERGAAVIVVLHDLNLA-------ARYADR------IVL------------LHQGRLVA 220 (257) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHH-------HHHCCE------EEE------------EECCEEEE T ss_conf 98766899999999999999985599999992788999-------997999------999------------98999999 Q ss_pred ECHHH-HHHHHHHHHCCCCCCCCCCCCCEEECC--CCCC Q ss_conf 62878-998999985289846888722366574--0961 Q gi|254780180|r 268 VGTTS-LRLLETATTEDGIINPWSGFTNIFITP--GYRF 303 (360) Q Consensus 268 VGTT~-~R~LEs~~~~~~~~~~~~g~T~lfI~P--g~~f 303 (360) -||.. +-+=|.. -..+.....++.+| |++. T Consensus 221 ~G~p~evl~~~~l------~~~yg~~~~v~~~~~~~~p~ 253 (257) T PRK13548 221 DGTPAEVLTPETL------RQVYGADLRVQRHPELGHPL 253 (257) T ss_pred ECCHHHHCCHHHH------HHHHCCCEEEEECCCCCCEE T ss_conf 9387997798899------99839942999769999768 No 171 >PRK05439 pantothenate kinase; Provisional Probab=20.07 E-value=63 Score=13.35 Aligned_cols=34 Identities=38% Similarity=0.312 Sum_probs=14.7 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 7400555688789999999997215530378731656 Q gi|254780180|r 189 GSVAAPTAGLHFTSNLLSRLISIGIKVYFVTLHVGAG 225 (360) Q Consensus 189 GSVAAPTAGLHFt~~ll~~L~~kGi~~~~iTLHVG~G 225 (360) ||||. |=-=|..+|+.|-++--..-.|.|=.-=| T Consensus 93 GSVAV---GKSTtARlLq~LL~r~~~~~~V~LvTTDG 126 (312) T PRK05439 93 GSVAV---GKSTTARLLQALLSRWPEHPKVDLVTTDG 126 (312) T ss_pred EEEEC---CHHHHHHHHHHHHHCCCCCCCEEEEECCC T ss_conf 20102---62889999999995078999458993466 Done!