Query gi|254780717|ref|YP_003065130.1| zinc uptake ABC transporter [Candidatus Liberibacter asiaticus str. psy62] Match_columns 294 No_of_seqs 110 out of 2555 Neff 8.0 Searched_HMMs 39220 Date Sun May 29 22:05:18 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780717.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09545 znuA high-affinity zi 100.0 0 0 519.6 29.9 291 2-293 1-307 (308) 2 cd01019 ZnuA Zinc binding prot 100.0 0 0 494.4 28.2 272 21-293 1-286 (286) 3 cd01017 AdcA Metal binding pro 100.0 0 0 479.9 27.8 273 21-293 1-281 (282) 4 cd01137 PsaA Metal binding pro 100.0 0 0 478.9 27.1 267 20-293 14-286 (287) 5 cd01016 TroA Metal binding pro 100.0 0 0 466.7 27.4 264 23-293 1-275 (276) 6 cd01018 ZntC Metal binding pro 100.0 0 0 455.3 26.9 260 22-290 1-265 (266) 7 pfam01297 SBP_bac_9 Periplasmi 100.0 0 0 450.5 26.5 266 26-294 1-272 (272) 8 cd01020 TroA_b Metal binding p 100.0 0 0 443.4 26.8 254 22-293 1-263 (264) 9 COG0803 LraI ABC-type metal io 100.0 0 0 432.2 28.2 266 22-294 31-302 (303) 10 COG4531 ZnuA ABC-type Zn2+ tra 100.0 0 0 428.8 24.2 293 1-294 3-317 (318) 11 cd01145 TroA_c Periplasmic bin 100.0 0 0 369.0 17.5 201 22-223 1-203 (203) 12 cd00636 TroA-like Helical back 98.2 8.9E-05 2.3E-09 50.7 13.2 124 25-185 2-138 (148) 13 COG4607 CeuA ABC-type enteroch 96.1 0.13 3.3E-06 30.7 15.4 219 23-289 58-319 (320) 14 cd01977 Nitrogenase_VFe_alpha 94.8 0.37 9.4E-06 27.7 14.2 141 133-289 242-407 (415) 15 PRK06027 purU formyltetrahydro 89.3 1.9 4.8E-05 23.2 8.7 111 169-282 100-210 (285) 16 PRK10957 iron-enterobactin tra 89.2 1.9 4.8E-05 23.2 15.4 41 3-43 6-66 (319) 17 cd01968 Nitrogenase_NifE_I Nit 86.1 2.9 7.4E-05 22.0 16.1 145 133-289 238-405 (410) 18 PRK13011 formyltetrahydrofolat 86.0 2.9 7.5E-05 22.0 7.7 110 171-283 104-213 (287) 19 cd03795 GT1_like_4 This family 84.9 3.3 8.4E-05 21.7 6.0 20 244-265 277-296 (357) 20 PRK13010 purU formyltetrahydro 82.8 4 0.0001 21.2 9.5 112 169-283 105-216 (289) 21 PRK10270 hypothetical protein; 81.2 1.4 3.5E-05 24.1 2.7 16 1-16 1-16 (340) 22 PRK09590 celB cellobiose phosp 80.3 2.2 5.7E-05 22.8 3.5 64 221-284 36-102 (104) 23 cd03819 GT1_WavL_like This fam 80.1 3.3 8.3E-05 21.7 4.3 21 244-266 276-296 (355) 24 cd03808 GT1_cap1E_like This fa 78.6 2.9 7.4E-05 22.0 3.7 20 244-265 275-294 (359) 25 PRK00591 prfA peptide chain re 78.2 5.7 0.00015 20.2 7.1 97 126-222 23-163 (360) 26 PRK10355 xylF D-xylose transpo 78.1 5.8 0.00015 20.1 10.4 100 1-105 3-114 (330) 27 cd01967 Nitrogenase_MoFe_alpha 76.2 6.5 0.00017 19.8 18.6 144 133-289 240-404 (406) 28 cd00316 Oxidoreductase_nitroge 75.8 6.7 0.00017 19.7 17.7 173 45-265 180-373 (399) 29 TIGR03659 IsdE heme ABC transp 75.4 6.8 0.00017 19.7 21.9 247 2-289 1-288 (289) 30 TIGR00289 TIGR00289 conserved 75.2 6.2 0.00016 20.0 4.6 43 218-260 72-117 (227) 31 PRK03095 prsA peptidylprolyl i 75.0 4.1 0.0001 21.1 3.6 29 1-29 1-30 (287) 32 pfam00148 Oxidored_nitro Nitro 73.8 7.5 0.00019 19.4 18.6 172 45-264 173-365 (398) 33 cd01965 Nitrogenase_MoFe_beta_ 73.2 7.7 0.0002 19.3 14.2 117 133-264 255-395 (428) 34 COG1464 NlpA ABC-type metal io 73.0 6 0.00015 20.0 4.1 143 1-155 1-158 (268) 35 pfam11897 DUF3417 Protein of u 72.1 2.7 6.9E-05 22.2 2.1 74 126-207 39-116 (119) 36 cd03800 GT1_Sucrose_synthase T 71.9 8.3 0.00021 19.1 6.5 18 33-50 27-44 (398) 37 cd03820 GT1_amsD_like This fam 71.7 5.4 0.00014 20.3 3.6 19 245-265 265-283 (348) 38 cd03817 GT1_UGDG_like This fam 71.2 7.8 0.0002 19.3 4.3 10 71-80 83-92 (374) 39 PRK10954 periplasmic protein d 70.7 8.8 0.00022 19.0 5.2 54 1-64 1-57 (207) 40 PRK11063 metQ DL-methionine tr 70.3 7.2 0.00018 19.6 4.0 144 2-156 6-161 (271) 41 cd03814 GT1_like_2 This family 70.2 9 0.00023 18.9 4.5 23 33-55 20-42 (364) 42 cd03801 GT1_YqgM_like This fam 69.1 9.5 0.00024 18.8 4.7 20 33-52 20-39 (374) 43 cd03823 GT1_ExpE7_like This fa 67.4 8.3 0.00021 19.2 3.8 20 244-265 275-294 (359) 44 cd01976 Nitrogenase_MoFe_alpha 66.8 11 0.00027 18.5 17.4 142 133-292 252-418 (421) 45 PRK10866 outer membrane protei 66.8 5.5 0.00014 20.3 2.8 208 1-233 1-224 (243) 46 COG0075 Serine-pyruvate aminot 66.6 11 0.00027 18.5 9.2 56 20-80 78-139 (383) 47 PRK12450 foldase protein PrsA; 66.6 6 0.00015 20.0 3.0 29 1-29 4-35 (309) 48 cd03805 GT1_ALG2_like This fam 66.4 11 0.00028 18.4 5.8 16 249-266 316-331 (392) 49 PRK03002 prsA peptidylprolyl i 66.1 7.3 0.00018 19.5 3.3 31 1-31 1-34 (285) 50 cd01981 Pchlide_reductase_B Pc 65.2 11 0.00029 18.3 17.3 150 133-289 246-427 (430) 51 TIGR01511 ATPase-IB1_Cu copper 64.7 12 0.0003 18.2 5.3 44 223-267 375-419 (545) 52 TIGR03156 GTP_HflX GTP-binding 64.5 12 0.0003 18.2 7.7 74 28-104 17-96 (351) 53 PRK11627 hypothetical protein; 64.1 4.7 0.00012 20.7 2.0 33 1-37 2-34 (192) 54 TIGR01465 cobM_cbiF precorrin- 64.1 10 0.00026 18.6 3.7 38 63-104 15-53 (252) 55 PRK13883 conjugal transfer pro 61.7 9.3 0.00024 18.8 3.2 20 1-21 1-20 (156) 56 cd03799 GT1_amsK_like This is 61.4 13 0.00034 17.8 6.6 43 66-110 124-166 (355) 57 cd01974 Nitrogenase_MoFe_beta 61.4 13 0.00034 17.8 19.3 142 133-289 259-433 (435) 58 PRK09495 glnH glutamine ABC tr 61.0 9.6 0.00024 18.7 3.1 34 1-34 1-39 (247) 59 cd03825 GT1_wcfI_like This fam 60.0 14 0.00036 17.7 5.1 39 33-78 19-57 (365) 60 PRK09701 D-allose transporter 59.8 14 0.00036 17.7 6.1 100 1-107 1-117 (311) 61 PRK01326 prsA foldase protein 58.8 13 0.00032 18.0 3.4 28 1-28 1-32 (310) 62 PRK10449 heat-inducible protei 58.7 7.1 0.00018 19.6 2.2 24 1-25 1-24 (140) 63 pfam07901 DUF1672 Protein of u 57.8 7.6 0.00019 19.4 2.2 21 1-21 1-21 (304) 64 pfam10933 DUF2827 Protein of u 57.6 12 0.00032 18.0 3.2 73 64-141 67-143 (364) 65 pfam02669 KdpC K+-transporting 57.2 0.76 1.9E-05 25.7 -3.0 92 48-148 53-147 (188) 66 pfam07273 DUF1439 Protein of u 57.2 11 0.00028 18.4 2.9 21 1-21 1-21 (177) 67 cd00614 CGS_like CGS_like: Cys 56.6 16 0.00041 17.3 5.4 78 151-232 220-307 (369) 68 PRK11058 putative GTPase HflX; 56.5 16 0.00041 17.3 7.9 75 28-104 25-104 (426) 69 PRK08043 bifunctional acyl-[ac 56.2 16 0.00042 17.3 5.2 11 45-55 97-107 (719) 70 pfam08139 LPAM_1 Prokaryotic m 55.9 8.5 0.00022 19.1 2.1 21 1-22 1-21 (26) 71 pfam12262 Lipase_bact_N Bacter 55.8 13 0.00033 17.9 3.1 32 1-32 1-33 (269) 72 PRK10780 periplasmic chaperone 55.3 10 0.00026 18.6 2.5 33 1-33 1-33 (161) 73 PRK10598 hypothetical protein; 54.4 12 0.00031 18.1 2.7 22 1-22 1-23 (186) 74 PRK00031 lolA outer-membrane l 53.9 9.5 0.00024 18.8 2.1 20 1-20 1-20 (201) 75 PRK13280 N-glycosylase/DNA lya 53.6 18 0.00046 17.0 7.8 106 161-270 121-231 (265) 76 COG1535 EntB Isochorismate hyd 53.5 16 0.0004 17.4 3.2 50 186-241 28-77 (218) 77 TIGR00670 asp_carb_tr aspartat 53.1 18 0.00047 17.0 4.7 73 33-110 60-147 (336) 78 PRK03629 tolB translocation pr 52.9 15 0.00038 17.5 3.0 20 1-20 1-20 (430) 79 PRK09967 putative outer membra 52.7 17 0.00044 17.1 3.3 21 1-21 2-22 (160) 80 cd03794 GT1_wbuB_like This fam 52.3 19 0.00048 16.9 6.4 13 70-82 97-109 (394) 81 PRK09861 cytoplasmic membrane 52.1 19 0.00048 16.9 4.4 141 5-156 11-162 (272) 82 KOG4175 consensus 51.6 19 0.00049 16.8 8.3 47 223-269 136-183 (268) 83 pfam00121 TIM Triosephosphate 50.8 20 0.00051 16.7 6.7 63 187-264 157-229 (243) 84 pfam01053 Cys_Met_Meta_PP Cys/ 50.8 20 0.00051 16.7 5.6 77 151-232 233-319 (381) 85 TIGR03525 GldK gliding motilit 50.6 18 0.00045 17.1 3.1 19 1-20 1-19 (449) 86 PRK11411 fecB iron-dicitrate t 49.6 21 0.00053 16.6 18.0 136 142-289 142-294 (297) 87 COG3275 LytS Putative regulato 49.2 21 0.00054 16.6 3.5 37 124-167 368-412 (557) 88 PRK13479 2-aminoethylphosphona 49.0 21 0.00054 16.6 12.4 51 223-293 311-361 (368) 89 COG1927 Mtd Coenzyme F420-depe 48.5 22 0.00055 16.5 8.4 196 30-244 15-220 (277) 90 TIGR03088 stp2 sugar transfera 48.2 22 0.00055 16.5 5.2 53 24-80 34-89 (374) 91 PRK02627 acetylornithine amino 48.2 22 0.00056 16.5 7.2 117 143-261 84-221 (398) 92 PHA01735 hypothetical protein 47.8 22 0.00056 16.4 3.3 37 216-252 27-63 (76) 93 COG1022 FAA1 Long-chain acyl-C 47.7 22 0.00056 16.4 5.6 81 126-208 277-380 (613) 94 PRK01742 tolB translocation pr 47.5 20 0.0005 16.8 2.9 26 1-26 4-29 (430) 95 PRK13601 putative ribosomal pr 47.5 22 0.00057 16.4 5.6 45 224-268 14-61 (84) 96 PRK04792 tolB translocation pr 47.2 23 0.00058 16.4 3.2 21 1-21 2-22 (450) 97 cd00617 Tnase_like Tryptophana 47.1 23 0.00058 16.4 4.0 50 219-268 295-363 (431) 98 KOG0053 consensus 46.8 23 0.00058 16.3 6.8 18 276-293 391-408 (409) 99 PRK00994 F420-dependent methyl 46.8 23 0.00058 16.3 7.3 190 30-245 15-220 (276) 100 pfam07383 DUF1496 Protein of u 46.6 20 0.00051 16.7 2.9 33 1-33 1-33 (88) 101 COG2102 Predicted ATPases of P 46.5 22 0.00055 16.5 3.0 25 132-156 45-69 (223) 102 cd05844 GT1_like_7 Glycosyltra 46.4 23 0.00059 16.3 7.0 41 68-111 138-178 (367) 103 PRK00578 prfB peptide chain re 46.3 23 0.00059 16.3 7.3 92 125-218 38-171 (367) 104 COG0626 MetC Cystathionine bet 46.2 23 0.0006 16.3 6.2 129 152-292 246-395 (396) 105 pfam01026 TatD_DNase TatD rela 45.9 24 0.0006 16.3 6.7 20 226-245 187-206 (255) 106 PRK07789 acetolactate synthase 44.6 25 0.00063 16.1 5.6 41 62-102 287-327 (612) 107 pfam07437 YfaZ YfaZ precursor. 44.1 15 0.00039 17.5 1.9 26 1-26 1-26 (180) 108 PRK04405 prsA peptidylprolyl i 44.0 23 0.00057 16.4 2.8 27 1-28 5-33 (298) 109 pfam03652 UPF0081 Uncharacteri 43.5 26 0.00065 16.0 6.7 65 197-261 43-112 (134) 110 cd06310 PBP1_ABC_sugar_binding 43.4 26 0.00066 16.0 7.9 34 71-105 56-90 (273) 111 PRK13962 bifunctional phosphog 43.3 26 0.00066 16.0 6.2 21 61-81 295-319 (653) 112 PRK12789 flgI flagellar basal 43.2 11 0.00028 18.3 1.1 21 1-21 2-22 (367) 113 TIGR03431 PhnD phosphonate ABC 43.2 26 0.00066 16.0 5.8 75 1-83 2-91 (288) 114 pfam01993 MTD methylene-5,6,7, 42.8 26 0.00067 16.0 7.4 190 30-245 14-220 (276) 115 PRK07245 consensus 42.6 27 0.00068 15.9 6.5 22 126-147 12-33 (337) 116 PRK00109 Holliday junction res 42.2 27 0.00069 15.9 6.8 64 197-260 46-114 (141) 117 TIGR01826 CofD_related conserv 42.2 20 0.00051 16.7 2.3 16 62-77 179-195 (331) 118 PRK00315 potassium-transportin 41.6 1.2 2.9E-05 24.6 -4.1 23 125-149 127-149 (195) 119 COG3933 Transcriptional antite 41.3 28 0.00071 15.8 4.5 31 29-59 120-150 (470) 120 pfam01261 AP_endonuc_2 Xylose 41.2 28 0.00071 15.8 10.2 21 220-240 133-153 (201) 121 cd03796 GT1_PIG-A_like This fa 40.5 29 0.00073 15.7 5.5 19 33-51 20-38 (398) 122 COG0240 GpsA Glycerol-3-phosph 40.4 29 0.00073 15.7 6.1 75 34-109 34-110 (329) 123 TIGR02196 GlrX_YruB Glutaredox 40.3 29 0.00073 15.7 4.6 43 224-266 13-56 (79) 124 COG2262 HflX GTPases [General 40.2 29 0.00074 15.7 7.4 75 28-104 20-99 (411) 125 PRK11557 putative DNA-binding 40.1 29 0.00074 15.7 8.7 79 186-264 132-212 (282) 126 PRK07811 cystathionine gamma-s 39.7 29 0.00075 15.7 5.7 77 152-232 239-325 (386) 127 pfam06291 Lambda_Bor Bor prote 39.1 21 0.00054 16.6 2.0 20 1-21 1-20 (97) 128 COG2247 LytB Putative cell wal 39.1 30 0.00077 15.6 5.8 88 1-94 1-99 (337) 129 PRK10812 putative metallodepen 39.0 30 0.00077 15.6 6.3 11 76-86 37-47 (265) 130 COG5633 Predicted periplasmic 38.6 19 0.00048 16.9 1.7 26 1-27 1-26 (123) 131 cd00003 PNPsynthase Pyridoxine 38.2 31 0.00079 15.5 9.8 44 220-266 108-152 (234) 132 PRK04781 histidinol-phosphate 38.2 31 0.00079 15.5 9.3 47 31-80 109-158 (362) 133 COG4992 ArgD Ornithine/acetylo 38.0 31 0.0008 15.5 4.5 65 131-198 74-140 (404) 134 cd06312 PBP1_ABC_sugar_binding 38.0 31 0.0008 15.5 7.5 56 45-104 31-89 (271) 135 PRK11449 putative deoxyribonuc 37.9 31 0.0008 15.5 6.4 18 45-62 31-48 (258) 136 PRK00116 ruvA Holliday junctio 37.9 23 0.00058 16.3 2.0 11 78-88 21-31 (198) 137 PRK07810 O-succinylhomoserine 37.5 32 0.00081 15.4 7.2 83 150-232 252-344 (406) 138 PRK09423 gldA glycerol dehydro 37.4 32 0.00081 15.4 9.8 40 220-259 289-331 (366) 139 pfam03740 PdxJ Pyridoxal phosp 36.9 33 0.00083 15.4 9.3 44 220-266 109-153 (239) 140 pfam04056 Ssl1 Ssl1-like. Ssl1 36.4 33 0.00084 15.3 4.5 45 221-265 173-218 (250) 141 TIGR01163 rpe ribulose-phospha 36.4 33 0.00084 15.3 6.8 139 133-284 61-214 (216) 142 PRK06176 cystathionine gamma-s 36.2 33 0.00085 15.3 5.5 128 151-290 229-377 (379) 143 PRK06683 hypothetical protein; 36.1 33 0.00085 15.3 5.4 45 224-268 16-63 (82) 144 pfam09171 DUF1886 Domain of un 35.9 34 0.00086 15.3 8.2 123 166-292 117-244 (246) 145 PRK03202 6-phosphofructokinase 35.3 34 0.00088 15.2 8.3 67 198-264 176-249 (323) 146 cd06316 PBP1_ABC_sugar_binding 35.1 35 0.00089 15.2 8.6 34 71-105 55-89 (294) 147 PRK10528 multifunctional acyl- 35.0 35 0.00089 15.2 7.0 72 220-291 93-185 (191) 148 pfam09403 FadA Adhesion protei 34.9 28 0.0007 15.8 2.1 16 190-205 84-99 (126) 149 COG4314 NosL Predicted lipopro 34.8 28 0.00072 15.8 2.1 22 1-22 1-22 (176) 150 PRK10641 btuB vitamin B12/coba 34.7 35 0.0009 15.2 2.8 22 1-22 2-23 (615) 151 PRK06746 peptide chain release 34.6 35 0.0009 15.2 5.8 19 128-146 3-21 (326) 152 PRK08134 O-acetylhomoserine am 34.6 35 0.0009 15.1 5.1 131 151-292 276-428 (433) 153 PRK09859 multidrug efflux syst 34.3 35 0.00089 15.2 2.5 68 3-85 6-74 (385) 154 PRK13602 putative ribosomal pr 34.2 36 0.00091 15.1 5.4 45 224-268 16-63 (82) 155 TIGR02622 CDP_4_6_dhtase CDP-g 34.1 36 0.00092 15.1 3.5 47 187-233 5-56 (361) 156 cd03821 GT1_Bme6_like This fam 34.0 36 0.00092 15.1 6.6 10 71-80 86-95 (375) 157 cd01971 Nitrogenase_VnfN_like 33.9 36 0.00093 15.1 18.4 149 133-289 242-426 (427) 158 cd03769 SR_IS607_transposase_l 33.2 37 0.00095 15.0 9.5 72 213-289 40-116 (134) 159 COG0161 BioA Adenosylmethionin 32.7 38 0.00097 15.0 7.0 120 142-261 89-252 (449) 160 pfam06506 PrpR_N Propionate ca 32.7 38 0.00097 15.0 8.0 62 195-266 88-149 (169) 161 KOG2807 consensus 32.6 38 0.00097 14.9 4.7 48 219-266 177-225 (378) 162 PRK00022 lolB outer membrane l 32.4 34 0.00086 15.3 2.2 21 1-21 1-21 (203) 163 PRK08247 cystathionine gamma-s 32.4 38 0.00098 14.9 4.4 46 151-196 231-278 (366) 164 PRK10877 thiol:disulfide inter 32.1 35 0.00089 15.2 2.2 42 1-52 1-42 (232) 165 pfam10280 Med11 Mediator compl 32.0 39 0.00099 14.9 4.6 66 134-199 14-81 (113) 166 PRK07504 O-succinylhomoserine 31.8 39 0.001 14.9 5.4 130 151-291 245-395 (397) 167 PRK10290 superoxide dismutase; 31.8 38 0.00096 15.0 2.3 20 1-20 1-20 (173) 168 TIGR03449 mycothiol_MshA UDP-N 31.8 39 0.001 14.9 5.3 19 33-51 26-44 (405) 169 TIGR02329 propionate_PrpR prop 31.7 39 0.001 14.8 5.4 71 187-267 99-172 (658) 170 PRK07671 cystathionine beta-ly 31.5 40 0.001 14.8 5.1 76 152-232 230-315 (377) 171 PRK10190 hypothetical protein; 31.5 40 0.001 14.8 3.8 42 1-42 1-59 (310) 172 PRK02910 light-independent pro 31.3 40 0.001 14.8 14.5 76 185-265 291-387 (524) 173 TIGR01279 DPOR_bchN light-inde 31.3 40 0.001 14.8 3.7 54 32-85 72-132 (458) 174 PRK13238 tnaA tryptophanase; P 31.2 40 0.001 14.8 4.0 135 133-268 195-389 (461) 175 PRK09221 beta alanine--pyruvat 31.1 40 0.001 14.8 7.0 49 144-192 96-146 (445) 176 PRK10760 murein hydrolase B; P 31.0 25 0.00063 16.2 1.3 25 1-26 1-25 (357) 177 cd03809 GT1_mtfB_like This fam 30.8 41 0.001 14.8 5.4 11 70-80 138-148 (365) 178 PRK09810 entericidin A; Provis 30.8 36 0.00093 15.1 2.1 21 1-21 2-22 (41) 179 PRK09465 tolC outer membrane c 30.4 37 0.00094 15.0 2.1 16 1-16 1-16 (446) 180 pfam06649 DUF1161 Protein of u 30.4 38 0.00097 14.9 2.2 20 1-20 1-20 (75) 181 TIGR02747 TraV type IV conjuga 30.4 33 0.00084 15.3 1.8 42 1-42 1-59 (174) 182 cd03811 GT1_WabH_like This fam 30.3 42 0.0011 14.7 4.4 20 245-266 276-295 (353) 183 PRK00042 tpiA triosephosphate 30.2 42 0.0011 14.7 7.6 78 188-284 161-248 (251) 184 TIGR01457 HAD-SF-IIA-hyp2 HAD- 30.2 42 0.0011 14.7 4.4 16 227-242 25-40 (251) 185 PRK12698 flgH flagellar basal 30.2 42 0.0011 14.7 2.3 19 1-21 1-19 (224) 186 pfam07269 consensus 30.0 34 0.00087 15.2 1.9 19 1-22 1-19 (55) 187 PRK08133 O-succinylhomoserine 29.8 42 0.0011 14.6 5.4 79 150-232 240-328 (391) 188 PRK00094 gpsA NAD(P)H-dependen 29.8 42 0.0011 14.6 3.3 83 23-109 26-110 (325) 189 PRK03757 hypothetical protein; 29.7 43 0.0011 14.6 3.0 22 1-22 1-23 (191) 190 TIGR02127 pyrF_sub2 orotidine 29.6 43 0.0011 14.6 3.7 122 127-253 20-177 (306) 191 TIGR02475 CobW cobalamin biosy 29.4 30 0.00076 15.6 1.5 45 61-107 176-220 (349) 192 PRK05968 hypothetical protein; 29.3 43 0.0011 14.6 5.7 17 276-292 371-387 (389) 193 pfam05643 DUF799 Putative bact 29.3 43 0.0011 14.6 2.3 29 56-86 88-117 (215) 194 TIGR02887 spore_ger_x_C germin 29.2 37 0.00096 15.0 2.0 21 1-21 1-21 (400) 195 PRK09934 putative fimbrial pro 29.0 44 0.0011 14.6 3.2 17 1-17 1-17 (171) 196 cd05013 SIS_RpiR RpiR-like pro 29.0 44 0.0011 14.6 5.3 26 222-247 74-99 (139) 197 PRK07798 acyl-CoA synthetase; 28.8 44 0.0011 14.5 9.1 20 24-43 101-120 (532) 198 COG0788 PurU Formyltetrahydrof 28.6 45 0.0011 14.5 8.4 92 184-282 117-212 (287) 199 cd06318 PBP1_ABC_sugar_binding 28.5 45 0.0011 14.5 6.8 34 71-105 54-88 (282) 200 PRK12435 ferrochelatase; Provi 28.5 45 0.0011 14.5 9.6 213 53-288 51-304 (316) 201 cd02812 PcrB_like PcrB_like pr 28.4 45 0.0011 14.5 7.0 76 195-278 138-214 (219) 202 pfam00534 Glycos_transf_1 Glyc 28.2 35 0.0009 15.2 1.7 14 135-148 142-155 (172) 203 PRK13697 cytochrome c6; Provis 27.9 43 0.0011 14.6 2.1 16 1-16 1-16 (111) 204 cd06305 PBP1_methylthioribose_ 27.8 46 0.0012 14.4 7.7 35 71-106 54-89 (273) 205 PRK12696 flgH flagellar basal 27.7 46 0.0012 14.4 2.6 20 1-21 2-21 (238) 206 pfam09711 Cas_Csn2 CRISPR-asso 27.6 46 0.0012 14.4 3.3 32 213-244 137-168 (188) 207 pfam06316 Ail_Lom Enterobacter 26.8 46 0.0012 14.4 2.1 19 1-19 1-19 (199) 208 PRK04922 tolB translocation pr 26.8 48 0.0012 14.3 3.4 29 133-161 139-167 (439) 209 PRK07050 cystathionine beta-ly 26.8 48 0.0012 14.3 5.3 134 152-294 246-394 (394) 210 pfam07424 TrbM TrbM. This fami 26.4 48 0.0012 14.3 2.1 54 1-65 1-54 (189) 211 PRK11548 hypothetical protein; 26.4 49 0.0012 14.3 2.4 52 1-52 3-60 (113) 212 PRK05967 cystathionine beta-ly 26.2 49 0.0012 14.2 5.5 129 152-294 244-391 (392) 213 smart00187 INB Integrin beta s 26.1 49 0.0013 14.2 6.7 94 186-291 229-340 (423) 214 PRK13861 type IV secretion sys 25.9 39 0.001 14.8 1.6 11 55-65 71-81 (293) 215 pfam03932 CutC CutC family. Co 25.6 50 0.0013 14.2 6.3 71 191-264 117-198 (202) 216 PRK00178 tolB translocation pr 25.6 50 0.0013 14.2 3.0 29 134-162 135-163 (433) 217 PRK09578 periplasmic multidrug 25.5 50 0.0013 14.2 2.6 69 2-84 7-75 (385) 218 COG0540 PyrB Aspartate carbamo 25.4 51 0.0013 14.2 5.1 72 33-107 64-135 (316) 219 cd06302 PBP1_LsrB_Quorum_Sensi 25.0 52 0.0013 14.1 10.5 69 28-103 8-87 (298) 220 PRK02998 prsA peptidylprolyl i 24.8 52 0.0013 14.1 3.3 27 1-29 1-31 (283) 221 PRK04043 tolB translocation pr 24.8 52 0.0013 14.1 6.1 76 1-78 1-92 (419) 222 PRK00059 prsA peptidylprolyl i 24.7 52 0.0013 14.1 3.0 80 153-238 176-271 (336) 223 PTZ00151 translationally contr 24.7 52 0.0013 14.1 3.5 36 137-172 92-127 (173) 224 pfam02603 Hpr_kinase_N HPr Ser 24.6 52 0.0013 14.1 5.9 48 217-264 63-111 (127) 225 PRK11251 DNA-binding transcrip 24.5 53 0.0013 14.0 4.8 63 2-66 2-74 (109) 226 PRK10653 D-ribose transporter 24.4 53 0.0013 14.0 9.6 100 1-105 3-115 (295) 227 pfam05728 UPF0227 Uncharacteri 24.4 53 0.0013 14.0 6.7 20 128-147 35-54 (187) 228 PRK07812 O-acetylhomoserine am 24.3 53 0.0014 14.0 5.9 130 152-292 281-432 (436) 229 cd04299 GT1_Glycogen_Phosphory 24.3 51 0.0013 14.2 1.9 43 215-258 524-568 (778) 230 KOG2045 consensus 24.3 45 0.0012 14.5 1.7 17 27-43 859-875 (1493) 231 PRK11023 hypothetical protein; 24.3 53 0.0014 14.0 2.1 20 1-20 1-21 (191) 232 cd01453 vWA_transcription_fact 24.2 53 0.0014 14.0 5.7 55 221-284 122-177 (183) 233 PRK10796 LPS-assembly lipoprot 24.2 53 0.0014 14.0 3.8 42 1-42 1-53 (196) 234 cd07986 LPLAT_ACT14924-like Ly 24.2 53 0.0014 14.0 7.2 53 218-270 82-148 (210) 235 pfam08085 Entericidin Enterici 24.2 49 0.0013 14.2 1.8 18 1-18 1-21 (42) 236 TIGR02668 moaA_archaeal probab 24.2 53 0.0014 14.0 6.3 47 221-269 212-260 (324) 237 COG5645 Predicted periplasmic 24.1 38 0.00097 14.9 1.2 19 1-20 1-19 (80) 238 cd03818 GT1_ExpC_like This fam 24.1 54 0.0014 14.0 4.4 15 249-265 317-331 (396) 239 COG5567 Predicted small peripl 23.9 54 0.0014 14.0 2.6 25 1-25 1-27 (58) 240 cd06317 PBP1_ABC_sugar_binding 23.9 54 0.0014 14.0 6.8 33 71-104 55-88 (275) 241 COG2875 CobM Precorrin-4 methy 23.8 54 0.0014 14.0 3.5 44 56-104 14-57 (254) 242 PRK08249 cystathionine gamma-s 23.6 55 0.0014 13.9 4.9 77 152-232 244-330 (398) 243 TIGR00514 accC acetyl-CoA carb 23.5 55 0.0014 13.9 4.0 79 186-264 71-159 (451) 244 PRK04870 histidinol-phosphate 23.4 55 0.0014 13.9 7.9 14 280-293 343-356 (356) 245 KOG4549 consensus 23.3 55 0.0014 13.9 4.4 47 215-261 39-86 (144) 246 pfam11777 DUF3316 Protein of u 23.3 45 0.0012 14.5 1.5 52 1-52 1-57 (114) 247 PRK09028 cystathionine beta-ly 23.2 56 0.0014 13.9 5.3 129 152-292 241-388 (394) 248 pfam05628 Borrelia_P13 Borreli 23.2 25 0.00063 16.1 0.1 23 1-23 1-23 (165) 249 TIGR00469 pheS_mito phenylalan 22.9 56 0.0014 13.9 2.4 13 67-79 164-176 (460) 250 PRK07503 methionine gamma-lyas 22.9 57 0.0014 13.9 7.2 77 152-232 247-333 (403) 251 PRK09837 copper/silver efflux 22.8 57 0.0014 13.8 2.2 20 1-20 1-20 (460) 252 COG0725 ModA ABC-type molybdat 22.4 58 0.0015 13.8 6.0 64 22-93 30-96 (258) 253 pfam01591 6PF2K 6-phosphofruct 22.3 58 0.0015 13.8 6.8 33 216-248 104-136 (223) 254 PRK10002 outer membrane protei 22.2 58 0.0015 13.8 1.9 21 1-21 2-22 (362) 255 PRK08248 O-acetylhomoserine am 22.2 58 0.0015 13.8 5.0 131 152-293 277-429 (431) 256 CHL00076 chlB photochlorophyll 22.1 59 0.0015 13.7 16.2 60 199-263 320-394 (510) 257 pfam04076 BOF Bacterial OB fol 22.0 50 0.0013 14.2 1.5 19 1-19 1-19 (126) 258 TIGR02155 PA_CoA_ligase phenyl 22.0 59 0.0015 13.7 3.9 99 185-289 114-218 (431) 259 PTZ00333 triosephosphate isome 21.9 59 0.0015 13.7 8.3 63 187-264 161-234 (252) 260 pfam10566 Glyco_hydro_97 Glyco 21.8 59 0.0015 13.7 7.2 74 197-271 315-404 (643) 261 PRK09304 arginine exporter pro 21.8 43 0.0011 14.6 1.1 13 127-139 107-119 (197) 262 TIGR02478 6PF1K_euk 6-phosphof 21.8 60 0.0015 13.7 5.3 159 133-292 342-569 (777) 263 cd06451 AGAT_like Alanine-glyo 21.7 60 0.0015 13.7 7.8 27 260-293 329-355 (356) 264 TIGR01485 SPP_plant-cyano sucr 21.7 60 0.0015 13.7 2.5 24 132-155 92-115 (257) 265 PRK05994 O-acetylhomoserine am 21.6 60 0.0015 13.7 5.3 129 152-291 276-426 (426) 266 TIGR01781 Trep_dent_lipo Trepo 21.4 61 0.0015 13.7 1.9 17 5-21 8-24 (463) 267 cd06319 PBP1_ABC_sugar_binding 21.3 61 0.0016 13.6 7.2 33 71-104 54-87 (277) 268 cd07983 LPLAT_DUF374-like Lyso 21.2 61 0.0016 13.6 7.3 67 195-264 61-132 (189) 269 PRK05586 biotin carboxylase; V 21.1 57 0.0015 13.8 1.7 17 184-200 270-286 (447) 270 cd01451 vWA_Magnesium_chelatas 21.1 61 0.0016 13.6 6.7 53 216-268 113-168 (178) 271 pfam07312 DUF1459 Protein of u 21.1 53 0.0014 14.0 1.5 28 2-29 3-30 (84) 272 PRK08861 cystathionine gamma-s 20.9 62 0.0016 13.6 5.1 131 152-293 234-385 (388) 273 COG4047 Uncharacterized protei 20.9 62 0.0016 13.6 10.3 102 159-271 113-215 (243) 274 TIGR01990 bPGM beta-phosphoglu 20.9 62 0.0016 13.6 4.8 81 193-279 65-153 (190) 275 PRK10554 outer membrane porin 20.8 62 0.0016 13.6 1.8 21 1-21 1-21 (373) 276 PRK11337 DNA-binding transcrip 20.6 63 0.0016 13.6 8.2 25 221-245 201-225 (293) 277 COG5510 Predicted small secret 20.5 63 0.0016 13.5 2.0 19 1-19 2-23 (44) 278 PRK10159 outer membrane phosph 20.4 64 0.0016 13.5 1.8 21 1-21 1-21 (351) 279 TIGR02294 nickel_nikA nickel A 20.3 64 0.0016 13.5 4.6 18 130-147 102-119 (513) 280 pfam00838 TCTP Translationally 20.3 64 0.0016 13.5 3.7 36 137-172 88-123 (166) 281 COG4851 CamS Protein involved 20.2 64 0.0016 13.5 2.0 87 1-94 1-94 (382) 282 pfam11672 DUF3268 Protein of u 20.2 56 0.0014 13.9 1.5 32 176-207 64-95 (102) 283 TIGR00885 fucP L-fucose:H+ sym 20.2 44 0.0011 14.5 0.9 63 22-88 130-196 (427) 284 COG0287 TyrA Prephenate dehydr 20.2 64 0.0016 13.5 4.9 120 170-291 96-241 (279) 285 PRK11372 lysozyme inhibitor; P 20.0 65 0.0016 13.5 1.8 18 1-20 3-20 (109) No 1 >PRK09545 znuA high-affinity zinc transporter periplasmic component; Reviewed Probab=100.00 E-value=0 Score=519.64 Aligned_cols=291 Identities=33% Similarity=0.585 Sum_probs=265.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECC Q ss_conf 15899999999996001323880599972289999999707944178870589983533568024566502578999676 Q gi|254780717|r 2 KNFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGA 81 (294) Q Consensus 2 kk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~ 81 (294) ||.|+++.++.+++++.+.+++++||||++|+++|+++|+||+++|++|+|+|+|||+|+|||+|+++|++||++||||. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~ak~~Vvtsi~Pl~~l~~~I~gd~~~V~~Lvp~g~dPH~yep~Psd~~~l~~ADli~~~G~ 80 (308) T PRK09545 1 KKTLLFAALSAALWGGATQAANAAVVASIKPLGFIASAIADGVTETEVLLPDGASPHDYSLRPSDVKRLQSADLVVWVGP 80 (308) T ss_pred CCCHHHHHHHHHHHCCCHHCCCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEEECC T ss_conf 90449999999996550013898399987689999999848972799812899897467689999999860999999586 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCC---------------CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 610356899973145576799751354462100---------------12111233212247567765555678999999 Q gi|254780717|r 82 EMESFLVKPLHSLNKQSNVVTLSHSPDLHRILL---------------RDNHSHFHDSEADDLHLWLNPLNVQYIAHVIA 146 (294) Q Consensus 82 ~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~ 146 (294) +||+|+++.+.+.... +++.++...++.++.. .++++++|+++..|||+||||.|++.++++|+ T Consensus 81 ~lE~~l~k~l~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~h~~~~~~~~~~h~hg~~DPH~WldP~na~~~a~~I~ 159 (308) T PRK09545 81 EMEAFLEKPVSKLPEN-KQVTIAQLPDVKPLLMKSAHHDEDEDDHDHAAEKHDEDHHHGDYNMHLWLSPEIARATAVAIH 159 (308) T ss_pred CHHHHHHHHHHHCCCC-CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 5677899999736657-806998227854233345456555555554445567655569999714479999999999999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCCCCHHHHH Q ss_conf 9974218656588998999986312223589998503002781462062157789874981200-346775444978899 Q gi|254780717|r 147 MELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVFMGAASLR 225 (294) Q Consensus 147 ~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~~s~~~l~ 225 (294) ++|+++||+|+++|++|+++|.++|++|++++++.+++.++|.|+|+||+|+||+++|||++.+ ++.+++.+||+++|. T Consensus 160 ~~L~~~dP~~~~~y~~N~~~~~~~L~~ld~~~~~~la~~~~r~~v~~HdAf~Yfa~~ygL~~~~~~~~~~e~~Ps~~~l~ 239 (308) T PRK09545 160 DKLVELMPQSKAKLDANLKDFEAQLAATDKQIGNQLAPVKGKGYFVFHDAYGYFEKHYGLTPLGHFTVNPEIQPGAQRLH 239 (308) T ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCHHHHH T ss_conf 99998684517999999999999999999999997257778869997720799999869938788712877678999999 Q ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99999975499389981888978999999971990899805336787781678999999999998741 Q gi|254780717|r 226 NIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 226 ~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) ++++.+|+++|+|||.||+++++.++.|++++|+++++|||+|+..+.|+++|.+||++++++|+..| T Consensus 240 ~l~~~ik~~~v~~If~Ep~~~~k~~~~ia~~~g~kv~~lDpLg~~~~~~~~~Y~~~m~~~~~~~~~CL 307 (308) T PRK09545 240 EIRTQLVEHKATCVFAEPQFRPAVIESVAKGTSVRMGTLDPLGTNIKLGKTSYSNFLSQLANQYASCL 307 (308) T ss_pred HHHHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999998399799965999949999999865985999678755688881179999999999999973 No 2 >cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=494.35 Aligned_cols=272 Identities=35% Similarity=0.604 Sum_probs=248.0 Q ss_pred CCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCE Q ss_conf 38805999722899999997079441788705899835335680245665025789996766103568999731455767 Q gi|254780717|r 21 AGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNV 100 (294) Q Consensus 21 ~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~ 100 (294) +.+++||||++|+++|+++|+||+++|++|+|+|+|||+|||||+|+++|++||+|||||.+||+|+.+++++.. +.++ T Consensus 1 aak~~Vvts~~pl~~i~~~I~Gd~v~V~~L~p~g~dpH~yep~p~d~~~l~~Adliv~~G~~lE~wl~k~~~~~~-~~~~ 79 (286) T cd01019 1 AAEASVLTSIKPLGFIAAAIMGGVGEVEVLVPPGASPHDYELRPSDARKLQEADLVVWIGPDLEAFLDKVLQGRK-KGKV 79 (286) T ss_pred CCCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCC-CCCE T ss_conf 999879998779999999982895379980599989736768999999996699999969972468999997477-8883 Q ss_pred EEEEECCCCCCCCC-------------CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99751354462100-------------12111233212247567765555678999999997421865658899899998 Q gi|254780717|r 101 VTLSHSPDLHRILL-------------RDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEF 167 (294) Q Consensus 101 i~~~~~~~~~~~~~-------------~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~ 167 (294) +.+..+.++..... .++++++|++++.|||+||||.|++.++++|+++|+++||+|+++|++|+++| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~~~~DPH~Wldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~ 159 (286) T cd01019 80 LTLAKLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEEGGLDPHLWLSPENAAEVAQAVAEKLSALDPDNAATYAANLEAF 159 (286) T ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHH T ss_conf 78533556554334443455455555555654445568999816348477999999999999886822099999999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 6312223589998503002781462062157789874981200-346775444978899999999754993899818889 Q gi|254780717|r 168 KNQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFD 246 (294) Q Consensus 168 ~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~ 246 (294) .++|++|++++++.++++++|.|+++||+|+||+++|||++.+ ++.+++.+||+++|.++++.+|+++|+|||+|++++ T Consensus 160 ~~~L~~l~~~~~~~l~~~~~~~~v~~H~af~Yfa~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~If~e~~~~ 239 (286) T cd01019 160 NARLAELDATIKERLAPVKTKPFFVFHDAYGYFEKRYGLTQAGVFTIDPEIDPGAKRLAKIRKEIKEKGATCVFAEPQFH 239 (286) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 99999999999997445565228985663689998669838678722467789999999999999983998899828989 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 78999999971990899805336787781678999999999998741 Q gi|254780717|r 247 SKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 247 ~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) ++.++.|++++|+++++|||+|...+.++++|++||+.|+++|+++| T Consensus 240 ~k~a~~ia~e~g~~v~~ld~l~~~~~~~~~~Y~~~m~~n~~~i~~aL 286 (286) T cd01019 240 PKIAETLAEGTGAKVGELDPLGGLIELGKNSYVNFLRNLADSLASCL 286 (286) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 39999999971993899637767788874069999999999999769 No 3 >cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=479.90 Aligned_cols=273 Identities=26% Similarity=0.429 Sum_probs=247.2 Q ss_pred CCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCC-CCC Q ss_conf 3880599972289999999707944178870589983533568024566502578999676610356899973145-576 Q gi|254780717|r 21 AGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNK-QSN 99 (294) Q Consensus 21 ~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~-~~~ 99 (294) +++++||||++|+++||++|+||.++|++|+|+|+|||+|+|+|+|+++|++||+|||||.+||+|+.+++++... +.. T Consensus 1 sgki~Vv~S~~p~~~lv~~I~Gd~v~V~~lv~~g~dPH~yep~p~d~~~l~~Adli~~nG~~lE~~l~~~~~~~~~~~~~ 80 (282) T cd01017 1 SGKLKVVTTFYPLYEFTKAIGGDKADVKLIIPAGTEPHDFEPSPKDIARIADADVFVYNGLGMETWAEKVLKSLQNKKLK 80 (282) T ss_pred CCCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCCE T ss_conf 99828999888999999998199448998159897963675899999999669999991878368999999853689977 Q ss_pred EEEEEECCCCCCC---CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 7997513544621---0012111233212247567765555678999999997421865658899899998631222358 Q gi|254780717|r 100 VVTLSHSPDLHRI---LLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDK 176 (294) Q Consensus 100 ~i~~~~~~~~~~~---~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~ 176 (294) ++..+.+..+... ...+.++++|+++..|||+||||.|+..++++|+++|+++||+|++.|++|+++|.++|..|++ T Consensus 81 vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DPH~Wldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~ 160 (282) T cd01017 81 VVEASKGIKLLKAGGAEHDHDHSHSHHHGDYDPHVWLSPVLAIQQVENIKDALIKLDPDNKEYYEKNAAAYAKKLEALDQ 160 (282) T ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99877895235567765555555445579999825578155999999999999986844179999999999999999999 Q ss_pred HHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECC-CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHH Q ss_conf 99985030027814620621577898749812003-46775444978899999999754993899818889789999999 Q gi|254780717|r 177 ELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTF-PMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITN 255 (294) Q Consensus 177 ~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~-~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~ 255 (294) ++++.++++++|.|+++||+|+||+++|||++.++ +.+++.+||++++.++++.+|+++|+|||+|++++++.++.|++ T Consensus 161 ~~~~~l~~~~~~~~v~~H~af~Yf~~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~If~e~~~~~~~~~~ia~ 240 (282) T cd01017 161 EYRAKLAKAKGKTFVTQHAAFGYLARRYGLKQIAIVGVSPEVEPSPKQLAELVEFVKKSDVKYIFFEENASSKIAETLAK 240 (282) T ss_pred HHHHHHHCCCCCCEEEECCHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHH T ss_conf 99998744367856873533789998669848654334888899999999999999985998999818999099999999 Q ss_pred HHCCCEEEECCCCCCCCC---CCCCHHHHHHHHHHHHHHHC Q ss_conf 719908998053367877---81678999999999998741 Q gi|254780717|r 256 DTGVMSAILDPEGMLIAE---GPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 256 e~~~~~~~ld~~~~~~~~---~~~~Y~~~m~~n~~~l~k~l 293 (294) ++|++++.|||++....+ ++++|++||++|+++|.++| T Consensus 241 e~g~~v~~l~pl~~~~~~~~~~~~~Y~~~m~~N~~~i~~aL 281 (282) T cd01017 241 ETGAKLLVLNPLETLTKEEIDDGKDYFSLMKENLETLKRAL 281 (282) T ss_pred HHCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 81996785447666652348873669999999999999970 No 4 >cd01137 PsaA Metal binding protein PsaA. These proteins have been shown to function as initial receptors in ABC transport of Mn2+ and as surface adhesins in some eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=478.88 Aligned_cols=267 Identities=23% Similarity=0.376 Sum_probs=242.6 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCC Q ss_conf 23880599972289999999707944178870589983533568024566502578999676610356899973145576 Q gi|254780717|r 20 RAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSN 99 (294) Q Consensus 20 ~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~ 99 (294) ++++++||||++|+++||++|+||+++|++|+|+|+|||+|||+|+|+++|++||+|||||++||+|+.+++++..++.. T Consensus 14 ~~~k~~Vvtt~~pl~~~v~~I~Gd~v~V~~Lvp~g~dPH~yep~p~d~~~l~~Adliv~~G~~lE~~~~k~~~~~~~~~~ 93 (287) T cd01137 14 AASKLKVVATFSILADIARNIAGDRVNVTSIVPPGADPHEYEPTPSDIKKLSKADLILYNGLNLEPWLERLVKNAGKDVP 93 (287) T ss_pred CCCCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCCCC T ss_conf 58996699987699999999818845999821879897568899999999957999998588741789999996066898 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 79975135446210012111233212247567765555678999999997421865658899899998631222358999 Q gi|254780717|r 100 VVTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELH 179 (294) Q Consensus 100 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~ 179 (294) ++.++.+.+.... .++|+++..|||+|+||.|++.++++|+++|+++||+|+++|++|+++|.++|++|+++++ T Consensus 94 ~v~~~~~i~~~~~------~~~~~~~~~DPHiWldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~Ld~~~~ 167 (287) T cd01137 94 VVAVSEGIDPIPL------EEGHYKGKPDPHAWMSPKNAIIYVKNIAKALSEADPANAETYQKNAAAYKAKLKALDEWAK 167 (287) T ss_pred EEECCCCCCCCCC------CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8953588431045------5667889999863579999999999999999986821228999999999999999999999 Q ss_pred HHCC--CCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH Q ss_conf 8503--002781462062157789874981200-3467754449788999999997549938998188897899999997 Q gi|254780717|r 180 SILQ--PVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITND 256 (294) Q Consensus 180 ~~~~--~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e 256 (294) +.++ |..+|.++++||+|+||+++|||++.+ ++.+++.+||+++|.++++.+|+++|+|||+|+|++++.++.|+++ T Consensus 168 ~~l~~vp~~~~~~vt~H~af~Yfa~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~i~~if~E~~~~~k~a~~ia~e 247 (287) T cd01137 168 AKFATIPAEKRKLVTSEGAFSYFAKAYGLKEAYLWPINTEEEGTPKQVATLIEQVKKEKVPAVFVESTVNDRLMKQVAKE 247 (287) T ss_pred HHHHCCCCCCCEEEEECCHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 99724874355578517416899986799177644127866669999999999998529978998589990999999997 Q ss_pred HCCCEEE---ECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 1990899---805336787781678999999999998741 Q gi|254780717|r 257 TGVMSAI---LDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 257 ~~~~~~~---ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) +|++++. +|+++..... +++|++||++|+++|.++| T Consensus 248 ~g~~v~~~l~~d~l~~~~~~-~~tY~~~m~~N~~~l~~aL 286 (287) T cd01137 248 TGAKIGGQLYTDSLSEKGGP-ADTYLDMMEHNLDTIVEGL 286 (287) T ss_pred CCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHC T ss_conf 29971462565567888998-3759999999999999864 No 5 >cd01016 TroA Metal binding protein TroA. These proteins have been shown to function as initial receptors in ABC transport of Zn2+ and possibly Fe3+ in many eubacterial species. The TroA proteins belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=466.68 Aligned_cols=264 Identities=20% Similarity=0.313 Sum_probs=235.8 Q ss_pred CEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEE Q ss_conf 80599972289999999707944178870589983533568024566502578999676610356899973145576799 Q gi|254780717|r 23 SLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVT 102 (294) Q Consensus 23 ~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~ 102 (294) +|+||||+.||++||++|+||+++|++|+|+|+|||+|||+|+|+++|++||++|+||++||+|+.+++.++..+..++. T Consensus 1 kP~Vvtt~~pl~~iv~~I~Gd~v~V~~lv~~g~dPH~yep~p~d~~~l~~ADlii~~G~~lE~~~~~~~~~~~~~~~v~~ 80 (276) T cd01016 1 KPNVVTTTGMIADAVENIGGDHVEVTGLMGPGVDPHLYKATAGDVEKLQNADVVFYNGLHLEGKMSDVLSKLGSSKSVIA 80 (276) T ss_pred CCEEEEECCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCCCCEEE T ss_conf 99899968099999999808816999802889896468799999999967999998488631789999985458984797 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 75135446210012111233212247567765555678999999997421865658899899998631222358999850 Q gi|254780717|r 103 LSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSIL 182 (294) Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~ 182 (294) .+..... ....++++++..|||+||||.|++.++++|+++|+++||+|++.|++|+++|.++|++|+++++..+ T Consensus 81 ~~~~~~~------~~~~~~~~~~~~DPHiWldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~Ld~~~~~~l 154 (276) T cd01016 81 LEDTLDR------SQLILDEEEGTYDPHIWFDVKLWKYAVKAVAEVLSEKLPEHKDEFQANSEAYVEELDSLDAYAKKKI 154 (276) T ss_pred EECCCCC------CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 0246674------4456555689999864479999999999999999986831289999999999999999999999986 Q ss_pred C--CCCCCEEEECCCHHHHHHHHCCCCEECC-CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCC Q ss_conf 3--0027814620621577898749812003-467754449788999999997549938998188897899999997199 Q gi|254780717|r 183 Q--PVEKKKIIVFHEAYRYFASHYNLSIVTF-PMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGV 259 (294) Q Consensus 183 ~--~~~~~~~i~~H~a~~Yf~~~~gl~~~~~-~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~ 259 (294) + |.++|.++|+|+||+||+++|||++.++ +.+++.+|||++|.++++.+|+++|+|||+|++++++.++.|+++++. T Consensus 155 ~~~p~~~r~~vt~H~Af~Yfa~~ygl~~~~~~~~~~~~epsp~~l~~li~~ik~~~v~~If~E~~~~~k~~~~i~~~~~a 234 (276) T cd01016 155 AEIPEQQRVLVTAHDAFGYFGRAYGFEVKGLQGISTDSEAGLRDINELVDLIVERKIKAIFVESSVNQKSIEALQDAVKA 234 (276) T ss_pred HCCCCCCEEEEEECCCHHHHHHHCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHC T ss_conf 22555560899846406779986698273142468888889799999999999839999998489995999999999833 Q ss_pred CEEEEC--------CCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 089980--------5336787781678999999999998741 Q gi|254780717|r 260 MSAILD--------PEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 260 ~~~~ld--------~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) +.+.+. +++... .++++|++||++|+++|+++| T Consensus 235 ~~~~~~~~~~l~~d~~~~~~-~~~~tY~~~M~~N~~~l~eaL 275 (276) T cd01016 235 RGHDVQIGGELYSDAMGEEG-TSEGTYIGMFKHNVDTIVEAL 275 (276) T ss_pred CCCEEEECCCCCCCCCCCCC-CCCCCHHHHHHHHHHHHHHHH T ss_conf 57424316712323678889-987778999999999999972 No 6 >cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=455.33 Aligned_cols=260 Identities=25% Similarity=0.415 Sum_probs=235.9 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCC-HHHHHHHHHCCCCCCE Q ss_conf 880599972289999999707944178870589983533568024566502578999676610-3568999731455767 Q gi|254780717|r 22 GSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEME-SFLVKPLHSLNKQSNV 100 (294) Q Consensus 22 ~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E-~~~~~~~~~~~~~~~~ 100 (294) ++++||||++|+++||++|+||+++|++|+|+|+|||+|+|||+|+++|++||+|||||++|| +|+.++... .++..+ T Consensus 1 dkp~Vvtsi~pl~~~v~~I~gd~v~V~~Lvp~g~dPH~ye~~psd~~~l~~ADlvv~~G~~lE~~~~~~l~~~-~~~~~v 79 (266) T cd01018 1 DKPTVAVSIEPQKYFVEKIAGDTVDVVVLVPPGSNPHTYEPKPQQMKKLSEADLYFRIGLGFEEVWLERFRSN-NPKMQV 79 (266) T ss_pred CCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEEECCCHHHHHHHHHHHC-CCCCEE T ss_conf 9988999777999999998399348998069997964686999999999669899995873458899999960-899828 Q ss_pred EEEEECCCCCCCC----CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9975135446210----012111233212247567765555678999999997421865658899899998631222358 Q gi|254780717|r 101 VTLSHSPDLHRIL----LRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDK 176 (294) Q Consensus 101 i~~~~~~~~~~~~----~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~ 176 (294) +.++.+..+.... ..++++++|+++..|||+||||.|++.++++|+++|+++||+|++.|++|+++|.++|++|++ T Consensus 80 v~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~DPH~Wldp~~~~~~a~~Ia~~L~~ldP~~~~~y~~N~~~~~~~L~~L~~ 159 (266) T cd01018 80 VNMSKGITLIPMADHHHHHHGEHEHHHHGNYDPHIWLSPANAKIMAENIYEALAELDPQNATYYQANLDALLAELDALDS 159 (266) T ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98678951044566445556654555568999846578676999999999999886935089999999999999999999 Q ss_pred HHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH Q ss_conf 99985030027814620621577898749812003467754449788999999997549938998188897899999997 Q gi|254780717|r 177 ELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITND 256 (294) Q Consensus 177 ~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e 256 (294) ++++.++++++|.++++||+|+||+++|||++.++. .++.+||+++|.++++.||+++|+|||.|++++++.++.|++| T Consensus 160 ~~~~~l~~~~~~~~v~~Hdaf~Yf~~~ygl~~~~i~-~~~~eps~~~l~~l~~~ik~~~v~~If~E~~~~~k~~~~ia~e 238 (266) T cd01018 160 EIRTILSKLKQRAFMVYHPAWGYFARDYGLTQIPIE-EEGKEPSPADLKRLIDLAKEKGVRVVFVQPQFSTKSAEAIARE 238 (266) T ss_pred HHHHHHCCCCCCEEEEECCCHHHHHHHCCCEEEECC-CCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 999973776787389965447999986798798605-8888989999999999999839989997089990999999997 Q ss_pred HCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 1990899805336787781678999999999998 Q gi|254780717|r 257 TGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIA 290 (294) Q Consensus 257 ~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~ 290 (294) +|+++++|||+|+. |.+.|+++++.|+ T Consensus 239 tg~kv~~ldpL~~~-------~~~~~~~~~~~~~ 265 (266) T cd01018 239 IGAKVVTIDPLAAD-------WEENLLKVADAFA 265 (266) T ss_pred HCCCEEEECCCCCC-------HHHHHHHHHHHHC T ss_conf 09937996786402-------7999999998742 No 7 >pfam01297 SBP_bac_9 Periplasmic solute binding protein family. This family includes periplasmic solute binding proteins such as TroA that interacts with an ATP-binding cassette transport system in Treponema pallidum. Probab=100.00 E-value=0 Score=450.54 Aligned_cols=266 Identities=32% Similarity=0.516 Sum_probs=244.2 Q ss_pred EEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEE Q ss_conf 99972289999999707944178870589983533568024566502578999676610356899973145576799751 Q gi|254780717|r 26 VVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSH 105 (294) Q Consensus 26 Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~ 105 (294) ||||++|++++|++|+||+++|++|+|+|.|||+|+|+|+|+++|++||+|||||++||+|+.++++...++.+++.+++ T Consensus 1 Vv~t~~pl~~iv~~i~gd~~~V~~l~~~g~dpH~y~~~p~d~~~l~~ADlii~~G~~~E~~l~~~~~~~~~~~~~v~~~~ 80 (272) T pfam01297 1 VVASIPPLADLVKAIGGDKVEVTSLVPPGADPHTYEPTPSDIKKLAKADLVVYNGAGLEPWLDKLLASLANKVKVVDLSE 80 (272) T ss_pred CEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCCCEEEECCC T ss_conf 98965799999999808934899806999796577689999999956999999298716789999985689984898357 Q ss_pred CCCCCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 35446210---012111233212247567765555678999999997421865658899899998631222358999850 Q gi|254780717|r 106 SPDLHRIL---LRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSIL 182 (294) Q Consensus 106 ~~~~~~~~---~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~ 182 (294) ........ ..++++++|+++..|||+|+||.|++.++++|+++|+++||++++.|++|+++|.++|+++++++++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~dPH~Wldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~l~~l~~~~~~~l 160 (272) T pfam01297 81 GIELLDAPGHEHDHDEHDHDDHGHGDPHIWLDPKNAKAMAEAIADALSELDPENAATYEKNAAAFLKKLDELDAEIKAKL 160 (272) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86555576545655555555678999806268277999999999999986854089999999999999999999999875 Q ss_pred CCCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE Q ss_conf 3002781462062157789874981200-346775444978899999999754993899818889789999999719908 Q gi|254780717|r 183 QPVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS 261 (294) Q Consensus 183 ~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~ 261 (294) ++++++.|+++||+|+||+++|||++.+ ++.+++.+||++++.++++.+|+++|+|||+|++++++.++.|++++|+++ T Consensus 161 ~~~~~~~~v~~H~af~Y~~~~~gl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~if~e~~~~~~~~~~ia~~~g~~~ 240 (272) T pfam01297 161 APIPGKRVITFHDAFGYFAKAYGLEQIAILGESPESEPSPADLAELIKLIKEHNVKVIFVEPQFSPKLAETLAEETGAKV 240 (272) T ss_pred HCCCCCEEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE T ss_conf 13476438985550699999779927540300356788999999999999984998999838999099999999969976 Q ss_pred E--EECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 9--98053367877816789999999999987419 Q gi|254780717|r 262 A--ILDPEGMLIAEGPELYFQLMRSMSNSIAKNCS 294 (294) Q Consensus 262 ~--~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ls 294 (294) + .+|+++. .++++|++||+.|+++|+++|. T Consensus 241 ~~~~~d~l~~---~~~~~Y~~~m~~n~~~l~~al~ 272 (272) T pfam01297 241 VPLYLDPLGS---EGGDTYLELMRHNLDTLAEALG 272 (272) T ss_pred EEEECCCCCC---CCCCCHHHHHHHHHHHHHHHCC T ss_conf 8873578888---8847799999999999998649 No 8 >cd01020 TroA_b Metal binding protein TroA_b. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=100.00 E-value=0 Score=443.40 Aligned_cols=254 Identities=19% Similarity=0.319 Sum_probs=223.2 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCEEEEEE-CCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCE Q ss_conf 880599972289999999707944178870-5899835335680245665025789996766103568999731455767 Q gi|254780717|r 22 GSLQVVVSIKPIHSIVSCIMQGIGTPALLV-KGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNV 100 (294) Q Consensus 22 ~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~-~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~ 100 (294) |+++||||+.|++++|++|+||+++|++|+ +||+|||+|||||+|++++++||++||||++||+|+.++++.... .+ T Consensus 1 gkl~Vvat~~~l~dl~~~I~Gd~v~V~~li~~p~~dPH~yep~p~d~~~l~~Adlvv~nG~~lE~wl~k~~~~~~~--~~ 78 (264) T cd01020 1 GKINVVASTNFWGSVAEAVGGDHVEVTSIITNPDVDPHDFEPTPTDAAKVSTADIVVYNGGGYDPWMTKLLADTKD--VI 78 (264) T ss_pred CCCEEEEECHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCHHHHHHHHHCCCC--CE T ss_conf 9737999771999999998288169999447999896568799999999962989999588816899999852899--66 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99751354462100121112332122475677655556789999999974218656588998999986312223589998 Q gi|254780717|r 101 VTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHS 180 (294) Q Consensus 101 i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~ 180 (294) +.... ...+|++++.|||+||||.|++.++++|+++|+++||+|+++|++|+++|.++|++|+.++++ T Consensus 79 ~~~~~------------~~~~~~~~~~dPHiWldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~L~~~~~~ 146 (264) T cd01020 79 VIAAD------------LDGHDDKEGDNPHLWYDPETMSKVANALADALVKADPDNKKYYQANAKKFVASLKPLAAKIAE 146 (264) T ss_pred EEECC------------CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88336------------666667799998937799999999999999999869021799999999999999999999998 Q ss_pred HCCCCCCCEEEECCCHHHHHHHHCCCCEEC-----CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHH-- Q ss_conf 503002781462062157789874981200-----3467754449788999999997549938998188897899999-- Q gi|254780717|r 181 ILQPVEKKKIIVFHEAYRYFASHYNLSIVT-----FPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSI-- 253 (294) Q Consensus 181 ~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-----~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~i-- 253 (294) .+.+.+++.|+++|++|+||+++|||++.+ ....++.+||+++|.++++.+|+++|+|||+|++++++.++.| T Consensus 147 ~~~~~~~~~~v~~H~af~Yf~~~yGl~~~~~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~~If~e~q~~~~~a~~i~~ 226 (264) T cd01020 147 LSAKYKGAPVAATEPVFDYLLDALGMKERTPKGYTATTESETEPSPADIAAFQNAIKNRQIDALIVNPQQASSATTNITG 226 (264) T ss_pred HHCCCCCCEEEEECCCHHHHHHHCCCEEECCCCEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHH T ss_conf 51457898699965417999997799450442034123788999999999999999986999999828988089999999 Q ss_pred -HHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf -9971990899805336787781678999999999998741 Q gi|254780717|r 254 -TNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 254 -a~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) +++++++++.+. ... +.+.+|++||++|+++|.+|| T Consensus 227 ~a~~~~v~v~~l~---e~~-~~~~tY~~~M~~Nl~~i~~aL 263 (264) T cd01020 227 LAKRSGVPVVEVT---ETM-PNGTTYLTWMLKQVDQLEKAL 263 (264) T ss_pred HHHHCCCEEEECC---CCC-CCCCCHHHHHHHHHHHHHHHH T ss_conf 9985595698425---678-999989999999999999973 No 9 >COG0803 LraI ABC-type metal ion transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=432.24 Aligned_cols=266 Identities=28% Similarity=0.428 Sum_probs=236.9 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCC-CE Q ss_conf 88059997228999999970794417887058998353356802456650257899967661035689997314557-67 Q gi|254780717|r 22 GSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQS-NV 100 (294) Q Consensus 22 ~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~-~~ 100 (294) ++++||||++|+++++++|+||.++|.+|+|+|.|||+|||+|+|++++++||++||||.+||+|+.+++.+..+.. .+ T Consensus 31 ~~~~Vvtt~~~~~d~~~~I~gd~~~V~~iv~~g~dpH~yepsp~di~~i~~ADliv~nG~~le~w~~k~~~~~~~~~~~~ 110 (303) T COG0803 31 AKLKVVTTFPPIADVVKNIAGDKVDVVSLVPPGADPHSYEPTPSDIAKLRKADLIVYNGLGLEPWLEKLLESADKKKVLV 110 (303) T ss_pred CCCEEEEECHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHCCEEEECCCCHHHHHHHHHHHCCCCCCEE T ss_conf 78769997667999999970886028984289999706889989999987499999848865778999998467877516 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99751354462100121112332122475677655556789999999974218656588998999986312223589998 Q gi|254780717|r 101 VTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHS 180 (294) Q Consensus 101 i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~ 180 (294) +..+++..+..... . +.+|..|||+|+||.|++.++++|+++|+++||+|++.|++|+++|.++|++|+.+++. T Consensus 111 i~~s~~i~~~~~~~--~----~~~g~~dpH~Wldp~na~~~v~~I~~~L~~~dP~~~~~y~~N~~~y~~kL~~l~~~~~~ 184 (303) T COG0803 111 IEVSDGIELLPLPG--E----EEEGVNDPHVWLDPKNAKIYAENIADALVELDPENKETYEKNAEAYLKKLNKLDEEAKA 184 (303) T ss_pred EECCCCCCCCCCCC--C----CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87147703456787--4----44689999820489999999999999998869413899999999999999999999999 Q ss_pred HCCCCC-CCEEEECCCHHHHHHHHCCCCEECC-CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHC Q ss_conf 503002-7814620621577898749812003-46775444978899999999754993899818889789999999719 Q gi|254780717|r 181 ILQPVE-KKKIIVFHEAYRYFASHYNLSIVTF-PMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTG 258 (294) Q Consensus 181 ~~~~~~-~~~~i~~H~a~~Yf~~~~gl~~~~~-~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~ 258 (294) ++...+ +|.++|+|++|+||+++||+.+..+ +.+++.+||+++++++++++|++++++||.|++++++.++.|++|+| T Consensus 185 ~~~~~~~~r~~vt~h~af~Y~~~~~g~~~~~i~~~~~~~e~s~~~l~~l~~~ik~~~i~~If~e~~~~~~~~~~la~e~g 264 (303) T COG0803 185 KLSKIPAQRDVVTSHGAFGYLARDYGLKQVAIAGISPEAEPSPKDLAKLVDLIKKKNIKAIFVESNVSSKSAETLAKETG 264 (303) T ss_pred HHHCCCCCCEEEEECCCHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHC T ss_conf 98447646649986372065451059851220365855699999999999999856997899818888279999999859 Q ss_pred CCEEEEC---CCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 9089980---53367877816789999999999987419 Q gi|254780717|r 259 VMSAILD---PEGMLIAEGPELYFQLMRSMSNSIAKNCS 294 (294) Q Consensus 259 ~~~~~ld---~~~~~~~~~~~~Y~~~m~~n~~~l~k~ls 294 (294) +++..+. +++.....+ ++|++||+.|++.+.++|. T Consensus 265 ~~~~~~~~~d~~~~~~~~~-~~y~~~~~~n~~~i~~~l~ 302 (303) T COG0803 265 VKILGLLYLDSLGDKDSKG-DTYISMMKANLDTIVEGLK 302 (303) T ss_pred CCCCCCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHHHC T ss_conf 8602100223445678764-5289999999999998634 No 10 >COG4531 ZnuA ABC-type Zn2+ transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=428.83 Aligned_cols=293 Identities=39% Similarity=0.677 Sum_probs=260.8 Q ss_pred CHHHHHHHHHHHHH-HHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEE Q ss_conf 91589999999999-60013238805999722899999997079441788705899835335680245665025789996 Q gi|254780717|r 1 MKNFLIILIFLFFI-LSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWL 79 (294) Q Consensus 1 Mkk~l~~~~~~~~~-l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~ 79 (294) |++.++.+..++.. +.+..+.....|||||.|+++|+..|+.+..+..+|+|+|.+||+|.++|||+++|++||+|||+ T Consensus 3 ~~~~~~l~~A~~A~~~~~~~a~A~~~Vv~SIKPl~~iasaI~dGVg~p~vlvp~gASpHdYsLrPSDv~rlq~Adlv~Wv 82 (318) T COG4531 3 LHKKTLLLSALFALLLGSAPAAAAAAVVTSIKPLGFIASAIADGVGEPEVLLPGGASPHDYSLRPSDVKRLQSADLVVWV 82 (318) T ss_pred CHHHHHHHHHHHHHHHCCCHHCCCCCCEEEECCHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCHHHHHHHHCCCEEEEE T ss_conf 05779999999999844530004632146504179999999826899736427999955335783578886318789997 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCC------------C-------CCCCCC-CCCCHHHHHHHHHHH Q ss_conf 76610356899973145576799751354462100121------------1-------123321-224756776555567 Q gi|254780717|r 80 GAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRILLRDN------------H-------SHFHDS-EADDLHLWLNPLNVQ 139 (294) Q Consensus 80 G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~------------~-------~~~~~~-~~~dpH~Wl~p~n~~ 139 (294) |.++|+|++|.+..+ ...++|.+++.+++.++..++. | .|+|+| |.+|+|+||||.|++ T Consensus 83 Gp~lEaFL~Kpl~~l-~~~kvv~l~d~~~v~~L~~r~~hd~~~e~~h~hdH~~~dh~~d~dH~H~G~~d~H~Wl~P~~ak 161 (318) T COG4531 83 GPDLEAFLDKPLSGL-PGAKVVTLADLPGVKPLLFREAHDHEEEHDHGHDHAGADHKGDHDHHHEGEYDMHLWLSPAIAK 161 (318) T ss_pred CCCHHHHHHHHHHCC-CCCCEEEHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECHHHHH T ss_conf 866787763122136-6772786231478743222222564211355434567655675445687875730132606789 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCC Q ss_conf 89999999974218656588998999986312223589998503002781462062157789874981200-34677544 Q gi|254780717|r 140 YIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVF 218 (294) Q Consensus 140 ~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~ 218 (294) .+|..|+++|.++||+|+..|++|++.|..+|..+++.+.++++|+++++|+++||||+||.++|||+..+ +.++|+.. T Consensus 162 ~~A~~i~~~L~e~dp~~~a~y~aNlk~f~~~La~~d~~i~~~L~pvk~Kpf~VFHDAY~YFE~~ygl~~~G~fTVsPev~ 241 (318) T COG4531 162 AVAAAIAKKLAELDPQNAAKYDANLKDFEAQLAALDKKVGEELAPVKGKPFFVFHDAYGYFENAYGLKPLGHFTVSPEVQ 241 (318) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECHHHHHHHHHCCCCCCEEEECCCCC T ss_conf 99999999998649455899998899999999988899998712457897589842488888740766465277673447 Q ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 4978899999999754993899818889789999999719908998053367877816789999999999987419 Q gi|254780717|r 219 MGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNCS 294 (294) Q Consensus 219 ~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ls 294 (294) |++++|.++.++|++.++.|||.||||.+++++.++++|++++.+|||+|.....|+++|.++|++.+++++.-|| T Consensus 242 PGA~rl~~Ir~~l~e~~a~CvFaEPQF~Pkvve~v~~GT~vr~g~LDPlg~~i~lg~~sY~~fl~~lA~sy~~CLs 317 (318) T COG4531 242 PGAKRLAEIRTQLKEQKATCVFAEPQFRPKVVETVAEGTSVRSGTLDPLGTNIKLGKDSYFNFLSNLANSYASCLS 317 (318) T ss_pred CCHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHHHHCCCCCCEEEECCCCCCCEECCCHHHHHHHHHHHHHHHHHC T ss_conf 4378999999999972796772189876189999854775431420667662003840699999999999999853 No 11 >cd01145 TroA_c Periplasmic binding protein TroA_c. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=100.00 E-value=0 Score=369.05 Aligned_cols=201 Identities=20% Similarity=0.308 Sum_probs=174.2 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEE Q ss_conf 88059997228999999970794417887058998353356802456650257899967661035689997314557679 Q gi|254780717|r 22 GSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVV 101 (294) Q Consensus 22 ~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i 101 (294) .+++||||+.|+++||++|+||+++|++|+|+|+|||+|+|||+|+++|++||+|||||.+||+|+.++++...... +. T Consensus 1 a~l~Vvts~~pl~~iv~~I~Gd~v~V~~Lvp~g~dPH~yep~p~d~~~l~~Adlv~~~G~~lE~~~~k~~~~~~~~~-~~ 79 (203) T cd01145 1 AALNVVVTFPDLKDLVREVAGDAVIVSALTPPGVDPHQYQLKPSDIAKMRKADLVVTSGHELEGFEPKLAELSSNSK-VQ 79 (203) T ss_pred CCCEEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHCCCCEEEEECCCHHHHHHHHHHHCCCCC-CC T ss_conf 99679998779999999983995389980698989756658999999983897999948870779999998656875-21 Q ss_pred EEEEC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97513-54462100121112332122475677655556789999999974218656588998999986312223589998 Q gi|254780717|r 102 TLSHS-PDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHS 180 (294) Q Consensus 102 ~~~~~-~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~ 180 (294) ..... ..........++.++++++..|||+||||.|++.++++|+++|+++||+|++.|++|+++|.++|++|+.++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~dPH~Wldp~~~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~ 159 (203) T cd01145 80 PGIKILIEDSDTVGMVDRAMGDYHGKGNPHVWLDPNNAPALAKALADALIELDPSEQEEYKENLRVFLAKLNKLLREWER 159 (203) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 13101222345555456555556799998014799999999999999999868460899999999999999999999999 Q ss_pred HCCCCCCCEEEECCCHHHHHHHHCCCCEEC-CCCCCCCCCCHHH Q ss_conf 503002781462062157789874981200-3467754449788 Q gi|254780717|r 181 ILQPVEKKKIIVFHEAYRYFASHYNLSIVT-FPMSHSVFMGAAS 223 (294) Q Consensus 181 ~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~-~~~~~~~~~s~~~ 223 (294) .+.+.++|.++++||+|+||+++|||++++ ++.++|.+|||++ T Consensus 160 ~~~~~~~~~~v~~H~af~Yfa~~ygl~~~~~~~~~p~~~Ps~~h 203 (203) T cd01145 160 QFEGLKGIQVVAYHPSYQYLADWLGIEVVASLEPLPELPPTSSH 203 (203) T ss_pred HHCCCCCCEEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCCCC T ss_conf 73776799799978737999997799798987569998989998 No 12 >cd00636 TroA-like Helical backbone metal receptor (TroA-like domain). These proteins have been shown to function in the ABC transport of ferric siderophores and metal ions such as Mn2+, Fe3+, Cu2+ and/or Zn2+. Their ligand binding site is formed in the interface between two globular domains linked by a single helix. Many of these proteins also possess a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). The TroA-like proteins differ in their fold and ligand-binding mechanism from the PBPI and PBPII proteins, but are structurally similar, however, to the beta-subunit of the nitrogenase molybdenum-iron protein MoFe. Most TroA-like proteins are encoded by ABC-type operons and appear to function as periplasmic components of ABC transporters in metal ion uptake. Probab=98.21 E-value=8.9e-05 Score=50.72 Aligned_cols=124 Identities=20% Similarity=0.190 Sum_probs=94.6 Q ss_pred EEEEECHHHHHHHHHHHCCCCEEEEEECCCCC-------------CCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHH Q ss_conf 59997228999999970794417887058998-------------35335680245665025789996766103568999 Q gi|254780717|r 25 QVVVSIKPIHSIVSCIMQGIGTPALLVKGASS-------------PHEYSLRISEAMMLENADIIFWLGAEMESFLVKPL 91 (294) Q Consensus 25 ~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~d-------------pH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~ 91 (294) +||++.....+++..++++...|.+....+.+ +|.++|.+.++.++ +.|++|+.+...+.+.+++. T Consensus 2 riv~~~~~~~e~l~~lg~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~i~~l-~PDlvi~~~~~~~~~~~~l~ 80 (148) T cd00636 2 RVVALDPGATELLLALGGDDKPVGVADPSGYPPEAKALLEKVPDVGHGYEPNLEKIAAL-KPDLIIANGSGLEAWLDKLS 80 (148) T ss_pred EEEEECCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHCC-CCCEEEEECCCCHHHHHHHH T ss_conf 19995750899999869998599995677887033200012562688889999999707-99899994666788999998 Q ss_pred HHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 73145576799751354462100121112332122475677655556789999999974218656588998999986312 Q gi|254780717|r 92 HSLNKQSNVVTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQL 171 (294) Q Consensus 92 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L 171 (294) .. . .+++.+.... + .++.++...+..+++.+. ++++++++.+.+ T Consensus 81 ~~-~--~~~~~~~~~~---------------------~---~~~~~~~~~i~~lg~~~~---------~~~~a~~~~~~~ 124 (148) T cd00636 81 KI-A--IPVVVVDEAS---------------------E---LSLENIKESIRLIGKALG---------KEENAEELIAEL 124 (148) T ss_pred HC-C--CCEEEECCCC---------------------C---CCHHHHHHHHHHHHHHHC---------CCHHHHHHHHHH T ss_conf 63-9--9789957888---------------------8---999999999999999969---------818999999999 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 22358999850300 Q gi|254780717|r 172 SQLDKELHSILQPV 185 (294) Q Consensus 172 ~~l~~~~~~~~~~~ 185 (294) ++..+.+++..... T Consensus 125 ~~~~~~i~~~~~~~ 138 (148) T cd00636 125 DARLAELRAKLAKI 138 (148) T ss_pred HHHHHHHHHHHCCC T ss_conf 99999999996389 No 13 >COG4607 CeuA ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=96.12 E-value=0.13 Score=30.66 Aligned_cols=219 Identities=14% Similarity=0.112 Sum_probs=104.3 Q ss_pred CEEEEEECHHHHHHHHHHHCCCCEEE--EEECC----CCC---C---CCCCCCHHHHHH--HCCCCEEEEECCCCCHHHH Q ss_conf 80599972289999999707944178--87058----998---3---533568024566--5025789996766103568 Q gi|254780717|r 23 SLQVVVSIKPIHSIVSCIMQGIGTPA--LLVKG----ASS---P---HEYSLRISEAMM--LENADIIFWLGAEMESFLV 88 (294) Q Consensus 23 ~~~Vv~s~~pl~~lv~~I~gd~~~V~--~L~~~----g~d---p---H~y~~tp~d~~~--l~~Adlii~~G~~~E~~~~ 88 (294) +-+||+.-...-|-...+|=+.+-|- ..+|. =.+ + -.||| |.++ -.+.||||..|..-+ ..+ T Consensus 58 PekVvv~D~gaLD~ld~lGve~~~v~~~~~~P~yL~~y~~dky~nvGtlfEP---D~Eai~a~kPdLIIiggR~ak-~yd 133 (320) T COG4607 58 PEKVVVLDLGALDTLDALGVEVVAVGPGKNLPAYLQKYKDDKYANVGTLFEP---DYEAIAAAKPDLIIIGGRAAK-AYD 133 (320) T ss_pred CCEEEEECCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCC---CHHHHHHCCCCEEEECCHHHH-HHH T ss_conf 7558995411566688729720223687786378997146786666753587---889997449988997747777-799 Q ss_pred HHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99973145576799751354462100121112332122475677655556789999999974218656588998999986 Q gi|254780717|r 89 KPLHSLNKQSNVVTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFK 168 (294) Q Consensus 89 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~ 168 (294) ++. +-.+.|.+. .++-++|-|.. ..++.+++.+.+- ..+++.. T Consensus 134 ~l~----kiAPti~l~---------------------~d~~n~~~S~~---~n~e~Lg~IFgkE---------~eAk~~~ 176 (320) T COG4607 134 KLS----KIAPTIDLG---------------------ADTANLIESTK---QNIETLGKIFGKE---------EEAKELL 176 (320) T ss_pred HHH----HHCCEEEEC---------------------CCHHHHHHHHH---HHHHHHHHHHCCH---------HHHHHHH T ss_conf 998----538747852---------------------65678899999---9999999985840---------8899999 Q ss_pred HHHHHHHHHHHHHCCCCCCCEE--EECCC---------HHHHHHHHCCCCEEC--CC-CCCCCCCCHHHHHHHHHHHHHC Q ss_conf 3122235899985030027814--62062---------157789874981200--34-6775444978899999999754 Q gi|254780717|r 169 NQLSQLDKELHSILQPVEKKKI--IVFHE---------AYRYFASHYNLSIVT--FP-MSHSVFMGAASLRNIRSKIISD 234 (294) Q Consensus 169 ~~L~~l~~~~~~~~~~~~~~~~--i~~H~---------a~~Yf~~~~gl~~~~--~~-~~~~~~~s~~~l~~~~~~ik~~ 234 (294) .+++..-+.+++.-....++.+ ++.-. .|+..-..+|++... +. .+||...|- +.|+++ T Consensus 177 ~~id~~i~~~k~~a~~~~~t~m~il~ngGkisafGp~SRfg~ihd~~G~~pvd~~~~~s~HGq~VSf-------EyI~e~ 249 (320) T COG4607 177 ADIDASIAAAKEKAAGKGKTALVILVNGGKISAFGPSSRFGWIHDDLGFTPVDENIKNSNHGQPVSF-------EYIKEK 249 (320) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCEEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCEECH-------HHHHHH T ss_conf 8799999999998641698059999449704554487753133201288832333444678872029-------999751 Q ss_pred CCCEEEE-EC--------CCCHHHH-HHHHHHHC----CCEEEECCCCCCCCCC-CCCHHHHHHHHHHHH Q ss_conf 9938998-18--------8897899-99999719----9089980533678778-167899999999999 Q gi|254780717|r 235 KISCLFY-GP--------EFDSKII-RSITNDTG----VMSAILDPEGMLIAEG-PELYFQLMRSMSNSI 289 (294) Q Consensus 235 ~v~~if~-e~--------~~~~~~~-~~ia~e~~----~~~~~ld~~~~~~~~~-~~~Y~~~m~~n~~~l 289 (294) |..+||+ +. +...... +.+.++++ -+++.|||..+....| -.+|.+|+.+..+.+ T Consensus 250 NPD~lfViDR~~aig~~~~~a~qvldN~lv~~t~A~k~g~I~yLdp~~wy~aggG~~s~~~~i~~~~~a~ 319 (320) T COG4607 250 NPDWLFVIDRGAAIGEEGSAAKQVLDNELVANTKAWKNGQIIYLDPDTWYIAGGGLESLTQMIEQVKAAF 319 (320) T ss_pred CCCEEEEEECCCHHCCCCHHHHHHHCCHHHHHCCHHHCCEEEEECHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 9987999717403147761677764019887331410683899773896343563999999999999863 No 14 >cd01977 Nitrogenase_VFe_alpha Nitrogenase_VFe_alpha -like: Nitrogenase VFe protein, alpha subunit like. This group contains proteins similar to the alpha subunits of, the VFe protein of the vanadium-dependent (V-) nitrogenase and the FeFe protein of the iron only (Fe-) nitrogenase Nitrogenase catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. In addition to V- and Fe- nitrogenases there is a molybdenum (Mo)-dependent nitrogenase which is the most widespread and best characterized of these systems. These systems consist of component 1 (VFe protein, FeFe protein or, MoFe protein respectively) and, component 2 (Fe protein). MoFe is an alpha2beta2 tetramer, V-and Fe- nitrogenases are alpha2beta2delta2 hexamers. The alpha and beta subunits of VFe and FeFe are similar to the alpha and beta subunits of MoFe. For MoFe each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha sub Probab=94.78 E-value=0.37 Score=27.73 Aligned_cols=141 Identities=13% Similarity=0.116 Sum_probs=70.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHH------HHHHHCCC Q ss_conf 655556789999999974218656588998999986312223589998503002781462062157------78987498 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYR------YFASHYNL 206 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~------Yf~~~~gl 206 (294) +-+.+...+.+.|++.+...++ -.....+...++..+++ ++++ .+.+|.++++-+... -|.++.|+ T Consensus 242 ~Gi~~T~~~Lr~ia~~~g~~~~-~e~~i~~e~~~~~~~l~----~~~~---~l~GKkv~i~~g~~~~~~~~~~~~~ElGm 313 (415) T cd01977 242 FGFEYCAESLRKIGAFFGIEDR-AEAVIAEEMAKWKPELD----WYKE---RLKGKKVCIWTGGPKLWHWTKVIEDELGM 313 (415) T ss_pred CCHHHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHHH----HHHH---HCCCCEEEEECCCHHHHHHHHHHHHHCCC T ss_conf 7746799999999998299411-77889999999999999----9987---57399899975855899999999986685 Q ss_pred CEECCCCCCCCCCCHHHHH-------------------HHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCC Q ss_conf 1200346775444978899-------------------999999754993899818889789999999719908998053 Q gi|254780717|r 207 SIVTFPMSHSVFMGAASLR-------------------NIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPE 267 (294) Q Consensus 207 ~~~~~~~~~~~~~s~~~l~-------------------~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~ 267 (294) +++.+....+ ...++. ++.+.++..+...++..+ . -+.+++++|++.+.+... T Consensus 314 evv~~~~~~~---~~~~~e~~~~~~~~~~~~v~~~~~~e~~~~i~~~~pDlliG~~-~----~~~ia~klgiP~~~~~~~ 385 (415) T cd01977 314 QVVAMSSKFG---HQEDFEKVIARGGEGTIYIDDPNELEFFEILEMLKPDIILTGP-R----VGELVKKLHVPYVNIHAY 385 (415) T ss_pred EEEEECCCCC---CHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCEEEECC-C----CCEEEEECCCCEEEECCC T ss_conf 8999605458---8789999997478880998689899999999736999999658-7----441102248987980587 Q ss_pred CCCCCCCCCCHHHHHHHHHHHH Q ss_conf 3678778167899999999999 Q gi|254780717|r 268 GMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 268 ~~~~~~~~~~Y~~~m~~n~~~l 289 (294) ....--|.++-+.+++...++| T Consensus 386 ~~~p~~GY~G~~~l~~~I~n~i 407 (415) T cd01977 386 HNGPYMGFEGFVNLARDMYNAI 407 (415) T ss_pred CCCCCCHHHHHHHHHHHHHHHH T ss_conf 6789510874999999999984 No 15 >PRK06027 purU formyltetrahydrofolate deformylase; Reviewed Probab=89.25 E-value=1.9 Score=23.24 Aligned_cols=111 Identities=14% Similarity=0.078 Sum_probs=69.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHH Q ss_conf 31222358999850300278146206215778987498120034677544497889999999975499389981888978 Q gi|254780717|r 169 NQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSK 248 (294) Q Consensus 169 ~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~ 248 (294) -.|.+|--+++.---+.+=..+++-|+.+..++++||+....+..+++.. +.+=.++.+.++++++..|+.-... .= T Consensus 100 HCL~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~~~IPF~~iPv~~~~K--~e~E~~i~~~~~~~~~d~ivla~ym-~i 176 (285) T PRK06027 100 HCLGDLLWRWRSGELPVEIAAVISNHPDLRSLVESFGIPFHHVPVTKETK--AEAEAQLLELIDEYQPDLVVLARYM-QI 176 (285) T ss_pred CCHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHCCCCEEEECCCCCCH--HHHHHHHHHHHHHCCCCEEEEHHHH-HH T ss_conf 37899999875698562577994285789999998699828826876653--7799999999873497199763368-76 Q ss_pred HHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHH Q ss_conf 9999999719908998053367877816789999 Q gi|254780717|r 249 IIRSITNDTGVMSAILDPEGMLIAEGPELYFQLM 282 (294) Q Consensus 249 ~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m 282 (294) +...+.++..-+++.+.|-=.-.-.|.+.|-+-. T Consensus 177 l~~~~~~~~~~~iiNiH~s~lp~f~G~~~~~~a~ 210 (285) T PRK06027 177 LSPEFVARFPGRIINIHHSFLPAFKGAKPYHQAY 210 (285) T ss_pred CCHHHHHHHCCCEEEECHHHCCCCCCCCHHHHHH T ss_conf 6888998721764784511225799987799999 No 16 >PRK10957 iron-enterobactin transporter periplasmic binding protein; Provisional Probab=89.24 E-value=1.9 Score=23.24 Aligned_cols=41 Identities=20% Similarity=0.271 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHHHCCCCC--------------------CCEEEEEECHHHHHHHHHHHCC Q ss_conf 5899999999996001323--------------------8805999722899999997079 Q gi|254780717|r 3 NFLIILIFLFFILSSVARA--------------------GSLQVVVSIKPIHSIVSCIMQG 43 (294) Q Consensus 3 k~l~~~~~~~~~l~~~~~~--------------------~~~~Vv~s~~pl~~lv~~I~gd 43 (294) ++.+++.++.+++++|+.+ .+-+||+.-....+.+-.++.. T Consensus 6 ~~~~~~~~~~~~~~g~~~~~~~~~prtV~h~~G~t~i~~~PqRVV~l~~~~~d~llaLg~p 66 (319) T PRK10957 6 LYRLALLLLGLLLSGIAAAQAADWPRTITDSRGTHTLEPKPARIVSTSVTLTGTLLAIDAP 66 (319) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCEECCCCCCEEEEECCCCHHHHHHCCCC T ss_conf 8999999999997478755458988789719842766899967999548627889867996 No 17 >cd01968 Nitrogenase_NifE_I Nitrogenase_NifE_I: a subgroup of the NifE subunit of the NifEN complex: NifE forms an alpha2beta2 tetramer with NifN. NifE and NifN are structurally homologous to nitrogenase MoFe protein alpha and beta subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The NifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this NifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). Probab=86.06 E-value=2.9 Score=22.04 Aligned_cols=145 Identities=13% Similarity=0.139 Sum_probs=84.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC-C-CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECC-C----HHHHHHHHCC Q ss_conf 6555567899999999742186-5-658899899998631222358999850300278146206-2----1577898749 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDP-R-NKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFH-E----AYRYFASHYN 205 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP-~-~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H-~----a~~Yf~~~~g 205 (294) +-+.+...+...|++.+..-.+ + -.+.++....++...|...+.. +.++.++++. + ++.-|.++.| T Consensus 238 ~Gl~~t~~~l~~la~~~g~~~~~~~~e~~I~~er~~l~d~l~d~~~~-------l~Gkrvai~~~~~~~~~l~~~l~ElG 310 (410) T cd01968 238 YGIRDTSKSLRNIAELLGDEELIERTEELIAREEARLRPELAPYRAR-------LEGKKAALYTGGVKSWSLVSALQDLG 310 (410) T ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-------HCCCEEEEECCCHHHHHHHHHHHHCC T ss_conf 78699999999999983898402456899999999999999999998-------53988999648468899999999779 Q ss_pred CCEECCCCCCCC-------------C---CCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 812003467754-------------4---497889999999975499389981888978999999971990899805336 Q gi|254780717|r 206 LSIVTFPMSHSV-------------F---MGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGM 269 (294) Q Consensus 206 l~~~~~~~~~~~-------------~---~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~ 269 (294) ++++.....+.. + ..-.+..++.+.+++.++..++..+. .+.++++.|+|.+.....-. T Consensus 311 m~vv~~~~~~~~~~~~~~~~~~~~~~~~vi~d~d~~el~~~i~~~~~Dliig~s~-----~~~~a~rlGiP~~~~~~~~~ 385 (410) T cd01968 311 MEVVATGTQKGTKEDYERIKELLGEGTVIVDDANPRELKKLLKEKKADLLVAGGK-----ERYLALKLGIPFCDINHERK 385 (410) T ss_pred CEEEEEECCCCCHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCEEEECCC-----HHHHHHHHCCCEEEECCCCC T ss_conf 9899981578988899999985598888984899999999986569999995773-----24889980898898037767 Q ss_pred CCCCCCCCHHHHHHHHHHHH Q ss_conf 78778167899999999999 Q gi|254780717|r 270 LIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 270 ~~~~~~~~Y~~~m~~n~~~l 289 (294) ...-|.++-+.+++..++++ T Consensus 386 ~~~~GY~G~~~l~~~i~nai 405 (410) T cd01968 386 HPYAGYEGMLNFAKEVDLAV 405 (410) T ss_pred CCEEEHHHHHHHHHHHHHHH T ss_conf 77331751999999999986 No 18 >PRK13011 formyltetrahydrofolate deformylase; Reviewed Probab=86.01 E-value=2.9 Score=22.02 Aligned_cols=110 Identities=10% Similarity=0.068 Sum_probs=66.7 Q ss_pred HHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHH Q ss_conf 22235899985030027814620621577898749812003467754449788999999997549938998188897899 Q gi|254780717|r 171 LSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKII 250 (294) Q Consensus 171 L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~ 250 (294) |.+|--+++.---+..=..+++-|+.+..+++.||+....+...++.. +.+=.++.+.++++++..|+.-..+ .-+. T Consensus 104 L~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~~~IPF~~iPv~~~~K--~e~E~~~~~~~~~~~~d~ivla~ym-~il~ 180 (287) T PRK13011 104 LNDLLYRWRIGELPMDIVGVVSNHPDLEPLAAWHGIPFHHFPITPDTK--PQQEAQVLDVIEESGAELVVLARYM-QVLS 180 (287) T ss_pred HHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHCCCCEEEECCCCCCC--HHHHHHHHHHHHHCCCCEEEHHHHH-HHCC T ss_conf 999999976498684465873498678999997699828837887763--7899999999873397399434488-8749 Q ss_pred HHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHH Q ss_conf 999997199089980533678778167899999 Q gi|254780717|r 251 RSITNDTGVMSAILDPEGMLIAEGPELYFQLMR 283 (294) Q Consensus 251 ~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~ 283 (294) ..+.++..-++..+.|-=.-.-.|.+.|.+-.. T Consensus 181 ~~~~~~~~~~iinih~s~lP~f~G~~~~~~a~~ 213 (287) T PRK13011 181 PELCRKLAGRAINIHHSFLPGFKGAKPYHQAYE 213 (287) T ss_pred HHHHHHHCCCCCCCCHHHCCCCCCCCHHHHHHH T ss_conf 899984038010216001157899867999997 No 19 >cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=84.92 E-value=3.3 Score=21.70 Aligned_cols=20 Identities=10% Similarity=0.200 Sum_probs=11.8 Q ss_pred CCCHHHHHHHHHHHCCCEEEEC Q ss_conf 8897899999997199089980 Q gi|254780717|r 244 EFDSKIIRSITNDTGVMSAILD 265 (294) Q Consensus 244 ~~~~~~~~~ia~e~~~~~~~ld 265 (294) .++-..+|.++ +|++++..| T Consensus 277 gfg~~~lEAma--~G~PVVat~ 296 (357) T cd03795 277 AFGIVLLEAMA--FGKPVISTE 296 (357) T ss_pred CCCHHHHHHHH--CCCCEEEEC T ss_conf 56667999998--799899935 No 20 >PRK13010 purU formyltetrahydrofolate deformylase; Reviewed Probab=82.84 E-value=4 Score=21.15 Aligned_cols=112 Identities=12% Similarity=0.047 Sum_probs=69.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHH Q ss_conf 31222358999850300278146206215778987498120034677544497889999999975499389981888978 Q gi|254780717|r 169 NQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSK 248 (294) Q Consensus 169 ~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~ 248 (294) -.|.+|--+++.---+.+=..+++-|+.+..++++||+-...+...++.. +.+=+++.+.++++++..|+.-..+ .- T Consensus 105 HCL~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~~gIPF~hipv~~~~k--~~~e~~~~~~~~~~~~d~vvla~ym-~i 181 (289) T PRK13010 105 HCLNDLLYRWRMGELDMDIVGIISNHPDLQPLAVQHDIPFHHLPVTPDTK--AQQEAQILDLIETSGAELVVLARYM-QV 181 (289) T ss_pred HHHHHHHHHHHCCCCCCEEEEEEECCHHHHHHHHHCCCCEEEECCCCCCH--HHHHHHHHHHHHHCCCCEEEEECHH-HH T ss_conf 50999999987498586014999787778999997799839826877752--6789999999871398899870546-67 Q ss_pred HHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHH Q ss_conf 99999997199089980533678778167899999 Q gi|254780717|r 249 IIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMR 283 (294) Q Consensus 249 ~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~ 283 (294) +...+.+...-++..+.|-=.-.-.|.+.|-+-.. T Consensus 182 l~~~~~~~~~~~iinih~s~lP~f~G~~~~~~a~~ 216 (289) T PRK13010 182 LSDDLSRKLSGRAINIHHSFLPGFKGARPYHQAHA 216 (289) T ss_pred CCHHHHHHCCCCEEEECHHHCCCCCCCCHHHHHHH T ss_conf 69899975785126635212257899877999997 No 21 >PRK10270 hypothetical protein; Provisional Probab=81.20 E-value=1.4 Score=24.13 Aligned_cols=16 Identities=38% Similarity=0.841 Sum_probs=11.3 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9158999999999960 Q gi|254780717|r 1 MKNFLIILIFLFFILS 16 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~ 16 (294) |||++++++++++++. T Consensus 1 Mkk~~~~~~~l~~~l~ 16 (340) T PRK10270 1 MKKVLLIILLLLVVLG 16 (340) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9028999999999999 No 22 >PRK09590 celB cellobiose phosphotransferase system IIB component; Reviewed Probab=80.27 E-value=2.2 Score=22.78 Aligned_cols=64 Identities=14% Similarity=0.129 Sum_probs=44.1 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCHH--HHHHHHHHHCCCEEEECCCCCCC-CCCCCCHHHHHHH Q ss_conf 7889999999975499389981888978--99999997199089980533678-7781678999999 Q gi|254780717|r 221 AASLRNIRSKIISDKISCLFYGPEFDSK--IIRSITNDTGVMSAILDPEGMLI-AEGPELYFQLMRS 284 (294) Q Consensus 221 ~~~l~~~~~~ik~~~v~~if~e~~~~~~--~~~~ia~e~~~~~~~ld~~~~~~-~~~~~~Y~~~m~~ 284 (294) +--..+..+.+++.+..+++..||..-+ ..+..+.+.|+++..+++..-.. +.|.+.-.+++.. T Consensus 36 Av~~~e~~~~i~~~~yDv~LlgPQVr~~~~~~k~~a~~~giPv~vI~~~~Y~Plp~g~~k~~~~i~e 102 (104) T PRK09590 36 AISATEGGKAIAAAAFDLYLVSPQTKMYFKQFEEAGSKAGKPVVQIPPQAYIPIPMGIEKMANLILE 102 (104) T ss_pred EEEHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHH T ss_conf 8488898876322688789988768878999999998729977887874645777788999999873 No 23 >cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core. Probab=80.12 E-value=3.3 Score=21.73 Aligned_cols=21 Identities=14% Similarity=0.161 Sum_probs=11.9 Q ss_pred CCCHHHHHHHHHHHCCCEEEECC Q ss_conf 88978999999971990899805 Q gi|254780717|r 244 EFDSKIIRSITNDTGVMSAILDP 266 (294) Q Consensus 244 ~~~~~~~~~ia~e~~~~~~~ld~ 266 (294) .++--.+|..+ +|.+++.-|. T Consensus 276 gf~~vllEAma--~G~PvV~s~~ 296 (355) T cd03819 276 AFGRTAVEAQA--MGRPVIASDH 296 (355) T ss_pred CCHHHHHHHHH--CCCCEEEECC T ss_conf 00078999998--6998999089 No 24 >cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides. Probab=78.58 E-value=2.9 Score=22.04 Aligned_cols=20 Identities=10% Similarity=0.189 Sum_probs=9.5 Q ss_pred CCCHHHHHHHHHHHCCCEEEEC Q ss_conf 8897899999997199089980 Q gi|254780717|r 244 EFDSKIIRSITNDTGVMSAILD 265 (294) Q Consensus 244 ~~~~~~~~~ia~e~~~~~~~ld 265 (294) .++-..+|..+ +|.+++.-| T Consensus 275 gf~~~~lEAma--~G~PvI~s~ 294 (359) T cd03808 275 GLPRVLLEAMA--MGRPVIATD 294 (359) T ss_pred CCCCCHHHHHH--CCCCEEECC T ss_conf 57842899986--699899948 No 25 >PRK00591 prfA peptide chain release factor 1; Validated Probab=78.17 E-value=5.7 Score=20.17 Aligned_cols=97 Identities=19% Similarity=0.309 Sum_probs=60.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHH----------------------HH--HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 24756776555567899999999----------------------74--2186565889989999863122235899985 Q gi|254780717|r 126 ADDLHLWLNPLNVQYIAHVIAME----------------------LI--KKDPRNKIIYEKNEEEFKNQLSQLDKELHSI 181 (294) Q Consensus 126 ~~dpH~Wl~p~n~~~~a~~I~~~----------------------L~--~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~ 181 (294) .-||-+|-||..+..+.+..+.- |. +-||+-.+..+.-...+.+++..+..+++.. T Consensus 23 ls~p~fW~D~~~a~kl~kE~~~L~~~v~~~~~l~~~~~dl~el~eLl~ee~D~el~e~a~~El~~l~~~l~~lE~eL~~l 102 (360) T PRK00591 23 LSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAEEDLEDAKELLKEESDPEMREMAKEELKELEERLEELEEELKIL 102 (360) T ss_pred HHCCCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 72985212999999999999999999999999999999899999998627997899999999999999999999999998 Q ss_pred CC---CCCCC-EEEECCCH----------------HHHHHHHCCCCEECCCCCCCCCCCHH Q ss_conf 03---00278-14620621----------------57789874981200346775444978 Q gi|254780717|r 182 LQ---PVEKK-KIIVFHEA----------------YRYFASHYNLSIVTFPMSHSVFMGAA 222 (294) Q Consensus 182 ~~---~~~~~-~~i~~H~a----------------~~Yf~~~~gl~~~~~~~~~~~~~s~~ 222 (294) +. |...+ .++.-|+. |.-|++.-|+++..+..+++.+.+.+ T Consensus 103 Llp~d~~D~~nailEIrAGAGGtEA~dwA~mL~RMY~RwAE~~g~k~evid~s~ge~aGiK 163 (360) T PRK00591 103 LLPKDPNDDKNVILEIRAGTGGDEAALFAGDLFRMYSRYAERQGWKVEILSASEGELGGYK 163 (360) T ss_pred HCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHH T ss_conf 3799977567858999658887489999999999999999867997156025776556502 No 26 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=78.07 E-value=5.8 Score=20.15 Aligned_cols=100 Identities=12% Similarity=0.171 Sum_probs=52.8 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHH----H-HHHH---HHHCCCCEEEEEE-CCCCCCCCCCCCHHHHHHH- Q ss_conf 915899999999996001323880599972289----9-9999---9707944178870-5899835335680245665- Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPI----H-SIVS---CIMQGIGTPALLV-KGASSPHEYSLRISEAMML- 70 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl----~-~lv~---~I~gd~~~V~~L~-~~g~dpH~y~~tp~d~~~l- 70 (294) ||++++.+++++.++++.+.+.+++|-++++-+ + .+.+ .-+. ...++..+ .++.|+- ---+|++.+ T Consensus 3 ~k~~~~~~~~~~~~~~~~a~a~~~~IGv~~~~~~~~~~~~~~~~~~~~Ak-e~G~~v~v~dA~~D~~---~Qi~qIe~~I 78 (330) T PRK10355 3 IKNILLTLCAALLLTSVAAHAKEVKIGMAIDDLRLERWQKDRDIFVKKAE-SLGAKVFVQSANGNEE---TQMSQIENMI 78 (330) T ss_pred HHHHHHHHHHHHHHHCCHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHH-HCCCEEEEECCCCCHH---HHHHHHHHHH T ss_conf 68999999999998446112138679999068888689999999999999-7599899971899989---9999999999 Q ss_pred -CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEE Q ss_conf -0257899967661035689997314-5576799751 Q gi|254780717|r 71 -ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSH 105 (294) Q Consensus 71 -~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~ 105 (294) ++.|+||.+-.+-+... ..++... ...++|.+.. T Consensus 79 ~qgvdaIiv~p~D~~al~-~~v~~A~~aGIPVI~~D~ 114 (330) T PRK10355 79 NRGVDVLVIIPYNGQVLS-NVIKEAKQEGIKVLAYDR 114 (330) T ss_pred HCCCCEEEEECCCHHHHH-HHHHHHHHCCCEEEEECC T ss_conf 859999999699878889-999999987994999578 No 27 >cd01967 Nitrogenase_MoFe_alpha_like Nitrogenase_MoFe_alpha_like: Nitrogenase MoFe protein, alpha subunit_like. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. Three genetically distinct types of nitrogenase systems are known to exist: a molybdenum-dependent nitrogenase (Mo-nitrogenase), a vanadium dependent nitrogenase (V-nitrogenase), and an iron-only nitrogenase (Fe-nitrogenase). These nitrogenase systems consist of component 1 (MoFe protein, VFe protein or, FeFe protein respectively) and, component 2 (Fe protein). This group contains the alpha subunit of component 1 of all three different forms. The most widespread and best characterized of these systems is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer, the alternative nitrogenases are alpha2beta2delta2 hexamers having alpha and beta subunits similar to the alpha and beta subunits of MoFe. The role of the delta subunit is unknown. For MoFe, each alphabeta pair of subunits contains one Probab=76.20 E-value=6.5 Score=19.82 Aligned_cols=144 Identities=13% Similarity=0.123 Sum_probs=78.3 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEC-CCHH----HHHHHHCCCC Q ss_conf 6555567899999999742186565889989999863122235899985030027814620-6215----7789874981 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVF-HEAY----RYFASHYNLS 207 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~-H~a~----~Yf~~~~gl~ 207 (294) +-+.+...+.+.|++.+.... .-.+..++...++...|+..... +.++.++++ .+.+ .-|.++.|++ T Consensus 240 iG~~~T~~fl~~la~~~g~~~-~~e~~i~~e~~~~~~~l~~~~~~-------l~Gkrv~i~~~~~~~~~~~~~l~ElG~e 311 (406) T cd01967 240 YGFEDTSESLRKIAKFFGDEE-KAEEVIAEEEARIKPELEKYRER-------LKGKKVIIYTGGARSWHVIAALRELGME 311 (406) T ss_pred CCHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHHH-------HCCCEEEEECCCHHHHHHHHHHHHCCCE T ss_conf 778999999999999859960-06899999999999999999998-------3697599976854689999999988996 Q ss_pred EECCCCCCCC----------------CCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCCCC Q ss_conf 2003467754----------------449788999999997549938998188897899999997199089980533678 Q gi|254780717|r 208 IVTFPMSHSV----------------FMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGMLI 271 (294) Q Consensus 208 ~~~~~~~~~~----------------~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~ 271 (294) +......... -....+..++.+.+++.+...++..+ . -+.++++.|++.+.+....... T Consensus 312 vv~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~e~~~~~~~~~pDllig~s-~----~~~~a~~lgiP~~d~~~~~~~~ 386 (406) T cd01967 312 VVAAGYEFGHDDDYERIRKILDEGTLLVDDYNDLELEELVEKLKPDLILSGI-K----EKYVAQKLGIPFLDLHSERNGP 386 (406) T ss_pred EEEEEECCCCCHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCEEEECC-C----CHHHHHHCCCCEEEEECCCCCC T ss_conf 9998744788028899986368877576069989999998307999999787-4----3488998299889962676785 Q ss_pred CCCCCCHHHHHHHHHHHH Q ss_conf 778167899999999999 Q gi|254780717|r 272 AEGPELYFQLMRSMSNSI 289 (294) Q Consensus 272 ~~~~~~Y~~~m~~n~~~l 289 (294) .-|.++-+.+++...+++ T Consensus 387 ~~GY~G~~~l~~~i~nal 404 (406) T cd01967 387 YAGYEGFLNFARDIDTAL 404 (406) T ss_pred EEEHHHHHHHHHHHHHHH T ss_conf 365678999999999986 No 28 >cd00316 Oxidoreductase_nitrogenase The nitrogenase enzyme system catalyzes the ATP-dependent reduction of dinitrogen to ammonia. This group contains both alpha and beta subunits of component 1 of the three known genetically distinct types of nitrogenase systems: a molybdenum-dependent nitrogenase (Mo-nitrogenase), a vanadium-dependent nitrogenase (V-nitrogenase), and an iron-only nitrogenase (Fe-nitrogenase) and, both subunits of Protochlorophyllide (Pchlide) reductase and chlorophyllide (chlide) reductase. The nitrogenase systems consist of component 1 (MoFe protein, VFe protein or, FeFe protein respectively) and, component 2 (Fe protein). The most widespread and best characterized nitrogenase is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer, the alternative nitrogenases are alpha2beta2delta2 hexamers whose alpha and beta subunits are similar to the alpha and beta subunits of MoFe. For MoFe, each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molec Probab=75.77 E-value=6.7 Score=19.75 Aligned_cols=173 Identities=14% Similarity=0.119 Sum_probs=90.8 Q ss_pred CEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 41788705899835335680245665025789996766103568999731455767997513544621001211123321 Q gi|254780717|r 45 GTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRILLRDNHSHFHDS 124 (294) Q Consensus 45 ~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 124 (294) .+|..+++.|. +..+++++.+|++-+..+...-..+.+.++.... ...+. T Consensus 180 i~v~~~~~~~~-------~~~~i~~~~~A~~nl~~~~~~~~~~a~~l~~~~g-~p~~~---------------------- 229 (399) T cd00316 180 IRVNALFDGGT-------TVEELRELGNAKLNLVLCRESGLYLARYLEEKYG-IPYIL---------------------- 229 (399) T ss_pred CCEEEECCCCC-------CHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHHC-CCEEE---------------------- T ss_conf 90589769999-------9799975263538999676579999999998609-85560---------------------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEE-EECCCH----HHH Q ss_conf 224756776555567899999999742186565889989999863122235899985030027814-620621----577 Q gi|254780717|r 125 EADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKI-IVFHEA----YRY 199 (294) Q Consensus 125 ~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~-i~~H~a----~~Y 199 (294) .+| +-+.+...+...|++.+..-. +-.+.++....++.+.+...... +.++.+ +...+. +.. T Consensus 230 --~~p---~G~~~T~~~l~~i~~~~g~~~-~~~~~i~~e~~~~~~~~~~~~~~-------l~gkrv~i~~~~~~~~~l~~ 296 (399) T cd00316 230 --INP---IGLEATDAFLRKLAELFGIEK-EVPEVIARERARLLDALADYHEY-------LGGKKVAIFGDGDLLLALAR 296 (399) T ss_pred --CCC---CCHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHHH-------HCCCEEEEECCCHHHHHHHH T ss_conf --588---786999999999999848986-44899999999999999999887-------45988999888278999999 Q ss_pred HHHHCCCCEECCCC--CCC--------------CCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEE Q ss_conf 89874981200346--775--------------44497889999999975499389981888978999999971990899 Q gi|254780717|r 200 FASHYNLSIVTFPM--SHS--------------VFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAI 263 (294) Q Consensus 200 f~~~~gl~~~~~~~--~~~--------------~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ 263 (294) |.++.|+++..+.. .+. ...-..++.++.+.+++.+...++..+ ..+.++++.+++.+. T Consensus 297 ~l~elG~~v~~~~~~~~~~~~~~~~~~~~~~~~~vv~~~d~~~~~~~i~~~~pDliig~~-----~~~~~a~~~~ip~~~ 371 (399) T cd00316 297 FLLELGMEVVAAGTTFGHKADYERREELLGEGTEVVDDGDLEELEELIRELKPDLIIGGS-----KGRYIAKKLGIPLVR 371 (399) T ss_pred HHHHCCCEEEEEEEECCCHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCEEEECC-----HHHHHHHHCCCCEEE T ss_conf 999879967799984698477889997417998899889999999998716999999794-----478999984998897 Q ss_pred EC Q ss_conf 80 Q gi|254780717|r 264 LD 265 (294) Q Consensus 264 ld 265 (294) +. T Consensus 372 ~~ 373 (399) T cd00316 372 IG 373 (399) T ss_pred CC T ss_conf 67 No 29 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=75.42 E-value=6.8 Score=19.69 Aligned_cols=247 Identities=11% Similarity=0.058 Sum_probs=100.1 Q ss_pred HHHH--HHHHHHHHHHHCCCCC-----------CCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHH Q ss_conf 1589--9999999996001323-----------88059997228999999970794417887058998353356802456 Q gi|254780717|r 2 KNFL--IILIFLFFILSSVARA-----------GSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAM 68 (294) Q Consensus 2 kk~l--~~~~~~~~~l~~~~~~-----------~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~ 68 (294) ||++ ++++++.+.+++|++. .+.+||+..+...++...++.+-+.|.. .. + --|...+ T Consensus 1 Kk~~~~~~~~~~~~~l~~C~~~~~~~~~~~~~~~p~RIV~l~~~~~eil~~Lg~~~VGv~~---~~-----~-~~p~~~k 71 (289) T TIGR03659 1 KKILSLILLALLSLGLTGCSSSKAESKVSNKKSKEERIVATSVAVTEILDKLDLDLVGVPT---SQ-----K-TLPKRYK 71 (289) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHCCCCEEEEEC---CC-----C-CCHHHHH T ss_conf 9129999999999998432899877766777899965998686379999986997599977---87-----6-6968871 Q ss_pred HHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEEC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 65025789996766103568999731455767997513-54462100121112332122475677655556789999999 Q gi|254780717|r 69 MLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHS-PDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIAM 147 (294) Q Consensus 69 ~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~ 147 (294) .+. .+|...+|-+++++.. .++ -|+..... .+.......-+- ..--.|+ -+.+.+...+..+++ T Consensus 72 ~v~------~vG~~~~pnlE~I~aL-kPD-LIi~~~~~~~~~~~~l~~~~i----~v~~~~~---~~~~~i~~~i~~lg~ 136 (289) T TIGR03659 72 DVP------EVGNPMSPDMEKIKSL-KPT-VVLSVTTLEEDLGPKFKQLGV----EATFLNL---TSVDGMKKSITELGE 136 (289) T ss_pred CCC------CCCCCCCCCHHHHHHC-CCC-EEEECCCCHHHHHHHHHHCCC----EEEEECC---CCHHHHHHHHHHHHH T ss_conf 857------2188789999999736-997-899557425878999986097----5999837---899999999999999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CEEEE----------CCCHH-HHHHHHCCCCEECCCC-C Q ss_conf 9742186565889989999863122235899985030027-81462----------06215-7789874981200346-7 Q gi|254780717|r 148 ELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEK-KKIIV----------FHEAY-RYFASHYNLSIVTFPM-S 214 (294) Q Consensus 148 ~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~-~~~i~----------~H~a~-~Yf~~~~gl~~~~~~~-~ 214 (294) .+.+ .+.-++-.+.|.++++++.+.. ...++ +.++. ..++| +-+.+..|.+-+.-.. . T Consensus 137 i~g~-----e~~A~~li~~~~~~l~~i~~~~----~~~~~~~v~~~~~~~~~~~~~~~~s~~g~il~~~G~~Nv~~~~~~ 207 (289) T TIGR03659 137 KYGR-----EEQAEKLVKEINEKEAEVKKKV----KGKKKPKVLILMGVPGSYLVATENSYIGDLVKLAGGENVYKGNKQ 207 (289) T ss_pred HHCC-----HHHHHHHHHHHHHHHHHHHHHH----CCCCCCEEEEEEECCCCEEEECCCCCHHHHHHHHCCCCCCCCCCC T ss_conf 8387-----7889999999999999999873----467897499999349953884599646789998167420225688 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEECC-CCHHHHHHHHHH---------H----CCCEEEECCCCCCCCCCCCCHHH Q ss_conf 754449788999999997549938998188-897899999997---------1----99089980533678778167899 Q gi|254780717|r 215 HSVFMGAASLRNIRSKIISDKISCLFYGPE-FDSKIIRSITND---------T----GVMSAILDPEGMLIAEGPELYFQ 280 (294) Q Consensus 215 ~~~~~s~~~l~~~~~~ik~~~v~~if~e~~-~~~~~~~~ia~e---------~----~~~~~~ld~~~~~~~~~~~~Y~~ 280 (294) .-.+.| .+.+-+.+..+||.-.. .+....+.+.++ + +-++..+|+--+... ++-.+.+ T Consensus 208 ~~~~~s-------~E~l~~~nPDvI~~~~~~~~~~~~~~~~~~~~~np~w~~l~AVKn~~Vy~ld~~~~~~~-~~~~~~~ 279 (289) T TIGR03659 208 EYLSSN-------TEYLLKANPDIILRAAHGMPDEVKKMFDEEFKTNDIWKHFEAVKNNRVYDLDEELFGMT-ANLKVAE 279 (289) T ss_pred CCCCCC-------HHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCHHCCCCEEEECHHHHCCC-CCHHHHH T ss_conf 755448-------99997609989999557884566888999986295121698200894799787880779-7577999 Q ss_pred HHHHHHHHH Q ss_conf 999999999 Q gi|254780717|r 281 LMRSMSNSI 289 (294) Q Consensus 281 ~m~~n~~~l 289 (294) .|......| T Consensus 280 ~l~~l~k~l 288 (289) T TIGR03659 280 ALDELKKML 288 (289) T ss_pred HHHHHHHHH T ss_conf 999999974 No 30 >TIGR00289 TIGR00289 conserved hypothetical protein TIGR00289; InterPro: IPR005237 This family of conserved hypothetical proteins has no known function. Homologous proteins related to MJ0570 of Methanococcus jannaschii include both the apparent orthologs in the family, the much longer protein YLR143W from Saccharomyces cerevisiae (Baker's yeast), and second homologous proteins from Archaeoglobus fulgidus and Pyrococcus horikoshii that appear to represent a second orthologous group.. Probab=75.16 E-value=6.2 Score=19.95 Aligned_cols=43 Identities=9% Similarity=0.161 Sum_probs=18.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCEEE---EECCCCHHHHHHHHHHHCCC Q ss_conf 44978899999999754993899---81888978999999971990 Q gi|254780717|r 218 FMGAASLRNIRSKIISDKISCLF---YGPEFDSKIIRSITNDTGVM 260 (294) Q Consensus 218 ~~s~~~l~~~~~~ik~~~v~~if---~e~~~~~~~~~~ia~e~~~~ 260 (294) |--..+|+.-++.+.+.++.+|. .+++.-++..+++|+++|.+ T Consensus 72 EkEVedLag~l~~lderGvea~~~Ga~~S~YQK~Rid~vCrelGlk 117 (227) T TIGR00289 72 EKEVEDLAGQLEELDERGVEAVVIGAIESEYQKSRIDKVCRELGLK 117 (227) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCEECCHHHHHHHHHHHHHCCH T ss_conf 2117666215544553054547885341011233577887771410 No 31 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=75.00 E-value=4.1 Score=21.11 Aligned_cols=29 Identities=24% Similarity=0.258 Sum_probs=17.3 Q ss_pred CHHHHHH-HHHHHHHHHCCCCCCCEEEEEE Q ss_conf 9158999-9999999600132388059997 Q gi|254780717|r 1 MKNFLII-LIFLFFILSSVARAGSLQVVVS 29 (294) Q Consensus 1 Mkk~l~~-~~~~~~~l~~~~~~~~~~Vv~s 29 (294) |||.++. .+++++.|++|+....-.|+++ T Consensus 1 MKK~~la~~~~svl~LaaC~~~~~~~vat~ 30 (287) T PRK03095 1 MKKAMLALAATSVIALSACGTSSSDKIVTS 30 (287) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 907899999999999985168998758984 No 32 >pfam00148 Oxidored_nitro Nitrogenase component 1 type Oxidoreductase. Probab=73.84 E-value=7.5 Score=19.44 Aligned_cols=172 Identities=15% Similarity=0.137 Sum_probs=89.1 Q ss_pred CEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 41788705899835335680245665025789996766103568999731455767997513544621001211123321 Q gi|254780717|r 45 GTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRILLRDNHSHFHDS 124 (294) Q Consensus 45 ~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 124 (294) .+|.+.+++|. +..|++++.+|.+-+..+..+-..+.+.++...+ .+-+... T Consensus 173 i~v~~~~~~~~-------~~~~i~~~~~A~~ni~~~~~~g~~~a~~L~~~~g-~P~i~~~-------------------- 224 (398) T pfam00148 173 IEVNAVFPGGT-------TLEDLRRAGNAALNIVLCPESGRYAAKYLEERFG-VPYIELG-------------------- 224 (398) T ss_pred CCEEEEECCCC-------CHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHC-CCEEECC-------------------- T ss_conf 93578818999-------8799986423859999888888999999999829-9768447-------------------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEE-EECCCH----HHH Q ss_conf 224756776555567899999999742186565889989999863122235899985030027814-620621----577 Q gi|254780717|r 125 EADDLHLWLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKI-IVFHEA----YRY 199 (294) Q Consensus 125 ~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~-i~~H~a----~~Y 199 (294) .=+.+.+...+...|++.+.+- ...+.+.+...++...+.+. +..+ .++.+ |...+. +.- T Consensus 225 ------~P~G~~~T~~~l~~i~~~~g~~--~~~~~i~~e~~~~~~~~~~~----~~~l---~gkrv~v~g~~~~~~~l~~ 289 (398) T pfam00148 225 ------SPIGIEATDRFLRALAELLGKE--VAPEVIAEERGRLLDALADY----HEYL---AGKRVAIYGDPDLAWGLAR 289 (398) T ss_pred ------CCCCHHHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHHHHH----HHHH---CCCEEEEECCCHHHHHHHH T ss_conf ------7677789999999999996899--87789999999999999999----8873---7977999888457999999 Q ss_pred HHHHCCCCEECCCC--CCCC-CC-------------CHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEE Q ss_conf 89874981200346--7754-44-------------97889999999975499389981888978999999971990899 Q gi|254780717|r 200 FASHYNLSIVTFPM--SHSV-FM-------------GAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAI 263 (294) Q Consensus 200 f~~~~gl~~~~~~~--~~~~-~~-------------s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ 263 (294) |.+++|+++..+.. .++. .. -..+..++.+.+++.+...+|..+. -+.++++.+++.+. T Consensus 290 ~L~ElG~~v~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~d~~e~~~~i~~~~pDliig~s~-----~~~~a~~~~ip~i~ 364 (398) T pfam00148 290 FLEELGMEVVAVGTGTGHPDDYERLKALLDGTLRVIDDADLEELEELIKELKPDLLIGNSK-----ERYLAKKLGIPLVR 364 (398) T ss_pred HHHHCCCEEEEEEECCCCHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCEEEECHH-----HHHHHHHCCCCEEE T ss_conf 9998799799999678987889989985478977997799999999987269999998917-----89999973998898 Q ss_pred E Q ss_conf 8 Q gi|254780717|r 264 L 264 (294) Q Consensus 264 l 264 (294) + T Consensus 365 ~ 365 (398) T pfam00148 365 I 365 (398) T ss_pred E T ss_conf 2 No 33 >cd01965 Nitrogenase_MoFe_beta_like Nitrogenase_MoFe_beta_like: Nitrogenase MoFe protein, beta subunit_like. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. This group contains the beta subunits of component 1 of the three known genetically distinct types of nitrogenase systems: a molybdenum-dependent nitrogenase (Mo-nitrogenase), a vanadium-dependent nitrogenase (V-nitrogenase), and an iron-only nitrogenase (Fe-nitrogenase). These nitrogenase systems consist of component 1 (MoFe protein, VFe protein or, FeFe protein respectively) and, component 2 (Fe protein). The most widespread and best characterized of these systems is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer, the alternative nitrogenases are alpha2beta2delta2 hexamers having alpha and beta subunits similar to the alpha and beta subunits of MoFe. For MoFe, each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (Fe Probab=73.21 E-value=7.7 Score=19.35 Aligned_cols=117 Identities=10% Similarity=0.061 Sum_probs=66.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECC-C----HHHHHHHHCCCC Q ss_conf 65555678999999997421865658899899998631222358999850300278146206-2----157789874981 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFH-E----AYRYFASHYNLS 207 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H-~----a~~Yf~~~~gl~ 207 (294) +-+.+...+...|++.+.+-.|+ .++.-..++.+.+.+.. .. ..++.++++- + ++.-|.++.|++ T Consensus 255 ~G~~~T~~~l~~la~~~g~~~~~---~~~~er~r~~d~~~d~~----~~---l~gkrvai~~~~~~~~~l~~~L~elG~~ 324 (428) T cd01965 255 IGLKATDEFLRALSKLSGKPIPE---ELERERGRLLDAMLDSH----FY---LGGKRVAIAGDPDLLLGLSRFLLEMGAE 324 (428) T ss_pred CCHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHHHH----HH---CCCCEEEEECCCHHHHHHHHHHHHCCCE T ss_conf 46799999999999984899559---99999999999999999----86---0797899988818899999999985995 Q ss_pred EECC--CC-CCC----------------CCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEE Q ss_conf 2003--46-775----------------444978899999999754993899818889789999999719908998 Q gi|254780717|r 208 IVTF--PM-SHS----------------VFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAIL 264 (294) Q Consensus 208 ~~~~--~~-~~~----------------~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~l 264 (294) +..+ .. ++. ......|+.++.+.+++.+...+|..+ .-+.++++.+++.+.+ T Consensus 325 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~~v~~~~D~~~le~~i~~~~~dliig~s-----~~~~~A~~l~iP~~~~ 395 (428) T cd01965 325 PVAAVTGTDNPPFEKRMELLASLEGIPAEVVFVGDLWDLESLAKEEPVDLLIGNS-----HGRYLARDLGIPLVRV 395 (428) T ss_pred EEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCEEEECH-----HHHHHHHHCCCCEEEE T ss_conf 6799981799247788998766507996399769999999999747999999785-----7899999859987994 No 34 >COG1464 NlpA ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism] Probab=72.95 E-value=6 Score=20.04 Aligned_cols=143 Identities=15% Similarity=0.173 Sum_probs=62.7 Q ss_pred CHHH--HHHHHHHHHHHHCCCCC-----CCEEEEEECHHHHHHHHHHHCC---C--CEEEEEECCCCCCCCCCCCHHHHH Q ss_conf 9158--99999999996001323-----8805999722899999997079---4--417887058998353356802456 Q gi|254780717|r 1 MKNF--LIILIFLFFILSSVARA-----GSLQVVVSIKPIHSIVSCIMQG---I--GTPALLVKGASSPHEYSLRISEAM 68 (294) Q Consensus 1 Mkk~--l~~~~~~~~~l~~~~~~-----~~~~Vv~s~~pl~~lv~~I~gd---~--~~V~~L~~~g~dpH~y~~tp~d~~ 68 (294) |+++ +.+++.+++++++|..+ .+++|.+|..|-+.+.+.+... . .+++...=.+ | -.|... T Consensus 1 m~~~~~~~~~~~~~l~~~~~~~~~~~~~~~I~vg~~~~p~a~ile~~~k~~~~k~Gi~l~i~~FtD-----Y-~~PN~A- 73 (268) T COG1464 1 MKKLKKLALALVALLALAACGAAAAKATKTIKVGATPGPHAEILEVVVKPALKKKGLDLKIVEFTD-----Y-VQPNEA- 73 (268) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEECC-----C-CCHHHH- T ss_conf 943889999999999997522311345771799636896599999998778885596599998148-----7-665578- Q ss_pred HHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCC-CCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 650257899967661035689997314557679975135446210-0121112-33212247567765555678999999 Q gi|254780717|r 69 MLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRIL-LRDNHSH-FHDSEADDLHLWLNPLNVQYIAHVIA 146 (294) Q Consensus 69 ~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~-~~~~~~~dpH~Wl~p~n~~~~a~~I~ 146 (294) |.+-|+= .|--..-|+++...+.... +++.++... +.++. ....... ..-.++--=-+=.||.|-.+....+. T Consensus 74 -L~~gdiD-aN~FQH~pyL~~~~k~~~~--~Lv~vg~~~-i~Pmg~YSkk~ksl~el~~GatIaiPNDpsN~gRAL~lL~ 148 (268) T COG1464 74 -LADGDID-ANAFQHKPYLDQFNKEHGG--KLVAVGNTH-IEPMGLYSKKYKSLAELKDGATIAIPNDPTNEGRALLLLQ 148 (268) T ss_pred -HHCCCCC-CHHHHCHHHHHHHHHHCCC--CEEEEEEEE-ECCCEECCHHCCCHHHCCCCCEEECCCCCCCHHHHHHHHH T ss_conf -7668846-0012146889999997399--789875577-6142501220086846799998988789873567999999 Q ss_pred HH-HHHCCCC Q ss_conf 99-7421865 Q gi|254780717|r 147 ME-LIKKDPR 155 (294) Q Consensus 147 ~~-L~~~dP~ 155 (294) ++ |+++.+. T Consensus 149 ~aGLIkLk~~ 158 (268) T COG1464 149 KAGLIKLKDG 158 (268) T ss_pred HCCCEEECCC T ss_conf 8796797688 No 35 >pfam11897 DUF3417 Protein of unknown function (DUF3417). This family of proteins are functionally uncharacterized. This protein is found in bacteria and archaea. Proteins in this family are typically between 145 to 860 amino acids in length. This protein is found associated with pfam00343. This protein has a conserved AYF sequence motif. Probab=72.14 E-value=2.7 Score=22.23 Aligned_cols=74 Identities=20% Similarity=0.251 Sum_probs=36.6 Q ss_pred CCCHHHHHHH-HHHHHHHHHHHHH-HHH--CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHH Q ss_conf 2475677655-5567899999999-742--18656588998999986312223589998503002781462062157789 Q gi|254780717|r 126 ADDLHLWLNP-LNVQYIAHVIAME-LIK--KDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYRYFA 201 (294) Q Consensus 126 ~~dpH~Wl~p-~n~~~~a~~I~~~-L~~--~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~ 201 (294) ..||..|-.. .|-+.+...+.+. |.+ .||.....|+.=.++|..-|.. ..++... .+. ...+...||+ T Consensus 39 ~iDp~lW~~~~hNPv~~L~~v~~~rL~~la~D~~f~~~~~~v~~~~~~Yl~~-~~w~~~~-~~~------~~~~~VAYFS 110 (119) T pfam11897 39 RLDPELWEETGHNPVRLLGEVSQERLEELAADPEFLARYDRVAADFDEYLSA-PTWFQRN-PGD------EGPRPIAYFS 110 (119) T ss_pred HHCHHHHHHCCCCHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHCC-CCCCCCC-CCC------CCCCCEEEEC T ss_conf 9699999876899799998689999999956999999999999999999868-8643235-777------8999569971 Q ss_pred HHCCCC Q ss_conf 874981 Q gi|254780717|r 202 SHYNLS 207 (294) Q Consensus 202 ~~~gl~ 207 (294) -+|||. T Consensus 111 ~EfGlh 116 (119) T pfam11897 111 MEFGLH 116 (119) T ss_pred CCCCCC T ss_conf 030443 No 36 >cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light. Probab=71.87 E-value=8.3 Score=19.15 Aligned_cols=18 Identities=0% Similarity=0.012 Sum_probs=7.7 Q ss_pred HHHHHHHHHCCCCEEEEE Q ss_conf 999999970794417887 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALL 50 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L 50 (294) +..+++.++.--.+|+.+ T Consensus 27 v~~La~~L~~~GH~V~V~ 44 (398) T cd03800 27 VLELARALARLGHEVDIF 44 (398) T ss_pred HHHHHHHHHHCCCEEEEE T ss_conf 999999999869969999 No 37 >cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran. Probab=71.72 E-value=5.4 Score=20.35 Aligned_cols=19 Identities=16% Similarity=0.357 Sum_probs=10.4 Q ss_pred CCHHHHHHHHHHHCCCEEEEC Q ss_conf 897899999997199089980 Q gi|254780717|r 245 FDSKIIRSITNDTGVMSAILD 265 (294) Q Consensus 245 ~~~~~~~~ia~e~~~~~~~ld 265 (294) ++-..+|.++ +|.+++..| T Consensus 265 fgl~~lEAma--~G~PvIas~ 283 (348) T cd03820 265 FPMVLLEAMA--FGLPVISFD 283 (348) T ss_pred CCCHHHHHHH--CCCCEEEEC T ss_conf 8708999998--699999967 No 38 >cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol. Probab=71.20 E-value=7.8 Score=19.32 Aligned_cols=10 Identities=20% Similarity=0.106 Sum_probs=4.6 Q ss_pred CCCCEEEEEC Q ss_conf 0257899967 Q gi|254780717|r 71 ENADIIFWLG 80 (294) Q Consensus 71 ~~Adlii~~G 80 (294) .+.|+|...+ T Consensus 83 ~~~DvIh~~~ 92 (374) T cd03817 83 LGPDIVHTHT 92 (374) T ss_pred CCCCEEEECC T ss_conf 6999999878 No 39 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=70.72 E-value=8.8 Score=18.99 Aligned_cols=54 Identities=22% Similarity=0.328 Sum_probs=31.0 Q ss_pred CHHHHHHHHHHHHHHHCCCC---CCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCH Q ss_conf 91589999999999600132---38805999722899999997079441788705899835335680 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR---AGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRI 64 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~---~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp 64 (294) |||++++++++++.+++.++ +|+=-.+.. .|+ .+..+|.=.. .=.|||=|+.-| T Consensus 1 mkk~~~~~~~llls~~a~Aa~~~eG~~Y~~L~-~p~--------~~~~eV~EfF-sy~CpHCy~fEp 57 (207) T PRK10954 1 MKKIWLALAGMVLAFSASAAQFTDGKQYTTLD-KPV--------AGAPQVLEFF-SFYCPHCYQFEE 57 (207) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEECC-CCC--------CCCCCEEEEE-EECCCCHHHCCH T ss_conf 91599999999999772250354899755888-888--------9997089999-746972551070 No 40 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=70.26 E-value=7.2 Score=19.55 Aligned_cols=144 Identities=11% Similarity=0.043 Sum_probs=67.4 Q ss_pred HHHHH-HHHHHHHHHHCCCCC----CCEEEEEECHHHHHHHHHHHC---CC--CEEEEEECCCCCCCCCCCCHHHHHHHC Q ss_conf 15899-999999996001323----880599972289999999707---94--417887058998353356802456650 Q gi|254780717|r 2 KNFLI-ILIFLFFILSSVARA----GSLQVVVSIKPIHSIVSCIMQ---GI--GTPALLVKGASSPHEYSLRISEAMMLE 71 (294) Q Consensus 2 kk~l~-~~~~~~~~l~~~~~~----~~~~Vv~s~~pl~~lv~~I~g---d~--~~V~~L~~~g~dpH~y~~tp~d~~~l~ 71 (294) |.|+. ..+++.+.+++|++. ..++|.+|-.|..++.+.+.. ++ .+|+...=... ..|.....=. T Consensus 6 k~~~~~~~l~~~l~l~gCg~~~~~~~~ikVG~~~gp~~ei~e~~~~~~~ek~G~~veiv~FsDy------~~pN~AL~~G 79 (271) T PRK11063 6 KTFAAVGALIGSLALVGCGQDEKDPNHIKVGVIVGAEQQVAEVAQKVAKEKYGLDVELVTFNDY------VLPNEALSKG 79 (271) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEECCC------HHHHHHHHCC T ss_conf 9999999999999750058985679818998468987999999888888607976899993683------2155898679 Q ss_pred CCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHH-H Q ss_conf 2578999676610356899973145576799751354462100121112-3321224756776555567899999999-7 Q gi|254780717|r 72 NADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRILLRDNHSH-FHDSEADDLHLWLNPLNVQYIAHVIAME-L 149 (294) Q Consensus 72 ~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~dpH~Wl~p~n~~~~a~~I~~~-L 149 (294) +-|+=+ -.+.+||+..-+..+ .+.+.+....-.+-.....-+.. ..-..+--=-+=-||.|..+...-+.++ | T Consensus 80 ~iDaN~---fQH~~yL~~~nk~~g--~~L~~v~~~~~~P~glYS~K~ksl~elp~Ga~IaIPND~sN~~RAL~lL~~aGL 154 (271) T PRK11063 80 DIDANA---FQHKPYLDQQIKDRG--YKLVAVGNTFVYPIAGYSKKIKSLDELQDGSQVAVPNDPTNLGRSLLLLQKVGL 154 (271) T ss_pred CCCHHH---HHCHHHHHHHHHHCC--CCEEEEEEEEEEEEEEEECCCCCHHHCCCCCEEEECCCCCHHHHHHHHHHHCCC T ss_conf 844234---557999999999779--957998667784158642365886475699999804885609999999998898 Q ss_pred HHCCCCC Q ss_conf 4218656 Q gi|254780717|r 150 IKKDPRN 156 (294) Q Consensus 150 ~~~dP~~ 156 (294) +++.|.. T Consensus 155 IkLk~~~ 161 (271) T PRK11063 155 IKLKDGV 161 (271) T ss_pred EEECCCC T ss_conf 8977999 No 41 >cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=70.19 E-value=9 Score=18.92 Aligned_cols=23 Identities=9% Similarity=0.118 Sum_probs=11.6 Q ss_pred HHHHHHHHHCCCCEEEEEECCCC Q ss_conf 99999997079441788705899 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLVKGAS 55 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~~~g~ 55 (294) +.++++.+..--.+|+.+.+... T Consensus 20 ~~~la~~L~~~Gh~V~v~t~~~~ 42 (364) T cd03814 20 LQRLVEHLRARGHEVLVIAPGPF 42 (364) T ss_pred HHHHHHHHHHCCCEEEEEECCCC T ss_conf 99999999977998999978997 No 42 >cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. Probab=69.07 E-value=9.5 Score=18.77 Aligned_cols=20 Identities=5% Similarity=0.061 Sum_probs=9.9 Q ss_pred HHHHHHHHHCCCCEEEEEEC Q ss_conf 99999997079441788705 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLVK 52 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~~ 52 (294) +..+++.+...-.+|+.+.+ T Consensus 20 ~~~la~~L~~~G~~V~vit~ 39 (374) T cd03801 20 VLELARALAARGHEVTVLTP 39 (374) T ss_pred HHHHHHHHHHCCCEEEEEEE T ss_conf 99999999977998999960 No 43 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=67.45 E-value=8.3 Score=19.16 Aligned_cols=20 Identities=25% Similarity=0.303 Sum_probs=9.7 Q ss_pred CCCHHHHHHHHHHHCCCEEEEC Q ss_conf 8897899999997199089980 Q gi|254780717|r 244 EFDSKIIRSITNDTGVMSAILD 265 (294) Q Consensus 244 ~~~~~~~~~ia~e~~~~~~~ld 265 (294) .++-..+|..+ +|.+++..| T Consensus 275 ~fg~~~lEAma--~G~PvIas~ 294 (359) T cd03823 275 NFPLVIREALA--AGVPVIASD 294 (359) T ss_pred CCCHHHHHHHH--CCCCEEECC T ss_conf 77479999998--299899888 No 44 >cd01976 Nitrogenase_MoFe_alpha Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=66.79 E-value=11 Score=18.47 Aligned_cols=142 Identities=8% Similarity=0.006 Sum_probs=71.4 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHCCCCCCCEEEEC---CCHH--HHHHHHCCC Q ss_conf 6555567899999999742186565889989999863-122235899985030027814620---6215--778987498 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKN-QLSQLDKELHSILQPVEKKKIIVF---HEAY--RYFASHYNL 206 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~-~L~~l~~~~~~~~~~~~~~~~i~~---H~a~--~Yf~~~~gl 206 (294) +-+.+.......|++.+.+ + ..++++++.+ +...+...+..-...++++.++.+ ..++ .-..++.|+ T Consensus 252 ~Gi~~T~~~Lr~IA~~fG~------e-l~~~~E~lI~~e~~~~~~~l~~yr~~L~GKrv~i~~Gg~~~~~~i~~~~eLGm 324 (421) T cd01976 252 FGPTKIAESLRKIAAYFDD------E-ITAKTEEVIAEYKPAMEAVIAKYRPRLEGKTVMLYVGGLRPRHYIGAYEDLGM 324 (421) T ss_pred CCHHHHHHHHHHHHHHHCH------H-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCC T ss_conf 5879999999999998590------8-88999999999999999999999987089889998998746899999998798 Q ss_pred CEECCCC--CC-----------C------CCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCC Q ss_conf 1200346--77-----------5------444978899999999754993899818889789999999719908998053 Q gi|254780717|r 207 SIVTFPM--SH-----------S------VFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPE 267 (294) Q Consensus 207 ~~~~~~~--~~-----------~------~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~ 267 (294) ++++.+. .+ + ..+++ .++.+.+++.+...++... .-+.++.+.|++.+.++.. T Consensus 325 evV~~g~~~~~~~Dye~~~~~~~~~~~iidd~~~---~Ele~~i~~~~pDl~l~g~-----ker~ia~K~gIP~~~~~~~ 396 (421) T cd01976 325 EVVGTGYEFAHRDDYERTEVIPKEGTLLYDDVTH---YELEEFVKRLKPDLIGSGI-----KEKYVFQKMGIPFRQMHSW 396 (421) T ss_pred EEEEEEEECCCHHHHHHHHHHCCCCCEEEECCCH---HHHHHHHHHCCCCEEEECC-----CCCHHEEECCCCEEECCCC T ss_conf 8999731227476799998508988289727999---9999999853999999547-----6202004448885766886 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 3678778167899999999999874 Q gi|254780717|r 268 GMLIAEGPELYFQLMRSMSNSIAKN 292 (294) Q Consensus 268 ~~~~~~~~~~Y~~~m~~n~~~l~k~ 292 (294) ....+. .+|.-|+. .+..+..+ T Consensus 397 ~~~~py--aGy~G~v~-far~i~~~ 418 (421) T cd01976 397 DYSGPY--HGFDGFAI-FARDMDMA 418 (421) T ss_pred CCCCCC--CCHHHHHH-HHHHHHHH T ss_conf 657776--44767999-99999998 No 45 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=66.77 E-value=5.5 Score=20.26 Aligned_cols=208 Identities=10% Similarity=0.028 Sum_probs=81.8 Q ss_pred CHHH--HHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCC--CCE-------EEEEECCCCCCCCCCCCHHHHHH Q ss_conf 9158--999999999960013238805999722899999997079--441-------78870589983533568024566 Q gi|254780717|r 1 MKNF--LIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQG--IGT-------PALLVKGASSPHEYSLRISEAMM 69 (294) Q Consensus 1 Mkk~--l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd--~~~-------V~~L~~~g~dpH~y~~tp~d~~~ 69 (294) |+|+ ++++++++++++||++..+...-.|..-++.-+..-..+ ... +....|-+. -..-.. T Consensus 1 m~~~k~l~~~~~l~l~l~gCs~~~~~~~~~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~--------~A~qA~ 72 (243) T PRK10866 1 MTRMKYLVAAATLSLFLAGCSGSKEEVPDNPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGP--------YSQQVQ 72 (243) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH--------HHHHHH T ss_conf 945899999999999998768998888899999999999999987899999999999998689976--------789999 Q ss_pred HCCCCEEEEECCCCCH---HHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 5025789996766103---5689997314557679975135446210012111233212247567765555678999999 Q gi|254780717|r 70 LENADIIFWLGAEMES---FLVKPLHSLNKQSNVVTLSHSPDLHRILLRDNHSHFHDSEADDLHLWLNPLNVQYIAHVIA 146 (294) Q Consensus 70 l~~Adlii~~G~~~E~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~ 146 (294) |.-|+..+-+| +++. -.++.++..+.+.++.=+.-..++-.....+.. -..--..| ..=.||..+....+... T Consensus 73 L~~ayayy~~~-~y~~A~~~~~rFi~lyP~~~~v~Ya~Yl~gl~~~~~~~~~--~~~~~~~~-~~~rD~~~~~~A~~~f~ 148 (243) T PRK10866 73 LDLIYAYYKNA-DLPLAQAAIDRFMRLNPTHPNIDYVMYMRGLTNMALDDSA--LQGFFGVD-RSDRDPQHARAAFSDFS 148 (243) T ss_pred HHHHHHHHHHC-CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHH--HHHHHCCC-HHHCCHHHHHHHHHHHH T ss_conf 99999987307-7899999999999867899973899999889877401358--88874325-64307699999999999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHH Q ss_conf 99742186565889989999863122235899985030--0278146206215778987498120034677544497889 Q gi|254780717|r 147 MELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQP--VEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASL 224 (294) Q Consensus 147 ~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~--~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l 224 (294) .|++.+|+.+ |...+..-...+.+.-++-.-.++. .++..++..=+.|+++.+.| |+++..+.-| T Consensus 149 -~li~~yP~S~--Ya~dA~~r~~~lrn~LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Y----------p~t~~~~eAL 215 (243) T PRK10866 149 -KLVRGYPNSQ--YTTDATKRLVFLKDRLAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDY----------PDTQATRDAL 215 (243) T ss_pred -HHHHHCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC----------CCCCCHHHHH T ss_conf -9998789984--58999999999999999999999999999265699999999999768----------9996339999 Q ss_pred HHHHHHHHH Q ss_conf 999999975 Q gi|254780717|r 225 RNIRSKIIS 233 (294) Q Consensus 225 ~~~~~~ik~ 233 (294) ..+++--.. T Consensus 216 ~~l~eaY~~ 224 (243) T PRK10866 216 PLMENAYRQ 224 (243) T ss_pred HHHHHHHHH T ss_conf 999999998 No 46 >COG0075 Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase [Amino acid transport and metabolism] Probab=66.64 E-value=11 Score=18.46 Aligned_cols=56 Identities=11% Similarity=-0.001 Sum_probs=28.1 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCCC-CEEEEEEC-CCCCCCCCCCCHHHHHHHCC----CCEEEEEC Q ss_conf 2388059997228999999970794-41788705-89983533568024566502----57899967 Q gi|254780717|r 20 RAGSLQVVVSIKPIHSIVSCIMQGI-GTPALLVK-GASSPHEYSLRISEAMMLEN----ADIIFWLG 80 (294) Q Consensus 20 ~~~~~~Vv~s~~pl~~lv~~I~gd~-~~V~~L~~-~g~dpH~y~~tp~d~~~l~~----Adlii~~G 80 (294) ..+..-+|..+-.+..--.+|+.-. .+|..+-. .|. .++|.+++..-+ .++++... T Consensus 78 ~pgdkVLv~~nG~FG~R~~~ia~~~g~~v~~~~~~wg~-----~v~p~~v~~~L~~~~~~~~V~~vH 139 (383) T COG0075 78 EPGDKVLVVVNGKFGERFAEIAERYGAEVVVLEVEWGE-----AVDPEEVEEALDKDPDIKAVAVVH 139 (383) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEECCCCC-----CCCHHHHHHHHHCCCCCCEEEEEE T ss_conf 89986999827807779999999829966998478888-----799999999985289950899981 No 47 >PRK12450 foldase protein PrsA; Reviewed Probab=66.62 E-value=6 Score=20.02 Aligned_cols=29 Identities=14% Similarity=0.197 Sum_probs=17.8 Q ss_pred CHHHHHHHH--HHHHHHHCCCCCC-CEEEEEE Q ss_conf 915899999--9999960013238-8059997 Q gi|254780717|r 1 MKNFLIILI--FLFFILSSVARAG-SLQVVVS 29 (294) Q Consensus 1 Mkk~l~~~~--~~~~~l~~~~~~~-~~~Vv~s 29 (294) |||+++.++ +.++.|++|++.+ .-.|+++ T Consensus 4 mKK~~~~~~~~~svl~LaAC~s~~~~~~v~t~ 35 (309) T PRK12450 4 MNKLITGVVTLATVVTLSACQSSHNNTKLVSM 35 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 99999999999999999860589998368983 No 48 >cd03805 GT1_ALG2_like This family is most closely related to the GT1 family of glycosyltransferases. ALG2, a 1,3-mannosyltransferase, in yeast catalyzes the mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate and Man(1)GlcNAc(2)-dolichol diphosphate to form Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of this enzyme causes an abnormal accumulation of Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol, which is associated with a type of congenital disorders of glycosylation (CDG), designated CDG-Ii, in humans. Probab=66.37 E-value=11 Score=18.42 Aligned_cols=16 Identities=6% Similarity=0.083 Sum_probs=8.4 Q ss_pred HHHHHHHHHCCCEEEECC Q ss_conf 999999971990899805 Q gi|254780717|r 249 IIRSITNDTGVMSAILDP 266 (294) Q Consensus 249 ~~~~ia~e~~~~~~~ld~ 266 (294) .+|..+ +|.+++..|. T Consensus 316 ~lEAma--~G~PVVasd~ 331 (392) T cd03805 316 PLEAMY--AGKPVIACNS 331 (392) T ss_pred HHHHHH--CCCCEEEECC T ss_conf 999997--7999999489 No 49 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=66.12 E-value=7.3 Score=19.52 Aligned_cols=31 Identities=19% Similarity=0.356 Sum_probs=16.2 Q ss_pred CHH-HH--HHHHHHHHHHHCCCCCCCEEEEEECH Q ss_conf 915-89--99999999960013238805999722 Q gi|254780717|r 1 MKN-FL--IILIFLFFILSSVARAGSLQVVVSIK 31 (294) Q Consensus 1 Mkk-~l--~~~~~~~~~l~~~~~~~~~~Vv~s~~ 31 (294) ||+ .+ ++.+++++.|++|++.+.-.+|+|+. T Consensus 1 mkkK~i~~~~~~~svl~LaaC~~~~~~~~VAtv~ 34 (285) T PRK03002 1 MRGKHIFIITALISILMLSACGQKNSSATVATAT 34 (285) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 9601899999999999999845799997799868 No 50 >cd01981 Pchlide_reductase_B Pchlide_reductase_B: B protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=65.24 E-value=11 Score=18.29 Aligned_cols=150 Identities=15% Similarity=0.099 Sum_probs=76.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHH---HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCH-----HHH-HHHH Q ss_conf 65555678999999997421865658899---89999863122235899985030027814620621-----577-8987 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYE---KNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEA-----YRY-FASH 203 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~---~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a-----~~Y-f~~~ 203 (294) +-+.+...+...|++.+..-.|......+ .-..++...+..-+..... .-+.++.++++-|. +.- ++++ T Consensus 246 iGi~~T~~fL~~i~~~~g~e~~~e~~~~e~~~~~~~~~~~~~~d~~~~~~~--~~l~GKrvaI~gd~~~~~~l~~fl~~E 323 (430) T cd01981 246 IGVVATARFLREIQELLGIQIIPELVNVEPYIDSQTRWVSQSARSSRSIDS--QNLTGKRAFVFGDATHVAAATRILARE 323 (430) T ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCEEEEECCCHHHHHHHHHHHHH T ss_conf 678999999999999958986233455788998877655577887655565--403797799987815899999999985 Q ss_pred CCCCEECCCCC--CCCC---C----------CHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEEC-CC Q ss_conf 49812003467--7544---4----------9788999999997549938998188897899999997199089980-53 Q gi|254780717|r 204 YNLSIVTFPMS--HSVF---M----------GAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILD-PE 267 (294) Q Consensus 204 ~gl~~~~~~~~--~~~~---~----------s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld-~~ 267 (294) .|++++..+.. +..+ . -..|..++.+.+++.++..+|..+ .-+.++++.|++.+.+. |+ T Consensus 324 LGm~vv~~gt~~~~~~~~~~~~~~~~~~~~~i~~D~~el~~~i~~~~pDlliG~s-----~ek~ia~k~giPlirigfPi 398 (430) T cd01981 324 MGFRVVGAGTYCKEDAKWFREQATGYCDEALITDDHTEVGDMIARTEPELIFGTQ-----MERHIGKRLDIPCAVISAPV 398 (430) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCEEEECC-----CHHHHHHHHCCCEEEECCCC T ss_conf 6976997067766468999998743698089958999999998624999999664-----18999998696968744871 Q ss_pred C------C-CCCCCCCCHHHHHHHHHHHH Q ss_conf 3------6-78778167899999999999 Q gi|254780717|r 268 G------M-LIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 268 ~------~-~~~~~~~~Y~~~m~~n~~~l 289 (294) . . ..--|.++-..+.+..+++| T Consensus 399 ~~~~~~~~~~P~~GY~Ga~~l~~~I~nal 427 (430) T cd01981 399 HIQNFPLGYRPFLGYEGTNVIADTVYNSL 427 (430) T ss_pred EECCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 13144667899420276999999997763 No 51 >TIGR01511 ATPase-IB1_Cu copper-translocating P-type ATPase; InterPro: IPR006403 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents P-type ATPases that specialise in cation and copper transport; those transporting copper (3.6.3.4 from EC) have been classified as belonging to the IB subfamily , , . These proteins are involved in a variety of processes in both prokaryotes and eukaryotes. Some of these P-ATPases import copper into cells, such as in Arabidopsis sp., which require copper to create functional hormone receptors for ethylene signalling. Other P-ATPases are involved in copper export out of cells, such as occurs with the efflux of hepatic copper into the bile in eukaryotes. In bacteria, some of these P-ATPases play a role in osmotic adaptation. Certain copper ATPases have been implicated in both Wilson and Menkes diseases . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0004008 copper-exporting ATPase activity, 0005524 ATP binding, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=64.70 E-value=12 Score=18.22 Aligned_cols=44 Identities=9% Similarity=0.201 Sum_probs=34.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEE-EECCC Q ss_conf 8999999997549938998188897899999997199089-98053 Q gi|254780717|r 223 SLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSA-ILDPE 267 (294) Q Consensus 223 ~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~-~ld~~ 267 (294) +=+++++.+|+.++.|+.. ++-+++.++.+|+++|+.+. ++-|. T Consensus 375 ~a~~~i~~Lk~~Gi~~~mL-tGDN~~~A~a~A~~lGI~v~Aev~P~ 419 (545) T TIGR01511 375 EAKEVIQALKRRGIEPVML-TGDNRKTAKAVAKELGINVRAEVLPD 419 (545) T ss_pred HHHHHHHHHHHCCCEEEEE-ECCCHHHHHHHHHHHCCEEEECCCHH T ss_conf 6899999998759879998-66987999999997283333178807 No 52 >TIGR03156 GTP_HflX GTP-binding protein HflX. This protein family is one of a number of homologous small, well-conserved GTP-binding proteins with pleiotropic effects. Bacterial members are designated HflX, following the naming convention in Escherichia coli where HflX is encoded immediately downstream of the RNA chaperone Hfq, and immediately upstream of HflKC, a membrane-associated protease pair with an important housekeeping function. Over large numbers of other bacterial genomes, the pairing with hfq is more significant than with hflK and hlfC. The gene from Homo sapiens in this family has been named PGPL (pseudoautosomal GTP-binding protein-like). Probab=64.52 E-value=12 Score=18.20 Aligned_cols=74 Identities=11% Similarity=0.156 Sum_probs=39.5 Q ss_pred EECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHH------HHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEE Q ss_conf 9722899999997079441788705899835335680245------6650257899967661035689997314557679 Q gi|254780717|r 28 VSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEA------MMLENADIIFWLGAEMESFLVKPLHSLNKQSNVV 101 (294) Q Consensus 28 ~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~------~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i 101 (294) .+..=+..+++..+.+.+....--....||-.| ....-+ .+-.++|++|.++ .+.|--.+.++..- +.+++ T Consensus 17 ~~l~El~~L~~t~g~~vv~~~~q~~~~~~~~~~-~G~GKl~ei~~~~~~~~~~~vi~d~-~Lsp~Q~rnLe~~~-~~~V~ 93 (351) T TIGR03156 17 ESLEELAELARTAGAEVVGTVTQKRSRPDPATY-IGKGKVEEIAELVEALEADLVIFNH-ELSPSQERNLEKAL-GCRVI 93 (351) T ss_pred HHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCE-ECCCHHHHHHHHHHHCCCCEEEECC-CCCHHHHHHHHHHH-CCEEE T ss_conf 559999999997899899999994489974200-1501999999999855999999989-79999999999986-98897 Q ss_pred EEE Q ss_conf 975 Q gi|254780717|r 102 TLS 104 (294) Q Consensus 102 ~~~ 104 (294) +-. T Consensus 94 DRt 96 (351) T TIGR03156 94 DRT 96 (351) T ss_pred ECH T ss_conf 108 No 53 >PRK11627 hypothetical protein; Provisional Probab=64.10 E-value=4.7 Score=20.72 Aligned_cols=33 Identities=18% Similarity=0.406 Sum_probs=18.8 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHH Q ss_conf 9158999999999960013238805999722899999 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIV 37 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv 37 (294) |||+++. ++.+++|++|++. ++ .-++.|-..+. T Consensus 2 mkk~~~~-l~a~~lL~gCas~--p~-~l~l~P~~~~~ 34 (192) T PRK11627 2 LKKILFP-LVALFMLAGCATP--PT-TLEVSPKITLP 34 (192) T ss_pred HHHHHHH-HHHHHHHHHHCCC--CC-EEEECCCCCCC T ss_conf 2879999-9999999860699--76-37867887664 No 54 >TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase; InterPro: IPR006362 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobM and CibF precorrin-4 C11-methyltransferase (2.1.1.133 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, CobM catalyses the methylation of precorrin-4 at C-11 to yield precorrin-5. The extruded acyl group is then removed in the subsequent step catalysed by CobF. In the anaerobic pathway, CibF catalyses the methylation of cobalt-precorrin-4 to cobalt-precoriin-5 . Nomenclature note: occasionally precorrin-4 C11-methyltransferase is one of two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.131 from EC). ; GO: 0046026 precorrin-4 C11-methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=64.05 E-value=10 Score=18.57 Aligned_cols=38 Identities=24% Similarity=0.349 Sum_probs=23.4 Q ss_pred CHHHHHHHCCCCEEEEECC-CCCHHHHHHHHHCCCCCCEEEEE Q ss_conf 8024566502578999676-61035689997314557679975 Q gi|254780717|r 63 RISEAMMLENADIIFWLGA-EMESFLVKPLHSLNKQSNVVTLS 104 (294) Q Consensus 63 tp~d~~~l~~Adlii~~G~-~~E~~~~~~~~~~~~~~~~i~~~ 104 (294) |-+=.+-|++||+|+|=|- =- +++++...+++.+.+.+ T Consensus 15 TvkG~~lle~ADvilYAGSLV~----~~~L~~~r~~Ae~~~sA 53 (252) T TIGR01465 15 TVKGRKLLESADVILYAGSLVP----PELLAHCRPGAEVVNSA 53 (252) T ss_pred HHHHHHHHHCCCEEEECCCCCH----HHHHHHCCCCCEEEECC T ss_conf 5999988631997999687781----78997278988886050 No 55 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=61.72 E-value=9.3 Score=18.83 Aligned_cols=20 Identities=25% Similarity=0.419 Sum_probs=12.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||++++. ++.+.+++|+.. T Consensus 1 MrK~~~~a-ll~laL~gCAtt 20 (156) T PRK13883 1 MRKILSLA-LLALALGGCATT 20 (156) T ss_pred CHHHHHHH-HHHHHHHCCCCC T ss_conf 92689999-999997033135 No 56 >cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor. Probab=61.38 E-value=13 Score=17.84 Aligned_cols=43 Identities=16% Similarity=0.163 Sum_probs=21.9 Q ss_pred HHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCC Q ss_conf 456650257899967661035689997314557679975135446 Q gi|254780717|r 66 EAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLH 110 (294) Q Consensus 66 d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~ 110 (294) -..++.+||.++.+.......+.+... ....++..+..+++.. T Consensus 124 l~~~~~~ad~ii~vS~~~~~~l~~~~~--~~~~ki~vi~ngvd~~ 166 (355) T cd03799 124 LDEKLARADFVVAISEYNRQQLIRLLG--CDPDKIHVVHCGVDLE 166 (355) T ss_pred HHHHHHHCCEEEECCHHHHHHHHHHCC--CCHHHEEEECCCCCHH T ss_conf 999998399999989999999998609--9914689989964888 No 57 >cd01974 Nitrogenase_MoFe_beta Nitrogenase_MoFe_beta: Nitrogenase MoFe protein, beta subunit. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Molybdenum (Mo-) nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. This group contains the beta subunit of the MoFe protein. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=61.36 E-value=13 Score=17.84 Aligned_cols=142 Identities=13% Similarity=0.136 Sum_probs=74.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEC-CC----HHHHHHHHCCCC Q ss_conf 6555567899999999742186565889989999863122235899985030027814620-62----157789874981 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVF-HE----AYRYFASHYNLS 207 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~-H~----a~~Yf~~~~gl~ 207 (294) +-+.+...+...|++.+.+--|+ .++....++...+.+.+.. +.++.++++ .+ ++.-|.++.|++ T Consensus 259 ~G~~~Td~flr~l~~~~g~~~~~---~i~~er~r~~d~~~d~~~~-------l~gkrvai~g~~~~~~~l~~~l~elG~~ 328 (435) T cd01974 259 IGVAATDEFLMALSELTGKPIPE---ELEEERGRLVDAMTDSHQY-------LHGKKFALYGDPDFLIGLTSFLLELGME 328 (435) T ss_pred CCHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHHHHHHH-------HCCCEEEEECCHHHHHHHHHHHHHCCCE T ss_conf 45699999999999985899469---9999999999999999887-------4296799988828999999999988997 Q ss_pred EECCC--CCCC----------------CC---CCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEE-- Q ss_conf 20034--6775----------------44---4978899999999754993899818889789999999719908998-- Q gi|254780717|r 208 IVTFP--MSHS----------------VF---MGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAIL-- 264 (294) Q Consensus 208 ~~~~~--~~~~----------------~~---~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~l-- 264 (294) +..+. ..+. .. ....|+.++.+.+++.++..||..+ .-+.+++++|++.+.+ T Consensus 329 ~~~vv~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~~D~~~l~~~i~~~~~dliig~s-----~~~~iA~~~giP~~~ig~ 403 (435) T cd01974 329 PVHVLTGNGGKRFEKEMQALLDASPYGAGAKVYPGKDLWHLRSLLFTEPVDLLIGNT-----YGKYIARDTDIPLVRFGF 403 (435) T ss_pred EEEEEECCCCHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHCCCCEEEECC-----HHHHHHHHHCCCEEEEEC T ss_conf 899997999778999999998625678886799589999999999755999999480-----789999980999799806 Q ss_pred ---CCCCCCC--CCCCCCHHHHHHHHHHHH Q ss_conf ---0533678--778167899999999999 Q gi|254780717|r 265 ---DPEGMLI--AEGPELYFQLMRSMSNSI 289 (294) Q Consensus 265 ---d~~~~~~--~~~~~~Y~~~m~~n~~~l 289 (294) |-.+..- .-|.++-..++...++++ T Consensus 404 P~~dr~~~~~~~~~GY~G~l~l~e~i~N~l 433 (435) T cd01974 404 PIFDRHHHHRFPIVGYEGALRLLTTILNTL 433 (435) T ss_pred CEEECCCCCCCCCEEHHHHHHHHHHHHHHH T ss_conf 803434765676031765999999999985 No 58 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=61.02 E-value=9.6 Score=18.75 Aligned_cols=34 Identities=24% Similarity=0.246 Sum_probs=19.4 Q ss_pred CHHHHHHHHHHHHHHH---CCCCCCCEEEEEE--CHHHH Q ss_conf 9158999999999960---0132388059997--22899 Q gi|254780717|r 1 MKNFLIILIFLFFILS---SVARAGSLQVVVS--IKPIH 34 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~---~~~~~~~~~Vv~s--~~pl~ 34 (294) |||++.++++++.+++ ++++.++++|.++ .+|+. T Consensus 1 mk~~~~~~~~~~~~~~~~~~~a~~~~l~Vg~~~~~pPf~ 39 (247) T PRK09495 1 MKSVLKVSLAALTLAFAVSSHAADKELVVATDTAFVPFE 39 (247) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCEE T ss_conf 977999999999999987665149949999899879805 No 59 >cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis. Probab=60.04 E-value=14 Score=17.69 Aligned_cols=39 Identities=15% Similarity=0.203 Sum_probs=19.0 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEE Q ss_conf 9999999707944178870589983533568024566502578999 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFW 78 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~ 78 (294) ...+++.+...-.+|+.+...... ........++|++-. T Consensus 19 ~~~L~~~L~~~Gh~v~v~~~~~~~-------~~~~~~~~~~dvvh~ 57 (365) T cd03825 19 AYRLHRALQAAGVDSTMLVQEKKA-------LISKIEIINADIVHL 57 (365) T ss_pred HHHHHHHHHHCCCEEEEEEECCCC-------HHHHHHHCCCCEEEE T ss_conf 999999999779908999926984-------777665528998998 No 60 >PRK09701 D-allose transporter subunit; Provisional Probab=59.78 E-value=14 Score=17.67 Aligned_cols=100 Identities=15% Similarity=0.115 Sum_probs=49.9 Q ss_pred CHHHHHHHHHHHHHHHC---CCCCCCEEEEEEC--HHHHHHHHHHHCC---C--CEEEEEECCCC-CCCCCCCCHHHHHH Q ss_conf 91589999999999600---1323880599972--2899999997079---4--41788705899-83533568024566 Q gi|254780717|r 1 MKNFLIILIFLFFILSS---VARAGSLQVVVSI--KPIHSIVSCIMQG---I--GTPALLVKGAS-SPHEYSLRISEAMM 69 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~---~~~~~~~~Vv~s~--~pl~~lv~~I~gd---~--~~V~~L~~~g~-dpH~y~~tp~d~~~ 69 (294) |||++.++..+++.++. .+...++-|++.. .|++.-++.=+.+ . ++|....++.. | ..+|+.. T Consensus 1 m~k~~~~~~~~~~~~~~~~~a~a~~~iavi~k~~~npf~~~~~~Ga~~aAk~~G~~v~v~~~~~~~D------~~~Qi~~ 74 (311) T PRK09701 1 MNKYLKYFSGTLVGLMLSTSAFAAAEYAVVLKTLSNPFWVDMKKGIEDEAKTLGVSVDIFASPSEGD------FQSQLQL 74 (311) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCC------HHHHHHH T ss_conf 9369999999999999888774368499996888998999999999999998699799992798789------9999999 Q ss_pred H-----CCCCEEEEECCCCCHHHHHHHHHC-CCCCCEEEEEECC Q ss_conf 5-----025789996766103568999731-4557679975135 Q gi|254780717|r 70 L-----ENADIIFWLGAEMESFLVKPLHSL-NKQSNVVTLSHSP 107 (294) Q Consensus 70 l-----~~Adlii~~G~~~E~~~~~~~~~~-~~~~~~i~~~~~~ 107 (294) | ++.|.|+..-.+-+.. ...++.. ....+++.+...+ T Consensus 75 Ie~~I~~gvdaIii~p~d~~a~-~~~i~~A~~aGIpVV~~D~~v 117 (311) T PRK09701 75 FEDLSNKNYKGIAFAPLSSVNL-VMPVARAWKKGIYLVNLDEKI 117 (311) T ss_pred HHHHHHCCCCEEEEECCCHHHH-HHHHHHHHHCCCCEEECCCCC T ss_conf 9999975999999918987788-999999997799189636766 No 61 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=58.84 E-value=13 Score=17.98 Aligned_cols=28 Identities=21% Similarity=0.297 Sum_probs=15.4 Q ss_pred CHHHHHH---HHHHHHHHHCCCCCCCEE-EEE Q ss_conf 9158999---999999960013238805-999 Q gi|254780717|r 1 MKNFLII---LIFLFFILSSVARAGSLQ-VVV 28 (294) Q Consensus 1 Mkk~l~~---~~~~~~~l~~~~~~~~~~-Vv~ 28 (294) |||-+++ .++.++.|++|+....-. |++ T Consensus 1 MKKKl~~~~~~~~svl~LaaCs~~~~~~~vat 32 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAACSKTNDNTKVIS 32 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 90889999999999999987169998806898 No 62 >PRK10449 heat-inducible protein; Provisional Probab=58.69 E-value=7.1 Score=19.56 Aligned_cols=24 Identities=21% Similarity=0.508 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 9158999999999960013238805 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQ 25 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~ 25 (294) |||+ +.++++.+++++|++.++.. T Consensus 1 MKk~-l~l~~~~llLagC~s~~~~~ 24 (140) T PRK10449 1 MKKV-VALVALSLLMAGCVSSGKIS 24 (140) T ss_pred CCHH-HHHHHHHHHHHHCCCCCCCC T ss_conf 9034-99999999997523899887 No 63 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=57.82 E-value=7.6 Score=19.38 Aligned_cols=21 Identities=14% Similarity=0.467 Sum_probs=18.5 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||++..+++++++|++|+.- T Consensus 1 MkK~i~~il~~~llL~GCs~m 21 (304) T pfam07901 1 MRKLIKLILIATLLLSGCSTT 21 (304) T ss_pred CCHHHHHHHHHHHHHCCCCCC T ss_conf 901899999999997133445 No 64 >pfam10933 DUF2827 Protein of unknown function (DUF2827). This is a family of uncharacterized proteins found in Burkholderia. Probab=57.59 E-value=12 Score=18.02 Aligned_cols=73 Identities=15% Similarity=0.296 Sum_probs=41.6 Q ss_pred HHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECC----CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 02456650257899967661035689997314557679975135----44621001211123321224756776555567 Q gi|254780717|r 64 ISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSP----DLHRILLRDNHSHFHDSEADDLHLWLNPLNVQ 139 (294) Q Consensus 64 p~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~~~~dpH~Wl~p~n~~ 139 (294) ++|+ ..+-|++|-.|..+|+-+.+.++..+ ++++..--+. +.+........++.......| -+|++|.... T Consensus 67 ~~ea--~~~lDVlIEmg~~i~~~~~~~~r~~G--~KvV~~~~Gn~Yv~~iE~~iF~~~~~~~~~~~~~D-eIW~~Pq~~~ 141 (364) T pfam10933 67 FAEV--KDALDVLIEMGAQIDVEWTAYLKARG--GKIVSYCCGNDYVIAIERMMFGRPSGHLFIGTPYD-EVWTIPQFER 141 (364) T ss_pred HHHH--HHCCCEEEEECCCCCHHHHHHHHHCC--CEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCC-EEEECCCHHH T ss_conf 9998--33789999916754879999999759--92899966856787634765079887656799865-1784500453 Q ss_pred HH Q ss_conf 89 Q gi|254780717|r 140 YI 141 (294) Q Consensus 140 ~~ 141 (294) .- T Consensus 142 ~~ 143 (364) T pfam10933 142 TC 143 (364) T ss_pred HC T ss_conf 32 No 65 >pfam02669 KdpC K+-transporting ATPase, c chain. This family consists of K+-transporting ATPase, c chain, KdpC. KdpC forms strong interactions with the KdpA subunit, serving to assemble and stabilize the Kdp complex. It has been suggested that KdpC could be one of the connecting links between the energy providing subunit KdpB and the K+-transporting subunit KdpA. The K+ transport system actively transports K+ ions via ATP hydrolysis. Probab=57.25 E-value=0.76 Score=25.73 Aligned_cols=92 Identities=14% Similarity=0.052 Sum_probs=42.7 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEE---ECCCCCCCCCCCCCCCCCCC Q ss_conf 887058998353356802456650257899967661035689997314557679975---13544621001211123321 Q gi|254780717|r 48 ALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLS---HSPDLHRILLRDNHSHFHDS 124 (294) Q Consensus 48 ~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~---~~~~~~~~~~~~~~~~~~~~ 124 (294) ..|-++=.+|..|.++|+-.-- ..=|..-.-|.++-+.=.++.+....+...+... ....++...... -. T Consensus 53 ~LIgQ~F~~~~yF~~RPSa~~~-~~y~~~~SggSNl~psnp~l~~~v~~r~~~~~~~~~~~~~~vP~DlVTa------Sg 125 (188) T pfam02669 53 ALIGQTFTEEGYFHSRPSAINY-SEYPTGASGGSNLAPSNPDLLSRIAARVEAQRLENLPAAVQVPVDLVTS------SG 125 (188) T ss_pred HHCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCC------CC T ss_conf 3227778899876789866779-9988444666577999999999999999999987789888897888406------66 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 224756776555567899999999 Q gi|254780717|r 125 EADDLHLWLNPLNVQYIAHVIAME 148 (294) Q Consensus 125 ~~~dpH~Wl~p~n~~~~a~~I~~~ 148 (294) .+.|||+ ||.++..++..|+++ T Consensus 126 SGLDPhI--S~aaA~~Qv~RVA~a 147 (188) T pfam02669 126 SGLDPHI--SPAAAQAQFPRVAKA 147 (188) T ss_pred CCCCCCC--CHHHHHHHHHHHHHH T ss_conf 6689998--999999999999998 No 66 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=57.22 E-value=11 Score=18.37 Aligned_cols=21 Identities=33% Similarity=0.698 Sum_probs=16.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) ||+++++++++++++++|++- T Consensus 1 Mk~~~~~~~~l~~~LsgC~~l 21 (177) T pfam07273 1 MKKLLLLILLLVLLLSGCASL 21 (177) T ss_pred CCHHHHHHHHHHHHHHCCCCC T ss_conf 914899999999997155756 No 67 >cd00614 CGS_like CGS_like: Cystathionine gamma-synthase is a PLP dependent enzyme and catalyzes the committed step of methionine biosynthesis. This pathway is unique to microorganisms and plants, rendering the enzyme an attractive target for the development of antimicrobials and herbicides. This subgroup also includes cystathionine gamma-lyases (CGL), O-acetylhomoserine sulfhydrylases and O-acetylhomoserine thiol lyases. CGL's are very similar to CGS's. Members of this group are widely distributed among all three forms of life. Probab=56.60 E-value=16 Score=17.33 Aligned_cols=78 Identities=8% Similarity=0.099 Sum_probs=39.9 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHCCC--CEECCCCCCCCCCC Q ss_conf 2186565889989999863122235899985---0300278-----146206215778987498--12003467754449 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFMG 220 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s 220 (294) -+.|.++-....+.+.+.-++++-.+--... +...... +-+..|+-+..+.+.|+= -+.+++.. .+ T Consensus 220 ~~sp~~a~l~lrgl~TL~lRm~~~~~nA~~lA~~L~~hp~V~~V~yPgL~s~p~~~~~~~~~~g~ggl~sf~l~----~~ 295 (369) T cd00614 220 ILSPFDAWLLLRGLKTLPLRMERHSENALKVAEFLEKHPKVERVYYPGLPSHPQHELAKKQMSGYGGVFSFELK----GG 295 (369) T ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEC----CC T ss_conf 68978889988601759999999999999999999719998879787898880489999768999717799976----98 Q ss_pred HHHHHHHHHHHH Q ss_conf 788999999997 Q gi|254780717|r 221 AASLRNIRSKII 232 (294) Q Consensus 221 ~~~l~~~~~~ik 232 (294) .....++.+.++ T Consensus 296 ~~~~~~f~~~l~ 307 (369) T cd00614 296 LEAAKKFLNALK 307 (369) T ss_pred HHHHHHHHHHCC T ss_conf 999999998589 No 68 >PRK11058 putative GTPase HflX; Provisional Probab=56.53 E-value=16 Score=17.32 Aligned_cols=75 Identities=15% Similarity=0.143 Sum_probs=42.7 Q ss_pred EECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCC--H---HHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEE Q ss_conf 972289999999707944178870589983533568--0---24566502578999676610356899973145576799 Q gi|254780717|r 28 VSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLR--I---SEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVT 102 (294) Q Consensus 28 ~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~t--p---~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~ 102 (294) -+..=+..+++..+.+.+...+--...-||-.|==+ . +++.+-.+||++|.++ .+.|--.+.++..- +.++++ T Consensus 25 e~l~El~~L~~tag~evv~~~~q~r~~p~p~~~iG~GK~~El~~~~~~~~~d~vifd~-~Lsp~Q~rnLe~~~-~~~V~D 102 (426) T PRK11058 25 EDLQEFESLVSSAGVEALQVITGSRKAPHPKYFVGEGKAVEIAEAVKATGASVVLFDH-ALSPAQERNLERLC-ECRVID 102 (426) T ss_pred CCHHHHHHHHHHCCCEEEEEEEEECCCCCCCCEECHHHHHHHHHHHHHCCCCEEEECC-CCCHHHHHHHHHHH-CCEEEC T ss_conf 0299999999978997999999943899753102320999999999964989999999-89989999999986-992880 Q ss_pred EE Q ss_conf 75 Q gi|254780717|r 103 LS 104 (294) Q Consensus 103 ~~ 104 (294) -. T Consensus 103 Rt 104 (426) T PRK11058 103 RT 104 (426) T ss_pred CH T ss_conf 36 No 69 >PRK08043 bifunctional acyl-[acyl carrier protein] synthetase/2-acylglycerophosphoethanolamine acyltransferase; Validated Probab=56.18 E-value=16 Score=17.28 Aligned_cols=11 Identities=18% Similarity=0.537 Sum_probs=6.3 Q ss_pred CEEEEEECCCC Q ss_conf 41788705899 Q gi|254780717|r 45 GTPALLVKGAS 55 (294) Q Consensus 45 ~~V~~L~~~g~ 55 (294) .++.+|.|-|. T Consensus 97 G~~v~iFPEG~ 107 (719) T PRK08043 97 GRPVVIFPEGR 107 (719) T ss_pred CCEEEEECCCC T ss_conf 99899908975 No 70 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=55.90 E-value=8.5 Score=19.09 Aligned_cols=21 Identities=33% Similarity=0.636 Sum_probs=14.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9158999999999960013238 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG 22 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~ 22 (294) |||++.++++++. +++|++.. T Consensus 1 Mkk~~~l~~~~l~-LagCas~~ 21 (26) T pfam08139 1 MKKLLLLLLALLL-LAGCASXX 21 (26) T ss_pred CHHHHHHHHHHHH-HHCCCCCC T ss_conf 9669999999999-82332001 No 71 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=55.75 E-value=13 Score=17.91 Aligned_cols=32 Identities=19% Similarity=0.324 Sum_probs=20.4 Q ss_pred CHHHHHHHHHH-HHHHHCCCCCCCEEEEEECHH Q ss_conf 91589999999-999600132388059997228 Q gi|254780717|r 1 MKNFLIILIFL-FFILSSVARAGSLQVVVSIKP 32 (294) Q Consensus 1 Mkk~l~~~~~~-~~~l~~~~~~~~~~Vv~s~~p 32 (294) |||.++.++++ .+.|++|..........+..| T Consensus 1 MKk~~l~~~i~SAL~LaGCg~~s~~~g~~~~~~ 33 (269) T pfam12262 1 MKKKFLALLLASALLLAGCGDDTETSGTPTIDP 33 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 924799999999864111579965667755464 No 72 >PRK10780 periplasmic chaperone; Provisional Probab=55.32 E-value=10 Score=18.58 Aligned_cols=33 Identities=18% Similarity=0.169 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHH Q ss_conf 915899999999996001323880599972289 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPI 33 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl 33 (294) |||+++.+++.+.+.++..++.++-||-+-..+ T Consensus 1 mKk~l~~~~l~l~l~~~a~Aa~KIgvVd~~~i~ 33 (161) T PRK10780 1 MKKWLLAAGLGLALATSAQAADKIAIVNMGSLF 33 (161) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCEEEEEHHHHH T ss_conf 958999999999858888775354553299999 No 73 >PRK10598 hypothetical protein; Provisional Probab=54.43 E-value=12 Score=18.11 Aligned_cols=22 Identities=23% Similarity=0.362 Sum_probs=15.1 Q ss_pred CHHHHH-HHHHHHHHHHCCCCCC Q ss_conf 915899-9999999960013238 Q gi|254780717|r 1 MKNFLI-ILIFLFFILSSVARAG 22 (294) Q Consensus 1 Mkk~l~-~~~~~~~~l~~~~~~~ 22 (294) ||||++ .++++..++++|++-. T Consensus 1 mkk~~l~~~l~l~~llsGC~~l~ 23 (186) T PRK10598 1 MKKFLLAAALLVSGLLVGCNQLT 23 (186) T ss_pred CCHHHHHHHHHHHHHHHCCCCCC T ss_conf 91269999999999971435336 No 74 >PRK00031 lolA outer-membrane lipoprotein carrier protein; Reviewed Probab=53.90 E-value=9.5 Score=18.76 Aligned_cols=20 Identities=30% Similarity=0.358 Sum_probs=14.1 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) |||+++++++++.++++.+. T Consensus 1 mk~~~~~~~~~~~l~~~~~~ 20 (201) T PRK00031 1 MKKLLIAALLAAALVASSAF 20 (201) T ss_pred CCHHHHHHHHHHHHHHHHHC T ss_conf 91599999999998776645 No 75 >PRK13280 N-glycosylase/DNA lyase; Provisional Probab=53.61 E-value=18 Score=17.02 Aligned_cols=106 Identities=12% Similarity=0.108 Sum_probs=58.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC-CCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHH--HHHHHHH--HHHCC Q ss_conf 989999863122235899985030-027814620621577898749812003467754449788--9999999--97549 Q gi|254780717|r 161 EKNEEEFKNQLSQLDKELHSILQP-VEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAAS--LRNIRSK--IISDK 235 (294) Q Consensus 161 ~~N~~~~~~~L~~l~~~~~~~~~~-~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~--l~~~~~~--ik~~~ 235 (294) .....-| ..|..+-..+...+.. ...+.+|-.---|+|.++.-+-+..+ .|..=|-|.| +..+... +-+.. T Consensus 121 ~~~l~~y-~~m~~l~~~La~~l~~~~~~KTiVFAvKM~gYa~r~~~~~~~p---~p~eIpIPvD~Ri~~~T~~sglv~~~ 196 (265) T PRK13280 121 LNPLKYY-EDLEELLEDLAKILGAKKESKTVVFAVKMFGYAYRAVFGEFRP---YPMEIPIPVDYRIAKLTLKSGLVEGP 196 (265) T ss_pred HHHHHHH-HHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHCCCCCC---CCCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 3334676-2399999999999689987563533699899999994677778---98679996309999999974103475 Q ss_pred CCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 93899818889789999999719908998053367 Q gi|254780717|r 236 ISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGML 270 (294) Q Consensus 236 v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~ 270 (294) ...+........+....+++++|++...||++-|. T Consensus 197 ~~~~~~~~e~~~~~W~~Va~~sgIPpLhiDsilW~ 231 (265) T PRK13280 197 PEEAMRTNEEPQEFWNKVARESGIPPLHIDSILWL 231 (265) T ss_pred HHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 55540353248999999988629996201014532 No 76 >COG1535 EntB Isochorismate hydrolase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=53.53 E-value=16 Score=17.36 Aligned_cols=50 Identities=16% Similarity=0.274 Sum_probs=27.5 Q ss_pred CCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEE Q ss_conf 27814620621577898749812003467754449788999999997549938998 Q gi|254780717|r 186 EKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFY 241 (294) Q Consensus 186 ~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~ 241 (294) .+|.++.-||--.||.+.|+=. .+..+--...+.++....++++|+++++ T Consensus 28 p~RavLLIhDMQ~YFv~~~~~~------~~~~~~li~Ni~~Lr~~~~~~giPVvyT 77 (218) T COG1535 28 PKRAVLLIHDMQNYFVSPWGEN------CPLMEQLIANIAKLRIWCKQAGIPVVYT 77 (218) T ss_pred CCCCEEEEEHHHHHHCCCCCCC------CCCHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 6633122326578524878877------8648999999999999999829967998 No 77 >TIGR00670 asp_carb_tr aspartate carbamoyltransferase; InterPro: IPR002082 Aspartate carbamoyltransferase (ATCase) catalyses the formation of carbamoyl-aspartate in the pyrimidine biosynthesis pathway, by the association of aspartate and carbamoyl-phosphate. This is the commitment step in the Escherichia coli pathway and is regulated by feedback inhibition by CTP, the final product of the pathway. The structural organisation of the ATCase protein varies considerably between different organisms. In bacteria such as E. coli, Salmonella typhimurium and Serratia marcescens, the ATCase is a dodecamer of 2 catalytic (c) trimers and 3 regulatory (r) dimers. The catalytic domains are coded for by the pyrB gene, and the regulatory domains by pyrI. In Gram-positive bacteria such as Bacillus subtilis, ATCase exists as a trimer of catalytic subunits, but unlike in E. coli, it neither contains nor binds to regulatory subunits. In eukaryotes, ATCase is found as a single domain in a multifunctional enzyme that contains activity for glutamine amidotransferase, carbamoylphosphate synthetase, dihydroorotase, and aspartate carbamoyltransferase. ; GO: 0004070 aspartate carbamoyltransferase activity, 0006207 'de novo' pyrimidine base biosynthetic process. Probab=53.14 E-value=18 Score=16.97 Aligned_cols=73 Identities=10% Similarity=0.069 Sum_probs=39.5 Q ss_pred HHHHHHHHHC--CCCEEE-----EEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCC---HHHHHHHHHCC----CCC Q ss_conf 9999999707--944178-----870589983533568024566502578999676610---35689997314----557 Q gi|254780717|r 33 IHSIVSCIMQ--GIGTPA-----LLVKGASSPHEYSLRISEAMMLENADIIFWLGAEME---SFLVKPLHSLN----KQS 98 (294) Q Consensus 33 l~~lv~~I~g--d~~~V~-----~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E---~~~~~~~~~~~----~~~ 98 (294) ...-.+.+|| +.+.+. ++.. |. ++-=|-+-+...+++|+||.=. .+| ..+++++.... -+. T Consensus 60 FE~A~KRLGgPn~v~~~~~~~~sS~~K-GE---tL~DTi~~~~~y~D~D~iViRH-~~~GvCegaar~~ae~~~R~~~~v 134 (336) T TIGR00670 60 FETAMKRLGGPNDVVNFSDSETSSVAK-GE---TLADTIKTLSAYGDSDAIVIRH-PLEGVCEGAARLAAEVSDRLGIEV 134 (336) T ss_pred HHHHHHHHCCCCCEEECCCCCCCCHHC-CC---CHHHHHHHHHHCCCCCEEEEEC-CCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 889888727985456225563411210-54---4788999885404755699835-865413468899988631104578 Q ss_pred C-EEEEEECCCCC Q ss_conf 6-79975135446 Q gi|254780717|r 99 N-VVTLSHSPDLH 110 (294) Q Consensus 99 ~-~i~~~~~~~~~ 110 (294) + ++++++|.+-| T Consensus 135 PsviNAGDG~~qH 147 (336) T TIGR00670 135 PSVINAGDGSGQH 147 (336) T ss_pred CEEEECCCCCCCC T ss_conf 7166368787879 No 78 >PRK03629 tolB translocation protein TolB; Provisional Probab=52.94 E-value=15 Score=17.53 Aligned_cols=20 Identities=25% Similarity=0.308 Sum_probs=14.5 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) ||+++.+++++++++++.+. T Consensus 1 mk~~~~~~f~~lll~~~~a~ 20 (430) T PRK03629 1 MKQALRVAFGFLILWASVLH 20 (430) T ss_pred CCHHHHHHHHHHHHHHHHHC T ss_conf 90148889989998754422 No 79 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=52.69 E-value=17 Score=17.12 Aligned_cols=21 Identities=29% Similarity=0.477 Sum_probs=15.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) ||+++..++++.+++++|++. T Consensus 2 ~k~~~~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 2 IKHLVAPLIFTSLILTGCQSP 22 (160) T ss_pred HHHHHHHHHHHHHHHHHCCCC T ss_conf 078999999999999845899 No 80 >cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase. Probab=52.32 E-value=19 Score=16.89 Aligned_cols=13 Identities=15% Similarity=0.179 Sum_probs=8.0 Q ss_pred HCCCCEEEEECCC Q ss_conf 5025789996766 Q gi|254780717|r 70 LENADIIFWLGAE 82 (294) Q Consensus 70 l~~Adlii~~G~~ 82 (294) ..+.|+++..+.. T Consensus 97 ~~~~Div~~~~~~ 109 (394) T cd03794 97 RRRPDVIIATSPP 109 (394) T ss_pred CCCCCEEEECCCC T ss_conf 5899889991784 No 81 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=52.13 E-value=19 Score=16.87 Aligned_cols=141 Identities=12% Similarity=0.044 Sum_probs=65.6 Q ss_pred HHHHHHHHHHHHCCCCC----CCEEEEEECHHHHHHHHHHHC---CC--CEEEEEECCCCCCCCCCCCHHHHHHHCCCCE Q ss_conf 99999999996001323----880599972289999999707---94--4178870589983533568024566502578 Q gi|254780717|r 5 LIILIFLFFILSSVARA----GSLQVVVSIKPIHSIVSCIMQ---GI--GTPALLVKGASSPHEYSLRISEAMMLENADI 75 (294) Q Consensus 5 l~~~~~~~~~l~~~~~~----~~~~Vv~s~~pl~~lv~~I~g---d~--~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adl 75 (294) ...++++.+++++|+++ ..++|.+|-.|.+++.+.+.. ++ .+|+...=.. | -.|.....=.+-|+ T Consensus 11 ~a~ll~~~~~l~~C~~~~~~~k~ikVG~~~~p~~~i~e~~~~~~~ek~G~~leiv~FsD-----y-~~PN~AL~~G~iDa 84 (272) T PRK09861 11 GAALLLAGILLAGCDQSSSDAKHIKVGVINGAEQDVAEVAKKVAKEKYGLDVELVGFSG-----S-LLPNDATNHGELDA 84 (272) T ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEECC-----C-CCHHHHHHCCCCCC T ss_conf 99999999998745897678973899967898699999999888761797689999468-----6-22628976798360 Q ss_pred EEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHH-HHHCC Q ss_conf 99967661035689997314557679975135446210012111233-21224756776555567899999999-74218 Q gi|254780717|r 76 IFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHRILLRDNHSHFH-DSEADDLHLWLNPLNVQYIAHVIAME-LIKKD 153 (294) Q Consensus 76 ii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~dpH~Wl~p~n~~~~a~~I~~~-L~~~d 153 (294) =+ -...+||+..-+..+ .+.+.+....-.+-......+..-. -.++--=-+=-||.|..+...-+.++ |+++. T Consensus 85 N~---fQH~pyL~~~n~~~g--~~L~~v~~~~~~P~glYS~K~ksl~dlp~Ga~IaIPnDpsN~~RAL~lL~~aGLI~Lk 159 (272) T PRK09861 85 NV---FQHRPFLEQDNQAHG--YKLVAVGNTFVFPMAGYSKKIKTVAQIKEGATVAIPNDPTNLGRALLLLQKEKLITLK 159 (272) T ss_pred HH---HHHHHHHHHHHHHCC--CCEEEEEEEEEEEEEECCCCCCCHHHCCCCCEEEECCCHHHHHHHHHHHHHCCCEEEC T ss_conf 24---557999999999869--9579973167830140344659875847999898047812699999999988978977 Q ss_pred CCC Q ss_conf 656 Q gi|254780717|r 154 PRN 156 (294) Q Consensus 154 P~~ 156 (294) |.. T Consensus 160 ~~~ 162 (272) T PRK09861 160 EGK 162 (272) T ss_pred CCC T ss_conf 999 No 82 >KOG4175 consensus Probab=51.64 E-value=19 Score=16.82 Aligned_cols=47 Identities=11% Similarity=0.220 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHCCCCEE-EEECCCCHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 89999999975499389-981888978999999971990899805336 Q gi|254780717|r 223 SLRNIRSKIISDKISCL-FYGPEFDSKIIRSITNDTGVMSAILDPEGM 269 (294) Q Consensus 223 ~l~~~~~~ik~~~v~~i-f~e~~~~~~~~~~ia~e~~~~~~~ld~~~~ 269 (294) .-..+.+.++++++..+ ++-|....+.++.|+.-..-.+.....||. T Consensus 136 Ea~~~Rne~~k~gislvpLvaPsTtdeRmell~~~adsFiYvVSrmG~ 183 (268) T KOG4175 136 EAETLRNEARKHGISLVPLVAPSTTDERMELLVEAADSFIYVVSRMGV 183 (268) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 989999999864924897417998089999999863255999871456 No 83 >pfam00121 TIM Triosephosphate isomerase. Probab=50.82 E-value=20 Score=16.74 Aligned_cols=63 Identities=10% Similarity=0.245 Sum_probs=45.6 Q ss_pred CCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHC----------CCCEEEEECCCCHHHHHHHHHH Q ss_conf 781462062157789874981200346775444978899999999754----------9938998188897899999997 Q gi|254780717|r 187 KKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISD----------KISCLFYGPEFDSKIIRSITND 256 (294) Q Consensus 187 ~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~----------~v~~if~e~~~~~~~~~~ia~e 256 (294) ++-++.|+|.|. +.++..|++.++.++.+.||+. +++. ++.-..++..++.|... T Consensus 157 ~~iiIAYEPvWA--------------IGtG~~as~~~i~~~~~~Ir~~l~~~~~~~~~~i~i-LYGGSVn~~N~~~i~~~ 221 (243) T pfam00121 157 KNLIIAYEPVWA--------------IGTGKVATPEQAQEVHAFIRKYLAELSKEVAEEVRI-LYGGSVNPDNAKELLAQ 221 (243) T ss_pred CCEEEEECCHHH--------------CCCCCCCCHHHHHHHHHHHHHHHHHHCHHHCCCCCE-EEECCCCHHHHHHHHCC T ss_conf 563999678222--------------489999999999999999999999845423166428-97176899899999668 Q ss_pred HCCCEEEE Q ss_conf 19908998 Q gi|254780717|r 257 TGVMSAIL 264 (294) Q Consensus 257 ~~~~~~~l 264 (294) -++..+.+ T Consensus 222 ~~vdG~LV 229 (243) T pfam00121 222 PDIDGFLV 229 (243) T ss_pred CCCCEEEE T ss_conf 89987984 No 84 >pfam01053 Cys_Met_Meta_PP Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH/OAS sulfhydrylase, O-succinylhomoserine sulfhydrylase All of these members participate is slightly different reactions. All these enzymes use PLP (pyridoxal-5'-phosphate) as a cofactor. Probab=50.77 E-value=20 Score=16.74 Aligned_cols=77 Identities=6% Similarity=0.087 Sum_probs=36.1 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCC---C--EEEECCCHHHHHHHHCCC--CEECCCCCCCCCCC Q ss_conf 2186565889989999863122235899985---030027---8--146206215778987498--12003467754449 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEK---K--KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFMG 220 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~---~--~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s 220 (294) -+.|..+-....+.+.+.-++++..+--... +..... . +-+..|+.+..+.|.++= -+.+++...+ T Consensus 233 ~~~p~~a~l~~rgl~TL~lR~~~~~~nA~~lA~~L~~hp~V~~V~yPgL~s~p~~~~~~r~~~g~ggl~sf~l~~~---- 308 (381) T pfam01053 233 VLSPFDAWLLLRGLKTLGLRMERHQENALKLAEFLEEHPKVEKVYYPGLPSHPGHELAKKQMSGFGGMLSFELKGG---- 308 (381) T ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCEEEEEECCH---- T ss_conf 7886888998714787999999988899999999972999777957589998049999975899997318986789---- Q ss_pred HHHHHHHHHHHH Q ss_conf 788999999997 Q gi|254780717|r 221 AASLRNIRSKII 232 (294) Q Consensus 221 ~~~l~~~~~~ik 232 (294) ....++.+.++ T Consensus 309 -~~~~~f~d~l~ 319 (381) T pfam01053 309 -EAASKFLDNLK 319 (381) T ss_pred -HHHHHHHHHCC T ss_conf -99999998477 No 85 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=50.58 E-value=18 Score=17.07 Aligned_cols=19 Identities=21% Similarity=0.473 Sum_probs=11.7 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) |||+++++++ ++++++|++ T Consensus 1 Mkk~~~~~~~-~~~~~SCg~ 19 (449) T TIGR03525 1 MKKYLVFAAL-VVLVYSCGS 19 (449) T ss_pred CCHHHHHHHH-HHHHHHCCC T ss_conf 9124899999-998721058 No 86 >PRK11411 fecB iron-dicitrate transporter substrate-binding subunit; Provisional Probab=49.63 E-value=21 Score=16.63 Aligned_cols=136 Identities=10% Similarity=0.026 Sum_probs=62.2 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH----CCCCCCCEEE--ECCCHHHHHHHHCCCCEE-C-CCC Q ss_conf 9999999742186565889989999863122235899985----0300278146--206215778987498120-0-346 Q gi|254780717|r 142 AHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSI----LQPVEKKKII--VFHEAYRYFASHYNLSIV-T-FPM 213 (294) Q Consensus 142 a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~----~~~~~~~~~i--~~H~a~~Yf~~~~gl~~~-~-~~~ 213 (294) +..|.+.+.+ ++.-++-.+.+.+++.++.+++.+. +.-.....+. ..+.-++.+.+..|++.. . ... T Consensus 142 ~~~lG~~~gk-----e~~a~~~i~~~~~~l~~~~~k~~~~~~~~~~~~~~~~~~v~~~~s~~g~il~~lG~~~~~~~~~~ 216 (297) T PRK11411 142 AAIIGEALGK-----KREMQARIAQHKQRMAQFASQLPKGTRVAFGVSREQQFNLHSPESYAGSVLAALGLQVPIAAMNG 216 (297) T ss_pred HHHHHHHHCC-----HHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCEEEECCCCHHHHHHHHCCCCCCCCCCCC T ss_conf 9999998690-----99999999999999999998616687089999808946996798417999998399878765578 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHH-----HHHC----CCEEEECCCCCCCCCCCCCHHHHHHH Q ss_conf 77544497889999999975499389981888978999999-----9719----90899805336787781678999999 Q gi|254780717|r 214 SHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSIT-----NDTG----VMSAILDPEGMLIAEGPELYFQLMRS 284 (294) Q Consensus 214 ~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia-----~e~~----~~~~~ld~~~~~~~~~~~~Y~~~m~~ 284 (294) .+..+.|. +.+.+.+..+||+-....+...+.+. +.+. -++..+|+.-|....|+.+...|+.. T Consensus 217 ~~~~~iS~-------E~l~~~dpD~I~v~~~~~~~~~~~~~~~p~w~~l~AVKn~~V~~vd~~~w~~~~G~~aa~~~ldd 289 (297) T PRK11411 217 APMPSISL-------EQLLALNPDWLLVAHYREESIVKRWQQDPLWQMLTAAQKQQVAEVDSNTWARMRGIFAAERIAAD 289 (297) T ss_pred CCCCCCCH-------HHHHCCCCCEEEEEECCCHHHHHHHHCCHHHHCCCEEECCCEEEECHHHCCCCCCHHHHHHHHHH T ss_conf 87641079-------99840499989999478726888775390554098434794899882325306579999999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780717|r 285 MSNSI 289 (294) Q Consensus 285 n~~~l 289 (294) ...-| T Consensus 290 l~~~~ 294 (297) T PRK11411 290 TVKIF 294 (297) T ss_pred HHHHH T ss_conf 99987 No 87 >COG3275 LytS Putative regulator of cell autolysis [Signal transduction mechanisms] Probab=49.18 E-value=21 Score=16.58 Aligned_cols=37 Identities=22% Similarity=0.354 Sum_probs=23.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCH--------HHHHHHHHHH Q ss_conf 1224756776555567899999999742186565--------8899899998 Q gi|254780717|r 124 SEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNK--------IIYEKNEEEF 167 (294) Q Consensus 124 ~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~--------~~y~~N~~~~ 167 (294) +-..|||+-++--|.+.+ +++.+|+.+ .+|+.|.+.- T Consensus 368 qaQvnPHFLFNaLNTIsa-------~IR~npdkAreLil~LS~yfR~NL~~~ 412 (557) T COG3275 368 QAQVNPHFLFNALNTISA-------VIRRNPDKARELILYLSTYFRYNLENN 412 (557) T ss_pred HHCCCHHHHHHHHHHHHH-------HHCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 860370888888887788-------861782799999999999999873478 No 88 >PRK13479 2-aminoethylphosphonate--pyruvate transaminase; Provisional Probab=49.02 E-value=21 Score=16.57 Aligned_cols=51 Identities=10% Similarity=0.142 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 89999999975499389981888978999999971990899805336787781678999999999998741 Q gi|254780717|r 223 SLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 223 ~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) +..++.+.++++++- ++. -+.. ..++-.+-.||... .+.+...+..|.++| T Consensus 311 d~~~~~~~l~~~g~~-i~~-G~~~-----------~~~~fRIGhMG~~~-------~~di~~~l~ale~~L 361 (368) T PRK13479 311 DFKEFYERLKEQGFV-IYP-GKLT-----------QVDTFRIGCIGAVD-------AADIRRLVAAIAEAL 361 (368) T ss_pred HHHHHHHHHHHCCCE-EEC-CCCC-----------CCCEEEEECCCCCC-------HHHHHHHHHHHHHHH T ss_conf 599999999977938-968-8557-----------78979887799999-------999999999999999 No 89 >COG1927 Mtd Coenzyme F420-dependent N(5),N(10)-methenyltetrahydromethanopterin dehydrogenase [Energy production and conversion] Probab=48.51 E-value=22 Score=16.52 Aligned_cols=196 Identities=10% Similarity=0.063 Sum_probs=94.4 Q ss_pred CHHHHHHHHHHHCCC--CEEEEEEC-CCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCC--CCCCEEEEE Q ss_conf 228999999970794--41788705-8998353356802456650257899967661035689997314--557679975 Q gi|254780717|r 30 IKPIHSIVSCIMQGI--GTPALLVK-GASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLN--KQSNVVTLS 104 (294) Q Consensus 30 ~~pl~~lv~~I~gd~--~~V~~L~~-~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~--~~~~~i~~~ 104 (294) +.|+-++...--.|+ .+|+++-. .-.+|...+--..++..--+.|++|++|.+--..-.+.....- .....+.++ T Consensus 15 ts~v~dlllDErAdRedi~vrVvgsgaKM~Pe~veaav~~~~e~~~pDfvi~isPNpaaPGP~kARE~l~~s~~Paiiig 94 (277) T COG1927 15 TSPVVDLLLDERADREDIEVRVVGSGAKMDPECVEAAVTEMLEEFNPDFVIYISPNPAAPGPKKAREILSDSDVPAIIIG 94 (277) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHCCCCEEEEC T ss_conf 08999998775414579659996166655868888877999985189889995799899994689998743599779966 Q ss_pred ECCCCCCCCCCCCCCCCCCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 135446210012111233212247567-----765555678999999997421865658899899998631222358999 Q gi|254780717|r 105 HSPDLHRILLRDNHSHFHDSEADDLHL-----WLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELH 179 (294) Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~dpH~-----Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~ 179 (294) +.+++..-..-+..+.+..--..||-+ +|||.-|..+-..+-+.|.-- - -++-.+..++++....+ T Consensus 95 DaPg~~vkdeleeqGlGYIivk~DpmiGArREFLDPvEMA~fNaDv~kVLa~t-----G----a~R~vQeaiD~~ie~vk 165 (277) T COG1927 95 DAPGLKVKDELEEQGLGYIIVKADPMIGARREFLDPVEMASFNADVMKVLAAT-----G----AFRLVQEAIDKVIEDVK 165 (277) T ss_pred CCCCCHHHHHHHHCCCEEEEECCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHC-----C----HHHHHHHHHHHHHHHHH T ss_conf 88631367899865870799627754432343048898875006899999840-----0----79999999999999985 Q ss_pred HHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 85030027814620621577898749812003467754449788999999997549938998188 Q gi|254780717|r 180 SILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPE 244 (294) Q Consensus 180 ~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~ 244 (294) .--.+---+.++...-+ .++-|+ .+|=....+---..+.+++.+-+++--|.+.. T Consensus 166 ~gk~~eLP~ivi~~~k~----vE~~~f------~NPYAkaKAmAA~~iAekVadvdvkgCFm~kd 220 (277) T COG1927 166 EGKEPELPKIVIDPEKA----VEAAGF------ENPYAKAKAMAALEIAEKVADVDVKGCFMEKD 220 (277) T ss_pred CCCCCCCCCEEECHHHH----HHHHCC------CCHHHHHHHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 57876787267547898----776235------78688999999999998764277641355316 No 90 >TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Probab=48.23 E-value=22 Score=16.49 Aligned_cols=53 Identities=13% Similarity=-0.030 Sum_probs=22.3 Q ss_pred EEEEEECHHHHHHHHHHHCCCCEEEEEEC-CCCCCCCCCCCHHHHH--HHCCCCEEEEEC Q ss_conf 05999722899999997079441788705-8998353356802456--650257899967 Q gi|254780717|r 24 LQVVVSIKPIHSIVSCIMQGIGTPALLVK-GASSPHEYSLRISEAM--MLENADIIFWLG 80 (294) Q Consensus 24 ~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~-~g~dpH~y~~tp~d~~--~l~~Adlii~~G 80 (294) ..|+ +......+.+.+-...+++..+-. ++.+ +..-++-.+ +=.+-|+|-..+ T Consensus 34 ~~vi-~~~~~~~~~~~~~~~~v~~~~l~~~~~~~---~~~~~~l~~li~~~kpDiIh~~~ 89 (374) T TIGR03088 34 HAVV-ALTEVSAFRKRIQRPDVAFYALHKQPGKD---VAVYPQLYRLLRQLRPDIVHTRN 89 (374) T ss_pred EEEE-EECCCCHHHHHHHHCCCEEEECCCCCCCC---HHHHHHHHHHHHHHCCCEEEECC T ss_conf 9999-97898457899986898899907887647---99999999999983984898636 No 91 >PRK02627 acetylornithine aminotransferase; Provisional Probab=48.22 E-value=22 Score=16.49 Aligned_cols=117 Identities=14% Similarity=0.288 Sum_probs=52.8 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCC-CCEE-------CCC-C Q ss_conf 999999742186565889989999863122235899985030027814620621577898749-8120-------034-6 Q gi|254780717|r 143 HVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYRYFASHYN-LSIV-------TFP-M 213 (294) Q Consensus 143 ~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~g-l~~~-------~~~-~ 213 (294) ..+++.|.++.|-.+-+|-+....-.+---++...+.......++..++++..+|-= +.+| +.+. .+. . T Consensus 84 ~~lae~L~~~~~~~~v~f~~sGseAvE~AiKlAr~y~~~~~~~~r~~ii~~~~syHG--~T~ga~s~tg~~~~~~~~~p~ 161 (398) T PRK02627 84 EELAEKLVELSGMDKVFFCNSGAEANEAAIKLARKYGHTKGGIEKPEIITAENSFHG--RTLATLSATGQPKYQKGFEPL 161 (398) T ss_pred HHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCC--CCCCCCCCCCCCCCCCCCCCC T ss_conf 999999975249987998287799999999999999886478888679995378699--975100137882223676788 Q ss_pred CCC-CCCCHHHHHHHHHHHHHCCCCEEEEECCC--------CH---HHHHHHHHHHCCCE Q ss_conf 775-44497889999999975499389981888--------97---89999999719908 Q gi|254780717|r 214 SHS-VFMGAASLRNIRSKIISDKISCLFYGPEF--------DS---KIIRSITNDTGVMS 261 (294) Q Consensus 214 ~~~-~~~s~~~l~~~~~~ik~~~v~~if~e~~~--------~~---~~~~~ia~e~~~~~ 261 (294) .++ ......++..+.+.+..+++.+++.||-. ++ +.++.++++.|+-. T Consensus 162 ~~~~~~~p~~d~~~l~~~~~~~~iAAvi~EPiqg~gG~~~~~~~yl~~lr~lc~~~gill 221 (398) T PRK02627 162 VPGFIYVPFNDIEALEAAINNPRTAAVLLEPIQGEGGVNPADKEYLQALRELCDENNILL 221 (398) T ss_pred CCCEEEECCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCE T ss_conf 888056247729999998458987999982788888986588889999999986507407 No 92 >PHA01735 hypothetical protein Probab=47.82 E-value=22 Score=16.45 Aligned_cols=37 Identities=19% Similarity=0.009 Sum_probs=29.1 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHH Q ss_conf 5444978899999999754993899818889789999 Q gi|254780717|r 216 SVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRS 252 (294) Q Consensus 216 ~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ 252 (294) ..+.++++|....+.+|+++|.++-+|.+.=.++++. T Consensus 27 sGeAttaDL~aA~~~LKdN~I~gva~egsPL~~La~~ 63 (76) T PHA01735 27 SGEATTADLRAACDWLKSNDITGVAVDGSPLAKLAGL 63 (76) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHH T ss_conf 7888799999999999987966236799869999875 No 93 >COG1022 FAA1 Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism] Probab=47.73 E-value=22 Score=16.44 Aligned_cols=81 Identities=7% Similarity=0.031 Sum_probs=41.6 Q ss_pred CCCHHHHHHHHH-HHHHHHHHHHHHHHCCCCCHHHHHHHHH----HHHHHHHHH-------------HHHHHHHCCCCCC Q ss_conf 247567765555-6789999999974218656588998999----986312223-------------5899985030027 Q gi|254780717|r 126 ADDLHLWLNPLN-VQYIAHVIAMELIKKDPRNKIIYEKNEE----EFKNQLSQL-------------DKELHSILQPVEK 187 (294) Q Consensus 126 ~~dpH~Wl~p~n-~~~~a~~I~~~L~~~dP~~~~~y~~N~~----~~~~~L~~l-------------~~~~~~~~~~~~~ 187 (294) ...||++..+.. +-.+-+.|.+++.+.-+..+..|+-=+. .+...+... -.++++.+. .+ T Consensus 277 ~~rPt~m~~VPRvwE~i~~~I~~kv~~~~~~~~~if~~a~~~~~~~~~~~~~~~~~~~~~~~~d~LVf~Kir~~lG--gr 354 (613) T COG1022 277 EVRPTVMIGVPRVWEKVYKGIMEKVAKAPAVRRKLFRWALKVAYKKISRALLGGGPLSWLLVADRLVFRKIRDALG--GR 354 (613) T ss_pred HHCCEEEEECHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHC--CC T ss_conf 7299399632589999999999998746166899999999988999886402557762023345889999999717--84 Q ss_pred CEE-EEC----CCHHHHHHHHCCCCE Q ss_conf 814-620----621577898749812 Q gi|254780717|r 188 KKI-IVF----HEAYRYFASHYNLSI 208 (294) Q Consensus 188 ~~~-i~~----H~a~~Yf~~~~gl~~ 208 (294) -.+ ++. -+...+|.+..|+.+ T Consensus 355 i~~~~sGGa~l~~~~~~f~~~lGi~i 380 (613) T COG1022 355 IRYALSGGAPLSPELLHFFRSLGIPI 380 (613) T ss_pred EEEEEECCCCCCHHHHHHHHHCCCCE T ss_conf 79999568759889999999769775 No 94 >PRK01742 tolB translocation protein TolB; Provisional Probab=47.54 E-value=20 Score=16.76 Aligned_cols=26 Identities=15% Similarity=0.447 Sum_probs=12.8 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 91589999999999600132388059 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQV 26 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~V 26 (294) |||++.++++++.++++..+...+.| T Consensus 4 ~~~~~~~~~~l~~~~~~~~a~a~L~I 29 (430) T PRK01742 4 LKRLVSVFAVLFAVISNAFAGDEVRI 29 (430) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 99999999999999877850575599 No 95 >PRK13601 putative ribosomal protein L7Ae-like; Provisional Probab=47.53 E-value=22 Score=16.42 Aligned_cols=45 Identities=4% Similarity=0.203 Sum_probs=35.6 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHH---HHHHHCCCEEEECCCC Q ss_conf 99999999754993899818889789999---9997199089980533 Q gi|254780717|r 224 LRNIRSKIISDKISCLFYGPEFDSKIIRS---ITNDTGVMSAILDPEG 268 (294) Q Consensus 224 l~~~~~~ik~~~v~~if~e~~~~~~~~~~---ia~e~~~~~~~ld~~~ 268 (294) .....+-+++.+++++|.-...+++..+. ++++.|+++...|+|- T Consensus 14 ~KQT~KAl~~g~a~~V~vA~DAD~~v~~~v~~lc~e~~VpV~~VdsMk 61 (84) T PRK13601 14 AKQTLKAIMNCNVLQVYIAKDAEEHVTKKIKELCEEKSIKIVYVDTMK 61 (84) T ss_pred HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEECCHH T ss_conf 899999997498338999816868899999999997399869845599 No 96 >PRK04792 tolB translocation protein TolB; Provisional Probab=47.19 E-value=23 Score=16.39 Aligned_cols=21 Identities=33% Similarity=0.534 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||+++.++++++++++++.+ T Consensus 2 ~k~~~~~l~lll~~~s~~A~A 22 (450) T PRK04792 2 LKRLILGLFVLLLSFSQVANA 22 (450) T ss_pred CHHHHHHHHHHHHHHHHHHEE T ss_conf 389999999999986575202 No 97 >cd00617 Tnase_like Tryptophanase family (Tnase). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to tryptophanase (Tnase) and tyrosine phenol-lyase (TPL). Tnase and TPL are active as tetramers and catalyze beta-elimination reactions. Tnase catalyzes degradation of L-tryptophan to yield indole, pyruvate and ammonia and TPL catalyzes degradation of L-tyrosine to yield phenol, pyruvate and ammonia. Probab=47.12 E-value=23 Score=16.38 Aligned_cols=50 Identities=12% Similarity=0.199 Sum_probs=35.0 Q ss_pred CCHHHHHHHHHHHHHCCCCEEE--------EE-----C-----CCCH-HHHHHHHHHHCCCEEEECCCC Q ss_conf 4978899999999754993899--------81-----8-----8897-899999997199089980533 Q gi|254780717|r 219 MGAASLRNIRSKIISDKISCLF--------YG-----P-----EFDS-KIIRSITNDTGVMSAILDPEG 268 (294) Q Consensus 219 ~s~~~l~~~~~~ik~~~v~~if--------~e-----~-----~~~~-~~~~~ia~e~~~~~~~ld~~~ 268 (294) -.++++.-+.+.+++.+|+++- .+ | +++. .++-.|-.+.|++.+++-++. T Consensus 295 ~Ri~qv~yL~~~L~~~GvPvv~P~GGHav~iDA~~flPhip~~~fPa~aLa~~LY~~~GIR~~E~Gs~~ 363 (431) T cd00617 295 HRVEQVRYLGDRLDEAGVPIVEPAGGHAVFIDAREFLPHIPQEQFPAQALAAELYLEAGVRAVELGIFS 363 (431) T ss_pred HHHHHHHHHHHHHHHCCCCEECCCCCEEEEEEHHHHCCCCCHHHCCHHHHHHHHHHHCCCCEEEECCEE T ss_conf 779999999999998899666689743899746774689981439189999999997094568540200 No 98 >KOG0053 consensus Probab=46.75 E-value=23 Score=16.34 Aligned_cols=18 Identities=17% Similarity=0.226 Sum_probs=11.8 Q ss_pred CCHHHHHHHHHHHHHHHC Q ss_conf 678999999999998741 Q gi|254780717|r 276 ELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 276 ~~Y~~~m~~n~~~l~k~l 293 (294) +.-.++.+...++|++.. T Consensus 391 Ed~~dL~~d~~~Al~~~~ 408 (409) T KOG0053 391 EDIEDLIKDFQQALEKAK 408 (409) T ss_pred CCHHHHHHHHHHHHHHHC T ss_conf 779999999999998622 No 99 >PRK00994 F420-dependent methylenetetrahydromethanopterin dehydrogenase; Provisional Probab=46.75 E-value=23 Score=16.34 Aligned_cols=190 Identities=10% Similarity=0.044 Sum_probs=91.1 Q ss_pred CHHHHHHHHHHHCCCCEEEEE-ECCCCCCCCCCCCHHHHHH------HCCCCEEEEECCCCCHH----HHHHHHHCCCCC Q ss_conf 228999999970794417887-0589983533568024566------50257899967661035----689997314557 Q gi|254780717|r 30 IKPIHSIVSCIMQGIGTPALL-VKGASSPHEYSLRISEAMM------LENADIIFWLGAEMESF----LVKPLHSLNKQS 98 (294) Q Consensus 30 ~~pl~~lv~~I~gd~~~V~~L-~~~g~dpH~y~~tp~d~~~------l~~Adlii~~G~~~E~~----~~~~~~~~~~~~ 98 (294) ..++-+++..--.|+-++.+- +..|. .+.|.++.. --+.|++|++|.+--.. ....++. .+. T Consensus 15 ~s~~idl~LDErAdRedI~vrv~gsGa-----Km~pe~~~~v~~~~~~~~pDf~i~isPN~a~PGP~~ARE~l~~--~~i 87 (276) T PRK00994 15 MSPVIDLLLDERADREDIDVRVVGSGA-----KMGPEEVERVVKKMKEWKPDFIIVISPNPAAPGPTKAREILSA--AGI 87 (276) T ss_pred HHHHHHHHHHHHHCCCCCEEEEECCCC-----CCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHH--CCC T ss_conf 599999987654134685499952667-----7797899999999884089989997899889995679999975--699 Q ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 679975135446210012111233212247567-----765555678999999997421865658899899998631222 Q gi|254780717|r 99 NVVTLSHSPDLHRILLRDNHSHFHDSEADDLHL-----WLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQ 173 (294) Q Consensus 99 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~-----Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~ 173 (294) ..|.+++.++......-+..+.+..--..||-+ ||||.-|..+-..+-..|+.- --++..+.+|+. T Consensus 88 P~IvI~D~p~~k~kd~l~~~g~GYIivk~D~MIGARREFLDP~EMa~fNaD~~kVLa~t---------G~~RlvQ~elD~ 158 (276) T PRK00994 88 PCIVIGDAPGKKKKDAMEEQGFGYIIVKADPMIGARREFLDPVEMASFNADVIKVLAGT---------GAFRLVQEALDK 158 (276) T ss_pred CEEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHH T ss_conf 88997488533148899864984799836754431232038799987402699999852---------089999999999 Q ss_pred HHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 358999850300278146206215778987498120034677544497889999999975499389981888 Q gi|254780717|r 174 LDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEF 245 (294) Q Consensus 174 l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~ 245 (294) ....++.--.+.--+.+++...+-.. -++ .+|=....+.--..+.+.+-+-+++.-|.+... T Consensus 159 vi~~v~~Gke~~LPkiv~t~ekaVe~----a~F------~NPYAkAKA~AA~~iAe~VA~idvkgCFm~Ke~ 220 (276) T PRK00994 159 VIDQVKAGKEIELPKIVITPEKAVEA----AGF------ANPYAKAKAMAAFEMAEKVADIDVKGCFMTKEM 220 (276) T ss_pred HHHHHHCCCCCCCCEEEECHHHHHHH----HCC------CCHHHHHHHHHHHHHHHHHHCCCCHHEEEECCH T ss_conf 99998668987787378747786776----246------887789999999999998745874112531467 No 100 >pfam07383 DUF1496 Protein of unknown function (DUF1496). This family consists of several bacterial proteins of around 90 residues in length. Members of this family seem to be found exclusively in the Orders Vibrionales and Enterobacteriales. The function of this family is unknown. Probab=46.59 E-value=20 Score=16.71 Aligned_cols=33 Identities=27% Similarity=0.396 Sum_probs=20.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHH Q ss_conf 915899999999996001323880599972289 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPI 33 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl 33 (294) ||+.++..++++..+++.+......|+++.+|- T Consensus 1 Mk~~~~~~~~~~~~~~~la~~~~~dv~~~~~~~ 33 (88) T pfam07383 1 MKRLLILCFAALLSLVALANSVAADVIVTPPPE 33 (88) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCEECCCCHH T ss_conf 921899999999999986165788666169977 No 101 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=46.47 E-value=22 Score=16.49 Aligned_cols=25 Identities=16% Similarity=0.090 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 7655556789999999974218656 Q gi|254780717|r 132 WLNPLNVQYIAHVIAMELIKKDPRN 156 (294) Q Consensus 132 Wl~p~n~~~~a~~I~~~L~~~dP~~ 156 (294) |-+...+..+|..+.-.+...+... T Consensus 45 ~~n~~~~~~~Ae~~gi~l~~~~~~g 69 (223) T COG2102 45 TPNLELAELQAEAMGIPLVTFDTSG 69 (223) T ss_pred CCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 2256799999874488359983476 No 102 >cd05844 GT1_like_7 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center Probab=46.35 E-value=23 Score=16.30 Aligned_cols=41 Identities=10% Similarity=-0.015 Sum_probs=20.7 Q ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCC Q ss_conf 66502578999676610356899973145576799751354462 Q gi|254780717|r 68 MMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSPDLHR 111 (294) Q Consensus 68 ~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~~~~~ 111 (294) ..+..||.+|.++..+- +++.+..-...++..+..++|... T Consensus 138 ~l~~~a~~iI~vS~~~~---~~l~~~G~~~~ki~vi~~GvD~~~ 178 (367) T cd05844 138 RLARRAALFIAVSQFIR---DRLLALGFPPEKVHVHPIGVDTAK 178 (367) T ss_pred HHHHHCCEEEECCHHHH---HHHHHCCCCHHHEEEECCCCCHHH T ss_conf 99972699996999999---999985989789999778636764 No 103 >PRK00578 prfB peptide chain release factor 2; Validated Probab=46.29 E-value=23 Score=16.30 Aligned_cols=92 Identities=18% Similarity=0.316 Sum_probs=48.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHH----------------------HH--HCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 224756776555567899999999----------------------74--218656588998999986312223589998 Q gi|254780717|r 125 EADDLHLWLNPLNVQYIAHVIAME----------------------LI--KKDPRNKIIYEKNEEEFKNQLSQLDKELHS 180 (294) Q Consensus 125 ~~~dpH~Wl~p~n~~~~a~~I~~~----------------------L~--~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~ 180 (294) ..-||-+|-||..+..+.+..+.- |. +-||+-.+........+.+++.++. ++. T Consensus 38 ~lsdP~~W~D~~~a~kl~kE~~~L~~~v~~~~~l~~~~~dl~~l~eL~~ee~D~el~eea~~el~~l~~~l~~lE--l~~ 115 (367) T PRK00578 38 EAEDPDFWNDQERAQKVGKELSSLKAKLDTLENLRQRLDDLEELLELAEEEDDEETLAEAEAELKALEKKLAALE--LER 115 (367) T ss_pred HHCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH--HHH T ss_conf 863970544999999999999999999999999999999999999998649998999999999999999999999--986 Q ss_pred HCC-CCCCCE-EEECCCH----------------HHHHHHHCCCCEECCCCCCCCC Q ss_conf 503-002781-4620621----------------5778987498120034677544 Q gi|254780717|r 181 ILQ-PVEKKK-IIVFHEA----------------YRYFASHYNLSIVTFPMSHSVF 218 (294) Q Consensus 181 ~~~-~~~~~~-~i~~H~a----------------~~Yf~~~~gl~~~~~~~~~~~~ 218 (294) .+. |...+. ++.-|+. |.-|++.-|+++..+..+++.+ T Consensus 116 Llsd~~D~~naileI~aGaGG~EA~~~a~~L~rMY~r~ae~~g~k~~v~~~~~~~~ 171 (367) T PRK00578 116 LLSGEYDANNAILTIHAGAGGTEAQDWASMLLRMYLRWAERHGFKVEVLDYSEGEE 171 (367) T ss_pred CCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCC T ss_conf 04898867884799984899607889999999999999987588631410377655 No 104 >COG0626 MetC Cystathionine beta-lyases/cystathionine gamma-synthases [Amino acid transport and metabolism] Probab=46.20 E-value=23 Score=16.29 Aligned_cols=129 Identities=12% Similarity=0.160 Sum_probs=61.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCE--------EEECCCHHHHHHHHC-CC-CEECCCCCCCCCCCH Q ss_conf 18656588998999986312223589998503002781--------462062157789874-98-120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKK--------IIVFHEAYRYFASHY-NL-SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~--------~i~~H~a~~Yf~~~~-gl-~~~~~~~~~~~~~s~ 221 (294) +.|..+=....+.+.+.-+++...+--.++..-+.+.+ -...||....+.|.+ |. -+.++....+ T Consensus 246 l~p~dA~l~lRGlkTL~~Rm~~~~~nA~~IA~~L~~~p~V~~V~yPgl~shp~he~~krq~~g~gg~~Sf~l~~~----- 320 (396) T COG0626 246 LSPFDAWLLLRGLRTLALRMERHNENALKIAEFLADHPKVKKVYYPGLPSHPGHELAKRQMTGYGGLFSFELKNE----- 320 (396) T ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEECCH----- T ss_conf 887999999817153999999999989999999862997689989999889768999974579860899996785----- Q ss_pred HHHHHHHHHHHHCC-------CCEEEEEC--CCCHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 88999999997549-------93899818--88978999999971990--899805336787781678999999999998 Q gi|254780717|r 222 ASLRNIRSKIISDK-------ISCLFYGP--EFDSKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSIA 290 (294) Q Consensus 222 ~~l~~~~~~ik~~~-------v~~if~e~--~~~~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~ 290 (294) .+..++.+.++--. +..+..-| ..-..+-.....+.|+. .+.| +.|. +.-.+++....++|+ T Consensus 321 ~~~~~f~~~L~l~~~a~SlGgveSLi~~pa~~th~~~~~~~r~~~Gi~~~LvRl-SVGl------Ed~eDLiaDl~~Al~ 393 (396) T COG0626 321 EAAKKFLDSLKLFKLAESLGGVESLISHPATMTHASIPLEERAKAGITDGLVRL-SVGL------EDVEDLIADLEQALA 393 (396) T ss_pred HHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHHH T ss_conf 899999972782378326877440001665567554898789865999982899-9567------998999999999998 Q ss_pred HH Q ss_conf 74 Q gi|254780717|r 291 KN 292 (294) Q Consensus 291 k~ 292 (294) +. T Consensus 394 ~~ 395 (396) T COG0626 394 KA 395 (396) T ss_pred HH T ss_conf 52 No 105 >pfam01026 TatD_DNase TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme. Probab=45.94 E-value=24 Score=16.26 Aligned_cols=20 Identities=10% Similarity=0.180 Sum_probs=12.0 Q ss_pred HHHHHHHHCCCCEEEEECCC Q ss_conf 99999975499389981888 Q gi|254780717|r 226 NIRSKIISDKISCLFYGPEF 245 (294) Q Consensus 226 ~~~~~ik~~~v~~if~e~~~ 245 (294) ++.+.++.-....|+.|+-. T Consensus 187 ~~~~~~~~iPl~rlllETD~ 206 (255) T pfam01026 187 ELREVVAEIPLDRLLVETDA 206 (255) T ss_pred HHHHHHHHCCCCEEEEECCC T ss_conf 99999974996259983589 No 106 >PRK07789 acetolactate synthase 1 catalytic subunit; Validated Probab=44.60 E-value=25 Score=16.13 Aligned_cols=41 Identities=20% Similarity=0.366 Sum_probs=24.3 Q ss_pred CCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEE Q ss_conf 68024566502578999676610356899973145576799 Q gi|254780717|r 62 LRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVT 102 (294) Q Consensus 62 ~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~ 102 (294) -++.....+++||++|..|..|..+..........+.++|- T Consensus 287 g~~~a~~~l~~aDlvL~iG~rl~~~~tg~~~~~~p~~kvI~ 327 (612) T PRK07789 287 GTVAAVAALQRSDLLIALGARFDDRVTGKLDSFAPDAKVIH 327 (612) T ss_pred CCHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEE T ss_conf 88899976505886888625567555687343599873899 No 107 >pfam07437 YfaZ YfaZ precursor. This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins. Probab=44.10 E-value=15 Score=17.47 Aligned_cols=26 Identities=19% Similarity=0.398 Sum_probs=19.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 91589999999999600132388059 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQV 26 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~V 26 (294) |||+++.+++.+++.++.+.+..+.+ T Consensus 1 mkk~~~~~a~~l~~~s~~A~A~s~~~ 26 (180) T pfam07437 1 MAKFRVASVALLLLVALSVNASSFNL 26 (180) T ss_pred CCHHHHHHHHHHHHHHHHCEEEEEEC T ss_conf 90358999999987632105788422 No 108 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=43.99 E-value=23 Score=16.40 Aligned_cols=27 Identities=15% Similarity=0.121 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHH--HHHHCCCCCCCEEEEE Q ss_conf 915899999999--9960013238805999 Q gi|254780717|r 1 MKNFLIILIFLF--FILSSVARAGSLQVVV 28 (294) Q Consensus 1 Mkk~l~~~~~~~--~~l~~~~~~~~~~Vv~ 28 (294) |||+++.+++++ +.|++|++.+ -.|++ T Consensus 5 MKK~~l~~~~~~~~l~LaaCss~~-~~Vat 33 (298) T PRK04405 5 MKKWALAAASAGLLLSLAGCSSNN-KTVAT 33 (298) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCC-CEEEE T ss_conf 899999999999999998717999-76998 No 109 >pfam03652 UPF0081 Uncharacterized protein family (UPF0081). Probab=43.53 E-value=26 Score=16.03 Aligned_cols=65 Identities=9% Similarity=0.138 Sum_probs=44.1 Q ss_pred HHHHHHHCCCCEE--CCC--CCCCCCCCHHHHHHHHHHHHHC-CCCEEEEECCCCHHHHHHHHHHHCCCE Q ss_conf 5778987498120--034--6775444978899999999754-993899818889789999999719908 Q gi|254780717|r 197 YRYFASHYNLSIV--TFP--MSHSVFMGAASLRNIRSKIISD-KISCLFYGPEFDSKIIRSITNDTGVMS 261 (294) Q Consensus 197 ~~Yf~~~~gl~~~--~~~--~~~~~~~s~~~l~~~~~~ik~~-~v~~if~e~~~~~~~~~~ia~e~~~~~ 261 (294) +.-+.++++...+ |+. .+....+..+.+..+.+.+++. ++++.|.+..+++..++...++.|.+. T Consensus 43 i~~ii~e~~i~~iVvGlP~~~dG~~~~~~~~v~~f~~~L~~~~~i~v~~~DEr~TS~~A~~~l~~~~~~~ 112 (134) T pfam03652 43 LAELIKEWQPDGIVVGLPLNMDGSEGEQTKRVRKFARRLKKRFGLPVELVDERLTTVEAERILREAGLSR 112 (134) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCC T ss_conf 9999998399989995589999880889999999999999861998688634327999999999737442 No 110 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=43.37 E-value=26 Score=16.01 Aligned_cols=34 Identities=24% Similarity=0.351 Sum_probs=17.2 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEE Q ss_conf 0257899967661035689997314-5576799751 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSH 105 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~ 105 (294) ++.|.|+....+-+. +...++.+. ...+++.+.. T Consensus 56 ~~vDgIii~p~~~~~-~~~~l~~a~~~gIPvV~~d~ 90 (273) T cd06310 56 RGPDAILLAPTDAKA-LVPPLKEAKDAGIPVVLIDS 90 (273) T ss_pred CCCCEEEEECCCCHH-HHHHHHHHHHCCCCEEEEEC T ss_conf 499999991687144-79999999984998589803 No 111 >PRK13962 bifunctional phosphoglycerate kinase/triosephosphate isomerase; Provisional Probab=43.25 E-value=26 Score=16.00 Aligned_cols=21 Identities=24% Similarity=0.436 Sum_probs=15.2 Q ss_pred CCCHHHH----HHHCCCCEEEEECC Q ss_conf 5680245----66502578999676 Q gi|254780717|r 61 SLRISEA----MMLENADIIFWLGA 81 (294) Q Consensus 61 ~~tp~d~----~~l~~Adlii~~G~ 81 (294) ...|+.+ +.|++|.-|||||+ T Consensus 295 DIGp~Ti~~~~~~I~~a~TI~WNGP 319 (653) T PRK13962 295 DIGPETVELFKEKLSDAKTIVWNGP 319 (653) T ss_pred CCCHHHHHHHHHHHHHCCEEEEECC T ss_conf 6889999999999977999999789 No 112 >PRK12789 flgI flagellar basal body P-ring protein; Reviewed Probab=43.23 E-value=11 Score=18.34 Aligned_cols=21 Identities=14% Similarity=0.374 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||+++++++.+++++..+.+ T Consensus 2 ~~~~~~~~~~~l~~~~~~~~a 22 (367) T PRK12789 2 IRRLLLAVLLALLAGPAAAAA 22 (367) T ss_pred HHHHHHHHHHHHHHCCCHHHH T ss_conf 089999999999855306666 No 113 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=43.17 E-value=26 Score=16.00 Aligned_cols=75 Identities=19% Similarity=0.290 Sum_probs=39.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC--C--EEEE-----------EECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHH Q ss_conf 9158999999999960013238--8--0599-----------97228999999970794417887058998353356802 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG--S--LQVV-----------VSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRIS 65 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~--~--~~Vv-----------~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~ 65 (294) |||++++++++++++++++.+. + +++. ....|+.+..+.-.|- +|+..+..+.. .-- T Consensus 2 ~~r~l~~~~~~~~~~~~~a~a~~~p~~l~~gv~P~~~~~~~~~~~~Pl~~yL~~~lg~--~Ve~~~~~~y~------~~~ 73 (288) T TIGR03431 2 LRRLILSLVAAFMLISSNAQAEDWPKELNFGIIPTENASDLKQRWEPLADYLSKKLGV--KVKLFFATDYA------GVI 73 (288) T ss_pred CHHHHHHHHHHHHHHCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCC--CEEEEECCCHH------HHH T ss_conf 0889999999999974322430488368999806999999999999999999998789--78999689999------999 Q ss_pred HHHHHCCCCEEEEECCCC Q ss_conf 456650257899967661 Q gi|254780717|r 66 EAMMLENADIIFWLGAEM 83 (294) Q Consensus 66 d~~~l~~Adlii~~G~~~ 83 (294) +..+-.+.|+.+.++..+ T Consensus 74 eal~~g~~Dia~~~p~~y 91 (288) T TIGR03431 74 EGMRFGKVDIAWYGPSSY 91 (288) T ss_pred HHHHCCCCEEEEECCHHH T ss_conf 999859830999884788 No 114 >pfam01993 MTD methylene-5,6,7,8-tetrahydromethanopterin dehydrogenase. This enzyme family is involved in formation of methane from carbon dioxide EC:1.5.99.9. The enzyme requires coenzyme F420. Probab=42.84 E-value=26 Score=15.96 Aligned_cols=190 Identities=9% Similarity=0.043 Sum_probs=90.6 Q ss_pred CHHHHHHHHHHHCCCCEEEEE-ECCCCCCCCCCCCHHHHHH-------HCCCCEEEEECCCCCHH----HHHHHHHCCCC Q ss_conf 228999999970794417887-0589983533568024566-------50257899967661035----68999731455 Q gi|254780717|r 30 IKPIHSIVSCIMQGIGTPALL-VKGASSPHEYSLRISEAMM-------LENADIIFWLGAEMESF----LVKPLHSLNKQ 97 (294) Q Consensus 30 ~~pl~~lv~~I~gd~~~V~~L-~~~g~dpH~y~~tp~d~~~-------l~~Adlii~~G~~~E~~----~~~~~~~~~~~ 97 (294) ..++-+++..--.|+-++.+- +..|. .+.|.++.. --+.|++|++|.+--.. ....++. .. T Consensus 14 ~s~~idl~LDErAdRedI~vrv~gsGa-----Km~pe~~e~~~~~~l~~~~pDf~i~isPN~a~PGP~~ARE~l~~--~g 86 (276) T pfam01993 14 TSPVVDLLLDERADREDIEVRVVGSGA-----KMDPECVEEVVLDMLEEFEPDFVIYISPNPAAPGPKKAREMLSD--SG 86 (276) T ss_pred HHHHHHHHHHHHHCCCCCEEEEECCCC-----CCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHH--CC T ss_conf 599999987765234686499952666-----67988899999999986189989997899889995679999975--69 Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHH-----HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 7679975135446210012111233212247567-----76555567899999999742186565889989999863122 Q gi|254780717|r 98 SNVVTLSHSPDLHRILLRDNHSHFHDSEADDLHL-----WLNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLS 172 (294) Q Consensus 98 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~dpH~-----Wl~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~ 172 (294) ...|.+++.++......-+..+.+..--..||-+ ||||.-|..+-..+-..|+. .--++..+.+|+ T Consensus 87 iP~IvI~D~p~~K~kd~l~~~g~GYIivk~D~MIGARREFLDP~EMa~fNaD~~kVLa~---------tG~~RlvQ~eiD 157 (276) T pfam01993 87 YPAVIIGDAPGLKVKDEMEEQGLGYILVKADPMIGARREFLDPTEMALFNADVIKVLAA---------TGAFRVVQEAID 157 (276) T ss_pred CCEEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHH T ss_conf 98799837863325899986598479983676443123203879998740259999986---------318999999999 Q ss_pred HHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 2358999850300278146206215778987498120034677544497889999999975499389981888 Q gi|254780717|r 173 QLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEF 245 (294) Q Consensus 173 ~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~ 245 (294) .....++.--.+.--+.+++...+-.. -++ .+|=....+.--..+.+.+-+-+++.-|.+... T Consensus 158 ~vi~~v~~Gk~~~LPkiv~t~~kaVe~----a~F------~NPYAkAKA~AA~~iAe~VA~idvkgCFm~Ke~ 220 (276) T pfam01993 158 KMIEDVKAGKEPELPQIVIDRNKAVEA----EEF------TNPYAKAKAMAAFEIAEKVADIDVKGCFMEQDP 220 (276) T ss_pred HHHHHHHCCCCCCCCEEEECHHHHHHH----HCC------CCHHHHHHHHHHHHHHHHHHCCCCHHEEEECCH T ss_conf 999998658977787388747786776----236------887789999999999999745874102531467 No 115 >PRK07245 consensus Probab=42.56 E-value=27 Score=15.94 Aligned_cols=22 Identities=9% Similarity=0.276 Sum_probs=15.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 2475677655556789999999 Q gi|254780717|r 126 ADDLHLWLNPLNVQYIAHVIAM 147 (294) Q Consensus 126 ~~dpH~Wl~p~n~~~~a~~I~~ 147 (294) .-||.+|-||..+..+.+.++. T Consensus 12 ~s~Pd~w~D~~ka~kl~ke~~~ 33 (337) T PRK07245 12 MTEPDFWNDNIAAQKTSQELNE 33 (337) T ss_pred HCCCHHHHCHHHHHHHHHHHHH T ss_conf 6297033099999999999999 No 116 >PRK00109 Holliday junction resolvase-like protein; Reviewed Probab=42.22 E-value=27 Score=15.90 Aligned_cols=64 Identities=8% Similarity=0.133 Sum_probs=43.4 Q ss_pred HHHHHHHCCCCEE--CCC--CCCCCCCCHHHHHHHHHHHHHC-CCCEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 5778987498120--034--6775444978899999999754-99389981888978999999971990 Q gi|254780717|r 197 YRYFASHYNLSIV--TFP--MSHSVFMGAASLRNIRSKIISD-KISCLFYGPEFDSKIIRSITNDTGVM 260 (294) Q Consensus 197 ~~Yf~~~~gl~~~--~~~--~~~~~~~s~~~l~~~~~~ik~~-~v~~if~e~~~~~~~~~~ia~e~~~~ 260 (294) +.-+.++++...+ |.. .+....+..+.+..+.+.+++. ++++.|.+..++++.++...++.|.+ T Consensus 46 l~~ii~e~~i~~iViGlP~~~~g~~~~~~~~v~~F~~~L~~~~~l~v~~~DEr~TS~~A~~~l~~~~~~ 114 (141) T PRK00109 46 LEKLLKEWQPDLLVVGLPLNMDGTEGPMTERARKFANRLEGRFGLPVELVDERLTTVEAERMLFEAGVS 114 (141) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCC T ss_conf 999999839998999447999988448899999999999997299879862432799999999973653 No 117 >TIGR01826 CofD_related conserved hypothetical protein; InterPro: IPR010119 This entry represents a subfamily of conserved hypothetical proteins that forms a sister group to the family of CofD, (IPR010115 from INTERPRO), LPPG:Fo 2-phospho-L-lactate transferase, an enzyme of cytochrome F420 biosynthesis.. Probab=42.18 E-value=20 Score=16.72 Aligned_cols=16 Identities=19% Similarity=0.227 Sum_probs=7.2 Q ss_pred CC-HHHHHHHCCCCEEE Q ss_conf 68-02456650257899 Q gi|254780717|r 62 LR-ISEAMMLENADIIF 77 (294) Q Consensus 62 ~t-p~d~~~l~~Adlii 77 (294) |+ +.-+.+|.+||+|| T Consensus 179 a~a~eAv~aI~eADlIi 195 (331) T TIGR01826 179 ALAREAVEAIREADLII 195 (331) T ss_pred CCCHHHHHHHHHCCEEE T ss_conf 75158999996669278 No 118 >PRK00315 potassium-transporting ATPase subunit C; Reviewed Probab=41.55 E-value=1.2 Score=24.59 Aligned_cols=23 Identities=22% Similarity=0.283 Sum_probs=19.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2247567765555678999999997 Q gi|254780717|r 125 EADDLHLWLNPLNVQYIAHVIAMEL 149 (294) Q Consensus 125 ~~~dpH~Wl~p~n~~~~a~~I~~~L 149 (294) .+.|||+ ||.++..++..|+++- T Consensus 127 SGLDPhI--S~~aA~~Qv~RVA~ar 149 (195) T PRK00315 127 SGLDPHI--SPAAAAYQIPRVAAAR 149 (195) T ss_pred CCCCCCC--CHHHHHHHHHHHHHHH T ss_conf 6689998--9999999999999983 No 119 >COG3933 Transcriptional antiterminator [Transcription] Probab=41.32 E-value=28 Score=15.82 Aligned_cols=31 Identities=10% Similarity=0.063 Sum_probs=12.4 Q ss_pred ECHHHHHHHHHHHCCCCEEEEEECCCCCCCC Q ss_conf 7228999999970794417887058998353 Q gi|254780717|r 29 SIKPIHSIVSCIMQGIGTPALLVKGASSPHE 59 (294) Q Consensus 29 s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~ 59 (294) |-.-.+..|..+-|...-.-.=+|-.+.||. T Consensus 120 TASSmaevanrLL~~~~~~aiDMPLdvsp~~ 150 (470) T COG3933 120 TASSMAEVANRLLGEEIFIAIDMPLDVSPSD 150 (470) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCHHH T ss_conf 0788999999986325301346877678899 No 120 >pfam01261 AP_endonuc_2 Xylose isomerase-like TIM barrel. This TIM alpha/beta barrel structure is found in xylose isomerase and in endonuclease IV (EC:3.1.21.2). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae. Probab=41.16 E-value=28 Score=15.80 Aligned_cols=21 Identities=5% Similarity=-0.050 Sum_probs=12.1 Q ss_pred CHHHHHHHHHHHHHCCCCEEE Q ss_conf 978899999999754993899 Q gi|254780717|r 220 GAASLRNIRSKIISDKISCLF 240 (294) Q Consensus 220 s~~~l~~~~~~ik~~~v~~if 240 (294) |+.++..+++.+...++..+| T Consensus 133 ~~~~~~~l~~~v~~~~~~~~~ 153 (201) T pfam01261 133 YFEEALRLIEEVDSPNVGLCL 153 (201) T ss_pred CHHHHHHHHHHCCCCCCCCCC T ss_conf 999999999864998655110 No 121 >cd03796 GT1_PIG-A_like This family is most closely related to the GT1 family of glycosyltransferases. Phosphatidylinositol glycan-class A (PIG-A), an X-linked gene in humans, is necessary for the synthesis of N-acetylglucosaminyl-phosphatidylinositol, a very early intermediate in glycosyl phosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is an important cellular structure that facilitates the attachment of many proteins to cell surfaces. Somatic mutations in PIG-A have been associated with Paroxysmal Nocturnal Hemoglobinuria (PNH), an acquired hematological disorder. Probab=40.48 E-value=29 Score=15.73 Aligned_cols=19 Identities=11% Similarity=0.324 Sum_probs=8.0 Q ss_pred HHHHHHHHHCCCCEEEEEE Q ss_conf 9999999707944178870 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLV 51 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~ 51 (294) ++.+++.+..--.+|..+. T Consensus 20 v~~La~~L~~~Gh~V~Vit 38 (398) T cd03796 20 IYQLSQCLIKRGHKVVVIT 38 (398) T ss_pred HHHHHHHHHHCCCEEEEEE T ss_conf 9999999997699899996 No 122 >COG0240 GpsA Glycerol-3-phosphate dehydrogenase [Energy production and conversion] Probab=40.40 E-value=29 Score=15.73 Aligned_cols=75 Identities=23% Similarity=0.205 Sum_probs=54.5 Q ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEEC--CCCCHHHHHHHHHCCCCCCEEEEEECCCC Q ss_conf 99999970794417887058998353356802456650257899967--66103568999731455767997513544 Q gi|254780717|r 34 HSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLG--AEMESFLVKPLHSLNKQSNVVTLSHSPDL 109 (294) Q Consensus 34 ~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G--~~~E~~~~~~~~~~~~~~~~i~~~~~~~~ 109 (294) .+++.+|--++.++.+|- .-..|.+...++.-.+.+++||+++..= ..+..++.++...+.++..++.++++... T Consensus 34 ~~~~~~i~~~~~N~~yLp-~i~lp~~l~at~Dl~~a~~~ad~iv~avPs~~~r~v~~~l~~~l~~~~~iv~~sKGie~ 110 (329) T COG0240 34 EEIVAEINETRENPKYLP-GILLPPNLKATTDLAEALDGADIIVIAVPSQALREVLRQLKPLLLKDAIIVSATKGLEP 110 (329) T ss_pred HHHHHHHHHCCCCCCCCC-CCCCCCCCCCCCCHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEECCCCC T ss_conf 999999973476701059-96288632224689999722999999787578999999876433678749997446558 No 123 >TIGR02196 GlrX_YruB Glutaredoxin-like protein, YruB-family; InterPro: IPR011911 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This glutaredoxin-like protein family contains the conserved CxxC motif and includes the Clostridium pasteurianum protein YruB which has been cloned from a rubredoxin operon . Somewhat related to NrdH, it is unknown whether this protein actually interacts with glutathione/glutathione reducatase, or, like NrdH, some other reductant system.. Probab=40.26 E-value=29 Score=15.71 Aligned_cols=43 Identities=12% Similarity=0.028 Sum_probs=36.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHC-CCEEEECC Q ss_conf 99999999754993899818889789999999719-90899805 Q gi|254780717|r 224 LRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTG-VMSAILDP 266 (294) Q Consensus 224 l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~-~~~~~ld~ 266 (294) =.++.++++.+||++.=.+-+.++..++.+.+.+| -+.+++=- T Consensus 13 C~~aK~yL~~~gv~f~~~dV~~d~~A~~~~~k~~gG~~gVPv~v 56 (79) T TIGR02196 13 CKKAKEYLTSKGVAFEEIDVEKDSAAREELLKKLGGQRGVPVIV 56 (79) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCEEE T ss_conf 78999999954994576343369788999999838985455576 No 124 >COG2262 HflX GTPases [General function prediction only] Probab=40.18 E-value=29 Score=15.70 Aligned_cols=75 Identities=13% Similarity=0.156 Sum_probs=43.1 Q ss_pred EECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCC--HHH---HHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEE Q ss_conf 972289999999707944178870589983533568--024---566502578999676610356899973145576799 Q gi|254780717|r 28 VSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLR--ISE---AMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVT 102 (294) Q Consensus 28 ~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~t--p~d---~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~ 102 (294) .+..=+..+++..+++.+.+..-....-||-.|==+ ... +....+||++|.++ .+.|--...+...- ..++++ T Consensus 20 ~~leEl~~La~tag~~v~~~~~q~r~~pdp~~~iG~GK~eEi~~~v~~~~ad~VIf~~-~LsP~Q~~NLe~~l-~~kVID 97 (411) T COG2262 20 ESLEELAELAETAGYEVVEVVTQKRERPDPKTYIGSGKLEEIAEAVEETGADLVIFDH-ELSPSQLRNLEKEL-GVKVID 97 (411) T ss_pred HHHHHHHHHHHHCCCEEEEEEEEECCCCCCCEECCCCHHHHHHHHHHHCCCCEEEECC-CCCHHHHHHHHHHH-CCEEEE T ss_conf 2299999999975976816678712589963000765599999999843999999999-68988999999997-988984 Q ss_pred EE Q ss_conf 75 Q gi|254780717|r 103 LS 104 (294) Q Consensus 103 ~~ 104 (294) -. T Consensus 98 Rt 99 (411) T COG2262 98 RT 99 (411) T ss_pred HH T ss_conf 18 No 125 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=40.14 E-value=29 Score=15.70 Aligned_cols=79 Identities=8% Similarity=-0.001 Sum_probs=32.5 Q ss_pred CCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHH--HHHHHCCCEEE Q ss_conf 2781462062157789874981200346775444978899999999754993899818889789999--99971990899 Q gi|254780717|r 186 EKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRS--ITNDTGVMSAI 263 (294) Q Consensus 186 ~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~--ia~e~~~~~~~ 263 (294) .++.++....+-...+++|..+..-++........+.........+.++++-.+|.-++.+..+.+. ++++.|++++. T Consensus 132 A~~I~i~G~G~S~~vA~~~~~kl~rig~~~~~~~d~~~~~~~~~~l~~~Dv~I~iS~sG~t~~~~~~~~~Ak~~Ga~iIa 211 (282) T PRK11557 132 ARRIILTGIGASGLVAQNFAWKLMKIGINAVAERDMHALLATVQALSPDDLLLAISYSGERRELNLAADEALRVGGKVLA 211 (282) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEE T ss_conf 89589997043699999999999980996374088689999998189999899985999978999999999987993999 Q ss_pred E Q ss_conf 8 Q gi|254780717|r 264 L 264 (294) Q Consensus 264 l 264 (294) + T Consensus 212 I 212 (282) T PRK11557 212 I 212 (282) T ss_pred E T ss_conf 7 No 126 >PRK07811 cystathionine gamma-synthase; Provisional Probab=39.67 E-value=29 Score=15.65 Aligned_cols=77 Identities=8% Similarity=0.108 Sum_probs=36.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCE-----EEECCCHHHHHHHHCCC--CEECCCCCCCCCCCH Q ss_conf 186565889989999863122235899985---03002781-----46206215778987498--120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKKK-----IIVFHEAYRYFASHYNL--SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~~-----~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-++++-.+--... +....... -+..|+.+..+.+.++- -+.+++...+ . T Consensus 239 ~sP~~a~l~lrGL~TL~lRm~~~~~nA~~lA~~L~~hp~V~~V~yPgL~s~p~h~la~~~~~g~Gg~~sf~l~g~----~ 314 (386) T PRK07811 239 PGPFDAYLTMRGLKTLAVRMDRHSENAEAVAEFLDGHPEVSTVLYPGLPSHPGHEVAARQMRGFGGMVSVRLRGG----E 314 (386) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHCCCCCCEEEEEECCC----H T ss_conf 885788899865736999999999999999999972998377989999999378999986799986348996286----9 Q ss_pred HHHHHHHHHHH Q ss_conf 88999999997 Q gi|254780717|r 222 ASLRNIRSKII 232 (294) Q Consensus 222 ~~l~~~~~~ik 232 (294) ....++.+.++ T Consensus 315 ~~~~~f~~~L~ 325 (386) T PRK07811 315 EAARDFCARTK 325 (386) T ss_pred HHHHHHHHHCC T ss_conf 99999997089 No 127 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=39.15 E-value=21 Score=16.57 Aligned_cols=20 Identities=25% Similarity=0.433 Sum_probs=14.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||+++. +++.+++++|+.. T Consensus 1 MKk~~~~-~~~allLtgCa~Q 20 (97) T pfam06291 1 MKKMLFA-AALALLITGCAQQ 20 (97) T ss_pred CHHHHHH-HHHHHHHCCCCCE T ss_conf 9225999-9999997213304 No 128 >COG2247 LytB Putative cell wall-binding domain [Cell envelope biogenesis, outer membrane] Probab=39.05 E-value=30 Score=15.59 Aligned_cols=88 Identities=20% Similarity=0.318 Sum_probs=42.7 Q ss_pred CHHHHHHHHHHHHHHHCC----CCCCC--EEEEEECHHHHH--HHHHHHCCCCEEEEEECC-CCCCCCCCCCHHHHHHHC Q ss_conf 915899999999996001----32388--059997228999--999970794417887058-998353356802456650 Q gi|254780717|r 1 MKNFLIILIFLFFILSSV----ARAGS--LQVVVSIKPIHS--IVSCIMQGIGTPALLVKG-ASSPHEYSLRISEAMMLE 71 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~----~~~~~--~~Vv~s~~pl~~--lv~~I~gd~~~V~~L~~~-g~dpH~y~~tp~d~~~l~ 71 (294) |||++-+++++++.+++. +++.. .-|++|. ..+| ++..++ +..+-..++.| | -|.=+..+--.=- T Consensus 1 Mrr~l~lll~~~l~l~s~~av~A~~~~~~~VIlvsD-n~aD~~lA~~ia-ellNA~Vlttpwg----~ynes~~~eI~~l 74 (337) T COG2247 1 MRRLLMLLLASLLALSSPPAVSAQSQNTTVVILVSD-NEADLLLALPIA-ELLNAPVLTTPWG----IYNESVLDEIIEL 74 (337) T ss_pred CCCHHHHHHHHHHHHHCCHHHHHHHCCCEEEEEECC-HHHHHHHHHHHH-HHHCCEEEECCCC----CCCHHHHHHHHHH T ss_conf 910889999999998230444444148159999566-077788756799-9758905726754----2139999999961 Q ss_pred CCCEEEEECCCCC--HHHHHHHHHC Q ss_conf 2578999676610--3568999731 Q gi|254780717|r 72 NADIIFWLGAEME--SFLVKPLHSL 94 (294) Q Consensus 72 ~Adlii~~G~~~E--~~~~~~~~~~ 94 (294) +-|+++.+|...- +..++.+++. T Consensus 75 npd~VLIIGGp~AVs~~yE~~Lks~ 99 (337) T COG2247 75 NPDLVLIIGGPIAVSPNYENALKSL 99 (337) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHC T ss_conf 9865999789875786589999857 No 129 >PRK10812 putative metallodependent hydrolase; Provisional Probab=39.03 E-value=30 Score=15.59 Aligned_cols=11 Identities=0% Similarity=0.241 Sum_probs=5.1 Q ss_pred EEEECCCCCHH Q ss_conf 99967661035 Q gi|254780717|r 76 IFWLGAEMESF 86 (294) Q Consensus 76 ii~~G~~~E~~ 86 (294) ++.+|..++.| T Consensus 37 ~i~~g~~~~~~ 47 (265) T PRK10812 37 CLAVATTLPGY 47 (265) T ss_pred EEEECCCHHHH T ss_conf 99953998999 No 130 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=38.64 E-value=19 Score=16.88 Aligned_cols=26 Identities=15% Similarity=0.335 Sum_probs=16.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 915899999999996001323880599 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVV 27 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv 27 (294) |||..+ +.+..+++.+|++...+.++ T Consensus 1 Mrk~~~-~~l~~~lLvGCsS~~~i~~~ 26 (123) T COG5633 1 MRKLCL-LSLALLLLVGCSSHQEILVN 26 (123) T ss_pred CCEEHH-HHHHHHHHHCCCCCCCCCCC T ss_conf 930038-99999994204788775225 No 131 >cd00003 PNPsynthase Pyridoxine 5'-phosphate (PNP) synthase domain; pyridoxal 5'-phosphate is the active form of vitamin B6 that acts as an essential, ubiquitous coenzyme in amino acid metabolism. In bacteria, formation of pyridoxine 5'-phosphate is a step in the biosynthesis of vitamin B6. PNP synthase, a homooctameric enzyme, catalyzes the final step in PNP biosynthesis, the condensation of 1-amino-acetone 3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP synthase adopts a TIM barrel topology, intersubunit contacts are mediated by three ''extra'' helices, generating a tetramer of symmetric dimers with shared active sites; the open state has been proposed to accept substrates and to release products, while most of the catalytic events are likely to occur in the closed state; a hydrophilic channel running through the center of the barrel was identified as the essential structural feature that enables PNP synthase to release water molecules produced during the reaction from the closed, Probab=38.19 E-value=31 Score=15.51 Aligned_cols=44 Identities=23% Similarity=0.340 Sum_probs=34.6 Q ss_pred CHHHHHHHHHHHHHCCCCE-EEEECCCCHHHHHHHHHHHCCCEEEECC Q ss_conf 9788999999997549938-9981888978999999971990899805 Q gi|254780717|r 220 GAASLRNIRSKIISDKISC-LFYGPEFDSKIIRSITNDTGVMSAILDP 266 (294) Q Consensus 220 s~~~l~~~~~~ik~~~v~~-if~e~~~~~~~~~~ia~e~~~~~~~ld~ 266 (294) ....|..+++.+++.++++ +|.||.. +.++ .++++|+..++|+. T Consensus 108 ~~~~L~~~i~~lk~~~IrvSLFIDPd~--~qi~-~a~~~Gad~VElhT 152 (234) T cd00003 108 QAEKLKPIIERLKDAGIRVSLFIDPDP--EQIE-AAKEVGADRVELHT 152 (234) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCH--HHHH-HHHHHCCCEEEEEC T ss_conf 788999999999865982799727987--8999-99984939999824 No 132 >PRK04781 histidinol-phosphate aminotransferase; Provisional Probab=38.19 E-value=31 Score=15.51 Aligned_cols=47 Identities=9% Similarity=0.123 Sum_probs=19.7 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHH-H--HCCCCEEEEEC Q ss_conf 28999999970794417887058998353356802456-6--50257899967 Q gi|254780717|r 31 KPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAM-M--LENADIIFWLG 80 (294) Q Consensus 31 ~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~-~--l~~Adlii~~G 80 (294) +..|....++.|-. +. -+|-..++..|++-...+. + -.+++++|.-. T Consensus 109 F~~Y~~~a~~~g~~--~~-~vpl~~~~~~f~~d~~~~~~~~~~~~~klv~l~n 158 (362) T PRK04781 109 FGMYAVCARLQNAP--LV-EVPLVDGADGFHADVPAIVAAALASNAKLVFLCS 158 (362) T ss_pred HHHHHHHHHHCCCE--EE-EEECCCCCCCCCCCHHHHHHHHHCCCCCEEEECC T ss_conf 68999999983987--99-9957555124677999999885267988899868 No 133 >COG4992 ArgD Ornithine/acetylornithine aminotransferase [Amino acid transport and metabolism] Probab=38.00 E-value=31 Score=15.49 Aligned_cols=65 Identities=20% Similarity=0.358 Sum_probs=35.1 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHH Q ss_conf 7765555-67899999999742186-56588998999986312223589998503002781462062157 Q gi|254780717|r 131 LWLNPLN-VQYIAHVIAMELIKKDP-RNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYR 198 (294) Q Consensus 131 ~Wl~p~n-~~~~a~~I~~~L~~~dP-~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~ 198 (294) +|+.... .......+++.|++.-| ..+-+|-+-.....+---++..++.. +.+++.+|+++++|- T Consensus 74 l~h~sn~~~~~~~~~la~~L~~~s~~~d~vff~NSGaEA~EaAiKlARk~~~---~~~k~~Iia~~nsFH 140 (404) T COG4992 74 LWHVSNLFYNEPQAELAEKLVELSPFADRVFFCNSGAEANEAALKLARKYTG---DPEKSKIIAFENSFH 140 (404) T ss_pred HHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCC---CCCCCEEEEECCCCC T ss_conf 6211313378699999999996286666899757838999999999999708---888757999747757 No 134 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=37.97 E-value=31 Score=15.49 Aligned_cols=56 Identities=7% Similarity=-0.078 Sum_probs=27.1 Q ss_pred CEEEEEECCCCCCCCCCCCHHHHHH--HCCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEE Q ss_conf 4178870589983533568024566--50257899967661035689997314-557679975 Q gi|254780717|r 45 GTPALLVKGASSPHEYSLRISEAMM--LENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLS 104 (294) Q Consensus 45 ~~V~~L~~~g~dpH~y~~tp~d~~~--l~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~ 104 (294) +++..+.+.+.|+..- .++++. -++.|.|+....+-+. +...++.+. ...+++.+. T Consensus 31 v~v~~~~~~~~d~~~Q---~~~i~~~i~~~vDaIii~p~~~~~-~~~~i~~a~~agIpVv~~d 89 (271) T cd06312 31 VDVEYRGPETFDVADM---ARLIEAAIAAKPDGIVVTIPDPDA-LDPAIKRAVAAGIPVISFN 89 (271) T ss_pred CEEEEEECCCCCHHHH---HHHHHHHHHCCCCEEEEECCCCCH-HHHHHHHHHHCCCEEEEEE T ss_conf 9899996898999999---999999997599989993788300-2699999996598699994 No 135 >PRK11449 putative deoxyribonuclease YjjV; Provisional Probab=37.95 E-value=31 Score=15.48 Aligned_cols=18 Identities=6% Similarity=0.134 Sum_probs=10.8 Q ss_pred CEEEEEECCCCCCCCCCC Q ss_conf 417887058998353356 Q gi|254780717|r 45 GTPALLVKGASSPHEYSL 62 (294) Q Consensus 45 ~~V~~L~~~g~dpH~y~~ 62 (294) ..|..++-+|.||.+|+- T Consensus 31 ~gv~~i~~~g~~~~~~~~ 48 (258) T PRK11449 31 AGVGKIIVPATEAANFAR 48 (258) T ss_pred CCCCEEEEECCCHHHHHH T ss_conf 299999993499999999 No 136 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=37.88 E-value=23 Score=16.35 Aligned_cols=11 Identities=18% Similarity=0.006 Sum_probs=5.2 Q ss_pred EECCCCCHHHH Q ss_conf 96766103568 Q gi|254780717|r 78 WLGAEMESFLV 88 (294) Q Consensus 78 ~~G~~~E~~~~ 88 (294) .+|-|++-++. T Consensus 21 v~GvGY~v~v~ 31 (198) T PRK00116 21 VNGVGYEVQVP 31 (198) T ss_pred ECCEEEEEEEC T ss_conf 79988999916 No 137 >PRK07810 O-succinylhomoserine sulfhydrylase; Provisional Probab=37.51 E-value=32 Score=15.44 Aligned_cols=83 Identities=13% Similarity=0.083 Sum_probs=37.1 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCE-----EEECCCHHHHHHHHCCC--CEECCCCCCCCCC Q ss_conf 42186565889989999863122235899985---03002781-----46206215778987498--1200346775444 Q gi|254780717|r 150 IKKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKKK-----IIVFHEAYRYFASHYNL--SIVTFPMSHSVFM 219 (294) Q Consensus 150 ~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~~-----~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~ 219 (294) .-+.|.++-......+.+.-++++..+--.+. +....... -+..|+.+..+.+.+.- -+.+++...+... T Consensus 252 ~~lsP~~a~l~lrGL~TL~lRm~r~~~nA~~vA~~L~~hp~V~~V~yPgL~shp~h~la~r~~~g~G~~~sf~l~~~~~~ 331 (406) T PRK07810 252 PALSAFNAWTLLKGLETLAIRVQHSNASALRIAEFLEGHPAVRWVRYPYLPSHPQYDLAKRQMSGGGTVVTFELDAPEDA 331 (406) T ss_pred CCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCHH T ss_conf 98987888998706897999999999999999999973998137980578778177899844789986328998036178 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9788999999997 Q gi|254780717|r 220 GAASLRNIRSKII 232 (294) Q Consensus 220 s~~~l~~~~~~ik 232 (294) +.+..-++++.++ T Consensus 332 ~~~~~~~~~~~l~ 344 (406) T PRK07810 332 AKQRAFEVLDKLR 344 (406) T ss_pred HHHHHHHHHHHCC T ss_conf 8999999998287 No 138 >PRK09423 gldA glycerol dehydrogenase; Provisional Probab=37.36 E-value=32 Score=15.43 Aligned_cols=40 Identities=3% Similarity=0.038 Sum_probs=26.2 Q ss_pred CHHHHHHHHHHHHHCCCCEEEEECCCC---HHHHHHHHHHHCC Q ss_conf 978899999999754993899818889---7899999997199 Q gi|254780717|r 220 GAASLRNIRSKIISDKISCLFYGPEFD---SKIIRSITNDTGV 259 (294) Q Consensus 220 s~~~l~~~~~~ik~~~v~~if~e~~~~---~~~~~~ia~e~~~ 259 (294) ....+.++.+..++-++++=+.|-+.. ...++.+++..-. T Consensus 289 ~~~~i~~~~~l~~~lglP~~L~elGv~e~~~e~l~~iAe~A~~ 331 (366) T PRK09423 289 PKEEIEEVIDFCHAVGLPTTLADLGLKEDSDEELRKVAEAACA 331 (366) T ss_pred CHHHHHHHHHHHHHCCCCCCHHHCCCCCCCHHHHHHHHHHHHC T ss_conf 3999999999999879999889959999988999999999824 No 139 >pfam03740 PdxJ Pyridoxal phosphate biosynthesis protein PdxJ. Members of this family belong to the PdxJ family that catalyses the condensation of 1-deoxy-d-xylulose-5-phosphate (DXP) and 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one to form pyridoxine 5'-phosphate (PNP). This reaction is involved in de novo synthesis of pyridoxine (vitamin B6) and pyridoxal phosphate. Probab=36.86 E-value=33 Score=15.38 Aligned_cols=44 Identities=18% Similarity=0.221 Sum_probs=34.3 Q ss_pred CHHHHHHHHHHHHHCCCCE-EEEECCCCHHHHHHHHHHHCCCEEEECC Q ss_conf 9788999999997549938-9981888978999999971990899805 Q gi|254780717|r 220 GAASLRNIRSKIISDKISC-LFYGPEFDSKIIRSITNDTGVMSAILDP 266 (294) Q Consensus 220 s~~~l~~~~~~ik~~~v~~-if~e~~~~~~~~~~ia~e~~~~~~~ld~ 266 (294) ....|..+++.+++.++++ +|.||.. ..++ .++++|+..++|+. T Consensus 109 ~~~~L~~~i~~lk~~girvSlFIDpd~--~~i~-~a~~~Gad~VElhT 153 (239) T pfam03740 109 QLEKLKPAIRRLKNAGIRVSLFIDPDP--EQIE-AAKIVGADRIELHT 153 (239) T ss_pred CHHHHHHHHHHHHCCCCEEEEEECCCH--HHHH-HHHHCCCCEEEEEC T ss_conf 068999999998607853899707998--9999-99980929998504 No 140 >pfam04056 Ssl1 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex. Probab=36.38 E-value=33 Score=15.33 Aligned_cols=45 Identities=18% Similarity=0.256 Sum_probs=35.2 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE-EEEC Q ss_conf 78899999999754993899818889789999999719908-9980 Q gi|254780717|r 221 AASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS-AILD 265 (294) Q Consensus 221 ~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~-~~ld 265 (294) |.++.+.++.+++++|+|=+..-...-...+.|+++||... +.+| T Consensus 173 PgdI~~tI~~l~~~~IrvsvI~LaaEv~Ick~l~~~T~G~y~V~ld 218 (250) T pfam04056 173 PGDIYSTIDTLKKEKIRCSVIGLSAEVFICKELCKATNGTYSVALD 218 (250) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHCCEEEEECC T ss_conf 8659999999997590799987338999999999974998887569 No 141 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=36.37 E-value=33 Score=15.33 Aligned_cols=139 Identities=16% Similarity=0.163 Sum_probs=76.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEE-----CCCHHHHHHHHCCCC Q ss_conf 655556789999999974218656588998999986312223589998503002781462-----062157789874981 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIV-----FHEAYRYFASHYNLS 207 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~-----~H~a~~Yf~~~~gl~ 207 (294) +|-..|+.=.+...+.+.+.=++.=..--++.. -+..+-..+|+ ..-+.-++ .=++..|+....++- T Consensus 61 ~DVHLMv~~pd~~~~~Fa~aGA~~I~vH~Ea~~----h~~R~l~~Ik~----~G~~AG~v~NP~TPl~~~~~~L~~~D~V 132 (216) T TIGR01163 61 IDVHLMVENPDRYIEDFAEAGADIITVHAEATE----HIHRLLQLIKE----LGAKAGIVLNPATPLEALEYVLEDVDLV 132 (216) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEECCCCC----CHHHHHHHHHH----CCCCEEEEECCCCCHHHHHHHHHHCCEE T ss_conf 266303578577788999708998998437762----67999999997----1897068867999987899898762989 Q ss_pred EE-C----CCCCCCCCCCHHHHHHHHHHHH--HCCCCE-EEEECCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCC--C Q ss_conf 20-0----3467754449788999999997--549938-99818889789999999719908998053367877816--7 Q gi|254780717|r 208 IV-T----FPMSHSVFMGAASLRNIRSKII--SDKISC-LFYGPEFDSKIIRSITNDTGVMSAILDPEGMLIAEGPE--L 277 (294) Q Consensus 208 ~~-~----~~~~~~~~~s~~~l~~~~~~ik--~~~v~~-if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~--~ 277 (294) .+ + ++-..=.+-+...++++++.++ +.+-+| |.++-+.+.+.+..+ ++.|+.+.+- |+..- +.+ . T Consensus 133 LlMSVnPGFgGQkFIP~~~~Kir~~R~~id~~~~~~~~~ieVDGGv~~~ni~~~-~~AGAD~~Va---GSaiF-~~~s~d 207 (216) T TIGR01163 133 LLMSVNPGFGGQKFIPETLEKIRELRKMIDKLELGLSILIEVDGGVNEDNIAEV-AEAGADILVA---GSAIF-GADSLD 207 (216) T ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHH-HHCCCCEEEE---EEEEE-CCCCCC T ss_conf 988760799884110578999999999998602799558997179897679999-9758989998---31020-888668 Q ss_pred HHHHHHH Q ss_conf 8999999 Q gi|254780717|r 278 YFQLMRS 284 (294) Q Consensus 278 Y~~~m~~ 284 (294) |.+-++. T Consensus 208 ~~~~i~~ 214 (216) T TIGR01163 208 YKEAIRS 214 (216) T ss_pred HHHHHHH T ss_conf 7999973 No 142 >PRK06176 cystathionine gamma-synthase/cystathionine beta-lyase; Validated Probab=36.17 E-value=33 Score=15.31 Aligned_cols=128 Identities=13% Similarity=0.195 Sum_probs=58.3 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHCC-C-CEECCCCCCCCCCC Q ss_conf 2186565889989999863122235899985---0300278-----14620621577898749-8-12003467754449 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHYN-L-SIVTFPMSHSVFMG 220 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~g-l-~~~~~~~~~~~~~s 220 (294) .+.|..+-......+.+.-+++...+--... +...... +-+..||.+..+.+.+. . -+.+++... T Consensus 229 ~l~p~~a~ll~rGL~TL~lR~~~~~~nA~~vA~~L~~hp~V~~V~yPgL~s~p~~~la~~~~~g~gg~~sf~l~~----- 303 (379) T PRK06176 229 VLGPQDSWLLQRGIKTLGLRMEAHQKNALCVAEFLEKHPKVEKVYYPGLPTHPNHELAKKQMRGFSGMLSFTLKN----- 303 (379) T ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCCEEEEECC----- T ss_conf 778467899972225289999999999999999987199826798989999943999997678999854788568----- Q ss_pred HHHHHHHHHHHHHC-------CCCEEEEECCC--CHHHHHHHHHHHCCCE--EEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 78899999999754-------99389981888--9789999999719908--9980533678778167899999999999 Q gi|254780717|r 221 AASLRNIRSKIISD-------KISCLFYGPEF--DSKIIRSITNDTGVMS--AILDPEGMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 221 ~~~l~~~~~~ik~~-------~v~~if~e~~~--~~~~~~~ia~e~~~~~--~~ld~~~~~~~~~~~~Y~~~m~~n~~~l 289 (294) .....++.+.+|=- ++..++.-|.. .....+.-.++.|++. +.| ++|. +.-.+++...-++| T Consensus 304 ~~~a~~f~~~l~lf~~a~slGg~eSLi~~P~~~th~~~~~e~r~~~Gi~~~liRl-SvGl------Ed~~DLi~Dl~~AL 376 (379) T PRK06176 304 DSEAVAFVESLKLFILGESLGGVESLVGVPAFMTHACIPKEQREAAGIRDGLVRL-SVGI------EHEQDLLEDLEQAF 376 (379) T ss_pred HHHHHHHHHHCCCCEEECCCCCCCCEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHH T ss_conf 9999999981882546216688760424763225544899999866989394999-9511------99999999999998 Q ss_pred H Q ss_conf 8 Q gi|254780717|r 290 A 290 (294) Q Consensus 290 ~ 290 (294) + T Consensus 377 ~ 377 (379) T PRK06176 377 A 377 (379) T ss_pred H T ss_conf 3 No 143 >PRK06683 hypothetical protein; Provisional Probab=36.12 E-value=33 Score=15.30 Aligned_cols=45 Identities=4% Similarity=0.046 Sum_probs=33.2 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHH---HHHHHCCCEEEECCCC Q ss_conf 99999999754993899818889789999---9997199089980533 Q gi|254780717|r 224 LRNIRSKIISDKISCLFYGPEFDSKIIRS---ITNDTGVMSAILDPEG 268 (294) Q Consensus 224 l~~~~~~ik~~~v~~if~e~~~~~~~~~~---ia~e~~~~~~~ld~~~ 268 (294) .++..+-+++.+++.+|.-.-.+++..+. ++++.|+++...|+|- T Consensus 16 tKQTlKAlk~g~v~~v~iA~DAD~~v~~~v~~~a~~~~ipV~~VdSmk 63 (82) T PRK06683 16 HKRTLEAIKNGIVKEVVIAEDADMRLTHVIIRTALQHNIPITKVESVR 63 (82) T ss_pred HHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHHHHCCCEEEEECHH T ss_conf 598899998498269999725868889999999998299879971298 No 144 >pfam09171 DUF1886 Domain of unknown function (DUF1886). This domain is predominantly found in the Archaeal protein N-glycosylase/DNA lyase. Probab=35.85 E-value=34 Score=15.28 Aligned_cols=123 Identities=12% Similarity=0.038 Sum_probs=65.7 Q ss_pred HHHHHHHHHHHHHHHHCCCC-CCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHH--HHHHHHHHHHCCCC--EEE Q ss_conf 98631222358999850300-27814620621577898749812003467754449788--99999999754993--899 Q gi|254780717|r 166 EFKNQLSQLDKELHSILQPV-EKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAAS--LRNIRSKIISDKIS--CLF 240 (294) Q Consensus 166 ~~~~~L~~l~~~~~~~~~~~-~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~--l~~~~~~ik~~~v~--~if 240 (294) .|-..|..+-..+...+..- .++.+|-.---|+|.++.-+-+. ...|..=|-|-| ++.+...+-.-+.+ ... T Consensus 117 ~yy~~m~~l~~~La~~l~~k~~~KTiVFAvKM~gYa~r~~~~~~---~~~p~eIpIPvD~Ri~~~T~~~~~~~~~~~~~~ 193 (246) T pfam09171 117 LYYENLEELLEALAKILGAKPESKTVVFAVKMFGYAYRAAFGVF---RPYPMEIPIPVDYRIAKLTKKSGLVDAPPEEAV 193 (246) T ss_pred HHHCCHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHCCCC---CCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHH T ss_conf 65418999999999996899875635445998999999956877---789878999623999999998732358736654 Q ss_pred EECCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 8188897899999997199089980533678778167899999999999874 Q gi|254780717|r 241 YGPEFDSKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAKN 292 (294) Q Consensus 241 ~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ 292 (294) .-.....+....+++++|++...||++-|.. .+.-.|.+.+...-+.+... T Consensus 194 ~~~e~~~~~W~~Va~~~gIPplhiDsilW~~-g~~~~~~~~~~~~~~~~~~~ 244 (246) T pfam09171 194 ARYEEVQEFWNRVARESGIPPLHLDSILWLL-GGAIDVGENIKELREKLIGL 244 (246) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHH T ss_conf 2027799999998735599841225789871-66403523156653999987 No 145 >PRK03202 6-phosphofructokinase; Provisional Probab=35.34 E-value=34 Score=15.22 Aligned_cols=67 Identities=13% Similarity=0.146 Sum_probs=34.1 Q ss_pred HHHHHHCCCCEEC-CCCCCCCCCCHHHHHHHHHHHH-HCCCCEE-EEECCCC----HHHHHHHHHHHCCCEEEE Q ss_conf 7789874981200-3467754449788999999997-5499389-9818889----789999999719908998 Q gi|254780717|r 198 RYFASHYNLSIVT-FPMSHSVFMGAASLRNIRSKII-SDKISCL-FYGPEFD----SKIIRSITNDTGVMSAIL 264 (294) Q Consensus 198 ~Yf~~~~gl~~~~-~~~~~~~~~s~~~l~~~~~~ik-~~~v~~i-f~e~~~~----~~~~~~ia~e~~~~~~~l 264 (294) +|++-.-++-..+ +-.-||.+.+..++.+.++... +.+-.|+ ++.+... ..+.+.|.++++.+.... T Consensus 176 G~lAl~~~lA~gad~iliPE~~~~~~~~~~~i~~~~~~~k~~~iivvaEG~~~~~g~~la~~i~~~~~~~~R~~ 249 (323) T PRK03202 176 GDLALAAGLAGGAELILIPEVPFDIEEFLAKVKAGIKKGKKHCIVVVAEGIKDADGKELAKEIEEETGLETRVT 249 (323) T ss_pred CHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHCCCEEEE T ss_conf 68999987632898999249999999999999999976998289999699887789999999999839837980 No 146 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=35.08 E-value=35 Score=15.20 Aligned_cols=34 Identities=15% Similarity=0.200 Sum_probs=17.4 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEE Q ss_conf 0257899967661035689997314-5576799751 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSH 105 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~ 105 (294) ++.|.|+..-.+.+.. ...++.+. ...+++.+.. T Consensus 55 ~gvDaIii~p~d~~a~-~~~v~~A~~aGIpVv~~d~ 89 (294) T cd06316 55 QKPDIIISIPVDPVST-AAAYKKVAEAGIKLVFMDN 89 (294) T ss_pred CCCCEEEEECCCHHHH-HHHHHHHHHCCCCEEEECC T ss_conf 5999999938886787-9999999981996798326 No 147 >PRK10528 multifunctional acyl-CoA thioesterase I and protease I and lysophospholipase L1; Provisional Probab=35.02 E-value=35 Score=15.19 Aligned_cols=72 Identities=11% Similarity=0.115 Sum_probs=47.9 Q ss_pred CHHHHHHHHHHHHHCCCCEEEEECCCCH-----------HHHHHHHHHHCCCEEEE-------CCC---CCCCCCCCCCH Q ss_conf 9788999999997549938998188897-----------89999999719908998-------053---36787781678 Q gi|254780717|r 220 GAASLRNIRSKIISDKISCLFYGPEFDS-----------KIIRSITNDTGVMSAIL-------DPE---GMLIAEGPELY 278 (294) Q Consensus 220 s~~~l~~~~~~ik~~~v~~if~e~~~~~-----------~~~~~ia~e~~~~~~~l-------d~~---~~~~~~~~~~Y 278 (294) ..+.|.++++.+++.++++++.+...++ .....+|+|.++..+.. +|- .....+..++| T Consensus 93 ~~~NL~~ii~~~~~~g~~vlL~g~~~p~n~g~~y~~~f~~iy~~lA~e~~v~l~pf~l~~v~~~~~~~q~DgiHPN~~G~ 172 (191) T PRK10528 93 TEQTLRQIIQDVKAANAQPLLMQIRLPANYGRRYNEAFSAIYPKLAKEFDIPLLPFFMEEVYLKPQWMQDDGIHPNRDAQ 172 (191) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEHHHHHHHCCCHHHCCCCCCCCCHHHH T ss_conf 99999999999998899789970656755126678889999999999809974137887563597552888899598899 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999987 Q gi|254780717|r 279 FQLMRSMSNSIAK 291 (294) Q Consensus 279 ~~~m~~n~~~l~k 291 (294) --|.......|+. T Consensus 173 ~~ia~~v~~~L~~ 185 (191) T PRK10528 173 PFIADWMAKQLQP 185 (191) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 148 >pfam09403 FadA Adhesion protein FadA. FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. Probab=34.91 E-value=28 Score=15.83 Aligned_cols=16 Identities=13% Similarity=0.422 Sum_probs=7.1 Q ss_pred EEECCCHHHHHHHHCC Q ss_conf 4620621577898749 Q gi|254780717|r 190 IIVFHEAYRYFASHYN 205 (294) Q Consensus 190 ~i~~H~a~~Yf~~~~g 205 (294) +--|-|-|.=+.+.|+ T Consensus 84 ~rfykdeY~~LlKkY~ 99 (126) T pfam09403 84 TRFHKDEYKKLLKRYD 99 (126) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 0048988999999999 No 149 >COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion] Probab=34.75 E-value=28 Score=15.77 Aligned_cols=22 Identities=23% Similarity=0.350 Sum_probs=18.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9158999999999960013238 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG 22 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~ 22 (294) |||.|+..++.+.++++|..+. T Consensus 1 mkr~Lla~la~~~llAgC~~~e 22 (176) T COG4314 1 MKRTLLAILAVTALLAGCRQAE 22 (176) T ss_pred CCHHHHHHHHHHHHHHHCCHHH T ss_conf 9403799999999987534311 No 150 >PRK10641 btuB vitamin B12/cobalamin outer membrane transporter; Provisional Probab=34.68 E-value=35 Score=15.16 Aligned_cols=22 Identities=14% Similarity=0.120 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9158999999999960013238 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG 22 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~ 22 (294) |||+.++.+++..++++.|+.. T Consensus 2 ~~~~~~~~~~~~~~~~~~a~~~ 23 (615) T PRK10641 2 IKKASLLTALSVTAFSAWAQDT 23 (615) T ss_pred CHHHHHHHHHHHHCCHHHHCCC T ss_conf 4489999999875133543358 No 151 >PRK06746 peptide chain release factor 2; Provisional Probab=34.63 E-value=35 Score=15.15 Aligned_cols=19 Identities=11% Similarity=0.205 Sum_probs=11.5 Q ss_pred CHHHHHHHHHHHHHHHHHH Q ss_conf 7567765555678999999 Q gi|254780717|r 128 DLHLWLNPLNVQYIAHVIA 146 (294) Q Consensus 128 dpH~Wl~p~n~~~~a~~I~ 146 (294) +|-+|-||.++....+.++ T Consensus 3 ~p~fWdD~~~A~~i~kE~s 21 (326) T PRK06746 3 GAGFWDDQQGAQAVINEAN 21 (326) T ss_pred CCCCCCCHHHHHHHHHHHH T ss_conf 9777869999999999999 No 152 >PRK08134 O-acetylhomoserine aminocarboxypropyltransferase; Validated Probab=34.57 E-value=35 Score=15.15 Aligned_cols=131 Identities=11% Similarity=0.079 Sum_probs=54.2 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHH---HHCCCCCCCE-----EEECCCHHHHHHHHC--CCC-EECCCCCCCCCC Q ss_conf 21865658899899998631222358999---8503002781-----462062157789874--981-200346775444 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELH---SILQPVEKKK-----IIVFHEAYRYFASHY--NLS-IVTFPMSHSVFM 219 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~---~~~~~~~~~~-----~i~~H~a~~Yf~~~~--gl~-~~~~~~~~~~~~ 219 (294) -+.|.++-......+.+.-+++.-.+--. +.+....... -+..|+.+..+.+.+ |.- +.+++...+ T Consensus 276 ~lsp~da~L~lRGL~TL~lRm~r~~~nA~~lA~~L~~hp~V~~V~yPgL~s~p~h~la~r~~~~G~Gg~~sf~l~g~--- 352 (433) T PRK08134 276 CLSPMNAWQLLQGIETLPLRMERHVANTRAVVEFLASHPAVERVAHPELESHPDHALAKRLLPRGAGSVFSFDLKGG--- 352 (433) T ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEECC--- T ss_conf 66836779987367768999999998799999999858996789898989980299999878999658999999699--- Q ss_pred CHHHHHHHHHHHHHCC-------CCEEEEECCCC--HHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9788999999997549-------93899818889--78999999971990--8998053367877816789999999999 Q gi|254780717|r 220 GAASLRNIRSKIISDK-------ISCLFYGPEFD--SKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNS 288 (294) Q Consensus 220 s~~~l~~~~~~ik~~~-------v~~if~e~~~~--~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~ 288 (294) .....++.+.++=-. +..++.-|... .+.-..-.++.|+. .+.| +.|. |...+++...-++ T Consensus 353 -~~~~~~f~~~l~lf~~a~SlGg~eSLi~~Pa~~th~~~~~e~~~~~Gi~~~lvRl-SvGl------Ed~eDLi~DL~qA 424 (433) T PRK08134 353 -RAAGRKFIESLKLFSHLANVGDARSLVIHPASTTHFRMDAAALAAAGIGEGTIRL-SIGL------EDADDLIDDLKRA 424 (433) T ss_pred -HHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHH T ss_conf -9999999983881301226687530346763234635899999866989394999-9411------9999999999999 Q ss_pred HHHH Q ss_conf 9874 Q gi|254780717|r 289 IAKN 292 (294) Q Consensus 289 l~k~ 292 (294) |+.+ T Consensus 425 L~aa 428 (433) T PRK08134 425 LKAA 428 (433) T ss_pred HHHH T ss_conf 9998 No 153 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=34.34 E-value=35 Score=15.18 Aligned_cols=68 Identities=19% Similarity=0.182 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCC-CCCCCCCCHHHHHHHCCCCEEEEECC Q ss_conf 58999999999960013238805999722899999997079441788705899-83533568024566502578999676 Q gi|254780717|r 3 NFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGAS-SPHEYSLRISEAMMLENADIIFWLGA 81 (294) Q Consensus 3 k~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~-dpH~y~~tp~d~~~l~~Adlii~~G~ 81 (294) ++++.++++.++|++|+......-. ...| .+.|..+.+... ..+.| |..+.....++|-+-.+. T Consensus 6 ~~li~ll~~~~lL~gC~~~~~~~~~-~~~p-----------~V~v~~v~~~~~~~~~~~---~G~v~~~~~~~l~~~VsG 70 (385) T PRK09859 6 KLLIPLLFCGAMLTACDDKSAENAA-AMTP-----------EVGVVTLSPGSVNVLSEL---PGRTVPYEVAEIRPQVGG 70 (385) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCC-CCCC-----------CEEEEEEEEEEEEEEEEE---EEEEEEEEEEEEEEECCE T ss_conf 7899999999999537999768789-9998-----------148999867510158998---799996289999814562 Q ss_pred CCCH Q ss_conf 6103 Q gi|254780717|r 82 EMES 85 (294) Q Consensus 82 ~~E~ 85 (294) -.+. T Consensus 71 ~v~~ 74 (385) T PRK09859 71 IIIK 74 (385) T ss_pred EEEE T ss_conf 9999 No 154 >PRK13602 putative ribosomal protein L7Ae-like; Provisional Probab=34.24 E-value=36 Score=15.11 Aligned_cols=45 Identities=9% Similarity=0.211 Sum_probs=33.0 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHH---HHHHHHCCCEEEECCCC Q ss_conf 9999999975499389981888978999---99997199089980533 Q gi|254780717|r 224 LRNIRSKIISDKISCLFYGPEFDSKIIR---SITNDTGVMSAILDPEG 268 (294) Q Consensus 224 l~~~~~~ik~~~v~~if~e~~~~~~~~~---~ia~e~~~~~~~ld~~~ 268 (294) ..+..+-+++.+++++|.-.-.+++.++ .++++-|+++...|+|- T Consensus 16 ~KQT~KAi~~g~~~~V~IA~Dad~~v~~~v~~l~~ek~Ipv~~V~Smk 63 (82) T PRK13602 16 TKQTVKALKRGSVKEVVVAKDADPCLTSGVINLANEQGVPVSMVESMK 63 (82) T ss_pred HHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCEEEECCHH T ss_conf 399899997698129999825988899999999997299879962899 No 155 >TIGR02622 CDP_4_6_dhtase CDP-glucose 4,6-dehydratase; InterPro: IPR013445 This entry contains CDP-glucose 4,6-dehydratases from a variety of Gram-negative and Gram-positive bacteria. Members are typically encoded next to a gene that encodes a glucose-1-phosphate cytidylyltransferase, which produces the substrate CDP-D-glucose. This is used by the proteins in this entry to produce CDP-4-keto-6-deoxyglucose.. Probab=34.08 E-value=36 Score=15.10 Aligned_cols=47 Identities=17% Similarity=0.258 Sum_probs=30.6 Q ss_pred CCEEEECCCHHH-----HHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 781462062157-----78987498120034677544497889999999975 Q gi|254780717|r 187 KKKIIVFHEAYR-----YFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIIS 233 (294) Q Consensus 187 ~~~~i~~H~a~~-----Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~ 233 (294) +|.|||.|=.|- +|...+|=+|.|....|.+.|+.=++.++.+.++. T Consensus 5 KkVl~TGHTGFKGSWL~lWL~~lGA~V~GYSL~P~t~PnlFe~l~l~~~~~~ 56 (361) T TIGR02622 5 KKVLITGHTGFKGSWLSLWLLELGAEVAGYSLDPPTSPNLFELLNLAKKIKD 56 (361) T ss_pred CEEEEECCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 6789845786425589999984796798971688788405557525424323 No 156 >cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide. Probab=34.04 E-value=36 Score=15.09 Aligned_cols=10 Identities=40% Similarity=0.584 Sum_probs=5.9 Q ss_pred CCCCEEEEEC Q ss_conf 0257899967 Q gi|254780717|r 71 ENADIIFWLG 80 (294) Q Consensus 71 ~~Adlii~~G 80 (294) .+.|+++.+| T Consensus 86 ~~~Dvi~~~~ 95 (375) T cd03821 86 READIVHVHG 95 (375) T ss_pred CCCCEEEECC T ss_conf 8999999868 No 157 >cd01971 Nitrogenase_VnfN_like Nitrogenase_vnfN_like: VnfN subunit of the VnfEN complex-like. This group in addition to VnfN contains a subset of the beta subunit of the nitrogenase MoFe protein and NifN-like proteins. The nitrogenase enzyme system catalyzes the ATP-dependent reduction of dinitrogen to ammonia. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of MoFe protein of the molybdenum(Mo)-nitrogenase. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to NifEN where it is further processed to FeMoco. VnfEN may similarly be a scaffolding protien for the iron-vanadium cofactor (FeVco) of the vanadium-dependent (V)-nitrogenase. NifE and NifN are essential for the Mo-nitrogenase, VnfE and VnfN are not essential for the V-nitrogenase. NifE and NifN can substitute when the vnfEN genes are inactivated. Probab=33.88 E-value=36 Score=15.08 Aligned_cols=149 Identities=9% Similarity=0.067 Sum_probs=66.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEC-CCHHHH-----HHHHCCC Q ss_conf 6555567899999999742186565889989999863122235899985030027814620-621577-----8987498 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVF-HEAYRY-----FASHYNL 206 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~-H~a~~Y-----f~~~~gl 206 (294) +-..+..+++..|++.+..-.+.-.+..+.-..++...|+.....+... ..++.+.++ -+.+.+ +.+++|+ T Consensus 242 iGi~~Td~flr~l~~~~g~~~~~~~~~i~~er~~~~~~l~~~~~~~~~~---~~~~~vai~~d~~~~~gl~~~l~~eLGm 318 (427) T cd01971 242 IGAKATAEFLRQVAKFAGIEKAKVEAFIKAEEKRYYHYLERFSDFMARW---GLPRRFAVIADSTYALGLARFLVNELGW 318 (427) T ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCEEEEECCHHHHHHHHHHHHHHCCC T ss_conf 5799999999999998299911457889999999999999999999873---4683699988758899999999985698 Q ss_pred CEEC--CCCCCCCC------------------CCHHHHHHHHHHHHHC----CCCEEEEECCCCHHHHHHHHHHHCCCEE Q ss_conf 1200--34677544------------------4978899999999754----9938998188897899999997199089 Q gi|254780717|r 207 SIVT--FPMSHSVF------------------MGAASLRNIRSKIISD----KISCLFYGPEFDSKIIRSITNDTGVMSA 262 (294) Q Consensus 207 ~~~~--~~~~~~~~------------------~s~~~l~~~~~~ik~~----~v~~if~e~~~~~~~~~~ia~e~~~~~~ 262 (294) .... +..++..+ .-..+...+.+.+++. ++..+|..+ .-+.+++++|++.+ T Consensus 319 ~p~~~v~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~D~~~i~~~l~~~~~~~~pdllig~s-----~~~~~a~~lgiP~i 393 (427) T cd01971 319 VPAKQVITDNPPEKYRSAIENEFEAEGVSAEVVFSEDGYAIGQSLRQSDFKYKPPIIFGSS-----WERDLAKELGGKIL 393 (427) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHCCCCCCCEEEECC-----HHHHHHHHHCCCEE T ss_conf 6348999279777789999976543178863366487799999998633447987899892-----58999999699879 Q ss_pred EEC-C----CCCCC-CCCCCCHHHHHHHHHHHH Q ss_conf 980-5----33678-778167899999999999 Q gi|254780717|r 263 ILD-P----EGMLI-AEGPELYFQLMRSMSNSI 289 (294) Q Consensus 263 ~ld-~----~~~~~-~~~~~~Y~~~m~~n~~~l 289 (294) ... | .+..- --|.++-..++...+++| T Consensus 394 r~gfPv~dr~~~~r~~~GY~G~l~l~~~i~N~l 426 (427) T cd01971 394 EVSFPVTNRVVLNRGYAGYRGALTLLEDIYTTV 426 (427) T ss_pred EECCCCEEECCCCCCEEEEHHHHHHHHHHHHHC T ss_conf 846886101678987066064999999999850 No 158 >cd03769 SR_IS607_transposase_like Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain. Serine recombinases catalyze site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme. They are functionally versatile and include resolvases, invertases, integrases, and transposases. This subfamily is composed of proteins that catalyze the transposition of insertion sequence (IS) elements such as IS607 from Helicobacter and IS1535 from Mycobacterium, and similar proteins from other bacteria and several archaeal species. IS elements are DNA segments that move to new sites in prokaryotic and eukaryotic genomes causing insertion mutations and gene rearrangements. Probab=33.17 E-value=37 Score=15.00 Aligned_cols=72 Identities=11% Similarity=0.176 Sum_probs=47.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEEEECC-----CCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 67754449788999999997549938998188-----8978999999971990899805336787781678999999999 Q gi|254780717|r 213 MSHSVFMGAASLRNIRSKIISDKISCLFYGPE-----FDSKIIRSITNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSN 287 (294) Q Consensus 213 ~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~-----~~~~~~~~ia~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~ 287 (294) ..++....-+.+.++.+.++++++.+|++..- +....++.+.+..|++++.++.-....+ -.+++...+. T Consensus 40 ~gSG~~~~Rp~l~~Ll~~~~~g~v~~vvV~~~DRL~Rfg~elle~~~~~~gv~iivin~~~~~~~-----~~EL~eDli~ 114 (134) T cd03769 40 IGSGLNEKRKGLLKLLEDVLAGKVERVVITYKDRLARFGFELLEELFKAYGVEIVVINQEENEEL-----EQELVEDLIE 114 (134) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCH-----HHHHHHHHHH T ss_conf 67776603589999999997389878999855577876899999999986988999979989986-----8999999999 Q ss_pred HH Q ss_conf 99 Q gi|254780717|r 288 SI 289 (294) Q Consensus 288 ~l 289 (294) .+ T Consensus 115 Ii 116 (134) T cd03769 115 II 116 (134) T ss_pred HH T ss_conf 99 No 159 >COG0161 BioA Adenosylmethionine-8-amino-7-oxononanoate aminotransferase [Coenzyme metabolism] Probab=32.73 E-value=38 Score=14.96 Aligned_cols=120 Identities=16% Similarity=0.247 Sum_probs=71.6 Q ss_pred HHHHHHHHHHCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHH--------------HHHHHHC Q ss_conf 9999999742186---5658899899998631222358999850300278146206215--------------7789874 Q gi|254780717|r 142 AHVIAMELIKKDP---RNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAY--------------RYFASHY 204 (294) Q Consensus 142 a~~I~~~L~~~dP---~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~--------------~Yf~~~~ 204 (294) +-.++++|.++-| -++-+|..+.+.-.+---++.-.|.......+++.|++-..+| ..+...| T Consensus 89 a~~LA~~L~~~aP~~~l~~vFf~~sGSeAvEtAlKma~qY~~~~G~p~r~~~Isr~~gYHG~T~ga~Sv~g~~~~~~~~~ 168 (449) T COG0161 89 AIELAEKLAELAPEGGLDHVFFTDSGSEAVETALKMALQYWRARGQPQRKKFISRRNGYHGDTLGAMSVGGPVALRHAFY 168 (449) T ss_pred HHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHEECCCCHHHHHHHC T ss_conf 99999999986798775679980896189999999999999965997531899952674766530013568545554301 Q ss_pred CCC---EECCC-C----CCC-CCCC---HHHHHHHHHHHHH---CCCCEEEEECCCCH------------HHHHHHHHHH Q ss_conf 981---20034-6----775-4449---7889999999975---49938998188897------------8999999971 Q gi|254780717|r 205 NLS---IVTFP-M----SHS-VFMG---AASLRNIRSKIIS---DKISCLFYGPEFDS------------KIIRSITNDT 257 (294) Q Consensus 205 gl~---~~~~~-~----~~~-~~~s---~~~l~~~~~~ik~---~~v~~if~e~~~~~------------~~~~~ia~e~ 257 (294) +.- +.-+. . .+. .+.- .+.+.++.+.|.+ ++|.+++.||-... +.++.|+++. T Consensus 169 ~~ll~~~~~~~~P~~y~~~~~~~~~~~~~~~a~~le~~i~~~g~~~IAAfI~EPv~g~agG~~~pp~~Yl~~vr~iC~ky 248 (449) T COG0161 169 DPLLPEVLHLPAPYAYRRGFFGEGDEEFAEAADELEALILEHGPETIAAFIVEPVVGGAGGMLVPPPGYLKRVREICDKY 248 (449) T ss_pred CCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 44455762268996434577777748899999999999986095528999955200566775418768999999999983 Q ss_pred CCCE Q ss_conf 9908 Q gi|254780717|r 258 GVMS 261 (294) Q Consensus 258 ~~~~ 261 (294) ++-. T Consensus 249 ~ILl 252 (449) T COG0161 249 GILL 252 (449) T ss_pred CCEE T ss_conf 9479 No 160 >pfam06506 PrpR_N Propionate catabolism activator. This domain is found at the N terminus of several sigma54- dependent transcriptional activators including PrpR, which activates catabolism of propionate. Probab=32.68 E-value=38 Score=14.95 Aligned_cols=62 Identities=10% Similarity=0.205 Sum_probs=46.0 Q ss_pred CHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEECC Q ss_conf 215778987498120034677544497889999999975499389981888978999999971990899805 Q gi|254780717|r 195 EAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAILDP 266 (294) Q Consensus 195 ~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ld~ 266 (294) +.+..|++-+|++..-.... ++.++...++.+++.++.+++... .+..+|++.|.+.+.+.+ T Consensus 88 ~~~~~~~~il~~~i~~~~~~-----~~~e~~~~i~~l~~~G~~vvVG~~-----~~~~~A~~~Gl~~vli~s 149 (169) T pfam06506 88 PGLKALSELLGLDIVQRAYQ-----SEEEARAAVKELKAQGIKVIVGDG-----LVCDLAEQAGLQGVLIYS 149 (169) T ss_pred HHHHHHHHHHCCCEEEEEEC-----CHHHHHHHHHHHHHCCCEEEECCC-----HHHHHHHHCCCCEEEEEC T ss_conf 68999999969935999966-----889999999999986995998582-----899999983995799966 No 161 >KOG2807 consensus Probab=32.64 E-value=38 Score=14.95 Aligned_cols=48 Identities=19% Similarity=0.178 Sum_probs=38.4 Q ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE-EEECC Q ss_conf 4978899999999754993899818889789999999719908-99805 Q gi|254780717|r 219 MGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS-AILDP 266 (294) Q Consensus 219 ~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~-~~ld~ 266 (294) -.|.++.++++++|..+|+|=+..-...-.+-+.|+++||... |.+|+ T Consensus 177 ~DPgdi~~tI~~lk~~kIRvsvIgLsaEv~icK~l~kaT~G~Y~V~lDe 225 (378) T KOG2807 177 CDPGDIYETIDKLKAYKIRVSVIGLSAEVFICKELCKATGGRYSVALDE 225 (378) T ss_pred CCCCCHHHHHHHHHHHCEEEEEEEECHHHHHHHHHHHHHCCEEEEEECH T ss_conf 5852099999999861727999850055899999988618857987578 No 162 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=32.43 E-value=34 Score=15.27 Aligned_cols=21 Identities=24% Similarity=0.301 Sum_probs=15.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |.|.+.+++++++++++|+.- T Consensus 1 ~~r~~~~l~~~~llLsgCat~ 21 (203) T PRK00022 1 MFRLLRLLPLAALLLAGCASL 21 (203) T ss_pred CHHHHHHHHHHHHHHHHCCCC T ss_conf 906389999999998661489 No 163 >PRK08247 cystathionine gamma-synthase; Reviewed Probab=32.38 E-value=38 Score=14.92 Aligned_cols=46 Identities=7% Similarity=0.103 Sum_probs=21.1 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCE--EEECCCH Q ss_conf 218656588998999986312223589998503002781--4620621 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKK--IIVFHEA 196 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~--~i~~H~a 196 (294) .+.|..+-......+.+.-+++.-.+--.....-++..+ --++||. T Consensus 231 ~~~p~~a~l~~rGl~TL~lR~~~~~~nA~~lA~~L~~hp~V~~V~yPG 278 (366) T PRK08247 231 VLSPFDSWLLIRGMKTLALRMRQHEENAKALAAFLNEQPGVTDVLYPG 278 (366) T ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 779377778863525099999999999999999986199704895699 No 164 >PRK10877 thiol:disulfide interchange protein DsbC; Provisional Probab=32.08 E-value=35 Score=15.19 Aligned_cols=42 Identities=19% Similarity=0.178 Sum_probs=25.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 9158999999999960013238805999722899999997079441788705 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVK 52 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~ 52 (294) |||.++++.++++.+++.+.++. ..+..-.++++ +.|.++-+ T Consensus 1 mkk~l~~~~~l~~~~s~~a~aDe-------a~i~~~l~k~g---~~v~~V~~ 42 (232) T PRK10877 1 MKKGFMLFTLLAAAFSGFAHADD-------AAIQQTLAKLG---IQSADIQP 42 (232) T ss_pred CCHHHHHHHHHHHHHHHHHHCCH-------HHHHHHHHHCC---CCCCEECC T ss_conf 92669999999999862524379-------99999999839---98313556 No 165 >pfam10280 Med11 Mediator complex protein. Mediator is a large, modular protein complex that is conserved from yeast to human and conveys regulatory signals from DNA-binding transcription factors to RNA polymerase II. Not only are the polypeptides conserved but the structural organisation is also largely conserved. One or two subunits are either fungal or vertebral specific but Med11 is one of the subunits that is conserved from fungi to humans. Med11 appears to be necessary for the full and successful assembly of the core head sub-region. Probab=31.97 E-value=39 Score=14.88 Aligned_cols=66 Identities=23% Similarity=0.246 Sum_probs=44.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCHHHH Q ss_conf 555567899999999742186--565889989999863122235899985030027814620621577 Q gi|254780717|r 134 NPLNVQYIAHVIAMELIKKDP--RNKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIVFHEAYRY 199 (294) Q Consensus 134 ~p~n~~~~a~~I~~~L~~~dP--~~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~~H~a~~Y 199 (294) +..-+..-|..+-+.|++.-| .+++.|+.+...|.+-|..++..+++.+.-+.....+.-|..=+| T Consensus 14 ~I~~lL~~Ag~~i~~Ls~~k~~~~~k~~f~~~~~~F~~~L~~V~~~Lr~qI~~L~e~~~~~p~e~~~~ 81 (113) T pfam10280 14 KIVSLLQSAGQVIQELSKEKSGESSKEQFEAHSKEFYKTLSSVEVELRKQIKYLDEVIGGQPHEGSGY 81 (113) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999999999871589975679999999999999999999999999999998822688777760 No 166 >PRK07504 O-succinylhomoserine sulfhydrylase; Reviewed Probab=31.84 E-value=39 Score=14.87 Aligned_cols=130 Identities=16% Similarity=0.162 Sum_probs=54.9 Q ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHCCCC--EECCCCCCCCCCC Q ss_conf 2186565889989999863122235899985---0300278-----1462062157789874981--2003467754449 Q gi|254780717|r 151 KKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHYNLS--IVTFPMSHSVFMG 220 (294) Q Consensus 151 ~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~gl~--~~~~~~~~~~~~s 220 (294) -+.|..+-......+.+--++++-.+--... +...... +-+..||....+.+.+.=- +.+++... + T Consensus 245 ~~~p~~a~l~lRGL~TL~lRm~r~~~nA~~iA~~L~~hp~V~~V~yPgl~s~p~~~~a~k~~~g~gg~~sf~l~~----~ 320 (397) T PRK07504 245 SLSPFNAWVLLKGLETLAVRVRQQTESAAAIADFLAGHPKVARVIYPGRADHPQADIVAKQMTGGSTLVAFELKG----G 320 (397) T ss_pred CCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEEC----C T ss_conf 788378899871677299999999999999999997389955797989877827889998579985245899828----8 Q ss_pred HHHHHHHHHHHHHCC-------CCEEEEECCCC--HHHHHHHHHHHCCCE--EEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 788999999997549-------93899818889--789999999719908--9980533678778167899999999999 Q gi|254780717|r 221 AASLRNIRSKIISDK-------ISCLFYGPEFD--SKIIRSITNDTGVMS--AILDPEGMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 221 ~~~l~~~~~~ik~~~-------v~~if~e~~~~--~~~~~~ia~e~~~~~--~~ld~~~~~~~~~~~~Y~~~m~~n~~~l 289 (294) .....++.+.++=-. +..++.-|... ....+.-.++.|+.. +.| +.|. +...+++...-++| T Consensus 321 ~~~~~~f~~~L~l~~~a~SlGg~eSLi~~pa~~th~~~~~e~r~~~Gi~~~lvRl-SVGl------Ed~eDLi~Dl~~AL 393 (397) T PRK07504 321 KEAAFRFLNALKIVRISNNLGDAKSLITHPATTTHKNLSPEARAELGISEGFIRL-SAGL------EDTDDLIEDLEAAL 393 (397) T ss_pred HHHHHHHHHCCCCCEECCCCCCCCCEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHH T ss_conf 9999999980891522226787650323764234311899999866979295999-9600------89999999999999 Q ss_pred HH Q ss_conf 87 Q gi|254780717|r 290 AK 291 (294) Q Consensus 290 ~k 291 (294) ++ T Consensus 394 ~~ 395 (397) T PRK07504 394 KK 395 (397) T ss_pred HH T ss_conf 85 No 167 >PRK10290 superoxide dismutase; Provisional Probab=31.81 E-value=38 Score=14.97 Aligned_cols=20 Identities=25% Similarity=0.361 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) |||+.+++++++.+.+..+. T Consensus 1 mk~~~~~~~~l~~~~~~~a~ 20 (173) T PRK10290 1 MKRFSLAILALVVCTGAQAA 20 (173) T ss_pred CCHHHHHHHHHHHHHHHHHC T ss_conf 91079999999997666514 No 168 >TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species. Probab=31.81 E-value=39 Score=14.86 Aligned_cols=19 Identities=5% Similarity=-0.059 Sum_probs=8.6 Q ss_pred HHHHHHHHHCCCCEEEEEE Q ss_conf 9999999707944178870 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLV 51 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~ 51 (294) +..+++.++.--.+|+.+. T Consensus 26 v~~La~~La~rGheV~V~t 44 (405) T TIGR03449 26 ILETATELARRGIEVDIFT 44 (405) T ss_pred HHHHHHHHHHCCCEEEEEE T ss_conf 9999999997899699993 No 169 >TIGR02329 propionate_PrpR propionate catabolism operon regulatory protein PrpR; InterPro: IPR012704 At least five distinct pathways exist for the catabolism of propionate by way of propionyl-CoA. Members of this family represent the transcriptional regulatory protein PrpR which is, in most cases, divergently transcribed from the operon that encodes the genes involved in the methylcitric acid cycle of propionate catabolism. This protein is required for the expression of the proteins involved in this pathway . 2-methylcitric acid, an intermediate in this pathway, has been proposed to be a co-activator of PrpR .; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0019629 propionate catabolic process 2-methylcitrate cycle, 0005737 cytoplasm. Probab=31.69 E-value=39 Score=14.85 Aligned_cols=71 Identities=15% Similarity=0.331 Sum_probs=48.8 Q ss_pred CCEEEEC---CCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEE Q ss_conf 7814620---6215778987498120034677544497889999999975499389981888978999999971990899 Q gi|254780717|r 187 KKKIIVF---HEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAI 263 (294) Q Consensus 187 ~~~~i~~---H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ 263 (294) ...+||| -|++.-|.+.|||.+.- -.=.|..|=...+..+|+.+|.+|+-- -++--+|++.|...+. T Consensus 99 ~igvvT~~~t~p~~~~Fq~~f~l~~~~-----r~y~t~eDAr~~v~~Lra~G~~~vVG~-----GL~tDlA~~AgL~gvf 168 (658) T TIGR02329 99 RIGVVTYQDTVPALREFQKAFNLDIEQ-----RSYVTEEDARSCVNDLRASGIEVVVGA-----GLITDLAEQAGLVGVF 168 (658) T ss_pred CEEEEECCCCHHHHHHHHHHHCCCHHH-----HHHCCHHHHHHHHHHHHHCCCEEEECC-----CHHHHHHHHCCCCEEE T ss_conf 104674278846799999986341688-----750467789999999976897598876-----5688888767964787 Q ss_pred ECCC Q ss_conf 8053 Q gi|254780717|r 264 LDPE 267 (294) Q Consensus 264 ld~~ 267 (294) |++- T Consensus 169 lYSa 172 (658) T TIGR02329 169 LYSA 172 (658) T ss_pred EECH T ss_conf 6337 No 170 >PRK07671 cystathionine beta-lyase; Provisional Probab=31.50 E-value=40 Score=14.83 Aligned_cols=76 Identities=11% Similarity=0.224 Sum_probs=33.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHCC-C-CEECCCCCCCCCCCH Q ss_conf 186565889989999863122235899985---0300278-----14620621577898749-8-120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHYN-L-SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~g-l-~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-+++.-.+--... +...... +-+-.||.+..+.+.+. . -+.+++.. +. T Consensus 230 lsP~~a~L~lRGL~TL~lRm~r~~~nA~~vA~~L~~hp~V~~V~yPgL~s~p~h~la~~~~~g~gg~~sf~l~-----~~ 304 (377) T PRK07671 230 LGPQDSFLLLRGLKTLGIRMEEHETNSRAIAEFLNNHPKVNKVYYPGLPSHPNHELAKEQANGFGGMISFDVG-----SE 304 (377) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHCCCCCCEEEEEEEC-----CH T ss_conf 8847778987068869999999999999999998619984499676888992599998558997559999938-----78 Q ss_pred HHHHHHHHHHH Q ss_conf 88999999997 Q gi|254780717|r 222 ASLRNIRSKII 232 (294) Q Consensus 222 ~~l~~~~~~ik 232 (294) ....++.+.+| T Consensus 305 ~~a~~f~~~L~ 315 (377) T PRK07671 305 ETLNKVLERLQ 315 (377) T ss_pred HHHHHHHHHCC T ss_conf 99999998088 No 171 >PRK10190 hypothetical protein; Provisional Probab=31.49 E-value=40 Score=14.83 Aligned_cols=42 Identities=24% Similarity=0.356 Sum_probs=24.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCC-------------CCE-EEEE---ECHHHHHHHHHHHC Q ss_conf 915899999999996001323-------------880-5999---72289999999707 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA-------------GSL-QVVV---SIKPIHSIVSCIMQ 42 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~-------------~~~-~Vv~---s~~pl~~lv~~I~g 42 (294) |||+.+++.+++.+++..+.+ +.. .+.+ .+.|+.+|+++-.- T Consensus 1 ~~~~~~~~~~~~~~~s~~a~A~~y~LP~~gs~lvG~~~~~~v~~~~~~~L~~iAr~y~~ 59 (310) T PRK10190 1 MRRVNILCSFALLFASHTSLAVTYPLPPEGSRLVGQSLTVTVPDHNTQPLETFAAQYGQ 59 (310) T ss_pred CCHHHHHHHHHHHHCCCCHHEEEECCCCCCCCEECCCEEEEECCCCCCCHHHHHHHHCH T ss_conf 93136799999998065034067127999982445207999378987329999998676 No 172 >PRK02910 light-independent protochlorophyllide reductase subunit B; Provisional Probab=31.30 E-value=40 Score=14.81 Aligned_cols=76 Identities=12% Similarity=0.187 Sum_probs=42.1 Q ss_pred CCCCEEEECCC-----HHHH-HHHHCCCCEECCCCCCCC---------------CCCHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 02781462062-----1577-898749812003467754---------------44978899999999754993899818 Q gi|254780717|r 185 VEKKKIIVFHE-----AYRY-FASHYNLSIVTFPMSHSV---------------FMGAASLRNIRSKIISDKISCLFYGP 243 (294) Q Consensus 185 ~~~~~~i~~H~-----a~~Y-f~~~~gl~~~~~~~~~~~---------------~~s~~~l~~~~~~ik~~~v~~if~e~ 243 (294) +.++.++++-| +..+ +.+++|+++.+.+..... ..-..+-.++.+.|.+.....||. + T Consensus 291 ltgKr~fVfGDaTha~a~~kil~~ElG~~vvg~GTY~r~~Ar~~r~~~~~~~~e~lItdD~~eV~~~I~~~~P~lv~G-T 369 (524) T PRK02910 291 LTGKRVFVFGDATHAVAAAKILRDELGFEVVGAGTYLREDARWVRAAAKGYGDEALITDDYLEVEDAIAEAAPELVLG-T 369 (524) T ss_pred HCCCEEEEECCCHHHHHHHHHHHHHHCEEEEEECCCHHHHHHHHHHHHHHHCCCCEECCCHHHHHHHHHHCCCCEEEC-C T ss_conf 136447996560799999999888648078731355187789999999860645064067899999998559652104-0 Q ss_pred CCCHHHHHHHHHHHCCCEEEEC Q ss_conf 8897899999997199089980 Q gi|254780717|r 244 EFDSKIIRSITNDTGVMSAILD 265 (294) Q Consensus 244 ~~~~~~~~~ia~e~~~~~~~ld 265 (294) |.. +-+++.++++-.++. T Consensus 370 qME----RHi~krL~IPCaVIS 387 (524) T PRK02910 370 QME----RHSAKRLGIPCAVIS 387 (524) T ss_pred HHH----HHHHHCCCCCCCCCC T ss_conf 787----655520599855668 No 173 >TIGR01279 DPOR_bchN light-independent protochlorophyllide reductase, N subunit; InterPro: IPR005970 Synonym: dark protochlorophyllide reductase Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite. This enzyme describes the N subunit of the dark form protochlorophyllide reductase, a nitrogenase-like enzyme . ; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction. Probab=31.26 E-value=40 Score=14.80 Aligned_cols=54 Identities=9% Similarity=0.140 Sum_probs=25.6 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCC-----CCCCCCHHHHHH-HC-CCCEEEEECCCCCH Q ss_conf 89999999707944178870589983-----533568024566-50-25789996766103 Q gi|254780717|r 32 PIHSIVSCIMQGIGTPALLVKGASSP-----HEYSLRISEAMM-LE-NADIIFWLGAEMES 85 (294) Q Consensus 32 pl~~lv~~I~gd~~~V~~L~~~g~dp-----H~y~~tp~d~~~-l~-~Adlii~~G~~~E~ 85 (294) -|.-++++|..++-++.+|+=-|+|+ =+++-...++.. .+ +--++++.+-|++- T Consensus 72 eL~~v~~~i~~~~P~p~vifllssCt~ev~K~Dl~~~A~~Ls~~~~Pr~PvL~~~asGlD~ 132 (458) T TIGR01279 72 ELKRVVEQIKRERPNPSVIFLLSSCTPEVIKMDLEGLAEQLSTEFGPRVPVLFAPASGLDY 132 (458) T ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCEEECCHHHHHHHHHCCCCCCEEEEEECCCCCCC T ss_conf 9999999998508996499996168862254026776687300368950078730585310 No 174 >PRK13238 tnaA tryptophanase; Provisional Probab=31.19 E-value=40 Score=14.80 Aligned_cols=135 Identities=11% Similarity=0.232 Sum_probs=68.9 Q ss_pred HHHHHHHHHHHHHHHHHHH-------CCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHCC----CCCC------CEEEECC Q ss_conf 6555567899999999742-------1865658899899998631-2223589998503----0027------8146206 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIK-------KDPRNKIIYEKNEEEFKNQ-LSQLDKELHSILQ----PVEK------KKIIVFH 194 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~-------~dP~~~~~y~~N~~~~~~~-L~~l~~~~~~~~~----~~~~------~~~i~~H 194 (294) .|..|++... .|+++..- ..-+|+-+.+.+-..|.++ +.++..++-+.-. ..++ -.++.+. T Consensus 195 VSm~Nir~v~-~la~~~~ip~~lDaaRfaENAyFIk~RE~gY~d~si~eI~rEmfs~aDg~tmSaKKD~~vniGG~l~~~ 273 (461) T PRK13238 195 VSMANLRAVY-EIAKKYGIPVVLDAARFAENAYFIKQREPGYKDKSIKEITREMFSYADGLTMSAKKDAMVNIGGLLCFK 273 (461) T ss_pred CCHHHHHHHH-HHHHHCCCCEEEEHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHEEEEEECCCCEECCCCEEEEC T ss_conf 4689999999-999982995998656676645788732531257999999999984523799983301210442289966 Q ss_pred CHHHHHHH--H-----CCCC-----------EECCCCC--CC---CCCCHHHHHHHHHHHHHCCCCEE--------EEE- Q ss_conf 21577898--7-----4981-----------2003467--75---44497889999999975499389--------981- Q gi|254780717|r 195 EAYRYFAS--H-----YNLS-----------IVTFPMS--HS---VFMGAASLRNIRSKIISDKISCL--------FYG- 242 (294) Q Consensus 195 ~a~~Yf~~--~-----~gl~-----------~~~~~~~--~~---~~~s~~~l~~~~~~ik~~~v~~i--------f~e- 242 (294) |--..|.+ . .|+- ..+++.. -+ ..-.++++.-+.+.+++.+|+++ |.+ T Consensus 274 d~~~l~~~~~~~~i~~EGf~TYGGlaGRDmeAlAvGL~E~~d~dyl~~Ri~qv~yLg~~L~~~GVPvv~P~GGHAv~iDA 353 (461) T PRK13238 274 DDSDLYTKCRTLCILYEGFPTYGGLAGRDMEALAVGLYEGMDEDYLAYRIAQVEYLGEGLEEAGVPVVTPAGGHAVFVDA 353 (461) T ss_pred CHHHHHHHHHHHEEECCCCCCCCCCCHHHHHHHHCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCEEEEEEH T ss_conf 75999999874424205875668833877999973578760779999779999999999997799705679851899746 Q ss_pred ----C-----CCCH-HHHHHHHHHHCCCEEEECCCC Q ss_conf ----8-----8897-899999997199089980533 Q gi|254780717|r 243 ----P-----EFDS-KIIRSITNDTGVMSAILDPEG 268 (294) Q Consensus 243 ----~-----~~~~-~~~~~ia~e~~~~~~~ld~~~ 268 (294) | |++. .++-.|-.+.|++.+++-++. T Consensus 354 ~~fLPhip~~qfPa~aLa~eLY~~~GIR~vEiGs~~ 389 (461) T PRK13238 354 GKFLPHIPAEQFPAQALACELYLEAGIRAVEIGSLL 389 (461) T ss_pred HHHCCCCCHHHCCHHHHHHHHHHHCCCCEEEECCHH T ss_conf 785679981448189999999997295578750012 No 175 >PRK09221 beta alanine--pyruvate transaminase; Provisional Probab=31.09 E-value=40 Score=14.79 Aligned_cols=49 Identities=12% Similarity=0.209 Sum_probs=20.6 Q ss_pred HHHHHHHHCCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 999997421865--6588998999986312223589998503002781462 Q gi|254780717|r 144 VIAMELIKKDPR--NKIIYEKNEEEFKNQLSQLDKELHSILQPVEKKKIIV 192 (294) Q Consensus 144 ~I~~~L~~~dP~--~~~~y~~N~~~~~~~L~~l~~~~~~~~~~~~~~~~i~ 192 (294) .++++|+++-|. .+-+|-+....-.+---++...+........+..+++ T Consensus 96 ~lae~L~~~~p~~l~~v~f~~SGsEAvE~AiKlAr~y~~~~g~~~r~~ii~ 146 (445) T PRK09221 96 ELAERLAELAPGGLDHVFFTNSGSESVDTALKIALAYHRARGQGTRTRLIG 146 (445) T ss_pred HHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 999999984888758799608789999999999999999719997526962 No 176 >PRK10760 murein hydrolase B; Provisional Probab=30.98 E-value=25 Score=16.15 Aligned_cols=25 Identities=12% Similarity=0.329 Sum_probs=15.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 91589999999999600132388059 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQV 26 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~V 26 (294) ||+++.+ +.++++|++|++.-.+.+ T Consensus 1 ~r~~~~~-~~~~~~l~~css~p~~~~ 25 (357) T PRK10760 1 KRRYVTL-LPLFVLLAACSSKPKPTE 25 (357) T ss_pred CCCHHHH-HHHHHHHHHHCCCCCCCC T ss_conf 9404868-999999998518999999 No 177 >cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide. Probab=30.84 E-value=41 Score=14.76 Aligned_cols=11 Identities=36% Similarity=0.447 Sum_probs=4.6 Q ss_pred HCCCCEEEEEC Q ss_conf 50257899967 Q gi|254780717|r 70 LENADIIFWLG 80 (294) Q Consensus 70 l~~Adlii~~G 80 (294) +..||.||.+. T Consensus 138 ~~~ad~ii~vS 148 (365) T cd03809 138 LRRADAIITVS 148 (365) T ss_pred HHHCCEEEECC T ss_conf 99699999979 No 178 >PRK09810 entericidin A; Provisional Probab=30.80 E-value=36 Score=15.07 Aligned_cols=21 Identities=19% Similarity=0.469 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||++.++++...++++|..+ T Consensus 2 mkrli~lil~~~~ll~gcnta 22 (41) T PRK09810 2 MKRLIVLVLLASTLLTGCNTA 22 (41) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 389999999999998256212 No 179 >PRK09465 tolC outer membrane channel protein; Reviewed Probab=30.45 E-value=37 Score=15.04 Aligned_cols=16 Identities=38% Similarity=0.262 Sum_probs=9.9 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9158999999999960 Q gi|254780717|r 1 MKNFLIILIFLFFILS 16 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~ 16 (294) ||||+.+++.+++..+ T Consensus 1 mkki~~~l~~l~l~~~ 16 (446) T PRK09465 1 MKKLLPLLIGLSLAGF 16 (446) T ss_pred CCHHHHHHHHHHHHHC T ss_conf 9429999999999861 No 180 >pfam06649 DUF1161 Protein of unknown function (DUF1161). This family consists of several short, hypothetical bacterial proteins of unknown function. Probab=30.42 E-value=38 Score=14.94 Aligned_cols=20 Identities=30% Similarity=0.529 Sum_probs=13.4 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) |||+++...++++..+..++ T Consensus 1 Mkk~~l~~~l~lla~~alAA 20 (75) T pfam06649 1 MKRLLLAVALLLLAASALAA 20 (75) T ss_pred CCHHHHHHHHHHHHHHHHHC T ss_conf 92469999999971345315 No 181 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=30.36 E-value=33 Score=15.35 Aligned_cols=42 Identities=12% Similarity=0.339 Sum_probs=19.8 Q ss_pred CHHHH-HHHH-HHHHHHHCCCCC---------------CCEEEEEECHHHHHHHHHHHC Q ss_conf 91589-9999-999996001323---------------880599972289999999707 Q gi|254780717|r 1 MKNFL-IILI-FLFFILSSVARA---------------GSLQVVVSIKPIHSIVSCIMQ 42 (294) Q Consensus 1 Mkk~l-~~~~-~~~~~l~~~~~~---------------~~~~Vv~s~~pl~~lv~~I~g 42 (294) |+|+. ++++ +..|+|+||++. ++.-+-.+..-.++-.+.... T Consensus 1 Mmr~~~L~~~~~~~f~LtGCsa~~g~i~~~~f~C~~~r~~~~~C~~md~V~~~~~~l~~ 59 (174) T TIGR02747 1 MMRLKVLLLLACVAFLLTGCSASLGLIHNSDFSCEGTRGGVGTCATMDQVNDDKRLLKA 59 (174) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCEEEEHHHHHHHHHHHCC T ss_conf 90678999999998720025774565258897554533885420028776778764227 No 182 >cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core. Probab=30.27 E-value=42 Score=14.70 Aligned_cols=20 Identities=15% Similarity=0.288 Sum_probs=10.6 Q ss_pred CCHHHHHHHHHHHCCCEEEECC Q ss_conf 8978999999971990899805 Q gi|254780717|r 245 FDSKIIRSITNDTGVMSAILDP 266 (294) Q Consensus 245 ~~~~~~~~ia~e~~~~~~~ld~ 266 (294) ++-..+|.++ +|.+++..|. T Consensus 276 fg~~~lEAma--~G~pvI~s~~ 295 (353) T cd03811 276 FPNVLLEAMA--LGTPVVATDC 295 (353) T ss_pred CCHHHHHHHH--CCCCEEECCC T ss_conf 8538999998--0998999489 No 183 >PRK00042 tpiA triosephosphate isomerase; Provisional Probab=30.20 E-value=42 Score=14.69 Aligned_cols=78 Identities=9% Similarity=0.232 Sum_probs=50.7 Q ss_pred CEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHC----------CCCEEEEECCCCHHHHHHHHHHH Q ss_conf 81462062157789874981200346775444978899999999754----------99389981888978999999971 Q gi|254780717|r 188 KKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISD----------KISCLFYGPEFDSKIIRSITNDT 257 (294) Q Consensus 188 ~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~----------~v~~if~e~~~~~~~~~~ia~e~ 257 (294) +-++.|+|.|. +.++..|++.++.++...||+. +++ |++.-..++..++.+...- T Consensus 161 ~iiIAYEPvWA--------------IGTG~~as~~~i~~~~~~Ir~~l~~~~~~~~~~i~-IlYGGSVn~~N~~~i~~~~ 225 (251) T PRK00042 161 NLVIAYEPVWA--------------IGTGKTATPEQAQEVHAFIRKVLAELYAEVAEKVR-ILYGGSVKPDNAAELFAQP 225 (251) T ss_pred CEEEEECCHHH--------------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC-EEEECCCCHHHHHHHHCCC T ss_conf 35999675775--------------07999999999999999999999986052305753-8871778998999996688 Q ss_pred CCCEEEECCCCCCCCCCCCCHHHHHHH Q ss_conf 990899805336787781678999999 Q gi|254780717|r 258 GVMSAILDPEGMLIAEGPELYFQLMRS 284 (294) Q Consensus 258 ~~~~~~ld~~~~~~~~~~~~Y~~~m~~ 284 (294) ++..+.+-- ....++.+.++.+. T Consensus 226 ~vDG~LVGg----ASL~~~~F~~Ii~a 248 (251) T PRK00042 226 DIDGALVGG----ASLKAEDFLAIVKA 248 (251) T ss_pred CCCEEEECH----HHCCHHHHHHHHHH T ss_conf 999798515----86798999999999 No 184 >TIGR01457 HAD-SF-IIA-hyp2 HAD-superfamily subfamily IIA hydrolase, TIGR01457; InterPro: IPR006354 These sequences are all members of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences are all from the Gram-positive (low-GC) bacteria.. Probab=30.17 E-value=42 Score=14.69 Aligned_cols=16 Identities=13% Similarity=0.274 Sum_probs=6.1 Q ss_pred HHHHHHHCCCCEEEEE Q ss_conf 9999975499389981 Q gi|254780717|r 227 IRSKIISDKISCLFYG 242 (294) Q Consensus 227 ~~~~ik~~~v~~if~e 242 (294) +.+.+++..++++|+- T Consensus 25 FV~~L~kr~~pYLFVT 40 (251) T TIGR01457 25 FVKELKKRDVPYLFVT 40 (251) T ss_pred HHHHHHHCCCCEEEEE T ss_conf 9998864798636872 No 185 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=30.17 E-value=42 Score=14.70 Aligned_cols=19 Identities=11% Similarity=0.506 Sum_probs=12.8 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||+++++++ ++|++|+.. T Consensus 1 M~r~~l~~~~--l~L~gCa~~ 19 (224) T PRK12698 1 MARYILLALA--LLLAGCSST 19 (224) T ss_pred CHHHHHHHHH--HHHHCCCCC T ss_conf 9779999999--998365688 No 186 >pfam07269 consensus Probab=29.99 E-value=34 Score=15.25 Aligned_cols=19 Identities=32% Similarity=0.503 Sum_probs=12.5 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9158999999999960013238 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG 22 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~ 22 (294) ||++++++++. +++|...+ T Consensus 1 MKy~ll~l~l~---LAgC~T~d 19 (55) T pfam07269 1 MKYCLLLLILA---LAGCQTND 19 (55) T ss_pred CHHHHHHHHHH---HHHCCCCC T ss_conf 90149999999---87511467 No 187 >PRK08133 O-succinylhomoserine sulfhydrylase; Validated Probab=29.77 E-value=42 Score=14.65 Aligned_cols=79 Identities=11% Similarity=0.069 Sum_probs=33.9 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHCCC--CEECCCCCCCCCC Q ss_conf 42186565889989999863122235899985---0300278-----146206215778987498--1200346775444 Q gi|254780717|r 150 IKKDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFM 219 (294) Q Consensus 150 ~~~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~ 219 (294) ..+.|.++-......+.+.-++++-.+--.+. +...... +-+..||.+..+.|.++= -+.+++...+. T Consensus 240 ~~lsP~~a~l~lrGL~TL~lRm~r~~~nA~~lA~~L~~hp~V~~V~yPgL~shp~h~la~r~~~g~Gg~~sf~l~g~~-- 317 (391) T PRK08133 240 PTLSPFNAWVFLKGLETLSLRMEAHSANALELAEWLEAHPGVERVFYPGLPSHPQHELAKRQQKGGGAIVSFEVKGGR-- 317 (391) T ss_pred CCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHCCCCCCCEEEEEECCCH-- T ss_conf 888932778986047769999999999999999999709991479899998997778999708999725899971999-- Q ss_pred CHHHHHHHHHHHH Q ss_conf 9788999999997 Q gi|254780717|r 220 GAASLRNIRSKII 232 (294) Q Consensus 220 s~~~l~~~~~~ik 232 (294) ....++.+.++ T Consensus 318 --~~~~~f~~~l~ 328 (391) T PRK08133 318 --EAAWRVIDATR 328 (391) T ss_pred --HHHHHHHHHCC T ss_conf --99999998288 No 188 >PRK00094 gpsA NAD(P)H-dependent glycerol-3-phosphate dehydrogenase; Validated Probab=29.77 E-value=42 Score=14.65 Aligned_cols=83 Identities=16% Similarity=0.102 Sum_probs=49.6 Q ss_pred CEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEEC--CCCCHHHHHHHHHCCCCCCE Q ss_conf 8059997228999999970794417887058998353356802456650257899967--66103568999731455767 Q gi|254780717|r 23 SLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLG--AEMESFLVKPLHSLNKQSNV 100 (294) Q Consensus 23 ~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G--~~~E~~~~~~~~~~~~~~~~ 100 (294) ...+.+.. ...++.|-..+.+..+| |.-.=|++...+..--+.+.+||++|..= ..+..++.++.....++..+ T Consensus 26 ~V~lw~r~---~~~~~~In~~~~n~~yl-p~i~L~~nI~~t~dl~~~l~~ad~iiiavPs~~~~~~l~~i~~~i~~~~~l 101 (325) T PRK00094 26 DVVLWGRD---PEHAAEINTDRENPRYL-PGIKLPDNLRATSDLAEALADADLILVAVPSHAFREVLKQLKPLLRPDAPI 101 (325) T ss_pred EEEEEEEC---HHHHHHHHHCCCCCCCC-CCCCCCCCEEEECCHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCCCEE T ss_conf 78999838---99999999649886568-978589983897899999837984999457699999999998646899749 Q ss_pred EEEEECCCC Q ss_conf 997513544 Q gi|254780717|r 101 VTLSHSPDL 109 (294) Q Consensus 101 i~~~~~~~~ 109 (294) +.++++.+. T Consensus 102 i~~tKGle~ 110 (325) T PRK00094 102 VWATKGIEA 110 (325) T ss_pred EEEECCEEC T ss_conf 997655624 No 189 >PRK03757 hypothetical protein; Provisional Probab=29.68 E-value=43 Score=14.64 Aligned_cols=22 Identities=32% Similarity=0.279 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHH-CCCCCC Q ss_conf 9158999999999960-013238 Q gi|254780717|r 1 MKNFLIILIFLFFILS-SVARAG 22 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~-~~~~~~ 22 (294) |||+++.++++.+++. +++.+. T Consensus 1 mkk~~l~~~l~a~~~~~~~A~a~ 23 (191) T PRK03757 1 MKKTLLGLTLGSLLFSAGSAVAA 23 (191) T ss_pred CCHHHHHHHHHHHHHHCCCCCCC T ss_conf 90439999999998630404575 No 190 >TIGR02127 pyrF_sub2 orotidine 5'-phosphate decarboxylase; InterPro: IPR011995 Orotidine 5'-phosphate decarboxylase (OMPdecase) , catalyzes the last step in the de novo biosynthesis of pyrimidines, the decarboxylation of OMP into UMP. In higher eukaryotes OMPdecase is part, with orotate phosphoribosyltransferase, of a bifunctional enzyme, while the prokaryotic and fungal OMPdecases are monofunctional protein. Some parts of the sequence of OMPdecase are well conserved across species. The best conserved region is located in the N-terminal half of OMPdecases and is centred around a lysine residue which is essential for the catalytic function of the enzyme. This entry represents orotidine 5'-monophosphate decarboxylase, the PyrF protein of pyrimidine nucleotide biosynthesis.; GO: 0004590 orotidine-5'-phosphate decarboxylase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=29.64 E-value=43 Score=14.63 Aligned_cols=122 Identities=17% Similarity=0.076 Sum_probs=51.5 Q ss_pred CCHHHHHHHHHHH----------------HHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHC--C-- Q ss_conf 4756776555567----------------899999999742186565---8899899998631222358999850--3-- Q gi|254780717|r 127 DDLHLWLNPLNVQ----------------YIAHVIAMELIKKDPRNK---IIYEKNEEEFKNQLSQLDKELHSIL--Q-- 183 (294) Q Consensus 127 ~dpH~Wl~p~n~~----------------~~a~~I~~~L~~~dP~~~---~~y~~N~~~~~~~L~~l~~~~~~~~--~-- 183 (294) .|||.|+=|.-.+ .+.+.|-|+-.+.-|.-| .+||.-...=.+.|+++-+.++..- . T Consensus 20 LDP~~~~l~~~~~~~~~~~~~~~~A~~~~~Fc~~iiDA~a~~~a~vKPq~AFFE~~G~~G~~~L~evi~~~~~~GPq~~V 99 (306) T TIGR02127 20 LDPRLELLPEWVKESEARKKKESVAAGLAAFCKRIIDATAEYAAVVKPQVAFFEAFGSEGFKALEEVIAHAKSAGPQLLV 99 (306) T ss_pred ECCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCHHHHCCHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 56985762254111422222447889999988999986426331440460011112767899999999997415894268 Q ss_pred --CCCCCEEEECCCHHHHHHHHCC-CC-------EECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEE---EECCCCHHHH Q ss_conf --0027814620621577898749-81-------200346775444978899999999754993899---8188897899 Q gi|254780717|r 184 --PVEKKKIIVFHEAYRYFASHYN-LS-------IVTFPMSHSVFMGAASLRNIRSKIISDKISCLF---YGPEFDSKII 250 (294) Q Consensus 184 --~~~~~~~i~~H~a~~Yf~~~~g-l~-------~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if---~e~~~~~~~~ 250 (294) ..|+--+=. -+=+|-.-++| ++ .=++..+|= .+-..|.-+++..++++- -|| -.++.....+ T Consensus 100 ~~D~KRGDIg~--Ta~aYA~~~lg~~~~~~S~~~~DA~TvsPY--lG~dSL~Pfle~~~~~G~-giFvLv~TSN~~~~~~ 174 (306) T TIGR02127 100 LADAKRGDIGS--TAEAYAKAYLGHLKADDSPLEADALTVSPY--LGLDSLRPFLEVARANGA-GIFVLVKTSNPGGADL 174 (306) T ss_pred EECCCCCCHHH--HHHHHHHHHHCCCCCCCCCCEEEEEEECCH--HCCHHHHHHHHHHHHHCC-EEEEEEEECCCCHHHH T ss_conf 80431376777--999999998336788988410102451300--010331156898975389-6999985168553788 Q ss_pred HHH Q ss_conf 999 Q gi|254780717|r 251 RSI 253 (294) Q Consensus 251 ~~i 253 (294) +.+ T Consensus 175 Q~~ 177 (306) T TIGR02127 175 QDL 177 (306) T ss_pred HHH T ss_conf 855 No 191 >TIGR02475 CobW cobalamin biosynthesis protein CobW; InterPro: IPR012824 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobW proteins, which are generally found proximal to the trimeric cobaltochelatase subunit CobN, an essential protein for vitamin B12 (cobalamin) biosynthesis . They contain a P-loop nucleotide-binding loop in the N-terminal domain and a histidine-rich region in the C-terminal portion suggesting a role in metal binding, possibly as an intermediary between the cobalt transport and chelation systems. CobW might be involved in cobalt reduction leading to cobalt(I) corrinoids.; GO: 0009236 cobalamin biosynthetic process. Probab=29.41 E-value=30 Score=15.62 Aligned_cols=45 Identities=18% Similarity=0.251 Sum_probs=31.4 Q ss_pred CCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECC Q ss_conf 56802456650257899967661035689997314557679975135 Q gi|254780717|r 61 SLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSP 107 (294) Q Consensus 61 ~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~ 107 (294) |+.-.|+.=|++||++=.- +++.--..+...+..+.+++..+.|. T Consensus 176 QlacADLviLnKaDlldaa--g~~~v~a~i~~~lpr~v~iv~a~~G~ 220 (349) T TIGR02475 176 QLACADLVILNKADLLDAA--GLARVRAEIAAELPRAVKIVEASHGE 220 (349) T ss_pred HHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 9998899985343004888--99999999997358875177514785 No 192 >PRK05968 hypothetical protein; Provisional Probab=29.32 E-value=43 Score=14.60 Aligned_cols=17 Identities=18% Similarity=0.108 Sum_probs=12.3 Q ss_pred CCHHHHHHHHHHHHHHH Q ss_conf 67899999999999874 Q gi|254780717|r 276 ELYFQLMRSMSNSIAKN 292 (294) Q Consensus 276 ~~Y~~~m~~n~~~l~k~ 292 (294) +...+++...-++|+++ T Consensus 371 Ed~eDLi~Dl~qAL~~a 387 (389) T PRK05968 371 EGTEALWADLEQALAAA 387 (389) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 99999999999999984 No 193 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=29.31 E-value=43 Score=14.60 Aligned_cols=29 Identities=7% Similarity=0.284 Sum_probs=19.4 Q ss_pred CCCCCC-CCHHHHHHHCCCCEEEEECCCCCHH Q ss_conf 835335-6802456650257899967661035 Q gi|254780717|r 56 SPHEYS-LRISEAMMLENADIIFWLGAEMESF 86 (294) Q Consensus 56 dpH~y~-~tp~d~~~l~~Adlii~~G~~~E~~ 86 (294) |+.+.+ ..|.-+..+-+||.++|.- .+.| T Consensus 88 ~a~~i~~v~~~Kl~eiFGADAvLYvt--I~~y 117 (215) T pfam05643 88 NAADIHAVRPEKLHQIFGNDAVLYIT--VTEY 117 (215) T ss_pred CHHHHHCCCHHHHHHHHCCCEEEEEE--EECC T ss_conf 98997359989999873877689987--1024 No 194 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=29.23 E-value=37 Score=14.99 Aligned_cols=21 Identities=19% Similarity=0.336 Sum_probs=15.9 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) +++.++++++++++|+||... T Consensus 1 ~~~~~~~~~~~~~lL~GCwd~ 21 (400) T TIGR02887 1 KKLKILLLILALLLLTGCWDK 21 (400) T ss_pred CHHHHHHHHHHHHHHCCCCCC T ss_conf 946678999999983266562 No 195 >PRK09934 putative fimbrial protein; Provisional Probab=29.05 E-value=44 Score=14.57 Aligned_cols=17 Identities=12% Similarity=0.190 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHHC Q ss_conf 91589999999999600 Q gi|254780717|r 1 MKNFLIILIFLFFILSS 17 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~ 17 (294) |||+++.+++.+++.+. T Consensus 1 mkk~~l~~~~~l~~~~~ 17 (171) T PRK09934 1 MRRVFIAIFCGLLWSPL 17 (171) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 92689999999998634 No 196 >cd05013 SIS_RpiR RpiR-like protein. RpiR contains a SIS (Sugar ISomerase) domain, which is found in many phosphosugar isomerases and phosphosugar binding proteins. In E. coli, rpiR negatively regulates the expression of rpiB gene. Both rpiB and rpiA are ribose phosphate isomerases that catalyze the reversible reactions of ribose 5-phosphate into ribulose 5-phosphate. Probab=28.98 E-value=44 Score=14.56 Aligned_cols=26 Identities=4% Similarity=-0.130 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCH Q ss_conf 88999999997549938998188897 Q gi|254780717|r 222 ASLRNIRSKIISDKISCLFYGPEFDS 247 (294) Q Consensus 222 ~~l~~~~~~ik~~~v~~if~e~~~~~ 247 (294) +++.++.+..|+.+++++.......+ T Consensus 74 ~~~~~~~~~ak~~g~~ii~IT~~~~s 99 (139) T cd05013 74 KETVEAAEIAKERGAKVIAITDSANS 99 (139) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 78999999999869979999799999 No 197 >PRK07798 acyl-CoA synthetase; Validated Probab=28.83 E-value=44 Score=14.54 Aligned_cols=20 Identities=10% Similarity=0.087 Sum_probs=7.4 Q ss_pred EEEEEECHHHHHHHHHHHCC Q ss_conf 05999722899999997079 Q gi|254780717|r 24 LQVVVSIKPIHSIVSCIMQG 43 (294) Q Consensus 24 ~~Vv~s~~pl~~lv~~I~gd 43 (294) .+++.+-.-+...+.++..+ T Consensus 101 ~~~~i~~~~~~~~~~~~~~~ 120 (532) T PRK07798 101 AVALVYHREFAPRVAEVLPR 120 (532) T ss_pred CCEEEECHHHHHHHHHHHHC T ss_conf 84999753367899988750 No 198 >COG0788 PurU Formyltetrahydrofolate hydrolase [Nucleotide transport and metabolism] Probab=28.56 E-value=45 Score=14.51 Aligned_cols=92 Identities=11% Similarity=0.131 Sum_probs=58.8 Q ss_pred CCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHH----HHHHHCC Q ss_conf 002781462062157789874981200346775444978899999999754993899818889789999----9997199 Q gi|254780717|r 184 PVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRS----ITNDTGV 259 (294) Q Consensus 184 ~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~----ia~e~~~ 259 (294) +.+=..+++-|+.+...++.||+-..-+....+. -+.+-.++.+.+.+.++.+++.- |+++. +++...- T Consensus 117 ~~eI~~VIsNH~dl~~~v~~~~IPfhhip~~~~~--k~e~E~~~~~ll~~~~~DlvVLA-----RYMqILSpd~~~~~~g 189 (287) T COG0788 117 PAEIVAVISNHDDLRPLVERFDIPFHHIPVTKEN--KAEAEARLLELLEEYGADLVVLA-----RYMQILSPDFVERFPG 189 (287) T ss_pred CCCEEEEECCCHHHHHHHHHCCCCEEECCCCCCC--CHHHHHHHHHHHHHHCCCEEEEH-----HHHHHCCHHHHHHCCC T ss_conf 8725899748778899999749986654678775--34789999999998599878204-----6676578999986569 Q ss_pred CEEEECCCCCCCCCCCCCHHHHH Q ss_conf 08998053367877816789999 Q gi|254780717|r 260 MSAILDPEGMLIAEGPELYFQLM 282 (294) Q Consensus 260 ~~~~ld~~~~~~~~~~~~Y~~~m 282 (294) +++.+.+-=.-.-.|.+-|-+-. T Consensus 190 ~iINIHHSFLPaF~GA~PY~QA~ 212 (287) T COG0788 190 KIINIHHSFLPAFIGANPYHQAY 212 (287) T ss_pred CEEEECCCCCCCCCCCCHHHHHH T ss_conf 77885144456668987679998 No 199 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=28.46 E-value=45 Score=14.50 Aligned_cols=34 Identities=15% Similarity=0.221 Sum_probs=17.4 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEE Q ss_conf 0257899967661035689997314-5576799751 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSH 105 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~ 105 (294) ++.|.|+....+-+. +...++.+. ...+++.+.. T Consensus 54 ~~vd~iii~~~d~~~-~~~~~~~a~~aGIPVv~~d~ 88 (282) T cd06318 54 RGVNVLIINPVDPEG-LVPAVAAAKAAGVPVVVVDS 88 (282) T ss_pred CCCCEEEEECCCHHH-HHHHHHHHHHCCCCEEEECC T ss_conf 699879981168056-69999999977998899737 No 200 >PRK12435 ferrochelatase; Provisional Probab=28.46 E-value=45 Score=14.50 Aligned_cols=213 Identities=12% Similarity=0.146 Sum_probs=99.0 Q ss_pred CCCCCCCCCCCHHHHHHHC--------CCCEEEEECCCC-CHHHHHHHHHCCCCC--CEEEEEECCCCCCCC---CCCC- Q ss_conf 8998353356802456650--------257899967661-035689997314557--679975135446210---0121- Q gi|254780717|r 53 GASSPHEYSLRISEAMMLE--------NADIIFWLGAEM-ESFLVKPLHSLNKQS--NVVTLSHSPDLHRIL---LRDN- 117 (294) Q Consensus 53 ~g~dpH~y~~tp~d~~~l~--------~Adlii~~G~~~-E~~~~~~~~~~~~~~--~~i~~~~~~~~~~~~---~~~~- 117 (294) +|.+|.. +.|-++..+|+ +.++.++.|..+ .|+++..++.+...+ +++.+.-.+--.... ..+. T Consensus 51 gg~SPl~-~~t~~qa~~L~~~L~~~~~~~~~~v~~gmRy~~P~i~~~l~~l~~~g~~~ii~lplyPqyS~~t~~s~~~~~ 129 (316) T PRK12435 51 GGISPLA-KITKEQAHKLTDSMNNMFTEYEFNCYLGLKHTAPFIEDAVEEMKRDGIEQAISIVLAPHYSTFSIKAYNERA 129 (316) T ss_pred CCCCHHH-HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHH T ss_conf 8988549-999999999999971525898751677428999989999999998499569998667644542089999999 Q ss_pred ----CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCH-----------HHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf ----1123321224756776555567899999999742186565-----------8899899998631222358999850 Q gi|254780717|r 118 ----HSHFHDSEADDLHLWLNPLNVQYIAHVIAMELIKKDPRNK-----------IIYEKNEEEFKNQLSQLDKELHSIL 182 (294) Q Consensus 118 ----~~~~~~~~~~dpH~Wl~p~n~~~~a~~I~~~L~~~dP~~~-----------~~y~~N~~~~~~~L~~l~~~~~~~~ 182 (294) .......-..=+|++-.|.....+++.|.+.|.+...... .-+-...+-|..++....+.+.+.+ T Consensus 130 ~~~~~~~~~p~i~~I~~~~~~p~yI~a~a~~I~~~l~~~~~~~~~~llFSaHglP~~~i~~gDpY~~q~~~t~~~va~~l 209 (316) T PRK12435 130 IRLSEEIGGPVIEPIDQWYDEPKFISYWADQIKETFTKIEDKEKAVVIFSAHSLPEKIIAAGDPYVEQLQHTADLIAAAA 209 (316) T ss_pred HHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHCCCCCHHHHHHHHHHHHHHC T ss_conf 99998629986898268789879999999999999986636566579996277855563568980899999999999974 Q ss_pred CCCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEC-CCCHHHHHHH-------- Q ss_conf 3002781462062157789874981200346775444978899999999754993899818-8897899999-------- Q gi|254780717|r 183 QPVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGP-EFDSKIIRSI-------- 253 (294) Q Consensus 183 ~~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~-~~~~~~~~~i-------- 253 (294) .- .+ ..++ |..++|. ..+=.+|+.. ..+.+..++.+.+.+++-| .|...-+|+| T Consensus 210 ~~-~~-~~~~-------yQSr~g~------p~~WL~P~t~--d~l~~l~~~~~~k~vvv~Pi~FvsDhlETL~EldiE~k 272 (316) T PRK12435 210 NI-QN-YTIG-------WQSAGNT------PDPWIGPDVQ--DLTRDLYEEHGYESFIYCPVGFVAEHLEVLYDNDYECK 272 (316) T ss_pred CC-CC-CEEE-------EECCCCC------CCCCCCCCHH--HHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHHH T ss_conf 99-98-5699-------8899998------9999998799--99999998279758999897430330788999999999 Q ss_pred --HHHHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf --99719908998053367877816789999999999 Q gi|254780717|r 254 --TNDTGVMSAILDPEGMLIAEGPELYFQLMRSMSNS 288 (294) Q Consensus 254 --a~e~~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~ 288 (294) +++.|.+......++. -..|++.|...+.. T Consensus 273 e~a~e~G~~~~rvp~lN~-----~p~fI~~La~lV~~ 304 (316) T PRK12435 273 VVTDELNAAYFRPNMPNA-----QSTFIDCLATIVSR 304 (316) T ss_pred HHHHHCCCCEEECCCCCC-----CHHHHHHHHHHHHH T ss_conf 999975995796689999-----98999999999999 No 201 >cd02812 PcrB_like PcrB_like proteins. One member of this family, a protein from Archaeoglobus fulgidus, has been characterized as a (S)-3-O-geranylgeranylglyceryl phosphate synthase (AfGGGPS). AfGGGPS catalyzes the formation of an ether linkage between sn-glycerol-1-phosphate (G1P) and geranylgeranyl diphosphate (GGPP), the committed step in archaeal lipid biosynthesis. Therefore, it has been proposed that PcrB-like proteins are either prenyltransferases or are involved in lipoteichoic acid biosynthesis although the exact function is still unknown. Probab=28.42 E-value=45 Score=14.50 Aligned_cols=76 Identities=16% Similarity=0.310 Sum_probs=45.6 Q ss_pred CHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCC-HHHHHHHHHHHCCCEEEECCCCCCCCC Q ss_conf 2157789874981200346775444978899999999754993899818889-789999999719908998053367877 Q gi|254780717|r 195 EAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFD-SKIIRSITNDTGVMSAILDPEGMLIAE 273 (294) Q Consensus 195 ~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~-~~~~~~ia~e~~~~~~~ld~~~~~~~~ 273 (294) .++.-.++.+|++...++ ..+..-.+.-++.+.+.+. +++ +|+.-+.. ...++.++ +.|+..+++ |+..++ T Consensus 138 ~ayAlaae~lg~~~iYLE-gSGa~v~~e~V~~vk~~l~--~~~-LivGGGIrs~e~a~~~~-~AgAD~IVv---Gn~iee 209 (219) T cd02812 138 AAYALAAEYLGMPIVYLE-YSGAYGPPEVVRAVKKVLG--DTP-LIVGGGIRSGEQAKEMA-EAGADTIVV---GNIVEE 209 (219) T ss_pred HHHHHHHHHHCCCEEEEE-CCCCCCCHHHHHHHHHHCC--CCE-EEEECCCCCHHHHHHHH-HCCCCEEEE---CCCEEE T ss_conf 999999998299389995-6899799999999998467--970-99928979999999999-869999998---872240 Q ss_pred CCCCH Q ss_conf 81678 Q gi|254780717|r 274 GPELY 278 (294) Q Consensus 274 ~~~~Y 278 (294) +++.. T Consensus 210 ~~~~~ 214 (219) T cd02812 210 DPNAA 214 (219) T ss_pred CHHHH T ss_conf 68999 No 202 >pfam00534 Glycos_transf_1 Glycosyl transferases group 1. Mutations in this domain of subunit A of phosphatidylinositol N-acetylglucosaminyltransferase lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharides. Members of this family transfer UDP, ADP, GDP or CMP linked sugars. The eukaryotic glycogen synthases may be distant members of this family. Probab=28.24 E-value=35 Score=15.16 Aligned_cols=14 Identities=21% Similarity=0.252 Sum_probs=5.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 55567899999999 Q gi|254780717|r 135 PLNVQYIAHVIAME 148 (294) Q Consensus 135 p~n~~~~a~~I~~~ 148 (294) |.+...+++.|.+. T Consensus 142 ~~~~~~l~~~i~~l 155 (172) T pfam00534 142 PGDAEALAEAIEKL 155 (172) T ss_pred CCCHHHHHHHHHHH T ss_conf 99999999999999 No 203 >PRK13697 cytochrome c6; Provisional Probab=27.90 E-value=43 Score=14.61 Aligned_cols=16 Identities=25% Similarity=0.658 Sum_probs=10.0 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9158999999999960 Q gi|254780717|r 1 MKNFLIILIFLFFILS 16 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~ 16 (294) |||++.++++.++++. T Consensus 1 Mk~l~~~~l~~~~~~~ 16 (111) T PRK13697 1 MKKILKLVLLTLLLLT 16 (111) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 9789999999999999 No 204 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=27.82 E-value=46 Score=14.43 Aligned_cols=35 Identities=14% Similarity=0.064 Sum_probs=18.8 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEEC Q ss_conf 0257899967661035689997314-55767997513 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSHS 106 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~~ 106 (294) ++.|.|+....+-+.. ...++.+. ...+++.+... T Consensus 54 ~~vDgIii~p~d~~~~-~~~l~~a~~aGIPVV~~d~~ 89 (273) T cd06305 54 QKVDAIIIQHGRAEVL-KPWVKRALDAGIPVVAFDVD 89 (273) T ss_pred CCCCEEEEECCCHHHH-HHHHHHHHHCCCCEEEECCC T ss_conf 5999999946871444-89999999859978998177 No 205 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=27.74 E-value=46 Score=14.42 Aligned_cols=20 Identities=20% Similarity=0.335 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) ||| ++++++++++|++|+.. T Consensus 2 m~~-~l~~~~~~~~L~GC~~~ 21 (238) T PRK12696 2 IRK-LLAASCAVLLLSGCNAA 21 (238) T ss_pred HHH-HHHHHHHHHHHHCCCCC T ss_conf 589-99999999995165688 No 206 >pfam09711 Cas_Csn2 CRISPR-associated protein (Cas_Csn2). CRISPR loci appear to be mobile elements with a wide host range. This entry represents proteins found only in CRISPR-containing species, near other CRISPR-associated proteins (cas). The species range so far for these proteins is pathogenic bacteria only. Clusters of short DNA repeats with nonhomologous spacers, which are found at regular intervals in the genomes of phylogenetically distinct prokaryotic species, comprise a family with recognisable features. This family is known as CRISPR (short for Clustered, Regularly Interspaced Short Palindromic Repeats). Probab=27.58 E-value=46 Score=14.40 Aligned_cols=32 Identities=13% Similarity=0.295 Sum_probs=25.2 Q ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 67754449788999999997549938998188 Q gi|254780717|r 213 MSHSVFMGAASLRNIRSKIISDKISCLFYGPE 244 (294) Q Consensus 213 ~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~ 244 (294) .+...=.+-..+.++++.+.=.++.++|.||- T Consensus 137 vN~~~Ylt~eE~~~l~eYi~l~~~~vLFlEpr 168 (188) T pfam09711 137 VNSLAYLTKDEIAQLLEYISLSQADVLFLEPR 168 (188) T ss_pred EECCCCCCHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 94200068999999999860324349996252 No 207 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=26.84 E-value=46 Score=14.43 Aligned_cols=19 Identities=11% Similarity=0.224 Sum_probs=15.1 Q ss_pred CHHHHHHHHHHHHHHHCCC Q ss_conf 9158999999999960013 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVA 19 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~ 19 (294) |||+...++++.++++.++ T Consensus 1 m~~~~~~~l~~~~~l~~~~ 19 (199) T pfam06316 1 MRKLCAAILSAAICLAAAG 19 (199) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 9017789999999999727 No 208 >PRK04922 tolB translocation protein TolB; Provisional Probab=26.78 E-value=48 Score=14.31 Aligned_cols=29 Identities=14% Similarity=0.072 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 65555678999999997421865658899 Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNKIIYE 161 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~~~y~ 161 (294) ..+.++..++..|++..-+.---.+-+|. T Consensus 139 ~~~~~~R~~aH~isD~Iye~LTG~~G~F~ 167 (439) T PRK04922 139 ARANAMRDVAHQMADAIYEKITGVRGAFW 167 (439) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCEE T ss_conf 08899899999999999997409987402 No 209 >PRK07050 cystathionine beta-lyase; Provisional Probab=26.75 E-value=48 Score=14.31 Aligned_cols=134 Identities=6% Similarity=-0.020 Sum_probs=56.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCCEE-----EECCCHHHHHHHHCCCC--EECCCCCCCCCCCH Q ss_conf 18656588998999986312223589998---5030027814-----62062157789874981--20034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHS---ILQPVEKKKI-----IVFHEAYRYFASHYNLS--IVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~---~~~~~~~~~~-----i~~H~a~~Yf~~~~gl~--~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-+++...+--.. .+.......- +-.||.+..+.+.|+=. +.++.. +...+. T Consensus 246 ~sp~~a~l~~rGL~TL~lRm~r~~~nA~~lA~~L~~hp~V~~V~yPgL~s~p~h~~a~~~~~g~gg~~s~~~--~~~~~~ 323 (394) T PRK07050 246 VSADDCSLVLRGLPSMQVRFDAHSRSALELAQWLKARPEIATVLHPALPDCPGHAFWMRDFTGAGGLFSVVF--DERYDA 323 (394) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEE--CCCCCH T ss_conf 898999999748984788899999999999999861898108989998898159999975799962899997--566799 Q ss_pred HHHHHHHHHHHHCCCCEEE--EECCCCH-HHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 8899999999754993899--8188897-8999999971990--8998053367877816789999999999987419 Q gi|254780717|r 222 ASLRNIRSKIISDKISCLF--YGPEFDS-KIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSIAKNCS 294 (294) Q Consensus 222 ~~l~~~~~~ik~~~v~~if--~e~~~~~-~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ls 294 (294) ..+.++.+.++=-+..+=+ +|+-.-+ .....-..+.+.+ .+.| +.|. +.-.+++...-++|+.+|| T Consensus 324 ~~~~~f~~~l~l~~~a~SlGg~eSL~~p~~~~~~~~~~~~~~~~liRl-SvGl------Ed~~DLi~DL~qAL~aa~~ 394 (394) T PRK07050 324 AQVDAFVEALELFAIGWSWGGACSLAMPYDVASMRTGKWPHRGTLVRF-YIGL------EDEADLRADLEQALQAALA 394 (394) T ss_pred HHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE-EECC------CCHHHHHHHHHHHHHHHHC T ss_conf 999999983882655026457664745897655562002789986999-9411------9999999999999997409 No 210 >pfam07424 TrbM TrbM. This family contains the bacterial protein TrbM (approximately 180 residues long). In Comamonas testosteroni T-2, TrbM is derived from the IncP1beta plasmid pTSA, which encodes the widespread genes for p-toluenesulfonate (TSA) degradation. Probab=26.43 E-value=48 Score=14.31 Aligned_cols=54 Identities=22% Similarity=0.325 Sum_probs=25.9 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHH Q ss_conf 91589999999999600132388059997228999999970794417887058998353356802 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRIS 65 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~ 65 (294) |||+++.+++.+.++.+.. +..+.+.+...-++ .+...-+..|.-|-+.+|+-+ T Consensus 1 MKK~ll~~al~~~~~~s~~-A~~~d~ltgd~~la----------Cea~lCLss~~~p~EC~~sl~ 54 (189) T pfam07424 1 MKKILLAVALAFGMFGSAS-AQKLDVLTGDTRLA----------CEAILCLSSGVKPSECSPSLA 54 (189) T ss_pred CCHHHHHHHHHHHHHHCCC-CCHHHHHCCCHHHH----------HHHHHEECCCCCCCCCCHHHH T ss_conf 9201799999999971476-42003414853756----------465430058999743346778 No 211 >PRK11548 hypothetical protein; Provisional Probab=26.37 E-value=49 Score=14.27 Aligned_cols=52 Identities=10% Similarity=-0.029 Sum_probs=24.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCC----CEEEEEECHHHHHHHHHHH--CCCCEEEEEEC Q ss_conf 9158999999999960013238----8059997228999999970--79441788705 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAG----SLQVVVSIKPIHSIVSCIM--QGIGTPALLVK 52 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~----~~~Vv~s~~pl~~lv~~I~--gd~~~V~~L~~ 52 (294) ||.+....+++++++++|+... ++.|.=-+..-...+.++- =.+.+|..++. T Consensus 3 ~k~l~~a~ll~~lllsgCS~~~~~vYk~dI~QGN~i~~~~v~~L~~GMTk~QV~~iLG 60 (113) T PRK11548 3 CKTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTPNDVAKIRVGMTQQQVAYALG 60 (113) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHHCC T ss_conf 5788999999999980115787756576557567669999987618999999998718 No 212 >PRK05967 cystathionine beta-lyase; Provisional Probab=26.20 E-value=49 Score=14.25 Aligned_cols=129 Identities=11% Similarity=0.068 Sum_probs=52.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCCEE-----EECCCHHHHHHHHCC-C-CEECCCCCCCCCCCH Q ss_conf 18656588998999986312223589998---5030027814-----620621577898749-8-120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHS---ILQPVEKKKI-----IVFHEAYRYFASHYN-L-SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~---~~~~~~~~~~-----i~~H~a~~Yf~~~~g-l-~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-+++.-.+--.. .+.......- +..||....+.+.++ . -+.++....+.+ T Consensus 244 l~P~~a~l~lrGl~TL~lRm~r~~~nA~~lA~~L~~hp~V~~V~yPgL~shp~h~la~~~~~g~gg~~sf~l~~g~~--- 320 (392) T PRK05967 244 AGPDDTYQILRGLRTMGIRLEHHRKSALEIARWLEGRPMVARVLHPALPSHPGHEIWKRDFSGASGIFSFVLAAGGH--- 320 (392) T ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEECCCH--- T ss_conf 79788899871776099999999998999999997199976897999889944999998566897389999926839--- Q ss_pred HHHHHHHHHHHHCC-------CCEEEEECCCCHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 88999999997549-------9389981888978999999971990--89980533678778167899999999999874 Q gi|254780717|r 222 ASLRNIRSKIISDK-------ISCLFYGPEFDSKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSIAKN 292 (294) Q Consensus 222 ~~l~~~~~~ik~~~-------v~~if~e~~~~~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ 292 (294) ....++++.++--. +..+..-+..+. .-..+.+++ .+.| +.|. +...+++...-++|+.+ T Consensus 321 ~~~~~f~~~l~l~~~a~SlGg~eSL~~h~~~~~----~~~~~~~~~~~liRl-SVGl------Ed~~DLi~DL~qAL~aa 389 (392) T PRK05967 321 AQAHAFLDALEIFGLGYSWGGYESLALHVKLGD----RTVAKNPYAGPVIRL-QIGL------EDVPDLKADLERGFAAA 389 (392) T ss_pred HHHHHHHHHCCCCEEECCCCCCHHEEECCCCCH----HHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHHHHH T ss_conf 999999982871657315577002026888998----999747999898999-9440------99999999999999973 Q ss_pred CC Q ss_conf 19 Q gi|254780717|r 293 CS 294 (294) Q Consensus 293 ls 294 (294) -| T Consensus 390 ~a 391 (392) T PRK05967 390 SA 391 (392) T ss_pred HC T ss_conf 35 No 213 >smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed). Probab=26.09 E-value=49 Score=14.24 Aligned_cols=94 Identities=13% Similarity=0.205 Sum_probs=57.1 Q ss_pred CCCEEEECCCHHHHHHHHCCCCEECCCC-------------CCC----CCCCHHHHHHHHHHHHHCCCCEEEEECCCCHH Q ss_conf 2781462062157789874981200346-------------775----44497889999999975499389981888978 Q gi|254780717|r 186 EKKKIIVFHEAYRYFASHYNLSIVTFPM-------------SHS----VFMGAASLRNIRSKIISDKISCLFYGPEFDSK 248 (294) Q Consensus 186 ~~~~~i~~H~a~~Yf~~~~gl~~~~~~~-------------~~~----~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~ 248 (294) .+|-+|...||.-+||.+=.| .++.. ... .=|| +..+.+++++++|..||.-.+.-.. T Consensus 229 arrllVf~TDa~fH~AgDGkL--~GIv~PNDg~CHLd~~g~Yt~s~~~DYPS---i~ql~~kl~ennI~~IFAVT~~~~~ 303 (423) T smart00187 229 ARRLLVFSTDAGFHFAGDGKL--AGIVQPNDGQCHLDNNGEYTMSTTQDYPS---IGQLNQKLAENNINPIFAVTKKQVS 303 (423) T ss_pred CEEEEEEECCCCCCCCCCCCE--EEEECCCCCCEEECCCCCCCCCCCCCCCC---HHHHHHHHHHCCCEEEEEECCCHHH T ss_conf 438999983786302367624--43543788730327888524456567887---8999999985393279985220456 Q ss_pred HHHHHHHHH-CCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999971-9908998053367877816789999999999987 Q gi|254780717|r 249 IIRSITNDT-GVMSAILDPEGMLIAEGPELYFQLMRSMSNSIAK 291 (294) Q Consensus 249 ~~~~ia~e~-~~~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k 291 (294) +.+.|++-. |..+..|...+ ..-+++++...+.|.. T Consensus 304 ~Y~~Ls~~ipgs~vg~L~~DS-------sNVv~LI~~aY~ki~S 340 (423) T smart00187 304 LYKELSALIPGSSVGVLSEDS-------SNVVELIKDAYNKISS 340 (423) T ss_pred HHHHHHHHCCCCEEEEECCCC-------HHHHHHHHHHHHHHCE T ss_conf 999998757754035524575-------1389999999987501 No 214 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=25.87 E-value=39 Score=14.85 Aligned_cols=11 Identities=9% Similarity=0.054 Sum_probs=6.5 Q ss_pred CCCCCCCCCHH Q ss_conf 98353356802 Q gi|254780717|r 55 SSPHEYSLRIS 65 (294) Q Consensus 55 ~dpH~y~~tp~ 65 (294) .|+-.|+.+|+ T Consensus 71 Gd~~~W~v~~~ 81 (293) T PRK13861 71 SNSKDLAALPR 81 (293) T ss_pred CCCCCEEEECC T ss_conf 88201489627 No 215 >pfam03932 CutC CutC family. Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are between 200 and 300 amino acids in length are found in both eukaryotes and bacteria. Probab=25.57 E-value=50 Score=14.18 Aligned_cols=71 Identities=15% Similarity=0.207 Sum_probs=40.7 Q ss_pred EECCCHHHHHHHH---------CCCCEECCCCCCCCCCCHHH-HHHHHHHHHHCCCC-EEEEECCCCHHHHHHHHHHHCC Q ss_conf 6206215778987---------49812003467754449788-99999999754993-8998188897899999997199 Q gi|254780717|r 191 IVFHEAYRYFASH---------YNLSIVTFPMSHSVFMGAAS-LRNIRSKIISDKIS-CLFYGPEFDSKIIRSITNDTGV 259 (294) Q Consensus 191 i~~H~a~~Yf~~~---------~gl~~~~~~~~~~~~~s~~~-l~~~~~~ik~~~v~-~if~e~~~~~~~~~~ia~e~~~ 259 (294) +|+|-||.+.... +|++- + ..++.++++-+ +..+.+.++..+-+ -|..-...++..++.|.+++|+ T Consensus 117 ~TFHRAfD~~~d~~~al~~L~~lG~~r--I-LTSGg~~~a~~g~~~L~~l~~~a~~~i~Im~GgGI~~~N~~~l~~~~g~ 193 (202) T pfam03932 117 VTFHRAFDMCPDPEEALEQLIELGCER--V-LTSGGPLSALEGLEKLASLVAQAGGRISIMAGAGVNAENIAELRQLTGV 193 (202) T ss_pred EEEECHHHCCCCHHHHHHHHHHCCCCE--E-ECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCC T ss_conf 598620430599999999999759987--8-7579978766749999999996599849995799899999999997199 Q ss_pred CEEEE Q ss_conf 08998 Q gi|254780717|r 260 MSAIL 264 (294) Q Consensus 260 ~~~~l 264 (294) +-... T Consensus 194 ~~~H~ 198 (202) T pfam03932 194 AEVHG 198 (202) T ss_pred CEEEE T ss_conf 48856 No 216 >PRK00178 tolB translocation protein TolB; Provisional Probab=25.55 E-value=50 Score=14.17 Aligned_cols=29 Identities=21% Similarity=0.140 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 55556789999999974218656588998 Q gi|254780717|r 134 NPLNVQYIAHVIAMELIKKDPRNKIIYEK 162 (294) Q Consensus 134 ~p~n~~~~a~~I~~~L~~~dP~~~~~y~~ 162 (294) ++.++..++..|++.+-+.--..+-.|.. T Consensus 135 ~~~~~R~~aH~iaD~Iy~~lTG~~G~F~t 163 (433) T PRK00178 135 STDQLRDMAHYIADQSFEKLTGIKGAFST 163 (433) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 78998999999999999983099743022 No 217 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=25.52 E-value=50 Score=14.17 Aligned_cols=69 Identities=12% Similarity=0.082 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECC Q ss_conf 15899999999996001323880599972289999999707944178870589983533568024566502578999676 Q gi|254780717|r 2 KNFLIILIFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGA 81 (294) Q Consensus 2 kk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~ 81 (294) ||++++++++.++|++|+...+..--. .| -.+.|..+-+. .-+... --+..++....+++-+-.+. T Consensus 7 ~~~~l~~l~~~~~L~gC~~~~~~~~~~--~~----------~~V~v~~v~~~-~~~~~~-~~~G~v~a~~~v~l~~~v~G 72 (385) T PRK09578 7 RRLALAALVAAFALAGCGKGDSDAAAE--AP----------REATVVTVRPT-SVPMTV-ELPGRLDAYRQAEVRARVAG 72 (385) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCC--CC----------CEEEEEEEEEE-CCCEEE-EEEEEEEEEEEEEEEEECCE T ss_conf 999999999999981679998787899--99----------60389998773-322699-99999998799999966446 Q ss_pred CCC Q ss_conf 610 Q gi|254780717|r 82 EME 84 (294) Q Consensus 82 ~~E 84 (294) -.+ T Consensus 73 ~v~ 75 (385) T PRK09578 73 IVT 75 (385) T ss_pred EEE T ss_conf 999 No 218 >COG0540 PyrB Aspartate carbamoyltransferase, catalytic chain [Nucleotide transport and metabolism] Probab=25.44 E-value=51 Score=14.16 Aligned_cols=72 Identities=7% Similarity=-0.081 Sum_probs=32.7 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECC Q ss_conf 999999970794417887058998353356802456650257899967661035689997314557679975135 Q gi|254780717|r 33 IHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLSHSP 107 (294) Q Consensus 33 l~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~~~~ 107 (294) ...-.+.+|||.+.+..--....--.++-=|-+-+.+.. +|+++.=. ..|.....+.+..+.. .+|+.+++. T Consensus 64 FE~A~krLG~~Vv~~~~~~sSs~KGEtL~DT~~tl~ayg-~D~iViRH-~~egaa~~~a~~~~~~-pvINaGDG~ 135 (316) T COG0540 64 FETAMKRLGADVVNFSDSESSSKKGETLADTIRTLSAYG-VDAIVIRH-PEEGAARLLAEFSGVN-PVINAGDGS 135 (316) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHC-CCEEEEEC-CCCCHHHHHHHHCCCC-CEEECCCCC T ss_conf 999999729938964587665446610999999998607-99899947-6414899998735888-547888899 No 219 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=24.98 E-value=52 Score=14.11 Aligned_cols=69 Identities=13% Similarity=0.051 Sum_probs=32.3 Q ss_pred EECHHHHHHHHHHHCC---C--CEEEEEECCCCCCCCCCCCHHHHHH-----HCCCCEEEEECCCCCHHHHHHHHHCC-C Q ss_conf 9722899999997079---4--4178870589983533568024566-----50257899967661035689997314-5 Q gi|254780717|r 28 VSIKPIHSIVSCIMQG---I--GTPALLVKGASSPHEYSLRISEAMM-----LENADIIFWLGAEMESFLVKPLHSLN-K 96 (294) Q Consensus 28 ~s~~pl~~lv~~I~gd---~--~~V~~L~~~g~dpH~y~~tp~d~~~-----l~~Adlii~~G~~~E~~~~~~~~~~~-~ 96 (294) ++..|++.-+++=+.+ . +++..+-|...|+ .+++.. -++.|.|+....+-+. +...++.+. . T Consensus 8 ~~~~pfw~~v~~Gae~aA~e~G~~v~~~~~~~~D~------~~Q~~~ie~~Ia~gvDaIii~p~d~~a-~~~~i~~A~~a 80 (298) T cd06302 8 VTGIPYFNRMEEGAKEAAKELGVDAIYVGPTTADA------AGQVQIIEDLIAQGVDAIAVVPNDPDA-LEPVLKKAREA 80 (298) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCH------HHHHHHHHHHHHCCCCEEEEECCCHHH-HHHHHHHHHHC T ss_conf 89986999999999999999799999988998999------999999999997399889990687788-89999999867 Q ss_pred CCCEEEE Q ss_conf 5767997 Q gi|254780717|r 97 QSNVVTL 103 (294) Q Consensus 97 ~~~~i~~ 103 (294) ..+++.+ T Consensus 81 GIpVv~~ 87 (298) T cd06302 81 GIKVVTH 87 (298) T ss_pred CCEEEEE T ss_conf 9869982 No 220 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=24.80 E-value=52 Score=14.08 Aligned_cols=27 Identities=30% Similarity=0.300 Sum_probs=12.8 Q ss_pred CHHH--HHHH--HHHHHHHHCCCCCCCEEEEEE Q ss_conf 9158--9999--999999600132388059997 Q gi|254780717|r 1 MKNF--LIIL--IFLFFILSSVARAGSLQVVVS 29 (294) Q Consensus 1 Mkk~--l~~~--~~~~~~l~~~~~~~~~~Vv~s 29 (294) |||. ++.+ ++.++.|++|+++. .|+++ T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~s~~--~vat~ 31 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACGSSD--NVVTS 31 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC--CEEEE T ss_conf 93589999999999999998668998--17984 No 221 >PRK04043 tolB translocation protein TolB; Provisional Probab=24.78 E-value=52 Score=14.08 Aligned_cols=76 Identities=11% Similarity=0.173 Sum_probs=35.4 Q ss_pred CHHHHHHHHHHHHHHHCCC-------CCCCEEEEEECHH------H-HHHHHHHHCCCCEEEEEEC--CCCCCCCCCCCH Q ss_conf 9158999999999960013-------2388059997228------9-9999997079441788705--899835335680 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVA-------RAGSLQVVVSIKP------I-HSIVSCIMQGIGTPALLVK--GASSPHEYSLRI 64 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~-------~~~~~~Vv~s~~p------l-~~lv~~I~gd~~~V~~L~~--~g~dpH~y~~tp 64 (294) |||+++++++++.++++-+ ....+.|++.... + .+|.+-|..| ....-+.. .....-.+++.. T Consensus 1 MKki~l~ll~~~~~~aae~~IeIt~~~~~~~~IAVvp~~~~~~~~l~~~i~~II~~D-L~rSG~F~~i~~~~~~~~~~~~ 79 (419) T PRK04043 1 MKKLVLFLLVSLGLFAVDATLDIVKKGQKLPKIVVEDSSDSNDANLKLKFFKILVND-LKVSSHFDVSENKTQSNYAINY 79 (419) T ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCEEEECCCCCCCCCHHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCCH T ss_conf 955778899997774504799999478787658996168677641368999999875-3416585127876666677685 Q ss_pred HHHHHHCCCCEEEE Q ss_conf 24566502578999 Q gi|254780717|r 65 SEAMMLENADIIFW 78 (294) Q Consensus 65 ~d~~~l~~Adlii~ 78 (294) +|-+. .+||.++. T Consensus 80 ~~w~~-~~~d~lv~ 92 (419) T PRK04043 80 NELKD-KKVNLVAR 92 (419) T ss_pred HHHHH-HCCCEEEE T ss_conf 68775-09979999 No 222 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=24.69 E-value=52 Score=14.07 Aligned_cols=80 Identities=14% Similarity=0.169 Sum_probs=35.9 Q ss_pred CCCCHHHHHHHHHHHHHHHH---------HHHHHHHHHCCCCCCCEEEECCCHHHHHHHHCCCCEEC------CCCCC-C Q ss_conf 86565889989999863122---------23589998503002781462062157789874981200------34677-5 Q gi|254780717|r 153 DPRNKIIYEKNEEEFKNQLS---------QLDKELHSILQPVEKKKIIVFHEAYRYFASHYNLSIVT------FPMSH-S 216 (294) Q Consensus 153 dP~~~~~y~~N~~~~~~~L~---------~l~~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~~~~------~~~~~-~ 216 (294) +-+-+.+|++|-..|..+-. ......++....++. -..|.=+++.|.-...+ ++.-+ + T Consensus 176 d~Ei~~yY~~nk~~f~~~p~~~~~~~Il~~~~~~A~~i~~~l~~------G~dF~~lAk~~S~D~~s~~~gGdLG~~~~~ 249 (336) T PRK00059 176 DKDAQKYYDENKSTFTEKPNTVHLSHILVKTEEEAKKVKKRLDK------GEDFAKVAKEVSQDPGSKDKGGDLGDVPYS 249 (336) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEEEEECCHHHHHHHHHHHHC------CCCHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 89999999997887337866588998887899999999999877------998899999848995310148866640267 Q ss_pred CCCCHHHHHHHHHHHHHCCCCE Q ss_conf 4449788999999997549938 Q gi|254780717|r 217 VFMGAASLRNIRSKIISDKISC 238 (294) Q Consensus 217 ~~~s~~~l~~~~~~ik~~~v~~ 238 (294) ...-+..+....-.++...++- T Consensus 250 ~~~~~~~f~~a~f~Lk~GeiS~ 271 (336) T PRK00059 250 DSNYDKEFMDAAKALKEGEISA 271 (336) T ss_pred CCCCCHHHHHHHHHCCCCCCCC T ss_conf 6655668999997456888688 No 223 >PTZ00151 translationally controlled tumor protein; Provisional Probab=24.65 E-value=52 Score=14.07 Aligned_cols=36 Identities=14% Similarity=0.218 Sum_probs=30.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 567899999999742186565889989999863122 Q gi|254780717|r 137 NVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLS 172 (294) Q Consensus 137 n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~ 172 (294) ..+...+.|.+.|.+..|+.-+.|+++++++.+.+. T Consensus 92 yiK~YmK~v~~kLee~~perV~~Fk~~a~~~vK~il 127 (173) T PTZ00151 92 YIKKYIQRIVAYLEEKNPDRVEPFKTKAQPFVKHVL 127 (173) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999863298899999998899999997 No 224 >pfam02603 Hpr_kinase_N HPr Serine kinase N terminus. This family represents the N-terminal region of Hpr Serine/threonine kinase PtsK. This kinase is the sensor in a multicomponent phospho-relay system in control of carbon catabolic repression in bacteria. This kinase in unusual in that it recognizes the tertiary structure of its target and is a member of a novel family unrelated to any previously described protein phosphorylating enzymes. X-ray analysis of the full-length crystalline enzyme from Staphylococcus xylosus at a resolution of 1.95 A shows the enzyme to consist of two clearly separated domains that are assembled in a hexameric structure resembling a three-bladed propeller. The blades are formed by two N-terminal domains each, and the compact central hub assembles the C-terminal kinase domains. Probab=24.58 E-value=52 Score=14.06 Aligned_cols=48 Identities=10% Similarity=0.029 Sum_probs=35.6 Q ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEE-CCCCHHHHHHHHHHHCCCEEEE Q ss_conf 44497889999999975499389981-8889789999999719908998 Q gi|254780717|r 217 VFMGAASLRNIRSKIISDKISCLFYG-PEFDSKIIRSITNDTGVMSAIL 264 (294) Q Consensus 217 ~~~s~~~l~~~~~~ik~~~v~~if~e-~~~~~~~~~~ia~e~~~~~~~l 264 (294) ...++....+..+.+-+.+++|++.- ....+..+-.+|++.|+++..- T Consensus 63 ~~l~~e~r~~~l~~l~~~~~P~iIvt~~~~~p~~l~~~a~~~~vPll~t 111 (127) T pfam02603 63 DQLTEEERKERLEKLFSYDTPCLIVTRGLEPPEELLEAAKKYGIPLLRS 111 (127) T ss_pred HHCCHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHCCCEEEC T ss_conf 9699999999999985759988999799999999999999979957982 No 225 >PRK11251 DNA-binding transcriptional activator OsmE; Provisional Probab=24.46 E-value=53 Score=14.04 Aligned_cols=63 Identities=11% Similarity=0.071 Sum_probs=33.4 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEECHHHHH---------HHHHHHCCC-CEEEEEECCCCCCCCCCCCHHH Q ss_conf 1589999999999600132388059997228999---------999970794-4178870589983533568024 Q gi|254780717|r 2 KNFLIILIFLFFILSSVARAGSLQVVVSIKPIHS---------IVSCIMQGI-GTPALLVKGASSPHEYSLRISE 66 (294) Q Consensus 2 kk~l~~~~~~~~~l~~~~~~~~~~Vv~s~~pl~~---------lv~~I~gd~-~~V~~L~~~g~dpH~y~~tp~d 66 (294) .+.++..++.+.+|++|+...+..+-.. .|+-. =|.+|+|.- -++..+..+|.+ ++|-++.+| T Consensus 2 ~~~~l~aaa~~~~LaGCt~~d~~~~~~~-ePlVkdV~~GMT~qqV~~igG~Pss~~~~~~~~GtC-~~YiL~~~d 74 (109) T PRK11251 2 MAGILSAAAVLTMLAGCTAYDRNPVQYV-EPVVKDVKKGMTRQQVLQIGGKPSSEVSMIHARGTC-QTYILGNRD 74 (109) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCHHHHC-CHHHHHHHCCCCHHHHHHHCCCCCCEEEEECCCCCC-CEEEEECCC T ss_conf 0258899999999971447767813431-506988874757999998549986417886078522-003732468 No 226 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=24.45 E-value=53 Score=14.04 Aligned_cols=100 Identities=13% Similarity=0.166 Sum_probs=44.5 Q ss_pred CHHHHHHHHHHHHHHHCCCCC-CCEEEEEE-----CHHHHHHHHH---HHCCCCEEEEE-ECCCCCCCCCCCCHHHHHHH Q ss_conf 915899999999996001323-88059997-----2289999999---70794417887-05899835335680245665 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA-GSLQVVVS-----IKPIHSIVSC---IMQGIGTPALL-VKGASSPHEYSLRISEAMML 70 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~-~~~~Vv~s-----~~pl~~lv~~---I~gd~~~V~~L-~~~g~dpH~y~~tp~d~~~l 70 (294) |||++.++..+.++.+.++.+ .+.+|.+- .+.+..+++. .+.+. ....+ ...+.||.. --++++.+ T Consensus 3 ~~k~~~~~~~~~~~~~~~a~a~a~~TIgvivp~i~npff~~v~~gie~~a~~~-Gy~l~v~~s~~d~~~---q~~~i~~l 78 (295) T PRK10653 3 MKKLATLVSAVALSATVSANAMAKDTIALVVSTLNNPFFVSLKDGAQKEADKL-GYNLVVLDSQNNPAK---ELANVQDL 78 (295) T ss_pred HHHHHHHHHHHHHHHHCCCHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHHC-CCEEEEEECCCCHHH---HHHHHHHH T ss_conf 07999999999876521217752997999948987979999999999999976-998999839999999---99999999 Q ss_pred --CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEEE Q ss_conf --0257899967661035689997314-5576799751 Q gi|254780717|r 71 --ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLSH 105 (294) Q Consensus 71 --~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~~ 105 (294) ++.|.++.+..+.+. +...++.+. ...++|.+.. T Consensus 79 i~~~vdgiii~p~d~~~-~~~~~~~a~~~gIPvV~~d~ 115 (295) T PRK10653 79 TVRGTKILLINPTDSDA-VGNAVKMANQANIPVITLDR 115 (295) T ss_pred HHCCCCEEEECCCCHHH-HHHHHHHHHHCCCEEEEEEE T ss_conf 97699889984777065-69999999976981999951 No 227 >pfam05728 UPF0227 Uncharacterized protein family (UPF0227). Despite being classed as uncharacterized proteins, the members of this family are almost certainly enzymes that are distantly related to the pfam00561. Probab=24.41 E-value=53 Score=14.04 Aligned_cols=20 Identities=15% Similarity=-0.006 Sum_probs=8.7 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 75677655556789999999 Q gi|254780717|r 128 DLHLWLNPLNVQYIAHVIAM 147 (294) Q Consensus 128 dpH~Wl~p~n~~~~a~~I~~ 147 (294) .|++=.+|..++.++..+-+ T Consensus 35 ~P~L~~~P~~ai~~l~~~I~ 54 (187) T pfam05728 35 TPHLPHDPQQALKELEKAVA 54 (187) T ss_pred CCCCCCCHHHHHHHHHHHHH T ss_conf 89999889999999999999 No 228 >PRK07812 O-acetylhomoserine aminocarboxypropyltransferase; Validated Probab=24.32 E-value=53 Score=14.03 Aligned_cols=130 Identities=12% Similarity=0.145 Sum_probs=56.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCC-----EEEECCCHHHHHHHHC--CCC-EECCCCCCCCCCC Q ss_conf 18656588998999986312223589998---50300278-----1462062157789874--981-2003467754449 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHS---ILQPVEKK-----KIIVFHEAYRYFASHY--NLS-IVTFPMSHSVFMG 220 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~---~~~~~~~~-----~~i~~H~a~~Yf~~~~--gl~-~~~~~~~~~~~~s 220 (294) +.|..+-......+.+.-++++-.+--.+ .+...... +-+..||.+..+.+.+ |.- +.+++...+ T Consensus 281 lsP~da~L~lRGL~TL~lRm~r~~~nA~~iA~~L~~hp~V~~V~yPgL~shp~h~la~~~~~~G~G~~~sf~l~g~---- 356 (436) T PRK07812 281 ASPFNAFLIAQGLETLSLRIERHVANAQRVAEFLAARDDVLSVNYAGLPSSPWYERAKRLAPKGTGAVLSFELAGG---- 356 (436) T ss_pred CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHCCCCCCEEEEEEEECC---- T ss_conf 5807778987366838999999999999999999609996689898988883889999868999727999998098---- Q ss_pred HHHHHHHHHHHHHC-------CCCEEEEECCC--CHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 78899999999754-------99389981888--978999999971990--89980533678778167899999999999 Q gi|254780717|r 221 AASLRNIRSKIISD-------KISCLFYGPEF--DSKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 221 ~~~l~~~~~~ik~~-------~v~~if~e~~~--~~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l 289 (294) .....++++.++=- ++..++.-|-. ...+.+.-.++.|+. .+.| +.|. +...+++...-++| T Consensus 357 ~~~~~~f~~~L~lf~~a~SlGg~eSLi~~Pa~~th~~~~~e~r~~~Gi~~~liRl-SVGl------Ed~eDLi~DL~qAL 429 (436) T PRK07812 357 VEAGKAFVNALTLHSHVANIGDVRSLVIHPASTTHAQLTPEEQLATGVTPGLVRL-AVGI------EGIDDILADLEAGF 429 (436) T ss_pred HHHHHHHHHCCCCCCEEECCCCCCEEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHH T ss_conf 9999999980895501047773661546775456432899999865989495999-9510------99999999999999 Q ss_pred HHH Q ss_conf 874 Q gi|254780717|r 290 AKN 292 (294) Q Consensus 290 ~k~ 292 (294) +.+ T Consensus 430 ~aa 432 (436) T PRK07812 430 AAA 432 (436) T ss_pred HHH T ss_conf 971 No 229 >cd04299 GT1_Glycogen_Phosphorylase_like This family is most closely related to the oligosaccharide phosphorylase domain family and other unidentified sequences. Oligosaccharide phosphorylase catalyzes the breakdown of oligosaccharides into glucose-1-phosphate units. They are important allosteric enzymes in carbohydrate metabolism. The members of this family are found in bacteria and Archaea. Probab=24.30 E-value=51 Score=14.15 Aligned_cols=43 Identities=12% Similarity=0.143 Sum_probs=29.0 Q ss_pred CCCCCCHHHHHHHHHHHHHCC--CCEEEEECCCCHHHHHHHHHHHC Q ss_conf 754449788999999997549--93899818889789999999719 Q gi|254780717|r 215 HSVFMGAASLRNIRSKIISDK--ISCLFYGPEFDSKIIRSITNDTG 258 (294) Q Consensus 215 ~~~~~s~~~l~~~~~~ik~~~--v~~if~e~~~~~~~~~~ia~e~~ 258 (294) |..+++-+-+.++.+..+.-. -+++|.| ..+-.+++.|.+++. T Consensus 524 P~D~~Gk~lI~~i~~~s~~~~~~~rivFLe-dYDm~lAr~Lv~GvD 568 (778) T cd04299 524 PADEPGKELIQEIVEFSRRPEFRGRIVFLE-DYDMALARHLVQGVD 568 (778) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCEEEEC-CCCHHHHHHHHCCCE T ss_conf 987667999999999863805577589967-988699987506643 No 230 >KOG2045 consensus Probab=24.29 E-value=45 Score=14.47 Aligned_cols=17 Identities=18% Similarity=0.137 Sum_probs=10.2 Q ss_pred EEECHHHHHHHHHHHCC Q ss_conf 99722899999997079 Q gi|254780717|r 27 VVSIKPIHSIVSCIMQG 43 (294) Q Consensus 27 v~s~~pl~~lv~~I~gd 43 (294) +.+-+|...+|+.|+-+ T Consensus 859 ~~~~yPaqlVVedv~~e 875 (1493) T KOG2045 859 SVTGYPAQLVVEDVTVE 875 (1493) T ss_pred CCCCCCHHHHHHHHHCC T ss_conf 63456553322455322 No 231 >PRK11023 hypothetical protein; Provisional Probab=24.26 E-value=53 Score=14.02 Aligned_cols=20 Identities=15% Similarity=0.321 Sum_probs=12.4 Q ss_pred CHHHH-HHHHHHHHHHHCCCC Q ss_conf 91589-999999999600132 Q gi|254780717|r 1 MKNFL-IILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l-~~~~~~~~~l~~~~~ 20 (294) ||.+. +++++++++|++|+. T Consensus 1 mk~~~~l~~l~~~~~L~GC~~ 21 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVA 21 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHH T ss_conf 944889999999999726823 No 232 >cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group. Probab=24.22 E-value=53 Score=14.02 Aligned_cols=55 Identities=18% Similarity=0.206 Sum_probs=39.7 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEE-EECCCCCCCCCCCCCHHHHHHH Q ss_conf 788999999997549938998188897899999997199089-9805336787781678999999 Q gi|254780717|r 221 AASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSA-ILDPEGMLIAEGPELYFQLMRS 284 (294) Q Consensus 221 ~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~-~ld~~~~~~~~~~~~Y~~~m~~ 284 (294) |.++.+.++.+|+++|+|=+..-...-...+.|+++||..-. .+| ...|.++|.. T Consensus 122 pgdI~~ti~~lk~~~IrvsvI~l~aEv~I~k~l~~~TgG~y~V~ld---------e~H~~~ll~~ 177 (183) T cd01453 122 PGNIYETIDKLKKENIRVSVIGLSAEMHICKEICKATNGTYKVILD---------ETHLKELLLE 177 (183) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHCCEEEEECC---------HHHHHHHHHH T ss_conf 7649999999998397899997427899999999983997687539---------9999999995 No 233 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=24.21 E-value=53 Score=14.01 Aligned_cols=42 Identities=12% Similarity=0.147 Sum_probs=23.6 Q ss_pred CHHHHHHHHHHHH-HHHCCCCC----------CCEEEEEECHHHHHHHHHHHC Q ss_conf 9158999999999-96001323----------880599972289999999707 Q gi|254780717|r 1 MKNFLIILIFLFF-ILSSVARA----------GSLQVVVSIKPIHSIVSCIMQ 42 (294) Q Consensus 1 Mkk~l~~~~~~~~-~l~~~~~~----------~~~~Vv~s~~pl~~lv~~I~g 42 (294) ||.+..+++.+.+ ++++|+.. -+.-.+.|..|+..+++.|-. T Consensus 1 ~r~l~~l~l~lavll~agCGFhLRg~~~lP~~l~tL~l~S~Dpys~Ltr~vr~ 53 (196) T PRK10796 1 MRYLATLLLSLAVLVTAGCGWHLRGTTQVPSEMKTMILDSGDPNGPLSRAVRN 53 (196) T ss_pred CHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCEEEEEECCCCCHHHHHHHH T ss_conf 95699999999999974767157689999864445899804878679999999 No 234 >cd07986 LPLAT_ACT14924-like Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: Unknown ACT14924. Lysophospholipid acyltransferase (LPLAT) superfamily member: acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. Included in this subgroup are uncharacterized phospholipid/glycerol acyltransferases such as the Pectobacterium carotovorum subsp. carotovorum PC1 locus ACT14924 putative acyltransferase, and similar proteins. Probab=24.20 E-value=53 Score=14.01 Aligned_cols=53 Identities=9% Similarity=0.048 Sum_probs=37.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECCCC--------------HHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 44978899999999754993899818889--------------789999999719908998053367 Q gi|254780717|r 218 FMGAASLRNIRSKIISDKISCLFYGPEFD--------------SKIIRSITNDTGVMSAILDPEGML 270 (294) Q Consensus 218 ~~s~~~l~~~~~~ik~~~v~~if~e~~~~--------------~~~~~~ia~e~~~~~~~ld~~~~~ 270 (294) ..+.+.+.+..+.+++.++-+||.|-..+ .+-+-.||++++++++++.-.|.+ T Consensus 82 ~~~~~~l~~a~~~L~~G~~l~IFPeG~vS~~~~~~~~~~d~~wk~g~~~lA~k~~~pIvPV~i~G~N 148 (210) T cd07986 82 AKNRESLREALRHLKNGGALIIFPAGRVSTASPPFGRVSDRPWNPFVARLARKAKAPVVPVYFSGRN 148 (210) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEEEEEECC T ss_conf 5519999999999977997999658510036666776435664138999999859988989985228 No 235 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=24.18 E-value=49 Score=14.22 Aligned_cols=18 Identities=28% Similarity=0.670 Sum_probs=9.7 Q ss_pred CHHHHHHHHHHHH---HHHCC Q ss_conf 9158999999999---96001 Q gi|254780717|r 1 MKNFLIILIFLFF---ILSSV 18 (294) Q Consensus 1 Mkk~l~~~~~~~~---~l~~~ 18 (294) |||.+++++++++ .+++| T Consensus 1 Mk~~~~~~~~l~~~~~~lagC 21 (42) T pfam08085 1 MKKLIALLLALLLLALVLAGC 21 (42) T ss_pred CHHHHHHHHHHHHHHHHHHCC T ss_conf 923699999999999998612 No 236 >TIGR02668 moaA_archaeal probable molybdenum cofactor biosynthesis protein A; InterPro: IPR013485 This entry consists of archaeal proteins which are predicted to be functionally equivalent to MoaA (molybdenum cofactor biosynthesis protein A) from bacteria (see IPR013483 from INTERPRO).; GO: 0046872 metal ion binding, 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=24.17 E-value=53 Score=14.01 Aligned_cols=47 Identities=13% Similarity=0.094 Sum_probs=22.6 Q ss_pred HHHHHHHHHHHHHCCCCEEEE-ECCCCHHHHHHHHHHHC-CCEEEECCCCC Q ss_conf 788999999997549938998-18889789999999719-90899805336 Q gi|254780717|r 221 AASLRNIRSKIISDKISCLFY-GPEFDSKIIRSITNDTG-VMSAILDPEGM 269 (294) Q Consensus 221 ~~~l~~~~~~ik~~~v~~if~-e~~~~~~~~~~ia~e~~-~~~~~ld~~~~ 269 (294) .-++..+.+.+++.-+++-.- +-+.-+++. |-.+.+ +.+-.+.||.. T Consensus 212 ~~d~~~ie~~l~~~A~~~~~r~~~H~R~~Y~--i~~~~~~~~VEvV~Pm~N 260 (324) T TIGR02668 212 HIDIDPIEEELEKRADRVETRREMHNRKKYF--IPGGDGIVEVEVVKPMDN 260 (324) T ss_pred CCCHHHHHHHHHHHCCCCHHCCCCCCCCEEE--ECCCCEEEEEEEEECCCC T ss_conf 1773789999997504110000046875333--028943799988747887 No 237 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=24.11 E-value=38 Score=14.94 Aligned_cols=19 Identities=37% Similarity=0.686 Sum_probs=11.0 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) ||++++.++.. +++++|++ T Consensus 1 mr~i~l~l~v~-lllSGC~S 19 (80) T COG5645 1 MRNILLSLMVL-LLLSGCGS 19 (80) T ss_pred CCEEHHHHHHH-HHHCCCCE T ss_conf 91532789999-98376410 No 238 >cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II). Probab=24.09 E-value=54 Score=14.00 Aligned_cols=15 Identities=13% Similarity=0.364 Sum_probs=6.5 Q ss_pred HHHHHHHHHCCCEEEEC Q ss_conf 99999997199089980 Q gi|254780717|r 249 IIRSITNDTGVMSAILD 265 (294) Q Consensus 249 ~~~~ia~e~~~~~~~ld 265 (294) .+|.++ +|.+++.-| T Consensus 317 llEAMA--~G~PVVas~ 331 (396) T cd03818 317 LLEAMA--CGCLVVGSD 331 (396) T ss_pred HHHHHH--CCCCEEEEC T ss_conf 899997--799899927 No 239 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=23.87 E-value=54 Score=13.97 Aligned_cols=25 Identities=32% Similarity=0.436 Sum_probs=14.6 Q ss_pred CHHHHHHHH--HHHHHHHCCCCCCCEE Q ss_conf 915899999--9999960013238805 Q gi|254780717|r 1 MKNFLIILI--FLFFILSSVARAGSLQ 25 (294) Q Consensus 1 Mkk~l~~~~--~~~~~l~~~~~~~~~~ 25 (294) ||+.+..++ .+++.+++|...+++. T Consensus 1 mk~~~~s~~ala~l~sLA~CG~KGPLy 27 (58) T COG5567 1 MKNVFKSLLALATLFSLAGCGLKGPLY 27 (58) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 904899999999999998526678762 No 240 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=23.87 E-value=54 Score=13.97 Aligned_cols=33 Identities=15% Similarity=0.147 Sum_probs=16.6 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCC-CCCCEEEEE Q ss_conf 0257899967661035689997314-557679975 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLN-KQSNVVTLS 104 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~-~~~~~i~~~ 104 (294) ++.|.|+....+-+. +...++.+. ...+++.+. T Consensus 55 ~~vd~Iii~p~d~~~-~~~~i~~a~~agIpVv~~d 88 (275) T cd06317 55 QKVDGIILWPTDGQA-YIPGLRKAKQAGIPVVITN 88 (275) T ss_pred CCCCEEEEECCCCHH-HHHHHHHHHHCCCEEEEEC T ss_conf 599999996787124-5799999998699499976 No 241 >COG2875 CobM Precorrin-4 methylase [Coenzyme metabolism] Probab=23.83 E-value=54 Score=13.97 Aligned_cols=44 Identities=18% Similarity=0.353 Sum_probs=26.2 Q ss_pred CCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCEEEEE Q ss_conf 8353356802456650257899967661035689997314557679975 Q gi|254780717|r 56 SPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHSLNKQSNVVTLS 104 (294) Q Consensus 56 dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~~~~~~~~i~~~ 104 (294) ||.-- |-+-.+-|++||+++|-|.=.- .++++-..+++.+++.. T Consensus 14 dpdLi--Tvkg~~ll~~advviYAGSLV~---~elL~~~~~~aei~nSa 57 (254) T COG2875 14 DPDLI--TVKGQRLLEKADVVIYAGSLVP---PELLEYCRPDAEIVNSA 57 (254) T ss_pred CCCEE--EEHHHHHHHHCCEEEECCCCCC---HHHHHHCCCCCEEEECC T ss_conf 96525--5257888853999998887689---99996458998898247 No 242 >PRK08249 cystathionine gamma-synthase; Provisional Probab=23.61 E-value=55 Score=13.94 Aligned_cols=77 Identities=10% Similarity=0.144 Sum_probs=32.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCC---C--EEEECCCHHHHHHHHCCC--CEECCCCCCCCCCCH Q ss_conf 186565889989999863122235899985---030027---8--146206215778987498--120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEK---K--KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~---~--~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-++++-.+--.+. +..... . +-+..||.+..+.+.+.= -+.+++...+ . T Consensus 244 lsP~~a~L~~RGL~TL~lRm~r~~~nA~~vA~~L~~hp~V~~V~yPgL~s~p~h~la~~~~~g~gg~~sf~l~g~----~ 319 (398) T PRK08249 244 LDPFAAYLILRGMKTLALRVRQQQENAMALAEYLQTEPLVEAVNYPGLPTHPNHAIAKAQMRGFGGMLSFALKGG----M 319 (398) T ss_pred CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCCEEEEEECC----H T ss_conf 891778998646897999999998899999999983998077989999999688999986799997117998189----9 Q ss_pred HHHHHHHHHHH Q ss_conf 88999999997 Q gi|254780717|r 222 ASLRNIRSKII 232 (294) Q Consensus 222 ~~l~~~~~~ik 232 (294) ....++.+.++ T Consensus 320 ~~~~~f~~~l~ 330 (398) T PRK08249 320 DTVRRLLPRLQ 330 (398) T ss_pred HHHHHHHHHCC T ss_conf 99999997188 No 243 >TIGR00514 accC acetyl-CoA carboxylase, biotin carboxylase; InterPro: IPR004549 This family represents the biotin carboxylase subunit found usually as a component of acetyl-CoA carboxylase. Acetyl-CoA carboxylase (6.4.1.2 from EC) is a heterohexamer of biotin carboxyl carrier protein, biotin carboxylase (6.3.4.14 from EC), and two subunits of carboxyl transferase in a 2:2 complex. In the first step of long-chain fatty acid synthesis, biotin carboxylase catalyses the carboxylation of the carrier protein and then the transcarboxylase transfers the carboxyl group to form malonyl-coA. Homologous domains are found in eukaryotic forms of acetyl-CoA carboxylase and in a number of other carboxylases (e.g. pyruvate carboxylase). In some systems, the biotin carboxyl carrier protein and this protein (biotin carboxylase) may be shared by different carboxyltransferases. However, this model is not intended to identify the biotin carboxylase domain of propionyl-coA carboxylase. The model should hit the full length of proteins, except for chloroplast transit peptides in plants. If it hits a domain only of a longer protein, there may be a problem with the identification.; GO: 0016874 ligase activity. Probab=23.49 E-value=55 Score=13.93 Aligned_cols=79 Identities=10% Similarity=0.073 Sum_probs=48.9 Q ss_pred CCCEEEECCCHHHHHHHHCCCC-------EECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECC-CCHH--HHHHHHH Q ss_conf 2781462062157789874981-------2003467754449788999999997549938998188-8978--9999999 Q gi|254780717|r 186 EKKKIIVFHEAYRYFASHYNLS-------IVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPE-FDSK--IIRSITN 255 (294) Q Consensus 186 ~~~~~i~~H~a~~Yf~~~~gl~-------~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~-~~~~--~~~~ia~ 255 (294) .-+..-.-||.|+.|++--+|- +.-+++++++=-.--|=...++.+|+-+||||=-..+ .-.. -...||+ T Consensus 71 ~~tG~~AiHPGYGFLSENA~FAe~c~~~g~~FIGP~pe~Ir~MGDK~~A~~~mKkaGVP~VPGS~GP~~~~~~e~~~~A~ 150 (451) T TIGR00514 71 EITGADAIHPGYGFLSENADFAEICEDHGITFIGPSPEAIRLMGDKVTAKETMKKAGVPVVPGSDGPLVEDEEEAVRIAK 150 (451) T ss_pred HHCCCCEECCCCCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCCEECCCCCCCCCCHHHHHHHHH T ss_conf 77488076288773443124788987388266678712011268868999999748876623888863102788999997 Q ss_pred HHCCCEEEE Q ss_conf 719908998 Q gi|254780717|r 256 DTGVMSAIL 264 (294) Q Consensus 256 e~~~~~~~l 264 (294) +.|-|++.= T Consensus 151 ~IGyPv~IK 159 (451) T TIGR00514 151 EIGYPVIIK 159 (451) T ss_pred HCCCCEEEE T ss_conf 478968999 No 244 >PRK04870 histidinol-phosphate aminotransferase; Provisional Probab=23.42 E-value=55 Score=13.92 Aligned_cols=14 Identities=0% Similarity=-0.114 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHHHC Q ss_conf 99999999998741 Q gi|254780717|r 280 QLMRSMSNSIAKNC 293 (294) Q Consensus 280 ~~m~~n~~~l~k~l 293 (294) +-|...+++|++.| T Consensus 343 een~~ll~aLk~~L 356 (356) T PRK04870 343 EENAQFLAALKLAL 356 (356) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999999859 No 245 >KOG4549 consensus Probab=23.33 E-value=55 Score=13.91 Aligned_cols=47 Identities=11% Similarity=0.011 Sum_probs=39.0 Q ss_pred CCCCCC-HHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE Q ss_conf 754449-78899999999754993899818889789999999719908 Q gi|254780717|r 215 HSVFMG-AASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS 261 (294) Q Consensus 215 ~~~~~s-~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~ 261 (294) -+++++ -.|+..++..+|+.+|.|+.......++++.+..+-.+++. T Consensus 39 ~g~e~~fY~Di~rIL~dLk~~GVtl~~ASRt~ap~iA~q~L~~fkvk~ 86 (144) T KOG4549 39 KGEEMIFYDDIRRILVDLKKLGVTLIHASRTMAPQIASQGLETFKVKQ 86 (144) T ss_pred CCCEEEECCCHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCC T ss_conf 760456604356789999865928999527788899999999855575 No 246 >pfam11777 DUF3316 Protein of unknown function (DUF3316). This family of bacterial proteins has no known function. Several members are, however, annotated as being putative acyl-CoA synthetase, but this could not be confirmed. Probab=23.31 E-value=45 Score=14.47 Aligned_cols=52 Identities=10% Similarity=0.079 Sum_probs=24.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCC-----CCEEEEEECHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 915899999999996001323-----8805999722899999997079441788705 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA-----GSLQVVVSIKPIHSIVSCIMQGIGTPALLVK 52 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~-----~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~ 52 (294) |||++++++++++...+.++. +.-.+.+....-.+=+-.+|.+.+.-....| T Consensus 1 mKk~~ll~~~ll~s~~AfA~~~~~~~~~~~~~~~~~~tk~~AY~~G~~~~~~l~~~s 57 (114) T pfam11777 1 MKKLILLALLLLLSATAFAGNYVSTTQTDSLQTDVYATKQAAYGAGFTNLNDLYLSS 57 (114) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 910899999999988887555536556258971553059999999999999886069 No 247 >PRK09028 cystathionine beta-lyase; Provisional Probab=23.18 E-value=56 Score=13.89 Aligned_cols=129 Identities=12% Similarity=0.152 Sum_probs=55.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCCC-----EEEECCCHHHHHHHHCCC--CEECCCCCCCCCCCH Q ss_conf 18656588998999986312223589998---50300278-----146206215778987498--120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHS---ILQPVEKK-----KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~---~~~~~~~~-----~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-++++-.+--.. .+...... +-+-.||.+..+.|.+.= -+.+++...+. . T Consensus 241 ~sP~~a~L~lRGL~TL~lRm~r~~~nA~~vA~~L~~hp~V~~V~yPgL~shp~h~la~r~~~g~gg~~sf~l~~g~---~ 317 (394) T PRK09028 241 TSPDDVYLAARGLRTLGVRLAQHEKNALKVANWLATRPEVDHLRHPAFETCPGHEFFKRDFSGSNGLFSFVLKQGD---P 317 (394) T ss_pred CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEECCC---H T ss_conf 8878999998367859999999999999999998609998789888998894489999867799869999981699---9 Q ss_pred HHHHHHHHHHHHC-------CCCEEEEECCCCHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 8899999999754-------99389981888978999999971990--89980533678778167899999999999874 Q gi|254780717|r 222 ASLRNIRSKIISD-------KISCLFYGPEFDSKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSIAKN 292 (294) Q Consensus 222 ~~l~~~~~~ik~~-------~v~~if~e~~~~~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~ 292 (294) ....++.+.++-- ++..++..|.. ....+.. .+.++. .+.| +.|. +...+++...-++|.|. T Consensus 318 ~~~~~f~~~L~lf~~a~SlGg~eSLi~~p~~-~~~~~~~-~~~~~~~~lvRl-SVGl------Ed~eDLi~DL~qAL~k~ 388 (394) T PRK09028 318 KAVTALVEGMQHFKMGFSWGGFESLILGPFG-INKLRTA-TNWDFSKPLIRL-HIGL------EDVDDLIADLEAGFERY 388 (394) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCEEECCCC-CCCHHHH-HCCCCCCCEEEE-EECC------CCHHHHHHHHHHHHHHH T ss_conf 9999999848827571168987540788887-6742232-035899997999-9410------99899999999999985 No 248 >pfam05628 Borrelia_P13 Borrelia membrane protein P13. This family consists of P13 proteins from Borrelia species. P13 is a 13kDa integral membrane protein which is post-translationally processed at both ends and modified by an unknown mechanism. Probab=23.16 E-value=25 Score=16.15 Aligned_cols=23 Identities=13% Similarity=0.379 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 91589999999999600132388 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARAGS 23 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~~~ 23 (294) |||++++++..++.+...|++.+ T Consensus 1 MkKi~~liLif~lt~qiFAq~d~ 23 (165) T pfam05628 1 MKKIFTLILIFSLTMQIFAQEDK 23 (165) T ss_pred CCEEEEHHHHHHHHHHHHCCCCH T ss_conf 96266205667557541200033 No 249 >TIGR00469 pheS_mito phenylalanyl-tRNA synthetase; InterPro: IPR004530 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (nomenclature adopted in InterPro). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta (see IPR004531 from INTERPRO and IPR004532 from INTERPRO). In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the mitochondrial phenylalanyl-tRNA synthetases. Unlike all other known phenylalanyl-tRNA synthetases, the mitochondrial form demonstrated from yeast is monomeric. It is similar to but longer than the alpha subunit (PheS) of the alpha 2 beta 2 form found in bacteria, Archaea, and eukaryotes, and shares the characteristic motifs of class II aminoacyl-tRNA ligases.; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=22.93 E-value=56 Score=13.86 Aligned_cols=13 Identities=8% Similarity=0.041 Sum_probs=5.4 Q ss_pred HHHHCCCCEEEEE Q ss_conf 5665025789996 Q gi|254780717|r 67 AMMLENADIIFWL 79 (294) Q Consensus 67 ~~~l~~Adlii~~ 79 (294) +.+..++|+..-+ T Consensus 164 ~~~~TK~~~~~~~ 176 (460) T TIGR00469 164 VRKVTKDDLFVKE 176 (460) T ss_pred EHHHHHHHHCCCC T ss_conf 2100234430468 No 250 >PRK07503 methionine gamma-lyase; Provisional Probab=22.88 E-value=57 Score=13.85 Aligned_cols=77 Identities=6% Similarity=0.083 Sum_probs=34.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCC---CC--EEEECCCHHHHHHHHCCC--CEECCCCCCCCCCCH Q ss_conf 186565889989999863122235899985---03002---78--146206215778987498--120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVE---KK--KIIVFHEAYRYFASHYNL--SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~---~~--~~i~~H~a~~Yf~~~~gl--~~~~~~~~~~~~~s~ 221 (294) +.|..+-......+.+.-++++-.+--.+. +.... +. +-+..||-+..+.+.|.- -+.+++...+ . T Consensus 247 lsP~da~L~lRGL~TL~lRm~~~~~nA~~lA~~L~~hp~V~~V~yPgL~shp~h~la~~q~~g~gg~~sf~l~g~----~ 322 (403) T PRK07503 247 LSPHDASLLMRGLKTLALRMDRHCANAQQVAEFLARQPAVELVHYPGLPSFAQHALAQRQMRLPGGMIAFELKGG----I 322 (403) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCEEEEEECCC----H T ss_conf 886888998646775999999999999999999974899558989888898018999986789983689996698----9 Q ss_pred HHHHHHHHHHH Q ss_conf 88999999997 Q gi|254780717|r 222 ASLRNIRSKII 232 (294) Q Consensus 222 ~~l~~~~~~ik 232 (294) ....++.+.+| T Consensus 323 ~~~~~f~~~L~ 333 (403) T PRK07503 323 EAGRRFMNALQ 333 (403) T ss_pred HHHHHHHHHCC T ss_conf 99999998089 No 251 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=22.78 E-value=57 Score=13.84 Aligned_cols=20 Identities=10% Similarity=0.162 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) ||++-++.+++.+++++|+- T Consensus 1 ~~~~rll~l~~~l~L~gC~~ 20 (460) T PRK09837 1 MSPCKLLPFCVALALTGCSL 20 (460) T ss_pred CCHHHHHHHHHHHHHCCCCC T ss_conf 97578999999999824778 No 252 >COG0725 ModA ABC-type molybdate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=22.39 E-value=58 Score=13.79 Aligned_cols=64 Identities=11% Similarity=0.076 Sum_probs=39.3 Q ss_pred CCEEEEEE---CHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHH Q ss_conf 88059997---2289999999707944178870589983533568024566502578999676610356899973 Q gi|254780717|r 22 GSLQVVVS---IKPIHSIVSCIMQGIGTPALLVKGASSPHEYSLRISEAMMLENADIIFWLGAEMESFLVKPLHS 93 (294) Q Consensus 22 ~~~~Vv~s---~~pl~~lv~~I~gd~~~V~~L~~~g~dpH~y~~tp~d~~~l~~Adlii~~G~~~E~~~~~~~~~ 93 (294) ..++|.+. ..|+..+.++--.+. .+..-+.-|.+.. --++++.-..+|+++.-. +.|+.++.+. T Consensus 30 ~~i~VfAAaSL~~~l~~i~~~F~~~~-~~~V~~~f~gS~~----l~~qIe~Ga~~D~fiSa~---~~~~~~l~~~ 96 (258) T COG0725 30 ATITVFAAASLTDALEEIAKQFEKET-GVKVEVEFGGSGA----LARQIEQGAPADLFISAD---DAYMDKLEDK 96 (258) T ss_pred CEEEEEEEHHHHHHHHHHHHHHHHHH-CCEEEEEECCHHH----HHHHHHCCCCCCEEEECC---HHHHHHHHHC T ss_conf 40999981566899999999999987-9879999613899----999997599868799888---8889999866 No 253 >pfam01591 6PF2K 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme contains a P-loop motif. Probab=22.29 E-value=58 Score=13.78 Aligned_cols=33 Identities=12% Similarity=0.095 Sum_probs=25.9 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHH Q ss_conf 544497889999999975499389981888978 Q gi|254780717|r 216 SVFMGAASLRNIRSKIISDKISCLFYGPEFDSK 248 (294) Q Consensus 216 ~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~ 248 (294) .+..|.+.-..+.+.+++.+++++|.|+.-+.+ T Consensus 104 ATN~T~~RR~~i~~~~~~~~~~vlFiEsic~D~ 136 (223) T pfam01591 104 ATNTTRERRKNILDFAEENGLKVFFLESICDDP 136 (223) T ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCH T ss_conf 887689999999999986697499999973888 No 254 >PRK10002 outer membrane protein F; Provisional Probab=22.23 E-value=58 Score=13.77 Aligned_cols=21 Identities=24% Similarity=0.382 Sum_probs=15.8 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||-++.++++.+++++.+.+ T Consensus 2 MKK~~LA~ai~all~ag~a~A 22 (362) T PRK10002 2 MKRNILAVIVPALLVAGTANA 22 (362) T ss_pred CHHHHHHHHHHHHHHHCCCCE T ss_conf 308799999999986064417 No 255 >PRK08248 O-acetylhomoserine aminocarboxypropyltransferase; Validated Probab=22.22 E-value=58 Score=13.77 Aligned_cols=131 Identities=13% Similarity=0.175 Sum_probs=55.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCC-----EEEECCCHHHHHHHHC--CCC-EECCCCCCCCCCC Q ss_conf 186565889989999863122235899985---0300278-----1462062157789874--981-2003467754449 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKK-----KIIVFHEAYRYFASHY--NLS-IVTFPMSHSVFMG 220 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~-----~~i~~H~a~~Yf~~~~--gl~-~~~~~~~~~~~~s 220 (294) +.|..+-......+.+.-+++.-.+--... +...... +-+..||.+.-+.+.+ |.- +.+++...+ T Consensus 277 ~sp~da~L~lRGLrTL~lRm~r~~~nA~~lA~~L~~hp~V~~V~yPgL~shp~h~la~~~~~~g~G~~~sf~l~~~---- 352 (431) T PRK08248 277 LSPFNAFLLLQGLETLHLRMERHSENALAVAKFLEEHPAVESVSYPGLPSHPSYELAKKYLPKGQGAILTFEIKGG---- 352 (431) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEECCC---- T ss_conf 8806779997268979999999998899999999848995779899999995899999867999606999997288---- Q ss_pred HHHHHHHHHHHHHC-------CCCEEEEECCC--CHHHHHHHHHHHCCCE--EEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 78899999999754-------99389981888--9789999999719908--9980533678778167899999999999 Q gi|254780717|r 221 AASLRNIRSKIISD-------KISCLFYGPEF--DSKIIRSITNDTGVMS--AILDPEGMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 221 ~~~l~~~~~~ik~~-------~v~~if~e~~~--~~~~~~~ia~e~~~~~--~~ld~~~~~~~~~~~~Y~~~m~~n~~~l 289 (294) .....++.+.++=- ++..++.-|-. .....+.-.++.|+.. +.| +.|. +...+++...-++| T Consensus 353 ~~~~~~f~~~L~lf~~a~SlGg~eSLi~~pa~~th~~~~~~~r~~~Gi~~~liRl-SVGl------Ed~eDLi~DL~qAL 425 (431) T PRK08248 353 VEAGKKLIDSVKLFSHLANVGDSKSLIIHPASTTHEQLSEEEQRAAGVTPGLVRL-SVGT------ESIDDIIEDLRQAI 425 (431) T ss_pred HHHHHHHHHHCCCCEECCCCCCCCCEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHH T ss_conf 9999999983783611326676571645764355210899999866979495999-9410------99999999999999 Q ss_pred HHHC Q ss_conf 8741 Q gi|254780717|r 290 AKNC 293 (294) Q Consensus 290 ~k~l 293 (294) .++= T Consensus 426 ~~a~ 429 (431) T PRK08248 426 EQSQ 429 (431) T ss_pred HHHC T ss_conf 9723 No 256 >CHL00076 chlB photochlorophyllide reductase subunit B Probab=22.06 E-value=59 Score=13.75 Aligned_cols=60 Identities=10% Similarity=0.088 Sum_probs=28.7 Q ss_pred HHHHHCCCCEECCCCCCCCC---------------CCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEE Q ss_conf 78987498120034677544---------------497889999999975499389981888978999999971990899 Q gi|254780717|r 199 YFASHYNLSIVTFPMSHSVF---------------MGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMSAI 263 (294) Q Consensus 199 Yf~~~~gl~~~~~~~~~~~~---------------~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~~~ 263 (294) ++.+++|+++.+.+...-.. .-..+-.++.+.|.+.....||. +| +=+-+++.++++-.+ T Consensus 320 ~l~~EmG~~v~~~gty~r~~a~~~r~~~~~~~~e~litdD~~eV~~~I~~~~P~~vfG-Tq----mERh~~krl~IPc~v 394 (510) T CHL00076 320 ILSREMGIRVVCAGTYCKHDAEWFREQVQGFCDEILITDDHTEVGDMIARIEPSAIFG-TQ----MERHIGKRLDIPCGV 394 (510) T ss_pred HHHHHHCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHCCCCHHCC-HH----HHHHHHHHCCCCCCC T ss_conf 8777628423541242155559999999844863255064899999998719823124-48----887766640887567 No 257 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=22.01 E-value=50 Score=14.17 Aligned_cols=19 Identities=32% Similarity=0.275 Sum_probs=11.5 Q ss_pred CHHHHHHHHHHHHHHHCCC Q ss_conf 9158999999999960013 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVA 19 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~ 19 (294) |||.++++++++...++.+ T Consensus 1 MKK~~i~~~~~l~s~~~~A 19 (126) T pfam04076 1 MKKLAIALASALASTSALA 19 (126) T ss_pred CCHHHHHHHHHHHHCHHHH T ss_conf 9568999999998467879 No 258 >TIGR02155 PA_CoA_ligase phenylacetate-CoA ligase; InterPro: IPR011880 Phenylacetate-CoA ligase (PA-CoA ligase) catalyzes the first step in aromatic catabolism of phenylacetic acid (PA) into phenylacetyl-CoA (PA-CoA) . Often located in a conserved gene cluster with enzymes involved in phenylacetic acid activation (paaG/H/I/J), phenylacetate-CoA ligase has been found among the proteobacteria as well as in Gram-positive prokaryotes . In the B-subclass proteobacterium Azoarcus evansii, phenylacetate-CoA ligase has been shown to be induced under aerobic and anaerobic growth conditions. It remains unclear however, whether this induction is due to the same enzyme or to another isoenzyme restricted to specific anaerobic growth conditions .. Probab=21.96 E-value=59 Score=13.74 Aligned_cols=99 Identities=18% Similarity=0.228 Sum_probs=46.8 Q ss_pred CCCCEEEECCCHHHH--HHHHCCCCEECCCCCC-CCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE Q ss_conf 027814620621577--8987498120034677-5444978899999999754993899818889789999999719908 Q gi|254780717|r 185 VEKKKIIVFHEAYRY--FASHYNLSIVTFPMSH-SVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS 261 (294) Q Consensus 185 ~~~~~~i~~H~a~~Y--f~~~~gl~~~~~~~~~-~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~ 261 (294) ..+|.=...|.||+| |.-=.|....+-.+.. -++.|=-+=.+-+++|++-+...|.+.|..-=.+.+.+.+. + T Consensus 114 AGGr~g~~~H~AYGYGLFTGGLG~HyGaEkLG~~~vPiSGG~TekQ~qLI~Df~P~iI~~TPSY~L~~~e~l~r~-~--- 189 (431) T TIGR02155 114 AGGRPGDLIHNAYGYGLFTGGLGAHYGAEKLGCTVVPISGGQTEKQVQLIQDFKPDIIMVTPSYMLNILEELKRM-G--- 189 (431) T ss_pred CCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHC-C--- T ss_conf 278988757873054300032456423120660787137978056777643379888985778999999988640-6--- Q ss_pred EEECCCCCCCC---CCCCCHHHHHHHHHHHH Q ss_conf 99805336787---78167899999999999 Q gi|254780717|r 262 AILDPEGMLIA---EGPELYFQLMRSMSNSI 289 (294) Q Consensus 262 ~~ld~~~~~~~---~~~~~Y~~~m~~n~~~l 289 (294) +||-..+.+ .|.+=|.+-||.++++- T Consensus 190 --~dp~~~Sl~~gilGaEPW~~~mR~~~E~~ 218 (431) T TIGR02155 190 --IDPEQTSLKVGILGAEPWTNAMRKEIEAR 218 (431) T ss_pred --CCHHHCCEEEEEECCCCCHHHHHHHHHHH T ss_conf --88645636788723771058899999987 No 259 >PTZ00333 triosephosphate isomerase; Provisional Probab=21.89 E-value=59 Score=13.73 Aligned_cols=63 Identities=11% Similarity=0.253 Sum_probs=45.0 Q ss_pred CCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHC-----------CCCEEEEECCCCHHHHHHHHH Q ss_conf 781462062157789874981200346775444978899999999754-----------993899818889789999999 Q gi|254780717|r 187 KKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISD-----------KISCLFYGPEFDSKIIRSITN 255 (294) Q Consensus 187 ~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~-----------~v~~if~e~~~~~~~~~~ia~ 255 (294) ++-++.|+|.|. +.++..|++.++.++.+.||+. +++.+ +.-..++..++.|.+ T Consensus 161 ~~iiIAYEPvWA--------------IGTG~~as~~~i~e~~~~Ir~~l~~~~~~~~~~~i~IL-YGGSVn~~Na~~i~~ 225 (252) T PTZ00333 161 DKVVIAYEPVWA--------------IGTGKVATPEQAQEVHAFIRKWLSEKVGADVAKKVRII-YGGSVNEKNCHELIK 225 (252) T ss_pred HCEEEEECCHHH--------------CCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHHCCCCEE-EECCCCHHHHHHHHC T ss_conf 241899667653--------------58899999999999999999999987185450677488-727779999999966 Q ss_pred HHCCCEEEE Q ss_conf 719908998 Q gi|254780717|r 256 DTGVMSAIL 264 (294) Q Consensus 256 e~~~~~~~l 264 (294) .-++..+.+ T Consensus 226 ~~~vDG~LV 234 (252) T PTZ00333 226 QPDIDGFLV 234 (252) T ss_pred CCCCCEEEE T ss_conf 889997896 No 260 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=21.82 E-value=59 Score=13.72 Aligned_cols=74 Identities=12% Similarity=0.123 Sum_probs=47.7 Q ss_pred HHHHHHHCCCCEECC--CCCC-C-------CCC-CHHHHHHHHHHHHHCCCCEEEEECCCC--H---HHHHHHHHHHCCC Q ss_conf 577898749812003--4677-5-------444-978899999999754993899818889--7---8999999971990 Q gi|254780717|r 197 YRYFASHYNLSIVTF--PMSH-S-------VFM-GAASLRNIRSKIISDKISCLFYGPEFD--S---KIIRSITNDTGVM 260 (294) Q Consensus 197 ~~Yf~~~~gl~~~~~--~~~~-~-------~~~-s~~~l~~~~~~ik~~~v~~if~e~~~~--~---~~~~~ia~e~~~~ 260 (294) |--||.++|++.+-+ +... + .++ ..-+|.+++++.++++|..++.-...+ . +..+ .-++.|++ T Consensus 315 yIDfAA~~G~eyvLvD~gW~~~~~~~~~d~~~~~p~~dl~~lv~Ya~~KgV~i~lw~~~~~~~~~~~~~f~-~~~~~Gv~ 393 (643) T pfam10566 315 YIDFAARNGLEYISLDEGWYTPDGGKNSDMTKVISELDLQELIAYGKSKGVGIILWTNQKALYAQLDEAFK-LYEKWGVV 393 (643) T ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHH-HHHHCCCC T ss_conf 99999984996899815415788876566454276458799999998679639998356678998999999-99982997 Q ss_pred EEEECCCCCCC Q ss_conf 89980533678 Q gi|254780717|r 261 SAILDPEGMLI 271 (294) Q Consensus 261 ~~~ld~~~~~~ 271 (294) .+.+|-++.+. T Consensus 394 GvKvdF~~~d~ 404 (643) T pfam10566 394 GFKVDFLDRDD 404 (643) T ss_pred EEEECCCCCCC T ss_conf 27756737652 No 261 >PRK09304 arginine exporter protein; Provisional Probab=21.76 E-value=43 Score=14.59 Aligned_cols=13 Identities=23% Similarity=0.404 Sum_probs=9.2 Q ss_pred CCHHHHHHHHHHH Q ss_conf 4756776555567 Q gi|254780717|r 127 DDLHLWLNPLNVQ 139 (294) Q Consensus 127 ~dpH~Wl~p~n~~ 139 (294) .|||+|+|..-.+ T Consensus 107 LNPHvyLDTv~ll 119 (197) T PRK09304 107 LNPHVYLDTFVVL 119 (197) T ss_pred CCCHHHHHHHHHH T ss_conf 1815888899997 No 262 >TIGR02478 6PF1K_euk 6-phosphofructokinase; InterPro: IPR009161 Members of this family are eukaryotic (with one exception) ATP-dependent 6-phosphofructokinases (2.7.1.11 from EC) in which two tandem copies of the phosphofructokinase are found . Members are found, often including several isozymes, in animals and fungi and in the bacterium Propionibacterium acnes KPA171202 (a human skin commensal). Phosphofructokinase (PFK) catalyses the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate, which then enters the Embden-Meyerhof pathway. PFK is a key regulatory enzyme in glycolysis. For additional information please see , , .; GO: 0003872 6-phosphofructokinase activity, 0006096 glycolysis, 0005737 cytoplasm. Probab=21.75 E-value=60 Score=13.71 Aligned_cols=159 Identities=13% Similarity=0.167 Sum_probs=92.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCH-----HHHHHHHHHHHHHHHHHHHHHH------H----HCC------C------- Q ss_conf 6555567899999999742186565-----8899899998631222358999------8----503------0------- Q gi|254780717|r 133 LNPLNVQYIAHVIAMELIKKDPRNK-----IIYEKNEEEFKNQLSQLDKELH------S----ILQ------P------- 184 (294) Q Consensus 133 l~p~n~~~~a~~I~~~L~~~dP~~~-----~~y~~N~~~~~~~L~~l~~~~~------~----~~~------~------- 184 (294) ....-+..+.+.+.+++.+.+-+++ .-|..|+..|+. +...+..-+ + .++ | T Consensus 342 ~PL~~~V~~T~~V~~~i~~~~f~~A~~LRg~eF~~~~~~~~~-~s~~~~~~~~~~~~~~r~~~R~AIihvGAPA~GMNaA 420 (777) T TIGR02478 342 KPLVEAVRQTKTVAKAIKERRFDEAVRLRGREFVENLATFLE-LSKPDQDKKLVPSKADRLNLRVAIIHVGAPAGGMNAA 420 (777) T ss_pred CHHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHH T ss_conf 318899998999899873177545872357235775444776-2033788867775435674207999648885302399 Q ss_pred ---------CCCCEEEECCCHHHHHHHHC------CCCEEC------CCCCCCCC-------CCHHHHHHHHHHHHHCCC Q ss_conf ---------02781462062157789874------981200------34677544-------497889999999975499 Q gi|254780717|r 185 ---------VEKKKIIVFHEAYRYFASHY------NLSIVT------FPMSHSVF-------MGAASLRNIRSKIISDKI 236 (294) Q Consensus 185 ---------~~~~~~i~~H~a~~Yf~~~~------gl~~~~------~~~~~~~~-------~s~~~l~~~~~~ik~~~v 236 (294) .++-.++.=|+.|.=|++-. -++.+. +....|.+ |+-++|..+...++++++ T Consensus 421 ~R~avr~~~~~Gh~v~aI~nGf~GL~~~~DDtPicsvrEL~W~dV~gW~~~GGS~lGTnR~lP~~~d~~~iA~~f~~~ki 500 (777) T TIGR02478 421 TRAAVRYALARGHTVIAIHNGFSGLARGDDDTPICSVRELTWSDVEGWVGEGGSELGTNRELPNGKDLGEIAYKFQKHKI 500 (777) T ss_pred HHHHHHHHHCCCCEEEEEECCCHHCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 99999998506987999816841002288897433122202344423342887621677897786538999999986098 Q ss_pred CEEEEECCCCH-HHHHHHH------HHHCCCEEEEC-CC-----CCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 38998188897-8999999------97199089980-53-----3678778167899999999999874 Q gi|254780717|r 237 SCLFYGPEFDS-KIIRSIT------NDTGVMSAILD-PE-----GMLIAEGPELYFQLMRSMSNSIAKN 292 (294) Q Consensus 237 ~~if~e~~~~~-~~~~~ia------~e~~~~~~~ld-~~-----~~~~~~~~~~Y~~~m~~n~~~l~k~ 292 (294) ..|+.=-.|.. ..+.+|. .+..+|.+.+- .+ |+....|.||=+.-+..-.|.|+++ T Consensus 501 DgL~IIGGFEAf~al~qL~~aR~~YP~F~IPmv~iPATISNNVPGTe~SLG~DTaLN~i~~~cD~IkqS 569 (777) T TIGR02478 501 DGLLIIGGFEAFEALLQLEQAREKYPAFRIPMVVIPATISNNVPGTEYSLGSDTALNEITEYCDKIKQS 569 (777) T ss_pred CEEEEECCHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 879997603688999999999861885578857866601077872443022015777899872632235 No 263 >cd06451 AGAT_like Alanine-glyoxylate aminotransferase (AGAT) family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to alanine-glyoxylate aminotransferase (AGAT), serine-glyoxylate aminotransferase (SGAT), and 3-hydroxykynurenine transaminase (HKT). AGAT is a homodimeric protein, which catalyses the transamination of glyoxylate to glycine, and SGAT converts serine and glyoxylate to hydroxypyruvate and glycine. HKT catalyzes the PLP-dependent transamination of 3-hydroxykynurenine, a potentially toxic metabolite of the kynurenine pathway. Probab=21.68 E-value=60 Score=13.70 Aligned_cols=27 Identities=4% Similarity=-0.093 Sum_probs=14.6 Q ss_pred CEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 0899805336787781678999999999998741 Q gi|254780717|r 260 MSAILDPEGMLIAEGPELYFQLMRSMSNSIAKNC 293 (294) Q Consensus 260 ~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k~l 293 (294) ++..+-.||.. -.+.+...++.|++.| T Consensus 329 ~~fRIg~mG~~-------~~~di~~~l~al~~~L 355 (356) T cd06451 329 KVFRIGHMGEA-------TREDVLGVLSALEEAL 355 (356) T ss_pred CEEEECCCCCC-------CHHHHHHHHHHHHHHH T ss_conf 58986189899-------9999999999999985 No 264 >TIGR01485 SPP_plant-cyano sucrose phosphatase; InterPro: IPR012847 This entry describes the sucrose phosphate phosphohydrolase from plants and cyanobacteria (SPP). However, a closely related group of sequences from bacteria and archaea may prove to catalyze the same reaction. SPP is a member of the Class IIB subfamily of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. SPP catalyzes the final step in the biosynthesis of sucrose, a critically important molecule for plants. Sucrose phosphate synthase (SPS), the prior step in the biosynthesis of sucrose contains a domain, which exhibits considerable similarity to SPP albeit without conservation of the catalytic residues. The catalytic machinery of the synthase resides in another domain. It seems likely that the phosphatase-like domain is involved in substrate binding, possibly binding both substrates in a "product-like" orientation prior to ligation by the synthase catalytic domain.. Probab=21.66 E-value=60 Score=13.70 Aligned_cols=24 Identities=17% Similarity=0.195 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 765555678999999997421865 Q gi|254780717|r 132 WLNPLNVQYIAHVIAMELIKKDPR 155 (294) Q Consensus 132 Wl~p~n~~~~a~~I~~~L~~~dP~ 155 (294) .||+.=-+.++.+|+..+.++.|+ T Consensus 92 ~L~~~W~R~iv~~~t~~F~~L~~Q 115 (257) T TIGR01485 92 YLSEKWERDIVVAITRKFEELKPQ 115 (257) T ss_pred HHCCCCCHHHHHHHHCCHHHHCCC T ss_conf 705887623354422350230579 No 265 >PRK05994 O-acetylhomoserine aminocarboxypropyltransferase; Validated Probab=21.57 E-value=60 Score=13.69 Aligned_cols=129 Identities=12% Similarity=0.080 Sum_probs=53.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCE-----EEECCCHHHHHHHHCC--CC-EECCCCCCCCCCC Q ss_conf 186565889989999863122235899985---03002781-----4620621577898749--81-2003467754449 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKKK-----IIVFHEAYRYFASHYN--LS-IVTFPMSHSVFMG 220 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~~-----~i~~H~a~~Yf~~~~g--l~-~~~~~~~~~~~~s 220 (294) +.|..+-......+.+.-++++-.+--.+. +....... -+..|+....+.+.+. .- +.+++...+. T Consensus 276 lsP~da~L~lRGL~TL~lRm~r~~~nA~~vA~~L~~hp~V~~V~yPgL~s~p~h~lak~~~~~g~G~~~Sf~l~g~~--- 352 (426) T PRK05994 276 ISPFNAFLILTGIETLPLRMQRHSDNALAVAEWLSGHPKVSWVNYAGLPSDKYHALAQKYSPKGAGAVFTFGLKGGY--- 352 (426) T ss_pred CCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHCCCCCEEEEEEEECCCH--- T ss_conf 68167899972578389999999999999999996299978898989899803999998788997379999966979--- Q ss_pred HHHHHHHHHHHHHC-------CCCEEEEECCC--CHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 78899999999754-------99389981888--978999999971990--89980533678778167899999999999 Q gi|254780717|r 221 AASLRNIRSKIISD-------KISCLFYGPEF--DSKIIRSITNDTGVM--SAILDPEGMLIAEGPELYFQLMRSMSNSI 289 (294) Q Consensus 221 ~~~l~~~~~~ik~~-------~v~~if~e~~~--~~~~~~~ia~e~~~~--~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l 289 (294) ....++.+.++=- ++..++.-|.. ....-+.-.+..|+. .+.| +.|. +...+++...-++| T Consensus 353 -~~~~~f~~~l~lf~~a~SlGg~eSLi~~p~~~th~~~~~e~r~~~Gi~~~liRl-SVGl------Ed~eDLi~DL~qAL 424 (426) T PRK05994 353 -EAGVKLVSSLKLFSHLANIGDTRSLVIHPASTTHRQLSDEQKVAAGAGPDVVRL-SIGI------EDKDDIIADLEQAL 424 (426) T ss_pred -HHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHH T ss_conf -999999982791500316677550457864234521899999965989697999-8431------99999999999985 Q ss_pred HH Q ss_conf 87 Q gi|254780717|r 290 AK 291 (294) Q Consensus 290 ~k 291 (294) +| T Consensus 425 ~k 426 (426) T PRK05994 425 AK 426 (426) T ss_pred CC T ss_conf 85 No 266 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=21.39 E-value=61 Score=13.66 Aligned_cols=17 Identities=29% Similarity=0.520 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999996001323 Q gi|254780717|r 5 LIILIFLFFILSSVARA 21 (294) Q Consensus 5 l~~~~~~~~~l~~~~~~ 21 (294) |+.+++|+++|+||..+ T Consensus 8 lIFIL~LAvLLFSC~ke 24 (463) T TIGR01781 8 LIFILMLAVLLFSCKKE 24 (463) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 88899999986047621 No 267 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=21.26 E-value=61 Score=13.64 Aligned_cols=33 Identities=15% Similarity=0.057 Sum_probs=15.1 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHCCC-CCCEEEEE Q ss_conf 02578999676610356899973145-57679975 Q gi|254780717|r 71 ENADIIFWLGAEMESFLVKPLHSLNK-QSNVVTLS 104 (294) Q Consensus 71 ~~Adlii~~G~~~E~~~~~~~~~~~~-~~~~i~~~ 104 (294) ++.|.|+....+-+. +...++.+.. ..+++.+. T Consensus 54 ~~vDgIii~p~~~~~-~~~~~~~a~~~gIPvv~~d 87 (277) T cd06319 54 KGVSGIIISPTNSSA-AVTLLKLAAQAKIPVVIAD 87 (277) T ss_pred CCCCEEEECCCCCCC-HHHHHHHHHHCCCCEEEEE T ss_conf 699879964777411-0999999997699789986 No 268 >cd07983 LPLAT_DUF374-like Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: DUF374. Lysophospholipid acyltransferase (LPLAT) superfamily member: acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. Included in this subgroup are the uncharacterized DUF374 phospholipid/glycerol acyltransferases and similar proteins. Probab=21.20 E-value=61 Score=13.64 Aligned_cols=67 Identities=15% Similarity=0.061 Sum_probs=44.5 Q ss_pred CHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCH-----HHHHHHHHHHCCCEEEE Q ss_conf 21577898749812003467754449788999999997549938998188897-----89999999719908998 Q gi|254780717|r 195 EAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEFDS-----KIIRSITNDTGVMSAIL 264 (294) Q Consensus 195 ~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~-----~~~~~ia~e~~~~~~~l 264 (294) +-.....+.||+..+ .-+ ...-+.+-+.++++.+++...-+|+.|---.+ .=+-.||+.+|++++++ T Consensus 61 eiia~~~~~~G~~~I--RGS-s~rgg~~al~~~~~~lk~g~~i~itpDGPrGP~~~~k~G~i~LA~~sg~pIvpv 132 (189) T cd07983 61 EIIARVLERLGIRVV--RGS-SSRGGAAALREMLRALKDGYNIAITPDGPRGPRYKVKPGVILLARKSGAPIVPV 132 (189) T ss_pred HHHHHHHHHCCCCEE--ECC-CCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHHHHHHCCCCEEEE T ss_conf 389999998198156--445-555819999999999977985999468998843435732999999809988999 No 269 >PRK05586 biotin carboxylase; Validated Probab=21.13 E-value=57 Score=13.83 Aligned_cols=17 Identities=12% Similarity=0.251 Sum_probs=6.8 Q ss_pred CCCCCEEEECCCHHHHH Q ss_conf 00278146206215778 Q gi|254780717|r 184 PVEKKKIIVFHEAYRYF 200 (294) Q Consensus 184 ~~~~~~~i~~H~a~~Yf 200 (294) ......|+.-.+.=-|| T Consensus 270 gaGTvEFl~d~~~~fyF 286 (447) T PRK05586 270 NAGTIEFLLDKDGNFYF 286 (447) T ss_pred CCCEEEEEECCCCCEEE T ss_conf 33247888758997899 No 270 >cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions. Probab=21.08 E-value=61 Score=13.62 Aligned_cols=53 Identities=15% Similarity=0.023 Sum_probs=39.6 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEECCC---CHHHHHHHHHHHCCCEEEECCCC Q ss_conf 544497889999999975499389981888---97899999997199089980533 Q gi|254780717|r 216 SVFMGAASLRNIRSKIISDKISCLFYGPEF---DSKIIRSITNDTGVMSAILDPEG 268 (294) Q Consensus 216 ~~~~s~~~l~~~~~~ik~~~v~~if~e~~~---~~~~~~~ia~e~~~~~~~ld~~~ 268 (294) +..+......++.+.+++.+++.++.+.+. ....++.||+..|..-..+|.+. T Consensus 113 g~~~~~~~~~~~a~~~~~~gi~~~vId~~~~~~~~~~~~~LA~~~~g~Y~~id~l~ 168 (178) T cd01451 113 GPDPTADRALAAARKLRARGISALVIDTEGRPVRRGLAKDLARALGGQYVRLPDLS 168 (178) T ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHCCCCEEECCCCC T ss_conf 99951269999999998669978999799997674899999994299699899799 No 271 >pfam07312 DUF1459 Protein of unknown function (DUF1459). This family consists of several hypothetical Caenorhabditis elegans proteins of around 85 residues in length. The function of this family is unknown. Probab=21.05 E-value=53 Score=14.02 Aligned_cols=28 Identities=39% Similarity=0.453 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 1589999999999600132388059997 Q gi|254780717|r 2 KNFLIILIFLFFILSSVARAGSLQVVVS 29 (294) Q Consensus 2 kk~l~~~~~~~~~l~~~~~~~~~~Vv~s 29 (294) .|-++++.+.++|+++|.---.+.||.| T Consensus 3 qk~~ivl~~a~fciss~qv~ytpevvss 30 (84) T pfam07312 3 QKSLIVLAFALFCISSCQVLYSPEVVAS 30 (84) T ss_pred CHHHHHHHHHHHHHHHCEEEECHHHCCC T ss_conf 0335699999998611106635002157 No 272 >PRK08861 cystathionine gamma-synthase; Provisional Probab=20.93 E-value=62 Score=13.60 Aligned_cols=131 Identities=14% Similarity=0.167 Sum_probs=55.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCEE-----EECCCHHHHHHHHCC-C-CEECCCCCCCCCCCH Q ss_conf 186565889989999863122235899985---030027814-----620621577898749-8-120034677544497 Q gi|254780717|r 152 KDPRNKIIYEKNEEEFKNQLSQLDKELHSI---LQPVEKKKI-----IVFHEAYRYFASHYN-L-SIVTFPMSHSVFMGA 221 (294) Q Consensus 152 ~dP~~~~~y~~N~~~~~~~L~~l~~~~~~~---~~~~~~~~~-----i~~H~a~~Yf~~~~g-l-~~~~~~~~~~~~~s~ 221 (294) ..|..+-......+.+.-+++...+--.+. +.......- +..|+.+..+.+.++ . -+.+++... +. T Consensus 234 ~sP~~a~l~~RGL~TL~lRm~~~~~nA~~lA~~L~~hp~V~~V~yPgL~shp~h~lak~~~~g~ggm~sf~l~~----~~ 309 (388) T PRK08861 234 GTPFDSYMTLRGIRTLGARMRVHEESAQEILKYLQTQSLVGTIYHPSLPEHPGHEIAKKQQSGFGSMLSFEFAG----SF 309 (388) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHCCCCCCEEEEEECC----CH T ss_conf 88677788854789788999999986999999998499927898989999926999986456986269999779----89 Q ss_pred HHHHHHHHHHHHC-------CCCEEEEECCC--CHHHHHHHHHHHCCCE--EEECCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 8899999999754-------99389981888--9789999999719908--99805336787781678999999999998 Q gi|254780717|r 222 ASLRNIRSKIISD-------KISCLFYGPEF--DSKIIRSITNDTGVMS--AILDPEGMLIAEGPELYFQLMRSMSNSIA 290 (294) Q Consensus 222 ~~l~~~~~~ik~~-------~v~~if~e~~~--~~~~~~~ia~e~~~~~--~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~ 290 (294) ....++.+.++-- ++..++.-|.. ....-+.-.++.|+.. +.| ..|. +...+++...-++|+ T Consensus 310 ~~~~~f~~~l~lf~~a~SlGg~eSLi~~p~s~sh~~~~~~~~~~~gi~~~liRl-sVGl------Ed~eDLi~Dl~qAL~ 382 (388) T PRK08861 310 EQLKVFVKALKLFSLAESLGGVESLICHPASMTHRAMGEEALAEAGVSQQLLRL-SVGL------EDAQDLIADLKQAFA 382 (388) T ss_pred HHHHHHHHHCCCCEECCCCCCCCCEEECCCCCCCCCCCHHHHHHCCCCCCEEEE-EECC------CCHHHHHHHHHHHHH T ss_conf 999999983880533116687750213775455411899999866989496999-9421------999999999999999 Q ss_pred HHC Q ss_conf 741 Q gi|254780717|r 291 KNC 293 (294) Q Consensus 291 k~l 293 (294) ++- T Consensus 383 ~a~ 385 (388) T PRK08861 383 KAA 385 (388) T ss_pred HHH T ss_conf 987 No 273 >COG4047 Uncharacterized protein conserved in archaea [Function unknown] Probab=20.91 E-value=62 Score=13.60 Aligned_cols=102 Identities=19% Similarity=0.124 Sum_probs=62.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCEEEECCCHHHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 8998999986312223589998503-002781462062157789874981200346775444978899999999754993 Q gi|254780717|r 159 IYEKNEEEFKNQLSQLDKELHSILQ-PVEKKKIIVFHEAYRYFASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKIS 237 (294) Q Consensus 159 ~y~~N~~~~~~~L~~l~~~~~~~~~-~~~~~~~i~~H~a~~Yf~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~ 237 (294) .-..|...|...+..+-..+...+. .-.++.+|-+---|+|-++--- ..+...|=.-|-|-|..-..-..+.. T Consensus 113 L~~~~~~~yyedm~~l~~~la~~lg~d~esKT~VFsVKmfgYA~r~~~---g~f~pyp~eIPIPvD~Ri~rlT~~~t--- 186 (243) T COG4047 113 LTEENIEVYYEDMSLLLEALARALGADRESKTVVFSVKMFGYAARIVL---GRFRPYPFEIPIPVDVRIARLTRKLT--- 186 (243) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCCCCCCHHHHHHHHHHHH--- T ss_conf 656319999863899999999996778544528988467678999983---67888775788740489999999875--- Q ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEECCCCCCC Q ss_conf 8998188897899999997199089980533678 Q gi|254780717|r 238 CLFYGPEFDSKIIRSITNDTGVMSAILDPEGMLI 271 (294) Q Consensus 238 ~if~e~~~~~~~~~~ia~e~~~~~~~ld~~~~~~ 271 (294) ...+.+....||+++|++...+|++-+.. T Consensus 187 -----ne~Pq~~W~~iAResGIPpLHiDSILW~~ 215 (243) T COG4047 187 -----NEAPQKFWDKIARESGIPPLHIDSILWPL 215 (243) T ss_pred -----HHHHHHHHHHHHHHCCCCCCEECCEECCC T ss_conf -----31479999999874398962130000025 No 274 >TIGR01990 bPGM beta-phosphoglucomutase; InterPro: IPR010972 This model represents the beta-phosphoglucomutase enzyme which catalyses the interconversion of beta-D-glucose-1-phosphate and beta-D-glucose-6-phosphate. The 6-phosphate is capable of non-enzymatic anomerization (alpha to beta or vice versa) while the 1-phosphate is not. A separate enzyme is responsible for the isomerisation of the alpha anomers. Beta-D-glucose-1-phosphate results from the phosphorolysis of maltose (2.4.1.8 from EC), trehalose (2.4.1.64 from EC) or trehalose-6-phosphate (2.4.1.216 from EC). Alternatively, these reactions can be run in the synthetic direction to create the disaccharides. All sequenced genomes which contain a member of this family also appear to contain at least one putative maltose or trehalose phosphorylase. Three species, Lactococcus, Enterococcus and Neisseria appear to contain a pair of paralogous beta-PGM's. Beta-phosphoglucomutase is a member of the haloacid dehalogenase superfamily of hydrolase enzymes. These enzymes are characterised by a series of three catalytic motifs positioned within an alpha-beta (Rossman) fold . Beta-PGM contains an inserted alpha helical domain in between the first and second conserved motifs and thus is a member of subfamily IA of the superfamily , . The third catalytic motif comes in three variants, the third of which, containing a conserved DD or ED, is the only one found here as well as in several other related enzymes. The enzyme from Lactococcus lactis has been extensively characterised including a remarkable crystal structure which traps the pentacoordinate transition state . . Probab=20.89 E-value=62 Score=13.60 Aligned_cols=81 Identities=19% Similarity=0.262 Sum_probs=47.8 Q ss_pred CCCHHHHHHHHCCCCEEC-C-C-CCC-CCCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCE---EEEC Q ss_conf 062157789874981200-3-4-677-5444978899999999754993899818889789999999719908---9980 Q gi|254780717|r 193 FHEAYRYFASHYNLSIVT-F-P-MSH-SVFMGAASLRNIRSKIISDKISCLFYGPEFDSKIIRSITNDTGVMS---AILD 265 (294) Q Consensus 193 ~H~a~~Yf~~~~gl~~~~-~-~-~~~-~~~~s~~~l~~~~~~ik~~~v~~if~e~~~~~~~~~~ia~e~~~~~---~~ld 265 (294) +..-..+++++=|..+.- + . +.| +.-|+ +..+++.+|++++||=+-.. +|.+..|-+.+++.- ...| T Consensus 65 s~~e~~~la~~KN~~Y~~LlD~~lTp~d~LPG---i~~lL~~Lk~~~ikialASa---SkNA~~vLekL~L~~~Fd~IvD 138 (190) T TIGR01990 65 SEEEKEELAERKNDYYVELLDKELTPEDVLPG---IKSLLEDLKKKNIKIALASA---SKNAPTVLEKLELRDYFDAIVD 138 (190) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCC---HHHHHHHHHHCCCCEEEEEH---HHHHHHHHHHHHHHHCCCEEEC T ss_conf 77899999998779999997506898660401---89999999984894887302---3448999998214220422645 Q ss_pred CCC-CCCCCCCCCHH Q ss_conf 533-67877816789 Q gi|254780717|r 266 PEG-MLIAEGPELYF 279 (294) Q Consensus 266 ~~~-~~~~~~~~~Y~ 279 (294) |-. ..++++|+=++ T Consensus 139 p~~~~~gKPdPEIFL 153 (190) T TIGR01990 139 PAEIKKGKPDPEIFL 153 (190) T ss_pred HHHHCCCCCCHHHHH T ss_conf 456217877867999 No 275 >PRK10554 outer membrane porin protein C; Provisional Probab=20.77 E-value=62 Score=13.60 Aligned_cols=21 Identities=24% Similarity=0.325 Sum_probs=15.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||-++.++++.+++++.+.+ T Consensus 1 MKk~~LA~~i~all~ag~a~A 21 (373) T PRK10554 1 MKVKVLSLLVPALLVAGAANA 21 (373) T ss_pred CCHHHHHHHHHHHHHHCCCCE T ss_conf 954089999999986075317 No 276 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=20.64 E-value=63 Score=13.56 Aligned_cols=25 Identities=0% Similarity=0.038 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 7889999999975499389981888 Q gi|254780717|r 221 AASLRNIRSKIISDKISCLFYGPEF 245 (294) Q Consensus 221 ~~~l~~~~~~ik~~~v~~if~e~~~ 245 (294) .+++.++.+..|+.++++|...... T Consensus 201 t~e~i~~~~~Ak~~Ga~vI~IT~~~ 225 (293) T PRK11337 201 TSDVKAAVELAKQNGAKIICITHSY 225 (293) T ss_pred CHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 8899999999998799499976999 No 277 >COG5510 Predicted small secreted protein [Function unknown] Probab=20.48 E-value=63 Score=13.54 Aligned_cols=19 Identities=16% Similarity=0.352 Sum_probs=9.9 Q ss_pred CHHH---HHHHHHHHHHHHCCC Q ss_conf 9158---999999999960013 Q gi|254780717|r 1 MKNF---LIILIFLFFILSSVA 19 (294) Q Consensus 1 Mkk~---l~~~~~~~~~l~~~~ 19 (294) |||+ .++++++.+++++|. T Consensus 2 mk~t~l~i~~vll~s~llaaCN 23 (44) T COG5510 2 MKKTILLIALVLLASTLLAACN 23 (44) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 0489999999999999998866 No 278 >PRK10159 outer membrane phosphoporin protein E; Provisional Probab=20.38 E-value=64 Score=13.53 Aligned_cols=21 Identities=24% Similarity=0.322 Sum_probs=15.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 915899999999996001323 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVARA 21 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~~ 21 (294) |||-++.++++.++.++.+.+ T Consensus 1 MKk~~lA~ai~a~l~s~~a~A 21 (351) T PRK10159 1 MKKSTLALVVMGIVASASVQA 21 (351) T ss_pred CCHHHHHHHHHHHHHCCCCCE T ss_conf 947389999999987265317 No 279 >TIGR02294 nickel_nikA nickel ABC transporter, periplasmic nickel-binding protein; InterPro: IPR011980 Members of this family are periplasmic nickel-binding proteins of nickel ABC transporters. Nickel is bound specifically, albeit weakly, through water molecules positioned in the binding site. The amino acids whose side chains line the binding site include Tyr-44, Met-49, Trp-122, Arg-159, Trp-420, and Tyr-424 (numbering based on the precursor sequence of Escherichia coli NikA) with the Arg contributing a hydrogen bond indirectly through a water molecule .. Probab=20.34 E-value=64 Score=13.52 Aligned_cols=18 Identities=22% Similarity=0.364 Sum_probs=9.1 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 677655556789999999 Q gi|254780717|r 130 HLWLNPLNVQYIAHVIAM 147 (294) Q Consensus 130 H~Wl~p~n~~~~a~~I~~ 147 (294) |-||...|.+.-+++.-+ T Consensus 102 H~Wl~l~n~l~~v~a~~~ 119 (513) T TIGR02294 102 HSWLELVNQLENVKALDK 119 (513) T ss_pred CCHHHHCCEEEEEEECCC T ss_conf 223321002333684360 No 280 >pfam00838 TCTP Translationally controlled tumour protein. Probab=20.33 E-value=64 Score=13.52 Aligned_cols=36 Identities=14% Similarity=0.301 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 567899999999742186565889989999863122 Q gi|254780717|r 137 NVQYIAHVIAMELIKKDPRNKIIYEKNEEEFKNQLS 172 (294) Q Consensus 137 n~~~~a~~I~~~L~~~dP~~~~~y~~N~~~~~~~L~ 172 (294) ..+.+.+.|.+.|.+..|+.-+.|++|++.+.+.+. T Consensus 88 yiK~YmK~v~~~L~e~~pe~V~~Fk~~a~~~vK~il 123 (166) T pfam00838 88 YIKGYMKAVKAKLQEENPERVSLFKKNIQGWVKSLL 123 (166) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH T ss_conf 999999999998765196789999998899999876 No 281 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=20.25 E-value=64 Score=13.51 Aligned_cols=87 Identities=21% Similarity=0.072 Sum_probs=42.2 Q ss_pred CHHHHHHH-HHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCC-CCCCCCC----CHHHHHHHCCCC Q ss_conf 91589999-99999960013238805999722899999997079441788705899-8353356----802456650257 Q gi|254780717|r 1 MKNFLIIL-IFLFFILSSVARAGSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGAS-SPHEYSL----RISEAMMLENAD 74 (294) Q Consensus 1 Mkk~l~~~-~~~~~~l~~~~~~~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~-dpH~y~~----tp~d~~~l~~Ad 74 (294) |||.+.+. ..++++|++|+..-.=+ -.-+++.-+...-+++.++|-.. .-..|.. +.+..+-|..+. T Consensus 1 Mkktl~i~~ta~vliLs~C~~~~dd~-------edkv~qk~~~sseq~k~Iv~k~nis~n~YktvLpyk~~kaRGl~~~n 73 (382) T COG4851 1 MKKTLGIAATASVLILSGCFPFVDDT-------EDKVVQKEGKSSEQEKGIVPKANISENYYKTVLPYKAGKARGLGSAN 73 (382) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCC-------CCHHHHCCCCCHHHCCCCCCCCCCCCCCHHEEEEECCCCCCCHHHHH T ss_conf 94015589999999986116766873-------00234303770332011213544443313224221024445500665 Q ss_pred EEE-EECCCCCHHHHHHHHHC Q ss_conf 899-96766103568999731 Q gi|254780717|r 75 IIF-WLGAEMESFLVKPLHSL 94 (294) Q Consensus 75 lii-~~G~~~E~~~~~~~~~~ 94 (294) +.= +.+..||..+.++.+.. T Consensus 74 ~~nr~dide~E~GLm~lake~ 94 (382) T COG4851 74 VYNRYDIDELELGLMRLAKEH 94 (382) T ss_pred HHCCCCHHHHHHHHHHHHHHC T ss_conf 540156047888899987740 No 282 >pfam11672 DUF3268 Protein of unknown function (DUF3268). This family of proteins has no known function. Probab=20.24 E-value=56 Score=13.86 Aligned_cols=32 Identities=25% Similarity=0.400 Sum_probs=19.8 Q ss_pred HHHHHHCCCCCCCEEEECCCHHHHHHHHCCCC Q ss_conf 89998503002781462062157789874981 Q gi|254780717|r 176 KELHSILQPVEKKKIIVFHEAYRYFASHYNLS 207 (294) Q Consensus 176 ~~~~~~~~~~~~~~~i~~H~a~~Yf~~~~gl~ 207 (294) ......|.|+-+...++-.++|.++++.+|+. T Consensus 64 ~~ah~~fd~lw~~~~~~R~~aY~wLA~~lgl~ 95 (102) T pfam11672 64 ILAHRAFDPIWRSKHMSRSEAYAWLAAKLGLP 95 (102) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC T ss_conf 99999989998756647899999999981998 No 283 >TIGR00885 fucP L-fucose:H+ symporter permease; InterPro: IPR005275 This family describes the L-fucose permease in bacteria. L-fucose(6-deoxy-L-galactose) is a monosaccharide found in glycoproteins and cell wall polysaccharides. L-fucose is used in bacteria through an inducible pathway mediated by at least four enzymes: a permease, isomerase, kinase and an aldolase which are encoded by fucP, fucI, fucK, fucA respectively. . Probab=20.21 E-value=44 Score=14.53 Aligned_cols=63 Identities=16% Similarity=0.161 Sum_probs=53.0 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCC----CCCCCHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf 8805999722899999997079441788705899835----335680245665025789996766103568 Q gi|254780717|r 22 GSLQVVVSIKPIHSIVSCIMQGIGTPALLVKGASSPH----EYSLRISEAMMLENADIIFWLGAEMESFLV 88 (294) Q Consensus 22 ~~~~Vv~s~~pl~~lv~~I~gd~~~V~~L~~~g~dpH----~y~~tp~d~~~l~~Adlii~~G~~~E~~~~ 88 (294) -+++..=|+.|+++++....+.+.--..+-+.++|.. .=.++|++..+++++|+.... +|++. T Consensus 130 rRLNlAQsFNp~Gsi~g~~va~~lil~nl~~~~~D~~Gnlif~~ls~~e~~aik~sdla~~~----~PY~i 196 (427) T TIGR00885 130 RRLNLAQSFNPLGSIIGMVVAQQLILSNLESQSQDVLGNLIFDKLSEEELSAIKESDLASVR----TPYLI 196 (427) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHH----HHHHH T ss_conf 02225630215789999999989886328876642578789884297888886465576661----23899 No 284 >COG0287 TyrA Prephenate dehydrogenase [Amino acid transport and metabolism] Probab=20.19 E-value=64 Score=13.50 Aligned_cols=120 Identities=8% Similarity=0.105 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHCCCCC--CCEEEECCCHHHH--HHHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 12223589998503002--7814620621577--8987498120034677544497889999999975499389981888 Q gi|254780717|r 170 QLSQLDKELHSILQPVE--KKKIIVFHEAYRY--FASHYNLSIVTFPMSHSVFMGAASLRNIRSKIISDKISCLFYGPEF 245 (294) Q Consensus 170 ~L~~l~~~~~~~~~~~~--~~~~i~~H~a~~Y--f~~~~gl~~~~~~~~~~~~~s~~~l~~~~~~ik~~~v~~if~e~~~ 245 (294) .+........+.+...- +..|+..||=|+- +..-|.=.++.+....+.+ ...+.++.+..+.-+.+++.+++.. T Consensus 96 Dv~S~K~~v~~a~~~~~~~~~~~vg~HPM~G~~~~~~lf~~~~~vltp~~~~~--~~~~~~~~~~~~~~ga~~v~~~~ee 173 (279) T COG0287 96 DVGSVKSSVVEAMEKYLPGDVRFVGGHPMFGPEADAGLFENAVVVLTPSEGTE--KEWVEEVKRLWEALGARLVEMDAEE 173 (279) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCEEEEECCCCCC--HHHHHHHHHHHHHCCCEEEECCHHH T ss_conf 36420178999999746677707821899987431123379679994799987--8999999999997498899868889 Q ss_pred CHHHHHHHH--------------HHHCC--------CEEEECCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 978999999--------------97199--------08998053367877816789999999999987 Q gi|254780717|r 246 DSKIIRSIT--------------NDTGV--------MSAILDPEGMLIAEGPELYFQLMRSMSNSIAK 291 (294) Q Consensus 246 ~~~~~~~ia--------------~e~~~--------~~~~ld~~~~~~~~~~~~Y~~~m~~n~~~l~k 291 (294) ..+.+..++ .+.+. .......+.......|..|.+++.+|-+.+.+ T Consensus 174 HD~~~a~vshLpH~~a~al~~~~~~~~~~~~~~~~~as~~frd~tRia~~~P~m~~dI~~~N~~~~l~ 241 (279) T COG0287 174 HDRVMAAVSHLPHAAALALANALAKLETEELLVLKLASGGFRDITRIASSDPEMYADIQLSNKEALLE 241 (279) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCEEHEECCCCHHHHHHHHHCCHHHHH T ss_conf 87899999889999999999998614720267886245654430011169937999999868186899 No 285 >PRK11372 lysozyme inhibitor; Provisional Probab=20.04 E-value=65 Score=13.48 Aligned_cols=18 Identities=39% Similarity=0.575 Sum_probs=11.6 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 91589999999999600132 Q gi|254780717|r 1 MKNFLIILIFLFFILSSVAR 20 (294) Q Consensus 1 Mkk~l~~~~~~~~~l~~~~~ 20 (294) ||+++++ ++.+++++|+. T Consensus 3 MKkll~~--~~~llLsGCs~ 20 (109) T PRK11372 3 MKKLLII--CLPVLLTGCSA 20 (109) T ss_pred HHHHHHH--HHHHHHHCCCC T ss_conf 4889999--99999704543 Done!