Query gi|255764489|ref|YP_003065094.2| stationary phase survival protein SurE [Candidatus Liberibacter asiaticus str. psy62] Match_columns 250 No_of_seqs 158 out of 1614 Neff 7.7 Searched_HMMs 39220 Date Sun May 29 23:21:15 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764489.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK13933 stationary phase surv 100.0 0 0 610.9 25.8 248 1-250 1-253 (253) 2 PRK00346 surE stationary phase 100.0 0 0 608.5 26.1 246 1-249 1-246 (246) 3 PRK13935 stationary phase surv 100.0 0 0 609.6 24.7 246 1-249 1-250 (255) 4 PRK13932 stationary phase surv 100.0 0 0 604.7 24.9 244 1-247 7-254 (258) 5 PRK13931 stationary phase surv 100.0 0 0 592.3 23.6 248 1-248 1-260 (261) 6 PRK13934 stationary phase surv 100.0 0 0 582.1 22.9 244 1-249 1-260 (266) 7 COG0496 SurE Predicted acid ph 100.0 0 0 578.0 23.3 248 1-249 1-252 (252) 8 TIGR00087 surE 5'/3'-nucleotid 100.0 0 0 524.6 19.1 236 1-236 1-326 (326) 9 LOAD_surE consensus 100.0 0 0 471.0 18.7 188 1-189 1-191 (192) 10 pfam01975 SurE Survival protei 100.0 0 0 469.2 18.7 185 2-187 1-189 (190) 11 cd03784 GT1_Gtf_like This fami 96.2 0.028 7E-07 34.6 7.2 112 1-126 1-134 (401) 12 pfam04007 DUF354 Protein of un 95.9 0.027 6.8E-07 34.7 6.1 104 1-125 1-111 (335) 13 PRK10307 predicted glycosyl tr 95.8 0.11 2.9E-06 30.8 8.8 34 1-34 1-39 (415) 14 cd03802 GT1_AviGT4_like This f 94.3 0.28 7.3E-06 28.4 7.3 37 1-38 1-47 (335) 15 PRK09191 two-component respons 91.6 0.49 1.2E-05 26.9 5.3 45 75-125 174-218 (261) 16 cd04962 GT1_like_5 This family 89.9 1.7 4.3E-05 23.6 7.8 82 1-89 1-90 (371) 17 cd03817 GT1_UGDG_like This fam 88.9 2 5.2E-05 23.1 9.7 108 2-122 1-113 (374) 18 cd03808 GT1_cap1E_like This fa 87.6 2.2 5.6E-05 23.0 6.1 79 5-89 5-86 (359) 19 PRK13435 response regulator; P 87.5 1.9 5E-05 23.3 5.8 80 1-125 2-82 (141) 20 cd03795 GT1_like_4 This family 86.9 2.7 6.8E-05 22.4 6.7 84 2-89 1-89 (357) 21 cd03814 GT1_like_2 This family 86.8 2.7 6.9E-05 22.4 6.3 105 2-123 1-113 (364) 22 cd03799 GT1_amsK_like This is 86.2 1.7 4.4E-05 23.6 4.9 94 13-122 14-108 (355) 23 cd03132 GATase1_catalase Type 86.1 1.7 4.4E-05 23.6 4.9 35 3-39 6-40 (142) 24 pfam07355 GRDB Glycine/sarcosi 85.2 2 5.1E-05 23.2 4.9 52 69-123 65-117 (349) 25 cd03794 GT1_wbuB_like This fam 84.8 3.4 8.7E-05 21.8 10.5 112 2-122 1-129 (394) 26 cd03820 GT1_amsD_like This fam 83.8 3.8 9.6E-05 21.5 6.9 72 14-89 17-89 (348) 27 PRK06849 hypothetical protein; 82.1 2.6 6.6E-05 22.5 4.3 73 1-89 5-82 (387) 28 cd03825 GT1_wcfI_like This fam 80.2 3.1 8E-05 22.0 4.3 37 1-37 1-40 (365) 29 PRK10365 transcriptional regul 75.1 7.2 0.00018 19.8 6.7 30 1-33 6-35 (441) 30 TIGR01027 proB glutamate 5-kin 74.5 4.5 0.00012 21.0 3.8 31 4-34 11-47 (379) 31 PRK12767 carbamoyl phosphate s 74.4 7.5 0.00019 19.7 7.1 74 1-91 2-77 (325) 32 TIGR03107 glu_aminopep glutamy 73.8 3.7 9.5E-05 21.5 3.2 22 10-31 15-36 (350) 33 cd03823 GT1_ExpE7_like This fa 73.8 7.7 0.0002 19.6 6.2 41 2-42 1-47 (359) 34 COG1091 RfbD dTDP-4-dehydrorha 73.5 7.9 0.0002 19.5 5.1 97 1-130 1-105 (281) 35 cd03805 GT1_ALG2_like This fam 72.3 7 0.00018 19.9 4.3 40 1-40 1-43 (392) 36 cd03801 GT1_YqgM_like This fam 72.0 8.6 0.00022 19.3 7.5 111 2-126 1-117 (374) 37 cd03134 GATase1_PfpI_like A ty 71.8 8 0.0002 19.5 4.5 42 2-43 1-42 (165) 38 PRK09836 DNA-binding transcrip 68.8 4.4 0.00011 21.1 2.6 81 1-126 1-81 (226) 39 PRK13782 phosphocarrier protei 64.2 12 0.00032 18.3 4.2 79 2-82 5-83 (85) 40 PRK10675 UDP-galactose-4-epime 63.2 8.4 0.00021 19.4 3.2 31 1-34 1-31 (338) 41 COG1817 Uncharacterized protei 63.2 3.5 9E-05 21.7 1.2 102 1-124 1-111 (346) 42 PRK00726 murG N-acetylglucosam 62.9 13 0.00033 18.2 7.6 108 3-125 4-122 (359) 43 PRK11517 transcriptional regul 61.9 7.1 0.00018 19.8 2.6 80 1-126 1-80 (223) 44 PRK09189 uroporphyrinogen-III 61.6 14 0.00034 18.1 4.0 35 1-40 1-35 (237) 45 cd03796 GT1_PIG-A_like This fa 60.8 14 0.00036 18.0 9.5 109 2-122 1-118 (398) 46 cd03821 GT1_Bme6_like This fam 60.1 15 0.00037 17.9 5.9 28 14-41 18-45 (375) 47 TIGR03466 HpnA hopanoid-associ 59.5 10 0.00026 18.9 3.1 32 1-35 1-32 (328) 48 PRK13780 phosphocarrier protei 56.8 16 0.00042 17.6 3.7 78 2-81 5-82 (88) 49 PRK11083 DNA-binding response 56.4 17 0.00043 17.5 5.1 81 1-126 4-84 (229) 50 PRK08622 galactose-6-phosphate 56.1 14 0.00036 18.0 3.3 35 1-37 1-36 (171) 51 cd03785 GT1_MurG MurG is an N- 55.4 18 0.00045 17.4 7.9 100 11-125 12-120 (350) 52 TIGR01925 spIIAB anti-sigma F 54.8 7.9 0.0002 19.5 1.8 45 6-57 79-125 (137) 53 PRK10816 DNA-binding transcrip 54.7 11 0.00028 18.7 2.6 80 1-125 1-80 (223) 54 PRK12615 galactose-6-phosphate 54.4 15 0.00039 17.8 3.3 38 1-40 1-39 (171) 55 PRK05476 S-adenosyl-L-homocyst 54.4 18 0.00047 17.3 4.8 100 17-127 59-179 (427) 56 PRK10850 phosphohistidinoprote 54.3 18 0.00046 17.4 3.6 77 2-80 5-81 (85) 57 smart00448 REC cheY-homologous 54.1 18 0.00047 17.3 4.0 28 1-31 1-28 (55) 58 PRK10710 DNA-binding transcrip 53.5 19 0.00048 17.2 4.9 80 1-126 11-90 (240) 59 cd03818 GT1_ExpC_like This fam 53.1 18 0.00045 17.4 3.4 35 2-38 1-35 (396) 60 PRK00654 glgA glycogen synthas 52.5 20 0.0005 17.1 3.8 38 1-38 1-45 (476) 61 COG0726 CDA1 Predicted xylanas 52.3 15 0.00037 17.9 2.9 36 2-37 66-103 (267) 62 cd03791 GT1_Glycogen_synthase_ 52.1 18 0.00047 17.3 3.4 24 16-39 22-45 (476) 63 pfam03033 Glyco_transf_28 Glyc 52.0 20 0.00051 17.1 4.2 96 14-126 14-117 (136) 64 PRK10766 DNA-binding transcrip 51.1 21 0.00053 17.0 3.9 79 1-125 3-81 (224) 65 PRK06953 short chain dehydroge 50.0 21 0.00055 16.9 5.3 74 1-89 1-76 (222) 66 cd03169 GATase1_PfpI_1 Type 1 49.4 22 0.00056 16.8 4.6 37 2-38 1-37 (180) 67 PRK10336 DNA-binding transcrip 49.4 15 0.00039 17.8 2.6 80 1-125 1-80 (219) 68 PRK10643 DNA-binding transcrip 47.6 17 0.00043 17.5 2.6 80 1-125 1-80 (222) 69 cd03822 GT1_ecORF704_like This 46.8 24 0.00061 16.6 6.8 95 14-125 17-113 (366) 70 TIGR02177 PorB_KorB 2-oxoacid: 46.7 19 0.0005 17.1 2.8 34 217-250 265-302 (302) 71 pfam02951 GSH-S_N Prokaryotic 46.7 23 0.00059 16.7 3.2 38 1-38 1-42 (119) 72 cd01714 ETF_beta The electron 46.5 24 0.00062 16.5 5.7 102 16-127 38-145 (202) 73 PRK10446 ribosomal protein S6 44.4 26 0.00067 16.3 3.9 36 1-38 1-37 (300) 74 pfam07075 DUF1343 Protein of u 44.0 27 0.00068 16.3 4.0 109 3-124 2-118 (359) 75 KOG3619 consensus 43.7 6.3 0.00016 20.1 -0.1 74 50-126 154-254 (867) 76 COG2065 PyrR Pyrimidine operon 42.9 12 0.0003 18.5 1.2 28 98-126 102-131 (179) 77 PRK05884 short chain dehydroge 42.9 28 0.0007 16.2 3.9 74 1-89 1-75 (223) 78 TIGR01283 nifE nitrogenase MoF 42.9 9.7 0.00025 19.0 0.8 24 102-126 265-288 (470) 79 PRK13668 hypothetical protein; 42.6 28 0.00071 16.2 3.9 12 222-233 234-245 (266) 80 PRK06769 hypothetical protein; 41.8 24 0.00062 16.5 2.7 31 14-47 31-61 (175) 81 TIGR01862 N2-ase-Ialpha nitrog 41.5 13 0.00032 18.3 1.2 20 104-124 261-280 (510) 82 cd00367 PTS-HPr_like Histidine 40.2 31 0.00078 15.9 5.0 75 3-79 2-76 (77) 83 COG1925 FruB Phosphotransferas 40.0 31 0.00078 15.9 3.9 78 2-81 5-82 (88) 84 PRK05205 bifunctional pyrimidi 39.7 17 0.00042 17.6 1.6 22 98-119 101-124 (176) 85 pfam02057 Glyco_hydro_59 Glyco 37.0 30 0.00076 16.0 2.5 48 5-52 179-227 (669) 86 pfam02684 LpxB Lipid-A-disacch 37.0 16 0.0004 17.7 1.1 35 10-44 88-126 (373) 87 COG2110 Predicted phosphatase 36.8 35 0.00088 15.6 5.8 69 84-152 77-149 (179) 88 TIGR01317 GOGAT_sm_gam glutama 36.4 30 0.00076 16.0 2.4 18 15-33 163-180 (517) 89 TIGR02962 hdxy_isourate hydrox 36.3 12 0.00031 18.4 0.4 11 116-126 12-22 (117) 90 COG1259 Uncharacterized conser 36.1 35 0.0009 15.5 3.8 31 47-78 87-117 (151) 91 pfam04432 FrhB_FdhB_C Coenzyme 36.0 36 0.00091 15.5 3.1 36 61-98 5-45 (162) 92 pfam00201 UDPGT UDP-glucoronos 33.8 39 0.00098 15.3 4.2 38 2-39 2-39 (501) 93 PRK10125 predicted glycosyl tr 33.7 39 0.00099 15.3 4.6 34 1-34 1-37 (405) 94 pfam06406 StbA StbA protein. T 33.1 40 0.001 15.3 3.2 60 1-67 1-60 (318) 95 PRK08340 glucose-1-dehydrogena 33.1 40 0.001 15.3 3.7 79 1-89 1-82 (259) 96 pfam00381 PTS-HPr PTS HPr comp 33.1 40 0.001 15.3 4.2 77 2-80 4-80 (83) 97 TIGR01279 DPOR_bchN light-inde 33.0 11 0.00028 18.6 -0.2 17 62-78 91-107 (458) 98 cd00948 FBP_aldolase_I_a Fruct 31.3 34 0.00088 15.6 2.1 131 29-165 49-221 (330) 99 PRK08250 glutamine amidotransf 31.1 43 0.0011 15.0 4.5 36 1-38 1-36 (235) 100 PRK11249 katE hydroperoxidase 30.4 44 0.0011 15.0 4.4 27 9-35 579-605 (725) 101 cd01831 Endoglucanase_E_like E 29.8 45 0.0011 14.9 2.6 12 84-97 56-67 (169) 102 cd01833 XynB_like SGNH_hydrola 29.7 42 0.0011 15.1 2.3 44 42-93 6-52 (157) 103 cd03800 GT1_Sucrose_synthase T 29.5 46 0.0012 14.9 3.2 93 16-122 27-129 (398) 104 PRK10897 phosphohistidinoprote 29.3 46 0.0012 14.9 4.7 82 2-84 6-87 (90) 105 TIGR03449 mycothiol_MshA UDP-N 29.0 46 0.0012 14.8 3.7 105 6-122 16-129 (405) 106 cd03798 GT1_wlbH_like This fam 28.6 47 0.0012 14.8 10.6 27 14-40 18-44 (377) 107 TIGR01282 nifD nitrogenase mol 28.6 24 0.00062 16.6 0.9 30 16-46 152-182 (521) 108 pfam10841 DUF2644 Protein of u 28.4 22 0.00057 16.8 0.7 11 3-13 2-12 (60) 109 PRK09581 pleD response regulat 28.4 48 0.0012 14.8 4.8 29 2-33 4-32 (457) 110 KOG4120 consensus 28.1 32 0.00081 15.8 1.4 20 78-98 144-163 (426) 111 PRK00025 lpxB lipid-A-disaccha 28.0 30 0.00075 16.0 1.3 22 101-124 269-290 (382) 112 PRK05429 gamma-glutamyl kinase 27.9 49 0.0012 14.7 3.8 31 4-34 19-53 (372) 113 cd04256 AAK_P5CS_ProBA AAK_P5C 27.8 49 0.0012 14.7 3.5 31 4-34 19-54 (284) 114 TIGR01380 glut_syn glutathione 27.7 49 0.0013 14.7 3.8 107 18-168 22-144 (322) 115 cd03819 GT1_WavL_like This fam 27.7 49 0.0013 14.7 3.6 25 14-38 14-38 (355) 116 PRK02858 germination protease; 27.1 35 0.0009 15.6 1.5 54 64-126 165-239 (367) 117 cd06428 M1P_guanylylT_A_like_N 27.0 50 0.0013 14.6 6.5 22 144-166 121-142 (257) 118 PRK12320 hypothetical protein; 26.9 51 0.0013 14.6 3.8 44 1-47 1-44 (699) 119 PRK12362 germination protease; 26.8 36 0.00091 15.5 1.5 54 64-126 153-227 (324) 120 COG0451 WcaG Nucleoside-diphos 26.6 51 0.0013 14.6 3.9 39 1-42 1-39 (314) 121 PRK09265 aminotransferase AlaT 26.3 45 0.0012 14.9 2.0 33 2-37 97-129 (404) 122 PRK10241 hydroxyacylglutathion 26.1 52 0.0013 14.5 2.7 88 3-110 49-137 (251) 123 pfam07429 Fuc4NAc_transf 4-alp 25.6 54 0.0014 14.5 4.5 17 227-243 217-233 (361) 124 PRK08674 bifunctional phosphog 25.3 54 0.0014 14.4 2.2 25 101-125 85-113 (328) 125 pfam08323 Glyco_transf_5 Starc 25.1 55 0.0014 14.4 3.3 23 16-38 21-43 (229) 126 TIGR01382 PfpI intracellular p 24.7 55 0.0014 14.4 4.3 46 2-47 1-47 (189) 127 PRK13728 conjugal transfer pro 24.6 44 0.0011 14.9 1.7 49 113-164 95-147 (181) 128 cd02537 GT8_Glycogenin Glycoge 24.5 56 0.0014 14.3 3.1 31 3-35 5-37 (240) 129 PRK00358 pyrH uridylate kinase 24.5 56 0.0014 14.3 2.9 30 4-33 11-45 (231) 130 pfam02577 DUF151 Uncharacteriz 23.8 58 0.0015 14.2 2.5 21 58-78 91-111 (133) 131 TIGR02114 coaB_strep phosphopa 23.7 49 0.0012 14.7 1.7 108 1-156 1-122 (253) 132 PTZ00019 fructose-bisphosphate 23.6 55 0.0014 14.4 2.0 109 56-165 100-228 (356) 133 PRK12314 gamma-glutamyl kinase 23.5 59 0.0015 14.2 3.3 30 4-33 20-53 (265) 134 COG0763 LpxB Lipid A disacchar 23.5 45 0.0012 14.9 1.5 19 101-121 269-287 (381) 135 CHL00148 orf27 Ycf27; Reviewed 23.4 59 0.0015 14.2 2.6 80 1-126 7-86 (240) 136 pfam03418 Peptidase_A25 Germin 23.0 49 0.0013 14.7 1.6 39 81-126 168-227 (355) 137 cd00401 AdoHcyase S-adenosyl-L 22.6 61 0.0016 14.1 4.2 98 18-127 51-173 (413) 138 TIGR02738 TrbB type-F conjugat 22.6 43 0.0011 15.0 1.3 51 113-169 97-147 (176) 139 PRK06951 consensus 22.5 61 0.0016 14.1 3.0 48 1-52 1-64 (254) 140 PRK10017 putative pyruvyl tran 22.4 47 0.0012 14.8 1.4 34 1-34 1-40 (426) 141 pfam05221 AdoHcyase S-adenosyl 22.4 62 0.0016 14.1 3.3 100 17-127 57-181 (430) 142 PRK12342 putative electron tra 22.3 62 0.0016 14.1 7.0 103 16-128 38-147 (254) 143 TIGR01130 ER_PDI_fam protein d 22.2 44 0.0011 15.0 1.3 37 28-69 18-78 (522) 144 PRK03359 putative electron tra 21.9 63 0.0016 14.0 5.9 99 15-123 38-145 (256) 145 PRK11574 hypothetical protein; 21.9 63 0.0016 14.0 3.0 61 2-62 4-67 (196) 146 TIGR03590 PseG pseudaminic aci 21.9 45 0.0012 14.9 1.3 40 1-41 1-46 (280) 147 TIGR00213 GmhB_yaeD D,D-heptos 21.9 63 0.0016 14.0 2.9 30 13-45 28-57 (178) 148 COG0745 OmpR Response regulato 21.9 63 0.0016 14.0 2.7 31 1-34 1-31 (229) 149 PRK08942 D,D-heptose 1,7-bisph 21.8 63 0.0016 14.0 2.6 30 14-46 32-61 (181) 150 TIGR01821 5aminolev_synth 5-am 21.7 35 0.0009 15.5 0.7 19 103-124 264-282 (427) 151 PRK05928 hemD uroporphyrinogen 21.2 65 0.0017 13.9 3.9 33 1-38 2-34 (252) 152 pfam08660 Alg14 Oligosaccharid 21.2 65 0.0017 13.9 4.9 95 14-121 12-115 (166) 153 cd04955 GT1_like_6 This family 21.1 66 0.0017 13.9 3.3 24 15-38 20-43 (363) 154 TIGR02524 dot_icm_DotB Dot/Icm 21.1 66 0.0017 13.9 3.6 39 75-122 204-242 (358) 155 PRK13151 consensus 20.8 67 0.0017 13.9 4.3 31 1-34 1-31 (195) 156 COG1497 Predicted transcriptio 20.4 47 0.0012 14.8 1.1 22 104-125 213-234 (260) 157 cd05017 SIS_PGI_PMI_1 The memb 20.4 68 0.0017 13.8 2.2 25 101-125 50-78 (119) 158 PRK05246 glutathione synthetas 20.3 68 0.0017 13.8 3.0 37 1-37 2-42 (316) 159 pfam00274 Glycolytic Fructose- 20.2 68 0.0017 13.8 2.7 29 136-164 189-218 (348) 160 cd03811 GT1_WabH_like This fam 20.1 69 0.0018 13.8 6.6 93 16-124 18-111 (353) No 1 >PRK13933 stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=610.90 Aligned_cols=248 Identities=31% Similarity=0.497 Sum_probs=233.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEE-----EEEEEEECCCHHHHHHHH Q ss_conf 926882689868856999999998548919998179985650320224897058897-----110476327279999999 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTI-----SKKRFAVHGTPVDCVVIA 75 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~-----~~~~~~v~GtPaDcv~~~ 75 (250) |||||||||||+||||++|+++|++. ++||||||++||||+|||||+++||+++++ +.++|+|+|||||||++| T Consensus 1 MrILlTNDDGi~apGi~~L~~~l~~~-~~V~VvAP~~~~Sg~s~sit~~~pl~~~~~~~~~~~~~~yav~GTPaDCV~la 79 (253) T PRK13933 1 MNILLTNDDGINAEGINTLAELLSKY-HNVIIVAPENQRSASSHSITIYEPIIIKEVKLEGYNCKAYSISGTPADCVRVA 79 (253) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHCC-CCEEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEECCCHHHHHHHH T ss_conf 96999727888977899999998618-96899944999868847756789834788521788765689359879999999 Q ss_pred HHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 87504645419997148885310233345777899889871898532220013567661478888849999999867786 Q gi|255764489|r 76 LQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIP 155 (250) Q Consensus 76 l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p 155 (250) |+++++++|||||||||+|.|+|.|++|||||||||||+++||||||+||++......+|+.+++++.++++++.+.++| T Consensus 80 l~~l~~~~pDLVvSGIN~G~NlG~dv~ySGTVgAA~Ea~l~GiPsIAvS~~~~~~~~~d~~~a~~~~~~ii~~~~~~~~~ 159 (253) T PRK13933 80 LDKLLPDNIDMVISGINKGLNIGNDILYSGTVSAAIEGAIYKVPSIAVSADVKKGKDENYKIAAKYALEVLNILKKEYLK 159 (253) T ss_pred HHHHCCCCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 85215899999997881774788382306788999999874998289997516897727699999999999999864898 Q ss_pred CCCEEECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEEECCCCCC Q ss_conf 67354314567881015883891376654355427840688651699723677778988550789968988990215356 Q gi|255764489|r 156 NTTLCNINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISVTPITTDL 235 (250) Q Consensus 156 ~~~vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISVTPL~~Dl 235 (250) .+.+||||||.++.+++||+|+|+||+|.|...+.+..+++++.+||+.. ....+..++||.+|+++||||||||++|| T Consensus 160 ~~~~lNVN~P~~~~~~ikGik~t~~g~r~~~~~~~~~~d~~g~~~y~~~~-~~~~~~~~~tD~~al~~g~ISVTPL~~Dl 238 (253) T PRK13933 160 NDVVLNLNVPFCSEEEIKGLKVCKVGNKTFNTRYSEEIDEEGNKVYKLEG-DINKDIYEGTDVYYLRNGYVTLTPLHYDL 238 (253) T ss_pred CCEEEEEECCCCCHHCCCCEEEEECCCCCCCCCEEEEECCCCCCEEEECC-CCCCCCCCCCHHHHHHCCCEEEEECCCCC T ss_conf 67278864698963207876998778764677515736889986499867-65778899868999979938982354168 Q ss_pred CCHHHHHHHHHHHCC Q ss_conf 488999999987459 Q gi|255764489|r 236 TDYNSQQYISLSLET 250 (250) Q Consensus 236 T~~~~l~~L~~~l~~ 250 (250) |||+.|++|++||=| T Consensus 239 T~~~~l~~l~~~f~~ 253 (253) T PRK13933 239 TNFKILEEVEQLFLS 253 (253) T ss_pred CCHHHHHHHHHHHCC T ss_conf 698999999998649 No 2 >PRK00346 surE stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=608.55 Aligned_cols=246 Identities=35% Similarity=0.542 Sum_probs=235.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) |||||||||||+||||++|+++|++++ +|+||||++||||+|||||+++|+++++++.+.|+|+|||||||++||++++ T Consensus 1 M~ILlTNDDGi~a~Gl~~L~~~l~~~~-~V~VvAP~~~~Sg~g~sit~~~pl~~~~~~~~~~av~GTPaDCV~~al~~l~ 79 (246) T PRK00346 1 MRILLTNDDGIHAPGIAALAEALRELA-DVTVVAPDRERSGASHSLTLTRPLRVEKFDNGFYAVDGTPTDCVHLALNGLL 79 (246) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCC-CEEEEECCCCCCCCCCCCCCCCCEEEEEECCCEEEECCCHHHHHHHHHHHHC T ss_conf 959997078889878999999997569-8999913888756645544689758999337379973878999999986414 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 64541999714888531023334577789988987189853222001356766147888884999999986778667354 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPNTTLC 160 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~~~vl 160 (250) ++||||||||||+|.|+|.|++|||||||||||+++||||||+||+.. ...+|+.+++++.++++++++.++|++++| T Consensus 80 ~~~PDLVvSGIN~G~N~G~dv~ySGTVgAA~Ea~~~GipsIAvS~~~~--~~~~~~~a~~~~~~ii~~l~~~~~p~~~~l 157 (246) T PRK00346 80 DEKPDLVVSGINHGANLGDDVLYSGTVAAAMEGRLLGIPAIAVSLAGK--SPAHFETAARVARKLVEKLLAKPLPPGTLL 157 (246) T ss_pred CCCCCEEEECCCCCCCCCCCEEEHHHHHHHHHHHHCCCCCEEEEECCC--CCCCHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 899878996676777576133403577899999874998269870348--874479999999999999996599878179 Q ss_pred ECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEEECCCCCCCCHHH Q ss_conf 31456788101588389137665435542784068865169972367777898855078996898899021535648899 Q gi|255764489|r 161 NINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISVTPITTDLTDYNS 240 (250) Q Consensus 161 NINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISVTPL~~DlT~~~~ 240 (250) |||||.++..++||+|+|+||++.|...+.+..+++++.|||++......+..++||.+|+++||||||||++|||||+. T Consensus 158 NVN~P~~~~~~~~gik~t~~g~~~~~~~~~~~~~~~g~~~yw~~~~~~~~~~~~~tD~~al~~g~ISVTPL~~d~T~~~~ 237 (246) T PRK00346 158 NVNVPDLPLEEIKGIRVTRLGKRHYAEPVIKRTDPRGRPYYWIGGAGVEDDAGEGTDFHAVAEGYVSITPLQLDLTAYEA 237 (246) T ss_pred EEECCCCCCCCCCCEEEEECCCCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHCCCEEEEECCCCCCCHHH T ss_conf 86479998556788499878875326661475679886079966777788899896899997990898035747759899 Q ss_pred HHHHHHHHC Q ss_conf 999998745 Q gi|255764489|r 241 QQYISLSLE 249 (250) Q Consensus 241 l~~L~~~l~ 249 (250) |++|++||| T Consensus 238 l~~l~~~l~ 246 (246) T PRK00346 238 LDVLKDWLE 246 (246) T ss_pred HHHHHHHHC T ss_conf 999999759 No 3 >PRK13935 stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=609.56 Aligned_cols=246 Identities=29% Similarity=0.436 Sum_probs=231.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEE----EEEECCCHHHHHHHHH Q ss_conf 926882689868856999999998548919998179985650320224897058897110----4763272799999998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKK----RFAVHGTPVDCVVIAL 76 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~----~~~v~GtPaDcv~~~l 76 (250) |||||||||||+||||++|+++|++. ++||||||++||||+|||||+++||+++++... .|+|+|||||||++|| T Consensus 1 M~ILlTNDDGi~a~Gi~~L~~~l~~~-~~V~VvAP~~~~Sg~g~siT~~~pl~~~~~~~~~~~~~y~v~GTPaDCV~lal 79 (255) T PRK13935 1 MNILVTNDDGVTSKGILCLARYLSKK-HEVFVVAPEKEQSATGHAITLRVPLWAKKVDINERFEIYAVTGTPADCVKIGY 79 (255) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHCC-CCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEECCCHHHHHHHHH T ss_conf 96999707998977899999998529-93999927998767865642488704787404888418993698489999998 Q ss_pred HHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 75046454199971488853102333457778998898718985322200135676614788888499999998677866 Q gi|255764489|r 77 QKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPN 156 (250) Q Consensus 77 ~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~ 156 (250) +++++++|||||||||+|.|+|.||+|||||||||||+++||||||||+.. . +..+|+.+++++.++++++.++++|+ T Consensus 80 ~~l~~~~pDLVvSGIN~G~NlG~dv~YSGTVgAA~Eg~l~GipsIAvS~~~-~-~~~~~~~a~~~~~~ii~~~~~~~~p~ 157 (255) T PRK13935 80 DVIMKKKPDLVISGINRGPNLGTDVLYSGTVSGALEGAIAGVPSIAISSAN-F-ENPDYETAARFLLNFLEEFDFSLLPP 157 (255) T ss_pred HHHCCCCCCEEEECCCCCCCCCCCEEECCHHHHHHHHHHCCCCEEEEECCC-C-CCCCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 640589998899687488777713772003677898975499869996566-7-88787899999999999998617984 Q ss_pred CCEEECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEEECCCCCCC Q ss_conf 73543145678810158838913766543554278406886516997236777789885507899689889902153564 Q gi|255764489|r 157 TTLCNINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISVTPITTDLT 236 (250) Q Consensus 157 ~~vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISVTPL~~DlT 236 (250) .++||||||.++.+++||+|+|+||+|.|...+.++.+++++.|||+.......+..++||++|+++||||||||++||| T Consensus 158 ~~~lNVN~P~~~~~~ikG~k~t~~g~r~~~~~~~~~~dp~g~~yyw~~g~~~~~~~~~~tD~~al~~gyISITPL~~dlT 237 (255) T PRK13935 158 FTALNINVPSVPYGEIKGWKLTRQSRRRYNDYFEERVDPFGNKYYWMMGEIIEDDPDDDADYKAVREGYVSVTPISVFLT 237 (255) T ss_pred CCEEEEECCCCCCCCCCCEEEEECCCCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHCCEEEEEECCCCCC T ss_conf 50686427999723588549966665336765057357999878986477677789989379999799099832565775 Q ss_pred CHHHHHHHHHHHC Q ss_conf 8899999998745 Q gi|255764489|r 237 DYNSQQYISLSLE 249 (250) Q Consensus 237 ~~~~l~~L~~~l~ 249 (250) ||+.|++|++||+ T Consensus 238 ~~~~l~~L~~~~~ 250 (255) T PRK13935 238 NEECLKKLEEVYE 250 (255) T ss_pred CHHHHHHHHHHHH T ss_conf 9899999999986 No 4 >PRK13932 stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=604.74 Aligned_cols=244 Identities=31% Similarity=0.517 Sum_probs=231.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEE----EEEECCCHHHHHHHHH Q ss_conf 926882689868856999999998548919998179985650320224897058897110----4763272799999998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKK----RFAVHGTPVDCVVIAL 76 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~----~~~v~GtPaDcv~~~l 76 (250) |||||||||||+||||++|+++|+++| +||||||+.||||+|||||+++||+++++... .|+|+|||||||++|| T Consensus 7 p~ILlTNDDGi~a~Gl~aL~~~l~~~~-~V~VvAP~~~~Sg~s~siTl~~plr~~~~~~~~~~~~y~v~GTPaDCV~lal 85 (258) T PRK13932 7 PHILVCNDDGIEGEGIHVLAASMKKIG-RVTVVAPAEPHSGMSHAMTLGVPLRIKEYQKNNRFFGYTVSGTPVDCIKVAL 85 (258) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCCHHHHHHHHH T ss_conf 759987478889878999999998579-9899944899847755626999717899736896567881698799999998 Q ss_pred HHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 75046454199971488853102333457778998898718985322200135676614788888499999998677866 Q gi|255764489|r 77 QKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPN 156 (250) Q Consensus 77 ~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~ 156 (250) +++++++|||||||||+|.|+|.|++|||||||||||+++|||||||||... +..+|+.+++++.++++++++.++|+ T Consensus 86 ~~l~~~kPDLVvSGIN~G~NlG~dv~YSGTVgAA~Eg~~~GipsIA~S~~~~--~~~~~~~a~~~~~~l~~~~l~~~~p~ 163 (258) T PRK13932 86 SHILPEKPDLIVSGINYGSNTATNTLYSGTVAAALEGAIQGIPSLAFSLTTY--ENADFTYAGKFARKLARKVLREGLPP 163 (258) T ss_pred HHHCCCCCCEEEECCCCCCCCCCCEECCHHHHHHHHHHHCCCCEEEEEECCC--CCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 6215899998996876886567131040789999999975998499970246--65565999999999999998638986 Q ss_pred CCEEECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEEECCCCCCC Q ss_conf 73543145678810158838913766543554278406886516997236777789885507899689889902153564 Q gi|255764489|r 157 TTLCNINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISVTPITTDLT 236 (250) Q Consensus 157 ~~vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISVTPL~~DlT 236 (250) +.+||||||.++.+++||+|+|+||++.|...+.++.+++++.|||+.......+..++||.+|+++||||||||++||| T Consensus 164 ~~~lNVN~P~~~~~~ikGik~t~~g~~~~~~~~~~~~d~~g~~yywl~g~~~~~~~~~~tD~~al~~gyISITPL~~DlT 243 (258) T PRK13932 164 DTILSVNIPNVPESDIQGVLITRQGRSRWEEDAIERHDMYGNPYYWLNGTLQLLDDSLTQDEYAVRHNYVAVTPLSCDLT 243 (258) T ss_pred CCEEEEECCCCCHHCCCCEEEEECCCCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHCCEEEEEECCCCCC T ss_conf 81788536889622078849815686545664356227999867997167667788999659999799089821463786 Q ss_pred CHHHHHHHHHH Q ss_conf 88999999987 Q gi|255764489|r 237 DYNSQQYISLS 247 (250) Q Consensus 237 ~~~~l~~L~~~ 247 (250) ||+.|++|++| T Consensus 244 ~~~~l~~L~~w 254 (258) T PRK13932 244 NHDFLSSLEQW 254 (258) T ss_pred CHHHHHHHHHC T ss_conf 97999888537 No 5 >PRK13931 stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=592.28 Aligned_cols=248 Identities=35% Similarity=0.544 Sum_probs=227.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHC---CCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHH Q ss_conf 92688268986885699999999854---891999817998565032022489705889711047632727999999987 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSI---SDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQ 77 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~---~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~ 77 (250) |||||||||||+||||++|+++++++ +|+|+||||++||||+|||||+++|+++++++.+.|+|+|||||||++||+ T Consensus 1 MrILiTNDDGi~a~Gi~aL~~~~~~~~~~~~~V~VvAP~~~~Sg~s~ait~~~pl~~~~~~~~~y~v~GTPaDCV~lal~ 80 (261) T PRK13931 1 MRILITNDDGINAPGLEVLEQIATELAGPGGEVWTVAPAFEQSGVGHCISYTHPMMIAELGPRRFAAEGSPADCVLAALH 80 (261) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHCCCCCCCCCEEEECCCCEEECCCCHHHHHHHHHH T ss_conf 96999817999981799999999987328985999805999744542757899856776178736547882899999986 Q ss_pred HCCC-CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC---CCCHHHHHHHHHHHHHHHHHCC Q ss_conf 5046-45419997148885310233345777899889871898532220013567---6614788888499999998677 Q gi|255764489|r 78 KMSD-KKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYEN---MIPWEVSETHAPRVLRQLLKTQ 153 (250) Q Consensus 78 ~l~~-~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~---~~~~~~a~~~~~~ii~~l~~~~ 153 (250) ++++ +||||||||||+|.|+|.|++|||||||||||+++|||||||||.+.... ..+|+.+++++.++++++++.+ T Consensus 81 ~l~~~~~PDLVvSGIN~G~NlG~dv~ySGTVgAA~Eg~~~gipsIAvS~~~~~~~~~~~~~~~~a~~~~~~ii~~ll~~~ 160 (261) T PRK13931 81 DVMKDAPPDLVLSGVNRGNNSAENVLYSGTVGGAMEAALQGLPAIALSQYYGPRNEGLDDPFEAARTHGARVVRKLLAAG 160 (261) T ss_pred HHCCCCCCCEEEECCCCCCCCEEEEECHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 43348998889967658876545143318889999999839995788710266544544508999999999999999601 Q ss_pred C----CCCCEEECCCCCCCCHHCCCCEEEECCCCCCC-CCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEE Q ss_conf 8----66735431456788101588389137665435-542784068865169972367777898855078996898899 Q gi|255764489|r 154 I----PNTTLCNINFPRCSPEEVQKTVVTAQGKPCFS-IDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISV 228 (250) Q Consensus 154 ~----p~~~vlNINiP~~~~~~ikg~k~t~~g~~~~~-~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISV 228 (250) + |.+++||||||.++.+++||+|+|+||.|.+. ...++..+++++.|||+.......+..++||.+|+++||||| T Consensus 161 ~~~~~~~~~~lNVN~P~~~~~~~kg~k~t~~g~r~~~~~~~~~~~~p~g~~~yw~~g~~~~~~~~~~tD~~al~~gyISV 240 (261) T PRK13931 161 PWEDEDYRLFYNVNFPPVPAADVKGIRVAAQGFREGTRFGVEPHMSPSGRRFLWIKGGAQQVPTAPGTDAAVNLDGYISV 240 (261) T ss_pred CCCCCCCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHCCEEEE T ss_conf 42378745488345798985564038997712334666315642379997678845787787899897999997990897 Q ss_pred ECCCCCCCCHHHHHHHHHHH Q ss_conf 02153564889999999874 Q gi|255764489|r 229 TPITTDLTDYNSQQYISLSL 248 (250) Q Consensus 229 TPL~~DlT~~~~l~~L~~~l 248 (250) |||++|||||+.|++|+++| T Consensus 241 TPL~~dlT~~~~l~~L~~~~ 260 (261) T PRK13931 241 TPMRADLTAHDRLAELEALI 260 (261) T ss_pred CCCCCCCCCHHHHHHHHHHC T ss_conf 31555786879999999860 No 6 >PRK13934 stationary phase survival protein SurE; Provisional Probab=100.00 E-value=0 Score=582.15 Aligned_cols=244 Identities=27% Similarity=0.398 Sum_probs=221.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEE---EEEEEECCCHHHHHHHHHH Q ss_conf 9268826898688569999999985489199981799856503202248970588971---1047632727999999987 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTIS---KKRFAVHGTPVDCVVIALQ 77 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~---~~~~~v~GtPaDcv~~~l~ 77 (250) |||||||||||+||||++|+++|+++| +||||||++||||+|||||+++||+++++. .+.|+|+|||||||++|++ T Consensus 1 MrILlTNDDGi~a~Gi~~L~~~l~~~g-~V~VVAP~~~~Sg~g~siTl~~Plr~~~~~~~g~~~yav~GTPaDCV~lal~ 79 (266) T PRK13934 1 MKILVTNDDGVHSPGLRLLYEFVSPLG-EVDVVAPETPKSATGLGITLHKPLRMYEVDLCGFKVYATSGTPSDTIYLATY 79 (266) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCC-CEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCEEEECCCHHHHHHHHHH T ss_conf 969998079989888999999998479-9999954899867757703799706788526885179977988999999986 Q ss_pred HCCCCCCCEEEECCCCCCCCCHHH-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCC-----CCCHHHHHHHHHHHHHHHHH Q ss_conf 504645419997148885310233-345777899889871898532220013567-----66147888884999999986 Q gi|255764489|r 78 KMSDKKPDLILSGVNVGTNTSNHV-AYSGTLAAAFEGSLQGIRSFALSQAYTYEN-----MIPWEVSETHAPRVLRQLLK 151 (250) Q Consensus 78 ~l~~~~pDlViSGiN~G~N~g~~v-~ySGTvgAA~ea~~~GipsIAiS~~~~~~~-----~~~~~~a~~~~~~ii~~l~~ 151 (250) +| .++|||||||||+|.|+|.|+ +|||||||||||+++|||||||||+....+ ...|+.++.++..+++++++ T Consensus 80 ~l-~~~PDLVvSGIN~G~NlG~dvi~ySGTV~AA~Eg~~~GiPsIA~S~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~l~ 158 (266) T PRK13934 80 GL-GRKYDLVLSGINLGDNTSLQVILSSGTLGAAFQAALLGIPAVAYSAYVDDWEELLEDGEALEIMKAVVRATAEYVLK 158 (266) T ss_pred CC-CCCCCEEEECCCCCCCCCCCCEECCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 33-78988899677578866724015145889999998569985999843278210023068999999999999999986 Q ss_pred CCCCCCC-EEECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHH-HCCCEEEE Q ss_conf 7786673-543145678810158838913766543554278406886516997236777789885507899-68988990 Q gi|255764489|r 152 TQIPNTT-LCNINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAI-QHNMISVT 229 (250) Q Consensus 152 ~~~p~~~-vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al-~~GyISVT 229 (250) .++|++. +||||||.++.+++| .++|+||++.|...++++.+++|+.|||+.+ .. .+.+++||.+|+ ++|||||| T Consensus 159 ~~~P~~~~lLNVN~P~~~~~~ik-~~vtr~g~~~~~~~~~~r~dprG~~yyWl~G-~~-~~~~~gTD~~ail~~gyISVT 235 (266) T PRK13934 159 RGMPKGVDVISVNFPRRLRRGVK-AKLVKAAKLRFAQQVERRVDPRGRAYYWLYG-TP-LEPEPGTDVYVVLKEGNIAIT 235 (266) T ss_pred CCCCCCCCEEECCCCCCCHHCCE-EEEEEECCCCCCCCEEEEECCCCCEEECCCC-CC-CCCCCCCCHHHHHHCCEEEEE T ss_conf 48998774453368988532170-8999705744567614758879984675079-82-789999679999868969972 Q ss_pred CCCCCCCCHH-----HHHHHHHHHC Q ss_conf 2153564889-----9999998745 Q gi|255764489|r 230 PITTDLTDYN-----SQQYISLSLE 249 (250) Q Consensus 230 PL~~DlT~~~-----~l~~L~~~l~ 249 (250) ||++|||+|+ .|++|+.|++ T Consensus 236 PL~~DlT~~d~~r~~~~e~~~~~~~ 260 (266) T PRK13934 236 PLTLNLNALDGERAADLEALKRLVD 260 (266) T ss_pred ECCEECCCCCCHHHCCHHHHHHHHH T ss_conf 3520077875022302999999999 No 7 >COG0496 SurE Predicted acid phosphatase [General function prediction only] Probab=100.00 E-value=0 Score=577.95 Aligned_cols=248 Identities=34% Similarity=0.553 Sum_probs=235.9 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) |||||||||||+||||++|+++|+ .++||+||||+.||||+|||+|+++|+++++++.+.|+|+|||||||++|++.++ T Consensus 1 mrILlTNDDGi~a~Gi~aL~~al~-~~~dV~VVAP~~~qSg~s~slTl~~Plr~~~~~~~~~av~GTPaDCV~lal~~l~ 79 (252) T COG0496 1 MRILLTNDDGIHAPGIRALARALR-EGADVTVVAPDREQSGASHSLTLHEPLRVRQVDNGAYAVNGTPADCVILGLNELL 79 (252) T ss_pred CEEEEECCCCCCCHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCEEEEECCCEEEECCCHHHHHHHHHHHHC T ss_conf 949996688657777999999986-2797999826987745434400356723247236528935881899999999864 Q ss_pred CCC-CCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC---CCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 645-4199971488853102333457778998898718985322200135676---614788888499999998677866 Q gi|255764489|r 81 DKK-PDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENM---IPWEVSETHAPRVLRQLLKTQIPN 156 (250) Q Consensus 81 ~~~-pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~---~~~~~a~~~~~~ii~~l~~~~~p~ 156 (250) +++ |||||||||.|.|+|.|++|||||||||||+++||||||+||.+..... .+|+.+++++..++++++++.+|. T Consensus 80 ~~~~pDLVvSGIN~G~Nlg~dv~ySGTVaaA~Ea~~~GipsIA~S~~~~~~~~~~~~~~e~A~~~~~~lv~~l~~~p~~~ 159 (252) T COG0496 80 KEPRPDLVVSGINAGANLGDDVIYSGTVAAAMEAALLGIPAIAISLAYREAFGKQDVDFETAAKVARALVEALLANPLPP 159 (252) T ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEHHHHHHHHHHCCCCCEEEEEHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 67899989967647886551134201499999998729642365410001355544367999999999999998388998 Q ss_pred CCEEECCCCCCCCHHCCCCEEEECCCCCCCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHCCCEEEECCCCCCC Q ss_conf 73543145678810158838913766543554278406886516997236777789885507899689889902153564 Q gi|255764489|r 157 TTLCNINFPRCSPEEVQKTVVTAQGKPCFSIDAKQISTNDNMSHYCLTFGDHLKNLCEKSDAFAIQHNMISVTPITTDLT 236 (250) Q Consensus 157 ~~vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~e~TD~~Al~~GyISVTPL~~DlT 236 (250) ..+||||||+++..+++|+++||+|++.|...+.++.+++|..|||+.......+..|+||++|+++||||||||++|+| T Consensus 160 ~~llNVNiP~~~~~~~~gi~vtr~g~~~~~~~~~~r~dprG~~yyW~~~~~~~~~~~~gtD~~a~~~g~IsITPl~~dlt 239 (252) T COG0496 160 DTLLNVNIPNLPLEEIKGIRVTRLGRRRYAEPVEERTDPRGEPYYWIGPGGLAEDAEEGTDFHAVREGYISITPLQLDLT 239 (252) T ss_pred CCEEEEECCCCCCCCCCCEEEEECHHHHCCCCCCEEECCCCCEEEEECCCCCCCCCCCCCHHHHHHCCCEECCCCCCCCH T ss_conf 82899867999722368679984236433676323368999787776687766668999509998759866644343761 Q ss_pred CHHHHHHHHHHHC Q ss_conf 8899999998745 Q gi|255764489|r 237 DYNSQQYISLSLE 249 (250) Q Consensus 237 ~~~~l~~L~~~l~ 249 (250) +|+.++.|++|++ T Consensus 240 ~~~~~~~l~~~l~ 252 (252) T COG0496 240 AYEALESLKSWLK 252 (252) T ss_pred HHHHHHHHHHHHC T ss_conf 7888889998539 No 8 >TIGR00087 surE 5'/3'-nucleotidase SurE; InterPro: IPR002828 This entry represents a SurE-like structural domain with a 3-layer alpha/bete/alpha topology that bears some topological similarity to the N-terminal domain of the glutaminase/asparaginase family. This domain is found in the stationary phase survival protein SurE, a metal ion-dependent phosphatase found in eubacteria, archaea and eukaryotes. In Escherichia coli, SurE also has activity as a nucleotidase and exopolyphosphatase, and may be involved in the stress response . E. coli cells with mutations in the surE gene survive poorly in stationary phase . The structure of SurE homologues have been determined from Thermotoga maritima and the archaea Pyrobaculum aerophilum . The T. maritima SurE homologue has phosphatase activity that is inhibited by vanadate or tungstate, both of which bind adjacent to the divalent metal ion. This domain is found in acid phosphatases (3.1.3.2 from EC), 5'-nucleotidases (3.1.3.5 from EC), 3'-nucleotidases (3.1.3.6 from EC) and exopolyphosphatases (3.6.1.11 from EC).; GO: 0016787 hydrolase activity. Probab=100.00 E-value=0 Score=524.59 Aligned_cols=236 Identities=31% Similarity=0.525 Sum_probs=219.2 Q ss_pred CEEEEECCCC----CCCHHHHHHHHHHHHCC--CCEEEECCCCCCCCEE--------------------EEECCCCCEEE Q ss_conf 9268826898----68856999999998548--9199981799856503--------------------20224897058 Q gi|255764489|r 1 MRILLTNDDG----IKSKGLITLENIARSIS--DDIWICAPEMDQSCLA--------------------NSLTMSRNIAC 54 (250) Q Consensus 1 m~ILitNDDG----~~a~gl~~l~~~l~~~~--~~v~vvAP~~~~S~~~--------------------~ait~~~~~~~ 54 (250) |||||||||| ++||||++|+++|++.| |||+||||+.||||+| |++|+.+|+|+ T Consensus 1 m~iL~TNDDGPLsD~~S~gI~Al~~aL~~~g~~~ev~vVAP~~q~Sg~Gk~hLaGknLtaqfiY~~V~~~~~Tl~~p~r~ 80 (326) T TIGR00087 1 MKILLTNDDGPLSDIFSPGIRALYQALKELGPEYEVTVVAPAVQRSGVGKAHLAGKNLTAQFIYSKVLAHSLTLFEPIRV 80 (326) T ss_pred CEEEEEECCCCCHHHCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHCCCCCEEEEEEEEECCCCCCHHHCCEE T ss_conf 94789815873012214226999999998479861699825877662322000132100122433423110252011122 Q ss_pred EEE----E--------------EEEEEECCCHHHHHHHHHHHCCCCC-----CCEEEECCCCCCCCC-HHHHHHHHHHHH Q ss_conf 897----1--------------1047632727999999987504645-----419997148885310-233345777899 Q gi|255764489|r 55 RTI----S--------------KKRFAVHGTPVDCVVIALQKMSDKK-----PDLILSGVNVGTNTS-NHVAYSGTLAAA 110 (250) Q Consensus 55 ~~~----~--------------~~~~~v~GtPaDcv~~~l~~l~~~~-----pDlViSGiN~G~N~g-~~v~ySGTvgAA 110 (250) .++ + .+.|+|||||||||++|+++++++. |||||||||.|.||| .+++|||||||| T Consensus 81 ~~~~~~~~~~~~vLnaei~~~~~~~~~~dGTP~DcV~lG~~~~~~~~~nnitpDLV~SGiN~G~NlG~~~~~~SGTvgAA 160 (326) T TIGR00087 81 EQVKVENGKLGYVLNAEIDKDDIHIYAVDGTPTDCVILGIHELMPEVHNNITPDLVISGINLGENLGAEDVTYSGTVGAA 160 (326) T ss_pred EEEECCCCCCCEEEECCCCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCHHHHHHCCHHHHH T ss_conf 14313688887033143887743589971886899999999870431234453258806668886773333102538998 Q ss_pred HHHHHHC-CCCCCCCCCCCC--------CCCCCHHHHHHHHHHHHHHHHHCCCC------CCCEEECCCCCCCCH----- Q ss_conf 8898718-985322200135--------67661478888849999999867786------673543145678810----- Q gi|255764489|r 111 FEGSLQG-IRSFALSQAYTY--------ENMIPWEVSETHAPRVLRQLLKTQIP------NTTLCNINFPRCSPE----- 170 (250) Q Consensus 111 ~ea~~~G-ipsIAiS~~~~~--------~~~~~~~~a~~~~~~ii~~l~~~~~p------~~~vlNINiP~~~~~----- 170 (250) |||+++| +||||+|+.+.. ....+|+.|++++..|+++|+++++| ...+||||+|.++.. T Consensus 161 ~E~~~~Gn~paIA~S~~~~~~~~~~~~~~~~~~f~~Aak~~~~~~~~l~~~~~p~~svLP~~~~lnvN~P~~~~~sDDtk 240 (326) T TIGR00087 161 MEAAIHGNVPAIAISLQIFNTDVKFKENSSPLDFEIAAKVTNAIVKNLLKNGLPEKSVLPGGDLLNVNVPLVPSESDDTK 240 (326) T ss_pred HHHHHHCCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCE T ss_conf 88665178640464100023200003677643766899999999999850688545427886265766787756465614 Q ss_pred -------HCCCCEEEECCCCCCCCCCEEECCC---CCCCEEEEECCC----C-----CCCCCCCCCHHHHHCCC-EEEEC Q ss_conf -------1588389137665435542784068---865169972367----7-----77898855078996898-89902 Q gi|255764489|r 171 -------EVQKTVVTAQGKPCFSIDAKQISTN---DNMSHYCLTFGD----H-----LKNLCEKSDAFAIQHNM-ISVTP 230 (250) Q Consensus 171 -------~ikg~k~t~~g~~~~~~~~~~~~~~---~~~~~y~~~~~~----~-----~~~~~e~TD~~Al~~Gy-ISVTP 230 (250) +.+++++||++++.+...+.++.|+ +++.|||+..-. + ..+..++||+.|++++| ||||| T Consensus 241 vYYaPiW~~~~~~~tr~~~~~~~~~v~~r~Dp~~~rg~~yyW~~~~~~~~~p~~~~~~~~~~~~tD~~~~~~~~h~SITp 320 (326) T TIGR00087 241 VYYAPIWENTGIRITRLGRRMYETSVEERTDPLVFRGRSYYWIGGDSKKDAPKEYLEREDREEGTDVDAIRSGYHISITP 320 (326) T ss_pred EEEEECCCCCCCEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHCCCEECCCC T ss_conf 78730216897144168765653241001177300586333257523233776888742078753145331570026883 Q ss_pred CCCCCC Q ss_conf 153564 Q gi|255764489|r 231 ITTDLT 236 (250) Q Consensus 231 L~~DlT 236 (250) |++|+| T Consensus 321 l~~d~t 326 (326) T TIGR00087 321 LKVDLT 326 (326) T ss_pred CCCCCC T ss_conf 423679 No 9 >LOAD_surE consensus Probab=100.00 E-value=0 Score=470.98 Aligned_cols=188 Identities=36% Similarity=0.602 Sum_probs=178.3 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEE---EEEEEECCCHHHHHHHHHH Q ss_conf 9268826898688569999999985489199981799856503202248970588971---1047632727999999987 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTIS---KKRFAVHGTPVDCVVIALQ 77 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~---~~~~~v~GtPaDcv~~~l~ 77 (250) |||||||||||+||||++|+++|++.||+|+||||++||||+|||||+++|+++++++ .+.|+|+|||||||++||+ T Consensus 1 M~ILlTNDDGi~a~Gl~aL~~~l~~~g~~V~vvAP~~~~Sg~g~sit~~~pl~~~~~~~~~~~~y~v~GTPaDCV~~al~ 80 (192) T LOAD_surE 1 MRILVTNDDGIDSPGIRALAEALKEEGAEVTVVAPDRERSGVSHSLTLSEPLRVKKVELKGIGAYAVDGTPADCVHLGLS 80 (192) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCEEEEECCCCCEEEECCCHHHHHHHHHH T ss_conf 96999837988888899999999878996999877998867731345899840499704897589945988999999972 Q ss_pred HCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 50464541999714888531023334577789988987189853222001356766147888884999999986778667 Q gi|255764489|r 78 KMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPNT 157 (250) Q Consensus 78 ~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~~ 157 (250) ++++++|||||||||+|.|+|.|++||||||||+||+++||||||+|+.+......+|+.+++++..++++++++++|++ T Consensus 81 ~~~~~~pDLVvSGIN~G~N~G~dv~ySGTVgAA~ea~~~gipsiA~S~~~~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~ 160 (192) T LOAD_surE 81 ELLFEKPDLVVSGINEGANLGDDITYSGTVAAAMEAVLLGIPAIAISLAYRHEETLDFETAAKIAAALVETLLANGLPDP 160 (192) T ss_pred CCCCCCCCEEEECCCCCCCCCEEEEEHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 22579998899665488767355875788778999997499858998551788763689999999999999996599989 Q ss_pred CEEECCCCCCCCHHCCCCEEEECCCCCCCCCC Q ss_conf 35431456788101588389137665435542 Q gi|255764489|r 158 TLCNINFPRCSPEEVQKTVVTAQGKPCFSIDA 189 (250) Q Consensus 158 ~vlNINiP~~~~~~ikg~k~t~~g~~~~~~~~ 189 (250) ++||||||.++..+ +++|+|+||+|.|.+.+ T Consensus 161 ~~lNvN~P~~~~~~-~~ik~t~~g~r~~~~~~ 191 (192) T LOAD_surE 161 TLLNVNIPAVPAGD-PLIKVTRLGRRRYRDSV 191 (192) T ss_pred CEEEEEECCCCCCC-CCEEEEECCCCCCCCCC T ss_conf 79999708898436-98899979986007862 No 10 >pfam01975 SurE Survival protein SurE. E. coli cells with the surE gene disrupted are found to survive poorly in stationary phase. It is suggested that SurE may be involved in stress response. Yeast also contains a member of the family. A sequence from Yarrowia lipolytica can complement a mutation in acid phosphatase, suggesting that members of this family could be phosphatases. Probab=100.00 E-value=0 Score=469.16 Aligned_cols=185 Identities=38% Similarity=0.637 Sum_probs=175.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEE---EEEEECCCHHHHHHHHHHH Q ss_conf 2688268986885699999999854891999817998565032022489705889711---0476327279999999875 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISK---KRFAVHGTPVDCVVIALQK 78 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~---~~~~v~GtPaDcv~~~l~~ 78 (250) ||||||||||+||||++|+++|++. ++|+||||++||||+|||||+++|++++++.. ++|+|+|||||||++||++ T Consensus 1 rILlTNDDGi~s~Gi~~L~~~l~~~-~~V~VvAP~~~~Sg~g~ait~~~pl~~~~~~~~~~~~~~v~GTPaDcV~~gl~~ 79 (190) T pfam01975 1 RILLTNDDGIHAPGIRALAEALKAL-GEVTVVAPDREQSGVGHSITLHRPLRVKKVDNDGAGAYAVNGTPADCVKLGLNG 79 (190) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHC-CCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCHHHHHHHHHHH T ss_conf 9889748988878899999999857-968999438988688433137997278996489950699648779999998751 Q ss_pred CCC-CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 046-4541999714888531023334577789988987189853222001356766147888884999999986778667 Q gi|255764489|r 79 MSD-KKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPNT 157 (250) Q Consensus 79 l~~-~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~~ 157 (250) +++ ++|||||||||+|.|+|.|++|||||||||||+++||||||+||++......+|+.+++++.++++++++.++|.+ T Consensus 80 ~~~~~~pDlVvSGIN~G~N~G~dv~ySGTVgAA~Ea~~~GipsIA~S~~~~~~~~~~~~~a~~~~~~li~~l~~~~~p~~ 159 (190) T pfam01975 80 LLDDKKPDLVVSGINHGANLGTDVLYSGTVGAAMEAALLGIPSIAVSLAGNSDEKDDFETAAKLVRRLVEKLLKNGLPPG 159 (190) T ss_pred HCCCCCCCEEEECCCCCCCCCEEEECHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 16789987899565577657455752577788999997599869999740888776789999999999999973689888 Q ss_pred CEEECCCCCCCCHHCCCCEEEECCCCCCCC Q ss_conf 354314567881015883891376654355 Q gi|255764489|r 158 TLCNINFPRCSPEEVQKTVVTAQGKPCFSI 187 (250) Q Consensus 158 ~vlNINiP~~~~~~ikg~k~t~~g~~~~~~ 187 (250) ++||||||.++.+++||+|+|+||+|.|.. T Consensus 160 ~~lNVN~P~~~~~~~~G~k~t~~g~r~~~~ 189 (190) T pfam01975 160 TLLNVNIPALPPSEIKGIKVTRLGRRRYAE 189 (190) T ss_pred CEEEEECCCCCCHHCCCEEEEECCCCEECC T ss_conf 689976498982324883994788713566 No 11 >cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics. Probab=96.19 E-value=0.028 Score=34.58 Aligned_cols=112 Identities=19% Similarity=0.198 Sum_probs=55.2 Q ss_pred CEEEEECCC--CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEE-EEEC---CCCCEE--EEEEEEEEEEECCCHHH-- Q ss_conf 926882689--8688569999999985489199981799856503-2022---489705--88971104763272799-- Q gi|255764489|r 1 MRILLTNDD--GIKSKGLITLENIARSISDDIWICAPEMDQSCLA-NSLT---MSRNIA--CRTISKKRFAVHGTPVD-- 70 (250) Q Consensus 1 m~ILitNDD--G~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~-~ait---~~~~~~--~~~~~~~~~~v~GtPaD-- 70 (250) ||||+.--= |--.| +..|+++|.+.||+|+++.+...+.... +.++ +..... ........+.....+.. T Consensus 1 Mril~~~~~~~GH~~P-~l~lA~~L~~rGh~Vt~~~~~~~~~~i~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T cd03784 1 MRVLITTIGSRGDVQP-LVALAWALRAAGHEVRVATPPEFADLVEAAGLEFVPVGGDPDELLASPERNAGLLLLGPGLLL 79 (401) T ss_pred CEEEEECCCCHHHHHH-HHHHHHHHHHCCCEEEEEECHHHHHHHHHCCCEEEECCCCHHHHHHCCCCCCCCCCCCHHHHH T ss_conf 9799987985758999-999999999889959999387888899977986887698777764211123333454055799 Q ss_pred ------------HHHHHHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf ------------99999875046454199971488853102333457778998898718985322200 Q gi|255764489|r 71 ------------CVVIALQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 71 ------------cv~~~l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) .+.-.+..+...+||+||++.. .+ ++..-|..+|||.+.++.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~pD~vi~d~~---------~~----~~~~~A~~~giP~v~~~~~ 134 (401) T cd03784 80 GALRLLRREAEAMLDDLVAAARDWGPDLVVADPL---------AF----AGAVAAEALGIPAVRLLLG 134 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCC---------HH----HHHHHHHHHCCCEEEEECC T ss_conf 9999999999999999999961679988998970---------78----9999999929998999566 No 12 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=95.91 E-value=0.027 Score=34.68 Aligned_cols=104 Identities=13% Similarity=0.160 Sum_probs=65.0 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHH----- Q ss_conf 926882689868856999999998548919998179985650320224897058897110476327279999999----- Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIA----- 75 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~----- 75 (250) |||++.=..=-+..-.+.+.+.|++.||+|.|.|=+.++.- .+--...+....++.. .+|...=..-. T Consensus 1 MkIwiDI~~p~hvhfFk~iI~eL~k~GheV~iTaR~~~~~~---~LL~~y~i~~~~iG~~----g~s~~~Kl~~~~~R~~ 73 (335) T pfam04007 1 LKVWIDITNAPHVHFFKPIISELEKEGYEVLLTCRKFGELP---ELLRSLGFQVKSIGKH----GATLIKKLLSSAERVY 73 (335) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCHHH---HHHHHCCCCEEEECCC----CCCHHHHHHHHHHHHH T ss_conf 93999789950888899999999868988999996135199---9999769976997588----8888999999999999 Q ss_pred -HHH-CCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf -875-04645419997148885310233345777899889871898532220 Q gi|255764489|r 76 -LQK-MSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 76 -l~~-l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) |.. +.+.+||+.|| .|-+.||+-|...|+|+|++.- T Consensus 74 ~L~~~~~~~~PDv~is--------------~~S~~a~~va~~LgipsI~f~D 111 (335) T pfam04007 74 LLTKLIPEKKPDVAIM--------------KNSMELPRVAFGLRIPSIIVLD 111 (335) T ss_pred HHHHHHHHHCCCEEEE--------------CCCHHHHHHHHHCCCCEEEEEC T ss_conf 9999988629978994--------------4880199999882998799947 No 13 >PRK10307 predicted glycosyl transferase; Provisional Probab=95.78 E-value=0.11 Score=30.80 Aligned_cols=34 Identities=12% Similarity=0.102 Sum_probs=26.1 Q ss_pred CEEEEECCCCC-C----CHHHHHHHHHHHHCCCCEEEEC Q ss_conf 92688268986-8----8569999999985489199981 Q gi|255764489|r 1 MRILLTNDDGI-K----SKGLITLENIARSISDDIWICA 34 (250) Q Consensus 1 m~ILitNDDG~-~----a~gl~~l~~~l~~~~~~v~vvA 34 (250) ||||+..+.=+ + +.-...|++.|.+.||+|.|++ T Consensus 1 MrIl~vs~~y~P~~~G~~~~~~~La~~L~~~GheV~Vit 39 (415) T PRK10307 1 MKILVYGINYAPELTGIGKYTGEMAEWLAARGHEVRVIT 39 (415) T ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 989998584899788799999999999997899899997 No 14 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=94.26 E-value=0.28 Score=28.37 Aligned_cols=37 Identities=19% Similarity=0.197 Sum_probs=27.2 Q ss_pred CEEEEECCC----------CCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 926882689----------86885699999999854891999817998 Q gi|255764489|r 1 MRILLTNDD----------GIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~ILitNDD----------G~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||||+.-+- |+ .--+..|++.|.+.||+|+|++|... T Consensus 1 MkI~~v~~~~~p~pP~~~GG~-e~~~~~La~~L~~~Gh~V~v~~~~~~ 47 (335) T cd03802 1 MRIALVAPPREPVPPPAYGGT-ERVVAALTEGLVARGHEVTLFASGDS 47 (335) T ss_pred CEEEEECCCCCCCCCCCCCCH-HHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 986998884003699998979-99999999999976998999962898 No 15 >PRK09191 two-component response regulator; Provisional Probab=91.60 E-value=0.49 Score=26.94 Aligned_cols=45 Identities=18% Similarity=0.214 Sum_probs=31.1 Q ss_pred HHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 987504645419997148885310233345777899889871898532220 Q gi|255764489|r 75 ALQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 75 ~l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) |+......+||||+.-||.+.+. ||--.|+.--.-++||-|=+.- T Consensus 174 Al~la~~~~PDlvL~DI~L~d~~------sGIdaa~~I~~~~~iPvIFiTA 218 (261) T PRK09191 174 AVALAKKTRPGLILADIQLADGS------SGIDAVNDILKTFDVPVIFITA 218 (261) T ss_pred HHHHHHCCCCCEEEEEEECCCCC------CHHHHHHHHHHHCCCCEEEECC T ss_conf 99997425989899953479998------7999999999828999899777 No 16 >cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=89.94 E-value=1.7 Score=23.64 Aligned_cols=82 Identities=13% Similarity=0.120 Sum_probs=42.4 Q ss_pred CEEEEECCCCCCC-HH----HHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHH-- Q ss_conf 9268826898688-56----9999999985489199981799856503202248970588971104763272799999-- Q gi|255764489|r 1 MRILLTNDDGIKS-KG----LITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVV-- 73 (250) Q Consensus 1 m~ILitNDDG~~a-~g----l~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~-- 73 (250) |||++.= +-. -| ...|++.|.+.||+|+|++..... ..-.....+++..+....|.+-..|-=+.. T Consensus 1 MkI~i~~---~P~~GG~e~~v~~La~~L~~~GHeV~vit~~~~~----~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 73 (371) T cd04962 1 MKIGIVC---YPTYGGSGVVATELGKALARRGHEVHFITSSRPF----RLDEYSPNIFFHEVEVPQYPLFQYPPYDLALA 73 (371) T ss_pred CEEEEEC---CCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCC----CCCCCCCCEEEEEECCCCCCCCCCCCCHHHHH T ss_conf 9799989---9999869999999999999759999999568987----65556897379984687765344672137899 Q ss_pred HHHHH-CCCCCCCEEEE Q ss_conf 99875-04645419997 Q gi|255764489|r 74 IALQK-MSDKKPDLILS 89 (250) Q Consensus 74 ~~l~~-l~~~~pDlViS 89 (250) --+.. +...+||+|-. T Consensus 74 ~~l~~~~~~~~~DvvH~ 90 (371) T cd04962 74 SKIAEVAKRYKLDLLHV 90 (371) T ss_pred HHHHHHHHHCCCCEEEE T ss_conf 99999999739988997 No 17 >cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol. Probab=88.85 E-value=2 Score=23.14 Aligned_cols=108 Identities=12% Similarity=0.137 Sum_probs=51.2 Q ss_pred EEEEECCCCC-C----CHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHH Q ss_conf 2688268986-8----8569999999985489199981799856503202248970588971104763272799999998 Q gi|255764489|r 2 RILLTNDDGI-K----SKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIAL 76 (250) Q Consensus 2 ~ILitNDDG~-~----a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l 76 (250) ||++--|.-+ + +.-+..|++.|.+.||+|+|++|.......- .....+.........+.-...+--...... T Consensus 1 kIaivt~~y~P~~GG~~~~~~~La~~L~~~GheV~Vit~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (374) T cd03817 1 KIGIFTDTYLPQVNGVATSIRRLAEELEKRGHEVYVVAPSYPGAPEE---EEVVVVRPFRVPTFKYPDFRLPLPIPRALI 77 (374) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC---CCCCEEEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 98999589899998099999999999997799899997279887754---357628984367776521345555799999 Q ss_pred HHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 7504645419997148885310233345777899889871898532 Q gi|255764489|r 77 QKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 77 ~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) ..+...+||+|.+- . .++=...|..-+...++|-|. T Consensus 78 ~~~~~~~~DvIh~~---~-------~~~~~~~a~~~~~~~~ip~V~ 113 (374) T cd03817 78 IILKELGPDIVHTH---T-------PFSLGLLGLRVARKLGIPVVA 113 (374) T ss_pred HHHHHCCCCEEEEC---C-------HHHHHHHHHHHHHHCCCCEEE T ss_conf 99986699999987---8-------258899999999974995999 No 18 >cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides. Probab=87.58 E-value=2.2 Score=22.96 Aligned_cols=79 Identities=16% Similarity=0.265 Sum_probs=40.7 Q ss_pred EECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHH--HHHHHH-CCC Q ss_conf 82689868856999999998548919998179985650320224897058897110476327279999--999875-046 Q gi|255764489|r 5 LTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCV--VIALQK-MSD 81 (250) Q Consensus 5 itNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv--~~~l~~-l~~ 81 (250) ++|=.|=-.--+..|+++|.+.||+|+|++|.......-. ...+++..+....... .|-.-. ...+.. +.. T Consensus 5 i~~~~GG~e~~~~~La~~L~~~Gh~V~vit~~~~~~~~~~----~~gv~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~ 78 (359) T cd03808 5 IVTVDGGLYSFRLPLIKALRAAGYEVHVVAPPGDELEELE----ALGVKVIPIPLDRRGI--NPFKDLKALLRLYRLLRK 78 (359) T ss_pred EECCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHH----HCCCEEEEECCCCCCC--CHHHHHHHHHHHHHHHHH T ss_conf 9758765999999999999976999999970798743367----5798899927877778--869999999999999998 Q ss_pred CCCCEEEE Q ss_conf 45419997 Q gi|255764489|r 82 KKPDLILS 89 (250) Q Consensus 82 ~~pDlViS 89 (250) .+||+|.+ T Consensus 79 ~~pDvIh~ 86 (359) T cd03808 79 ERPDIVHT 86 (359) T ss_pred HCCCEEEE T ss_conf 49989999 No 19 >PRK13435 response regulator; Provisional Probab=87.48 E-value=1.9 Score=23.26 Aligned_cols=80 Identities=18% Similarity=0.240 Sum_probs=49.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) |||||.-||-.-+. .|.+.|++.||+|+-+|-. ..-|+..+. T Consensus 2 mRILIVEDe~~i~~---~l~~~L~~~G~~vv~~A~~-----------------------------------~~eAl~~~~ 43 (141) T PRK13435 2 LRVLIVEDEALIAL---ELEKLLEEAGHQVVGIAST-----------------------------------SEQALALGR 43 (141) T ss_pred CEEEEECCCHHHHH---HHHHHHHHCCCEEEEEECC-----------------------------------HHHHHHHHH T ss_conf 78999899899999---9999999879979997599-----------------------------------999999976 Q ss_pred CCCCCEEEECCCCCCCC-CHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 64541999714888531-0233345777899889871898532220 Q gi|255764489|r 81 DKKPDLILSGVNVGTNT-SNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~-g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ..+||+|+.=||....+ |.++ +..-....++|-|-+|- T Consensus 44 ~~~PDlvllDi~LpdG~~G~e~-------~r~l~~~~~ipvI~lTa 82 (141) T PRK13435 44 RAQPDVALVDIRLADGPTGVEV-------ARRLSIDGGVEVIFMTA 82 (141) T ss_pred CCCCCEEEECCCCCCCCCHHHH-------HHHHHHCCCCCEEEEEC T ss_conf 5999899978878999989999-------99998759983899977 No 20 >cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=86.95 E-value=2.7 Score=22.42 Aligned_cols=84 Identities=14% Similarity=0.065 Sum_probs=42.4 Q ss_pred EEEEECCC-CCCCHH----HHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHH Q ss_conf 26882689-868856----9999999985489199981799856503202248970588971104763272799999998 Q gi|255764489|r 2 RILLTNDD-GIKSKG----LITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIAL 76 (250) Q Consensus 2 ~ILitNDD-G~~a~g----l~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l 76 (250) |||+.++. ..+.-| +..|++.|.+.||+|+|+.+........... ...++.+... ......+|.-...... T Consensus 1 kIL~i~~~f~P~~GG~e~~~~~L~~~L~~~Gh~V~v~~~~~~~~~~~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~ 76 (357) T cd03795 1 RVLHVGKFYPPDRGGIEQVIRDLAEGLAARGIEVAVLCASPEPKGRDEER---NGHRVIRAPS-LLNVASTPFSPSFFKQ 76 (357) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCC---CCCEEEECCC-CCCCCCCHHHHHHHHH T ss_conf 99999382899898299999999999997799899998279887765025---8847998774-3333442046999999 Q ss_pred HHCCCCCCCEEEE Q ss_conf 7504645419997 Q gi|255764489|r 77 QKMSDKKPDLILS 89 (250) Q Consensus 77 ~~l~~~~pDlViS 89 (250) .....++||+|.. T Consensus 77 ~~~~~~~~Diih~ 89 (357) T cd03795 77 LKKLAKKADVIHL 89 (357) T ss_pred HHHHCCCCCEEEE T ss_conf 9997259999999 No 21 >cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=86.83 E-value=2.7 Score=22.39 Aligned_cols=105 Identities=16% Similarity=0.135 Sum_probs=50.5 Q ss_pred EEEEECCC-CCCC----HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEE---EEEEEECCCHHHHHH Q ss_conf 26882689-8688----569999999985489199981799856503202248970588971---104763272799999 Q gi|255764489|r 2 RILLTNDD-GIKS----KGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTIS---KKRFAVHGTPVDCVV 73 (250) Q Consensus 2 ~ILitNDD-G~~a----~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~---~~~~~v~GtPaDcv~ 73 (250) |||+..+. ..+. .-+..|++.|.+.||+|+|++|.....-... ..-.++..+. ...+...-. +.. T Consensus 1 kIl~i~~~f~P~~GG~e~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~---~~~ 73 (364) T cd03814 1 RIAIVTDTFLPQVNGVVRTLQRLVEHLRARGHEVLVIAPGPFRESEGP----ARVVPVPSVPLPGYPEIRLALP---PRR 73 (364) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCC----CCEEEEEEEECCCCCCCEECCC---HHH T ss_conf 989993888999884999999999999977998999978997655556----6346786674688876300203---299 Q ss_pred HHHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99875046454199971488853102333457778998898718985322 Q gi|255764489|r 74 IALQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFAL 123 (250) Q Consensus 74 ~~l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAi 123 (250) .-...+...+||+|..- +- ...| ..|.+-+...|+|.|.. T Consensus 74 ~~~~~~~~~~pDiIh~~-----~~----~~~~-~~a~~~~~~~~ip~i~~ 113 (364) T cd03814 74 RVRRLLDAFAPDVVHIA-----TP----GPLG-LAALRAARRLGIPVVTS 113 (364) T ss_pred HHHHHHHHCCCCEEEEC-----CC----CHHH-HHHHHHHHHCCCCEEEE T ss_conf 99999986599999987-----84----1678-99999999759978999 No 22 >cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor. Probab=86.20 E-value=1.7 Score=23.61 Aligned_cols=94 Identities=10% Similarity=-0.014 Sum_probs=45.6 Q ss_pred CHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHH-HCCCCCCCEEEECC Q ss_conf 85699999999854891999817998565032022489705889711047632727999999987-50464541999714 Q gi|255764489|r 13 SKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQ-KMSDKKPDLILSGV 91 (250) Q Consensus 13 a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~-~l~~~~pDlViSGi 91 (250) -..+..|+++|.+.||+|+|+++...+....+.... ..........+.....+. ..-+. .+...+||+|-+= T Consensus 14 etfv~~la~~L~~~GHeV~v~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~---~~~l~~~i~~~~~DiIH~H- 86 (355) T cd03799 14 ETFILREILALEAAGHEVEIFSLRPPEDTLVHPEDR---AELARTRYLARSLALLAQ---ALVLARELRRLGIDHIHAH- 86 (355) T ss_pred CHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCC---CEECCCCCCHHHHHHHHH---HHHHHHHHHHCCCCEEEEC- T ss_conf 179999999999679849999534887773064302---121552154777999999---9999999977799899976- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 8885310233345777899889871898532 Q gi|255764489|r 92 NVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 92 N~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) ..+.....+.+-+.+.|+|-|. T Consensus 87 ---------~~~~~~~~~~~~~~~~~ip~v~ 108 (355) T cd03799 87 ---------FGTTPATVAMLASRLGGIPYSF 108 (355) T ss_pred ---------CCCHHHHHHHHHHHHCCCCEEE T ss_conf ---------8833799999999974999999 No 23 >cd03132 GATase1_catalase Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Catalase catalyzes the dismutation of hydrogen peroxide (H2O2) to water and oxygen. This group includes the large catalases: Neurospora crassa Catalase-1 and Catalase-3 and, Escherichia coli HP-II. This GATase1-like domain has an essential role in HP-II catalase activity. However, it lacks enzymatic activity and the catalytic triad typical of GATase1 domains. Catalase-1 and -3 are homotetrameric, HP-II is homohexameric. It has been proposed that this domain may facilitate the folding and oligomerization process. The interface between this GATase1-like domain of HP-II and the core of the subunit forms part of a channel which provides access to the deeply buried catalase active sites of HPII. Catalase-1 is associated with non-growing cells; C Probab=86.13 E-value=1.7 Score=23.60 Aligned_cols=35 Identities=20% Similarity=0.460 Sum_probs=30.8 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 6882689868856999999998548919998179985 Q gi|255764489|r 3 ILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQ 39 (250) Q Consensus 3 ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~ 39 (250) ||+++ |++..-+..++++|++.|..|.|++|.... T Consensus 6 iLv~d--g~~~~e~~~~~~al~~aG~~v~via~~~g~ 40 (142) T cd03132 6 ILVAD--GVDAAELSALKAALKAAGANVKVVAPTLGG 40 (142) T ss_pred EEECC--CCCHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 99669--988899999999999889989999579870 No 24 >pfam07355 GRDB Glycine/sarcosine/betaine reductase selenoprotein B (GRDB). This family represents a conserved region approximately 350 residues long within the selenoprotein B component of the bacterial glycine, sarcosine and betaine reductase complexes. Probab=85.23 E-value=2 Score=23.20 Aligned_cols=52 Identities=27% Similarity=0.441 Sum_probs=38.1 Q ss_pred HHHHHHHH-HHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999998-75046454199971488853102333457778998898718985322 Q gi|255764489|r 69 VDCVVIAL-QKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFAL 123 (250) Q Consensus 69 aDcv~~~l-~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAi 123 (250) .|-+.--+ ..+.+.+|||+|.|+ +.|.|..=.--|||..|.+-.+ |||++.= T Consensus 65 ~dea~~~il~mv~~~~pDlfiAGP--AFnAGRYGvACG~i~kaV~e~l-~IP~vTg 117 (349) T pfam07355 65 IEEAVAEILEMLKEEKPDLFIAGP--AFNAGRYGVACGTIAKAVKEEL-GIPAVTG 117 (349) T ss_pred HHHHHHHHHHHHHHCCCCEEEECC--CCCCCCHHHHHHHHHHHHHHHH-CCCEEEE T ss_conf 999999999999842999898766--3256425888999999999986-9963864 No 25 >cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase. Probab=84.80 E-value=3.4 Score=21.77 Aligned_cols=112 Identities=13% Similarity=0.122 Sum_probs=53.0 Q ss_pred EEEEECCCCC-C----CHHHHHHHHHHHHCCCCEEEECCCCCCCCEEE----EECCCCCEEEEEEEEEEEEECCCH---H Q ss_conf 2688268986-8----85699999999854891999817998565032----022489705889711047632727---9 Q gi|255764489|r 2 RILLTNDDGI-K----SKGLITLENIARSISDDIWICAPEMDQSCLAN----SLTMSRNIACRTISKKRFAVHGTP---V 69 (250) Q Consensus 2 ~ILitNDDG~-~----a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~----ait~~~~~~~~~~~~~~~~v~GtP---a 69 (250) |||+..+.=. . +.-+..|++.|.+.||+|+|+++..+...... .......+++.++....+.-.+.. . T Consensus 1 KIlii~~~fpP~~gG~~~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~~~~~~~~~~~gi~v~r~~~~~~~~~~~~~~~~ 80 (394) T cd03794 1 KILILSQYFPPELGGGAFRTTELAEELVKRGHEVTVITGSPNYPSGKIYKGYKREEVDGVRVHRVPLPPYKKNGLLKRLL 80 (394) T ss_pred CEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCEEEEEECCCCCCCCCHHHHHH T ss_conf 98999177789898299999999999997799799995478776432356664466488599993377667752788999 Q ss_pred HHHHHHHH----H-CCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999987----5-04645419997148885310233345777899889871898532 Q gi|255764489|r 70 DCVVIALQ----K-MSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 70 Dcv~~~l~----~-l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) .-..+... . ....+||+|++- +. -..++ ..|..-+...|+|-+- T Consensus 81 ~~~~~~~~~~~~~~~~~~~~Div~~~---~~-----~~~~~-~~~~~~~~~~~~p~v~ 129 (394) T cd03794 81 NYLSFALSALLALLKRRRRPDVIIAT---SP-----PLLIA-LAALLLARLKGAPFVL 129 (394) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEC---CC-----CHHHH-HHHHHHHHHCCCCEEE T ss_conf 99999999999999855899889991---78-----47889-9999999863996999 No 26 >cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran. Probab=83.78 E-value=3.8 Score=21.50 Aligned_cols=72 Identities=18% Similarity=0.232 Sum_probs=36.5 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCCEEEE Q ss_conf 56999999998548919998179985650320224897058897110476327279999999875-04645419997 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPDLILS 89 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pDlViS 89 (250) --+..|++.|.+.||+|.|+++.....- ...+...+++..+......-....-. ....+.. +...+||+|++ T Consensus 17 ~~~~~la~~L~~~G~~V~vit~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~Dvi~~ 89 (348) T cd03820 17 RVLSNLANALAEKGHEVTIISLDKGEPP---FYELDPKIKVIDLGDKRDSKLLARFK-KLRRLRKLLKNNKPDVVIS 89 (348) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCC---CCCCCCCEEEEECCCCCCCCHHHHHH-HHHHHHHHHHHCCCCEEEE T ss_conf 9999999999877998999996699986---44058974999888765420567899-9999999999759999999 No 27 >PRK06849 hypothetical protein; Provisional Probab=82.07 E-value=2.6 Score=22.52 Aligned_cols=73 Identities=11% Similarity=0.195 Sum_probs=41.5 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC-CEEEEECCCCCEEEEEEEEEEEEEC---CCHHHHHHHHH Q ss_conf 9268826898688569999999985489199981799856-5032022489705889711047632---72799999998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQS-CLANSLTMSRNIACRTISKKRFAVH---GTPVDCVVIAL 76 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S-~~~~ait~~~~~~~~~~~~~~~~v~---GtPaDcv~~~l 76 (250) ++||||+ -.+|-=..|++.+.+.||+|+++-+..-.- ..|++++ +.|.+. --|.- -.-+| T Consensus 5 ~tvLiTg---~r~~~aL~laR~l~~~Gh~V~~aD~~~~~l~r~Sr~v~------------~~~~vP~P~~d~~~-y~~~L 68 (387) T PRK06849 5 KTVLITG---ARAPAALQLARSFHNAGHTVILADSLKYPLSRFSRAVD------------GFYTIPSPKWDPNA-YIQAL 68 (387) T ss_pred CEEEEEC---CCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHCEE------------EEEECCCCCCCHHH-HHHHH T ss_conf 7799958---86078999999998789979998489877542000011------------27986999789899-99999 Q ss_pred HHC-CCCCCCEEEE Q ss_conf 750-4645419997 Q gi|255764489|r 77 QKM-SDKKPDLILS 89 (250) Q Consensus 77 ~~l-~~~~pDlViS 89 (250) ..+ ..++.||+|. T Consensus 69 l~Iv~~e~idl~IP 82 (387) T PRK06849 69 LSIVKRHNIDLLIP 82 (387) T ss_pred HHHHHHHCCCEEEE T ss_conf 99999838999997 No 28 >cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis. Probab=80.21 E-value=3.1 Score=22.00 Aligned_cols=37 Identities=19% Similarity=0.237 Sum_probs=27.0 Q ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 926882689868---8569999999985489199981799 Q gi|255764489|r 1 MRILLTNDDGIK---SKGLITLENIARSISDDIWICAPEM 37 (250) Q Consensus 1 m~ILitNDDG~~---a~gl~~l~~~l~~~~~~v~vvAP~~ 37 (250) ||||..|.-... ..-+..|.+.|.+.||+|.++.+.. T Consensus 1 MKIL~v~~~~~~GGae~~~~~L~~~L~~~Gh~v~v~~~~~ 40 (365) T cd03825 1 MKVLHLNTSDISGGAARAAYRLHRALQAAGVDSTMLVQEK 40 (365) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECC T ss_conf 9599993899992389999999999997799089999269 No 29 >PRK10365 transcriptional regulatory protein ZraR; Provisional Probab=75.14 E-value=7.2 Score=19.80 Aligned_cols=30 Identities=20% Similarity=0.314 Sum_probs=21.8 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf 926882689868856999999998548919998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWIC 33 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vv 33 (250) |+|||.-||--. ...|.+.|+..|++|..+ T Consensus 6 ~~ILIVDDd~~~---~~~l~~~L~~~G~~v~~a 35 (441) T PRK10365 6 IDILVVDDDISH---CTILQALLRGWGYNVALA 35 (441) T ss_pred CEEEEECCCHHH---HHHHHHHHHHCCCEEEEE T ss_conf 859998398999---999999999779989998 No 30 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=74.50 E-value=4.5 Score=21.01 Aligned_cols=31 Identities=16% Similarity=0.286 Sum_probs=26.6 Q ss_pred EEECCCC---CCCHHHHHHHHH---HHHCCCCEEEEC Q ss_conf 8826898---688569999999---985489199981 Q gi|255764489|r 4 LLTNDDG---IKSKGLITLENI---ARSISDDIWICA 34 (250) Q Consensus 4 LitNDDG---~~a~gl~~l~~~---l~~~~~~v~vvA 34 (250) +||+++| ++.+-|..|++. |++.||+|++|- T Consensus 11 ~Lt~~~g~h~l~~~~i~~lv~~v~~L~~~Gh~vviVS 47 (379) T TIGR01027 11 SLTGSSGSHELDRSRIAELVEQVAALHAAGHEVVIVS 47 (379) T ss_pred EEECCCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 1117888741377999999999999986599899981 No 31 >PRK12767 carbamoyl phosphate synthase-like protein; Provisional Probab=74.42 E-value=7.5 Score=19.68 Aligned_cols=74 Identities=16% Similarity=0.146 Sum_probs=41.7 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEEC-CCHHHHHHHHHHHC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632-72799999998750 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVH-GTPVDCVVIALQKM 79 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~-GtPaDcv~~~l~~l 79 (250) ||||||+==| |. .+.++|+..+....|++-+.+--+.|+.+ .++.|.+. .+=-+-+ =.|..+ T Consensus 2 ~nILvt~~G~----~~-~ii~~lk~~~~~~~Vi~~D~~~~a~~~~~-----------aD~~y~~P~~~d~~y~-~~ll~i 64 (325) T PRK12767 2 MNILVTSAGR----RV-QLVKALKKSLLGGKVIGADISPLAPALYF-----------ADKFYVVPKVTDPNYI-DALLDI 64 (325) T ss_pred CEEEEECCCC----HH-HHHHHHHHCCCCCEEEEECCCCCCCCHHH-----------CCEEEECCCCCCHHHH-HHHHHH T ss_conf 4899986786----89-99999997699859999689989953445-----------4889987888987899-999999 Q ss_pred C-CCCCCEEEECC Q ss_conf 4-64541999714 Q gi|255764489|r 80 S-DKKPDLILSGV 91 (250) Q Consensus 80 ~-~~~pDlViSGi 91 (250) + ..++|+||.|+ T Consensus 65 ~~~~~id~iiP~~ 77 (325) T PRK12767 65 CKKENIDALIPLI 77 (325) T ss_pred HHHHCCCEEEECC T ss_conf 9987999999778 No 32 >TIGR03107 glu_aminopep glutamyl aminopeptidase. This model represents the M42.001 clade within MEROPS family M42. M42 includes glutamyl aminopeptidase as in the present model, deblocking aminopeptidases as from Pyrococcus horikoshii and related species, and endo-1,4-beta-glucanase (cellulase M) as from Clostridium thermocellum. The current family includes Probab=73.84 E-value=3.7 Score=21.54 Aligned_cols=22 Identities=14% Similarity=0.040 Sum_probs=15.3 Q ss_pred CCCCHHHHHHHHHHHHCCCCEE Q ss_conf 8688569999999985489199 Q gi|255764489|r 10 GIKSKGLITLENIARSISDDIW 31 (250) Q Consensus 10 G~~a~gl~~l~~~l~~~~~~v~ 31 (250) |++.+-.+.+++.+++++.+|+ T Consensus 15 G~E~~v~~~i~~~l~~~~dev~ 36 (350) T TIGR03107 15 GFEHPIRDYLRQDITPLVDQVE 36 (350) T ss_pred CCHHHHHHHHHHHHHHHCCEEE T ss_conf 6759999999999886187499 No 33 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=73.81 E-value=7.7 Score=19.59 Aligned_cols=41 Identities=17% Similarity=0.071 Sum_probs=27.4 Q ss_pred EEEEECCCCC-CC-----HHHHHHHHHHHHCCCCEEEECCCCCCCCE Q ss_conf 2688268986-88-----56999999998548919998179985650 Q gi|255764489|r 2 RILLTNDDGI-KS-----KGLITLENIARSISDDIWICAPEMDQSCL 42 (250) Q Consensus 2 ~ILitNDDG~-~a-----~gl~~l~~~l~~~~~~v~vvAP~~~~S~~ 42 (250) |||+.++.=. +. --+..|++.|.+.||+|+|+++....... T Consensus 1 rIl~vt~~~pP~~~GG~e~~~~~la~~L~~~Gh~V~v~~~~~~~~~~ 47 (359) T cd03823 1 RILVVNHLYPPRSVGGAEVVAHDLAEALAKRGHEVAVLTAGEDPPRQ 47 (359) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCC T ss_conf 99999486489997649999999999999779989999557987664 No 34 >COG1091 RfbD dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane] Probab=73.47 E-value=7.9 Score=19.54 Aligned_cols=97 Identities=20% Similarity=0.302 Sum_probs=61.3 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) ||||||-+-|.-.- .|.+.+. -+++ |+|..... +|-|=.|+|.--+.. T Consensus 1 M~iLi~G~~GqLG~---~L~~~l~-~~~~--v~a~~~~~------------------------~Ditd~~~v~~~i~~-- 48 (281) T COG1091 1 MKILITGANGQLGT---ELRRALP-GEFE--VIATDRAE------------------------LDITDPDAVLEVIRE-- 48 (281) T ss_pred CCEEEECCCCHHHH---HHHHHHC-CCCE--EEECCCCC------------------------CCCCCHHHHHHHHHH-- T ss_conf 95899769876799---9999717-7843--99515765------------------------555685899999986-- Q ss_pred CCCCCEEEECCCCC-----CC---CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 64541999714888-----53---1023334577789988987189853222001356 Q gi|255764489|r 81 DKKPDLILSGVNVG-----TN---TSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYE 130 (250) Q Consensus 81 ~~~pDlViSGiN~G-----~N---~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~ 130 (250) .+||+||.=.=+= .+ ....+---|+---|..+.-.|.+=|=+|-+|-++ T Consensus 49 -~~PDvVIn~AAyt~vD~aE~~~e~A~~vNa~~~~~lA~aa~~~ga~lVhiSTDyVFD 105 (281) T COG1091 49 -TRPDVVINAAAYTAVDKAESEPELAFAVNATGAENLARAAAEVGARLVHISTDYVFD 105 (281) T ss_pred -HCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCEEEC T ss_conf -199989987320365413389899777677999999999997197699963445743 No 35 >cd03805 GT1_ALG2_like This family is most closely related to the GT1 family of glycosyltransferases. ALG2, a 1,3-mannosyltransferase, in yeast catalyzes the mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate and Man(1)GlcNAc(2)-dolichol diphosphate to form Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of this enzyme causes an abnormal accumulation of Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol, which is associated with a type of congenital disorders of glycosylation (CDG), designated CDG-Ii, in humans. Probab=72.27 E-value=7 Score=19.87 Aligned_cols=40 Identities=20% Similarity=0.319 Sum_probs=27.4 Q ss_pred CEEEEECCC-CC--CCHHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 926882689-86--88569999999985489199981799856 Q gi|255764489|r 1 MRILLTNDD-GI--KSKGLITLENIARSISDDIWICAPEMDQS 40 (250) Q Consensus 1 m~ILitNDD-G~--~a~gl~~l~~~l~~~~~~v~vvAP~~~~S 40 (250) ||||..-.+ |+ .---+..|++.|.+.||+|.|+....+++ T Consensus 1 MkI~fi~p~l~~GGaEr~v~~la~~L~~~Gh~V~v~t~~~d~~ 43 (392) T cd03805 1 LRVAFIHPDLGIGGAERLVVDAALALQSRGHEVTIYTSHHDPS 43 (392) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 9899986999998699999999999997699399997268833 No 36 >cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. Probab=71.96 E-value=8.6 Score=19.32 Aligned_cols=111 Identities=14% Similarity=0.086 Sum_probs=51.5 Q ss_pred EEEEECCCCCCC-HH----HHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHH Q ss_conf 268826898688-56----9999999985489199981799856503202248970588971104763272799999998 Q gi|255764489|r 2 RILLTNDDGIKS-KG----LITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIAL 76 (250) Q Consensus 2 ~ILitNDDG~~a-~g----l~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l 76 (250) |||+..+.-.-. -| +..|++.|.+.||+|+|++|.....-...... ...................-.....+ T Consensus 1 kIlii~~~~~P~~GG~e~~~~~la~~L~~~G~~V~vit~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (374) T cd03801 1 KILLVTPEYPPSVGGAERHVLELARALAARGHEVTVLTPGDGGLPDEEEVG---GIVVVRPPPLLRVRRLLLLLLLALRL 77 (374) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHCC---CCEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 989994877999881999999999999977998999960798875034237---71699567665420024567899999 Q ss_pred H-HCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 7-5046454199971488853102333457778998898718985322200 Q gi|255764489|r 77 Q-KMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 77 ~-~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) . .+...+||+|.+- + ..++..+ ..-....++|-|..... T Consensus 78 ~~~~~~~~~Dii~~~-----~-----~~~~~~~-~~~~~~~~~~~i~~~h~ 117 (374) T cd03801 78 RRLLRRERFDVVHAH-----D-----WLALLAA-ALAARLLGIPLVLTVHG 117 (374) T ss_pred HHHHHHCCCCEEEEC-----C-----CHHHHHH-HHHHHHCCCCEEEEECC T ss_conf 999985599899978-----8-----3178999-99998669978999678 No 37 >cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704. Probab=71.85 E-value=8 Score=19.52 Aligned_cols=42 Identities=14% Similarity=0.220 Sum_probs=34.7 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEE Q ss_conf 268826898688569999999985489199981799856503 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLA 43 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~ 43 (250) ||||-=-||++..-+..-++.|+..|++|.++.|+..++-.| T Consensus 1 KV~il~~~Gfee~E~~~p~~~l~~aG~~V~~~s~~~~~~v~~ 42 (165) T cd03134 1 KVAILAADGFEDVELTYPLYRLREAGAEVVVAGPEAGGEIQG 42 (165) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCEEE T ss_conf 999992698209999999999997899799999749985683 No 38 >PRK09836 DNA-binding transcriptional activator CusR; Provisional Probab=68.81 E-value=4.4 Score=21.09 Aligned_cols=81 Identities=11% Similarity=0.126 Sum_probs=50.7 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) ||||+.-||-.-+. .|.+.|+..|++|..+. + ..-|+..+. T Consensus 1 MkILiVEDd~~l~~---~l~~~L~~~G~~v~~a~------------------------------~------g~~a~~~~~ 41 (226) T PRK09836 1 MKLLIVEDEKKTGE---YLTKGLTEAGFVVDLAD------------------------------N------GLNGYHLAM 41 (226) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------C------HHHHHHHHH T ss_conf 98999939999999---99999987899999989------------------------------9------999999985 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 6454199971488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) ..+|||||--||...--|.++. ..+-....++|=|-+|.. T Consensus 42 ~~~~DlvilDi~lP~~~G~~l~------~~iR~~~~~~PII~Lta~ 81 (226) T PRK09836 42 TGDYDLIILDIMLPDVNGWDIV------RMLRSANKGMPILLLTAL 81 (226) T ss_pred HCCCCEEEECCCCCCCCCCCHH------HHHHHHCCCCEEEEEECC T ss_conf 1899999988999999872043------567761679609999445 No 39 >PRK13782 phosphocarrier protein Chr; Provisional Probab=64.19 E-value=12 Score=18.35 Aligned_cols=79 Identities=18% Similarity=0.161 Sum_probs=53.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 26882689868856999999998548919998179985650320224897058897110476327279999999875046 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSD 81 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~ 81 (250) ++-|+|.-|+++---..|++.+..+..+|++.. .+++.-+.||----.|.+.+-..=...++|.=++-..-+|-.++. T Consensus 5 ~v~I~n~~GLHARPAa~lv~~a~~f~s~I~l~~--~~~~~naKSil~lm~Lg~~~G~~i~i~a~G~De~~Al~~l~~lie 82 (85) T PRK13782 5 RVEVSLKTGLQARPAALFVQEANRFHADIFIEK--DGKKVNAKSIMGLMSLAIGTGSMITIITEGSDEEEALEALAAYVQ 82 (85) T ss_pred EEEECCCCCCCHHHHHHHHHHHHCCCCEEEEEE--CCEEECHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 999868997656089999999972898699998--999983582999985177898999999978899999999999984 Q ss_pred C Q ss_conf 4 Q gi|255764489|r 82 K 82 (250) Q Consensus 82 ~ 82 (250) . T Consensus 83 ~ 83 (85) T PRK13782 83 E 83 (85) T ss_pred C T ss_conf 5 No 40 >PRK10675 UDP-galactose-4-epimerase; Provisional Probab=63.24 E-value=8.4 Score=19.38 Aligned_cols=31 Identities=23% Similarity=0.208 Sum_probs=24.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9268826898688569999999985489199981 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICA 34 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvA 34 (250) ||||||=--|+=-. .|++.|.+.||+|+++- T Consensus 1 MkvLVTGg~GFIGs---~l~~~Ll~~g~~V~~~d 31 (338) T PRK10675 1 MRVLVTGGSGYIGS---HTCVQLLQNGHDVVILD 31 (338) T ss_pred CEEEEECCCCHHHH---HHHHHHHHCCCEEEEEE T ss_conf 91999898767999---99999997849899998 No 41 >COG1817 Uncharacterized protein conserved in archaea [Function unknown] Probab=63.22 E-value=3.5 Score=21.68 Aligned_cols=102 Identities=16% Similarity=0.155 Sum_probs=55.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECC-CHHH--HHHHH-- Q ss_conf 926882689868856999999998548919998179985650320224897058897110476327-2799--99999-- Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHG-TPVD--CVVIA-- 75 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~G-tPaD--cv~~~-- 75 (250) |||++-=----++.-.+.|-..+++-||+|.+.+=+.+-+.-=.- .++-+ ...++. +| ++-- -+..+ T Consensus 1 mkVwiDI~n~~hvhfFk~lI~elekkG~ev~iT~rd~~~v~~LLd-~ygf~--~~~Igk-----~g~~tl~~Kl~~~~eR 72 (346) T COG1817 1 MKVWIDIGNPPHVHFFKNLIWELEKKGHEVLITCRDFGVVTELLD-LYGFP--YKSIGK-----HGGVTLKEKLLESAER 72 (346) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHCCEEEEEEEEECCCHHHHHH-HHCCC--EEEECC-----CCCCCHHHHHHHHHHH T ss_conf 937997589610238999999998578499999851275889999-83997--076404-----5774478999999999 Q ss_pred ---HHHC-CCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf ---8750-464541999714888531023334577789988987189853222 Q gi|255764489|r 76 ---LQKM-SDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALS 124 (250) Q Consensus 76 ---l~~l-~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS 124 (250) |..+ ...+||+.++ +| -+-+++-+...|+|+|-+= T Consensus 73 ~~~L~ki~~~~kpdv~i~-~~-------------s~~l~rvafgLg~psIi~~ 111 (346) T COG1817 73 VYKLSKIIAEFKPDVAIG-KH-------------SPELPRVAFGLGIPSIIFV 111 (346) T ss_pred HHHHHHHHHHCCCCEEEE-CC-------------CCCHHHHHHHCCCCEEEEC T ss_conf 999999875229855752-27-------------8105567765288638964 No 42 >PRK00726 murG N-acetylglucosaminyl transferase; Provisional Probab=62.92 E-value=13 Score=18.20 Aligned_cols=108 Identities=19% Similarity=0.188 Sum_probs=50.4 Q ss_pred EEEECC-C-CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEEC---CCHHHHHHH--- Q ss_conf 688268-9-86885699999999854891999817998565032022489705889711047632---727999999--- Q gi|255764489|r 3 ILLTND-D-GIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVH---GTPVDCVVI--- 74 (250) Q Consensus 3 ILitND-D-G~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~---GtPaDcv~~--- 74 (250) ||++=- - |---|+| +|+++|++.|++|..+.-.. |.-..+--...+....+....+.=. ..+.....+ T Consensus 4 I~i~~GGTGGHi~Pal-ala~~L~~~g~ev~~ig~~~---g~E~~~v~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l~~~ 79 (359) T PRK00726 4 ILLAGGGTGGHVFPAL-ALAEELKKRGWEVLWLGTKR---GMEARLVPKAGIEFHFIPIGGLRRKGSLANLKAPFKLLKG 79 (359) T ss_pred EEEEECCCHHHHHHHH-HHHHHHHHCCCEEEEEECCC---HHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHH T ss_conf 9999588689999999-99999983879899997882---6865404414983899777888987879999999999999 Q ss_pred ---HHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf ---987504645419997148885310233345777899889871898532220 Q gi|255764489|r 75 ---ALQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 75 ---~l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ++..+...+||+||+ .|-.+ |. -+.+-|.++|+|-+=--| T Consensus 80 ~~~~~~il~~~kPd~Vig-------~GGY~--s~--P~~laA~l~~iP~iiHEq 122 (359) T PRK00726 80 VLQARKILKRFKPDVVVG-------FGGYV--SG--PAGLAARLLGIPLVIHEQ 122 (359) T ss_pred HHHHHHHHHHCCCCEEEE-------CCCHH--HH--HHHHHHHHCCCCEEEECC T ss_conf 999999999749999997-------89741--28--999999982998699745 No 43 >PRK11517 transcriptional regulatory protein YedW; Provisional Probab=61.89 E-value=7.1 Score=19.82 Aligned_cols=80 Identities=14% Similarity=0.051 Sum_probs=49.8 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) ||||+.-||=--+. .|.+.|+..|++|..+. +| .-|+..+. T Consensus 1 MkILiVEDd~~l~~---~l~~~L~~~g~~V~~a~------------------------------~g------~~al~~~~ 41 (223) T PRK11517 1 MKILLIEDNQRTQE---WVTQGLSEAGYVIDAVS------------------------------DG------RDGLYLAL 41 (223) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------CH------HHHHHHHH T ss_conf 98999969899999---99999998899999989------------------------------99------99999985 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 6454199971488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) ...|||||.-++...--|.++.- .-++ ...+|-|-+|.. T Consensus 42 ~~~~DlvilDi~lP~~dG~~l~~-----~iR~--~~~~pII~lta~ 80 (223) T PRK11517 42 KDDYALIILDIMLPGMDGWQILQ-----TLRT--AKQTPVICLTAR 80 (223) T ss_pred CCCCCEEEECCCCCCCCHHHHHH-----HHHH--CCCCCEEEEECC T ss_conf 28999999849998736899999-----9985--688648999565 No 44 >PRK09189 uroporphyrinogen-III synthase; Validated Probab=61.59 E-value=14 Score=18.11 Aligned_cols=35 Identities=17% Similarity=0.215 Sum_probs=27.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 9268826898688569999999985489199981799856 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQS 40 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S 40 (250) ||||||== .++-..|.+.|++.||+| ++.|--+-. T Consensus 1 MrILvTRP----~~~a~~l~~~L~~~G~~~-~~~Pll~i~ 35 (237) T PRK09189 1 MRVLVTRP----EPSAERTAARLRAMGHEP-VLLPLSRTV 35 (237) T ss_pred CEEEEECC----HHHHHHHHHHHHHCCCCE-EECCCCCEE T ss_conf 93999386----777999999999779989-987861055 No 45 >cd03796 GT1_PIG-A_like This family is most closely related to the GT1 family of glycosyltransferases. Phosphatidylinositol glycan-class A (PIG-A), an X-linked gene in humans, is necessary for the synthesis of N-acetylglucosaminyl-phosphatidylinositol, a very early intermediate in glycosyl phosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is an important cellular structure that facilitates the attachment of many proteins to cell surfaces. Somatic mutations in PIG-A have been associated with Paroxysmal Nocturnal Hemoglobinuria (PNH), an acquired hematological disorder. Probab=60.83 E-value=14 Score=17.97 Aligned_cols=109 Identities=9% Similarity=0.091 Sum_probs=50.5 Q ss_pred EEEEECCCCC-CC----HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEE-EEECCCHHH-HHHH Q ss_conf 2688268986-88----569999999985489199981799856503202248970588971104-763272799-9999 Q gi|255764489|r 2 RILLTNDDGI-KS----KGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKR-FAVHGTPVD-CVVI 74 (250) Q Consensus 2 ~ILitNDDG~-~a----~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~-~~v~GtPaD-cv~~ 74 (250) |||+.+|.=+ +- .-+..|++.|.+.||+|+|+.+.......-+- ....+++..+.... +.-.--|.- +..- T Consensus 1 RI~ivt~~f~P~iGG~e~~v~~La~~L~~~Gh~V~Vit~~~~~~~~~~~--~~~g~~V~~~p~~~~~~~~~~~~~~~~~~ 78 (398) T cd03796 1 RICMVSDFFYPNLGGVETHIYQLSQCLIKRGHKVVVITHAYGNRVGIRY--LTNGLKVYYLPFVVFYNQSTLPTFFGTFP 78 (398) T ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEE--ECCCCEEEECCCCCCCCCCHHHHHHHHHH T ss_conf 9899948879999977999999999999769989999689998877468--53884699756633456311677988899 Q ss_pred HHHH-CCCCCCCEEEECCCCCCCCCHHHHHHHH-HHHHHHHHHHCCCCCC Q ss_conf 9875-0464541999714888531023334577-7899889871898532 Q gi|255764489|r 75 ALQK-MSDKKPDLILSGVNVGTNTSNHVAYSGT-LAAAFEGSLQGIRSFA 122 (250) Q Consensus 75 ~l~~-l~~~~pDlViSGiN~G~N~g~~v~ySGT-vgAA~ea~~~GipsIA 122 (250) .+.. +..++||+|-+ | + .+|.- ..+.+.+...|+|.+. T Consensus 79 ~~r~~~~~~~~DIIH~---H--~-----~~~~l~~~~~~~ar~~g~~~V~ 118 (398) T cd03796 79 LLRNILIRERITIVHG---H--Q-----AFSALAHEALLHARTMGLKTVF 118 (398) T ss_pred HHHHHHHHCCCCEEEE---C--C-----HHHHHHHHHHHHHHHCCCCEEE T ss_conf 9999997679988998---9--6-----2688999999998755997899 No 46 >cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide. Probab=60.13 E-value=15 Score=17.90 Aligned_cols=28 Identities=4% Similarity=0.189 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCC Q ss_conf 5699999999854891999817998565 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSC 41 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~ 41 (250) .-+..|++.|.+.||+|+|+.+..+.+. T Consensus 18 ~~~~~la~~L~~~Gh~V~v~t~~~~~~~ 45 (375) T cd03821 18 RVVLNLSKALAKLGHEVTVATTDAGGDP 45 (375) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCC T ss_conf 9999999999977998999970798764 No 47 >TIGR03466 HpnA hopanoid-associated sugar epimerase. The sequences in this family are members of the pfam01370 superfamily of NAD-dependent epimerases and dehydratases typically acting on nucleotide-sugar substrates. The genes of the family modeled here are generally in the same locus with genes involved in the biosynthesis and elaboration of hopene, the cyclization product of the polyisoprenoid squalene. Probab=59.51 E-value=10 Score=18.87 Aligned_cols=32 Identities=13% Similarity=0.324 Sum_probs=24.1 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECC Q ss_conf 92688268986885699999999854891999817 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAP 35 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP 35 (250) ||||||=--|+=.. .|++.|.+.||+|.+..- T Consensus 1 MriLVTGgtGfiG~---~l~~~L~~~G~~V~~l~r 32 (328) T TIGR03466 1 MKVLVTGATGFVGS---AVVRLLLERGEEVRVLVR 32 (328) T ss_pred CEEEEECCCCHHHH---HHHHHHHHCCCEEEEEEC T ss_conf 94999867779999---999999978498999989 No 48 >PRK13780 phosphocarrier protein HPr; Provisional Probab=56.85 E-value=16 Score=17.61 Aligned_cols=78 Identities=13% Similarity=0.126 Sum_probs=52.2 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 26882689868856999999998548919998179985650320224897058897110476327279999999875046 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSD 81 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~ 81 (250) .+-|+|.-|+++---..|++.++.+..+|++...... .-+.||----.|.+.+-..=...++|.=++-+.-+|..++. T Consensus 5 ~~~I~n~~GLHARPAa~lv~~a~~f~s~I~i~~~~k~--~naKSi~~lm~Lg~~~G~~i~i~a~G~De~~A~~~l~~~l~ 82 (88) T PRK13780 5 DFHITAETGIHARPATLLVQTASKFDSDITLEYNGKS--VNLKSIMGVMSLGVGQGADITISAEGADEADAIAAIEETMK 82 (88) T ss_pred EEEECCCCCCCCCHHHHHHHHHHHCCCEEEEEECCEE--ECCHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 9998189877625899999999738977999989999--82174999996178898999999978899999999999998 No 49 >PRK11083 DNA-binding response regulator CreB; Provisional Probab=56.38 E-value=17 Score=17.51 Aligned_cols=81 Identities=19% Similarity=0.191 Sum_probs=52.0 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) +||||.-||=.-+. .|.+.|+..|++|..+. + ..-|+..+. T Consensus 4 ~kILiVEDd~~l~~---~l~~~L~~~g~~v~~~~------------------------------~------~~~al~~~~ 44 (229) T PRK11083 4 PTILLVEDEQAIAD---TLVYALQSEGFTVEWFE------------------------------R------GLPALDKLR 44 (229) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------C------HHHHHHHHH T ss_conf 99999969999999---99999998899999989------------------------------9------999999997 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 6454199971488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) ..+|||||--|+...=-|.++. ..+-.....+|-|-+|.. T Consensus 45 ~~~~DlvilDi~LP~~~G~~l~------~~iR~~~~~~pII~lta~ 84 (229) T PRK11083 45 QQPPDLVILDVGLPDISGFELC------RQLRAFHPALPVIFLTAR 84 (229) T ss_pred HCCCCEEEECCCCCCCCHHHHH------HHHHHHCCCCEEEEEECC T ss_conf 1899899973889998768899------999970899729998367 No 50 >PRK08622 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=56.06 E-value=14 Score=17.96 Aligned_cols=35 Identities=14% Similarity=0.198 Sum_probs=24.7 Q ss_pred CEEEEECCC-CCCCHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 926882689-8688569999999985489199981799 Q gi|255764489|r 1 MRILLTNDD-GIKSKGLITLENIARSISDDIWICAPEM 37 (250) Q Consensus 1 m~ILitNDD-G~~a~gl~~l~~~l~~~~~~v~vvAP~~ 37 (250) |||.|..|. |++-. ..|++.|++.|++|.=+-|.. T Consensus 1 MKI~IgsDHaG~eLK--e~l~~~L~~~G~eV~D~G~~~ 36 (171) T PRK08622 1 MKIAIGCDHIVTDEK--MAVSDFLKSKGHEVIDVGTYD 36 (171) T ss_pred CEEEEEECCHHHHHH--HHHHHHHHHCCCEEEECCCCC T ss_conf 979998486069999--999999997899899758999 No 51 >cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis. It transfers N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the C4 hydroxyl of a lipid-linked N-acetylmuramoyl pentapeptide (NAM). The resulting disaccharide is then transported across the cell membrane, where it is polymerized into NAG-NAM cell-wall repeat structure. MurG belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains, each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=55.41 E-value=18 Score=17.41 Aligned_cols=100 Identities=17% Similarity=0.147 Sum_probs=47.2 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEE---EECCCHHHHHHH--H----HHHCCC Q ss_conf 6885699999999854891999817998565032022489705889711047---632727999999--9----875046 Q gi|255764489|r 11 IKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRF---AVHGTPVDCVVI--A----LQKMSD 81 (250) Q Consensus 11 ~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~---~v~GtPaDcv~~--~----l~~l~~ 81 (250) --.|.| +|+++|++.||+|..+.-.. |.-..+-......+..+....+ ..-....++..+ + ...+.. T Consensus 12 Hi~Pal-ala~~L~~~g~~V~~i~~~~---g~e~~~~~~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 87 (350) T cd03785 12 HIFPAL-ALAEELRERGAEVLFLGTKR---GLEARLVPKAGIPLHTIPVGGLRRKGSLKKLKAPFKLLKGVLQARKILKK 87 (350) T ss_pred HHHHHH-HHHHHHHHCCCEEEEEECCC---HHHHCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999-99999997879899998783---68642344139948997688878887399999999999999999999996 Q ss_pred CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 45419997148885310233345777899889871898532220 Q gi|255764489|r 82 KKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 82 ~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) .+||+||+ .| -. .| +.+.+-|.++|||-+---| T Consensus 88 ~kPd~vi~---~G----GY--~s--~P~~laA~~~~iP~~ihEq 120 (350) T cd03785 88 FKPDVVVG---FG----GY--VS--GPVGLAAKLLGIPLVIHEQ 120 (350) T ss_pred CCCCEEEE---CC----CC--HH--HHHHHHHHHCCCCCEEECC T ss_conf 49999998---89----81--03--8999999972998556567 No 52 >TIGR01925 spIIAB anti-sigma F factor; InterPro: IPR010194 This entry describes the SpoIIAB anti-sigma F factor. Sigma F regulates spore development in B subtilis. SpoIIAB binds to sigma F, preventing formation of the transcription complex at the promoter . SpoIIAA (anti-anti-sigma F factor) binds to SpoIIAB to inhibit association with sigma F, however SpoIIAB can phosphorylate SpoIIAA, causing disassociation of the SpoIIAA/B complex . The SpoIIE phosphatase dephosphorylates SpoIIAA.; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0030435 sporulation, 0045892 negative regulation of transcription DNA-dependent. Probab=54.77 E-value=7.9 Score=19.53 Aligned_cols=45 Identities=20% Similarity=0.494 Sum_probs=32.3 Q ss_pred ECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEE--CCCCCEEEEEE Q ss_conf 26898688569999999985489199981799856503202--24897058897 Q gi|255764489|r 6 TNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSL--TMSRNIACRTI 57 (250) Q Consensus 6 tNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ai--t~~~~~~~~~~ 57 (250) ..|.|+ ||.-|.+|.+. .+..-|+=+|||+|-.+ +|...+.+..- T Consensus 79 v~D~G~---GI~~lE~A~~P----LyTskPeLERSGMGFTvME~FMD~~~v~S~ 125 (137) T TIGR01925 79 VRDEGI---GIENLEEAREP----LYTSKPELERSGMGFTVMESFMDDVEVDSE 125 (137) T ss_pred EEECCC---CHHHHHHHCCC----CCCCCCCCCCCCCCEEEECCCCCCEEEEEC T ss_conf 864675---72337853266----457998722067860121112451268623 No 53 >PRK10816 DNA-binding transcriptional regulator PhoP; Provisional Probab=54.75 E-value=11 Score=18.67 Aligned_cols=80 Identities=11% Similarity=0.194 Sum_probs=47.8 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) ||||+.-||=--+ ..|...|+..|++|..+ .+ ..-|+..+. T Consensus 1 mkILivEdd~~~~---~~l~~~L~~~g~~V~~a-~~-----------------------------------~~~al~~~~ 41 (223) T PRK10816 1 MRVLVVEDNALLR---HHLKVQLQDAGHQVDAA-ED-----------------------------------AKEADYYLN 41 (223) T ss_pred CEEEEEECCHHHH---HHHHHHHHHCCCEEEEE-CC-----------------------------------HHHHHHHHH T ss_conf 9899995989999---99999999789999998-99-----------------------------------999999997 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 645419997148885310233345777899889871898532220 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ...|||||.-++...=-|.++. ..+-.....+|-|-+|. T Consensus 42 ~~~~dlvilD~~lp~~~G~~l~------~~ir~~~~~~piI~lta 80 (223) T PRK10816 42 EHLPDIAIVDLGLPDEDGLSLI------RRWRSNDVSLPILVLTA 80 (223) T ss_pred HCCCCEEEEECCCCCCCCCCCC------CCHHCCCCCCCEEEEEC T ss_conf 5799899997999898864001------20110489876899944 No 54 >PRK12615 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=54.41 E-value=15 Score=17.75 Aligned_cols=38 Identities=18% Similarity=0.277 Sum_probs=26.8 Q ss_pred CEEEEECCC-CCCCHHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 926882689-8688569999999985489199981799856 Q gi|255764489|r 1 MRILLTNDD-GIKSKGLITLENIARSISDDIWICAPEMDQS 40 (250) Q Consensus 1 m~ILitNDD-G~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S 40 (250) |||.|..|. |++-. ..|++.|++.|++|.=+-|..+.| T Consensus 1 MKI~IgsDHaG~~LK--~~i~~~L~~~G~eV~D~Gt~~~~s 39 (171) T PRK12615 1 MKIAIGCDHIVTNEK--MAVSDFLKSKGYDVIDCGTYDHTR 39 (171) T ss_pred CEEEEEECCHHHHHH--HHHHHHHHHCCCEEEECCCCCCCC T ss_conf 979999686159999--999999998899789779999887 No 55 >PRK05476 S-adenosyl-L-homocysteine hydrolase; Provisional Probab=54.41 E-value=18 Score=17.31 Aligned_cols=100 Identities=16% Similarity=0.198 Sum_probs=52.2 Q ss_pred HHHHHHHHHCCCCEEEEC--CCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCCCCCCEEEECCCCC Q ss_conf 999999985489199981--799856503202248970588971104763272799999998750464541999714888 Q gi|255764489|r 17 ITLENIARSISDDIWICA--PEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSDKKPDLILSGVNVG 94 (250) Q Consensus 17 ~~l~~~l~~~~~~v~vvA--P~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~~~pDlViSGiN~G 94 (250) ..|.+.|++.|.+|.+++ |-+-|--.-.++- +-+..+|+..|--.+--.--+...++.+||+++- .| T Consensus 59 A~L~~tL~~~GAeV~~~~~NplSTQDdvaAaL~--------~~Gi~VfA~~g~t~eey~~~~~~~L~~~P~iiiD---DG 127 (427) T PRK05476 59 AVLIETLKALGAEVRWASCNPFSTQDHAAAALA--------AAGIPVFAWKGETLEEYWEAIERALDWGPNMILD---DG 127 (427) T ss_pred HHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHH--------HCCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEE---CC T ss_conf 999999998499899954798764579999998--------6891699978989999999999975559987875---65 Q ss_pred CCCCHHH------HHHHHHHH------------HHHHH-HHCCCCCCCCCCC Q ss_conf 5310233------34577789------------98898-7189853222001 Q gi|255764489|r 95 TNTSNHV------AYSGTLAA------------AFEGS-LQGIRSFALSQAY 127 (250) Q Consensus 95 ~N~g~~v------~ySGTvgA------------A~ea~-~~GipsIAiS~~~ 127 (250) .-+-.-+ +.++-+|+ ||+.. -..+|.||+--.. T Consensus 128 ~Dl~~~lh~~~~~~~~~i~G~~EETTTGv~RL~am~~~g~L~~PviavNDa~ 179 (427) T PRK05476 128 GDLTLLVHTERPELLANIKGVTEETTTGVHRLYAMAKDGELKFPAINVNDSV 179 (427) T ss_pred CHHHHHHHHHHHHHHHHCEECCCCCCHHHHHHHHHHHCCCCCCCEEEECCHH T ss_conf 2289999986056675416303475263899999986697777779865534 No 56 >PRK10850 phosphohistidinoprotein-hexose phosphotransferase component of PTS system (Hpr); Provisional Probab=54.26 E-value=18 Score=17.37 Aligned_cols=77 Identities=16% Similarity=0.204 Sum_probs=51.3 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 2688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) .+-|+|..|+++-=-..|++.++.+..+|++..- ++..-+.||----.|.+++-..=...++|.=++-+.-+|-.++ T Consensus 5 ~vtV~n~~GLHARPAa~~v~~a~~f~s~I~l~~~--~~~anakSi~~im~Lg~~~G~~i~i~a~G~De~~Al~~l~~ll 81 (85) T PRK10850 5 EVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSN--GKSASAKSLFKLQTLGLTQGTVVTISAEGEDEQKAVEHLVKLM 81 (85) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC--CEEECHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHHH T ss_conf 9998289865352899999999628986999989--9998458499999749899899999997889999999999999 No 57 >smart00448 REC cheY-homologous receiver domain. CheY regulates the clockwise rotation of E. coli flagellar motors. This domain contains a phosphoacceptor site that is phosphorylated by histidine kinase homologues. Probab=54.12 E-value=18 Score=17.28 Aligned_cols=28 Identities=25% Similarity=0.278 Sum_probs=18.7 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEE Q ss_conf 9268826898688569999999985489199 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIW 31 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~ 31 (250) ||||+.-||-. -.+.+...++..|++|. T Consensus 1 ~kVLivdD~~~---~~~~l~~~l~~~g~~v~ 28 (55) T smart00448 1 MRILVVDDDPL---LRELLKALLEREGYEVD 28 (55) T ss_pred CEEEEEECCHH---HHHHHHHHHHHCCCEEE T ss_conf 98999937999---99999999997899999 No 58 >PRK10710 DNA-binding transcriptional regulator BaeR; Provisional Probab=53.46 E-value=19 Score=17.22 Aligned_cols=80 Identities=19% Similarity=0.196 Sum_probs=47.9 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) +||||.-||=--+. .|...|+..|++|..+. + ..-|+..+. T Consensus 11 ~kILlVEDd~~l~~---~l~~~L~~~g~~V~~a~------------------------------~------~~eal~~~~ 51 (240) T PRK10710 11 PRILIVEDEPKLGQ---LLIDYLRAASYAPTLIS------------------------------H------GDEVLPYVR 51 (240) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------C------HHHHHHHHH T ss_conf 87999929899999---99999998899999989------------------------------9------999999997 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 6454199971488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) ...|||||..++...--|.+++ ..+- ...++|-|.+|.. T Consensus 52 ~~~~DliilDi~lp~~~Gl~l~------~~lr-~~~~~piI~lta~ 90 (240) T PRK10710 52 QTPPDLILLDLMLPGTDGLTLC------REIR-RFSDIPIVMVTAK 90 (240) T ss_pred HCCCCEEEECCCCCCCCCCCCH------HHHH-CCCCCCEEEECCC T ss_conf 3799899987999888776321------1221-1576468998167 No 59 >cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II). Probab=53.14 E-value=18 Score=17.39 Aligned_cols=35 Identities=11% Similarity=0.085 Sum_probs=24.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 2688268986885699999999854891999817998 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) |||..+--+. --++-|+++|.+.||+|+|+.+... T Consensus 1 ~~~~~~~~~p--~q~r~LA~~La~rGHeV~Vit~~~~ 35 (396) T cd03818 1 RILFVHQNFP--GQFRHLAPALAAQGHEVVFLTEPNA 35 (396) T ss_pred CEEEECCCCC--HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9899889988--1699999999978998999968999 No 60 >PRK00654 glgA glycogen synthase; Provisional Probab=52.46 E-value=20 Score=17.12 Aligned_cols=38 Identities=21% Similarity=0.385 Sum_probs=24.8 Q ss_pred CEEEEECCC-CC--CCHH----HHHHHHHHHHCCCCEEEECCCCC Q ss_conf 926882689-86--8856----99999999854891999817998 Q gi|255764489|r 1 MRILLTNDD-GI--KSKG----LITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~ILitNDD-G~--~a~g----l~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||||.---. .. ..-| ...|.++|++.||+|.|+.|.-. T Consensus 1 MkIl~vs~E~~P~~KvGGLgdvv~~L~~aL~~~G~~V~Vi~P~Y~ 45 (476) T PRK00654 1 MKILFVASECAPLIKTGGLGDVVGALPKALAKLGHDVRVLLPGYP 45 (476) T ss_pred CEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 979999874387557574999999999999976995999947983 No 61 >COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism] Probab=52.29 E-value=15 Score=17.91 Aligned_cols=36 Identities=14% Similarity=0.168 Sum_probs=23.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCC--CEEEECCCC Q ss_conf 268826898688569999999985489--199981799 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISD--DIWICAPEM 37 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~--~v~vvAP~~ 37 (250) .|.||-|||+...+...+.+.|++.+- ..+|+.... T Consensus 66 ~v~lTFDDg~~~~~~~~il~iL~k~~i~ATfFv~g~~~ 103 (267) T COG0726 66 AVALTFDDGPLDGNTPRILPLLKKYGIKATFFVVGSWI 103 (267) T ss_pred EEEEEEECCCCCCCHHHHHHHHHHHCCCEEEEEECHHH T ss_conf 79999755766436599999999819976999733765 No 62 >cd03791 GT1_Glycogen_synthase_DULL1_like This family is most closely related to the GT1 family of glycosyltransferases. Glycogen synthase catalyzes the formation and elongation of the alpha-1,4-glucose backbone using ADP-glucose, the second and key step of glycogen biosynthesis. This family includes starch synthases of plants, such as DULL1 in Zea mays and glycogen synthases of various organisms. Probab=52.06 E-value=18 Score=17.29 Aligned_cols=24 Identities=13% Similarity=0.293 Sum_probs=20.1 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 999999998548919998179985 Q gi|255764489|r 16 LITLENIARSISDDIWICAPEMDQ 39 (250) Q Consensus 16 l~~l~~~l~~~~~~v~vvAP~~~~ 39 (250) ...|.++|.+.||+|.|+.|.-.+ T Consensus 22 v~~L~~aLa~~Gh~V~Vi~P~y~~ 45 (476) T cd03791 22 VGALPKALAKLGHDVRVIMPKYGR 45 (476) T ss_pred HHHHHHHHHHCCCEEEEEECCCCC T ss_conf 999999999779969999579866 No 63 >pfam03033 Glyco_transf_28 Glycosyltransferase family 28 N-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). This N-terminal domain contains the acceptor binding site and likely membrane association site. This family also contains a large number of proteins that probably have quite distinct activities. Probab=51.99 E-value=20 Score=17.07 Aligned_cols=96 Identities=16% Similarity=0.187 Sum_probs=50.2 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEE--CCCHHHH------HHHHHHHCCCCCCC Q ss_conf 569999999985489199981799856503202248970588971104763--2727999------99998750464541 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAV--HGTPVDC------VVIALQKMSDKKPD 85 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v--~GtPaDc------v~~~l~~l~~~~pD 85 (250) |++ +|++.|++.||+|.+..+..... . + ....+....+....+-- -..+... +.-++..+...+|| T Consensus 14 Pal-ala~~L~~~g~~v~igt~~~~e~---~-v-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kp~ 87 (136) T pfam03033 14 PAV-ALAWALRRRGHEVRLGTPPGLEE---F-V-EEAGLPFVPIGGDGLRRKSLKNLKEPLEGGRALRQAKEILKEFKPD 87 (136) T ss_pred HHH-HHHHHHHHCCCEEEECCCCCHHH---H-H-HHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999-99999998599771215802888---8-7-5359818996279854675999999999999999999999856998 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99971488853102333457778998898718985322200 Q gi|255764489|r 86 LILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 86 lViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) +|++ .|- |. ++-+.+.|.+.++|-+--.|+ T Consensus 88 ~vig-------~GG---y~-s~p~~~aa~~~~ip~~ihEqN 117 (136) T pfam03033 88 LVIG-------FGG---YV-AVPALIAAPLAGIPLIVHEQN 117 (136) T ss_pred EEEC-------CCC---CC-CHHHHHHHHHCCCCEEEECCC T ss_conf 8974-------388---54-228999999839988998798 No 64 >PRK10766 DNA-binding transcriptional regulator TorR; Provisional Probab=51.06 E-value=21 Score=16.98 Aligned_cols=79 Identities=14% Similarity=0.146 Sum_probs=50.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) +|||+.-||---+. .|...|+..|++|..+. + ..-|+..+. T Consensus 3 ~kILlVEDD~~l~~---~l~~~L~~~g~~V~~~~-~-----------------------------------~~~a~~~l~ 43 (224) T PRK10766 3 YHILVVEDEPVTRA---RLQGYFEQEGYRVSEAA-S-----------------------------------GAGMREIMQ 43 (224) T ss_pred CEEEEECCCHHHHH---HHHHHHHHCCCEEEEEC-C-----------------------------------HHHHHHHHH T ss_conf 71999919999999---99999998799999989-9-----------------------------------999999996 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 645419997148885310233345777899889871898532220 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ..+|||||..++...--|.++.- .+-. ...+|-|=+|- T Consensus 44 ~~~~DlvilDi~lp~~~G~el~~------~iR~-~~~~piI~lta 81 (224) T PRK10766 44 NQHVDLILLDINLPGEDGLMLTR------ELRS-RSTVGIILVTG 81 (224) T ss_pred HCCCCEEEECCCCCCCCCCCHHH------HHHH-CCCCCEEEECC T ss_conf 08999999889999887661376------7630-47855686335 No 65 >PRK06953 short chain dehydrogenase; Provisional Probab=50.04 E-value=21 Score=16.88 Aligned_cols=74 Identities=18% Similarity=0.230 Sum_probs=41.3 Q ss_pred CE-EEEECCCCCCCHHH-HHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 92-68826898688569-99999998548919998179985650320224897058897110476327279999999875 Q gi|255764489|r 1 MR-ILLTNDDGIKSKGL-ITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK 78 (250) Q Consensus 1 m~-ILitNDDG~~a~gl-~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~ 78 (250) || |||| |= +-|| +++++.+.+.|+.|+.++-+.+. ... ++..+.+.+.+|=+=.+.+.-.... T Consensus 1 MK~~LVT---Ga-s~GIG~a~a~~la~~G~~V~~~~R~~~~--l~~---------l~~~~~~~~~~Dv~d~~~v~~~~~~ 65 (222) T PRK06953 1 MKTVLIV---GA-SRGIGLEFVRQYRADGWRVIATARDAAG--LAA---------LRALGAEALALDVADPESIAGLGWK 65 (222) T ss_pred CCEEEEC---CC-CHHHHHHHHHHHHHCCCEEEEEECCHHH--HHH---------HHHCCCCEEEEECCCHHHHHHHHHH T ss_conf 9999994---75-7299999999999888999999688888--999---------8842151777405899999999986 Q ss_pred CCCCCCCEEEE Q ss_conf 04645419997 Q gi|255764489|r 79 MSDKKPDLILS 89 (250) Q Consensus 79 l~~~~pDlViS 89 (250) +...++|++|- T Consensus 66 ~~~~~ldili~ 76 (222) T PRK06953 66 LDGEALDAAVY 76 (222) T ss_pred HCCCCCCEEEE T ss_conf 23677678998 No 66 >cd03169 GATase1_PfpI_1 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=49.41 E-value=22 Score=16.82 Aligned_cols=37 Identities=14% Similarity=0.259 Sum_probs=30.6 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 2688268986885699999999854891999817998 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||||-=-||++---+..-++.|++.|++|.+++|... T Consensus 1 kvlil~adg~Ee~E~~~P~~~l~~aG~eV~vaSp~~~ 37 (180) T cd03169 1 KILILTGDFVEDYEVMVPFQALQEVGHEVDVVAPGKK 37 (180) T ss_pred CEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9899947883078999999999988998999956886 No 67 >PRK10336 DNA-binding transcriptional regulator QseB; Provisional Probab=49.38 E-value=15 Score=17.75 Aligned_cols=80 Identities=14% Similarity=0.140 Sum_probs=47.5 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) ||||+.-||=.-+. .|...|+..|++|..+ . +| .-|+..+. T Consensus 1 mkILlVEDd~~l~~---~l~~~L~~~g~~V~~a-~-----------------------------~g------~~a~~~~~ 41 (219) T PRK10336 1 MRILLIEDDMLIGD---GIKTGLSKMGFSVDWF-T-----------------------------QG------RQGKEALY 41 (219) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEE-C-----------------------------CH------HHHHHHHH T ss_conf 98999938999999---9999999879999998-9-----------------------------99------99999986 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 645419997148885310233345777899889871898532220 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ..+|||||--||...--|.++. -.+-.....+|-|-+|. T Consensus 42 ~~~~DlvilDi~lP~~dG~~l~------~~iR~~~~~~PII~lta 80 (219) T PRK10336 42 SAPYDAVILDLTLPGMDGRDIL------REWREKGQREPVLILTA 80 (219) T ss_pred CCCCCEEEEECCCCCCCCCCCC------CCHHHHCCCCCEEEEEC T ss_conf 2896999997999999856310------10465278887899806 No 68 >PRK10643 DNA-binding transcriptional regulator BasR; Provisional Probab=47.63 E-value=17 Score=17.50 Aligned_cols=80 Identities=16% Similarity=0.123 Sum_probs=49.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) |||||.-||=.-+. .|++.|+..|++|..+. + ..-|+..+. T Consensus 1 mkILlVEDd~~~~~---~l~~~L~~~g~~V~~a~------------------------------~------~~ea~~~~~ 41 (222) T PRK10643 1 MKILIVEDDTLLLQ---GLILAAQTEGYACDGVS------------------------------T------AREAEQSLE 41 (222) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------C------HHHHHHHHH T ss_conf 97999928899999---99999997899999989------------------------------9------999999997 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 645419997148885310233345777899889871898532220 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~ 125 (250) ..+|||||--++...--|.+++ ..+-.....+|-|.+|. T Consensus 42 ~~~~DlvilDi~lp~~~G~~l~------~~ir~~~~~~pII~lt~ 80 (222) T PRK10643 42 SGHYSLVVLDLGLPDEDGLHFL------ARIRQKKYTLPVLILTA 80 (222) T ss_pred HCCCCEEEEECCCCCCCCHHHH------HHHHHCCCCCCEEEEEC T ss_conf 4899899996888998622689------99983489988999821 No 69 >cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides. Probab=46.78 E-value=24 Score=16.57 Aligned_cols=95 Identities=15% Similarity=0.121 Sum_probs=46.4 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCCCCCCEEEECCCC Q ss_conf 56999999998548919998179985650320224897058897110476327279999999875046454199971488 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSDKKPDLILSGVNV 93 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~~~pDlViSGiN~ 93 (250) .-+..|++.|.+.||+|.|+.+..+..... ........ .+.....|..-..+ ...+...+||+|..--+. T Consensus 17 ~~~~~La~~L~~~Gh~V~v~~~~~~~~~~~----~~~~~~~~-----~~~~~~~~~~~~~~-~~~i~~~~~Dvvh~~~~~ 86 (366) T cd03822 17 TFTTDLVNALSARGPDVLVVSVAALYPSLL----YGGEQEVV-----RVIVLDNPLDYRRA-ARAIRLSGPDVVVIQHEY 86 (366) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCCCCC----CCCCCCEE-----EECCCCCCHHHHHH-HHHHHHCCCCEEEEECCC T ss_conf 999999999986799899995888887544----46776416-----76036673259999-999985399999993653 Q ss_pred CCCCCHHHHHHHHHH--HHHHHHHHCCCCCCCCC Q ss_conf 853102333457778--99889871898532220 Q gi|255764489|r 94 GTNTSNHVAYSGTLA--AAFEGSLQGIRSFALSQ 125 (250) Q Consensus 94 G~N~g~~v~ySGTvg--AA~ea~~~GipsIAiS~ 125 (250) | ++++..+ ++.-....|+|-++.-. T Consensus 87 ~-------~~~~~~~~~~~~~~~~~~~p~v~t~H 113 (366) T cd03822 87 G-------IFGGEAGLYLLLLLRGLGIPVVVTLH 113 (366) T ss_pred C-------CCHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 3-------21066899999999855998999977 No 70 >TIGR02177 PorB_KorB 2-oxoacid:acceptor oxidoreductase, beta subunit, pyruvate/2-ketoisovalerate family; InterPro: IPR011896 A number of anaerobic and microaerophilic species lack pyruvate dehydrogenase and have instead a four subunit, oxygen-sensitive pyruvate oxidoreductase, with either ferredoxins or flavodoxins used as the acceptor. Several related four-subunit enzymes may exist in the same species. This entry describes the beta subunits, representing mostly pyruvate and 2-ketoisovalerate specific enzymes.. Probab=46.71 E-value=19 Score=17.15 Aligned_cols=34 Identities=18% Similarity=0.173 Sum_probs=26.3 Q ss_pred CHHHHHCCC-EEEECCCCCCC--CHHHHHH-HHHHHCC Q ss_conf 078996898-89902153564--8899999-9987459 Q gi|255764489|r 217 DAFAIQHNM-ISVTPITTDLT--DYNSQQY-ISLSLET 250 (250) Q Consensus 217 D~~Al~~Gy-ISVTPL~~DlT--~~~~l~~-L~~~l~~ 250 (250) -...+..+| +++-|..-+.+ |-.-+++ |++.||. T Consensus 265 rL~~~~~~~s~~~pP~~~~~~~eDG~P~~~P~~~~l~~ 302 (302) T TIGR02177 265 RLEKILPRYSIEAPPAEQEIKSEDGKPIEKPLKELLEE 302 (302) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCC T ss_conf 99974688675467543367788768414568887339 No 71 >pfam02951 GSH-S_N Prokaryotic glutathione synthetase, N-terminal domain. Probab=46.69 E-value=23 Score=16.68 Aligned_cols=38 Identities=16% Similarity=0.211 Sum_probs=25.0 Q ss_pred CEEEEECC--CCCCCHHH--HHHHHHHHHCCCCEEEECCCCC Q ss_conf 92688268--98688569--9999999854891999817998 Q gi|255764489|r 1 MRILLTND--DGIKSKGL--ITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~ILitND--DG~~a~gl--~~l~~~l~~~~~~v~vvAP~~~ 38 (250) |||+.-=| +.++-..= -+|.+++++.||+||++-|..= T Consensus 1 Mki~fimDpie~l~~~kDTT~~lm~eaq~rg~~v~~~~~~dL 42 (119) T pfam02951 1 MKLAFVMDPIESINIKKDSTFAMMLEAQRRGHELFYYEPGDL 42 (119) T ss_pred CEEEEEECCHHHCCCCCCCHHHHHHHHHHCCCEEEEECCCCE T ss_conf 969999579898787887199999999987997999800328 No 72 >cd01714 ETF_beta The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases. ETF transfers electrons to the main respiratory chain via ETF-ubiquinone oxidoreductase. ETF is an heterodimer that consists of an alpha and a beta subunit which binds one molecule of FAD per dimer . A similar system also exists in some bacteria. The homologous pair of proteins (FixA/FixB) are essential for nitrogen fixation. The beta subunit protein is distantly related to and forms a heterodimer with the alpha subunit. Probab=46.51 E-value=24 Score=16.55 Aligned_cols=102 Identities=19% Similarity=0.279 Sum_probs=54.5 Q ss_pred HHHHHHHH---HHCCCCEEEE--CCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCCEEEE Q ss_conf 99999999---8548919998--179985650320224897058897110476327279999999875-04645419997 Q gi|255764489|r 16 LITLENIA---RSISDDIWIC--APEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPDLILS 89 (250) Q Consensus 16 l~~l~~~l---~~~~~~v~vv--AP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pDlViS 89 (250) ..+|..++ .+.|.+|+++ .|...+...-.++..+-. +...+.++.++ |+-+-.+--.|.. +....||||+. T Consensus 38 ~~AlE~Alrlke~~g~~VtvlsvGp~~a~~~Lr~alAmGaD-~ai~i~d~~~~--~~D~~~tA~~La~~i~~~~~DLIl~ 114 (202) T cd01714 38 EYAVEEALRLKEKYGGEVTVVSMGPPQAEEALREALAMGAD-RAILVSDRAFA--GADTLATAKALAAAIKKIGVDLILT 114 (202) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHCCC-EEEEECCCCCC--CCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 89999999986603988999993747889999999970898-35998065556--6698999999999998759988999 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 14888531023334577789988987189853222001 Q gi|255764489|r 90 GVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAY 127 (250) Q Consensus 90 GiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~ 127 (250) |--.-. .++|-||+.. |.++|+|.+..-... T Consensus 115 G~~s~D------~~~g~v~~~l-A~~Lg~P~vt~v~~i 145 (202) T cd01714 115 GKQSID------GDTGQVGPLL-AELLGWPQITYVSKI 145 (202) T ss_pred EEECCC------CCCCCHHHHH-HHHHCCCCCCEEEEE T ss_conf 545457------9977799999-999489722488899 No 73 >PRK10446 ribosomal protein S6 modification protein; Provisional Probab=44.41 E-value=26 Score=16.35 Aligned_cols=36 Identities=25% Similarity=0.200 Sum_probs=26.0 Q ss_pred CEE-EEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 926-88268986885699999999854891999817998 Q gi|255764489|r 1 MRI-LLTNDDGIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~I-LitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||| +|+.+-...|. +.|.+++++.||+|.++-|..- T Consensus 1 mki~iLs~~~~~yst--~RL~eaa~~rGh~v~vidp~~~ 37 (300) T PRK10446 1 MKIAILSRDGTLYSC--KRLREAAIQRGHLVEILDPLSC 37 (300) T ss_pred CEEEEEECCCCCCHH--HHHHHHHHHCCCEEEEEEHHHC T ss_conf 979999369976079--9999999987995999614891 No 74 >pfam07075 DUF1343 Protein of unknown function (DUF1343). This family consists of several hypothetical bacterial proteins of around 400 residues in length. The function of this family is unknown. Probab=44.04 E-value=27 Score=16.31 Aligned_cols=109 Identities=17% Similarity=0.180 Sum_probs=63.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHC-C-CCEEEECCCCCCCCEEEE-ECCCCCEEEEEEEEEEEEECC---CHHHHHHHHH Q ss_conf 688268986885699999999854-8-919998179985650320-224897058897110476327---2799999998 Q gi|255764489|r 3 ILLTNDDGIKSKGLITLENIARSI-S-DDIWICAPEMDQSCLANS-LTMSRNIACRTISKKRFAVHG---TPVDCVVIAL 76 (250) Q Consensus 3 ILitNDDG~~a~gl~~l~~~l~~~-~-~~v~vvAP~~~~S~~~~a-it~~~~~~~~~~~~~~~~v~G---tPaDcv~~~l 76 (250) -||||--|+.+.|.+. .+.|.+. + .=+.+.+|++.--|...+ ......+ -.+.+-..|++-| .|.+-..-+ T Consensus 2 gLitN~t~v~~~~~~~-~d~L~~~~~v~l~~lF~PEHG~~G~~~ag~~v~~~~-D~~tglpv~SLYg~~~~Pt~~~L~~- 78 (359) T pfam07075 2 GLITNQTGVDRDLEHL-VDLLLAAPGVNLKALFGPEHGFRGDAQAGEKVGDYV-DPKTGLPVYSLYGKTRKPTPEMLKG- 78 (359) T ss_pred EEEECHHHCCCCCCCH-HHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCCCCCHHHHHC- T ss_conf 7993602338998789-999966569868998179878740421498567876-8888990897989999999999836- Q ss_pred HHCCCCCCCEEEECCCCCCCCCH-HHHHHHHHHHHHHHHH-HCCCCCCCC Q ss_conf 75046454199971488853102-3334577789988987-189853222 Q gi|255764489|r 77 QKMSDKKPDLILSGVNVGTNTSN-HVAYSGTLAAAFEGSL-QGIRSFALS 124 (250) Q Consensus 77 ~~l~~~~pDlViSGiN~G~N~g~-~v~ySGTvgAA~ea~~-~GipsIAiS 124 (250) .|++|=-| .-+|. --+|=-|++-+|||+- .|+|=|-+= T Consensus 79 -------iDvlv~Di---QDvG~R~YTyi~Tm~~~meaaa~~~~~~iVLD 118 (359) T pfam07075 79 -------VDVLVFDI---QDVGVRFYTYISTLAYAMEAAAEAGKEFIVLD 118 (359) T ss_pred -------CCEEEEEE---ECCCEEEEEHHHHHHHHHHHHHHCCCCEEEEC T ss_conf -------99999974---22761565459999999999997499589947 No 75 >KOG3619 consensus Probab=43.71 E-value=6.3 Score=20.14 Aligned_cols=74 Identities=23% Similarity=0.381 Sum_probs=48.2 Q ss_pred CCEEEEEEEEEEEEECCCH-------HHHHHHHHHHCC---------CCCCCEEEECCCCCCCC----C-----HHHHH- Q ss_conf 9705889711047632727-------999999987504---------64541999714888531----0-----23334- Q gi|255764489|r 50 RNIACRTISKKRFAVHGTP-------VDCVVIALQKMS---------DKKPDLILSGVNVGTNT----S-----NHVAY- 103 (250) Q Consensus 50 ~~~~~~~~~~~~~~v~GtP-------aDcv~~~l~~l~---------~~~pDlViSGiN~G~N~----g-----~~v~y- 103 (250) .-+|++..++-.|.|+|-| .-||.|||+-+. +-..|+ =.||-.|.=+ | .||+. T Consensus 154 ~clRiKiLGdcyyCvsglp~~~~dHA~~~v~mgl~Mi~aI~~vr~at~~dvnm-rvGihsG~Vl~GvlG~~kwq~DVws~ 232 (867) T KOG3619 154 HCLRIKILGDCYYCVSGLPEARPDHAVCCVEMGLDMIKAIKQVREATGVDVNM-RVGIHSGSVLCGVLGLRKWQYDVWSN 232 (867) T ss_pred CEEEEEEECCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCE-EEEEECCCEEECCCCCCEEEEEECCC T ss_conf 60765451331588458877771589999998999999999999984788735-98884276534212133035663132 Q ss_pred HHHHHHHHHHHHHCCCC-CCCCCC Q ss_conf 57778998898718985-322200 Q gi|255764489|r 104 SGTLAAAFEGSLQGIRS-FALSQA 126 (250) Q Consensus 104 SGTvgAA~ea~~~Gips-IAiS~~ 126 (250) -.|++.-||+. |+|+ +.+|.. T Consensus 233 dv~lAn~mEs~--G~pgrVhis~~ 254 (867) T KOG3619 233 DVTLANHMEAG--GVPGRVHISKA 254 (867) T ss_pred HHHHHHHHHHC--CCCCEEEECHH T ss_conf 52565543425--87851684263 No 76 >COG2065 PyrR Pyrimidine operon attenuation protein/uracil phosphoribosyltransferase [Nucleotide transport and metabolism] Probab=42.94 E-value=12 Score=18.45 Aligned_cols=28 Identities=29% Similarity=0.470 Sum_probs=20.4 Q ss_pred CHHHHHHH-HHHHHHHHHHH-CCCCCCCCCC Q ss_conf 02333457-77899889871-8985322200 Q gi|255764489|r 98 SNHVAYSG-TLAAAFEGSLQ-GIRSFALSQA 126 (250) Q Consensus 98 g~~v~ySG-TvgAA~ea~~~-GipsIAiS~~ 126 (250) =+||+|+| ||-||+.|.+. |-| -++.++ T Consensus 102 VDDVLytGRTIRAAldal~d~GRP-a~I~La 131 (179) T COG2065 102 VDDVLYTGRTIRAALDALVDYGRP-AKIQLA 131 (179) T ss_pred EEEECCCCCCHHHHHHHHHHCCCC-CEEEEE T ss_conf 820111472499999999864985-358999 No 77 >PRK05884 short chain dehydrogenase; Provisional Probab=42.94 E-value=28 Score=16.21 Aligned_cols=74 Identities=15% Similarity=0.158 Sum_probs=40.1 Q ss_pred CEEEEECCCCCCCHHH-HHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHC Q ss_conf 9268826898688569-999999985489199981799856503202248970588971104763272799999998750 Q gi|255764489|r 1 MRILLTNDDGIKSKGL-ITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKM 79 (250) Q Consensus 1 m~ILitNDDG~~a~gl-~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l 79 (250) ||||||=- +.|| ++.++.+.+.|++|.++....+.=- ...+ +++.....+|=|=.+.+.-... . T Consensus 1 ~~VlVTGg----s~GIG~aiA~~la~~Ga~V~i~~r~~~~l~-----~~~~-----el~~~~~~~d~~d~~~~~~~~~-~ 65 (223) T PRK05884 1 MEVLVTGG----DTDLGRTIAEGFRNDGHKVTLVGARRDDLE-----VAAK-----ELDVDAIVCDNTDPASLEEARG-L 65 (223) T ss_pred CEEEEECC----CCHHHHHHHHHHHHCCCEEEEEECCHHHHH-----HHHH-----HCCCCEEEEECCCHHHHHHHHH-H T ss_conf 93999878----879999999999987999999959878999-----9985-----3487689985278899999999-9 Q ss_pred CCCCCCEEEE Q ss_conf 4645419997 Q gi|255764489|r 80 SDKKPDLILS 89 (250) Q Consensus 80 ~~~~pDlViS 89 (250) +.+..|.++. T Consensus 66 ~~~~~d~lvn 75 (223) T PRK05884 66 FPHHLDTIVN 75 (223) T ss_pred HHHHHCCEEE T ss_conf 9986194788 No 78 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=42.92 E-value=9.7 Score=19.00 Aligned_cols=24 Identities=29% Similarity=0.227 Sum_probs=19.8 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 3457778998898718985322200 Q gi|255764489|r 102 AYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 102 ~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) =.+.+++..||-.. |||-|=.|.. T Consensus 265 ks~~nlA~~m~~~Y-GIPyf~~SFy 288 (470) T TIGR01283 265 KSMINLARKMEEKY-GIPYFEVSFY 288 (470) T ss_pred HHHHHHHHHHHHHC-CCCEEEECCC T ss_conf 36788999999646-8975872141 No 79 >PRK13668 hypothetical protein; Provisional Probab=42.57 E-value=28 Score=16.17 Aligned_cols=12 Identities=8% Similarity=0.423 Sum_probs=6.2 Q ss_pred HCCCEEEECCCC Q ss_conf 689889902153 Q gi|255764489|r 222 QHNMISVTPITT 233 (250) Q Consensus 222 ~~GyISVTPL~~ 233 (250) .+|.|-||.|.+ T Consensus 234 a~G~VPITaLsF 245 (266) T PRK13668 234 AKGLVPITSLSF 245 (266) T ss_pred CCCCCCEEEEEE T ss_conf 489775055457 No 80 >PRK06769 hypothetical protein; Validated Probab=41.80 E-value=24 Score=16.53 Aligned_cols=31 Identities=10% Similarity=0.025 Sum_probs=21.0 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEC Q ss_conf 5699999999854891999817998565032022 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSCLANSLT 47 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait 47 (250) ||-....+.|++.|..+.|| +||||.|+.+- T Consensus 31 ~~~~~ai~~L~~~g~~i~vv---TNQsgI~rG~~ 61 (175) T PRK06769 31 PFTKASLQKLKAKNIKIFSF---TNQPGIADGIA 61 (175) T ss_pred CCHHHHHHHHHHCCCEEEEE---ECCCHHCCCCC T ss_conf 88899999999869959999---68821136877 No 81 >TIGR01862 N2-ase-Ialpha nitrogenase component I, alpha chain; InterPro: IPR010143 This entry represents all three varieties (Fe-Fe, Mo-Fe and V-Fe) of the component I alpha chain of nitrogenase. Nitrogenase (1.18.6.1 from EC) is the enzyme system responsible for biological nitrogen fixation. Nitrogenase is an oligomeric complex which consists of two components: component 2 is an homodimer of an iron-sulphur protein, while component 1 which contains the active site for the reduction of nitrogen to ammonia exists in three different forms: the molybdenum-iron containing protein (MoFe) is a hetero-tetramer consisting of two pairs of alpha (nifD) and beta (nifK) subunits; the vanadium-iron containing protein (VFe) is a hexamer of two pairs each of alpha (vnfD), beta (vnfK), and delta (vnfG) subunits; the third form seems to only contain iron and is a hexamer composed of alpha (anfD), beta (anfK), and delta (anfG) subunits. The alpha and beta chains of the three types of component 1 are evolutionary related and they are also related to proteins nifE and nifN, which are most probably involved in the iron-molybdenum cofactor biosynthesis .; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=41.52 E-value=13 Score=18.31 Aligned_cols=20 Identities=20% Similarity=0.341 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCC Q ss_conf 577789988987189853222 Q gi|255764489|r 104 SGTLAAAFEGSLQGIRSFALS 124 (250) Q Consensus 104 SGTvgAA~ea~~~GipsIAiS 124 (250) ++=++--||-.. |||=|-+- T Consensus 261 ~nYIa~~le~rY-GIP~~~~d 280 (510) T TIGR01862 261 ANYIANELEERY-GIPWLKID 280 (510) T ss_pred HHHHHHHHHHCC-CCCEEEEC T ss_conf 579999855307-87537754 No 82 >cd00367 PTS-HPr_like Histidine-containing phosphocarrier protein (HPr)-like proteins. HPr is a central component of the bacterial phosphoenolpyruvate sugar phosphotransferase system (PTS). The PTS catalyses the phosphorylation of sugar substrates during their translocation across the cell membrane. The phosphoryl group from phosphoenolpyruvate is transferred to HPr by enzyme I (EI). Phospho-HPr then transfers the phosphoryl group to one of several sugar-specific phosphoprotein intermediates. The conserved histidine in the N-terminus of HPr serves as an acceptor for the phosphoryl group of EI. In addition to the phosphotransferase proteins HPr and E1, this family also includes the closely related Carbon Catabolite Repressor (CCR) proteins which use the same phosphorylation mechanism and interact with transcriptional regulators to control expression of genes coding for utilization of less favored carbon sources. Probab=40.17 E-value=31 Score=15.95 Aligned_cols=75 Identities=16% Similarity=0.216 Sum_probs=47.8 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHC Q ss_conf 68826898688569999999985489199981799856503202248970588971104763272799999998750 Q gi|255764489|r 3 ILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKM 79 (250) Q Consensus 3 ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l 79 (250) +-|.|.-|+++---..|++.++.+..+|++..- +++.-+.|+-----|.+++-+.=...++|.=++-+.-+|-.+ T Consensus 2 v~I~n~~GLHARpAa~lv~~a~~f~s~i~i~~~--~~~~~akSil~ll~L~~~~G~~i~i~~~G~De~~Al~~l~~l 76 (77) T cd00367 2 VTITNPLGLHARPAALLVQLASKFKSDITLRKG--GRKANAKSILGLMSLGAKQGDEITLSAEGEDAEEALEALAEL 76 (77) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCEEEEEEC--CEEEECCCHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHH T ss_conf 899179985574999999999867985999989--999873158999840779999999999786899999999975 No 83 >COG1925 FruB Phosphotransferase system, HPr-related proteins [Carbohydrate transport and metabolism] Probab=40.02 E-value=31 Score=15.93 Aligned_cols=78 Identities=13% Similarity=0.236 Sum_probs=51.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 26882689868856999999998548919998179985650320224897058897110476327279999999875046 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSD 81 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~ 81 (250) .+.|.|..|+++-=-..|++.++.+..++++.-. .+++-+.||-----|.+.+-..=...++|.=+.-..-+|..++. T Consensus 5 ~~~i~n~~GLHARPAa~lv~~a~~f~s~i~l~~~--g~~v~akSim~lm~Lg~~~G~~i~i~a~G~de~~Al~aL~~li~ 82 (88) T COG1925 5 TVTIKNKNGLHARPAAKLVKLASKFDSEITLTNN--GKEANAKSIMGLMALGAKKGDEIELSAEGEDEEEALEALSELIE 82 (88) T ss_pred EEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEEC--CEEECHHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 9999789865343799999998407754899869--98705385999998287789989999867069999999999998 No 84 >PRK05205 bifunctional pyrimidine regulatory protein PyrR uracil phosphoribosyltransferase; Provisional Probab=39.73 E-value=17 Score=17.58 Aligned_cols=22 Identities=32% Similarity=0.584 Sum_probs=15.7 Q ss_pred CHHHHHHH-HHHHHHHHHHH-CCC Q ss_conf 02333457-77899889871-898 Q gi|255764489|r 98 SNHVAYSG-TLAAAFEGSLQ-GIR 119 (250) Q Consensus 98 g~~v~ySG-TvgAA~ea~~~-Gip 119 (250) =+||+||| |+-||+++.+. |-| T Consensus 101 VDDVL~TGrTlraal~~l~~~grP 124 (176) T PRK05205 101 VDDVLYTGRTIRAALDALFDYGRP 124 (176) T ss_pred EECCCCCCHHHHHHHHHHHHCCCC T ss_conf 831022368999999988636988 No 85 >pfam02057 Glyco_hydro_59 Glycosyl hydrolase family 59. Probab=37.01 E-value=30 Score=16.02 Aligned_cols=48 Identities=10% Similarity=0.120 Sum_probs=38.0 Q ss_pred EECCCCCCCHHHHHHHHHHHHCCC-CEEEECCCCCCCCEEEEECCCCCE Q ss_conf 826898688569999999985489-199981799856503202248970 Q gi|255764489|r 5 LTNDDGIKSKGLITLENIARSISD-DIWICAPEMDQSCLANSLTMSRNI 52 (250) Q Consensus 5 itNDDG~~a~gl~~l~~~l~~~~~-~v~vvAP~~~~S~~~~ait~~~~~ 52 (250) +-|..++++.-++.|+++|...|. +|-|||-+.....++.++-....| T Consensus 179 ~wNEr~~~~~~ik~lR~~ld~~g~~~vKIVa~D~~w~~i~~~~~~D~~L 227 (669) T pfam02057 179 IWNERSFDANYIKELRKMLNYQGLQRVKIIASDNLWEPISASMLLDQEL 227 (669) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCHHH T ss_conf 1004688757999999998664436448999578876653445439988 No 86 >pfam02684 LpxB Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases, EC:2.4.2.128. These enzymes catalyse the reaction: UDP-2,3-bis(3-hydroxytetradecanoyl) glucosamine + 2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate <= UDP + 2,3-bis(3-hydroxytetradecanoyl)-D-glucosaminyl-1,6 -beta-D-2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate. These enzymes catalyse the fist disaccharide step in the synthesis of lipid-A-disaccharide. Probab=37.01 E-value=16 Score=17.74 Aligned_cols=35 Identities=14% Similarity=0.034 Sum_probs=22.0 Q ss_pred CCCCHHHHH-HHHHHHHCCC---CEEEECCCCCCCCEEE Q ss_conf 868856999-9999985489---1999817998565032 Q gi|255764489|r 10 GIKSKGLIT-LENIARSISD---DIWICAPEMDQSCLAN 44 (250) Q Consensus 10 G~~a~gl~~-l~~~l~~~~~---~v~vvAP~~~~S~~~~ 44 (250) +|++||+.- |++.+++.|. -|+.|+|+-=--..++ T Consensus 88 lID~pgFNlrlak~lkk~g~~ipvi~yV~PqvWAWr~~R 126 (373) T pfam02684 88 LIDAPDFNLRLAKKLRKLGPKLKIIHYVSPSVWAWRPKR 126 (373) T ss_pred EECCCCHHHHHHHHHHHCCCCCCEEEEECCCEEEECHHH T ss_conf 971761559999999971999878999688422127158 No 87 >COG2110 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 [General function prediction only] Probab=36.78 E-value=35 Score=15.62 Aligned_cols=69 Identities=19% Similarity=0.186 Sum_probs=47.6 Q ss_pred CCEEEECCCC----CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 4199971488----85310233345777899889871898532220013567661478888849999999867 Q gi|255764489|r 84 PDLILSGVNV----GTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKT 152 (250) Q Consensus 84 pDlViSGiN~----G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~ 152 (250) .-+||-.+-. |.+...+.+++.+.++...+.-.|+.||||=--...--..+|+.+++...+-++.++.. T Consensus 77 a~~ViH~vgp~~~~g~~~~~e~l~~a~~~~l~~a~~~g~~SiAfPaistGv~G~p~~~aa~i~~~~v~~~~~~ 149 (179) T COG2110 77 AKYVIHTVGPSWRGGSKDEAELLAAAYRAALRLAKEAGVRSVAFPAISTGVYGFPLEEAARIAVEAVKDFLPE 149 (179) T ss_pred CCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 7879945877645888048999999999999999985981660165338626789899999999999874323 No 88 >TIGR01317 GOGAT_sm_gam glutamate synthases, NADH/NADPH, small subunit; InterPro: IPR006005 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This describes a family in several archaeal and deeply branched bacterial lineages of a homotetrameric form of the NADPH-dependent or NADH-dependent glutamate synthase (1.4.1.13 from EC and 1.4.1.14 from EC respectively) small subunit. There is no corresponding large subunit. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=36.35 E-value=30 Score=15.99 Aligned_cols=18 Identities=11% Similarity=0.215 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHCCCCEEEE Q ss_conf 6999999998548919998 Q gi|255764489|r 15 GLITLENIARSISDDIWIC 33 (250) Q Consensus 15 gl~~l~~~l~~~~~~v~vv 33 (250) ||.+ ++.|..+||.|+|. T Consensus 163 GLAa-A~qLnrAGH~VTVf 180 (517) T TIGR01317 163 GLAA-ADQLNRAGHTVTVF 180 (517) T ss_pred HHHH-HHHHHHCCCCEEEE T ss_conf 9999-99985358838997 No 89 >TIGR02962 hdxy_isourate hydroxyisourate hydrolase; InterPro: IPR014306 Members of this entry, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes.; GO: 0006810 transport. Probab=36.29 E-value=12 Score=18.39 Aligned_cols=11 Identities=9% Similarity=0.093 Sum_probs=5.2 Q ss_pred HCCCCCCCCCC Q ss_conf 18985322200 Q gi|255764489|r 116 QGIRSFALSQA 126 (250) Q Consensus 116 ~GipsIAiS~~ 126 (250) +|.||-++-.. T Consensus 12 ~G~PA~~~ki~ 22 (117) T TIGR02962 12 HGRPAANVKIE 22 (117) T ss_pred CCCCCCCCEEE T ss_conf 68887775047 No 90 >COG1259 Uncharacterized conserved protein [Function unknown] Probab=36.07 E-value=35 Score=15.55 Aligned_cols=31 Identities=16% Similarity=0.304 Sum_probs=15.3 Q ss_pred CCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 24897058897110476327279999999875 Q gi|255764489|r 47 TMSRNIACRTISKKRFAVHGTPVDCVVIALQK 78 (250) Q Consensus 47 t~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~ 78 (250) ||.-.|.+++- ...-.++-.|.||+.+|+.. T Consensus 87 tyyA~L~~~~~-~~~~~iDaRPSDaI~LAlr~ 117 (151) T COG1259 87 TYYATLILEQD-DGKIQIDARPSDAIALALRV 117 (151) T ss_pred EEEEEEEEECC-CCEEEEECCCCHHHHHHHHH T ss_conf 08999999728-92588722650478999986 No 91 >pfam04432 FrhB_FdhB_C Coenzyme F420 hydrogenase/dehydrogenase, beta subunit C terminus. Coenzyme F420 hydrogenase (EC:1.12.99.1) reduces the low-potential two-electron acceptor coenzyme F420. This family contains the C termini of F420 hydrogenase and dehydrogenase beta subunits. The N terminus of Methanobacterium formicicum formate dehydrogenase beta chain (EC:1.2.1.2) is also a member of this family. This region is often found in association with the 4Fe-4S binding domain, fer4 (pfam00037). Probab=35.98 E-value=36 Score=15.54 Aligned_cols=36 Identities=42% Similarity=0.750 Sum_probs=24.4 Q ss_pred EEEECCCHHHHHHHHHHHCCCCCCC-----EEEECCCCCCCCC Q ss_conf 4763272799999998750464541-----9997148885310 Q gi|255764489|r 61 RFAVHGTPVDCVVIALQKMSDKKPD-----LILSGVNVGTNTS 98 (250) Q Consensus 61 ~~~v~GtPaDcv~~~l~~l~~~~pD-----lViSGiN~G~N~g 98 (250) .+++-||| |-..||..+..+.++ +++-|+.-+.|.. T Consensus 5 rvavvG~P--Cqi~alr~l~~~~~~~~~~i~~~Igl~C~~~~s 45 (162) T pfam04432 5 KVAVVGTP--CQIQALRKLQKRNPDLLDNIVLTIGLFCGGNFS 45 (162) T ss_pred EEEEEECC--HHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCC T ss_conf 99999048--789999999861745452527999725899989 No 92 >pfam00201 UDPGT UDP-glucoronosyl and UDP-glucosyl transferase. Probab=33.78 E-value=39 Score=15.32 Aligned_cols=38 Identities=11% Similarity=0.075 Sum_probs=26.7 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 26882689868856999999998548919998179985 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQ 39 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~ 39 (250) ||||-==.|=+--.+..|.++|.+.||+|+++-|+... T Consensus 2 kVLVfP~~gSH~~~~~~l~~~La~RGH~VTvl~p~~~~ 39 (501) T pfam00201 2 KVLVWPMDGSHWMNMKGILLELVQRGHEVTVLRPSASI 39 (501) T ss_pred CEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEEEECCC T ss_conf 58998499984999999999999889929999713512 No 93 >PRK10125 predicted glycosyl transferase; Provisional Probab=33.70 E-value=39 Score=15.31 Aligned_cols=34 Identities=18% Similarity=-0.039 Sum_probs=28.0 Q ss_pred CEEEEECC-CCCCCHHHHH--HHHHHHHCCCCEEEEC Q ss_conf 92688268-9868856999--9999985489199981 Q gi|255764489|r 1 MRILLTND-DGIKSKGLIT--LENIARSISDDIWICA 34 (250) Q Consensus 1 m~ILitND-DG~~a~gl~~--l~~~l~~~~~~v~vvA 34 (250) ||||.-|= |+.-+.|..+ |.++|++.|+++.++. T Consensus 1 MrVL~INts~~~GGAaraA~rLh~~L~~~G~~s~mlv 37 (405) T PRK10125 1 MNILQFNVRLAEGGAAGVALDLHQRALQQGLASHFVY 37 (405) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 9599997366988299999999999985799669999 No 94 >pfam06406 StbA StbA protein. This family consists of several bacterial StbA plasmid stability proteins. Probab=33.14 E-value=40 Score=15.26 Aligned_cols=60 Identities=17% Similarity=0.130 Sum_probs=34.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCC Q ss_conf 9268826898688569999999985489199981799856503202248970588971104763272 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGT 67 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~Gt 67 (250) |||-+ |||...-=|.. .+-|.-.+..-|.+=+++-+.++.-.+..-=.+++...|+++-+ T Consensus 1 m~i~i--DdGstniKlaW-----~e~g~i~t~~~pnsf~~~~~~~~~~~~~~~nYeidG~kyt~d~~ 60 (318) T pfam06406 1 MKIFI--DDGSTNIKLAW-----LEDGEVKTLISPNSFKPEWSVSLLGDLQPFNYEIDGEKYSFDPV 60 (318) T ss_pred CEEEE--ECCCCCEEEEE-----ECCCEEEEEECCCCCCCCCCCCCCCCCCEEEEEECCEEEEECCC T ss_conf 95999--35988536898-----51988877544200356655341278530357977789997788 No 95 >PRK08340 glucose-1-dehydrogenase; Provisional Probab=33.14 E-value=40 Score=15.26 Aligned_cols=79 Identities=10% Similarity=0.077 Sum_probs=38.7 Q ss_pred CEEEEECCCCCCCHHH-HHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHC Q ss_conf 9268826898688569-999999985489199981799856503202248970588971104763272799999998750 Q gi|255764489|r 1 MRILLTNDDGIKSKGL-ITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKM 79 (250) Q Consensus 1 m~ILitNDDG~~a~gl-~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l 79 (250) ||||||=- +-|| ++.++.|.+.|++|.++.-..++ ....+-.+.+ . -....+.+|=|-.+-+.-.+... T Consensus 1 mnVlITGa----s~GIG~aiA~~la~~Ga~V~i~~r~~~~-l~~~~~~l~~-~----g~~~~~~~Dv~~~~~v~~~v~~~ 70 (259) T PRK08340 1 MNVLVTAS----SRGIGFNVARELLKKGARVVISSRNEEN-LEKALKELKE-L----GEVYAIKADLSDKDDLKNLVKEA 70 (259) T ss_pred CEEEEECC----CCHHHHHHHHHHHHCCCEEEEEECCHHH-HHHHHHHHHH-C----CCEEEEEEECCCHHHHHHHHHHH T ss_conf 98999758----7789999999999879999999799899-9999999874-1----88799996369989999999999 Q ss_pred CC--CCCCEEEE Q ss_conf 46--45419997 Q gi|255764489|r 80 SD--KKPDLILS 89 (250) Q Consensus 80 ~~--~~pDlViS 89 (250) .+ -+.|.+|- T Consensus 71 ~~~~G~iD~LVn 82 (259) T PRK08340 71 WELLGGIDALVW 82 (259) T ss_pred HHHHCCCCEEEE T ss_conf 998599888998 No 96 >pfam00381 PTS-HPr PTS HPr component phosphorylation site. Probab=33.09 E-value=40 Score=15.25 Aligned_cols=77 Identities=14% Similarity=0.185 Sum_probs=45.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 2688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) .+-|.|..|+++-=-..|++.++++..+|++.- ..+..-+.|+.----|.+++-+.=...++|.=+.-+.-.|..++ T Consensus 4 ~~~v~n~~GLHARPAa~lv~~a~~~~~~v~i~~--~~~~v~akSi~~ll~L~~~~G~~v~v~~~G~de~~Al~~l~~~~ 80 (83) T pfam00381 4 EVTIKNPLGLHARPAALLVKAASKFDSDITLVK--GGKTANAKSIMGLMSLGAKQGDEITISAEGEDAEEALEALAELL 80 (83) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE--CCEEECCHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHHH T ss_conf 999838997646299999999985897499998--99996225599998408789999999997888999999999999 No 97 >TIGR01279 DPOR_bchN light-independent protochlorophyllide reductase, N subunit; InterPro: IPR005970 Synonym: dark protochlorophyllide reductase Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite. This enzyme describes the N subunit of the dark form protochlorophyllide reductase, a nitrogenase-like enzyme . ; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction. Probab=32.95 E-value=11 Score=18.64 Aligned_cols=17 Identities=12% Similarity=0.325 Sum_probs=6.9 Q ss_pred EEECCCHHHHHHHHHHH Q ss_conf 76327279999999875 Q gi|255764489|r 62 FAVHGTPVDCVVIALQK 78 (250) Q Consensus 62 ~~v~GtPaDcv~~~l~~ 78 (250) |-++.=|.+-.+|-|.+ T Consensus 91 fllssCt~ev~K~Dl~~ 107 (458) T TIGR01279 91 FLLSSCTPEVIKMDLEG 107 (458) T ss_pred EEECCCCCCEEECCHHH T ss_conf 99616886225402677 No 98 >cd00948 FBP_aldolase_I_a Fructose-1,6-bisphosphate aldolase. The enzyme catalyzes the cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate (DHAP). This family includes proteins found in vertebrates, plants, and bacterial plant pathogens. Mutations in the aldolase genes in humans cause hemolytic anemia and hereditary fructose intolerance. The enzyme is a member of the class I aldolase family, which utilizes covalent catalysis through a Schiff base formed between a lysine residue of the enzyme and ketose substrates. Probab=31.27 E-value=34 Score=15.63 Aligned_cols=131 Identities=15% Similarity=0.178 Sum_probs=59.9 Q ss_pred CEEEECCCCCCCCEEEEECCCCCEE--------------------EEEEEEEEEEECCCHHHHHHHHHHHCCCC------ Q ss_conf 1999817998565032022489705--------------------88971104763272799999998750464------ Q gi|255764489|r 29 DIWICAPEMDQSCLANSLTMSRNIA--------------------CRTISKKRFAVHGTPVDCVVIALQKMSDK------ 82 (250) Q Consensus 29 ~v~vvAP~~~~S~~~~ait~~~~~~--------------------~~~~~~~~~~v~GtPaDcv~~~l~~l~~~------ 82 (250) +...-+|.-++ -.|..|-+.+.++ -.|++.......|+|-.-+.-||++|..+ T Consensus 49 elLftt~~~~~-yIsGvILfdETl~Q~t~dG~~~~~~L~~~givpGIKVD~G~~~l~g~~~E~~T~GLDgL~eR~~~Y~~ 127 (330) T cd00948 49 ELLFTTPGLGQ-YISGVILFEETLYQKTDDGKPFVDILKEKGIVPGIKVDKGLVPLAGTDGETTTQGLDGLAERCAKYYK 127 (330) T ss_pred HHHEECCCHHH-CEEEEECCHHHHCCCCCCCCCHHHHHHHCCCEEEEECCCCCCCCCCCCCCEECCCCCCHHHHHHHHHH T ss_conf 88510887343-52378776777434565688147999867973459737986168999834104674539999999998 Q ss_pred ------CCCEEEECCCCCCCCC-HHHHHHHHHHHHHHHHHHC---CCCC-C--CCCCCCCCCC--CCHHHHHHHHHHHHH Q ss_conf ------5419997148885310-2333457778998898718---9853-2--2200135676--614788888499999 Q gi|255764489|r 83 ------KPDLILSGVNVGTNTS-NHVAYSGTLAAAFEGSLQG---IRSF-A--LSQAYTYENM--IPWEVSETHAPRVLR 147 (250) Q Consensus 83 ------~pDlViSGiN~G~N~g-~~v~ySGTvgAA~ea~~~G---ipsI-A--iS~~~~~~~~--~~~~~a~~~~~~ii~ 147 (250) |=--|++ .|.++= ...+-.-.-+=|+-|+++. +=.| - +-+ .++.. ..++..+++...+.+ T Consensus 128 ~GArFAKWRaVi~---I~~~~PS~~~I~~Na~~LArYA~icQ~~glVPIVEPEVLm--dG~H~ie~c~~vt~~vL~~vf~ 202 (330) T cd00948 128 QGARFAKWRAVLK---IGNGTPSELAIKENAHGLARYAAICQENGLVPIVEPEVLM--DGDHDIERCQEVTEKVLAAVYK 202 (330) T ss_pred CCCCEEEEEEEEE---CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCHHHHHHHHHHHHHHHHH T ss_conf 6998234026787---6999986999999899999999999985981321501001--6763479999999999999999 Q ss_pred HHHHCCC-CCCCEEECCCC Q ss_conf 9986778-66735431456 Q gi|255764489|r 148 QLLKTQI-PNTTLCNINFP 165 (250) Q Consensus 148 ~l~~~~~-p~~~vlNINiP 165 (250) .|.+.+. -++.+|--|+- T Consensus 203 ~L~~~~V~leG~lLKPnMV 221 (330) T cd00948 203 ALNDHHVLLEGTLLKPNMV 221 (330) T ss_pred HHHHCCCCCCCCEECCCCC T ss_conf 9998297657744026501 No 99 >PRK08250 glutamine amidotransferase; Provisional Probab=31.06 E-value=43 Score=15.05 Aligned_cols=36 Identities=17% Similarity=0.090 Sum_probs=30.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 92688268986885699999999854891999817998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||||+----=++.||. +.+.|++.|+++.++-|... T Consensus 1 MrilviqH~~~E~pG~--i~~~l~~~G~~~~~~~~~~g 36 (235) T PRK08250 1 MRVHFIIHESFESPGA--YLKWAEDRGYTISWSRVYAG 36 (235) T ss_pred CEEEEEECCCCCCCCH--HHHHHHHCCCEEEEEECCCC T ss_conf 9799998189989707--89999978996999966899 No 100 >PRK11249 katE hydroperoxidase II; Provisional Probab=30.36 E-value=44 Score=14.97 Aligned_cols=27 Identities=15% Similarity=0.329 Sum_probs=9.3 Q ss_pred CCCCCHHHHHHHHHHHHCCCCEEEECC Q ss_conf 986885699999999854891999817 Q gi|255764489|r 9 DGIKSKGLITLENIARSISDDIWICAP 35 (250) Q Consensus 9 DG~~a~gl~~l~~~l~~~~~~v~vvAP 35 (250) ||++...+..|++++.+.|..+.|+|| T Consensus 579 dg~d~~~~~~l~~~l~~~g~~~~iv~p 605 (725) T PRK11249 579 DGVDAADLLAILKALKAKGAHAKLLYP 605 (725) T ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 898989999999999968998999936 No 101 >cd01831 Endoglucanase_E_like Endoglucanase E-like members of the SGNH hydrolase family; Endoglucanase E catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans. Probab=29.77 E-value=45 Score=14.91 Aligned_cols=12 Identities=58% Similarity=1.187 Sum_probs=9.1 Q ss_pred CCEEEECCCCCCCC Q ss_conf 41999714888531 Q gi|255764489|r 84 PDLILSGVNVGTNT 97 (250) Q Consensus 84 pDlViSGiN~G~N~ 97 (250) ||+|| ||.|.|= T Consensus 56 Pd~vv--i~lGtND 67 (169) T cd01831 56 PDLVV--INLGTND 67 (169) T ss_pred CCEEE--EECCCCC T ss_conf 98899--9777667 No 102 >cd01833 XynB_like SGNH_hydrolase subfamily, similar to Ruminococcus flavefaciens XynB. Most likely a secreted hydrolase with xylanase activity. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Probab=29.67 E-value=42 Score=15.11 Aligned_cols=44 Identities=27% Similarity=0.379 Sum_probs=28.0 Q ss_pred EEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCCEEE--ECCCC Q ss_conf 0320224897058897110476327279999999875-0464541999--71488 Q gi|255764489|r 42 LANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPDLIL--SGVNV 93 (250) Q Consensus 42 ~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pDlVi--SGiN~ 93 (250) .|-|||++.+ .-..++|.-+.=+.--+.. +...+||+|+ -|+|. T Consensus 6 ~gdsit~gd~--------~n~G~~G~t~~~~~~r~~~~v~~~~PdiV~i~~GtND 52 (157) T cd01833 6 LGDSITWGDK--------DHEGHSGYLIDQIAAAAADWVLAAKPDVVLLHLGTND 52 (157) T ss_pred ECCCCCCCCC--------CCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 2275545789--------9999784449999998898898679989999736675 No 103 >cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light. Probab=29.50 E-value=46 Score=14.88 Aligned_cols=93 Identities=15% Similarity=0.194 Sum_probs=43.8 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHH------HHH--HHHHHC--CCCCCC Q ss_conf 9999999985489199981799856503202248970588971104763272799------999--998750--464541 Q gi|255764489|r 16 LITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVD------CVV--IALQKM--SDKKPD 85 (250) Q Consensus 16 l~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaD------cv~--~~l~~l--~~~~pD 85 (250) +..|+++|.+.||+|+|+++...... .........+++.++... ...+.+.+ +-+ -.+..+ ...+|| T Consensus 27 v~~La~~L~~~GH~V~V~t~~~~~~~-~~~~~~~~gv~v~rl~~~--~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~pD 103 (398) T cd03800 27 VLELARALARLGHEVDIFTRRIDDAL-PPIVELAPGVRVVRVPAG--PAEYLPKEELWPYLDEFADDLLRFLRREGGRPD 103 (398) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCC-CCEEEECCCEEEEEECCC--CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99999999986996999972477788-880682498699995578--854332777787899999999999998389988 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9997148885310233345777899889871898532 Q gi|255764489|r 86 LILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 86 lViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) +|.+ +. ..+|.+ +..-+..+|+|-+. T Consensus 104 vIH~--h~--------~~~~~~-~~~~~~~~~ip~V~ 129 (398) T cd03800 104 LIHA--HY--------WDSGLV-ALLLARRLGIPLVH 129 (398) T ss_pred EEEE--CC--------CCHHHH-HHHHHHHHCCCEEE T ss_conf 8998--88--------407899-99999971999999 No 104 >PRK10897 phosphohistidinoprotein-hexose phosphotransferase component of N-regulated PTS system (Npr); Provisional Probab=29.26 E-value=46 Score=14.86 Aligned_cols=82 Identities=12% Similarity=0.147 Sum_probs=51.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 26882689868856999999998548919998179985650320224897058897110476327279999999875046 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSD 81 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~ 81 (250) .+-|+|--|.++-==..|++.++.+..+|++.--+ .+..-+.||-----|.+.+-..-...++|.=+.-+.-+|-.|+. T Consensus 6 ~vtI~N~~GLHARPAa~lv~~a~~f~s~I~~~~~~-g~~~nakSi~~lm~L~a~~G~~i~v~a~G~De~eAl~al~~Li~ 84 (90) T PRK10897 6 TVEITNKLGMHARPAMKLFELMQGFDAEVLLRNDE-GTEAEANSVIALLMLDSAKGRQIEVEATGPQEEEALAAVIALFN 84 (90) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECC-CCEECCCCHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 99996898756708999999997499689999299-96885746999996067999999999988379999999999998 Q ss_pred CCC Q ss_conf 454 Q gi|255764489|r 82 KKP 84 (250) Q Consensus 82 ~~p 84 (250) ..+ T Consensus 85 ~~F 87 (90) T PRK10897 85 SGF 87 (90) T ss_pred HCC T ss_conf 279 No 105 >TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species. Probab=29.01 E-value=46 Score=14.83 Aligned_cols=105 Identities=13% Similarity=0.108 Sum_probs=51.7 Q ss_pred ECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEE-EC--CCHHHHHHHH--H-HHC Q ss_conf 2689868856999999998548919998179985650320224897058897110476-32--7279999999--8-750 Q gi|255764489|r 6 TNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFA-VH--GTPVDCVVIA--L-QKM 79 (250) Q Consensus 6 tNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~-v~--GtPaDcv~~~--l-~~l 79 (250) ++|=|=-.--+..|++.|.+.||+|.|+++...-+. ........-+++..+....|. ++ --|.....+. + ..+ T Consensus 16 ~gd~GG~e~~v~~La~~La~rGheV~V~t~~~~~~~-~~~~~~~~gv~v~~~~~~p~~~~~~~~l~~~l~~~~~~~l~~~ 94 (405) T TIGR03449 16 TGDAGGMNVYILETATELARRGIEVDIFTRATRPSQ-PPVVEVAPGVRVRNVVAGPYEGLDKEDLPTQLCAFTGGVLRAE 94 (405) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCC-CCEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 999588699999999999978996999935888778-8846704984999825786232456676999999999999999 Q ss_pred C---CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 4---645419997148885310233345777899889871898532 Q gi|255764489|r 80 S---DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 80 ~---~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) . +..+|+|-+- ...||.++. .-+..+|+|-|. T Consensus 95 ~~~~~~~~DvIH~h----------~~~~~~~~~-~~~~~~~iP~V~ 129 (405) T TIGR03449 95 ARHEPGYYDLIHSH----------YWLSGQVGW-LLRDRWGVPLVH 129 (405) T ss_pred HHHCCCCCCEEEEC----------CCCHHHHHH-HHHHHCCCCEEE T ss_conf 98568997689988----------710789999-999864999899 No 106 >cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases. wlbH in Bordetella parapertussis has been shown to be required for the biosynthesis of a trisaccharide that, when attached to the B. pertussis lipopolysaccharide (LPS) core (band B), generates band A LPS. Probab=28.60 E-value=47 Score=14.79 Aligned_cols=27 Identities=11% Similarity=-0.070 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 569999999985489199981799856 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQS 40 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S 40 (250) --+..|++.|.+.||+|+|++|..... T Consensus 18 ~~~~~la~~L~~~G~~V~Vit~~~~~~ 44 (377) T cd03798 18 IFVKELARALAKRGVEVTVLAPGPWGP 44 (377) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 999999999997799699995379875 No 107 >TIGR01282 nifD nitrogenase molybdenum-iron protein alpha chain; InterPro: IPR005972 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=28.57 E-value=24 Score=16.55 Aligned_cols=30 Identities=7% Similarity=0.134 Sum_probs=15.3 Q ss_pred HHHHHH-HHHHCCCCEEEECCCCCCCCEEEEE Q ss_conf 999999-9985489199981799856503202 Q gi|255764489|r 16 LITLEN-IARSISDDIWICAPEMDQSCLANSL 46 (250) Q Consensus 16 l~~l~~-~l~~~~~~v~vvAP~~~~S~~~~ai 46 (250) |.+.++ .-++++..| |-.=.+---|.|.|+ T Consensus 152 IeaVA~~~~k~~~~~V-~PvrCEGfRGVSQSl 182 (521) T TIGR01282 152 IEAVAKKASKELGKPV-VPVRCEGFRGVSQSL 182 (521) T ss_pred HHHHHHHHHHHHCCCE-EEEEECCCCCCCCCC T ss_conf 6899999998628835-788606753732012 No 108 >pfam10841 DUF2644 Protein of unknown function (DUF2644). This family of proteins with unknown function appear to be restricted to Pasteurellaceae. Probab=28.37 E-value=22 Score=16.77 Aligned_cols=11 Identities=64% Similarity=0.730 Sum_probs=8.3 Q ss_pred EEEECCCCCCC Q ss_conf 68826898688 Q gi|255764489|r 3 ILLTNDDGIKS 13 (250) Q Consensus 3 ILitNDDG~~a 13 (250) =|+||+||--| T Consensus 2 eL~tn~dGrlS 12 (60) T pfam10841 2 ELITNDDGRLS 12 (60) T ss_pred CCEECCCCCEE T ss_conf 63356898583 No 109 >PRK09581 pleD response regulator PleD; Reviewed Probab=28.35 E-value=48 Score=14.76 Aligned_cols=29 Identities=21% Similarity=0.142 Sum_probs=19.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf 26882689868856999999998548919998 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWIC 33 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vv 33 (250) ||||.-||-.. .+.|...|++.|++|.++ T Consensus 4 rILiVdD~~~~---~~~l~~~L~~~~y~v~~a 32 (457) T PRK09581 4 RILVVDDIPAN---VKLLEAKLLAEYYTVLTA 32 (457) T ss_pred CEEEECCCHHH---HHHHHHHHHHCCCEEEEE T ss_conf 19999498999---999999998789999998 No 110 >KOG4120 consensus Probab=28.08 E-value=32 Score=15.83 Aligned_cols=20 Identities=20% Similarity=0.561 Sum_probs=14.7 Q ss_pred HCCCCCCCEEEECCCCCCCCC Q ss_conf 504645419997148885310 Q gi|255764489|r 78 KMSDKKPDLILSGVNVGTNTS 98 (250) Q Consensus 78 ~l~~~~pDlViSGiN~G~N~g 98 (250) +|++ -+|+||-|||.|--.+ T Consensus 144 ~lc~-NldiVIVGiNPGl~aa 163 (426) T KOG4120 144 YLCE-NLDIVIVGINPGLTAA 163 (426) T ss_pred HHHC-CCCEEEEECCCHHHHH T ss_conf 6635-7667999437015555 No 111 >PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed Probab=27.97 E-value=30 Score=16.03 Aligned_cols=22 Identities=32% Similarity=0.404 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 334577789988987189853222 Q gi|255764489|r 101 VAYSGTLAAAFEGSLQGIRSFALS 124 (250) Q Consensus 101 v~ySGTvgAA~ea~~~GipsIAiS 124 (250) +..|||+ ++|.++.|+|.|.+- T Consensus 269 i~~SGTa--TLE~al~~~P~Vv~Y 290 (382) T PRK00025 269 LAASGTV--TLEAALLGVPMVVGY 290 (382) T ss_pred EECCCHH--HHHHHHHCCCEEEEE T ss_conf 6537779--999999719858998 No 112 >PRK05429 gamma-glutamyl kinase; Provisional Probab=27.91 E-value=49 Score=14.71 Aligned_cols=31 Identities=23% Similarity=0.451 Sum_probs=23.9 Q ss_pred EEECCCC-CCCHHHHHHHHH---HHHCCCCEEEEC Q ss_conf 8826898-688569999999---985489199981 Q gi|255764489|r 4 LLTNDDG-IKSKGLITLENI---ARSISDDIWICA 34 (250) Q Consensus 4 LitNDDG-~~a~gl~~l~~~---l~~~~~~v~vvA 34 (250) +||+++| ++..-|..|++. |+..|++|.+|- T Consensus 19 ~Lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVS 53 (372) T PRK05429 19 LLTDDGGGLDRARIAELARQIAALRAAGHEVVLVS 53 (372) T ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 36898888889999999999999997899999988 No 113 >cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Probab=27.76 E-value=49 Score=14.70 Aligned_cols=31 Identities=13% Similarity=0.217 Sum_probs=22.0 Q ss_pred EEECCCCC--CCHHHHHHHHH---HHHCCCCEEEEC Q ss_conf 88268986--88569999999---985489199981 Q gi|255764489|r 4 LLTNDDGI--KSKGLITLENI---ARSISDDIWICA 34 (250) Q Consensus 4 LitNDDG~--~a~gl~~l~~~---l~~~~~~v~vvA 34 (250) +||++||. +-.-|..|++. |++.|++|.+|- T Consensus 19 ~lt~~d~~~~~~~~i~~l~~~I~~L~~~G~evvlVs 54 (284) T cd04256 19 VVTREDECGLALGRLASIVEQVSELQSQGREVILVT 54 (284) T ss_pred HEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 237999997589999999999999997899699990 No 114 >TIGR01380 glut_syn glutathione synthase; InterPro: IPR006284 These are the glutathione synthetases found in Gram-negative bacteria. This gene does not appear to be present in genomes of Gram-positive bacteria. Glutathione synthetase has an ATP-binding domain in the COOH terminus and catalyzes the second step in the glutathione biosynthesis pathway: ATP + gamma-L-glutamyl-L-cysteine + glycine = ADP + phosphate + glutathione. Glutathione is a tripeptide that functions as a reductant in many cellular reactions. ; GO: 0004363 glutathione synthase activity, 0006750 glutathione biosynthetic process. Probab=27.65 E-value=49 Score=14.68 Aligned_cols=107 Identities=17% Similarity=0.238 Sum_probs=67.2 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEE--EEEEEEECCC---HHHHHHHHHHHCCCCCCCEEEECCC Q ss_conf 9999998548919998179985650320224897058897--1104763272---7999999987504645419997148 Q gi|255764489|r 18 TLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTI--SKKRFAVHGT---PVDCVVIALQKMSDKKPDLILSGVN 92 (250) Q Consensus 18 ~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~--~~~~~~v~Gt---PaDcv~~~l~~l~~~~pDlViSGiN 92 (250) +|-++++..||+||..-|..=-=-.|..+-.-+|+++..+ .+..|.|+.- =.|.+.+.|. .+.|.|+= T Consensus 22 alm~EaQ~RgH~~f~~~~~~Ls~~~G~~~A~~~~~~~~pv~~~~~~WQv~~~w~~~~~~~~~~L~----d~~d~vlM--- 94 (322) T TIGR01380 22 ALMEEAQKRGHELFFYEPGDLSVKNGRVFARAAPVKVLPVQLDDDRWQVKQDWYTLGEKVRLALK----DELDVVLM--- 94 (322) T ss_pred HHHHHHHHCCCEEEEEECCCEEEECCHHHEEEEEEEEEEEEECCCCCEECHHHHHHHHHHHHCCC----CCCCEEEE--- T ss_conf 99989854693899984361111023111002106686058718801011278887665420113----57429998--- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC-----CCCC------CEEE Q ss_conf 8853102333457778998898718985322200135676614788888499999998677-----8667------3543 Q gi|255764489|r 93 VGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQ-----IPNT------TLCN 161 (250) Q Consensus 93 ~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~-----~p~~------~vlN 161 (250) ..+.+|+...-|+.-+++.+...+ -|++ .++- T Consensus 95 -------------------------------------RkDPPf~~~Y~yaT~lLe~~~~~g~~viN~P~~lRd~NEKl~~ 137 (322) T TIGR01380 95 -------------------------------------RKDPPFDMEYIYATYLLELAEPTGTLVINSPQGLRDANEKLFA 137 (322) T ss_pred -------------------------------------CCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCHHHCCCHHHHH T ss_conf -------------------------------------4750986315446778877842796798488232246603454 Q ss_pred CCCCCCC Q ss_conf 1456788 Q gi|255764489|r 162 INFPRCS 168 (250) Q Consensus 162 INiP~~~ 168 (250) .+||.+- T Consensus 138 ~~Fp~~~ 144 (322) T TIGR01380 138 LQFPKVI 144 (322) T ss_pred HCCCCCC T ss_conf 3341235 No 115 >cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core. Probab=27.65 E-value=49 Score=14.68 Aligned_cols=25 Identities=8% Similarity=0.098 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 5699999999854891999817998 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) .-+..|++.|.+.||+|+|+.+... T Consensus 14 ~~~~~La~~L~~~Gh~V~vi~~~~~ 38 (355) T cd03819 14 RGTLELARALVERGHRSLVASAGGR 38 (355) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9999999999987998999968998 No 116 >PRK02858 germination protease; Provisional Probab=27.05 E-value=35 Score=15.56 Aligned_cols=54 Identities=19% Similarity=0.465 Sum_probs=32.1 Q ss_pred ECCCHHHHHHHHHHHCCCCCCCEEE---------------------ECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3272799999998750464541999---------------------7148885310233345777899889871898532 Q gi|255764489|r 64 VHGTPVDCVVIALQKMSDKKPDLIL---------------------SGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 64 v~GtPaDcv~~~l~~l~~~~pDlVi---------------------SGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) ..|--+--+.-|+ +-.-|||+|| .||+.|+-+|..=. ++---..|||=|| T Consensus 165 ~TGiET~EIIkgI--Vek~kPD~VIaIDALAaRs~~Rln~TIQIsDTGI~PGSGVGN~R~-------~i~~etLGVPVIA 235 (367) T PRK02858 165 ITGIETSDIIYGI--IEKTKPDFVIAIDALAARSIERVNSTIQISDTGIHPGSGVGNKRK-------ELSKETLGIPVIA 235 (367) T ss_pred CCCCCHHHHHHHH--HHHHCCCEEEEEHHHHCCCHHHCCCEEEECCCCCCCCCCCCCCCC-------CCCHHHCCCCEEE T ss_conf 4440399999999--875089889996031227567626757854678688877564444-------2688772998799 Q ss_pred CCCC Q ss_conf 2200 Q gi|255764489|r 123 LSQA 126 (250) Q Consensus 123 iS~~ 126 (250) +-.- T Consensus 236 IGVP 239 (367) T PRK02858 236 IGVP 239 (367) T ss_pred ECCC T ss_conf 6588 No 117 >cd06428 M1P_guanylylT_A_like_N N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repre Probab=26.97 E-value=50 Score=14.61 Aligned_cols=22 Identities=18% Similarity=0.143 Sum_probs=10.0 Q ss_pred HHHHHHHHCCCCCCCEEECCCCC Q ss_conf 99999986778667354314567 Q gi|255764489|r 144 RVLRQLLKTQIPNTTLCNINFPR 166 (250) Q Consensus 144 ~ii~~l~~~~~p~~~vlNINiP~ 166 (250) ++++...+++ ...+++-+++|. T Consensus 121 ~l~~~h~~~~-~~~ti~~~~v~~ 142 (257) T cd06428 121 ELLEFHKKHG-ASGTILGTEASR 142 (257) T ss_pred HHHHHHHHCC-CCEEEEEEECCC T ss_conf 9999999779-986999998782 No 118 >PRK12320 hypothetical protein; Provisional Probab=26.93 E-value=51 Score=14.60 Aligned_cols=44 Identities=14% Similarity=0.080 Sum_probs=34.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEC Q ss_conf 92688268986885699999999854891999817998565032022 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLT 47 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait 47 (250) ||||+|.--|.-.-| |..-|-+.||+|+=.|-...-|+-+.+=. T Consensus 1 M~i~VT~A~G~lGR~---la~rLla~GH~V~Giar~r~~s~~~~~dF 44 (699) T PRK12320 1 MQILVTDATGAVGRS---VTRQLIAAGHTVSGIAQHPHDALDPRVDY 44 (699) T ss_pred CEEEEECCCHHHHHH---HHHHHHHCCCEEEEEECCCCCCCCCCCCE T ss_conf 947883462156778---99999866872454404798666754555 No 119 >PRK12362 germination protease; Provisional Probab=26.83 E-value=36 Score=15.53 Aligned_cols=54 Identities=19% Similarity=0.446 Sum_probs=31.9 Q ss_pred ECCCHHHHHHHHHHHCCCCCCCEEE---------------------ECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3272799999998750464541999---------------------7148885310233345777899889871898532 Q gi|255764489|r 64 VHGTPVDCVVIALQKMSDKKPDLIL---------------------SGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 64 v~GtPaDcv~~~l~~l~~~~pDlVi---------------------SGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) ..|--+--+.-|+ +-.-|||+|| .||+.|+-+|..=. ++---..|||=|| T Consensus 153 ~TGiET~EIIkgi--v~k~kPd~VIaIDALAaRs~~Rln~TIQIsDTGI~PGSGVGN~R~-------~l~~etLGVPVIA 223 (324) T PRK12362 153 ITGIETAEIIKGI--VEKIKPDLVIAIDALASRKVERVNTTIQISDTGISPGSGVGNKRM-------GLNEETLGVPVIA 223 (324) T ss_pred CCCCCHHHHHHHH--HHHHCCCEEEEEHHHHCCCHHHCCCEEEECCCCCCCCCCCCCCCC-------CCCHHHCCCCEEE T ss_conf 4430399999999--875089899996021137666536758865778589988675444-------2698782998899 Q ss_pred CCCC Q ss_conf 2200 Q gi|255764489|r 123 LSQA 126 (250) Q Consensus 123 iS~~ 126 (250) +-.- T Consensus 224 IGVP 227 (324) T PRK12362 224 IGVP 227 (324) T ss_pred ECCC T ss_conf 6588 No 120 >COG0451 WcaG Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism] Probab=26.62 E-value=51 Score=14.57 Aligned_cols=39 Identities=23% Similarity=0.124 Sum_probs=29.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCE Q ss_conf 926882689868856999999998548919998179985650 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCL 42 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~ 42 (250) |+||||=-.|+=. +.|++.|.+.||+|+.+-...+.... T Consensus 1 ~~iLVtG~tGfiG---~~l~~~L~~~g~~V~~~~r~~~~~~~ 39 (314) T COG0451 1 MRILVTGGAGFIG---SHLVERLLAAGHDVRGLDRLRDGLDP 39 (314) T ss_pred CEEEEECCCCHHH---HHHHHHHHHCCCEEEEEECCCCCCCC T ss_conf 9699992887779---99999998589979999178754311 No 121 >PRK09265 aminotransferase AlaT; Validated Probab=26.28 E-value=45 Score=14.89 Aligned_cols=33 Identities=21% Similarity=0.378 Sum_probs=22.7 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 268826898688569999999985489199981799 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEM 37 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~ 37 (250) +|+||| |- +.+|..+..++-.-|++|.|.+|.- T Consensus 97 ~I~vT~--Ga-~~al~~~~~all~pGDeVlvp~P~Y 129 (404) T PRK09265 97 DIYIGN--GV-SELIVMAMQALLNNGDEVLVPAPDY 129 (404) T ss_pred HEEECC--CH-HHHHHHHHHHHCCCCCEEEEECCCC T ss_conf 978758--88-9999999999649989899946887 No 122 >PRK10241 hydroxyacylglutathione hydrolase; Provisional Probab=26.12 E-value=52 Score=14.51 Aligned_cols=88 Identities=16% Similarity=0.202 Sum_probs=46.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC-CEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCCC Q ss_conf 68826898688569999999985489199981799856-50320224897058897110476327279999999875046 Q gi|255764489|r 3 ILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQS-CLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMSD 81 (250) Q Consensus 3 ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S-~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~~ 81 (250) ||+|----=+.-|...|++ ..+ ++.|.+|...+. +..+.+.-+..+.+-....+++.+-|--.+++.+- T Consensus 49 IL~TH~H~DHi~G~~~L~~---~~~-~~~v~g~~~~~~~~~~~~v~dgd~i~~g~~~~~vi~TPGHT~Ghi~y~------ 118 (251) T PRK10241 49 IFLTHHHHDHVGGVKELVE---KFP-QIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVFATPGHTLGHICYF------ 118 (251) T ss_pred EEECCCCHHHHHHHHHHHH---HCC-CCEEECHHHCCCCCCCEEECCCCEEEECCEEEEEEECCCCCCCHHHHH------ T ss_conf 9945895576632999998---768-985867033267689878489999998998999997799983000111------ Q ss_pred CCCCEEEECCCCCCCCCHHHHHHHHHHHH Q ss_conf 45419997148885310233345777899 Q gi|255764489|r 82 KKPDLILSGVNVGTNTSNHVAYSGTLAAA 110 (250) Q Consensus 82 ~~pDlViSGiN~G~N~g~~v~ySGTvgAA 110 (250) ..+.+++| |++.+|.+|-- T Consensus 119 -~~~~lFtG---------DTLF~gGcGR~ 137 (251) T PRK10241 119 -SKPYLFCG---------DTLFSGGCGRL 137 (251) T ss_pred -CCCCEEEC---------CCCCCCCCCCC T ss_conf -37706751---------53004661478 No 123 >pfam07429 Fuc4NAc_transf 4-alpha-L-fucosyltransferase (Fuc4NAc transferase). This family contains the bacterial enzyme 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) (EC 2.4.1.-) (approximately 360 residues long). This catalyses the synthesis of Fuc4NAc-ManNAcA-GlcNAc-PP-Und (lipid III) as part of the biosynthetic pathway of enterobacterial common antigen (ECA), a polysaccharide comprised of the trisaccharide repeat unit Fuc4NAc-ManNAcA-GlcNAc. Probab=25.56 E-value=54 Score=14.45 Aligned_cols=17 Identities=6% Similarity=0.069 Sum_probs=6.6 Q ss_pred EEECCCCCCCCHHHHHH Q ss_conf 99021535648899999 Q gi|255764489|r 227 SVTPITTDLTDYNSQQY 243 (250) Q Consensus 227 SVTPL~~DlT~~~~l~~ 243 (250) -|.||.+.-.+....++ T Consensus 217 iivPmgYPannq~Yi~~ 233 (361) T pfam07429 217 IIVPMGYPANNQAYIEQ 233 (361) T ss_pred EEEECCCCCCCHHHHHH T ss_conf 99978867876999999 No 124 >PRK08674 bifunctional phosphoglucose/phosphomannose isomerase; Validated Probab=25.32 E-value=54 Score=14.44 Aligned_cols=25 Identities=36% Similarity=0.582 Sum_probs=20.6 Q ss_pred HHHHH----HHHHHHHHHHHCCCCCCCCC Q ss_conf 33457----77899889871898532220 Q gi|255764489|r 101 VAYSG----TLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 101 v~ySG----TvgAA~ea~~~GipsIAiS~ 125 (250) +-||| |+.|..+|.-.|..-|++|. T Consensus 85 vSySGnTeETL~a~~~A~~rga~vi~Its 113 (328) T PRK08674 85 VSYSGNTEETLSAVEQAKKRGAKIIAITS 113 (328) T ss_pred ECCCCCCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 82899977999999999975995899948 No 125 >pfam08323 Glyco_transf_5 Starch synthase catalytic domain. Probab=25.15 E-value=55 Score=14.40 Aligned_cols=23 Identities=13% Similarity=0.265 Sum_probs=19.5 Q ss_pred HHHHHHHHHHCCCCEEEECCCCC Q ss_conf 99999999854891999817998 Q gi|255764489|r 16 LITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 16 l~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ...|.++|++.|++|.|+-|.-. T Consensus 21 v~~Lp~aL~~~G~~V~vi~P~Y~ 43 (229) T pfam08323 21 VGALPKALAKLGHDVRVILPRYG 43 (229) T ss_pred HHHHHHHHHHCCCEEEEEECCCC T ss_conf 99999999976994999932898 No 126 >TIGR01382 PfpI intracellular protease, PfpI family; InterPro: IPR006286 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad . This group of cysteine peptidases belong to MEROPS peptidase family C56 (Pfp1 endopeptidase, clan PC(C)). The member of this family from Pyrococcus horikoshii has been solved to 2 Angstrom resolution. It is an ATP-independent intracellular protease that crystallises as a hexameric ring. Cys-101 is proposed as the active site residue in a catalytic triad with the adjacent His-102 and a Glu residue from an adjacent monomer. A member of this family from Bacillus subtilis, GSP18, has been shown to be expressed in response to several forms of stress. A role in the degradation of small peptides has been suggested. This family is contained in a larger one of the thiamine biosynthesis protein ThiJ and its homologs.; GO: 0016798 hydrolase activity acting on glycosyl bonds. Probab=24.74 E-value=55 Score=14.36 Aligned_cols=46 Identities=7% Similarity=0.241 Sum_probs=31.5 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC-CCCEEEEEC Q ss_conf 2688268986885699999999854891999817998-565032022 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMD-QSCLANSLT 47 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~-~S~~~~ait 47 (250) +||++=-|+++=.-|..=.+.|++.||+|.+|.|-.+ +...+.+|. T Consensus 1 ~l~~l~~~~fED~El~~P~~~l~~~G~~V~~v~~g~~~g~~v~~aih 47 (189) T TIGR01382 1 KLLVLAEDEFEDEELLEPLKRLKEAGHEVTVVSIGKEAGTIVGDAIH 47 (189) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHCCCEEEEEEECCEECCEEECEEE T ss_conf 97887378862236777899998569889898316500232201101 No 127 >PRK13728 conjugal transfer protein TrbB; Provisional Probab=24.56 E-value=44 Score=14.94 Aligned_cols=49 Identities=20% Similarity=0.271 Sum_probs=34.1 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC----CCEEECCC Q ss_conf 98718985322200135676614788888499999998677866----73543145 Q gi|255764489|r 113 GSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPN----TTLCNINF 164 (250) Q Consensus 113 a~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~----~~vlNINi 164 (250) +.-+|++-.|+|++-.+ ...|..+.....+|++.++.+ +|. -.++|||- T Consensus 95 a~~~gi~V~~~SlDG~g--~~~FP~~lPaTpei~~tfF~n-~~~~tPatFLvNvnT 147 (181) T PRK13728 95 AQQYGFSVFPYTLDGQG--DTAFPEALPAPPDVMQTFFPN-IPVATPTTFLVNVNT 147 (181) T ss_pred HHHHCCEEEEEEECCCC--CCCCCCCCCCCHHHHHHHCCC-CCCCCCEEEEEECCC T ss_conf 99859717998536988--878886777987999997478-987886169973374 No 128 >cd02537 GT8_Glycogenin Glycogenin belongs the GT 8 family and initiates the biosynthesis of glycogen. Glycogenin initiates the biosynthesis of glycogen by incorporating glucose residues through a self-glucosylation reaction at a Tyr residue, and then acts as substrate for chain elongation by glycogen synthase and branching enzyme. It contains a conserved DxD motif and an N-terminal beta-alpha-beta Rossmann-like fold that are common to the nucleotide-binding domains of most glycosyltransferases. The DxD motif is essential for coordination of the catalytic divalent cation, most commonly Mn2+. Glycogenin can be classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. It is placed in glycosyltransferase family 8 which includes lipopolysaccharide glucose and galactose transferases and galactinol synthases. Probab=24.54 E-value=56 Score=14.33 Aligned_cols=31 Identities=32% Similarity=0.404 Sum_probs=22.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHCC--CCEEEECC Q ss_conf 6882689868856999999998548--91999817 Q gi|255764489|r 3 ILLTNDDGIKSKGLITLENIARSIS--DDIWICAP 35 (250) Q Consensus 3 ILitNDDG~~a~gl~~l~~~l~~~~--~~v~vvAP 35 (250) -|+|||+ ..+|..+|...|+..+ ++++|..+ T Consensus 5 Tllt~~~--Yl~GalvL~~sLr~~~s~~~lvvlvt 37 (240) T cd02537 5 TLLTNDD--YLPGALVLGYSLRKVGSSYDLVVLVT 37 (240) T ss_pred EEECCHH--HHHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 9975877--88999999999998599989999989 No 129 >PRK00358 pyrH uridylate kinase; Provisional Probab=24.53 E-value=56 Score=14.33 Aligned_cols=30 Identities=17% Similarity=0.317 Sum_probs=23.2 Q ss_pred EEECCCC--CCCHHHHHHHHHHH---HCCCCEEEE Q ss_conf 8826898--68856999999998---548919998 Q gi|255764489|r 4 LLTNDDG--IKSKGLITLENIAR---SISDDIWIC 33 (250) Q Consensus 4 LitNDDG--~~a~gl~~l~~~l~---~~~~~v~vv 33 (250) .||+++| ++..-|+.|++.+. +.|++|.+| T Consensus 11 ~L~~~~~~~l~~~~i~~la~~I~~l~~~G~evvlV 45 (231) T PRK00358 11 ALAGDEGFGIDPDVLDRIAQEIKELVELGVEVAVV 45 (231) T ss_pred HCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 50389999839999999999999999779969999 No 130 >pfam02577 DUF151 Uncharacterized ACR, COG1259. Probab=23.76 E-value=58 Score=14.24 Aligned_cols=21 Identities=19% Similarity=0.457 Sum_probs=14.8 Q ss_pred EEEEEEECCCHHHHHHHHHHH Q ss_conf 110476327279999999875 Q gi|255764489|r 58 SKKRFAVHGTPVDCVVIALQK 78 (250) Q Consensus 58 ~~~~~~v~GtPaDcv~~~l~~ 78 (250) +.+...+|..|.||+.+|+.. T Consensus 91 ~~~~~~iDaRPSDAIaLAlr~ 111 (133) T pfam02577 91 GGEIIEIDARPSDAIALALRT 111 (133) T ss_pred CCEEEEEECCHHHHHHHHHHC T ss_conf 997999844747899999971 No 131 >TIGR02114 coaB_strep phosphopantothenate--cysteine ligase; InterPro: IPR011848 In most bacteria, a single bifunctional protein catalyses phosphopantothenoylcysteine decarboxylase and phosphopantothenate--cysteine ligase activities, sequential steps in coenzyme A biosynthesis. These activities reside in separate proteins encoded by tandem genes in some bacterial lineages. This entry describes proteins from the genera Streptococcus and Enterococcus homologous to the C-terminal region of IPR005252 from INTERPRO, corresponding to phosphopantothenate--cysteine ligase activity .. Probab=23.72 E-value=49 Score=14.71 Aligned_cols=108 Identities=21% Similarity=0.298 Sum_probs=57.0 Q ss_pred CEEEEECC------CCC-----CCHHH--HHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCC Q ss_conf 92688268------986-----88569--999999985489199981799856503202248970588971104763272 Q gi|255764489|r 1 MRILLTND------DGI-----KSKGL--ITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGT 67 (250) Q Consensus 1 m~ILitND------DG~-----~a~gl--~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~Gt 67 (250) |+||||-. |-+ +|.|= +..+|-+-+.||+|+.|--.+---= +-...|+++++.. T Consensus 1 Mk~LvTsGGTsE~ID~VRSITNHStG~LGK~IaE~fL~~Gh~VtlvTTK~A~kP-----~~~~~Lsi~Eie~-------- 67 (253) T TIGR02114 1 MKILVTSGGTSEPIDNVRSITNHSTGGLGKIIAEKFLAAGHEVTLVTTKRALKP-----EPQLNLSIKEIET-------- 67 (253) T ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCC-----CCCCCCCEEEECC-------- T ss_conf 936881588636666635512337687217999999865878978753612188-----8888862486427-------- Q ss_pred HHHHHHHHHHHCCC-CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 79999999875046-45419997148885310233345777899889871898532220013567661478888849999 Q gi|255764489|r 68 PVDCVVIALQKMSD-KKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYTYENMIPWEVSETHAPRVL 146 (250) Q Consensus 68 PaDcv~~~l~~l~~-~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii 146 (250) ++--...|..++. +..|.. |.|||+|- |...- ...-+++...+. T Consensus 68 -~~~L~~~L~~~v~~kq~d~l------------------------------iHsMAVSD-YTpvy---m~~~E~v~~Sfa 112 (253) T TIGR02114 68 -VKDLLTTLKELVAEKQHDIL------------------------------IHSMAVSD-YTPVY---MTDLEQVKDSFA 112 (253) T ss_pred -HHHHHHHHHHHCCCCCCCEE------------------------------EEEECCCC-CCHHH---HCCHHHHHHHHH T ss_conf -78999998875012554388------------------------------88511268-85100---011678999998 Q ss_pred HHHHHCCCCC Q ss_conf 9998677866 Q gi|255764489|r 147 RQLLKTQIPN 156 (250) Q Consensus 147 ~~l~~~~~p~ 156 (250) ++|-+.-+|+ T Consensus 113 qqLseqtLPk 122 (253) T TIGR02114 113 QQLSEQTLPK 122 (253) T ss_pred HHHHHHCCCC T ss_conf 7554203652 No 132 >PTZ00019 fructose-bisphosphate aldolase; Provisional Probab=23.60 E-value=55 Score=14.38 Aligned_cols=109 Identities=13% Similarity=0.103 Sum_probs=54.8 Q ss_pred EEEEEEEEECCCHHHHHHHHHHHCCCC------------CCCEEEECCCCCCCCCH-HHHHHHHHHHHHHHHHH---CC- Q ss_conf 971104763272799999998750464------------54199971488853102-33345777899889871---89- Q gi|255764489|r 56 TISKKRFAVHGTPVDCVVIALQKMSDK------------KPDLILSGVNVGTNTSN-HVAYSGTLAAAFEGSLQ---GI- 118 (250) Q Consensus 56 ~~~~~~~~v~GtPaDcv~~~l~~l~~~------------~pDlViSGiN~G~N~g~-~v~ySGTvgAA~ea~~~---Gi- 118 (250) |++.......|+|-.-+.-||++|..+ |---|++ |+.+.++=. ..+-.-.-+=|+-|++| |+ T Consensus 100 KVD~G~~~l~g~~~E~~T~GLDgL~eR~~~Y~~~GArFAKWRaV~~-I~~~~~~PS~~~I~~Na~~LArYA~icQ~~GlV 178 (356) T PTZ00019 100 KVDKGLVTIPCTDGETSTQGLDGLAERCKKYYKAGARFAKWRAVLV-IDLAKGKPSDLSITETAHTLARYAAICQENGLV 178 (356) T ss_pred EECCCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCCCEECCEEEE-ECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 7178852189998231104645599999999986986100025889-658999986999998699999999999870873 Q ss_pred CCCCCCCCCCCCCC--CCHHHHHHHHHHHHHHHHHCCC-CCCCEEECCCC Q ss_conf 85322200135676--6147888884999999986778-66735431456 Q gi|255764489|r 119 RSFALSQAYTYENM--IPWEVSETHAPRVLRQLLKTQI-PNTTLCNINFP 165 (250) Q Consensus 119 psIAiS~~~~~~~~--~~~~~a~~~~~~ii~~l~~~~~-p~~~vlNINiP 165 (250) |=+-=-.-..++.. ..++..+++...+.+.|.+.+. -++++|--|+- T Consensus 179 PIVEPEVLmdG~H~ie~c~~VT~~vL~~vf~~L~~~~V~leG~lLKPnMV 228 (356) T PTZ00019 179 PIVEPEILADGSHTIEVCAEVTEKVLAAVFKALNDHGVLLEGALLKPNMV 228 (356) T ss_pred CCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCC T ss_conf 42140320377656799999999999999999997497406615116513 No 133 >PRK12314 gamma-glutamyl kinase; Provisional Probab=23.54 E-value=59 Score=14.21 Aligned_cols=30 Identities=13% Similarity=0.253 Sum_probs=22.0 Q ss_pred EEECCCC-CCCHHHHHHHHHH---HHCCCCEEEE Q ss_conf 8826898-6885699999999---8548919998 Q gi|255764489|r 4 LLTNDDG-IKSKGLITLENIA---RSISDDIWIC 33 (250) Q Consensus 4 LitNDDG-~~a~gl~~l~~~l---~~~~~~v~vv 33 (250) +||+++| ++..-|..|++.. ++.|++|.+| T Consensus 20 ~lt~~~g~~~~~~i~~la~~I~~l~~~G~~vvlV 53 (265) T PRK12314 20 SLSYENGKINLERIEQLVFVISDLMNKGKEVILV 53 (265) T ss_pred HEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 2678999859999999999999999789989998 No 134 >COG0763 LpxB Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane] Probab=23.45 E-value=45 Score=14.89 Aligned_cols=19 Identities=37% Similarity=0.483 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCC Q ss_conf 334577789988987189853 Q gi|255764489|r 101 VAYSGTLAAAFEGSLQGIRSF 121 (250) Q Consensus 101 v~ySGTvgAA~ea~~~GipsI 121 (250) +.-|||+ +.|+++.|.|.| T Consensus 269 l~aSGT~--tLE~aL~g~P~V 287 (381) T COG0763 269 LAASGTA--TLEAALAGTPMV 287 (381) T ss_pred HHHCCHH--HHHHHHHCCCEE T ss_conf 9844679--999998299979 No 135 >CHL00148 orf27 Ycf27; Reviewed Probab=23.44 E-value=59 Score=14.20 Aligned_cols=80 Identities=18% Similarity=0.205 Sum_probs=50.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHCC Q ss_conf 92688268986885699999999854891999817998565032022489705889711047632727999999987504 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQKMS 80 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~l~ 80 (250) +||||.-||=--+. .|...|+..|++|..+. + ..-|+..+. T Consensus 7 ~kILiVEDd~~l~~---~l~~~L~~~G~~v~~a~------------------------------~------g~~a~~~~~ 47 (240) T CHL00148 7 EKILVVDDEASIRR---ILETRLSIIGYEVITAS------------------------------D------GEEALKLFH 47 (240) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC------------------------------C------HHHHHHHHH T ss_conf 82999939899999---99999997899999989------------------------------9------999999997 Q ss_pred CCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 6454199971488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) ..+|||||--++...--|.++.- .-++ ...+|-|-+|.. T Consensus 48 ~~~~DlviLDi~LP~~dG~~l~~-----~iR~--~~~~PII~LTa~ 86 (240) T CHL00148 48 QEQPDLVILDVMMPKLDGYGVCQ-----EIRK--ESDVPIIMLTAL 86 (240) T ss_pred HCCCCEEEEECCCCCCCCCEEEE-----CCCC--CCCCCEEEEECC T ss_conf 47999999979999888663054-----1403--799548998167 No 136 >pfam03418 Peptidase_A25 Germination protease. Probab=23.01 E-value=49 Score=14.67 Aligned_cols=39 Identities=28% Similarity=0.498 Sum_probs=27.0 Q ss_pred CCCCCEEE---------------------ECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 64541999---------------------71488853102333457778998898718985322200 Q gi|255764489|r 81 DKKPDLIL---------------------SGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQA 126 (250) Q Consensus 81 ~~~pDlVi---------------------SGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~ 126 (250) .-|||+|| .||+.|+-+|..=. ++---..|||=||+-.- T Consensus 168 ~~kPd~vIaIDALAaRs~~Rln~TIQisdTGI~PGSGVGN~R~-------~i~~etLGVPVIAIGVP 227 (355) T pfam03418 168 KSKPDFIIAIDALAARAIERVNATIQISDSGIHPGSGVGNKRK-------DISKDTLGIPVIAIGIP 227 (355) T ss_pred HCCCCEEEEEHHHHCCCHHHHCCEEEECCCCCCCCCCCCCCCC-------CCCHHHCCCCEEEECCC T ss_conf 2089889996442306467616757843678688876564333-------15886729977995388 No 137 >cd00401 AdoHcyase S-adenosyl-L-homocysteine hydrolase (AdoHycase) catalyzes the hydrolysis of S-adenosyl-L-homocysteine (AdoHyc) to form adenosine (Ado) and homocysteine (Hcy). The equilibrium lies far on the side of AdoHyc synthesis, but in nature the removal of Ado and Hyc is sufficiently fast, so that the net reaction is in the direction of hydrolysis. Since AdoHyc is a potent inhibitor of S-adenosyl-L-methionine dependent methyltransferases, AdoHycase plays a critical role in the modulation of the activity of various methyltransferases. The enzyme forms homooligomers of 45-50kDa subunits, each binding one molecule of NAD+. Probab=22.62 E-value=61 Score=14.10 Aligned_cols=98 Identities=16% Similarity=0.233 Sum_probs=51.5 Q ss_pred HHHHHHHHCCCCEEEEC--CCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHH----HHHHHHHHCCCCCCCEEEECC Q ss_conf 99999985489199981--799856503202248970588971104763272799----999998750464541999714 Q gi|255764489|r 18 TLENIARSISDDIWICA--PEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVD----CVVIALQKMSDKKPDLILSGV 91 (250) Q Consensus 18 ~l~~~l~~~~~~v~vvA--P~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaD----cv~~~l~~l~~~~pDlViSGi 91 (250) .|.+.|++.|.+|.+++ |-+-|--.-.++- +.+..+|+-.|--.+ |..-+|.+ .+.+||+++- T Consensus 51 ~L~~tL~~~GA~V~~~~~NplSTQDdvaAaL~--------~~gi~VfA~~g~s~eey~~~l~~~L~~-~~~~P~iiiD-- 119 (413) T cd00401 51 VLIETLVALGAEVRWSSCNIFSTQDHAAAAIA--------AAGIPVFAWKGETLEEYWWCIEQALKF-PDGEPNMILD-- 119 (413) T ss_pred HHHHHHHHCCCEEEEECCCCCCCHHHHHHHHH--------HCCCEEEEECCCCHHHHHHHHHHHHHC-CCCCCCEEEC-- T ss_conf 99999998599899963797653289999999--------679769984799989999999999828-7999646861-- Q ss_pred CCCCCCCHH------HHHHHHHHHH------------HHHH-HHCCCCCCCCCCC Q ss_conf 888531023------3345777899------------8898-7189853222001 Q gi|255764489|r 92 NVGTNTSNH------VAYSGTLAAA------------FEGS-LQGIRSFALSQAY 127 (250) Q Consensus 92 N~G~N~g~~------v~ySGTvgAA------------~ea~-~~GipsIAiS~~~ 127 (250) .|.-+-.- -++++-+|++ |+.. -..+|.||+--+. T Consensus 120 -DG~Dl~~~~h~~~~~~~~~i~G~~EETTTGv~RL~am~~~g~L~fP~iaVNDa~ 173 (413) T cd00401 120 -DGGDLTLLIHKKHPELLPGIRGISEETTTGVHRLYKMEKEGKLKFPAINVNDSV 173 (413) T ss_pred -CCCCHHHHHHHCCHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEECCHH T ss_conf -441221213405746774201245553088999999997698278678863666 No 138 >TIGR02738 TrbB type-F conjugative transfer system pilin assembly thiol-disulfide isomerase TrbB; InterPro: IPR014109 This entry includes thiol-disulphide isomerase (TrbB), a protein that is involved in conjugative transfer of plasmid DNA, specifically in the F-type system. TrbB has been predicted to contain a thioredoxin fold, it also contains a conserved pair of cysteines and has been shown to function as a thiol disulphide isomerase by complementation of Escherichia coli DsbA mutants . TtbB is believed to be involved in pilin assembly . TrbB is closely related to TraF (IPR014110 from INTERPRO) though somewhat longer lacking the cysteine motif and is apparently not functional as a disulphide bond isomerase.. Probab=22.61 E-value=43 Score=15.04 Aligned_cols=51 Identities=12% Similarity=0.178 Sum_probs=34.5 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCC Q ss_conf 987189853222001356766147888884999999986778667354314567881 Q gi|255764489|r 113 GSLQGIRSFALSQAYTYENMIPWEVSETHAPRVLRQLLKTQIPNTTLCNINFPRCSP 169 (250) Q Consensus 113 a~~~GipsIAiS~~~~~~~~~~~~~a~~~~~~ii~~l~~~~~p~~~vlNINiP~~~~ 169 (250) +.-.|++=.|+|++- .....|+.+.....++.+.++.++ ...+|++|..-. T Consensus 97 ~~~~~~~V~a~SlDG--~g~~~FP~~~P~tpE~~~~~F~~~----ey~~~~~PalFL 147 (176) T TIGR02738 97 SQQFGLSVYAFSLDG--QGLAEFPDALPATPEVMQTFFPNG----EYRPVVTPALFL 147 (176) T ss_pred HHHCCCCEEEEEECC--CCCCCCCCCCCCCHHHHHHHCCCC----CCCEECCCEEEE T ss_conf 876287537887338--978778876578765787633889----703010232233 No 139 >PRK06951 consensus Probab=22.51 E-value=61 Score=14.09 Aligned_cols=48 Identities=13% Similarity=0.191 Sum_probs=28.8 Q ss_pred CEEEEECCCCC--------------CCHHHHHHHHHHHHCCC--CEEEECCCCCCCCEEEEECCCCCE Q ss_conf 92688268986--------------88569999999985489--199981799856503202248970 Q gi|255764489|r 1 MRILLTNDDGI--------------KSKGLITLENIARSISD--DIWICAPEMDQSCLANSLTMSRNI 52 (250) Q Consensus 1 m~ILitNDDG~--------------~a~gl~~l~~~l~~~~~--~v~vvAP~~~~S~~~~ait~~~~~ 52 (250) |.||++.|||| ...-+..|.+++..+.+ +|.+|-=. |.|.+++-+-.+ T Consensus 1 m~i~~e~~~~Va~itLnrP~~~Nal~~~m~~el~~~~~~~~~d~~vrvvvl~----g~g~~FcaG~Dl 64 (254) T PRK06951 1 MDILVERAAGVLTITFARPAKKNAITAAMYQTMADALAAAQDDAAVRAILIR----GSDGIFSAGNDL 64 (254) T ss_pred CEEEEEEECCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEE----CCCCCEECCCCH T ss_conf 9589999899999997488746898999999999999998668891799998----799987689975 No 140 >PRK10017 putative pyruvyl transferase; Provisional Probab=22.40 E-value=47 Score=14.79 Aligned_cols=34 Identities=15% Similarity=0.317 Sum_probs=23.1 Q ss_pred CEEEEECCCCCCCHH----HHHHHHHHHHC--CCCEEEEC Q ss_conf 926882689868856----99999999854--89199981 Q gi|255764489|r 1 MRILLTNDDGIKSKG----LITLENIARSI--SDDIWICA 34 (250) Q Consensus 1 m~ILitNDDG~~a~g----l~~l~~~l~~~--~~~v~vvA 34 (250) |||+|++=-|..--| |+.+.+.|++. +-+++|.- T Consensus 1 MkIvI~G~yg~~N~GDeAIL~siI~~Lr~~~p~~~I~VlS 40 (426) T PRK10017 1 MKLLILGNHTCGNRGDSAILRGLLDAINILNPHAEVDVMS 40 (426) T ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 9799983607987427999999999999758999689995 No 141 >pfam05221 AdoHcyase S-adenosyl-L-homocysteine hydrolase. Probab=22.36 E-value=62 Score=14.07 Aligned_cols=100 Identities=14% Similarity=0.142 Sum_probs=51.2 Q ss_pred HHHHHHHHHCCCCEEEEC--CCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCH-H---HHHHHHHHHCCCCCCCEEEEC Q ss_conf 999999985489199981--7998565032022489705889711047632727-9---999999875046454199971 Q gi|255764489|r 17 ITLENIARSISDDIWICA--PEMDQSCLANSLTMSRNIACRTISKKRFAVHGTP-V---DCVVIALQKMSDKKPDLILSG 90 (250) Q Consensus 17 ~~l~~~l~~~~~~v~vvA--P~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtP-a---Dcv~~~l~~l~~~~pDlViSG 90 (250) ..|.+.|++.|.+|.+++ |-+.|--.-.++- +.+..+||..|-. - +|..-+|...-+..|+++|- T Consensus 57 A~L~~tL~~~GA~V~~~~~NplSTQDdvaAaL~--------~~Gi~VfA~~g~t~eey~~~~~~~L~~~~~~~P~iiiD- 127 (430) T pfam05221 57 AVLIETLVALGAEVRWASCNIFSTQDHAAAAIA--------AAGVPVFAWKGETLEEYWWCIERALTFKDGQGPNMILD- 127 (430) T ss_pred HHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHH--------HCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCEEEC- T ss_conf 999999998598799965897532379999999--------67976999379998999999999846588989656542- Q ss_pred CCCCCCCCHH------HHHHHHHHHH------------HHHH-HHCCCCCCCCCCC Q ss_conf 4888531023------3345777899------------8898-7189853222001 Q gi|255764489|r 91 VNVGTNTSNH------VAYSGTLAAA------------FEGS-LQGIRSFALSQAY 127 (250) Q Consensus 91 iN~G~N~g~~------v~ySGTvgAA------------~ea~-~~GipsIAiS~~~ 127 (250) .|.-+-.- -+..+-+|++ |+.. -..+|.||+--+. T Consensus 128 --DG~Dl~~~~h~~~~~l~~~i~G~~EETTTGv~RL~am~~~g~L~fPviaVNDa~ 181 (430) T pfam05221 128 --DGGDATLLIHKKYPQYLAGIRGVSEETTTGVHRLYKMLKKGKLLFPAINVNDSV 181 (430) T ss_pred --CCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCEEEECCHH T ss_conf --541252207856288886516715321478998788986596064179967666 No 142 >PRK12342 putative electron transfer flavoprotein YdiQ; Provisional Probab=22.30 E-value=62 Score=14.06 Aligned_cols=103 Identities=16% Similarity=0.150 Sum_probs=51.0 Q ss_pred HHHHHHHH--HHCCCCEEEE--CCCCCCCC--EEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCCEEE Q ss_conf 99999999--8548919998--17998565--0320224897058897110476327279999999875-0464541999 Q gi|255764489|r 16 LITLENIA--RSISDDIWIC--APEMDQSC--LANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPDLIL 88 (250) Q Consensus 16 l~~l~~~l--~~~~~~v~vv--AP~~~~S~--~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pDlVi 88 (250) ..+|.+++ ++.|.+|+++ .|...+.- .=.++..+-. +...+.++.++-.-|.+ +...|.. +....+|||+ T Consensus 38 ~~AlE~Al~lk~~gg~v~vvs~G~~~~~~~~~~r~alAmGaD-~a~li~d~~~~g~D~~~--tA~~La~~i~~~~~DLVl 114 (254) T PRK12342 38 LNAIEAASQLATDGDEIAALTVGGSLLQNSKVRKDVLSRGPH-SLYLVQDAQLEHALPLD--TAKALAAAIEKIGFDLLL 114 (254) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHCCC-EEEEEECCCCCCCCHHH--HHHHHHHHHHHHCCCEEE T ss_conf 999999999886199699999568802559999999973798-89998457656668899--999999999983969999 Q ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 7148885310233345777899889871898532220013 Q gi|255764489|r 89 SGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFALSQAYT 128 (250) Q Consensus 89 SGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAiS~~~~ 128 (250) .|--.-. .++|.||+.+ |.++|+|.+..-.... T Consensus 115 ~G~~s~D------~~tgqvg~~l-Ae~Lg~P~vt~v~~i~ 147 (254) T PRK12342 115 FGEGSGD------LYAQQVGLLL-GELLQLPVINAVSKIQ 147 (254) T ss_pred EECCCCC------CCCCCHHHHH-HHHCCCCCEEEEEEEE T ss_conf 9362124------8988578999-9870997166799999 No 143 >TIGR01130 ER_PDI_fam protein disulfide isomerases; InterPro: IPR005792 This family represents eukaryotic protein disulphide isomerases retained in the endoplasmic reticulum (ER) and other closely related forms. Some members have been assigned alternative or additional functions such as prolyl 4-hydroxylase and dolichyl-diphosphooligosaccharide-protein glycotransferase. Members of this family have at least two protein-disulphide domains, each similar to thioredoxin but with the redox-active disulphide in the motif PWCGHCK, and an ER retention signal at the extreme C-terminus (KDEL, HDEL, and similar motifs). ; GO: 0016853 isomerase activity, 0005783 endoplasmic reticulum. Probab=22.21 E-value=44 Score=14.96 Aligned_cols=37 Identities=16% Similarity=0.424 Sum_probs=22.3 Q ss_pred CCEEEE---CCCCCCCCEEEEECCC--------------CCEEEEEEEE-------EEEEECCCHH Q ss_conf 919998---1799856503202248--------------9705889711-------0476327279 Q gi|255764489|r 28 DDIWIC---APEMDQSCLANSLTMS--------------RNIACRTISK-------KRFAVHGTPV 69 (250) Q Consensus 28 ~~v~vv---AP~~~~S~~~~ait~~--------------~~~~~~~~~~-------~~~~v~GtPa 69 (250) |++++| || |||||=++- -++.+-+++- ..|.|+|.|+ T Consensus 18 ~~~vLVeFYAP-----WCGHCK~LAPEY~~AA~~L~~~~p~i~LAKVDaT~e~~La~kygV~GYPT 78 (522) T TIGR01130 18 NEFVLVEFYAP-----WCGHCKSLAPEYEKAATELKKKGPPIKLAKVDATEEKDLAQKYGVSGYPT 78 (522) T ss_pred CCEEEEEECCC-----CCCHHCCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHCCCCCCCC T ss_conf 98589995288-----34411053436899999998558980488840761032250257664543 No 144 >PRK03359 putative electron transfer flavoprotein FixA; Reviewed Probab=21.94 E-value=63 Score=14.02 Aligned_cols=99 Identities=14% Similarity=0.136 Sum_probs=48.0 Q ss_pred HHHHHHHHHH--H-CC-CCEEEEC--CCCCCC--CEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCC Q ss_conf 6999999998--5-48-9199981--799856--50320224897058897110476327279999999875-0464541 Q gi|255764489|r 15 GLITLENIAR--S-IS-DDIWICA--PEMDQS--CLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPD 85 (250) Q Consensus 15 gl~~l~~~l~--~-~~-~~v~vvA--P~~~~S--~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pD 85 (250) -..+|.++++ + .| .+|+++. |..-.+ ..-.++..+-. +...+.+..| .|+-+-++--.|.. +....+| T Consensus 38 De~AlE~AlrlkE~~gg~~VtvvsvG~~~~~~~~~lr~aLAmGaD-~ai~i~d~~~--~~~D~~~tA~~La~~i~~~~~D 114 (256) T PRK03359 38 DLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPD-ELIVVIDDQF--EQALPQQTASALAAAAQKAGFD 114 (256) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHCCCC-EEEEECCCCC--CCCCHHHHHHHHHHHHHHCCCC T ss_conf 689999999988851995799999377544559999999975897-2899757654--4437799999999999872999 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99971488853102333457778998898718985322 Q gi|255764489|r 86 LILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFAL 123 (250) Q Consensus 86 lViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIAi 123 (250) ||+.|-..- =.++|.||+.. |.++|+|.+.. T Consensus 115 LVl~G~~s~------D~~~gqvg~~l-Ae~Lg~P~vt~ 145 (256) T PRK03359 115 LILCGDGSS------DLYAQQVGLLV-GEILNIPAING 145 (256) T ss_pred EEEEECCCC------CCCCCCCCHHH-HHHHCCCHHEE T ss_conf 999925102------59998522899-99829852200 No 145 >PRK11574 hypothetical protein; Provisional Probab=21.93 E-value=63 Score=14.02 Aligned_cols=61 Identities=7% Similarity=0.065 Sum_probs=39.6 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC---CEEEEECCCCCEEEEEEEEEEE Q ss_conf 268826898688569999999985489199981799856---5032022489705889711047 Q gi|255764489|r 2 RILLTNDDGIKSKGLITLENIARSISDDIWICAPEMDQS---CLANSLTMSRNIACRTISKKRF 62 (250) Q Consensus 2 ~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~~S---~~~~ait~~~~~~~~~~~~~~~ 62 (250) ++||-==||++---..+.++.|+..|.+|.++....+.+ -.+|.+++.-...+..+....| T Consensus 4 k~LV~lA~GfEEiEa~~~vDvLRRagi~V~~asv~~~~~~~v~gs~gi~i~aD~~l~~v~~~~y 67 (196) T PRK11574 4 SALVCLAPGSEETEAVTTIDLLVRGGIKVTTASVASDGNLTITCSRGVKLLADAPLVEVADGEY 67 (196) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCEEEECCCCEEECCCCHHHCCCCCC T ss_conf 6999916985021251899999777984999985389973799479978957867777782558 No 146 >TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases. Probab=21.90 E-value=45 Score=14.90 Aligned_cols=40 Identities=23% Similarity=0.306 Sum_probs=28.0 Q ss_pred CEEEEECCCCCCCHH------HHHHHHHHHHCCCCEEEECCCCCCCC Q ss_conf 926882689868856------99999999854891999817998565 Q gi|255764489|r 1 MRILLTNDDGIKSKG------LITLENIARSISDDIWICAPEMDQSC 41 (250) Q Consensus 1 m~ILitNDDG~~a~g------l~~l~~~l~~~~~~v~vvAP~~~~S~ 41 (250) |||++--|=|- .-| -.+|++++++.|++|..+.-....+. T Consensus 1 mkI~fr~d~~~-~iG~GH~~RclaLA~~l~~~g~~v~f~~~~~~~~~ 46 (280) T TIGR03590 1 MRILFRADASS-EIGLGHVMRCLTLARELRKRGAEVAFACKTLPGDL 46 (280) T ss_pred CEEEEEEECCC-CEEHHHHHHHHHHHHHHHHCCCEEEEEEECCCHHH T ss_conf 97999996789-91320899999999999988994999992795889 No 147 >TIGR00213 GmhB_yaeD D,D-heptose 1,7-bisphosphate phosphatase; InterPro: IPR004446 In a number of species, including Escherichia coli, the histidine biosynthetic enzymes imidazole glycerol phosphate dehydratase and histidinol phosphatase are found together in the bifunctional protein HisB. This family represents a protein closely related to the histidinol phosphatase domain of HisB. The protein is found both in Helicobacter pylori, for which the histidine biosynthetic pathway appears to be absent, and in species that also have a bifunctional HisB protein.. Probab=21.90 E-value=63 Score=14.01 Aligned_cols=30 Identities=17% Similarity=0.231 Sum_probs=22.6 Q ss_pred CHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEE Q ss_conf 856999999998548919998179985650320 Q gi|255764489|r 13 SKGLITLENIARSISDDIWICAPEMDQSCLANS 45 (250) Q Consensus 13 a~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~a 45 (250) -.|+--+-+.+++.|+..++| +||||..+. T Consensus 28 ~~Gv~~~l~~~K~~Gy~L~lv---TNQSGI~RG 57 (178) T TIGR00213 28 IDGVIDALRELKKLGYKLVLV---TNQSGIARG 57 (178) T ss_pred HHHHHHHHHHHHHCCCEEEEE---ECCCCCCCC T ss_conf 566999999987458278898---745655435 No 148 >COG0745 OmpR Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription] Probab=21.86 E-value=63 Score=14.01 Aligned_cols=31 Identities=29% Similarity=0.348 Sum_probs=21.8 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9268826898688569999999985489199981 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICA 34 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvA 34 (250) |||||.-||=--+. .|...|+..|++|..+. T Consensus 1 ~~ILiveDd~~i~~---~l~~~L~~~G~~v~~~~ 31 (229) T COG0745 1 MRILLVEDDPELAE---LLKEYLEEEGYEVDVAA 31 (229) T ss_pred CEEEEEECCHHHHH---HHHHHHHHCCCEEEEEC T ss_conf 98999918889999---99999987597899978 No 149 >PRK08942 D,D-heptose 1,7-bisphosphate phosphatase; Validated Probab=21.85 E-value=63 Score=14.01 Aligned_cols=30 Identities=20% Similarity=0.201 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEE Q ss_conf 569999999985489199981799856503202 Q gi|255764489|r 14 KGLITLENIARSISDDIWICAPEMDQSCLANSL 46 (250) Q Consensus 14 ~gl~~l~~~l~~~~~~v~vvAP~~~~S~~~~ai 46 (250) ||.....+.|++.|..+.|| +||||.|+.. T Consensus 32 ~g~~~al~~l~~~g~~~~iv---TNQsGI~rG~ 61 (181) T PRK08942 32 PGSIEAIARLKQAGYRVVVA---TNQSGIARGL 61 (181) T ss_pred CCHHHHHHHHHHCCCEEEEE---ECCHHHCCCC T ss_conf 78899999999879969999---5871342586 No 150 >TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process. Probab=21.68 E-value=35 Score=15.55 Aligned_cols=19 Identities=53% Similarity=0.714 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 4577789988987189853222 Q gi|255764489|r 103 YSGTLAAAFEGSLQGIRSFALS 124 (250) Q Consensus 103 ySGTvgAA~ea~~~GipsIAiS 124 (250) -+||+|=|+ |++=|+ ||=| T Consensus 264 i~GTLaKAF-G~~GGY--Ia~~ 282 (427) T TIGR01821 264 IEGTLAKAF-GVVGGY--IAAS 282 (427) T ss_pred CCCCCHHHC-CCCCCE--EECC T ss_conf 016511223-444530--2055 No 151 >PRK05928 hemD uroporphyrinogen-III synthase; Reviewed Probab=21.20 E-value=65 Score=13.93 Aligned_cols=33 Identities=24% Similarity=0.271 Sum_probs=25.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 92688268986885699999999854891999817998 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) ||||||=-.+- -..|.+.|++.|+++ +..|--+ T Consensus 2 ~~IlvTRp~~~----~~~l~~~L~~~G~~~-~~~Pli~ 34 (252) T PRK05928 2 MKILLTRPSPA----GEELVELLRKLGFVA-LHLPLIE 34 (252) T ss_pred CEEEECCCHHH----HHHHHHHHHHCCCCE-EEECCEE T ss_conf 98998697788----999999999889969-9967888 No 152 >pfam08660 Alg14 Oligosaccharide biosynthesis protein Alg14 like. Alg14 is involved dolichol-linked oligosaccharide biosynthesis and anchors the catalytic subunit Alg13 to the ER membrane. Probab=21.17 E-value=65 Score=13.92 Aligned_cols=95 Identities=16% Similarity=0.215 Sum_probs=43.9 Q ss_pred HHHHHHHHHHHHCC-CCEEEECCCCCCCCEEEEECCCCC-EEEEEEEEEEEEEC-------CCHHHHHHHHHHHCCCCCC Q ss_conf 56999999998548-919998179985650320224897-05889711047632-------7279999999875046454 Q gi|255764489|r 14 KGLITLENIARSIS-DDIWICAPEMDQSCLANSLTMSRN-IACRTISKKRFAVH-------GTPVDCVVIALQKMSDKKP 84 (250) Q Consensus 14 ~gl~~l~~~l~~~~-~~v~vvAP~~~~S~~~~ait~~~~-~~~~~~~~~~~~v~-------GtPaDcv~~~l~~l~~~~p 84 (250) .-+..|.+.+.+.. +..++++=..+.|. ...-.+... ..+..+ .+.+.+. -+-.-+...++.-+...|| T Consensus 12 ~eml~L~~~l~~~~~~~~yvv~~~D~~s~-~~~~~~~~~~~~i~~~-~r~R~v~qs~~~s~~~~~~~~~~s~~il~k~kP 89 (166) T pfam08660 12 AEMLRLLPFLKNLEDPRRYVVTETDKDSL-SKASKFEKSRGGILKI-PRAREVGQSYLRSIFTTLRNLLSALKLLRRERP 89 (166) T ss_pred HHHHHHHHHHHHCCCCCEEEEECCCHHHH-HHHHHCCCCCCEEEEC-CCEEECCCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999877568973899988980789-9988605555237644-631573855674599999999999999985399 Q ss_pred CEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 1999714888531023334577789988987189853 Q gi|255764489|r 85 DLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSF 121 (250) Q Consensus 85 DlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsI 121 (250) |+||| .|...+.= .++-|-+++++.+ T Consensus 90 dvii~---tG~g~~vp--------~~~~a~ll~~~~~ 115 (166) T pfam08660 90 DVILC---NGPGTCVP--------FCLAAKLLKILGL 115 (166) T ss_pred CEEEE---CCCCEEEH--------HHHHHHHHHHHHC T ss_conf 89997---79960309--------9999999986401 No 153 >cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=21.11 E-value=66 Score=13.91 Aligned_cols=24 Identities=4% Similarity=-0.132 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 699999999854891999817998 Q gi|255764489|r 15 GLITLENIARSISDDIWICAPEMD 38 (250) Q Consensus 15 gl~~l~~~l~~~~~~v~vvAP~~~ 38 (250) -+..|++.|.+.||+|+|+++... T Consensus 20 ~v~~La~~L~~~Gh~V~v~t~~~~ 43 (363) T cd04955 20 FVEELAPRLVARGHEVTVYCRSPY 43 (363) T ss_pred HHHHHHHHHHHCCCEEEEEECCCC T ss_conf 999999999977997999987898 No 154 >TIGR02524 dot_icm_DotB Dot/Icm secretion system ATPase DotB; InterPro: IPR013363 Proteins in this entry are the DotB component of Dot/Icm secretion systems, as found in the obligate intracellular pathogens Legionella pneumophila and Coxiella burnetii. While this system resembles type IV secretion systems and has been called a form of type IV, the literature now seems to favor calling this the Dot/Icm system. This family is most closely related to the TraJ proteins of plasmid transfer (IPR013364 from INTERPRO), rather than to proteins of other type IV secretion systems.. Probab=21.08 E-value=66 Score=13.91 Aligned_cols=39 Identities=28% Similarity=0.486 Sum_probs=28.8 Q ss_pred HHHHCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 987504645419997148885310233345777899889871898532 Q gi|255764489|r 75 ALQKMSDKKPDLILSGVNVGTNTSNHVAYSGTLAAAFEGSLQGIRSFA 122 (250) Q Consensus 75 ~l~~l~~~~pDlViSGiN~G~N~g~~v~ySGTvgAA~ea~~~GipsIA 122 (250) |+..-+.+||.+++-|=-.- .-|+.|++||++-|-|--. T Consensus 204 GvrnalrrkP~~i~vGe~rd---------~etisa~leaaltGhPvyt 242 (358) T TIGR02524 204 GVRNALRRKPRLILVGEARD---------AETISAVLEAALTGHPVYT 242 (358) T ss_pred HHHHHHHCCCCEEEEECCCC---------HHHHHHHHHHHHCCCCCEE T ss_conf 88887522885689833644---------1579999988743884001 No 155 >PRK13151 consensus Probab=20.75 E-value=67 Score=13.87 Aligned_cols=31 Identities=16% Similarity=0.232 Sum_probs=26.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9268826898688569999999985489199981 Q gi|255764489|r 1 MRILLTNDDGIKSKGLITLENIARSISDDIWICA 34 (250) Q Consensus 1 m~ILitNDDG~~a~gl~~l~~~l~~~~~~v~vvA 34 (250) |||-|-| +.+-.|+.+.++++.+|+++.++- T Consensus 1 MkI~IiD---yg~GNi~Sv~~al~~lg~~~~i~~ 31 (195) T PRK13151 1 MKIIIID---TACANLASLKFCLDRLGFNATISR 31 (195) T ss_pred CEEEEEE---CCCHHHHHHHHHHHHCCCCEEEEC T ss_conf 9899994---796099999999998599879967 No 156 >COG1497 Predicted transcriptional regulator [Transcription] Probab=20.39 E-value=47 Score=14.81 Aligned_cols=22 Identities=32% Similarity=0.530 Sum_probs=18.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 5777899889871898532220 Q gi|255764489|r 104 SGTLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 104 SGTvgAA~ea~~~GipsIAiS~ 125 (250) =||.-||+||+.+|...|.++. T Consensus 213 Fg~~~a~ieAa~rGl~vlvv~t 234 (260) T COG1497 213 FGTLEAAIEAAVRGLSVLVVIT 234 (260) T ss_pred ECCCHHHHHHHHCCCCEEEEEE T ss_conf 3461879999866883899970 No 157 >cd05017 SIS_PGI_PMI_1 The members of this protein family contain the SIS (Sugar ISomerase) domain and have both the phosphoglucose isomerase (PGI) and the phosphomannose isomerase (PMI) functions. These functions catalyze the reversible reactions of glucose 6-phosphate to fructose 6-phosphate, and mannose 6-phosphate to fructose 6-phosphate, respectively at an equal rate. This protein contains two SIS domains. This alignment is based on the first SIS domain. Probab=20.38 E-value=68 Score=13.82 Aligned_cols=25 Identities=36% Similarity=0.561 Sum_probs=21.4 Q ss_pred HHHHH----HHHHHHHHHHHCCCCCCCCC Q ss_conf 33457----77899889871898532220 Q gi|255764489|r 101 VAYSG----TLAAAFEGSLQGIRSFALSQ 125 (250) Q Consensus 101 v~ySG----TvgAA~ea~~~GipsIAiS~ 125 (250) +-||| |+.|+.+|.-.|.+-||++. T Consensus 50 ~S~SGnTeEtl~~~~~a~~~ga~vi~its 78 (119) T cd05017 50 VSYSGNTEETLSAVEQAKERGAKIVAITS 78 (119) T ss_pred ECCCCCCHHHHHHHHHHHHCCCEEEEECC T ss_conf 92897968999999999985990999849 No 158 >PRK05246 glutathione synthetase; Provisional Probab=20.32 E-value=68 Score=13.81 Aligned_cols=37 Identities=16% Similarity=0.305 Sum_probs=24.7 Q ss_pred CEEEEECC--CCCCCHH--HHHHHHHHHHCCCCEEEECCCC Q ss_conf 92688268--9868856--9999999985489199981799 Q gi|255764489|r 1 MRILLTND--DGIKSKG--LITLENIARSISDDIWICAPEM 37 (250) Q Consensus 1 m~ILitND--DG~~a~g--l~~l~~~l~~~~~~v~vvAP~~ 37 (250) |||+.-=| +.++-.. =.+|.+++++.||+||++-|.. T Consensus 2 mki~f~mDpie~l~~~kDTT~~Lm~eAq~rGhev~~~~p~d 42 (316) T PRK05246 2 MKVAFIMDPIESINIKKDSTFAMMLEAQRRGHELFYYEPDD 42 (316) T ss_pred CEEEEEECCHHHCCCCCCHHHHHHHHHHHCCCEEEEEECCC T ss_conf 47999968978779988729999999998799899997275 No 159 >pfam00274 Glycolytic Fructose-bisphosphate aldolase class-I. Probab=20.20 E-value=68 Score=13.80 Aligned_cols=29 Identities=28% Similarity=0.326 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHHHHCCC-CCCCEEECCC Q ss_conf 7888884999999986778-6673543145 Q gi|255764489|r 136 EVSETHAPRVLRQLLKTQI-PNTTLCNINF 164 (250) Q Consensus 136 ~~a~~~~~~ii~~l~~~~~-p~~~vlNINi 164 (250) +..+++...+.+.|...+. -++++|--|. T Consensus 189 ~vt~~vl~~vf~~L~~~~V~LeG~lLKPnM 218 (348) T pfam00274 189 EVTEKVLAAVFKALNDHHVMLEGTLLKPNM 218 (348) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCE T ss_conf 999999999999999849861252235660 No 160 >cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core. Probab=20.14 E-value=69 Score=13.79 Aligned_cols=93 Identities=15% Similarity=0.216 Sum_probs=42.2 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCEEEEECCCCCEEEEEEEEEEEEECCCHHHHHHHHHHH-CCCCCCCEEEECCCCC Q ss_conf 999999998548919998179985650320224897058897110476327279999999875-0464541999714888 Q gi|255764489|r 16 LITLENIARSISDDIWICAPEMDQSCLANSLTMSRNIACRTISKKRFAVHGTPVDCVVIALQK-MSDKKPDLILSGVNVG 94 (250) Q Consensus 16 l~~l~~~l~~~~~~v~vvAP~~~~S~~~~ait~~~~~~~~~~~~~~~~v~GtPaDcv~~~l~~-l~~~~pDlViSGiN~G 94 (250) +..|++.|.+.||+|.|+++.......-. ..............+.....+. ....+.. +...+||+|++= + T Consensus 18 ~~~la~~L~~~G~~V~vi~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~--~~~~l~~~i~~~~~DiI~~~---~ 89 (353) T cd03811 18 LLNLANGLDKRGYDVTLVVLRDEGDYLEL---LPSNVKLIPVRVLKLKSLRDLL--AILRLRRLLRKEKPDVVISH---L 89 (353) T ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCHHH---CCCCCEEEEECCCCCCCHHHHH--HHHHHHHHHHHCCCCEEEEC---C T ss_conf 99999999977997999997799851333---0567338861355655332599--99999999997499899998---8 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 531023334577789988987189853222 Q gi|255764489|r 95 TNTSNHVAYSGTLAAAFEGSLQGIRSFALS 124 (250) Q Consensus 95 ~N~g~~v~ySGTvgAA~ea~~~GipsIAiS 124 (250) . ++..+ ++.-+...++|-|..- T Consensus 90 ~-------~~~~~-~~~~~~~~~~~~i~~~ 111 (353) T cd03811 90 T-------TTPNV-LALLAARLGTKLIVWE 111 (353) T ss_pred C-------CHHHH-HHHHHHHCCCCEEEEE T ss_conf 6-------27899-9999974799789997 Done!