Query gi|254780791|ref|YP_003065204.1| exodeoxyribonuclease VII large subunit [Candidatus Liberibacter asiaticus str. psy62] Match_columns 529 No_of_seqs 228 out of 1609 Neff 8.5 Searched_HMMs 39220 Date Sun May 29 19:57:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780791.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00286 xseA exodeoxyribonucl 100.0 0 0 1023.1 55.5 438 11-504 2-442 (443) 2 COG1570 XseA Exonuclease VII, 100.0 0 0 1028.3 52.5 436 10-504 1-439 (440) 3 pfam02601 Exonuc_VII_L Exonucl 100.0 0 0 662.4 34.5 293 132-498 1-295 (295) 4 TIGR00237 xseA exodeoxyribonuc 100.0 0 0 563.3 31.8 434 16-498 1-441 (441) 5 cd04489 ExoVII_LU_OBF ExoVII_L 99.8 8.1E-19 2.1E-23 188.2 11.0 78 34-112 1-78 (78) 6 pfam01336 tRNA_anti OB-fold nu 97.9 0.0001 2.6E-09 62.3 8.9 74 35-110 1-74 (75) 7 COG1570 XseA Exonuclease VII, 97.0 0.061 1.6E-06 37.5 16.0 340 76-471 15-397 (440) 8 PRK00286 xseA exodeoxyribonucl 97.0 0.068 1.7E-06 37.1 17.0 224 182-452 144-383 (443) 9 TIGR03007 pepcterm_ChnLen poly 97.0 0.069 1.8E-06 37.0 13.5 224 281-512 166-424 (510) 10 PRK13480 3'-5' exoribonuclease 96.8 0.013 3.3E-07 43.5 8.4 87 23-114 4-93 (314) 11 cd04492 YhaM_OBF_like YhaM_OBF 96.7 0.023 5.9E-07 41.2 9.6 75 35-114 3-79 (83) 12 cd04482 RPA2_OBF_like RPA2_OBF 96.6 0.013 3.3E-07 43.5 7.5 72 35-111 1-74 (91) 13 PRK05673 dnaE DNA polymerase I 96.5 0.12 3E-06 35.0 11.7 17 101-117 85-101 (1143) 14 cd03524 RPA2_OBF_family RPA2_O 96.4 0.023 5.9E-07 41.3 7.7 72 37-110 2-75 (75) 15 PRK06326 consensus 96.3 0.15 3.8E-06 34.0 11.4 51 212-262 171-243 (1240) 16 PRK07892 consensus 96.2 0.025 6.3E-07 41.0 7.1 20 11-30 12-31 (1180) 17 COG1107 Archaea-specific RecJ- 96.2 0.19 4.9E-06 33.0 14.4 221 15-263 191-466 (715) 18 cd04487 RecJ_OBF2_like RecJ_OB 96.2 0.041 1.1E-06 39.0 8.1 70 35-110 1-72 (73) 19 PRK05672 dnaE2 error-prone DNA 96.1 0.032 8E-07 40.0 7.3 18 99-116 78-95 (1050) 20 PRK10917 ATP-dependent DNA hel 96.1 0.19 4.9E-06 33.0 11.2 138 34-198 60-198 (677) 21 PRK07373 DNA polymerase III su 96.0 0.04 1E-06 39.1 7.3 76 34-111 280-358 (447) 22 PRK08957 consensus 96.0 0.041 1E-06 39.1 7.4 47 68-117 52-98 (1159) 23 cd04485 DnaE_OBF DnaE_OBF: A s 96.0 0.034 8.6E-07 39.8 6.8 73 37-111 2-77 (84) 24 TIGR02169 SMC_prok_A chromosom 96.0 0.23 6E-06 32.3 16.5 35 449-483 1083-1117(1202) 25 PRK09074 consensus 96.0 0.24 6E-06 32.2 13.6 20 11-30 12-31 (1149) 26 cd04484 polC_OBF polC_OBF: A s 95.9 0.092 2.3E-06 35.9 8.9 76 34-110 1-81 (82) 27 PRK06826 dnaE DNA polymerase I 95.9 0.043 1.1E-06 38.8 7.2 12 101-112 89-100 (1157) 28 cd04488 RecG_wedge_OBF RecG_we 95.6 0.088 2.3E-06 36.0 7.8 67 37-106 2-70 (75) 29 pfam00465 Fe-ADH Iron-containi 95.4 0.19 4.8E-06 33.1 8.7 70 146-225 23-95 (312) 30 PRK07279 dnaE DNA polymerase I 95.3 0.07 1.8E-06 36.9 6.4 38 81-118 55-92 (1033) 31 PRK06658 consensus 95.2 0.11 2.7E-06 35.3 7.2 18 100-117 89-106 (1145) 32 cd04483 hOBFC1_like hOBFC1_lik 95.2 0.24 6.2E-06 32.1 8.9 68 37-108 2-89 (92) 33 PRK06920 dnaE DNA polymerase I 95.2 0.14 3.7E-06 34.1 7.6 37 82-118 57-95 (1107) 34 PRK07374 dnaE DNA polymerase I 95.1 0.11 2.9E-06 35.1 6.8 50 71-120 52-104 (1171) 35 PRK06637 consensus 95.1 0.22 5.5E-06 32.6 8.3 18 101-118 79-96 (1182) 36 PRK09860 putative alcohol dehy 94.9 0.48 1.2E-05 29.5 11.1 97 101-224 4-103 (383) 37 PRK07012 consensus 94.9 0.16 4.1E-06 33.7 7.2 60 55-117 39-98 (1173) 38 COG4085 Predicted RNA-binding 94.8 0.17 4.3E-06 33.6 7.2 75 34-110 53-132 (204) 39 PRK09423 gldA glycerol dehydro 94.4 0.39 9.9E-06 30.3 8.3 89 146-252 30-125 (366) 40 pfam01442 Apolipoprotein Apoli 94.4 0.62 1.6E-05 28.5 21.8 86 272-357 19-105 (191) 41 PRK00484 lysS lysyl-tRNA synth 94.4 0.6 1.5E-05 28.6 9.2 234 1-254 22-308 (491) 42 PRK06142 enoyl-CoA hydratase; 94.3 0.64 1.6E-05 28.4 11.0 39 231-269 99-139 (271) 43 TIGR02393 RpoD_Cterm RNA polym 94.2 0.053 1.4E-06 38.0 3.5 52 222-273 110-163 (240) 44 COG1571 Predicted DNA-binding 94.0 0.33 8.4E-06 30.9 7.2 77 32-114 266-344 (421) 45 PRK06752 single-stranded DNA-b 93.9 0.52 1.3E-05 29.2 8.1 77 34-111 4-103 (112) 46 COG1454 EutG Alcohol dehydroge 93.7 0.54 1.4E-05 29.0 7.9 70 145-224 29-101 (377) 47 cd04490 PolII_SU_OBF PolII_SU_ 93.6 0.42 1.1E-05 30.0 7.1 67 35-108 2-72 (79) 48 cd06533 Glyco_transf_WecG_TagA 93.6 0.65 1.6E-05 28.3 8.1 92 144-253 45-137 (171) 49 cd04478 RPA2_DBD_D RPA2_DBD_D: 93.5 0.87 2.2E-05 27.2 10.1 79 34-116 1-83 (95) 50 pfam03808 Glyco_tran_WecB Glyc 93.3 0.62 1.6E-05 28.5 7.6 93 144-254 47-140 (172) 51 pfam02601 Exonuc_VII_L Exonucl 93.2 0.94 2.4E-05 26.9 11.8 165 182-383 23-206 (295) 52 PRK08744 consensus 93.2 0.45 1.2E-05 29.7 6.8 44 69-117 67-110 (1195) 53 PRK06203 aroB 3-dehydroquinate 93.2 0.36 9.1E-06 30.6 6.2 104 144-259 41-151 (390) 54 PRK00448 polC DNA polymerase I 93.1 0.83 2.1E-05 27.4 8.0 79 31-110 233-317 (1436) 55 PRK11070 ssDNA exonuclease Rec 93.1 0.98 2.5E-05 26.7 10.5 27 110-136 186-212 (574) 56 PRK08763 single-stranded DNA-b 93.0 0.66 1.7E-05 28.2 7.3 80 31-111 4-108 (160) 57 cd04496 SSB_OBF SSB_OBF: A sub 92.9 0.95 2.4E-05 26.8 8.1 49 62-110 44-99 (100) 58 cd04316 ND_PkAspRS_like_N ND_P 92.8 1.1 2.7E-05 26.4 8.7 77 33-112 13-95 (108) 59 KOG3108 consensus 92.5 1.2 3E-05 26.1 8.4 97 32-132 68-168 (265) 60 COG3264 Small-conductance mech 92.4 1.2 3E-05 26.0 14.6 20 256-275 35-54 (835) 61 PRK06751 single-stranded DNA-b 92.4 0.77 2E-05 27.6 7.1 76 34-110 3-101 (172) 62 PRK05159 aspC aspartyl-tRNA sy 92.0 1.1 2.9E-05 26.2 7.5 77 33-112 17-99 (434) 63 TIGR03405 Phn_Fe-ADH phosphona 91.8 1.3 3.4E-05 25.5 8.7 70 145-224 23-95 (355) 64 KOG0994 consensus 91.6 1.4 3.6E-05 25.3 20.3 16 228-243 1493-1508(1758) 65 PRK10624 L-1,2-propanediol oxi 91.5 0.99 2.5E-05 26.6 6.8 69 146-224 30-101 (381) 66 PRK07274 single-stranded DNA-b 91.5 1.4 3.6E-05 25.3 7.6 79 32-111 2-102 (131) 67 PRK06495 enoyl-CoA hydratase; 91.4 0.61 1.5E-05 28.6 5.6 76 187-269 32-128 (257) 68 cd04322 LysRS_N LysRS_N: N-ter 91.4 1.5 3.7E-05 25.2 8.8 73 35-110 2-79 (108) 69 cd04100 Asp_Lys_Asn_RS_N Asp_L 91.4 1.5 3.7E-05 25.2 9.5 76 34-111 1-83 (85) 70 PRK12366 replication factor A; 91.4 1.5 3.7E-05 25.1 8.1 72 36-111 188-267 (649) 71 PRK05995 enoyl-CoA hydratase; 91.4 0.66 1.7E-05 28.3 5.8 77 186-269 32-130 (261) 72 COG1200 RecG RecG-like helicas 91.3 1.5 3.8E-05 25.0 8.3 94 14-110 31-137 (677) 73 cd04317 EcAspRS_like_N EcAspRS 91.0 1.6 4E-05 24.8 9.0 77 33-111 15-101 (135) 74 PRK09076 enoyl-CoA hydratase; 91.0 0.69 1.8E-05 28.1 5.5 75 187-269 31-126 (258) 75 PRK04036 DNA polymerase II sma 90.9 1.6 4.1E-05 24.8 8.2 70 34-108 148-219 (497) 76 PRK06143 enoyl-CoA hydratase; 90.9 0.93 2.4E-05 26.9 6.1 78 186-269 37-133 (260) 77 PRK08138 enoyl-CoA hydratase; 90.8 0.81 2.1E-05 27.4 5.8 77 186-269 35-128 (260) 78 TIGR01278 DPOR_BchB light-inde 90.8 0.97 2.5E-05 26.7 6.2 109 149-268 28-152 (562) 79 PRK11154 fadJ multifunctional 90.7 0.62 1.6E-05 28.5 5.1 83 183-276 32-139 (706) 80 PRK09010 single-stranded DNA-b 90.5 1.7 4.4E-05 24.5 7.7 77 31-108 5-102 (178) 81 cd06325 PBP1_ABC_uncharacteriz 90.4 1.8 4.5E-05 24.4 9.3 84 144-246 130-217 (281) 82 PRK09245 enoyl-CoA hydratase; 90.4 0.9 2.3E-05 27.0 5.7 75 188-269 34-134 (266) 83 PRK06293 single-stranded DNA-b 90.2 0.67 1.7E-05 28.2 5.0 75 36-111 5-99 (161) 84 PRK06951 consensus 90.2 0.81 2.1E-05 27.5 5.4 75 185-269 28-121 (254) 85 pfam00436 SSB Single-strand bi 90.1 1.2 2.9E-05 26.1 6.1 76 34-110 3-103 (104) 86 PRK05733 single-stranded DNA-b 90.1 1.8 4.7E-05 24.3 7.1 77 32-110 5-102 (172) 87 PRK13805 bifunctional acetalde 89.8 0.93 2.4E-05 26.9 5.4 97 145-251 482-608 (862) 88 PRK08486 single-stranded DNA-b 89.7 2 5E-05 24.0 7.4 78 33-111 3-105 (180) 89 PRK08512 consensus 89.6 2 5.1E-05 23.9 9.5 45 73-117 55-101 (1185) 90 PRK11423 methylmalonyl-CoA dec 89.6 1.1 2.8E-05 26.3 5.6 40 230-269 86-127 (261) 91 PRK05981 enoyl-CoA hydratase; 89.5 1.1 2.9E-05 26.1 5.7 39 231-269 94-134 (266) 92 PRK07510 consensus 89.5 1 2.6E-05 26.5 5.4 79 184-269 31-134 (266) 93 PRK05980 enoyl-CoA hydratase; 89.4 1.3 3.2E-05 25.7 5.9 75 187-269 32-131 (260) 94 PRK05658 RNA polymerase sigma 89.3 0.53 1.4E-05 29.1 3.9 14 155-168 41-54 (620) 95 pfam00378 ECH Enoyl-CoA hydrat 89.1 1.3 3.4E-05 25.6 5.8 76 187-269 19-113 (169) 96 TIGR02168 SMC_prok_B chromosom 89.0 2.2 5.5E-05 23.6 21.1 41 111-173 168-209 (1191) 97 PRK07275 single-stranded DNA-b 89.0 0.53 1.3E-05 29.1 3.7 77 34-111 4-103 (163) 98 PRK06213 enoyl-CoA hydratase; 88.9 1.2 3.1E-05 25.8 5.5 40 230-269 81-122 (229) 99 PRK08182 single-stranded DNA-b 88.9 0.86 2.2E-05 27.2 4.7 54 58-111 49-110 (148) 100 TIGR00655 PurU formyltetrahydr 88.8 1.4 3.7E-05 25.2 5.7 81 132-216 79-182 (294) 101 TIGR00644 recJ single-stranded 88.7 2.3 5.8E-05 23.4 7.2 58 50-107 638-704 (705) 102 PRK06127 enoyl-CoA hydratase; 88.7 1.5 3.9E-05 25.0 5.8 75 187-269 40-137 (269) 103 PRK05869 enoyl-CoA hydratase; 88.6 1.5 3.7E-05 25.2 5.7 77 186-269 35-130 (225) 104 PRK06461 single-stranded DNA-b 88.6 2.3 5.9E-05 23.4 9.4 79 30-114 12-101 (130) 105 PRK13732 single-stranded DNA-b 88.6 2.3 5.9E-05 23.4 7.3 77 32-110 6-103 (175) 106 PRK06023 enoyl-CoA hydratase; 88.6 1.4 3.6E-05 25.3 5.6 79 185-269 30-125 (248) 107 KOG1680 consensus 88.5 2 5E-05 24.0 6.3 98 144-270 43-159 (290) 108 COG0337 AroB 3-dehydroquinate 88.5 2.3 6E-05 23.3 7.2 93 144-242 32-127 (360) 109 PRK00843 egsA NAD(P)-dependent 88.5 2.3 6E-05 23.3 9.1 82 145-245 34-117 (351) 110 PRK08139 enoyl-CoA hydratase; 88.4 1.8 4.5E-05 24.4 6.0 39 231-269 94-134 (265) 111 PRK05809 3-hydroxybutyryl-CoA 88.4 1.8 4.5E-05 24.4 6.0 77 186-269 32-128 (260) 112 PRK06688 enoyl-CoA hydratase; 88.3 1.7 4.2E-05 24.7 5.8 79 185-269 32-131 (263) 113 PRK00002 aroB 3-dehydroquinate 88.1 2.3 5.7E-05 23.5 6.4 99 145-259 32-133 (360) 114 PRK07629 consensus 88.0 1.5 3.8E-05 25.1 5.4 75 187-269 33-130 (261) 115 PRK08290 enoyl-CoA hydratase; 87.9 1.9 4.8E-05 24.2 5.9 36 231-269 106-146 (284) 116 PRK06863 single-stranded DNA-b 87.9 1.3 3.2E-05 25.7 5.0 80 32-112 4-109 (168) 117 PRK03932 asnC asparaginyl-tRNA 87.9 2.5 6.4E-05 23.1 9.9 87 23-112 5-100 (462) 118 PRK07799 enoyl-CoA hydratase; 87.9 1.4 3.6E-05 25.3 5.3 77 185-269 32-131 (263) 119 PRK13010 purU formyltetrahydro 87.8 2.6 6.5E-05 23.0 8.2 89 113-217 67-178 (289) 120 PRK07468 enoyl-CoA hydratase; 87.7 1.7 4.3E-05 24.6 5.6 39 231-269 91-131 (262) 121 PRK07830 consensus 87.7 2 5E-05 24.0 5.9 40 230-269 94-135 (267) 122 PRK07511 enoyl-CoA hydratase; 87.6 2 5E-05 24.0 5.8 78 185-269 30-129 (259) 123 TIGR03210 badI 2-ketocyclohexa 87.4 2.1 5.4E-05 23.7 5.9 78 186-269 30-125 (256) 124 PRK07459 single-stranded DNA-b 87.4 2.6 6.8E-05 22.9 6.6 80 32-112 3-102 (121) 125 PRK08258 enoyl-CoA hydratase; 87.3 2.1 5.3E-05 23.8 5.9 39 231-269 97-137 (270) 126 PRK06210 enoyl-CoA hydratase; 87.2 2.3 5.8E-05 23.5 6.0 35 231-268 99-138 (272) 127 pfam04392 ABC_sub_bind ABC tra 87.2 2.7 6.9E-05 22.8 9.6 85 145-247 130-217 (292) 128 TIGR03200 dearomat_oah 6-oxocy 87.1 1.8 4.5E-05 24.4 5.4 76 186-267 56-162 (360) 129 PRK05674 gamma-carboxygeranoyl 87.0 1.8 4.6E-05 24.3 5.4 35 230-269 91-132 (271) 130 PRK06958 single-stranded DNA-b 87.0 1.2 3.1E-05 25.8 4.5 79 32-111 4-108 (181) 131 cd07025 Peptidase_S66 LD-Carbo 87.0 2.8 7.1E-05 22.7 7.1 87 163-254 180-273 (282) 132 PRK02801 primosomal replicatio 86.9 2.8 7.2E-05 22.6 8.0 75 35-110 5-99 (101) 133 PRK10481 hypothetical protein; 86.9 2.8 7.2E-05 22.6 10.6 80 146-247 129-212 (224) 134 cd06558 crotonase-like Crotona 86.8 2.4 6E-05 23.3 5.9 80 185-276 26-129 (195) 135 PRK08227 aldolase; Validated 86.8 2.8 7.3E-05 22.6 8.3 88 156-261 150-241 (291) 136 TIGR00284 TIGR00284 dihydropte 86.8 2.5 6.3E-05 23.2 5.9 52 17-76 45-96 (529) 137 PRK05901 RNA polymerase sigma 86.7 1.7 4.4E-05 24.5 5.1 48 225-272 272-321 (398) 138 PRK07772 single-stranded DNA-b 86.7 1 2.6E-05 26.5 4.0 76 34-110 6-108 (183) 139 PRK08260 enoyl-CoA hydratase; 86.6 2.5 6.4E-05 23.1 5.9 38 231-269 101-141 (293) 140 cd00288 Pyruvate_Kinase Pyruva 86.6 2.9 7.4E-05 22.5 8.7 185 49-270 87-294 (480) 141 PRK07110 polyketide biosynthes 86.6 2.3 5.9E-05 23.4 5.7 77 186-269 32-133 (257) 142 PRK06144 enoyl-CoA hydratase; 86.4 2.6 6.5E-05 23.0 5.9 75 186-269 36-133 (262) 143 COG0629 Ssb Single-stranded DN 86.4 1.6 4.1E-05 24.8 4.8 78 33-110 4-106 (167) 144 PRK09210 RNA polymerase sigma 86.2 2.2 5.7E-05 23.5 5.5 49 225-273 243-293 (368) 145 cd04491 SoSSB_OBF SoSSB_OBF: A 86.2 3 7.7E-05 22.3 9.3 49 54-107 26-75 (82) 146 pfam01761 DHQ_synthase 3-dehyd 86.1 3.1 7.8E-05 22.3 8.5 91 144-245 18-111 (310) 147 PRK07827 enoyl-CoA hydratase; 86.0 2.8 7.1E-05 22.7 5.9 39 230-269 86-127 (255) 148 PRK05617 enoyl-CoA hydratase; 86.0 2.1 5.4E-05 23.7 5.3 37 228-269 93-136 (356) 149 PRK05864 enoyl-CoA hydratase; 85.8 2.7 6.9E-05 22.7 5.8 36 231-269 93-133 (269) 150 PRK06190 enoyl-CoA hydratase; 85.8 3 7.6E-05 22.4 6.0 77 186-269 32-125 (258) 151 cd07062 Peptidase_S66_mccF_lik 85.6 3.2 8.2E-05 22.1 9.1 74 175-253 221-297 (308) 152 PRK05862 enoyl-CoA hydratase; 85.6 2.8 7.1E-05 22.7 5.7 76 187-269 33-125 (257) 153 PRK08140 enoyl-CoA hydratase; 85.4 2.6 6.5E-05 23.0 5.5 39 231-269 90-130 (262) 154 PRK07657 enoyl-CoA hydratase; 85.3 2.3 6E-05 23.3 5.2 77 186-269 33-129 (261) 155 COG0017 AsnS Aspartyl/asparagi 85.1 3.3 8.5E-05 21.9 9.8 233 23-311 6-265 (435) 156 PRK07658 enoyl-CoA hydratase; 85.1 3.1 7.8E-05 22.3 5.8 40 230-269 84-125 (257) 157 PRK06072 enoyl-CoA hydratase; 85.1 3.4 8.5E-05 21.9 6.1 76 187-269 21-112 (240) 158 cd00958 DhnA Class I fructose- 85.1 3.4 8.6E-05 21.9 8.8 90 156-261 105-202 (235) 159 cd01545 PBP1_SalR Ligand-bindi 84.8 3.5 8.8E-05 21.8 8.0 21 8-30 7-27 (270) 160 PRK07327 enoyl-CoA hydratase; 84.8 2.9 7.4E-05 22.5 5.5 71 187-269 41-137 (271) 161 TIGR01843 type_I_hlyD type I s 84.7 3.5 8.9E-05 21.8 17.5 111 396-506 216-364 (434) 162 PRK07260 enoyl-CoA hydratase; 84.4 3.5 8.9E-05 21.8 5.8 76 187-269 33-131 (260) 163 cd04497 hPOT1_OB1_like hPOT1_O 84.2 3.6 9.2E-05 21.6 10.6 80 24-106 6-94 (138) 164 PRK13371 4-hydroxy-3-methylbut 83.6 3.8 9.7E-05 21.4 6.1 139 81-243 162-324 (392) 165 cd04474 RPA1_DBD_A RPA1_DBD_A: 83.6 3.5 8.8E-05 21.8 5.5 60 34-93 11-79 (104) 166 COG0587 DnaE DNA polymerase II 83.5 1.4 3.5E-05 25.5 3.4 62 57-119 39-100 (1139) 167 PRK09674 enoyl-CoA hydratase-i 83.5 3.4 8.6E-05 21.9 5.4 76 187-269 33-125 (257) 168 PRK05853 hypothetical protein; 83.3 1.7 4.4E-05 24.5 3.9 50 62-111 50-106 (168) 169 COG2805 PilT Tfp pilus assembl 83.2 3.9 0.0001 21.3 7.1 123 87-240 80-221 (353) 170 pfam12128 DUF3584 Protein of u 83.1 4 0.0001 21.3 22.1 59 432-492 515-575 (1192) 171 PRK06731 flhF flagellar biosyn 83.1 4 0.0001 21.3 6.5 16 222-237 252-267 (270) 172 PRK05870 enoyl-CoA hydratase; 82.9 4 0.0001 21.2 5.6 39 230-269 85-126 (254) 173 PRK10826 2-deoxyglucose-6-phos 82.5 4.1 0.00011 21.1 8.1 94 146-259 109-208 (222) 174 cd00578 L-fuc_L-ara-isomerases 82.4 4.2 0.00011 21.1 9.9 25 231-255 117-141 (452) 175 cd04320 AspRS_cyto_N AspRS_cyt 82.4 4.2 0.00011 21.1 9.8 74 34-110 1-88 (102) 176 PRK02910 light-independent pro 82.4 4.2 0.00011 21.1 6.0 100 148-262 27-144 (524) 177 TIGR01420 pilT_fam twitching m 82.3 1.7 4.2E-05 24.7 3.4 125 85-241 79-224 (350) 178 TIGR00617 rpa1 replication fac 82.2 0.76 1.9E-05 27.7 1.7 67 8-76 207-276 (671) 179 PRK05657 RNA polymerase sigma 81.8 4 0.0001 21.2 5.3 132 95-271 109-250 (328) 180 cd06275 PBP1_PurR Ligand-bindi 81.6 4.5 0.00011 20.8 8.7 19 148-166 2-21 (269) 181 PRK04531 acetylglutamate kinas 81.2 1.9 4.9E-05 24.1 3.5 107 144-262 18-175 (421) 182 PRK07938 enoyl-CoA hydratase; 81.2 4.6 0.00012 20.7 5.9 35 230-269 84-125 (249) 183 PRK07226 fructose-bisphosphate 81.1 4.6 0.00012 20.7 8.8 91 155-261 121-219 (266) 184 COG1188 Ribosome-associated he 80.9 2.2 5.6E-05 23.6 3.7 44 457-503 30-73 (100) 185 PRK03580 carnitinyl-CoA dehydr 80.9 4.7 0.00012 20.7 5.8 75 188-269 33-126 (262) 186 cd04318 EcAsnRS_like_N EcAsnRS 80.4 4.8 0.00012 20.5 8.8 75 35-112 2-81 (82) 187 cd06310 PBP1_ABC_sugar_binding 80.3 4.8 0.00012 20.5 8.6 83 147-245 1-87 (273) 188 pfam10234 Cluap1 Clusterin-ass 80.2 4.9 0.00012 20.5 15.5 188 153-384 16-219 (269) 189 PRK07509 enoyl-CoA hydratase; 80.2 4.9 0.00012 20.5 5.5 36 231-269 93-133 (262) 190 PRK04539 ppnK inorganic polyph 80.0 4.9 0.00013 20.4 5.8 89 143-245 3-97 (296) 191 KOG0556 consensus 79.9 5 0.00013 20.4 7.7 68 24-93 73-148 (533) 192 PRK08272 enoyl-CoA hydratase; 79.6 5.1 0.00013 20.3 5.7 36 231-269 123-163 (308) 193 PRK10139 serine endoprotease; 79.3 5.2 0.00013 20.3 5.9 23 457-480 407-429 (455) 194 CHL00076 chlB photochlorophyll 79.3 5.2 0.00013 20.3 5.4 48 150-197 29-80 (510) 195 pfam07302 AroM AroM protein. T 79.0 5.3 0.00013 20.2 7.2 80 146-247 126-209 (221) 196 COG1190 LysU Lysyl-tRNA synthe 78.9 5.3 0.00013 20.2 10.8 234 2-254 30-317 (502) 197 PRK09701 D-allose transporter 78.9 5.3 0.00014 20.2 8.6 84 143-245 22-112 (311) 198 PRK03378 ppnK inorganic polyph 78.8 5.1 0.00013 20.3 5.0 98 144-257 4-108 (292) 199 cd06299 PBP1_LacI_like_13 Liga 78.1 5.5 0.00014 20.0 8.7 21 8-30 7-27 (265) 200 PRK10942 serine endoprotease; 77.9 5.6 0.00014 19.9 5.8 21 458-479 427-447 (474) 201 PRK10017 putative pyruvyl tran 77.9 4.3 0.00011 21.0 4.4 36 209-245 118-153 (426) 202 PRK06494 enoyl-CoA hydratase; 77.9 5.6 0.00014 19.9 5.6 73 187-269 33-125 (259) 203 PTZ00059 dynein light chain; P 77.7 5.2 0.00013 20.2 4.8 48 18-71 37-89 (90) 204 PRK00409 recombination and DNA 77.7 5.7 0.00014 19.9 18.7 22 145-166 324-345 (780) 205 COG1196 Smc Chromosome segrega 77.6 5.7 0.00015 19.9 21.9 32 452-483 1040-1071(1163) 206 pfam02016 Peptidase_S66 LD-car 77.6 5.7 0.00015 19.9 7.2 86 164-254 178-270 (281) 207 PRK07854 enoyl-CoA hydratase; 77.4 5.8 0.00015 19.8 5.2 39 230-269 76-117 (243) 208 PRK02155 ppnK inorganic polyph 77.2 5.8 0.00015 19.8 5.8 84 144-245 4-92 (291) 209 PRK07396 naphthoate synthase; 77.1 5.9 0.00015 19.8 5.8 42 228-269 95-138 (273) 210 PRK12445 lysyl-tRNA synthetase 77.1 5.9 0.00015 19.8 7.8 202 34-252 67-319 (505) 211 PRK04759 consensus 77.0 5.9 0.00015 19.7 6.1 87 143-245 3-93 (294) 212 pfam05622 HOOK HOOK protein. T 77.0 5.9 0.00015 19.7 17.5 62 389-450 362-425 (713) 213 PRK11281 potassium efflux prot 76.4 6.1 0.00016 19.6 22.9 22 432-453 311-332 (1107) 214 COG2176 PolC DNA polymerase II 76.3 6.1 0.00016 19.6 5.8 79 33-111 240-323 (1444) 215 TIGR01405 polC_Gram_pos DNA po 76.3 6.1 0.00016 19.6 8.4 80 32-111 7-94 (1264) 216 COG1311 HYS2 Archaeal DNA poly 75.9 4.5 0.00012 20.8 4.0 36 34-70 140-175 (481) 217 COG5235 RFA2 Single-stranded D 75.4 6.4 0.00016 19.4 10.3 134 32-169 66-207 (258) 218 PRK10936 periplasmic sensory p 75.2 6.5 0.00017 19.4 8.5 84 144-246 44-131 (340) 219 KOG3430 consensus 75.1 6.4 0.00016 19.4 4.7 50 18-71 37-89 (90) 220 cd02070 corrinoid_protein_B12- 74.7 6.7 0.00017 19.3 9.9 109 143-274 79-199 (201) 221 cd06298 PBP1_CcpA_like Ligand- 74.2 6.8 0.00017 19.2 8.8 78 148-245 2-83 (268) 222 cd04323 AsnRS_cyto_like_N AsnR 73.2 7.2 0.00018 19.0 8.8 76 35-112 2-83 (84) 223 PRK10416 cell division protein 73.0 7.2 0.00018 19.0 5.2 22 287-308 196-217 (499) 224 TIGR03336 IOR_alpha indolepyru 73.0 7.2 0.00018 19.0 7.3 79 145-250 230-313 (595) 225 pfam01221 Dynein_light Dynein 72.8 7.3 0.00019 18.9 5.2 48 18-71 33-85 (86) 226 PRK10422 lipopolysaccharide co 72.8 7.3 0.00019 18.9 7.5 31 142-175 2-39 (352) 227 PRK09120 p-hydroxycinnamoyl Co 72.6 7.4 0.00019 18.9 5.0 76 187-269 38-137 (277) 228 PRK03202 6-phosphofructokinase 72.5 7.4 0.00019 18.9 8.2 38 209-255 95-132 (323) 229 TIGR03189 dienoyl_CoA_hyt cycl 72.2 7.5 0.00019 18.8 5.6 40 230-269 79-120 (251) 230 pfam04539 Sigma70_r3 Sigma-70 71.9 6.4 0.00016 19.4 4.0 47 224-270 29-77 (78) 231 pfam03599 CdhD CO dehydrogenas 71.9 7.6 0.00019 18.8 10.4 161 83-264 15-197 (384) 232 TIGR02037 degP_htrA_DO proteas 71.8 6.6 0.00017 19.3 4.1 41 460-502 91-135 (484) 233 COG2957 Peptidylarginine deimi 71.8 7.3 0.00019 18.9 4.3 162 58-222 104-337 (346) 234 PRK08150 enoyl-CoA hydratase; 71.3 7.8 0.0002 18.7 5.4 37 230-269 91-132 (264) 235 PRK07659 enoyl-CoA hydratase; 71.0 7.9 0.0002 18.6 5.6 39 230-269 88-129 (260) 236 PRK10898 serine endoprotease; 70.7 8 0.0002 18.6 5.5 37 458-495 298-341 (355) 237 PRK13011 formyltetrahydrofolat 70.6 8 0.0002 18.5 10.9 90 112-217 63-175 (287) 238 COG2984 ABC-type uncharacteriz 70.5 8.1 0.00021 18.5 5.4 94 143-257 157-253 (322) 239 cd06278 PBP1_LacI_like_2 Ligan 70.4 8.1 0.00021 18.5 8.7 75 8-91 7-85 (266) 240 cd06305 PBP1_methylthioribose_ 70.3 8.2 0.00021 18.5 9.4 82 147-246 1-86 (273) 241 PRK11914 diacylglycerol kinase 70.0 8.3 0.00021 18.4 8.7 106 145-275 6-119 (304) 242 cd07021 Clp_protease_NfeD_like 69.9 8.3 0.00021 18.4 8.5 81 185-277 13-97 (178) 243 cd01391 Periplasmic_Binding_Pr 69.6 8.4 0.00021 18.4 8.6 89 147-254 1-96 (269) 244 cd07015 Clp_protease_NfeD Nodu 69.4 8.5 0.00022 18.3 5.8 78 185-274 13-100 (172) 245 PRK00771 signal recognition pa 69.2 8.5 0.00022 18.3 5.7 60 170-239 207-291 (433) 246 COG1024 CaiD Enoyl-CoA hydrata 69.1 8.6 0.00022 18.3 6.0 81 182-269 29-129 (257) 247 cd06289 PBP1_MalI_like Ligand- 68.5 8.8 0.00022 18.2 9.2 11 81-91 77-87 (268) 248 PRK06901 aspartate-semialdehyd 68.5 7.6 0.00019 18.8 3.8 48 149-196 124-177 (323) 249 COG4942 Membrane-bound metallo 68.4 8.8 0.00022 18.2 23.2 13 256-268 20-32 (420) 250 cd00363 PFK Phosphofructokinas 68.2 8.9 0.00023 18.2 8.6 43 209-255 93-135 (338) 251 TIGR02997 Sig70-cyanoRpoD RNA 67.9 9 0.00023 18.1 4.1 105 133-273 155-274 (336) 252 TIGR02280 PaaB1 phenylacetate 67.7 3.6 9.3E-05 21.6 2.0 54 187-248 28-103 (259) 253 cd06270 PBP1_GalS_like Ligand 67.7 9.1 0.00023 18.1 8.9 95 146-259 117-221 (268) 254 PRK07135 dnaE DNA polymerase I 67.7 9.1 0.00023 18.1 7.3 17 483-499 930-946 (987) 255 TIGR00615 recR recombination p 67.7 9.1 0.00023 18.1 4.5 154 12-237 35-202 (205) 256 CHL00180 rbcR LysR transcripti 67.6 7.1 0.00018 19.0 3.5 105 140-266 91-200 (307) 257 COG3199 Predicted inorganic po 67.5 9.2 0.00023 18.0 9.0 103 147-275 50-152 (355) 258 PRK10867 signal recognition pa 67.0 9.3 0.00024 18.0 4.8 43 170-222 213-261 (453) 259 PTZ00314 inosine-5'-monophosph 66.8 9.4 0.00024 17.9 5.0 71 156-259 263-334 (499) 260 TIGR03521 GldG gliding-associa 66.2 9.6 0.00024 17.9 7.3 149 12-171 30-212 (552) 261 PRK02645 ppnK inorganic polyph 66.2 9.6 0.00024 17.9 5.4 28 209-244 58-85 (304) 262 pfam01174 SNO SNO glutamine am 66.1 6.5 0.00017 19.4 3.0 61 180-244 3-72 (188) 263 pfam02729 OTCace_N Aspartate/o 65.9 9.7 0.00025 17.8 4.4 85 143-247 35-121 (140) 264 CHL00197 carA carbamoyl-phosph 65.9 9.7 0.00025 17.8 7.0 92 135-250 184-277 (383) 265 cd06274 PBP1_FruR Ligand bindi 65.8 9.8 0.00025 17.8 8.0 22 8-31 7-28 (264) 266 PRK08321 naphthoate synthase; 65.8 9.8 0.00025 17.8 6.1 18 230-247 125-142 (302) 267 pfam00117 GATase Glutamine ami 65.8 9.8 0.00025 17.8 4.0 41 208-250 41-82 (187) 268 PRK09428 pssA phosphatidylseri 65.7 9.8 0.00025 17.8 5.8 73 157-246 206-292 (444) 269 PRK12723 flagellar biosynthesi 65.6 9.8 0.00025 17.8 8.6 68 146-229 207-275 (388) 270 PRK00476 aspS aspartyl-tRNA sy 65.2 10 0.00025 17.7 8.8 80 29-110 13-103 (587) 271 pfam10649 DUF2478 Protein of u 65.2 10 0.00025 17.7 3.9 58 208-267 93-150 (159) 272 COG0568 RpoD DNA-directed RNA 65.1 10 0.00026 17.7 6.8 137 95-273 122-265 (342) 273 PRK07112 polyketide biosynthes 64.9 10 0.00026 17.7 4.9 38 231-269 83-123 (251) 274 cd04243 AAK_AK-HSDH-like AAK_A 64.8 10 0.00026 17.6 8.1 41 215-258 5-49 (293) 275 cd04104 p47_IIGP_like p47 (47- 64.6 10 0.00026 17.6 7.6 86 145-244 1-111 (197) 276 TIGR03249 KdgD 5-dehydro-4-deo 64.5 10 0.00026 17.6 5.5 25 154-178 113-137 (296) 277 KOG3857 consensus 64.3 10 0.00026 17.6 6.0 135 110-270 52-218 (465) 278 KOG1384 consensus 64.1 7.3 0.00019 18.9 3.0 32 145-176 6-37 (348) 279 PRK09791 putative DNA-binding 64.0 9.6 0.00024 17.9 3.6 88 141-250 92-180 (302) 280 pfam00365 PFK Phosphofructokin 64.0 10 0.00027 17.5 7.9 37 209-254 93-129 (279) 281 cd00951 KDGDH 5-dehydro-4-deox 63.8 11 0.00027 17.5 5.1 39 154-194 108-146 (289) 282 cd04486 YhcR_OBF_like YhcR_OBF 63.7 11 0.00027 17.5 7.5 68 36-107 1-70 (78) 283 PRK13228 consensus 63.6 11 0.00027 17.5 9.1 113 146-277 113-227 (232) 284 cd07044 CofD_YvcK Family of Co 63.3 11 0.00027 17.4 6.0 109 139-263 119-235 (309) 285 cd00881 GTP_translation_factor 63.2 11 0.00027 17.4 6.1 69 101-171 117-187 (189) 286 PRK12680 transcriptional regul 63.2 11 0.00027 17.4 7.1 102 142-267 91-198 (327) 287 cd06267 PBP1_LacI_sugar_bindin 63.1 11 0.00028 17.4 8.3 136 62-242 54-204 (264) 288 cd06281 PBP1_LacI_like_5 Ligan 62.9 11 0.00028 17.4 10.0 77 8-91 7-87 (269) 289 PRK10053 hypothetical protein; 62.8 11 0.00028 17.4 8.1 78 23-109 51-129 (130) 290 TIGR00263 trpB tryptophan synt 62.5 11 0.00028 17.3 4.1 38 177-221 197-237 (412) 291 pfam01177 Asp_Glu_race Asp/Glu 62.5 11 0.00028 17.3 7.9 90 145-246 106-203 (212) 292 cd01537 PBP1_Repressors_Sugar_ 62.1 11 0.00028 17.3 9.9 121 62-214 54-184 (264) 293 cd01139 TroA_f Periplasmic bin 62.1 11 0.00029 17.3 6.2 33 208-245 91-123 (342) 294 PRK09039 hypothetical protein; 62.0 11 0.00029 17.3 19.7 48 393-440 147-194 (343) 295 PRK06341 single-stranded DNA-b 61.4 9 0.00023 18.1 3.1 77 32-109 5-103 (165) 296 cd06306 PBP1_TorT-like TorT-li 61.3 12 0.00029 17.2 6.7 83 147-246 1-87 (268) 297 cd03522 MoeA_like MoeA_like. T 61.2 12 0.00029 17.1 7.1 107 97-224 119-233 (312) 298 PRK04346 tryptophan synthase s 61.1 4.7 0.00012 20.6 1.6 38 180-228 210-249 (392) 299 TIGR01357 aroB 3-dehydroquinat 60.9 12 0.0003 17.1 6.0 107 130-242 7-119 (361) 300 PRK10586 hypothetical protein; 60.8 12 0.0003 17.1 5.6 79 144-245 33-115 (362) 301 PRK13028 tryptophan synthase s 60.8 2.5 6.5E-05 23.0 0.1 64 156-230 178-254 (403) 302 PRK06027 purU formyltetrahydro 60.5 12 0.0003 17.0 10.4 88 113-216 62-172 (285) 303 cd04257 AAK_AK-HSDH AAK_AK-HSD 60.2 12 0.0003 17.0 9.7 41 215-258 5-50 (294) 304 PRK13055 putative lipid kinase 60.2 12 0.0003 17.0 9.3 110 144-275 1-118 (334) 305 PRK13175 consensus 60.0 12 0.00031 17.0 3.6 42 208-250 39-85 (206) 306 KOG1899 consensus 59.9 12 0.00031 17.0 18.4 93 251-350 83-177 (861) 307 cd01742 GATase1_GMP_Synthase T 59.8 11 0.00027 17.4 3.2 40 208-250 41-81 (181) 308 PRK12419 riboflavin synthase s 59.8 12 0.00031 17.0 10.0 95 140-248 5-114 (158) 309 pfam06189 5-nucleotidase 5'-nu 59.6 12 0.00031 16.9 4.7 144 35-194 67-231 (263) 310 cd07187 YvcK_like family of mo 59.3 12 0.00031 16.9 6.6 109 139-263 119-236 (308) 311 KOG1970 consensus 59.3 12 0.00031 17.0 3.4 77 95-178 63-142 (634) 312 cd03147 GATase1_Ydr533c_like T 59.2 12 0.00032 16.9 3.6 43 207-251 93-137 (231) 313 PRK12683 transcriptional regul 58.9 12 0.00032 16.8 6.0 96 141-259 90-190 (308) 314 PRK12478 enoyl-CoA hydratase; 58.7 13 0.00032 16.8 5.7 36 233-269 105-143 (298) 315 PRK01231 ppnK inorganic polyph 58.6 13 0.00032 16.8 6.1 82 145-245 4-91 (296) 316 TIGR01121 D_amino_aminoT D-ami 58.6 12 0.00031 17.0 3.3 32 8-40 57-91 (278) 317 cd06277 PBP1_LacI_like_1 Ligan 58.0 13 0.00033 16.7 8.1 20 11-31 12-31 (268) 318 pfam00318 Ribosomal_S2 Ribosom 57.9 13 0.00033 16.7 3.5 43 208-259 137-179 (205) 319 cd03576 NTR_PCOLCE NTR domain, 57.7 13 0.00033 16.7 6.3 65 26-93 14-92 (124) 320 TIGR02628 fuculo_kin_coli L-fu 57.3 13 0.00033 16.7 3.3 18 81-98 284-301 (473) 321 TIGR00763 lon ATP-dependent pr 57.2 13 0.00034 16.6 8.6 35 221-260 209-247 (941) 322 cd00764 Eukaryotic_PFK Phospho 57.0 13 0.00034 16.6 9.9 35 208-245 478-513 (762) 323 PRK13054 lipid kinase; Reviewe 56.8 13 0.00034 16.6 9.1 109 145-275 4-116 (299) 324 TIGR01415 trpB_rel pyridoxal-p 56.8 9 0.00023 18.1 2.4 154 34-221 89-261 (426) 325 PRK06642 single-stranded DNA-b 56.5 3.2 8.3E-05 22.1 0.1 79 31-110 4-109 (152) 326 PRK11637 hypothetical protein; 56.4 14 0.00035 16.5 22.1 10 283-292 30-39 (404) 327 pfam03100 CcmE CcmE. CcmE is t 56.4 14 0.00035 16.5 7.2 81 14-108 36-118 (130) 328 PRK07921 RNA polymerase sigma 56.4 14 0.00035 16.5 5.5 45 227-271 197-243 (320) 329 cd07022 S49_Sppa_36K_type Sign 55.9 14 0.00035 16.5 5.7 77 181-269 21-104 (214) 330 cd01889 SelB_euk SelB subfamil 55.8 14 0.00035 16.4 4.4 67 103-176 125-191 (192) 331 cd00763 Bacterial_PFK Phosphof 55.7 14 0.00035 16.4 9.8 50 209-270 93-142 (317) 332 cd01741 GATase1_1 Subgroup of 55.6 14 0.00035 16.4 7.0 86 147-251 1-93 (188) 333 pfam09370 TIM-br_sig_trns TIM- 55.6 7.9 0.0002 18.6 2.0 61 182-252 18-92 (268) 334 COG1619 LdcA Uncharacterized p 55.5 14 0.00036 16.4 5.4 93 159-256 190-290 (313) 335 TIGR01296 asd_B aspartate-semi 55.3 5.7 0.00015 19.9 1.2 113 62-203 63-183 (350) 336 cd05402 NT_PAP_TUTase Nucleoti 55.1 10 0.00027 17.5 2.5 88 157-267 4-101 (114) 337 cd06341 PBP1_ABC_ligand_bindin 55.0 14 0.00036 16.3 7.6 77 146-240 134-212 (341) 338 pfam08960 DUF1874 Domain of un 54.9 12 0.0003 17.1 2.7 23 228-252 25-47 (104) 339 COG0763 LpxB Lipid A disacchar 54.7 14 0.00037 16.3 8.1 50 217-269 254-309 (381) 340 PRK05416 hypothetical protein; 54.5 14 0.00037 16.3 6.0 110 142-267 2-111 (292) 341 pfam01207 Dus Dihydrouridine s 54.4 14 0.00036 16.4 3.1 15 230-244 173-187 (309) 342 PRK08259 enoyl-CoA hydratase; 54.1 15 0.00037 16.2 5.3 38 231-269 84-124 (254) 343 PRK03620 5-dehydro-4-deoxygluc 53.9 15 0.00037 16.2 5.8 74 154-247 109-184 (296) 344 PRK06490 glutamine amidotransf 53.8 15 0.00038 16.2 6.2 83 143-244 9-94 (243) 345 cd02801 DUS_like_FMN Dihydrour 53.7 13 0.00034 16.6 2.9 20 209-230 207-226 (231) 346 cd03141 GATase1_Hsp31_like Typ 53.6 13 0.00034 16.6 2.8 41 208-251 90-133 (221) 347 KOG1405 consensus 53.6 15 0.00038 16.2 6.2 90 135-232 217-331 (484) 348 PRK06246 fumarate hydratase; P 53.5 9.5 0.00024 17.9 2.1 50 208-260 133-196 (276) 349 PRK05813 single-stranded DNA-b 53.4 14 0.00036 16.4 2.9 133 34-187 10-158 (217) 350 PRK10361 DNA recombination pro 53.3 15 0.00038 16.1 20.7 20 483-503 241-260 (475) 351 pfam04076 BOF Bacterial OB fol 53.3 15 0.00038 16.1 8.1 68 33-109 58-125 (126) 352 KOG1029 consensus 53.2 15 0.00038 16.1 21.8 181 2-207 58-268 (1118) 353 PRK11151 DNA-binding transcrip 53.2 15 0.00038 16.1 6.1 95 142-260 89-188 (305) 354 PRK06843 inositol-5-monophosph 53.1 15 0.00038 16.1 6.3 58 148-221 168-228 (404) 355 cd06307 PBP1_uncharacterized_s 53.1 15 0.00038 16.1 9.6 210 8-261 7-230 (275) 356 cd01425 RPS2 Ribosomal protein 52.7 14 0.00035 16.4 2.8 44 207-259 126-169 (193) 357 KOG1885 consensus 52.5 15 0.00039 16.0 6.8 104 16-123 85-211 (560) 358 COG4019 Uncharacterized protei 52.3 15 0.00039 16.0 4.9 82 113-218 19-102 (156) 359 TIGR03316 ygeW probable carbam 52.2 16 0.0004 16.0 4.4 19 10-28 7-25 (357) 360 PRK10348 ribosome-associated h 52.1 16 0.0004 16.0 3.8 41 459-502 32-72 (133) 361 pfam00389 2-Hacid_dh D-isomer 51.9 16 0.0004 16.0 7.2 23 130-152 122-144 (313) 362 cd03134 GATase1_PfpI_like A ty 51.8 16 0.0004 16.0 4.4 40 208-251 62-104 (165) 363 cd06295 PBP1_CelR Ligand bindi 51.7 16 0.0004 15.9 7.5 19 10-30 20-38 (275) 364 cd01574 PBP1_LacI Ligand-bindi 51.7 16 0.0004 15.9 8.3 78 8-91 7-87 (264) 365 PRK10415 tRNA-dihydrouridine s 51.7 15 0.00039 16.0 2.9 14 133-146 94-107 (321) 366 PRK01045 ispH 4-hydroxy-3-meth 51.6 16 0.0004 15.9 8.4 105 143-269 153-270 (304) 367 pfam06266 HrpF HrpF protein. T 51.5 9.7 0.00025 17.8 1.9 22 211-232 25-46 (74) 368 cd04481 RPA1_DBD_B_like RPA1_D 51.5 16 0.0004 15.9 4.6 64 35-98 6-77 (106) 369 PRK12555 chemotaxis-specific m 51.4 16 0.0004 15.9 8.0 80 145-250 1-86 (340) 370 PRK10653 D-ribose transporter 51.2 16 0.00041 15.9 9.9 81 145-246 26-113 (295) 371 pfam01915 Glyco_hydro_3_C Glyc 51.1 16 0.00041 15.9 6.9 24 226-249 118-141 (223) 372 pfam00582 Usp Universal stress 51.0 14 0.00036 16.4 2.6 36 208-245 102-138 (140) 373 cd06318 PBP1_ABC_sugar_binding 50.8 16 0.00041 15.8 7.5 36 225-262 195-236 (282) 374 PHA01627 DNA binding protein 50.7 9.4 0.00024 17.9 1.7 22 228-251 27-48 (105) 375 PRK00132 rpsI 30S ribosomal pr 50.5 15 0.00038 16.2 2.7 31 187-223 45-75 (130) 376 PRK08402 replication factor A; 50.5 16 0.00042 15.8 4.8 83 23-106 64-156 (357) 377 pfam01933 UPF0052 Uncharacteri 50.4 16 0.00042 15.8 5.7 128 113-261 97-231 (291) 378 cd05776 DNA_polB_alpha_exo The 50.4 16 0.00042 15.8 6.1 52 185-249 80-131 (234) 379 PRK07455 keto-hydroxyglutarate 50.4 16 0.00042 15.8 5.9 83 147-248 15-114 (210) 380 TIGR01425 SRP54_euk signal rec 50.3 7.2 0.00018 19.0 1.1 142 76-253 112-301 (453) 381 pfam00380 Ribosomal_S9 Ribosom 50.2 16 0.00042 15.8 3.1 31 186-222 35-65 (121) 382 PRK08284 precorrin 6A synthase 50.0 17 0.00042 15.7 8.3 68 183-259 10-78 (253) 383 PRK02102 ornithine carbamoyltr 49.9 17 0.00042 15.7 4.9 10 238-247 121-130 (331) 384 PRK00197 proA gamma-glutamyl p 49.9 17 0.00042 15.7 3.2 25 209-245 197-222 (416) 385 PRK13288 pyrophosphatase PpaX; 49.8 17 0.00043 15.7 11.1 105 146-269 99-205 (214) 386 PRK13396 3-deoxy-7-phosphohept 49.6 17 0.00043 15.7 4.1 69 186-262 220-291 (352) 387 PRK13586 1-(5-phosphoribosyl)- 49.6 17 0.00043 15.7 4.2 73 159-245 147-229 (231) 388 PTZ00195 60S ribosomal protein 49.5 17 0.00043 15.7 3.2 87 8-105 55-146 (198) 389 KOG0250 consensus 49.2 17 0.00043 15.6 22.3 16 95-110 493-508 (1074) 390 PRK12820 bifunctional aspartyl 48.6 17 0.00044 15.6 9.0 76 33-110 19-106 (706) 391 pfam07411 DUF1508 Domain of un 48.6 12 0.00032 16.8 2.1 18 49-66 2-19 (49) 392 pfam04555 XhoI Restriction end 48.4 13 0.00034 16.6 2.2 127 18-160 24-188 (196) 393 pfam06148 COG2 COG (conserved 48.3 17 0.00045 15.5 13.7 68 251-328 11-78 (133) 394 cd06285 PBP1_LacI_like_7 Ligan 48.3 17 0.00045 15.5 9.5 12 18-29 15-26 (265) 395 TIGR02407 ectoine_ectB diamino 48.3 16 0.00041 15.9 2.5 58 161-228 180-250 (413) 396 pfam12004 DUF3498 Domain of un 48.2 18 0.00045 15.5 13.3 43 13-60 29-73 (489) 397 COG3206 GumC Uncharacterized p 47.9 18 0.00045 15.5 16.1 22 431-452 378-399 (458) 398 COG0205 PfkA 6-phosphofructoki 47.8 18 0.00045 15.5 7.4 47 190-252 83-129 (347) 399 PRK02228 V-type ATP synthase s 47.7 18 0.00045 15.5 6.2 61 187-259 29-90 (100) 400 PRK05599 hypothetical protein; 47.6 18 0.00046 15.5 7.8 75 150-232 27-101 (246) 401 PRK04284 ornithine carbamoyltr 47.5 18 0.00046 15.4 5.4 28 1-28 1-28 (332) 402 TIGR02636 galM_Leloir galactos 47.4 5.2 0.00013 20.3 -0.1 98 50-157 167-268 (346) 403 PRK13143 hisH imidazole glycer 47.1 18 0.00046 15.4 4.2 35 209-244 40-76 (201) 404 smart00537 DCX Domain in the D 46.9 16 0.00042 15.8 2.5 39 448-486 36-76 (89) 405 pfam04577 DUF563 Protein of un 46.5 18 0.00047 15.3 9.5 130 86-245 102-243 (246) 406 pfam11576 DUF3236 Protein of u 46.5 18 0.00047 15.3 4.5 81 113-217 18-100 (154) 407 COG0788 PurU Formyltetrahydrof 46.5 18 0.00047 15.3 11.1 89 113-217 64-175 (287) 408 COG0133 TrpB Tryptophan syntha 46.1 8.7 0.00022 18.3 0.9 38 181-229 207-246 (396) 409 COG1322 Predicted nuclease of 45.9 19 0.00048 15.2 20.1 48 444-502 200-247 (448) 410 PRK08252 enoyl-CoA hydratase; 45.8 19 0.00048 15.2 5.1 77 186-269 31-122 (254) 411 cd06271 PBP1_AglR_RafR_like Li 45.7 19 0.00048 15.2 8.1 202 9-259 12-225 (268) 412 cd06312 PBP1_ABC_sugar_binding 45.7 19 0.00048 15.2 8.8 14 232-245 74-87 (271) 413 PRK05395 3-dehydroquinate dehy 45.6 19 0.00048 15.2 7.9 71 156-245 25-98 (143) 414 COG2759 MIS1 Formyltetrahydrof 45.5 19 0.00049 15.2 5.5 153 90-253 181-383 (554) 415 cd06320 PBP1_allose_binding Pe 45.3 19 0.00049 15.2 9.4 34 147-180 1-38 (275) 416 PRK11175 universal stress prot 45.3 19 0.00049 15.2 4.7 56 181-249 90-147 (304) 417 cd06282 PBP1_GntR_like_2 Ligan 45.0 19 0.00049 15.1 9.9 22 8-31 7-28 (266) 418 PRK07765 para-aminobenzoate sy 44.9 19 0.00049 15.1 3.5 16 223-238 199-214 (221) 419 cd00466 DHQase_II Dehydroquina 44.7 20 0.0005 15.1 7.5 69 158-245 25-96 (140) 420 PRK12679 cbl transcriptional r 44.6 20 0.0005 15.1 5.1 94 143-259 92-190 (316) 421 cd06323 PBP1_ribose_binding Pe 44.5 20 0.0005 15.1 9.4 12 233-244 73-84 (268) 422 cd06290 PBP1_LacI_like_9 Ligan 44.4 20 0.0005 15.1 8.2 12 18-29 15-26 (265) 423 cd01541 PBP1_AraR Ligand-bindi 44.3 20 0.0005 15.1 9.6 21 8-30 7-27 (273) 424 pfam00885 DMRL_synthase 6,7-di 44.2 20 0.0005 15.1 10.0 90 145-248 3-107 (144) 425 KOG0970 consensus 44.0 20 0.00051 15.0 5.7 96 132-249 559-656 (1429) 426 COG3596 Predicted GTPase [Gene 44.0 20 0.00051 15.0 9.5 135 129-276 22-183 (296) 427 PRK13057 putative lipid kinase 43.9 20 0.00051 15.0 5.6 91 157-274 13-105 (287) 428 COG1379 PHP family phosphoeste 43.9 20 0.00051 15.0 3.2 16 247-262 168-183 (403) 429 pfam05681 Fumerase Fumarate hy 43.7 18 0.00046 15.4 2.2 49 208-256 128-190 (271) 430 PRK01713 ornithine carbamoyltr 43.7 20 0.00051 15.0 5.2 20 8-27 9-28 (334) 431 cd00408 DHDPS-like Dihydrodipi 43.6 20 0.00052 15.0 6.2 184 6-260 6-192 (281) 432 KOG0042 consensus 43.1 16 0.0004 15.9 1.9 136 129-276 320-495 (680) 433 KOG3598 consensus 43.1 21 0.00052 14.9 9.1 30 139-168 1357-1386(2220) 434 cd00452 KDPG_aldolase KDPG and 42.8 21 0.00053 14.9 6.1 83 147-248 6-105 (190) 435 TIGR03133 malonate_beta malona 42.7 21 0.00053 14.9 7.2 25 209-238 160-184 (274) 436 COG4607 CeuA ABC-type enteroch 42.6 19 0.00048 15.3 2.2 13 150-162 123-135 (320) 437 PRK00856 pyrB aspartate carbam 42.5 21 0.00053 14.8 5.7 27 2-28 2-28 (304) 438 PRK13254 cytochrome c-type bio 42.4 21 0.00053 14.8 7.9 104 14-135 37-144 (149) 439 cd04246 AAK_AK-DapG-like AAK_A 42.1 21 0.00054 14.8 4.8 41 212-254 2-48 (239) 440 COG3481 Predicted HD-superfami 42.0 14 0.00037 16.3 1.5 133 49-200 17-156 (287) 441 TIGR02518 EutH_ACDH acetaldehy 41.9 21 0.00054 14.8 7.4 183 18-239 226-449 (528) 442 COG2884 FtsE Predicted ATPase 41.9 21 0.00054 14.8 2.4 130 88-242 17-158 (223) 443 cd01836 FeeA_FeeB_like SGNH_hy 41.8 21 0.00054 14.8 6.0 74 145-229 2-85 (191) 444 cd01483 E1_enzyme_family Super 41.8 21 0.00054 14.8 4.4 80 164-273 60-142 (143) 445 cd04722 TIM_phosphate_binding 41.8 21 0.00054 14.8 7.5 86 159-259 101-189 (200) 446 COG4126 Hydantoin racemase [Am 41.7 21 0.00055 14.8 9.0 31 208-247 174-204 (230) 447 PRK13435 response regulator; P 41.4 22 0.00055 14.7 9.1 72 154-249 8-83 (141) 448 PRK04863 mukB cell division pr 41.3 22 0.00055 14.7 22.2 65 366-430 1028-1099(1486) 449 PRK13141 hisH imidazole glycer 41.3 22 0.00055 14.7 4.2 35 209-244 39-77 (204) 450 PRK04820 rnpA ribonuclease P; 41.2 22 0.00055 14.7 5.2 24 204-227 81-104 (145) 451 cd01748 GATase1_IGP_Synthase T 41.1 22 0.00056 14.7 5.3 42 208-250 36-82 (198) 452 PRK10014 DNA-binding transcrip 41.1 22 0.00056 14.7 10.4 113 119-251 37-154 (342) 453 PRK00742 chemotaxis-specific m 41.1 22 0.00056 14.7 7.2 73 154-250 9-87 (345) 454 TIGR00960 3a0501s02 Type II (G 40.9 15 0.00039 16.1 1.5 100 89-201 19-130 (216) 455 PRK04147 N-acetylneuraminate l 40.7 22 0.00056 14.6 3.4 42 154-195 113-155 (294) 456 KOG3091 consensus 40.6 22 0.00056 14.6 15.6 102 226-340 292-397 (508) 457 cd03791 GT1_Glycogen_synthase_ 40.5 22 0.00057 14.6 7.1 81 174-267 298-417 (476) 458 pfam04012 PspA_IM30 PspA/IM30 40.4 22 0.00057 14.6 20.1 19 434-452 166-184 (220) 459 cd07020 Clp_protease_NfeD_1 No 40.3 22 0.00057 14.6 5.6 73 185-269 13-92 (187) 460 PRK05299 rpsB 30S ribosomal pr 40.2 22 0.00057 14.6 6.7 66 208-297 157-222 (255) 461 PRK10161 transcriptional regul 40.1 22 0.00057 14.6 7.9 106 144-274 1-113 (229) 462 PRK05718 keto-hydroxyglutarate 39.9 23 0.00058 14.5 6.2 85 146-248 16-116 (212) 463 PRK02842 light-independent pro 39.8 23 0.00058 14.5 6.2 163 63-256 24-218 (425) 464 TIGR02038 protease_degS peripl 39.8 23 0.00058 14.5 3.4 42 461-502 84-126 (358) 465 PRK13951 bifunctional shikimat 39.7 23 0.00058 14.5 5.7 96 142-259 182-277 (488) 466 COG1660 Predicted P-loop-conta 39.6 23 0.00058 14.5 6.3 104 147-269 2-110 (286) 467 pfam04204 HTS Homoserine O-suc 39.5 23 0.00058 14.5 6.6 100 124-231 11-118 (298) 468 cd02115 AAK Amino Acid Kinases 39.5 16 0.00041 15.8 1.5 104 146-252 30-141 (248) 469 TIGR02093 P_ylase glycogen/sta 39.4 6.5 0.00017 19.4 -0.6 74 155-258 285-364 (822) 470 pfam06705 SF-assemblin SF-asse 39.1 23 0.00059 14.4 22.5 27 439-465 218-245 (247) 471 pfam09726 Macoilin Transmembra 39.1 23 0.00059 14.4 19.7 14 281-294 411-424 (680) 472 TIGR03597 GTPase_YqeH ribosome 38.9 23 0.00059 14.4 6.1 89 116-225 109-198 (360) 473 cd06273 PBP1_GntR_like_1 This 38.5 24 0.0006 14.4 8.3 77 8-92 7-87 (268) 474 pfam02765 Telo_bind Telomeric 38.5 24 0.0006 14.4 10.0 71 34-106 14-95 (141) 475 COG1350 Predicted alternative 38.3 24 0.0006 14.4 3.1 109 83-221 151-267 (432) 476 COG4972 PilM Tfp pilus assembl 38.3 24 0.0006 14.4 3.4 75 96-170 29-114 (354) 477 TIGR03017 EpsF chain length de 38.3 24 0.00061 14.3 22.5 21 114-134 67-87 (444) 478 PRK13226 phosphoglycolate phos 38.0 24 0.00061 14.3 5.0 102 146-268 104-210 (221) 479 cd06272 PBP1_hexuronate_repres 38.0 24 0.00061 14.3 6.9 193 8-259 7-216 (261) 480 PRK12564 carbamoyl phosphate s 37.9 24 0.00061 14.3 5.8 86 140-250 166-253 (355) 481 smart00424 STE STE like transc 37.9 24 0.00061 14.3 2.3 77 55-163 7-83 (111) 482 COG0277 GlcD FAD/FMN-containin 37.9 24 0.00061 14.3 3.0 25 440-464 432-456 (459) 483 cd00147 cPLA2_like Cytosolic p 37.7 10 0.00026 17.7 0.2 32 237-268 176-207 (438) 484 pfam01735 PLA2_B Lysophospholi 37.7 11 0.00027 17.5 0.3 22 238-259 154-175 (490) 485 cd06315 PBP1_ABC_sugar_binding 37.6 24 0.00062 14.3 10.0 79 146-245 1-86 (280) 486 PRK10696 C32 tRNA thiolase; Pr 37.6 24 0.00062 14.3 3.6 36 146-181 40-77 (311) 487 cd01967 Nitrogenase_MoFe_alpha 37.5 24 0.00062 14.3 8.8 116 144-275 87-214 (406) 488 pfam02552 CO_dh CO dehydrogena 37.5 24 0.00062 14.3 4.0 61 173-248 11-71 (168) 489 PRK08251 short chain dehydroge 37.5 24 0.00062 14.2 8.2 90 112-230 13-103 (248) 490 PRK00087 4-hydroxy-3-methylbut 37.5 24 0.00062 14.2 4.7 213 15-275 13-272 (670) 491 PRK12681 cysB transcriptional 37.4 24 0.00062 14.2 4.0 92 145-259 94-190 (324) 492 PRK07211 replication factor A; 37.4 24 0.00062 14.2 5.4 73 31-106 171-252 (473) 493 PRK13165 cytochrome c-type bio 37.3 24 0.00062 14.2 6.4 72 15-93 38-116 (161) 494 pfam01220 DHquinase_II Dehydro 37.3 25 0.00063 14.2 7.5 73 153-244 24-96 (140) 495 COG1922 WecG Teichoic acid bio 37.2 25 0.00063 14.2 8.9 90 146-253 109-199 (253) 496 PRK12595 bifunctional 3-deoxy- 37.1 25 0.00063 14.2 4.2 76 174-262 121-204 (360) 497 PRK13527 glutamine amidotransf 37.0 25 0.00063 14.2 4.3 65 180-247 7-83 (196) 498 pfam03607 DCX Doublecortin. 37.0 25 0.00063 14.2 2.1 49 439-487 5-53 (60) 499 COG5140 UFD1 Ubiquitin fusion- 36.7 25 0.00064 14.2 2.8 50 449-509 135-184 (331) 500 PRK07413 hypothetical protein; 36.6 25 0.00064 14.1 4.0 54 159-229 145-199 (382) No 1 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=100.00 E-value=0 Score=1023.07 Aligned_cols=438 Identities=40% Similarity=0.669 Sum_probs=416.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEE Q ss_conf 89862299999999999740017189999970543568886279998748947999997352105866814598899999 Q gi|254780791|r 11 DHPEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIG 90 (529) Q Consensus 11 ~~~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g 90 (529) ...+|||||||.+||.+||.+|++|||+||||||+ +|+||||||+|||++|+|+||||++.+.+++|.|++||+|++.| T Consensus 2 ~~~~lsVsel~~~ik~~le~~~~~v~V~GEIS~~~-~~~sGH~YF~LkD~~a~i~~v~~~~~~~~l~~~~~~G~~V~v~g 80 (443) T PRK00286 2 MDNPLSVSELNGYVKSLLERDLGQVWVRGEISNFT-RPSSGHWYFTLKDEQAQIRCVMFRGSARRLKFKPEEGMQVLVRG 80 (443) T ss_pred CCCCEEHHHHHHHHHHHHHHHCCCEEEEEEECCCE-ECCCCEEEEEEEECCCEEEEEEECCHHHHCCCCCCCCCEEEEEE T ss_conf 99863099999999999983489899999964666-28996499999818928999999284752899998999999999 Q ss_pred EEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCC Q ss_conf 96675288437999997101680079999999999765401226100163102652899984784258999999863059 Q gi|254780791|r 91 KITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRF 170 (529) Q Consensus 91 ~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~ 170 (529) +++||+|+|+|||+|++|+|+|+|+|+++||++|+||++|||||++||+|||.||++||||||++|||++||++++++|| T Consensus 81 ~i~~Y~~~g~~ql~v~~i~~~g~G~L~~~~e~lk~kL~~eGlFd~~~Kk~lP~~P~~IgvITS~tgAa~~Di~~~~~~R~ 160 (443) T PRK00286 81 KVSVYEPRGDYQLIVEDIEPAGIGALAQAFEQLKEKLAAEGLFDPERKKPLPFFPKRIGVITSPTGAAIRDILTVLSRRF 160 (443) T ss_pred EEEEECCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHCCC T ss_conf 99898688618999978127792189999999999997777968777899987855799983684389999999985049 Q ss_pred C-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 7-589997210011110367999999997410035767775899951688844422007699999997489048852057 Q gi|254780791|r 171 P-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGH 249 (529) Q Consensus 171 p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGH 249 (529) | +++++||++|||++|+.+|++||+.+|... .+|||||+|||||+|||||||||.||||||+|+||||||||| T Consensus 161 p~~~i~l~p~~VQG~~A~~~I~~ai~~~~~~~------~~DvIIi~RGGGS~eDL~~FNdE~varaI~~s~iPVISaIGH 234 (443) T PRK00286 161 PSVEVIIYPTLVQGEGAAASIVEAIERANARG------EVDVLIVARGGGSLEDLWAFNDEAVARAIAASKIPVISAVGH 234 (443) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHHHHHC------CCCEEEEECCCCCHHHHHHCCCHHHHHHHHHCCCCEEECCCC T ss_conf 96599998145626547999999999985224------888899936878888976518799999998489978951466 Q ss_pred CCCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CHHHHHHHHHHH Q ss_conf 775258988641237772145676332346777669999988877899888999985567765304--978898678887 Q gi|254780791|r 250 ETDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALP--NSDQTLSCSRYR 327 (529) Q Consensus 250 E~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~--~~~~~L~~~~Qr 327 (529) |||+||+|||||+||||||||||+++|+..++.+.+.++..+|...+.+.++..+++|+.+.+++. +|...+....++ T Consensus 235 E~D~Ti~D~VAD~Ra~TPTaAAe~~~p~~~e~~~~l~~~~~~l~~~~~~~l~~~~~~L~~l~~~l~~~~p~~~l~~~~~~ 314 (443) T PRK00286 235 ETDFTIADFVADLRAPTPTAAAELVVPDQAELLQRLQQLRQRLARALRRYLKQKEQRLDQLAHRLKFQSPQRRLDQQQQR 314 (443) T ss_pred CCCCCHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 77755888865403799699999865319999999999999999999999999999999999654236999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88865456899999987568899999841011346889999999999999999988765556788887767777788750 Q gi|254780791|r 328 LDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHMLRE 407 (529) Q Consensus 328 Ld~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l~~ 407 (529) |+.+..+|.+.+...+...+..++.+..+|.. T Consensus 315 l~~~~~~L~~~~~~~l~~~~~~l~~l~~~L~~------------------------------------------------ 346 (443) T PRK00286 315 LDELRQRLRRALRRQLRQARQRLERLSQRLQN------------------------------------------------ 346 (443) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC------------------------------------------------ T ss_conf 99999999999999999999999999987538------------------------------------------------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEE Q ss_conf 26889999999999999999999999999999999999998871696777314619998489889577789299986999 Q gi|254780791|r 408 QTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILI 487 (529) Q Consensus 408 ~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i 487 (529) +...+...+.++..+..++...+.+.++.++++|..+.+.|+++||+++|+||||||++.+|++|+|+++++.||.|++ T Consensus 347 -~~~~i~~~~~~l~~~~~~l~~~~~~~l~~~~~~l~~~~~~L~~lsP~~~L~RGYaiv~~~~gkiI~s~~~l~~gd~i~i 425 (443) T PRK00286 347 -PQPRIARAQQRLEQLANRLLQAMQRRLKRKRQRLEALAQQLEALSPLATLARGYAIVRDEDGKVIRSAKQLKPGDRLTI 425 (443) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEE T ss_conf -6778999999999999999999999999999999999999995895788749559999299999748899799898999 Q ss_pred EEECEEEEEEEEECCCC Q ss_conf 99110999999505788 Q gi|254780791|r 488 NFFDGQANAIVINKAPP 504 (529) Q Consensus 488 ~l~DG~v~a~V~~k~~~ 504 (529) +|+||.+.|+|+++.|. T Consensus 426 ~~~DG~i~a~V~~i~~~ 442 (443) T PRK00286 426 RLADGEVDAEVTDVQPA 442 (443) T ss_pred EEECCEEEEEEEEEECC T ss_conf 99361899999997189 No 2 >COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=1028.29 Aligned_cols=436 Identities=41% Similarity=0.667 Sum_probs=416.7 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEE Q ss_conf 88986229999999999974001718999997054356888627999874894799999735210586681459889999 Q gi|254780791|r 10 LDHPEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVI 89 (529) Q Consensus 10 ~~~~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~ 89 (529) ++.++|||||||.+||.+||.+|++|||+||||||+ .|+|||+||+|||+.|+|+|+||++++.+++|.|++||+|+|. T Consensus 1 ~~~~~~sVSeln~~ik~llE~~~~~V~v~GEISn~t-~~~sgH~YFtLKD~~A~i~c~mf~~~~~~l~f~p~eG~~V~v~ 79 (440) T COG1570 1 SPEKILSVSELNDYIKRLLERDLGQVWVRGEISNFT-RPASGHLYFTLKDERAQIRCVMFKGNNRRLKFRPEEGMQVLVR 79 (440) T ss_pred CCCCEECHHHHHHHHHHHHHHCCCEEEEEEEECCCC-CCCCCCEEEEECCCCCEEEEEEECCCCCCCCCCCCCCCEEEEE T ss_conf 998705199999999999984078599999963776-6888607999716771689999747000268786679889999 Q ss_pred EEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHC Q ss_conf 99667528843799999710168007999999999976540122610016310265289998478425899999986305 Q gi|254780791|r 90 GKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCR 169 (529) Q Consensus 90 g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r 169 (529) |+|++|+|+|.|||+++.|+|+|+|+|+++||+||+||.+|||||+++|+|||.+|++||||||+||||++||++++++| T Consensus 80 G~is~Y~~rG~YQi~~~~~~p~G~G~L~~~~E~lK~kL~aEGlFd~~~KkpLP~~p~~IGVITS~tgAairDIl~~~~rR 159 (440) T COG1570 80 GKISLYEPRGDYQIVAESMEPAGLGALYLAFEQLKAKLAAEGLFDPERKKPLPFFPKKIGVITSPTGAALRDILHTLSRR 159 (440) T ss_pred EEEEEECCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHH T ss_conf 77888769885699983278287258999999999999867786832269999998868998187058999999998753 Q ss_pred CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEEC Q ss_conf 97-58999721001111036799999999741003576777589995168884442200769999999748904885205 Q gi|254780791|r 170 FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIG 248 (529) Q Consensus 170 ~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIG 248 (529) || ++|++||++|||++|+.+|++||+.+|.+.+ +|||||+|||||+|||||||||.||||||+|+|||||||| T Consensus 160 ~P~~~viv~pt~VQG~~A~~eIv~aI~~an~~~~------~DvlIVaRGGGSiEDLW~FNdE~vaRAi~~s~iPvISAVG 233 (440) T COG1570 160 FPSVEVIVYPTLVQGEGAAEEIVEAIERANQRGD------VDVLIVARGGGSIEDLWAFNDEIVARAIAASRIPVISAVG 233 (440) T ss_pred CCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCC------CCEEEEECCCCHHHHHHCCCHHHHHHHHHHCCCCEEEECC T ss_conf 8887299982144178709999999998634589------9889991686348888615809999999807998584066 Q ss_pred CCCCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CHHHHHHHHHH Q ss_conf 7775258988641237772145676332346777669999988877899888999985567765304--97889867888 Q gi|254780791|r 249 HETDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALP--NSDQTLSCSRY 326 (529) Q Consensus 249 HE~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~--~~~~~L~~~~Q 326 (529) ||||+||+|||||+||||||||||+++|+..|+.+++..++.||.+++.+.+..+++.+..+.+++. +|...+...+| T Consensus 234 HEtD~tL~DfVAD~RApTPTaAAE~~vP~~~el~~~l~~~~~rL~~~~~~~l~~~~~~l~~l~~~l~~~~p~~~l~~~~q 313 (440) T COG1570 234 HETDFTLADFVADLRAPTPTAAAELVVPDSAELLQQLDQLQRRLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQ 313 (440) T ss_pred CCCCCCHHHHHHHCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 67775287764313479966878885878999999999999999999999999999999999987764497889989999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 78886545689999998756889999984101134688999999999999999998876555678888776777778875 Q gi|254780791|r 327 RLDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHMLR 406 (529) Q Consensus 327 rLd~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l~ 406 (529) ++|++..+|.+.+...+...+++++.+..||.. T Consensus 314 ~ld~~~~rL~~~l~~~~~~~~~~~~~l~~rl~~----------------------------------------------- 346 (440) T COG1570 314 RLDELAIRLRRALENQLALKKQRLERLTQRLNP----------------------------------------------- 346 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------------------------------- T ss_conf 999999999999999999988888777777778----------------------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEE Q ss_conf 02688999999999999999999999999999999999999887169677731461999848988957778929998699 Q gi|254780791|r 407 EQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRIL 486 (529) Q Consensus 407 ~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~ 486 (529) .+..+++++..+..++..++.+.++..+++|+.+...|++|||.+||+||||+|++.+|++|+|+++++.||.|+ T Consensus 347 -----~~~~~~~~~~~~~~~l~~~~~~~l~~~~~~l~~l~~~L~~Lsp~~~L~RGyavv~~~~~~~i~s~~~l~~gd~l~ 421 (440) T COG1570 347 -----QIQRQQQRLQQLERRLDKALRRQLKRKRERLEALVEQLESLSPLATLARGYAVVRKENGQVIKSVADLKAGDRLT 421 (440) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCEEEEECCCCCEECCHHHCCCCCEEE T ss_conf -----898888888889988889999999999999999999997179145664744899826877540186669999899 Q ss_pred EEEECEEEEEEEEECCCC Q ss_conf 999110999999505788 Q gi|254780791|r 487 INFFDGQANAIVINKAPP 504 (529) Q Consensus 487 i~l~DG~v~a~V~~k~~~ 504 (529) ++|+||.+.++|+++.+. T Consensus 422 ~~~~DG~v~a~v~~~~~~ 439 (440) T COG1570 422 LRLADGEVDAEVKGVQTE 439 (440) T ss_pred EEECCCCEEEEEEECCCC T ss_conf 997377189999621468 No 3 >pfam02601 Exonuc_VII_L Exonuclease VII, large subunit. This family consist of exonuclease VII, large subunit EC:3.1.11.6 This enzyme catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. This exonuclease VII enzyme is composed of one large subunit and 4 small ones. Probab=100.00 E-value=0 Score=662.42 Aligned_cols=293 Identities=37% Similarity=0.549 Sum_probs=258.8 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 2261001631026528999847842589999998630597-589997210011110367999999997410035767775 Q gi|254780791|r 132 LFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPD 210 (529) Q Consensus 132 lfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D 210 (529) |||++||+|||.||++||||||++|||++||++++++||| +++++||++|||++|+.+|++||..+|.+.. ..+|| T Consensus 1 LFd~~~Kk~lP~~p~~IgvITS~~gAa~~Di~~~~~~r~p~~~i~l~p~~VQG~~a~~~I~~ai~~~~~~~~---~~~~D 77 (295) T pfam02601 1 LFDPNRKRPLPRFPKRIAVITSATGAAYQDFLRTARRRGPLVEIEIYPTLVQGDGAAESIVSALERANERET---ALDYD 77 (295) T ss_pred CCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCC---CCCCC T ss_conf 988212899999999899984894089999999999819996799947357650489999999999984689---89983 Q ss_pred EEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 89995168884442200769999999748904885205777525898864123777214567633234677766999998 Q gi|254780791|r 211 IIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEA 290 (529) Q Consensus 211 ~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~ 290 (529) ||||+|||||+|||||||||.|||+||+|+||||||||||||+||+|||||+||||||||||+++|++.++.+++.++.. T Consensus 78 viii~RGGGS~eDL~~FN~e~laraI~~~~iPvisaIGHEtD~Tl~D~VAD~Ra~TPTaAAe~ivp~~~~l~~~l~~~~~ 157 (295) T pfam02601 78 VIVIIRGGGSKEDLWVFNDEELARAIANSPIPVITGIGHETDTTIADLVADVRAATPTAAAELLVPDRTELLQKLEGLEQ 157 (295) T ss_pred EEEEECCCCCHHHHHHCCHHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 89995786888987455889999999838998780678899961888887500589899999965449999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 88778998889999855677653049-78898678887888654568999999875688999998410113468899999 Q gi|254780791|r 291 RLNNIIIRLIKYKINTLNSLLKALPN-SDQTLSCSRYRLDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKN 369 (529) Q Consensus 291 RL~~a~~~~l~~~~q~L~~l~r~L~~-~~~~L~~~~QrLd~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~ 369 (529) +|...+.+.++..+++|+.+..++.. +...+...+++|+.+..+|...+...+...+.+++... T Consensus 158 ~l~~~~~~~l~~~~~~L~~l~~~l~~~~~~~l~~~~~~L~~~~~rl~~~~~~~l~~~~~~L~~~~--------------- 222 (295) T pfam02601 158 RLSRALKNRLEKEQDRLNLLRERLKSLSRRKLEQHEERLAELYQRLESSIQNLLSRKQSRLERLK--------------- 222 (295) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------- T ss_conf 99999999999999999999975147829999999999999999999999999999999999999--------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999988765556788887767777788750268899999999999999999999999999999999999988 Q gi|254780791|r 370 RQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRIL 449 (529) Q Consensus 370 r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L 449 (529) .++.. ..+.+...++++..+.++| T Consensus 223 -------------------------------------------------------~~~~~-~~~~~~~~~~~l~~l~~~L 246 (295) T pfam02601 223 -------------------------------------------------------LNREL-EKNSVLENKLALENLTAQL 246 (295) T ss_pred -------------------------------------------------------HHHHH-HHHHHHHHHHHHHHHHHHH T ss_conf -------------------------------------------------------98888-9999999999999999999 Q ss_pred HHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEE Q ss_conf 7169677731461999848988957778929998699999110999999 Q gi|254780791|r 450 QSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILINFFDGQANAIV 498 (529) Q Consensus 450 ~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v~a~V 498 (529) +++||+++|+||||||++++|++|+|+++++.||.|+++|+||++.|+| T Consensus 247 ~~lsP~~vL~RGYaiv~~~~gkiI~s~~~l~~gd~l~i~~~DG~i~a~V 295 (295) T pfam02601 247 KALSPLKTLKRGFAIVRRKDGKIVTSAAELKPGDNLEIRLADGSAQAKV 295 (295) T ss_pred HHCCHHHHHHCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEC T ss_conf 9679677984860899969999974889979999799999440799879 No 4 >TIGR00237 xseA exodeoxyribonuclease VII, large subunit; InterPro: IPR003753 Exonuclease VII 3.1.11.6 from EC is composed of two nonidentical subunits; one large subunit and 4 small ones . Exonuclease VII catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. The large subunit also contains the OB-fold domains (IPR004365 from INTERPRO) that bind to nucleic acids at the N-terminus.; GO: 0008855 exodeoxyribonuclease VII activity, 0006308 DNA catabolic process, 0009318 exodeoxyribonuclease VII complex. Probab=100.00 E-value=0 Score=563.32 Aligned_cols=434 Identities=34% Similarity=0.594 Sum_probs=395.6 Q ss_pred CHHHHHHHHHHHHHHC--CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 2999999999997400--17189999970543568886279998748947999997352105866814598899999966 Q gi|254780791|r 16 SVSELSYHLKHIVESN--LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT 93 (529) Q Consensus 16 svs~l~~~i~~~l~~~--~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~ 93 (529) +|++++.+++..++.+ |..+|+.||++|++-.+.+|||||+|||+.+.+.|++|++...++.|.|++|++|++.|.++ T Consensus 1 ~~~~~~~~~~~~~~~~p~~~~~w~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 80 (441) T TIGR00237 1 TVSELNAYIKALLEGDPHFLQVWLKGELSNLTIQPVSGHWYFTLKDEKAQVRCVLFRGNNERLPFRPENGQEVLVRGGIS 80 (441) T ss_pred CCHHHHHHHHHHHHCCCCCEEEEEEECCCCEEEEEECCEEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCEEEEECCEE T ss_conf 90146799998750586300146510002115654045068875033103455544211223310224673566651604 Q ss_pred EECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-E Q ss_conf 752884379999971016800799999999997654012261001631026528999847842589999998630597-5 Q gi|254780791|r 94 TFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP-L 172 (529) Q Consensus 94 ~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~ 172 (529) +|.|+|.|++++..++|.|.|.+...+++++.+|..+|+||.+.|+|+|.+|+.||||||++||+++|+++++.+||| + T Consensus 81 ~~~p~g~~~~~~~~~~p~~~g~~~~~~~~~~~~~~~~g~~~~~~~~~~p~~~~~~g~~~~~~g~~~~d~~~~~~~~~p~~ 160 (441) T TIGR00237 81 LYEPRGDYQLIAEELEPAGEGLLQLAYEELKEKLAAEGLFDEEYKKPLPHFPKAVGVITSPTGAALADVLHILKRRDPLL 160 (441) T ss_pred EEECCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCHHEEEEECCCHHHHHHHHHHHHHCCCCC T ss_conf 64137761477630265420478999999998765302001231135765200112332453046899999875115542 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 89997210011110367999999997410035767775899951688844422007699999997489048852057775 Q gi|254780791|r 173 RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETD 252 (529) Q Consensus 173 ~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D 252 (529) ++++||+.|||++++.+|+..+..++...+.. ..+|+++++|||||+||||+||+|.+++++|.+.+|++++||||+| T Consensus 161 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~--~~~d~~~~g~ggg~~~~~w~~~~~~~~~~~~~~~~~~~~~~g~~~d 238 (441) T TIGR00237 161 KVVILPTLVQGEGAPGSIVESIELANSFHDEK--NECDVLIVGRGGGSLEDLWSFNDEKVARALFLSKLPIISAVGHETD 238 (441) T ss_pred EEEEEHHHHCCCCHHHHHHHHHHHHHHHHHHH--HCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 00121001225540689999888642100011--1000557605786324554022467777777531002210144210 Q ss_pred CHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CHHHHHHHHHHHHHH Q ss_conf 258988641237772145676332346777669999988877899888999985567765304--978898678887888 Q gi|254780791|r 253 WTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALP--NSDQTLSCSRYRLDR 330 (529) Q Consensus 253 ~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~--~~~~~L~~~~QrLd~ 330 (529) +|++|||+|+|++||++|+|++.|...++..++.....++..++...+.....++..+..++. +|...+......++. T Consensus 239 ~~~~d~~~d~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (441) T TIGR00237 239 FTLSDFVADLRAPTPSGAAELLLPNSDELLERLDGAEVRLDRAFDTLLGQKKGRLEQLAASLKRLHPENKLALKELKLEK 318 (441) T ss_pred HHHHHHHHHHHCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 04666666530357420033321444789987523567888899988644467899999998752056778888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65456899999987568899999841011346889999999999999999988765556788887767777788750268 Q gi|254780791|r 331 LPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHMLREQTK 410 (529) Q Consensus 331 ~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l~~~~~ 410 (529) +..++...+...+............++... .+. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------------------------------~~~ 351 (441) T TIGR00237 319 LEKRLQAALNKLLERTGQKLTRLTKRLEQL-----------------------------------------------NPS 351 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------------------------------CCC T ss_conf 999999999888765556788888887640-----------------------------------------------653 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEE-CCCCCEECCHHHCCCCCEEEEE Q ss_conf 8-9999999999999999999999999999999999998871696777314619998-4898895777892999869999 Q gi|254780791|r 411 N-RIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSFAYKNTLKRGYTSIQ-DTNNNFITQKRNLATKTRILIN 488 (529) Q Consensus 411 ~-~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~slsP~~~L~RGYaiv~-~~~GkiI~s~~~l~~gd~i~i~ 488 (529) . ++...+.+.+.+..++...+...+.....++..+...+..++|..+|+|||++.. +++|+.+.++.++..+|.++++ T Consensus 352 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (441) T TIGR00237 352 PYKLLRAQTRLEELNNRLNALLNAQLNLLLPQFGTLLGKLNALSPLEVLARGYSLALSNPEGKALKSVKQVDEGDRLTTK 431 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCHHHHHHHHHHHHHHHHHEE T ss_conf 14566776678888889988877777665677788775421001456763132000035310356666643111221001 Q ss_pred EECEEEEEEE Q ss_conf 9110999999 Q gi|254780791|r 489 FFDGQANAIV 498 (529) Q Consensus 489 l~DG~v~a~V 498 (529) +.||.+.+.| T Consensus 432 ~~dg~~~~~~ 441 (441) T TIGR00237 432 LKDGWLTSEV 441 (441) T ss_pred ECCCEEEECC T ss_conf 0054112049 No 5 >cd04489 ExoVII_LU_OBF ExoVII_LU_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold domain of Escherichia coli exodeoxyribonuclease VII (ExoVII) large subunit. E. coli ExoVII is composed of two non-identical subunits. E. coli ExoVII is a single-strand-specific exonuclease which degrades ssDNA from both 3-prime and 5-prime ends. ExoVII plays a role in methyl-directed mismatch repair in vivo. ExoVII may also guard the genome from mutagenesis by removing excess ssDNA, since the build up of ssDNA would lead to SOS induction and PolIV-dependent mutagenesis. Probab=99.79 E-value=8.1e-19 Score=188.15 Aligned_cols=78 Identities=41% Similarity=0.776 Sum_probs=75.3 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECC Q ss_conf 1899999705435688862799987489479999973521058668145988999999667528843799999710168 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSG 112 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g 112 (529) ++||+|||||++ .++|||+||+|||++|+|+|+||++.+.++++.|++||+|++.|++++|+|+|+|||+|++|+|+| T Consensus 1 ~vwV~GEIs~~~-~~~sGh~Yf~LkD~~a~i~~v~~~~~~~~~~~~~~~G~~V~v~g~~~~y~~~G~~ql~v~~i~~~G 78 (78) T cd04489 1 RVWVEGEISNLK-RPSSGHLYFTLKDEDASIRCVMWRSNARRLGFPLEEGMEVLVRGKVSFYEPRGGYQLIVEEIEPAG 78 (78) T ss_pred CEEEEEEECCCE-ECCCCEEEEEEECCCEEEEEEEEHHHHHHCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECC T ss_conf 989999988558-289963999994788199999966788327999889989999999999989840999999999782 No 6 >pfam01336 tRNA_anti OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl-tRNA synthetases catalyse the addition of an amino acid to the appropriate tRNA molecule EC:6.1.1.-. This family also includes part of RecG helicase involved in DNA repair. Replication factor A is a heterotrimeric complex, that contains a subunit in this family. This domain is also found at the C-terminus of bacterial DNA polymerase III alpha chain. Probab=97.91 E-value=0.0001 Score=62.25 Aligned_cols=74 Identities=23% Similarity=0.405 Sum_probs=62.8 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 8999997054356888627999874894799999735210586681459889999996675288437999997101 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) ++|.|+|++.+ ...+++++|+|.|..+.|.|++|..........++.|+.|.+.|++..| +.+.+++.+..+.+ T Consensus 1 v~v~G~V~~~~-~~~~~~~~~~l~D~tg~i~~~~~~~~~~~~~~~l~~g~~v~v~G~v~~~-~~~~~~i~~~~i~~ 74 (75) T pfam01336 1 VTVAGRVTSVR-RSGGKVAFLTLRDGTGSIQVVLFKEEAEKLAKKLKEGDVVLVTGKVKKR-PGGELELVVEEIEV 74 (75) T ss_pred CEEEEEEEEEE-ECCCCEEEEEEEECCEEEEEEEEHHHHHHHHHCCCCCCEEEEEEEEEEC-CCCCEEEEEEEEEE T ss_conf 99999999518-8799989999997993999999523868886328989899999999997-49989999989997 No 7 >COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair] Probab=97.04 E-value=0.061 Score=37.47 Aligned_cols=340 Identities=18% Similarity=0.204 Sum_probs=176.0 Q ss_pred CCCCCCCCC-EEEEEEEEEEE--CCCCEEEEEEEEEEECCCCHHHH-HHHHHHHHHHHHHCCCCCCCCCCCCCCC---EE Q ss_conf 866814598-89999996675--28843799999710168007999-9999999765401226100163102652---89 Q gi|254780791|r 76 IEFLPEEGI-EFLVIGKITTF--PGSSKYQIIIESLIPSGSGTLLT-ALEKRKKKLLEEGLFSDQHKNPIPFIPK---II 148 (529) Q Consensus 76 ~~~~~~~G~-~v~~~g~~~~y--~~~g~~ql~v~~i~~~g~G~l~~-~~e~lk~~L~~eGlfd~~~k~~lP~~p~---~i 148 (529) ++-.+++.+ .|.|.|.||=| ++.|-.-|.+.+=...=-..|+. ....++-+ -+||..=.-+- .+-.||. -= T Consensus 15 ik~llE~~~~~V~v~GEISn~t~~~sgH~YFtLKD~~A~i~c~mf~~~~~~l~f~-p~eG~~V~v~G-~is~Y~~rG~YQ 92 (440) T COG1570 15 IKRLLERDLGQVWVRGEISNFTRPASGHLYFTLKDERAQIRCVMFKGNNRRLKFR-PEEGMQVLVRG-KISLYEPRGDYQ 92 (440) T ss_pred HHHHHHHCCCEEEEEEEECCCCCCCCCCEEEEECCCCCEEEEEEECCCCCCCCCC-CCCCCEEEEEE-EEEEECCCCCEE T ss_conf 9999984078599999963776688860799971677168999974700026878-66798899997-788876988569 Q ss_pred EEEECCCHHHHHHHHHHHH---H-----------C------CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9984784258999999863---0-----------5------975899972100111103679999999974100357677 Q gi|254780791|r 149 AVITSPTGAVIRDILQRIS---C-----------R------FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPR 208 (529) Q Consensus 149 ~vits~~~a~~~D~~~~~~---~-----------r------~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~ 208 (529) -++++-.-||+++..-.+. . | ||-.|=+.-++ .-.|..+| |..+.++. +. T Consensus 93 i~~~~~~p~G~G~L~~~~E~lK~kL~aEGlFd~~~KkpLP~~p~~IGVITS~--tgAairDI---l~~~~rR~-----P~ 162 (440) T COG1570 93 IVAESMEPAGLGALYLAFEQLKAKLAAEGLFDPERKKPLPFFPKKIGVITSP--TGAALRDI---LHTLSRRF-----PS 162 (440) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCEEEEECCC--CHHHHHHH---HHHHHHHC-----CC T ss_conf 9983278287258999999999999867786832269999998868998187--05899999---99987538-----88 Q ss_pred CCEEEE---ECCCCCHHHHHHCCHHHHHHHHHHCC-----CEEEEEECC---C-----CCCHHHHHHHCCCCCCCHHHHH Q ss_conf 758999---51688844422007699999997489-----048852057---7-----7525898864123777214567 Q gi|254780791|r 209 PDIIIL---ARGGGSIEDLWHFNDEMIVRAIANSS-----IPIISAIGH---E-----TDWTLADYAADLRAPTPTGAAE 272 (529) Q Consensus 209 ~D~iii---~RGGGS~eDL~~FN~e~laraI~~~~-----iPVisgIGH---E-----~D~Tl~D~VAD~Ra~TPTaAAE 272 (529) +.++|. +-|-|+.+ ++++||-... =-+|.|=|- | -|..++.-++..+.|-=||.- T Consensus 163 ~~viv~pt~VQG~~A~~--------eIv~aI~~an~~~~~DvlIVaRGGGSiEDLW~FNdE~vaRAi~~s~iPvISAVG- 233 (440) T COG1570 163 VEVIVYPTLVQGEGAAE--------EIVEAIERANQRGDVDVLIVARGGGSIEDLWAFNDEIVARAIAASRIPVISAVG- 233 (440) T ss_pred CEEEEEECCCCCCCCHH--------HHHHHHHHHHCCCCCCEEEEECCCCHHHHHHCCCHHHHHHHHHHCCCCEEEECC- T ss_conf 72999821441787099--------999999986345899889991686348888615809999999807998584066- Q ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 63323467776699999888778998889999855677653049788986788878886545689999998756889999 Q gi|254780791|r 273 MAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLSCSRYRLDRLPRELEHNLEIIIFRKYRHFNH 352 (529) Q Consensus 273 lavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~~~~QrLd~~~~rL~~~l~~~l~~~~~~l~~ 352 (529) .|..-.|.++-+-+....-. ..--+--|. ..+..++++....+|.+.+...+..+++.+.. T Consensus 234 ------HEtD~tL~DfVAD~RApTPT-----------aAAE~~vP~--~~el~~~l~~~~~rL~~~~~~~l~~~~~~l~~ 294 (440) T COG1570 234 ------HETDFTLADFVADLRAPTPT-----------AAAELVVPD--SAELLQQLDQLQRRLHRALRRLLDQKKQRLEH 294 (440) T ss_pred ------CCCCCCHHHHHHHCCCCCCH-----------HHHHHHCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ------67775287764313479966-----------878885878--99999999999999999999999999999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98410113468899999999999999999887655567888877677777887502688999999999999999999999 Q gi|254780791|r 353 IINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFILS 432 (529) Q Consensus 353 ~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~~~ 432 (529) +..++....|...+...++++++...+|.+.+...+......... +..++... +.....++...-. T Consensus 295 l~~~l~~~~p~~~l~~~~q~ld~~~~rL~~~l~~~~~~~~~~~~~-----------l~~rl~~~---~~~~~~~~~~~~~ 360 (440) T COG1570 295 LARRLQFRSPERLLSEQQQRLDELAIRLRRALENQLALKKQRLER-----------LTQRLNPQ---IQRQQQRLQQLER 360 (440) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHH---HHHHHHHHHHHHH T ss_conf 998776449788998999999999999999999999998888877-----------77777788---9888888888998 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCC Q ss_conf 999999999999999887169677731461999848988 Q gi|254780791|r 433 HKIKSCHTSVSITTRILQSFAYKNTLKRGYTSIQDTNNN 471 (529) Q Consensus 433 ~~L~~~~~rL~~l~~~L~slsP~~~L~RGYaiv~~~~Gk 471 (529) ..-...+..|.....+++.+. .....=++.-|. +.|= T Consensus 361 ~l~~~~~~~l~~~~~~l~~l~-~~L~~Lsp~~~L-~RGy 397 (440) T COG1570 361 RLDKALRRQLKRKRERLEALV-EQLESLSPLATL-ARGY 397 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHCCCHHHHH-HCCE T ss_conf 888999999999999999999-999717914566-4744 No 8 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=96.98 E-value=0.068 Score=37.06 Aligned_cols=224 Identities=18% Similarity=0.233 Sum_probs=133.6 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEE---EECCCCCHHHHHHCCHHHHHHHHHHCC---CE--EEEEECC--C- Q ss_conf 11110367999999997410035767775899---951688844422007699999997489---04--8852057--7- Q gi|254780791|r 182 QGDECPKEIANAILQLNTLKEGRTCPRPDIII---LARGGGSIEDLWHFNDEMIVRAIANSS---IP--IISAIGH--E- 250 (529) Q Consensus 182 QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ii---i~RGGGS~eDL~~FN~e~laraI~~~~---iP--VisgIGH--E- 250 (529) ..-.|-.+|++-+.. +. +.+.+++ .+=|=|+.. ++++||..+. -| ||-|=|= - T Consensus 144 ~tgAa~~Di~~~~~~---R~-----p~~~i~l~p~~VQG~~A~~--------~I~~ai~~~~~~~~~DvIIi~RGGGS~e 207 (443) T PRK00286 144 PTGAAIRDILTVLSR---RF-----PSVEVIIYPTLVQGEGAAA--------SIVEAIERANARGEVDVLIVARGGGSLE 207 (443) T ss_pred CCHHHHHHHHHHHHC---CC-----CCEEEEEEECCCCHHHHHH--------HHHHHHHHHHHHCCCCEEEEECCCCCHH T ss_conf 843899999999850---49-----9659999814562654799--------9999999985224888899936878888 Q ss_pred -----CCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf -----752589886412377721456763323467776699999888778998889999855677653049788986788 Q gi|254780791|r 251 -----TDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLSCSR 325 (529) Q Consensus 251 -----~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~~~~ 325 (529) -|..|+.-|+...-|.=||.- .|.-..|.++-+.+....-.... . + .-+-..+.. T Consensus 208 DL~~FNdE~varaI~~s~iPVISaIG-------HE~D~Ti~D~VAD~Ra~TPTaAA---------e--~--~~p~~~e~~ 267 (443) T PRK00286 208 DLWAFNDEAVARAIAASKIPVISAVG-------HETDFTIADFVADLRAPTPTAAA---------E--L--VVPDQAELL 267 (443) T ss_pred HHHHCCCHHHHHHHHHCCCCEEECCC-------CCCCCCHHHHHHHCCCCCHHHHH---------H--H--HCCCHHHHH T ss_conf 97651879999999848997895146-------67775588886540379969999---------9--8--653199999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87888654568999999875688999998410113468899999999999999999887655567888877677777887 Q gi|254780791|r 326 YRLDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHLRYIFLEKKEKIAILHML 405 (529) Q Consensus 326 QrLd~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l~~~~~~~~~~~~~l~~l 405 (529) ++++....+|.+.+...+...++.++.+..++....|...+....++++....++...+...+....... T Consensus 268 ~~l~~~~~~l~~~~~~~l~~~~~~L~~l~~~l~~~~p~~~l~~~~~~l~~~~~~L~~~~~~~l~~~~~~l---------- 337 (443) T PRK00286 268 QRLQQLRQRLARALRRYLKQKEQRLDQLAHRLKFQSPQRRLDQQQQRLDELRQRLRRALRRQLRQARQRL---------- 337 (443) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------- T ss_conf 9999999999999999999999999999965423699999999999999999999999999999999999---------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 50268899999999999999999999999999999999999988716 Q gi|254780791|r 406 REQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSF 452 (529) Q Consensus 406 ~~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~sl 452 (529) .....++......+.....++.....+.....+..++.....|+.+ T Consensus 338 -~~l~~~L~~~~~~i~~~~~~l~~~~~~l~~~~~~~l~~~~~~l~~~ 383 (443) T PRK00286 338 -ERLSQRLQNPQPRIARAQQRLEQLANRLLQAMQRRLKRKRQRLEAL 383 (443) T ss_pred -HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -9999875386778999999999999999999999999999999999 No 9 >TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily; InterPro: IPR014345 Members of this entry belong to the family of polysaccharide chain length determinant proteins. They are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the epsH homologue that is the putative exosortase gene.. Probab=96.97 E-value=0.069 Score=36.99 Aligned_cols=224 Identities=14% Similarity=0.148 Sum_probs=110.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH---------------------- Q ss_conf 7766999998887789988899998556776530497889867888788865456899---------------------- Q gi|254780791|r 281 LQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLSCSRYRLDRLPRELEHN---------------------- 338 (529) Q Consensus 281 L~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~~~~QrLd~~~~rL~~~---------------------- 338 (529) +.+||..++.||..+..+.-+.++.+...+...=..--..|...+++|+...-++... T Consensus 166 i~~Qik~Y~~kL~~aE~~L~~FK~~~~~~lp~~~~~y~~~l~~~~~~L~~~~l~~~e~~~~Rd~l~rQL~g~~Pv~~~~~ 245 (510) T TIGR03007 166 IDEQIKEYEKKLEAAENRLKAFKQENGGILPDQEGDYYSELSEAQEELEAARLELNEAKAQRDALKRQLAGEEPVLLAGS 245 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCC T ss_conf 99999999999999999999876873453779737899999999999999999999888767887514588661562368 Q ss_pred ------------HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q ss_conf ------------99998756889999984101134688999999999999999998876555-67888877677777887 Q gi|254780791|r 339 ------------LEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQIVKQHL-RYIFLEKKEKIAILHML 405 (529) Q Consensus 339 ------------l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~~~~~l-~~~~~~~~~~~~~l~~l 405 (529) ++..+...+..|+.+.-+....+|. +...+..+..++++......... +................ T Consensus 246 g~~~~~s~~~~~~~~Ri~~L~~~Ld~L~l~YTd~HPd--v~~~k~~Ia~L~~~~~~e~~~~~~Q~~~~~~~~~~~~~nPv 323 (510) T TIGR03007 246 GEKSASSVANSELDSRIEALEKQLDALRLRYTDKHPD--VIATKREIAQLEEQKEEELESYRKQNGEGGGSEKGSIYNPV 323 (510) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCH T ss_conf 8866531235625689999999999863363011558--99999999999999889998898730345555566789831 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEE Q ss_conf 50268899999999999999999999999999999999999988716967773146199984898895777892999869 Q gi|254780791|r 406 REQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRI 485 (529) Q Consensus 406 ~~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i 485 (529) -|+++..+......+..+..|... ....+++.+..++.+-+-=.-+ .-|.|+|-+.+....++++.-.+.+....+ T Consensus 324 yQql~~~l~~~eA~~asl~~r~~~-~~~~~~~l~~~~~~iP~veaEl---~~L~rdy~~~k~~Ye~Ll~R~e~A~~S~~~ 399 (510) T TIGR03007 324 YQQLQIELAEAEAEIASLEARVAE-LTARIERLESLLRTIPEVEAEL---TQLNRDYEVNKSNYEQLLARRESAEVSKQM 399 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHE T ss_conf 377899999999999999999999-9999999999975327999999---999988755588899999889855553303 Q ss_pred EEEEECEEEEEEEEECCCCCCCCCCCC Q ss_conf 999911099999950578887677777 Q gi|254780791|r 486 LINFFDGQANAIVINKAPPKEQPTKKC 512 (529) Q Consensus 486 ~i~l~DG~v~a~V~~k~~~~~~~~~~~ 512 (529) +.+ |+.+.=+|+|.-..+..|..+. T Consensus 400 e~~--~~~v~FRviDPP~~P~~PsgP~ 424 (510) T TIGR03007 400 EVQ--DKAVSFRVIDPPIVPSKPSGPN 424 (510) T ss_pred EEE--CCCEEEEEECCCCCCCCCCCCC T ss_conf 320--6702467756756789999878 No 10 >PRK13480 3'-5' exoribonuclease YhaM; Provisional Probab=96.76 E-value=0.013 Score=43.53 Aligned_cols=87 Identities=15% Similarity=0.271 Sum_probs=62.0 Q ss_pred HHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEE--EEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCC Q ss_conf 999997400-17189999970543568886279--998748947999997352105866814598899999966752884 Q gi|254780791|r 23 HLKHIVESN-LSHVCVRGEISGYRGIHSSGHAY--FSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSS 99 (529) Q Consensus 23 ~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Y--f~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g 99 (529) .|+++-+++ +..+.+.-+..- + ..++|--| |+|.|..+.|+|.||... ....-..+.|+-|.|.|.+..| +| T Consensus 4 ~i~~l~~G~~v~~~~lv~~~~~-~-~tknG~~Yl~l~L~D~tG~I~ak~W~~~-~~~~~~~~~g~~V~v~G~v~~y--~g 78 (314) T PRK13480 4 GIENLEVGESVDHFLLIKSATK-G-VASNGKPFLTLILQDKSGDIEAKLWDVS-KEDEATYVPETIVHVKGDIINY--RG 78 (314) T ss_pred CHHHCCCCCEEEEEEEEEECHH-H-CCCCCCCEEEEEEECCCCEEEEEECCCC-HHHHHHCCCCCEEEEEEEEEEE--CC T ss_conf 1410899988889999987143-3-1899981479999837876889967898-6668637899889999999876--89 Q ss_pred EEEEEEEEEEECCCC Q ss_conf 379999971016800 Q gi|254780791|r 100 KYQIIIESLIPSGSG 114 (529) Q Consensus 100 ~~ql~v~~i~~~g~G 114 (529) +.|+.|..|+++..+ T Consensus 79 ~~Ql~i~~i~~~~~~ 93 (314) T PRK13480 79 RKQLKVNQIRLATPE 93 (314) T ss_pred CCCEEEEEEEECCCC T ss_conf 862588885767866 No 11 >cd04492 YhaM_OBF_like YhaM_OBF_like: A subfamily of OB folds similar to that found in Bacillus subtilis YhaM and Staphylococcus aureus cmp-binding factor-1 (SaCBF1). Both these proteins are 3'-to-5'exoribonucleases. YhaM requires Mn2+ or Co2+ for activity and is inactive in the presence of Mg2+. YhaM also has a Mn2+ dependent 3'-to-5'single-stranded DNA exonuclease activity. SaCBF is also a double-stranded DNA binding protein, binding specifically to cmp, the replication enhancer found in S. aureus plasmid pT181. Proteins in this group combine an N-terminal OB fold with a C-terminal HD domain. The HD domain is found in metal-dependent phosphohydrolases. Probab=96.74 E-value=0.023 Score=41.22 Aligned_cols=75 Identities=19% Similarity=0.312 Sum_probs=53.7 Q ss_pred EEEEEEECCCCCCCCCCEEE--EEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECC Q ss_conf 89999970543568886279--9987489479999973521058668145988999999667528843799999710168 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAY--FSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSG 112 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Y--f~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g 112 (529) +++.-+.+- + ..++|--| |+|.|..+.|.|.||...-.. .-..+.|.=|.|.|.+..| +|++||.|.+|.+.. T Consensus 3 ~fli~~~~~-~-~tk~G~~Yl~l~L~D~tG~I~ak~W~~~~~~-~~~~~~g~~V~v~G~v~~y--~g~~Ql~I~~i~~~~ 77 (83) T cd04492 3 FFLIKSKEL-R-TAKNGKPYLALTLQDKTGEIEAKLWDASEED-EEKFKPGDIVHVKGRVEEY--RGRLQLKIQRIRLVT 77 (83) T ss_pred EEEEEEHHH-H-CCCCCCCEEEEEEECCCCEEEEEECCCCHHH-HHCCCCCCEEEEEEEEEEE--CCCEEEEEEEEEECC T ss_conf 999998010-0-0579991789999878876888885788678-8307899999999999888--995108998968998 Q ss_pred CC Q ss_conf 00 Q gi|254780791|r 113 SG 114 (529) Q Consensus 113 ~G 114 (529) .. T Consensus 78 ~~ 79 (83) T cd04492 78 EE 79 (83) T ss_pred CC T ss_conf 34 No 12 >cd04482 RPA2_OBF_like RPA2_OBF_like: A subgroup of uncharacterized archaeal OB folds with similarity to the OB fold of the central ssDNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B; RPA2 DBD-D is a weak ssDNA-binding domain. RPA2 DBD-D is also involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. N-terminal to human RPA2 DBD-D is a domain containing all the known phosphorylation sites of RPA. Human RPA2 is phosphorylated in a cell cycle depende Probab=96.60 E-value=0.013 Score=43.49 Aligned_cols=72 Identities=22% Similarity=0.330 Sum_probs=56.5 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCC--CCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 89999970543568886279998748947999997352--105866814598899999966752884379999971016 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGT--LNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPS 111 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~--~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~ 111 (529) +.+.|+|+..-..-..||+.|++.|+...+.|+.|+-+ +..+-..+.-||.|.+.|.++ +.+ .|.++.|..- T Consensus 1 y~~~G~V~~~P~~i~GGHv~~~v~~~~~~i~~~afepTg~~r~~~r~L~~GD~I~v~G~v~---~~~--~lnlEkl~v~ 74 (91) T cd04482 1 YRVTGKVVEEPRTIEGGHVFFKISDGTGEIDCAAYEPTKEFRDVVRLLIPGDEVTVYGSVR---PGT--TLNLEKLRVI 74 (91) T ss_pred CEEEEEEECCEEEECCCEEEEEEEECCCEEEEEEECCCCHHHHHHHHCCCCCEEEEEEEEC---CCC--EEEEEEEEEE T ss_conf 9899999786489528749999993998899999987750889997389999999999987---898--0889999963 No 13 >PRK05673 dnaE DNA polymerase III subunit alpha; Validated Probab=96.47 E-value=0.12 Score=34.96 Aligned_cols=17 Identities=12% Similarity=0.192 Sum_probs=8.3 Q ss_pred EEEEEEEEEECCCCHHH Q ss_conf 79999971016800799 Q gi|254780791|r 101 YQIIIESLIPSGSGTLL 117 (529) Q Consensus 101 ~ql~v~~i~~~g~G~l~ 117 (529) |.|++-.-...|-..|. T Consensus 85 ~hLvlLAkN~~GY~NL~ 101 (1143) T PRK05673 85 THLTLLAKNETGYRNLM 101 (1143) T ss_pred CEEEEEECCHHHHHHHH T ss_conf 50899958999999999 No 14 >cd03524 RPA2_OBF_family RPA2_OBF_family: A family of oligonucleotide binding (OB) folds with similarity to the OB fold of the single strand (ss) DNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). RPA contains six OB folds, which are involved in ssDNA binding and in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. This family also includes OB folds similar to those found in Escherichia coli SSB, the wedge domain of E. coli RecG (a branched-DNA-specific helicase), E. coli ssDNA specific exodeoxyribonuclease VII large subunit, Pyroco Probab=96.38 E-value=0.023 Score=41.25 Aligned_cols=72 Identities=24% Similarity=0.393 Sum_probs=58.8 Q ss_pred EEEEECCCCCCCC-CCEEEEEEECCC-CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 9999705435688-862799987489-4799999735210586681459889999996675288437999997101 Q gi|254780791|r 37 VRGEISGYRGIHS-SGHAYFSLKDNH-SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 37 v~gEis~~~~~~~-sGH~Yf~lkd~~-a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) +.|-|..+..... ...+-|+|.|.. ..+.|++|+..+......++.|+-|.+.|+++.|. |.+++.+.++.+ T Consensus 2 i~g~V~~~~~~~~~~~~~~~~l~D~t~~~i~v~~~~~~~~~~~~~~~~g~~v~i~g~v~~~~--~~~~l~~~~~~~ 75 (75) T cd03524 2 IVGIVVAVEEIRTEGKVLIFTLTDGTGGTIRVTLFGELAEELENLLKEGQVVYIKGKVKKFR--GRLQLIVESIEL 75 (75) T ss_pred EEEEEEEEEEEECCCEEEEEEEECCCCCEEEEEEEEEHHHHHHHHHHCCCEEEEEEEEEECC--CEEEEEEEEECC T ss_conf 78999998420038649999997699979999999882576877874782999980365236--648999985219 No 15 >PRK06326 consensus Probab=96.30 E-value=0.15 Score=34.03 Aligned_cols=51 Identities=16% Similarity=0.210 Sum_probs=26.2 Q ss_pred EEEECCCCCHHHHHHCCH------------------HHHHHHHHHCCCEEEEEE----CCCCCCHHHHHHHCC Q ss_conf 999516888444220076------------------999999974890488520----577752589886412 Q gi|254780791|r 212 IILARGGGSIEDLWHFND------------------EMIVRAIANSSIPIISAI----GHETDWTLADYAADL 262 (529) Q Consensus 212 iii~RGGGS~eDL~~FN~------------------e~laraI~~~~iPVisgI----GHE~D~Tl~D~VAD~ 262 (529) +=|-|-|.+.|+++.|-+ +.+.+.--...+|+|..= -|.-|.-+-|-.-.. T Consensus 171 lEiq~h~~~~e~~~~~~~~~l~~~~~~~i~~q~~vn~~l~~lA~~~giplVATnDvhY~~~eD~~~hdvL~cI 243 (1240) T PRK06326 171 SEVQLHKMSEEKIALFEEEWLKQEYYQFIEKQIKVNEAVLATSKRLGIPSVATNDIHYINPDDWLAHEILLNV 243 (1240) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 8521068836654222455443333210267789999999999981989899588600786258999999998 No 16 >PRK07892 consensus Probab=96.21 E-value=0.025 Score=40.95 Aligned_cols=20 Identities=25% Similarity=0.454 Sum_probs=9.0 Q ss_pred CCCCCCHHHHHHHHHHHHHH Q ss_conf 89862299999999999740 Q gi|254780791|r 11 DHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 11 ~~~~~svs~l~~~i~~~l~~ 30 (529) .+..||..+=+-.++.+++. T Consensus 12 vhS~YSlLdGa~~ieeLV~~ 31 (1180) T PRK07892 12 NHTEYSMLDGAAKIKPLFAE 31 (1180) T ss_pred CCCCCCCCCCCCCHHHHHHH T ss_conf 66637210506599999999 No 17 >COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair] Probab=96.19 E-value=0.19 Score=33.04 Aligned_cols=221 Identities=19% Similarity=0.315 Sum_probs=129.5 Q ss_pred CCHHHHHHHHHHH----HHHCC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCC--CCCCCEEE Q ss_conf 2299999999999----74001-71899999705435688862799987489479999973521058668--14598899 Q gi|254780791|r 15 YSVSELSYHLKHI----VESNL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFL--PEEGIEFL 87 (529) Q Consensus 15 ~svs~l~~~i~~~----l~~~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~--~~~G~~v~ 87 (529) |-+-++...+.+. |+... ..|.|+|||...+ .+||---|||.|+.+.+.|+-|-. ..++-- .+-|+-|- T Consensus 191 Y~~~~~~ke~~r~~i~did~~ig~tV~I~GeV~qik--qT~GPTVFtltDetg~i~aAAFe~--aGvRAyP~ievGdiV~ 266 (715) T COG1107 191 YREVQVEKELPRTLIDDLDEMIGKTVRIEGEVTQIK--QTSGPTVFTLTDETGAIWAAAFEE--AGVRAYPEIEVGDIVE 266 (715) T ss_pred CHHHHHHHHCCCCCHHHHHHHCCCEEEEEEEEEEEE--ECCCCEEEEEECCCCCEEHHHHCC--CCCCCCCCCCCCCEEE T ss_conf 101112210453107777753486699977899999--758987999806888442666435--7743489987785699 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHH-HHHHHH Q ss_conf 9999667528843799999710168007999999999976540122610016310265289998478425899-999986 Q gi|254780791|r 88 VIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIR-DILQRI 166 (529) Q Consensus 88 ~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~-D~~~~~ 166 (529) +.|.|+ .-.|++|+-+..|+.- .|+= -.+.|+++.++ +|. +-. | +....+|-|+.=.+++ ||+... T Consensus 267 ViG~V~--~r~g~lQiE~~~me~L-~G~e---a~eVr~rid~a--ld~--~Ae-P--~~~~~lvdse~lE~L~p~m~~vA 333 (715) T COG1107 267 VIGEVT--RRDGRLQIEIEAMEKL-TGDE---AAEVRKRIDEA--LDR--RAE-P--ADVGFLVDSEVLEALKPDMVDVA 333 (715) T ss_pred EEEEEE--ECCCCEEEEEHHHHHH-HCCH---HHHHHHHHHHH--HHH--CCC-C--CCCCCCCCHHHHHHHHHHHHHHH T ss_conf 987775--0478278762435775-1851---89999999999--862--038-7--53455567799987518899999 Q ss_pred HH--C--CC-EEEEE-EECCCCC-----------------------------------------CCHHHHHHHHHHHHHH Q ss_conf 30--5--97-58999-7210011-----------------------------------------1103679999999974 Q gi|254780791|r 167 SC--R--FP-LRVII-FPVKVQG-----------------------------------------DECPKEIANAILQLNT 199 (529) Q Consensus 167 ~~--r--~p-~~~~~-~p~~vQG-----------------------------------------~~a~~~i~~ai~~~~~ 199 (529) +. | +- -.|++ .|+-+-| +.+...|.-||+-+.+ T Consensus 334 k~irrAV~egRPIiiRHHaDaDG~~agvAlE~AilplI~d~~~d~DAeyh~~KRrPskAPfYeleDvtrDl~~aLED~~R 413 (715) T COG1107 334 KEIRRAVLEGRPIIIRHHADADGYCAGVALEKAILPLIEDVHPDEDAEYHLFKRRPSKAPFYELEDVTRDLNFALEDAHR 413 (715) T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHHHHHH T ss_conf 99999873478369964567665200266899889999872798024567763386668730377666468899999996 Q ss_pred HCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCC Q ss_conf 1003576777589995168884442200769999999748904885205777525898864123 Q gi|254780791|r 200 LKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLR 263 (529) Q Consensus 200 ~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~R 263 (529) ..+ +.-++|++--|+-.||--+.. ..-..-|+||+==+|-.|.-+.|---|.. T Consensus 414 hGq-----KlPL~VlvDnGsTeEDipA~~------~~k~Ygi~ivVVDHH~Pde~vvD~yvd~H 466 (715) T COG1107 414 HGQ-----KLPLLVLVDNGSTEEDIPAIK------QLKAYGIDIVVVDHHYPDEAVVDEYVDVH 466 (715) T ss_pred CCC-----CCCEEEEECCCCCCCCCHHHH------HHHHCCCCEEEECCCCCCCHHHHHHHHHC T ss_conf 087-----565699976898644627898------88754997899717889603334566631 No 18 >cd04487 RecJ_OBF2_like RecJ_OBF2_like: A subfamily of OB folds corresponding to the second OB fold (OBF2) of archaeal-specific proteins with similarity to eubacterial RecJ. RecJ is an ssDNA-specific exonuclease. Although the overall sequence similarity of these proteins to eubacterial RecJ proteins is marginal, they appear to carry motifs, which have been shown to be essential for nuclease function in Escherichia coli RecJ. In addition to this OB fold, most proteins in this subfamily contain: i) an N-terminal OB fold belonging to a different domain family (the ribosomal S1-like RNA-binding family); and ii) a domain, C-terminal to OBF2, characteristic of DHH family proteins. DHH family proteins include E. coli RecJ, and are predicted to have a phosphoesterase function. Probab=96.18 E-value=0.041 Score=38.99 Aligned_cols=70 Identities=27% Similarity=0.406 Sum_probs=57.0 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCC--CCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 8999997054356888627999874894799999735210586681--459889999996675288437999997101 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLP--EEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~--~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) |.|+|||...+ ..||---|++.|+.+.+.|+-|-. ..++-.| +.||-|-+.|.++.+ .|.+|+-+.+|+. T Consensus 1 V~i~GeV~qik--QT~GPTIFti~Detg~v~~AAFee--aGvRAyPev~~gdiV~v~G~V~~r--~g~lQiE~~~~~~ 72 (73) T cd04487 1 VHIEGEVVQIK--QTSGPTIFTLRDETGTVWAAAFEE--AGVRAYPEVEVGDIVRVTGEVEPR--DGQLQIEVESLEV 72 (73) T ss_pred CEEEEEEEEEE--ECCCCEEEEEECCCCCEEHHHHCC--CCCCCCCCCCCCCEEEEEEEEEEE--CCEEEEEEEEEEC T ss_conf 97999999989--889999999975999698205130--674128887889889999899970--8858999730250 No 19 >PRK05672 dnaE2 error-prone DNA polymerase; Validated Probab=96.13 E-value=0.032 Score=40.03 Aligned_cols=18 Identities=17% Similarity=0.244 Sum_probs=7.9 Q ss_pred CEEEEEEEEEEECCCCHH Q ss_conf 437999997101680079 Q gi|254780791|r 99 SKYQIIIESLIPSGSGTL 116 (529) Q Consensus 99 g~~ql~v~~i~~~g~G~l 116 (529) +++.|++-.-...|--+| T Consensus 78 ~~~hLvLLAkN~~GY~NL 95 (1050) T PRK05672 78 GGPHLVVLARDREGYGRL 95 (1050) T ss_pred CCCCEEEEECCHHHHHHH T ss_conf 886689981898999999 No 20 >PRK10917 ATP-dependent DNA helicase RecG; Provisional Probab=96.10 E-value=0.19 Score=33.01 Aligned_cols=138 Identities=14% Similarity=0.169 Sum_probs=79.4 Q ss_pred CEEEEEEECCCCCCCCCC-EEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECC Q ss_conf 189999970543568886-2799987489479999973521058668145988999999667528843799999710168 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSG-HAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSG 112 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sG-H~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g 112 (529) .+.|.|+|.......... -+-+.+.|+.+.|.|++|.+. ..+.-.++.|+.|+|.|+++.| +|++|+.--+++... T Consensus 60 ~vtI~g~V~~~~~~~~~r~~l~v~v~D~tG~I~lvfFn~~-pyl~k~l~~G~~v~VsGKV~~~--~~~lQm~hPe~e~~~ 136 (677) T PRK10917 60 KVTVEGTVLSAEVYFGPRRRLRVTLSDGTGVLTLVFFNNQ-PYLKKQLKVGKRVAVYGKVKRG--KYGLEIVHPEYEVLG 136 (677) T ss_pred EEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEEEEECCC-HHHHHHCCCCCEEEEEEEEEEE--CCEEEEECCEEEECC T ss_conf 7999999999843778972499999989879999998984-7899638999989999999855--998999789898138 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHH Q ss_conf 00799999999997654012261001631026528999847842589999998630597589997210011110367999 Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIAN 192 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ 192 (529) .. ..-.+-+|-||.+=| -+...++-+++.+=+.++.--...|..+--+.--.+... T Consensus 137 ~~--------------------~~~~~I~PVYplT~G----Lsqk~irklI~~aL~~~~~l~E~LP~~ll~k~~L~s~~~ 192 (677) T PRK10917 137 EE--------------------TSIGRLTPVYPLTEG----LKQKTLRKLIKQALERLPALPELLPEELLEKYGLLSLAE 192 (677) T ss_pred CC--------------------CCCCCEEEECCCCCC----CCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCHHH T ss_conf 75--------------------456854553507766----786999999999998743143339988896638998999 Q ss_pred HHHHHH Q ss_conf 999997 Q gi|254780791|r 193 AILQLN 198 (529) Q Consensus 193 ai~~~~ 198 (529) ||..+. T Consensus 193 AL~~IH 198 (677) T PRK10917 193 ALRAIH 198 (677) T ss_pred HHHHHH T ss_conf 999975 No 21 >PRK07373 DNA polymerase III subunit alpha; Reviewed Probab=96.02 E-value=0.04 Score=39.13 Aligned_cols=76 Identities=22% Similarity=0.424 Sum_probs=64.1 Q ss_pred CEEEEEEECCCCCC-CCCCE--EEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 18999997054356-88862--7999874894799999735210586681459889999996675288437999997101 Q gi|254780791|r 34 HVCVRGEISGYRGI-HSSGH--AYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 34 ~~~v~gEis~~~~~-~~sGH--~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) .|.+.|=|++++.. .++|. .+++|-|..+++.|++|...+.+..-.+++|.-|++.|+|+.+ .|+.+++|+++.| T Consensus 280 ~V~~~g~I~~vr~~~TKkG~~MAfvtLeD~tG~iEvvvFpe~ye~~~~~L~~d~ivlv~Gkv~~r--d~~~~liv~~i~~ 357 (447) T PRK07373 280 KVSAVVMLNGVKKIVTKKGDPMAFLQLEDLSGQSEAVVFPKSYERIQELLQEDSRLIIWGKVDRR--DDQVQLIVEDAEP 357 (447) T ss_pred EEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEEHHHHHHHHHHHCCCCEEEEEEEEEEE--CCCEEEEEEECCC T ss_conf 89999999999998704896599999997897489998089999998873379989999999987--8937899800355 Q ss_pred C Q ss_conf 6 Q gi|254780791|r 111 S 111 (529) Q Consensus 111 ~ 111 (529) - T Consensus 358 l 358 (447) T PRK07373 358 I 358 (447) T ss_pred H T ss_conf 8 No 22 >PRK08957 consensus Probab=96.01 E-value=0.041 Score=39.05 Aligned_cols=47 Identities=17% Similarity=0.258 Sum_probs=20.2 Q ss_pred EECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHH Q ss_conf 97352105866814598899999966752884379999971016800799 Q gi|254780791|r 68 IWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLL 117 (529) Q Consensus 68 ~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~ 117 (529) =|-....+..++|==|.++.+.+.. . ....|.|++-.-...|-.+|. T Consensus 52 eFy~ackk~GIKPIIG~El~v~~~~--~-~~~~~~LvLLAkN~~GY~NL~ 98 (1159) T PRK08957 52 KFYGAAHGAGIKPIIGADFWVQSPL--L-GDELTHLTVLAANNEGYQNLT 98 (1159) T ss_pred HHHHHHHHCCCCEEEEEEEEEECCC--C-CCCCCCEEEEECCHHHHHHHH T ss_conf 9999999869987977799973687--7-787342899958999999999 No 23 >cd04485 DnaE_OBF DnaE_OBF: A subfamily of OB folds corresponding to the C-terminal OB-fold nucleic acid binding domain of Thermus aquaticus and Escherichia coli type C replicative DNA polymerase III alpha subunit (DnaE). The DNA polymerase holoenzyme of E. coli contains two copies of this replicative polymerase, each of which copies a different DNA strand. This group also contains Bacillus subtilis DnaE. Replication in B. subtilis and Staphylococcus aureus requires two different type C polymerases, polC and DnaE, both of which are thought to be included in the DNA polymerase holoenzyme. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=95.98 E-value=0.034 Score=39.78 Aligned_cols=73 Identities=25% Similarity=0.347 Sum_probs=59.9 Q ss_pred EEEEECCCCCC-CCCC--EEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 99997054356-8886--279998748947999997352105866814598899999966752884379999971016 Q gi|254780791|r 37 VRGEISGYRGI-HSSG--HAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPS 111 (529) Q Consensus 37 v~gEis~~~~~-~~sG--H~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~ 111 (529) |-|-|++++.. .++| -.+|+|-|..+.+.|++|...+....-.++.|.-|++.|+++... ++++|.+.+|.+- T Consensus 2 iaG~I~~~k~~~tk~G~~~a~~tleD~~g~~e~~~F~~~~~~~~~~l~~~~~v~i~g~v~~~~--~~~~l~~~~i~~l 77 (84) T cd04485 2 VAGLVTSVRRRRTKKGKRMAFVTLEDLTGSIEVVVFPETYEKYRDLLKEDALLLVEGKVERRD--GGLRLIAERIEDL 77 (84) T ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCCEEEEECHHHHHHHHHHHCCCCEEEEEEEEEEEC--CEEEEEEEEEEEH T ss_conf 899999989985579998999999437874999996999999888744698899999999969--9899999994678 No 24 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=95.97 E-value=0.23 Score=32.26 Aligned_cols=35 Identities=6% Similarity=0.014 Sum_probs=13.5 Q ss_pred HHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCC Q ss_conf 87169677731461999848988957778929998 Q gi|254780791|r 449 LQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKT 483 (529) Q Consensus 449 L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd 483 (529) |.-=||..++.-|-.|.-.+.||.|.....++-|+ T Consensus 1083 L~Le~PdDPF~GGl~l~a~P~~K~vqrleAMSGGE 1117 (1202) T TIGR02169 1083 LILENPDDPFAGGLELKAKPKGKPVQRLEAMSGGE 1117 (1202) T ss_pred EECCCCCCCCCCCCEEEEEECCCCEEEHHHCCCCH T ss_conf 54358887436871788873788502022103838 No 25 >PRK09074 consensus Probab=95.97 E-value=0.24 Score=32.24 Aligned_cols=20 Identities=35% Similarity=0.424 Sum_probs=9.7 Q ss_pred CCCCCCHHHHHHHHHHHHHH Q ss_conf 89862299999999999740 Q gi|254780791|r 11 DHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 11 ~~~~~svs~l~~~i~~~l~~ 30 (529) ....||..+=+-.++.+++. T Consensus 12 vhS~YSlLdg~~~i~eLv~~ 31 (1149) T PRK09074 12 VHSAYSLLEGAIKLKKIIGL 31 (1149) T ss_pred CCCCCCHHCCCCCHHHHHHH T ss_conf 57627130407599999999 No 26 >cd04484 polC_OBF polC_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold nucleic acid binding domain of Bacillus subtilis type C replicative DNA polymerase III alpha subunit (polC). Replication in B. subtilis and Staphylococcus aureus requires two different polymerases, polC and DnaE. The holoenzyme is thought to include the two different polymerases. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=95.93 E-value=0.092 Score=35.87 Aligned_cols=76 Identities=22% Similarity=0.219 Sum_probs=54.8 Q ss_pred CEEEEEEECCCCCC-CCCC-EEE-EEEECCCCEEEEEEECCCCCCCCCCC-CCCCEEEEEEEEEEECC-CCEEEEEEEEE Q ss_conf 18999997054356-8886-279-99874894799999735210586681-45988999999667528-84379999971 Q gi|254780791|r 34 HVCVRGEISGYRGI-HSSG-HAY-FSLKDNHSRIDAIIWKGTLNKIEFLP-EEGIEFLVIGKITTFPG-SSKYQIIIESL 108 (529) Q Consensus 34 ~~~v~gEis~~~~~-~~sG-H~Y-f~lkd~~a~i~~~~~~~~~~~~~~~~-~~G~~v~~~g~~~~y~~-~g~~ql~v~~i 108 (529) +|.|+|||-++-.+ -+|| ++| |.+.|...+|.|-+|........... +.|+-|.+.|++. |.. ...+.|++.+| T Consensus 1 nV~i~G~IF~~E~relk~gk~i~~~~iTD~t~Si~~K~F~~~~~~~~~~~~~~G~~v~v~G~v~-~D~f~~e~v~~~~~i 79 (82) T cd04484 1 NVVVEGEVFDLEIRELKSGRKILTFKVTDYTSSITVKKFLRKDEKDKEELKSKGDWVRVRGKVQ-YDTFSKELVLMINDI 79 (82) T ss_pred CEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEEECCCHHHHHHHHCCCCEEEEEEEEE-ECCCCCCEEEECCEE T ss_conf 9899999997898972489899999998088979999983585444566525897999999996-317877159981124 Q ss_pred EE Q ss_conf 01 Q gi|254780791|r 109 IP 110 (529) Q Consensus 109 ~~ 110 (529) +. T Consensus 80 ~~ 81 (82) T cd04484 80 EE 81 (82) T ss_pred EE T ss_conf 78 No 27 >PRK06826 dnaE DNA polymerase III DnaE; Reviewed Probab=95.92 E-value=0.043 Score=38.80 Aligned_cols=12 Identities=17% Similarity=0.429 Sum_probs=4.1 Q ss_pred EEEEEEEEEECC Q ss_conf 799999710168 Q gi|254780791|r 101 YQIIIESLIPSG 112 (529) Q Consensus 101 ~ql~v~~i~~~g 112 (529) |.|++-.-...| T Consensus 89 ~~LvLLAkN~~G 100 (1157) T PRK06826 89 YHLVLLAKNEIG 100 (1157) T ss_pred CEEEEEECCHHH T ss_conf 239999589999 No 28 >cd04488 RecG_wedge_OBF RecG_wedge_OBF: A subfamily of OB folds corresponding to the OB fold found in the N-terminal (wedge) domain of Escherichia coli RecG. RecG is a branched-DNA-specific helicase, which catalyzes the interconversion of a DNA replication fork to a four-stranded (Holliday) junction in vivo and in vitro. This interconversion provides a route to repair stalled forks. The RecG monomer contains three domains. The N-terminal domain is named for its wedge structure, and may provide the specificity of RecG for binding branched-DNA structures. During the reversal of fork to Holliday junction, the wedge domain is fixed at the junction of the fork where the leading and lagging strand duplex arms meet, and is thought to promote the unwinding of the nascent leading and lagging strands. In order to form the Holliday junction, these nascent strands would be annealed, and the parental strands reannealed. The wedge domain may also be a processivity factor of RecG on these branched cha Probab=95.62 E-value=0.088 Score=36.04 Aligned_cols=67 Identities=16% Similarity=0.249 Sum_probs=46.6 Q ss_pred EEEEECCCCCCCCC-CEEE-EEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEE Q ss_conf 99997054356888-6279-9987489479999973521058668145988999999667528843799999 Q gi|254780791|r 37 VRGEISGYRGIHSS-GHAY-FSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIE 106 (529) Q Consensus 37 v~gEis~~~~~~~s-GH~Y-f~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~ 106 (529) |.|+|......+.. +-.+ ..+.|+.+.+.|++|... ..+.-.++.|..|.+.|+++.| +|++|+.-- T Consensus 2 i~g~I~~~~~~~~~~r~~l~~~v~D~tg~i~l~~Fn~~-~~~~~~~~~G~~v~v~Gkv~~~--~~~~q~~hP 70 (75) T cd04488 2 VEGTVVSVEVVPRRGRRRLKVTLSDGTGTLTLVFFNFQ-PYLKKQLPPGTRVRVSGKVKRF--RGGLQIVHP 70 (75) T ss_pred EEEEEEEEEEECCCCCEEEEEEEEECCCEEEEEEECCC-HHHHHHCCCCCEEEEEEEEEEC--CCEEEEECC T ss_conf 99999998854589970899999969988999998997-7999538899999999999713--999999799 No 29 >pfam00465 Fe-ADH Iron-containing alcohol dehydrogenase. Probab=95.36 E-value=0.19 Score=33.11 Aligned_cols=70 Identities=23% Similarity=0.372 Sum_probs=48.0 Q ss_pred CEEEEEECCCHH---HHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 289998478425---89999998630597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 146 KIIAVITSPTGA---VIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 146 ~~i~vits~~~a---~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) +|+-|||+++-. .+..+...|.+. .+++.+| .-|.++-...+|.++++.+...+ +| .||+=||||.- T Consensus 23 ~r~llVt~~~~~~~g~~~~l~~~L~~~-~~~~~~~-~~v~~~p~~~~v~~~~~~~~~~~-------~D-~IIaiGGGS~i 92 (312) T pfam00465 23 ARALIVTDPSLKKLGLLDRVLDSLEEA-GIEVVVF-DGVEPNPTLEEVDEAAAAAREEG-------AD-VIIAVGGGSVI 92 (312) T ss_pred CEEEEEECCCHHHCCHHHHHHHHHHHC-CCEEEEE-CCCCCCCCHHHHHHHHHHHHHCC-------CC-EEEECCCCCCC T ss_conf 979999895856676399999999974-9949998-58279999999999999999649-------98-99980897612 Q ss_pred HHH Q ss_conf 422 Q gi|254780791|r 223 DLW 225 (529) Q Consensus 223 DL~ 225 (529) |.. T Consensus 93 D~a 95 (312) T pfam00465 93 DTA 95 (312) T ss_pred HHH T ss_conf 499 No 30 >PRK07279 dnaE DNA polymerase III DnaE; Reviewed Probab=95.30 E-value=0.07 Score=36.95 Aligned_cols=38 Identities=13% Similarity=0.168 Sum_probs=21.4 Q ss_pred CCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHH Q ss_conf 45988999999667528843799999710168007999 Q gi|254780791|r 81 EEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLT 118 (529) Q Consensus 81 ~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~ 118 (529) +.|.|-++=..+.+......|.|++-.-...|-.+|.+ T Consensus 55 ~~gIKPIiG~E~~v~~~~~~~hl~LLAkn~~Gy~nL~k 92 (1033) T PRK07279 55 KNGLQPVLGLELNIFYQERQVLLRLIAKNTQGYKNLLK 92 (1033) T ss_pred HCCCCCEEEEEEEEECCCCCEEEEEECCCHHHHHHHHH T ss_conf 75989478889999748962369998169999999999 No 31 >PRK06658 consensus Probab=95.25 E-value=0.11 Score=35.32 Aligned_cols=18 Identities=22% Similarity=0.407 Sum_probs=9.1 Q ss_pred EEEEEEEEEEECCCCHHH Q ss_conf 379999971016800799 Q gi|254780791|r 100 KYQIIIESLIPSGSGTLL 117 (529) Q Consensus 100 ~~ql~v~~i~~~g~G~l~ 117 (529) .|.|++..-...|-.+|. T Consensus 89 ~~hLvLLAkN~~GY~NL~ 106 (1145) T PRK06658 89 YYHLILLAKNETGYRNLM 106 (1145) T ss_pred CCEEEEEECCHHHHHHHH T ss_conf 330899828999999999 No 32 >cd04483 hOBFC1_like hOBFC1_like: A subfamily of OB folds similar to that found in human OB fold containing protein 1 (hOBFC1). Members of this group belong to the Replication protein A subunit 2 (RPA2) family of OB folds. RPA is a nuclear ssDNA binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The OB fold domain of RPA2 has dual roles in ssDNA binding and trimerization. Probab=95.21 E-value=0.24 Score=32.11 Aligned_cols=68 Identities=19% Similarity=0.266 Sum_probs=49.4 Q ss_pred EEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC--------------------CCCCCCCCEEEEEEEEEEEC Q ss_conf 9999705435688862799987489479999973521058--------------------66814598899999966752 Q gi|254780791|r 37 VRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI--------------------EFLPEEGIEFLVIGKITTFP 96 (529) Q Consensus 37 v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~--------------------~~~~~~G~~v~~~g~~~~y~ 96 (529) |.|-|...+.+. .-+-|++-|..+-|.|++|++....- .-.++-|+-+-|+|+|..| T Consensus 2 IlG~VVs~~ere--~f~~y~vDDgTGcI~Cv~W~~~~~~~~~s~~~~~~~~~~~~~a~~~~~~l~iG~llRVRG~I~~y- 78 (92) T cd04483 2 ILGTVVSRRERE--TFYSFGVDDGTGVVNCVCWKNLSYAEVSSRSDAARILKSALMALKQAKVLEIGDLLRVRGSIRTY- 78 (92) T ss_pred EEEEEEEEEEEC--CEEEEEEECCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEE- T ss_conf 379999997605--71699860898669989960457677666678445444457888761157752389994268887- Q ss_pred CCCEEEEEEEEE Q ss_conf 884379999971 Q gi|254780791|r 97 GSSKYQIIIESL 108 (529) Q Consensus 97 ~~g~~ql~v~~i 108 (529) ||.-|+.+..+ T Consensus 79 -Rg~rei~as~~ 89 (92) T cd04483 79 -RGEREINASVV 89 (92) T ss_pred -ECEEEEEEEEE T ss_conf -04489968999 No 33 >PRK06920 dnaE DNA polymerase III DnaE; Reviewed Probab=95.16 E-value=0.14 Score=34.14 Aligned_cols=37 Identities=19% Similarity=0.286 Sum_probs=17.6 Q ss_pred CCCEEEEEEEEEEEC--CCCEEEEEEEEEEECCCCHHHH Q ss_conf 598899999966752--8843799999710168007999 Q gi|254780791|r 82 EGIEFLVIGKITTFP--GSSKYQIIIESLIPSGSGTLLT 118 (529) Q Consensus 82 ~G~~v~~~g~~~~y~--~~g~~ql~v~~i~~~g~G~l~~ 118 (529) .|.|-++=..+.++. ....|.|++-.-...|-..|.+ T Consensus 57 ~gIKPIiG~e~~v~~~~~~~~~~LiLLAkN~~GY~nL~k 95 (1107) T PRK06920 57 HGIKPIIGLTASIFSEEEEKSYPLVLLAENEIGYQNLLK 95 (1107) T ss_pred CCCCCEEEEEEEEECCCCCCCCCEEEEECCHHHHHHHHH T ss_conf 799803542799843897775787999349999999999 No 34 >PRK07374 dnaE DNA polymerase III subunit alpha; Validated Probab=95.06 E-value=0.11 Score=35.12 Aligned_cols=50 Identities=18% Similarity=0.235 Sum_probs=23.5 Q ss_pred CCCCCCCCCCCCCCEEEE-EEEEEE--ECCCCEEEEEEEEEEECCCCHHHHHH Q ss_conf 521058668145988999-999667--52884379999971016800799999 Q gi|254780791|r 71 GTLNKIEFLPEEGIEFLV-IGKITT--FPGSSKYQIIIESLIPSGSGTLLTAL 120 (529) Q Consensus 71 ~~~~~~~~~~~~G~~v~~-~g~~~~--y~~~g~~ql~v~~i~~~g~G~l~~~~ 120 (529) ..+.....+|=-|.++.+ .|.+.- ......|.|++......|-.+|.+-. T Consensus 52 ~aa~~~gIKPIiG~e~~v~~~~~~d~~~~~~~~~hlvlLAkN~~GY~nL~kL~ 104 (1171) T PRK07374 52 KLCKSAGIKPIIGNEMYVINGSIDDPQPKKEKRYHLVVLAKNATGYKNLVKLT 104 (1171) T ss_pred HHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 99997699833676899737976566556567751899558989999999999 No 35 >PRK06637 consensus Probab=95.05 E-value=0.22 Score=32.59 Aligned_cols=18 Identities=28% Similarity=0.294 Sum_probs=9.2 Q ss_pred EEEEEEEEEECCCCHHHH Q ss_conf 799999710168007999 Q gi|254780791|r 101 YQIIIESLIPSGSGTLLT 118 (529) Q Consensus 101 ~ql~v~~i~~~g~G~l~~ 118 (529) +.|++......|-..|.+ T Consensus 79 ~~lvLLAkn~~GY~nL~k 96 (1182) T PRK06637 79 AQILLIAKDETGYKNLLK 96 (1182) T ss_pred CCEEEEECCHHHHHHHHH T ss_conf 438998079999999999 No 36 >PRK09860 putative alcohol dehydrogenase; Provisional Probab=94.94 E-value=0.48 Score=29.52 Aligned_cols=97 Identities=19% Similarity=0.249 Sum_probs=60.9 Q ss_pred EEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHH---HHHHHHHHHHCCCEEEEEE Q ss_conf 7999997101680079999999999765401226100163102652899984784258---9999998630597589997 Q gi|254780791|r 101 YQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAV---IRDILQRISCRFPLRVIIF 177 (529) Q Consensus 101 ~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~---~~D~~~~~~~r~p~~~~~~ 177 (529) |.|++-.--..|.|.+.+ +.+.+.+-|. +++-|||.++-.. +.++...|.+.. +.+.+| T Consensus 4 ftF~~Pt~i~fG~g~~~~----l~~~~~~~G~-------------k~~lvvt~~~~~~~g~~~~v~~~L~~~g-i~~~~f 65 (383) T PRK09860 4 STFFIPSVNVIGADSLTD----AMNMMADYGF-------------TRTLIVTDNMLTKLGMAGDVQKALEERN-IFSVIY 65 (383) T ss_pred CEEECCCCEEECCCHHHH----HHHHHHHCCC-------------CEEEEECCCCHHHCCHHHHHHHHHHHCC-CCEEEE T ss_conf 489589814988389999----9999998299-------------8799982845665746999999998769-958996 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 21001111036799999999741003576777589995168884442 Q gi|254780791|r 178 PVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 178 p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) .-|+.+-....+-++++.+...+ +| .||+=||||.-|. T Consensus 66 -~~V~~nP~~~~v~~~~~~~r~~~-------~D-~ivavGGGS~iD~ 103 (383) T PRK09860 66 -DGTQPNPTTENVAAGLKLLKENN-------CD-SVISLGGGSPHDC 103 (383) T ss_pred -CCCCCCCCHHHHHHHHHHHHHCC-------CC-EEEEECCCCHHHH T ss_conf -89527969999999999998739-------99-9999389622678 No 37 >PRK07012 consensus Probab=94.86 E-value=0.16 Score=33.71 Aligned_cols=60 Identities=10% Similarity=0.076 Sum_probs=23.1 Q ss_pred EEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHH Q ss_conf 998748947999997352105866814598899999966752884379999971016800799 Q gi|254780791|r 55 FSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLL 117 (529) Q Consensus 55 f~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~ 117 (529) .-|.|.+.--.++=|-..+.+...+|==|.++.+...- .....+.|++..-...|-..|. T Consensus 39 lAlTD~~nl~Gav~F~~aa~~~gIKPIiG~e~~v~~~~---~~~~~~~l~LLAkn~~GY~nL~ 98 (1173) T PRK07012 39 LALTDLGNAFGLVRFYKEARGAGVKPIAGCDVWITNPD---DRDKPSRLLLLVKDKRGYLNLC 98 (1173) T ss_pred EEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCC---CCCCCCCEEEECCCHHHHHHHH T ss_conf 98851886878999999999779983367799976787---7777642899789889999999 No 38 >COG4085 Predicted RNA-binding protein, contains TRAM domain [General function prediction only] Probab=94.84 E-value=0.17 Score=33.56 Aligned_cols=75 Identities=19% Similarity=0.295 Sum_probs=63.0 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC-----CCCCCCCEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 18999997054356888627999874894799999735210586-----6814598899999966752884379999971 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE-----FLPEEGIEFLVIGKITTFPGSSKYQIIIESL 108 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~-----~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i 108 (529) .+-|+|||...+..+.+|-...-|-|....|..+.|++...-+. +-..+|+-|.+.|+++.| +|++++.+... T Consensus 53 ~v~vkg~V~~~~n~~~~gi~~l~lndgtGti~vva~~~tee~l~~n~~~p~~~eGe~veVtGrv~~y--rG~~eVkvnq~ 130 (204) T COG4085 53 EVTVKGEVTADQNAIGGGIESLVLNDGTGTITVVASRSTEETLELNEGMPVTVEGEIVEVTGRVEEY--RGSSEVKVNQP 130 (204) T ss_pred CCEEEEEEEEEECCCCCCEEEEEEECCCCCEEEEEECCHHHHHHHCCCCCCCCCCCEEEEEEEEEEE--CCCCEEECCCC T ss_conf 5322148975310123665999997798748999906736768644788732267589998888771--79730310684 Q ss_pred EE Q ss_conf 01 Q gi|254780791|r 109 IP 110 (529) Q Consensus 109 ~~ 110 (529) .- T Consensus 131 ~d 132 (204) T COG4085 131 ND 132 (204) T ss_pred CC T ss_conf 43 No 39 >PRK09423 gldA glycerol dehydrogenase; Provisional Probab=94.42 E-value=0.39 Score=30.30 Aligned_cols=89 Identities=21% Similarity=0.270 Sum_probs=56.3 Q ss_pred CEEEEEECCCHHH-HHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 2899984784258-999999863059758999721001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAV-IRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~-~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) +|+-|||.+.... +.|-+...-+...+++. | ..+.|+-...++-++.+.+... ++| +||+=||||.-|. T Consensus 30 ~r~liVtd~~~~~~~~~~v~~~L~~~gi~~~-~-~~~~~~pt~~~v~~~~~~~~~~-------~~D-~IiavGGGS~iD~ 99 (366) T PRK09423 30 KRALLIADEFVLGIVGDTVEASLKDAGLDVV-F-EVFNGECSDNEIDRLVAIAEEN-------GCD-VIIGIGGGKTLDT 99 (366) T ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCEEE-E-EECCCCCCHHHHHHHHHHHHHC-------CCC-EEEEECCHHHHHH T ss_conf 9589998952899899999999986798699-9-7338999999999999999864-------999-8999378388777 Q ss_pred HHCCHHHHHHHHHHC---C---CEEEEEECCCCC Q ss_conf 200769999999748---9---048852057775 Q gi|254780791|r 225 WHFNDEMIVRAIANS---S---IPIISAIGHETD 252 (529) Q Consensus 225 ~~FN~e~laraI~~~---~---iPVisgIGHE~D 252 (529) |++|+.. | ||.++|-|=|.. T Consensus 100 --------AKaia~~~~~P~i~IPTtAgTgSe~t 125 (366) T PRK09423 100 --------AKAVADYLGVPVVIVPTIASTDAPCS 125 (366) T ss_pred --------HHHHHHHCCCCEEEECCCCCCCCCCC T ss_conf --------99999982899799568666788766 No 40 >pfam01442 Apolipoprotein Apolipoprotein A1/A4/E domain. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, and Apolipoprotein E. Probab=94.39 E-value=0.62 Score=28.51 Aligned_cols=86 Identities=19% Similarity=0.154 Sum_probs=48.4 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 76332346777669999988877899888999985567765304-97889867888788865456899999987568899 Q gi|254780791|r 272 EMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALP-NSDQTLSCSRYRLDRLPRELEHNLEIIIFRKYRHF 350 (529) Q Consensus 272 Elavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~-~~~~~L~~~~QrLd~~~~rL~~~l~~~l~~~~~~l 350 (529) +-+.|...++..++..-...|...+..-++..+.++.....-+. ...+-+...+++++.....|...+.......+..+ T Consensus 19 ~~l~P~t~~~~~~l~~~~~~lr~~l~~dleelr~~l~p~~~el~~~~~~~~eelr~~l~p~~~el~~~~~~~~eeLr~~l 98 (191) T pfam01442 19 EQLGPYAQEFWAQLSKETEALREELQKDLEEVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRERLNRDAEELRRKL 98 (191) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77051689999888899999999998589999998789999999999989999999877789999998886599999999 Q ss_pred HHHHHHH Q ss_conf 9998410 Q gi|254780791|r 351 NHIINRI 357 (529) Q Consensus 351 ~~~~~rL 357 (529) ......+ T Consensus 99 ~~~~eel 105 (191) T pfam01442 99 APYAEEL 105 (191) T ss_pred HHHHHHH T ss_conf 9999999 No 41 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=94.39 E-value=0.6 Score=28.60 Aligned_cols=234 Identities=19% Similarity=0.240 Sum_probs=139.7 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC---CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC Q ss_conf 99888888888986229999999999974001---718999997054356888627999874894799999735210586 Q gi|254780791|r 1 MNPFSQKNSLDHPEYSVSELSYHLKHIVESNL---SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE 77 (529) Q Consensus 1 ~~~~~~~~~~~~~~~svs~l~~~i~~~l~~~~---~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~ 77 (529) ++|+|.+... -.|+.++...-..+-.... ..|.|.|=|.+.|. .++-++++|+|..+.|.|++-+.....-. T Consensus 22 ~~pyp~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~V~vaGrv~~~R~--~Gk~~F~~lrD~~g~iQ~~~~~~~~~~~~ 96 (491) T PRK00484 22 INPYPNDFER---THTSAELQAKYADLEKEELEALIEVSVAGRVMLKRV--MGKASFATIQDGSGRIQLYVSKDDVGEEA 96 (491) T ss_pred CCCCCCCCCC---CCCHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEEEC--CCCEEEEEEEECCCCEEEEEECCCCCHHH T ss_conf 9999997858---666999999875138533467988999987840505--69819999996994589999667589999 Q ss_pred ----CCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEC------------CCCHHHHHH--------------------- Q ss_conf ----6814598899999966752884379999971016------------800799999--------------------- Q gi|254780791|r 78 ----FLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPS------------GSGTLLTAL--------------------- 120 (529) Q Consensus 78 ----~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~------------g~G~l~~~~--------------------- 120 (529) -.+.-|+-|.|.|.+- -.++|.+++.|.+++.- |+-+...++ T Consensus 97 ~~~~~~l~~gd~i~v~G~v~-~t~~Gel~v~~~~~~iLsksl~plP~k~~g~~d~e~r~r~RyLDLi~n~~~~~~f~~Rs 175 (491) T PRK00484 97 YEAFKKLDLGDIIGVEGTLF-KTKTGELSVKVTELTLLTKSLRPLPDKFHGLSDQETRYRQRYLDLIVNEESRETFRKRS 175 (491) T ss_pred HHHHHCCCCCCEEEEEEEEE-ECCCCEEEEECCEEEEECCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHH T ss_conf 99996068666898787998-65885166403317993168889985444556755631133021027999999999999 Q ss_pred ---HHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEE--EE------CCCCCCCHHHH Q ss_conf ---99999765401226100163102652899984784258999999863059758999--72------10011110367 Q gi|254780791|r 121 ---EKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVII--FP------VKVQGDECPKE 189 (529) Q Consensus 121 ---e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~--~p------~~vQG~~a~~~ 189 (529) ..+++-|...|+.+-+. | | +-+.+.||.-++|+.... -++.+.+| .| ..|=|-+-.-+ T Consensus 176 ~ii~~iR~~l~~~gF~EVeT----P-----i-L~~~~gGA~ArpF~t~~n-~l~~~~yL~~SPqLylk~l~vgG~ervfe 244 (491) T PRK00484 176 KIISAIRRFLDNRGFLEVET----P-----M-LQPIPGGAAARPFITHHN-ALDIDLYLRIAPELYLKRLIVGGFERVFE 244 (491) T ss_pred HHHHHHHHHHHHCCEEEEEC----C-----C-CCCCCCCCCCCCCCCCCC-CCCCCEEECCCHHHHHHHHHHCCCHHHEE T ss_conf 99999999998676899867----8-----7-766688756676556445-66855442068789877787627202222 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHH--CCCEEEEEECCCCCCH Q ss_conf 9999999974100357677758999516888444220076999999974--8904885205777525 Q gi|254780791|r 190 IANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIAN--SSIPIISAIGHETDWT 254 (529) Q Consensus 190 i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~--~~iPVisgIGHE~D~T 254 (529) |....+-=+.. ..+.+.|=-|=+=......+|+|.+-++-+ +.|+. .....++--|++.|++ T Consensus 245 I~r~FR~E~~~--~rH~pEFT~lE~e~af~d~~dvm~l~E~li-~~v~~~v~g~~~~~~~~~~idl~ 308 (491) T PRK00484 245 IGRNFRNEGID--TRHNPEFTMIEFYQAYADYNDMMDLTEELI-RHLAKEVLGTTKITYGGTEIDFG 308 (491) T ss_pred HHHHHHCCCCC--CCCCCCEEEEEEEEECCCHHHHHHHHHHHH-HHHHHHHCCCCEEEECCEEECCC T ss_conf 48766257554--341840102136772479999999999999-99999961995673287763378 No 42 >PRK06142 enoyl-CoA hydratase; Provisional Probab=94.31 E-value=0.64 Score=28.37 Aligned_cols=39 Identities=21% Similarity=0.273 Sum_probs=25.3 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ....+|.+||.|||++| |+-.=- .-+=+.+|+|..||.| T Consensus 99 ~~~~~i~~~~kPvIaav~G~a~GgG~~la~~cD~~ia~~~a 139 (271) T PRK06142 99 ASINAVADCRKPVLAAVQGWCIGGAVDLISACDMRYASADA 139 (271) T ss_pred HHHHHHHHCCCCEEEEECCEEEEHHHHHHHHHCCCCCCCCC T ss_conf 99999972899899998884440758999861512227985 No 43 >TIGR02393 RpoD_Cterm RNA polymerase sigma factor RpoD; InterPro: IPR012760 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the C-terminal region of sigma 70 (RpoD) which contains the well-conserved regions 2, 3 and 4. Region 2 of sigma-70 is the most conserved region of the entire protein. The high conservation is due to region 2 containing both the -10 promoter recognition helix and the primary core RNA polymerase binding determinant. The core-binding helix, interacts with the clamp domain of the largest polymerase subunit, beta prime , . The aromatic residues of the recognition helix, found at the C terminus of this domain are thought to mediate strand separation, thereby allowing transcription initiation , . Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but in some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme .Region 4 is involved in binding to the -35 promoter element via a helix-turn-helix motif .; GO: 0003677 DNA binding, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=94.21 E-value=0.053 Score=38.01 Aligned_cols=52 Identities=27% Similarity=0.416 Sum_probs=41.3 Q ss_pred HHHHHCCHHHH--HHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 44220076999--99997489048852057775258988641237772145676 Q gi|254780791|r 222 EDLWHFNDEMI--VRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEM 273 (529) Q Consensus 222 eDL~~FN~e~l--araI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAEl 273 (529) .+=|.++-+.| +..++.-||=.=+-||||-|..+.|||.|..+.+|..+|+. T Consensus 110 Ae~~g~~~~Kv~~i~~~~~~PiSLe~piG~e~dS~l~DFi~D~~~~~P~~~~~~ 163 (240) T TIGR02393 110 AERMGLPVEKVREIKKIAQEPISLETPIGEEEDSSLGDFIEDESIESPEDAAAK 163 (240) T ss_pred HHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 987089989999999862488656667788777744265257444685899999 No 44 >COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only] Probab=94.00 E-value=0.33 Score=30.94 Aligned_cols=77 Identities=18% Similarity=0.188 Sum_probs=51.8 Q ss_pred CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC--CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 17189999970543568886279998748947999997352105--8668145988999999667528843799999710 Q gi|254780791|r 32 LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK--IEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLI 109 (529) Q Consensus 32 ~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~--~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~ 109 (529) +.++.|+|++..--..-..||+.|.+-|.+..|.|+.|.-.... +--++.-|++|.+.|++.=+ . |+++.|. T Consensus 266 ~~~~~v~g~v~~~p~~ieGghv~v~i~d~~G~I~~~A~eptk~fr~~a~~L~pGD~i~~~G~~~~~---~---~n~ek~~ 339 (421) T COG1571 266 YSKYRVVGRVEAEPRAIEGGHVVVEITDGEGEIGAVAFEPTKEFRELARKLIPGDEITVYGSVKPG---T---LNLEKFQ 339 (421) T ss_pred CCCEEEEEEEECCCEEEECCEEEEEECCCCCEEEEEEECCCCCCHHHHHHCCCCCEEEEECCCCCC---C---EEEEEEE T ss_conf 343289999812318963887999961898169999963555225789856899989980574566---2---3689999 Q ss_pred ECCCC Q ss_conf 16800 Q gi|254780791|r 110 PSGSG 114 (529) Q Consensus 110 ~~g~G 114 (529) +-.+. T Consensus 340 v~~l~ 344 (421) T COG1571 340 VLKLA 344 (421) T ss_pred EEEEE T ss_conf 99866 No 45 >PRK06752 single-stranded DNA-binding protein; Validated Probab=93.89 E-value=0.52 Score=29.16 Aligned_cols=77 Identities=18% Similarity=0.380 Sum_probs=53.2 Q ss_pred CEEEEEEECC---CCCCCCCCEEE--EEEE------CCC-----CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE--E Q ss_conf 1899999705---43568886279--9987------489-----479999973521058668145988999999667--5 Q gi|254780791|r 34 HVCVRGEISG---YRGIHSSGHAY--FSLK------DNH-----SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT--F 95 (529) Q Consensus 34 ~~~v~gEis~---~~~~~~sGH~Y--f~lk------d~~-----a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~--y 95 (529) .|-+.|-+.. ++ ...+|+.| |+|. +.. .-++|++|+..+..+.-.++-|+.|.|.|++.. | T Consensus 4 ~V~LiGrl~~dpelr-~t~~G~~~~~f~lAv~r~~~~~~g~~~tdf~~~~~w~k~Ae~~~~yl~KG~~V~v~Grl~~~~y 82 (112) T PRK06752 4 RVVLIGRLTKEPELY-YTKQGVAYARVCVAVNRGFRNSLGEQQVDFINCVVWRKSAENVTEYCTKGSLVGITGRIHTSNY 82 (112) T ss_pred EEEEEEECCCCCEEE-ECCCCCEEEEEEEEECCCEECCCCCEEEEEEEEEEECHHHHHHHHHHCCCCEEEEEEEEECCCE T ss_conf 899999678686699-9899978999999613644878897878999999996589899998479999999999984610 Q ss_pred CCC-----CEEEEEEEEEEEC Q ss_conf 288-----4379999971016 Q gi|254780791|r 96 PGS-----SKYQIIIESLIPS 111 (529) Q Consensus 96 ~~~-----g~~ql~v~~i~~~ 111 (529) ... +...++|+.|+.- T Consensus 83 ~~~~G~~~~~~eVvv~~v~fL 103 (112) T PRK06752 83 EDDQGKRIYRTEVVIESITFL 103 (112) T ss_pred ECCCCCEEEEEEEEEEEEEEC T ss_conf 999997999999999999977 No 46 >COG1454 EutG Alcohol dehydrogenase, class IV [Energy production and conversion] Probab=93.72 E-value=0.54 Score=29.02 Aligned_cols=70 Identities=21% Similarity=0.330 Sum_probs=53.2 Q ss_pred CCEEEEEECCCHH---HHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 5289998478425---8999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 145 PKIIAVITSPTGA---VIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vits~~~a---~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) -+|+-|||.++-+ .+.+++..|..+. +++.+| .-|+++=....+.++.+.+.... +| .||+=||||. T Consensus 29 ~~r~liVTd~~~~~~g~~~~v~~~L~~~~-i~~~if-~~v~p~P~~~~v~~~~~~~~~~~-------~D-~iIalGGGS~ 98 (377) T COG1454 29 AKRALIVTDRGLAKLGLLDKVLDSLDAAG-IEYEVF-DEVEPEPTIETVEAGAEVAREFG-------PD-TIIALGGGSV 98 (377) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHCC-CEEEEE-CCCCCCCCHHHHHHHHHHHHHCC-------CC-EEEEECCCCH T ss_conf 87159997986311066999999987449-828995-68899998889999999998169-------99-8999779308 Q ss_pred HHH Q ss_conf 442 Q gi|254780791|r 222 EDL 224 (529) Q Consensus 222 eDL 224 (529) -|. T Consensus 99 ~D~ 101 (377) T COG1454 99 IDA 101 (377) T ss_pred HHH T ss_conf 789 No 47 >cd04490 PolII_SU_OBF PolII_SU_OBF: A subfamily of OB folds corresponding to the OB fold found in Pyrococcus abyssi DNA polymerase II (PolII) small subunit. PolII is a family D DNA polymerase, having a 3-prime to 5-prime exonuclease activity. P. abyssi PolII is heterodimeric. The large subunit appears to be the polymerase, and the small subunit may be the exonuclease. The small subunit contains a calcineurin-like phosphatase superfamily domain C-terminal to this OB-fold domain. Probab=93.59 E-value=0.42 Score=29.97 Aligned_cols=67 Identities=24% Similarity=0.340 Sum_probs=45.2 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCC---CEE-EEEEEEEEECCCCEEEEEEEEE Q ss_conf 8999997054356888627999874894799999735210586681459---889-9999966752884379999971 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEG---IEF-LVIGKITTFPGSSKYQIIIESL 108 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G---~~v-~~~g~~~~y~~~g~~ql~v~~i 108 (529) +.|.|=||+.+ ..++||+.|+|-|..+.|+|.+.+.+-.. +...++ |+| =+.|. |...|++ |++++| T Consensus 2 v~iIGmV~d~r-~TknG~~~~~lED~TG~i~vl~~kd~~~~--~~~~~~ll~DeVIGV~G~---~s~~g~~-~f~~~i 72 (79) T cd04490 2 VSIIGMVNDVR-STKNGHRIVELEDTTGRITVLLTKDKEEL--FEEAEDILPDEVIGVSGT---VSKDGGL-IFADEI 72 (79) T ss_pred EEEEEEECCCE-ECCCCCEEEEEECCCCEEEEEEECCCHHH--HHHHHCCCCCCEEEEEEE---ECCCCCE-EEEECC T ss_conf 69999993327-61688789999879998999996893677--566752568878999999---8789988-999712 No 48 >cd06533 Glyco_transf_WecG_TagA The glycosyltransferase WecG/TagA superfamily contains Escherichia coli WecG, Bacillus subtilis TagA and related proteins. E. coli WecG is believed to be a UDP-N-acetyl-D-mannosaminuronic acid transferase, and is involved in enterobacterial common antigen (eca) synthesis. B. subtilis TagA plays a key role in the Wall Teichoic Acid (WTA) biosynthetic pathway, catalyzing the transfer of N-acetylmannosamine to the C4 hydroxyl of a membrane-anchored N-acetylglucosaminyl diphospholipid to make ManNAc-beta-(1,4)-GlcNAc-pp-undecaprenyl. This is the first committed step in this pathway. Also included in this group is Xanthomonas campestris pv. campestris GumM, a glycosyltransferase participating in the biosynthesis of the exopolysaccharide xanthan. Probab=93.57 E-value=0.65 Score=28.33 Aligned_cols=92 Identities=18% Similarity=0.318 Sum_probs=67.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 6528999847842589999998630597-589997210011110367999999997410035767775899951688844 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) -..+|.++-|..+ .+..+.+.++.+|| ++|.-+...--|.....+|++.|... ++|+|+++=|..-.| T Consensus 45 ~~~~v~llG~~~~-~~~~~~~~l~~~yp~l~i~g~~~g~f~~~~~~~i~~~I~~~----------~~div~vglG~PkQE 113 (171) T cd06533 45 KGLRVFLLGAKPE-VLEKAAERLRARYPGLKIVGYHHGYFGPEEEEEIIERINAS----------GADILFVGLGAPKQE 113 (171) T ss_pred CCCEEEEEECCHH-HHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHC----------CCCEEEEECCCCHHH T ss_conf 4974999808989-99999999997889937999878999806689999999864----------999999967982889 Q ss_pred HHHHCCHHHHHHHHHHCCCEEEEEECCCCCC Q ss_conf 4220076999999974890488520577752 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIISAIGHETDW 253 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVisgIGHE~D~ 253 (529) -++..|-..+ +.+|+-|||=--|+ T Consensus 114 ~~~~~~~~~l-------~~~~~~~vGgafd~ 137 (171) T cd06533 114 LWIARHKDRL-------PVPVAIGVGGSFDF 137 (171) T ss_pred HHHHHHHHHC-------CCCEEEECCEEEEH T ss_conf 9999999877-------99869864512110 No 49 >cd04478 RPA2_DBD_D RPA2_DBD_D: A subfamily of OB folds corresponding to the OB fold of the central ssDNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B; RPA2 DBD-D is a weak ssDNA-binding domain. RPA2 DBD-D is also involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. N-terminal to human RPA2 DBD-D is a domain containing all the known phosphorylation sites of RPA. Human RPA2 is phosphorylated in a cell cycle dependent manner in response to DNA dam Probab=93.46 E-value=0.87 Score=27.16 Aligned_cols=79 Identities=22% Similarity=0.294 Sum_probs=60.2 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 189999970543568886279998748947999997352105---86681459889999996675288437999997101 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) .|.+.|-|.+... .+..+=|+|-|..+.|.|..|-..... ..-..++|+-|-|.|.+..| .|.-++.+..|.| T Consensus 1 ~V~~VG~V~~v~~--~~t~~~y~idDgTG~i~~~~w~~~~~~~~~~~~~i~~g~YVrV~G~lk~f--~~~~~I~~~~irp 76 (95) T cd04478 1 QVTLVGVVRNVEE--QSTNITYTIDDGTGTIEVRQWLDDDNDDSSEVEPIEEGTYVRVFGNLKSF--QGKKSIMAFSIRP 76 (95) T ss_pred CEEEEEEEEEEEE--CCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEE--CCEEEEEEEEEEE T ss_conf 9899999998777--25579999987987399999637787642224433269899999998002--8913799999999 Q ss_pred CC-CCHH Q ss_conf 68-0079 Q gi|254780791|r 111 SG-SGTL 116 (529) Q Consensus 111 ~g-~G~l 116 (529) .. .-++ T Consensus 77 v~d~NEi 83 (95) T cd04478 77 VTDFNEV 83 (95) T ss_pred ECCCCHH T ss_conf 3797577 No 50 >pfam03808 Glyco_tran_WecB Glycosyl transferase WecB/TagA/CpsF family. Probab=93.30 E-value=0.62 Score=28.50 Aligned_cols=93 Identities=20% Similarity=0.310 Sum_probs=66.9 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 6528999847842589999998630597-589997210011110367999999997410035767775899951688844 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) -..+|.++-+..+. +..+.+.++.+|| ++|.-+...--++....+|++.|... ++|+|+++=|..-.| T Consensus 47 ~~~~v~llG~~~~~-~~~~~~~l~~~yP~l~i~g~~~g~f~~~e~~~i~~~I~~~----------~~div~vglG~PkQE 115 (172) T pfam03808 47 RGKRVFLLGGKPGV-LEKAAARLRARYPGLRIVGTHDGYFSPEEEEAIIEAINAS----------GPDLLFVGLGAPKQE 115 (172) T ss_pred CCCEEEEEECCHHH-HHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHC----------CCCEEEEECCCCHHH T ss_conf 49838998088899-9999999998879955999879999868999999999845----------999999956981779 Q ss_pred HHHHCCHHHHHHHHHHCCCEEEEEECCCCCCH Q ss_conf 42200769999999748904885205777525 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIISAIGHETDWT 254 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVisgIGHE~D~T 254 (529) -+...|-.. .+.+|+-|||=-.|+. T Consensus 116 ~~~~~~~~~-------~~~~v~~~vGa~~d~~ 140 (172) T pfam03808 116 KWIARNRAR-------LPVPVFIGVGGSFDFL 140 (172) T ss_pred HHHHHHHHH-------CCCCEEEECCCHHHHH T ss_conf 999999987-------7998798524012466 No 51 >pfam02601 Exonuc_VII_L Exonuclease VII, large subunit. This family consist of exonuclease VII, large subunit EC:3.1.11.6 This enzyme catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. This exonuclease VII enzyme is composed of one large subunit and 4 small ones. Probab=93.22 E-value=0.94 Score=26.87 Aligned_cols=165 Identities=20% Similarity=0.228 Sum_probs=85.5 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEE---EECCCCCHHHHHHCCHHHHHHHHHHCC------CE--EEE--EEC Q ss_conf 11110367999999997410035767775899---951688844422007699999997489------04--885--205 Q gi|254780791|r 182 QGDECPKEIANAILQLNTLKEGRTCPRPDIII---LARGGGSIEDLWHFNDEMIVRAIANSS------IP--IIS--AIG 248 (529) Q Consensus 182 QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ii---i~RGGGS~eDL~~FN~e~laraI~~~~------iP--Vis--gIG 248 (529) ..-.|-..|++-+.. +. +.+++++ .+=|-|+.. +++.+|.... -| ||- |=| T Consensus 23 ~~gAa~~Di~~~~~~---r~-----p~~~i~l~p~~VQG~~a~~--------~I~~ai~~~~~~~~~~~~Dviii~RGGG 86 (295) T pfam02601 23 ATGAAYQDFLRTARR---RG-----PLVEIEIYPTLVQGDGAAE--------SIVSALERANERETALDYDVIVIIRGGG 86 (295) T ss_pred CCCHHHHHHHHHHHH---HC-----CCCEEEEECCCCCCCHHHH--------HHHHHHHHHHHCCCCCCCCEEEEECCCC T ss_conf 940899999999998---19-----9967999473576504899--------9999999998468989983899957868 Q ss_pred CC------CCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 77------752589886412377721456763323467776699999888778998889999855677653049788986 Q gi|254780791|r 249 HE------TDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLS 322 (529) Q Consensus 249 HE------~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~ 322 (529) =- -|+.|+--|+.+.-|.=||.- .|.-..|.++-+.+...--... ...+ -.... T Consensus 87 S~eDL~~FN~e~laraI~~~~iPvisaIG-------HEtD~Tl~D~VAD~Ra~TPTaA----------Ae~i---vp~~~ 146 (295) T pfam02601 87 SKEDLWVFNDEELARAIANSPIPVITGIG-------HETDTTIADLVADVRAATPTAA----------AELL---VPDRT 146 (295) T ss_pred CHHHHHHCCHHHHHHHHHHCCCCEEECCC-------CCCCCCHHHHHHHHCCCCHHHH----------HHHH---CCCHH T ss_conf 88987455889999999838998780678-------8999618888875005898999----------9996---54499 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7888788865456899999987568899999841011346889999999999999999988 Q gi|254780791|r 323 CSRYRLDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEKQQHIEQI 383 (529) Q Consensus 323 ~~~QrLd~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~q~~L~q~ 383 (529) +..++|+....+|.+.+...+...++.++.+..++... +...++...+++.....++... T Consensus 147 ~l~~~l~~~~~~l~~~~~~~l~~~~~~L~~l~~~l~~~-~~~~l~~~~~~L~~~~~rl~~~ 206 (295) T pfam02601 147 ELLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSL-SRRKLEQHEERLAELYQRLESS 206 (295) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999751478-2999999999999999999999 No 52 >PRK08744 consensus Probab=93.20 E-value=0.45 Score=29.68 Aligned_cols=44 Identities=11% Similarity=0.221 Sum_probs=19.1 Q ss_pred ECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHH Q ss_conf 7352105866814598899999966752884379999971016800799 Q gi|254780791|r 69 WKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLL 117 (529) Q Consensus 69 ~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~ 117 (529) |-..+.+...+|=-|.++.+. .+....|.|++..-...|-..|. T Consensus 67 Fy~aa~~~GIKPIiG~E~~v~-----~~~~~~~~l~LLAkN~~GY~NL~ 110 (1195) T PRK08744 67 FYKAAEGVGIKPIAGADVMIA-----TPDMTPWRMTLLCRDREGYLSLS 110 (1195) T ss_pred HHHHHHHCCCCEEEEEEEEEE-----CCCCCCCCEEEECCCHHHHHHHH T ss_conf 999999869982850089986-----58888776799808889999999 No 53 >PRK06203 aroB 3-dehydroquinate synthase; Reviewed Probab=93.16 E-value=0.36 Score=30.63 Aligned_cols=104 Identities=24% Similarity=0.314 Sum_probs=54.6 Q ss_pred CCCEEEEEECCCHHHHH-HHHHHHHH----CC-CEEEEEEECCCCCCCHHHHHHHHHHHH-HHHCCCCCCCCCCEEEEEC Q ss_conf 65289998478425899-99998630----59-758999721001111036799999999-7410035767775899951 Q gi|254780791|r 144 IPKIIAVITSPTGAVIR-DILQRISC----RF-PLRVIIFPVKVQGDECPKEIANAILQL-NTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 144 ~p~~i~vits~~~a~~~-D~~~~~~~----r~-p~~~~~~p~~vQG~~a~~~i~~ai~~~-~~~~~~~~~~~~D~iii~R 216 (529) -++|+.|||-.+=+.+. |.+..+.. .. ++.+..-|..|-+-++.+.+...++.+ +.+.+ ...++ +-+||+= T Consensus 41 ~~~r~~ivtD~~V~~~y~~~~~~i~~y~~~~~~~~~l~~~~~v~p~GE~~K~~~~~~~~i~~~l~~-~~~~R-~~~viai 118 (390) T PRK06203 41 KRKKVLVVIDQGLLQFHPDLLEQIARYAQAHAEVLTLAAEPIIVPGGEAAKNDPQLLEQLLAAINE-AGIDR-HSYVLAI 118 (390) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCEECCCEEECCCCCCCCCHHHHHHHHHHHHHH-CCCCC-CCEEEEE T ss_conf 887799998987677655599999999984555420023347616971104779999999999997-59998-9759997 Q ss_pred CCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 6888444220076999999974890488520577752589886 Q gi|254780791|r 217 GGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 217 GGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) |||.+.|+..| |-++|.=-||.| -.-|||+-+| T Consensus 119 GGGvv~D~~Gf-----aAs~y~RGi~~i-----~iPTTlLa~v 151 (390) T PRK06203 119 GGGAVLDMVGY-----AAATAHRGVRLI-----RIPTTVLAQN 151 (390) T ss_pred CCHHHHHHHHH-----HHHHHHCCCCEE-----ECCCHHHHHH T ss_conf 68399789999-----999986399852-----1554698887 No 54 >PRK00448 polC DNA polymerase III PolC; Validated Probab=93.12 E-value=0.83 Score=27.38 Aligned_cols=79 Identities=16% Similarity=0.115 Sum_probs=56.0 Q ss_pred CCCCEEEEEEECCCCC-CCCCC-EEE-EEEECCCCEEEEEEECCCCCCCC--CCCCCCCEEEEEEEEEEECC-CCEEEEE Q ss_conf 0171899999705435-68886-279-99874894799999735210586--68145988999999667528-8437999 Q gi|254780791|r 31 NLSHVCVRGEISGYRG-IHSSG-HAY-FSLKDNHSRIDAIIWKGTLNKIE--FLPEEGIEFLVIGKITTFPG-SSKYQII 104 (529) Q Consensus 31 ~~~~~~v~gEis~~~~-~~~sG-H~Y-f~lkd~~a~i~~~~~~~~~~~~~--~~~~~G~~v~~~g~~~~y~~-~g~~ql~ 104 (529) .-.+|+|+|+|-+.-. .-+|| ++| |.+.|..++|.|-+|...-.... -.++.|+-|.++|++. |.. ...+.+. T Consensus 233 ~~~~v~v~G~vf~~e~~~~k~g~~i~~~~itD~t~si~~k~f~~~~~~~~~~~~i~~g~~v~v~g~~~-~d~~~~~~~~~ 311 (1436) T PRK00448 233 EERRVVVEGYVFKKEIKELKSGRHILTFKITDYTSSITVKKFLRDKEDLKKFDEIKKGDWVKVRGSVQ-NDTFTRDLVMN 311 (1436) T ss_pred CCCEEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEECCCCCHHHHHHHCCCCCEEEEEEEEE-ECCCCCCEEEE T ss_conf 25818999999866744315798799999982688689999447633278985167997899999985-12677721787 Q ss_pred EEEEEE Q ss_conf 997101 Q gi|254780791|r 105 IESLIP 110 (529) Q Consensus 105 v~~i~~ 110 (529) +.+|.. T Consensus 312 ~~~i~~ 317 (1436) T PRK00448 312 AKDIKE 317 (1436) T ss_pred HHHHHC T ss_conf 557110 No 55 >PRK11070 ssDNA exonuclease RecJ; Provisional Probab=93.06 E-value=0.98 Score=26.69 Aligned_cols=27 Identities=19% Similarity=0.364 Sum_probs=19.8 Q ss_pred ECCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 168007999999999976540122610 Q gi|254780791|r 110 PSGSGTLLTALEKRKKKLLEEGLFSDQ 136 (529) Q Consensus 110 ~~g~G~l~~~~e~lk~~L~~eGlfd~~ 136 (529) .+|.|--+.-...+.++|...|+|+.. T Consensus 186 L~G~GVafkl~~al~~~l~~~~~~~~~ 212 (574) T PRK11070 186 LAGVGVAFYLMLALRTFLRDQGWFDER 212 (574) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCHHHC T ss_conf 550249999999999998763424441 No 56 >PRK08763 single-stranded DNA-binding protein; Provisional Probab=92.95 E-value=0.66 Score=28.23 Aligned_cols=80 Identities=16% Similarity=0.215 Sum_probs=57.6 Q ss_pred CCCCEEEEEEEC---CCCCCCCCCEEE--EEE------ECCC-------CEEEEEEECCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 017189999970---543568886279--998------7489-------4799999735210586681459889999996 Q gi|254780791|r 31 NLSHVCVRGEIS---GYRGIHSSGHAY--FSL------KDNH-------SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKI 92 (529) Q Consensus 31 ~~~~~~v~gEis---~~~~~~~sGH~Y--f~l------kd~~-------a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~ 92 (529) .+..|-+.|-|. .++ +..||+.| |+| ||.+ --++|++|...+..+.-.++.|.+|.|.|++ T Consensus 4 ~~NkV~LiG~l~~Dpelr-~t~sG~~v~~fslA~~~~~~~~~g~~~~~t~w~~vv~~gk~AE~~~~yl~KG~~V~VeGrL 82 (160) T PRK08763 4 GINKVILVGNLGNDPDIK-YTQSGMTITRISLATTSMRKDREGNTQERTEWHRVKFFGKLGEIAGEYLRKGSQCYIEGEI 82 (160) T ss_pred CCCEEEEEEECCCCCEEE-ECCCCCEEEEEEEEECCCEECCCCCCEECEEEEEEEEEHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 861999987547597587-8899987999999977873568888252105999998368999999874179989998888 Q ss_pred E--EEC-----CCCEEEEEEEEEEEC Q ss_conf 6--752-----884379999971016 Q gi|254780791|r 93 T--TFP-----GSSKYQIIIESLIPS 111 (529) Q Consensus 93 ~--~y~-----~~g~~ql~v~~i~~~ 111 (529) . -|. .++...++|+.|+.. T Consensus 83 ~~~~y~dkdG~kr~~teIv~~~v~~l 108 (160) T PRK08763 83 RYDKFTGQDGQERYVTEIVADEMQML 108 (160) T ss_pred EEEEEECCCCCEEEEEEEEEEEEEEC T ss_conf 88551758998899999999888981 No 57 >cd04496 SSB_OBF SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB). SSBs bind with high affinity to ssDNA. They bind to and protect ssDNA intermediates during DNA metabolic pathways. All bacterial and eukaryotic SSBs studied to date oligomerize to bring together four OB folds in their active state. The majority (e.g. Escherichia coli SSB) have a single OB fold per monomer, which oligomerize to form a homotetramer. However, Deinococcus and Thermus SSB proteins have two OB folds per monomer, which oligomerize to form a homodimer. Mycobacterium tuberculosis SSB varies in quaternary structure from E. coli SSB. It forms a dimer of dimers having a unique dimer interface, which lends the protein greater stability. Included in this group are OB folds similar to Escherichia coli PriB. E.coli PriB is homodimeric with each monomer having a single OB fold. It does not appear to form higher order oligomers. PriB is an essential protein for the replication restart Probab=92.90 E-value=0.95 Score=26.85 Aligned_cols=49 Identities=8% Similarity=0.137 Sum_probs=38.6 Q ss_pred CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE--EECC-----CCEEEEEEEEEEE Q ss_conf 47999997352105866814598899999966--7528-----8437999997101 Q gi|254780791|r 62 SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT--TFPG-----SSKYQIIIESLIP 110 (529) Q Consensus 62 a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~--~y~~-----~g~~ql~v~~i~~ 110 (529) .-++|++|...+..+.-.++.|+.|.|.|++. .|.. +..+.+.+++|.+ T Consensus 44 ~~~~v~~wg~~A~~~~~~l~KG~~V~V~G~l~~~~~~~~~g~~~~~~~i~a~~i~~ 99 (100) T cd04496 44 DWIRVVAFGKLAENAAKYLKKGDLVYVEGRLRTRSWEDKDGQKRYGTEVVADRIEF 99 (100) T ss_pred EEEEEEEECHHHHHHHHHCCCCCEEEEEEEEECCEEECCCCCEEEEEEEEEEEEEE T ss_conf 99999999708987787537998899999988123799999899999999999994 No 58 >cd04316 ND_PkAspRS_like_N ND_PkAspRS_like_N: N-terminal, anticodon recognition domain of the type found in the homodimeric non-discriminating (ND) Pyrococcus kodakaraensis aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. P. kodakaraensis AspRS is a class 2b aaRS. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. P. kodakaraensis ND-AspRS can charge both tRNAAsp and tRNAAsn. Some of the enzymes in this group may be discriminating, based on the presence of homologs of asparaginyl-tRNA synthetase (AsnRS) in their completed genomes. Probab=92.77 E-value=1.1 Score=26.38 Aligned_cols=77 Identities=19% Similarity=0.194 Sum_probs=58.4 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCC----CCCCCCEEEEEEEEEEECCC--CEEEEEEE Q ss_conf 7189999970543568886279998748947999997352105866----81459889999996675288--43799999 Q gi|254780791|r 33 SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEF----LPEEGIEFLVIGKITTFPGS--SKYQIIIE 106 (529) Q Consensus 33 ~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~----~~~~G~~v~~~g~~~~y~~~--g~~ql~v~ 106 (529) ..|+|.|=|-+.|. .++=+++.|.|..+.+.|++-+.....--| .+..|.-|.+.|.|. ..|+ |+|-+.++ T Consensus 13 ~~V~i~Gwv~~~R~--~g~~~Fi~LRD~~G~vQ~v~~~~~~~~~~~~~~~~l~~EsvV~V~G~V~-~~~~~~~giEi~v~ 89 (108) T cd04316 13 EEVTVAGWVHEIRD--LGGIKFVILRDREGIVQVTAPKKKVDKELFKTVRKLSRESVISVTGTVK-AEPKAPNGVEIIPE 89 (108) T ss_pred CEEEEEEEEEEEEC--CCCEEEEEEECCCEEEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEE-ECCCCCCCEEEEEE T ss_conf 99999899982321--7987999998098779999937888989999986599944999999999-48989996758964 Q ss_pred EEEECC Q ss_conf 710168 Q gi|254780791|r 107 SLIPSG 112 (529) Q Consensus 107 ~i~~~g 112 (529) .++.-+ T Consensus 90 ~i~ils 95 (108) T cd04316 90 EIEVLS 95 (108) T ss_pred EEEEEE T ss_conf 899985 No 59 >KOG3108 consensus Probab=92.46 E-value=1.2 Score=26.06 Aligned_cols=97 Identities=20% Similarity=0.211 Sum_probs=71.1 Q ss_pred CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCC--CCCCCCEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 17189999970543568886279998748947999997352105866--8145988999999667528843799999710 Q gi|254780791|r 32 LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEF--LPEEGIEFLVIGKITTFPGSSKYQIIIESLI 109 (529) Q Consensus 32 ~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~--~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~ 109 (529) +.++.+.|=|.+... +.-+.-|+|=|..++|.|..|........+ .++.|.-|.+.|.+. .+.|.-+|.+..|. T Consensus 68 v~~v~~VGivr~~e~--~~t~i~y~I~D~tg~id~r~W~~~~~~~~e~~~l~~~~yVkv~G~Lk--~f~Gk~sl~~fkI~ 143 (265) T KOG3108 68 VSAVSIVGIVRNIEK--SATNITYEIEDGTGQIDVRQWFHDNAESEEMPALETGTYVKVYGHLK--PFQGKKSLQVFKIR 143 (265) T ss_pred EEEEEEEEEEEECEE--CCCCEEEEEECCCCEEEEEECCCCCCHHHHCCCCCCCCEEEEECCCC--CCCCCEEEEEEEEE T ss_conf 888899999970202--67614999826865078887035530222174124475799640104--88995267787655 Q ss_pred ECC-CCHHH-HHHHHHHHHHHHHHC Q ss_conf 168-00799-999999997654012 Q gi|254780791|r 110 PSG-SGTLL-TALEKRKKKLLEEGL 132 (529) Q Consensus 110 ~~g-~G~l~-~~~e~lk~~L~~eGl 132 (529) |.- .-+.. ..+|-+...|..... T Consensus 144 pv~D~Nevt~h~LE~i~~hl~~s~~ 168 (265) T KOG3108 144 PVEDFNEVTTHFLEVINAHLSLSKS 168 (265) T ss_pred EEECCCCEEEEEEHHHHHHHHHHHC T ss_conf 5403772257731457899887543 No 60 >COG3264 Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane] Probab=92.40 E-value=1.2 Score=26.01 Aligned_cols=20 Identities=25% Similarity=0.134 Sum_probs=9.2 Q ss_pred HHHHHCCCCCCCHHHHHHHC Q ss_conf 98864123777214567633 Q gi|254780791|r 256 ADYAADLRAPTPTGAAEMAV 275 (529) Q Consensus 256 ~D~VAD~Ra~TPTaAAElav 275 (529) -+.+.-.|++.+..|+|... T Consensus 35 q~~~~~~~~~~~~~a~~~~~ 54 (835) T COG3264 35 QEAVNSKRQEEAEPAAEEAE 54 (835) T ss_pred HHHHHHHCCCCCCCCHHHCC T ss_conf 98754313241213211113 No 61 >PRK06751 single-stranded DNA-binding protein; Provisional Probab=92.40 E-value=0.77 Score=27.65 Aligned_cols=76 Identities=21% Similarity=0.366 Sum_probs=52.7 Q ss_pred CEEEEEEEC---CCCCCCCCCEE--EEEE------ECCC-----CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE--E Q ss_conf 189999970---54356888627--9998------7489-----479999973521058668145988999999667--5 Q gi|254780791|r 34 HVCVRGEIS---GYRGIHSSGHA--YFSL------KDNH-----SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT--F 95 (529) Q Consensus 34 ~~~v~gEis---~~~~~~~sGH~--Yf~l------kd~~-----a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~--y 95 (529) .|-+.|-|. .++ +-.+|.. -|+| +|.+ --++||+|+..+..+.-.++.|++|.|.|++.. | T Consensus 3 kV~LiGrL~~DPElr-~t~~G~~v~~f~lAv~r~~~~~~g~~~td~~~vv~wgk~AE~~~~yl~KG~~V~VeGrl~~r~y 81 (172) T PRK06751 3 RVILVGRLTKDPDLR-YTPNGVAVATFTLAVNRAFANQQGEREADFINCVIWRKQAENVANYLKKGSLAGVDGRLQTRNY 81 (172) T ss_pred EEEEEEECCCCCEEE-ECCCCCEEEEEEEEECCCEECCCCCEEEEEEEEEEECHHHHHHHHHHCCCCEEEEEEEEECCCC T ss_conf 799998768787588-8899987999999966760267897677899999728889999987578998999998624766 Q ss_pred C-----CCCEEEEEEEEEEE Q ss_conf 2-----88437999997101 Q gi|254780791|r 96 P-----GSSKYQIIIESLIP 110 (529) Q Consensus 96 ~-----~~g~~ql~v~~i~~ 110 (529) + .++...++|+.|.. T Consensus 82 edkdG~~r~~teIv~~~~~~ 101 (172) T PRK06751 82 EGQDGKRVYVTEVLAESVQF 101 (172) T ss_pred CCCCCCEEEEEEEEEEEEEE T ss_conf 87899889999999966897 No 62 >PRK05159 aspC aspartyl-tRNA synthetase; Provisional Probab=92.01 E-value=1.1 Score=26.16 Aligned_cols=77 Identities=14% Similarity=0.135 Sum_probs=57.3 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC----CCCCCCCCEEEEEEEEEEECCC--CEEEEEEE Q ss_conf 71899999705435688862799987489479999973521058----6681459889999996675288--43799999 Q gi|254780791|r 33 SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI----EFLPEEGIEFLVIGKITTFPGS--SKYQIIIE 106 (529) Q Consensus 33 ~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~----~~~~~~G~~v~~~g~~~~y~~~--g~~ql~v~ 106 (529) ..|.|.|=|.+.|. + ++=++++|.|..+.|.|++=+.....- --.+..|+-|.|.|.|. ..++ +++.+.|+ T Consensus 17 ~~V~v~G~v~~~R~-~-Gkl~Fi~LrD~~g~iQ~~~~~~~~~~~~~~~~~~l~~g~~v~v~G~v~-~~~~~~~~~ev~~~ 93 (434) T PRK05159 17 KEVTLAGWVHEIRD-L-GGIKFLLLRDRTGIIQVVVPKKKVDEELFEGIKKLSRESVVSVTGVVK-ESEKAPNGVEVQPE 93 (434) T ss_pred CEEEEEEEEEEEEC-C-CCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEE-CCCCCCCCEEEEEE T ss_conf 99999898872881-7-985999999698028999979989989999985799962999999998-58999975899998 Q ss_pred EEEECC Q ss_conf 710168 Q gi|254780791|r 107 SLIPSG 112 (529) Q Consensus 107 ~i~~~g 112 (529) .++.-+ T Consensus 94 ~i~ils 99 (434) T PRK05159 94 EIEVLN 99 (434) T ss_pred EEEEEE T ss_conf 999983 No 63 >TIGR03405 Phn_Fe-ADH phosphonate metabolism-associated iron-containing alcohol dehydrogenase. 2-hydroxyethylphosphonate (2-HEP), the presumed product of the reaction of Pald with an alcohol dehydrogenase, is a biologically novel but reasonable analog of 2-AEP and may be a constituent of as-yet undescribed natural products. In the case of Azoarcus, downstream of the dehydrogenase is a CDP-glycerol:glycerophosphate transferase homolog that may indicate the existence of a pathway for 2-HEP-derived phosphonolipid biosynthesis. Probab=91.80 E-value=1.3 Score=25.47 Aligned_cols=70 Identities=19% Similarity=0.237 Sum_probs=43.2 Q ss_pred CCEEEEEECCCHHHH---HHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 528999847842589---99999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 145 PKIIAVITSPTGAVI---RDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vits~~~a~~---~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) -+|+-|||.++.... ..+...+.. ..+.+| .-|.-+-....+-++++.+.... ..+| .||+=||||. T Consensus 23 ~k~~llVt~~~~~~~g~~~~v~~~L~~---~~~~vf-~~V~pnP~~~~v~~~~~~~~~~e-----~~~D-~IiavGGGS~ 92 (355) T TIGR03405 23 GRRVVVVTFPEARALGLARRLEALLGG---RLAALI-DDVAPNPDVAQLDGLYARLWGDE-----GACD-LVIALGGGSV 92 (355) T ss_pred CCEEEEEECCCHHHCCHHHHHHHHHCC---CCEEEE-ECCCCCCCHHHHHHHHHHHHHHC-----CCCC-EEEECCCCCH T ss_conf 984999979636554399999998189---955999-27468949899999999998423-----6998-8998078518 Q ss_pred HHH Q ss_conf 442 Q gi|254780791|r 222 EDL 224 (529) Q Consensus 222 eDL 224 (529) -|. T Consensus 93 iD~ 95 (355) T TIGR03405 93 IDT 95 (355) T ss_pred HHH T ss_conf 788 No 64 >KOG0994 consensus Probab=91.59 E-value=1.4 Score=25.30 Aligned_cols=16 Identities=13% Similarity=0.318 Sum_probs=10.9 Q ss_pred CHHHHHHHHHHCCCEE Q ss_conf 7699999997489048 Q gi|254780791|r 228 NDEMIVRAIANSSIPI 243 (529) Q Consensus 228 N~e~laraI~~~~iPV 243 (529) --|+||..|....||. T Consensus 1493 si~~vA~~vL~l~lp~ 1508 (1758) T KOG0994 1493 SIEEVAEEVLALELPL 1508 (1758) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999998532789 No 65 >PRK10624 L-1,2-propanediol oxidoreductase; Provisional Probab=91.51 E-value=0.99 Score=26.65 Aligned_cols=69 Identities=19% Similarity=0.276 Sum_probs=46.1 Q ss_pred CEEEEEECCCHH---HHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 289998478425---89999998630597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 146 KIIAVITSPTGA---VIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 146 ~~i~vits~~~a---~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) +++-|||.++-. .+.+++..|.+ ..+.+.+| .-|..+-....|-++++.+...+ +|+| |+=||||.- T Consensus 30 k~~lvvtd~~~~k~g~~~~v~~~L~~-~~i~~~vf-~~v~pnP~~~~V~~~~~~~r~~~-------~D~I-iavGGGSvi 99 (381) T PRK10624 30 HKALIVTDKTLVQCGVVAKVTDKLDA-AGLAYEIY-DGVKPNPTISVVKEGLEVFQASG-------ADYL-IAIGGGSPQ 99 (381) T ss_pred CEEEEEECCCHHHCCHHHHHHHHHHH-CCCEEEEE-CCCCCCCCHHHHHHHHHHHHHCC-------CCEE-EEECCCHHH T ss_conf 87999969654556369999999987-69839998-89258989999999999998649-------9989-980894088 Q ss_pred HH Q ss_conf 42 Q gi|254780791|r 223 DL 224 (529) Q Consensus 223 DL 224 (529) |. T Consensus 100 D~ 101 (381) T PRK10624 100 DT 101 (381) T ss_pred HH T ss_conf 89 No 66 >PRK07274 single-stranded DNA-binding protein; Provisional Probab=91.50 E-value=1.4 Score=25.34 Aligned_cols=79 Identities=11% Similarity=0.267 Sum_probs=53.4 Q ss_pred CCCEEEEEEEC---CCCCCCCCCEEE--EEE------ECCC-----CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE-- Q ss_conf 17189999970---543568886279--998------7489-----47999997352105866814598899999966-- Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGHAY--FSL------KDNH-----SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT-- 93 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH~Y--f~l------kd~~-----a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~-- 93 (529) |..|-+.|-+. .++ +-.+|+.+ |+| |+.+ --+.|++|...+..+.-.++-|++|.|.|++. T Consensus 2 mN~V~LiGrL~~Dpel~-~t~~G~~v~~fslAv~r~~k~~~ge~~td~~~vv~wgk~Ae~~~~~~~KG~~V~V~G~L~~~ 80 (131) T PRK07274 2 YNKVILIGRLTAKPELV-KTATDKSVARVTLAVNRRFKNQNGEREADFINVVVWGKLAETLVSYASKGSLISIDGELRTR 80 (131) T ss_pred CCEEEEEEECCCCCEEE-ECCCCCEEEEEEEEEECCEECCCCCEEEEEEEEEECCHHHHHHHHHCCCCCEEEEEEEEECC T ss_conf 64699999777797699-98999889999998713068788978877999998284799998751699999999998874 Q ss_pred EECCCC----EEEEEEEEEEEC Q ss_conf 752884----379999971016 Q gi|254780791|r 94 TFPGSS----KYQIIIESLIPS 111 (529) Q Consensus 94 ~y~~~g----~~ql~v~~i~~~ 111 (529) -|...| ...++|+.++.- T Consensus 81 ~ydkdG~~~~~~eI~v~~~~~L 102 (131) T PRK07274 81 KYDKDGQTHYVTEVLCQSFQLL 102 (131) T ss_pred CCCCCCCEEEEEEEEEEEEEEC T ss_conf 8724997999999999789974 No 67 >PRK06495 enoyl-CoA hydratase; Provisional Probab=91.42 E-value=0.61 Score=28.58 Aligned_cols=76 Identities=20% Similarity=0.316 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-----------------HHHHHHCCH--HHHHHHHHHCCCEEEEEE Q ss_conf 3679999999974100357677758999516888-----------------444220076--999999974890488520 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-----------------IEDLWHFND--EMIVRAIANSSIPIISAI 247 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-----------------~eDL~~FN~--e~laraI~~~~iPVisgI 247 (529) -.++..++..++... .+-+||| +|.|. ..+.|.++. ..+..+|..|+.|||++| T Consensus 32 ~~el~~al~~~~~d~------~vr~vvl-~g~G~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIAav 104 (257) T PRK06495 32 RDELIAVFDSLNDRP------DVRVVVL-TGAGKVFCAGADLKNRPDLESGPGALWAHNRRTREVFYCIRECAKPVIAAV 104 (257) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 999999999997299------9169999-767981276887565146556703589999999999999995899899997 Q ss_pred -CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf -577752-5898864123777214 Q gi|254780791|r 248 -GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 248 -GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ||=.-- .-.=+.+|+|..++.+ T Consensus 105 ~G~a~GgG~~lala~D~~ia~~~a 128 (257) T PRK06495 105 NGPALGAGLGLVASCDIIVASENA 128 (257) T ss_pred CCEEECCCHHHHHHHHHHEECCCC T ss_conf 137741722566543241004788 No 68 >cd04322 LysRS_N LysRS_N: N-terminal, anticodon recognition domain of lysyl-tRNA synthetases (LysRS). These enzymes are homodimeric class 2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Included in this group are E. coli LysS and LysU. These two isoforms of LysRS are encoded by distinct genes which are differently regulated. Eukaryotes contain 2 sets of aaRSs, both of which encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein Probab=91.41 E-value=1.5 Score=25.15 Aligned_cols=73 Identities=22% Similarity=0.234 Sum_probs=55.1 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC-----CCCCCCCEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 8999997054356888627999874894799999735210586-----68145988999999667528843799999710 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE-----FLPEEGIEFLVIGKITTFPGSSKYQIIIESLI 109 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~-----~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~ 109 (529) |.|.|=|.+.+ .+ +|=++++|.|..+.+.+++=+.....-. -.++-|+-|-|.|.+. ..+.|.+++.++.++ T Consensus 2 V~v~GrV~~~R-~~-G~l~FidLrD~~g~iQv~~~~~~~~~~~~~~~~~~~~~gDvv~v~G~v~-~t~tGel~v~~~~~~ 78 (108) T cd04322 2 VSVAGRIMSKR-GS-GKLSFADLQDESGKIQVYVNKDDLGEEEFEDFKKLLDLGDIIGVTGTPF-KTKTGELSIFVKEFT 78 (108) T ss_pred EEEEEEEEEEE-CC-CCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCCEEEEEEEEEE-CCCCCEEEEEEEEHH T ss_conf 89999998470-67-9829999996996189999756477567777763444344999997875-179934999964912 Q ss_pred E Q ss_conf 1 Q gi|254780791|r 110 P 110 (529) Q Consensus 110 ~ 110 (529) . T Consensus 79 l 79 (108) T cd04322 79 L 79 (108) T ss_pred C T ss_conf 0 No 69 >cd04100 Asp_Lys_Asn_RS_N Asp_Lys_Asn_RS_N: N-terminal, anticodon recognition domain of class 2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. Class 2b aaRSs include the homodimeric aspartyl-, asparaginyl-, and lysyl-tRNA synthetases (AspRS, AsnRS, and LysRS). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Included in this group are archeal and archeal-like A Probab=91.40 E-value=1.5 Score=25.15 Aligned_cols=76 Identities=17% Similarity=0.228 Sum_probs=57.3 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCCCEEEEEEEEEEEC----CCCEEEEEEE Q ss_conf 189999970543568886279998748947999997352105---866814598899999966752----8843799999 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEGIEFLVIGKITTFP----GSSKYQIIIE 106 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G~~v~~~g~~~~y~----~~g~~ql~v~ 106 (529) .|+|.|=|.+.|. .++-.+++|.|..+.+.|++-...... .--.+..|+-|.+.|.+.-=+ +.|.|-+.++ T Consensus 1 ~V~v~Gwv~~~R~--~g~~~Fi~lrD~~g~iQvv~~~~~~~~~~~~~~~l~~es~I~v~G~v~~~~~~~~~~g~~Ei~~~ 78 (85) T cd04100 1 EVTLAGWVHSRRD--HGGLIFIDLRDGSGIVQVVVNKEELGEFFEEAEKLRTESVVGVTGTVVKRPEGNLATGEIELQAE 78 (85) T ss_pred CEEEEEEEEEEEC--CCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCEEEEEE T ss_conf 9999998790432--89889958503895089999177585788776358988899999999968877789988999993 Q ss_pred EEEEC Q ss_conf 71016 Q gi|254780791|r 107 SLIPS 111 (529) Q Consensus 107 ~i~~~ 111 (529) .++.- T Consensus 79 ~i~il 83 (85) T cd04100 79 ELEVL 83 (85) T ss_pred EEEEE T ss_conf 99998 No 70 >PRK12366 replication factor A; Reviewed Probab=91.39 E-value=1.5 Score=25.14 Aligned_cols=72 Identities=18% Similarity=0.262 Sum_probs=50.4 Q ss_pred EEEEEECCCCCC-------CCCCEE-EEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 999997054356-------888627-999874894799999735210586681459889999996675288437999997 Q gi|254780791|r 36 CVRGEISGYRGI-------HSSGHA-YFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIES 107 (529) Q Consensus 36 ~v~gEis~~~~~-------~~sGH~-Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~ 107 (529) .|.|+|...-.. -+.|++ -|.|+|+..+|+.++|-..+ .+..+.|+-|.+.|.+.= .=+|+..+.+.. T Consensus 188 ti~G~V~~~~~irtF~r~DGS~G~v~sl~l~DeTGsIRvTLWdd~a---d~~v~kGD~V~V~Gy~~e-gy~g~lel~~~n 263 (649) T PRK12366 188 TINGEITKAYPKKEFTKKDGTEGQLKSFIVKDETGSIRVTLWNELT---DIEVKKGDIVEVSGYVKQ-GYSGNTELSVDN 263 (649) T ss_pred EEEEEEEECCCCEEEECCCCCCCEEEEEEEECCCCCEEEEEECCCC---CCCCCCCCEEEEEEEECC-CCCCCEEEEECC T ss_conf 6999996436863532478986467678997788858999856766---543357989999888725-632404888753 Q ss_pred EEEC Q ss_conf 1016 Q gi|254780791|r 108 LIPS 111 (529) Q Consensus 108 i~~~ 111 (529) +..- T Consensus 264 ~~I~ 267 (649) T PRK12366 264 IGIV 267 (649) T ss_pred EEEC T ss_conf 7850 No 71 >PRK05995 enoyl-CoA hydratase; Provisional Probab=91.38 E-value=0.66 Score=28.26 Aligned_cols=77 Identities=19% Similarity=0.191 Sum_probs=40.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-CH---HHHH--------------HC--CHHHHHHHHHHCCCEEEE Q ss_conf 0367999999997410035767775899951688-84---4422--------------00--769999999748904885 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGG-SI---EDLW--------------HF--NDEMIVRAIANSSIPIIS 245 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG-S~---eDL~--------------~F--N~e~laraI~~~~iPVis 245 (529) -..++..++..++... .+.|||| +|-| ++ -||- .+ .-..+.+.|..||.|||+ T Consensus 32 m~~el~~al~~~~~d~------~vrvvvl-~g~G~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~kPvIa 104 (261) T PRK05995 32 VIAELTAAFRALDADD------SVRAVVL-AGAGKAFCAGADLNWMKKMAGYSDDENRADARRLADMLRAIYRCPKPVIA 104 (261) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 9999999999997299------9369999-78998436576779887520378235566799999999999968998999 Q ss_pred EECC-CCC-CHHHHHHHCCCCCCCHH Q ss_conf 2057-775-25898864123777214 Q gi|254780791|r 246 AIGH-ETD-WTLADYAADLRAPTPTG 269 (529) Q Consensus 246 gIGH-E~D-~Tl~D~VAD~Ra~TPTa 269 (529) +|=- =.= =.-+=+.+|+|..++++ T Consensus 105 ai~G~a~GgG~~Lal~~D~~ia~~~a 130 (261) T PRK05995 105 RVHGDAYAGGMGLVAACDIAVAADHA 130 (261) T ss_pred EECCEEEECCHHHHHHCCEEECCCCC T ss_conf 97898860417776516653033885 No 72 >COG1200 RecG RecG-like helicase [DNA replication, recombination, and repair / Transcription] Probab=91.27 E-value=1.5 Score=25.05 Aligned_cols=94 Identities=16% Similarity=0.129 Sum_probs=60.9 Q ss_pred CCCHHHHHHHHHHHHHH-----C---C---CCEEEEEEECCCCCCCCCC--EEEEEEECCCCEEEEEEECCCCCCCCCCC Q ss_conf 62299999999999740-----0---1---7189999970543568886--27999874894799999735210586681 Q gi|254780791|r 14 EYSVSELSYHLKHIVES-----N---L---SHVCVRGEISGYRGIHSSG--HAYFSLKDNHSRIDAIIWKGTLNKIEFLP 80 (529) Q Consensus 14 ~~svs~l~~~i~~~l~~-----~---~---~~~~v~gEis~~~~~~~sG--H~Yf~lkd~~a~i~~~~~~~~~~~~~~~~ 80 (529) .+||.+|..+.=.-.|. . . ..+-|+|+|-.....+.++ -+=|.+.|....+.+++|...+ .+...+ T Consensus 31 i~tv~DLL~~~P~~YeD~~~~~~i~~~~~g~~vti~g~V~~~~~~~~~~~~~l~v~~~d~~~~l~l~fFn~~~-~l~~~~ 109 (677) T COG1200 31 IHTVQDLLLYLPRRYEDRTLLPGIAEARPGEIVTIEGTVLSHEKFPFGKRKLLKVTLSDGTGVLTLVFFNFPA-YLKKKL 109 (677) T ss_pred CCCHHHHHHHCCCCHHHCCCCCCHHHCCCCCEEEEEEEEEEEECCCCCCCCEEEEEEECCCEEEEEEEECCCH-HHHHHC T ss_conf 8759999986862142146567843537786699999997640257788734999996296899999978668-888418 Q ss_pred CCCCEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 459889999996675288437999997101 Q gi|254780791|r 81 EEGIEFLVIGKITTFPGSSKYQIIIESLIP 110 (529) Q Consensus 81 ~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~ 110 (529) +.|..|.+.|++.. -+|++|+.--++.. T Consensus 110 ~~G~~v~v~Gk~~~--~~~~~~~~hpe~~~ 137 (677) T COG1200 110 KVGERVIVYGKVKR--FKGGLQITHPEYIV 137 (677) T ss_pred CCCCEEEEEEEEEE--CCCCEEEECCEEEE T ss_conf 79988999998950--36835887666783 No 73 >cd04317 EcAspRS_like_N EcAspRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli aspartyl-tRNA synthetase (AspRS), the human mitochondrial (mt) AspRS-2, the discriminating (D) Thermus thermophilus AspRS-1, and the nondiscriminating (ND) Helicobacter pylori AspRS. These homodimeric enzymes are class2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, wh Probab=91.00 E-value=1.6 Score=24.85 Aligned_cols=77 Identities=14% Similarity=0.167 Sum_probs=55.4 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC--CCCCCCCEEEEEEEEEE--------ECCCCEEE Q ss_conf 718999997054356888627999874894799999735210586--68145988999999667--------52884379 Q gi|254780791|r 33 SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE--FLPEEGIEFLVIGKITT--------FPGSSKYQ 102 (529) Q Consensus 33 ~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~--~~~~~G~~v~~~g~~~~--------y~~~g~~q 102 (529) ..|+|.|=|.+.+. .++=.++.|+|..+.|.|++=+....... -.+..+.-|.|.|.|.. -.+.|.+. T Consensus 15 ~~V~v~GwV~~~R~--~gki~Fi~LRD~tg~iQ~v~~~~~~~~~~~~~~l~~EsvV~V~G~V~~r~~~~~n~~~~tGeiE 92 (135) T cd04317 15 QEVTLCGWVQRRRD--HGGLIFIDLRDRYGIVQVVFDPEEAPEFELAEKLRNESVIQVTGKVRARPEGTVNPKLPTGEIE 92 (135) T ss_pred CEEEEEEEEEEEEE--CCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCEE T ss_conf 99999999999971--7979999985288358899957651678999718976799999999967976605665689779 Q ss_pred EEEEEEEEC Q ss_conf 999971016 Q gi|254780791|r 103 IIIESLIPS 111 (529) Q Consensus 103 l~v~~i~~~ 111 (529) +.|+.++.- T Consensus 93 i~v~~i~il 101 (135) T cd04317 93 VVASELEVL 101 (135) T ss_pred EEEEEEEEE T ss_conf 999799997 No 74 >PRK09076 enoyl-CoA hydratase; Provisional Probab=90.95 E-value=0.69 Score=28.06 Aligned_cols=75 Identities=24% Similarity=0.275 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH-----HHHHHCCH-------------HHHHHHHHHCCCEEEEEE- Q ss_conf 36799999999741003576777589995168884-----44220076-------------999999974890488520- Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI-----EDLWHFND-------------EMIVRAIANSSIPIISAI- 247 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~-----eDL~~FN~-------------e~laraI~~~~iPVisgI- 247 (529) -.++..+|..++... .+. +||.||-|.. -||-.|.+ ..+.+++.+|+.|||++| T Consensus 31 ~~~l~~~l~~~~~d~------~vr-vvvl~g~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~pvIaav~ 103 (258) T PRK09076 31 LQALKQLVLELNANK------DVY-ALVITGDGEKFFSAGADLNLFADGDKAVAREMARRFGEAFEALSAFRGVSIAAIN 103 (258) T ss_pred HHHHHHHHHHHHHCC------CCE-EEEEECCCCCCEECCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 999999999996299------955-9999658998640670277660478445899999999999999958999999989 Q ss_pred CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 5777--525898864123777214 Q gi|254780791|r 248 GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 248 GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ||-. =..| =+.+|+|..++.+ T Consensus 104 G~a~GgG~~l-al~cD~ria~~~a 126 (258) T PRK09076 104 GYAMGGGLEC-ALACDIRIAEEQA 126 (258) T ss_pred CEEEEHHHHH-HHHCCEEEECCCC T ss_conf 9076434999-8606755431565 No 75 >PRK04036 DNA polymerase II small subunit; Validated Probab=90.88 E-value=1.6 Score=24.76 Aligned_cols=70 Identities=19% Similarity=0.258 Sum_probs=44.6 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC--CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 189999970543568886279998748947999997352105--866814598899999966752884379999971 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK--IEFLPEEGIEFLVIGKITTFPGSSKYQIIIESL 108 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~--~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i 108 (529) .+.|.|=|++.+ ..++||+-|+|-|..+.|+|.+.+..-.. +...+-...=|-|.|.+ ...|++ |+|++| T Consensus 148 ev~IIGmV~d~r-~tknG~~~~~lED~tG~i~vli~k~~~~~~~~a~~ll~DeVIgv~G~~---s~~g~~-~f~~~i 219 (497) T PRK04036 148 EVSIIGMVSDIR-STKNGHKIVELEDPTGTFSVLIMKDREDLGELAEDLLPDEVIGVKGTL---SGDGPL-MFADEI 219 (497) T ss_pred CEEEEEEECCCE-ECCCCCEEEEEECCCCEEEEEEECCCHHHHHHHHHCCCCCEEEEEEEE---CCCCCE-EEEEEC T ss_conf 089999975413-136774799998789839999947837877777624577578999998---699877-998404 No 76 >PRK06143 enoyl-CoA hydratase; Provisional Probab=90.86 E-value=0.93 Score=26.92 Aligned_cols=78 Identities=18% Similarity=0.254 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEE-CCC------CCHHHHHHCCH----------HHHHHHHHHCCCEEEEEE- Q ss_conf 036799999999741003576777589995-168------88444220076----------999999974890488520- Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILA-RGG------GSIEDLWHFND----------EMIVRAIANSSIPIISAI- 247 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~-RGG------GS~eDL~~FN~----------e~laraI~~~~iPVisgI- 247 (529) -..++.+++..++... .+-||||. .|+ |.+.++..++. ..+..+|..||.|||++| T Consensus 37 m~~eL~~~~~~~~~d~------~vrvvVltg~G~~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIAav~ 110 (260) T PRK06143 37 VILALTQALRWLAADP------DVRVLVLRGAGEKAFIGGADIKEMATLDQASAEAFISRLRDLCDAVRHFPVPVIARIP 110 (260) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEEECCCCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 9999999999885389------9559999658877634885679886159656899999999999999948998899987 Q ss_pred CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 577752-5898864123777214 Q gi|254780791|r 248 GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 248 GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ||=.=- .-.=+.+|.|..+++| T Consensus 111 G~a~GgG~~lal~cD~~ias~~a 133 (260) T PRK06143 111 GWCLGGGLELAAACDLRIAAHDA 133 (260) T ss_pred CEEECCCHHHHHCCCEEEECCCC T ss_conf 96110425766447765322365 No 77 >PRK08138 enoyl-CoA hydratase; Provisional Probab=90.81 E-value=0.81 Score=27.43 Aligned_cols=77 Identities=25% Similarity=0.355 Sum_probs=45.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHH-----------HHCCHHHHHHHHHHCCCEEEEEE-CC Q ss_conf 036799999999741003576777589995168884----442-----------20076999999974890488520-57 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDL-----------WHFNDEMIVRAIANSSIPIISAI-GH 249 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL-----------~~FN~e~laraI~~~~iPVisgI-GH 249 (529) -..++..++..++... .+-|||| +|+|-. -|| +.-+.+.+.++|.+||.|||.+| || T Consensus 35 m~~el~~al~~~~~d~------~vr~vVl-tg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaaV~G~ 107 (260) T PRK08138 35 VRQQLAEHFTSLSEDP------DIRVIVL-TGGGTVFAAGADLKEFATAGAIELYLRHTERYWEAIAQCPKPVIAAVNGY 107 (260) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCE T ss_conf 9999999999997689------9669999-78998764687888884458467899999999999983899899997894 Q ss_pred CCCC-HHHHHHHCCCCCCCHH Q ss_conf 7752-5898864123777214 Q gi|254780791|r 250 ETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 250 E~D~-Tl~D~VAD~Ra~TPTa 269 (529) =.=- .-.=+.+|+|..+++| T Consensus 108 a~GgG~~lalacD~~ia~~~a 128 (260) T PRK08138 108 ALGGGCELAMHADIIVAGESA 128 (260) T ss_pred EECCCHHHHHHCCCCCCCCCC T ss_conf 033307998741011335444 No 78 >TIGR01278 DPOR_BchB light-independent protochlorophyllide reductase, B subunit; InterPro: IPR005969 Synonym: dark protochlorophyllide reductase Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite .; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction. Probab=90.77 E-value=0.97 Score=26.73 Aligned_cols=109 Identities=17% Similarity=0.338 Sum_probs=80.2 Q ss_pred EEEECCCHHHHHHHHHHHHHC---CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH--- Q ss_conf 998478425899999986305---97-58999721001111036799999999741003576777589995168884--- Q gi|254780791|r 149 AVITSPTGAVIRDILQRISCR---FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI--- 221 (529) Q Consensus 149 ~vits~~~a~~~D~~~~~~~r---~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~--- 221 (529) +|+=+|.|..|-|++.+.=+| || +.+.++...+=+.++..-++.+|++.+... +||+|||+=-=-|. T Consensus 28 ~v~HAPqGDdY~n~~~sMLER~~~fpp~T~Svv~~~~La~Gs~~rl~~tv~~~d~~~------~PdlIvv~psCss~lLQ 101 (562) T TIGR01278 28 AVLHAPQGDDYVNVMFSMLERTPNFPPVTTSVVDRRDLARGSQARLVDTVRRVDERE------KPDLIVVTPSCSSSLLQ 101 (562) T ss_pred EEEECCCCCHHHHHHHHHHCCCCCCCCCEEEEECCEECCHHHHHHHHHHHHHHHHCC------CCCEEEECCCHHHHHHH T ss_conf 688678884289999986227898594027676255754135574179898651035------88679982220147887 Q ss_pred HHHHHCCHHHHHHHHHH-CCCEEEE-EECC----C---CCCHHHHHHHCCCCCCCH Q ss_conf 44220076999999974-8904885-2057----7---752589886412377721 Q gi|254780791|r 222 EDLWHFNDEMIVRAIAN-SSIPIIS-AIGH----E---TDWTLADYAADLRAPTPT 268 (529) Q Consensus 222 eDL~~FN~e~laraI~~-~~iPVis-gIGH----E---~D~Tl~D~VAD~Ra~TPT 268 (529) ||| ..+|++-.- .+++||. .|-| | -|.+|-+.|...-.+-|+ T Consensus 102 EDL-----~~~a~~~g~~t~~~v~~~~vn~yR~~E~~A~~~~L~~lV~~~~~~~~~ 152 (562) T TIGR01278 102 EDL-----GNLAAAAGLDTKSKVLVADVNAYRVKELQAADRLLTQLVRRFAKEQPK 152 (562) T ss_pred HCH-----HHHHHHHCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 168-----999998263356531030376578999999899999999998477888 No 79 >PRK11154 fadJ multifunctional fatty acid oxidation complex subunit alpha; Reviewed Probab=90.75 E-value=0.62 Score=28.49 Aligned_cols=83 Identities=19% Similarity=0.306 Sum_probs=43.1 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHCC--------------HHHHHHHHHHCCCEEE Q ss_conf 11103679999999974100357677758999516888-4---4422007--------------6999999974890488 Q gi|254780791|r 183 GDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHFN--------------DEMIVRAIANSSIPII 244 (529) Q Consensus 183 G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~FN--------------~e~laraI~~~~iPVi 244 (529) ......++..++..++... .+-+|||..|+|- + -||-.|. -..+.++|..+|.||| T Consensus 32 ~~~~~~~L~~~l~~~~~d~------~vr~vVl~sg~g~~F~AGaDl~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvI 105 (706) T PRK11154 32 KAEFAEQVRAILKQIKEDK------SLKGVVFISGKPDNFIAGADINMLAACKTAQEAEALARQGQQLFAEIEALPIPVV 105 (706) T ss_pred CHHHHHHHHHHHHHHHHCC------CCEEEEEEECCCCEEEECCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 9999999999999985099------9669999517998047072977672478757799999888999999980999889 Q ss_pred EEE-------CCCCCCHHHHHHHCCCCCCCHHHHHHHCC Q ss_conf 520-------57775258988641237772145676332 Q gi|254780791|r 245 SAI-------GHETDWTLADYAADLRAPTPTGAAEMAVP 276 (529) Q Consensus 245 sgI-------GHE~D~Tl~D~VAD~Ra~TPTaAAElavp 276 (529) .+| |.| +=+.+|+|..+.++.+.+..| T Consensus 106 AAvnG~a~GgG~E-----laLacD~RiAs~~a~~~~g~P 139 (706) T PRK11154 106 AAIHGACLGGGLE-----LALACHYRVCTDDPKTVLGLP 139 (706) T ss_pred EEECCEEEHHHHH-----HHHHCCEEEECCCCCCCCCCC T ss_conf 9987864189999-----999689999617846321364 No 80 >PRK09010 single-stranded DNA-binding protein; Provisional Probab=90.48 E-value=1.7 Score=24.49 Aligned_cols=77 Identities=14% Similarity=0.242 Sum_probs=51.2 Q ss_pred CCCCEEEEEEECC---CCCCCCCCEEE--EEE------ECCC--------CEEEEEEECCCCCCCCCCCCCCCEEEEEEE Q ss_conf 0171899999705---43568886279--998------7489--------479999973521058668145988999999 Q gi|254780791|r 31 NLSHVCVRGEISG---YRGIHSSGHAY--FSL------KDNH--------SRIDAIIWKGTLNKIEFLPEEGIEFLVIGK 91 (529) Q Consensus 31 ~~~~~~v~gEis~---~~~~~~sGH~Y--f~l------kd~~--------a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~ 91 (529) .+..|-+.|-|.. ++ +..+|+.+ |+| ||.. .=++|++|...+..+.-.++-|++|.|.|+ T Consensus 5 ~~NkV~LiGnLg~DPElr-~t~~G~~va~fslAt~~~~kd~~~Ge~~e~t~w~~v~~wgk~AE~~~~yl~KG~~V~VeGr 83 (178) T PRK09010 5 GVNKVILVGNLGQDPEVR-YMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLFGKLAEVAGEYLRKGSQVYIEGQ 83 (178) T ss_pred CCCEEEEEEECCCCCEEE-ECCCCCEEEEEEEEECCCCCCCCCCCEECCCEEEEEEEECHHHHHHHHHCCCCCEEEEEEE T ss_conf 875899988637696588-8899986999999865871126787150331289999967688998876179998999989 Q ss_pred EEE--ECCCCEEEEEEEEE Q ss_conf 667--52884379999971 Q gi|254780791|r 92 ITT--FPGSSKYQIIIESL 108 (529) Q Consensus 92 ~~~--y~~~g~~ql~v~~i 108 (529) +.. |..+-+-.-++..| T Consensus 84 L~~r~w~dk~G~~ry~tei 102 (178) T PRK09010 84 LRTRKWTDQSGQDRYTTEV 102 (178) T ss_pred EECCCCCCCCCCEEEEEEE T ss_conf 8644337589998989999 No 81 >cd06325 PBP1_ABC_uncharacterized_transporter Type I periplasmic ligand-binding domain of uncharacterized ABC-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); its ligand specificity has not been determined experimentally. Probab=90.40 E-value=1.8 Score=24.44 Aligned_cols=84 Identities=20% Similarity=0.316 Sum_probs=56.4 Q ss_pred CCCEEEEEECC---CHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 65289998478---425899999986305975899972100111103679999999974100357677758999516888 Q gi|254780791|r 144 IPKIIAVITSP---TGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 144 ~p~~i~vits~---~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) -.++||||-++ ++...-+.++....+.++++..+++. ...+|-.++..+.. ++|++.+.- T Consensus 130 ~~k~igvlyn~~e~ns~~~~~~~~~~a~~~Gi~v~~~~v~-----~~~ei~~a~~~l~~--------~~Dal~~~~---- 192 (281) T cd06325 130 DAKTVGVLYNPSEANSVVQVKELKKAAAKLGIEVVEATVS-----SSNDVQQAAQSLAG--------KVDAIYVPT---- 192 (281) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECC-----CHHHHHHHHHHHHH--------CCCEEEEEC---- T ss_conf 9858999957998656999999999999769889999459-----88899999997562--------589999918---- Q ss_pred HHHHHHC-CHHHHHHHHHHCCCEEEEE Q ss_conf 4442200-7699999997489048852 Q gi|254780791|r 221 IEDLWHF-NDEMIVRAIANSSIPIISA 246 (529) Q Consensus 221 ~eDL~~F-N~e~laraI~~~~iPVisg 246 (529) |-+.. +-+.++....+.+|||+++ T Consensus 193 --d~~v~s~~~~i~~~a~~~~iPv~~~ 217 (281) T cd06325 193 --DNTVASAMEAVVKVANEAKIPVIAS 217 (281) T ss_pred --CCHHHHHHHHHHHHHHHCCCCEEEC T ss_conf --8127779999999998749988936 No 82 >PRK09245 enoyl-CoA hydratase; Provisional Probab=90.36 E-value=0.9 Score=27.04 Aligned_cols=75 Identities=21% Similarity=0.303 Sum_probs=43.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHH---HHCC-----------------HHHHHHHHHHCCCEE Q ss_conf 679999999974100357677758999516888-4---442---2007-----------------699999997489048 Q gi|254780791|r 188 KEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDL---WHFN-----------------DEMIVRAIANSSIPI 243 (529) Q Consensus 188 ~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL---~~FN-----------------~e~laraI~~~~iPV 243 (529) .++..++..++... .+.|||| +|.|. + -|| +.+. =..+.++|++|+.|| T Consensus 34 ~el~~~l~~~~~d~------~vrvvVi-tg~G~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPv 106 (266) T PRK09245 34 DALVAACARINADR------SVRAVIL-TGAGTAFSSGGNVKDMRARAGAFGGSPADIRQGYRRGIQRIPLALYNLEVPV 106 (266) T ss_pred HHHHHHHHHHHHCC------CEEEEEE-ECCCCEEECCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 99999999884099------8269999-7888704256568999753156667677899999999999999998389988 Q ss_pred EEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 8520-577752-5898864123777214 Q gi|254780791|r 244 ISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 244 isgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |++| ||=.-- .-+=+.+|+|.+++.+ T Consensus 107 Iaai~G~a~GgG~~lal~cD~~va~~~a 134 (266) T PRK09245 107 IAAVNGPAIGAGCDLACMCDIRIASETA 134 (266) T ss_pred EEEECCEECCCHHHHHHCCCEEEECCCC T ss_conf 9998266625305777607754513575 No 83 >PRK06293 single-stranded DNA-binding protein; Provisional Probab=90.24 E-value=0.67 Score=28.17 Aligned_cols=75 Identities=23% Similarity=0.352 Sum_probs=49.0 Q ss_pred EEEEEEC---CCCCCCCCC--EEEEEE--------ECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE--ECC--- Q ss_conf 9999970---543568886--279998--------7489479999973521058668145988999999667--528--- Q gi|254780791|r 36 CVRGEIS---GYRGIHSSG--HAYFSL--------KDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT--FPG--- 97 (529) Q Consensus 36 ~v~gEis---~~~~~~~sG--H~Yf~l--------kd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~--y~~--- 97 (529) .+.|-+. .+| +..|| -+=|+| +|+.--|+||+|+..+..+.--++.|..|.|.|++.. |+. T Consensus 5 ~LvGrLt~DPElR-~T~sG~aVa~FtlAvn~~~~~~~et~fi~~v~W~k~AE~~~~yl~KGs~V~VeGrl~trsyedkdG 83 (161) T PRK06293 5 YFAGYLGADPEER-MTSKGKRVVVLRLGVKSRVGSKDETVWCRCNIWHNRYDKMLPYLKKGSGVIVAGDISVESYMSKDG 83 (161) T ss_pred EEEEECCCCCCEE-ECCCCCEEEEEEEEECCCCCCCCCCEEEEEEEHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC T ss_conf 5456146687244-889998799999998688787568659999973478999999864789899998777347687899 Q ss_pred --CCEEEEEEEEEEEC Q ss_conf --84379999971016 Q gi|254780791|r 98 --SSKYQIIIESLIPS 111 (529) Q Consensus 98 --~g~~ql~v~~i~~~ 111 (529) ++...++++.++.. T Consensus 84 ~kry~tEVvad~~~fl 99 (161) T PRK06293 84 SPQSSLVVSVDTLKFS 99 (161) T ss_pred CEEEEEEEEEEEEEEC T ss_conf 8898999999678965 No 84 >PRK06951 consensus Probab=90.24 E-value=0.81 Score=27.46 Aligned_cols=75 Identities=25% Similarity=0.334 Sum_probs=43.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHHHHC----------CHHHHHHHHHHCCCEEEEEE--- Q ss_conf 1036799999999741003576777589995168884----442200----------76999999974890488520--- Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDLWHF----------NDEMIVRAIANSSIPIISAI--- 247 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL~~F----------N~e~laraI~~~~iPVisgI--- 247 (529) .-..++..++..++... .+-|||| +|+|.. -||-.| .-..+.++|.+||.|||++| T Consensus 28 ~m~~el~~~~~~~~~d~------~vrvvvl-~g~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaav~G~ 100 (254) T PRK06951 28 AMYQTMADALAAAQDDA------AVRAILI-RGSDGIFSAGNDLEDFLKAPPKDEDAPVFQFLRQISSAPKPIVAAVCGP 100 (254) T ss_pred HHHHHHHHHHHHHHHCC------CEEEEEE-ECCCCCEECCCCHHHHHHCCCCCCCCHHHHHHHHHHHCCCCEEEEECCE T ss_conf 99999999999986688------9179999-8799987689975988618764333289999999983899989998687 Q ss_pred --CCCCCCHHHHHHHCCCCCCCHH Q ss_conf --5777525898864123777214 Q gi|254780791|r 248 --GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 248 --GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) |.-.... +.+|+|..++++ T Consensus 101 a~GgG~~la---l~~D~~ia~~~a 121 (254) T PRK06951 101 AVGIGTTML---LHCDLVYAADTA 121 (254) T ss_pred EECCCHHHH---HHCCCHHHCCCC T ss_conf 514415776---604301220367 No 85 >pfam00436 SSB Single-strand binding protein family. This family includes single stranded binding proteins and also the primosomal replication protein N (PriB). PriB forms a complex with PriA, PriC and ssDNA. Probab=90.09 E-value=1.2 Score=26.08 Aligned_cols=76 Identities=17% Similarity=0.309 Sum_probs=52.7 Q ss_pred CEEEEEEECC---CCCCCCCCE--EEEEEEC------C-------CCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE-- Q ss_conf 1899999705---435688862--7999874------8-------947999997352105866814598899999966-- Q gi|254780791|r 34 HVCVRGEISG---YRGIHSSGH--AYFSLKD------N-------HSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT-- 93 (529) Q Consensus 34 ~~~v~gEis~---~~~~~~sGH--~Yf~lkd------~-------~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~-- 93 (529) .|-+.|-|.. ++. ..+|. +-|+|-- . ..-+.|++|...+..+.-.++.|+.|.|.|++. T Consensus 3 ~v~l~G~l~~dpe~r~-~~~g~~~~~f~va~~~~~~~~~~~~~~~t~~~~v~~~g~~A~~~~~~l~KG~~V~V~G~l~~~ 81 (104) T pfam00436 3 KVILVGRLTRDPELRY-TPNGNAVANFTLAVNRRFKDQNGESDEETDFIRVVVWGKLAENAAEYLKKGSLVYVEGRLRTR 81 (104) T ss_pred EEEEEEEECCCCEEEE-CCCCCEEEEEEEEEEECCCCCCCCCCCCCEEEEEEECCHHHHHHHHCCCCCCEEEEEEEEEEE T ss_conf 9999999056989999-689988999999996021346787024558998773054465431227899799999997811 Q ss_pred EEC-CCC----EEEEEEEEEEE Q ss_conf 752-884----37999997101 Q gi|254780791|r 94 TFP-GSS----KYQIIIESLIP 110 (529) Q Consensus 94 ~y~-~~g----~~ql~v~~i~~ 110 (529) .|. +.| .+.+++++|+. T Consensus 82 ~~~~~~G~~r~~~~v~~~~i~~ 103 (104) T pfam00436 82 KYEDQEGQKRYVTEIVADNVQF 103 (104) T ss_pred EEECCCCCEEEEEEEEEEEEEE T ss_conf 3799999899999999999994 No 86 >PRK05733 single-stranded DNA-binding protein; Provisional Probab=90.07 E-value=1.8 Score=24.25 Aligned_cols=77 Identities=16% Similarity=0.205 Sum_probs=50.7 Q ss_pred CCCEEEEEEEC---CCCCCCCCCEEE--EEE------ECC--------CCEEEEEEECCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 17189999970---543568886279--998------748--------94799999735210586681459889999996 Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGHAY--FSL------KDN--------HSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKI 92 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH~Y--f~l------kd~--------~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~ 92 (529) +..|-+.|-|. .++. ..+|+.+ |+| +|. ..=++|++|...+..+.-.++.|++|.|.|++ T Consensus 5 ~NkV~LiG~lg~DPElr~-t~~G~~v~~fslAt~~~~~~~~~ge~~e~T~w~~v~~wgk~AE~~~~yl~KG~~V~VeGrL 83 (172) T PRK05733 5 VNKVILVGTCGQDPEVRY-LPNGNAVTNLSLATSEQWTDKQSGQKVERTEWHRVSLFGKVAEIAGEYLRKGSQVYIEGKL 83 (172) T ss_pred CCEEEEEEECCCCCEEEE-CCCCCEEEEEEEEEECCCCCCCCCCEECCCCEEEEEEECHHHHHHHHHCCCCCEEEEEEEE T ss_conf 628999755577963888-8999879999999725500167875831232289987345788888653899989999888 Q ss_pred EE--ECCCCEEEEEEEEEEE Q ss_conf 67--5288437999997101 Q gi|254780791|r 93 TT--FPGSSKYQIIIESLIP 110 (529) Q Consensus 93 ~~--y~~~g~~ql~v~~i~~ 110 (529) .. |+..|. .-++..|.. T Consensus 84 ~tr~wekdG~-kry~teiv~ 102 (172) T PRK05733 84 QTREWEKDGI-KRYTTEIVV 102 (172) T ss_pred EEECHHHCCC-EEEEEEEEE T ss_conf 8404755898-798999999 No 87 >PRK13805 bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional Probab=89.78 E-value=0.93 Score=26.93 Aligned_cols=97 Identities=26% Similarity=0.394 Sum_probs=63.2 Q ss_pred CCEEEEEECCCHHH---HHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 52899984784258---999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 145 PKIIAVITSPTGAV---IRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vits~~~a~---~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .+|+-|||-+.=.- +..+...| +..++++.+|. -|.-+-....+-++++.++... +| .||+=||||. T Consensus 482 ~kralIVTD~~~~~~G~vd~v~~~L-~~~gi~~~vFd-~V~pdPt~~~V~~G~~~~~~~~-------~D-~IIalGGGS~ 551 (862) T PRK13805 482 KKRAFIVTDRGMVELGYVDKVTDVL-RANGVETEVFS-EVEPDPTLSTVRKGAELMRSFK-------PD-TIIALGGGSP 551 (862) T ss_pred CCEEEEECCHHHHHCCCHHHHHHHH-HHCCCEEEEEC-CCCCCCCHHHHHHHHHHHHHCC-------CC-EEEEECCCCH T ss_conf 9779998785187678899999999-98698399966-9899939799999999998649-------99-9999478348 Q ss_pred HH----HHHCCH------HH-------HHHHHHHCC----------CEEEEEECCCC Q ss_conf 44----220076------99-------999997489----------04885205777 Q gi|254780791|r 222 ED----LWHFND------EM-------IVRAIANSS----------IPIISAIGHET 251 (529) Q Consensus 222 eD----L~~FN~------e~-------laraI~~~~----------iPVisgIGHE~ 251 (529) -| .|.|.+ +. +-+.++..| ||.-||-|-|. T Consensus 552 iDaAKai~~~~e~p~~~~~~~~~~f~di~kr~~~~P~~~~k~~liaIPTTaGTGSEv 608 (862) T PRK13805 552 MDAAKIMWLFYEHPEVDFEDLAQKFMDIRKRIYKFPKLGKKAKLVAIPTTSGTGSEV 608 (862) T ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHCCCCCCCCCCCEEEEECCCCCCHHHC T ss_conf 899999999986887415443122121122103578877766189965887313310 No 88 >PRK08486 single-stranded DNA-binding protein; Provisional Probab=89.67 E-value=2 Score=23.98 Aligned_cols=78 Identities=26% Similarity=0.299 Sum_probs=53.5 Q ss_pred CCEEEEEEEC---CCCCCCCCCEEE--EEE------ECCC-------CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 7189999970---543568886279--998------7489-------479999973521058668145988999999667 Q gi|254780791|r 33 SHVCVRGEIS---GYRGIHSSGHAY--FSL------KDNH-------SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT 94 (529) Q Consensus 33 ~~~~v~gEis---~~~~~~~sGH~Y--f~l------kd~~-------a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~ 94 (529) ..|-+.|-|. .+|. -.||+.. |+| |+.+ --++|++|...+..+.-.++.|++|.|.|++.. T Consensus 3 NkViLvGrL~~DpElr~-t~~G~~v~~f~lAv~r~~~~~~Ge~~e~t~f~~v~~wgk~AE~~~~yl~KGs~V~VeGrL~~ 81 (180) T PRK08486 3 NKVILVGNLTRDVELRY-LPSGSAIATIGLATSRRFKKQDGEKGEEVCFIDIRLFGRTAEIANQYLSKGSKVLIEGRLTF 81 (180) T ss_pred EEEEEEEECCCCCEEEE-CCCCCEEEEEEEEECCCEECCCCCCCCCCEEEEEEECCHHHHHHHHHCCCCCEEEEEEEEEC T ss_conf 06999885686863888-79998899999986686667888780223699998816789888752189999999999884 Q ss_pred --ECC-----CCEEEEEEEEEEEC Q ss_conf --528-----84379999971016 Q gi|254780791|r 95 --FPG-----SSKYQIIIESLIPS 111 (529) Q Consensus 95 --y~~-----~g~~ql~v~~i~~~ 111 (529) |.. ++...++|+.|+.- T Consensus 82 r~w~dkdG~~ry~teVv~d~~~fl 105 (180) T PRK08486 82 ESWMDQNGQKRSKHTITAESMQML 105 (180) T ss_pred CCCCCCCCCEEEEEEEEEEEEEEE T ss_conf 645157998899999999688980 No 89 >PRK08512 consensus Probab=89.59 E-value=2 Score=23.94 Aligned_cols=45 Identities=13% Similarity=0.169 Sum_probs=21.0 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEE--CCCCEEEEEEEEEEECCCCHHH Q ss_conf 10586681459889999996675--2884379999971016800799 Q gi|254780791|r 73 LNKIEFLPEEGIEFLVIGKITTF--PGSSKYQIIIESLIPSGSGTLL 117 (529) Q Consensus 73 ~~~~~~~~~~G~~v~~~g~~~~y--~~~g~~ql~v~~i~~~g~G~l~ 117 (529) ..+...+|==|+++.+...-.+- ..+.+|.|++..-...|-.+|. T Consensus 55 ~kk~gIKPIiG~E~yv~~~~~~~~~~~~~~~hl~LLAkN~~GY~NL~ 101 (1185) T PRK08512 55 MKKEGIKPIIGMEAYIHNHDELDDKSSKQRFHLCLFAKNEIGYKNLM 101 (1185) T ss_pred HHHCCCCEEEEEEEEEECCCCCCCCCCCCCCEEEEEECCHHHHHHHH T ss_conf 99869982775389983687556656566412999948999999999 No 90 >PRK11423 methylmalonyl-CoA decarboxylase; Provisional Probab=89.56 E-value=1.1 Score=26.30 Aligned_cols=40 Identities=23% Similarity=0.237 Sum_probs=27.9 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCCCCH-HHHHHHCCCCCCCHH Q ss_conf 999999974890488520-5777525-898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHETDWT-LADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~D~T-l~D~VAD~Ra~TPTa 269 (529) ..+.++|..||.|||.+| ||=.=-- -.=+.+|+|..+|++ T Consensus 86 ~~~~~~i~~~~kPvIAav~G~a~GgG~~lal~cD~~ia~~~a 127 (261) T PRK11423 86 RQILRMIQKFPKPVIAMVEGSVWGGAFDLVMSCDLIIAASTS 127 (261) T ss_pred HHHHHHHHHCCCCEEEEECCHHHHCHHHHHHHHHHHCCCCCC T ss_conf 999999994899899996773321408999987542101162 No 91 >PRK05981 enoyl-CoA hydratase; Provisional Probab=89.49 E-value=1.1 Score=26.10 Aligned_cols=39 Identities=21% Similarity=0.263 Sum_probs=25.3 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .+.+++..||.|||++| ||=.=- .-+=+.+|+|..++.+ T Consensus 94 ~~~~~i~~~~kPvIaai~G~a~GgG~~lal~cD~~ias~~a 134 (266) T PRK05981 94 PFLRRLRNLHCPIVTAVNGPAAGVGMSFALMGDLILCARSA 134 (266) T ss_pred HHHHHHHHCCCCEEEEEECEEECCCHHHHHCCCCCCCCCCC T ss_conf 99999995899989998076632636776306405657887 No 92 >PRK07510 consensus Probab=89.47 E-value=1 Score=26.51 Aligned_cols=79 Identities=18% Similarity=0.267 Sum_probs=44.8 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHHHHCCH-------------------HHHHHHHHHCC Q ss_conf 11036799999999741003576777589995168884----44220076-------------------99999997489 Q gi|254780791|r 184 DECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDLWHFND-------------------EMIVRAIANSS 240 (529) Q Consensus 184 ~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL~~FN~-------------------e~laraI~~~~ 240 (529) ..-..++..++..++... .+-+||| +|.|.. -||..|.+ ..+.++|.+|| T Consensus 31 ~~m~~el~~~~~~~~~d~------~vr~vvl-tg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 103 (266) T PRK07510 31 DDLTRALVEAARRAADDP------EVRAVLL-TGAGGAFCAGGDVKSMAAARAAPPFEGAQRVANLRRRMEVSRLLHQMP 103 (266) T ss_pred HHHHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999999999999986489------9459999-789986055657898861447776327789999999999999998399 Q ss_pred CEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 0488520-577752-5898864123777214 Q gi|254780791|r 241 IPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 241 iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .|||.+| ||=.=- .-+=+++|.|..++++ T Consensus 104 kPvIaav~G~a~GgG~~la~~cD~rias~~a 134 (266) T PRK07510 104 KPTVAQIDGAAAGAGLSLALACDLRVAGASA 134 (266) T ss_pred CCEEEEECCEEEEHHHHHHHCCCCHHCCCCC T ss_conf 9889998797764225676504601015443 No 93 >PRK05980 enoyl-CoA hydratase; Provisional Probab=89.42 E-value=1.3 Score=25.70 Aligned_cols=75 Identities=16% Similarity=0.282 Sum_probs=43.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-----HHHHHHCCH-----------------HHHHHHHHHCCCEEE Q ss_conf 3679999999974100357677758999516888-----444220076-----------------999999974890488 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-----IEDLWHFND-----------------EMIVRAIANSSIPII 244 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-----~eDL~~FN~-----------------e~laraI~~~~iPVi 244 (529) ..+|..++..++... .+.+||| +|.|. =-||-.|++ ..+.++|..||.||| T Consensus 32 ~~~l~~~l~~~~~d~------~ir~vVl-~g~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvI 104 (260) T PRK05980 32 IDRLMARLDAIEVDE------SVRAVIL-TGAGERAFSAGADIHEFSASVAAGPDVALRDFVRRGQTMTARLEAFPKPVI 104 (260) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEEECCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 999999999986299------9339999-669983443576767641210147247899999999999999997899989 Q ss_pred EEE-CCC--CCCHHHHHHHCCCCCCCHH Q ss_conf 520-577--7525898864123777214 Q gi|254780791|r 245 SAI-GHE--TDWTLADYAADLRAPTPTG 269 (529) Q Consensus 245 sgI-GHE--~D~Tl~D~VAD~Ra~TPTa 269 (529) ++| ||= -=..|+ +.+|+|..+|.+ T Consensus 105 a~v~G~a~GgG~~la-l~cD~~ia~~~a 131 (260) T PRK05980 105 AAVNGLAFGGGCEIT-EAVHLAIASERA 131 (260) T ss_pred EEECCEEEECCCHHH-HHCCCCCCCCCC T ss_conf 997788877753544-204130147778 No 94 >PRK05658 RNA polymerase sigma factor RpoD; Validated Probab=89.34 E-value=0.53 Score=29.06 Aligned_cols=14 Identities=29% Similarity=0.358 Sum_probs=8.7 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 42589999998630 Q gi|254780791|r 155 TGAVIRDILQRISC 168 (529) Q Consensus 155 ~~a~~~D~~~~~~~ 168 (529) ++.-+.|++..|.+ T Consensus 41 ~~eqId~ii~~L~~ 54 (620) T PRK05658 41 DSEQIEDIISMIND 54 (620) T ss_pred CHHHHHHHHHHHHH T ss_conf 98999999999997 No 95 >pfam00378 ECH Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase. Probab=89.12 E-value=1.3 Score=25.56 Aligned_cols=76 Identities=16% Similarity=0.201 Sum_probs=42.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HH-------------HHHCCHHHHHHHHHHCCCEEEEEE-C Q ss_conf 36799999999741003576777589995168884----44-------------220076999999974890488520-5 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----ED-------------LWHFNDEMIVRAIANSSIPIISAI-G 248 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eD-------------L~~FN~e~laraI~~~~iPVisgI-G 248 (529) ..++.++++.+.... .+- +||.+|+|.. -| .|...-..+..+|.+||.|||++| | T Consensus 19 ~~~l~~~~~~~~~d~------~i~-vvil~g~g~~FsaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~PvIa~v~G 91 (169) T pfam00378 19 LTELIQALEKLEQDP------SVR-AVVLTGAPGAFSAGADIKEMAAKRPAQQAQFSLEALDLWSRLEDLPKPVIAAVNG 91 (169) T ss_pred HHHHHHHHHHHHHCC------CCE-EEEEECCCCCEECCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 999999999998689------945-9999789964225888899861683778899999999999999689989999668 Q ss_pred CCCCC-HHHHHHHCCCCCCCHH Q ss_conf 77752-5898864123777214 Q gi|254780791|r 249 HETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 249 HE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |-.-- .-.=+.+|.|..|+++ T Consensus 92 ~a~GgG~~la~~~D~~va~~~a 113 (169) T pfam00378 92 YALGGGLELALACDYRIAADNA 113 (169) T ss_pred CEECCCHHHHHHCCCHHHCCCC T ss_conf 3650460888737606535798 No 96 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=89.04 E-value=2.2 Score=23.63 Aligned_cols=41 Identities=17% Similarity=0.307 Sum_probs=22.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EE Q ss_conf 6800799999999997654012261001631026528999847842589999998630597-58 Q gi|254780791|r 111 SGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP-LR 173 (529) Q Consensus 111 ~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~ 173 (529) +|+.-...+.++.-+||..- ..|- +=+.||++=+.+.++ ++ T Consensus 168 AGisKYK~Rk~Et~~kL~~t---~~NL-------------------~Rl~Dil~El~~ql~~L~ 209 (1191) T TIGR02168 168 AGISKYKERKKETERKLERT---RENL-------------------ERLEDILKELERQLESLE 209 (1191) T ss_pred CCHHHHHHHHHHHHHHHHHH---HHHH-------------------HHHHHHHHHHHHHHHHHH T ss_conf 13578887799999989999---9859-------------------999999999884227789 No 97 >PRK07275 single-stranded DNA-binding protein; Provisional Probab=89.01 E-value=0.53 Score=29.11 Aligned_cols=77 Identities=21% Similarity=0.377 Sum_probs=52.9 Q ss_pred CEEEEEEEC---CCCCCCCCCE--EEEEE------ECCC-----CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE--E Q ss_conf 189999970---5435688862--79998------7489-----479999973521058668145988999999667--5 Q gi|254780791|r 34 HVCVRGEIS---GYRGIHSSGH--AYFSL------KDNH-----SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT--F 95 (529) Q Consensus 34 ~~~v~gEis---~~~~~~~sGH--~Yf~l------kd~~-----a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~--y 95 (529) .|-+.|-|. .+|. -.+|- +-|+| |+.+ --++|++|+..+..+.-.++.|.+|.|.|++.. | T Consensus 4 kV~LiGrl~~DpElr~-t~~G~~v~~fslAv~r~~k~~~ge~~td~~~~v~wg~~AE~~~~yl~KG~~V~VeGrl~~~~y 82 (163) T PRK07275 4 NVVLVGRMTRDAELRY-TPSNVAVATFTLAVNRTFKSQNGEREADFINCVIWRQQAENLANWAKKGALIGVTGRIQTRNY 82 (163) T ss_pred EEEEEEECCCCCEEEE-CCCCCEEEEEEEEECCCEECCCCCCCCEEEEEEECCHHHHHHHHHHCCCCEEEEEEEEECCCC T ss_conf 6999896588863889-899988999999847855878895033089999807789999987458999999999721553 Q ss_pred CC-----CCEEEEEEEEEEEC Q ss_conf 28-----84379999971016 Q gi|254780791|r 96 PG-----SSKYQIIIESLIPS 111 (529) Q Consensus 96 ~~-----~g~~ql~v~~i~~~ 111 (529) +. ++...++|+.|+.- T Consensus 83 ~dkdG~~r~~teVv~~~~~~l 103 (163) T PRK07275 83 ENQQGQRVYVTEVVADNFQML 103 (163) T ss_pred CCCCCCEEEEEEEEEEEEEEE T ss_conf 758997899999999688970 No 98 >PRK06213 enoyl-CoA hydratase; Provisional Probab=88.93 E-value=1.2 Score=25.81 Aligned_cols=40 Identities=20% Similarity=0.269 Sum_probs=27.8 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCCC-CHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-57775-25898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHETD-WTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~D-~Tl~D~VAD~Ra~TPTa 269 (529) ..+.+++..||.|||++| ||-.= =..+-+.+|+|..++++ T Consensus 81 ~~~~~~~~~~~kPvIaav~G~a~GgG~~lala~D~ria~~~~ 122 (229) T PRK06213 81 STLARRLLSHPKPVIVACTGHAIAKGAFLLLSGDYRIGVHGP 122 (229) T ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCEEEEECCC T ss_conf 999999995799999998786511998998718889996686 No 99 >PRK08182 single-stranded DNA-binding protein; Provisional Probab=88.87 E-value=0.86 Score=27.23 Aligned_cols=54 Identities=13% Similarity=0.260 Sum_probs=40.7 Q ss_pred ECCCC-EEEEEEECCCCCCCCCCCCCCCEEEEEEEEE--EEC-----CCCEEEEEEEEEEEC Q ss_conf 74894-7999997352105866814598899999966--752-----884379999971016 Q gi|254780791|r 58 KDNHS-RIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT--TFP-----GSSKYQIIIESLIPS 111 (529) Q Consensus 58 kd~~a-~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~--~y~-----~~g~~ql~v~~i~~~ 111 (529) +|... =++|++|...+..+.-.++.|++|+|.|++. -|. .|..+.+++++|..- T Consensus 49 ~d~~~~W~~V~~Wg~~AE~~a~~L~KG~~V~V~Grl~~~~W~dkdG~~R~~~eV~Ad~v~~l 110 (148) T PRK08182 49 EDRGGFWAPVELWHRDAEHWARLYQKGMRVLVEGRMERDEWTDNEDNERVTFKVEARRVGIL 110 (148) T ss_pred EECCCEEEEEEEEHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCEEEEEEEEEEEEEEC T ss_conf 96664799999873699999998738998999989860134779998788999998688834 No 100 >TIGR00655 PurU formyltetrahydrofolate deformylase; InterPro: IPR004810 An Escherichia coli gene designated purU has been identified and characterised. The gene codes for a 280-amino-acid protein, PurU (P37051 from SWISSPROT, 3.5.1.10 from EC). PurU is an enzyme that catalyses the hydrolysis of 10-formyltetrahydrofolate (formyl-FH4) to FH4 and formate ,.10-formyltetrahydrofolate + H(2)O = formate +tetrahydrofolate Formyl-FH4 hydrolase generates the formate that is used by purT-encoded 5'-phosphoribosylglycinamide transformylase for step three of de novo purine nucleotide synthesis. Formyl-FH4 hydrolase, a hexamer of 32kDa subunits, is activated by methionine and inhibited by glycine. Heterotropic cooperativity is observed for activation by methionine in the presence of glycine and for inhibition by glycine in the presence of methionine. These results suggest that formyl-FH4 hydrolase is a regulatory enzyme whose main function is to balance the pools of FH4 and C1-FH4 in response to changing growth conditions. The enzyme uses methionine and glycine to sense the pools of C1-FH4 and FH4, respectively. ; GO: 0008864 formyltetrahydrofolate deformylase activity, 0006189 'de novo' IMP biosynthetic process. Probab=88.76 E-value=1.4 Score=25.22 Aligned_cols=81 Identities=21% Similarity=0.247 Sum_probs=53.4 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHH----------------------HHHHCCCEEEEEEECCCCCC-CHHH Q ss_conf 226100163102652899984784258999999----------------------86305975899972100111-1036 Q gi|254780791|r 132 LFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQ----------------------RISCRFPLRVIIFPVKVQGD-ECPK 188 (529) Q Consensus 132 lfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~----------------------~~~~r~p~~~~~~p~~vQG~-~a~~ 188 (529) -|+.+.....-..+|||+|..|+..--+.|+|- -+.+||.++.+..|+.=--. .+.. T Consensus 79 k~~~~~~~~~~~~~krvai~vsKe~HCL~DLL~r~~~GeL~~ei~~VIsNH~~lr~lve~F~iPF~~~p~~~~~r~e~Ek 158 (294) T TIGR00655 79 KFEMTWELILADKLKRVAILVSKEDHCLGDLLWRWYSGELDAEIALVISNHEDLRELVERFGIPFHYIPATKDTRLEHEK 158 (294) T ss_pred HHCCEEEEEECCCCCEEEEEEECCHHHHHHHHHHHCCCCCEEEEEEEECCCHHHHHHHHHCCCCEEEECCCCCHHHHHHH T ss_conf 41744899612558768998404120358999987389823799998649878998866538988986378850568999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 7999999997410035767775899951 Q gi|254780791|r 189 EIANAILQLNTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 189 ~i~~ai~~~~~~~~~~~~~~~D~iii~R 216 (529) ...+.|+... ++ +.-.+|+||||| T Consensus 159 ~~Lell~~ye-lk---h~~~~DlvVLAk 182 (294) T TIGR00655 159 KELELLKQYE-LK---HKYQVDLVVLAK 182 (294) T ss_pred HHHHHHHCCC-CC---EECCCCEEEEEH T ss_conf 9999874033-21---102754698210 No 101 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=88.68 E-value=2.3 Score=23.44 Aligned_cols=58 Identities=28% Similarity=0.394 Sum_probs=26.1 Q ss_pred CCEEEEEEECCC-----CEEEEEEECCCCCCC--CCCCCCCCEEEEEEEEE--EECCCCEEEEEEEE Q ss_conf 862799987489-----479999973521058--66814598899999966--75288437999997 Q gi|254780791|r 50 SGHAYFSLKDNH-----SRIDAIIWKGTLNKI--EFLPEEGIEFLVIGKIT--TFPGSSKYQIIIES 107 (529) Q Consensus 50 sGH~Yf~lkd~~-----a~i~~~~~~~~~~~~--~~~~~~G~~v~~~g~~~--~y~~~g~~ql~v~~ 107 (529) +.|+=++|+++. ..+.|+.|+..-.-+ ......|..|-+.|+++ .|.++.++||+|.+ T Consensus 638 ~~H~k~~l~~~~~~p~g~~~~ai~f~~~~~~~~~~~~~~~~~~~~~~g~l~~n~~~g~~~~ql~i~d 704 (705) T TIGR00644 638 ENHLKLSLKSGEQGPDGKNIEAIAFNAGEKPLIQLLELNLGRPLDVAGKLSINEWRGREKPQLIIQD 704 (705) T ss_pred CCCEEEEEECCCCCCCCCEEEEEEEECCCCCCCCHHHCCCCCEEEEEEEEEEEEECCCCEEEEEEEC T ss_conf 5617899850776898736778987157722112121368857899999876500686304567722 No 102 >PRK06127 enoyl-CoA hydratase; Provisional Probab=88.68 E-value=1.5 Score=24.97 Aligned_cols=75 Identities=21% Similarity=0.333 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC--CH---HHHHHCC-----H----------HHHHHHHHHCCCEEEEE Q ss_conf 367999999997410035767775899951688--84---4422007-----6----------99999997489048852 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGG--SI---EDLWHFN-----D----------EMIVRAIANSSIPIISA 246 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG--S~---eDL~~FN-----~----------e~laraI~~~~iPVisg 246 (529) ..++..++..++... .+.+||| +|.| ++ -||..|. . ..+..+|+.||.|||++ T Consensus 40 ~~~l~~~l~~~~~d~------~vr~vVl-~g~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaa 112 (269) T PRK06127 40 WEALPQALAAAEDDD------AIRVVVL-TGAGEKAFVSGADISQFEESRSDAEAVAAYEQAVEAAQAALADYAKPTIAC 112 (269) T ss_pred HHHHHHHHHHHHHCC------CEEEEEE-ECCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 999999999986389------9159999-689977413686457775224786788999999999999999589989999 Q ss_pred E-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 0-5777--525898864123777214 Q gi|254780791|r 247 I-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 247 I-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) | ||-. -.. .=+.+|+|.++|.+ T Consensus 113 v~G~a~GgG~~-lal~~D~~ia~~~a 137 (269) T PRK06127 113 IRGYCIGGGMG-IALACDIRIAAEDS 137 (269) T ss_pred ECCCCCCCCHH-HHCCCCEEEECCCC T ss_conf 76812026413-32067667756995 No 103 >PRK05869 enoyl-CoA hydratase; Validated Probab=88.62 E-value=1.5 Score=25.15 Aligned_cols=77 Identities=29% Similarity=0.294 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH---HHHHHC-------------CHHHHHHHHHHCCCEEEEEE-C Q ss_conf 036799999999741003576777589995168884---442200-------------76999999974890488520-5 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI---EDLWHF-------------NDEMIVRAIANSSIPIISAI-G 248 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~---eDL~~F-------------N~e~laraI~~~~iPVisgI-G 248 (529) ...++..+++.++... .+-+|||.=||+.+ -||-.| --..+..++.+||.|||.+| | T Consensus 35 ~~~~L~~al~~~~~d~------~vr~vvl~g~g~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~i~~~pkPvIAav~G 108 (225) T PRK05869 35 VYREIVAAADELGRRD------DVAAVILFGGHEIFSAGDDMPELRTLSAQEADTAARVRQQAVDAVAAIPKPTVAAITG 108 (225) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEEECCCCCEECCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 9999999999996489------9459999889974772677787722683345578999999999998389989999828 Q ss_pred CCC--CCHHHHHHHCCCCCCCHH Q ss_conf 777--525898864123777214 Q gi|254780791|r 249 HET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 249 HE~--D~Tl~D~VAD~Ra~TPTa 269 (529) |=. =..|+ +.+|+|..++++ T Consensus 109 ~a~GgG~~la-l~cD~ria~~~a 130 (225) T PRK05869 109 YALGAGLTLA-LAADWRVSGDNV 130 (225) T ss_pred CCCHHHHHHH-HHHCHHHHCCCC T ss_conf 0034789999-970446307984 No 104 >PRK06461 single-stranded DNA-binding protein; Reviewed Probab=88.61 E-value=2.3 Score=23.41 Aligned_cols=79 Identities=16% Similarity=0.216 Sum_probs=54.2 Q ss_pred HCCCCEEEEEEECCC------CCCCCCCE-EEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEE-EEEEEECCCCEE Q ss_conf 001718999997054------35688862-7999874894799999735210586681459889999-996675288437 Q gi|254780791|r 30 SNLSHVCVRGEISGY------RGIHSSGH-AYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVI-GKITTFPGSSKY 101 (529) Q Consensus 30 ~~~~~~~v~gEis~~------~~~~~sGH-~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~y~~~g~~ 101 (529) ....+|-|.|.|-+. +....++. +.+.+.|+.+.|+.++|...+. .++.|+-|.+. |+++.| +|.+ T Consensus 12 pg~~~V~v~~rVl~~~~~r~~~~~~g~~~V~~~~V~DeTG~I~~tlWde~~~----~i~~GD~V~I~nayv~~~--~g~l 85 (130) T PRK06461 12 PGMRRVNVTGRVLEVGEPKVIQTKGGPRTISEAVVGDETGRVKLTLWGDQAG----TLKEGEVVKIENAWTTLY--RGKV 85 (130) T ss_pred CCCCCEEEEEEEEECCCCCEEECCCCCEEEEEEEEECCCCEEEEEEECCCCC----CCCCCCEEEEECCEEEEE--CCEE T ss_conf 9989826999999768972786289966899999987998699999456456----468999999944798888--8869 Q ss_pred EEEEE---EEEECCCC Q ss_conf 99999---71016800 Q gi|254780791|r 102 QIIIE---SLIPSGSG 114 (529) Q Consensus 102 ql~v~---~i~~~g~G 114 (529) +|+|- .|++.+.. T Consensus 86 ~L~vGk~g~I~~~~d~ 101 (130) T PRK06461 86 QLNVGRYGSISESDDE 101 (130) T ss_pred EEEECCCEEEEECCCC T ss_conf 9986896268976874 No 105 >PRK13732 single-stranded DNA-binding protein; Provisional Probab=88.57 E-value=2.3 Score=23.39 Aligned_cols=77 Identities=12% Similarity=0.245 Sum_probs=50.3 Q ss_pred CCCEEEEEEEC---CCCCCCCCCE--EEEEE------ECC--------CCEEEEEEECCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 17189999970---5435688862--79998------748--------94799999735210586681459889999996 Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGH--AYFSL------KDN--------HSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKI 92 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH--~Yf~l------kd~--------~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~ 92 (529) +..|-+.|-+. .++. ..+|. +-|+| ||. .--++||+|...+..+.-.++.|.+|.|.|++ T Consensus 6 ~NkViLiG~Lg~DPElr~-t~~G~~va~f~lAt~~~~~d~~~ge~~e~t~w~~vv~~gk~AE~~~~yl~KGs~V~VeGrL 84 (175) T PRK13732 6 INKVILVGRLGKDPEVRY-IPNGGAVANLQVATSESWRDKQTGEMREQTEWHRVVLFGKLAEVAGEYLRKGAQVYIEGQL 84 (175) T ss_pred CCEEEEEEECCCCCEEEE-CCCCCEEEEEEEEEECCCCCCCCCCEECCCCEEEEEEEHHHHHHHHHHHCCCCEEEEEEEE T ss_conf 758999874577961788-8999879999999836621277883641335489998648999999872799989999987 Q ss_pred EE--ECCCCEEEEEEEEEEE Q ss_conf 67--5288437999997101 Q gi|254780791|r 93 TT--FPGSSKYQIIIESLIP 110 (529) Q Consensus 93 ~~--y~~~g~~ql~v~~i~~ 110 (529) .. |+..|.=. +++.|.. T Consensus 85 ~~r~wed~G~~r-y~tevv~ 103 (175) T PRK13732 85 RTRSWEDNGITR-YVTEILV 103 (175) T ss_pred EECCCCCCCCEE-EEEEEEE T ss_conf 721501089669-8999998 No 106 >PRK06023 enoyl-CoA hydratase; Provisional Probab=88.56 E-value=1.4 Score=25.29 Aligned_cols=79 Identities=20% Similarity=0.223 Sum_probs=43.6 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH---HHHHHC------------CHHHHHHHHHHCCCEEEEEE-C Q ss_conf 1036799999999741003576777589995168884---442200------------76999999974890488520-5 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI---EDLWHF------------NDEMIVRAIANSSIPIISAI-G 248 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~---eDL~~F------------N~e~laraI~~~~iPVisgI-G 248 (529) ....++..+++.++... .+.+|||.=.||++ -||-.| .-..+.+++..||.|||++| | T Consensus 30 ~~~~~l~~al~~~~~d~------~vr~vvltg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvIaav~G 103 (248) T PRK06023 30 AMYAAMAKALKAGDADD------AIRAHVFLGVPGAFSSGNDMQDFMAAAMGGTSFGSEILDFLIALAETEKPIVSGVDG 103 (248) T ss_pred HHHHHHHHHHHHHHHCC------CCEEEEEECCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 99999999999985489------936999978998660898868886412355145799999999999589989999815 Q ss_pred CCCCC-HHHHHHHCCCCCCCHH Q ss_conf 77752-5898864123777214 Q gi|254780791|r 249 HETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 249 HE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |=.=- .-.=+.+|+|..+++| T Consensus 104 ~a~GgG~~lal~cD~~ia~~~a 125 (248) T PRK06023 104 LAIGIGTTIHLHCDLTFATPRS 125 (248) T ss_pred EEEHHHHHHHHCCCEEEECCCC T ss_conf 3301468997234645634786 No 107 >KOG1680 consensus Probab=88.50 E-value=2 Score=24.02 Aligned_cols=98 Identities=20% Similarity=0.229 Sum_probs=58.2 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH-- Q ss_conf 652899984784258999999863059758999721001111036799999999741003576777589995168884-- Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI-- 221 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~-- 221 (529) --..||+||=-.--+..-|++. ...++++|+..++.... +-|||+-=.|||+ T Consensus 43 ~d~~I~lItlNRP~~~Nal~~~--------------------~m~eL~~A~~~~e~D~s------~~viVltG~gksFcs 96 (290) T KOG1680 43 EDNGIALITLNRPKALNALCRA--------------------TMLELAEAFKDFESDDS------VGVIVLTGSGKSFCS 96 (290) T ss_pred CCCCEEEEEECCHHHHCCCCHH--------------------HHHHHHHHHHHHHCCCC------CCEEEEECCCCCCCC T ss_conf 4787689983786775211589--------------------99999999997404676------547999768875236 Q ss_pred -HHH--------------HHCCHHHHHHHHHHCCCEEEEEE-CCC-CCCHHHHHHHCCCCCCCHHH Q ss_conf -442--------------20076999999974890488520-577-75258988641237772145 Q gi|254780791|r 222 -EDL--------------WHFNDEMIVRAIANSSIPIISAI-GHE-TDWTLADYAADLRAPTPTGA 270 (529) Q Consensus 222 -eDL--------------~~FN~e~laraI~~~~iPVisgI-GHE-~D~Tl~D~VAD~Ra~TPTaA 270 (529) -|| ..++.+.+ +..-+-|||.|| ||- .-=+-+-+++|+|.++|+|- T Consensus 97 G~Dl~e~~~~~~~~~~~~~~~~~~~~---~~~~~KPvIaainG~AlgGG~ELalmCDirva~~~Ak 159 (290) T KOG1680 97 GADLKEMKKDEFQDVSDGIFLRVWDL---VSRLKKPVIAAINGFALGGGLELALMCDIRVAGEGAK 159 (290) T ss_pred CCCHHHHHHCCCCCCCCCCCCCHHHH---HHHCCCCEEEEEECEEECCCHHHHHHCCEEECCCCCE T ss_conf 43789875211234454430012566---6524655137550366436122235313475267775 No 108 >COG0337 AroB 3-dehydroquinate synthetase [Amino acid transport and metabolism] Probab=88.46 E-value=2.3 Score=23.33 Aligned_cols=93 Identities=24% Similarity=0.296 Sum_probs=53.9 Q ss_pred CCCEEEEEECCCHHHH--HHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 6528999847842589--999998630597-5899972100111103679999999974100357677758999516888 Q gi|254780791|r 144 IPKIIAVITSPTGAVI--RDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 144 ~p~~i~vits~~~a~~--~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) .+.++.|||.++-+.+ .-+...+..... ...++.|.- +..+++-......+.+-+ ...+ -+-.||+=|||. T Consensus 32 ~~~k~~ivtd~~v~~~y~~~~~~~l~~~g~~v~~~~lp~G----E~~Ksl~~~~~i~~~ll~-~~~~-R~s~iialGGGv 105 (360) T COG0337 32 AGRKVAIVTDETVAPLYLEKLLATLEAAGVEVDSIVLPDG----EEYKSLETLEKIYDALLE-AGLD-RKSTLIALGGGV 105 (360) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEEEECCC----CCCCCHHHHHHHHHHHHH-CCCC-CCCEEEEECCHH T ss_conf 6974999977115787799999999855860579995798----450559999999999997-5999-886799978758 Q ss_pred HHHHHHCCHHHHHHHHHHCCCE Q ss_conf 4442200769999999748904 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIP 242 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iP 242 (529) +.||..|=--...|=|---.|| T Consensus 106 igDlaGF~Aaty~RGv~fiqiP 127 (360) T COG0337 106 IGDLAGFAAATYMRGVRFIQIP 127 (360) T ss_pred HHHHHHHHHHHHHHCCCEEECC T ss_conf 7777789999997089767556 No 109 >PRK00843 egsA NAD(P)-dependent glycerol-1-phosphate dehydrogenase; Reviewed Probab=88.46 E-value=2.3 Score=23.33 Aligned_cols=82 Identities=20% Similarity=0.246 Sum_probs=50.0 Q ss_pred CCEEEEEECCCHHH--HHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 52899984784258--9999998630597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 145 PKIIAVITSPTGAV--IRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 145 p~~i~vits~~~a~--~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) -+++.|||.++... -..+...+... ++++.. .+.|+....+|-+....+... +.| +||+=|||..- T Consensus 34 ~~k~lII~d~~v~~~~g~~v~~sL~~~-g~~v~~---~~~~e~s~~~i~~l~~~~~~~-------~~d-~vi~~GgG~~~ 101 (351) T PRK00843 34 KGNALIVTGPTTKEIAGKKVEDSLKDA-GFEVDV---VIVKDATMEEVEKVEEKARDS-------GAG-FLIGVGGGSSI 101 (351) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHC-CCEEEE---EECCCCCHHHHHHHHHHHHHC-------CCC-EEEEECCHHHH T ss_conf 995899989527899999999999866-985999---937999999999999999845-------999-89995651684 Q ss_pred HHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 42200769999999748904885 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVis 245 (529) |+.-| +-+...+|+|+ T Consensus 102 D~~k~-------~a~~~~~p~i~ 117 (351) T PRK00843 102 DVAKL-------ASTRLGIPFIS 117 (351) T ss_pred HHHHH-------HHHHCCCCEEE T ss_conf 89999-------99982999899 No 110 >PRK08139 enoyl-CoA hydratase; Validated Probab=88.41 E-value=1.8 Score=24.43 Aligned_cols=39 Identities=23% Similarity=0.284 Sum_probs=25.1 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .+..+|..|+.|||++| ||-.=- .-+=+.+|.|..++++ T Consensus 94 ~l~~~l~~~~kPvIaav~G~a~GgG~~lal~~D~~ias~~a 134 (265) T PRK08139 94 RVMQAIVALPQPVIARVHGIATAAGCQLVASCDLAVAADTA 134 (265) T ss_pred HHHHHHHHCCCCEEEEECCEEEEEECHHHCCCCEEEECCCC T ss_conf 99999985999899997782354140110047664532652 No 111 >PRK05809 3-hydroxybutyryl-CoA dehydratase; Validated Probab=88.39 E-value=1.8 Score=24.40 Aligned_cols=77 Identities=23% Similarity=0.352 Sum_probs=44.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC--H---HHHHHCCH-------------HHHHHHHHHCCCEEEEEE Q ss_conf 03679999999974100357677758999516888--4---44220076-------------999999974890488520 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS--I---EDLWHFND-------------EMIVRAIANSSIPIISAI 247 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS--~---eDL~~FN~-------------e~laraI~~~~iPVisgI 247 (529) -..+|..++..++... .+.+|| .+|.|. + -||..|.+ ..+.+++..||.|||++| T Consensus 32 ~~~el~~al~~~~~d~------~vr~vv-l~g~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvIaav 104 (260) T PRK05809 32 TLKELDTVLDDLENDD------NVYAVI-LTGAGEKAFVAGADISEMKDLNEEEGRKFGLLGNKVFRRLENLDKPVIAAI 104 (260) T ss_pred HHHHHHHHHHHHHHCC------CCEEEE-EECCCCCCEECCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 9999999999998689------956999-966898874079974765306876789999999999999983899889997 Q ss_pred -CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf -577752-5898864123777214 Q gi|254780791|r 248 -GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 248 -GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ||=.=- +-+=+.+|+|..++.+ T Consensus 105 ~G~a~GgG~~lal~~D~~ia~~~a 128 (260) T PRK05809 105 NGFALGGGCELSMACDIRIASEKA 128 (260) T ss_pred CCEEEECCHHHHHHCCEEEECCCC T ss_conf 596666502765420134515897 No 112 >PRK06688 enoyl-CoA hydratase; Provisional Probab=88.33 E-value=1.7 Score=24.65 Aligned_cols=79 Identities=23% Similarity=0.316 Sum_probs=43.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH---HHHHHC--------------C--HHHHHHHHHHCCCEEEE Q ss_conf 1036799999999741003576777589995168884---442200--------------7--69999999748904885 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI---EDLWHF--------------N--DEMIVRAIANSSIPIIS 245 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~---eDL~~F--------------N--~e~laraI~~~~iPVis 245 (529) .-..+|..+|..++... .+.+|||.=.|+++ -||-.| . -..+.++|.+||.|||+ T Consensus 32 ~~~~~l~~~~~~~~~d~------~v~~iVl~g~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIa 105 (263) T PRK06688 32 AMYQALADALEAAATDP------AVRVVLLTGAGGAFSAGGDIKDFDAAPGGGPTYPEDELAPVNRFLRAIAALPKPVVA 105 (263) T ss_pred HHHHHHHHHHHHHHHCC------CCEEEEEECCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 99999999999996389------936999977998676897679887256665055789899999999999958998899 Q ss_pred EE-CCCCC-CHHHHHHHCCCCCCCHH Q ss_conf 20-57775-25898864123777214 Q gi|254780791|r 246 AI-GHETD-WTLADYAADLRAPTPTG 269 (529) Q Consensus 246 gI-GHE~D-~Tl~D~VAD~Ra~TPTa 269 (529) +| ||-.= =+-+=+.+|.|..++.| T Consensus 106 ai~G~a~GgG~~la~~cD~ria~~~a 131 (263) T PRK06688 106 AVNGAAVGVGVSLALACDLVYASDSA 131 (263) T ss_pred EECCEECCCCHHHHHHCCCHHCCCCC T ss_conf 97576614527888623401004566 No 113 >PRK00002 aroB 3-dehydroquinate synthase; Reviewed Probab=88.06 E-value=2.3 Score=23.48 Aligned_cols=99 Identities=23% Similarity=0.334 Sum_probs=56.3 Q ss_pred CCEEEEEECCCHHHHH-HH-HHHHHH-CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 5289998478425899-99-998630-59758999721001111036799999999741003576777589995168884 Q gi|254780791|r 145 PKIIAVITSPTGAVIR-DI-LQRISC-RFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vits~~~a~~~-D~-~~~~~~-r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .+|+-|||..+-+-+. +- ...+.. .+++.++++|+ ||. .+++-...+.++.+.+. ...+ +-+||+=|||.+ T Consensus 32 ~~r~~iitD~~V~~l~~~~l~~~l~~~~~~~~~~~i~~---gE~-~K~l~~~~~i~~~l~~~-~~~R-~~~iiaiGGGvv 105 (360) T PRK00002 32 GRKVVIVTDETVAPLYLEKLRAALEAAGFEVDVVVLPD---GEQ-YKSLETLERIYDALLEA-GLDR-SDTLIALGGGVV 105 (360) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEECC---CCC-CCCHHHHHHHHHHHHHC-CCCC-CCCEEEECCHHH T ss_conf 99799998985689999999999997699079999679---743-17798999999999976-9998-873153066376 Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 44220076999999974890488520577752589886 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) .||..|=--...| -||.|- .-|||.-+| T Consensus 106 ~DlaGF~As~y~R-----Gi~~i~-----iPTTLLa~v 133 (360) T PRK00002 106 GDLAGFAAATYMR-----GIRFIQ-----VPTTLLAQV 133 (360) T ss_pred HHHHHHHHHHHHC-----CCCEEE-----CCCHHHHHH T ss_conf 5889999999847-----997797-----460089998 No 114 >PRK07629 consensus Probab=87.97 E-value=1.5 Score=25.08 Aligned_cols=75 Identities=23% Similarity=0.341 Sum_probs=43.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-CH---HHHHHCC-------H---------HHHHHHHHHCCCEEEEE Q ss_conf 367999999997410035767775899951688-84---4422007-------6---------99999997489048852 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGG-SI---EDLWHFN-------D---------EMIVRAIANSSIPIISA 246 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG-S~---eDL~~FN-------~---------e~laraI~~~~iPVisg 246 (529) ..+|..++..++... .+.|||| +|-| ++ -||-.|+ + ..+..+|..||.|||++ T Consensus 33 ~~el~~a~~~~~~d~------~vrvvvl-tg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaa 105 (261) T PRK07629 33 IAELTAAFRALDADD------DVRAVVL-AARGPAFCAGADLNWMKRMAGYSDDENRADARKLARMLRTIYECPKPTIAR 105 (261) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 999999999996299------9169999-779982146767088885113672556777999999999999589989999 Q ss_pred E-CCC--CCCHHHHHHHCCCCCCCHH Q ss_conf 0-577--7525898864123777214 Q gi|254780791|r 247 I-GHE--TDWTLADYAADLRAPTPTG 269 (529) Q Consensus 247 I-GHE--~D~Tl~D~VAD~Ra~TPTa 269 (529) | ||= -=..|+ +.+|+|..++.+ T Consensus 106 v~G~a~GgG~~la-l~~D~ria~~~a 130 (261) T PRK07629 106 VHGDAYAGGMGLV-AACDIAVAADTA 130 (261) T ss_pred ECCEEECCHHHHH-HHCCEEECCCCC T ss_conf 7895443607997-733421314787 No 115 >PRK08290 enoyl-CoA hydratase; Provisional Probab=87.94 E-value=1.9 Score=24.18 Aligned_cols=36 Identities=14% Similarity=0.153 Sum_probs=24.4 Q ss_pred HHHHHHHHCCCEEEEEE-----CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-----5777525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-----GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-----GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) .+++.|.+||.|||.+| |=-.. +=+.+|+|..+++| T Consensus 106 ~~~~~i~~~~KPvIAaV~G~a~GgG~~---lalacDi~iAse~A 146 (284) T PRK08290 106 GYCRRWRDLPKPTIAQVQGACIAGGLM---LAWPCDLIVASDDA 146 (284) T ss_pred HHHHHHHCCCCCEEEEECCEEEEHHHH---HHHHCCEEEECCCC T ss_conf 999998529998899986875212068---78537789973998 No 116 >PRK06863 single-stranded DNA-binding protein; Provisional Probab=87.91 E-value=1.3 Score=25.73 Aligned_cols=80 Identities=18% Similarity=0.269 Sum_probs=53.9 Q ss_pred CCCEEEEEEEC---CCCCCCCCCEE--EEEE------ECCC--------CEEEEEEECCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 17189999970---54356888627--9998------7489--------4799999735210586681459889999996 Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGHA--YFSL------KDNH--------SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKI 92 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH~--Yf~l------kd~~--------a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~ 92 (529) +..|-+.|-|. .+| +..||+. -|+| +|.+ .=++|++|+..+..+.-.++.|++|.|.|++ T Consensus 4 ~NkV~LiGnlg~DPElr-~t~sG~~v~~fslA~~~~~~~~~~ge~~e~t~w~~v~~wgk~AE~~~~yl~KG~~V~VeGrL 82 (168) T PRK06863 4 INKVIIVGHLGNDPEIR-TMPNGEAVANISVATSESWTDKNTGERREVTEWHRIVFYRRQAEVAGEYLRKGSQVYVEGRL 82 (168) T ss_pred CCEEEEEEECCCCCEEE-ECCCCCEEEEEEEEECCCCEECCCCEEEEECHHHHHHHCHHHHHHHHHHHCCCCEEEEEEEE T ss_conf 78899987468685488-87999879999999615612058873775020321200568999999875689879998888 Q ss_pred EE--EC-----CCCEEEEEEEEEEECC Q ss_conf 67--52-----8843799999710168 Q gi|254780791|r 93 TT--FP-----GSSKYQIIIESLIPSG 112 (529) Q Consensus 93 ~~--y~-----~~g~~ql~v~~i~~~g 112 (529) .. |. .++...++++.|+.-+ T Consensus 83 ~~r~w~dk~G~~r~~teV~~d~~~~l~ 109 (168) T PRK06863 83 KTRKWQDQNGQDRYTTEIQGDVLQMLG 109 (168) T ss_pred EECCCCCCCCCEEEEEEEEEEEEEECC T ss_conf 755118799998999999985799835 No 117 >PRK03932 asnC asparaginyl-tRNA synthetase; Validated Probab=87.91 E-value=2.5 Score=23.06 Aligned_cols=87 Identities=18% Similarity=0.248 Sum_probs=61.8 Q ss_pred HHHHHHHH--CC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC----CCCCCCCCEEEEEEEEEEE Q ss_conf 99999740--01-71899999705435688862799987489479999973521058----6681459889999996675 Q gi|254780791|r 23 HLKHIVES--NL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI----EFLPEEGIEFLVIGKITTF 95 (529) Q Consensus 23 ~i~~~l~~--~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~----~~~~~~G~~v~~~g~~~~y 95 (529) .||+++.+ +. ..|.|.|=|.+.|. .++=++++|.|..+.+.+.+..+....- -..+..|+-|-|.|.|. - T Consensus 5 ~i~~~~~~~~~ig~~V~v~Grv~~~R~--~Gkl~Fi~LrD~tg~~qiq~~~~~~~~~~~~~~~~l~~gd~V~V~G~v~-~ 81 (462) T PRK03932 5 SIKDILKGKKAVGQEVTIRGWVRTKRD--SGKISFLELNDGSCFKQIQVVKNNELPNYGEEVFKLTTGSSVYVTGTVV-E 81 (462) T ss_pred CHHHHHCCCCCCCCEEEEEEEEEEEEC--CCCCEEEEEECCCCCEEEEEEECCCCHHHHHHHHCCCCCEEEEEEEEEE-E T ss_conf 799976582238998999897782781--7983899998289878999996797667999996699956999999999-2 Q ss_pred CCC--CEEEEEEEEEEECC Q ss_conf 288--43799999710168 Q gi|254780791|r 96 PGS--SKYQIIIESLIPSG 112 (529) Q Consensus 96 ~~~--g~~ql~v~~i~~~g 112 (529) .++ |++.+.|..++.-+ T Consensus 82 t~~~~~~~El~~~~i~vls 100 (462) T PRK03932 82 SPGAGQGYELQATKIEVIG 100 (462) T ss_pred CCCCCCCEEEEEEEEEEEE T ss_conf 5999977799983899983 No 118 >PRK07799 enoyl-CoA hydratase; Provisional Probab=87.91 E-value=1.4 Score=25.28 Aligned_cols=77 Identities=18% Similarity=0.172 Sum_probs=42.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHC----------------CHHHHHHHHHHCCCEEE Q ss_conf 103679999999974100357677758999516888-4---442200----------------76999999974890488 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHF----------------NDEMIVRAIANSSIPII 244 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~F----------------N~e~laraI~~~~iPVi 244 (529) .-..+|..++..++... .+.+||| +|.|. + -||-.| ........+..||.||| T Consensus 32 ~m~~el~~al~~~~~d~------~vr~vvl-tg~G~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kPvI 104 (263) T PRK07799 32 EMLAIMVDAWDRVDNDP------DIRSCIL-TGAGGYFCAGMDLKAATQKPPGDSFKDGSYDPSRIDALLKGRRLTKPLI 104 (263) T ss_pred HHHHHHHHHHHHHHHCC------CEEEEEE-ECCCCCEECCCCHHHHHCCCCCHHHHHHHCCHHHHHHHHHHCCCCCCEE T ss_conf 99999999999986199------9279999-8889825358666887506851245541102789999998635999889 Q ss_pred EEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 520-5777--525898864123777214 Q gi|254780791|r 245 SAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 245 sgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ++| ||=. =..|+ +.+|+|..+|.| T Consensus 105 aav~G~a~GgG~~la-l~cD~~ia~~~a 131 (263) T PRK07799 105 AAVEGPAIAGGTEIL-QGTDIRVAGESA 131 (263) T ss_pred EEECCEEEHHHHHHH-HHCCEEEECCCC T ss_conf 998587556889998-845766630665 No 119 >PRK13010 purU formyltetrahydrofolate deformylase; Reviewed Probab=87.76 E-value=2.6 Score=22.99 Aligned_cols=89 Identities=15% Similarity=0.158 Sum_probs=50.8 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHH----------------------CC Q ss_conf 00799999999997654012261001631026528999847842589999998630----------------------59 Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISC----------------------RF 170 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~----------------------r~ 170 (529) ..+|...|+.+-+++..+--+... .-++|++|..|+-+--+.|+|-.-+. +| T Consensus 67 ~~~l~~~f~~ia~~~~m~w~l~~~------~~k~RvaImVSK~~HCL~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~~ 140 (289) T PRK13010 67 VDTFRQEFQPVAEKFDMQWAIHPD------GQRPKVVIMVSKFDHCLNDLLYRWRMGELDMDIVGIISNHPDLQPLAVQH 140 (289) T ss_pred HHHHHHHHHHHHHHCCCEEEEECC------CCCCEEEEEEECHHHHHHHHHHHHHCCCCCCEEEEEEECCHHHHHHHHHC T ss_conf 999999999999764978999616------77846999996724509999999874985860149997877789999977 Q ss_pred CEEEEEEECCCC-CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 758999721001-11103679999999974100357677758999516 Q gi|254780791|r 171 PLRVIIFPVKVQ-GDECPKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 171 p~~~~~~p~~vQ-G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) .+..+.+|+.=. .+.+++.+.+-|+.. ++|+||++|= T Consensus 141 gIPF~hipv~~~~k~~~e~~~~~~~~~~----------~~d~vvla~y 178 (289) T PRK13010 141 DIPFHHLPVTPDTKAQQEAQILDLIETS----------GAELVVLARY 178 (289) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHC----------CCCEEEEECH T ss_conf 9983982687775267899999998713----------9889987054 No 120 >PRK07468 enoyl-CoA hydratase; Provisional Probab=87.74 E-value=1.7 Score=24.60 Aligned_cols=39 Identities=15% Similarity=0.221 Sum_probs=25.9 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .+.++|..||.|||++| ||-.-- .-+=+.+|+|..+|+| T Consensus 91 ~~~~~i~~~~kPvIaav~G~a~GgG~~lal~~D~~ia~~~a 131 (262) T PRK07468 91 LMLKALNDLPKPLIGRIQGQAFGGGVGLISVCDVAIGVSGA 131 (262) T ss_pred HHHHHHHHCCCCEEEEECCEEEECCCEEECCCCEEEECCCC T ss_conf 99999984899899997476755554020457678973787 No 121 >PRK07830 consensus Probab=87.66 E-value=2 Score=24.00 Aligned_cols=40 Identities=23% Similarity=0.356 Sum_probs=26.9 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCCC-CHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-57775-25898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHETD-WTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~D-~Tl~D~VAD~Ra~TPTa 269 (529) ..+.++|..||+|||++| ||=.= =.-.=+.+|+|..++.+ T Consensus 94 ~~~~~~i~~~~kPvIaav~G~a~GgG~~lal~cD~~ia~~~a 135 (267) T PRK07830 94 NRAVRAIVALPVPVVAVVQGPAAGVGVSLALACDLVLASESA 135 (267) T ss_pred HHHHHHHHHCCCCEEEEECCEEEEHHHHHHHHHCCCCCCCCC T ss_conf 999999996899899997783770257999743213334574 No 122 >PRK07511 enoyl-CoA hydratase; Provisional Probab=87.64 E-value=2 Score=24.03 Aligned_cols=78 Identities=18% Similarity=0.162 Sum_probs=46.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHHHHCC----------------HHHHHHHHHHCCCEEE Q ss_conf 1036799999999741003576777589995168884----4422007----------------6999999974890488 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDLWHFN----------------DEMIVRAIANSSIPII 244 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL~~FN----------------~e~laraI~~~~iPVi 244 (529) .-..+|..++..++... .+.+||| +|.|-. -||-.|. -..+.+++..||.||| T Consensus 30 ~m~~el~~al~~~~~d~------~vr~vvl-tg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvI 102 (259) T PRK07511 30 DMYAAGIEALATAERDP------SIRAVVL-TGADGFFCAGGNLNRLLENRAKPPSVQAASIDLLHDWIRAIRAFPKPVI 102 (259) T ss_pred HHHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 99999999999997689------9659999-8999962364368988634248988999999999999999983999899 Q ss_pred EEE-CCCCC-CHHHHHHHCCCCCCCHH Q ss_conf 520-57775-25898864123777214 Q gi|254780791|r 245 SAI-GHETD-WTLADYAADLRAPTPTG 269 (529) Q Consensus 245 sgI-GHE~D-~Tl~D~VAD~Ra~TPTa 269 (529) ++| ||-.= =+-.=+.+|+|..+++| T Consensus 103 aav~G~a~GgG~~lal~~D~~ia~~~a 129 (259) T PRK07511 103 AAVEGAAAGAGFSLALACDLLVAADDA 129 (259) T ss_pred EEECCCEEHHHHHHHHHCCCCCCCCCC T ss_conf 996570311338999832726545455 No 123 >TIGR03210 badI 2-ketocyclohexanecarboxyl-CoA hydrolase. Members of this protein family are 2-ketocyclohexanecarboxyl-CoA hydrolase, a ring-opening enzyme that acts in catabolism of molecules such as benzoyl-CoA and cyclohexane carboxylate. It converts -ketocyclohexanecarboxyl-CoA to pimelyl-CoA. It is not sensitive to oxygen. Probab=87.44 E-value=2.1 Score=23.68 Aligned_cols=78 Identities=19% Similarity=0.239 Sum_probs=45.1 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC-CCH---HHHHHCC------------HHHHHHHHHHCCCEEEEEE-C Q ss_conf 036799999999741003576777589995168-884---4422007------------6999999974890488520-5 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGG-GSI---EDLWHFN------------DEMIVRAIANSSIPIISAI-G 248 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG-GS~---eDL~~FN------------~e~laraI~~~~iPVisgI-G 248 (529) --.++..++..++... .+-||||...| +++ -|+..|+ -+.+..+|..||.|||++| | T Consensus 30 ~~~~l~~al~~~~~d~------~vr~vIltg~G~~~FcaG~D~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaav~G 103 (256) T TIGR03210 30 TCDELIHALKDAGYDR------QIGVIVLAGAGDKAFCTGGDQSTHDGGYDGRGTIGLPMEELHSAIRDVPKPVIARVQG 103 (256) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEEECCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 9999999999873399------9629999679974110477532221475411233016999999998489989999888 Q ss_pred CCCCC-HHHHHHHCCCCCCCHH Q ss_conf 77752-5898864123777214 Q gi|254780791|r 249 HETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 249 HE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |=.=- .-.=..+|+|.+++++ T Consensus 104 ~a~GgG~~lal~~D~~iA~~~a 125 (256) T TIGR03210 104 YAIGGGNVLVTICDLTIASEKA 125 (256) T ss_pred EEEHHHHHHHHHHCCCCCCCCC T ss_conf 6666869999852415234787 No 124 >PRK07459 single-stranded DNA-binding protein; Provisional Probab=87.44 E-value=2.6 Score=22.85 Aligned_cols=80 Identities=23% Similarity=0.275 Sum_probs=52.1 Q ss_pred CCCEEEEEEECC---CCCCCCCC--EEEEEEE------C-CCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE--EECC Q ss_conf 171899999705---43568886--2799987------4-8947999997352105866814598899999966--7528 Q gi|254780791|r 32 LSHVCVRGEISG---YRGIHSSG--HAYFSLK------D-NHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT--TFPG 97 (529) Q Consensus 32 ~~~~~v~gEis~---~~~~~~sG--H~Yf~lk------d-~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~--~y~~ 97 (529) +..|-+.|-+.. ++. -.|| .+=|+|- + +..-+.|++|...+..+.-.++-|++|.|.|++. -|.. T Consensus 3 mN~V~LiGrLg~DpElr~-t~sG~~v~~fslAv~~~~k~~~t~w~~v~~wgk~Ae~~~~yl~KG~~V~V~G~l~~~~w~d 81 (121) T PRK07459 3 LNSVTLVGRAGRDPEVRY-FESGSVVCNLTLAVNRRSRDDEPDWFNLEIWGKTAQVAADYVKKGSLIGITGSLKFDRWTD 81 (121) T ss_pred EEEEEEEEECCCCCEEEE-CCCCCEEEEEEEEECCCCCCCCCCEEEEEEECHHHHHHHHHHCCCCEEEEEEEEECCCCEE T ss_conf 018999996676961889-7999789999998467535778749999998908999988736898999999986542475 Q ss_pred ------CCEEEEEEEEEEECC Q ss_conf ------843799999710168 Q gi|254780791|r 98 ------SSKYQIIIESLIPSG 112 (529) Q Consensus 98 ------~g~~ql~v~~i~~~g 112 (529) +....+.|+.|+.-| T Consensus 82 k~~G~~r~~~~V~v~~i~fl~ 102 (121) T PRK07459 82 RNTGEDRSKPVIRVDRLELLG 102 (121) T ss_pred CCCCEEEEEEEEEEEEEEECC T ss_conf 789989999999996999836 No 125 >PRK08258 enoyl-CoA hydratase; Provisional Probab=87.35 E-value=2.1 Score=23.77 Aligned_cols=39 Identities=31% Similarity=0.499 Sum_probs=25.6 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .+.+++.+|+.|||++| ||=.=- .-+=+.+|+|..++++ T Consensus 97 ~~~~~i~~~~kPvIaav~G~a~GgG~~la~~cD~~ia~~~a 137 (270) T PRK08258 97 DLVKAMRACPQPVIAAVDGVCAGAGAILAMASDLRLATPSA 137 (270) T ss_pred HHHHHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCCC T ss_conf 99999985899989998787546769999852624146666 No 126 >PRK06210 enoyl-CoA hydratase; Provisional Probab=87.23 E-value=2.3 Score=23.45 Aligned_cols=35 Identities=20% Similarity=0.245 Sum_probs=19.7 Q ss_pred HHHHHHHHCCCEEEEEE-CC----CCCCHHHHHHHCCCCCCCH Q ss_conf 99999974890488520-57----7752589886412377721 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GH----ETDWTLADYAADLRAPTPT 268 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GH----E~D~Tl~D~VAD~Ra~TPT 268 (529) .+..++..||.|||++| || -... =+++|+|..++. T Consensus 99 ~~~~~~~~~~kPvIaav~G~a~GgG~~l---a~~cD~rias~~ 138 (272) T PRK06210 99 TRYHFLTALRKPVIAAINGACAGIGLTH---ALMCDVRFAADG 138 (272) T ss_pred HHHHHHHHCCCCEEEEEEEEEECCCCEE---EECCCEEEECCC T ss_conf 9999998489988999823660487512---213654342477 No 127 >pfam04392 ABC_sub_bind ABC transporter substrate binding protein. This family contains many hypothetical proteins and some ABC transporter substrate binding proteins. Probab=87.21 E-value=2.7 Score=22.75 Aligned_cols=85 Identities=15% Similarity=0.260 Sum_probs=53.4 Q ss_pred CCEEEEEECCC---HHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 52899984784---258999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 145 PKIIAVITSPT---GAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vits~~---~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) -++||||-+++ +...-..++....++++++..+++ ....++-.++..+.. ++|++++.-.+-- T Consensus 130 ~k~igviyn~~e~~s~~~~~~~~~~a~~~gi~l~~~~v-----~~~~ei~~a~~~l~~--------~~Dal~i~~d~~v- 195 (292) T pfam04392 130 VKSIGVYYSPSEANSVSLVEEIKKYAKKSGIKVVEASV-----PSSNDVPSAMSSMAG--------KVDAIFIPTDNLI- 195 (292) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEC-----CCHHHHHHHHHHHHC--------CCCEEEEECCCCH- T ss_conf 85899995799865799999999999976998999966-----886679999997432--------8988999378107- Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 44220076999999974890488520 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgI 247 (529) . =+-+.++....+.++||+++- T Consensus 196 ~----s~~~~i~~~a~~~kiPv~~~~ 217 (292) T pfam04392 196 A----SAFTAVLQEANKAKIPVITSD 217 (292) T ss_pred H----HHHHHHHHHHHHCCCCEEECC T ss_conf 8----899999999997499989577 No 128 >TIGR03200 dearomat_oah 6-oxocyclohex-1-ene-1-carbonyl-CoA hydrolase. Members of this protein family are 6-oxocyclohex-1-ene-1-carbonyl-CoA hydrolase, a ring-hydrolyzing enzyme in the anaerobic metabolism of aromatic enzymes by way of benzoyl-CoA, as seen in Thauera aromatica, Geobacter metallireducens, and Azoarcus sp. Note that Rhodopseudomonas palustris uses a different pathway to perform a similar degradation of benzoyl-CoA to 3-hydroxpimelyl-CoA. Probab=87.14 E-value=1.8 Score=24.39 Aligned_cols=76 Identities=24% Similarity=0.391 Sum_probs=45.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC-C------CC-----------CHHHHHHCCHH--HHHHHHHHCCCEEEE Q ss_conf 0367999999997410035767775899951-6------88-----------84442200769--999999748904885 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILAR-G------GG-----------SIEDLWHFNDE--MIVRAIANSSIPIIS 245 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R-G------GG-----------S~eDL~~FN~e--~laraI~~~~iPVis 245 (529) ...+++.|++.+.... .+-|||+.= | || -..|.+.|..- .+.+.|-.||-|||. T Consensus 56 tVkEli~AF~~A~~D~------sVgvIVLTGaGDKAFCTGGDqke~ae~yag~p~e~~~~~~l~~~m~~~Ir~cpKPVIA 129 (360) T TIGR03200 56 MVKAIILAFRRASSDR------DVVAVVFTAVGDKAFCTGGNTKEYAEYYAGNPQEYRQYMRLFNDMVSAILGCDKPVIC 129 (360) T ss_pred HHHHHHHHHHHHCCCC------CEEEEEEECCCCCCEECCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 9999999999724288------5239999558761121587678754577788523577878999999998579987699 Q ss_pred EE-------CCC----CCCHHHHHHHCCCCCCC Q ss_conf 20-------577----75258988641237772 Q gi|254780791|r 246 AI-------GHE----TDWTLADYAADLRAPTP 267 (529) Q Consensus 246 gI-------GHE----~D~Tl~D~VAD~Ra~TP 267 (529) .| ||| .|.||+---|-.--.-| T Consensus 130 rVNG~AVGGG~eL~maCDLtIAsD~A~FGQ~GP 162 (360) T TIGR03200 130 RVNGMRIGGGQEIGMAADFTIAQDLANFGQAGP 162 (360) T ss_pred EECCEEECCCCEEEEEEHHHHHHHHHHHCCCCC T ss_conf 871588438717887630134544445427888 No 129 >PRK05674 gamma-carboxygeranoyl-CoA hydratase; Validated Probab=87.04 E-value=1.8 Score=24.34 Aligned_cols=35 Identities=14% Similarity=0.208 Sum_probs=25.9 Q ss_pred HHHHHHHHHCCCEEEEEE-------CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-------5777525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-------GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-------GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) ..+..+|.+||.|||++| |.+ +=+++|+|..+++| T Consensus 91 ~~~~~~i~~~~kPvIaaV~G~a~GgG~~-----lal~cD~~ia~~~a 132 (271) T PRK05674 91 AELMYALARLKIPTLAVVQGAAFGGALG-----LISCCDMAIGAEDA 132 (271) T ss_pred HHHHHHHHHCCCCEEEEECCHHHEHHHH-----HHHHCCEEECCCCC T ss_conf 9999999968998999977830024688-----87630622211156 No 130 >PRK06958 single-stranded DNA-binding protein; Provisional Probab=87.02 E-value=1.2 Score=25.83 Aligned_cols=79 Identities=18% Similarity=0.272 Sum_probs=53.3 Q ss_pred CCCEEEEEEEC---CCCCCCCCCEEE--EEEE------CC--------CCEEEEEEECCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 17189999970---543568886279--9987------48--------94799999735210586681459889999996 Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGHAY--FSLK------DN--------HSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKI 92 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH~Y--f~lk------d~--------~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~ 92 (529) +..|-+.|-|. .++ +-.+|... |+|- |. ..=++|++|...+..+.-.++.|++|.|.|++ T Consensus 4 ~NkV~LiG~L~~DPelr-~t~~G~~va~f~lA~~~~~~d~~~ge~~e~t~w~~v~~~gk~AE~~~~yl~KG~~V~VeGrL 82 (181) T PRK06958 4 VNKVILVGNLGADPEVR-YLPSGDAVANIRLATTDRYKDKASGEFKEATEWHRVAFFGRLAEIVGEYLKKGSSVYIEGRI 82 (181) T ss_pred CCEEEEEEECCCCCEEE-ECCCCCEEEEEEEEEECCEEECCCCCEEECCEEEEEECCCHHHHHHHHHCCCCCEEEEEEEE T ss_conf 78799988578696587-87999779999999863333237883763163778503356999998644799989997577 Q ss_pred EE--ECC-----CCEEEEEEEEEEEC Q ss_conf 67--528-----84379999971016 Q gi|254780791|r 93 TT--FPG-----SSKYQIIIESLIPS 111 (529) Q Consensus 93 ~~--y~~-----~g~~ql~v~~i~~~ 111 (529) .. |+. ++...++|+.|+.- T Consensus 83 ~~r~w~dkdG~~ry~tEiv~d~~~~l 108 (181) T PRK06958 83 RTRKWQGQDGQDRYSTEIVADQMQML 108 (181) T ss_pred EECCCCCCCCCEEEEEEEEEEEEEEE T ss_conf 70645978999899999998688981 No 131 >cd07025 Peptidase_S66 LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein. LD-carboxypeptidase (Muramoyltetrapeptide carboxypeptidase; EC 3.4.17.13; Merops family S66; initially described as Carboxypeptidase II) family also includes the microcin c7 self-immunity protein (MccF) as well as uncharacterized proteins including hypothetical proteins. LD-carboxypeptidase hydrolyzes the amide bond that links the dibasic amino acids to C-terminal D-amino acids. The physiological substrates of LD-carboxypeptidase are tetrapeptide fragments (such as UDP-MurNAc-tetrapeptides) that are produced when bacterial cell walls are degraded; they contain an L-configured residue (L-lysine or meso-diaminopimelic acid residue) as the penultimate residue and D-alanine as the ultimate residue. A possible role of LD-carboxypeptidase is in peptidoglycan recycling whereby the resulting tripeptide (precursor for murein synthesis) can be reconverted into peptidoglycan by attachment o Probab=86.98 E-value=2.8 Score=22.65 Aligned_cols=87 Identities=22% Similarity=0.280 Sum_probs=52.2 Q ss_pred HHHHHHC-CC-E-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-HHHHHHCCHHHHHHHHH- Q ss_conf 9986305-97-5-899972100111103679999999974100357677758999516888-44422007699999997- Q gi|254780791|r 163 LQRISCR-FP-L-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-IEDLWHFNDEMIVRAIA- 237 (529) Q Consensus 163 ~~~~~~r-~p-~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~eDL~~FN~e~laraI~- 237 (529) .+.+-.. +| + .-+||==.|-. .+..|-++|....... ...++-.||++|=-+- .++-..+.-+++++.++ T Consensus 180 ~~l~GT~~~p~~~g~ILfLEd~~e--~~~~idR~L~~L~~aG---~f~~~~gii~G~f~~~~~~~~~~~~~~~il~~~~~ 254 (282) T cd07025 180 ASLLGTPYLPDTEGKILFLEDVGE--PPYRIDRMLTQLKLAG---VLDKVAGIILGRFTDCEDNDDFGYTLEEVLKEVLG 254 (282) T ss_pred HHHCCCCCCCCCCCCEEEEEECCC--CHHHHHHHHHHHHHHC---CCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 976398767787781899980799--9899999999999839---75357089995687788877657339999999875 Q ss_pred HCCCEEEEE--ECCCCCCH Q ss_conf 489048852--05777525 Q gi|254780791|r 238 NSSIPIISA--IGHETDWT 254 (529) Q Consensus 238 ~~~iPVisg--IGHE~D~T 254 (529) ...+||+.+ +||..+.. T Consensus 255 ~~~iPv~~~~~~GH~~~~~ 273 (282) T cd07025 255 DLGIPVLYGLPIGHTPPNL 273 (282) T ss_pred HCCCCEEECCCCCCCCCCE T ss_conf 1798189899998999863 No 132 >PRK02801 primosomal replication protein N; Provisional Probab=86.92 E-value=2.8 Score=22.63 Aligned_cols=75 Identities=20% Similarity=0.268 Sum_probs=49.6 Q ss_pred EEEEEEECCC---CCCCCCC--EEEEEEECCC------------CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECC Q ss_conf 8999997054---3568886--2799987489------------479999973521058668145988999999667528 Q gi|254780791|r 35 VCVRGEISGY---RGIHSSG--HAYFSLKDNH------------SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPG 97 (529) Q Consensus 35 ~~v~gEis~~---~~~~~sG--H~Yf~lkd~~------------a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~ 97 (529) +-+.|+|... +..| +| ||.|.|.... -.|.++++...+..+.-.++-|.+|.+.|-++.-.. T Consensus 5 l~Lsg~v~~~~~lr~SP-AGIp~~~f~LeH~S~Q~EAg~~Rqv~c~i~vv~~G~~~~~~~~~l~~G~~v~v~GFLa~~~~ 83 (101) T PRK02801 5 LVLSGTVCRTPKRKVSP-SGIPHCQFVLEHRSVQEEAGFLRQAWCQMPVIVSGHQFQAITQSITVGSEITVQGFISCHKG 83 (101) T ss_pred EEEEEEEEECCCEEECC-CCCEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEHHHHHHHHHCCCCCCEEEEEEEEEECCC T ss_conf 99999995067310589-97822899998778854279827899999999961678987504589988999999841256 Q ss_pred CCEE-E--EEEEEEEE Q ss_conf 8437-9--99997101 Q gi|254780791|r 98 SSKY-Q--IIIESLIP 110 (529) Q Consensus 98 ~g~~-q--l~v~~i~~ 110 (529) +-++ + |-+++|+. T Consensus 84 rng~~~LVLHa~~Ie~ 99 (101) T PRK02801 84 RNGLSKLVLHAEQIEL 99 (101) T ss_pred CCCCCCEEEEEEEEEE T ss_conf 5787439999025998 No 133 >PRK10481 hypothetical protein; Provisional Probab=86.92 E-value=2.8 Score=22.63 Aligned_cols=80 Identities=18% Similarity=0.261 Sum_probs=56.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCC-E---EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 28999847842589999998630597-5---8999721001111036799999999741003576777589995168884 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFP-L---RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p-~---~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .++|||++. .+-+...+++|. + .+...-++.+|.. .++..|-+.+... ..|+||+ T Consensus 129 ~~lGVvvP~-----~eQ~~~~~~kW~~~~~~~~~~~asPY~g~~--~~l~~Aa~~L~~~-------gadlivL------- 187 (224) T PRK10481 129 HQLGVIVPV-----EEQLALQAKKWQVLQKPPVVSLASPYHGSE--QKLIDAGKELLAQ-------GADLIVL------- 187 (224) T ss_pred CEEEEEECC-----HHHHHHHHHHHHHCCCCEEEEEECCCCCCH--HHHHHHHHHHHHC-------CCCEEEE------- T ss_conf 869999627-----899999999887347861699837988988--9999999999865-------9999999------- Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 44220076999999974890488520 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgI 247 (529) |=+.|+.+.=...-..+.+|||.+. T Consensus 188 -DCiGYt~~~r~~~~~~~g~PViLsr 212 (224) T PRK10481 188 -DCLGYHQRHRDLLQKALDVPVLLSN 212 (224) T ss_pred -ECCCCCHHHHHHHHHHHCCCEEEEH T ss_conf -8777759999999998599999518 No 134 >cd06558 crotonase-like Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole. Probab=86.81 E-value=2.4 Score=23.30 Aligned_cols=80 Identities=24% Similarity=0.432 Sum_probs=44.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC-CCC------HHHHHHCC------------HHHHHHHHHHCCCEEEE Q ss_conf 103679999999974100357677758999516-888------44422007------------69999999748904885 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARG-GGS------IEDLWHFN------------DEMIVRAIANSSIPIIS 245 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG-GGS------~eDL~~FN------------~e~laraI~~~~iPVis 245 (529) ....++.++++.++... .+.+||| +| |++ +.++..+. -..+.+.+..||.|||+ T Consensus 26 ~~~~~l~~~~~~~~~d~------~~~~vvl-~g~g~~FsaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIa 98 (195) T cd06558 26 EMLDELAAALDEAEADP------DVRVVVL-TGAGKAFCAGADLKELAALSDAGEEARAFIRELQELLRALLRLPKPVIA 98 (195) T ss_pred HHHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 99999999999997499------9579999-7499870479975888726666688999999999999999978999999 Q ss_pred EE-----CCCCCCHHHHHHHCCCCCCCHHHHHHHCC Q ss_conf 20-----57775258988641237772145676332 Q gi|254780791|r 246 AI-----GHETDWTLADYAADLRAPTPTGAAEMAVP 276 (529) Q Consensus 246 gI-----GHE~D~Tl~D~VAD~Ra~TPTaAAElavp 276 (529) +| |.-.... +.+|.|..+++ |.+..| T Consensus 99 ai~G~a~GgG~~la---l~~D~~ia~~~--a~f~~p 129 (195) T cd06558 99 AVNGAALGGGLELA---LACDIRIAAED--AKFGLP 129 (195) T ss_pred EECCCCCCCCCHHH---HHCCEEEEECC--CEEECC T ss_conf 86885277750646---61888998089--857466 No 135 >PRK08227 aldolase; Validated Probab=86.79 E-value=2.8 Score=22.57 Aligned_cols=88 Identities=16% Similarity=0.184 Sum_probs=47.6 Q ss_pred HHHHHHHHHHH--HHCCCEEEEEEE-CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHH Q ss_conf 25899999986--305975899972-100111103679999999974100357677758999516888444220076999 Q gi|254780791|r 156 GAVIRDILQRI--SCRFPLRVIIFP-VKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMI 232 (529) Q Consensus 156 ~a~~~D~~~~~--~~r~p~~~~~~p-~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~l 232 (529) ..-++|+-++. .+||.+.+...+ ..-....-+.-|.-|.+.+-.+. .|+|=. .|..|.. T Consensus 150 ~~~l~~lg~v~~e~~~~GmPlla~~~~g~~~~~d~~~va~aaRia~ELG-------ADiVKt-----------~yt~e~f 211 (291) T PRK08227 150 TQSIKNIIQLVDAGLRYGMPTMAVTAVGKDMVRDARYFSLATRIAAEMG-------AQIIKT-----------YYVEKGF 211 (291) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHC-------CCEEEC-----------CCCHHHH T ss_conf 9999999999999998299879983468777777899999999999978-------998850-----------6973459 Q ss_pred HHHHHHCCCEEEEEECCCCC-CHHHHHHHC Q ss_conf 99997489048852057775-258988641 Q gi|254780791|r 233 VRAIANSSIPIISAIGHETD-WTLADYAAD 261 (529) Q Consensus 233 araI~~~~iPVisgIGHE~D-~Tl~D~VAD 261 (529) .+.++.||+||+.+=|.-+| ..+..+|.+ T Consensus 212 ~~Vv~a~pvPVliaGG~k~~d~e~L~~v~~ 241 (291) T PRK08227 212 ERITAGCPVPIVIAGGKKLPERDALEMCYQ 241 (291) T ss_pred HHHHHCCCCCEEEECCCCCCHHHHHHHHHH T ss_conf 999964899789967998986999999999 No 136 >TIGR00284 TIGR00284 dihydropteroate synthase-related protein; InterPro: IPR005236 The proteins of this family have been found so far only in the four archaeal species. The central region of the proteins shows considerable homology to the amino-terminal half of dihydropteroate synthases. . Probab=86.75 E-value=2.5 Score=23.15 Aligned_cols=52 Identities=19% Similarity=0.200 Sum_probs=27.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC Q ss_conf 999999999997400171899999705435688862799987489479999973521058 Q gi|254780791|r 17 VSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI 76 (529) Q Consensus 17 vs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~ 76 (529) +.+-....+..|+.... ...+|--- .||+|-++|==.-....-+|+|+.... T Consensus 45 Iak~lk~~~~~L~~~~k------d~~D~~l~--pGlvrgDak~veevtG~~~fKgT~ea~ 96 (529) T TIGR00284 45 IAKELKKLKDKLEKKLK------DSVDYVLV--PGLVRGDAKVVEEVTGVKVFKGTEEAV 96 (529) T ss_pred HHHHHHHHHHHHHHHHH------CCCCEEEE--CCCHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 99999878887645431------46476884--770010023210025402224767720 No 137 >PRK05901 RNA polymerase sigma factor; Provisional Probab=86.68 E-value=1.7 Score=24.49 Aligned_cols=48 Identities=27% Similarity=0.402 Sum_probs=32.2 Q ss_pred HHCCHHHHHHH--HHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHH Q ss_conf 20076999999--9748904885205777525898864123777214567 Q gi|254780791|r 225 WHFNDEMIVRA--IANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAE 272 (529) Q Consensus 225 ~~FN~e~lara--I~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAE 272 (529) +.+..+.|... +...++-.=+.||++-|.++.|+++|..+.+|-.+++ T Consensus 272 lg~~~~~v~~~~~~~~~~~SLd~~vg~d~~~~l~d~i~D~~~~~p~~~~~ 321 (398) T PRK05901 272 MDFTPEKVREIQKYAREPISLDKTIGDEGDSQFGDFIEDSEAVSPVDAVS 321 (398) T ss_pred HCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCHHHHHH T ss_conf 19999999999997289855678778998651866416999999999999 No 138 >PRK07772 single-stranded DNA-binding protein; Provisional Probab=86.66 E-value=1 Score=26.53 Aligned_cols=76 Identities=18% Similarity=0.416 Sum_probs=48.2 Q ss_pred CEEEEEEECC---CCCCCCCCEE--EEEEE------CC---------CCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 1899999705---4356888627--99987------48---------947999997352105866814598899999966 Q gi|254780791|r 34 HVCVRGEISG---YRGIHSSGHA--YFSLK------DN---------HSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT 93 (529) Q Consensus 34 ~~~v~gEis~---~~~~~~sGH~--Yf~lk------d~---------~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~ 93 (529) +|-|.|-+.. +|. -.||-. =|+|- |. ..-++|++|+..+..+.-.++.|++|+|.|++. T Consensus 6 ~V~lvGnL~~DPe~r~-t~~G~~v~~f~lA~~~r~~d~~~g~~~d~~t~~~~v~~wgklAe~~~~~l~KG~~V~V~GrL~ 84 (183) T PRK07772 6 TITVVGNLTADPELRF-TPSGAAVANFTVASTPRTYDRQTNEWKDGEALFLRCSIWRQAAENVAESLTKGMRVIVTGRLK 84 (183) T ss_pred EEEEEEECCCCCEEEE-CCCCCEEEEEEEEECCCEEECCCCCEECCCEEEEEEEEEHHHHHHHHHHCCCCCEEEEEEEEE T ss_conf 8999875687970888-999996999999855852726888477162479999981799999887446998999998877 Q ss_pred E--ECC-----CCEEEEEEEEEEE Q ss_conf 7--528-----8437999997101 Q gi|254780791|r 94 T--FPG-----SSKYQIIIESLIP 110 (529) Q Consensus 94 ~--y~~-----~g~~ql~v~~i~~ 110 (529) . |.. |+..-++++.|-+ T Consensus 85 tr~w~dk~G~~r~~tei~ad~vg~ 108 (183) T PRK07772 85 QRSYETREGEKRTVVELEVDEIGP 108 (183) T ss_pred CCCCCCCCCCEEEEEEEEEEECCC T ss_conf 675486899989999999961243 No 139 >PRK08260 enoyl-CoA hydratase; Provisional Probab=86.60 E-value=2.5 Score=23.07 Aligned_cols=38 Identities=24% Similarity=0.381 Sum_probs=26.0 Q ss_pred HHHHHHHHCCCEEEEEE-CCC--CCCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577--7525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHE--TDWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE--~D~Tl~D~VAD~Ra~TPTa 269 (529) .+..+|++|+.|||++| ||= --..|+ +.+|+|.++++| T Consensus 101 ~~~~~i~~~~kPvIAaV~G~a~GgG~~La-lacD~rias~~a 141 (293) T PRK08260 101 RVTLRIFDCLKPVIAAVNGPAVGVGATMT-LAMDIRLASTAA 141 (293) T ss_pred HHHHHHHHCCCCEEEEECCEEEHHHHHHH-HCCCCCCCCCCC T ss_conf 99999984899889998898512658987-603600102478 No 140 >cd00288 Pyruvate_Kinase Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer. Probab=86.60 E-value=2.9 Score=22.49 Aligned_cols=185 Identities=15% Similarity=0.159 Sum_probs=84.4 Q ss_pred CCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 88627999874894799999735210586681459889999996675288437999997101680079999999999765 Q gi|254780791|r 49 SSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLL 128 (529) Q Consensus 49 ~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~ 128 (529) .+-+++|+..+....-+.-++.-++..+--..+.|+.|++- .|...|.|..++..+. +.-+-. T Consensus 87 ~G~~v~l~~~~~~~~~~~~~i~v~~~~l~~~v~~Gd~I~id--------DG~i~l~V~~~~~~~~---------i~~~v~ 149 (480) T cd00288 87 AGDKFLVTTDPAAKKGTKEKIYVDYKNLTKDVSPGNTILVD--------DGLLSLKVLSKDDDKT---------LVCEVL 149 (480) T ss_pred CCCEEEEEECCCCCCCCCCEEECCHHHHHHHCCCCCEEEEE--------CCEEEEEEEEECCCCE---------EEEEEE T ss_conf 89999998568767898888965728889765899889994--------7838999999858976---------999997 Q ss_pred HHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHH-------------- Q ss_conf 401226100163102652899984784258999999863059758999721001111036799999-------------- Q gi|254780791|r 129 EEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAI-------------- 194 (529) Q Consensus 129 ~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai-------------- 194 (529) .-|.+-.++-.-+|-.+..+--+|..+ +.|+.--++...- ++.-+.||-.+-..++.+.| T Consensus 150 ~gG~l~s~Kgvn~p~~~~~lp~lTekD---~~di~~a~~~~vD---~valSFVr~~~Dv~~lr~~l~~~g~~~~IiaKIE 223 (480) T cd00288 150 NGGVLGSRKGVNLPGTDVDLPALSEKD---KADLRFGVEQGVD---MIFASFVRKASDVLEIREVLGEKGKDIKIIAKIE 223 (480) T ss_pred ECCEECCCCCCCCCCCCCCCCCCCHHH---HHHHHHHHHCCCC---EEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 583973886323688867866798778---9999868875999---9998888989999999999997488666999852 Q ss_pred -----HHHHHHCCCCCCCCCCEEEEECCCCC----HHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCC Q ss_conf -----99974100357677758999516888----444220076999999974890488520577752589886412377 Q gi|254780791|r 195 -----LQLNTLKEGRTCPRPDIIILARGGGS----IEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAP 265 (529) Q Consensus 195 -----~~~~~~~~~~~~~~~D~iii~RGGGS----~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~ 265 (529) +-++..- ..-|.|+|+||-=+ +|++ +.--..+.+.--....|||+| |=++| +=...| T Consensus 224 ~~~al~nl~eIi-----~~sDgIMIARGDLgvEi~~e~v-p~~Qk~Ii~~c~~~gKPvIvA------TqmLe--SMi~~p 289 (480) T cd00288 224 NQEGVNNFDEIL-----EASDGIMVARGDLGVEIPAEEV-FLAQKMLIAKCNLAGKPVITA------TQMLE--SMIYNP 289 (480) T ss_pred CHHHHHCHHHHH-----HHCCEEEEECCCCCCCCCHHHH-HHHHHHHHHHHHHCCCEEEEE------CCHHH--HHHCCC T ss_conf 776664799999-----8538899977865564798898-999999999999839959996------73688--762389 Q ss_pred CCHHH Q ss_conf 72145 Q gi|254780791|r 266 TPTGA 270 (529) Q Consensus 266 TPTaA 270 (529) +||-| T Consensus 290 ~PTRA 294 (480) T cd00288 290 RPTRA 294 (480) T ss_pred CCCEE T ss_conf 98521 No 141 >PRK07110 polyketide biosynthesis enoyl-CoA hydratase; Validated Probab=86.57 E-value=2.3 Score=23.39 Aligned_cols=77 Identities=21% Similarity=0.264 Sum_probs=42.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-------CHHHHHHCC----------------HHHHHHHHHHCCCE Q ss_conf 0367999999997410035767775899951688-------844422007----------------69999999748904 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGG-------SIEDLWHFN----------------DEMIVRAIANSSIP 242 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG-------S~eDL~~FN----------------~e~laraI~~~~iP 242 (529) -..++..++..++... .+-|||| +|.| .+..|..+. -..+.+.+.+||.| T Consensus 32 m~~~l~~al~~~~~d~------~vrvvVl-tg~g~~FcaG~D~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kP 104 (257) T PRK07110 32 LVRQLHGAFDTIAQDP------RCKVVIL-TGYPNYFATGGTQEGLLSLSEGKGTFTDYKTNIEHEYANLYSLALNCPIP 104 (257) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9999999999997689------9169999-88998652566678887544155311467888999999999999819998 Q ss_pred EEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 88520-577752-5898864123777214 Q gi|254780791|r 243 IISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 243 VisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ||.+| ||-.=- .-.=+.+|.|..++.+ T Consensus 105 vIaavnG~a~GgG~~lal~cD~ria~~~a 133 (257) T PRK07110 105 VIAAMQGHAIGGGFVMGLYADFVVLSEES 133 (257) T ss_pred EEEEECCHHHHHHHHHHHCCCEEEECCCC T ss_conf 99997878857768997326878986997 No 142 >PRK06144 enoyl-CoA hydratase; Provisional Probab=86.38 E-value=2.6 Score=23.00 Aligned_cols=75 Identities=23% Similarity=0.345 Sum_probs=41.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEE-CCCCCH---HHHHHCC--------------HHHHHHHHHHCCCEEEEEE Q ss_conf 036799999999741003576777589995-168884---4422007--------------6999999974890488520 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILA-RGGGSI---EDLWHFN--------------DEMIVRAIANSSIPIISAI 247 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~-RGGGS~---eDL~~FN--------------~e~laraI~~~~iPVisgI 247 (529) -..++..+|+.++... .+-+|||. -||..+ -||-.|. -..+..+|..||.|||++| T Consensus 36 m~~el~~~l~~~~~d~------~vr~vVl~gag~~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaav 109 (262) T PRK06144 36 MYEGLAAICEEIAADP------SIRAVVLRGAGDKAFVAGTDIAQFRAFSTAEDAVAYERRIDRVLGALEQLRVPTIAAI 109 (262) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEEECCCCCCEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 9999999999986389------9149999579986253476578761457704578999999999999995899889997 Q ss_pred -----CCCCCCHHHHHHHCCCCCCCHH Q ss_conf -----5777525898864123777214 Q gi|254780791|r 248 -----GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 248 -----GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) |--.... +.+|+|..+|.+ T Consensus 110 ~G~a~GgG~~la---l~cD~~ias~~a 133 (262) T PRK06144 110 AGACVGGGAAIA---AACDLRIATPSA 133 (262) T ss_pred CCEEEECCCEEH---HCCCEEEECCCC T ss_conf 587860722201---107776778997 No 143 >COG0629 Ssb Single-stranded DNA-binding protein [DNA replication, recombination, and repair] Probab=86.35 E-value=1.6 Score=24.82 Aligned_cols=78 Identities=19% Similarity=0.304 Sum_probs=50.8 Q ss_pred CCEEEEEEECC---CCCCCCCCEEE--EEEEC-------------CCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 71899999705---43568886279--99874-------------89479999973521058668145988999999667 Q gi|254780791|r 33 SHVCVRGEISG---YRGIHSSGHAY--FSLKD-------------NHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITT 94 (529) Q Consensus 33 ~~~~v~gEis~---~~~~~~sGH~Y--f~lkd-------------~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~ 94 (529) ..|.+.|-|+. ++.-+.+|-++ |++.. ..--++|++|...+..+.-.++-|+.|+|.|++.. T Consensus 4 Nkv~LvG~l~~DPE~r~t~~g~~~v~~~~~a~~r~~~~~~~~~~~~t~~~~vv~wgk~Ae~~~~yl~KG~~V~VeG~l~~ 83 (167) T COG0629 4 NKVILVGRLTRDPELRYTPNGGAVVALFSAAVNRRFDNQSGERDEETDWIRVVIWGKLAENAAEYLKKGSLVYVEGRLQT 83 (167) T ss_pred CEEEEEEEECCCHHHEECCCCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEECHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 33999885046811035368881379999872242115676656765269999982388999997478878999999888 Q ss_pred --EC-CCC----EEEEEEEEEEE Q ss_conf --52-884----37999997101 Q gi|254780791|r 95 --FP-GSS----KYQIIIESLIP 110 (529) Q Consensus 95 --y~-~~g----~~ql~v~~i~~ 110 (529) |. +.| ...+.++.+-. T Consensus 84 ~~~~~~~G~~r~~~~~~~~~v~~ 106 (167) T COG0629 84 RKWEDQEGQKRYQTEIVTEIVAD 106 (167) T ss_pred EEEECCCCCCCEEEEEEEEEEEC T ss_conf 77757889853147998899611 No 144 >PRK09210 RNA polymerase sigma factor RpoD; Validated Probab=86.18 E-value=2.2 Score=23.51 Aligned_cols=49 Identities=27% Similarity=0.417 Sum_probs=32.9 Q ss_pred HHCCHHHHHHH--HHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 20076999999--97489048852057775258988641237772145676 Q gi|254780791|r 225 WHFNDEMIVRA--IANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEM 273 (529) Q Consensus 225 ~~FN~e~lara--I~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAEl 273 (529) +.+..+.|... ++..++-.=+-||.|.|.++.|+++|-.+++|...++. T Consensus 243 lgi~~e~V~~i~~~~~~~~SLd~pvg~e~~s~l~d~i~D~~~~~p~e~~~~ 293 (368) T PRK09210 243 MDMPPEKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDQDATSPADHAAY 293 (368) T ss_pred HCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHH T ss_conf 499899999999853888566886788775528755248999998999999 No 145 >cd04491 SoSSB_OBF SoSSB_OBF: A subfamily of OB folds similar to the OB fold of the crenarchaeote Sulfolobus solfataricus single-stranded (ss) DNA-binding protein (SSoSSB). SSoSSB has a single OB fold, and it physically and functionally interacts with RNA polymerase. In vitro, SSoSSB can substitute for the basal transcription factor TBP, stimulating transcription from promoters under conditions in which TBP is limiting, and supporting transcription when TBP is absent. SSoSSB selectively melts the duplex DNA of promoter sequences. It also relieves transcriptional repression by the chromatin Alba. In addition, SSoSSB activates reverse gyrase activity, which involves DNA binding, DNA cleavage, strand passage and ligation. SSoSSB stimulates all these steps in the presence of the chromatin protein, Sul7d. SSoSSB antagonizes the inhibitory effect of Sul7d on reverse gyrase supercoiling activity. It also physically and functionally interacts with Mini-chromosome Maintenance (MCM), stimulating Probab=86.17 E-value=3 Score=22.32 Aligned_cols=49 Identities=12% Similarity=0.153 Sum_probs=38.3 Q ss_pred EEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEE-EEEEEECCCCEEEEEEEE Q ss_conf 999874894799999735210586681459889999-996675288437999997 Q gi|254780791|r 54 YFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVI-GKITTFPGSSKYQIIIES 107 (529) Q Consensus 54 Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~y~~~g~~ql~v~~ 107 (529) .+.+.|+.++|+.++|...+ ...++.|+-|.+. |++..| +|.++|.+.. T Consensus 26 ~~~i~D~TG~ir~t~W~~~~---~~~l~~Gd~v~i~~~~v~~~--~g~~el~~~~ 75 (82) T cd04491 26 SGLVGDETGTIRFTLWDEKA---ADDLEPGDVVRIENAYVREF--NGRLELSVGK 75 (82) T ss_pred EEEEECCCCEEEEEEECCCC---CCCCCCCCEEEEEEEEEEEE--CCEEEEEECC T ss_conf 99999899879999968744---45558999999968999888--9979999899 No 146 >pfam01761 DHQ_synthase 3-dehydroquinate synthase. The 3-dehydroquinate synthase EC:4.6.1.3 domain is present in isolation in various bacterial 3-dehydroquinate synthases and also present as a domain in the pentafunctional AROM polypeptide. 3-dehydroquinate (DHQ) synthase catalyses the formation of dehydroquinate (DHQ) and orthophosphate from 3-deoxy-D-arabino heptulosonic 7 phosphate. This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids. Probab=86.08 E-value=3.1 Score=22.29 Aligned_cols=91 Identities=27% Similarity=0.436 Sum_probs=49.0 Q ss_pred CCCEEEEEECCCHHHH--HHHHHHHHHC-CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 6528999847842589--9999986305-975899972100111103679999999974100357677758999516888 Q gi|254780791|r 144 IPKIIAVITSPTGAVI--RDILQRISCR-FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 144 ~p~~i~vits~~~a~~--~D~~~~~~~r-~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) ..+++-|||.++-... ..+...++.. +++.++++|. |+ ..+++...-+..+.+.+ ....+.| +||+=|||. T Consensus 18 ~~~~~~iv~D~~v~~~~~~~i~~~L~~~~~~~~~~~~~~---gE-~~k~~~~~~~i~~~l~~-~~~~r~d-~iiaiGGG~ 91 (310) T pfam01761 18 KGRRLVVVTDENVAKLYGEKLEELLKAAGFEVEVVVIPD---GE-TYKTLETVARIYDALLE-KGLTRSD-VIIALGGGV 91 (310) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEECC---CC-CCCCHHHHHHHHHHHHH-CCCCCCC-EEEEECCCC T ss_conf 998299998976799999999999987799579999569---86-54899999999999997-4999775-499954962 Q ss_pred HHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 4442200769999999748904885 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iPVis 245 (529) .-|+.-| +-+.|.-.+|+|+ T Consensus 92 v~D~ak~-----~A~~~~rg~~~i~ 111 (310) T pfam01761 92 IGDLAGF-----AAATYMRGIPFIQ 111 (310) T ss_pred HHHHHHH-----HHHHHHCCCCEEE T ss_conf 1168999-----9999976997798 No 147 >PRK07827 enoyl-CoA hydratase; Provisional Probab=86.04 E-value=2.8 Score=22.65 Aligned_cols=39 Identities=31% Similarity=0.437 Sum_probs=27.5 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-5777--525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ..+.++|.+||.|||.+| ||=. =.. +=+++|+|..+|+| T Consensus 86 ~~~~~~i~~~~kPvIaav~G~a~GgG~~-la~~cD~~ia~~~a 127 (255) T PRK07827 86 TALLRAIVELPKPVIAAIDGHVRAGGLG-LVGACDIAVAGPGS 127 (255) T ss_pred HHHHHHHHHCCCCEEEEECCEEEECCHH-HHHCCCEEECCCCC T ss_conf 9999999968998899977867634046-65516554148998 No 148 >PRK05617 enoyl-CoA hydratase; Provisional Probab=85.95 E-value=2.1 Score=23.72 Aligned_cols=37 Identities=19% Similarity=0.159 Sum_probs=29.2 Q ss_pred CHHHHHHHHHHCCCEEEEEE-------CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 76999999974890488520-------5777525898864123777214 Q gi|254780791|r 228 NDEMIVRAIANSSIPIISAI-------GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 228 N~e~laraI~~~~iPVisgI-------GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) .++.+...|..+|.|||++| |=| +=+.+|.|.+||++ T Consensus 93 ~ey~l~~~i~~~~KP~IA~i~G~~mGGG~~-----Lal~~d~RIate~t 136 (356) T PRK05617 93 EEYRLNARIARYPKPYIALMDGIVMGGGVG-----ISAHGSHRIVTERT 136 (356) T ss_pred HHHHHHHHHHHCCCCEEEEEECEEEECHHH-----HHHHCCEEEECCCC T ss_conf 999999999977999899951626644899-----97608757852664 No 149 >PRK05864 enoyl-CoA hydratase; Provisional Probab=85.78 E-value=2.7 Score=22.74 Aligned_cols=36 Identities=25% Similarity=0.399 Sum_probs=25.2 Q ss_pred HHHHHHHHCCCEEEEEE-CC----CCCCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-57----77525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GH----ETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GH----E~D~Tl~D~VAD~Ra~TPTa 269 (529) .+..++.+|+.|||++| || -.. .=+.+|+|..+++| T Consensus 93 ~~~~~l~~~~kPvIaaV~G~a~GgG~~---lal~cD~ria~~~A 133 (269) T PRK05864 93 DVILALRRLHQPVIAAVNGPAIGGGLC---LALAADIRVASSSA 133 (269) T ss_pred HHHHHHHHCCCCEEEEECCEEEECCHH---HHHCCCEEEECCCC T ss_conf 999999858998899973857646357---76417702435886 No 150 >PRK06190 enoyl-CoA hydratase; Provisional Probab=85.78 E-value=3 Score=22.40 Aligned_cols=77 Identities=19% Similarity=0.165 Sum_probs=42.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HH-----------HHHCCHHHHHHHHHHCCCEEEEEE-CC Q ss_conf 036799999999741003576777589995168884----44-----------220076999999974890488520-57 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----ED-----------LWHFNDEMIVRAIANSSIPIISAI-GH 249 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eD-----------L~~FN~e~laraI~~~~iPVisgI-GH 249 (529) -..++..++..++... .+-|||| +|.|.. -| -+.+-...+..++..||.|||++| || T Consensus 32 m~~el~~al~~~~~d~------~vrvvVl-tg~G~~FcAG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~kPvIAav~G~ 104 (258) T PRK06190 32 LRRALFAALAEADADD------DVDVVVL-TGADPAFCAGLDLKELGGDGLAYGAADALPDIVPAWPAMRKPVIGAINGA 104 (258) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 9999999999886589------9369999-78998752887978870464101356668899999996899999997770 Q ss_pred CCCC-HHHHHHHCCCCCCCHH Q ss_conf 7752-5898864123777214 Q gi|254780791|r 250 ETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 250 E~D~-Tl~D~VAD~Ra~TPTa 269 (529) =.=- .-.=+.+|+|..+|+| T Consensus 105 a~GgG~~lal~cD~~ias~~a 125 (258) T PRK06190 105 AVTGGLELALACDILIASERA 125 (258) T ss_pred HHHHHHHHHHCCCEEEECCCC T ss_conf 304438987247668855981 No 151 >cd07062 Peptidase_S66_mccF_like Microcin C7 self-immunity protein determines resistance to exogenous microcin C7. Microcin C7 self-immunity protein (mccF): MccF, a homolog of the LD-carboxypeptidase family, mediates resistance against exogenously added microcin C7 (MccC7), a ribosomally-encoded peptide antibiotic that contains a phosphoramidate linkage to adenosine monophosphate at its C-terminus. The plasmid-encoded mccF gene is transcribed in the opposite direction to the other five genes (mccA-E) and is required for the full expression of immunity but not for production. The catalytic triad residues (Ser, His, Glu) of LD-carboxypeptidase are also conserved in MccF, strongly suggesting that MccF shares the hydrolytic activity with LD-carboxypeptidases. Substrates of MccF have not been deduced, but could likely be microcin C7 precursors. The possible role of MccF is to defend producer cells against exogenous microcin from re-entering after having been exported. It is suggested that M Probab=85.62 E-value=3.2 Score=22.12 Aligned_cols=74 Identities=22% Similarity=0.248 Sum_probs=44.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHH-HHCCCEEEEE--ECCCC Q ss_conf 99721001111036799999999741003576777589995168884442200769999999-7489048852--05777 Q gi|254780791|r 175 IIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAI-ANSSIPIISA--IGHET 251 (529) Q Consensus 175 ~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI-~~~~iPVisg--IGHE~ 251 (529) +||==-|- +.+..|-+.|..+... +..+.+-.||++|=-+...+-+.+.-+++.+.. ....+||+.+ +||-. T Consensus 221 ILflEdv~--e~~~~idR~L~~L~~a---G~f~~~~Gii~G~f~~~~~~~~~~~~~~vi~~~~~~~~iPV~~~~~~GH~~ 295 (308) T cd07062 221 ILFLETSE--LSPATVERALRQLKLA---GVFDKISGIIFGRPQDEEDKGTEETYEDILLEVLGDLDIPIVYDVDFGHTP 295 (308) T ss_pred EEEEEECC--CCHHHHHHHHHHHHHH---CCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC T ss_conf 89998189--9999999999999983---985467089993377676778651899999998633898489899987999 Q ss_pred CC Q ss_conf 52 Q gi|254780791|r 252 DW 253 (529) Q Consensus 252 D~ 253 (529) +. T Consensus 296 ~~ 297 (308) T cd07062 296 PQ 297 (308) T ss_pred CC T ss_conf 86 No 152 >PRK05862 enoyl-CoA hydratase; Provisional Probab=85.62 E-value=2.8 Score=22.66 Aligned_cols=76 Identities=16% Similarity=0.183 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHCC-----------HHHHHHHHHHCCCEEEEEE-CCC Q ss_conf 3679999999974100357677758999516888-4---4422007-----------6999999974890488520-577 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHFN-----------DEMIVRAIANSSIPIISAI-GHE 250 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~FN-----------~e~laraI~~~~iPVisgI-GHE 250 (529) ..++..++..++... .+.|||| +|-|- + -||-.|. -..+..+|.+||.|||++| ||= T Consensus 33 ~~el~~~~~~~~~d~------~vrvvvl-~g~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvIaav~G~a 105 (257) T PRK05862 33 MDELGAALAAFDADE------GIGAIVI-TGSEKAFAAGADIKEMADLSFMDVYKGDYITNWEKVARIRKPVIAAVAGYA 105 (257) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCEE T ss_conf 999999999998599------9359999-889983146878577435661557777799999999858999899976886 Q ss_pred CCCH-HHHHHHCCCCCCCHH Q ss_conf 7525-898864123777214 Q gi|254780791|r 251 TDWT-LADYAADLRAPTPTG 269 (529) Q Consensus 251 ~D~T-l~D~VAD~Ra~TPTa 269 (529) .--- -.=+.+|+|..++++ T Consensus 106 ~GgG~~lal~cD~~ia~~~a 125 (257) T PRK05862 106 LGGGCELAMMCDIIIAADTA 125 (257) T ss_pred ECCHHHHHHHCCEEECCCCC T ss_conf 04245887734531304786 No 153 >PRK08140 enoyl-CoA hydratase; Provisional Probab=85.38 E-value=2.6 Score=22.98 Aligned_cols=39 Identities=21% Similarity=0.336 Sum_probs=25.9 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-577752-5898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) .+.++|.+||.|||.+| ||=.=- .-+=+.+|+|..+++| T Consensus 90 ~~~~~i~~~~kPvIaav~G~a~GgG~~lal~cD~~ia~~~a 130 (262) T PRK08140 90 PLVRRLRALPLPVIAAVNGVAAGAGANLALACDIVLAARSA 130 (262) T ss_pred HHHHHHHHCCCCEEEEECCEEECCHHHHHHHCCEEEECCCC T ss_conf 99999995899889998797641057998735543303675 No 154 >PRK07657 enoyl-CoA hydratase; Provisional Probab=85.33 E-value=2.3 Score=23.33 Aligned_cols=77 Identities=25% Similarity=0.356 Sum_probs=44.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC--CH---HHHHH---CC----------HHHHHHHHHHCCCEEEEEE Q ss_conf 0367999999997410035767775899951688--84---44220---07----------6999999974890488520 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGG--SI---EDLWH---FN----------DEMIVRAIANSSIPIISAI 247 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG--S~---eDL~~---FN----------~e~laraI~~~~iPVisgI 247 (529) ...++..++..++... .+.+||| +|.| ++ -||-. ++ -..++++|..||.|||.+| T Consensus 33 m~~~l~~al~~~~~d~------~vr~vvl-~g~G~~~F~aG~Dl~e~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaav 105 (261) T PRK07657 33 LLEELQNILTQINEEA------NVRVVIL-TGAGEKAFCAGADLKERAGMNEEQVRHAVSLIRTTMEMVEQLPQPVIAAI 105 (261) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 9999999999997589------9649999-66998764289984766038877899999999999999982999899997 Q ss_pred -CCCCCCH-HHHHHHCCCCCCCHH Q ss_conf -5777525-898864123777214 Q gi|254780791|r 248 -GHETDWT-LADYAADLRAPTPTG 269 (529) Q Consensus 248 -GHE~D~T-l~D~VAD~Ra~TPTa 269 (529) ||=.=-- -.=+.+|+|..+|.+ T Consensus 106 ~G~a~GgG~~lal~~D~~ia~~~a 129 (261) T PRK07657 106 NGIALGGGLELALACDFRIAAESA 129 (261) T ss_pred CCEEEEHHHHHHHHCCCCCCCCCC T ss_conf 796640318998844603012554 No 155 >COG0017 AsnS Aspartyl/asparaginyl-tRNA synthetases [Translation, ribosomal structure and biogenesis] Probab=85.14 E-value=3.3 Score=21.94 Aligned_cols=233 Identities=18% Similarity=0.211 Sum_probs=120.2 Q ss_pred HHHHHHHHCC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCC---CCCCCCEEEEEEEEEEE-CC Q ss_conf 9999974001-7189999970543568886279998748947999997352105866---81459889999996675-28 Q gi|254780791|r 23 HLKHIVESNL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEF---LPEEGIEFLVIGKITTF-PG 97 (529) Q Consensus 23 ~i~~~l~~~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~---~~~~G~~v~~~g~~~~y-~~ 97 (529) +|+++.+..- ..|.|.|=|-+.+. .+|-++..|.|..+.|.||..++....--| .+..|.-|.|.|.|.-= .. T Consensus 6 ~i~di~~~~~~~~V~v~GWV~~~R~--~g~i~Fi~lrDgsg~iQ~v~~~~~~~~~~~~~~~L~~es~v~V~G~v~~~~~a 83 (435) T COG0017 6 YIKDIKPHVGGQEVTVRGWVHNKRD--LGKIIFLVLRDGSGFIQAVVPKNKVYEELFKAKKLTLESSVVVTGIVKASPKA 83 (435) T ss_pred EHHHHHCCCCCCEEEEEEEEEEECC--CCCEEEEEEECCCCEEEEEEECCCCCHHHHHHHCCCCCCEEEEEEEEECCCCC T ss_conf 3776504479857999988755001--58708999975883799999778771767666317876589999999748899 Q ss_pred CCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHH--HHHHHCCCEEEE Q ss_conf 843799999710168007999999999976540122610016310265289998478425899999--986305975899 Q gi|254780791|r 98 SSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDIL--QRISCRFPLRVI 175 (529) Q Consensus 98 ~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~--~~~~~r~p~~~~ 175 (529) .++|-+.|+.|+.-|..+ + ++|.-++.-- .+.-.+ +.+.-|-|-.-- T Consensus 84 ~~g~El~v~~i~Vl~~a~-------------------~----~~Pi~~~~~~--------~~e~lld~rhL~lR~~~~~A 132 (435) T COG0017 84 PQGFELQVEKIEVLGEAD-------------------P----PYPIDKKEHS--------ELETLLDNRHLDLRTPKIQA 132 (435) T ss_pred CCCEEEEEEEEEEEECCC-------------------C----CCCCCCCCCC--------CHHHHHHCHHEECCCCCHHH T ss_conf 987799887889940468-------------------8----7776864555--------78888744030204620377 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE---C-CCCCHHH------------HHHCCHHHHHHHHHHC Q ss_conf 9721001111036799999999741003576777589995---1-6888444------------2200769999999748 Q gi|254780791|r 176 IFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILA---R-GGGSIED------------LWHFNDEMIVRAIANS 239 (529) Q Consensus 176 ~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~---R-GGGS~eD------------L~~FN~e~laraI~~~ 239 (529) + -.|| ..|+.|+..+-.-+ ....++.-+|+ = |||-+=- =.-|+-|.++.+ -. T Consensus 133 v--~kir-----s~i~~a~~eff~~~---gF~eV~tP~i~~~~~EGg~elF~v~yf~~~a~LtqS~QLyke~~~~a--l~ 200 (435) T COG0017 133 V--FKIR-----SSILRAIREFFYEN---GFTEVHTPIITASATEGGGELFKVDYFDKEAYLTQSPQLYKEALAAA--LE 200 (435) T ss_pred H--HHHH-----HHHHHHHHHHHHHC---CCEEECCCEEECCCCCCCCEEEEEEECCCCEEEECCHHHHHHHHHHH--HC T ss_conf 8--9499-----99999999999758---95896594685347899850577750685667714788999999998--57 Q ss_pred C----CEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9----04885205777525898864123777214567633234677766999998887789988899998556776 Q gi|254780791|r 240 S----IPIISAIGHETDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLL 311 (529) Q Consensus 240 ~----iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~ 311 (529) + -|++=|=-|.|..-|+.|.. -=+|++.-+..++..-+.++ +...++..++.....|+.+. T Consensus 201 rVf~igP~FRAE~s~T~RHL~EF~~--------ld~Emaf~~~~d~m~l~E~~---i~~i~~~v~e~~~~el~~l~ 265 (435) T COG0017 201 RVFTIGPTFRAEKSNTRRHLSEFWM--------LDPEMAFADLNDVMDLAEEL---IKYLFKKVLEECADELEFLG 265 (435) T ss_pred CEEEECCCEECCCCCCCCHHHHHHE--------ECCEECCCCHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH T ss_conf 6489567255477897003666733--------02111367688899999999---99999999997189888761 No 156 >PRK07658 enoyl-CoA hydratase; Provisional Probab=85.14 E-value=3.1 Score=22.28 Aligned_cols=40 Identities=20% Similarity=0.249 Sum_probs=26.0 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-577752-5898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ..+...|..||.|||.+| ||-.=- .-.=+.+|+|..++++ T Consensus 84 ~~~~~~i~~~~kPvIaaV~G~a~GgG~~lal~cD~~ia~~~a 125 (257) T PRK07658 84 QQTFERVEKFSKPVIAAIHGAALGGGLELAMSCHIRFATESA 125 (257) T ss_pred HHHHHHHHHCCCCEEEEECCHHHHCCHHHHHCCCEEEECCCC T ss_conf 999999995899899997672531425775326534311004 No 157 >PRK06072 enoyl-CoA hydratase; Provisional Probab=85.14 E-value=3.4 Score=21.94 Aligned_cols=76 Identities=24% Similarity=0.303 Sum_probs=46.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHHHHCC----------HHHHHHHHHHCCCEEEEEE-CCCC Q ss_conf 36799999999741003576777589995168884----4422007----------6999999974890488520-5777 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDLWHFN----------DEMIVRAIANSSIPIISAI-GHET 251 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL~~FN----------~e~laraI~~~~iPVisgI-GHE~ 251 (529) ..+|..++..++... .+-+||| +|.|-. -||-.|. -..+.+.+..|+.|||.+| ||=. T Consensus 21 ~~eL~~a~~~~~~d~------~vr~vVl-~g~G~~FcAG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~kp~Iaav~G~a~ 93 (240) T PRK06072 21 RNELISALREINADP------KIRVVIL-TGEGRAFSVGADLSEISEDITEDLRDSFHPIIKEIRFSNKIYISAVNGVVA 93 (240) T ss_pred HHHHHHHHHHHHHCC------CEEEEEE-ECCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEE T ss_conf 999999999997589------9069999-879986551588788745446889999999999998389998999888755 Q ss_pred CCHH-HHHHHCCCCCCCHH Q ss_conf 5258-98864123777214 Q gi|254780791|r 252 DWTL-ADYAADLRAPTPTG 269 (529) Q Consensus 252 D~Tl-~D~VAD~Ra~TPTa 269 (529) =--+ .=+.+|+|..||++ T Consensus 94 GgG~~la~~cD~rias~~a 112 (240) T PRK06072 94 GAGISIALSTDFRFASKDV 112 (240) T ss_pred HHHHHHHHHCCEEEECCCC T ss_conf 6879999862765530232 No 158 >cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies. Probab=85.06 E-value=3.4 Score=21.92 Aligned_cols=90 Identities=17% Similarity=0.230 Sum_probs=59.4 Q ss_pred HHHHHHHHHHH--HHCCCEEEEEEE----CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCH Q ss_conf 25899999986--305975899972----100111103679999999974100357677758999516888444220076 Q gi|254780791|r 156 GAVIRDILQRI--SCRFPLRVIIFP----VKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFND 229 (529) Q Consensus 156 ~a~~~D~~~~~--~~r~p~~~~~~p----~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~ 229 (529) ..-++|+-+.. .++|.+.+++++ -.+..+..+.-|.-|-+.+-.+. .|+|=+-=- -|. T Consensus 105 ~~~l~~l~~v~~ea~~~G~Pll~~~yprG~~~~~~~d~~~ia~aaRia~ELG-------ADiVKv~y~---------g~~ 168 (235) T cd00958 105 REMLEELARVAAEAHKYGLPLIAWMYPRGPAVKNEKDPDLIAYAARIGAELG-------ADIVKTKYT---------GDA 168 (235) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC-------CCEEEECCC---------CCH T ss_conf 9999999999999998399789997416876556566899999999999978-------998982399---------998 Q ss_pred HHHHHHHHHCCCEEEEEECCCC--CCHHHHHHHC Q ss_conf 9999999748904885205777--5258988641 Q gi|254780791|r 230 EMIVRAIANSSIPIISAIGHET--DWTLADYAAD 261 (529) Q Consensus 230 e~laraI~~~~iPVisgIGHE~--D~Tl~D~VAD 261 (529) |...+.|..|++||+.+=|..+ |.-+..+|.+ T Consensus 169 e~f~~vv~~~~vPVviaGG~k~~~~~~~L~~v~~ 202 (235) T cd00958 169 ESFKEVVEGCPVPVVIAGGPKKDSEEEFLKMVYD 202 (235) T ss_pred HHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHH T ss_conf 9999999708998998089999999999999999 No 159 >cd01545 PBP1_SalR Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. The SalR binds to glucose based compound Salicin which is chemically related to aspirin. The ligand-binding of SalR is structurally homologous to the periplasmic sugar-binding domain of ABC-transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand bind Probab=84.81 E-value=3.5 Score=21.83 Aligned_cols=21 Identities=10% Similarity=-0.007 Sum_probs=12.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 88889862299999999999740 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~ 30 (529) |+.++|-| +++...|...++. T Consensus 7 p~i~npff--~~l~~gi~~~~~~ 27 (270) T cd01545 7 DNPSPGYV--SEIQLGALDACRD 27 (270) T ss_pred CCCCCHHH--HHHHHHHHHHHHH T ss_conf 99973899--9999999999998 No 160 >PRK07327 enoyl-CoA hydratase; Provisional Probab=84.80 E-value=2.9 Score=22.50 Aligned_cols=71 Identities=27% Similarity=0.393 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHCC---------------HHHHHHHHHHCCCEEEEEE Q ss_conf 3679999999974100357677758999516888-4---4422007---------------6999999974890488520 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHFN---------------DEMIVRAIANSSIPIISAI 247 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~FN---------------~e~laraI~~~~iPVisgI 247 (529) ..+|..++..++... .+-|||| +|.|. + -||-.|. -..+..++..||.|||++| T Consensus 41 ~~~l~~a~~~~~~d~------~vrvvvl-tg~G~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kPvIAav 113 (271) T PRK07327 41 HRELADIWRDVDRDP------DVRVVVI-RGEGKAFSAGGDLALVEEMAGDFEVRARVWREARDLVYNVINCSKPIVSAI 113 (271) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 999999999997689------9169999-789986556756477763157878899999999999999997899889998 Q ss_pred -------CCCCCCHHHHHHHCCCCCCCHH Q ss_conf -------5777525898864123777214 Q gi|254780791|r 248 -------GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 248 -------GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) |-| .=+.+|+|..++++ T Consensus 114 ~G~a~GgG~~-----lal~cD~~ia~~~a 137 (271) T PRK07327 114 HGPAVGAGLV-----AALLADISIAAKDA 137 (271) T ss_pred ECEEECCCCH-----HHHHCCEEEECCCC T ss_conf 0333045508-----77636843684898 No 161 >TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family; InterPro: IPR010129 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Type I secretion is an ABC transporter that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C terminus of the transported protein. This entry represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This entry selects a group of sequences closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.; GO: 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=84.74 E-value=3.5 Score=21.80 Aligned_cols=111 Identities=14% Similarity=0.062 Sum_probs=67.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCC------CE--EEEEC Q ss_conf 76777778875026889999999999999999999999999999999999998871696777314------61--99984 Q gi|254780791|r 396 KEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHTSVSITTRILQSFAYKNTLKR------GY--TSIQD 467 (529) Q Consensus 396 ~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~rL~~l~~~L~slsP~~~L~R------GY--aiv~~ 467 (529) +......+......+..+...+.++......+...+...+...+.+|..+...|...+-...+.- |- .+..+ T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~A~~~l~~~~~~l~~a~~~~~~~~irsPvdG~v~~l~~~ 295 (434) T TIGR01843 216 QGELARLEAELEVLKAQIDELQLKLQQLEQTFREEVLEELTEAQARLAELREELSKARDRLQRLIIRSPVDGTVQSLKVH 295 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEEEEEEEEE T ss_conf 99999879999999999999999998899999999999999999999999999999998753211442545199999998 Q ss_pred CCCCEE------------------------CCHHHCCCCCEEEEEEE------CEEEEEEEEECCCCCC Q ss_conf 898895------------------------77789299986999991------1099999950578887 Q gi|254780791|r 468 TNNNFI------------------------TQKRNLATKTRILINFF------DGQANAIVINKAPPKE 506 (529) Q Consensus 468 ~~GkiI------------------------~s~~~l~~gd~i~i~l~------DG~v~a~V~~k~~~~~ 506 (529) .-|-+| +++..|+.|+..+|+|. =|.++.+|....|+.- T Consensus 296 T~GgVv~pG~~lm~IVP~~~~l~iEa~~~~~Dig~v~~Gq~a~ik~~AF~~~ryG~~~G~V~~iS~D~~ 364 (434) T TIGR01843 296 TVGGVVQPGETLMEIVPEDDPLEIEAKLSNKDIGFVHVGQPAEIKFSAFPYTRYGLLNGKVKSISPDTV 364 (434) T ss_pred EECCEECCCCCEEEEECCCCCEEEEEEECCCCCCEEECCCEEEEEEECCCCCCCCCEEEEEEEECCCCH T ss_conf 405443278617898339996589999735465345558716999963686033518999998627511 No 162 >PRK07260 enoyl-CoA hydratase; Provisional Probab=84.43 E-value=3.5 Score=21.78 Aligned_cols=76 Identities=21% Similarity=0.244 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH----HHHHHCC------H-----------HHHHHHHHHCCCEEEE Q ss_conf 36799999999741003576777589995168884----4422007------6-----------9999999748904885 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI----EDLWHFN------D-----------EMIVRAIANSSIPIIS 245 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~----eDL~~FN------~-----------e~laraI~~~~iPVis 245 (529) ..+|..++..++... .+-|||| +|.|.. -||-.|. + -.+..++..||.|||+ T Consensus 33 ~~~l~~~~~~~~~d~------~vrvvil-~g~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kPvIa 105 (260) T PRK07260 33 CQEILEALALAKEDT------SVRFLLI-NAVGKVFSVGGDLVEMKRAVAKDDVQSLVKIAELVNEISFAIKQLPKPVIM 105 (260) T ss_pred HHHHHHHHHHHHHCC------CEEEEEE-ECCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 999999999997689------9179999-789998889768888873225641678999999999999999848999899 Q ss_pred EE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 20-577752-5898864123777214 Q gi|254780791|r 246 AI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 246 gI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) +| ||=.-- .-.=+.+|+|..++.+ T Consensus 106 av~G~a~GgG~~la~~~D~~ia~~~a 131 (260) T PRK07260 106 CVDGAVAGAAANIAVAADFCIASTKT 131 (260) T ss_pred EECCEECCCHHHHHHHHCCCCCCCCC T ss_conf 97685401175898860522235687 No 163 >cd04497 hPOT1_OB1_like hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB Probab=84.25 E-value=3.6 Score=21.64 Aligned_cols=80 Identities=18% Similarity=0.194 Sum_probs=51.4 Q ss_pred HHHHHHHCCCCEEEEEEECCCCCCC-CCC---EEEEEEECCC----CEEEEEEECCCCCCCCCCCCCCCEEEEE-EEEEE Q ss_conf 9999740017189999970543568-886---2799987489----4799999735210586681459889999-99667 Q gi|254780791|r 24 LKHIVESNLSHVCVRGEISGYRGIH-SSG---HAYFSLKDNH----SRIDAIIWKGTLNKIEFLPEEGIEFLVI-GKITT 94 (529) Q Consensus 24 i~~~l~~~~~~~~v~gEis~~~~~~-~sG---H~Yf~lkd~~----a~i~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~ 94 (529) |+.+++..-..|-|.|-|..+.... ++| .+.|+|+|.. ..+.|.+|+.....|+.- +.||=|+++ .++.. T Consensus 6 L~~a~~~~~~~VnvigVV~~~~~p~~s~g~d~~~tl~I~D~S~~~~~~l~v~~F~~~~~~LP~v-~~GDiIll~~~~v~~ 84 (138) T cd04497 6 LSSALKESGGSVNVIGVVVDAGPPVRSKGTDYCCTLTITDPSLANSDGLTVKLFRPNEESLPIV-KVGDIILLRRVKIQS 84 (138) T ss_pred HHHHHHHCCCEEEEEEEEEECCCCEECCCCEEEEEEEEECCCCCCCCCEEEEEECCCHHHCCCC-CCCCEEEEEEEEEEE T ss_conf 8999863498588999998657964469972999999992899999967999989986569899-999899998889999 Q ss_pred ECCCCEEEEEEE Q ss_conf 528843799999 Q gi|254780791|r 95 FPGSSKYQIIIE 106 (529) Q Consensus 95 y~~~g~~ql~v~ 106 (529) |. |..+++.. T Consensus 85 ~~--~~~~~~~~ 94 (138) T cd04497 85 YN--GKPQGISN 94 (138) T ss_pred EC--CEEEEEEC T ss_conf 88--90799977 No 164 >PRK13371 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional Probab=83.62 E-value=3.8 Score=21.44 Aligned_cols=139 Identities=16% Similarity=0.219 Sum_probs=65.2 Q ss_pred CCCCEEEEEEEEEEECCCCEE-----EEEEEEEEEC--------CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCE Q ss_conf 459889999996675288437-----9999971016--------800799999999997654012261001631026528 Q gi|254780791|r 81 EEGIEFLVIGKITTFPGSSKY-----QIIIESLIPS--------GSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKI 147 (529) Q Consensus 81 ~~G~~v~~~g~~~~y~~~g~~-----ql~v~~i~~~--------g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~ 147 (529) +.|-.|++.|.-+.=+-.|.. .++|.+++.+ |-|+...-+++.+ .--.+| |||+++ ..+ T Consensus 162 k~gy~iII~G~~~HpEv~gt~s~a~~~~vV~~~eEa~~v~d~i~~~~~~~~fl~kf~-~a~s~g-fdPd~d------l~k 233 (392) T PRK13371 162 KHGFTSIIHGKYKHEETLATSSFAGTYLVVLDLEEAQYVADYILGGGDREEFLKRFA-KACSPG-FDPDRD------LER 233 (392) T ss_pred HCCCEEEEECCCCCCCEEEEECCCCCEEEEECHHHHHHHHHHHCCCCCHHHHHHHHH-HHCCCC-CCCCCC------CCE T ss_conf 689859997788987158842468855897058998889887507886899998753-302567-796556------432 Q ss_pred EEEEECCC-----HHHHHHHHH-HHHHCCC-EEE----EEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 99984784-----258999999-8630597-589----997210011110367999999997410035767775899951 Q gi|254780791|r 148 IAVITSPT-----GAVIRDILQ-RISCRFP-LRV----IIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 148 i~vits~~-----~a~~~D~~~-~~~~r~p-~~~----~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R 216 (529) |||++=-| ...+.+++. ++-.+|+ .++ ..| ..||.|-..=...-..-....+|++|++- T Consensus 234 vgvv~QTTm~~~ete~I~~~l~~~~~~~~g~~~~~~hf~~f----------nTIC~ATq~RQ~A~~~L~~~~vDlmiVVG 303 (392) T PRK13371 234 VGVANQTTMLKSETEEIGKLFERTMLRKYGPAELNEHFLSF----------NTICDATQERQDAMFSLVEEPLDLMVVIG 303 (392) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCC----------CCCCHHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 89994545768999999999999987623841211234523----------88513018999999998754788899946 Q ss_pred CCCCHHHHHHCCHHHHHHHHHHCCCEE Q ss_conf 688844422007699999997489048 Q gi|254780791|r 217 GGGSIEDLWHFNDEMIVRAIANSSIPI 243 (529) Q Consensus 217 GGGS~eDL~~FN~e~laraI~~~~iPV 243 (529) |--|- |.-.|+.---...+|. T Consensus 304 G~NSS------NT~rL~EIa~~~g~pt 324 (392) T PRK13371 304 GFNSS------NTTHLQEIAIERGIPS 324 (392) T ss_pred CCCCC------CHHHHHHHHHHHCCCE T ss_conf 88884------1899999999859985 No 165 >cd04474 RPA1_DBD_A RPA1_DBD_A: A subfamily of OB folds corresponding to the second OB fold, the ssDNA-binding domain (DBD)-A, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-A, RPA1 contains three other OB folds: DBD-B, DBD-C, and RPA1N. The major DNA binding activity of human RPA (hRPA) and Saccharomyces cerevisiae RPA (ScRPA) is associated with DBD-A and DBD-B of RPA1. RPA1 DBD-C is involved in trimerization. The ssDNA-binding mechanism is believed to be multistep and to involve conformational change. Although ScRPA and the hRPA have similar ssDNA-binding properties, they differ funct Probab=83.57 E-value=3.5 Score=21.83 Aligned_cols=60 Identities=18% Similarity=0.345 Sum_probs=38.5 Q ss_pred CEEEEEEEC------CCCCCCCCC-EEEEEEECC-CCEEEEEEECCCCCCCCCCCCCCCEEEE-EEEEE Q ss_conf 189999970------543568886-279998748-9479999973521058668145988999-99966 Q gi|254780791|r 34 HVCVRGEIS------GYRGIHSSG-HAYFSLKDN-HSRIDAIIWKGTLNKIEFLPEEGIEFLV-IGKIT 93 (529) Q Consensus 34 ~~~v~gEis------~~~~~~~sG-H~Yf~lkd~-~a~i~~~~~~~~~~~~~~~~~~G~~v~~-~g~~~ 93 (529) +..|+|=|+ .|+.....| .+-|+|.|+ ...|+|++|...+.+..-.+++|.=..+ .|+|. T Consensus 11 ~w~Ik~RV~~Ks~ir~~~n~~g~gk~f~~~L~De~g~eI~at~f~~~~dkf~~~l~~G~vY~is~g~Vk 79 (104) T cd04474 11 KWTIKARVTNKSDIRTWSNARGEGKLFSFDLLDEDGGEIRATFFNDAVDKFYDLLEVGKVYYISKGSVK 79 (104) T ss_pred CEEEEEEEEECCCCCCEECCCCCCEEEEEEEEECCCCEEEEEEEHHHHHHHHHHCCCCCEEEEECCEEE T ss_conf 459999997157750350689973599999998998989999906999987733432879999565998 No 166 >COG0587 DnaE DNA polymerase III, alpha subunit [DNA replication, recombination, and repair] Probab=83.51 E-value=1.4 Score=25.45 Aligned_cols=62 Identities=18% Similarity=0.181 Sum_probs=28.9 Q ss_pred EECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHH Q ss_conf 874894799999735210586681459889999996675288437999997101680079999 Q gi|254780791|r 57 LKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTA 119 (529) Q Consensus 57 lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~ 119 (529) |.|.+.--.||=|-..+.+-.++|=-|+++.+...... ..+-.+.|++-..+..|-+.|..- T Consensus 39 iTD~~nl~Gav~Fy~~ak~~gikpIiG~e~~v~~~~~~-~~~~~~~l~llAkn~~GY~nL~~L 100 (1139) T COG0587 39 LTDHNNLYGAVEFYKAAKKAGIKPIIGCEAYVANGDGF-RGRERPHLLLLAKNNEGYKNLVKL 100 (1139) T ss_pred EECCCCCEEHHHHHHHHHHCCCEEEEEEEEEEECCCCC-CCCCCCCEEEEECCHHHHHHHHHH T ss_conf 95277632389999999986981775207997246544-556775279996788999999999 No 167 >PRK09674 enoyl-CoA hydratase-isomerase; Provisional Probab=83.47 E-value=3.4 Score=21.93 Aligned_cols=76 Identities=14% Similarity=0.193 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHCCH-----------HHHHHHHHHCCCEEEEEE-CCC Q ss_conf 3679999999974100357677758999516888-4---44220076-----------999999974890488520-577 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHFND-----------EMIVRAIANSSIPIISAI-GHE 250 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~FN~-----------e~laraI~~~~iPVisgI-GHE 250 (529) ..+|..++..++... .+-+||| +|.|- + -||-.|.+ ..+...|..|+.|||.+| ||- T Consensus 33 ~~el~~a~~~~~~d~------~vr~vvl-tg~g~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIaav~G~a 105 (257) T PRK09674 33 LAQLVNELEAAATDT------SISVCVI-TGNARFFAAGADLNEMAEKDLVATLNDPRPQLWQRIQAFNKPLIAAVNGYA 105 (257) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCEE T ss_conf 999999999986389------9069999-789987707765276502451445541899999999838998999988931 Q ss_pred C-CCHHHHHHHCCCCCCCHH Q ss_conf 7-525898864123777214 Q gi|254780791|r 251 T-DWTLADYAADLRAPTPTG 269 (529) Q Consensus 251 ~-D~Tl~D~VAD~Ra~TPTa 269 (529) . -=...=+.+|+|..++++ T Consensus 106 ~GgG~~la~~cD~~ias~~a 125 (257) T PRK09674 106 LGAGCELALLCDIVIAGENA 125 (257) T ss_pred EEEEHHHHHHCCCCHHHHHH T ss_conf 23145765530210233321 No 168 >PRK05853 hypothetical protein; Validated Probab=83.27 E-value=1.7 Score=24.48 Aligned_cols=50 Identities=16% Similarity=0.280 Sum_probs=38.5 Q ss_pred CEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE--EEC-CC----CEEEEEEEEEEEC Q ss_conf 47999997352105866814598899999966--752-88----4379999971016 Q gi|254780791|r 62 SRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT--TFP-GS----SKYQIIIESLIPS 111 (529) Q Consensus 62 a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~--~y~-~~----g~~ql~v~~i~~~ 111 (529) --++|.+|+.-+.++.-.++.|+.|+|.|++. -|+ +. ..+.+.++.|-|. T Consensus 50 ~f~~V~~Wg~LAenv~~sL~KG~pViV~GRL~~r~we~kdG~kRs~~eI~A~~VGpd 106 (168) T PRK05853 50 LFITVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGNRRSSLEMRATSVGPD 106 (168) T ss_pred EEEEEEEEHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCEEEEEEEEEEEECCC T ss_conf 699999828999999864059998999999876512868998888999999887545 No 169 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=83.20 E-value=3.9 Score=21.31 Aligned_cols=123 Identities=20% Similarity=0.336 Sum_probs=73.2 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHH----HHH Q ss_conf 9999966752884379999971016800799999999997654012261001631026528999847842589----999 Q gi|254780791|r 87 LVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVI----RDI 162 (529) Q Consensus 87 ~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~----~D~ 162 (529) ..+-++++|..+|+|.+...-|-.. .=. ||+| |+ +.--+.+-..|+-.-+||.|||.|- .-+ T Consensus 80 ~~RfRvN~f~qr~~~a~vlR~Ip~~-i~~----~e~L-------gl--P~i~~~~~~~~~GLILVTGpTGSGKSTTlAam 145 (353) T COG2805 80 VARFRVNAFKQRGGYALVLRLIPSK-IPT----LEEL-------GL--PPIVRELAESPRGLILVTGPTGSGKSTTLAAM 145 (353) T ss_pred CCEEEEEHHHHCCCCEEEEECCCCC-CCC----HHHC-------CC--CHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH T ss_conf 5168864555468837997616766-899----8781-------99--77999998287966998679999678799999 Q ss_pred HHHHHHCCCEEEE--------EEE---CCCC----CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHC Q ss_conf 9986305975899--------972---1001----111036799999999741003576777589995168884442200 Q gi|254780791|r 163 LQRISCRFPLRVI--------IFP---VKVQ----GDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHF 227 (529) Q Consensus 163 ~~~~~~r~p~~~~--------~~p---~~vQ----G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~F 227 (529) +.-+++..+..|+ +|+ |.++ |.+. .+-.+||+.+-+- +||||+|+ -+-|+ T Consensus 146 Id~iN~~~~~HIlTIEDPIE~vh~skkslI~QREvG~dT-~sF~~aLraALRe-------DPDVIlvG----EmRD~--- 210 (353) T COG2805 146 IDYINKHKAKHILTIEDPIEYVHESKKSLINQREVGRDT-LSFANALRAALRE-------DPDVILVG----EMRDL--- 210 (353) T ss_pred HHHHHCCCCCCEEEECCCHHHHHCCHHHHHHHHHHCCCH-HHHHHHHHHHHHC-------CCCEEEEE----CCCCH--- T ss_conf 999841477516872374686504327666687745427-8899999998602-------99979982----13469--- Q ss_pred CHHHHHHHHHHCC Q ss_conf 7699999997489 Q gi|254780791|r 228 NDEMIVRAIANSS 240 (529) Q Consensus 228 N~e~laraI~~~~ 240 (529) |.+.-|+-+.. T Consensus 211 --ETi~~ALtAAE 221 (353) T COG2805 211 --ETIRLALTAAE 221 (353) T ss_pred --HHHHHHHHHHH T ss_conf --99999998986 No 170 >pfam12128 DUF3584 Protein of unknown function (DUF3584). This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. There are two conserved sequence motifs: GKT and YLP. Probab=83.10 E-value=4 Score=21.28 Aligned_cols=59 Identities=12% Similarity=0.162 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHHHHH--HHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEEEEECE Q ss_conf 999999999999999988--7169677731461999848988957778929998699999110 Q gi|254780791|r 432 SHKIKSCHTSVSITTRIL--QSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILINFFDG 492 (529) Q Consensus 432 ~~~L~~~~~rL~~l~~~L--~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG 492 (529) ...+...+.++..+..+| ..-|-...|...--==.+.=|++|... +--...|.=.|.+| T Consensus 515 ~~~~~~l~~~~~~l~~~L~p~~gSL~~fL~~~~p~We~tIGKVidee--lL~rtdL~P~~~~~ 575 (1192) T pfam12128 515 ERRLLQLRQALQELELQLSPQAGSLLEFLRNEAPDWEESIGKVISPE--LLERTDLDPQLVEG 575 (1192) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHCCHH--HHHCCCCCCEECCC T ss_conf 99999999999999986278999489999758997677674018999--97068999720069 No 171 >PRK06731 flhF flagellar biosynthesis regulator FlhF; Validated Probab=83.08 E-value=4 Score=21.27 Aligned_cols=16 Identities=6% Similarity=0.212 Sum_probs=7.0 Q ss_pred HHHHHCCHHHHHHHHH Q ss_conf 4422007699999997 Q gi|254780791|r 222 EDLWHFNDEMIVRAIA 237 (529) Q Consensus 222 eDL~~FN~e~laraI~ 237 (529) |||-.||-+.+|+.|- T Consensus 252 eDi~~~~~~~la~riL 267 (270) T PRK06731 252 KNIHIATAEHLAKQML 267 (270) T ss_pred HHHHCCCHHHHHHHHH T ss_conf 2141379999999983 No 172 >PRK05870 enoyl-CoA hydratase; Provisional Probab=82.89 E-value=4 Score=21.21 Aligned_cols=39 Identities=33% Similarity=0.460 Sum_probs=26.9 Q ss_pred HHHHHHHHHCCCEEEEEE-CCC--CCCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-577--7525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHE--TDWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE--~D~Tl~D~VAD~Ra~TPTa 269 (529) .....++.+||.|||++| ||= -=..|+ +.+|+|..+++| T Consensus 85 ~~~~~~i~~~~kPvIAav~G~a~GgG~~la-lacD~ria~~~a 126 (254) T PRK05870 85 YDGFMAVASCPLPTIAAVNGAAVGAGLNLA-LAADVRIAGPAA 126 (254) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCHHH-HCCCEEEECCCC T ss_conf 999999996899999998771465671666-417689985997 No 173 >PRK10826 2-deoxyglucose-6-phosphatase; Provisional Probab=82.55 E-value=4.1 Score=21.11 Aligned_cols=94 Identities=15% Similarity=0.150 Sum_probs=60.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEC-CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 2899984784258999999863059758999721-001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFPLRVIIFPV-KVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~-~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) -++||+||.....++-++..+.=.--|..++-.- ...|+-+|.-...|+++++-.. =++|+| || T Consensus 109 ~~l~i~Ts~~~~~~~~~L~~~gl~~~Fd~iv~~ddv~~~KP~Pe~yl~A~~~lg~~p-------~e~lvv-------eD- 173 (222) T PRK10826 109 LKIGLASASPLHMLEAVLTMLDLRDYFDALASAEKLPYSKPHPEVYLNCAAKLGVDP-------LTCVAL-------ED- 173 (222) T ss_pred CCEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCH-------HHEEEE-------CC- T ss_conf 975886288489999999874996411057635325677998499999999959898-------896878-------38- Q ss_pred HHCCHHHHHHHHHHCCCEEEEEE--CCCCC---CHHHHHH Q ss_conf 20076999999974890488520--57775---2589886 Q gi|254780791|r 225 WHFNDEMIVRAIANSSIPIISAI--GHETD---WTLADYA 259 (529) Q Consensus 225 ~~FN~e~laraI~~~~iPVisgI--GHE~D---~Tl~D~V 259 (529) ...=++|--+.-+++| || ||..+ ...+|+| T Consensus 174 ----S~~gi~aA~~AGm~~v-~v~~~~~~~~~~~~~Ad~v 208 (222) T PRK10826 174 ----SFNGLIAAKAARMRSI-VVPAPEQRNDPRWALAHVQ 208 (222) T ss_pred ----CHHHHHHHHHCCCEEE-EECCCCCCCCCCCCCCCEE T ss_conf ----9999999998799799-9899888664120468989 No 174 >cd00578 L-fuc_L-ara-isomerases L-fucose isomerase (FucIase) and L-arabinose isomerase (AI) family; composed of FucIase, AI and similar proteins. FucIase converts L-fucose, an aldohexose, to its ketose form, which prepares it for aldol cleavage (similar to the isomerization of glucose in glycolysis). L-fucose (or 6-deoxy-L-galactose) is found in various oligo- and polysaccharides in mammals, bacteria and plants. AI catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion to D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute. Probab=82.40 E-value=4.2 Score=21.07 Aligned_cols=25 Identities=24% Similarity=0.249 Sum_probs=15.4 Q ss_pred HHHHHHHHCCCEEEEEECCCCCCHH Q ss_conf 9999997489048852057775258 Q gi|254780791|r 231 MIVRAIANSSIPIISAIGHETDWTL 255 (529) Q Consensus 231 ~laraI~~~~iPVisgIGHE~D~Tl 255 (529) +.+-+...+.+|--.=.||-.|.-+ T Consensus 117 ~~~~~l~r~gi~~~~i~g~~~d~~v 141 (452) T cd00578 117 EFGNILARLGIPFKVVYGHWKDEDV 141 (452) T ss_pred HHHHHHHHCCCCEEEEECCCCCHHH T ss_conf 9999999879983898669999899 No 175 >cd04320 AspRS_cyto_N AspRS_cyto_N: N-terminal, anticodon recognition domain of the type found in Saccharomyces cerevisiae and human cytoplasmic aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Probab=82.40 E-value=4.2 Score=21.07 Aligned_cols=74 Identities=16% Similarity=0.272 Sum_probs=51.1 Q ss_pred CEEEEEEECCCCCCCCCCE-EEEEEECCCCEEEEEEECCCCC--C--CCC--CCCCCCEEEEEEEEEEECC-------CC Q ss_conf 1899999705435688862-7999874894799999735210--5--866--8145988999999667528-------84 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGH-AYFSLKDNHSRIDAIIWKGTLN--K--IEF--LPEEGIEFLVIGKITTFPG-------SS 99 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH-~Yf~lkd~~a~i~~~~~~~~~~--~--~~~--~~~~G~~v~~~g~~~~y~~-------~g 99 (529) .||++|=|.+.|. .+|+ +++.|.|....+.||++...-. . +++ .+....=|.+.|.|. =++ .+ T Consensus 1 ~V~~rGwv~~~R~--~g~kl~Fl~LRd~~g~iQ~V~~~~~~~~s~~m~~~~~~l~~ESvV~V~G~V~-~~~~~i~~~~~~ 77 (102) T cd04320 1 EVLIRARVHTSRA--QGAKLAFLVLRQQGYTIQGVLAASAEGVSKQMVKWAGSLSKESIVDVEGTVK-KPEEPIKSCTQQ 77 (102) T ss_pred CEEEEEEEEEECC--CCCEEEEEEEECCCEEEEEEEEECCCCCCHHHHHHHHCCCCCEEEEEEEEEE-CCCCCCCCCCCC T ss_conf 9899999972021--7971799999158718999998078877999999985599700999999998-899755678868 Q ss_pred EEEEEEEEEEE Q ss_conf 37999997101 Q gi|254780791|r 100 KYQIIIESLIP 110 (529) Q Consensus 100 ~~ql~v~~i~~ 110 (529) +|-+.|+.+.. T Consensus 78 ~iEi~i~~i~v 88 (102) T cd04320 78 DVELHIEKIYV 88 (102) T ss_pred CEEEEEEEEEE T ss_conf 78999999999 No 176 >PRK02910 light-independent protochlorophyllide reductase subunit B; Provisional Probab=82.35 E-value=4.2 Score=21.06 Aligned_cols=100 Identities=21% Similarity=0.314 Sum_probs=60.6 Q ss_pred EEEEECCCHHHHHHHHHHH-HHC--CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC--- Q ss_conf 9998478425899999986-305--97-5899972100111103679999999974100357677758999516888--- Q gi|254780791|r 148 IAVITSPTGAVIRDILQRI-SCR--FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS--- 220 (529) Q Consensus 148 i~vits~~~a~~~D~~~~~-~~r--~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS--- 220 (529) =+|+-.|.|.-|-|.+.+. .++ +| +.+..+...+=|.+...-...++..+.... +||+||++ +| T Consensus 27 H~VlHAPqGD~Y~~~~~tMlER~~~~pPvT~s~~~~rdLgr~t~el~~~~~~~a~~~~------~P~~ivv~---~SCta 97 (524) T PRK02910 27 HYVLHAPQGDDYADLLFTMLERRGDRPPVTYSTVQARDLGRGTAELVKDTIRRADERF------QPDLIVVG---PSCTA 97 (524) T ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEEECHHHHCCCHHHHHHHHHHHHHHCC------CCCEEEEC---CHHHH T ss_conf 5997269887424578998863489998478753144405650788899999985442------98579985---40069 Q ss_pred ---HHHHHHCCHHHHHHHHHHCCCEEEE-EECC-------CCCCHHHHHHHCC Q ss_conf ---4442200769999999748904885-2057-------7752589886412 Q gi|254780791|r 221 ---IEDLWHFNDEMIVRAIANSSIPIIS-AIGH-------ETDWTLADYAADL 262 (529) Q Consensus 221 ---~eDL~~FN~e~laraI~~~~iPVis-gIGH-------E~D~Tl~D~VAD~ 262 (529) .||+-. +|+. +..++|||. .+-| --|.|+-.+|-.+ T Consensus 98 ~liQedp~~-----la~~-~~l~~~vi~~e~~~y~~~E~~aAd~t~~qlVr~~ 144 (524) T PRK02910 98 ELLQEDPGG-----LAKG-AGLPIPVLVLELNAYRVKENWAADETFYQLVRAL 144 (524) T ss_pred HHHHHCCCC-----CCCC-CCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 887546112-----2334-6999864455577210567888899999999998 No 177 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=82.30 E-value=1.7 Score=24.66 Aligned_cols=125 Identities=17% Similarity=0.243 Sum_probs=77.7 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCEEEEEECCCHHH----- Q ss_conf 8999999667528843799999710168007999999999976540122-6100163102652899984784258----- Q gi|254780791|r 85 EFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLF-SDQHKNPIPFIPKIIAVITSPTGAV----- 158 (529) Q Consensus 85 ~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlf-d~~~k~~lP~~p~~i~vits~~~a~----- 158 (529) .=..+=+|++|.-||+|.++..-|-.. .-.|++-||= +- =+.+-.-|+=.-+||+|||.| T Consensus 79 ~~~~RfRvN~f~QRg~~a~vlR~ip~~------------Ip~fe~LGLP~~v--~~~~a~~~~GLiLVTGPTGSGKSTTl 144 (350) T TIGR01420 79 PGLARFRVNAFKQRGGVALVLRLIPSK------------IPTFEELGLPRPV--LRELAERPRGLILVTGPTGSGKSTTL 144 (350) T ss_pred CCCCEEEEHHHHHCCHHHHHHHHCCCC------------CCCHHHCCCCHHH--HHHHHHCCCCCEEEECCCCCCHHHHH T ss_conf 673221220323500064232311534------------6216663798789--99998366993898768898678999 Q ss_pred --HHHHHHHHHHC------CCEEEEEEE---CCCC----CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf --99999986305------975899972---1001----11103679999999974100357677758999516888444 Q gi|254780791|r 159 --IRDILQRISCR------FPLRVIIFP---VKVQ----GDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 159 --~~D~~~~~~~r------~p~~~~~~p---~~vQ----G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) +=|-+|.-+.. .|+++ ||+ |.|+ |.++. |=.+||+.+-+- +||||+|+ =+= T Consensus 145 AsmIDyIN~~~~~HIiTIEDPIEy-vh~~~~sli~QREvG~DT~-sF~~ALraALRe-------DPDvILiG----E~R- 210 (350) T TIGR01420 145 ASMIDYINKNKAGHIITIEDPIEY-VHKNKRSLINQREVGLDTL-SFANALRAALRE-------DPDVILIG----EMR- 210 (350) T ss_pred HHHHHHHHCCCCCCCEEEECCEEE-EECCCEEEEECCCCCCCHH-HHHHHHHHHHCC-------CCCEEEEE----CCC- T ss_conf 999978740388882563177314-1047702454362467545-799997684102-------89889982----556- Q ss_pred HHHCCHHHHHHHHHHCCC Q ss_conf 220076999999974890 Q gi|254780791|r 224 LWHFNDEMIVRAIANSSI 241 (529) Q Consensus 224 L~~FN~e~laraI~~~~i 241 (529) |-|.+.-|+.+..+ T Consensus 211 ----D~ET~~~AL~AAET 224 (350) T TIGR01420 211 ----DLETVELALTAAET 224 (350) T ss_pred ----CHHHHHHHHHHHHH T ss_conf ----27899999987421 No 178 >TIGR00617 rpa1 replication factor-a protein 1 (rpa1); InterPro: IPR004591 All proteins in this family for which functions are known are part of a multiprotein complex made up of homologs of RPA1, RPA2 and RPA3 that bind single-stranded DNA and function in the recognition of DNA damage for nucleotide excision repair. The complex binds to single-stranded DNA sequences participating in DNA replication in addition to those mediating transcriptional repression and activation, and stimulates the activity of cognate strand exchange protein Sep1. It cooperates with T-AG and DNA topoisomerase I to unwind template DNA containing the Simian virus 40 origin of replication. This family is Replication factor-a protein 1 (RPA1); GO: 0003677 DNA binding, 0006260 DNA replication, 0005634 nucleus. Probab=82.21 E-value=0.76 Score=27.71 Aligned_cols=67 Identities=16% Similarity=0.229 Sum_probs=40.2 Q ss_pred CCCCC-CCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEE-EEEEC-CCCEEEEEEECCCCCCC Q ss_conf 88889-862299999999999740017189999970543568886279-99874-89479999973521058 Q gi|254780791|r 8 NSLDH-PEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAY-FSLKD-NHSRIDAIIWKGTLNKI 76 (529) Q Consensus 8 ~~~~~-~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Y-f~lkd-~~a~i~~~~~~~~~~~~ 76 (529) ..++. +.+.+..|+=|-....=. ..|-=+|||--|.+...-|-++ |+|-| +...|+|++|.-.+.+. T Consensus 207 ~~~t~~~v~Pi~~LsPYq~~W~Ik--ARVT~K~~irtw~N~RGEGKlFn~~L~D~~sgeIRaTaFNe~ad~F 276 (671) T TIGR00617 207 SGSTRERVVPIESLSPYQNKWTIK--ARVTNKSEIRTWSNARGEGKLFNVELLDSESGEIRATAFNEQADKF 276 (671) T ss_pred HCCCCCCEEEECCCCCCCCCEEEE--EEEEEHHCCCCCCCCCCCCEEEEEEEECCCCCEEEEECHHHHHHHH T ss_conf 168977547712258877732898--9872000133023788881065557471797846774247887532 No 179 >PRK05657 RNA polymerase sigma factor RpoS; Validated Probab=81.79 E-value=4 Score=21.21 Aligned_cols=132 Identities=20% Similarity=0.195 Sum_probs=62.7 Q ss_pred ECCCCEEEEEEEEEEECC-CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHH----HHHC Q ss_conf 528843799999710168-00799999999997654012261001631026528999847842589999998----6305 Q gi|254780791|r 95 FPGSSKYQIIIESLIPSG-SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQR----ISCR 169 (529) Q Consensus 95 y~~~g~~ql~v~~i~~~g-~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~----~~~r 169 (529) |..+| +-..++--.| +|-+.+. + -||+.+--.+..| | ++=|... +.+. T Consensus 109 Y~~~G---l~l~DLIQEGniGLikAv-e----------KFDp~rG~RFSTY------------A-~WWIRq~I~raI~~~ 161 (328) T PRK05657 109 YLNRG---LALLDLIEEGNLGLIRAV-E----------KFDPERGFRFSTY------------A-TWWIRQTIERAIMNQ 161 (328) T ss_pred CCCCC---CCHHHHHHHHHHHHHHHH-H----------HCCCCCCCCEEEE------------E-HHHHHHHHHHHHHHC T ss_conf 56899---898999988689999999-9----------7197669984230------------0-999999999999965 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHH---CCHHHHHHHHHHCC--CEEE Q ss_conf 975899972100111103679999999974100357677758999516888444220---07699999997489--0488 Q gi|254780791|r 170 FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWH---FNDEMIVRAIANSS--IPII 244 (529) Q Consensus 170 ~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~---FN~e~laraI~~~~--iPVi 244 (529) +. ++=.|+.|+.. ...+.++...+.... + ..-+.+++.. ...+.|......+. +-.= T Consensus 162 ~r--~IRlP~~~~~~--l~~i~ra~~~l~~~~--g------------r~pt~eEIA~~lg~~~~~v~~~l~~~~~~~SLD 223 (328) T PRK05657 162 TR--TIRLPVHVVKE--LNVYLRAARELEHKL--D------------HEPSAEEIAELLDKPVDDVSRMLALNERITSLD 223 (328) T ss_pred CC--CCCCCCHHHHH--HHHHHHHHHHHHHHH--C------------CCCCHHHHHHHHCCCHHHHHHHHHHCCCCCCCC T ss_conf 77--77756119999--999999999999984--8------------999899999996959999999997458763444 Q ss_pred EEECCCCCCHHHHHHHCCCCCCCHHHH Q ss_conf 520577752589886412377721456 Q gi|254780791|r 245 SAIGHETDWTLADYAADLRAPTPTGAA 271 (529) Q Consensus 245 sgIGHE~D~Tl~D~VAD~Ra~TPTaAA 271 (529) +-+|.+-|.|+.|+++|-..++|...+ T Consensus 224 ~pv~~d~~~tl~d~l~D~~~~~pd~~~ 250 (328) T PRK05657 224 APLGGDPEKSLLDILADDQENGPEDTT 250 (328) T ss_pred CCCCCCCCCHHHHHHCCCCCCCHHHHH T ss_conf 535899851178886277899999999 No 180 >cd06275 PBP1_PurR Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. This dimeric PurR belongs to the LacI-GalR family of transcription regulators and is activated to bind to DNA operator sites by initially binding either of high affinity corepressors, hypoxanthine or guanine. PurR is composed of two functional domains: aan N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the purine transcription repressor undergoes a Probab=81.57 E-value=4.5 Score=20.84 Aligned_cols=19 Identities=11% Similarity=0.447 Sum_probs=7.5 Q ss_pred EEEEECC-CHHHHHHHHHHH Q ss_conf 9998478-425899999986 Q gi|254780791|r 148 IAVITSP-TGAVIRDILQRI 166 (529) Q Consensus 148 i~vits~-~~a~~~D~~~~~ 166 (529) ||||.+. +-.-+.++++.+ T Consensus 2 IGvivp~~~npf~~~l~~gi 21 (269) T cd06275 2 IGMLVTTSTNPFFAEVVRGV 21 (269) T ss_pred EEEEECCCCCHHHHHHHHHH T ss_conf 89997898759999999999 No 181 >PRK04531 acetylglutamate kinase; Provisional Probab=81.22 E-value=1.9 Score=24.06 Aligned_cols=107 Identities=24% Similarity=0.394 Sum_probs=61.2 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-E-EEEEEECCCCCCC------------------------------------ Q ss_conf 6528999847842589999998630597-5-8999721001111------------------------------------ Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRFP-L-RVIIFPVKVQGDE------------------------------------ 185 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~p-~-~~~~~p~~vQG~~------------------------------------ 185 (529) -|+++|||-= +||++.|-+..+..--. + .+=++|+.|-|-+ T Consensus 18 ~~~~fAVIKv-GGavi~e~l~~la~dLafL~~vGl~PVVVHGgGpqI~~~L~~~gIe~~fi~G~RVTd~etl~vv~~vl~ 96 (421) T PRK04531 18 DPERFAVIKV-GGAVLREDLEALTSSLKFLQEVGLTPVVVHGAGPQLDAELSAAGIEKDTVNGLRVTSPEALAIVRKVFQ 96 (421) T ss_pred CCCEEEEEEE-CHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 8761899998-879999799999999999998799799993898778799997599976779630588889999999997 Q ss_pred -HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC------------CCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf -036799999999741003576777589995168------------8844422007699999997489048852057775 Q gi|254780791|r 186 -CPKEIANAILQLNTLKEGRTCPRPDIIILARGG------------GSIEDLWHFNDEMIVRAIANSSIPIISAIGHETD 252 (529) Q Consensus 186 -a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG------------GS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D 252 (529) .-.+|+++|.... ..++.+. || |-..+--..|.+.|-.++-+-.||||+-||+..| T Consensus 97 ~~n~~lV~~L~~~G----------~~A~~i~-g~~~~a~~~d~~~~G~VG~V~~Vd~~~I~~~L~~g~IPVIapig~~~~ 165 (421) T PRK04531 97 ASNLKLVEALQQNG----------ARATSIT-GGVFEAEYLDRDTYGLVGEVKAVLLAAVRSSLRAGKIPVIASLGETPS 165 (421) T ss_pred HHHHHHHHHHHHCC----------CCEEECC-CCCEECCCCCCCCCCCCCCEEEECHHHHHHHHHCCCEEEECCCEECCC T ss_conf 65299999999679----------9757346-654350336666676433347786899999997898269815315899 Q ss_pred CHHHHHHHCC Q ss_conf 2589886412 Q gi|254780791|r 253 WTLADYAADL 262 (529) Q Consensus 253 ~Tl~D~VAD~ 262 (529) --+.--=||. T Consensus 166 Gq~lNINAD~ 175 (421) T PRK04531 166 GQILNINADV 175 (421) T ss_pred CCEEEECHHH T ss_conf 9788405999 No 182 >PRK07938 enoyl-CoA hydratase; Provisional Probab=81.16 E-value=4.6 Score=20.73 Aligned_cols=35 Identities=23% Similarity=0.304 Sum_probs=25.1 Q ss_pred HHHHHHHHHCCCEEEEEE-------CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-------5777525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-------GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-------GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) ..+.++|..||.|||.+| |.|. =+.+|+|..+|+| T Consensus 84 ~~~~~~i~~~~kPvIaaV~G~a~GgG~~l-----al~cD~~ias~~a 125 (249) T PRK07938 84 FAAFRAVYECPVPVIAAVHGFCLGGGIGL-----VGNADVIVASDDA 125 (249) T ss_pred HHHHHHHHHCCCCEEEEECCEEEEEEEEE-----EECCCHHHHCCCC T ss_conf 99999998299989999757576612131-----2043344442687 No 183 >PRK07226 fructose-bisphosphate aldolase; Provisional Probab=81.07 E-value=4.6 Score=20.71 Aligned_cols=91 Identities=19% Similarity=0.323 Sum_probs=61.8 Q ss_pred CHHHHHHHHHHHH--HCCCEEEE--EEE--CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCC Q ss_conf 4258999999863--05975899--972--10011110367999999997410035767775899951688844422007 Q gi|254780791|r 155 TGAVIRDILQRIS--CRFPLRVI--IFP--VKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFN 228 (529) Q Consensus 155 ~~a~~~D~~~~~~--~r~p~~~~--~~p--~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN 228 (529) ...-++++-++.. ++|.+.++ .|| ..+..+..+.-|.-|-+.+-.+. .|+|=+- |+=| T Consensus 121 E~~~l~~l~~l~~ea~~~GlPll~~~yprG~~~~~~~d~~~ia~aaRia~ELG-------ADiVKv~---------y~gd 184 (266) T PRK07226 121 EAEMLEDLGRIARECEEWGMPLLAMMYPRGPKIKNEYDPEVVAHAARVGAELG-------ADIIKTN---------YTGD 184 (266) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC-------CCEEEEC---------CCCC T ss_conf 79999999999999998599569997346887777875999999999999967-------9989835---------9999 Q ss_pred HHHHHHHHHHCCCEEEEEECCCC--CCHHHHHHHC Q ss_conf 69999999748904885205777--5258988641 Q gi|254780791|r 229 DEMIVRAIANSSIPIISAIGHET--DWTLADYAAD 261 (529) Q Consensus 229 ~e~laraI~~~~iPVisgIGHE~--D~Tl~D~VAD 261 (529) .|...+.|..|++||+.+=|=.+ |..+..+|.+ T Consensus 185 ~e~f~~vv~~~~vPVliaGG~k~~~~~~~L~~v~~ 219 (266) T PRK07226 185 PESFREVVEGCPVPVVIAGGPKTGSDLEFLQMVED 219 (266) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHH T ss_conf 89999999847987899578888988999999999 No 184 >COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis] Probab=80.95 E-value=2.2 Score=23.60 Aligned_cols=44 Identities=11% Similarity=0.188 Sum_probs=35.8 Q ss_pred HHCCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEEEECCC Q ss_conf 73146199984898895777892999869999911099999950578 Q gi|254780791|r 457 TLKRGYTSIQDTNNNFITQKRNLATKTRILINFFDGQANAIVINKAP 503 (529) Q Consensus 457 ~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v~a~V~~k~~ 503 (529) ..+.|= |. -||+.++.+++++.||.|+|+|.-....++|..... T Consensus 30 ~~~~Gr--V~-vNG~~aKpS~~VK~GD~l~i~~~~~~~~v~Vl~~~~ 73 (100) T COG1188 30 MIEGGR--VK-VNGQRAKPSKEVKVGDILTIRFGNKEFTVKVLALGE 73 (100) T ss_pred HHHCCE--EE-ECCEECCCCCCCCCCCEEEEEECCCEEEEEEEECCC T ss_conf 987796--98-999675655614779899999578189999973543 No 185 >PRK03580 carnitinyl-CoA dehydratase; Provisional Probab=80.94 E-value=4.7 Score=20.67 Aligned_cols=75 Identities=19% Similarity=0.174 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC--H---HHHHHCCH------------HHHHHHHHHCCCEEEEEE-CC Q ss_conf 679999999974100357677758999516888--4---44220076------------999999974890488520-57 Q gi|254780791|r 188 KEIANAILQLNTLKEGRTCPRPDIIILARGGGS--I---EDLWHFND------------EMIVRAIANSSIPIISAI-GH 249 (529) Q Consensus 188 ~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS--~---eDL~~FN~------------e~laraI~~~~iPVisgI-GH 249 (529) .++..++..++... .+.|||| +|.|. + -||-.|.+ ....+.+.+||.|||++| || T Consensus 33 ~~l~~al~~~~~d~------~vrvvvl-tg~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvIaaV~G~ 105 (262) T PRK03580 33 FAMGEVFLNFRDDP------ELRVAII-TGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFDLDKPVIAAVNGY 105 (262) T ss_pred HHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCE T ss_conf 99999999986199------9659999-6699960426710766534786431342667899999981999999998686 Q ss_pred CCCC-HHHHHHHCCCCCCCHH Q ss_conf 7752-5898864123777214 Q gi|254780791|r 250 ETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 250 E~D~-Tl~D~VAD~Ra~TPTa 269 (529) =.=- .-+=+.+|+|..+++| T Consensus 106 a~GgG~~LalacD~~ia~~~a 126 (262) T PRK03580 106 AFGGGFELALAADFIVCADNA 126 (262) T ss_pred EEEHHHHHHHHCCEEEECCCC T ss_conf 661869998852702414787 No 186 >cd04318 EcAsnRS_like_N EcAsnRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli asparaginyl-tRNA synthetase (AsnRS) and, in Arabidopsis thaliana and Saccharomyces cerevisiae mitochondrial (mt) AsnRS. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial Probab=80.40 E-value=4.8 Score=20.53 Aligned_cols=75 Identities=19% Similarity=0.228 Sum_probs=53.6 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCC--EEEEEEECCCCCC-CCCCCCCCCEEEEEEEEEEECCC--CEEEEEEEEEE Q ss_conf 8999997054356888627999874894--7999997352105-86681459889999996675288--43799999710 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHS--RIDAIIWKGTLNK-IEFLPEEGIEFLVIGKITTFPGS--SKYQIIIESLI 109 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a--~i~~~~~~~~~~~-~~~~~~~G~~v~~~g~~~~y~~~--g~~ql~v~~i~ 109 (529) |-|.|=|-+.|. .++.+|.+|.|... .+.||+=...... .--.+..|.-|.+.|.+- -.|. ++|-|.++.++ T Consensus 2 V~i~GWV~~~R~--~~~i~Fi~L~DGS~~~~lQvV~~~~~~~~~~~~~l~~gssv~v~G~l~-~s~~~~q~~El~~~~i~ 78 (82) T cd04318 2 VTVNGWVRSVRD--SKKISFIELNDGSCLKNLQVVVDKELTNFKEILKLSTGSSIRVEGVLV-KSPGAKQPFELQAEKIE 78 (82) T ss_pred EEEEEEEEEEEC--CCCEEEEEEECCCCCCCEEEEECCCCCCHHHHHCCCCCEEEEEEEEEE-CCCCCCCCEEEEEEEEE T ss_conf 999986888864--897499999789864438999837767977984467853999999999-08999999999944999 Q ss_pred ECC Q ss_conf 168 Q gi|254780791|r 110 PSG 112 (529) Q Consensus 110 ~~g 112 (529) .-| T Consensus 79 i~g 81 (82) T cd04318 79 VLG 81 (82) T ss_pred EEE T ss_conf 995 No 187 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=80.33 E-value=4.8 Score=20.51 Aligned_cols=83 Identities=14% Similarity=0.103 Sum_probs=38.6 Q ss_pred EEEEEECCCHHHHH-HHHHHH---HHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 89998478425899-999986---30597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 147 IIAVITSPTGAVIR-DILQRI---SCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 147 ~i~vits~~~a~~~-D~~~~~---~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) |||||+...+--|. .+.+-+ .+.+++++.++.+.=++ -+..-++.|+.+-.. ++|.|||.=.. T Consensus 1 kIg~i~~~~~npf~~~v~~G~e~aA~~~G~~v~~~~~~~~~--d~~~q~~~i~~~i~~-------~vDgIii~p~~---- 67 (273) T cd06310 1 KIALVPKGTTSDFWQAVKAGAEAAAKELGVKVTFQGPASET--DVAGQVNLLENAIAR-------GPDAILLAPTD---- 67 (273) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCC--CHHHHHHHHHHHHHC-------CCCEEEEECCC---- T ss_conf 98999599998599999999999999809989997289978--999999999999974-------99999991687---- Q ss_pred HHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 42200769999999748904885 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVis 245 (529) .-.....++...+..||||+ T Consensus 68 ---~~~~~~~l~~a~~~gIPvV~ 87 (273) T cd06310 68 ---AKALVPPLKEAKDAGIPVVL 87 (273) T ss_pred ---CHHHHHHHHHHHHCCCCEEE T ss_conf ---14479999999984998589 No 188 >pfam10234 Cluap1 Clusterin-associated protein-1. This protein is conserved from worms to humans. The protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell. Probab=80.21 E-value=4.9 Score=20.49 Aligned_cols=188 Identities=15% Similarity=0.246 Sum_probs=83.4 Q ss_pred CCCHHHHHHHHHHHHHCC-CE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHH Q ss_conf 784258999999863059-75-8999721001111036799999999741003576777589995168884442200769 Q gi|254780791|r 153 SPTGAVIRDILQRISCRF-PL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDE 230 (529) Q Consensus 153 s~~~a~~~D~~~~~~~r~-p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e 230 (529) .|+=.-+.|++.=|..|| |- .| |.-+.-+.---..++++..+-..+ .+.+.+ .--|+.=+-+ T Consensus 16 ~PNF~LVAeiL~WLv~ryeP~~~i---p~~i~tE~dRV~FIk~v~e~~~tK---a~IKLN----------~kKLY~Adg~ 79 (269) T pfam10234 16 TPNFELVAEILHWLAKRYEPSADI---PGDIETEEDRVLFIKAVTEFMVTK---AHIKLN----------PKKLYAADGY 79 (269) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCC---CCCCCCHHHHHHHHHHHHHHHHHH---HHHCCC----------HHHHHHCCHH T ss_conf 998099999999999980888767---776775367999999999999987---241257----------9999870189 Q ss_pred ------HHHHHHHHCCCEEEEEECCCC------CCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf ------999999748904885205777------52589886412377721456763323467776699999888778998 Q gi|254780791|r 231 ------MIVRAIANSSIPIISAIGHET------DWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIR 298 (529) Q Consensus 231 ------~laraI~~~~iPVisgIGHE~------D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~ 298 (529) .|+.-+|... -.++-+.+. ++.|.|-+.|+++. .+|...|.+..+.|...+.. T Consensus 80 AVkELlKit~lLy~a~--~~~~~~~~~~~~s~~~~~l~~K~~dlk~~-------------r~LaSeit~~GA~L~dlL~k 144 (269) T pfam10234 80 AVKELLKITSVLYKAM--KSTDGDEEENTRSLAKFDLGSKISDLKAA-------------RQLASEITERGAALYDLLGK 144 (269) T ss_pred HHHHHHHHHHHHHHHH--CCCCCCCCCCCHHHHHCCCHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999986--18866555320036660301468879999-------------99999999978999999975 Q ss_pred HHHHHHHHHHHHHHHHC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 88999985567765304--9788986788878886545689999998756889999984101134688999999999999 Q gi|254780791|r 299 LIKYKINTLNSLLKALP--NSDQTLSCSRYRLDRLPRELEHNLEIIIFRKYRHFNHIINRIKSDFLIHNIQKNRQHILEK 376 (529) Q Consensus 299 ~l~~~~q~L~~l~r~L~--~~~~~L~~~~QrLd~~~~rL~~~l~~~l~~~~~~l~~~~~rL~~~~~~~~l~~~r~~L~~~ 376 (529) ..+.+..+...+.+.+. ..+..+....+.++.. +...+..++.+... ...+..++++.++.+... T Consensus 145 E~~lr~~R~~a~~r~~el~~iEk~l~~~i~~~~~~-----------~~~~~~~l~~~~~d--e~~Le~KIekkk~ELER~ 211 (269) T pfam10234 145 ELDLRESRQKALSRPLELAEVEKALKEAIKNLAAR-----------LQQLQAELDNLKSD--EANLEAKIERKKQELERT 211 (269) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH T ss_conf 89899999999817667899999999999999999-----------99999999987737--877998999988889999 Q ss_pred HHHHHHHH Q ss_conf 99999887 Q gi|254780791|r 377 QQHIEQIV 384 (529) Q Consensus 377 q~~L~q~~ 384 (529) ++||..+. T Consensus 212 qKRL~sLq 219 (269) T pfam10234 212 QKRLQALQ 219 (269) T ss_pred HHHHHHHH T ss_conf 99998753 No 189 >PRK07509 enoyl-CoA hydratase; Provisional Probab=80.20 E-value=4.9 Score=20.48 Aligned_cols=36 Identities=22% Similarity=0.404 Sum_probs=23.3 Q ss_pred HHHHHHHHCCCEEEEEE-----CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-----5777525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-----GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-----GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) .++.++..||.|||++| |==.... +.+|+|..++++ T Consensus 93 ~~~~~~~~~~kPvIaav~G~a~GgG~~La---lacD~ria~~~a 133 (262) T PRK07509 93 RVSLGWRRLPVPVIAALEGVCFGGGLQIA---LGADIRIAAPDT 133 (262) T ss_pred HHHHHHHHCCCCEEEEEECEEEECCCEEE---CCCCEEEECCCC T ss_conf 99999860999889998076888640211---146555636986 No 190 >PRK04539 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=80.02 E-value=4.9 Score=20.44 Aligned_cols=89 Identities=20% Similarity=0.238 Sum_probs=49.6 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHC---CCEEEEEEECCCCCCCHHHHHH---HHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 265289998478425899999986305---9758999721001111036799---9999997410035767775899951 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRISCR---FPLRVIIFPVKVQGDECPKEIA---NAILQLNTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~~~r---~p~~~~~~p~~vQG~~a~~~i~---~ai~~~~~~~~~~~~~~~D~iii~R 216 (529) ..+++||||+-++.....+..+.+.+. -++++++-+..+........-. ..+... .+ ...+|++|..= T Consensus 3 ~~fk~Igiv~k~~~~~~~~~~~~l~~~L~~~g~~v~ld~~~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~Dlii~lG 76 (296) T PRK04539 3 SPFHNIGIVTRPNTPDIQDTAHTLITFLKQHGFTVYLDEVGVKEGCIYTQDTVGCHIVNKT-EL-----GQYCDLVAVLG 76 (296) T ss_pred CCCCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEECCHHHHCCCCCCCCCCCCCCCHH-HH-----CCCCCEEEEEC T ss_conf 9997899997089879999999999999978799999654112035464334565546867-81-----64677999978 Q ss_pred CCCCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 68884442200769999999748904885 Q gi|254780791|r 217 GGGSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 217 GGGS~eDL~~FN~e~laraI~~~~iPVis 245 (529) |-|.+- ..+|..+...+||+. T Consensus 77 GDGTlL--------~a~r~~~~~~~PilG 97 (296) T PRK04539 77 GDGTFL--------SVAREIAPRAVPIIG 97 (296) T ss_pred CCHHHH--------HHHHHHHHCCCCEEE T ss_conf 708999--------999986005997899 No 191 >KOG0556 consensus Probab=79.88 E-value=5 Score=20.40 Aligned_cols=68 Identities=21% Similarity=0.245 Sum_probs=43.7 Q ss_pred HHHHHHHCC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC-----CC--CCCCCEEEEEEEEE Q ss_conf 999974001-718999997054356888627999874894799999735210586-----68--14598899999966 Q gi|254780791|r 24 LKHIVESNL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE-----FL--PEEGIEFLVIGKIT 93 (529) Q Consensus 24 i~~~l~~~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~-----~~--~~~G~~v~~~g~~~ 93 (529) |+++-+..- .-|||+|-|- +++..+-.|++.|.+....+.|++..+.-..+. |. +..-.-|.|.|.|. T Consensus 73 v~dl~~~~~~~~V~vRgrVh--tsr~~GK~~FlvLRq~~~tVQ~~~~~~~~~~isk~Mvkf~~~is~ESiV~v~g~v~ 148 (533) T KOG0556 73 VSDLDESNDGSEVLVRGRVH--TSRLKGKLCFLVLRQQGSTVQCLVAVNEDGTISKQMVKFAGSISKESIVDVRGVVV 148 (533) T ss_pred HHHHHHHCCCCEEEEEEEEE--ECCCCCEEEEEEEECCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 43423312783699998885--13445507999994067359999971787607778999986458120899998871 No 192 >PRK08272 enoyl-CoA hydratase; Provisional Probab=79.58 E-value=5.1 Score=20.33 Aligned_cols=36 Identities=14% Similarity=0.139 Sum_probs=25.0 Q ss_pred HHHHHHHHCCCEEEEEE-----CCCCCCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-----5777525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-----GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-----GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) .+..++..|+.|||.+| |--.. +=+.+|+|..+++| T Consensus 123 ~~~~~l~~~~kPvIAaV~G~a~GgG~~---Lal~cD~~iaa~~A 163 (308) T PRK08272 123 RGFMSLLHAHKPTVAKIHGYCVAGGTD---IALHCDQVIAADDA 163 (308) T ss_pred HHHHHHHCCCCCEEEEECCEEEEHHHH---HHHHCCCEEECCCC T ss_conf 999998639998899988865628799---99967800554888 No 193 >PRK10139 serine endoprotease; Provisional Probab=79.32 E-value=5.2 Score=20.27 Aligned_cols=23 Identities=13% Similarity=0.189 Sum_probs=13.7 Q ss_pred HHCCCEEEEECCCCCEECCHHHCC Q ss_conf 731461999848988957778929 Q gi|254780791|r 457 TLKRGYTSIQDTNNNFITQKRNLA 480 (529) Q Consensus 457 ~L~RGYaiv~~~~GkiI~s~~~l~ 480 (529) =|.+|-.|+. -+++.|+|+++++ T Consensus 407 GLr~GDVI~~-VN~~~V~sv~d~~ 429 (455) T PRK10139 407 GLQKDDVIIG-VNRDRVNSIAEMR 429 (455) T ss_pred CCCCCCEEEE-ECCEECCCHHHHH T ss_conf 9999999997-7998739999999 No 194 >CHL00076 chlB photochlorophyllide reductase subunit B Probab=79.29 E-value=5.2 Score=20.26 Aligned_cols=48 Identities=13% Similarity=0.268 Sum_probs=19.0 Q ss_pred EEECCCHHHHHHHHHHHHH---CCC-EEEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 9847842589999998630---597-58999721001111036799999999 Q gi|254780791|r 150 VITSPTGAVIRDILQRISC---RFP-LRVIIFPVKVQGDECPKEIANAILQL 197 (529) Q Consensus 150 vits~~~a~~~D~~~~~~~---r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~ 197 (529) |+-.|.|.-|-|.+.++-+ .+| +...++...+=|.++++-++++|... T Consensus 29 V~hAP~GddY~d~~~tmleR~~~~pPvT~s~~~~~~L~~gt~e~v~~~i~~~ 80 (510) T CHL00076 29 IMHAPLGDDYFNVMRSMLERERDFTPVTASIVDRHVLARGSQEKVVDNITRK 80 (510) T ss_pred EEECCCCCCCHHHHHHHHHHCCCCCCEEEECCEEEECCCCCHHHHHHHHHHH T ss_conf 9516998754125888986447999846412001223677479999999998 No 195 >pfam07302 AroM AroM protein. This family consists of several bacterial and archaeal AroM proteins. In Escherichia coli the aroM gene is cotranscribed with aroL. The function of this family is unknown. Probab=78.98 E-value=5.3 Score=20.19 Aligned_cols=80 Identities=18% Similarity=0.283 Sum_probs=56.3 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCC-E--E-EEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 28999847842589999998630597-5--8-999721001111036799999999741003576777589995168884 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFP-L--R-VIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p-~--~-~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .++|||++.. +-+.....+|. + + +...-++.+|.. .++..|-+.+... ..|+||+ T Consensus 126 ~~lGVvvP~~-----eQ~~~~~~kW~~~~~~~~~~~asPy~~~~--~~l~~Aa~~L~~~-------gadlivL------- 184 (221) T pfam07302 126 HQLGVIVPLP-----EQISQQANKWQKLVKEVVVVAASPYHGSE--DRLEEAARELLDQ-------GADLIVL------- 184 (221) T ss_pred CEEEEEECCH-----HHHHHHHHHHHHCCCCCEEEEECCCCCCH--HHHHHHHHHHHHC-------CCCEEEE------- T ss_conf 8599995579-----99999999887458871799718988988--9999999999975-------9899999------- Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 44220076999999974890488520 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgI 247 (529) |=+.|+.+.=.+.--.+.+|||.+. T Consensus 185 -DCiGYt~~~r~~~~~~~g~PViLsr 209 (221) T pfam07302 185 -DCIGYTQAMKDMLQEALGKPVLLSR 209 (221) T ss_pred -ECCCCCHHHHHHHHHHHCCCEEEHH T ss_conf -8877749999999998699999508 No 196 >COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=78.90 E-value=5.3 Score=20.17 Aligned_cols=234 Identities=19% Similarity=0.251 Sum_probs=114.4 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHH----HHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCC Q ss_conf 9888888888986229999999999----974001718999997054356888627999874894799999735210586 Q gi|254780791|r 2 NPFSQKNSLDHPEYSVSELSYHLKH----IVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIE 77 (529) Q Consensus 2 ~~~~~~~~~~~~~~svs~l~~~i~~----~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~ 77 (529) +|+|....-+. ++.++...-.. -++.....|.|-|-|...+. .+.-++|+|-|..++|...+=+.....-. T Consensus 30 ~~yp~~~~~~~---~~~~l~~~~~~~~~~el~~~~~~v~vAGRi~~~R~--~GK~~F~~i~d~~gkiQ~yi~k~~~~~~~ 104 (502) T COG1190 30 DPYPNDFERTH---TSADLREKYADKTKEELEALNIEVSVAGRIMTIRN--MGKASFADLQDGSGKIQLYVNKDEVGEEV 104 (502) T ss_pred CCCCCCCCCCC---CHHHHHHHHHCCCHHHHHHCCCEEEEECCEEEECC--CCCEEEEEEECCCCEEEEEEECCCCCHHH T ss_conf 88877676645---49999999753240233200550588432663202--57525899940896389999546546466 Q ss_pred C-----CCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE------------CCCCHHHHHHHH------------------ Q ss_conf 6-----81459889999996675288437999997101------------680079999999------------------ Q gi|254780791|r 78 F-----LPEEGIEFLVIGKITTFPGSSKYQIIIESLIP------------SGSGTLLTALEK------------------ 122 (529) Q Consensus 78 ~-----~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~------------~g~G~l~~~~e~------------------ 122 (529) | ...-||-|-+.|.+ +..+.|.+++.|+.+.. .|+-|...++.+ T Consensus 105 ~~~~~~~~dlGDiigv~G~~-~~T~~GelSv~v~~~~lLsKsL~pLPeK~hgL~D~E~RyR~RylDLi~N~e~r~~f~~R 183 (502) T COG1190 105 FEALFKKLDLGDIIGVEGPL-FKTKTGELSVSVEELRLLSKSLRPLPEKFHGLTDKEIRYRQRYLDLIVNPESRQTFIKR 183 (502) T ss_pred HHHHHHCCCCCCEEEEEEEE-EECCCCCEEEEEEEEEEECCCCCCCCHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 88886326657888664205-65489846899877756202077797454378638889998888986599999999999 Q ss_pred ------HHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHC-CCEEEEEEE------CCCCCCCHHHH Q ss_conf ------99976540122610016310265289998478425899999986305-975899972------10011110367 Q gi|254780791|r 123 ------RKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCR-FPLRVIIFP------VKVQGDECPKE 189 (529) Q Consensus 123 ------lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r-~p~~~~~~p------~~vQG~~a~~~ 189 (529) +++-|..+|.++-+. |. +=+=++||+-+=|.+..+.- -.+-+.+.| ..|=|-+-.-| T Consensus 184 s~ii~~iR~fl~~~gFlEVET----P~------lq~i~GGA~ArPF~ThhNald~dlyLRIApELyLKRliVGG~erVfE 253 (502) T COG1190 184 SKIIRAIREFLDDRGFLEVET----PM------LQPIPGGAAARPFITHHNALDMDLYLRIAPELYLKRLIVGGFERVFE 253 (502) T ss_pred HHHHHHHHHHHHHCCCEEECC----CC------CCCCCCCCCCCCCEEEECCCCCCEEEEECCHHHHHHHHHCCCHHHEE T ss_conf 999999999998779758416----00------03557873226501220236775687624189999887537214223 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHC--CCEEEEEECCCCCCH Q ss_conf 99999999741003576777589995168884442200769999999748--904885205777525 Q gi|254780791|r 190 IANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANS--SIPIISAIGHETDWT 254 (529) Q Consensus 190 i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~--~iPVisgIGHE~D~T 254 (529) |=+..+ |.--+..+.+.|..|=.==-=...+|+|.+-+ +|.+.++.. ....|.==|++.|++ T Consensus 254 Igr~FR--NEGid~tHNPEFTmlE~Y~AYaDy~D~m~ltE-~Li~~~a~~v~gt~~v~y~~~~id~~ 317 (502) T COG1190 254 IGRNFR--NEGIDTTHNPEFTMLEFYQAYADYEDLMDLTE-ELIKELAKEVNGTTKVTYGGQEIDFS 317 (502) T ss_pred ECCCCC--CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCEEEEECCEEEECC T ss_conf 055200--37876664842356899999857999999999-99999999954973787899767548 No 197 >PRK09701 D-allose transporter subunit; Provisional Probab=78.87 E-value=5.3 Score=20.16 Aligned_cols=84 Identities=17% Similarity=0.128 Sum_probs=52.9 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHH----HHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 265289998478425899999986----3059758999721001111036799999999741003576777589995168 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRI----SCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~----~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) .-..+||||++..+.-|++-++.= ....++++.++...-+|+-+. + ++.|+.+-. ..+|.|||. T Consensus 22 ~a~~~iavi~k~~~npf~~~~~~Ga~~aAk~~G~~v~v~~~~~~~D~~~-Q-i~~Ie~~I~-------~gvdaIii~--- 89 (311) T PRK09701 22 FAAAEYAVVLKTLSNPFWVDMKKGIEDEAKTLGVSVDIFASPSEGDFQS-Q-LQLFEDLSN-------KNYKGIAFA--- 89 (311) T ss_pred HHCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCHHH-H-HHHHHHHHH-------CCCCEEEEE--- T ss_conf 4368499996888998999999999999998699799992798789999-9-999999997-------599999991--- Q ss_pred CCHHHHHHCCHH---HHHHHHHHCCCEEEE Q ss_conf 884442200769---999999748904885 Q gi|254780791|r 219 GSIEDLWHFNDE---MIVRAIANSSIPIIS 245 (529) Q Consensus 219 GS~eDL~~FN~e---~laraI~~~~iPVis 245 (529) +-|.. ..++...+..||||+ T Consensus 90 -------p~d~~a~~~~i~~A~~aGIpVV~ 112 (311) T PRK09701 90 -------PLSSVNLVMPVARAWKKGIYLVN 112 (311) T ss_pred -------CCCHHHHHHHHHHHHHCCCCEEE T ss_conf -------89877889999999977991896 No 198 >PRK03378 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=78.77 E-value=5.1 Score=20.34 Aligned_cols=98 Identities=15% Similarity=0.189 Sum_probs=55.7 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHC---CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 65289998478425899999986305---975899972100111103679999999974100357677758999516888 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCR---FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r---~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) -.++||||+-++..........+.+. -.+++++-+-.-.--+.+..-...+..+ ...+|++|..=|-|. T Consensus 4 ~fk~IgIi~k~~~~~a~~~~~~l~~~L~~~g~~v~ld~~~a~~l~~~~~~~~~~~~~--------~~~~Dlvi~lGGDGT 75 (292) T PRK03378 4 HFKCIGIVGHPRHPTALTTHEMLYRWLCTKGYEVIVEQQIAHELQLKNVKTGTLAEI--------GQQADLAVVVGGDGN 75 (292) T ss_pred CCCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEECHHHHHHCCCCCCCCCCHHHH--------CCCCCEEEEECCCHH T ss_conf 698999995279858999999999999978998998347876529986554786685--------305689999777889 Q ss_pred HHHHHHCCHHHHHHHHHHCCCEEEE----EECCCCCCHHHH Q ss_conf 4442200769999999748904885----205777525898 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIPIIS----AIGHETDWTLAD 257 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iPVis----gIGHE~D~Tl~D 257 (529) +- ..+|..+...+||+. -+|-=+|+..-| T Consensus 76 ~L--------~aar~~~~~~~PilGIN~G~lGFLt~~~~~~ 108 (292) T PRK03378 76 ML--------GAARTLARYDIKVIGINRGNLGFLTDLDPDN 108 (292) T ss_pred HH--------HHHHHHCCCCCCEEEEECCCCEECCCCCHHH T ss_conf 99--------9999854369968988379836888678789 No 199 >cd06299 PBP1_LacI_like_13 Ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This group includes the ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This regulatory protein is a member of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial trans Probab=78.15 E-value=5.5 Score=20.00 Aligned_cols=21 Identities=10% Similarity=0.276 Sum_probs=11.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 88889862299999999999740 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~ 30 (529) |+.++|- -+++...|...+.. T Consensus 7 p~l~npf--f~~~~~gi~~~~~~ 27 (265) T cd06299 7 PDIRNPY--FASLATAIQDAASA 27 (265) T ss_pred CCCCCHH--HHHHHHHHHHHHHH T ss_conf 8987899--99999999999998 No 200 >PRK10942 serine endoprotease; Provisional Probab=77.92 E-value=5.6 Score=19.95 Aligned_cols=21 Identities=29% Similarity=0.366 Sum_probs=9.3 Q ss_pred HCCCEEEEECCCCCEECCHHHC Q ss_conf 3146199984898895777892 Q gi|254780791|r 458 LKRGYTSIQDTNNNFITQKRNL 479 (529) Q Consensus 458 L~RGYaiv~~~~GkiI~s~~~l 479 (529) |.+|-.|+. -|++.|++++++ T Consensus 427 Lr~GDVI~~-VN~~~V~s~~dl 447 (474) T PRK10942 427 LKKGDVIIG-ANQQPVKNIAEL 447 (474) T ss_pred CCCCCEEEE-ECCEECCCHHHH T ss_conf 999988997-799884999999 No 201 >PRK10017 putative pyruvyl transferase; Provisional Probab=77.92 E-value=4.3 Score=20.97 Aligned_cols=36 Identities=22% Similarity=0.215 Sum_probs=12.3 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 7589995168884442200769999999748904885 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVis 245 (529) +|++|-+ ||+-+.|+........+.+..-.+-||+- T Consensus 118 ~D~vIs~-GGs~~~D~yg~~~~~~~L~a~l~kKpv~~ 153 (426) T PRK10017 118 YDAIIQV-GGSFFVDLYGVPQFEHALCTFMAKKPLFM 153 (426) T ss_pred CCEEEEE-CCCCCCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 4789971-77620147685216899999973996899 No 202 >PRK06494 enoyl-CoA hydratase; Provisional Probab=77.85 E-value=5.6 Score=19.93 Aligned_cols=73 Identities=21% Similarity=0.266 Sum_probs=40.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC--H---HHH----------HHCCHHHHHHHHHHCCCEEEEEE---- Q ss_conf 3679999999974100357677758999516888--4---442----------20076999999974890488520---- Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGS--I---EDL----------WHFNDEMIVRAIANSSIPIISAI---- 247 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS--~---eDL----------~~FN~e~laraI~~~~iPVisgI---- 247 (529) ..++..+++.++... .+-||| .+|.|. + -|| |..+......+++.||.|||++| T Consensus 33 ~~el~~al~~~~~d~------~vrvvv-l~g~G~~~FcaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~~kPvIaaV~G~a 105 (259) T PRK06494 33 HFELEKVFDDFAADP------EQWVAI-VTGAGDKAFSAGNDLKWQAAGGKRGWPESGFAGLTSRFDLDKPIIAAVNGVA 105 (259) T ss_pred HHHHHHHHHHHHHCC------CCEEEE-EECCCCCCEECCCCHHHHHHCCCCCCCHHHHHHHHHHHCCCCCEEEEECCEE T ss_conf 999999999986399------935999-9758998157787767664224456526678999999619998899970658 Q ss_pred -CCCCCCHHHHHHHCCCCCCCHH Q ss_conf -5777525898864123777214 Q gi|254780791|r 248 -GHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 248 -GHE~D~Tl~D~VAD~Ra~TPTa 269 (529) |.-.... +.+|+|..+|.+ T Consensus 106 ~GgG~~la---l~cD~~ias~~a 125 (259) T PRK06494 106 MGGGFEIA---LACDLIIAAENA 125 (259) T ss_pred EECCCEEE---HHHCCCCCCCCC T ss_conf 62543000---210105667783 No 203 >PTZ00059 dynein light chain; Provisional Probab=77.75 E-value=5.2 Score=20.24 Aligned_cols=48 Identities=13% Similarity=0.409 Sum_probs=35.2 Q ss_pred HHHHHHHHHHHHHCCCCEE--EEEEECCCCC--CC-CCCEEEEEEECCCCEEEEEEECC Q ss_conf 9999999999740017189--9999705435--68-88627999874894799999735 Q gi|254780791|r 18 SELSYHLKHIVESNLSHVC--VRGEISGYRG--IH-SSGHAYFSLKDNHSRIDAIIWKG 71 (529) Q Consensus 18 s~l~~~i~~~l~~~~~~~~--v~gEis~~~~--~~-~sGH~Yf~lkd~~a~i~~~~~~~ 71 (529) .++..+||.-++.-|+..| |+|+ ||-+ .| .+.++||.+ ..+...+||+ T Consensus 37 ~diA~~IK~~fD~kyg~~WhcVVGk--~FGs~Vthe~~~fiyf~~----g~~~vllfKs 89 (90) T PTZ00059 37 KDIAAYIKKEFDKKYNPTWHCIVGR--NFGSYVTHETKHFIYFYM----GQVAILLFKS 89 (90) T ss_pred HHHHHHHHHHHHCCCCCEEEEEECC--CCEEEEEEECCCEEEEEE----CCEEEEEEEC T ss_conf 9999999999757449924999967--704689980585899997----9999999954 No 204 >PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed Probab=77.69 E-value=5.7 Score=19.89 Aligned_cols=22 Identities=27% Similarity=0.314 Sum_probs=17.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHH Q ss_conf 5289998478425899999986 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRI 166 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~ 166 (529) ..++-|||.|+..|--=.|+|+ T Consensus 324 ~~~~liITGPNtGGKTv~LKtv 345 (780) T PRK00409 324 DKTVLVITGPNTGGKTVTLKTL 345 (780) T ss_pred CCEEEEEECCCCCCCEEHHHHH T ss_conf 8418999689888856379999 No 205 >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Probab=77.65 E-value=5.7 Score=19.89 Aligned_cols=32 Identities=9% Similarity=0.006 Sum_probs=16.1 Q ss_pred CCHHHHHCCCEEEEECCCCCEECCHHHCCCCC Q ss_conf 69677731461999848988957778929998 Q gi|254780791|r 452 FAYKNTLKRGYTSIQDTNNNFITQKRNLATKT 483 (529) Q Consensus 452 lsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd 483 (529) .+|..+|.-|-.|.-.+.||.+.+...++.|+ T Consensus 1040 ~~~dd~l~~Giei~a~ppgK~~~~l~~LSGGE 1071 (1163) T COG1196 1040 TEPDDPLTAGIEISARPPGKKLQSLSLLSGGE 1071 (1163) T ss_pred CCCCCHHHCCCEEEEECCCCCCCCCCCCCCCH T ss_conf 89986232595899738998656212148618 No 206 >pfam02016 Peptidase_S66 LD-carboxypeptidase. Muramoyl-tetrapeptide carboxypeptidase hydrolyses a peptide bond between a di-basic amino acid and the C-terminal D-alanine in the tetrapeptide moiety in peptidoglycan. This cleaves the bond between an L- and a D-amino acid. The function of this activity is in murein recycling. This family also includes the microcin c7 self-immunity protein. This family corresponds to Merops family S66. Probab=77.58 E-value=5.7 Score=19.87 Aligned_cols=86 Identities=22% Similarity=0.293 Sum_probs=52.3 Q ss_pred HHHHHCC-C-E-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-HHHHHHCCHHHHHHHHH-H Q ss_conf 9863059-7-5-899972100111103679999999974100357677758999516888-44422007699999997-4 Q gi|254780791|r 164 QRISCRF-P-L-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-IEDLWHFNDEMIVRAIA-N 238 (529) Q Consensus 164 ~~~~~r~-p-~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~eDL~~FN~e~laraI~-~ 238 (529) +.+-..| | + .-+||==-| ++.+..|-+.|..+... +..+++-.||++|=.+- .++-+.+.-+++.+.++ . T Consensus 178 ~l~GT~y~p~~~g~ILflEdv--~e~~y~idR~L~~L~~a---G~f~~~~Gii~G~f~~~~~~~~~~~~~~~vl~~~~~~ 252 (281) T pfam02016 178 SLLGTPYLPDTDGKILFLEDV--GEAPYRIDRMLTQLKLA---GILDKVAGIILGRFTDCDDDEDYGDTLEEVLREVLGD 252 (281) T ss_pred HHCCCCCCCCCCCCEEEEEEC--CCCHHHHHHHHHHHHHH---CCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 663886566878838999807--99999999999999981---9744572899934777889865344299999998752 Q ss_pred CCCEEEEE--ECCCCCCH Q ss_conf 89048852--05777525 Q gi|254780791|r 239 SSIPIISA--IGHETDWT 254 (529) Q Consensus 239 ~~iPVisg--IGHE~D~T 254 (529) ..+||+.+ +||-.+.. T Consensus 253 ~~iPv~~~~~~GH~~~~~ 270 (281) T pfam02016 253 LGIPVLYGLPFGHTPPNL 270 (281) T ss_pred CCCCEEECCCCCCCCCCE T ss_conf 798589899988999874 No 207 >PRK07854 enoyl-CoA hydratase; Provisional Probab=77.41 E-value=5.8 Score=19.83 Aligned_cols=39 Identities=31% Similarity=0.506 Sum_probs=26.8 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-5777--525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ..+.++|.+||.|||++| ||=. =.. .=+.+|+|..+++| T Consensus 76 ~~~~~~i~~~~kPvIAav~G~a~GgG~~-lal~cD~ria~~~a 117 (243) T PRK07854 76 IELLHAIDAAPVPVIAAINGPAIGAGLQ-LALACDLRVVAPEA 117 (243) T ss_pred HHHHHHHHHCCCCEEEEECCEECCCHHH-HHHCCCEEEECCCC T ss_conf 9999999858998899975832335157-88609999964776 No 208 >PRK02155 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=77.24 E-value=5.8 Score=19.80 Aligned_cols=84 Identities=17% Similarity=0.230 Sum_probs=46.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHC---CCEEEEEEECCCC--CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 65289998478425899999986305---9758999721001--111036799999999741003576777589995168 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCR---FPLRVIIFPVKVQ--GDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r---~p~~~~~~p~~vQ--G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) -.++||||+.+.........+.+.+- -.+++++-+...+ |..--... -.+.+ . ..+|++|..=|- T Consensus 4 ~fk~Vgiv~k~~~~~~~~~~~~l~~~L~~~g~~v~~e~~~a~~~~~~~~~~~--~~~~~---~-----~~~Dlvi~lGGD 73 (291) T PRK02155 4 QFRTVALVGRYQTPGIAEPLEALAACIAKRGFEVVFEADTARNTGLTGYPAL--TPAEI---G-----ARADVAVVLGGD 73 (291) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCCCC--CHHHH---C-----CCCCEEEEECCC T ss_conf 7989999914898689999999999999788999995557776399887747--97994---6-----376789997678 Q ss_pred CCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 884442200769999999748904885 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~~iPVis 245 (529) |.+- ..+|..+...+||+. T Consensus 74 GTlL--------~~a~~~~~~~~PilG 92 (291) T PRK02155 74 GTML--------GIGRQLAPYGTPLIG 92 (291) T ss_pred HHHH--------HHHHHHHCCCCCEEE T ss_conf 8999--------999987115996899 No 209 >PRK07396 naphthoate synthase; Validated Probab=77.11 E-value=5.9 Score=19.77 Aligned_cols=42 Identities=19% Similarity=0.244 Sum_probs=27.8 Q ss_pred CHHHHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 76999999974890488520-577752-5898864123777214 Q gi|254780791|r 228 NDEMIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 228 N~e~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |-..+.++|..|+.|||.+| ||=.=- .-+=+.+|+|..+++| T Consensus 95 ~~~~~~~~i~~~~kPvIAaV~G~a~GgG~~lalacD~~ias~~A 138 (273) T PRK07396 95 NVLDLQRLIRTCPKPVIAMVAGYAIGGGHVLHLVCDLTIAADNA 138 (273) T ss_pred HHHHHHHHHHHCCCCEEEEECCEEEEHHHHHHHHCCCCEECCCC T ss_conf 08999999982999999998587760769999852801223798 No 210 >PRK12445 lysyl-tRNA synthetase; Reviewed Probab=77.05 E-value=5.9 Score=19.76 Aligned_cols=202 Identities=18% Similarity=0.199 Sum_probs=109.1 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCC-----CCCCCCEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 189999970543568886279998748947999997352105866-----814598899999966752884379999971 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEF-----LPEEGIEFLVIGKITTFPGSSKYQIIIESL 108 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~-----~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i 108 (529) .|.|.|=|.+.|. .+|-++++|.|..+.|.|++-+.....-.| ...-|+-|-+.|.+. -.+.|.+.+.|..+ T Consensus 67 ~V~vaGrv~~~R~--~Gk~~F~~LrD~~G~iQ~~~~~~~~~~~~~~~~~~~~~~gdii~v~G~~~-~t~tGe~~i~~~~~ 143 (505) T PRK12445 67 EVSVAGRMMTRRI--MGKASFVTLQDVGGRIQLYVARDSLPEGVYNDQFKKWDLGDIIGARGTLF-KTQTGELSIHCTEL 143 (505) T ss_pred EEEEEEEEEEEEC--CCCEEEEEEEECCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEEEEEE-CCCCCCCCEEEEEE T ss_conf 8999988987874--79839999985984399999588686689999975121534999963660-47877621257899 Q ss_pred EEC------------CCCHHHHHH------------------------HHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEE Q ss_conf 016------------800799999------------------------99999765401226100163102652899984 Q gi|254780791|r 109 IPS------------GSGTLLTAL------------------------EKRKKKLLEEGLFSDQHKNPIPFIPKIIAVIT 152 (529) Q Consensus 109 ~~~------------g~G~l~~~~------------------------e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vit 152 (529) ..- |+-+-..++ ..+++-|...|+.+-+. |.+ -+ T Consensus 144 ~ilsksl~plP~k~~g~~d~e~R~r~RyLDLir~~~~~~~~~~Rs~ii~~iR~~l~~~gF~EVeT----Pil------~~ 213 (505) T PRK12445 144 RLLTKALRPLPDKFHGLQDQEVRYRQRYLDLIANDKSRQTFVVRSKILAAIRQFMVARGFMEVET----PMM------QV 213 (505) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEC----CCC------CC T ss_conf 99347778989745578788776341111436599999999999999999999999779389978----875------55 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEE--EEE------CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 78425899999986305975899--972------1001111036799999999741003576777589995168884442 Q gi|254780791|r 153 SPTGAVIRDILQRISCRFPLRVI--IFP------VKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 153 s~~~a~~~D~~~~~~~r~p~~~~--~~p------~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) .+.||+-+||+.-... ++.+.+ +.| ..|=|-+-.-+|....+-=+. ...+.+.|=-|=+=......+|+ T Consensus 214 ~~gGa~Arpf~t~~~~-~~~~~yL~qSPQLykk~LmvgGfdRvfeI~r~FRnE~~--~~~H~PEFT~lE~e~af~d~~dv 290 (505) T PRK12445 214 IPGGASARPFITHHNA-LDLDMYLRIAPELYLKRLVVGGFERVFEINRNFRNEGI--SVRHNPEFTMMELYMAYADYHDL 290 (505) T ss_pred CCCCCCCCEEEECCCC-CCCCEEECCCCHHHHHHHHHCCCCCEEEHHHHHHCCCC--CCCCCHHHHHHHHHHHCCCHHHH T ss_conf 4587666103304667-88674341693999999985797445762787607888--86654445435687632789899 Q ss_pred HHCCHHHHHHHHHH--CCCEEEEEECCCCC Q ss_conf 20076999999974--89048852057775 Q gi|254780791|r 225 WHFNDEMIVRAIAN--SSIPIISAIGHETD 252 (529) Q Consensus 225 ~~FN~e~laraI~~--~~iPVisgIGHE~D 252 (529) |-+-++- .+.|+. +..+.++==++..| T Consensus 291 m~l~E~l-i~~i~~~v~g~~~i~y~~~~~d 319 (505) T PRK12445 291 IELTESL-FRTLAQEVLGTTKVTYGEHVFD 319 (505) T ss_pred HHHHHHH-HHHHHHHHCCCCEEEECCCHHC T ss_conf 9999999-9999998448835866520120 No 211 >PRK04759 consensus Probab=77.00 E-value=5.9 Score=19.75 Aligned_cols=87 Identities=18% Similarity=0.239 Sum_probs=51.8 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHH---CCCEEEEEEECCCCCC-CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 26528999847842589999998630---5975899972100111-1036799999999741003576777589995168 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRISC---RFPLRVIIFPVKVQGD-ECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~~~---r~p~~~~~~p~~vQG~-~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) ..+++||||+-+......+....+.+ .-++++++-+...+.- +.+.+-...+..+ ...+|++|..=|- T Consensus 3 ~~f~~I~Iv~k~~~~~~~~~~~~l~~~L~~~g~~v~vd~~~~~~l~~~~~~~~~~~~~l--------~~~~Dlvi~lGGD 74 (294) T PRK04759 3 KPFNVIAIIGKPRDQQAIQTHKELYHWLTSLGYTVFIDDRLAAILTDVPQEHFASLVEL--------GKKADLAIVVGGD 74 (294) T ss_pred CCCCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEECHHHHHHCCCCCCCCCCCHHHC--------CCCCCEEEEECCC T ss_conf 99979999941799799999999999998689999991688655334772223775563--------7665689998478 Q ss_pred CCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 884442200769999999748904885 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~~iPVis 245 (529) |.+- ..+|......+||+. T Consensus 75 GTlL--------~aar~~~~~~~Pilg 93 (294) T PRK04759 75 GNML--------GAARVLSRFDISVIG 93 (294) T ss_pred HHHH--------HHHHHHCCCCCCEEE T ss_conf 5899--------999986016996898 No 212 >pfam05622 HOOK HOOK protein. This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagel Probab=76.99 E-value=5.9 Score=19.74 Aligned_cols=62 Identities=18% Similarity=0.169 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH Q ss_conf 678888776777778875026889999999999999999999999--99999999999999887 Q gi|254780791|r 389 RYIFLEKKEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFILSH--KIKSCHTSVSITTRILQ 450 (529) Q Consensus 389 ~~~~~~~~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~~~~--~L~~~~~rL~~l~~~L~ 450 (529) +......++....++........+..........+..++..+... ++...+..|......|. T Consensus 362 k~Qle~yk~qi~eL~~~l~e~~~~~~~lef~~~~l~ek~~~l~~eke~l~~erd~Lre~~~eL~ 425 (713) T pfam05622 362 RGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELR 425 (713) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8899999999999999999999889999999999999999999989999999999999999988 No 213 >PRK11281 potassium efflux protein KefA; Provisional Probab=76.40 E-value=6.1 Score=19.62 Aligned_cols=22 Identities=5% Similarity=-0.049 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999887169 Q gi|254780791|r 432 SHKIKSCHTSVSITTRILQSFA 453 (529) Q Consensus 432 ~~~L~~~~~rL~~l~~~L~sls 453 (529) ++.+++..+....+.+++..+. T Consensus 311 k~~ld~l~Qt~rni~EQi~~L~ 332 (1107) T PRK11281 311 KNWLDRLLQSERNIKEQISVLQ 332 (1107) T ss_pred HHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999987513 No 214 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=76.30 E-value=6.1 Score=19.60 Aligned_cols=79 Identities=15% Similarity=0.020 Sum_probs=54.3 Q ss_pred CCEEEEEEECCCCC-CCCCC-EE-EEEEECCCCEEEEEEECCCCCC--CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 71899999705435-68886-27-9998748947999997352105--86681459889999996675288437999997 Q gi|254780791|r 33 SHVCVRGEISGYRG-IHSSG-HA-YFSLKDNHSRIDAIIWKGTLNK--IEFLPEEGIEFLVIGKITTFPGSSKYQIIIES 107 (529) Q Consensus 33 ~~~~v~gEis~~~~-~~~sG-H~-Yf~lkd~~a~i~~~~~~~~~~~--~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~ 107 (529) .+|.|+|+|.+..- .-+|| |+ -|.+.|..+++.|-||...-.- .--.++-||=|.+.|.|..=.-.+..+++|.+ T Consensus 240 ~~v~v~G~IF~~e~~~~ksGr~l~~i~vTD~t~Sl~~k~f~~~~ed~~~~~~ik~g~wvk~~g~v~~d~f~~~l~m~i~~ 319 (1444) T COG2176 240 TRVKVEGYIFKIEIKELKSGRTLLNIKVTDYTSSLILKKFLRDEEDEKKFDGIKKGMWVKARGNVQLDTFTRDLTMIIND 319 (1444) T ss_pred CCEEEEEEEEEEEEEECCCCCEEEEEEEECCCHHEEEHHHCCCCCCHHHHHHCCCCCEEEEEEEEEECCCCCCEEEEHHH T ss_conf 10589999988764322468679999986271003202122666318888623338479999788733555551787533 Q ss_pred EEEC Q ss_conf 1016 Q gi|254780791|r 108 LIPS 111 (529) Q Consensus 108 i~~~ 111 (529) |.+- T Consensus 320 I~ei 323 (1444) T COG2176 320 INEI 323 (1444) T ss_pred HHHH T ss_conf 1343 No 215 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=76.29 E-value=6.1 Score=19.60 Aligned_cols=80 Identities=21% Similarity=0.089 Sum_probs=58.4 Q ss_pred CCCEEEEEEECCCCCC-CCCC-EEE-EEEECCCC--EEEEEEECCCCC-CC-CC-CCCCCCEEEEEEEEEEECCCCEEEE Q ss_conf 1718999997054356-8886-279-99874894--799999735210-58-66-8145988999999667528843799 Q gi|254780791|r 32 LSHVCVRGEISGYRGI-HSSG-HAY-FSLKDNHS--RIDAIIWKGTLN-KI-EF-LPEEGIEFLVIGKITTFPGSSKYQI 103 (529) Q Consensus 32 ~~~~~v~gEis~~~~~-~~sG-H~Y-f~lkd~~a--~i~~~~~~~~~~-~~-~~-~~~~G~~v~~~g~~~~y~~~g~~ql 103 (529) -.++.|+|+|.+.... -+|| ++| |.+-|... ++.|-+|.-... -. .| .++-|+-|.|.|.|..=.-.+..++ T Consensus 7 e~~~~~~G~iF~~e~~~~ksgr~~~~i~~tD~~~GGSl~~k~F~~~~~eD~~~~~~~K~~~wv~~~g~~~~~~~~~~~~~ 86 (1264) T TIGR01405 7 ENRVKVEGVIFKIEIKELKSGRTLLKIKVTDYTDGGSLILKKFLKSEKEDKEKFDGIKIGKWVRVRGKIELDKFSRDLQM 86 (1264) T ss_pred CCEEEEEEEEEEEECCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEEEEEEEECCCCCCEE T ss_conf 34489999986330233577527999998731788606887604899720577734203773599998875221200058 Q ss_pred EEEEEEEC Q ss_conf 99971016 Q gi|254780791|r 104 IIESLIPS 111 (529) Q Consensus 104 ~v~~i~~~ 111 (529) +|.+|.+- T Consensus 87 ~~~~I~~i 94 (1264) T TIGR01405 87 IIKDIEEI 94 (1264) T ss_pred EEECCCCC T ss_conf 77502124 No 216 >COG1311 HYS2 Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B [DNA replication, recombination, and repair] Probab=75.87 E-value=4.5 Score=20.77 Aligned_cols=36 Identities=28% Similarity=0.456 Sum_probs=31.9 Q ss_pred CEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEEC Q ss_conf 1899999705435688862799987489479999973 Q gi|254780791|r 34 HVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWK 70 (529) Q Consensus 34 ~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~ 70 (529) .+.|.||++..+. ..+||+++++-|....+.|++=+ T Consensus 140 dv~Iig~v~~~r~-t~~gh~ii~~ed~tG~v~vvl~k 175 (481) T COG1311 140 DVKIIGEVNDVRE-TKNGHFIISLEDTTGVVTVVLGK 175 (481) T ss_pred CCEEEEEECCCEE-EECCCEEEECCCCCCEEEEEECC T ss_conf 8479999712203-20464899824656659999616 No 217 >COG5235 RFA2 Single-stranded DNA-binding replication protein A (RPA), medium (30 kD) subunit [DNA replication, recombination, and repair] Probab=75.40 E-value=6.4 Score=19.42 Aligned_cols=134 Identities=16% Similarity=0.154 Sum_probs=82.9 Q ss_pred CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC--CCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 171899999705435688862799987489479999973521058--668145988999999667528843799999710 Q gi|254780791|r 32 LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI--EFLPEEGIEFLVIGKITTFPGSSKYQIIIESLI 109 (529) Q Consensus 32 ~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~--~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~ 109 (529) ..+|..+|-|-|.+. +.. ...|.+-|..++|.|..|.+...-. .-..++|.-|.|.|++-.|... -.+....|. T Consensus 66 v~~V~fVGvvrni~~-~tt-n~~~~iEDGTG~Ievr~W~~~~~~~e~~~d~~~~~yvkV~G~lk~F~GK--~~I~~~~i~ 141 (258) T COG5235 66 VTNVQFVGVVRNIKT-STT-NSMFVIEDGTGSIEVRFWPGNSYEEEQCKDLEEQNYVKVNGSLKTFNGK--RSISASHIS 141 (258) T ss_pred EEEEEEEEEEEEEEE-CCC-CEEEEEECCCCEEEEEECCCCCHHHHHCCCCCCCCEEEEECCEEEECCE--EEEEHHHEE T ss_conf 866899999984010-345-4189993487428988568872377754343326679994211220780--577410052 Q ss_pred EC--CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC----CEEEEEECCCHHHHHHHHHHHHHC Q ss_conf 16--8007999999999976540122610016310265----289998478425899999986305 Q gi|254780791|r 110 PS--GSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIP----KIIAVITSPTGAVIRDILQRISCR 169 (529) Q Consensus 110 ~~--g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p----~~i~vits~~~a~~~D~~~~~~~r 169 (529) +- .---++--+|-++.-|-.-|-+.....-..|-+- -+.-=+||..+.-++-.+-...+| T Consensus 142 ~I~d~NeV~~HfLe~I~~Hl~~t~~~~~ple~~~~n~GqSlf~k~dNdtSsgss~lq~~~~~c~~~ 207 (258) T COG5235 142 AIEDSNEVTYHFLECIYQHLFYTRQLQRPLEEEVKNDGQSLFAKLDNDTSSGSSRLQEDILECYRR 207 (258) T ss_pred ECCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHH T ss_conf 103544557899999999999998861401443689762013212675544555561899999997 No 218 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=75.16 E-value=6.5 Score=19.37 Aligned_cols=84 Identities=14% Similarity=0.063 Sum_probs=52.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHH----HHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 65289998478425899999986----30597589997210011110367999999997410035767775899951688 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRI----SCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~----~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) =|+|||++.+.-...|+--++-= +++.++++.+|.+ +|+. ++...+..++..-. .++|+|||+ T Consensus 44 k~~ki~~~~p~lkd~yW~~v~~G~~~~Ak~lGv~l~v~~a--~~~~---d~~~Q~~qie~~i~----~~vDAIil~---- 110 (340) T PRK10936 44 KPWKLCALYPHLKDSYWLSVNYGLVQEAKRLGVDLKVLEA--GGYY---NLATQQQQLEQCVA----WGADAILLG---- 110 (340) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEECC--CCCC---CHHHHHHHHHHHHH----CCCCEEEEC---- T ss_conf 7717999806888869999999999999980986999858--9988---99999999999997----599999986---- Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEE Q ss_conf 844422007699999997489048852 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISA 246 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisg 246 (529) ..|-. .+......-.||||+= T Consensus 111 -~vd~~-----a~~~~~~~agIPVI~~ 131 (340) T PRK10936 111 -AVTPD-----GPDLQLQAANIPVIAL 131 (340) T ss_pred -CCCCC-----CHHHHHHHCCCCEEEE T ss_conf -78834-----1156898589978999 No 219 >KOG3430 consensus Probab=75.13 E-value=6.4 Score=19.41 Aligned_cols=50 Identities=20% Similarity=0.373 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHHHCCCCEE--EEEEECCCCCCCCCCEE-EEEEECCCCEEEEEEECC Q ss_conf 9999999999740017189--99997054356888627-999874894799999735 Q gi|254780791|r 18 SELSYHLKHIVESNLSHVC--VRGEISGYRGIHSSGHA-YFSLKDNHSRIDAIIWKG 71 (529) Q Consensus 18 s~l~~~i~~~l~~~~~~~~--v~gEis~~~~~~~sGH~-Yf~lkd~~a~i~~~~~~~ 71 (529) .++...||.-++.-|+.-| |+|+=-+..-.|.+||+ ||.+- .+.-.+|+. T Consensus 37 k~iA~~iKkefDkkyG~~WhcivG~~FGs~vThe~g~Fiyf~~g----~l~illfK~ 89 (90) T KOG3430 37 KDIAAFIKKEFDKKYGPTWHCIVGRNFGSYVTHETGHFIYFYLG----VLAILLFKC 89 (90) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEEECCCEEEEEEC----EEEEEEEEC T ss_conf 99999999987541299518998686232788603828999963----089999961 No 220 >cd02070 corrinoid_protein_B12-BD B12 binding domain of corrinoid proteins. A family of small methanogenic corrinoid proteins that bind methyl-Co(III) 5-hydroxybenzimidazolylcobamide as a cofactor. They play a role on the methanogenesis from trimethylamine, dimethylamine or monomethylamine, which is initiated by a series of corrinoid-dependent methyltransferases. Probab=74.71 E-value=6.7 Score=19.28 Aligned_cols=109 Identities=25% Similarity=0.274 Sum_probs=65.6 Q ss_pred CCCCEEEEEECCCHHHHHHHHH-HHH---HCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 2652899984784258999999-863---059758999721001111036799999999741003576777589995168 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQ-RIS---CRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~-~~~---~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) .-++...|++.+.|- +||+=. .+. ++++++++...+-| .+.+++++++.- ++|+|.|- T Consensus 79 ~~~~g~vv~~tv~Gd-~H~lG~~mv~~~l~~~G~~V~~LG~~v----p~e~~v~~~~~~----------~~divglS--- 140 (201) T cd02070 79 SAKKGKVVIGTVEGD-IHDIGKNLVATMLEANGFEVIDLGRDV----PPEEFVEAVKEH----------KPDILGLS--- 140 (201) T ss_pred CCCCCEEEEECCCCC-HHHHHHHHHHHHHHHCCCEEEECCCCC----CHHHHHHHHHHH----------CCCEEEEE--- T ss_conf 887865999627987-738999999999987897799778999----979999999972----------98999996--- Q ss_pred CCHHHHHHCCHHH---HHHHHHHC----CCEEEEEECCCCCCHHHHHH-HCCCCCCCHHHHHHH Q ss_conf 8844422007699---99999748----90488520577752589886-412377721456763 Q gi|254780791|r 219 GSIEDLWHFNDEM---IVRAIANS----SIPIISAIGHETDWTLADYA-ADLRAPTPTGAAEMA 274 (529) Q Consensus 219 GS~eDL~~FN~e~---laraI~~~----~iPVisgIGHE~D~Tl~D~V-AD~Ra~TPTaAAEla 274 (529) =++..+-+. +++++-+. .+||+.| |.-.+.-+++.+ ||.=++++..|..++ T Consensus 141 ----~~~~~~~~~~~~~i~~lr~~~~~~~v~i~vG-G~a~~~~~a~~~GAD~~a~da~~av~~a 199 (201) T cd02070 141 ----ALMTTTMGGMKEVIEALKEAGLRDKVKVMVG-GAPVNQEFADEIGADGYAEDAAEAVAIA 199 (201) T ss_pred ----CCCCCCHHHHHHHHHHHHHCCCCCCCEEEEE-CCCCCHHHHHHHCCCEECCCHHHHHHHH T ss_conf ----2566889999999999997289889859998-8017999999929887847999999987 No 221 >cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators. Probab=74.20 E-value=6.8 Score=19.19 Aligned_cols=78 Identities=18% Similarity=0.220 Sum_probs=32.1 Q ss_pred EEEEECC-CHHHHHHHHHHHHH---CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 9998478-42589999998630---5975899972100111103679999999974100357677758999516888444 Q gi|254780791|r 148 IAVITSP-TGAVIRDILQRISC---RFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 148 i~vits~-~~a~~~D~~~~~~~---r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) ||||.+. +..-+.++++-+.+ ..++.+.++.+ ++....-.+.|+.+... ++|-||+.-... T Consensus 2 IGvivp~l~npf~~~l~~gie~~~~~~gy~~li~~s----~~~~~~e~~~i~~l~~~-------~vDGiI~~~~~~---- 66 (268) T cd06298 2 VGVIIPDITNSYFAELARGIDDIATMYKYNIILSNS----DNDKEKELKVLNNLLAK-------QVDGIIFMGGKI---- 66 (268) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEEEC----CCCHHHHHHHHHHHHHC-------CCCEEEEECCCC---- T ss_conf 899978987989999999999999987998999989----99989999999999966-------999999826779---- Q ss_pred HHHCCHHHHHHHHHHCCCEEEE Q ss_conf 2200769999999748904885 Q gi|254780791|r 224 LWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 224 L~~FN~e~laraI~~~~iPVis 245 (529) +..+.+.+..+.+|||. T Consensus 67 -----~~~~~~~l~~~~~PvV~ 83 (268) T cd06298 67 -----SEEHREEFKRSPTPVVL 83 (268) T ss_pred -----CHHHHHHHHHCCCCEEE T ss_conf -----99999999966998999 No 222 >cd04323 AsnRS_cyto_like_N AsnRS_cyto_like_N: N-terminal, anticodon recognition domain of the type found in human and Saccharomyces cerevisiae cytoplasmic asparaginyl-tRNA synthetase (AsnRS), in Brugia malayai AsnRs and, in various putative bacterial AsnRSs. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, whereas the other exclusively with Probab=73.18 E-value=7.2 Score=19.00 Aligned_cols=76 Identities=17% Similarity=0.246 Sum_probs=53.6 Q ss_pred EEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC--CCCCCCCCEEEEEEEEEEECC----CCEEEEEEEEE Q ss_conf 899999705435688862799987489479999973521058--668145988999999667528----84379999971 Q gi|254780791|r 35 VCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI--EFLPEEGIEFLVIGKITTFPG----SSKYQIIIESL 108 (529) Q Consensus 35 ~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~--~~~~~~G~~v~~~g~~~~y~~----~g~~ql~v~~i 108 (529) |-|.|=|-+.|. .++=.++.|.|..+.+.||+-....... --.+..+.-|.+.|.+.--++ -|+|-+.++.+ T Consensus 2 V~l~GWV~~~R~--~g~l~Fi~LRD~~G~iQ~V~~~~~~~~~~~~~~l~~Es~v~V~G~v~~~~~~~~~~~~~Ei~~~~i 79 (84) T cd04323 2 VKVFGWVHRLRS--QKKLMFLVLRDGTGFLQCVLSKKLVTEFYDAKSLTQESSVEVTGEVKEDPRAKQAPGGYELQVDYL 79 (84) T ss_pred EEEEEEEEEEEC--CCCCEEEEEEECCEEEEEEECCCCCCHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCEEEEEEEE T ss_conf 999998998623--899289999508879999988865736899973888359999999997897777997599994399 Q ss_pred EECC Q ss_conf 0168 Q gi|254780791|r 109 IPSG 112 (529) Q Consensus 109 ~~~g 112 (529) +.-| T Consensus 80 ~vl~ 83 (84) T cd04323 80 EIIG 83 (84) T ss_pred EEEC T ss_conf 9963 No 223 >PRK10416 cell division protein FtsY; Provisional Probab=73.01 E-value=7.2 Score=18.97 Aligned_cols=22 Identities=14% Similarity=0.093 Sum_probs=8.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9998887789988899998556 Q gi|254780791|r 287 NLEARLNNIIIRLIKYKINTLN 308 (529) Q Consensus 287 ~l~~RL~~a~~~~l~~~~q~L~ 308 (529) .+..||...+.+.-+..-..+. T Consensus 196 g~f~rlk~gL~kt~~~l~~~~~ 217 (499) T PRK10416 196 GFFARLKRSLLKTKENLGSGFI 217 (499) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 5899999899999999999999 No 224 >TIGR03336 IOR_alpha indolepyruvate ferredoxin oxidoreductase, alpha subunit. Indolepyruvate ferredoxin oxidoreductase (IOR) is an alpha 2/beta 2 tetramer related to ketoacid oxidoreductases for pyruvate (1.2.7.1, POR), 2-ketoglutarate (1.2.7.3, KOR), and 2-oxoisovalerate (1.2.7.7, VOR). These multi-subunit enzymes typically are found in anaerobes and are inactiviated by oxygen. IOR in Pyrococcus acts in fermentation of all three aromatic amino acids, following removal of the amino group by transamination. In Methanococcus maripaludis, by contrast, IOR acts in the opposite direction, in pathways of amino acid biosynthesis from phenylacetate, indoleacetate, and p-hydroxyphenylacetate. In M. maripaludis and many other species, iorA and iorB are found next to an apparent phenylacetate-CoA ligase. Probab=72.95 E-value=7.2 Score=18.96 Aligned_cols=79 Identities=18% Similarity=0.286 Sum_probs=49.6 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHC---CC-EEE-EEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 5289998478425899999986305---97-589-997210011110367999999997410035767775899951688 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISCR---FP-LRV-IIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~r---~p-~~~-~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) ..+|||||| |.+|.|+...+++- .+ +++ ..||-..++ |+.|-+ .+|-|++ T Consensus 230 ~~~iGIIts--G~~y~~v~eAl~~lG~~i~ilKlgm~~PLp~~~----------i~~F~~--------g~d~VlV----- 284 (595) T TIGR03336 230 GAKIGVIAS--GIAYNYVKEALERLGVDVSVLKIGFTYPVPEGL----------VEEFLS--------GVEEVLV----- 284 (595) T ss_pred CCCEEEEEC--CCCHHHHHHHHHHCCCCCCEEEECCCCCCCHHH----------HHHHHH--------CCCEEEE----- T ss_conf 998899922--810899999999729996268843767898899----------999984--------5988999----- Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8444220076999999974890488520577 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAIGHE 250 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgIGHE 250 (529) +|.+.+|=|+.|....+.....+ .=+|-. T Consensus 285 -VEE~~p~iE~qik~~~~~~~~~~-~v~GK~ 313 (595) T TIGR03336 285 -VEELEPVVEEQVKALAGTAGLNI-KVHGKE 313 (595) T ss_pred -EECCCHHHHHHHHHHHHHCCCCC-EEECCC T ss_conf -93583658999999975168995-477888 No 225 >pfam01221 Dynein_light Dynein light chain type 1. Probab=72.80 E-value=7.3 Score=18.93 Aligned_cols=48 Identities=13% Similarity=0.385 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHCCCCEE--EEEEECCCC---CCCCCCEEEEEEECCCCEEEEEEECC Q ss_conf 9999999999740017189--999970543---56888627999874894799999735 Q gi|254780791|r 18 SELSYHLKHIVESNLSHVC--VRGEISGYR---GIHSSGHAYFSLKDNHSRIDAIIWKG 71 (529) Q Consensus 18 s~l~~~i~~~l~~~~~~~~--v~gEis~~~---~~~~sGH~Yf~lkd~~a~i~~~~~~~ 71 (529) .++..+||.-++..|+..| |+|+ +|- ....+.++||.+ ..+...+||+ T Consensus 33 ~diA~~IK~~fD~kyg~~WhciVG~--~Fgs~vth~~~~~i~f~~----~~~~ill~K~ 85 (86) T pfam01221 33 KDIAAHIKKEFDKKYGPTWHCIVGK--NFGSYVTHETKHFIYFYI----GQLAFLLFKT 85 (86) T ss_pred HHHHHHHHHHHHHHHCCEEEEEECC--CEEEEEEECCCCEEEEEE----CCEEEEEEEC T ss_conf 9999999999742029858999988--626899972796899998----9999999953 No 226 >PRK10422 lipopolysaccharide core biosynthesis protein; Provisional Probab=72.77 E-value=7.3 Score=18.92 Aligned_cols=31 Identities=16% Similarity=0.213 Sum_probs=16.8 Q ss_pred CCCCCEEEEEECCCHHHHHHHH------HHHHHCCC-EEEE Q ss_conf 0265289998478425899999------98630597-5899 Q gi|254780791|r 142 PFIPKIIAVITSPTGAVIRDIL------QRISCRFP-LRVI 175 (529) Q Consensus 142 P~~p~~i~vits~~~a~~~D~~------~~~~~r~p-~~~~ 175 (529) +..||||.||-.. +++|++ +.|+++|| .+|. T Consensus 2 ~~~~kkILIir~~---~iGD~il~tP~i~~Lk~~~P~a~I~ 39 (352) T PRK10422 2 DKPFRRILIIKMR---FHGDMLLTTPVISSLKKNYPDAKID 39 (352) T ss_pred CCCCCEEEEECCC---CHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 9899779997588---6049999999999999988998899 No 227 >PRK09120 p-hydroxycinnamoyl CoA hydratase/lyase; Validated Probab=72.58 E-value=7.4 Score=18.89 Aligned_cols=76 Identities=18% Similarity=0.250 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-CH---HHHHHC-----------C-------HHHHHHHHHHCCCEEE Q ss_conf 367999999997410035767775899951688-84---442200-----------7-------6999999974890488 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGG-SI---EDLWHF-----------N-------DEMIVRAIANSSIPII 244 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG-S~---eDL~~F-----------N-------~e~laraI~~~~iPVi 244 (529) ..++..++..++... .+-|||| +|.| ++ -||-.| . -..+.+.+..|+.||| T Consensus 38 ~~eL~~al~~~~~d~------~vrvvVl-tG~G~~FcaG~Dl~e~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPvI 110 (277) T PRK09120 38 NREMIDVLDELEFDD------DAGVLVL-TGEGEAWTAGMDLKEYFRETDAQPEILQEKIRRESSGWQWRRLRMYAKPTI 110 (277) T ss_pred HHHHHHHHHHHHHCC------CCEEEEE-ECCCCCEECCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 999999999997299------9559999-889997246868899852144663356899999876899999971899999 Q ss_pred EEE-CCCCCCHH-HHHHHCCCCCCCHH Q ss_conf 520-57775258-98864123777214 Q gi|254780791|r 245 SAI-GHETDWTL-ADYAADLRAPTPTG 269 (529) Q Consensus 245 sgI-GHE~D~Tl-~D~VAD~Ra~TPTa 269 (529) .+| ||=.=--+ +=+.+|+|..+++| T Consensus 111 AaVnG~a~GgG~~lalacDi~ia~~~A 137 (277) T PRK09120 111 AMVNGWCFGGGFSPLVACDLAIAADEA 137 (277) T ss_pred EEECCEEEHHHHHHHHHHHHHHHCCCC T ss_conf 998884540779999975111105582 No 228 >PRK03202 6-phosphofructokinase; Provisional Probab=72.53 E-value=7.4 Score=18.88 Aligned_cols=38 Identities=26% Similarity=0.423 Sum_probs=21.2 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHH Q ss_conf 75899951688844422007699999997489048852057775258 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTL 255 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl 255 (529) .|.+|++=|-||......+.+ ++++||| ||---+|-=| T Consensus 95 Id~Li~IGGdgS~~~a~~L~~--------~~~i~vi-gIPkTIDNDl 132 (323) T PRK03202 95 IDALVVIGGDGSYDGAKKLSE--------EYGIPCI-GIPKTIDNDL 132 (323) T ss_pred CCEEEEECCCHHHHHHHHHHH--------CCCCCEE-EECCCCCCCC T ss_conf 999999379469999999984--------3797489-7214446898 No 229 >TIGR03189 dienoyl_CoA_hyt cyclohexa-1,5-dienecarbonyl-CoA hydratase. This enzyme, cyclohexa-1,5-dienecarbonyl-CoA hydratase, also called dienoyl-CoA hydratase, acts on the product of benzoyl-CoA reductase (EC 1.3.99.15). Benzoyl-CoA is a common intermediate in the degradation of many aromatic compounds, and this enzyme is part of an anaerobic pathway for dearomatization and degradation. Probab=72.23 E-value=7.5 Score=18.83 Aligned_cols=40 Identities=25% Similarity=0.339 Sum_probs=25.4 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-577752-5898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) ..++.++..||+|||.+| ||-.=- .-+=+.+|+|..++++ T Consensus 79 ~~~~~~~~~~~~PvIaav~G~a~GgG~~lal~~D~~ia~~~a 120 (251) T TIGR03189 79 HKLVIAMLDSPVPILVAVRGQCLGGGLEVAAAGNLMFAAPDA 120 (251) T ss_pred HHHHHHHHHCCCCEEEEEEEEEECCHHHHHHHCCCCEECCCC T ss_conf 999999996899889998227862207889866745234764 No 230 >pfam04539 Sigma70_r3 Sigma-70 region 3. Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but as some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme. Probab=71.92 E-value=6.4 Score=19.42 Aligned_cols=47 Identities=23% Similarity=0.337 Sum_probs=33.4 Q ss_pred HHHCCHHHHHHHHHHCCCEEE--EEECCCCCCHHHHHHHCCCCCCCHHH Q ss_conf 220076999999974890488--52057775258988641237772145 Q gi|254780791|r 224 LWHFNDEMIVRAIANSSIPII--SAIGHETDWTLADYAADLRAPTPTGA 270 (529) Q Consensus 224 L~~FN~e~laraI~~~~iPVi--sgIGHE~D~Tl~D~VAD~Ra~TPTaA 270 (529) .+-+..+.|-.....+.-|+- +-||++-|.|+.|+++|-.+++|-.+ T Consensus 29 ~l~~s~~~v~~~~~~~~~~~SLd~p~~~~~~~~l~d~i~D~~~~~P~e~ 77 (78) T pfam04539 29 ELGISEEKVREVLEAAREPVSLDLPVGEEEDGELGDLLEDDDAESPEDA 77 (78) T ss_pred HHCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHCCCCCCCCHHHH T ss_conf 9694999999999875898661067688998628662059999881010 No 231 >pfam03599 CdhD CO dehydrogenase/acetyl-CoA synthase delta subunit. Probab=71.92 E-value=7.6 Score=18.77 Aligned_cols=161 Identities=14% Similarity=0.145 Sum_probs=79.5 Q ss_pred CCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC--CCCCCEEEEEE-CCCHHHH Q ss_conf 98899999966752884379999971016800799999999997654012261001631--02652899984-7842589 Q gi|254780791|r 83 GIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPI--PFIPKIIAVIT-SPTGAVI 159 (529) Q Consensus 83 G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~l--P~~p~~i~vit-s~~~a~~ 159 (529) |..|+.+-..++|+|-+ +-+.|.+.-. .+++..+.++.++ +. |+ ++..+ -.-+.-|+|.. |.+.+.+ T Consensus 15 GE~VLyRHektf~npt~-ia~dv~D~m~--~~ei~a~~~~i~~-v~----~~--~~~rVG~~l~~D~iav~~~s~dp~~f 84 (384) T pfam03599 15 GEEVLYRFELPFPNPTA-ITIDVFDNLS--PELLKARRERIED-VM----FD--PKKRVGEKLTLDMIAIRLISTDPKEF 84 (384) T ss_pred CEEEEEECCCCCCCCCC-EEEEECCCCC--HHHHHHHHHHHHH-HH----CC--HHHHHHHHCCCCEEEEEECCCCHHHH T ss_conf 83688746625679982-6999538877--6779999999875-21----32--22222122023389999078999999 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCC-------------CCCCCCCCEEEEECCCCCHHHHHH Q ss_conf 9999986305975899972100111103679999999974100-------------357677758999516888444220 Q gi|254780791|r 160 RDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKE-------------GRTCPRPDIIILARGGGSIEDLWH 226 (529) Q Consensus 160 ~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~-------------~~~~~~~D~iii~RGGGS~eDL~~ 226 (529) ......+.....+.++|... -+.-+-+|++.+..... ..-..++++-+++++.+++++|. T Consensus 85 a~~vk~V~~~~~~pliL~~~------dp~vl~aale~~~~~rPLlyaAt~~N~~~m~~lA~~~~~Pv~v~a~~dl~~l~- 157 (384) T pfam03599 85 AKAVEKVLQAVDVPLVIGGS------DPEVLKAALEVAEDERPLLYAATLDNYKEIAELALEYKHPVLLWSINDLNELK- 157 (384) T ss_pred HHHHHHHHHHCCCCEEEEEC------CHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHCCCCEEEECCCCHHHHH- T ss_conf 99999999646998899808------99999999997465897785489989999999999749828997488899999- Q ss_pred CCHHHHHHHHHHCCCE--EEE----EECCCCCCHHHHHHHCCCC Q ss_conf 0769999999748904--885----2057775258988641237 Q gi|254780791|r 227 FNDEMIVRAIANSSIP--IIS----AIGHETDWTLADYAADLRA 264 (529) Q Consensus 227 FN~e~laraI~~~~iP--Vis----gIGHE~D~Tl~D~VAD~Ra 264 (529) .|++.+.+.-+- |+- ..|.....|+.||+.=.|+ T Consensus 158 ----~L~~~l~~~GikdlVLDPgT~~~g~~l~~t~~n~~~IRRa 197 (384) T pfam03599 158 ----NLNRKLLKAGVKDIVLDPTTEALGYGIKDTIDNFVRIRRA 197 (384) T ss_pred ----HHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHH T ss_conf ----9999999769854898898866674689999999999999 No 232 >TIGR02037 degP_htrA_DO protease Do; InterPro: IPR011782 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists serine peptidases belonging to MEROPS peptidase family S1, subfamily S1C (protease Do, clan PA(S)). They are variously designated DegP, DegQ, heat shock protein HtrA, MucD and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in Escherichia coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens . The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures .; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis. Probab=71.78 E-value=6.6 Score=19.31 Aligned_cols=41 Identities=20% Similarity=0.338 Sum_probs=22.6 Q ss_pred CCEEEEECCC-C--CEECCHHHCCCCCEEEEEEECEE-EEEEEEECC Q ss_conf 4619998489-8--89577789299986999991109-999995057 Q gi|254780791|r 460 RGYTSIQDTN-N--NFITQKRNLATKTRILINFFDGQ-ANAIVINKA 502 (529) Q Consensus 460 RGYaiv~~~~-G--kiI~s~~~l~~gd~i~i~l~DG~-v~a~V~~k~ 502 (529) =|| ||. +| | =+||+..=+..-+.|++.|+||. .+|+++++- T Consensus 91 SGv-Ii~-~d~Gk~YilTNnHVv~gA~~I~V~L~DgrefkAklvG~D 135 (484) T TIGR02037 91 SGV-IIS-ADKGKFYILTNNHVVDGADEITVTLSDGREFKAKLVGKD 135 (484) T ss_pred CCE-EEE-CCCCEEEEEECCEEECCCCEEEEEECCCCEEEEEEECCC T ss_conf 418-984-789869998754363685379999459948556886667 No 233 >COG2957 Peptidylarginine deiminase and related enzymes [Amino acid transport and metabolism] Probab=71.75 E-value=7.3 Score=18.94 Aligned_cols=162 Identities=24% Similarity=0.303 Sum_probs=77.9 Q ss_pred ECCCCEEEEEEECCCCCC-CCCCCCCCCEEEEE----EEEEEECCCCEEEEEEE--EEEECCCCHHHH------------ Q ss_conf 748947999997352105-86681459889999----99667528843799999--710168007999------------ Q gi|254780791|r 58 KDNHSRIDAIIWKGTLNK-IEFLPEEGIEFLVI----GKITTFPGSSKYQIIIE--SLIPSGSGTLLT------------ 118 (529) Q Consensus 58 kd~~a~i~~~~~~~~~~~-~~~~~~~G~~v~~~----g~~~~y~~~g~~ql~v~--~i~~~g~G~l~~------------ 118 (529) .++++...|+=|++++=. ++|.-+..++|-.+ +....|... -|+.+ .|+..|.|.++- T Consensus 104 i~~~g~~~a~d~~FNaWGGlkf~~e~Dd~Va~k~~~~~~k~~~~~~---~fiLEGGsi~~dG~Gt~LTT~~CLLn~nRNP 180 (346) T COG2957 104 INDKGDLRAVDFGFNAWGGLKFPWELDDQVASKLAELLFKAPLKTD---PFILEGGSIHSDGEGTLLTTEQCLLNDNRNP 180 (346) T ss_pred ECCCCCEEEECCEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCC---CEEEECCCEEECCCEEEEEEHHHHCCCCCCC T ss_conf 6377728874214503455346611231899999985365754558---7356367240158643885457655899898 Q ss_pred -----HHH-HHHHHHHHH-------HCCCCCCCC---CCCCC--CCEEEEE-----ECCCHHHHHHHHHHHHH-C----C Q ss_conf -----999-999976540-------122610016---31026--5289998-----47842589999998630-5----9 Q gi|254780791|r 119 -----ALE-KRKKKLLEE-------GLFSDQHKN---PIPFI--PKIIAVI-----TSPTGAVIRDILQRISC-R----F 170 (529) Q Consensus 119 -----~~e-~lk~~L~~e-------Glfd~~~k~---~lP~~--p~~i~vi-----ts~~~a~~~D~~~~~~~-r----~ 170 (529) .+| .|++-|.++ |+.....-- .+-+| |-.|+.. ..+.-|++++++..|+. | - T Consensus 181 hl~~~~ie~~L~e~Lg~kkvlWL~~Gl~~D~TDgHiDtlarFv~p~~iv~~~~dde~Dp~y~~~q~~~~~L~~~~d~~G~ 260 (346) T COG2957 181 HLNQEGIENKLKEYLGAKKVLWLEYGLKNDDTDGHIDTLARFVAPGEIVLSWCDDENDPHYAALQAMLEELKELRDAKGR 260 (346) T ss_pred CCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 85888999999998576278982678767766510454665427882898724799983589999999999850056897 Q ss_pred CEEEEEEECC----CC-CCCHHHHH--------------------HHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 7589997210----01-11103679--------------------99999997410035767775899951688844 Q gi|254780791|r 171 PLRVIIFPVK----VQ-GDECPKEI--------------------ANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 171 p~~~~~~p~~----vQ-G~~a~~~i--------------------~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) |++|+=.|++ -+ |+.-+++- .-|++.+..+-....+..++.-.|.+||||+- T Consensus 261 ~~~l~~Lp~Pk~~~~e~~~rL~aSY~NFlI~N~avIvP~y~D~nD~~a~~~L~~~fP~reVVGVp~r~il~ggGs~H 337 (346) T COG2957 261 PLKLHKLPIPKPVTDEDGERLPASYVNFLIINGAVIVPQYDDPNDALALDVLQQAFPGREVVGVPAREILLGGGSLH 337 (346) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHCCCCEEECCCCHHHEECCCCEE T ss_conf 34799757885342335789753114579866818963058963389999999768997375653288522688358 No 234 >PRK08150 enoyl-CoA hydratase; Provisional Probab=71.34 E-value=7.8 Score=18.67 Aligned_cols=37 Identities=22% Similarity=0.409 Sum_probs=23.6 Q ss_pred HHHHHHHHHCCCEEEEEE-CC----CCCCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-57----77525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GH----ETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GH----E~D~Tl~D~VAD~Ra~TPTa 269 (529) ..+...+..||+|||.+| || -.... +.+|.|..+|++ T Consensus 91 ~~~~~~i~~~~~PvIaav~G~a~GgG~~la---lacD~~ia~~~a 132 (264) T PRK08150 91 HRVFDKIQYGRVPVIAALHGAVVGGGLELA---SAAHIRVADEST 132 (264) T ss_pred HHHHHHHHHCCCCEEEEECCEEEEEEEEEE---CCCEEEECCCCC T ss_conf 999999982899889997268987024310---230455415786 No 235 >PRK07659 enoyl-CoA hydratase; Provisional Probab=70.97 E-value=7.9 Score=18.61 Aligned_cols=39 Identities=26% Similarity=0.267 Sum_probs=27.5 Q ss_pred HHHHHHHHHCCCEEEEEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 999999974890488520-5777--525898864123777214 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 230 e~laraI~~~~iPVisgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ..+..++++||.|||++| ||=. =+.| =+.+|+|..+|+| T Consensus 88 ~~~~~~~~~~~kPvIaav~G~a~GgG~~l-al~cD~~ia~~~a 129 (260) T PRK07659 88 SEIVVTLYTMPKLVISAIHGPAAGLGLSL-ALTADYVMADISA 129 (260) T ss_pred HHHHHHHHHCCCCEEEEECCEEEECHHHH-HHHCCCCCCCCCC T ss_conf 99999999689989999637066413499-8742745556776 No 236 >PRK10898 serine endoprotease; Provisional Probab=70.67 E-value=8 Score=18.56 Aligned_cols=37 Identities=11% Similarity=0.044 Sum_probs=28.2 Q ss_pred HCCCEEEEECCCCCEECCHHHC-------CCCCEEEEEEECEEEE Q ss_conf 3146199984898895777892-------9998699999110999 Q gi|254780791|r 458 LKRGYTSIQDTNNNFITQKRNL-------ATKTRILINFFDGQAN 495 (529) Q Consensus 458 L~RGYaiv~~~~GkiI~s~~~l-------~~gd~i~i~l~DG~v~ 495 (529) |+.|=-|+. -||+.|++..++ ++||.+++.+..|--. T Consensus 298 L~~GDvI~~-idg~~v~~~~~l~~~l~~~~pGd~v~l~v~R~G~~ 341 (355) T PRK10898 298 IQVNDLIIS-VNNKPAISALETMDQVAEIRPGSVIPVVVMRDDKQ 341 (355) T ss_pred CCCCCEEEE-ECCEECCCHHHHHHHHHHCCCCCEEEEEEEECCEE T ss_conf 998999999-89989389999999997189979899999999999 No 237 >PRK13011 formyltetrahydrofolate deformylase; Reviewed Probab=70.61 E-value=8 Score=18.55 Aligned_cols=90 Identities=14% Similarity=0.237 Sum_probs=54.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHH-CCCEEEE--E------------ Q ss_conf 800799999999997654012261001631026528999847842589999998630-5975899--9------------ Q gi|254780791|r 112 GSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISC-RFPLRVI--I------------ 176 (529) Q Consensus 112 g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~-r~p~~~~--~------------ 176 (529) ...++...|+++-+++.. +-+..-+.-++|++|..|+-+-.+.|+|...+. ..|++|. + T Consensus 63 ~~~~l~~~f~~ia~~~~m------~w~l~~~~~k~RvaImVSK~~HCL~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~ 136 (287) T PRK13011 63 DEDALRAGFAPIAAAFGM------QWELHDPAARPKVLIMVSKFDHCLNDLLYRWRIGELPMDIVGVVSNHPDLEPLAAW 136 (287) T ss_pred CHHHHHHHHHHHHHHCCC------EEEEECCCCCCEEEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHH T ss_conf 999999999998865396------79997056784389999685236999999976498684465873498678999997 Q ss_pred -----EECCCCCC---CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf -----72100111---103679999999974100357677758999516 Q gi|254780791|r 177 -----FPVKVQGD---ECPKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 177 -----~p~~vQG~---~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) |..+|-.+ .++.++.+-|+.. ++|+||++|= T Consensus 137 ~~IPF~~iPv~~~~K~e~E~~~~~~~~~~----------~~d~ivla~y 175 (287) T PRK13011 137 HGIPFHHFPITPDTKPQQEAQVLDVIEES----------GAELVVLARY 175 (287) T ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC----------CCCEEEHHHH T ss_conf 69982883788776378999999998733----------9739943448 No 238 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=70.51 E-value=8.1 Score=18.53 Aligned_cols=94 Identities=20% Similarity=0.270 Sum_probs=61.2 Q ss_pred CCCCEEEEEECCCH---HHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 26528999847842---589999998630597589997210011110367999999997410035767775899951688 Q gi|254780791|r 143 FIPKIIAVITSPTG---AVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 143 ~~p~~i~vits~~~---a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) ...++|||+=+|+- ....+.++...+-.++++ +++.|+- ..++-.++.-+. .++|+|.+- T Consensus 157 Pnak~Igv~Y~p~E~ns~~l~eelk~~A~~~Gl~v--ve~~v~~---~ndi~~a~~~l~--------g~~d~i~~p---- 219 (322) T COG2984 157 PNAKSIGVLYNPGEANSVSLVEELKKEARKAGLEV--VEAAVTS---VNDIPRAVQALL--------GKVDVIYIP---- 219 (322) T ss_pred CCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEE--EEEECCC---CCCCHHHHHHHC--------CCCCEEEEE---- T ss_conf 88706999957988660899999999998779889--9983476---320089999734--------787679986---- Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHH Q ss_conf 84442200769999999748904885205777525898 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLAD 257 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D 257 (529) ..-|-.+--+.|.......+||||+ |.+|+.--. T Consensus 220 -~dn~i~s~~~~l~~~a~~~kiPli~---sd~~~V~~G 253 (322) T COG2984 220 -TDNLIVSAIESLLQVANKAKIPLIA---SDTSSVKEG 253 (322) T ss_pred -CCHHHHHHHHHHHHHHHHHCCCEEC---CCHHHHHCC T ss_conf -6067788899999998870897354---777887668 No 239 >cd06278 PBP1_LacI_like_2 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=70.40 E-value=8.1 Score=18.51 Aligned_cols=75 Identities=21% Similarity=0.220 Sum_probs=30.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCC Q ss_conf 888898622999999999997400-17189999970543568886279998748947999997352105---86681459 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEG 83 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G 83 (529) |+.++|- -+++...|...++.. |. +-+ .. ++-. ...-..+-.|.+ -+++++++-+.... +....+.| T Consensus 7 p~i~npf--f~~i~~gie~~~~~~gy~-~ll-~~-~~~~--~~~e~~l~~l~~--~~vDGiIi~~~~~~~~~~~~~~~~~ 77 (266) T cd06278 7 ADLDNPF--YSELLEALSRALQARGYQ-PLL-IN-TDDD--EDLDAALRQLLQ--YRVDGVIVTSGTLSSELAEECRRNG 77 (266) T ss_pred CCCCCHH--HHHHHHHHHHHHHHCCCE-EEE-EE-CCCC--HHHHHHHHHHHH--CCCCEEEEECCCCCHHHHHHHHHCC T ss_conf 9987699--999999999999986999-999-97-9998--899999999995--5999999948989999999999769 Q ss_pred CEEEEEEE Q ss_conf 88999999 Q gi|254780791|r 84 IEFLVIGK 91 (529) Q Consensus 84 ~~v~~~g~ 91 (529) .-|++.++ T Consensus 78 iPvV~i~~ 85 (266) T cd06278 78 IPVVLINR 85 (266) T ss_pred CCEEEECC T ss_conf 98999787 No 240 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=70.26 E-value=8.2 Score=18.49 Aligned_cols=82 Identities=18% Similarity=0.192 Sum_probs=47.8 Q ss_pred EEEEEECCCHHHHHHH-HHHH---HHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 8999847842589999-9986---30597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 147 IIAVITSPTGAVIRDI-LQRI---SCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 147 ~i~vits~~~a~~~D~-~~~~---~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) |||||...++.-|..- .+-+ .+..++++.++.+ +| -+..-++.|+.+-.. ++|.|||.=.-.+ T Consensus 1 kIa~~~~~~~~~f~~~~~~G~~~~A~~~G~~~~v~d~--~~--d~~~Q~~~i~~~i~~-------~vDgIii~p~d~~-- 67 (273) T cd06305 1 RIAVVRYGGSGDFDQAYLAGTKAEAEALGGDLRVYDA--GG--DDAKQADQIDQAIAQ-------KVDAIIIQHGRAE-- 67 (273) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECC--CC--CHHHHHHHHHHHHHC-------CCCEEEEECCCHH-- T ss_conf 9899971788979999999999999974998999739--99--999999999999985-------9999999468714-- Q ss_pred HHHHCCHHHHHHHHHHCCCEEEEE Q ss_conf 422007699999997489048852 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIISA 246 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVisg 246 (529) . -...++...+..||||+- T Consensus 68 ---~--~~~~l~~a~~aGIPVV~~ 86 (273) T cd06305 68 ---V--LKPWVKRALDAGIPVVAF 86 (273) T ss_pred ---H--HHHHHHHHHHCCCCEEEE T ss_conf ---4--489999999859978998 No 241 >PRK11914 diacylglycerol kinase; Reviewed Probab=69.98 E-value=8.3 Score=18.44 Aligned_cols=106 Identities=25% Similarity=0.366 Sum_probs=61.0 Q ss_pred CCEEEEEECCCH---H---HHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 528999847842---5---8999999863059758999721001111036799999999741003576777589995168 Q gi|254780791|r 145 PKIIAVITSPTG---A---VIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 145 p~~i~vits~~~---a---~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) .+||.||-=|.+ . .+..+...+..+. +++.+.++. +.+-+.++++... . ..+|+||++=|- T Consensus 6 m~kv~vIvNP~aG~g~~~~~~~~i~~~l~~~g-~~~~~~~t~--~~~~a~~la~~a~---~-------~g~d~vv~~GGD 72 (304) T PRK11914 6 IGKVTVLTNPLSGHGAAPHAAERAIARLHHRG-VDVVEIVGT--DAHHARHLVAAAL---A-------KGTDALVVVGGD 72 (304) T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHHCC-CCEEEEECC--CHHHHHHHHHHHH---H-------CCCCEEEEEECC T ss_conf 86699999977799856889999999999879-909999327--8789999999888---6-------499699999562 Q ss_pred CCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCC--CHHHHHHHC Q ss_conf 884442200769999999748904885205777525898864123777--214567633 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPT--PTGAAEMAV 275 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~T--PTaAAElav 275 (529) |.+- +++..++.+.+|+ |-=.==|--||+-.+.-|+ |-+|++.+. T Consensus 73 GTv~--------ev~~~l~~~~~pl----giiP~GTgNdfAr~lgip~~d~~~a~~~i~ 119 (304) T PRK11914 73 GVIS--------NALQVLAGTDIPL----GIIPAGTGNDHAREFGIPTGDPEAAADVIV 119 (304) T ss_pred HHHH--------HHHHHHCCCCCEE----EEECCCCHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 5988--------9876413578608----996388726889972999878999998874 No 242 >cd07021 Clp_protease_NfeD_like Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease. Nodulation formation efficiency D (NfeD; stomatin operon partner protein, STOPP; DUF107) is a member of membrane-anchored ClpP-class proteases. Currently, more than 300 NfeD homologs have been identified - all of which are bacterial or archaeal in origin. Majority of these genomes have been shown to possess operons containing a homologous NfeD/stomatin gene pair, causing NfeD to be previously named STOPP (stomatin operon partner protein). NfeD homologs can be divided into two groups: long and short forms. Long-form homologs have a putative ClpP-class serine protease domain while the short form homologs do not. Downstream from the ClpP-class domain is the so-called NfeD or DUF107 domain. N-terminal region of the NfeD homolog PH1510 (1510-N or PH1510-N) from Pyrococcus horikoshii has been shown to possess serine protease activity and has a Ser-Lys catalytic dyad, preferentiall Probab=69.95 E-value=8.3 Score=18.44 Aligned_cols=81 Identities=23% Similarity=0.306 Sum_probs=57.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEE--EECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCC--CCCHHHHHHH Q ss_conf 10367999999997410035767775899--9516888444220076999999974890488520577--7525898864 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIII--LARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHE--TDWTLADYAA 260 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~ii--i~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE--~D~Tl~D~VA 260 (529) ....-+.++|+.+...+ .++|| |==.||... .-..++.+|.++++||++=|++. .-=+++=+.| T Consensus 13 ~~~~~l~r~l~~A~~~~-------a~~ivl~idTpGG~v~-----~~~~I~~~I~~~~~pvv~~V~~~AaSAG~~Ia~aa 80 (178) T cd07021 13 GLAAFVERALKEAKEEG-------ADAVVLDIDTPGGRVD-----SALEIVDLILNSPIPTIAYVNDRAASAGALIALAA 80 (178) T ss_pred HHHHHHHHHHHHHHHCC-------CCEEEEEEECCCCCHH-----HHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHC T ss_conf 89999999999999689-------9789999979998689-----99999999984899999999992077999999846 Q ss_pred CCCCCCCHHHHHHHCCC Q ss_conf 12377721456763323 Q gi|254780791|r 261 DLRAPTPTGAAEMAVPV 277 (529) Q Consensus 261 D~Ra~TPTaAAElavp~ 277 (529) |..+-+|.+---.+.|. T Consensus 81 d~i~m~p~t~iG~a~Pv 97 (178) T cd07021 81 DEIYMAPGATIGAAEPI 97 (178) T ss_pred CEEEECCCCCEECCCCC T ss_conf 70048899853015425 No 243 >cd01391 Periplasmic_Binding_Protein_Type_1 Type 1 periplasmic binding fold superfamily. Type 1 periplasmic binding fold superfamily. This model and hierarchy represent the ligand binding domains of the LacI family of transcriptional regulators, periplasmic binding proteins of the ABC-type transport systems, the family C G-protein couples receptors (GPCRs), membrane bound guanylyl cyclases including the family of natriuretic peptide receptors (NPRs), and the N-terminal leucine/isoleucine/valine- binding protein (LIVBP)-like domains of the ionotropic glutamate receptors (iGluRs). In LacI-like transcriptional regulator and the bacterial periplasmic binding proteins the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. Periplasmic sugar binding proteins are one of the components of ABC transporters and are involved in the active transport of water-soluble ligands. The LacI family of proteins con Probab=69.63 E-value=8.4 Score=18.39 Aligned_cols=89 Identities=25% Similarity=0.239 Sum_probs=40.5 Q ss_pred EEEEEECCCH-H-HHHHHHHHHH---HC--CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 8999847842-5-8999999863---05--97589997210011110367999999997410035767775899951688 Q gi|254780791|r 147 IIAVITSPTG-A-VIRDILQRIS---CR--FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 147 ~i~vits~~~-a-~~~D~~~~~~---~r--~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) +||+|.+.+| . -...+.+-+. +. ++++++++.+ .+.+..-.++++.+... ++|.||+.=+.+ T Consensus 1 ~IG~i~p~~~~~~~~~~~~~g~~~~~~~~g~~~~l~~~~~----~~~~~~~~~~i~~l~~~-------~vd~ii~~~~~~ 69 (269) T cd01391 1 KIGVLLPLSGSAPFGAQLLAGIELAAEEIGRGLEVILADS----QSDPERALEALRDLIQQ-------GVDGIIGPPSSS 69 (269) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEC----CCCHHHHHHHHHHHHHC-------CCCEEEECCCCC T ss_conf 9899963887886999999999999998199769999849----99999999999999974-------999999468874 Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCH Q ss_conf 84442200769999999748904885205777525 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWT 254 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~T 254 (529) +...+........||||+--....+.+ T Consensus 70 --------~~~~~~~~~~~~~iPvv~~~~~~~~~~ 96 (269) T cd01391 70 --------SALAVVELAAAAGIPVVSLDATAPDLT 96 (269) T ss_pred --------CHHHHHHHHHHCCCEEEEECCCCCCCC T ss_conf --------448999999976981999678878777 No 244 >cd07015 Clp_protease_NfeD Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease. Nodulation formation efficiency D (NfeD; stomatin operon partner protein, STOPP; DUF107) is a member of membrane-anchored ClpP-class proteases. Currently, more than 300 NfeD homologs have been identified - all of which are bacterial or archaeal in origin. Majority of these genomes have been shown to possess operons containing a homologous NfeD/stomatin gene pair, causing NfeD to be previously named STOPP (stomatin operon partner protein). NfeD homologs can be divided into two groups: long and short forms. Long-form homologs have a putative ClpP-class serine protease domain while the short form homologs do not. Downstream from the ClpP-class domain is the so-called NfeD or DUF107 domain. N-terminal region of the NfeD homolog PH1510 (1510-N or PH1510-N) from Pyrococcus horikoshii has been shown to possess serine protease activity and has a Ser-Lys catalytic dyad, preferentially cle Probab=69.40 E-value=8.5 Score=18.35 Aligned_cols=78 Identities=23% Similarity=0.264 Sum_probs=56.5 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEE--ECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEE---CCC--CCCHHHH Q ss_conf 103679999999974100357677758999--516888444220076999999974890488520---577--7525898 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIIL--ARGGGSIEDLWHFNDEMIVRAIANSSIPIISAI---GHE--TDWTLAD 257 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii--~RGGGS~eDL~~FN~e~laraI~~~~iPVisgI---GHE--~D~Tl~D 257 (529) ....-+.++|+.+...+ .|+||| ==-||.... -..+..+|.++++||++=| |+. .==|++= T Consensus 13 ~~~~~l~r~l~~A~~~~-------a~~vii~ldTPGG~~~a-----~~~I~~~i~~s~vPv~~yV~P~g~~A~SAGa~I~ 80 (172) T cd07015 13 YTYDQFDRYITIAEQDN-------AEAIIIELDTPGGRADA-----AGNIVQRIQQSKIPVIIYVYPPGASAASAGTYIA 80 (172) T ss_pred HHHHHHHHHHHHHHHCC-------CCEEEEEEECCCCHHHH-----HHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHH T ss_conf 89999999999999779-------98999998689628999-----9999999982999989999479962676999999 Q ss_pred HHHCCCCCCCHH---HHHHH Q ss_conf 864123777214---56763 Q gi|254780791|r 258 YAADLRAPTPTG---AAEMA 274 (529) Q Consensus 258 ~VAD~Ra~TPTa---AAEla 274 (529) +.||.-+-.|.. ||.=+ T Consensus 81 ~aa~~~~MaPgt~iGaA~PV 100 (172) T cd07015 81 LGSHLIAMAPGTSIGACRPI 100 (172) T ss_pred HHCCHHEECCCCCCCCCCEE T ss_conf 85543206799874357632 No 245 >PRK00771 signal recognition particle protein Srp54; Provisional Probab=69.15 E-value=8.5 Score=18.31 Aligned_cols=60 Identities=23% Similarity=0.382 Sum_probs=34.0 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEE------ECCCCCH-------------------HHH Q ss_conf 975899972100111103679999999974100357677758999------5168884-------------------442 Q gi|254780791|r 170 FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIIL------ARGGGSI-------------------EDL 224 (529) Q Consensus 170 ~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii------~RGGGS~-------------------eDL 224 (529) -|-++++.--.+.|.+|... ++ .|+..- .++-||+ +||||++ +|| T Consensus 207 ~P~e~llV~Da~~GQ~a~~~-a~---~F~~~~------~i~gvIlTKlDgdarGGaaLSi~~~t~~PI~FiG~GEk~~dl 276 (433) T PRK00771 207 KPDEVILVIDATIGQQASEQ-AK---AFKEAV------GIGGIIITKLDGTAKGGGALSAVAETGAPIKFIGTGETIDDL 276 (433) T ss_pred CCCEEEEEEECCCCHHHHHH-HH---HHHHHC------CCCEEEEECCCCCCCCCHHHHHHHHHCCCCEEEECCCCCCCC T ss_conf 79768998654422678999-99---998753------887379972567887305421898878995688617872148 Q ss_pred HHCCHHHHHHHHHHC Q ss_conf 200769999999748 Q gi|254780791|r 225 WHFNDEMIVRAIANS 239 (529) Q Consensus 225 ~~FN~e~laraI~~~ 239 (529) -+||-+.+|..|--+ T Consensus 277 e~F~p~r~asRILGm 291 (433) T PRK00771 277 EPFDPKRFISRLLGM 291 (433) T ss_pred CCCCHHHHHHHHCCC T ss_conf 866808899987089 No 246 >COG1024 CaiD Enoyl-CoA hydratase/carnithine racemase [Lipid metabolism] Probab=69.14 E-value=8.6 Score=18.31 Aligned_cols=81 Identities=22% Similarity=0.300 Sum_probs=47.7 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC-H---HHHHHCC------------H--HHHHHHHHHCCCEE Q ss_conf 111103679999999974100357677758999516888-4---4422007------------6--99999997489048 Q gi|254780791|r 182 QGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS-I---EDLWHFN------------D--EMIVRAIANSSIPI 243 (529) Q Consensus 182 QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS-~---eDL~~FN------------~--e~laraI~~~~iPV 243 (529) -...--.++..|+..++... .+- +||.+|.|- + -||-.|- . ..+.+++..||.|| T Consensus 29 l~~~~~~~l~~al~~~~~d~------~vr-~vvltG~g~~FsaG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~kPv 101 (257) T COG1024 29 LNLEMLDELAEALDEAEADP------DVR-VVVLTGAGKAFSAGADLKELLSPEDGNAAENLMQPGQDLLRALADLPKPV 101 (257) T ss_pred CCHHHHHHHHHHHHHHHHCC------CCE-EEEEECCCCCEECCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 89999999999999996299------816-99998189716717498877422310468888878899999998589999 Q ss_pred EEEE-CCCCCC-HHHHHHHCCCCCCCHH Q ss_conf 8520-577752-5898864123777214 Q gi|254780791|r 244 ISAI-GHETDW-TLADYAADLRAPTPTG 269 (529) Q Consensus 244 isgI-GHE~D~-Tl~D~VAD~Ra~TPTa 269 (529) |++| ||-.-- .-+=+.+|.|..++.+ T Consensus 102 IAav~G~a~GgG~eLal~~D~ria~~~a 129 (257) T COG1024 102 IAAVNGYALGGGLELALACDIRIAAEDA 129 (257) T ss_pred EEEECCHHHHHHHHHHHHCCEEEECCCC T ss_conf 9983650146788998708879834777 No 247 >cd06289 PBP1_MalI_like Ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. This group includes the ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. They are members of the LacI-GalR family of repressor proteins which are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=68.52 E-value=8.8 Score=18.21 Aligned_cols=11 Identities=27% Similarity=0.724 Sum_probs=4.6 Q ss_pred CCCCEEEEEEE Q ss_conf 45988999999 Q gi|254780791|r 81 EEGIEFLVIGK 91 (529) Q Consensus 81 ~~G~~v~~~g~ 91 (529) +.|..|++.++ T Consensus 77 ~~~iPvV~i~~ 87 (268) T cd06289 77 ESGIPVVLVAR 87 (268) T ss_pred HCCCCEEEECC T ss_conf 75998998367 No 248 >PRK06901 aspartate-semialdehyde dehydrogenase; Provisional Probab=68.46 E-value=7.6 Score=18.77 Aligned_cols=48 Identities=19% Similarity=0.322 Sum_probs=26.1 Q ss_pred EEEECCCHHHHHHHH--HHHHHCCCE-EEEE---EECCCCCCCHHHHHHHHHHH Q ss_conf 998478425899999--986305975-8999---72100111103679999999 Q gi|254780791|r 149 AVITSPTGAVIRDIL--QRISCRFPL-RVII---FPVKVQGDECPKEIANAILQ 196 (529) Q Consensus 149 ~vits~~~a~~~D~~--~~~~~r~p~-~~~~---~p~~vQG~~a~~~i~~ai~~ 196 (529) |||+.|+...++=.+ .-|.+.|++ .|++ -+++-.|..|..++...... T Consensus 124 ~IIANPNCsTiqlvvaL~PLh~~~~ikrvvvsTYQaVSGaG~~gv~EL~~Qt~~ 177 (323) T PRK06901 124 NIVSLPDPQVSQLALALAPFLQEQPLNQIFVTSLLPASYTDAETVKKLAGQTAR 177 (323) T ss_pred CEEECCCHHHHHHHHHHHHHHHHHCCCEEEEEEEHHHHHHHHHHHHHHHHHHHH T ss_conf 988899828999999889999971977899986200455418899999999999 No 249 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=68.41 E-value=8.8 Score=18.19 Aligned_cols=13 Identities=15% Similarity=0.026 Sum_probs=5.7 Q ss_pred HHHHHCCCCCCCH Q ss_conf 9886412377721 Q gi|254780791|r 256 ADYAADLRAPTPT 268 (529) Q Consensus 256 ~D~VAD~Ra~TPT 268 (529) ..+++-+-+..|+ T Consensus 20 ~~l~~~~~~~s~s 32 (420) T COG4942 20 SLLSAAVLAAAFS 32 (420) T ss_pred HHHHHCCCCCCHH T ss_conf 9986051135766 No 250 >cd00363 PFK Phosphofructokinase, a key regulatory enzyme in glycolysis, catalyzes the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate. The members belong to PFK family that includes ATP- and pyrophosphate (PPi)- dependent phosphofructokinases. Some members evolved by gene duplication and thus have a large C-terminal/N-terminal extension comprising a second PFK domain. Generally, ATP-PFKs are allosteric homotetramers, and PPi-PFKs are dimeric and nonallosteric except for plant PPi-PFKs which are allosteric heterotetramers. Probab=68.23 E-value=8.9 Score=18.16 Aligned_cols=43 Identities=16% Similarity=0.214 Sum_probs=20.9 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHH Q ss_conf 75899951688844422007699999997489048852057775258 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTL 255 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl 255 (529) .|.+|++=|-||.....-+.+|-.- ..+.+||| ||=.-+|--| T Consensus 93 Id~Li~IGGdgS~~~a~~L~~~~~~---~~~~i~vv-gIPkTIDNDl 135 (338) T cd00363 93 IDALVVIGGDGSYTGADLLTEEWPS---KYQGFNVI-GLPGTIDNDI 135 (338) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHH---CCCCCCEE-EEECCCCCCC T ss_conf 9989997892699999999998885---28995189-8410347899 No 251 >TIGR02997 Sig70-cyanoRpoD RNA polymerase sigma factor, cyanobacterial RpoD-like family. Probab=67.89 E-value=9 Score=18.11 Aligned_cols=105 Identities=21% Similarity=0.268 Sum_probs=57.4 Q ss_pred CCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHH----HH-----CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 2610016310265289998478425899999986----30-----59758999721001111036799999999741003 Q gi|254780791|r 133 FSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRI----SC-----RFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEG 203 (529) Q Consensus 133 fd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~----~~-----r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~ 203 (529) |||+|--.+.+| ||+=|..-| ++ |-|++|. |=++-|+++.+.-.. T Consensus 155 FDPtrGYKFSTY-------------AYWWIRQ~ITRAIA~qSRTIRLPiH~~-------------EKLnkiKK~qReL~q 208 (336) T TIGR02997 155 FDPTRGYKFSTY-------------AYWWIRQGITRAIANQSRTIRLPIHIT-------------EKLNKIKKVQRELSQ 208 (336) T ss_pred CCCCCCCCCCCH-------------HHHHHHHHHHHHHHCCCCCCCCCCCHH-------------HHHHHHHHHHHHHHH T ss_conf 577778863301-------------246875688999843688324741076-------------666899899899999 Q ss_pred CCCCCCCEEEEECCCCCHHHHHHC---CHHHHHHHHHHCCCE--EEEEE-CCCCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 576777589995168884442200---769999999748904--88520-57775258988641237772145676 Q gi|254780791|r 204 RTCPRPDIIILARGGGSIEDLWHF---NDEMIVRAIANSSIP--IISAI-GHETDWTLADYAADLRAPTPTGAAEM 273 (529) Q Consensus 204 ~~~~~~D~iii~RGGGS~eDL~~F---N~e~laraI~~~~iP--VisgI-GHE~D~Tl~D~VAD~Ra~TPTaAAEl 273 (529) .-.-.| |..+|.-+ +-+.|=..+..++-| -=+=| |-|-|+.|-|+-+|-...+|....+- T Consensus 209 ~lGR~P----------s~~EiAe~lel~~~qvR~~l~~~~~p~SLd~~VrG~e~dt~LgdLl~d~~~~~P~~~~~~ 274 (336) T TIGR02997 209 KLGRRP----------SEAEIAEALELEPEQVRELLQRARQPVSLDTRVRGDEEDTELGDLLEDEDGESPEEQVEQ 274 (336) T ss_pred HHCCCC----------CHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCHHHHHH T ss_conf 838897----------868999883889889999999707874457732688854226543268889982789999 No 252 >TIGR02280 PaaB1 phenylacetate degradation probable enoyl-CoA hydratase paaB; InterPro: IPR011968 This family of proteins are found within apparent operons for the degradation of phenylacetic acid . These proteins contain the enoyl-CoA hydratase domain. This activity is consistent with current hypotheses for the degradation pathway which involve the ligation of phenylacetate with coenzyme A (paaF), hydroxylation (paaGHIJK), ring-opening (paaN) and degradation of the resulting fatty acid-like compound to a Krebs cycle intermediate (paaABCDE).. Probab=67.74 E-value=3.6 Score=21.62 Aligned_cols=54 Identities=26% Similarity=0.521 Sum_probs=40.1 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEE---CC--------------CCCHHHHHH-----CCHHHHHHHHHHCCCEEE Q ss_conf 36799999999741003576777589995---16--------------888444220-----076999999974890488 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILA---RG--------------GGSIEDLWH-----FNDEMIVRAIANSSIPII 244 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~---RG--------------GGS~eDL~~-----FN~e~laraI~~~~iPVi 244 (529) +.++.+|++++.+..+ +=++|+. || ||..-||=- || .|+|.+++.|.||| T Consensus 28 H~~l~~aL~~ver~~~------~R~l~LTGAGRGFcAGQDL~dr~v~~~G~~pDLG~~ve~fYn--PLvRrL~~Lp~PVv 99 (259) T TIGR02280 28 HAELREALERVERDDD------ARALLLTGAGRGFCAGQDLSDRNVTPGGAAPDLGETVEKFYN--PLVRRLRALPKPVV 99 (259) T ss_pred HHHHHHHHHHHHCCCC------EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC--HHHHHHHHCCCCEE T ss_conf 9999999866401575------458887388874344655544678874225853221777507--48999972699779 Q ss_pred EEEC Q ss_conf 5205 Q gi|254780791|r 245 SAIG 248 (529) Q Consensus 245 sgIG 248 (529) .||= T Consensus 100 ~AVN 103 (259) T TIGR02280 100 CAVN 103 (259) T ss_pred EEEC T ss_conf 8535 No 253 >cd06270 PBP1_GalS_like Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalS is a dimeric protein like GalR,and its major role is in regulating expression of the high-affinity galactose transporter encoded by the mgl operon, whereas GalR is the exclusive regulator of galactose permease, the low-affinity galactose transporter. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are homologous to the periplasmic sugar bindi Probab=67.74 E-value=9.1 Score=18.09 Aligned_cols=95 Identities=23% Similarity=0.235 Sum_probs=41.6 Q ss_pred CEEEEEECCCHHH-----HHHHHHHHHHCC-CE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 2899984784258-----999999863059-75-8999721001111036799999999741003576777589995168 Q gi|254780791|r 146 KIIAVITSPTGAV-----IRDILQRISCRF-PL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 146 ~~i~vits~~~a~-----~~D~~~~~~~r~-p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) ++||+|+++.... +.=|...+++.. ++ ...++. |+....+-..+++.+-.. .+++|+|+- T Consensus 117 ~~i~~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~i~~----~~~~~~~g~~~~~~ll~~-----~~~~~ai~~---- 183 (268) T cd06270 117 RKIACITGPLTKEDARLRLQGYRDALAEAGIALDESLIIE----GDFTEEGGYAAMQELLAR-----GAPFTAVFC---- 183 (268) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE----CCCCHHHHHHHHHHHHHC-----CCCCCEEEE---- T ss_conf 8389845988883199999999999998599978004761----677678899988999956-----999874553---- Q ss_pred CCHHHHHHCCHHHHHHHHHHC--CCE-EEEEECCCCCCHHHHHH Q ss_conf 884442200769999999748--904-88520577752589886 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANS--SIP-IISAIGHETDWTLADYA 259 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~--~iP-VisgIGHE~D~Tl~D~V 259 (529) ..|.+++ .+.++..+. +|| =|+=||.. |..+++++ T Consensus 184 --~nD~~A~---g~~~~l~~~g~~vP~disvigfd-d~~~~~~~ 221 (268) T cd06270 184 --ANDEMAA---GAISALREHGISVPQDVSIIGFD-DVLLARYL 221 (268) T ss_pred --CCHHHHH---HHHHHHHHCCCCCCCCEEEEEEC-CHHHHHHC T ss_conf --3779999---99999998299999976999989-86899714 No 254 >PRK07135 dnaE DNA polymerase III DnaE; Validated Probab=67.67 E-value=9.1 Score=18.08 Aligned_cols=17 Identities=12% Similarity=0.118 Sum_probs=7.8 Q ss_pred CEEEEEEECEEEEEEEE Q ss_conf 86999991109999995 Q gi|254780791|r 483 TRILINFFDGQANAIVI 499 (529) Q Consensus 483 d~i~i~l~DG~v~a~V~ 499 (529) ....+.|.|++-...|+ T Consensus 930 ~~af~~l~D~tg~~e~~ 946 (987) T PRK07135 930 EYKKVIFADDTVEIKIF 946 (987) T ss_pred CEEEEEEECCCCCEEEE T ss_conf 57999998688658999 No 255 >TIGR00615 recR recombination protein RecR; InterPro: IPR000093 The bacterial protein recR seems to play a role in a recombinational process of DNA repair . It may act with recF and recO. RecR is a protein of about 200 amino acid residues. This protein contains a putative C4-type zinc finger in the N-terminal section.; GO: 0006281 DNA repair, 0006310 DNA recombination. Probab=67.65 E-value=9.1 Score=18.07 Aligned_cols=154 Identities=23% Similarity=0.312 Sum_probs=83.0 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEE--EECCCCCCCCCCEEEE--EEECC-CCEEEEEEECCCCCCCCCCCCCCCEE Q ss_conf 9862299999999999740017189999--9705435688862799--98748-94799999735210586681459889 Q gi|254780791|r 12 HPEYSVSELSYHLKHIVESNLSHVCVRG--EISGYRGIHSSGHAYF--SLKDN-HSRIDAIIWKGTLNKIEFLPEEGIEF 86 (529) Q Consensus 12 ~~~~svs~l~~~i~~~l~~~~~~~~v~g--Eis~~~~~~~sGH~Yf--~lkd~-~a~i~~~~~~~~~~~~~~~~~~G~~v 86 (529) .++..+.+|+..+..+.++ ..+..+.| .||.=. + |++ +-+++ +++| ||+-.. +| | T Consensus 35 ~~~~~~~~la~aL~~a~~~-~~~C~~C~Cl~~s~~~--~----C~ICsd~~Rde~s~i-CvV~~p---------~D---v 94 (205) T TIGR00615 35 RDPSEVERLAQALLEAKEE-LRTCSVCGCLALSDQE--V----CNICSDERRDENSVI-CVVEDP---------KD---V 94 (205) T ss_pred CCHHHHHHHHHHHHHHHHH-CCCCCCCCEEECCCCC--C----CCCCCCCCCCCCCEE-EEECCH---------HH---H T ss_conf 8999999999999999871-4746567502114778--7----775788873447536-874472---------67---9 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEEC-CCCHHHHHHHHHHH----HHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHH Q ss_conf 9999966752884379999971016-80079999999999----765401226100163102652899984784258999 Q gi|254780791|r 87 LVIGKITTFPGSSKYQIIIESLIPS-GSGTLLTALEKRKK----KLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRD 161 (529) Q Consensus 87 ~~~g~~~~y~~~g~~ql~v~~i~~~-g~G~l~~~~e~lk~----~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D 161 (529) .+.-++.-| +|.|-..--.|.|- |+|-=...+..|.+ ||.++ T Consensus 95 ~a~E~~~~f--~G~YhVL~G~isPl~Gigp~~~~i~~L~~riGeRl~~~------------------------------- 141 (205) T TIGR00615 95 FALEKTKEF--RGRYHVLGGLISPLDGIGPEDLKIAALLKRIGERLQEE------------------------------- 141 (205) T ss_pred HHHHHHHHH--CCCEEEECCEECCCCCCCCCCCCHHHHHHHHHCCCCCC------------------------------- T ss_conf 998765421--66014315641764576832214688998860000005------------------------------- Q ss_pred HHHHHHHCCCEEEE-EEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC---CCCHHHHHHCCHHHHHHHHH Q ss_conf 99986305975899-972100111103679999999974100357677758999516---88844422007699999997 Q gi|254780791|r 162 ILQRISCRFPLRVI-IFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARG---GGSIEDLWHFNDEMIVRAIA 237 (529) Q Consensus 162 ~~~~~~~r~p~~~~-~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG---GGS~eDL~~FN~e~laraI~ 237 (529) -+-+|+ -.+.+|-||..+.-|.+-|+.+- +.+-|==||.| ||.+| .=|+-.|+||.. T Consensus 142 --------~~~EVIlA~nPt~EGeaTa~Yi~~~l~~~f--------P~~~~tRiA~G~PvGg~lE---~~D~~TL~ral~ 202 (205) T TIGR00615 142 --------SVKEVILATNPTVEGEATALYIARLLRPLF--------PGVKVTRIASGLPVGGDLE---YVDEVTLARALE 202 (205) T ss_pred --------CCCEEEEECCCCCHHHHHHHHHHHHHHHCC--------CCCEEEEEEECCCCCCCHH---HHHHHHHHHHHC T ss_conf --------797788607898413479999999986238--------9837865542242255156---675799999840 No 256 >CHL00180 rbcR LysR transcriptional regulator; Provisional Probab=67.58 E-value=7.1 Score=19.04 Aligned_cols=105 Identities=17% Similarity=0.240 Sum_probs=64.4 Q ss_pred CCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 31026528999847842589999998630597-58999721001111036799999999741003576777589995168 Q gi|254780791|r 140 PIPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 140 ~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) ..+.-.-+||+..|.....+-+++.....+|| +++.+.- +...+++..+.. ..+|+.|+. | T Consensus 91 ~~~~g~lrI~~~~s~~~~~lp~~l~~f~~~~P~v~i~l~~------~~~~~~~~~l~~----------~~~D~~i~~--~ 152 (307) T CHL00180 91 NLQGGTLIIGASQTTGTYLMPRLIGLFRQKYPQIAVQLQV------HSTRRIAWSVAN----------GQIDLAIIG--G 152 (307) T ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEE------CCHHHHHHHHHC----------CCEEEEEEC--C T ss_conf 1125860101040666664358899999988899727897------799999999987----------980099975--7 Q ss_pred CCHHHHHHCCHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHHHCCCCCC Q ss_conf 88444220076999999974890488520577----7525898864123777 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANSSIPIISAIGHE----TDWTLADYAADLRAPT 266 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~~iPVisgIGHE----~D~Tl~D~VAD~Ra~T 266 (529) .-..|+ ++.--+..++..++=+|++-+|. ...++.|+ ++.++-+ T Consensus 153 ~~~~~~---~~~l~~~~l~~~~~~lv~~~~hpla~~~~v~~~dL-~~~~~i~ 200 (307) T CHL00180 153 EVPTEL---KDNLQVTPYAEDELALILPKSHPFAKLKKIQKEDL-YRLRFIA 200 (307) T ss_pred CCCCCC---CCCEEEEEEECCCEEEEECCCCHHHCCCCCCHHHH-HCCCEEE T ss_conf 788666---78549999432427999738980223999999998-1798487 No 257 >COG3199 Predicted inorganic polyphosphate/ATP-NAD kinase [General function prediction only] Probab=67.46 E-value=9.2 Score=18.05 Aligned_cols=103 Identities=21% Similarity=0.319 Sum_probs=63.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHH Q ss_conf 89998478425899999986305975899972100111103679999999974100357677758999516888444220 Q gi|254780791|r 147 IIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWH 226 (529) Q Consensus 147 ~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~ 226 (529) .+-.+|.|.-.+..-+ ....++++++-|.-..- ++++..-+.+.+.+.... +|+|+.+=|-|-.-| T Consensus 50 ~~~flt~p~~mG~~~~---~~~~~~~~v~~~~~~~~-~tTa~DT~~~~r~~~~~g-------VdlIvfaGGDGTarD--- 115 (355) T COG3199 50 DVEFLTPPGPMGESLA---EASGFKYRVIRFQESTP-RTTAEDTINAVRRMVERG-------VDLIVFAGGDGTARD--- 115 (355) T ss_pred CEEEEECCCCCCHHHH---HHHCCCCEEEEECCCCC-CCCHHHHHHHHHHHHHCC-------CEEEEEECCCCCHHH--- T ss_conf 4399827864325688---76067413776404678-763799999999998659-------619999578852999--- Q ss_pred CCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHC Q ss_conf 0769999999748904885205777525898864123777214567633 Q gi|254780791|r 227 FNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEMAV 275 (529) Q Consensus 227 FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElav 275 (529) ++.++ ...+||+ ||.--+ -+++-+=|-+|++||.++. T Consensus 116 -----Va~av-~~~vPvL-GipaGv-----k~~SgvfA~~P~~aa~l~~ 152 (355) T COG3199 116 -----VAEAV-GADVPVL-GIPAGV-----KNYSGVFALSPEDAARLLG 152 (355) T ss_pred -----HHHHC-CCCCCEE-EECCCC-----CEECCCCCCCHHHHHHHHH T ss_conf -----98532-6788667-413565-----0203520148277899999 No 258 >PRK10867 signal recognition particle protein; Provisional Probab=66.96 E-value=9.3 Score=17.97 Aligned_cols=43 Identities=23% Similarity=0.421 Sum_probs=23.3 Q ss_pred CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEE------ECCCCCHH Q ss_conf 975899972100111103679999999974100357677758999------51688844 Q gi|254780791|r 170 FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIIL------ARGGGSIE 222 (529) Q Consensus 170 ~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii------~RGGGS~e 222 (529) -|-++++.--.+.|.+|... +.+ |+..- .+|-||| +||||++. T Consensus 213 ~P~e~llV~Da~~GQ~a~~~-a~~---F~~~~------~~~gvIlTKlDgdarGG~alS 261 (453) T PRK10867 213 NPVETLFVVDAMTGQDAANT-AKA---FNEAL------PLTGVVLTKVDGDARGGAALS 261 (453) T ss_pred CCCCEEEEEECCCCHHHHHH-HHH---HHHHC------CCCEEEEECCCCCCCCHHHHH T ss_conf 78713797432235668999-999---99855------987078750467876138989 No 259 >PTZ00314 inosine-5'-monophosphate dehydrogenase; Provisional Probab=66.78 E-value=9.4 Score=17.94 Aligned_cols=71 Identities=24% Similarity=0.389 Sum_probs=49.2 Q ss_pred HHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHH Q ss_conf 2589999998630597-589997210011110367999999997410035767775899951688844422007699999 Q gi|254780791|r 156 GAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVR 234 (529) Q Consensus 156 ~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~lar 234 (529) +...-|+++.++..|| +.|+ =|+-|..+-+..|..+ ..|.|-++-|+||+ T Consensus 263 s~~v~~~ik~ik~~~p~v~vI------aGNVaT~~~a~~Li~a----------GAD~vkVGiGpGSi------------- 313 (499) T PTZ00314 263 SIYQIDFIKWIKSTYPHLEVI------AGNVVTQDQAKNLIDA----------GADGIRIGMGSGSI------------- 313 (499) T ss_pred CHHHHHHHHHHHHHCCCCCEE------EEEECHHHHHHHHHHC----------CCCEEEECCCCCCC------------- T ss_conf 278999999988527988467------6433109999999974----------99879975358855------------- Q ss_pred HHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 9974890488520577752589886 Q gi|254780791|r 235 AIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 235 aI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) |.+.+++|||--.=+.+.|.+ T Consensus 314 ----CTTR~v~GvGvPq~tAv~~~a 334 (499) T PTZ00314 314 ----CTTQEVCAVGRPQATAVYKVA 334 (499) T ss_pred ----CCCCCCCCCCCCHHHHHHHHH T ss_conf ----104643466786056799999 No 260 >TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=66.24 E-value=9.6 Score=17.86 Aligned_cols=149 Identities=17% Similarity=0.228 Sum_probs=69.6 Q ss_pred CCCCCHHHHHHHHHHHHHHCCC-CEEEEEEE---------------CCCCCCCCCCEEEEEEECCCCEEEEE--EECCCC Q ss_conf 9862299999999999740017-18999997---------------05435688862799987489479999--973521 Q gi|254780791|r 12 HPEYSVSELSYHLKHIVESNLS-HVCVRGEI---------------SGYRGIHSSGHAYFSLKDNHSRIDAI--IWKGTL 73 (529) Q Consensus 12 ~~~~svs~l~~~i~~~l~~~~~-~~~v~gEi---------------s~~~~~~~sGH~Yf~lkd~~a~i~~~--~~~~~~ 73 (529) .--||+|+-+..+=..|+.... .|+..|++ ..|+. .|||+-|+..|....-... ...... T Consensus 30 ~k~yTLS~~T~~iL~~L~~pv~I~vf~s~~~p~~~~~l~~~i~~lLeeY~~--~s~~I~~efiDP~~~~~~~~~~~~~~l 107 (552) T TIGR03521 30 DKRYTLSPASKEVVKKLDDPVSIDIFLDGELPADFRRLQKETRQLLEEFAA--YNPNIKFRFVNPLEEEDEQGEEILDSL 107 (552) T ss_pred CCCEECCHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH--HCCCEEEEEECCCCCCCHHHHHHHHHH T ss_conf 877325888999998289987999998688873678999999999999997--589869999689988405678888789 Q ss_pred CCCCCCC------CCCC---EEEEEEEEEEECCCCEEEEEEEEEEEC-C---CCHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 0586681------4598---899999966752884379999971016-8---0079999999999765401226100163 Q gi|254780791|r 74 NKIEFLP------EEGI---EFLVIGKITTFPGSSKYQIIIESLIPS-G---SGTLLTALEKRKKKLLEEGLFSDQHKNP 140 (529) Q Consensus 74 ~~~~~~~------~~G~---~v~~~g~~~~y~~~g~~ql~v~~i~~~-g---~G~l~~~~e~lk~~L~~eGlfd~~~k~~ 140 (529) ......| ++|. +++.-|-+..|..+ .-.|.-+... | ...+....+.+ |.-|...-++= T Consensus 108 ~~~Gi~p~~~~~~e~~~~~~~~if~~~~i~y~~~---~~~i~~l~~~~~~~~~~~v~~si~~l------Ey~lt~aI~kl 178 (552) T TIGR03521 108 AQYGIKPANVTVNENGKQSQKLIFPWAIVSYRGK---ETGVPLLKNILGASPEERVNQSIQNL------EYEFADAISKL 178 (552) T ss_pred HHCCCCCCEEEEECCCCEEEEEECCCEEEEECCC---EEECCCCCCCCCCCHHHHHHHHHHHH------HHHHHHHHHHH T ss_conf 8659886135773389556788514305650685---05412223544557466666666647------99999999997 Q ss_pred CCCCCCEEEEEECCCH---HHHHHHHHHHHHCCC Q ss_conf 1026528999847842---589999998630597 Q gi|254780791|r 141 IPFIPKIIAVITSPTG---AVIRDILQRISCRFP 171 (529) Q Consensus 141 lP~~p~~i~vits~~~---a~~~D~~~~~~~r~p 171 (529) .-.-.++||++++.+- .-+.||...+++.|- T Consensus 179 ~~~~~~~I~~l~GhgE~~~~~~~~~~~~L~~~y~ 212 (552) T TIGR03521 179 INPREKRIAVLKGNGELADLQIADLVSTLKEYYF 212 (552) T ss_pred HCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCC T ss_conf 2657856999938997872036799999875132 No 261 >PRK02645 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=66.23 E-value=9.6 Score=17.86 Aligned_cols=28 Identities=36% Similarity=0.697 Sum_probs=12.6 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEE Q ss_conf 758999516888444220076999999974890488 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPII 244 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVi 244 (529) +|++|..=|-|.+- ..+|.++.+.+||+ T Consensus 58 ~Dlvi~lGGDGT~L--------~aar~~~~~~iPil 85 (304) T PRK02645 58 PDLAIVLGGDGTVL--------AAARHLAPHDIPIL 85 (304) T ss_pred CCEEEEECCCHHHH--------HHHHHHCCCCCCEE T ss_conf 88999978688999--------99998542699889 No 262 >pfam01174 SNO SNO glutamine amidotransferase family. This family and its amidotransferase domain was first described in. It is predicted that members of this family are involved in the pyridoxine biosynthetic pathway, based on the proximity and co-regulation of the corresponding genes and physical interaction between the members of pfam01174 and pfam01680. Probab=66.12 E-value=6.5 Score=19.39 Aligned_cols=61 Identities=21% Similarity=0.293 Sum_probs=31.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHC----CCCCCCCCCEEEEECCCCCHHHHHHCCHH-----HHHHHHHHCCCEEE Q ss_conf 0011110367999999997410----03576777589995168884442200769-----99999974890488 Q gi|254780791|r 180 KVQGDECPKEIANAILQLNTLK----EGRTCPRPDIIILARGGGSIEDLWHFNDE-----MIVRAIANSSIPII 244 (529) Q Consensus 180 ~vQG~~a~~~i~~ai~~~~~~~----~~~~~~~~D~iii~RGGGS~eDL~~FN~e-----~laraI~~~~iPVi 244 (529) ++|| .-.|.++||+++..-. +.....+.|.|||= |||| .+.+-..+. .|...|.+...|++ T Consensus 3 alqG--Nl~sv~nal~~lG~e~~~v~~p~di~~ad~LILP-Gg~s-~~~~~ll~~~gl~~~l~e~i~~~~kPiL 72 (188) T pfam01174 3 ALQG--AVEEHEEAIKKCGAENKTVKRPEDLAQCDALIIP-GGES-TAMSLLAKRYGFYEPLYEFVHNPNKPIW 72 (188) T ss_pred CCCC--CHHHHHHHHHHCCCCEEEECCHHHHHCCCEEEEC-CCCH-HHHHHHHHHCCCHHHHHHHHHHCCCCEE T ss_conf 3235--8999999999869987996899999408989988-9887-9999999986888999999980799663 No 263 >pfam02729 OTCace_N Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding domain. Probab=65.95 E-value=9.7 Score=17.82 Aligned_cols=85 Identities=18% Similarity=0.140 Sum_probs=44.1 Q ss_pred CCCCEEEEEECCCHHH-HHHHHHHHHHCCCEEEEEE-ECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 2652899984784258-9999998630597589997-2100111103679999999974100357677758999516888 Q gi|254780791|r 143 FIPKIIAVITSPTGAV-IRDILQRISCRFPLRVIIF-PVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 143 ~~p~~i~vits~~~a~-~~D~~~~~~~r~p~~~~~~-p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) .-.+.|+.+=-+.+-= --=| -....+-+..++.+ +...|- +-.++|...++.++.+ +| +|++|. T Consensus 35 l~gk~i~~lF~epSTRTR~SF-e~A~~~LGg~~i~l~~~~s~~-~kgEsi~Dt~~vls~~--------~D-~iv~R~--- 100 (140) T pfam02729 35 LRGKTLALLFFEPSTRTRLSF-EAAAKRLGGHVIYLDPGSSSL-GKGESLKDTARVLSRY--------VD-AIVIRH--- 100 (140) T ss_pred CCCCEEEEEECCCCCCCEEEH-HHHHHHCCCCEEECCCCCCCC-CCCCCHHHHHHHHHHH--------CC-EEEEEC--- T ss_conf 789889998537887411220-768986698468736333557-7884799999999985--------99-999989--- Q ss_pred HHHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 444220076999999974890488520 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iPVisgI 247 (529) +++..+-.....+++|||.|- T Consensus 101 ------~~~~~~~~~a~~~~vPVINa~ 121 (140) T pfam02729 101 ------PSHGALEELAKYSSVPVINAG 121 (140) T ss_pred ------CCCHHHHHHHHHCCCCEEECC T ss_conf ------961169999986897889899 No 264 >CHL00197 carA carbamoyl-phosphate synthase arginine-specific small subunit; Provisional Probab=65.92 E-value=9.7 Score=17.81 Aligned_cols=92 Identities=27% Similarity=0.421 Sum_probs=59.8 Q ss_pred CCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEE Q ss_conf 10016310265289998478425899999986305975899972100111103679999999974100357677758999 Q gi|254780791|r 135 DQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIIL 214 (529) Q Consensus 135 ~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii 214 (529) ...+++.|..+.+|+||-= -+-+-|+|.|.+|. ++|.++|... .+.+| ..+ +||-|.| T Consensus 184 ~~~~~~~~~~~~~VvviD~---GiK~nILr~L~~~g-~~v~VvP~~~----~~~~I-------~~~-------~PDGIfL 241 (383) T CHL00197 184 ADNKRPHSSYPLRIIVIDF---GVKYNILRRLKSFG-CEITIVPATS----TYQDI-------MAY-------QPDGILL 241 (383) T ss_pred CCCCCCCCCCCCEEEEEEC---CCCHHHHHHHHHCC-CEEEEECCCC----CCCHH-------HCC-------CCCEEEE T ss_conf 4344458888745999967---77188999999789-9899989998----51134-------256-------8888997 Q ss_pred ECCCCCHHHHHHCCHHHHHHHHH-HCCCEEEE-EECCC Q ss_conf 51688844422007699999997-48904885-20577 Q gi|254780791|r 215 ARGGGSIEDLWHFNDEMIVRAIA-NSSIPIIS-AIGHE 250 (529) Q Consensus 215 ~RGGGS~eDL~~FN~e~laraI~-~~~iPVis-gIGHE 250 (529) -=|=|-.+++- ..-..+|.+. ..++||.- ..||+ T Consensus 242 SNGPGDP~~~~--~~i~~i~~li~~~~iPifGICLGHQ 277 (383) T CHL00197 242 SNGPGDPSTVH--YGIKTVKKLIKKANIPIFGICMGHQ 277 (383) T ss_pred CCCCCCHHHHH--HHHHHHHHHHHCCCCCEEEECHHHH T ss_conf 89999978989--9999999999648997899558899 No 265 >cd06274 PBP1_FruR Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs. Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to members of the type I periplasmic binding protein superfamily. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor Probab=65.81 E-value=9.8 Score=17.80 Aligned_cols=22 Identities=9% Similarity=0.289 Sum_probs=13.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 888898622999999999997400 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN 31 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~ 31 (529) |+.++|- -+++...|...+... T Consensus 7 pdi~npf--f~~l~~~ie~~~~~~ 28 (264) T cd06274 7 PDLENRS--FARIAKRLEALARER 28 (264) T ss_pred CCCCCHH--HHHHHHHHHHHHHHC T ss_conf 7987879--999999999999986 No 266 >PRK08321 naphthoate synthase; Validated Probab=65.81 E-value=9.8 Score=17.80 Aligned_cols=18 Identities=17% Similarity=0.134 Sum_probs=14.2 Q ss_pred HHHHHHHHHCCCEEEEEE Q ss_conf 999999974890488520 Q gi|254780791|r 230 EMIVRAIANSSIPIISAI 247 (529) Q Consensus 230 e~laraI~~~~iPVisgI 247 (529) ..+.+.|..||.|||.+| T Consensus 125 ~~~~~~i~~~~kPvIAaV 142 (302) T PRK08321 125 LEVQRLIRFMPKVVIAVV 142 (302) T ss_pred HHHHHHHHCCCCCEEEEE T ss_conf 999999962999999998 No 267 >pfam00117 GATase Glutamine amidotransferase class-I. Probab=65.79 E-value=9.8 Score=17.79 Aligned_cols=41 Identities=32% Similarity=0.551 Sum_probs=34.5 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEE-EEECCC Q ss_conf 7758999516888444220076999999974890488-520577 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPII-SAIGHE 250 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVi-sgIGHE 250 (529) ++|.|||.=|-|+..|.. +...+++.+.+..+||+ .--||. T Consensus 41 ~~d~iilspGpg~p~~~~--~~~~~i~~~~~~~~PiLGIClG~Q 82 (187) T pfam00117 41 NPDGIIISPGPGSPGDAG--GAIEAIKELRENKIPILGICLGHQ 82 (187) T ss_pred CCCEEEECCCCCCHHHCC--CHHHHHHHHHHCCCCEEEEEHHHH T ss_conf 999899919986110134--589999999977998999988899 No 268 >PRK09428 pssA phosphatidylserine synthase; Provisional Probab=65.75 E-value=9.8 Score=17.79 Aligned_cols=73 Identities=22% Similarity=0.315 Sum_probs=34.1 Q ss_pred HHHHHHHHHHHHC---CC-----EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCC Q ss_conf 5899999986305---97-----589997210011110367999999997410035767775899951688844422007 Q gi|254780791|r 157 AVIRDILQRISCR---FP-----LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFN 228 (529) Q Consensus 157 a~~~D~~~~~~~r---~p-----~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN 228 (529) ..++.|...+++. +. -.+.++|..-=|.. ..+-..|...-... -+-+.|+=| .|| T Consensus 206 ~~~~~~r~~l~~~~y~~~~~~~~~~~~itPLvglG~~--n~ln~~i~~Ll~~a-------~~~l~i~Tp--------YFN 268 (444) T PRK09428 206 NDIRLFRQRLRDASYQFQGQANNDQLSVTPLVGLGKK--NLLNKTIFHLMASA-------EQKLTICTP--------YFN 268 (444) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCC--CHHHHHHHHHHHCC-------CCCEEEEEC--------CCC T ss_conf 6799999998753211566677786216632226875--35789999998556-------770799856--------878 Q ss_pred -HHHHHHHHHH-----CCCEEEEE Q ss_conf -6999999974-----89048852 Q gi|254780791|r 229 -DEMIVRAIAN-----SSIPIISA 246 (529) Q Consensus 229 -~e~laraI~~-----~~iPVisg 246 (529) .+.+.+.+.. ..+=||+| T Consensus 269 ~p~~l~~~l~~~l~~g~~v~Iiva 292 (444) T PRK09428 269 LPAILVRNIIRLLRRGKKVEIIVG 292 (444) T ss_pred CCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 989999999999857996499953 No 269 >PRK12723 flagellar biosynthesis regulator FlhF; Provisional Probab=65.63 E-value=9.8 Score=17.77 Aligned_cols=68 Identities=19% Similarity=0.210 Sum_probs=33.0 Q ss_pred CEEEEEECCC-HHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 2899984784-258999999863059758999721001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPT-GAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~-~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) .+||+||.-+ =.|=.|-|++.++.-. ++|-....+.++..++.... .+|+|+|==-|=|.-|. T Consensus 207 ~~V~lit~DtyRigAveQLktya~il~-------vp~~v~~~~~dl~~~l~~~~---------~~D~IlIDTAGrs~~d~ 270 (388) T PRK12723 207 LNIKIITIDNYRIGAKKQIQTYGDIMG-------IPVKAIESFKDLKEEITQSK---------DFDLVLIDTIGKSPKDF 270 (388) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHC-------CCEEEECCHHHHHHHHHHHC---------CCCEEEEECCCCCCCCH T ss_conf 737999807875889999999999978-------80698578899999999724---------99999995899885689 Q ss_pred HHCCH Q ss_conf 20076 Q gi|254780791|r 225 WHFND 229 (529) Q Consensus 225 ~~FN~ 229 (529) +-+++ T Consensus 271 ~~~~e 275 (388) T PRK12723 271 MKLAE 275 (388) T ss_pred HHHHH T ss_conf 99999 No 270 >PRK00476 aspS aspartyl-tRNA synthetase; Validated Probab=65.17 E-value=10 Score=17.70 Aligned_cols=80 Identities=15% Similarity=0.209 Sum_probs=54.5 Q ss_pred HHCC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC--CCCCCCCCEEEEEEEEEE--------ECC Q ss_conf 4001-71899999705435688862799987489479999973521058--668145988999999667--------528 Q gi|254780791|r 29 ESNL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI--EFLPEEGIEFLVIGKITT--------FPG 97 (529) Q Consensus 29 ~~~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~--~~~~~~G~~v~~~g~~~~--------y~~ 97 (529) +.+. ..|.|.|=|.+.|. .++=++++|.|..+.|.|++=+...... --.+..|+-|.|.|.|-. -.+ T Consensus 13 ~~~~g~~V~v~GwV~~~R~--~Gkl~F~~LrD~~G~iQiv~~~~~~~~~~~~~~l~~e~iv~V~G~v~~rp~~~~n~~~~ 90 (587) T PRK00476 13 ESHVGQTVTLCGWVHRRRD--HGGVIFIDLRDREGIVQVVFDPDDAEAFAVAESLRSEYVIQVTGKVRARPEGTVNPNLA 90 (587) T ss_pred HHHCCCEEEEEEEEEEEEC--CCCCEEEEEEECCCCEEEEECCCCHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCC T ss_conf 4769898999785753664--79829999983897889999188778999996699987899998999357656687788 Q ss_pred CCEEEEEEEEEEE Q ss_conf 8437999997101 Q gi|254780791|r 98 SSKYQIIIESLIP 110 (529) Q Consensus 98 ~g~~ql~v~~i~~ 110 (529) .|.+.+.+..++. T Consensus 91 tGeiei~~~~i~i 103 (587) T PRK00476 91 TGEIEVLASELEI 103 (587) T ss_pred CCCEEEEEEEEEE T ss_conf 8728999708999 No 271 >pfam10649 DUF2478 Protein of unknown function (DUF2478). This is a family of hypothetical bacterial proteins found in the vicinity of Molybdenum ABC transporter ATP-binding gene-products MobA MobB and MobC. However the function could not be confirmed. This family appears to belong to the P-loop superfamily by alignment to pfam03266. However, the characteristic P-loop sequence motif appears to have diverged beyond recognition in this family. Probab=65.16 E-value=10 Score=17.70 Aligned_cols=58 Identities=21% Similarity=0.335 Sum_probs=37.8 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf 775899951688844422007699999997489048852057775258988641237772 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTP 267 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TP 267 (529) .+|++||-|=|.-..+=-.|-++ ++.|+ ..-|||+|+|...-=-.-.+|.-++-..-| T Consensus 93 g~dLlIvNkFGk~Ea~G~Glr~~-i~~Al-~~giPvLt~V~~~~~~~W~~Fagg~a~~L~ 150 (159) T pfam10649 93 GADLLIVNKFGKQEAEGRGLRAE-IAEAL-AAGIPVLTAVPPRNLEAWRAFAGGLAEELP 150 (159) T ss_pred CCCEEEECCCHHHHHCCCCHHHH-HHHHH-HCCCCEEEEECHHHHHHHHHHCCCCCCCCC T ss_conf 99899981424666149976999-99999-769988998898888999985277676579 No 272 >COG0568 RpoD DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [Transcription] Probab=65.12 E-value=10 Score=17.69 Aligned_cols=137 Identities=23% Similarity=0.276 Sum_probs=66.9 Q ss_pred ECCCCEEEEEEEEEEECC-CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHH----HHC Q ss_conf 528843799999710168-007999999999976540122610016310265289998478425899999986----305 Q gi|254780791|r 95 FPGSSKYQIIIESLIPSG-SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRI----SCR 169 (529) Q Consensus 95 y~~~g~~ql~v~~i~~~g-~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~----~~r 169 (529) |-.+| |-..++.-.| +| |.++.+ -||+.+--.+-.| |.+=|...| .+. T Consensus 122 Y~~rG---L~~~DLIQEGniG-LmkAVe----------kFdp~rG~kFsTY-------------A~wWIrqaI~raI~~q 174 (342) T COG0568 122 YTGRG---LPFLDLIQEGNIG-LMKAVE----------KFDPEKGFKFSTY-------------ATWWIRQAITRAIADQ 174 (342) T ss_pred HHCCC---CCHHHHHHCCCHH-HHHHHH----------HCCCCCCCCCHHH-------------HHHHHHHHHHHHHHHH T ss_conf 63379---8688988525099-999999----------6296668721109-------------9999999999999974 Q ss_pred CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC-CCCCHHHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 97-589997210011110367999999997410035767775899951-6888444220076999999974890488520 Q gi|254780791|r 170 FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILAR-GGGSIEDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 170 ~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R-GGGS~eDL~~FN~e~laraI~~~~iPVisgI 247 (529) |- +.+ |+ +-.+.+.-|.++.+.-...-...++.--|++ =|++.+ +..=...+..-|+-.=+-| T Consensus 175 ~rtIRi---pv------h~~e~~nkl~r~~r~l~q~~~r~p~~eeia~~l~~~~~------~V~~m~~~~~~~~SLd~~i 239 (342) T COG0568 175 ARTIRI---PV------HQVELINKLRRVKRELLQELGREPTPEEIAEELGVSPD------KVREMLKRASEPISLDTPI 239 (342) T ss_pred CCHHHH---HH------HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHH------HHHHHHHHCCCCCCCCCCC T ss_conf 234657---69------99999999999999999984589979999999599999------9999998653676767848 Q ss_pred CCCCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 57775258988641237772145676 Q gi|254780791|r 248 GHETDWTLADYAADLRAPTPTGAAEM 273 (529) Q Consensus 248 GHE~D~Tl~D~VAD~Ra~TPTaAAEl 273 (529) |++-|..+.||+.|-+...|...++. T Consensus 240 g~ded~~l~d~leD~~~~~p~~~~~~ 265 (342) T COG0568 240 GDDEDSELGDFLEDDKSVSPEDAVER 265 (342) T ss_pred CCCCCCHHHHHHHCCCCCCHHHHHHH T ss_conf 88852078887642687898999998 No 273 >PRK07112 polyketide biosynthesis enoyl-CoA hydratase; Validated Probab=64.93 E-value=10 Score=17.67 Aligned_cols=38 Identities=13% Similarity=0.077 Sum_probs=24.3 Q ss_pred HHHHHHHHCCCEEEEEE-CCCCC--CHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-57775--25898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHETD--WTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~D--~Tl~D~VAD~Ra~TPTa 269 (529) .+..++..+|.|||.+| |+=.= .. +=+.+|+|..+|+| T Consensus 83 ~~~~~l~~~p~pvIAaV~G~a~GgG~~-lal~cD~~ia~~~A 123 (251) T PRK07112 83 GLWHRLATGPYITIAHVQGKVNAGGIG-FVAASDIAIADENV 123 (251) T ss_pred HHHHHHHHCCCCEEEEECCEEEEHHHH-HHHHCCEEEECCCC T ss_conf 999999849982999961656661567-76521502457998 No 274 >cd04243 AAK_AK-HSDH-like AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH). These aspartokinases are found in such bacteria as E. coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and in higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK- Probab=64.78 E-value=10 Score=17.65 Aligned_cols=41 Identities=20% Similarity=0.289 Sum_probs=26.7 Q ss_pred ECCCCCHHHHHHCCHHHHHHHHHHC----CCEEEEEECCCCCCHHHHH Q ss_conf 5168884442200769999999748----9048852057775258988 Q gi|254780791|r 215 ARGGGSIEDLWHFNDEMIVRAIANS----SIPIISAIGHETDWTLADY 258 (529) Q Consensus 215 ~RGGGS~eDL~~FN~e~laraI~~~----~iPVisgIGHE~D~Tl~D~ 258 (529) ==||-|+.|--+|.. +++-|.+. .+=|+||.|.-||.- .++ T Consensus 5 KFGGtSv~~~~~i~~--v~~iI~~~~~~~~vVVVSA~~gvTd~L-~~~ 49 (293) T cd04243 5 KFGGTSVASAERIRR--VADIIKSRASSPVLVVVSALGGVTNRL-VAL 49 (293) T ss_pred EECCCCCCCHHHHHH--HHHHHHHCCCCCEEEEECCCCCCHHHH-HHH T ss_conf 878123699999999--999998377989599981998868999-999 No 275 >cd04104 p47_IIGP_like p47 (47-kDa) family. The p47 GTPase family consists of several highly homologous proteins, including IGTP, TGTP/Mg21, IRG-47, GTPI, LRG-47, and IIGP1. They are found in higher eukaryotes where they play a role in immune resistance against intracellular pathogens. p47 proteins exist at low resting levels in mouse cells, but are strongly induced by Type II interferon (IFN-gamma). ITGP is critical for resistance to Toxoplasma gondii infection and in involved in inhibition of Coxsackievirus-B3-induced apoptosis. TGTP was shown to limit vesicular stomatitis virus (VSV) infection of fibroblasts in vitro. IRG-47 is involved in resistance to T. gondii infection. LRG-47 has been implicated in resistance to T. gondii, Listeria monocytogenes, Leishmania, and mycobacterial infections. IIGP1 has been shown to localize to the ER and to the Golgi membranes in IFN-induced cells and inflamed tissues. In macrophages, IIGP1 interacts with hook3, a microtubule binding protei Probab=64.62 E-value=10 Score=17.62 Aligned_cols=86 Identities=20% Similarity=0.267 Sum_probs=56.3 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHH---------------------CC--C-E-EEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 528999847842589999998630---------------------59--7-5-899972100111103679999999974 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISC---------------------RF--P-L-RVIIFPVKVQGDECPKEIANAILQLNT 199 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~---------------------r~--p-~-~~~~~p~~vQG~~a~~~i~~ai~~~~~ 199 (529) |-+|| ||+.+|||-.-|+|.++- +| | + ++.+|..+ |.+++. ..+=.+++. T Consensus 1 pl~ia-VtGesGaGKSSfINAlRGl~~~d~~aA~tGv~eTT~~~~~Y~~p~~pnv~lwDlP--G~Gt~~--f~~~~Yl~~ 75 (197) T cd04104 1 PLNIA-VTGESGAGKSSFINALRGVGHEEEGAAPTGVVETTMKRTPYPHPKFPNVTLWDLP--GIGSTA--FPPDDYLEE 75 (197) T ss_pred CEEEE-EECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCEEECCC--CCCCCC--CCHHHHHHH T ss_conf 91799-9558998689999998688988777588888746778620479999987697289--999876--598999986 Q ss_pred HCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEE Q ss_conf 100357677758999516888444220076999999974890488 Q gi|254780791|r 200 LKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPII 244 (529) Q Consensus 200 ~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVi 244 (529) +. ...||++||+=.+- ..=||..++++|..+..|+. T Consensus 76 ~~----~~~yD~fiiiss~r-----f~~nd~~la~~i~~~gK~fy 111 (197) T cd04104 76 MK----FSEYDFFIIISSTR-----FSSNDVKLAKAIQCMGKKFY 111 (197) T ss_pred CC----CCCCCEEEEEECCC-----CCHHHHHHHHHHHHCCCCEE T ss_conf 58----55457899983886-----41426999999998099289 No 276 >TIGR03249 KdgD 5-dehydro-4-deoxyglucarate dehydratase. 5-dehydro-4-deoxyglucarate dehydratase not only catalyzes the dehydration of the substrate (diol to ketone + water), but causes the decarboxylation of the intermediate product to yield 2-oxoglutarate semialdehyde (2,5-dioxopentanoate). The gene for the enzyme is usually observed in the vicinity of transporters and dehydratases handling D-galactarate and D-gluconate as well as aldehyde dehydrogenases which convert the product to alpha-ketoglutarate. Probab=64.53 E-value=10 Score=17.61 Aligned_cols=25 Identities=12% Similarity=0.147 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 8425899999986305975899972 Q gi|254780791|r 154 PTGAVIRDILQRISCRFPLRVIIFP 178 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p~~~~~~p 178 (529) ++-.++-++.+.+.+.-++.+++|. T Consensus 113 ~~~~~l~~~f~~ia~a~~~pi~lYn 137 (296) T TIGR03249 113 GEQEGLYAHVEAVCESTDLGVIVYQ 137 (296) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 9999999999999971599778730 No 277 >KOG3857 consensus Probab=64.30 E-value=10 Score=17.58 Aligned_cols=135 Identities=24% Similarity=0.260 Sum_probs=82.2 Q ss_pred ECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHH--HCCCEEEEEEECCCCCCCHH Q ss_conf 1680079999999999765401226100163102652899984784258999999863--05975899972100111103 Q gi|254780791|r 110 PSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRIS--CRFPLRVIIFPVKVQGDECP 187 (529) Q Consensus 110 ~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~--~r~p~~~~~~p~~vQG~~a~ 187 (529) --|.|-+.+.=..+| ..| .++.-+||-++=+.+.-+-+... .+..+++.+|. -||=+-.. T Consensus 52 rfG~gv~~Evg~dik----n~g-------------aKk~llvTDkni~~~~~~~~a~~~L~~~~I~~~vyD-~v~~ePtv 113 (465) T KOG3857 52 RFGKGVLAEVGDDIK----NLG-------------AKKTLLVTDKNIAKLGLVKVAQDSLEENGINVEVYD-KVQPEPTV 113 (465) T ss_pred HHCCHHHHHHHHHHH----HCC-------------CCCEEEEECCCHHHCCCHHHHHHHHHHCCCCEEEEC-CCCCCCCH T ss_conf 206116899999987----157-------------442589607774441417899999997598169724-76479862 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH------HHHCCHH-HHHH----------HHHH-----CCCEEEE Q ss_conf 679999999974100357677758999516888444------2200769-9999----------9974-----8904885 Q gi|254780791|r 188 KEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED------LWHFNDE-MIVR----------AIAN-----SSIPIIS 245 (529) Q Consensus 188 ~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD------L~~FN~e-~lar----------aI~~-----~~iPVis 245 (529) .++-.||+.+..- .+|. .|+=||||.-| |.+=|-| +... .+-. .-||.-+ T Consensus 114 ~s~~~alefak~~-------~fDs-~vaiGGGSa~DtaKaaaL~Asn~~~eflDyvg~pigk~~~~s~p~lPLiAipTTa 185 (465) T KOG3857 114 GSVTAALEFAKKK-------NFDS-FVAIGGGSAHDTAKAAALLASNGEGEFLDYVGPPIGKVKQSSKPLLPLIAIPTTA 185 (465) T ss_pred HHHHHHHHHHHHC-------CCCE-EEEECCCCHHHHHHHHHHHHCCCCCCCHHCCCCCCCCCCCCCCCCCCEEECCCCC T ss_conf 3499999998733-------6655-8987685045568889886028876300023786666665555556347502578 Q ss_pred EECCC-CCCHHHHHHHC-------CCCCCCHHH Q ss_conf 20577-75258988641-------237772145 Q gi|254780791|r 246 AIGHE-TDWTLADYAAD-------LRAPTPTGA 270 (529) Q Consensus 246 gIGHE-~D~Tl~D~VAD-------~Ra~TPTaA 270 (529) |-|-| |-++|.||--. .|+-+||-| T Consensus 186 GTgSEtT~~AI~d~e~~k~K~gI~~k~ikP~la 218 (465) T KOG3857 186 GTGSETTRFAIIDYEELKIKMGIIDKNIKPTLA 218 (465) T ss_pred CCCCCCEEEEEECCHHHHEEEEEECCCCCCEEE T ss_conf 766444015885333330011221024564044 No 278 >KOG1384 consensus Probab=64.08 E-value=7.3 Score=18.92 Aligned_cols=32 Identities=28% Similarity=0.386 Sum_probs=28.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 52899984784258999999863059758999 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISCRFPLRVII 176 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~ 176 (529) +.++.||..+||+|-.+..--|+.|||.+|+. T Consensus 6 k~KVvvI~G~TGsGKSrLaVdLA~rf~~EIIN 37 (348) T KOG1384 6 KDKVVVIMGATGAGKSRLAVDLATRFPGEIIN 37 (348) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHCCCEEEC T ss_conf 73599995577777046678889757864651 No 279 >PRK09791 putative DNA-binding transcriptional regulator; Provisional Probab=64.01 E-value=9.6 Score=17.86 Aligned_cols=88 Identities=17% Similarity=0.180 Sum_probs=56.9 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 1026528999847842589999998630597-589997210011110367999999997410035767775899951688 Q gi|254780791|r 141 IPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 141 lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) ...-.-+||+..+.....+-+++.....+|| +++.+... ...+++..+.. ...|+.|...-.+ T Consensus 92 ~~~G~l~Ig~~~~~~~~~lp~~l~~f~~~~P~v~i~i~~~------~~~~~~~~l~~----------g~~D~~i~~~~~~ 155 (302) T PRK09791 92 QLAGQINIGMGASISRSLMPAVISRFHQQHPQVKVRIMEG------QLVSMINELRQ----------GELDFTINTYYQG 155 (302) T ss_pred CCCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCEEEEEEC------CHHHHHHHHHC----------CCCCEEEECCCCC T ss_conf 7663399986268898711999999999887977999989------99999999987----------9977899656788 Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8444220076999999974890488520577 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAIGHE 250 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgIGHE 250 (529) ... ++-....++..+.-+|..-||- T Consensus 156 ~~~------~~l~~~~l~~~~~~~v~~~~hp 180 (302) T PRK09791 156 PYD------HEFTFEKLLEKQFAIFCRPGHP 180 (302) T ss_pred CCC------CCEEEEEEEEEEEEEEECCCCC T ss_conf 788------7647999774015999838984 No 280 >pfam00365 PFK Phosphofructokinase. Probab=63.98 E-value=10 Score=17.53 Aligned_cols=37 Identities=14% Similarity=0.151 Sum_probs=18.1 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCH Q ss_conf 7589995168884442200769999999748904885205777525 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWT 254 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~T 254 (529) .|.+|++=|.||......+-+ ++++|| .||=.-+|-= T Consensus 93 Id~li~IGGd~S~~~a~~L~~--------~~~i~v-igIPkTIDND 129 (279) T pfam00365 93 IDALVVIGGDGSYTGADLLTS--------EHGFNC-VGLPGTIDND 129 (279) T ss_pred CCEEEEECCCHHHHHHHHHHH--------HCCCCE-EEEECCCCCC T ss_conf 997999669568999999999--------729978-9820345689 No 281 >cd00951 KDGDH 5-dehydro-4-deoxyglucarate dehydratase, also called 5-keto-4-deoxy-glucarate dehydratase (KDGDH), which is member of dihydrodipicolinate synthase (DHDPS) family that comprises several pyruvate-dependent class I aldolases. The enzyme is involved in glucarate metabolism, and its mechanism presumbly involves a Schiff-base intermediate similar to members of DHDPS family. While in the case of Pseudomonas sp. 5-dehydro-4-deoxy-D-glucarate is degraded by KDGDH to 2,5-dioxopentanoate, in certain species of Enterobacteriaceae it is degraded instead to pyruvate and glycerate. Probab=63.78 E-value=11 Score=17.50 Aligned_cols=39 Identities=8% Similarity=0.073 Sum_probs=23.0 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHH Q ss_conf 84258999999863059758999721001111036799999 Q gi|254780791|r 154 PTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAI 194 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai 194 (529) ++-.++-++.+.+.+.-++.+++|.. -|..-..+.+..| T Consensus 108 ~~~~~l~~~~~~ia~a~~lpi~lYn~--~~~~~~~~~l~~L 146 (289) T cd00951 108 APQEGLYAHVEAVCKSTDLGVIVYNR--ANAVLTADSLARL 146 (289) T ss_pred CCHHHHHHHHHHHHHHCCCCEEECCC--CCCCCCHHHHHHH T ss_conf 99999999999999846998661488--7767789999999 No 282 >cd04486 YhcR_OBF_like YhcR_OBF_like: A subfamily of OB-fold domains similar to the OB folds of Bacillus subtilis YhcR. YhcR is a sugar-nonspecific nuclease, which is active in the presence of Ca2+ and Mn2+. It cleaves RNA endonucleolytically, producing 3'-monophosphate nucleosides. YhcR appears to be the major Ca2+ activated nuclease of B. subtilis. YhcR may be localized in the cell wall. Probab=63.69 E-value=11 Score=17.49 Aligned_cols=68 Identities=19% Similarity=0.081 Sum_probs=41.3 Q ss_pred EEEEEECCCCCCCCCCEEEEEE--ECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 9999970543568886279998--74894799999735210586681459889999996675288437999997 Q gi|254780791|r 36 CVRGEISGYRGIHSSGHAYFSL--KDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIES 107 (529) Q Consensus 36 ~v~gEis~~~~~~~sGH~Yf~l--kd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~ 107 (529) .|+|=|+.......-+.+|+-= .|.+..-+..||-.... .-.++.|+.|.+.|+|+=| .|..||.... T Consensus 1 tv~GvVTa~~~~~~~~GFyiQ~~~~d~~~~tS~gifV~~~~--~~~~~~Gd~V~vtG~V~Ey--~g~TqL~~~~ 70 (78) T cd04486 1 TVEGVVTAVFSGGGLGGFYIQDEDGDGDPATSEGIFVYTGS--GADVAVGDLVRVTGTVTEY--YGLTQLTAVS 70 (78) T ss_pred CEEEEEEEECCCCCCCEEEEECCCCCCCCCCCCEEEEECCC--CCCCCCCCEEEEEEEEEEE--CCCEEEECCC T ss_conf 92899999752898476999727899997642379998178--8888989999997899862--4858984451 No 283 >PRK13228 consensus Probab=63.59 E-value=11 Score=17.48 Aligned_cols=113 Identities=17% Similarity=0.169 Sum_probs=68.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECC-CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 28999847842589999998630597589997210-01111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVK-VQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~-vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) -++||+||.+......++..+.-.--|.+++..-- -.++-.|.-+..|++.++-.. =++|+| +|= T Consensus 113 ~~l~ivT~~~~~~~~~~l~~~gl~~~Fd~iv~~d~v~~~KP~P~~~l~a~~~lg~~p-------~~~v~V-------gDs 178 (232) T PRK13228 113 YRLACITNKPEILAVPLLKQTGLADYFELILGGDSLPEKKPDPLPLLHACEVLGIDP-------ANMVLV-------GDS 178 (232) T ss_pred CCEEEEECCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCC-------CCEEEE-------CCC T ss_conf 968998089689999987530233331055304336788987999999999929793-------408988-------488 Q ss_pred HHCCHHHHHHHHHHCCCEEE-EEECCCCCCHHHHHHHCCCCCCCHHHHHHHCCC Q ss_conf 20076999999974890488-520577752589886412377721456763323 Q gi|254780791|r 225 WHFNDEMIVRAIANSSIPII-SAIGHETDWTLADYAADLRAPTPTGAAEMAVPV 277 (529) Q Consensus 225 ~~FN~e~laraI~~~~iPVi-sgIGHE~D~Tl~D~VAD~Ra~TPTaAAElavp~ 277 (529) .+|-.=|+ +.-+++| ..=|+-....|.+.=||.-..+++..-+++++. T Consensus 179 --~~di~aA~---~AGi~~i~V~~G~~~~~~l~~~~ad~vv~~l~el~~lL~~~ 227 (232) T PRK13228 179 --ANDVIAAR---AAGCPVFCVPYGYADGHGLAALDCDAIVDSLPEAYALLTPQ 227 (232) T ss_pred --HHHHHHHH---HCCCEEEEECCCCCCHHHHHHCCCCEEECCHHHHHHHHHHC T ss_conf --87999999---95993999879999977898789999999999999997516 No 284 >cd07044 CofD_YvcK Family of CofD-like proteins and proteins related to YvcK. CofD is a 2-phospho-L-lactate transferase that catalyzes the last step in the biosynthesis of coenzyme F(420)-0 (F(420) without polyglutamate) by transferring the lactyl phosphate moiety of lactyl(2)diphospho-(5')guanosine (LPPG) to 7,8-didemethyl-8-hydroxy-5-deazariboflavin ribitol (F0). F420 is a hydride carrier, important for energy metabolism of methanogenic archaea, as well as for the biosynthesis of other natural products, like tetracycline in Streptomyces. F420 and some of its precursors are also utilized as cofactors for enzymes, like DNA photolyase in Mycobacterium tuberculosis. YvcK from Bacillus subtilis is a member of a family of mostly uncharacterized proteins and has been proposed to play a role in carbon metabolism, since its function is essential for growth on intermediates of the Krebs cycle and pentose phosphate pathway. Both families appear to have a conserved phosphate binding site, but ha Probab=63.32 E-value=11 Score=17.44 Aligned_cols=109 Identities=19% Similarity=0.202 Sum_probs=61.9 Q ss_pred CCCCCCCCEEEEEECCCHHHHH-HHHHHHHHCCCE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 6310265289998478425899-999986305975-89997210011110367999999997410035767775899951 Q gi|254780791|r 139 NPIPFIPKIIAVITSPTGAVIR-DILQRISCRFPL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 139 ~~lP~~p~~i~vits~~~a~~~-D~~~~~~~r~p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R 216 (529) .|.-.-|-++...+....-... ..+. ..+-++ .+.+-|+ ....+.++.++||+. -|+||++= T Consensus 119 lP~t~~~v~L~a~l~dG~~v~Ge~~i~--~~~~~I~~v~l~~~--~~~~~~~~~l~AI~~------------ADlIvlgP 182 (309) T cd07044 119 LPSSDDPVSLHAEXEDGTIVHGESFIP--KGEKKIDRVFLTPV--DEASPSREVLEAIEK------------ADNIVIGP 182 (309) T ss_pred ECCCCCCEEEEEEECCCCEEEEEEHHH--CCCCCCEEEEECCC--CCCCCCHHHHHHHHH------------CCEEEECC T ss_conf 827799647999977998898677122--38987217995378--879889999999986------------89999879 Q ss_pred CCCCHHHHHHCCHHHHHHHHHHCCCEEE------EEECCCCCCHHHHHHHCCC Q ss_conf 6888444220076999999974890488------5205777525898864123 Q gi|254780791|r 217 GGGSIEDLWHFNDEMIVRAIANSSIPII------SAIGHETDWTLADYAADLR 263 (529) Q Consensus 217 GGGS~eDL~~FN~e~laraI~~~~iPVi------sgIGHE~D~Tl~D~VAD~R 263 (529) |-==..=+-.+-=..++.||.+++-|+| |-=|.-..+|++|+|.-+. T Consensus 183 GSlyTSIiPnLlv~gI~eAi~~s~a~kvyi~NlmtqpGET~g~~~~dhv~ai~ 235 (309) T cd07044 183 GSLYTSILPNISVPGIREALKKTXAKKVYVSNIXTQPGETDEYTSSDHAEALQ 235 (309) T ss_pred CCCHHHHCCCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHH T ss_conf 84477766412555699999975998899807899998769989899999999 No 285 >cd00881 GTP_translation_factor GTP translation factor family. This family consists primarily of translation initiation, elongation, and release factors, which play specific roles in protein translation. In addition, the family includes Snu114p, a component of the U5 small nuclear riboprotein particle which is a component of the spliceosome and is involved in excision of introns, TetM, a tetracycline resistance gene that protects the ribosome from tetracycline binding, and the unusual subfamily CysN/ATPS, which has an unrelated function (ATP sulfurylase) acquired through lateral transfer of the EF1-alpha gene and development of a new function. Probab=63.24 E-value=11 Score=17.43 Aligned_cols=69 Identities=20% Similarity=0.185 Sum_probs=44.9 Q ss_pred EEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCE--EEEEECCCHHHHHHHHHHHHHCCC Q ss_conf 79999971016800799999999997654012261001631026528--999847842589999998630597 Q gi|254780791|r 101 YQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKI--IAVITSPTGAVIRDILQRISCRFP 171 (529) Q Consensus 101 ~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~--i~vits~~~a~~~D~~~~~~~r~p 171 (529) +-+.|..||-.+.-++...++++++.|...|+..... .....++. |--|++.+|.++.+.+..+-+..| T Consensus 117 ~iv~iNKiD~~~~~~~~~~~~ei~~~l~~~~~~~~~~--~~~~~~~~~piv~iSA~~G~gv~~Lld~i~~~lP 187 (189) T cd00881 117 IIVAINKIDRVGEEDLEEVLREIKELLGLIGFISTKE--EGTRNGLLVPIVPGSALTGIGVEELLEAIVEHLP 187 (189) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCHHHHH--HCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 7999989718775629999999999987532102321--1012588775999888678697999999997687 No 286 >PRK12680 transcriptional regulator CysB-like protein; Reviewed Probab=63.23 E-value=11 Score=17.43 Aligned_cols=102 Identities=13% Similarity=0.120 Sum_probs=64.4 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 026528999847842589999998630597-5899972100111103679999999974100357677758999516888 Q gi|254780791|r 142 PFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 142 P~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) ..-.-+||...+...-.+-+++.....+|| +++.+.. | .+.+|++.+.. ..+|+-|+.--++. T Consensus 91 ~~G~L~Ig~~~t~a~~~LP~vi~~f~~~yP~V~l~l~~----~--~~~~l~e~l~~----------g~~Diai~~~~~~~ 154 (327) T PRK12680 91 SQGQLTLTTTHTQARFVLPPAVAQIKQAYPQVSVHLQQ----A--AESAALDLLGQ----------GDADIAVVSTAGGE 154 (327) T ss_pred CCEEEEEEEECHHHHHHHHHHHHHHHHHCCCCEEEEEE----C--CCHHHHHHHHC----------CCCCEEEECCCCCC T ss_conf 73379999842277775689999999868996799998----9--82889999978----------99409994477779 Q ss_pred HHHHHHCCHHHHHHHHHHCCCEEEEEECCCC-----CCHHHHHHHCCCCCCC Q ss_conf 4442200769999999748904885205777-----5258988641237772 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIPIISAIGHET-----DWTLADYAADLRAPTP 267 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iPVisgIGHE~-----D~Tl~D~VAD~Ra~TP 267 (529) ..|+..+ -.+..+.-+|..-||.- -.|+.|+ |+...-|+ T Consensus 155 ~~~~~~~-------pl~~~~~~vvvP~~HpL~~~~~~i~L~~L-a~~plI~~ 198 (327) T PRK12680 155 PSAGIAV-------PLYRWRRLVVVPRGHALDTPRTAPDMAAL-AEHPLISY 198 (327) T ss_pred CCCCEEE-------EEEECCEEEEEECCCCCCCCCCCCCHHHH-HCCCEEEC T ss_conf 9765476-------75223579999789810058999999998-08988963 No 287 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=63.09 E-value=11 Score=17.41 Aligned_cols=136 Identities=24% Similarity=0.277 Sum_probs=56.3 Q ss_pred CEEEEEEECCCCCC---CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 47999997352105---866814598899999966752884379999971016800799999999997654012261001 Q gi|254780791|r 62 SRIDAIIWKGTLNK---IEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHK 138 (529) Q Consensus 62 a~i~~~~~~~~~~~---~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k 138 (529) ..+.+++.-+.... +...-+.|.-|++.++.. ..+ .+..+. .+....|.+ .-+.|.+.| + T Consensus 54 ~~vDgiIi~~~~~~~~~~~~~~~~~iPvV~~~~~~-~~~--~~~~V~--~D~~~a~~~------a~~~L~~~G-----~- 116 (264) T cd06267 54 RRVDGIILAPSRLDDELLEELAALGIPVVLVDRPL-DGL--GVDSVG--IDNRAGAYL------AVEHLIELG-----H- 116 (264) T ss_pred CCCCEEEECCCCCCHHHHHHHHHCCCCEEEEEECC-CCC--CCCEEE--ECHHHHHHH------HHHHHHHHC-----C- T ss_conf 79999996788899899999998699889996047-999--987999--787999999------999998718-----8- Q ss_pred CCCCCCCCEEEEEECCCHHH-----HHHHHHHHHHCC-CE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCE Q ss_conf 63102652899984784258-----999999863059-75-899972100111103679999999974100357677758 Q gi|254780791|r 139 NPIPFIPKIIAVITSPTGAV-----IRDILQRISCRF-PL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDI 211 (529) Q Consensus 139 ~~lP~~p~~i~vits~~~a~-----~~D~~~~~~~r~-p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ 211 (529) ++||+|+++.+.. +.-|...+++.. ++ ...+++ +.....+..+++..+-.. .+++|+ T Consensus 117 -------~~i~~i~~~~~~~~~~~R~~gf~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~l~~-----~~~~~a 180 (264) T cd06267 117 -------RRIAFIGGPPDLSTARERLEGYREALEEAGIPLDEELIVE----GDFSEESGYEAARELLAS-----GERPTA 180 (264) T ss_pred -------CCEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEEC----CCCCHHHHHHHHHHHHHC-----CCCCCE T ss_conf -------8568963898770699999999999998699988530752----789779999999999965-----999989 Q ss_pred EEEECCCCCHHHHHHCCHHH---HHHHHHHC--CCE Q ss_conf 99951688844422007699---99999748--904 Q gi|254780791|r 212 IILARGGGSIEDLWHFNDEM---IVRAIANS--SIP 242 (529) Q Consensus 212 iii~RGGGS~eDL~~FN~e~---laraI~~~--~iP 242 (529) |+ |.||.. +.+++.+. ++| T Consensus 181 i~------------~~~D~~A~g~~~~l~~~g~~iP 204 (264) T cd06267 181 IF------------AANDLMAIGALRALRELGLRVP 204 (264) T ss_pred EE------------ECCHHHHHHHHHHHHHCCCCCC T ss_conf 98------------7887999999999998299999 No 288 >cd06281 PBP1_LacI_like_5 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=62.92 E-value=11 Score=17.38 Aligned_cols=77 Identities=10% Similarity=0.145 Sum_probs=32.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC----CCCCCCCC Q ss_conf 88889862299999999999740017189999970543568886279998748947999997352105----86681459 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK----IEFLPEEG 83 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~----~~~~~~~G 83 (529) |++++|- -+++...|...+... ++--+... |+.. +..-.-|++.- ...+++++++-..... ....-+.| T Consensus 7 p~i~npf--f~~~~~gi~~~~~~~-Gy~~ll~~-s~~~--~~~e~~~l~~l-~~~~vdGiIi~~~~~~~~~~~~~~~~~~ 79 (269) T cd06281 7 SDITNPL--LAQLFSGAEDRLRAA-GYSLLIAN-SLND--PERELEILRSF-EQRRMDGIIIAPGDERDPELVDALASLD 79 (269) T ss_pred CCCCCHH--HHHHHHHHHHHHHHC-CCEEEEEE-CCCC--HHHHHHHHHHH-HHCCCCEEEECCCCCCCHHHHHHHHHCC T ss_conf 8987899--999999999999986-99899996-8999--89999999999-8579989997677779999999998479 Q ss_pred CEEEEEEE Q ss_conf 88999999 Q gi|254780791|r 84 IEFLVIGK 91 (529) Q Consensus 84 ~~v~~~g~ 91 (529) .-|++.++ T Consensus 80 iPvV~~~r 87 (269) T cd06281 80 LPIVLLDR 87 (269) T ss_pred CCEEEEEC T ss_conf 98899844 No 289 >PRK10053 hypothetical protein; Provisional Probab=62.77 E-value=11 Score=17.36 Aligned_cols=78 Identities=17% Similarity=0.265 Sum_probs=59.3 Q ss_pred HHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEE Q ss_conf 999997400-1718999997054356888627999874894799999735210586681459889999996675288437 Q gi|254780791|r 23 HLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKY 101 (529) Q Consensus 23 ~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ 101 (529) .++..++.. =..|-++|-|.. +-||=.|..+|....|..-|=...+....+.| -++|.+.|+|+ ...... T Consensus 51 TV~qAk~m~Dda~V~L~GnIv~-----~lg~d~Y~FrD~TG~I~VeId~~~W~G~~VtP--~d~V~I~GeVD--ke~~~~ 121 (130) T PRK10053 51 TVEQAKTMHDGATVSLRGNLID-----HKGDDRYVFRDKSGEINVIIPAAVFDGREVQP--DQMININGSLD--KKSAPP 121 (130) T ss_pred HHHHHHHCCCCCEEEEEEEEEE-----ECCCCEEEEECCCCCEEEEECHHHCCCCCCCC--CCEEEEEEEEC--CCCCCC T ss_conf 6998864669984999888886-----14883489977997599996777748978599--98789988972--566882 Q ss_pred EEEEEEEE Q ss_conf 99999710 Q gi|254780791|r 102 QIIIESLI 109 (529) Q Consensus 102 ql~v~~i~ 109 (529) .+.|..|. T Consensus 122 ~vdV~~iq 129 (130) T PRK10053 122 VVRVTHLQ 129 (130) T ss_pred EEEEEEEC T ss_conf 89988741 No 290 >TIGR00263 trpB tryptophan synthase, beta subunit; InterPro: IPR006654 Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan , :L-serine + 1-(indol-3-yl)glycerol 3-phosphate = L-tryptophan + glyceraldehyde 3-phosphate + H_2O It has two functional domains, each found in bacteria and plants on a separate subunit: alpha chain (IPR002028 from INTERPRO) is for the aldol cleavage of indoleglycerol phosphate to indole and glyceraldehyde 3-phosphate and beta chain is for the synthesis of tryptophan from indole and serine. In fungi the two domains are fused together on a single multifunctional protein . The beta chain of the enzyme, represented here, requires pyridoxal-phosphate as a cofactor. The pyridoxal-phosphate group is attached to a lysine residue. The region around this lysine residue also contains two histidine residues which are part of the pyridoxal-phosphate binding site.; GO: 0004834 tryptophan synthase activity, 0006568 tryptophan metabolic process. Probab=62.55 E-value=11 Score=17.33 Aligned_cols=38 Identities=34% Similarity=0.493 Sum_probs=22.5 Q ss_pred EECCCCCCCHH--HHHHHHHHHHH-HHCCCCCCCCCCEEEEECCCCCH Q ss_conf 72100111103--67999999997-41003576777589995168884 Q gi|254780791|r 177 FPVKVQGDECP--KEIANAILQLN-TLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 177 ~p~~vQG~~a~--~~i~~ai~~~~-~~~~~~~~~~~D~iii~RGGGS~ 221 (529) ||.-|-=-++. .|+...|...+ .+ -||.||=|=||||= T Consensus 197 fP~mVr~FQsvIG~E~k~Qi~~~e~gr-------LPD~viACvGGGSN 237 (412) T TIGR00263 197 FPTMVRDFQSVIGEEAKEQILEKEEGR-------LPDAVIACVGGGSN 237 (412) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCC-------CCCEEEEECCCCHH T ss_conf 722117887789899999999996279-------97389985167720 No 291 >pfam01177 Asp_Glu_race Asp/Glu/Hydantoin racemase. This family contains aspartate racemase, glutamate racemase, hydantoin racemase and arylmalonate decarboxylase. Probab=62.55 E-value=11 Score=17.33 Aligned_cols=90 Identities=19% Similarity=0.242 Sum_probs=39.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEE-------EEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 5289998478425899999986305975899-------972100111103679999999974100357677758999516 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISCRFPLRVI-------IFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~r~p~~~~-------~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) .+||||+|......-+-+.+.+.+. ++++. +.+..--|...+..+....+.+.... ....|+||+ | T Consensus 106 ~~ri~vl~t~~t~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~d~ivL--g 178 (212) T pfam01177 106 AKRVGVLATYGTIVSGLYQELLAEA-GIEVVAPAIRLGIVEVLELGRIEEASVEALAAALERLA----EDGADAVIL--G 178 (212) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHC-CCCEEECCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHH----HCCCCEEEE--C T ss_conf 9808999564244379999999975-99278524355325599928999999999999998875----289899998--5 Q ss_pred CCCHHHHHHCCHHHHHHHH-HHCCCEEEEE Q ss_conf 8884442200769999999-7489048852 Q gi|254780791|r 218 GGSIEDLWHFNDEMIVRAI-ANSSIPIISA 246 (529) Q Consensus 218 GGS~eDL~~FN~e~laraI-~~~~iPVisg 246 (529) +..+--+.. ....+ ....+|||+. T Consensus 179 CT~l~~~~~-----~~~~l~~~~gvpvids 203 (212) T pfam01177 179 CTELPLLLE-----LIEALEPELGVPVIDS 203 (212) T ss_pred CCCHHHHHH-----HHHHHHHHCCCCEECH T ss_conf 888287999-----9999865039919823 No 292 >cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem Probab=62.12 E-value=11 Score=17.27 Aligned_cols=121 Identities=20% Similarity=0.269 Sum_probs=50.3 Q ss_pred CEEEEEEECCCCCC----CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 47999997352105----86681459889999996675288437999997101680079999999999765401226100 Q gi|254780791|r 62 SRIDAIIWKGTLNK----IEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQH 137 (529) Q Consensus 62 a~i~~~~~~~~~~~----~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~ 137 (529) ..++++++-+.... +...-+.|..|++.++.. .+...+.. | ..+....|. .+-+.|.+.| T Consensus 54 ~~vDgiIi~~~~~~~~~~~~~~~~~~ipvV~~~~~~--~~~~~~~~-V-~~d~~~~g~------~~~~~l~~~g------ 117 (264) T cd01537 54 RGVDGIIIAPSDLTAPTIVKLARKAGIPVVLVDRDI--PDGDRVPS-V-GSDNEQAGY------LAGEHLAEKG------ 117 (264) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCCCCEEEECCCC--CCCCCCEE-E-EECHHHHHH------HHHHHHHHCC------ T ss_conf 699999996798886899999997599799985667--88888719-9-977799999------9999998629------ Q ss_pred CCCCCCCCCEEEEEECCCHHHH-----HHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCE Q ss_conf 1631026528999847842589-----999998630597-5899972100111103679999999974100357677758 Q gi|254780791|r 138 KNPIPFIPKIIAVITSPTGAVI-----RDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDI 211 (529) Q Consensus 138 k~~lP~~p~~i~vits~~~a~~-----~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ 211 (529) .++|++|+++.+... .-|...++...+ ....+ +.|.....+..+++..+-.. .+++|+ T Consensus 118 -------~~~i~~i~~~~~~~~~~~R~~g~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~l~~-----~~~~da 181 (264) T cd01537 118 -------HRRIALLAGPLGSSTARERVAGFKDALKEAGPIEIVLV----QEGDWDAEKGYQAAEELLTA-----HPDPTA 181 (264) T ss_pred -------CCCEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEE----EECCCCHHHHHHHHHHHHHH-----CCCCEE T ss_conf -------86245304887770699999999999997799985379----95799989999999999850-----899769 Q ss_pred EEE Q ss_conf 999 Q gi|254780791|r 212 IIL 214 (529) Q Consensus 212 iii 214 (529) |+. T Consensus 182 i~~ 184 (264) T cd01537 182 IFA 184 (264) T ss_pred EEE T ss_conf 998 No 293 >cd01139 TroA_f Periplasmic binding protein TroA_f. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=62.05 E-value=11 Score=17.26 Aligned_cols=33 Identities=21% Similarity=0.411 Sum_probs=20.4 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 77589995168884442200769999999748904885 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVis 245 (529) +|||||+.-++ -..-++..++..+-...||||. T Consensus 91 ~PDvVi~~~~~-----~~~~~~~~~~~~l~~~GIpVv~ 123 (342) T cd01139 91 KPDLVILNIWA-----KTTAEESGILEKLEQAGIPVVF 123 (342) T ss_pred CCCEEEEECCC-----CCCCCCHHHHHHHHHCCCCEEE T ss_conf 99889994434-----5654456799999971998899 No 294 >PRK09039 hypothetical protein; Validated Probab=61.99 E-value=11 Score=17.25 Aligned_cols=48 Identities=15% Similarity=0.222 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 887767777788750268899999999999999999999999999999 Q gi|254780791|r 393 LEKKEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFILSHKIKSCHT 440 (529) Q Consensus 393 ~~~~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~~~~~L~~~~~ 440 (529) ...+..+..+.......+.+....+..+..+..+|+.++..++..... T Consensus 147 ~aLr~qL~~l~~~L~~~e~~~~~~~~~i~~Lg~~LN~aLa~rv~eL~~ 194 (343) T PRK09039 147 AALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVALAQRVQELNR 194 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999877888889999998998999999999999 No 295 >PRK06341 single-stranded DNA-binding protein; Provisional Probab=61.36 E-value=9 Score=18.10 Aligned_cols=77 Identities=14% Similarity=0.300 Sum_probs=44.7 Q ss_pred CCCEEEEEEEC---CCCCCCCCCEEE--EEE------ECCC--------CEEEEEEECCC-CCCCCCCCCCCCEEEEEEE Q ss_conf 17189999970---543568886279--998------7489--------47999997352-1058668145988999999 Q gi|254780791|r 32 LSHVCVRGEIS---GYRGIHSSGHAY--FSL------KDNH--------SRIDAIIWKGT-LNKIEFLPEEGIEFLVIGK 91 (529) Q Consensus 32 ~~~~~v~gEis---~~~~~~~sGH~Y--f~l------kd~~--------a~i~~~~~~~~-~~~~~~~~~~G~~v~~~g~ 91 (529) +..|-+.|-+. .++. -.+|+.. |+| +|.+ .=++|++|+.. +..+.--++.|++|.|.|+ T Consensus 5 ~NkV~LiG~lg~DPE~r~-t~~G~~v~~fslA~~~~~~~~~~ge~~e~T~w~~vv~~~~~lae~~~~yl~KG~~V~VeGr 83 (165) T PRK06341 5 VNKVILIGNLGADPEIRR-TQDGRPIANLRIATSETWRDRNSGERKEKTEWHRVVIFNEGLCKVAEQYLKKGAKVYIEGQ 83 (165) T ss_pred CEEEEEEEECCCCCEEEE-CCCCCEEEEEEEEEECCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 428999872677974878-6999879999999741500256775433526676666528999999997378987988322 Q ss_pred EEE--ECCCCEEEEEEEEEE Q ss_conf 667--528843799999710 Q gi|254780791|r 92 ITT--FPGSSKYQIIIESLI 109 (529) Q Consensus 92 ~~~--y~~~g~~ql~v~~i~ 109 (529) +.. |..+-+-.-++..|. T Consensus 84 L~~r~w~dk~G~~ry~teVv 103 (165) T PRK06341 84 LQTRKWTDQSGVERYSTEVV 103 (165) T ss_pred EEEEEEECCCCCEEEEEEEE T ss_conf 68731187999889899999 No 296 >cd06306 PBP1_TorT-like TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. The Tor respiratory system is consists of three proteins (TorC, TorA, and TorD) and is induced in the presence of TMAO. The TMAO control is tightly regulated by three proteins: TorS, TorT, and TorR. Thus, the disruption of any of these proteins can abolish the Tor respiratory induction. TorT shares homology with the sugar-binding domain of the type I periplasmic binding proteins. The members of TorT-like family bind TMAO or related compounds and are predicted to be involved in signal transduction and/or substrate transport. Probab=61.28 E-value=12 Score=17.16 Aligned_cols=83 Identities=17% Similarity=0.171 Sum_probs=40.7 Q ss_pred EEEEEECCCHHHHHHHHHHH----HHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 89998478425899999986----30597589997210011110367999999997410035767775899951688844 Q gi|254780791|r 147 IIAVITSPTGAVIRDILQRI----SCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 147 ~i~vits~~~a~~~D~~~~~----~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) ||||+.+.-..-|+.-++.= ...+++++.++.+.-+++ .+.+| +.|+.+-. ..+|.|||. .- T Consensus 1 ki~v~~~~l~npfw~~~~~Ga~~aA~e~G~~v~~~~~~~~~d-~~~Qi-~~ie~~i~-------~gvDaIii~-----p~ 66 (268) T cd06306 1 KLCVLYPHLKDAYWLSVNYGMVEEAKRLGVSLKLLEAGGYPN-LAKQI-AQLEDCAA-------WGADAILLG-----AV 66 (268) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCC-HHHHH-HHHHHHHH-------CCCCEEEEC-----CC T ss_conf 989995788987999999999999999699799995899999-99999-99999998-------399999986-----79 Q ss_pred HHHHCCHHHHHHHHHHCCCEEEEE Q ss_conf 422007699999997489048852 Q gi|254780791|r 223 DLWHFNDEMIVRAIANSSIPIISA 246 (529) Q Consensus 223 DL~~FN~e~laraI~~~~iPVisg 246 (529) |=-+++. +.++.. ..||||+- T Consensus 67 d~~al~~--~~~a~~-agIPVV~~ 87 (268) T cd06306 67 SPDGLNE--ILQQVA-ASIPVIAL 87 (268) T ss_pred CHHHHHH--HHHHHH-CCCEEEEE T ss_conf 8677789--999998-69829999 No 297 >cd03522 MoeA_like MoeA_like. This domain is similar to a domain found in a variety of proteins involved in biosynthesis of molybdopterin cofactor, like MoaB, MogA, and MoeA. There this domain is presumed to bind molybdopterin. The exact function of this subgroup is unknown. Probab=61.16 E-value=12 Score=17.14 Aligned_cols=107 Identities=19% Similarity=0.272 Sum_probs=60.5 Q ss_pred CCCEEEEEEEEEEECC-CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCC----HHHHHHHHHHHHHC-- Q ss_conf 8843799999710168-0079999999999765401226100163102652899984784----25899999986305-- Q gi|254780791|r 97 GSSKYQIIIESLIPSG-SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPT----GAVIRDILQRISCR-- 169 (529) Q Consensus 97 ~~g~~ql~v~~i~~~g-~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~----~a~~~D~~~~~~~r-- 169 (529) ..|+ .+--.+|-|-+ ..+. +++...-+....+|. =.|.-++++|+|+..+ +....-|..+++.| T Consensus 119 ~~G~-~vAt~kiIPl~v~~~~---l~~a~~i~~~~p~~~-----v~p~~~~~vglI~T~~ev~~~~i~dk~~~v~~~rl~ 189 (312) T cd03522 119 EAGQ-MVATVKIIPLAVPEAL---VERAEALARDGPLLR-----VAPFRPLRVGLIVTGSEVYGGRIEDKFGPVLRARLA 189 (312) T ss_pred CCCC-EEEEEEEEEEECCHHH---HHHHHHHHCCCCEEE-----EECCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH T ss_conf 6898-5999988643457999---999999856796199-----933754168999967866665034524789999999 Q ss_pred -CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf -9758999721001111036799999999741003576777589995168884442 Q gi|254780791|r 170 -FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 170 -~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) |+.++ +....|. +-...|.+||..+-... +.+||+= ||...|= T Consensus 190 ~~G~~l-~~~~~v~--dd~~~ia~aI~~~~~~g--------a~lvi~t-gsasvDp 233 (312) T cd03522 190 ALGVEL-VEQVIVP--HDEAAIAAAIAEALEAG--------AELLILT-GGASVDP 233 (312) T ss_pred HCCCEE-EEEEECC--CCHHHHHHHHHHHHHCC--------CEEEEEC-CCCCCCC T ss_conf 749846-5579558--98999999999998689--------9399944-8754288 No 298 >PRK04346 tryptophan synthase subunit beta; Validated Probab=61.09 E-value=4.7 Score=20.59 Aligned_cols=38 Identities=29% Similarity=0.424 Sum_probs=22.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH--HHHCC Q ss_conf 00111103679999999974100357677758999516888444--22007 Q gi|254780791|r 180 KVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED--LWHFN 228 (529) Q Consensus 180 ~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD--L~~FN 228 (529) +|=|.++-.++.+. .. ..||+||-|=||||--= ..+|= T Consensus 210 svIG~E~k~Q~~e~---~G--------~~PD~vvaCVGGGSNa~G~f~pFl 249 (392) T PRK04346 210 SVIGEEAKAQILEK---EG--------RLPDAVVACVGGGSNAIGIFHPFI 249 (392) T ss_pred HHHHHHHHHHHHHH---HC--------CCCCEEEECCCCCCHHHHHHCCCC T ss_conf 98549999999997---38--------999999987787611665321013 No 299 >TIGR01357 aroB 3-dehydroquinate synthase; InterPro: IPR016037 The 3-dehydroquinate synthase (4.2.3.4 from EC) domain is present in isolation in various bacterial 3-dehydroquinate synthases and also present as a domain in the pentafunctional AROM polypeptide (P07547 from SWISSPROT) . 3-dehydroquinate (DHQ) synthase catalyses the formation of dehydroquinate (DHQ) and orthophosphate from 3-deoxy-D-arabino heptulosonic 7 phosphate . This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids.; GO: 0003856 3-dehydroquinate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=60.94 E-value=12 Score=17.11 Aligned_cols=107 Identities=24% Similarity=0.313 Sum_probs=62.1 Q ss_pred HHCCCCCCCCCCCC--CC-CEEEEEECCCHHHHH-HHHHHHHHC--CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 01226100163102--65-289998478425899-999986305--9758999721001111036799999999741003 Q gi|254780791|r 130 EGLFSDQHKNPIPF--IP-KIIAVITSPTGAVIR-DILQRISCR--FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEG 203 (529) Q Consensus 130 eGlfd~~~k~~lP~--~p-~~i~vits~~~a~~~-D~~~~~~~r--~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~ 203 (529) +|+|++.---.+.. .+ +++-|||.++-+.+. +-+.-+... |+...+++|. || ..++.-...+..+.+-+. T Consensus 7 ~g~~~~~~~~~~~~l~~~~~~~~~~~d~~v~~~~~~~~~~~~~~~g~~v~~~~~p~---GE-~~K~l~~~~~l~~~L~~~ 82 (361) T TIGR01357 7 SGLLDRQLVEELAELRKACSKLVIVTDETVADLYAEKLLEALQSAGFNVLVLTVPD---GE-ESKSLETVQRLYDQLLEA 82 (361) T ss_pred CCCCHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHCCCCEEEEEECC---CC-CCCCHHHHHHHHHHHHHC T ss_conf 77402467776774101365079986745688888899999985387526887078---98-745488899999999965 Q ss_pred CCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCE Q ss_conf 576777589995168884442200769999999748904 Q gi|254780791|r 204 RTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIP 242 (529) Q Consensus 204 ~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iP 242 (529) ..++ +-.||+=|||.+.||..|===.-.|=|---.+| T Consensus 83 -~~~R-~~~i~~~GGGvvgDlaGFvAaty~RGi~~iqvP 119 (361) T TIGR01357 83 -GLDR-SSTIIALGGGVVGDLAGFVAATYMRGIRFIQVP 119 (361) T ss_pred -CCCC-CCEEEEECCHHHHHHHHHHHHHHHHCCCEEECC T ss_conf -8895-647888545377878999999996155826414 No 300 >PRK10586 hypothetical protein; Provisional Probab=60.83 E-value=12 Score=17.10 Aligned_cols=79 Identities=16% Similarity=0.306 Sum_probs=44.2 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCC--C-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 652899984784258999999863059--7-5899972100111103679999999974100357677758999516888 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRF--P-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~--p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) .-+|+.||++..+ +.-+...+..-+ + +.. ....|+-+..+|-+-...+. ...| +||+=|||- T Consensus 33 ~~~~~lii~g~~~--~~~~~~~~~~sl~~~~~~~----~~f~Gecs~~ei~rl~~~~~--------~~~d-~IigvGGGK 97 (362) T PRK10586 33 QLSRAVWIYGERA--IAAAQPYLPPAFELPGAKH----ILFRGHCSESDVAQLAAESG--------DDRQ-VVIGVGGGA 97 (362) T ss_pred CCCCEEEEECHHH--HHHHHHHHHHHHHCCCCEE----EEECCCCCHHHHHHHHHHHC--------CCCC-EEEEECCCH T ss_conf 4784699988789--9999888766540258569----98789458999999999735--------5887-899955719 Q ss_pred HHHHHHCCHHHHHHHHHH-CCCEEEE Q ss_conf 444220076999999974-8904885 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIAN-SSIPIIS 245 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~-~~iPVis 245 (529) .-|- |++++. ..+|||+ T Consensus 98 ~iDt--------aK~vA~~~~~pvv~ 115 (362) T PRK10586 98 LLDT--------AKALARRLGLPFVA 115 (362) T ss_pred HHHH--------HHHHHHHCCCCEEE T ss_conf 9999--------99999981999899 No 301 >PRK13028 tryptophan synthase subunit beta; Provisional Probab=60.80 E-value=2.5 Score=23.03 Aligned_cols=64 Identities=23% Similarity=0.386 Sum_probs=32.6 Q ss_pred HHHHHHHHHHHHH-CCCE-EE---EEEEC------CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH- Q ss_conf 2589999998630-5975-89---99721------00111103679999999974100357677758999516888444- Q gi|254780791|r 156 GAVIRDILQRISC-RFPL-RV---IIFPV------KVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED- 223 (529) Q Consensus 156 ~a~~~D~~~~~~~-r~p~-~~---~~~p~------~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD- 223 (529) ..|++|.+....+ .|.+ .+ .-||. +|=|.++..+|.+. .. ..||+||.|=||||--= T Consensus 178 ~eAir~~~~~~~~t~Y~lGSvvgphPyp~~V~~fQsvIG~E~~~Q~~e~---~G--------~~PD~vvacVGGGSNa~G 246 (403) T PRK13028 178 DSAFEAYLKDPDTTHYAIGSVVGPHPFPKMVRDFQSVIGREAREQFLEM---TG--------RLPDYVVACVGGGSNAIG 246 (403) T ss_pred HHHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH---CC--------CCCCEEEECCCCCHHHHH T ss_conf 9999999838465269714656887208999999999989999999986---49--------999989977887234889 Q ss_pred -HHHCCHH Q ss_conf -2200769 Q gi|254780791|r 224 -LWHFNDE 230 (529) Q Consensus 224 -L~~FN~e 230 (529) ..+|-++ T Consensus 247 ~f~pFl~d 254 (403) T PRK13028 247 AFAAFLDD 254 (403) T ss_pred HHHHHHCC T ss_conf 87764266 No 302 >PRK06027 purU formyltetrahydrofolate deformylase; Reviewed Probab=60.47 E-value=12 Score=17.05 Aligned_cols=88 Identities=18% Similarity=0.264 Sum_probs=50.7 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHC-CCEEEEE--------------- Q ss_conf 007999999999976540122610016310265289998478425899999986305-9758999--------------- Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCR-FPLRVII--------------- 176 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r-~p~~~~~--------------- 176 (529) .-+|...|+.+-+++..+ -...-+.-+.||+|..|+-+--+.|.|..-+.. .|++|.. T Consensus 62 ~~~L~~~f~~la~~f~m~------~~l~~~~~k~rv~ImVSK~dHCL~DLL~R~~~GeL~~eI~~VISNH~dL~~lae~~ 135 (285) T PRK06027 62 LETLRADFAALAEEFGMD------WRLHDSAGRKRVVILVSKEDHCLGDLLWRWRSGELPVEIAAVISNHPDLRSLVESF 135 (285) T ss_pred HHHHHHHHHHHHHHCCCE------EEEECCCCCCEEEEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHC T ss_conf 999999999988763986------99963567835999983645378999998756985625779942857899999986 Q ss_pred ----EECCCCCCC---HHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf ----721001111---0367999999997410035767775899951 Q gi|254780791|r 177 ----FPVKVQGDE---CPKEIANAILQLNTLKEGRTCPRPDIIILAR 216 (529) Q Consensus 177 ----~p~~vQG~~---a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R 216 (529) |..+|-+++ +++.|.+.|+.. ++|+||++| T Consensus 136 ~IPF~~iPv~~~~K~e~E~~i~~~~~~~----------~~d~ivla~ 172 (285) T PRK06027 136 GIPFHHVPVTKETKAEAEAQLLELIDEY----------QPDLVVLAR 172 (285) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHC----------CCCEEEEHH T ss_conf 9982882687665377999999998734----------971997633 No 303 >cd04257 AAK_AK-HSDH AAK_AK-HSDH: Amino Acid Kinase Superfamily (AAK), AK-HSDH; this CD includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK - homoserine dehydrogenase (HSDH). These aspartokinases are found in bacteria (E. coli AKI-HSDHI, ThrA and E. coli AKII-HSDHII, MetL) and higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-HSDH is an alanine-act Probab=60.23 E-value=12 Score=17.02 Aligned_cols=41 Identities=24% Similarity=0.311 Sum_probs=28.0 Q ss_pred ECCCCCHHHHHHCCHHHHHHHHHHC----C-CEEEEEECCCCCCHHHHH Q ss_conf 5168884442200769999999748----9-048852057775258988 Q gi|254780791|r 215 ARGGGSIEDLWHFNDEMIVRAIANS----S-IPIISAIGHETDWTLADY 258 (529) Q Consensus 215 ~RGGGS~eDL~~FN~e~laraI~~~----~-iPVisgIGHE~D~Tl~D~ 258 (529) ==||-|+.|--+|. .+++-|-+. + +=|+||.|.-||.- .+. T Consensus 5 KFGGtSv~~~~~i~--~v~~iI~~~~~~~~~vVVVSA~ggvTd~L-~~~ 50 (294) T cd04257 5 KFGGTSLANAERIR--RVADIILNAAKQEQVAVVVSAPGKVTDLL-LEL 50 (294) T ss_pred EECCCCCCCHHHHH--HHHHHHHHHCCCCCEEEEECCCCCCHHHH-HHH T ss_conf 87811158999999--99999975324899899980999867999-999 No 304 >PRK13055 putative lipid kinase; Reviewed Probab=60.22 E-value=12 Score=17.02 Aligned_cols=110 Identities=28% Similarity=0.333 Sum_probs=58.7 Q ss_pred CCCEEEEEECCCH---HHHH---HHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 6528999847842---5899---999986305975899972100111103679999999974100357677758999516 Q gi|254780791|r 144 IPKIIAVITSPTG---AVIR---DILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 144 ~p~~i~vits~~~---a~~~---D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) +.||+.||-=|.+ .+-+ ++...+++ ..+++.+|.++--+. .+.++.+.... ..+|+||.+=| T Consensus 1 MmKr~~lIvNP~SG~g~~~~~~~~i~~~L~~-~g~e~~~~~tt~~~~-~a~~~a~~a~~----------~g~d~Iva~GG 68 (334) T PRK13055 1 MQKRARLIYNPTSGQEIMKKNVADILDILEQ-AGYETSAFQTTPEPN-SAKNEARRAAK----------AGFDLIIAAGG 68 (334) T ss_pred CCEEEEEEECCCCCCCCHHHHHHHHHHHHHH-CCCEEEEEEEECCCC-HHHHHHHHHHH----------CCCCEEEEECC T ss_conf 9706999999877897578899999999998-698599999441785-79999998765----------79989999877 Q ss_pred CCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCC--CCHHHHHHHC Q ss_conf 888444220076999999974890488520577752589886412377--7214567633 Q gi|254780791|r 218 GGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAP--TPTGAAEMAV 275 (529) Q Consensus 218 GGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~--TPTaAAElav 275 (529) =|.+- +++..++....|+.-|| =.==|--||+-.+.-| .|.+|++.+. T Consensus 69 DGTin--------evvngl~~~~~~~~Lgi--IP~GTgNDfAr~Lgip~~~~~~A~~~i~ 118 (334) T PRK13055 69 DGTIN--------EVVNGIAPLEKRPKMAI--IPAGTTNDYARALKIPRGNPVEAAKVIL 118 (334) T ss_pred CCHHH--------HHHHHHHCCCCCCEEEE--EECCCCHHHHHHCCCCCCCHHHHHHHHH T ss_conf 60899--------99998734699971899--8077707899982999669999999986 No 305 >PRK13175 consensus Probab=59.99 E-value=12 Score=16.99 Aligned_cols=42 Identities=21% Similarity=0.335 Sum_probs=26.2 Q ss_pred CCCEEEEECCCCC----HHHHHHCCHHHHHHHHHHCCCEEE-EEECCC Q ss_conf 7758999516888----444220076999999974890488-520577 Q gi|254780791|r 208 RPDIIILARGGGS----IEDLWHFNDEMIVRAIANSSIPII-SAIGHE 250 (529) Q Consensus 208 ~~D~iii~RGGGS----~eDL~~FN~e~laraI~~~~iPVi-sgIGHE 250 (529) .+|.||| =|+|| .+.|...|=.+..+...+...||+ .-.||. T Consensus 39 ~~d~lIL-PGvGsf~~~~~~l~~~~l~~~i~~~~~~g~PiLGIClGmQ 85 (206) T PRK13175 39 AADALIL-PGVGAFDPAMQNLRSRGLIPPIKDAIASGKPFLGICLGLQ 85 (206) T ss_pred CCCEEEE-CCCCCCHHHHHHHHHCCCHHHHHHHHHCCCCEEEEEHHHH T ss_conf 2892896-8998526788755531871689999856997799867866 No 306 >KOG1899 consensus Probab=59.94 E-value=12 Score=16.98 Aligned_cols=93 Identities=13% Similarity=0.143 Sum_probs=44.9 Q ss_pred CCCHHHHHHHC--CCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 75258988641--2377721456763323467776699999888778998889999855677653049788986788878 Q gi|254780791|r 251 TDWTLADYAAD--LRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLSCSRYRL 328 (529) Q Consensus 251 ~D~Tl~D~VAD--~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~~~~QrL 328 (529) |+-+|++.... -+..|||+ |...++..++..++..-....- ...-...+.......+...+-.++..+++| T Consensus 83 taa~i~eWles~~p~~~~~s~------~~~~~yQerLaRLe~dkesL~L-QvsvLteqVeaQgEKIrDLE~cie~kr~kL 155 (861) T KOG1899 83 TAARIAEWLESPSPSMSTVSC------PEYPEYQERLARLEMDKESLQL-QVSVLTEQVEAQGEKIRDLETCIEEKRNKL 155 (861) T ss_pred HHHHHHHHHHCCCCCCCCCCC------CCCHHHHHHHHHHHCCHHHHEE-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 779999998566877777667------7654799999998525043330-189999999875656787999999988520 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8865456899999987568899 Q gi|254780791|r 329 DRLPRELEHNLEIIIFRKYRHF 350 (529) Q Consensus 329 d~~~~rL~~~l~~~l~~~~~~l 350 (529) ...++-|++.+-....-..+.+ T Consensus 156 natEEmLQqellsrtsLETqKl 177 (861) T KOG1899 156 NATEEMLQQELLSRTSLETQKL 177 (861) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 1089999999876556777776 No 307 >cd01742 GATase1_GMP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. Glutamine amidotransferase (GATase) activity catalyse the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. GMP synthetase catalyses the amination of the nucleotide precursor xanthosine 5'-monophospahte to form GMP. GMP synthetase belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=59.81 E-value=11 Score=17.42 Aligned_cols=40 Identities=23% Similarity=0.389 Sum_probs=28.2 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEE-EECCC Q ss_conf 77589995168884442200769999999748904885-20577 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIIS-AIGHE 250 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVis-gIGHE 250 (529) .+|.||+.=|-||..| .+...+.+.|.+..+||+- -.||. T Consensus 41 ~~dgvils~GP~~~~~---~~~~~~~~~i~~~~~PilGIClG~Q 81 (181) T cd01742 41 NPKGIILSGGPSSVYE---EDAPRVDPEIFELGVPVLGICYGMQ 81 (181) T ss_pred CCCEEEECCCCCCCCC---CCCCHHHHHHHHCCCCEEEEHHHHH T ss_conf 9898998999998555---7983330999847999999859999 No 308 >PRK12419 riboflavin synthase subunit beta; Provisional Probab=59.75 E-value=12 Score=16.95 Aligned_cols=95 Identities=23% Similarity=0.288 Sum_probs=55.6 Q ss_pred CCCCCCCEEEEEECCCHHHHHHH-----HHHHHHC-CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEE Q ss_conf 31026528999847842589999-----9986305-97-58999721001111036799999999741003576777589 Q gi|254780791|r 140 PIPFIPKIIAVITSPTGAVIRDI-----LQRISCR-FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDII 212 (529) Q Consensus 140 ~lP~~p~~i~vits~~~a~~~D~-----~~~~~~r-~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~i 212 (529) |-=.-|+|||||.|.==.-+-|= +..|.+. .+ -++.+|-++ | +-||--+.+.+-.. .+||.| T Consensus 5 p~~a~~~riaIV~s~~n~~It~~Ll~ga~~~L~~~gv~~~~i~v~~VP--G---AfEiP~aa~~l~~~------~~~dai 73 (158) T PRK12419 5 PRFATPQRIAFVQARWHADIVDQAREGFVAEIAARGGAASQVDVFEVP--G---AFEIPLHAKTLAKT------GRYAAI 73 (158) T ss_pred CCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHEEEEECC--C---HHHHHHHHHHHHHC------CCCCEE T ss_conf 767789869999767758999999999999999859985545899828--6---78999999999855------999889 Q ss_pred E----EECCCCCHHHHHHCCHHHHHHHH----HHCCCEEEEEEC Q ss_conf 9----95168884442200769999999----748904885205 Q gi|254780791|r 213 I----LARGGGSIEDLWHFNDEMIVRAI----ANSSIPIISAIG 248 (529) Q Consensus 213 i----i~RGGGS~eDL~~FN~e~laraI----~~~~iPVisgIG 248 (529) | |+||+=.-=|.-| ..+++.| .+..+||+-||= T Consensus 74 IalG~VIrGeT~Hfd~V~---~~v~~gl~~lsl~~~iPV~~GVL 114 (158) T PRK12419 74 VAAALVVDGGIYRHEFVA---QAVIDGLMQVQLETEVPVFSVVL 114 (158) T ss_pred EEEEEEECCCCCCHHHHH---HHHHHHHHHHHHHCCCCEEEEEC T ss_conf 988789828975247899---99999999998624987699983 No 309 >pfam06189 5-nucleotidase 5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5). Probab=59.60 E-value=12 Score=16.93 Aligned_cols=144 Identities=16% Similarity=0.181 Sum_probs=79.1 Q ss_pred EEEEEEECCCCCCCC-CCEEEEEEECC-------CCEEEEEEECCCCCCCC--CCC---CCCCEEEEEEEEE-EECCCCE Q ss_conf 899999705435688-86279998748-------94799999735210586--681---4598899999966-7528843 Q gi|254780791|r 35 VCVRGEISGYRGIHS-SGHAYFSLKDN-------HSRIDAIIWKGTLNKIE--FLP---EEGIEFLVIGKIT-TFPGSSK 100 (529) Q Consensus 35 ~~v~gEis~~~~~~~-sGH~Yf~lkd~-------~a~i~~~~~~~~~~~~~--~~~---~~G~~v~~~g~~~-~y~~~g~ 100 (529) ++..|| |-++.-.+ .=|+|+|--.+ .....|.|+.+....-. -++ -|||-|++.-.-. ||..+|= T Consensus 67 ~Ft~G~-sp~~Yl~af~~~LFLS~~~~DV~~Ai~~G~aAa~v~~~~~~~~~~~~qlRIAFDgDaVLFsDesE~vyq~~GL 145 (263) T pfam06189 67 AFTGGE-SPYRYLKAFNADLFLSANAEDVREALDAGVAAATVLPSTPDDSESDDQLRIAFDGDAVLFSDESERVYKAQGL 145 (263) T ss_pred EECCCC-CHHHHHHHHCCCEEEECCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEECCCEEEECCCHHHHHHHHCH T ss_conf 541898-8266677607846866888999999975997199627886777888745897368747865707888986088 Q ss_pred EEEEEE----EEEECC---CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEE Q ss_conf 799999----710168---0079999999999765401226100163102652899984784258999999863059758 Q gi|254780791|r 101 YQIIIE----SLIPSG---SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLR 173 (529) Q Consensus 101 ~ql~v~----~i~~~g---~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~ 173 (529) =.|.-. .-.|-. .-.++.++-++.+++ .. -.-|-||++||+.++.+-.-+++||+. |++. T Consensus 146 ~aF~~~E~~~a~~Pl~~GPfk~fL~~L~~lQ~~~------~~------~~~pIRtaLVTARsApaheRvIkTLR~-Wgi~ 212 (263) T pfam06189 146 EAFQEHERENADLPLSGGPFKDFLEALSRLQQKF------PR------YNCPIRTALVTARSAPAHERVIKTLRA-WGVR 212 (263) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHC------CC------CCCCEEEEEEECCCCCCHHHHHHHHHH-CCCC T ss_conf 9899868886229777698799999999999866------86------689757898862557513799999997-4960 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 999721001111036799999 Q gi|254780791|r 174 VIIFPVKVQGDECPKEIANAI 194 (529) Q Consensus 174 ~~~~p~~vQG~~a~~~i~~ai 194 (529) | =.+..-|--....+.+++ T Consensus 213 i--DEa~FLGG~~K~~~L~~f 231 (263) T pfam06189 213 I--DEAFFLGGLDKGEFLKAF 231 (263) T ss_pred H--HHHHHHCCCCHHHHHHHH T ss_conf 2--165761788367899865 No 310 >cd07187 YvcK_like family of mostly uncharacterized proteins similar to B.subtilis YvcK. One member of this protein family, YvcK from Bacillus subtilis, has been proposed to play a role in carbon metabolism, since its function is essential for growth on intermediates of the Krebs cycle and the pentose phosphate pathway. In general, this family of mostly uncharacterized proteins is related to the CofD-like protein family. CofD has been characterized as a 2-phospho-L-lactate transferase involved in F420 biosynthesis. This family appears to have the same conserved phosphate binding site as the other family in this hierarchy, but a different substrate binding site. Probab=59.34 E-value=12 Score=16.90 Aligned_cols=109 Identities=26% Similarity=0.253 Sum_probs=62.9 Q ss_pred CCCCCCCCEEEEEECCCHHHHHHHHHHHH--HCCCE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 63102652899984784258999999863--05975-8999721001111036799999999741003576777589995 Q gi|254780791|r 139 NPIPFIPKIIAVITSPTGAVIRDILQRIS--CRFPL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILA 215 (529) Q Consensus 139 ~~lP~~p~~i~vits~~~a~~~D~~~~~~--~r~p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~ 215 (529) .|.-.-|-++..... +|-.+.-=.++-. ...|+ +|.+-|..+ .|.++.++||+. -|+||++ T Consensus 119 lP~t~~~~~L~a~l~-dG~~i~Ge~~I~~~~~~~~I~~v~l~p~~~---~~~~~a~~aI~~------------ADlIv~g 182 (308) T cd07187 119 LPVTNDPLHLVAELE-DGTIVRGESRISHLELGSPIKRVFLEPPDP---KANPEALEAIEE------------ADLIVYG 182 (308) T ss_pred EECCCCCEEEEEEEC-CCCEEEEEECCCCCCCCCCCEEEEEECCCC---CCCHHHHHHHHH------------CCEEEEC T ss_conf 956389707999977-998998401025576789964899767999---989999999997------------9989987 Q ss_pred CCCCCHHHHHHCCHHHHHHHHHHCCCEE--EEEECC---CCCC-HHHHHHHCCC Q ss_conf 1688844422007699999997489048--852057---7752-5898864123 Q gi|254780791|r 216 RGGGSIEDLWHFNDEMIVRAIANSSIPI--ISAIGH---ETDW-TLADYAADLR 263 (529) Q Consensus 216 RGGGS~eDL~~FN~e~laraI~~~~iPV--isgIGH---E~D~-Tl~D~VAD~R 263 (529) =|-==..=+-.+=-..++.||++++-|. |.-+++ |||- |+.|+|.-+. T Consensus 183 PGSlyTSiiPnLlv~gi~~AI~~s~a~kV~i~Nl~tqpgeT~g~s~~dhv~~l~ 236 (308) T cd07187 183 PGSLYTSILPNLLVKGIAEAIRASKAPKVYICNLMTQPGETDGFTLSDHVRALL 236 (308) T ss_pred CCCCHHHHCCHHCCHHHHHHHHHCCCCEEEEEECCCCCHHHCCCCHHHHHHHHH T ss_conf 984066551142236699999967997899961789912316999999999999 No 311 >KOG1970 consensus Probab=59.27 E-value=12 Score=16.96 Aligned_cols=77 Identities=16% Similarity=0.240 Sum_probs=57.1 Q ss_pred ECCCCEEEEEEEEEEECCCCHHH---HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC Q ss_conf 52884379999971016800799---999999997654012261001631026528999847842589999998630597 Q gi|254780791|r 95 FPGSSKYQIIIESLIPSGSGTLL---TALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP 171 (529) Q Consensus 95 y~~~g~~ql~v~~i~~~g~G~l~---~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p 171 (529) |++--+.++.++.-.|...-+|+ .+..+++.=|. -|+. -.|.++.+|-+||+|+|.+---.+++|..-.+ T Consensus 63 ~~~~d~~elW~eKy~P~t~eeLAVHkkKI~eVk~WL~---~~~~----~~~~l~~~iLLltGPsGcGKSTtvkvLskelg 135 (634) T KOG1970 63 NEKEDEFELWVEKYKPRTLEELAVHKKKISEVKQWLK---QVAE----FTPKLGSRILLLTGPSGCGKSTTVKVLSKELG 135 (634) T ss_pred CCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHH---HHHH----HCCCCCCEEEEEECCCCCCCHHHHHHHHHHHC T ss_conf 8773333225776085628887551774899999999---9997----45366760799857988871319999998648 Q ss_pred EEEEEEE Q ss_conf 5899972 Q gi|254780791|r 172 LRVIIFP 178 (529) Q Consensus 172 ~~~~~~p 178 (529) ++++=|- T Consensus 136 ~~~~Ew~ 142 (634) T KOG1970 136 YQLIEWS 142 (634) T ss_pred CEEEEEC T ss_conf 0212304 No 312 >cd03147 GATase1_Ydr533c_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. This group includes proteins similar to S. cerevisiae Ydr533c. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and Glu residue form a different catalytic triad from the typical GATase1domain. Ydr533c protein is a homodimer. Probab=59.18 E-value=12 Score=16.88 Aligned_cols=43 Identities=21% Similarity=0.318 Sum_probs=32.3 Q ss_pred CCCCEEEEECCCCCHHHHHHCCHH--HHHHHHHHCCCEEEEEECCCC Q ss_conf 777589995168884442200769--999999748904885205777 Q gi|254780791|r 207 PRPDIIILARGGGSIEDLWHFNDE--MIVRAIANSSIPIISAIGHET 251 (529) Q Consensus 207 ~~~D~iii~RGGGS~eDL~~FN~e--~laraI~~~~iPVisgIGHE~ 251 (529) ..||+|.|.=|.|..-||. .|+. .+.+.+++.. -+|++|.|-. T Consensus 93 ~~YdaVf~pGGhG~m~Dl~-~~~~l~~ll~~~~~~g-k~vaaVCHGp 137 (231) T cd03147 93 DDYGIFFVAGGHGTLFDFP-HATNLQKIAQQIYANG-GVVAAVCHGP 137 (231) T ss_pred HHCEEEEECCCCCCCCCCC-CCHHHHHHHHHHHHCC-CEEEEECCHH T ss_conf 7825999889885303242-3999999999999759-9799980688 No 313 >PRK12683 transcriptional regulator CysB-like protein; Reviewed Probab=58.95 E-value=12 Score=16.85 Aligned_cols=96 Identities=20% Similarity=0.244 Sum_probs=60.4 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 1026528999847842589999998630597-589997210011110367999999997410035767775899951688 Q gi|254780791|r 141 IPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 141 lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) .+.-.-+||...+...-.+-.++....++|| ++|.+.. +...++++++.. ..+|+.|..-+=. T Consensus 90 ~~~G~l~Ig~~~~~~~~~lp~~l~~f~~~~P~v~i~l~~------~~~~~~~~~L~~----------g~~Dl~i~~~~~~ 153 (308) T PRK12683 90 RDSGRLTVATTHTQARYALPKVVRQFTEVFPKVHLALRQ------GSPQEIAEMLLN----------GEADIGIATEALD 153 (308) T ss_pred CCCCEEEECCCCHHHHHCCCHHHHHHHHHCCCCEEEEEE------CCHHHHHHHHHC----------CCCCEEECCCCCC T ss_conf 777624200132011201684668999878996699960------786999999977----------9977880666778 Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHH Q ss_conf 8444220076999999974890488520577----752589886 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAIGHE----TDWTLADYA 259 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgIGHE----~D~Tl~D~V 259 (529) ...|+-.+ -++..+.-+|.+-+|. ...|+.|+. T Consensus 154 ~~~~l~~~-------~l~~~~~~lv~~~~hpla~~~~v~l~dl~ 190 (308) T PRK12683 154 REPDLVSF-------PYYSWHHVVVVPKGHPLTGRENLTLEALA 190 (308) T ss_pred CCCCCEEE-------EECCCCEEEEECCCCCCCCCCCCCHHHHC T ss_conf 88771687-------61245158984699953369999999985 No 314 >PRK12478 enoyl-CoA hydratase; Provisional Probab=58.69 E-value=13 Score=16.81 Aligned_cols=36 Identities=22% Similarity=0.256 Sum_probs=23.4 Q ss_pred HHHHHHCCCEEEEEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 999974890488520-5777--525898864123777214 Q gi|254780791|r 233 VRAIANSSIPIISAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 233 araI~~~~iPVisgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) ..+|.+|+.|||.+| |+-. =.. .=+.+|+|..+++| T Consensus 105 ~~~i~~~~kPvIAaVnG~a~GgG~~-Lal~cDi~iAs~~A 143 (298) T PRK12478 105 FMAIWRASKPVIAQVHGWCVGGASD-YALCADIVIASEDA 143 (298) T ss_pred HHHHHHCCCCEEEEECCEEEHHHHH-HHHHCCEEEECCCC T ss_conf 9999818998899987987367789-99867947853887 No 315 >PRK01231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=58.63 E-value=13 Score=16.81 Aligned_cols=82 Identities=22% Similarity=0.286 Sum_probs=46.7 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHH---CCCEEEEEEECCC---CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 528999847842589999998630---5975899972100---1111036799999999741003576777589995168 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISC---RFPLRVIIFPVKV---QGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~---r~p~~~~~~p~~v---QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) .++||||+-++.....+..+.+.+ .-.+++.+-+... -+.+.+ . .....+ ...+|++|..=|- T Consensus 4 ~k~vgIv~k~~~~~a~~~~~~l~~~L~~~gi~v~ld~~~a~~~~~~~~~--~-~~~~~~--------~~~~Dlii~lGGD 72 (296) T PRK01231 4 FRNIGLIGRLGSSQVVETLRRLKRFLLDRHLHVILEEETAEVLPGHGLQ--T-VSRKLL--------GEVCDLVIVVGGD 72 (296) T ss_pred CCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCC--C-CCHHHC--------CCCEEEEEEECCC T ss_conf 9899999848987999999999999987889999932477665878866--2-453341--------6530499995787 Q ss_pred CCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 884442200769999999748904885 Q gi|254780791|r 219 GSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 219 GS~eDL~~FN~e~laraI~~~~iPVis 245 (529) |.+- ..+|..+...+||+. T Consensus 73 GT~L--------~~~~~~~~~~~PilG 91 (296) T PRK01231 73 GSLL--------GAARALARHNVPVLG 91 (296) T ss_pred CHHH--------HHHHHHCCCCCCEEE T ss_conf 2899--------999996036997898 No 316 >TIGR01121 D_amino_aminoT D-amino acid aminotransferase; InterPro: IPR005784 D-amino acid aminotransferase (2.6.1.21 from EC) catalyzes transamination between various D-amino acids and alpha-keto acids. This enzyme is a homodimer. The pyridoxal phosphate attachment site is a Lys in the motif Cys-Asp-Ile-Lys-Ser-Leu-Asn. Specificity is broad for various D-amino acids, and differs among members of the family .; GO: 0008483 transaminase activity, 0008152 metabolic process. Probab=58.61 E-value=12 Score=16.96 Aligned_cols=32 Identities=38% Similarity=0.481 Sum_probs=22.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC-C--CCEEEEEE Q ss_conf 888898622999999999997400-1--71899999 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN-L--SHVCVRGE 40 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~-~--~~~~v~gE 40 (529) -....| ||.-+|-..++.+++.+ . ++||++-- T Consensus 57 I~l~lp-ys~~~L~~~L~~Lv~~N~~~~G~vY~Q~T 91 (278) T TIGR01121 57 IRLDLP-YSKEELHELLHELVEENNINTGGVYLQVT 91 (278) T ss_pred CCCCCC-CCHHHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 413447-67789999999999862865795799970 No 317 >cd06277 PBP1_LacI_like_1 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=58.03 E-value=13 Score=16.73 Aligned_cols=20 Identities=15% Similarity=0.311 Sum_probs=11.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHC Q ss_conf 898622999999999997400 Q gi|254780791|r 11 DHPEYSVSELSYHLKHIVESN 31 (529) Q Consensus 11 ~~~~~svs~l~~~i~~~l~~~ 31 (529) +++||. +++...|...+... T Consensus 12 ~snpFf-~~l~~gie~~~~~~ 31 (268) T cd06277 12 NSPAFY-SEIYRAIEEEAKKY 31 (268) T ss_pred CCCCHH-HHHHHHHHHHHHHC T ss_conf 899779-99999999999984 No 318 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=57.92 E-value=13 Score=16.72 Aligned_cols=43 Identities=23% Similarity=0.514 Sum_probs=36.9 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 7758999516888444220076999999974890488520577752589886 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) .||+|||.= +-+|...++.-..+.||||+=|---.|.+++||+ T Consensus 137 ~Pd~vii~d---------~~~~~~ai~Ea~~l~IP~I~ivDTn~~p~~idyp 179 (205) T pfam00318 137 LPDLVIVVD---------PNKEHIAIKEASKLGIPVIAIVDTNCDPDLIDYP 179 (205) T ss_pred CCCEEEEEC---------CCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCEE T ss_conf 898699857---------8645389999987599756540599995438778 No 319 >cd03576 NTR_PCOLCE NTR domain, PCOLCE subfamily; Procollagen C-endopeptidase enhancers (PCOLCEs) are extracellular matrix proteins that enhance the activity of procollagen C-proteases, by binding to the procollagen I C-peptide. They contain a C-terminal NTR domain, which have been suggested to possess inhibitory functions towards specific serine proteases but not towards metzincins, which are inhibited by the related TIMPs. Probab=57.69 E-value=13 Score=16.69 Aligned_cols=65 Identities=23% Similarity=0.369 Sum_probs=41.5 Q ss_pred HHHHHCC--CCEEEEEEECCCCCCCCCCEEEEEE----ECCCCEEEEEEECCCCCCCC------C--CCCCCCEEEEEEE Q ss_conf 9974001--7189999970543568886279998----74894799999735210586------6--8145988999999 Q gi|254780791|r 26 HIVESNL--SHVCVRGEISGYRGIHSSGHAYFSL----KDNHSRIDAIIWKGTLNKIE------F--LPEEGIEFLVIGK 91 (529) Q Consensus 26 ~~l~~~~--~~~~v~gEis~~~~~~~sGH~Yf~l----kd~~a~i~~~~~~~~~~~~~------~--~~~~G~~v~~~g~ 91 (529) ..++.+| +.+.|.|-|+++...+.|||.++++ ||++=.| -+.+....++ . .++-|...++-|. T Consensus 14 g~l~s~fC~S~FVItG~V~~l~~~~~s~~atvsv~~vyK~g~L~i---~~~G~~~svk~~~~C~~CP~l~~G~~YiiMG~ 90 (124) T cd03576 14 GTLESNYCSSEFVITGTVISMTRGPGSLHATVSIIKLYKEGRLAI---TQAGKTMSVKITVVCKQCPLLRRGLNYILMGQ 90 (124) T ss_pred CCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEHHHHCCCCCCC---CCCCCCEEEEEEEECCCCCCCCCCCEEEEEEE T ss_conf 860134446666899999897887995699999424013477565---44787578999850899987627873799988 Q ss_pred EE Q ss_conf 66 Q gi|254780791|r 92 IT 93 (529) Q Consensus 92 ~~ 93 (529) |+ T Consensus 91 Vd 92 (124) T cd03576 91 VD 92 (124) T ss_pred EC T ss_conf 64 No 320 >TIGR02628 fuculo_kin_coli L-fuculokinase; InterPro: IPR013450 Proteins in this entry are L-fuculokinases from the clade that includes the Escherichia coli enzyme P11553 from SWISSPROT. They catalyze the second step in fucose catabolism and belong to the FGGY family of carbohydrate kinases. These enzymes are encoded by the kinase (K) gene of the fucose (fuc) operon.; GO: 0008737 L-fuculokinase activity, 0005975 carbohydrate metabolic process. Probab=57.32 E-value=13 Score=16.68 Aligned_cols=18 Identities=6% Similarity=0.141 Sum_probs=9.7 Q ss_pred CCCCEEEEEEEEEEECCC Q ss_conf 459889999996675288 Q gi|254780791|r 81 EEGIEFLVIGKITTFPGS 98 (529) Q Consensus 81 ~~G~~v~~~g~~~~y~~~ 98 (529) ..|.-+.+=..-.+|=|- T Consensus 284 ~~G~T~ElDs~~g~yNPa 301 (473) T TIGR02628 284 SAGVTVELDSQAGLYNPA 301 (473) T ss_pred HCCCCEEEECCCCCCCCH T ss_conf 468535750325773732 No 321 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=57.15 E-value=13 Score=16.62 Aligned_cols=35 Identities=17% Similarity=0.134 Sum_probs=20.5 Q ss_pred HHHHHHCCH--HHHHHHHHHCCC--EEEEEECCCCCCHHHHHHH Q ss_conf 444220076--999999974890--4885205777525898864 Q gi|254780791|r 221 IEDLWHFND--EMIVRAIANSSI--PIISAIGHETDWTLADYAA 260 (529) Q Consensus 221 ~eDL~~FN~--e~laraI~~~~i--PVisgIGHE~D~Tl~D~VA 260 (529) +.|+..||. ..=+..+-...- =|+.-- .-|||||| T Consensus 209 ~~~~~~~~~l~~~~~~~~~~~~~S~~v~~~P-----~~LaD~~A 247 (941) T TIGR00763 209 FRELISLSKLFREQPALLSALEDSANVIDEP-----GRLADLVA 247 (941) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCCCC-----HHHHHHHH T ss_conf 9999852312000377743254447510462-----25888999 No 322 >cd00764 Eukaryotic_PFK Phosphofructokinase, a key regulatory enzyme in glycolysis, catalyzes the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate. The members belong to a subfamily of the PFKA family (cd00363) and include eukaryotic ATP-dependent phosphofructokinases. These have evolved from the bacterial PFKs by gene duplication and fusion events and exhibit complex allosteric behavior. Probab=56.97 E-value=13 Score=16.59 Aligned_cols=35 Identities=20% Similarity=0.298 Sum_probs=24.2 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHH-HCCCEEEE Q ss_conf 775899951688844422007699999997-48904885 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIA-NSSIPIIS 245 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~-~~~iPVis 245 (529) .+|.+|++=|-||+.-+..+-+ +|.-| ++.||+|. T Consensus 478 ~Id~LivIGG~gs~~ga~~L~~---~r~~y~~~~IP~v~ 513 (762) T cd00764 478 GIDGLIIVGGFEAYKGLLQLRE---AREQYEEFCIPMVL 513 (762) T ss_pred CCCEEEEECCCHHHHHHHHHHH---HHHHCCCCCCCEEE T ss_conf 9988999668079999999997---56424345886797 No 323 >PRK13054 lipid kinase; Reviewed Probab=56.79 E-value=13 Score=16.57 Aligned_cols=109 Identities=18% Similarity=0.229 Sum_probs=62.2 Q ss_pred CCEEEEEE--CCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHH-HHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 52899984--7842589999998630597589997210011110367999-99999741003576777589995168884 Q gi|254780791|r 145 PKIIAVIT--SPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIAN-AILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 145 p~~i~vit--s~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~-ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) +|-.-|+- |..+..++|++..+...+ +++.++.+.=+|+ +.++++ |++ ..+|+||++=|=|++ T Consensus 4 ~k~lli~N~~~~g~~~~~~~~~~l~~~~-~~~~~~~T~~~gd--a~~la~~a~~-----------~g~d~vv~~GGDGTv 69 (299) T PRK13054 4 PKSLLILNGKSAGNEELREAVGLLREEG-HTLHVRVTWEKGD--AARYVEEALA-----------LGVATVIAGGGDGTL 69 (299) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHCC-CEEEEEECCCCCH--HHHHHHHHHH-----------CCCCEEEEECCCHHH T ss_conf 5699998170005068999999999769-8799996489745--9999999987-----------799899998772299 Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCC-CCHHHHHHHC Q ss_conf 44220076999999974890488520577752589886412377-7214567633 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAP-TPTGAAEMAV 275 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~-TPTaAAElav 275 (529) - +|+..++.++.|.-..+|==.==|--||+-.+--| .|.+|++.+. T Consensus 70 ~--------evvngl~~~~~~~~~~LgiiP~GTgNdfAR~lgip~d~~~a~~~i~ 116 (299) T PRK13054 70 N--------EVATALAQLEGDDRPSLGILPLGTANDFATAAGIPLEPDKALKLAI 116 (299) T ss_pred H--------HHHHHHHHCCCCCCCEEEEECCCCHHHHHHHCCCCCCHHHHHHHHH T ss_conf 9--------9999997478877863999638756899998299989999997784 No 324 >TIGR01415 trpB_rel pyridoxal-phosphate dependent TrpB-like enzyme; InterPro: IPR006316 These sequences represent a family of pyridoxal-phosphate dependent enzymes that are closely related to the beta subunit of tryptophan synthase. ; GO: 0004834 tryptophan synthase activity, 0030170 pyridoxal phosphate binding, 0000162 tryptophan biosynthetic process. Probab=56.75 E-value=9 Score=18.10 Aligned_cols=154 Identities=23% Similarity=0.328 Sum_probs=80.0 Q ss_pred CEEEEEEECCCCCCCCC----CEEEEEEECCCCEE----EEEEECCCCCCCCCCCCCCCEEEEE-EEEEEECCC---CEE Q ss_conf 18999997054356888----62799987489479----9999735210586681459889999-996675288---437 Q gi|254780791|r 34 HVCVRGEISGYRGIHSS----GHAYFSLKDNHSRI----DAIIWKGTLNKIEFLPEEGIEFLVI-GKITTFPGS---SKY 101 (529) Q Consensus 34 ~~~v~gEis~~~~~~~s----GH~Yf~lkd~~a~i----~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~y~~~---g~~ 101 (529) .|+-+-|=...+++|+= -|.||.=+|.--.+ .|==|.+..+.-.-. =|+++-|. .++|++.+= .=+ T Consensus 89 rIyyK~E~~~~tGSHK~NTA~AqAYYak~~G~k~l~TETGAGQWG~AlS~A~al--~~L~~~VfMvr~Sf~~KP~Rk~lM 166 (426) T TIGR01415 89 RIYYKYESVSPTGSHKINTAIAQAYYAKKEGAKRLVTETGAGQWGSALSLAGAL--FGLKVKVFMVRVSFQQKPYRKYLM 166 (426) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHH--HCCEEEEEEEECCHHCCHHHHHHH T ss_conf 898740575778885244799999987554843743236886478999999987--289258988741210484689999 Q ss_pred EEEEEEEEECC--CCHHHHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCEEEEEECCCHHHHHHHHHHHHHCCC----EEE Q ss_conf 99999710168--00799999999997654012261001631-026528999847842589999998630597----589 Q gi|254780791|r 102 QIIIESLIPSG--SGTLLTALEKRKKKLLEEGLFSDQHKNPI-PFIPKIIAVITSPTGAVIRDILQRISCRFP----LRV 174 (529) Q Consensus 102 ql~v~~i~~~g--~G~l~~~~e~lk~~L~~eGlfd~~~k~~l-P~~p~~i~vits~~~a~~~D~~~~~~~r~p----~~~ 174 (529) +++=-.+-||- .-+.= |+-|+ .- |.-|=..||-=|+ |+++.+..=..||= ++- T Consensus 167 ~~yGa~V~PSPS~~T~~G------R~~L~-----------e~gP~~PGSLGiAISE---A~e~al~~~~~~Y~lGSVLnh 226 (426) T TIGR01415 167 ELYGAEVIPSPSELTEFG------RKVLK-----------EDGPDHPGSLGIAISE---AIEEALSDEDTKYSLGSVLNH 226 (426) T ss_pred HHCCCEECCCCCCCCHHH------HHHHH-----------CCCCCCCCHHHHHHHH---HHHHHHHCCCCEECCCHHHHH T ss_conf 870884336887751346------88874-----------0677778616789999---999997089944446305666 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 99721001111036799999999741003576777589995168884 Q gi|254780791|r 175 IIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 175 ~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) +|.+=+|=|=+|-++ |+.+. ..|||||=|=||||- T Consensus 227 vLlHqTViGLEaKkQ----lE~~G--------e~PD~~~GcVGGGSN 261 (426) T TIGR01415 227 VLLHQTVIGLEAKKQ----LEEAG--------EKPDVIIGCVGGGSN 261 (426) T ss_pred HHHHHHHHHHHHHHH----HHHCC--------CCCCEEEEEECCCHH T ss_conf 788766787645553----24107--------999889997338544 No 325 >PRK06642 single-stranded DNA-binding protein; Provisional Probab=56.46 E-value=3.2 Score=22.07 Aligned_cols=79 Identities=18% Similarity=0.352 Sum_probs=47.9 Q ss_pred CCCCEEEEEEEC---CCCCCCCCCE--EEEEE------ECC--------CCEEEEEEECCC-CCCCCCCCCCCCEEEEEE Q ss_conf 017189999970---5435688862--79998------748--------947999997352-105866814598899999 Q gi|254780791|r 31 NLSHVCVRGEIS---GYRGIHSSGH--AYFSL------KDN--------HSRIDAIIWKGT-LNKIEFLPEEGIEFLVIG 90 (529) Q Consensus 31 ~~~~~~v~gEis---~~~~~~~sGH--~Yf~l------kd~--------~a~i~~~~~~~~-~~~~~~~~~~G~~v~~~g 90 (529) .+..|-+.|-+. .++. ..+|. +=|+| ||. ..-++||+|... +..+.-.++.|++|.|.| T Consensus 4 s~NkV~LiG~lg~DPElr~-t~~G~~v~~fslA~~~~~k~~~~ge~~e~t~w~~vv~~~k~~ae~~~~yl~KG~~V~VeG 82 (152) T PRK06642 4 SLNKVILIGNVGRDPEIRT-TGEGKKIINLSLATTETWKDRITSERKERTEWHRVVIFSEGLVSVVERYVTKGSKLYIEG 82 (152) T ss_pred CCEEEEEEEECCCCCEEEE-CCCCCEEEEEEEEEEECEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 7518999863576965888-799976899999973010112577624311688877633899999998656888899966 Q ss_pred EEEE--EC-----CCCEEEEEEEEEEE Q ss_conf 9667--52-----88437999997101 Q gi|254780791|r 91 KITT--FP-----GSSKYQIIIESLIP 110 (529) Q Consensus 91 ~~~~--y~-----~~g~~ql~v~~i~~ 110 (529) ++.. |. .|+...++|+.+.+ T Consensus 83 ~L~~r~y~dkdG~~r~~teIv~~~~~~ 109 (152) T PRK06642 83 SLQTRKWNDNSGQEKYTTEVVLQNFNS 109 (152) T ss_pred EEEECCCCCCCCCEEEEEEEEEECCCC T ss_conf 787535598999989899999967876 No 326 >PRK11637 hypothetical protein; Provisional Probab=56.43 E-value=14 Score=16.53 Aligned_cols=10 Identities=10% Similarity=0.182 Sum_probs=3.2 Q ss_pred HHHHHHHHHH Q ss_conf 6699999888 Q gi|254780791|r 283 SSLINLEARL 292 (529) Q Consensus 283 ~~L~~l~~RL 292 (529) .+|......| T Consensus 30 ~~I~~~~~~l 39 (404) T PRK11637 30 ADIAAKERAV 39 (404) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 327 >pfam03100 CcmE CcmE. CcmE is the product of one of a cluster of Ccm genes that are necessary for cytochrome c biosynthesis in eubacteria. Expression of these proteins is induced when the organisms are grown under anaerobic conditions with nitrate or nitrite as the final electron acceptor. Probab=56.40 E-value=14 Score=16.52 Aligned_cols=81 Identities=14% Similarity=0.201 Sum_probs=50.5 Q ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEEEC--CCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEE Q ss_conf 62299999999999740017189999970--5435688862799987489479999973521058668145988999999 Q gi|254780791|r 14 EYSVSELSYHLKHIVESNLSHVCVRGEIS--GYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGK 91 (529) Q Consensus 14 ~~svs~l~~~i~~~l~~~~~~~~v~gEis--~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~ 91 (529) -||+||+.. ...+. =..+.|-|-|. .+. +..+..+-|.+-|..+.|... +.+ -++=..++|..|++.|+ T Consensus 36 f~tPsEl~~---~~~~~-~~~iRvgG~V~~gSi~-~~~~~~~~F~itD~~~~i~V~-Y~G---~lPdlF~Eg~gVVveG~ 106 (130) T pfam03100 36 FYTPSEVAA---GKAPP-GRRIRLGGLVEEGSVV-RDDGLTVRFVVTDGAASVPVR-YTG---ILPDLFREGQGVVAEGR 106 (130) T ss_pred EECHHHHHC---CCCCC-CCEEEEEEEEECCCEE-ECCCCEEEEEEECCCCEEEEE-EEC---CCCHHHHCCCCEEEEEE T ss_conf 987999825---77666-8689998899608679-679977999998489469999-947---79656627980899999 Q ss_pred EEEECCCCEEEEEEEEE Q ss_conf 66752884379999971 Q gi|254780791|r 92 ITTFPGSSKYQIIIESL 108 (529) Q Consensus 92 ~~~y~~~g~~ql~v~~i 108 (529) .+ ..| -|.++.+ T Consensus 107 ~~---~~g--~F~A~~l 118 (130) T pfam03100 107 LD---ADG--VFVADEV 118 (130) T ss_pred EC---CCC--EEEEEEE T ss_conf 88---899--8999799 No 328 >PRK07921 RNA polymerase sigma factor SigB; Reviewed Probab=56.38 E-value=14 Score=16.52 Aligned_cols=45 Identities=22% Similarity=0.344 Sum_probs=28.0 Q ss_pred CCHHHHHHHHHHCCCE--EEEEECCCCCCHHHHHHHCCCCCCCHHHH Q ss_conf 0769999999748904--88520577752589886412377721456 Q gi|254780791|r 227 FNDEMIVRAIANSSIP--IISAIGHETDWTLADYAADLRAPTPTGAA 271 (529) Q Consensus 227 FN~e~laraI~~~~iP--VisgIGHE~D~Tl~D~VAD~Ra~TPTaAA 271 (529) ...+.|......+.-| .=..||.+-|.|+.|+++|..+++|-... T Consensus 197 i~~~~v~~~l~~~~~~~SLD~~vg~d~~~~l~d~i~D~~~~~~e~~~ 243 (320) T PRK07921 197 IPEEKINDLLEHSRDPVSLDMPVGSDEEAPLGDFIEDAEAMSAENAV 243 (320) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCHHHHH T ss_conf 99999999999728883667766888753033244056789989999 No 329 >cd07022 S49_Sppa_36K_type Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 36K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily are all bacterial and include sohB peptidase and protein C. These are sometimes referred to as 36K type since they contain only one domain, unlike E. coli SppA that also contains an amino-terminal domain. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases. Probab=55.91 E-value=14 Score=16.46 Aligned_cols=77 Identities=25% Similarity=0.251 Sum_probs=52.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEE--EECCCCCHHHHHHCCHHHHHHHHHHCC--CEEEEEEC---CCCCC Q ss_conf 011110367999999997410035767775899--951688844422007699999997489--04885205---77752 Q gi|254780791|r 181 VQGDECPKEIANAILQLNTLKEGRTCPRPDIII--LARGGGSIEDLWHFNDEMIVRAIANSS--IPIISAIG---HETDW 253 (529) Q Consensus 181 vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ii--i~RGGGS~eDL~~FN~e~laraI~~~~--iPVisgIG---HE~D~ 253 (529) ..|......|+++|+.+.... ++..|| |-=+||+.. -.+++..+|..++ .||++=++ +.--+ T Consensus 21 ~~g~~~~~~i~~~l~~A~~d~------~vkaivL~InSPGG~~~-----~s~~i~~~I~~~~~~KPVva~~~~~~aSggY 89 (214) T cd07022 21 SSGLTSYEGIAAAIRAALADP------DVRAIVLDIDSPGGEVA-----GVFELADAIRAARAGKPIVAFVNGLAASAAY 89 (214) T ss_pred CCCCCCHHHHHHHHHHHHCCC------CCCEEEEEEECCCCCHH-----HHHHHHHHHHHHCCCCCEEEEECCCHHHHHH T ss_conf 777558999999999995099------97589999979897689-----9999999999860899899998881156899 Q ss_pred HHHHHHHCCCCCCCHH Q ss_conf 5898864123777214 Q gi|254780791|r 254 TLADYAADLRAPTPTG 269 (529) Q Consensus 254 Tl~D~VAD~Ra~TPTa 269 (529) -|+ ..||.=+.+|+. T Consensus 90 ~iA-saad~I~a~~~s 104 (214) T cd07022 90 WIA-SAADRIVVTPTA 104 (214) T ss_pred HHH-HHCCCEEECCCC T ss_conf 998-725705877764 No 330 >cd01889 SelB_euk SelB subfamily. SelB is an elongation factor needed for the co-translational incorporation of selenocysteine. Selenocysteine is coded by a UGA stop codon in combination with a specific downstream mRNA hairpin. In bacteria, the C-terminal part of SelB recognizes this hairpin, while the N-terminal part binds GTP and tRNA in analogy with elongation factor Tu (EF-Tu). It specifically recognizes the selenocysteine charged tRNAsec, which has a UCA anticodon, in an EF-Tu like manner. This allows insertion of selenocysteine at in-frame UGA stop codons. In E. coli SelB binds GTP, selenocysteyl-tRNAsec and a stem-loop structure immediately downstream of the UGA codon (the SECIS sequence). The absence of active SelB prevents the participation of selenocysteyl-tRNAsec in translation. Archaeal and animal mechanisms of selenocysteine incorporation are more complex. Although the SECIS elements have different secondary structures and conserved elements between archaea and euk Probab=55.77 E-value=14 Score=16.44 Aligned_cols=67 Identities=16% Similarity=0.213 Sum_probs=39.0 Q ss_pred EEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99997101680079999999999765401226100163102652899984784258999999863059758999 Q gi|254780791|r 103 IIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVII 176 (529) Q Consensus 103 l~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~ 176 (529) +.+..||-...-.....++++++.|.. +++...-...| |-=|++.+|.++.|....+.+.-|..+++ T Consensus 125 v~iNK~D~v~~~~~~~~~~~i~~~l~~--~l~~~~~~~~~-----iipiSA~~G~gi~eL~~~i~~lip~p~~~ 191 (192) T cd01889 125 VVLNKIDLIPEEERERKIEKMKKKLQK--TLEKTRFKNSP-----IIPVSAKPGGGEAELGKDLNNLIVLPLIL 191 (192) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHH--HHHHCCCCCCE-----EEEEECCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 999741278815779999999999999--98653899984-----99957898849899998887618999636 No 331 >cd00763 Bacterial_PFK Phosphofructokinase, a key regulatory enzyme in glycolysis, catalyzes the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate. The members belong to a subfamily of the PFKA family (cd00363) and include bacterial ATP-dependent phosphofructokinases. These are allosrterically regulated homotetramers; the subunits are of about 320 amino acids. Probab=55.73 E-value=14 Score=16.44 Aligned_cols=50 Identities=16% Similarity=0.234 Sum_probs=24.6 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHH Q ss_conf 75899951688844422007699999997489048852057775258988641237772145 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGA 270 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaA 270 (529) .|.++++=|-||....+ .+++..+||| ||---+|--| .-.|...=-.||+ T Consensus 93 Id~Li~IGGdgS~~~a~---------~l~e~~i~vi-gIPkTIDNDi--~gtd~t~GfdTAv 142 (317) T cd00763 93 IDALVVIGGDGSYMGAM---------RLTEHGFPCV-GLPGTIDNDI--PGTDYTIGFDTAL 142 (317) T ss_pred CCEEEEECCCHHHHHHH---------HHHHCCCCEE-EECCCCCCCC--CCCCCCCCHHHHH T ss_conf 99899956945899999---------9997499779-7135446788--8878786889999 No 332 >cd01741 GATase1_1 Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. This group contains a subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. GATase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. Glutamine amidotransferases (GATase) includes the triad family of amidotransferases which have a conserved Cys-His-Glu catalytic triad in the glutaminase active site. In this subgroup this triad is conserved. GATase activity can be found in a range of biosynthetic enzymes, including: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase , anthranilate synthase component II, glutamine-dependent carbamoyl phosphate synthase, cytidine triphosphate synthetase, gamma-glutamyl hydrolase, imidazole glycerol phosphate synthase and, cobyric acid synthase. Glutamine amidotransferase (GATase) domains can occur either as single polypeptides, as in glutamine Probab=55.64 E-value=14 Score=16.43 Aligned_cols=86 Identities=17% Similarity=0.299 Sum_probs=52.1 Q ss_pred EEEEEECCCH---HHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH-H Q ss_conf 8999847842---58999999863059758999721001111036799999999741003576777589995168884-4 Q gi|254780791|r 147 IIAVITSPTG---AVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI-E 222 (529) Q Consensus 147 ~i~vits~~~---a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~-e 222 (529) ||+||-.... ..+.|+++.... -++++.+|....... +.+ ...||.|||.=|-.|. + T Consensus 1 ~i~il~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~--~~~----------------~~~~dgvii~Gg~~~~~~ 61 (188) T cd01741 1 RILILQHDTPEGPGLFEDLLREAGA-ETIEIDVVDVYAGEL--LPD----------------LDDYDGLVILGGPMSVDE 61 (188) T ss_pred CEEEEECCCCCCCHHHHHHHHHCCC-CCEEEEEEECCCCCC--CCC----------------HHHCCEEEEECCCCCCCC T ss_conf 9899977999997489999996699-846899996666989--978----------------444088999699877786 Q ss_pred H--HHHCCHHHHHHHHHHCCCEEEE-EECCCC Q ss_conf 4--2200769999999748904885-205777 Q gi|254780791|r 223 D--LWHFNDEMIVRAIANSSIPIIS-AIGHET 251 (529) Q Consensus 223 D--L~~FN~e~laraI~~~~iPVis-gIGHE~ 251 (529) | -|-=..-++++.+++..+||+- --||.. T Consensus 62 ~~~pwi~~~~~~i~~~~~~~~PilGIC~G~Ql 93 (188) T cd01741 62 DDYPWLKKLKELIRQALAAGKPVLGICLGHQL 93 (188) T ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEECHHHHH T ss_conf 78866999999999999879999998468999 No 333 >pfam09370 TIM-br_sig_trns TIM-barrel signal transduction protein. This domain is likely to have a TIM barrel fold related to IGPS. Although this family of proteins are functionally uncharacterized this domain is found as an N-terminal domain of sigma 54 -dependent transcriptional activators (enhancer-binding proteins) suggesting a potential role in signal recognition/receiving and signal transduction. Probab=55.62 E-value=7.9 Score=18.59 Aligned_cols=61 Identities=18% Similarity=0.391 Sum_probs=33.2 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC-------CCCCHHHHHHCCHH-----HHHHHHHH--CCCEEEEEE Q ss_conf 11110367999999997410035767775899951-------68884442200769-----99999974--890488520 Q gi|254780791|r 182 QGDECPKEIANAILQLNTLKEGRTCPRPDIIILAR-------GGGSIEDLWHFNDE-----MIVRAIAN--SSIPIISAI 247 (529) Q Consensus 182 QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R-------GGGS~eDL~~FN~e-----~laraI~~--~~iPVisgI 247 (529) -|.+|-.-|..-..... ..|.||+-- |-||+.-||+|-|- +.++.|.- -.+|||.|| T Consensus 18 igagaGtGlsAK~ae~g---------GaDlIi~ynsGrfRm~G~gSlagllpygdaN~iv~ema~Evlpvv~~tPViaGv 88 (268) T pfam09370 18 IGAGAGTGLSAKCEEAG---------GADLILIYNSGRYRMAGRGSLAGLLPYGNANEIVLEMAREVLPVVKHTPVLAGV 88 (268) T ss_pred EEEECCCCHHHHHHHHC---------CCCEEEEECCCHHHHCCCCCHHHHCCCCCHHHHHHHHHHHHHHHCCCCCEEEEC T ss_conf 99732511657899857---------986998615403443588313120135657699999998887553588758761 Q ss_pred CCCCC Q ss_conf 57775 Q gi|254780791|r 248 GHETD 252 (529) Q Consensus 248 GHE~D 252 (529) +- +| T Consensus 89 ~~-tD 92 (268) T pfam09370 89 CG-TD 92 (268) T ss_pred CC-CC T ss_conf 58-89 No 334 >COG1619 LdcA Uncharacterized proteins, homologs of microcin C7 resistance protein MccF [Defense mechanisms] Probab=55.47 E-value=14 Score=16.41 Aligned_cols=93 Identities=25% Similarity=0.299 Sum_probs=48.0 Q ss_pred HHHHHHHHHHCC-C-E-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH-HHHHHCCH-HHHH Q ss_conf 999999863059-7-5-8999721001111036799999999741003576777589995168884-44220076-9999 Q gi|254780791|r 159 IRDILQRISCRF-P-L-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI-EDLWHFND-EMIV 233 (529) Q Consensus 159 ~~D~~~~~~~r~-p-~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~-eDL~~FN~-e~la 233 (529) +.=+...+-.+| | . ..+|+-=-|- ..+..|-+.|..... .+...++-.||++|=.... ..-..+.+ |.|. T Consensus 190 L~~l~~l~GT~~~P~~~~~ILfLEd~~--~~~~~idR~L~qL~~---~gil~~v~gii~G~f~~~~~~~~~~~~~~~~v~ 264 (313) T COG1619 190 LSVLASLIGTPYMPDIEGKILFLEDVN--EKPYRIDRMLLQLKL---AGILEKVSGIILGRFTKCTPNEYDPSYTLEEVL 264 (313) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEEECC--CCHHHHHHHHHHHHH---HCCHHHCCEEEEECCCCCCCCCCCCCHHHHHHH T ss_conf 999997358855427768689998489--887789999999987---085845858998347667777643203299999 Q ss_pred HHHHH-CCCEEEEE--ECCCCCCHHH Q ss_conf 99974-89048852--0577752589 Q gi|254780791|r 234 RAIAN-SSIPIISA--IGHETDWTLA 256 (529) Q Consensus 234 raI~~-~~iPVisg--IGHE~D~Tl~ 256 (529) +.++. ..||||++ +||+.+.-.+ T Consensus 265 ~~~~~~~~iPv~~~~~fGH~~p~~~l 290 (313) T COG1619 265 RDLALSLNIPVIYGLDFGHTQPNLTL 290 (313) T ss_pred HHHHHHCCCCEEECCCCCCCCCCEEE T ss_conf 98775259865868877788864036 No 335 >TIGR01296 asd_B aspartate-semialdehyde dehydrogenase; InterPro: IPR005986 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . Aspartate-semialdehyde dehydrogenase, the second enzyme in the aspartate pathway, converts aspartyl phosphate to aspartate-semialdehyde, the branch point intermediate between the lysine and threonine/methionine pathways. Based on sequence alignments, the aspartate-semialdehyde dehydrogenase family appears to have two distinct subgroups, one found in most bacteria (Gram-positve and Gram-negative), while the other is found primarily in organisms lacking peptidoglycan (archaea,fungi and some bacteria). Most studies have been performed on enzymes isolated from Gram-negative bacteria , , , . The N-terminal domain forms the active site and NADP-binding pocket, while C-terminal domain is primarily involved in hydrophobic intersubunit contacts. The catalytic mechanism involves the formation of a covalent thioester acyl-enzyme intermediate mediated through nucleophilic attack by an active site cysteine residue on the substrate aspartyl phosphate. Release of inorganic phosphate is followed by hydride transfer from NADPH to yield the product. The recently described archaeal structure suggests that the two subgroups of aspartate semi-aldehyde dehydrogenase share similar structures and have an identical catalytic mechanism, despite their relatively low sequence identity . Unlike the bacterial enzymes, the archaeal enzyme utilised both NAD and NADP as cofactor. This entry represents the subgroup of aspartate-semialdehyde dehydrogenase most commonly found in bacteria. Note that some bacteria, such as Chlamydia, contain the enzyme from the other subgroup (IPR005676 from INTERPRO) instead.; GO: 0004073 aspartate-semialdehyde dehydrogenase activity, 0009086 methionine biosynthetic process, 0009088 threonine biosynthetic process. Probab=55.33 E-value=5.7 Score=19.87 Aligned_cols=113 Identities=19% Similarity=0.319 Sum_probs=54.6 Q ss_pred CEEEEEEEC-CCCCCCCCCCCCCC-EEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 479999973-52105866814598-8999999667528843799999710168007999999999976540122610016 Q gi|254780791|r 62 SRIDAIIWK-GTLNKIEFLPEEGI-EFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKN 139 (529) Q Consensus 62 a~i~~~~~~-~~~~~~~~~~~~G~-~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~ 139 (529) -+|.=++|. |....-.|-|+--- -|+|.=+=|.|-..-..=|+|-.+-+.- +++.+. T Consensus 63 ekidIAlFSAGgsvSke~aP~a~k~g~~VIDntS~fR~D~dVPLVVPEVN~~~-------------------~~~~~~-- 121 (350) T TIGR01296 63 EKIDIALFSAGGSVSKEFAPKAAKAGVIVIDNTSAFRMDPDVPLVVPEVNLED-------------------LKEFNK-- 121 (350) T ss_pred CEEEEEEECCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEECCCCHHH-------------------HHCCCC-- T ss_conf 70334565153133488878886087579817620005789863443778477-------------------634788-- Q ss_pred CCCCCCCEEEEEECCCHHHHHHH--HHHHHHCCCE-EEEE--EE-CCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 31026528999847842589999--9986305975-8999--72-1001111036799999999741003 Q gi|254780791|r 140 PIPFIPKIIAVITSPTGAVIRDI--LQRISCRFPL-RVII--FP-VKVQGDECPKEIANAILQLNTLKEG 203 (529) Q Consensus 140 ~lP~~p~~i~vits~~~a~~~D~--~~~~~~r~p~-~~~~--~p-~~vQG~~a~~~i~~ai~~~~~~~~~ 203 (529) -|||.=|+=..++=+ |+-|.+.+.+ .|++ |- ++==|..+.+|+-+..+..-...|. T Consensus 122 --------kgIIANPNCstIqmv~~LkPl~~~~~ikrVvVSTYQAVSGAG~~g~~eL~nQtka~~~~~E~ 183 (350) T TIGR01296 122 --------KGIIANPNCSTIQMVVVLKPLHDEAKIKRVVVSTYQAVSGAGNKGVEELKNQTKAVLEGKEK 183 (350) T ss_pred --------CCCEECCCCCHHHHHHHHHHHHHHCCEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf --------88145886007779884046563067458999722003542799999989999998524588 No 336 >cd05402 NT_PAP_TUTase Nucleotidyltransferase (NT) domain of poly(A) polymerases and terminal uridylyl transferases. Poly(A) polymerases (PAPs) catalyze mRNA poly(A) tail synthesis, and terminal uridylyl transferases (TUTases) uridylate RNA. PAPs in this subgroup include human PAP alpha, mouse testis-specific cytoplasmic PAP beta, human nuclear PAP gamma, Saccharomyces cerevisiae PAP1, TRF4 and-5, Schizosaccharomyces pombe caffeine-induced death proteins -1, and -14, Caenorhabditis elegans Germ Line Development-2, and Chlamydomonas reinhardtii MUT68. This family also includes human U6 snRNA-specific TUTase1, and Trypanosoma brucei 3'-TUTase-1,-2, and 4. This family belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. For the majority of proteins in this family, these carboxyla Probab=55.15 E-value=10 Score=17.52 Aligned_cols=88 Identities=22% Similarity=0.286 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC-CCH-HHHHHC------ Q ss_conf 589999998630597-58999721001111036799999999741003576777589995168-884-442200------ Q gi|254780791|r 157 AVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG-GSI-EDLWHF------ 227 (529) Q Consensus 157 a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG-GS~-eDL~~F------ 227 (529) ..+.++...++..|| .++.+|.+.+-|-.-+.+ +.|+.|..-+. .+. +-|.-+ T Consensus 4 ~~~~~l~~~v~~~~~~~~v~~fGS~~~gl~l~~S------------------DiDl~i~~~~~~~~~~~~l~~l~~~l~~ 65 (114) T cd05402 4 EVLDRLQELIKEWFPGAKLYPFGSYVTGLGLPGS------------------DIDLCLLGPNHRVDREDFLRKLAKLLKK 65 (114) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCC------------------CCCEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 9999999999988899899986897157859998------------------8745885388631089999999999986 Q ss_pred -CHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf -7699999997489048852057775258988641237772 Q gi|254780791|r 228 -NDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTP 267 (529) Q Consensus 228 -N~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TP 267 (529) ....-...|...++|||.-+...++.. .|..+... T Consensus 66 ~~~~~~~~~I~~ArVPIik~~~~~~~i~-----~DIs~~n~ 101 (114) T cd05402 66 SGEVVEVEPIINARVPIIKFVDKPTGIE-----VDISFNNL 101 (114) T ss_pred CCCCEEEEEEEEECCCEEEEEECCCCEE-----EEEEECCC T ss_conf 7992778884022320799998677969-----99980785 No 337 >cd06341 PBP1_ABC_ligand_binding_like_7 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=55.00 E-value=14 Score=16.35 Aligned_cols=77 Identities=16% Similarity=0.137 Sum_probs=36.8 Q ss_pred CEEEEEECCC--HHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 2899984784--25899999986305975899972100111103679999999974100357677758999516888444 Q gi|254780791|r 146 KIIAVITSPT--GAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 146 ~~i~vits~~--~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) +.++++...+ |..+.+..+...++..+++......-.|..--...+..|+.. .+|+|+++=.++ | T Consensus 134 k~~~~~~~~~~~g~~~~~~~~~~~~~~G~~vv~~~~~~~~~~D~~~~v~~l~~~----------~pd~V~~~~~~~---~ 200 (341) T cd06341 134 RAVALVTALSAAVSAAAALLARSLAAAGVSVAGIVVITATAPDPTPQAQQAAAA----------GADAIITVLDAA---V 200 (341) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHC----------CCCEEEEECCCH---H T ss_conf 489999358358899999999999975995457874489998778999999856----------969999906847---8 Q ss_pred HHHCCHHHHHHHHHHCC Q ss_conf 22007699999997489 Q gi|254780791|r 224 LWHFNDEMIVRAIANSS 240 (529) Q Consensus 224 L~~FN~e~laraI~~~~ 240 (529) . -.++++..+.- T Consensus 201 ~-----~~~~k~~~~~G 212 (341) T cd06341 201 C-----ASVLKAVRAAG 212 (341) T ss_pred H-----HHHHHHHHHCC T ss_conf 9-----99999999769 No 338 >pfam08960 DUF1874 Domain of unknown function (DUF1874). This domain is found in a set of hypothetical viral and bacterial proteins. Probab=54.86 E-value=12 Score=17.10 Aligned_cols=23 Identities=26% Similarity=0.370 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 7699999997489048852057775 Q gi|254780791|r 228 NDEMIVRAIANSSIPIISAIGHETD 252 (529) Q Consensus 228 N~e~laraI~~~~iPVisgIGHE~D 252 (529) +.++.+..|. .-++||||||+-- T Consensus 25 ~~~Eak~~l~--~~~fvSAIGH~aT 47 (104) T pfam08960 25 DIEEAKELLE--NERFVSAIGHDAT 47 (104) T ss_pred CHHHHHHHHC--CCCEEEEECCHHH T ss_conf 9999998861--6673675164989 No 339 >COG0763 LpxB Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane] Probab=54.70 E-value=14 Score=16.31 Aligned_cols=50 Identities=14% Similarity=0.145 Sum_probs=25.6 Q ss_pred CCCCHHHHHHCCHHHHHH------HHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHH Q ss_conf 688844422007699999------99748904885205777525898864123777214 Q gi|254780791|r 217 GGGSIEDLWHFNDEMIVR------AIANSSIPIISAIGHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 217 GGGS~eDL~~FN~e~lar------aI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTa 269 (529) .+|...+.+.=-|-.++. .++-+.+|.|++==- + .+.=+.+-....+|.. T Consensus 254 ~~~~~~~a~~~aD~al~aSGT~tLE~aL~g~P~Vv~Yk~--~-~it~~iak~lvk~~yi 309 (381) T COG0763 254 IDGEKRKAFAAADAALAASGTATLEAALAGTPMVVAYKV--K-PITYFIAKRLVKLPYV 309 (381) T ss_pred ECCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCEEEEEEC--C-HHHHHHHHHHCCCCCC T ss_conf 074578999984189984467999999829997999944--5-8899999986167724 No 340 >PRK05416 hypothetical protein; Provisional Probab=54.47 E-value=14 Score=16.28 Aligned_cols=110 Identities=15% Similarity=0.122 Sum_probs=60.8 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 02652899984784258999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 142 PFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 142 P~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) +.-|.+|-|||+.+|||---.++++.+---+.|.-.|.. ++..+-.... +.....++=+.|=+|+++.. T Consensus 2 ~~~~m~lviVTGlSGAGKStAl~~LEDlGy~ciDNlP~~---------Ll~~l~~~~~--~~~~~~~lAv~iD~R~~~~~ 70 (292) T PRK05416 2 EAAPMRLVIVTGLSGAGKSVALRALEDLGYYCVDNLPPS---------LLPKLVELLA--QSGGIDKVAVVIDVRSRPFF 70 (292) T ss_pred CCCCEEEEEEECCCCCCHHHHHHHHHHCCCEEECCCCHH---------HHHHHHHHHH--CCCCCCCEEEEEECCCCCCH T ss_conf 989868999968997879999999981794478688889---------9999999972--47887706999824665447 Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf 4422007699999997489048852057775258988641237772 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTP 267 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TP 267 (529) ++|-. +...+-+..+++-.=-=.-.|.+|+-==+-.|=+-| T Consensus 71 ~~l~~-----~~~~l~~~~~~~~ilFLdA~~~~LlrRy~eTRR~HP 111 (292) T PRK05416 71 LDLPE-----ALDELRERGIDVRVLFLDASDEVLIRRYSETRRRHP 111 (292) T ss_pred HHHHH-----HHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCCCC T ss_conf 78999-----999998579955999997886999998875068999 No 341 >pfam01207 Dus Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. Dus 1 from Saccharomyces cerevisiae acts on pre-tRNA-Phe, while Dus 2 acts on pre-tRNA-Tyr and pre-tRNA-Leu. Dus 1 is active as a single subunit, requiring NADPH or NADH, and is stimulated by the presence of FAD. Some family members may be targeted to the mitochondria and even have a role in mitochondria. Probab=54.36 E-value=14 Score=16.36 Aligned_cols=15 Identities=33% Similarity=0.516 Sum_probs=6.9 Q ss_pred HHHHHHHHHCCCEEE Q ss_conf 999999974890488 Q gi|254780791|r 230 EMIVRAIANSSIPII 244 (529) Q Consensus 230 e~laraI~~~~iPVi 244 (529) +.+.+..-.++|||| T Consensus 173 ~~i~~~k~~~~ipvi 187 (309) T pfam01207 173 DAIKQVKQAVSIPVI 187 (309) T ss_pred HHHHHHHHHCCCCEE T ss_conf 899999985898289 No 342 >PRK08259 enoyl-CoA hydratase; Provisional Probab=54.10 E-value=15 Score=16.24 Aligned_cols=38 Identities=24% Similarity=0.240 Sum_probs=24.2 Q ss_pred HHHHHHHHCCCEEEEEE-CCCC--CCHHHHHHHCCCCCCCHH Q ss_conf 99999974890488520-5777--525898864123777214 Q gi|254780791|r 231 MIVRAIANSSIPIISAI-GHET--DWTLADYAADLRAPTPTG 269 (529) Q Consensus 231 ~laraI~~~~iPVisgI-GHE~--D~Tl~D~VAD~Ra~TPTa 269 (529) .+......||.|||.+| ||=. =.. +=+.+|+|..+++| T Consensus 84 ~~~~~~~~~~kPvIAaV~G~a~GgG~~-lal~cD~~ia~~~A 124 (254) T PRK08259 84 PMGPSRMRLSKPVIAAISGYAVAGGLE-LALWCDLRVAEEDA 124 (254) T ss_pred HHHHHHHHCCCCEEEEECCEEEECCHH-HHHHCCEEEECCCC T ss_conf 567999858998899977886557446-64415666855896 No 343 >PRK03620 5-dehydro-4-deoxyglucarate dehydratase; Provisional Probab=53.93 E-value=15 Score=16.21 Aligned_cols=74 Identities=15% Similarity=0.269 Sum_probs=36.9 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHH Q ss_conf 84258999999863059758999721001111036799999999741003576777589995168884442200769999 Q gi|254780791|r 154 PTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIV 233 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~la 233 (529) ++-.++-++.+.+.+.-++.|++|.- .|-.-..+.+..|.. . .+.++-|==..| |+ +.+. T Consensus 109 ~~~~~l~~~~~~ia~a~~lPi~lYn~--~~~~~~~~~l~~L~~--~--------~pnivgiKdss~---d~-----~~~~ 168 (296) T PRK03620 109 APQEGLAAHVEAVCKSTDLGVIVYNR--DNAVLTADTLARLAE--R--------CPNLIGFKDGVG---DI-----ELMV 168 (296) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECC--CCCCCCHHHHHHHHH--H--------CCCEEEEEECCC---CH-----HHHH T ss_conf 99999999999999831899775178--887769999999997--2--------898899995868---89-----9999 Q ss_pred HHHHHC--CCEEEEEE Q ss_conf 999748--90488520 Q gi|254780791|r 234 RAIANS--SIPIISAI 247 (529) Q Consensus 234 raI~~~--~iPVisgI 247 (529) +.+..+ ++-|++|. T Consensus 169 ~~~~~~~~~~~v~~G~ 184 (296) T PRK03620 169 RITRALGDRLLYLGGL 184 (296) T ss_pred HHHHHCCCCEEEEECC T ss_conf 9999769975998289 No 344 >PRK06490 glutamine amidotransferase; Provisional Probab=53.84 E-value=15 Score=16.20 Aligned_cols=83 Identities=16% Similarity=0.161 Sum_probs=56.9 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHC-CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 265289998478425899999986305-9758999721001111036799999999741003576777589995168884 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRISCR-FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~~~r-~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .-.++|.||--........|-..++++ +++.+ + .+..|+.-|..+ ..||.+||-=|--|. T Consensus 9 ~~~k~vLviqH~~~e~pG~i~~~L~~~G~~~~i--~-r~~~gd~lP~~l----------------~~~dglvVlGGpmsa 69 (243) T PRK06490 9 RDKRPILIVLHQERSTPGRVGQLLRERGYPLDI--R-RPRLGDPLPETL----------------EDHAGAVIFGGPMSA 69 (243) T ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHCCCEEEE--E-ECCCCCCCCCCC----------------CCCCEEEEECCCCCC T ss_conf 789857999668999972999999978985899--9-678999899975----------------460589992899998 Q ss_pred HHHHHC--CHHHHHHHHHHCCCEEE Q ss_conf 442200--76999999974890488 Q gi|254780791|r 222 EDLWHF--NDEMIVRAIANSSIPII 244 (529) Q Consensus 222 eDL~~F--N~e~laraI~~~~iPVi 244 (529) .|--+| .+..++|...+..+|++ T Consensus 70 ~D~~p~l~~e~~~I~~~l~~~~P~L 94 (243) T PRK06490 70 NDPDDFIRREIDWISVPLKENKPFL 94 (243) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 9987179999999999998699889 No 345 >cd02801 DUS_like_FMN Dihydrouridine synthase-like (DUS-like) FMN-binding domain. Members of this family catalyze the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. It is likely that different family members have different substrate specificities, which may overlap. 1VHN, a putative flavin oxidoreductase, has high sequence similarity to DUS. The enzymatic mechanism of 1VHN is not known at the present. Probab=53.73 E-value=13 Score=16.59 Aligned_cols=20 Identities=25% Similarity=0.687 Sum_probs=10.5 Q ss_pred CCEEEEECCCCCHHHHHHCCHH Q ss_conf 7589995168884442200769 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDE 230 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e 230 (529) +|-|.|+|| ++.+-|.|++. T Consensus 207 ~dgvMigRg--al~nP~iF~~i 226 (231) T cd02801 207 VDGVMIGRG--ALGNPWLFREI 226 (231) T ss_pred CCEEEECHH--HHHCCHHHHHH T ss_conf 999998788--87698899999 No 346 >cd03141 GATase1_Hsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein (EcHsp31). This group includes EcHsp31 and Saccharomyces cerevisiae Ydr533c protein. EcHsp31 has chaperone activity. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1 domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For EcHsp31, this Cys together with a different His and, an Asp (rather than a Glu) residue form a different Probab=53.58 E-value=13 Score=16.61 Aligned_cols=41 Identities=22% Similarity=0.359 Sum_probs=30.9 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHH---HHHHHCCCEEEEEECCCC Q ss_conf 77589995168884442200769999---999748904885205777 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIV---RAIANSSIPIISAIGHET 251 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~la---raI~~~~iPVisgIGHE~ 251 (529) .||+|+|.=|-|...||. +++.|. +.+++..- +|++|.|-. T Consensus 90 ~ydai~~pGG~g~m~dl~--~~~~l~~l~~~~~~~gk-~vaaiCHGp 133 (221) T cd03141 90 DYDAIFIPGGHGPMFDLP--DNPDLQDLLREFYENGK-VVAAVCHGP 133 (221) T ss_pred HCCEEEECCCCCHHHHHH--CCHHHHHHHHHHHHCCC-EEEEECCHH T ss_conf 883899899854587744--16999999999997498-599976307 No 347 >KOG1405 consensus Probab=53.58 E-value=15 Score=16.17 Aligned_cols=90 Identities=18% Similarity=0.364 Sum_probs=61.0 Q ss_pred CCCCCCCCCCCCEEEEEECC--------------C---HHHHHHHHHHHHHCC-CE-EEEEEECCCCCC--CHHHHHHHH Q ss_conf 10016310265289998478--------------4---258999999863059-75-899972100111--103679999 Q gi|254780791|r 135 DQHKNPIPFIPKIIAVITSP--------------T---GAVIRDILQRISCRF-PL-RVIIFPVKVQGD--ECPKEIANA 193 (529) Q Consensus 135 ~~~k~~lP~~p~~i~vits~--------------~---~a~~~D~~~~~~~r~-p~-~~~~~p~~vQG~--~a~~~i~~a 193 (529) +-||..+|.|++-||=--+- . =|-++|.+..-+++. |+ -+++-|..-.|- .|.++--+. T Consensus 217 pIHKldiPaFdWPiA~fP~ykYPLdenv~~N~ked~~Cl~~Ve~li~~~~~k~~pVaaiIvEPIQsEGGDnhaSp~Ff~k 296 (484) T KOG1405 217 PIHKLDIPAFDWPIAPFPRYKYPLDENVAENKKEDQGCLAEVEDLIVKYRKKKKPVAAIIVEPIQSEGGDNHASPDFFRK 296 (484) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECHHCCCCCCCCCHHHHHH T ss_conf 64006888888977887434587456565512213667999999999976037975799865110257886689899999 Q ss_pred HHHHHHHCCCCCCCCCCEEEEE----CCCCCHHHHHHCCHHHH Q ss_conf 9999741003576777589995----16888444220076999 Q gi|254780791|r 194 ILQLNTLKEGRTCPRPDIIILA----RGGGSIEDLWHFNDEMI 232 (529) Q Consensus 194 i~~~~~~~~~~~~~~~D~iii~----RGGGS~eDL~~FN~e~l 232 (529) |+.+.. +.+++.|+ -||||..++||-..-+| T Consensus 297 Lrdi~~--------Kh~v~fivDEVQTGgGaTGk~WaHehw~l 331 (484) T KOG1405 297 LRDITK--------KHGVAFIVDEVQTGGGATGKFWAHEHWNL 331 (484) T ss_pred HHHHHH--------HCCEEEEEEEEECCCCCCCCEEEEHHCCC T ss_conf 999998--------64958996400007886675432020389 No 348 >PRK06246 fumarate hydratase; Provisional Probab=53.47 E-value=9.5 Score=17.91 Aligned_cols=50 Identities=36% Similarity=0.438 Sum_probs=31.0 Q ss_pred CCCEEEEECCCCCH--------------HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHH Q ss_conf 77589995168884--------------442200769999999748904885205777525898864 Q gi|254780791|r 208 RPDIIILARGGGSI--------------EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAA 260 (529) Q Consensus 208 ~~D~iii~RGGGS~--------------eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VA 260 (529) .++..++.-||||- |.+.-|=-|.|..+=.+---|++.|||=--. .|.++ T Consensus 133 ~~~i~~~~KGgGSeN~s~l~ml~Ps~g~egi~~fv~e~v~~~G~~~CPP~~vGVGIGGt---~e~a~ 196 (276) T PRK06246 133 RLKITVAPKGGGSENMSALKMLKPADGLEGIKKFVLETVKEAGGNPCPPIIVGVGIGGT---FDKAA 196 (276) T ss_pred EEEEEEECCCCCCCCHHHHHEECCCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCC---HHHHH T ss_conf 48999844778743554564027544577899999999997088999987068885674---99999 No 349 >PRK05813 single-stranded DNA-binding protein; Provisional Probab=53.36 E-value=14 Score=16.41 Aligned_cols=133 Identities=17% Similarity=0.336 Sum_probs=75.1 Q ss_pred CEEEEEEEC-CCCCCCC---CCEEEEEEE-----CCCCEEEEEEECCCCCCC-CCCCCCCCEEEEEEEEEEECC------ Q ss_conf 189999970-5435688---862799987-----489479999973521058-668145988999999667528------ Q gi|254780791|r 34 HVCVRGEIS-GYRGIHS---SGHAYFSLK-----DNHSRIDAIIWKGTLNKI-EFLPEEGIEFLVIGKITTFPG------ 97 (529) Q Consensus 34 ~~~v~gEis-~~~~~~~---sGH~Yf~lk-----d~~a~i~~~~~~~~~~~~-~~~~~~G~~v~~~g~~~~y~~------ 97 (529) .|.|.|||. .|..+|- -|-+-|.|. |..-.|.-++=- +-+ ......|..|.+.|.+--|-. T Consensus 10 ~V~~~G~i~s~~~fSHE~~GEgFY~~~l~V~RLS~~~D~ip~~vSE---RLi~~~~~~~g~~i~v~GQ~RSYN~~~~~~N 86 (217) T PRK05813 10 KVYLEGKVVSELEFSHEMYGEGFYTFKLEVPRLSDSKDILNITVSE---RLLADMDLKVGTEIIVEGQLRSYNKFIDGSN 86 (217) T ss_pred EEEEEEEECCCCEEEEEEECCEEEEEEEEEECCCCCCCEEEEEEEH---HHHHCCCCCCCCEEEEEEEEEECCCCCCCCC T ss_conf 8999999835735734663342589999851146874337789774---3420011357978999852440336678774 Q ss_pred CCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 84379999971016800799999999997654012261001631026528999847842589999998630597589997 Q gi|254780791|r 98 SSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIF 177 (529) Q Consensus 98 ~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~ 177 (529) +=-+++.|..|++.-+ + ..--...--.||.. |.|+ | =+-|-|--|.|++-...+.|. +-... T Consensus 87 rLiLtvF~Rei~~~~e-~-----~~~~NqIfLdGyIC---K~PV--Y------R~TPlGREIaDiLlAVNR~Y~-KSDYI 148 (217) T PRK05813 87 RLILTVFARNIEYCDE-R-----SKNPNEIFLDGFIC---KEPV--Y------RTTPFGREIADLLLAVNRPYN-KSDYI 148 (217) T ss_pred EEEEEEEEEEEEECCC-C-----CCCCCEEEEEEEEC---CCCE--E------CCCCCCHHHHHHHHHHCCCCC-CCCCC T ss_conf 4999998761162143-4-----68866498803762---7862--0------269884257777645067678-77730 Q ss_pred ECCCCCCCHH Q ss_conf 2100111103 Q gi|254780791|r 178 PVKVQGDECP 187 (529) Q Consensus 178 p~~vQG~~a~ 187 (529) ||-.=|.+|- T Consensus 149 PcI~WGrnAr 158 (217) T PRK05813 149 PTIAWGRNAR 158 (217) T ss_pred EEEEECCCHH T ss_conf 1778422256 No 350 >PRK10361 DNA recombination protein RmuC; Provisional Probab=53.30 E-value=15 Score=16.14 Aligned_cols=20 Identities=10% Similarity=0.348 Sum_probs=14.0 Q ss_pred CEEEEEEECEEEEEEEEECCC Q ss_conf 869999911099999950578 Q gi|254780791|r 483 TRILINFFDGQANAIVINKAP 503 (529) Q Consensus 483 d~i~i~l~DG~v~a~V~~k~~ 503 (529) -.+.|+|-+|+- ..|..|.| T Consensus 241 PDvIV~LP~~k~-iVIDSKvs 260 (475) T PRK10361 241 PDVIVRLPQGKD-VVIDAKMT 260 (475) T ss_pred CCEEEECCCCCE-EEEECCCC T ss_conf 976898789975-88862788 No 351 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=53.26 E-value=15 Score=16.13 Aligned_cols=68 Identities=19% Similarity=0.249 Sum_probs=51.9 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 71899999705435688862799987489479999973521058668145988999999667528843799999710 Q gi|254780791|r 33 SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLI 109 (529) Q Consensus 33 ~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~ 109 (529) ..|-++|-|.. . -||=+|..+|..+.|..-|=...+......| .++|.+.|+|+ .......+-|..|+ T Consensus 58 ~~V~L~G~Iv~--~---l~~d~Y~F~D~TG~I~VeId~~~w~G~~v~p--~~kVrI~GevD--k~~~~~eIdV~~I~ 125 (126) T pfam04076 58 AWVSLEGNIVR--Q---IGDDEYEFRDASGTIKVDIDDRVWNGQEVQP--KDKVKITGEVD--KDLNKAEIDVKFIE 125 (126) T ss_pred CCEEEEEEEEE--E---ECCCEEEEECCCCCEEEEECHHHCCCCCCCC--CCEEEEEEEEC--CCCCCEEEEEEEEE T ss_conf 93999889996--8---4895389978996399996888838981799--99899999994--78887179888876 No 352 >KOG1029 consensus Probab=53.24 E-value=15 Score=16.13 Aligned_cols=181 Identities=14% Similarity=0.118 Sum_probs=75.2 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHC-----CCC-EE--EEEEECCCCCCCCCCEEEEEEECCC-CEEEEEEECCC Q ss_conf 988888888898622999999999997400-----171-89--9999705435688862799987489-47999997352 Q gi|254780791|r 2 NPFSQKNSLDHPEYSVSELSYHLKHIVESN-----LSH-VC--VRGEISGYRGIHSSGHAYFSLKDNH-SRIDAIIWKGT 72 (529) Q Consensus 2 ~~~~~~~~~~~~~~svs~l~~~i~~~l~~~-----~~~-~~--v~gEis~~~~~~~sGH~Yf~lkd~~-a~i~~~~~~~~ 72 (529) +++..-.-|+.-+||+. ...|+..|.+- ++. +. --+-++.. .+.+||--.+.-+.- +.+..+|=.+- T Consensus 58 sDldkDGrmdi~EfSIA--mkLi~lkLqG~~lP~~LPPsll~~~~~~~p~~--~p~fg~Gsls~~qpL~~a~p~~m~~s~ 133 (1118) T KOG1029 58 SDLDKDGRMDIREFSIA--MKLIKLKLQGIQLPPVLPPSLLKQPPRNAPST--WPGFGMGSLSYSQPLPPAAPRRMSSSP 133 (1118) T ss_pred HHCCCCCCCHHHHHHHH--HHHHHHHHCCCCCCCCCCHHHHCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 40376666018999999--99999875378688778947750677678888--976687776767888866664467776 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEE---CCCC----------EEEEEEEEEEECCCCHHHHHHHHH----HHHHHHHHCCCC Q ss_conf 10586681459889999996675---2884----------379999971016800799999999----997654012261 Q gi|254780791|r 73 LNKIEFLPEEGIEFLVIGKITTF---PGSS----------KYQIIIESLIPSGSGTLLTALEKR----KKKLLEEGLFSD 135 (529) Q Consensus 73 ~~~~~~~~~~G~~v~~~g~~~~y---~~~g----------~~ql~v~~i~~~g~G~l~~~~e~l----k~~L~~eGlfd~ 135 (529) .. |--|.+.+-+|.- -|.| .-.=.+....|++. -..+.++. -.||+---+|.. T Consensus 134 v~--------~~Pv~vatvpS~~~~sl~nGplp~~~~spl~~~ss~se~~~~~~--s~~q~~eWAVp~~~klKY~QlFNa 203 (1118) T KOG1029 134 VV--------GPPVSVATVPSSRHNSLPNGPLPPTSNSPLPHDSSVSEGRPSIE--SVNQLEEWAVPQHNKLKYRQLFNA 203 (1118) T ss_pred CC--------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCH--HHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 67--------89731146777888878889989877898988742533276411--344444204665114579987410 Q ss_pred CCCCCCCCCCCEEEEEECCCHHHHH--HHH--HHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 0016310265289998478425899--999--98630597589997210011110367999999997410035767 Q gi|254780791|r 136 QHKNPIPFIPKIIAVITSPTGAVIR--DIL--QRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCP 207 (529) Q Consensus 136 ~~k~~lP~~p~~i~vits~~~a~~~--D~~--~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~ 207 (529) .-| .+-|-+|+..+-..= -=| ++|+.=|. |..+-=-|.-.+.+-+-|+-.++....+.+.+ T Consensus 204 ~Dk-------trsG~Lsg~qaR~aL~qS~Lpq~~LA~IW~----LsDvd~DGkL~~dEfilam~liema~sGq~lP 268 (1118) T KOG1029 204 LDK-------TRSGYLSGQQARSALGQSGLPQNQLAHIWT----LSDVDGDGKLSADEFILAMHLIEMAKSGQPLP 268 (1118) T ss_pred CCC-------CCCCCCCCHHHHHHHHHCCCCHHHHHHHEE----EECCCCCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 120-------103443437777788765784654746022----00137877444999999999999984589788 No 353 >PRK11151 DNA-binding transcriptional regulator OxyR; Provisional Probab=53.18 E-value=15 Score=16.12 Aligned_cols=95 Identities=16% Similarity=0.222 Sum_probs=62.1 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 026528999847842589999998630597-5899972100111103679999999974100357677758999516888 Q gi|254780791|r 142 PFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 142 P~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) +.-.-+||++.+-...-+-+++..++++|| +++.+.... ..+++..+.. ..+|+.|+.. T Consensus 89 ~~G~lrig~~~~~~~~llp~~l~~f~~~~P~v~i~l~~~~------~~~l~~~l~~----------g~~D~~i~~~---- 148 (305) T PRK11151 89 MSGPLHIGLIPTVGPYLLPHIIPMLHQTFPKLEMYLHEAQ------THQLLAQLDS----------GKLDCAILAL---- 148 (305) T ss_pred CCCEEEEEEHHHHHHHCCCHHHHHHHHCCCCCEEEEEECC------CHHHHHHHHC----------CCCCEEEEEC---- T ss_conf 7641431102223332382779998623898379999887------5777878657----------8853799845---- Q ss_pred HHHHHHCCHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHHH Q ss_conf 444220076999999974890488520577----7525898864 Q gi|254780791|r 221 IEDLWHFNDEMIVRAIANSSIPIISAIGHE----TDWTLADYAA 260 (529) Q Consensus 221 ~eDL~~FN~e~laraI~~~~iPVisgIGHE----~D~Tl~D~VA 260 (529) ..+-..|... .++..++=++..=||. ...++.|+.. T Consensus 149 ~~~~~~~~~~----~l~~e~l~lv~~~~hpla~~~~i~l~~L~~ 188 (305) T PRK11151 149 VKESEAFIEV----PLFDEPMLLAVYEDHPWANRDRVPMSDLAG 188 (305) T ss_pred CCCCCCCEEE----EECCCCEEEEEECCCCCCCCCCCCHHHHCC T ss_conf 8899872687----620462799995898433699989999769 No 354 >PRK06843 inositol-5-monophosphate dehydrogenase; Validated Probab=53.12 E-value=15 Score=16.11 Aligned_cols=58 Identities=31% Similarity=0.544 Sum_probs=38.8 Q ss_pred EEEEECCCH--HHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 999847842--589999998630597-58999721001111036799999999741003576777589995168884 Q gi|254780791|r 148 IAVITSPTG--AVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 148 i~vits~~~--a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) +-||-+..| ...-|+++.++..|| +.| +=|+-|..+-...|..+ ..|.|.++-|+||+ T Consensus 168 ~lvID~AhGhs~~~~e~ik~ik~~~p~v~V------IaGNVaT~~~a~~Li~a----------GAD~VkVGiGpGsi 228 (404) T PRK06843 168 ILVIDSAHGHSTRIIELVKTIKNKYPNLDL------IAGNIVTKEAALDLINV----------GADCLKVGIGPGSI 228 (404) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCCCCE------EECCCCCHHHHHHHHHH----------CCCEEEECCCCCCC T ss_conf 999968875217899999999976799616------63030579999999981----------98999956547877 No 355 >cd06307 PBP1_uncharacterized_sugar_binding Periplasmic sugar-binding domain of uncharacterized transport systems. Periplasmic sugar-binding domain of uncharacterized transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. The members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes. Probab=53.08 E-value=15 Score=16.11 Aligned_cols=210 Identities=19% Similarity=0.168 Sum_probs=94.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECC-CCEEEEEEECCCC-CC----CCCCC Q ss_conf 888898622999999999997400-17189999970543568886279998748-9479999973521-05----86681 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDN-HSRIDAIIWKGTL-NK----IEFLP 80 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~-~a~i~~~~~~~~~-~~----~~~~~ 80 (529) +++++|-| .++...+....+.. ...+++...-.+ . .+..--..+-+. ..++++++..... .. +.--- T Consensus 7 ~~~~npF~--~~i~~Gie~aa~~~~~~~v~v~~~~~~--~--~d~~~~~~~l~~i~~~vDgiii~~~d~~~~~~~i~~a~ 80 (275) T cd06307 7 PKGSNAFY--RELAAALEAAAAAFPDARIRVRIHFVE--S--FDPAALAAALLRLGARSDGVALVAPDHPQVRAAVARLA 80 (275) T ss_pred CCCCCHHH--HHHHHHHHHHHHHCCCCEEEEEEEECC--C--CCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHH T ss_conf 79999699--999999999998656864999998779--8--99999999999985589889998998177899999999 Q ss_pred CCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHH- Q ss_conf 4598899999966752884379999971016800799999999997654012261001631026528999847842589- Q gi|254780791|r 81 EEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVI- 159 (529) Q Consensus 81 ~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~- 159 (529) +.|..|+...+- -+......++- .+....|.+... -+.+.| +.-+-+|++|++..+.-- T Consensus 81 ~~GIPVV~~~~~--~~~~~~~~~Vg--~Dn~~aG~~aa~--~l~~~l--------------~~~~g~i~ii~g~~~~~~~ 140 (275) T cd06307 81 AAGVPVVTLVSD--LPGSPRAGYVG--IDNRAAGRTAAW--LIGRFL--------------GRRPGKVAVLAGSHRFRGH 140 (275) T ss_pred HCCCEEEEECCC--CCCCCCCEEEC--CCHHHHHHHHHH--HHHHHH--------------CCCCCEEEEEECCCCCHHH T ss_conf 879849997288--88877433991--487999999999--999960--------------7689779999079875329 Q ss_pred ----HHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHH Q ss_conf ----999998630597-589997210011110367999999997410035767775899951688844422007699999 Q gi|254780791|r 160 ----RDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVR 234 (529) Q Consensus 160 ----~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~lar 234 (529) .=|...+++.+| +++.- ...-.++ ...-.++...+-. .++++|+|. +=|||. ..+++ T Consensus 141 ~~R~~Gf~~~l~~~~p~~~iv~-~~~~~~~--~~~a~~~~~~lL~-----~~Pdi~ai~-~~~~~~---------~ga~~ 202 (275) T cd06307 141 EEREMGFRSVLREEFPGLRVLE-TLEGLDD--PARAYEATRKLLA-----RHPDLVGIY-NAGGGN---------RGVIR 202 (275) T ss_pred HHHHHHHHHHHHHHCCCCCEEE-EECCCCC--HHHHHHHHHHHHH-----HCCCCEEEE-ECCCCH---------HHHHH T ss_conf 9999999999986588874024-6617886--5999999999997-----499962999-848716---------99999 Q ss_pred HHHHCCCE-EEEEECCCCCCHHHHHHHC Q ss_conf 99748904-8852057775258988641 Q gi|254780791|r 235 AIANSSIP-IISAIGHETDWTLADYAAD 261 (529) Q Consensus 235 aI~~~~iP-VisgIGHE~D~Tl~D~VAD 261 (529) |+-++..| -|.-|||......++++.+ T Consensus 203 Al~~~g~~~~v~vvg~D~~~~~~~~l~~ 230 (275) T cd06307 203 ALREAGRAGKVVFVGHELTPETRAALRD 230 (275) T ss_pred HHHHCCCCCCEEEEECCCCHHHHHHHHC T ss_conf 9997399998699962799899999981 No 356 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=52.65 E-value=14 Score=16.45 Aligned_cols=44 Identities=25% Similarity=0.474 Sum_probs=37.1 Q ss_pred CCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 77758999516888444220076999999974890488520577752589886 Q gi|254780791|r 207 PRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 207 ~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) ..||+|||.= +-+|...++.-..+.||||+=+---.|.+.+||. T Consensus 126 ~~Pdliiv~d---------~~~~~~ai~Ea~~l~IPvI~i~Dtn~~p~~idyp 169 (193) T cd01425 126 RLPDLVIVLD---------PRKEHQAIREASKLGIPVIAIVDTNCDPDLIDYP 169 (193) T ss_pred CCCCEEEEEC---------CCCCHHHHHHHHHHCCCEEEEECCCCCCCCCCEE T ss_conf 6999899988---------9841689999986187557885089996658778 No 357 >KOG1885 consensus Probab=52.47 E-value=15 Score=16.03 Aligned_cols=104 Identities=22% Similarity=0.317 Sum_probs=69.2 Q ss_pred CHHHHHHHHHHHHHHC-C--CCEEEEEEECCCCCCCCCC--EEEEEEECCCCEEEEEEECCCCC-CCCC-----CCCCCC Q ss_conf 2999999999997400-1--7189999970543568886--27999874894799999735210-5866-----814598 Q gi|254780791|r 16 SVSELSYHLKHIVESN-L--SHVCVRGEISGYRGIHSSG--HAYFSLKDNHSRIDAIIWKGTLN-KIEF-----LPEEGI 84 (529) Q Consensus 16 svs~l~~~i~~~l~~~-~--~~~~v~gEis~~~~~~~sG--H~Yf~lkd~~a~i~~~~~~~~~~-~~~~-----~~~~G~ 84 (529) |..+++..-..+..++ . ..+.|-|-|...+. || -++|+|.+++-++..+.=..... -..| .++-|| T Consensus 85 si~~fieky~~l~~ge~~~n~~~svaGRI~s~R~---sGsKL~Fydl~~~g~klQvm~~~~~~~~~~~F~~~~~~lkrGD 161 (560) T KOG1885 85 SIPDFIEKYLHLATGEHLDNEIVSVAGRIHSKRE---SGSKLVFYDLHGDGVKLQVMANAKKITSEEDFEQLHKFLKRGD 161 (560) T ss_pred CHHHHHHHHCCCCCCCCCCCCEEEEEEEEEEEEC---CCCCEEEEEEECCCEEEEEEEEHHHCCCHHHHHHHHHHHHCCC T ss_conf 0899999855756656566522555106754301---5774699999469868999972200478788999986642168 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEE---------EE---CCCCHHHHHHHHH Q ss_conf 899999966752884379999971---------01---6800799999999 Q gi|254780791|r 85 EFLVIGKITTFPGSSKYQIIIESL---------IP---SGSGTLLTALEKR 123 (529) Q Consensus 85 ~v~~~g~~~~y~~~g~~ql~v~~i---------~~---~g~G~l~~~~e~l 123 (529) -|-+.|++. ....|.+++++..+ -| .|+-|...+|+++ T Consensus 162 iig~~G~pg-rt~~gELSi~~~~~~lLspcLh~lP~~~~gLkD~EtRyrqR 211 (560) T KOG1885 162 IIGVSGYPG-RTKSGELSIIPNEIILLSPCLHMLPHEHFGLKDKETRYRKR 211 (560) T ss_pred EEEEECCCC-CCCCCEEEEEECCHHEECCHHCCCCHHHCCCCCHHHHHHHH T ss_conf 776515778-68785177741202020523134876651777688899988 No 358 >COG4019 Uncharacterized protein conserved in archaea [Function unknown] Probab=52.26 E-value=15 Score=16.01 Aligned_cols=82 Identities=21% Similarity=0.296 Sum_probs=47.0 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCC--HHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHH Q ss_conf 0079999999999765401226100163102652899984784--25899999986305975899972100111103679 Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPT--GAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEI 190 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~--~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i 190 (529) -||-....|.+++.+. -.+||.|.|... =.|+.|+++....+- .+.+-+|+.+-.-.--+.+ T Consensus 19 RGD~eeEve~ireyi~---------------sA~r~vV~t~N~~K~~aindvlrrf~l~E-aeml~~~T~~ADlTrmPA~ 82 (156) T COG4019 19 RGDKEEEVEKIREYIV---------------SAKRIVVATNNQKKFKAINDVLRRFCLAE-AEMLDIDTRFADLTRMPAL 82 (156) T ss_pred CCCHHHHHHHHHHHHH---------------CCCEEEEECCCHHHHHHHHHHHHHHCCCH-HHHHCCCCCHHHCCCCHHH T ss_conf 3546889999999985---------------14538872497899999999999824406-7770576414424448189 Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 9999999741003576777589995168 Q gi|254780791|r 191 ANAILQLNTLKEGRTCPRPDIIILARGG 218 (529) Q Consensus 191 ~~ai~~~~~~~~~~~~~~~D~iii~RGG 218 (529) .+||...+-. +.| ++|+||- T Consensus 83 tKalmaldis-------~AD-lvIARGR 102 (156) T COG4019 83 TKALMALDIS-------KAD-LVIARGR 102 (156) T ss_pred HHHHHHCCCC-------CCC-EEEEECC T ss_conf 9888841115-------783-7986144 No 359 >TIGR03316 ygeW probable carbamoyltransferase YgeW. Members of this protein family include the ygeW gene product of Escherichia coli. The function is unknown. Members show homology to ornithine carbamoyltransferase (TIGR00658) and aspartate carbamoyltransferase (carbamoyltransferase), and therefore may belong to the carbamoyltransferases in function. Members often are found in a large, conserved genomic region associated with purine catabolism. Probab=52.17 E-value=16 Score=16.00 Aligned_cols=19 Identities=21% Similarity=0.128 Sum_probs=8.9 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 8898622999999999997 Q gi|254780791|r 10 LDHPEYSVSELSYHLKHIV 28 (529) Q Consensus 10 ~~~~~~svs~l~~~i~~~l 28 (529) .+...||-.|+..-+.... T Consensus 7 L~~~dls~~ei~~ll~~A~ 25 (357) T TIGR03316 7 ILTWEWTRDELDTVLDVAF 25 (357) T ss_pred CCCHHCCHHHHHHHHHHHH T ss_conf 8811289999999999999 No 360 >PRK10348 ribosome-associated heat shock protein Hsp15; Provisional Probab=52.14 E-value=16 Score=16.00 Aligned_cols=41 Identities=10% Similarity=0.107 Sum_probs=32.3 Q ss_pred CCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEEEECC Q ss_conf 14619998489889577789299986999991109999995057 Q gi|254780791|r 459 KRGYTSIQDTNNNFITQKRNLATKTRILINFFDGQANAIVINKA 502 (529) Q Consensus 459 ~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v~a~V~~k~ 502 (529) +.|- |+ -||+.++.+..++.||.|+|+...-....+|.+-. T Consensus 32 ~~G~--V~-vnG~~~Kps~~V~~GD~l~i~~~~~~~~i~V~~l~ 72 (133) T PRK10348 32 EGGK--VH-YNGQRSKPSKIVELNATLTLRQGNDERTVIVKAIT 72 (133) T ss_pred HCCC--EE-ECCEECCCCCCCCCCCEEEEEECCEEEEEEEEECC T ss_conf 8798--89-89988688887689999999869959999998866 No 361 >pfam00389 2-Hacid_dh D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain. Probab=51.90 E-value=16 Score=15.97 Aligned_cols=23 Identities=17% Similarity=0.174 Sum_probs=9.4 Q ss_pred HHCCCCCCCCCCCCCCCEEEEEE Q ss_conf 01226100163102652899984 Q gi|254780791|r 130 EGLFSDQHKNPIPFIPKIIAVIT 152 (529) Q Consensus 130 eGlfd~~~k~~lP~~p~~i~vit 152 (529) +|-|+...-...-..-+++|||- T Consensus 122 ~g~w~~~~~~~~~l~~~~~giiG 144 (313) T pfam00389 122 AGDWKKGGPIGLELRGKTLGVIG 144 (313) T ss_pred CCCCCCCCCCCCCCCCCEEEEEE T ss_conf 08766556555336898899994 No 362 >cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704. Probab=51.82 E-value=16 Score=15.96 Aligned_cols=40 Identities=33% Similarity=0.551 Sum_probs=25.7 Q ss_pred CCCEEEEECCCCCHHHHHHCCHH---HHHHHHHHCCCEEEEEECCCC Q ss_conf 77589995168884442200769---999999748904885205777 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDE---MIVRAIANSSIPIISAIGHET 251 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e---~laraI~~~~iPVisgIGHE~ 251 (529) .||+|||. ||-+.++|+ +++ .+++..++...| |.+|.|-. T Consensus 62 ~yD~ivlP-GG~g~~~l~--~~~~~~~~i~~~~~~~k~-iaaIC~gp 104 (165) T cd03134 62 DYDALVIP-GGTNPDKLR--RDPDAVAFVRAFAEAGKP-VAAICHGP 104 (165) T ss_pred HCCEEEEC-CCCCHHHHH--HCHHHHHHHHHHHHHCCC-EEEECCHH T ss_conf 28889978-985888885--298999999999971992-99998757 No 363 >cd06295 PBP1_CelR Ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. This group includes the ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. The binding of CelR to the celE promoter is inhibited specifically by cellobiose. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn chang Probab=51.73 E-value=16 Score=15.95 Aligned_cols=19 Identities=11% Similarity=0.183 Sum_probs=11.4 Q ss_pred CCCCCCCHHHHHHHHHHHHHH Q ss_conf 889862299999999999740 Q gi|254780791|r 10 LDHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 10 ~~~~~~svs~l~~~i~~~l~~ 30 (529) .++|-| +++...|...+.. T Consensus 20 i~npff--~~l~~gie~~~~~ 38 (275) T cd06295 20 FSDPFF--LSLLGGIADALAE 38 (275) T ss_pred EECHHH--HHHHHHHHHHHHH T ss_conf 637999--9999999999998 No 364 >cd01574 PBP1_LacI Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of LacI is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=51.68 E-value=16 Score=15.94 Aligned_cols=78 Identities=10% Similarity=0.070 Sum_probs=31.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCC---CCCCCCCC Q ss_conf 888898622999999999997400171899999705435688862799987489479999973521058---66814598 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKI---EFLPEEGI 84 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~---~~~~~~G~ 84 (529) |++++| .-+++...|...+... ++--+....++-.. ..-=++.=+|+ ...++++||-...... ....+.|. T Consensus 7 p~i~~~--f~~~~~~gie~~~~~~-Gy~~~l~~~~~~~~-~~~~~~i~~l~--~~~vdGiI~~~~~~~~~~~~~~~~~~i 80 (264) T cd01574 7 TDLALH--GPSSTLAAIESAAREA-GYAVTLSMLAEADE-EALRAAVRRLL--AQRVDGVIVNAPLDDADAALAAAPADV 80 (264) T ss_pred CCCCCH--HHHHHHHHHHHHHHHC-CCEEEEEECCCCCH-HHHHHHHHHHH--HCCCCEEEEECCCCCHHHHHHHHHCCC T ss_conf 766687--9999999999999984-99799997999989-99999999999--649999999168899699999985899 Q ss_pred EEEEEEE Q ss_conf 8999999 Q gi|254780791|r 85 EFLVIGK 91 (529) Q Consensus 85 ~v~~~g~ 91 (529) -|++.+. T Consensus 81 PvV~i~~ 87 (264) T cd01574 81 PVVFVDG 87 (264) T ss_pred CEEEECC T ss_conf 6899866 No 365 >PRK10415 tRNA-dihydrouridine synthase B; Provisional Probab=51.68 E-value=15 Score=16.02 Aligned_cols=14 Identities=7% Similarity=0.050 Sum_probs=8.4 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 26100163102652 Q gi|254780791|r 133 FSDQHKNPIPFIPK 146 (529) Q Consensus 133 fd~~~k~~lP~~p~ 146 (529) +|.|--.|.|..-+ T Consensus 94 IDiN~GCP~~kV~k 107 (321) T PRK10415 94 IDINMGCPAKKVNR 107 (321) T ss_pred EEEECCCCHHHHCC T ss_conf 94318999899707 No 366 >PRK01045 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed Probab=51.63 E-value=16 Score=15.93 Aligned_cols=105 Identities=21% Similarity=0.269 Sum_probs=64.3 Q ss_pred CCCCEEEEEECCC--HHHHHHHHHHHHHCCC-EEEE----EEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 2652899984784--2589999998630597-5899----9721001111036799999999741003576777589995 Q gi|254780791|r 143 FIPKIIAVITSPT--GAVIRDILQRISCRFP-LRVI----IFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILA 215 (529) Q Consensus 143 ~~p~~i~vits~~--~a~~~D~~~~~~~r~p-~~~~----~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~ 215 (529) ..+++|+|++=.| -..+..|+..|+.+|| ..+. +..++-+=..|+.+++ ..+|++|++ T Consensus 153 ~~~~ki~vvsQTT~~~~~~~~i~~~l~~~~~~~~~~~~nTIC~aT~~RQ~a~~~La---------------~~vD~miVv 217 (304) T PRK01045 153 KDPDKLALVTQTTLSVDDTAEIIAALKARFPEIRGPRKDDICYATQNRQEAVKELA---------------KQVDLVIVV 217 (304) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH---------------HCCCEEEEE T ss_conf 89971899972556599999999999976836525732352077899999999877---------------418889997 Q ss_pred CCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHC------CCCCCCHH Q ss_conf 1688844422007699999997489048852057775258988641------23777214 Q gi|254780791|r 216 RGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAAD------LRAPTPTG 269 (529) Q Consensus 216 RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD------~Ra~TPTa 269 (529) =|=-|- |.-.|++-.-....|.. -|...-|....+|.-. --|+||.- T Consensus 218 Gg~nSs------NT~~L~~i~~~~~~~t~-~Ie~~~el~~~~~~~~~~VGITAGASTP~~ 270 (304) T PRK01045 218 GSKNSS------NSNRLREVAERAGAPAY-LIDDASEIDPEWFKGVKTVGVTAGASAPEW 270 (304) T ss_pred CCCCCC------CHHHHHHHHHHHCCCEE-EECCHHHCCHHHHCCCCEEEEECCCCCCHH T ss_conf 688996------48999999998799959-968816699989479998999723378899 No 367 >pfam06266 HrpF HrpF protein. The species Pseudomonas syringae encompasses plant pathogens with differing host specificities and corresponding pathovar designations. P. syringae requires the Hrp (type III protein secretion) system, encoded by a 25-kb cluster of hrp and hrc genes, in order to elicit the hypersensitive response (HR) in nonhosts or to be pathogenic in hosts. The exact function of HrpF is unknown but the protein is needed for pathogenicity. Probab=51.54 E-value=9.7 Score=17.82 Aligned_cols=22 Identities=27% Similarity=0.302 Sum_probs=10.8 Q ss_pred EEEEECCCCCHHHHHHCCHHHH Q ss_conf 8999516888444220076999 Q gi|254780791|r 211 IIILARGGGSIEDLWHFNDEMI 232 (529) Q Consensus 211 ~iii~RGGGS~eDL~~FN~e~l 232 (529) +..=+-+|||.||+.+|||..- T Consensus 25 a~~~ase~~s~~D~~Af~eAs~ 46 (74) T pfam06266 25 AALEASEGFSQDDSHAFNEASM 46 (74) T ss_pred HHHHCCCCCCHHHHHHHHHHHH T ss_conf 9974336898889999999999 No 368 >cd04481 RPA1_DBD_B_like RPA1_DBD_B_like: A subgroup of uncharacterized, plant OB folds with similarity to the third OB fold, the ssDNA-binding domain (DBD)-B, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-B, RPA1 contains three other OB folds: DBD-A, DBD-C, and RPA1N. The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. Probab=51.48 E-value=16 Score=15.92 Aligned_cols=64 Identities=16% Similarity=0.291 Sum_probs=35.4 Q ss_pred EEEEEEECCCCCC-CCCCEEEEEEECCC-CEEEEEEECCCCCCCCCCC---CCCCEEEE--E-EEEEEECCC Q ss_conf 8999997054356-88862799987489-4799999735210586681---45988999--9-996675288 Q gi|254780791|r 35 VCVRGEISGYRGI-HSSGHAYFSLKDNH-SRIDAIIWKGTLNKIEFLP---EEGIEFLV--I-GKITTFPGS 98 (529) Q Consensus 35 ~~v~gEis~~~~~-~~sGH~Yf~lkd~~-a~i~~~~~~~~~~~~~~~~---~~G~~v~~--~-g~~~~y~~~ 98 (529) +--.|++..+... ..+--.=|+|+|.. ..+.|++|...+....-.. .++.-|++ . .+|.-|... T Consensus 6 i~~v~~~~~~~~~~~~~~kr~~~i~D~~~~~l~~tlwG~~A~~F~~~~~~~~~~~~VV~v~~~~~v~~~~g~ 77 (106) T cd04481 6 IVDVGPLEELPPVNKPSRKLDFEIRDLSDERLKCTLWGEYAEEFDAKFQSAGNGEPVVAVLRFWKIKEYKGP 77 (106) T ss_pred EEEECCCEEEECCCCCCEEEEEEEEECCCCEEEEEEEHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEECCC T ss_conf 989118479962897407899999968999899999947988877888751599868999998998875794 No 369 >PRK12555 chemotaxis-specific methylesterase; Provisional Probab=51.44 E-value=16 Score=15.91 Aligned_cols=80 Identities=29% Similarity=0.461 Sum_probs=42.7 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 528999847842589999998630597-5899972100111103679999999974100357677758999516888444 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) |-||.|| .+++.++..++.+=+..| ++| ..+..-|+ +|++.+... ++|||++ | T Consensus 1 PirVLIV--DDs~~~R~~l~~~L~~~~~~eV--v~~A~~g~-------eAl~~~~~~-------~pDvVll--------D 54 (340) T PRK12555 1 PMNVGIV--DDSALAREALRRIIARRPDHRV--LGVATDGL-------QARDLCKAQ-------PPDVVLL--------D 54 (340) T ss_pred CCEEEEE--ECCHHHHHHHHHHHHHCCCCEE--EEEECCHH-------HHHHHHHHH-------CCCEEEE--------E T ss_conf 9889999--0988999999999960999489--99989999-------999998861-------9999997--------2 Q ss_pred H--HHCCHHHHHHHHHH-CCCE--EEEEECCC Q ss_conf 2--20076999999974-8904--88520577 Q gi|254780791|r 224 L--WHFNDEMIVRAIAN-SSIP--IISAIGHE 250 (529) Q Consensus 224 L--~~FN~e~laraI~~-~~iP--VisgIGHE 250 (529) + =--|-.++.|.|.+ +|+| ++|+..++ T Consensus 55 i~MP~mdGie~l~~I~~~~p~PVimvSs~~~~ 86 (340) T PRK12555 55 LEMPRMDGLTFLRRIMRRRPCPVLIVSSLTEA 86 (340) T ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 78899987999999998789986998503565 No 370 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=51.24 E-value=16 Score=15.89 Aligned_cols=81 Identities=15% Similarity=0.298 Sum_probs=47.4 Q ss_pred CCEEEEEECC-CHHHHHHHHHHHHH---CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 5289998478-42589999998630---5975899972100111103679999999974100357677758999516888 Q gi|254780791|r 145 PKIIAVITSP-TGAVIRDILQRISC---RFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 145 p~~i~vits~-~~a~~~D~~~~~~~---r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) .+.||||.+. +.--+..+++-+.+ ..+++++++.+ ++ +...+ .+.|+.+-.. .+|.||+.- T Consensus 26 ~~TIgvivp~i~npff~~v~~gie~~a~~~Gy~l~v~~s--~~-d~~~q-~~~i~~li~~-------~vdgiii~p---- 90 (295) T PRK10653 26 KDTIALVVSTLNNPFFVSLKDGAQKEADKLGYNLVVLDS--QN-NPAKE-LANVQDLTVR-------GTKILLINP---- 90 (295) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEEEC--CC-CHHHH-HHHHHHHHHC-------CCCEEEECC---- T ss_conf 997999948987979999999999999976998999839--99-99999-9999999976-------998899847---- Q ss_pred HHHHHHCCHHH---HHHHHHHCCCEEEEE Q ss_conf 44422007699---999997489048852 Q gi|254780791|r 221 IEDLWHFNDEM---IVRAIANSSIPIISA 246 (529) Q Consensus 221 ~eDL~~FN~e~---laraI~~~~iPVisg 246 (529) .+.+. ..+...+..||||+- T Consensus 91 ------~d~~~~~~~~~~a~~~gIPvV~~ 113 (295) T PRK10653 91 ------TDSDAVGNAVKMANQANIPVITL 113 (295) T ss_pred ------CCHHHHHHHHHHHHHCCCEEEEE T ss_conf ------77065699999999769819999 No 371 >pfam01915 Glyco_hydro_3_C Glycosyl hydrolase family 3 C terminal domain. This domain is involved in catalysis and may be involved in binding beta-glucan. This domain is found associated with pfam00933. Probab=51.12 E-value=16 Score=15.87 Aligned_cols=24 Identities=17% Similarity=0.369 Sum_probs=16.9 Q ss_pred HCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 007699999997489048852057 Q gi|254780791|r 226 HFNDEMIVRAIANSSIPIISAIGH 249 (529) Q Consensus 226 ~FN~e~laraI~~~~iPVisgIGH 249 (529) +-++.+|+++|++...|||.-+=. T Consensus 118 ~~~q~~Li~~v~~~~~~vVvVl~~ 141 (223) T pfam01915 118 PGNQDELIEAVAAAGKPVVVVLHS 141 (223) T ss_pred CHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 877999999999879898999845 No 372 >pfam00582 Usp Universal stress protein family. The universal stress protein UspA is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. UspA enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae UspA reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, though UspA lacks ATP-binding activity. Probab=50.97 E-value=14 Score=16.37 Aligned_cols=36 Identities=14% Similarity=0.448 Sum_probs=21.6 Q ss_pred CCCEEEEECCC-CCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 77589995168-884442200769999999748904885 Q gi|254780791|r 208 RPDIIILARGG-GSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 208 ~~D~iii~RGG-GS~eDL~~FN~e~laraI~~~~iPVis 245 (529) .+|+||++.-| |.+..+ |.--.--+-+-.|++||+. T Consensus 102 ~~dliV~G~~~~~~~~~~--~~Gs~s~~v~~~~~~pVlv 138 (140) T pfam00582 102 DADLIVMGSRGRSGLRRL--LLGSVAEKVLRHAPCPVLV 138 (140) T ss_pred CCCEEEEECCCCCCCCCC--CCCHHHHHHHHCCCCCEEE T ss_conf 788899928999864465--0173999999548998999 No 373 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=50.77 E-value=16 Score=15.83 Aligned_cols=36 Identities=14% Similarity=0.088 Sum_probs=16.9 Q ss_pred HHCCHH---HHHHHHHHCCC---EEEEEECCCCCCHHHHHHHCC Q ss_conf 200769---99999974890---488520577752589886412 Q gi|254780791|r 225 WHFNDE---MIVRAIANSSI---PIISAIGHETDWTLADYAADL 262 (529) Q Consensus 225 ~~FN~e---~laraI~~~~i---PVisgIGHE~D~Tl~D~VAD~ 262 (529) ||+||. ..++|+-+... ..|.|++-. ....+++.+- T Consensus 195 ~~~nD~~a~Ga~~Al~~~G~~~~v~vvG~D~~--~~~~~~i~~g 236 (282) T cd06318 195 YSENDDMALGAMRVLAEAGKTDDVKVAAADGQ--KEALALIKGG 236 (282) T ss_pred EECCCHHHHHHHHHHHHCCCCCCEEEEEECCC--HHHHHHHHCC T ss_conf 98896799999999996699998299998988--9999999839 No 374 >PHA01627 DNA binding protein Probab=50.67 E-value=9.4 Score=17.93 Aligned_cols=22 Identities=32% Similarity=0.489 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 769999999748904885205777 Q gi|254780791|r 228 NDEMIVRAIANSSIPIISAIGHET 251 (529) Q Consensus 228 N~e~laraI~~~~iPVisgIGHE~ 251 (529) +.++-+..+..- ++||+|||+- T Consensus 27 ~~~Eake~v~~e--~~vSaIGH~a 48 (105) T PHA01627 27 DIEEAKELLTNE--EFVSAIGHDA 48 (105) T ss_pred CHHHHHHHHCCC--CEEEEECCHH T ss_conf 799999774445--6355406398 No 375 >PRK00132 rpsI 30S ribosomal protein S9; Reviewed Probab=50.53 E-value=15 Score=16.17 Aligned_cols=31 Identities=19% Similarity=0.222 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 3679999999974100357677758999516888444 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) -..|..-|...+.. ..||+.|-++|||-..- T Consensus 45 r~~i~~Pl~~~~~~------~~~DI~i~V~GGG~sGQ 75 (130) T PRK00132 45 RMVVRQPLELTETE------GKYDIKVNVKGGGLSGQ 75 (130) T ss_pred HHHHHHHHHHHCCC------CCEEEEEEEECCCHHHH T ss_conf 99998589874745------64038999979889899 No 376 >PRK08402 replication factor A; Reviewed Probab=50.50 E-value=16 Score=15.80 Aligned_cols=83 Identities=17% Similarity=0.194 Sum_probs=46.8 Q ss_pred HHHHHHHHCC-CCEEEEEEECC---CC----CCCCCCEE-EEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 9999974001-71899999705---43----56888627-9998748947999997352105866814598899999966 Q gi|254780791|r 23 HLKHIVESNL-SHVCVRGEISG---YR----GIHSSGHA-YFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT 93 (529) Q Consensus 23 ~i~~~l~~~~-~~~~v~gEis~---~~----~~~~sGH~-Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~ 93 (529) .|+++..+-= .+|-|.|-|-. .+ ..-+.|.+ -+.+-|+..+++.|+|-......--+++.|+-|-+. ..+ T Consensus 64 kIkdI~~Gm~~rnVeivgrIi~ky~~REf~kkDGS~GkVASL~I~DeTG~aRLvLWD~~v~~yy~~l~~GDVIKv~-~~~ 142 (357) T PRK08402 64 KISDLYPGMDPREVNIVGRILKKYPPREYTKKDGSTGRVASLIIYDDTGRARVVLWDAKVAEYYNKLEVGDVIKVL-DAQ 142 (357) T ss_pred CHHHHHCCCCCCCCEEEEEEEEECCCHHHHHCCCCCCEEEEEEEECCCCCEEEEEECHHHHHHHHHCCCCCEEEEE-CHH T ss_conf 5777507667567579998733058055300369875378999973788389998338789889753768669983-313 Q ss_pred EECC-CCEEEEEEE Q ss_conf 7528-843799999 Q gi|254780791|r 94 TFPG-SSKYQIIIE 106 (529) Q Consensus 94 ~y~~-~g~~ql~v~ 106 (529) |-+. +|-++|.|. T Consensus 143 Vres~~g~peLhv~ 156 (357) T PRK08402 143 VRESLSGLPELHIN 156 (357) T ss_pred HHHHCCCCHHEEEC T ss_conf 44311797120340 No 377 >pfam01933 UPF0052 Uncharacterized protein family UPF0052. Probab=50.41 E-value=16 Score=15.79 Aligned_cols=128 Identities=20% Similarity=0.168 Sum_probs=73.3 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCE-EEEEEECCCCCCCHHHHHH Q ss_conf 007999999999976540122610016310265289998478425899999986305975-8999721001111036799 Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPL-RVIIFPVKVQGDECPKEIA 191 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~-~~~~~p~~vQG~~a~~~i~ 191 (529) .|++....+.+.+.|.-.| +=.|.-.-|-+..+.|....-..+.. .+-+...++ .+.+-|.. ..|.++.+ T Consensus 97 ~g~~~~ai~~~~~~l~i~g-----~VlP~t~~~v~l~~~~~dG~~~~ge~-~I~~~~~~i~~v~~~~~~---~~~~p~al 167 (291) T pfam01933 97 LGSLDEAIDLLSRVLGVKG-----KVLPMSNDPVALVAELQDGREVRGES-HIAEQPGKIRRVFLEPED---PKAAPEAV 167 (291) T ss_pred CCCHHHHHHHHHHHHCCCC-----EEEECCCCCEEEEEEECCCCEEEEEE-EEECCCCCCEEEEEECCC---CCCCHHHH T ss_conf 6999999999999868996-----69946588546899977996998028-981179974089986799---99898999 Q ss_pred HHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEE--EEE----CCCCCCHHHHHHHC Q ss_conf 99999974100357677758999516888444220076999999974890488--520----57775258988641 Q gi|254780791|r 192 NAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPII--SAI----GHETDWTLADYAAD 261 (529) Q Consensus 192 ~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVi--sgI----GHE~D~Tl~D~VAD 261 (529) +||+. -|+|||+=|-==..=+-.+--..+..||.+++-|+| +-| |--..+|.+|+|-- T Consensus 168 ~AI~~------------AD~IiigPgs~~tSI~P~L~v~gi~eAi~~s~a~~V~V~ni~~~~GeT~~~~~~dhv~~ 231 (291) T pfam01933 168 EAIKD------------ADLIVLGPGSLFTSIGPNLLVPGIAEALRKSKAKKVYVSNIMTQPGETGGFSAADHVDA 231 (291) T ss_pred HHHHH------------CCEEEECCCCCHHHHCCHHCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHH T ss_conf 99985------------99899869961666431303626999998679988999668899988789888998999 No 378 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=50.39 E-value=16 Score=15.78 Aligned_cols=52 Identities=25% Similarity=0.374 Sum_probs=40.0 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 10367999999997410035767775899951688844422007699999997489048852057 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGH 249 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGH 249 (529) ..+.++..++-..-..- +||+|+= =+++.|+=+-|...+..+++|-||-||- T Consensus 80 ~sEr~LL~~f~~~i~~~------DPDii~G-------hni~~fdl~~L~~R~~~lki~~wsriGR 131 (234) T cd05776 80 ENERALLNFFLAKLQKI------DPDVLVG-------HDLEGFDLDVLLSRIQELKVPHWSRIGR 131 (234) T ss_pred CCHHHHHHHHHHHHHHC------CCCEEEE-------ECCCCCCHHHHHHHHHHHCCCCCCCCCC T ss_conf 88999999999999850------9989985-------0637775999999999809985331355 No 379 >PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=50.36 E-value=16 Score=15.78 Aligned_cols=83 Identities=24% Similarity=0.340 Sum_probs=51.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHC-CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH- Q ss_conf 89998478425899999986305-97-5899972100111103679999999974100357677758999516888444- Q gi|254780791|r 147 IIAVITSPTGAVIRDILQRISCR-FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED- 223 (529) Q Consensus 147 ~i~vits~~~a~~~D~~~~~~~r-~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD- 223 (529) -|+||...+..-..++.+.+.+- ++ ++| +....+|.+ +|+.+... -+|++| |-|++-| T Consensus 15 vi~Vlr~~~~~~a~~~~~al~~gGi~~iEi-----Tl~t~~a~~----~I~~l~~~-------~p~~~i---GaGTV~~~ 75 (210) T PRK07455 15 AIAVIRAPDLELGLQMAEAVAAGGMRLIEI-----TWNSDQPAE----LISQLREK-------LPECII---GTGTLLTL 75 (210) T ss_pred EEEEEECCCHHHHHHHHHHHHHCCCCEEEE-----ECCCCCHHH----HHHHHHHH-------CCCCEE---EEEECCCH T ss_conf 799997599999999999999879988999-----689988999----99999987-------899689---88818789 Q ss_pred --------------HHHCCHHHHHHHHHHCCCEEEEEEC Q ss_conf --------------2200769999999748904885205 Q gi|254780791|r 224 --------------LWHFNDEMIVRAIANSSIPIISAIG 248 (529) Q Consensus 224 --------------L~~FN~e~laraI~~~~iPVisgIG 248 (529) ..+.-+++|+++-.+..+|+|-|+- T Consensus 76 e~~~~a~~aGA~FiVSP~~~~~vi~~a~~~~i~~iPGv~ 114 (210) T PRK07455 76 EDLEEAIAAGAQFCFTPHVDLELIQAAVAADIPIIPGAL 114 (210) T ss_pred HHHHHHHHCCCCEEECCCCCHHHHHHHHHCCCCEECCCC T ss_conf 999999986999998688889999999982997658869 No 380 >TIGR01425 SRP54_euk signal recognition particle protein SRP54; InterPro: IPR006325 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes , . SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor . In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. This entry represents the 54 kDa SRP54 component, a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 of the signal recognition particle has a three-domain structure: an N-terminal helical bundle domain, a GTPase domain, and the M-domain that binds the 7s RNA and also binds the signal sequence. The extreme C-terminal region is glycine-rich and lower in complexity and poorly conserved between species.; GO: 0005525 GTP binding, 0008312 7S RNA binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0048500 signal recognition particle. Probab=50.28 E-value=7.2 Score=18.95 Aligned_cols=142 Identities=22% Similarity=0.339 Sum_probs=75.1 Q ss_pred CCCCCCCCC-EEEEEEEE------------EEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 866814598-89999996------------67528843799999710168007999999999976540122610016310 Q gi|254780791|r 76 IEFLPEEGI-EFLVIGKI------------TTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIP 142 (529) Q Consensus 76 ~~~~~~~G~-~v~~~g~~------------~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP 142 (529) -+|+|+-|- .|+..+-+ -+|..+=+|.--+---|=-=-|+ |++||+ + ..|=.|| T Consensus 112 kk~~p~Kgk~~ViMfVGLQGaGKTTtctKLA~YYk~rGfK~~lvCADTFRAGA----FdQLkq--------N-A~kA~iP 178 (453) T TIGR01425 112 KKFTPKKGKSSVIMFVGLQGAGKTTTCTKLAYYYKRRGFKPALVCADTFRAGA----FDQLKQ--------N-ATKAKIP 178 (453) T ss_pred CEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCH----HHHHHH--------H-HHHCCCC T ss_conf 03411568821588862148871566878777763266432565177542324----899987--------4-7644897 Q ss_pred CC-------CCEEE----------------EEEC---CCHHHHHHHHHHHHHC-CC-EEEEEEECCCCCCCHHHHHHHHH Q ss_conf 26-------52899----------------9847---8425899999986305-97-58999721001111036799999 Q gi|254780791|r 143 FI-------PKIIA----------------VITS---PTGAVIRDILQRISCR-FP-LRVIIFPVKVQGDECPKEIANAI 194 (529) Q Consensus 143 ~~-------p~~i~----------------vits---~~~a~~~D~~~~~~~r-~p-~~~~~~p~~vQG~~a~~~i~~ai 194 (529) +| |=+|| |=|| ..-+.+-.=+.-+.+= =| .-|++...+. |..|. +=++|. T Consensus 179 FYGsy~E~DPVkiA~EGv~~Fk~E~~diIivDTSGRHkQe~~LF~Em~qv~~Ai~Pd~iifVMDGsI-GQAA~-~QAkAF 256 (453) T TIGR01425 179 FYGSYLESDPVKIASEGVEKFKKEKFDIIIVDTSGRHKQEEELFEEMVQVAEAIKPDSIIFVMDGSI-GQAAF-SQAKAF 256 (453) T ss_pred CCCCCCCCCCEEEECCCHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCH-HHHHH-HHHHHH T ss_conf 1201048987078002011322127847998379873225888899876863349983699806616-67889-999986 Q ss_pred HHHHHHCCCCCCCCCCEEEE------ECCCCCHHHHHHCCHHHHHHHHHHCCCEEE-EEECCCCCC Q ss_conf 99974100357677758999------516888444220076999999974890488-520577752 Q gi|254780791|r 195 LQLNTLKEGRTCPRPDIIIL------ARGGGSIEDLWHFNDEMIVRAIANSSIPII-SAIGHETDW 253 (529) Q Consensus 195 ~~~~~~~~~~~~~~~D~iii------~RGGGS~eDL~~FN~e~laraI~~~~iPVi-sgIGHE~D~ 253 (529) +... .+=.||| +.|||++. |||+.+.||| =|-|=.+|. T Consensus 257 K~~~---------~vGSvIiTKLDGHAkGGGALS------------AVAATKsPiiFIGTGEhv~d 301 (453) T TIGR01425 257 KDSV---------EVGSVIITKLDGHAKGGGALS------------AVAATKSPIIFIGTGEHVDD 301 (453) T ss_pred HHCC---------CCEEEEEECCCCCCCCCHHHH------------HHHHHCCCEEEECCCCCCCC T ss_conf 3003---------500388751567767623788------------98753597798137750276 No 381 >pfam00380 Ribosomal_S9 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes. Probab=50.19 E-value=16 Score=15.76 Aligned_cols=31 Identities=26% Similarity=0.372 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH Q ss_conf 0367999999997410035767775899951688844 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE 222 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e 222 (529) .-..|..-|...+.. ..||+.|-++|||-.. T Consensus 35 ~r~~v~~Pl~~~~~~------~~~Di~i~V~GGG~sg 65 (121) T pfam00380 35 LRMKILEPLELTGTL------GKFDIVVTVKGGGISG 65 (121) T ss_pred HHHHHHHHHHHHHHC------CCCCEEEEEECCCHHH T ss_conf 999999799986004------6642699996888858 No 382 >PRK08284 precorrin 6A synthase; Provisional Probab=50.00 E-value=17 Score=15.74 Aligned_cols=68 Identities=24% Similarity=0.370 Sum_probs=43.2 Q ss_pred CCCHHHHH-HHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 11103679-999999974100357677758999516888444220076999999974890488520577752589886 Q gi|254780791|r 183 GDECPKEI-ANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 183 G~~a~~~i-~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) |-+.|+-| ++|++.+++ .|||++..-|.+..||..+-.+.+.+.+..-..+|+.-.=-++|..-.||. T Consensus 10 GpGdPellTlqAv~aL~~---------ADVif~p~kg~~k~~La~lRr~I~~~~i~~~~~riv~~~~P~Rd~~~~~Y~ 78 (253) T PRK08284 10 GAGDPDHLTLQAVAALNR---------ADVFFVPDKGADKDDLAALRREICARYITEPGYRVVEFDDPVRDRAPADYR 78 (253) T ss_pred CCCCHHHHHHHHHHHHHC---------CCEEEEECCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCHHHH T ss_conf 799848879999999960---------999999478877225799999999986477883489637897666620178 No 383 >PRK02102 ornithine carbamoyltransferase; Validated Probab=49.91 E-value=17 Score=15.73 Aligned_cols=10 Identities=20% Similarity=0.581 Sum_probs=4.4 Q ss_pred HCCCEEEEEE Q ss_conf 4890488520 Q gi|254780791|r 238 NSSIPIISAI 247 (529) Q Consensus 238 ~~~iPVisgI 247 (529) .+.+|||-|- T Consensus 121 ~~~vPVIN~~ 130 (331) T PRK02102 121 YSGVPVWNGL 130 (331) T ss_pred HCCCCEECCC T ss_conf 4798645488 No 384 >PRK00197 proA gamma-glutamyl phosphate reductase; Provisional Probab=49.89 E-value=17 Score=15.72 Aligned_cols=25 Identities=24% Similarity=0.793 Sum_probs=16.6 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHH-HCCCEEEE Q ss_conf 75899951688844422007699999997-48904885 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIA-NSSIPIIS 245 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~-~~~iPVis 245 (529) +|+| |-|||. .|+++|. ++++|||- T Consensus 197 vdli-iprGg~-----------~li~~v~~~a~vPvi~ 222 (416) T PRK00197 197 VDVL-IPRGGA-----------GLIKRVVEEATVPVIE 222 (416) T ss_pred CCEE-EECCCH-----------HHHHHHHHHCCCCEEE T ss_conf 3389-936865-----------7889998637787896 No 385 >PRK13288 pyrophosphatase PpaX; Provisional Probab=49.77 E-value=17 Score=15.71 Aligned_cols=105 Identities=13% Similarity=0.130 Sum_probs=64.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEE-ECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 28999847842589999998630597589997-21001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFPLRVIIF-PVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~-p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) -++||+|+.......-++..+.=.--|..++- .-.-.++-.|.-+..|++.++... -++|+| |=|..| T Consensus 99 ~~l~ivTn~~~~~~~~~l~~~gl~~~Fd~iv~~ddv~~~KP~P~~~~~a~~~l~~~p-------~e~l~V---GDs~~D- 167 (214) T PRK13288 99 YKLGIVTTKARDTVEMGLKLTGLDKFFDVVVTLDDVEHAKPDPEPVQKALELLGAKP-------EEALMV---GDNYHD- 167 (214) T ss_pred CCEEEEECCCHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHCCCC-------CCEEEE---ECCHHH- T ss_conf 926775357569999999972221443157861334456987999999999939597-------868999---689899- Q ss_pred HHCCHHHHHHHHHHCCCEEE-EEECCCCCCHHHHHHHCCCCCCCHH Q ss_conf 20076999999974890488-5205777525898864123777214 Q gi|254780791|r 225 WHFNDEMIVRAIANSSIPII-SAIGHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 225 ~~FN~e~laraI~~~~iPVi-sgIGHE~D~Tl~D~VAD~Ra~TPTa 269 (529) +.+--+.-+++| .+-|+..-..|.+.=||.-..+|+. T Consensus 168 --------i~aA~~aG~~~i~v~~g~~~~~~l~~~~ad~vi~~~~e 205 (214) T PRK13288 168 --------ILAGKNAGTKTAGVAWTIKGREYLEQYKPDFMLDKMSD 205 (214) T ss_pred --------HHHHHHCCCEEEEEECCCCCHHHHHHCCCCEEECCHHH T ss_conf --------99999959959999389998767976899999899999 No 386 >PRK13396 3-deoxy-7-phosphoheptulonate synthase; Provisional Probab=49.62 E-value=17 Score=15.69 Aligned_cols=69 Identities=19% Similarity=0.318 Sum_probs=40.0 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCC--HHHHHHHHHH-CCCEEEEEECCCCCCHHHHHHHCC Q ss_conf 0367999999997410035767775899951688844422007--6999999974-890488520577752589886412 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFN--DEMIVRAIAN-SSIPIISAIGHETDWTLADYAADL 262 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN--~e~laraI~~-~~iPVisgIGHE~D~Tl~D~VAD~ 262 (529) ...+...|.+++.... +.++++.=||==+.|.-+.-| |...+-.+.+ +.+|||.---|-+-. -|||..+ T Consensus 220 ti~ewl~AaEyi~~~G------n~~viLcERGirtfe~~~~RntlDl~aip~~k~~thlPVi~DPSH~~G~--r~~V~~l 291 (352) T PRK13396 220 TIDEWLMAAEYILAAG------NPNVILCERGIRTFDRQYTRNTLDLSVIPVLRSLTHLPIMIDPSHGTGK--SEYVPSM 291 (352) T ss_pred CHHHHHHHHHHHHHCC------CCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEECCCCCCCC--HHHHHHH T ss_conf 9999986999999769------9858999489775667654677557887999748999889789864578--7279999 No 387 >PRK13586 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional Probab=49.62 E-value=17 Score=15.69 Aligned_cols=73 Identities=21% Similarity=0.281 Sum_probs=34.6 Q ss_pred HHHHHHHHHHCCCEEEEEEEC----CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC-CCHHHHHHCCHH--- Q ss_conf 999999863059758999721----001111036799999999741003576777589995168-884442200769--- Q gi|254780791|r 159 IRDILQRISCRFPLRVIIFPV----KVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG-GSIEDLWHFNDE--- 230 (529) Q Consensus 159 ~~D~~~~~~~r~p~~~~~~p~----~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG-GS~eDL~~FN~e--- 230 (529) +.|++..+.+...-++++-.+ +.+|-+ .++.+.+.... ..++++ || ||++|+....+. T Consensus 147 ~~~~i~~~~~~g~~~ii~TdI~~DGt~~G~d--~~l~~~i~~~~-----------~~~i~a-GGi~s~~Di~~L~~~G~~ 212 (231) T PRK13586 147 VEDAISHVNSLESLGVIFTYVCNEGTKNGID--NNVKRYVKLVK-----------GEKEYA-GGIGSIQDLQKLKKMGFD 212 (231) T ss_pred HHHHHHHHHHCCCCEEEEEEECCHHCCCCCC--HHHHHHHHHCC-----------CCCEEE-CCCCCHHHHHHHHHCCCC T ss_conf 9999999997599889997645112036899--89999998718-----------995998-688999999999867998 Q ss_pred --HHHHHHHHCCCEEEE Q ss_conf --999999748904885 Q gi|254780791|r 231 --MIVRAIANSSIPIIS 245 (529) Q Consensus 231 --~laraI~~~~iPVis 245 (529) .+.+|+|+-.|--|+ T Consensus 213 gaivG~Aly~G~i~~~~ 229 (231) T PRK13586 213 YAIVGMSFYAGVLRGIT 229 (231) T ss_pred EEEEEHHHHCCCCCCEE T ss_conf 89999788688266073 No 388 >PTZ00195 60S ribosomal protein L18; Provisional Probab=49.47 E-value=17 Score=15.67 Aligned_cols=87 Identities=13% Similarity=0.175 Sum_probs=42.8 Q ss_pred CCCCCCCCCHHHHHHHHHHH----HHHCC-CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCC Q ss_conf 88889862299999999999----74001-71899999705435688862799987489479999973521058668145 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHI----VESNL-SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEE 82 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~----l~~~~-~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~ 82 (529) .-.|.||+|+|.|...++.. ..+.= .-..|+|-|++=. + + |++ ..-+|-|-=|-..++. --++. T Consensus 55 Sr~nRpPlSlSrl~~~mk~~~~~~~~g~~~kiaVvVGtVTDD~-R-----l-~ev--Pkl~VcALrfT~~AR~--RIlka 123 (198) T PTZ00195 55 SRSNRAPISLSRIAVVMKRKAVFTAKGKKAPIAVVVGDVLDDV-R-----M-ARI--PAMRVCALRFSKSARQ--SIVAA 123 (198) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCH-H-----H-HCC--CCCEEEEEEECHHHHH--HHHHC T ss_conf 3457996119999999863123347887687799983032653-4-----3-137--6514887651388999--99963 Q ss_pred CCEEEEEEEEEEECCCCEEEEEE Q ss_conf 98899999966752884379999 Q gi|254780791|r 83 GIEFLVIGKITTFPGSSKYQIII 105 (529) Q Consensus 83 G~~v~~~g~~~~y~~~g~~ql~v 105 (529) |-+++-.-.+-+-.|.|.=.+.+ T Consensus 124 GGe~lTfDQLAl~aP~G~ntvLL 146 (198) T PTZ00195 124 GGECLTFDQLAMIAPTGKNTYLL 146 (198) T ss_pred CCEEEEHHHHHHHCCCCCCEEEE T ss_conf 98588678989858899871886 No 389 >KOG0250 consensus Probab=49.20 E-value=17 Score=15.64 Aligned_cols=16 Identities=13% Similarity=0.077 Sum_probs=7.8 Q ss_pred ECCCCEEEEEEEEEEE Q ss_conf 5288437999997101 Q gi|254780791|r 95 FPGSSKYQIIIESLIP 110 (529) Q Consensus 95 y~~~g~~ql~v~~i~~ 110 (529) ++|.|-+-.+|+-++| T Consensus 493 ~~P~GPlG~~Vtl~~~ 508 (1074) T KOG0250 493 TPPKGPLGKYVTLKEP 508 (1074) T ss_pred CCCCCCCCCEEEECCC T ss_conf 9998876543672580 No 390 >PRK12820 bifunctional aspartyl-tRNA synthetase/aspartyl/glutamyl-tRNA amidotransferase subunit C; Provisional Probab=48.59 E-value=17 Score=15.57 Aligned_cols=76 Identities=13% Similarity=0.253 Sum_probs=55.1 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC----CCCCCCCCCEEEEEEEEEEE--------CCCCE Q ss_conf 7189999970543568886279998748947999997352105----86681459889999996675--------28843 Q gi|254780791|r 33 SHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK----IEFLPEEGIEFLVIGKITTF--------PGSSK 100 (529) Q Consensus 33 ~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~----~~~~~~~G~~v~~~g~~~~y--------~~~g~ 100 (529) ..|.|.|=|.+.|. .+|=++++|+|..+.+.||+=+..... .--.+..+.-|-|.|.|..= -+.|. T Consensus 19 ~~V~l~GWV~~~R~--~G~l~FidLRD~~G~vQvv~~~~~~~~d~~~~~~~l~~E~vI~V~G~V~~r~~~~~n~~~~tGe 96 (706) T PRK12820 19 REVCLAGWVDAFRD--HGELLFIHLRDRNGFIQAVFSPEAAPADVYELAASLRAEFCVALQGEVQKRLEETENPHIETGD 96 (706) T ss_pred CEEEEEEEEEEEEC--CCCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCC T ss_conf 98999786878966--8984999997289458999868868888999985399867899998999347656688898861 Q ss_pred EEEEEEEEEE Q ss_conf 7999997101 Q gi|254780791|r 101 YQIIIESLIP 110 (529) Q Consensus 101 ~ql~v~~i~~ 110 (529) +.+.|+.++. T Consensus 97 iEI~v~~l~i 106 (706) T PRK12820 97 IEVFVRELSI 106 (706) T ss_pred EEEEEEEEEE T ss_conf 8999769999 No 391 >pfam07411 DUF1508 Domain of unknown function (DUF1508). This family represents a series of bacterial domains of unknown function of around 50 residues in length. Members of this family are often found as tandem repeats and in some cases represent the whole protein. All member proteins are described as being hypothetical. Probab=48.56 E-value=12 Score=16.84 Aligned_cols=18 Identities=28% Similarity=0.410 Sum_probs=14.5 Q ss_pred CCCEEEEEEECCCCEEEE Q ss_conf 886279998748947999 Q gi|254780791|r 49 SSGHAYFSLKDNHSRIDA 66 (529) Q Consensus 49 ~sGH~Yf~lkd~~a~i~~ 66 (529) ++|.+||.||..+.+|=+ T Consensus 2 ~~g~~~f~Lka~NgeiI~ 19 (49) T pfam07411 2 KNGEFRFRLKAANGQVIA 19 (49) T ss_pred CCCCEEEEEEECCCCEEE T ss_conf 988889999937998998 No 392 >pfam04555 XhoI Restriction endonuclease XhoI. This family consists of type II restriction enzymes (EC:3.1.21.4) that recognize the double-stranded sequence CTCGAG and cleave after C-1. Probab=48.42 E-value=13 Score=16.63 Aligned_cols=127 Identities=20% Similarity=0.263 Sum_probs=74.2 Q ss_pred HHHHHHHHHHHHH-CCCC--EEEEE---EECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCC-CC--CCC--CCC--C- Q ss_conf 9999999999740-0171--89999---97054356888627999874894799999735210-58--668--145--9- Q gi|254780791|r 18 SELSYHLKHIVES-NLSH--VCVRG---EISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLN-KI--EFL--PEE--G- 83 (529) Q Consensus 18 s~l~~~i~~~l~~-~~~~--~~v~g---Eis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~-~~--~~~--~~~--G- 83 (529) --+...|.+.+.. .+++ |.+.+ +|-+| .+|+ --|=+-..++..-+-++=+++... .. .|. -++ | T Consensus 24 dgf~~li~~~~~~~Gl~~a~i~~~~~~l~lPGY-fRpt-K~WDllVv~~g~LvAaiElKSqvGpSfGnNfNNR~EEAlG~ 101 (196) T pfam04555 24 DGFAALVLDIIRANGLAHAEIFQQRTALTLPGY-FRPT-KLWDLLVVQKGVLIAAVELKSQVGPSFGNNFNNRAEEAIGT 101 (196) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCC-CCCC-CCCCEEEEECCEEEEEEEECCCCCCCCCCCCCCHHHHHHCC T ss_conf 889999999999859977787524753105654-2445-65217999889388998730003865467655067887145 Q ss_pred -------CEEEEEEEEEEECCCCEEEEEEEE----------------EEECCCCHHH-HHHHHHHHHHHHHHCCCCCCCC Q ss_conf -------889999996675288437999997----------------1016800799-9999999976540122610016 Q gi|254780791|r 84 -------IEFLVIGKITTFPGSSKYQIIIES----------------LIPSGSGTLL-TALEKRKKKLLEEGLFSDQHKN 139 (529) Q Consensus 84 -------~~v~~~g~~~~y~~~g~~ql~v~~----------------i~~~g~G~l~-~~~e~lk~~L~~eGlfd~~~k~ 139 (529) ..=-+.|+ + -.|.-+|=+.+++ ++|.+.|.=| .+|+-+.+||-.|||++.. T Consensus 102 a~Dl~tA~re~~Fgk-~-~~pw~G~l~l~Ed~p~s~~pV~~~~phfpv~p~F~gaSY~~Ryei~CeRLv~e~lY~aa--- 176 (196) T pfam04555 102 AHDLWTAYREGAFGK-Q-PRPFLGWLMLVEDCPESRAAVADRSPHFPVFPEFKGASYLKRYELLCQKLVLERLYTTA--- 176 (196) T ss_pred HHHHHHHHHHHHCCC-C-CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHCCCCCE--- T ss_conf 889999998513067-8-88715789998217322477657888888776554653999999999999984444300--- Q ss_pred CCCCCCCEEEEEECCCHHHHH Q ss_conf 310265289998478425899 Q gi|254780791|r 140 PIPFIPKIIAVITSPTGAVIR 160 (529) Q Consensus 140 ~lP~~p~~i~vits~~~a~~~ 160 (529) ++|||+.++++. T Consensus 177 ---------~~itS~~~~~~~ 188 (196) T pfam04555 177 ---------CLIASPRQAGLT 188 (196) T ss_pred ---------EEEECCCCCCCC T ss_conf ---------688626656666 No 393 >pfam06148 COG2 COG (conserved oligomeric Golgi) complex component, COG2. The COG complex comprises eight proteins COG1-8. The COG complex plays critical roles in Golgi structure and function. The proposed function of the complex is to mediate the initial physical contact between transport vesicles and their membrane targets. A comparable role in tethering vesicles has been suggested for at least six additional large multisubunit complexes, including the exocyst, a complex that mediates trafficking to the plasma membrane. COG2 structure reveals a six-helix bundle with few conserved surface features but a general resemblance to recently determined crystal structures of four different exocyst subunits. These bundles inCOG2 may act as platforms for interaction with other trafficing proteins including SNAREs (soluble N-ethylmaleimide factor attachment protein receptors) and Rabs. Probab=48.31 E-value=17 Score=15.54 Aligned_cols=68 Identities=26% Similarity=0.328 Sum_probs=44.0 Q ss_pred CCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 752589886412377721456763323467776699999888778998889999855677653049788986788878 Q gi|254780791|r 251 TDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLINLEARLNNIIIRLIKYKINTLNSLLKALPNSDQTLSCSRYRL 328 (529) Q Consensus 251 ~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~l~~RL~~a~~~~l~~~~q~L~~l~r~L~~~~~~L~~~~QrL 328 (529) .||-...||+++|--+|= ..+...+..+..-|...+...++..-+..-.+...|...+..+...+.-| T Consensus 11 ~~Fd~~~fl~~~~~~~~L----------e~L~~dL~~~~~~L~~~L~~lIN~dY~dFvsLs~~L~g~~~~i~~l~~pL 78 (133) T pfam06148 11 PDFDPDQFLSELRKGVPL----------ESLRDDLRAYLKLLKSELIELINEDYADFVSLSTNLVGLDEKLERIRVPL 78 (133) T ss_pred CCCCHHHHHHHCCCCCCH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999775068999----------99999999999999999999998759999978887314799999999889 No 394 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=48.30 E-value=17 Score=15.53 Aligned_cols=12 Identities=0% Similarity=0.044 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999974 Q gi|254780791|r 18 SELSYHLKHIVE 29 (529) Q Consensus 18 s~l~~~i~~~l~ 29 (529) +++...|...+. T Consensus 15 ~~l~~~i~~~~~ 26 (265) T cd06285 15 ATMYEGIEEAAA 26 (265) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 395 >TIGR02407 ectoine_ectB diaminobutyrate--2-oxoglutarate aminotransferase; InterPro: IPR012773 Members of this family of class III pyridoxal-phosphate-dependent aminotransferases are diaminobutyrate--2-oxoglutarate aminotransferase (2.6.1.76 from EC) that catalyze the first step in ectoine biosynthesis from L-aspartate beta-semialdehyde. This family is readily separated phylogenetically from enzymes with the same substrate and product but involved in other process such as siderophore (Q9Z3R2 from SWISSPROT) or 1,3-diaminopropane (P44951 from SWISSPROT) biosynthesis. Ectoine is a compatible solute particularly effective in conferring salt tolerance.; GO: 0030170 pyridoxal phosphate binding, 0047307 diaminobutyrate-pyruvate transaminase activity, 0019491 ectoine biosynthetic process. Probab=48.27 E-value=16 Score=15.91 Aligned_cols=58 Identities=24% Similarity=0.403 Sum_probs=33.9 Q ss_pred HHHHHHHHC-----CCEEEEEEECCCCCCC----HHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC----CCCCHHHHHHC Q ss_conf 999986305-----9758999721001111----0367999999997410035767775899951----68884442200 Q gi|254780791|r 161 DILQRISCR-----FPLRVIIFPVKVQGDE----CPKEIANAILQLNTLKEGRTCPRPDIIILAR----GGGSIEDLWHF 227 (529) Q Consensus 161 D~~~~~~~r-----~p~~~~~~p~~vQG~~----a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R----GGGS~eDL~~F 227 (529) =|-+.|.|. -|.-|+| =+||||+ |-.+=.++|+++.+. +|++.|+= |=|=.-.-.+| T Consensus 180 Yfe~~L~D~sSGvd~PAAvIL--ETVQGEGGiNvAs~eWLq~l~~lCr~--------~DILLIVDDIQaGCGRTG~FFSF 249 (413) T TIGR02407 180 YFEKLLEDSSSGVDLPAAVIL--ETVQGEGGINVASKEWLQRLEKLCRR--------HDILLIVDDIQAGCGRTGTFFSF 249 (413) T ss_pred HHHHHHCCCCCCCCCCCEEEE--ECCCCCCCCCCCCHHHHHHHHHHHHH--------CCCEEEEEEEECCCCCCCCCCCC T ss_conf 999861788898476725898--11037577031235579999999986--------49868773174188887896453 Q ss_pred C Q ss_conf 7 Q gi|254780791|r 228 N 228 (529) Q Consensus 228 N 228 (529) = T Consensus 250 E 250 (413) T TIGR02407 250 E 250 (413) T ss_pred C T ss_conf 4 No 396 >pfam12004 DUF3498 Domain of unknown function (DUF3498). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP. Probab=48.20 E-value=18 Score=15.52 Aligned_cols=43 Identities=21% Similarity=0.336 Sum_probs=20.9 Q ss_pred CCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCCCCC-CCCCEEEEEEECC Q ss_conf 862299999999999740017-18999997054356-8886279998748 Q gi|254780791|r 13 PEYSVSELSYHLKHIVESNLS-HVCVRGEISGYRGI-HSSGHAYFSLKDN 60 (529) Q Consensus 13 ~~~svs~l~~~i~~~l~~~~~-~~~v~gEis~~~~~-~~sGH~Yf~lkd~ 60 (529) |-++ .-+..-+++.++.++. .| +++-+.+. +.+.++||-=|-. T Consensus 29 Ps~s-~svSsgLQki~e~dl~ssi----D~tRLpSPT~enkD~ffvtR~p 73 (489) T pfam12004 29 PSVS-CGLSSGLQKIMMDDLDSSV----DFTRLPSPTPENKDLFFVTRPS 73 (489) T ss_pred CCCC-CCCCHHHHHHHHCCCCCCC----CCCCCCCCCCCCCCCEEECCCC T ss_conf 8666-6746778887633555653----3102789984556631212676 No 397 >COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane] Probab=47.86 E-value=18 Score=15.48 Aligned_cols=22 Identities=9% Similarity=0.122 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999988716 Q gi|254780791|r 431 LSHKIKSCHTSVSITTRILQSF 452 (529) Q Consensus 431 ~~~~L~~~~~rL~~l~~~L~sl 452 (529) +++.++..++.++.+..+.+.+ T Consensus 378 L~Re~~~~r~~ye~lL~r~qe~ 399 (458) T COG3206 378 LEREAEAARSLYETLLQRYQEL 399 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999 No 398 >COG0205 PfkA 6-phosphofructokinase [Carbohydrate transport and metabolism] Probab=47.84 E-value=18 Score=15.48 Aligned_cols=47 Identities=23% Similarity=0.299 Sum_probs=24.8 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 999999997410035767775899951688844422007699999997489048852057775 Q gi|254780791|r 190 IANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETD 252 (529) Q Consensus 190 i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D 252 (529) -..+++.+.+++ .|.+|++=|=||...-.-+-+|-- +||| ||=+-+| T Consensus 83 ~~~~~~~l~~~g-------Id~LvvIGGDGS~~gA~~Lae~~~--------i~vV-GvPkTID 129 (347) T COG0205 83 RKVAAENLKKLG-------IDALVVIGGDGSYTGAALLAEEGG--------IPVV-GVPKTID 129 (347) T ss_pred HHHHHHHHHHCC-------CCEEEEECCCCHHHHHHHHHHHCC--------CCEE-ECCCCCC T ss_conf 999999999839-------988999788873899999997359--------8689-6688702 No 399 >PRK02228 V-type ATP synthase subunit F; Provisional Probab=47.74 E-value=18 Score=15.47 Aligned_cols=61 Identities=13% Similarity=0.193 Sum_probs=43.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCC-HHHHHH Q ss_conf 3679999999974100357677758999516888444220076999999974890488520577752-589886 Q gi|254780791|r 187 PKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDW-TLADYA 259 (529) Q Consensus 187 ~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~-Tl~D~V 259 (529) ..+..++++.+-...+ +-+||+ -+|++.==.+.+-+.+.+|-.|+|-+||-|.|. +|-|.+ T Consensus 29 ~Ee~~~~i~~ll~~~d------vGIIvi------~~~~~~~L~~~lr~~l~~sv~P~vv~IG~~~~~~~lRe~I 90 (100) T PRK02228 29 DEELDEAVESVLADDD------VGILVM------HSDDLEHLPRRLRQTLEESVEPTVVTLGGGGGGLSLREKI 90 (100) T ss_pred HHHHHHHHHHHHCCCC------EEEEEE------EHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHH T ss_conf 8999999999745687------479998------3079777679999876456677799958987716699999 No 400 >PRK05599 hypothetical protein; Provisional Probab=47.62 E-value=18 Score=15.45 Aligned_cols=75 Identities=8% Similarity=0.061 Sum_probs=50.4 Q ss_pred EEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCH Q ss_conf 98478425899999986305975899972100111103679999999974100357677758999516888444220076 Q gi|254780791|r 150 VITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFND 229 (529) Q Consensus 150 vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~ 229 (529) |+++.+...++.+...+..+++..+.++++-|.-.++..++++.+.... ...|++|..=|-..-.+.+.+|+ T Consensus 27 vl~~R~~e~l~~l~~~l~~~g~~~v~~~~~Dvtd~~~~~~~v~~~~~~~--------g~id~lv~naGi~~~~~~~~~d~ 98 (246) T PRK05599 27 VLAARRPEAAGGLAEDLRQLGATSVHVLSFDATDLDSHRELVKQTQELA--------GEISLAVVAFGILGDQERAETDE 98 (246) T ss_pred EEEECCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHC--------CCCEEEEECCCCCCCCCCCCCCH T ss_conf 9999999999999999986259718997289999999999999999861--------98439998776678732011899 Q ss_pred HHH Q ss_conf 999 Q gi|254780791|r 230 EMI 232 (529) Q Consensus 230 e~l 232 (529) +.. T Consensus 99 ~~~ 101 (246) T PRK05599 99 AHA 101 (246) T ss_pred HHH T ss_conf 999 No 401 >PRK04284 ornithine carbamoyltransferase; Provisional Probab=47.53 E-value=18 Score=15.44 Aligned_cols=28 Identities=18% Similarity=0.273 Sum_probs=16.7 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 9988888888898622999999999997 Q gi|254780791|r 1 MNPFSQKNSLDHPEYSVSELSYHLKHIV 28 (529) Q Consensus 1 ~~~~~~~~~~~~~~~svs~l~~~i~~~l 28 (529) |+.|..+.-.+...||-.|+..-+.... T Consensus 1 m~~l~~rh~l~~~dls~~ei~~il~~A~ 28 (332) T PRK04284 1 MKNLRNRSFLTLLDFSRQEVEFLLTLSE 28 (332) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 9767688868711089999999999999 No 402 >TIGR02636 galM_Leloir galactose mutarotase; InterPro: IPR013458 Galactose mutarotases (D-galactose 1-epimerase) participate in the Leloir pathway for galactose/glucose interconversion. Genes encoding the proteins found in this entry are found clustered with genes encoding other enzymes of the Leloir pathway. These proteins belong to the aldose 1-epimerase family. However, the aldose 1-epimerase itself (5.1.3.3 from EC) has a relatively broad specificity and can utilise D-glucose, L-arabinose, D-xylose, D-galactose, maltose and lactose. The restricted genomic context of the genes suggests members should act primarily on D-galactose.; GO: 0004034 aldose 1-epimerase activity, 0006012 galactose metabolic process. Probab=47.41 E-value=5.2 Score=20.25 Aligned_cols=98 Identities=19% Similarity=0.225 Sum_probs=51.2 Q ss_pred CCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEE-EECCCCEEEEEEEEEEECCCCHHHHHHHHHHH-HH Q ss_conf 86279998748947999997352105866814598899999966-75288437999997101680079999999999-76 Q gi|254780791|r 50 SGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKIT-TFPGSSKYQIIIESLIPSGSGTLLTALEKRKK-KL 127 (529) Q Consensus 50 sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~-~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~-~L 127 (529) .+|+||.|.|+.+..+|-==.-+...=.|.|-|-.-.|-.|.+- |=... |=|.-.+ .|...|..=.+ .= T Consensus 167 TNHvYFNL~g~~~~~d~~~H~L~lnAd~yLP~D~~tgIP~G~l~~V~~Ts--FDFr~~K-------~i~~~f~~~~~QqQ 237 (346) T TIGR02636 167 TNHVYFNLDGAEAGSDVLSHELQLNADRYLPLDEETGIPLGELKSVDGTS--FDFRKEK-------AIGQDFLADDDQQQ 237 (346) T ss_pred CCCEEECCCCCCCCCCHHHCEEEEECCCCCCCCCCCCCCCCCEECCCCCC--CCCCCCH-------HHHHHHHCCHHHHH T ss_conf 73301126877256864265268864843012574346456520135788--5655620-------11254525834878 Q ss_pred HHHHCCCCCCCCCCCC--CCCEEEEEECCCHH Q ss_conf 5401226100163102--65289998478425 Q gi|254780791|r 128 LEEGLFSDQHKNPIPF--IPKIIAVITSPTGA 157 (529) Q Consensus 128 ~~eGlfd~~~k~~lP~--~p~~i~vits~~~a 157 (529) .+.|| |-.-=..-+. .-+-.|++|||+.. T Consensus 238 ~~kGY-DHaF~l~~~~~d~~~~~a~l~Sp~~~ 268 (346) T TIGR02636 238 LAKGY-DHAFLLEGERCDGVKPAAILTSPDED 268 (346) T ss_pred HHCCC-CCHHHCCCCCCCCCCEEEEEECCCCC T ss_conf 64062-74451577675454407999868856 No 403 >PRK13143 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=47.10 E-value=18 Score=15.39 Aligned_cols=35 Identities=20% Similarity=0.340 Sum_probs=15.8 Q ss_pred CCEEEEECCCCCHHHHHHCCH--HHHHHHHHHCCCEEE Q ss_conf 758999516888444220076--999999974890488 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFND--EMIVRAIANSSIPII 244 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~--e~laraI~~~~iPVi 244 (529) +|.||| -|.||..+-+..-. .+..+..+...+||+ T Consensus 40 ~d~iIL-PGvG~~~~~~~~l~~~~~~i~~~~~~~~PiL 76 (201) T PRK13143 40 ADGIVL-PGVGAFGDVMENLGPLKETINEAVDDGKPFL 76 (201) T ss_pred CCEEEE-CCCCCHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 796797-3888188999876515799999998699879 No 404 >smart00537 DCX Domain in the Doublecortin (DCX) gene product. Tandemly-repeated domain in doublin, the Doublecortin gene product. Proposed to bind tubulin. Doublecortin (DCX) is mutated in human X-linked neuronal migration defects. Probab=46.93 E-value=16 Score=15.77 Aligned_cols=39 Identities=13% Similarity=0.092 Sum_probs=23.0 Q ss_pred HHHHCCHHHHHCCCEE--EEECCCCCEECCHHHCCCCCEEE Q ss_conf 8871696777314619--99848988957778929998699 Q gi|254780791|r 448 ILQSFAYKNTLKRGYT--SIQDTNNNFITQKRNLATKTRIL 486 (529) Q Consensus 448 ~L~slsP~~~L~RGYa--iv~~~~GkiI~s~~~l~~gd~i~ 486 (529) .|+.++..--|.-|++ -+++.+|+.|++.+++..|+... T Consensus 36 lL~~lT~~~~l~~~~~Vr~lyt~~G~~V~~l~~l~~g~~YV 76 (89) T smart00537 36 LLQDLTEVVKLDLPHGVRKLYTLDGKKVTSLDELEDGGSYV 76 (89) T ss_pred HHHHHHHHHCCCCCCCEEEEECCCCCEECCHHHHHCCCEEE T ss_conf 99998875416588762489979998918999971799899 No 405 >pfam04577 DUF563 Protein of unknown function (DUF563). Family of uncharacterized proteins. Probab=46.50 E-value=18 Score=15.32 Aligned_cols=130 Identities=21% Similarity=0.312 Sum_probs=84.7 Q ss_pred EEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEE---------EEEEC--C Q ss_conf 999999667528843799999710168007999999999976540122610016310265289---------99847--8 Q gi|254780791|r 86 FLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKII---------AVITS--P 154 (529) Q Consensus 86 v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i---------~vits--~ 154 (529) |++--.++.-+=.|..||.|.+..|-.+| +|..+.++|..=-+.|.++....==||.-| +|-++ | T Consensus 102 ~liPLfiTa~~~~geV~llv~~~~~WW~~----Ky~~il~~LS~y~vIdld~d~~vhCFp~a~VGL~~H~el~Idp~~~p 177 (246) T pfam04577 102 VLVPLYITARHFNGEVQFVVTNYSPWWLG----KYGDVLRQLSDYDVIDLDKDDRVHCFPSAIVGLRIHKELTIDPSLSP 177 (246) T ss_pred CEEHHHHHHHHCCCEEEEEEECCCHHHHH----HHHHHHHHHCCCCCCCCCCCCCEECCCCCEEEEEECCCEEECCCCCC T ss_conf 32111000262097799999478679999----99999998567984247899855046840861266265538821088 Q ss_pred CHHHHHHHHHHHHHCCCEEEEEEECCCC-CCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHH Q ss_conf 4258999999863059758999721001-111036799999999741003576777589995168884442200769999 Q gi|254780791|r 155 TGAVIRDILQRISCRFPLRVIIFPVKVQ-GDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIV 233 (529) Q Consensus 155 ~~a~~~D~~~~~~~r~p~~~~~~p~~vQ-G~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~la 233 (529) ++-.+.||.+-|.+-|.+... .+.+ |+. ...+|-++||.|+| +-. -.|+.+|+ T Consensus 178 ~~~sm~DF~~fLr~ay~l~r~---~~~~~~~~-------------------~~~rPRL~iisR~~-~R~---ilN~~evv 231 (246) T pfam04577 178 GGYSMVDFRRFLRRAYSLPRD---APIQLGEE-------------------PGRKPRLLIISRRG-SRK---ILNEDEVV 231 (246) T ss_pred CCCCHHHHHHHHHHHHCCCCC---CCCCCCCC-------------------CCCCCEEEEEECCC-CCE---ECCHHHHH T ss_conf 996489999999998578866---55447777-------------------78998699998789-625---52899999 Q ss_pred HHHHHCCCEEEE Q ss_conf 999748904885 Q gi|254780791|r 234 RAIANSSIPIIS 245 (529) Q Consensus 234 raI~~~~iPVis 245 (529) ++..+-.-=|+. T Consensus 232 ~~ae~vGFeV~v 243 (246) T pfam04577 232 RAAEEVGFEVVV 243 (246) T ss_pred HHHHHCCCEEEE T ss_conf 999984987992 No 406 >pfam11576 DUF3236 Protein of unknown function (DUF3236). This family of proteins with unknown function appears to be restricted to Methanobacteria. Probab=46.47 E-value=18 Score=15.32 Aligned_cols=81 Identities=21% Similarity=0.315 Sum_probs=47.7 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCC--HHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHH Q ss_conf 0079999999999765401226100163102652899984784--25899999986305975899972100111103679 Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPT--GAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEI 190 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~--~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i 190 (529) .||-.+..|.+...+. -.++|.|.|.-. =.++.|+++.+... ..+.+-.|+-.-.-.--+.+ T Consensus 18 ~GD~~EEve~Iq~~I~---------------sAk~ivV~t~N~kKf~vin~il~~~~~~-~~~ml~i~Tn~ADltRmPAi 81 (154) T pfam11576 18 RGDKEEEVEEIQEYIK---------------SAKRIVVPTNNEKKFKVVNDVLRRFGLP-EAEMLQIPTNSADLTRMPAV 81 (154) T ss_pred CCCHHHHHHHHHHHHH---------------CCCEEEEECCCCHHHHHHHHHHHHCCCC-CCEEEECCCCCCCCCCCCHH T ss_conf 4653999999999985---------------1564998559834568999999973797-51046646862001138077 Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 999999974100357677758999516 Q gi|254780791|r 191 ANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 191 ~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) .+||-..+.-+ -| +||+|| T Consensus 82 ~K~liAvD~sd-------AD-lvIaRG 100 (154) T pfam11576 82 SKALMALDISD-------AD-LVIARG 100 (154) T ss_pred HHHHHEEECCC-------CC-EEEECC T ss_conf 65542131368-------84-799605 No 407 >COG0788 PurU Formyltetrahydrofolate hydrolase [Nucleotide transport and metabolism] Probab=46.47 E-value=18 Score=15.32 Aligned_cols=89 Identities=18% Similarity=0.254 Sum_probs=51.6 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHC-CCEEEE---------------- Q ss_conf 007999999999976540122610016310265289998478425899999986305-975899---------------- Q gi|254780791|r 113 SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCR-FPLRVI---------------- 175 (529) Q Consensus 113 ~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r-~p~~~~---------------- 175 (529) .-.|.+.|+.+-+++. -+-....+..+++|+|.-|..+--+.|.|-..+.. .|++|. T Consensus 64 ~~~l~~~f~~~a~~f~------m~~~~~~~~~~~ri~i~VSK~~HCL~DLL~r~~~g~L~~eI~~VIsNH~dl~~~v~~~ 137 (287) T COG0788 64 REALRAAFAPLAEEFG------MDWRLHDAAQRKRIAILVSKEDHCLGDLLYRWRIGELPAEIVAVISNHDDLRPLVERF 137 (287) T ss_pred HHHHHHHHHHHHHHHC------CEEEEECCCCCCEEEEEEECHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHC T ss_conf 9999988998888627------6388812355755999981457779999999761886872589974877889999974 Q ss_pred ---EEECCCCCC---CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf ---972100111---103679999999974100357677758999516 Q gi|254780791|r 176 ---IFPVKVQGD---ECPKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 176 ---~~p~~vQG~---~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) .+..++-.+ .+.+.+.+-+ ... ..|+||++|= T Consensus 138 ~IPfhhip~~~~~k~e~E~~~~~ll---~~~-------~~DlvVLARY 175 (287) T COG0788 138 DIPFHHIPVTKENKAEAEARLLELL---EEY-------GADLVVLARY 175 (287) T ss_pred CCCEEECCCCCCCCHHHHHHHHHHH---HHH-------CCCEEEEHHH T ss_conf 9986654678775347899999999---985-------9987820466 No 408 >COG0133 TrpB Tryptophan synthase beta chain [Amino acid transport and metabolism] Probab=46.09 E-value=8.7 Score=18.25 Aligned_cols=38 Identities=32% Similarity=0.464 Sum_probs=23.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHH--HHHHCCH Q ss_conf 011110367999999997410035767775899951688844--4220076 Q gi|254780791|r 181 VQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIE--DLWHFND 229 (529) Q Consensus 181 vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~e--DL~~FN~ 229 (529) |=|+++-.+|.+. .+ ..||+||-|=||||-. =..+|=+ T Consensus 207 vIG~E~k~Qile~---eg--------rlPD~vvACVGGGSNAiG~F~~Fi~ 246 (396) T COG0133 207 VIGEEAKAQILEK---EG--------RLPDAVVACVGGGSNAIGIFHPFID 246 (396) T ss_pred HHHHHHHHHHHHH---HC--------CCCCEEEEECCCCCCHHHHCCCCCC T ss_conf 9869999999998---38--------9997488831677213330311138 No 409 >COG1322 Predicted nuclease of restriction endonuclease-like fold, RmuC family [General function prediction only] Probab=45.90 E-value=19 Score=15.25 Aligned_cols=48 Identities=17% Similarity=0.251 Sum_probs=26.9 Q ss_pred HHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEEEECC Q ss_conf 99998871696777314619998489889577789299986999991109999995057 Q gi|254780791|r 444 ITTRILQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILINFFDGQANAIVINKA 502 (529) Q Consensus 444 ~l~~~L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v~a~V~~k~ 502 (529) .+.+.|+...|..-+++.+-.+. .+|.-+ --++.|+|-+| ...|++.. T Consensus 200 ~Le~ILe~~gl~~~~ey~~Q~~l-~~~~~~--------~pD~~IrLP~~--~~LvIDsK 247 (448) T COG1322 200 QLERILEDSGLREGYEYETQVSL-SNEERV--------RPDVAIRLPGG--KLLVIDAK 247 (448) T ss_pred HHHHHHHHHCCHHCCCEEEEEEC-CCCCCC--------CCCEEEECCCC--CEEEEECC T ss_conf 99999997275011550467631-677667--------89868988999--71555478 No 410 >PRK08252 enoyl-CoA hydratase; Provisional Probab=45.84 E-value=19 Score=15.24 Aligned_cols=77 Identities=23% Similarity=0.268 Sum_probs=38.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-CH---HHHHHCCH----HH----HH-HHHHHCCCEEEEEE-CCCC Q ss_conf 0367999999997410035767775899951688-84---44220076----99----99-99974890488520-5777 Q gi|254780791|r 186 CPKEIANAILQLNTLKEGRTCPRPDIIILARGGG-SI---EDLWHFND----EM----IV-RAIANSSIPIISAI-GHET 251 (529) Q Consensus 186 a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG-S~---eDL~~FN~----e~----la-raI~~~~iPVisgI-GHE~ 251 (529) -..+|..++..++... .+.|||| +|.| ++ -||-.|.. .. +. -....|+.|||++| ||-. T Consensus 31 m~~el~~~l~~~~~d~------~vrvvvl-tg~G~~F~aG~Dl~~~~~~~~~~~~~~~~~~~~~~~~~kPvIaav~G~a~ 103 (254) T PRK08252 31 VAQGLAAALDQLDADD------DLSVGIL-TGAGGTFCAGMDLKAFARGERPSIPGRGFAGLTERPPRKPLIAAVEGYAL 103 (254) T ss_pred HHHHHHHHHHHHHHCC------CEEEEEE-ECCCCCEECCCCHHHHHCCCCCCCHHHHHHHHHHCCCCCCEEEEECCEEE T ss_conf 9999999999997399------9179999-88997355553678786389743113257999970579988999878776 Q ss_pred CC-HHHHHHHCCCCCCCHH Q ss_conf 52-5898864123777214 Q gi|254780791|r 252 DW-TLADYAADLRAPTPTG 269 (529) Q Consensus 252 D~-Tl~D~VAD~Ra~TPTa 269 (529) =- .-.=+.+|.|..++.+ T Consensus 104 GgG~~la~~~D~~ias~~a 122 (254) T PRK08252 104 AGGFELALACDLIVAARDA 122 (254) T ss_pred HHHHHHHHHCCEEEECCCC T ss_conf 1759999747899852444 No 411 >cd06271 PBP1_AglR_RafR_like Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the represso Probab=45.75 E-value=19 Score=15.23 Aligned_cols=202 Identities=17% Similarity=0.218 Sum_probs=87.7 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCCC Q ss_conf 88898622999999999997400-17189999970543568886279998748947999997352105---866814598 Q gi|254780791|r 9 SLDHPEYSVSELSYHLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEGI 84 (529) Q Consensus 9 ~~~~~~~svs~l~~~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G~ 84 (529) ..++|-| +++...|...+... | .+.|. .++- ....-..|.++.. ..+++++++.+.... +...-+.|. T Consensus 12 di~NPff--~~l~~gie~~~~~~Gy-~lll~--~~~~--~~~~~~~~~~~l~-~~~vDGiIl~~~~~~~~~~~~l~~~~i 83 (268) T cd06271 12 EEGDPFF--AEFLSGLSEALAEHGY-DLVLL--PVDP--DEDPLEVYRRLVE-SGLVDGVIISRTRPDDPRVALLLERGF 83 (268) T ss_pred CCCCHHH--HHHHHHHHHHHHHCCC-EEEEE--ECCC--CHHHHHHHHHHHH-HCCCCEEEEECCCCCHHHHHHHHHCCC T ss_conf 5568899--9999999999998499-89999--5899--9899999999998-489888999678898199999997799 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHH----- Q ss_conf 899999966752884379999971016800799999999997654012261001631026528999847842589----- Q gi|254780791|r 85 EFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVI----- 159 (529) Q Consensus 85 ~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~----- 159 (529) -+++.++.. .... +..+ ....... -.+.-+.|.+.|. ++||.|+++..... T Consensus 84 P~V~~~r~~-~~~~--~~~V----~~Dn~~~----~~~a~~~L~~~G~-------------~~I~~i~~~~~~~~~~~R~ 139 (268) T cd06271 84 PFVTHGRTE-LGDP--HPWV----DFDNEAA----AYQAVRRLIALGH-------------RRIALLNPPEDLTFAQHRR 139 (268) T ss_pred CEEEECCCC-CCCC--CCEE----EECHHHH----HHHHHHHHHHHCC-------------CEEEECCCCCCCHHHHHHH T ss_conf 999976656-8999--9689----9576999----9999999998598-------------7487548875460799999 Q ss_pred HHHHHHHHHC-CCE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHH Q ss_conf 9999986305-975-89997210011110367999999997410035767775899951688844422007699999997 Q gi|254780791|r 160 RDILQRISCR-FPL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIA 237 (529) Q Consensus 160 ~D~~~~~~~r-~p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~ 237 (529) .-|...+++. -++ ...++ .|.....+..+++..+-... +.+|+|+- ..|.+++.=...++. . T Consensus 140 ~Gf~~al~~~gl~~~~~~~~----~~~~~~~~~~~~~~~ll~~~-----~~p~Ai~~------~nD~~A~g~l~~l~~-~ 203 (268) T cd06271 140 AGYRRALAEAGLPLDPALIV----SGDMTEEGGYAAAAELLALP-----DRPTAIVC------SSELMALGVLAALAE-A 203 (268) T ss_pred HHHHHHHHHCCCCCCCCEEE----ECCCCHHHHHHHHHHHHHCC-----CCCCEEEE------CCHHHHHHHHHHHHH-C T ss_conf 99999999839997711365----06875777999999999659-----99868997------777999999999998-2 Q ss_pred HCCCEE-EEEECCCCCCHHHHHH Q ss_conf 489048-8520577752589886 Q gi|254780791|r 238 NSSIPI-ISAIGHETDWTLADYA 259 (529) Q Consensus 238 ~~~iPV-isgIGHE~D~Tl~D~V 259 (529) ..+||= |+=||.. |..+++++ T Consensus 204 g~~vP~DisvigfD-~~~~~~~~ 225 (268) T cd06271 204 GLRPGRDVSVVGFD-DSPPLLFF 225 (268) T ss_pred CCCCCCCEEEEEEC-CCHHHHHC T ss_conf 99989997999978-82899827 No 412 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=45.71 E-value=19 Score=15.23 Aligned_cols=14 Identities=29% Similarity=0.615 Sum_probs=6.6 Q ss_pred HHHHHHHCCCEEEE Q ss_conf 99999748904885 Q gi|254780791|r 232 IVRAIANSSIPIIS 245 (529) Q Consensus 232 laraI~~~~iPVis 245 (529) +++...+..||||+ T Consensus 74 ~i~~a~~agIpVv~ 87 (271) T cd06312 74 AIKRAVAAGIPVIS 87 (271) T ss_pred HHHHHHHCCCEEEE T ss_conf 99999965986999 No 413 >PRK05395 3-dehydroquinate dehydratase; Provisional Probab=45.60 E-value=19 Score=15.21 Aligned_cols=71 Identities=21% Similarity=0.354 Sum_probs=51.1 Q ss_pred HHHHHHHHHHHHH---CCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHH Q ss_conf 2589999998630---5975899972100111103679999999974100357677758999516888444220076999 Q gi|254780791|r 156 GAVIRDILQRISC---RFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMI 232 (529) Q Consensus 156 ~a~~~D~~~~~~~---r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~l 232 (529) ..-+.|+...++. ...+++..|-+-..| +|+..|..+.. ..|.|||-=|| |..-+..+ T Consensus 25 ~~tl~~i~~~~~~~~~~~~i~~~~~QSN~Eg-----elI~~Iq~a~~--------~~dgiIiNpga------~THtS~al 85 (143) T PRK05395 25 TTTLADIEALLKEEAAELGVELEFFQSNHEG-----ELIDRIHEARD--------DVDGIIINPGA------YTHTSVAL 85 (143) T ss_pred CCCHHHHHHHHHHHHHHCCCEEEEEECCCHH-----HHHHHHHHHHH--------CCCEEEECCCH------HHHHHHHH T ss_conf 5689999999999999759759998518689-----99999999863--------37579973621------33244889 Q ss_pred HHHHHHCCCEEEE Q ss_conf 9999748904885 Q gi|254780791|r 233 VRAIANSSIPIIS 245 (529) Q Consensus 233 araI~~~~iPVis 245 (529) ..|+..+++|+|- T Consensus 86 ~DAl~~~~~P~iE 98 (143) T PRK05395 86 RDALAAVSIPVVE 98 (143) T ss_pred HHHHHHCCCCEEE T ss_conf 9899874998899 No 414 >COG2759 MIS1 Formyltetrahydrofolate synthetase [Nucleotide transport and metabolism] Probab=45.54 E-value=19 Score=15.21 Aligned_cols=153 Identities=24% Similarity=0.322 Sum_probs=74.3 Q ss_pred EEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHH Q ss_conf 9966752884379999971016800799999999997654012-261001631026528999847842589999998630 Q gi|254780791|r 90 GKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGL-FSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISC 168 (529) Q Consensus 90 g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGl-fd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~ 168 (529) |...=|+...+|.+.|-+=- --+=.|..-+..+|+||..-=+ ++..+ =|-+..-+++ +..=.+-+.|-++ T Consensus 181 ~~~~G~~RedgFdITvASEi-MAIlcLa~dlkDlk~Rl~~iviay~~~~---~PV~~~Dl~~-~GAma~lLkDAik---- 251 (554) T COG2759 181 GPENGVPREDGFDITVASEI-MAILCLATDLKDLKERLGRIVIAYDYDG---KPVTAGDLKV-EGAMAALLKDAIK---- 251 (554) T ss_pred CCCCCCCCCCCCEEEHHHHH-HHHHHHHHHHHHHHHHHHHEEEEEECCC---CCEEEECCCC-CHHHHHHHHHHCC---- T ss_conf 76678656778334429999-9999986107999999743588872379---9246331564-3679999986436---- Q ss_pred CCC-E--EEEEEECCCCCC------CHHHHHHH---H------------------HHHHHHHCCCCCCCCCCEEEEE--- Q ss_conf 597-5--899972100111------10367999---9------------------9999741003576777589995--- Q gi|254780791|r 169 RFP-L--RVIIFPVKVQGD------ECPKEIAN---A------------------ILQLNTLKEGRTCPRPDIIILA--- 215 (529) Q Consensus 169 r~p-~--~~~~~p~~vQG~------~a~~~i~~---a------------------i~~~~~~~~~~~~~~~D~iii~--- 215 (529) | + .+.=-|+.|-|- .-..||+. | .++|-..........+|++||+ T Consensus 252 --PNLvQTlEgtPa~VHgGPFANIAhGcnSiiAt~~AlkL~dy~VTEAGFgaDlGaEKF~dIK~r~~gl~PdavVlVATv 329 (554) T COG2759 252 --PNLVQTLEGTPAFVHGGPFANIAHGCNSIIATKTALKLADYVVTEAGFGADLGAEKFFDIKCRSSGLKPDAVVLVATV 329 (554) T ss_pred --CCCEEECCCCCEEEECCCCCHHHCCCHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHCCEECCCCCCCCCEEEEEEEH T ss_conf --430321379834673576301222630477889998642758875255434353665011101469898769986136 Q ss_pred -----CCCCCHHHHHHCCHHHHH-------H---HHHHCCCEEEEEECC-CCCC Q ss_conf -----168884442200769999-------9---997489048852057-7752 Q gi|254780791|r 216 -----RGGGSIEDLWHFNDEMIV-------R---AIANSSIPIISAIGH-ETDW 253 (529) Q Consensus 216 -----RGGGS~eDL~~FN~e~la-------r---aI~~~~iPVisgIGH-E~D~ 253 (529) -||+-.+||-.=|-+.|. | .|-.+.+||+.||-+ .||+ T Consensus 330 RALK~hGG~~~~~l~~Env~avk~G~aNL~~Hi~Nikkfgvp~VVAIN~F~tDt 383 (554) T COG2759 330 RALKMHGGVPKEDLTEENVDAVKKGFANLLKHIENIKKFGVPVVVAINKFPTDT 383 (554) T ss_pred HHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 899872898857752120899998777899999888870998699961578888 No 415 >cd06320 PBP1_allose_binding Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. The members of this group are belonging to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Like other periplasmic receptors of the ABC-type transport systems, the allose-binding protein consists of two alpha/beta domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Probab=45.33 E-value=19 Score=15.18 Aligned_cols=34 Identities=12% Similarity=0.121 Sum_probs=14.0 Q ss_pred EEEEEECC-CHHHHHHHHHHHH---HCCCEEEEEEECC Q ss_conf 89998478-4258999999863---0597589997210 Q gi|254780791|r 147 IIAVITSP-TGAVIRDILQRIS---CRFPLRVIIFPVK 180 (529) Q Consensus 147 ~i~vits~-~~a~~~D~~~~~~---~r~p~~~~~~p~~ 180 (529) |||||... +..=+..+.+-+. +..++.+.++.+. T Consensus 1 kIGviv~~~~npf~~~~~~g~e~~a~~~G~~v~v~~~~ 38 (275) T cd06320 1 KYGVVLKTLSNEFWRSLKEGYENEAKKLGVSVDIQAAP 38 (275) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 98999489988899999999999999749989999689 No 416 >PRK11175 universal stress protein UspE; Provisional Probab=45.31 E-value=19 Score=15.18 Aligned_cols=56 Identities=14% Similarity=0.280 Sum_probs=34.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC-CCHHHH-HHCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 01111036799999999741003576777589995168-884442-2007699999997489048852057 Q gi|254780791|r 181 VQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG-GSIEDL-WHFNDEMIVRAIANSSIPIISAIGH 249 (529) Q Consensus 181 vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG-GS~eDL-~~FN~e~laraI~~~~iPVisgIGH 249 (529) |-+.....+|++..... ++|+||++.=| ..+.++ ..=+++.|.| .||.||+.-=.+ T Consensus 90 ~~~~~p~~~Ii~~a~~~----------~~DLVV~g~~~~~~l~~~l~gs~~~~llR---~~~~PVllv~~~ 147 (304) T PRK11175 90 VWHNRPFEAIIQEVIKG----------KHDLVVKMTHQHDKLESVIFTPTDWHLLR---KCPCPVLMVKDH 147 (304) T ss_pred EECCCHHHHHHHHHHHC----------CCCEEEECCCCCCHHHHHHHCHHHHHHHH---CCCCCEEEEECC T ss_conf 98898799999999975----------99899987777623443300515899984---499958997256 No 417 >cd06282 PBP1_GntR_like_2 Ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding Probab=45.05 E-value=19 Score=15.15 Aligned_cols=22 Identities=18% Similarity=0.220 Sum_probs=14.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 888898622999999999997400 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN 31 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~ 31 (529) |+.++|- -+++...|...++.. T Consensus 7 p~i~npf--~~~~~~gie~~~~~~ 28 (266) T cd06282 7 PSLANPV--FAECVQGIQEEARAA 28 (266) T ss_pred CCCCCHH--HHHHHHHHHHHHHHC T ss_conf 8986889--999999999999987 No 418 >PRK07765 para-aminobenzoate synthase component II; Provisional Probab=44.90 E-value=19 Score=15.13 Aligned_cols=16 Identities=19% Similarity=0.148 Sum_probs=10.7 Q ss_pred HHHHCCHHHHHHHHHH Q ss_conf 4220076999999974 Q gi|254780791|r 223 DLWHFNDEMIVRAIAN 238 (529) Q Consensus 223 DL~~FN~e~laraI~~ 238 (529) .+-.|=|++||-+|+. T Consensus 199 ~~~~~ie~~i~~~~~~ 214 (221) T PRK07765 199 TLVRRLENEVAAAVAP 214 (221) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 7899999999999987 No 419 >cd00466 DHQase_II Dehydroquinase (DHQase), type II. Dehydroquinase (or 3-dehydroquinate dehydratase) catalyzes the reversible dehydration of 3-dehydroquinate to form 3-dehydroshikimate. This reaction is part of two metabolic pathways: the biosynthetic shikimate pathway and the catabolic quinate pathway. There are two types of DHQases, which are distinct from each other in amino acid sequence and three-dimensional structure. Type I enzymes usually catalyze the biosynthetic reaction using a syn elimination mechanism. In contrast, type II enzymes, found in the quinate pathway of fungi and in the shikimate pathway of many bacteria, are dodecameric enzymes that employ an anti elimination reaction mechanism. Probab=44.69 E-value=20 Score=15.11 Aligned_cols=69 Identities=25% Similarity=0.332 Sum_probs=50.5 Q ss_pred HHHHHHHHHHHCC---CEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHH Q ss_conf 8999999863059---7589997210011110367999999997410035767775899951688844422007699999 Q gi|254780791|r 158 VIRDILQRISCRF---PLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVR 234 (529) Q Consensus 158 ~~~D~~~~~~~r~---p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~lar 234 (529) -+.|+.+.++..+ .+++..|-+-.-| +|+..|..+.. .+|.|||-=|| |.+-+..+.. T Consensus 25 tl~~i~~~~~~~a~~~~~~l~~~QSN~Eg-----elI~~I~~a~~--------~~dgiIiNpga------~ThtSial~D 85 (140) T cd00466 25 TLADIEALLRELAAELGVEVEFFQSNHEG-----ELIDWIHEARD--------GADGIIINPGA------YTHTSIALRD 85 (140) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCHH-----HHHHHHHHHHC--------CCCEEEECCCH------HHHHHHHHHH T ss_conf 99999999999998749916898618699-----99999999747--------57589975612------2002377799 Q ss_pred HHHHCCCEEEE Q ss_conf 99748904885 Q gi|254780791|r 235 AIANSSIPIIS 245 (529) Q Consensus 235 aI~~~~iPVis 245 (529) |+..+++|+|- T Consensus 86 Al~~~~~P~vE 96 (140) T cd00466 86 ALAAVSIPVIE 96 (140) T ss_pred HHHHCCCCEEE T ss_conf 99864999899 No 420 >PRK12679 cbl transcriptional regulator Cbl; Reviewed Probab=44.60 E-value=20 Score=15.10 Aligned_cols=94 Identities=21% Similarity=0.226 Sum_probs=56.9 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 26528999847842589999998630597-58999721001111036799999999741003576777589995168884 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) .-.-+||...+...-.+-+++...+.+|| +++.+. +| .+.+|...+.. ..+|+.|+.-.-... T Consensus 92 ~G~l~Ig~~~t~a~~~LP~~l~~F~~~~P~v~l~l~----~~--~~~~i~~~l~~----------g~~Dl~i~~~~~~~~ 155 (316) T PRK12679 92 SGVLTIATTHTQARYSLPEVIKAFRELFPEVRLELI----QG--TPQEIATLLQN----------GEADIGIASERLSND 155 (316) T ss_pred CCEEEEEEEECCHHCCCCHHHHHHHHHCCCCEEEEE----EC--CHHHHHHHHHC----------CCCCEEECCCCCCCC T ss_conf 764999985032010381999999986899538998----47--87999999987----------998852224677899 Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHH Q ss_conf 44220076999999974890488520577----752589886 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAIGHE----TDWTLADYA 259 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgIGHE----~D~Tl~D~V 259 (529) .+|-. ..++..+.-+|..-||. ...|+.|+. T Consensus 156 p~l~~-------~p~~~~~~~lvvp~~HpLa~~~~vsl~~L~ 190 (316) T PRK12679 156 PQLVA-------FPWFRWHHSLLVPLDHPLTQITPLTLESIA 190 (316) T ss_pred CCCEE-------EEEEECCEEEEECCCCCCCCCCCCCHHHHC T ss_conf 76079-------980553413560189951248998999987 No 421 >cd06323 PBP1_ribose_binding Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Members of this group are belonging to the type I periplasmic binding protein superfamily, whose members are involved in chemotaxis, ATP-binding cassette transport, and intercellular communication in central nervous system. The thermophilic and mesophilic ribose-binding proteins are structurally very similar, but differ substantially in thermal stability. Probab=44.50 E-value=20 Score=15.08 Aligned_cols=12 Identities=33% Similarity=0.708 Sum_probs=5.1 Q ss_pred HHHHHHCCCEEE Q ss_conf 999974890488 Q gi|254780791|r 233 VRAIANSSIPII 244 (529) Q Consensus 233 araI~~~~iPVi 244 (529) ++......|||| T Consensus 73 i~~~~~~~iPvV 84 (268) T cd06323 73 VKAANEAGIPVF 84 (268) T ss_pred HHHHHHCCCCEE T ss_conf 999997699689 No 422 >cd06290 PBP1_LacI_like_9 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=44.44 E-value=20 Score=15.08 Aligned_cols=12 Identities=0% Similarity=0.025 Sum_probs=4.9 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999974 Q gi|254780791|r 18 SELSYHLKHIVE 29 (529) Q Consensus 18 s~l~~~i~~~l~ 29 (529) +++...|...+. T Consensus 15 ~~i~~gie~~~~ 26 (265) T cd06290 15 GRILKGMERGLN 26 (265) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 423 >cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i Probab=44.30 E-value=20 Score=15.06 Aligned_cols=21 Identities=0% Similarity=0.146 Sum_probs=10.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 88889862299999999999740 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVES 30 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~ 30 (529) |+.++|-| +++...|...+.. T Consensus 7 p~l~n~ff--~~~~~gi~~~~~~ 27 (273) T cd01541 7 TYISDYIF--PSIIRGIESVLSE 27 (273) T ss_pred CCCCCHHH--HHHHHHHHHHHHH T ss_conf 88765799--9999999999998 No 424 >pfam00885 DMRL_synthase 6,7-dimethyl-8-ribityllumazine synthase. This family includes the beta chain of 6,7-dimethyl-8- ribityllumazine synthase EC:2.5.1.9, an enzyme involved in riboflavin biosynthesis. The family also includes a subfamily of distant archaebacterial proteins that may also have the same function. Probab=44.22 E-value=20 Score=15.05 Aligned_cols=90 Identities=20% Similarity=0.242 Sum_probs=54.3 Q ss_pred CCEEEEEECCCHHHH-----HHHHHHHHHC-CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEE---- Q ss_conf 528999847842589-----9999986305-97-589997210011110367999999997410035767775899---- Q gi|254780791|r 145 PKIIAVITSPTGAVI-----RDILQRISCR-FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIII---- 213 (529) Q Consensus 145 p~~i~vits~~~a~~-----~D~~~~~~~r-~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ii---- 213 (529) ..|||||.|.=-.-+ ..+++.+... .+ .++.+|.++ | +-||--+++.+-+.. +||.+| T Consensus 3 ~~kI~IV~s~~n~~i~~~l~~~a~~~l~~~g~~~~~i~~~~VP--G---a~EiP~~~~~l~~~~------~~d~iIalG~ 71 (144) T pfam00885 3 GLRIAIVVARFNADITDRLLEGALDALKRHGVSEEDIDVVRVP--G---AFELPLAAKKLAKSG------RYDAVVALGA 71 (144) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHEEEEECC--C---HHHHHHHHHHHHHCC------CCCEEEEEEE T ss_conf 9889999803759999999999999999869993105899879--6---889999999998648------9887999999 Q ss_pred EECCCCCHHHHHHCCHHHHHHHHH----HCCCEEEEEEC Q ss_conf 951688844422007699999997----48904885205 Q gi|254780791|r 214 LARGGGSIEDLWHFNDEMIVRAIA----NSSIPIISAIG 248 (529) Q Consensus 214 i~RGGGS~eDL~~FN~e~laraI~----~~~iPVisgIG 248 (529) |+||+=.--|+.+ ..+.+.+. +..+||+.||= T Consensus 72 VIkGeT~H~e~I~---~~v~~gl~~l~l~~~~PI~~gIL 107 (144) T pfam00885 72 VIRGGTYHFDYVA---QEVSRGLMQVSLETGVPVGFGVL 107 (144) T ss_pred EEECCCCHHHHHH---HHHHHHHHHHHHCCCCCEEEEEC T ss_conf 9837984899999---99999999998267980799862 No 425 >KOG0970 consensus Probab=43.98 E-value=20 Score=15.02 Aligned_cols=96 Identities=23% Similarity=0.422 Sum_probs=54.1 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCHHHH-HHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHH-HHHHHCCCCCCCCC Q ss_conf 2261001631026528999847842589-999998630597589997210011110367999999-99741003576777 Q gi|254780791|r 132 LFSDQHKNPIPFIPKIIAVITSPTGAVI-RDILQRISCRFPLRVIIFPVKVQGDECPKEIANAIL-QLNTLKEGRTCPRP 209 (529) Q Consensus 132 lfd~~~k~~lP~~p~~i~vits~~~a~~-~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~-~~~~~~~~~~~~~~ 209 (529) -|+..+--|-|.+|.+--++|-|.|-.+ -||- .+.++--..|.+. +.+..+..-+- .++. ..+ T Consensus 559 ~~~id~p~p~~~~~~~~c~l~rP~~~~fP~g~~-ela~~k~~~v~~~-------~sErALLs~fla~~~~-------~dp 623 (1429) T KOG0970 559 NFSIDKPAPAPAFPRHFCVLTRPPGTSFPLGLK-ELAKQKLSKVVLH-------NSERALLSHFLAMLNK-------EDP 623 (1429) T ss_pred CCCCCCCCCCCCCCCCCEEEECCCCCCCCCHHH-HHHHHCCCCEEEE-------CCHHHHHHHHHHHHHC-------CCC T ss_conf 543457899974567532675688776873589-9987306734883-------4889999999999634-------299 Q ss_pred CEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECC Q ss_conf 5899951688844422007699999997489048852057 Q gi|254780791|r 210 DIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGH 249 (529) Q Consensus 210 D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGH 249 (529) |+||= -|++.|.=.-|+-.+-+|+||-||-||- T Consensus 624 D~iVg-------Hn~~~~~l~VLl~R~~~~Kip~WS~IgR 656 (1429) T KOG0970 624 DVIVG-------HNIQGFYLDVLLSRLHALKIPNWSSIGR 656 (1429) T ss_pred CEEEE-------ECCCCCHHHHHHHHHHHHCCCCHHHHHH T ss_conf 87997-------1411020999999998734864434322 No 426 >COG3596 Predicted GTPase [General function prediction only] Probab=43.97 E-value=20 Score=15.02 Aligned_cols=135 Identities=13% Similarity=0.180 Sum_probs=74.8 Q ss_pred HHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHC---------------------CCE-EEEEEECCCCCC-- Q ss_conf 40122610016310265289998478425899999986305---------------------975-899972100111-- Q gi|254780791|r 129 EEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCR---------------------FPL-RVIIFPVKVQGD-- 184 (529) Q Consensus 129 ~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r---------------------~p~-~~~~~p~~vQG~-- 184 (529) .|+..+..++.++-.++.-+++|+..||||-.--+|.|=.. +.. .+.+|.++==|+ T Consensus 22 se~~l~~~~~~~l~~~~pvnvLi~G~TG~GKSSliNALF~~~~~~v~~vg~~t~~~~~~~~~~~~~~l~lwDtPG~gdg~ 101 (296) T COG3596 22 SERILEQLRMLQLTEKEPVNVLLMGATGAGKSSLINALFQGEVKEVSKVGVGTDITTRLRLSYDGENLVLWDTPGLGDGK 101 (296) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCCEEEECCCCCCCCH T ss_conf 89999875121120368615897437777688999999702673421046688701567741266524884378855320 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCE-EEEECCCCCHHHHHHCCHHHHHHHHHHCC--CEEEEEECCCCCCHHHHHHHC Q ss_conf 103679999999974100357677758-99951688844422007699999997489--048852057775258988641 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDI-IILARGGGSIEDLWHFNDEMIVRAIANSS--IPIISAIGHETDWTLADYAAD 261 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~-iii~RGGGS~eDL~~FN~e~laraI~~~~--iPVisgIGHE~D~Tl~D~VAD 261 (529) .+-.+....+... + ++.|+ ++|.-||-..-- =||.+.|-|..+- .|+|.+|- +.|.+..-+==| T Consensus 102 ~~D~~~r~~~~d~--l------~~~DLvL~l~~~~draL~----~d~~f~~dVi~~~~~~~~i~~Vt-Q~D~a~p~~~W~ 168 (296) T COG3596 102 DKDAEHRQLYRDY--L------PKLDLVLWLIKADDRALG----TDEDFLRDVIILGLDKRVLFVVT-QADRAEPGREWD 168 (296) T ss_pred HHHHHHHHHHHHH--H------HHCCEEEEECCCCCCCCC----CCHHHHHHHHHHCCCCEEEEEEE-HHHHHCCCCCCC T ss_conf 2218999999988--6------322479996147770014----77999999998605760699973-665436554300 Q ss_pred CCCCCCHHHHHHHCC Q ss_conf 237772145676332 Q gi|254780791|r 262 LRAPTPTGAAEMAVP 276 (529) Q Consensus 262 ~Ra~TPTaAAElavp 276 (529) .-.--|+.|.+-.+. T Consensus 169 ~~~~~p~~a~~qfi~ 183 (296) T COG3596 169 SAGHQPSPAIKQFIE 183 (296) T ss_pred CCCCCCCHHHHHHHH T ss_conf 025999878999999 No 427 >PRK13057 putative lipid kinase; Reviewed Probab=43.95 E-value=20 Score=15.02 Aligned_cols=91 Identities=22% Similarity=0.329 Sum_probs=47.4 Q ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHH Q ss_conf 58999999863059758999721001111036799999999741003576777589995168884442200769999999 Q gi|254780791|r 157 AVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAI 236 (529) Q Consensus 157 a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI 236 (529) +++..+...+..+. +++.+..+.-.|+ +.++++++. ..+|+||++=|=|.+- +++..+ T Consensus 13 ~~~~~~~~~l~~~g-~~~~~~~T~~~g~--a~~~~~~~~-----------~~~d~vv~~GGDGTv~--------ev~~gl 70 (287) T PRK13057 13 AALGAARAALEAAG-LELVEPHPESPAD--LSEVIEAHA-----------DGVDLVIVGGGDGTLN--------AAAPAL 70 (287) T ss_pred HHHHHHHHHHHHCC-CEEEEEECCCHHH--HHHHHHHHH-----------CCCCEEEEECCHHHHH--------HHHHHH T ss_conf 31999999999879-9799995498789--999999986-----------6999899988589999--------999986 Q ss_pred HHCCCEE-EEEECCCCCCHHHHHHHCCCCC-CCHHHHHHH Q ss_conf 7489048-8520577752589886412377-721456763 Q gi|254780791|r 237 ANSSIPI-ISAIGHETDWTLADYAADLRAP-TPTGAAEMA 274 (529) Q Consensus 237 ~~~~iPV-isgIGHE~D~Tl~D~VAD~Ra~-TPTaAAEla 274 (529) +...+|+ |--.| |==||+-++.-| +|.+|++.+ T Consensus 71 ~~~~~~lgiiP~G-----TgNdfAr~lgip~d~~~a~~~i 105 (287) T PRK13057 71 VETGLPLGILPLG-----TANDLARTLGIPLDLEAAARVI 105 (287) T ss_pred CCCCCCEEEECCC-----CHHHHHHHCCCCCCHHHHHHHH T ss_conf 0579836997388-----6467899839998999999998 No 428 >COG1379 PHP family phosphoesterase with a Zn ribbon [General function prediction only] Probab=43.86 E-value=20 Score=15.01 Aligned_cols=16 Identities=31% Similarity=0.524 Sum_probs=9.8 Q ss_pred ECCCCCCHHHHHHHCC Q ss_conf 0577752589886412 Q gi|254780791|r 247 IGHETDWTLADYAADL 262 (529) Q Consensus 247 IGHE~D~Tl~D~VAD~ 262 (529) .|-.-|+..+|+|+-+ T Consensus 168 LGLSADtdmAD~I~el 183 (403) T COG1379 168 LGLSADTDMADMIEEL 183 (403) T ss_pred HCCCCCCHHHHHHHHH T ss_conf 4464571478999984 No 429 >pfam05681 Fumerase Fumarate hydratase (Fumerase). This family consists of several bacterial fumarate hydratase proteins FumA and FumB. Fumarase, or fumarate hydratase (EC 4.2.1.2), is a component of the citric acid cycle. In facultative anaerobes such as Escherichia coli, fumarase also engages in the reductive pathway from oxaloacetate to succinate during anaerobic growth. Three fumarases, FumA, FumB, and FumC, have been reported in E. coli. fumA and fumB genes are homologous and encode products of identical sizes which form thermolabile dimers of Mr 120,000. FumA and FumB are class I enzymes and are members of the iron-dependent hydrolases, which include aconitase and malate hydratase. The active FumA contains a 4Fe-4S centre, and it can be inactivated upon oxidation to give a 3Fe-4S centre. Probab=43.72 E-value=18 Score=15.42 Aligned_cols=49 Identities=29% Similarity=0.331 Sum_probs=29.1 Q ss_pred CCCEEEEECCCCCH--------------HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHH Q ss_conf 77589995168884--------------44220076999999974890488520577752589 Q gi|254780791|r 208 RPDIIILARGGGSI--------------EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLA 256 (529) Q Consensus 208 ~~D~iii~RGGGS~--------------eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~ 256 (529) .+++.++.-||||- |.+.-|=-+.|..+=.+---|+|.|||=--++--+ T Consensus 128 ~~~i~~~~KGgGsEN~s~l~mL~P~~g~egi~~fVl~~V~~aG~~~CPP~ivGVGIGGt~~~a 190 (271) T pfam05681 128 ELEITVAAKGGGSENKSKLAMLNPSDGIEGIKKFVLETVKEAGPNACPPGVVGVGIGGTSEKA 190 (271) T ss_pred EEEEEEECCCCCCCCHHHHEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCHHHH T ss_conf 699999547787435424224265556678999999999971778999871688867869999 No 430 >PRK01713 ornithine carbamoyltransferase; Provisional Probab=43.72 E-value=20 Score=14.99 Aligned_cols=20 Identities=20% Similarity=0.189 Sum_probs=7.7 Q ss_pred CCCCCCCCCHHHHHHHHHHH Q ss_conf 88889862299999999999 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHI 27 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~ 27 (529) .-.+...||-.||..-+... T Consensus 9 hlL~~~d~s~~ei~~il~~A 28 (334) T PRK01713 9 HLLSLVNHTEREIKYLLDLS 28 (334) T ss_pred CCCCCCCCCHHHHHHHHHHH T ss_conf 83870009999999999999 No 431 >cd00408 DHDPS-like Dihydrodipicolinate synthase family. A member of the class I aldolases, which use an active-site lysine which stablilzes a reaction intermediate via Schiff base formation, and have TIM beta/alpha barrel fold. The dihydrodipicolinate synthase family comprises several pyruvate-dependent class I aldolases that use the same catalytic step to catalyze different reactions in different pathways and includes such proteins as N-acetylneuraminate lyase, MosA protein, 5-keto-4-deoxy-glucarate dehydratase, trans-o-hydroxybenzylidenepyruvate hydratase-aldolase, trans-2'-carboxybenzalpyruvate hydratase-aldolase, and 2-keto-3-deoxy- gluconate aldolase. The family is also referred to as the N-acetylneuraminate lyase (NAL) family. Probab=43.57 E-value=20 Score=14.97 Aligned_cols=184 Identities=17% Similarity=0.216 Sum_probs=87.8 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCE Q ss_conf 88888898622999999999997400171899999705435688862799987489479999973521058668145988 Q gi|254780791|r 6 QKNSLDHPEYSVSELSYHLKHIVESNLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIE 85 (529) Q Consensus 6 ~~~~~~~~~~svs~l~~~i~~~l~~~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~ 85 (529) ..|-.+.-..-...+-..++.+++.....++|-|-.+.+ +++|..|-..-++++. ... +.+ T Consensus 6 ~TPf~~dg~iD~~~~~~~i~~l~~~Gv~gi~v~G~tGE~--------~~Ls~~Er~~l~~~~~-----~~~------~~~ 66 (281) T cd00408 6 VTPFTADGEVDLDALRRLVEFLIEAGVDGLVVLGTTGEA--------PTLTDEERKEVIEAVV-----EAV------AGR 66 (281) T ss_pred ECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECEECCCH--------HHCCHHHHHHHHHHHH-----HHC------CCC T ss_conf 688588969499999999999997699989968545243--------1389999999999999-----980------898 Q ss_pred EEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHH Q ss_conf 99999966752884379999971016800799999999997654012261001631026528999847842589999998 Q gi|254780791|r 86 FLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQR 165 (529) Q Consensus 86 v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~ 165 (529) +-+...++ ........ ++-+..++-|.-.. -...|.|.+ ++-..+-++.+. T Consensus 67 ~pvi~gv~-------------------~~s~~~~~-~~a~~a~~~Gad~i--~v~pP~y~~-------~~~~~i~~~~~~ 117 (281) T cd00408 67 VPVIAGVG-------------------ANSTREAI-ELARHAEEAGADGV--LVVPPYYNK-------PSQEGIVAHFKA 117 (281) T ss_pred CEEEEECC-------------------CCCHHHHH-HHHHHHHHCCCCEE--EECCCCCCC-------CCHHHHHHHHHH T ss_conf 50999578-------------------78899999-99999997599989--987997778-------999999999999 Q ss_pred HHHCCCEEEEEEECCC-CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHC--CCE Q ss_conf 6305975899972100-1111036799999999741003576777589995168884442200769999999748--904 Q gi|254780791|r 166 ISCRFPLRVIIFPVKV-QGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANS--SIP 242 (529) Q Consensus 166 ~~~r~p~~~~~~p~~v-QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~--~iP 242 (529) +.+.-++.+++|..+. -|-.-..+.+..|... +.++-|==+.| |.+.+.+.+..+ .+- T Consensus 118 i~~~~~~pi~iYn~P~~~g~~l~~~~l~~L~~~-----------~nv~giK~s~~--------d~~~~~~~~~~~~~~~~ 178 (281) T cd00408 118 VADASDLPVILYNIPGRTGVDLSPETIARLAEH-----------PNIVGIKDSSG--------DLDRLTRLIALLGPDFA 178 (281) T ss_pred HHHCCCCCEEEEECCCCCCCCCCHHHHHHHHCC-----------CCEEEEECCCC--------CHHHHHHHHHHCCCCCE T ss_conf 985559977997277531677689999998489-----------99899984889--------99999999997599705 Q ss_pred EEEEECCCCCCHHHHHHH Q ss_conf 885205777525898864 Q gi|254780791|r 243 IISAIGHETDWTLADYAA 260 (529) Q Consensus 243 VisgIGHE~D~Tl~D~VA 260 (529) |++| + |..+.++.. T Consensus 179 v~~G--~--d~~~~~~l~ 192 (281) T cd00408 179 VLSG--D--DDLLLPALA 192 (281) T ss_pred EECC--C--HHHHHHHHH T ss_conf 6269--6--688999987 No 432 >KOG0042 consensus Probab=43.09 E-value=16 Score=15.93 Aligned_cols=136 Identities=20% Similarity=0.274 Sum_probs=60.6 Q ss_pred HHHCCCCCCCC-----CCCCCCCEEEEE----------ECCCHHHHHHHHHHHHHC--CCEEEE----------EEECCC Q ss_conf 40122610016-----310265289998----------478425899999986305--975899----------972100 Q gi|254780791|r 129 EEGLFSDQHKN-----PIPFIPKIIAVI----------TSPTGAVIRDILQRISCR--FPLRVI----------IFPVKV 181 (529) Q Consensus 129 ~eGlfd~~~k~-----~lP~~p~~i~vi----------ts~~~a~~~D~~~~~~~r--~p~~~~----------~~p~~v 181 (529) .-||||+...- -+|.--+.|+=- ..|+-.-|+.||+-++.= ++++|. +=|-.+ T Consensus 320 ~mGlldP~TsDgRViFflPWqg~TIaGTTD~pt~v~~~P~PtE~dIqfIL~ev~~yl~~~~~VrR~DVlsaWsGiRPLv~ 399 (680) T KOG0042 320 NMGLLDPKTSDGRVIFFLPWQGKTIAGTTDIPTSVTHSPTPTEDDIQFILKEVQHYLSFDVEVRREDVLSAWSGIRPLVR 399 (680) T ss_pred CCCCCCCCCCCCCEEEEECCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCC T ss_conf 56544678889708998425783663057899888789999778899999999986079865004536778627766566 Q ss_pred CCC--CHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCC-----EEEE-----EECC Q ss_conf 111--103679999999974100357677758999516888444220076999999974890-----4885-----2057 Q gi|254780791|r 182 QGD--ECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSI-----PIIS-----AIGH 249 (529) Q Consensus 182 QG~--~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~i-----PVis-----gIGH 249 (529) +-+ ....+|++.=-. ...+--+|-|+ ||--.---..-||.|-.+|-.+.+ |-+| .=|| T Consensus 400 DP~~~~~t~sl~R~H~v--------~~~~~gLiTIa--GGKWTTyR~MAEeTVd~aI~~~~lk~~~~~cvT~~l~l~Ga~ 469 (680) T KOG0042 400 DPKKVKDTQSLVRNHFV--------FVSPSGLITIA--GGKWTTYRHMAEETVDAAIKAGDLKPARKPCVTKKLKLEGAE 469 (680) T ss_pred CCCCCCCCHHHHHHCEE--------EECCCCEEEEE--CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCC T ss_conf 98754220002430168--------70699708972--684154798889999999982787777876524314884467 Q ss_pred C-CCCHHHHHHHCCCCCCCHHHHHHHCC Q ss_conf 7-75258988641237772145676332 Q gi|254780791|r 250 E-TDWTLADYAADLRAPTPTGAAEMAVP 276 (529) Q Consensus 250 E-~D~Tl~D~VAD~Ra~TPTaAAElavp 276 (529) . +-.+-+-+|-|.- -|+.+|+..+. T Consensus 470 ~wt~~~~~~LvQdyg--~e~~vA~hLs~ 495 (680) T KOG0042 470 GWTPNMYIRLVQDYG--MESDVAQHLSQ 495 (680) T ss_pred CCCHHHHHHHHHHHC--CCHHHHHHHHH T ss_conf 876788999999818--72899999987 No 433 >KOG3598 consensus Probab=43.06 E-value=21 Score=14.91 Aligned_cols=30 Identities=7% Similarity=0.048 Sum_probs=20.3 Q ss_pred CCCCCCCCEEEEEECCCHHHHHHHHHHHHH Q ss_conf 631026528999847842589999998630 Q gi|254780791|r 139 NPIPFIPKIIAVITSPTGAVIRDILQRISC 168 (529) Q Consensus 139 ~~lP~~p~~i~vits~~~a~~~D~~~~~~~ 168 (529) ..+--||.++-.-+...-.--+-|-++|+. T Consensus 1357 lqlicyph~~k~~e~g~n~qrq~ikriLqn 1386 (2220) T KOG3598 1357 LQLICYPHTHKLFEAGKNSQRQTIKRILQN 1386 (2220) T ss_pred HHHHHCCCCCHHHCCCCCHHHHHHHHHHHH T ss_conf 998853342100005653788899999987 No 434 >cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources. Probab=42.85 E-value=21 Score=14.89 Aligned_cols=83 Identities=13% Similarity=0.257 Sum_probs=38.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHC-CC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 89998478425899999986305-97-58999721001111036799999999741003576777589995168884442 Q gi|254780791|r 147 IIAVITSPTGAVIRDILQRISCR-FP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 147 ~i~vits~~~a~~~D~~~~~~~r-~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) -|+|++..+..-...+...+.+- ++ ++| +....+|.+ .++.+..- -+|++ | |-|++-|. T Consensus 6 iiaVlr~~~~~~a~~~~~al~~~Gi~~iEi-----tl~t~~a~~-~i~~l~~~----------~~~~~-i--GaGTV~~~ 66 (190) T cd00452 6 LVAVLRGDDAEDALALAEALIEGGIRAIEI-----TLRTPGALE-AIRALRKE----------FPEAL-I--GAGTVLTP 66 (190) T ss_pred EEEEEECCCHHHHHHHHHHHHHCCCCEEEE-----ECCCCHHHH-HHHHHHHH----------CCCCE-E--EECCCCCH T ss_conf 799997799999999999999869988999-----678802999-99999986----------89808-9--65234779 Q ss_pred ---------------HHCCHHHHHHHHHHCCCEEEEEEC Q ss_conf ---------------200769999999748904885205 Q gi|254780791|r 225 ---------------WHFNDEMIVRAIANSSIPIISAIG 248 (529) Q Consensus 225 ---------------~~FN~e~laraI~~~~iPVisgIG 248 (529) .+.=+++|++.--+..+|+|-|+- T Consensus 67 ~~~~~a~~aGa~FivsP~~~~~v~~~a~~~~~~~iPGv~ 105 (190) T cd00452 67 EQADAAIAAGAQFIVSPGLDPEVVKAANRAGIPLLPGVA 105 (190) T ss_pred HHHHHHHHCCCCEEECCCCCHHHHHHHHHCCCCEECCCC T ss_conf 999999985998997377999999999982996657879 No 435 >TIGR03133 malonate_beta malonate decarboxylase, beta subunit. Members of this protein family are the beta subunit of malonate decarboxylase. Malonate decarboxylase may be a soluble enzyme, or linked to membrane subunits and active as a sodium pump. In the malonate decarboxylase complex, the beta subunit appears to act as a malonyl-CoA decarboxylase. Probab=42.67 E-value=21 Score=14.87 Aligned_cols=25 Identities=12% Similarity=0.239 Sum_probs=16.8 Q ss_pred CCEEEEECCCCCHHHHHHCCHHHHHHHHHH Q ss_conf 758999516888444220076999999974 Q gi|254780791|r 209 PDIIILARGGGSIEDLWHFNDEMIVRAIAN 238 (529) Q Consensus 209 ~D~iii~RGGGS~eDL~~FN~e~laraI~~ 238 (529) .|.||+.++ |. +..+-..|++++.- T Consensus 160 ~D~iIm~~~-~~----ifl~GP~vIk~~tG 184 (274) T TIGR03133 160 CSYLIMTEE-GR----LGLSGPEVIEQEAG 184 (274) T ss_pred CCEEEEECC-CE----EECCCCHHHHHCCC T ss_conf 738999789-57----95338067745358 No 436 >COG4607 CeuA ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=42.58 E-value=19 Score=15.25 Aligned_cols=13 Identities=8% Similarity=0.222 Sum_probs=5.1 Q ss_pred EEECCCHHHHHHH Q ss_conf 9847842589999 Q gi|254780791|r 150 VITSPTGAVIRDI 162 (529) Q Consensus 150 vits~~~a~~~D~ 162 (529) ||.+.++.+|.++ T Consensus 123 IiggR~ak~yd~l 135 (320) T COG4607 123 IIGGRAAKAYDKL 135 (320) T ss_pred EECCHHHHHHHHH T ss_conf 9774777779999 No 437 >PRK00856 pyrB aspartate carbamoyltransferase catalytic subunit; Provisional Probab=42.49 E-value=21 Score=14.85 Aligned_cols=27 Identities=26% Similarity=0.321 Sum_probs=16.4 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 988888888898622999999999997 Q gi|254780791|r 2 NPFSQKNSLDHPEYSVSELSYHLKHIV 28 (529) Q Consensus 2 ~~~~~~~~~~~~~~svs~l~~~i~~~l 28 (529) +|+.-+.-.+.-.||-.|+..-+.... T Consensus 2 n~~~~k~~lsi~Dls~~ei~~ll~~A~ 28 (304) T PRK00856 2 NPLKMKHLLSIEDLSREEIELLLDTAE 28 (304) T ss_pred CCCCCCCCCCHHHCCHHHHHHHHHHHH T ss_conf 988768808722199999999999999 No 438 >PRK13254 cytochrome c-type biogenesis protein CcmE; Reviewed Probab=42.41 E-value=21 Score=14.84 Aligned_cols=104 Identities=13% Similarity=0.257 Sum_probs=59.5 Q ss_pred CCCHHHHHHHHHHHHHHCC-CCEEEEEEECC--CCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEE Q ss_conf 6229999999999974001-71899999705--43568886279998748947999997352105866814598899999 Q gi|254780791|r 14 EYSVSELSYHLKHIVESNL-SHVCVRGEISG--YRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIG 90 (529) Q Consensus 14 ~~svs~l~~~i~~~l~~~~-~~~~v~gEis~--~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g 90 (529) -||.||+... . . .. ..+.|-|-|.. +. +..+..+.|.+.|.++.|.. .|.+. ++=..++|..|++.| T Consensus 37 FyTPsEi~~~--~-~--~~~~~iRvGGlV~~gSi~-r~~~~~~~F~itD~~~~i~V-~Y~G~---lPdLF~EgqgVVveG 106 (149) T PRK13254 37 FYTPSEVAEG--E-A--PAGKRFRLGGLVEKGSVQ-RGGDLTVRFVVTDGNATVPV-VYTGI---LPDLFREGQGVVAEG 106 (149) T ss_pred EECHHHHHCC--C-C--CCCCEEEEEEEEEECCEE-ECCCCEEEEEEECCCCEEEE-EECCC---CCCCCCCCCEEEEEE T ss_conf 9788999437--6-6--679679998999728478-56997599999779828999-98887---960013998499999 Q ss_pred EEEEECCCCEEEEEEEEEEECC-CCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9667528843799999710168-00799999999997654012261 Q gi|254780791|r 91 KITTFPGSSKYQIIIESLIPSG-SGTLLTALEKRKKKLLEEGLFSD 135 (529) Q Consensus 91 ~~~~y~~~g~~ql~v~~i~~~g-~G~l~~~~e~lk~~L~~eGlfd~ 135 (529) +.. ..| -|.++.|-.-- +--| =-++.+.|++.|...+ T Consensus 107 ~~~---~~~--~F~A~~ilAKHde~Ym---P~e~~~~l~~~~~~~~ 144 (149) T PRK13254 107 RLQ---DGG--VFVADEVLAKHDENYM---PKEVADALKKAGVWKD 144 (149) T ss_pred EEC---CCC--EEEEEEEEECCCCCCC---CHHHHHHHHHCCCCCC T ss_conf 988---999--8999799986898879---9899999997499889 No 439 >cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati Probab=42.12 E-value=21 Score=14.80 Aligned_cols=41 Identities=27% Similarity=0.424 Sum_probs=25.7 Q ss_pred EEEECCCCCHHHHHHCCHHHHHHHHHHC------CCEEEEEECCCCCCH Q ss_conf 9995168884442200769999999748------904885205777525 Q gi|254780791|r 212 IILARGGGSIEDLWHFNDEMIVRAIANS------SIPIISAIGHETDWT 254 (529) Q Consensus 212 iii~RGGGS~eDL~~FN~e~laraI~~~------~iPVisgIGHE~D~T 254 (529) ||+==||.|+.|--.|. .+|+-|.+. .+=|+||+|-.||.- T Consensus 2 iV~KFGGtSv~~~e~i~--~va~iI~~~~~~g~~viVVvSA~gg~Td~L 48 (239) T cd04246 2 IVQKFGGTSVADIERIK--RVAERIKKAVKKGYQVVVVVSAMGGTTDEL 48 (239) T ss_pred EEEEECCCCCCCHHHHH--HHHHHHHHHHHCCCEEEEEECCCCCCHHHH T ss_conf 89987810049999999--999999999867982999984898656779 No 440 >COG3481 Predicted HD-superfamily hydrolase [General function prediction only] Probab=41.98 E-value=14 Score=16.28 Aligned_cols=133 Identities=20% Similarity=0.228 Sum_probs=84.4 Q ss_pred CCCEEEEEE--ECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHH Q ss_conf 886279998--748947999997352105866814598899999966752884379999971016800799999999997 Q gi|254780791|r 49 SSGHAYFSL--KDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKK 126 (529) Q Consensus 49 ~sGH~Yf~l--kd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~ 126 (529) +.|--|.+| -|....|++-.|--.-.. .-..+.|+=|.+.|.+.+| +|.-|+.+..++....++=- T Consensus 17 ~~~~~~l~l~~~d~~gei~~~~wd~~~~~-~~~~~~~~Vv~~~g~~~~~--~~~~q~ki~~~r~~~~~~~~--------- 84 (287) T COG3481 17 SNGKDKLKLTLQDKTGEIEAKLWDALKND-EEAFKPGMVVHVEGVKEVY--RGRKQHKIIRIRLITDSDPV--------- 84 (287) T ss_pred HCCCHHHEEEECCCCCEECCCCCCCCCCC-HHHHCCCCEECCCCCEECC--CCCCHHEEEECCCCCCCCCC--------- T ss_conf 54871020011145543412102146342-7661767363144414301--36531014413556555766--------- Q ss_pred HHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCH-----HHHHHHHHHHHHHH Q ss_conf 654012261001631026528999847842589999998630597589997210011110-----36799999999741 Q gi|254780791|r 127 LLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDEC-----PKEIANAILQLNTL 200 (529) Q Consensus 127 L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a-----~~~i~~ai~~~~~~ 200 (529) .++.+-..+....++.+ |=|.-.++++.+++..-.++|-.+.-||+.+|--.+ +.-+..+++..... T Consensus 85 -~~~~ls~~e~~~~i~~~------i~~~e~~~i~~iv~~~~~K~~~~f~~~paa~~~Hh~~~~gLa~ht~~~~~l~~~~ 156 (287) T COG3481 85 -TKAPLSMDEMEEEITGY------IFSIENAVIQRIVRHDLEKIPEVFYKYPAATQNHHAFEGGLAEHTLTVLELYKRI 156 (287) T ss_pred -CCCCCCHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHCHHHHHCCCHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf -66568989999998787------3112547899999987631347775050246666532212689999999999987 No 441 >TIGR02518 EutH_ACDH acetaldehyde dehydrogenase (acetylating); InterPro: IPR013357 Aldehyde dehydrogenases (1.2.1.3 from EC and 1.2.1.5 from EC) are enzymes which oxidize a wide variety of aliphatic and aromatic aldehydes using NAD(P) as a cofactor. In eukaryotes, at least twenty distinct aldehyde dehydrogenase families have been classified . Many aldehyde dehydrogenases have also been found in prokaryotic species. A glutamic acid and a cysteine residue have been implicated in the catalytic activity of mammalian aldehyde dehydrogenase, though these residues are not necessarily conserved in all the enzymes of this family. The acetaldehyde dehydrogenase family (1.2.1.10 from EC) of bacterial enzymes catalyse the formation of acetyl-CoA from acetaldehyde in the degradation of phenols, cresols and catechol . Proteins in this entry occur in bacterial species and are annotated as probable acetaldehyde dehydrogenases, though this function has not been experimentally verified.. Probab=41.88 E-value=21 Score=14.77 Aligned_cols=183 Identities=19% Similarity=0.232 Sum_probs=87.3 Q ss_pred HHHHHHHHHHHHH-CCC--CEE-----EEEEEC-------CCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC------- Q ss_conf 9999999999740-017--189-----999970-------543568886279998748947999997352105------- Q gi|254780791|r 18 SELSYHLKHIVES-NLS--HVC-----VRGEIS-------GYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK------- 75 (529) Q Consensus 18 s~l~~~i~~~l~~-~~~--~~~-----v~gEis-------~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~------- 75 (529) ..+-..||+.|+. .|. .++ |.-|=. .||. -| -||==+++.-++.=-+||.+-.. T Consensus 226 Anv~~Av~~Il~SKtFDnGtICASEQSvivE~~n~~aVv~Elk~---QG-~yFl~~ee~~~l~kf~lr~ngtMNp~iVGk 301 (528) T TIGR02518 226 ANVKKAVRKILDSKTFDNGTICASEQSVIVEKCNKDAVVEELKK---QG-GYFLTEEEAEKLGKFILRPNGTMNPQIVGK 301 (528) T ss_pred CCHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCHHHHHHHHHH---CC-CCCCCHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 78789998874025667761653135068615772689999985---24-223477888766432126878607632488 Q ss_pred ----C----CCC-CCCCCEEEEEEEEEEECCCCEEEEEEEEEEE--C--CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf ----8----668-1459889999996675288437999997101--6--8007999999999976540122610016310 Q gi|254780791|r 76 ----I----EFL-PEEGIEFLVIGKITTFPGSSKYQIIIESLIP--S--GSGTLLTALEKRKKKLLEEGLFSDQHKNPIP 142 (529) Q Consensus 76 ----~----~~~-~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~--~--g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP 142 (529) + .|. .-.-.+|++--...|= .. |=+--+.+-+ + ..-++....|.-.+-|..||. .| T Consensus 302 ~~Q~iA~lAGl~gvp~~~kVL~~~~~~Vg-~~--~P~SrEKLttiLaFY~~e~~~~ace~~~elL~nEG~---GH----- 370 (528) T TIGR02518 302 SPQVIAKLAGLTGVPEDAKVLIAEENKVG-KK--NPYSREKLTTILAFYVEENWEEACELSIELLENEGA---GH----- 370 (528) T ss_pred CHHHHHHHCCCCCCCHHHEEECCCCCCCC-CC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---CC----- T ss_conf 87899976166568213336401326436-78--886534455677788798799999999999963689---74----- Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCCE-EEEE-EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 265289998478425899999986305975-8999-72100111103679999999974100357677758999516888 Q gi|254780791|r 143 FIPKIIAVITSPTGAVIRDILQRISCRFPL-RVII-FPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGS 220 (529) Q Consensus 143 ~~p~~i~vits~~~a~~~D~~~~~~~r~p~-~~~~-~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS 220 (529) =-+|-|.+ .||++..+=+-|+ .+++ .|=..=|-+|...++=|+-.=. =+=||-| T Consensus 371 -----Tl~iH~~~----~~iv~eFAL~kPvSR~lvN~gGslGG~GAttnL~PAlTLGC---------------GavGGSS 426 (528) T TIGR02518 371 -----TLIIHSED----KDIVREFALKKPVSRMLVNTGGSLGGIGATTNLVPALTLGC---------------GAVGGSS 426 (528) T ss_pred -----EEEEECCC----HHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCHHHHCCC---------------CCCCCCC T ss_conf -----47772487----47999974108841688838687864341001301322057---------------6536765 Q ss_pred HHH----HHHCCHHHHHHHHHHC Q ss_conf 444----2200769999999748 Q gi|254780791|r 221 IED----LWHFNDEMIVRAIANS 239 (529) Q Consensus 221 ~eD----L~~FN~e~laraI~~~ 239 (529) ..| +.-||--.||-=|-+| T Consensus 427 ~S~Ni~P~~L~N~rrvA~G~~el 449 (528) T TIGR02518 427 TSDNITPENLINIRRVAYGVREL 449 (528) T ss_pred CCCCCCCHHHHCCHHHHHCHHHH T ss_conf 64666705563312354121556 No 442 >COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning] Probab=41.88 E-value=21 Score=14.77 Aligned_cols=130 Identities=18% Similarity=0.286 Sum_probs=70.5 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHH--HHHHHHC-----CCC--CCCCCCCCCCCEEEEEECCCHHH Q ss_conf 99996675288437999997101680079999999999--7654012-----261--00163102652899984784258 Q gi|254780791|r 88 VIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKK--KLLEEGL-----FSD--QHKNPIPFIPKIIAVITSPTGAV 158 (529) Q Consensus 88 ~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~--~L~~eGl-----fd~--~~k~~lP~~p~~i~vits~~~a~ 158 (529) +.-.||+--+.|.|.|++ -|+|.|.= .|-+|.- ..-..|- ||. -+++.+|.+=++||+| T Consensus 17 aL~~vs~~i~~Gef~fl~---GpSGAGKS--TllkLi~~~e~pt~G~i~~~~~dl~~l~~~~iP~LRR~IGvV------- 84 (223) T COG2884 17 ALRDVSFHIPKGEFVFLT---GPSGAGKS--TLLKLIYGEERPTRGKILVNGHDLSRLKGREIPFLRRQIGVV------- 84 (223) T ss_pred HHHCCEEEECCCEEEEEE---CCCCCCHH--HHHHHHHHHHCCCCCEEEECCEECCCCCCCCCCHHHHEEEEE------- T ss_conf 662815764686089986---78888789--999999841367876599888500325545360554412058------- Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCC---HHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHH Q ss_conf 999999863059758999721001111---03679999999974100357677758999516888444220076999999 Q gi|254780791|r 159 IRDILQRISCRFPLRVIIFPVKVQGDE---CPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRA 235 (529) Q Consensus 159 ~~D~~~~~~~r~p~~~~~~p~~vQG~~---a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~lara 235 (529) +||| +.|.++--++=+.||-.|.|.. ....+..+|+...-... ...+=.-=-||-.. --.+||| T Consensus 85 FQD~-rLL~~~tvyeNVA~pL~v~G~~~~~i~~rV~~~L~~VgL~~k------~~~lP~~LSGGEQQ------RvaIARA 151 (223) T COG2884 85 FQDF-RLLPDRTVYENVALPLRVIGKPPREIRRRVSEVLDLVGLKHK------ARALPSQLSGGEQQ------RVAIARA 151 (223) T ss_pred EEEC-CCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHH------HHCCCCCCCCHHHH------HHHHHHH T ss_conf 6412-234653066542120100178878999999999988062025------44396015823788------9999999 Q ss_pred HHHCCCE Q ss_conf 9748904 Q gi|254780791|r 236 IANSSIP 242 (529) Q Consensus 236 I~~~~iP 242 (529) |.+-|-= T Consensus 152 iV~~P~v 158 (223) T COG2884 152 IVNQPAV 158 (223) T ss_pred HCCCCCE T ss_conf 8169876 No 443 >cd01836 FeeA_FeeB_like SGNH_hydrolase subfamily, FeeA, FeeB and similar esterases/lipases. FeeA and FeeB are part of a biosynthetic gene cluster and may participate in the biosynthesis of long-chain N-acyltyrosines by providing saturated and unsaturated fatty acids, which it turn are loaded onto the acyl carrier protein FeeL. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Probab=41.81 E-value=21 Score=14.77 Aligned_cols=74 Identities=16% Similarity=0.189 Sum_probs=42.6 Q ss_pred CCEEEEEECCCHHHHHH----------HHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEE Q ss_conf 52899984784258999----------99986305975899972100111103679999999974100357677758999 Q gi|254780791|r 145 PKIIAVITSPTGAVIRD----------ILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIIL 214 (529) Q Consensus 145 p~~i~vits~~~a~~~D----------~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii 214 (529) |.||.+|--..++|+++ +-+.+....+..+.+.-..+-|..+. ++.+.+.... ..++|+|+| T Consensus 2 p~ri~~lGDS~t~G~G~~~~~~g~~~~la~~l~~~~~~~v~~~n~~~~G~tt~-~~~~~l~~~~-------~~~pd~vii 73 (191) T cd01836 2 PLRLLVLGDSTAAGVGVETQDQALAGQLARGLAAITGRGVRWRLFAKTGATSA-DLLRQLAPLP-------ETRFDVAVI 73 (191) T ss_pred CEEEEEEEHHHHHEECCCCCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCHH-HHHHHHHHHH-------CCCCCEEEE T ss_conf 84999998346044078851218899999999897589848998430485168-7986504533-------459998999 Q ss_pred ECCCCCHHHHHHCCH Q ss_conf 516888444220076 Q gi|254780791|r 215 ARGGGSIEDLWHFND 229 (529) Q Consensus 215 ~RGGGS~eDL~~FN~ 229 (529) .=||- |++.+.. T Consensus 74 ~iG~N---D~~~~~~ 85 (191) T cd01836 74 SIGVN---DVTHLTS 85 (191) T ss_pred ECCCC---CCCCCCC T ss_conf 77785---2302688 No 444 >cd01483 E1_enzyme_family Superfamily of activating enzymes (E1) of the ubiquitin-like proteins. This family includes classical ubiquitin-activating enzymes E1, ubiquitin-like (ubl) activating enzymes and other mechanistic homologes, like MoeB, Thif1 and others. The common reaction mechanism catalyzed by MoeB, ThiF and the E1 enzymes begins with a nucleophilic attack of the C-terminal carboxylate of MoaD, ThiS and ubiquitin, respectively, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of MoaD and ThiS. Probab=41.81 E-value=21 Score=14.77 Aligned_cols=80 Identities=28% Similarity=0.301 Sum_probs=48.2 Q ss_pred HHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCE Q ss_conf 98630597-58999721001111036799999999741003576777589995168884442200769999999748904 Q gi|254780791|r 164 QRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIP 242 (529) Q Consensus 164 ~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iP 242 (529) ..+++.+| +++..|+..+..+.... .+ ..+|+||.+-. +.+. -..+.+.-....+| T Consensus 60 ~~l~~~np~v~i~~~~~~~~~~~~~~-------~~---------~~~dvvi~~~D--~~~~-----r~~l~~~~~~~~ip 116 (143) T cd01483 60 RRLNELNPGVNVTAVPEGISEDNLDD-------FL---------DGVDLVIDAID--NIAV-----RRALNRACKELGIP 116 (143) T ss_pred HHHHHHCCCCEEEEEECCCCHHHHHH-------HH---------CCCCEEEECCC--CHHH-----HHHHHHHHHHCCCC T ss_conf 99985689838999945689646999-------97---------59999998779--9999-----99999999986998 Q ss_pred EEEEECC--CCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 8852057--775258988641237772145676 Q gi|254780791|r 243 IISAIGH--ETDWTLADYAADLRAPTPTGAAEM 273 (529) Q Consensus 243 VisgIGH--E~D~Tl~D~VAD~Ra~TPTaAAEl 273 (529) .|++=++ +.+.+..+ ++++.+.|. T Consensus 117 ~i~~~~~g~~G~v~~~~-------~~~~~~~e~ 142 (143) T cd01483 117 VIDAGGLGLGGDIQVID-------IGSLSAAEA 142 (143) T ss_pred EEEEECCCCEEEEEEEC-------CCCCCHHHH T ss_conf 89963667688899996-------698545750 No 445 >cd04722 TIM_phosphate_binding TIM barrel proteins share a structurally conserved phosphate binding motif and in general share an eight beta/alpha closed barrel structure. Specific for this family is the conserved phosphate binding site at the edges of strands 7 and 8. The phosphate comes either from the substrate, as in the case of inosine monophosphate dehydrogenase (IMPDH), or from ribulose-5-phosphate 3-epimerase (RPE) or from cofactors, like FMN. Probab=41.76 E-value=21 Score=14.76 Aligned_cols=86 Identities=19% Similarity=0.221 Sum_probs=53.9 Q ss_pred HHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC--CCCCHHHHHHCCHHHHHHH Q ss_conf 9999998630597-589997210011110367999999997410035767775899951--6888444220076999999 Q gi|254780791|r 159 IRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILAR--GGGSIEDLWHFNDEMIVRA 235 (529) Q Consensus 159 ~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~R--GGGS~eDL~~FN~e~lara 235 (529) ...+++.++.+++ ..++..+++.=-.. +.. +... ..|.+.+.. |||...+.+.-+.+.+.+. T Consensus 101 ~~~~~~~~~~~~~~~~vi~~~~~~~~~~-------~~~-a~~~-------g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (200) T cd04722 101 DLELIRELREAVPDVKVVVKLSPTGELA-------AAA-AEEA-------GVDEVGLGNGGGGGGGRDAVPIADLLLILA 165 (200) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHH-------HHH-HHHC-------CCCEEEECCCCCCCCCCCCCCHHHHHHHHH T ss_conf 6899999998448964999689999999-------999-9980-------997999708746788876661168999999 Q ss_pred HHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 974890488520577752589886 Q gi|254780791|r 236 IANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 236 I~~~~iPVisgIGHE~D~Tl~D~V 259 (529) .-..++|||.+=|=-+-..+.+.. T Consensus 166 ~~~~~ipvi~~gGi~~~~~~~~~~ 189 (200) T cd04722 166 KRGSKVPVIAGGGINDPEDAAEAL 189 (200) T ss_pred HHHCCCCEEEECCCCCHHHHHHHH T ss_conf 985799989975879999999999 No 446 >COG4126 Hydantoin racemase [Amino acid transport and metabolism] Probab=41.71 E-value=21 Score=14.75 Aligned_cols=31 Identities=29% Similarity=0.618 Sum_probs=14.9 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 7758999516888444220076999999974890488520 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgI 247 (529) ..|+||| |++...+| -+.|..+ +-||||-|| T Consensus 174 gAeaIiL--GCAGms~l----a~~Lq~~---~gvPVIDgv 204 (230) T COG4126 174 GAEAIIL--GCAGMSDL----ADQLQKA---FGVPVIDGV 204 (230) T ss_pred CCCEEEE--CCCCHHHH----HHHHHHH---HCCCCCCCH T ss_conf 8888998--58217779----9999997---199846656 No 447 >PRK13435 response regulator; Provisional Probab=41.37 E-value=22 Score=14.71 Aligned_cols=72 Identities=14% Similarity=0.184 Sum_probs=43.6 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH-H--HCCHH Q ss_conf 84258999999863059758999721001111036799999999741003576777589995168884442-2--00769 Q gi|254780791|r 154 PTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL-W--HFNDE 230 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL-~--~FN~e 230 (529) .+-..+.+.+..+-+..++++.- .+.+.-+|++.+... .||++++ |+ + ..|-. T Consensus 8 EDe~~i~~~l~~~L~~~G~~vv~---------~A~~~~eAl~~~~~~-------~PDlvll--------Di~LpdG~~G~ 63 (141) T PRK13435 8 EDEALIALELEKLLEEAGHQVVG---------IASTSEQALALGRRA-------QPDVALV--------DIRLADGPTGV 63 (141) T ss_pred CCCHHHHHHHHHHHHHCCCEEEE---------EECCHHHHHHHHHCC-------CCCEEEE--------CCCCCCCCCHH T ss_conf 99899999999999987997999---------759999999997659-------9989997--------88789999899 Q ss_pred HHHHHHH-HCCCEEEEEECC Q ss_conf 9999997-489048852057 Q gi|254780791|r 231 MIVRAIA-NSSIPIISAIGH 249 (529) Q Consensus 231 ~laraI~-~~~iPVisgIGH 249 (529) +++|.+. .+++|||.--|| T Consensus 64 e~~r~l~~~~~ipvI~lTa~ 83 (141) T PRK13435 64 EVARRLSIDGGVEVIFMTAN 83 (141) T ss_pred HHHHHHHHCCCCCEEEEECC T ss_conf 99999987599838999777 No 448 >PRK04863 mukB cell division protein MukB; Provisional Probab=41.34 E-value=22 Score=14.71 Aligned_cols=65 Identities=12% Similarity=0.241 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999988-------76555678888776777778875026889999999999999999999 Q gi|254780791|r 366 IQKNRQHILEKQQHIEQI-------VKQHLRYIFLEKKEKIAILHMLREQTKNRIFYLHTHIKKLITRIEFI 430 (529) Q Consensus 366 l~~~r~~L~~~q~~L~q~-------~~~~l~~~~~~~~~~~~~l~~l~~~~~~~l~~~~~~l~~l~~rL~~~ 430 (529) ....++.+.+..+.+..+ +..+.+....+....+.........+...+......++.+..++... T Consensus 1028 ~~~k~q~l~e~~~el~~lGv~~~~gaE~~a~~~rdeL~~~L~~~r~rr~~le~~l~~~e~e~~~l~k~~~k~ 1099 (1486) T PRK04863 1028 YDAKRELLAELKRELQDLGVPADSGAEERARARRDELHAQLSANRSRRNQLEKQLTFCEAEMDNLTKRLRKL 1099 (1486) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999975998990279897744788999999878899999999999999999999999998 No 449 >PRK13141 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=41.29 E-value=22 Score=14.70 Aligned_cols=35 Identities=23% Similarity=0.374 Sum_probs=13.7 Q ss_pred CCEEEEECCCCCHHHH----HHCCHHHHHHHHHHCCCEEE Q ss_conf 7589995168884442----20076999999974890488 Q gi|254780791|r 209 PDIIILARGGGSIEDL----WHFNDEMIVRAIANSSIPII 244 (529) Q Consensus 209 ~D~iii~RGGGS~eDL----~~FN~e~laraI~~~~iPVi 244 (529) +|.||+ =|+||...- -..+=.+..+......+||+ T Consensus 39 ~d~lIl-PGvGsf~~~~~~L~~~~l~~~i~~~~~~~~PiL 77 (204) T PRK13141 39 ADAVIL-PGVGAFPDAMQNLRERGLVEVIKEAVASGKPLL 77 (204) T ss_pred CCEEEE-CCCCCCHHHHHHHHHCCCHHHHHHHHHCCCCEE T ss_conf 898998-688750568776465392999999986599689 No 450 >PRK04820 rnpA ribonuclease P; Reviewed Probab=41.17 E-value=22 Score=14.69 Aligned_cols=24 Identities=17% Similarity=0.138 Sum_probs=16.9 Q ss_pred CCCCCCCEEEEECCCCCHHHHHHC Q ss_conf 576777589995168884442200 Q gi|254780791|r 204 RTCPRPDIIILARGGGSIEDLWHF 227 (529) Q Consensus 204 ~~~~~~D~iii~RGGGS~eDL~~F 227 (529) ...+.+|.|||+|+|=..-|-+.+ T Consensus 81 ~~Lp~~DiVVlaR~g~~~l~N~el 104 (145) T PRK04820 81 PELAGGDYVIVARSAAAKATNPQI 104 (145) T ss_pred HCCCCCCEEEEECCCHHHCCHHHH T ss_conf 138888889993677344051999 No 451 >cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=41.11 E-value=22 Score=14.68 Aligned_cols=42 Identities=21% Similarity=0.321 Sum_probs=18.7 Q ss_pred CCCEEEEECCCCCH----HHHHHCCHHHHHHHHHHCCCEEE-EEECCC Q ss_conf 77589995168884----44220076999999974890488-520577 Q gi|254780791|r 208 RPDIIILARGGGSI----EDLWHFNDEMIVRAIANSSIPII-SAIGHE 250 (529) Q Consensus 208 ~~D~iii~RGGGS~----eDL~~FN~e~laraI~~~~iPVi-sgIGHE 250 (529) ++|.||| =|-||. +.|-..|=.+..+..++..+||+ .-.||. T Consensus 36 ~~d~iIl-PGvG~f~~~~~~l~~~~~~~~i~~~~~~~~PiLGIClG~Q 82 (198) T cd01748 36 SADKLIL-PGVGAFGDAMANLRERGLIEALKEAIASGKPFLGICLGMQ 82 (198) T ss_pred HCCEEEE-CCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEEEEE T ss_conf 0898998-7887403444441003819999999876996899981145 No 452 >PRK10014 DNA-binding transcriptional repressor MalI; Provisional Probab=41.11 E-value=22 Score=14.68 Aligned_cols=113 Identities=12% Similarity=0.206 Sum_probs=55.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCC-CCCCEEEEEECC-CHHHHHHHHHHHHHC---CCEEEEEEECCCCCCCHHHHHHHH Q ss_conf 999999976540122610016310-265289998478-425899999986305---975899972100111103679999 Q gi|254780791|r 119 ALEKRKKKLLEEGLFSDQHKNPIP-FIPKIIAVITSP-TGAVIRDILQRISCR---FPLRVIIFPVKVQGDECPKEIANA 193 (529) Q Consensus 119 ~~e~lk~~L~~eGlfd~~~k~~lP-~~p~~i~vits~-~~a~~~D~~~~~~~r---~p~~~~~~p~~vQG~~a~~~i~~a 193 (529) ..++..+-.++-||-.....+.+- .--..||+|... +..-+.+++.-+.+. .++.+.++++ ..-+.+.... T Consensus 37 Tr~rV~~aa~elgY~pn~~A~~l~~~~t~~Ig~v~~~~~~~f~~~~i~gi~~~~~~~g~~~~~~~~----~~~~~~~~~~ 112 (342) T PRK10014 37 TGERVNQAIEELGFVRNRQASALRGGQSGVIGLIVRDLSAPFYAELTAGLTEALEAQGRMVFLLQG----GKDGEQLAQR 112 (342) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEC----CCCHHHHHHH T ss_conf 999999999994899786788641387736886054457767799999999999981988999947----9998999999 Q ss_pred HHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 9999741003576777589995168884442200769999999748904885205777 Q gi|254780791|r 194 ILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHET 251 (529) Q Consensus 194 i~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~ 251 (529) ++.+... .+|.||+.--.+ +++.+...+....+|+|. ||... T Consensus 113 ~~~l~~~-------~vdgiIi~~~~~--------~~~~~~~~~~~~~iP~V~-~~~~~ 154 (342) T PRK10014 113 FSTLLNQ-------GVDGVVIAGAAG--------SSDDLRRMAEEKAIPVVF-ASRAS 154 (342) T ss_pred HHHHHHC-------CCCEEEEECCCC--------CCHHHHHHHHHCCCCEEE-ECCCC T ss_conf 9987623-------988899957877--------888999999855997999-56877 No 453 >PRK00742 chemotaxis-specific methylesterase; Provisional Probab=41.07 E-value=22 Score=14.68 Aligned_cols=73 Identities=26% Similarity=0.404 Sum_probs=31.5 Q ss_pred CCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHH--HCCHH Q ss_conf 842589999998630597-589997210011110367999999997410035767775899951688844422--00769 Q gi|254780791|r 154 PTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLW--HFNDE 230 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~--~FN~e 230 (529) .+++.++..++.+=+..| ++| ....--|+ +|++.+... +||+|++ |++ .-|-. T Consensus 9 DDs~~~R~~l~~~L~~~~~~eV--v~~A~nG~-------eAl~~~~~~-------~pDvVll--------Di~MP~mdGi 64 (345) T PRK00742 9 DDSAFMRRLLSEILNSDPDIEV--VGTARDGL-------EAVEKIKKL-------NPDVITL--------DVEMPVMDGI 64 (345) T ss_pred ECCHHHHHHHHHHHHHCCCEEE--EEEECCHH-------HHHHHHHHC-------CCCEEEE--------ECCCCCCCHH T ss_conf 2988999999999972899089--99989999-------999998860-------9999998--------3788999879 Q ss_pred HHHHHH-HHCCCE--EEEEECCC Q ss_conf 999999-748904--88520577 Q gi|254780791|r 231 MIVRAI-ANSSIP--IISAIGHE 250 (529) Q Consensus 231 ~laraI-~~~~iP--VisgIGHE 250 (529) ++.|.| ..+|+| ++|+..|+ T Consensus 65 e~l~~I~~~~p~pVimvSs~~~~ 87 (345) T PRK00742 65 EALRKIMALRPTPVVMVSSLTER 87 (345) T ss_pred HHHHHHHHHCCCCEEEEEECCCC T ss_conf 99999997589877999722555 No 454 >TIGR00960 3a0501s02 Type II (General) Secretory Pathway (IISP) Family protein; InterPro: IPR013505 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=40.87 E-value=15 Score=16.06 Aligned_cols=100 Identities=17% Similarity=0.275 Sum_probs=58.0 Q ss_pred EEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHH--HHHHHHC-----CCCC--CCCCCCCCCCEEEEEECCCHHHH Q ss_conf 9996675288437999997101680079999999999--7654012-----2610--01631026528999847842589 Q gi|254780791|r 89 IGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKK--KLLEEGL-----FSDQ--HKNPIPFIPKIIAVITSPTGAVI 159 (529) Q Consensus 89 ~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~--~L~~eGl-----fd~~--~k~~lP~~p~~i~vits~~~a~~ 159 (529) .-.|+|--+.|.|-|.| =+||-|- --|-+|.- ..-..|= ||.+ +++.+|.+=+.||+| + T Consensus 19 L~~~~~~~~kGem~fL~---GHSGaGK--ST~lkLi~~~~~P~~G~i~~~G~d~~~L~~r~~P~LRr~iG~V-------F 86 (216) T TIGR00960 19 LDNVTFHIDKGEMVFLV---GHSGAGK--STLLKLILGIEKPTRGKIRFSGQDLTRLRGREVPFLRRHIGMV-------F 86 (216) T ss_pred HCCCEEEECCCCEEEEE---CCCCCCH--HHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCEEEECCEE-------E T ss_conf 03864785385079985---6888607--8999999852289986078715421001577467300010426-------7 Q ss_pred HHHHHHHHHCCCEEEEEEE---CCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 9999986305975899972---10011110367999999997410 Q gi|254780791|r 160 RDILQRISCRFPLRVIIFP---VKVQGDECPKEIANAILQLNTLK 201 (529) Q Consensus 160 ~D~~~~~~~r~p~~~~~~p---~~vQG~~a~~~i~~ai~~~~~~~ 201 (529) ||+ +.|.+|.-++=..+| +=+.|+.+..-+..||+++--.+ T Consensus 87 QD~-~LL~drtv~dNVa~pL~iiG~~~~~~~~rv~~aL~~VGL~~ 130 (216) T TIGR00960 87 QDH-RLLSDRTVYDNVALPLRIIGVPGRDINERVSAALEKVGLKG 130 (216) T ss_pred CCH-HHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC T ss_conf 011-55311655455243355228997426789999987306112 No 455 >PRK04147 N-acetylneuraminate lyase; Provisional Probab=40.74 E-value=22 Score=14.64 Aligned_cols=42 Identities=14% Similarity=0.115 Sum_probs=26.6 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEECCC-CCCCHHHHHHHHHH Q ss_conf 8425899999986305975899972100-11110367999999 Q gi|254780791|r 154 PTGAVIRDILQRISCRFPLRVIIFPVKV-QGDECPKEIANAIL 195 (529) Q Consensus 154 ~~~a~~~D~~~~~~~r~p~~~~~~p~~v-QG~~a~~~i~~ai~ 195 (529) ++-..+.++.+.+.+.-++.+++|..+. =|-.-+.+.+..|. T Consensus 113 ~~~~~i~~~f~~va~a~~~pi~iYn~P~~tg~~~~~~~l~~L~ 155 (294) T PRK04147 113 FSFEEICDYYREIIDSADNPMIVYNIPALTGVNLSLDQFNELF 155 (294) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 9989999999999850499778875675416788999999995 No 456 >KOG3091 consensus Probab=40.60 E-value=22 Score=14.62 Aligned_cols=102 Identities=15% Similarity=0.122 Sum_probs=57.4 Q ss_pred HCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHCCC----HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0076999999974890488520577752589886412377721456763323----467776699999888778998889 Q gi|254780791|r 226 HFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEMAVPV----KEHLQSSLINLEARLNNIIIRLIK 301 (529) Q Consensus 226 ~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElavp~----~~EL~~~L~~l~~RL~~a~~~~l~ 301 (529) ..|++.+-++++.-|=+=| |--+=|-+- -..+|+ |.++|+ +.+|..++.-..+.....-. +++ T Consensus 292 v~~~~~q~e~~L~~kP~gV-------d~~~W~QA~--~dnp~s---~kliPVpvvGF~dL~~R~K~Q~q~~~~~r~-ri~ 358 (508) T KOG3091 292 VYLSVAQTEAYLETKPAGV-------DQRIWRQAM--KDNPPS---NKLIPVPVVGFEDLRQRLKVQDQEVKQHRI-RIN 358 (508) T ss_pred HCCCHHHHHHHHCCCCCCC-------CHHHHHHHH--HCCCCC---CCCCCEECCCHHHHHHHHHHHHHHHHHHHH-HHH T ss_conf 1267888778760899876-------989999986--438985---550331203559999998888988899999-999 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999855677653049788986788878886545689999 Q gi|254780791|r 302 YKINTLNSLLKALPNSDQTLSCSRYRLDRLPRELEHNLE 340 (529) Q Consensus 302 ~~~q~L~~l~r~L~~~~~~L~~~~QrLd~~~~rL~~~l~ 340 (529) .-.+++..+.+.-.....++++.+.|+.+++.|+-+-+. T Consensus 359 ~i~e~v~eLqk~~ad~~~KI~~~k~r~~~Ls~RiLRv~i 397 (508) T KOG3091 359 AIGERVTELQKHHADAVAKIEEAKNRHVELSHRILRVMI 397 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999899998763015788999999877779999999999 No 457 >cd03791 GT1_Glycogen_synthase_DULL1_like This family is most closely related to the GT1 family of glycosyltransferases. Glycogen synthase catalyzes the formation and elongation of the alpha-1,4-glucose backbone using ADP-glucose, the second and key step of glycogen biosynthesis. This family includes starch synthases of plants, such as DULL1 in Zea mays and glycogen synthases of various organisms. Probab=40.47 E-value=22 Score=14.61 Aligned_cols=81 Identities=25% Similarity=0.358 Sum_probs=41.0 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH---------------HHCCHHHHHHHHHH Q ss_conf 999721001111036799999999741003576777589995168884442---------------20076999999974 Q gi|254780791|r 174 VIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL---------------WHFNDEMIVRAIAN 238 (529) Q Consensus 174 ~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL---------------~~FN~e~laraI~~ 238 (529) +++|-...--+--..-|++|+..+-..+ .-+||++.|-...|+. -.| +|.+++.||+ T Consensus 298 l~~~vgRl~~qKG~dll~~a~~~~~~~~-------~~~vi~G~G~~~~e~~~~~l~~~~~~~~~~~~~~-~e~l~~~lya 369 (476) T cd03791 298 LFGFVGRLTEQKGIDLLLEALPELLELG-------GQLVILGSGDPEYEEALRELAARYPGRVAVLIGY-DEALAHLIYA 369 (476) T ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHCC-------CEEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHH T ss_conf 8999423520248999999999999639-------8899994697789999999997689959999806-8788999998 Q ss_pred ------------------------CCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf ------------------------89048852057775258988641237772 Q gi|254780791|r 239 ------------------------SSIPIISAIGHETDWTLADYAADLRAPTP 267 (529) Q Consensus 239 ------------------------~~iPVisgIGHE~D~Tl~D~VAD~Ra~TP 267 (529) ..+||.+.+|= |.|-|-|....++ T Consensus 370 ~aD~~l~PS~~EP~Gl~qleAm~~GtppIa~~tGG-----L~dtV~d~~~~~~ 417 (476) T cd03791 370 GADFFLMPSRFEPCGLTQMYAMRYGTVPIVRATGG-----LADTVIDYNEDTG 417 (476) T ss_pred HCCEEECCCCCCCCCHHHHHHHHCCCCEEECCCCC-----CCCEEECCCCCCC T ss_conf 49997425545785489999986699759806999-----8651003666777 No 458 >pfam04012 PspA_IM30 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator. Probab=40.41 E-value=22 Score=14.60 Aligned_cols=19 Identities=0% Similarity=-0.046 Sum_probs=8.3 Q ss_pred HHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999988716 Q gi|254780791|r 434 KIKSCHTSVSITTRILQSF 452 (529) Q Consensus 434 ~L~~~~~rL~~l~~~L~sl 452 (529) .+++.+.+......+.++. T Consensus 166 ~fer~eekv~~~EA~aeA~ 184 (220) T pfam04012 166 SFERIEEKIEEREARADAA 184 (220) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999 No 459 >cd07020 Clp_protease_NfeD_1 Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease. Nodulation formation efficiency D (NfeD; stomatin operon partner protein, STOPP; DUF107) is a member of membrane-anchored ClpP-class proteases. Currently, more than 300 NfeD homologs have been identified - all of which are bacterial or archaeal in origin. Majority of these genomes have been shown to possess operons containing a homologous NfeD/stomatin gene pair, causing NfeD to be previously named STOPP (stomatin operon partner protein). NfeD homologs can be divided into two groups: long and short forms. Long-form homologs have a putative ClpP-class serine protease domain while the short form homologs do not. Downstream from the ClpP-class domain is the so-called NfeD or DUF107 domain. N-terminal region of the NfeD homolog PH1510 (1510-N or PH1510-N) from Pyrococcus horikoshii has been shown to possess serine protease activity and has a Ser-Lys catalytic dyad, preferentially c Probab=40.29 E-value=22 Score=14.59 Aligned_cols=73 Identities=25% Similarity=0.295 Sum_probs=52.5 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCEEEE--ECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEEC---CC--CCCHHHH Q ss_conf 103679999999974100357677758999--5168884442200769999999748904885205---77--7525898 Q gi|254780791|r 185 ECPKEIANAILQLNTLKEGRTCPRPDIIIL--ARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIG---HE--TDWTLAD 257 (529) Q Consensus 185 ~a~~~i~~ai~~~~~~~~~~~~~~~D~iii--~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIG---HE--~D~Tl~D 257 (529) .....+.++|+.+...+ .++||| ==-||.... -..+...|.++++|||+=|+ +. .==+++= T Consensus 13 ~~~~~l~r~l~~A~~~~-------a~avvl~idTpGG~v~~-----~~~I~~~i~~s~vpvi~~V~p~G~~A~SAGa~I~ 80 (187) T cd07020 13 ATADYLERAIDQAEEGG-------ADALIIELDTPGGLLDS-----TREIVQAILASPVPVVVYVYPSGARAASAGTYIL 80 (187) T ss_pred HHHHHHHHHHHHHHHCC-------CCEEEEEEECCCCHHHH-----HHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHH T ss_conf 89999999999998689-------98999998589607899-----9999999981899989998789760771899999 Q ss_pred HHHCCCCCCCHH Q ss_conf 864123777214 Q gi|254780791|r 258 YAADLRAPTPTG 269 (529) Q Consensus 258 ~VAD~Ra~TPTa 269 (529) +.||.-+-.|.. T Consensus 81 ~a~~~~~MaPgt 92 (187) T cd07020 81 LAAHIAAMAPGT 92 (187) T ss_pred HHCCHHHCCCCC T ss_conf 836075478987 No 460 >PRK05299 rpsB 30S ribosomal protein S2; Provisional Probab=40.18 E-value=22 Score=14.57 Aligned_cols=66 Identities=20% Similarity=0.395 Sum_probs=49.6 Q ss_pred CCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHH Q ss_conf 77589995168884442200769999999748904885205777525898864123777214567633234677766999 Q gi|254780791|r 208 RPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEMAVPVKEHLQSSLIN 287 (529) Q Consensus 208 ~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElavp~~~EL~~~L~~ 287 (529) .||+|||+ | +=+|...++.-..+.||||.=+---.|-+++||+ +|..++-...+.- T Consensus 157 lPd~viv~-------d--~~~e~~AV~EA~kl~IPvI~ivDTn~dP~~idyp---------------IP~NDDs~~SI~l 212 (255) T PRK05299 157 LPDALFVV-------D--PNKEHIAVKEARKLGIPVVAIVDTNCDPDGVDYP---------------IPGNDDAIRSIKL 212 (255) T ss_pred CCCEEEEE-------C--CCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCEE---------------ECCCCCHHHHHHH T ss_conf 99879984-------7--8533799999997599888762489996658655---------------3377528999999 Q ss_pred HHHHHHHHHH Q ss_conf 9988877899 Q gi|254780791|r 288 LEARLNNIII 297 (529) Q Consensus 288 l~~RL~~a~~ 297 (529) +-.-+..++. T Consensus 213 i~~~l~~ai~ 222 (255) T PRK05299 213 YTSKLADAIL 222 (255) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 461 >PRK10161 transcriptional regulator PhoB; Provisional Probab=40.12 E-value=22 Score=14.57 Aligned_cols=106 Identities=21% Similarity=0.254 Sum_probs=59.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 65289998478425899999986305975899972100111103679999999974100357677758999516888444 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) +.+||.||-- -..+.+.+...=+..++++... .+...|++.++.. .||+||+--+ T Consensus 1 M~~kILiVED--d~~l~~~l~~~L~~~g~~v~~a----------~~g~~a~~~l~~~-------~~DliilDi~------ 55 (229) T PRK10161 1 MARRILVVED--EAPIREMVCFVLEQNGFQPVEA----------EDYDSAVNQLNEP-------WPDLILLDWM------ 55 (229) T ss_pred CCCCEEEEEC--CHHHHHHHHHHHHHCCCEEEEE----------CCHHHHHHHHHCC-------CCCEEEEECC------ T ss_conf 9971999959--9999999999999779999998----------9999999998528-------9989999789------ Q ss_pred HHHCCHHHHHHHHHH------CCCEEEEEECCCCCCHH-HHHHHCCCCCCCHHHHHHH Q ss_conf 220076999999974------89048852057775258-9886412377721456763 Q gi|254780791|r 224 LWHFNDEMIVRAIAN------SSIPIISAIGHETDWTL-ADYAADLRAPTPTGAAEMA 274 (529) Q Consensus 224 L~~FN~e~laraI~~------~~iPVisgIGHE~D~Tl-~D~VAD~Ra~TPTaAAEla 274 (529) |=..|-.++|+.|-. +||=++||-+.+.|..- .+.=||==-.-|-.-.|++ T Consensus 56 lP~~dG~~~~~~ir~~~~~~~~PII~lta~~~~~d~v~gl~~GADDYl~KPf~~~EL~ 113 (229) T PRK10161 56 LPGGSGIQFIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELV 113 (229) T ss_pred CCCCCCHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHCCCCEEEECCCCHHHHH T ss_conf 9887633587887750246897589995566657799998769876520899989999 No 462 >PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=39.92 E-value=23 Score=14.54 Aligned_cols=85 Identities=14% Similarity=0.282 Sum_probs=45.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHC-CCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 289998478425899999986305-9758999721001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCR-FPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r-~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) +-|+||+..+..-...+...+.+- .+ ++.++..+.+|... |+.+.+. -+|++ | |=|++-|- T Consensus 16 ~ii~Vlr~~~~~~a~~i~~al~~gGi~----~iEiTl~tp~a~~~----I~~l~~~-------~p~~~-v--GaGTV~~~ 77 (212) T PRK05718 16 PVVPVIVINKLEDAVPLAKALVAGGLP----VLEVTLRTPAALEA----IRAIRKE-------VPEAL-I--GAGTVLNP 77 (212) T ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCC----EEEEECCCCHHHHH----HHHHHHH-------CCCCE-E--EEEEECCH T ss_conf 879999748999999999999987997----89995789619999----9999975-------89817-9--65331348 Q ss_pred ---------------HHCCHHHHHHHHHHCCCEEEEEEC Q ss_conf ---------------200769999999748904885205 Q gi|254780791|r 225 ---------------WHFNDEMIVRAIANSSIPIISAIG 248 (529) Q Consensus 225 ---------------~~FN~e~laraI~~~~iPVisgIG 248 (529) .+.=+++|+++-.+..+|+|-|+- T Consensus 78 e~~~~a~~aGA~FiVSP~~~~~v~~~a~~~~i~~iPGv~ 116 (212) T PRK05718 78 EQLAQAIEAGAQFIVSPGLTPPLLKACQDGPIPLIPGVN 116 (212) T ss_pred HHHHHHHHCCCCEEECCCCCHHHHHHHHHCCCCEECCCC T ss_conf 899999984998998489989999999981997657869 No 463 >PRK02842 light-independent protochlorophyllide reductase subunit N; Provisional Probab=39.81 E-value=23 Score=14.53 Aligned_cols=163 Identities=18% Similarity=0.293 Sum_probs=95.3 Q ss_pred EEEEEEECCCCCCCCCC-----------CCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 79999973521058668-----------1459889999996675288437999997101680079999999999765401 Q gi|254780791|r 63 RIDAIIWKGTLNKIEFL-----------PEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEG 131 (529) Q Consensus 63 ~i~~~~~~~~~~~~~~~-----------~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eG 131 (529) -+.|++|=.+-..-.|- +++-+-|++.+ +||+.. -+.+.=|.+|+-+.+..++++..+|+. T Consensus 24 glt~i~WLhrki~DaFFLVvGsrTCahllQsA~GvMIFA-----ePRf~t-AileE~Dlag~ad~~~El~rvv~~l~r-- 95 (425) T PRK02842 24 GLTSVVWLHRKIKDSFFLVVGTRTCAHLLQSAAGVMIFA-----EPRFGT-AILEEGDLAGLADANEELDRVVEEIKR-- 95 (425) T ss_pred CCHHHHHHHHHCCCCEEEEEECCHHHHHHHHCCCCEEEC-----CCCCCE-EECCCCHHCCCCCCCHHHHHHHHHHHC-- T ss_conf 723479899864765499984502787874066771315-----761111-232620110467771889999999834-- Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHH----CCCEEEEEEE------CCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 2261001631026528999847842589999998630----5975899972------10011110367999999997410 Q gi|254780791|r 132 LFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISC----RFPLRVIIFP------VKVQGDECPKEIANAILQLNTLK 201 (529) Q Consensus 132 lfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~----r~p~~~~~~p------~~vQG~~a~~~i~~ai~~~~~~~ 201 (529) + .-|+.|-+|.|-++-+|..=+..++. ++.+.|..|+ +.-|||++. ..|+--. .. T Consensus 96 -------r---~~i~~lflvgsCpsEvIKlDL~~~A~rL~~~~~v~Vl~~sgsGiettFTQGED~~---LaAlvp~--~P 160 (425) T PRK02842 96 -------R---PNPSVIFLVGSCPSEVIKLDLEGLAERLSTEFGVPVLNYSGSGLETTFTQGEDAV---LAALVPF--CP 160 (425) T ss_pred -------C---CCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCHHHH---HHHHHHH--CC T ss_conf -------8---9982799973587666340288899986441187079722576014646307789---9998634--76 Q ss_pred CCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEE-----------EEECCCCCCHHH Q ss_conf 0357677758999516888444220076999999974890488-----------520577752589 Q gi|254780791|r 202 EGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPII-----------SAIGHETDWTLA 256 (529) Q Consensus 202 ~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVi-----------sgIGHE~D~Tl~ 256 (529) ........++++ ||+-|-+. -.+.+--...-|+++ -+||+.|-+.++ T Consensus 161 -~~~~~~~~Lllv----Gsl~D~Ve---dq~~~~~~~lGI~~v~~LP~~~~~~lP~vG~~T~v~l~ 218 (425) T PRK02842 161 -EAPADHPSLLLV----GSLADVVE---DQLRREFEKLGIGVVGFLPPRRSTELPAIGPGTVVALV 218 (425) T ss_pred -CCCCCCCCEEEE----ECCCCHHH---HHHHHHHHHCCCCCCEECCCCCCCCCCCCCCCCEEEEE T ss_conf -776788766998----35882788---99999999739962115789875546655999589995 No 464 >TIGR02038 protease_degS periplasmic serine peptidase DegS; InterPro: IPR011783 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of the periplasmic serine protease DegS (HhoB). They belong to MEROPS peptidase family S1, subfamily S1C (protease Do, clan PA(S)). They are a shorter paralogs of protease Do (HtrA, DegP) and DegQ (HhoA). They are found in Escherichia coli and several of the gammaproteobacteria. DegS contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress by detecting misfolded proteins in the periplasm. DegS then cleaves the periplasmic domain of RseA, a transmembrane protein and inhibitor of sigmaE, activating the sigmaE-driven expression of periplasmic proteases/chaperones , , .; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis. Probab=39.78 E-value=23 Score=14.53 Aligned_cols=42 Identities=19% Similarity=0.358 Sum_probs=35.4 Q ss_pred CEEEEECCCCCEECCHHHCCCCCEEEEEEECEEE-EEEEEECC Q ss_conf 6199984898895777892999869999911099-99995057 Q gi|254780791|r 461 GYTSIQDTNNNFITQKRNLATKTRILINFFDGQA-NAIVINKA 502 (529) Q Consensus 461 GYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v-~a~V~~k~ 502 (529) |=-++++++|=+||+..=++.-|+|.+-|.||.+ .|.+.++- T Consensus 84 GSGVIms~~GYIlTN~Hvi~~ADqIvVALQdGr~f~A~lVG~D 126 (358) T TIGR02038 84 GSGVIMSKEGYILTNKHVIKKADQIVVALQDGRVFEAELVGSD 126 (358) T ss_pred CCEEEECCCCCEEECHHHHHCCCEEEEEECCCCEEEEEEECCC T ss_conf 6557972887451335754044839987138935776763577 No 465 >PRK13951 bifunctional shikimate kinase/3-dehydroquinate synthase; Provisional Probab=39.70 E-value=23 Score=14.52 Aligned_cols=96 Identities=21% Similarity=0.231 Sum_probs=51.2 Q ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 02652899984784258999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 142 PFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 142 P~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) ...|....|||+..-+.+. ....|..-+++|. ||.+ +++-..-...+.+-+. ..++ .-+||+=|||.+ T Consensus 182 ~~~~~~~~v~t~~~v~~l~------~~~~~~~~iv~P~---GE~~-Ksl~~~~~~~~~l~~~-g~~R-~~~ivalGGGvv 249 (488) T PRK13951 182 KRVRNEELVFTTERVEKIY------GRYLPENRLLFPD---GEEV-KTLEHVSRAYYELVRM-DFPR-GKTIAGVGGGAL 249 (488) T ss_pred CCCCCCEEEEECHHHHHHH------HCCCCCCEEEECC---CCCC-CCHHHHHHHHHHHHHC-CCCC-CCEEEEECCHHH T ss_conf 4678745999361568877------5446765068189---8443-7899999999999976-9996-755999788288 Q ss_pred HHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH Q ss_conf 44220076999999974890488520577752589886 Q gi|254780791|r 222 EDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 222 eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~V 259 (529) .||..|= -++|-=-||.|- .-|||+-+| T Consensus 250 gDl~GF~-----Ast~~RGi~~i~-----iPTTLLa~V 277 (488) T PRK13951 250 TDFTGFV-----ASTFKRGVGLSF-----YPTTLLAQV 277 (488) T ss_pred HHHHHHH-----HHHHHCCCCEEE-----CCCHHHHHH T ss_conf 7889999-----999964998787-----540599877 No 466 >COG1660 Predicted P-loop-containing kinase [General function prediction only] Probab=39.61 E-value=23 Score=14.50 Aligned_cols=104 Identities=16% Similarity=0.175 Sum_probs=63.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHH Q ss_conf 89998478425899999986305975899972100111103679999999974100357677758999516888444220 Q gi|254780791|r 147 IIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWH 226 (529) Q Consensus 147 ~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~ 226 (529) +|-|||.-+|||-.=-++++.+---|.+ ++-|.+++-++-.+.. .......++-|+|=+||+.++.+|-. T Consensus 2 ~lvIVTGlSGAGKsvAl~~lEDlGyycv---------DNLPp~Llp~~~~~~~-~~~~~~~kvAv~iDiRs~~~~~~l~~ 71 (286) T COG1660 2 RLVIVTGLSGAGKSVALRVLEDLGYYCV---------DNLPPQLLPKLADLML-TLESRITKVAVVIDVRSREFFGDLEE 71 (286) T ss_pred CEEEEECCCCCCHHHHHHHHHHCCEEEE---------CCCCHHHHHHHHHHHH-HCCCCCCEEEEEEECCCCHHHHHHHH T ss_conf 4699956888768899999974580453---------5899889999999976-30567752799996145031788999 Q ss_pred CCHHHHHHHHHHC-----CCEEEEEECCCCCCHHHHHHHCCCCCCCHH Q ss_conf 0769999999748-----904885205777525898864123777214 Q gi|254780791|r 227 FNDEMIVRAIANS-----SIPIISAIGHETDWTLADYAADLRAPTPTG 269 (529) Q Consensus 227 FN~e~laraI~~~-----~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTa 269 (529) . ...+... .|=-+- -.|.||+-=-++.|=.-|=+ T Consensus 72 ~-----l~~l~~~~~~~~~iLFLe----A~~~~Lv~RY~etRR~HPL~ 110 (286) T COG1660 72 V-----LDELKDNGDIDPRVLFLE----ADDETLVRRYSETRRSHPLS 110 (286) T ss_pred H-----HHHHHHCCCCCCEEEEEE----CCHHHHHHHHHHHHHCCCCC T ss_conf 9-----999985579872499997----86457998875520028887 No 467 >pfam04204 HTS Homoserine O-succinyltransferase. Probab=39.55 E-value=23 Score=14.50 Aligned_cols=100 Identities=16% Similarity=0.224 Sum_probs=65.9 Q ss_pred HHHHHHHHCCCCCCCCCCC--CCCCEEEEEEC--CCHHHHHHHHHHHHHCCC--EEEE-EEECCCCCCCHHHHHHHH-HH Q ss_conf 9976540122610016310--26528999847--842589999998630597--5899-972100111103679999-99 Q gi|254780791|r 124 KKKLLEEGLFSDQHKNPIP--FIPKIIAVITS--PTGAVIRDILQRISCRFP--LRVI-IFPVKVQGDECPKEIANA-IL 195 (529) Q Consensus 124 k~~L~~eGlfd~~~k~~lP--~~p~~i~vits--~~~a~~~D~~~~~~~r~p--~~~~-~~p~~vQG~~a~~~i~~a-i~ 195 (529) ++.|.+||.|--...+..- -=|.+|||+-= ..-+-=..|+|.|.. -| +++. +++++-+..+.+.+-+.. -. T Consensus 11 ~~~L~~e~i~vm~~~~A~~QdIRpL~I~iLNLMP~K~~TE~Q~lRlL~n-tplqv~v~~~~~~sh~~knt~~~hl~~fY~ 89 (298) T pfam04204 11 VEILESENIFVMDEERAARQDIRPLRIGILNLMPKKIETETQLLRLLGN-TPLQVEVTLLRMSSHESKNTPAEHLESFYR 89 (298) T ss_pred HHHHHHCCCEEECHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHCC-CCCEEEEEEEEECCCCCCCCCHHHHHHHHH T ss_conf 7899877987746545400456431257770689838999999998578-874489999873444689987899999987 Q ss_pred HHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHH Q ss_conf 997410035767775899951688844422007699 Q gi|254780791|r 196 QLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEM 231 (529) Q Consensus 196 ~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~ 231 (529) .++.... ..+|.+||. |+.-..+.|.+.. T Consensus 90 ~f~~ik~----~~~DGlIIT---GAPvE~l~FeeV~ 118 (298) T pfam04204 90 TFDEIKD----EKFDGLIIT---GAPVEHLPFEDVD 118 (298) T ss_pred HHHHHHH----CCCCEEEEC---CCCCCCCCCCCCC T ss_conf 1999965----887569972---8987778855374 No 468 >cd02115 AAK Amino Acid Kinases (AAK) superfamily, catalytic domain; present in such enzymes like N-acetylglutamate kinase (NAGK), carbamate kinase (CK), aspartokinase (AK), glutamate-5-kinase (G5K) and UMP kinase (UMPK). The AAK superfamily includes kinases that phosphorylate a variety of amino acid substrates. These kinases catalyze the formation of phosphoric anhydrides, generally with a carboxylate, and use ATP as the source of the phosphoryl group; are involved in amino acid biosynthesis. Some of these kinases control the process via allosteric feed-back inhibition. Probab=39.48 E-value=16 Score=15.81 Aligned_cols=104 Identities=16% Similarity=0.159 Sum_probs=53.7 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCC----EEEEE---EECCCCCCCH-HHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 28999847842589999998630597----58999---7210011110-3679999999974100357677758999516 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFP----LRVII---FPVKVQGDEC-PKEIANAILQLNTLKEGRTCPRPDIIILARG 217 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p----~~~~~---~p~~vQG~~a-~~~i~~ai~~~~~~~~~~~~~~~D~iii~RG 217 (529) .++.||.|..+ .+.|.+..+..... ..+.- -.+.-.|+.- ...+.++|....-.. ......|..++-.. T Consensus 30 ~~~vvV~sa~~-~it~~l~~~~~~~~~~~~~~~t~~~~~~i~~~g~~~~~~~l~~~L~~~g~~a--~~l~~~~~~~~~~~ 106 (248) T cd02115 30 GRVVVVHGAGP-QITDELLAHGELLGYARGLRITDRETDALAAMGEGMSNLLIAAALEQHGIKA--VPLDLTQAGFASPN 106 (248) T ss_pred CCEEEEECCCH-HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE--EEEEEEECCEECCC T ss_conf 99899969844-5329999865354331367899899999999999999999999998569981--69865532331366 Q ss_pred CCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 88844422007699999997489048852057775 Q gi|254780791|r 218 GGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETD 252 (529) Q Consensus 218 GGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D 252 (529) -|...+.+..|.+.+...+....+|||+|.|+-.+ T Consensus 107 ~~~~~~v~~~~~~~l~~~l~~~~ipVi~g~~~~~~ 141 (248) T cd02115 107 QGHVGKITKVSTDRLKSLLENGILPILSGFGGTDE 141 (248) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCEEECCCCCCCC T ss_conf 44432102268999998774167668136345556 No 469 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=39.35 E-value=6.5 Score=19.38 Aligned_cols=74 Identities=27% Similarity=0.185 Sum_probs=43.1 Q ss_pred CHHHHHHHHHHHHH--C-CCE-EEEEEE--CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCC Q ss_conf 42589999998630--5-975-899972--10011110367999999997410035767775899951688844422007 Q gi|254780791|r 155 TGAVIRDILQRISC--R-FPL-RVIIFP--VKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFN 228 (529) Q Consensus 155 ~~a~~~D~~~~~~~--r-~p~-~~~~~p--~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN 228 (529) .+|.+|||++.... + .+- .+.-+| ++||=+++++.| ||-.+.++= .| -+++-+=. T Consensus 285 vSAsLQDIir~~~~~~~~~~~~~l~~f~~k~aiQLNDTHPal--aIPELMRLL-------~D----------e~~~~WD~ 345 (822) T TIGR02093 285 VSASLQDIIRRHKEKERTHGDKSLEDFPKKVAIQLNDTHPAL--AIPELMRLL-------ID----------EEGLDWDE 345 (822) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHCCCEEEEEECCCCCCC--HHHHHHHHH-------HH----------HCCCCHHH T ss_conf 775699999988750256888750021082888736898731--146676777-------64----------30588877 Q ss_pred HHHHHHHHHHCCCEEEEEECCCCCCHHHHH Q ss_conf 699999997489048852057775258988 Q gi|254780791|r 229 DEMIVRAIANSSIPIISAIGHETDWTLADY 258 (529) Q Consensus 229 ~e~laraI~~~~iPVisgIGHE~D~Tl~D~ 258 (529) --+++..+++. |..||+=. T Consensus 346 AW~It~~tfaY-----------TNHT~LpE 364 (822) T TIGR02093 346 AWDITTKTFAY-----------TNHTLLPE 364 (822) T ss_pred HHHHHHHCCCC-----------CCCHHHHH T ss_conf 87760000165-----------46101232 No 470 >pfam06705 SF-assemblin SF-assemblin/beta giardin. This family consists of several eukaryotic SF-assemblin and related beta giardin proteins. During mitosis the SF-assemblin-based cytoskeleton is reorganized; it divides in prophase and is reduced to two dot-like structures at each spindle pole in metaphase. During anaphase, the two dots present at each pole are connected again. In telophase there is an asymmetrical outgrowth of new fibres. It has been suggested that SF-assemblin is involved in re-establishing the microtubular root system characteristic of interphase cells after mitosis. Probab=39.14 E-value=23 Score=14.45 Aligned_cols=27 Identities=19% Similarity=0.425 Sum_probs=10.3 Q ss_pred HHHHHHHHHHHHHC-CHHHHHCCCEEEE Q ss_conf 99999999988716-9677731461999 Q gi|254780791|r 439 HTSVSITTRILQSF-AYKNTLKRGYTSI 465 (529) Q Consensus 439 ~~rL~~l~~~L~sl-sP~~~L~RGYaiv 465 (529) ..|-..-....+++ .|-++|..|-.+| T Consensus 218 ~~Re~aDDdIVqAln~Ytk~LQd~Lr~v 245 (247) T pfam06705 218 EERIAEDDEIVQAVNDYTKALQEGLKLV 245 (247) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988645789999999999998998661 No 471 >pfam09726 Macoilin Transmembrane protein. This entry is a highly conserved protein present in eukaryotes. Probab=39.13 E-value=23 Score=14.45 Aligned_cols=14 Identities=29% Similarity=0.520 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 77669999988877 Q gi|254780791|r 281 LQSSLINLEARLNN 294 (529) Q Consensus 281 L~~~L~~l~~RL~~ 294 (529) |.+.|..+.+-|.. T Consensus 411 LE~dlKkLraELq~ 424 (680) T pfam09726 411 LEQDIKKLQAELQQ 424 (680) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999998 No 472 >TIGR03597 GTPase_YqeH ribosome biogenesis GTPase YqeH. This family describes YqeH, a member of a larger family of GTPases involved in ribosome biogenesis. Like YqlF, it shows a cyclical permutation relative to GTPases EngA (in which the GTPase domain is duplicated), Era, and others. Members of this protein family are found in a relatively small number of bacterial species, including Bacillus subtilis but not Escherichia coli. Probab=38.93 E-value=23 Score=14.42 Aligned_cols=89 Identities=27% Similarity=0.288 Sum_probs=58.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHH Q ss_conf 99999999997654012261001631026528999847842589999998630597-58999721001111036799999 Q gi|254780791|r 116 LLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAI 194 (529) Q Consensus 116 l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai 194 (529) +..-.+-+++.+.++|+ .|..|.+|++.+|-++.++++.+.+-+. -++++..++==|+. +++++| T Consensus 109 ~~~i~~wv~~~~~~~gl-----------~~~~V~lvSa~~g~gi~~l~~~i~~~~~~~dvyvvG~tNvGKS---TliN~L 174 (360) T TIGR03597 109 LSKIKEWMKKRAKELGL-----------KPVDIILVSAKKGNGIDELLDKIKKARNKKDVYVVGVTNVGKS---SLINKL 174 (360) T ss_pred HHHHHHHHHHHHHHCCC-----------CCCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHH---HHHHHH T ss_conf 79999999999998599-----------8366899968889899999999998716995899916866589---999999 Q ss_pred HHHHHHCCCCCCCCCCEEEEECCCCCHHHHH Q ss_conf 9997410035767775899951688844422 Q gi|254780791|r 195 LQLNTLKEGRTCPRPDIIILARGGGSIEDLW 225 (529) Q Consensus 195 ~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~ 225 (529) -..+.. ..+.+.+.+==|-.-|+. T Consensus 175 l~~~~~-------~~~~~T~S~~PGTTL~~i 198 (360) T TIGR03597 175 LKQNNG-------DKDVITTSPFPGTTLDLI 198 (360) T ss_pred HHHCCC-------CCCCEEECCCCCCCCCCE T ss_conf 876167-------777236458998533307 No 473 >cd06273 PBP1_GntR_like_1 This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational Probab=38.52 E-value=24 Score=14.37 Aligned_cols=77 Identities=13% Similarity=0.236 Sum_probs=32.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCC Q ss_conf 888898622999999999997400-17189999970543568886279998748947999997352105---86681459 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEG 83 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G 83 (529) |+.++|- .+++...|...++.. |.-+. +. |+... ..--.+.=.|. .-+++++++-+.... +...-+.| T Consensus 7 p~i~n~f--f~~~~~~ie~~~~~~Gy~~~i--~~-s~~d~-~~e~~~i~~l~--~~~vdGiIi~~~~~~~~~~~~l~~~~ 78 (268) T cd06273 7 PTLDNAI--FARVIQAFQETLAAHGYTLLV--AS-SGYDL-DREYAQARKLL--ERGVDGLALIGLDHSPALLDLLARRG 78 (268) T ss_pred CCCCCHH--HHHHHHHHHHHHHHCCCEEEE--EE-CCCCH-HHHHHHHHHHH--HCCCCEEEECCCCCCHHHHHHHHHCC T ss_conf 7988989--999999999999985999999--95-89998-99999999999--65999999918999989999999759 Q ss_pred CEEEEEEEE Q ss_conf 889999996 Q gi|254780791|r 84 IEFLVIGKI 92 (529) Q Consensus 84 ~~v~~~g~~ 92 (529) .-|++.++. T Consensus 79 iP~V~~~~~ 87 (268) T cd06273 79 VPYVATWNY 87 (268) T ss_pred CCEEEECCC T ss_conf 989998465 No 474 >pfam02765 Telo_bind Telomeric single stranded DNA binding POT1/CDC13. This domain binds single stranded telomeric DNA and adopts an OB fold. It includes the proteins POT1 and CDC13 which have been shown to regulate telomere length, replication and capping. Probab=38.46 E-value=24 Score=14.37 Aligned_cols=71 Identities=13% Similarity=0.111 Sum_probs=48.6 Q ss_pred CEEEEEEECCCCCC-C---CCCEEEEEEECCC------CEEEEEEECCCCCCCCCCCCCCCEEEEE-EEEEEECCCCEEE Q ss_conf 18999997054356-8---8862799987489------4799999735210586681459889999-9966752884379 Q gi|254780791|r 34 HVCVRGEISGYRGI-H---SSGHAYFSLKDNH------SRIDAIIWKGTLNKIEFLPEEGIEFLVI-GKITTFPGSSKYQ 102 (529) Q Consensus 34 ~~~v~gEis~~~~~-~---~sGH~Yf~lkd~~------a~i~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~y~~~g~~q 102 (529) .+-|.|-|..++.. + ++.-+.|+++|+. ..+.|.+|......++.--..||=|.+. .++..|. |..| T Consensus 14 ~vnv~gVV~~~~~p~~s~g~~~~~t~~i~d~s~~~~~~~~l~v~~F~~~~e~LP~v~~~GDIi~l~r~~v~~~~--~~~~ 91 (141) T pfam02765 14 VVNVIGVVVDAKFPSKSRGSDYDFTLTIVDEDNKGDSNDGLTVKLFSENFEDLPNVQSAGDIIRLHRVKVKLYN--GKPQ 91 (141) T ss_pred EEEEEEEEEECCCCEECCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCHHHCCCCCCCCCEEEEEEEEEEEEC--CEEE T ss_conf 89999999563575345998579999999078887789963999989996789986777989999778999998--9678 Q ss_pred EEEE Q ss_conf 9999 Q gi|254780791|r 103 IIIE 106 (529) Q Consensus 103 l~v~ 106 (529) .+.. T Consensus 92 ~~~~ 95 (141) T pfam02765 92 GVAR 95 (141) T ss_pred EEEC T ss_conf 9872 No 475 >COG1350 Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) [General function prediction only] Probab=38.35 E-value=24 Score=14.35 Aligned_cols=109 Identities=25% Similarity=0.409 Sum_probs=61.6 Q ss_pred CCEEEEE-EEEEEECCCCE---EEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHH Q ss_conf 9889999-99667528843---7999997101680079999999999765401226100163102652899984784258 Q gi|254780791|r 83 GIEFLVI-GKITTFPGSSK---YQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAV 158 (529) Q Consensus 83 G~~v~~~-g~~~~y~~~g~---~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~ 158 (529) |+++.|. .++|+|.+-++ ++++--.+-|+-. + +-+.-+|+.+| | |.-|-..||-- +-| T Consensus 151 ~lk~~V~Mvr~Sy~qKpyRk~lM~~yGa~V~pSPS-~----~Te~Grk~l~e---~-------p~hPGSLGIAI---SEA 212 (432) T COG1350 151 GLKATVFMVRVSYYQKPYRKYLMELYGAEVVPSPS-E----LTEFGRKILKE---D-------PDHPGSLGIAI---SEA 212 (432) T ss_pred CCEEEEEEEEHHHHCCHHHHHHHHHHCCEEECCCC-C----HHHHHHHHHHC---C-------CCCCCHHHHHH---HHH T ss_conf 85149999862321052999999985874206976-2----13788999841---9-------69986166878---999 Q ss_pred HHHHHHHHHHCCCE----EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 99999986305975----8999721001111036799999999741003576777589995168884 Q gi|254780791|r 159 IRDILQRISCRFPL----RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 159 ~~D~~~~~~~r~p~----~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) +.|.++.=..+|-+ +..|.+-+|=|-+|.++ ++.+. ..+|+||=|=||||- T Consensus 213 iE~al~~~~~kY~lGSVlnhvllhQTViGlEakkQ----le~~~--------e~PDv~igcvGGGSN 267 (432) T COG1350 213 IEYALKNENTKYSLGSVLNHVLLHQTVIGLEAKKQ----LEQAG--------EDPDVIIGCVGGGSN 267 (432) T ss_pred HHHHHHCCCCEECCHHHHHHHHHHHHHHHHHHHHH----HHHCC--------CCCCEEEEECCCCCC T ss_conf 99998488843521678789999888874879999----99638--------899879994168776 No 476 >COG4972 PilM Tfp pilus assembly protein, ATPase PilM [Cell motility and secretion / Intracellular trafficking and secretion] Probab=38.34 E-value=24 Score=14.35 Aligned_cols=75 Identities=16% Similarity=0.197 Sum_probs=41.1 Q ss_pred CCCCEEEEEEEEEEECC--------CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCH---HHHHHHHH Q ss_conf 28843799999710168--------00799999999997654012261001631026528999847842---58999999 Q gi|254780791|r 96 PGSSKYQIIIESLIPSG--------SGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTG---AVIRDILQ 164 (529) Q Consensus 96 ~~~g~~ql~v~~i~~~g--------~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~---a~~~D~~~ 164 (529) ...++|+|.-..++|-- +=+..+--+.||+-|++-|.+..+---.+|-----+-+|+=|.+ --++|++. T Consensus 29 ~~g~~~kLe~y~~~~lp~~iv~dg~ivd~~av~~~Lk~ala~~gi~~k~aa~AVP~s~aitk~i~vp~~lde~eL~~~V~ 108 (354) T COG4972 29 RSGNRYKLEKYASEPLPENIVADGKIVDYDAVASALKRALAKLGIKSKNAATAVPGSAAITKTIPVPDELDEKELEDQVE 108 (354) T ss_pred CCCCCEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHH T ss_conf 56896014343640467240036871368999999999998659633455521586400367741687646889999999 Q ss_pred HHHHCC Q ss_conf 863059 Q gi|254780791|r 165 RISCRF 170 (529) Q Consensus 165 ~~~~r~ 170 (529) .-..+| T Consensus 109 ~ea~~y 114 (354) T COG4972 109 SEASRY 114 (354) T ss_pred HHHHHC T ss_conf 987634 No 477 >TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer. Probab=38.28 E-value=24 Score=14.34 Aligned_cols=21 Identities=10% Similarity=0.194 Sum_probs=9.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 079999999999765401226 Q gi|254780791|r 114 GTLLTALEKRKKKLLEEGLFS 134 (529) Q Consensus 114 G~l~~~~e~lk~~L~~eGlfd 134 (529) +.+..+++-++.+.-.+.+.+ T Consensus 67 ~~~~t~~~il~S~~~~~~vi~ 87 (444) T TIGR03017 67 GYMATQVDIINSDRVAKKVVD 87 (444) T ss_pred HHHHHHHHHHCCHHHHHHHHH T ss_conf 779999999825999999999 No 478 >PRK13226 phosphoglycolate phosphatase; Provisional Probab=38.01 E-value=24 Score=14.31 Aligned_cols=102 Identities=17% Similarity=0.220 Sum_probs=58.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCC-CCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 289998478425899999986305975899972100-1111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKV-QGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~v-QG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) .++||+||........++..+.=..-|.+++-.-.| -++=.|.-+..|++.++-.. -++|+| +|= T Consensus 104 ~~~~VvTnk~~~~~~~~l~~~gl~~~F~~iv~~ddv~~~KP~Pd~~l~a~~~lgv~p-------~e~v~V-------GDs 169 (221) T PRK13226 104 CVWGIVTNKPEYLARLILPQLGWEQRCAVLIGGDTLAERKPHPLPLLTAAERIGVAP-------TQCVYV-------GDD 169 (221) T ss_pred CEEEEECCCCHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCH-------HHEEEE-------CCC T ss_conf 908985598188899999983985223300014445677888699999999949895-------368888-------999 Q ss_pred HHCCHHHHHHHHHHCCCEEE---EEEC-CCCCCHHHHHHHCCCCCCCH Q ss_conf 20076999999974890488---5205-77752589886412377721 Q gi|254780791|r 225 WHFNDEMIVRAIANSSIPII---SAIG-HETDWTLADYAADLRAPTPT 268 (529) Q Consensus 225 ~~FN~e~laraI~~~~iPVi---sgIG-HE~D~Tl~D~VAD~Ra~TPT 268 (529) .+|-.=|+ +.-+++| .|-+ ||.|. ...-||.=..+|. T Consensus 170 --~~Di~aA~---aAG~~~I~v~~Gy~~~~~~l--~~~~ad~~i~~p~ 210 (221) T PRK13226 170 --ERDILAAR---AAAMPSVAALWGYRLHDDDP--LTWQADVLVEQPQ 210 (221) T ss_pred --HHHHHHHH---HCCCCEEEECCCCCCCCCCH--HHCCCCEEECCHH T ss_conf --87999999---96999999778999985386--6659899988999 No 479 >cd06272 PBP1_hexuronate_repressor_like Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor Probab=38.01 E-value=24 Score=14.31 Aligned_cols=193 Identities=20% Similarity=0.197 Sum_probs=79.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHC-CCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCC---CCCCCCCC Q ss_conf 888898622999999999997400-17189999970543568886279998748947999997352105---86681459 Q gi|254780791|r 8 NSLDHPEYSVSELSYHLKHIVESN-LSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNK---IEFLPEEG 83 (529) Q Consensus 8 ~~~~~~~~svs~l~~~i~~~l~~~-~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~---~~~~~~~G 83 (529) |+.++|. -+++...|...+... |. +-| . .++ .....=.. |. .-.++++|+-+.... +...-+.| T Consensus 7 p~i~npf--f~~l~~gi~~~~~~~GY~-~ll-~-~~~--~~~~~~~~---l~--~~~vDGiIi~~~~~~~~~~~~~~~~~ 74 (261) T cd06272 7 PSVSRVA--LTELVTGINQAISKNGYN-MNV-S-ITP--SLAEAEDL---FK--ENRFDGVIIFGESASDVEYLYKIKLA 74 (261) T ss_pred CCCCCHH--HHHHHHHHHHHHHHCCCE-EEE-E-ECC--CHHHHHHH---HH--HCCCCEEEEECCCCCHHHHHHHHHCC T ss_conf 7987589--999999999999984998-999-9-689--85999999---99--67998999978989979999997489 Q ss_pred CEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHH-----H Q ss_conf 88999999667528843799999710168007999999999976540122610016310265289998478425-----8 Q gi|254780791|r 84 IEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGA-----V 158 (529) Q Consensus 84 ~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a-----~ 158 (529) .-|++.++. +.. .+.. +..... ..-...-+.|.+.|. ++||.|+++... - T Consensus 75 iPvV~~~~~--~~~--~~~~----V~~Dn~----~~~~~a~~~L~~~G~-------------~~i~~i~~~~~~~~~~~R 129 (261) T cd06272 75 IPVVSYGVD--YDL--KYPI----VNVDNE----KAMELAVLYLAEKGH-------------KKIAYIGDLSLDRRQRKR 129 (261) T ss_pred CCEEEECCC--CCC--CCCE----EEECHH----HHHHHHHHHHHHCCC-------------CCEEEECCCCCCHHHHHH T ss_conf 939998367--899--9989----996889----999999999998699-------------828996698887359999 Q ss_pred HHHHHHHHHHC-CCE-EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHH---H Q ss_conf 99999986305-975-899972100111103679999999974100357677758999516888444220076999---9 Q gi|254780791|r 159 IRDILQRISCR-FPL-RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMI---V 233 (529) Q Consensus 159 ~~D~~~~~~~r-~p~-~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~l---a 233 (529) +.-|++.+++. .+. ...++. +.....+-..++..+-... +.+|+|+ |+||..- . T Consensus 130 ~~gf~~al~~~g~~~~~~~i~~----~~~~~~~~~~~~~~ll~~~-----~~p~Ai~------------~~nD~~A~g~~ 188 (261) T cd06272 130 FKGFLETCDENGISISDSHIDV----DGLSAEGGDNAAKKLLKES-----DLPTAII------------CGSYDIALGVL 188 (261) T ss_pred HHHHHHHHHHCCCCCCCEEEEE----CCCHHHHHHHHHHHHHHCC-----CCCCCEE------------ECCHHHHHHHH T ss_conf 9999999998599855138983----0310367999999998527-----8986101------------27689999999 Q ss_pred HHHHH--CCCE-EEEEECCCCCCHHHHHH Q ss_conf 99974--8904-88520577752589886 Q gi|254780791|r 234 RAIAN--SSIP-IISAIGHETDWTLADYA 259 (529) Q Consensus 234 raI~~--~~iP-VisgIGHE~D~Tl~D~V 259 (529) ++..+ -.|| =|+=||.. |..+++++ T Consensus 189 ~~l~~~g~~vP~dvsviGfD-d~~~~~~~ 216 (261) T cd06272 189 SALNKQGISIPEDIEIISYD-NIPQMAII 216 (261) T ss_pred HHHHHHCCCCCCCEEEEEEC-CCHHHHHH T ss_conf 99998099999955999988-81889950 No 480 >PRK12564 carbamoyl phosphate synthase small subunit; Reviewed Probab=37.89 E-value=24 Score=14.30 Aligned_cols=86 Identities=29% Similarity=0.420 Sum_probs=57.8 Q ss_pred CCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 31026528999847842589999998630597589997210011110367999999997410035767775899951688 Q gi|254780791|r 140 PIPFIPKIIAVITSPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGG 219 (529) Q Consensus 140 ~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGG 219 (529) ..|.-..+|+||-= -+-.-|+|.|.+|. ++|.++|.-. .+.+| .. .+||-|+|-=|=| T Consensus 166 ~~~~~~~~V~viD~---GvK~nIlr~L~~rg-~~v~VvP~~~----~~~~I-------~~-------~~pDGIflSNGPG 223 (355) T PRK12564 166 PWPGGGYHVVAIDF---GVKRNILRELAERG-CRVTVVPATT----TAEEI-------LA-------LNPDGVFLSNGPG 223 (355) T ss_pred ECCCCCEEEEEEEC---CCHHHHHHHHHHCC-CEEEEECCCC----CHHHH-------HC-------CCCCEEEECCCCC T ss_conf 72799718999937---87388999999879-8699968978----76576-------41-------4888899679999 Q ss_pred CHHHHHHCCHHHHHHHHHHCCCEEEEEE--CCC Q ss_conf 8444220076999999974890488520--577 Q gi|254780791|r 220 SIEDLWHFNDEMIVRAIANSSIPIISAI--GHE 250 (529) Q Consensus 220 S~eDL~~FN~e~laraI~~~~iPVisgI--GHE 250 (529) -.+++- ..-+.+|.+.+..+||. || ||. T Consensus 224 DP~~~~--~~i~~ik~li~~~~Pif-GICLGHQ 253 (355) T PRK12564 224 DPAALD--YAIEMIKELLEGKIPIF-GICLGHQ 253 (355) T ss_pred CHHHHH--HHHHHHHHHHCCCCCEE-EECHHHH T ss_conf 968879--99999999852698789-9888799 No 481 >smart00424 STE STE like transcription factors. Probab=37.89 E-value=24 Score=14.30 Aligned_cols=77 Identities=19% Similarity=0.406 Sum_probs=36.3 Q ss_pred EEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 99874894799999735210586681459889999996675288437999997101680079999999999765401226 Q gi|254780791|r 55 FSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFS 134 (529) Q Consensus 55 f~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd 134 (529) |-|..+..-|+||+|.+-+.- .|-.++ ++-+| .++..|- .-.-.|=-+||+|. T Consensus 7 y~l~~~~gfvsCV~Wn~l~~i------tgtdiv---r~~~~-----------rf~~fgR-------~v~~~KKfEEGiFS 59 (111) T smart00424 7 YYLPNGEGFVSCVFWNNLYHI------TGTDIV---RCLVY-----------RFQAFGR-------PVVNRKKFEEGIFS 59 (111) T ss_pred EECCCCCCEEEEEEECCEEEE------ECCCHH---HHHHH-----------HHHHCCC-------CHHHHHHHHHHHHH T ss_conf 764788856899997667687------265237---76666-----------5886196-------00023445455356 Q ss_pred CCCCCCCCCCCCEEEEEECCCHHHHHHHH Q ss_conf 10016310265289998478425899999 Q gi|254780791|r 135 DQHKNPIPFIPKIIAVITSPTGAVIRDIL 163 (529) Q Consensus 135 ~~~k~~lP~~p~~i~vits~~~a~~~D~~ 163 (529) .-|.+.. -.=|..-.|.+- +=||+ T Consensus 60 DLRnLK~----g~DA~LE~PkS~-fL~fL 83 (111) T smart00424 60 DLRNLKC----GTDATLEEPKSP-FLDFL 83 (111) T ss_pred HHHCCCC----CCCCCCCCCCCH-HHHHH T ss_conf 6642789----976422587568-89999 No 482 >COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion] Probab=37.87 E-value=24 Score=14.30 Aligned_cols=25 Identities=12% Similarity=0.151 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCEEE Q ss_conf 9999999988716967773146199 Q gi|254780791|r 440 TSVSITTRILQSFAYKNTLKRGYTS 464 (529) Q Consensus 440 ~rL~~l~~~L~slsP~~~L~RGYai 464 (529) ..+..+.+.-+.++|.++|..|+.+ T Consensus 432 ~~~~~~~~~k~~~DP~~i~npg~~~ 456 (459) T COG0277 432 EAWALLRAIKRAFDPNGIFNPGKLF 456 (459) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 7999999999860998667698647 No 483 >cd00147 cPLA2_like Cytosolic phospholipase A2, catalytic domain; hydrolyses arachidonyl phospholipids. Catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bond of phospholipids to release arachidonic acid. At the active site, cPLA2 contains a serine nucleophile through which the catalytic mechanism is initiated. The active site is partially covered by a solvent-accessible flexible lid. cPLA2 displays interfacial activation as it exists in both "closed lid" and "open lid" forms. Movement of the cPLA2 lid possibly exposes a greater hydrophobic surface and the active site. cPLA2 belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). Calcium is required for cPLA2 to bind with membranes or phospholipids. Group IV cPLA2 includes six intercellular enzymes: cPLA2alpha, cPLA2beta, cPLA2gamma, cPLA2delta, cP Probab=37.72 E-value=10 Score=17.69 Aligned_cols=32 Identities=22% Similarity=0.189 Sum_probs=19.8 Q ss_pred HHCCCEEEEEECCCCCCHHHHHHHCCCCCCCH Q ss_conf 74890488520577752589886412377721 Q gi|254780791|r 237 ANSSIPIISAIGHETDWTLADYAADLRAPTPT 268 (529) Q Consensus 237 ~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPT 268 (529) ++.|.||++|+||+.|.++...=+..=-=||= T Consensus 176 g~~P~PI~~a~~~~~~~~~~~~~~~~~EftPy 207 (438) T cd00147 176 GQNPLPIYTALNVKPGETSINDFATWFEFTPY 207 (438) T ss_pred CCCCCCEEEEECCCCCCCCCCCCCEEEEECCH T ss_conf 99986779980456886325789868997732 No 484 >pfam01735 PLA2_B Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysophsopholipids and are capable in vitro of hydrolysing all phospholipids extractable form yeast cells. Cytosolic phospholipase A2 associates with natural membranes in response to physiological increases in Ca2+ and selectively hydrolyses arachidonyl phospholipids, the aligned region corresponds the the carboxy-terminal Ca2+-independent catalytic domain of the protein as discussed in. Probab=37.66 E-value=11 Score=17.48 Aligned_cols=22 Identities=27% Similarity=0.374 Sum_probs=8.2 Q ss_pred HCCCEEEEEECCCCCCHHHHHH Q ss_conf 4890488520577752589886 Q gi|254780791|r 238 NSSIPIISAIGHETDWTLADYA 259 (529) Q Consensus 238 ~~~iPVisgIGHE~D~Tl~D~V 259 (529) ++|.||++|+|...|+++.++= T Consensus 154 ~~P~PI~vA~~r~p~~~~~~~n 175 (490) T pfam01735 154 EMPFPIIVADGRKPGTTVINLN 175 (490) T ss_pred CCCCCEEEECCCCCCCCCCCCC T ss_conf 9987879974525885223577 No 485 >cd06315 PBP1_ABC_sugar_binding_like_6 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=37.63 E-value=24 Score=14.27 Aligned_cols=79 Identities=22% Similarity=0.249 Sum_probs=48.6 Q ss_pred CEEEEEEC----CCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCH Q ss_conf 28999847----84258999999863059758999721001111036799999999741003576777589995168884 Q gi|254780791|r 146 KIIAVITS----PTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSI 221 (529) Q Consensus 146 ~~i~vits----~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~ 221 (529) |+|++|.| +.=.++.+=.+-..+.-+.++.++. |.+.+....++|+.+-..+ +|.||| | T Consensus 1 k~i~~va~~l~n~g~~~v~~G~~eaA~~lGw~v~v~D----~~g~~~~~~~~i~~ai~~k-------~D~Iii--~---- 63 (280) T cd06315 1 KNIIFVASDLKNGGILGVGEGVREAAKAIGWNLRILD----GRGSEAGQAAALNQAIALK-------PDGIVL--G---- 63 (280) T ss_pred CEEEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEC----CCCCHHHHHHHHHHHHHCC-------CCEEEE--C---- T ss_conf 9399998726687415498899999997598799988----9999999999999999639-------999999--9---- Q ss_pred HHHHHCCHH---HHHHHHHHCCCEEEE Q ss_conf 442200769---999999748904885 Q gi|254780791|r 222 EDLWHFNDE---MIVRAIANSSIPIIS 245 (529) Q Consensus 222 eDL~~FN~e---~laraI~~~~iPVis 245 (529) +|+.- .-.+...+..||||+ T Consensus 64 ----~~D~~~~~~~l~~A~~agIPvv~ 86 (280) T cd06315 64 ----GVDAAELQAELELAQKAGIPVVG 86 (280) T ss_pred ----CCCHHHHHHHHHHHHHCCCCEEE T ss_conf ----82978878999999987997896 No 486 >PRK10696 C32 tRNA thiolase; Provisional Probab=37.61 E-value=24 Score=14.26 Aligned_cols=36 Identities=19% Similarity=0.291 Sum_probs=27.0 Q ss_pred CEEEEEEC--CCHHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 28999847--8425899999986305975899972100 Q gi|254780791|r 146 KIIAVITS--PTGAVIRDILQRISCRFPLRVIIFPVKV 181 (529) Q Consensus 146 ~~i~vits--~~~a~~~D~~~~~~~r~p~~~~~~p~~v 181 (529) -||.|--| ++|-++=+++..++++.|+++.|..+.| T Consensus 40 DRVlVglSGGKDS~~LL~iL~~Lq~~api~FeLvAv~l 77 (311) T PRK10696 40 DRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNL 77 (311) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEE T ss_conf 99999826788899999999999985899855999983 No 487 >cd01967 Nitrogenase_MoFe_alpha_like Nitrogenase_MoFe_alpha_like: Nitrogenase MoFe protein, alpha subunit_like. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. Three genetically distinct types of nitrogenase systems are known to exist: a molybdenum-dependent nitrogenase (Mo-nitrogenase), a vanadium dependent nitrogenase (V-nitrogenase), and an iron-only nitrogenase (Fe-nitrogenase). These nitrogenase systems consist of component 1 (MoFe protein, VFe protein or, FeFe protein respectively) and, component 2 (Fe protein). This group contains the alpha subunit of component 1 of all three different forms. The most widespread and best characterized of these systems is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer, the alternative nitrogenases are alpha2beta2delta2 hexamers having alpha and beta subunits similar to the alpha and beta subunits of MoFe. The role of the delta subunit is unknown. For MoFe, each alphabeta pair of subunits contains one Probab=37.55 E-value=24 Score=14.26 Aligned_cols=116 Identities=16% Similarity=0.143 Sum_probs=62.1 Q ss_pred CCCEEEEEECCCHHHHHHHH----HHHHHCCCEEEEEEECC-CCCCC-------HHHHHHHHHHHHHHHCCCCCCCCCCE Q ss_conf 65289998478425899999----98630597589997210-01111-------03679999999974100357677758 Q gi|254780791|r 144 IPKIIAVITSPTGAVIRDIL----QRISCRFPLRVIIFPVK-VQGDE-------CPKEIANAILQLNTLKEGRTCPRPDI 211 (529) Q Consensus 144 ~p~~i~vits~~~a~~~D~~----~~~~~r~p~~~~~~p~~-vQG~~-------a~~~i~~ai~~~~~~~~~~~~~~~D~ 211 (529) -|+-|+|+|+-.++-++|=+ +.+++++|..|+..+++ ..|.. |...|++.+.. ..+ ......+. T Consensus 87 ~P~~i~V~tTC~~eiIGdDi~~v~~~~~~~~~~pvi~v~tpgf~g~~~~~G~~~a~~~i~~~l~~---~~~-~~~~~~~~ 162 (406) T cd01967 87 PPKAIFVYSTCPTGLIGDDIEAVAKEASKELGIPVIPVNCEGFRGVSQSLGHHIANDAILDHLVG---TKE-PEEKTPYD 162 (406) T ss_pred CCCEEEEECCCCHHHHCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCC---CCC-CCCCCCCE T ss_conf 99789997778099858768999999888609978998789878876469999999999998444---578-76788983 Q ss_pred EEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHC Q ss_conf 9995168884442200769999999748904885205777525898864123777214567633 Q gi|254780791|r 212 IILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAADLRAPTPTGAAEMAV 275 (529) Q Consensus 212 iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD~Ra~TPTaAAElav 275 (529) |-|. | +. .++-|-+++-|-+-.+-++|.+-+.. ++|+.|+- ..|.|++-++. T Consensus 163 VNii-g-~~---~~~~d~~eik~ll~~~Gi~v~~~~~~--~~s~~~i~-----~~~~A~~ni~~ 214 (406) T cd01967 163 VNII-G-EY---NIGGDAWVIKPLLEELGIRVNATFTG--DGTVDELR-----RAHRAKLNLVH 214 (406) T ss_pred EEEE-C-CC---CCHHHHHHHHHHHHHCCCCEEEECCC--CCCHHHHH-----HCCCCCEEEEE T ss_conf 9998-8-87---87013999999999859926887489--99857776-----46788689996 No 488 >pfam02552 CO_dh CO dehydrogenase beta subunit/acetyl-CoA synthase epsilon subunit. This family consists of Carbon monoxide dehydrogenase I/II beta subunit EC:1.2.99.2 and acetyl-CoA synthase epsilon subunit. Carbon monoxide beta subunit catalyses the reaction: CO + H2O + acceptor <= CO2 + reduced acceptor. Probab=37.49 E-value=24 Score=14.25 Aligned_cols=61 Identities=20% Similarity=0.185 Sum_probs=44.2 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEEC Q ss_conf 8999721001111036799999999741003576777589995168884442200769999999748904885205 Q gi|254780791|r 173 RVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIG 248 (529) Q Consensus 173 ~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIG 248 (529) ..++.|-.|--+.+.+.+...|.++.+ =++|++ .+=||..|+|..+.++....+|++.+-+ T Consensus 11 ~~i~~~~~a~~~~~~~~aa~~i~kAK~----------P~livG-----p~vl~~~~~~~~~k~~~~~~l~~~at~~ 71 (168) T pfam02552 11 TAIMNPKIATLEEPMKVAAMMIKKAKR----------PLLIVG-----PQVLYAENEEAKVKKILEKDLKEVATAL 71 (168) T ss_pred CCCCCCCEECHHHHHHHHHHHHHHCCC----------CEEEEC-----CHHHHHHHHHHHHHHHHHCCCCEECCCC T ss_conf 104687200132259999999985579----------869867-----1442134588899999982886430566 No 489 >PRK08251 short chain dehydrogenase; Provisional Probab=37.48 E-value=24 Score=14.25 Aligned_cols=90 Identities=14% Similarity=0.174 Sum_probs=57.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHH Q ss_conf 800799999999997654012261001631026528999847842589999998630597-5899972100111103679 Q gi|254780791|r 112 GSGTLLTALEKRKKKLLEEGLFSDQHKNPIPFIPKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEI 190 (529) Q Consensus 112 g~G~l~~~~e~lk~~L~~eGlfd~~~k~~lP~~p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i 190 (529) |+|. ++-++|.++|+- -++++.+...+..+...+..++| .++..+++=|--.++..++ T Consensus 13 GIG~------alA~~la~~G~~---------------v~l~~r~~~~l~~~~~el~~~~~~~~v~~~~~Dvsd~~~v~~~ 71 (248) T PRK08251 13 GLGA------GMAREFAAKGRD---------------LALCARRTDRLEELKAELLARYPGIKVAVAALDVNDHDQVFEV 71 (248) T ss_pred HHHH------HHHHHHHHCCCE---------------EEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHH T ss_conf 9999------999999987998---------------9999898889999999998737997399997867868999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHH Q ss_conf 9999999741003576777589995168884442200769 Q gi|254780791|r 191 ANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDE 230 (529) Q Consensus 191 ~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e 230 (529) +..+ ... ...+|++|..=|-|...++...+.+ T Consensus 72 ~~~~---~~~-----~g~iD~lvnNAGi~~~~~~~~~~~~ 103 (248) T PRK08251 72 FAEF---SDE-----LGGLDRVIVNAGIGKGARLGTGKFW 103 (248) T ss_pred HHHH---HHH-----HCCCCEEEECCCCCCCCCCCCCCHH T ss_conf 9999---998-----0999899985765788665559999 No 490 >PRK00087 4-hydroxy-3-methylbut-2-enyl diphosphate reductase/S1 RNA-binding domain protein; Reviewed Probab=37.48 E-value=24 Score=14.25 Aligned_cols=213 Identities=16% Similarity=0.204 Sum_probs=102.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEC-------CCCCCCCCC-EEEEEEECCCCEEEEEEECCCCC-----------C Q ss_conf 2299999999999740017189999970-------543568886-27999874894799999735210-----------5 Q gi|254780791|r 15 YSVSELSYHLKHIVESNLSHVCVRGEIS-------GYRGIHSSG-HAYFSLKDNHSRIDAIIWKGTLN-----------K 75 (529) Q Consensus 15 ~svs~l~~~i~~~l~~~~~~~~v~gEis-------~~~~~~~sG-H~Yf~lkd~~a~i~~~~~~~~~~-----------~ 75 (529) |-|..-...+..+++..=..+|+.|+|- .|+ ..| ++.-++.|....=+.|++|..=- + T Consensus 13 ~GV~rAI~~a~~~~~~~~~~vytlG~iIHN~~Vv~~L~---~~Gv~~v~~~~~~~~~~~~VIIrAHGv~p~v~~~~~~~g 89 (670) T PRK00087 13 FGVKRAVDTAIETLEEGKKKIYTLGPLIHNNQVVEELK---RKGIEPIEEIEDELNEGDTIIIRSHGVPPEVLEKLKDKG 89 (670) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHH---HCCCEEECCCCCCCCCCCEEEEECCCCCHHHHHHHHHCC T ss_conf 08899999999999852997898288868979999999---886999167656579979999928999989999999879 Q ss_pred C-------CC----------CCCCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 8-------66----------814598899999966752884379999971016800799999999997654012261001 Q gi|254780791|r 76 I-------EF----------LPEEGIEFLVIGKITTFPGSSKYQIIIESLIPSGSGTLLTALEKRKKKLLEEGLFSDQHK 138 (529) Q Consensus 76 ~-------~~----------~~~~G~~v~~~g~~~~y~~~g~~ql~v~~i~~~g~G~l~~~~e~lk~~L~~eGlfd~~~k 138 (529) + ++ .-++|-+|++.|.-+.=+-.|-+...-...-.. ...+.+ T Consensus 90 ~~vvDATCP~V~Kvh~~~~~~~~~Gy~iviiG~~~HpEV~Gi~g~~~~~~~vi------~~~e~~--------------- 148 (670) T PRK00087 90 LKVIDATCPFVKNIQKLAKKYYEEGYQIVIVGDKNHPEVIGINGWCNNSAIIS------KDGEEA--------------- 148 (670) T ss_pred CEEEECCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCHHEEECCCCCCEEEEE------CCHHHH--------------- T ss_conf 95996788262899999999997879899987999954157244469817998------079998--------------- Q ss_pred CCCCCCCCEEEEEEC--CCHHHHHHHHHHHHHCCC-EEEE--EEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEE Q ss_conf 631026528999847--842589999998630597-5899--97210011110367999999997410035767775899 Q gi|254780791|r 139 NPIPFIPKIIAVITS--PTGAVIRDILQRISCRFP-LRVI--IFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIII 213 (529) Q Consensus 139 ~~lP~~p~~i~vits--~~~a~~~D~~~~~~~r~p-~~~~--~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~ii 213 (529) ..+| +++++|++= -+-.-+..|+..|..+|| +.+. +..++-+=..|+.+++ ..+|++| T Consensus 149 ~~l~--~~kv~vvsQTT~~~~~~~~iv~~lk~~~~e~~v~nTIC~AT~~RQ~a~~~LA---------------~~vD~mi 211 (670) T PRK00087 149 ANLP--YPKVCVVSQTTEKQENFEKVLKELKKKGKEVKVFNTICNATAVRQEAAEKLA---------------KKVDVMI 211 (670) T ss_pred HCCC--CCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH---------------HHCCEEE T ss_conf 3488--7678999899898899999999999878887678985442688999999998---------------6599999 Q ss_pred EECCCCCHHHHHHCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHC------CCCCCCHHHHHHHC Q ss_conf 951688844422007699999997489048852057775258988641------23777214567633 Q gi|254780791|r 214 LARGGGSIEDLWHFNDEMIVRAIANSSIPIISAIGHETDWTLADYAAD------LRAPTPTGAAEMAV 275 (529) Q Consensus 214 i~RGGGS~eDL~~FN~e~laraI~~~~iPVisgIGHE~D~Tl~D~VAD------~Ra~TPTaAAElav 275 (529) ++=|=-|- |.-.|+.-.-....|.. =|-...|..-..|-.- .-|+||.---|-++ T Consensus 212 VVGG~nSS------NT~rL~eia~~~~~~ty-~IE~~~el~~~~~~~~~~VGITAGASTP~~iI~EVi 272 (670) T PRK00087 212 VVGGKNSS------NTTKLYEICKSNCTNTI-HIENAGELPEEIFKGVNIIGVTAGASTPDWIIEEVI 272 (670) T ss_pred EECCCCCH------HHHHHHHHHHHHCCCEE-EECCHHHCCHHHHCCCCEEEEECCCCCCHHHHHHHH T ss_conf 98998972------59999999997799989-969928799989769998999551379988999999 No 491 >PRK12681 cysB transcriptional regulator CysB; Reviewed Probab=37.39 E-value=24 Score=14.24 Aligned_cols=92 Identities=18% Similarity=0.282 Sum_probs=51.6 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHH Q ss_conf 528999847842589999998630597-5899972100111103679999999974100357677758999516888444 Q gi|254780791|r 145 PKIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIED 223 (529) Q Consensus 145 p~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eD 223 (529) --+||...+.....+-+++.....+|| +++.+. || .+.+|.+.+.. ..+|+.|...+-...+| T Consensus 94 ~L~Ia~~~t~a~~~LP~~i~~F~~~~P~v~l~l~----~~--~~~~i~~~l~~----------g~~DlaI~~e~~~~~~d 157 (324) T PRK12681 94 SLYIATTHTQARYALPPVIKGFIKRYPRVSLHMH----QG--SPTQIAEAAAK----------GEADFAIATEALHLYDD 157 (324) T ss_pred EEEEEECHHHHHHHHHHHHHHHHHHCCCCCEEEE----EC--CHHHHHHHHHC----------CCCCCCCCCCCCCCCCC T ss_conf 4999853588887725999999987899844653----17--87999999987----------99872312466666887 Q ss_pred HHHCCHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHH Q ss_conf 220076999999974890488520577----752589886 Q gi|254780791|r 224 LWHFNDEMIVRAIANSSIPIISAIGHE----TDWTLADYA 259 (529) Q Consensus 224 L~~FN~e~laraI~~~~iPVisgIGHE----~D~Tl~D~V 259 (529) |..+ -+|.-+.-+|..-||- ...|+.|+. T Consensus 158 l~~~-------P~~~~~~~vvvp~~HPLa~~~~itl~dL~ 190 (324) T PRK12681 158 LIML-------PCYHWNRSVVVPPDHPLAKKKKLTIEELA 190 (324) T ss_pred CEEE-------EEEECCCCCCCCCCCCCCCCCCCCHHHHC T ss_conf 4686-------40003222457999965579999999986 No 492 >PRK07211 replication factor A; Reviewed Probab=37.37 E-value=24 Score=14.24 Aligned_cols=73 Identities=18% Similarity=0.385 Sum_probs=38.0 Q ss_pred CCCCEEEEEEECC------CCC-CCCCCEE-EEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEEEE-EEEEEECCCCEE Q ss_conf 0171899999705------435-6888627-999874894799999735210586681459889999-996675288437 Q gi|254780791|r 31 NLSHVCVRGEISG------YRG-IHSSGHA-YFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFLVI-GKITTFPGSSKY 101 (529) Q Consensus 31 ~~~~~~v~gEis~------~~~-~~~sGH~-Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~~~-g~~~~y~~~g~~ 101 (529) ..+.+-++|+|-. |.. .-+.|-+ -|.|+|+...|+.++|-.++..+. .+..|+-|.+. |++ -+..|+. T Consensus 171 g~~dv~l~g~VL~~~~iRtF~R~DGSeGrV~sl~lGDETG~IRvTLWDd~aDl~~-~l~~GdsVeIv~ayv--REr~g~l 247 (473) T PRK07211 171 GQSDVTLRGEVLDTDAVRTFDRDDGSEGRVSNLTVGDETGRIRVTLWDDQADRAD-DLDAGASVEIVDGYV--REREGSL 247 (473) T ss_pred CCCCCEEEEEEECCCCCCCEECCCCCCCEEEEEEEECCCCEEEEEEECCCCCCCC-CCCCCCEEEEECCEE--ECCCCCE T ss_conf 8876379999943788741145789863488889855876199998558741002-589998799932586--2247856 Q ss_pred EEEEE Q ss_conf 99999 Q gi|254780791|r 102 QIIIE 106 (529) Q Consensus 102 ql~v~ 106 (529) .|.+- T Consensus 248 EL~vg 252 (473) T PRK07211 248 ELHVG 252 (473) T ss_pred EEEEC T ss_conf 78974 No 493 >PRK13165 cytochrome c-type biogenesis protein CcmE; Reviewed Probab=37.32 E-value=24 Score=14.23 Aligned_cols=72 Identities=17% Similarity=0.319 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHH-------CCCCEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCEEE Q ss_conf 2299999999999740-------017189999970543568886279998748947999997352105866814598899 Q gi|254780791|r 15 YSVSELSYHLKHIVES-------NLSHVCVRGEISGYRGIHSSGHAYFSLKDNHSRIDAIIWKGTLNKIEFLPEEGIEFL 87 (529) Q Consensus 15 ~svs~l~~~i~~~l~~-------~~~~~~v~gEis~~~~~~~sGH~Yf~lkd~~a~i~~~~~~~~~~~~~~~~~~G~~v~ 87 (529) ||.||+...-...-+. ..+..-+.|-| .....+--+.|.+.|..+.|. |-|.+- ++=..++|.-|+ T Consensus 38 ~TPsEv~~~~~~~~~~p~~g~~iRvGGlV~~gSv---~r~~~~l~v~F~iTD~~~~i~-V~Y~Gi---lPDLFrEGqGVV 110 (161) T PRK13165 38 YTPGEILYGKRETQQKPEVGQRLRVGGMVMPGSV---QRDPNSLKVSFTVYDAGGSVT-VTYEGI---LPDLFREGQGIV 110 (161) T ss_pred ECHHHHHCCCCCCCCCCCCCCEEEEEEEEECCEE---EECCCCCEEEEEEECCCCEEE-EEECCC---CCHHHHCCCEEE T ss_conf 6789985033654446666846997038863859---977996279999976886799-998477---940332698399 Q ss_pred EEEEEE Q ss_conf 999966 Q gi|254780791|r 88 VIGKIT 93 (529) Q Consensus 88 ~~g~~~ 93 (529) +.|++. T Consensus 111 aeG~~~ 116 (161) T PRK13165 111 AQGVLE 116 (161) T ss_pred EEEEEC T ss_conf 999987 No 494 >pfam01220 DHquinase_II Dehydroquinase class II. Probab=37.27 E-value=25 Score=14.22 Aligned_cols=73 Identities=23% Similarity=0.324 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCHHHH Q ss_conf 78425899999986305975899972100111103679999999974100357677758999516888444220076999 Q gi|254780791|r 153 SPTGAVIRDILQRISCRFPLRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFNDEMI 232 (529) Q Consensus 153 s~~~a~~~D~~~~~~~r~p~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~e~l 232 (529) +.|=+.+.+.++.......+++..|-+=.-| +|+..|..+.. .+|.|||-=|| |..-+..+ T Consensus 24 ~~tl~~i~~~~~~~a~~~g~~~~~~QSN~Eg-----eiId~I~~a~~--------~~dgiIiNpga------~THtS~ai 84 (140) T pfam01220 24 STTLADIEARLRELAAELGVELEFFQSNHEG-----ELIDWIHEARG--------DVDGIIINPAA------YTHTSVAL 84 (140) T ss_pred CCCHHHHHHHHHHHHHHCCCEEEEEECCCHH-----HHHHHHHHHHC--------CCCEEEECCCH------HEEHHHHH T ss_conf 3699999999999999729926899628689-----99999999754--------47689985621------01102667 Q ss_pred HHHHHHCCCEEE Q ss_conf 999974890488 Q gi|254780791|r 233 VRAIANSSIPII 244 (529) Q Consensus 233 araI~~~~iPVi 244 (529) ..|+....+|+| T Consensus 85 ~DAl~~~~~P~i 96 (140) T pfam01220 85 RDALAAVGIPVI 96 (140) T ss_pred HHHHHHCCCCEE T ss_conf 999986499989 No 495 >COG1922 WecG Teichoic acid biosynthesis proteins [Cell envelope biogenesis, outer membrane] Probab=37.21 E-value=25 Score=14.22 Aligned_cols=90 Identities=18% Similarity=0.272 Sum_probs=0.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHH Q ss_conf 28999847842589999998630597-58999721001111036799999999741003576777589995168884442 Q gi|254780791|r 146 KIIAVITSPTGAVIRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDL 224 (529) Q Consensus 146 ~~i~vits~~~a~~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL 224 (529) ++|.+.-+..+-+-+=- ..++..|| +.|.-+..--=...-.+.|++.|...+ +|+|.++=|=.-.| T Consensus 109 ~~vfllGgkp~V~~~a~-~~l~~~~p~l~ivg~h~GYf~~~e~~~i~~~I~~s~----------pdil~VgmG~P~QE-- 175 (253) T COG1922 109 KRVFLLGGKPGVAEQAA-AKLRAKYPGLKIVGSHDGYFDPEEEEAIVERIAASG----------PDILLVGMGVPRQE-- 175 (253) T ss_pred CEEEEECCCHHHHHHHH-HHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHCC----------CCEEEEECCCCHHH-- T ss_conf 64999648778999999-999987899569995178788356899999998559----------99899956996227-- Q ss_pred HHCCHHHHHHHHHHCCCEEEEEECCCCCC Q ss_conf 20076999999974890488520577752 Q gi|254780791|r 225 WHFNDEMIVRAIANSSIPIISAIGHETDW 253 (529) Q Consensus 225 ~~FN~e~laraI~~~~iPVisgIGHE~D~ 253 (529) .=+++....++.+|+.|||--.|+ T Consensus 176 -----~wi~~~~~~~~~~v~igVGg~fDv 199 (253) T COG1922 176 -----IWIARNRQQLPVAVAIGVGGSFDV 199 (253) T ss_pred -----HHHHHHHHHCCCCEEEECCCEEEE T ss_conf -----999876885698669960512798 No 496 >PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed Probab=37.10 E-value=25 Score=14.20 Aligned_cols=76 Identities=22% Similarity=0.222 Sum_probs=0.0 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC--------CCHHHHHHCCHHHHHHHHHHCCCEEEE Q ss_conf 999721001111036799999999741003576777589995168--------884442200769999999748904885 Q gi|254780791|r 174 VIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGG--------GSIEDLWHFNDEMIVRAIANSSIPIIS 245 (529) Q Consensus 174 ~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGG--------GS~eDL~~FN~e~laraI~~~~iPVis 245 (529) ++-=||+|-.++-.-++..+++... +-+-||| +|++.|-.=-=+-|.++=-+.-+|||| T Consensus 121 iIAGPCsvES~eQi~~~A~~vk~~G-------------~~~lRgGa~KPRTsPysFqGlG~eGL~~L~~a~~e~gl~vvT 187 (360) T PRK12595 121 FIFGPCAVESYEQVAAVAQALKAKG-------------LKLLRGGAFKPRTSPYDFQGLGVEGLKILKQVADEYDLAVIS 187 (360) T ss_pred EEECCCCCCCHHHHHHHHHHHHHCC-------------CCEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEE T ss_conf 8956883678999999999999759-------------755725556899999765768457999999999985997279 Q ss_pred EECCCCCCHHHHHHHCC Q ss_conf 20577752589886412 Q gi|254780791|r 246 AIGHETDWTLADYAADL 262 (529) Q Consensus 246 gIGHE~D~Tl~D~VAD~ 262 (529) -|-...|.-++--.+|+ T Consensus 188 EV~~~~~ve~~~~yvDi 204 (360) T PRK12595 188 EIVNPNDVEVALDYVDV 204 (360) T ss_pred EECCHHHHHHHHHHCCE T ss_conf 85788899999974868 No 497 >PRK13527 glutamine amidotransferase subunit PdxT; Provisional Probab=37.04 E-value=25 Score=14.20 Aligned_cols=65 Identities=18% Similarity=0.325 Sum_probs=0.0 Q ss_pred CCCCC--CHHHHHHHHHHHHHH------HCCCCCCCCCCEEEEECCCCCHHH----HHHCCHHHHHHHHHHCCCEEEEEE Q ss_conf 00111--103679999999974------100357677758999516888444----220076999999974890488520 Q gi|254780791|r 180 KVQGD--ECPKEIANAILQLNT------LKEGRTCPRPDIIILARGGGSIED----LWHFNDEMIVRAIANSSIPIISAI 247 (529) Q Consensus 180 ~vQG~--~a~~~i~~ai~~~~~------~~~~~~~~~~D~iii~RGGGS~eD----L~~FN~e~laraI~~~~iPVisgI 247 (529) ..||+ .+..++-+|++...- ..........|.|||= | |+..+ |-..+=.+..+...+...|++ || T Consensus 7 alqG~~~~hi~sl~kAl~~lg~~~e~~~v~~p~~l~~~d~LILP-G-G~f~~~m~~L~~~gl~~~i~~~i~~g~PiL-GI 83 (196) T PRK13527 7 ALQGDVEEHIDALKRALDELGINGEVVRVRRPEDLKDCDALIIP-G-GESTTIGRLMKRYGILDEIKEKIEEGLPIL-GT 83 (196) T ss_pred EECCCHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHCCCEEEEC-C-CCHHHHHHHHHHCCCHHHHHHHHHCCCCEE-EE T ss_conf 72479999999999999982999059996998998319979989-9-667999999888688899999997699779-97 No 498 >pfam03607 DCX Doublecortin. Probab=37.00 E-value=25 Score=14.19 Aligned_cols=49 Identities=12% Similarity=0.149 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEE Q ss_conf 9999999998871696777314619998489889577789299986999 Q gi|254780791|r 439 HTSVSITTRILQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILI 487 (529) Q Consensus 439 ~~rL~~l~~~L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i 487 (529) +.++.+....|+.++..-.|.-|=--++..+|+.|++.+++..|+.... T Consensus 5 ~r~~~sfd~lL~~lT~~v~l~~gVR~lyT~~G~~V~~l~~l~~g~~YVa 53 (60) T pfam03607 5 RRRFKSFDALLDDLTEKVKLPTGVRKLYTPDGHRVTSLDELEDGGSYVA 53 (60) T ss_pred CCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCHHHHCCCCEEEE T ss_conf 4224899999999984048987715888799999689999627998999 No 499 >COG5140 UFD1 Ubiquitin fusion-degradation protein [Posttranslational modification, protein turnover, chaperones] Probab=36.67 E-value=25 Score=14.15 Aligned_cols=50 Identities=18% Similarity=0.141 Sum_probs=0.0 Q ss_pred HHHCCHHHHHCCCEEEEECCCCCEECCHHHCCCCCEEEEEEECEEEEEEEEECCCCCCCCC Q ss_conf 8716967773146199984898895777892999869999911099999950578887677 Q gi|254780791|r 449 LQSFAYKNTLKRGYTSIQDTNNNFITQKRNLATKTRILINFFDGQANAIVINKAPPKEQPT 509 (529) Q Consensus 449 L~slsP~~~L~RGYaiv~~~~GkiI~s~~~l~~gd~i~i~l~DG~v~a~V~~k~~~~~~~~ 509 (529) |+--+|+.+|+ ..+++-+.|..||.|+|.+.|-.-..+++...|.+.... T Consensus 135 LdI~dpkavLE-----------n~LRNfstLt~~D~iei~ynd~v~~Ik~~~v~pe~S~~~ 184 (331) T COG5140 135 LDIEDPKAVLE-----------NCLRNFSTLTEGDEIEIQYNDEVGSIKFTVVHPEPSANA 184 (331) T ss_pred EECCCHHHHHH-----------HHHHHCCCCCCCCEEEEEECCEEEEEEEEEECCCCCCCE T ss_conf 75168289999-----------987541004368789999778465799999635777654 No 500 >PRK07413 hypothetical protein; Validated Probab=36.63 E-value=25 Score=14.15 Aligned_cols=54 Identities=20% Similarity=0.366 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHCCH Q ss_conf 9999998630597-5899972100111103679999999974100357677758999516888444220076 Q gi|254780791|r 159 IRDILQRISCRFP-LRVIIFPVKVQGDECPKEIANAILQLNTLKEGRTCPRPDIIILARGGGSIEDLWHFND 229 (529) Q Consensus 159 ~~D~~~~~~~r~p-~~~~~~p~~vQG~~a~~~i~~ai~~~~~~~~~~~~~~~D~iii~RGGGS~eDL~~FN~ 229 (529) ..|++..|++|-| .+|++ =|.+||++|++.=+....|.. .|-=++.++..+||+ T Consensus 145 ~deVl~~L~~RP~~~eVVl-----TGR~AP~eLIe~ADLVTEMrp------------~~hp~~~~~~~~~~~ 199 (382) T PRK07413 145 VDEVVKTLKSRPEGLEIII-----TGRAAPQSLLDIADLHSEMRA------------HRRPEASEDGVPFNS 199 (382) T ss_pred HHHHHHHHHCCCCCCEEEE-----ECCCCCHHHHHHHCHHHHCCC------------CCCCCCCCCCCCCCC T ss_conf 9999999970999988999-----599999999986233552576------------568865534544578 Done!