Query gi|254780479|ref|YP_003064892.1| A/G-specific adenine glycosylase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 356 No_of_seqs 257 out of 2961 Neff 6.9 Searched_HMMs 39220 Date Sun May 29 17:49:10 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780479.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10880 adenine DNA glycosyla 100.0 0 0 860.8 28.1 341 5-355 3-349 (350) 2 TIGR01084 mutY A/G-specific ad 100.0 0 0 805.4 15.1 275 7-281 1-284 (297) 3 COG1194 MutY A/G-specific DNA 100.0 0 0 764.3 23.3 330 5-350 8-340 (342) 4 PRK13910 DNA glycosylase MutY; 100.0 0 0 684.3 20.9 285 47-354 1-286 (290) 5 KOG2457 consensus 100.0 0 0 634.0 10.2 338 6-345 91-500 (555) 6 PRK10702 endonuclease III; Pro 100.0 0 0 436.3 16.0 202 12-220 6-208 (211) 7 COG0177 Nth Predicted EndoIII- 100.0 1.6E-39 4.1E-44 298.9 14.6 181 37-220 27-208 (211) 8 TIGR01083 nth endonuclease III 100.0 7.9E-38 2E-42 287.0 10.8 187 3-206 2-192 (192) 9 smart00478 ENDO3c endonuclease 100.0 2.8E-37 7.3E-42 283.0 11.1 148 48-197 1-149 (149) 10 cd00056 ENDO3c endonuclease II 100.0 6.7E-34 1.7E-38 259.3 11.2 154 40-195 1-158 (158) 11 KOG1921 consensus 99.9 6E-25 1.5E-29 196.3 8.4 181 39-220 78-262 (286) 12 pfam00730 HhH-GPD HhH-GPD supe 99.9 8.8E-23 2.2E-27 181.0 9.8 122 44-165 1-130 (144) 13 cd03431 DNA_Glycosylase_C DNA 99.8 2E-20 5E-25 164.4 10.7 112 239-350 3-118 (118) 14 PRK13913 3-methyladenine DNA g 99.6 2.5E-15 6.4E-20 128.5 10.0 141 37-181 28-180 (218) 15 COG2231 Uncharacterized protei 99.6 1.3E-14 3.4E-19 123.3 12.6 178 38-219 29-214 (215) 16 PRK08999 hypothetical protein; 99.6 4.7E-15 1.2E-19 126.5 9.4 118 237-354 4-131 (312) 17 PRK10776 nucleoside triphospha 99.6 1.4E-14 3.5E-19 123.2 9.5 113 238-350 4-126 (129) 18 cd03425 MutT_pyrophosphohydrol 99.5 5.2E-14 1.3E-18 119.2 9.5 111 240-350 3-123 (124) 19 PRK10546 pyrimidine (deoxy)nuc 99.5 4.1E-13 1E-17 112.9 9.5 110 240-350 6-125 (135) 20 TIGR00588 ogg 8-oxoguanine DNA 99.1 8.4E-11 2.1E-15 96.6 4.8 123 37-161 146-318 (379) 21 KOG2875 consensus 99.1 1.2E-10 3.1E-15 95.5 5.5 115 37-155 115-256 (323) 22 COG0122 AlkA 3-methyladenine D 99.1 3.4E-09 8.8E-14 85.2 11.8 164 11-181 78-261 (285) 23 TIGR00586 mutt mutator mutT pr 98.8 4.9E-09 1.3E-13 84.1 4.3 116 236-351 3-149 (150) 24 cd03427 MTH1 MutT homolog-1 (M 98.7 1.5E-07 3.8E-12 73.7 8.3 108 242-350 4-124 (137) 25 PRK10308 3-methyl-adenine DNA 98.6 3.9E-07 9.8E-12 70.8 9.6 136 32-180 106-262 (283) 26 cd04691 Nudix_Hydrolase_32 Mem 98.6 2.6E-07 6.7E-12 72.0 7.5 93 241-335 3-106 (117) 27 cd04696 Nudix_Hydrolase_37 Mem 98.5 8.3E-07 2.1E-11 68.5 8.6 103 241-348 5-125 (125) 28 cd04699 Nudix_Hydrolase_39 Mem 98.5 8.9E-07 2.3E-11 68.2 8.2 102 240-341 3-118 (129) 29 cd04667 Nudix_Hydrolase_10 Mem 98.4 2.2E-06 5.5E-11 65.5 8.7 101 242-347 3-111 (112) 30 cd04681 Nudix_Hydrolase_22 Mem 98.3 5.4E-06 1.4E-10 62.7 9.3 105 243-349 6-128 (130) 31 cd04683 Nudix_Hydrolase_24 Mem 98.1 1.9E-05 4.8E-10 58.9 7.9 100 241-340 2-117 (120) 32 cd04664 Nudix_Hydrolase_7 Memb 98.1 6.2E-05 1.6E-09 55.2 10.4 102 245-348 10-129 (129) 33 cd04673 Nudix_Hydrolase_15 Mem 98.1 1.8E-05 4.7E-10 59.0 7.3 102 242-344 3-121 (122) 34 cd04511 Nudix_Hydrolase_4 Memb 98.0 2.5E-05 6.5E-10 58.0 7.8 104 242-347 16-130 (130) 35 cd04690 Nudix_Hydrolase_31 Mem 98.0 1.8E-05 4.6E-10 59.0 6.7 96 241-340 3-113 (118) 36 cd04676 Nudix_Hydrolase_17 Mem 98.0 3E-05 7.6E-10 57.5 7.8 105 240-348 4-128 (129) 37 cd04682 Nudix_Hydrolase_23 Mem 98.0 2.7E-05 6.8E-10 57.8 7.3 97 243-339 5-116 (122) 38 cd04677 Nudix_Hydrolase_18 Mem 98.0 5.5E-05 1.4E-09 55.6 8.4 105 239-348 8-132 (132) 39 cd04684 Nudix_Hydrolase_25 Con 97.9 0.00013 3.3E-09 53.0 9.8 104 243-347 4-127 (128) 40 cd04679 Nudix_Hydrolase_20 Mem 97.9 8E-05 2E-09 54.5 8.3 106 240-347 4-124 (125) 41 cd03430 GDPMH GDP-mannose glyc 97.9 0.00017 4.3E-09 52.2 9.1 97 240-339 14-134 (144) 42 cd04678 Nudix_Hydrolase_19 Mem 97.8 0.00014 3.5E-09 52.8 8.6 108 240-348 4-129 (129) 43 cd02883 Nudix_Hydrolase Nudix 97.8 0.00025 6.3E-09 51.0 9.7 104 242-347 4-122 (123) 44 cd04670 Nudix_Hydrolase_12 Mem 97.8 0.0001 2.6E-09 53.8 7.5 107 239-347 3-123 (127) 45 cd03674 Nudix_Hydrolase_1 Memb 97.8 0.00017 4.4E-09 52.1 8.1 104 241-349 6-135 (138) 46 cd04695 Nudix_Hydrolase_36 Mem 97.8 0.00033 8.4E-09 50.1 9.3 107 241-350 5-127 (131) 47 cd04672 Nudix_Hydrolase_14 Mem 97.7 8.8E-05 2.3E-09 54.2 5.9 102 242-348 6-123 (123) 48 cd04669 Nudix_Hydrolase_11 Mem 97.7 0.00016 4E-09 52.5 6.9 97 241-339 3-116 (121) 49 cd04700 DR1025_like DR1025 fro 97.7 0.00023 5.9E-09 51.2 7.8 97 241-338 16-126 (142) 50 cd04671 Nudix_Hydrolase_13 Mem 97.6 0.00023 6E-09 51.2 7.2 106 239-350 1-120 (123) 51 pfam00293 NUDIX NUDIX domain. 97.6 0.00033 8.5E-09 50.1 7.4 107 242-348 6-128 (133) 52 cd03673 Ap6A_hydrolase Diadeno 97.6 0.0016 4E-08 45.4 10.5 99 248-350 14-129 (131) 53 cd04697 Nudix_Hydrolase_38 Mem 97.5 0.00055 1.4E-08 48.6 7.9 97 240-337 3-112 (126) 54 cd04680 Nudix_Hydrolase_21 Mem 97.5 0.0005 1.3E-08 48.9 7.6 104 241-349 3-119 (120) 55 cd04687 Nudix_Hydrolase_28 Mem 97.5 0.00076 1.9E-08 47.6 8.5 96 241-339 4-123 (128) 56 cd04688 Nudix_Hydrolase_29 Mem 97.5 0.0011 2.8E-08 46.5 9.1 95 242-341 5-123 (126) 57 cd04689 Nudix_Hydrolase_30 Mem 97.5 0.002 5E-08 44.7 10.4 103 242-350 4-125 (125) 58 pfam00633 HHH Helix-hairpin-he 97.5 2.9E-05 7.5E-10 57.5 1.0 30 108-137 1-30 (30) 59 cd04693 Nudix_Hydrolase_34 Mem 97.4 0.0006 1.5E-08 48.3 6.7 94 243-337 5-113 (127) 60 KOG1918 consensus 97.3 0.0011 2.7E-08 46.5 7.4 148 37-194 73-242 (254) 61 cd04694 Nudix_Hydrolase_35 Mem 97.2 0.0012 3.1E-08 46.2 6.1 44 240-283 3-47 (143) 62 PRK09438 ntpA dATP pyrophospho 97.1 0.0022 5.7E-08 44.3 6.8 106 242-352 12-145 (148) 63 cd03675 Nudix_Hydrolase_2 Cont 97.1 0.006 1.5E-07 41.2 8.8 105 242-348 3-125 (134) 64 cd03429 NADH_pyrophosphatase N 96.8 0.0086 2.2E-07 40.2 7.6 92 243-336 5-106 (131) 65 cd03671 Ap4A_hydrolase_plant_l 96.7 0.0096 2.4E-07 39.8 7.2 43 240-285 5-47 (147) 66 cd03428 Ap4A_hydrolase_human_l 96.6 0.022 5.7E-07 37.2 8.7 96 249-350 16-128 (130) 67 cd03426 CoAse Coenzyme A pyrop 96.5 0.012 2.9E-07 39.3 6.6 87 249-335 15-116 (157) 68 PRK03759 isopentenyl-diphospha 96.3 0.048 1.2E-06 34.9 8.7 101 235-337 32-152 (184) 69 cd02885 IPP_Isomerase Isopente 96.2 0.062 1.6E-06 34.1 9.0 101 236-337 29-148 (165) 70 PRK01229 N-glycosylase/DNA lya 96.1 0.0075 1.9E-07 40.6 4.1 115 40-162 34-163 (208) 71 cd04692 Nudix_Hydrolase_33 Mem 96.1 0.063 1.6E-06 34.1 8.8 99 238-337 3-127 (144) 72 TIGR03252 uncharacterized HhH- 96.0 0.033 8.4E-07 36.0 7.0 103 35-137 14-134 (177) 73 PRK00714 dinucleoside polyphos 95.9 0.026 6.6E-07 36.8 6.1 107 239-349 9-150 (156) 74 smart00525 FES FES domain. iro 95.7 0.0032 8.2E-08 43.2 0.8 23 198-220 1-23 (26) 75 cd04686 Nudix_Hydrolase_27 Mem 95.5 0.11 2.9E-06 32.3 7.9 89 243-336 5-118 (131) 76 PRK10707 hypothetical protein; 95.2 0.16 4E-06 31.3 7.8 99 237-335 29-144 (190) 77 cd04685 Nudix_Hydrolase_26 Mem 95.1 0.13 3.3E-06 31.9 7.3 96 243-338 5-124 (133) 78 PRK05379 bifunctional nicotina 95.0 0.11 2.9E-06 32.3 6.7 109 239-349 203-335 (340) 79 COG1051 ADP-ribose pyrophospha 95.0 0.37 9.3E-06 28.7 9.2 107 241-348 12-135 (145) 80 cd04674 Nudix_Hydrolase_16 Mem 94.7 0.11 2.9E-06 32.3 6.1 44 244-288 9-52 (118) 81 cd04665 Nudix_Hydrolase_8 Memb 94.5 0.18 4.5E-06 30.9 6.7 48 241-293 2-49 (118) 82 pfam05559 DUF763 Protein of un 94.5 0.041 1E-06 35.4 3.4 34 112-145 263-299 (319) 83 cd03672 Dcp2p mRNA decapping e 94.2 0.24 6.2E-06 30.0 6.9 89 243-336 6-110 (145) 84 pfam10576 EndIII_4Fe-2S Iron-s 93.4 0.025 6.3E-07 36.9 0.5 21 199-219 1-21 (26) 85 PRK00241 nudC NADH pyrophospha 93.0 0.81 2.1E-05 26.3 7.8 115 223-349 126-252 (257) 86 cd04661 MRP_L46 Mitochondrial 93.0 0.39 9.9E-06 28.5 6.2 82 264-345 25-128 (132) 87 COG1059 Thermostable 8-oxoguan 92.4 0.22 5.7E-06 30.2 4.3 90 72-164 66-167 (210) 88 COG2816 NPY1 NTP pyrophosphohy 92.3 1.1 2.8E-05 25.4 7.9 152 170-339 82-253 (279) 89 PRK13901 ruvA Holliday junctio 91.6 0.078 2E-06 33.4 1.3 76 117-195 71-155 (196) 90 COG1555 ComEA DNA uptake prote 91.0 0.093 2.4E-06 32.9 1.1 24 115-138 94-117 (149) 91 TIGR02705 nudix_YtkD nucleosid 90.9 0.36 9.1E-06 28.8 4.1 78 240-322 26-110 (158) 92 cd03424 ADPRase_NUDT5 ADP-ribo 90.5 1.1 2.7E-05 25.4 6.2 93 243-337 7-115 (137) 93 KOG3069 consensus 90.3 0.77 2E-05 26.4 5.3 88 249-336 57-162 (246) 94 TIGR00575 dnlj DNA ligase, NAD 90.2 0.46 1.2E-05 28.0 4.1 22 116-137 562-583 (706) 95 PRK00024 radC DNA repair prote 90.0 0.11 2.8E-06 32.4 0.8 70 62-138 14-86 (224) 96 COG1415 Uncharacterized conser 88.6 0.21 5.4E-06 30.4 1.4 35 113-147 273-310 (373) 97 TIGR01259 comE comEA protein; 88.6 0.15 3.8E-06 31.5 0.6 48 75-130 67-114 (124) 98 COG1443 Idi Isopentenyldiphosp 87.1 1.4 3.5E-05 24.6 4.9 99 237-337 33-153 (185) 99 TIGR00084 ruvA Holliday juncti 86.3 0.23 5.8E-06 30.1 0.5 23 115-137 115-137 (217) 100 TIGR00426 TIGR00426 competence 85.1 0.64 1.6E-05 27.0 2.3 50 74-132 11-62 (70) 101 TIGR00615 recR recombination p 84.9 0.3 7.6E-06 29.3 0.5 21 115-135 9-29 (205) 102 PRK13766 Hef nuclease; Provisi 84.7 1.6 4E-05 24.3 4.1 30 238-267 467-500 (764) 103 PRK11762 nudE ADP-ribose dipho 84.3 3.6 9.2E-05 21.7 6.7 92 244-337 53-159 (185) 104 PRK13266 Thf1-like protein; Re 84.0 3.2 8.1E-05 22.1 5.4 127 6-160 5-145 (224) 105 TIGR00084 ruvA Holliday juncti 83.8 0.48 1.2E-05 27.9 1.2 59 117-178 82-143 (217) 106 TIGR00593 pola DNA polymerase 83.5 0.46 1.2E-05 28.0 1.0 45 101-147 165-220 (1005) 107 PRK00116 ruvA Holliday junctio 82.1 0.65 1.7E-05 26.9 1.3 50 83-136 75-126 (198) 108 pfam11731 Cdd1 Pathogenicity l 81.7 0.61 1.5E-05 27.1 1.0 42 116-157 10-51 (92) 109 PRK00254 ski2-like helicase; P 81.6 1.7 4.3E-05 24.0 3.3 20 8-27 8-29 (717) 110 PRK08097 ligB NAD-dependent DN 81.5 0.68 1.7E-05 26.8 1.2 20 66-86 95-114 (563) 111 TIGR03060 PS_II_psb29 photosys 81.3 3.7 9.5E-05 21.6 5.0 126 6-160 5-144 (214) 112 KOG2841 consensus 81.1 2.7 6.9E-05 22.6 4.2 21 330-350 225-247 (254) 113 TIGR01259 comE comEA protein; 81.0 0.85 2.2E-05 26.1 1.6 24 114-137 68-91 (124) 114 PRK00116 ruvA Holliday junctio 80.2 0.58 1.5E-05 27.3 0.5 58 117-177 72-131 (198) 115 COG0632 RuvA Holliday junction 80.0 0.41 1E-05 28.3 -0.3 44 117-161 72-117 (201) 116 PRK07456 consensus 79.9 1 2.6E-05 25.5 1.7 34 102-136 179-223 (975) 117 PRK00076 recR recombination pr 79.8 0.77 2E-05 26.4 1.0 24 113-136 2-29 (197) 118 PRK06887 consensus 79.4 1.1 2.8E-05 25.3 1.7 35 101-136 161-206 (954) 119 TIGR01816 sdhA_forward succina 79.0 0.43 1.1E-05 28.2 -0.5 18 62-79 271-289 (615) 120 PRK08609 hypothetical protein; 79.0 3.1 7.8E-05 22.2 3.9 24 112-135 82-105 (570) 121 PRK05929 consensus 78.3 1.2 3.1E-05 25.1 1.6 33 103-136 160-203 (870) 122 PRK07625 consensus 78.1 1.3 3.3E-05 24.8 1.8 35 101-136 160-205 (922) 123 PRK01172 ski2-like helicase; P 77.7 1.7 4.2E-05 24.1 2.2 45 8-52 8-61 (674) 124 PRK08076 consensus 77.6 1.4 3.5E-05 24.6 1.8 34 102-136 162-206 (877) 125 smart00278 HhH1 Helix-hairpin- 77.3 0.54 1.4E-05 27.5 -0.4 23 118-140 1-23 (26) 126 PRK07956 ligA NAD-dependent DN 77.1 2.4 6.1E-05 22.9 2.9 13 342-354 506-518 (668) 127 PRK13844 recombination protein 77.0 1 2.6E-05 25.5 1.0 24 112-135 5-32 (200) 128 PRK07556 consensus 76.9 1.3 3.3E-05 24.8 1.5 33 103-136 169-212 (977) 129 PRK12911 bifunctional preprote 76.8 0.71 1.8E-05 26.7 0.1 30 118-157 958-987 (1405) 130 cd00141 NT_POLXc Nucleotidyltr 76.8 2.2 5.6E-05 23.2 2.6 26 111-136 78-103 (307) 131 PRK08835 consensus 76.6 1.5 3.7E-05 24.4 1.7 51 101-161 161-222 (931) 132 COG0353 RecR Recombinational D 76.4 0.91 2.3E-05 25.9 0.6 20 116-135 10-29 (198) 133 PRK07997 consensus 76.3 1.5 3.9E-05 24.3 1.7 35 101-136 161-206 (928) 134 PRK07898 consensus 76.3 1.5 3.9E-05 24.3 1.7 35 101-136 175-220 (902) 135 PRK08786 consensus 76.3 1.4 3.7E-05 24.5 1.6 34 102-136 157-201 (927) 136 PRK08928 consensus 75.4 1.7 4.3E-05 24.0 1.7 51 101-161 159-220 (861) 137 PRK05797 consensus 75.0 1.6 4E-05 24.2 1.5 34 102-136 162-206 (869) 138 PRK08434 consensus 74.7 1.7 4.4E-05 23.9 1.7 49 103-161 159-218 (887) 139 TIGR02789 nickel_nikB nickel A 74.6 0.54 1.4E-05 27.5 -1.0 38 107-150 245-282 (315) 140 TIGR02236 recomb_radA DNA repa 74.4 0.54 1.4E-05 27.5 -1.0 57 68-128 21-77 (333) 141 PRK13901 ruvA Holliday junctio 74.3 1.2 3E-05 25.1 0.7 98 39-140 25-129 (196) 142 PRK07300 consensus 74.2 1.9 4.7E-05 23.7 1.7 33 103-136 171-214 (880) 143 smart00483 POLXc DNA polymeras 73.9 5.5 0.00014 20.4 4.1 25 111-135 82-106 (334) 144 PRK02362 ski2-like helicase; P 71.5 3.1 7.8E-05 22.2 2.3 18 10-27 10-29 (736) 145 PRK05755 DNA polymerase I; Pro 71.2 2.5 6.4E-05 22.8 1.8 35 101-136 159-204 (889) 146 PRK07945 hypothetical protein; 70.9 0.9 2.3E-05 25.9 -0.5 61 94-162 23-85 (335) 147 KOG2534 consensus 70.8 3.7 9.4E-05 21.6 2.6 67 94-162 34-107 (353) 148 COG1796 POL4 DNA polymerase IV 70.6 2 5.1E-05 23.5 1.2 21 252-272 201-221 (326) 149 pfam02371 Transposase_20 Trans 70.4 1.6 4E-05 24.2 0.6 41 118-162 2-42 (87) 150 COG2003 RadC DNA repair protei 70.0 2.3 5.9E-05 23.1 1.4 62 71-137 21-85 (224) 151 TIGR01129 secD protein-export 69.2 1.7 4.3E-05 24.0 0.6 12 137-148 118-129 (522) 152 COG3547 Transposase and inacti 68.0 5.4 0.00014 20.5 2.9 37 97-135 169-205 (303) 153 pfam01601 Corona_S2 Coronaviru 66.5 3 7.7E-05 22.2 1.4 75 77-160 149-239 (609) 154 pfam03686 UPF0146 Uncharacteri 66.3 3.5 9E-05 21.7 1.7 92 100-199 3-95 (127) 155 TIGR02150 IPP_isom_1 isopenten 66.0 5 0.00013 20.7 2.4 32 236-268 29-61 (190) 156 COG0494 MutT NTP pyrophosphohy 65.7 5.9 0.00015 20.2 2.8 28 250-280 24-51 (161) 157 KOG3084 consensus 65.6 11 0.00028 18.3 4.2 93 242-336 191-297 (345) 158 TIGR03374 ABALDH 1-pyrroline d 64.5 9.9 0.00025 18.6 3.7 76 81-156 49-136 (472) 159 cd01701 Pol_zeta Pol_zeta, a m 62.0 13 0.00032 17.8 4.0 88 68-159 164-265 (405) 160 PRK04148 hypothetical protein; 61.9 8.5 0.00022 19.1 3.0 92 100-199 3-102 (135) 161 pfam01367 5_3_exonuc 5'-3' exo 61.4 3.2 8.2E-05 22.0 0.8 34 115-157 15-48 (100) 162 TIGR02757 TIGR02757 conserved 61.0 3.7 9.3E-05 21.6 1.0 46 141-187 210-260 (269) 163 KOG3690 consensus 60.3 14 0.00035 17.6 6.3 97 93-205 43-148 (646) 164 KOG3911 consensus 59.4 4.8 0.00012 20.8 1.3 29 63-92 29-58 (378) 165 COG4649 Uncharacterized protei 59.2 14 0.00036 17.5 3.7 74 121-194 124-213 (221) 166 PRK09482 xni exonuclease IX; P 58.4 3.3 8.3E-05 22.0 0.4 12 124-135 188-199 (256) 167 COG1255 Uncharacterized protei 58.1 11 0.00028 18.3 3.0 99 102-212 5-104 (129) 168 smart00475 53EXOc 5'-3' exonuc 57.9 5.5 0.00014 20.4 1.5 33 103-136 161-204 (259) 169 COG5071 RPN5 26S proteasome re 56.9 9.4 0.00024 18.8 2.5 23 58-80 7-29 (439) 170 PRK13482 DNA integrity scannin 56.4 16 0.0004 17.2 4.4 23 137-159 268-294 (352) 171 COG1491 Predicted RNA-binding 54.4 5.9 0.00015 20.2 1.1 84 76-163 74-171 (202) 172 pfam09674 DUF2400 Protein of u 50.8 11 0.00027 18.4 2.0 19 143-162 174-192 (230) 173 pfam05597 Phasin Poly(hydroxya 50.6 19 0.00049 16.6 3.3 82 74-161 11-106 (132) 174 KOG4368 consensus 50.0 20 0.0005 16.5 3.4 39 66-104 196-236 (757) 175 pfam04919 DUF655 Protein of un 49.5 7.6 0.00019 19.4 1.1 54 107-164 103-158 (181) 176 PRK04301 radA DNA repair and r 49.4 7.5 0.00019 19.5 1.0 40 66-107 26-65 (318) 177 pfam02889 Sec63 Sec63 Brl doma 49.1 20 0.00052 16.4 3.7 48 85-136 119-166 (309) 178 TIGR03029 EpsG chain length de 47.8 13 0.00033 17.8 2.0 41 123-163 112-156 (274) 179 PRK05812 secD preprotein trans 47.7 7.1 0.00018 19.6 0.7 20 145-164 144-165 (513) 180 COG4277 Predicted DNA-binding 47.5 6.3 0.00016 20.0 0.4 45 76-122 83-153 (404) 181 pfam11264 ThylakoidFormat Thyl 47.1 22 0.00055 16.2 4.8 119 13-160 7-141 (216) 182 TIGR02153 gatD_arch glutamyl-t 47.1 7.6 0.00019 19.4 0.8 91 56-168 87-191 (413) 183 pfam05997 Nop52 Nucleolar prot 47.0 16 0.00042 17.1 2.4 50 52-108 16-70 (212) 184 COG1561 Uncharacterized stress 47.0 13 0.00032 17.9 1.9 35 94-128 84-118 (290) 185 KOG4548 consensus 46.5 22 0.00057 16.1 3.8 90 249-340 138-250 (263) 186 pfam04669 DUF579 Protein of un 46.4 22 0.00057 16.1 4.8 98 60-163 49-176 (187) 187 pfam04904 NCD1 NAB conserved r 45.1 23 0.00059 16.0 3.8 56 53-108 17-76 (82) 188 KOG0135 consensus 44.8 24 0.0006 16.0 6.2 99 61-160 477-585 (661) 189 COG0272 Lig NAD-dependent DNA 44.5 17 0.00043 17.0 2.2 16 71-86 82-97 (667) 190 TIGR01954 nusA_Cterm_rpt trans 44.5 21 0.00055 16.2 2.7 38 66-105 12-51 (52) 191 CHL00137 rps13 ribosomal prote 44.2 10 0.00026 18.6 1.0 28 111-138 7-37 (122) 192 pfam10343 DUF2419 Protein of u 44.0 21 0.00055 16.2 2.7 41 74-114 69-115 (282) 193 COG0258 Exo 5'-3' exonuclease 43.8 15 0.00038 17.4 1.8 34 102-135 171-215 (310) 194 PTZ00221 cyclophilin; Provisio 43.7 13 0.00033 17.8 1.5 58 69-129 20-98 (249) 195 PRK05179 rpsM 30S ribosomal pr 43.0 11 0.00029 18.2 1.2 27 112-138 8-37 (122) 196 cd04666 Nudix_Hydrolase_9 Memb 42.2 26 0.00065 15.7 8.8 83 250-336 15-114 (122) 197 pfam05516 consensus 42.1 22 0.00055 16.2 2.5 55 59-124 47-102 (136) 198 PRK13023 bifunctional preprote 42.1 11 0.00028 18.3 0.9 20 145-164 156-177 (840) 199 PRK04053 rps13p 30S ribosomal 42.0 12 0.0003 18.0 1.1 47 97-150 5-51 (149) 200 TIGR02111 PQQ_syn_pqqC coenzym 42.0 26 0.00065 15.7 2.8 76 36-128 20-101 (239) 201 COG1031 Uncharacterized Fe-S o 41.0 13 0.00034 17.7 1.2 30 111-147 253-283 (560) 202 pfam05166 YcgL YcgL domain. Th 41.0 16 0.0004 17.2 1.6 21 72-92 48-68 (74) 203 PRK13024 bifunctional preprote 40.9 11 0.00029 18.2 0.9 31 118-158 309-339 (741) 204 PRK02406 DNA polymerase IV; Va 40.7 27 0.00069 15.5 4.7 132 66-201 118-277 (355) 205 TIGR01448 recD_rel helicase, R 40.7 10 0.00026 18.6 0.6 17 119-135 195-212 (769) 206 pfam09735 Nckap1 Membrane-asso 40.3 27 0.0007 15.5 3.6 17 8-24 368-384 (1118) 207 cd01700 Pol_V Pol V was discov 40.2 27 0.0007 15.5 2.9 69 70-141 112-198 (344) 208 PRK03137 1-pyrroline-5-carboxy 40.0 28 0.00071 15.5 4.9 68 83-150 86-164 (514) 209 PRK01151 rps17E 30S ribosomal 39.8 15 0.00039 17.2 1.4 29 94-122 4-32 (58) 210 cd03586 Pol_IV_kappa Pol_IV_ka 39.2 28 0.00072 15.4 3.3 82 66-151 110-205 (337) 211 cd00008 53EXOc 5'-3' exonuclea 38.9 15 0.00038 17.4 1.2 43 107-159 162-215 (240) 212 PTZ00134 40S ribosomal protein 38.8 9.9 0.00025 18.6 0.3 30 109-138 18-50 (154) 213 KOG3835 consensus 38.1 30 0.00075 15.3 4.2 81 53-133 19-119 (495) 214 TIGR03631 bact_S13 30S ribosom 37.9 12 0.00031 17.9 0.7 27 112-138 6-35 (113) 215 pfam00113 Enolase_C Enolase, C 37.8 6.5 0.00017 19.9 -0.8 29 106-140 66-96 (296) 216 pfam11372 DUF3173 Protein of u 37.6 20 0.00051 16.5 1.7 25 78-106 5-30 (59) 217 pfam08887 GAD-like GAD-like do 37.4 25 0.00064 15.8 2.2 13 60-72 3-15 (109) 218 cd00080 HhH2_motif Helix-hairp 37.1 8.2 0.00021 19.2 -0.3 19 118-136 22-40 (75) 219 PRK11820 hypothetical protein; 37.0 28 0.00071 15.4 2.4 34 94-127 84-117 (288) 220 TIGR03629 arch_S13P archaeal r 36.3 8.9 0.00023 18.9 -0.3 30 109-138 9-41 (144) 221 cd00424 Pol_Y Y-family of DNA 35.9 18 0.00047 16.7 1.3 89 67-159 111-214 (341) 222 COG0529 CysC Adenylylsulfate k 35.5 22 0.00055 16.2 1.6 42 120-161 28-73 (197) 223 PRK11241 gabD succinate-semial 35.5 32 0.00082 15.0 3.3 56 82-137 60-126 (482) 224 pfam03755 YicC_N YicC-like fam 35.3 30 0.00078 15.2 2.3 34 94-127 83-116 (159) 225 LOAD_Hrd consensus 35.1 11 0.00027 18.4 -0.0 21 112-132 41-61 (77) 226 COG1084 Predicted GTPase [Gene 34.9 33 0.00084 14.9 2.8 84 46-144 47-139 (346) 227 KOG2379 consensus 34.9 27 0.00069 15.5 2.0 36 97-135 38-74 (501) 228 KOG2705 consensus 34.8 20 0.00051 16.5 1.3 28 107-139 193-220 (471) 229 pfam00416 Ribosomal_S13 Riboso 34.7 12 0.00031 18.0 0.2 27 112-138 6-35 (106) 230 pfam02467 Whib Transcription f 34.3 12 0.00031 18.0 0.1 19 203-221 29-47 (66) 231 PRK10729 nudF ADP-ribose pyrop 34.3 34 0.00086 14.9 7.1 76 241-316 51-139 (202) 232 PRK02515 psbU photosystem II c 33.7 34 0.00088 14.8 5.0 80 99-189 51-133 (144) 233 cd07092 ALDH_ABALDH-YdcW Esche 33.6 31 0.0008 15.1 2.2 22 85-106 34-55 (450) 234 cd07093 ALDH_F8_HMSADH Human a 33.5 32 0.00081 15.1 2.2 24 84-107 33-56 (455) 235 smart00341 HRDC Helicase and R 33.5 13 0.00033 17.8 0.2 21 112-132 41-61 (81) 236 cd07090 ALDH_F9_TMBADH NAD+-de 33.0 34 0.00088 14.8 2.3 57 82-138 31-98 (457) 237 TIGR01862 N2-ase-Ialpha nitrog 32.2 5.5 0.00014 20.4 -1.9 114 82-202 231-362 (510) 238 pfam00570 HRDC HRDC domain. Th 31.8 13 0.00034 17.7 0.0 21 112-132 38-58 (68) 239 smart00279 HhH2 Helix-hairpin- 31.7 15 0.00038 17.3 0.3 19 118-136 16-34 (36) 240 PRK09407 gabD2 succinic semial 31.6 37 0.00095 14.6 2.5 55 83-137 66-131 (523) 241 PTZ00081 enolase (2-phospho-D- 31.6 8.9 0.00023 18.9 -0.9 12 147-158 132-143 (442) 242 KOG0648 consensus 31.5 37 0.00095 14.6 2.6 47 237-283 114-161 (295) 243 PHA00619 CRISPR-associated Cas 31.5 25 0.00064 15.8 1.4 12 205-216 196-207 (210) 244 KOG2171 consensus 31.5 37 0.00095 14.5 5.2 16 206-221 584-599 (1075) 245 PRK03352 DNA polymerase IV; Va 31.3 32 0.00082 15.0 1.9 86 69-159 118-217 (345) 246 TIGR00372 cas4 CRISPR-associat 30.5 18 0.00045 16.8 0.5 16 202-217 191-206 (206) 247 pfam11798 IMS_HHH IMS family H 30.5 14 0.00036 17.6 -0.0 16 120-135 14-29 (33) 248 KOG4233 consensus 30.4 19 0.00049 16.6 0.6 47 169-218 42-89 (90) 249 pfam10273 WGG Pre-rRNA-process 30.4 39 0.00099 14.4 2.8 12 5-16 33-44 (82) 250 PRK13698 plasmid-partitioning 30.4 39 0.00099 14.4 5.1 50 122-180 102-157 (323) 251 TIGR00658 orni_carb_tr ornithi 30.2 33 0.00085 14.9 1.9 50 140-194 85-142 (341) 252 PTZ00205 DNA polymerase kappa; 30.1 29 0.00073 15.3 1.5 19 120-138 311-329 (571) 253 TIGR02545 ATP_syn_fliI flagell 29.5 26 0.00067 15.6 1.2 15 145-161 331-345 (439) 254 PRK13473 gamma-aminobutyraldeh 29.5 40 0.001 14.3 4.1 56 83-138 52-119 (475) 255 KOG1338 consensus 29.3 40 0.001 14.3 4.5 93 47-144 96-196 (466) 256 cd03312 CIMS_N_terminal_like C 28.8 11 0.00029 18.2 -0.7 15 101-115 108-122 (360) 257 TIGR02535 hyp_Hser_kinase prop 28.6 3.4 8.6E-05 21.9 -3.5 23 178-200 200-223 (431) 258 cd01703 Pol_iota Pol iota is m 28.5 23 0.00059 16.0 0.8 53 85-138 163-230 (394) 259 cd07108 ALDH_MGR_2402 Magnetos 28.2 39 0.00099 14.4 1.9 53 83-135 32-96 (457) 260 TIGR02061 aprA adenylylsulfate 27.9 33 0.00083 14.9 1.5 24 62-87 562-585 (651) 261 PTZ00154 40S ribosomal protein 27.7 34 0.00086 14.8 1.5 29 94-122 5-33 (130) 262 KOG2966 consensus 27.5 22 0.00056 16.2 0.5 11 190-200 149-159 (325) 263 TIGR02418 acolac_catab acetola 27.4 17 0.00043 17.0 -0.1 30 123-152 75-104 (553) 264 cd07146 ALDH_PhpJ Streptomyces 27.3 38 0.00098 14.5 1.8 26 83-108 31-56 (451) 265 COG3600 GepA Uncharacterized p 27.2 28 0.00071 15.4 1.0 70 87-159 31-108 (154) 266 COG1468 CRISPR-associated prot 26.8 23 0.00058 16.0 0.5 17 201-217 172-188 (190) 267 cd07151 ALDH_HBenzADH NADP+-de 26.4 45 0.0012 14.0 3.3 56 83-138 45-111 (465) 268 pfam03118 RNA_pol_A_CTD Bacter 26.2 21 0.00054 16.3 0.3 47 87-135 11-57 (62) 269 PRK12308 bifunctional arginino 26.1 44 0.0011 14.0 1.9 23 110-132 202-224 (614) 270 COG1383 RPS17A Ribosomal prote 25.9 43 0.0011 14.1 1.8 29 94-122 5-33 (74) 271 cd07102 ALDH_EDX86601 Uncharac 25.9 46 0.0012 13.9 4.9 25 83-107 31-55 (452) 272 pfam06754 PhnG Phosphonate met 25.8 44 0.0011 14.0 1.8 20 72-91 11-30 (147) 273 PRK13181 hisH imidazole glycer 25.8 15 0.00038 17.4 -0.6 11 121-131 41-51 (199) 274 PRK09457 astD succinylglutamic 25.8 46 0.0012 13.9 2.1 23 84-106 39-61 (475) 275 KOG3771 consensus 25.7 47 0.0012 13.9 4.1 83 1-99 63-145 (460) 276 cd07098 ALDH_F15-22 Aldehyde d 25.5 47 0.0012 13.8 2.2 26 83-108 31-56 (465) 277 TIGR02923 AhaC ATP synthase A1 25.3 46 0.0012 13.9 1.8 19 40-58 47-65 (353) 278 pfam02961 BAF Barrier to autoi 25.2 16 0.00041 17.1 -0.5 74 114-218 15-89 (89) 279 cd04663 Nudix_Hydrolase_6 Memb 25.1 48 0.0012 13.8 4.1 36 250-290 14-49 (126) 280 pfam06424 PRP1_N PRP1 splicing 25.0 16 0.00042 17.1 -0.5 16 115-130 115-130 (132) 281 cd02027 APSK Adenosine 5'-phos 24.8 47 0.0012 13.8 1.8 32 121-152 5-39 (149) 282 cd03740 SOCS_SOCS6 SOCS (suppr 24.7 45 0.0011 14.0 1.7 22 328-349 17-38 (41) 283 cd07116 ALDH_ACDHII-AcoD Ralst 24.6 49 0.0012 13.7 2.0 27 83-109 51-77 (479) 284 cd02037 MRP-like MRP (Multiple 24.3 40 0.001 14.3 1.4 27 124-150 9-38 (169) 285 TIGR02291 rimK_rel_E_lig alpha 24.2 33 0.00084 14.9 0.9 13 259-271 119-131 (320) 286 TIGR02684 dnstrm_HI1420 probab 24.1 50 0.0013 13.7 2.6 30 73-105 24-55 (91) 287 COG3100 Uncharacterized protei 24.0 43 0.0011 14.1 1.5 20 73-92 61-80 (103) 288 COG0099 RpsM Ribosomal protein 24.0 37 0.00093 14.6 1.1 27 112-138 8-37 (121) 289 cd07147 ALDH_F21_RNP123 Aldehy 23.9 50 0.0013 13.6 2.0 21 85-105 36-56 (452) 290 TIGR00608 radc DNA repair prot 23.9 49 0.0013 13.7 1.8 42 97-138 34-83 (223) 291 PRK00558 uvrC excinuclease ABC 23.8 50 0.0013 13.6 3.6 23 187-209 376-398 (609) 292 pfam00286 Flexi_CP Viral coat 23.8 50 0.0013 13.6 3.9 78 80-175 11-90 (140) 293 pfam01583 APS_kinase Adenylyls 23.8 46 0.0012 13.9 1.6 32 121-152 8-42 (157) 294 COG0634 Hpt Hypoxanthine-guani 23.5 51 0.0013 13.6 2.7 106 95-216 18-134 (178) 295 cd07559 ALDH_ACDHII_AcoD-like 23.4 51 0.0013 13.6 2.1 58 81-138 49-118 (480) 296 KOG1362 consensus 23.4 15 0.00038 17.4 -1.0 16 40-55 213-228 (577) 297 TIGR01258 pgm_1 phosphoglycera 23.3 21 0.00053 16.3 -0.2 11 53-63 156-166 (248) 298 cd03313 enolase Enolase: Enola 23.1 12 0.00031 17.9 -1.4 49 86-137 54-109 (408) 299 TIGR02903 spore_lon_C ATP-depe 22.9 37 0.00095 14.6 1.0 30 124-153 185-224 (616) 300 cd07106 ALDH_AldA-AAD23400 Str 22.9 52 0.0013 13.5 2.0 24 84-107 33-56 (446) 301 COG1251 NirB NAD(P)H-nitrite r 22.9 52 0.0013 13.5 2.7 46 108-157 310-355 (793) 302 TIGR01203 HGPRTase hypoxanthin 22.8 52 0.0013 13.5 2.2 61 95-155 9-77 (183) 303 cd07591 BAR_Rvs161p The Bin/Am 22.7 53 0.0013 13.5 3.2 75 1-84 44-124 (224) 304 KOG0142 consensus 22.4 23 0.00059 16.0 -0.1 33 235-268 50-83 (225) 305 cd02011 TPP_PK Thiamine pyroph 22.1 48 0.0012 13.8 1.4 12 203-214 89-100 (227) 306 pfam05405 Mt_ATP-synt_B Mitoch 21.9 55 0.0014 13.4 2.7 121 68-194 35-162 (163) 307 TIGR03371 cellulose_yhjQ cellu 21.9 55 0.0014 13.4 1.8 39 125-163 12-53 (246) 308 KOG4579 consensus 21.8 53 0.0014 13.4 1.6 16 62-77 70-85 (177) 309 PRK00077 eno phosphopyruvate h 21.8 16 0.00041 17.2 -1.1 27 110-136 80-113 (427) 310 TIGR03650 violacein_E violacei 21.6 36 0.00093 14.6 0.7 15 38-52 53-67 (184) 311 TIGR01577 oligosac_amyl oligos 21.6 55 0.0014 13.3 2.0 40 97-136 402-444 (679) 312 cd07131 ALDH_AldH-CAJ73105 Unc 21.5 56 0.0014 13.3 4.3 56 83-138 50-116 (478) 313 COG1168 MalY Bifunctional PLP- 21.5 56 0.0014 13.3 2.6 109 21-158 14-124 (388) 314 pfam10391 DNA_pol_lambd_f Fing 21.4 44 0.0011 14.0 1.1 21 118-138 2-22 (52) 315 TIGR03018 pepcterm_TyrKin exop 21.3 41 0.001 14.2 1.0 41 123-163 44-89 (207) 316 pfam00506 Flu_NP Influenza vir 21.3 56 0.0014 13.3 7.1 105 92-202 156-274 (506) 317 COG4892 Predicted heme/steroid 21.2 30 0.00075 15.3 0.2 35 95-129 41-75 (81) 318 PRK04195 replication factor C 21.2 57 0.0014 13.3 3.1 27 122-148 47-73 (403) 319 pfam08440 Poty_PP Potyviridae 21.0 38 0.00097 14.5 0.7 14 132-145 1-14 (274) 320 COG2355 Zn-dependent dipeptida 20.9 57 0.0015 13.2 2.2 90 113-218 107-200 (313) 321 PRK13153 consensus 20.8 42 0.0011 14.2 0.9 11 121-131 41-51 (203) 322 cd02035 ArsA ArsA ATPase funct 20.8 58 0.0015 13.2 1.7 25 125-149 9-36 (217) 323 COG5133 Uncharacterized conser 20.7 43 0.0011 14.1 1.0 94 116-217 10-122 (181) 324 COG1313 PflX Uncharacterized F 20.6 52 0.0013 13.6 1.3 45 169-213 38-84 (335) 325 PRK09847 gamma-glutamyl-gamma- 20.6 58 0.0015 13.2 2.3 52 86-137 75-138 (494) 326 TIGR01944 rnfB electron transp 20.5 58 0.0015 13.2 2.7 54 107-161 41-96 (213) 327 pfam02764 Diphtheria_T Diphthe 20.1 42 0.0011 14.2 0.8 100 85-199 73-173 (180) 328 TIGR01837 PHA_granule_1 poly(h 20.1 59 0.0015 13.1 1.7 23 139-162 75-97 (121) 329 KOG2573 consensus 20.0 36 0.00091 14.7 0.4 31 57-96 126-157 (498) 330 cd07086 ALDH_F7_AASADH-like NA 20.0 60 0.0015 13.1 4.9 53 83-135 48-111 (478) No 1 >PRK10880 adenine DNA glycosylase; Provisional Probab=100.00 E-value=0 Score=860.79 Aligned_cols=341 Identities=32% Similarity=0.584 Sum_probs=310.5 Q ss_pred HHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHH Q ss_conf 68999999999996289-98875485544456787046766446731161203326589987748997898729989999 Q gi|254780479|r 5 EHIIQSKILDWYDTNHR-VLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEIL 83 (356) Q Consensus 5 ~~~~~~~ll~w~~~~~R-~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl 83 (356) ...|+++||+||+.|+| |||||++ +|||+|||||||||||||+||+|||++||++|||+++||+|++|||| T Consensus 3 ~~~f~~~ll~Wy~~~~r~~lPWr~~--------~~PY~vwvSEiMLQQTqv~tV~~yy~rf~~~fP~~~~LA~A~~~~vl 74 (350) T PRK10880 3 ASQFSAQVLDWYDKYGRKTLPWQID--------KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVL 74 (350) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCC--------CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHH T ss_conf 7999999999998739988998999--------98226999999983487789999999999988399999779999999 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCC Q ss_conf 99740463458899987767543005786873145688888777877667766434963131243146665765197545 Q gi|254780479|r 84 SAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKP 163 (356) Q Consensus 84 ~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~ 163 (356) ++|+||||||||||||++|++|+++|+|.||+|+++|++|||||+||||||+|||||++.++|||||+||++|+|+++.+ T Consensus 75 ~~W~GLGYY~RArnLh~aA~~i~~~~~G~~P~~~~~L~~LPGIG~yTA~AI~siaf~~~~~ivDgNV~RVlsR~~~i~~~ 154 (350) T PRK10880 75 HLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW 154 (350) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCC T ss_conf 98640693899999999999999975898982599986266887279999999976995357664403288887503368 Q ss_pred C--CHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 7--35689998741025785226789999999986320412356431100000134310110012344322222211232 Q gi|254780479|r 164 A--PLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGA 241 (356) Q Consensus 164 ~--~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~ 241 (356) . +..++.+|.+++.++|.++++|||||||||||+||||++|+|..|||+..|.||++|.++.||+|++|++++.+... T Consensus 155 ~~~~~~~~~l~~~a~~~~p~~~~~~~nQAlMDLGA~vCtp~~P~C~~CPl~~~C~A~~~~~~~~~P~kk~kk~~p~~~~~ 234 (350) T PRK10880 155 PGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMVCTRSKPKCELCPLQNGCIAAANHSWALYPGKKPKQTLPERTGY 234 (350) T ss_pred CCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCHHHCCCCCCCCCCCEEEEE T ss_conf 87559999999999973884327799999999734102799998788988220646427985655886788988779999 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC- Q ss_conf 0003413760577645720132047442111134672158886301010000213735899831599999999980776- Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI- 320 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~- 320 (356) +++++++|++||+||+++|+|+||||||+++..+....+.... .........++.|+|+||||+|+|.++.+.+... T Consensus 235 -~~ii~~~~~ill~kR~~~giw~gLw~fP~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~H~FSHf~L~I~p~~~~~~~~~ 312 (350) T PRK10880 235 -FLLLQHEDEVWLAQRPPSGLWGGLYCFPQFADEEELRQWLAQR-GINADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFT 312 (350) T ss_pred -EEEEEECCEEEEEECCCCCHHHHCCCCCCCCCHHHHHHHHHHC-CCCCCCCEECCCEEEEEECEEEEEEEEEEEECCCC T ss_conf -9999509989998379777345445488888677899999982-89810001147625841060767389999814500 Q ss_pred --CCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCCC Q ss_conf --76678133227888747998789999984482379 Q gi|254780479|r 321 --VIIPDSTWHDAQNLANAALPTVMKKALSAGGIKVP 355 (356) Q Consensus 321 --~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~p 355 (356) ...+++.|++++++.+++||+|++|||+.|....| T Consensus 313 ~~~~~~~~~W~~~~~~~~~GLPaPvkKiL~~L~~~~~ 349 (350) T PRK10880 313 GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 (350) T ss_pred CCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHCCCCC T ss_conf 1355667856444530204787899999997140489 No 2 >TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760 The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions. Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate. This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=100.00 E-value=0 Score=805.38 Aligned_cols=275 Identities=39% Similarity=0.756 Sum_probs=259.0 Q ss_pred HHHHHHHHHHHHCCC-CCCCCCCCC----CCCCCCCCHHHHHHHHHHHHHCCHHHCCH-HHHHHHHHCCCHHHHHCCCHH Q ss_conf 999999999996289-988754855----44456787046766446731161203326-589987748997898729989 Q gi|254780479|r 7 IIQSKILDWYDTNHR-VLPWRTSPK----TEKSSLPSPYKVWISEIMLQQTTVKTVEP-YFKKFMQKWPTIFCLSSAKDE 80 (356) Q Consensus 7 ~~~~~ll~w~~~~~R-~lpwr~~~~----~~~~~~~~py~v~vseimlqqT~v~~v~~-~~~~~~~~~P~~~~la~a~~~ 80 (356) .|++.||.||+++|| +||||.+.. .+..+++|||+|||||||||||||+|||| ||+|||++||||++||+|++| T Consensus 1 ~f~~~ll~Wy~~~gRk~LPWr~~~~~~cdeslkhi~~pY~VW~SEvMLQQTqV~tV~prYf~rFle~FPTv~~LA~A~~d 80 (297) T TIGR01084 1 QFREDLLSWYDKEGRKTLPWRQNKNQRCDESLKHIDDPYRVWVSEVMLQQTQVATVIPRYFERFLERFPTVQALANAPQD 80 (297) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEECCCCCEEEECCCCCCHHHHHHCCCHHHHHCCCHH T ss_conf 93788999998717867887478755424577522354514400211001100112671004766427885787477965 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCC Q ss_conf 99999740463458899987767543005786873145688888777877667766434963131243146665765197 Q gi|254780479|r 81 EILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDI 160 (356) Q Consensus 81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~ 160 (356) |||++|+||||||||||||+||++|+++|||+||.|+++|.+|||||+||||||+|||||+|.|+|||||+|||||+|++ T Consensus 81 eVL~lW~GLGYYaRARNL~kAA~~v~~~fGG~fP~d~~~~~~L~GVG~yTAgAils~a~~~~~p~~DGNV~RVLsR~fA~ 160 (297) T TIGR01084 81 EVLKLWEGLGYYARARNLHKAAQEVVEEFGGEFPQDLEDLKALPGVGRYTAGAILSFAYNKPVPILDGNVKRVLSRLFAV 160 (297) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 79998625786788899999999999871881772379785178976217999999872687620154078899999862 Q ss_pred CCCC--CHHHHHHHHHHHCCCC-CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 5457--3568999874102578-522678999999998632041235643110000013431011001234432222221 Q gi|254780479|r 161 IKPA--PLYHKTIKNYARKITS-TSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPM 237 (356) Q Consensus 161 ~~~~--~~~~k~l~~~~~~~~~-~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~ 237 (356) ++++ +..++.||.+++.++| ..++.+||||||||||+||++++|.|+.|||+++|.||+.|..+.||+|++|+..+. T Consensus 161 ~~~~~~k~~e~~l~~~~~~llpe~~~~~~~nqalmDlGA~iC~rk~P~C~~CPl~~~C~A~~~~~~~~yP~kk~~~~~~~ 240 (297) T TIGR01084 161 EGWPGKKKVENRLWELAESLLPEKADPEAFNQALMDLGALICTRKKPKCDLCPLQDFCLAYKQGTPEEYPVKKKKKAPPE 240 (297) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHCCCCCCCCCHHH T ss_conf 48988734889999999985886568658888998623610378478545487066555654277111678773100012 Q ss_pred EEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHH Q ss_conf 12320003413760577645720132047442111134672158 Q gi|254780479|r 238 RTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGN 281 (356) Q Consensus 238 r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~ 281 (356) +...+.++.+.+++++|.||+++|||+|||+||.++..++.... T Consensus 241 ~~~~~l~l~~~d~~~~~~~r~~~~l~gGLy~fP~~~~~kG~~~~ 284 (297) T TIGR01084 241 RTTYFLVLLNEDGEVLLEQRPEKGLWGGLYCFPQFELEKGIKEE 284 (297) T ss_pred HHEEEEEEEECCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 01012234414870576663766410010036525532013578 No 3 >COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=764.26 Aligned_cols=330 Identities=42% Similarity=0.729 Sum_probs=299.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHH Q ss_conf 68999999999996289988754855444567870467664467311612033265899877489978987299899999 Q gi|254780479|r 5 EHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILS 84 (356) Q Consensus 5 ~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~ 84 (356) ...|++.||.||++|+|+||||++ .+||+|||||||||||||++|+|||.+||++|||+++||+|++|||++ T Consensus 8 ~~~~~~~ll~Wy~~~~R~LPWR~~--------~~PY~VwvSEiMLQQT~v~~Vi~yy~~fl~rfPti~~LA~A~~~evl~ 79 (342) T COG1194 8 IEKFQEALLDWYDKNGRDLPWRET--------KDPYRVWVSEIMLQQTQVATVIPYYERFLERFPTIKALAAAPEDEVLK 79 (342) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCC--------CCCCEEHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHH T ss_conf 578888999999974874887789--------986220268887600607455456999998689989986688889999 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC Q ss_conf 97404634588999877675430057868731456888887778776677664349631312431466657651975457 Q gi|254780479|r 85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA 164 (356) Q Consensus 85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~ 164 (356) +|+||||||||||||++|++|+++|+|.||+|.++|.+|||||+|||+||+|||||++.++|||||+||++|+|+++.+. T Consensus 80 ~W~gLGYysRArnL~~~A~~v~~~~~G~~P~~~~~l~~LpGiG~yTa~Ail~~a~~~~~~~lDgNV~RVl~R~f~i~~~~ 159 (342) T COG1194 80 AWEGLGYYSRARNLHKAAQEVVERHGGEFPDDEEELAALPGVGPYTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDI 159 (342) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEECCHHEEEHHHHCCCCCC T ss_conf 98716737899999999999999819979999999986789738899999998718987533054221126653143655 Q ss_pred C--HHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 3--568999874102578522-6789999999986320412356431100000134310110012344322222211232 Q gi|254780479|r 165 P--LYHKTIKNYARKITSTSR-PGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGA 241 (356) Q Consensus 165 ~--~~~k~l~~~~~~~~~~~~-~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~ 241 (356) . ..++.+|.+++.++.+.+ +++||||||||||+||||++|+|..||++..|.+++.|.++.||+|++|++++ +... T Consensus 160 ~~~~~~~~~~~~~~~ll~p~~~~~~fnqammdlGA~ICt~~~P~C~~CPl~~~c~a~~~g~~~~~P~k~~k~~~~-~~~~ 238 (342) T COG1194 160 GKPKTKKELWELAEQLLTPDRRPGDFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLP-RRFA 238 (342) T ss_pred CCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC-HHEE T ss_conf 654105899999997448988767999999986367615899987739466889999819964478766554551-0033 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCCC Q ss_conf 00034137605776457201320474421111346721588863010100002137358998315999999999807767 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQIV 321 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~~ 321 (356) ++++++.+|.+++.||+++|+|+|||+||.++++.............. ..+++++|+||||+|+++ +...... . T Consensus 239 ~~~~~~~~~~~~l~kr~~~gl~~gl~~fP~~e~~~~~~~~~~~~~~~~----~~~~~~~H~fth~~l~i~-~~a~~~~-~ 312 (342) T COG1194 239 AFLILNRDGEVLLEKRPEKGLLGGLWCFPQFEDEADLLDWLAADGLAA----EPLGAFRHTFTHFRLTIE-LRASASL-V 312 (342) T ss_pred EEEEECCCCCHHHHHCCCCCCEECCCCCCCCCCCCHHHHHHHHCCCCC----CCCCCEEEEEEEEEEEEE-EEEECCC-C T ss_conf 577870685316662765672211120564343320466765413210----124312434668999999-9863246-7 Q ss_pred CCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 66781332278887479987899999844 Q gi|254780479|r 322 IIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 322 ~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .. +..|++.++++.+++|++++|+++.+ T Consensus 313 ~~-~~~w~~~~~~~~~~l~~p~~k~l~~~ 340 (342) T COG1194 313 LS-DGRWYNLSDLESIGLPAPVKKLLQQL 340 (342) T ss_pred CC-CCEECCCCCCCCCCCCHHHHHHHHHH T ss_conf 88-73451634443135527999999873 No 4 >PRK13910 DNA glycosylase MutY; Provisional Probab=100.00 E-value=0 Score=684.27 Aligned_cols=285 Identities=30% Similarity=0.503 Sum_probs=248.6 Q ss_pred HHHHHCCHHHCCH-HHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 6731161203326-589987748997898729989999997404634588999877675430057868731456888887 Q gi|254780479|r 47 IMLQQTTVKTVEP-YFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPG 125 (356) Q Consensus 47 imlqqT~v~~v~~-~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpG 125 (356) ||||||||+||+| ||++||++|||+++||+|++||||++|+||||||||||||++|++|+++|+|+||+|+++|++||| T Consensus 1 iMLQQTqv~tvip~y~~~f~~~fP~~~~la~a~~~~vl~~W~GLGYY~RArnl~~~a~~i~~~~~g~~P~~~~~L~~LPG 80 (290) T PRK13910 1 VMSQQTQINTVVERFYSPFLEAFPTLKDLANAQLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPG 80 (290) T ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCC T ss_conf 96885742100578999999988399999778999999998746848999999999999999838989852999975889 Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 77877667766434963131243146665765197545735689998741025785226789999999986320412356 Q gi|254780479|r 126 IGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPL 205 (356) Q Consensus 126 iG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~ 205 (356) ||+|||+||+|||||++.++|||||+||++|+|+++.+.. .+.+|..++.++|.++++|||||||||||+||+|+ |+ T Consensus 81 IG~yTA~AI~siaf~~~~~~vDgNv~RVl~R~~~~~~~~~--~k~~~~~~~~~~~~~~~~~~nqalMdlGa~iC~pk-P~ 157 (290) T PRK13910 81 IGAYTANAILCFGFREKSACVDANIKRVLLRLFGLDPNIH--AKDLQIKANDFLNLNESFNHNQALIDLGALICSPK-PK 157 (290) T ss_pred CCCHHHHHHHHHHCCCCCCCCCCCHHEEEEHHHCCCCCCC--HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC-CC T ss_conf 9826999999986476332000452014102324789946--69999999982595422689999998633343799-99 Q ss_pred CCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH Q ss_conf 43110000013431011001234432222221123200034137605776457201320474421111346721588863 Q gi|254780479|r 206 CPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH 285 (356) Q Consensus 206 C~~Cpl~~~C~~~~~~~~~~~P~kk~KKkk~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~ 285 (356) |..||+++.|.|+ +.+..+|.+++++..+.. ..+++ +..+++++|+|+ ++++|+||||||..+.+.+ T Consensus 158 C~~CPl~~~C~a~--~~~~~~~~kkkkk~~~~~-~~~~~-i~~~~~ilL~k~-~~~~~~Glw~fP~~ke~~~-------- 224 (290) T PRK13910 158 CAICPFNPYCLGK--NHPEKHTLKKKQEIIQEE-RYLGV-VIQNNQIALEKI-EQKLYLGMHHFPNLKENLE-------- 224 (290) T ss_pred CCCCCCHHHHHCC--CCCCCCCCCCCCCCCCCE-EEEEE-EEECCEEEEEEC-CCCCCCCCCCCCCCCCCCC-------- T ss_conf 8779795552315--876446888887677620-69999-998999999817-8866677665999875556-------- Q ss_pred CCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCC Q ss_conf 010100002137358998315999999999807767667813322788874799878999998448237 Q gi|254780479|r 286 SAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKV 354 (356) Q Consensus 286 ~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~ 354 (356) .....++.++|+||||++++++|.+.+++. ....+|++++|++++|||++++|||+.+..++ T Consensus 225 -----~~~~~l~~ikH~fTH~kl~i~v~~~~~k~~--~~~~~W~~l~el~~l~lps~~kKIL~~Lk~~~ 286 (290) T PRK13910 225 -----FKLPFLGAIKHSHTKFKLNLNLYLAAIKDL--KNPIRFYSLKDLETLPISSMTLKILNFLKQKN 286 (290) T ss_pred -----CCHHHHHHCEEEEEEEEEEEEEEEEHHHCC--CCCCCEEEHHHHHHCCCCCHHHHHHHHHHHHC T ss_conf -----540233000001565779999998541058--98774316777520889828999999997745 No 5 >KOG2457 consensus Probab=100.00 E-value=0 Score=633.95 Aligned_cols=338 Identities=35% Similarity=0.587 Sum_probs=275.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCC-CCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCH-HHHH Q ss_conf 8999999999996289988754855444-5678704676644673116120332658998774899789872998-9999 Q gi|254780479|r 6 HIIQSKILDWYDTNHRVLPWRTSPKTEK-SSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKD-EEIL 83 (356) Q Consensus 6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~-~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~-~~vl 83 (356) +.|+..||+|||.|+||||||+.+...+ +-.|..|.|||||||||||||.||+.||.+||++|||+.+||.|+. +||+ T Consensus 91 ~~fR~sLl~wYD~~KRdLPWR~r~sEde~DwerRaYeVwVSEiMLQQTrV~TV~~YYt~WMqkwPTl~dla~Asl~~eVn 170 (555) T KOG2457 91 QKFRMSLLDWYDVNKRDLPWRNRRSEDEKDWERRAYEVWVSEIMLQQTRVQTVMKYYTRWMQKWPTLYDLAQASLEKEVN 170 (555) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 99999898876300223864468752135688889999999999989999999999999998375088888878988999 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH-HHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC Q ss_conf 997404634588999877675430057868731456888-8877787766776643496313124314666576519754 Q gi|254780479|r 84 SAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKK-LPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 84 ~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~-LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~ 162 (356) .+|+|||||+|||+|+++|+.+++.++|.||.+.++|++ +||||+||||||+|||||++..+|||||.||+||..+|.. T Consensus 171 ~lWaGlGyY~R~rrL~ega~~vv~~~~ge~Prta~~l~kgvpGVG~YTAGAiaSIAf~q~tGiVDGNVirvlsRalAIhs 250 (555) T KOG2457 171 ELWAGLGYYRRARRLLEGAKMVVAGTEGEFPRTASSLMKGVPGVGQYTAGAIASIAFNQVTGIVDGNVIRVLSRALAIHS 250 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC T ss_conf 99841018988899999999999757887887389998518887742310455423047643204615777677676137 Q ss_pred CCC--HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCC---------------- Q ss_conf 573--56899987410257852267899999999863204123564311000001343101100---------------- Q gi|254780479|r 163 PAP--LYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSH---------------- 224 (356) Q Consensus 163 ~~~--~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~~~~~---------------- 224 (356) +.+ +....+|.+|+.++++.+|||||||+|||||++|||.+|.|+.||+++.|.||+...++ T Consensus 251 DcSkgk~~q~~wkLA~qLVDP~RPGDFNQalMELGAt~CTpq~P~CS~CPvss~CrA~q~~Eqn~~i~~g~t~~~dve~~ 330 (555) T KOG2457 251 DCSKGKFFQSSWKLAAQLVDPSRPGDFNQALMELGATLCTPQKPSCSSCPVSSQCRAFQLSEQNRTISVGDTYPTDVEKA 330 (555) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCEECCCCCCCCCCCCCHHHHHHHHHHHHCCEEECCCCCCCCHHHC T ss_conf 74300688999999998259889883779999836712157998767787288998876764234543488688527552 Q ss_pred --------------------------CCCCCCCCCCCCCEEEEEEEEEEC-------CCEEEEEECCCCHHHHCCCCCCC Q ss_conf --------------------------123443222222112320003413-------76057764572013204744211 Q gi|254780479|r 225 --------------------------LLGINTIKKKRPMRTGAVFIAITN-------DNRILLRKRTNTRLLEGMDELPG 271 (356) Q Consensus 225 --------------------------~~P~kk~KKkk~~r~~~~~vii~~-------~~kiLL~KRp~~gll~GLwEFP~ 271 (356) .||++..+ ..+... ...++++. .+.||+.+||..|+++|||+||+ T Consensus 331 ~P~~~~C~vCv~~ips~e~~Qs~gv~~~p~~p~~-~~~reE-~~~v~~~e~~dp~t~~~~ilv~~rp~~gllagLw~fpt 408 (555) T KOG2457 331 KPRHDFCCVCVLEIPSLERNQSGGVFVLPKRPEQ-LAGREE-FPSVILNEEADPATRRNAILVYLRPAFGLLAGLWKFPT 408 (555) T ss_pred CCCCCCEEEEECCCCCCHHHCCCCEEEECCCHHH-CCCCCC-CCEEEEECCCCCHHHCCEEEEEECCCHHHHHHHHHCCC T ss_conf 8887823686267897133146747971268554-476211-56035542578243101057785442147877643572 Q ss_pred CCCCCC-CCHHHHHHCC------------CCCCCCEECCCEEEEEEEEEEEEEEEEEE--CCCC--CCCCCCEEECHHHH Q ss_conf 113467-2158886301------------01000021373589983159999999998--0776--76678133227888 Q gi|254780479|r 272 SAWSST-KDGNIDTHSA------------PFTANWILCNTITHTFTHFTLTLFVWKTI--VPQI--VIIPDSTWHDAQNL 334 (356) Q Consensus 272 ~e~~~~-~~~~~~~~~~------------~~~~~~~~l~~ikH~fTH~~L~i~v~~~~--~~~~--~~~~~~~Wv~~~el 334 (356) +.+.+. +........- .......+.|.++|+|||+.+.-+||..- .+.. ..+.+.+|++..++ T Consensus 409 i~~~e~se~~~~~a~~q~~v~~w~~~~~~t~~~~~~~~G~~~htfshi~~ts~V~~~a~~~~~~vt~~p~~~~wi~q~~l 488 (555) T KOG2457 409 IVSRELSEFVHIFAHIQRKVYVWLLVVQLTGGTEDLFKGQAKHTFSHICVTSDVLSTAGLTSAVVTVPPFRLQWIKQLSL 488 (555) T ss_pred EECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH T ss_conf 11057522778999999999999623400123133216502200335556888988751578750269733202345446 Q ss_pred HHCC-CCHHHHH Q ss_conf 7479-9878999 Q gi|254780479|r 335 ANAA-LPTVMKK 345 (356) Q Consensus 335 ~~~~-LPs~~kK 345 (356) +..+ ....+++ T Consensus 489 ~h~~~~e~~lk~ 500 (555) T KOG2457 489 DHMVEKEQILKC 500 (555) T ss_pred HHHHHHHHHHHH T ss_conf 899999999888 No 6 >PRK10702 endonuclease III; Provisional Probab=100.00 E-value=0 Score=436.32 Aligned_cols=202 Identities=23% Similarity=0.310 Sum_probs=181.6 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCH Q ss_conf 99999962899887548554445678704676644673116120332658998774899789872998999999740463 Q gi|254780479|r 12 ILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGY 91 (356) Q Consensus 12 ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGy 91 (356) ..++++.=.+.+|+..+ +.++.|||+|||||||+|||++++|.++|++|+++|||+++||+|+++||.++|+|+|| T Consensus 6 ~~~i~~~l~~~~p~~~~----~l~~~~P~~vLVs~ILsqqTtd~~v~~~~~~L~~~~~t~e~la~a~~~el~~~i~~~G~ 81 (211) T PRK10702 6 RLEILTRLRDNNPHPTT----ELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKTIGL 81 (211) T ss_pred HHHHHHHHHHHCCCCCC----CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHHHH T ss_conf 99999999987869999----85889858999999997418589999999999997799999870999999999998635 Q ss_pred H-HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHH Q ss_conf 4-588999877675430057868731456888887778776677664349631312431466657651975457356899 Q gi|254780479|r 92 Y-TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKT 170 (356) Q Consensus 92 y-~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~ 170 (356) | +||+|||++|++|+++|+|.+|++.++|++|||||+|||+||+|+|||+|+++|||||.||++|++..... ..+. T Consensus 82 y~~KA~~L~~~a~~i~~~~~G~vP~~~~~L~~LpGIG~kTA~aIl~~a~~~~~~~VDtnV~RV~~Rlg~~~~~---~~~~ 158 (211) T PRK10702 82 YNSKAENVIKTCRILLEKHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPGK---NVEQ 158 (211) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCC---CHHH T ss_conf 9999999999999999990998766699999876635889999999984998652573599999997657789---9999 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHC Q ss_conf 98741025785226789999999986320412356431100000134310 Q gi|254780479|r 171 IKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSE 220 (356) Q Consensus 171 l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~~ 220 (356) ++..+..++|.+.+.++||+|||||++||+|++|+|..|||++.|.++++ T Consensus 159 ~~~~l~~~~p~~~~~~~~~~li~~G~~iC~~~~P~C~~Cpl~~~C~~~~K 208 (211) T PRK10702 159 VEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIEDLCEYKEK 208 (211) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99999982890237999999999950150699993999989144999666 No 7 >COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Probab=100.00 E-value=1.6e-39 Score=298.89 Aligned_cols=181 Identities=27% Similarity=0.466 Sum_probs=166.8 Q ss_pred CCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 870467664467311612033265899877489978987299899999974046345-8899987767543005786873 Q gi|254780479|r 37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKKYEGNFPH 115 (356) Q Consensus 37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~g~~P~ 115 (356) .|||.++|+-|+.|||+=+.|-+-+.+.+++|||+++|++|++++|-.+-...|+|+ +|+|++++|++|+++|+|.+|. T Consensus 27 ~~pf~lLva~iLSaqttD~~vn~at~~Lf~~~~t~e~l~~a~~~~l~~~I~~iGlyr~KAk~I~~~~~~l~e~~~g~vP~ 106 (211) T COG0177 27 KDPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLNADEEELEELIKSIGLYRNKAKNIKELARILLEKFGGEVPD 106 (211) T ss_pred CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 88389999999944674488999999999975999999749999999999863871899999999999999974999981 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 14568888877787766776643496313124314666576519754573568999874102578522678999999998 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG 195 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG 195 (356) +.++|++|||||++||+.|+++|||.|+++||+||.||..|+...... +-.+++.-...++|.+.+.++|.+|+.|| T Consensus 107 ~~~eL~~LPGVGrKTAnvVL~~a~g~p~i~VDTHV~Rvs~R~gl~~~~---~p~~ve~~L~~~iP~~~~~~~h~~lI~~G 183 (211) T COG0177 107 TREELLSLPGVGRKTANVVLSFAFGIPAIAVDTHVHRVSNRLGLVPGK---TPEEVEEALMKLIPKELWTDLHHWLILHG 183 (211) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCCCC---CHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 599997489966577898998655998652124299999984778899---99999999999789788999999999960 Q ss_pred HHHCCCCCCCCCCCCCCCCHHHHHC Q ss_conf 6320412356431100000134310 Q gi|254780479|r 196 ALICTSNKPLCPLCPIQKNCLTFSE 220 (356) Q Consensus 196 a~iC~p~~P~C~~Cpl~~~C~~~~~ 220 (356) ..||+|++|+|+.||+++.|.++.. T Consensus 184 R~iC~ar~P~C~~C~l~~~C~~~~~ 208 (211) T COG0177 184 RYICKARKPRCEECPLADLCPSAGK 208 (211) T ss_pred HHHCCCCCCCCCCCCCHHHCCHHCC T ss_conf 5311689998675646554811100 No 8 >TIGR01083 nth endonuclease III; InterPro: IPR005759 The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=100.00 E-value=7.9e-38 Score=286.98 Aligned_cols=187 Identities=26% Similarity=0.424 Sum_probs=163.5 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC---HHHCCHHHHHHHHHCCCHHHHHCCCH Q ss_conf 716899999999999628998875485544456787046766446731161---20332658998774899789872998 Q gi|254780479|r 3 QPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTT---VKTVEPYFKKFMQKWPTIFCLSSAKD 79 (356) Q Consensus 3 ~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~---v~~v~~~~~~~~~~~P~~~~la~a~~ 79 (356) ++++.+-++|.+-| +....+..+.|||.++||-|+.+|++ |+.|.| +++++|||+++||.|++ T Consensus 2 ~~~~~il~~L~~~y-----------P~p~tEL~~~~PFeLLVAtiLSAQ~TD~~VNkaT~---~LF~~Y~tp~~~a~a~~ 67 (192) T TIGR01083 2 QKAQEILERLRKLY-----------PHPTTELDYKNPFELLVATILSAQATDKSVNKATK---KLFEVYPTPQALAAAGL 67 (192) T ss_pred CHHHHHHHHHHHHC-----------CCCEEEEEECCCHHHHHHHHHHHHHCCHHHHHCCH---HHHHCCCCHHHHHCCCH T ss_conf 40789999999738-----------99725433307078999999986531326763167---86512778689960893 Q ss_pred HHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH Q ss_conf 99999974046345-88999877675430057868731456888887778776677664349631312431466657651 Q gi|254780479|r 80 EEILSAWAGLGYYT-RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF 158 (356) Q Consensus 80 ~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~ 158 (356) |||..+-.--|||+ .|+||.++|++|+++|+|++|+|.++|++||||||.||+-|++.|||.|+++||+||.||..|+ T Consensus 68 eel~~~Ik~iGlYr~KAk~I~~~~~~LvE~y~GeVP~~~~eL~~LPGVGRKTANVVL~~aFg~P~iAVDTHv~Rv~~Rl- 146 (192) T TIGR01083 68 EELEEYIKSIGLYRNKAKNIIALCRKLVERYGGEVPEDREELVKLPGVGRKTANVVLNVAFGIPAIAVDTHVFRVSNRL- 146 (192) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHCCCEEEECCCHHHHHHHH- T ss_conf 1347764225864568999999999999981898775537661789987114562433442687057414346554331- Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 975457356899987410257852267899999999863204123564 Q gi|254780479|r 159 DIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLC 206 (356) Q Consensus 159 ~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C 206 (356) |+....+. ..+++-..+++|.+.+.+++-+|+-||..||++|+|.| T Consensus 147 gl~~~~dp--~~vE~~L~~l~P~~~w~~~hh~lIlHGRy~CkAr~P~C 192 (192) T TIGR01083 147 GLSKGKDP--DKVEEELLKLIPKEFWTKLHHWLILHGRYTCKARKPRC 192 (192) T ss_pred CCCCCCCH--HHHHHHHHHHCCCCCHHHCCHHHHHHCCCCCCCCCCCC T ss_conf 35777898--99999998744850012223675431111116888889 No 9 >smart00478 ENDO3c endonuclease III. includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=100.00 E-value=2.8e-37 Score=283.04 Aligned_cols=148 Identities=29% Similarity=0.428 Sum_probs=139.9 Q ss_pred HHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 731161203326589987748997898729989999997404634-5889998776754300578687314568888877 Q gi|254780479|r 48 MLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGI 126 (356) Q Consensus 48 mlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGi 126 (356) |.|||++++|.++|++|+++|||+++||+|+++||..+|+|+||| +||++|+++|++|+++|+|.+|++.++|++|||| T Consensus 1 LSqqt~~~~v~~~~~~l~~~~pt~~~l~~a~~~~l~~~i~~~g~~~~ka~~i~~~a~~i~~~~~~~~p~~~~~L~~lpGV 80 (149) T smart00478 1 LSQQTSDEAVNKATERLFEKFPTPEDLAAADEEELEELIRPLGFYRRKAKYLIELARILVEEYGGEVPDDREELLKLPGV 80 (149) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCC T ss_conf 99865289999999999998859999986899999999998688999999999999999986655588559998758986 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH Q ss_conf 78776677664349631312431466657651975457356899987410257852267899999999863 Q gi|254780479|r 127 GDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLGAL 197 (356) Q Consensus 127 G~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlGa~ 197 (356) |+|||++|++++||++.++||+||.||++|++.++... ..+.++..++.++|.+.+++||++|||+|++ T Consensus 81 G~~tA~~vl~~~~~~~~~~vD~~v~Rv~~R~~~~~~~~--~~~~~~~~l~~~~p~~~~~~~~~~l~~~G~~ 149 (149) T smart00478 81 GRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKS--TPEEVEKLLEKLLPKEDWRELNLLLIDFGRT 149 (149) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCC--CHHHHHHHHHHHCCCCCHHHHHHHHHHCCCC T ss_conf 59999999999879983513413999999984788889--8999999999878934399999999981899 No 10 >cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=100.00 E-value=6.7e-34 Score=259.28 Aligned_cols=154 Identities=33% Similarity=0.482 Sum_probs=146.3 Q ss_pred HHHHHHHHHHHHCCHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCC---CC Q ss_conf 46766446731161203326589987748-997898729989999997404634588999877675430057868---73 Q gi|254780479|r 40 YKVWISEIMLQQTTVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNF---PH 115 (356) Q Consensus 40 y~v~vseimlqqT~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~---P~ 115 (356) |.+||++||+|||++++|...|.+|.++| ||+++||+++++++..+|.|+||++||++|+++|+.|.+.++|.. |+ T Consensus 1 f~~Li~~Il~qq~s~~~a~~~~~~l~~~~~pt~~~l~~~~~~~l~~~~~~~gy~~Ka~~i~~~a~~i~~~~~~~~~~~~~ 80 (158) T cd00056 1 FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGLVLDDPD 80 (158) T ss_pred CHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHH T ss_conf 99999999981452999999999999854998999980999999999733568999999999888889860895789888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 14568888877787766776643496313124314666576519754573568999874102578522678999999998 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG 195 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG 195 (356) +.++|++|||||+|||++|++++|+.++++||+||.||++|++++..+ ...++++..++.++|....+++|++||++| T Consensus 81 ~~~~L~~l~GIG~~TA~~vl~~~~~~~~~~vD~~v~R~~~rl~~~~~~--~~~~~~~~~~~~~~p~~~~~~~~~~L~~~g 158 (158) T cd00056 81 AREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKK--KTPEELEELLEELLPKPYWGEANQALMDLG 158 (158) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCC--CCHHHHHHHHHHHCCCCHHHHHHHHHHHCC T ss_conf 999987589828999999999987998362259999999995787789--999999999998589201999999998676 No 11 >KOG1921 consensus Probab=99.92 E-value=6e-25 Score=196.25 Aligned_cols=181 Identities=24% Similarity=0.380 Sum_probs=154.1 Q ss_pred HHHHHHHHHHHHHCCHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHHH-HHHHHHHHHHHHHCCCCCCCCH Q ss_conf 046766446731161203326589987748-99789872998999999740463458-8999877675430057868731 Q gi|254780479|r 39 PYKVWISEIMLQQTTVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYTR-ARNLKKCADIIVKKYEGNFPHK 116 (356) Q Consensus 39 py~v~vseimlqqT~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~R-ar~l~~~a~~i~~~~~g~~P~~ 116 (356) -|.|+|+-.+.-||.=.+.-.--.+..+.- =|++++.++++..+-.+-...|||+| |.+|+++|+|+.++|+|.+|.+ T Consensus 78 RfqvLv~lmLSSQTKDevt~~Am~rL~~~~gLT~e~v~~~de~~l~~LI~~VgFy~rKA~ylkkta~IL~d~f~gDIP~~ 157 (286) T KOG1921 78 RFQVLVGLMLSSQTKDEVTAAAMLRLKEYGGLTLEAVLKIDEPTLNELIYPVGFYTRKAKYLKKTAKILQDKFDGDIPDT 157 (286) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 59999999970100788899999999985597899986167576876500012157888999999999998707997555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCEEEE-CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 45688888777877667766434963131-24314666576519754573568999874102578522678999999998 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIVAIAFNHFAVV-VDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMDLG 195 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~s~a~~~~~~~-vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMdlG 195 (356) .++|++||||||..|.-+|++|+|.-+.+ ||+||.|+-.|+..++... +.-.+-+.-.+.++|...+...|--|.-|| T Consensus 158 v~dLlsLPGVGPKMa~L~m~~AWn~i~GI~VDtHVHRi~nrlgWv~~kt-kspE~TR~aLq~wLPk~lW~eIN~lLVGFG 236 (286) T KOG1921 158 VEDLLSLPGVGPKMAHLTMQVAWNKIVGICVDTHVHRICNRLGWVDTKT-KSPEQTRVALQQWLPKSLWVEINHLLVGFG 236 (286) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHCCCCCCC-CCHHHHHHHHHHHCCHHHHHHHHCEEECCC T ss_conf 9988558997659999999998566405786328999888756335666-887899999998676877756201055145 Q ss_pred HHHCCCCCCCCCCCCC-CCCHHHHHC Q ss_conf 6320412356431100-000134310 Q gi|254780479|r 196 ALICTSNKPLCPLCPI-QKNCLTFSE 220 (356) Q Consensus 196 a~iC~p~~P~C~~Cpl-~~~C~~~~~ 220 (356) .+||+|+.|+|+.|-+ +..|.|... T Consensus 237 Q~iC~p~~prC~~C~~~~~~Cpss~~ 262 (286) T KOG1921 237 QTICTPRRPRCGLCLLSRDLCPSSFK 262 (286) T ss_pred CEEEECCCCCCCCCCCCCCCCCHHHH T ss_conf 21453279986332357566842444 No 12 >pfam00730 HhH-GPD HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family. Probab=99.89 E-value=8.8e-23 Score=180.98 Aligned_cols=122 Identities=34% Similarity=0.512 Sum_probs=113.8 Q ss_pred HHHHHHHHCCHHHCCHHHHHHHHH--CCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCCCCCCCCHHHH- Q ss_conf 644673116120332658998774--8997898729989999997404634-588999877675430057868731456- Q gi|254780479|r 44 ISEIMLQQTTVKTVEPYFKKFMQK--WPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKYEGNFPHKVEI- 119 (356) Q Consensus 44 vseimlqqT~v~~v~~~~~~~~~~--~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~~g~~P~~~~~- 119 (356) |+.|+.|||+++++...|.+|+++ |||++++++++++++..+|.++||| +||++|+.+|+.+.++|+|..|.+.++ T Consensus 1 V~~IlsQq~s~~~a~~i~~rl~~~~~~pt~~~l~~~~~~~l~~~i~~~G~~~~Ka~~I~~~a~~~~~~~~~~~~~~~~~~ 80 (144) T pfam00730 1 VSAILSQQTSDKAANKITKRLFERYGFPTPEDLAEADEEELRELIKGLGFYRRKAKYIKELARILVEGYLGLVPLDLEEL 80 (144) T ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH T ss_conf 95542012349999999999999828989999985999999999870897699999999999888986289788611569 Q ss_pred --HHHHHHHHHHHHHHHHHHHCCCE--EEECCCCHHHHHHHHHCCCCCCC Q ss_conf --88888777877667766434963--13124314666576519754573 Q gi|254780479|r 120 --LKKLPGIGDYTASAIVAIAFNHF--AVVVDTNIERIISRYFDIIKPAP 165 (356) Q Consensus 120 --l~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv~RVl~R~~~~~~~~~ 165 (356) |++|||||++||..++.++|+.+ .+++|+||.|++.|++.++...+ T Consensus 81 ~~L~~l~GIG~~ta~~~l~~~~~~~d~~~~~D~~v~r~~~rl~~~~~~~~ 130 (144) T pfam00730 81 EALLALPGVGRWTAEAVLLFALGRPDVFPAVDTHVRRVAKRLGLIKEKPT 130 (144) T ss_pred HHHHCCCCCCHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHHCCCCCCCC T ss_conf 99860889769999999999869998732644999999999779989999 No 13 >cd03431 DNA_Glycosylase_C DNA glycosylase (MutY in bacteria and hMYH in humans) is responsible for repairing misread A*oxoG residues to C*G by removing the inappropriately paired adenine base from the DNA backbone. It belongs to the Nudix hydrolase superfamily and is important for the repair of various genotoxic lesions. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity. They are also recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA glycosylase does not seem to contain this signature motif. DNA glycosylase consists of 2 domains: the N-terminal domain contains the catalytic properties of the enzyme and the C-terminal domain affects substrate (oxoG) binding and enzymatic turnover. The C-terminal domain is highly similar to MutT, based on secondary structure and topology, despite low sequence identity. MutT sanitizes the nucleotide precursor pool by hydrolyzing oxo-dGTP to Probab=99.84 E-value=2e-20 Score=164.41 Aligned_cols=112 Identities=34% Similarity=0.526 Sum_probs=88.3 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHH---HCCCCCCCCEECCCEEEEEEEEEEEEEEEEE Q ss_conf 2320003413760577645720132047442111134672158886---3010100002137358998315999999999 Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDT---HSAPFTANWILCNTITHTFTHFTLTLFVWKT 315 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~---~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~ 315 (356) ...++++++.+|+|||+|||++|+|+||||||++++++........ ...........+++|+|+||||++++++|.+ T Consensus 3 ~~~~~~ii~~~~~iLl~kRp~~Gll~gLwefP~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~H~fSH~~l~~~~~~~ 82 (118) T cd03431 3 RGIAVVVIRNDGRVLLEKRPEKGLLAGLWEFPSVEWEEEADGEEALLSALKKALRLSLEPLGTVKHTFTHFRLTLHVYLA 82 (118) T ss_pred EEEEEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEE T ss_conf 99999999779999999889998456530078665565646279998766655301221044041342417999999999 Q ss_pred ECCCC-CCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 80776-766781332278887479987899999844 Q gi|254780479|r 316 IVPQI-VIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 316 ~~~~~-~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .+... ....++.|++.+++++++||++++|+++.+ T Consensus 83 ~~~~~~~~~~~~~Wv~~~~l~~~~~p~~~~Kil~~l 118 (118) T cd03431 83 RLEGDLLAPDEGRWVPLEELDEYALPTVMRKILELL 118 (118) T ss_pred EECCCCCCCCCCEEEEHHHCCCCCCCHHHHHHHHHC T ss_conf 963798789888997678854068987999999759 No 14 >PRK13913 3-methyladenine DNA glycosylase; Provisional Probab=99.63 E-value=2.5e-15 Score=128.47 Aligned_cols=141 Identities=16% Similarity=0.205 Sum_probs=116.9 Q ss_pred CCHHHHHHHHHHHHHCCHHHCCHHHHHHHHH-------CCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHC Q ss_conf 8704676644673116120332658998774-------89978987299899999974046345-889998776754300 Q gi|254780479|r 37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQK-------WPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKK 108 (356) Q Consensus 37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~~-------~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~ 108 (356) ..|+.|.|.-|+-|+|.=..|..-..+.-+. --++..+++.+++++-.+-..-|||+ +|++|+..|+.++.+ T Consensus 28 ~~~FEvivGAILtQNT~W~nVekAl~nLk~a~lL~~~~~~~l~~i~~l~~e~La~lIrPaGFy~~KA~rLk~l~~~~~~d 107 (218) T PRK13913 28 ALKFEALLGAVLTQNTKFEAVLKSLENLKNAFILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSKNILKD 107 (218) T ss_pred CCCCEEEEEEEEHHCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99765534411004187889999999999766778644159999971899999999504015899999999999999987 Q ss_pred CCC----CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 578----6873145688888777877667766434963131243146665765197545735689998741025785 Q gi|254780479|r 109 YEG----NFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITST 181 (356) Q Consensus 109 ~~g----~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~ 181 (356) |++ .-+.++++|++++|||+-||.+|+..||++|+.+||...+|+++|+ |+... + -..++.+.+.-++. T Consensus 108 ~~~~~~~~~~~~Re~LL~lkGIG~ETADsILlYa~~~p~FVVDaYT~Ri~~rl-G~~~~-~--Ydelq~~fe~~l~e 180 (218) T PRK13913 108 FQSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKYSYLFLKKL-GIEIE-D--YDELQHFFEKGVQE 180 (218) T ss_pred HHHHHHCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEECHHHHHHHHHHC-CCCCC-C--HHHHHHHHHHCCHH T ss_conf 52575145365899997489866333999999974998451118899999981-99857-9--99999999962277 No 15 >COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Probab=99.62 E-value=1.3e-14 Score=123.33 Aligned_cols=178 Identities=23% Similarity=0.361 Sum_probs=140.1 Q ss_pred CHHHHHHHHHHHHHCCHHHCCHHHHHHHHH-CCCHHHHHCCCHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHCCC---CC Q ss_conf 704676644673116120332658998774-89978987299899999974046345-88999877675430057---86 Q gi|254780479|r 38 SPYKVWISEIMLQQTTVKTVEPYFKKFMQK-WPTIFCLSSAKDEEILSAWAGLGYYT-RARNLKKCADIIVKKYE---GN 112 (356) Q Consensus 38 ~py~v~vseimlqqT~v~~v~~~~~~~~~~-~P~~~~la~a~~~~vl~~w~gLGyy~-Rar~l~~~a~~i~~~~~---g~ 112 (356) +-+.|+++-|+-|.|.=+.|..--+..-+. --+++++..-+++++..+-.+-|||+ .|.+|....+.++..|. +. T Consensus 29 ~~~EiiigAILtQNT~WknvekAlenLk~~~~~~l~~I~~~~~~~L~elIrpsGFYnqKa~rLk~l~k~l~~~~~~~~~~ 108 (215) T COG2231 29 NKDEIIIGAILTQNTSWKNVEKALENLKNEGILNLKKILKLDEEELAELIRPSGFYNQKAKRLKALSKNLAKFFINLESF 108 (215) T ss_pred CCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 72068998898524529999999999988156799998458999999987042408999999999999999986423111 Q ss_pred CCC-HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCC--HHHHHH Q ss_conf 873-14568888877787766776643496313124314666576519754573568999874102578522--678999 Q gi|254780479|r 113 FPH-KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSR--PGDFVQ 189 (356) Q Consensus 113 ~P~-~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~--~~~~nQ 189 (356) -+. ..++|+++.|||.-||.||+.+||++|+.+||.-.+|+++|++++... + ..++..+.+.-+|.+- .-.|.- T Consensus 109 ~~~~~R~~LL~iKGIG~ETaDsILlYa~~rp~FVvD~YtrR~l~rlg~i~~k-~--ydeik~~fe~~l~~~~~lyqe~HA 185 (215) T COG2231 109 KSEVLREELLSIKGIGKETADSILLYALDRPVFVVDKYTRRLLSRLGGIEEK-K--YDEIKELFEENLPENLRLYQEFHA 185 (215) T ss_pred CHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC-C--HHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 5188999987268866223999999980486446329999999994551025-4--999999998535067999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHH Q ss_conf 999998632041235643110000013431 Q gi|254780479|r 190 AMMDLGALICTSNKPLCPLCPIQKNCLTFS 219 (356) Q Consensus 190 AlMdlGa~iC~p~~P~C~~Cpl~~~C~~~~ 219 (356) -+..+|-.-|.-+ |.|+.||+...|..+. T Consensus 186 lIv~~~K~f~~k~-~~~~~cpL~~~~~~~~ 214 (215) T COG2231 186 LIVEHAKHFCKKK-PLCEKCPLKEKCKKYR 214 (215) T ss_pred HHHHHHHHHCCCC-CCCCCCHHHHHHHHCC T ss_conf 9999999981588-6778765899875226 No 16 >PRK08999 hypothetical protein; Provisional Probab=99.61 E-value=4.7e-15 Score=126.55 Aligned_cols=118 Identities=18% Similarity=0.140 Sum_probs=97.8 Q ss_pred CEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEE Q ss_conf 11232000341376057764572013204744211113467215888------630101000021373589983159999 Q gi|254780479|r 237 MRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTL 310 (356) Q Consensus 237 ~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i 310 (356) .....+.++++.+|+||+.||++++.|+|+||||+++.+.++..... ++.............+.|.|.|+.+.+ T Consensus 4 ~i~V~~gvi~~~~~~vLi~~R~~~~~~~g~WEFPGGKvE~~Et~~~aL~REl~EElgI~v~~~~~~~~~~h~Y~~~~v~L 83 (312) T PRK08999 4 RVHVAAAVIRDADGRILLARRPEGKHQGGLWEFPGGKVEPGETVEQALARELREELGIEVTAARPLITVPHDYPDKRVRL 83 (312) T ss_pred EEEEEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCEEEEE T ss_conf 79999999985899699997899998788464898867999899999999988870927632138899999649849999 Q ss_pred EEEEEEC----CCCCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCCCC Q ss_conf 9999980----7767667813322788874799878999998448237 Q gi|254780479|r 311 FVWKTIV----PQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKV 354 (356) Q Consensus 311 ~v~~~~~----~~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~~~ 354 (356) ++|.+.. +...+.+..+|++++++.+++||.+++.||.++.++. T Consensus 84 ~~~~v~~~~G~~~~~EgQ~~~Wv~~~~l~~~~fp~AN~~Ii~~l~LP~ 131 (312) T PRK08999 84 DVREVTAWQGEPHGREGQPLAWVAPDELAVYPFPPANQPIVRALRLPD 131 (312) T ss_pred EEEEEEEECCCCCCCCCCEEEEECHHHCCCCCCCCCCHHHHHHHCCCC T ss_conf 999987307700465687566776533545899999869998503586 No 17 >PRK10776 nucleoside triphosphate pyrophosphohydrolase; Provisional Probab=99.58 E-value=1.4e-14 Score=123.24 Aligned_cols=113 Identities=15% Similarity=0.139 Sum_probs=91.7 Q ss_pred EEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH------CCCCCCCCEECCCEEEEEEEEEEEEE Q ss_conf 123200034137605776457201320474421111346721588863------01010000213735899831599999 Q gi|254780479|r 238 RTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH------SAPFTANWILCNTITHTFTHFTLTLF 311 (356) Q Consensus 238 r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~------~~~~~~~~~~l~~ikH~fTH~~L~i~ 311 (356) ....++|+++.+|++|+.||+.++.++|+||||+++.+.++....... ..........+..+.|.|+|+.+.++ T Consensus 4 i~V~~aii~~~~~~~Li~~R~~~~~~~g~WEFPGGKvE~gEt~~~Al~REl~EElgi~v~~~~~~~~~~~~y~~~~i~l~ 83 (129) T PRK10776 4 LQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEAGETPEQALIRELQEEVGITPQQATLFEKLEYEFPDRHITLW 83 (129) T ss_pred EEEEEEEEEECCCEEEEEECCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHCEEEEECCEEEEEEEECCCEEEEEE T ss_conf 89999999948998999996999988985919928527997989999999877517167605299999998998099999 Q ss_pred EEEEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 9999807----76766781332278887479987899999844 Q gi|254780479|r 312 VWKTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 312 v~~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) +|.+..- .....++.+|++.++|.++++|.+.++||+.| T Consensus 84 ~f~~~~~~G~~~~~e~~~~~Wv~~~eL~~~~~~~ad~~ii~~L 126 (129) T PRK10776 84 FWLVESWEGEPWGKEGQPGRWVSQVALNADDFPPANEPIIAKL 126 (129) T ss_pred EEEEEECCCEECCCCCCCCEEEEHHHCCCCCCCCCCHHHHHHH T ss_conf 9999971699975657675997788836599786849999998 No 18 >cd03425 MutT_pyrophosphohydrolase The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi). This enzyme requires two divalent cations for activity; one coordinates the phosphoryl groups of the NTP/dNTP substrate, and the other coordinates to the enzyme. It also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as metal binding and catalytic site. MutT pyrophosphohydrolase is important in preventing errors in DNA replication by hydrolyzing mutagenic nucleotides such as 8-oxo-dGTP (a product of oxidative damage), which can mispair with template adenine during DNA replication, to guanine nucleotides. Probab=99.54 E-value=5.2e-14 Score=119.16 Aligned_cols=111 Identities=21% Similarity=0.255 Sum_probs=89.7 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEEEEE Q ss_conf 32000341376057764572013204744211113467215888------630101000021373589983159999999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTLFVW 313 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~ 313 (356) ..+++|++++|++||.||++++.++|+|+||++..+.+++.... ++..........++.+.|.++|+++.+.+| T Consensus 3 Vv~avI~~~~g~vLl~~R~~~~~~~g~W~~PGG~ve~gE~~~~a~~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~l~~f 82 (124) T cd03425 3 VVAAIIIDDDGRILIAQRPAGKHLGGLWEFPGGKVEPGETPEQALVRELREELGIEVEVGELLATVEHDYPDKRVTLHVF 82 (124) T ss_pred EEEEEEEECCCEEEEEECCCCCCCCCEEECCEEEECCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCEEEEEEEE T ss_conf 99999997999899998689999999497987679699999999999999986970467469999989889918999999 Q ss_pred EEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 99807----76766781332278887479987899999844 Q gi|254780479|r 314 KTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 314 ~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .+... .....++++|+++++|.+++||.++++||++| T Consensus 83 ~~~~~~g~~~~~E~~~~~Wv~~~el~~l~~~~a~~~il~~L 123 (124) T cd03425 83 LVELWSGEPQLLEHQELRWVPPEELDDLDFPPADVPIVAAL 123 (124) T ss_pred EEEECCCCCCCCCCCCCEEEEHHHCCCCCCCCCHHHHHHHH T ss_conf 98970780833645365980299936499982618999974 No 19 >PRK10546 pyrimidine (deoxy)nucleoside triphosphate pyrophosphohydrolase; Provisional Probab=99.47 E-value=4.1e-13 Score=112.90 Aligned_cols=110 Identities=15% Similarity=0.079 Sum_probs=84.6 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCC------CCCCCCEECCCEEEEEEEEEEEEEEE Q ss_conf 320003413760577645720132047442111134672158886301------01000021373589983159999999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSA------PFTANWILCNTITHTFTHFTLTLFVW 313 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~------~~~~~~~~l~~ikH~fTH~~L~i~v~ 313 (356) ..+.+ +.++|++|+.||+..+.++|+||||+++.+.+++.......+ ........+.++.|.++|+.+++++| T Consensus 6 Vvaai-i~~~~rvLi~~R~~~~~~~g~WEFPGGKvE~gEs~~~Al~REl~EElgi~v~~~~~l~~~~~~~~~~~i~l~~~ 84 (135) T PRK10546 6 VVAAI-IERDGKILLAQRPAHSDQAGLWEFAGGKVEPGESQPQALIRELREELGIEATVGEYIASHQREVSGRIIHLHAW 84 (135) T ss_pred EEEEE-EEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCEEEEEEEE T ss_conf 99999-99599999998899999998592983216898798999988878750958984568999988779989999999 Q ss_pred EEECC----CCCCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 99807----76766781332278887479987899999844 Q gi|254780479|r 314 KTIVP----QIVIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 314 ~~~~~----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .+..- .....++.+|++.+++.+++||.+...||+++ T Consensus 85 ~~~~~~g~~~~~e~~~~~Wv~~~el~~~~~~pAd~~il~~l 125 (135) T PRK10546 85 HVPDFHGTLQAHEHQALVWCTPEEALRYPLAPADIPLLEAF 125 (135) T ss_pred EEEEECCEEEECCCCCCEEEEHHHHCCCCCCCCCHHHHHHH T ss_conf 99988881821456756992389935499885859999999 No 20 >TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg); InterPro: IPR004577 All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. The enzyme incises DNA at 8-oxoG residues, and excises 7,8-dihydro-8-oxoguanine from damaged DNA. It has beta-lyase activity that nicks DNA 3' to the lesion.; GO: 0008534 oxidized purine base lesion DNA N-glycosylase activity, 0006281 DNA repair, 0005634 nucleus. Probab=99.11 E-value=8.4e-11 Score=96.60 Aligned_cols=123 Identities=25% Similarity=0.480 Sum_probs=93.2 Q ss_pred CCHHHHHHHHHHHHHCCHHHCCHHHHHH---------------HHHCCCHHHHHC---CCHHHHHHH------------H Q ss_conf 8704676644673116120332658998---------------774899789872---998999999------------7 Q gi|254780479|r 37 PSPYKVWISEIMLQQTTVKTVEPYFKKF---------------MQKWPTIFCLSS---AKDEEILSA------------W 86 (356) Q Consensus 37 ~~py~v~vseimlqqT~v~~v~~~~~~~---------------~~~~P~~~~la~---a~~~~vl~~------------w 86 (356) +||++.++|=|-.----|++...--++. ..-||+.+.||. .+.||+-.. - T Consensus 146 kdP~EcliSfIcSsNnni~RiTrm~e~lc~~fG~~~~~~dgvtyH~FP~~~~LtgvaeGsledl~~~E~nlPsdfsfn~L 225 (379) T TIGR00588 146 KDPFECLISFICSSNNNIARITRMVERLCQAFGPRLITLDGVTYHGFPSLHALTGVAEGSLEDLPEAEANLPSDFSFNHL 225 (379) T ss_pred CCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 78715667788750465212543389999862203212246211479875665212100166799987515643462657 Q ss_pred --HCCCHHHHHHHHHHHHHHHHHCCCCC-C----------------CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE-C Q ss_conf --40463458899987767543005786-8----------------73145688888777877667766434963131-2 Q gi|254780479|r 87 --AGLGYYTRARNLKKCADIIVKKYEGN-F----------------PHKVEILKKLPGIGDYTASAIVAIAFNHFAVV-V 146 (356) Q Consensus 87 --~gLGyy~Rar~l~~~a~~i~~~~~g~-~----------------P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~-v 146 (356) =|||| |||.+.++|+.|+++-++. + -+-.+.|+.|||||++.|.+|+..|++++.++ | T Consensus 226 R~lG~GY--RA~Yi~~tar~l~ee~~~~nitsdta~LQ~ic~~~~Yedar~~L~~l~GVG~KVADCicLmgl~k~~avPV 303 (379) T TIGR00588 226 RKLGLGY--RARYIRETARALLEEQGGRNITSDTAWLQQICKDADYEDAREALLELPGVGPKVADCICLMGLDKPQAVPV 303 (379) T ss_pred HHCCCCC--CCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCEEE T ss_conf 5248875--40689999999884126642002357999860668867899997216999704888888652278971011 Q ss_pred CCCHHHHHHHHHCCC Q ss_conf 431466657651975 Q gi|254780479|r 147 DTNIERIISRYFDII 161 (356) Q Consensus 147 D~Nv~RVl~R~~~~~ 161 (356) |.+|.||..|-|.+. T Consensus 304 DVh~~~Ia~rdy~~s 318 (379) T TIGR00588 304 DVHVRRIAKRDYQWS 318 (379) T ss_pred HHHHHHHHHHCCCCC T ss_conf 156888864403531 No 21 >KOG2875 consensus Probab=99.10 E-value=1.2e-10 Score=95.46 Aligned_cols=115 Identities=25% Similarity=0.461 Sum_probs=93.7 Q ss_pred CCHHHHHHHHHHHHHCCHHHCCHHHHHHHH---------------HCCCHHHHHC-CCHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 870467664467311612033265899877---------------4899789872-998999999740463458899987 Q gi|254780479|r 37 PSPYKVWISEIMLQQTTVKTVEPYFKKFMQ---------------KWPTIFCLSS-AKDEEILSAWAGLGYYTRARNLKK 100 (356) Q Consensus 37 ~~py~v~vseimlqqT~v~~v~~~~~~~~~---------------~~P~~~~la~-a~~~~vl~~w~gLGyy~Rar~l~~ 100 (356) +||+..+.|=|-.+.--+++...--++|.. .|||++.|+. +.|.++.++ |+|| |||.+-. T Consensus 115 QdP~E~lfSFiCSSNNNIaRIT~Mve~fc~~fG~~i~~~dg~~~h~FPsl~~L~g~~~Ea~LR~~--gfGY--RAkYI~~ 190 (323) T KOG2875 115 QDPIECLFSFICSSNNNIARITGMVERFCQAFGPRIIQLDGVDYHGFPSLQALAGPEVEAELRKL--GFGY--RAKYISA 190 (323) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHCCHHHHHHHHHC--CCCH--HHHHHHH T ss_conf 17197889887257875999999999999861750375557120368547776474769999981--7644--6899999 Q ss_pred HHHHHHHCCCC----------CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEE-ECCCCHHHHHH Q ss_conf 76754300578----------687314568888877787766776643496313-12431466657 Q gi|254780479|r 101 CADIIVKKYEG----------NFPHKVEILKKLPGIGDYTASAIVAIAFNHFAV-VVDTNIERIIS 155 (356) Q Consensus 101 ~a~~i~~~~~g----------~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~-~vD~Nv~RVl~ 155 (356) +|+.|+++++| .+-+-.+.|..|||||+..|.+|+.+++|.... +||.||.|+.. T Consensus 191 ta~~l~~~~g~~~wLqslr~~~yeear~~L~~lpGVG~KVADCI~Lm~l~~~~~VPVDvHi~ria~ 256 (323) T KOG2875 191 TARALQEKQGGLAWLQSLRKSSYEEAREALCSLPGVGPKVADCICLMSLDKLSAVPVDVHIWRIAQ 256 (323) T ss_pred HHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHH T ss_conf 999999723500599988545289999998528887614756223120588765562245888762 No 22 >COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Probab=99.07 E-value=3.4e-09 Score=85.24 Aligned_cols=164 Identities=20% Similarity=0.309 Sum_probs=117.1 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHH----------CCCHHHHHCCCHH Q ss_conf 999999962899887548554445678704676644673116120332658998774----------8997898729989 Q gi|254780479|r 11 KILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQK----------WPTIFCLSSAKDE 80 (356) Q Consensus 11 ~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~----------~P~~~~la~a~~~ 80 (356) .|-.+++..+. +|.+..++..-....|||..+||-|.-||--++.-...+.++.+. |||++.|+.++++ T Consensus 78 ~l~~i~~~~~~-~~~~~~~g~~~~~~~d~fe~lv~aI~~QqvS~~~A~~i~~rl~~~~g~~~~~~~~fptpe~l~~~~~~ 156 (285) T COG0122 78 DLAPIIDALGP-LPLLRAPGLRLPLAPDPFEALVRAILSQQVSVAAAAKIWARLVSLYGNALEIYHSFPTPEQLAAADEE 156 (285) T ss_pred CHHHHHHHCCC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCHH T ss_conf 59999986465-65445556556666678999999999765059999999999999818766656679899999847999 Q ss_pred HHHHHHHCCCHH-HHHHHHHHHHHHHHHCC---CCCCC--C--HHHHHHHHHHHHHHHHHHHHHHHCCCE--EEECCCCH Q ss_conf 999997404634-58899987767543005---78687--3--145688888777877667766434963--13124314 Q gi|254780479|r 81 EILSAWAGLGYY-TRARNLKKCADIIVKKY---EGNFP--H--KVEILKKLPGIGDYTASAIVAIAFNHF--AVVVDTNI 150 (356) Q Consensus 81 ~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~---~g~~P--~--~~~~l~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv 150 (356) ++. ++|-- +||+.|..+|+.+.+.. .---+ . -.+.|.+|+|||+.||..++.++++.+ .|+.|.-+ T Consensus 157 ~l~----~~g~s~~Ka~yi~~~A~~~~~g~~~~~~l~~~~~e~a~e~L~~i~GIG~WTAe~~llf~lgr~dvfP~~D~~l 232 (285) T COG0122 157 ALR----RCGLSGRKAEYIISLARAAAEGELDLSELKPLSDEEAIEELTALKGIGPWTAEMFLLFGLGRPDVFPADDLGL 232 (285) T ss_pred HHH----HHCCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 998----8378577899999999999859965676625889999999873788679999999998168898677155999 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 6665765197545735689998741025785 Q gi|254780479|r 151 ERIISRYFDIIKPAPLYHKTIKNYARKITST 181 (356) Q Consensus 151 ~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~ 181 (356) +|-+.++++. +....++.+..+++.+-|- T Consensus 233 r~~~~~~~~~--~~~~~~~~~~~~~e~w~p~ 261 (285) T COG0122 233 RRAIKKLYRL--PTRPTEKEVRELAERWGPY 261 (285) T ss_pred HHHHHHHHCC--CCCCHHHHHHHHHHHCCCH T ss_conf 9999997257--8785088999998505678 No 23 >TIGR00586 mutt mutator mutT protein; InterPro: IPR003561 MutT is a small bacterial protein (~12-15kDa) involved in the GO system responsible for removing an oxidatively damaged form of guanine (8-hydroxy- guanine or 7,8-dihydro-8-oxoguanine) from DNA and the nucleotide pool. 8-oxo-dGTP is inserted opposite dA and dC residues of template DNA with near equal efficiency, leading to A.T to G.C transversions. MutT specifically degrades 8-oxo-dGTP to the monophosphate, with the concomitant release of pyrophosphate. A short conserved N-terminal region of mutT (designated the MutT domain) is also found in a variety of other prokaryotic, viral, and eukaryotic proteins , , , . Recently, the generic name 'NUDIX hydrolases' (NUcleoside DIphosphate linked to some other moiety X) has been coined for this domain family . The solution structure of the MutT enzyme has been solved by NMR . The structure is globular and compact, and contains a five-stranded mixed beta-sheet. The parallel portion of the sheet is sandwiched between two alpha-helices, forming an alpha+beta fold. ; GO: 0008413 8-oxo-78-dihydroguanine triphosphatase activity, 0006281 DNA repair. Probab=98.80 E-value=4.9e-09 Score=84.14 Aligned_cols=116 Identities=15% Similarity=0.175 Sum_probs=91.2 Q ss_pred CCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC------CCCCCEECCCEEEEEEEEEEE Q ss_conf 21123200034137605776457201320474421111346721588863010------100002137358998315999 Q gi|254780479|r 236 PMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP------FTANWILCNTITHTFTHFTLT 309 (356) Q Consensus 236 ~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~------~~~~~~~l~~ikH~fTH~~L~ 309 (356) +.....+.||+|.+|+|+|.||+.++-+++.||||+++.+.++.......++. .........++.|.|-..+|+ T Consensus 3 K~~QiavgiIrNe~gei~itrR~~g~~~a~~lEFPGGK~e~GETpeqa~vREl~EEigI~~~~~~l~e~l~~eyP~~~l~ 82 (150) T TIGR00586 3 KKLQIAVGIIRNENGEIFITRRAAGAHFANKLEFPGGKIEMGETPEQAVVRELQEEIGITPQHFELFEKLEYEYPQKRLT 82 (150) T ss_pred CEEEEEEEEEECCCCCEEEEECCCCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHEEEEEECCHHEEEECCCCCCEEEH T ss_conf 60678888998188807887444562100024798981388989789776542220012320000000101306740000 Q ss_pred EEEEEEECCC------------------------CCCCCCCEEECHHHHHHCC-CCHHHHHHHHHCC Q ss_conf 9999998077------------------------6766781332278887479-9878999998448 Q gi|254780479|r 310 LFVWKTIVPQ------------------------IVIIPDSTWHDAQNLANAA-LPTVMKKALSAGG 351 (356) Q Consensus 310 i~v~~~~~~~------------------------~~~~~~~~Wv~~~el~~~~-LPs~~kKIL~alg 351 (356) +.+|.+..-. ....+...|+...+|+... +|..+.+++.+|- T Consensus 83 l~~~L~~~~~gep~~~~raada~~a~klefpggk~e~g~~~~w~~~~~L~~~~gfppan~~~i~~L~ 149 (150) T TIGR00586 83 LYVYLVQEWSGEPFGKERAADAHMANKLEFPGGKIEMGEEGEWVAITDLDEERGFPPANAELIQKLR 149 (150) T ss_pred HHHEEHHHHCCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCEEEEEECCCCCCCCCCCHHHHHHHC T ss_conf 2310014333881022455556665431278860221798517898414556788840268999961 No 24 >cd03427 MTH1 MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily. MTH1, the mammalian counterpart of MutT, hydrolyzes oxidized purine nucleoside triphosphates, such as 8-oxo-dGTP and 2-hydroxy-ATP, to monophosphates, thereby preventing the incorporation of such oxygen radicals during replication. This is an important step in the repair mechanism in genomic and mitochondrial DNA. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity, and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. MTH1 is predominantly localized in the cytoplasm and mitochondria. Structurally, this enzyme adopts a similar fold to MutT despite low sequence similarity outside the conserved nudix motif. The most distinctive structural difference between MutT and MTH1 is the presence of a beta-hairpin, which is absent in MutT. This results in a m Probab=98.66 E-value=1.5e-07 Score=73.74 Aligned_cols=108 Identities=15% Similarity=0.055 Sum_probs=73.0 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCC------CEECCCEEEEE--EEEEEEEEEE Q ss_conf 000341376057764572013204744211113467215888630101000------02137358998--3159999999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTAN------WILCNTITHTF--THFTLTLFVW 313 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~------~~~l~~ikH~f--TH~~L~i~v~ 313 (356) .++++.++|+|||.+|+.+ -..|+|+||+++.+.++.......++...+. ....+.+.+.+ .++.+.+++| T Consensus 4 tl~ii~~~~kiLl~~R~~~-~~~G~W~~PGGkvE~gEt~~~a~~REl~EEtGl~v~~~~~~g~~~~~~~~~~~~~~~~~f 82 (137) T cd03427 4 TLCFIKDPDKVLLLNRKKG-PGWGGWNGPGGKVEPGETPEECAIRELKEETGLTIDNLKLVGIIKFPFPGEEERYGVFVF 82 (137) T ss_pred EEEEEEECCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCEEEEEEEEEEEEEEEECCCCEEEEEEEE T ss_conf 9999998999999996999-999959795565679979999999999898692887138999999992799989999999 Q ss_pred EEECCCC-----CCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 9980776-----766781332278887479987899999844 Q gi|254780479|r 314 KTIVPQI-----VIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 314 ~~~~~~~-----~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .+..... ....+.+|++.+++.+++++...+.+|..+ T Consensus 83 ~~~~~~G~~~~~~~e~~l~Wv~~~el~~l~~~p~d~~~l~~l 124 (137) T cd03427 83 LATEFEGEPLKESEEGILDWFDIDDLPLLPMWPGDREWLPLM 124 (137) T ss_pred EEEEEEEEECCCCCCCEEEEEEHHHCCCCCCCCCHHHHHHHH T ss_conf 998352476798833368998498977589982368999999 No 25 >PRK10308 3-methyl-adenine DNA glycosylase II; Provisional Probab=98.63 E-value=3.9e-07 Score=70.79 Aligned_cols=136 Identities=21% Similarity=0.294 Sum_probs=97.4 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHH-------------HCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHH Q ss_conf 44567870467664467311612033265899877-------------48997898729989999997404634-58899 Q gi|254780479|r 32 EKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ-------------KWPTIFCLSSAKDEEILSAWAGLGYY-TRARN 97 (356) Q Consensus 32 ~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~-------------~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~ 97 (356) .-+..-|+|+..|=-|+=||-.|+.....-.|+++ .|||++.||.++.+++ .+||.- +|++. T Consensus 106 RvPg~~d~fE~~vrAIlGQQvSv~aA~tl~~Rlv~~~G~~~~~~~~~~~FPtp~~la~~~~~~L----~~lg~p~~ra~t 181 (283) T PRK10308 106 RLPGSVDAFEQGVRAILGQLVSVAMAAKLTAKVAQLYGERLDDFPDYVCFPTPQRLAAADPQAL----KALGMPLKRAEA 181 (283) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEECCCHHHHHCCCHHHH----HHCCCCHHHHHH T ss_conf 7888798899999999714021999999999999994895788988534799899853897566----455896689999 Q ss_pred HHHHHHHHHHC-CCCCCCCH----HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH--HHHHHHHHCCCCCCCHHHHH Q ss_conf 98776754300-57868731----4568888877787766776643496313124314--66657651975457356899 Q gi|254780479|r 98 LKKCADIIVKK-YEGNFPHK----VEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI--ERIISRYFDIIKPAPLYHKT 170 (356) Q Consensus 98 l~~~a~~i~~~-~~g~~P~~----~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv--~RVl~R~~~~~~~~~~~~k~ 170 (356) |...|+.+.+. ..-.-+.| .+.|+.|||||+-||.-|+--+++.|..-..|-+ +| ++-+. ..++ T Consensus 182 l~~lA~a~~~g~l~l~~~~d~~~~~~~L~~l~GIGpWTa~Yv~mR~lg~pD~fp~~Dl~l~~---~l~~~------~~~~ 252 (283) T PRK10308 182 LIHLANAALEGTLPLTAPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPGM------TPAQ 252 (283) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHH---HHCCC------CHHH T ss_conf 99999999669866777899999999987367978899999999837899878760299997---61359------9999 Q ss_pred HHHHHHCCCC Q ss_conf 9874102578 Q gi|254780479|r 171 IKNYARKITS 180 (356) Q Consensus 171 l~~~~~~~~~ 180 (356) +...++.+-| T Consensus 253 ~~~~a~~W~P 262 (283) T PRK10308 253 IRRYAERWKP 262 (283) T ss_pred HHHHHHCCCC T ss_conf 9999753588 No 26 >cd04691 Nudix_Hydrolase_32 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.57 E-value=2.6e-07 Score=71.95 Aligned_cols=93 Identities=14% Similarity=0.246 Sum_probs=61.9 Q ss_pred EEEEEEECCCEEEEEECC-CCHHHHCCCCCCCCCCCCCCCHHHHHHCCCC------CCCCEECCCEEEEEEEEEEEEEEE Q ss_conf 200034137605776457-2013204744211113467215888630101------000021373589983159999999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRT-NTRLLEGMDELPGSAWSSTKDGNIDTHSAPF------TANWILCNTITHTFTHFTLTLFVW 313 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp-~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~------~~~~~~l~~ikH~fTH~~L~i~v~ 313 (356) .+.+++ ++|++||.||+ .++..+|+||||+++.+.++.......++.. ......+.++.|..++..+ +++| T Consensus 3 Vagii~-~~~kvLl~rR~~~~~~~~G~WefPGGkvE~gEt~~~Al~REl~EElgI~v~~~~~~~~~~~~~~~~~~-l~~~ 80 (117) T cd04691 3 VVGVLF-SDDKVLLERRSLTKNADPGKLNIPGGHIEAGESQEEALLREVQEELGVDPLSYTYLCSLYHPTSELQL-LHYY 80 (117) T ss_pred EEEEEE-ECCEEEEEECCCCCCCCCCEEECCEECCCCCCCHHHHHHHHHHHHHCEEEECCEEEEEEEECCCCEEE-EEEE T ss_conf 999999-89999999889999999996989745177988999999999889819156224589999855999699-9999 Q ss_pred EEEC----CCCCCCCCCEEECHHHHH Q ss_conf 9980----776766781332278887 Q gi|254780479|r 314 KTIV----PQIVIIPDSTWHDAQNLA 335 (356) Q Consensus 314 ~~~~----~~~~~~~~~~Wv~~~el~ 335 (356) .+.. +.....++.+|++.+||- T Consensus 81 ~~~~~~G~~~~~E~~e~~Wv~~~el~ 106 (117) T cd04691 81 VVTFWQGEIPAQEAAEVHWMTANDIV 106 (117) T ss_pred EEEEECCEECCCCCCCCEECCHHHCC T ss_conf 99998176643347705771699953 No 27 >cd04696 Nudix_Hydrolase_37 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.50 E-value=8.3e-07 Score=68.47 Aligned_cols=103 Identities=17% Similarity=0.185 Sum_probs=66.4 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEEEE-------EEEE Q ss_conf 200034137605776457201320474421111346721588863010100002------137358998-------3159 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITHTF-------THFT 307 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH~f-------TH~~ 307 (356) ..++|++.+|++||.||+. +.|+|+||++..+.++........+...+... .++...+.+ +|+ T Consensus 5 V~avI~~~~g~vLl~r~~~---~~g~W~lPGG~ve~gEs~~~aa~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~~~h~- 80 (125) T cd04696 5 VGALIYAPDGRILLVRTTK---WRGLWGVPGGKVEWGETLEEALKREFREETGLKLRDIKFAMVQEAIFSEEFHKPAHF- 80 (125) T ss_pred EEEEEEECCCEEEEEEECC---CCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCCCCCCCCCEE- T ss_conf 9999994999899999369---999598986588699599999999999975957653589988822126767777659- Q ss_pred EEEEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf 999999998077-----67667813322788874799878999998 Q gi|254780479|r 308 LTLFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 308 L~i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) -+..|.+.... .....+++|++++++.+++|+..+|.+|+ T Consensus 81 -~~~~~~~~~~~~~~~~~~E~~e~~Wv~~~el~~~~l~~~~r~~ie 125 (125) T cd04696 81 -VLFDFFARTDGTEVTPNEEIVEWEWVTPEEALDYPLNSFTRLLLE 125 (125) T ss_pred -EEEEEEEEEECCCCCCCCCCCEEEEECHHHHHCCCCCHHHHHHHC T ss_conf -999999998089277783542999943889565988868876519 No 28 >cd04699 Nudix_Hydrolase_39 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.48 E-value=8.9e-07 Score=68.22 Aligned_cols=102 Identities=15% Similarity=0.147 Sum_probs=65.0 Q ss_pred EEEEEEEECCCEEEEEECCC-CHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCE--EEEEEEEEEEE Q ss_conf 32000341376057764572-01320474421111346721588863010100002------13735--89983159999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTN-TRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTI--THTFTHFTLTL 310 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~-~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~i--kH~fTH~~L~i 310 (356) ....+|++.+|++||.||.. +..++|+|+||++..+.++.......++...+... ..... .+...+++..+ T Consensus 3 ~V~~~I~~~~g~vLl~kR~~~~~~~~G~W~~PGG~ie~gEt~~~aa~REl~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~ 82 (129) T cd04699 3 AVAALIVKDVGRILILKRSKDERTAPGKWELPGGKVEEGETFEEALKREVYEETGLTVTPFLRYPSTVTHEDSGVYNVIY 82 (129) T ss_pred EEEEEEEECCCEEEEEEECCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCEEEEE T ss_conf 99999998999899998089999999818898050659989999999999998795778768876999972799508999 Q ss_pred EEEEEECCCC-----CCCCCCEEECHHHHHHCCCCH Q ss_conf 9999980776-----766781332278887479987 Q gi|254780479|r 311 FVWKTIVPQI-----VIIPDSTWHDAQNLANAALPT 341 (356) Q Consensus 311 ~v~~~~~~~~-----~~~~~~~Wv~~~el~~~~LPs 341 (356) .+|.+..... ....+++|+++++|.+++... T Consensus 83 ~~f~~~~~~~~~~~~~E~~e~~Wv~~eeL~~L~~d~ 118 (129) T cd04699 83 LVFVCEALSGAVKLSDEHEEYAWVTLEELAILKADI 118 (129) T ss_pred EEEEEEECCCCCCCCCCCEEEEEEEHHHHHHCCCCC T ss_conf 999986248868788652066998489975487975 No 29 >cd04667 Nudix_Hydrolase_10 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.41 E-value=2.2e-06 Score=65.54 Aligned_cols=101 Identities=14% Similarity=0.146 Sum_probs=65.3 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEE--EEEEEEEEEEEECCC Q ss_conf 000341376057764572013204744211113467215888630101000021373589983--159999999998077 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFT--HFTLTLFVWKTIVPQ 319 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fT--H~~L~i~v~~~~~~~ 319 (356) +.||.+++|++||.||+ +|+|+||++..+.++....................+.+.++ +.+-..++|.+.+.. T Consensus 3 A~vi~~~~g~vLLvrr~-----~g~W~lPGG~ve~GEt~~~Aa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~f~~~~~~ 77 (112) T cd04667 3 ATVICRRGGRVLLVRKS-----GSRWALPGGKIEPGETPLQAARRELQEETGLQGLDLLYLFHVDGGSTRHHVFVASVPP 77 (112) T ss_pred EEEEEECCCEEEEEEEC-----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCEEEEEEEEEECC T ss_conf 89999709999999918-----9938898748889989999999999998693765469999990898899999999758 Q ss_pred ------CCCCCCCEEECHHHHHHCCCCHHHHHHH Q ss_conf ------6766781332278887479987899999 Q gi|254780479|r 320 ------IVIIPDSTWHDAQNLANAALPTVMKKAL 347 (356) Q Consensus 320 ------~~~~~~~~Wv~~~el~~~~LPs~~kKIL 347 (356) .....+.+|++.+++.+++++...+.|+ T Consensus 78 ~~~~~~~~E~~e~rW~~~del~el~~~~~~R~iv 111 (112) T cd04667 78 SAQPKPSNEIADCRWLSLDALGDLNASAATRLIV 111 (112) T ss_pred CCCCCCCCCCEEEEEEEHHHCCCCCCCHHHHHHC T ss_conf 8641899742147998699963688994672101 No 30 >cd04681 Nudix_Hydrolase_22 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.33 E-value=5.4e-06 Score=62.72 Aligned_cols=105 Identities=14% Similarity=0.241 Sum_probs=65.2 Q ss_pred EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEE----EEEEEEEE Q ss_conf 003413760577645720132047442111134672158886301010000------21373589983----15999999 Q gi|254780479|r 243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFT----HFTLTLFV 312 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fT----H~~L~i~v 312 (356) .+|++.+|++||.||.+. -..|+|+||++..+.++.......++...+.. ..+..+.+.++ +++.-..+ T Consensus 6 ~vI~~~~g~vLl~kR~~~-~~~g~W~lPGG~ve~gEt~~~aa~REl~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (130) T cd04681 6 VLILNEDGELLVVRRARE-PGKGTLDLPGGFVDPGESAEEALIREIREETGLKVTELSYLFSLPNTYPYGGMEYDTLDLF 84 (130) T ss_pred EEEEECCCEEEEEEECCC-CCCCCEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCEEEEEEEEE T ss_conf 999968999999996899-8899092988206599999999989999986954688689999764554698238999999 Q ss_pred EEEECCCC------CCCCCCEEECHHHHH--HCCCCHHHHHHHHH Q ss_conf 99980776------766781332278887--47998789999984 Q gi|254780479|r 313 WKTIVPQI------VIIPDSTWHDAQNLA--NAALPTVMKKALSA 349 (356) Q Consensus 313 ~~~~~~~~------~~~~~~~Wv~~~el~--~~~LPs~~kKIL~a 349 (356) |.+..... ....+.+|++++||. ++++|+ +++.|.. T Consensus 85 ~~~~~~~~~~~~~~~E~~e~~W~~~~el~~~~~afp~-~~~~i~~ 128 (130) T cd04681 85 FVCQVDDKPIVKAPDDVAELKWVVPQDIELENFAFPS-IRQAVER 128 (130) T ss_pred EEEEECCCCCCCCCCCEEEEEEECHHHCCHHHCCCHH-HHHHHHH T ss_conf 9999879874799982516899868789710259866-9999998 No 31 >cd04683 Nudix_Hydrolase_24 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.10 E-value=1.9e-05 Score=58.93 Aligned_cols=100 Identities=16% Similarity=0.233 Sum_probs=61.6 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--C--EE---EEEEE--EEEEEE Q ss_conf 200034137605776457201320474421111346721588863010100002137--3--58---99831--599999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--T--IT---HTFTH--FTLTLF 311 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~--ik---H~fTH--~~L~i~ 311 (356) .++++++.++++||.||.+.+...|+|+||++..+.++........+..++....+. . .. |..+. ...-.. T Consensus 2 av~vvl~~~~~vLL~~R~~~~~~~G~W~~PGG~ve~GEt~~~aa~REl~EEtGl~v~~~~l~~~~~~~~~~~~~~~~~~~ 81 (120) T cd04683 2 AVYVLLRRDDEVLLQRRANTGYMDGQWALPAGHLEKGEDAVTAAVREAREEIGVTLDPEDLRLAHTMHRRTEDIESRIGL 81 (120) T ss_pred EEEEEEEECCEEEEEECCCCCCCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEECEEEEEEEEECCCCCCCEEEEE T ss_conf 89999993999999977899999997989758877999999999999888769388734089999883268998799999 Q ss_pred EEEEECC-------CCCCCCCCEEECHHHHHHCCCC Q ss_conf 9999807-------7676678133227888747998 Q gi|254780479|r 312 VWKTIVP-------QIVIIPDSTWHDAQNLANAALP 340 (356) Q Consensus 312 v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LP 340 (356) +|.+... .+....+..|+++++|-...+| T Consensus 82 ~f~~~~~~g~p~~~~p~e~~~~~Wf~~deLP~~~~~ 117 (120) T cd04683 82 FFTVRRWSGEPRNCEPDKCAELRWFPLDALPDDTVD 117 (120) T ss_pred EEEEEEECCEEECCCCCCEEEEEEECHHHCCCCCCC T ss_conf 999997478986189981458999998899886577 No 32 >cd04664 Nudix_Hydrolase_7 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s Probab=98.09 E-value=6.2e-05 Score=55.24 Aligned_cols=102 Identities=13% Similarity=0.083 Sum_probs=65.2 Q ss_pred EEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCC--------EEE--EE--EEEEEEEEE Q ss_conf 341376057764572013204744211113467215888630101000021373--------589--98--315999999 Q gi|254780479|r 245 AITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNT--------ITH--TF--THFTLTLFV 312 (356) Q Consensus 245 ii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~--------ikH--~f--TH~~L~i~v 312 (356) .+++++++||.||+++ +.|+|+||++..+.++........+...+....... ... .+ .+....+++ T Consensus 10 ~~~~~~~vLll~Rs~~--~~G~W~~p~G~ve~gE~~~~aA~REi~EETGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (129) T cd04664 10 RLTGEGRVLLLRRSDK--YAGFWQSVTGGIEDGESPAEAARREVAEETGLDPERLTLLDRGASIAFVEFTDNGRVWTEHP 87 (129) T ss_pred EECCCCEEEEEECCCC--CCCCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEEECCCCEEEEEEE T ss_conf 9889998999987999--98939198732289979999999999998889830179986145899999658985899999 Q ss_pred EEEECCCC------CCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf 99980776------7667813322788874799878999998 Q gi|254780479|r 313 WKTIVPQI------VIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 313 ~~~~~~~~------~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) |.+.++.. ..+.+++|++.++..++...-..|++|. T Consensus 88 f~~~v~~~~~v~l~~EH~~~~W~~~~ea~~~l~~~~~r~a~~ 129 (129) T cd04664 88 FAFHLPSDAVVTLDWEHDAFEWVPPEEAAALLLWESNRRAWA 129 (129) T ss_pred EEEECCCCCCCCCCCCCCCEEECCHHHHHHHCCCHHHHHHHC T ss_conf 999867898776874447218836999997659933668759 No 33 >cd04673 Nudix_Hydrolase_15 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.06 E-value=1.8e-05 Score=58.98 Aligned_cols=102 Identities=17% Similarity=0.129 Sum_probs=61.6 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEE------EEEEEE Q ss_conf 0003413760577645720132047442111134672158886301010000------2137358998------315999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTF------THFTLT 309 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~f------TH~~L~ 309 (356) +..++.++|++||.||... -..|+|+||++..+.+++.......+...+.. ..++.+.+.. .+++.. T Consensus 3 v~~ii~~~~kvLL~~R~~~-~~~g~W~lPGG~ve~gEs~~~a~~REv~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (122) T cd04673 3 VGAVVFRGGRVLLVRRANP-PDAGLWSFPGGKVELGETLEQAALRELLEETGLEAEVGRLLTVVDVIERDAAGRVEFHYV 81 (122) T ss_pred EEEEEEECCEEEEEEECCC-CCCCEEECCEEEEECCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCCEEEEEE T ss_conf 9999998999999996999-999979998968578999999999999998897647606999991020289996589999 Q ss_pred EEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHH Q ss_conf 9999998077-----6766781332278887479987899 Q gi|254780479|r 310 LFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMK 344 (356) Q Consensus 310 i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~k 344 (356) +.+|.+.... .....+.+|++.++|.+++|+-..| T Consensus 82 ~~~f~~~~~~g~~~~~~E~~e~~W~~~del~~l~l~~~~R 121 (122) T cd04673 82 LIDFLCRYLGGEPVAGDDALDARWVPLDELAALSLTESTR 121 (122) T ss_pred EEEEEEEECCCEEECCCCEEEEEEEEHHHCCCCCCCCCCC T ss_conf 9999999739988419883888998699914798993436 No 34 >cd04511 Nudix_Hydrolase_4 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specifici Probab=98.05 E-value=2.5e-05 Score=57.98 Aligned_cols=104 Identities=13% Similarity=0.176 Sum_probs=64.9 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE--CCCEEEEEE--EEEEEEEEEEEEC Q ss_conf 000341376057764572013204744211113467215888630101000021--373589983--1599999999980 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL--CNTITHTFT--HFTLTLFVWKTIV 317 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~--l~~ikH~fT--H~~L~i~v~~~~~ 317 (356) +.+++.++|+|||.||... --.|+|++|++..+.++.......++..++.... ...+..+|+ |......+|.+.. T Consensus 16 v~~ii~~~~kILLvrR~~~-P~~g~W~lPGG~vE~gEt~eea~~REv~EEtGl~v~~~~~~~v~~~p~~~~v~~~f~~~~ 94 (130) T cd04511 16 VGCVPEWEGKVLLCRRAIE-PRHGFWTLPAGFMENGETTEQGALRETWEEAGARVEIDGLYAVYSVPHISQVYMFYRARL 94 (130) T ss_pred EEEEEEECCEEEEEECCCC-CCCCCEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCEEEEEEEEEE T ss_conf 9999986999999940689-999958689748658989999999899874097899626999945787767999999999 Q ss_pred CC-----CCCCCCCEEECHHHHH--HCCCCHHHHHHH Q ss_conf 77-----6766781332278887--479987899999 Q gi|254780479|r 318 PQ-----IVIIPDSTWHDAQNLA--NAALPTVMKKAL 347 (356) Q Consensus 318 ~~-----~~~~~~~~Wv~~~el~--~~~LPs~~kKIL 347 (356) .. .....+.+|++.++|. ++++|+. |+.| T Consensus 95 ~~g~~~~~~E~~e~~wf~~delP~~~lAf~s~-~~al 130 (130) T cd04511 95 LDLDFAPGPESLEVRLFTEEEIPWDELAFPTV-RRTL 130 (130) T ss_pred ECCCCCCCCCEEEEEEECHHHCCHHHCCHHHH-HHHC T ss_conf 12767999520167788788997065451655-6319 No 35 >cd04690 Nudix_Hydrolase_31 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.03 E-value=1.8e-05 Score=59.02 Aligned_cols=96 Identities=17% Similarity=0.150 Sum_probs=62.8 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE--------CCCEE---EEEEEEEEE Q ss_conf 2000341376057764572013204744211113467215888630101000021--------37358---998315999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL--------CNTIT---HTFTHFTLT 309 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~--------l~~ik---H~fTH~~L~ 309 (356) .+.++++.+|++||.||..+ |.|+||++..+.++........+...+.... .+.+. |....+.+. T Consensus 3 ~~a~v~~~~grvLl~rr~~~----~~W~lPGG~vE~gEt~~~a~~REl~EE~Gl~v~~~~l~~~~~~~~~~~~~~~~~~~ 78 (118) T cd04690 3 AAALILVRDGRVLLVRKRGT----DVFYLPGGKIEAGETPLQALIRELSEELGLDLDPDSLEYLGTFRAPAANEPGVDVR 78 (118) T ss_pred EEEEEEECCCEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEECCEEEEEEEEEECCCCCCCEEEE T ss_conf 99999989998999997799----94989963765997999999999989759498602079989997216679998999 Q ss_pred EEEEEEECCCC----CCCCCCEEECHHHHHHCCCC Q ss_conf 99999980776----76678133227888747998 Q gi|254780479|r 310 LFVWKTIVPQI----VIIPDSTWHDAQNLANAALP 340 (356) Q Consensus 310 i~v~~~~~~~~----~~~~~~~Wv~~~el~~~~LP 340 (356) +++|.+..... ....+.+|++++++.+++|. T Consensus 79 ~~~~~~~~~g~~~~~~E~~e~~Wv~~~el~~~~la 113 (118) T cd04690 79 ATVYVAELTGEPVPAAEIEEIRWVDYDDPADDRLA 113 (118) T ss_pred EEEEEEEEECCCCCCCCCCCEEEECHHHCCCCCCC T ss_conf 99999998188457863100299579897689989 No 36 >cd04676 Nudix_Hydrolase_17 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.02 E-value=3e-05 Score=57.51 Aligned_cols=105 Identities=17% Similarity=0.202 Sum_probs=65.0 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--C-------EEEEEEE----- Q ss_conf 3200034137605776457201320474421111346721588863010100002137--3-------5899831----- Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--T-------ITHTFTH----- 305 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~-------ikH~fTH----- 305 (356) ....++++.+|++||.||... |.|.||++..+.++........+...+...... . ..|.++. T Consensus 4 gv~~vv~~~~g~vLL~~r~~~----g~W~lPGG~ve~GE~~~~aa~RE~~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~ 79 (129) T cd04676 4 GVTAVVRDDEGRVLLIRRSDN----GLWALPGGAVEPGESPADTAVREVREETGLDVEVTGLVGIYTGPVHVVTYPNGDV 79 (129) T ss_pred EEEEEEEECCCEEEEEEECCC----CEEECCCCEECCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCEECCCCCE T ss_conf 799999979980999998699----9698992574199898998889999976957787279999726442162799986 Q ss_pred EEEEEEEEEEECCC------CCCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf 59999999998077------67667813322788874799878999998 Q gi|254780479|r 306 FTLTLFVWKTIVPQ------IVIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 306 ~~L~i~v~~~~~~~------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) .+....+|.+.... .....+.+|++++++..++++-.+|..++ T Consensus 80 ~~~~~~~f~~~~~~~~~~~~~~E~~e~~Wf~~delp~l~~~p~~R~~i~ 128 (129) T cd04676 80 RQYLDITFRCRVVGGELRVGDDESLDVAWFDPDGLPPLLMHPSMRLRID 128 (129) T ss_pred EEEEEEEEEEEEECCCCCCCCCCEEEEEEECHHHCCCCCCCCCCHHHCC T ss_conf 9999999998872884559997178889989989969999945043337 No 37 >cd04682 Nudix_Hydrolase_23 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=98.01 E-value=2.7e-05 Score=57.83 Aligned_cols=97 Identities=11% Similarity=0.017 Sum_probs=58.5 Q ss_pred EEEEECCCEEEEEECCCC-HH-HHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC----CEEEEEEE--EEEEEEEEE Q ss_conf 003413760577645720-13-20474421111346721588863010100002137----35899831--599999999 Q gi|254780479|r 243 FIAITNDNRILLRKRTNT-RL-LEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN----TITHTFTH--FTLTLFVWK 314 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~-gl-l~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~----~ikH~fTH--~~L~i~v~~ 314 (356) ..++..+|++||.+|++. ++ |.|+|+||++..+.++........+..++....+. ...+.|++ .....++|. T Consensus 5 ~al~~~~g~vLl~~R~~~~~~~~pg~W~~pGG~~e~~E~~~~~~~RE~~EE~gl~~~~~~~~~~~~~~~~~~~~~~~~F~ 84 (122) T cd04682 5 LALLIGDGRLLLQLRDDKPGIPYPGHWDLPGGHREGGETPLECVLRELLEEIGLTLPESRIPWFRVYPSASPPGTEHVFV 84 (122) T ss_pred EEEEECCCEEEEEECCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHEEEEEEECCCCCCCEEEEEE T ss_conf 99996799799998999999998993879966667998999999999999888856700020278521578994799999 Q ss_pred EECC-------CCCCCCCCEEECHHHHHHCCC Q ss_conf 9807-------767667813322788874799 Q gi|254780479|r 315 TIVP-------QIVIIPDSTWHDAQNLANAAL 339 (356) Q Consensus 315 ~~~~-------~~~~~~~~~Wv~~~el~~~~L 339 (356) +.+. ...+.++++|++.+|+-+.+. T Consensus 85 ~~l~~~~~~~~~~~Egq~~~W~~~eEll~~~~ 116 (122) T cd04682 85 VPLTAREDAILFGDEGQALRLMTVEEFLAHED 116 (122) T ss_pred EEEECCCCCCCCCCCCCEEEEEEHHHHHHCCC T ss_conf 84005631535687330768745999964836 No 38 >cd04677 Nudix_Hydrolase_18 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.98 E-value=5.5e-05 Score=55.61 Aligned_cols=105 Identities=18% Similarity=0.181 Sum_probs=61.9 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--CEEEEEEEE---------- Q ss_conf 23200034137605776457201320474421111346721588863010100002137--358998315---------- Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--TITHTFTHF---------- 306 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~ikH~fTH~---------- 306 (356) .....++++.+|++||.||... |.|.||++..+.++.......++...+...... .+...|++. T Consensus 8 ~g~~~vv~~~~g~vLL~~r~~~----g~W~lPGG~ve~gEt~~~aa~RE~~EEtGi~~~~~~~~~~~~~~~~~~~~~~~~ 83 (132) T cd04677 8 VGAGVILLNEQGEVLLQKRSDT----GDWGLPGGAMELGESLEETARRELKEETGLEVEELELLGVYSGKEFYVKPNGDD 83 (132) T ss_pred EEEEEEEEECCCEEEEEEECCC----CCEECCEEEECCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCEEEECCCCCC T ss_conf 5799999989998999998899----949898427669979999999999998695535247999977864554489984 Q ss_pred --EEEEEEEEEECC------CCCCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf --999999999807------767667813322788874799878999998 Q gi|254780479|r 307 --TLTLFVWKTIVP------QIVIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 307 --~L~i~v~~~~~~------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) .....+|.+... ......+.+|++++++.++-+|. ++++|+ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~E~~e~~Wf~~delp~l~~p~-~r~~ie 132 (132) T cd04677 84 EQYIVTLYYVTKVFGGKLVPDGDETLELKFFSLDELPELINPQ-HKEMLQ 132 (132) T ss_pred EEEEEEEEEEEEECCCCCCCCCCCEEEEEEECHHHCCCCCCHH-HHHHCC T ss_conf 7999999999862587567999701589998299983448922-575609 No 39 >cd04684 Nudix_Hydrolase_25 Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability Probab=97.94 E-value=0.00013 Score=52.99 Aligned_cols=104 Identities=14% Similarity=0.169 Sum_probs=63.4 Q ss_pred EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEE------EEEEEE Q ss_conf 003413760577645720132047442111134672158886301010000------21373589983------159999 Q gi|254780479|r 243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFT------HFTLTL 310 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fT------H~~L~i 310 (356) +.++.++|++||.||... -..|+|.||++..+.++........+...+.. ..++.+.+.++ +++.-. T Consensus 4 ~avi~~~~~vLL~rr~~~-~~~g~W~lPGG~ve~gEt~~~aa~REv~EETGl~v~~~~~l~~~~~~~~~~~~~~~~~~v~ 82 (128) T cd04684 4 YAVIPRDGKLLLIQKNGG-PYEGRWDLPGGGIEPGESPEEALHREVLEETGLTVEIGRRLGSASRYFYSPDGDYDAHHLC 82 (128) T ss_pred EEEEEECCEEEEEEECCC-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEEEECCCCCEEEEEEE T ss_conf 999993999999997789-9898298985607799899999999999872937886338899999997899977679999 Q ss_pred EEEEEECCCCC--------CCCCCEEECHHHHHHCCCCHHHHHHH Q ss_conf 99999807767--------66781332278887479987899999 Q gi|254780479|r 311 FVWKTIVPQIV--------IIPDSTWHDAQNLANAALPTVMKKAL 347 (356) Q Consensus 311 ~v~~~~~~~~~--------~~~~~~Wv~~~el~~~~LPs~~kKIL 347 (356) .+|.+...... ...+..|++.+++.+..++-.+++.+ T Consensus 83 ~~f~~~~~~g~~~~~~~~ee~~e~~W~~~del~e~~~~P~~~~~v 127 (128) T cd04684 83 VFYDARVVGGALPVQEPGEDSHGAAWLPLDEAIERLLSPLVLWAV 127 (128) T ss_pred EEEEEEECCCCCCCCCCCCCEEEEEEECHHHHHCCCCCHHHHHHH T ss_conf 999999868940568999830599995889942312794787863 No 40 >cd04679 Nudix_Hydrolase_20 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.91 E-value=8e-05 Score=54.46 Aligned_cols=106 Identities=15% Similarity=0.116 Sum_probs=64.0 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCCEEEEEEE--EEEEEE Q ss_conf 320003413760577645720132047442111134672158886301010000------213735899831--599999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNTITHTFTH--FTLTLF 311 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~ikH~fTH--~~L~i~ 311 (356) ....++++.+|++||.||... -..|+|+||++..+.++........+...+.. ..++.+.|..+. ++.... T Consensus 4 ~v~avv~~~~g~vLL~~R~~~-~~~g~W~lPGG~vE~gEt~~~aa~REv~EEtGl~v~~~~l~~~~~~~~~~~~~~~~~~ 82 (125) T cd04679 4 GCGAAILRDDGKLLLVKRLRA-PEAGHWGIPGGKVDWMEAVEDAVVREIEEETGLSIHSTRLLCVVDHIIEEPPQHWVAP 82 (125) T ss_pred EEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCEEEEE T ss_conf 699999977998999993899-9999793988305899899999999999987978585379999856435899659999 Q ss_pred EEEEECC-------CCCCCCCCEEECHHHHHHCCCCHHHHHHH Q ss_conf 9999807-------76766781332278887479987899999 Q gi|254780479|r 312 VWKTIVP-------QIVIIPDSTWHDAQNLANAALPTVMKKAL 347 (356) Q Consensus 312 v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL 347 (356) +|.+... .+....+..|+++++|-. +++..++.+| T Consensus 83 ~f~~~~~~g~~~~~~~~e~~e~~Wf~~d~LP~-~l~~~~r~~i 124 (125) T cd04679 83 VYLAENFSGEPRLMEPDKLLELGWFALDALPQ-PLTRATRDAV 124 (125) T ss_pred EEEEEECCCEEECCCCCEEEEEEEECHHHCCC-CCCHHHHHHH T ss_conf 99999738878228998146887898889888-6598899974 No 41 >cd03430 GDPMH GDP-mannose glycosyl hydrolase (AKA GDP-mannose mannosyl hydrolase (GDPMH)) is a member of the Nudix hydrolase superfamily. This class of enzymes is unique from other members of the superfamily in two aspects. First, it contains a modified Nudix signature sequence. The slight changes to the conserved sequence motif, GX5EX7REUXEEXGU, where U = I, L or V), are believed to contribute to the removal of all magnesium binding sites but one, retaining only the metal site that coordinates the pyrophosphate of the substrate. Secondly, it is not a pyrophosphatase that substitutes at a phosphorus; instead, it hydrolyzes nucleotide sugars such as GDP-mannose to GDP and mannose, cleaving the phosphoglycosyl bond by substituting at a carbon position. GDP-mannose provides mannosyl components for cell wall synthesis and is required for the synthesis of other glycosyl donors (such as GDP-fucose and colitose) for the cell wall. The importance of GDP-sugar hydrolase activities is thus close Probab=97.85 E-value=0.00017 Score=52.23 Aligned_cols=97 Identities=16% Similarity=0.191 Sum_probs=61.2 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC--------CCCEECCCEEEEEE------- Q ss_conf 320003413760577645720132047442111134672158886301010--------00021373589983------- Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT--------ANWILCNTITHTFT------- 304 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~--------~~~~~l~~ikH~fT------- 304 (356) ..-++|.+.+|++||.||... -++|.|.+|++..+.++.......+.... .....++.+.|.|+ T Consensus 14 ~V~~vI~n~~g~iLL~kR~~~-p~~g~W~~PGG~i~~gEt~~~a~~Re~~eE~Gl~i~~~~~~~~Gv~eh~y~d~~~~~~ 92 (144) T cd03430 14 SIDLIVENEDGQYLLGKRTNR-PAQGYWFVPGGRIRKNETLTEAFERIAKDELGLEFLISDAELLGVFEHFYDDNFFGDD 92 (144) T ss_pred EEEEEEECCCCCEEEEECCCC-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCEEEEEEEEECCCCCCCCC T ss_conf 999999979996999981699-8899498983447799799999999999974971640344897758998267777898 Q ss_pred ---EEEEEEEEEEEECCCC------CCCCCCEEECHHHHHHCCC Q ss_conf ---1599999999980776------7667813322788874799 Q gi|254780479|r 305 ---HFTLTLFVWKTIVPQI------VIIPDSTWHDAQNLANAAL 339 (356) Q Consensus 305 ---H~~L~i~v~~~~~~~~------~~~~~~~Wv~~~el~~~~L 339 (356) |+-. ..|.+.+... ...++++|++.+++-+.+. T Consensus 93 ~~~H~V~--l~y~~~~~~~e~~~~ddeh~e~~Wf~~del~~~~~ 134 (144) T cd03430 93 FSTHYVV--LGYVLKLSSNELLLPDEQHSEYQWLTSDELLADDD 134 (144) T ss_pred CCEEEEE--EEEEEEECCCCCCCCHHHCCEEEEECHHHHHCCCC T ss_conf 5309999--99999955897559945700669967999521776 No 42 >cd04678 Nudix_Hydrolase_19 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.85 E-value=0.00014 Score=52.83 Aligned_cols=108 Identities=15% Similarity=0.120 Sum_probs=63.7 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC--CEEEEEE------EEEEEEE Q ss_conf 3200034137605776457201320474421111346721588863010100002137--3589983------1599999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN--TITHTFT------HFTLTLF 311 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~--~ikH~fT------H~~L~i~ 311 (356) ....++++.+|++||.||... .-.|+|.||++..+.+++.......+...+....+. .+.+.+. +++.... T Consensus 4 gV~~iv~~~~g~vLL~rR~~~-~~~g~W~lPgG~ve~GEs~~~Aa~REv~EEtGl~v~~~~~~~~~~~~~~~~~~h~i~~ 82 (129) T cd04678 4 GVGVFVLNPKGKVLLGKRKGS-HGAGTWALPGGHLEFGESFEECAAREVLEETGLHIENVQFLTVTNDVFEEEGKHYVTI 82 (129) T ss_pred EEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCEEEEE T ss_conf 799999989995999994789-9999498981040589889999999999987985252499999887757789469999 Q ss_pred EEEEECCC---------CCCCCCCEEECHHHHHHC-CCCHHHHHHHH Q ss_conf 99998077---------676678133227888747-99878999998 Q gi|254780479|r 312 VWKTIVPQ---------IVIIPDSTWHDAQNLANA-ALPTVMKKALS 348 (356) Q Consensus 312 v~~~~~~~---------~~~~~~~~Wv~~~el~~~-~LPs~~kKIL~ 348 (356) +|.+.... +....+.+|++.++|-++ |+-.+.+.+|+ T Consensus 83 ~f~~~~~~g~~~~~~~ep~e~~~~~Wf~~deLP~l~pl~~p~~~~~~ 129 (129) T cd04678 83 FVKAEVDDGEAEPNKMEPEKCEGWEWFDWEELPSVDPLFLPLKNLFE 129 (129) T ss_pred EEEEEEECCCCCCCCCCCHHCCEEEEECHHHCCCCCCCCHHHHHHHC T ss_conf 99999837823647799324013089998998698634178899649 No 43 >cd02883 Nudix_Hydrolase Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+ for their activity. Members of this family are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance and "house-cleaning" enzy Probab=97.83 E-value=0.00025 Score=51.04 Aligned_cols=104 Identities=18% Similarity=0.226 Sum_probs=65.7 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEEC--CCEEEEEEE------EEEEEEEE Q ss_conf 0003413760577645720132047442111134672158886301010000213--735899831------59999999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILC--NTITHTFTH------FTLTLFVW 313 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l--~~ikH~fTH------~~L~i~v~ 313 (356) ..++++.++++||.||... ..|.|.||++..+.++.......++...+....+ ..+...|++ ......+| T Consensus 4 ~~ii~~~~~~vLl~~r~~~--~~~~w~~PgG~ve~gE~~~~aa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~~f 81 (123) T cd02883 4 GAVILDEDGRVLLVRRADS--PGGLWELPGGGVEPGETLEEAAIREVREETGLDVDVLRLLGVYEVESPDEGEHAVVFVF 81 (123) T ss_pred EEEEEECCCEEEEEEECCC--CCCEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCEEEEEEE T ss_conf 9999979999999994289--98959998143159989899756678898594567758999999877789966999999 Q ss_pred EEECCCC-------CCCCCCEEECHHHHHHCCCCHHHHHHH Q ss_conf 9980776-------766781332278887479987899999 Q gi|254780479|r 314 KTIVPQI-------VIIPDSTWHDAQNLANAALPTVMKKAL 347 (356) Q Consensus 314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~LPs~~kKIL 347 (356) .+..... ....+..|++.+++..++++..++..+ T Consensus 82 ~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~~ 122 (123) T cd02883 82 LARLVGGEPTLLPPDEISEVRWVTLDELPALALSPALRPAL 122 (123) T ss_pred EEEECCCCCCCCCCCCEEEEEEEEHHHHHCCCCCHHHHHHH T ss_conf 99995895212898668999998499913499983675965 No 44 >cd04670 Nudix_Hydrolase_12 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.82 E-value=0.0001 Score=53.78 Aligned_cols=107 Identities=13% Similarity=0.186 Sum_probs=61.0 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-----CEEEEEEE-E-EEEEE Q ss_conf 23200034137605776457201320474421111346721588863010100002137-----35899831-5-99999 Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-----TITHTFTH-F-TLTLF 311 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-----~ikH~fTH-~-~L~i~ 311 (356) .....++++.+|++||.|+... ..|.|+||++..+.++.......++...+...... .+.|.... + ...+. T Consensus 3 igv~avi~n~~~~vLlv~~~~~--~~g~W~lPgG~ve~GEt~~eaa~REv~EEtGl~v~~~~~l~~~~~~~~~~~~~~~~ 80 (127) T cd04670 3 VGVGGLVLNEKNEVLVVQERNK--TPNGWKLPGGLVDPGEDIFDGAVREVLEETGIDTEFVSVVGFRHAHPGAFGKSDLY 80 (127) T ss_pred EEEEEEEEECCCEEEEEEECCC--CCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECCCCCCCCEEEE T ss_conf 7899999979999999997789--99958998203159979999999999998794568975898996079888971799 Q ss_pred E-EEEEC-C-----CCCCCCCCEEECHHHHHHCCCCHHHHHHH Q ss_conf 9-99980-7-----76766781332278887479987899999 Q gi|254780479|r 312 V-WKTIV-P-----QIVIIPDSTWHDAQNLANAALPTVMKKAL 347 (356) Q Consensus 312 v-~~~~~-~-----~~~~~~~~~Wv~~~el~~~~LPs~~kKIL 347 (356) + +.... . +.....+.+|++++|+-+.|+.+.+.|+| T Consensus 81 f~~~~~~~~~~i~~~~~Ei~e~~W~~~~El~~~p~~~~~~r~~ 123 (127) T cd04670 81 FICRLKPLSFDINFDTSEIAAAKWMPLEEYISQPITSEVNRLI 123 (127) T ss_pred EEEEEECCCCCCCCCCCCEEEEEEECHHHHHCCCCCHHHHHHH T ss_conf 9999981888778898505787883699987088704666656 No 45 >cd03674 Nudix_Hydrolase_1 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamil Probab=97.77 E-value=0.00017 Score=52.15 Aligned_cols=104 Identities=16% Similarity=0.222 Sum_probs=63.9 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCE----------EEEE------- Q ss_conf 20003413760577645720132047442111134672158886301010000213735----------8998------- Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTI----------THTF------- 303 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~i----------kH~f------- 303 (356) .++|+.+.+|++||.+|.. .|.|.+|++..+.+++.......+...+.......+ .|.+ T Consensus 6 ~~~Vv~~~~g~vLLv~~~~----~g~W~lPGG~ve~gEt~~~Aa~REv~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (138) T cd03674 6 SAFVVNPDRGKVLLTHHRK----LGSWLQPGGHIDPDESLLEAALRELREETGIELLGLRPLSVLVDLDVHPIDGHPKRG 81 (138) T ss_pred EEEEEECCCCEEEEEEECC----CCEEECCCCEECCCCCHHHHHHHHHHHHHCCCCCCEEECCCCCEEEEEEECCCCCCC T ss_conf 9999978999899998659----992989965867988999999999999879873222560434057898766856788 Q ss_pred --EEEEEEEEEEEEECC-------CCCCCCCCEEECHHHHHHCCCCHHHHHHHHH Q ss_conf --315999999999807-------7676678133227888747998789999984 Q gi|254780479|r 304 --THFTLTLFVWKTIVP-------QIVIIPDSTWHDAQNLANAALPTVMKKALSA 349 (356) Q Consensus 304 --TH~~L~i~v~~~~~~-------~~~~~~~~~Wv~~~el~~~~LPs~~kKIL~a 349 (356) .|+++.+. |.+... ......+.+|++++++..+.++--++++|+. T Consensus 82 ~~~~~h~~v~-f~~~~~~~~~~~~~~~E~~~~~W~~~~el~~l~~~~~~rr~i~~ 135 (138) T cd03674 82 VPGHLHLDLR-FLAVAPADDVAPPKSDESDAVRWFPLDELASLELPEDVRRLVEK 135 (138) T ss_pred CCCEEEEEEE-EEEEECCCCCCCCCCCCCCEEEEEEHHHCCCCCCCHHHHHHHHH T ss_conf 8876999999-99996288866689973210299859993025899769999999 No 46 >cd04695 Nudix_Hydrolase_36 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.75 E-value=0.00033 Score=50.13 Aligned_cols=107 Identities=12% Similarity=0.121 Sum_probs=65.1 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCEEEEEEEEE--E-EEE Q ss_conf 2000341376057764572013204744211113467215888630101000021------373589983159--9-999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTITHTFTHFT--L-TLF 311 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~ikH~fTH~~--L-~i~ 311 (356) .++..++.+.+|||-||.. -++|.|+||++..+.+++.......+........ ...+.+.+++-+ + .+. T Consensus 5 v~~~~~~~~~~~LlLkR~~--~~~g~W~~p~G~iE~gEs~~eaA~REl~EETGl~~~~l~~~~~~~~~y~~~~~~i~~~~ 82 (131) T cd04695 5 VLLRSLDKETKVLLLKRVK--TLGGFWCHVAGGVEAGETAWQAALRELKEETGISLPELYNADYLEQFYEANDNRILMAP 82 (131) T ss_pred EEEEEECCCCEEEEEECCC--CCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEEEEECCCCEEEEEE T ss_conf 9999989988899997689--99995869823346999999999998889778774347712426889984897699999 Q ss_pred EEEEECCC------CCCCCCCEEECHHHHHHCC-CCHHHHHHHHHC Q ss_conf 99998077------6766781332278887479-987899999844 Q gi|254780479|r 312 VWKTIVPQ------IVIIPDSTWHDAQNLANAA-LPTVMKKALSAG 350 (356) Q Consensus 312 v~~~~~~~------~~~~~~~~Wv~~~el~~~~-LPs~~kKIL~al 350 (356) +|.+.++. .....+++|++.++..++. +|. .|.+|+.+ T Consensus 83 ~f~~~~~~~~~v~ls~Eh~~y~W~~~eea~~ll~~~~-~r~~l~~~ 127 (131) T cd04695 83 VFVGFVPPHQEVVLNHEHTEYRWCSFAEALELAPFPG-QRALYDHV 127 (131) T ss_pred EEEEEECCCCCCCCCCCCCCCEECCHHHHHHHCCCCC-HHHHHHHH T ss_conf 9999938999782562032089718999996769967-89999999 No 47 >cd04672 Nudix_Hydrolase_14 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.72 E-value=8.8e-05 Score=54.18 Aligned_cols=102 Identities=11% Similarity=0.108 Sum_probs=59.2 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEEEEEE-----EEEEE Q ss_conf 00034137605776457201320474421111346721588863010100002------13735899831-----59999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITHTFTH-----FTLTL 310 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH~fTH-----~~L~i 310 (356) ..+++ ++|++||.||+.. |+|.||++..+.++........+...+... ..+...+..++ ++... T Consensus 6 ~~vv~-~dg~vLL~rr~~~----g~W~lPGG~ve~gEt~~eaa~REv~EETGl~v~~~~l~~v~~~~~~~~~~~~~~~~~ 80 (123) T cd04672 6 RAAIF-KDGKILLVREKSD----GLWSLPGGWADVGLSPAENVVKEVKEETGLDVKVRKLAAVDDRNKHHPPPQPYQVYK 80 (123) T ss_pred EEEEE-ECCEEEEEEECCC----CEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCCCEEEEE T ss_conf 99999-8999999998899----939898466379989999999999997790999858999983543279995249999 Q ss_pred EEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf 999998077-----67667813322788874799878999998 Q gi|254780479|r 311 FVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 311 ~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) .+|.+.... .....+..|+++++|..+++..-.+++|+ T Consensus 81 ~~f~~~~~~g~~~~~~E~~e~~Wf~~deLP~L~~~~~t~~~i~ 123 (123) T cd04672 81 LFFLCEILGGEFKPNIETSEVGFFALDDLPPLSEKRNTEEQIE 123 (123) T ss_pred EEEEEEECCCCCCCCCHHHEEEEECHHHCCCCCCCCCCHHHHC T ss_conf 9999998789388894564367988889987875698979919 No 48 >cd04669 Nudix_Hydrolase_11 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.70 E-value=0.00016 Score=52.45 Aligned_cols=97 Identities=14% Similarity=0.140 Sum_probs=53.8 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE--ECCCEEEEEEEEE-----EEEEEE Q ss_conf 200034137605776457201320474421111346721588863010100002--1373589983159-----999999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI--LCNTITHTFTHFT-----LTLFVW 313 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~--~l~~ikH~fTH~~-----L~i~v~ 313 (356) .+.++++.+|++||.||...| .++|.||++..+.++........+....... ..+.+-.++.+.. ..+.++ T Consensus 3 a~~ii~~~~~~vLl~~r~~~~--~~~w~~PGG~ve~gEt~~~a~~RE~~EE~g~~v~~~~~~~~~~~~~~~~~~F~~~i~ 80 (121) T cd04669 3 ASIVIINDQGEILLIRRIKPG--KTYYVFPGGGIEEGETPEEAAKREALEELGLDVRVEEIFLIVNQNGRTEHYFLARVI 80 (121) T ss_pred EEEEEEECCCEEEEEEECCCC--CCEEECCCCEECCCCCHHHHHHHHHHHHCCCEEEEEEEEEEEEECCCEEEEEEEEEE T ss_conf 899999779999999984899--986989960605999999999999887609889965599999747980699999998 Q ss_pred EEECC--CCC----CC-C---CCEEECHHHHHHCCC Q ss_conf 99807--767----66-7---813322788874799 Q gi|254780479|r 314 KTIVP--QIV----II-P---DSTWHDAQNLANAAL 339 (356) Q Consensus 314 ~~~~~--~~~----~~-~---~~~Wv~~~el~~~~L 339 (356) ..... ... .. . +..|+++++|..++| T Consensus 81 ~g~~~~~~~~e~~~~~~~~~~e~~WvpldeL~~l~l 116 (121) T cd04669 81 SGKLGLGVGEEFERQSDDNQYHPVWVDLDQLETIPL 116 (121) T ss_pred CCEECCCCCCCCCCCCCCCEEEEEEEEHHHHCCCCC T ss_conf 147815878423577888728999977999272898 No 49 >cd04700 DR1025_like DR1025 from Deinococcus radiodurans, a member of the Nudix hydrolase superfamily, show nucleoside triphosphatase and dinucleoside polyphosphate pyrophosphatase activities. Like other enzymes belonging to this superfamily, it requires a divalent cation, in this case Mg2+, for its activity. It also contains a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. In general, substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is us Probab=97.70 E-value=0.00023 Score=51.20 Aligned_cols=97 Identities=15% Similarity=0.152 Sum_probs=60.3 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCC------CEECCCEEEEEEEEEEEE-EEE Q ss_conf 2000341376057764572013204744211113467215888630101000------021373589983159999-999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTAN------WILCNTITHTFTHFTLTL-FVW 313 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~------~~~l~~ikH~fTH~~L~i-~v~ 313 (356) ...||++.+|++||.||. .+-..|+|+||++..+.++........+...+. ...++.+...+.+-...+ ++| T Consensus 16 agavV~n~~g~vLLvrr~-~~p~~g~W~lPgG~vE~GEt~~eAa~REv~EETGl~v~~~~~l~~~~~~~~~g~~~~~~~f 94 (142) T cd04700 16 AGAVILNERNDVLLVQEK-GGPKKGLWHIPSGAVEDGEFPQDAAVREACEETGLRVRPVKFLGTYLGRFDDGVLVLRHVW 94 (142) T ss_pred EEEEEEECCCEEEEEEEC-CCCCCCEEECCEECCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCEEEEEEE T ss_conf 999999699989999976-8998998979833387996999999999999669788888898999468079988999999 Q ss_pred EEECCCC-------CCCCCCEEECHHHHHHCC Q ss_conf 9980776-------766781332278887479 Q gi|254780479|r 314 KTIVPQI-------VIIPDSTWHDAQNLANAA 338 (356) Q Consensus 314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~ 338 (356) .++.... ....+.+|++.+|+..+. T Consensus 95 l~~~~~~~~~p~~~dE~~e~~Wf~~~El~~L~ 126 (142) T cd04700 95 LAEPEGQTLAPKFTDEIAEASFFSREDVAQLY 126 (142) T ss_pred EEEECCCCCCCCCCCCEEEEEEECHHHHHHHH T ss_conf 99974883256999876788998599955545 No 50 >cd04671 Nudix_Hydrolase_13 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.65 E-value=0.00023 Score=51.19 Aligned_cols=106 Identities=14% Similarity=0.127 Sum_probs=60.1 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-----CEEEEEEEEEEEEEEE Q ss_conf 23200034137605776457201320474421111346721588863010100002137-----3589983159999999 Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-----TITHTFTHFTLTLFVW 313 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-----~ikH~fTH~~L~i~v~ 313 (356) |..+.++++.+|+|||.||... -..|+|+||++..+.++.......++...+...... .+.... ......+| T Consensus 1 ~vV~avi~~~~g~vLLv~r~~~-~~~g~W~lPGG~vE~GEt~eeaa~REv~EETGl~v~~~~l~~v~~~~--~~~~~~~f 77 (123) T cd04671 1 YIVAAVILNNQGEVLLIQEAKR-SCRGKWYLPAGRMEPGETIEEAVKREVKEETGLDCEPTTLLSVEEQG--GSWFRFVF 77 (123) T ss_pred CEEEEEEEECCCEEEEEEECCC-CCCCEEECCEEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEECC--CEEEEEEE T ss_conf 9899999988998999996589-99995979801804998999999999999749888977799999269--80999999 Q ss_pred EEECCC-----C----CCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 998077-----6----766781332278887479987899999844 Q gi|254780479|r 314 KTIVPQ-----I----VIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 314 ~~~~~~-----~----~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .++... . ....+.+|+++++| |.|-..+-+++.+ T Consensus 78 ~~~~~~G~l~~~~~~~~E~~ea~Wf~~deL---P~pl~~~~~~~~~ 120 (123) T cd04671 78 TGNITGGDLKTEKEADSESLQARWYSNKDL---PLPLRAHDILSLI 120 (123) T ss_pred EEEEECCEECCCCCCCCCEEEEEEECHHHC---CCCCCCCHHHHHH T ss_conf 999927651168999987799899888998---6876404345463 No 51 >pfam00293 NUDIX NUDIX domain. Probab=97.60 E-value=0.00033 Score=50.12 Aligned_cols=107 Identities=14% Similarity=0.164 Sum_probs=64.6 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCEEEEEEEEEE--EEEEE Q ss_conf 000341376057764572013204744211113467215888630101000021------3735899831599--99999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTITHTFTHFTL--TLFVW 313 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~ikH~fTH~~L--~i~v~ 313 (356) ..++++.++++||.||...+..+|.|+||++..+.++........+...+.... ...+.+...+... ..++| T Consensus 6 ~vvi~~~~~~vll~~r~~~~~~~~~w~~PgG~ve~gE~~~~aa~RE~~EE~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (133) T pfam00293 6 GVVLLNEDGEVLLVRRSRTPGTPGLWELPGGKVEPGESPEEAAVRELEEETGLRVLLLLLLGVLEYPAPGRDLREHVHVF 85 (133) T ss_pred EEEEEECCCEEEEEEEECCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCCEEEEEE T ss_conf 99999489989999927689999849998455727988999999998897595115789999998649999982999999 Q ss_pred EEECCCC--------CCCCCCEEECHHHHHHCCCCHHHHHHHH Q ss_conf 9980776--------7667813322788874799878999998 Q gi|254780479|r 314 KTIVPQI--------VIIPDSTWHDAQNLANAALPTVMKKALS 348 (356) Q Consensus 314 ~~~~~~~--------~~~~~~~Wv~~~el~~~~LPs~~kKIL~ 348 (356) .+..... ....+..|++.+++.+.....-++.++. T Consensus 86 ~~~~~~~~~~~~~~~~E~~~~~w~~~~e~~~~~~~~~~~~~~~ 128 (133) T pfam00293 86 LAELLGGEPEPQPDPGEVSEVRWVPLEELLELLLAGEIREVIA 128 (133) T ss_pred EEEEECCCCCCCCCCCEEEEEEEEEHHHHHHHHHCCCHHHHHH T ss_conf 9998168644668978389999999999998878897389999 No 52 >cd03673 Ap6A_hydrolase Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily. Ap6A hydrolase specifically hydrolyzes diadenosine polyphosphates, but not ATP or diadenosine triphosphate, and it generates ATP as the product. Ap6A, the most preferred substrate, hydrolyzes to produce two ATP molecules, which is a novel hydrolysis mode for Ap6A. These results indicate that Ap6A hydrolase is a diadenosine polyphosphate hydrolase. It requires the presence of a divalent cation, such as Mn2+, Mg2+, Zn2+, and Co2+, for activity. Members of the Nudix superfamily are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Probab=97.56 E-value=0.0016 Score=45.37 Aligned_cols=99 Identities=13% Similarity=0.022 Sum_probs=60.7 Q ss_pred CCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC------CCCEECCCEEEEEEEE----EEEEEEEEEEC Q ss_conf 3760577645720132047442111134672158886301010------0002137358998315----99999999980 Q gi|254780479|r 248 NDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT------ANWILCNTITHTFTHF----TLTLFVWKTIV 317 (356) Q Consensus 248 ~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~------~~~~~l~~ikH~fTH~----~L~i~v~~~~~ 317 (356) .+.+|||.+|+.+ |.|.||++..+.+++.......+... .....++.+.|.|.+- .-.++.|.+.. T Consensus 14 ~~~~vLLv~r~~~----~~W~lPgG~ve~gEs~~eaa~RE~~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~ 89 (131) T cd03673 14 GGIEVLLIHRPRG----DDWSLPKGKLEPGETPPEAAVREVEEETGIRAEVGDPLGTIRYWFSSSGKRVHKTVHWWLMRA 89 (131) T ss_pred CCEEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEECCEEEEEEECCCCEEEEEEEEEEEEE T ss_conf 9679999995899----979998245279979999999999998795389842711668997349960788999999999 Q ss_pred CC-------CCCCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 77-------6766781332278887479987899999844 Q gi|254780479|r 318 PQ-------IVIIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 318 ~~-------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) .. .....+.+|++.+++.++....-.++||+++ T Consensus 90 ~~~~~~~~~~~e~~~~~W~~~~ea~~~l~~~~~r~il~~a 129 (131) T cd03673 90 LGGEFTPQPDEEVDEVRWLPPDEARDRLSYPNDRELLRAA 129 (131) T ss_pred ECCCCCCCCCCCCCEEEEEEHHHHHHHCCCHHHHHHHHHH T ss_conf 6795320788654179992199997765998799999997 No 53 >cd04697 Nudix_Hydrolase_38 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.54 E-value=0.00055 Score=48.58 Aligned_cols=97 Identities=8% Similarity=0.002 Sum_probs=56.0 Q ss_pred EEEEEEEECCCEEEEEECCCC-HHHHCCCCCCC-CCCCCCCCHHHH------HHCCCCCCCCEECCCEEEEEEEEEEEEE Q ss_conf 320003413760577645720-13204744211-113467215888------6301010000213735899831599999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNT-RLLEGMDELPG-SAWSSTKDGNID------THSAPFTANWILCNTITHTFTHFTLTLF 311 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~-~e~~~~~~~~~~------~~~~~~~~~~~~l~~ikH~fTH~~L~i~ 311 (356) ..+ ++.|.+|++||+||+.. ..|.|+|+... +-...+++.... ++..........+..+.+.....+.... T Consensus 3 ~~v-~i~n~~G~iLlqkRs~~K~~~Pg~wd~s~~Ghv~~GE~~~~aa~REl~EElGI~~~~l~~~~~~~~~~~~~~~~~~ 81 (126) T cd04697 3 TYI-FVFNSEGKLCVHKRTLTKDWCPGYWDIAFGGVVQAGESYLQNAQRELEEELGIDGVQLTPLGLFYYDTDGNRVWGK 81 (126) T ss_pred EEE-EEECCCCCEEEEEECCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCEEEE T ss_conf 999-9992999799998579998799613457687677998999999999999868986675675899987898848999 Q ss_pred EEEEECCC-----CCCCCCCEEECHHHHHHC Q ss_conf 99998077-----676678133227888747 Q gi|254780479|r 312 VWKTIVPQ-----IVIIPDSTWHDAQNLANA 337 (356) Q Consensus 312 v~~~~~~~-----~~~~~~~~Wv~~~el~~~ 337 (356) +|.+..+. +...++.+|++++||.+. T Consensus 82 vf~~~~d~~l~~~~eEV~~~~w~~~~El~~~ 112 (126) T cd04697 82 VFSCVYDGPLKLQEEEVEEITWLSINEILQF 112 (126) T ss_pred EEEEEECCCCCCCHHHHHEEEECCHHHHHHH T ss_conf 9999999999599677311798039999989 No 54 >cd04680 Nudix_Hydrolase_21 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.53 E-value=0.0005 Score=48.85 Aligned_cols=104 Identities=13% Similarity=0.110 Sum_probs=62.0 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-------CEEEEEEEEEEEEEEE Q ss_conf 200034137605776457201320474421111346721588863010100002137-------3589983159999999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-------TITHTFTHFTLTLFVW 313 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-------~ikH~fTH~~L~i~v~ 313 (356) ...++++.+|++||.||.. .|.|+||++..+.+++.......+...+...... .+.|........+.+| T Consensus 3 v~~vv~~~~g~vLL~rr~~----~~~W~lPGG~ve~gEs~~~aa~RE~~EEtGl~v~~~~~l~~~~~~~~~~~~~~~~~f 78 (120) T cd04680 3 ARAVVTDADGRVLLVRHTY----GPGWYLPGGGLERGETFAEAARRELLEELGIRLAVVAELLGVYYHSASGSWDHVIVF 78 (120) T ss_pred EEEEEECCCCEEEEEEECC----CCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEECCCCCCCEEEEEE T ss_conf 9999992999099998679----991999954716998999999999999869755515479999825899975399999 Q ss_pred EEECCC------CCCCCCCEEECHHHHHHCCCCHHHHHHHHH Q ss_conf 998077------676678133227888747998789999984 Q gi|254780479|r 314 KTIVPQ------IVIIPDSTWHDAQNLANAALPTVMKKALSA 349 (356) Q Consensus 314 ~~~~~~------~~~~~~~~Wv~~~el~~~~LPs~~kKIL~a 349 (356) .+.... .....+..|+++++|.. +++-.++++|.. T Consensus 79 ~~~~~~~~~~~~~~E~~e~~Wf~~d~LPe-~l~p~~~~~i~~ 119 (120) T cd04680 79 RARADTQPVIRPSHEISEARFFPPDALPE-PTTPATRRRIAE 119 (120) T ss_pred EEEECCCCCCCCCHHHHCEEEECHHHCCC-CCCHHHHHHHHC T ss_conf 99824888679983674113988889888-789779999753 No 55 >cd04687 Nudix_Hydrolase_28 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.53 E-value=0.00076 Score=47.59 Aligned_cols=96 Identities=19% Similarity=0.178 Sum_probs=56.3 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECCCEEE--------EEEEE Q ss_conf 200034137605776457201320474421111346721588863010100002------1373589--------98315 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCNTITH--------TFTHF 306 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~~ikH--------~fTH~ 306 (356) ..++|+ ++|++||.||.+.+ +.+|.||++..+.++........+..++... .+..+.+ ...++ T Consensus 4 a~AvI~-~~~~vLL~r~~~~~--~~~w~lPGG~ve~GEt~~~Al~RE~~EEtGl~v~v~~l~~v~e~~~~~~~~~~~~~~ 80 (128) T cd04687 4 AKAVII-KNDKILLIKHHDDG--GVWYILPGGGQEPGETLEDAAHRECKEEIGIDVEIGPLLFVREYIGHNPTSELPGHF 80 (128) T ss_pred EEEEEE-ECCEEEEEEECCCC--CCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEECCEEEEEEECCCCCCCCCCCCE T ss_conf 999999-89999999983699--816989886676997999999998878639678856089999701325566788854 Q ss_pred EEEEEEEEEECCCCC-----CCC-----CCEEECHHHHHHCCC Q ss_conf 999999999807767-----667-----813322788874799 Q gi|254780479|r 307 TLTLFVWKTIVPQIV-----IIP-----DSTWHDAQNLANAAL 339 (356) Q Consensus 307 ~L~i~v~~~~~~~~~-----~~~-----~~~Wv~~~el~~~~L 339 (356) +.-..+|.+...... ... +.+|+++++|.++++ T Consensus 81 h~v~~~f~~~~~~g~~~~~~~~~d~~~~~~~Wv~ldeL~~~~l 123 (128) T cd04687 81 HQVELMFECKIKSGTPAKTPSKPDPNQIGVEWLKLKELGDIPL 123 (128) T ss_pred EEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEECHHHHCCCCC T ss_conf 9999999999868954667778876657899804999566988 No 56 >cd04688 Nudix_Hydrolase_29 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.51 E-value=0.0011 Score=46.51 Aligned_cols=95 Identities=15% Similarity=0.116 Sum_probs=55.6 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCC------CCCCEECCCEEEEEEE----EEEEEE Q ss_conf 000341376057764572013204744211113467215888630101------0000213735899831----599999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPF------TANWILCNTITHTFTH----FTLTLF 311 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~------~~~~~~l~~ikH~fTH----~~L~i~ 311 (356) .++++ ++|++||.+++.. ++|.||++..+.++........+.. ......+..+.+.|++ ++--.. T Consensus 5 ~aii~-~~~~vLl~~~~~~----~~w~lPGG~ve~gEt~~eA~~RE~~EEtGl~v~~~~l~~v~e~~f~~~~~~~h~i~~ 79 (126) T cd04688 5 AAIII-HNGKLLVQKNPDE----TFYRPPGGGIEFGESSEEALIREFKEELGLKIEITRLLGVVENIFTYNGKPGHEIEF 79 (126) T ss_pred EEEEE-ECCEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEEEECCCCCCEEEEE T ss_conf 99999-8999999997799----989998534369989999998888887596788246999997688729975479999 Q ss_pred EEEEECCCCC-------------CCCCCEEECHHHHHHCCC-CH Q ss_conf 9999807767-------------667813322788874799-87 Q gi|254780479|r 312 VWKTIVPQIV-------------IIPDSTWHDAQNLANAAL-PT 341 (356) Q Consensus 312 v~~~~~~~~~-------------~~~~~~Wv~~~el~~~~L-Ps 341 (356) .|.+...... ...+.+|++++++.++++ |. T Consensus 80 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~Wv~ldeL~~l~l~P~ 123 (126) T cd04688 80 YYLVTLLDESLYQQDIEILEEEGEKIVFRWIPIDELKEIKLYPE 123 (126) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCEEEEEECCHHHHCCCCCCCC T ss_conf 99999978855778764327798799999847999276887970 No 57 >cd04689 Nudix_Hydrolase_30 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate sp Probab=97.51 E-value=0.002 Score=44.69 Aligned_cols=103 Identities=10% Similarity=0.091 Sum_probs=62.4 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH------CCCCCCCCEECCCEEEEEEEE----EEEEE Q ss_conf 00034137605776457201320474421111346721588863------010100002137358998315----99999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH------SAPFTANWILCNTITHTFTHF----TLTLF 311 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~------~~~~~~~~~~l~~ikH~fTH~----~L~i~ 311 (356) +..++.++|++||.||...+ .|.||++..+.++....... ..........++.+.|.|+.. +--.. T Consensus 4 a~aii~~~~~vLl~~~~~~~----~~~lPGG~ve~gE~~~~a~~RE~~EE~G~~v~~~~~l~~~e~~~~~~~~~~H~i~~ 79 (125) T cd04689 4 ARAIVRAGNKVLLARVIGQP----HYFLPGGHVEPGETAENALRRELQEELGVAVSDGRFLGAIENQWHEKGVRTHEINH 79 (125) T ss_pred EEEEEEECCEEEEEEECCCC----CEECCCCCCCCCCCHHHHHHHHHHHHHCCEEEECEEEEEEEEEEECCCCCEEEEEE T ss_conf 99999989999999988999----58999798569989999999999998799789465999997467759965189999 Q ss_pred EEEEECCC------CC---CCCCCEEECHHHHHHCCCCHHHHHHHHHC Q ss_conf 99998077------67---66781332278887479987899999844 Q gi|254780479|r 312 VWKTIVPQ------IV---IIPDSTWHDAQNLANAALPTVMKKALSAG 350 (356) Q Consensus 312 v~~~~~~~------~~---~~~~~~Wv~~~el~~~~LPs~~kKIL~al 350 (356) +|.+..+. .. ..-+..|+++++++- .|++.+-+|..| T Consensus 80 ~f~~~~~~~~~~~~~~~d~d~~~~~WvpL~eL~l--yP~~l~d~i~~~ 125 (125) T cd04689 80 IFAVESSWLASDGPPQADEDHLSFSWVPVSDLSL--YPALLRDLVKGL 125 (125) T ss_pred EEEEECCCCCCCCCCCCCCCEEEEEEECHHHHHH--CCHHHHHHHHCC T ss_conf 9999525545568988888667899971899534--877588885349 No 58 >pfam00633 HHH Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain. Probab=97.50 E-value=2.9e-05 Score=57.54 Aligned_cols=30 Identities=43% Similarity=0.605 Sum_probs=27.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 057868731456888887778776677664 Q gi|254780479|r 108 KYEGNFPHKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 108 ~~~g~~P~~~~~l~~LpGiG~yta~ai~s~ 137 (356) .++|..|.+.++|++|||||+|||.+|+++ T Consensus 1 ~~~~~~~as~eeL~~lpGVG~~tA~~I~~~ 30 (30) T pfam00633 1 SLEGLIPASREELLALPGVGPKTAEAILSY 30 (30) T ss_pred CCCCCCCCCHHHHHHCCCCCHHHHHHHHCC T ss_conf 964435235999972889776889988539 No 59 >cd04693 Nudix_Hydrolase_34 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.41 E-value=0.0006 Score=48.29 Aligned_cols=94 Identities=12% Similarity=0.098 Sum_probs=47.4 Q ss_pred EEEEECCCEEEEEEC-CCCHHHHCCCCCCCCCC-CCCCCHHHHHHCCCCCCCCEEC--C---CEEEEEEEE--EEEEEEE Q ss_conf 003413760577645-72013204744211113-4672158886301010000213--7---358998315--9999999 Q gi|254780479|r 243 FIAITNDNRILLRKR-TNTRLLEGMDELPGSAW-SSTKDGNIDTHSAPFTANWILC--N---TITHTFTHF--TLTLFVW 313 (356) Q Consensus 243 ~vii~~~~kiLL~KR-p~~gll~GLwEFP~~e~-~~~~~~~~~~~~~~~~~~~~~l--~---~ikH~fTH~--~L~i~v~ 313 (356) .++.|.+|++||+|| +.+.++.|+|++|.+.. ..++.... ............+ . .+.+.+... .-.+.++ T Consensus 5 v~v~n~~g~iLlqkRs~~K~~~Pg~W~~~~gg~v~~gE~~ea-a~RE~~EElGi~l~~~~~~~~~~~~~~~~~~~~~~~~ 83 (127) T cd04693 5 VCIFNSKGELLLQKRSPNKDGWPGMWDLSVGGHVQAGETSTA-AEREVKEELGLELDFSELRPLFRYFFEAEGFDDYYLF 83 (127) T ss_pred EEEECCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHH-HHHHHHHHHCCCCCCCCCEEEEEEECCCCCEEEEEEE T ss_conf 999939998999966899889997276788755128985999-9999999717701766647889999269925899999 Q ss_pred EEEC------CCCCCCCCCEEECHHHHHHC Q ss_conf 9980------77676678133227888747 Q gi|254780479|r 314 KTIV------PQIVIIPDSTWHDAQNLANA 337 (356) Q Consensus 314 ~~~~------~~~~~~~~~~Wv~~~el~~~ 337 (356) .... .+.....+++|++.++|.+. T Consensus 84 ~~~~~~~~~~~~~~EV~d~~Wvs~eEl~~~ 113 (127) T cd04693 84 YADVEIGKLILQKEEVDEVKFVSKDEIDGL 113 (127) T ss_pred EEEEECCCCCCCHHHEEEEEEECHHHHHHH T ss_conf 970378987289378246899679999999 No 60 >KOG1918 consensus Probab=97.34 E-value=0.0011 Score=46.53 Aligned_cols=148 Identities=21% Similarity=0.373 Sum_probs=111.5 Q ss_pred CCHHHHHHHHHHHHHCCHHHCCHHHHHHH------HHCCCHHHHHCCCHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHCC Q ss_conf 87046766446731161203326589987------748997898729989999997404634-58899987767543005 Q gi|254780479|r 37 PSPYKVWISEIMLQQTTVKTVEPYFKKFM------QKWPTIFCLSSAKDEEILSAWAGLGYY-TRARNLKKCADIIVKKY 109 (356) Q Consensus 37 ~~py~v~vseimlqqT~v~~v~~~~~~~~------~~~P~~~~la~a~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~~~~ 109 (356) +.||.-+++-|..||---+..-..|.||+ ..||+++..-.++-+++.+. |+- +.|-.||..|..-.+.| T Consensus 73 q~Pf~~LiraIlsQQLs~kAansI~~Rfvsl~~g~~~~~~pe~i~~~~~~~lrkc----G~S~rK~~yLh~lA~~~~ng~ 148 (254) T KOG1918 73 QTPFERLIRAILSQQLSGKAANSIYNRFVSLCGGAEKFPTPEFIDPLDCEELRKC----GFSKRKASYLHSLAEAYTNGY 148 (254) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCHHHHHHH----CCCHHHHHHHHHHHHHHHCCC T ss_conf 4709999999999999899999999999998479767788321276789999985----733246799999999986477 Q ss_pred CCCCCCHH-------HHH----HHHHHHHHHHHHHHHHHHCCCE--EEECCCCHHHHHHHHHCCCC-CCCHHHHHHHHHH Q ss_conf 78687314-------568----8888777877667766434963--13124314666576519754-5735689998741 Q gi|254780479|r 110 EGNFPHKV-------EIL----KKLPGIGDYTASAIVAIAFNHF--AVVVDTNIERIISRYFDIIK-PAPLYHKTIKNYA 175 (356) Q Consensus 110 ~g~~P~~~-------~~l----~~LpGiG~yta~ai~s~a~~~~--~~~vD~Nv~RVl~R~~~~~~-~~~~~~k~l~~~~ 175 (356) +|.+. ++| -...|||+.|+--.+-|+.+.+ .|+=|--|++=+.-++|.+. |. .+++.++. T Consensus 149 ---I~s~~~i~~mseEeL~~~LT~VKGIg~Wtv~MflIfsL~R~DVmp~dDlgir~g~k~l~gl~~~p~---~~evekl~ 222 (254) T KOG1918 149 ---IPSKSGIEKMSEEELIERLTNVKGIGRWTVEMFLIFSLHRPDVMPADDLGIRNGVKKLLGLKPLPL---PKEVEKLC 222 (254) T ss_pred ---CCCHHHHHHCCHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCC---HHHHHHHH T ss_conf ---776677761579999999986047551124432110147776668503668877999808999886---59999986 Q ss_pred HCCCCCCCH-HHHHHHHHHH Q ss_conf 025785226-7899999999 Q gi|254780479|r 176 RKITSTSRP-GDFVQAMMDL 194 (356) Q Consensus 176 ~~~~~~~~~-~~~nQAlMdl 194 (356) +..-|..-. .=|+.-+.|+ T Consensus 223 e~~kpyRtvaawYlWki~~~ 242 (254) T KOG1918 223 EKCKPYRTVAAWYLWKIIKL 242 (254) T ss_pred HHCCCHHHHHHHHHHHHHHH T ss_conf 20340477888898732343 No 61 >cd04694 Nudix_Hydrolase_35 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=97.17 E-value=0.0012 Score=46.17 Aligned_cols=44 Identities=18% Similarity=0.311 Sum_probs=35.2 Q ss_pred EEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHHH Q ss_conf 320003413760577645720-13204744211113467215888 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNID 283 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~~ 283 (356) ....++.+.+|++||.||+.. +.+.|+|+||++..+.++..... T Consensus 3 gVa~ii~d~dgrvLL~rRs~~~~~~Pg~W~~PGG~ve~GEt~~ea 47 (143) T cd04694 3 GVAVLLQSSDQKLLLTRRASSLRIFPNVWVPPGGHVELGENLLEA 47 (143) T ss_pred EEEEEEECCCCEEEEEEECCCCCCCCCCEECCCCCCCCCCCHHHH T ss_conf 599999989992999985898889998171887745899898999 No 62 >PRK09438 ntpA dATP pyrophosphohydrolase; Provisional Probab=97.09 E-value=0.0022 Score=44.31 Aligned_cols=106 Identities=14% Similarity=0.106 Sum_probs=64.6 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECC-------------------CEEEE Q ss_conf 00034137605776457201320474421111346721588863010100002137-------------------35899 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCN-------------------TITHT 302 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~-------------------~ikH~ 302 (356) ..||.+.++++||.||... .|+|++|++..+.++........+..++...... ..+|. T Consensus 12 lvvi~~~~~~vLl~~R~~~---~~~W~~~~G~~e~gE~~~~aa~RE~~EEtGi~~~~~~~~L~D~~~~~~yeI~~~wr~R 88 (148) T PRK09438 12 LVVIYTPDLKVLMLQRADD---PDFWQSVTGSLEEGETPAQTAAREVKEETGIDVAAEQLTLIDCQRTVEYEIFPHWRHR 88 (148) T ss_pred EEEEECCCCCEEEEEECCC---CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEECCCCCEEEEECCHHHCCC T ss_conf 9999859998999997489---9967098312689979899999999998686225671364266520232114422220 Q ss_pred E----EEEEEEEEEEEEECCC-----CCCCCCCEEECHHHHHHCCCCHHHHHHHHHCCC Q ss_conf 8----3159999999998077-----676678133227888747998789999984482 Q gi|254780479|r 303 F----THFTLTLFVWKTIVPQ-----IVIIPDSTWHDAQNLANAALPTVMKKALSAGGI 352 (356) Q Consensus 303 f----TH~~L~i~v~~~~~~~-----~~~~~~~~Wv~~~el~~~~LPs~~kKIL~alg~ 352 (356) | +| -.-++|.+.++. ..++.+++|++.++..+.-++-.++-.|..|.+ T Consensus 89 Y~pgv~~--n~EhvF~l~l~~~~~V~LsEH~~~~Wl~~~eA~~l~~s~sN~~Ai~~l~~ 145 (148) T PRK09438 89 YAPGVTH--NTEHWFCLALPHERPVVLTEHLAYQWLPAEEAAALTKSPSNAEAILQLVI 145 (148) T ss_pred CCCCCCC--CEEEEEEEECCCCCCCCCCHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHH T ss_conf 4888524--16799999858899721201432452479999998669331999999986 No 63 >cd03675 Nudix_Hydrolase_2 Contains a crystal structure of the Nudix hydrolase from Nitrosomonas europaea, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, Probab=97.06 E-value=0.006 Score=41.25 Aligned_cols=105 Identities=13% Similarity=0.106 Sum_probs=59.7 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCC------CCEECCCEEEEEEEE--EEEEEEE Q ss_conf 00034137605776457201320474421111346721588863010100------002137358998315--9999999 Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTA------NWILCNTITHTFTHF--TLTLFVW 313 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~------~~~~l~~ikH~fTH~--~L~i~v~ 313 (356) +..++..+|++||.++...| ++.|.+|++..+.++........+...+ ....++.+...+... ......| T Consensus 3 Vaavv~~~gr~LLV~~~~~~--~~~~~~PgG~~e~gE~~~~a~~RE~~EEtG~~v~~~~~~g~~~~~~~~~~~~ylr~~F 80 (134) T cd03675 3 VAAVVERDGRFLLVEEETDG--GLVFNQPAGHLEPGESLIEAAVRETLEETGWHVEPTALLGIYQWTAPDSDTTYLRFAF 80 (134) T ss_pred EEEEEEECCEEEEEEEECCC--CCEEECCCEECCCCCCHHHHHHHHHHHHHCCEEEEEEEEEEEEEECCCCCCEEEEEEE T ss_conf 99999979999999996599--9729898227889989999999999997577677617999998755899961899999 Q ss_pred EEECCCC-------CCCCCCEEECHHHHHHCC--CCHH-HHHHHH Q ss_conf 9980776-------766781332278887479--9878-999998 Q gi|254780479|r 314 KTIVPQI-------VIIPDSTWHDAQNLANAA--LPTV-MKKALS 348 (356) Q Consensus 314 ~~~~~~~-------~~~~~~~Wv~~~el~~~~--LPs~-~kKIL~ 348 (356) .++.... ....+..|++++||.... +-+| +.|.|. T Consensus 81 ~~~~~~~~~~~~~D~eI~~a~W~tldEi~~~~~~lRsPlV~~~i~ 125 (134) T cd03675 81 AAELLEHLPDQPLDSGIVRAHWLTLEEILALAARLRSPLVLRCIE 125 (134) T ss_pred EEEECCCCCCCCCCCCCEEEEECCHHHHHHHHHCCCCHHHHHHHH T ss_conf 999778998889998602417278999865254116884999999 No 64 >cd03429 NADH_pyrophosphatase NADH pyrophosphatase, a member of the Nudix hydrolase superfamily, catalyzes the cleavage of NADH into reduced nicotinamide mononucleotide (NMNH) and AMP. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity. Members of this family are also recognized by the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. A block of 8 conserved amino acids downstream of the nudix motif is thought to give NADH pyrophosphatase its specificity for NADH. NADH pyrophosphatase forms a dimer. Probab=96.78 E-value=0.0086 Score=40.17 Aligned_cols=92 Identities=13% Similarity=0.114 Sum_probs=60.2 Q ss_pred EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEE----EEEEEEEEEECC Q ss_conf 0034137605776457201320474421111346721588863010100002137358998315----999999999807 Q gi|254780479|r 243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHF----TLTLFVWKTIVP 318 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~----~L~i~v~~~~~~ 318 (356) .++++.++++||.++.... .|+|.+|.+-.+.++........+...+.....+.+++..|.. .--|..|.+... T Consensus 5 ~~V~~~~~~iLL~r~~~~~--~g~~~~~aG~ve~GEs~e~a~~REv~EE~gv~v~~i~y~~sqpwpfp~~Lm~~f~a~~~ 82 (131) T cd03429 5 VLVIDGGDRILLARQPRFP--PGMYSLLAGFVEPGESLEEAVRREVKEEVGIRVKNIRYVGSQPWPFPSSLMLGFTAEAD 82 (131) T ss_pred EEEEECCCEEEEECCCCCC--CCCEECCEEEECCCCCHHHCEEEEEEEEECCEECCEEEEEECCCCCCCEEEEEEEEEEC T ss_conf 9999489889997056799--99594662664399997991189899871770221699887587998868999999976 Q ss_pred C------CCCCCCCEEECHHHHHH Q ss_conf 7------67667813322788874 Q gi|254780479|r 319 Q------IVIIPDSTWHDAQNLAN 336 (356) Q Consensus 319 ~------~~~~~~~~Wv~~~el~~ 336 (356) . ..+..+.+|++.+|+.. T Consensus 83 ~~e~~~~~~Ei~earWfs~del~~ 106 (131) T cd03429 83 SGEIVVDDDELEDARWFSRDEVRA 106 (131) T ss_pred CCCCCCCHHHEEEEEEECHHHHHH T ss_conf 997669931621677676999866 No 65 >cd03671 Ap4A_hydrolase_plant_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Members of this family are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one group (represented by this subfamily) and fungi/animals/archaea enzymes fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for the inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU where U is Ile, Leu, or Val), Ap4A hydrolase is structurally Probab=96.67 E-value=0.0096 Score=39.84 Aligned_cols=43 Identities=14% Similarity=0.294 Sum_probs=32.5 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHH Q ss_conf 3200034137605776457201320474421111346721588863 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTH 285 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~ 285 (356) ....++.|.+|++|+.||... .|.|.||.+..+.++....... T Consensus 5 ~Vg~vi~n~~~~vl~~~R~~~---~~~W~~P~G~i~~gE~~~~aa~ 47 (147) T cd03671 5 NVGVVLFNEDGKVFVGRRIDT---PGAWQFPQGGIDEGEDPEQAAL 47 (147) T ss_pred EEEEEEEECCCEEEEEEECCC---CCCCCCCEEEECCCCCHHHHHH T ss_conf 789999928994999996899---8888798002479999999999 No 66 >cd03428 Ap4A_hydrolase_human_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Ap4A hydrolases are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one subfamily and fungi/animals/archaea enzymes, represented by this subfamily, fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val) that functions as a metal binding and Probab=96.59 E-value=0.022 Score=37.25 Aligned_cols=96 Identities=11% Similarity=0.067 Sum_probs=55.0 Q ss_pred CCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCC------CCEECCCEEEEEEE----EEEEEEEEEEECC Q ss_conf 7605776457201320474421111346721588863010100------00213735899831----5999999999807 Q gi|254780479|r 249 DNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTA------NWILCNTITHTFTH----FTLTLFVWKTIVP 318 (356) Q Consensus 249 ~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~------~~~~l~~ikH~fTH----~~L~i~v~~~~~~ 318 (356) +-++||.+|.. |.|.||.+..+.++............. .......+.+.|.+ ..-.+..|.+.++ T Consensus 16 ~~~vLLi~~~~-----g~W~~P~G~ie~gEs~~~aA~REv~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~~v~~fl~~~~ 90 (130) T cd03428 16 EIEYLLLQASY-----GHWDFPKGHVEPGEDDLEAALRETEEETGITAEQLFIVLGFKETLNYQVRGKLKTVTYFLAELR 90 (130) T ss_pred CEEEEEEECCC-----CCEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCEEEEEECCCCCEEEEEEEEEEEEE T ss_conf 48999999189-----9689990504899599999999999987987078785174789998168978999999999981 Q ss_pred CC------CCCCCCEEECHHHHHHC-CCCHHHHHHHHHC Q ss_conf 76------76678133227888747-9987899999844 Q gi|254780479|r 319 QI------VIIPDSTWHDAQNLANA-ALPTVMKKALSAG 350 (356) Q Consensus 319 ~~------~~~~~~~Wv~~~el~~~-~LPs~~kKIL~al 350 (356) .. ....+++|++.++..++ ..+. .+.||+.. T Consensus 91 ~~~~~~~~~E~~d~~W~~~~eA~~~l~~~~-~r~il~~a 128 (130) T cd03428 91 PDVEVKLSEEHQDYRWLPYEEALKLLTYED-LKAVLDKA 128 (130) T ss_pred CCCCCCCCCCEEEEEEECHHHHHHHCCCHH-HHHHHHHH T ss_conf 786789982156789983999998839973-99999996 No 67 >cd03426 CoAse Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes. The need of CoAses mainly arises under conditions of oxidative stress. CoAse has a conserved Nudix fold and requires a single divalent cation for catalysis. In addition to a signature Nudix motif G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val, CoAse contains an additional motif upstream called the NuCoA motif (LLTXT(SA)X3RX3GX3FPGG) which is postulated to be involved in CoA recognition. CoA plays a central role in lipid metabolism. It is involved in the initial steps of fatty acid sythesis in the cytosol, in the oxidation of fatty acids and the citric acid cycle in the mitochondria, and in the oxidation of long-chain fatty acids in peroxisomes. CoA has the important role of activating fatty acids for further modification into key biological signalling molecules. Probab=96.48 E-value=0.012 Score=39.28 Aligned_cols=87 Identities=16% Similarity=0.143 Sum_probs=56.1 Q ss_pred CCEEEEEECCCC-HHHHCCCCCCCCCCCCC-CCHH------HHHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC Q ss_conf 760577645720-13204744211113467-2158------886301010000213735899831599999999980776 Q gi|254780479|r 249 DNRILLRKRTNT-RLLEGMDELPGSAWSST-KDGN------IDTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI 320 (356) Q Consensus 249 ~~kiLL~KRp~~-gll~GLwEFP~~e~~~~-~~~~------~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~ 320 (356) +-++||.||+.. +-++|-+.||++..+.+ ++.. ..++..........++.+...+|...+.+++|.+.++.. T Consensus 15 ~~~ill~kRs~~l~~h~GqisfPGG~~e~~D~~~~~tAlREt~EEiGl~~~~~~~lg~L~~~~t~s~~~V~PfV~~~~~~ 94 (157) T cd03426 15 ELRVLLTKRASHLRSHPGQVAFPGGKVDPGDEDPVATALREAEEEIGLPPDSVEVLGRLPPYYTRSGFVVTPVVGLVPPP 94 (157) T ss_pred CEEEEEEEECCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCEEEEEEEEECCC T ss_conf 67999998267667789848379960276653389999999999868991133898634871325984899999994798 Q ss_pred C-------CCCCCEEECHHHHH Q ss_conf 7-------66781332278887 Q gi|254780479|r 321 V-------IIPDSTWHDAQNLA 335 (356) Q Consensus 321 ~-------~~~~~~Wv~~~el~ 335 (356) . +.+...|++++++- T Consensus 95 ~~~~~~~~EV~~vf~vPL~~ll 116 (157) T cd03426 95 LPLVLNPDEVAEVFEVPLSFLL 116 (157) T ss_pred CCCCCCHHHHHHHHHCCHHHHH T ss_conf 7657984897888140599973 No 68 >PRK03759 isopentenyl-diphosphate delta-isomerase; Provisional Probab=96.25 E-value=0.048 Score=34.94 Aligned_cols=101 Identities=12% Similarity=0.089 Sum_probs=52.1 Q ss_pred CCCEEEEEEEEEECCCEEEEEECCCCH-HHHCCCCCCCCCCC-CCCCH------HHHHHCCCCCCCC-EECCCEEEEEE- Q ss_conf 221123200034137605776457201-32047442111134-67215------8886301010000-21373589983- Q gi|254780479|r 235 RPMRTGAVFIAITNDNRILLRKRTNTR-LLEGMDELPGSAWS-STKDG------NIDTHSAPFTANW-ILCNTITHTFT- 304 (356) Q Consensus 235 k~~r~~~~~vii~~~~kiLL~KRp~~g-ll~GLwEFP~~e~~-~~~~~------~~~~~~~~~~~~~-~~l~~ikH~fT- 304 (356) ...|.+.++ +.+.+|++||+||...+ .+.|+|..-..... .++.. ...++........ ..++.+.+.-. T Consensus 32 ~lHRafsv~-l~~~~g~lllq~Rs~~K~~~pg~w~~s~~gH~~~gE~~~~aa~R~l~EElGi~~~~l~~~~~~f~Y~a~~ 110 (184) T PRK03759 32 PLHLAFSCY-LFDADGRLLVTRRALSKKTWPGVWTNSCCGHPQPGESLEDAIIRRCREELGVEITDLTLVLPDFRYRATD 110 (184) T ss_pred CEEEEEEEE-EECCCCCEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECCC T ss_conf 538899999-9959996999971589737999400354467899989999999999998698834440550038997238 Q ss_pred -----EEEEEEEEEEEECC-----CCCCCCCCEEECHHHHHHC Q ss_conf -----15999999999807-----7676678133227888747 Q gi|254780479|r 305 -----HFTLTLFVWKTIVP-----QIVIIPDSTWHDAQNLANA 337 (356) Q Consensus 305 -----H~~L~i~v~~~~~~-----~~~~~~~~~Wv~~~el~~~ 337 (356) -+. ..+||.+..+ ++.+..+.+|++.++|... T Consensus 111 ~~g~~E~E-~~~Vf~~~~d~~~~pnp~EV~e~~Wv~~~eL~~~ 152 (184) T PRK03759 111 PNGIVENE-VCPVFAARATSALQPNPDEVMDYQWVDPADLLRA 152 (184) T ss_pred CCCCCEEE-EEEEEEEEECCCCCCCHHHHEEEEEECHHHHHHH T ss_conf 99862257-9999999848989899689134689869999999 No 69 >cd02885 IPP_Isomerase Isopentenyl diphosphate (IPP) isomerase, a member of the Nudix hydrolase superfamily, is a key enzyme in the isoprenoid biosynthetic pathway. Isoprenoids comprise a large family of natural products including sterols, carotenoids, dolichols and prenylated proteins. These compounds are synthesized from two precursors: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). IPP isomerase catalyzes the interconversion of IPP and DMAPP by a stereoselective antarafacial transposition of hydrogen. The enzyme requires one Mn2+ or Mg2+ ion in its active site to fold into an active conformation and also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. The metal binding site is present within the active site and plays structural and catalytical roles. IPP isomerase is well represented in several bacteria, archaebacteria and eukaryotes, including fungi, mamm Probab=96.17 E-value=0.062 Score=34.10 Aligned_cols=101 Identities=11% Similarity=0.024 Sum_probs=50.8 Q ss_pred CCEEEEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCCC-CCCCCCHHH------HHHCCCCCCCC-EECCCEEE--EEE Q ss_conf 2112320003413760577645720-1320474421111-346721588------86301010000-21373589--983 Q gi|254780479|r 236 PMRTGAVFIAITNDNRILLRKRTNT-RLLEGMDELPGSA-WSSTKDGNI------DTHSAPFTANW-ILCNTITH--TFT 304 (356) Q Consensus 236 ~~r~~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e-~~~~~~~~~------~~~~~~~~~~~-~~l~~ikH--~fT 304 (356) ..+...++ +.+.+|++||+||+.. .++.|+|..-.+. ...++.... .++........ ..+..+.+ .+. T Consensus 29 ~Hrav~v~-v~n~~g~lllqkRs~~K~~~Pg~wd~s~~Gh~~~GE~~~~aa~Rel~EElGi~~~~~~~~~~~~~y~~~~~ 107 (165) T cd02885 29 LHRAFSVF-LFNSKGRLLLQRRALSKYTFPGLWTNTCCSHPLPGEGVKDAAQRRLREELGITGDLLELVLPRFRYRAPDD 107 (165) T ss_pred EEEEEEEE-EECCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEEEECCCC T ss_conf 48999999-99089979999857988689970735543467699758999999888861998543204666899932248 Q ss_pred EE---EEEEEEEEEECC-----CCCCCCCCEEECHHHHHHC Q ss_conf 15---999999999807-----7676678133227888747 Q gi|254780479|r 305 HF---TLTLFVWKTIVP-----QIVIIPDSTWHDAQNLANA 337 (356) Q Consensus 305 H~---~L~i~v~~~~~~-----~~~~~~~~~Wv~~~el~~~ 337 (356) .. +-..++|.+..+ ++....+++|++.++|... T Consensus 108 ~~~~e~E~~~vy~~~~d~~~~~~~~EV~~~~w~~~~el~~~ 148 (165) T cd02885 108 GGLVEHEIDHVFFARADVTLIPNPDEVSEYRWVSLEDLKEL 148 (165) T ss_pred CCCCCEEEEEEEEEEECCCCCCCHHHHEEEEEECHHHHHHH T ss_conf 98742379999999658987699579238899789999999 No 70 >PRK01229 N-glycosylase/DNA lyase; Provisional Probab=96.13 E-value=0.0075 Score=40.59 Aligned_cols=115 Identities=21% Similarity=0.213 Sum_probs=76.5 Q ss_pred HHHHHHHHHH----HHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCC--CHH-HHHHHHHHHHHHH------H Q ss_conf 4676644673----1161203326589987748997898729989999997404--634-5889998776754------3 Q gi|254780479|r 40 YKVWISEIML----QQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGL--GYY-TRARNLKKCADII------V 106 (356) Q Consensus 40 y~v~vseiml----qqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gL--Gyy-~Rar~l~~~a~~i------~ 106 (356) +.=|-||... -||.+...+..- +-+. +.+...+++++...-... -|| .||+++..+-+++ . T Consensus 34 e~~iF~EL~FCILTanssA~~~~ka~-~~l~-----~g~~~~~~eel~~~l~~~g~RF~n~rAkyIv~aR~~~~~lk~i~ 107 (208) T PRK01229 34 EEDLFSELSFCILTANSSAEGGIKAQ-KEIG-----DGFLYLSEEELREKLKEVGHRFPNLRAEYIVEARKLIGKLKEII 107 (208) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHH-HHHH-----HHCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999938874077899999-9987-----40025999999999998496784226899999999988999999 Q ss_pred HCCCCCCCCHHHHHHH-HHHHHHHHHHHHH-HHHCCCEEEECCCCHHHHHHHHHCCCC Q ss_conf 0057868731456888-8877787766776-643496313124314666576519754 Q gi|254780479|r 107 KKYEGNFPHKVEILKK-LPGIGDYTASAIV-AIAFNHFAVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 107 ~~~~g~~P~~~~~l~~-LpGiG~yta~ai~-s~a~~~~~~~vD~Nv~RVl~R~~~~~~ 162 (356) +.. ....+..+-|.+ ..|+|-..|+-.+ .++| ...+++|-||.|.+.++.-++. T Consensus 108 ~~~-~~~~e~Re~Lv~nIKG~G~KEASHFLRNiG~-~dlAIlDrHILr~l~~~g~i~~ 163 (208) T PRK01229 108 KAD-KDQFEAREFLVKNIKGIGYKEASHFLRNVGF-EDLAILDRHILRFLKRYGLIKE 163 (208) T ss_pred HCC-CCHHHHHHHHHHHCCCCCHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHCCCCCC T ss_conf 706-8778999999985647667999999996585-2357769999999998289765 No 71 >cd04692 Nudix_Hydrolase_33 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=96.12 E-value=0.063 Score=34.10 Aligned_cols=99 Identities=15% Similarity=0.145 Sum_probs=54.0 Q ss_pred EEEEEEEEEEC---CCEEEEEECCCCH-HHHCCCCCCC-CCCCCCCCHH------HHHHCCCCC--CCCEECCCEEEEEE Q ss_conf 12320003413---7605776457201-3204744211-1134672158------886301010--00021373589983 Q gi|254780479|r 238 RTGAVFIAITN---DNRILLRKRTNTR-LLEGMDELPG-SAWSSTKDGN------IDTHSAPFT--ANWILCNTITHTFT 304 (356) Q Consensus 238 r~~~~~vii~~---~~kiLL~KRp~~g-ll~GLwEFP~-~e~~~~~~~~------~~~~~~~~~--~~~~~l~~ikH~fT 304 (356) +...++++ ++ ++++||+||...+ .+.|+|+... +-...++... +.++..... .....++.+.+.+. T Consensus 3 ra~h~~v~-~~~~~~g~illqkRs~~K~~~Pg~wd~s~~Ghv~~gEt~~~aa~REl~EElGl~~~~~~l~~lg~~~~~~~ 81 (144) T cd04692 3 RTFHCWII-TKDEGKGYVLLQKRSANKKTYPGLWDISSAGHILAGETPLEDGIRELEEELGLDVSADDLIPLGTFKIEYD 81 (144) T ss_pred EEEEEEEE-ECCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHCEEEEEEEEEEE T ss_conf 38999999-66799528999978899878998311665750457999899999899998688767688188218999974 Q ss_pred EEE-----EEEEEEEEECC--------CCCCCCCCEEECHHHHHHC Q ss_conf 159-----99999999807--------7676678133227888747 Q gi|254780479|r 305 HFT-----LTLFVWKTIVP--------QIVIIPDSTWHDAQNLANA 337 (356) Q Consensus 305 H~~-----L~i~v~~~~~~--------~~~~~~~~~Wv~~~el~~~ 337 (356) +.. -..++|.+..+ +.....+.+|++.+++..+ T Consensus 82 ~~~~~~d~e~~~vy~~~~~~~~~~~~~~~~EV~~~~wv~~~el~~~ 127 (144) T cd04692 82 HIGKLIDREFHHVYLYELKVPLEEFTLQKEEVAGVVLIPLDEFAEL 127 (144) T ss_pred CCCCCEEEEEEEEEEEEECCCCCCCCCCHHHHHEEEEECHHHHHHH T ss_conf 4998172089999999968984567899788105999879999999 No 72 >TIGR03252 uncharacterized HhH-GPD family protein. This model describes a small, well-conserved bacterial protein family. Its sequence largely consists of a domain, HhH-GPD, found in a variety of related base excision DNA repair enzymes (see pfam00730). Probab=96.03 E-value=0.033 Score=36.05 Aligned_cols=103 Identities=17% Similarity=0.312 Sum_probs=70.4 Q ss_pred CCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHC--CCHHHHHCCCHHHHHHHHHC---CCHH--HHHHHHHHHHHHHHH Q ss_conf 6787046766446731161203326589987748--99789872998999999740---4634--588999877675430 Q gi|254780479|r 35 SLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKW--PTIFCLSSAKDEEILSAWAG---LGYY--TRARNLKKCADIIVK 107 (356) Q Consensus 35 ~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~--P~~~~la~a~~~~vl~~w~g---LGyy--~Rar~l~~~a~~i~~ 107 (356) -.+||...+|-=.+=||-.++..-.==.+..+|+ -+...+|.++.|+...+.+. +-=| |=|+++|+.|++|++ T Consensus 14 L~~~p~AlLiGMLLDQQvPmE~AF~GP~~l~~RlG~lD~~~IA~~Dpd~f~~l~~e~PAiHRfP~SMA~Riq~l~~~iv~ 93 (177) T TIGR03252 14 LSSDPFALLTGMLLDQQVPMERAFAGPHKIARRMGSLDAEDIAKYDPQAFVALFSERPAVHRFPGSMAKRVQALAQYVVD 93 (177) T ss_pred HCCCCHHHHHHHHHHCCCHHHHHHCCHHHHHHHHCCCCHHHHHHCCHHHHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHH T ss_conf 71392899999998233429998618599998837999899981799999999767961421728899999999999999 Q ss_pred CCCCC--------CCCH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 05786--------8731---456888887778776677664 Q gi|254780479|r 108 KYEGN--------FPHK---VEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 108 ~~~g~--------~P~~---~~~l~~LpGiG~yta~ai~s~ 137 (356) +|+|. .|+- +..|..|||.|.--|.-.++. T Consensus 94 ~YdGda~~vW~~g~~dg~ell~Rl~~LPGFG~qKA~IFlAL 134 (177) T TIGR03252 94 TYDGDATAVWTEGDPDGKELLRRLKALPGFGKQKAKIFLAL 134 (177) T ss_pred HHCCCHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 80894988740679989999999986799619999999999 No 73 >PRK00714 dinucleoside polyphosphate hydrolase; Reviewed Probab=95.94 E-value=0.026 Score=36.81 Aligned_cols=107 Identities=7% Similarity=0.096 Sum_probs=58.2 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC------EECCC----EEEEE----- Q ss_conf 2320003413760577645720132047442111134672158886301010000------21373----58998----- Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW------ILCNT----ITHTF----- 303 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~------~~l~~----ikH~f----- 303 (356) -....+++|.+|++|+.||.. .|.|.||.+..+.+++.......+..++.. ..++. +.+-| T Consensus 9 ~nVg~vi~n~~g~vl~~~R~~----~~~Wq~PqGgid~gE~~~~aa~REl~EE~Gi~~~~v~ii~~~~~wl~Yd~P~~~~ 84 (156) T PRK00714 9 PNVGIILLNRQGQVFWGRRIG----QHSWQFPQGGIDPGETPEQAMYRELYEEVGLRPEDVEILAETRDWLRYDLPKRLV 84 (156) T ss_pred CCEEEEEEECCCCEEEEEECC----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCEEEECCHHHH T ss_conf 763999990999299999889----9976799450079989999999999998589820004576258838985817764 Q ss_pred -----EEEEEEEEEE---E-------EECC--CCCCCCCCEEECHHHHHHCCCC---HHHHHHHHH Q ss_conf -----3159999999---9-------9807--7676678133227888747998---789999984 Q gi|254780479|r 304 -----THFTLTLFVW---K-------TIVP--QIVIIPDSTWHDAQNLANAALP---TVMKKALSA 349 (356) Q Consensus 304 -----TH~~L~i~v~---~-------~~~~--~~~~~~~~~Wv~~~el~~~~LP---s~~kKIL~a 349 (356) .+|.=..+.| . ++++ ...+.++.+|++.+++-+.-+| .+.+++|+. T Consensus 85 ~~~~~~~y~GQ~QkWfl~rf~g~d~~Inl~~~~~pEF~~wkW~~~~~~~~~vv~FKr~vY~~vl~e 150 (156) T PRK00714 85 RRDWKPVCIGQKQKWFLLRLTGDDSEINLNTSHHPEFDAWRWVSYWYPLRQVVPFKRDVYRRVLKE 150 (156) T ss_pred HHHCCCCCCCCEEEEEEEEEECCCCCEECCCCCCCCEEEEEECCHHHHHHHCCCCCHHHHHHHHHH T ss_conf 243058702636899999981798517668999986233475888997776333189999999999 No 74 >smart00525 FES FES domain. iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3) Probab=95.74 E-value=0.0032 Score=43.16 Aligned_cols=23 Identities=39% Similarity=1.039 Sum_probs=21.2 Q ss_pred HCCCCCCCCCCCCCCCCHHHHHC Q ss_conf 20412356431100000134310 Q gi|254780479|r 198 ICTSNKPLCPLCPIQKNCLTFSE 220 (356) Q Consensus 198 iC~p~~P~C~~Cpl~~~C~~~~~ 220 (356) ||+|++|+|+.||++++|.++.. T Consensus 1 iC~~~kP~C~~Cpl~~~C~~~~~ 23 (26) T smart00525 1 ICTARKPRCDECPLKDLCPAYXX 23 (26) T ss_pred CCCCCCCCCCCCCCHHHCCHHHC T ss_conf 96566887676821210603331 No 75 >cd04686 Nudix_Hydrolase_27 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=95.46 E-value=0.11 Score=32.27 Aligned_cols=89 Identities=15% Similarity=0.135 Sum_probs=51.9 Q ss_pred EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC-------CCCEECCCEEE-------EEEEEEE Q ss_conf 003413760577645720132047442111134672158886301010-------00021373589-------9831599 Q gi|254780479|r 243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT-------ANWILCNTITH-------TFTHFTL 308 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~-------~~~~~l~~ikH-------~fTH~~L 308 (356) .|++ ++++|||.+++.. |-|.||++..+.+++.......+... .....++.+.. -+..+++ T Consensus 5 aIi~-~~~kiLL~~~~~~----~~~~lPGG~ve~gE~~~~al~RE~~EETG~~~i~~~~~~G~~~E~r~~~~~~~~~~~m 79 (131) T cd04686 5 AIIL-QGDKILLLYTKRY----GDYKFPGGGVEKGEDHIEGLIRELQEETGATNIRVIEKFGTYTERRPWRKPDADIFHM 79 (131) T ss_pred EEEE-ECCEEEEEEECCC----CCEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCCEEEEEEEEEEECCCCCEEEE T ss_conf 9999-8999999997699----9676787737899898999999999874961013131145599984100588877989 Q ss_pred EEEEEEEECCCCCC---CC--------CCEEECHHHHHH Q ss_conf 99999998077676---67--------813322788874 Q gi|254780479|r 309 TLFVWKTIVPQIVI---IP--------DSTWHDAQNLAN 336 (356) Q Consensus 309 ~i~v~~~~~~~~~~---~~--------~~~Wv~~~el~~ 336 (356) ..++|.|++..... .. .-.|+++++.-. T Consensus 80 ~s~~Y~C~v~~~~g~~~le~~E~~~G~~pvWv~i~eAi~ 118 (131) T cd04686 80 ISYYYLCEVDAELGAQQLEDYEAELGMKPIWINIHEAIE 118 (131) T ss_pred EEEEEEEEECHHCCCCCCCCHHHHCCCCEEECCHHHHHH T ss_conf 878999985122189877603442697217678999999 No 76 >PRK10707 hypothetical protein; Provisional Probab=95.16 E-value=0.16 Score=31.25 Aligned_cols=99 Identities=16% Similarity=0.110 Sum_probs=56.8 Q ss_pred CEEEEEEE--EEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCH-------HHHHHCCCCCCCCEECCCEEEEEEEE Q ss_conf 11232000--3413760577645720-13204744211113467215-------88863010100002137358998315 Q gi|254780479|r 237 MRTGAVFI--AITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDG-------NIDTHSAPFTANWILCNTITHTFTHF 306 (356) Q Consensus 237 ~r~~~~~v--ii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~-------~~~~~~~~~~~~~~~l~~ikH~fTH~ 306 (356) .|.-.++| +.+.+..+||.+|+.. .-.+|...||++..+..+.. +..++..........++.+....|.. T Consensus 29 ~r~AAVLvpl~~~~~~~iLlt~Rs~~L~~H~GqIsFPGGk~e~~D~~~~~TALREt~EEiGl~~~~v~vlG~L~~~~t~s 108 (190) T PRK10707 29 QRQAAVLIPIVRRPQPGLLLTQRAIHLRKHAGQVAFPGGAVDDTDASLIAAALREAEEEVAIPPSAVEVIGVLPPVDSVT 108 (190) T ss_pred CCCEEEEEEEEECCCCEEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCEECCC T ss_conf 86537999999889958999975653567999884899676888888899999988898689943258986347356678 Q ss_pred EEEEEEEEEECCCCC-------CCCCCEEECHHHHH Q ss_conf 999999999807767-------66781332278887 Q gi|254780479|r 307 TLTLFVWKTIVPQIV-------IIPDSTWHDAQNLA 335 (356) Q Consensus 307 ~L~i~v~~~~~~~~~-------~~~~~~Wv~~~el~ 335 (356) ...+.++.+.++... +.++..|++++++- T Consensus 109 g~~V~P~Vg~i~~~~~~~~~~~EV~~vf~vPL~~ll 144 (190) T PRK10707 109 GYQVTPVVGIIPPNLPYRASEDEVAAVFEMPLAEAL 144 (190) T ss_pred CCEEEEEEEEECCCCCCCCCHHHHHHHEECCHHHHH T ss_conf 988999999977887778984884723554199962 No 77 >cd04685 Nudix_Hydrolase_26 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=95.12 E-value=0.13 Score=31.86 Aligned_cols=96 Identities=15% Similarity=0.110 Sum_probs=54.5 Q ss_pred EEEEECCCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEEC---C----CEEEEEE----EEEEEE Q ss_conf 003413760577645720-132047442111134672158886301010000213---7----3589983----159999 Q gi|254780479|r 243 FIAITNDNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILC---N----TITHTFT----HFTLTL 310 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l---~----~ikH~fT----H~~L~i 310 (356) .++++.+|++||.+|... .--.+.|.+|++..+.++........+......... + .....|+ +++-.- T Consensus 5 vvl~d~~grvLL~r~~~~~~~~~~~W~~pgG~~~~gE~~~~a~~RE~~EE~G~~~~~~~~~v~~r~~~f~~~g~~~~~~E 84 (133) T cd04685 5 VVLLDPDDRVLLLRGDDPDSPGPDWWFTPGGGVEPGESPEQAARRELREETGITVADLGPPVWRRDAAFTFLGVDGRQEE 84 (133) T ss_pred EEEECCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHCCEECCCCCCEEEEEEEEECCCCCCCCEE T ss_conf 99986999999997447899999918599874889869899999999987597800244637999678703787100068 Q ss_pred EEEEEECCC------------CCCCCCCEEECHHHHHHCC Q ss_conf 999998077------------6766781332278887479 Q gi|254780479|r 311 FVWKTIVPQ------------IVIIPDSTWHDAQNLANAA 338 (356) Q Consensus 311 ~v~~~~~~~------------~~~~~~~~Wv~~~el~~~~ 338 (356) .+|.+.... .....+.+|.+.+||...+ T Consensus 85 ~fflvr~~~~e~~~~~~t~~E~~~i~e~RWws~~EL~~t~ 124 (133) T cd04685 85 RFFLARTPRTEPSPAGWTALERRSILGWRWWTRAELAATP 124 (133) T ss_pred EEEEEEECCCEECCCCCCHHHHHHCCCCCCCCHHHHHHCC T ss_conf 9999994686035766766664010255258899993279 No 78 >PRK05379 bifunctional nicotinamide mononucleotide adenylyltransferase/ADP-ribose pyrophosphatase; Provisional Probab=95.00 E-value=0.11 Score=32.29 Aligned_cols=109 Identities=13% Similarity=0.065 Sum_probs=58.2 Q ss_pred EEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEE------CCCE--EEEEEEE---- Q ss_conf 232000341376057764572013204744211113467215888630101000021------3735--8998315---- Q gi|254780479|r 239 TGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWIL------CNTI--THTFTHF---- 306 (356) Q Consensus 239 ~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~------l~~i--kH~fTH~---- 306 (356) ...-.|++ .+|++||.+|... -..|+|.||++-.+.++........+..++.... .+.+ .++|+|- T Consensus 203 vT~DaVVv-~~ghVLLIrR~~~-P~kG~WALPGGfv~~~Etl~~aa~REL~EETgl~~~~~~l~~~l~~~~vfd~P~Rdp 280 (340) T PRK05379 203 VTVDAVVV-QSGHVLLVRRRAE-PGKGLWALPGGFVGQDETLLDACLRELREETGLKLPEPVLRGSIRDQQVFDHPGRSL 280 (340) T ss_pred EEEEEEEE-ECCEEEEEEECCC-CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCCCCCCCCC T ss_conf 65316999-8898999941899-988887589886999999899999988641268876203220012432478999999 Q ss_pred --EEEEEEEEEECCCC--------CCCCCCEEECHHHHHHCCCCH--HHHHHHHH Q ss_conf --99999999980776--------766781332278887479987--89999984 Q gi|254780479|r 307 --TLTLFVWKTIVPQI--------VIIPDSTWHDAQNLANAALPT--VMKKALSA 349 (356) Q Consensus 307 --~L~i~v~~~~~~~~--------~~~~~~~Wv~~~el~~~~LPs--~~kKIL~a 349 (356) ..-.+.|.+.++.. ....+.+|++++++...+-.- =+..||+. T Consensus 281 RGrvvt~a~~~~l~~~~lp~v~agDDA~~A~W~pl~el~~~~~~~feDH~~Il~~ 335 (340) T PRK05379 281 RGRTITHAFLFEFPAGELPAVKGGDDADKARWVPLAELAAMRDRMFEDHFQIITH 335 (340) T ss_pred CCCEEEEEEEEEECCCCCCCCCCCCCHHHCEEEEHHHHHHCHHHHHCCHHHHHHH T ss_conf 8778999999972677667878998556577653999654777564409999999 No 79 >COG1051 ADP-ribose pyrophosphatase [Nucleotide transport and metabolism] Probab=94.97 E-value=0.37 Score=28.70 Aligned_cols=107 Identities=17% Similarity=0.204 Sum_probs=55.8 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEE--EEE-------EEEEEEEE Q ss_conf 200034137605776457201320474421111346721588863010100002137358--998-------31599999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTIT--HTF-------THFTLTLF 311 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ik--H~f-------TH~~L~i~ 311 (356) .+.+++..++++||.||... =..|.|.||++..+.++........+..++.......+. -+| ..+.+.+. T Consensus 12 ~v~~~i~~~~~iLLvrR~~~-p~~g~WalPGG~ve~GEt~eeaa~REl~EETgL~~~~~~~~~v~~~~~rd~r~~~v~~~ 90 (145) T COG1051 12 AVGALIVRNGRILLVRRANE-PGAGYWALPGGFVEIGETLEEAARRELKEETGLRVRVLELLAVFDDPGRDPRGHHVSFL 90 (145) T ss_pred EEEEEEECCCEEEEEEECCC-CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCEEEEEEE T ss_conf 57899942885999986688-88986818866278998889999999999878865430688885488878852699999 Q ss_pred EEEEECCC-CC-----CCCCCEEECHHHHHHCCCC--HHHHHHHH Q ss_conf 99998077-67-----6678133227888747998--78999998 Q gi|254780479|r 312 VWKTIVPQ-IV-----IIPDSTWHDAQNLANAALP--TVMKKALS 348 (356) Q Consensus 312 v~~~~~~~-~~-----~~~~~~Wv~~~el~~~~LP--s~~kKIL~ 348 (356) ++.+.... .. ....-.|++.+++....+| ..+..+.. T Consensus 91 ~~~~~~~g~~~~~~~~d~~~~~~~~~~~l~~~~~~~~~~~~~l~~ 135 (145) T COG1051 91 FFAAEPEGELLAGDGDDAAEVGWFPLDELPELPLPLAFTLADLRR 135 (145) T ss_pred EEEECCCCCCCCCCHHHHHHCCEECHHHCCCCCCCCCCCHHHHHH T ss_conf 999714786666772647627526454554544431022999999 No 80 >cd04674 Nudix_Hydrolase_16 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate Probab=94.74 E-value=0.11 Score=32.31 Aligned_cols=44 Identities=16% Similarity=0.086 Sum_probs=30.6 Q ss_pred EEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC Q ss_conf 034137605776457201320474421111346721588863010 Q gi|254780479|r 244 IAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP 288 (356) Q Consensus 244 vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~ 288 (356) .++..++++||.||... --.|+|.||++..+.++........+. T Consensus 9 ~~~~~~d~vLl~rR~~~-P~~G~w~lPgGf~e~gEt~~~aa~RE~ 52 (118) T cd04674 9 ALLPVDDGLLVIRRGIE-PGRGKLALPGGFIELGETWQDAVAREL 52 (118) T ss_pred EEEEECCEEEEEEECCC-CCCCCCCCCEEECCCCCCHHHHHHHHH T ss_conf 99997999999985999-888858487537549999999999999 No 81 >cd04665 Nudix_Hydrolase_8 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s Probab=94.51 E-value=0.18 Score=30.92 Aligned_cols=48 Identities=15% Similarity=0.197 Sum_probs=33.9 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC Q ss_conf 20003413760577645720132047442111134672158886301010000 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW 293 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~ 293 (356) .+.++-..++++||.+++..| |+||++..+.++.......++...+.. T Consensus 2 sV~vi~~~~~~~Ll~~~~~rg-----w~~PGG~~e~gEt~~~aa~RE~~EEtG 49 (118) T cd04665 2 SVLVICFYDDGLLLVRHKDRG-----WEFPGGHVEPGETIEEAARREVWEETG 49 (118) T ss_pred EEEEEEEECCEEEEEEECCCC-----EECCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 599999789999999928997-----879830417999999999998877329 No 82 >pfam05559 DUF763 Protein of unknown function (DUF763). This family consists of several uncharacterized bacterial and archaeal proteins of unknown function. Probab=94.47 E-value=0.041 Score=35.39 Aligned_cols=34 Identities=26% Similarity=0.345 Sum_probs=23.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHH---CCCEEEE Q ss_conf 687314568888877787766776643---4963131 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTASAIVAIA---FNHFAVV 145 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ai~s~a---~~~~~~~ 145 (356) .=|+|+++|+.+||||+.|..|++-+| ||.|.-- T Consensus 263 ~~p~dfeeLLl~~GvGp~TlRALaLvaElIyg~p~s~ 299 (319) T pfam05559 263 RNPEDFEELLLLKGVGPSTLRALALVAEVIYGTPPSF 299 (319) T ss_pred CCCCCHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 6801699997147988899999999999980899886 No 83 >cd03672 Dcp2p mRNA decapping enzyme 2 (Dcp2p), the catalytic subunit, and Dcp1p are the two components of the decapping enzyme complex. Decapping is a key step in both general and nonsense-mediated 5'-3' mRNA-decay pathways. Dcp2p contains an all-alpha helical N-terminal domain and a C-terminal domain which has the Nudix fold. While decapping is not dependent on the N-terminus of Dcp2p, it does affect its efficiency. Dcp1p binds the N-terminal domain of Dcp2p stimulating the decapping activity of Dcp2p. Decapping permits the degradation of the transcript and is a site of numerous control inputs. It is responsible for nonsense-mediated decay as well as AU-rich element (ARE)-mediated decay. In addition, it may also play a role in the levels of mRNA. Enzymes belonging to the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V). Probab=94.20 E-value=0.24 Score=29.96 Aligned_cols=89 Identities=17% Similarity=0.208 Sum_probs=49.3 Q ss_pred EEEEECC-CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCE------ECC-CEEEEEEEEEEEEEEEE Q ss_conf 0034137-605776457201320474421111346721588863010100002------137-35899831599999999 Q gi|254780479|r 243 FIAITND-NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWI------LCN-TITHTFTHFTLTLFVWK 314 (356) Q Consensus 243 ~vii~~~-~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~------~l~-~ikH~fTH~~L~i~v~~ 314 (356) +++++.+ +++||.|+.. ++-|.||.+..+.++........+..++..- ... -|.+.+--..+.+.+.. T Consensus 6 aIiln~~~~kvLLVk~~~----~~~W~fPkGki~~~E~~~~~AiREv~EEtG~Dv~~~i~~~~~Ie~~~~~q~~rlyiv~ 81 (145) T cd03672 6 AIILNEDLDKVLLVKGWK----SKSWSFPKGKINKDEDDHDCAIREVYEETGFDISKYIDKDDYIELIIRGQNVKLYIVP 81 (145) T ss_pred EEEEECCCCEEEEEEECC----CCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHCCCCEEEEEEECCEEEEEEEEC T ss_conf 999918999899999438----9958599641689979899998887773275754215846069999789089999982 Q ss_pred EECC--------CCCCCCCCEEECHHHHHH Q ss_conf 9807--------767667813322788874 Q gi|254780479|r 315 TIVP--------QIVIIPDSTWHDAQNLAN 336 (356) Q Consensus 315 ~~~~--------~~~~~~~~~Wv~~~el~~ 336 (356) .++ ...+.+...|++.++|.. T Consensus 82 -gVp~dt~F~p~TRkEI~~I~W~~i~~Lp~ 110 (145) T cd03672 82 -GVPEDTPFEPKTRKEISKIEWFDIKDLPT 110 (145) T ss_pred -CCCCCCCCCCCCCCCEEEEEEEEHHHHHH T ss_conf -88888751788723276578889888444 No 84 >pfam10576 EndIII_4Fe-2S Iron-sulfur binding domain of endonuclease III. Escherichia coli endonuclease III (EC 4.2.99.18) is a DNA repair enzyme that acts both as a DNA N-glycosylase, removing oxidized pyrimidines from DNA, and as an apurinic/apyrimidinic (AP) endonuclease, introducing a single-strand nick at the site from which the damaged base was removed. Endonuclease III is an iron-sulfur protein that binds a single 4Fe-4S cluster. The 4Fe-4S cluster does not seem to be important for catalytic activity, but is probably involved in the proper positioning of the enzyme along the DNA strand. The 4Fe-4S cluster is bound by four cysteines which are all located in a 17 amino acid region at the C-terminal end of endonuclease III. A similar region is also present in the central section of mutY and in the C-terminus of ORF-10 and of the Micro-coccus UV endonuclease. Probab=93.38 E-value=0.025 Score=36.95 Aligned_cols=21 Identities=38% Similarity=0.999 Sum_probs=19.0 Q ss_pred CCCCCCCCCCCCCCCCHHHHH Q ss_conf 041235643110000013431 Q gi|254780479|r 199 CTSNKPLCPLCPIQKNCLTFS 219 (356) Q Consensus 199 C~p~~P~C~~Cpl~~~C~~~~ 219 (356) |+|++|+|+.|||++.|.... T Consensus 1 Ct~rkP~C~~Cpl~~~C~~~~ 21 (26) T pfam10576 1 CTARKPKCEECPLADLCXXXX 21 (26) T ss_pred CCCCCCCCCCCCHHHHHHHCC T ss_conf 987898766586898774101 No 85 >PRK00241 nudC NADH pyrophosphatase; Reviewed Probab=92.99 E-value=0.81 Score=26.28 Aligned_cols=115 Identities=11% Similarity=0.053 Sum_probs=69.4 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEE Q ss_conf 00123443222222112320003413760577645720132047442111134672158886301010000213735899 Q gi|254780479|r 223 SHLLGINTIKKKRPMRTGAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHT 302 (356) Q Consensus 223 ~~~~P~kk~KKkk~~r~~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~ 302 (356) ...||...|- .. +++.+++++||.+.+. +=.|+|...-+-.+.++..+.....+..++....++.+++. T Consensus 126 ~~~yPri~Pa-------VI--vlV~~gd~iLLar~~~--~~~~~yS~lAGFvE~GEslE~aV~REV~EE~Gi~V~~i~Y~ 194 (257) T PRK00241 126 ERYYPRIAPC-------II--VAVRRGDEILLARHPR--HRNGVYTVLAGFVEVGESLEQCVAREVMEEVGIKVKNLRYV 194 (257) T ss_pred CCCCCCCCCE-------EE--EEEEECCEEEECCCCC--CCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCEECCEEEE T ss_conf 6316887976-------99--9997299887614788--99985230100365785899975351023467287540175 Q ss_pred EEE-----EEEEEEEEEEEC------CCCCCCCCCEEECHHHHHHCCCC-HHHHHHHHH Q ss_conf 831-----599999999980------77676678133227888747998-789999984 Q gi|254780479|r 303 FTH-----FTLTLFVWKTIV------PQIVIIPDSTWHDAQNLANAALP-TVMKKALSA 349 (356) Q Consensus 303 fTH-----~~L~i~v~~~~~------~~~~~~~~~~Wv~~~el~~~~LP-s~~kKIL~a 349 (356) -|. ..|-+- |.+.. .+..+..+.+|++.+++..+|-+ |.-+++|++ T Consensus 195 ~SQpWPFP~sLMlG-f~A~~~~~ei~~d~~Ei~~A~Wf~~delp~lPp~~SIAr~LIe~ 252 (257) T PRK00241 195 GSQPWPFPRSLMLG-FHADYDSGEIVFDPKEIADAQWFTYDELPLLPPSGTIARRLIED 252 (257) T ss_pred EECCCCCCHHHHEE-EEEEECCCCCCCCHHHEEEEEEECHHHHHHCCCCCCHHHHHHHH T ss_conf 41588897411210-69995688632583330224745689842069976599999999 No 86 >cd04661 MRP_L46 Mitochondrial ribosomal protein L46 (MRP L46) is a component of the large subunit (39S) of the mammalian mitochondrial ribosome and a member of the Nudix hydrolase superfamily. MRPs are thought to be involved in the maintenance of the mitochondrial DNA. In general, members of the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for activity and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. MRP L46 appears to contain a modified nudix motif. Probab=92.99 E-value=0.39 Score=28.52 Aligned_cols=82 Identities=10% Similarity=0.019 Sum_probs=39.9 Q ss_pred HCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCC--EE-EEEEEEE--------------EEEEEEEEECC-----CCC Q ss_conf 04744211113467215888630101000021373--58-9983159--------------99999999807-----767 Q gi|254780479|r 264 EGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNT--IT-HTFTHFT--------------LTLFVWKTIVP-----QIV 321 (356) Q Consensus 264 ~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~--ik-H~fTH~~--------------L~i~v~~~~~~-----~~~ 321 (356) .+.|.||.+.+++++......+..........+.. +- +-+.|++ -.+.+|.+.+- ... T Consensus 25 ~~~W~fP~~~~~~~EtLr~~Aer~L~~~~G~~~~~~~vgnaP~g~~k~k~p~~~~~~~~~G~K~Ff~ka~~~~G~~~~~~ 104 (132) T cd04661 25 QNHWILPQGKREEGETLRQTAERTLKELCGNNLKAKFYGNAPVGFYKYKYPKAVRNEGIVGAKVFFFKARYMSGQFELSQ 104 (132) T ss_pred CCEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCEEEEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCC T ss_conf 88802884246678548999999999860777367896268758898756852244466634799999999558646567 Q ss_pred CCCCCEEECHHHHHHCCCCHHHHH Q ss_conf 667813322788874799878999 Q gi|254780479|r 322 IIPDSTWHDAQNLANAALPTVMKK 345 (356) Q Consensus 322 ~~~~~~Wv~~~el~~~~LPs~~kK 345 (356) ...++.|++.+||.++--|..... T Consensus 105 ~~~df~Wltk~EL~~~l~~~y~~~ 128 (132) T cd04661 105 NQVDFKWLAKEELQKYLNPPYLQS 128 (132) T ss_pred CCCEEEECCHHHHHHHCCHHHHHH T ss_conf 752038708999987659999988 No 87 >COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Probab=92.42 E-value=0.22 Score=30.20 Aligned_cols=90 Identities=23% Similarity=0.303 Sum_probs=61.6 Q ss_pred HHHHCCCHHHHHHHHHCC--CHHH-HHHHHHHHHHHH------HHCCCCCCCC-HHHHHH-HHHHHHHHHHHHHH-HHHC Q ss_conf 898729989999997404--6345-889998776754------3005786873-145688-88877787766776-6434 Q gi|254780479|r 72 FCLSSAKDEEILSAWAGL--GYYT-RARNLKKCADII------VKKYEGNFPH-KVEILK-KLPGIGDYTASAIV-AIAF 139 (356) Q Consensus 72 ~~la~a~~~~vl~~w~gL--Gyy~-Rar~l~~~a~~i------~~~~~g~~P~-~~~~l~-~LpGiG~yta~ai~-s~a~ 139 (356) +.+-..++||+-..-.-. -||+ ||+.+-.+-+.+ ++ ....+. ..+-|. ...|||=.-|+-.+ .++| T Consensus 66 ~gfly~~~eEL~e~Lk~~g~Rf~n~raeyIVeaR~~~~~lk~~v~--~~~~~~vaRE~Lv~nikGiGyKEASHFLRNVG~ 143 (210) T COG1059 66 DGFLYLSEEELREKLKEVGYRFYNVRAEYIVEAREKFDDLKIIVK--ADENEKVARELLVENIKGIGYKEASHFLRNVGF 143 (210) T ss_pred CCCCCCCHHHHHHHHHHHCCHHCCCCHHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHCCH T ss_conf 331028899999999981601224245999999998877899882--674267899999997015537999999995471 Q ss_pred CCEEEECCCCHHHHHHHHHCCCCCC Q ss_conf 9631312431466657651975457 Q gi|254780479|r 140 NHFAVVVDTNIERIISRYFDIIKPA 164 (356) Q Consensus 140 ~~~~~~vD~Nv~RVl~R~~~~~~~~ 164 (356) ...+++|-|+.|-+.|++.++..+ T Consensus 144 -~D~AIlDrHIlr~l~r~g~i~e~~ 167 (210) T COG1059 144 -EDLAILDRHILRWLVRYGLIDENP 167 (210) T ss_pred -HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf -689999999999999836523474 No 88 >COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair] Probab=92.26 E-value=1.1 Score=25.38 Aligned_cols=152 Identities=15% Similarity=0.145 Sum_probs=87.0 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCC---------CCCCHHHHHCCCCCCCCCCCCCCCCCCEEE Q ss_conf 99874102578522678999999998632041235643110---------000013431011001234432222221123 Q gi|254780479|r 170 TIKNYARKITSTSRPGDFVQAMMDLGALICTSNKPLCPLCP---------IQKNCLTFSEGKSHLLGINTIKKKRPMRTG 240 (356) Q Consensus 170 ~l~~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cp---------l~~~C~~~~~~~~~~~P~kk~KKkk~~r~~ 240 (356) .++.+... ++....+.++||.+=+ .=..+..-|..|- .+..|.. .| -..||+..|- . T Consensus 82 ~lR~l~~~-~~~~~~~~~~~a~~l~---~w~~~~RFCg~CG~~~~~~~~g~~~~C~~--cg-~~~fPR~dP~-------v 147 (279) T COG2816 82 DLRSLLTE-LDEGLFGLAARAVQLL---EWYRSHRFCGRCGTKTYPREGGWARVCPK--CG-HEHFPRIDPC-------V 147 (279) T ss_pred EHHHHHCC-CCHHHHHHHHHHHHHH---HHHHHCCCCCCCCCCCCCCCCCEEEECCC--CC-CCCCCCCCCE-------E T ss_conf 18877364-9878989999999999---88863767777898074356732455798--88-7007988976-------9 Q ss_pred EEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEE-----EEEEEEEEEE Q ss_conf 20003413760577645720132047442111134672158886301010000213735899831-----5999999999 Q gi|254780479|r 241 AVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTH-----FTLTLFVWKT 315 (356) Q Consensus 241 ~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH-----~~L~i~v~~~ 315 (356) .+ ++.+++++||-+++.. +.|||..-.+-++.++..+.....+..++....++.+.+.-|- ..|-|-++-. T Consensus 148 Iv--~v~~~~~ilLa~~~~h--~~g~yS~LAGFVE~GETlE~AV~REv~EE~Gi~V~~vrY~~SQPWPfP~SLMigf~ae 223 (279) T COG2816 148 IV--AVIRGDEILLARHPRH--FPGMYSLLAGFVEPGETLEQAVAREVFEEVGIKVKNVRYVGSQPWPFPHSLMLGFMAE 223 (279) T ss_pred EE--EEECCCCEEECCCCCC--CCCCEEEEEECCCCCCCHHHHHHHHHHHHHCEEEEEEEEEECCCCCCCHHHHHHHEEE T ss_conf 99--9961772000689887--7853000133045785499999999888647177656687326899953465240255 Q ss_pred EC-----CCCCCCCCCEEECHHH-HHHCCC Q ss_conf 80-----7767667813322788-874799 Q gi|254780479|r 316 IV-----PQIVIIPDSTWHDAQN-LANAAL 339 (356) Q Consensus 316 ~~-----~~~~~~~~~~Wv~~~e-l~~~~L 339 (356) .. .+..+..+.+|++.+| +..++= T Consensus 224 y~sgeI~~d~~Eleda~WFs~~evl~~L~~ 253 (279) T COG2816 224 YDSGEITPDEGELEDARWFSRDEVLPALPP 253 (279) T ss_pred ECCCCCCCCCCHHHHCCCCCHHHHHHHCCC T ss_conf 426664678221334113477777762688 No 89 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=91.64 E-value=0.078 Score=33.44 Aligned_cols=76 Identities=24% Similarity=0.363 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHCCCE-EEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCC-------CHHHH Q ss_conf 45688888777877667766-434963-1312431466657651975457356899987410257852-------26789 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIVA-IAFNHF-AVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTS-------RPGDF 187 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~s-~a~~~~-~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~-------~~~~~ 187 (356) ++.|++..||||.||=+|+| +.-++- .++..+++.+ |+++=|+-. +.+++.+-++.+++.... ...+. T Consensus 71 F~~LisVsGIGpk~Al~iLs~~~~~~l~~aI~~~D~~~-L~~vpGIG~--KtA~rIi~ELk~Kl~~~~~~~~~~~~~~e~ 147 (196) T PRK13901 71 FEELIGVDGIGPRAALRVLSGIKYNEFRDAIDREDIEL-ISKVKGIGN--KMAGKIFLKLRGKLVKNDELESSLFKFKEL 147 (196) T ss_pred HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HHHCCCCCH--HHHHHHHHHHHHHHCCCCCCCCCCCCHHHH T ss_conf 99987658826899999975799999999999289999-831999589--999999999976531566556553448999 Q ss_pred HHHHHHHH Q ss_conf 99999998 Q gi|254780479|r 188 VQAMMDLG 195 (356) Q Consensus 188 nQAlMdlG 195 (356) -+||+-|| T Consensus 148 ~~AL~~LG 155 (196) T PRK13901 148 EQSIVNMG 155 (196) T ss_pred HHHHHHCC T ss_conf 99999849 No 90 >COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair] Probab=91.02 E-value=0.093 Score=32.90 Aligned_cols=24 Identities=42% Similarity=0.584 Sum_probs=20.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 314568888877787766776643 Q gi|254780479|r 115 HKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 115 ~~~~~l~~LpGiG~yta~ai~s~a 138 (356) .|.++|..|||||++.|.||...- T Consensus 94 As~eeL~~lpgIG~~kA~aIi~yR 117 (149) T COG1555 94 ASAEELQALPGIGPKKAQAIIDYR 117 (149) T ss_pred CCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 089999886798999999999999 No 91 >TIGR02705 nudix_YtkD nucleoside triphosphatase YtkD; InterPro: IPR014078 The proteins in this entry belong to the nudix family and share some sequence identity with Escherichia coli MutT but appear not to be functionally interchangeable with it. The functional assignment of the proteins in this family is contentious. Reference challenges the findings of reference , both in interpretation and in enzyme assay results.. Probab=90.94 E-value=0.36 Score=28.76 Aligned_cols=78 Identities=12% Similarity=0.124 Sum_probs=51.7 Q ss_pred EEEEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCE------EEEEEE-EEEEEEE Q ss_conf 320003413760577645720132047442111134672158886301010000213735------899831-5999999 Q gi|254780479|r 240 GAVFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTI------THTFTH-FTLTLFV 312 (356) Q Consensus 240 ~~~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~i------kH~fTH-~~L~i~v 312 (356) ..|+||=+.+|+.||+.=+.-|+ |||+++.+.++....+..++..++.+...+.+ .=.-.| -.+-=+| T Consensus 26 ~HVlvip~~~~qWLlT~H~~RGl-----EFPGGKvE~GEt~~eAA~REv~EETGA~v~~l~Y~gQY~Ve~~~~~~f~K~v 100 (158) T TIGR02705 26 NHVLVIPRYKDQWLLTEHKRRGL-----EFPGGKVEEGETSKEAAIREVYEETGAIVKELHYIGQYEVEGEDDTDFVKDV 100 (158) T ss_pred CCEEEEEEECCEEEECCCCCCCC-----CCCCCEECCCCCHHHHHHEECCCCCCEEEEEEEEEEEEEEECCCCCEEEEEE T ss_conf 95689755689334403587870-----4788800799884676510010035807546888876787589996657776 Q ss_pred EEEECCCCCC Q ss_conf 9998077676 Q gi|254780479|r 313 WKTIVPQIVI 322 (356) Q Consensus 313 ~~~~~~~~~~ 322 (356) |.+.+..... T Consensus 101 Yfa~V~~le~ 110 (158) T TIGR02705 101 YFAEVSALES 110 (158) T ss_pred EEEEEEEEEE T ss_conf 6889987324 No 92 >cd03424 ADPRase_NUDT5 ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate. Like other members of the Nudix hydrolase superfamily, it requires a divalent cation, such as Mg2+, for its activity. It also contains a highly conserved 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic enzymes (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). Human ADPRase-II is also referred to as NUDT5. It lacks the N-terminal target sequence unique to mitochondrial ADPRase. The different cytosolic types are distinguished by their specificities for substrate and specific requirem Probab=90.49 E-value=1.1 Score=25.42 Aligned_cols=93 Identities=8% Similarity=0.012 Sum_probs=52.2 Q ss_pred EEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCC------CCCCCEECCCEEE--EEEEEEEEEEEEE Q ss_conf 0034137605776457201320474421111346721588863010------1000021373589--9831599999999 Q gi|254780479|r 243 FIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAP------FTANWILCNTITH--TFTHFTLTLFVWK 314 (356) Q Consensus 243 ~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~------~~~~~~~l~~ikH--~fTH~~L~i~v~~ 314 (356) .+.+..+++++|.+.=--++=.-+||||.+..+.+++......++. ....+..++++.- .+|.- .+++|. T Consensus 7 il~~~~~~~i~lv~QyR~~~~~~~~E~PaG~id~gE~~~~aA~REL~EETG~~~~~~~~l~~~~~~pg~~~~--~~~~~~ 84 (137) T cd03424 7 VLPYDDDGKVVLVRQYRPPVGGWLLELPAGLIDPGEDPEEAARRELEEETGYEAGDLEKLGSFYPSPGFSDE--RIHLFL 84 (137) T ss_pred EEEECCCCEEEEEEEEECCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCC--EEEEEE T ss_conf 999869998999981442899838981554688997999999999741206513867893362588823371--799999 Q ss_pred EECCCCC--------CCCCCEEECHHHHHHC Q ss_conf 9807767--------6678133227888747 Q gi|254780479|r 315 TIVPQIV--------IIPDSTWHDAQNLANA 337 (356) Q Consensus 315 ~~~~~~~--------~~~~~~Wv~~~el~~~ 337 (356) +...... ..-+..|++.+++.+. T Consensus 85 a~~~~~~~~~~~d~~E~i~v~~~~~~e~~~~ 115 (137) T cd03424 85 AEDLSPGEEGLLDEGEDIEVVLVPLDEALEL 115 (137) T ss_pred EEECCCCCCCCCCCCCCEEEEEECHHHHHHH T ss_conf 9944045567888775268999669999999 No 93 >KOG3069 consensus Probab=90.31 E-value=0.77 Score=26.42 Aligned_cols=88 Identities=15% Similarity=0.152 Sum_probs=49.3 Q ss_pred CCEEEEEECCCC-HHHHCCCCCCCCCCCCCCCHHH-------HHHCCCCCCCCEECCCEEEEEEEEEEEEEEEEEECCCC Q ss_conf 760577645720-1320474421111346721588-------86301010000213735899831599999999980776 Q gi|254780479|r 249 DNRILLRKRTNT-RLLEGMDELPGSAWSSTKDGNI-------DTHSAPFTANWILCNTITHTFTHFTLTLFVWKTIVPQI 320 (356) Q Consensus 249 ~~kiLL~KRp~~-gll~GLwEFP~~e~~~~~~~~~-------~~~~~~~~~~~~~l~~ikH~fTH~~L~i~v~~~~~~~~ 320 (356) +-++||+||... .-+.|--.||++..+....... .++..........++...--++...+.+..+.+.+... T Consensus 57 ~l~vLltkRSr~LrshsGev~fPGG~~d~~D~s~~~tAlREt~EEIGl~~~~~~~~g~l~~~~~r~~~~v~p~v~~l~~~ 136 (246) T KOG3069 57 ELSVLLTKRSRTLRSHSGEVCFPGGRRDPHDKSDIQTALRETEEEIGLDPELVDVLGALPPFVLRSGWSVFPVVGFLSDK 136 (246) T ss_pred CEEEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEECCCCCCCEEEEEEECC T ss_conf 56899983356333568715578876783445048889887898739998882564215440452676313068997314 Q ss_pred ----------CCCCCCEEECHHHHHH Q ss_conf ----------7667813322788874 Q gi|254780479|r 321 ----------VIIPDSTWHDAQNLAN 336 (356) Q Consensus 321 ----------~~~~~~~Wv~~~el~~ 336 (356) .+.....||+++++-. T Consensus 137 ~~l~~~~ln~gEv~~~F~VPL~~ll~ 162 (246) T KOG3069 137 KILPSLRLNSGEVESAFWVPLTDLLL 162 (246) T ss_pred CCCCCCCCCCHHEEEEEEEEHHHHHH T ss_conf 33421358802402332201899732 No 94 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=90.20 E-value=0.46 Score=28.00 Aligned_cols=22 Identities=27% Similarity=0.374 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1456888887778776677664 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~ 137 (356) +..+|++++|||+-+|.+|..+ T Consensus 562 ~~s~L~~~~g~G~~vA~~~~~~ 583 (706) T TIGR00575 562 DRSELLSVEGVGPKVAESIVNF 583 (706) T ss_pred CCHHHHHCCCCHHHHHHHHHHH T ss_conf 6055641014027899999998 No 95 >PRK00024 radC DNA repair protein RadC; Reviewed Probab=90.00 E-value=0.11 Score=32.40 Aligned_cols=70 Identities=21% Similarity=0.364 Sum_probs=49.2 Q ss_pred HHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99877489978987299899999974046345889998776754300578---687314568888877787766776643 Q gi|254780479|r 62 KKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 62 ~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s~a 138 (356) +||++. -+.+|+ ..|+|.+- |||..+.++-...|+.+.++||| -+=.+.++|.+.+|||+-.|..|.++. T Consensus 14 ER~l~~--G~~~Ls---D~ELLall--L~~g~~~~d~~~lA~~ll~~~g~l~~l~~a~~~eL~~i~GiG~~kA~~l~a~~ 86 (224) T PRK00024 14 ERLLRR--GAAALS---DAELLAIL--LRTGTKGKSVLDLARELLERFGSLRGLLDASLEELQEIKGIGPAKAAQLKAAL 86 (224) T ss_pred HHHHHH--CCCCCC---HHHHHHHH--HHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHH T ss_conf 999860--932077---79999999--84699999989999999998599999987088988447898899999999999 No 96 >COG1415 Uncharacterized conserved protein [Function unknown] Probab=88.58 E-value=0.21 Score=30.37 Aligned_cols=35 Identities=29% Similarity=0.413 Sum_probs=25.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH---HCCCEEEECC Q ss_conf 8731456888887778776677664---3496313124 Q gi|254780479|r 113 FPHKVEILKKLPGIGDYTASAIVAI---AFNHFAVVVD 147 (356) Q Consensus 113 ~P~~~~~l~~LpGiG~yta~ai~s~---a~~~~~~~vD 147 (356) =|.|+++|+..||||+.|..|.+-+ .||.|.---| T Consensus 273 ~p~Df~elLl~~GiGpstvRALalVAEvIyg~~~s~rD 310 (373) T COG1415 273 NPDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRD 310 (373) T ss_pred CCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 83249998740687889999999999998089988677 No 97 >TIGR01259 comE comEA protein; InterPro: IPR004787 The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable . ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) .. Probab=88.58 E-value=0.15 Score=31.45 Aligned_cols=48 Identities=33% Similarity=0.569 Sum_probs=19.7 Q ss_pred HCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 72998999999740463458899987767543005786873145688888777877 Q gi|254780479|r 75 SSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYT 130 (356) Q Consensus 75 a~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yt 130 (356) =.|+++| |+...|+| -+||.-+-. -.+-+|+| .+.|||.+.-|||+.| T Consensus 67 N~As~~E-L~~l~GiG-P~kA~aIi~-----YRe~nG~F-~SvddL~kVsGIG~k~ 114 (124) T TIGR01259 67 NKASLEE-LQALPGIG-PAKAKAIIE-----YREENGAF-KSVDDLTKVSGIGEKS 114 (124) T ss_pred HHHHHHH-HHHCCCCC-CHHHHHHHH-----HHHHCCCC-CCHHHHHCCCCCCHHH T ss_conf 0678999-86369998-133799999-----99856997-7755500357885466 No 98 >COG1443 Idi Isopentenyldiphosphate isomerase [Lipid metabolism] Probab=87.15 E-value=1.4 Score=24.62 Aligned_cols=99 Identities=13% Similarity=0.127 Sum_probs=49.5 Q ss_pred CEEEEEEEEEECCCEEEEEECCCC-HHHHCCCCCCCC----CCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEE--- Q ss_conf 112320003413760577645720-132047442111----134672158886301010000213735899831599--- Q gi|254780479|r 237 MRTGAVFIAITNDNRILLRKRTNT-RLLEGMDELPGS----AWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTL--- 308 (356) Q Consensus 237 ~r~~~~~vii~~~~kiLL~KRp~~-gll~GLwEFP~~----e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L--- 308 (356) .+.+.+| +.+.+|++||+||... ..|.|.|.=-.. ..+..+..........+...... ...-+.+++|.- T Consensus 33 HrAFS~~-lFne~g~LLltrRA~~K~twP~vWTNSvCsHP~~~es~~~A~~rRl~~ELGie~~~-~d~~~il~rf~YrA~ 110 (185) T COG1443 33 HRAFSSF-LFNERGQLLLTRRALSKKTWPGVWTNSVCSHPLPGESNEDAARRRLAYELGIEPDQ-YDKLEILPRFRYRAA 110 (185) T ss_pred HHHHHEE-EECCCCCEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCCCCCCCEEEECC T ss_conf 7431046-86788866310144532557663436542798688741889999889983998754-474021465178635 Q ss_pred --------EEE-EEEEECC-----CCCCCCCCEEECHHHHHHC Q ss_conf --------999-9999807-----7676678133227888747 Q gi|254780479|r 309 --------TLF-VWKTIVP-----QIVIIPDSTWHDAQNLANA 337 (356) Q Consensus 309 --------~i~-v~~~~~~-----~~~~~~~~~Wv~~~el~~~ 337 (356) +|. |+.+.+. ++.+..+++|++.+++... T Consensus 111 ~~~~~~E~Eic~V~~~~~~~~~~~npdEV~~~~wv~~e~l~~~ 153 (185) T COG1443 111 DPDGIVENEICPVLAARLDSALDPNPDEVMDYRWVSPEDLKEM 153 (185) T ss_pred CCCCCCEEEEEEEEEEEECCCCCCCHHHHHHEECCCHHHHHHH T ss_conf 8987530024138888613778799688624112689999875 No 99 >TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=86.31 E-value=0.23 Score=30.12 Aligned_cols=23 Identities=39% Similarity=0.729 Sum_probs=15.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31456888887778776677664 Q gi|254780479|r 115 HKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 115 ~~~~~l~~LpGiG~yta~ai~s~ 137 (356) +++..|.++||||..||.+++|. T Consensus 115 ~~~~~L~k~pGvGKK~A~~l~~l 137 (217) T TIGR00084 115 EEVKALVKIPGVGKKTAERLLAL 137 (217) T ss_pred HHHHHHHCCCCCCHHHHHHHHHH T ss_conf 10444204588573789999877 No 100 >TIGR00426 TIGR00426 competence protein ComEA helix-hairpin-helix repeat region; InterPro: IPR004509 This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain.. Probab=85.11 E-value=0.64 Score=27.00 Aligned_cols=50 Identities=40% Similarity=0.599 Sum_probs=36.6 Q ss_pred HHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHH--HCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 872998999999740463458899987767543--00578687314568888877787766 Q gi|254780479|r 74 LSSAKDEEILSAWAGLGYYTRARNLKKCADIIV--KKYEGNFPHKVEILKKLPGIGDYTAS 132 (356) Q Consensus 74 la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~--~~~~g~~P~~~~~l~~LpGiG~yta~ 132 (356) .=.|+.+|+-+...|.| .+=|+-|+ .+-.|.| .+.|+|.+.||||..|-- T Consensus 11 INtAtaeElq~~~~GvG--------~kKAeAIv~YREe~G~F-~t~Edl~~V~GiG~~~~E 62 (70) T TIGR00426 11 INTATAEELQKALSGVG--------AKKAEAIVAYREEYGRF-KTVEDLKKVSGIGEKLLE 62 (70) T ss_pred CCHHCHHHHHHHHCCCC--------HHHHHHHHHHHHCCCCC-CCHHHHHHCCCCCHHHHH T ss_conf 01104788887642887--------23789998875327795-762223214787624555 No 101 >TIGR00615 recR recombination protein RecR; InterPro: IPR000093 The bacterial protein recR seems to play a role in a recombinational process of DNA repair . It may act with recF and recO. RecR is a protein of about 200 amino acid residues. This protein contains a putative C4-type zinc finger in the N-terminal section.; GO: 0006281 DNA repair, 0006310 DNA recombination. Probab=84.86 E-value=0.3 Score=29.34 Aligned_cols=21 Identities=48% Similarity=0.733 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 314568888877787766776 Q gi|254780479|r 115 HKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 115 ~~~~~l~~LpGiG~yta~ai~ 135 (356) +=.|.|.+|||||+.||.=++ T Consensus 9 ~Lie~L~kLPgiG~KsA~RlA 29 (205) T TIGR00615 9 KLIESLKKLPGIGPKSAQRLA 29 (205) T ss_pred HHHHHHHHCCCCCHHHHHHHH T ss_conf 999986407898714789999 No 102 >PRK13766 Hef nuclease; Provisional Probab=84.74 E-value=1.6 Score=24.27 Aligned_cols=30 Identities=20% Similarity=0.352 Sum_probs=16.0 Q ss_pred EEEEEEEEEEC----CCEEEEEECCCCHHHHCCC Q ss_conf 12320003413----7605776457201320474 Q gi|254780479|r 238 RTGAVFIAITN----DNRILLRKRTNTRLLEGMD 267 (356) Q Consensus 238 r~~~~~vii~~----~~kiLL~KRp~~gll~GLw 267 (356) +.+.+++++.+ ++.++..+|.++.+...|- T Consensus 467 r~G~v~vLi~~gt~de~~~~~~~~kek~m~~~l~ 500 (764) T PRK13766 467 RGGRVVVLIAKGTRDEAYYWSSRRKEKKMKEILK 500 (764) T ss_pred CCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 8974999995686267788851120689999999 No 103 >PRK11762 nudE ADP-ribose diphosphatase NudE; Provisional Probab=84.35 E-value=3.6 Score=21.69 Aligned_cols=92 Identities=8% Similarity=0.024 Sum_probs=51.0 Q ss_pred EEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHC------CCCCCCCEECCCEEE--EEEEEEEEEEEEEE Q ss_conf 0341376057764572013204744211113467215888630------101000021373589--98315999999999 Q gi|254780479|r 244 IAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHS------APFTANWILCNTITH--TFTHFTLTLFVWKT 315 (356) Q Consensus 244 vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~------~~~~~~~~~l~~ikH--~fTH~~L~i~v~~~ 315 (356) +-+..+++++|.+.=-.++-.=.||||.+..+.+++....... ......+..++.+-- .||.- .+++|.+ T Consensus 53 vpi~~~~~vvLVrQyR~~~~~~~lEiPaG~id~gE~p~~aA~REL~EETGy~a~~~~~l~~~~~spg~~~e--~~~~flA 130 (185) T PRK11762 53 VPILDDDTLLLIREYAAGTESYELGFPKGLIDPGETPEEAANRELKEEVGFGARQLTFLKELSLAPSYFSS--KMNIVLA 130 (185) T ss_pred EEECCCCEEEEEECCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCC--EEEEEEE T ss_conf 99759997999993368888879982430589996989999999889769774768997599837731153--2999998 Q ss_pred EC--CCCCCC-----CCCEEECHHHHHHC Q ss_conf 80--776766-----78133227888747 Q gi|254780479|r 316 IV--PQIVII-----PDSTWHDAQNLANA 337 (356) Q Consensus 316 ~~--~~~~~~-----~~~~Wv~~~el~~~ 337 (356) .- +..... -+..+++++++.+. T Consensus 131 ~~l~~~~~~~De~E~iev~~~~~~e~~~m 159 (185) T PRK11762 131 QDLYPERLEGDEPEPLEVVRWPLADLDEL 159 (185) T ss_pred CCCCCCCCCCCCCCCEEEEEECHHHHHHH T ss_conf 58605667999984149999769999999 No 104 >PRK13266 Thf1-like protein; Reviewed Probab=83.98 E-value=3.2 Score=22.07 Aligned_cols=127 Identities=17% Similarity=0.244 Sum_probs=79.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHH Q ss_conf 89999999999962899887548554445678704676644673116------1-------2033265899877489978 Q gi|254780479|r 6 HIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIF 72 (356) Q Consensus 6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~ 72 (356) ..+++.=-..|..+.|..| .-|+-.|-|.|-++- . .--+..-|.+||+-||--+ T Consensus 5 ~TVsDsKr~F~~~~p~pI~-------------~lYrrvvdELLVElHLL~~n~~F~yD~lFAlGlvt~Fd~fm~GY~Pee 71 (224) T PRK13266 5 RTVSDSKRAFHAAFPRVIP-------------SLYRRVVDELLVELHLLSVQSDFKYDPLFALGLVTVFDRFMQGYRPEE 71 (224) T ss_pred CCHHHHHHHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHH T ss_conf 1148879999985898882-------------899999999999999987414664273778449999999976799825 Q ss_pred HHHCCCHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHH Q ss_conf 987299899999-9740463458899987767543005786873145688888777877667766434963131243146 Q gi|254780479|r 73 CLSSAKDEEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIE 151 (356) Q Consensus 73 ~la~a~~~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~ 151 (356) +.. .+.. +.+.+|+ -+..+++-|+.+.+...|.=+.+..++++-.|-|.-...+++.-.- +|=. T Consensus 72 ~~~-----~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~l~~~~~~~~--------~~~~ 136 (224) T PRK13266 72 HKD-----ALFEALCQAVGF--DPEQLREDAEQLLELAKGKSLDEILSWLTQKGGGANELLATLQAIA--------NNSK 136 (224) T ss_pred HHH-----HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHH--------CCCC T ss_conf 699-----999999984489--9999999999999998758999999999735455506889999874--------2787 Q ss_pred HHHHHHHCC Q ss_conf 665765197 Q gi|254780479|r 152 RIISRYFDI 160 (356) Q Consensus 152 RVl~R~~~~ 160 (356) =--||+++| T Consensus 137 f~YSRl~AI 145 (224) T PRK13266 137 FKYSRLFAI 145 (224) T ss_pred CHHHHHHHH T ss_conf 328999999 No 105 >TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=83.80 E-value=0.48 Score=27.87 Aligned_cols=59 Identities=17% Similarity=0.272 Sum_probs=41.2 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHCCCEEEECC-CCHHHHHHHHHCCCCCCCHHHHHH-HHHHHCC Q ss_conf 4568888877787766776-643496313124-314666576519754573568999-8741025 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIV-AIAFNHFAVVVD-TNIERIISRYFDIIKPAPLYHKTI-KNYARKI 178 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~-s~a~~~~~~~vD-~Nv~RVl~R~~~~~~~~~~~~k~l-~~~~~~~ 178 (356) +.+|+.+-|||+.+|=||+ ++..+.-.-+|+ .||.+ |+.+-|+-. +.+++.+ =++..++ T Consensus 82 F~~Li~~nGvGpk~ALaiL~~~~~~~~~~ai~~~~~~~-L~k~pGvGK--K~A~~l~~leL~gk~ 143 (217) T TIGR00084 82 FKELIKVNGVGPKLALAILSNMSPEEFVQAIETEEVKA-LVKIPGVGK--KTAERLLALELKGKL 143 (217) T ss_pred HHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHCCCCCCH--HHHHHHHHHHHHHHH T ss_conf 99985148802899999866788758988886410444-204588573--789999877754544 No 106 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=83.52 E-value=0.46 Score=27.99 Aligned_cols=45 Identities=31% Similarity=0.561 Sum_probs=34.6 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 76754300578687314568888-----------877787766776643496313124 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVD 147 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD 147 (356) ++..|+++| |..|.-.-++.+| ||||++||...+ -.||-=.-+.+ T Consensus 165 t~e~V~eKy-Gv~P~Q~~D~~AL~GD~SDNiPGV~GIGeKTA~kLL-~~fgsLe~iy~ 220 (1005) T TIGR00593 165 TPEYVVEKY-GVTPSQLVDLKALVGDSSDNIPGVKGIGEKTAAKLL-QEFGSLENIYE 220 (1005) T ss_pred CHHHHHHHC-CCCHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH-HHHHHHHHHHH T ss_conf 778987541-867457887620468888785998973765689999-87210899998 No 107 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=82.05 E-value=0.65 Score=26.92 Aligned_cols=50 Identities=26% Similarity=0.371 Sum_probs=24.9 Q ss_pred HHHHHCCCHHHHHHHHHH--HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999740463458899987--767543005786873145688888777877667766 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKK--CADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVA 136 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~--~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s 136 (356) |.-.+|.|. .-|-++.. ....+.+- ..=.|.+.|.++||||+.||.-|.. T Consensus 75 Li~V~GIGp-K~AL~ILs~~~~~~l~~a---I~~~D~~~L~~vpGIG~KtA~rIi~ 126 (198) T PRK00116 75 LISVSGVGP-KLALAILSGLSPEELAQA---IANGDIKALTKVPGVGKKTAERIVL 126 (198) T ss_pred HHCCCCCCH-HHHHHHHCCCCHHHHHHH---HHHCCHHHHCCCCCCCHHHHHHHHH T ss_conf 856688578-999988702999999999---9858999970688978899999999 No 108 >pfam11731 Cdd1 Pathogenicity locus. Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed. Probab=81.70 E-value=0.61 Score=27.14 Aligned_cols=42 Identities=24% Similarity=0.324 Sum_probs=33.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH Q ss_conf 145688888777877667766434963131243146665765 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY 157 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~ 157 (356) ...+|..|||||+-||.-...+++..+.-....|-.-...|+ T Consensus 10 ~l~~L~~lPnIG~a~a~DL~~LGi~~~~~L~g~dp~elY~~l 51 (92) T pfam11731 10 ALKELTDLPNIGKATAKDLRLLGINSPAQLAGRDPLELYERL 51 (92) T ss_pred HHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHCCCHHHHHHHH T ss_conf 999874189746999999999189989999179999999999 No 109 >PRK00254 ski2-like helicase; Provisional Probab=81.55 E-value=1.7 Score=23.99 Aligned_cols=20 Identities=10% Similarity=0.205 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHCC-CC-CCCCC Q ss_conf 9999999999628-99-88754 Q gi|254780479|r 8 IQSKILDWYDTNH-RV-LPWRT 27 (356) Q Consensus 8 ~~~~ll~w~~~~~-R~-lpwr~ 27 (356) +.+++.++|+..| +. +||+. T Consensus 8 ~~~~~~~~~~~~gI~~l~p~Q~ 29 (717) T PRK00254 8 VDERIKEILKERGIEELYPPQA 29 (717) T ss_pred CCHHHHHHHHHCCCCCCCHHHH T ss_conf 9989999999769872689999 No 110 >PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed Probab=81.55 E-value=0.68 Score=26.78 Aligned_cols=20 Identities=15% Similarity=0.325 Sum_probs=9.2 Q ss_pred HHCCCHHHHHCCCHHHHHHHH Q ss_conf 748997898729989999997 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILSAW 86 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~~w 86 (356) ...|- -||.++-.++-+..| T Consensus 95 H~~pM-lSL~n~~~~~el~~~ 114 (563) T PRK08097 95 HPVAH-TGVRKLADKQALARW 114 (563) T ss_pred CCCCC-CCHHHCCCHHHHHHH T ss_conf 89876-471111898999999 No 111 >TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29. Psp29, originally designated sll1414 in Synechocystis 6803, is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function. Probab=81.35 E-value=3.7 Score=21.59 Aligned_cols=126 Identities=15% Similarity=0.247 Sum_probs=77.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHH Q ss_conf 89999999999962899887548554445678704676644673116------1-------2033265899877489978 Q gi|254780479|r 6 HIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIF 72 (356) Q Consensus 6 ~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~ 72 (356) ..+++.=-..|..+.|..| .-|+-.|-|.|-++- . .--+..-|.+||+-||--+ T Consensus 5 ~TVsDsKr~F~~~~p~pI~-------------~lYrrvvdELLVElHLl~~n~~F~yD~lfAlGlvt~Fd~fm~GY~Pee 71 (214) T TIGR03060 5 RTVSDSKRAFHAAFPRVIP-------------PLYRRVVDELLVELHLLSHQSDFKYDPLFALGLVTVFDRFMEGYRPEE 71 (214) T ss_pred CCHHHHHHHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHH T ss_conf 1148879999985898881-------------899999999999999988503654373678459999999976799825 Q ss_pred HHHCCCHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHH Q ss_conf 987299899999-9740463458899987767543005786873145688888777877667766434963131243146 Q gi|254780479|r 73 CLSSAKDEEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIE 151 (356) Q Consensus 73 ~la~a~~~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~ 151 (356) +.. .+.. +.+.+|. -+..+++-|+.+.+...|.=+.+..++++=.|-|.-...++.+++ +|-. T Consensus 72 ~~~-----~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~~~~l~~~a---------~n~k 135 (214) T TIGR03060 72 HLD-----ALFDALCNSNGF--DPEQLREDAKQLLEQAKGKGLDEILSWLTQANLSNGGGDTLQGIA---------GRHK 135 (214) T ss_pred HHH-----HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCHHHHHHHHH---------CCCC T ss_conf 699-----999999873489--999999999999999974899999999850475541167999874---------1787 Q ss_pred HHHHHHHCC Q ss_conf 665765197 Q gi|254780479|r 152 RIISRYFDI 160 (356) Q Consensus 152 RVl~R~~~~ 160 (356) =--||+++| T Consensus 136 f~YSRl~AI 144 (214) T TIGR03060 136 FKYSRLFAI 144 (214) T ss_pred CCHHHHHHH T ss_conf 408999999 No 112 >KOG2841 consensus Probab=81.07 E-value=2.7 Score=22.59 Aligned_cols=21 Identities=14% Similarity=0.058 Sum_probs=8.5 Q ss_pred CHHHHHHCC--CCHHHHHHHHHC Q ss_conf 278887479--987899999844 Q gi|254780479|r 330 DAQNLANAA--LPTVMKKALSAG 350 (356) Q Consensus 330 ~~~el~~~~--LPs~~kKIL~al 350 (356) +..+++..+ =|+-.+++.+.+ T Consensus 225 S~~ele~~~G~G~~kak~l~~~l 247 (254) T KOG2841 225 SEGELEQCPGLGPAKAKRLHKFL 247 (254) T ss_pred CHHHHHHCCCCCHHHHHHHHHHH T ss_conf 77679867573789999999998 No 113 >TIGR01259 comE comEA protein; InterPro: IPR004787 The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable . ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) .. Probab=80.99 E-value=0.85 Score=26.13 Aligned_cols=24 Identities=42% Similarity=0.609 Sum_probs=21.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 731456888887778776677664 Q gi|254780479|r 114 PHKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 114 P~~~~~l~~LpGiG~yta~ai~s~ 137 (356) -.+.+||..||||||.-|.||.++ T Consensus 68 ~As~~EL~~l~GiGP~kA~aIi~Y 91 (124) T TIGR01259 68 KASLEELQALPGIGPAKAKAIIEY 91 (124) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 678999863699981337999999 No 114 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=80.17 E-value=0.58 Score=27.27 Aligned_cols=58 Identities=16% Similarity=0.225 Sum_probs=37.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HCCC-EEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHC Q ss_conf 456888887778776677664-3496-313124314666576519754573568999874102 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIVAI-AFNH-FAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARK 177 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~s~-a~~~-~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~ 177 (356) ++.|++..||||.||=+|+|. ..++ ..++..+|+.. |+++-|+-. +.+++.+-++..+ T Consensus 72 F~~Li~V~GIGpK~AL~ILs~~~~~~l~~aI~~~D~~~-L~~vpGIG~--KtA~rIi~ELk~K 131 (198) T PRK00116 72 FRLLISVSGVGPKLALAILSGLSPEELAQAIANGDIKA-LTKVPGVGK--KTAERIVLELKDK 131 (198) T ss_pred HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HCCCCCCCH--HHHHHHHHHHHHH T ss_conf 99985668857899998870299999999998589999-706889788--9999999999988 No 115 >COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair] Probab=80.02 E-value=0.41 Score=28.34 Aligned_cols=44 Identities=23% Similarity=0.283 Sum_probs=25.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HCCCEEEE-CCCCHHHHHHHHHCCC Q ss_conf 456888887778776677664-34963131-2431466657651975 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIVAI-AFNHFAVV-VDTNIERIISRYFDII 161 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~s~-a~~~~~~~-vD~Nv~RVl~R~~~~~ 161 (356) +..|++..||||.||=||+|- ..++=+-+ -.+|+.. |+|+=|+- T Consensus 72 F~~LisVnGIGpK~ALaiLs~~~~~~l~~aI~~~d~~~-L~k~PGIG 117 (201) T COG0632 72 FRLLISVNGIGPKLALAILSNLDPEELAQAIANEDVKA-LSKIPGIG 117 (201) T ss_pred HHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHCCHHH-HHCCCCCC T ss_conf 99987118805899999984899999999998328676-44189877 No 116 >PRK07456 consensus Probab=79.89 E-value=1 Score=25.52 Aligned_cols=34 Identities=32% Similarity=0.403 Sum_probs=28.0 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 6754300578687314568888-----------8777877667766 Q gi|254780479|r 102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ...|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 179 ~~~V~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 223 (975) T PRK07456 179 EAGVKEKL-GVAPEQVVDLKALTGDSSDNIPGVKGVGPKTAINLLK 223 (975) T ss_pred HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99999986-9598998999997288656889999855999999999 No 117 >PRK00076 recR recombination protein RecR; Reviewed Probab=79.82 E-value=0.77 Score=26.43 Aligned_cols=24 Identities=42% Similarity=0.790 Sum_probs=16.7 Q ss_pred CCCHH----HHHHHHHHHHHHHHHHHHH Q ss_conf 87314----5688888777877667766 Q gi|254780479|r 113 FPHKV----EILKKLPGIGDYTASAIVA 136 (356) Q Consensus 113 ~P~~~----~~l~~LpGiG~yta~ai~s 136 (356) +|... +.|.+|||||+.||--++- T Consensus 2 ~p~~i~~LI~~l~kLPGIG~KsA~Rla~ 29 (197) T PRK00076 2 YPPPIEKLIEALRKLPGIGPKSAQRLAF 29 (197) T ss_pred CCHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 8789999999998789998899999999 No 118 >PRK06887 consensus Probab=79.39 E-value=1.1 Score=25.33 Aligned_cols=35 Identities=26% Similarity=0.398 Sum_probs=28.4 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 76754300578687314568888-----------8777877667766 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ....|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 161 d~~~V~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 206 (954) T PRK06887 161 DREGVIEKY-GIPPELIIDYLALMGDSSDNIPGVAGVGEKTALGLLQ 206 (954) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHH T ss_conf 799998987-9798999999997188524778878541699999999 No 119 >TIGR01816 sdhA_forward succinate dehydrogenase, flavoprotein subunit; InterPro: IPR011281 Succinate dehydrogenase and fumarate reductase are homologous enzymes reversible in principle but favoured under different circumstances. This entry represents a narrowly defined clade of the succinate dehydrogenase flavoprotein subunit as found in mitochondria, in Rickettsia, in Escherichia coli and other proteobacteria, and in a few other lineages. However, excluded are all known fumarate reductases. It also excludes putative succinate dehydrogenases that appear to diverged before the split between E. coli succinate dehydrogenase and fumarate reductase.; GO: 0016627 oxidoreductase activity acting on the CH-CH group of donors, 0050660 FAD binding, 0006099 tricarboxylic acid cycle. Probab=79.01 E-value=0.43 Score=28.21 Aligned_cols=18 Identities=33% Similarity=0.571 Sum_probs=15.4 Q ss_pred HHHHHHC-CCHHHHHCCCH Q ss_conf 9987748-99789872998 Q gi|254780479|r 62 KKFMQKW-PTIFCLSSAKD 79 (356) Q Consensus 62 ~~~~~~~-P~~~~la~a~~ 79 (356) ||||+|| ||++|||.=|. T Consensus 271 ERFMERYAP~~KDLAsRDv 289 (615) T TIGR01816 271 ERFMERYAPTAKDLASRDV 289 (615) T ss_pred CCCHHCCCCCCCCCCCCCH T ss_conf 5102001675100368636 No 120 >PRK08609 hypothetical protein; Provisional Probab=78.96 E-value=3.1 Score=22.19 Aligned_cols=24 Identities=29% Similarity=0.511 Sum_probs=13.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 687314568888877787766776 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ai~ 135 (356) ++|...-+|+.+|||||.+|..+- T Consensus 82 ~~P~gl~eLl~IpGlGPKka~~L~ 105 (570) T PRK08609 82 EVPEGLLPLLKLPGLGGKKIAKLY 105 (570) T ss_pred HCCHHHHHHHCCCCCCHHHHHHHH T ss_conf 487779999778987789999999 No 121 >PRK05929 consensus Probab=78.30 E-value=1.2 Score=25.07 Aligned_cols=33 Identities=24% Similarity=0.458 Sum_probs=27.3 Q ss_pred HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 754300578687314568888-----------8777877667766 Q gi|254780479|r 103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ..|.++| |.-|+..-++++| ||||+.||...+. T Consensus 160 ~~V~ek~-Gv~P~qi~D~laL~GDsSDNIPGVpGIG~KTA~kLL~ 203 (870) T PRK05929 160 NEVIEQY-GVPPGQIADYLALVGDSSDNIPGVSGCGPKKAAALLK 203 (870) T ss_pred HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9999997-9698999999997287646899999760999999998 No 122 >PRK07625 consensus Probab=78.11 E-value=1.3 Score=24.81 Aligned_cols=35 Identities=23% Similarity=0.241 Sum_probs=28.0 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 76754300578687314568888-----------8777877667766 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ....|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 160 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 205 (922) T PRK07625 160 DRDGVLAKF-GVPPERIVDYLSLIGDTVDNVPGVEKCGPKTAVKWLT 205 (922) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 899999987-9798999999997197635789999856798999999 No 123 >PRK01172 ski2-like helicase; Provisional Probab=77.73 E-value=1.7 Score=24.08 Aligned_cols=45 Identities=11% Similarity=0.059 Sum_probs=27.5 Q ss_pred HHHHHHHHHHHCCCC-CCCCCCCCCCCCCCC--------CHHHHHHHHHHHHHC Q ss_conf 999999999962899-887548554445678--------704676644673116 Q gi|254780479|r 8 IQSKILDWYDTNHRV-LPWRTSPKTEKSSLP--------SPYKVWISEIMLQQT 52 (356) Q Consensus 8 ~~~~ll~w~~~~~R~-lpwr~~~~~~~~~~~--------~py~v~vseimlqqT 52 (356) +.+.+++||+.+|=+ .||+.-.-..-...+ .-=+.+|+|++..++ T Consensus 8 ~~~~~~~~~~~~g~~l~p~Q~ea~~~~~~gkNllvsaPTgsGKTlvAe~ai~~~ 61 (674) T PRK01172 8 YDDEFLNLFTGNDFELYDHQRMAIEQLRKGENVIVSVPTAAGKTLIAYSAIYET 61 (674) T ss_pred CCHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHH T ss_conf 997999999967998898999999999779959997899986999999999999 No 124 >PRK08076 consensus Probab=77.60 E-value=1.4 Score=24.63 Aligned_cols=34 Identities=32% Similarity=0.641 Sum_probs=27.3 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 6754300578687314568888-----------8777877667766 Q gi|254780479|r 102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ...|.++| |.-|+..-++++| ||||+.||...+. T Consensus 162 ~~~v~ek~-Gv~P~q~~D~~aL~GDssDNIPGVpGiG~KtA~~ll~ 206 (877) T PRK08076 162 PEALFEKY-GLTPKQIIDMKGLMGDSSDNIPGVPGVGEKTAIKLLK 206 (877) T ss_pred HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99999997-9698999999997187646799999863799999999 No 125 >smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1. Probab=77.29 E-value=0.54 Score=27.48 Aligned_cols=23 Identities=43% Similarity=0.573 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 56888887778776677664349 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVAIAFN 140 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s~a~~ 140 (356) ++|+++||||+.||.+|+..... T Consensus 1 ~~L~~v~GIG~k~A~~ll~~~~~ 23 (26) T smart00278 1 EELLKVPGIGPKTAEKILEAXXX 23 (26) T ss_pred CCCCCCCCCCCHHHHHHHHHHHC T ss_conf 92101799881159999997620 No 126 >PRK07956 ligA NAD-dependent DNA ligase LigA; Validated Probab=77.07 E-value=2.4 Score=22.93 Aligned_cols=13 Identities=31% Similarity=0.319 Sum_probs=7.6 Q ss_pred HHHHHHHHCCCCC Q ss_conf 8999998448237 Q gi|254780479|r 342 VMKKALSAGGIKV 354 (356) Q Consensus 342 ~~kKIL~alg~~~ 354 (356) +..++|-|||+++ T Consensus 506 ~l~r~L~ALGI~~ 518 (668) T PRK07956 506 PLARFLYALGIRH 518 (668) T ss_pred CHHHHHHHCCCCC T ss_conf 5888998627864 No 127 >PRK13844 recombination protein RecR; Provisional Probab=77.04 E-value=1 Score=25.53 Aligned_cols=24 Identities=38% Similarity=0.641 Sum_probs=16.7 Q ss_pred CCCCHH----HHHHHHHHHHHHHHHHHH Q ss_conf 687314----568888877787766776 Q gi|254780479|r 112 NFPHKV----EILKKLPGIGDYTASAIV 135 (356) Q Consensus 112 ~~P~~~----~~l~~LpGiG~yta~ai~ 135 (356) .+|... +.|.+|||||+.||-=++ T Consensus 5 ~~~~~i~~LI~~l~kLPGIG~KsA~Rla 32 (200) T PRK13844 5 IFSPKISAVIESLRKLPTIGKKSSQRLA 32 (200) T ss_pred CCCHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 5889999999998168998788999999 No 128 >PRK07556 consensus Probab=76.87 E-value=1.3 Score=24.81 Aligned_cols=33 Identities=36% Similarity=0.596 Sum_probs=26.8 Q ss_pred HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 754300578687314568888-----------8777877667766 Q gi|254780479|r 103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ..|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 169 ~~V~eK~-GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 212 (977) T PRK07556 169 PEVIEKF-GVPPEKVVDVQALAGDSVDNVPGIPGIGIKTAAQLIN 212 (977) T ss_pred HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9999986-9698999999997187646789899854899999999 No 129 >PRK12911 bifunctional preprotein translocase subunit SecD/SecF; Reviewed Probab=76.84 E-value=0.71 Score=26.67 Aligned_cols=30 Identities=27% Similarity=0.517 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH Q ss_conf 5688888777877667766434963131243146665765 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY 157 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~ 157 (356) +.-++|||| ||-|++|+ ++||+||. ++.|. T Consensus 958 nATLTLPGI----AGIVLTIG-----MAVDANVL-IFERI 987 (1405) T PRK12911 958 DAPLTLSGL----AGIVLAMG-----MAVDANVL-VFERI 987 (1405) T ss_pred CCCCCCCHH----HHHHHHHH-----HHHHHHHH-HHHHH T ss_conf 876455008----89997640-----01304479-99999 No 130 >cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this Probab=76.84 E-value=2.2 Score=23.20 Aligned_cols=26 Identities=27% Similarity=0.542 Sum_probs=21.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 86873145688888777877667766 Q gi|254780479|r 111 GNFPHKVEILKKLPGIGDYTASAIVA 136 (356) Q Consensus 111 g~~P~~~~~l~~LpGiG~yta~ai~s 136 (356) ...|....+|+++|||||.||..+-. T Consensus 78 ~~~p~~l~~l~~I~GiGpk~a~~l~~ 103 (307) T cd00141 78 EDVPPGLLLLLRVPGVGPKTARKLYE 103 (307) T ss_pred CCCCHHHHHHHCCCCCCHHHHHHHHH T ss_conf 65637899996478878899999998 No 131 >PRK08835 consensus Probab=76.60 E-value=1.5 Score=24.45 Aligned_cols=51 Identities=27% Similarity=0.400 Sum_probs=35.2 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC Q ss_conf 76754300578687314568888-----------87778776677664349631312431466657651975 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII 161 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~ 161 (356) ....|.++| |.-|+.+-++++| ||||+.||...+.= | |++.-++..+-.+. T Consensus 161 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~Iy~nld~Ik 222 (931) T PRK08835 161 DREGVVEKF-GIPPELIIDYLALMGDKVDNIPGVPGVGDKTATALLQG-I--------GGLDALYDNLDDIA 222 (931) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-H--------CCHHHHHHHHHHCC T ss_conf 899999987-96989989999971886468899997458889999998-4--------86999997698614 No 132 >COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair] Probab=76.45 E-value=0.91 Score=25.91 Aligned_cols=20 Identities=40% Similarity=0.747 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 14568888877787766776 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~ 135 (356) =.+.|.+|||||+.||.-++ T Consensus 10 LI~~l~kLPGvG~KsA~R~A 29 (198) T COG0353 10 LIDALKKLPGVGPKSAQRLA 29 (198) T ss_pred HHHHHHHCCCCCHHHHHHHH T ss_conf 99999768998832799999 No 133 >PRK07997 consensus Probab=76.34 E-value=1.5 Score=24.33 Aligned_cols=35 Identities=34% Similarity=0.568 Sum_probs=28.3 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 76754300578687314568888-----------8777877667766 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ....|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 161 ~~~~V~ek~-GV~P~qv~D~laL~GDsSDNIPGVpGIG~KTA~kLL~ 206 (928) T PRK07997 161 GPEEVVNKY-GVPPELIIDFLALMGDSSDNIPGVPGVGEKTAQALLQ 206 (928) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 799999987-9798988999997288656889999754899999999 No 134 >PRK07898 consensus Probab=76.33 E-value=1.5 Score=24.32 Aligned_cols=35 Identities=31% Similarity=0.557 Sum_probs=28.0 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 76754300578687314568888-----------8777877667766 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ....|.++| |.-|+..-++++| ||||+.||...+. T Consensus 175 d~~~V~ek~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 220 (902) T PRK07898 175 TPEAVEEKY-GLTPAQYPDFAALRGDPSDNLPGIPGVGEKTAAKWIA 220 (902) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 899999986-9798999999998188645799999844788999999 No 135 >PRK08786 consensus Probab=76.31 E-value=1.4 Score=24.52 Aligned_cols=34 Identities=26% Similarity=0.348 Sum_probs=27.7 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 6754300578687314568888-----------8777877667766 Q gi|254780479|r 102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ...|.++| |.-|+.+-++++| ||||+.||...+. T Consensus 157 ~~~V~eK~-GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 201 (927) T PRK08786 157 DAAVIAKF-GVRPDQIVDLLALMGDTVDNVPGVEKCGPKTAAKWLA 201 (927) T ss_pred HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 89999996-9698999999998087646789999856899999999 No 136 >PRK08928 consensus Probab=75.42 E-value=1.7 Score=24.05 Aligned_cols=51 Identities=29% Similarity=0.442 Sum_probs=35.1 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC Q ss_conf 76754300578687314568888-----------87778776677664349631312431466657651975 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII 161 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~ 161 (356) ....|.++| |.-|+.+-++++| ||||+.||...+.- | |++.-++..+-.+. T Consensus 159 ~~~~V~ek~-GV~P~qiiD~laL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~Iy~n~d~Ik 220 (861) T PRK08928 159 TEDDVVEKF-GVTSDKLLDVMALTGDRSDNIPGVPSIGPKTAAKLITQ-F--------GSLENILNSLDQIS 220 (861) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-C--------CCHHHHHHHHHHCC T ss_conf 899999997-97989999999980876468899988562899999996-7--------97999999886113 No 137 >PRK05797 consensus Probab=74.97 E-value=1.6 Score=24.23 Aligned_cols=34 Identities=35% Similarity=0.598 Sum_probs=26.5 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 6754300578687314568888-----------8777877667766 Q gi|254780479|r 102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ...|.++| |.-|+..-++++| ||||+.||...+. T Consensus 162 ~~~v~ek~-Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KTA~kLL~ 206 (869) T PRK05797 162 KNKMVEEY-GVTPTQFIDVKGLMGDKSDNIPGVPGIGEKTAFKLIK 206 (869) T ss_pred HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99989997-7899999999998088645789999878189999998 No 138 >PRK08434 consensus Probab=74.74 E-value=1.7 Score=23.93 Aligned_cols=49 Identities=24% Similarity=0.351 Sum_probs=33.5 Q ss_pred HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCC Q ss_conf 754300578687314568888-----------87778776677664349631312431466657651975 Q gi|254780479|r 103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDII 161 (356) Q Consensus 103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~ 161 (356) ..|.++| |.-|+.+-++++| ||||+.||...+.- | |++.-++..+-.++ T Consensus 159 ~~V~eK~-GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~e-y--------GsLE~I~~n~d~Ik 218 (887) T PRK08434 159 AACLEKY-GVKPKQIRDFLALCGDSADNIPGVKGIGAKGAKKLLDE-F--------GSLEGIYENLELVR 218 (887) T ss_pred HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH-C--------CCHHHHHHHHHHCC T ss_conf 9999997-96989999999981886457899998568999999998-2--------88899999887124 No 139 >TIGR02789 nickel_nikB nickel ABC transporter, permease subunit NikB; InterPro: IPR014156 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of the NikB family of nickel ABC transporter permeases. Operons that contain this protein also contain a homologous permease subunit NikC. Nickel is used in cells as part of urease or certain hydrogenases or superoxide dismutases.. Probab=74.59 E-value=0.54 Score=27.51 Aligned_cols=38 Identities=29% Similarity=0.433 Sum_probs=25.5 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH Q ss_conf 00578687314568888877787766776643496313124314 Q gi|254780479|r 107 KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI 150 (356) Q Consensus 107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv 150 (356) +=-+|.+ =.|..=++||||||.-+||. |...|++-.-| T Consensus 245 ~Ll~GT~--viE~iFswPG~GRy~i~Aif----NRDyPviQcy~ 282 (315) T TIGR02789 245 ELLGGTL--VIENIFSWPGVGRYAISAIF----NRDYPVIQCYT 282 (315) T ss_pred HHHHCCE--EEEEECCCCCHHHHHHHHHH----HCCCHHHHHHH T ss_conf 7840711--45410126661045743356----23763899999 No 140 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=74.42 E-value=0.54 Score=27.51 Aligned_cols=57 Identities=21% Similarity=0.374 Sum_probs=34.9 Q ss_pred CCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 8997898729989999997404634588999877675430057868731456888887778 Q gi|254780479|r 68 WPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGD 128 (356) Q Consensus 68 ~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~ 128 (356) |-|++++|.|+..|+-.. +|.| =++|..+..+||...+= |.|-.-.+-|..=.-||+ T Consensus 21 y~t~~~iA~A~~~EL~~~-~gI~-E~~A~kiI~AAR~a~~~--~~F~~a~~vl~rR~~v~k 77 (333) T TIGR02236 21 YDTLEAIAVASPKELSEI-AGIG-EGTAAKIIQAARKAADL--GGFETADDVLERRKSVGK 77 (333) T ss_pred HHHHHHHHCCCHHHHHHH-CCCC-HHHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHCCCE T ss_conf 788999844585795320-3787-77899999999998467--672148999998720750 No 141 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=74.25 E-value=1.2 Score=25.10 Aligned_cols=98 Identities=15% Similarity=0.177 Sum_probs=48.4 Q ss_pred HHHHHHHHHHHHHCCHHHCCHHH--HHHHHHCCCHHHHHCCCHHHH---HHHHHCCCHHHHHHHHHH--HHHHHHHCCCC Q ss_conf 04676644673116120332658--998774899789872998999---999740463458899987--76754300578 Q gi|254780479|r 39 PYKVWISEIMLQQTTVKTVEPYF--KKFMQKWPTIFCLSSAKDEEI---LSAWAGLGYYTRARNLKK--CADIIVKKYEG 111 (356) Q Consensus 39 py~v~vseimlqqT~v~~v~~~~--~~~~~~~P~~~~la~a~~~~v---l~~w~gLGyy~Rar~l~~--~a~~i~~~~~g 111 (356) -|.|.||.-.+.+-+...-+..| .-+=+--.++--+....|.++ |.-.+|.|.- -|-.+.. .+..+.+- T Consensus 25 GY~V~vs~~~~~~l~~~~~v~l~t~~~vrEd~~~LyGF~~~~Er~~F~~LisVsGIGpk-~Al~iLs~~~~~~l~~a--- 100 (196) T PRK13901 25 EFELLVSSFCLAELRLLEDVEILTYLHTREDELKLFGFLNSSEREVFEELIGVDGIGPR-AALRVLSGIKYNEFRDA--- 100 (196) T ss_pred EEEEEECHHHHHHCCCCCEEEEEEEEEEEECCCEEECCCCHHHHHHHHHHHCCCCCCHH-HHHHHHCCCCHHHHHHH--- T ss_conf 99999678999765899808999999995167713365988999999998765882689-99999757999999999--- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 68731456888887778776677664349 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTASAIVAIAFN 140 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ai~s~a~~ 140 (356) ..-.|.+.|.++||||+.||.-|..=--| T Consensus 101 I~~~D~~~L~~vpGIG~KtA~rIi~ELk~ 129 (196) T PRK13901 101 IDREDIELISKVKGIGNKMAGKIFLKLRG 129 (196) T ss_pred HHHCCHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 99289999831999589999999999976 No 142 >PRK07300 consensus Probab=74.15 E-value=1.9 Score=23.73 Aligned_cols=33 Identities=27% Similarity=0.387 Sum_probs=26.0 Q ss_pred HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 754300578687314568888-----------8777877667766 Q gi|254780479|r 103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ..|.++| |.-|+..-++++| ||||+.||...+. T Consensus 171 ~~v~ek~-GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~ 214 (880) T PRK07300 171 AYLMEKM-GLTPNQFIDLKALMGDKSDNIPGVTKIGEKTGLKLLH 214 (880) T ss_pred HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9999997-8898999999998088624677889853699999999 No 143 >smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases Probab=73.90 E-value=5.5 Score=20.40 Aligned_cols=25 Identities=16% Similarity=0.290 Sum_probs=20.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8687314568888877787766776 Q gi|254780479|r 111 GNFPHKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 111 g~~P~~~~~l~~LpGiG~yta~ai~ 135 (356) ...|....+|+++|||||.||...- T Consensus 82 ~~~~~~l~el~~I~GvGpk~a~~l~ 106 (334) T smart00483 82 DEVYKSLKLFTNVFGVGPKTAAKWY 106 (334) T ss_pred CCCCHHHHHHHCCCCCCHHHHHHHH T ss_conf 8651689998538887789999999 No 144 >PRK02362 ski2-like helicase; Provisional Probab=71.51 E-value=3.1 Score=22.20 Aligned_cols=18 Identities=17% Similarity=0.477 Sum_probs=10.2 Q ss_pred HHHHHHHHHCC-CC-CCCCC Q ss_conf 99999999628-99-88754 Q gi|254780479|r 10 SKILDWYDTNH-RV-LPWRT 27 (356) Q Consensus 10 ~~ll~w~~~~~-R~-lpwr~ 27 (356) +.++++|+..| +. +||+. T Consensus 10 ~~~~~~~~~~gI~~Lyp~Q~ 29 (736) T PRK02362 10 DGVIDFYEGSGIEELYPPQA 29 (736) T ss_pred HHHHHHHHHCCCCCCCHHHH T ss_conf 89999999769975789999 No 145 >PRK05755 DNA polymerase I; Provisional Probab=71.16 E-value=2.5 Score=22.82 Aligned_cols=35 Identities=29% Similarity=0.570 Sum_probs=26.7 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 76754300578687314568888-----------8777877667766 Q gi|254780479|r 101 CADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 101 ~a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ....|.++| |.-|...-++++| ||||+.||...+. T Consensus 159 ~~~~v~ek~-Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KtA~kLl~ 204 (889) T PRK05755 159 DPEGVIEKY-GVTPEQIIDFLALMGDSSDNIPGVPGVGEKTAAKLLK 204 (889) T ss_pred CHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 899999997-9698999999998088645789988626999999998 No 146 >PRK07945 hypothetical protein; Provisional Probab=70.88 E-value=0.9 Score=25.93 Aligned_cols=61 Identities=20% Similarity=0.286 Sum_probs=34.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC Q ss_conf 8899987767543005786873--14568888877787766776643496313124314666576519754 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPH--KVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~--~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~ 162 (356) |.+-...+|..|..--.+.+-. ....|.+|||||+.||..|.- +++|-|---|.++-.-.. T Consensus 23 rv~a~r~aa~~~~~~~~~~~~~~~~~~~~~~l~gig~~ta~vi~~--------a~~g~~p~~l~~l~~~~~ 85 (335) T PRK07945 23 RVMAFRNAADVVEALDEAERARHGRAGSWQSLPGIGPKTAKVIAQ--------AWAGRVPDYLAELRAAAE 85 (335) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHCCCCEECCCCCHHHHHHHHH--------HHCCCCCHHHHHHHHHCC T ss_conf 899999999999837999999987449811278878058999999--------965898289999997446 No 147 >KOG2534 consensus Probab=70.85 E-value=3.7 Score=21.63 Aligned_cols=67 Identities=24% Similarity=0.286 Sum_probs=40.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCC-------HHHHHHHHHCCCC Q ss_conf 88999877675430057868731456888887778776677664349631312431-------4666576519754 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTN-------IERIISRYFDIIK 162 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~N-------v~RVl~R~~~~~~ 162 (356) ||+---+||..+ +.+.-.+ .+.+++.+|||||+..|.-|--|.=.-...-++.- +.-.|.++||+-. T Consensus 34 r~~~y~~Aasvl-k~~p~~I-~S~~ea~~lP~iG~kia~ki~EiletG~l~ele~v~~de~~~~lklFtnifGvG~ 107 (353) T KOG2534 34 RARAYRRAASVL-KSLPFPI-TSGEEAEKLPGIGPKIAEKIQEILETGVLRELEAVRNDERSQSLKLFTNIFGVGL 107 (353) T ss_pred HHHHHHHHHHHH-HHCCCCC-CCHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHCCCH T ss_conf 999999999999-8499976-6578855799977779999999997087366787754506789999998725577 No 148 >COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair] Probab=70.63 E-value=2 Score=23.47 Aligned_cols=21 Identities=29% Similarity=0.474 Sum_probs=11.9 Q ss_pred EEEEECCCCHHHHCCCCCCCC Q ss_conf 577645720132047442111 Q gi|254780479|r 252 ILLRKRTNTRLLEGMDELPGS 272 (356) Q Consensus 252 iLL~KRp~~gll~GLwEFP~~ 272 (356) +++.--.+..++.-|.+||.. T Consensus 201 ~~~s~~~~~~v~~~~~~~~~~ 221 (326) T COG1796 201 ILISTSHPESVLEELLEMPNV 221 (326) T ss_pred EEECCCCCHHHHHHHHCCCCC T ss_conf 675069947899998667876 No 149 >pfam02371 Transposase_20 Transposase IS116/IS110/IS902 family. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. This region is often found with pfam01548. The exact function of this region is uncertain. This family contains a HHH motif suggesting a DNA-binding function. Probab=70.40 E-value=1.6 Score=24.24 Aligned_cols=41 Identities=29% Similarity=0.387 Sum_probs=25.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC Q ss_conf 568888877787766776643496313124314666576519754 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~ 162 (356) +-|.++||||+-||..|++..-+... --| .+-+.-+.|+.- T Consensus 2 ~~L~sipGiG~~~a~~l~aeigd~~r---F~~-~~~~~s~~Gl~P 42 (87) T pfam02371 2 ELLLSIPGIGPITAAALLAEIGDISR---FKS-ARQLAAYAGLAP 42 (87) T ss_pred CHHHCCCCCCHHHHHHHHHHHCCHHH---CCC-HHHHHHHCCCCC T ss_conf 23426999529999999999298532---789-999999839999 No 150 >COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair] Probab=69.96 E-value=2.3 Score=23.06 Aligned_cols=62 Identities=27% Similarity=0.244 Sum_probs=41.8 Q ss_pred HHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 78987299899999974046345889998776754300578---68731456888887778776677664 Q gi|254780479|r 71 IFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 71 ~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s~ 137 (356) .++|+++..=+++ |+.=.+..+-...|+.+.++||+ -+..+.++|..+||||..-|.-+.++ T Consensus 21 ~~~Lsd~ELLail-----LrtG~~~~~~~~la~~lL~~fg~L~~l~~a~~~el~~v~GiG~aka~~l~a~ 85 (224) T COG2003 21 AEALSDAELLAIL-----LRTGTKGESVLDLAKELLQEFGSLAELLKASVEELSSVKGIGLAKAIQIKAA 85 (224) T ss_pred HHHCCHHHHHHHH-----HHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHH T ss_conf 4534668999999-----9628999878999999999732588887379999951788338899999999 No 151 >TIGR01129 secD protein-export membrane protein SecD; InterPro: IPR005791 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . Together with SecY and SecG, SecE forms a multimeric channel through which preproteins are translocated, using both proton motive forces and ATP-driven secretion. The latter is mediated by SecA. The structure of the Escherichia coli SecYEG assembly revealed a sandwich of two membranes interacting through the extensive cytoplasmic domains . Each membrane is composed of dimers of SecYEG. The monomeric complex contains 15 transmembrane helices. This entry describes the SecD family of transport proteins. Members of this family are highly variable in length immediately after the well-conserved motif LGLGLXGG at the amino-terminal end of this model. Archaeal homologs are not included in the seed. SecD from Mycobacterium tuberculosis has a long Pro-rich insert. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=69.15 E-value=1.7 Score=24.01 Aligned_cols=12 Identities=8% Similarity=-0.011 Sum_probs=5.8 Q ss_pred HHCCCEEEECCC Q ss_conf 434963131243 Q gi|254780479|r 137 IAFNHFAVVVDT 148 (356) Q Consensus 137 ~a~~~~~~~vD~ 148 (356) ++..+|+.-.+| T Consensus 118 lGv~EP~v~~~G 129 (522) T TIGR01129 118 LGVAEPVVQRQG 129 (522) T ss_pred HCCCCCEEEEEC T ss_conf 167776488724 No 152 >COG3547 Transposase and inactivated derivatives [DNA replication, recombination, and repair] Probab=67.97 E-value=5.4 Score=20.48 Aligned_cols=37 Identities=27% Similarity=0.367 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 998776754300578687314568888877787766776 Q gi|254780479|r 97 NLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 97 ~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~ 135 (356) .+....+.+........+.+. |.++||||+-+|.+|. T Consensus 169 ~i~~l~~~~~~~~~~~~~~~~--l~~ipgig~~~a~~i~ 205 (303) T COG3547 169 EIAALDREIEARLGIGVVTDR--LASIPGIGELTAAAIA 205 (303) T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHCCCCCHHHHHHHH T ss_conf 898888999998500034568--8858672077899998 No 153 >pfam01601 Corona_S2 Coronavirus S2 glycoprotein. The coronavirus spike glycoprotein forms the characteristic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2. Probab=66.50 E-value=3 Score=22.23 Aligned_cols=75 Identities=27% Similarity=0.419 Sum_probs=42.0 Q ss_pred CCHHHHHH---HHHCCCH----HHHHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHH------HHHHHHHHHHHHCCCE Q ss_conf 99899999---9740463----45889998776754300-578687314568888877------7877667766434963 Q gi|254780479|r 77 AKDEEILS---AWAGLGY----YTRARNLKKCADIIVKK-YEGNFPHKVEILKKLPGI------GDYTASAIVAIAFNHF 142 (356) Q Consensus 77 a~~~~vl~---~w~gLGy----y~Rar~l~~~a~~i~~~-~~g~~P~~~~~l~~LpGi------G~yta~ai~s~a~~~~ 142 (356) +-.||+|= .-.|||+ |.++-.=...+-.++.+ |+| ++-|||| +-|||+++.+++|+-- T Consensus 149 S~IEDLLF~KV~t~glgtvd~dYk~Ct~g~~i~DL~CaQ~yNG--------i~VLPpv~da~~~amYT~sL~g~ma~ggi 220 (609) T pfam01601 149 SAIEDLLFNKVVTSGLGTVDEDYKKCTGGGSIADLICAQYYNG--------IMVLPPVVDAEMMAMYTASLVGAMALGGI 220 (609) T ss_pred HHHHHHHHCEEEECCCCCCHHHHHHCCCCCCHHHHHHHHHCCC--------EEECCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 3788763231696589853545875569963577500533175--------58548878878998899999998761451 Q ss_pred EE--ECCCCHHHHHHHHHCC Q ss_conf 13--1243146665765197 Q gi|254780479|r 143 AV--VVDTNIERIISRYFDI 160 (356) Q Consensus 143 ~~--~vD~Nv~RVl~R~~~~ 160 (356) .+ ++ -=-.-|-+|+-++ T Consensus 221 taaaaI-PFa~~vQ~RlN~l 239 (609) T pfam01601 221 TAAAAI-PFATQVQARLNYV 239 (609) T ss_pred HHHEEC-CHHHHHHHHHHHH T ss_conf 031017-4699899886334 No 154 >pfam03686 UPF0146 Uncharacterized protein family (UPF0146). The function of this family of proteins is unknown. Probab=66.30 E-value=3.5 Score=21.75 Aligned_cols=92 Identities=17% Similarity=0.164 Sum_probs=59.0 Q ss_pred HHHHHHHHC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCC Q ss_conf 776754300-5786873145688888777877667766434963131243146665765197545735689998741025 Q gi|254780479|r 100 KCADIIVKK-YEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKI 178 (356) Q Consensus 100 ~~a~~i~~~-~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~ 178 (356) +.|..|.++ +.|++=+ =|||.++.-|-..--.|-...++|-|-..|-.-+-.+..+.-.-.-.+.+-|+-+ T Consensus 3 ~~a~~ia~~~~~gkiVE--------VGIG~~~~vA~~L~~~g~dv~~tDi~~~av~~gl~~v~DDif~P~~~lY~~A~lI 74 (127) T pfam03686 3 EVAEYIARENARGKVVE--------VGIGFFLDVAKRLAERGFDVLATDINEKAVPEGLRFVVDDITNPNISIYEGADLI 74 (127) T ss_pred HHHHHHHHHCCCCCEEE--------EECCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEECCCCCCCHHHHCCCCEE T ss_conf 39999874187885899--------9136888999999985996899977765766788878725889898895587889 Q ss_pred CCCCCHHHHHHHHHHHHHHHC Q ss_conf 785226789999999986320 Q gi|254780479|r 179 TSTSRPGDFVQAMMDLGALIC 199 (356) Q Consensus 179 ~~~~~~~~~nQAlMdlGa~iC 199 (356) -.-.-|.+...++++++..|= T Consensus 75 YSIRPp~El~~~i~~lA~~v~ 95 (127) T pfam03686 75 YSIRPPPELQSAILDVAKAVG 95 (127) T ss_pred EECCCCHHHHHHHHHHHHHCC T ss_conf 981898788899999999819 No 155 >TIGR02150 IPP_isom_1 isopentenyl-diphosphate delta-isomerase, type 1; InterPro: IPR011876 This entry represents type 1 of two non-homologous families of the enzyme isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP is an essential building block for many compounds, including enzyme cofactors, sterols, and prenyl groups. This inzyme interconverts isopentenyl diphosphate and dimethylallyl diphosphate.; GO: 0004452 isopentenyl-diphosphate delta-isomerase activity, 0008299 isoprenoid biosynthetic process. Probab=66.01 E-value=5 Score=20.70 Aligned_cols=32 Identities=25% Similarity=0.471 Sum_probs=24.8 Q ss_pred CCEEEEEEEEEECCCEEEEEECCCCH-HHHCCCC Q ss_conf 21123200034137605776457201-3204744 Q gi|254780479|r 236 PMRTGAVFIAITNDNRILLRKRTNTR-LLEGMDE 268 (356) Q Consensus 236 ~~r~~~~~vii~~~~kiLL~KRp~~g-ll~GLwE 268 (356) ..+-+-+|| .+.++++||+||...+ .|-|+|. T Consensus 29 LHrAFS~fl-fn~~~~LL~qrRa~~K~twP~vWT 61 (190) T TIGR02150 29 LHRAFSVFL-FNDQGRLLLQRRAASKITWPGVWT 61 (190) T ss_pred EEEEEEEEE-ECCCCCEEEEEECCCCCCCCCCEE T ss_conf 231367677-718983688884011884378400 No 156 >COG0494 MutT NTP pyrophosphohydrolases including oxidative damage repair enzymes [DNA replication, recombination, and repair / General function prediction only] Probab=65.69 E-value=5.9 Score=20.17 Aligned_cols=28 Identities=29% Similarity=0.488 Sum_probs=24.2 Q ss_pred CEEEEEECCCCHHHHCCCCCCCCCCCCCCCH Q ss_conf 6057764572013204744211113467215 Q gi|254780479|r 250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDG 280 (356) Q Consensus 250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~ 280 (356) +++|+.+|...+ |.|+||++..+.++.. T Consensus 24 ~~vl~~~~~~~~---~~~~~PgG~ve~~e~~ 51 (161) T COG0494 24 GEVLLAQRRDDG---GLWELPGGKVEPGEEL 51 (161) T ss_pred CEEEEEEEECCC---CEEEECCCCCCCCCCC T ss_conf 799999984799---9599088855899987 No 157 >KOG3084 consensus Probab=65.56 E-value=11 Score=18.27 Aligned_cols=93 Identities=15% Similarity=0.194 Sum_probs=52.0 Q ss_pred EEEEEECCCEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEEEEEEE---------- Q ss_conf 0003413760577645720132047442111134672158886301010000213735899831599999---------- Q gi|254780479|r 242 VFIAITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHFTLTLF---------- 311 (356) Q Consensus 242 ~~vii~~~~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~~L~i~---------- 311 (356) +.++++.+|+.+|-.| ..+.=.|||.-+-+-.+.++..+.....+..++.....+.|...-| ..|=+. T Consensus 191 Im~li~~d~~~~LL~R-~~r~~~gl~t~lAGFlEpGES~eeav~REtwEEtGi~V~~I~~~as-QPWP~~p~SLMIgc~a 268 (345) T KOG3084 191 IMLLIDHDGKHALLGR-QKRYPPGLWTCLAGFLEPGESIEEAVRRETWEETGIEVEVISYVAS-QPWPLMPQSLMIGCLA 268 (345) T ss_pred EEEEECCCCCEEEEEC-CCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHCCEEEEEEEEEC-CCCCCCCHHHHHHHHH T ss_conf 9999768887741001-4679996010000027755029999999889974954135766504-8989996689999999 Q ss_pred ---E-EEEECCCCCCCCCCEEECHHHHHH Q ss_conf ---9-999807767667813322788874 Q gi|254780479|r 312 ---V-WKTIVPQIVIIPDSTWHDAQNLAN 336 (356) Q Consensus 312 ---v-~~~~~~~~~~~~~~~Wv~~~el~~ 336 (356) . -.+.+++..+..+.+|++..++.+ T Consensus 269 la~~~~~I~vd~dlEleDaqwF~r~ev~~ 297 (345) T KOG3084 269 LAKLNGKISVDKDLELEDAQWFDREEVKS 297 (345) T ss_pred HHHHCCCCCCCCCHHHHHCCCCCHHHHHH T ss_conf 98617851237534544234424999999 No 158 >TIGR03374 ABALDH 1-pyrroline dehydrogenase. Members of this protein family are 1-pyrroline dehydrogenase (1.5.1.35), also called gamma-aminobutyraldehyde dehydrogenase. This enzyme can follow putrescine transaminase (EC 2.6.1.82) for a two-step conversion of putrescine to gamma-aminobutyric acid (GABA). The member from Escherichia coli is characterized as a homotetramer that binds one NADH per momomer. This enzyme belongs to the medium-chain aldehyde dehydrogenases, and is quite similar in sequence to the betaine aldehyde dehydrogenase (EC 1.2.1.8) family. Probab=64.53 E-value=9.9 Score=18.59 Aligned_cols=76 Identities=16% Similarity=0.161 Sum_probs=47.1 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCH-HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 999997404634588999877675430057----------86-8731-45688888777877667766434963131243 Q gi|254780479|r 81 EILSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHK-VEILKKLPGIGDYTASAIVAIAFNHFAVVVDT 148 (356) Q Consensus 81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~-~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~ 148 (356) +-...|+-+..+.|++-|+++|..+.++.+ |+ +... .+++..+-.+=+|.|++......-.+...++| T Consensus 49 ~A~~~W~~~~~~eR~~il~~~a~~l~~~~~~la~l~~~e~GK~~~~a~~~ev~~~~~~~~~~a~~~~~~~~~~~~~~~~g 128 (472) T TIGR03374 49 AAFAEWGQTTPKARAECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARCLSGLAAGEYLEG 128 (472) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 99998712999999999999999999789999999999719469999999999999999999988987047757777788 Q ss_pred CHHHHHHH Q ss_conf 14666576 Q gi|254780479|r 149 NIERIISR 156 (356) Q Consensus 149 Nv~RVl~R 156 (356) +..++..+ T Consensus 129 ~~~~~~r~ 136 (472) T TIGR03374 129 HTSMIRRD 136 (472) T ss_pred CCEEEEEE T ss_conf 63069971 No 159 >cd01701 Pol_zeta Pol_zeta, a member of the Y-family of DNA polymerases. Y-family polymerases can transverse normal replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at higher error rate. The Y-family has no 3'-5' exonuclease activity. Pol zeta has also been named Rev1; this subfamily is mainly present in eukaryotes. Probab=62.05 E-value=13 Score=17.83 Aligned_cols=88 Identities=23% Similarity=0.334 Sum_probs=39.2 Q ss_pred CCCHHHHHCCCHHHHHHHH-----HCCCHHHHHHHHHHHHHHHHHCCCCCC---CCHHHHH------HHHHHHHHHHHHH Q ss_conf 8997898729989999997-----404634588999877675430057868---7314568------8888777877667 Q gi|254780479|r 68 WPTIFCLSSAKDEEILSAW-----AGLGYYTRARNLKKCADIIVKKYEGNF---PHKVEIL------KKLPGIGDYTASA 133 (356) Q Consensus 68 ~P~~~~la~a~~~~vl~~w-----~gLGyy~Rar~l~~~a~~i~~~~~g~~---P~~~~~l------~~LpGiG~yta~a 133 (356) +-+++.+|..=.++|...- -|.| .-+-|=|.|--.. +-+|.+ |++..++ ..|||||+-|+.- T Consensus 164 ~~~~~~~a~~Ir~~I~~~tglt~SiGIa---~nK~lAKiAs~~~-KP~G~~~i~~~~~~~~L~~lpv~~i~GIG~~t~~k 239 (405) T cd01701 164 YELPEELAETIRNEIKETTGCSASVGIG---PNILLARLATRKA-KPNGQYYLSPEKVEEFLDTLPVKDLPGVGSSTKEK 239 (405) T ss_pred CCCHHHHHHHHHHHHHHHHCCCEEECCC---CCHHHHHHHHHHC-CCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHH T ss_conf 6779999999999999987997564437---6499999988736-99873236878999999855211158717899999 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHC Q ss_conf 76643496313124314666576519 Q gi|254780479|r 134 IVAIAFNHFAVVVDTNIERIISRYFD 159 (356) Q Consensus 134 i~s~a~~~~~~~vD~Nv~RVl~R~~~ 159 (356) ........-.-.......-.|.+.|| T Consensus 240 L~~~gi~t~~dl~~~~~~~~L~~~fG 265 (405) T cd01701 240 LKSLGIDTCGDLQLISTKEKLQKVFG 265 (405) T ss_pred HHHCCCCCHHHHHCCCHHHHHHHHHH T ss_conf 99849916899862787999999977 No 160 >PRK04148 hypothetical protein; Provisional Probab=61.86 E-value=8.5 Score=19.07 Aligned_cols=92 Identities=21% Similarity=0.234 Sum_probs=59.1 Q ss_pred HHHHHHHHCCC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHH-HHHCCCC---CCCHHHHHH Q ss_conf 77675430057----868731456888887778776677664349631312431466657-6519754---573568999 Q gi|254780479|r 100 KCADIIVKKYE----GNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIIS-RYFDIIK---PAPLYHKTI 171 (356) Q Consensus 100 ~~a~~i~~~~~----g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~-R~~~~~~---~~~~~~k~l 171 (356) ..|..|.++|. |++=+ =|||.|+--|-..--.|-.+.++|-|-..|.. +-.|+.. +.-.-.-.+ T Consensus 3 ~iaeyI~~~y~~~~~~kIvE--------vGIGf~~~vA~~L~e~g~dv~~~Din~~aV~~a~~~Gl~~v~DDif~P~l~i 74 (135) T PRK04148 3 TIAEYIARNYPHLKNGKIAE--------LGIGFYFKVAKKLKESGFDVIVIDINKKAVEKAKKLGLNAFVDDIFNPNLEI 74 (135) T ss_pred HHHHHHHHHCCCCCCCEEEE--------EECCCCHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCEEECCCCCCCHHH T ss_conf 69999998542112873899--------9236667899999874998899957655432212328763751588998888 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 8741025785226789999999986320 Q gi|254780479|r 172 KNYARKITSTSRPGDFVQAMMDLGALIC 199 (356) Q Consensus 172 ~~~~~~~~~~~~~~~~nQAlMdlGa~iC 199 (356) .+-|.-+-.-.-|.+.-.++++++-.|= T Consensus 75 Y~~a~lIYSIRPp~Elq~~il~lakkv~ 102 (135) T PRK04148 75 YKNAKLIYSIRPPRDLQPFILELAKKLN 102 (135) T ss_pred HCCCCEEEEECCCHHHHHHHHHHHHHCC T ss_conf 5178879981898788899999999759 No 161 >pfam01367 5_3_exonuc 5'-3' exonuclease, C-terminal SAM fold. Probab=61.40 E-value=3.2 Score=22.05 Aligned_cols=34 Identities=24% Similarity=0.422 Sum_probs=21.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH Q ss_conf 3145688888777877667766434963131243146665765 Q gi|254780479|r 115 HKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY 157 (356) Q Consensus 115 ~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~ 157 (356) ++.+.+--.||||+.||..++.-- |++.-++... T Consensus 15 D~sDnIPGv~GiG~KtA~~Ll~~~---------gsle~i~~n~ 48 (100) T pfam01367 15 DSSDNIPGVPGIGEKTAAKLLKEY---------GSLENIYENL 48 (100) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHC---------CCHHHHHHHH T ss_conf 764588999998816899999981---------9899999819 No 162 >TIGR02757 TIGR02757 conserved hypothetical protein TIGR02757; InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.. Probab=60.99 E-value=3.7 Score=21.65 Aligned_cols=46 Identities=26% Similarity=0.418 Sum_probs=23.7 Q ss_pred CEEEECCCCHHHHHHHHHCCCCCCCHHHHH---HHHHHHCCCCCCCH--HHH Q ss_conf 631312431466657651975457356899---98741025785226--789 Q gi|254780479|r 141 HFAVVVDTNIERIISRYFDIIKPAPLYHKT---IKNYARKITSTSRP--GDF 187 (356) Q Consensus 141 ~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~---l~~~~~~~~~~~~~--~~~ 187 (356) +=..++||||.||-|+.+|+..-....-|. +-+...++.| ++| +|| T Consensus 210 ~Li~PLDTH~~~~~s~~Lkl~~rk~~dlK~A~~iT~~L~~~~p-~DP~kYDF 260 (269) T TIGR02757 210 DLILPLDTHVFRIASKKLKLLKRKSYDLKAAIEITEALKKLNP-EDPIKYDF 260 (269) T ss_pred CCCCCCHHHHHHHHHHHHCHHHHHHCCHHHHHHHHHHHHHHCC-CCCCCCCH T ss_conf 1403335899999877623256652366899999999876178-85952441 No 163 >KOG3690 consensus Probab=60.34 E-value=14 Score=17.63 Aligned_cols=97 Identities=15% Similarity=0.169 Sum_probs=51.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE----CCCCHHHHHHHHHCCCCCCCHHH Q ss_conf 58899987767543005786873145688888777877667766434963131----24314666576519754573568 Q gi|254780479|r 93 TRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVV----VDTNIERIISRYFDIIKPAPLYH 168 (356) Q Consensus 93 ~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~----vD~Nv~RVl~R~~~~~~~~~~~~ 168 (356) .+++....-+...--.|.-+.+.+ .++++--++..++.-+.+.|-.....- -|.-|+|.+-++ T Consensus 43 ~q~~~v~~q~~la~~~y~~~~~~~--~~~a~~ea~~~~~~~~r~~a~q~k~~~~~~l~D~~~rRql~~~----------- 109 (646) T KOG3690 43 AQARVVYRQEALAGWNYNTNGTEE--NLLALLEAEDKLVEFVRSLATQAKKFDVAELQDERLRRQLQKV----------- 109 (646) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHH----------- T ss_conf 999999999999988875379849--9999998889999999999877754248640488899999988----------- Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHH-----HHHHCCCCCCC Q ss_conf 99987410257852267899999999-----86320412356 Q gi|254780479|r 169 KTIKNYARKITSTSRPGDFVQAMMDL-----GALICTSNKPL 205 (356) Q Consensus 169 k~l~~~~~~~~~~~~~~~~nQAlMdl-----Ga~iC~p~~P~ 205 (356) .++--.-++..+--++|+++-++ +++||.+.+|. T Consensus 110 ---s~lg~saL~~~d~~~~~~ll~~m~~~y~~~~vC~y~~p~ 148 (646) T KOG3690 110 ---SQLGMSALSPDDLKRYNQLLSSMSTNYNTATVCAYDQPS 148 (646) T ss_pred ---HHHCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCCCC T ss_conf ---663320289789999999999998524776212888999 No 164 >KOG3911 consensus Probab=59.38 E-value=4.8 Score=20.84 Aligned_cols=29 Identities=21% Similarity=0.572 Sum_probs=19.4 Q ss_pred HHHHHCCCHHHHHC-CCHHHHHHHHHCCCHH Q ss_conf 98774899789872-9989999997404634 Q gi|254780479|r 63 KFMQKWPTIFCLSS-AKDEEILSAWAGLGYY 92 (356) Q Consensus 63 ~~~~~~P~~~~la~-a~~~~vl~~w~gLGyy 92 (356) +-+.+|=+.++-.+ =+.+|.|++|.|| || T Consensus 29 r~Lrkyi~ak~~k~~F~~~dflklWKGL-fY 58 (378) T KOG3911 29 RKLRKYISAKTQKEGFDQDDFLKLWKGL-FY 58 (378) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHH-HH T ss_conf 9999999987500478789999998766-88 No 165 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=59.20 E-value=14 Score=17.54 Aligned_cols=74 Identities=19% Similarity=0.137 Sum_probs=49.8 Q ss_pred HHHHHHHHHHHHHHHHHH------CC-----CEEEECCCCHHHHHHHH-HCCC----CCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 888877787766776643------49-----63131243146665765-1975----45735689998741025785226 Q gi|254780479|r 121 KKLPGIGDYTASAIVAIA------FN-----HFAVVVDTNIERIISRY-FDII----KPAPLYHKTIKNYARKITSTSRP 184 (356) Q Consensus 121 ~~LpGiG~yta~ai~s~a------~~-----~~~~~vD~Nv~RVl~R~-~~~~----~~~~~~~k~l~~~~~~~~~~~~~ 184 (356) -..|-||+-+|.-=.++. |+ .--...|||-.|+.+|= +|+. ++-...++.+.++++.-..+.+. T Consensus 124 t~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEALglAa~kagd~a~A~~~F~qia~Da~aprni 203 (221) T COG4649 124 TSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREALGLAAYKAGDFAKAKSWFVQIANDAQAPRNI 203 (221) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHH T ss_conf 78760136799998789874356488899874130489970178899987688873224677999999997014696779 Q ss_pred HHHHHHHHHH Q ss_conf 7899999999 Q gi|254780479|r 185 GDFVQAMMDL 194 (356) Q Consensus 185 ~~~nQAlMdl 194 (356) +.=.|.|||+ T Consensus 204 rqRAq~mldl 213 (221) T COG4649 204 RQRAQIMLDL 213 (221) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 166 >PRK09482 xni exonuclease IX; Provisional Probab=58.37 E-value=3.3 Score=22.01 Aligned_cols=12 Identities=33% Similarity=0.675 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHH Q ss_conf 877787766776 Q gi|254780479|r 124 PGIGDYTASAIV 135 (356) Q Consensus 124 pGiG~yta~ai~ 135 (356) ||||+.||..++ T Consensus 188 ~GIG~KtA~~LL 199 (256) T PRK09482 188 AGIGPKSAAELL 199 (256) T ss_pred CCCCHHHHHHHH T ss_conf 985888999999 No 167 >COG1255 Uncharacterized protein conserved in archaea [Function unknown] Probab=58.09 E-value=11 Score=18.29 Aligned_cols=99 Identities=17% Similarity=0.143 Sum_probs=58.9 Q ss_pred HHHHH-HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 67543-00578687314568888877787766776643496313124314666576519754573568999874102578 Q gi|254780479|r 102 ADIIV-KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITS 180 (356) Q Consensus 102 a~~i~-~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~ 180 (356) |..|. +...|++=+ =|||.|+.-|=..--.|-...++|-|-.++=.-+-....+...-.-.+..-++.+-. T Consensus 5 a~~iAre~~~gkVvE--------VGiG~~~~VA~~L~e~g~dv~atDI~~~~a~~g~~~v~DDitnP~~~iY~~A~lIYS 76 (129) T COG1255 5 AEYIARENARGKVVE--------VGIGFFLDVAKRLAERGFDVLATDINEKTAPEGLRFVVDDITNPNISIYEGADLIYS 76 (129) T ss_pred HHHHHHHHCCCCEEE--------ECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCHHHHHCCCCEEE T ss_conf 999998705883799--------814117999999997488689985230137655338872477964777627001266 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 52267899999999863204123564311000 Q gi|254780479|r 181 TSRPGDFVQAMMDLGALICTSNKPLCPLCPIQ 212 (356) Q Consensus 181 ~~~~~~~nQAlMdlGa~iC~p~~P~C~~Cpl~ 212 (356) -.-|.+..-++||+...|- -.|-.-||- T Consensus 77 iRpppEl~~~ildva~aVg----a~l~I~pL~ 104 (129) T COG1255 77 IRPPPELQSAILDVAKAVG----APLYIKPLT 104 (129) T ss_pred CCCCHHHHHHHHHHHHHHC----CCEEEEECC T ss_conf 2798899899999998609----978997168 No 168 >smart00475 53EXOc 5'-3' exonuclease. Probab=57.92 E-value=5.5 Score=20.39 Aligned_cols=33 Identities=36% Similarity=0.685 Sum_probs=21.6 Q ss_pred HHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHH Q ss_conf 754300578687314568888-----------8777877667766 Q gi|254780479|r 103 DIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIVA 136 (356) Q Consensus 103 ~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s 136 (356) ..+.++| |.-|+.+-+++.| ||||+.||..++. T Consensus 161 ~~v~e~~-gv~P~q~~d~~aL~GD~sDnIpGV~GIG~KtA~kLL~ 204 (259) T smart00475 161 ENVIEKY-GLTPEQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLK 204 (259) T ss_pred HHHHHHH-CCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9999884-8998998637640464535899999847899999999 No 169 >COG5071 RPN5 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=56.91 E-value=9.4 Score=18.78 Aligned_cols=23 Identities=17% Similarity=0.202 Sum_probs=16.7 Q ss_pred CHHHHHHHHHCCCHHHHHCCCHH Q ss_conf 26589987748997898729989 Q gi|254780479|r 58 EPYFKKFMQKWPTIFCLSSAKDE 80 (356) Q Consensus 58 ~~~~~~~~~~~P~~~~la~a~~~ 80 (356) ..|-+.|-+-||.+.+||.-+.+ T Consensus 7 ~dys~~~~e~~~~~~~l~~~d~~ 29 (439) T COG5071 7 VDYSEKFAELQKSLNNLNTIDID 29 (439) T ss_pred CCHHHHHHHHHHHHCCHHHCCHH T ss_conf 56899999973421303223415 No 170 >PRK13482 DNA integrity scanning protein DisA; Provisional Probab=56.39 E-value=16 Score=17.19 Aligned_cols=23 Identities=17% Similarity=0.288 Sum_probs=10.5 Q ss_pred HHCCCEEEECCCCH----HHHHHHHHC Q ss_conf 43496313124314----666576519 Q gi|254780479|r 137 IAFNHFAVVVDTNI----ERIISRYFD 159 (356) Q Consensus 137 ~a~~~~~~~vD~Nv----~RVl~R~~~ 159 (356) ++|+.....+|..| .|+|+|+-. T Consensus 268 LGy~~~~~~lD~~v~pRGYR~L~kipr 294 (352) T PRK13482 268 LGYPGGSEALDTAVSPRGYRILAKIPR 294 (352) T ss_pred HCCCCCCCCCCCCCCCCHHHHHCCCCC T ss_conf 589987653346778725888605999 No 171 >COG1491 Predicted RNA-binding protein [Translation, ribosomal structure and biogenesis] Probab=54.37 E-value=5.9 Score=20.21 Aligned_cols=84 Identities=29% Similarity=0.318 Sum_probs=44.2 Q ss_pred CCCHHHHHHHHHCCCHH---HHHH-HHHHHHHHHHHCCCCC--------CCCH--HHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 29989999997404634---5889-9987767543005786--------8731--4568888877787766776643496 Q gi|254780479|r 76 SAKDEEILSAWAGLGYY---TRAR-NLKKCADIIVKKYEGN--------FPHK--VEILKKLPGIGDYTASAIVAIAFNH 141 (356) Q Consensus 76 ~a~~~~vl~~w~gLGyy---~Rar-~l~~~a~~i~~~~~g~--------~P~~--~~~l~~LpGiG~yta~ai~s~a~~~ 141 (356) ....+.|.+.=+++.|- .-|| +|-..-..|+.+-.++ =|-+ ...|..|||||..|.-+|+-=--.+ T Consensus 74 ~g~rdki~~v~rri~Y~DLT~~Ar~eLpyvve~iV~~~E~rFV~FFN~A~PIt~RLH~LELLpGiGkK~m~~IleERkkk 153 (202) T COG1491 74 PGKRDKIDKVRRRIRYEDLTDTARSELPYVVEKIVKENEDRFVKFFNEAEPITLRLHQLELLPGIGKKTMWAILEERKKK 153 (202) T ss_pred CCCCCHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCC T ss_conf 88753079998622285456777755189999999861567888733567633878888753120499999999987428 Q ss_pred EEEECCCCHHHHHHHHHCCCCC Q ss_conf 3131243146665765197545 Q gi|254780479|r 142 FAVVVDTNIERIISRYFDIIKP 163 (356) Q Consensus 142 ~~~~vD~Nv~RVl~R~~~~~~~ 163 (356) |. ....-|-.|+-++..| T Consensus 154 pF----eSFeDi~~Rv~~~~~p 171 (202) T COG1491 154 PF----ESFEDIKERVKGLHDP 171 (202) T ss_pred CC----CCHHHHHHHHCCCCCH T ss_conf 88----6689999880567778 No 172 >pfam09674 DUF2400 Protein of unknown function (DUF2400). Members of this uncharacterized protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation. Probab=50.77 E-value=11 Score=18.40 Aligned_cols=19 Identities=21% Similarity=0.508 Sum_probs=10.3 Q ss_pred EEECCCCHHHHHHHHHCCCC Q ss_conf 13124314666576519754 Q gi|254780479|r 143 AVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 143 ~~~vD~Nv~RVl~R~~~~~~ 162 (356) ..++|+||.||--++ |+-. T Consensus 174 ~iPLDtH~~rvar~L-gL~~ 192 (230) T pfam09674 174 IIPLDTHTHRVARKL-GLLK 192 (230) T ss_pred EEECCHHHHHHHHHH-CCCC T ss_conf 320207089999992-8822 No 173 >pfam05597 Phasin Poly(hydroxyalcanoate) granule associated protein (phasin). Polyhydroxyalkanoates (PHAs) are storage polyesters synthesized by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon. Probab=50.60 E-value=19 Score=16.56 Aligned_cols=82 Identities=20% Similarity=0.387 Sum_probs=39.0 Q ss_pred HHCCCHHHHHHHH-HCCCHHHHHHH--------HHHHHHHHHHCCCCCCCCHHHHHH-----HHHHHHHHHHHHHHHHHC Q ss_conf 8729989999997-40463458899--------987767543005786873145688-----888777877667766434 Q gi|254780479|r 74 LSSAKDEEILSAW-AGLGYYTRARN--------LKKCADIIVKKYEGNFPHKVEILK-----KLPGIGDYTASAIVAIAF 139 (356) Q Consensus 74 la~a~~~~vl~~w-~gLGyy~Rar~--------l~~~a~~i~~~~~g~~P~~~~~l~-----~LpGiG~yta~ai~s~a~ 139 (356) ++.-=.+-.-+.| +|||=|+|+.. |-+-...+-.+-...+-+..+++. ..-.|+.-+ .-.+ T Consensus 11 ~~~~v~~~arqIWLAGLGA~aka~eEg~k~Fe~LVk~Ge~~e~~~~~~~ee~~~~~~~~~~~~~~~v~~~a-----~~~w 85 (132) T pfam05597 11 LSGEVKDSARQIWLAGLGAFAKAQEEGGKVFEALVKEGEELEKRTRKVAEEQVEAVRESVKSRVSEVKDKA-----EGQW 85 (132) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHH T ss_conf 87889999999999874599999999999999999978999999999999999999998888677799888-----7658 Q ss_pred CCEEEECCCCHHHHHHHHHCCC Q ss_conf 9631312431466657651975 Q gi|254780479|r 140 NHFAVVVDTNIERIISRYFDII 161 (356) Q Consensus 140 ~~~~~~vD~Nv~RVl~R~~~~~ 161 (356) ++-.-++|.-|.++|.|+ |+. T Consensus 86 dklE~~fe~rV~~aL~rL-Gip 106 (132) T pfam05597 86 DKLEQAFDERVAKALNRL-GVP 106 (132) T ss_pred HHHHHHHHHHHHHHHHHC-CCC T ss_conf 889999999999999856-999 No 174 >KOG4368 consensus Probab=49.97 E-value=20 Score=16.50 Aligned_cols=39 Identities=18% Similarity=0.215 Sum_probs=25.2 Q ss_pred HHCCCHHHHHCCCHHHHHHHHHCCCHHH--HHHHHHHHHHH Q ss_conf 7489978987299899999974046345--88999877675 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILSAWAGLGYYT--RARNLKKCADI 104 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~~w~gLGyy~--Rar~l~~~a~~ 104 (356) ..|-.++.+.+--+..+|.+|+|-||+. -...|...|.+ T Consensus 196 ~~~~~~~e~~~~~~~~ll~~we~~~yf~ds~~~ql~~~~~~ 236 (757) T KOG4368 196 TSFLAVEEDKQQKIARLLQLWEKNGYFDDSIIQQLQSPALG 236 (757) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 76641677777799999998760573138999987414654 No 175 >pfam04919 DUF655 Protein of unknown function, DUF655. This family includes several uncharacterized archaeal proteins. Probab=49.49 E-value=7.6 Score=19.40 Aligned_cols=54 Identities=26% Similarity=0.273 Sum_probs=30.0 Q ss_pred HCCCCCCCCH--HHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC Q ss_conf 0057868731--456888887778776677664349631312431466657651975457 Q gi|254780479|r 107 KKYEGNFPHK--VEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA 164 (356) Q Consensus 107 ~~~~g~~P~~--~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~ 164 (356) +=||-.-|-+ .-.|.-|||||..++-.|+----..|-- ...-+=.|+-++..|. T Consensus 103 ~FfN~A~pit~rlH~leLLPGIGkK~~~~ileeR~~~~Fe----SFedi~~Rv~~l~dp~ 158 (181) T pfam04919 103 KFFNEAEPITTRLHQLELLPGIGKKMMWAILEERKKKPFE----SFEDIKERVKGLHDPV 158 (181) T ss_pred HHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCC----CHHHHHHHHCCCCCHH T ss_conf 7651368744878888753340389999999996559988----9999998835789999 No 176 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=49.39 E-value=7.5 Score=19.47 Aligned_cols=40 Identities=18% Similarity=0.257 Sum_probs=30.2 Q ss_pred HHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 748997898729989999997404634588999877675430 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVK 107 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~ 107 (356) .-|-|++++|.++..|+.+. .|+. +..|..+.++|+.++. T Consensus 26 aG~~tv~~l~~~~~~~L~~~-~gis-~~~a~ki~~~a~~~~~ 65 (318) T PRK04301 26 AGYDTVEAIAVASPKELSEI-AGIS-ESTAAKIIEAAREALD 65 (318) T ss_pred CCCCCHHHHHCCCHHHHHHH-HCCC-HHHHHHHHHHHHHHCC T ss_conf 69954999874899999985-0999-9999999999998536 No 177 >pfam02889 Sec63 Sec63 Brl domain. This domain (also known as the Brl domain) is required for assembly of functional endoplasmic reticulum translocons. Probab=49.12 E-value=20 Score=16.41 Aligned_cols=48 Identities=17% Similarity=0.182 Sum_probs=37.8 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9740463458899987767543005786873145688888777877667766 Q gi|254780479|r 85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVA 136 (356) Q Consensus 85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s 136 (356) ....-|+.+-|.+....+|.|+.. .+ .+..-|+.||++|+..+.++.. T Consensus 119 i~~~~~~~~~~~~~l~l~q~i~q~---~w-~~~~pL~Qlp~~~~~~i~~l~~ 166 (309) T pfam02889 119 IALSKGWLSPALNALELSKMIEQR---LW-PSDSPLRQLPHIPPEVIKKLEK 166 (309) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHC---CC-CCCCHHHCCCCCCHHHHHHHHH T ss_conf 998689799999999999999833---59-9998545179999999999997 No 178 >TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF). Probab=47.82 E-value=13 Score=17.77 Aligned_cols=41 Identities=24% Similarity=0.388 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHHHHC---CCEEEECCCCHHH-HHHHHHCCCCC Q ss_conf 88777877667766434---9631312431466-65765197545 Q gi|254780479|r 123 LPGIGDYTASAIVAIAF---NHFAVVVDTNIER-IISRYFDIIKP 163 (356) Q Consensus 123 LpGiG~yta~ai~s~a~---~~~~~~vD~Nv~R-Vl~R~~~~~~~ 163 (356) -||=|..|-+|=++++| |+++..||+|.+| .+.++|+++.. T Consensus 112 ~pGEGKS~vAaNLA~~~Aq~G~RvLLVDaDLRrP~lh~~f~l~~~ 156 (274) T TIGR03029 112 KSGEGCSYIAANLAIVFSQLGEKTLLIDANLRDPVQHRNFKLSEQ 156 (274) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHCCCCC T ss_conf 999989999999999999679919999588888447797599997 No 179 >PRK05812 secD preprotein translocase subunit SecD; Reviewed Probab=47.72 E-value=7.1 Score=19.61 Aligned_cols=20 Identities=20% Similarity=0.421 Sum_probs=9.7 Q ss_pred ECCCCHHHHHHHH--HCCCCCC Q ss_conf 1243146665765--1975457 Q gi|254780479|r 145 VVDTNIERIISRY--FDIIKPA 164 (356) Q Consensus 145 ~vD~Nv~RVl~R~--~~~~~~~ 164 (356) +++-++.=+=.|+ +|+.+|. T Consensus 144 av~qaieiIR~RVd~lGv~EP~ 165 (513) T PRK05812 144 ALEQAITIIRNRVNELGVAEPV 165 (513) T ss_pred HHHHHHHHHHHHHHHHCCCCCH T ss_conf 9999999999987754777674 No 180 >COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only] Probab=47.53 E-value=6.3 Score=19.98 Aligned_cols=45 Identities=27% Similarity=0.414 Sum_probs=23.0 Q ss_pred CCCHHHHHHHHHCCCHHHH-------------------HHHHHHHHHHHHHC--CCCC-----CCCHHHHHHH Q ss_conf 2998999999740463458-------------------89998776754300--5786-----8731456888 Q gi|254780479|r 76 SAKDEEILSAWAGLGYYTR-------------------ARNLKKCADIIVKK--YEGN-----FPHKVEILKK 122 (356) Q Consensus 76 ~a~~~~vl~~w~gLGyy~R-------------------ar~l~~~a~~i~~~--~~g~-----~P~~~~~l~~ 122 (356) .-+.+||..+ -|-||+| ...|.+.|+++.-+ |+|- +|..-.+|++ T Consensus 83 ~ftp~Eiv~l--tlnfYrRnYIeGLFLSSGvi~~~DyTmE~mi~var~LRle~~f~GYIHlK~IPgas~~li~ 153 (404) T COG4277 83 RFTPEEIVDL--TLNFYRRNYIEGLFLSSGVIKNPDYTMEEMIEVARILRLEHKFRGYIHLKIIPGASPDLIK 153 (404) T ss_pred CCCHHHHHHH--HHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHH T ss_conf 0589999999--9989887424330002463368614799999999988320455757998756999989999 No 181 >pfam11264 ThylakoidFormat Thylakoid formation protein. THF1 is localized to the outer plastid membrane and the stroma. THF1 has a role in sugar signalling. THF1 is also thought to have a role in chloroplast and leaf development. THF1 has been shown to play a crucial role in vesicle-mediated thylakoid membrane biogenesis. Probab=47.11 E-value=22 Score=16.20 Aligned_cols=119 Identities=18% Similarity=0.235 Sum_probs=64.8 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------C-------HHHCCHHHHHHHHHCCCHHHHHCCCH Q ss_conf 9999962899887548554445678704676644673116------1-------20332658998774899789872998 Q gi|254780479|r 13 LDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------T-------VKTVEPYFKKFMQKWPTIFCLSSAKD 79 (356) Q Consensus 13 l~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~-------v~~v~~~~~~~~~~~P~~~~la~a~~ 79 (356) -..|..+.|..| .-|+-.|-|.|-++- . .--|..-|.+||+-||.-++.. T Consensus 7 r~F~~~~p~pI~-------------~lYrrvvdELLVElHLL~~n~~F~yD~lFAlGlvt~Fd~fm~GY~Pee~~~---- 69 (216) T pfam11264 7 RAFHAAYPRPIP-------------SLYRRVVDELLVELHLLSHQSDFKYDPLFALGLVTVFDRFMEGYRPEEHKD---- 69 (216) T ss_pred HHHHHHCCCCCC-------------HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHH---- T ss_conf 999985898882-------------899999999999999988603654273788459999999976799856799---- Q ss_pred HHHHH-HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH-HHHHH-HHHHHHHHHCCCEEEECCCCHHHHHHH Q ss_conf 99999-9740463458899987767543005786873145688888-77787-766776643496313124314666576 Q gi|254780479|r 80 EEILS-AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLP-GIGDY-TASAIVAIAFNHFAVVVDTNIERIISR 156 (356) Q Consensus 80 ~~vl~-~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~Lp-GiG~y-ta~ai~s~a~~~~~~~vD~Nv~RVl~R 156 (356) .+.. +.+.+|+ -+..+++-|+.+.+...|.=+.+..++++-. |=|+- .++.+.+++-+ -.=--|| T Consensus 70 -~IF~Alc~s~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~~~~~~~~~~l~~~l~~~a~~---------~~f~YSR 137 (216) T pfam11264 70 -AIFNALCSALGF--DPEQLRKDAEQLEEAAKSHSLEEIVSWLSQKGGGAPNPLKDILQRIAGN---------SKFHYSR 137 (216) T ss_pred -HHHHHHHHHCCC--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCC---------CCCHHHH T ss_conf -999999985489--9999999999999998759999999999736667531689999987516---------8741899 Q ss_pred HHCC Q ss_conf 5197 Q gi|254780479|r 157 YFDI 160 (356) Q Consensus 157 ~~~~ 160 (356) +|+| T Consensus 138 l~AI 141 (216) T pfam11264 138 LFAI 141 (216) T ss_pred HHHH T ss_conf 9999 No 182 >TIGR02153 gatD_arch glutamyl-tRNA(Gln) amidotransferase, subunit D; InterPro: IPR011878 This peptide is found only in the Archaea. It is part of a heterodimer, with GatD (IPR004414 from INTERPRO), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn) .; GO: 0006450 regulation of translational fidelity. Probab=47.10 E-value=7.6 Score=19.40 Aligned_cols=91 Identities=22% Similarity=0.358 Sum_probs=52.1 Q ss_pred HCCHHH--HHHHHHCCCHHHHHCCCHHHHHH---------HHHCCCHHHH-HHHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 332658--99877489978987299899999---------9740463458-89998776754300578687314568888 Q gi|254780479|r 56 TVEPYF--KKFMQKWPTIFCLSSAKDEEILS---------AWAGLGYYTR-ARNLKKCADIIVKKYEGNFPHKVEILKKL 123 (356) Q Consensus 56 ~v~~~~--~~~~~~~P~~~~la~a~~~~vl~---------~w~gLGyy~R-ar~l~~~a~~i~~~~~g~~P~~~~~l~~L 123 (356) .|-|+| +..+..-|-+.+.|+=+-+.|+. .|.-++ .+ |+-|.+.|.=||--||-. =+ T Consensus 87 AV~P~ftAeel~~~~PEL~~iAni~a~~~~niLSENMkP~~W~~IA--~~V~~al~~G~~GVVvaHGTD-------TM-- 155 (413) T TIGR02153 87 AVKPAFTAEELARAVPELLEIANIKARAVFNILSENMKPEYWIKIA--EAVAKALEEGADGVVVAHGTD-------TM-- 155 (413) T ss_pred CCCCCCCHHHHHHHCCCHHEECCCCEEEEEHHCCCCCCHHHHHHHH--HHHHHHHCCCCCEEEEEECCC-------HH-- T ss_conf 6004539899986174002012431056000107788858999999--999998408995389941587-------46-- Q ss_pred HHHHHHHHHHHHHHHCCC--EEEECCCCHHHHHHHHHCCCCCCCHHH Q ss_conf 877787766776643496--313124314666576519754573568 Q gi|254780479|r 124 PGIGDYTASAIVAIAFNH--FAVVVDTNIERIISRYFDIIKPAPLYH 168 (356) Q Consensus 124 pGiG~yta~ai~s~a~~~--~~~~vD~Nv~RVl~R~~~~~~~~~~~~ 168 (356) .|||+|. ||+|.. |.|+|=.=-.| --|.|++.+. T Consensus 156 ----~YTAaAL-SFm~~~Gl~~PvVlVGAQR------SSDRPSSDAa 191 (413) T TIGR02153 156 ----AYTAAAL-SFMFKTGLPVPVVLVGAQR------SSDRPSSDAA 191 (413) T ss_pred ----HHHHHHH-HHHHHCCCCCCEEEECCCC------CCCCCCCHHH T ss_conf ----8999999-9999716999889985743------5888731379 No 183 >pfam05997 Nop52 Nucleolar protein,Nop52. Nop52 believed to be involved in the generation of 28S rRNA. Probab=46.97 E-value=16 Score=17.07 Aligned_cols=50 Identities=24% Similarity=0.474 Sum_probs=27.5 Q ss_pred CCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHH-----HHHHHHHHHHHHHHC Q ss_conf 612033265899877489978987299899999974046345-----889998776754300 Q gi|254780479|r 52 TTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYT-----RARNLKKCADIIVKK 108 (356) Q Consensus 52 T~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~-----Rar~l~~~a~~i~~~ 108 (356) ||-+++. -..+|+..-.+ ..++.|.+++|.|| ||. +...=++.|..|.+- T Consensus 16 ~Rd~al~-~L~~~L~~~~~-----~~~~~d~~KlWKGL-fy~mW~sDkpl~Q~~La~~la~L 70 (212) T pfam05997 16 TRDKALR-SLRKYLSKRSN-----LLDELDLLKLWKGL-FYCMWMSDKPLVQQELADKIAQL 70 (212) T ss_pred HHHHHHH-HHHHHHHHCCC-----CCCHHHHHHHHHHH-HHHHHHCCCHHHHHHHHHHHHHH T ss_conf 7999999-99999987678-----89999999999999-99999728869999999999999 No 184 >COG1561 Uncharacterized stress-induced protein [Function unknown] Probab=46.97 E-value=13 Score=17.86 Aligned_cols=35 Identities=14% Similarity=0.210 Sum_probs=29.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 88999877675430057868731456888887778 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGD 128 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~ 128 (356) -++.+..+++.|...|+-..|-+.++|+++|||=. T Consensus 84 ll~~~~~~~~~i~~~~~~~~~~~~~~lL~~~gVl~ 118 (290) T COG1561 84 LLKQLVTLLKELAAQFDLAAELSLDDLLRLPGVLA 118 (290) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCEEE T ss_conf 99999999999998708667899889856766030 No 185 >KOG4548 consensus Probab=46.45 E-value=22 Score=16.13 Aligned_cols=90 Identities=14% Similarity=0.052 Sum_probs=40.6 Q ss_pred CCEEEEEECCCCHHHHCCCCCCCCCC-CCCCCHHHHHHCCCCCCCCEE-----CC--CEE---EEEEEE-------EEEE Q ss_conf 76057764572013204744211113-467215888630101000021-----37--358---998315-------9999 Q gi|254780479|r 249 DNRILLRKRTNTRLLEGMDELPGSAW-SSTKDGNIDTHSAPFTANWIL-----CN--TIT---HTFTHF-------TLTL 310 (356) Q Consensus 249 ~~kiLL~KRp~~gll~GLwEFP~~e~-~~~~~~~~~~~~~~~~~~~~~-----l~--~ik---H~fTH~-------~L~i 310 (356) +.-+||.+|+=+ = .+.|.||...+ ++++......+.......... .+ .+. |.+..- .-.+ T Consensus 138 r~LyLLV~~k~g-~-~s~w~fP~~~~s~~~~~lr~~ae~~Lk~~~ge~~~t~fvgnaP~g~~~~q~pr~~~~e~~~~sk~ 215 (263) T KOG4548 138 RKLYLLVKRKFG-K-SSVWIFPNRQFSSSEKTLRGHAERDLKVLSGENKSTWFVGNAPFGHTPLQSPREMTTEEPVSSKV 215 (263) T ss_pred CEEEEEEEECCC-C-CCEEECCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHEECCCCCCCCCCCCCCCCCCCCCCCCEE T ss_conf 017887763267-6-55205778654776522888999999987233023310205745656666841234212445315 Q ss_pred EEEEEEC-----CCCCCCCCCEEECHHHHHHCCCC Q ss_conf 9999980-----77676678133227888747998 Q gi|254780479|r 311 FVWKTIV-----PQIVIIPDSTWHDAQNLANAALP 340 (356) Q Consensus 311 ~v~~~~~-----~~~~~~~~~~Wv~~~el~~~~LP 340 (356) .++.+.+ .+..+..++.|++.+++.++--+ T Consensus 216 ff~k~~lv~~~~~kn~n~edfvWvTkdel~e~l~~ 250 (263) T KOG4548 216 FFFKASLVANSNQKNQNKEDFVWVTKDELGEKLPK 250 (263) T ss_pred EEEEEEECCCCCHHCCCCCCEEEECHHHHHHHCCH T ss_conf 77653201564122045341488655887535456 No 186 >pfam04669 DUF579 Protein of unknown function (DUF579). This is a eukaryotic family of uncharacterized proteins. Probab=46.38 E-value=22 Score=16.12 Aligned_cols=98 Identities=22% Similarity=0.338 Sum_probs=53.8 Q ss_pred HHHHHHHHCCCHH------HHHCCCHHHHHHHHHC--------CCHHHHHH--------HHHHHH--HHHHHCCCCCCCC Q ss_conf 5899877489978------9872998999999740--------46345889--------998776--7543005786873 Q gi|254780479|r 60 YFKKFMQKWPTIF------CLSSAKDEEILSAWAG--------LGYYTRAR--------NLKKCA--DIIVKKYEGNFPH 115 (356) Q Consensus 60 ~~~~~~~~~P~~~------~la~a~~~~vl~~w~g--------LGyy~Rar--------~l~~~a--~~i~~~~~g~~P~ 115 (356) +|..|-++||+++ +-...+.++++..+.+ |. +|.++ -+++.. .++++.+.|.+|+ T Consensus 49 ~~~~~~~~~P~l~~y~v~Y~t~~~~a~~L~~~~~~~~c~~~~~~~-~s~c~Lal~~lP~~vy~~~WDvi~VD~P~G~~~~ 127 (187) T pfam04669 49 IYEKFRKDFPELEAYHVVYRTKVSDADELKSAAKDPECRPVQNLL-FSDCKLALNDLPNEVYKEKWDRIFVDAPEGYVED 127 (187) T ss_pred HHHHHHHHCCCCEEEEEEEECHHCCHHHHHHHHCCCCCCCCCCCC-CCCCCHHHCCCCHHHHCCCCCEEEEECCCCCCCC T ss_conf 999887409875378988750230489998752277676665545-5756231125988995570668998089998989 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH----HCCCEEEE--CCCCHHHHHHHHHCCCCC Q ss_conf 1456888887778776677664----34963131--243146665765197545 Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAI----AFNHFAVV--VDTNIERIISRYFDIIKP 163 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~----a~~~~~~~--vD~Nv~RVl~R~~~~~~~ 163 (356) ...-.-++ ||||.++=- ++...+.+ ||--|+|.+|+-|--++. T Consensus 128 ~PGRM~aI-----ytAg~laR~r~~~~~t~~VfVHdv~R~vE~~~s~eFLC~e~ 176 (187) T pfam04669 128 APGRMGAI-----YTAGTLARSRASGEYTTTVFVHDVDRRVEDVFSKEFLCIEI 176 (187) T ss_pred CCCHHHHH-----HHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHCCHHH T ss_conf 98504889-----99999975337899875089836884899998898743477 No 187 >pfam04904 NCD1 NAB conserved region 1 (NCD1). Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of transcription factors. This region consists of the N-terminal NAB conserved region 1, which interacts with the EGR1 inhibitory domain (R1). It may also mediate multimerisation. Probab=45.12 E-value=23 Score=15.99 Aligned_cols=56 Identities=23% Similarity=0.319 Sum_probs=42.8 Q ss_pred CHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHC Q ss_conf 1203326589987748-9978987299899999974046345---889998776754300 Q gi|254780479|r 53 TVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYT---RARNLKKCADIIVKK 108 (356) Q Consensus 53 ~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~ 108 (356) |-+..+.||.-|++.= -+++.|.+|.+||.+...+=.|--+ -.|+|+|+-+.-+.+ T Consensus 17 qrAnLL~Yyd~fi~~GGDDvqQlc~a~eeEFleim~lVGM~sKPLHVrRlqKaL~ew~~~ 76 (82) T pfam04904 17 QRANLLQYYDTFIAQGGDDVQQLCEAGEEEFLEIMALVGMASKPLHVRRLQKALQEWVTN 76 (82) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 980079999999997676799999973899999999966577652899999999999638 No 188 >KOG0135 consensus Probab=44.79 E-value=24 Score=15.96 Aligned_cols=99 Identities=15% Similarity=0.173 Sum_probs=37.7 Q ss_pred HHHHHHHCCCHHHHHCC--CHHHHHHHHHCC-CHHH-HHHHHHHHHHHHHHCCCCCCCCHHHHHHHH---HH---HHHHH Q ss_conf 89987748997898729--989999997404-6345-889998776754300578687314568888---87---77877 Q gi|254780479|r 61 FKKFMQKWPTIFCLSSA--KDEEILSAWAGL-GYYT-RARNLKKCADIIVKKYEGNFPHKVEILKKL---PG---IGDYT 130 (356) Q Consensus 61 ~~~~~~~~P~~~~la~a--~~~~vl~~w~gL-Gyy~-Rar~l~~~a~~i~~~~~g~~P~~~~~l~~L---pG---iG~yt 130 (356) +..|+..+|+.-+..-+ ...+.+..|+=+ -|.. |.+.|.++.........+.. .+++..-.+ -+ -=.|| T Consensus 477 ~~~~ln~~~s~~~~~~~s~~~~d~l~~~~f~~~~~~l~~~lLl~t~a~~~~~l~~g~-~~fea~n~~q~h~~~~La~a~~ 555 (661) T KOG0135 477 FVGFLNEYPSSPDPVLASQLVSDTLRSSQFQLCYFWLRCHLLLETVASRERQLQSGL-SSFEARNKCQVHQLRELALAFT 555 (661) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHHHHHH T ss_conf 777776431467743222222350107999889999999999999999999984255-2089999998850058899999 Q ss_pred HHHHHHHHCCCEEEECCCCHHHHHHHHHCC Q ss_conf 667766434963131243146665765197 Q gi|254780479|r 131 ASAIVAIAFNHFAVVVDTNIERIISRYFDI 160 (356) Q Consensus 131 a~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~ 160 (356) --+|++--.+.-.-++|+-|+-||.++.+. T Consensus 556 E~~il~~~l~h~~k~~~~s~k~vL~l~~dl 585 (661) T KOG0135 556 EHTILQRFLDHVHKLPDPSVKAVLGLVRDL 585 (661) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 999999999998628983089999998879 No 189 >COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair] Probab=44.52 E-value=17 Score=16.97 Aligned_cols=16 Identities=44% Similarity=0.586 Sum_probs=8.5 Q ss_pred HHHHHCCCHHHHHHHH Q ss_conf 7898729989999997 Q gi|254780479|r 71 IFCLSSAKDEEILSAW 86 (356) Q Consensus 71 ~~~la~a~~~~vl~~w 86 (356) .-||.+|=-++=+..| T Consensus 82 MlSL~naf~~~el~~f 97 (667) T COG0272 82 MLSLDNAFDEEELRAF 97 (667) T ss_pred CCCHHHCCCHHHHHHH T ss_conf 5434422798999999 No 190 >TIGR01954 nusA_Cterm_rpt transcription termination factor NusA, C-terminal duplication; InterPro: IPR010214 NusA, or N utilisation substance protein A, is a bacterial transcription termination factor. It binds to RNA polymerase alpha subunit and promotes termination at certain RNA hairpin structures. It is named for the interaction in Escherichia coli of phage lambda antitermination protein N with the N-utilisation substance, consisting of NusA, NusB, NusE (ribosomal protein S10), and NusG . This entry represents an acidic 50-residue region found in two copies toward the C terminus of most proteobacterial NusA proteins, spaced about 26 residues apart. Analogous C-terminal extensions in some other bacterial lineages lack apparent homology but appear similarly acidic ; GO: 0030528 transcription regulator activity. Probab=44.51 E-value=21 Score=16.24 Aligned_cols=38 Identities=26% Similarity=0.263 Sum_probs=30.0 Q ss_pred HHCCCHHHHHCCCHHHHHH--HHHCCCHHHHHHHHHHHHHHH Q ss_conf 7489978987299899999--974046345889998776754 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILS--AWAGLGYYTRARNLKKCADII 105 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~--~w~gLGyy~Rar~l~~~a~~i 105 (356) +-|-|+++||..+.+|++. . +||= ==+|.-|+..||.. T Consensus 12 eGf~t~EdlAy~~~~EL~~fGI-eG~~-Ee~A~~L~~~Ar~~ 51 (52) T TIGR01954 12 EGFTTVEDLAYVPVDELLSFGI-EGLD-EETAKELINRARNA 51 (52) T ss_pred HCCCHHHHHHCCCHHHHHHHCC-CCCC-HHHHHHHHHHHHHH T ss_conf 3673178875045566886358-9988-78899999998752 No 191 >CHL00137 rps13 ribosomal protein S13; Validated Probab=44.23 E-value=10 Score=18.57 Aligned_cols=28 Identities=29% Similarity=0.328 Sum_probs=21.2 Q ss_pred CCCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 8687314---568888877787766776643 Q gi|254780479|r 111 GNFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 111 g~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) -.+|.+. -.|..+.|||+.+|.+|+.-+ T Consensus 7 ~~i~~~K~v~~aLt~I~GIG~~~A~~Ic~~l 37 (122) T CHL00137 7 VDLPRNKRIEIALTYIYGIGLTSAKKILEKA 37 (122) T ss_pred CCCCCCCEEEEHHHHHCCCCHHHHHHHHHHC T ss_conf 6799997731311100061899999999984 No 192 >pfam10343 DUF2419 Protein of unknown function (DUF2419). This is a family of conserved proteins found from plants to humans. The function is not known. A few members are annotated as being cobyrinic acid a,c-diamide synthetase but this could not be confirmed. Probab=44.02 E-value=21 Score=16.25 Aligned_cols=41 Identities=20% Similarity=0.402 Sum_probs=27.2 Q ss_pred HHCCCHHHHHHHHHCC-----C-HHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 8729989999997404-----6-345889998776754300578687 Q gi|254780479|r 74 LSSAKDEEILSAWAGL-----G-YYTRARNLKKCADIIVKKYEGNFP 114 (356) Q Consensus 74 la~a~~~~vl~~w~gL-----G-yy~Rar~l~~~a~~i~~~~~g~~P 114 (356) +++-+.+++-..-.|= - .-.|.+.|+++.+.++++|+|.|- T Consensus 69 ~~~~t~~~l~~iF~~~~~~e~Pll~ER~~~L~EvG~vL~~~f~Gs~~ 115 (282) T pfam10343 69 WAKCTLEELRHIFRSATDEEIPLLEERLRCLREAGRVLLEKFDGSFV 115 (282) T ss_pred HHHCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 98679999999834799886878899999999999999987298799 No 193 >COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair] Probab=43.84 E-value=15 Score=17.38 Aligned_cols=34 Identities=32% Similarity=0.476 Sum_probs=19.2 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHH Q ss_conf 6754300578687314568888-----------877787766776 Q gi|254780479|r 102 ADIIVKKYEGNFPHKVEILKKL-----------PGIGDYTASAIV 135 (356) Q Consensus 102 a~~i~~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~ 135 (356) ...+.+.|.|.-|...-++..| ||||+.||-.++ T Consensus 171 ~~~~~e~~~g~~p~qliD~~~L~Gd~sDnipGV~GIG~ktA~~Ll 215 (310) T COG0258 171 LEEVEEKFKGLTPEQLIDLKALVGDSSDNIPGVKGIGPKTALKLL 215 (310) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH T ss_conf 999999856999899889999829653578999983899999999 No 194 >PTZ00221 cyclophilin; Provisional Probab=43.75 E-value=13 Score=17.79 Aligned_cols=58 Identities=22% Similarity=0.465 Sum_probs=40.9 Q ss_pred CCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHH-----------------HHHHHHHCCCCCCCCHHHHHHHH----HHHH Q ss_conf 99789872998999999740463458899987-----------------76754300578687314568888----8777 Q gi|254780479|r 69 PTIFCLSSAKDEEILSAWAGLGYYTRARNLKK-----------------CADIIVKKYEGNFPHKVEILKKL----PGIG 127 (356) Q Consensus 69 P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~-----------------~a~~i~~~~~g~~P~~~~~l~~L----pGiG 127 (356) |+.++.-+--|+.-||.|+. |.|+.++.+ +-++|.+-|.-..|++.+...+| -|+| T Consensus 20 ~~~~~~y~~~e~rrlkewen---yq~~hr~~ee~~~~RvFfDIsIGg~~~GRIVfELf~DvvPkTaENFRaLCTGEkG~~ 96 (249) T PTZ00221 20 PSREQVYNDIEERRLKEWEN---YQKSHRMKEEQNSCRAFLDISIGDVLAGRLVFELFEDVVPETVENFRALITGSCGID 96 (249) T ss_pred CCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHCCCEEEEEEEECCEECCEEEEEECCCCCHHHHHHHHHHHCCCCCCC T ss_conf 63888875799988888887---777765567523742899987799876359999758987276888898853788778 Q ss_pred HH Q ss_conf 87 Q gi|254780479|r 128 DY 129 (356) Q Consensus 128 ~y 129 (356) .. T Consensus 97 ~~ 98 (249) T PTZ00221 97 TN 98 (249) T ss_pred CC T ss_conf 88 No 195 >PRK05179 rpsM 30S ribosomal protein S13; Validated Probab=43.00 E-value=11 Score=18.15 Aligned_cols=27 Identities=33% Similarity=0.443 Sum_probs=20.4 Q ss_pred CCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 687314---568888877787766776643 Q gi|254780479|r 112 NFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .+|.+. -.|..+-|||+.+|.+|+.-+ T Consensus 8 dl~~~K~v~~ALt~I~GIG~~~A~~Ic~~l 37 (122) T PRK05179 8 DIPRNKRVVIALTYIYGIGRTRAKEILAAA 37 (122) T ss_pred CCCCCCCHHHHHHHHCCCCHHHHHHHHHHC T ss_conf 489997868477300275899999999982 No 196 >cd04666 Nudix_Hydrolase_9 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s Probab=42.25 E-value=26 Score=15.70 Aligned_cols=83 Identities=11% Similarity=0.035 Sum_probs=45.0 Q ss_pred CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCC-------EECCCEEEEEEE----EEEEEEEEEEECC Q ss_conf 60577645720132047442111134672158886301010000-------213735899831----5999999999807 Q gi|254780479|r 250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANW-------ILCNTITHTFTH----FTLTLFVWKTIVP 318 (356) Q Consensus 250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~~~~-------~~l~~ikH~fTH----~~L~i~v~~~~~~ 318 (356) -+|||...... |-|.||-+.++.++........+...+-. ..++.+.|...+ ..-.+.+|...+. T Consensus 15 ~eVLLItsr~~----~rWilPKG~~e~~es~~~aA~REa~EEAGv~g~i~~~~lG~~~y~k~~~~~~~~~~V~~f~~~V~ 90 (122) T cd04666 15 VEVLLVTSRRT----GRWIVPKGGPEKDESPAEAAAREAWEEAGVRGKIGKRPLGRFEYRKRSKNRPPRCEVAVFPLEVT 90 (122) T ss_pred EEEEEEEECCC----CEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEECCCCCEEEEEEECCCCCCCCEEEEEEEEEEE T ss_conf 89999992699----94998875367998999998888887429488886758889999982799853199999999998 Q ss_pred CC----CCC--CCCEEECHHHHHH Q ss_conf 76----766--7813322788874 Q gi|254780479|r 319 QI----VII--PDSTWHDAQNLAN 336 (356) Q Consensus 319 ~~----~~~--~~~~Wv~~~el~~ 336 (356) .. +.. -.-+|+++++-.. T Consensus 91 ~~~~~wpE~~~R~R~W~s~~eA~~ 114 (122) T cd04666 91 EELDEWPEMHQRKRKWFSPEEAAL 114 (122) T ss_pred EECCCCCCCCCEEEEECCHHHHHH T ss_conf 511259975867999938999998 No 197 >pfam05516 consensus Probab=42.14 E-value=22 Score=16.21 Aligned_cols=55 Identities=15% Similarity=0.320 Sum_probs=33.1 Q ss_pred HHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHH Q ss_conf 658998774899789872998999999740463458899987767543005786873-145688888 Q gi|254780479|r 59 PYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPH-KVEILKKLP 124 (356) Q Consensus 59 ~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~-~~~~l~~Lp 124 (356) ..|+.|.+.||++..++.-+++++ |.=+| + .-=+..++.|.+++++ ++..|+.+- T Consensus 47 eIy~~f~~~FPd~~~v~~ide~el-KS~~~-------K---ekWR~F~~~fe~~veDyNfGTllR~d 102 (136) T pfam05516 47 EIYEHFREEFPEFKPVAVIDEDEL-KSKEG-------K---EKWREFCNKFEKKVEDYNFGTLLRLD 102 (136) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHH-CCCHH-------H---HHHHHHHHHHHCCCCCCCCEEEEECC T ss_conf 999999987868520130699997-08331-------3---99999999985626666640346337 No 198 >PRK13023 bifunctional preprotein translocase subunit SecD/SecF; Reviewed Probab=42.10 E-value=11 Score=18.34 Aligned_cols=20 Identities=20% Similarity=0.263 Sum_probs=8.9 Q ss_pred ECCCCHHHHHHHH--HCCCCCC Q ss_conf 1243146665765--1975457 Q gi|254780479|r 145 VVDTNIERIISRY--FDIIKPA 164 (356) Q Consensus 145 ~vD~Nv~RVl~R~--~~~~~~~ 164 (356) ++|-++.-+=.|+ +|+.+|. T Consensus 156 Ai~qaieiiR~RId~lGv~EP~ 177 (840) T PRK13023 156 ALARSIEIVRRRVEELGYGNPL 177 (840) T ss_pred HHHHHHHHHHHHHHHHCCCCCE T ss_conf 9998999999866652688860 No 199 >PRK04053 rps13p 30S ribosomal protein S13P; Reviewed Probab=42.01 E-value=12 Score=18.03 Aligned_cols=47 Identities=26% Similarity=0.298 Sum_probs=29.5 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH Q ss_conf 998776754300578687314568888877787766776643496313124314 Q gi|254780479|r 97 NLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI 150 (356) Q Consensus 97 ~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv 150 (356) ++..++++.-.+..|.-|- .-.|..+-|||+.+|.+|+.-+ -+|.|. T Consensus 5 ~~~~mvRI~gvdIp~~K~i-~~ALt~IyGIG~~~A~~Ic~~l------gId~~~ 51 (149) T PRK04053 5 EFRYIVRIANTDLDGTKPV-EYALTGIKGIGRRTARAIARKL------GLDPHA 51 (149) T ss_pred HHHHHHHHCCCCCCCCCEE-EEECCCCCCCCHHHHHHHHHHH------CCCCCC T ss_conf 8667586508678999686-4441111484899999999991------899877 No 200 >TIGR02111 PQQ_syn_pqqC coenzyme PQQ biosynthesis protein C; InterPro: IPR011845 This entry describes the coenzyme PQQ (pyrrolo-quinoline-quinone) biosynthesis protein PqqC. Pyrroloquinoline quinone (PQQ) is the prosthetic group of several bacterial enzymes, including methanol dehydrogenase of methylotrophs and the glucose dehydrogenase of a number of bacteria . PQQC has been found to be required in the synthesis of PQQ but its function is unclear. This entry does not include related proteins likely to be functionally distinct from PqqC, such as homologs found in the Chlamydias.; GO: 0018189 pyrroloquinoline quinone biosynthetic process. Probab=41.98 E-value=26 Score=15.70 Aligned_cols=76 Identities=20% Similarity=0.332 Sum_probs=54.2 Q ss_pred CCCHHHHHHHHHHHHHCCHHH-CCH--HHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 787046766446731161203-326--58998774899789872998999999740463458899987767543005786 Q gi|254780479|r 36 LPSPYKVWISEIMLQQTTVKT-VEP--YFKKFMQKWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGN 112 (356) Q Consensus 36 ~~~py~v~vseimlqqT~v~~-v~~--~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~ 112 (356) ..-||++++=+=-+-++||.. |++ ||+.- -=.-|-.-||+-+..+|-+.|. -| +.+++|. T Consensus 20 ~~HPFH~~lh~G~~~~~Qv~AWv~NRyyYQ~~-IP~KDAA~Lar~~D~~~Rr~W~-------qR---------i~DhDG~ 82 (239) T TIGR02111 20 DKHPFHALLHEGKLTREQVQAWVLNRYYYQVN-IPLKDAAILARCPDPELRRIWR-------QR---------ILDHDGE 82 (239) T ss_pred CCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCC-CCHHHHHHHCCCCCHHHHHHHH-------HH---------HHCCCCC T ss_conf 56743577516998477899999877887325-8456799860699888899998-------74---------4204867 Q ss_pred CCCH--HHHHHHHH-HHHH Q ss_conf 8731--45688888-7778 Q gi|254780479|r 113 FPHK--VEILKKLP-GIGD 128 (356) Q Consensus 113 ~P~~--~~~l~~Lp-GiG~ 128 (356) .|++ .+.++.|- +||= T Consensus 83 ~~~dGGi~rWLrLaeAvGL 101 (239) T TIGR02111 83 VEEDGGIERWLRLAEAVGL 101 (239) T ss_pred CCCCCHHHHHHHHHHCCCC T ss_conf 8887428999987631688 No 201 >COG1031 Uncharacterized Fe-S oxidoreductase [Energy production and conversion] Probab=40.96 E-value=13 Score=17.73 Aligned_cols=30 Identities=33% Similarity=0.409 Sum_probs=12.4 Q ss_pred CCCCC-HHHHHHHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 86873-14568888877787766776643496313124 Q gi|254780479|r 111 GNFPH-KVEILKKLPGIGDYTASAIVAIAFNHFAVVVD 147 (356) Q Consensus 111 g~~P~-~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD 147 (356) |.+|. +.++|.+|= .+|...|=+..+.-+| T Consensus 253 ~e~P~PnPealekL~-------~Gir~~AP~l~tLHiD 283 (560) T COG1031 253 GEVPRPNPEALEKLF-------RGIRNVAPNLKTLHID 283 (560) T ss_pred CCCCCCCHHHHHHHH-------HHHHHHCCCCEEEEEC T ss_conf 879999989999999-------9998618987266545 No 202 >pfam05166 YcgL YcgL domain. This family of proteins formerly called DUF709 includes the E. coli gene ycgL. Homologues of YcgL are found in gammaproteobacteria. The structure of this protein shows a novel alpha/beta/alpha sandwich structure. Probab=40.95 E-value=16 Score=17.21 Aligned_cols=21 Identities=33% Similarity=0.472 Sum_probs=17.1 Q ss_pred HHHHCCCHHHHHHHHHCCCHH Q ss_conf 898729989999997404634 Q gi|254780479|r 72 FCLSSAKDEEILSAWAGLGYY 92 (356) Q Consensus 72 ~~la~a~~~~vl~~w~gLGyy 92 (356) +.||.|+.++|+..-+--||| T Consensus 48 r~LA~~d~~~Vl~~l~~~Gfy 68 (74) T pfam05166 48 KKLARADVEKVLEALEEQGFY 68 (74) T ss_pred CHHHCCCHHHHHHHHHHCCEE T ss_conf 132139999999999868978 No 203 >PRK13024 bifunctional preprotein translocase subunit SecD/SecF; Reviewed Probab=40.92 E-value=11 Score=18.19 Aligned_cols=31 Identities=32% Similarity=0.477 Sum_probs=23.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH Q ss_conf 56888887778776677664349631312431466657651 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF 158 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~ 158 (356) ..-++|||| ||-|++|+ ++||+||.= +.|+- T Consensus 309 ~~tLTLpGI----AGiiLtiG-----maVDaNVli-fERik 339 (741) T PRK13024 309 GAVLTLPGI----AGLVLGIG-----MAVDANVII-YERIK 339 (741) T ss_pred HHHCCHHHH----HHHHHHHH-----HHHCCCEEH-HHHHH T ss_conf 422269999----99999998-----763211100-29899 No 204 >PRK02406 DNA polymerase IV; Validated Probab=40.74 E-value=27 Score=15.54 Aligned_cols=132 Identities=18% Similarity=0.152 Sum_probs=61.2 Q ss_pred HHCCCHHHHHCCCHHHHHHHHHCCCHH-----HHHHHHHHHHHHHHHCCCCC---CCCHHHHHH------HHHHHHHHHH Q ss_conf 748997898729989999997404634-----58899987767543005786---873145688------8887778776 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILSAWAGLGYY-----TRARNLKKCADIIVKKYEGN---FPHKVEILK------KLPGIGDYTA 131 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~~w~gLGyy-----~Rar~l~~~a~~i~~~~~g~---~P~~~~~l~------~LpGiG~yta 131 (356) ..|++...+|..=.++|.+. +|.. +.-+-|=|.|--.. +-+|. -|++..+.+ .+||||+-|+ T Consensus 118 ~~~~~~~~~A~~ir~~I~~~---~gl~~SiGIa~nk~lAK~As~~~-KP~G~~vv~~~~~~~~L~~lpv~~l~GIG~~~~ 193 (355) T PRK02406 118 LCIGSATLIAQEIRRDIWNE---LGLTASAGVAPNKFLAKIASDWN-KPNGQFVITPEEVPAFVATLPVEKIPGVGKVTA 193 (355) T ss_pred CCCCCHHHHHHHHHHHHHHH---HCCCEEEECCCCHHHHHHHHHHC-CCCCCEEECHHHHHHHHHHCCCCCCCCCCHHHH T ss_conf 01789999999999999998---79966870256799999988623-987663656789999996188740588588999 Q ss_pred HHHHHHHCCCEEEECCCCHHH-----------HHHHHHCCCCCCCHHHHHHHHHHHCCC---CCCCHHHHHHHHHHHHHH Q ss_conf 677664349631312431466-----------657651975457356899987410257---852267899999999863 Q gi|254780479|r 132 SAIVAIAFNHFAVVVDTNIER-----------IISRYFDIIKPAPLYHKTIKNYARKIT---STSRPGDFVQAMMDLGAL 197 (356) Q Consensus 132 ~ai~s~a~~~~~~~vD~Nv~R-----------Vl~R~~~~~~~~~~~~k~l~~~~~~~~---~~~~~~~~nQAlMdlGa~ 197 (356) .-...++...-.-..+..... +.....|++...-..+..-+.+...-+ +.....+.-++|.+|... T Consensus 194 ~~L~~~GI~ti~DL~~~~~~~L~~~fG~~g~~l~~~a~Gid~~~V~~~~~~KSI~~~~Tf~~d~~~~~~i~~~l~~l~~~ 273 (355) T PRK02406 194 EKLHALGIRTCADLQKWDLATLLRRFGKFGRRLYERARGIDERPVKPDRERKSVGVERTYAKDLYDLEDCEAELERLYPE 273 (355) T ss_pred HHHHHCCCCCHHHHHHCCHHHHHHHHCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 99998298179997609999999997989999999965998743033667610555678799999999999999999999 Q ss_pred HCCC Q ss_conf 2041 Q gi|254780479|r 198 ICTS 201 (356) Q Consensus 198 iC~p 201 (356) ||.. T Consensus 274 v~~r 277 (355) T PRK02406 274 LEAR 277 (355) T ss_pred HHHH T ss_conf 9999 No 205 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=40.68 E-value=10 Score=18.57 Aligned_cols=17 Identities=35% Similarity=0.540 Sum_probs=7.7 Q ss_pred HHHH-HHHHHHHHHHHHH Q ss_conf 6888-8877787766776 Q gi|254780479|r 119 ILKK-LPGIGDYTASAIV 135 (356) Q Consensus 119 ~l~~-LpGiG~yta~ai~ 135 (356) .|.. ..|||=-||.+++ T Consensus 195 ~L~~d~~GiGF~~AD~lA 212 (769) T TIGR01448 195 LLAEDVKGIGFKTADQLA 212 (769) T ss_pred HHHHHHCCCCHHHHHHHH T ss_conf 136751374056899999 No 206 >pfam09735 Nckap1 Membrane-associated apoptosis protein. Expression of this protein was found to be markedly reduced in patients with Alzheimer's disease. It is involved in the regulation of actin polymerisation in the brain as part of a WAVE2 signalling complex. Probab=40.31 E-value=27 Score=15.49 Aligned_cols=17 Identities=12% Similarity=0.288 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 99999999996289988 Q gi|254780479|r 8 IQSKILDWYDTNHRVLP 24 (356) Q Consensus 8 ~~~~ll~w~~~~~R~lp 24 (356) +.+.=+-||-.|.-..| T Consensus 368 ~ardEV~Wl~RH~~~~p 384 (1118) T pfam09735 368 LARDEVLWLLRHAGNVP 384 (1118) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 87778999884156787 No 207 >cd01700 Pol_V Pol V was discovered in Escherichia coli as Umuc and UmuD proteins induced by UV. This branch of DNA polymerases is mostly found in bacteria. Pol V enables DNA replication to bypass covalently linked cys-sin T-T photo-dimers and 6-4 T-T or T-C photoproducts, which would otherwise stall the DNA replication fork. Probab=40.24 E-value=27 Score=15.49 Aligned_cols=69 Identities=20% Similarity=0.249 Sum_probs=36.6 Q ss_pred CHHHHHCCCHHHHHHHHH-----CCCHHHHHHHHHHHHHHHHHC---C-CCCC---CCHHHH------HHHHHHHHHHHH Q ss_conf 978987299899999974-----046345889998776754300---5-7868---731456------888887778776 Q gi|254780479|r 70 TIFCLSSAKDEEILSAWA-----GLGYYTRARNLKKCADIIVKK---Y-EGNF---PHKVEI------LKKLPGIGDYTA 131 (356) Q Consensus 70 ~~~~la~a~~~~vl~~w~-----gLGyy~Rar~l~~~a~~i~~~---~-~g~~---P~~~~~------l~~LpGiG~yta 131 (356) ++.++|..=.++|.+.-. |.|+ -+-|=|.|--+.+. + +|-+ |++... +.++||||+-|+ T Consensus 112 ~~~~~a~~ir~~I~~~~Glt~SiGIa~---nk~lAKiAs~~~Kp~~~~~~~~~~~~~~~~~~~l~~lpv~~i~GIG~~t~ 188 (344) T cd01700 112 DLEDLGREIRATVLQWTGIPVGVGIGP---TKTLAKLANHAAKKWPAYGGVVVLDSEERRRKLLKRTPVGDVWGIGRRLS 188 (344) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECC---CHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHCCHHHCCCCCHHHH T ss_conf 999999999999999879976983343---17899997887377888997144189899998755287444278369999 Q ss_pred HHHHHHHCCC Q ss_conf 6776643496 Q gi|254780479|r 132 SAIVAIAFNH 141 (356) Q Consensus 132 ~ai~s~a~~~ 141 (356) .....+...- T Consensus 189 ~kL~~~GI~T 198 (344) T cd01700 189 KRLAAMGIHT 198 (344) T ss_pred HHHHHCCCCC T ss_conf 9999869978 No 208 >PRK03137 1-pyrroline-5-carboxylate dehydrogenase; Provisional Probab=39.97 E-value=28 Score=15.46 Aligned_cols=68 Identities=15% Similarity=0.131 Sum_probs=43.0 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHC-----------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCH Q ss_conf 99974046345889998776754300-----------578687314568888877787766776643496313124314 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKK-----------YEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNI 150 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~-----------~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv 150 (356) .+.|+.+....|++-|+++|..|.++ -|--+.+...|....-..-+|.|+....+..+.+....+|.. T Consensus 86 f~~W~~~~~~eRa~~L~k~ad~l~~~~~ela~l~~~E~GK~~~ea~~Ev~~ai~~~~~~a~~~~~~~~~~~~~~~~g~~ 164 (514) T PRK03137 86 FETWRKWSPEDRARILFRAAAILRRRKHEFSAILVKEAGKPWDEADADTAEAIDFLEYYARQMLKLADGKPVLSRPGEH 164 (514) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCE T ss_conf 9988049999999999999999997999999999997592889999999999999999999999852884557889850 No 209 >PRK01151 rps17E 30S ribosomal protein S17e; Validated Probab=39.75 E-value=15 Score=17.24 Aligned_cols=29 Identities=31% Similarity=0.488 Sum_probs=24.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 88999877675430057868731456888 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKK 122 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~ 122 (356) |-+.+.++|+.|++.|.+.|-.|++.=.. T Consensus 4 r~k~IKr~a~~lieky~~~ft~DF~~NK~ 32 (58) T PRK01151 4 RPTFIKRTAEELLEKYPDKFTDDFEHNKK 32 (58) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 62799999999999832052478888889 No 210 >cd03586 Pol_IV_kappa Pol_IV_kappa, a member of the Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases. Originally called the DinB family, they belong to the recently described Y-family of DNA polymerases. Pol IV is mostly found in bacteria and archaea. Although the structure of Pol IV is similar to that of Pol eta, it shows markedly differenct efficiencies and fidelities in bypassing various DNA lesions. All Pol IV-like polymerases studied to date are able to b Probab=39.23 E-value=28 Score=15.38 Aligned_cols=82 Identities=21% Similarity=0.253 Sum_probs=39.7 Q ss_pred HHCCCHHHHHCCCHHHHHHHHH-----CCCHHHHHHHHHHHHHHHHHCCCCCC---CCHHHHHH------HHHHHHHHHH Q ss_conf 7489978987299899999974-----04634588999877675430057868---73145688------8887778776 Q gi|254780479|r 66 QKWPTIFCLSSAKDEEILSAWA-----GLGYYTRARNLKKCADIIVKKYEGNF---PHKVEILK------KLPGIGDYTA 131 (356) Q Consensus 66 ~~~P~~~~la~a~~~~vl~~w~-----gLGyy~Rar~l~~~a~~i~~~~~g~~---P~~~~~l~------~LpGiG~yta 131 (356) ..|-+..++|..=.++|.+.-. |+| .-+-|=|.|--+.+. +|.+ |++..+.+ .+||||+.|+ T Consensus 110 ~~~~~~~~~a~~ir~~I~~~tgl~~SiGIa---~nk~lAKiAs~~~KP-~G~~~i~~~~~~~~L~~lpv~~l~GIG~~~~ 185 (337) T cd03586 110 RLFGSATEIAKEIRARIREETGLTCSVGIA---PNKFLAKIASDLNKP-NGLTVIPPEKVWAFLAPLPVRKIWGIGKVTA 185 (337) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHCC-CCCEEECHHHHHHHHCCCCCHHCCCCCHHHH T ss_conf 028999999999999999987997489995---739999999876100-7967608899999861276302078468999 Q ss_pred HHHHHHHCCCEEEECCCCHH Q ss_conf 67766434963131243146 Q gi|254780479|r 132 SAIVAIAFNHFAVVVDTNIE 151 (356) Q Consensus 132 ~ai~s~a~~~~~~~vD~Nv~ 151 (356) .-....+...-.-..+-+.. T Consensus 186 ~~L~~~gI~t~~dl~~~~~~ 205 (337) T cd03586 186 EKLNRLGIRTIGDLAAADLE 205 (337) T ss_pred HHHHHCCCCCHHHHHCCCHH T ss_conf 99998089689998649999 No 211 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=38.90 E-value=15 Score=17.37 Aligned_cols=43 Identities=33% Similarity=0.504 Sum_probs=22.6 Q ss_pred HCCCCCCCCHHHHHHHH-----------HHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHC Q ss_conf 00578687314568888-----------877787766776643496313124314666576519 Q gi|254780479|r 107 KKYEGNFPHKVEILKKL-----------PGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFD 159 (356) Q Consensus 107 ~~~~g~~P~~~~~l~~L-----------pGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~ 159 (356) ++| |.-|+.+-+++.| ||||+.||..++.-- |++.-++..+-. T Consensus 162 ~~~-gv~p~q~~d~~aL~GD~sDnIpGV~GiG~KtA~kLl~~~---------gsle~i~~~~~~ 215 (240) T cd00008 162 EKY-GVTPAQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLKEY---------GSLEGILENLDK 215 (240) T ss_pred HHC-CCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH---------CCHHHHHHHHHH T ss_conf 960-989999987699728764688999985789999999980---------999999982985 No 212 >PTZ00134 40S ribosomal protein S18; Provisional Probab=38.80 E-value=9.9 Score=18.61 Aligned_cols=30 Identities=23% Similarity=0.278 Sum_probs=21.7 Q ss_pred CCCCCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 578687314---568888877787766776643 Q gi|254780479|r 109 YEGNFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 109 ~~g~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .|-.+|.+. -.|..+.|||+.||.+|+.-+ T Consensus 18 ~gtdi~g~K~v~~aLt~I~GIG~~~A~~Ic~~~ 50 (154) T PTZ00134 18 LNTNVDGREKVTIALTAIKGIGRRFATVVCKQA 50 (154) T ss_pred CCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHC T ss_conf 387589995889985322064899999999980 No 213 >KOG3835 consensus Probab=38.06 E-value=30 Score=15.26 Aligned_cols=81 Identities=22% Similarity=0.235 Sum_probs=54.3 Q ss_pred CHHHCCHHHHHHHHHC-CCHHHHHCCCHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHCCC---C-------------C Q ss_conf 1203326589987748-9978987299899999974046345---88999877675430057---8-------------6 Q gi|254780479|r 53 TVKTVEPYFKKFMQKW-PTIFCLSSAKDEEILSAWAGLGYYT---RARNLKKCADIIVKKYE---G-------------N 112 (356) Q Consensus 53 ~v~~v~~~~~~~~~~~-P~~~~la~a~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~~~---g-------------~ 112 (356) |-+-.+.||+-||++= -||+.|.+|.|+|.|....=.|--. -.|+|+++-+.-+.+-+ - + T Consensus 19 qkANLlsYyd~FIqQGGDDvqQlceagEeEFLEIMaLVGMa~KPLHVRRlQkALrew~tnpglFnqpl~~ipvssip~yk 98 (495) T KOG3835 19 QKANLLSYYDVFIQQGGDDVQQLCEAGEEEFLEIMALVGMAPKPLHVRRLQKALREWSTNPGLFNQPLQQIPVSSIPNYK 98 (495) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCC T ss_conf 87317999999998645689999875579999999986147764269999999998712950411852116999876556 Q ss_pred CCCHHHHHHHHHHHHHHHHHH Q ss_conf 873145688888777877667 Q gi|254780479|r 113 FPHKVEILKKLPGIGDYTASA 133 (356) Q Consensus 113 ~P~~~~~l~~LpGiG~yta~a 133 (356) .+++---.+-||||---|..- T Consensus 99 ~~et~pt~lgl~~~s~~~s~~ 119 (495) T KOG3835 99 PAETDPTALGLPGISAATSPK 119 (495) T ss_pred CCCCCCCCCCCCCCHHCCCCC T ss_conf 788884201475300014899 No 214 >TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type. Probab=37.88 E-value=12 Score=17.94 Aligned_cols=27 Identities=30% Similarity=0.396 Sum_probs=20.0 Q ss_pred CCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 687314---568888877787766776643 Q gi|254780479|r 112 NFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .+|.+. -.|..+-|||+.+|.+|+.-+ T Consensus 6 di~~~K~v~~ALt~I~GIG~~~A~~Ic~~l 35 (113) T TIGR03631 6 DIPNNKRVEIALTYIYGIGRTRARKILEKA 35 (113) T ss_pred CCCCCCCHHHHHHCEECCCHHHHHHHHHHC T ss_conf 499996746065200275899999999992 No 215 >pfam00113 Enolase_C Enolase, C-terminal TIM barrel domain. Probab=37.83 E-value=6.5 Score=19.90 Aligned_cols=29 Identities=17% Similarity=0.140 Sum_probs=13.0 Q ss_pred HHCCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 3005786873--1456888887778776677664349 Q gi|254780479|r 106 VKKYEGNFPH--KVEILKKLPGIGDYTASAIVAIAFN 140 (356) Q Consensus 106 ~~~~~g~~P~--~~~~l~~LpGiG~yta~ai~s~a~~ 140 (356) +.+-||-.|. +.++-+. +...||-.-+|. T Consensus 66 vgDEGGfaP~~~~~e~aL~------li~eAi~~ag~~ 96 (296) T pfam00113 66 VGDEGGFAPNIQSNKEALD------LIVEAIEKAGYT 96 (296) T ss_pred CCCCCCCCCCCCCHHHHHH------HHHHHHHHCCCC T ss_conf 5778787999988199999------999999974898 No 216 >pfam11372 DUF3173 Protein of unknown function (DUF3173). This family of proteins with unknown function appears to be restricted to Firmicutes. Probab=37.56 E-value=20 Score=16.48 Aligned_cols=25 Identities=20% Similarity=0.513 Sum_probs=10.9 Q ss_pred CHHHHHHHHHCCCHH-HHHHHHHHHHHHHH Q ss_conf 989999997404634-58899987767543 Q gi|254780479|r 78 KDEEILSAWAGLGYY-TRARNLKKCADIIV 106 (356) Q Consensus 78 ~~~~vl~~w~gLGyy-~Rar~l~~~a~~i~ 106 (356) +-+|+++ |||- +-|+++.+-|+.+. T Consensus 5 t~~dLi~----lGf~~~~A~~IIrqAK~~l 30 (59) T pfam11372 5 TKKDLIK----LGFKPHTARDIIRQAKELL 30 (59) T ss_pred CHHHHHH----HCCCHHHHHHHHHHHHHHH T ss_conf 7999999----5798879999999999999 No 217 >pfam08887 GAD-like GAD-like domain. This domain is functionally uncharacterized, but it appears to be distantly related to the GAD domain pfam02938. Probab=37.39 E-value=25 Score=15.77 Aligned_cols=13 Identities=8% Similarity=0.376 Sum_probs=8.0 Q ss_pred HHHHHHHHCCCHH Q ss_conf 5899877489978 Q gi|254780479|r 60 YFKKFMQKWPTIF 72 (356) Q Consensus 60 ~~~~~~~~~P~~~ 72 (356) .|+.|+++|.... T Consensus 3 ~~e~fl~~fG~~~ 15 (109) T pfam08887 3 TLEEFLESFGPPI 15 (109) T ss_pred HHHHHHHHCCCCC T ss_conf 7899999728966 No 218 >cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination. Probab=37.12 E-value=8.2 Score=19.17 Aligned_cols=19 Identities=32% Similarity=0.552 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 5688888777877667766 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVA 136 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s 136 (356) |.+-.+||||+.||..++. T Consensus 22 DnipGV~GIG~ktA~~ll~ 40 (75) T cd00080 22 DNIPGVPGIGPKTALKLLK 40 (75) T ss_pred CCCCCCCCCCHHHHHHHHH T ss_conf 5887758637999999999 No 219 >PRK11820 hypothetical protein; Provisional Probab=37.04 E-value=28 Score=15.43 Aligned_cols=34 Identities=15% Similarity=0.158 Sum_probs=27.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 8899987767543005786873145688888777 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIG 127 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG 127 (356) .+..+++.-+.|.+.++...|-+..+|+++|||= T Consensus 84 ~~~~y~~~l~~i~~~~~~~~~~~l~~ll~~p~v~ 117 (288) T PRK11820 84 LAKQYLKALEEVKEELPEAGEISLDDLLRWPGVL 117 (288) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCCC T ss_conf 9999999999999874998999999996386522 No 220 >TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13. Probab=36.34 E-value=8.9 Score=18.92 Aligned_cols=30 Identities=23% Similarity=0.224 Sum_probs=22.0 Q ss_pred CCCCCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 578687314---568888877787766776643 Q gi|254780479|r 109 YEGNFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 109 ~~g~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .|-.+|.+. -.|..+.|||+.+|.+|+.-+ T Consensus 9 ~g~di~~~K~v~~aLt~I~GIG~~~A~~Ic~~~ 41 (144) T TIGR03629 9 ADTDLDGNKPVEYALTGIKGIGRRFARAIARKL 41 (144) T ss_pred ECCCCCCCCEEEEEEEEECCCCHHHHHHHHHHH T ss_conf 075489996898872212372899999999990 No 221 >cd00424 Pol_Y Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases. Probab=35.93 E-value=18 Score=16.70 Aligned_cols=89 Identities=22% Similarity=0.247 Sum_probs=41.6 Q ss_pred HCCCHHHHHCCCHHHHHHHHHCCCHHHH---HH--HHHHHHHHHHHCCCCC---CCCHHH-------HHHHHHHHHHHHH Q ss_conf 4899789872998999999740463458---89--9987767543005786---873145-------6888887778776 Q gi|254780479|r 67 KWPTIFCLSSAKDEEILSAWAGLGYYTR---AR--NLKKCADIIVKKYEGN---FPHKVE-------ILKKLPGIGDYTA 131 (356) Q Consensus 67 ~~P~~~~la~a~~~~vl~~w~gLGyy~R---ar--~l~~~a~~i~~~~~g~---~P~~~~-------~l~~LpGiG~yta 131 (356) .|.+...+|..=.++|.. -+|.-.. |. -|=|+|--+. +-+|. .|++.. .+..|||||+-|+ T Consensus 111 ~~~~~~~~a~~ir~~I~~---~~gl~~SiGIa~nk~lAKlAs~~~-Kp~g~~~~~~~~~~~~~l~~lpv~~l~GIG~~~~ 186 (341) T cd00424 111 LLEDGLDLALEIKQIIAE---ETGITASIGIAANKLLAKLAAKLA-KPNGLTILFPESLPEEFLIKLPLSDIPGIGSVTA 186 (341) T ss_pred CCCCHHHHHHHHHHHHHH---HHCCCEEEEECCCHHHHHHHHHHC-CCCCEEEECCHHHHHHHHCCCCHHHHCCCCHHHH T ss_conf 078999999999999999---849966996465199999999744-9998698385777999741388788748788999 Q ss_pred HHHHHHHCCCEEEECCCCHHHHHHHHHC Q ss_conf 6776643496313124314666576519 Q gi|254780479|r 132 SAIVAIAFNHFAVVVDTNIERIISRYFD 159 (356) Q Consensus 132 ~ai~s~a~~~~~~~vD~Nv~RVl~R~~~ 159 (356) .-..........-..+.+...+...+++ T Consensus 187 ~~L~~~gi~ti~dl~~~~~~~l~~~~~~ 214 (341) T cd00424 187 SRLEALGIIRKCDLLALQPDQLLASLGG 214 (341) T ss_pred HHHHHCCCCCHHHHHHCCHHHHHHHHCC T ss_conf 9999849935999871899999999768 No 222 >COG0529 CysC Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism] Probab=35.50 E-value=22 Score=16.20 Aligned_cols=42 Identities=19% Similarity=0.214 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHHHH---HHHCCCEEEECCCCHHH-HHHHHHCCC Q ss_conf 8888877787766776---64349631312431466-657651975 Q gi|254780479|r 120 LKKLPGIGDYTASAIV---AIAFNHFAVVVDTNIER-IISRYFDII 161 (356) Q Consensus 120 l~~LpGiG~yta~ai~---s~a~~~~~~~vD~Nv~R-Vl~R~~~~~ 161 (356) +.-|+|-|..|-+-.+ .++-|..+-++||-..| -|+|=+|.+ T Consensus 28 ~TGLSGsGKSTiA~ale~~L~~~G~~~y~LDGDnvR~gL~~dLgFs 73 (197) T COG0529 28 FTGLSGSGKSTIANALEEKLFAKGYHVYLLDGDNVRHGLNRDLGFS 73 (197) T ss_pred EECCCCCCHHHHHHHHHHHHHHCCCEEEEECCHHHHHCCCCCCCCC T ss_conf 6468888787999999999997597589855746765005788978 No 223 >PRK11241 gabD succinate-semialdehyde dehydrogenase I; Provisional Probab=35.50 E-value=32 Score=14.99 Aligned_cols=56 Identities=18% Similarity=0.275 Sum_probs=33.2 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999740463458899987767543005----------786-8731456888887778776677664 Q gi|254780479|r 82 ILSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 82 vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~ 137 (356) -.+.|+.+..+.|++-|++.|..+.++- .|+ +-+...|+...--.-+|.|.....+ T Consensus 60 A~~~W~~~~~~~R~~iL~k~a~~l~~~~eela~l~~~E~GK~~~~A~~ev~~~~~~~~~~a~~a~~~ 126 (482) T PRK11241 60 ALPAWRALTAKERANILRRWFNLMMEHQDDLARLMTLEQGKPLAEAKGEISYAASFIEWFAEEGKRI 126 (482) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9899814999999999999999999789999999999839972455348999999999999999984 No 224 >pfam03755 YicC_N YicC-like family, N-terminal region. Family of bacterial proteins. Although poorly characterized, the members of this protein family have been demonstrated to play a role in stationary phase survival. These proteins are not essential during stationary phase. Probab=35.26 E-value=30 Score=15.17 Aligned_cols=34 Identities=15% Similarity=0.186 Sum_probs=26.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 8899987767543005786873145688888777 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIG 127 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG 127 (356) .++.+....+.|.+.++-..|-+..+++++|||= T Consensus 83 ~~~~~~~~~~~l~~~~~~~~~~~~~~ll~~~~vi 116 (159) T pfam03755 83 LLKAYLAALKELAEELGLAAPISLDDLLRLPGVL 116 (159) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCCE T ss_conf 9999999999999972998999999996598700 No 225 >LOAD_Hrd consensus Probab=35.13 E-value=11 Score=18.39 Aligned_cols=21 Identities=24% Similarity=0.447 Sum_probs=17.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHH Q ss_conf 687314568888877787766 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTAS 132 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ 132 (356) ..|.|.++|..++|||+.... T Consensus 41 ~~P~t~~eL~~I~Gig~~k~~ 61 (77) T LOAD_Hrd 41 LLPTTVSELLAIDGVGEAKVE 61 (77) T ss_pred HCCCCHHHHCCCCCCCHHHHH T ss_conf 789999998289996999999 No 226 >COG1084 Predicted GTPase [General function prediction only] Probab=34.89 E-value=33 Score=14.92 Aligned_cols=84 Identities=15% Similarity=0.224 Sum_probs=49.7 Q ss_pred HHHHHHCCHHHCCHHHHHHHHHCCCHHHHH---------CCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCH Q ss_conf 467311612033265899877489978987---------29989999997404634588999877675430057868731 Q gi|254780479|r 46 EIMLQQTTVKTVEPYFKKFMQKWPTIFCLS---------SAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHK 116 (356) Q Consensus 46 eimlqqT~v~~v~~~~~~~~~~~P~~~~la---------~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~ 116 (356) |+-.-+|.-+.|..++.+++++||++..|- -.+.+++-+..+.+ ++|..+ .+.|.++|-+.+. T Consensus 47 e~~rv~t~~~i~~d~l~~iv~~~P~id~LhpFY~eLidvl~d~d~~k~sLs~v---~~A~~~---i~~l~~eYi~~lk-- 118 (346) T COG1084 47 EIRRVKTASNIVRDRLDKIVERFPSLDDLHPFYRELIDVLVDIDHLKISLSAV---SWASKI---IEKLAREYIRLLK-- 118 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHHHHHHH---HHHHHH---HHHHHHHHHHHHH-- T ss_conf 99999999999999999999848884326858999999994789999999997---799999---9999999999874-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 4568888877787766776643496313 Q gi|254780479|r 117 VEILKKLPGIGDYTASAIVAIAFNHFAV 144 (356) Q Consensus 117 ~~~l~~LpGiG~yta~ai~s~a~~~~~~ 144 (356) .+=.+++|+.++--|||.-+. T Consensus 119 -------~a~~~~~~~~lrR~a~GR~aS 139 (346) T COG1084 119 -------AAKDPKEANQLRRQAFGRVAS 139 (346) T ss_pred -------CCCCHHHHHHHHHHHHHHHHH T ss_conf -------279866899999999878999 No 227 >KOG2379 consensus Probab=34.87 E-value=27 Score=15.53 Aligned_cols=36 Identities=22% Similarity=0.283 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHHH Q ss_conf 9987767543005786873-14568888877787766776 Q gi|254780479|r 97 NLKKCADIIVKKYEGNFPH-KVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 97 ~l~~~a~~i~~~~~g~~P~-~~~~l~~LpGiG~yta~ai~ 135 (356) -++++.+.+++. ++|- +..++-++-|+|.+-..-++ T Consensus 38 s~~ka~r~l~~~---~~~l~tl~dl~~ikG~G~~~~~k~~ 74 (501) T KOG2379 38 SLKKALRNLCKA---KDPLMTLPDLSSIKGFGKKWIVKLM 74 (501) T ss_pred CHHHHHHHHHCC---CCCCCCHHHCCCCCCCCCHHHHHHH T ss_conf 678899875316---6752211210010576503688887 No 228 >KOG2705 consensus Probab=34.82 E-value=20 Score=16.48 Aligned_cols=28 Identities=25% Similarity=0.489 Sum_probs=22.1 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 005786873145688888777877667766434 Q gi|254780479|r 107 KKYEGNFPHKVEILKKLPGIGDYTASAIVAIAF 139 (356) Q Consensus 107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~ 139 (356) -+.+|+.|++. ||||||..||+.--+.+ T Consensus 193 ~d~~G~~p~s~-----~p~i~Rf~aG~fYlv~~ 220 (471) T KOG2705 193 LDSDGQPPKSA-----LPSIGRFLAGCFYLVLH 220 (471) T ss_pred ECCCCCCCCCC-----CHHHHHHHCCCHHHHHH T ss_conf 34899997321-----32467672241888888 No 229 >pfam00416 Ribosomal_S13 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes. Probab=34.68 E-value=12 Score=17.96 Aligned_cols=27 Identities=30% Similarity=0.341 Sum_probs=20.1 Q ss_pred CCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 687314---568888877787766776643 Q gi|254780479|r 112 NFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .+|.+. -.|..+-|||+++|.+|+.-+ T Consensus 6 di~~~K~v~~ALt~I~GIG~~~A~~Ic~~l 35 (106) T pfam00416 6 NLDGNKKIEIALTYIKGIGRRKANQILKKA 35 (106) T ss_pred CCCCCCEEEEEECCCCCCCHHHHHHHHHHC T ss_conf 799996864441121052899999999991 No 230 >pfam02467 Whib Transcription factor WhiB. WhiB is a putative transcription factor in Actinobacteria, required for differentiation and sporulation. Probab=34.31 E-value=12 Score=18.00 Aligned_cols=19 Identities=26% Similarity=0.759 Sum_probs=15.1 Q ss_pred CCCCCCCCCCCCHHHHHCC Q ss_conf 3564311000001343101 Q gi|254780479|r 203 KPLCPLCPIQKNCLTFSEG 221 (356) Q Consensus 203 ~P~C~~Cpl~~~C~~~~~~ 221 (356) +..|..|||...|..|... T Consensus 29 k~iC~~CpV~~~Cl~~Al~ 47 (66) T pfam02467 29 KRICMRCPVLQECAAHALE 47 (66) T ss_pred HHHHCCCCCHHHHHHHHHH T ss_conf 9998089629999999997 No 231 >PRK10729 nudF ADP-ribose pyrophosphatase NudF; Provisional Probab=34.28 E-value=34 Score=14.86 Aligned_cols=76 Identities=4% Similarity=0.014 Sum_probs=39.4 Q ss_pred EEEEE-EE-CCCEEEEEECCCCHHHHC-----CCCCCCCCCCCCCCHHHHHHCCCCCCCCEECCCEEEEEEEE------E Q ss_conf 20003-41-376057764572013204-----74421111346721588863010100002137358998315------9 Q gi|254780479|r 241 AVFIA-IT-NDNRILLRKRTNTRLLEG-----MDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFTHF------T 307 (356) Q Consensus 241 ~~~vi-i~-~~~kiLL~KRp~~gll~G-----LwEFP~~e~~~~~~~~~~~~~~~~~~~~~~l~~ikH~fTH~------~ 307 (356) .+.|+ ++ ..++++|.+.=--+.+.. |||+|-+..+.+++.......+..++..-..+.+.+..+-+ + T Consensus 51 av~vL~yDp~~d~vvLveQfR~~~~~~~~~~wllEipAG~id~gE~p~~~A~REl~EEtG~~~~~l~~l~~~~~SPG~s~ 130 (202) T PRK10729 51 AAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIEEGESVEDVARREAIEEAGLIVGRTKPVLSYLASPGGTS 130 (202) T ss_pred EEEEEEEECCCCEEEEEEECCCHHCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCC T ss_conf 79999980778969999823512203799963999402033899798999999988875958052899778816897533 Q ss_pred EEEEEEEEE Q ss_conf 999999998 Q gi|254780479|r 308 LTLFVWKTI 316 (356) Q Consensus 308 L~i~v~~~~ 316 (356) =.+++|.+. T Consensus 131 E~i~lf~a~ 139 (202) T PRK10729 131 ERSSIMVGE 139 (202) T ss_pred EEEEEEEEE T ss_conf 159999998 No 232 >PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional Probab=33.75 E-value=34 Score=14.80 Aligned_cols=80 Identities=11% Similarity=0.094 Sum_probs=46.8 Q ss_pred HHHHHHHHHCCCCCCC---CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 8776754300578687---3145688888777877667766434963131243146665765197545735689998741 Q gi|254780479|r 99 KKCADIIVKKYEGNFP---HKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYA 175 (356) Q Consensus 99 ~~~a~~i~~~~~g~~P---~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~ 175 (356) -.+-..+..+||+++- .+...+.++||.+|.-|+-|.+-+ |.-- | .-++.+..-+...++.++... T Consensus 51 n~aD~kl~t~~g~KIDlNNa~vr~f~q~pGmYPtlA~kIv~~~---PY~s----V----eDvl~ipgLse~qK~~l~k~~ 119 (144) T PRK02515 51 NAADAKLATERGEKIDLNNSSVRAFRQFPGMYPTLAGKIVKNA---PYDS----V----EDVLNLPGLSERQKELLEANL 119 (144) T ss_pred CHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHCC---CCCC----H----HHHHHCCCCCHHHHHHHHHHH T ss_conf 5688888887156402562749999868884679999998479---9787----9----999717899999999999865 Q ss_pred HCCCCCCCHHHHHH Q ss_conf 02578522678999 Q gi|254780479|r 176 RKITSTSRPGDFVQ 189 (356) Q Consensus 176 ~~~~~~~~~~~~nQ 189 (356) +.++-..-.-.||+ T Consensus 120 ~~Ftvt~p~~~~~e 133 (144) T PRK02515 120 ENFTVTEPEIALNE 133 (144) T ss_pred CCCEECCCCHHHHC T ss_conf 36634687288863 No 233 >cd07092 ALDH_ABALDH-YdcW Escherichia coli NAD+-dependent gamma-aminobutyraldehyde dehydrogenase YdcW-like. NAD+-dependent, tetrameric, gamma-aminobutyraldehyde dehydrogenase (ABALDH), YdcW of Escherichia coli K12, catalyzes the oxidation of gamma-aminobutyraldehyde to gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl medium-chain aldehydes, but with a lower catalytic efficiency. Probab=33.64 E-value=31 Score=15.09 Aligned_cols=22 Identities=32% Similarity=0.397 Sum_probs=9.4 Q ss_pred HHHCCCHHHHHHHHHHHHHHHH Q ss_conf 9740463458899987767543 Q gi|254780479|r 85 AWAGLGYYTRARNLKKCADIIV 106 (356) Q Consensus 85 ~w~gLGyy~Rar~l~~~a~~i~ 106 (356) .|+.+....|++-|+++|..|. T Consensus 34 ~W~~~~~~eR~~~L~k~a~~l~ 55 (450) T cd07092 34 SWRRTTPAERSKALLKLADAIE 55 (450) T ss_pred HHCCCCHHHHHHHHHHHHHHHH T ss_conf 9744999999999999999999 No 234 >cd07093 ALDH_F8_HMSADH Human aldehyde dehydrogenase family 8 member A1-like. In humans, the aldehyde dehydrogenase family 8 member A1 (ALDH8A1) protein functions to convert 9-cis-retinal to 9-cis-retinoic acid and has a preference for NAD+. Also included in this CD is the 2-hydroxymuconic semialdehyde dehydrogenase (HMSADH) which catalyzes the conversion of 2-hydroxymuconic semialdehyde to 4-oxalocrotonate, a step in the meta cleavage pathway of aromatic hydrocarbons in bacteria. Such HMSADHs seen here are: XylG of the TOL plasmid pWW0 of Pseudomonas putida, TomC of Burkholderia cepacia G4, and AphC of Comamonas testosterone. Probab=33.51 E-value=32 Score=15.06 Aligned_cols=24 Identities=38% Similarity=0.576 Sum_probs=13.3 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 997404634588999877675430 Q gi|254780479|r 84 SAWAGLGYYTRARNLKKCADIIVK 107 (356) Q Consensus 84 ~~w~gLGyy~Rar~l~~~a~~i~~ 107 (356) +.|+.+..+.|++-|.++|..+.+ T Consensus 33 ~~W~~~~~~eR~~~L~~~a~~l~~ 56 (455) T cd07093 33 PGWSRMSPAERARILHKVADLIEA 56 (455) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 988029999999999999999997 No 235 >smart00341 HRDC Helicase and RNase D C-terminal. Hypothetical role in nucleic acid binding. Mutations in the HRDC domain cause human disease. Probab=33.45 E-value=13 Score=17.80 Aligned_cols=21 Identities=29% Similarity=0.523 Sum_probs=17.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHH Q ss_conf 687314568888877787766 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTAS 132 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ 132 (356) ..|.|.++|.+++|+|++... T Consensus 41 ~~P~~~~~L~~i~g~~~~~~~ 61 (81) T smart00341 41 ALPTNVSELLAIDGVGEEKAR 61 (81) T ss_pred HCCCCHHHHHCCCCCCHHHHH T ss_conf 887999998468999999999 No 236 >cd07090 ALDH_F9_TMBADH NAD+-dependent 4-trimethylaminobutyraldehyde dehydrogenase, ALDH family 9A1. NAD+-dependent, 4-trimethylaminobutyraldehyde dehydrogenase (TMABADH, EC=1.2.1.47), also known as aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in humans, is a cytosolic tetramer which catalyzes the oxidation of gamma-aminobutyraldehyde involved in 4-aminobutyric acid (GABA) biosynthesis and also oxidizes betaine aldehyde (gamma-trimethylaminobutyraldehyde) which is involved in carnitine biosynthesis. Probab=32.96 E-value=34 Score=14.80 Aligned_cols=57 Identities=18% Similarity=0.285 Sum_probs=32.7 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999740463458899987767543005----------786-87314568888877787766776643 Q gi|254780479|r 82 ILSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 82 vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~a 138 (356) ..+.|+-+....|++-|++.|..+.++- .|+ +-+...|+...-..-+|.|+....+. T Consensus 31 A~~~W~~~~~~eR~~~l~k~a~~l~~~~~~la~~~~~E~GK~~~ea~~Ev~~a~~~~~~~a~~~~~~~ 98 (457) T cd07090 31 AQKEWSATSGMERGRILRKAADLLRERNDEIARLETIDNGKPIEEARVDIDSSADCLEYYAGLAPTLS 98 (457) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99998229999999999999999997899999999997190899999999999999999999999824 No 237 >TIGR01862 N2-ase-Ialpha nitrogenase component I, alpha chain; InterPro: IPR010143 This entry represents all three varieties (Fe-Fe, Mo-Fe and V-Fe) of the component I alpha chain of nitrogenase. Nitrogenase (1.18.6.1 from EC) is the enzyme system responsible for biological nitrogen fixation. Nitrogenase is an oligomeric complex which consists of two components: component 2 is an homodimer of an iron-sulphur protein, while component 1 which contains the active site for the reduction of nitrogen to ammonia exists in three different forms: the molybdenum-iron containing protein (MoFe) is a hetero-tetramer consisting of two pairs of alpha (nifD) and beta (nifK) subunits; the vanadium-iron containing protein (VFe) is a hexamer of two pairs each of alpha (vnfD), beta (vnfK), and delta (vnfG) subunits; the third form seems to only contain iron and is a hexamer composed of alpha (anfD), beta (anfK), and delta (anfG) subunits. The alpha and beta chains of the three types of component 1 are evolutionary related and they are also related to proteins nifE and nifN, which are most probably involved in the iron-molybdenum cofactor biosynthesis .; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=32.18 E-value=5.5 Score=20.41 Aligned_cols=114 Identities=20% Similarity=0.288 Sum_probs=69.8 Q ss_pred HHHHHHCCCHHHHHHHHHHH--------------HHHHHHCCCCCCCCHHHHHHHHHHHHH-HHHHHHHHHHCCCEEEEC Q ss_conf 99997404634588999877--------------675430057868731456888887778-776677664349631312 Q gi|254780479|r 82 ILSAWAGLGYYTRARNLKKC--------------ADIIVKKYEGNFPHKVEILKKLPGIGD-YTASAIVAIAFNHFAVVV 146 (356) Q Consensus 82 vl~~w~gLGyy~Rar~l~~~--------------a~~i~~~~~g~~P~~~~~l~~LpGiG~-yta~ai~s~a~~~~~~~v 146 (356) |+..|.|=|||..-|++|++ |+.+.++|| +| +++.-++|. |++..++.||--=-.+-+ T Consensus 231 V~a~fTGdg~yd~~~~mh~A~LNlV~C~RS~nYIa~~le~rYG--IP-----~~~~dffG~~~~~e~LR~I~~~Fg~~eI 303 (510) T TIGR01862 231 VVATFTGDGTYDEIRLMHKAKLNLVECARSANYIANELEERYG--IP-----WLKIDFFGFEYIAESLRAIAAFFGDSEI 303 (510) T ss_pred EEEECCCCCCHHHHHCCCCCCCCCEEECCHHHHHHHHHHHCCC--CC-----EEEECCCCCHHHHHHHHHHHHHCCCCCH T ss_conf 9775134422266312662110111101405799998553078--75-----3775465703789989999964588542 Q ss_pred CCCHHHHHHHHHC-CCCCCCHHHHHHHHHHHCCCC-CCCHHHHHH-HHHHHHHHHCCCC Q ss_conf 4314666576519-754573568999874102578-522678999-9999986320412 Q gi|254780479|r 147 DTNIERIISRYFD-IIKPAPLYHKTIKNYARKITS-TSRPGDFVQ-AMMDLGALICTSN 202 (356) Q Consensus 147 D~Nv~RVl~R~~~-~~~~~~~~~k~l~~~~~~~~~-~~~~~~~nQ-AlMdlGa~iC~p~ 202 (356) .--.++|++-... +.......+..|+-.---|-. ..++.++.. +.=|||-.|-... T Consensus 304 ~~r~EevIae~~a~w~p~ld~yKerL~GK~v~ly~GG~r~wH~~~~~~~~lG~~VV~~~ 362 (510) T TIGR01862 304 EKRAEEVIAEEIAKWKPELDYYKERLQGKRVCLYVGGSRLWHWIKSLEEDLGVEVVAVG 362 (510) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCEEEEEE T ss_conf 26888877874421041245411311498899975888001111257776394599975 No 238 >pfam00570 HRDC HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease. It is interesting to note that the RecQ helicase in Deinococcus radiodurans has three tandem HRDC domains. Probab=31.83 E-value=13 Score=17.74 Aligned_cols=21 Identities=29% Similarity=0.662 Sum_probs=17.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHH Q ss_conf 687314568888877787766 Q gi|254780479|r 112 NFPHKVEILKKLPGIGDYTAS 132 (356) Q Consensus 112 ~~P~~~~~l~~LpGiG~yta~ 132 (356) ..|.|.++|.+++|+|+..+. T Consensus 38 ~~P~s~~~L~~i~g~~~~~~~ 58 (68) T pfam00570 38 KLPRTLEELLRIPGVGPRKLE 58 (68) T ss_pred HCCCCHHHHHCCCCCCHHHHH T ss_conf 784999998089999999999 No 239 >smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs. Probab=31.70 E-value=15 Score=17.35 Aligned_cols=19 Identities=26% Similarity=0.405 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 5688888777877667766 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVA 136 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s 136 (356) +.+-.+||||+.||.-++. T Consensus 16 DnipGV~GIG~ktA~~ll~ 34 (36) T smart00279 16 DNIPGVKGIGPKTALKLLR 34 (36) T ss_pred CCCCCCCCCCHHHHHHHHH T ss_conf 6889999747899999998 No 240 >PRK09407 gabD2 succinic semialdehyde dehydrogenase; Reviewed Probab=31.64 E-value=37 Score=14.57 Aligned_cols=55 Identities=20% Similarity=0.268 Sum_probs=33.7 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430057----------86-8731456888887778776677664 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHKVEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~~~~l~~LpGiG~yta~ai~s~ 137 (356) .+.|+.+....|++-|++.|..|.++-+ |+ +-+...|.......-+|.|+....+ T Consensus 66 ~~~W~~~~~~eR~~iL~k~ad~l~~~~eela~l~~~E~GK~~~~a~~Ev~~~~~~~~~~a~~a~~~ 131 (523) T PRK09407 66 QRDWAARPVRERAAVFLRFHDLVLENREELMDLIQLETGKARAHAFEEVLDVAINARYYARVAEKL 131 (523) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 898821999999999999999999779999999999729179999999999999999999999997 No 241 >PTZ00081 enolase (2-phospho-D-glycerate hydrolase); Provisional Probab=31.58 E-value=8.9 Score=18.95 Aligned_cols=12 Identities=8% Similarity=0.180 Sum_probs=4.8 Q ss_pred CCCHHHHHHHHH Q ss_conf 431466657651 Q gi|254780479|r 147 DTNIERIISRYF 158 (356) Q Consensus 147 D~Nv~RVl~R~~ 158 (356) .--..|-++.+. T Consensus 132 ~~pLy~yl~~l~ 143 (442) T PTZ00081 132 GVPLYRYLAQLA 143 (442) T ss_pred CCCHHHHHHHHH T ss_conf 993999988761 No 242 >KOG0648 consensus Probab=31.50 E-value=37 Score=14.55 Aligned_cols=47 Identities=15% Similarity=0.160 Sum_probs=32.6 Q ss_pred CEEEEEEEEEECCCEEEEEECC-CCHHHHCCCCCCCCCCCCCCCHHHH Q ss_conf 1123200034137605776457-2013204744211113467215888 Q gi|254780479|r 237 MRTGAVFIAITNDNRILLRKRT-NTRLLEGMDELPGSAWSSTKDGNID 283 (356) Q Consensus 237 ~r~~~~~vii~~~~kiLL~KRp-~~gll~GLwEFP~~e~~~~~~~~~~ 283 (356) .+....+.++|.++++|+.+.. .--...|.|-||++.+...++.+.- T Consensus 114 h~vgvg~~V~n~~~eVlVv~e~d~~~~~~~~wK~ptG~v~~~e~i~~g 161 (295) T KOG0648 114 HRVGVGAFVLNKKKEVLVVQEKDGAVKIRGGWKLPTGRVEEGEDIWHG 161 (295) T ss_pred HHEEEEEEEECCCCEEEEEEECCCCEEECCCCCCCCEEECCCCCCHHH T ss_conf 340001367368863678972255323126510561375366540134 No 243 >PHA00619 CRISPR-associated Cas4-like protein Probab=31.50 E-value=25 Score=15.77 Aligned_cols=12 Identities=33% Similarity=0.863 Sum_probs=10.9 Q ss_pred CCCCCCCCCCHH Q ss_conf 643110000013 Q gi|254780479|r 205 LCPLCPIQKNCL 216 (356) Q Consensus 205 ~C~~Cpl~~~C~ 216 (356) .|..||....|- T Consensus 196 ~C~~CP~~~~Cw 207 (210) T PHA00619 196 ECIECPFYNVCW 207 (210) T ss_pred CEEECCCEEEEE T ss_conf 102477101664 No 244 >KOG2171 consensus Probab=31.49 E-value=37 Score=14.55 Aligned_cols=16 Identities=6% Similarity=0.189 Sum_probs=6.5 Q ss_pred CCCCCCCCCHHHHHCC Q ss_conf 4311000001343101 Q gi|254780479|r 206 CPLCPIQKNCLTFSEG 221 (356) Q Consensus 206 C~~Cpl~~~C~~~~~~ 221 (356) |..|-....|..+... T Consensus 584 y~~~~warmc~ilg~~ 599 (1075) T KOG2171 584 YMIAFWARMCRILGDD 599 (1075) T ss_pred HHHHHHHHHHHHHCHH T ss_conf 8899999999996521 No 245 >PRK03352 DNA polymerase IV; Validated Probab=31.27 E-value=32 Score=15.00 Aligned_cols=86 Identities=19% Similarity=0.166 Sum_probs=44.5 Q ss_pred CCHHHHHCCCHHHHHHHHHCCCHH-----HHHHHHHHHHHHHHHCCCCCC---CCHHHHH------HHHHHHHHHHHHHH Q ss_conf 997898729989999997404634-----588999877675430057868---7314568------88887778776677 Q gi|254780479|r 69 PTIFCLSSAKDEEILSAWAGLGYY-----TRARNLKKCADIIVKKYEGNF---PHKVEIL------KKLPGIGDYTASAI 134 (356) Q Consensus 69 P~~~~la~a~~~~vl~~w~gLGyy-----~Rar~l~~~a~~i~~~~~g~~---P~~~~~l------~~LpGiG~yta~ai 134 (356) -+...+|..=.++|.+ | +|.. +.-+-|=|.|--+.+ -+|.+ |++..++ .++||||+-|+.-. T Consensus 118 ~~~~~~a~~ir~~I~~-~--tgl~~SiGIa~nk~LAKiAs~~~K-P~G~~~i~~~~~~~~l~~lpv~~i~GIG~~~~~kL 193 (345) T PRK03352 118 DDPEEVAEEIRAVVLE-E--TGLSCSVGISDNKQRAKIATGFAK-PAGVFRLTDANWMAVMGDRPVDALWGVGPKTAKKL 193 (345) T ss_pred CCHHHHHHHHHHHHHH-H--HCCCCEECCHHHHHHHHHHHHHHH-CCCEEECCHHHHHHHHHCCCCHHCCCCCHHHHHHH T ss_conf 9999999999999999-8--789824304079999999987610-48747448889999985278211278289999999 Q ss_pred HHHHCCCEEEECCCCHHHHHHHHHC Q ss_conf 6643496313124314666576519 Q gi|254780479|r 135 VAIAFNHFAVVVDTNIERIISRYFD 159 (356) Q Consensus 135 ~s~a~~~~~~~vD~Nv~RVl~R~~~ 159 (356) .+++...-.=..+.+... |.+.|| T Consensus 194 ~~~GI~Ti~DL~~~~~~~-L~~~fG 217 (345) T PRK03352 194 AALGITTVADLAATDPAV-LTATFG 217 (345) T ss_pred HHCCCCCHHHHHCCCHHH-HHHHHC T ss_conf 975997199986699999-999978 No 246 >TIGR00372 cas4 CRISPR-associated protein Cas4; InterPro: IPR013343 This entry consists of conserved proteins found in many prokaryotic genomes whose genes are associated with CRISPRs (Clustered, Regularly Interspaced Short Palidromic Repeats). The function of these proteins has not been experimentally determined, but computational analysis suggests that they be nucleases like RecB (IPR004586 from INTERPRO), functioning as part of a hypothetical DNA repair system , . . Probab=30.52 E-value=18 Score=16.82 Aligned_cols=16 Identities=31% Similarity=0.930 Sum_probs=12.8 Q ss_pred CCCCCCCCCCCCCHHH Q ss_conf 2356431100000134 Q gi|254780479|r 202 NKPLCPLCPIQKNCLT 217 (356) Q Consensus 202 ~~P~C~~Cpl~~~C~~ 217 (356) ++++|..|+.+..|.. T Consensus 191 ~~~kC~~C~y~~~C~~ 206 (206) T TIGR00372 191 KSRKCKFCPYREICLP 206 (206) T ss_pred CCCCCCCCCCCCCCCC T ss_conf 8775777555212589 No 247 >pfam11798 IMS_HHH IMS family HHH motif. These proteins are involved in UV protection, eg. Probab=30.48 E-value=14 Score=17.55 Aligned_cols=16 Identities=44% Similarity=0.769 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8888877787766776 Q gi|254780479|r 120 LKKLPGIGDYTASAIV 135 (356) Q Consensus 120 l~~LpGiG~yta~ai~ 135 (356) +..+||||+.|+...- T Consensus 14 i~~i~GIG~~~~~~L~ 29 (33) T pfam11798 14 ISKIPGIGRKTAEKLE 29 (33) T ss_pred CCCCCCCCHHHHHHHH T ss_conf 2216886667899999 No 248 >KOG4233 consensus Probab=30.42 E-value=19 Score=16.59 Aligned_cols=47 Identities=17% Similarity=0.224 Sum_probs=30.0 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCHHHH Q ss_conf 9998741025785226789999999-9863204123564311000001343 Q gi|254780479|r 169 KTIKNYARKITSTSRPGDFVQAMMD-LGALICTSNKPLCPLCPIQKNCLTF 218 (356) Q Consensus 169 k~l~~~~~~~~~~~~~~~~nQAlMd-lGa~iC~p~~P~C~~Cpl~~~C~~~ 218 (356) |..-.+.+.++-+++-..|-..|.| .||+-|.++ +|..| |++.|..| T Consensus 42 kAYvllGQfLllkKdE~lF~~Wlk~~~gat~~~a~--~~~~C-L~eWc~~F 89 (90) T KOG4233 42 KAYVLLGQFLLLKKDEDLFQEWLKETCGATAKQAQ--DCFNC-LNEWCDCF 89 (90) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHH--HHHHH-HHHHHHHH T ss_conf 78999878897226699999999998284488899--99988-99999975 No 249 >pfam10273 WGG Pre-rRNA-processing protein TSR2. This entry represents the central conserved section of a family of proteins described as pre-rRNA-processing protein TSR2. The region has a distinctive WGG motif but the function is unknown. Probab=30.39 E-value=39 Score=14.42 Aligned_cols=12 Identities=17% Similarity=0.448 Sum_probs=4.3 Q ss_pred HHHHHHHHHHHH Q ss_conf 689999999999 Q gi|254780479|r 5 EHIIQSKILDWY 16 (356) Q Consensus 5 ~~~~~~~ll~w~ 16 (356) +..+...++.|| T Consensus 33 ~~~l~~~i~~~f 44 (82) T pfam10273 33 ADWLAGAIVDLF 44 (82) T ss_pred HHHHHHHHHHHH T ss_conf 999999999998 No 250 >PRK13698 plasmid-partitioning protein; Provisional Probab=30.38 E-value=39 Score=14.42 Aligned_cols=50 Identities=8% Similarity=0.060 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCC-----C-CCHHHHHHHHHHHCCCC Q ss_conf 88877787766776643496313124314666576519754-----5-73568999874102578 Q gi|254780479|r 122 KLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK-----P-APLYHKTIKNYARKITS 180 (356) Q Consensus 122 ~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~-----~-~~~~~k~l~~~~~~~~~ 180 (356) ..|-||+-..+- .-++||--+|-.+.+.+-+- + ....-+.+.++++..-| T Consensus 102 Q~PAigRr~~g~---------iEVlDGSRRR~a~il~~~Dy~VlV~el~Deq~~~Lsdv~N~~rp 157 (323) T PRK13698 102 QTPAFGRRVSGV---------IEIADGSRRRKAAILTESDYRVLVGELDDEQMAALSRLGNDYRP 157 (323) T ss_pred CCCCCEEECCCC---------EEEECCHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHCCCC T ss_conf 476210341796---------89803527778776068844899730887999999998750177 No 251 >TIGR00658 orni_carb_tr ornithine carbamoyltransferase; InterPro: IPR002292 The ornithine carbamoyltransferases are a family of enzymes that catalyse the production of citrulline from carbamoyl-phosphate and ornithine . This reaction is part of the arginine biosynthetic pathway, and in some organisms the reaction is also part of the arginine deaminase pathway . In most prokaryotes, and some lower eukaryotes, the enzyme is found in the cytoplasm, but in higher eukaryotes, such as mammals, the enzyme is found in the mitochondrial matrix and functions as part of the urea cycle. In most organisms analysed to date, the enzyme has been shown to consist of a trimer of identical or nonidentical subunits . Extensive similarity has been found within the ornithine carbamoyltransferase family, some of the conserved areas being important in catalysis . The C-terminal region of these enzymes shows a degree of similarity to the aspartate carbamoyltransferases, which are also known to bind carbamoyl- phosphate . This region contains a highly conserved Cys residue (in a His-Cys-Lys-Pro motif) implicated in ornithine binding . ; GO: 0004585 ornithine carbamoyltransferase activity, 0006520 amino acid metabolic process, 0009348 ornithine carbamoyltransferase complex. Probab=30.23 E-value=33 Score=14.88 Aligned_cols=50 Identities=32% Similarity=0.458 Sum_probs=27.0 Q ss_pred CCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHC--------CCCCCCHHHHHHHHHHH Q ss_conf 96313124314666576519754573568999874102--------57852267899999999 Q gi|254780479|r 140 NHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARK--------ITSTSRPGDFVQAMMDL 194 (356) Q Consensus 140 ~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~--------~~~~~~~~~~nQAlMdl 194 (356) |...++=|| -|||||+...-.-=...+..++.+|+. |++..|| .|+|=|| T Consensus 85 G~gEsi~DT--ARVLsRyVD~I~~R~~~h~~v~~lA~yasVPVINgLtD~~HP---cQ~LADl 142 (341) T TIGR00658 85 GRGESIKDT--ARVLSRYVDGIMARVYKHEDVEELAKYASVPVINGLTDLFHP---CQALADL 142 (341) T ss_pred CCCCCCCCH--HHHHCCEEEEEEEECCCHHHHHHHHHHCCCCEECCCCCCCCH---HHHHHHH T ss_conf 368870001--102031565677732553789999854898546055685523---7999998 No 252 >PTZ00205 DNA polymerase kappa; Provisional Probab=30.05 E-value=29 Score=15.34 Aligned_cols=19 Identities=26% Similarity=0.688 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8888877787766776643 Q gi|254780479|r 120 LKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 120 l~~LpGiG~yta~ai~s~a 138 (356) +.++||||+-|+.-..+++ T Consensus 311 vRKIpGIGkVte~~L~alG 329 (571) T PTZ00205 311 LRSVPGVGKVTEALLKGLG 329 (571) T ss_pred CCCCCCCCHHHHHHHHHCC T ss_conf 7668975688899998759 No 253 >TIGR02545 ATP_syn_fliI flagellar protein export ATPase FliI; InterPro: IPR013379 FliI proteins are involved in bacterial flagellum systems, acting to drive protein export for flagellar biosynthesis. The most closely related proteins are the YscN proteins of bacterial type III secretion systems.; GO: 0016887 ATPase activity, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum. Probab=29.46 E-value=26 Score=15.64 Aligned_cols=15 Identities=27% Similarity=0.468 Sum_probs=6.6 Q ss_pred ECCCCHHHHHHHHHCCC Q ss_conf 12431466657651975 Q gi|254780479|r 145 VVDTNIERIISRYFDII 161 (356) Q Consensus 145 ~vD~Nv~RVl~R~~~~~ 161 (356) +||||| ||+|-++-. T Consensus 331 iLDGHI--vLsR~lA~~ 345 (439) T TIGR02545 331 ILDGHI--VLSRKLAER 345 (439) T ss_pred HCCCEE--EECHHHHHC T ss_conf 102207--732267645 No 254 >PRK13473 gamma-aminobutyraldehyde dehydrogenase; Provisional Probab=29.46 E-value=40 Score=14.32 Aligned_cols=56 Identities=20% Similarity=0.277 Sum_probs=32.5 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCHH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430057----------86-87314-568888877787766776643 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHKV-EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~~-~~l~~LpGiG~yta~ai~s~a 138 (356) .+.|+-+.+..|+.-|.++|..+.++.+ |+ +.... .|+...-..=+|.|+....+. T Consensus 52 ~~~W~~~~~~~R~~~L~k~a~~l~~~~~~la~~~~~e~GKp~~~a~~~ev~~~i~~~~~~a~~~~~~~ 119 (475) T PRK13473 52 FPEWSQTTPKERAEALLKLADAIEENADEFARLESLNCGKPLHLVLNDEIPAIVDVFRFFAGAARCLE 119 (475) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99882599999999999999999978999999999984987999999999999999999998888534 No 255 >KOG1338 consensus Probab=29.29 E-value=40 Score=14.30 Aligned_cols=93 Identities=13% Similarity=0.294 Sum_probs=58.6 Q ss_pred HHHHHCCHHHCCHHHHHHHHHCCCHHHHHCC---CHHHHHHHHHCCCHHH---HHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 6731161203326589987748997898729---9899999974046345---889998776754300578687314568 Q gi|254780479|r 47 IMLQQTTVKTVEPYFKKFMQKWPTIFCLSSA---KDEEILSAWAGLGYYT---RARNLKKCADIIVKKYEGNFPHKVEIL 120 (356) Q Consensus 47 imlqqT~v~~v~~~~~~~~~~~P~~~~la~a---~~~~vl~~w~gLGyy~---Rar~l~~~a~~i~~~~~g~~P~~~~~l 120 (356) +||.--++.+- ..|...+++||++..+-.. +++|.--+-.|++--. -+.++++--.-++..+++.+|.-+. T Consensus 96 vll~E~~~pq~-SrWrPYfs~wp~p~rm~spifWdEnEl~~Ll~stvlee~~Kd~aeI~~~~i~~i~pf~~~~p~vfs-- 172 (466) T KOG1338 96 VLLREKKMPQK-SRWRPYFSRWPQPARMHSPIFWDENELSMLLCSTVLEETVKDKAEIEKDFIFVIQPFKQHCPIVFS-- 172 (466) T ss_pred HHHHHHHCCCC-CCCCCHHHHCCCHHHCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHC-- T ss_conf 99987505320-234407773798665278755780577777523452266768999999999999999876854103-- Q ss_pred HHHHHHHH--HHHHHHHHHHCCCEEE Q ss_conf 88887778--7766776643496313 Q gi|254780479|r 121 KKLPGIGD--YTASAIVAIAFNHFAV 144 (356) Q Consensus 121 ~~LpGiG~--yta~ai~s~a~~~~~~ 144 (356) .|-.-. |.++-+++.+|+.+.- T Consensus 173 --~~slEdF~y~~Al~laysfdve~~ 196 (466) T KOG1338 173 --RPSLEDFMYAYALGLAYSFDVEFL 196 (466) T ss_pred --CCCHHHHHHHHHHHHHHHEEEEHH T ss_conf --537888999999998883210020 No 256 >cd03312 CIMS_N_terminal_like CIMS - Cobalamine-independent methonine synthase, or MetE, N-terminal domain_like. Many members have been characterized as 5-methyltetrahydropteroyltriglutamate-homocysteine methyltransferases, EC:2.1.1.14, mostly from bacteria and plants. This enzyme catalyses the last step in the production of methionine by transferring a methyl group from 5-methyltetrahydrofolate to L-homocysteine without using an intermediate methyl carrier. The active enzyme has a dual (beta-alpha)8-barrel structure, and this model covers the N-terminal barrel, and a few single-barrel sequences most similar to the N-terminal barrel. It is assumed that the homologous N-terminal barrel has evolved from the C-terminus via gene duplication and has subsequently lost binding sites, and it seems as if the two barrels forming the active enzyme may sometimes reside on different polypeptides. The C-terminal domain incorporates the Zinc ion, which binds and activates homocysteine. Side chains fro Probab=28.78 E-value=11 Score=18.17 Aligned_cols=15 Identities=13% Similarity=0.155 Sum_probs=5.3 Q ss_pred HHHHHHHCCCCCCCC Q ss_conf 767543005786873 Q gi|254780479|r 101 CADIIVKKYEGNFPH 115 (356) Q Consensus 101 ~a~~i~~~~~g~~P~ 115 (356) |-|....+|.=.+|+ T Consensus 108 MtKWFdTNYHYiVPE 122 (360) T cd03312 108 MTKWFDTNYHYIVPE 122 (360) T ss_pred HHCCCCCCCCEECCC T ss_conf 651036776247011 No 257 >TIGR02535 hyp_Hser_kinase proposed homoserine kinase; InterPro: IPR013371 The genes encoding proteins in this entry are largely adjacent to genes involved in the biosynthesis of threonine (aspartate kinase, homoserine dehydrogenase and threonine synthase) in genomes which are lacking any other known homoserine kinase, and in which the presence of a homoserine kinase would indicate a complete pathway for the biosynthesis of threonine. These proteins are homologous to the archaeal form of 2,3-bisphosphoglycerate-independent phosphoglycerate mutase and form part of a superfamily of metalloenzymes including phosphopentomutases, alkaline phosphatases and sulphatases. The proposal that these proteins encode a kinase is based on analogy to phosphomutases which are intramolecular phosphotransferases. A mutase active site could evolve to bring together homoserine and a phosphate donor such as phosphoenolpyruvate, resulting in kinase activity.; GO: 0004619 phosphoglycerate mutase activity. Probab=28.59 E-value=3.4 Score=21.89 Aligned_cols=23 Identities=9% Similarity=0.196 Sum_probs=12.0 Q ss_pred CCCCC-CHHHHHHHHHHHHHHHCC Q ss_conf 57852-267899999999863204 Q gi|254780479|r 178 ITSTS-RPGDFVQAMMDLGALICT 200 (356) Q Consensus 178 ~~~~~-~~~~~nQAlMdlGa~iC~ 200 (356) ++|.. .-.+|=+-||.-.+.|=+ T Consensus 200 lvPeg~~~~~~l~~Li~~S~~iL~ 223 (431) T TIGR02535 200 LVPEGGEEAELLKDLILKSAEILK 223 (431) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 577888734899999999999860 No 258 >cd01703 Pol_iota Pol iota is member of the DNA polymerase Y-family, and has also been called Rad30 homolog B. Unlike classic DNA polymerases,Y-family polymerases are induced by DNA damage. They can transverse normal replication-blocking DNA lesions. Unlike Pol eta, Pol iota is unable to replicate through a cis-syn T-T dimer. In human Pol iota, the base-pairing mode in the active site at the replicative end mat bee Hoogsteen instead of Watson-Click. Human Pol iota can incorporate the correct nucleotide opposite a purine much more efficiently than opposite a pyrimidine. Pol iota prefers to insert Guanosine instead of Adenosine opposite Thymidine. Probab=28.51 E-value=23 Score=15.98 Aligned_cols=53 Identities=26% Similarity=0.322 Sum_probs=28.5 Q ss_pred HHHCCCHH-----HHHHHHHHHHHHHHHCCCC---CCCCHHHH-------HHHHHHHHHHHHHHHHHHH Q ss_conf 97404634-----5889998776754300578---68731456-------8888877787766776643 Q gi|254780479|r 85 AWAGLGYY-----TRARNLKKCADIIVKKYEG---NFPHKVEI-------LKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 85 ~w~gLGyy-----~Rar~l~~~a~~i~~~~~g---~~P~~~~~-------l~~LpGiG~yta~ai~s~a 138 (356) .++-|||- +.-+-|=|.|--..+ -+| .+|.+..+ +.++||||+.|+.-+.+.. T Consensus 163 I~~~lGlt~S~GIa~NKlLAKlas~~~K-Pngqt~l~p~~~~~~l~~~~~i~ki~GIG~kt~~~L~~~g 230 (394) T cd01703 163 IENELGLTCCAGIASNKLLAKLVGSVHK-PNQQTTLLPESADHLIMDLGSLRKIPGIGYKTAKKLEDHG 230 (394) T ss_pred HHHHHCCCEEEEECCCHHHHHHHHHCCC-CCCEEEECCCCHHHHHCCCCCHHHHCCCCHHHHHHHHHCC T ss_conf 9998799767766340328987753258-8980597771187784366880552781799999999838 No 259 >cd07108 ALDH_MGR_2402 Magnetospirillum NAD(P)+-dependent aldehyde dehydrogenase MSR-1-like. NAD(P)+-dependent aldehyde dehydrogenase of Magnetospirillum gryphiswaldense MSR-1 (MGR_2402) , and other similar sequences, are present in this CD. Probab=28.17 E-value=39 Score=14.41 Aligned_cols=53 Identities=21% Similarity=0.256 Sum_probs=31.8 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CC-CHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430057----------86-87-314568888877787766776 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------GN-FP-HKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P-~~~~~l~~LpGiG~yta~ai~ 135 (356) -+.|+.+..+.|++-|.++|..|.++.+ |+ +. +...|+...-..=+|+|+... T Consensus 32 ~~~W~~~~~~~R~~il~~~a~~l~~~~~~la~~~~~E~GK~l~~~a~~Ev~~~~~~~~~~a~~a~ 96 (457) T cd07108 32 FPEWAATPARERGKLLARIADALEARSEELARLLALETGNALRTQARPEAAVLADLFRYFGGLAG 96 (457) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89870299999999999999999978999999999870841188999999999999999997788 No 260 >TIGR02061 aprA adenylylsulfate reductase, alpha subunit; InterPro: IPR011803 During dissimilatory sulphate reduction or sulphur oxidation, adenylylsulphate (APS) reductase catalyzes reversibly the two-electron reduction of APS to sulphite and AMP. Found in several bacterial lineages and in Archaeoglobales, APS reductase is a heterodimer composed of an alpha subunit containing a noncovalently bound FAD, and a beta subunit containing two [4Fe-4S] clusters. This entry describes the alpha subunit of APS reductase, which shares a common evolutionary origin with fumarate reductase/succinate dehydrogenase flavoproteins.. Probab=27.94 E-value=33 Score=14.95 Aligned_cols=24 Identities=21% Similarity=0.403 Sum_probs=15.0 Q ss_pred HHHHHHCCCHHHHHCCCHHHHHHHHH Q ss_conf 99877489978987299899999974 Q gi|254780479|r 62 KKFMQKWPTIFCLSSAKDEEILSAWA 87 (356) Q Consensus 62 ~~~~~~~P~~~~la~a~~~~vl~~w~ 87 (356) ..||+ =|.+.||+-++-|+|++|| T Consensus 562 la~le--EDleKLAArdlHELlRaWE 585 (651) T TIGR02061 562 LALLE--EDLEKLAARDLHELLRAWE 585 (651) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHH T ss_conf 87764--1177887877889998888 No 261 >PTZ00154 40S ribosomal protein S17; Provisional Probab=27.70 E-value=34 Score=14.84 Aligned_cols=29 Identities=34% Similarity=0.518 Sum_probs=24.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 88999877675430057868731456888 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKK 122 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~ 122 (356) |-+.++++|+.|++.|-..|-.|++.=.. T Consensus 5 RtktvKraAr~iiEkYy~~lt~DF~~NK~ 33 (130) T PTZ00154 5 RTKTVKRAARQIVEKYYAKLTLDFQINKK 33 (130) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 00599999999999803141454777479 No 262 >KOG2966 consensus Probab=27.48 E-value=22 Score=16.17 Aligned_cols=11 Identities=36% Similarity=0.863 Sum_probs=5.7 Q ss_pred HHHHHHHHHCC Q ss_conf 99999863204 Q gi|254780479|r 190 AMMDLGALICT 200 (356) Q Consensus 190 AlMdlGa~iC~ 200 (356) -+.++|++||. T Consensus 149 ~v~vl~~~V~l 159 (325) T KOG2966 149 NVIVLGDLVCL 159 (325) T ss_pred EEEHHHHHHHH T ss_conf 11215217876 No 263 >TIGR02418 acolac_catab acetolactate synthase, catabolic; InterPro: IPR012782 Acetolactate synthase is a thiamin pyrophosphate-dependent enzyme that combines two molecules of pyruvate to yield 2-acetolactate with the release of CO2. It exists in two distinct forms that have different properties, though they are clearly related , . The biosynthetic form, found in plants, fungi and bacteria, is involved in the biosynthesis of branched-chain amino acids and contains non-catalytic FAD. The catabolic form is found only in some bacteria, participates in butanediol fermentation, and does not contain FAD. This entry represents the catabolic form of acetolactate synthase, generally found in species that posses acetolactate decarboxylase, which is often encoded in the same operon. The enzyme is thought to be a homotetramer, though this has not conclusively been shown. Each monomer consists of three distinct domains, designated, starting from the N-terminus, as alpha, beta and gamma . The core of the alpha and gamma domains is a six-stranded parallel beta sheet surrounded by alpha helices, while the the beta domain forms a double Rossman fold. The thiamin pyrophosphate cofactor has an unusual conformation that has not been seen in other enzymes using this cofactor, including the biosynthetic form of acetolactate synthase. The active site is located in a funnel at the dimer interface, bordered by residues from all three domains.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding. Probab=27.40 E-value=17 Score=16.97 Aligned_cols=30 Identities=17% Similarity=0.305 Sum_probs=28.3 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEECCCCHHH Q ss_conf 887778776677664349631312431466 Q gi|254780479|r 123 LPGIGDYTASAIVAIAFNHFAVVVDTNIER 152 (356) Q Consensus 123 LpGiG~yta~ai~s~a~~~~~~~vD~Nv~R 152 (356) =||||.=+.|-.-+=+=|-|+.++=|+|+| T Consensus 75 GPG~sNL~TGl~TAn~EGDPVvAiGG~VkR 104 (553) T TIGR02418 75 GPGVSNLVTGLLTANSEGDPVVAIGGQVKR 104 (553) T ss_pred CCCHHHHHHHHHEECCCCCHHHHCCCCCCH T ss_conf 888144674331015677635411677632 No 264 >cd07146 ALDH_PhpJ Streptomyces putative phosphonoformaldehyde dehydrogenase PhpJ-like. Putative phosphonoformaldehyde dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog reportedly involved in the biosynthesis of phosphinothricin tripeptides in Streptomyces viridochromogenes DSM 40736, and similar sequences are included in this CD. Probab=27.34 E-value=38 Score=14.46 Aligned_cols=26 Identities=27% Similarity=0.323 Sum_probs=18.5 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 99974046345889998776754300 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKK 108 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~ 108 (356) .+.|+.++...|++-|+++|.++.++ T Consensus 31 ~~a~~~~~~~eR~~~l~~~a~~l~~~ 56 (451) T cd07146 31 ASYRSTLTRYQRSAILNKAAALLEAR 56 (451) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 45532099999999999999999978 No 265 >COG3600 GepA Uncharacterized phage-associated protein [Function unknown] Probab=27.25 E-value=28 Score=15.44 Aligned_cols=70 Identities=21% Similarity=0.248 Sum_probs=41.0 Q ss_pred HCCCHHHHHHHHHHHHHHHHH------CCCCCCCCHHHHHHHHHHHHHH--HHHHHHHHHCCCEEEECCCCHHHHHHHHH Q ss_conf 404634588999877675430------0578687314568888877787--76677664349631312431466657651 Q gi|254780479|r 87 AGLGYYTRARNLKKCADIIVK------KYEGNFPHKVEILKKLPGIGDY--TASAIVAIAFNHFAVVVDTNIERIISRYF 158 (356) Q Consensus 87 ~gLGyy~Rar~l~~~a~~i~~------~~~g~~P~~~~~l~~LpGiG~y--ta~ai~s~a~~~~~~~vD~Nv~RVl~R~~ 158 (356) +-|=||.-+..|+..-+-+++ +||-.+|.-+.+..+ =|+| |-.+...--++-....+|++|.++|+|+. T Consensus 31 qKLlYyA~~~~L~~~~~pL~~~~ieAW~~GPVip~~Yn~~K~---~Gs~~I~~r~~~~~l~~~~~~~iD~~~s~~L~~Vw 107 (154) T COG3600 31 QKLLYYAHGWFLAVTGRPLFDEKIEAWKHGPVIPSLYNAFKQ---YGSNSIDERLPVRGLSNGNALPIDADVSAILARVW 107 (154) T ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCHHHHHHHHH---CCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 999999999999980885632378887618873999999997---08877871122117776204753214999999999 Q ss_pred C Q ss_conf 9 Q gi|254780479|r 159 D 159 (356) Q Consensus 159 ~ 159 (356) . T Consensus 108 ~ 108 (154) T COG3600 108 D 108 (154) T ss_pred H T ss_conf 9 No 266 >COG1468 CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense mechanisms] Probab=26.81 E-value=23 Score=16.03 Aligned_cols=17 Identities=24% Similarity=0.682 Sum_probs=14.8 Q ss_pred CCCCCCCCCCCCCCHHH Q ss_conf 12356431100000134 Q gi|254780479|r 201 SNKPLCPLCPIQKNCLT 217 (356) Q Consensus 201 p~~P~C~~Cpl~~~C~~ 217 (356) ++++.|..|+++..|.. T Consensus 172 ~~~~~C~~C~y~~iC~~ 188 (190) T COG1468 172 KKKKKCKKCAYREICFP 188 (190) T ss_pred CCCCCCCCCCCCEECCC T ss_conf 99886999986334267 No 267 >cd07151 ALDH_HBenzADH NADP+-dependent p-hydroxybenzaldehyde dehydrogenase-like. NADP+-dependent, p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid and other related sequences are included in this CD. Probab=26.44 E-value=45 Score=13.96 Aligned_cols=56 Identities=20% Similarity=0.129 Sum_probs=35.9 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCC-CCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999740463458899987767543005----------786-87314568888877787766776643 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKY----------EGN-FPHKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~-~P~~~~~l~~LpGiG~yta~ai~s~a 138 (356) .+.|+-+..+.|++-|+++|..|.++. .|+ +.+...|+.....+-+|.|+...... T Consensus 45 ~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~e~Gk~~~~a~~ev~~~~~~~~~~a~~~~~~~ 111 (465) T cd07151 45 QKEWAATLPQERAEILEKAAQILEERRDEIVEWLIRESGSTRIKANIEWGAAMAITREAATFPLRME 111 (465) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8987329999999999999999997899999999997299889999999999999999987899831 No 268 >pfam03118 RNA_pol_A_CTD Bacterial RNA polymerase, alpha chain C terminal domain. The alpha subunit of RNA polymerase consists of two independently folded domains, referred to as amino-terminal and carboxyl terminal domains. The amino terminal domain is involved in the interaction with the other subunits of the RNA polymerase. The carboxyl-terminal domain interacts with the DNA and activators. The amino acid sequence of the alpha subunit is conserved in prokaryotic and chloroplast RNA polymerases. There are three regions of particularly strong conservation, two in the amino-terminal and one in the carboxyl- terminal. Probab=26.22 E-value=21 Score=16.29 Aligned_cols=47 Identities=23% Similarity=0.159 Sum_probs=33.7 Q ss_pred HCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 4046345889998776754300578687314568888877787766776 Q gi|254780479|r 87 AGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 87 ~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~ 135 (356) +-|+.-.|+.|-.+.+.+-- .+.-+-.+.++|+++|++|+.+-.-|- T Consensus 11 ~~L~LS~R~~N~Lk~~~I~t--v~dL~~~s~~dLl~i~N~G~kSl~EI~ 57 (62) T pfam03118 11 EELELSVRSYNCLKRAGINT--VGDLLSKSEEDLLKIKNFGKKSLEEIK 57 (62) T ss_pred HHHCCCHHHHHHHHHCCCCC--HHHHHHCCHHHHHHCCCCCHHHHHHHH T ss_conf 98168689999998949967--999985899999748898685799999 No 269 >PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional Probab=26.08 E-value=44 Score=14.03 Aligned_cols=23 Identities=22% Similarity=0.225 Sum_probs=9.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 78687314568888877787766 Q gi|254780479|r 110 EGNFPHKVEILKKLPGIGDYTAS 132 (356) Q Consensus 110 ~g~~P~~~~~l~~LpGiG~yta~ 132 (356) |..||-|.+..-++=|...-+.+ T Consensus 202 GT~~pIDR~~tA~~LGF~~~~~N 224 (614) T PRK12308 202 GTAYPIDREALAHNLGFRRATRN 224 (614) T ss_pred CCCCCCCHHHHHHHCCCCCCCCC T ss_conf 89898799999997599986678 No 270 >COG1383 RPS17A Ribosomal protein S17E [Translation, ribosomal structure and biogenesis] Probab=25.92 E-value=43 Score=14.13 Aligned_cols=29 Identities=28% Similarity=0.503 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 88999877675430057868731456888 Q gi|254780479|r 94 RARNLKKCADIIVKKYEGNFPHKVEILKK 122 (356) Q Consensus 94 Rar~l~~~a~~i~~~~~g~~P~~~~~l~~ 122 (356) |-++.+.+|+.|.+.|...|-.|++.=.. T Consensus 5 R~~~vKR~a~el~ekY~~~ft~dFe~NK~ 33 (74) T COG1383 5 RPKFVKRTARELIEKYPDKFTDDFETNKR 33 (74) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 25799999999999857775232788899 No 271 >cd07102 ALDH_EDX86601 Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (EDX86601). Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (locus EDX86601) and other similar sequences, are present in this CD. Probab=25.90 E-value=46 Score=13.89 Aligned_cols=25 Identities=8% Similarity=0.329 Sum_probs=14.9 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVK 107 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~ 107 (356) .+.|+-+..+.|++-|++.|..+.+ T Consensus 31 ~~~W~~~~~~eR~~~L~~~a~~l~~ 55 (452) T cd07102 31 QKGWRAVPLEERKAIVTRAVELLAA 55 (452) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 9988029999999999999999996 No 272 >pfam06754 PhnG Phosphonate metabolism protein PhnG. This family consists of several bacterial phosphonate metabolism protein PhnG sequences. In Escherichia coli, the phn operon encodes proteins responsible for the uptake and breakdown of phosphonates. The exact function of PhnG is unknown, however it is thought likely that along with six other proteins PhnG makes up the the C-P (carbon-phosphorus) lyase. Probab=25.85 E-value=44 Score=14.02 Aligned_cols=20 Identities=35% Similarity=0.615 Sum_probs=17.6 Q ss_pred HHHHCCCHHHHHHHHHCCCH Q ss_conf 89872998999999740463 Q gi|254780479|r 72 FCLSSAKDEEILSAWAGLGY 91 (356) Q Consensus 72 ~~la~a~~~~vl~~w~gLGy 91 (356) .-||.|+.+++..+|++|+. T Consensus 11 s~LA~a~~~~L~~~~~~~~~ 30 (147) T pfam06754 11 AVLARAPAAELAALWAALGL 30 (147) T ss_pred HHHCCCCHHHHHHHHHHCCC T ss_conf 99814999999999985588 No 273 >PRK13181 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=25.84 E-value=15 Score=17.37 Aligned_cols=11 Identities=36% Similarity=0.836 Sum_probs=5.9 Q ss_pred HHHHHHHHHHH Q ss_conf 88887778776 Q gi|254780479|r 121 KKLPGIGDYTA 131 (356) Q Consensus 121 ~~LpGiG~yta 131 (356) +-|||||.++. T Consensus 41 lIlPGvG~~~~ 51 (199) T PRK13181 41 VILPGVGAFGQ 51 (199) T ss_pred EECCCCCCCCH T ss_conf 04579865330 No 274 >PRK09457 astD succinylglutamic semialdehyde dehydrogenase; Reviewed Probab=25.79 E-value=46 Score=13.88 Aligned_cols=23 Identities=26% Similarity=0.546 Sum_probs=10.9 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 99740463458899987767543 Q gi|254780479|r 84 SAWAGLGYYTRARNLKKCADIIV 106 (356) Q Consensus 84 ~~w~gLGyy~Rar~l~~~a~~i~ 106 (356) +.|+.+....|++-|++.|..+. T Consensus 39 ~~W~~~~~~eR~~~L~~~a~~l~ 61 (475) T PRK09457 39 PAWARLSFEARQAIVERFAALLE 61 (475) T ss_pred HHHCCCCHHHHHHHHHHHHHHHH T ss_conf 99815999999999999999999 No 275 >KOG3771 consensus Probab=25.68 E-value=47 Score=13.87 Aligned_cols=83 Identities=14% Similarity=0.221 Sum_probs=42.8 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHH Q ss_conf 96716899999999999628998875485544456787046766446731161203326589987748997898729989 Q gi|254780479|r 1 MPQPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDE 80 (356) Q Consensus 1 ~~~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~ 80 (356) |+.-...|++.|++-|+- +.+=|... ..+-+-=.++..|.| |+-+++|+-=-.+|+..||+++....-... T Consensus 63 m~~a~~~l~e~l~eiy~p---~~~g~~~l----~~v~~~~d~l~~d~~--~~l~d~vl~pl~~~~~~fpdik~~i~KR~~ 133 (460) T KOG3771 63 MLAASKKLAESLQEIYEP---DWPGRDYL----QAVADNDDLLWKDLD--QKLVDQVLLPLDTYLGQFPDIKKAIAKRGR 133 (460) T ss_pred HHHHHHHHHHHHHHHCCC---CCCHHHHH----HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHCHHHHHHHHHHCC T ss_conf 999999999999985476---32308999----999987777899998--888876321598763103458999986140 Q ss_pred HHHHHHHCCCHHHHHHHHH Q ss_conf 9999974046345889998 Q gi|254780479|r 81 EILSAWAGLGYYTRARNLK 99 (356) Q Consensus 81 ~vl~~w~gLGyy~Rar~l~ 99 (356) .++ =|.|+||=. T Consensus 134 Kl~-------DyD~~r~~~ 145 (460) T KOG3771 134 KLV-------DYDSARHSF 145 (460) T ss_pred HHH-------HHHHHHHHH T ss_conf 221-------268788899 No 276 >cd07098 ALDH_F15-22 Aldehyde dehydrogenase family 15A1 and 22A1-like. Aldehyde dehydrogenase family members ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1 (Arabidopsis thaliana, EC=1.2.1.3), and similar sequences, are in this CD. Significant improvement of stress tolerance in tobacco plants was observed by overexpressing the ALDH22A1 gene from maize (Zea mays) and was accompanied by a reduction of malondialdehyde derived from cellular lipid peroxidation. Probab=25.52 E-value=47 Score=13.85 Aligned_cols=26 Identities=19% Similarity=0.410 Sum_probs=17.4 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 99974046345889998776754300 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKK 108 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~ 108 (356) .+.|+-+..+.|+.-|+++|..+.++ T Consensus 31 ~~~W~~~~~~~R~~~L~~~a~~l~~~ 56 (465) T cd07098 31 QREWAKTSFAERRKVLRSLLKYILEN 56 (465) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 99983299999999999999999978 No 277 >TIGR02923 AhaC ATP synthase A1, C subunit; InterPro: IPR014272 A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition to V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex, consisting of seven subunits, contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex, consisting of at least two subunits, forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents subunit C of the A1 component. More information about this protein can be found at Protein of the Month: ATP Synthases .. Probab=25.31 E-value=46 Score=13.92 Aligned_cols=19 Identities=32% Similarity=0.305 Sum_probs=9.1 Q ss_pred HHHHHHHHHHHHCCHHHCC Q ss_conf 4676644673116120332 Q gi|254780479|r 40 YKVWISEIMLQQTTVKTVE 58 (356) Q Consensus 40 y~v~vseimlqqT~v~~v~ 58 (356) |.-.+||+.-|-+-|.-|. T Consensus 47 Y~~~~d~l~~~~~g~~~~E 65 (353) T TIGR02923 47 YKKELDELGSKYEGVDLVE 65 (353) T ss_pred HHHHHHHHHCCCCCCHHHH T ss_conf 3789988621267844899 No 278 >pfam02961 BAF Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration. Probab=25.19 E-value=16 Score=17.15 Aligned_cols=74 Identities=19% Similarity=0.288 Sum_probs=41.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 73145688888777877667766434963131243146665765197545735689998741025785226789999999 Q gi|254780479|r 114 PHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQAMMD 193 (356) Q Consensus 114 P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQAlMd 193 (356) |-.-.....|||||+--+.-.-.-+|++ .---+-+.|+-..+-..|-.-|+| T Consensus 15 PMG~K~V~~l~GIG~~lg~~L~~kGfdK----------------------------AyvvLGqfLvl~kde~~F~~Wlk~ 66 (89) T pfam02961 15 PMGDKPVTALAGIGEVLGGRLEDKGFDK----------------------------AYVVLGQFLLLKKDEELFKEWLKD 66 (89) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHCCCCH----------------------------HHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 7778874505786289988999848539----------------------------999999999856649999999999 Q ss_pred H-HHHHCCCCCCCCCCCCCCCCHHHH Q ss_conf 9-863204123564311000001343 Q gi|254780479|r 194 L-GALICTSNKPLCPLCPIQKNCLTF 218 (356) Q Consensus 194 l-Ga~iC~p~~P~C~~Cpl~~~C~~~ 218 (356) . ||.-... -+|-.| |++.|.+| T Consensus 67 ~~gAn~kQa--~dcy~c-LkeWCd~F 89 (89) T pfam02961 67 TCGANAKQA--RDCYNC-LKEWCSCF 89 (89) T ss_pred HCCCCHHHH--HHHHHH-HHHHHHCC T ss_conf 808888899--889999-99998629 No 279 >cd04663 Nudix_Hydrolase_6 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belong to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V) which functions as metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specificity are Probab=25.09 E-value=48 Score=13.79 Aligned_cols=36 Identities=6% Similarity=-0.032 Sum_probs=24.6 Q ss_pred CEEEEEECCCCHHHHCCCCCCCCCCCCCCCHHHHHHCCCCC Q ss_conf 60577645720132047442111134672158886301010 Q gi|254780479|r 250 NRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFT 290 (356) Q Consensus 250 ~kiLL~KRp~~gll~GLwEFP~~e~~~~~~~~~~~~~~~~~ 290 (356) .++|+.+-|. +| |++|.+..+.++........+... T Consensus 14 ~elL~f~hp~----~g-~q~p~G~ve~gE~~~~aa~RE~~E 49 (126) T cd04663 14 LELLVFEHPL----AG-FQIVKGTVEPGETPEAAALRELQE 49 (126) T ss_pred EEEEEEECCC----CC-EECCCCCCCCCCCHHHHHHHHHHH T ss_conf 6899981788----85-476576758998989999999888 No 280 >pfam06424 PRP1_N PRP1 splicing factor, N-terminal. This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression. Probab=25.02 E-value=16 Score=17.08 Aligned_cols=16 Identities=31% Similarity=0.549 Sum_probs=12.3 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 3145688888777877 Q gi|254780479|r 115 HKVEILKKLPGIGDYT 130 (356) Q Consensus 115 ~~~~~l~~LpGiG~yt 130 (356) -+.++|.+||-||+|| T Consensus 115 vt~eeW~~IPE~GD~t 130 (132) T pfam06424 115 VTEDEWANIPEAGDYT 130 (132) T ss_pred CCHHHHHCCCCCCCCC T ss_conf 8899997188532225 No 281 >cd02027 APSK Adenosine 5'-phosphosulfate kinase (APSK) catalyzes the phosphorylation of adenosine 5'-phosphosulfate to form 3'-phosphoadenosine 5'-phosphosulfate (PAPS). The end-product PAPS is a biologically "activated" sulfate form important for the assimilation of inorganic sulfate. Probab=24.79 E-value=47 Score=13.83 Aligned_cols=32 Identities=22% Similarity=0.225 Sum_probs=24.8 Q ss_pred HHHHHHHHHHHHHHHH---HHCCCEEEECCCCHHH Q ss_conf 8888777877667766---4349631312431466 Q gi|254780479|r 121 KKLPGIGDYTASAIVA---IAFNHFAVVVDTNIER 152 (356) Q Consensus 121 ~~LpGiG~yta~ai~s---~a~~~~~~~vD~Nv~R 152 (356) .-|||-|..|-+-.+. ...+.++..+||...| T Consensus 5 tGLsgsGKTTlA~~l~~~L~~~~~~~~~lDGD~iR 39 (149) T cd02027 5 TGLSGSGKSTIARALEEKLFQRGRPVYVLDGDNVR 39 (149) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCEEEECCHHHH T ss_conf 79999999999999999999869975997748899 No 282 >cd03740 SOCS_SOCS6 SOCS (suppressors of cytokine signaling) box of SOCS6-like proteins. Together with CIS1, the CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions. Probab=24.67 E-value=45 Score=14.00 Aligned_cols=22 Identities=23% Similarity=0.298 Sum_probs=17.2 Q ss_pred EECHHHHHHCCCCHHHHHHHHH Q ss_conf 3227888747998789999984 Q gi|254780479|r 328 WHDAQNLANAALPTVMKKALSA 349 (356) Q Consensus 328 Wv~~~el~~~~LPs~~kKIL~a 349 (356) -+..+.|..+|||+.++..|+. T Consensus 17 ~vr~d~I~~LPLP~~l~~Yl~e 38 (41) T cd03740 17 YTRIDLIQKLPLPNKMKGYLLE 38 (41) T ss_pred HHHHHHHHHCCCCHHHHHHHHC T ss_conf 7323001107996889998752 No 283 >cd07116 ALDH_ACDHII-AcoD Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane. These proteins apparently require RpoN factors for expression. Probab=24.58 E-value=49 Score=13.73 Aligned_cols=27 Identities=30% Similarity=0.335 Sum_probs=20.6 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 999740463458899987767543005 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKY 109 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~ 109 (356) .+.|+-+..+.|++-|+++|..+.++. T Consensus 51 ~~~W~~~~~~eR~~~l~~~a~~l~~~~ 77 (479) T cd07116 51 KEAWGKTSVAERANILNKIADRMEANL 77 (479) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHCH T ss_conf 998804999999999999999999789 No 284 >cd02037 MRP-like MRP (Multiple Resistance and pH adaptation) is a homologue of the Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains a ATP-binding domain at the N-termini. It is found in bacteria as a membrane-spanning protein and functions as a Na+/H+ antiporter. Probab=24.26 E-value=40 Score=14.33 Aligned_cols=27 Identities=26% Similarity=0.402 Sum_probs=23.0 Q ss_pred HHHHHHHHHHHHHHHC---CCEEEECCCCH Q ss_conf 8777877667766434---96313124314 Q gi|254780479|r 124 PGIGDYTASAIVAIAF---NHFAVVVDTNI 150 (356) Q Consensus 124 pGiG~yta~ai~s~a~---~~~~~~vD~Nv 150 (356) .|||..|-++-++.++ |..+.++|..+ T Consensus 9 ggvgkst~~~~la~~l~~~g~~v~~~d~di 38 (169) T cd02037 9 GGVGKSTVAVNLALALAKLGYKVGLLDADI 38 (169) T ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEEEEC T ss_conf 988199999999999998799789997137 No 285 >TIGR02291 rimK_rel_E_lig alpha-L-glutamate ligase homolog; InterPro: IPR011758 Members of this protein family contain a region of homology to the RimK family of alpha-L-glutamate ligases (IPR004666 from INTERPRO), various members of which modify the Glu-Glu C-terminus of ribosomal protein S6, or tetrahydromethanopterin, or a form of coenzyme F420 derivative. Members of this family are found so far in various Vibrio and Pseudomonas species and some other gamma and beta Proteobacteria. The function is unknown.. Probab=24.16 E-value=33 Score=14.92 Aligned_cols=13 Identities=31% Similarity=0.570 Sum_probs=8.5 Q ss_pred CCHHHHCCCCCCC Q ss_conf 2013204744211 Q gi|254780479|r 259 NTRLLEGMDELPG 271 (356) Q Consensus 259 ~~gll~GLwEFP~ 271 (356) =+.+|+|||++=+ T Consensus 119 vSniL~GLySLGG 131 (320) T TIGR02291 119 VSNILAGLYSLGG 131 (320) T ss_pred HHHHHHHHHHCCC T ss_conf 8888877765079 No 286 >TIGR02684 dnstrm_HI1420 probable addiction module antidote protein; InterPro: IPR014057 Members of this strictly bacterial protein family are small, at roughly 100 amino acids. The gene is almost invariably the downstream member of a gene pair. It is a predicted DNA-binding protein from a clade within the helix-turn-helix family IPR001387 from INTERPRO. These gene pairs, when found on the bacterial chromosome, are located often with prophage regions, but also both in integrated plasmid regions and in housekeeping gene regions. Analysis suggests that the gene pair may serve as an addiction module.. Probab=24.11 E-value=50 Score=13.67 Aligned_cols=30 Identities=30% Similarity=0.376 Sum_probs=23.6 Q ss_pred HHHCC--CHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 98729--9899999974046345889998776754 Q gi|254780479|r 73 CLSSA--KDEEILSAWAGLGYYTRARNLKKCADII 105 (356) Q Consensus 73 ~la~a--~~~~vl~~w~gLGyy~Rar~l~~~a~~i 105 (356) .|... |-..|..+ |||=.|||.|-+.|+.. T Consensus 24 a~e~gGdDp~~~~~A---LG~iArArGMtqlA~~t 55 (91) T TIGR02684 24 ALEDGGDDPALIAAA---LGVIARARGMTQLAKKT 55 (91) T ss_pred HHHCCCCCHHHHHHH---HHHHHHHCCHHHHHHHH T ss_conf 974369998899999---88998651657899982 No 287 >COG3100 Uncharacterized protein conserved in bacteria [Function unknown] Probab=24.04 E-value=43 Score=14.12 Aligned_cols=20 Identities=30% Similarity=0.415 Sum_probs=8.2 Q ss_pred HHHCCCHHHHHHHHHCCCHH Q ss_conf 98729989999997404634 Q gi|254780479|r 73 CLSSAKDEEILSAWAGLGYY 92 (356) Q Consensus 73 ~la~a~~~~vl~~w~gLGyy 92 (356) .||+|+.++|+...+--||| T Consensus 61 ~L~n~dv~kV~~~i~~QGfy 80 (103) T COG3100 61 KLVNADVEKVKQAIEEQGFY 80 (103) T ss_pred HHHHHHHHHHHHHHHHCCEE T ss_conf 88785499999999865546 No 288 >COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis] Probab=23.99 E-value=37 Score=14.60 Aligned_cols=27 Identities=30% Similarity=0.347 Sum_probs=20.4 Q ss_pred CCCCHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 687314---568888877787766776643 Q gi|254780479|r 112 NFPHKV---EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 112 ~~P~~~---~~l~~LpGiG~yta~ai~s~a 138 (356) .+|.+. -.|..+.|||.-+|.+|+.-+ T Consensus 8 dip~~K~v~iALt~IyGIG~~~a~~I~~~~ 37 (121) T COG0099 8 DIPGNKRVVIALTYIYGIGRRRAKEICKKA 37 (121) T ss_pred CCCCCCEEEEHHHHHCCCCHHHHHHHHHHC T ss_conf 799982576504630353699999999991 No 289 >cd07147 ALDH_F21_RNP123 Aldehyde dehydrogenase family 21A1-like. Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was first described in the moss Tortula ruralis and is believed to play an important role in the detoxification of aldehydes generated in response to desiccation- and salinity-stress, and ALDH21A1 expression represents a unique stress tolerance mechanism. So far, of plants, only the bryophyte sequence has been observed, but similar protein sequences from bacteria and archaea are also present in this CD. Probab=23.88 E-value=50 Score=13.64 Aligned_cols=21 Identities=24% Similarity=0.238 Sum_probs=8.6 Q ss_pred HHHCCCHHHHHHHHHHHHHHH Q ss_conf 974046345889998776754 Q gi|254780479|r 85 AWAGLGYYTRARNLKKCADII 105 (356) Q Consensus 85 ~w~gLGyy~Rar~l~~~a~~i 105 (356) .|+.+....|++-|+++|..+ T Consensus 36 ~W~~~~~~eR~~~l~~~a~~l 56 (452) T cd07147 36 PMRALPAHRRAAILLHCVARL 56 (452) T ss_pred HHHCCCHHHHHHHHHHHHHHH T ss_conf 982599999999999999999 No 290 >TIGR00608 radc DNA repair protein RadC; InterPro: IPR001405 RadC is a DNA repair protein found in many bacteria. They share a region of similarity at the C-terminus that contains two perfectly conserved histidine residues.; GO: 0006281 DNA repair. Probab=23.86 E-value=49 Score=13.69 Aligned_cols=42 Identities=24% Similarity=0.320 Sum_probs=28.5 Q ss_pred HHHHHHHHHHHCCC--------CCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99877675430057--------8687314568888877787766776643 Q gi|254780479|r 97 NLKKCADIIVKKYE--------GNFPHKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 97 ~l~~~a~~i~~~~~--------g~~P~~~~~l~~LpGiG~yta~ai~s~a 138 (356) .-...|+.+.+.+| +-.-.++++|.+.||||.=-|--|.+++ T Consensus 34 dVl~Lsk~lL~~fG~~~~d~l~~L~~~~~~~L~~v~GiG~ak~~qlka~~ 83 (223) T TIGR00608 34 DVLSLSKELLKQFGQEELDSLDLLLSAEPEELSSVPGIGLAKAIQLKAIV 83 (223) T ss_pred CHHHHHHHHHHHHCHHCCCCHHHHHCCCHHHHHCCCCCHHHHHHHHHHHH T ss_conf 62678999998707000130014321585785046885088999999999 No 291 >PRK00558 uvrC excinuclease ABC subunit C; Validated Probab=23.76 E-value=50 Score=13.62 Aligned_cols=23 Identities=17% Similarity=0.206 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999999863204123564311 Q gi|254780479|r 187 FVQAMMDLGALICTSNKPLCPLC 209 (356) Q Consensus 187 ~nQAlMdlGa~iC~p~~P~C~~C 209 (356) ...+|=+|....=.++.|.=-+| T Consensus 376 ~~~~l~~L~~~L~L~~~p~rIE~ 398 (609) T PRK00558 376 QTAALEELAEALGLPTLPKRIEC 398 (609) T ss_pred HHHHHHHHHHHHCCCCCCCEEEE T ss_conf 99999999998388998878999 No 292 >pfam00286 Flexi_CP Viral coat protein. Family includes coat proteins from Potexviruses and carlaviruses. Probab=23.75 E-value=50 Score=13.62 Aligned_cols=78 Identities=23% Similarity=0.269 Sum_probs=38.6 Q ss_pred HHHHHHHHCCCHHH--HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH Q ss_conf 99999974046345--8899987767543005786873145688888777877667766434963131243146665765 Q gi|254780479|r 80 EEILSAWAGLGYYT--RARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY 157 (356) Q Consensus 80 ~~vl~~w~gLGyy~--Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~ 157 (356) ..|...|+|||--. -+.-+...|....+.-.+.+-.=...-...+|+-+..+. .++ +-+.-..|-++|+ T Consensus 11 ~~I~~~~~~lGvp~~~v~~~~~~la~~C~d~gSS~~~~~~G~~~~~~g~~~~~~a--~~v-------i~~~~TLRqfC~~ 81 (140) T pfam00286 11 VAIAALWQGLGVPAEHVAKVAIDLARACADVGSSRSTDLTGTSEWPPGITRADLA--LAV-------VKAHATLRQFCRY 81 (140) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHH--HHH-------HHCCCCHHHHHHH T ss_conf 9999999983998889999999999997077887545527867889996489999--999-------9807989999999 Q ss_pred HCCCCCCCHHHHHHHHHH Q ss_conf 197545735689998741 Q gi|254780479|r 158 FDIIKPAPLYHKTIKNYA 175 (356) Q Consensus 158 ~~~~~~~~~~~k~l~~~~ 175 (356) |+ |.+|..+ T Consensus 82 yA---------kvvWN~m 90 (140) T pfam00286 82 YA---------KVVWNYM 90 (140) T ss_pred HH---------HHHHHHH T ss_conf 83---------9999988 No 293 >pfam01583 APS_kinase Adenylylsulphate kinase. Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif. Probab=23.75 E-value=46 Score=13.94 Aligned_cols=32 Identities=25% Similarity=0.242 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHHHHH---HHCCCEEEECCCCHHH Q ss_conf 8888777877667766---4349631312431466 Q gi|254780479|r 121 KKLPGIGDYTASAIVA---IAFNHFAVVVDTNIER 152 (356) Q Consensus 121 ~~LpGiG~yta~ai~s---~a~~~~~~~vD~Nv~R 152 (356) .-|||-|..|-+.-+. ...+.++..+||...| T Consensus 8 TGLsGsGKTTlA~~l~~~L~~~~~~~~~LDGD~~R 42 (157) T pfam01583 8 TGLSGSGKSTIANALERKLFAQGISVYVLDGDNVR 42 (157) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCEEEECCHHHH T ss_conf 89899999999999999999759977997688775 No 294 >COG0634 Hpt Hypoxanthine-guanine phosphoribosyltransferase [Nucleotide transport and metabolism] Probab=23.46 E-value=51 Score=13.58 Aligned_cols=106 Identities=21% Similarity=0.181 Sum_probs=58.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCC--HHHHH-- Q ss_conf 89998776754300578687314568888877787766776643496313124314666576519754573--56899-- Q gi|254780479|r 95 ARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAP--LYHKT-- 170 (356) Q Consensus 95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~--~~~k~-- 170 (356) +..+.++|++|.++|+|+.|-- +--|.|-=+.+|.-+..|.+....--++ .|++.+-...+. ...+. T Consensus 18 ~~ri~ela~~I~~~y~g~~~~v---v~iLkGs~~F~~dL~~~i~~~~e~dFm~------vSSYg~~t~ssg~v~i~kDld 88 (178) T COG0634 18 KARIKELAAQITEDYGGKDPLV---VGVLKGSFPFMADLIRAIDFPLEVDFMH------VSSYGGGTSSSGEVKILKDLD 88 (178) T ss_pred HHHHHHHHHHHHHHHCCCCEEE---EEECCCCHHHHHHHHHHCCCCCEEEEEE------EECCCCCCCCCCCEEEECCCC T ss_conf 9999999999977627996199---9974664530999998559984167899------740478763477168853566 Q ss_pred --HH----HHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCHH Q ss_conf --98----741025785226789999999986320412356-43110000013 Q gi|254780479|r 171 --IK----NYARKITSTSRPGDFVQAMMDLGALICTSNKPL-CPLCPIQKNCL 216 (356) Q Consensus 171 --l~----~~~~~~~~~~~~~~~nQAlMdlGa~iC~p~~P~-C~~Cpl~~~C~ 216 (356) +. .+.+.+++. |.--+.++|+ |+-++|+ |..|-|-+.+. T Consensus 89 ~di~grdVLiVeDIiDs---G~TLs~i~~~----l~~r~a~sv~i~tLldK~~ 134 (178) T COG0634 89 EDIKGRDVLIVEDIIDS---GLTLSKVRDL----LKERGAKSVRIATLLDKPE 134 (178) T ss_pred CCCCCCEEEEEECCCCC---CHHHHHHHHH----HHHCCCCEEEEEEEEECCC T ss_conf 67789849998521036---7559999999----9756997689999860765 No 295 >cd07559 ALDH_ACDHII_AcoD-like Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II and Staphylococcus aureus AldA1 (SACOL0154)-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane, as well as, the uncharacterized aldehyde dehydrogenase from Staphylococcus aureus (AldA1, locus SACOL0154) and other similar sequences. Probab=23.39 E-value=51 Score=13.57 Aligned_cols=58 Identities=24% Similarity=0.204 Sum_probs=33.9 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCC----------CCCCC-CHH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999740463458899987767543005----------78687-314-568888877787766776643 Q gi|254780479|r 81 EILSAWAGLGYYTRARNLKKCADIIVKKY----------EGNFP-HKV-EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 81 ~vl~~w~gLGyy~Rar~l~~~a~~i~~~~----------~g~~P-~~~-~~l~~LpGiG~yta~ai~s~a 138 (356) +-.+.|+-++.+.|++-|+++|..+.++- .|+-. +.. .|+..--..-+|.|+...... T Consensus 49 ~A~~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~~~E~GK~~~~a~~~ev~~a~~~~~~~a~~~~~~~ 118 (480) T cd07559 49 EAFKTWGKTSVAERANILNKIADRIEENLELLAVAETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQE 118 (480) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999980499999999999999999968999999999970923888888799999999999999998705 No 296 >KOG1362 consensus Probab=23.35 E-value=15 Score=17.36 Aligned_cols=16 Identities=13% Similarity=-0.045 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHCCHH Q ss_conf 4676644673116120 Q gi|254780479|r 40 YKVWISEIMLQQTTVK 55 (356) Q Consensus 40 y~v~vseimlqqT~v~ 55 (356) ..-+.-++++|+|... T Consensus 213 ~~~~~~~~~~~~~~l~ 228 (577) T KOG1362 213 TSSLFVAVGNQLTLLD 228 (577) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 2179999976999999 No 297 >TIGR01258 pgm_1 phosphoglycerate mutase 1 family; InterPro: IPR005952 Most members of this family are phosphoglycerate mutase (5.4.2.1 from EC). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate.2-phospho-D-glycerate + 2,3-diphosphoglycerate = 3-phospho-D-glycerate + 2,3-diphosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (5.4.2.4 from EC). ; GO: 0016868 intramolecular transferase activity phosphotransferases, 0006096 glycolysis. Probab=23.32 E-value=21 Score=16.35 Aligned_cols=11 Identities=45% Similarity=0.794 Sum_probs=4.2 Q ss_pred CHHHCCHHHHH Q ss_conf 12033265899 Q gi|254780479|r 53 TVKTVEPYFKK 63 (356) Q Consensus 53 ~v~~v~~~~~~ 63 (356) +|++|+|||+. T Consensus 156 T~~RvlPYW~d 166 (248) T TIGR01258 156 TVARVLPYWND 166 (248) T ss_pred HHHHHCCCHHH T ss_conf 99650653111 No 298 >cd03313 enolase Enolase: Enolases are homodimeric enzymes that catalyse the reversible dehydration of 2-phospho-D-glycerate to phosphoenolpyruvate as part of the glycolytic and gluconeogenesis pathways. The reaction is facilitated by the presence of metal ions. Probab=23.09 E-value=12 Score=17.94 Aligned_cols=49 Identities=24% Similarity=0.348 Sum_probs=18.3 Q ss_pred HHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHH----HHHHHHH---HHHHHHHHHHH Q ss_conf 7404634588999877675430057868731456----8888877---78776677664 Q gi|254780479|r 86 WAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEI----LKKLPGI---GDYTASAIVAI 137 (356) Q Consensus 86 w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~----l~~LpGi---G~yta~ai~s~ 137 (356) +.|.|-..-..|+.. .|....-|.=|.|..+ |+.|.|- .+-=|+||++. T Consensus 54 y~GkgV~~Av~~vn~---~I~~~L~G~~~~dQ~~iD~~LielDgT~nKs~lGaNailav 109 (408) T cd03313 54 YLGKGVLKAVKNVNE---IIAPALIGMDVTDQRAIDKLLIELDGTPNKSKLGANAILGV 109 (408) T ss_pred CCCCCHHHHHHHHHH---HHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 478159999999999---88999769986768999999997208866344651799999 No 299 >TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252 This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC.. Probab=22.95 E-value=37 Score=14.56 Aligned_cols=30 Identities=33% Similarity=0.367 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHH----------HHCCCEEEECCCCHHHH Q ss_conf 8777877667766----------43496313124314666 Q gi|254780479|r 124 PGIGDYTASAIVA----------IAFNHFAVVVDTNIERI 153 (356) Q Consensus 124 pGiG~yta~ai~s----------~a~~~~~~~vD~Nv~RV 153 (356) ||||.-||+=++. |+-|-+..=|||-..|= T Consensus 185 PGVGKTTaARl~LEe~K~~~~tPF~~DA~FvEVDGtTLRW 224 (616) T TIGR02903 185 PGVGKTTAARLALEEAKKLKNTPFAEDAPFVEVDGTTLRW 224 (616) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCC T ss_conf 3884789999987621368744761137857515762667 No 300 >cd07106 ALDH_AldA-AAD23400 Streptomyces aureofaciens putative aldehyde dehydrogenase AldA (AAD23400)-like. Putative aldehyde dehydrogenase, AldA, from Streptomyces aureofaciens (locus AAD23400) and other similar sequences are present in this CD. Probab=22.87 E-value=52 Score=13.51 Aligned_cols=24 Identities=25% Similarity=0.315 Sum_probs=12.6 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 997404634588999877675430 Q gi|254780479|r 84 SAWAGLGYYTRARNLKKCADIIVK 107 (356) Q Consensus 84 ~~w~gLGyy~Rar~l~~~a~~i~~ 107 (356) +.|+-+....|++-|++.|..|.+ T Consensus 33 ~~W~~~~~~~R~~~L~~~a~~l~~ 56 (446) T cd07106 33 PGWSATPLEERRAALLAIADAIEA 56 (446) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 987239999999999999999997 No 301 >COG1251 NirB NAD(P)H-nitrite reductase [Energy production and conversion] Probab=22.86 E-value=52 Score=13.50 Aligned_cols=46 Identities=17% Similarity=0.267 Sum_probs=26.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHH Q ss_conf 05786873145688888777877667766434963131243146665765 Q gi|254780479|r 108 KYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRY 157 (356) Q Consensus 108 ~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~ 157 (356) .|.|..+ ..-++++||--+.+|-+. =.-|.+..+.+-..+.|..|+ T Consensus 310 ~y~gsv~---stkLKv~Gvdl~S~GD~~-e~~~~~~iv~~D~~~~iYKrl 355 (793) T COG1251 310 AYEGSVT---STKLKVSGVDVFSAGDFQ-ETEGAESIVFRDEQRGIYKKL 355 (793) T ss_pred CCCCCCC---HHHHCCCCCCEEECCCHH-HCCCCCEEEEECCCCCCEEEE T ss_conf 5545542---454315566333011132-248994489865411300489 No 302 >TIGR01203 HGPRTase hypoxanthine phosphoribosyltransferase; InterPro: IPR005904 Phosphoribosyltransferases (PRT) are enzymes that catalyze the synthesis of beta-n-5'-monophosphates from phosphoribosylpyrophosphate (PRPP) and an enzyme specific amine. A number of PRT's are involved in the biosynthesis of purine, pyrimidine, and pyridine nucleotides, or in the salvage of purines and pyrimidines. Purine nucleotides are synthesized both via the de novo pathway and via the salvage pathway and are vital for cell functions and cell proliferation through DNA and RNA syntheses and ATP energy supply. This entry presents hypoxanthine phosphoribosyltransferase (2.4.2.8 from EC), which belongs to phosphoribosyltransferase family and is involved in purine salvage.; GO: 0004422 hypoxanthine phosphoribosyltransferase activity, 0006166 purine ribonucleoside salvage, 0005737 cytoplasm. Probab=22.84 E-value=52 Score=13.50 Aligned_cols=61 Identities=26% Similarity=0.273 Sum_probs=42.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHH---HHHHHHHHHHHHHHHH-----CCCEEEECCCCHHHHHH Q ss_conf 8999877675430057868731456888---8877787766776643-----49631312431466657 Q gi|254780479|r 95 ARNLKKCADIIVKKYEGNFPHKVEILKK---LPGIGDYTASAIVAIA-----FNHFAVVVDTNIERIIS 155 (356) Q Consensus 95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~---LpGiG~yta~ai~s~a-----~~~~~~~vD~Nv~RVl~ 155 (356) ..+..++|++|.++|.|.-|.+-++|.. |.|==.-.|.-|..|. -+...+.+--.-.+|-| T Consensus 9 ~~r~~eLa~~I~~~Y~~~~~tt~~~l~~~gvLKGs~~F~aDL~~~i~~~~~~S~RS~~~~~~dFm~vSS 77 (183) T TIGR01203 9 KARIAELAKQITEDYQGKKPTTDEELVLVGVLKGSFPFFADLIRAIDKLNKYSDRSVVPVQVDFMAVSS 77 (183) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEE T ss_conf 999999999999873567899897389999725624459999999887422068730114888898875 No 303 >cd07591 BAR_Rvs161p The Bin/Amphiphysin/Rvs (BAR) domain of Saccharomyces cerevisiae Reduced viability upon starvation protein 161 and similar proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of fungal proteins with similarity to Saccharomyces cerevisiae Reduced viability upon starvation protein 161 (Rvs161p) and Schizosaccharomyces pombe Hob3 (homolog of Bin3). S. cerevisiae Rvs161p plays a role in regulating cell polarity, actin cytoskeleton polarization, vesicle trafficking, endocytosis, bud formation, and the mating response. It forms a heterodimer with another BAR domain protein Rvs167p. Rvs161p and Rvs167p share common functions but are not interchangeable. Their BAR domains cannot be replaced with each other and the overexpression of one cannot suppress the mutant phenotypes of the other. S. pombe Hob3 is important in regulating filamentous actin localization an Probab=22.69 E-value=53 Score=13.48 Aligned_cols=75 Identities=16% Similarity=0.176 Sum_probs=42.2 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC------CHHHCCHHHHHHHHHCCCHHHH Q ss_conf 9671689999999999962899887548554445678704676644673116------1203326589987748997898 Q gi|254780479|r 1 MPQPEHIIQSKILDWYDTNHRVLPWRTSPKTEKSSLPSPYKVWISEIMLQQT------TVKTVEPYFKKFMQKWPTIFCL 74 (356) Q Consensus 1 ~~~~~~~~~~~ll~w~~~~~R~lpwr~~~~~~~~~~~~py~v~vseimlqqT------~v~~v~~~~~~~~~~~P~~~~l 74 (356) |.+-...|.+.|..+|..++.+-++-. ..-|..-|.|+--+.. --.+|+.=-.+|...||.+... T Consensus 44 m~~~~~~lae~l~~~Y~~~~~~~~~~~---------~~~y~~~v~el~~~l~~el~~~~~~~vl~Pl~~~~s~f~~i~k~ 114 (224) T cd07591 44 LTSSQARIAETISSFYGDAGDKDGAML---------SQEYKQAVEELDAETVKELDGPYRQTVLDPIGRFNSYFPEINEA 114 (224) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999997353579644568---------89999999999999899998888999988999999985889999 Q ss_pred HCCCHHHHHH Q ss_conf 7299899999 Q gi|254780479|r 75 SSAKDEEILS 84 (356) Q Consensus 75 a~a~~~~vl~ 84 (356) ..-.....+- T Consensus 115 I~KR~~KllD 124 (224) T cd07591 115 IKKRNHKLLD 124 (224) T ss_pred HHHHHHHHHH T ss_conf 9997716877 No 304 >KOG0142 consensus Probab=22.45 E-value=23 Score=15.99 Aligned_cols=33 Identities=24% Similarity=0.495 Sum_probs=24.4 Q ss_pred CCCEEEEEEEEEECCCEEEEEECCCCHH-HHCCCC Q ss_conf 2211232000341376057764572013-204744 Q gi|254780479|r 235 RPMRTGAVFIAITNDNRILLRKRTNTRL-LEGMDE 268 (356) Q Consensus 235 k~~r~~~~~vii~~~~kiLL~KRp~~gl-l~GLwE 268 (356) .-.|.+.+|+ .+.+|+.||+||+..++ |.|+|. T Consensus 50 lLHRaFSVFl-Fns~~~lLlQqRS~~KitFP~~~T 83 (225) T KOG0142 50 LLHRAFSVFL-FNSKNELLLQQRSDEKITFPGLWT 83 (225) T ss_pred HHHHEEEEEE-ECCCCHHHHHHHCCCCCCCCCHHH T ss_conf 4432046898-658514776540354411650555 No 305 >cd02011 TPP_PK Thiamine pyrophosphate (TPP) family, Phosphoketolase (PK) subfamily, TPP-binding module; PK catalyzes the conversion of D-xylulose 5-phosphate and phosphate to acetyl phosphate, D-glyceraldehyde-3-phosphate and H2O. This enzyme requires divalent magnesium ions and TPP for activity. Probab=22.15 E-value=48 Score=13.76 Aligned_cols=12 Identities=8% Similarity=0.113 Sum_probs=4.6 Q ss_pred CCCCCCCCCCCC Q ss_conf 356431100000 Q gi|254780479|r 203 KPLCPLCPIQKN 214 (356) Q Consensus 203 ~P~C~~Cpl~~~ 214 (356) .=.|+..|+... T Consensus 89 DGEaETGPlAts 100 (227) T cd02011 89 DGEAETGPLATS 100 (227) T ss_pred CCCCCCCCCHHH T ss_conf 863235841222 No 306 >pfam05405 Mt_ATP-synt_B Mitochondrial ATP synthase B chain precursor (ATP-synt_B). The Fo sector of the ATP synthase is a membrane bound complex which mediates proton transport. It is composed of nine different polypeptide subunits (a, b, c, d, e, f, g F6, A6L). Probab=21.86 E-value=55 Score=13.37 Aligned_cols=121 Identities=15% Similarity=0.119 Sum_probs=76.4 Q ss_pred CCCHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 89978987299899999974046345889998776754300578687314568888877787766776643496313124 Q gi|254780479|r 68 WPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVD 147 (356) Q Consensus 68 ~P~~~~la~a~~~~vl~~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD 147 (356) =|++.+.+.+..+++...+. -+|+.|..|-.-..++....=.-.+.-..|..+-+-++...+-.++-.....|- T Consensus 35 G~~i~~~lD~~~e~i~~~l~------~~r~~~~~a~~~~ie~~k~~~~~~~~~~~l~~~~ke~~al~~e~~~~~~~~~v~ 108 (163) T pfam05405 35 GPSIKEWLDKRIEKIQDELN------QSRNLHEKALKERIEYVKKLQSVVEETKVLFEVSKETVALEAEAFERELQAALA 108 (163) T ss_pred HHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 08899999998999999999------888878999999999999999999989999998246799777999999999999 Q ss_pred CCHHHHHHHHHCCCCCCCH-HHHH-----HHHHHHCCCCCC-CHHHHHHHHHHH Q ss_conf 3146665765197545735-6899-----987410257852-267899999999 Q gi|254780479|r 148 TNIERIISRYFDIIKPAPL-YHKT-----IKNYARKITSTS-RPGDFVQAMMDL 194 (356) Q Consensus 148 ~Nv~RVl~R~~~~~~~~~~-~~k~-----l~~~~~~~~~~~-~~~~~nQAlMdl 194 (356) .+|+|+|--+...+..... .++. +......+.|+. .---..|++-|| T Consensus 109 ~evk~~Ld~~v~~e~~~r~~~Q~~~v~~v~~~V~~~i~~~~~~k~~l~~~I~~l 162 (163) T pfam05405 109 REIKSKLDTLVRKESSVRQREQDHLVNWVISSVLKELSPPKFQKKSLQESIADL 162 (163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 999999999999998999999999999999999987079167999999998860 No 307 >TIGR03371 cellulose_yhjQ cellulose synthase operon protein YhjQ. Members of this family are the YhjQ protein, found immediately upsteam of bacterial cellulose synthase (bcs) genes in a broad range of bacteria, including both copies of the bcs locus in Klebsiella pneumoniae. In several species it is seen clearly as part of the bcs operon. It is identified as a probable component of the bacterial cellulose metabolic process not only by gene location, but also by partial phylogenetic profiling, or Haft-Selengut algorithm (PMID:16930487), based on a bacterial cellulose biosynthesis genome property profile. Cellulose plays an important role in biofilm formation and structural integrity in some bacteria. Mutants in yhjQ in Escherichia coli, show altered morphology an growth, but the function of YhjQ has not yet been determined. Probab=21.85 E-value=55 Score=13.37 Aligned_cols=39 Identities=21% Similarity=0.310 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHHHHC---CCEEEECCCCHHHHHHHHHCCCCC Q ss_conf 777877667766434---963131243146665765197545 Q gi|254780479|r 125 GIGDYTASAIVAIAF---NHFAVVVDTNIERIISRYFDIIKP 163 (356) Q Consensus 125 GiG~yta~ai~s~a~---~~~~~~vD~Nv~RVl~R~~~~~~~ 163 (356) |||..|.++=++.++ |.++.+||..-..-++..||.+.. T Consensus 12 GVGKTT~avNLA~~La~~G~rVLlIDlDpQ~~l~~~~g~~~~ 53 (246) T TIGR03371 12 GVGRTTLTAALASALKLLGEPVLAIDLDPQNLLRLHFGMDWS 53 (246) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCC T ss_conf 854999999999999968997899975999850322488875 No 308 >KOG4579 consensus Probab=21.85 E-value=53 Score=13.45 Aligned_cols=16 Identities=50% Similarity=0.615 Sum_probs=10.3 Q ss_pred HHHHHHCCCHHHHHCC Q ss_conf 9987748997898729 Q gi|254780479|r 62 KKFMQKWPTIFCLSSA 77 (356) Q Consensus 62 ~~~~~~~P~~~~la~a 77 (356) ++|..+|||...|--| T Consensus 70 ~kft~kf~t~t~lNl~ 85 (177) T KOG4579 70 KKFTIKFPTATTLNLA 85 (177) T ss_pred HHHHHCCCHHHHHHCC T ss_conf 9884054046555221 No 309 >PRK00077 eno phosphopyruvate hydratase; Provisional Probab=21.83 E-value=16 Score=17.15 Aligned_cols=27 Identities=26% Similarity=0.199 Sum_probs=10.1 Q ss_pred CCCCCCHHHH----HHHHHHH---HHHHHHHHHH Q ss_conf 7868731456----8888877---7877667766 Q gi|254780479|r 110 EGNFPHKVEI----LKKLPGI---GDYTASAIVA 136 (356) Q Consensus 110 ~g~~P~~~~~----l~~LpGi---G~yta~ai~s 136 (356) -|.=|.|..+ |+.|.|- .+.-|+||++ T Consensus 80 ~G~~~~dQ~~iD~~LielDgT~nKs~lGaNaila 113 (427) T PRK00077 80 IGLDAFDQAGIDKAMIELDGTENKSKLGANAILA 113 (427) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH T ss_conf 7997462899999999732985412244267899 No 310 >TIGR03650 violacein_E violacein biosynthesis enzyme VioE. This enzyme catalyzes the third step in violacein biosynthesis from a pair of Trp residues, as in Chromobacterium violaceum, but the first step that distinguishes that pathway from staurosporine (an indolocarbazole antibiotic) biosynthesis. Probab=21.60 E-value=36 Score=14.62 Aligned_cols=15 Identities=33% Similarity=0.740 Sum_probs=9.5 Q ss_pred CHHHHHHHHHHHHHC Q ss_conf 704676644673116 Q gi|254780479|r 38 SPYKVWISEIMLQQT 52 (356) Q Consensus 38 ~py~v~vseimlqqT 52 (356) .-|+.|.||||--.| T Consensus 53 ~G~rLWMSEIg~a~~ 67 (184) T TIGR03650 53 TGYRLWMSEIGNAAS 67 (184) T ss_pred CCHHHHHHHHCCCCC T ss_conf 211226766466433 No 311 >TIGR01577 oligosac_amyl oligosaccharide amylase; InterPro: IPR006465 This entry represents glycoamylase enzymes (also known as glucan 1,4-alpha-glucosidase), which catalyse the hydrolysis of terminal 1,4-linked alpha-D-glucose residues successively from non-reducing ends of the chains with the release of beta-D-glucose. This enzyme has been characterised in Thermoactinomyces vulgaris, where, unlike previously characterised glucoamylases, this enzyme hydrolyzes maltooligosaccharides (maltotetraose, maltose) more efficiently than starch, indicating this enzyme belongs to a class of glucoamylase-type enzymes with oligosaccharide-metabolizing activity , . More information about this protein can be found at Protein of the Month: alpha-Amylase .. Probab=21.58 E-value=55 Score=13.33 Aligned_cols=40 Identities=23% Similarity=0.349 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998776754300578---6873145688888777877667766 Q gi|254780479|r 97 NLKKCADIIVKKYEG---NFPHKVEILKKLPGIGDYTASAIVA 136 (356) Q Consensus 97 ~l~~~a~~i~~~~~g---~~P~~~~~l~~LpGiG~yta~ai~s 136 (356) .+.+||+.++.--.. -+-...|-+..-=||=.||++||.| T Consensus 402 sv~kAA~yL~l~~~~~~Gl~~pc~DLWEer~GvftYt~~a~y~ 444 (679) T TIGR01577 402 SVEKAAEYLILVIKSELGLFEPCFDLWEERYGVFTYTAAAVYA 444 (679) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 9999999999873367887677530001003212223468888 No 312 >cd07131 ALDH_AldH-CAJ73105 Uncharacterized Candidatus kuenenia aldehyde dehydrogenase AldH (CAJ73105)-like. Uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105) and similar sequences with similarity to alpha-aminoadipic semialdehyde dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31), Arabidopsis ALDH7B4, and Streptomyces clavuligerus delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH) are included in this CD. Probab=21.51 E-value=56 Score=13.32 Aligned_cols=56 Identities=14% Similarity=0.206 Sum_probs=35.0 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHH-----------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430-----------0578687314568888877787766776643 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVK-----------KYEGNFPHKVEILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~-----------~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a 138 (356) .+.|+-+..+.|++-|+++|..+.+ +-|--+.+...|+...-..-+|.|+....+. T Consensus 50 ~~~W~~~~~~eR~~~L~~~a~~l~~~~~~la~~~~~E~GKp~~~a~~ev~~~~~~~~~~a~~~~~~~ 116 (478) T cd07131 50 FPEWRKVPAPRRAEYLFRAAELLKKRKEELARLVTREMGKPLAEGRGDVQEAIDMAQYAAGEGRRLF 116 (478) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9988259999999999999999997799999999997393899999999999999999999999851 No 313 >COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism] Probab=21.47 E-value=56 Score=13.32 Aligned_cols=109 Identities=19% Similarity=0.257 Sum_probs=74.4 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHCCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHC-CCHH-HHHHHH Q ss_conf 99887548554445678704676644673116120332658998774899789872998999999740-4634-588999 Q gi|254780479|r 21 RVLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQKWPTIFCLSSAKDEEILSAWAG-LGYY-TRARNL 98 (356) Q Consensus 21 R~lpwr~~~~~~~~~~~~py~v~vseimlqqT~v~~v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~g-LGyy-~Rar~l 98 (356) ..+-|+..++ ..|--..|||+. -||++..+.+|=.+.|-. | .||- .-.... T Consensus 14 ~s~KWd~~~g-----~~DvlPmWVADM-------------------Df~~pp~i~~Al~~rvdh---GvfGY~~~~~~~~ 66 (388) T COG1168 14 NSLKWDAHPG-----NEDVLPMWVADM-------------------DFPTPPEIIEALRERVDH---GVFGYPYGSDELY 66 (388) T ss_pred CCEECCCCCC-----CCCCCEEEEECC-------------------CCCCCHHHHHHHHHHHHC---CCCCCCCCCHHHH T ss_conf 7500255689-----888413432004-------------------678887999999999865---8778878877899 Q ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHH Q ss_conf 877675430057868731456888887778776677664349631312431466657651 Q gi|254780479|r 99 KKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYF 158 (356) Q Consensus 99 ~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~ 158 (356) .+.+....++|+=.+.. +.+.--|||=+=.+-+|-.+.---...+|.+=|.-=|.+.. T Consensus 67 ~ai~~w~~~r~~~~i~~--e~i~~~p~VVpgi~~~I~~~T~~gd~Vvi~tPvY~PF~~~i 124 (388) T COG1168 67 AAIAHWFKQRHQWEIKP--EWIVFVPGVVPGISLAIRALTKPGDGVVIQTPVYPPFYNAI 124 (388) T ss_pred HHHHHHHHHHCCCCCCC--CEEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH T ss_conf 99999999713777885--52898375127699999983769982696488861899987 No 314 >pfam10391 DNA_pol_lambd_f Fingers domain of DNA polymerase lambda. DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. DNA_pol_lambd_f is the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone. Probab=21.41 E-value=44 Score=14.04 Aligned_cols=21 Identities=19% Similarity=0.281 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 568888877787766776643 Q gi|254780479|r 118 EILKKLPGIGDYTASAIVAIA 138 (356) Q Consensus 118 ~~l~~LpGiG~yta~ai~s~a 138 (356) ..+.+.-||||.||.-..+-+ T Consensus 2 ~~f~~I~GvGp~~A~~~~~~G 22 (52) T pfam10391 2 KLFTNIWGVGPKTARKWYRQG 22 (52) T ss_pred HHHHHCCCCCHHHHHHHHHHC T ss_conf 036635440699999999947 No 315 >TIGR03018 pepcterm_TyrKin exopolysaccharide/PEPCTERM locus tyrosine autokinase. Members of this protein family are related to a known protein-tyrosine autokinase and to numerous homologs from exopolysaccharide biosynthesis region proteins, many of which are designated as chain length determinants. Most members of this family contain a short region, immediately C-terminal to the region modeled here, with an abundance of Tyr residues. These C-terminal tyrosine residues are likely to be autophosphorylation sites. Some members of this family are fusion proteins. Probab=21.31 E-value=41 Score=14.25 Aligned_cols=41 Identities=22% Similarity=0.453 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHHHHC----CCEEEECCCCHHH-HHHHHHCCCCC Q ss_conf 88777877667766434----9631312431466-65765197545 Q gi|254780479|r 123 LPGIGDYTASAIVAIAF----NHFAVVVDTNIER-IISRYFDIIKP 163 (356) Q Consensus 123 LpGiG~yta~ai~s~a~----~~~~~~vD~Nv~R-Vl~R~~~~~~~ 163 (356) .||-|..|-++=++.++ ++.+..||++.+| -+++++++... T Consensus 44 ~pgeGKTtva~nLA~~lA~~~~~~VLLVDaDlr~p~l~~~l~~~~~ 89 (207) T TIGR03018 44 LPGEGKSFTAINLAISLAQEYDKTVLLIDADLRRPSLHRTLGLEAE 89 (207) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHCCCCCC T ss_conf 9999889999999999997249859999535789971001388999 No 316 >pfam00506 Flu_NP Influenza virus nucleoprotein. Probab=21.26 E-value=56 Score=13.29 Aligned_cols=105 Identities=20% Similarity=0.315 Sum_probs=59.6 Q ss_pred HHHHHHHHHH---HHHHHHCCCCCCCCHHHH-HHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHH------HHHHHHC-C Q ss_conf 4588999877---675430057868731456-888887778776677664349631312431466------6576519-7 Q gi|254780479|r 92 YTRARNLKKC---ADIIVKKYEGNFPHKVEI-LKKLPGIGDYTASAIVAIAFNHFAVVVDTNIER------IISRYFD-I 160 (356) Q Consensus 92 y~Rar~l~~~---a~~i~~~~~g~~P~~~~~-l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~R------Vl~R~~~-~ 160 (356) |.|.|+|... ++.+.---|..+|.-... -.+..|||.-.|.||.-|.-+ +-|-|..| -+.|... + T Consensus 156 yQR~ral~R~g~dp~l~sl~qGstlprRsga~g~a~kG~Gtlvaeair~IkRg----i~dr~~lR~~~t~~ayEr~~~~l 231 (506) T pfam00506 156 YQRTRALVRTGMDPRLCSLMQGSTLPRRSGAAGAAIKGVGTLVAEAIRMIKRG----INDRNFLRGRRTRIAYERMCNIL 231 (506) T ss_pred HHHHHHHHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHC----CCCCCEECCCCCCCHHHHHHHHH T ss_conf 99889998708987886553268777533332432111667799999998742----22100001253576899999986 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCHHHHH-HHHHHH--HHHHCCCC Q ss_conf 5457356899987410257852267899-999999--86320412 Q gi|254780479|r 161 IKPAPLYHKTIKNYARKITSTSRPGDFV-QAMMDL--GALICTSN 202 (356) Q Consensus 161 ~~~~~~~~k~l~~~~~~~~~~~~~~~~n-QAlMdl--Ga~iC~p~ 202 (356) +. +..-.....+++......+|++-- ++|.-| .++|-+|+ T Consensus 232 k~--K~~~~~qrAl~dqV~~srNpg~A~iEdL~~lArs~lilrps 274 (506) T pfam00506 232 KG--KFQTAAQRALMDQVRESRNPGNAEIEDLIFLARSALILRPS 274 (506) T ss_pred HC--CCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 42--26757899999999853798732278999999877740510 No 317 >COG4892 Predicted heme/steroid binding protein [General function prediction only] Probab=21.20 E-value=30 Score=15.26 Aligned_cols=35 Identities=26% Similarity=0.501 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 89998776754300578687314568888877787 Q gi|254780479|r 95 ARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDY 129 (356) Q Consensus 95 ar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~y 129 (356) ---||.+.+-+.++++..-|.-.+-|..+|-||.- T Consensus 41 HqglhsaG~DLs~~~~~~aphg~eil~~~PvVG~L 75 (81) T COG4892 41 HQGLHSAGKDLSSEFNSCAPHGMEILTSLPVVGAL 75 (81) T ss_pred CCCCCCCCHHHHHHHHHCCCCHHHHHHCCCHHHEE T ss_conf 10100024137788763187606788627412101 No 318 >PRK04195 replication factor C large subunit; Provisional Probab=21.16 E-value=57 Score=13.27 Aligned_cols=27 Identities=15% Similarity=0.331 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 888777877667766434963131243 Q gi|254780479|r 122 KLPGIGDYTASAIVAIAFNHFAVVVDT 148 (356) Q Consensus 122 ~LpGiG~yta~ai~s~a~~~~~~~vD~ 148 (356) --||||.-|+.-+++=.+|-.+.=+++ T Consensus 47 GPpGvGKTT~a~~lAk~~g~~viElNA 73 (403) T PRK04195 47 GPPGVGKTSLAHALANDYGWEVIELNA 73 (403) T ss_pred CCCCCCHHHHHHHHHHHHCCCEEEECC T ss_conf 939987999999999984998599771 No 319 >pfam08440 Poty_PP Potyviridae polyprotein. This domain is found in polyproteins of the viral Potyviridae taxon. Probab=21.00 E-value=38 Score=14.47 Aligned_cols=14 Identities=14% Similarity=0.280 Sum_probs=8.2 Q ss_pred HHHHHHHCCCEEEE Q ss_conf 67766434963131 Q gi|254780479|r 132 SAIVAIAFNHFAVV 145 (356) Q Consensus 132 ~ai~s~a~~~~~~~ 145 (356) +|.+|||||.|++. T Consensus 1 AAflcFaYGLPvmt 14 (274) T pfam08440 1 AAFLCFAYGLPVMT 14 (274) T ss_pred CCHHHHHCCCCCEE T ss_conf 92787651998633 No 320 >COG2355 Zn-dependent dipeptidase, microsomal dipeptidase homolog [Amino acid transport and metabolism] Probab=20.86 E-value=57 Score=13.23 Aligned_cols=90 Identities=14% Similarity=0.169 Sum_probs=45.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCC---CCCCCHHHHHH Q ss_conf 873145688888777877667766434963131243146665765197545735689998741025---78522678999 Q gi|254780479|r 113 FPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPAPLYHKTIKNYARKI---TSTSRPGDFVQ 189 (356) Q Consensus 113 ~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~---~~~~~~~~~nQ 189 (356) |=.+++.|..|--.|- ..+.+.+|.+-..=||=..| +-.|+ ++.++..+ ..++++ ++-.+.++ + T Consensus 107 ~~~dl~~L~~~~~~Gv----R~lgltwn~~N~~g~g~~~~---~~~GL---s~~Gk~lV-~~~N~LgIiiDlSH~s~--k 173 (313) T COG2355 107 LGDDLDKLELFHALGV----RSLGLTWNRDNLFGDGCYER---TGGGL---TPFGKELV-REMNELGIIIDLSHLSD--K 173 (313) T ss_pred CCCCHHHHHHHHHHCC----EEEEEEECCCCCCCCCCCCC---CCCCC---CHHHHHHH-HHHHHCCCEEEECCCCC--C T ss_conf 0053899999998176----47876623677656756778---89987---87899999-99986597787223587--2 Q ss_pred HHHHHHHHHCCCCCCCCC-CCCCCCCHHHH Q ss_conf 999998632041235643-11000001343 Q gi|254780479|r 190 AMMDLGALICTSNKPLCP-LCPIQKNCLTF 218 (356) Q Consensus 190 AlMdlGa~iC~p~~P~C~-~Cpl~~~C~~~ 218 (356) .++|--++ ++.|-|. .|-.+..|..- T Consensus 174 t~~Dvl~~---s~~PviaSHSN~~al~~h~ 200 (313) T COG2355 174 TFWDVLDL---SKAPVVASHSNARALVDHP 200 (313) T ss_pred CHHHHHHC---CCCCEEEECCCCHHCCCCC T ss_conf 29999854---5886687437724105888 No 321 >PRK13153 consensus Probab=20.81 E-value=42 Score=14.20 Aligned_cols=11 Identities=36% Similarity=0.821 Sum_probs=5.4 Q ss_pred HHHHHHHHHHH Q ss_conf 88887778776 Q gi|254780479|r 121 KKLPGIGDYTA 131 (356) Q Consensus 121 ~~LpGiG~yta 131 (356) +=|||||.+.. T Consensus 41 lIlPGVGsf~~ 51 (203) T PRK13153 41 LILPGVGAFGD 51 (203) T ss_pred EEECCCCCHHH T ss_conf 89878883488 No 322 >cd02035 ArsA ArsA ATPase functionas as an efflux pump located on the inner membrane of the cell. This ATP-driven oxyanion pump catalyzes the extrusion of arsenite, antimonite and arsenate. Maintenance of a low intracellular concentration of oxyanion produces resistance to the toxic agents. The pump is composed of two subunits, the catalytic ArsA subunit and the membrane subunit ArsB, which are encoded by arsA and arsB genes respectively. Arsenic efflux in bacteria is catalyzed by either ArsB alone or by ArsAB complex. The ATP-coupled pump, however, is more efficient. ArsA is composed of two homologous halves, A1 and A2, connected by a short linker sequence. Probab=20.77 E-value=58 Score=13.22 Aligned_cols=25 Identities=28% Similarity=0.484 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHC---CCEEEECCCC Q ss_conf 777877667766434---9631312431 Q gi|254780479|r 125 GIGDYTASAIVAIAF---NHFAVVVDTN 149 (356) Q Consensus 125 GiG~yta~ai~s~a~---~~~~~~vD~N 149 (356) |||..|.+|-++.++ |+++.+||+. T Consensus 9 GVGKTTvAaalA~~lA~~G~rvLlvs~D 36 (217) T cd02035 9 GVGKTTIAAATAVRLAEEGKKVLLVSTD 36 (217) T ss_pred CCCHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 6619999999999999689949999589 No 323 >COG5133 Uncharacterized conserved protein [Function unknown] Probab=20.72 E-value=43 Score=14.11 Aligned_cols=94 Identities=12% Similarity=0.262 Sum_probs=52.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCEEEE--CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHH--- Q ss_conf 145688888777877667766434963131--243146665765197545735689998741025785226789999--- Q gi|254780479|r 116 KVEILKKLPGIGDYTASAIVAIAFNHFAVV--VDTNIERIISRYFDIIKPAPLYHKTIKNYARKITSTSRPGDFVQA--- 190 (356) Q Consensus 116 ~~~~l~~LpGiG~yta~ai~s~a~~~~~~~--vD~Nv~RVl~R~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~nQA--- 190 (356) +..++-.||-.-..+-.-.++-......|+ -||++.- +++.......+.++...+-++++|-.+.|- T Consensus 10 di~e~NqlP~r~e~~~dlll~ss~e~~~piese~~~~~~--------ee~d~id~qeiydLia~I~DPEHPltL~QLsVV 81 (181) T COG5133 10 DIKEMNQLPFRAESEEDLLLGSSLEERHPIESENGVLQN--------EEPDLIDQQEIYDLIADIRDPEHPLTLEQLSVV 81 (181) T ss_pred CHHHHHCCCCCCCCHHHHHCCCCHHHCCCCCCCCCCCCC--------CCCCCCCHHHHHHHHHHCCCCCCCCCHHHCCEE T ss_conf 356651387401212444314425441755334651126--------788876889999999860798887407660135 Q ss_pred --------------HHHHHHHHCCCCCCCCCCCCCCCCHHH Q ss_conf --------------999986320412356431100000134 Q gi|254780479|r 191 --------------MMDLGALICTSNKPLCPLCPIQKNCLT 217 (356) Q Consensus 191 --------------lMdlGa~iC~p~~P~C~~Cpl~~~C~~ 217 (356) .|-.--.+-||.-|.|+.|.+-..|.- T Consensus 82 ~lEdi~v~D~~~~n~i~~v~v~itPTIpHCSmatlIGLcIr 122 (181) T COG5133 82 SLEDISVPDGIAPNVIRCVKVVITPTIPHCSMATLIGLCIR 122 (181) T ss_pred EECCCCCCCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHEE T ss_conf 20255346788877047899996358872068877643203 No 324 >COG1313 PflX Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins [General function prediction only] Probab=20.58 E-value=52 Score=13.55 Aligned_cols=45 Identities=18% Similarity=0.279 Sum_probs=24.2 Q ss_pred HHHHHHHHCCCCCCCHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 9998741025785226789999--99998632041235643110000 Q gi|254780479|r 169 KTIKNYARKITSTSRPGDFVQA--MMDLGALICTSNKPLCPLCPIQK 213 (356) Q Consensus 169 k~l~~~~~~~~~~~~~~~~nQA--lMdlGa~iC~p~~P~C~~Cpl~~ 213 (356) ..+|.+-+.-++....+|...+ +||+-..|-...--.|..||-+. T Consensus 38 ~elw~~H~~~~~~~~~~d~~d~~~lldlK~eia~~~Le~C~lCerrC 84 (335) T COG1313 38 EELWRIHERKLRKLASGDIKDSPSLLDLKVEIAYRILEKCTLCERRC 84 (335) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999987640354466601206788889999998851367463004 No 325 >PRK09847 gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase; Provisional Probab=20.57 E-value=58 Score=13.19 Aligned_cols=52 Identities=23% Similarity=0.253 Sum_probs=30.0 Q ss_pred HHCCCHHHHHHHHHHHHHHHHHCCC----------CC-CCCH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 7404634588999877675430057----------86-8731-456888887778776677664 Q gi|254780479|r 86 WAGLGYYTRARNLKKCADIIVKKYE----------GN-FPHK-VEILKKLPGIGDYTASAIVAI 137 (356) Q Consensus 86 w~gLGyy~Rar~l~~~a~~i~~~~~----------g~-~P~~-~~~l~~LpGiG~yta~ai~s~ 137 (356) |+.+....|++-|++.|..|.++-+ |+ +-.. ..++...-..=+|.|+..-.+ T Consensus 75 W~~~~~~eR~~iL~~~a~~l~~~~~ela~l~~~E~GK~~~~a~~~ev~~~~~~~~~~a~~~~~~ 138 (494) T PRK09847 75 WSLSSPAKRKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKV 138 (494) T ss_pred CHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 0109999999999999999997999999999997290679999999999999999999988873 No 326 >TIGR01944 rnfB electron transport complex, RnfABCDGE type, B subunit; InterPro: IPR010207 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation. A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized. This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase, but is distinct. This entry describes the B subunit.; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding, 0006118 electron transport. Probab=20.50 E-value=58 Score=13.18 Aligned_cols=54 Identities=15% Similarity=0.132 Sum_probs=19.9 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHH--HHCCCEEEECCCCHHHHHHHHHCCC Q ss_conf 005786873145688888777877667766--4349631312431466657651975 Q gi|254780479|r 107 KKYEGNFPHKVEILKKLPGIGDYTASAIVA--IAFNHFAVVVDTNIERIISRYFDII 161 (356) Q Consensus 107 ~~~~g~~P~~~~~l~~LpGiG~yta~ai~s--~a~~~~~~~vD~Nv~RVl~R~~~~~ 161 (356) ++-+--+|.|.=.==--||=.|| |-||.. =+-=-+.++==-+|.|=|+-+++++ T Consensus 41 e~id~lLPQTQCGqCGypGC~PY-AeAia~NGea~In~C~PGG~a~~~~lA~~L~v~ 96 (213) T TIGR01944 41 ERIDALLPQTQCGQCGYPGCRPY-AEAIAENGEALINKCPPGGEAVILKLAELLGVE 96 (213) T ss_pred HHHHHHCCCCCCCCCCCCCCHHH-HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 99986477554646788874158-999986078778788887078999999983889 No 327 >pfam02764 Diphtheria_T Diphtheria toxin, T domain. Central domain of diphtheria toxin is the translocation (T) domain. pH induced conformational change in this domain triggers insertion into the endosomal membrane and facilitates the transfer of the catalytic domain into the cytoplasm. Probab=20.07 E-value=42 Score=14.21 Aligned_cols=100 Identities=23% Similarity=0.374 Sum_probs=55.1 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHCCCCCC Q ss_conf 97404634588999877675430057868731456888887778776677664349631312431466657651975457 Q gi|254780479|r 85 AWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGIGDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA 164 (356) Q Consensus 85 ~w~gLGyy~Rar~l~~~a~~i~~~~~g~~P~~~~~l~~LpGiG~yta~ai~s~a~~~~~~~vD~Nv~RVl~R~~~~~~~~ 164 (356) ..+|-.|-.-|-| .|+.|-.+-...+-++-..|--|||||. +|.|| ..+|.-|...+.+.-.++..- T Consensus 73 vfaganyaawavn---vaqvidsetadnlekttaalsilpgigs-----vmgia----dgavhhnteeivaqsialssl- 139 (180) T pfam02764 73 VFAGANYAAWAVN---VAQVIDSETADNLEKTTAALSILPGIGS-----VMGIA----DGAVHHNTEEIVAQSIALSSL- 139 (180) T ss_pred CCCCCCHHHHHHH---HHHHHCCHHHHHHHHHHHHHHHCCCCCC-----HHEEC----CCCCCCCHHHHHHHHHHHHHH- T ss_conf 3345513898876---9999730523248877788874557521-----20003----430004789999999999999- Q ss_pred CHHHHHHHHHHHCCCCCC-CHHHHHHHHHHHHHHHC Q ss_conf 356899987410257852-26789999999986320 Q gi|254780479|r 165 PLYHKTIKNYARKITSTS-RPGDFVQAMMDLGALIC 199 (356) Q Consensus 165 ~~~~k~l~~~~~~~~~~~-~~~~~nQAlMdlGa~iC 199 (356) .. .+---+..++++.. -..+|.++++.|=..|- T Consensus 140 -mv-aqaiplvgelvdigfaaynfvesiinlfqvvh 173 (180) T pfam02764 140 -MV-AQAIPLVGELVDIGFAAYNFVESIINLFQVVH 173 (180) T ss_pred -HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -99-86510556787631799999999999999986 No 328 >TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein; InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.. Probab=20.05 E-value=59 Score=13.12 Aligned_cols=23 Identities=17% Similarity=0.455 Sum_probs=17.2 Q ss_pred CCCEEEECCCCHHHHHHHHHCCCC Q ss_conf 496313124314666576519754 Q gi|254780479|r 139 FNHFAVVVDTNIERIISRYFDIIK 162 (356) Q Consensus 139 ~~~~~~~vD~Nv~RVl~R~~~~~~ 162 (356) +++=.-++|.-|.++|+|| +|.. T Consensus 75 w~~l~~~FD~R~~~~L~rL-~IPs 97 (121) T TIGR01837 75 WDKLEKAFDERVEQALNRL-NIPS 97 (121) T ss_pred HHHHHHHHHHHHHHHHHHC-CCCC T ss_conf 9889999999999999857-8998 No 329 >KOG2573 consensus Probab=20.02 E-value=36 Score=14.67 Aligned_cols=31 Identities=35% Similarity=0.517 Sum_probs=18.8 Q ss_pred CCHHHHHHHHHCCCHHHHHCCCHHHHHHHHHCCCH-HHHHH Q ss_conf 32658998774899789872998999999740463-45889 Q gi|254780479|r 57 VEPYFKKFMQKWPTIFCLSSAKDEEILSAWAGLGY-YTRAR 96 (356) Q Consensus 57 v~~~~~~~~~~~P~~~~la~a~~~~vl~~w~gLGy-y~Rar 96 (356) |--.|.++|+-++.- +.-++-=|||| ||||+ T Consensus 126 vR~hf~kl~K~L~~~---------d~~kaqLGLghsYSRaK 157 (498) T KOG2573 126 VRKHFDKLMKGLDPG---------DLEKAQLGLGHSYSRAK 157 (498) T ss_pred HHHHHHHHHCCCCCC---------CHHHHHHCCCCHHHHHH T ss_conf 999999997469975---------27888750100111545 No 330 >cd07086 ALDH_F7_AASADH-like NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD+-dependent, alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1, ALDH7B or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), and other similar sequences, such as the uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105). Probab=20.01 E-value=60 Score=13.11 Aligned_cols=53 Identities=19% Similarity=0.344 Sum_probs=33.8 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCC----------C-CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997404634588999877675430057----------8-687314568888877787766776 Q gi|254780479|r 83 LSAWAGLGYYTRARNLKKCADIIVKKYE----------G-NFPHKVEILKKLPGIGDYTASAIV 135 (356) Q Consensus 83 l~~w~gLGyy~Rar~l~~~a~~i~~~~~----------g-~~P~~~~~l~~LpGiG~yta~ai~ 135 (356) ...|+.+....||+-|.++|+.+.++.+ | -+.+...|+......-+|.|+... T Consensus 48 ~~~W~~~~~~eRa~iL~~~a~~l~~~~~ela~l~~~e~GK~~~~a~~Ev~~~~~~~~~~a~~~~ 111 (478) T cd07086 48 FKEWRKVPAPRRGEIVRQIGEALRKKKEALGRLVSLEMGKILPEGLGEVQEMIDICDYAVGLSR 111 (478) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988039999999999999999997899999999998392899999999999999999998899 Done!