Query psy9547
Match_columns 363
No_of_seqs 350 out of 2223
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 22:21:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9547.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9547hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK10880 adenine DNA glycosyla 100.0 9.9E-83 2.2E-87 606.2 36.1 341 11-357 4-346 (350)
2 COG1194 MutY A/G-specific DNA 100.0 7.5E-80 1.6E-84 572.7 29.0 336 7-356 5-341 (342)
3 TIGR01084 mutY A/G-specific ad 100.0 2.1E-72 4.5E-77 521.3 29.4 273 13-289 2-275 (275)
4 PRK13910 DNA glycosylase MutY; 100.0 2E-67 4.3E-72 489.2 29.9 283 45-358 1-284 (289)
5 KOG2457|consensus 100.0 2.4E-63 5.2E-68 458.3 14.7 336 6-341 86-490 (555)
6 COG0177 Nth Predicted EndoIII- 100.0 7.7E-53 1.7E-57 370.9 21.7 206 6-219 2-208 (211)
7 PRK10702 endonuclease III; Pro 100.0 1.9E-51 4.1E-56 367.9 22.9 205 6-218 2-207 (211)
8 TIGR01083 nth endonuclease III 100.0 4.2E-44 9.2E-49 317.7 21.2 189 9-205 2-191 (191)
9 PRK13913 3-methyladenine DNA g 100.0 2.5E-37 5.3E-42 276.3 21.1 183 12-200 6-216 (218)
10 KOG1921|consensus 100.0 5.3E-37 1.2E-41 268.9 13.6 182 35-219 77-262 (286)
11 COG2231 Uncharacterized protei 100.0 6.5E-35 1.4E-39 251.1 19.2 205 4-218 2-214 (215)
12 smart00478 ENDO3c endonuclease 100.0 6.2E-31 1.4E-35 224.6 15.6 148 45-196 1-149 (149)
13 cd00056 ENDO3c endonuclease II 100.0 3.3E-30 7.2E-35 222.2 16.6 154 37-194 1-158 (158)
14 PRK01229 N-glycosylase/DNA lya 99.9 1E-26 2.2E-31 206.1 16.6 157 30-192 31-200 (208)
15 TIGR00588 ogg 8-oxoguanine DNA 99.9 2.5E-25 5.4E-30 211.0 17.6 156 34-194 117-304 (310)
16 TIGR03252 uncharacterized HhH- 99.9 5.5E-23 1.2E-27 176.4 12.6 103 34-136 16-136 (177)
17 TIGR00586 mutt mutator mutT pr 99.9 8.2E-23 1.8E-27 169.3 11.7 120 236-356 3-127 (128)
18 PF14815 NUDIX_4: NUDIX domain 99.9 6E-23 1.3E-27 167.3 9.9 112 241-353 1-114 (114)
19 PRK10776 nucleoside triphospha 99.9 3.7E-22 8.1E-27 165.2 12.2 121 236-357 3-128 (129)
20 PRK10546 pyrimidine (deoxy)nuc 99.9 3.9E-22 8.5E-27 167.0 11.9 122 235-358 2-128 (135)
21 cd03431 DNA_Glycosylase_C DNA 99.9 5.2E-22 1.1E-26 161.8 11.9 112 237-354 3-117 (118)
22 PRK10308 3-methyl-adenine DNA 99.9 4.8E-21 1E-25 179.2 17.9 139 33-189 110-272 (283)
23 COG0122 AlkA 3-methyladenine D 99.8 1.4E-20 3E-25 175.9 14.9 145 34-189 103-270 (285)
24 cd03425 MutT_pyrophosphohydrol 99.8 1.6E-19 3.5E-24 147.8 11.1 117 238-355 2-123 (124)
25 KOG2875|consensus 99.8 2E-19 4.2E-24 161.4 10.5 118 34-154 115-258 (323)
26 PRK08999 hypothetical protein; 99.8 3.6E-19 7.9E-24 170.0 12.4 122 234-356 2-128 (312)
27 KOG1918|consensus 99.8 6.9E-18 1.5E-22 146.5 11.6 166 17-194 57-243 (254)
28 PF00730 HhH-GPD: HhH-GPD supe 99.7 1.1E-17 2.3E-22 134.9 11.3 104 41-179 1-107 (108)
29 PRK15472 nucleoside triphospha 99.7 2.3E-16 5E-21 133.2 10.7 117 237-354 4-137 (141)
30 PRK15434 GDP-mannose mannosyl 99.6 8.3E-16 1.8E-20 132.1 9.9 114 239-354 19-150 (159)
31 cd04676 Nudix_Hydrolase_17 Mem 99.6 1.2E-15 2.6E-20 125.8 9.8 110 238-353 3-128 (129)
32 cd03427 MTH1 MutT homolog-1 (M 99.6 3.8E-15 8.2E-20 124.8 10.8 116 239-357 3-126 (137)
33 cd04669 Nudix_Hydrolase_11 Mem 99.6 1.6E-15 3.6E-20 124.7 8.3 101 240-347 3-119 (121)
34 cd04696 Nudix_Hydrolase_37 Mem 99.6 4.8E-15 1E-19 122.4 10.2 110 239-352 4-124 (125)
35 cd04673 Nudix_Hydrolase_15 Mem 99.6 5.3E-15 1.1E-19 121.2 8.1 107 240-349 3-121 (122)
36 cd03430 GDPMH GDP-mannose glyc 99.6 1.7E-14 3.8E-19 122.3 10.1 102 239-342 14-132 (144)
37 cd04691 Nudix_Hydrolase_32 Mem 99.6 1.5E-14 3.2E-19 118.3 8.8 100 239-341 2-107 (117)
38 cd04684 Nudix_Hydrolase_25 Con 99.6 3E-14 6.5E-19 117.6 10.6 110 240-352 3-127 (128)
39 PRK09438 nudB dihydroneopterin 99.5 1.9E-14 4.2E-19 122.4 9.4 119 238-359 8-146 (148)
40 cd04679 Nudix_Hydrolase_20 Mem 99.5 2.3E-14 4.9E-19 118.3 9.5 110 240-352 5-124 (125)
41 cd04699 Nudix_Hydrolase_39 Mem 99.5 2.2E-14 4.7E-19 118.6 8.8 106 239-345 3-117 (129)
42 cd04683 Nudix_Hydrolase_24 Mem 99.5 4E-14 8.6E-19 115.9 8.9 100 241-341 4-113 (120)
43 cd04682 Nudix_Hydrolase_23 Mem 99.5 2.6E-14 5.6E-19 117.6 7.8 105 239-345 3-117 (122)
44 cd03673 Ap6A_hydrolase Diadeno 99.5 6.9E-14 1.5E-18 115.9 9.5 113 239-356 3-130 (131)
45 cd04664 Nudix_Hydrolase_7 Memb 99.5 5.5E-14 1.2E-18 116.7 8.3 111 239-352 3-128 (129)
46 cd04681 Nudix_Hydrolase_22 Mem 99.5 7.2E-14 1.6E-18 116.0 8.6 112 240-354 4-128 (130)
47 cd04680 Nudix_Hydrolase_21 Mem 99.5 8.4E-14 1.8E-18 113.7 8.5 108 240-353 3-118 (120)
48 cd02883 Nudix_Hydrolase Nudix 99.5 7.1E-14 1.5E-18 113.4 7.5 111 240-352 3-122 (123)
49 PLN02325 nudix hydrolase 99.5 2E-13 4.2E-18 115.8 10.2 111 241-355 13-137 (144)
50 cd04695 Nudix_Hydrolase_36 Mem 99.5 2.5E-13 5.4E-18 113.2 10.4 109 246-357 11-129 (131)
51 cd04688 Nudix_Hydrolase_29 Mem 99.5 1.5E-13 3.2E-18 113.6 8.7 103 240-348 4-124 (126)
52 cd04671 Nudix_Hydrolase_13 Mem 99.5 1.1E-13 2.3E-18 114.2 7.8 105 239-349 2-116 (123)
53 cd04687 Nudix_Hydrolase_28 Mem 99.5 2E-13 4.3E-18 113.2 8.9 104 240-347 4-126 (128)
54 cd04678 Nudix_Hydrolase_19 Mem 99.5 5.8E-13 1.3E-17 110.4 11.6 102 239-342 4-117 (129)
55 cd04670 Nudix_Hydrolase_12 Mem 99.5 2.5E-13 5.4E-18 112.4 9.2 110 239-352 4-123 (127)
56 cd03675 Nudix_Hydrolase_2 Cont 99.5 3.7E-13 8.1E-18 112.3 10.2 114 241-358 4-130 (134)
57 cd04690 Nudix_Hydrolase_31 Mem 99.5 2.3E-13 4.9E-18 111.0 8.4 105 239-348 2-116 (118)
58 cd04511 Nudix_Hydrolase_4 Memb 99.5 4.4E-13 9.5E-18 111.5 10.0 102 240-346 16-125 (130)
59 PRK15393 NUDIX hydrolase YfcD; 99.4 4.6E-13 1E-17 117.8 10.1 120 238-358 38-167 (180)
60 cd03674 Nudix_Hydrolase_1 Memb 99.4 1.1E-12 2.4E-17 110.3 12.0 111 240-356 5-137 (138)
61 cd04667 Nudix_Hydrolase_10 Mem 99.4 3.1E-13 6.8E-18 109.4 8.2 101 242-352 4-111 (112)
62 cd04700 DR1025_like DR1025 fro 99.4 6.1E-13 1.3E-17 112.5 8.7 103 238-342 14-125 (142)
63 cd04697 Nudix_Hydrolase_38 Mem 99.4 6.6E-13 1.4E-17 109.9 7.9 101 240-341 3-111 (126)
64 cd04672 Nudix_Hydrolase_14 Mem 99.4 1.2E-12 2.6E-17 107.8 8.4 104 240-349 5-119 (123)
65 cd03426 CoAse Coenzyme A pyrop 99.4 6.8E-13 1.5E-17 114.2 6.5 103 242-345 7-121 (157)
66 cd04693 Nudix_Hydrolase_34 Mem 99.4 1.4E-12 3E-17 107.9 7.9 101 239-341 2-112 (127)
67 cd04661 MRP_L46 Mitochondrial 99.4 1.1E-12 2.5E-17 109.4 7.0 93 248-342 12-120 (132)
68 cd03429 NADH_pyrophosphatase N 99.4 2.1E-12 4.5E-17 107.7 8.1 112 240-356 3-125 (131)
69 cd03671 Ap4A_hydrolase_plant_l 99.3 4.9E-12 1.1E-16 107.6 10.0 114 239-357 5-145 (147)
70 cd03424 ADPRase_NUDT5 ADP-ribo 99.3 2.2E-12 4.8E-17 108.1 7.3 102 239-341 4-114 (137)
71 cd03428 Ap4A_hydrolase_human_l 99.3 6.6E-12 1.4E-16 104.1 9.3 110 240-355 5-128 (130)
72 cd04689 Nudix_Hydrolase_30 Mem 99.3 6.2E-12 1.4E-16 103.7 9.1 105 240-352 4-122 (125)
73 cd04694 Nudix_Hydrolase_35 Mem 99.3 3.2E-12 6.9E-17 108.2 7.5 101 240-341 4-130 (143)
74 cd03672 Dcp2p mRNA decapping e 99.3 9.5E-12 2.1E-16 105.6 9.9 116 239-360 3-143 (145)
75 PF00293 NUDIX: NUDIX domain; 99.3 2E-12 4.4E-17 107.1 4.8 117 239-356 4-133 (134)
76 cd02885 IPP_Isomerase Isopente 99.3 7.7E-12 1.7E-16 108.5 8.5 104 238-342 31-148 (165)
77 cd04677 Nudix_Hydrolase_18 Mem 99.3 8.4E-12 1.8E-16 103.7 8.3 102 239-345 9-125 (132)
78 TIGR02150 IPP_isom_1 isopenten 99.3 8.4E-12 1.8E-16 107.5 8.1 104 238-341 28-141 (158)
79 COG1051 ADP-ribose pyrophospha 99.3 2.6E-11 5.6E-16 102.8 10.3 118 237-356 9-138 (145)
80 cd04692 Nudix_Hydrolase_33 Mem 99.2 2.8E-11 6.1E-16 102.5 8.4 103 239-341 4-126 (144)
81 cd04666 Nudix_Hydrolase_9 Memb 99.2 3.6E-11 7.7E-16 99.0 8.5 88 249-341 15-114 (122)
82 PRK05379 bifunctional nicotina 99.2 5.7E-11 1.2E-15 114.6 10.7 112 242-356 208-338 (340)
83 cd04686 Nudix_Hydrolase_27 Mem 99.2 6.1E-11 1.3E-15 98.8 9.3 96 240-341 3-118 (131)
84 PRK00241 nudC NADH pyrophospha 99.2 6.7E-11 1.5E-15 109.5 10.4 99 242-346 137-242 (256)
85 cd04685 Nudix_Hydrolase_26 Mem 99.2 6.1E-11 1.3E-15 99.1 8.3 102 240-342 3-123 (133)
86 COG1059 Thermostable 8-oxoguan 99.2 1.5E-10 3.3E-15 98.9 10.6 137 34-176 37-184 (210)
87 PRK00714 RNA pyrophosphohydrol 99.2 7E-11 1.5E-15 101.5 8.3 114 239-357 10-149 (156)
88 PRK03759 isopentenyl-diphospha 99.1 2.6E-10 5.6E-15 100.7 9.4 114 240-353 37-169 (184)
89 cd04674 Nudix_Hydrolase_16 Mem 99.1 4.6E-10 1E-14 91.6 10.1 80 238-320 5-87 (118)
90 PRK10707 putative NUDIX hydrol 99.1 4.9E-10 1.1E-14 99.2 8.8 95 246-341 41-145 (190)
91 cd04663 Nudix_Hydrolase_6 Memb 99.0 1.3E-09 2.9E-14 89.7 9.2 98 240-342 3-116 (126)
92 PRK11762 nudE adenosine nucleo 99.0 7.6E-10 1.7E-14 97.8 7.3 95 245-340 55-157 (185)
93 cd04665 Nudix_Hydrolase_8 Memb 98.9 1.1E-08 2.4E-13 83.7 9.2 94 240-340 3-103 (118)
94 cd04662 Nudix_Hydrolase_5 Memb 98.8 1.8E-08 3.8E-13 82.8 9.0 45 249-293 15-65 (126)
95 cd03676 Nudix_hydrolase_3 Memb 98.8 1.8E-08 3.9E-13 88.6 8.6 99 243-341 40-157 (180)
96 PLN02709 nudix hydrolase 98.8 1.7E-08 3.6E-13 90.7 8.0 91 249-340 51-153 (222)
97 PF00633 HHH: Helix-hairpin-he 98.8 2.2E-09 4.7E-14 65.3 0.8 29 106-134 2-30 (30)
98 TIGR02705 nudix_YtkD nucleosid 98.8 3E-08 6.5E-13 84.7 8.0 96 239-341 26-126 (156)
99 PLN02552 isopentenyl-diphospha 98.6 1.3E-07 2.9E-12 86.7 9.3 119 238-357 57-225 (247)
100 TIGR00052 nudix-type nucleosid 98.6 1E-07 2.2E-12 84.2 6.8 102 239-341 46-164 (185)
101 COG0494 MutT NTP pyrophosphohy 98.5 9.9E-07 2.1E-11 73.2 9.1 96 249-348 24-140 (161)
102 cd03670 ADPRase_NUDT9 ADP-ribo 98.4 2.3E-06 5.1E-11 75.2 10.7 102 250-357 50-184 (186)
103 PRK10729 nudF ADP-ribose pyrop 98.3 1.8E-06 3.9E-11 77.3 7.5 102 239-341 51-170 (202)
104 PLN02791 Nudix hydrolase homol 98.1 7.7E-06 1.7E-10 85.8 9.1 102 239-340 34-156 (770)
105 PRK15009 GDP-mannose pyrophosp 98.1 1.1E-05 2.4E-10 71.5 7.9 102 239-341 47-165 (191)
106 PF10576 EndIII_4Fe-2S: Iron-s 98.1 9.5E-07 2.1E-11 46.1 0.5 17 198-214 1-17 (17)
107 KOG3069|consensus 97.9 1.9E-05 4.1E-10 70.4 6.2 99 249-348 58-169 (246)
108 COG2816 NPY1 NTP pyrophosphohy 97.8 2.2E-05 4.8E-10 72.4 4.6 100 239-344 146-253 (279)
109 PLN03143 nudix hydrolase; Prov 97.7 7.5E-05 1.6E-09 70.1 7.3 53 241-293 133-191 (291)
110 smart00525 FES FES domain. iro 97.5 3.9E-05 8.4E-10 45.1 1.3 22 197-218 1-22 (26)
111 KOG3084|consensus 97.1 0.00076 1.6E-08 62.8 5.4 57 239-296 189-248 (345)
112 COG1443 Idi Isopentenyldiphosp 97.0 0.0013 2.9E-08 56.2 5.1 93 242-341 38-152 (185)
113 KOG2839|consensus 96.9 0.002 4.4E-08 53.5 5.6 99 238-340 10-123 (145)
114 COG4119 Predicted NTP pyrophos 96.6 0.0096 2.1E-07 48.2 7.2 108 250-359 19-151 (161)
115 smart00278 HhH1 Helix-hairpin- 95.8 0.0067 1.5E-07 35.4 1.8 20 116-135 2-21 (26)
116 KOG0648|consensus 95.5 0.0044 9.5E-08 57.8 0.6 103 239-344 117-233 (295)
117 PF14716 HHH_8: Helix-hairpin- 94.9 0.16 3.5E-06 37.0 7.1 55 78-134 10-66 (68)
118 PF03352 Adenine_glyco: Methyl 94.6 0.28 6.2E-06 42.8 9.1 71 34-104 24-99 (179)
119 KOG3041|consensus 94.6 0.16 3.5E-06 44.5 7.5 81 239-321 76-164 (225)
120 PF12826 HHH_2: Helix-hairpin- 94.5 0.048 1E-06 39.3 3.6 36 99-134 16-54 (64)
121 PRK10353 3-methyl-adenine DNA 93.8 1.1 2.3E-05 39.5 11.1 74 31-104 26-104 (187)
122 PLN02839 nudix hydrolase 93.4 0.32 6.9E-06 47.1 7.8 112 247-358 216-349 (372)
123 TIGR00426 competence protein C 93.0 0.22 4.8E-06 36.2 4.8 57 69-134 8-66 (69)
124 PRK00024 hypothetical protein; 92.2 0.15 3.2E-06 46.4 3.6 60 72-133 22-84 (224)
125 TIGR00608 radc DNA repair prot 92.2 0.14 3E-06 46.4 3.4 59 73-133 13-78 (218)
126 PF09674 DUF2400: Protein of u 91.8 0.098 2.1E-06 47.7 1.9 55 140-195 176-230 (232)
127 PRK02515 psbU photosystem II c 90.8 0.63 1.4E-05 38.2 5.5 58 64-134 48-106 (132)
128 PRK14601 ruvA Holliday junctio 90.8 0.29 6.3E-06 43.0 3.8 27 112-138 105-131 (183)
129 TIGR02757 conserved hypothetic 90.5 0.14 3E-06 46.5 1.6 55 140-195 173-227 (229)
130 PRK13901 ruvA Holliday junctio 90.4 0.32 7E-06 43.1 3.8 35 97-138 96-130 (196)
131 COG0353 RecR Recombinational D 90.1 0.15 3.2E-06 44.8 1.4 28 111-138 8-35 (198)
132 PRK14606 ruvA Holliday junctio 90.0 0.36 7.8E-06 42.6 3.8 27 112-138 105-131 (188)
133 TIGR00624 tag DNA-3-methyladen 89.6 3.1 6.7E-05 36.4 9.2 74 31-104 25-103 (179)
134 TIGR00615 recR recombination p 89.5 0.3 6.6E-06 43.2 2.9 26 112-137 8-33 (195)
135 PRK14602 ruvA Holliday junctio 89.5 0.41 9E-06 42.8 3.8 27 112-138 106-132 (203)
136 PRK00076 recR recombination pr 89.4 0.31 6.6E-06 43.2 2.9 27 112-138 8-34 (196)
137 PRK14603 ruvA Holliday junctio 89.3 0.44 9.6E-06 42.4 3.8 27 112-138 104-130 (197)
138 PRK14604 ruvA Holliday junctio 89.2 0.46 9.9E-06 42.2 3.8 27 112-138 105-131 (195)
139 COG0632 RuvA Holliday junction 89.0 0.45 9.8E-06 42.4 3.6 27 112-138 105-131 (201)
140 PRK13844 recombination protein 88.9 0.35 7.7E-06 42.9 2.9 26 112-137 12-37 (200)
141 PRK00116 ruvA Holliday junctio 88.2 0.72 1.6E-05 40.9 4.4 23 112-134 105-127 (192)
142 smart00483 POLXc DNA polymeras 88.2 2.2 4.7E-05 41.2 8.1 31 106-138 80-110 (334)
143 KOG4548|consensus 88.1 0.87 1.9E-05 41.5 4.8 96 249-346 139-251 (263)
144 PRK14605 ruvA Holliday junctio 87.7 0.45 9.8E-06 42.3 2.8 23 112-134 105-127 (194)
145 PF12836 HHH_3: Helix-hairpin- 87.2 0.6 1.3E-05 33.6 2.8 23 112-134 11-33 (65)
146 PRK08609 hypothetical protein; 86.9 3.6 7.9E-05 42.7 9.3 101 5-132 1-105 (570)
147 PRK14600 ruvA Holliday junctio 86.8 0.7 1.5E-05 40.7 3.4 25 113-138 106-130 (186)
148 PF14520 HHH_5: Helix-hairpin- 86.8 0.65 1.4E-05 32.8 2.7 26 114-139 4-29 (60)
149 KOG4195|consensus 86.5 0.76 1.6E-05 41.0 3.4 105 250-360 140-272 (275)
150 PF11731 Cdd1: Pathogenicity l 86.3 0.9 1.9E-05 35.2 3.4 28 113-140 10-37 (93)
151 PRK14605 ruvA Holliday junctio 86.2 1 2.2E-05 40.0 4.2 21 114-134 72-92 (194)
152 PRK00116 ruvA Holliday junctio 85.9 0.72 1.6E-05 40.9 3.1 24 114-138 72-95 (192)
153 PF12836 HHH_3: Helix-hairpin- 85.8 0.79 1.7E-05 32.9 2.8 53 71-132 8-61 (65)
154 TIGR01259 comE comEA protein. 85.6 1.8 3.8E-05 35.4 5.0 61 66-134 57-117 (120)
155 COG4112 Predicted phosphoester 85.6 5 0.00011 34.2 7.7 105 239-344 63-190 (203)
156 PF05559 DUF763: Protein of un 85.5 2.1 4.6E-05 40.5 6.1 43 93-139 251-296 (319)
157 PRK02515 psbU photosystem II c 85.2 1.8 4E-05 35.6 4.8 29 105-133 48-79 (132)
158 COG1555 ComEA DNA uptake prote 84.9 0.78 1.7E-05 38.9 2.7 23 113-135 95-117 (149)
159 PRK07945 hypothetical protein; 84.8 1.8 4E-05 41.8 5.6 58 77-135 8-69 (335)
160 COG2818 Tag 3-methyladenine DN 84.8 4.3 9.4E-05 35.5 7.2 71 34-104 30-105 (188)
161 TIGR01259 comE comEA protein. 84.1 0.97 2.1E-05 36.9 2.9 23 113-135 66-88 (120)
162 PF14520 HHH_5: Helix-hairpin- 83.5 2.3 5E-05 29.9 4.3 21 113-133 36-56 (60)
163 COG2003 RadC DNA repair protei 83.3 1.8 4E-05 39.0 4.5 58 73-132 23-83 (224)
164 PF02371 Transposase_20: Trans 83.2 0.99 2.2E-05 34.4 2.4 28 115-142 2-29 (87)
165 PF11798 IMS_HHH: IMS family H 82.5 1.1 2.3E-05 27.5 1.9 15 117-131 13-27 (32)
166 TIGR00084 ruvA Holliday juncti 82.0 1.1 2.4E-05 39.6 2.6 24 112-135 104-127 (191)
167 KOG0142|consensus 81.2 2 4.3E-05 37.9 3.8 112 242-353 57-200 (225)
168 TIGR00575 dnlj DNA ligase, NAD 81.1 5.2 0.00011 42.3 7.6 69 58-133 445-516 (652)
169 PRK13482 DNA integrity scannin 80.9 2 4.3E-05 41.5 4.0 36 96-131 297-335 (352)
170 PRK07956 ligA NAD-dependent DN 80.1 5 0.00011 42.5 7.1 39 59-100 459-500 (665)
171 PRK14601 ruvA Holliday junctio 80.1 1.9 4.2E-05 37.8 3.4 44 114-157 72-116 (183)
172 TIGR00084 ruvA Holliday juncti 79.1 1.5 3.3E-05 38.8 2.5 21 114-134 71-91 (191)
173 TIGR00608 radc DNA repair prot 78.9 9 0.0002 34.7 7.4 63 38-102 17-86 (218)
174 smart00483 POLXc DNA polymeras 78.5 5.1 0.00011 38.7 6.1 43 91-135 26-68 (334)
175 COG1555 ComEA DNA uptake prote 78.0 4.6 9.9E-05 34.3 5.0 55 71-133 91-145 (149)
176 cd00141 NT_POLXc Nucleotidyltr 78.0 4.7 0.0001 38.5 5.6 24 109-132 79-102 (307)
177 smart00279 HhH2 Helix-hairpin- 77.4 1.8 4E-05 27.3 1.8 16 118-133 19-34 (36)
178 PRK00024 hypothetical protein; 77.3 8.7 0.00019 34.9 6.9 64 38-103 27-93 (224)
179 PRK14602 ruvA Holliday junctio 76.5 3.1 6.7E-05 37.2 3.7 21 114-134 73-93 (203)
180 KOG2841|consensus 76.5 9.2 0.0002 34.7 6.6 69 27-102 179-250 (254)
181 PF10391 DNA_pol_lambd_f: Fing 76.5 2.6 5.7E-05 28.9 2.5 23 114-136 1-23 (52)
182 PRK14606 ruvA Holliday junctio 75.1 3.1 6.8E-05 36.7 3.3 21 114-134 72-92 (188)
183 PRK14350 ligA NAD-dependent DN 74.6 13 0.00029 39.4 8.3 38 57-97 448-488 (669)
184 PF03118 RNA_pol_A_CTD: Bacter 74.0 2 4.4E-05 31.1 1.5 48 83-132 14-61 (66)
185 COG1796 POL4 DNA polymerase IV 73.9 5.3 0.00011 38.0 4.6 57 78-135 14-73 (326)
186 PLN03060 inositol phosphatase- 72.2 47 0.001 29.7 10.0 110 13-133 5-127 (206)
187 KOG2457|consensus 71.7 17 0.00038 35.4 7.5 153 120-275 173-378 (555)
188 TIGR00426 competence protein C 71.6 4.2 9.1E-05 29.4 2.8 22 113-134 14-36 (69)
189 KOG2534|consensus 71.5 7 0.00015 37.1 4.8 66 91-159 34-107 (353)
190 PRK13482 DNA integrity scannin 71.4 9.9 0.00021 36.8 6.0 47 56-104 298-344 (352)
191 PRK14603 ruvA Holliday junctio 71.4 3.1 6.8E-05 37.0 2.4 43 114-156 71-114 (197)
192 COG2003 RadC DNA repair protei 71.1 19 0.00042 32.6 7.3 66 38-104 27-94 (224)
193 PRK14351 ligA NAD-dependent DN 69.6 17 0.00037 38.7 7.8 66 58-133 475-546 (689)
194 cd00141 NT_POLXc Nucleotidyltr 69.2 3.7 8E-05 39.2 2.6 38 91-130 23-60 (307)
195 PF04904 NCD1: NAB conserved r 68.2 16 0.00035 27.1 5.1 52 51-102 18-73 (82)
196 PRK08097 ligB NAD-dependent DN 67.4 13 0.00028 38.5 6.2 75 57-134 437-539 (562)
197 cd00080 HhH2_motif Helix-hairp 66.9 4 8.7E-05 30.2 1.8 18 117-134 24-41 (75)
198 PRK14604 ruvA Holliday junctio 66.9 4.4 9.5E-05 36.0 2.4 44 114-157 72-116 (195)
199 PRK13901 ruvA Holliday junctio 66.1 4.6 9.9E-05 35.9 2.3 21 114-134 71-91 (196)
200 COG1415 Uncharacterized conser 65.9 21 0.00046 34.3 6.8 67 111-179 274-345 (373)
201 PF00416 Ribosomal_S13: Riboso 65.6 7.1 0.00015 31.1 3.1 22 114-135 14-35 (107)
202 PRK14600 ruvA Holliday junctio 64.7 4.9 0.00011 35.4 2.2 21 114-134 72-92 (186)
203 KOG2937|consensus 62.1 6.2 0.00013 37.6 2.5 145 183-339 35-189 (348)
204 PRK14973 DNA topoisomerase I; 61.9 25 0.00054 38.9 7.4 88 64-154 822-916 (936)
205 PF11264 ThylakoidFormat: Thyl 61.3 1.3E+02 0.0027 27.2 12.0 107 15-132 4-124 (216)
206 PRK08609 hypothetical protein; 61.1 14 0.00029 38.6 5.1 56 78-135 12-68 (570)
207 COG1379 PHP family phosphoeste 59.9 56 0.0012 31.3 8.3 49 34-82 303-352 (403)
208 PLN00047 photosystem II biogen 59.6 97 0.0021 29.1 9.8 111 11-132 56-179 (283)
209 COG0632 RuvA Holliday junction 58.1 7.3 0.00016 34.7 2.2 43 114-156 72-115 (201)
210 PF14490 HHH_4: Helix-hairpin- 57.3 23 0.0005 27.3 4.6 53 75-134 7-65 (94)
211 PF12826 HHH_2: Helix-hairpin- 57.2 18 0.00038 25.8 3.7 38 58-97 16-53 (64)
212 PRK14552 C/D box methylation g 57.2 37 0.00081 33.8 7.2 18 7-24 152-169 (414)
213 COG0272 Lig NAD-dependent DNA 56.4 12 0.00026 39.2 3.7 23 112-134 540-562 (667)
214 PF14229 DUF4332: Domain of un 55.5 31 0.00066 28.1 5.3 75 65-154 16-92 (122)
215 PRK12766 50S ribosomal protein 55.0 7 0.00015 35.5 1.5 21 117-137 5-25 (232)
216 TIGR03060 PS_II_psb29 photosys 54.7 89 0.0019 28.1 8.5 96 13-119 7-115 (214)
217 PF13174 TPR_6: Tetratricopept 54.2 22 0.00047 20.6 3.3 28 40-67 5-32 (33)
218 COG4277 Predicted DNA-binding 52.7 10 0.00022 35.8 2.3 22 113-134 328-349 (404)
219 PRK13266 Thf1-like protein; Re 52.4 99 0.0021 28.1 8.4 96 13-119 7-115 (225)
220 COG1948 MUS81 ERCC4-type nucle 52.1 34 0.00075 31.6 5.5 21 114-134 213-233 (254)
221 TIGR00375 conserved hypothetic 51.9 23 0.0005 34.7 4.7 49 34-82 296-345 (374)
222 PF00570 HRDC: HRDC domain Blo 49.6 28 0.00061 24.6 3.8 21 110-130 39-59 (68)
223 PF01367 5_3_exonuc: 5'-3' exo 49.3 2.5 5.4E-05 33.4 -2.0 19 116-134 19-37 (101)
224 PF13869 NUDIX_2: Nucleotide h 48.9 25 0.00055 30.9 4.0 48 239-291 46-98 (188)
225 PF13543 KSR1-SAM: SAM like do 46.8 1.1E+02 0.0024 25.3 7.2 87 5-98 28-124 (129)
226 TIGR01954 nusA_Cterm_rpt trans 46.1 32 0.0007 22.6 3.4 36 63-101 12-48 (50)
227 PF10343 DUF2419: Protein of u 45.1 61 0.0013 30.6 6.1 65 34-111 45-117 (287)
228 PRK07758 hypothetical protein; 43.7 19 0.00041 28.0 2.1 21 112-132 64-84 (95)
229 PRK12766 50S ribosomal protein 41.9 44 0.00096 30.4 4.5 39 59-100 17-57 (232)
230 COG1796 POL4 DNA polymerase IV 40.3 97 0.0021 29.7 6.7 65 57-135 80-148 (326)
231 COG1948 MUS81 ERCC4-type nucle 39.3 37 0.0008 31.4 3.7 21 114-134 181-201 (254)
232 PRK14976 5'-3' exonuclease; Pr 39.3 18 0.00038 34.1 1.7 18 117-134 193-210 (281)
233 PRK14667 uvrC excinuclease ABC 38.9 47 0.001 34.6 4.8 23 111-134 542-564 (567)
234 PF10440 WIYLD: Ubiquitin-bind 38.6 55 0.0012 23.6 3.7 39 6-49 24-63 (65)
235 PRK09482 flap endonuclease-lik 38.2 19 0.00042 33.4 1.7 18 117-134 184-201 (256)
236 CHL00137 rps13 ribosomal prote 37.3 21 0.00046 29.1 1.6 22 114-135 16-37 (122)
237 PTZ00134 40S ribosomal protein 37.2 19 0.0004 30.8 1.3 29 107-135 19-50 (154)
238 COG1491 Predicted RNA-binding 37.1 25 0.00054 30.8 2.1 26 114-139 129-154 (202)
239 cd00008 53EXOc 5'-3' exonuclea 35.2 23 0.0005 32.4 1.7 18 117-134 185-202 (240)
240 COG0864 NikR Predicted transcr 35.0 49 0.0011 27.6 3.4 32 77-108 17-48 (136)
241 smart00475 53EXOc 5'-3' exonuc 34.9 24 0.00052 32.8 1.8 18 117-134 188-205 (259)
242 KOG2573|consensus 34.7 85 0.0018 31.0 5.4 73 7-102 180-253 (498)
243 TIGR01448 recD_rel helicase, p 34.3 53 0.0011 35.3 4.5 62 68-133 69-135 (720)
244 KOG1338|consensus 34.2 1.9E+02 0.0041 28.7 7.6 96 35-139 88-194 (466)
245 PRK05179 rpsM 30S ribosomal pr 33.2 28 0.0006 28.5 1.7 22 114-135 16-37 (122)
246 PRK04053 rps13p 30S ribosomal 33.1 32 0.00069 29.2 2.1 29 107-135 14-45 (149)
247 TIGR01448 recD_rel helicase, p 32.6 41 0.00089 36.1 3.3 17 118-134 150-166 (720)
248 smart00611 SEC63 Domain of unk 32.4 84 0.0018 29.6 5.2 44 85-132 125-168 (312)
249 PRK00254 ski2-like helicase; P 31.5 52 0.0011 35.2 3.9 42 94-139 628-669 (720)
250 PF04919 DUF655: Protein of un 31.3 33 0.00072 29.9 2.0 26 114-139 115-140 (181)
251 PF14056 DUF4250: Domain of un 31.2 83 0.0018 21.9 3.5 30 60-89 15-47 (55)
252 KOG3835|consensus 31.1 76 0.0016 31.0 4.4 56 50-105 19-78 (495)
253 PF07803 GSG-1: GSG1-like prot 31.1 75 0.0016 25.6 3.7 53 202-267 40-92 (118)
254 PF11372 DUF3173: Domain of un 31.1 1.6E+02 0.0034 20.8 4.9 27 75-105 5-32 (59)
255 COG3547 Transposase and inacti 30.9 2E+02 0.0044 26.6 7.5 37 94-132 169-205 (303)
256 COG1255 Uncharacterized protei 30.7 57 0.0012 26.5 3.0 85 121-211 20-104 (129)
257 PF13297 Telomere_Sde2_2: Telo 30.4 96 0.0021 22.0 3.7 34 63-96 4-43 (60)
258 TIGR00596 rad1 DNA repair prot 30.3 86 0.0019 34.2 5.3 41 57-100 769-809 (814)
259 TIGR03631 bact_S13 30S ribosom 30.2 31 0.00066 27.8 1.5 21 114-134 14-34 (113)
260 COG0099 RpsM Ribosomal protein 30.0 38 0.00083 27.5 1.9 28 108-135 7-37 (121)
261 PRK14666 uvrC excinuclease ABC 29.9 99 0.0022 32.9 5.5 23 112-134 666-688 (694)
262 PF04891 NifQ: NifQ; InterPro 29.7 1.9E+02 0.0042 25.0 6.4 75 123-214 83-165 (167)
263 COG0258 Exo 5'-3' exonuclease 29.1 60 0.0013 30.8 3.6 14 120-133 203-216 (310)
264 TIGR03629 arch_S13P archaeal r 29.1 33 0.00071 28.9 1.5 22 114-135 20-41 (144)
265 PF01706 FliG_C: FliG C-termin 29.0 83 0.0018 24.8 3.8 55 57-111 11-65 (110)
266 PRK00558 uvrC excinuclease ABC 28.9 62 0.0013 33.9 3.8 20 114-133 542-561 (598)
267 PRK14672 uvrC excinuclease ABC 28.8 78 0.0017 33.6 4.5 32 112-143 637-668 (691)
268 PF13751 DDE_Tnp_1_6: Transpos 28.8 26 0.00056 28.2 0.8 16 202-217 30-45 (125)
269 PRK04460 nickel responsive reg 28.6 73 0.0016 26.6 3.5 29 77-105 15-43 (137)
270 PF09373 PMBR: Pseudomurein-bi 28.1 1E+02 0.0022 18.8 3.2 22 5-26 5-26 (33)
271 PRK00254 ski2-like helicase; P 27.9 1E+02 0.0022 33.0 5.4 23 112-134 675-697 (720)
272 COG1623 Predicted nucleic-acid 27.1 1E+02 0.0022 29.2 4.4 36 60-98 308-344 (349)
273 PRK02967 nickel responsive reg 26.9 80 0.0017 26.4 3.5 30 77-106 14-43 (139)
274 TIGR02238 recomb_DMC1 meiotic 26.7 1.5E+02 0.0032 28.4 5.7 44 58-104 14-59 (313)
275 TIGR02236 recomb_radA DNA repa 26.5 1.4E+02 0.003 28.1 5.6 41 57-100 11-53 (310)
276 TIGR02793 nikR nickel-responsi 26.5 83 0.0018 25.9 3.5 29 77-105 13-41 (129)
277 PRK00630 nickel responsive reg 26.4 84 0.0018 26.6 3.5 29 77-105 23-51 (148)
278 PRK07758 hypothetical protein; 26.0 2.1E+02 0.0045 22.3 5.3 45 55-104 44-89 (95)
279 COG1623 Predicted nucleic-acid 25.5 50 0.0011 31.1 2.2 84 49-132 247-342 (349)
280 COG1498 SIK1 Protein implicate 25.5 2.6E+02 0.0057 27.6 7.1 41 7-47 129-170 (395)
281 COG1468 CRISPR-associated prot 25.5 38 0.00083 29.9 1.4 17 200-216 172-188 (190)
282 PF03686 UPF0146: Uncharacteri 25.3 16 0.00034 30.1 -1.0 74 122-197 21-94 (127)
283 PRK01002 nickel responsive reg 24.5 94 0.002 26.0 3.5 30 77-106 17-46 (141)
284 PF11417 Inhibitor_G39P: Loade 24.2 1.5E+02 0.0034 21.6 4.1 67 5-73 1-70 (71)
285 PRK01172 ski2-like helicase; P 23.8 1.6E+02 0.0035 31.2 6.0 38 65-105 633-671 (674)
286 PRK02362 ski2-like helicase; P 23.5 1.1E+02 0.0024 32.8 4.7 38 92-133 633-670 (737)
287 PF05166 YcgL: YcgL domain; I 23.4 50 0.0011 24.5 1.4 22 68-89 47-68 (74)
288 PF14475 Mso1_Sec1_bdg: Sec1-b 23.3 92 0.002 20.3 2.4 21 9-29 18-38 (41)
289 PF02467 Whib: Transcription f 23.2 21 0.00045 25.8 -0.6 16 204-219 31-46 (66)
290 PRK14670 uvrC excinuclease ABC 23.1 1.3E+02 0.0028 31.4 4.9 23 112-134 543-565 (574)
291 KOG2841|consensus 22.8 86 0.0019 28.6 3.0 24 111-134 223-246 (254)
292 PF14579 HHH_6: Helix-hairpin- 22.7 66 0.0014 24.4 2.1 26 114-139 26-51 (90)
293 PRK14670 uvrC excinuclease ABC 21.8 1.3E+02 0.0029 31.4 4.6 36 60-98 529-565 (574)
294 PF08823 PG_binding_2: Putativ 21.8 70 0.0015 23.6 1.9 25 66-90 5-32 (74)
295 PRK01151 rps17E 30S ribosomal 21.7 71 0.0015 22.5 1.8 22 91-112 4-25 (58)
296 PHA00439 exonuclease 21.1 56 0.0012 30.9 1.6 19 115-134 188-206 (286)
297 PF03755 YicC_N: YicC-like fam 21.1 1.4E+02 0.0031 25.2 4.1 35 91-125 83-117 (159)
298 smart00540 LEM in nuclear memb 20.8 1.8E+02 0.0039 19.2 3.5 33 70-102 2-41 (44)
299 PF14443 DBC1: DBC1 20.7 3.9E+02 0.0086 21.9 6.2 46 250-296 9-62 (126)
300 PF02889 Sec63: Sec63 Brl doma 20.5 1.8E+02 0.004 27.2 5.1 44 86-133 123-166 (314)
No 1
>PRK10880 adenine DNA glycosylase; Provisional
Probab=100.00 E-value=9.9e-83 Score=606.21 Aligned_cols=341 Identities=31% Similarity=0.552 Sum_probs=303.7
Q ss_pred HHHHHHHHHHHhhcCC-CCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCch
Q psy9547 11 LAFQESILTWYKQNAR-QLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYY 89 (363)
Q Consensus 11 ~~~~~~l~~~~~~~~~-~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~ 89 (363)
..|.+.|++||..++| ++|||.+.|||++|||+||+|||++++|.++|.+|+++|||+++||++++++|+++|+++|||
T Consensus 4 ~~~~~~ll~W~~~~~r~~lpWr~~~dpy~ilVseILlQQT~v~~v~~~~~rl~~~fPt~~~La~a~~eel~~~~~glGyy 83 (350)
T PRK10880 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYY 83 (350)
T ss_pred HHHHHHHHHHHHHcCCCCCCCCCCCCHHHHHHHHHHHhhccHHHHHHHHHHHHHHCcCHHHHHCcCHHHHHHHHHcCChH
Confidence 5689999999999997 999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHH
Q psy9547 90 SRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVV 169 (363)
Q Consensus 90 ~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~ 169 (363)
+||++|+++|+.|+++|+|.+|.++++|++|||||+|||++||+|+||+++++||+||.||++|+++++..+......+.
T Consensus 84 ~RAr~L~~~A~~i~~~~~g~~p~~~~~L~~LpGIG~~TA~aIl~~af~~~~~iVD~nV~RV~~Rl~~i~~~~~~~~~~~~ 163 (350)
T PRK10880 84 ARARNLHKAAQQVATLHGGEFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENR 163 (350)
T ss_pred HHHHHHHHHHHHHHHHhCCCchhhHHHHhcCCCccHHHHHHHHHHHCCCCeecccHHHHHHHHHHhcccCCCChHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999988765555566677
Q ss_pred HHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhhcCCcccCCcccCCccccceeEEEEEEEeCCc
Q psy9547 170 YETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTN 249 (363)
Q Consensus 170 ~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~~~~~ 249 (363)
+++.++.++|.+.+++||++||+||+++|+|++|+|+.|||++.|.+++.+.+..+|+|++|++.+.....++++.++ |
T Consensus 164 l~~~~~~l~p~~~~~~~nqalm~lGa~vC~p~~P~C~~Cpl~~~C~~~~~~~~~~~P~k~~k~~~~~~~~~~~~~~~~-~ 242 (350)
T PRK10880 164 LWQLSEQVTPAVGVERFNQAMMDLGAMVCTRSKPKCELCPLQNGCIAYANHSWALYPGKKPKQTLPERTGYFLLLQHG-D 242 (350)
T ss_pred HHHHHHHhCChhHHHHHHHHHHHhhHHhccCCCCCCCCCccHhhhHHHHcCCHhhCCCCCCCCCCCeEEEEEEEEEEC-C
Confidence 888899999999999999999999999999999999999999999999999999999998888877777777777655 8
Q ss_pred eEEEeecCCCccccccccccccCCCCCchHHHHHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEecCCCC-CCC
Q psy9547 250 KYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEKTI-PAN 328 (363)
Q Consensus 250 ~vLl~~R~~~~~~~glWEFPggk~e~~~~~alrEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~~~~~-~~~ 328 (363)
++|+.|||++|+|+||||||+... ++.+++..+++|+...... .+.+++|.|||++|+|++|.+....... ...
T Consensus 243 ~~~l~~r~~~gl~~gl~~fP~~~~----~~~~~~~~~~~~~~~~~~~-~~~~~~H~fTH~~~~~~~~~~~~~~~~~~~~~ 317 (350)
T PRK10880 243 EVWLEQRPPSGLWGGLFCFPQFAD----EEELRQWLAQRGIAADNLT-QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDE 317 (350)
T ss_pred EEEEEECCccChhhccccCCCCcc----hhhHHHHHHhcCCchhhhc-ccCceEEEEeeEEEEEEEEEEEccccccccCC
Confidence 999999999999999999998632 2344444467787533334 5789999999999999999887653211 112
Q ss_pred CCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 329 KTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 329 ~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
.+..|++++++.+++||++++|+++.+..
T Consensus 318 ~~~~w~~~~~~~~~~~p~~~~k~l~~~~~ 346 (350)
T PRK10880 318 GNGLWYNLAQPPSVGLAAPVERLLQQLRT 346 (350)
T ss_pred cCCeEechHHhcccCCcHHHHHHHHHhcc
Confidence 34579999999999999999999987653
No 2
>COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair]
Probab=100.00 E-value=7.5e-80 Score=572.65 Aligned_cols=336 Identities=42% Similarity=0.797 Sum_probs=304.0
Q ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhcc
Q psy9547 7 AKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGL 86 (363)
Q Consensus 7 ~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~l 86 (363)
...+..+.+.|++||+.++|+||||.+.|||.+|||+||+|||++++|.++|.+|+++|||+++||+|+.+||.++|.|+
T Consensus 5 ~~~~~~~~~~ll~Wy~~~~R~LPWR~~~~PY~VwvSEiMLQQT~v~~Vi~yy~~fl~rfPti~~LA~A~~~evl~~W~gL 84 (342)
T COG1194 5 DGDIEKFQEALLDWYDKNGRDLPWRETKDPYRVWVSEIMLQQTQVATVIPYYERFLERFPTIKALAAAPEDEVLKAWEGL 84 (342)
T ss_pred hhhhHHHHHHHHHHHHHhCCcCCCCCCCCcceehhHHHHhhhccHhhhhhhHHHHHHhCCCHHHHhcCCHHHHHHHHHhc
Confidence 34566789999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCccc
Q psy9547 87 GYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKT 166 (363)
Q Consensus 87 G~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~ 166 (363)
|||+||++|+++|+.|+++|+|.+|++.++|.+|||||+|||++||+|+||++.++||+||.||++|++.++.++....+
T Consensus 85 GYysRArnL~~~A~~v~~~~~G~~P~~~~~l~~LpGiG~yTa~Ail~~a~~~~~~~lDgNV~RVl~R~f~i~~~~~~~~~ 164 (342)
T COG1194 85 GYYSRARNLHKAAQEVVERHGGEFPDDEEELAALPGVGPYTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDIGKPKT 164 (342)
T ss_pred ChHHHHHHHHHHHHHHHHHcCCCCCCCHHHHHhCCCCcHHHHHHHHHHHhCCCCceeecchheeehhhhcccccccccch
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988777788
Q ss_pred HHHHHHHHHhcCCccc-HHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhhcCCcccCCcccCCccccceeEEEEEEE
Q psy9547 167 KVVYETLVSKTMSQTN-AREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVK 245 (363)
Q Consensus 167 ~~~~~~~~~~~~p~~~-~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~ 245 (363)
.+.++..++.+++++. +++|||+|||+|+.||++++|+|+.|||++.|.+++.+.++.+|+|++|++++ ...++.+++
T Consensus 165 ~~~~~~~~~~ll~p~~~~~~fnqammdlGA~ICt~~~P~C~~CPl~~~c~a~~~g~~~~~P~k~~k~~~~-~~~~~~~~~ 243 (342)
T COG1194 165 KKELWELAEQLLTPDRRPGDFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLP-RRFAAFLIL 243 (342)
T ss_pred hHHHHHHHHHhcCCCCChHHHHHHHHHhhhHhhcCCCCCCCcCcchHHHHHHHcCCcccCCCcCcccccc-hheeeEEEE
Confidence 9999999999665555 89999999999999999999999999999999999999999999999998888 777888888
Q ss_pred eCCceEEEeecCCCccccccccccccCCCCCchHHHHHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEecCCCC
Q psy9547 246 TDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEKTI 325 (363)
Q Consensus 246 ~~~~~vLl~~R~~~~~~~glWEFPggk~e~~~~~alrEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~~~~~ 325 (363)
+.+|++++.||+.+|+|+|||+||..+.+. ..+..+ .+.++.. + .+++++|+|||++++++ +......
T Consensus 244 ~~~~~~~l~kr~~~gl~~gl~~fP~~e~~~---~~~~~~-~~~~~~~---~-~~~~~~H~fth~~l~i~-~~a~~~~--- 311 (342)
T COG1194 244 NRDGEVLLEKRPEKGLLGGLWCFPQFEDEA---DLLDWL-AADGLAA---E-PLGAFRHTFTHFRLTIE-LRASASL--- 311 (342)
T ss_pred ccCcchhhhhCcccCceecccccccccccc---hhhhHh-hhccccc---c-cccceeeeeeEEEEEEE-EEeeccc---
Confidence 888999999999999999999999998763 222111 2334333 4 78999999999999999 4444332
Q ss_pred CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 326 PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 326 ~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
... +..|++++++..+++|.+++++++.+.
T Consensus 312 ~~~-~~~w~~~~~~~~~~l~~p~~k~l~~~~ 341 (342)
T COG1194 312 VLS-DGRWYNLSDLESIGLPAPVKKLLQQLK 341 (342)
T ss_pred CCC-CceeccccccccccccHHHHHHHHHhc
Confidence 223 789999999999999999999998764
No 3
>TIGR01084 mutY A/G-specific adenine glycosylase. This equivalog model identifies mutY members of the pfam00730 superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate). The major members of the superfamily are nth and mutY.
Probab=100.00 E-value=2.1e-72 Score=521.29 Aligned_cols=273 Identities=44% Similarity=0.843 Sum_probs=252.7
Q ss_pred HHHHHHHHHhhcCC-CCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHH
Q psy9547 13 FQESILTWYKQNAR-QLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSR 91 (363)
Q Consensus 13 ~~~~l~~~~~~~~~-~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~r 91 (363)
|.+.|++||..++| ++|||.+.|||++||++||+|||++++|.++|.+|+++|||+++|+++++++|.++|+++|||+|
T Consensus 2 ~~~~l~~w~~~~~r~~lpWr~~~dpy~vlvseIL~QQT~v~~v~~~~~rl~~~fpt~~~La~a~~eeL~~~~~~lG~y~R 81 (275)
T TIGR01084 2 FSEDLLSWYDKYGRKTLPWRQNKTPYRVWLSEVMLQQTQVATVIPYFERFLERFPTVQALANAPQDEVLKLWEGLGYYAR 81 (275)
T ss_pred hHHHHHHHHHHhCCCCCCCCCCCCHHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCHHHHHCcCHHHHHHHHHHCCcHHH
Confidence 57899999999998 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHH
Q psy9547 92 VRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYE 171 (363)
Q Consensus 92 A~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~ 171 (363)
|++|+++|+.|.++|+|.+|+++++|++|||||+|||++|++|+||+|.++||+||.||++|+++++.+.....+.+.++
T Consensus 82 Ar~L~~~A~~i~~~~~g~~p~~~~~L~~LpGIG~~TA~~Il~~a~~~~~~~vD~~v~RVl~Rl~~~~~~~~~~~~~~~l~ 161 (275)
T TIGR01084 82 ARNLHKAAQEVVEEFGGEFPQDFEDLAALPGVGRYTAGAILSFALNKPYPILDGNVKRVLSRLFAVEGWPGKKKVENRLW 161 (275)
T ss_pred HHHHHHHHHHHHHHcCCCCcHHHHHHHhCCCCCHHHHHHHHHHHCCCCCCcchHhHHHHHHHHccCcCCCCHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999998876555556677888
Q ss_pred HHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhhcCCcccCCcccCCccccceeEEEEEEEeCCceE
Q psy9547 172 TLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNKY 251 (363)
Q Consensus 172 ~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~~~~~~v 251 (363)
..++.++|.+.+++||++||+||+++|+|++|+|+.|||++.|.++..+.+..+|+|++|++.+.....+++|.+++|++
T Consensus 162 ~~~~~~lp~~~~~~~n~alm~lG~~vC~~~~P~C~~Cpl~~~C~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (275)
T TIGR01084 162 TLAESLLPKADPEAFNQALMDLGAMICTRKKPKCDLCPLQDFCLAYQQGTWEEYPVKKPKAAPPERTTYFLVLQNYDGEV 241 (275)
T ss_pred HHHHHHCChhhHHHHHHHHHHHhHHHcCCCCCCCCCCCChhhCHHHHcCCHhhcCCCCCCCCCCeEEEEEEEEEeCCCeE
Confidence 88999999999999999999999999999999999999999999999999999999998888777777788877766899
Q ss_pred EEeecCCCccccccccccccCCCCCchHHHHHHHhhCC
Q psy9547 252 LIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLP 289 (363)
Q Consensus 252 Ll~~R~~~~~~~glWEFPggk~e~~~~~alrEl~Eelg 289 (363)
|+.||+++|+|+||||||+.+.+ ++++++..++|
T Consensus 242 ~~~~r~~~~~~~gl~~~p~~~~~----~~~~~~~~~~~ 275 (275)
T TIGR01084 242 LLEQRPEKGLWGGLYCFPQFEDE----DSLAFLLAQRG 275 (275)
T ss_pred EEEeCCCCchhhccccCCCCCch----hHHHHHHHhcC
Confidence 99999999999999999997653 44444444443
No 4
>PRK13910 DNA glycosylase MutY; Provisional
Probab=100.00 E-value=2e-67 Score=489.16 Aligned_cols=283 Identities=28% Similarity=0.516 Sum_probs=245.7
Q ss_pred HhhhcchhhHHH-HHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCC
Q psy9547 45 MLQQTQVKTVLP-YYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGV 123 (363)
Q Consensus 45 l~qqt~~~~v~~-~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGI 123 (363)
|+|||++++|.+ +|++|+++|||+++||+|+++||+++|+++|||+||++|+++|+.|+++|+|.+|.++++|++||||
T Consensus 1 mlQQT~v~~v~~~yy~rf~~~fPt~e~La~a~~~el~~~~~glGyy~RAr~L~~~A~~i~~~~~g~~P~~~~~L~~LpGI 80 (289)
T PRK13910 1 MSQQTQINTVVERFYSPFLEAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPGI 80 (289)
T ss_pred CCCCCcHHHhHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHhCCCCChhHHHHHhCCCC
Confidence 899999999986 9999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCC
Q psy9547 124 GDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNP 203 (363)
Q Consensus 124 G~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P 203 (363)
|+|||++||+|+||+|+++||+||+||++|++++..+ .+.+.++..++.++|...++++|++||+||+++|+| +|
T Consensus 81 G~kTA~aIl~~af~~~~~~VD~nV~RVl~Rl~g~~~~----~~~~~l~~~~~~~l~~~~~~~~nqaLm~~Ga~iC~~-~P 155 (289)
T PRK13910 81 GAYTANAILCFGFREKSACVDANIKRVLLRLFGLDPN----IHAKDLQIKANDFLNLNESFNHNQALIDLGALICSP-KP 155 (289)
T ss_pred CHHHHHHHHHHHCCCCcCcccHHHHHHHHHHhcCCCC----ccHHHHHHHHHHhCCccchHHHHHHHHHHhHHHcCC-CC
Confidence 9999999999999999999999999999999877643 345678888889999999999999999999999999 79
Q ss_pred CCCCCCCcccChhhhcCCcccCCcccCCccccceeEEEEEEEeCCceEEEeecCCCccccccccccccCCCCCchHHHHH
Q psy9547 204 KCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEF 283 (363)
Q Consensus 204 ~C~~Cpl~~~C~~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~~~~~alrE 283 (363)
+|+.|||++.|.++..+.+..+ |+||++++ .....+++.++ |++|+.||+ +|+|+||||||+...
T Consensus 156 ~C~~CPl~~~C~~~~~~~~~~~--~~kk~~~~-~~~~~~~~~~~-~~~ll~kr~-~~l~~gl~~fP~~~~---------- 220 (289)
T PRK13910 156 KCAICPLNPYCLGKNNPEKHTL--KKKQEIVQ-EERYLGVVIQN-NQIALEKIE-QKLYLGMHHFPNLKE---------- 220 (289)
T ss_pred CCCCCcChhhhhhhhcCCcccc--CCCCCCCc-eEEEEEEEEEC-CEEEEEECC-CchhcccccCCCChh----------
Confidence 9999999999999999888765 44444333 34444455555 799999995 899999999998531
Q ss_pred HHhhCCCcccccccccccEEEEccceEEEEEEEEEEecCCCCCCCCCCeeecccccCcCCCChHHHHHHHHHHhh
Q psy9547 284 LSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKLFTKS 358 (363)
Q Consensus 284 l~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~~~~~~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~~ 358 (363)
+++.. .. .+++++|.|||++|++++|.+.... ..++.+|++++++.+++||++++++++.++..
T Consensus 221 ---~~~~~---~~-~~~~~~H~fTH~~~~~~~~~~~~~~----~~~~~~w~~~~~~~~~~~p~~~~k~~~~l~~~ 284 (289)
T PRK13910 221 ---NLEYK---LP-FLGAIKHSHTKFKLNLNLYLAAIKD----LKNPIRFYSLKDLETLPISSMTLKILNFLKQK 284 (289)
T ss_pred ---hhccc---cc-ccCceEEEEEeEEEEEEEEEEEecc----CCccceEecHHHhhhcCCcHHHHHHHHHHhhh
Confidence 11111 22 4679999999999999999887642 13456899999999999999999999988764
No 5
>KOG2457|consensus
Probab=100.00 E-value=2.4e-63 Score=458.28 Aligned_cols=336 Identities=34% Similarity=0.597 Sum_probs=280.3
Q ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCCC---------CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCH
Q psy9547 6 SAKEILAFQESILTWYKQNARQLPWRES---------NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTE 76 (363)
Q Consensus 6 ~~~~~~~~~~~l~~~~~~~~~~lpw~~~---------~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~ 76 (363)
+..+...|...|++||+.++|+||||.. +-.|++|||+||+|||++.+|.+||.+||++|||..++++|+.
T Consensus 86 ~~~Ev~~fR~sLl~wYD~~KRdLPWR~r~sEde~DwerRaYeVwVSEiMLQQTrV~TV~~YYt~WMqkwPTl~dla~Asl 165 (555)
T KOG2457|consen 86 SENEVQKFRMSLLDWYDVNKRDLPWRNRRSEDEKDWERRAYEVWVSEIMLQQTRVQTVMKYYTRWMQKWPTLYDLAQASL 165 (555)
T ss_pred hHHHHHHHHHHHHHHhhhhcccCccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCchHHHHHHHHH
Confidence 4467888999999999999999999986 2479999999999999999999999999999999999999998
Q ss_pred -HHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhc-CCCCcHHHHHHHHHHhcCCccccccchHHHHHhhh
Q psy9547 77 -DNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLS-IKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRL 154 (363)
Q Consensus 77 -~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~-lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl 154 (363)
.|+..+|.|+|||+|+|+|++.|+.+++..+|.+|.+-+.|++ +||||+|||++|+++||+++.-+||+||.||++|.
T Consensus 166 ~~eVn~lWaGlGyY~R~rrL~ega~~vv~~~~ge~Prta~~l~kgvpGVG~YTAGAiaSIAf~q~tGiVDGNVirvlsRa 245 (555)
T KOG2457|consen 166 EKEVNELWAGLGYYRRARRLLEGAKMVVAGTEGEFPRTASSLMKGVPGVGQYTAGAIASIAFNQVTGIVDGNVIRVLSRA 245 (555)
T ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHhCCCCCCChHHHHHhhCCCCCccchhhhhhhhhcCcccccccchHHHhHHh
Confidence 8999999999999999999999999999999999999999999 99999999999999999999999999999999999
Q ss_pred cccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhhcCC-------------
Q psy9547 155 FEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNT------------- 221 (363)
Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~~~~------------- 221 (363)
..+..+.++++....+|.++..++++.+|+||||++|+||+++|||.+|.|+.||+.+.|++|+...
T Consensus 246 lAIhsDcSkgk~~q~~wkLA~qLVDP~RPGDFNQalMELGAt~CTpq~P~CS~CPvss~CrA~q~~Eqn~~i~~g~t~~~ 325 (555)
T KOG2457|consen 246 LAIHSDCSKGKFFQSSWKLAAQLVDPSRPGDFNQALMELGATLCTPQKPSCSSCPVSSQCRAFQLSEQNRTISVGDTYPT 325 (555)
T ss_pred HhhcCCcchhhHHHHHHHHHHHhcCCCCCCcHHHHHHHhcCeeccCCCCCcCCCCcHHHHHHHhHhhhcceeecCCcCCc
Confidence 9999998888888999999999999999999999999999999999999999999999999998321
Q ss_pred -----------------------------cccCCcccCCccccceeEEEEEEEeC-----CceEEEeecCCCcccccccc
Q psy9547 222 -----------------------------IENFPVKLGKTKVKDVFLLTVVVKTD-----TNKYLIQKRPTTGLLSNFYM 267 (363)
Q Consensus 222 -----------------------------~~~~p~k~~k~~~~~~~v~~~ii~~~-----~~~vLl~~R~~~~~~~glWE 267 (363)
+-.+|++..+++.+...-.+++.... .+.||+.+||+.|+++|||+
T Consensus 326 dve~~~P~~~~C~vCv~~ips~e~~Qs~gv~~~p~~p~~~~~reE~~~v~~~e~~dp~t~~~~ilv~~rp~~gllagLw~ 405 (555)
T KOG2457|consen 326 DVEKAKPRHDFCCVCVLEIPSLERNQSGGVFVLPKRPEQLAGREEFPSVILNEEADPATRRNAILVYLRPAFGLLAGLWK 405 (555)
T ss_pred cHhhcCCCCCceeEeecCCCCchhhccCcEEEeccChhhcCcccccceeeeeccCCchhhcceeEEEeccchhHHHHhhh
Confidence 23466655554444333334343331 25899999999999999999
Q ss_pred ccccCC-CCC---chHHH--HHHHhhCCCcc-cccc--cccccEEEEccceEEEEEEEEEEecCCCC--CCCCCCeeecc
Q psy9547 268 FLSFES-DTP---YTSQV--EFLSENLPFKV-NLNE--KCLGNVKHVFSHLKWNMDVYSGTAKEKTI--PANKTYKLITE 336 (363)
Q Consensus 268 FPggk~-e~~---~~~al--rEl~Eelgl~v-~~~~--~~l~~v~h~~sh~~~~l~~~~~~~~~~~~--~~~~~~~W~~~ 336 (363)
||.... |++ .-.+. +.+..+++..- .-+. ..-+.+.|+|||+.++-+||.....+..+ .....++|++.
T Consensus 406 fpti~~~e~se~~~~~a~~q~~v~~w~~~~~~t~~~~~~~~G~~~htfshi~~ts~V~~~a~~~~~~vt~~p~~~~wi~q 485 (555)
T KOG2457|consen 406 FPTIVSRELSEFVHIFAHIQRKVYVWLLVVQLTGGTEDLFKGQAKHTFSHICVTSDVLSTAGLTSAVVTVPPFRLQWIKQ 485 (555)
T ss_pred cCceeccCcchHHHHHHHHHHHHHHHhcccccccchhhhcccchhhhhhhhhhHHHHHHHhhcCCCccccCCcccchhhh
Confidence 998766 332 11222 33555555432 2222 13388999999999988888544333222 23467899999
Q ss_pred cccCc
Q psy9547 337 TQMKK 341 (363)
Q Consensus 337 ~el~~ 341 (363)
.+|+.
T Consensus 486 ~~l~h 490 (555)
T KOG2457|consen 486 LSLDH 490 (555)
T ss_pred hhhHH
Confidence 99854
No 6
>COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
Probab=100.00 E-value=7.7e-53 Score=370.90 Aligned_cols=206 Identities=28% Similarity=0.526 Sum_probs=195.0
Q ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhc
Q psy9547 6 SAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEG 85 (363)
Q Consensus 6 ~~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~ 85 (363)
...++.++.++|..-|+....++-| .|||++||+.||||||+++.+.+++.+|++.||||++++++++++|.+.|+.
T Consensus 2 ~~~~~~~i~~~l~~~~p~~~~~l~~---~~pf~lLva~iLSaqttD~~vn~at~~Lf~~~~t~e~l~~a~~~~l~~~I~~ 78 (211)
T COG0177 2 NKKKALEILDRLRELYPEPKTELDF---KDPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLNADEEELEELIKS 78 (211)
T ss_pred cHhhHHHHHHHHHHHCCCCCCccCc---CCcHHHHHHHHHhccCchHHHHHHHHHHHHHcCCHHHHHcCCHHHHHHHHHh
Confidence 3567788889999888877666655 7999999999999999999999999999999999999999999999999999
Q ss_pred cCchH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCc
Q psy9547 86 LGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKG 164 (363)
Q Consensus 86 lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~ 164 (363)
+|||+ ||++|+++|+.|+++|+|++|+++++|++|||||+|||++||+++||.|+++||+||.||..|+++.+ .
T Consensus 79 iGlyr~KAk~I~~~~~~l~e~~~g~vP~~~~eL~~LPGVGrKTAnvVL~~a~g~p~i~VDTHV~Rvs~R~gl~~-----~ 153 (211)
T COG0177 79 IGLYRNKAKNIKELARILLEKFGGEVPDTREELLSLPGVGRKTANVVLSFAFGIPAIAVDTHVHRVSNRLGLVP-----G 153 (211)
T ss_pred cCCcHHHHHHHHHHHHHHHHHcCCCCCchHHHHHhCCCcchHHHHHHHHhhcCCCcccccchHHHHHHHhCCCC-----C
Confidence 99996 99999999999999999999999999999999999999999999999999999999999999999865 5
Q ss_pred ccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhhc
Q psy9547 165 KTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKN 219 (363)
Q Consensus 165 ~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~~ 219 (363)
+++.++++.+..++|.+.|.++|.+|+.|||.+|++++|+|+.||++++|+++..
T Consensus 154 ~~p~~ve~~L~~~iP~~~~~~~h~~lI~~GR~iC~ar~P~C~~C~l~~~C~~~~~ 208 (211)
T COG0177 154 KTPEEVEEALMKLIPKELWTDLHHWLILHGRYICKARKPRCEECPLADLCPSAGK 208 (211)
T ss_pred CCHHHHHHHHHHHCCHHHHHHHHHHHHHhhhhhccCCCCCcCcccchhhCchhcc
Confidence 6789999999999999999999999999999999999999999999999999874
No 7
>PRK10702 endonuclease III; Provisional
Probab=100.00 E-value=1.9e-51 Score=367.92 Aligned_cols=205 Identities=22% Similarity=0.326 Sum_probs=190.9
Q ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhc
Q psy9547 6 SAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEG 85 (363)
Q Consensus 6 ~~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~ 85 (363)
+..++..+.++|.+.|+. ..+|+..+|||++||++||||||+++++.+++.+|+.+||||++|+++++++|+++|++
T Consensus 2 ~~~~~~~i~~~l~~~~~~---~~~~~~~~~p~e~lvs~iLsq~t~~~~v~~~~~~L~~~~pt~e~l~~a~~~~l~~~i~~ 78 (211)
T PRK10702 2 NKAKRLEILTRLRDNNPH---PTTELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKT 78 (211)
T ss_pred CHHHHHHHHHHHHHHCCC---CCCCCCCCChHHHHHHHHHHhhcCHHHHHHHHHHHHHHcCCHHHHHCCCHHHHHHHHHH
Confidence 345688899999999875 34677789999999999999999999999999999999999999999999999999999
Q ss_pred cCchH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCc
Q psy9547 86 LGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKG 164 (363)
Q Consensus 86 lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~ 164 (363)
+|||+ ||++|+++|+.+.++|+|.+|+++++|++|||||+|||++||+|+||+|+++||+||.||++|+|+.+ .
T Consensus 79 ~G~y~~kA~~l~~~a~~i~~~~~~~~p~~~~~Ll~lpGVG~ktA~~ill~a~~~~~~~VDt~v~Rv~~r~g~~~-----~ 153 (211)
T PRK10702 79 IGLYNSKAENVIKTCRILLEQHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAP-----G 153 (211)
T ss_pred cCCHHHHHHHHHHHHHHHHHHcCCCCCchHHHHhcCCcccHHHHHHHHHHHcCCCcccccchHHHHHHHhCCCC-----C
Confidence 99996 99999999999999999999999999999999999999999999999999999999999999998753 2
Q ss_pred ccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhh
Q psy9547 165 KTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYK 218 (363)
Q Consensus 165 ~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~ 218 (363)
.++..+++.++..+|.+.+.++|++||+||+.+|++++|+|+.|||++.|+++.
T Consensus 154 ~~~~~~~~~l~~~lp~~~~~~~~~~li~~Gr~~C~~~~P~C~~Cpl~~~C~~~~ 207 (211)
T PRK10702 154 KNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIEDLCEYKE 207 (211)
T ss_pred CCHHHHHHHHHHhCCchHHHHHHHHHHHHhHHHcCCCCCCCCCCcChhhcCccc
Confidence 457788888888999999999999999999999999999999999999998754
No 8
>TIGR01083 nth endonuclease III. This equivalog model identifes nth members of the pfam00730 superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate). The major members of the superfamily are nth and mutY.
Probab=100.00 E-value=4.2e-44 Score=317.68 Aligned_cols=189 Identities=28% Similarity=0.483 Sum_probs=175.1
Q ss_pred HHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCc
Q psy9547 9 EILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGY 88 (363)
Q Consensus 9 ~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~ 88 (363)
++.+++++|.+.|+.....++| +|||++||++||+|||+++++.+++.+|++.||||++|+++++++|.++|+++||
T Consensus 2 ~~~~i~~~l~~~~~~~~~~~~~---~dpf~~Li~~ILsqqt~~~~~~~~~~~l~~~~pt~~~l~~~~~~~L~~~ir~~G~ 78 (191)
T TIGR01083 2 KAQEILERLRKNYPHPTTELDY---NNPFELLVATILSAQATDKSVNKATKKLFEVYPTPQALAQAGLEELEEYIKSIGL 78 (191)
T ss_pred hHHHHHHHHHHHCCCCCcccCC---CCHHHHHHHHHHHhhCcHHHHHHHHHHHHHHCCCHHHHHcCCHHHHHHHHHhcCC
Confidence 4678999999999864333333 8999999999999999999999999999999999999999999999999999999
Q ss_pred hH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccH
Q psy9547 89 YS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTK 167 (363)
Q Consensus 89 ~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~ 167 (363)
|+ ||++|+++|+.+.++|+|.+|++++.|++|||||+|||++||+|++|+|.++||+||.|++.|+|+.. ..++
T Consensus 79 ~~~Ka~~i~~~a~~i~~~~~~~~~~~~~~L~~l~GIG~ktA~~ill~~~~~~~~~vD~~v~Ri~~r~g~~~-----~~~~ 153 (191)
T TIGR01083 79 YRNKAKNIIALCRILVERYGGEVPEDREELVKLPGVGRKTANVVLNVAFGIPAIAVDTHVFRVSNRLGLSK-----GKDP 153 (191)
T ss_pred hHHHHHHHHHHHHHHHHHcCCCCchHHHHHHhCCCCcHHHHHHHHHHHcCCCccccchhHHHHHHHcCCCC-----CCCH
Confidence 96 99999999999999999999999999999999999999999999999999999999999999998753 2357
Q ss_pred HHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcCCCCCC
Q psy9547 168 VVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKC 205 (363)
Q Consensus 168 ~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~~~P~C 205 (363)
+.+++.++.++|.+.+.+||++||+||+.+|+|++|+|
T Consensus 154 ~~~~~~l~~~~p~~~~~~~h~~li~~G~~~C~~~~P~C 191 (191)
T TIGR01083 154 DKVEEELLKLIPREFWTKLHHWLILHGRYTCKARKPLC 191 (191)
T ss_pred HHHHHHHHHHCCchhHHHHHHHHHHHhHHhcCCCCCCC
Confidence 78888999999999999999999999999999999999
No 9
>PRK13913 3-methyladenine DNA glycosylase; Provisional
Probab=100.00 E-value=2.5e-37 Score=276.27 Aligned_cols=183 Identities=13% Similarity=0.176 Sum_probs=157.7
Q ss_pred HHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHh-------CCCHHHHhcCCHHHHHHHHh
Q psy9547 12 AFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKT-------YPTIKDFAFDTEDNVLKMWE 84 (363)
Q Consensus 12 ~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~-------~pt~~~la~a~~~el~~~~~ 84 (363)
+++..|.+.--....+..||+..+||++||++||+|||++++|.+++.+|.+. +|||++|+++++++|+++|+
T Consensus 6 ~l~~~l~~~~~~~~~~~~WWp~~~~fevLV~aILsQqT~~~~v~~a~~~L~~~~~~~~~~~~t~e~L~~a~~eeL~~~Ir 85 (218)
T PRK13913 6 ELLKALKSLDLLKNAPAWWWPNALKFEALLGAVLTQNTKFEAVEKSLENLKNAFILENDDEINLKKIAYIEFSKLAECVR 85 (218)
T ss_pred HHHHHHHhhhhhhcCCCCCCcCcCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcccccccCCCHHHHHcCCHHHHHHHHH
Confidence 44555554322234456899999999999999999999999999999999874 67999999999999999999
Q ss_pred ccCchH-HHHHHHHHHHHhhHHhCCC----CcccHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhccccc
Q psy9547 85 GLGYYS-RVRNFQAGCRQVIEQFGGE----VPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIED 159 (363)
Q Consensus 85 ~lG~~~-rA~~l~~~a~~i~~~~~g~----~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~ 159 (363)
++|||+ ||++|+++|+.+.++|+|. .|.++++|++|||||+|||++||+|++|+|+++||+|++||++|+|+..
T Consensus 86 p~Gf~~~KA~~Lk~la~~i~~~~g~~~~~~~~~~re~Ll~l~GIG~kTAd~iLlya~~rp~fvVDty~~Rv~~RlG~~~- 164 (218)
T PRK13913 86 PSGFYNQKAKRLIDLSENILKDFGSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKYSYLFLKKLGIEI- 164 (218)
T ss_pred hcCCHHHHHHHHHHHHHHHHHHcCCchhccCchHHHHHHcCCCccHHHHHHHHHHHcCCCccccchhHHHHHHHcCCCC-
Confidence 999997 9999999999999998772 4678999999999999999999999999999999999999999997743
Q ss_pred CcCCcccHHHHHHHHHhcC-----------Cc-----ccHHHHHHHHHHHhhhhCcC
Q psy9547 160 DISKGKTKVVYETLVSKTM-----------SQ-----TNAREFNQALMDLGATVCLF 200 (363)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~-----------p~-----~~~~~~~~~l~~~G~~~C~~ 200 (363)
.++.+++++++..+ |. ..+.+||..|++||+..|.-
T Consensus 165 -----~~y~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~Halive~~k~~~~~ 216 (218)
T PRK13913 165 -----EDYDELQHFFEKGVQENLNSALALYENTISLAQLYARFHGKIVEFSKQKLEL 216 (218)
T ss_pred -----CCHHHHHHHHHHhhhhhhhhhhccccccchHHHHHHHHHHHHHHHHHHhhhc
Confidence 25778888888765 21 24688999999999999963
No 10
>KOG1921|consensus
Probab=100.00 E-value=5.3e-37 Score=268.87 Aligned_cols=182 Identities=21% Similarity=0.352 Sum_probs=171.7
Q ss_pred ChHHHHHHHHHhhhcchhhHHHHHHHHHHhC-CCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcc
Q psy9547 35 NPYYVWISEVMLQQTQVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPR 112 (363)
Q Consensus 35 dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~-pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~ 112 (363)
--|++||+.+||.||.++....+..|+.+.. -|++++.++++..|.++|.+.|||+ ||.||++.|+.+.++|+|++|+
T Consensus 77 ~RfqvLv~lmLSSQTKDevt~~Am~rL~~~~gLT~e~v~~~de~~l~~LI~~VgFy~rKA~ylkkta~IL~d~f~gDIP~ 156 (286)
T KOG1921|consen 77 RRFQVLVGLMLSSQTKDEVTAAAMLRLKEYGGLTLEAVLKIDEPTLNELIYPVGFYTRKAKYLKKTAKILQDKFDGDIPD 156 (286)
T ss_pred HhHHHHHHHHHhcchHHHHHHHHHHHHHHhcCCCHHHHhccChHhHHhhhhhccchHHHHHHHHHHHHHHHHHhCCCCch
Confidence 4699999999999999999999999999987 7999999999999999999999995 8999999999999999999999
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHhcCCc-cccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHH
Q psy9547 113 DKKQLLSIKGVGDYTAGALASICYNIP-TPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALM 191 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~~~~~-~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~ 191 (363)
++++|++|||||||.|..+|..|+|+- .+.||+||+|+..|++|++. ..+++++.+..++.|+|.+.|..+|..|+
T Consensus 157 ~v~dLlsLPGVGPKMa~L~m~~AWn~i~GI~VDtHVHRi~nrlgWv~~---ktkspE~TR~aLq~wLPk~lW~eIN~lLV 233 (286)
T KOG1921|consen 157 TVEDLLSLPGVGPKMAHLTMQVAWNKIVGICVDTHVHRICNRLGWVDT---KTKSPEQTRVALQQWLPKSLWVEINHLLV 233 (286)
T ss_pred hHHHHhcCCCCchHHHHHHHHHHhccceeEEeehHHHHHHHHhccccc---ccCCHHHHHHHHHHhCcHHHHhhhhceee
Confidence 999999999999999999999999995 69999999999999999983 34567888889999999999999999999
Q ss_pred HHhhhhCcCCCCCCCCCCC-cccChhhhc
Q psy9547 192 DLGATVCLFKNPKCKECPL-SRFCSAYKN 219 (363)
Q Consensus 192 ~~G~~~C~~~~P~C~~Cpl-~~~C~~~~~ 219 (363)
.||++||+|++|+|+.|-+ ++.|++...
T Consensus 234 GFGQ~iC~p~~prC~~C~~~~~~Cpss~~ 262 (286)
T KOG1921|consen 234 GFGQTICTPRRPRCGLCLLSRDLCPSSFK 262 (286)
T ss_pred cccceeeecCCCCccccccCcccCchhhh
Confidence 9999999999999999999 699999653
No 11
>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]
Probab=100.00 E-value=6.5e-35 Score=251.09 Aligned_cols=205 Identities=23% Similarity=0.381 Sum_probs=182.4
Q ss_pred ccCHHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhC-CCHHHHhcCCHHHHHHH
Q psy9547 4 NLSAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKM 82 (363)
Q Consensus 4 ~~~~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~-pt~~~la~a~~~el~~~ 82 (363)
|.+...+..+++.|...|+. .-||+..+-+++++++||.|+|+|++|.++.+++.... -++++|...+.++|.++
T Consensus 2 ~~~~~~~~~iy~~L~~~yg~----q~WWp~~~~~EiiigAILtQNT~WknvekAlenLk~~~~~~l~~I~~~~~~~L~el 77 (215)
T COG2231 2 MLNMENITKIYKELLRLYGD----QGWWPADNKDEIIIGAILTQNTSWKNVEKALENLKNEGILNLKKILKLDEEELAEL 77 (215)
T ss_pred ccchHHHHHHHHHHHHHcCC----ccCCCCCCchhHHHHHHHhccccHHHHHHHHHHHHHcccCCHHHHhcCCHHHHHHH
Confidence 45667788999999999986 44999999999999999999999999999999998754 58999999999999999
Q ss_pred HhccCchH-HHHHHHHHHHHhhHHhCC---CCcc-cHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhccc
Q psy9547 83 WEGLGYYS-RVRNFQAGCRQVIEQFGG---EVPR-DKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEI 157 (363)
Q Consensus 83 ~~~lG~~~-rA~~l~~~a~~i~~~~~g---~~p~-~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~ 157 (363)
+++.|||+ ||++|+.++..++..|.+ .-+. .+++|++++|||+.|||+||++++++|+|+||.+.+|++.|+|++
T Consensus 78 IrpsGFYnqKa~rLk~l~k~l~~~~~~~~~~~~~~~R~~LL~iKGIG~ETaDsILlYa~~rp~FVvD~Yt~R~l~rlg~i 157 (215)
T COG2231 78 IRPSGFYNQKAKRLKALSKNLAKFFINLESFKSEVLREELLSIKGIGKETADSILLYALDRPVFVVDKYTRRLLSRLGGI 157 (215)
T ss_pred HhccchHHHHHHHHHHHHHHHHHHhhhhhccchHHHHHHHHccCCcchhhHHHHHHHHhcCcccchhHHHHHHHHHhccc
Confidence 99999997 999999888888886643 2222 499999999999999999999999999999999999999999987
Q ss_pred ccCcCCcccHHHHHHHHHhcCCccc--HHHHHHHHHHHhhhhCcCCCCCCCCCCCcccChhhh
Q psy9547 158 EDDISKGKTKVVYETLVSKTMSQTN--AREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYK 218 (363)
Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~p~~~--~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~~~~ 218 (363)
.. +++.++++..+..+|.+. +..||..++.||+.+|+. .|.|+.|||...|.+++
T Consensus 158 ~~-----k~ydeik~~fe~~l~~~~~lyqe~HAlIv~~~K~f~~k-~~~~~~cpL~~~~~~~~ 214 (215)
T COG2231 158 EE-----KKYDEIKELFEENLPENLRLYQEFHALIVEHAKHFCKK-KPLCEKCPLKEKCKKYR 214 (215)
T ss_pred cc-----ccHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHccC-CcCCCCchHHHHHhhcc
Confidence 63 268889999998888763 689999999999999995 69999999999998765
No 12
>smart00478 ENDO3c endonuclease III. includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Probab=99.97 E-value=6.2e-31 Score=224.56 Aligned_cols=148 Identities=33% Similarity=0.516 Sum_probs=140.0
Q ss_pred HhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCC
Q psy9547 45 MLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGV 123 (363)
Q Consensus 45 l~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGI 123 (363)
|+|||+++++..++.+|.+.||||++|+++++++|.++|+++||++ ||++|+++|+.+.+.++|.+|++++.|++||||
T Consensus 1 l~qq~~~~~a~~~~~~l~~~~~~~~~l~~~~~~eL~~~l~~~g~~~~ka~~i~~~a~~~~~~~~~~~~~~~~~L~~l~GI 80 (149)
T smart00478 1 LSQQTSDEAVNKATERLFEKFPTPEDLAAADEEELEELIRPLGFYRRKAKYLIELARILVEEYGGEVPDDREELLKLPGV 80 (149)
T ss_pred CCCcccHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHcCChHHHHHHHHHHHHHHHHHHCCCccHHHHHHHcCCCC
Confidence 6899999999999999999999999999999999999999999986 999999999999999999999999999999999
Q ss_pred cHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhhh
Q psy9547 124 GDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGAT 196 (363)
Q Consensus 124 G~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~ 196 (363)
|+|||++|++|+++++++++|+|+.|+++|+++.+. ..+++.+++.++.++|.+.++.+|++++.+|+.
T Consensus 81 G~~tA~~~l~~~~~~~~~~~D~~v~r~~~rl~~~~~----~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~g~~ 149 (149)
T smart00478 81 GRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDK----KSTPEEVEKLLEKLLPKEDWRELNLLLIDFGRT 149 (149)
T ss_pred cHHHHHHHHHHHCCCCCCccchHHHHHHHHhCCCCC----CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHcCC
Confidence 999999999999999999999999999999998762 346788999999999988899999999999973
No 13
>cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Probab=99.97 E-value=3.3e-30 Score=222.20 Aligned_cols=154 Identities=34% Similarity=0.492 Sum_probs=143.1
Q ss_pred HHHHHHHHHhhhcchhhHHHHHHHHHHhC-CCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCC---cc
Q psy9547 37 YYVWISEVMLQQTQVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEV---PR 112 (363)
Q Consensus 37 ~~~lv~~il~qqt~~~~v~~~~~~l~~~~-pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~---p~ 112 (363)
|++|+++||+|||+++.+..++.+|.+.| |||++|+.+++++|.+++.+.||.+||++|+++|+.+.+.+++.. ++
T Consensus 1 ~e~Li~~il~q~~s~~~a~~~~~~l~~~~gpt~~~l~~~~~~~l~~~~~~~G~~~kA~~i~~~a~~~~~~~~~~~~~~~~ 80 (158)
T cd00056 1 FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGLVLDDPD 80 (158)
T ss_pred CHHHHHHHHHhcccHHHHHHHHHHHHHHhCCCHHHHHCCCHHHHHHHHHhcChHHHHHHHHHHHHHHHHHcCCccCCCcc
Confidence 68999999999999999999999999999 999999999999999999999944599999999999999988866 78
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHH
Q psy9547 113 DKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMD 192 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~ 192 (363)
.++.|++|||||+|||+++++|+++.+++|+|+|+.|+++|+++++ ...+++.+++.++.+.|......+|+.||+
T Consensus 81 ~~~~L~~l~GIG~~tA~~~l~~~~~~~~~pvD~~v~r~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 156 (158)
T cd00056 81 AREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIP----KKKTPEELEELLEELLPKPYWGEANQALMD 156 (158)
T ss_pred cHHHHHcCCCCCHHHHHHHHHHHCCCCCCccchhHHHHHHHhCCCC----CCCCHHHHHHHHHHHCCHHHHHHHHHHHHH
Confidence 8999999999999999999999999889999999999999999864 345788899999999987778999999999
Q ss_pred Hh
Q psy9547 193 LG 194 (363)
Q Consensus 193 ~G 194 (363)
+|
T Consensus 157 ~g 158 (158)
T cd00056 157 LG 158 (158)
T ss_pred cC
Confidence 87
No 14
>PRK01229 N-glycosylase/DNA lyase; Provisional
Probab=99.95 E-value=1e-26 Score=206.07 Aligned_cols=157 Identities=12% Similarity=0.123 Sum_probs=130.4
Q ss_pred CCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccC--chH-HHHHHHHHHHH---hh
Q psy9547 30 WRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLG--YYS-RVRNFQAGCRQ---VI 103 (363)
Q Consensus 30 w~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG--~~~-rA~~l~~~a~~---i~ 103 (363)
|....|||+.||++||+|||+++++.+++.+|. ++.+ +++.++|+++|++.| ||+ ||++|+++++. +.
T Consensus 31 ~~~~~~~f~~Lv~~ILsqnT~~~~v~~a~~~L~-----~~~l-~~~~eeL~~~Ir~~Gygf~~~KAk~I~~~~~~~~~l~ 104 (208)
T PRK01229 31 KGDEEDLFSELSFCILTANSSAEGGIKAQKEIG-----DGFL-YLSEEELEEKLKEVGHRFYNKRAEYIVEARKLYGKLK 104 (208)
T ss_pred ccccCChHHHHHHHHhcCcCcHHHHHHHHHhcC-----HHHc-CCCHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHH
Confidence 446789999999999999999999999999983 4566 899999999999995 886 99999999987 66
Q ss_pred HHh--CCCCcccHHHHh-cCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCc--CCcccHHHHHHHHHhcC
Q psy9547 104 EQF--GGEVPRDKKQLL-SIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDI--SKGKTKVVYETLVSKTM 178 (363)
Q Consensus 104 ~~~--~g~~p~~~~~L~-~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~--~~~~~~~~~~~~~~~~~ 178 (363)
+.+ +++.|.++++|+ ++||||+|||++||+.++.+|+++||+||.|++.|+|.++... .+.+++.+++..+.++.
T Consensus 105 ~~~~~~~~~~~~R~~Ll~~lpGIG~KTAd~vL~~~~~~~~~iVDtHv~Ri~~RlG~~~~~~~~lt~~~y~~~E~~l~~~~ 184 (208)
T PRK01229 105 EIIKADKDQFEAREFLVKNIKGIGYKEASHFLRNVGYEDLAILDRHILRFLKRYGLIEEIPKTLSKKRYLEIEEILREIA 184 (208)
T ss_pred HHHhccCCchHHHHHHHHcCCCCcHHHHHHHHHHccCCCeeeeeHHHHHHHHHhCCCcccccccCcCCHHHHHHHHHHHH
Confidence 666 788999999999 9999999999999976655999999999999999999987532 23567888888777766
Q ss_pred Cccc--HHHHHHHHHH
Q psy9547 179 SQTN--AREFNQALMD 192 (363)
Q Consensus 179 p~~~--~~~~~~~l~~ 192 (363)
+.-. .+.+-..||.
T Consensus 185 ~~~~~~~~~Ldl~~w~ 200 (208)
T PRK01229 185 EELGISLGELDLYIWY 200 (208)
T ss_pred HHcCCCHHHHHHHHHH
Confidence 5422 3445455544
No 15
>TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg). All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). This family is distantly realted to the Nth-MutY superfamily.
Probab=99.93 E-value=2.5e-25 Score=210.97 Aligned_cols=156 Identities=17% Similarity=0.295 Sum_probs=125.9
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHH---------------hCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIK---------------TYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAG 98 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~---------------~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~ 98 (363)
.||||++|++|++||++++++.+++.+|.+ .||||++|++++.++ .|+.+|+..||++|+++
T Consensus 117 ~d~fE~lv~~IlsQq~si~~a~~~~~rL~~~~G~~~~~~~g~~~~~FPtp~~La~~~~e~---~Lr~~G~g~Ra~~I~~~ 193 (310)
T TIGR00588 117 QDPFECLISFICSSNNNIARITRMVERLCQAFGPRLITLDGVTYHGFPSLHALTGPEAEA---HLRKLGLGYRARYIRET 193 (310)
T ss_pred CCHHHHHHHHHHhCCCCHHHHHHHHHHHHHHhCCCcccCCCcccccCCCHHHHhCCChHH---HHHHcCCHHHHHHHHHH
Confidence 499999999999999999999999999965 389999999876543 45568886699999999
Q ss_pred HHHhhHHhCCC----------CcccHHHHhcCCCCcHHHHHHHHHHhcCCc-cccccchHHHHHhhhcccccCc-----C
Q psy9547 99 CRQVIEQFGGE----------VPRDKKQLLSIKGVGDYTAGALASICYNIP-TPAVDGNVFRIYGRLFEIEDDI-----S 162 (363)
Q Consensus 99 a~~i~~~~~g~----------~p~~~~~L~~lpGIG~~tA~~il~~~~~~~-~~~vD~~v~Rvl~Rl~~~~~~~-----~ 162 (363)
|+.+.+++++. .++.+++|++|||||+|||+||++|+++++ ++|+|+||+|+++|+|+..... .
T Consensus 194 A~~i~~~~~~~~~l~~l~~~~~~~~~~~L~~l~GIG~~tAd~vll~~l~~~d~~PvD~~v~r~~~r~y~~~~~~~~~~~~ 273 (310)
T TIGR00588 194 ARALLEEQGGRAWLQQIRGASYEDAREALCELPGVGPKVADCICLMGLDKPQAVPVDVHVWRIANRDYPWHPKTSRAKGP 273 (310)
T ss_pred HHHHHhccCCchhHHhhccCChHHHHHHHHhCCCccHHHHHHHHHHhCCCCCceeecHHHHHHHHHHhcccccccccccC
Confidence 99999977654 356799999999999999999999999997 6889999999999998764332 2
Q ss_pred CcccHHHHHHHHHhcCCcccHHHHHH-HHHHHh
Q psy9547 163 KGKTKVVYETLVSKTMSQTNAREFNQ-ALMDLG 194 (363)
Q Consensus 163 ~~~~~~~~~~~~~~~~p~~~~~~~~~-~l~~~G 194 (363)
...+++.++..+.+.+++ ++.+.| .|..+.
T Consensus 274 ~~~~~~~i~~~~~~~~g~--~ag~aq~~lf~~~ 304 (310)
T TIGR00588 274 SPFARKELGNFFRSLWGP--YAGWAQAVLFSAD 304 (310)
T ss_pred ChhHHHHHHHHHHHHhcC--cchHHHHHHHHHH
Confidence 245677788877776653 333443 344433
No 16
>TIGR03252 uncharacterized HhH-GPD family protein. This model describes a small, well-conserved bacterial protein family. Its sequence largely consists of a domain, HhH-GPD, found in a variety of related base excision DNA repair enzymes (see pfam00730).
Probab=99.89 E-value=5.5e-23 Score=176.40 Aligned_cols=103 Identities=15% Similarity=0.267 Sum_probs=95.6
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhC--CCHHHHhcCCHHHHHHHHhcc----CchH-HHHHHHHHHHHhhHHh
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTY--PTIKDFAFDTEDNVLKMWEGL----GYYS-RVRNFQAGCRQVIEQF 106 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~--pt~~~la~a~~~el~~~~~~l----G~~~-rA~~l~~~a~~i~~~~ 106 (363)
++||++||++||+|||++++|.+++.++++++ +||++|++++.++|..+|++. |||+ ||++|+++|+.|+++|
T Consensus 16 ~~pFelLVa~ILSQqTtd~nv~kA~~~L~~~~g~~tp~~La~a~~eeL~~lI~~~pal~Gfy~~KAk~Lk~~a~~iie~y 95 (177)
T TIGR03252 16 SDPFALLTGMLLDQQVPMERAFAGPHKIARRMGSLDAEDIAKYDPQAFVALFSERPAVHRFPGSMAKRVQALAQYVVDTY 95 (177)
T ss_pred CChHHHHHHHHHhccCcHHHHHHHHHHHHHHhCCCCHHHHHcCCHHHHHHHHhcCccccCchHHHHHHHHHHHHHHHHHh
Confidence 69999999999999999999999999998765 699999999999999999865 9997 9999999999999999
Q ss_pred CCCC--------ccc---HHHHhcCCCCcHHHHHHHHHHhc
Q psy9547 107 GGEV--------PRD---KKQLLSIKGVGDYTAGALASICY 136 (363)
Q Consensus 107 ~g~~--------p~~---~~~L~~lpGIG~~tA~~il~~~~ 136 (363)
+|++ |++ +++|++|||||++||++||.+..
T Consensus 96 ~G~v~~L~~~~~p~t~~lre~Ll~LpGVG~KTAnvVL~~l~ 136 (177)
T TIGR03252 96 DGDATAVWTEGDPDGKELLRRLKALPGFGKQKAKIFLALLG 136 (177)
T ss_pred CCChhhhhcccCCCcHHHHHHHHcCCCCCHHHHHHHHHHHH
Confidence 9986 766 78999999999999999999643
No 17
>TIGR00586 mutt mutator mutT protein. All proteins in this family for which functions are known are involved in repairing oxidative damage to dGTP (they are 8-oxo-dGTPases). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=99.89 E-value=8.2e-23 Score=169.27 Aligned_cols=120 Identities=9% Similarity=0.044 Sum_probs=104.8
Q ss_pred ceeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEE
Q psy9547 236 DVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWN 312 (363)
Q Consensus 236 ~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~ 312 (363)
...+++++|.+++|++|+.||+.+++++|+|+||||+++. ++++++ ||+.||+|+++.... .+..+.|.|+|+++.
T Consensus 3 ~~~~~~~ii~~~~~~vLl~~R~~~~~~~g~w~~Pgg~ve~ge~~~~~~~RE~~EE~g~~~~~~~-~~~~~~h~~~~~~~~ 81 (128)
T TIGR00586 3 LQQIAVGIIRNENGEIIITRRADGHMFAKLLEFPGGKEEGGETPEQAVVRELEEEIGIPQHFSE-FEKLEYEFYPRHITL 81 (128)
T ss_pred eEEEEEEEEECCCCEEEEEEEeCCCCCCCeEECCCcccCCCCCHHHHHHHHHHHHHCCcceeee-EEEEEEEECCCcEEE
Confidence 4667888887776899999999999999999999999987 778888 999999999987776 788999999999999
Q ss_pred EEEEEEEecCCCC--CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 313 MDVYSGTAKEKTI--PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 313 l~~~~~~~~~~~~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
+++|.+...++.. .+..++.|++++++.+++||+++.++++.++
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~W~~~~~l~~~~~p~~~~~~~~~~~ 127 (128)
T TIGR00586 82 WFWLLERWEGGPPGKEGQPEEWWVLVGLLADDFFPAANPVIIKLLR 127 (128)
T ss_pred EEEEEEEEcCCCcCcccccccEEeCHHHCCccCCCCCCHHHHHHHh
Confidence 9999998765432 2456789999999999999999999998764
No 18
>PF14815 NUDIX_4: NUDIX domain; PDB: 1VRL_A 1RRQ_A 3G0Q_A 3FSQ_A 1RRS_A 3FSP_A.
Probab=99.89 E-value=6e-23 Score=167.26 Aligned_cols=112 Identities=29% Similarity=0.568 Sum_probs=81.2
Q ss_pred EEEEEeCCceEEEeecCCCccccccccccccCCCC-CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEE
Q psy9547 241 TVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT-PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSG 318 (363)
Q Consensus 241 ~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~-~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~ 318 (363)
+++|++.+|++|+.|||++|+|+||||||....+. ...+++ +.+.+.+|+.+...+ .+++++|+|||++|++++|.+
T Consensus 1 ~~~i~~~~~~~Ll~kRp~~gll~GLwefP~~e~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~v~H~fSH~~~~~~~~~~ 79 (114)
T PF14815_consen 1 VLLIIRSQGRVLLEKRPEKGLLAGLWEFPLIESDEEDDEEELEEWLEEQLGLSIRSVE-PLGTVKHVFSHRRWTIHVYEV 79 (114)
T ss_dssp EEEEEETTSEEEEEE--SSSTTTT-EE--EEE-SSS-CHHHHHHHTCCSSS-EEEE-S--SEEEEEE-SSEEEEEEEEEE
T ss_pred CEEEEEeCCEEEEEECCCCChhhcCcccCEeCccCCCCHHHHHHHHHHHcCCChhhhe-ecCcEEEEccceEEEEEEEEE
Confidence 35566677999999999999999999999998876 456667 778788999888777 899999999999999999999
Q ss_pred EecCCCCCCCCCCeeecccccCcCCCChHHHHHHH
Q psy9547 319 TAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWK 353 (363)
Q Consensus 319 ~~~~~~~~~~~~~~W~~~~el~~~~~~~a~~~il~ 353 (363)
++.........+.+|++.+++.++++|++++|+++
T Consensus 80 ~~~~~~~~~~~~~~W~~~~~l~~~~~p~~~~kil~ 114 (114)
T PF14815_consen 80 EVSADPPAEPEEGQWVSLEELDQYPLPTPMRKILE 114 (114)
T ss_dssp EEE-SS----TTEEEEEGGGGGGS---HHHHHHH-
T ss_pred EecCCCCCCCCCcEEEEHHHHhhCCCCHHHHHHhC
Confidence 99874333467889999999999999999999985
No 19
>PRK10776 nucleoside triphosphate pyrophosphohydrolase; Provisional
Probab=99.88 E-value=3.7e-22 Score=165.19 Aligned_cols=121 Identities=9% Similarity=0.152 Sum_probs=104.3
Q ss_pred ceeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEE
Q psy9547 236 DVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWN 312 (363)
Q Consensus 236 ~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~ 312 (363)
.+.+++++|.+++|++|+.||+.++.++|+|+||||++++ +.++++ ||+.||+|+++.... .+..++|.|+|+.++
T Consensus 3 ~~~~~~~ii~~~~~~vll~rR~~~~~~~g~w~~PgG~~~~gE~~~~a~~Re~~EE~gl~~~~~~-~~~~~~~~~~~~~~~ 81 (129)
T PRK10776 3 KLQIAVGIIRNPNNEIFITRRAADAHMAGKWEFPGGKIEAGETPEQALIRELQEEVGITVQHAT-LFEKLEYEFPDRHIT 81 (129)
T ss_pred eeEEEEEEEECCCCEEEEEEecCCCCCCCeEECCceecCCCCCHHHHHHHHHHHHHCCceecce-EEEEEEeeCCCcEEE
Confidence 4677888888877899999999999999999999999987 667888 999999999876665 678899999999999
Q ss_pred EEEEEEEecCCCC--CCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 313 MDVYSGTAKEKTI--PANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 313 l~~~~~~~~~~~~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
+.+|.++...+.. .+..+++|++++++..++||++++++++.+.+
T Consensus 82 ~~~~~~~~~~~~~~~~e~~~~~W~~~~~l~~~~~p~~~~~~~~~~~~ 128 (129)
T PRK10776 82 LWFWLVESWEGEPWGKEGQPGRWVSQVALNADEFPPANEPIIAKLKA 128 (129)
T ss_pred EEEEEEEEECCccCCccCCccEEecHHHCccCCCCcccHHHHHHHHh
Confidence 9999888664422 35677899999999999999999999998753
No 20
>PRK10546 pyrimidine (deoxy)nucleoside triphosphate pyrophosphohydrolase; Provisional
Probab=99.88 E-value=3.9e-22 Score=166.96 Aligned_cols=122 Identities=12% Similarity=0.205 Sum_probs=105.3
Q ss_pred cceeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEE
Q psy9547 235 KDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKW 311 (363)
Q Consensus 235 ~~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~ 311 (363)
+.+.+++++|.++ |+|||.||+.++.++|+|+||||+++. ++++|+ ||+.||+|+++.... .+..+.|.|+|+.+
T Consensus 2 ~~~~~~~~ii~~~-~~vLL~~R~~~~~~~g~w~~PgG~ve~gE~~~~a~~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~ 79 (135)
T PRK10546 2 KMIDVVAAIIERD-GKILLAQRPAHSDQAGLWEFAGGKVEPGESQPQALIRELREELGIEATVGE-YVASHQREVSGRRI 79 (135)
T ss_pred CeEEEEEEEEecC-CEEEEEEccCCCCCCCcEECCcccCCCCCCHHHHHHHHHHHHHCCccccce-eEEEEEEecCCcEE
Confidence 3566777777654 899999999989999999999999988 777888 999999999988776 78889999999999
Q ss_pred EEEEEEEEecCCCC--CCCCCCeeecccccCcCCCChHHHHHHHHHHhh
Q psy9547 312 NMDVYSGTAKEKTI--PANKTYKLITETQMKKYAFPVPYQKVWKLFTKS 358 (363)
Q Consensus 312 ~l~~~~~~~~~~~~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~~ 358 (363)
++++|.+....+.+ .++.+++|++++++.+++++++++++++.+.+.
T Consensus 80 ~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~l~~~~~~ 128 (135)
T PRK10546 80 HLHAWHVPDFHGELQAHEHQALVWCTPEEALRYPLAPADIPLLEAFMAL 128 (135)
T ss_pred EEEEEEEEEecCcccccccceeEEcCHHHcccCCCCcCcHHHHHHHHHh
Confidence 99999888765433 346778999999999999999999999988764
No 21
>cd03431 DNA_Glycosylase_C DNA glycosylase (MutY in bacteria and hMYH in humans) is responsible for repairing misread A*oxoG residues to C*G by removing the inappropriately paired adenine base from the DNA backbone. It belongs to the Nudix hydrolase superfamily and is important for the repair of various genotoxic lesions. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity. They are also recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA glycosylase does not seem to contain this signature motif. DNA glycosylase consists of 2 domains: the N-terminal domain contains the catalytic properties of the enzyme and the C-terminal domain affects substrate (oxoG) binding and enzymatic turnover. The C-terminal domain is highly similar to MutT, based on secondary structure and topology, despite low sequence identity. MutT sanitizes the nucleotide precursor pool by hydrolyzing oxo-dGTP to
Probab=99.88 E-value=5.2e-22 Score=161.78 Aligned_cols=112 Identities=27% Similarity=0.453 Sum_probs=92.7
Q ss_pred eeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEE
Q psy9547 237 VFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNM 313 (363)
Q Consensus 237 ~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l 313 (363)
+..++++|.+ +|++||.|||.+|+|+||||||+++++. +.++++ +++.+++++ ... .+++++|.|||++++|
T Consensus 3 ~~~~~~ii~~-~~~~ll~kR~~~gl~~glwefP~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~~H~fth~~~~~ 77 (118)
T cd03431 3 RGIAVVVIRN-DGRVLLEKRPEKGLLAGLWEFPSVEWEEEADGEEALLSALKKALRL---SLE-PLGTVKHTFTHFRLTL 77 (118)
T ss_pred EEEEEEEEec-CCeEEEEECCCCCCCCcceeCCCccccCCcCHHHHHHHHHHHHhCc---ccc-cceeEEEecCCeEEEE
Confidence 4566666655 4899999999999999999999999876 556777 888777654 233 6789999999999999
Q ss_pred EEEEEEecCCCCCCCCCCeeecccccCcCCCChHHHHHHHH
Q psy9547 314 DVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKL 354 (363)
Q Consensus 314 ~~~~~~~~~~~~~~~~~~~W~~~~el~~~~~~~a~~~il~~ 354 (363)
++|.++...+. .+.++++|++++++.++++|++++++++.
T Consensus 78 ~~~~~~~~~~~-~~~~~~~W~~~eel~~~~~p~~~~kil~~ 117 (118)
T cd03431 78 HVYLARLEGDL-LAPDEGRWVPLEELDEYALPTVMRKILEL 117 (118)
T ss_pred EEEEEEEeCCC-cCccccEEccHHHHhhCCCCHHHHHHHHh
Confidence 99999887532 34567899999999999999999999865
No 22
>PRK10308 3-methyl-adenine DNA glycosylase II; Provisional
Probab=99.87 E-value=4.8e-21 Score=179.21 Aligned_cols=139 Identities=17% Similarity=0.167 Sum_probs=117.4
Q ss_pred CCChHHHHHHHHHhhhcchhhHHHHHHHHHHh-------------CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHH
Q psy9547 33 SNNPYYVWISEVMLQQTQVKTVLPYYEKFIKT-------------YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAG 98 (363)
Q Consensus 33 ~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~-------------~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~ 98 (363)
..||||.+|++|++||.++..+.....++.+. ||||++|+++++++|. ++|+++ ||++|+++
T Consensus 110 ~~d~fE~lv~aIigQqisv~~a~~~~~rlv~~~G~~l~~~~~~~~FPtpe~La~~~~~eL~----~~Gl~~~Ra~~L~~l 185 (283)
T PRK10308 110 SVDAFEQGVRAILGQLVSVAMAAKLTAKVAQLYGERLDDFPEYVCFPTPERLAAADPQALK----ALGMPLKRAEALIHL 185 (283)
T ss_pred CCCHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhCccccCCCCccCCCCHHHHHcCCHHHHH----HCCCCHHHHHHHHHH
Confidence 36999999999999999999999999988653 6899999999998875 489985 99999999
Q ss_pred HHHhhHHhCCCCc--------ccHHHHhcCCCCcHHHHHHHHHHhcCCc-c-ccccchHHHHHhhhcccccCcCCcccHH
Q psy9547 99 CRQVIEQFGGEVP--------RDKKQLLSIKGVGDYTAGALASICYNIP-T-PAVDGNVFRIYGRLFEIEDDISKGKTKV 168 (363)
Q Consensus 99 a~~i~~~~~g~~p--------~~~~~L~~lpGIG~~tA~~il~~~~~~~-~-~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~ 168 (363)
|+.+.+ |.++ +.+++|++|||||+|||++|++|++|++ + ++.|.+++|.+ . ..+.+
T Consensus 186 A~~i~~---g~l~l~~~~~~~~~~~~L~~LpGIGpwTA~~vllr~lg~~D~fp~~D~~l~~~~---~--------~~~~~ 251 (283)
T PRK10308 186 ANAALE---GTLPLTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF---P--------GMTPA 251 (283)
T ss_pred HHHHHc---CCCCccccCCHHHHHHHHhcCCCcCHHHHHHHHHHhCCCCCCCCcccHHHHHhc---c--------cCCHH
Confidence 999987 4332 2488999999999999999999999996 3 49999998754 1 12567
Q ss_pred HHHHHHHhcCCcccHHHHHHH
Q psy9547 169 VYETLVSKTMSQTNAREFNQA 189 (363)
Q Consensus 169 ~~~~~~~~~~p~~~~~~~~~~ 189 (363)
++++.++.|.|...+..+|.+
T Consensus 252 ~~~~~a~~w~P~rsya~~~LW 272 (283)
T PRK10308 252 QIRRYAERWKPWRSYALLHIW 272 (283)
T ss_pred HHHHHHHhcCCHHHHHHHHHH
Confidence 788899999998877777755
No 23
>COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=99.85 E-value=1.4e-20 Score=175.89 Aligned_cols=145 Identities=17% Similarity=0.261 Sum_probs=120.0
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHh----------CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHh
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKT----------YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQV 102 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~----------~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i 102 (363)
.|||+.+|++|++||.|++.+.+++++|.+. ||||++|++++++.+. .+|+.. ||++|+++|+++
T Consensus 103 ~d~fe~lv~aI~~QqvS~~~A~~i~~rl~~~~g~~~~~~~~fptpe~l~~~~~~~l~----~~g~s~~Ka~yi~~~A~~~ 178 (285)
T COG0122 103 PDPFEALVRAILSQQVSVAAAAKIWARLVSLYGNALEIYHSFPTPEQLAAADEEALR----RCGLSGRKAEYIISLARAA 178 (285)
T ss_pred CCHHHHHHHHHHHhHhhHHHHHHHHHHHHHHhCCccccccCCCCHHHHHhcCHHHHH----HhCCcHHHHHHHHHHHHHH
Confidence 7999999999999999999999999999862 8999999999997665 588874 999999999999
Q ss_pred hHHhCCC--C--------cccHHHHhcCCCCcHHHHHHHHHHhcCCc-ccc-ccchHHHHHhhhcccccCcCCcccHHHH
Q psy9547 103 IEQFGGE--V--------PRDKKQLLSIKGVGDYTAGALASICYNIP-TPA-VDGNVFRIYGRLFEIEDDISKGKTKVVY 170 (363)
Q Consensus 103 ~~~~~g~--~--------p~~~~~L~~lpGIG~~tA~~il~~~~~~~-~~~-vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~ 170 (363)
.+ |. + .+.++.|++|+|||+|||+|+|+|++|++ ++| .|.++++.+++++..+ ...+.+.+
T Consensus 179 ~~---g~~~~~~l~~~~~e~a~e~L~~i~GIG~WTAe~~llf~lgr~dvfP~~D~~lr~~~~~~~~~~----~~~~~~~~ 251 (285)
T COG0122 179 AE---GELDLSELKPLSDEEAIEELTALKGIGPWTAEMFLLFGLGRPDVFPADDLGLRRAIKKLYRLP----TRPTEKEV 251 (285)
T ss_pred Hc---CCccHHHhccCCHHHHHHHHHcCCCcCHHHHHHHHHHcCCCCCCCChHHHHHHHHHHHHhcCC----CCchHHHH
Confidence 87 42 1 13589999999999999999999999985 555 8889999999998433 23455667
Q ss_pred HHHHHhcCCcccHHHHHHH
Q psy9547 171 ETLVSKTMSQTNAREFNQA 189 (363)
Q Consensus 171 ~~~~~~~~p~~~~~~~~~~ 189 (363)
...++.|-|...+..++.+
T Consensus 252 ~~~~e~w~p~rs~A~~yLw 270 (285)
T COG0122 252 RELAERWGPYRSYAALYLW 270 (285)
T ss_pred HHHHhcccCHHHHHHHHHH
Confidence 8888999886655554433
No 24
>cd03425 MutT_pyrophosphohydrolase The MutT pyrophosphohydrolase is a prototypical Nudix hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi). This enzyme requires two divalent cations for activity; one coordinates the phosphoryl groups of the NTP/dNTP substrate, and the other coordinates to the enzyme. It also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as metal binding and catalytic site. MutT pyrophosphohydrolase is important in preventing errors in DNA replication by hydrolyzing mutagenic nucleotides such as 8-oxo-dGTP (a product of oxidative damage), which can mispair with template adenine during DNA replication, to guanine nucleotides.
Probab=99.81 E-value=1.6e-19 Score=147.82 Aligned_cols=117 Identities=15% Similarity=0.265 Sum_probs=102.1
Q ss_pred eEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
.+++++|.+++|++|+.||+.++.++|+|+||||.++. ++++++ ||+.||+|+++.... .++.++|.|+|.++.+.
T Consensus 2 ~~~~~~i~~~~~~~Ll~~r~~~~~~~g~w~~p~G~~~~~e~~~~~a~Re~~EE~g~~~~~~~-~~~~~~~~~~~~~~~~~ 80 (124)
T cd03425 2 EVVAAIIIDDDGRILIAQRPAGKHLGGLWEFPGGKVEPGETPEQALVRELREELGIEVEVGE-LLATVEHDYPDKRVTLH 80 (124)
T ss_pred eEEEEEEECCCCEEEEEEeCCCCCCCCeEeCCCcccCCCCCHHHHHHHHHHHhhCcEEeccc-eEEEEEeeCCCCeEEEE
Confidence 46778887776899999998888999999999999987 678888 999999999887666 78899999999999999
Q ss_pred EEEEEecCCCC--CCCCCCeeecccccCcCCCChHHHHHHHHH
Q psy9547 315 VYSGTAKEKTI--PANKTYKLITETQMKKYAFPVPYQKVWKLF 355 (363)
Q Consensus 315 ~~~~~~~~~~~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l 355 (363)
+|.+....+.. .+..++.|++++++.+++|+++++++++.+
T Consensus 81 ~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~l~~~ 123 (124)
T cd03425 81 VFLVELWSGEPQLLEHQELRWVPPEELDDLDFPPADVPIVAAL 123 (124)
T ss_pred EEEEeeeCCCcccccCceEEEeeHHHcccCCCCcccHHHHHhh
Confidence 99998765432 456788999999999999999999999876
No 25
>KOG2875|consensus
Probab=99.80 E-value=2e-19 Score=161.38 Aligned_cols=118 Identities=21% Similarity=0.337 Sum_probs=106.0
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHh---------------CCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKT---------------YPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAG 98 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~---------------~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~ 98 (363)
+||++.|+|.|+|++.+++++..+.++|... |||.++|+. .++++.+|.+||..|||||...
T Consensus 115 QdP~E~lfSFiCSSNNNIaRIT~Mve~fc~~fG~~i~~~dg~~~h~FPsl~~L~g---~~~Ea~LR~~gfGYRAkYI~~t 191 (323)
T KOG2875|consen 115 QDPIECLFSFICSSNNNIARITGMVERFCQAFGPRIIQLDGVDYHGFPSLQALAG---PEVEAELRKLGFGYRAKYISAT 191 (323)
T ss_pred cCcHHHHHHHHhcCCCcHHHHHHHHHHHHHhhCcceEeecCcccccCccHHHhcC---cHhHHHHHHcCcchhHHHHHHH
Confidence 7999999999999999999999999988653 799999886 6677888889988899999999
Q ss_pred HHHhhHHhCCC----------CcccHHHHhcCCCCcHHHHHHHHHHhcCCc-cccccchHHHHHhhh
Q psy9547 99 CRQVIEQFGGE----------VPRDKKQLLSIKGVGDYTAGALASICYNIP-TPAVDGNVFRIYGRL 154 (363)
Q Consensus 99 a~~i~~~~~g~----------~p~~~~~L~~lpGIG~~tA~~il~~~~~~~-~~~vD~~v~Rvl~Rl 154 (363)
|++|.++++|. +.+.++.|+.|||||+|+||||++++++.. ++|||+|+.|+++-+
T Consensus 192 a~~l~~~~g~~~wLqsl~~~~yeear~~L~~lpGVG~KVADCI~Lm~l~~~~~VPVDvHi~ria~~y 258 (323)
T KOG2875|consen 192 ARALQEKQGGLAWLQSLRKSSYEEAREALCSLPGVGPKVADCICLMSLDKLSAVPVDVHIWRIAQDY 258 (323)
T ss_pred HHHHHHhcccchHHHHHhcccHHHHHHHHhcCCCCcchHhhhhhhhhcCCCCcccchhhHHHHhhcc
Confidence 99999998872 335799999999999999999999999995 799999999999944
No 26
>PRK08999 hypothetical protein; Provisional
Probab=99.80 E-value=3.6e-19 Score=169.96 Aligned_cols=122 Identities=17% Similarity=0.275 Sum_probs=105.9
Q ss_pred ccceeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceE
Q psy9547 234 VKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLK 310 (363)
Q Consensus 234 ~~~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~ 310 (363)
++.+++++++|.+++|++|+.||+.+++|+|+|+||||+++. ++.+++ ||+.||+|+.+.... .+.++.|.|+|.+
T Consensus 2 ~~~~~~~~~vi~~~~~~vLL~kR~~~~~~~g~w~~PgG~ve~gE~~~~aa~RE~~EE~Gl~~~~~~-~l~~~~h~~~~~~ 80 (312)
T PRK08999 2 MKRIHVAAGVIRDADGRILLARRPEGKHQGGLWEFPGGKVEPGETVEQALARELQEELGIEVTAAR-PLITVRHDYPDKR 80 (312)
T ss_pred CceeEEEEEEEECCCCeEEEEEecCCCCCCCeEECCccCCCCCCCHHHHHHHHHHHHhCCceecce-eEEEEEEEcCCCe
Confidence 355678888888777899999999999999999999999987 678888 999999999887666 6788999999999
Q ss_pred EEEEEEEEEecCCCC--CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 311 WNMDVYSGTAKEKTI--PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 311 ~~l~~~~~~~~~~~~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
+++++|.+....+.. .++.+++|++++++.+++||++++++++.+.
T Consensus 81 ~~i~~y~~~~~~~~~~~~e~~~~~Wv~~~el~~~~~~~~~~~i~~~l~ 128 (312)
T PRK08999 81 VRLDVRRVTAWQGEPHGREGQPLAWVAPDELAVYPFPPANQPIVRALR 128 (312)
T ss_pred EEEEEEEEEEecCcccCccCCccEEecHHHcccCCCCcchHHHHHHhc
Confidence 999999988765432 3567889999999999999999999999874
No 27
>KOG1918|consensus
Probab=99.75 E-value=6.9e-18 Score=146.54 Aligned_cols=166 Identities=17% Similarity=0.281 Sum_probs=131.3
Q ss_pred HHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHH------hCCCHHHHhcCCHHHHHHHHhccCch-
Q psy9547 17 ILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIK------TYPTIKDFAFDTEDNVLKMWEGLGYY- 89 (363)
Q Consensus 17 l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~------~~pt~~~la~a~~~el~~~~~~lG~~- 89 (363)
|....+.+. ++-....+.||+-|+++|++||.+.+++..+|+||.. .||+|+.+..++.+++.+ +||+
T Consensus 57 L~~lv~~~~-p~~~~~~q~Pf~~LiraIlsQQLs~kAansI~~Rfvsl~~g~~~~~~pe~i~~~~~~~lrk----cG~S~ 131 (254)
T KOG1918|consen 57 LVKLVGNHE-PLTFKETQTPFERLIRAILSQQLSGKAANSIYNRFVSLCGGAEKFPTPEFIDPLDCEELRK----CGFSK 131 (254)
T ss_pred HHHHhcCCC-CCCcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCchhcCcCCHHHHHH----hCcch
Confidence 334444443 3445677899999999999999999999999999985 489999999999988765 9999
Q ss_pred HHHHHHHHHHHHhhHHhCCCCcc-----------cHHHHhcCCCCcHHHHHHHHHHhcCCc-c-ccccchHHHHHhhhcc
Q psy9547 90 SRVRNFQAGCRQVIEQFGGEVPR-----------DKKQLLSIKGVGDYTAGALASICYNIP-T-PAVDGNVFRIYGRLFE 156 (363)
Q Consensus 90 ~rA~~l~~~a~~i~~~~~g~~p~-----------~~~~L~~lpGIG~~tA~~il~~~~~~~-~-~~vD~~v~Rvl~Rl~~ 156 (363)
+|+.+|+.+|.+..+ |.+|. .++.|..++|||+||+.+++.|+++++ + ++-|..+++-.+.+++
T Consensus 132 rK~~yLh~lA~~~~n---g~I~s~~~i~~mseEeL~~~LT~VKGIg~Wtv~MflIfsL~R~DVmp~dDlgir~g~k~l~g 208 (254)
T KOG1918|consen 132 RKASYLHSLAEAYTN---GYIPSKSGIEKMSEEELIERLTNVKGIGRWTVEMFLIFSLHRPDVMPADDLGIRNGVKKLLG 208 (254)
T ss_pred hhHHHHHHHHHHHhc---CCCCchHHHhhcCHHHHHHHHHhccCccceeeeeeeeeccCCCcccCchhhhHHHHHHHHhC
Confidence 599999999999887 44553 256788899999999999999999996 4 4556789999999998
Q ss_pred cccCcCCcccHHHHHHHHHhcCCcccHHHHH-HHHHHHh
Q psy9547 157 IEDDISKGKTKVVYETLVSKTMSQTNAREFN-QALMDLG 194 (363)
Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~-~~l~~~G 194 (363)
.+. -....+++++.+.+-|.+..+.+. .-+.++.
T Consensus 209 l~~----~p~~~evekl~e~~kpyRtvaawYlWki~~~~ 243 (254)
T KOG1918|consen 209 LKP----LPLPKEVEKLCEKCKPYRTVAAWYLWKIIKLD 243 (254)
T ss_pred CCC----CCchHHHHHHhhhccchHHHHHHHHHhhhhhh
Confidence 763 245678888888888876554433 3354443
No 28
>PF00730 HhH-GPD: HhH-GPD superfamily base excision DNA repair protein This entry corresponds to Endonuclease III This entry corresponds to Alkylbase DNA glycosidase; InterPro: IPR003265 Endonuclease III (4.2.99.18 from EC) is a DNA repair enzyme which removes a number of damaged pyrimidines from DNA via its glycosylase activity and also cleaves the phosphodiester backbone at apurinic / apyrimidinic sites via a beta-elimination mechanism [, ]. The structurally related DNA glycosylase MutY recognises and excises the mutational intermediate 8-oxoguanine-adenine mispair []. The 3-D structures of Escherichia coli endonuclease III [] and catalytic domain of MutY [] have been determined. The structures contain two all-alpha domains: a sequence-continuous, six-helix domain (residues 22-132) and a Greek-key, four-helix domain formed by one N-terminal and three C-terminal helices (residues 1-21 and 133-211) together with the [Fe4S4] cluster. The cluster is bound entirely within the C-terminal loop by four cysteine residues with a ligation pattern Cys-(Xaa)6-Cys-(Xaa)2-Cys-(Xaa)5-Cys which is distinct from all other known Fe4S4 proteins. This structural motif is referred to as a [Fe4S4] cluster loop (FCL) []. Two DNA-binding motifs have been proposed, one at either end of the interdomain groove: the helix-hairpin-helix (HhH) and FCL motifs (see IPR003651 from INTERPRO). The primary role of the iron-sulphur cluster appears to involve positioning conserved basic residues for interaction with the DNA phosphate backbone by forming the loop of the FCL motif [, ]. The HhH-GPD domain gets its name from its hallmark helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This domain is found in a diverse range of structurally related DNA repair proteins that include: endonuclease III, 4.2.99.18 from EC and DNA glycosylase MutY, an A/G-specific adenine glycosylase. Both of these enzymes have a C-terminal iron-sulphur cluster loop (FCL). The methyl-CPG binding protein (MBD4) also contain a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II 3.2.2.21 from EC, 8-oxoguanine DNA glycosylases and other members of the AlkA family.; GO: 0006284 base-excision repair; PDB: 3F0Z_A 3I0X_A 3F10_A 3I0W_A 3S6I_D 3N5N_Y 1PU7_A 1PU8_B 1PU6_B 1NGN_A ....
Probab=99.75 E-value=1.1e-17 Score=134.87 Aligned_cols=104 Identities=24% Similarity=0.370 Sum_probs=88.7
Q ss_pred HHHHHhhhcchhhHHHHHHHHHHh--CCCHHHHhcCCHHHHHHHHhccCch-HHHHHHHHHHHHhhHHhCCCCcccHHHH
Q psy9547 41 ISEVMLQQTQVKTVLPYYEKFIKT--YPTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQAGCRQVIEQFGGEVPRDKKQL 117 (363)
Q Consensus 41 v~~il~qqt~~~~v~~~~~~l~~~--~pt~~~la~a~~~el~~~~~~lG~~-~rA~~l~~~a~~i~~~~~g~~p~~~~~L 117 (363)
|++||+|||+++++.+++.+|.+. ||||++|+++++++|.++|+++||+ +||++|+++|+.++
T Consensus 1 V~~Il~qq~s~~~a~~~~~~l~~~~g~pt~~~l~~~~~~el~~~i~~~G~~~~ka~~i~~~a~~~~-------------- 66 (108)
T PF00730_consen 1 VRAILSQQTSIKAARKIYRRLFERYGFPTPEALAEASEEELRELIRPLGFSRRKAKYIIELARAIL-------------- 66 (108)
T ss_dssp HHHHHCTTS-HHHHHHHHHHHHHHHSCSSHHHHHCSHHHHHHHHHTTSTSHHHHHHHHHHHHHHHH--------------
T ss_pred CeeeecCcCcHHHHHHHHHHHHHHhcCCCHHHHHhCCHHHHHHHhhccCCCHHHHHHHHHHHHHhh--------------
Confidence 789999999999999999999998 5999999999999999999999999 59999999999998
Q ss_pred hcCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCC
Q psy9547 118 LSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMS 179 (363)
Q Consensus 118 ~~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p 179 (363)
+...+++++|+|+.|++.|+++++.. .+++.+++.+++.+|
T Consensus 67 -----------------~~~d~~~~~D~~v~r~~~r~~~~~~~----~~~~~~~~~~~e~~~ 107 (108)
T PF00730_consen 67 -----------------GRPDPFPPVDTHVRRVLQRLGGIPEK----KTKEETEKKLEELWP 107 (108)
T ss_dssp -----------------C-SSSS-TTSHHHHHHHHHHTSSSSS----TTHHHHHHHHHHHGT
T ss_pred -----------------hcccceecCcHHHHHHHHHHcCCCCC----CCHHHHHHHHHhhCc
Confidence 22236899999999999999987743 456677777765554
No 29
>PRK15472 nucleoside triphosphatase NudI; Provisional
Probab=99.68 E-value=2.3e-16 Score=133.19 Aligned_cols=117 Identities=12% Similarity=0.204 Sum_probs=83.1
Q ss_pred eeEEEEEEEeCCceEEEeecCC-CccccccccccccCCCC--CchHHH-HHHHhhCCCccccccc-c--c-c-cEEEEcc
Q psy9547 237 VFLLTVVVKTDTNKYLIQKRPT-TGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEK-C--L-G-NVKHVFS 307 (363)
Q Consensus 237 ~~v~~~ii~~~~~~vLl~~R~~-~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~-~--l-~-~v~h~~s 307 (363)
+.+++++|.++ |+|||.||+. ++.++|.|+||||++++ ++++|+ ||++||+|+++..... . + . ...|.|+
T Consensus 4 r~~~~~ii~~~-~~vLl~~R~~~~~~~~g~W~lPgG~ve~gEs~~~aa~REl~EEtGl~~~~~~~~~~~~~~~~~~~~~~ 82 (141)
T PRK15472 4 RTIVCPLIQND-GAYLLCKMADDRGVFPGQWALSGGGVEPGERIEEALRREIREELGEQLLLTEITPWTFRDDIRTKTYA 82 (141)
T ss_pred eeEEEEEEecC-CEEEEEEecccCCCCCCceeCCcccCCCCCCHHHHHHHHHHHHHCCceeeeeeccccccccceeEEec
Confidence 45667777665 8999999976 45799999999999999 889999 9999999998643320 1 0 0 1124443
Q ss_pred ceE---E-E-EEEEEEEecCCC---CCCCCCCeeecccccCcCCCChHHHHHHHH
Q psy9547 308 HLK---W-N-MDVYSGTAKEKT---IPANKTYKLITETQMKKYAFPVPYQKVWKL 354 (363)
Q Consensus 308 h~~---~-~-l~~~~~~~~~~~---~~~~~~~~W~~~~el~~~~~~~a~~~il~~ 354 (363)
+.. + . ..+|.|....+. ..++.+++|++++|+.+++++++++.++..
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~w~~~~el~~l~~~~~~~~~~~~ 137 (141)
T PRK15472 83 DGRKEEIYMIYLIFDCVSANRDVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRL 137 (141)
T ss_pred CCCceeEEEEEEEEEeecCCCcccCChhhheEEEccHHHhccccccHHHHHHHHH
Confidence 321 1 1 233455443332 235688999999999999999999998864
No 30
>PRK15434 GDP-mannose mannosyl hydrolase NudD; Provisional
Probab=99.64 E-value=8.3e-16 Score=132.15 Aligned_cols=114 Identities=10% Similarity=0.216 Sum_probs=87.2
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccc--cccccccEEEEcc------
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNL--NEKCLGNVKHVFS------ 307 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~--~~~~l~~v~h~~s------ 307 (363)
.+.++|.+++|+||+.||. .+..+|+|+||||++|. ++++|+ ||++||+|+.+.. .. .++...|.|+
T Consensus 19 ~v~~vI~~~~g~VLL~kR~-~~~~~g~W~lPGG~VE~GEt~~~Aa~REl~EEtGl~v~~~~~~-~~~~~~~~~~~~~~~~ 96 (159)
T PRK15434 19 SLDFIVENSRGEFLLGKRT-NRPAQGYWFVPGGRVQKDETLEAAFERLTMAELGLRLPITAGQ-FYGVWQHFYDDNFSGT 96 (159)
T ss_pred EEEEEEECCCCEEEEEEcc-CCCCCCcEECCceecCCCCCHHHHHHHHHHHHHCCccccccce-EEEEEEeecccccCCC
Confidence 4555666666899999996 45678999999999999 889999 9999999998643 33 4554556554
Q ss_pred --ceEEEEEEEEEEecCCCC----CCCCCCeeecccccCcC-CCChHHHHHHHH
Q psy9547 308 --HLKWNMDVYSGTAKEKTI----PANKTYKLITETQMKKY-AFPVPYQKVWKL 354 (363)
Q Consensus 308 --h~~~~l~~~~~~~~~~~~----~~~~~~~W~~~~el~~~-~~~~a~~~il~~ 354 (363)
+.++.+.+|.++...+.+ .++.+++|++++++..+ .+.+.++.++..
T Consensus 97 ~~~~~~i~~~f~~~~~~g~~~~~~~E~~~~~W~~~~el~~~~~~~~~~~~~~~~ 150 (159)
T PRK15434 97 DFTTHYVVLGFRLRVAEEDLLLPDEQHDDYRWLTPDALLASDNVHANSRAYFLA 150 (159)
T ss_pred ccceEEEEEEEEEEecCCcccCChHHeeEEEEEeHHHhhhccccCHHHHHHhcc
Confidence 235777889898876532 25789999999999886 777777777643
No 31
>cd04676 Nudix_Hydrolase_17 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.63 E-value=1.2e-15 Score=125.85 Aligned_cols=110 Identities=9% Similarity=0.067 Sum_probs=81.9
Q ss_pred eEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccccccc---ccEEEEcc----
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCL---GNVKHVFS---- 307 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l---~~v~h~~s---- 307 (363)
..+.++|.+++|++|+.||+.. |+|+||||+++. ++++|+ ||++||+|+++.... .+ ....|.++
T Consensus 3 ~~v~~ii~~~~~~vLl~~r~~~----~~w~lPgG~v~~~E~~~~aa~REl~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~~ 77 (129)
T cd04676 3 PGVTAVVRDDEGRVLLIRRSDN----GLWALPGGAVEPGESPADTAVREVREETGLDVEVTG-LVGIYTGPVHVVTYPNG 77 (129)
T ss_pred ceEEEEEECCCCeEEEEEecCC----CcEECCeeccCCCCCHHHHHHHHHHHHhCceeEeeE-EEEEeecccceeecCCC
Confidence 3566677777689999999654 999999999998 788899 999999999876544 32 23445554
Q ss_pred --ceEEEEEEEEEEecCC----CCCCCCCCeeecccccCcCCCChHHHHHHH
Q psy9547 308 --HLKWNMDVYSGTAKEK----TIPANKTYKLITETQMKKYAFPVPYQKVWK 353 (363)
Q Consensus 308 --h~~~~l~~~~~~~~~~----~~~~~~~~~W~~~~el~~~~~~~a~~~il~ 353 (363)
|..+. .+|.+...++ ...+..+++|++++++.+++++++++++++
T Consensus 78 ~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~~w~~~~el~~~~~~~~~~~~~~ 128 (129)
T cd04676 78 DVRQYLD-ITFRCRVVGGELRVGDDESLDVAWFDPDGLPPLLMHPSMRLRID 128 (129)
T ss_pred CcEEEEE-EEEEEEeeCCeecCCCCceeEEEEEChhhCccccCCHhHHHHhc
Confidence 33333 4455555443 223556789999999999999999999875
No 32
>cd03427 MTH1 MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily. MTH1, the mammalian counterpart of MutT, hydrolyzes oxidized purine nucleoside triphosphates, such as 8-oxo-dGTP and 2-hydroxy-ATP, to monophosphates, thereby preventing the incorporation of such oxygen radicals during replication. This is an important step in the repair mechanism in genomic and mitochondrial DNA. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity, and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. MTH1 is predominantly localized in the cytoplasm and mitochondria. Structurally, this enzyme adopts a similar fold to MutT despite low sequence similarity outside the conserved nudix motif. The most distinctive structural difference between MutT and MTH1 is the presence of a beta-hairpin, which is absent in MutT. This results in a m
Probab=99.61 E-value=3.8e-15 Score=124.83 Aligned_cols=116 Identities=11% Similarity=0.028 Sum_probs=94.9
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccc--eEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSH--LKWNM 313 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh--~~~~l 313 (363)
++.++|.++ |++|+.+|..++ ..|.|+||||+++. ++.+|+ ||+.||+|+.+.... .++.+.+.++. .+..+
T Consensus 3 ~~~~~i~~~-~~vLL~~r~~~~-~~~~w~~PgG~ve~gEs~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~ 79 (137)
T cd03427 3 TTLCFIKDP-DKVLLLNRKKGP-GWGGWNGPGGKVEPGETPEECAIRELKEETGLTIDNLK-LVGIIKFPFPGEEERYGV 79 (137)
T ss_pred EEEEEEEEC-CEEEEEEecCCC-CCCeEeCCceeCCCCCCHHHHHHHHHHHhhCeEeecce-EEEEEEEEcCCCCcEEEE
Confidence 455667666 899999997655 88999999999998 788889 999999999988776 67777766664 56778
Q ss_pred EEEEEEecCCCC---CCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 314 DVYSGTAKEKTI---PANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 314 ~~~~~~~~~~~~---~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
++|.+....+.. .+..+.+|++++++.++++++.++.+++.+.+
T Consensus 80 ~~f~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~l~~~~~ 126 (137)
T cd03427 80 FVFLATEFEGEPLKESEEGILDWFDIDDLPLLPMWPGDREWLPLMLE 126 (137)
T ss_pred EEEEECCcccccCCCCccccceEEcHhhcccccCCCCcHHHHHHHhC
Confidence 888887665422 34567899999999999999999999998875
No 33
>cd04669 Nudix_Hydrolase_11 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.61 E-value=1.6e-15 Score=124.66 Aligned_cols=101 Identities=13% Similarity=0.165 Sum_probs=76.1
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVY 316 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~ 316 (363)
++++|++++|+|||.||..++ .++|+||||+++. ++++|+ ||++||+|+++.... .+..+.+ .+ ...++|
T Consensus 3 ~~~ii~~~~~~vLL~~r~~~~--~~~w~lPGG~ve~gEs~~~a~~REl~EEtGl~~~~~~-~~~~~~~--~~--~~~~~f 75 (121)
T cd04669 3 ASIVIINDQGEILLIRRIKPG--KTYYVFPGGGIEEGETPEEAAKREALEELGLDVRVEE-IFLIVNQ--NG--RTEHYF 75 (121)
T ss_pred eEEEEEeCCCEEEEEEEecCC--CCcEECCceeccCCCCHHHHHHHHHHHhhCeeEeeee-EEEEEee--CC--cEEEEE
Confidence 455666665899999997654 6899999999999 889999 999999999986655 4444333 33 346788
Q ss_pred EEEecCCCCC-------------CCCCCeeecccccCcCCCChH
Q psy9547 317 SGTAKEKTIP-------------ANKTYKLITETQMKKYAFPVP 347 (363)
Q Consensus 317 ~~~~~~~~~~-------------~~~~~~W~~~~el~~~~~~~a 347 (363)
.|+..+|.+. ++.+++|++++++..+++.|.
T Consensus 76 ~~~~~~g~~~~~~~~e~~~~~~~~~~~~~Wv~~~el~~l~~~p~ 119 (121)
T cd04669 76 LARVISGKLGLGVGEEFERQSDDNQYHPVWVDLDQLETIPLRPE 119 (121)
T ss_pred EEEEECCeecCCCchhhcccCCCCceEEEEEEHHHcccCCCCCC
Confidence 8887765221 123468999999999998773
No 34
>cd04696 Nudix_Hydrolase_37 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.60 E-value=4.8e-15 Score=122.39 Aligned_cols=110 Identities=6% Similarity=0.069 Sum_probs=82.9
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc-----ceE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS-----HLK 310 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s-----h~~ 310 (363)
.+.++|.+.+|++|+.||.. ++|.|+||||+++. ++++++ ||++||+|+++.... .+....+.|+ +.+
T Consensus 4 ~v~~~i~~~~~~iLL~r~~~---~~~~w~lPGG~ve~gEs~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~ 79 (125)
T cd04696 4 TVGALIYAPDGRILLVRTTK---WRGLWGVPGGKVEWGETLEEALKREFREETGLKLRDIK-FAMVQEAIFSEEFHKPAH 79 (125)
T ss_pred EEEEEEECCCCCEEEEEccC---CCCcEeCCceeccCCCCHHHHHHHHHHHHhCCcccccc-eEEEEEEeccCCCCCccE
Confidence 45666777668999999843 57999999999998 888999 999999999887554 4443344442 234
Q ss_pred EEEEEEEEEecCCCC---CCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 311 WNMDVYSGTAKEKTI---PANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 311 ~~l~~~~~~~~~~~~---~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
+.+..|.+....+.. .++.+++|++++++.++++++++..++
T Consensus 80 ~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~~ 124 (125)
T cd04696 80 FVLFDFFARTDGTEVTPNEEIVEWEWVTPEEALDYPLNSFTRLLL 124 (125)
T ss_pred EEEEEEEEEecCCcccCCcccceeEEECHHHHhcCCCCHHHHHHh
Confidence 455667776554322 356788999999999999999998876
No 35
>cd04673 Nudix_Hydrolase_15 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.58 E-value=5.3e-15 Score=121.16 Aligned_cols=107 Identities=8% Similarity=0.068 Sum_probs=83.3
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc------ceE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS------HLK 310 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s------h~~ 310 (363)
++++|.++ +++|+.||. .+.+.|.|+||||+++. ++++|+ ||++||+|+++.... .++...+.++ ..+
T Consensus 3 v~~ii~~~-~~vLl~~r~-~~~~~~~w~~PgG~ie~gE~~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~ 79 (122)
T cd04673 3 VGAVVFRG-GRVLLVRRA-NPPDAGLWSFPGGKVELGETLEQAALRELLEETGLEAEVGR-LLTVVDVIERDAAGRVEFH 79 (122)
T ss_pred EEEEEEEC-CEEEEEEEc-CCCCCCeEECCCcccCCCCCHHHHHHHHHHHhhCcEeeece-eEEEEEEeeccCCCccceE
Confidence 34455565 899999995 45578999999999998 888999 999999999987666 6666666552 345
Q ss_pred EEEEEEEEEecCCCC---CCCCCCeeecccccCcCCCChHHH
Q psy9547 311 WNMDVYSGTAKEKTI---PANKTYKLITETQMKKYAFPVPYQ 349 (363)
Q Consensus 311 ~~l~~~~~~~~~~~~---~~~~~~~W~~~~el~~~~~~~a~~ 349 (363)
+.+++|.+....+.. .+..+++|++++++.++++++.++
T Consensus 80 ~~~~~~~~~~~~~~~~~~~E~~~~~w~~~~el~~~~~~~~~~ 121 (122)
T cd04673 80 YVLIDFLCRYLGGEPVAGDDALDARWVPLDELAALSLTESTR 121 (122)
T ss_pred EEEEEEEEEeCCCcccCCcccceeEEECHHHHhhCcCCcccc
Confidence 677788887665422 355778999999999999988764
No 36
>cd03430 GDPMH GDP-mannose glycosyl hydrolase (AKA GDP-mannose mannosyl hydrolase (GDPMH)) is a member of the Nudix hydrolase superfamily. This class of enzymes is unique from other members of the superfamily in two aspects. First, it contains a modified Nudix signature sequence. The slight changes to the conserved sequence motif, GX5EX7REUXEEXGU, where U = I, L or V), are believed to contribute to the removal of all magnesium binding sites but one, retaining only the metal site that coordinates the pyrophosphate of the substrate. Secondly, it is not a pyrophosphatase that substitutes at a phosphorus; instead, it hydrolyzes nucleotide sugars such as GDP-mannose to GDP and mannose, cleaving the phosphoglycosyl bond by substituting at a carbon position. GDP-mannose provides mannosyl components for cell wall synthesis and is required for the synthesis of other glycosyl donors (such as GDP-fucose and colitose) for the cell wall. The importance of GDP-sugar hydrolase activities is thus close
Probab=99.56 E-value=1.7e-14 Score=122.28 Aligned_cols=102 Identities=14% Similarity=0.264 Sum_probs=78.7
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccc--ccccccEEEEccc-----
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLN--EKCLGNVKHVFSH----- 308 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~--~~~l~~v~h~~sh----- 308 (363)
.+.++|.+.+|++|+.||.. ....|+|+||||+++. +.++|+ ||++||+|+++... . .++...|.|++
T Consensus 14 ~v~~vI~~~~g~vLl~~R~~-~p~~g~w~lPGG~ve~gEs~~~aa~RE~~EE~Gl~v~~~~~~-~l~~~~~~~~~~~~~~ 91 (144)
T cd03430 14 SIDLIVENEDGQYLLGKRTN-RPAQGYWFVPGGRIRKNETLTEAFERIAKDELGLEFLISDAE-LLGVFEHFYDDNFFGD 91 (144)
T ss_pred EEEEEEEeCCCeEEEEEccC-CCCCCcEECCCceecCCCCHHHHHHHHHHHHHCCCcccccce-EEEEEEEEeccccccC
Confidence 45556667668999999964 4579999999999999 888999 99999999988766 5 66666665542
Q ss_pred ---eEEEEEEEEEEecCCC----CCCCCCCeeecccccCcC
Q psy9547 309 ---LKWNMDVYSGTAKEKT----IPANKTYKLITETQMKKY 342 (363)
Q Consensus 309 ---~~~~l~~~~~~~~~~~----~~~~~~~~W~~~~el~~~ 342 (363)
.++...+|.|....+. ..++.+++|++++++...
T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~ 132 (144)
T cd03430 92 DFSTHYVVLGYVLKLSSNELLLPDEQHSEYQWLTSDELLAD 132 (144)
T ss_pred CCccEEEEEEEEEEEcCCcccCCchhccEeEEecHHHHhcC
Confidence 2456677888776543 235688999999999865
No 37
>cd04691 Nudix_Hydrolase_32 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.55 E-value=1.5e-14 Score=118.25 Aligned_cols=100 Identities=15% Similarity=0.043 Sum_probs=77.6
Q ss_pred EEEEEEEeCCceEEEeecCCCc-cccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTG-LLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~-~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
+++++|.++ ++||+.||...+ ...|.|+||||+++. ++++|+ ||++||+|+++.... .+..+.+..+ ....++
T Consensus 2 ~v~~vi~~~-~~vLL~rR~~~~~~~~g~w~lPgG~ve~gE~~~~aa~REl~EEtGl~~~~~~-~l~~~~~~~~-~~~~~~ 78 (117)
T cd04691 2 GVVGVLFSD-DKVLLERRSLTKNADPGKLNIPGGHIEAGESQEEALLREVQEELGVDPLSYT-YLCSLYHPTS-ELQLLH 78 (117)
T ss_pred eEEEEEEEC-CEEEEEEeCCCCCCCCCeEECcceeecCCCCHHHHHHHHHHHHHCCCcccce-EEEEEeccCC-CeEEEE
Confidence 567777776 899999997655 589999999999998 788999 999999999865444 5555555444 346678
Q ss_pred EEEEEecCCCC--CCCCCCeeecccccCc
Q psy9547 315 VYSGTAKEKTI--PANKTYKLITETQMKK 341 (363)
Q Consensus 315 ~~~~~~~~~~~--~~~~~~~W~~~~el~~ 341 (363)
+|.+...++.+ .+..++.|++++++..
T Consensus 79 ~~~~~~~~~~~~~~E~~~~~W~~~~~l~~ 107 (117)
T cd04691 79 YYVVTFWQGEIPAQEAAEVHWMTANDIVL 107 (117)
T ss_pred EEEEEEecCCCCcccccccEEcCHHHcch
Confidence 88888766532 4667899999999863
No 38
>cd04684 Nudix_Hydrolase_25 Contains a crystal structure of the Nudix hydrolase from Enterococcus faecalis, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability
Probab=99.55 E-value=3e-14 Score=117.58 Aligned_cols=110 Identities=9% Similarity=0.042 Sum_probs=86.5
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccc------eE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSH------LK 310 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh------~~ 310 (363)
+.++|.++ ++||+.||...+ .+|.|+||||+++. ++++|+ ||+.||+|+++.... .++...|.|.. .+
T Consensus 3 ~~~ii~~~-~~vLl~~~~~~~-~~~~w~lPgG~ve~gE~~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~ 79 (128)
T cd04684 3 AYAVIPRD-GKLLLIQKNGGP-YEGRWDLPGGGIEPGESPEEALHREVLEETGLTVEIGR-RLGSASRYFYSPDGDYDAH 79 (128)
T ss_pred eEEEEEeC-CEEEEEEccCCC-CCCeEECCCcccCCCCCHHHHHHHHHHHHhCcEeecce-eeeEEEEEEECCCCCeecc
Confidence 45667776 999999996655 88999999999999 888999 999999999987766 67776665532 24
Q ss_pred EEEEEEEEEecCCCC------CCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 311 WNMDVYSGTAKEKTI------PANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 311 ~~l~~~~~~~~~~~~------~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
...++|.+....+.. .+..+.+|++++++....+.+...+.+
T Consensus 80 ~~~~~f~~~~~~~~~~~~~~~~e~~~~~W~~~~~l~~~~~~~~~~~a~ 127 (128)
T cd04684 80 HLCVFYDARVVGGALPVQEPGEDSHGAAWLPLDEAIERLLSPLVLWAV 127 (128)
T ss_pred EEEEEEEEEEecCccccCCCCCCceeeEEECHHHhhccCCCHHHHHhh
Confidence 567788888765431 244678999999999999988887654
No 39
>PRK09438 nudB dihydroneopterin triphosphate pyrophosphatase; Provisional
Probab=99.55 E-value=1.9e-14 Score=122.43 Aligned_cols=119 Identities=8% Similarity=0.094 Sum_probs=86.3
Q ss_pred eEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccccc-ccc-----------cE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEK-CLG-----------NV 302 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~-~l~-----------~v 302 (363)
..++++|.+.+|++|+.||..+ .|.|+||||++|. ++.+|+ ||++||+|+++..... .+. ..
T Consensus 8 ~~v~~vi~~~~~~vLl~~r~~~---~~~W~lPgG~ve~gEs~~~aa~REl~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~ 84 (148)
T PRK09438 8 VSVLVVIYTPDLGVLMLQRADD---PDFWQSVTGSLEEGETPAQTAIREVKEETGIDVLAEQLTLIDCQRSIEYEIFPHW 84 (148)
T ss_pred eEEEEEEEeCCCeEEEEEecCC---CCcEeCCcccCCCCCCHHHHHHHHHHHHhCcCccccceeecccccccccccchhh
Confidence 3455666666789999998543 4899999999998 888999 9999999998732210 111 01
Q ss_pred EEEcc--ceEEEEEEEEEEecCC---CCCCCCCCeeecccccCcCCCChHHHHHHHHHHhhh
Q psy9547 303 KHVFS--HLKWNMDVYSGTAKEK---TIPANKTYKLITETQMKKYAFPVPYQKVWKLFTKSK 359 (363)
Q Consensus 303 ~h~~s--h~~~~l~~~~~~~~~~---~~~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~~~ 359 (363)
.|.++ ..+...++|.+....+ ...++.+++|++++++.++.+.+.++.+++.+.+..
T Consensus 85 ~~~~~~~~~~~~~~~f~~~~~~~~~~~~~E~~~~~W~~~~e~~~~~~~~~~~~~l~~~~~~~ 146 (148)
T PRK09438 85 RHRYAPGVTRNTEHWFCLALPHERPVVLTEHLAYQWLDAREAAALTKSWSNAEAIEQLVIRL 146 (148)
T ss_pred hhccccccCCceeEEEEEecCCCCccccCcccceeeCCHHHHHHHhcChhHHHHHHHHHHHh
Confidence 11222 1245567888876543 234778899999999999999999999999887654
No 40
>cd04679 Nudix_Hydrolase_20 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.55 E-value=2.3e-14 Score=118.32 Aligned_cols=110 Identities=13% Similarity=0.108 Sum_probs=84.1
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc--ceEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS--HLKWNMD 314 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s--h~~~~l~ 314 (363)
++++|.+.+|++|+.+|.. ....|.|+||||+++. ++++|+ ||++||+|+++...+ .+....|.++ ..++...
T Consensus 5 ~~~~i~~~~~~vLL~~r~~-~~~~~~w~lPgG~ve~gEt~~eaa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~ 82 (125)
T cd04679 5 CGAAILRDDGKLLLVKRLR-APEAGHWGIPGGKVDWMEAVEDAVVREIEEETGLSIHSTR-LLCVVDHIIEEPPQHWVAP 82 (125)
T ss_pred EEEEEECCCCEEEEEEecC-CCCCCeEeCCeeeccCCCCHHHHHHHHHHHHHCCCcccce-EEEEEeecccCCCCeEEEE
Confidence 4455667668999999954 3457999999999999 888999 999999999987766 6766666554 4566677
Q ss_pred EEEEEecCCCC-----CCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 315 VYSGTAKEKTI-----PANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 315 ~~~~~~~~~~~-----~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
+|.+....+.. .++.+++|++++++.+ ++.++++..+
T Consensus 83 ~f~~~~~~~~~~~~~~~E~~~~~W~~~~~l~~-~l~~~~~~~~ 124 (125)
T cd04679 83 VYLAENFSGEPRLMEPDKLLELGWFALDALPQ-PLTRATRDAV 124 (125)
T ss_pred EEEEeecCCccccCCCccccEEEEeCHHHCCc-hhHHHHHHHh
Confidence 88888765422 2457789999999987 7777766543
No 41
>cd04699 Nudix_Hydrolase_39 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.54 E-value=2.2e-14 Score=118.57 Aligned_cols=106 Identities=13% Similarity=0.081 Sum_probs=77.9
Q ss_pred EEEEEEEeCCceEEEeecCCCcc-ccccccccccCCCC--CchHHH-HHHHhhCCCcccccccc-cccEEEEccc-eEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGL-LSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKC-LGNVKHVFSH-LKWN 312 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~-~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~-l~~v~h~~sh-~~~~ 312 (363)
+++++|.+++|++|+.||+..+. ++|+|+||||.++. ++.+|+ ||++||+|+++.... . ...+.|.++. .+..
T Consensus 3 ~v~~vv~~~~~~iLl~kr~~~~~~~~g~w~~PgG~ve~gEs~~~aa~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~~ 81 (129)
T cd04699 3 AVAALIVKDVGRILILKRSKDERTAPGKWELPGGKVEEGETFEEALKREVYEETGLTVTPFL-RYPSTVTHEDSGVYNVI 81 (129)
T ss_pred eEEEEEECCCCcEEEEEecCCCCCCCCcCcCCccCccCCCCHHHHHHHHHHHhhCcEEEeee-eeeEEEEEcCCCEEEEE
Confidence 34556666658999999987775 59999999999998 788888 999999999887655 3 3456676663 3344
Q ss_pred EEEEEEEecCC---CCCCCCCCeeecccccCcCCCC
Q psy9547 313 MDVYSGTAKEK---TIPANKTYKLITETQMKKYAFP 345 (363)
Q Consensus 313 l~~~~~~~~~~---~~~~~~~~~W~~~~el~~~~~~ 345 (363)
..+|.+....+ ...+..+++|++++++..++++
T Consensus 82 ~~~~~~~~~~~~~~~~~e~~~~~w~~~~el~~~~~~ 117 (129)
T cd04699 82 YLVFVCEALSGAVKLSDEHEEYAWVTLEELAILKAD 117 (129)
T ss_pred EEEEEeeecCCcccCChhheEEEEecHHHhhhhhcc
Confidence 45566655432 1235577899999998766665
No 42
>cd04683 Nudix_Hydrolase_24 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.52 E-value=4e-14 Score=115.86 Aligned_cols=100 Identities=8% Similarity=0.089 Sum_probs=71.8
Q ss_pred EEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccc-cccccEEEEccc-eEEEEEE
Q psy9547 241 TVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVFSH-LKWNMDV 315 (363)
Q Consensus 241 ~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~sh-~~~~l~~ 315 (363)
.++|.++ |+|||.||...+...|+|+||||+++. ++++|+ ||++||+|+.+.... ..+..+.+.+.. .+....+
T Consensus 4 ~~vi~~~-~~vLL~~r~~~~~~~~~w~lPgG~ve~gE~~~~aa~REl~EEtGl~v~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (120)
T cd04683 4 YVLLRRD-DEVLLQRRANTGYMDGQWALPAGHLEKGEDAVTAAVREAREEIGVTLDPEDLRLAHTMHRRTEDIESRIGLF 82 (120)
T ss_pred EEEEEEC-CEEEEEEccCCCCCCCeEeCCccccCCCCCHHHHHHHHHHHHHCCccChhheEEEEEEEecCCCCceEEEEE
Confidence 3445555 899999998778889999999999988 788888 999999999876322 034444444432 3444556
Q ss_pred EEEEecCCCC-----CCCCCCeeecccccCc
Q psy9547 316 YSGTAKEKTI-----PANKTYKLITETQMKK 341 (363)
Q Consensus 316 ~~~~~~~~~~-----~~~~~~~W~~~~el~~ 341 (363)
|.+....+.+ .+..+++|++++++..
T Consensus 83 f~~~~~~~~~~~~~~~e~~~~~W~~~~~l~~ 113 (120)
T cd04683 83 FTVRRWSGEPRNCEPDKCAELRWFPLDALPD 113 (120)
T ss_pred EEEEeecCccccCCCCcEeeEEEEchHHCcc
Confidence 7777654322 2456789999999975
No 43
>cd04682 Nudix_Hydrolase_23 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.52 E-value=2.6e-14 Score=117.55 Aligned_cols=105 Identities=10% Similarity=-0.022 Sum_probs=77.6
Q ss_pred EEEEEEEeCCceEEEeecCCC--ccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTT--GLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNM 313 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~--~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l 313 (363)
++++++.++ |++|+.+|... ..+.|+|+||||+++. ++++|+ ||+.||+|+++.... ......+.+......+
T Consensus 3 v~~~~~~~~-g~vLl~~r~~~~~~~~~g~w~~PgG~ve~gE~~~~aa~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~ 80 (122)
T cd04682 3 VALALLIGD-GRLLLQLRDDKPGIPYPGHWDLPGGHREGGETPLECVLRELLEEIGLTLPESR-IPWFRVYPSASPPGTE 80 (122)
T ss_pred eEEEEEEcC-CEEEEEEccCCCCCCCCCcEeCCCccccCCCCHHHHHHHHHHHHhCCcccccc-cceeEecccCCCCceE
Confidence 566666665 89999999764 5789999999999999 888999 999999999876433 2223334444445667
Q ss_pred EEEEEEecCCC-----CCCCCCCeeecccccCcCCCC
Q psy9547 314 DVYSGTAKEKT-----IPANKTYKLITETQMKKYAFP 345 (363)
Q Consensus 314 ~~~~~~~~~~~-----~~~~~~~~W~~~~el~~~~~~ 345 (363)
++|.+....+. ..++.++.|++++|+.+...+
T Consensus 81 ~~f~~~~~~~~~~~~~~~E~~~~~W~~~~el~~~~~~ 117 (122)
T cd04682 81 HVFVVPLTAREDAILFGDEGQALRLMTVEEFLAHEDA 117 (122)
T ss_pred EEEEEEEecCCCccccCchhheeecccHHHHhhcccc
Confidence 78888766431 235678999999999775444
No 44
>cd03673 Ap6A_hydrolase Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily. Ap6A hydrolase specifically hydrolyzes diadenosine polyphosphates, but not ATP or diadenosine triphosphate, and it generates ATP as the product. Ap6A, the most preferred substrate, hydrolyzes to produce two ATP molecules, which is a novel hydrolysis mode for Ap6A. These results indicate that Ap6A hydrolase is a diadenosine polyphosphate hydrolase. It requires the presence of a divalent cation, such as Mn2+, Mg2+, Zn2+, and Co2+, for activity. Members of the Nudix superfamily are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site.
Probab=99.51 E-value=6.9e-14 Score=115.85 Aligned_cols=113 Identities=10% Similarity=0.071 Sum_probs=88.2
Q ss_pred EEEEEEEeCC---ceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc----c
Q psy9547 239 LLTVVVKTDT---NKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS----H 308 (363)
Q Consensus 239 v~~~ii~~~~---~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s----h 308 (363)
.+.++|.+.+ ++||+.+|..+ |.|+||||.+++ +.++|+ ||++||+|+++.... .+..+.+.++ .
T Consensus 3 ~a~~ii~~~~~~~~~vLl~~~~~~----~~w~~PgG~v~~gEs~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~ 77 (131)
T cd03673 3 AAGGVVFRGSDGGIEVLLIHRPRG----DDWSLPKGKLEPGETPPEAAVREVEEETGIRAEVGD-PLGTIRYWFSSSGKR 77 (131)
T ss_pred eEEEEEEEccCCCeEEEEEEcCCC----CcccCCCCccCCCCCHHHHHHHHHhhhhCCceEecc-eEEEEEEeccCCCCC
Confidence 3455566554 79999999543 899999999998 888888 999999999988776 7777777666 4
Q ss_pred eEEEEEEEEEEecCCCC-----CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 309 LKWNMDVYSGTAKEKTI-----PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 309 ~~~~l~~~~~~~~~~~~-----~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
....+++|.+....+.. .+..++.|++++++.++...+.++.+++...
T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~E~~~~~W~~~~el~~~~~~~~~~~~l~~~~ 130 (131)
T cd03673 78 VHKTVHWWLMRALGGEFTPQPDEEVDEVRWLPPDEARDRLSYPNDRELLRAAL 130 (131)
T ss_pred cceEEEEEEEEEcCCCcccCCCCcEEEEEEcCHHHHHHHcCCHhHHHHHHHhh
Confidence 56677888887665422 2446789999999999888899998887653
No 45
>cd04664 Nudix_Hydrolase_7 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=99.50 E-value=5.5e-14 Score=116.65 Aligned_cols=111 Identities=9% Similarity=0.121 Sum_probs=83.5
Q ss_pred EEEEEEEeC--CceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEE----EEccc-
Q psy9547 239 LLTVVVKTD--TNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVK----HVFSH- 308 (363)
Q Consensus 239 v~~~ii~~~--~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~----h~~sh- 308 (363)
++.+++.+. +++||+.||... +.|.|+||||+++. ++.+|+ ||++||+|+.+.... .+.... +.+++
T Consensus 3 ~~~v~~~~~~~~~~vLL~~r~~~--~~~~w~~PgG~ve~~Es~~~aa~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~ 79 (129)
T cd04664 3 SVLVVPYRLTGEGRVLLLRRSDK--YAGFWQSVTGGIEDGESPAEAARREVAEETGLDPERLT-LLDRGASIAFVEFTDN 79 (129)
T ss_pred EEEEEEEEeCCCCEEEEEEeCCC--CCCcccccCcccCCCCCHHHHHHHHHHHHHCCChhheE-EEeecccccccccCCC
Confidence 455555655 689999999665 89999999999999 888999 999999999875443 333221 22322
Q ss_pred -eEEEEEEEEEEecCCC----CCCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 309 -LKWNMDVYSGTAKEKT----IPANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 309 -~~~~l~~~~~~~~~~~----~~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
.++.+++|.+....+. ..++.++.|++++++.++...+.+++++
T Consensus 80 ~~~~~~~~f~~~~~~~~~~~~~~E~~~~~W~~~~e~~~~~~~~~~~~~~ 128 (129)
T cd04664 80 GRVWTEHPFAFHLPSDAVVTLDWEHDAFEWVPPEEAAALLLWESNRRAW 128 (129)
T ss_pred ceEEEEeEEEEEcCCCCcccCCccccccEecCHHHHHHHHcChhhhhhh
Confidence 4567888998876532 1356789999999999998888888765
No 46
>cd04681 Nudix_Hydrolase_22 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.50 E-value=7.2e-14 Score=115.96 Aligned_cols=112 Identities=11% Similarity=0.103 Sum_probs=77.8
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc--ceE--EE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS--HLK--WN 312 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s--h~~--~~ 312 (363)
+.++|.+.+|++|+.+|...+ ..|.|+||||+++. +..+++ ||++||+|+++.... .+++..+.|+ ... ..
T Consensus 4 v~~~i~~~~~~vLL~~r~~~~-~~~~w~~PgG~ve~gEs~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~ 81 (130)
T cd04681 4 VGVLILNEDGELLVVRRAREP-GKGTLDLPGGFVDPGESAEEALIREIREETGLKVTELS-YLFSLPNTYPYGGMEYDTL 81 (130)
T ss_pred EEEEEEcCCCcEEEEEecCCC-CCCcEeCCceeecCCCCHHHHHHHHHHHHhCCccccee-EEEeecceeeeCCceeEEE
Confidence 445666767899999996544 68999999999998 788888 999999999887655 5555544443 222 22
Q ss_pred EEEEEEEecCCC----CCCCCCCeeecccccC--cCCCChHHHHHHHH
Q psy9547 313 MDVYSGTAKEKT----IPANKTYKLITETQMK--KYAFPVPYQKVWKL 354 (363)
Q Consensus 313 l~~~~~~~~~~~----~~~~~~~~W~~~~el~--~~~~~~a~~~il~~ 354 (363)
..+|.+.+..+. ..+..+++|++++++. ++.+ +..+..++.
T Consensus 82 ~~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~-~~~~~~~~~ 128 (130)
T cd04681 82 DLFFVCQVDDKPIVKAPDDVAELKWVVPQDIELENFAF-PSIRQAVER 128 (130)
T ss_pred EEEEEEEeCCCCCcCChHHhheeEEecHHHCCcccCCc-HHHHHHHHh
Confidence 235677765431 1345678999999995 3554 345555554
No 47
>cd04680 Nudix_Hydrolase_21 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.49 E-value=8.4e-14 Score=113.71 Aligned_cols=108 Identities=13% Similarity=0.112 Sum_probs=81.7
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccc-ccccccccEEEEccceEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVN-LNEKCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~-~~~~~l~~v~h~~sh~~~~l~~ 315 (363)
+.++|.+.+|++|+.||...+ .|+||||+++. ++++|+ ||++||+|+.+. ... .++.+.+.++.....+++
T Consensus 3 ~~~~i~~~~~~vLL~~r~~~~----~w~~PgG~ve~gEt~~~aa~REl~EEtG~~~~~~~~-~~~~~~~~~~~~~~~~~~ 77 (120)
T cd04680 3 ARAVVTDADGRVLLVRHTYGP----GWYLPGGGLERGETFAEAARRELLEELGIRLAVVAE-LLGVYYHSASGSWDHVIV 77 (120)
T ss_pred eEEEEECCCCeEEEEEECCCC----cEeCCCCcCCCCCCHHHHHHHHHHHHHCCccccccc-eEEEEecCCCCCceEEEE
Confidence 345666766899999996543 89999999999 889999 999999999987 655 666666666555556788
Q ss_pred EEEEecCCC-C---CCCCCCeeecccccCcCCCChHHHHHHH
Q psy9547 316 YSGTAKEKT-I---PANKTYKLITETQMKKYAFPVPYQKVWK 353 (363)
Q Consensus 316 ~~~~~~~~~-~---~~~~~~~W~~~~el~~~~~~~a~~~il~ 353 (363)
|.+....+. . .+..+++|++++++.+ .+.++.+..++
T Consensus 78 f~~~~~~~~~~~~~~E~~~~~w~~~~~l~~-~~~~~~~~~~~ 118 (120)
T cd04680 78 FRARADTQPVIRPSHEISEARFFPPDALPE-PTTPATRRRIA 118 (120)
T ss_pred EEecccCCCccCCcccEEEEEEECHHHCcc-cCChHHHHHhh
Confidence 888876542 1 2446789999999998 55566655543
No 48
>cd02883 Nudix_Hydrolase Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+ for their activity. Members of this family are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance and "house-cleaning" enzy
Probab=99.49 E-value=7.1e-14 Score=113.39 Aligned_cols=111 Identities=8% Similarity=0.054 Sum_probs=85.8
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccc-cccccEEEEccceEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~sh~~~~l~~ 315 (363)
+.+++.+.++++|+.||+.+ ++|+|+||||.++. ++.+++ ||+.||+|+.+.... .......|.++|....+.+
T Consensus 3 ~~~i~~~~~~~ill~kr~~~--~~~~~~~p~G~~~~~e~~~~~a~RE~~EE~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (123)
T cd02883 3 VGAVILDEDGRVLLVRRADS--PGGLWELPGGGVEPGETLEEAAIREVREETGLDVDVLRLLGVYEVESPDEGEHAVVFV 80 (123)
T ss_pred eEEEEECCCCCEEEEEEcCC--CCCeEeCCcccccCCCCHHHHHHHHHHHhhCccceeeeEEEEEEeeccCCCceEEEEE
Confidence 44555666589999999766 89999999999988 667888 999999999875333 0233455555688999999
Q ss_pred EEEEecCCCC-----CCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 316 YSGTAKEKTI-----PANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 316 ~~~~~~~~~~-----~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
|.+....+.. .+..+.+|++++++.+++.+++.++++
T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~w~~~~~l~~~~~~~~~~~~~ 122 (123)
T cd02883 81 FLARLVGGEPTLLPPDEISEVRWVTLDELPALALSPALRPAL 122 (123)
T ss_pred EEEEeCCCCcCCCCCCccceEEEEcHHHCccccccccccccc
Confidence 9998765422 344667899999999999998887765
No 49
>PLN02325 nudix hydrolase
Probab=99.48 E-value=2e-13 Score=115.81 Aligned_cols=111 Identities=10% Similarity=0.082 Sum_probs=77.7
Q ss_pred EEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc----ceEEEE
Q psy9547 241 TVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS----HLKWNM 313 (363)
Q Consensus 241 ~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s----h~~~~l 313 (363)
.++|.++ |++||.||.. +...|.|.+|||+++. +..+|+ ||++||+|+++.... .+..+.+.|. ..++..
T Consensus 13 ~~vi~~~-~~vLL~rr~~-~~~~g~W~lPGG~ve~gEs~~~aa~REv~EEtGl~v~~~~-~l~~~~~~~~~~~~~~~~i~ 89 (144)
T PLN02325 13 VVFLLKG-NSVLLGRRRS-SIGDSTFALPGGHLEFGESFEECAAREVKEETGLEIEKIE-LLTVTNNVFLEEPKPSHYVT 89 (144)
T ss_pred EEEEEcC-CEEEEEEecC-CCCCCeEECCceeCCCCCCHHHHHHHHHHHHHCCCCcceE-EEEEecceeecCCCCcEEEE
Confidence 3444454 8999999954 4567999999999999 888899 999999999988776 6655555442 234445
Q ss_pred EEEEEEecCCC--C-----CCCCCCeeecccccCcCCCChHHHHHHHHH
Q psy9547 314 DVYSGTAKEKT--I-----PANKTYKLITETQMKKYAFPVPYQKVWKLF 355 (363)
Q Consensus 314 ~~~~~~~~~~~--~-----~~~~~~~W~~~~el~~~~~~~a~~~il~~l 355 (363)
.+|.+...++. . .++.+++|+++++|.. .+..+.+.+++.+
T Consensus 90 ~~f~~~~~~~~~~~~~~e~~e~~~~~W~~~d~Lp~-~~~~p~~~~~~~~ 137 (144)
T PLN02325 90 VFMRAVLADPSQVPQNLEPEKCYGWDWYEWDNLPE-PLFWPLEKLVGSG 137 (144)
T ss_pred EEEEEEECCCCCCCCcCCchhcCceEEEChHHCCh-hhhHHHHHHHHcC
Confidence 56677654431 1 1246789999999987 4455555555443
No 50
>cd04695 Nudix_Hydrolase_36 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.48 E-value=2.5e-13 Score=113.18 Aligned_cols=109 Identities=12% Similarity=0.086 Sum_probs=82.2
Q ss_pred eCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccc---eEEEEEEEEEE
Q psy9547 246 TDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSH---LKWNMDVYSGT 319 (363)
Q Consensus 246 ~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh---~~~~l~~~~~~ 319 (363)
++++++|+.||+.. +.|.|+||||+++. ++.+|+ ||++||+|+++.... ....+.+.|++ ....+.+|.+.
T Consensus 11 ~~~~~vLl~~r~~~--~~g~w~~PgG~ve~gEs~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~~~f~~~ 87 (131)
T cd04695 11 DKETKVLLLKRVKT--LGGFWCHVAGGVEAGETAWQAALRELKEETGISLPELY-NADYLEQFYEANDNRILMAPVFVGF 87 (131)
T ss_pred CCCCEEEEEEecCC--CCCcEECCcccccCCCCHHHHHHHHHHHHhCCCccccc-cccceeeEeecCCceEEEEEEEEEE
Confidence 35589999999755 78999999999999 888999 999999999876443 32333444442 23445667766
Q ss_pred ecCCC----CCCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 320 AKEKT----IPANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 320 ~~~~~----~~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
...+. ..++.+++|++++++.+....+.++.+++.+..
T Consensus 88 ~~~~~~~~~~~E~~~~~W~~~~e~~~~~~~~~~~~~~~~~~~ 129 (131)
T cd04695 88 VPPHQEVVLNHEHTEYRWCSFAEALELAPFPGQRALYDHVWR 129 (131)
T ss_pred ecCCCccccCchhcccEecCHHHHHHhcCChhHHHHHHHHHh
Confidence 54331 236788999999999999999999999887653
No 51
>cd04688 Nudix_Hydrolase_29 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.47 E-value=1.5e-13 Score=113.62 Aligned_cols=103 Identities=17% Similarity=0.256 Sum_probs=79.6
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc--c--eEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS--H--LKWN 312 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s--h--~~~~ 312 (363)
+.++|+++ ++||+.||.. .+.|+||||+++. ++.+|+ ||+.||+|+++.... .++...|.|+ + .+..
T Consensus 4 v~~vi~~~-~~vLl~~~~~----~~~w~lPgG~ve~gEs~~~aa~RE~~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~ 77 (126)
T cd04688 4 AAAIIIHN-GKLLVQKNPD----ETFYRPPGGGIEFGESSEEALIREFKEELGLKIEITR-LLGVVENIFTYNGKPGHEI 77 (126)
T ss_pred EEEEEEEC-CEEEEEEeCC----CCeEECCCccccCCCCHHHHHHHHHHHHhCCceecce-eeEEEEEeeccCCcccEEE
Confidence 55666666 6999999854 6899999999999 888899 999999999987776 6666665443 2 2455
Q ss_pred EEEEEEEecCCCC-----------CCCCCCeeecccccCcCCCChHH
Q psy9547 313 MDVYSGTAKEKTI-----------PANKTYKLITETQMKKYAFPVPY 348 (363)
Q Consensus 313 l~~~~~~~~~~~~-----------~~~~~~~W~~~~el~~~~~~~a~ 348 (363)
.++|.|+..++.. .++.+++|++++++..+++.|.+
T Consensus 78 ~~~f~~~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~~l~~~~~~p~~ 124 (126)
T cd04688 78 EFYYLVTLLDESLYQQDIEILEEEGEKIVFRWIPIDELKEIKLYPEF 124 (126)
T ss_pred EEEEEEEeCCCcccccccceeccCCCEEEEEEeeHHHcccCccCCcc
Confidence 7889898776422 23457899999999999888754
No 52
>cd04671 Nudix_Hydrolase_13 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.47 E-value=1.1e-13 Score=114.23 Aligned_cols=105 Identities=9% Similarity=0.115 Sum_probs=77.7
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~ 315 (363)
++++++.+.+|++|+.+|.. +...|.|+||||+++. ++++|+ ||++||+|+++.... .+. +.+.. .++...+
T Consensus 2 ~~~~vv~~~~~~vLl~~r~~-~~~~~~w~lPgG~ve~gEt~~~aa~REl~EEtG~~~~~~~-~~~-~~~~~--~~~~~~~ 76 (123)
T cd04671 2 IVAAVILNNQGEVLLIQEAK-RSCRGKWYLPAGRMEPGETIEEAVKREVKEETGLDCEPTT-LLS-VEEQG--GSWFRFV 76 (123)
T ss_pred EEEEEEEcCCCEEEEEEecC-CCCCCeEECceeecCCCCCHHHHHHHHHHHHHCCeeecce-EEE-EEccC--CeEEEEE
Confidence 67777787779999999964 4468999999999998 889999 999999999987665 443 22222 3345677
Q ss_pred EEEEecCCCCC-------CCCCCeeecccccCcCCCChHHH
Q psy9547 316 YSGTAKEKTIP-------ANKTYKLITETQMKKYAFPVPYQ 349 (363)
Q Consensus 316 ~~~~~~~~~~~-------~~~~~~W~~~~el~~~~~~~a~~ 349 (363)
|.++..++... +..+++|++++++ ..++.+++.
T Consensus 77 f~a~~~~g~~~~~~~~~~e~~~~~W~~~~el-~~~~~~~~~ 116 (123)
T cd04671 77 FTGNITGGDLKTEKEADSESLQARWYSNKDL-PLPLRAHDI 116 (123)
T ss_pred EEEEEeCCeEccCCCCCcceEEEEEECHHHC-CCccchhee
Confidence 88887665221 2236899999999 567766654
No 53
>cd04687 Nudix_Hydrolase_28 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.47 E-value=2e-13 Score=113.19 Aligned_cols=104 Identities=13% Similarity=0.141 Sum_probs=76.1
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEc--------cc
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVF--------SH 308 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~--------sh 308 (363)
+.+||.++ |+||+.||...+ ++.|+||||.++. +.++|+ ||+.||+|+++...+ .+....+.+ ..
T Consensus 4 a~~iv~~~-~~vLl~~r~~~~--~~~~~lPGG~ve~gEt~~~aa~RE~~EEtGl~v~~~~-~~~~~~~~~~~~~~~~~~~ 79 (128)
T cd04687 4 AKAVIIKN-DKILLIKHHDDG--GVWYILPGGGQEPGETLEDAAHRECKEEIGIDVEIGP-LLFVREYIGHNPTSELPGH 79 (128)
T ss_pred EEEEEEEC-CEEEEEEEEcCC--CCeEECCCcccCCCCCHHHHHHHHHHHHHCCccccCc-EEEEEEEeccCccccCCCc
Confidence 45556665 899999996443 5899999999999 888999 999999999987654 333333331 23
Q ss_pred eEEEEEEEEEEecCCCC--------CCCCCCeeecccccCcCCCChH
Q psy9547 309 LKWNMDVYSGTAKEKTI--------PANKTYKLITETQMKKYAFPVP 347 (363)
Q Consensus 309 ~~~~l~~~~~~~~~~~~--------~~~~~~~W~~~~el~~~~~~~a 347 (363)
.+....+|.|+...+.. .+..+++|++++++.++++.|+
T Consensus 80 ~~~i~~~f~~~~~~~~~~~~~~~~~~~~~~~~W~~~~~l~~~~~~p~ 126 (128)
T cd04687 80 FHQVELMFECKIKSGTPAKTPSKPDPNQIGVEWLKLKELGDIPLYPS 126 (128)
T ss_pred eeEEEEEEEEEECCCCcccccCCCCCCEEeeEEEcHHHhCcccccCC
Confidence 45666788888765421 1224789999999999988764
No 54
>cd04678 Nudix_Hydrolase_19 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.46 E-value=5.8e-13 Score=110.41 Aligned_cols=102 Identities=10% Similarity=0.058 Sum_probs=77.6
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEc--cceEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVF--SHLKWNM 313 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~--sh~~~~l 313 (363)
.+.++|.+.+|++|+.||.. +..+|.|.||||.++. +.++|+ ||++||+|+++.... .+..+.+.+ ++.+...
T Consensus 4 ~v~~ii~~~~~~iLl~~r~~-~~~~~~w~~PGG~ve~gEt~~~Aa~REl~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~~~ 81 (129)
T cd04678 4 GVGVFVLNPKGKVLLGKRKG-SHGAGTWALPGGHLEFGESFEECAAREVLEETGLHIENVQ-FLTVTNDVFEEEGKHYVT 81 (129)
T ss_pred EEEEEEECCCCeEEEEeccC-CCCCCeEECCcccccCCCCHHHHHHHHHHHHhCCcccceE-EEEEEeEEeCCCCcEEEE
Confidence 34455666668999999964 5689999999999999 888999 999999999987765 565555544 3455666
Q ss_pred EEEEEEecCCC-------CCCCCCCeeecccccCcC
Q psy9547 314 DVYSGTAKEKT-------IPANKTYKLITETQMKKY 342 (363)
Q Consensus 314 ~~~~~~~~~~~-------~~~~~~~~W~~~~el~~~ 342 (363)
.+|.|....+. ..++++++|++++++.+.
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~~l~~~ 117 (129)
T cd04678 82 IFVKAEVDDGEAEPNKMEPEKCEGWEWFDWEELPSV 117 (129)
T ss_pred EEEEEEeCCCCcccCCCCCceeCceEEeCHHHCCCc
Confidence 77888776532 123467799999999985
No 55
>cd04670 Nudix_Hydrolase_12 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.46 E-value=2.5e-13 Score=112.39 Aligned_cols=110 Identities=10% Similarity=-0.013 Sum_probs=75.3
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccccccccc-EEEEccceEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGN-VKHVFSHLKWNMD 314 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~-v~h~~sh~~~~l~ 314 (363)
-++++|.+.+++||+.||... +.|.|+||||+++. ++.+++ ||++||+|+.+.... .+.. ..|.++..... .
T Consensus 4 ~~~~~v~~~~~~vLl~~r~~~--~~~~w~~PGG~ve~gEt~~~aa~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~-~ 79 (127)
T cd04670 4 GVGGLVLNEKNEVLVVQERNK--TPNGWKLPGGLVDPGEDIFDGAVREVLEETGIDTEFVS-VVGFRHAHPGAFGKSD-L 79 (127)
T ss_pred EEEEEEEcCCCeEEEEEccCC--CCCcEECCCccCCCCCCHHHHHHHHHHHHHCCCcceeE-EEEEEecCCCCcCcee-E
Confidence 345566777789999998554 78999999999998 888899 999999999886544 3321 12222211122 2
Q ss_pred EEEEEecC--CC----CCCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 315 VYSGTAKE--KT----IPANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 315 ~~~~~~~~--~~----~~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
+|.+.... .. ..+..+.+|++++++.+.+++......+
T Consensus 80 ~~~~~~~~~~~~~~~~~~E~~~~~w~~~~el~~~~~~~~~~~~~ 123 (127)
T cd04670 80 YFICRLKPLSFDINFDTSEIAAAKWMPLEEYISQPITSEVNRLI 123 (127)
T ss_pred EEEEEEccCcCcCCCChhhhheeEEEcHHHHhcchhHHHHHHHH
Confidence 24444432 11 1245678999999999999988865544
No 56
>cd03675 Nudix_Hydrolase_2 Contains a crystal structure of the Nudix hydrolase from Nitrosomonas europaea, which has an unknown function. In general, members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability,
Probab=99.46 E-value=3.7e-13 Score=112.33 Aligned_cols=114 Identities=10% Similarity=0.030 Sum_probs=81.9
Q ss_pred EEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEcc--ceEEEEEE
Q psy9547 241 TVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFS--HLKWNMDV 315 (363)
Q Consensus 241 ~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~s--h~~~~l~~ 315 (363)
+++|.+ ++++|+.||... -++.|+||||++++ ++.+|+ ||++||+|+++.... .+....+.++ ...+...+
T Consensus 4 ~~ii~~-~~~vLlv~r~~~--~~~~w~~PgG~ve~gEs~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~~~ 79 (134)
T cd03675 4 AAVVER-DGRFLLVEEETD--GGLVFNQPAGHLEPGESLIEAAVRETLEETGWHVEPTA-LLGIYQWTAPDSDTTYLRFA 79 (134)
T ss_pred EEEEEE-CCEEEEEEEccC--CCceEECCCccCCCCCCHHHHHHHHHHHHHCcccccce-EEEEEEeecCCCCeeEEEEE
Confidence 344444 489999998554 57899999999998 888999 999999999986655 4444433333 34455567
Q ss_pred EEEEecCCCC-----CCCCCCeeecccccCcCC---CChHHHHHHHHHHhh
Q psy9547 316 YSGTAKEKTI-----PANKTYKLITETQMKKYA---FPVPYQKVWKLFTKS 358 (363)
Q Consensus 316 ~~~~~~~~~~-----~~~~~~~W~~~~el~~~~---~~~a~~~il~~l~~~ 358 (363)
|.+....+.. .+..+..|++++++..+. .++...++++.+...
T Consensus 80 f~~~~~~~~~~~~~~~e~~~~~w~~~~el~~~~~~~~~~~~~~~i~~~l~~ 130 (134)
T cd03675 80 FAAELLEHLPDQPLDSGIVRAHWLTLEEILALAARLRSPLVLRCIEDYLAG 130 (134)
T ss_pred EEEEECCCCCCCCCCCCceeeEEEeHHHHHhhhhhhcCchHHHHHHHHHhc
Confidence 7777665321 245678899999999876 377788887776543
No 57
>cd04690 Nudix_Hydrolase_31 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.46 E-value=2.3e-13 Score=110.95 Aligned_cols=105 Identities=10% Similarity=0.135 Sum_probs=78.5
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccc--cccccccEEEE---ccceE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNL--NEKCLGNVKHV---FSHLK 310 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~--~~~~l~~v~h~---~sh~~ 310 (363)
++++++.+.+|++|+.||.. .|.|+||||.+++ +.++++ ||++||+|+.+.. .. .+..+.+. .++.+
T Consensus 2 ~~~~~v~~~~~~vLl~~r~~----~~~w~~PgG~ve~~Es~~~aa~REl~EEtGl~~~~~~~~-~~~~~~~~~~~~~~~~ 76 (118)
T cd04690 2 IAAALILVRDGRVLLVRKRG----TDVFYLPGGKIEAGETPLQALIRELSEELGLDLDPDSLE-YLGTFRAPAANEPGVD 76 (118)
T ss_pred eEEEEEEecCCeEEEEEECC----CCcEECCCCccCCCCCHHHHHHHHHHHHHCCccChhheE-EEEEEecccccCCCcE
Confidence 35556666668999998853 4789999999999 788888 9999999998766 44 45555442 23346
Q ss_pred EEEEEEEEEecCC--CCCCCCCCeeecccccCcCCCChHH
Q psy9547 311 WNMDVYSGTAKEK--TIPANKTYKLITETQMKKYAFPVPY 348 (363)
Q Consensus 311 ~~l~~~~~~~~~~--~~~~~~~~~W~~~~el~~~~~~~a~ 348 (363)
+.+++|.+..... ...+..+++|++++++...++++.+
T Consensus 77 ~~~~~f~~~~~~~~~~~~e~~~~~W~~~~e~~~~~~~~~~ 116 (118)
T cd04690 77 VRATVYVAELTGEPVPAAEIEEIRWVDYDDPADDRLAPLL 116 (118)
T ss_pred EEEEEEEEcccCCcCCCchhhccEEecHHHccccccCccc
Confidence 7788888887542 1135677899999999988888765
No 58
>cd04511 Nudix_Hydrolase_4 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specifici
Probab=99.45 E-value=4.4e-13 Score=111.54 Aligned_cols=102 Identities=16% Similarity=0.192 Sum_probs=75.6
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVY 316 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~ 316 (363)
+.++|.++ +++|+.||... ...|.|+||||+++. ++++++ ||++||+|+++.... .+.. +.+++.+..+++|
T Consensus 16 v~~ii~~~-~~vLL~kr~~~-~~~g~w~lPgG~ve~gE~~~~a~~REl~EEtGl~~~~~~-~~~~--~~~~~~~~~~~~f 90 (130)
T cd04511 16 VGCVPEWE-GKVLLCRRAIE-PRHGFWTLPAGFMENGETTEQGALRETWEEAGARVEIDG-LYAV--YSVPHISQVYMFY 90 (130)
T ss_pred EEEEEecC-CEEEEEEecCC-CCCCeEECCcccccCCCCHHHHHHHHHHHHhCCEEEeee-EEEE--EecCCceEEEEEE
Confidence 34455555 89999999654 478999999999998 788888 999999999986654 4432 3345555667888
Q ss_pred EEEecCCCC---CCCCCCeeecccccC--cCCCCh
Q psy9547 317 SGTAKEKTI---PANKTYKLITETQMK--KYAFPV 346 (363)
Q Consensus 317 ~~~~~~~~~---~~~~~~~W~~~~el~--~~~~~~ 346 (363)
.++...+.. .++.+.+|+++++|. ++.+++
T Consensus 91 ~~~~~~~~~~~~~e~~~~~~~~~~~l~~~~l~~~~ 125 (130)
T cd04511 91 RARLLDLDFAPGPESLEVRLFTEEEIPWDELAFPT 125 (130)
T ss_pred EEEEcCCcccCCcchhceEEECHHHCCchhccccc
Confidence 998876422 356778999999996 455543
No 59
>PRK15393 NUDIX hydrolase YfcD; Provisional
Probab=99.44 E-value=4.6e-13 Score=117.76 Aligned_cols=120 Identities=13% Similarity=0.032 Sum_probs=86.0
Q ss_pred eEEEEEEEeCCceEEEeecCCCc-ccccccc-ccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTG-LLSNFYM-FLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWN 312 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~-~~~glWE-FPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~ 312 (363)
..++++|.+.+|++|+.+|...+ .++|.|+ ||||++++ ++.+|+ ||+.||+|+.+.... .++.+.+......+.
T Consensus 38 ~~~~v~v~~~~g~iLL~~R~~~~~~~pg~~~~~pGG~ve~GEs~~eAA~REL~EEtGl~~~~~~-~~~~~~~~~~~~~~~ 116 (180)
T PRK15393 38 RATYIVVHDGMGKILVQRRTETKDFLPGMLDATAGGVVQAGEQLLESARREAEEELGIAGVPFA-EHGQFYFEDENCRVW 116 (180)
T ss_pred EEEEEEEECCCCeEEEEEeCCCCCCCCCcccccCCCcCCCCCCHHHHHHHHHHHHHCCCCccce-eceeEEecCCCceEE
Confidence 45556667777999999997654 5678885 89999999 888999 999999999865544 555555555555555
Q ss_pred EEEEEEEecCC---CCCCCCCCeeecccccCcC--CCChHHHHHHHHHHhh
Q psy9547 313 MDVYSGTAKEK---TIPANKTYKLITETQMKKY--AFPVPYQKVWKLFTKS 358 (363)
Q Consensus 313 l~~~~~~~~~~---~~~~~~~~~W~~~~el~~~--~~~~a~~~il~~l~~~ 358 (363)
.++|.+..... ...++.++.|++++++.+. .|++.....+..+...
T Consensus 117 ~~~f~~~~~~~~~~~~~E~~~~~W~~~~el~~~~~~~~~~~~~~l~~~l~~ 167 (180)
T PRK15393 117 GALFSCVSHGPFALQEEEVSEVCWMTPEEITARCDEFTPDSLKALALWLTR 167 (180)
T ss_pred EEEEEEEeCCCCCCChHHeeEEEECCHHHHhhhhhhcCccHHHHHHHHHHh
Confidence 66777655321 1235678899999999986 6876666666555433
No 60
>cd03674 Nudix_Hydrolase_1 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity. They also contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V), which forms a structural motif that functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamil
Probab=99.44 E-value=1.1e-12 Score=110.27 Aligned_cols=111 Identities=11% Similarity=0.050 Sum_probs=79.7
Q ss_pred EEEEEEeCC-ceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccc----cEEEE-c----
Q psy9547 240 LTVVVKTDT-NKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLG----NVKHV-F---- 306 (363)
Q Consensus 240 ~~~ii~~~~-~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~----~v~h~-~---- 306 (363)
+.++|.+++ +++|+.||.. .|.|.+|||++++ ++++|+ ||++||+|+++.... ... ...|. +
T Consensus 5 ~~~~v~~~~~~~vLLv~r~~----~~~w~lPgG~ve~gE~~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~~ 79 (138)
T cd03674 5 ASAFVVNPDRGKVLLTHHRK----LGSWLQPGGHIDPDESLLEAALRELREETGIELLGLR-PLSVLVDLDVHPIDGHPK 79 (138)
T ss_pred EEEEEEeCCCCeEEEEEEcC----CCcEECCceecCCCCCHHHHHHHHHHHHHCCCcccce-eccccccceeEeecCCCC
Confidence 445556665 8999999854 4899999999999 889999 999999999865443 322 11221 1
Q ss_pred ----cceEEEEEEEEEEecCCCC-----CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 307 ----SHLKWNMDVYSGTAKEKTI-----PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 307 ----sh~~~~l~~~~~~~~~~~~-----~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
.+.++ ..+|.+....+.. .+..+++|++++++..+++++.++.+++.+.
T Consensus 80 ~~~~~~~~~-~~~y~~~~~~~~~~~~~~~E~~~~~W~~~~el~~~~~~~~~~~~i~~~~ 137 (138)
T cd03674 80 RGVPGHLHL-DLRFLAVAPADDVAPPKSDESDAVRWFPLDELASLELPEDVRRLVEKAL 137 (138)
T ss_pred CCCCCcEEE-EEEEEEEccCccccCCCCCcccccEEEcHHHhhhccCCHHHHHHHHHHh
Confidence 12233 3457777654322 2557889999999999999999999987653
No 61
>cd04667 Nudix_Hydrolase_10 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.44 E-value=3.1e-13 Score=109.37 Aligned_cols=101 Identities=6% Similarity=-0.024 Sum_probs=74.1
Q ss_pred EEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEE
Q psy9547 242 VVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSG 318 (363)
Q Consensus 242 ~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~ 318 (363)
++|...++++|+.||.. |.|+||||.+++ ++++|+ ||+.||+|+++.... .+.. |.+.+...++|.+
T Consensus 4 ~~i~~~~~~vLlv~r~~-----~~w~~PgG~ve~gE~~~~aa~REl~EEtGl~~~~~~-~~~~----~~~~~~~~~~f~~ 73 (112)
T cd04667 4 TVICRRGGRVLLVRKSG-----SRWALPGGKIEPGETPLQAARRELQEETGLQGLDLL-YLFH----VDGGSTRHHVFVA 73 (112)
T ss_pred EEEEecCCEEEEEEcCC-----CcEeCCCCcCCCCCCHHHHHHHHHHHHhCCcccceE-EEEE----EeCCCEEEEEEEE
Confidence 34444458999999953 899999999998 788889 999999999865443 3322 2222334467777
Q ss_pred EecCC-C---CCCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 319 TAKEK-T---IPANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 319 ~~~~~-~---~~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
....+ . ..+..+++|++++++.++.+++++++|.
T Consensus 74 ~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~~ 111 (112)
T cd04667 74 SVPPSAQPKPSNEIADCRWLSLDALGDLNASAATRLIV 111 (112)
T ss_pred EcCCcCCCCCchheeEEEEecHHHhhhcccchhhhhhc
Confidence 76543 1 1345678999999999999999998874
No 62
>cd04700 DR1025_like DR1025 from Deinococcus radiodurans, a member of the Nudix hydrolase superfamily, show nucleoside triphosphatase and dinucleoside polyphosphate pyrophosphatase activities. Like other enzymes belonging to this superfamily, it requires a divalent cation, in this case Mg2+, for its activity. It also contains a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. In general, substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is us
Probab=99.42 E-value=6.1e-13 Score=112.51 Aligned_cols=103 Identities=11% Similarity=0.057 Sum_probs=76.0
Q ss_pred eEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceE-EEE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLK-WNM 313 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~-~~l 313 (363)
..++++|.+.++++||.||. .+...|.|+||||.+++ ++.+|+ ||++||+|+++.... .+..+.+.+++.. ...
T Consensus 14 ~av~~vv~~~~~~vLL~~r~-~~~~~~~w~lPgG~ve~gEt~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~~~ 91 (142)
T cd04700 14 RAAGAVILNERNDVLLVQEK-GGPKKGLWHIPSGAVEDGEFPQDAAVREACEETGLRVRPVK-FLGTYLGRFDDGVLVLR 91 (142)
T ss_pred eeEEEEEEeCCCcEEEEEEc-CCCCCCeEECCceecCCCCCHHHHHHHHHHHhhCceeeccE-EEEEEEEEcCCCcEEEE
Confidence 34445666766899998874 44568999999999998 888999 999999999987766 6666665565433 334
Q ss_pred EEEEEEecCC--CC---CCCCCCeeecccccCcC
Q psy9547 314 DVYSGTAKEK--TI---PANKTYKLITETQMKKY 342 (363)
Q Consensus 314 ~~~~~~~~~~--~~---~~~~~~~W~~~~el~~~ 342 (363)
++|.++...+ .+ .+..++.|++++++.+.
T Consensus 92 ~~f~~~~~~~~~~~~~~~E~~~~~w~~~~el~~~ 125 (142)
T cd04700 92 HVWLAEPEGQTLAPKFTDEIAEASFFSREDVAQL 125 (142)
T ss_pred EEEEEEecCCccccCCCCCEEEEEEECHHHhhhc
Confidence 6787876443 11 35567899999998764
No 63
>cd04697 Nudix_Hydrolase_38 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.40 E-value=6.6e-13 Score=109.88 Aligned_cols=101 Identities=13% Similarity=0.050 Sum_probs=74.7
Q ss_pred EEEEEEeCCceEEEeecCCCc-cccccccc-cccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTG-LLSNFYMF-LSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~-~~~glWEF-Pggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
+.++|.+.+|++|+.+|...+ .+.|+|+| |||+++. ++++++ ||+.||+|+++.... .++++.+......+..+
T Consensus 3 ~~v~i~~~~~~iLl~~R~~~~~~~~g~w~~~~GG~ve~gE~~~~aa~REl~EEtGl~~~~l~-~~~~~~~~~~~~~~~~~ 81 (126)
T cd04697 3 TYIFVFNSEGKLCVHKRTLTKDWCPGYWDIAFGGVVQAGESYLQNAQRELEEELGIDGVQLT-PLGLFYYDTDGNRVWGK 81 (126)
T ss_pred EEEEEEcCCCeEEEEECCCCCCCCCCcccCcCCcccCCCCCHHHHHHHHHHHHHCCCccccE-EeeEEEecCCCceEEEE
Confidence 456677777999999997765 57999999 6999988 788888 999999999886444 55555554443455567
Q ss_pred EEEEEecCC-C--CCCCCCCeeecccccCc
Q psy9547 315 VYSGTAKEK-T--IPANKTYKLITETQMKK 341 (363)
Q Consensus 315 ~~~~~~~~~-~--~~~~~~~~W~~~~el~~ 341 (363)
+|.+..... . ..+..+++|++++++.+
T Consensus 82 ~f~~~~~~~~~~~~~E~~~~~w~~~~el~~ 111 (126)
T cd04697 82 VFSCVYDGPLKLQEEEVEEITWLSINEILQ 111 (126)
T ss_pred EEEEEECCCCCCCHhHhhheEEcCHHHHHH
Confidence 777765432 1 23567889999999875
No 64
>cd04672 Nudix_Hydrolase_14 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.39 E-value=1.2e-12 Score=107.79 Aligned_cols=104 Identities=7% Similarity=0.010 Sum_probs=75.5
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEE---Ec--cceEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKH---VF--SHLKW 311 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h---~~--sh~~~ 311 (363)
++++|.++ |++|+.||.. .|.|+||||+++. ++++|+ ||++||+|+.+.... .++...+ .+ ...++
T Consensus 5 v~~~i~~~-~~vLL~~~~~----~~~w~~PGG~ve~gEs~~~aa~REl~EEtG~~~~~~~-~~~~~~~~~~~~~~~~~~~ 78 (123)
T cd04672 5 VRAAIFKD-GKILLVREKS----DGLWSLPGGWADVGLSPAENVVKEVKEETGLDVKVRK-LAAVDDRNKHHPPPQPYQV 78 (123)
T ss_pred EEEEEEEC-CEEEEEEEcC----CCcEeCCccccCCCCCHHHHHHHHHHHHhCCeeeEeE-EEEEeccccccCCCCceEE
Confidence 44555666 8999999854 6999999999999 888999 999999999885544 3333222 22 23345
Q ss_pred EEEEEEEEecCCCC---CCCCCCeeecccccCcCCCChHHH
Q psy9547 312 NMDVYSGTAKEKTI---PANKTYKLITETQMKKYAFPVPYQ 349 (363)
Q Consensus 312 ~l~~~~~~~~~~~~---~~~~~~~W~~~~el~~~~~~~a~~ 349 (363)
...+|.|....+.. .+..++.|++++++.++.+|..+.
T Consensus 79 ~~~~f~~~~~~~~~~~~~E~~~~~W~~~~el~~l~~~~~~~ 119 (123)
T cd04672 79 YKLFFLCEILGGEFKPNIETSEVGFFALDDLPPLSEKRNTE 119 (123)
T ss_pred EEEEEEEEecCCcccCCCceeeeEEECHHHCcccccCCcch
Confidence 55678888765432 345678899999999998887654
No 65
>cd03426 CoAse Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes. The need of CoAses mainly arises under conditions of oxidative stress. CoAse has a conserved Nudix fold and requires a single divalent cation for catalysis. In addition to a signature Nudix motif G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val, CoAse contains an additional motif upstream called the NuCoA motif (LLTXT(SA)X3RX3GX3FPGG) which is postulated to be involved in CoA recognition. CoA plays a central role in lipid metabolism. It is involved in the initial steps of fatty acid sythesis in the cytosol, in the oxidation of fatty acids and the citric acid cycle in the mitochondria, and in the oxidation of long-chain fatty acids in peroxisomes. CoA has the important role of activating fatty acids for further modification into key biological signalling molecules.
Probab=99.38 E-value=6.8e-13 Score=114.15 Aligned_cols=103 Identities=13% Similarity=0.067 Sum_probs=72.7
Q ss_pred EEEEeCC--ceEEEeecCCCc-cccccccccccCCCC---CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 242 VVVKTDT--NKYLIQKRPTTG-LLSNFYMFLSFESDT---PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 242 ~ii~~~~--~~vLl~~R~~~~-~~~glWEFPggk~e~---~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
.++.+.+ ++||+.||...+ .++|.|+||||+++. ++++|+ ||++||+|+++.... .+..+.+.++.....++
T Consensus 7 v~l~~~~~~~~vLL~~R~~~~~~~~g~w~lPGG~ve~gdEs~~eaa~REl~EEtGl~~~~~~-~l~~~~~~~~~~~~~v~ 85 (157)
T cd03426 7 VLLVEREGELRVLLTKRASHLRSHPGQVAFPGGKVDPGDEDPVATALREAEEEIGLPPDSVE-VLGRLPPYYTRSGFVVT 85 (157)
T ss_pred EEEEeCCCceEEEEEEcccccccCCCcEECCCCCcCCCcCCHHHHHHHHHHHHhCCCccceE-EEEECCCccccCCCEEE
Confidence 3444443 589999997764 589999999999987 578888 999999999887655 55544433333344556
Q ss_pred EEEEEecCC---C--CCCCCCCeeecccccCcCCCC
Q psy9547 315 VYSGTAKEK---T--IPANKTYKLITETQMKKYAFP 345 (363)
Q Consensus 315 ~~~~~~~~~---~--~~~~~~~~W~~~~el~~~~~~ 345 (363)
+|.+....+ . ..+.+++.|++++++.+..++
T Consensus 86 ~~~~~~~~~~~~~~~~~E~~~~~W~~~~el~~~~~~ 121 (157)
T cd03426 86 PVVGLVPPPLPLVLNPDEVAEVFEVPLSFLLDPANP 121 (157)
T ss_pred EEEEEECCCCCCCCCHHHhheeEEEcHHHHhCcCCc
Confidence 666655432 1 225678899999999887543
No 66
>cd04693 Nudix_Hydrolase_34 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.37 E-value=1.4e-12 Score=107.91 Aligned_cols=101 Identities=10% Similarity=0.079 Sum_probs=67.3
Q ss_pred EEEEEEEeCCceEEEeecCCCc-ccccccccc-ccCCCC--CchHHH-HHHHhhCCCccccccc-ccccEEEEccceEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTG-LLSNFYMFL-SFESDT--PYTSQV-EFLSENLPFKVNLNEK-CLGNVKHVFSHLKWN 312 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~-~~~glWEFP-ggk~e~--~~~~al-rEl~Eelgl~v~~~~~-~l~~v~h~~sh~~~~ 312 (363)
++.+++.+.+|+||+.||...+ .+.|.|+|| ||+++. ++ +|+ ||++||+|+.+..... .+..+.+... .+..
T Consensus 2 ~v~v~~~~~~g~vLl~~R~~~~~~~pg~w~~p~GG~ve~gE~~-~aa~REl~EEtGl~~~~~~~~~~~~~~~~~~-~~~~ 79 (127)
T cd04693 2 VVHVCIFNSKGELLLQKRSPNKDGWPGMWDLSVGGHVQAGETS-TAAEREVKEELGLELDFSELRPLFRYFFEAE-GFDD 79 (127)
T ss_pred eEEEEEEeCCCeEEEEEccCCCCCCCCcccccCCCcCCCCCCH-HHHHHHHHHHhCCCcChhhcEEEEEEEeecC-CeEE
Confidence 3455666766899999998654 578999998 999998 77 888 9999999998764330 2233333221 2222
Q ss_pred EEEEEEEecCCC----CCCCCCCeeecccccCc
Q psy9547 313 MDVYSGTAKEKT----IPANKTYKLITETQMKK 341 (363)
Q Consensus 313 l~~~~~~~~~~~----~~~~~~~~W~~~~el~~ 341 (363)
+.+|.+....+. ..+..+++|++++|+..
T Consensus 80 ~~~~~~~~~~~~~~~~~~E~~~~~w~~~~el~~ 112 (127)
T cd04693 80 YYLFYADVEIGKLILQKEEVDEVKFVSKDEIDG 112 (127)
T ss_pred EEEEEecCcccccccCHHHhhhEEEeCHHHHHH
Confidence 334444433221 13567889999998864
No 67
>cd04661 MRP_L46 Mitochondrial ribosomal protein L46 (MRP L46) is a component of the large subunit (39S) of the mammalian mitochondrial ribosome and a member of the Nudix hydrolase superfamily. MRPs are thought to be involved in the maintenance of the mitochondrial DNA. In general, members of the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for activity and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. MRP L46 appears to contain a modified nudix motif.
Probab=99.37 E-value=1.1e-12 Score=109.43 Aligned_cols=93 Identities=10% Similarity=0.145 Sum_probs=69.2
Q ss_pred CceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccc---cccccEEEEccc-------eEEEEE
Q psy9547 248 TNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNE---KCLGNVKHVFSH-------LKWNMD 314 (363)
Q Consensus 248 ~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~---~~l~~v~h~~sh-------~~~~l~ 314 (363)
+++||+.||..++ +|.|+||||++|+ |+.+|+ ||++||+|+.+...- ..+..+.+.++. ....++
T Consensus 12 ~~~~Llvk~~~~~--~g~W~fPgG~ve~gEt~~eaa~REl~EEtGl~v~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (132)
T cd04661 12 DTLVLLVQQKVGS--QNHWILPQGKREEGETLRQTAERTLKELCGNNLKAKFYGNAPVGFYKYKYPKAVRNEGIVGAKVF 89 (132)
T ss_pred CcEEEEEEeecCC--CCeeECCcccccCCCCHHHHHHHHHHHhhCCCceEEEEEecCcEEEEEecCcccccccCcccEEE
Confidence 3789999996543 7999999999999 899999 999999999766421 022333444432 224678
Q ss_pred EEEEEecCCCC---CCCCCCeeecccccCcC
Q psy9547 315 VYSGTAKEKTI---PANKTYKLITETQMKKY 342 (363)
Q Consensus 315 ~~~~~~~~~~~---~~~~~~~W~~~~el~~~ 342 (363)
+|.+...+|.+ .++.+++|++++|+.++
T Consensus 90 ~f~~~~~~g~~~~~~e~~~~~W~~~~el~~~ 120 (132)
T cd04661 90 FFKARYMSGQFELSQNQVDFKWLAKEELQKY 120 (132)
T ss_pred EEEEEEecCccccCCCcceeEecCHHHHHhh
Confidence 89998877633 36788999999999874
No 68
>cd03429 NADH_pyrophosphatase NADH pyrophosphatase, a member of the Nudix hydrolase superfamily, catalyzes the cleavage of NADH into reduced nicotinamide mononucleotide (NMNH) and AMP. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity. Members of this family are also recognized by the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. A block of 8 conserved amino acids downstream of the nudix motif is thought to give NADH pyrophosphatase its specificity for NADH. NADH pyrophosphatase forms a dimer.
Probab=99.36 E-value=2.1e-12 Score=107.74 Aligned_cols=112 Identities=13% Similarity=0.184 Sum_probs=79.1
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVY 316 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~ 316 (363)
+.+.|.++++++|+.||... -.|.|+||||+++. +.++|+ ||++||+|+++.... .++...+.+.+ ..+.+|
T Consensus 3 v~i~l~~~~~~vLL~~r~~~--~~~~w~lPgG~ie~gEt~~~aA~REl~EEtGl~~~~~~-~l~~~~~~~~~--~~~~~f 77 (131)
T cd03429 3 VIVLVIDGGDRILLARQPRF--PPGMYSLLAGFVEPGESLEEAVRREVKEEVGIRVKNIR-YVGSQPWPFPS--SLMLGF 77 (131)
T ss_pred EEEEEEeCCCEEEEEEecCC--CCCcCcCCcccccCCCCHHHHHhhhhhhccCceeeeeE-EEeecCCCCCc--eEEEEE
Confidence 34445566589999999543 27999999999998 788888 999999999886555 55443333333 335667
Q ss_pred EEEecCCCC----CCCCCCeeecccccCc----CCCChHHHHHHHHHH
Q psy9547 317 SGTAKEKTI----PANKTYKLITETQMKK----YAFPVPYQKVWKLFT 356 (363)
Q Consensus 317 ~~~~~~~~~----~~~~~~~W~~~~el~~----~~~~~a~~~il~~l~ 356 (363)
.+....+.. .+..++.|++++++.+ ++..++..++.+.+.
T Consensus 78 ~~~~~~~~~~~~~~E~~~~~w~~~~el~~~~~~~~~~~~~~~~~~~~~ 125 (131)
T cd03429 78 TAEADSGEIVVDDDELEDARWFSRDEVRAAGEGLPELPPPGSIARQLI 125 (131)
T ss_pred EEEEcCCcccCCchhhhccEeecHHHHhhcccCCccCCCcchHHHHHH
Confidence 777664422 3557789999999887 566666666655544
No 69
>cd03671 Ap4A_hydrolase_plant_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Members of this family are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one group (represented by this subfamily) and fungi/animals/archaea enzymes fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for the inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU where U is Ile, Leu, or Val), Ap4A hydrolase is structurally
Probab=99.35 E-value=4.9e-12 Score=107.55 Aligned_cols=114 Identities=12% Similarity=0.130 Sum_probs=77.4
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccE----EEEccc---
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNV----KHVFSH--- 308 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v----~h~~sh--- 308 (363)
.+.++|.+.+++|||.||...+ |.|+||||.+++ +..+++ ||++||+|+++.... .+... ++.++.
T Consensus 5 ~v~~ii~~~~~~vLL~~r~~~~---~~W~~PgG~~e~gE~~~~aA~REv~EEtGl~~~~~~-~l~~~~~~~~y~~~~~~~ 80 (147)
T cd03671 5 NVGVVLFNEDGKVFVGRRIDTP---GAWQFPQGGIDEGEDPEQAALRELEEETGLDPDSVE-IIAEIPDWLRYDLPPELK 80 (147)
T ss_pred eEEEEEEeCCCEEEEEEEcCCC---CCEECCcCCCCCCcCHHHHHHHHHHHHHCCCcCceE-EEEEcCCeeEeeChhhhh
Confidence 5566677777899999997665 999999999998 788888 999999999875544 33322 222220
Q ss_pred --------eEEEEEEEEEEecC--C--CC-----CCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 309 --------LKWNMDVYSGTAKE--K--TI-----PANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 309 --------~~~~l~~~~~~~~~--~--~~-----~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
.....++|.+.... + .. .+..+++|++++|+.++..+ -.+.|+.++.+
T Consensus 81 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~E~~~~~W~~~~el~~~~~~-~~~~~~~~~~~ 145 (147)
T cd03671 81 LKIWGGRYRGQEQKWFLFRFTGDDSEIDLNAPEHPEFDEWRWVPLEELPDLIVP-FKRPVYEAVLK 145 (147)
T ss_pred ccccCCcCCCEEEEEEEEEecCCCccccCCCCCCCCEeeEEeCCHHHHHHhchh-hhHHHHHHHHH
Confidence 01234455554432 1 11 25677899999999987655 55556666543
No 70
>cd03424 ADPRase_NUDT5 ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose and a variety of additional ADP-sugar conjugates to AMP and ribose-5-phosphate. Like other members of the Nudix hydrolase superfamily, it requires a divalent cation, such as Mg2+, for its activity. It also contains a highly conserved 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic enzymes (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). Human ADPRase-II is also referred to as NUDT5. It lacks the N-terminal target sequence unique to mitochondrial ADPRase. The different cytosolic types are distinguished by their specificities for substrate and specific requirem
Probab=99.34 E-value=2.2e-12 Score=108.07 Aligned_cols=102 Identities=10% Similarity=0.006 Sum_probs=74.1
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~ 315 (363)
.+.+++.+.++++|+.+|...+..+++|+||||+++. ++++|+ ||+.||+|+++.... .+.++.+........+++
T Consensus 4 ~v~v~~~~~~~~iLl~~~~~~~~~~~~w~~PgG~ve~gEs~~~aa~RE~~EE~Gl~~~~~~-~~~~~~~~~~~~~~~~~~ 82 (137)
T cd03424 4 AVAVLPYDDDGKVVLVRQYRPPVGGWLLELPAGLIDPGEDPEEAARRELEEETGYEAGDLE-KLGSFYPSPGFSDERIHL 82 (137)
T ss_pred EEEEEEEcCCCeEEEEEeeecCCCCEEEEeCCccCCCCCCHHHHHHHHHHHHHCCCccceE-EEeeEecCCcccCccEEE
Confidence 4566667777899998876566678899999999999 788889 999999999986544 555544322222345677
Q ss_pred EEEEecCCC------CCCCCCCeeecccccCc
Q psy9547 316 YSGTAKEKT------IPANKTYKLITETQMKK 341 (363)
Q Consensus 316 ~~~~~~~~~------~~~~~~~~W~~~~el~~ 341 (363)
|.+...... ..+..+++|++++|+..
T Consensus 83 ~~~~~~~~~~~~~~~~~E~~~~~w~~~~el~~ 114 (137)
T cd03424 83 FLAEDLSPGEEGLLDEGEDIEVVLVPLDEALE 114 (137)
T ss_pred EEEEcccccccCCCCCCCeeEEEEecHHHHHH
Confidence 777765431 23557789999999875
No 71
>cd03428 Ap4A_hydrolase_human_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Ap4A hydrolases are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one subfamily and fungi/animals/archaea enzymes, represented by this subfamily, fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val) that functions as a metal binding and
Probab=99.33 E-value=6.6e-12 Score=104.06 Aligned_cols=110 Identities=7% Similarity=0.017 Sum_probs=79.0
Q ss_pred EEEEEEeCCc---eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccE----EEEccce
Q psy9547 240 LTVVVKTDTN---KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNV----KHVFSHL 309 (363)
Q Consensus 240 ~~~ii~~~~~---~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v----~h~~sh~ 309 (363)
+.+||.+.++ +||+.||+. |.|+||||.++. ++.+++ ||++||+|+++.... .+... .+.+.+.
T Consensus 5 ~g~vi~~~~~~~~~vLl~~~~~-----~~w~~PgG~ve~gEs~~~aa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~~ 78 (130)
T cd03428 5 AGAIIYRRLNNEIEYLLLQASY-----GHWDFPKGHVEPGEDDLEAALRETEEETGITAEQLF-IVLGFKETLNYQVRGK 78 (130)
T ss_pred EEEEEEEecCCCceEEEEEccC-----CcCcCCcCCCCCCCCHHHHHHHHHHHHHCCChhhhh-hhccceeEEEccccCc
Confidence 3444444333 699999964 899999999998 888888 999999999987665 43222 2222233
Q ss_pred EEEEEEEEEEecCC-C--C-CCCCCCeeecccccCcCCCChHHHHHHHHH
Q psy9547 310 KWNMDVYSGTAKEK-T--I-PANKTYKLITETQMKKYAFPVPYQKVWKLF 355 (363)
Q Consensus 310 ~~~l~~~~~~~~~~-~--~-~~~~~~~W~~~~el~~~~~~~a~~~il~~l 355 (363)
...+++|.+....+ . . .+..++.|++++++.++..++.++.+++..
T Consensus 79 ~~~~~~f~~~~~~~~~~~~~~E~~~~~W~~~~e~~~~~~~~~~~~~~~~~ 128 (130)
T cd03428 79 LKTVTYFLAELRPDVEVKLSEEHQDYRWLPYEEALKLLTYEDLKAVLDKA 128 (130)
T ss_pred ceEEEEEEEEeCCCCccccccceeeEEeecHHHHHHHcCchhHHHHHHHh
Confidence 45567788877632 1 1 356778999999999988888888887654
No 72
>cd04689 Nudix_Hydrolase_30 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U=I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate sp
Probab=99.33 E-value=6.2e-12 Score=103.66 Aligned_cols=105 Identities=12% Similarity=0.199 Sum_probs=73.8
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccc----eEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSH----LKWN 312 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh----~~~~ 312 (363)
+.++|.++ |++|+.+|.. .+.|+||||.++. ++.+|+ ||++||+|+.+.... .++.+.+.|.. .+..
T Consensus 4 ~~~vi~~~-~~vLlv~~~~----~~~~~lPGG~ve~gEt~~~aa~REl~EEtGl~~~~~~-~l~~~~~~~~~~~~~~~~~ 77 (125)
T cd04689 4 ARAIVRAG-NKVLLARVIG----QPHYFLPGGHVEPGETAENALRRELQEELGVAVSDGR-FLGAIENQWHEKGVRTHEI 77 (125)
T ss_pred EEEEEEeC-CEEEEEEecC----CCCEECCCCcCCCCCCHHHHHHHHHHHHhCceeeccE-EEEEEeeeeccCCceEEEE
Confidence 44555555 8999999853 3689999999999 888999 999999999987766 66666654421 2334
Q ss_pred EEEEEEEecCCC----C---CCCCCCeeecccccCcCCCChHHHHHH
Q psy9547 313 MDVYSGTAKEKT----I---PANKTYKLITETQMKKYAFPVPYQKVW 352 (363)
Q Consensus 313 l~~~~~~~~~~~----~---~~~~~~~W~~~~el~~~~~~~a~~~il 352 (363)
.++|.+....+. . .+..+++|++++++..+ |....+++
T Consensus 78 ~~~f~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~--p~~l~~~~ 122 (125)
T cd04689 78 NHIFAVESSWLASDGPPQADEDHLSFSWVPVSDLSLY--PALLRDLV 122 (125)
T ss_pred EEEEEEEcccccccCCccCccceEEEEEccHHHcccC--cHHHHHHh
Confidence 567777765321 1 13457889999998755 44454444
No 73
>cd04694 Nudix_Hydrolase_35 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.33 E-value=3.2e-12 Score=108.16 Aligned_cols=101 Identities=11% Similarity=0.151 Sum_probs=69.6
Q ss_pred EEEEEEeCCceEEEeecCCCc-cccccccccccCCCC--CchHHH-HHHHhhCCCccccc----ccccccEEEEccc---
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTG-LLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLN----EKCLGNVKHVFSH--- 308 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~-~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~----~~~l~~v~h~~sh--- 308 (363)
+.+++++.++++|+.||...+ .++|+|+||||.+++ +..+++ ||++||+|+.+... + .++...|.|++
T Consensus 4 v~viv~~~~~~vLl~rr~~~~~~~~g~w~~PgG~v~~~E~~~~aa~RE~~EE~gi~~~~~~~~~~-~l~~~~~~~~~~~~ 82 (143)
T cd04694 4 VAVLLQSSDQKLLLTRRASSLRIFPNVWVPPGGHVELGENLLEAGLRELNEETGLTLDPIDKSWQ-VLGLWESVYPPLLS 82 (143)
T ss_pred EEEEEEcCCCEEEEEEECCCCCCCCCeEECcccccCCCCCHHHHHHHHHHHHHCCCcccccccee-EEeeeccccccccC
Confidence 334446677899999997654 799999999999998 778888 99999999987653 3 45555555542
Q ss_pred ------eEEEEEEEEEEecCC---------CCCCCCCCeeecccccCc
Q psy9547 309 ------LKWNMDVYSGTAKEK---------TIPANKTYKLITETQMKK 341 (363)
Q Consensus 309 ------~~~~l~~~~~~~~~~---------~~~~~~~~~W~~~~el~~ 341 (363)
.++.+.++.+....+ .+.+.++.+|++++++..
T Consensus 83 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~Ev~~~~Wv~~~~a~~ 130 (143)
T cd04694 83 RGLPKRHHIVVYILVKSSETHQQLQARLQPDPNEVSAAAWLDKSLAKA 130 (143)
T ss_pred CCcccceeEEEEEEEEeccccccccccccCChhhccceEeeCHHHHHH
Confidence 233334333332110 124678889999988753
No 74
>cd03672 Dcp2p mRNA decapping enzyme 2 (Dcp2p), the catalytic subunit, and Dcp1p are the two components of the decapping enzyme complex. Decapping is a key step in both general and nonsense-mediated 5'-3' mRNA-decay pathways. Dcp2p contains an all-alpha helical N-terminal domain and a C-terminal domain which has the Nudix fold. While decapping is not dependent on the N-terminus of Dcp2p, it does affect its efficiency. Dcp1p binds the N-terminal domain of Dcp2p stimulating the decapping activity of Dcp2p. Decapping permits the degradation of the transcript and is a site of numerous control inputs. It is responsible for nonsense-mediated decay as well as AU-rich element (ARE)-mediated decay. In addition, it may also play a role in the levels of mRNA. Enzymes belonging to the Nudix superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and are recognized by a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V).
Probab=99.32 E-value=9.5e-12 Score=105.58 Aligned_cols=116 Identities=9% Similarity=0.048 Sum_probs=73.6
Q ss_pred EEEEEEEeCC-ceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 239 LLTVVVKTDT-NKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 239 v~~~ii~~~~-~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
+++++|.+++ ++||+.||... +.|+||||+++. ++.+|+ ||++||+|+++...........+.+++.. ++
T Consensus 3 ~~gaii~~~~~~~vLLvr~~~~----~~W~lPGG~ve~gEs~~~AA~REl~EETGl~v~~~~~~~~~~~~~~~~~~--~~ 76 (145)
T cd03672 3 VYGAIILNEDLDKVLLVKGWKS----KSWSFPKGKINKDEDDHDCAIREVYEETGFDISKYIDKDDYIELIIRGQN--VK 76 (145)
T ss_pred eeEEEEEeCCCCEEEEEEecCC----CCEECCCccCCCCcCHHHHHHHHHHHhhCccceeccccceeeecccCCcE--EE
Confidence 5667777753 69999998533 489999999998 888999 99999999987542201111122233333 44
Q ss_pred EEEEEecCC----CC---CCCCCCeeecccccCcCCC--------------ChHHHHHHHHHHhhhc
Q psy9547 315 VYSGTAKEK----TI---PANKTYKLITETQMKKYAF--------------PVPYQKVWKLFTKSKG 360 (363)
Q Consensus 315 ~~~~~~~~~----~~---~~~~~~~W~~~~el~~~~~--------------~~a~~~il~~l~~~~~ 360 (363)
+|.+....+ .+ .+..+++|++++++.+..- .+-..+|.+.+...+|
T Consensus 77 ~f~~~~~~~~~~~~~~~~~E~~~~~Wv~~~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (145)
T cd03672 77 LYIVPGVPEDTPFEPKTRKEISKIEWFDIKDLPTKKNKKIPGLNSNKFFMVIPFIKPLKKWINRQKG 143 (145)
T ss_pred EEEEecCCCCcccCcCChhhhheEEEeeHHHhhhhhhhccccccccceEEEhHHHHHHHHHHHHhhc
Confidence 555543221 11 2567889999999986322 3344555555555555
No 75
>PF00293 NUDIX: NUDIX domain; InterPro: IPR000086 The generic name 'NUDIX hydrolases' (NUcleoside DIphosphate linked to some other moiety X) has been coined for this domain family []. The family can be divided into a number of subgroups, of which MutT anti- mutagenic activity represents only one type; most of the rest hydrolyse diverse nucleoside diphosphate derivatives (including ADP-ribose, GDP- mannose, TDP-glucose, NADH, UDP-sugars, dNTP and NTP).; GO: 0016787 hydrolase activity; PDB: 3FJY_A 3MGM_A 2XSQ_A 3COU_A 2O5F_A 1Q27_A 3F6A_A 3E57_B 3SON_B 2GT4_C ....
Probab=99.30 E-value=2e-12 Score=107.08 Aligned_cols=117 Identities=11% Similarity=0.067 Sum_probs=87.4
Q ss_pred EEEEEEEeCCceEEEeecCCCc-cccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccce---EE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTG-LLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHL---KW 311 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~-~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~---~~ 311 (363)
.+.++|.+.+++||+.||...+ ..+|.|+||||+++. ++.+|+ ||+.||+|+++.... ......+..... ..
T Consensus 4 ~v~~ii~~~~~~vLl~~r~~~~~~~~~~~~~pgG~i~~~E~~~~aa~REl~EE~g~~~~~~~-~~~~~~~~~~~~~~~~~ 82 (134)
T PF00293_consen 4 AVGVIIFNEDGKVLLIKRSRSPITFPGYWELPGGGIEPGESPEEAARRELKEETGLDVSPLE-LLGLFSYPSPSGDPEGE 82 (134)
T ss_dssp EEEEEEEETTTEEEEEEESTTSSSSTTEEESSEEEECTTSHHHHHHHHHHHHHHSEEEEEEE-EEEEEEEEETTTESSEE
T ss_pred EEEEEEEeCCcEEEEEEecCCCCCCCCeEecceeeEEcCCchhhhHHhhhhhcccceecccc-cceeeeecccCCCcccE
Confidence 4455667776799999997665 489999999999999 788888 999999999985544 444455544433 25
Q ss_pred EEEEEEEEecCCC----C--CCCCCCeeecccccCcCCCChHHHHHHHHHH
Q psy9547 312 NMDVYSGTAKEKT----I--PANKTYKLITETQMKKYAFPVPYQKVWKLFT 356 (363)
Q Consensus 312 ~l~~~~~~~~~~~----~--~~~~~~~W~~~~el~~~~~~~a~~~il~~l~ 356 (363)
..++|.+....+. . .+..++.|++++++.++.....+..++..+.
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~el~~~~~~~~~~~~i~~~~ 133 (134)
T PF00293_consen 83 IVIFFIAELPSEQSEIQPQDEEISEVKWVPPDELLELLLNGRIRKIIPWLY 133 (134)
T ss_dssp EEEEEEEEEEEEESECHTTTTTEEEEEEEEHHHHHHHHHTTHHHHHHHHHH
T ss_pred EEEEEEEEEeCCccccCCCCccEEEEEEEEHHHhhhchhCcchhhhhcccc
Confidence 5666666654421 1 2567789999999999999999888887664
No 76
>cd02885 IPP_Isomerase Isopentenyl diphosphate (IPP) isomerase, a member of the Nudix hydrolase superfamily, is a key enzyme in the isoprenoid biosynthetic pathway. Isoprenoids comprise a large family of natural products including sterols, carotenoids, dolichols and prenylated proteins. These compounds are synthesized from two precursors: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). IPP isomerase catalyzes the interconversion of IPP and DMAPP by a stereoselective antarafacial transposition of hydrogen. The enzyme requires one Mn2+ or Mg2+ ion in its active site to fold into an active conformation and also contains the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. The metal binding site is present within the active site and plays structural and catalytical roles. IPP isomerase is well represented in several bacteria, archaebacteria and eukaryotes, including fungi, mamm
Probab=99.30 E-value=7.7e-12 Score=108.47 Aligned_cols=104 Identities=13% Similarity=0.113 Sum_probs=74.1
Q ss_pred eEEEEEEEeCCceEEEeecCCCc-ccccccccc-ccCCCC--CchHHH-HHHHhhCCCccccccccc-ccEEEEccc---
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTG-LLSNFYMFL-SFESDT--PYTSQV-EFLSENLPFKVNLNEKCL-GNVKHVFSH--- 308 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~-~~~glWEFP-ggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l-~~v~h~~sh--- 308 (363)
.+++++|.+++|++|+.||+.++ .+.|.|++| ||.++. ++++|+ ||++||+|+++.... .+ ..+.+..++
T Consensus 31 ~~v~v~i~~~~~~iLl~kR~~~~~~~Pg~w~~~~gG~ie~GEt~~eaa~REl~EEtGl~~~~~~-~~~~~~~~~~~~~~~ 109 (165)
T cd02885 31 RAFSVFLFNSKGRLLLQRRALSKYTFPGLWTNTCCSHPLPGEGVKDAAQRRLREELGITGDLLE-LVLPRFRYRAPDDGG 109 (165)
T ss_pred eEEEEEEEcCCCcEEEEeccCCCccCCCcccccccCCCCCCCCHHHHHHHHHHHHhCCCccchh-hccceEEEEEEcCCC
Confidence 34455667777899999998765 789999997 899988 888999 999999999987655 43 444443221
Q ss_pred --eEEEEEEEEEEecCC---CCCCCCCCeeecccccCcC
Q psy9547 309 --LKWNMDVYSGTAKEK---TIPANKTYKLITETQMKKY 342 (363)
Q Consensus 309 --~~~~l~~~~~~~~~~---~~~~~~~~~W~~~~el~~~ 342 (363)
.+...++|.+....+ ...+..+++|++++++.++
T Consensus 110 ~~~~~i~~~f~~~~~~~~~~~~~Ev~~~~w~~~~el~~~ 148 (165)
T cd02885 110 LVEHEIDHVFFARADVTLIPNPDEVSEYRWVSLEDLKEL 148 (165)
T ss_pred ceeeEEEEEEEEEeCCCCCCCccceeEEEEECHHHHHHH
Confidence 122345677765433 1235678899999999763
No 77
>cd04677 Nudix_Hydrolase_18 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.30 E-value=8.4e-12 Score=103.65 Aligned_cols=102 Identities=15% Similarity=0.088 Sum_probs=68.1
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEE--EEcc---c--
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVK--HVFS---H-- 308 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~--h~~s---h-- 308 (363)
.+++++.++++++|+.+|... |.|+||||.++. ++.+|+ ||++||+|+++.... .+.... +.|. +
T Consensus 9 ~~~~~v~~~~~~vLL~~r~~~----~~w~~PgG~v~~gEt~~~aa~REl~EE~Gi~~~~~~-~~~~~~~~~~~~~~~~~~ 83 (132)
T cd04677 9 GAGVILLNEQGEVLLQKRSDT----GDWGLPGGAMELGESLEETARRELKEETGLEVEELE-LLGVYSGKEFYVKPNGDD 83 (132)
T ss_pred ceEEEEEeCCCCEEEEEecCC----CcEECCeeecCCCCCHHHHHHHHHHHHhCCeeeeeE-EEEEecCCceeecCCCCc
Confidence 344555666689999999643 889999999998 888899 999999999887654 443221 1111 1
Q ss_pred -eEEEEEEEEEEecCCC----CCCCCCCeeecccccCcCCCC
Q psy9547 309 -LKWNMDVYSGTAKEKT----IPANKTYKLITETQMKKYAFP 345 (363)
Q Consensus 309 -~~~~l~~~~~~~~~~~----~~~~~~~~W~~~~el~~~~~~ 345 (363)
..+.+.+|.+...++. ..+..+++|++++++.+...+
T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~W~~~~e~~~~~~~ 125 (132)
T cd04677 84 EQYIVTLYYVTKVFGGKLVPDGDETLELKFFSLDELPELINP 125 (132)
T ss_pred EEEEEEEEEEEeccCCcccCCCCceeeEEEEChhHCccchhH
Confidence 1233344444433322 124567899999999876554
No 78
>TIGR02150 IPP_isom_1 isopentenyl-diphosphate delta-isomerase, type 1. This model represents type 1 of two non-homologous families of the enzyme isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP is an essential building block for many compounds, including enzyme cofactors, sterols, and prenyl groups. This inzyme interconverts isopentenyl diphosphate and dimethylallyl diphosphate.
Probab=99.29 E-value=8.4e-12 Score=107.47 Aligned_cols=104 Identities=12% Similarity=0.055 Sum_probs=73.4
Q ss_pred eEEEEEEEeCCceEEEeecCCCc-ccccccccc-ccCCCCCchHHH-HHHHhhCCCcccccc-cccccEEEEcc---ceE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTG-LLSNFYMFL-SFESDTPYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVFS---HLK 310 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~-~~~glWEFP-ggk~e~~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~s---h~~ 310 (363)
..+.++|.+++|++|++||+.++ .+.|+|++| ||+++..-.+|+ ||++||+|+++.... ..+..+.|.++ ..+
T Consensus 28 ~~v~v~v~~~~g~vLl~kR~~~k~~~PG~W~~~~gG~v~~GE~eaa~REl~EE~Gl~~~~~~l~~~~~~~~~~~~~~g~~ 107 (158)
T TIGR02150 28 RAFSVFLFNEEGQLLLQRRALSKITWPGVWTNSCCSHPLPGELEAAIRRLREELGIPADDVPLTVLPRFSYRARDAWGEH 107 (158)
T ss_pred EEEEEEEEcCCCeEEEEeccCCCcCCCCCccccccCCCCcccHHHHHHHHHHHHCCCccccceEEcceEEEEEecCCCcE
Confidence 34455666777899999997764 789999998 788888324777 999999999876542 02444444432 234
Q ss_pred EEEEEEEEEecCC-CC--CCCCCCeeecccccCc
Q psy9547 311 WNMDVYSGTAKEK-TI--PANKTYKLITETQMKK 341 (363)
Q Consensus 311 ~~l~~~~~~~~~~-~~--~~~~~~~W~~~~el~~ 341 (363)
..+++|.++.... .+ .+.+++.|++++|+.+
T Consensus 108 ~~~~~f~~~~~~~~~~~~~Ev~~~~W~~~~el~~ 141 (158)
T TIGR02150 108 ELCPVFFARAPVPLNPNPEEVAEYRWVSLEELKE 141 (158)
T ss_pred EEEEEEEEecCCcccCChhHeeeEEEeCHHHHHH
Confidence 5668888876542 22 2678899999999865
No 79
>COG1051 ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]
Probab=99.28 E-value=2.6e-11 Score=102.84 Aligned_cols=118 Identities=8% Similarity=0.087 Sum_probs=78.3
Q ss_pred eeEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEc---cceE
Q psy9547 237 VFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVF---SHLK 310 (363)
Q Consensus 237 ~~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~---sh~~ 310 (363)
..++++++...+|++|+.|| .+.+.+|.|.+|||.+|. |.++|+ ||++||+|+++...+ .+..+.-.. ...+
T Consensus 9 p~~~v~~~i~~~~~iLLvrR-~~~p~~g~WalPGG~ve~GEt~eeaa~REl~EETgL~~~~~~-~~~v~~~~~rd~r~~~ 86 (145)
T COG1051 9 PLVAVGALIVRNGRILLVRR-ANEPGAGYWALPGGFVEIGETLEEAARRELKEETGLRVRVLE-LLAVFDDPGRDPRGHH 86 (145)
T ss_pred cceeeeEEEEeCCEEEEEEe-cCCCCCCcEeCCCccCCCCCCHHHHHHHHHHHHhCCccccee-EEEEecCCCCCCceeE
Confidence 34444444433479999999 667789999999999999 889999 999999999976655 333222221 1245
Q ss_pred EEEEEEEEEecCC-CC-C--CCCCCeeecccccCcC--CCChHHHHHHHHHH
Q psy9547 311 WNMDVYSGTAKEK-TI-P--ANKTYKLITETQMKKY--AFPVPYQKVWKLFT 356 (363)
Q Consensus 311 ~~l~~~~~~~~~~-~~-~--~~~~~~W~~~~el~~~--~~~~a~~~il~~l~ 356 (363)
+.+.+|.+...++ .. . +.....|++.+++... +.+.....+.....
T Consensus 87 v~~~~~~~~~~g~~~~~~~~d~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~ 138 (145)
T COG1051 87 VSFLFFAAEPEGELLAGDGDDAAEVGWFPLDELPELPLPLAFTLADLRRLFL 138 (145)
T ss_pred EEEEEEEEecCCCcccCChhhHhhcceecHhHcccccccccccHHHHHHHHh
Confidence 6666666554332 12 1 3466789999999984 44444444544443
No 80
>cd04692 Nudix_Hydrolase_33 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.23 E-value=2.8e-11 Score=102.45 Aligned_cols=103 Identities=17% Similarity=0.130 Sum_probs=72.5
Q ss_pred EEEEEEEeCC---ceEEEeecCCC-ccccccccc-cccCCCC--CchHHH-HHHHhhCCCcccccc-cccccEEEEccc-
Q psy9547 239 LLTVVVKTDT---NKYLIQKRPTT-GLLSNFYMF-LSFESDT--PYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVFSH- 308 (363)
Q Consensus 239 v~~~ii~~~~---~~vLl~~R~~~-~~~~glWEF-Pggk~e~--~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~sh- 308 (363)
++.++|.+++ +++|+++|... ..+.|.|+| |||+++. ++++++ ||++||+|+.+.... ..++.+.+.+.+
T Consensus 4 ~v~~~v~~~~~~~~~vLl~~R~~~~~~~pg~W~~~~gG~ve~gEt~~~aa~REl~EEtGl~~~~~~l~~~~~~~~~~~~~ 83 (144)
T cd04692 4 TFHCWIITKDEGKGYVLLQKRSANKKTYPGLWDISSAGHILAGETPLEDGIRELEEELGLDVSADDLIPLGTFKIEYDHI 83 (144)
T ss_pred EEEEEEEEccCCCCEEEEEecCCCCCCCCCccccccCcccCCCCCHHHHHHHHHHHHhCCCCChHHeEEeeEEEEecccc
Confidence 4455566665 89999999775 478999999 5999998 888999 999999999865322 146667776651
Q ss_pred ----eEEEEEEEEEEecC--CC--C--CCCCCCeeecccccCc
Q psy9547 309 ----LKWNMDVYSGTAKE--KT--I--PANKTYKLITETQMKK 341 (363)
Q Consensus 309 ----~~~~l~~~~~~~~~--~~--~--~~~~~~~W~~~~el~~ 341 (363)
.....++|.+.... +. + .+..+++|++++++.+
T Consensus 84 ~~~~~~~~~~~f~~~~~~~~~~~~~~~~E~~~~~W~~~~el~~ 126 (144)
T cd04692 84 GKLIDREFHHVYLYELKVPLEEFTLQKEEVAGVVLIPLDEFAE 126 (144)
T ss_pred CCCccceEEEEEEEeccCChhhcCCChhHhheEEEECHHHHHH
Confidence 11234566666543 21 1 3567889999998853
No 81
>cd04666 Nudix_Hydrolase_9 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=99.23 E-value=3.6e-11 Score=99.03 Aligned_cols=88 Identities=9% Similarity=0.035 Sum_probs=67.7
Q ss_pred ceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccc-ccccccEEEEccc----eEEEEEEEEEEe
Q psy9547 249 NKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLN-EKCLGNVKHVFSH----LKWNMDVYSGTA 320 (363)
Q Consensus 249 ~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~-~~~l~~v~h~~sh----~~~~l~~~~~~~ 320 (363)
+++|+.+|... |.|.||||+++. ++++|+ ||+.||+|+++... . .++++.|.+++ .+...++|.+..
T Consensus 15 ~~vLLv~~~~~----~~w~~PgG~ve~~E~~~~aa~RE~~EEtG~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~f~~~~ 89 (122)
T cd04666 15 VEVLLVTSRRT----GRWIVPKGGPEKDESPAEAAAREAWEEAGVRGKIGKR-PLGRFEYRKRSKNRPPRCEVAVFPLEV 89 (122)
T ss_pred eEEEEEEecCC----CeEECCCCCcCCCCCHHHHHHHHHHHHhCCcccccce-EEEEEEeeecCCCCCceEEEEEEEEEE
Confidence 58999988543 899999999998 788888 99999999988766 6 78888888775 366677787776
Q ss_pred cCCC---C-CCCCCCeeecccccCc
Q psy9547 321 KEKT---I-PANKTYKLITETQMKK 341 (363)
Q Consensus 321 ~~~~---~-~~~~~~~W~~~~el~~ 341 (363)
.... . .+..+.+|++++++..
T Consensus 90 ~~~~~~~~~~e~~~~~W~~~~ea~~ 114 (122)
T cd04666 90 TEELDEWPEMHQRKRKWFSPEEAAL 114 (122)
T ss_pred eccccCCcccCceEEEEecHHHHHH
Confidence 5421 1 1335689999998843
No 82
>PRK05379 bifunctional nicotinamide mononucleotide adenylyltransferase/ADP-ribose pyrophosphatase; Provisional
Probab=99.22 E-value=5.7e-11 Score=114.61 Aligned_cols=112 Identities=10% Similarity=0.015 Sum_probs=80.0
Q ss_pred EEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccccccccc--EEEEc--cce----E
Q psy9547 242 VVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGN--VKHVF--SHL----K 310 (363)
Q Consensus 242 ~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~--v~h~~--sh~----~ 310 (363)
++|.++ |+|||.+|.. +...|+|.||||++++ +.++|+ ||++||+|+++.... ..+. ..+.| ++. +
T Consensus 208 avv~~~-g~VLLvrR~~-~p~~g~W~lPGG~ve~gEt~~~Aa~REl~EETGl~v~~~~-l~~~~~~~~~f~~p~r~~~~~ 284 (340)
T PRK05379 208 AVVVQS-GHVLLVRRRA-EPGKGLWALPGGFLEQDETLLDACLRELREETGLKLPEPV-LRGSIRDQQVFDHPGRSLRGR 284 (340)
T ss_pred EEEEEC-CEEEEEEecC-CCCCCeEECCcccCCCCCCHHHHHHHHHHHHHCCcccccc-cceeeeeeEEEcCCCCCCCCc
Confidence 444454 8999999954 4568999999999999 888999 999999999875543 2222 12333 322 3
Q ss_pred EEEEEEEEEecCCC------CCCCCCCeeecccccCcC--CCChHHHHHHHHHH
Q psy9547 311 WNMDVYSGTAKEKT------IPANKTYKLITETQMKKY--AFPVPYQKVWKLFT 356 (363)
Q Consensus 311 ~~l~~~~~~~~~~~------~~~~~~~~W~~~~el~~~--~~~~a~~~il~~l~ 356 (363)
...++|.+....+. ..+..+++|++++++... .+...+..||+.+.
T Consensus 285 ~i~~~f~~~~~~~~~~~~~~~de~~~~~W~~~~el~~~~~~~~~dh~~ii~~~~ 338 (340)
T PRK05379 285 TITHAFLFEFPAGELPRVKGGDDADKARWVPLAELLAMRDRMFEDHFQIITHFL 338 (340)
T ss_pred EEEEEEEEEecCCccCccCCCCceeeEEEEEHHHhhhhhhhhhhHHHHHHHHHh
Confidence 44567777665331 135678899999999875 57788888887765
No 83
>cd04686 Nudix_Hydrolase_27 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.22 E-value=6.1e-11 Score=98.82 Aligned_cols=96 Identities=10% Similarity=0.013 Sum_probs=67.3
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcc-cccccccccEEEEc----c---c
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKV-NLNEKCLGNVKHVF----S---H 308 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v-~~~~~~l~~v~h~~----s---h 308 (363)
+.+||.++ |+||+.+|.. .|.|+||||+++. ++.+|+ ||++||+|+.+ .... .++.+.+.+ + .
T Consensus 3 ~~~ii~~~-~~vLLv~~~~----~~~w~lPgG~ve~gEt~~~aa~REl~EEtGl~~~~~~~-~l~~~~~~~~~~~~~~~~ 76 (131)
T cd04686 3 VRAIILQG-DKILLLYTKR----YGDYKFPGGGVEKGEDHIEGLIRELQEETGATNIRVIE-KFGTYTERRPWRKPDADI 76 (131)
T ss_pred EEEEEEEC-CEEEEEEEcC----CCcEECccccCCCCCCHHHHHHHHHHHHHCCcccccce-EEEEEEeeccccCCCCce
Confidence 56677776 8999999854 2579999999998 888999 99999999986 3444 555554322 2 2
Q ss_pred eEEEEEEEEEEecCC--CC-CCC------CCCeeecccccCc
Q psy9547 309 LKWNMDVYSGTAKEK--TI-PAN------KTYKLITETQMKK 341 (363)
Q Consensus 309 ~~~~l~~~~~~~~~~--~~-~~~------~~~~W~~~~el~~ 341 (363)
++...++|.|....+ .. .++ -...|++++++..
T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~W~~~~ea~~ 118 (131)
T cd04686 77 FHMISYYYLCEVDAELGAQQLEDYEAELGMKPIWINIHEAIE 118 (131)
T ss_pred eEEEEEEEEEEEcCCcCCcccchhhHhcCCCcEEecHHHHHH
Confidence 344567888887543 11 111 2368999998854
No 84
>PRK00241 nudC NADH pyrophosphatase; Reviewed
Probab=99.21 E-value=6.7e-11 Score=109.46 Aligned_cols=99 Identities=15% Similarity=0.139 Sum_probs=75.1
Q ss_pred EEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEE
Q psy9547 242 VVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSG 318 (363)
Q Consensus 242 ~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~ 318 (363)
++|.++ +++|+.+|...+ .|+|++|||.+|. +.++|+ ||++||+|+++.... ++++..+.|++. .+..|.+
T Consensus 137 v~V~~~-~~iLL~rr~~~~--~g~wslPgG~vE~GEs~eeAa~REv~EEtGl~v~~~~-~~~s~~~~~p~~--lm~~f~a 210 (256)
T PRK00241 137 VAVRRG-DEILLARHPRHR--NGVYTVLAGFVEVGETLEQCVAREVMEESGIKVKNLR-YVGSQPWPFPHS--LMLGFHA 210 (256)
T ss_pred EEEEeC-CEEEEEEccCCC--CCcEeCcccCCCCCCCHHHHhhhhhhhccCceeeeeE-EEEeEeecCCCe--EEEEEEE
Confidence 344444 899999985443 7999999999998 889999 999999999987666 676666667764 4677888
Q ss_pred EecCCCC----CCCCCCeeecccccCcCCCCh
Q psy9547 319 TAKEKTI----PANKTYKLITETQMKKYAFPV 346 (363)
Q Consensus 319 ~~~~~~~----~~~~~~~W~~~~el~~~~~~~ 346 (363)
+..++.. .|..+.+|++++|+..++-..
T Consensus 211 ~~~~~~~~~~~~Ei~~a~W~~~del~~lp~~~ 242 (256)
T PRK00241 211 DYDSGEIVFDPKEIADAQWFRYDELPLLPPSG 242 (256)
T ss_pred EecCCcccCCcccEEEEEEECHHHCcccCCch
Confidence 7765432 244677999999998765443
No 85
>cd04685 Nudix_Hydrolase_26 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.19 E-value=6.1e-11 Score=99.12 Aligned_cols=102 Identities=9% Similarity=0.070 Sum_probs=70.9
Q ss_pred EEEEEEeCCceEEEeecCCC-ccccccccccccCCCC--CchHHH-HHHHhhCCCcc-cccccccccEEEEccc----eE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTT-GLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKV-NLNEKCLGNVKHVFSH----LK 310 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~-~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v-~~~~~~l~~v~h~~sh----~~ 310 (363)
+-++|.+.+|+||+.+|... +..++.|+||||.++. +..+|+ ||++||+|+++ .... .+....+.|++ .+
T Consensus 3 ~~~~i~~~~g~vLl~r~~~~~~~~~~~w~~PgG~ve~gE~~~~a~~Re~~EE~G~~~~~~~~-~~~~~~~~f~~~~~~~~ 81 (133)
T cd04685 3 ARVVLLDPDDRVLLLRGDDPDSPGPDWWFTPGGGVEPGESPEQAARRELREETGITVADLGP-PVWRRDAAFTFLGVDGR 81 (133)
T ss_pred EEEEEEcCCCeEEEEEEeCCCCCCCCEEECCcCCCCCCCCHHHHHHHHHHHHHCCccccccc-eEEEEEEEEEecCccce
Confidence 34556666699999988654 2578999999999988 788999 99999999988 4444 44444444422 12
Q ss_pred EEEEEEEEEecCCCC-------C---CCCCCeeecccccCcC
Q psy9547 311 WNMDVYSGTAKEKTI-------P---ANKTYKLITETQMKKY 342 (363)
Q Consensus 311 ~~l~~~~~~~~~~~~-------~---~~~~~~W~~~~el~~~ 342 (363)
-..++|.+....+.. . ....++|+++++|.++
T Consensus 82 ~~~~~f~~~~~~~~~~~~~~~~~E~~~~~~~~W~~~~el~~~ 123 (133)
T cd04685 82 QEERFFLARTPRTEPSPAGWTALERRSILGWRWWTRAELAAT 123 (133)
T ss_pred eeEEEEEEEcCCccccCCCCChhhhhhcccccCCCHHHHhhC
Confidence 234577777653211 1 1246799999999886
No 86
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=99.19 E-value=1.5e-10 Score=98.86 Aligned_cols=137 Identities=14% Similarity=0.178 Sum_probs=104.4
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCc--hH-HHHHHHHHHHHhhHHh----
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGY--YS-RVRNFQAGCRQVIEQF---- 106 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~--~~-rA~~l~~~a~~i~~~~---- 106 (363)
.+-|..|.-+|+..|++...+.++-..+-+.| ..++.+||.+.++.+|| |+ ||++|.++=+.+-+--
T Consensus 37 e~lf~ELsFCILTANsSA~~~~~~q~~lG~gf------ly~~~eEL~e~Lk~~g~Rf~n~raeyIVeaR~~~~~lk~~v~ 110 (210)
T COG1059 37 EDLFKELSFCILTANSSATMGLRAQNELGDGF------LYLSEEELREKLKEVGYRFYNVRAEYIVEAREKFDDLKIIVK 110 (210)
T ss_pred HHHHHHHHHHhccccchHHHHHHHHHHhcccc------ccCCHHHHHHHHHHhcchhcccchHHHHHHHHHHHHHHHHHh
Confidence 57799999999999999999999877776554 44688999999999985 56 9999988544442100
Q ss_pred CCCCcc-cHHHHh-cCCCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcC--CcccHHHHHHHHHh
Q psy9547 107 GGEVPR-DKKQLL-SIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDIS--KGKTKVVYETLVSK 176 (363)
Q Consensus 107 ~g~~p~-~~~~L~-~lpGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~--~~~~~~~~~~~~~~ 176 (363)
....+. .++.|. .++|||.|-|+.+|....-....++|.|+.|.+.|++.++..+. +++.+-.++..++.
T Consensus 111 ~~~~~~vaRE~Lv~nikGiGyKEASHFLRNVG~~D~AIlDrHIlr~l~r~g~i~e~~kt~t~K~YLe~E~ilr~ 184 (210)
T COG1059 111 ADENEKVARELLVENIKGIGYKEASHFLRNVGFEDLAILDRHILRWLVRYGLIDENPKTLTRKLYLEIEEILRS 184 (210)
T ss_pred cCcchHHHHHHHHHHcccccHHHHHHHHHhcChhHHHHHHHHHHHHHHHhcccccCcccccHHHHHHHHHHHHH
Confidence 111233 588888 79999999999999987667899999999999999999886542 23334445555444
No 87
>PRK00714 RNA pyrophosphohydrolase; Reviewed
Probab=99.18 E-value=7e-11 Score=101.52 Aligned_cols=114 Identities=10% Similarity=0.095 Sum_probs=75.0
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccE----EEEccce--
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNV----KHVFSHL-- 309 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v----~h~~sh~-- 309 (363)
.+.++|.+.+|++|+.||... .|.|+||||.+++ ++++|+ ||+.||+|+++.... .++.. .+.|+..
T Consensus 10 ~v~~~i~~~~g~vLL~~r~~~---~~~w~~P~G~~~~gE~~~~aa~REl~EEtG~~~~~~~-~~~~~~~~~~y~~~~~~~ 85 (156)
T PRK00714 10 NVGIILLNRQGQVFWGRRIGQ---GHSWQFPQGGIDPGETPEQAMYRELYEEVGLRPEDVE-ILAETRDWLRYDLPKRLV 85 (156)
T ss_pred eEEEEEEecCCEEEEEEEcCC---CCeEECCcccCCCCcCHHHHHHHHHHHHhCCCccceE-EEEEcCCeEEecCcHHHh
Confidence 455566776799999999642 4899999999998 788888 999999999875443 33332 2222210
Q ss_pred --------EEEEEEEEEEecCC----CC-----CCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 310 --------KWNMDVYSGTAKEK----TI-----PANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 310 --------~~~l~~~~~~~~~~----~~-----~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
.-..++|.+....+ .+ .+..+++|++++|+.+.-++. .+++++.+.+
T Consensus 86 ~~~~~~~~~~~~~~fl~~~~~~~~~~~l~~~~~~E~~~~~W~~~del~~~~~~~-~r~~~~~~~~ 149 (156)
T PRK00714 86 RRSKGVYRGQKQKWFLLRLTGDDSEINLNTTSHPEFDAWRWVSYWYPLDQVVPF-KRDVYRRVLK 149 (156)
T ss_pred hccCCcccCcEEEEEEEEecCCCccccCCCCCCCCeeeeEeCCHHHHHHhchhh-hHHHHHHHHH
Confidence 01345676665321 11 245678999999998854443 3566655544
No 88
>PRK03759 isopentenyl-diphosphate delta-isomerase; Provisional
Probab=99.12 E-value=2.6e-10 Score=100.72 Aligned_cols=114 Identities=10% Similarity=0.080 Sum_probs=75.2
Q ss_pred EEEEEEeCCceEEEeecCCCc-ccccccccc-ccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEc--cc---e
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTG-LLSNFYMFL-SFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVF--SH---L 309 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~-~~~glWEFP-ggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~--sh---~ 309 (363)
+.++|.+++|+||+.||+.++ .+.|.|.+| ||++++ ++++|+ ||+.||+|+++......+..+.+.+ +. .
T Consensus 37 v~v~i~~~~g~vLL~rR~~~~~~~PG~w~~~~gG~ve~GEt~~~aa~REl~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~ 116 (184)
T PRK03759 37 FSCYLFDADGRLLVTRRALSKKTWPGVWTNSCCGHPQPGESLEDAVIRRCREELGVEITDLELVLPDFRYRATDPNGIVE 116 (184)
T ss_pred EEEEEEcCCCeEEEEEccCCCCCCCCcccccccCCCCCCCCHHHHHHHHHHHHhCCCccccccccceEEEEEecCCCcee
Confidence 344566767899999997654 578999997 899888 888999 9999999998853321233333221 11 1
Q ss_pred EEEEEEEEEEecCC-C--CCCCCCCeeecccccCc------CCCChHHHHHHH
Q psy9547 310 KWNMDVYSGTAKEK-T--IPANKTYKLITETQMKK------YAFPVPYQKVWK 353 (363)
Q Consensus 310 ~~~l~~~~~~~~~~-~--~~~~~~~~W~~~~el~~------~~~~~a~~~il~ 353 (363)
+...++|.+..... . ..+..++.|++++++.+ ..+++..+.++.
T Consensus 117 ~~~~~vf~~~~~~~~~~~~~Ev~~~~W~~~~el~~~i~~~~~~~~~~l~~~~~ 169 (184)
T PRK03759 117 NEVCPVFAARVTSALQPNPDEVMDYQWVDPADLLRAVDATPWAFSPWMVLQAA 169 (184)
T ss_pred eEEEEEEEEEECCCCCCChhHeeeEEEECHHHHHHHHHhCCcccChHHHHHHH
Confidence 22356777776532 1 23557889999999953 345555555443
No 89
>cd04674 Nudix_Hydrolase_16 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=99.12 E-value=4.6e-10 Score=91.63 Aligned_cols=80 Identities=9% Similarity=-0.107 Sum_probs=53.8
Q ss_pred eEEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEE
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMD 314 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~ 314 (363)
.+++++|..++ .+|+.+|.. ....|.|+||||+++. +.++|+ ||+.||+|+++.... ...-..|..+...+.+.
T Consensus 5 ~~av~vl~~~~-~~lL~~r~~-~~~~~~w~lPgG~ve~~E~~~~aa~REl~EE~g~~~~~~~-l~~~~~~~~~~~~~~~~ 81 (118)
T cd04674 5 PVVVALLPVDD-GLLVIRRGI-EPGRGKLALPGGFIELGETWQDAVARELLEETGVAVDPAD-IRLFDVRSAPDGTLLVF 81 (118)
T ss_pred EEEEEEEEECC-CEEEEEeec-CCCCCeEECCceecCCCCCHHHHHHHHHHHHHCCcccccE-EEEEEEEecCCCeEEEE
Confidence 45666666664 566666643 3468999999999999 788888 999999999876432 22223355555555555
Q ss_pred EEEEEe
Q psy9547 315 VYSGTA 320 (363)
Q Consensus 315 ~~~~~~ 320 (363)
.|....
T Consensus 82 ~~~~~~ 87 (118)
T cd04674 82 GLLPER 87 (118)
T ss_pred EEEecc
Confidence 555543
No 90
>PRK10707 putative NUDIX hydrolase; Provisional
Probab=99.07 E-value=4.9e-10 Score=99.24 Aligned_cols=95 Identities=11% Similarity=0.035 Sum_probs=68.2
Q ss_pred eCCceEEEeecCCC-ccccccccccccCCCC---CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEe
Q psy9547 246 TDTNKYLIQKRPTT-GLLSNFYMFLSFESDT---PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTA 320 (363)
Q Consensus 246 ~~~~~vLl~~R~~~-~~~~glWEFPggk~e~---~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~ 320 (363)
++++.+|+.||... ....|.|+||||++|+ ++.+++ ||+.||+|+.+.... .++++...++...+.++.|.+.+
T Consensus 41 ~~~~~vLl~~R~~~~r~~~G~~~~PGG~~e~~de~~~~tA~REl~EEtGl~~~~~~-~lg~l~~~~~~~~~~~~~~v~~~ 119 (190)
T PRK10707 41 RPQPTLLLTQRSIHLRKHAGQVAFPGGAVDPTDASLIATALREAQEEVAIPPSAVE-VIGVLPPVDSSTGYQVTPVVGII 119 (190)
T ss_pred CCCCEEEEEEeCCcccCCCCcEEcCCcccCCCcccHHHHHHHHHHHHHCCCccceE-EEEEeeeeeccCCcEEEEEEEEE
Confidence 34469999998755 4678999999999996 467788 999999999887666 77777665554455566666554
Q ss_pred cCC---C--CCCCCCCeeecccccCc
Q psy9547 321 KEK---T--IPANKTYKLITETQMKK 341 (363)
Q Consensus 321 ~~~---~--~~~~~~~~W~~~~el~~ 341 (363)
..+ . ..|..+..|++++++.+
T Consensus 120 ~~~~~~~~d~~Ev~~v~~vpl~e~~~ 145 (190)
T PRK10707 120 PPDLPYRANEDEVAAVFEMPLAEALH 145 (190)
T ss_pred CCCCCCCCChhhhheEEEEeHHHHhC
Confidence 432 1 12345668999888755
No 91
>cd04663 Nudix_Hydrolase_6 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belong to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, U=I, L or V) which functions as metal binding and catalytic site. Substrates of nudix hydrolase include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate specificity are
Probab=99.03 E-value=1.3e-09 Score=89.70 Aligned_cols=98 Identities=8% Similarity=0.107 Sum_probs=64.7
Q ss_pred EEEEEEeCCc--eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccE--EEEccceEEE
Q psy9547 240 LTVVVKTDTN--KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNV--KHVFSHLKWN 312 (363)
Q Consensus 240 ~~~ii~~~~~--~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v--~h~~sh~~~~ 312 (363)
++++|.++++ ++|+.|.+. +.|+||||++++ ++++|+ ||++||+|+++.+........ .+.+++..+.
T Consensus 3 ~~~~~~~~~~~~~ll~~r~~~-----~~~~lPgG~ve~~E~~~~aa~Rel~EEtGl~~~~~~~~~~~~~~~~~~~~~~~~ 77 (126)
T cd04663 3 CPAVLRRNGEVLELLVFEHPL-----AGFQIVKGTVEPGETPEAAALRELQEESGLPSFLSDYILHVWERRFYQKRHFWH 77 (126)
T ss_pred EEEEEEeCCceEEEEEEEcCC-----CcEECCCccCCCCCCHHHHHHHHHHHHHCCeeeeeeecceeeeCCEeeccEEEE
Confidence 5566666544 777776653 349999999999 888888 999999999873322122211 1225677777
Q ss_pred EEEEEEE-----ecCCCC--CCCCCCe--eecccccCcC
Q psy9547 313 MDVYSGT-----AKEKTI--PANKTYK--LITETQMKKY 342 (363)
Q Consensus 313 l~~~~~~-----~~~~~~--~~~~~~~--W~~~~el~~~ 342 (363)
+.++.+. .+.+.+ .+++.++ |++++++.+.
T Consensus 78 l~~~~~~~~~~~~~~~~~~~~E~~~i~~~Wv~l~~~~~~ 116 (126)
T cd04663 78 LTLCEVDQDLPDSWVHFVQDDGGHEFRFFWVDLASCLDE 116 (126)
T ss_pred EEEEEecCCCcccccCcccCCCCceEEEEEEcccccccc
Confidence 7777662 222222 3566666 9999999654
No 92
>PRK11762 nudE adenosine nucleotide hydrolase NudE; Provisional
Probab=99.01 E-value=7.6e-10 Score=97.78 Aligned_cols=95 Identities=12% Similarity=-0.023 Sum_probs=71.4
Q ss_pred EeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEec
Q psy9547 245 KTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAK 321 (363)
Q Consensus 245 ~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~ 321 (363)
.+++++||+.++...+...+.||||||.+++ ++++++ ||++||+|+.+.... .++.+.+..+.....+++|.+...
T Consensus 55 ~~~~~~vlLvrq~r~~~~~~~~elPaG~ve~gE~~~~aA~REl~EEtG~~~~~l~-~l~~~~~~~~~~~~~~~~f~a~~~ 133 (185)
T PRK11762 55 ILDDDTLLLIREYAAGTERYELGFPKGLIDPGETPLEAANRELKEEVGFGARQLT-FLKELSLAPSYFSSKMNIVLAEDL 133 (185)
T ss_pred EeCCCEEEEEEeecCCCCCcEEEccceeCCCCCCHHHHHHHHHHHHHCCCCcceE-EEEEEecCCCccCcEEEEEEEEcc
Confidence 4455789998886667778899999999998 788888 999999999988777 777777666666677888887754
Q ss_pred CCC---CC--CCCCCeeecccccC
Q psy9547 322 EKT---IP--ANKTYKLITETQMK 340 (363)
Q Consensus 322 ~~~---~~--~~~~~~W~~~~el~ 340 (363)
.+. .. +..+..|++++++.
T Consensus 134 ~~~~~~~~e~E~i~~~~~~~~e~~ 157 (185)
T PRK11762 134 YPERLEGDEPEPLEVVRWPLADLD 157 (185)
T ss_pred ccccCCCCCCceeEEEEEcHHHHH
Confidence 321 11 22345788888774
No 93
>cd04665 Nudix_Hydrolase_8 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=98.88 E-value=1.1e-08 Score=83.65 Aligned_cols=94 Identities=10% Similarity=0.052 Sum_probs=64.5
Q ss_pred EEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEE
Q psy9547 240 LTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVY 316 (363)
Q Consensus 240 ~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~ 316 (363)
+++++.++ |++|+.++. .+-|+||||+++. ++++|+ ||+.||+|+.+.... .++...-..........+|
T Consensus 3 v~vi~~~~-~~vLl~~~~-----~~~w~lPgG~ve~gE~~~~aa~REl~EE~G~~~~~~~-~l~~~~~~~~~~~~~~~~y 75 (118)
T cd04665 3 VLVICFYD-DGLLLVRHK-----DRGWEFPGGHVEPGETIEEAARREVWEETGAELGSLT-LVGYYQVDLFESGFETLVY 75 (118)
T ss_pred EEEEEEEC-CEEEEEEeC-----CCEEECCccccCCCCCHHHHHHHHHHHHHCCccCceE-EEEEEEecCCCCcEEEEEE
Confidence 34444555 899999884 3569999999998 788888 999999999886555 5554332222224455667
Q ss_pred EEEecCCC----CCCCCCCeeecccccC
Q psy9547 317 SGTAKEKT----IPANKTYKLITETQMK 340 (363)
Q Consensus 317 ~~~~~~~~----~~~~~~~~W~~~~el~ 340 (363)
.+....+. ..+.....|++.+...
T Consensus 76 ~a~~~~~~~~~~~~E~~~~~~~~~~~~~ 103 (118)
T cd04665 76 PAVSAQLEEKASYLETDGPVLFKNEPEE 103 (118)
T ss_pred EEEEEecccccccccccCcEEeccCCcc
Confidence 77766542 1356778899866553
No 94
>cd04662 Nudix_Hydrolase_5 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate s
Probab=98.84 E-value=1.8e-08 Score=82.85 Aligned_cols=45 Identities=13% Similarity=-0.051 Sum_probs=36.5
Q ss_pred ceEEEeecCCC---ccccccccccccCCCC--CchHHH-HHHHhhCCCccc
Q psy9547 249 NKYLIQKRPTT---GLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVN 293 (363)
Q Consensus 249 ~~vLl~~R~~~---~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~ 293 (363)
.+|||.+|... ..-.|.|+||||+++. ++.+++ ||+.||+|+.+.
T Consensus 15 ~~vlL~~~~~~~~~~~~~~~W~lPgG~ie~~E~~~~aA~REl~EEtGl~~~ 65 (126)
T cd04662 15 IEVLLVHPGGPFWANKDLGAWSIPKGEYTEGEDPLLAAKREFSEETGFCVD 65 (126)
T ss_pred EEEEEEEccCccccCCCCCEEECCcccCCCCcCHHHHHHHHHHHHhCCcce
Confidence 36999987332 2446899999999998 788888 999999999875
No 95
>cd03676 Nudix_hydrolase_3 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belong to this superfamily requires a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate spe
Probab=98.80 E-value=1.8e-08 Score=88.58 Aligned_cols=99 Identities=12% Similarity=0.077 Sum_probs=68.2
Q ss_pred EEEeCC--ceEEEeecCCCc-cccccc-cccccCCCC--CchHHH-HHHHhhCCCccccccc--ccccEEEEcc--c---
Q psy9547 243 VVKTDT--NKYLIQKRPTTG-LLSNFY-MFLSFESDT--PYTSQV-EFLSENLPFKVNLNEK--CLGNVKHVFS--H--- 308 (363)
Q Consensus 243 ii~~~~--~~vLl~~R~~~~-~~~glW-EFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~--~l~~v~h~~s--h--- 308 (363)
++.+.+ +++|++||...+ .+.|+| ++|+|.++. ++.+++ ||++||+|+.+..... .++.+...|. .
T Consensus 40 ~~~~~~~~~~l~lqrRs~~K~~~Pg~wd~~~~G~v~~gE~~~~aA~REl~EE~Gl~~~~~~~l~~~g~~~~~~~~~~~~~ 119 (180)
T cd03676 40 YVRDEDGGLRIWIPRRSPTKATWPGMLDNLVAGGLGHGEGPEETLVKECDEEAGLPEDLVRQLKPVGVVSYLREGEAGGL 119 (180)
T ss_pred EEEcCCCCeEEEEEeccCCCCCCCCceeeecccCCCCCCCHHHHHHHHHHHHhCCCHHHHhhceeccEEEEEEEcCCCcE
Confidence 344554 799999998765 589999 699999988 788888 9999999998764320 2333444432 1
Q ss_pred eEEEEEEEEEEecCC-----CCCCCCCCeeecccccCc
Q psy9547 309 LKWNMDVYSGTAKEK-----TIPANKTYKLITETQMKK 341 (363)
Q Consensus 309 ~~~~l~~~~~~~~~~-----~~~~~~~~~W~~~~el~~ 341 (363)
.+..+++|.+....+ ...|..++.|++++|+.+
T Consensus 120 ~~e~~~~f~~~~~~~~~~~~~~~Ev~~~~~~~~~el~~ 157 (180)
T cd03676 120 QPEVEYVYDLELPPDFIPAPQDGEVESFRLLTIDEVLR 157 (180)
T ss_pred eeeEEEEEEEEcCCCCeeCCCCCcEeEEEEECHHHHHH
Confidence 124467777765322 112457789999999863
No 96
>PLN02709 nudix hydrolase
Probab=98.79 E-value=1.7e-08 Score=90.73 Aligned_cols=91 Identities=9% Similarity=0.016 Sum_probs=63.8
Q ss_pred ceEEEeecCCC-ccccccccccccCCCC---CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEecCC
Q psy9547 249 NKYLIQKRPTT-GLLSNFYMFLSFESDT---PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEK 323 (363)
Q Consensus 249 ~~vLl~~R~~~-~~~~glWEFPggk~e~---~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~~~ 323 (363)
.++|+.+|... +.++|.|.||||++|+ +..+++ ||+.||+|+.....+ .++.....++...+.+++|.+.+...
T Consensus 51 ~~vLl~~Rs~~l~~h~GqiafPGG~~e~~D~~~~~tAlRE~~EEiGl~~~~v~-vlg~L~~~~t~sg~~V~P~V~~~~~~ 129 (222)
T PLN02709 51 LRVILTKRSSTLSSHPGEVALPGGKRDEEDKDDIATALREAREEIGLDPSLVT-IISVLEPFVNKKGMSVAPVIGFLHDK 129 (222)
T ss_pred eEEEEEEcCCCCCCCCCCccCCCcccCCCCCCHHHHHHHHHHHHHCCCchheE-EeeecCCeECCCCCEEEEEEEEecCC
Confidence 38999999765 3689999999999998 345777 999999999876555 55555555555556678887765421
Q ss_pred ---C----CCCCCCCeeecccccC
Q psy9547 324 ---T----IPANKTYKLITETQMK 340 (363)
Q Consensus 324 ---~----~~~~~~~~W~~~~el~ 340 (363)
. ..|-+.+.|++++++-
T Consensus 130 ~~~~~~~np~EV~~vf~vPL~~ll 153 (222)
T PLN02709 130 KAFKPLPNPAEVEEIFDVPLEMFL 153 (222)
T ss_pred CCccccCChhhhheeEEecHHHHh
Confidence 1 1244555666666553
No 97
>PF00633 HHH: Helix-hairpin-helix motif; InterPro: IPR000445 The HhH motif is an around 20 amino acids domain present in prokaryotic and eukaryotic non-sequence-specific DNA binding proteins [, , ]. The HhH motif is similar to, but distinct from, the HtH motif. Both of these motifs have two helices connected by a short turn. In the HtH motif the second helix binds to DNA with the helix in the major groove. This allows the contact between specific base and residues throughout the protein. In the HhH motif the second helix does not protrude from the surface of the protein and therefore cannot lie in the major groove of the DNA. Crystallographic studies suggest that the interaction of the HhH domain with DNA is mediated by amino acids located in the strongly conserved loop (L-P-G-V) and at the N-terminal end of the second helix []. This interaction could involve the formation of hydrogen bonds between protein backbone nitrogens and DNA phosphate groups []. The structural difference between the HtH and HhH domains is reflected at the functional level: whereas the HtH domain, found primarily in gene regulatory proteins, binds DNA in a sequence specific manner, the HhH domain is rather found in proteins involved in enzymatic activities and binds DNA with no sequence specificity []. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain [].; GO: 0003677 DNA binding; PDB: 3C1Z_A 3C23_A 3C1Y_A 3C21_A 1Z00_A 2A1J_B 1KEA_A 1VRL_A 1RRQ_A 3G0Q_A ....
Probab=98.76 E-value=2.2e-09 Score=65.27 Aligned_cols=29 Identities=31% Similarity=0.603 Sum_probs=25.5
Q ss_pred hCCCCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 106 FGGEVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 106 ~~g~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
|+|.+|.++++|++|||||+|||++|+.|
T Consensus 2 ~~g~~pas~eeL~~lpGIG~~tA~~I~~~ 30 (30)
T PF00633_consen 2 LDGLIPASIEELMKLPGIGPKTANAILSF 30 (30)
T ss_dssp HHHHHTSSHHHHHTSTT-SHHHHHHHHHH
T ss_pred CCCcCCCCHHHHHhCCCcCHHHHHHHHhC
Confidence 45778999999999999999999999986
No 98
>TIGR02705 nudix_YtkD nucleoside triphosphatase YtkD. The functional assignment to the proteins of this family is contentious. Reference challenges the findings of reference, both in interpretation and in enzyme assay results. This protein belongs to the nudix family and shares some sequence identity with E. coli MutT but appears not to be functionally interchangeable with it.
Probab=98.75 E-value=3e-08 Score=84.72 Aligned_cols=96 Identities=10% Similarity=0.052 Sum_probs=66.1
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~ 315 (363)
.++++.+.+ +++|+.++.. ..||||||++|+ ++++|+ ||+.||+|+.+.... .++.+...........++
T Consensus 26 ~V~ii~~~~-~~~LL~~~~~-----~~~elPgG~vE~gEt~~eaA~REl~EETG~~~~~~~-~lg~~~~~~~~~~~~~~v 98 (156)
T TIGR02705 26 HVLVIPRYK-DQWLLTEHKR-----RGLEFPGGKVEPGETSKEAAIREVMEETGAIVKELH-YIGQYEVEGESTDFVKDV 98 (156)
T ss_pred EEEEEEEEC-CEEEEEEEcC-----CcEECCceecCCCCCHHHHHHHHHHHHhCcEeeeeE-EEEEEEecCCCcEEEEEE
Confidence 344444455 6898887742 349999999998 788999 999999999887666 666655544445677888
Q ss_pred EEEEecCCC-CCCCCCCe-eecccccCc
Q psy9547 316 YSGTAKEKT-IPANKTYK-LITETQMKK 341 (363)
Q Consensus 316 ~~~~~~~~~-~~~~~~~~-W~~~~el~~ 341 (363)
|.++..... ..+..+.. ++.++++.+
T Consensus 99 f~A~~~~~~~~~e~~E~~~~~~~~~~~~ 126 (156)
T TIGR02705 99 YFAEVSALESKDDYLETKGPVLLQEIPD 126 (156)
T ss_pred EEEEEeccccCCCceeeEeEEEHHHHHH
Confidence 888877432 22323334 577776643
No 99
>PLN02552 isopentenyl-diphosphate delta-isomerase
Probab=98.64 E-value=1.3e-07 Score=86.71 Aligned_cols=119 Identities=10% Similarity=0.104 Sum_probs=74.1
Q ss_pred eEEEEEEEeCCceEEEeecCCCc-cccccc-----cccccCCCC---------------CchHHH-HHHHhhCCCcccc-
Q psy9547 238 FLLTVVVKTDTNKYLIQKRPTTG-LLSNFY-----MFLSFESDT---------------PYTSQV-EFLSENLPFKVNL- 294 (363)
Q Consensus 238 ~v~~~ii~~~~~~vLl~~R~~~~-~~~glW-----EFPggk~e~---------------~~~~al-rEl~Eelgl~v~~- 294 (363)
+++.++|.+.+|++||+||...+ .+.|+| ..|++-.+. +..+|+ ||+.||+|+.+..
T Consensus 57 ra~~v~i~n~~g~lLLQkRs~~K~~~Pg~Wd~s~~GHp~~ge~~~e~~~e~~~~~~~~~~~~eAA~REL~EElGI~~~~~ 136 (247)
T PLN02552 57 RAFSVFLFNSKYELLLQQRAATKVTFPLVWTNTCCSHPLYGQDPNEVDRESELIDGNVLGVKNAAQRKLLHELGIPAEDV 136 (247)
T ss_pred EEEEEEEEcCCCeEEEEEecCCCCCCCcceecccCCccccccccccccccccccccchhhHHHHHHhHHHHHhCCCcccc
Confidence 34455667777999999997755 699999 555543111 145677 9999999998543
Q ss_pred ----cccccccEEEEccc------------eEEEEEEEEEEecCCC----CCCCCCCeeecccccCcC-------CCChH
Q psy9547 295 ----NEKCLGNVKHVFSH------------LKWNMDVYSGTAKEKT----IPANKTYKLITETQMKKY-------AFPVP 347 (363)
Q Consensus 295 ----~~~~l~~v~h~~sh------------~~~~l~~~~~~~~~~~----~~~~~~~~W~~~~el~~~-------~~~~a 347 (363)
.. .++.+.+.... ..+...+|......+. ..|-.+++|++++|+... .|+|.
T Consensus 137 ~~~~l~-~~~~~~y~~~~~~~~~~~~~~~E~e~~~v~~~~~~~~~~l~lq~eEV~~~~wvs~~el~~~~~~~~~~~~tpw 215 (247)
T PLN02552 137 PVDQFT-FLTRLHYKAADDVTHGPDGKWGEHELDYLLFIRPVRDVKVNPNPDEVADVKYVNREELKEMMRKESGLKLSPW 215 (247)
T ss_pred ccccce-eeeEEEEecccccccccCCCccceEEEEEEEEEecCCCcccCCHHHhheEEEEeHHHHHHHHhhcCCcccCHH
Confidence 12 33444443332 1222222322222221 235578899999999874 28899
Q ss_pred HHHHHHHHHh
Q psy9547 348 YQKVWKLFTK 357 (363)
Q Consensus 348 ~~~il~~l~~ 357 (363)
.+.+++.+..
T Consensus 216 ~~~~~~~~l~ 225 (247)
T PLN02552 216 FRLIVDNFLM 225 (247)
T ss_pred HHHHHHHHHH
Confidence 9988877654
No 100
>TIGR00052 nudix-type nucleoside diphosphatase, YffH/AdpP family.
Probab=98.59 E-value=1e-07 Score=84.17 Aligned_cols=102 Identities=5% Similarity=-0.071 Sum_probs=64.2
Q ss_pred EEEEEEEeC-CceEEEeecCCCcc-----ccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccce
Q psy9547 239 LLTVVVKTD-TNKYLIQKRPTTGL-----LSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHL 309 (363)
Q Consensus 239 v~~~ii~~~-~~~vLl~~R~~~~~-----~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~ 309 (363)
.++.++.+. +++||+.+.-..+. -.++||||+|+++. ++++|+ ||+.||+|+.+.... .+..+.-.....
T Consensus 46 ~v~vl~~~~~~~~vlLvrq~R~~~~~~~~~~~~lelPaG~ve~gE~~~~aA~REl~EEtG~~~~~~~-~~~~~~~~~g~~ 124 (185)
T TIGR00052 46 AAAVLLYDPKKDTVVLIEQFRIAAYVNGEEPWLLELSAGMVEKGESPEDVARREAIEEAGYQVKNLR-KLLSFYSSPGGV 124 (185)
T ss_pred eEEEEEEECCCCEEEEEECceeeeeecCCcceEEEECcEecCCCCCHHHHHHHHccccccceecceE-EEEEEEcCCCCC
Confidence 344444543 47899987643333 36899999999998 788888 999999999886554 444332222233
Q ss_pred EEEEEEEEEEecCCC------C--CCCCCCeeecccccCc
Q psy9547 310 KWNMDVYSGTAKEKT------I--PANKTYKLITETQMKK 341 (363)
Q Consensus 310 ~~~l~~~~~~~~~~~------~--~~~~~~~W~~~~el~~ 341 (363)
...+++|.++...+. . .+.....|++++++.+
T Consensus 125 ~~~~~~f~a~~~~~~~~~~~~~~~~E~ie~~~~~~~e~~~ 164 (185)
T TIGR00052 125 TELIHLFIAEVDDNQAAGIGGGADEEEIEVLHLVFSQALQ 164 (185)
T ss_pred cEEEEEEEEEEchhhcCCCCCCCCccceEEEEeCHHHHHH
Confidence 456788888754321 0 1122345777666643
No 101
>COG0494 MutT NTP pyrophosphohydrolases including oxidative damage repair enzymes [DNA replication, recombination, and repair / General function prediction only]
Probab=98.46 E-value=9.9e-07 Score=73.15 Aligned_cols=96 Identities=9% Similarity=0.089 Sum_probs=60.7
Q ss_pred ceEEEeecCCCccccccccccccCCCC--CchH-HH-HHHHhhCCCcccc--cccccccEEEEccc-----eEEEEEEEE
Q psy9547 249 NKYLIQKRPTTGLLSNFYMFLSFESDT--PYTS-QV-EFLSENLPFKVNL--NEKCLGNVKHVFSH-----LKWNMDVYS 317 (363)
Q Consensus 249 ~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~-al-rEl~Eelgl~v~~--~~~~l~~v~h~~sh-----~~~~l~~~~ 317 (363)
+++|+.+|...+ |.|+||||+++. ++.+ |+ ||+.||+|+.+.. .. .++.+...+.. ......++.
T Consensus 24 ~~vl~~~~~~~~---~~~~~PgG~ve~~e~~~~~aa~RE~~EEtGl~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (161)
T COG0494 24 GEVLLAQRRDDG---GLWELPGGKVEPGEELPEEAAARELEEETGLRVKDERLE-LLGEFPPSPGDGSSVGGREHRVFFV 99 (161)
T ss_pred CEEeEEEccccC---CceecCCcccCCCCchHHHHHHHHHHHHhCCeeeeecce-eeeeccCcccCcccccceEEEEEEe
Confidence 689999997766 899999999997 4455 77 9999999998863 22 22222222211 122333333
Q ss_pred EEec--CC-C-------CCCCCCCeeecccccCcCCCChHH
Q psy9547 318 GTAK--EK-T-------IPANKTYKLITETQMKKYAFPVPY 348 (363)
Q Consensus 318 ~~~~--~~-~-------~~~~~~~~W~~~~el~~~~~~~a~ 348 (363)
.... .. . ..+...+.|+..+++.........
T Consensus 100 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 140 (161)
T COG0494 100 AEVDDSLAVAIEGLSAPSEELEDLEWVPLDELAALVLAEDQ 140 (161)
T ss_pred eeccccccccccccCCCcchhhceeeeeHHHcccccccchh
Confidence 3221 11 1 124577889999999887666543
No 102
>cd03670 ADPRase_NUDT9 ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose to AMP and ribose-5-P. Like other members of the Nudix hydrolase superfamily of enzymes, it is thought to require a divalent cation, such as Mg2+, for its activity. It also contains a 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). ADPRase-m is also known as NUDT9. It can be distinugished from the cytosolic ADPRase by a N-terminal target sequence unique to mitochondrial ADPRase. NUDT9 functions as a monomer.
Probab=98.41 E-value=2.3e-06 Score=75.18 Aligned_cols=102 Identities=8% Similarity=0.049 Sum_probs=63.2
Q ss_pred eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCccccccc----------ccc-cEEEEc------cce
Q psy9547 250 KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEK----------CLG-NVKHVF------SHL 309 (363)
Q Consensus 250 ~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~----------~l~-~v~h~~------sh~ 309 (363)
+||+.||+. .|.|.||||.+++ +.++|+ ||+.||+|+.+..... .+. .-.-+| +..
T Consensus 50 ~vLl~~r~~----~g~walPGG~v~~~E~~~~aa~Rel~EEt~l~l~~~~~~~~~l~~l~~~~~~~~~~vy~~~~~dpr~ 125 (186)
T cd03670 50 QFVAIKRPD----SGEWAIPGGMVDPGEKISATLKREFGEEALNSLQKSDEEKEEIKKLVELFSKDGVEVYKGYVDDPRN 125 (186)
T ss_pred EEEEEEeCC----CCcCcCCeeeccCCCCHHHHHHHHHHHHHcccccccchhhhhhcchhhhhcccccEEEeccccCCCC
Confidence 789999965 4899999999999 778888 9999999764321100 000 001122 111
Q ss_pred ----EEEEEEEEEEecCC------CC---CCCCCCeeecccccCcCCCChHHHHHHHHHHh
Q psy9547 310 ----KWNMDVYSGTAKEK------TI---PANKTYKLITETQMKKYAFPVPYQKVWKLFTK 357 (363)
Q Consensus 310 ----~~~l~~~~~~~~~~------~~---~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~ 357 (363)
.+....|.+...++ .+ .+..+.+|++++++. ++...+..||+...+
T Consensus 126 td~~w~~Tva~~f~~~~~~~~~~~~~~a~dda~~a~W~~v~~l~--~L~~dH~~Il~~a~~ 184 (186)
T cd03670 126 TDNAWMETVAVNFHDEDGNDVENLPLEAGDDAGSVRWQDIDSKL--PLYANHSQFLKKVAE 184 (186)
T ss_pred CCcceEEEEEEEEEecCcccccccccCCCCchheeEEEEccccc--ccccCHHHHHHHHHH
Confidence 11223333322211 11 244677899999987 678888888887654
No 103
>PRK10729 nudF ADP-ribose pyrophosphatase NudF; Provisional
Probab=98.31 E-value=1.8e-06 Score=77.29 Aligned_cols=102 Identities=5% Similarity=-0.054 Sum_probs=64.9
Q ss_pred EEEEEEEeC-CceEEEeecCCCcc-----ccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccce
Q psy9547 239 LLTVVVKTD-TNKYLIQKRPTTGL-----LSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHL 309 (363)
Q Consensus 239 v~~~ii~~~-~~~vLl~~R~~~~~-----~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~ 309 (363)
.++.+..+. +++|++.+--.-+. -+-+||||+|.+++ ++++++ |||.||+|+.+.... .+.++.......
T Consensus 51 ~V~il~~~~~~~~vlLvrQyR~~~~~~~~~~~~lE~PAG~vd~gE~p~~aA~REL~EETGy~a~~~~-~l~~~~~spg~~ 129 (202)
T PRK10729 51 AAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIEEGESVEDVARREAIEEAGLIVGRTK-PVLSYLASPGGT 129 (202)
T ss_pred eEEEEEEECCCCEEEEEEeeecccccCCCCCeEEEccceEcCCCCCHHHHHHHHHHHHhCceeeEEE-EEEEEEcCCCcC
Confidence 444444544 36877755322222 23689999999987 788888 999999999876555 555555544445
Q ss_pred EEEEEEEEEEecC----C---CCCCC--CCCeeecccccCc
Q psy9547 310 KWNMDVYSGTAKE----K---TIPAN--KTYKLITETQMKK 341 (363)
Q Consensus 310 ~~~l~~~~~~~~~----~---~~~~~--~~~~W~~~~el~~ 341 (363)
.-.+++|.++... + ...+. -+..|++.+++..
T Consensus 130 ~e~~~~fla~~~~~~~~~~~~~~de~E~i~v~~~~~~e~~~ 170 (202)
T PRK10729 130 SERSSIMVGEVDATTASGIHGLADENEDIRVHVVSREQAYQ 170 (202)
T ss_pred ceEEEEEEEEEcchhcccCCCCCCCCCceEEEEEcHHHHHH
Confidence 5677888877421 1 11122 2357898888753
No 104
>PLN02791 Nudix hydrolase homolog
Probab=98.14 E-value=7.7e-06 Score=85.78 Aligned_cols=102 Identities=15% Similarity=0.077 Sum_probs=66.2
Q ss_pred EEEEEEEeC-CceEEEeecCCC-ccccccccc-cccCCCC--CchHHH-HHHHhhCCCcccccc-cccccEEEEc--cc-
Q psy9547 239 LLTVVVKTD-TNKYLIQKRPTT-GLLSNFYMF-LSFESDT--PYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVF--SH- 308 (363)
Q Consensus 239 v~~~ii~~~-~~~vLl~~R~~~-~~~~glWEF-Pggk~e~--~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~--sh- 308 (363)
.+-++|.+. ++++||+||... ..+.|+|++ |||.++. +..+++ ||++||+|+.+.... ..++.+.+.+ .+
T Consensus 34 AvhVwIfn~~~gelLLQkRS~~K~~~PG~WDiS~gGHv~aGEs~~eAA~REL~EELGI~l~~~~l~~l~~~~~~~~~~~g 113 (770)
T PLN02791 34 AVHVWIYSESTQELLLQRRADCKDSWPGQWDISSAGHISAGDTSLLSAQRELEEELGIILPKDAFELLFVFLQECVINDG 113 (770)
T ss_pred EEEEEEEECCCCeEEEEEecCCCCCCCCcccCcCCCCCCCCCCHHHHHHHHHHHHhCCCCChhheeeeeeEEEEeeccCC
Confidence 333445554 589999999765 579999999 7999988 666778 999999999763221 0233322211 11
Q ss_pred ---eEEEEEEEEEEecCC--------CCCCCCCCeeecccccC
Q psy9547 309 ---LKWNMDVYSGTAKEK--------TIPANKTYKLITETQMK 340 (363)
Q Consensus 309 ---~~~~l~~~~~~~~~~--------~~~~~~~~~W~~~~el~ 340 (363)
-+...++|.+..... +..|.++++|++.+|+.
T Consensus 114 ~~~e~E~~~VYlv~~~~~~p~~~~~lq~eEV~~v~wvsl~El~ 156 (770)
T PLN02791 114 KFINNEYNDVYLVTTLDPIPLEAFTLQESEVSAVKYMSIEEYK 156 (770)
T ss_pred CcceeeEEEEEEEEECCCCCcccCCCChhhhheeEEEcHHHHH
Confidence 112345676653221 12366889999999985
No 105
>PRK15009 GDP-mannose pyrophosphatase NudK; Provisional
Probab=98.09 E-value=1.1e-05 Score=71.53 Aligned_cols=102 Identities=6% Similarity=-0.090 Sum_probs=62.2
Q ss_pred EEEEEEEeC-CceEEEeecCCCcc------ccccccccccCCCC-CchHHH-HHHHhhCCCcccccccccccEEEEccce
Q psy9547 239 LLTVVVKTD-TNKYLIQKRPTTGL------LSNFYMFLSFESDT-PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHL 309 (363)
Q Consensus 239 v~~~ii~~~-~~~vLl~~R~~~~~------~~glWEFPggk~e~-~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~ 309 (363)
.+++++.+. ++++++.+.-..+. -+=.||||+|.++. ++++++ |||.||+|+.+.... .++++.-.-...
T Consensus 47 ~v~Vl~~~~~~~~vvLvrQyR~~v~~~~~~~~~~lElPAG~vd~~~p~~aA~REL~EETGy~a~~~~-~l~~~~~spG~s 125 (191)
T PRK15009 47 GATILLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKEAIEETGYEVGEVR-KLFELYMSPGGV 125 (191)
T ss_pred EEEEEEEECCCCEEEEEEcccccccccCCCCceEEEEeccccCCCCHHHHHHHHHHHhhCCccceEE-EeeEEEcCCccc
Confidence 344444553 46888766433333 35679999999988 778888 999999999876554 444332111222
Q ss_pred EEEEEEEEEEecCC------CCC--CCCCCeeecccccCc
Q psy9547 310 KWNMDVYSGTAKEK------TIP--ANKTYKLITETQMKK 341 (363)
Q Consensus 310 ~~~l~~~~~~~~~~------~~~--~~~~~~W~~~~el~~ 341 (363)
.-.+++|.++.... ... |.-+..|++.+|+.+
T Consensus 126 ~e~~~lf~a~~~~~~~~~~~~~de~E~iev~~~~~~e~~~ 165 (191)
T PRK15009 126 TELIHFFIAEYSDSQRANAGGGVEDEDIEVLELPFSQALE 165 (191)
T ss_pred CcEEEEEEEEECchhcccCCCCCCCceEEEEEEcHHHHHH
Confidence 34567777775311 111 223457888888754
No 106
>PF10576 EndIII_4Fe-2S: Iron-sulfur binding domain of endonuclease III; InterPro: IPR003651 Endonuclease III (4.2.99.18 from EC) is a DNA repair enzyme which removes a number of damaged pyrimidines from DNA via its glycosylase activity and also cleaves the phosphodiester backbone at apurinic / apyrimidinic sites via a beta-elimination mechanism [, ]. The structurally related DNA glycosylase MutY recognises and excises the mutational intermediate 8-oxoguanine-adenine mispair []. The 3-D structures of Escherichia coli endonuclease III [] and catalytic domain of MutY [] have been determined. The structures contain two all-alpha domains: a sequence-continuous, six-helix domain (residues 22-132) and a Greek-key, four-helix domain formed by one N-terminal and three C-terminal helices (residues 1-21 and 133-211) together with the [Fe4S4] cluster. The cluster is bound entirely within the C-terminal loop by four cysteine residues with a ligation pattern Cys-(Xaa)6-Cys-(Xaa)2-Cys-(Xaa)5-Cys which is distinct from all other known Fe4S4 proteins. This structural motif is referred to as a [Fe4S4] cluster loop (FCL) []. Two DNA-binding motifs have been proposed, one at either end of the interdomain groove: the helix-hairpin-helix (HhH) and FCL motifs. The primary role of the iron-sulphur cluster appears to involve positioning conserved basic residues for interaction with the DNA phosphate backbone by forming the loop of the FCL motif [, ]. The iron-sulphur cluster loop (FCL) is also found in DNA-(apurinic or apyrimidinic site) lyase, a subfamily of endonuclease III. The enzyme has both apurinic and apyrimidinic endonuclease activity and a DNA N-glycosylase activity. It cuts damaged DNA at cytosines, thymines and guanines, and acts on the damaged strand 5' of the damaged site. The enzyme binds a 4Fe-4S cluster which is not important for the catalytic activity, but is probably involved in the alignment of the enzyme along the DNA strand.; GO: 0004519 endonuclease activity, 0051539 4 iron, 4 sulfur cluster binding; PDB: 1VRL_A 1RRQ_A 3G0Q_A 3FSQ_A 1RRS_A 3FSP_A 2ABK_A 1KG7_A 1KG2_A 1MUN_A ....
Probab=98.08 E-value=9.5e-07 Score=46.13 Aligned_cols=17 Identities=53% Similarity=1.546 Sum_probs=11.6
Q ss_pred CcCCCCCCCCCCCcccC
Q psy9547 198 CLFKNPKCKECPLSRFC 214 (363)
Q Consensus 198 C~~~~P~C~~Cpl~~~C 214 (363)
|++++|+|+.|||++.|
T Consensus 1 Ctar~P~C~~Cpl~~~C 17 (17)
T PF10576_consen 1 CTARKPKCEECPLADYC 17 (17)
T ss_dssp S-SSS--GGG-TTGGG-
T ss_pred CcCCCCccccCCCcccC
Confidence 89999999999999988
No 107
>KOG3069|consensus
Probab=97.93 E-value=1.9e-05 Score=70.39 Aligned_cols=99 Identities=15% Similarity=0.114 Sum_probs=68.4
Q ss_pred ceEEEeecCCC-ccccccccccccCCCC---CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEEEEEecCC
Q psy9547 249 NKYLIQKRPTT-GLLSNFYMFLSFESDT---PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEK 323 (363)
Q Consensus 249 ~~vLl~~R~~~-~~~~glWEFPggk~e~---~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~~~~~~~~ 323 (363)
-+||++||... ..++|--.||||+.+. +..+++ ||-.||.|+...... .++...--+.-..+.+..+.+-....
T Consensus 58 l~vLltkRSr~LrshsGev~fPGG~~d~~D~s~~~tAlREt~EEIGl~~~~~~-~~g~l~~~~~r~~~~v~p~v~~l~~~ 136 (246)
T KOG3069|consen 58 LSVLLTKRSRTLRSHSGEVCFPGGRRDPHDKSDIQTALRETEEEIGLDPELVD-VLGALPPFVLRSGWSVFPVVGFLSDK 136 (246)
T ss_pred eEEEEEeccccccccCCceeCCCCcCCccccchHHHHHHHHHHHhCCCHHHhh-hhhhccceeeccCcccceeEEEEecc
Confidence 37999999765 6899999999999998 556667 999999999876554 55555544444556666665544331
Q ss_pred --------CCCCCCCCeeecccccCcCCCChHH
Q psy9547 324 --------TIPANKTYKLITETQMKKYAFPVPY 348 (363)
Q Consensus 324 --------~~~~~~~~~W~~~~el~~~~~~~a~ 348 (363)
+..|.+.+.||.++++-.-.-..++
T Consensus 137 ~~l~~~~ln~gEv~~~F~VPL~~ll~~~~~~~~ 169 (246)
T KOG3069|consen 137 KILPSLRLNSGEVESAFWVPLTDLLLPKHQTAF 169 (246)
T ss_pred cccccccCCchheeeeeeeeHHHHhhhhcchHH
Confidence 1124467789998888654444443
No 108
>COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair]
Probab=97.81 E-value=2.2e-05 Score=72.45 Aligned_cols=100 Identities=15% Similarity=0.168 Sum_probs=77.1
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDV 315 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~ 315 (363)
++..+|+++ +++||.+++... .|+|..=-|-+|+ |.|+|. ||+.||.|+.|.-.. ++++--+.|+|-- |.-
T Consensus 146 ~vIv~v~~~-~~ilLa~~~~h~--~g~yS~LAGFVE~GETlE~AV~REv~EE~Gi~V~~vr-Y~~SQPWPfP~SL--Mig 219 (279)
T COG2816 146 CVIVAVIRG-DEILLARHPRHF--PGMYSLLAGFVEPGETLEQAVAREVFEEVGIKVKNVR-YVGSQPWPFPHSL--MLG 219 (279)
T ss_pred eEEEEEecC-CceeecCCCCCC--CcceeeeeecccCCccHHHHHHHHHHHhhCeEEeeee-EEeccCCCCchhh--hhh
Confidence 344445555 679999986543 9999999999999 889999 999999999998877 8888888888863 455
Q ss_pred EEEEecCCCC----CCCCCCeeecccc-cCcCCC
Q psy9547 316 YSGTAKEKTI----PANKTYKLITETQ-MKKYAF 344 (363)
Q Consensus 316 ~~~~~~~~~~----~~~~~~~W~~~~e-l~~~~~ 344 (363)
|.....+|.. .|-.+.+|++.+| |+.++-
T Consensus 220 f~aey~sgeI~~d~~Eleda~WFs~~evl~~L~~ 253 (279)
T COG2816 220 FMAEYDSGEITPDEGELEDARWFSRDEVLPALPP 253 (279)
T ss_pred heeeeccccccCCcchhhhccccCHhHHhhhcCC
Confidence 5566665532 3557789999999 777665
No 109
>PLN03143 nudix hydrolase; Provisional
Probab=97.75 E-value=7.5e-05 Score=70.14 Aligned_cols=53 Identities=6% Similarity=0.056 Sum_probs=37.4
Q ss_pred EEEEEeCCce--EEEeecCCCccccccccccccCCCC---CchHHH-HHHHhhCCCccc
Q psy9547 241 TVVVKTDTNK--YLIQKRPTTGLLSNFYMFLSFESDT---PYTSQV-EFLSENLPFKVN 293 (363)
Q Consensus 241 ~~ii~~~~~~--vLl~~R~~~~~~~glWEFPggk~e~---~~~~al-rEl~Eelgl~v~ 293 (363)
+.++.+.+++ +++.++..-+...-.||||+|.++. ++.+++ ||++||+|+.+.
T Consensus 133 VL~~l~~~ge~~VlLVrQ~R~pvg~~~lE~PAG~lD~~~edp~~aA~REL~EETG~~~~ 191 (291)
T PLN03143 133 VLILLESEGETYAVLTEQVRVPVGKFVLELPAGMLDDDKGDFVGTAVREVEEETGIKLK 191 (291)
T ss_pred EEEEEeCCCCEEEEEEEeEecCCCcEEEEecccccCCCCCCHHHHHHHHHHHHHCCccc
Confidence 3333343344 7777775433444589999999985 567777 999999999764
No 110
>smart00525 FES FES domain. iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3)
Probab=97.53 E-value=3.9e-05 Score=45.06 Aligned_cols=22 Identities=45% Similarity=1.312 Sum_probs=20.1
Q ss_pred hCcCCCCCCCCCCCcccChhhh
Q psy9547 197 VCLFKNPKCKECPLSRFCSAYK 218 (363)
Q Consensus 197 ~C~~~~P~C~~Cpl~~~C~~~~ 218 (363)
+|++++|+|+.||+++.|.++.
T Consensus 1 vC~ar~P~C~~Cpl~~~C~~~~ 22 (26)
T smart00525 1 ICTARKPRCDECPLKDLCPAYX 22 (26)
T ss_pred CccCCCCCCCCCcCHHHhhhhh
Confidence 6999999999999999999754
No 111
>KOG3084|consensus
Probab=97.11 E-value=0.00076 Score=62.82 Aligned_cols=57 Identities=11% Similarity=0.076 Sum_probs=45.7
Q ss_pred EEEEEEEeCCceEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccc
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNE 296 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~ 296 (363)
+|..+|.+.+|+.++-.| ..+.-.|||.-+-|-.|+ +.+||. ||..||.|++|....
T Consensus 189 vVIm~li~~d~~~~LL~R-~~r~~~gl~t~lAGFlEpGES~eeav~REtwEEtGi~V~~I~ 248 (345)
T KOG3084|consen 189 VVIMLLIDHDGKHALLGR-QKRYPPGLWTCLAGFLEPGESIEEAVRRETWEETGIEVEVIS 248 (345)
T ss_pred eEEEEEEcCCCCEeeeec-ccCCCCchhhhhhccCCccccHHHHHHHHHHHHhCceeeeEe
Confidence 555556666676666666 567788999999999999 888999 999999999987644
No 112
>COG1443 Idi Isopentenyldiphosphate isomerase [Lipid metabolism]
Probab=96.97 E-value=0.0013 Score=56.21 Aligned_cols=93 Identities=8% Similarity=0.191 Sum_probs=58.5
Q ss_pred EEEEeCCceEEEeecCC-Ccccccccc-----ccccCCCCCchHHH-HHHHhhCCCcccccccccccEEEEccceEE---
Q psy9547 242 VVVKTDTNKYLIQKRPT-TGLLSNFYM-----FLSFESDTPYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKW--- 311 (363)
Q Consensus 242 ~ii~~~~~~vLl~~R~~-~~~~~glWE-----FPggk~e~~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~--- 311 (363)
..|.|.+|++||+||.. +..|.|.|. -|+ ..| +.+.|+ |-+..|||+++... ...+.++++++
T Consensus 38 ~~lFne~g~LLltrRA~~K~twP~vWTNSvCsHP~-~~e-s~~~A~~rRl~~ELGie~~~~-----d~~~il~rf~YrA~ 110 (185)
T COG1443 38 SFLFNERGQLLLTRRALSKKTWPGVWTNSVCSHPL-PGE-SNEDAARRRLAYELGIEPDQY-----DKLEILPRFRYRAA 110 (185)
T ss_pred eeEECCCCceeeehhhhhcccCcccccccccCCCc-CCC-chHHHHHHHHHHHhCCCCccc-----CccccccceEEecc
Confidence 45678889999999965 468999996 233 111 345777 89999999987632 22233444432
Q ss_pred --------EE-EEEEEEecCC---CCCCCCCCeeecccccCc
Q psy9547 312 --------NM-DVYSGTAKEK---TIPANKTYKLITETQMKK 341 (363)
Q Consensus 312 --------~l-~~~~~~~~~~---~~~~~~~~~W~~~~el~~ 341 (363)
.| .++.++..+. .+.|-.+++|++++++..
T Consensus 111 ~~~~~~E~Eic~V~~~~~~~~~~~npdEV~~~~wv~~e~l~~ 152 (185)
T COG1443 111 DPDGIVENEICPVLAARLDSALDPNPDEVMDYRWVSPEDLKE 152 (185)
T ss_pred CCCCcceeeeeeEEEEeecCCCCCChHHhhheeccCHHHHHH
Confidence 22 1222333321 223558899999999865
No 113
>KOG2839|consensus
Probab=96.91 E-value=0.002 Score=53.48 Aligned_cols=99 Identities=9% Similarity=0.019 Sum_probs=58.5
Q ss_pred eEEEEEEEeCC-c--eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEE--Eccce
Q psy9547 238 FLLTVVVKTDT-N--KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKH--VFSHL 309 (363)
Q Consensus 238 ~v~~~ii~~~~-~--~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h--~~sh~ 309 (363)
.++.+|..+-+ + +||+.+-+.. +..|-||+|-+|+ +..+|+ ||..||-|+.=.... .++.+.- .++|.
T Consensus 10 ~vagCi~~r~~~~~ieVLlvsSs~~---~~~wi~PKGGwE~dE~~~eAA~REt~EEAGv~G~l~~-~~~g~~~~~~~~~~ 85 (145)
T KOG2839|consen 10 LVAGCICYRSDKEKIEVLLVSSSKK---PHRWIVPKGGWEPDESVEEAALRETWEEAGVKGKLGR-LLGGFEDFLSKKHR 85 (145)
T ss_pred EEEEeeeeeecCcceEEEEEecCCC---CCCccCCCCCCCCCCCHHHHHHHHHHHHhCceeeeec-cccchhhccChhhc
Confidence 34545544432 3 7999886543 3679999999999 777788 999999998755544 3333331 22333
Q ss_pred EE-EEEEEEEEecCC---CC---CCCCCCeeecccccC
Q psy9547 310 KW-NMDVYSGTAKEK---TI---PANKTYKLITETQMK 340 (363)
Q Consensus 310 ~~-~l~~~~~~~~~~---~~---~~~~~~~W~~~~el~ 340 (363)
.. .=..|...+... .+ .++.+.+|++++|--
T Consensus 86 ~~~k~~~~~l~v~e~le~wp~~~~~~r~r~W~~ledA~ 123 (145)
T KOG2839|consen 86 TKPKGVMYVLAVTEELEDWPESEHEFREREWLKLEDAI 123 (145)
T ss_pred ccccceeehhhhhhhcccChhhhcccceeEEeeHHHHH
Confidence 22 112233322221 11 135778899988754
No 114
>COG4119 Predicted NTP pyrophosphohydrolase [DNA replication, recombination, and repair / General function prediction only]
Probab=96.61 E-value=0.0096 Score=48.16 Aligned_cols=108 Identities=9% Similarity=-0.007 Sum_probs=68.9
Q ss_pred eEEEeecCCCcccc----ccccccccCCCC--CchHHH-HHHHhhCCCccccccc-------ccccEEEEccceE-----
Q psy9547 250 KYLIQKRPTTGLLS----NFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEK-------CLGNVKHVFSHLK----- 310 (363)
Q Consensus 250 ~vLl~~R~~~~~~~----glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~-------~l~~v~h~~sh~~----- 310 (363)
.||++. |.+.+|+ |-|..|-|.... .+..|+ ||+.||+|+.|.-... .=+.+.-.|+|..
T Consensus 19 ~VLLvH-PGGPFWa~kD~GAWSIPKGey~~gEdp~~AArREf~EE~Gi~vdGP~~~lG~~kQ~GGKvVta~~veae~Dva 97 (161)
T COG4119 19 DVLLVH-PGGPFWAGKDDGAWSIPKGEYTGGEDPWLAARREFSEEIGICVDGPRIDLGSLKQSGGKVVTAFGVEAELDVA 97 (161)
T ss_pred EEEEec-CCCCccccCCCCcccccccccCCCcCHHHHHHHHhhhhhceeecCchhhhhhhccCCCcEEEEEeeeeeeehh
Confidence 566664 5667775 679999998877 455566 9999999999853221 2233444454432
Q ss_pred -EEEEEEEEEecC--CCC---CCCCCCeeecccccCcCCCChHHHHHHHHHHhhh
Q psy9547 311 -WNMDVYSGTAKE--KTI---PANKTYKLITETQMKKYAFPVPYQKVWKLFTKSK 359 (363)
Q Consensus 311 -~~l~~~~~~~~~--~~~---~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~~~ 359 (363)
++-..|..++.. |.. .+-+...|+++.+.. ..+....+++|..|.++.
T Consensus 98 ~~rSntFe~eWPprSG~M~~FPEVDRagWF~l~eAr-~Kil~gQRpfldrL~a~~ 151 (161)
T COG4119 98 DARSNTFELEWPPRSGKMRKFPEVDRAGWFPLAEAR-TKILKGQRPFLDRLMAHA 151 (161)
T ss_pred hhhcceeeeecCCCCCccccCcccccccceecHHHH-hHHhhccchHHHHHHHHh
Confidence 222344455543 222 344666799888864 367778888888887764
No 115
>smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1.
Probab=95.78 E-value=0.0067 Score=35.43 Aligned_cols=20 Identities=30% Similarity=0.506 Sum_probs=17.5
Q ss_pred HHhcCCCCcHHHHHHHHHHh
Q psy9547 116 QLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 116 ~L~~lpGIG~~tA~~il~~~ 135 (363)
.|+++||||+++|+.++.+.
T Consensus 2 ~L~~i~GiG~k~A~~il~~~ 21 (26)
T smart00278 2 ELLKVPGIGPKTAEKILEAX 21 (26)
T ss_pred hhhhCCCCCHHHHHHHHHhc
Confidence 57899999999999998753
No 116
>KOG0648|consensus
Probab=95.54 E-value=0.0044 Score=57.77 Aligned_cols=103 Identities=6% Similarity=0.018 Sum_probs=61.0
Q ss_pred EEEEEEEeCCceEEEeecC-CCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccce----E
Q psy9547 239 LLTVVVKTDTNKYLIQKRP-TTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHL----K 310 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~-~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~----~ 310 (363)
.+.+.++|.+++||+.|-. ..--..|.|-||+|.+++ ..-..+ ||++||+|++....+ +...+|..+.. .
T Consensus 117 gvg~~V~n~~~eVlVv~e~d~~~~~~~~wK~ptG~v~~~e~i~~gavrEvkeetgid~ef~e--Vla~r~~H~~~~~~~k 194 (295)
T KOG0648|consen 117 GVGAFVLNKKKEVLVVQEKDGAVKIRGGWKLPTGRVEEGEDIWHGAVREVKEETGIDTEFVE--VLAFRRAHNATFGLIK 194 (295)
T ss_pred eeeeeEecCCceeEEEEecccceeecccccccceEecccccchhhhhhhhHHHhCcchhhhh--HHHHHhhhcchhhccc
Confidence 4445666775799998742 234568999999999988 455666 999999998543322 11222222211 1
Q ss_pred EEEEEEEEEecCC--CC----CCCCCCeeecccccCcCCC
Q psy9547 311 WNMDVYSGTAKEK--TI----PANKTYKLITETQMKKYAF 344 (363)
Q Consensus 311 ~~l~~~~~~~~~~--~~----~~~~~~~W~~~~el~~~~~ 344 (363)
-+ .++.|....- .. .+...+.|+..++..+.++
T Consensus 195 sd-~f~~c~L~p~s~~i~~~~~ei~~~~Wmp~~e~v~qp~ 233 (295)
T KOG0648|consen 195 SD-MFFTCELRPRSLDITKCKREIEAAAWMPIEEYVSQPL 233 (295)
T ss_pred cc-ceeEEEeeccccccchhHHHHHHHhcccHHHhhcccc
Confidence 11 1233443221 11 1224458999998887776
No 117
>PF14716 HHH_8: Helix-hairpin-helix domain; PDB: 2W9M_A 1HUZ_A 1HUO_A 2BPG_B 2BPF_A 1DK3_A 1BPE_A 1BNO_A 1BNP_A 1DK2_A ....
Probab=94.88 E-value=0.16 Score=36.97 Aligned_cols=55 Identities=16% Similarity=0.292 Sum_probs=37.1
Q ss_pred HHHHHHhcc-CchHHHHHHHHHHHHhhHHhCCCCcccHHH-HhcCCCCcHHHHHHHHHH
Q psy9547 78 NVLKMWEGL-GYYSRVRNFQAGCRQVIEQFGGEVPRDKKQ-LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 78 el~~~~~~l-G~~~rA~~l~~~a~~i~~~~~g~~p~~~~~-L~~lpGIG~~tA~~il~~ 134 (363)
++....+-. |-..|++...+++..|.. +...+ .+.++ +.+|||||+.+|..|--|
T Consensus 10 ~la~~~~~~~~~~~r~~aY~~Aa~~i~~-l~~~i-~~~~~~~~~l~gIG~~ia~kI~E~ 66 (68)
T PF14716_consen 10 ELADLYELQGGDPFRARAYRRAAAAIKA-LPYPI-TSGEEDLKKLPGIGKSIAKKIDEI 66 (68)
T ss_dssp HHHHHHHHTSTSHHHHHHHHHHHHHHHH-SSS-H-HSHHHHHCTSTTTTHHHHHHHHHH
T ss_pred HHHHHHHHhhCCcHHHHHHHHHHHHHHh-CCHhH-hhHHHHHhhCCCCCHHHHHHHHHH
Confidence 344444444 333488889998988876 33333 34565 999999999999988644
No 118
>PF03352 Adenine_glyco: Methyladenine glycosylase; InterPro: IPR005019 This family of methyladenine glycosylases includes DNA-3-methyladenine glycosylase I (3.2.2.20 from EC) which acts as a base excision repair enzyme by severing the glycosylic bond of numerous damaged bases. The enzyme is constitutively expressed and is specific for the alkylated 3-methyladenine DNA.; GO: 0008725 DNA-3-methyladenine glycosylase I activity, 0006284 base-excision repair; PDB: 2OFI_A 2OFK_A 2JG6_A 4AIA_E 4AI5_C 4AI4_A 1LMZ_A 1P7M_A 1NKU_A.
Probab=94.57 E-value=0.28 Score=42.83 Aligned_cols=71 Identities=14% Similarity=0.230 Sum_probs=57.3
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCC--CHHHHhcCCHHHHHHHHhccCch--H-HHHHHHHHHHHhhH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYP--TIKDFAFDTEDNVLKMWEGLGYY--S-RVRNFQAGCRQVIE 104 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~p--t~~~la~a~~~el~~~~~~lG~~--~-rA~~l~~~a~~i~~ 104 (363)
+.-|+.|+-+++-.-.++.++.+-...|.+.|- +|+.+|..++++|.+++..-|.- + |.+.++.=|+.+.+
T Consensus 24 ~~LFe~L~Le~fQaGLsW~~Il~Kr~~~r~aF~~Fd~~~vA~~~e~~ie~l~~d~~iIRnr~KI~Avi~NA~~~l~ 99 (179)
T PF03352_consen 24 RKLFEMLTLEGFQAGLSWSTILKKREAFREAFAGFDPEKVAKMDEEDIERLMQDPGIIRNRRKIRAVINNARAILK 99 (179)
T ss_dssp HHHHHHHHHHHHTTTS-HHHHHHTHHHHHHHTGGGHHHHHHT--HHHHHHHTTSTTSS--HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHCCCHHHHHcCCHHHHHHHhcCcchhhhHHHHHHHHHHHHHHHH
Confidence 456999999999999999999999999999985 79999999999999999988874 2 66777776766654
No 119
>KOG3041|consensus
Probab=94.57 E-value=0.16 Score=44.46 Aligned_cols=81 Identities=14% Similarity=0.108 Sum_probs=50.9
Q ss_pred EEEEEEEeCCce--EEEee--cCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcccccc-cccccEEEEccceE
Q psy9547 239 LLTVVVKTDTNK--YLIQK--RPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNE-KCLGNVKHVFSHLK 310 (363)
Q Consensus 239 v~~~ii~~~~~~--vLl~~--R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~-~~l~~v~h~~sh~~ 310 (363)
|++.+|...+|+ +++++ ||+-| +=--|||-|-++. +.+.|+ |||+||+|..=.+.. +...-..--|++.+
T Consensus 76 VaIl~il~~dG~~~ivL~kQfRpP~G--k~ciElPAGLiD~ge~~~~aAiREl~EEtGy~gkv~~~s~~~f~DPGltn~~ 153 (225)
T KOG3041|consen 76 VAILAILESDGKPYIVLVKQFRPPTG--KICIELPAGLIDDGEDFEGAAIRELEEETGYKGKVDMVSPTVFLDPGLTNCN 153 (225)
T ss_pred EEEEEEEecCCcEEEEEEEeecCCCC--cEEEEcccccccCCCchHHHHHHHHHHHhCccceeeeccccEEcCCCCCCCc
Confidence 455555555564 45544 44433 2236999999988 777777 999999998622221 02222333678888
Q ss_pred EEEEEEEEEec
Q psy9547 311 WNMDVYSGTAK 321 (363)
Q Consensus 311 ~~l~~~~~~~~ 321 (363)
+.|.++.++..
T Consensus 154 ~~iv~v~idg~ 164 (225)
T KOG3041|consen 154 LCIVVVDIDGD 164 (225)
T ss_pred eEEEEEEecCC
Confidence 88888776643
No 120
>PF12826 HHH_2: Helix-hairpin-helix motif; PDB: 1X2I_B 1DGS_A 1V9P_B.
Probab=94.55 E-value=0.048 Score=39.29 Aligned_cols=36 Identities=19% Similarity=0.386 Sum_probs=23.4
Q ss_pred HHHhhHHhCC---CCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 99 CRQVIEQFGG---EVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 99 a~~i~~~~~g---~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
|+.+.+.|+. ..-.+.++|.++||||+.+|+.|..|
T Consensus 16 ak~L~~~f~sl~~l~~a~~e~L~~i~gIG~~~A~si~~f 54 (64)
T PF12826_consen 16 AKLLAKHFGSLEALMNASVEELSAIPGIGPKIAQSIYEF 54 (64)
T ss_dssp HHHHHHCCSCHHHHCC--HHHHCTSTT--HHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHcCHHHHhccCCcCHHHHHHHHHH
Confidence 3444555543 12357899999999999999999876
No 121
>PRK10353 3-methyl-adenine DNA glycosylase I; Provisional
Probab=93.79 E-value=1.1 Score=39.54 Aligned_cols=74 Identities=15% Similarity=0.163 Sum_probs=61.7
Q ss_pred CCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCC--CHHHHhcCCHHHHHHHHhccCchH---HHHHHHHHHHHhhH
Q psy9547 31 RESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYP--TIKDFAFDTEDNVLKMWEGLGYYS---RVRNFQAGCRQVIE 104 (363)
Q Consensus 31 ~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~p--t~~~la~a~~~el~~~~~~lG~~~---rA~~l~~~a~~i~~ 104 (363)
...+.-||.|+-+.+-.-.++.++.+-.+.|.+.|- +++.+|..++++|+.++..-|.-+ |.+.+++=|+.+.+
T Consensus 26 ~dd~~LFE~L~Le~~QAGLSW~tIL~Kre~fr~aF~~Fd~~~VA~~~e~die~Ll~d~~IIRnr~KI~Avi~NA~~~l~ 104 (187)
T PRK10353 26 TDSKKLFEMICLEGQQAGLSWITVLKKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQ 104 (187)
T ss_pred CCcHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHcCCCHHHHhCCCHHHHHHHhcCchhHHhHHHHHHHHHHHHHHHH
Confidence 344668999999999999999999999999999874 899999999999999999888753 55567666666554
No 122
>PLN02839 nudix hydrolase
Probab=93.41 E-value=0.32 Score=47.11 Aligned_cols=112 Identities=12% Similarity=0.072 Sum_probs=67.8
Q ss_pred CCceEEEeecCCC-ccccccccc-cccCCCC--CchHHH-HHHHhhCCCcccccc--cccccEEEEccce----EEEEEE
Q psy9547 247 DTNKYLIQKRPTT-GLLSNFYMF-LSFESDT--PYTSQV-EFLSENLPFKVNLNE--KCLGNVKHVFSHL----KWNMDV 315 (363)
Q Consensus 247 ~~~~vLl~~R~~~-~~~~glWEF-Pggk~e~--~~~~al-rEl~Eelgl~v~~~~--~~l~~v~h~~sh~----~~~l~~ 315 (363)
++.++++.||... .-+.|+|.= -+|-+.. ++.+++ ||..||.|+...... ...+.+.+.+... .=.+.+
T Consensus 216 g~~~lWV~RRS~tK~t~PGmLDn~VAGGi~aGesp~etliREa~EEAgLp~~l~~~~~~~G~VsY~~~~~~g~~~evly~ 295 (372)
T PLN02839 216 GQKFLWIGKRSLSKSTYPGMLDHLVAGGLPHGISCGENLVKECEEEAGISKAIADRAIAVGAVSYMDIDQYCFKRDVLFC 295 (372)
T ss_pred CCeEEEeeccCCCCCCCCChhhhccccCccCCCCHHHHHHHHHHHHcCCCHHHHhcceEeEEEEEEEEcCCccccCEEEE
Confidence 4457999999775 457888852 2343433 788999 999999999754221 1566666654211 123456
Q ss_pred EEEEecCC-C--CC--CCCCCeeecccccC----cC-CCCh-HHHHHHHHHHhh
Q psy9547 316 YSGTAKEK-T--IP--ANKTYKLITETQMK----KY-AFPV-PYQKVWKLFTKS 358 (363)
Q Consensus 316 ~~~~~~~~-~--~~--~~~~~~W~~~~el~----~~-~~~~-a~~~il~~l~~~ 358 (363)
|.+.+..+ . +. |-+++.+++++|+. +- .|.+ ...-+++.|..+
T Consensus 296 YDLeLP~df~P~~qDGEVe~F~Lm~v~EV~~~l~~~~~fKpn~aLViiDFLiRh 349 (372)
T PLN02839 296 YDLELPQDFVPKNQDGEVESFKLIPVAQVANVIRKTSFFKANCSLVIIDFLFRH 349 (372)
T ss_pred eeeecCCccccCCCccceeEEEEecHHHHHHHHHcCCCCCcccHHHHHHHHHHc
Confidence 77776653 1 12 34778899988885 22 2422 444455555544
No 123
>TIGR00426 competence protein ComEA helix-hairpin-helix repeat region. Members of the subfamily recognized by this model include competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This model represents a region of two tandem copies of a helix-hairpin-helix domain (pfam00633), each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to the region covered by this model.
Probab=92.98 E-value=0.22 Score=36.21 Aligned_cols=57 Identities=12% Similarity=0.272 Sum_probs=39.5
Q ss_pred HHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHh--CCCCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 69 KDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQF--GGEVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 69 ~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~--~g~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
-+|-.|+.++|..++.++|.. +|+.| ++.. .|.+ .+.++|.+++|||.++++.|..+
T Consensus 8 invNta~~~~L~~~ipgig~~-~a~~I-------l~~R~~~g~~-~s~~dL~~v~gi~~~~~~~i~~~ 66 (69)
T TIGR00426 8 VNINTATAEELQRAMNGVGLK-KAEAI-------VSYREEYGPF-KTVEDLKQVPGIGNSLVEKNLAV 66 (69)
T ss_pred eECcCCCHHHHHhHCCCCCHH-HHHHH-------HHHHHHcCCc-CCHHHHHcCCCCCHHHHHHHHhh
Confidence 345567888888877777752 34333 3321 2223 48999999999999999998865
No 124
>PRK00024 hypothetical protein; Reviewed
Probab=92.18 E-value=0.15 Score=46.39 Aligned_cols=60 Identities=23% Similarity=0.356 Sum_probs=43.5
Q ss_pred hcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCC---CcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 72 AFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGE---VPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 72 a~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~---~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
..++..||.+++ +|+..+-+....+|+.+.++||+. +..+.++|.+++|||+..|..++.
T Consensus 22 ~~Lsd~ELLa~l--L~~g~~~~~~~~LA~~LL~~fgsL~~l~~as~~eL~~i~GIG~akA~~L~a 84 (224)
T PRK00024 22 AALSDAELLAIL--LRTGTKGKSVLDLARELLQRFGSLRGLLDASLEELQSIKGIGPAKAAQLKA 84 (224)
T ss_pred ccCCHHHHHHHH--HcCCCCCCCHHHHHHHHHHHcCCHHHHHhCCHHHHhhccCccHHHHHHHHH
Confidence 345777888876 444434456677899999888752 235689999999999998876654
No 125
>TIGR00608 radc DNA repair protein radc. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.18 E-value=0.14 Score=46.38 Aligned_cols=59 Identities=15% Similarity=0.224 Sum_probs=43.0
Q ss_pred cCCHHHHHHHHhccCchHHHH-HHHHHHHHhhHHh---CCC---CcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 73 FDTEDNVLKMWEGLGYYSRVR-NFQAGCRQVIEQF---GGE---VPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 73 ~a~~~el~~~~~~lG~~~rA~-~l~~~a~~i~~~~---~g~---~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
.++..||.+++=+.|. +-+ ....+|+.+.++| |+. ...+.++|++++|||+..|..++.
T Consensus 13 ~Lsd~ELLailL~~g~--~~~~~~~~lA~~ll~~f~~~g~l~~l~~a~~~eL~~i~GiG~aka~~l~a 78 (218)
T TIGR00608 13 ALSDYELLAIILRTGT--PKGLDVLSLSKRLLDVFGRQDSLGHLLSAPPEELSSVPGIGEAKAIQLKA 78 (218)
T ss_pred cCCHHHHHHHHHhCCC--CCCCCHHHHHHHHHHHhcccCCHHHHHhCCHHHHHhCcCCcHHHHHHHHH
Confidence 4567788887633443 444 7888899999988 442 235789999999999977776654
No 126
>PF09674 DUF2400: Protein of unknown function (DUF2400); InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.
Probab=91.78 E-value=0.098 Score=47.68 Aligned_cols=55 Identities=18% Similarity=0.221 Sum_probs=41.7
Q ss_pred cccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhh
Q psy9547 140 TPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGA 195 (363)
Q Consensus 140 ~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~ 195 (363)
.+|+|+||.|+..++|.+.....+-++..++.+.+..+.|.+ |-.+--||..+|.
T Consensus 176 iiPLDtHv~~var~LGL~~rk~~d~k~A~elT~~lr~~~p~D-PvKYDFAL~~~Gi 230 (232)
T PF09674_consen 176 IIPLDTHVFRVARKLGLLKRKSADWKAARELTEALREFDPDD-PVKYDFALFRLGI 230 (232)
T ss_pred cccchHhHHHHHHHcCCccCCCccHHHHHHHHHHHHhcCCCC-CcchhhhcccCCc
Confidence 489999999999999998765444566677777777777654 7666667777664
No 127
>PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional
Probab=90.82 E-value=0.63 Score=38.23 Aligned_cols=58 Identities=10% Similarity=0.204 Sum_probs=39.8
Q ss_pred hCCCHHHHhcCCHHHHHHHHhccCch-HHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 64 TYPTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 64 ~~pt~~~la~a~~~el~~~~~~lG~~-~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+...-+|-.|+.+++.. + -|.| .+|+.|. . +|.+ .+.++|.++||||+++.+.+--+
T Consensus 48 ~~~~kIdiN~A~~~el~~-l--pGigP~~A~~IV-------~--nGpf-~sveDL~~V~GIgekqk~~l~k~ 106 (132)
T PRK02515 48 EFGEKIDLNNSSVRAFRQ-F--PGMYPTLAGKIV-------K--NAPY-DSVEDVLNLPGLSERQKELLEAN 106 (132)
T ss_pred hcCCcccCCccCHHHHHH-C--CCCCHHHHHHHH-------H--CCCC-CCHHHHHcCCCCCHHHHHHHHHh
Confidence 456667777888888766 3 3544 4666665 2 4644 57889999999998877765543
No 128
>PRK14601 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=90.75 E-value=0.29 Score=43.01 Aligned_cols=27 Identities=19% Similarity=0.274 Sum_probs=22.8
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+.+.|.++||||+|||+-|.+---++
T Consensus 105 ~D~~~L~~vpGIGkKtAeRIilELkdK 131 (183)
T PRK14601 105 GDESVLKKVPGIGPKSAKRIIAELSDA 131 (183)
T ss_pred CCHHHHhhCCCCCHHHHHHHHHHHHHH
Confidence 478999999999999999998765444
No 129
>TIGR02757 conserved hypothetical protein TIGR02757. Members of this uncharacterized protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighborhoods show little conservation.
Probab=90.54 E-value=0.14 Score=46.53 Aligned_cols=55 Identities=20% Similarity=0.274 Sum_probs=40.0
Q ss_pred cccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhh
Q psy9547 140 TPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGA 195 (363)
Q Consensus 140 ~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~ 195 (363)
.+|+|+||.||..++|.+.....+-++..++.+.+..+-|. +|-.+--||..+|.
T Consensus 173 iiPLDtH~~rvar~LgL~~Rk~~d~kaa~ElT~~Lr~~dp~-DPvKYDFAL~~lGi 227 (229)
T TIGR02757 173 ILPLDTHVFRIAKKLKLLKRKSYDLKAAIEITEALRELNPE-DPIKYDFALFRLGQ 227 (229)
T ss_pred eeechHhHHHHHHHhCCcccCchhHHHHHHHHHHHHhcCCC-CCcchhhhcccCCc
Confidence 58999999999999988775433345566677777776664 47667677777764
No 130
>PRK13901 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=90.42 E-value=0.32 Score=43.12 Aligned_cols=35 Identities=17% Similarity=0.166 Sum_probs=26.7
Q ss_pred HHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 97 AGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 97 ~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
+++++|.+ .+.+.|.++||||+|||+-|.+---++
T Consensus 96 el~~aI~~-------~D~~~L~~vpGIGkKtAeRIIlELkdK 130 (196)
T PRK13901 96 EFRDAIDR-------EDIELISKVKGIGNKMAGKIFLKLRGK 130 (196)
T ss_pred HHHHHHHh-------CCHHHHhhCCCCCHHHHHHHHHHHHHh
Confidence 44555544 378999999999999999998765444
No 131
>COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair]
Probab=90.07 E-value=0.15 Score=44.80 Aligned_cols=28 Identities=25% Similarity=0.179 Sum_probs=22.0
Q ss_pred cccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 111 PRDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 111 p~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
...++.|.+|||||+|+|.-++.+-+++
T Consensus 8 ~~LI~~l~kLPGvG~KsA~R~AfhLL~~ 35 (198)
T COG0353 8 EKLIDALKKLPGVGPKSAQRLAFHLLQR 35 (198)
T ss_pred HHHHHHHhhCCCCChhHHHHHHHHHHcc
Confidence 3457889999999999999777665443
No 132
>PRK14606 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=90.05 E-value=0.36 Score=42.65 Aligned_cols=27 Identities=19% Similarity=0.195 Sum_probs=22.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+.+.|.++||||+|||+-|..---++
T Consensus 105 ~D~~~L~~vpGIGkKtAerIilELkdK 131 (188)
T PRK14606 105 QDVEGLSKLPGISKKTAERIVMELKDE 131 (188)
T ss_pred CCHHHHhhCCCCCHHHHHHHHHHHHHh
Confidence 378999999999999999998664443
No 133
>TIGR00624 tag DNA-3-methyladenine glycosylase I. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.58 E-value=3.1 Score=36.39 Aligned_cols=74 Identities=19% Similarity=0.196 Sum_probs=62.6
Q ss_pred CCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCC--CHHHHhcCCHHHHHHHHhccCchH---HHHHHHHHHHHhhH
Q psy9547 31 RESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYP--TIKDFAFDTEDNVLKMWEGLGYYS---RVRNFQAGCRQVIE 104 (363)
Q Consensus 31 ~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~p--t~~~la~a~~~el~~~~~~lG~~~---rA~~l~~~a~~i~~ 104 (363)
...+.-|+.|+-+++-.-.++.++.+-.+.|.+.|- +|+.+|..++++++.++..-|.-+ |.+.+++=|+.+.+
T Consensus 25 ~dd~~LFE~L~Le~fQAGLSW~tIL~Kr~~fr~aF~~Fd~~~VA~~~e~~ie~L~~d~~IIRnr~KI~Avi~NA~~~l~ 103 (179)
T TIGR00624 25 RDSVALFERMSLEGFQAGLSWITVLRKRENYRRAFSGFDIVKVARMTDADVERLLQDDGIIRNRGKIEATIANARAALQ 103 (179)
T ss_pred cCCHHHHHHHHHHHHhCcCCHHHHHHhHHHHHHHHcCCCHHHHhCCCHHHHHHHhcCccchhhHHHHHHHHHHHHHHHH
Confidence 344678999999999999999999999999998874 899999999999999999888753 45567777777765
No 134
>TIGR00615 recR recombination protein RecR. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.53 E-value=0.3 Score=43.16 Aligned_cols=26 Identities=23% Similarity=0.241 Sum_probs=20.9
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYN 137 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~ 137 (363)
...+.|.+|||||+|||.-++.+-+.
T Consensus 8 ~Li~~l~~LPGIG~KsA~RlA~~ll~ 33 (195)
T TIGR00615 8 KLIESLKKLPGIGPKSAQRLAFHLLK 33 (195)
T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHc
Confidence 45678899999999999987766543
No 135
>PRK14602 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=89.52 E-value=0.41 Score=42.80 Aligned_cols=27 Identities=22% Similarity=0.186 Sum_probs=22.3
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+.+.|.++||||+|||+-|..---++
T Consensus 106 ~D~~~L~~ipGIGkKtAerIilELkdK 132 (203)
T PRK14602 106 EDVAALTRVSGIGKKTAQHIFLELKYK 132 (203)
T ss_pred CCHHHHhcCCCcCHHHHHHHHHHHHHh
Confidence 478999999999999999998654333
No 136
>PRK00076 recR recombination protein RecR; Reviewed
Probab=89.44 E-value=0.31 Score=43.23 Aligned_cols=27 Identities=22% Similarity=0.196 Sum_probs=21.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
...+.|.+|||||+|||.-+..+-+..
T Consensus 8 ~Li~~l~~LPGIG~KsA~Rla~~ll~~ 34 (196)
T PRK00076 8 KLIEALRKLPGIGPKSAQRLAFHLLQR 34 (196)
T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHcC
Confidence 457788999999999999887765533
No 137
>PRK14603 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=89.27 E-value=0.44 Score=42.40 Aligned_cols=27 Identities=30% Similarity=0.229 Sum_probs=22.3
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+.+.|.++||||+|||+-|..---++
T Consensus 104 ~D~~~L~kvpGIGkKtAerIilELkdK 130 (197)
T PRK14603 104 GDARLLTSASGVGKKLAERIALELKGK 130 (197)
T ss_pred CCHHHHhhCCCCCHHHHHHHHHHHHHH
Confidence 378999999999999999998654433
No 138
>PRK14604 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=89.17 E-value=0.46 Score=42.24 Aligned_cols=27 Identities=22% Similarity=0.239 Sum_probs=22.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+.+.|.++||||+|||+-|..---++
T Consensus 105 ~D~~~L~kvpGIGkKtAerIilELk~K 131 (195)
T PRK14604 105 GDVARLARVPGIGKKTAERIVLELKGK 131 (195)
T ss_pred CCHHHHhhCCCCCHHHHHHHHHHHHHH
Confidence 478999999999999999998764443
No 139
>COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair]
Probab=89.03 E-value=0.45 Score=42.39 Aligned_cols=27 Identities=30% Similarity=0.283 Sum_probs=22.8
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.+...|.++||||+|||.-+++---|+
T Consensus 105 ~d~~~L~k~PGIGkKtAerivleLk~K 131 (201)
T COG0632 105 EDVKALSKIPGIGKKTAERIVLELKGK 131 (201)
T ss_pred cChHhhhcCCCCCHHHHHHHHHHHhhh
Confidence 478999999999999999998865443
No 140
>PRK13844 recombination protein RecR; Provisional
Probab=88.90 E-value=0.35 Score=42.89 Aligned_cols=26 Identities=15% Similarity=0.158 Sum_probs=20.9
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcC
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYN 137 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~ 137 (363)
...+.|.+|||||+|+|.-+..+-+.
T Consensus 12 ~LI~~l~~LPGIG~KsA~Rla~~lL~ 37 (200)
T PRK13844 12 AVIESLRKLPTIGKKSSQRLALYLLD 37 (200)
T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHc
Confidence 45677889999999999988776543
No 141
>PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed
Probab=88.22 E-value=0.72 Score=40.86 Aligned_cols=23 Identities=30% Similarity=0.387 Sum_probs=19.7
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.+.|.++||||+++|+.|+..
T Consensus 105 ~d~~~L~~v~Gig~k~A~~I~~~ 127 (192)
T PRK00116 105 GDVKALTKVPGIGKKTAERIVLE 127 (192)
T ss_pred CCHHHHHhCCCCCHHHHHHHHHH
Confidence 46788999999999999998865
No 142
>smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases
Probab=88.15 E-value=2.2 Score=41.25 Aligned_cols=31 Identities=29% Similarity=0.308 Sum_probs=25.0
Q ss_pred hCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 106 FGGEVPRDKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 106 ~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
.....|..+.+|+++|||||+||..+-. +|.
T Consensus 80 l~~~~p~~l~~l~~i~GiGpk~a~~l~~--lGi 110 (334)
T smart00483 80 LNDEVYKSLKLFTNVFGVGPKTAAKWYR--KGI 110 (334)
T ss_pred hcCcHHHHHHHHHccCCcCHHHHHHHHH--hCC
Confidence 3456788899999999999999987755 554
No 143
>KOG4548|consensus
Probab=88.13 E-value=0.87 Score=41.50 Aligned_cols=96 Identities=9% Similarity=0.077 Sum_probs=59.4
Q ss_pred ceEEEeecCCCccccccccccccCC-CC--CchHHH-HHHHhhCCCccc---ccccccccEEEEccceE-------EEEE
Q psy9547 249 NKYLIQKRPTTGLLSNFYMFLSFES-DT--PYTSQV-EFLSENLPFKVN---LNEKCLGNVKHVFSHLK-------WNMD 314 (363)
Q Consensus 249 ~~vLl~~R~~~~~~~glWEFPggk~-e~--~~~~al-rEl~Eelgl~v~---~~~~~l~~v~h~~sh~~-------~~l~ 314 (363)
.-|||++|+-+. .+.|-||.+.. ++ +....+ +.|+.-.|-... ++..+++...|.++.-. -.+.
T Consensus 139 ~LyLLV~~k~g~--~s~w~fP~~~~s~~~~~lr~~ae~~Lk~~~ge~~~t~fvgnaP~g~~~~q~pr~~~~e~~~~sk~f 216 (263)
T KOG4548|consen 139 KLYLLVKRKFGK--SSVWIFPNRQFSSSEKTLRGHAERDLKVLSGENKSTWFVGNAPFGHTPLQSPREMTTEEPVSSKVF 216 (263)
T ss_pred eEEEEEeeccCc--cceeeCCCcccCCccchHHHHHHHHHHHHhcchhhhheeccCccccccccCcccccccccccceeE
Confidence 578888875333 35899999998 43 444445 777665553322 22236777777776442 2345
Q ss_pred EEEEEecCC---CCCCCCCCeeecccccCcCCCCh
Q psy9547 315 VYSGTAKEK---TIPANKTYKLITETQMKKYAFPV 346 (363)
Q Consensus 315 ~~~~~~~~~---~~~~~~~~~W~~~~el~~~~~~~ 346 (363)
+|.+....+ ......++.|++-++|.++--+.
T Consensus 217 f~k~~lv~~~~~kn~n~edfvWvTkdel~e~l~~~ 251 (263)
T KOG4548|consen 217 FFKASLVANSNQKNQNKEDFVWVTKDELGEKLPKF 251 (263)
T ss_pred EeeeeeccccchhcccccceEEechHHHhhhcchH
Confidence 566666554 22234568999999998854333
No 144
>PRK14605 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=87.72 E-value=0.45 Score=42.25 Aligned_cols=23 Identities=26% Similarity=0.359 Sum_probs=19.6
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.+.|.++||||+|||+-|..-
T Consensus 105 ~D~~~L~~vpGIGkKtAerIilE 127 (194)
T PRK14605 105 GNAELLSTIPGIGKKTASRIVLE 127 (194)
T ss_pred CCHHHHHhCCCCCHHHHHHHHHH
Confidence 46888999999999999996644
No 145
>PF12836 HHH_3: Helix-hairpin-helix motif; PDB: 2EDU_A 2OCE_A 3BZK_A 3BZC_A 2DUY_A.
Probab=87.20 E-value=0.6 Score=33.56 Aligned_cols=23 Identities=22% Similarity=0.403 Sum_probs=18.3
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.++|.++||||++.|..|+.+
T Consensus 11 as~~eL~~lpgi~~~~A~~Iv~~ 33 (65)
T PF12836_consen 11 ASAEELQALPGIGPKQAKAIVEY 33 (65)
T ss_dssp S-HHHHHTSTT--HHHHHHHHHH
T ss_pred CCHHHHHHcCCCCHHHHHHHHHH
Confidence 36899999999999999999987
No 146
>PRK08609 hypothetical protein; Provisional
Probab=86.88 E-value=3.6 Score=42.75 Aligned_cols=101 Identities=19% Similarity=0.308 Sum_probs=53.7
Q ss_pred cCHHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHh
Q psy9547 5 LSAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWE 84 (363)
Q Consensus 5 ~~~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~ 84 (363)
|+..++..+++.+...++..+ .|||.+--- .+|.. -++..|. +|... .++ .-+.
T Consensus 1 m~n~~ia~~l~~~A~~le~~g--------~n~fr~~aY---------r~Aa~----~i~~l~~--~i~~~--~~l-~~ip 54 (570)
T PRK08609 1 MNKKDVIKLLETIATYMELKG--------ENPFKISAF---------RKAAQ----ALELDER--SLSEI--DDF-TKLK 54 (570)
T ss_pred CChHHHHHHHHHHHHHHHhcC--------CCcHHHHHH---------HHHHH----HHHhCch--hhhhh--hhh-ccCC
Confidence 356788888888888888644 356654211 11111 1233332 12222 122 2234
Q ss_pred ccCchHHHHHHHHHHHH----hhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 85 GLGYYSRVRNFQAGCRQ----VIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 85 ~lG~~~rA~~l~~~a~~----i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
++|=. =|+.|.++.+. -+++...++|....+|+++|||||+||..+-
T Consensus 55 gIG~~-ia~kI~Eil~tG~~~~le~l~~~~p~~~~~l~~i~GiGpk~a~~l~ 105 (570)
T PRK08609 55 GIGKG-TAEVIQEYRETGESSVLQELKKEVPEGLLPLLKLPGLGGKKIAKLY 105 (570)
T ss_pred CcCHH-HHHHHHHHHHhCChHHHHHHHhhCcHHHHHHhcCCCCCHHHHHHHH
Confidence 55521 13344443321 1122234578888899999999999997654
No 147
>PRK14600 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=86.77 E-value=0.7 Score=40.75 Aligned_cols=25 Identities=20% Similarity=0.321 Sum_probs=20.5
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 113 DKKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
+.+.| ++||||+|||+-|+.---++
T Consensus 106 D~~~L-~vpGIGkKtAerIilELk~K 130 (186)
T PRK14600 106 DKAAL-KVNGIGEKLINRIITELQYK 130 (186)
T ss_pred CHhhe-ECCCCcHHHHHHHHHHHHHH
Confidence 57889 99999999999998654433
No 148
>PF14520 HHH_5: Helix-hairpin-helix domain; PDB: 3AUO_B 3AU6_A 3AU2_A 3B0X_A 3B0Y_A 1SZP_C 3LDA_A 1WCN_A 2JZB_B 2ZTC_A ....
Probab=86.76 E-value=0.65 Score=32.75 Aligned_cols=26 Identities=31% Similarity=0.291 Sum_probs=21.9
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 114 KKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
..+|+++||||+++|..+...+++-.
T Consensus 4 ~~~L~~I~Gig~~~a~~L~~~G~~t~ 29 (60)
T PF14520_consen 4 FDDLLSIPGIGPKRAEKLYEAGIKTL 29 (60)
T ss_dssp HHHHHTSTTCHHHHHHHHHHTTCSSH
T ss_pred HHhhccCCCCCHHHHHHHHhcCCCcH
Confidence 56899999999999999988866654
No 149
>KOG4195|consensus
Probab=86.47 E-value=0.76 Score=41.02 Aligned_cols=105 Identities=10% Similarity=0.126 Sum_probs=57.4
Q ss_pred eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCcc-ccccc-----cc---ccEEEEc------cceE-
Q psy9547 250 KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKV-NLNEK-----CL---GNVKHVF------SHLK- 310 (363)
Q Consensus 250 ~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v-~~~~~-----~l---~~v~h~~------sh~~- 310 (363)
+++..||+. .|.|-.|||-+++ .....| ||+.||.=-.. ...+. .. .+|..-| |+--
T Consensus 140 e~vavkr~d----~~~WAiPGGmvdpGE~vs~tLkRef~eEa~ns~~k~~e~~l~~~~~~~g~evYkGYvDDpRNTDNaW 215 (275)
T KOG4195|consen 140 EFVAVKRPD----NGEWAIPGGMVDPGEKVSATLKREFGEEAMNSLQKSNEEMLHKLLSQCGMEVYKGYVDDPRNTDNAW 215 (275)
T ss_pred EEEEEecCC----CCcccCCCCcCCchhhhhHHHHHHHHHHHHHHHhhhhHHHHHHHHHhCCeEEeeeecCCCCcccccc
Confidence 577889976 4788999999998 345677 88888731111 11000 01 1111112 1111
Q ss_pred EEEEEEEEEecCCC------C---CCCCCCeeecccccCcCCCChHHHHHHHHHHhhhc
Q psy9547 311 WNMDVYSGTAKEKT------I---PANKTYKLITETQMKKYAFPVPYQKVWKLFTKSKG 360 (363)
Q Consensus 311 ~~l~~~~~~~~~~~------~---~~~~~~~W~~~~el~~~~~~~a~~~il~~l~~~~~ 360 (363)
++..++.+.-.+|. + .....++|++.+. ++++=+....+|+.+.+..+
T Consensus 216 mET~avn~hde~g~~~~nl~L~AgDda~~vrW~dVn~--~l~LyAshs~fi~lvae~r~ 272 (275)
T KOG4195|consen 216 METVAVNFHDEDGNNMDNLNLEAGDDAKKVRWVDVNS--NLPLYASHSQFIQLVAEKRG 272 (275)
T ss_pred eeEEEEeeeccccchhccccccccCcccceeEEEecC--CccchhhHHHHHHHHHHHhc
Confidence 22222222222221 1 1236679998775 56778888888888876543
No 150
>PF11731 Cdd1: Pathogenicity locus; InterPro: IPR021725 Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria []. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed.
Probab=86.29 E-value=0.9 Score=35.24 Aligned_cols=28 Identities=25% Similarity=0.310 Sum_probs=23.5
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHhcCCcc
Q psy9547 113 DKKQLLSIKGVGDYTAGALASICYNIPT 140 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~~~~~~ 140 (363)
...+|..|||||+.+|.-+...++..+.
T Consensus 10 ~~~~L~~iP~IG~a~a~DL~~LGi~s~~ 37 (93)
T PF11731_consen 10 GLSDLTDIPNIGKATAEDLRLLGIRSPA 37 (93)
T ss_pred HHHHHhcCCCccHHHHHHHHHcCCCCHH
Confidence 4678999999999999998887776654
No 151
>PRK14605 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=86.20 E-value=1 Score=40.00 Aligned_cols=21 Identities=24% Similarity=0.352 Sum_probs=19.1
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.|++++||||++|-.||+.
T Consensus 72 F~~Li~V~GIGpK~Al~ILs~ 92 (194)
T PRK14605 72 FETLIDVSGIGPKLGLAMLSA 92 (194)
T ss_pred HHHHhCCCCCCHHHHHHHHHh
Confidence 677999999999999999983
No 152
>PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed
Probab=85.90 E-value=0.72 Score=40.86 Aligned_cols=24 Identities=33% Similarity=0.321 Sum_probs=19.1
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhcCC
Q psy9547 114 KKQLLSIKGVGDYTAGALASICYNI 138 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~~~ 138 (363)
...|.++|||||+||..++. .|+-
T Consensus 72 f~~L~~i~GIGpk~A~~il~-~fg~ 95 (192)
T PRK00116 72 FRLLISVSGVGPKLALAILS-GLSP 95 (192)
T ss_pred HHHHhcCCCCCHHHHHHHHH-hCCH
Confidence 55788999999999999876 3443
No 153
>PF12836 HHH_3: Helix-hairpin-helix motif; PDB: 2EDU_A 2OCE_A 3BZK_A 3BZC_A 2DUY_A.
Probab=85.85 E-value=0.79 Score=32.94 Aligned_cols=53 Identities=15% Similarity=0.245 Sum_probs=30.6
Q ss_pred HhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 71 FAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 71 la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
|-.|+.+||..+ -|+.. .|+.|.+. .+. +|.+ .+.++|..++|||+.+.+-+.
T Consensus 8 iN~as~~eL~~l---pgi~~~~A~~Iv~~----R~~-~G~f-~s~~dL~~v~gi~~~~~~~l~ 61 (65)
T PF12836_consen 8 INTASAEELQAL---PGIGPKQAKAIVEY----REK-NGPF-KSLEDLKEVPGIGPKTYEKLK 61 (65)
T ss_dssp TTTS-HHHHHTS---TT--HHHHHHHHHH----HHH-H-S--SSGGGGGGSTT--HHHHHHHC
T ss_pred CccCCHHHHHHc---CCCCHHHHHHHHHH----HHh-CcCC-CCHHHHhhCCCCCHHHHHHHH
Confidence 446777777652 26654 67777552 222 2444 579999999999999988654
No 154
>TIGR01259 comE comEA protein. This model describes the ComEA protein in bacteria. The com E locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. Lesions in the loci has been variously described for the appearance of competence-related pheonotypes and impairment of competence, suggesting their intimate functional role in bacterial transformation.
Probab=85.64 E-value=1.8 Score=35.35 Aligned_cols=61 Identities=13% Similarity=0.267 Sum_probs=40.4
Q ss_pred CCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 66 PTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 66 pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
+.+-+|-.|+.++|.. +.++|. .+|+.|.+-=. + +|.+ .+.++|.+++|||+++++-+.-+
T Consensus 57 ~~~iniNtA~~~eL~~-lpGIG~-~~A~~Ii~~R~---~--~g~f-~s~eeL~~V~GIg~k~~~~i~~~ 117 (120)
T TIGR01259 57 LAAVNINAASLEELQA-LPGIGP-AKAKAIIEYRE---E--NGAF-KSVDDLTKVSGIGEKSLEKLKDY 117 (120)
T ss_pred CCCEeCCcCCHHHHhc-CCCCCH-HHHHHHHHHHH---h--cCCc-CCHHHHHcCCCCCHHHHHHHHhc
Confidence 3455677788888865 345553 35555544211 1 2433 67999999999999999987654
No 155
>COG4112 Predicted phosphoesterase (MutT family) [General function prediction only]
Probab=85.62 E-value=5 Score=34.19 Aligned_cols=105 Identities=20% Similarity=0.245 Sum_probs=55.7
Q ss_pred EEEEEEEeCCceEEEeecCCCc---ccccccccc-ccCCCC----CchHH-----H-HHHHhhCCCcccccc--cccccE
Q psy9547 239 LLTVVVKTDTNKYLIQKRPTTG---LLSNFYMFL-SFESDT----PYTSQ-----V-EFLSENLPFKVNLNE--KCLGNV 302 (363)
Q Consensus 239 v~~~ii~~~~~~vLl~~R~~~~---~~~glWEFP-ggk~e~----~~~~a-----l-rEl~Eelgl~v~~~~--~~l~~v 302 (363)
+..++|.++ ++||+..|..+| -+.|++..= ||.... ++.++ + ||+.||+++.-...+ ..++-+
T Consensus 63 IpYvvi~~e-devliyeRltgggE~RLHn~~SlG~GGHmn~~~GA~s~~evLk~n~~REleEEv~vseqd~q~~e~lGlI 141 (203)
T COG4112 63 IPYVVIMDE-DEVLIYERLTGGGEKRLHNLYSLGIGGHMNEGDGATSREEVLKGNLERELEEEVDVSEQDLQELEFLGLI 141 (203)
T ss_pred ccEEEEecC-CEEEEEEeccCcchhhhccccccccccccccCCCcccHHHHHccchHHHHHHHhCcCHHHhhhheeeeee
Confidence 345667777 799999998765 566777654 233322 22222 2 899999998733222 022222
Q ss_pred EEEcc-ceEEEEEE-EEEEecCC--CCCC--CCCCeeecccccCc-CCC
Q psy9547 303 KHVFS-HLKWNMDV-YSGTAKEK--TIPA--NKTYKLITETQMKK-YAF 344 (363)
Q Consensus 303 ~h~~s-h~~~~l~~-~~~~~~~~--~~~~--~~~~~W~~~~el~~-~~~ 344 (363)
...-. -.+++|-+ |..+.... ...+ .-.+.|+..++|.+ |+-
T Consensus 142 Ndd~neVgkVHiG~lf~~~~k~ndvevKEkd~~~~kwik~~ele~~y~~ 190 (203)
T COG4112 142 NDDTNEVGKVHIGALFLGRGKFNDVEVKEKDLFEWKWIKLEELEKFYGV 190 (203)
T ss_pred cCCCcccceEEEEEEEEeeccccceeeeecceeeeeeeeHHHHHHHhhh
Confidence 21111 11233322 22232221 1222 34568999999988 443
No 156
>PF05559 DUF763: Protein of unknown function (DUF763); InterPro: IPR008482 This family consists of several uncharacterised bacterial and archaeal proteins of unknown function.
Probab=85.46 E-value=2.1 Score=40.53 Aligned_cols=43 Identities=40% Similarity=0.553 Sum_probs=32.1
Q ss_pred HHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHh---cCCc
Q psy9547 93 RNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASIC---YNIP 139 (363)
Q Consensus 93 ~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~---~~~~ 139 (363)
+.+.++...+.+ .-|.++++|+.++||||.|.-++.+.+ +|.|
T Consensus 251 ~~~~~~l~~~~e----~~p~~feeLL~~~GvGp~TlRALaLvaelIyg~p 296 (319)
T PF05559_consen 251 RRLWKVLEKAYE----RQPSDFEELLLIKGVGPSTLRALALVAELIYGVP 296 (319)
T ss_pred HHHHHHHHHHhh----CCccCHHHHHhcCCCCHHHHHHHHHHHHHHcCCC
Confidence 344444444443 248899999999999999999988886 6665
No 157
>PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional
Probab=85.17 E-value=1.8 Score=35.57 Aligned_cols=29 Identities=17% Similarity=0.363 Sum_probs=23.2
Q ss_pred HhCCCCc---ccHHHHhcCCCCcHHHHHHHHH
Q psy9547 105 QFGGEVP---RDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 105 ~~~g~~p---~~~~~L~~lpGIG~~tA~~il~ 133 (363)
.+++.+. .+.++|.++|||||..|..|..
T Consensus 48 ~~~~kIdiN~A~~~el~~lpGigP~~A~~IV~ 79 (132)
T PRK02515 48 EFGEKIDLNNSSVRAFRQFPGMYPTLAGKIVK 79 (132)
T ss_pred hcCCcccCCccCHHHHHHCCCCCHHHHHHHHH
Confidence 3455554 3688999999999999999994
No 158
>COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]
Probab=84.91 E-value=0.78 Score=38.93 Aligned_cols=23 Identities=22% Similarity=0.374 Sum_probs=21.0
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHh
Q psy9547 113 DKKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~ 135 (363)
+.++|..|||||++.|.+|..+-
T Consensus 95 s~eeL~~lpgIG~~kA~aIi~yR 117 (149)
T COG1555 95 SAEELQALPGIGPKKAQAIIDYR 117 (149)
T ss_pred CHHHHHHCCCCCHHHHHHHHHHH
Confidence 58999999999999999999884
No 159
>PRK07945 hypothetical protein; Provisional
Probab=84.85 E-value=1.8 Score=41.76 Aligned_cols=58 Identities=24% Similarity=0.308 Sum_probs=37.7
Q ss_pred HHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCC-Cccc-HH-HHhcCCCCcHHHHHHHHHHh
Q psy9547 77 DNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGE-VPRD-KK-QLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 77 ~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~-~p~~-~~-~L~~lpGIG~~tA~~il~~~ 135 (363)
+++..++.-.|=.. |++..+++|+.|.. .+.+ +..- .+ +|.+|||||.-||+.|.-+.
T Consensus 8 ~~~a~lle~~~~n~frv~ayr~aa~~~~~-~~~~~~~~~~~~g~l~~~~giG~~~a~~i~e~~ 69 (335)
T PRK07945 8 RRIAFLLERARADTYRVRAFRRAADVVEA-LDAAERARRARAGSLTSLPGIGPKTAKVIAQAL 69 (335)
T ss_pred HHHHHHHHHcCCChhhHHHHHHHHHHHHh-cChhHHHHHHhcCCcccCCCcCHHHHHHHHHHH
Confidence 34445555455332 88888888888865 1222 2111 11 68899999999999988774
No 160
>COG2818 Tag 3-methyladenine DNA glycosylase [DNA replication, recombination, and repair]
Probab=84.80 E-value=4.3 Score=35.50 Aligned_cols=71 Identities=20% Similarity=0.209 Sum_probs=58.2
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhC--CCHHHHhcCCHHHHHHHHhccCchH---HHHHHHHHHHHhhH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTY--PTIKDFAFDTEDNVLKMWEGLGYYS---RVRNFQAGCRQVIE 104 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~--pt~~~la~a~~~el~~~~~~lG~~~---rA~~l~~~a~~i~~ 104 (363)
+.-|+.++-++.-.-.+|.++.+-.++|.+.| .+|+.+|..++++++.++...|.-+ |-+.++.-|+.+.+
T Consensus 30 ~~LFE~l~Le~fQAGLSW~tVL~KRe~freaF~~Fd~~kVA~~~~~dverLl~d~gIIR~r~KI~A~i~NA~~~l~ 105 (188)
T COG2818 30 QRLFELLCLEGFQAGLSWLTVLKKREAFREAFHGFDPEKVAAMTEEDVERLLADAGIIRNRGKIKATINNARAVLE 105 (188)
T ss_pred HHHHHHHHHHHHhccchHHHHHHhHHHHHHHHhcCCHHHHHcCCHHHHHHHHhCcchhhhHHHHHHHHHHHHHHHH
Confidence 34688888888877789999999999999987 4999999999999999999999753 45556665666554
No 161
>TIGR01259 comE comEA protein. This model describes the ComEA protein in bacteria. The com E locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. Lesions in the loci has been variously described for the appearance of competence-related pheonotypes and impairment of competence, suggesting their intimate functional role in bacterial transformation.
Probab=84.09 E-value=0.97 Score=36.87 Aligned_cols=23 Identities=22% Similarity=0.342 Sum_probs=20.9
Q ss_pred cHHHHhcCCCCcHHHHHHHHHHh
Q psy9547 113 DKKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~~ 135 (363)
+.++|.+|||||+++|..|+.+-
T Consensus 66 ~~~eL~~lpGIG~~~A~~Ii~~R 88 (120)
T TIGR01259 66 SLEELQALPGIGPAKAKAIIEYR 88 (120)
T ss_pred CHHHHhcCCCCCHHHHHHHHHHH
Confidence 58999999999999999999873
No 162
>PF14520 HHH_5: Helix-hairpin-helix domain; PDB: 3AUO_B 3AU6_A 3AU2_A 3B0X_A 3B0Y_A 1SZP_C 3LDA_A 1WCN_A 2JZB_B 2ZTC_A ....
Probab=83.47 E-value=2.3 Score=29.88 Aligned_cols=21 Identities=33% Similarity=0.575 Sum_probs=13.3
Q ss_pred cHHHHhcCCCCcHHHHHHHHH
Q psy9547 113 DKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~ 133 (363)
+.++|.++||||+++|+.+..
T Consensus 36 ~~~~L~~i~Gig~~~a~~i~~ 56 (60)
T PF14520_consen 36 DPEELAEIPGIGEKTAEKIIE 56 (60)
T ss_dssp HHHHHHTSTTSSHHHHHHHHH
T ss_pred CHHHHhcCCCCCHHHHHHHHH
Confidence 345566677777777766654
No 163
>COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair]
Probab=83.28 E-value=1.8 Score=39.02 Aligned_cols=58 Identities=17% Similarity=0.271 Sum_probs=42.7
Q ss_pred cCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCC---CcccHHHHhcCCCCcHHHHHHHH
Q psy9547 73 FDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGE---VPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 73 ~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~---~p~~~~~L~~lpGIG~~tA~~il 132 (363)
.++..||.+++=.-|- |-.....+|+.+.++||+- ...+.++|+++||||+..|--+.
T Consensus 23 ~Lsd~ELLailLrtG~--~~~~~~~la~~lL~~fg~L~~l~~a~~~el~~v~GiG~aka~~l~ 83 (224)
T COG2003 23 ALSDAELLAILLRTGT--KGESVLDLAKELLQEFGSLAELLKASVEELSSVKGIGLAKAIQIK 83 (224)
T ss_pred hcchHHHHHHHHhcCC--CCCCHHHHHHHHHHHcccHHHHHhCCHHHHhhCCCccHHHHHHHH
Confidence 3566788888744444 4456788899999998762 45679999999999987655443
No 164
>PF02371 Transposase_20: Transposase IS116/IS110/IS902 family; InterPro: IPR003346 Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. It is often found with the transposase IS111A/IS1328/IS1533 family (see IPR002525 from INTERPRO) [, ]. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated
Probab=83.21 E-value=0.99 Score=34.43 Aligned_cols=28 Identities=32% Similarity=0.324 Sum_probs=21.3
Q ss_pred HHHhcCCCCcHHHHHHHHHHhcCCcccc
Q psy9547 115 KQLLSIKGVGDYTAGALASICYNIPTPA 142 (363)
Q Consensus 115 ~~L~~lpGIG~~tA~~il~~~~~~~~~~ 142 (363)
+.|+++||||+-||..++....+...|.
T Consensus 2 ~~l~sipGig~~~a~~llaeigd~~rF~ 29 (87)
T PF02371_consen 2 ELLTSIPGIGPITAATLLAEIGDISRFK 29 (87)
T ss_pred chhcCCCCccHHHHHHHHHHHcCchhcc
Confidence 4588999999999999998773333343
No 165
>PF11798 IMS_HHH: IMS family HHH motif; InterPro: IPR024728 This helix-hairpin-helix motif is found in proteins belonging to the type-Y family of DNA polymerases []. This type of polymerases are thought to be involved in UV protection and mutation [, ]. ; PDB: 3PZP_B 2OH2_B 2W7O_B 3IN5_B 1T94_A 2W7P_B 2W8K_A 2AGQ_A 1RYR_A 3RAX_A ....
Probab=82.46 E-value=1.1 Score=27.46 Aligned_cols=15 Identities=33% Similarity=0.452 Sum_probs=11.8
Q ss_pred HhcCCCCcHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGAL 131 (363)
Q Consensus 117 L~~lpGIG~~tA~~i 131 (363)
+.+++|||++|+.-+
T Consensus 13 i~~~~GIG~kt~~kL 27 (32)
T PF11798_consen 13 IRKFWGIGKKTAKKL 27 (32)
T ss_dssp GGGSTTS-HHHHHHH
T ss_pred HHhhCCccHHHHHHH
Confidence 567999999999864
No 166
>TIGR00084 ruvA Holliday junction DNA helicase, RuvA subunit. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions.
Probab=81.99 E-value=1.1 Score=39.62 Aligned_cols=24 Identities=38% Similarity=0.400 Sum_probs=20.4
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHh
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~ 135 (363)
.+...|.++||||+|||+-|++--
T Consensus 104 ~d~~~L~~ipGiGkKtAerIileL 127 (191)
T TIGR00084 104 EEVKALVKIPGVGKKTAERLLLEL 127 (191)
T ss_pred CCHHHHHhCCCCCHHHHHHHHHHH
Confidence 367889999999999999998543
No 167
>KOG0142|consensus
Probab=81.24 E-value=2 Score=37.93 Aligned_cols=112 Identities=10% Similarity=0.128 Sum_probs=62.5
Q ss_pred EEEEeCCceEEEeecCCCc-ccccccc-----ccccCCCC---C----chHHH-HHHHhhCCCcccccc----cccccEE
Q psy9547 242 VVVKTDTNKYLIQKRPTTG-LLSNFYM-----FLSFESDT---P----YTSQV-EFLSENLPFKVNLNE----KCLGNVK 303 (363)
Q Consensus 242 ~ii~~~~~~vLl~~R~~~~-~~~glWE-----FPggk~e~---~----~~~al-rEl~Eelgl~v~~~~----~~l~~v~ 303 (363)
+++.+.+|++|+.||...+ .+.|+|. -|.++... + ...|+ |.|+-||||...... .+++.+.
T Consensus 57 VFlFns~~~lLlQqRS~~KitFP~~~TNtccSHPL~~~~el~~~d~lGVr~AAqRkL~~ELGIp~e~v~pee~~~ltrih 136 (225)
T KOG0142|consen 57 VFLFNSKNELLLQQRSDEKITFPGLWTNTCCSHPLYNPGELEENDALGVRRAAQRKLKAELGIPLEEVPPEEFNFLTRIH 136 (225)
T ss_pred EEEecCcchHHHhhhccccccccchhhhhhhcCcCCChhhhccCchHHHHHHHHHHHHHhhCCCccccCHHHcccceeee
Confidence 3455777999999997765 4677773 45533322 1 24556 899999999765332 1333333
Q ss_pred EEc-cce-----EEEEEEEEE-EecCC-CCCCCCCCeeecccccCc------CCCChHHHHHHH
Q psy9547 304 HVF-SHL-----KWNMDVYSG-TAKEK-TIPANKTYKLITETQMKK------YAFPVPYQKVWK 353 (363)
Q Consensus 304 h~~-sh~-----~~~l~~~~~-~~~~~-~~~~~~~~~W~~~~el~~------~~~~~a~~~il~ 353 (363)
|.= |+. .+....|.. ++.-. .+.|-++++|++.+||.. ++|.|=.+-|.+
T Consensus 137 YkA~sdg~wGEhEiDYiL~~~~~~~~nPnpnEv~e~ryvs~eelkel~~~~~~~~TPWfkli~~ 200 (225)
T KOG0142|consen 137 YKAPSDGIWGEHEIDYILFLVKDVTLNPNPNEVSEIRYVSREELKELVAKASAGFTPWFKLISE 200 (225)
T ss_pred eecCCCCCcccceeeEEEEEeccCCCCCChhhhhHhheecHHHHHHHHhccccCCChHHHHHHH
Confidence 321 222 222222222 22111 335668899999999964 455554444433
No 168
>TIGR00575 dnlj DNA ligase, NAD-dependent. The member of this family from Treponema pallidum differs in having three rather than just one copy of the BRCT (BRCA1 C Terminus) domain (pfam00533) at the C-terminus. It is included in the seed.
Probab=81.08 E-value=5.2 Score=42.31 Aligned_cols=69 Identities=20% Similarity=0.286 Sum_probs=39.3
Q ss_pred HHHHHHhC--CCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 58 YEKFIKTY--PTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 58 ~~~l~~~~--pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
+..|.+.. -++.+|..+..++|.++ -||.. .|.+|.+..+.-.. ..+ ...-..+.|||||+++|..++.
T Consensus 445 i~~L~~~g~I~~~~Dl~~L~~~~L~~L---~GfG~Ksa~nIl~~Ie~sk~---~~l-~r~L~aLgIpgVG~~~ak~L~~ 516 (652)
T TIGR00575 445 IEQLFEKKLVRSVADLYALKKEDLLEL---EGFGEKSAQNLLNAIEKSKE---KPL-ARLLFALGIRHVGEVTAKNLAK 516 (652)
T ss_pred HHHHHHcCCcCCHHHHHhcCHHHHhhc---cCccHHHHHHHHHHHHHhcc---CcH-HHHHhhccCCCcCHHHHHHHHH
Confidence 34455432 48999998888777653 35654 56677654443221 111 1123334577778777776654
No 169
>PRK13482 DNA integrity scanning protein DisA; Provisional
Probab=80.86 E-value=2 Score=41.51 Aligned_cols=36 Identities=25% Similarity=0.473 Sum_probs=28.8
Q ss_pred HHHHHHhhHHhCCC---CcccHHHHhcCCCCcHHHHHHH
Q psy9547 96 QAGCRQVIEQFGGE---VPRDKKQLLSIKGVGDYTAGAL 131 (363)
Q Consensus 96 ~~~a~~i~~~~~g~---~p~~~~~L~~lpGIG~~tA~~i 131 (363)
+.+|+.++++||+. +..+.++|.+++|||+..|..|
T Consensus 297 k~iAk~Ll~~FGSL~~Il~As~eeL~~VeGIGe~rA~~I 335 (352)
T PRK13482 297 SAVIENLVEHFGSLQGLLAASIEDLDEVEGIGEVRARAI 335 (352)
T ss_pred HHHHHHHHHHcCCHHHHHcCCHHHHhhCCCcCHHHHHHH
Confidence 56788899988752 3457889999999999998874
No 170
>PRK07956 ligA NAD-dependent DNA ligase LigA; Validated
Probab=80.15 E-value=5 Score=42.47 Aligned_cols=39 Identities=15% Similarity=0.231 Sum_probs=23.5
Q ss_pred HHHHHh--CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHH
Q psy9547 59 EKFIKT--YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCR 100 (363)
Q Consensus 59 ~~l~~~--~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~ 100 (363)
..|.+. ..++.+|..+..++|..+ -||.. .|.+|.+..+
T Consensus 459 ~~L~~~g~I~~i~DL~~L~~~~L~~l---~gfG~Ksa~~ll~~Ie 500 (665)
T PRK07956 459 EQLFEKGLIHDPADLFKLTAEDLLGL---EGFGEKSAQNLLDAIE 500 (665)
T ss_pred HHHHHcCCCCCHHHHHhcCHHHHhcC---cCcchHHHHHHHHHHH
Confidence 444443 358888888887776553 36654 4666655433
No 171
>PRK14601 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=80.05 E-value=1.9 Score=37.84 Aligned_cols=44 Identities=25% Similarity=0.161 Sum_probs=27.4
Q ss_pred HHHHhcCCCCcHHHHHHHHHH-hcCCccccccchHHHHHhhhccc
Q psy9547 114 KKQLLSIKGVGDYTAGALASI-CYNIPTPAVDGNVFRIYGRLFEI 157 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~-~~~~~~~~vD~~v~Rvl~Rl~~~ 157 (363)
++.|+++.||||++|=+||+- ..+.-.-++..+=...+.++-++
T Consensus 72 F~~Li~VsGIGpK~Al~ILs~~~~~el~~aI~~~D~~~L~~vpGI 116 (183)
T PRK14601 72 FEMLLKVNGIGANTAMAVCSSLDVNSFYKALSLGDESVLKKVPGI 116 (183)
T ss_pred HHHHhccCCccHHHHHHHHcCCCHHHHHHHHHhCCHHHHhhCCCC
Confidence 677999999999999999974 22222223333333444555443
No 172
>TIGR00084 ruvA Holliday junction DNA helicase, RuvA subunit. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions.
Probab=79.14 E-value=1.5 Score=38.77 Aligned_cols=21 Identities=38% Similarity=0.419 Sum_probs=18.6
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
...|++++||||++|-.+|..
T Consensus 71 F~~L~~V~GIGpK~Al~iL~~ 91 (191)
T TIGR00084 71 FKELIKVNGVGPKLALAILSN 91 (191)
T ss_pred HHHHhCCCCCCHHHHHHHHhc
Confidence 677999999999999999764
No 173
>TIGR00608 radc DNA repair protein radc. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=78.89 E-value=9 Score=34.65 Aligned_cols=63 Identities=10% Similarity=0.220 Sum_probs=47.1
Q ss_pred HHHHHHHHhhhcchh-hHHHHHHHHHHhC---CCHHHHhcCCHHHHHHHHhccCchHHHHHH---HHHHHHh
Q psy9547 38 YVWISEVMLQQTQVK-TVLPYYEKFIKTY---PTIKDFAFDTEDNVLKMWEGLGYYSRVRNF---QAGCRQV 102 (363)
Q Consensus 38 ~~lv~~il~qqt~~~-~v~~~~~~l~~~~---pt~~~la~a~~~el~~~~~~lG~~~rA~~l---~~~a~~i 102 (363)
..|++.+|..-+... .+.+..++++++| ++...|.+++.+|+.+ ++|+|-. ||..| .++++.+
T Consensus 17 ~ELLailL~~g~~~~~~~~~lA~~ll~~f~~~g~l~~l~~a~~~eL~~-i~GiG~a-ka~~l~a~~El~rR~ 86 (218)
T TIGR00608 17 YELLAIILRTGTPKGLDVLSLSKRLLDVFGRQDSLGHLLSAPPEELSS-VPGIGEA-KAIQLKAAVELAKRY 86 (218)
T ss_pred HHHHHHHHhCCCCCCCCHHHHHHHHHHHhcccCCHHHHHhCCHHHHHh-CcCCcHH-HHHHHHHHHHHHHHH
Confidence 446777777777777 8999999999999 9999999999999877 4666643 44444 4444444
No 174
>smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases
Probab=78.53 E-value=5.1 Score=38.72 Aligned_cols=43 Identities=16% Similarity=0.285 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHh
Q psy9547 91 RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 91 rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~ 135 (363)
|+++..++|..|.. +...+ .+.++|.+|||||+.+|+.|--+.
T Consensus 26 k~~ay~~Aa~~i~~-l~~~i-~~~~~l~~lpgIG~~ia~kI~Eil 68 (334)
T smart00483 26 KCSYFRKAASVLKS-LPFPI-NSMKDLKGLPGIGDKIKKKIEEII 68 (334)
T ss_pred HHHHHHHHHHHHHh-CCCCC-CCHHHHhcCCCccHHHHHHHHHHH
Confidence 67777777777754 33333 456789999999999999988763
No 175
>COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]
Probab=77.98 E-value=4.6 Score=34.27 Aligned_cols=55 Identities=18% Similarity=0.357 Sum_probs=37.1
Q ss_pred HhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 71 FAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 71 la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
|-.|+.+|| ..+.++|- .+|+.|++ ..+. +|.+ .+.++|.+.+|||++|.+-..-
T Consensus 91 iNtAs~eeL-~~lpgIG~-~kA~aIi~----yRe~-~G~f-~sv~dL~~v~GiG~~~~ekl~~ 145 (149)
T COG1555 91 INTASAEEL-QALPGIGP-KKAQAIID----YREE-NGPF-KSVDDLAKVKGIGPKTLEKLKD 145 (149)
T ss_pred ccccCHHHH-HHCCCCCH-HHHHHHHH----HHHH-cCCC-CcHHHHHhccCCCHHHHHHHHh
Confidence 446788888 66666774 24555543 1222 3433 6899999999999999876543
No 176
>cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this
Probab=77.96 E-value=4.7 Score=38.45 Aligned_cols=24 Identities=42% Similarity=0.508 Sum_probs=14.4
Q ss_pred CCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 109 EVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 109 ~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
+.|..+.+|+++|||||+||..+-
T Consensus 79 ~~~~~l~~l~~i~GiGpk~a~~l~ 102 (307)
T cd00141 79 DVPPGLLLLLRVPGVGPKTARKLY 102 (307)
T ss_pred cchHHHHHHHcCCCCCHHHHHHHH
Confidence 345555666666666666666554
No 177
>smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs.
Probab=77.38 E-value=1.8 Score=27.26 Aligned_cols=16 Identities=38% Similarity=0.430 Sum_probs=13.3
Q ss_pred hcCCCCcHHHHHHHHH
Q psy9547 118 LSIKGVGDYTAGALAS 133 (363)
Q Consensus 118 ~~lpGIG~~tA~~il~ 133 (363)
-.+||||++||--++.
T Consensus 19 ~Gv~giG~ktA~~ll~ 34 (36)
T smart00279 19 PGVKGIGPKTALKLLR 34 (36)
T ss_pred CCCCcccHHHHHHHHH
Confidence 4689999999987764
No 178
>PRK00024 hypothetical protein; Reviewed
Probab=77.26 E-value=8.7 Score=34.89 Aligned_cols=64 Identities=11% Similarity=0.242 Sum_probs=48.5
Q ss_pred HHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHH---HHHHHHhh
Q psy9547 38 YVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNF---QAGCRQVI 103 (363)
Q Consensus 38 ~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l---~~~a~~i~ 103 (363)
..|++.+|..-+....+.+..+++++.|++...+..++.+|+.+ ++|+|-. +|..| .++++.+.
T Consensus 27 ~ELLa~lL~~g~~~~~~~~LA~~LL~~fgsL~~l~~as~~eL~~-i~GIG~a-kA~~L~a~~El~~R~~ 93 (224)
T PRK00024 27 AELLAILLRTGTKGKSVLDLARELLQRFGSLRGLLDASLEELQS-IKGIGPA-KAAQLKAALELARRIL 93 (224)
T ss_pred HHHHHHHHcCCCCCCCHHHHHHHHHHHcCCHHHHHhCCHHHHhh-ccCccHH-HHHHHHHHHHHHHHHH
Confidence 45677788877888889999999999999999999999999977 4666642 44444 44444443
No 179
>PRK14602 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=76.54 E-value=3.1 Score=37.22 Aligned_cols=21 Identities=43% Similarity=0.279 Sum_probs=19.0
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
+..|+++.||||++|=.||+.
T Consensus 73 F~~Li~V~GIGpK~Al~iLs~ 93 (203)
T PRK14602 73 FIVLISISKVGAKTALAILSQ 93 (203)
T ss_pred HHHHhCCCCcCHHHHHHHHhh
Confidence 677899999999999999984
No 180
>KOG2841|consensus
Probab=76.49 E-value=9.2 Score=34.75 Aligned_cols=69 Identities=9% Similarity=0.134 Sum_probs=46.3
Q ss_pred CCCC-CCCCChHHHHHHHHHhhhcchhhHHHH-HHHHHHhCCCHHHHhcCCHHHHHHHHhccCch-HHHHHHHHHHHHh
Q psy9547 27 QLPW-RESNNPYYVWISEVMLQQTQVKTVLPY-YEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQAGCRQV 102 (363)
Q Consensus 27 ~lpw-~~~~dp~~~lv~~il~qqt~~~~v~~~-~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~-~rA~~l~~~a~~i 102 (363)
+++. +..+|+...+.+++. ++.-|.+. ...|...|+|.+.+..|+.+|++.+ -|.. +||++|++.....
T Consensus 179 dli~~~~~~d~ls~~~~~Lt----~i~~VnKtda~~LL~~FgsLq~~~~AS~~ele~~---~G~G~~kak~l~~~l~~p 250 (254)
T KOG2841|consen 179 DLIMERKDRDLLSSLLGFLT----TIPGVNKTDAQLLLQKFGSLQQISNASEGELEQC---PGLGPAKAKRLHKFLHQP 250 (254)
T ss_pred hhhhhcccccHHHHHHHHHH----hCCCCCcccHHHHHHhcccHHHHHhcCHhHHHhC---cCcCHHHHHHHHHHHhcc
Confidence 5565 445666665555543 22222221 1257789999999999999999886 6666 5999998876543
No 181
>PF10391 DNA_pol_lambd_f: Fingers domain of DNA polymerase lambda; InterPro: IPR018944 DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. This entry represents the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone []. ; GO: 0016779 nucleotidyltransferase activity; PDB: 1KDH_A 1KEJ_A 1JMS_A 2IHM_A 3OGU_A 1MQ2_A 2P66_A 7ICI_A 1ZQN_A 1ZQK_A ....
Probab=76.48 E-value=2.6 Score=28.92 Aligned_cols=23 Identities=30% Similarity=0.173 Sum_probs=16.1
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhc
Q psy9547 114 KKQLLSIKGVGDYTAGALASICY 136 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~ 136 (363)
++.++++.|||+.||.-....++
T Consensus 1 l~~f~~I~GVG~~tA~~w~~~G~ 23 (52)
T PF10391_consen 1 LKLFTGIWGVGPKTARKWYAKGI 23 (52)
T ss_dssp HHHHHTSTT--HHHHHHHHHTT-
T ss_pred CcchhhcccccHHHHHHHHHhCC
Confidence 35788999999999998776544
No 182
>PRK14606 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=75.10 E-value=3.1 Score=36.72 Aligned_cols=21 Identities=24% Similarity=0.112 Sum_probs=18.8
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
++.|+++.||||++|=+||+.
T Consensus 72 F~~Li~V~GIGpK~AL~iLs~ 92 (188)
T PRK14606 72 FLSLTKVSRLGPKTALKIISN 92 (188)
T ss_pred HHHHhccCCccHHHHHHHHcC
Confidence 677899999999999999964
No 183
>PRK14350 ligA NAD-dependent DNA ligase LigA; Provisional
Probab=74.59 E-value=13 Score=39.35 Aligned_cols=38 Identities=16% Similarity=0.254 Sum_probs=20.0
Q ss_pred HHHHHHHh--CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHH
Q psy9547 57 YYEKFIKT--YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQA 97 (363)
Q Consensus 57 ~~~~l~~~--~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~ 97 (363)
.++.|++. .-++.+|..+..++|.++ =||.. .|.+|.+
T Consensus 448 ~i~~L~~~G~i~~~~Dly~L~~~~l~~l---~g~geksa~nl~~ 488 (669)
T PRK14350 448 TIEFLFEKKFISSEIDLYTFNFDRLINL---KGFKDKRINNLKR 488 (669)
T ss_pred HHHHHHHcCCcCCHHHHhhCCHHHHhhc---cCccHHHHHHHHH
Confidence 34445543 246777777776665553 24543 4555533
No 184
>PF03118 RNA_pol_A_CTD: Bacterial RNA polymerase, alpha chain C terminal domain; InterPro: IPR011260 The core of the bacterial RNA polymerase (RNAP) consists of four subunits, two alpha, a beta and a beta', which are conserved from bacteria to mammals. The alpha subunit (RpoA) initiates RNAP assembly by dimerising to form a platform on which the beta subunits can interact. The alpha subunit consists of a N-terminal domain (NTD) and a C-terminal domain (CTD), connected by a short linker. The NTD is essential for RNAP assembly, while the CTD is necessary for transcription regulation, interacting with transcription factors and promoter upstream elements. In Escherichia coli, the catabolite activator protein (CAP or CRP) was shown to exert its effect through its interactions with the CTD, where CAP binding to CTD promotes RNAP binding to promoter DNA, thereby stimulating transcription initiation at class I CAP-dependent promoters. At class II CAP-dependent promoters, the interaction of CAP with CTD is one of multiple interactions involved in activation []. The CTD has a compact structure of four helices and two long arms enclosing its hydrophobic core, making its folding topology distinct from most other binding proteins. The upstream promoter element-binding site is formed from helices 1 and 4 [].; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3N97_B 1XS9_D 3K4G_A 3N4M_B 1COO_A 1LB2_E 3IYD_A 1Z3E_B 3GFK_B 3IHQ_B ....
Probab=73.96 E-value=2 Score=31.05 Aligned_cols=48 Identities=25% Similarity=0.277 Sum_probs=28.2
Q ss_pred HhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 83 WEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 83 ~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
+..+|++.||.+...-+..- . .+..+--+.++|+++||+|+++.+-|.
T Consensus 14 I~~L~LS~Ra~n~L~~~~I~-t-v~dL~~~s~~~L~~i~n~G~ksl~EI~ 61 (66)
T PF03118_consen 14 IEDLGLSVRAYNCLKRAGIH-T-VGDLVKYSEEDLLKIKNFGKKSLEEIK 61 (66)
T ss_dssp GGGSTSBHHHHHHHHCTT---B-HHHHHCS-HHHHHTSTTSHHHHHHHHH
T ss_pred HHHhCCCHHHHHHHHHhCCc-C-HHHHHhCCHHHHHhCCCCCHhHHHHHH
Confidence 44588887876653321110 0 000112357899999999999988664
No 185
>COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair]
Probab=73.88 E-value=5.3 Score=37.96 Aligned_cols=57 Identities=12% Similarity=0.264 Sum_probs=38.8
Q ss_pred HHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccHHH--HhcCCCCcHHHHHHHHHHh
Q psy9547 78 NVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQ--LLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 78 el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~--L~~lpGIG~~tA~~il~~~ 135 (363)
++...+.-.|=-. |++.-+++|+.+.. -+-++|...+. +++|||||+.+|+.|.-+.
T Consensus 14 ~iA~~me~~Gen~fk~~aYr~Aa~sle~-~~e~~~ei~e~~~~t~l~gIGk~ia~~I~e~l 73 (326)
T COG1796 14 RIADYMELEGENPFKIRAYRKAAQSLEN-LTEDLEEIEERGRLTELPGIGKGIAEKISEYL 73 (326)
T ss_pred HHHHHHHhcCCCccchHHHHHHHHhhhh-cccchHHHHhhcccCCCCCccHHHHHHHHHHH
Confidence 3334444344332 77777888888755 24456655555 8899999999999998774
No 186
>PLN03060 inositol phosphatase-like protein; Provisional
Probab=72.24 E-value=47 Score=29.71 Aligned_cols=110 Identities=16% Similarity=0.240 Sum_probs=71.7
Q ss_pred HHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcc-------------hhhHHHHHHHHHHhCCCHHHHhcCCHHHH
Q psy9547 13 FQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQ-------------VKTVLPYYEKFIKTYPTIKDFAFDTEDNV 79 (363)
Q Consensus 13 ~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~-------------~~~v~~~~~~l~~~~pt~~~la~a~~~el 79 (363)
+.+.=.++|..+.++.| .-|..+|-++|.++-- .--+..+|.+|++.||..++... -.
T Consensus 5 VsDtKr~F~~~~p~pI~-----siYrrvv~ELLVE~HLl~~n~~f~yD~lfAlGlvt~fd~fm~GY~Pee~~~~----IF 75 (206)
T PLN03060 5 VADTKASFLKAYRKPIP-----SIYSNVIQELLVQQHLMRYNATYKYDPIFALGFVTVYDQLMDGYPNATDRDA----IF 75 (206)
T ss_pred HHHHHHHHHHhCCCCCc-----hHHHHHHHHHHHHHHHHHhccCceeCchHHhhHHHHHHHHHcCCCChHHHHH----HH
Confidence 44455567777766554 6788999998887751 11355789999999998776543 34
Q ss_pred HHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 80 LKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 80 ~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
.+++..+|+ -+..+++-|+.+.+...|.-..++..+++-.|-|+-....++.
T Consensus 76 ~Alc~a~~~--dp~~~r~dA~~l~~~a~~~s~~~l~~~l~~~~~~~~~l~~~~~ 127 (206)
T PLN03060 76 KAYIEALGE--DPDQYRKDAKKLEEWASSQSASGIADFNSGDGEVEAVLKDIAE 127 (206)
T ss_pred HHHHHHcCC--CHHHHHHHHHHHHHHHhcCCHHHHHHHHhcccccchHHHHHHH
Confidence 467778898 4455666666666655565556666666666655544444433
No 187
>KOG2457|consensus
Probab=71.70 E-value=17 Score=35.41 Aligned_cols=153 Identities=18% Similarity=0.237 Sum_probs=84.0
Q ss_pred CCCCcHHHHHHHHHHhcCCccccccchH-------------------HHHHhhhcccccCcCCcccHHHHHHHHHhcCCc
Q psy9547 120 IKGVGDYTAGALASICYNIPTPAVDGNV-------------------FRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQ 180 (363)
Q Consensus 120 lpGIG~~tA~~il~~~~~~~~~~vD~~v-------------------~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~ 180 (363)
..|.|.|+-.-.|.-+...-+.-..+|+ .-|++-.++...++.++...+.+-+...-..+
T Consensus 173 WaGlGyY~R~rrL~ega~~vv~~~~ge~Prta~~l~kgvpGVG~YTAGAiaSIAf~q~tGiVDGNVirvlsRalAIhsD- 251 (555)
T KOG2457|consen 173 WAGLGYYRRARRLLEGAKMVVAGTEGEFPRTASSLMKGVPGVGQYTAGAIASIAFNQVTGIVDGNVIRVLSRALAIHSD- 251 (555)
T ss_pred HhhhhHHHHHHHHHHHHHHHHHhCCCCCCChHHHHHhhCCCCCccchhhhhhhhhcCcccccccchHHHhHHhHhhcCC-
Confidence 6799999988777765433222222222 23344444444444445555555443333222
Q ss_pred ccHHHHHHHHHHHhhhhCcCCCCC-------------C-CCCCCcccChhhhcCCcc---c----------CCcccCCcc
Q psy9547 181 TNAREFNQALMDLGATVCLFKNPK-------------C-KECPLSRFCSAYKNNTIE---N----------FPVKLGKTK 233 (363)
Q Consensus 181 ~~~~~~~~~l~~~G~~~C~~~~P~-------------C-~~Cpl~~~C~~~~~~~~~---~----------~p~k~~k~~ 233 (363)
-..+.+++.+|.|++..--|.+|. | ..=|.-+.|+-...-+.- . +|.+-.+-+
T Consensus 252 cSkgk~~q~~wkLA~qLVDP~RPGDFNQalMELGAt~CTpq~P~CS~CPvss~CrA~q~~Eqn~~i~~g~t~~~dve~~~ 331 (555)
T KOG2457|consen 252 CSKGKFFQSSWKLAAQLVDPSRPGDFNQALMELGATLCTPQKPSCSSCPVSSQCRAFQLSEQNRTISVGDTYPTDVEKAK 331 (555)
T ss_pred cchhhHHHHHHHHHHHhcCCCCCCcHHHHHHHhcCeeccCCCCCcCCCCcHHHHHHHhHhhhcceeecCCcCCccHhhcC
Confidence 346889999999999999988876 4 334666677754322221 2 222111212
Q ss_pred ccceeEEEEEE--Ee-----CCceEEEeecCCCccccccccccccCCCC
Q psy9547 234 VKDVFLLTVVV--KT-----DTNKYLIQKRPTTGLLSNFYMFLSFESDT 275 (363)
Q Consensus 234 ~~~~~v~~~ii--~~-----~~~~vLl~~R~~~~~~~glWEFPggk~e~ 275 (363)
+.+-.-.+++. -+ ..|.+-+-+||.. .+|.||||..-+-+
T Consensus 332 P~~~~C~vCv~~ips~e~~Qs~gv~~~p~~p~~--~~~reE~~~v~~~e 378 (555)
T KOG2457|consen 332 PRHDFCCVCVLEIPSLERNQSGGVFVLPKRPEQ--LAGREEFPSVILNE 378 (555)
T ss_pred CCCCceeEeecCCCCchhhccCcEEEeccChhh--cCcccccceeeeec
Confidence 22111222232 00 1256777888766 79999999865543
No 188
>TIGR00426 competence protein ComEA helix-hairpin-helix repeat region. Members of the subfamily recognized by this model include competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This model represents a region of two tandem copies of a helix-hairpin-helix domain (pfam00633), each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to the region covered by this model.
Probab=71.57 E-value=4.2 Score=29.38 Aligned_cols=22 Identities=32% Similarity=0.410 Sum_probs=19.8
Q ss_pred cHHHHhc-CCCCcHHHHHHHHHH
Q psy9547 113 DKKQLLS-IKGVGDYTAGALASI 134 (363)
Q Consensus 113 ~~~~L~~-lpGIG~~tA~~il~~ 134 (363)
..++|.. +||||+.+|..|+.+
T Consensus 14 ~~~~L~~~ipgig~~~a~~Il~~ 36 (69)
T TIGR00426 14 TAEELQRAMNGVGLKKAEAIVSY 36 (69)
T ss_pred CHHHHHhHCCCCCHHHHHHHHHH
Confidence 4678999 999999999999987
No 189
>KOG2534|consensus
Probab=71.47 E-value=7 Score=37.06 Aligned_cols=66 Identities=14% Similarity=0.319 Sum_probs=44.1
Q ss_pred HHHHHHHHHHHhhHHhCCCCc-ccHHHHhcCCCCcHHHHHHHHHHh-cCC-c---cccccc--hHHHHHhhhccccc
Q psy9547 91 RVRNFQAGCRQVIEQFGGEVP-RDKKQLLSIKGVGDYTAGALASIC-YNI-P---TPAVDG--NVFRIYGRLFEIED 159 (363)
Q Consensus 91 rA~~l~~~a~~i~~~~~g~~p-~~~~~L~~lpGIG~~tA~~il~~~-~~~-~---~~~vD~--~v~Rvl~Rl~~~~~ 159 (363)
|++....+|..+.. + .+| .+.+++.+|||||+++|..|--|. -|. + .+.-|- -+...+.+.+++..
T Consensus 34 r~~~y~~Aasvlk~-~--p~~I~S~~ea~~lP~iG~kia~ki~EiletG~l~ele~v~~de~~~~lklFtnifGvG~ 107 (353)
T KOG2534|consen 34 RARAYRRAASVLKS-L--PFPITSGEEAEKLPGIGPKIAEKIQEILETGVLRELEAVRNDERSQSLKLFTNIFGVGL 107 (353)
T ss_pred HHHHHHHHHHHHHh-C--CCCcccHHHhcCCCCCCHHHHHHHHHHHHcCCchhHHHHhcchhHHHHHHHHHHhccCH
Confidence 77777777777654 2 223 578999999999999999988774 232 1 233222 35667777777653
No 190
>PRK13482 DNA integrity scanning protein DisA; Provisional
Probab=71.45 E-value=9.9 Score=36.82 Aligned_cols=47 Identities=17% Similarity=0.422 Sum_probs=38.0
Q ss_pred HHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhH
Q psy9547 56 PYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIE 104 (363)
Q Consensus 56 ~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~ 104 (363)
+..++++++|+|...+.+|+.+++.++ +|+| ..||+.|++.++.+.+
T Consensus 298 ~iAk~Ll~~FGSL~~Il~As~eeL~~V-eGIG-e~rA~~I~e~l~Rl~e 344 (352)
T PRK13482 298 AVIENLVEHFGSLQGLLAASIEDLDEV-EGIG-EVRARAIREGLSRLAE 344 (352)
T ss_pred HHHHHHHHHcCCHHHHHcCCHHHHhhC-CCcC-HHHHHHHHHHHHHHHH
Confidence 566788999999999999999998664 5555 2478889998888766
No 191
>PRK14603 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=71.37 E-value=3.1 Score=37.00 Aligned_cols=43 Identities=26% Similarity=0.116 Sum_probs=26.8
Q ss_pred HHHHhcCCCCcHHHHHHHHHH-hcCCccccccchHHHHHhhhcc
Q psy9547 114 KKQLLSIKGVGDYTAGALASI-CYNIPTPAVDGNVFRIYGRLFE 156 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~-~~~~~~~~vD~~v~Rvl~Rl~~ 156 (363)
+..|+++.||||++|=+||+. ..+.-.-++..+=...++++-+
T Consensus 71 F~~L~~V~GIGpK~AL~iLs~~~~~~l~~aI~~~D~~~L~kvpG 114 (197)
T PRK14603 71 FELLLGVSGVGPKLALALLSALPPALLARALLEGDARLLTSASG 114 (197)
T ss_pred HHHHhCcCCcCHHHHHHHHcCCCHHHHHHHHHhCCHHHHhhCCC
Confidence 677999999999999999983 3433222333322333444433
No 192
>COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair]
Probab=71.12 E-value=19 Score=32.56 Aligned_cols=66 Identities=8% Similarity=0.222 Sum_probs=52.7
Q ss_pred HHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchH--HHHHHHHHHHHhhH
Q psy9547 38 YVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYS--RVRNFQAGCRQVIE 104 (363)
Q Consensus 38 ~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~--rA~~l~~~a~~i~~ 104 (363)
..|++.+|---|.-..|.+.-+++++.|.+...|..++.+++.. ++|+|... ..+.+.++++.+..
T Consensus 27 ~ELLailLrtG~~~~~~~~la~~lL~~fg~L~~l~~a~~~el~~-v~GiG~aka~~l~a~~El~~R~~~ 94 (224)
T COG2003 27 AELLAILLRTGTKGESVLDLAKELLQEFGSLAELLKASVEELSS-VKGIGLAKAIQIKAAIELGKRILA 94 (224)
T ss_pred HHHHHHHHhcCCCCCCHHHHHHHHHHHcccHHHHHhCCHHHHhh-CCCccHHHHHHHHHHHHHHHHHHH
Confidence 56788888888899999999999999999999999999999866 57777542 34455666666655
No 193
>PRK14351 ligA NAD-dependent DNA ligase LigA; Provisional
Probab=69.58 E-value=17 Score=38.66 Aligned_cols=66 Identities=24% Similarity=0.280 Sum_probs=36.0
Q ss_pred HHHHHHh--CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccHHH-H--hcCCCCcHHHHHHH
Q psy9547 58 YEKFIKT--YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDKKQ-L--LSIKGVGDYTAGAL 131 (363)
Q Consensus 58 ~~~l~~~--~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~~~-L--~~lpGIG~~tA~~i 131 (363)
+..|.+. .-++.+|..+..++|..+ -||.. .|.+|.+..+.-. ..| ... | +.|||||+.+|..+
T Consensus 475 i~~L~~~g~V~~~~Dl~~L~~~~L~~l---~g~g~Ksa~~Ll~~Ie~sk-----~~~--l~r~l~ALgIpgIG~~~ak~L 544 (689)
T PRK14351 475 VQQLVDAGLVESLADLYDLTVADLAEL---EGWGETSAENLLAELEASR-----EPP--LADFLVALGIPEVGPTTARNL 544 (689)
T ss_pred HHHHHHcCCCCCHHHHHHcCHHHHhcC---cCcchhHHHHHHHHHHHHc-----cCC--HHHHHHHcCCCCcCHHHHHHH
Confidence 3445543 248888888888775543 35654 5666644332211 112 222 2 23777777777766
Q ss_pred HH
Q psy9547 132 AS 133 (363)
Q Consensus 132 l~ 133 (363)
+.
T Consensus 545 ~~ 546 (689)
T PRK14351 545 AR 546 (689)
T ss_pred HH
Confidence 53
No 194
>cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this
Probab=69.24 E-value=3.7 Score=39.16 Aligned_cols=38 Identities=18% Similarity=0.367 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHH
Q psy9547 91 RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGA 130 (363)
Q Consensus 91 rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~ 130 (363)
|++...++|..|.. +.-.+ .+.+++.+|||||+.+|+.
T Consensus 23 r~~aY~~Aa~~l~~-l~~~i-~~~~~~~~ipgiG~~ia~k 60 (307)
T cd00141 23 RVRAYRKAARALES-LPEPI-ESLEEAKKLPGIGKKIAEK 60 (307)
T ss_pred hHHHHHHHHHHHHh-CCccc-CCHHHhcCCCCccHHHHHH
Confidence 77777777777655 45555 4577889999999999876
No 195
>PF04904 NCD1: NAB conserved region 1 (NCD1); InterPro: IPR006988 Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of eukaryotic (metazoa) transcription factors []. This entry represents the N-terminal NAB domain, which interacts with the EGR1 inhibitory domain (R1) []. It may also mediate multimerisation.; GO: 0045892 negative regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=68.18 E-value=16 Score=27.07 Aligned_cols=52 Identities=23% Similarity=0.408 Sum_probs=40.1
Q ss_pred hhhHHHHHHHHHHhC-CCHHHHhcCCHHHHHHHHhccCchHH---HHHHHHHHHHh
Q psy9547 51 VKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYSR---VRNFQAGCRQV 102 (363)
Q Consensus 51 ~~~v~~~~~~l~~~~-pt~~~la~a~~~el~~~~~~lG~~~r---A~~l~~~a~~i 102 (363)
.++-..||..|++.- .+.+.|..+.++|+.+++.-+|..+| .++++++.+..
T Consensus 18 rAnLl~Yyd~fi~~GgDDvqQL~~~~e~eF~eim~lvGM~sKPLHVrRlqKAL~ew 73 (82)
T PF04904_consen 18 RANLLQYYDTFIAQGGDDVQQLCEAGEEEFLEIMALVGMASKPLHVRRLQKALQEW 73 (82)
T ss_pred HhhHHHHHHHHHHHcChhHHHHHhcChHHHHHHHHHhCccCccHHHHHHHHHHHHH
Confidence 355677899998764 48999999999999999999999765 55665544443
No 196
>PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed
Probab=67.38 E-value=13 Score=38.53 Aligned_cols=75 Identities=13% Similarity=0.181 Sum_probs=45.2
Q ss_pred HHHHHHHh--CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhH----------------------HhCC---
Q psy9547 57 YYEKFIKT--YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIE----------------------QFGG--- 108 (363)
Q Consensus 57 ~~~~l~~~--~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~----------------------~~~g--- 108 (363)
....|++. .-++.+|-.+..+++.++ =||.. .|.+|.+.-+.-.+ .|+.
T Consensus 437 ~i~~L~~~G~i~~~~Diy~L~~~~l~~l---~gfgeks~~nll~aIe~sk~~~l~r~l~aLGI~~vG~~~ak~~~~~i~~ 513 (562)
T PRK08097 437 TWRALHQTGLFEHLFSWLALTPEQLANT---PGIGKARAEQLWHQFNLARQQPFSRWLKALGIPLPQAALNALDDRSWQQ 513 (562)
T ss_pred HHHHHHHcCCcCCHHHHhcCCHHHHhcC---cCccHHHHHHHHHHHHHHcCCCHHHHHHHcCCccHHHHHHHHhcCCHHH
Confidence 34455543 358888888887776653 36654 56666443221111 1110
Q ss_pred CCcccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 109 EVPRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 109 ~~p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
....+.++|.+++|||+.+|+.|..|
T Consensus 514 l~~a~~e~l~~i~gIG~~~a~si~~~ 539 (562)
T PRK08097 514 LLSRSEQQWQQLPGIGEGRARQLIAF 539 (562)
T ss_pred HHcCCHHHHhcCCCchHHHHHHHHHH
Confidence 01235678888999999999988877
No 197
>cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination.
Probab=66.93 E-value=4 Score=30.22 Aligned_cols=18 Identities=28% Similarity=0.320 Sum_probs=15.1
Q ss_pred HhcCCCCcHHHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~ 134 (363)
+-.+||||++||.-++.-
T Consensus 24 i~gv~giG~k~A~~ll~~ 41 (75)
T cd00080 24 IPGVPGIGPKTALKLLKE 41 (75)
T ss_pred CCCCCcccHHHHHHHHHH
Confidence 347999999999988864
No 198
>PRK14604 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=66.88 E-value=4.4 Score=36.00 Aligned_cols=44 Identities=25% Similarity=0.200 Sum_probs=27.4
Q ss_pred HHHHhcCCCCcHHHHHHHHHH-hcCCccccccchHHHHHhhhccc
Q psy9547 114 KKQLLSIKGVGDYTAGALASI-CYNIPTPAVDGNVFRIYGRLFEI 157 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~-~~~~~~~~vD~~v~Rvl~Rl~~~ 157 (363)
...|+++.||||++|=.||+. ..+.-.-++..+=...+.++-++
T Consensus 72 F~~Li~V~GIGpK~Al~iLs~~~~~el~~aI~~~D~~~L~kvpGI 116 (195)
T PRK14604 72 FELLIGVSGVGPKAALNLLSSGTPDELQLAIAGGDVARLARVPGI 116 (195)
T ss_pred HHHHhCcCCcCHHHHHHHHcCCCHHHHHHHHHhCCHHHHhhCCCC
Confidence 677999999999999999984 23332233333333344444433
No 199
>PRK13901 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=66.14 E-value=4.6 Score=35.89 Aligned_cols=21 Identities=24% Similarity=0.327 Sum_probs=18.9
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.|+++.||||++|=+||+-
T Consensus 71 F~~LisVsGIGPK~ALaILs~ 91 (196)
T PRK13901 71 FEELIGVDGIGPRAALRVLSG 91 (196)
T ss_pred HHHHhCcCCcCHHHHHHHHcC
Confidence 677999999999999999974
No 200
>COG1415 Uncharacterized conserved protein [Function unknown]
Probab=65.85 E-value=21 Score=34.31 Aligned_cols=67 Identities=22% Similarity=0.185 Sum_probs=41.2
Q ss_pred cccHHHHhcCCCCcHHHHHHHHHHh---cCCccccccchHHHHHhhhcccccCc--CCcccHHHHHHHHHhcCC
Q psy9547 111 PRDKKQLLSIKGVGDYTAGALASIC---YNIPTPAVDGNVFRIYGRLFEIEDDI--SKGKTKVVYETLVSKTMS 179 (363)
Q Consensus 111 p~~~~~L~~lpGIG~~tA~~il~~~---~~~~~~~vD~~v~Rvl~Rl~~~~~~~--~~~~~~~~~~~~~~~~~p 179 (363)
|.+.++|+-.|||||.|+-+..+.| +|.|.---|--. ..--+|+-+..+ -...++.++-..+++.+.
T Consensus 274 p~Df~elLl~~GiGpstvRALalVAEvIyg~~~s~rDP~~--yafA~GGKDgvP~pV~~~~yde~I~~l~~~ve 345 (373)
T COG1415 274 PDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRDPVK--YAFAVGGKDGVPFPVNRKTYDELIEFLEELVE 345 (373)
T ss_pred cccHHHHHhccCCCHHHHHHHHHHHHHHhCCCCCcCCchh--hhhhhcCCCCCCCCcchhhHHHHHHHHHHHHH
Confidence 6789999999999999999988876 776532222111 444445433222 124455555555555554
No 201
>PF00416 Ribosomal_S13: Ribosomal protein S13/S18; InterPro: IPR001892 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S13 is one of the proteins from the small ribosomal subunit. In Escherichia coli, S13 is known to be involved in binding fMet-tRNA and, hence, in the initiation of translation. It is a basic protein of 115 to 177 amino-acid residues that contains thee helices and a beta-hairpin in the core of the protein, forming a helix-two turns-helix (H2TH) motif, and a non-globular C-terminal extension. This family of ribosomal proteins is present in prokaryotes, eukaryotes and archaea [, ].; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3BBN_M 2QBB_M 3I1M_M 3OFP_M 3OFX_M 3OFO_M 1VS5_M 3OAQ_M 2QAL_M 3J18_M ....
Probab=65.58 E-value=7.1 Score=31.07 Aligned_cols=22 Identities=23% Similarity=0.237 Sum_probs=19.7
Q ss_pred HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 114 KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~ 135 (363)
.-.|.++.|||+.+|..|+...
T Consensus 14 ~~aLt~IyGIG~~~A~~Ic~~l 35 (107)
T PF00416_consen 14 YIALTKIYGIGRRKAKQICKKL 35 (107)
T ss_dssp HHHHTTSTTBCHHHHHHHHHHT
T ss_pred HhHHhhhhccCHHHHHHHHHHc
Confidence 6789999999999999998764
No 202
>PRK14600 ruvA Holliday junction DNA helicase RuvA; Provisional
Probab=64.70 E-value=4.9 Score=35.44 Aligned_cols=21 Identities=29% Similarity=0.294 Sum_probs=19.0
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.|+++.||||++|=+||+-
T Consensus 72 F~~LisV~GIGpK~Al~iLs~ 92 (186)
T PRK14600 72 LRMLVKVSGVNYKTAMSILSK 92 (186)
T ss_pred HHHHhCcCCcCHHHHHHHHcc
Confidence 677999999999999999984
No 203
>KOG2937|consensus
Probab=62.13 E-value=6.2 Score=37.56 Aligned_cols=145 Identities=10% Similarity=0.041 Sum_probs=78.1
Q ss_pred HHHHHHHHHHHhhhhCcCCCCCCCCCCCcccCh-hhhcCCcccCCcccCCccccceeEEEEEEEeCC-ceEEEeecCCCc
Q psy9547 183 AREFNQALMDLGATVCLFKNPKCKECPLSRFCS-AYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDT-NKYLIQKRPTTG 260 (363)
Q Consensus 183 ~~~~~~~l~~~G~~~C~~~~P~C~~Cpl~~~C~-~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~~~~-~~vLl~~R~~~~ 260 (363)
...+.+|++-+-.-+ ..++--..||+.--=. ...+..+...|.| ..+.|-.++|.+.. .++|+++-..
T Consensus 35 ~f~ve~a~w~y~d~~--~~ndpsl~c~~~~~~~~~~~e~~~~f~~yk------~~iPv~ga~ild~~~sr~llv~g~q-- 104 (348)
T KOG2937|consen 35 LFSVEKALWFYEDFI--NKNDPSLQCPLLWPWKNEKDEAFVDFAPYK------ARIPVRGAIILDEKRSRCLLVKGWQ-- 104 (348)
T ss_pred hHHHHHHHHHHHHHH--hcCCcccCCccccchhhHHHHHHHhhcccc------CCCCCchHhhhhhhhhhhheeecee--
Confidence 455666666666666 3466556788753221 1122223333432 11224445555542 4788887532
Q ss_pred cccccccccccCCCC--CchHHH-HHHHhhCCCcccccccccccEEEEccceEEEEEEE---EEEecC-CC-CCCCCCCe
Q psy9547 261 LLSNFYMFLSFESDT--PYTSQV-EFLSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVY---SGTAKE-KT-IPANKTYK 332 (363)
Q Consensus 261 ~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~v~~~~~~l~~v~h~~sh~~~~l~~~---~~~~~~-~~-~~~~~~~~ 332 (363)
+.=|.||.||+.. +...+. ||..||.|.+....-..-.-+.-.+-+..+.+.+. .+.... +. ..+-+.+.
T Consensus 105 --a~sw~fprgK~~kdesd~~caiReV~eetgfD~skql~~~e~Ie~nI~dq~~~~fIi~gvs~d~~f~~~v~~eis~ih 182 (348)
T KOG2937|consen 105 --ASSWSFPRGKISKDESDSDCAIREVTEETGFDYSKQLQDNEGIETNIRDQLVRLFIINGVSEDTNFNPRVRKEISKIH 182 (348)
T ss_pred --cccccccCccccccchhhhcchhcccchhhcCHHHHhccccCcccchhhceeeeeeeccceeeeecchhhhcccccee
Confidence 2339999999977 555666 99999999987542111111333333333433222 222211 11 12457778
Q ss_pred eeccccc
Q psy9547 333 LITETQM 339 (363)
Q Consensus 333 W~~~~el 339 (363)
|..++++
T Consensus 183 W~~l~~l 189 (348)
T KOG2937|consen 183 WHYLDHL 189 (348)
T ss_pred eeehhhh
Confidence 9999999
No 204
>PRK14973 DNA topoisomerase I; Provisional
Probab=61.88 E-value=25 Score=38.86 Aligned_cols=88 Identities=15% Similarity=0.077 Sum_probs=53.6
Q ss_pred hCCCHHHHhcCCHHHHHHHHhccCch-HHHHHHHHHHH-HhhHH-h----CCCCcccHHHHhcCCCCcHHHHHHHHHHhc
Q psy9547 64 TYPTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQAGCR-QVIEQ-F----GGEVPRDKKQLLSIKGVGDYTAGALASICY 136 (363)
Q Consensus 64 ~~pt~~~la~a~~~el~~~~~~lG~~-~rA~~l~~~a~-~i~~~-~----~g~~p~~~~~L~~lpGIG~~tA~~il~~~~ 136 (363)
.|-++++++.+++++|... -|+. ..+..+...|. .+... . ...+...+.+|++++|||+.|++..-.-+.
T Consensus 822 G~~~~~d~~~a~p~~La~~---~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~el~~vkg~ge~t~~~l~~ag~ 898 (936)
T PRK14973 822 GFDTPEDFCSVHPAYLALK---TGISPETICRHAKLVCEKLGRPVPEKISKAAFERGRAELLSVPGLGETTLEKLYLAGV 898 (936)
T ss_pred cCCCHHHHHhcCHHHHhcC---CCCChhhHHHHHHHHHHHhcCCCchhhhhhhhcccchhhhhccCCCHHHHHHHHHcCC
Confidence 5889999999999998774 6876 35555544443 33221 0 011223466699999999999976555444
Q ss_pred CCccccccchHHHHHhhh
Q psy9547 137 NIPTPAVDGNVFRIYGRL 154 (363)
Q Consensus 137 ~~~~~~vD~~v~Rvl~Rl 154 (363)
-.+.-..-....++..+-
T Consensus 899 ~~~e~l~~~d~~~la~~~ 916 (936)
T PRK14973 899 YDGDLLVSADPKKLAKVT 916 (936)
T ss_pred CCHHHhccCCHHHHhhhc
Confidence 333222222455555544
No 205
>PF11264 ThylakoidFormat: Thylakoid formation protein; InterPro: IPR017499 Psp29, originally designated sll1414 (P73956 from SWISSPROT) in Synechocystis sp. (strain PCC 6803), is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function.; GO: 0010027 thylakoid membrane organization, 0015979 photosynthesis, 0009523 photosystem II
Probab=61.26 E-value=1.3e+02 Score=27.24 Aligned_cols=107 Identities=20% Similarity=0.364 Sum_probs=63.1
Q ss_pred HHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcch-------------hhHHHHHHHHHHhCCCHHHHhcCCHHHHHH
Q psy9547 15 ESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQV-------------KTVLPYYEKFIKTYPTIKDFAFDTEDNVLK 81 (363)
Q Consensus 15 ~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~-------------~~v~~~~~~l~~~~pt~~~la~a~~~el~~ 81 (363)
+.=.++|..+.++.| .-|..+|-++|.++--. =-+..+|.+|++.||..++... -..+
T Consensus 4 DtKr~F~~~~~~pI~-----siYrrvv~ELLVe~HLl~~n~~F~yD~lfalG~vt~fd~fm~GY~p~~~~~~----If~A 74 (216)
T PF11264_consen 4 DTKRAFYKAFPRPIP-----SIYRRVVDELLVELHLLSVNKDFQYDPLFALGLVTVFDRFMQGYPPEEDKDS----IFNA 74 (216)
T ss_pred HHHHHHHHhCCCCCc-----HHHHHHHHHHHHHHHHHHhccCceeCchHHhhHHHHHHHHhcCCCChhHHHH----HHHH
Confidence 444567777776654 77899999998887511 1355788999999987665432 3446
Q ss_pred HHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcC-CCCcHHHHHHHH
Q psy9547 82 MWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSI-KGVGDYTAGALA 132 (363)
Q Consensus 82 ~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~l-pGIG~~tA~~il 132 (363)
++..+|+ -+..+++-|+.+.+...|.-..++..+++- .|-|+..-..++
T Consensus 75 lc~a~~~--dp~~~r~dA~~l~~~a~~~s~~~l~~~l~~~~~~~~~~l~~~~ 124 (216)
T PF11264_consen 75 LCQALGF--DPEQYRQDAEKLEEWAKGKSIEDLLSWLSQKGGEGDNPLAAIL 124 (216)
T ss_pred HHHHcCC--CHHHHHHHHHHHHHHHHcCCHHHHHHHHhccccccchHHHHHH
Confidence 7778898 333444555555544334333444444432 444444433333
No 206
>PRK08609 hypothetical protein; Provisional
Probab=61.15 E-value=14 Score=38.57 Aligned_cols=56 Identities=14% Similarity=0.220 Sum_probs=37.1
Q ss_pred HHHHHHhccCch-HHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHh
Q psy9547 78 NVLKMWEGLGYY-SRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 78 el~~~~~~lG~~-~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~ 135 (363)
++..++.-.|=. -|++.-+++|+.|... ..++ .+..+|.+|||||+.+|+.|--+.
T Consensus 12 ~~A~~le~~g~n~fr~~aYr~Aa~~i~~l-~~~i-~~~~~l~~ipgIG~~ia~kI~Eil 68 (570)
T PRK08609 12 TIATYMELKGENPFKISAFRKAAQALELD-ERSL-SEIDDFTKLKGIGKGTAEVIQEYR 68 (570)
T ss_pred HHHHHHHhcCCCcHHHHHHHHHHHHHHhC-chhh-hhhhhhccCCCcCHHHHHHHHHHH
Confidence 344444434421 2777788888877652 2223 245678899999999999988774
No 207
>COG1379 PHP family phosphoesterase with a Zn ribbon [General function prediction only]
Probab=59.94 E-value=56 Score=31.31 Aligned_cols=49 Identities=16% Similarity=0.202 Sum_probs=41.5
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCC-HHHHhcCCHHHHHHH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPT-IKDFAFDTEDNVLKM 82 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt-~~~la~a~~~el~~~ 82 (363)
.=|.+.+++..+.--.+.+.|...|++|.+.|.| .+.|-+++.|+|..+
T Consensus 303 liPLaeIi~~~~g~gi~tK~V~~~we~lv~~FGtEi~vLi~a~~e~La~V 352 (403)
T COG1379 303 LIPLAEIISMALGKGITTKAVKRTWERLVRAFGTEIDVLIDAPIEELARV 352 (403)
T ss_pred cccHHHHHHHHhccceechhHHHHHHHHHHHhcchhhhHhcCCHHHHhhh
Confidence 4677777877777778999999999999999986 788889999888664
No 208
>PLN00047 photosystem II biogenesis protein Psb29; Provisional
Probab=59.61 E-value=97 Score=29.08 Aligned_cols=111 Identities=16% Similarity=0.222 Sum_probs=69.5
Q ss_pred HHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcch-------------hhHHHHHHHHHHhCCCHHHHhcCCHH
Q psy9547 11 LAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQV-------------KTVLPYYEKFIKTYPTIKDFAFDTED 77 (363)
Q Consensus 11 ~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~-------------~~v~~~~~~l~~~~pt~~~la~a~~~ 77 (363)
..+.+.=.++|..++++.| .-|..+|-++|.|+--. --+..+|.+|++.||..++...
T Consensus 56 ~TVSDTKr~F~~~yp~pIp-----siYrrvvdELLVElHLLs~n~~F~yDplFALGlVtvfd~fm~GY~Pee~~~~---- 126 (283)
T PLN00047 56 PTVAETKAKFLKSYKRPIP-----SIYSTVLQELLVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGYPSDEDRDA---- 126 (283)
T ss_pred CcHHHHHHHHHHhCCCCCc-----HHHHHHHHHHHHHHHHHHhccCceeCchhhhhhHHHHHHHHccCCChHHHHH----
Confidence 3456666777887777664 77899999998887621 1345789999999998776543
Q ss_pred HHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 78 NVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 78 el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
-..+++..+|+ -+..+++-|+.+.+...|.-..++..++...|-...+...|.
T Consensus 127 IF~Alc~a~g~--Dp~qyr~dA~~l~~~A~~~s~~~l~~~l~~~~~l~~~l~~IA 179 (283)
T PLN00047 127 IFKAYIKALGE--DPEQYRKDAAKLEEWARSQTGSSLVDFSSKEGEIEGILKDIA 179 (283)
T ss_pred HHHHHHHHcCC--CHHHHHHHHHHHHHHHhcCCHHHHHHHHhcchHHHHHHHHHH
Confidence 34467778898 344455555555554444434455554444454444444443
No 209
>COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair]
Probab=58.10 E-value=7.3 Score=34.74 Aligned_cols=43 Identities=19% Similarity=0.166 Sum_probs=28.8
Q ss_pred HHHHhcCCCCcHHHHHHHHHH-hcCCccccccchHHHHHhhhcc
Q psy9547 114 KKQLLSIKGVGDYTAGALASI-CYNIPTPAVDGNVFRIYGRLFE 156 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~-~~~~~~~~vD~~v~Rvl~Rl~~ 156 (363)
...|+++.||||++|=+||+. ..+.-.-+++..=...+.++=+
T Consensus 72 F~~LisVnGIGpK~ALaiLs~~~~~~l~~aI~~~d~~~L~k~PG 115 (201)
T COG0632 72 FRLLISVNGIGPKLALAILSNLDPEELAQAIANEDVKALSKIPG 115 (201)
T ss_pred HHHHHccCCccHHHHHHHHcCCCHHHHHHHHHhcChHhhhcCCC
Confidence 677999999999999999986 3333334444444444555533
No 210
>PF14490 HHH_4: Helix-hairpin-helix containing domain; PDB: 3GPL_A 3E1S_A 3GP8_A.
Probab=57.28 E-value=23 Score=27.28 Aligned_cols=53 Identities=17% Similarity=0.263 Sum_probs=30.7
Q ss_pred CHHHHHHHHhccCchH-HHHHHHHHHHHhhHHhCCCCcccH----HHHhc-CCCCcHHHHHHHHHH
Q psy9547 75 TEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQFGGEVPRDK----KQLLS-IKGVGDYTAGALASI 134 (363)
Q Consensus 75 ~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~~~g~~p~~~----~~L~~-lpGIG~~tA~~il~~ 134 (363)
+..++...+...|+.. .|..| .+.||.+.-+.+ =.|+. ++|||-++||.|...
T Consensus 7 ~~~~~~~~L~~~gl~~~~a~kl-------~~~yg~~ai~~l~~nPY~L~~~i~gi~F~~aD~iA~~ 65 (94)
T PF14490_consen 7 GLRELMAFLQEYGLSPKLAMKL-------YKKYGDDAIEILKENPYRLIEDIDGIGFKTADKIALK 65 (94)
T ss_dssp --HHHHHHHHHTT--HHHHHHH-------HHHH-TTHHHHHHH-STCCCB-SSSSBHHHHHHHHHT
T ss_pred HHHHHHHHHHHcCCCHHHHHHH-------HHHHhHHHHHHHHHChHHHHHHccCCCHHHHHHHHHH
Confidence 4456677778888864 45444 444554322212 22555 899999999998875
No 211
>PF12826 HHH_2: Helix-hairpin-helix motif; PDB: 1X2I_B 1DGS_A 1V9P_B.
Probab=57.21 E-value=18 Score=25.81 Aligned_cols=38 Identities=8% Similarity=0.179 Sum_probs=24.3
Q ss_pred HHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHH
Q psy9547 58 YEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQA 97 (363)
Q Consensus 58 ~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~ 97 (363)
-..|.+.|+|++.|.+++.+++.. +.++|- ..|+.|.+
T Consensus 16 ak~L~~~f~sl~~l~~a~~e~L~~-i~gIG~-~~A~si~~ 53 (64)
T PF12826_consen 16 AKLLAKHFGSLEALMNASVEELSA-IPGIGP-KIAQSIYE 53 (64)
T ss_dssp HHHHHHCCSCHHHHCC--HHHHCT-STT--H-HHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHcCHHHHhc-cCCcCH-HHHHHHHH
Confidence 356788999999999999999855 445553 24555544
No 212
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=57.15 E-value=37 Score=33.82 Aligned_cols=18 Identities=33% Similarity=0.656 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHhhc
Q psy9547 7 AKEILAFQESILTWYKQN 24 (363)
Q Consensus 7 ~~~~~~~~~~l~~~~~~~ 24 (363)
+.+++.+..+|.+||+.+
T Consensus 152 dk~in~~~~RLrewY~~~ 169 (414)
T PRK14552 152 DKTINLFSERLREWYSLH 169 (414)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 367889999999999985
No 213
>COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair]
Probab=56.39 E-value=12 Score=39.22 Aligned_cols=23 Identities=26% Similarity=0.523 Sum_probs=17.4
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.++|.+++|||...|..|..|
T Consensus 540 a~~e~l~~i~giG~~vA~si~~f 562 (667)
T COG0272 540 ASEEELASIPGIGEVVARSIIEF 562 (667)
T ss_pred cCHHHHhhccchhHHHHHHHHHH
Confidence 45777888888888888877765
No 214
>PF14229 DUF4332: Domain of unknown function (DUF4332)
Probab=55.50 E-value=31 Score=28.10 Aligned_cols=75 Identities=19% Similarity=0.180 Sum_probs=42.9
Q ss_pred CCCHHHHhcCCHHHHHH--HHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCcccc
Q psy9547 65 YPTIKDFAFDTEDNVLK--MWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPA 142 (363)
Q Consensus 65 ~pt~~~la~a~~~el~~--~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~ 142 (363)
.-|.++|.+++...... +....|.. +++|.+. ..+.+|..+||||+..|..+..-++.-..-.
T Consensus 16 I~t~~~Ll~~~~~~~~r~~La~~~~i~--~~~l~~w-------------~~~AdL~ri~gi~~~~a~LL~~AGv~Tv~~L 80 (122)
T PF14229_consen 16 IKTTGDLLEAGDTPLGRKALAKKLGIS--ERNLLKW-------------VNQADLMRIPGIGPQYAELLEHAGVDTVEEL 80 (122)
T ss_pred CCcHHHHHHcCCCHHHHHHHHHhcCCC--HHHHHHH-------------HhHHHhhhcCCCCHHHHHHHHHhCcCcHHHH
Confidence 45777777776655444 33444442 1222221 1367788999999999998777665443333
Q ss_pred ccchHHHHHhhh
Q psy9547 143 VDGNVFRIYGRL 154 (363)
Q Consensus 143 vD~~v~Rvl~Rl 154 (363)
...+..++..++
T Consensus 81 A~~~p~~L~~~l 92 (122)
T PF14229_consen 81 AQRNPQNLHQKL 92 (122)
T ss_pred HhCCHHHHHHHH
Confidence 334444444443
No 215
>PRK12766 50S ribosomal protein L32e; Provisional
Probab=55.00 E-value=7 Score=35.50 Aligned_cols=21 Identities=38% Similarity=0.488 Sum_probs=9.8
Q ss_pred HhcCCCCcHHHHHHHHHHhcC
Q psy9547 117 LLSIKGVGDYTAGALASICYN 137 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~~~~ 137 (363)
|..|||||+.++..++..+|+
T Consensus 5 L~~IpGIG~krakkLl~~GF~ 25 (232)
T PRK12766 5 LEDISGVGPSKAEALREAGFE 25 (232)
T ss_pred cccCCCcCHHHHHHHHHcCCC
Confidence 444455555555444444333
No 216
>TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29. Psp29, originally designated sll1414 in Synechocystis 6803, is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function.
Probab=54.74 E-value=89 Score=28.15 Aligned_cols=96 Identities=14% Similarity=0.258 Sum_probs=61.8
Q ss_pred HHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcc-------------hhhHHHHHHHHHHhCCCHHHHhcCCHHHH
Q psy9547 13 FQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQ-------------VKTVLPYYEKFIKTYPTIKDFAFDTEDNV 79 (363)
Q Consensus 13 ~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~-------------~~~v~~~~~~l~~~~pt~~~la~a~~~el 79 (363)
+.+.=.++|..+.++.| .-|..+|-++|.++-- .--+..+|.+|++.|+..++... -.
T Consensus 7 VSDtKr~F~~~~p~pI~-----siYrrvv~ELLVElHLl~~n~~F~yDplfAlGlvt~fd~fm~GY~Pee~~~~----IF 77 (214)
T TIGR03060 7 VSDSKRAFHAAFPRVIP-----PLYRRVVDELLVELHLLSHQSDFKYDPLFALGLVTVFDRFMEGYRPEEHLDA----LF 77 (214)
T ss_pred HHHHHHHHHHhCCCCCc-----hHHHHHHHHHHHHHHHHHhccCceeCchHHhhHHHHHHHHHcCCCChHHHHH----HH
Confidence 44555567777766654 6788899888887651 11355789999999988776543 34
Q ss_pred HHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhc
Q psy9547 80 LKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLS 119 (363)
Q Consensus 80 ~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~ 119 (363)
.+++..+|+ -+..+++-|+.+.+...|.-.+++..+++
T Consensus 78 ~Alc~a~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~ 115 (214)
T TIGR03060 78 DALCNSNGF--DPEQLREDAKQLLEQAKGKGLDEILSWLT 115 (214)
T ss_pred HHHHHhcCC--CHHHHHHHHHHHHHHHhcCCHHHHHHHHh
Confidence 467778898 44556666666666555543444444443
No 217
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=54.22 E-value=22 Score=20.62 Aligned_cols=28 Identities=18% Similarity=0.544 Sum_probs=22.9
Q ss_pred HHHHHHhhhcchhhHHHHHHHHHHhCCC
Q psy9547 40 WISEVMLQQTQVKTVLPYYEKFIKTYPT 67 (363)
Q Consensus 40 lv~~il~qqt~~~~v~~~~~~l~~~~pt 67 (363)
.++.+..++.....+..+++++++.||+
T Consensus 5 ~~a~~~~~~g~~~~A~~~~~~~~~~~P~ 32 (33)
T PF13174_consen 5 RLARCYYKLGDYDEAIEYFQRLIKRYPD 32 (33)
T ss_dssp HHHHHHHHHCHHHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHccCHHHHHHHHHHHHHHCcC
Confidence 3456666677889999999999999995
No 218
>COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only]
Probab=52.69 E-value=10 Score=35.80 Aligned_cols=22 Identities=27% Similarity=0.409 Sum_probs=19.7
Q ss_pred cHHHHhcCCCCcHHHHHHHHHH
Q psy9547 113 DKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 113 ~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+++|+.+||||.++|.-|++.
T Consensus 328 ~~~~llRVPGiG~ksa~rIv~~ 349 (404)
T COG4277 328 PYKELLRVPGIGVKSARRIVMT 349 (404)
T ss_pred CHHHhcccCCCChHHHHHHHHH
Confidence 4899999999999999988764
No 219
>PRK13266 Thf1-like protein; Reviewed
Probab=52.38 E-value=99 Score=28.11 Aligned_cols=96 Identities=11% Similarity=0.269 Sum_probs=63.0
Q ss_pred HHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcc-------------hhhHHHHHHHHHHhCCCHHHHhcCCHHHH
Q psy9547 13 FQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQ-------------VKTVLPYYEKFIKTYPTIKDFAFDTEDNV 79 (363)
Q Consensus 13 ~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~-------------~~~v~~~~~~l~~~~pt~~~la~a~~~el 79 (363)
+.+.=.++|..+.++.| .-|..+|-++|.++-- .--+..+|.+|++.|+..++... -.
T Consensus 7 VSDtKr~F~~~~p~pI~-----siYrrvv~ELLVElHLl~~n~~F~yDplfAlGlvt~fd~fm~GY~Pee~~~~----IF 77 (225)
T PRK13266 7 VSDSKRAFYAAFPRPIN-----SIYRRVVDELLVELHLLSVNSDFKYDPLFALGLVTVFDRFMQGYRPEEHKDS----IF 77 (225)
T ss_pred HHHHHHHHHHhCCCCCc-----hHHHHHHHHHHHHHHHHHhccCceeCchHHhhHHHHHHHHHcCCCChHHHHH----HH
Confidence 44555677777766654 6788999998887651 11355789999999988776443 34
Q ss_pred HHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhc
Q psy9547 80 LKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLS 119 (363)
Q Consensus 80 ~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~ 119 (363)
.+++..+|+ -+..+++-|+.+.+...|.-..++..+++
T Consensus 78 ~Alc~a~~~--dp~~~r~dA~~l~~~a~~~s~~~i~~~l~ 115 (225)
T PRK13266 78 NALCQAVGF--DPEQLRQDAERLLELAKGKSLKEILSWLT 115 (225)
T ss_pred HHHHHHcCC--CHHHHHHHHHHHHHHHhcCCHHHHHHHHh
Confidence 467778898 45556666666666555544444444444
No 220
>COG1948 MUS81 ERCC4-type nuclease [DNA replication, recombination, and repair]
Probab=52.06 E-value=34 Score=31.60 Aligned_cols=21 Identities=24% Similarity=0.589 Sum_probs=11.0
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
..+|++++|||.+.|..|-.|
T Consensus 213 ~~eL~~v~gig~k~A~~I~~~ 233 (254)
T COG1948 213 EEELMKVKGIGEKKAREIYRF 233 (254)
T ss_pred HHHHHHhcCccHHHHHHHHHH
Confidence 455555555555555555443
No 221
>TIGR00375 conserved hypothetical protein TIGR00375. The member of this family from Methanococcus jannaschii, MJ0043, is considerably longer and appears to contain an intein N-terminal to the region of homology.
Probab=51.86 E-value=23 Score=34.73 Aligned_cols=49 Identities=20% Similarity=0.175 Sum_probs=42.7
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCC-HHHHhcCCHHHHHHH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPT-IKDFAFDTEDNVLKM 82 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt-~~~la~a~~~el~~~ 82 (363)
.=|.+.++++++.--...+.|...|++|+++|.| .+-|.+++.+||.++
T Consensus 296 ~iPL~ei~~~~~~~~~~~k~v~~~~~~l~~~fG~E~~iL~~~~~eel~~~ 345 (374)
T TIGR00375 296 LIPLAEVIGVGPKKGIFTKAVQSLWEKLKKAFGTEIAVLHEAAEEDLARV 345 (374)
T ss_pred eCCHHHHHhhhcCCCCccHHHHHHHHHHHHHhccHHHHHhcCCHHHHHHH
Confidence 5688999999998777899999999999999986 778889999998764
No 222
>PF00570 HRDC: HRDC domain Bloom syndrome. Werner syndrome.; InterPro: IPR002121 The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain associated with the human BLM gene result in Bloom Syndrome (BS), an autosomal recessive disorder characterised by proportionate pre- and postnatal growth deficiency; sun-sensitive, telangiectatic, hypo- and hyperpigmented skin; predisposition to malignancy; and chromosomal instability [].; GO: 0003676 nucleic acid binding, 0005622 intracellular; PDB: 3SAG_B 3SAH_B 2CPR_A 3SAF_B 3CYM_A 1WUD_A 2HBK_A 2HBJ_A 2HBM_A 2HBL_A ....
Probab=49.61 E-value=28 Score=24.59 Aligned_cols=21 Identities=29% Similarity=0.547 Sum_probs=17.1
Q ss_pred CcccHHHHhcCCCCcHHHHHH
Q psy9547 110 VPRDKKQLLSIKGVGDYTAGA 130 (363)
Q Consensus 110 ~p~~~~~L~~lpGIG~~tA~~ 130 (363)
.|.+.++|.+++|+|++-+.-
T Consensus 39 ~P~s~~~L~~i~g~~~~~~~~ 59 (68)
T PF00570_consen 39 LPTSIEELLQIPGMGKRKVRK 59 (68)
T ss_dssp --SSHHHHHTSTTCGHHHHHH
T ss_pred CCCCHHHHHHccCCCHHHHHH
Confidence 489999999999999987763
No 223
>PF01367 5_3_exonuc: 5'-3' exonuclease, C-terminal SAM fold; InterPro: IPR020045 This entry represents the C-terminal domain of 5' to 3' exonucleases. The 5'-3' exonucleases are conserved in organisms as diverse as bacteriophage and mammals. It adopts a SAM fold consisting of 4-5 helices packed into a bundle of two orthogonally packed alpha-hairpins. This domain is involved in interactions with DNA and proteins. 5' to 3' exonucleases that contain this domain include: Bacteriophage T4 RNase H, which has sequence similarity to the RAD2 family of eukaryotic proteins []. 5' to 3' exonuclease domain of DNA polymerase Taq, which is homologous to Escherichia coli DNA polymerase I (pol I) [, ]. Bacteriophage T5 5'-exonuclease, which are structure-specific endonucleases []. Flap endonuclease-1 (Fen-1 nuclease), a structure specific nuclease that is an essential enzyme for eukaryotic DNA replication and repair []. ; GO: 0003677 DNA binding, 0003824 catalytic activity; PDB: 1TAQ_A 1BGX_T 1TAU_A 1XO1_B 1EXN_A 1UT8_A 1UT5_B.
Probab=49.30 E-value=2.5 Score=33.36 Aligned_cols=19 Identities=26% Similarity=0.403 Sum_probs=13.0
Q ss_pred HHhcCCCCcHHHHHHHHHH
Q psy9547 116 QLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 116 ~L~~lpGIG~~tA~~il~~ 134 (363)
.+-.+||||+|||..++.-
T Consensus 19 NIPGV~GIG~KtA~~LL~~ 37 (101)
T PF01367_consen 19 NIPGVPGIGPKTAAKLLQE 37 (101)
T ss_dssp TB---TTSTCHCCCCCHHH
T ss_pred CCCCCCCCCHHHHHHHHHH
Confidence 3456899999999887764
No 224
>PF13869 NUDIX_2: Nucleotide hydrolase; PDB: 3MDG_B 2J8Q_B 3Q2S_A 3P5T_D 3BAP_A 2CL3_A 3P6Y_A 3Q2T_B 3BHO_A 3N9U_A ....
Probab=48.93 E-value=25 Score=30.93 Aligned_cols=48 Identities=15% Similarity=0.144 Sum_probs=28.9
Q ss_pred EEEEEEEeCCc--eEEEeecCCCccccccccccccCCCC--CchHHH-HHHHhhCCCc
Q psy9547 239 LLTVVVKTDTN--KYLIQKRPTTGLLSNFYMFLSFESDT--PYTSQV-EFLSENLPFK 291 (363)
Q Consensus 239 v~~~ii~~~~~--~vLl~~R~~~~~~~glWEFPggk~e~--~~~~al-rEl~Eelgl~ 291 (363)
|.+.+++...+ .||+-|.... .|-+|||+..+ ...+.+ |.|.+-++..
T Consensus 46 Ve~Vllvh~h~~PHvLLLq~~~~-----~fkLPGg~l~~gE~e~~gLkrkL~~~l~~~ 98 (188)
T PF13869_consen 46 VEGVLLVHEHGHPHVLLLQIGNT-----FFKLPGGRLRPGEDEIEGLKRKLTEKLSPE 98 (188)
T ss_dssp EEEEEEEEETTEEEEEEEEETTT-----EEE-SEEE--TT--HHHHHHHHHHHHHB-S
T ss_pred EEEEEEEecCCCcEEEEEeccCc-----cccCCccEeCCCCChhHHHHHHHHHHcCCC
Confidence 33334444433 5888886332 68999999988 556777 9998887754
No 225
>PF13543 KSR1-SAM: SAM like domain present in kinase suppressor RAS 1
Probab=46.78 E-value=1.1e+02 Score=25.26 Aligned_cols=87 Identities=14% Similarity=0.310 Sum_probs=57.9
Q ss_pred cCHHHHHHHHHHHHHHHhhc---CCCCCCCC------CCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCC
Q psy9547 5 LSAKEILAFQESILTWYKQN---ARQLPWRE------SNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDT 75 (363)
Q Consensus 5 ~~~~~~~~~~~~l~~~~~~~---~~~lpw~~------~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~ 75 (363)
++..+|..+-.+|+.+|... +..++=+. .-.-+..|+..|=.++.++..+ ....-|.++|.+.+
T Consensus 28 ltqqeIr~lE~KLvK~fSkQL~~K~k~~~~~~~~~l~~yP~l~~WL~vVgl~~~~i~~i-------~~~~~tLe~Llems 100 (129)
T PF13543_consen 28 LTQQEIRTLEGKLVKYFSKQLQCKAKVAERERAAELNSYPSLRQWLRVVGLRPESIQAI-------LSKVLTLEALLEMS 100 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCchhcccCCcHHHHhhhcCCCHHHHHHH-------HHhhcCHHHHHhCC
Confidence 45567777788888888653 11121110 1245778888776666665555 34567999999999
Q ss_pred HHHHHHHHhccCchH-HHHHHHHH
Q psy9547 76 EDNVLKMWEGLGYYS-RVRNFQAG 98 (363)
Q Consensus 76 ~~el~~~~~~lG~~~-rA~~l~~~ 98 (363)
++||..++..+|++. =.++|..+
T Consensus 101 d~el~~~l~~~g~~~EE~rRL~~A 124 (129)
T PF13543_consen 101 DEELKEILNRCGAREEECRRLCRA 124 (129)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHH
Confidence 999999999999874 33444443
No 226
>TIGR01954 nusA_Cterm_rpt transcription termination factor NusA, C-terminal duplication. NusA is a bacterial transcription termination factor. It is named for its interaction with phage lambda protein N, as part of the N utilization substance. Some members of the NusA family have a long C-terminal extension. This model represents an acidic 50-residue region found in two copies toward the C-terminus of most Proteobacterial NusA proteins, spaced about 26 residues apart. Analogous C-terminal extensions in some other bacterial lineages lack apparent homology but appear similarly acidic.
Probab=46.11 E-value=32 Score=22.62 Aligned_cols=36 Identities=19% Similarity=0.457 Sum_probs=28.2
Q ss_pred HhCCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHH
Q psy9547 63 KTYPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQ 101 (363)
Q Consensus 63 ~~~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~ 101 (363)
..|-|.++++.++++++..+ -|+.. ++..|+..++.
T Consensus 12 ~G~~s~e~la~~~~~eL~~i---~g~~~e~a~~ii~~a~~ 48 (50)
T TIGR01954 12 EGFTTVEDLAYVPIDELLSI---EGFDEETAKELINRARN 48 (50)
T ss_pred cCCCCHHHHHccCHHHHhcC---CCCCHHHHHHHHHHHHH
Confidence 46889999999999999774 56764 88888776653
No 227
>PF10343 DUF2419: Protein of unknown function (DUF2419); InterPro: IPR019438 This entry contains proteins that have no known function.
Probab=45.06 E-value=61 Score=30.61 Aligned_cols=65 Identities=17% Similarity=0.285 Sum_probs=50.5
Q ss_pred CChHHHHHHHHHhhhcchhhHHHHHHHHHHhCC--CHHHHhcCCHHHHHHHHhccC-----ch-HHHHHHHHHHHHhhHH
Q psy9547 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYP--TIKDFAFDTEDNVLKMWEGLG-----YY-SRVRNFQAGCRQVIEQ 105 (363)
Q Consensus 34 ~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~p--t~~~la~a~~~el~~~~~~lG-----~~-~rA~~l~~~a~~i~~~ 105 (363)
.+.|..|+++|= +.+.+..| +|+-+++.+.+++..++++-. +- .|++.|+++.+.+.++
T Consensus 45 ~tgY~~L~aal~-------------rAl~~~~pi~~~~~~~~~t~~~l~~if~s~~~~~iPll~eR~~~L~E~G~vL~~~ 111 (287)
T PF10343_consen 45 YTGYWSLCAALK-------------RALDEGIPITDPKYYAKMTDEELRHIFRSDTEEEIPLLEERARLLREVGRVLLEK 111 (287)
T ss_pred cchHHHHHHHHH-------------HHHhcCCCCcCHHHHHhCCHHHHHHHhcCCCCCcCCCHHHHHHHHHHHHHHHHHH
Confidence 367777776652 23445666 899999999999999998652 22 4999999999999999
Q ss_pred hCCCCc
Q psy9547 106 FGGEVP 111 (363)
Q Consensus 106 ~~g~~p 111 (363)
|+|.+-
T Consensus 112 f~gs~~ 117 (287)
T PF10343_consen 112 FDGSFA 117 (287)
T ss_pred HhhhHH
Confidence 999754
No 228
>PRK07758 hypothetical protein; Provisional
Probab=43.66 E-value=19 Score=27.95 Aligned_cols=21 Identities=24% Similarity=0.457 Sum_probs=17.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il 132 (363)
.+.++|++|+|+|+++.+-|-
T Consensus 64 ~te~ELl~iknlGkKSL~EIk 84 (95)
T PRK07758 64 YSEKEILKLHGMGPASLPKLR 84 (95)
T ss_pred CCHHHHHHccCCCHHHHHHHH
Confidence 468899999999999887654
No 229
>PRK12766 50S ribosomal protein L32e; Provisional
Probab=41.88 E-value=44 Score=30.40 Aligned_cols=39 Identities=10% Similarity=0.203 Sum_probs=24.6
Q ss_pred HHHHHh-CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHH
Q psy9547 59 EKFIKT-YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCR 100 (363)
Q Consensus 59 ~~l~~~-~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~ 100 (363)
.+|.+. |.|+++|..|+.++|.++ -|+.. +|..|.+.+.
T Consensus 17 kkLl~~GF~Sve~Ik~AS~eEL~~V---~GIg~k~AekI~e~l~ 57 (232)
T PRK12766 17 EALREAGFESVEDVRAADQSELAEV---DGIGNALAARIKADVG 57 (232)
T ss_pred HHHHHcCCCCHHHHHhCCHHHHHHc---cCCCHHHHHHHHHHhc
Confidence 345565 778888888877777663 45553 5666655443
No 230
>COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair]
Probab=40.30 E-value=97 Score=29.66 Aligned_cols=65 Identities=14% Similarity=0.271 Sum_probs=36.9
Q ss_pred HHHHHHHhCC----CHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 57 YYEKFIKTYP----TIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 57 ~~~~l~~~~p----t~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
.+..|...+| +.-.+...++..+..+|+.+|.-+. ..++++++ ++. +..|+|+|.+.+.-||
T Consensus 80 ~le~lk~~~P~gl~~Ll~v~GlGpkKi~~Ly~elgi~~~-e~l~~a~~------~~~-------~~~l~GfG~kse~~il 145 (326)
T COG1796 80 KLEALKKEVPEGLEPLLKVPGLGPKKIVSLYKELGIKDL-EELQEALE------NGK-------IRGLRGFGKKSEAKIL 145 (326)
T ss_pred HHHHHHHhCCcchHHHhhCCCCCcHHHHHHHHHHCcccH-HHHHHHHH------hCC-------ccccCCccchhHHHHH
Confidence 3444555566 2333334456666666666665322 22333332 122 3569999999999998
Q ss_pred HHh
Q psy9547 133 SIC 135 (363)
Q Consensus 133 ~~~ 135 (363)
-+.
T Consensus 146 ~~i 148 (326)
T COG1796 146 ENI 148 (326)
T ss_pred HHH
Confidence 774
No 231
>COG1948 MUS81 ERCC4-type nuclease [DNA replication, recombination, and repair]
Probab=39.31 E-value=37 Score=31.41 Aligned_cols=21 Identities=33% Similarity=0.235 Sum_probs=17.4
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
..-|.++||||+..|.-++..
T Consensus 181 ~~il~s~pgig~~~a~~ll~~ 201 (254)
T COG1948 181 LYILESIPGIGPKLAERLLKK 201 (254)
T ss_pred HHHHHcCCCccHHHHHHHHHH
Confidence 455778999999999988754
No 232
>PRK14976 5'-3' exonuclease; Provisional
Probab=39.30 E-value=18 Score=34.10 Aligned_cols=18 Identities=33% Similarity=0.418 Sum_probs=14.4
Q ss_pred HhcCCCCcHHHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~ 134 (363)
+-.+||||++||..++.-
T Consensus 193 ipGVpGIG~KtA~~LL~~ 210 (281)
T PRK14976 193 IKGVKGIGPKTAIKLLNK 210 (281)
T ss_pred CCCCCcccHHHHHHHHHH
Confidence 345899999999988753
No 233
>PRK14667 uvrC excinuclease ABC subunit C; Provisional
Probab=38.85 E-value=47 Score=34.57 Aligned_cols=23 Identities=13% Similarity=0.242 Sum_probs=18.7
Q ss_pred cccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 111 PRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 111 p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
..+.++|.++ |||..+|..|..+
T Consensus 542 ~As~eeL~~v-gi~~~~A~~I~~~ 564 (567)
T PRK14667 542 KADDEELKKL-GIPPSVKQEVKKY 564 (567)
T ss_pred hCCHHHHHHc-CCCHHHHHHHHHH
Confidence 3578888888 9999999888765
No 234
>PF10440 WIYLD: Ubiquitin-binding WIYLD domain; InterPro: IPR018848 This entry represents a presumed domain which has been predicted to contain three alpha helices. It was named the WIYLD domain based on the pattern of the ost conserved residues []. This domain appears to be specific to plant SET-domain proteins. ; GO: 0018024 histone-lysine N-methyltransferase activity
Probab=38.57 E-value=55 Score=23.59 Aligned_cols=39 Identities=23% Similarity=0.451 Sum_probs=31.9
Q ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCCC-CChHHHHHHHHHhhhc
Q psy9547 6 SAKEILAFQESILTWYKQNARQLPWRES-NNPYYVWISEVMLQQT 49 (363)
Q Consensus 6 ~~~~~~~~~~~l~~~~~~~~~~lpw~~~-~dp~~~lv~~il~qqt 49 (363)
.+..+..++..|+.-|+.+ |-.. .|-|.+|+-.|+..|-
T Consensus 24 ~~~~v~~vl~~LL~lY~~n-----W~lIEed~Y~~L~dai~e~~e 63 (65)
T PF10440_consen 24 SKKQVRPVLKNLLKLYDGN-----WELIEEDNYRVLADAIFEEQE 63 (65)
T ss_pred CHHHHHHHHHHHHHHHcCC-----chhhhcccHHHHHHHHHHHhh
Confidence 6678999999999999864 6544 4899999999987663
No 235
>PRK09482 flap endonuclease-like protein; Provisional
Probab=38.17 E-value=19 Score=33.35 Aligned_cols=18 Identities=22% Similarity=0.433 Sum_probs=14.6
Q ss_pred HhcCCCCcHHHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~ 134 (363)
+-.+||||++||.-++.-
T Consensus 184 IpGVpGIG~KtA~~LL~~ 201 (256)
T PRK09482 184 IPGVAGIGPKSAAELLNQ 201 (256)
T ss_pred CCCCCCcChHHHHHHHHH
Confidence 455899999999988764
No 236
>CHL00137 rps13 ribosomal protein S13; Validated
Probab=37.26 E-value=21 Score=29.14 Aligned_cols=22 Identities=23% Similarity=0.229 Sum_probs=19.1
Q ss_pred HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 114 KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~ 135 (363)
.-.|.++.|||+.+|..|+..+
T Consensus 16 ~~aLt~i~GIG~~~A~~ic~~l 37 (122)
T CHL00137 16 EYALTYIYGIGLTSAKEILEKA 37 (122)
T ss_pred eeeecccccccHHHHHHHHHHc
Confidence 5678999999999999998863
No 237
>PTZ00134 40S ribosomal protein S18; Provisional
Probab=37.18 E-value=19 Score=30.75 Aligned_cols=29 Identities=24% Similarity=0.237 Sum_probs=22.2
Q ss_pred CCCCccc---HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 107 GGEVPRD---KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 107 ~g~~p~~---~~~L~~lpGIG~~tA~~il~~~ 135 (363)
|-++|.+ .-.|..|.|||+.+|..|+.-+
T Consensus 19 ~~~l~~~K~v~~aLt~I~GIG~~~A~~I~~~l 50 (154)
T PTZ00134 19 NTNVDGKRKVPYALTAIKGIGRRFAYLVCKKA 50 (154)
T ss_pred CccCCCCCEEEEeecccccccHHHHHHHHHHc
Confidence 3445543 5678999999999999998763
No 238
>COG1491 Predicted RNA-binding protein [Translation, ribosomal structure and biogenesis]
Probab=37.06 E-value=25 Score=30.76 Aligned_cols=26 Identities=27% Similarity=0.239 Sum_probs=21.7
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 114 KKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
+-.|.-|||||.|+...||----.+|
T Consensus 129 LH~LELLpGiGkK~m~~ILeERkkkp 154 (202)
T COG1491 129 LHQLELLPGIGKKTMWAILEERKKKP 154 (202)
T ss_pred HHHHHhcccccHHHHHHHHHHHhcCC
Confidence 67788899999999999998655555
No 239
>cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity.
Probab=35.22 E-value=23 Score=32.41 Aligned_cols=18 Identities=28% Similarity=0.473 Sum_probs=14.2
Q ss_pred HhcCCCCcHHHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~ 134 (363)
+-.+||||++||.-++.-
T Consensus 185 ipGv~GiG~ktA~~Ll~~ 202 (240)
T cd00008 185 IPGVPGIGEKTAAKLLKE 202 (240)
T ss_pred CCCCCccCHHHHHHHHHH
Confidence 456899999999877653
No 240
>COG0864 NikR Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain and a metal-binding domain [Transcription]
Probab=34.97 E-value=49 Score=27.60 Aligned_cols=32 Identities=22% Similarity=0.363 Sum_probs=27.2
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHHhCC
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGG 108 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g 108 (363)
+++-++++.-||.+|++.|.++.+..++++..
T Consensus 17 ~elD~~i~~rg~~sRSE~IrdAir~yl~e~~~ 48 (136)
T COG0864 17 EELDELIEERGYSSRSELIRDALREYLEEYRW 48 (136)
T ss_pred HHHHHHHHHcCCCcHHHHHHHHHHHHHHHhhh
Confidence 46667777899999999999999999998643
No 241
>smart00475 53EXOc 5'-3' exonuclease.
Probab=34.87 E-value=24 Score=32.80 Aligned_cols=18 Identities=28% Similarity=0.473 Sum_probs=14.5
Q ss_pred HhcCCCCcHHHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALASI 134 (363)
Q Consensus 117 L~~lpGIG~~tA~~il~~ 134 (363)
+-.+||||++||.-++.-
T Consensus 188 ipGV~GIG~KtA~~Ll~~ 205 (259)
T smart00475 188 IPGVPGIGEKTAAKLLKE 205 (259)
T ss_pred CCCCCCCCHHHHHHHHHH
Confidence 455899999999987753
No 242
>KOG2573|consensus
Probab=34.73 E-value=85 Score=30.97 Aligned_cols=73 Identities=22% Similarity=0.353 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcCCHHHHHHHHhcc
Q psy9547 7 AKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGL 86 (363)
Q Consensus 7 ~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a~~~el~~~~~~l 86 (363)
+.+|+.|.-++.+||++| |..|+..+=-+.+ |.++..-+.+-+.+ .++.+...|..+
T Consensus 180 DKDINtfaMRirEwYswh------------FPEL~kiv~DNy~--------ya~~~~~i~dk~~l---~ed~~~~~~e~l 236 (498)
T KOG2573|consen 180 DKDINTFAMRIREWYSWH------------FPELVKIVPDNYK--------YAKVVKFIVDKEKL---NEDGLHELLEDL 236 (498)
T ss_pred hhhhhhhHHHHHHHHhhc------------cHHHHHhccchHH--------HHHHHHHHhchhhc---cccchhHHHHHh
Confidence 468899999999999986 2333333322222 12222222233332 334444555567
Q ss_pred CchH-HHHHHHHHHHHh
Q psy9547 87 GYYS-RVRNFQAGCRQV 102 (363)
Q Consensus 87 G~~~-rA~~l~~~a~~i 102 (363)
|... ||+.|+++++.=
T Consensus 237 ~~d~~ka~~Iiea~k~S 253 (498)
T KOG2573|consen 237 GVDSEKAQEIIEAAKNS 253 (498)
T ss_pred cCcHHHHHHHHHHHhhc
Confidence 7654 777777766553
No 243
>TIGR01448 recD_rel helicase, putative, RecD/TraA family. This model describes a family similar to RecD, the exodeoxyribonuclease V alpha chain of TIGR01447. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD.
Probab=34.34 E-value=53 Score=35.30 Aligned_cols=62 Identities=13% Similarity=0.269 Sum_probs=38.9
Q ss_pred HHHHhcCCHHHHHHHHhccCchHHHHHH-HHHHHHhhHHhCCC----CcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 68 IKDFAFDTEDNVLKMWEGLGYYSRVRNF-QAGCRQVIEQFGGE----VPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 68 ~~~la~a~~~el~~~~~~lG~~~rA~~l-~~~a~~i~~~~~g~----~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
.+.+.-.+.+.+...|.+--+.. | ...|+.|++.||.+ +.++.+.|.++||||+++++.|..
T Consensus 69 ~~~~~p~~~~~i~~yL~s~~~~G----IG~~~A~~iv~~fg~~~~~~i~~~~~~L~~v~gi~~~~~~~i~~ 135 (720)
T TIGR01448 69 MELEAPTSKEGIVAYLSSRSIKG----VGKKLAQRIVKTFGEAAFDVLDDDPEKLLEVPGISKANLEKFVS 135 (720)
T ss_pred EEEeCCCCHHHHHHHHhcCCCCC----cCHHHHHHHHHHhCHhHHHHHHhCHHHHhcCCCCCHHHHHHHHH
Confidence 34444456677777776322210 0 22355556777654 234577899999999999998876
No 244
>KOG1338|consensus
Probab=34.23 E-value=1.9e+02 Score=28.67 Aligned_cols=96 Identities=7% Similarity=0.154 Sum_probs=61.2
Q ss_pred ChHHHHHHHHHhhhcchhhHHHHHHHHHHhCCCHHHHhcC---CHHHHHHHHhccCchH--H-HHHHHHHHHHhhHHhCC
Q psy9547 35 NPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFD---TEDNVLKMWEGLGYYS--R-VRNFQAGCRQVIEQFGG 108 (363)
Q Consensus 35 dp~~~lv~~il~qqt~~~~v~~~~~~l~~~~pt~~~la~a---~~~el~~~~~~lG~~~--r-A~~l~~~a~~i~~~~~g 108 (363)
..|..|+.+++.-.-.-.. ..|.-.+..||+|+++-.. +++|+..++++.+... | +..|.+--..++..+++
T Consensus 88 gsw~~Lllvll~E~~~pq~--SrWrPYfs~wp~p~rm~spifWdEnEl~~Ll~stvlee~~Kd~aeI~~~~i~~i~pf~~ 165 (466)
T KOG1338|consen 88 GSWGMLLLVLLREKKMPQK--SRWRPYFSRWPQPARMHSPIFWDENELSMLLCSTVLEETVKDKAEIEKDFIFVIQPFKQ 165 (466)
T ss_pred CcHHHHHHHHHHHhhcccc--cccccHHHhCCChhhcCCCccCCchHHHHHhhcccchhhHhHHHHHHHHHHHHHHHHHH
Confidence 3456666665544332211 2333446789999998765 7899999998999872 3 56666665566666666
Q ss_pred CCcc-----cHHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 109 EVPR-----DKKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 109 ~~p~-----~~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
..|. +.+.. -|.++.++.+.|+.+
T Consensus 166 ~~p~vfs~~slEdF-------~y~~Al~laysfdve 194 (466)
T KOG1338|consen 166 HCPIVFSRPSLEDF-------MYAYALGLAYSFDVE 194 (466)
T ss_pred hCcchhcccCHHHH-------HHHHHHHHHHheeee
Confidence 6664 24444 357778888887654
No 245
>PRK05179 rpsM 30S ribosomal protein S13; Validated
Probab=33.23 E-value=28 Score=28.48 Aligned_cols=22 Identities=23% Similarity=0.249 Sum_probs=19.0
Q ss_pred HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 114 KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~ 135 (363)
.-.|.++.|||+.+|..|+..+
T Consensus 16 ~~aL~~I~GIG~~~a~~i~~~l 37 (122)
T PRK05179 16 VIALTYIYGIGRTRAKEILAAA 37 (122)
T ss_pred EeeecccccccHHHHHHHHHHh
Confidence 5678999999999999998763
No 246
>PRK04053 rps13p 30S ribosomal protein S13P; Reviewed
Probab=33.09 E-value=32 Score=29.18 Aligned_cols=29 Identities=34% Similarity=0.454 Sum_probs=22.3
Q ss_pred CCCCccc---HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 107 GGEVPRD---KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 107 ~g~~p~~---~~~L~~lpGIG~~tA~~il~~~ 135 (363)
+-++|.. .-.|.++.|||+.+|..|+..+
T Consensus 14 ~~~i~~~k~i~~aLt~IyGIG~~~a~~Ic~~l 45 (149)
T PRK04053 14 GTDLDGTKPVEYALTGIKGIGRRTARAIARKL 45 (149)
T ss_pred CccCCCCCEEeeeccccccccHHHHHHHHHHc
Confidence 3455543 5678999999999999998753
No 247
>TIGR01448 recD_rel helicase, putative, RecD/TraA family. This model describes a family similar to RecD, the exodeoxyribonuclease V alpha chain of TIGR01447. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD.
Probab=32.55 E-value=41 Score=36.12 Aligned_cols=17 Identities=24% Similarity=0.182 Sum_probs=12.8
Q ss_pred hcCCCCcHHHHHHHHHH
Q psy9547 118 LSIKGVGDYTAGALASI 134 (363)
Q Consensus 118 ~~lpGIG~~tA~~il~~ 134 (363)
++=-||++..|.-|..+
T Consensus 150 L~~~gi~~~~a~ki~~~ 166 (720)
T TIGR01448 150 LQGLGIGIKLAQRIYKF 166 (720)
T ss_pred HHHcCCCHHHHHHHHHH
Confidence 34479999998887764
No 248
>smart00611 SEC63 Domain of unknown function in Sec63p, Brr2p and other proteins.
Probab=32.36 E-value=84 Score=29.59 Aligned_cols=44 Identities=18% Similarity=0.190 Sum_probs=33.3
Q ss_pred ccCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 85 GLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 85 ~lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
..|+.+-|.++.++++.|.. |..++ ...|.+|||||+..+..+-
T Consensus 125 ~~~~~~~~~~~l~L~q~i~q---~~w~~-~~~L~Qlp~i~~~~~~~l~ 168 (312)
T smart00611 125 ERGWLSTALNALNLSQMIIQ---ALWPT-DSPLLQLPHLPEEILKRLE 168 (312)
T ss_pred hcchHHHHHHHHHHHHHHHH---hhCCC-CCccccCCCCCHHHHHHHH
Confidence 45776678889999999987 44443 3469999999998877654
No 249
>PRK00254 ski2-like helicase; Provisional
Probab=31.52 E-value=52 Score=35.24 Aligned_cols=42 Identities=19% Similarity=0.200 Sum_probs=28.5
Q ss_pred HHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 94 NFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 94 ~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
.|.++++.|.. | ++++.-.|.++||||+.+|...+..+|+-.
T Consensus 628 ~l~~l~~rl~~---g-~~~~~~~L~~ipgig~~~~~~l~~~g~~s~ 669 (720)
T PRK00254 628 YLETLHLRVKH---G-VREELLELMRLPMIGRKRARALYNAGFRSI 669 (720)
T ss_pred HHHHHHHHHHc---C-CCHHHhhhhcCCCCCHHHHHHHHHccCCCH
Confidence 44566667664 4 455566677888888888888777766654
No 250
>PF04919 DUF655: Protein of unknown function (DUF655); InterPro: IPR007003 This family includes several uncharacterised archaeal proteins.; PDB: 2I5H_A.
Probab=31.31 E-value=33 Score=29.94 Aligned_cols=26 Identities=19% Similarity=0.171 Sum_probs=17.5
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 114 KKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
+-.|.-|||||.|+...|+----..|
T Consensus 115 lH~LeLLPGIGKK~m~~ILeERkkkp 140 (181)
T PF04919_consen 115 LHSLELLPGIGKKTMWKILEERKKKP 140 (181)
T ss_dssp SBGGGGSTT--HHHHHHHHHHHHHS-
T ss_pred HHHHhhcccccHHHHHHHHHHHccCC
Confidence 55677799999999999997543333
No 251
>PF14056 DUF4250: Domain of unknown function (DUF4250)
Probab=31.16 E-value=83 Score=21.87 Aligned_cols=30 Identities=13% Similarity=0.345 Sum_probs=26.1
Q ss_pred HHHHhCCCHHHHh---cCCHHHHHHHHhccCch
Q psy9547 60 KFIKTYPTIKDFA---FDTEDNVLKMWEGLGYY 89 (363)
Q Consensus 60 ~l~~~~pt~~~la---~a~~~el~~~~~~lG~~ 89 (363)
+|+..|+|.++|. .++.++|.+-+..+||.
T Consensus 15 kLRD~~~sLd~Lc~~~~id~~~l~~kL~~~Gy~ 47 (55)
T PF14056_consen 15 KLRDEYSSLDELCYDYDIDKEELEEKLASIGYE 47 (55)
T ss_pred HHHhccCCHHHHHHHhCCCHHHHHHHHHHcCCe
Confidence 5778899999987 56999999999999994
No 252
>KOG3835|consensus
Probab=31.14 E-value=76 Score=30.98 Aligned_cols=56 Identities=23% Similarity=0.390 Sum_probs=44.7
Q ss_pred chhhHHHHHHHHHHhC-CCHHHHhcCCHHHHHHHHhccCchH---HHHHHHHHHHHhhHH
Q psy9547 50 QVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYS---RVRNFQAGCRQVIEQ 105 (363)
Q Consensus 50 ~~~~v~~~~~~l~~~~-pt~~~la~a~~~el~~~~~~lG~~~---rA~~l~~~a~~i~~~ 105 (363)
+-++-..||..|++.- .+++.|.++.++|.+++..-+|+-. -.|+++++.+....+
T Consensus 19 qkANLlsYyd~FIqQGGDDvqQlceagEeEFLEIMaLVGMa~KPLHVRRlQkALrew~tn 78 (495)
T KOG3835|consen 19 QKANLLSYYDVFIQQGGDDVQQLCEAGEEEFLEIMALVGMAPKPLHVRRLQKALREWSTN 78 (495)
T ss_pred HHhhHHHHHHHHHHhcchHHHHHHHhhHHHHHHHHHHhccCCcchhHHHHHHHHHHHhcC
Confidence 4567788999999754 5899999999999999999899864 477887776665554
No 253
>PF07803 GSG-1: GSG1-like protein; InterPro: IPR012478 This family contains sequences bearing similarity to a region of GSG1 (Q9Z1H7 from SWISSPROT), a protein specifically expressed in testicular germ cells []. It is possible that over expression of the human homologue may be involved in tumourigenesis of human testicular germ cell tumours []. The region in question has four highly conserved cysteine residues.
Probab=31.14 E-value=75 Score=25.64 Aligned_cols=53 Identities=13% Similarity=0.359 Sum_probs=36.3
Q ss_pred CCCCCCCCCcccChhhhcCCcccCCcccCCccccceeEEEEEEEeCCceEEEeecCCCcccccccc
Q psy9547 202 NPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYM 267 (363)
Q Consensus 202 ~P~C~~Cpl~~~C~~~~~~~~~~~p~k~~k~~~~~~~v~~~ii~~~~~~vLl~~R~~~~~~~glWE 267 (363)
+|.|+. +....|-.+.......- ......+|-.+...+||++++.+ +..|+|-
T Consensus 40 KPlC~~-~~~~~Ci~~~~~~~~~~-------~~~~~~~VqY~WETGDDkF~fr~-----FHtG~W~ 92 (118)
T PF07803_consen 40 KPLCGK-GKGTNCIHFPSNSDSGS-------NTSDSNVVQYIWETGDDKFIFRY-----FHTGIWL 92 (118)
T ss_pred CCCCCc-cccCcCcCCCCCCcccc-------cCCCCceeEEEEecCCceeehhh-----hhcchhh
Confidence 799955 77788988765443311 11234677788889989998764 6788885
No 254
>PF11372 DUF3173: Domain of unknown function (DUF3173); InterPro: IPR021512 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=31.09 E-value=1.6e+02 Score=20.82 Aligned_cols=27 Identities=19% Similarity=0.481 Sum_probs=19.2
Q ss_pred CHHHHHHHHhccCchH-HHHHHHHHHHHhhHH
Q psy9547 75 TEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQ 105 (363)
Q Consensus 75 ~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~ 105 (363)
+-++|.+ +||.. .|+.|++-|+.+.=+
T Consensus 5 ~k~dLi~----lGf~~~tA~~IIrqAK~~lV~ 32 (59)
T PF11372_consen 5 TKKDLIE----LGFSESTARDIIRQAKALLVQ 32 (59)
T ss_pred CHHHHHH----cCCCHHHHHHHHHHHHHHHHH
Confidence 3455555 79985 899988888776544
No 255
>COG3547 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=30.89 E-value=2e+02 Score=26.63 Aligned_cols=37 Identities=38% Similarity=0.517 Sum_probs=24.8
Q ss_pred HHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHH
Q psy9547 94 NFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132 (363)
Q Consensus 94 ~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il 132 (363)
.+..+.+.+........+... |.++||||+-+|..|.
T Consensus 169 ~i~~l~~~~~~~~~~~~~~~~--l~~~pgig~~~a~~i~ 205 (303)
T COG3547 169 EIAALDREIEARLGIGVVTDR--LASIPGIGELTAAAIA 205 (303)
T ss_pred HHHHHHHHHHHhhcccchHHH--HHhCCCccHHHHHHHH
Confidence 344445555544334344444 8899999999999988
No 256
>COG1255 Uncharacterized protein conserved in archaea [Function unknown]
Probab=30.73 E-value=57 Score=26.47 Aligned_cols=85 Identities=20% Similarity=0.108 Sum_probs=49.4
Q ss_pred CCCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhhCcC
Q psy9547 121 KGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLF 200 (363)
Q Consensus 121 pGIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~C~~ 200 (363)
=|||.++.=+=.+.-.|-.++.+|.|-.+.=.-+-.+..++.+ .+ ..+.+-+..+..-+-+.+++.+++++++.+=
T Consensus 20 VGiG~~~~VA~~L~e~g~dv~atDI~~~~a~~g~~~v~DDitn-P~-~~iY~~A~lIYSiRpppEl~~~ildva~aVg-- 95 (129)
T COG1255 20 VGIGFFLDVAKRLAERGFDVLATDINEKTAPEGLRFVVDDITN-PN-ISIYEGADLIYSIRPPPELQSAILDVAKAVG-- 95 (129)
T ss_pred EccchHHHHHHHHHHcCCcEEEEecccccCcccceEEEccCCC-cc-HHHhhCccceeecCCCHHHHHHHHHHHHhhC--
Confidence 3999987655555556667888898866333222222223322 12 2233334444433446889999999999884
Q ss_pred CCCCCCCCCCc
Q psy9547 201 KNPKCKECPLS 211 (363)
Q Consensus 201 ~~P~C~~Cpl~ 211 (363)
-.|-.-||.
T Consensus 96 --a~l~I~pL~ 104 (129)
T COG1255 96 --APLYIKPLT 104 (129)
T ss_pred --CCEEEEecC
Confidence 346555553
No 257
>PF13297 Telomere_Sde2_2: Telomere stability C-terminal
Probab=30.39 E-value=96 Score=21.96 Aligned_cols=34 Identities=12% Similarity=0.177 Sum_probs=27.1
Q ss_pred HhCCCHHHHhcCCHHHHHHHHhccCc------hHHHHHHH
Q psy9547 63 KTYPTIKDFAFDTEDNVLKMWEGLGY------YSRVRNFQ 96 (363)
Q Consensus 63 ~~~pt~~~la~a~~~el~~~~~~lG~------~~rA~~l~ 96 (363)
..|-|+++|.....|.+...+..+|+ ..||.+|.
T Consensus 4 ~~f~sa~eLe~lGldrLK~~L~a~GLKcGGTl~ERA~RLf 43 (60)
T PF13297_consen 4 DAFSSAEELEALGLDRLKSALMALGLKCGGTLQERAARLF 43 (60)
T ss_pred hhcCCHHHHHHhCHHHHHHHHHHcCCccCCCHHHHHHHHH
Confidence 46889999999999999999988876 12666664
No 258
>TIGR00596 rad1 DNA repair protein (rad1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford Universit
Probab=30.28 E-value=86 Score=34.20 Aligned_cols=41 Identities=15% Similarity=0.250 Sum_probs=31.2
Q ss_pred HHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHH
Q psy9547 57 YYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCR 100 (363)
Q Consensus 57 ~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~ 100 (363)
.+.+++.+|.|.++|++++.+||.+. +|-..+|+.|.++..
T Consensus 769 ~a~~ll~~f~si~~l~~as~eeL~~~---iG~~~~A~~i~~fl~ 809 (814)
T TIGR00596 769 NYRNLRKKVKSIRELAKLSQNELNEL---IGDEEAAKRLYDFLR 809 (814)
T ss_pred HHHHHHHHcCCHHHHHhCCHHHHHHH---hCCHHHHHHHHHHhc
Confidence 34567788999999999999998885 773347888877543
No 259
>TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type.
Probab=30.24 E-value=31 Score=27.79 Aligned_cols=21 Identities=24% Similarity=0.259 Sum_probs=18.5
Q ss_pred HHHHhcCCCCcHHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~ 134 (363)
.-.|.++.|||+.+|..|+..
T Consensus 14 ~~aL~~i~GIG~~~a~~i~~~ 34 (113)
T TIGR03631 14 EIALTYIYGIGRTRARKILEK 34 (113)
T ss_pred eeeeeeeecccHHHHHHHHHH
Confidence 566889999999999999876
No 260
>COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis]
Probab=29.97 E-value=38 Score=27.50 Aligned_cols=28 Identities=21% Similarity=0.324 Sum_probs=21.8
Q ss_pred CCCccc---HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 108 GEVPRD---KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 108 g~~p~~---~~~L~~lpGIG~~tA~~il~~~ 135 (363)
-++|.+ .-.|..|.|||..+|..|+.-+
T Consensus 7 vdip~~K~v~iALt~IyGIG~~~a~~I~~~~ 37 (121)
T COG0099 7 VDIPGNKRVVIALTYIYGIGRRRAKEICKKA 37 (121)
T ss_pred cCCCCCceEeehhhhhccccHHHHHHHHHHc
Confidence 345643 4568899999999999999764
No 261
>PRK14666 uvrC excinuclease ABC subunit C; Provisional
Probab=29.87 E-value=99 Score=32.94 Aligned_cols=23 Identities=26% Similarity=0.393 Sum_probs=19.0
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.++|.+++|||+++|..|+.+
T Consensus 666 AS~eELa~V~Gig~k~Ae~I~~~ 688 (694)
T PRK14666 666 AGEEGLAAVPGIGPARAAALHEH 688 (694)
T ss_pred cCHHHHHhcCCcCHHHHHHHHHH
Confidence 46778888999999999988765
No 262
>PF04891 NifQ: NifQ; InterPro: IPR006975 NifQ is involved in early stages of the biosynthesis of the iron-molybdenum cofactor (FeMo-co) [], which is an integral part of the active site of dinitrogenase []. The conserved C-terminal cysteine residues may be involved in metal binding [].; GO: 0030151 molybdenum ion binding, 0009399 nitrogen fixation
Probab=29.68 E-value=1.9e+02 Score=24.95 Aligned_cols=75 Identities=21% Similarity=0.369 Sum_probs=43.6
Q ss_pred CcHHHHHHHHHHhcCCccccccc------hHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHH--HHHHHHHh
Q psy9547 123 VGDYTAGALASICYNIPTPAVDG------NVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREF--NQALMDLG 194 (363)
Q Consensus 123 IG~~tA~~il~~~~~~~~~~vD~------~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~--~~~l~~~G 194 (363)
.+.|.|.+|..-+++-.-.=-|- .+.+.+.|+|- .++......=.|..| .+.--.-|
T Consensus 83 ~~~wlA~iiA~~~l~~~HLWqDLGL~~R~eLs~Lm~r~Fp---------------~Laa~N~~~MrWKKFfYrqlCe~eG 147 (167)
T PF04891_consen 83 EARWLAHIIARACLGPDHLWQDLGLRSRAELSALMRRHFP---------------PLAARNTRNMRWKKFFYRQLCEREG 147 (167)
T ss_pred HHHHHHHHHHHHHcCCCccHHhcCCCCHHHHHHHHHHHhH---------------HHHHhccCCCcHHHHHHHHHHHHcC
Confidence 45688888888888754222221 23344444431 122222222235544 45556778
Q ss_pred hhhCcCCCCCCCCCCCcccC
Q psy9547 195 ATVCLFKNPKCKECPLSRFC 214 (363)
Q Consensus 195 ~~~C~~~~P~C~~Cpl~~~C 214 (363)
-.+|+ .|.|+.|.=...|
T Consensus 148 ~~~C~--aPsC~~C~D~~~C 165 (167)
T PF04891_consen 148 LYLCR--APSCEECSDYAVC 165 (167)
T ss_pred CCcCC--CCCCCCcCCHhhc
Confidence 89996 7999999877666
No 263
>COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair]
Probab=29.15 E-value=60 Score=30.85 Aligned_cols=14 Identities=43% Similarity=0.546 Sum_probs=12.1
Q ss_pred CCCCcHHHHHHHHH
Q psy9547 120 IKGVGDYTAGALAS 133 (363)
Q Consensus 120 lpGIG~~tA~~il~ 133 (363)
+||||++||--++.
T Consensus 203 V~GIG~ktA~~Ll~ 216 (310)
T COG0258 203 VKGIGPKTALKLLQ 216 (310)
T ss_pred CCCcCHHHHHHHHH
Confidence 89999999987765
No 264
>TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13.
Probab=29.11 E-value=33 Score=28.90 Aligned_cols=22 Identities=36% Similarity=0.401 Sum_probs=18.7
Q ss_pred HHHHhcCCCCcHHHHHHHHHHh
Q psy9547 114 KKQLLSIKGVGDYTAGALASIC 135 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~ 135 (363)
.-.|.++.|||+.+|..|+..+
T Consensus 20 ~~aLt~I~GIG~~~a~~I~~~l 41 (144)
T TIGR03629 20 EYALTGIKGIGRRFARAIARKL 41 (144)
T ss_pred EEeecceeccCHHHHHHHHHHc
Confidence 5678999999999999997753
No 265
>PF01706 FliG_C: FliG C-terminal domain; InterPro: IPR023087 The flagellar motor switch in Escherichia coli and Salmonella typhimurium regulates the direction of flagellar rotation and hence controls swimming behaviour []. The switch is a complex apparatus that responds to signals transduced by the chemotaxis sensory signalling system during chemotactic behaviour []. CheY, the chemotaxis response regulator, is believed to act directly on the switch to induce tumbles in the swimming pattern, but no physical interactions of CheY and switch proteins have yet been demonstrated. The switch complex comprises at least three proteins - FliG, FliM and FliN. It has been shown that FliG interacts with FliM, FliM interacts with itself, and FliM interacts with FliN []. Several residues within the middle third of FliG appear to be strongly involved in the FliG-FliM interaction, with residues near the N- or C-termini being less important []. Such clustering suggests that FliG-FliM interaction plays a central role in switching. Analysis of the FliG, FliM and FliN sequences shows that none are especially hydrophobic or appear to be integral membrane proteins []. This result is consistent with other evidence suggesting that the proteins may be peripheral to the membrane, possibly mounted on the basal body M ring [, ]. FliG is present in about 25 copies per flagellum. This entry represents the C-terminal domain of FliG, the structure of which is known. This domain functions specifically in motor rotation [].; PDB: 3USY_B 3USW_A 3HJL_A 3AJC_A 1LKV_X 1QC7_B.
Probab=29.05 E-value=83 Score=24.85 Aligned_cols=55 Identities=11% Similarity=0.104 Sum_probs=35.2
Q ss_pred HHHHHHHhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhHHhCCCCc
Q psy9547 57 YYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVP 111 (363)
Q Consensus 57 ~~~~l~~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~~~~g~~p 111 (363)
...++.+...|.++|...+...+..+++.++...=|..|+.+...+.+.+-+.+|
T Consensus 11 la~~ir~~~f~F~dl~~l~~~~l~~ll~~v~~~~la~ALkga~~e~~~~il~nms 65 (110)
T PF01706_consen 11 LAEKIREKMFTFDDLVRLDDRDLQKLLREVDPDDLALALKGASEELREKILSNMS 65 (110)
T ss_dssp HHHHHHHHCS-GGGGGGS-HHHHHHHHTTS-HHHHHHHHCTS-HHHHHHHHTTS-
T ss_pred HHHHHHHHhhhHHHHHcCCHHHHHHHHHHCCHhHHHHHHccCCHHHHHHHHHHcC
Confidence 3456678899999999999999999999888644455555555555554444444
No 266
>PRK00558 uvrC excinuclease ABC subunit C; Validated
Probab=28.87 E-value=62 Score=33.94 Aligned_cols=20 Identities=30% Similarity=0.348 Sum_probs=12.7
Q ss_pred HHHHhcCCCCcHHHHHHHHH
Q psy9547 114 KKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~ 133 (363)
.-.|..|||||++++..++.
T Consensus 542 ~s~L~~IpGIG~k~~k~Ll~ 561 (598)
T PRK00558 542 TSALDDIPGIGPKRRKALLK 561 (598)
T ss_pred hhhHhhCCCcCHHHHHHHHH
Confidence 34556677777777775554
No 267
>PRK14672 uvrC excinuclease ABC subunit C; Provisional
Probab=28.81 E-value=78 Score=33.65 Aligned_cols=32 Identities=9% Similarity=0.018 Sum_probs=24.8
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHHhcCCccccc
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASICYNIPTPAV 143 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~~~~~~~~~v 143 (363)
.+.++|.++||||..+|..|..-+-.-...||
T Consensus 637 As~eel~~v~gi~~~~A~~i~~~~~~~~~~~~ 668 (691)
T PRK14672 637 ATPQDIATAIHIPLTQAHTILHAATRSTTAPV 668 (691)
T ss_pred CCHHHHHhCCCCCHHHHHHHHHHhhcccccch
Confidence 57888999999999999999887765544443
No 268
>PF13751 DDE_Tnp_1_6: Transposase DDE domain
Probab=28.76 E-value=26 Score=28.20 Aligned_cols=16 Identities=38% Similarity=0.995 Sum_probs=14.2
Q ss_pred CCCCCCCCCcccChhh
Q psy9547 202 NPKCKECPLSRFCSAY 217 (363)
Q Consensus 202 ~P~C~~Cpl~~~C~~~ 217 (363)
...|..||+++.|...
T Consensus 30 ~~~C~~Cp~r~~C~~~ 45 (125)
T PF13751_consen 30 KEDCKNCPLREQCTKS 45 (125)
T ss_pred chhccCCCccccCCCc
Confidence 4689999999999876
No 269
>PRK04460 nickel responsive regulator; Provisional
Probab=28.65 E-value=73 Score=26.58 Aligned_cols=29 Identities=17% Similarity=0.288 Sum_probs=23.6
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHH
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQ 105 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~ 105 (363)
+++.+.++..||.+|++.|.++.+..+..
T Consensus 15 ~~lD~~~~~~gy~sRSe~ird~ir~~l~e 43 (137)
T PRK04460 15 EKFDELIEEKGYQNRSEAIRDLIRDFLVE 43 (137)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHH
Confidence 46667788899999999999998866654
No 270
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=28.12 E-value=1e+02 Score=18.77 Aligned_cols=22 Identities=27% Similarity=0.549 Sum_probs=19.4
Q ss_pred cCHHHHHHHHHHHHHHHhhcCC
Q psy9547 5 LSAKEILAFQESILTWYKQNAR 26 (363)
Q Consensus 5 ~~~~~~~~~~~~l~~~~~~~~~ 26 (363)
++.+++..+..++.+|+..++|
T Consensus 5 i~~~~~~d~a~rv~~f~~~ngR 26 (33)
T PF09373_consen 5 ISKEEYLDMASRVNNFYESNGR 26 (33)
T ss_pred ecHHHHHHHHHHHHHHHHHcCC
Confidence 4667888999999999999998
No 271
>PRK00254 ski2-like helicase; Provisional
Probab=27.95 E-value=1e+02 Score=33.00 Aligned_cols=23 Identities=17% Similarity=0.364 Sum_probs=19.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.++|.+++|||+++|..|..+
T Consensus 675 a~~~el~~~~gi~~~~a~~i~~~ 697 (720)
T PRK00254 675 AKPSELLKVEGIGAKIVEGIFKH 697 (720)
T ss_pred CCHHHHhcCCCCCHHHHHHHHHH
Confidence 35788999999999999988776
No 272
>COG1623 Predicted nucleic-acid-binding protein (contains the HHH domain) [General function prediction only]
Probab=27.06 E-value=1e+02 Score=29.18 Aligned_cols=36 Identities=19% Similarity=0.535 Sum_probs=19.1
Q ss_pred HHHHhCCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHH
Q psy9547 60 KFIKTYPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAG 98 (363)
Q Consensus 60 ~l~~~~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~ 98 (363)
++...|.+...|.+|+.+++.++ .|+| . |||.|++-
T Consensus 308 nlV~~F~~l~~il~As~edL~~V-eGIG--e~rAr~i~~G 344 (349)
T COG1623 308 NLVRAFGTLDGILEASAEDLDAV-EGIG--EARARAIKEG 344 (349)
T ss_pred HHHHHHhhHHHHHHhcHhHHhhh-cchh--HHHHHHHHHh
Confidence 34445666666666666666443 3333 2 55555543
No 273
>PRK02967 nickel responsive regulator; Provisional
Probab=26.85 E-value=80 Score=26.39 Aligned_cols=30 Identities=17% Similarity=0.211 Sum_probs=24.1
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHHh
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQF 106 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~~ 106 (363)
+++.+.+...||.+|++.|+++.+..+..+
T Consensus 14 ~~lD~~~~~~gy~sRSeaIrd~iR~~l~~~ 43 (139)
T PRK02967 14 ETLDSLIARRGYQNRSEAIRDLLRAALAQE 43 (139)
T ss_pred HHHHHHHHHcCCCCHhHHHHHHHHHHHHHh
Confidence 456677788999999999999988776643
No 274
>TIGR02238 recomb_DMC1 meiotic recombinase Dmc1. This model describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (TIGR02236) and RadB (TIGR02237) and bacterial RecA (TIGR02012). It has been characterized for human as a recombinase active only in meiosis.
Probab=26.68 E-value=1.5e+02 Score=28.39 Aligned_cols=44 Identities=16% Similarity=0.280 Sum_probs=34.3
Q ss_pred HHHHHH-hCCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhH
Q psy9547 58 YEKFIK-TYPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIE 104 (363)
Q Consensus 58 ~~~l~~-~~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~ 104 (363)
.++|.+ .|-|.++++.+++.+|.+. .|++. +|..|++.|.....
T Consensus 14 ~~~L~~~g~~t~~~~~~~~~~~L~~~---~gls~~~~~~i~~~~~~~~~ 59 (313)
T TIGR02238 14 IKKLKSAGICTVNGVIMTTRRALCKI---KGLSEAKVDKIKEAASKIIN 59 (313)
T ss_pred HHHHHHcCCCcHHHHHhCCHHHHHHh---cCCCHHHHHHHHHHHHhhhc
Confidence 345654 5889999999999999876 78874 88888887776654
No 275
>TIGR02236 recomb_radA DNA repair and recombination protein RadA. This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA (TIGR02012), eukaryotic RAD51 (TIGR02239), and archaeal RadB (TIGR02237). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein.
Probab=26.55 E-value=1.4e+02 Score=28.12 Aligned_cols=41 Identities=17% Similarity=0.264 Sum_probs=31.1
Q ss_pred HHHHHHHh-CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHH
Q psy9547 57 YYEKFIKT-YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCR 100 (363)
Q Consensus 57 ~~~~l~~~-~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~ 100 (363)
...+|.+. |-|.++|+.+++++|.+. .|... .|..|.+.+.
T Consensus 11 ~~~~L~~~Gi~ti~dl~~~~~~~L~~~---~g~~~~~a~~l~~~~~ 53 (310)
T TIGR02236 11 TAEKLREAGYDTFEAIAVASPKELSEI---AGISEGTAAKIIQAAR 53 (310)
T ss_pred HHHHHHHcCCCCHHHHHcCCHHHHHhc---cCCCHHHHHHHHHHHH
Confidence 34556665 899999999999999875 57753 7777777665
No 276
>TIGR02793 nikR nickel-responsive transcriptional regulator NikR. Three members of the seed for this model, from Escherichia coli, Pseudomonas putida, and Brucella melitensis, are found associated with a nickel ABC transporter operon that acts to import nickel for use as a cofactor in urease or hydrogenase. These proteins, with characterized nickel-binding and DNA-binding domains, act as nickel-responsive transcriptional regulators. In the larger family of full-length homologs, most others both lack proximity to the nickel ABC transporter operon and form a separate clade. Several of the homologs not within the scope of this model, but rather scoring between the trusted and noise cutoffs, have been shown to bind nickel, copper, or both, and to regulate genes in response to nickel.
Probab=26.45 E-value=83 Score=25.92 Aligned_cols=29 Identities=17% Similarity=0.277 Sum_probs=23.0
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHH
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQ 105 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~ 105 (363)
+++.+.+...||.+|++.|.++.+..+.+
T Consensus 13 ~~lD~~~~~~g~~~RSe~ir~~ir~~l~e 41 (129)
T TIGR02793 13 ETLDRLIARRGYQNRSEAIRDLLRSGLQQ 41 (129)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHH
Confidence 45667778899999999999998866554
No 277
>PRK00630 nickel responsive regulator; Provisional
Probab=26.41 E-value=84 Score=26.61 Aligned_cols=29 Identities=21% Similarity=0.149 Sum_probs=23.2
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHH
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQ 105 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~ 105 (363)
+++.+.++..||.+|++.|+++.+..+.+
T Consensus 23 ~~lD~~~~~~gy~sRSe~Ir~~iR~~l~e 51 (148)
T PRK00630 23 DELDNRIIKNGYSSRSELVRDLIREKLVE 51 (148)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHH
Confidence 46667777889989999999998866543
No 278
>PRK07758 hypothetical protein; Provisional
Probab=26.03 E-value=2.1e+02 Score=22.31 Aligned_cols=45 Identities=18% Similarity=0.303 Sum_probs=29.5
Q ss_pred HHHHHHHH-HhCCCHHHHhcCCHHHHHHHHhccCchHHHHHHHHHHHHhhH
Q psy9547 55 LPYYEKFI-KTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIE 104 (363)
Q Consensus 55 ~~~~~~l~-~~~pt~~~la~a~~~el~~~~~~lG~~~rA~~l~~~a~~i~~ 104 (363)
.+.++.|. +.+-|+++|+..+++||.++ +++| .|+ |.++-+.+.+
T Consensus 44 vRA~N~Lk~AGI~TL~dLv~~te~ELl~i-knlG--kKS--L~EIkekL~E 89 (95)
T PRK07758 44 APARRALEHHGIHTVEELSKYSEKEILKL-HGMG--PAS--LPKLRKALEE 89 (95)
T ss_pred HHHHHHHHHcCCCcHHHHHcCCHHHHHHc-cCCC--HHH--HHHHHHHHHH
Confidence 34455565 46889999999999999774 5555 343 4444344443
No 279
>COG1623 Predicted nucleic-acid-binding protein (contains the HHH domain) [General function prediction only]
Probab=25.53 E-value=50 Score=31.12 Aligned_cols=84 Identities=17% Similarity=0.248 Sum_probs=49.4
Q ss_pred cchhhHHHHHHHHH-HhCCCHHHHhcC----CHHHHH-HHHhccCch--HHH-HHHHHHHHHhhHHhCC---CCcccHHH
Q psy9547 49 TQVKTVLPYYEKFI-KTYPTIKDFAFD----TEDNVL-KMWEGLGYY--SRV-RNFQAGCRQVIEQFGG---EVPRDKKQ 116 (363)
Q Consensus 49 t~~~~v~~~~~~l~-~~~pt~~~la~a----~~~el~-~~~~~lG~~--~rA-~~l~~~a~~i~~~~~g---~~p~~~~~ 116 (363)
+....+....+.+. +.+++++.++.+ ...++. ..+.+-||. +|. |--......++..|+. .+..+.++
T Consensus 247 ~d~~~~l~~l~~ls~~~lL~~~~i~kvlGY~~~~~~~d~~vspRGYR~l~kIpRlp~~iv~nlV~~F~~l~~il~As~ed 326 (349)
T COG1623 247 VDIEAVLEELQSLSDEELLDPENIAKVLGYPKTTEADDSLVSPRGYRLLNKIPRLPFAIVENLVRAFGTLDGILEASAED 326 (349)
T ss_pred cCHHHHHHHHHhccchhhCCHHHHHHHhCCCCcchhcccccCchhhHHHhcCcCccHHHHHHHHHHHhhHHHHHHhcHhH
Confidence 45556666666554 346677666632 111121 245567774 232 2223445566666643 23457899
Q ss_pred HhcCCCCcHHHHHHHH
Q psy9547 117 LLSIKGVGDYTAGALA 132 (363)
Q Consensus 117 L~~lpGIG~~tA~~il 132 (363)
|.++.|||..-|.+|-
T Consensus 327 L~~VeGIGe~rAr~i~ 342 (349)
T COG1623 327 LDAVEGIGEARARAIK 342 (349)
T ss_pred HhhhcchhHHHHHHHH
Confidence 9999999999988764
No 280
>COG1498 SIK1 Protein implicated in ribosomal biogenesis, Nop56p homolog [Translation, ribosomal structure and biogenesis]
Probab=25.51 E-value=2.6e+02 Score=27.63 Aligned_cols=41 Identities=22% Similarity=0.293 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHhhcCCCCC-CCCCCChHHHHHHHHHhh
Q psy9547 7 AKEILAFQESILTWYKQNARQLP-WRESNNPYYVWISEVMLQ 47 (363)
Q Consensus 7 ~~~~~~~~~~l~~~~~~~~~~lp-w~~~~dp~~~lv~~il~q 47 (363)
+++++.+..+|.+||+.|=+.+- --....-|-.+|+++-.-
T Consensus 129 Dk~iN~~~~RlrEWY~~hFPEL~~lv~~~~~Y~~~V~~~g~~ 170 (395)
T COG1498 129 DKEINLLAMRLREWYGWHFPELSSLVPDNEQYAKLVSALGNR 170 (395)
T ss_pred HHHHHHHHHHHHHHHhhcCcchhhhcccHHHHHHHHHHHcch
Confidence 46888999999999998744331 111244577777777443
No 281
>COG1468 CRISPR-associated protein Cas4 (RecB family exonuclease) [Defense mechanisms]
Probab=25.46 E-value=38 Score=29.91 Aligned_cols=17 Identities=35% Similarity=0.807 Sum_probs=14.8
Q ss_pred CCCCCCCCCCCcccChh
Q psy9547 200 FKNPKCKECPLSRFCSA 216 (363)
Q Consensus 200 ~~~P~C~~Cpl~~~C~~ 216 (363)
++++.|..|+++..|..
T Consensus 172 ~~~~~C~~C~y~~iC~~ 188 (190)
T COG1468 172 KKKKKCKKCAYREICFP 188 (190)
T ss_pred CCCCcCCCCCcceeccC
Confidence 36899999999999974
No 282
>PF03686 UPF0146: Uncharacterised protein family (UPF0146); InterPro: IPR005353 The function of this family of proteins is unknown.; PDB: 2K4M_A.
Probab=25.31 E-value=16 Score=30.07 Aligned_cols=74 Identities=18% Similarity=0.059 Sum_probs=37.4
Q ss_pred CCcHHHHHHHHHHhcCCccccccchHHHHHhhhcccccCcCCcccHHHHHHHHHhcCCcccHHHHHHHHHHHhhhh
Q psy9547 122 GVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATV 197 (363)
Q Consensus 122 GIG~~tA~~il~~~~~~~~~~vD~~v~Rvl~Rl~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~l~~~G~~~ 197 (363)
|||.++--+-.+.-.|..+..+|.|-..+-..+-.+..++-. .+ ..+-+-+.-+..-+-+.++|..|+++++.+
T Consensus 21 GiG~~~~vA~~L~~~G~dV~~tDi~~~~a~~g~~~v~DDif~-P~-l~iY~~a~lIYSiRPP~El~~~il~lA~~v 94 (127)
T PF03686_consen 21 GIGFNPEVAKKLKERGFDVIATDINPRKAPEGVNFVVDDIFN-PN-LEIYEGADLIYSIRPPPELQPPILELAKKV 94 (127)
T ss_dssp T-TT--HHHHHHHHHS-EEEEE-SS-S----STTEE---SSS----HHHHTTEEEEEEES--TTSHHHHHHHHHHH
T ss_pred CcCCCHHHHHHHHHcCCcEEEEECcccccccCcceeeecccC-CC-HHHhcCCcEEEEeCCChHHhHHHHHHHHHh
Confidence 999988777777777889999999987554333333333322 11 122222222322234678999999999988
No 283
>PRK01002 nickel responsive regulator; Provisional
Probab=24.47 E-value=94 Score=25.99 Aligned_cols=30 Identities=23% Similarity=0.362 Sum_probs=24.0
Q ss_pred HHHHHHHhccCchHHHHHHHHHHHHhhHHh
Q psy9547 77 DNVLKMWEGLGYYSRVRNFQAGCRQVIEQF 106 (363)
Q Consensus 77 ~el~~~~~~lG~~~rA~~l~~~a~~i~~~~ 106 (363)
+++.++++..||.+|++.|+++.+..+.++
T Consensus 17 ~~lD~~~~~~g~~sRSe~Ir~air~~l~~~ 46 (141)
T PRK01002 17 GEFDEIIEERGYASRSEGIRDAIRDYIIKY 46 (141)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHh
Confidence 466677788999899999999988766643
No 284
>PF11417 Inhibitor_G39P: Loader and inhibitor of phage G40P; InterPro: IPR024424 G39P inhibits the initiation of DNA replication by blocking G40P replicative helicase. G39P has a bipartite stricture consisting of a folded N-terminal domain and an unfolded C-terminal domain. The C-terminal is essential for helicase interaction [].; PDB: 1NO1_B.
Probab=24.16 E-value=1.5e+02 Score=21.63 Aligned_cols=67 Identities=21% Similarity=0.313 Sum_probs=32.9
Q ss_pred cCHHHHHHHHHHHHHHHhhcCCCCCCCCCCChHHHHHHHHHhhhcchhhHHHHHHHHHH--hCC-CHHHHhc
Q psy9547 5 LSAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIK--TYP-TIKDFAF 73 (363)
Q Consensus 5 ~~~~~~~~~~~~l~~~~~~~~~~lpw~~~~dp~~~lv~~il~qqt~~~~v~~~~~~l~~--~~p-t~~~la~ 73 (363)
|+.++.-+++..+.+-|+.....+.-...++-..+|. -+++.-.-+.|.....+.+. .|| |+.+|.+
T Consensus 1 Mtk~E~~~ll~~I~~aYP~~~~~f~~~~~k~~v~~W~--~~L~d~~ye~v~~al~~~i~~~kfPPsiaeii~ 70 (71)
T PF11417_consen 1 MTKEETAKLLKLIKAAYPQWAGNFKPTDSKETVDLWY--DMLKDYDYEIVMKALKKHIATNKFPPSIAEIIK 70 (71)
T ss_dssp --HHHHHHHHHHHHHHST---TT---STHHHHHHHHH--HHHTTS-HHHHHHHHHHHHHH-SS---GGGG--
T ss_pred CCHHHHHHHHHHHHHHCCcchhccchhhHHHHHHHHH--HHHHhCCHHHHHHHHHHHHHhCCCCcCHHHHhh
Confidence 5678888999999999985322221111112233333 34466666666666666654 463 7766654
No 285
>PRK01172 ski2-like helicase; Provisional
Probab=23.76 E-value=1.6e+02 Score=31.17 Aligned_cols=38 Identities=13% Similarity=0.299 Sum_probs=26.1
Q ss_pred CCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHHHHHhhHH
Q psy9547 65 YPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAGCRQVIEQ 105 (363)
Q Consensus 65 ~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~a~~i~~~ 105 (363)
+-|+.+|+.++++++.++ +|+.. .|+.|.+-|+.+++-
T Consensus 633 ~~~~~di~~~~~~~~~~i---~~~~~~~~~~i~~~~~~~~~~ 671 (674)
T PRK01172 633 FKTVDDIARSSPERIKKI---YGFSDTLANAIVNRAMKISSM 671 (674)
T ss_pred CCCHHHHHhCCHHHHHHH---hccCHHHHHHHHHHHHHHHHH
Confidence 457777777777777665 45654 677777777776654
No 286
>PRK02362 ski2-like helicase; Provisional
Probab=23.50 E-value=1.1e+02 Score=32.84 Aligned_cols=38 Identities=26% Similarity=0.331 Sum_probs=27.8
Q ss_pred HHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 92 VRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 92 A~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
+..+.++++.+.. | ++.+.-.|++|||||+.+|.....
T Consensus 633 ~~~~~~l~~~l~~---g-v~~~~~~L~~ip~i~~~~a~~l~~ 670 (737)
T PRK02362 633 ARAARELEKRVEY---G-VREELLDLVGLRGVGRVRARRLYN 670 (737)
T ss_pred HHHHHHHHHHHHh---C-CCHHHHHHhCCCCCCHHHHHHHHH
Confidence 4555666666654 3 667788899999999998876554
No 287
>PF05166 YcgL: YcgL domain; InterPro: IPR007840 This family of proteins formerly called DUF709 includes the Escherichia coli gene ycgL. Homologues of YcgL are found in gammaproteobacteria. The structure of this protein shows a novel alpha/beta/alpha sandwich structure []. The proteins in this entry are functionally uncharacterised.; PDB: 2H7A_A.
Probab=23.37 E-value=50 Score=24.49 Aligned_cols=22 Identities=27% Similarity=0.474 Sum_probs=16.3
Q ss_pred HHHHhcCCHHHHHHHHhccCch
Q psy9547 68 IKDFAFDTEDNVLKMWEGLGYY 89 (363)
Q Consensus 68 ~~~la~a~~~el~~~~~~lG~~ 89 (363)
-..||.++.++|.+.|+.-|||
T Consensus 47 ~r~La~~d~~~V~~~l~~~Gfy 68 (74)
T PF05166_consen 47 ERKLARADAEKVLAALEEQGFY 68 (74)
T ss_dssp ----SSS-HHHHHHHHHHTSEE
T ss_pred CceeccCCHHHHHHHHHhCCEE
Confidence 4569999999999999999997
No 288
>PF14475 Mso1_Sec1_bdg: Sec1-binding region of Mso1
Probab=23.30 E-value=92 Score=20.27 Aligned_cols=21 Identities=14% Similarity=0.434 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHhhcCCCCC
Q psy9547 9 EILAFQESILTWYKQNARQLP 29 (363)
Q Consensus 9 ~~~~~~~~l~~~~~~~~~~lp 29 (363)
+--.+...|.++|...++.+|
T Consensus 18 ddT~v~r~l~~yY~~k~~~~P 38 (41)
T PF14475_consen 18 DDTHVHRVLRKYYTEKGRPFP 38 (41)
T ss_pred chhHHHHHHHHHHHHcCCCCC
Confidence 334588899999999988876
No 289
>PF02467 Whib: Transcription factor WhiB; InterPro: IPR003482 WhiB is a putative transcription factor in Actinobacteria, required for differentiation and sporulation. The process of mycelium formation in Streptomyces, which occurs in response to nutrient limitation, is controlled by a number of whi genes, named for the white colour of aerial hyphae when mutations occur in these genes. The normal colour is grey. The exact role of WhiB is not clear, but a mutation in the gene results in white, tightly coiled aerial hyphae.
Probab=23.17 E-value=21 Score=25.76 Aligned_cols=16 Identities=31% Similarity=0.820 Sum_probs=13.1
Q ss_pred CCCCCCCcccChhhhc
Q psy9547 204 KCKECPLSRFCSAYKN 219 (363)
Q Consensus 204 ~C~~Cpl~~~C~~~~~ 219 (363)
.|..||++..|..+..
T Consensus 31 iC~~CPVr~~C~~~Al 46 (66)
T PF02467_consen 31 ICARCPVRAECLEYAL 46 (66)
T ss_pred HHcCChhHHHHHHHHH
Confidence 3567999999999864
No 290
>PRK14670 uvrC excinuclease ABC subunit C; Provisional
Probab=23.14 E-value=1.3e+02 Score=31.38 Aligned_cols=23 Identities=4% Similarity=0.031 Sum_probs=18.5
Q ss_pred ccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 112 RDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 112 ~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
.+.++|.++||||.++|..|..+
T Consensus 543 As~eeL~~v~gi~~~~A~~I~~~ 565 (574)
T PRK14670 543 LNEDEIAEKMKINIKMAKKIKKF 565 (574)
T ss_pred CCHHHHHhCCCCCHHHHHHHHHH
Confidence 46888888888888888888765
No 291
>KOG2841|consensus
Probab=22.75 E-value=86 Score=28.65 Aligned_cols=24 Identities=21% Similarity=0.248 Sum_probs=20.0
Q ss_pred cccHHHHhcCCCCcHHHHHHHHHH
Q psy9547 111 PRDKKQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 111 p~~~~~L~~lpGIG~~tA~~il~~ 134 (363)
-.++++|.++||+|+-.|.-+..|
T Consensus 223 ~AS~~ele~~~G~G~~kak~l~~~ 246 (254)
T KOG2841|consen 223 NASEGELEQCPGLGPAKAKRLHKF 246 (254)
T ss_pred hcCHhHHHhCcCcCHHHHHHHHHH
Confidence 357899999999999999877665
No 292
>PF14579 HHH_6: Helix-hairpin-helix motif; PDB: 2HPM_A 2HPI_A 3E0D_A 3F2C_A 3F2B_A 3F2D_A 2HQA_A 2HNH_A.
Probab=22.74 E-value=66 Score=24.41 Aligned_cols=26 Identities=31% Similarity=0.365 Sum_probs=18.4
Q ss_pred HHHHhcCCCCcHHHHHHHHHHhcCCc
Q psy9547 114 KKQLLSIKGVGDYTAGALASICYNIP 139 (363)
Q Consensus 114 ~~~L~~lpGIG~~tA~~il~~~~~~~ 139 (363)
+--|..|+|||..+|..|+.---+.|
T Consensus 26 r~gl~~Ikglg~~~a~~I~~~R~~g~ 51 (90)
T PF14579_consen 26 RLGLSAIKGLGEEVAEKIVEERENGP 51 (90)
T ss_dssp E-BGGGSTTS-HHHHHHHHHHHHCSS
T ss_pred eehHhhcCCCCHHHHHHHHHhHhcCC
Confidence 44588999999999998887543334
No 293
>PRK14670 uvrC excinuclease ABC subunit C; Provisional
Probab=21.83 E-value=1.3e+02 Score=31.36 Aligned_cols=36 Identities=19% Similarity=0.239 Sum_probs=27.8
Q ss_pred HHHHhCCCHHHHhcCCHHHHHHHHhccCchH-HHHHHHHH
Q psy9547 60 KFIKTYPTIKDFAFDTEDNVLKMWEGLGYYS-RVRNFQAG 98 (363)
Q Consensus 60 ~l~~~~pt~~~la~a~~~el~~~~~~lG~~~-rA~~l~~~ 98 (363)
.|.+.|.|.+.|.+|+.+||.++ -|... -|+.|.+.
T Consensus 529 ~LL~~Fgs~~~I~~As~eeL~~v---~gi~~~~A~~I~~~ 565 (574)
T PRK14670 529 KILKSLGTYKDILLLNEDEIAEK---MKINIKMAKKIKKF 565 (574)
T ss_pred HHHHHhCCHHHHHhCCHHHHHhC---CCCCHHHHHHHHHH
Confidence 35667999999999999999774 47765 57777554
No 294
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=21.79 E-value=70 Score=23.61 Aligned_cols=25 Identities=28% Similarity=0.462 Sum_probs=18.6
Q ss_pred CCHHHHhcCCHH---HHHHHHhccCchH
Q psy9547 66 PTIKDFAFDTED---NVLKMWEGLGYYS 90 (363)
Q Consensus 66 pt~~~la~a~~~---el~~~~~~lG~~~ 90 (363)
|.|+++..++.+ +|...|+.+|||+
T Consensus 5 ~~p~~~~~l~~~~~~evq~~L~~lGyy~ 32 (74)
T PF08823_consen 5 PRPEELLPLDGDVAREVQEALKRLGYYK 32 (74)
T ss_pred CCchhhccCcHHHHHHHHHHHHHcCCcc
Confidence 567777766654 7888899999964
No 295
>PRK01151 rps17E 30S ribosomal protein S17e; Validated
Probab=21.73 E-value=71 Score=22.48 Aligned_cols=22 Identities=9% Similarity=0.276 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHhhHHhCCCCcc
Q psy9547 91 RVRNFQAGCRQVIEQFGGEVPR 112 (363)
Q Consensus 91 rA~~l~~~a~~i~~~~~g~~p~ 112 (363)
|.++|+++|+.|+++|.+.+-.
T Consensus 4 r~k~IKr~a~~lieky~~~ft~ 25 (58)
T PRK01151 4 RPKYIKRTAEELLEKYPDLFTT 25 (58)
T ss_pred chHHHHHHHHHHHHHhHHHhcc
Confidence 6789999999999998655443
No 296
>PHA00439 exonuclease
Probab=21.14 E-value=56 Score=30.85 Aligned_cols=19 Identities=32% Similarity=0.242 Sum_probs=14.9
Q ss_pred HHHhcCCCCcHHHHHHHHHH
Q psy9547 115 KQLLSIKGVGDYTAGALASI 134 (363)
Q Consensus 115 ~~L~~lpGIG~~tA~~il~~ 134 (363)
+.+-.+|||| +||..+|.-
T Consensus 188 DNIPGVpGIG-KTA~kLL~~ 206 (286)
T PHA00439 188 DGYSGIPGWG-DTAEAFLEN 206 (286)
T ss_pred cCCCCCCCcC-HHHHHHHhC
Confidence 3345689999 999998865
No 297
>PF03755 YicC_N: YicC-like family, N-terminal region ; InterPro: IPR013527 Proteins in this entry are homologues of YicC (P23839 from SWISSPROT) from Escherichia coli. Although it is relatively poorly characterised YicC has been shown to be important for cells in the stationary phase, and essential for growth at high temperatures []. This domain is found at the N-terminal region of these proteins.
Probab=21.11 E-value=1.4e+02 Score=25.24 Aligned_cols=35 Identities=20% Similarity=0.339 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcH
Q psy9547 91 RVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGD 125 (363)
Q Consensus 91 rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~ 125 (363)
.++.+.+..+.+.+.++-..+-+...|+++|||=.
T Consensus 83 l~~~y~~~l~~l~~~~~~~~~~~~~~ll~~p~v~~ 117 (159)
T PF03755_consen 83 LAKAYYEALKELAEELGLAGPISLDDLLRLPGVLK 117 (159)
T ss_pred HHHHHHHHHHHHHHHcCCCCCCCHHHHHcCCCccc
Confidence 56677777788888776656778999999999743
No 298
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=20.78 E-value=1.8e+02 Score=19.19 Aligned_cols=33 Identities=9% Similarity=-0.069 Sum_probs=25.2
Q ss_pred HHhcCCHHHHHHHHhccCch-------HHHHHHHHHHHHh
Q psy9547 70 DFAFDTEDNVLKMWEGLGYY-------SRVRNFQAGCRQV 102 (363)
Q Consensus 70 ~la~a~~~el~~~~~~lG~~-------~rA~~l~~~a~~i 102 (363)
++..++.+||.+.+...|+. .|.-+++++.+.+
T Consensus 2 d~~~LSd~eL~~~L~~~G~~~gPIt~sTR~vy~kkL~~~~ 41 (44)
T smart00540 2 DVDRLSDAELRAELKQYGLPPGPITDTTRKLYEKKLRKLR 41 (44)
T ss_pred chhHcCHHHHHHHHHHcCCCCCCcCcchHHHHHHHHHHHH
Confidence 45678889999999988862 3777887777665
No 299
>PF14443 DBC1: DBC1
Probab=20.66 E-value=3.9e+02 Score=21.93 Aligned_cols=46 Identities=13% Similarity=0.126 Sum_probs=27.1
Q ss_pred eEEEeecCC-CccccccccccccCC-CC-CchHHH-----HHHHhhCCCcccccc
Q psy9547 250 KYLIQKRPT-TGLLSNFYMFLSFES-DT-PYTSQV-----EFLSENLPFKVNLNE 296 (363)
Q Consensus 250 ~vLl~~R~~-~~~~~glWEFPggk~-e~-~~~~al-----rEl~Eelgl~v~~~~ 296 (363)
+||+.+|.. -.-.+|-|. |.-.. .+ ...++| |-.++-+|++.....
T Consensus 9 kFlv~~k~ke~~aiGG~Ws-psLDG~DP~~dp~~LI~TAiR~~K~~tgiDLS~Ct 62 (126)
T PF14443_consen 9 KFLVGKKDKEIMAIGGPWS-PSLDGGDPSSDPSVLIRTAIRTCKALTGIDLSNCT 62 (126)
T ss_pred eeEEeecCceEEecCCcCC-cccCCCCCCCCcHHHHHHHHHHHHHHhccchhhcC
Confidence 466666533 245578888 65222 22 222333 889998999876554
No 300
>PF02889 Sec63: Sec63 Brl domain; InterPro: IPR004179 This domain was named after the yeast Sec63 (or NPL1) (also known as the Brl domain) protein in which it was found. This protein is required for assembly of functional endoplasmic reticulum translocons [, ]. Other yeast proteins containing this domain include pre-mRNA splicing helicase BRR2, HFM1 protein and putative helicases. ; PDB: 3IM2_A 3IM1_A 3HIB_A 2Q0Z_X.
Probab=20.49 E-value=1.8e+02 Score=27.19 Aligned_cols=44 Identities=16% Similarity=0.217 Sum_probs=30.1
Q ss_pred cCchHHHHHHHHHHHHhhHHhCCCCcccHHHHhcCCCCcHHHHHHHHH
Q psy9547 86 LGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALAS 133 (363)
Q Consensus 86 lG~~~rA~~l~~~a~~i~~~~~g~~p~~~~~L~~lpGIG~~tA~~il~ 133 (363)
.|+.+=+..+.++++.|.. +.. .+...|++|||||+..+..+..
T Consensus 123 ~~~~~~~~~~l~l~q~i~q---~~w-~~~~~L~Qlp~i~~~~~~~l~~ 166 (314)
T PF02889_consen 123 KGWLSTALNALELSQCIVQ---ALW-DSDSPLLQLPHIGEESLKKLEK 166 (314)
T ss_dssp TTBCCHHHHHHHHHHHHHH---TS--TTS-GGGGSTT--HHHHHHHHH
T ss_pred cccHHHHHHHHHHHHHHHh---hcC-CCCChhhcCCCCCHHHHHHHhc
Confidence 3444456788888888887 444 4668899999999999887665
Done!