Query psy11980
Match_columns 831
No_of_seqs 895 out of 4839
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 15:34:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11980hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 6E-30 1.3E-34 283.1 7.6 243 131-385 605-935 (958)
2 KOG3710|consensus 99.9 1.6E-26 3.6E-31 219.7 12.1 184 633-824 36-253 (280)
3 KOG2462|consensus 99.9 2.4E-27 5.3E-32 234.3 5.7 135 242-379 128-265 (279)
4 KOG3608|consensus 99.9 1.8E-25 3.9E-30 224.8 13.4 225 131-384 134-379 (467)
5 KOG2462|consensus 99.9 3.2E-25 6.9E-30 219.4 6.0 137 191-351 129-265 (279)
6 KOG1074|consensus 99.9 1.8E-25 4E-30 247.7 3.4 210 244-472 605-926 (958)
7 KOG3608|consensus 99.9 5.1E-23 1.1E-27 207.2 10.6 265 73-368 70-397 (467)
8 KOG3623|consensus 99.9 5.4E-22 1.2E-26 216.1 9.1 78 301-378 894-971 (1007)
9 KOG3623|consensus 99.8 8.1E-22 1.8E-26 214.7 5.8 111 42-152 210-330 (1007)
10 KOG3576|consensus 99.6 9.1E-17 2E-21 150.6 2.1 118 269-387 114-242 (267)
11 KOG3576|consensus 99.6 1.8E-16 3.9E-21 148.6 1.7 124 297-451 113-239 (267)
12 KOG3710|consensus 99.6 1.5E-15 3.2E-20 145.4 3.9 44 788-831 143-186 (280)
13 COG3751 EGL-9 Predicted prolin 99.3 5.5E-13 1.2E-17 134.1 1.2 196 590-807 33-237 (252)
14 PLN03086 PRLI-interacting fact 99.3 1.1E-11 2.3E-16 139.1 8.6 145 218-382 409-565 (567)
15 PLN03086 PRLI-interacting fact 99.1 3.8E-10 8.3E-15 126.7 9.3 140 193-351 408-562 (567)
16 PHA00733 hypothetical protein 99.0 2.3E-10 4.9E-15 105.4 5.0 83 299-383 38-125 (128)
17 PHA00733 hypothetical protein 98.9 2.2E-09 4.8E-14 98.8 5.4 55 298-354 70-124 (128)
18 KOG3993|consensus 98.8 1.3E-09 2.8E-14 113.9 0.6 59 330-388 431-489 (500)
19 PHA02768 hypothetical protein; 98.7 6.2E-09 1.3E-13 78.5 2.1 42 330-373 6-47 (55)
20 smart00702 P4Hc Prolyl 4-hydro 98.7 1.4E-08 3E-13 100.8 4.0 171 601-807 2-176 (178)
21 KOG3993|consensus 98.6 4E-09 8.7E-14 110.4 -0.6 192 104-297 267-483 (500)
22 PHA02768 hypothetical protein; 98.6 1.7E-08 3.6E-13 76.2 2.3 44 301-346 5-48 (55)
23 PRK05467 Fe(II)-dependent oxyg 98.5 1.4E-07 3E-12 95.3 4.8 170 603-808 3-176 (226)
24 PF13465 zf-H2C2_2: Zinc-finge 98.3 2.3E-07 4.9E-12 59.9 1.9 25 344-368 1-25 (26)
25 PF13465 zf-H2C2_2: Zinc-finge 98.2 4.3E-07 9.3E-12 58.6 1.5 26 316-341 1-26 (26)
26 PF13640 2OG-FeII_Oxy_3: 2OG-F 98.2 1.4E-06 3.1E-11 77.4 4.7 67 737-807 22-98 (100)
27 PHA00616 hypothetical protein 98.0 2.1E-06 4.6E-11 61.7 1.4 33 329-361 1-33 (44)
28 PHA00732 hypothetical protein 98.0 4.2E-06 9.1E-11 69.8 3.0 45 301-351 1-46 (79)
29 PHA00616 hypothetical protein 98.0 3.2E-06 6.8E-11 60.8 1.3 34 301-334 1-34 (44)
30 COG3751 EGL-9 Predicted prolin 97.9 2.8E-05 6E-10 79.0 8.6 144 639-829 32-177 (252)
31 PHA00732 hypothetical protein 97.9 9.2E-06 2E-10 67.8 3.4 46 329-380 1-47 (79)
32 KOG3844|consensus 97.8 8.4E-05 1.8E-09 78.2 9.6 70 736-807 137-214 (476)
33 smart00702 P4Hc Prolyl 4-hydro 97.7 0.00014 3.1E-09 72.0 8.9 117 651-824 2-121 (178)
34 PF05605 zf-Di19: Drought indu 97.6 7.7E-05 1.7E-09 57.7 4.5 51 329-382 2-54 (54)
35 PF05605 zf-Di19: Drought indu 97.5 8.7E-05 1.9E-09 57.4 3.6 36 273-309 3-39 (54)
36 KOG1146|consensus 97.3 0.00038 8.3E-09 83.9 7.2 110 43-153 437-611 (1406)
37 PRK05467 Fe(II)-dependent oxyg 97.2 0.0012 2.6E-08 67.0 8.1 39 788-827 80-126 (226)
38 PF00096 zf-C2H2: Zinc finger, 97.1 0.00027 5.8E-09 44.1 1.7 22 358-379 1-22 (23)
39 PF12756 zf-C2H2_2: C2H2 type 97.1 0.0003 6.5E-09 62.2 2.3 74 303-381 1-74 (100)
40 PF13661 2OG-FeII_Oxy_4: 2OG-F 97.1 0.00066 1.4E-08 55.7 4.0 42 787-829 10-54 (70)
41 PF00096 zf-C2H2: Zinc finger, 97.1 0.00024 5.1E-09 44.4 1.0 22 330-351 1-22 (23)
42 PF13894 zf-C2H2_4: C2H2-type 97.0 0.00039 8.5E-09 43.7 2.0 24 358-381 1-24 (24)
43 PF12756 zf-C2H2_2: C2H2 type 96.9 0.00039 8.4E-09 61.5 1.9 71 275-351 2-72 (100)
44 COG5189 SFP1 Putative transcri 96.9 0.00024 5.3E-09 72.3 0.5 70 299-378 347-419 (423)
45 KOG2231|consensus 96.7 0.0018 3.9E-08 74.4 5.4 103 133-239 117-235 (669)
46 COG5189 SFP1 Putative transcri 96.7 0.00047 1E-08 70.3 0.6 71 269-349 346-418 (423)
47 PF13912 zf-C2H2_6: C2H2-type 96.7 0.00075 1.6E-08 44.0 1.3 25 357-381 1-25 (27)
48 KOG2231|consensus 96.6 0.002 4.3E-08 74.1 4.8 107 106-223 117-245 (669)
49 PF13894 zf-C2H2_4: C2H2-type 96.6 0.0012 2.7E-08 41.3 1.7 23 330-352 1-23 (24)
50 PLN00052 prolyl 4-hydroxylase; 96.5 0.0073 1.6E-07 64.5 7.8 147 650-808 54-250 (310)
51 PF13912 zf-C2H2_6: C2H2-type 96.3 0.0017 3.6E-08 42.3 1.2 24 329-352 1-24 (27)
52 KOG1146|consensus 96.3 0.002 4.4E-08 77.9 2.7 101 275-384 439-546 (1406)
53 PF09237 GAGA: GAGA factor; I 96.1 0.0037 8E-08 46.1 2.0 35 295-329 18-52 (54)
54 PF09237 GAGA: GAGA factor; I 95.9 0.0051 1.1E-07 45.3 2.1 32 354-385 21-52 (54)
55 PF13909 zf-H2C2_5: C2H2-type 95.8 0.0062 1.3E-07 38.3 2.0 24 358-382 1-24 (24)
56 smart00355 ZnF_C2H2 zinc finge 95.4 0.013 2.8E-07 37.2 2.4 20 359-378 2-21 (26)
57 PRK04860 hypothetical protein; 95.3 0.0076 1.6E-07 57.8 1.4 36 301-340 119-154 (160)
58 COG5236 Uncharacterized conser 95.3 0.025 5.4E-07 58.7 5.1 127 245-382 152-306 (493)
59 smart00355 ZnF_C2H2 zinc finge 95.2 0.015 3.2E-07 36.9 2.2 23 330-352 1-23 (26)
60 PF13640 2OG-FeII_Oxy_3: 2OG-F 95.1 0.024 5.2E-07 50.1 3.8 34 792-826 3-37 (100)
61 PRK04860 hypothetical protein; 94.8 0.016 3.5E-07 55.6 2.0 39 329-371 119-157 (160)
62 COG5048 FOG: Zn-finger [Genera 94.3 0.02 4.3E-07 65.1 1.9 55 42-98 33-89 (467)
63 PF13909 zf-H2C2_5: C2H2-type 94.3 0.023 5E-07 35.7 1.3 22 330-352 1-22 (24)
64 KOG2785|consensus 93.8 0.13 2.8E-06 54.9 6.4 74 244-323 166-242 (390)
65 KOG2785|consensus 93.8 0.11 2.4E-06 55.4 5.8 53 41-93 2-89 (390)
66 COG5048 FOG: Zn-finger [Genera 93.6 0.023 5E-07 64.6 0.6 139 243-382 288-443 (467)
67 PF12874 zf-met: Zinc-finger o 93.5 0.041 8.8E-07 34.9 1.3 22 358-379 1-22 (25)
68 PF12874 zf-met: Zinc-finger o 93.5 0.034 7.3E-07 35.3 0.9 22 330-351 1-22 (25)
69 KOG2482|consensus 93.3 0.2 4.2E-06 52.4 6.6 23 104-126 195-217 (423)
70 COG5236 Uncharacterized conser 92.3 0.14 3E-06 53.4 3.8 135 43-185 152-306 (493)
71 KOG2482|consensus 92.3 0.14 3E-06 53.5 3.8 22 330-351 335-356 (423)
72 PF12171 zf-C2H2_jaz: Zinc-fin 91.3 0.079 1.7E-06 34.4 0.6 21 302-322 2-22 (27)
73 PF12171 zf-C2H2_jaz: Zinc-fin 90.6 0.12 2.6E-06 33.5 0.9 22 43-64 2-23 (27)
74 COG3128 PiuC Uncharacterized i 89.7 0.57 1.2E-05 45.0 5.0 55 746-808 125-179 (229)
75 KOG2893|consensus 88.8 0.13 2.7E-06 50.8 0.0 50 332-385 13-62 (341)
76 KOG2893|consensus 88.6 0.15 3.2E-06 50.3 0.3 42 74-123 12-53 (341)
77 PLN02997 flavonol synthase 88.2 7.7 0.00017 42.2 13.4 56 766-830 229-286 (325)
78 KOG4173|consensus 87.6 0.3 6.5E-06 47.3 1.7 69 78-153 87-168 (253)
79 TIGR00622 ssl1 transcription f 87.3 0.91 2E-05 40.4 4.4 83 272-364 15-110 (112)
80 TIGR02408 ectoine_ThpD ectoine 87.0 6.5 0.00014 41.8 11.7 43 635-677 13-55 (277)
81 PLN02704 flavonol synthase 87.0 17 0.00036 39.9 15.2 65 754-830 236-302 (335)
82 PF13913 zf-C2HC_2: zinc-finge 86.9 0.52 1.1E-05 29.9 1.9 19 359-378 4-22 (25)
83 PLN02912 oxidoreductase, 2OG-F 86.0 19 0.00041 39.7 14.9 64 755-830 235-300 (348)
84 PF13913 zf-C2HC_2: zinc-finge 85.2 0.74 1.6E-05 29.2 2.0 19 331-350 4-22 (25)
85 PF12013 DUF3505: Protein of u 84.7 1.3 2.7E-05 39.9 4.1 72 300-382 10-109 (109)
86 KOG4173|consensus 84.2 0.47 1E-05 46.0 1.2 89 272-382 79-171 (253)
87 smart00451 ZnF_U1 U1-like zinc 84.2 0.74 1.6E-05 31.7 1.9 23 357-379 3-25 (35)
88 cd00350 rubredoxin_like Rubred 82.9 0.73 1.6E-05 31.5 1.4 10 356-365 16-25 (33)
89 COG4049 Uncharacterized protei 82.2 0.64 1.4E-05 35.0 1.0 32 352-383 12-43 (65)
90 PLN02216 protein SRG1 81.9 44 0.00096 36.9 15.7 57 765-830 256-314 (357)
91 PLN02639 oxidoreductase, 2OG-F 81.8 23 0.0005 38.8 13.4 63 756-830 230-294 (337)
92 TIGR00622 ssl1 transcription f 81.5 2.5 5.4E-05 37.7 4.5 48 161-215 57-104 (112)
93 PF01753 zf-MYND: MYND finger; 80.8 0.17 3.7E-06 35.6 -2.4 28 428-455 10-37 (37)
94 cd00350 rubredoxin_like Rubred 80.5 0.93 2E-05 31.0 1.2 10 328-337 16-25 (33)
95 smart00451 ZnF_U1 U1-like zinc 80.1 1.3 2.7E-05 30.5 1.8 23 329-351 3-25 (35)
96 PLN02904 oxidoreductase 80.0 42 0.0009 37.1 14.7 64 756-830 246-311 (357)
97 TIGR02466 conserved hypothetic 79.9 3.3 7.1E-05 41.6 5.4 24 767-792 161-184 (201)
98 PF12013 DUF3505: Protein of u 77.7 2.9 6.3E-05 37.5 3.9 72 272-354 11-109 (109)
99 PLN02984 oxidoreductase, 2OG-F 77.5 4 8.6E-05 44.8 5.7 66 752-830 236-304 (341)
100 PLN00052 prolyl 4-hydroxylase; 76.2 1.8 3.9E-05 46.5 2.5 34 790-824 134-174 (310)
101 PLN02758 oxidoreductase, 2OG-F 75.2 87 0.0019 34.7 15.5 56 766-830 259-316 (361)
102 PF13759 2OG-FeII_Oxy_5: Putat 73.8 8 0.00017 34.0 5.7 27 765-793 63-89 (101)
103 PF04959 ARS2: Arsenite-resist 71.0 2.3 5E-05 42.9 1.7 35 355-389 75-109 (214)
104 cd00729 rubredoxin_SM Rubredox 69.3 2.7 5.9E-05 28.9 1.2 10 356-365 17-26 (34)
105 PTZ00273 oxidase reductase; Pr 68.9 5.2 0.00011 43.5 4.0 64 756-830 217-281 (320)
106 PF09986 DUF2225: Uncharacteri 68.8 1.7 3.7E-05 44.2 0.2 15 272-286 5-19 (214)
107 PRK04023 DNA polymerase II lar 68.7 4.3 9.4E-05 49.2 3.5 53 39-143 623-675 (1121)
108 PLN03002 oxidoreductase, 2OG-F 67.4 6.2 0.00013 43.2 4.2 72 749-830 216-290 (332)
109 PF09986 DUF2225: Uncharacteri 67.3 1.3 2.9E-05 45.0 -0.9 13 358-370 49-61 (214)
110 PLN02254 gibberellin 3-beta-di 67.3 5.9 0.00013 43.8 4.1 66 754-830 247-314 (358)
111 PLN03001 oxidoreductase, 2OG-F 66.0 7.1 0.00015 41.1 4.2 75 740-830 143-219 (262)
112 PF03171 2OG-FeII_Oxy: 2OG-Fe( 64.5 9.6 0.00021 33.1 4.2 62 739-824 27-98 (98)
113 PF12851 Tet_JBP: Oxygenase do 64.4 14 0.0003 36.2 5.6 58 751-822 111-169 (171)
114 COG4049 Uncharacterized protei 62.9 2.8 6E-05 31.8 0.3 28 269-296 14-41 (65)
115 PLN02947 oxidoreductase 62.7 14 0.0003 41.1 5.9 65 754-830 262-328 (374)
116 PF10571 UPF0547: Uncharacteri 62.6 4.7 0.0001 25.9 1.3 10 359-368 16-25 (26)
117 PLN03178 leucoanthocyanidin di 62.5 8.6 0.00019 42.6 4.2 64 754-829 248-313 (360)
118 PF02892 zf-BED: BED zinc fing 62.4 5 0.00011 29.4 1.6 26 356-381 15-44 (45)
119 PLN02515 naringenin,2-oxogluta 62.0 14 0.00031 40.8 5.8 66 755-830 233-300 (358)
120 PLN02276 gibberellin 20-oxidas 61.9 7.8 0.00017 42.9 3.8 63 756-830 245-309 (361)
121 PLN02485 oxidoreductase 60.8 9.5 0.00021 41.7 4.1 66 754-830 226-293 (329)
122 PF09538 FYDLN_acid: Protein o 60.7 5.5 0.00012 35.5 1.8 30 330-370 10-39 (108)
123 PLN00417 oxidoreductase, 2OG-F 60.2 11 0.00024 41.5 4.5 56 766-830 250-307 (348)
124 KOG1591|consensus 59.4 45 0.00096 35.5 8.6 37 790-827 178-225 (289)
125 COG1592 Rubrerythrin [Energy p 59.1 5.7 0.00012 38.3 1.7 11 354-364 146-156 (166)
126 PLN02156 gibberellin 2-beta-di 59.1 13 0.00028 40.7 4.7 64 756-830 219-284 (335)
127 PRK00464 nrdR transcriptional 58.7 2.8 6.2E-05 39.9 -0.3 12 358-369 29-40 (154)
128 COG1198 PriA Primosomal protei 58.6 6.3 0.00014 47.3 2.4 41 301-366 444-484 (730)
129 KOG2071|consensus 58.1 6.2 0.00013 45.2 2.1 24 425-448 511-534 (579)
130 PLN02393 leucoanthocyanidin di 56.4 17 0.00038 40.2 5.3 56 766-830 260-317 (362)
131 PF09538 FYDLN_acid: Protein o 55.2 6.6 0.00014 35.1 1.4 15 300-314 25-39 (108)
132 KOG2186|consensus 54.2 8.2 0.00018 39.2 2.0 45 105-151 4-48 (276)
133 PF09723 Zn-ribbon_8: Zinc rib 53.7 7.8 0.00017 28.1 1.3 30 427-460 5-34 (42)
134 PRK04023 DNA polymerase II lar 53.0 12 0.00026 45.6 3.5 49 272-368 626-674 (1121)
135 TIGR02098 MJ0042_CXXC MJ0042 f 52.7 7.1 0.00015 27.4 1.0 10 274-283 4-13 (38)
136 COG3491 PcbC Isopenicillin N s 52.4 2.3E+02 0.0049 30.4 12.2 71 750-830 196-278 (322)
137 KOG2186|consensus 52.4 7.6 0.00017 39.4 1.5 50 330-382 4-53 (276)
138 PLN02299 1-aminocyclopropane-1 52.1 15 0.00033 39.9 3.9 63 756-830 198-262 (321)
139 PLN02365 2-oxoglutarate-depend 51.5 18 0.00038 39.0 4.2 67 754-830 187-255 (300)
140 PRK00464 nrdR transcriptional 51.4 4.3 9.3E-05 38.8 -0.4 14 330-343 29-42 (154)
141 smart00659 RPOLCX RNA polymera 51.0 9.2 0.0002 28.1 1.3 11 330-340 3-13 (44)
142 smart00614 ZnF_BED BED zinc fi 50.7 11 0.00025 28.3 1.9 23 359-381 20-47 (50)
143 PF05443 ROS_MUCR: ROS/MUCR tr 50.6 12 0.00026 34.6 2.4 33 358-393 73-105 (132)
144 COG2888 Predicted Zn-ribbon RN 49.9 9.9 0.00022 29.5 1.4 8 329-336 50-57 (61)
145 COG1592 Rubrerythrin [Energy p 49.3 8.9 0.00019 36.9 1.4 13 296-308 144-156 (166)
146 PHA00626 hypothetical protein 49.2 7 0.00015 29.8 0.5 10 302-311 24-33 (59)
147 TIGR00373 conserved hypothetic 49.0 14 0.0003 35.6 2.6 33 325-366 105-137 (158)
148 smart00531 TFIIE Transcription 48.6 13 0.00028 35.4 2.3 14 328-341 98-111 (147)
149 PRK09678 DNA-binding transcrip 48.0 5.3 0.00012 32.7 -0.3 40 330-371 2-43 (72)
150 PF06524 NOA36: NOA36 protein; 46.6 7.1 0.00015 39.7 0.2 25 355-379 207-231 (314)
151 PF13717 zinc_ribbon_4: zinc-r 46.0 12 0.00026 26.1 1.3 12 274-285 4-15 (36)
152 TIGR02300 FYDLN_acid conserved 46.0 14 0.00031 33.6 2.0 32 330-372 10-41 (129)
153 TIGR00373 conserved hypothetic 45.3 12 0.00027 36.0 1.6 34 239-281 104-137 (158)
154 smart00834 CxxC_CXXC_SSSS Puta 44.2 6.7 0.00015 27.9 -0.3 11 330-340 6-16 (41)
155 TIGR02605 CxxC_CxxC_SSSS putat 43.5 7.1 0.00015 29.6 -0.2 11 330-340 6-16 (52)
156 PRK06266 transcription initiat 43.3 17 0.00038 35.7 2.4 36 325-369 113-148 (178)
157 smart00531 TFIIE Transcription 41.8 26 0.00057 33.3 3.3 14 103-116 98-111 (147)
158 PLN02403 aminocyclopropanecarb 41.1 38 0.00081 36.5 4.8 63 756-830 193-258 (303)
159 PRK14890 putative Zn-ribbon RN 41.0 15 0.00033 28.6 1.2 8 329-336 48-55 (59)
160 smart00614 ZnF_BED BED zinc fi 41.0 19 0.00042 27.1 1.8 20 331-350 20-44 (50)
161 PLN02750 oxidoreductase, 2OG-F 39.8 41 0.0009 37.0 5.0 67 754-830 230-298 (345)
162 KOG0717|consensus 39.7 20 0.00042 39.9 2.3 32 358-389 293-325 (508)
163 PRK06266 transcription initiat 39.5 17 0.00037 35.8 1.7 33 241-282 114-146 (178)
164 PRK09678 DNA-binding transcrip 39.5 8.9 0.00019 31.4 -0.3 9 274-282 3-11 (72)
165 PF13824 zf-Mss51: Zinc-finger 37.9 4.4 9.4E-05 31.1 -2.1 31 425-455 12-42 (55)
166 TIGR02300 FYDLN_acid conserved 37.7 25 0.00053 32.1 2.2 30 302-342 10-39 (129)
167 PRK00398 rpoP DNA-directed RNA 36.9 20 0.00043 26.5 1.3 28 427-461 3-30 (46)
168 COG1997 RPL43A Ribosomal prote 36.7 18 0.0004 30.5 1.2 13 357-369 53-65 (89)
169 PRK14714 DNA polymerase II lar 36.7 30 0.00065 43.5 3.4 39 303-367 681-719 (1337)
170 smart00734 ZnF_Rad18 Rad18-lik 36.4 27 0.00059 22.4 1.7 19 359-378 3-21 (26)
171 COG5151 SSL1 RNA polymerase II 35.8 19 0.00041 37.5 1.4 24 329-352 388-411 (421)
172 PF02176 zf-TRAF: TRAF-type zi 34.9 21 0.00046 27.8 1.3 21 314-334 23-43 (60)
173 PF05443 ROS_MUCR: ROS/MUCR tr 34.9 19 0.00042 33.3 1.2 25 43-70 73-97 (132)
174 PRK14873 primosome assembly pr 33.9 24 0.00052 42.4 2.0 44 275-364 386-429 (665)
175 KOG2807|consensus 33.4 39 0.00085 35.8 3.2 31 328-364 344-374 (378)
176 COG1996 RPC10 DNA-directed RNA 33.0 22 0.00048 26.7 1.0 30 425-461 4-33 (49)
177 PF03604 DNA_RNApol_7kD: DNA d 32.4 21 0.00047 24.2 0.8 9 357-365 17-25 (32)
178 TIGR00595 priA primosomal prot 31.9 31 0.00067 40.1 2.5 45 275-364 216-260 (505)
179 COG5151 SSL1 RNA polymerase II 31.4 41 0.00088 35.2 2.9 85 243-336 321-417 (421)
180 PF15269 zf-C2H2_7: Zinc-finge 31.4 30 0.00065 25.1 1.4 22 358-379 21-42 (54)
181 KOG4167|consensus 30.6 21 0.00047 41.7 0.9 24 73-96 793-816 (907)
182 COG1198 PriA Primosomal protei 30.5 35 0.00077 41.1 2.7 47 427-482 435-484 (730)
183 PF05721 PhyH: Phytanoyl-CoA d 29.3 43 0.00094 33.0 2.8 27 648-674 2-28 (211)
184 TIGR02408 ectoine_ThpD ectoine 29.2 9.8 0.00021 40.4 -2.0 32 593-624 21-52 (277)
185 PF15269 zf-C2H2_7: Zinc-finge 29.2 40 0.00087 24.5 1.7 25 69-93 17-41 (54)
186 COG5188 PRP9 Splicing factor 3 29.2 39 0.00085 36.0 2.4 27 39-65 235-261 (470)
187 PRK14714 DNA polymerase II lar 29.0 43 0.00093 42.2 3.1 54 245-341 668-721 (1337)
188 PF02176 zf-TRAF: TRAF-type zi 28.9 22 0.00049 27.6 0.5 40 328-368 8-53 (60)
189 PF14353 CpXC: CpXC protein 28.8 45 0.00097 30.7 2.6 12 132-143 39-50 (128)
190 PF04959 ARS2: Arsenite-resist 28.5 21 0.00046 36.0 0.3 27 43-69 78-104 (214)
191 PF13661 2OG-FeII_Oxy_4: 2OG-F 27.9 63 0.0014 26.2 3.0 24 738-761 38-64 (70)
192 COG4957 Predicted transcriptio 27.8 44 0.00096 30.6 2.1 31 358-391 77-107 (148)
193 PF14446 Prok-RING_1: Prokaryo 27.5 24 0.00052 27.1 0.4 28 43-84 6-33 (54)
194 KOG4124|consensus 27.4 16 0.00035 38.8 -0.8 70 270-349 347-418 (442)
195 KOG2807|consensus 27.1 72 0.0016 33.9 3.8 18 159-177 290-307 (378)
196 KOG2593|consensus 27.0 56 0.0012 36.2 3.2 40 324-366 123-162 (436)
197 KOG4124|consensus 26.6 22 0.00048 37.8 0.1 50 129-180 176-232 (442)
198 KOG2461|consensus 26.6 45 0.00097 37.3 2.4 70 295-364 325-394 (396)
199 PF12760 Zn_Tnp_IS1595: Transp 25.8 50 0.0011 24.3 1.8 11 355-365 35-45 (46)
200 TIGR03826 YvyF flagellar opero 25.2 72 0.0016 29.9 3.1 14 425-438 79-92 (137)
201 smart00440 ZnF_C2C2 C2C2 Zinc 25.1 21 0.00045 25.6 -0.3 10 73-82 29-38 (40)
202 PF05721 PhyH: Phytanoyl-CoA d 25.1 17 0.00036 36.0 -1.1 25 599-623 3-27 (211)
203 PF09845 DUF2072: Zn-ribbon co 24.9 43 0.00093 30.8 1.6 15 329-343 1-15 (131)
204 PRK05580 primosome assembly pr 24.4 47 0.001 40.2 2.3 47 273-364 382-428 (679)
205 PF06524 NOA36: NOA36 protein; 24.3 51 0.0011 33.8 2.1 24 270-293 207-230 (314)
206 PF08274 PhnA_Zn_Ribbon: PhnA 24.0 30 0.00065 23.1 0.3 10 356-365 18-27 (30)
207 COG5188 PRP9 Splicing factor 3 24.0 38 0.00082 36.1 1.2 20 302-321 239-258 (470)
208 PF13240 zinc_ribbon_2: zinc-r 24.0 40 0.00086 21.0 0.8 6 360-365 16-21 (23)
209 KOG4167|consensus 23.9 21 0.00046 41.7 -0.7 24 105-128 793-816 (907)
210 COG4530 Uncharacterized protei 23.6 42 0.0009 29.5 1.2 27 331-368 11-37 (129)
211 KOG2907|consensus 23.5 59 0.0013 28.9 2.0 46 37-84 69-114 (116)
212 PTZ00255 60S ribosomal protein 23.4 32 0.0007 29.5 0.4 12 357-368 54-65 (90)
213 TIGR01762 chlorin-enz chlorina 22.8 98 0.0021 33.1 4.1 34 640-673 4-37 (288)
214 KOG3290|consensus 22.5 38 0.00083 34.9 0.9 30 593-622 8-37 (286)
215 COG3357 Predicted transcriptio 21.9 50 0.0011 28.1 1.3 12 329-340 58-69 (97)
216 COG4957 Predicted transcriptio 21.5 61 0.0013 29.8 1.8 25 43-70 77-101 (148)
217 PF12907 zf-met2: Zinc-binding 21.5 44 0.00094 24.0 0.7 12 287-298 19-30 (40)
218 TIGR00280 L37a ribosomal prote 21.5 32 0.0007 29.5 0.1 13 357-369 53-65 (91)
219 COG3091 SprT Zn-dependent meta 20.9 44 0.00096 31.5 0.9 35 425-461 115-149 (156)
220 PF09416 UPF1_Zn_bind: RNA hel 20.7 73 0.0016 30.2 2.3 17 295-311 8-24 (152)
221 KOG1280|consensus 20.3 1E+02 0.0022 33.1 3.4 64 296-377 74-139 (381)
222 COG1660 Predicted P-loop-conta 20.1 41 0.00089 34.9 0.5 57 594-651 18-80 (286)
No 1
>KOG1074|consensus
Probab=99.96 E-value=6e-30 Score=283.07 Aligned_cols=243 Identities=26% Similarity=0.479 Sum_probs=163.1
Q ss_pred ccccCcCCCcCCChHHHHHHHHHhcCCCccccccccCccccCchhHHHHHhhhcCcccCCCccccC---cccccccCchh
Q psy11980 131 LYECDICKNRFPSSGAMKKHRRKHTGERPYECKEVCGRVFNRKDNLREHLRAHAGETKRKKKFKCD---KCEKQFYGLTL 207 (831)
Q Consensus 131 ~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~C~~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~C~---~C~k~f~~~~~ 207 (831)
+.+|-+|.+.....+.|+.|.++|+||+||+|+ +||+.|.++.+|+.|+-.|.....-+.+|.|+ .|.+.|.+...
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCK-iCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCK-ICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccc-cccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 578999999999999999999999999999998 99999999999999999998877767889999 99999999999
Q ss_pred hhhhhhhcccccccccccccchHHhhhhcccCCCCCccccccchhhhcChhhHHhhhhcc----------------cccC
Q psy11980 208 LRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKPHVCEYCEKAFIQASQLKAHMFHH----------------TGEN 271 (831)
Q Consensus 208 L~~H~~~h~~~c~~C~~~f~~~~~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~H----------------~~~k 271 (831)
|..|+++|..--.--+-.-. . ...-.-+|..|.+.|.....+..++..| +.+.
T Consensus 684 lpQhIriH~~~~~s~g~~a~--e---------~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~ 752 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAA--E---------GILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEEL 752 (958)
T ss_pred ccceEEeecCCCCCCCcccc--c---------ccchhcccchhhhcccccccchhhhhccCCcccCCccccccccccccc
Confidence 99999998621000000000 0 0000123444444444444444333322 2222
Q ss_pred ----CccCcccccccccchHHHhhhh----------------------hcccCCCCc-cCcccccccCChhHHH----HH
Q psy11980 272 ----GHTCELCNKTFNRKARLELHMK----------------------YIHEGADPY-QCDICRKTFIRKEDLT----RH 320 (831)
Q Consensus 272 ----~~~C~~C~k~F~~~~~L~~H~~----------------------~~H~~~k~~-~C~~C~k~F~~~~~L~----~H 320 (831)
+..+..|+..+.....+..+-. ..++.+++. .+..|+..-...-... .-
T Consensus 753 ~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~ 832 (958)
T KOG1074|consen 753 DVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQ 832 (958)
T ss_pred ccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcc
Confidence 3445555555544433332211 123445555 5555554332221100 00
Q ss_pred H--------------hhhcC------------------------CCCccCCCCCCccCChhHHHHHhhhccCCCCccccc
Q psy11980 321 C--------------LLHTG------------------------VKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEV 362 (831)
Q Consensus 321 ~--------------~~H~~------------------------~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~ 362 (831)
. .++.+ ...+.|..|++.|...+.|..|+++|+++|||.|.+
T Consensus 833 ~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~f 912 (958)
T KOG1074|consen 833 ETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHF 912 (958)
T ss_pred cccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchh
Confidence 0 00000 012789999999999999999999999999999999
Q ss_pred ccccccChhHHHHHHHHhccCch
Q psy11980 363 CGRAFIRQDCLIRHLRQKHRSHL 385 (831)
Q Consensus 363 C~~~F~~~~~L~~H~r~~H~~~~ 385 (831)
|++.|..+.+|+.||.+|+...+
T Consensus 913 C~~aFttrgnLKvHMgtH~w~q~ 935 (958)
T KOG1074|consen 913 CEEAFTTRGNLKVHMGTHMWVQP 935 (958)
T ss_pred hhhhhhhhhhhhhhhccccccCC
Confidence 99999999999999999887543
No 2
>KOG3710|consensus
Probab=99.94 E-value=1.6e-26 Score=219.69 Aligned_cols=184 Identities=43% Similarity=0.705 Sum_probs=158.9
Q ss_pred cchhhhhHHHHHHhhhcCCeEEEcCCCChhHHHHHHHHHHhhhhCCCccCCeeecCCcCCcccCccccCceeEEecCCcc
Q psy11980 633 DRVHSDMIENVIRDMDQYGLCVVDNFLGPERGMAVLHEVLGMYHSGVFKDGQLVSNKSTNKQDLKTIRGDQITWIDGRET 712 (831)
Q Consensus 633 ~~~~~~~~~~~~~~l~~~g~~~id~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~ 712 (831)
....+.+.+.|+..+.+.||+|+|+||+.+....+..|+..++..|.|+.+++...... ..+.||+|.|.|++|.+.
T Consensus 36 ~~~~~~~~e~ivp~mne~g~~vvd~flg~~~g~~v~~ev~~l~~~G~f~dgql~~~~~~---~~k~iRgd~i~wi~G~e~ 112 (280)
T KOG3710|consen 36 LLRLRLALEYIVPCMNEYGICVVDNFLGSETGKFILKEVEALYETGAFRDGQLVSPDAF---HSKDIRGDKITWVGGNEP 112 (280)
T ss_pred chHHHHHHHHhhhhhhhcceEEEechhhHHHHHHHHHHHHHHHhccCccCceeccCcCC---cchhhccCCceEecCCCC
Confidence 34556788999999999999999999999999999999999999999999999764333 344899999999999999
Q ss_pred cchhHHHHHHHHHHHHHHhhhcc------------------------------cCCcceeEEe----cCcccCCCceeEe
Q psy11980 713 YCSNIGRLISEVDAIIMRANRMV------------------------------NNGRMGDFVI----NGRTKQNGGLLRI 758 (831)
Q Consensus 713 ~~~~~~~~~~~~d~~~~~~n~~~------------------------------~~~R~~s~v~----~~W~~~~GGeL~l 758 (831)
.+..+..+...+|.+....|..+ -+||-+|+|+ +|..+.+||.|+|
T Consensus 113 gc~~i~~L~s~~d~~i~h~~~r~~~~~~gRtkAMVAcYPGNGtgYVrHVDNP~gDGRcITcIYYlNqNWD~kv~Gg~Lri 192 (280)
T KOG3710|consen 113 GCETIMLLPSPIDSVILHCNGRLGSYIIGRTKAMVACYPGNGTGYVRHVDNPHGDGRCITCIYYLNQNWDVKVHGGILRI 192 (280)
T ss_pred CccceeeecccchhhhhhhccccccccccceeEEEEEecCCCceeeEeccCCCCCceEEEEEEEcccCcceeeccceeEe
Confidence 88888888888888877765444 6789999998 6888999999999
Q ss_pred ccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCCCCCCceEEEEEecCC
Q psy11980 759 FPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNPNRDGRCITAIYYLNR 824 (831)
Q Consensus 759 y~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~~~~~R~~t~~~Yln~ 824 (831)
||++.. .+++|+|+||||+|||||+.|||||+ |+..+||+.+||.++..+|.-...=|.+.
T Consensus 193 ~pe~~~-~~adieP~fdrLlffwSdrrnPhev~----Pa~~tryaitvwyfda~era~a~~k~~~~ 253 (280)
T KOG3710|consen 193 FPEGST-TFADIEPKFDRLLFFWSDRRNPHEVQ----PAYATRYAITVWYFDAKERAEAKGKGKLC 253 (280)
T ss_pred ccCCCC-cccccCcCCCeEEEEEecCCCccccc----cccccceEEEEEEeccccchhhhhccccc
Confidence 998853 68899999999999999999999999 66889999999999888876555555443
No 3
>KOG2462|consensus
Probab=99.94 E-value=2.4e-27 Score=234.33 Aligned_cols=135 Identities=32% Similarity=0.651 Sum_probs=127.9
Q ss_pred CCccccccchhhhcChhhHHhhhhcccc---cCCccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHH
Q psy11980 242 VKPHVCEYCEKAFIQASQLKAHMFHHTG---ENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLT 318 (831)
Q Consensus 242 ~k~~~C~~C~k~f~~~~~L~~H~~~H~~---~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~ 318 (831)
...|+|+.||+.+.+.++|.+|.++|.. .+.+.|++|+|.|.+...|+.|++ .|+ -+++|.+|||.|.....|+
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHir-TH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIR-THT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhh-ccC--CCcccccccccccchHHhh
Confidence 3459999999999999999999999964 577999999999999999999999 675 6899999999999999999
Q ss_pred HHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHHH
Q psy11980 319 RHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 319 ~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r~ 379 (831)
.|+|+|+|||||.|+.|+|+|..+++|+.||++|++.|+|+|..|+|+|..++.|.+|...
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999999999999999999999999999754
No 4
>KOG3608|consensus
Probab=99.93 E-value=1.8e-25 Score=224.85 Aligned_cols=225 Identities=30% Similarity=0.568 Sum_probs=193.1
Q ss_pred ccccC--cCCCcCCChHHHHHHHHHhcC------------CCc-cccc-cccCccccCchhHHHHHhhhcCcccCCCccc
Q psy11980 131 LYECD--ICKNRFPSSGAMKKHRRKHTG------------ERP-YECK-EVCGRVFNRKDNLREHLRAHAGETKRKKKFK 194 (831)
Q Consensus 131 ~~~C~--~C~k~F~~~~~L~~H~~~h~~------------~~~-~~C~-~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~ 194 (831)
.|.|. .|+..|.+...|..|...|.. ++| +.|. ..|-+.|.++..|++|++.|+++ |...
T Consensus 134 ~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~e----KvvA 209 (467)
T KOG3608|consen 134 NFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNE----KVVA 209 (467)
T ss_pred hhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCC----eEEe
Confidence 36674 488999998888888877642 112 3443 24888899999999999999888 8889
Q ss_pred cCcccccccCchhhhhhhhhcccccccccccccchHHhhhhcccCCCCCccccccchhhhcChhhHHhhhhcccccCCcc
Q psy11980 195 CDKCEKQFYGLTLLRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKPHVCEYCEKAFIQASQLKAHMFHHTGENGHT 274 (831)
Q Consensus 195 C~~C~k~f~~~~~L~~H~~~h~~~c~~C~~~f~~~~~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~ 274 (831)
|+.||..|.+...|-.|.+..+ -....+|.|..|.|.|.+...|..|+..|- .-|+
T Consensus 210 Cp~Cg~~F~~~tkl~DH~rRqt----------------------~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~yk 265 (467)
T KOG3608|consen 210 CPHCGELFRTKTKLFDHLRRQT----------------------ELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYK 265 (467)
T ss_pred cchHHHHhccccHHHHHHHhhh----------------------hhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hccc
Confidence 9999999988888888876321 112348999999999999999999999986 5699
Q ss_pred CcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCC--CCCccCChhHHHHHhhhc
Q psy11980 275 CELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPI--CTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 275 C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~--C~k~F~~~~~L~~H~~~H 352 (831)
|+.|..+....++|..|++..|...+||+|+.|.+.|.+.+.|.+|...|. +..|.|.. |..+|.+...|++|++.+
T Consensus 266 CplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~ev 344 (467)
T KOG3608|consen 266 CPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEV 344 (467)
T ss_pred ccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHh
Confidence 999999999999999999999999999999999999999999999999998 67799998 999999999999998876
Q ss_pred c-C--CCCcccccccccccChhHHHHHHHHhccCc
Q psy11980 353 T-K--EPPKSCEVCGRAFIRQDCLIRHLRQKHRSH 384 (831)
Q Consensus 353 ~-~--~~py~C~~C~~~F~~~~~L~~H~r~~H~~~ 384 (831)
+ | +.+|.|..|++.|++-.+|.+|+++.|+-+
T Consensus 345 hEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 345 HEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred ccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 5 4 567999999999999999999999999854
No 5
>KOG2462|consensus
Probab=99.91 E-value=3.2e-25 Score=219.36 Aligned_cols=137 Identities=34% Similarity=0.708 Sum_probs=124.7
Q ss_pred CccccCcccccccCchhhhhhhhhcccccccccccccchHHhhhhcccCCCCCccccccchhhhcChhhHHhhhhccccc
Q psy11980 191 KKFKCDKCEKQFYGLTLLRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKPHVCEYCEKAFIQASQLKAHMFHHTGE 270 (831)
Q Consensus 191 ~~~~C~~C~k~f~~~~~L~~H~~~h~~~c~~C~~~f~~~~~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~ 270 (831)
..|+|+.||+.+.+.+.|.+|...|-.. ...+.+.|++|||.|.+...|+.|+++|+
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~---------------------~s~ka~~C~~C~K~YvSmpALkMHirTH~-- 185 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSL---------------------DSKKAFSCKYCGKVYVSMPALKMHIRTHT-- 185 (279)
T ss_pred Cceeccccccccccccccchhhcccccc---------------------cccccccCCCCCceeeehHHHhhHhhccC--
Confidence 4688999999988888888887766321 12567899999999999999999999997
Q ss_pred CCccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhh
Q psy11980 271 NGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLL 350 (831)
Q Consensus 271 k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~ 350 (831)
-+++|.+|||.|....-|+.|+| .|+|||||.|+.|+++|.++++|+.||++|.+.|+|+|..|+|+|..++.|.+|..
T Consensus 186 l~c~C~iCGKaFSRPWLLQGHiR-THTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 186 LPCECGICGKAFSRPWLLQGHIR-THTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred CCcccccccccccchHHhhcccc-cccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 68999999999999999999999 89999999999999999999999999999999999999999999999999999976
Q ss_pred h
Q psy11980 351 T 351 (831)
Q Consensus 351 ~ 351 (831)
.
T Consensus 265 S 265 (279)
T KOG2462|consen 265 S 265 (279)
T ss_pred h
Confidence 4
No 6
>KOG1074|consensus
Probab=99.91 E-value=1.8e-25 Score=247.75 Aligned_cols=210 Identities=24% Similarity=0.458 Sum_probs=154.8
Q ss_pred ccccccchhhhcChhhHHhhhhcccccCCccCcccccccccchHHHhhhhhcccCC----CCccCc---ccccccCChhH
Q psy11980 244 PHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLELHMKYIHEGA----DPYQCD---ICRKTFIRKED 316 (831)
Q Consensus 244 ~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~----k~~~C~---~C~k~F~~~~~ 316 (831)
|-+|-+|-+..+-++.|+.|.++|+||+||+|.+||++|.++.+|+.|+- .|... -+|.|+ +|-+.|.+.-.
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhccc-ccccCccccccccCCchhhhccccccccc
Confidence 56899999999999999999999999999999999999999999999997 66543 358899 99999999999
Q ss_pred HHHHHhhhcCC-CC------------ccCCCCCCccCChhHHHHHhhhcc----------------CCC----Ccccccc
Q psy11980 317 LTRHCLLHTGV-KP------------HKCPICTKGFTMKSSLKIHLLTHT----------------KEP----PKSCEVC 363 (831)
Q Consensus 317 L~~H~~~H~~~-k~------------~~C~~C~k~F~~~~~L~~H~~~H~----------------~~~----py~C~~C 363 (831)
|..|+++|.+. .+ -+|..|.+.|.....+..|+.-|. .+. +..|..|
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccc
Confidence 99999999842 11 469999999988888888876662 222 5678888
Q ss_pred cccccChhHHHHHHHHh--c---------------------cCchh----------------------------------
Q psy11980 364 GRAFIRQDCLIRHLRQK--H---------------------RSHLD---------------------------------- 386 (831)
Q Consensus 364 ~~~F~~~~~L~~H~r~~--H---------------------~~~~~---------------------------------- 386 (831)
+..+.....+..+--.+ + .++..
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence 88888776665543222 1 01100
Q ss_pred ------------HHHhhcchhHHHHHHHHHHhhhhcccccCCCccCccccccccccCcCCcccccCC---ccccccCCCc
Q psy11980 387 ------------EVLAQGDKKKLEAQLLAAAATELTSVLEDGEEEGDEGEENLLRCSKCKVTFYCSK---QHQTLDWKAH 451 (831)
Q Consensus 387 ------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~ee~~~~C~~C~~~f~~~~---~h~~~h~~~~ 451 (831)
++..+......... ..... .-.+...|..||+.|..+. .|+|.|+++|
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~-----------l~~~~------pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~K 906 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPS-----------LGRPG------PVNNAHVCNVCGKQFSSSAALEIHMRTHTGPK 906 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCC-----------CCCCC------cccchhhhccchhcccchHHHHHhhhcCCCCC
Confidence 01111111111100 00000 1133478999999999885 9999999999
Q ss_pred ccccccCccCcccCCccccCC
Q psy11980 452 KINCNILSNQSATNNTSLANN 472 (831)
Q Consensus 452 ~~~c~~c~~~~~~~~~~~~~~ 472 (831)
||.|..|+++ |....+|-.|
T Consensus 907 PF~C~fC~~a-FttrgnLKvH 926 (958)
T KOG1074|consen 907 PFFCHFCEEA-FTTRGNLKVH 926 (958)
T ss_pred Cccchhhhhh-hhhhhhhhhh
Confidence 9999999987 7777777666
No 7
>KOG3608|consensus
Probab=99.88 E-value=5.1e-23 Score=207.24 Aligned_cols=265 Identities=26% Similarity=0.500 Sum_probs=208.7
Q ss_pred ccc--cccCcccCC-hhHHHHHHHHhcCCC----------------C--------------CCcccccC--ccccccCCh
Q psy11980 73 TYI--KVCGRVFNR-KDNLREHLRAHAGET----------------K--------------RKKKFKCD--KCEKQFYGL 117 (831)
Q Consensus 73 ~~C--~~C~k~f~~-~~~L~~H~~~h~~~~----------------~--------------~~k~~~C~--~C~k~f~~~ 117 (831)
+.| ..|++...+ ...|.+|.-.|.-.. . ....|.|. .|+..|.+.
T Consensus 70 ~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s~ 149 (467)
T KOG3608|consen 70 HQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVSI 149 (467)
T ss_pred eeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccCH
Confidence 445 468887766 588999986653110 0 11346774 599999999
Q ss_pred hHHHHHHHHhc-------------C-CccccCc--CCCcCCChHHHHHHHHHhcCCCccccccccCccccCchhHHHHHh
Q psy11980 118 TLLRIHERLHA-------------E-KLYECDI--CKNRFPSSGAMKKHRRKHTGERPYECKEVCGRVFNRKDNLREHLR 181 (831)
Q Consensus 118 ~~L~~H~~~H~-------------~-k~~~C~~--C~k~F~~~~~L~~H~~~h~~~~~~~C~~~C~~~f~~~~~L~~H~~ 181 (831)
..|..|+..|. + ..+.|.. |-+.|.++..|++|++.|++++...|+ .||..|.++..|..|++
T Consensus 150 ~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp-~Cg~~F~~~tkl~DH~r 228 (467)
T KOG3608|consen 150 VEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACP-HCGELFRTKTKLFDHLR 228 (467)
T ss_pred HHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecc-hHHHHhccccHHHHHHH
Confidence 99999998774 1 1366765 999999999999999999999999999 89999999999999998
Q ss_pred hhcCcccCCCccccCcccccccCchhhhhhhhhcccccccccccccchHHhhhhcccCCCCCccccccchhhhcChhhHH
Q psy11980 182 AHAGETKRKKKFKCDKCEKQFYGLTLLRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKPHVCEYCEKAFIQASQLK 261 (831)
Q Consensus 182 ~h~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~c~~C~~~f~~~~~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~ 261 (831)
.-+... ..+|.|..|.|.|.++..|..|+..|- .-|+|+.|+.+.+..+.|.
T Consensus 229 Rqt~l~--~n~fqC~~C~KrFaTeklL~~Hv~rHv--------------------------n~ykCplCdmtc~~~ssL~ 280 (467)
T KOG3608|consen 229 RQTELN--TNSFQCAQCFKRFATEKLLKSHVVRHV--------------------------NCYKCPLCDMTCSSASSLT 280 (467)
T ss_pred hhhhhc--CCchHHHHHHHHHhHHHHHHHHHHHhh--------------------------hcccccccccCCCChHHHH
Confidence 765442 268999999999988888877776442 3488999999999999999
Q ss_pred hhhh-cccccCCccCcccccccccchHHHhhhhhcccCCCCccCcc--cccccCChhHHHHHHhhhc-C--CCCccCCCC
Q psy11980 262 AHMF-HHTGENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDI--CRKTFIRKEDLTRHCLLHT-G--VKPHKCPIC 335 (831)
Q Consensus 262 ~H~~-~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~--C~k~F~~~~~L~~H~~~H~-~--~k~~~C~~C 335 (831)
.|++ .|...|||+|+.|.+.|.+.+.|.+|.. .|+ +..|.|.. |...|.+...+++|++.++ | +-+|.|..|
T Consensus 281 ~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~C 358 (467)
T KOG3608|consen 281 THIRYRHSKDKPFKCDECDTRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCC 358 (467)
T ss_pred HHHHhhhccCCCccccchhhhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecc
Confidence 9987 4777889999999999999999999988 777 56799987 9999999999999988766 4 456999999
Q ss_pred CCccCChhHHHHHhhhccC------CCCccccccccccc
Q psy11980 336 TKGFTMKSSLKIHLLTHTK------EPPKSCEVCGRAFI 368 (831)
Q Consensus 336 ~k~F~~~~~L~~H~~~H~~------~~py~C~~C~~~F~ 368 (831)
++.|++-.+|..|++..++ -+.|.=.+|.-+|-
T Consensus 359 dr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~m 397 (467)
T KOG3608|consen 359 DRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFM 397 (467)
T ss_pred hhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCcee
Confidence 9999999999999765433 33344444544443
No 8
>KOG3623|consensus
Probab=99.86 E-value=5.4e-22 Score=216.10 Aligned_cols=78 Identities=35% Similarity=0.708 Sum_probs=76.4
Q ss_pred CccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHH
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLR 378 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r 378 (831)
.|.|+.|+|+|...+.|.+|.-.|+|.+||+|.+|.|+|..+-.|..|+|.|+|||||+|..|+|.|....+...||.
T Consensus 894 myaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 894 MYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred cchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhhc
Confidence 489999999999999999999999999999999999999999999999999999999999999999999999999994
No 9
>KOG3623|consensus
Probab=99.84 E-value=8.1e-22 Score=214.71 Aligned_cols=111 Identities=29% Similarity=0.482 Sum_probs=98.1
Q ss_pred CcccCcccccccCchHHHHHhhhccccccccccccccCcccCChhHHHHHHHHhcCCC---------CCCcccccCcccc
Q psy11980 42 GHLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIKVCGRVFNRKDNLREHLRAHAGET---------KRKKKFKCDKCEK 112 (831)
Q Consensus 42 ~~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~---------~~~k~~~C~~C~k 112 (831)
...|++|.+.+.....|..|.+..+...+.-|.|..|..+|..+..|.+||.+|.... ...+.|+|.+|||
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence 4679999999999999999998776655555899999999999999999999886432 1236799999999
Q ss_pred ccCChhHHHHHHHHhc-CCccccCcCCCcCCChHHHHHHHH
Q psy11980 113 QFYGLTLLRIHERLHA-EKLYECDICKNRFPSSGAMKKHRR 152 (831)
Q Consensus 113 ~f~~~~~L~~H~~~H~-~k~~~C~~C~k~F~~~~~L~~H~~ 152 (831)
.|..+..|+.|+|+|. +|||.|+.|+|.|.....+..||.
T Consensus 290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 9999999999999997 999999999999999999999974
No 10
>KOG3576|consensus
Probab=99.62 E-value=9.1e-17 Score=150.60 Aligned_cols=118 Identities=34% Similarity=0.588 Sum_probs=102.9
Q ss_pred ccCCccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHH
Q psy11980 269 GENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIH 348 (831)
Q Consensus 269 ~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H 348 (831)
+...|.|.+|+|.|.-...|.+|++ .|...+.|-|..||+.|...-.|++|+|+|+|.+||+|..|+|+|+++-+|..|
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~k-ch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLK-CHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhh-hccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 3456889999999999999999998 788888899999999999999999999999999999999999999999999999
Q ss_pred hhhccC-----------CCCcccccccccccChhHHHHHHHHhccCchhH
Q psy11980 349 LLTHTK-----------EPPKSCEVCGRAFIRQDCLIRHLRQKHRSHLDE 387 (831)
Q Consensus 349 ~~~H~~-----------~~py~C~~C~~~F~~~~~L~~H~r~~H~~~~~~ 387 (831)
++.-+| ++.|.|+.||++-.....+..|++.+|...+.-
T Consensus 193 l~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spal 242 (267)
T KOG3576|consen 193 LKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPAL 242 (267)
T ss_pred HHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCHHH
Confidence 765443 467999999999999999999999999876543
No 11
>KOG3576|consensus
Probab=99.60 E-value=1.8e-16 Score=148.64 Aligned_cols=124 Identities=32% Similarity=0.541 Sum_probs=103.0
Q ss_pred cCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHH
Q psy11980 297 EGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRH 376 (831)
Q Consensus 297 ~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H 376 (831)
.+...|.|.+|+|.|.-...|.+|++-|...|.|-|..|||.|...-.|++|+++|+|.+||+|..|+|+|+++-+|..|
T Consensus 113 sd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 113 SDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 34567999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhccCchhHHHhhcchhHHHHHHHHHHhhhhcccccCCCccCccccccccccCcCCcccccCC---ccccccCCCc
Q psy11980 377 LRQKHRSHLDEVLAQGDKKKLEAQLLAAAATELTSVLEDGEEEGDEGEENLLRCSKCKVTFYCSK---QHQTLDWKAH 451 (831)
Q Consensus 377 ~r~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~ee~~~~C~~C~~~f~~~~---~h~~~h~~~~ 451 (831)
++..|+....... + +-.++.|.|+.||.+-.... .|...|.+..
T Consensus 193 l~kvhgv~~~yay------k-------------------------err~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 193 LKKVHGVQHQYAY------K-------------------------ERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHHcCchHHHHH------H-------------------------HhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 9999985321110 0 00167899999998765442 5555555443
No 12
>KOG3710|consensus
Probab=99.56 E-value=1.5e-15 Score=145.38 Aligned_cols=44 Identities=70% Similarity=1.425 Sum_probs=41.7
Q ss_pred ccccccccCCCCceeEEeecCCCCCCceEEEEEecCCCcccccC
Q psy11980 788 HEAMVACYPGHGSHYVKHVDNPNRDGRCITAIYYLNRDWDIKVS 831 (831)
Q Consensus 788 HEVl~a~yp~~~~ry~~h~d~~~~~~R~~t~~~Yln~~w~~~~~ 831 (831)
-.+|.||||++|++||+|||||+.|||+||+|||||.|||++++
T Consensus 143 tkAMVAcYPGNGtgYVrHVDNP~gDGRcITcIYYlNqNWD~kv~ 186 (280)
T KOG3710|consen 143 TKAMVACYPGNGTGYVRHVDNPHGDGRCITCIYYLNQNWDVKVH 186 (280)
T ss_pred eeEEEEEecCCCceeeEeccCCCCCceEEEEEEEcccCcceeec
Confidence 35899999999999999999999999999999999999999874
No 13
>COG3751 EGL-9 Predicted proline hydroxylase [Posttranslational modification, protein turnover, chaperones]
Probab=99.29 E-value=5.5e-13 Score=134.14 Aligned_cols=196 Identities=16% Similarity=0.089 Sum_probs=127.9
Q ss_pred hhhccccCCccceeeecCCCCcccccccccceeeeCCCCCCCCcchhhhhHHHHHHhhhcCCeEEEcCCCChhHHHHHHH
Q psy11980 590 CHNVIRDMDQYGLCVLDDFLGPERDMHLDTNVVVVPPFHHTNNDRVHSDMIENVIRDMDQYGLCVVDNFLGPERGMAVLH 669 (831)
Q Consensus 590 ~~~i~~~l~~~g~~v~d~fl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~id~~l~~~~~~~l~~ 669 (831)
..++|++..+.+|.|+|.|...+++..+..+...-.....+..+.+.|.....+...|..+-+-|.+.-..+..+..-+.
T Consensus 33 l~~~i~~~~~~~~~~~~~~~~~~li~r~~~~~~~e~i~~~~~~~~i~r~~~~~~~~~l~g~~~~~~~~~~~~~~~~~~~~ 112 (252)
T COG3751 33 LDREIDDSLDWNWFVIDEFSNLELITRQVREELGEEIAEAFKEAAILRGGQIQVFAFLSGDLKDELDPTRSPNLPVQRYL 112 (252)
T ss_pred HHhhhhhhccccccccccccchhhcCHHHHHHHHhhhhhhccccceeccccceeeeecchhhhhhhcccccccchHHHHH
Confidence 56779999999999999999966555444333322223345667777777777777777777777664333444444444
Q ss_pred HHHhhhhCCCccCCeeecCCcCCcccCccccCceeEEecCCcccchhHHHHHHHHHHHHHHhhhcccCCcceeEEe---c
Q psy11980 670 EVLGMYHSGVFKDGQLVSNKSTNKQDLKTIRGDQITWIDGRETYCSNIGRLISEVDAIIMRANRMVNNGRMGDFVI---N 746 (831)
Q Consensus 670 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~~~~~~~~~~~~~d~~~~~~n~~~~~~R~~s~v~---~ 746 (831)
+..+..+.-.+....+ |.. .+.-++.....+..+.+|+.|.|.+..... |.+++++ +
T Consensus 113 ~~~es~r~~~~~~~~l--g~l------------~~~~ve~~~~~y~~G~~l~~H~D~~~~~~~------R~~~yv~y~~r 172 (252)
T COG3751 113 EFSESIRFILGLPQLL--GIL------------DVSEVEGQITVYNPGCFLLKHDDNGRDKDI------RLATYVYYLTR 172 (252)
T ss_pred HHHHHHHHHhchhhhc--Ccc------------ceeeeeeeeeEecCCceeEeecccCCCccc------eEEEEEeccCC
Confidence 4444433333333333 110 112222233334445789999999987555 9999998 7
Q ss_pred CcccCCCceeEeccCCC-----CCcceeecCCCCeEEEEecCCC-CCccccccccCCCCceeEEeec
Q psy11980 747 GRTKQNGGLLRIFPEGG-----GDKVADIEPMFDRILFFWSDRR-NPHEAMVACYPGHGSHYVKHVD 807 (831)
Q Consensus 747 ~W~~~~GGeL~ly~~~~-----~~~~~~i~P~~grlV~F~S~~~-~pHEVl~a~yp~~~~ry~~h~d 807 (831)
.|++++||+|+||+... .+...+|+|.+|+||+|.|+.. ++|+|.+- | ..+.|....=|
T Consensus 173 ~wkpe~GGeL~l~~s~~~~~~~~~~~~ti~P~fn~lv~F~s~~~Hs~h~V~~~-~-~~~~RlsV~GW 237 (252)
T COG3751 173 EWKPEYGGELRLFHSLQKNNTAADSFKTIAPVFNSLVFFKSRPSHSVHSVEEP-Y-AAADRLSVTGW 237 (252)
T ss_pred CCCcCCCCceeecccccccccccccccccCCCCceEEEEEecCCccceecccc-c-cccceEEEeeE
Confidence 99999999999998765 3677899999999999999986 34555532 2 23555555444
No 14
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.25 E-value=1.1e-11 Score=139.07 Aligned_cols=145 Identities=23% Similarity=0.450 Sum_probs=109.2
Q ss_pred ccccccccccchHHhhhhcccCCCCCcccccc--chhhhcChhhHHhhhhcccccCCccCcccccccccchHHHhhhhhc
Q psy11980 218 IERTCYARFAAKETLNRHMKTHSGVKPHVCEY--CEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLELHMKYI 295 (831)
Q Consensus 218 ~c~~C~~~f~~~~~L~~H~~~h~~~k~~~C~~--C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~ 295 (831)
.|..|..... ...|..|...-. -..-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|++..
T Consensus 409 ~C~NC~~~i~-l~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~~ 475 (567)
T PLN03086 409 ECRNCKHYIP-SRSIALHEAYCS-RHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKVF 475 (567)
T ss_pred ECCCCCCccc-hhHHHHHHhhCC-CcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHhc
Confidence 3444444432 333445543221 12346874 999883 3344444 58999999996 67899999965
Q ss_pred ccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccC----------ChhHHHHHhhhccCCCCcccccccc
Q psy11980 296 HEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFT----------MKSSLKIHLLTHTKEPPKSCEVCGR 365 (831)
Q Consensus 296 H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~----------~~~~L~~H~~~H~~~~py~C~~C~~ 365 (831)
| +++.|+ ||+.+ .+..|..|++.|.+++++.|++|++.|. ..+.|..|+.++ |.+++.|..||+
T Consensus 476 H---kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk 549 (567)
T PLN03086 476 H---EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGR 549 (567)
T ss_pred C---CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCC
Confidence 4 789999 99765 6799999999999999999999999995 245899999986 899999999999
Q ss_pred cccChhHHHHHHHHhcc
Q psy11980 366 AFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 366 ~F~~~~~L~~H~r~~H~ 382 (831)
.|..+ .|..|+...|.
T Consensus 550 ~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 550 SVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred eeeeh-hHHHHHHHhhc
Confidence 88865 58999988875
No 15
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.06 E-value=3.8e-10 Score=126.70 Aligned_cols=140 Identities=19% Similarity=0.409 Sum_probs=102.6
Q ss_pred cccCcccccccCchhhhhhhhhcc---cccc--cccccccchHHhhhhcccCCCCCccccccchhhhcChhhHHhhhhcc
Q psy11980 193 FKCDKCEKQFYGLTLLRIHERLHG---LIER--TCYARFAAKETLNRHMKTHSGVKPHVCEYCEKAFIQASQLKAHMFHH 267 (831)
Q Consensus 193 ~~C~~C~k~f~~~~~L~~H~~~h~---~~c~--~C~~~f~~~~~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~H 267 (831)
..|+.|..... ...|..|..... ..|+ .|+..|. +..+.. .+.|+.|++.|. ...|..|+..|
T Consensus 408 V~C~NC~~~i~-l~~l~lHe~~C~r~~V~Cp~~~Cg~v~~-r~el~~---------H~~C~~Cgk~f~-~s~LekH~~~~ 475 (567)
T PLN03086 408 VECRNCKHYIP-SRSIALHEAYCSRHNVVCPHDGCGIVLR-VEEAKN---------HVHCEKCGQAFQ-QGEMEKHMKVF 475 (567)
T ss_pred EECCCCCCccc-hhHHHHHHhhCCCcceeCCcccccceee-cccccc---------CccCCCCCCccc-hHHHHHHHHhc
Confidence 45666665543 233445543322 2344 3666652 222333 358999999996 67899999998
Q ss_pred cccCCccCcccccccccchHHHhhhhhcccCCCCccCcccccccCC----------hhHHHHHHhhhcCCCCccCCCCCC
Q psy11980 268 TGENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIR----------KEDLTRHCLLHTGVKPHKCPICTK 337 (831)
Q Consensus 268 ~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~----------~~~L~~H~~~H~~~k~~~C~~C~k 337 (831)
+ +++.|+ |++.+ .+..|..|+. .|...+++.|..|++.|.. .+.|..|...+ |.+++.|..|++
T Consensus 476 H--kpv~Cp-Cg~~~-~R~~L~~H~~-thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk 549 (567)
T PLN03086 476 H--EPLQCP-CGVVL-EKEQMVQHQA-STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGR 549 (567)
T ss_pred C--CCccCC-CCCCc-chhHHHhhhh-ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCC
Confidence 6 789999 99765 6689999987 7899999999999999952 45899999885 999999999999
Q ss_pred ccCChhHHHHHhhh
Q psy11980 338 GFTMKSSLKIHLLT 351 (831)
Q Consensus 338 ~F~~~~~L~~H~~~ 351 (831)
.+..+ .|..|+..
T Consensus 550 ~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 550 SVMLK-EMDIHQIA 562 (567)
T ss_pred eeeeh-hHHHHHHH
Confidence 98864 78888654
No 16
>PHA00733 hypothetical protein
Probab=99.03 E-value=2.3e-10 Score=105.37 Aligned_cols=83 Identities=28% Similarity=0.497 Sum_probs=63.6
Q ss_pred CCCccCcccccccCChhHHHHH--H---hhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHH
Q psy11980 299 ADPYQCDICRKTFIRKEDLTRH--C---LLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCL 373 (831)
Q Consensus 299 ~k~~~C~~C~k~F~~~~~L~~H--~---~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L 373 (831)
.+++.|.+|.+.|.....|..| + ..+++.+||.|+.|++.|.+...|..|++.| +.+|.|+.|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 4456666666666555555444 1 1334578999999999999999999999877 4579999999999999999
Q ss_pred HHHHHHhccC
Q psy11980 374 IRHLRQKHRS 383 (831)
Q Consensus 374 ~~H~r~~H~~ 383 (831)
.+|+...|+.
T Consensus 116 ~~H~~~~h~~ 125 (128)
T PHA00733 116 LDHVCKKHNI 125 (128)
T ss_pred HHHHHHhcCc
Confidence 9999998874
No 17
>PHA00733 hypothetical protein
Probab=98.87 E-value=2.2e-09 Score=98.81 Aligned_cols=55 Identities=25% Similarity=0.540 Sum_probs=37.7
Q ss_pred CCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccC
Q psy11980 298 GADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTK 354 (831)
Q Consensus 298 ~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~ 354 (831)
+.+||.|+.|++.|.+...|..|++.| +.+|.|+.|++.|.....|..|+...++
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 356677777777777777777777655 3457777777777777777777665544
No 18
>KOG3993|consensus
Probab=98.78 E-value=1.3e-09 Score=113.92 Aligned_cols=59 Identities=22% Similarity=0.186 Sum_probs=45.2
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHHHhccCchhHH
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQKHRSHLDEV 388 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r~~H~~~~~~~ 388 (831)
..|+.|+..+.++..--.|.+.-.-+.-|.|.+|.-+|.+...|.+|+...|......+
T Consensus 431 ~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse~rqv 489 (500)
T KOG3993|consen 431 ELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSELRQV 489 (500)
T ss_pred cCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHHhhhh
Confidence 46788887777776655555544456679999999999999999999999997554443
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.71 E-value=6.2e-09 Score=78.50 Aligned_cols=42 Identities=26% Similarity=0.490 Sum_probs=28.0
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHH
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCL 373 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L 373 (831)
|+|+.||+.|...++|..|+++|+ +||+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 566666666666666666666666 566666677666665554
No 20
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=98.67 E-value=1.4e-08 Score=100.77 Aligned_cols=171 Identities=20% Similarity=0.166 Sum_probs=106.5
Q ss_pred ceeeecCCCCcccccccccceeeeCCCCCCCCcchhhhhHH-HHHHhhhcCCeEEEcCCCChhHHHHHHHHHHhhhhCCC
Q psy11980 601 GLCVLDDFLGPERDMHLDTNVVVVPPFHHTNNDRVHSDMIE-NVIRDMDQYGLCVVDNFLGPERGMAVLHEVLGMYHSGV 679 (831)
Q Consensus 601 g~~v~d~fl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~g~~~id~~l~~~~~~~l~~~~~~~~~~~~ 679 (831)
++.|++|||+++.+++|.+.+..... .+.+...... .....+|.+...|++.--....+..+...+..+.....
T Consensus 2 ~i~~~~~~ls~~ec~~li~~~~~~~~-----~~~~~~~~~~~~~~~~~R~~~~~~l~~~~~~~~~~~l~~~i~~~~~~~~ 76 (178)
T smart00702 2 GVVVFHDFLSPAECQKLLEEAEPLGW-----RGEVTRGDTNPNHDSKYRQSNGTWLELLKGDLVIERIRQRLADFLGLLR 76 (178)
T ss_pred cEEEECCCCCHHHHHHHHHHhhhhcc-----cceeecCCCCccccCCCEeecceecCCCCCCHHHHHHHHHHHHHHCCCc
Confidence 57899999999999999887753221 1222211111 13456788888998853224556666666665543321
Q ss_pred ccCCeeecCCcCCcccCccccCceeEEecCCcccchhHHHHHHHHHHHHHHhhhcccCCcceeEEe--cCcccCCCceeE
Q psy11980 680 FKDGQLVSNKSTNKQDLKTIRGDQITWIDGRETYCSNIGRLISEVDAIIMRANRMVNNGRMGDFVI--NGRTKQNGGLLR 757 (831)
Q Consensus 680 ~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~~~~~~~~~~~~~d~~~~~~n~~~~~~R~~s~v~--~~W~~~~GGeL~ 757 (831)
+ + ......+..+. |...++|..|.|.+..... ..|.+|+++ ++ +.+||+|+
T Consensus 77 ~----~------------~~~~~~~~~~~-----Y~~g~~~~~H~D~~~~~~~----~~r~~T~~~yLn~--~~~GG~~~ 129 (178)
T smart00702 77 G----L------------PLSAEDAQVAR-----YGPGGHYGPHVDNFEDDEN----GDRIATFLLYLND--VEEGGELV 129 (178)
T ss_pred h----h------------hccCcceEEEE-----ECCCCcccCcCCCCCCCCC----CCeEEEEEEEecc--CCcCceEE
Confidence 1 0 00001111111 1123567788887643221 248899988 63 78999999
Q ss_pred eccCCCCCcceeecCCCCeEEEEecCC-CCCccccccccCCCCceeEEeec
Q psy11980 758 IFPEGGGDKVADIEPMFDRILFFWSDR-RNPHEAMVACYPGHGSHYVKHVD 807 (831)
Q Consensus 758 ly~~~~~~~~~~i~P~~grlV~F~S~~-~~pHEVl~a~yp~~~~ry~~h~d 807 (831)
++..+. .....|.|..|++|+|.+.. .++|+|.|.. .|.||+.-.|
T Consensus 130 f~~~~~-~~~~~v~P~~G~~v~f~~~~~~~~H~v~pv~---~G~r~~~~~W 176 (178)
T smart00702 130 FPGLGL-MVCATVKPKKGDLLFFPSGRGRSLHGVCPVT---RGSRWAITGW 176 (178)
T ss_pred ecCCCC-ccceEEeCCCCcEEEEeCCCCCccccCCcce---eCCEEEEEEE
Confidence 997653 24579999999999999973 3389999542 2788888776
No 21
>KOG3993|consensus
Probab=98.65 E-value=4e-09 Score=110.37 Aligned_cols=192 Identities=20% Similarity=0.340 Sum_probs=104.1
Q ss_pred ccccCccccccCChhHHHHHHHHhc-CCccccCcCCCcCCChHHHHHHHHHhcCCCccccc--cccCccccCchhHHHHH
Q psy11980 104 KFKCDKCEKQFYGLTLLRIHERLHA-EKLYECDICKNRFPSSGAMKKHRRKHTGERPYECK--EVCGRVFNRKDNLREHL 180 (831)
Q Consensus 104 ~~~C~~C~k~f~~~~~L~~H~~~H~-~k~~~C~~C~k~F~~~~~L~~H~~~h~~~~~~~C~--~~C~~~f~~~~~L~~H~ 180 (831)
-|.|..|...|.....|.+|.-.-. .--|+|+.|+|.|+-..+|..|.|.|.....-.-. .-=.+...++...+.-.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 3888888888888888888852111 13488888888888888888888887543211000 00000011111111111
Q ss_pred hhhcCcccCCCccccCcccccccCchhhhhhhhhccccccccc--ccccc---------hHHhhhhccc---CCC-----
Q psy11980 181 RAHAGETKRKKKFKCDKCEKQFYGLTLLRIHERLHGLIERTCY--ARFAA---------KETLNRHMKT---HSG----- 241 (831)
Q Consensus 181 ~~h~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~c~~C~--~~f~~---------~~~L~~H~~~---h~~----- 241 (831)
+ .|.......|.|..|+|.|.....|+.|+.+|+..-..-+ ..|.. ...+..|... |..
T Consensus 347 r--sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~ 424 (500)
T KOG3993|consen 347 R--SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV 424 (500)
T ss_pred c--cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee
Confidence 1 2333334579999999999999999999888764322111 11110 0111111111 000
Q ss_pred ---CCccccccchhhhcChhhHHhhhhcccccCCccCcccccccccchHHHhhhhhccc
Q psy11980 242 ---VKPHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLELHMKYIHE 297 (831)
Q Consensus 242 ---~k~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~ 297 (831)
.....|+.|+-.+.++..-..|.+.-..+..|.|.+|.-+|.+...|.+|+...|.
T Consensus 425 a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 425 AGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred eccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 11134556665555555444454444445556666666666666666666665553
No 22
>PHA02768 hypothetical protein; Provisional
Probab=98.62 E-value=1.7e-08 Score=76.22 Aligned_cols=44 Identities=20% Similarity=0.440 Sum_probs=36.5
Q ss_pred CccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHH
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLK 346 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~ 346 (831)
-|+|+.||+.|...++|..|+++|+ ++|+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 3788888888888888888888888 6888888888888776653
No 23
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=98.46 E-value=1.4e-07 Score=95.32 Aligned_cols=170 Identities=11% Similarity=0.005 Sum_probs=100.1
Q ss_pred eeecCCCCcccccccccceeeeCCCCCCCCcchhhhhHHHHHHhhhcCCeEEEcCCCChhHHHHHHHHHHhhhhC-CCcc
Q psy11980 603 CVLDDFLGPERDMHLDTNVVVVPPFHHTNNDRVHSDMIENVIRDMDQYGLCVVDNFLGPERGMAVLHEVLGMYHS-GVFK 681 (831)
Q Consensus 603 ~v~d~fl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~id~~l~~~~~~~l~~~~~~~~~~-~~~~ 681 (831)
.+|+|+|+++.+.++.+.+... .|.++++..... ...++++.-+.. ....++++...+...+.. ..|.
T Consensus 3 ~~I~~vLs~eec~~~~~~le~~----~~~dg~~taG~~---~~~vKnN~ql~~----d~~~a~~l~~~i~~~L~~~~l~~ 71 (226)
T PRK05467 3 LHIPDVLSPEEVAQIRELLDAA----EWVDGRVTAGAQ---AAQVKNNQQLPE----DSPLARELGNLILDALTRNPLFF 71 (226)
T ss_pred eeecccCCHHHHHHHHHHHHhc----CCccCCcCcCcc---chhcccccccCC----CCHHHHHHHHHHHHHHhcCchhh
Confidence 5788999999888887766432 133333222211 123333222222 233556666666655432 3343
Q ss_pred CCeeecCCcCCcccCccccCceeEEecCCcccchhHHHHHHHHHHHHHHhhhc-ccCCcceeEEe--cCcccCCCceeEe
Q psy11980 682 DGQLVSNKSTNKQDLKTIRGDQITWIDGRETYCSNIGRLISEVDAIIMRANRM-VNNGRMGDFVI--NGRTKQNGGLLRI 758 (831)
Q Consensus 682 ~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~~~~~~~~~~~~~d~~~~~~n~~-~~~~R~~s~v~--~~W~~~~GGeL~l 758 (831)
.+.+... ..+--+..| ...++|..|+|......... ...++.+|+++ ++..+.+||||++
T Consensus 72 sa~lp~~---------i~~~~f~rY--------~~G~~y~~H~D~~~~~~~~~~~~~rs~lS~~lyLnd~~~yeGGEl~~ 134 (226)
T PRK05467 72 SAALPRK---------IHPPLFNRY--------EGGMSYGFHVDNAVRSLPGTGGRVRTDLSATLFLSDPDDYDGGELVI 134 (226)
T ss_pred hhccccc---------cccceEEEE--------CCCCccCccccCCcccCCCCCcceeEEEEEEEEeCCCCCCcCCceEE
Confidence 3333110 011112222 23477999999876542211 11224677776 6666669999999
Q ss_pred ccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC
Q psy11980 759 FPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN 808 (831)
Q Consensus 759 y~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~ 808 (831)
+.... ...|.|..|++|+|.|.. +|||.|. ..|.||+...|-
T Consensus 135 ~~~~g---~~~Vkp~aG~~vlfps~~--lH~v~pV---t~G~R~~~~~Wi 176 (226)
T PRK05467 135 EDTYG---EHRVKLPAGDLVLYPSTS--LHRVTPV---TRGVRVASFFWI 176 (226)
T ss_pred ecCCC---cEEEecCCCeEEEECCCC--ceeeeec---cCccEEEEEecH
Confidence 86543 368999999999999986 5999953 359999999985
No 24
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.34 E-value=2.3e-07 Score=59.86 Aligned_cols=25 Identities=44% Similarity=0.897 Sum_probs=16.0
Q ss_pred HHHHHhhhccCCCCccccccccccc
Q psy11980 344 SLKIHLLTHTKEPPKSCEVCGRAFI 368 (831)
Q Consensus 344 ~L~~H~~~H~~~~py~C~~C~~~F~ 368 (831)
+|..|+++|++++||.|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 3566666666666666666666664
No 25
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.24 E-value=4.3e-07 Score=58.57 Aligned_cols=26 Identities=54% Similarity=0.973 Sum_probs=21.5
Q ss_pred HHHHHHhhhcCCCCccCCCCCCccCC
Q psy11980 316 DLTRHCLLHTGVKPHKCPICTKGFTM 341 (831)
Q Consensus 316 ~L~~H~~~H~~~k~~~C~~C~k~F~~ 341 (831)
+|.+|+++|++++||+|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47888888888888888888888863
No 26
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=98.21 E-value=1.4e-06 Score=77.37 Aligned_cols=67 Identities=22% Similarity=0.320 Sum_probs=46.4
Q ss_pred CCcceeEEe----cCcccCCCceeEeccCC-CCCcceeec-----CCCCeEEEEecCCCCCccccccccCCCCceeEEee
Q psy11980 737 NGRMGDFVI----NGRTKQNGGLLRIFPEG-GGDKVADIE-----PMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHV 806 (831)
Q Consensus 737 ~~R~~s~v~----~~W~~~~GGeL~ly~~~-~~~~~~~i~-----P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~ 806 (831)
..+.+|+|+ ++| +.+||+|++|+.. .......|+ |..|++|+|.++.. +|+|.|. +..+.||+...
T Consensus 22 ~~~~~t~llyL~~~~~-~~~GG~l~~~~~~~~~~~~~~~~~~~~~p~~g~~v~F~~~~~-~H~v~~v--~~~~~R~~l~~ 97 (100)
T PF13640_consen 22 PHRRVTLLLYLNDPEW-EFEGGELEFYPSKDSDDVSREVEDFDIVPKPGRLVIFPSDNS-LHGVTPV--GEGGRRYSLTF 97 (100)
T ss_dssp CSEEEEEEEESS-CS--HCEE--EEETTTS-TSSTCEEEGGGSEE-BTTEEEEEESCTC-EEEEEEE---EESEEEEEEE
T ss_pred CcceEEEEEEECCCCc-ccCCCEEEEeccccCCCcceEEEeccccCCCCEEEEEeCCCC-eecCccc--CCCCCEEEEEE
Confidence 358889987 578 9999999999754 223455666 99999999999666 9999987 54466665544
Q ss_pred c
Q psy11980 807 D 807 (831)
Q Consensus 807 d 807 (831)
|
T Consensus 98 ~ 98 (100)
T PF13640_consen 98 W 98 (100)
T ss_dssp E
T ss_pred E
Confidence 3
No 27
>PHA00616 hypothetical protein
Probab=98.02 E-value=2.1e-06 Score=61.67 Aligned_cols=33 Identities=27% Similarity=0.330 Sum_probs=17.2
Q ss_pred CccCCCCCCccCChhHHHHHhhhccCCCCcccc
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCE 361 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~ 361 (831)
||+|+.||+.|..++.|..|++.|+|++|+.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 345555555555555555555555555555443
No 28
>PHA00732 hypothetical protein
Probab=97.99 E-value=4.2e-06 Score=69.81 Aligned_cols=45 Identities=33% Similarity=0.702 Sum_probs=23.3
Q ss_pred CccCcccccccCChhHHHHHHhh-hcCCCCccCCCCCCccCChhHHHHHhhh
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLL-HTGVKPHKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~-H~~~k~~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
||.|..|++.|.+...|..|++. |. ++.|+.|++.|. .|..|.++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 35555555555555555555553 32 235555555554 34555543
No 29
>PHA00616 hypothetical protein
Probab=97.95 E-value=3.2e-06 Score=60.79 Aligned_cols=34 Identities=24% Similarity=0.346 Sum_probs=28.6
Q ss_pred CccCcccccccCChhHHHHHHhhhcCCCCccCCC
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPI 334 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~ 334 (831)
||+|+.||+.|..++.|..|++.|+|++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5788888888888888888888888888888764
No 30
>COG3751 EGL-9 Predicted proline hydroxylase [Posttranslational modification, protein turnover, chaperones]
Probab=97.95 E-value=2.8e-05 Score=79.01 Aligned_cols=144 Identities=22% Similarity=0.301 Sum_probs=95.1
Q ss_pred hHHHHHHhhhcCCeEEEcCCCC-hhHHHHHHHHHHhhhhCCCccCCeeecCCcCCcccCccccCceeEEecCCcccchhH
Q psy11980 639 MIENVIRDMDQYGLCVVDNFLG-PERGMAVLHEVLGMYHSGVFKDGQLVSNKSTNKQDLKTIRGDQITWIDGRETYCSNI 717 (831)
Q Consensus 639 ~~~~~~~~l~~~g~~~id~~l~-~~~~~~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~~~~~~ 717 (831)
.....++....-+|.++|.|.. .-..++++.++.+.. ...+..+.+.+|...... ..+++|.+.|++.........
T Consensus 32 ~l~~~i~~~~~~~~~~~~~~~~~~li~r~~~~~~~e~i-~~~~~~~~i~r~~~~~~~--~~l~g~~~~~~~~~~~~~~~~ 108 (252)
T COG3751 32 QLDREIDDSLDWNWFVIDEFSNLELITRQVREELGEEI-AEAFKEAAILRGGQIQVF--AFLSGDLKDELDPTRSPNLPV 108 (252)
T ss_pred HHHhhhhhhccccccccccccchhhcCHHHHHHHHhhh-hhhccccceeccccceee--eecchhhhhhhcccccccchH
Confidence 3344556666678888888877 333444555555432 334566666555433222 378899999998433334445
Q ss_pred HHHHHHHHHHHHHhhhcccCCcceeEEecCcccCCCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCC
Q psy11980 718 GRLISEVDAIIMRANRMVNNGRMGDFVINGRTKQNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPG 797 (831)
Q Consensus 718 ~~~~~~~d~~~~~~n~~~~~~R~~s~v~~~W~~~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~ 797 (831)
..|...++.++.-++..... |.|-+ . .-|...++|+.
T Consensus 109 ~~~~~~~es~r~~~~~~~~l----------------g~l~~--------------------------~-~ve~~~~~y~~ 145 (252)
T COG3751 109 QRYLEFSESIRFILGLPQLL----------------GILDV--------------------------S-EVEGQITVYNP 145 (252)
T ss_pred HHHHHHHHHHHHHhchhhhc----------------Cccce--------------------------e-eeeeeeeEecC
Confidence 67888888887666622111 11111 1 34788999996
Q ss_pred CCceeEEeecCC-CCCCceEEEEEecCCCcccc
Q psy11980 798 HGSHYVKHVDNP-NRDGRCITAIYYLNRDWDIK 829 (831)
Q Consensus 798 ~~~ry~~h~d~~-~~~~R~~t~~~Yln~~w~~~ 829 (831)
|.+|..|.|+. .++.|.+|.+||+|++|.+.
T Consensus 146 -G~~l~~H~D~~~~~~~R~~~yv~y~~r~wkpe 177 (252)
T COG3751 146 -GCFLLKHDDNGRDKDIRLATYVYYLTREWKPE 177 (252)
T ss_pred -CceeEeecccCCCccceEEEEEeccCCCCCcC
Confidence 99999999996 59999999999999999764
No 31
>PHA00732 hypothetical protein
Probab=97.90 E-value=9.2e-06 Score=67.77 Aligned_cols=46 Identities=33% Similarity=0.569 Sum_probs=34.5
Q ss_pred CccCCCCCCccCChhHHHHHhhh-ccCCCCcccccccccccChhHHHHHHHHh
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLT-HTKEPPKSCEVCGRAFIRQDCLIRHLRQK 380 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~-H~~~~py~C~~C~~~F~~~~~L~~H~r~~ 380 (831)
||.|+.|++.|.+.+.|..|++. |. ++.|+.|+++|. .|..|++++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 57788888888888888888874 54 357888888887 477777543
No 32
>KOG3844|consensus
Probab=97.82 E-value=8.4e-05 Score=78.23 Aligned_cols=70 Identities=20% Similarity=0.307 Sum_probs=52.0
Q ss_pred cCCcceeEEe----cCcccCCCceeEeccCCCCC----cceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeec
Q psy11980 736 NNGRMGDFVI----NGRTKQNGGLLRIFPEGGGD----KVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVD 807 (831)
Q Consensus 736 ~~~R~~s~v~----~~W~~~~GGeL~ly~~~~~~----~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d 807 (831)
.-+|.+++|+ +||.++.||+|+|||.+... ..+.|.|..++||+|.=--+++|+|--.- - .+.|....=|
T Consensus 137 ietRriaFilYL~~~Dwds~~GG~L~Lf~~d~~~~P~s~~asl~P~~Nql~fFeVsp~SFH~V~Ev~-s-de~RlSIsGW 214 (476)
T KOG3844|consen 137 IETRRIAFILYLVDPDWDSEYGGELRLFPDDCPSQPKSVAASLEPQWNQLVFFEVSPISFHDVEEVL-S-DEPRLSISGW 214 (476)
T ss_pred ccceEEEEEEEecCcccccccCceeEecccccccCccchhhccCcccceEEEEEecccchhhHHHHh-c-cCcceeEeee
Confidence 3458899998 69999999999999987543 34459999999999976668899985322 2 2345555444
No 33
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=97.69 E-value=0.00014 Score=71.97 Aligned_cols=117 Identities=26% Similarity=0.356 Sum_probs=72.8
Q ss_pred CeEEEcCCCChhHHHHHHHHHHhhhhCCCccCCeeecCCcCCcccCccccCceeEEecCCcccchhHHHHHHHHHHHHHH
Q psy11980 651 GLCVVDNFLGPERGMAVLHEVLGMYHSGVFKDGQLVSNKSTNKQDLKTIRGDQITWIDGRETYCSNIGRLISEVDAIIMR 730 (831)
Q Consensus 651 g~~~id~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 730 (831)
++.++++||+++....|+..+..... .+.+..+... ......+|.+...|++..+ ..+....+.+.+..+.+.
T Consensus 2 ~i~~~~~~ls~~ec~~li~~~~~~~~-----~~~~~~~~~~-~~~~~~~R~~~~~~l~~~~-~~~~~~~l~~~i~~~~~~ 74 (178)
T smart00702 2 GVVVFHDFLSPAECQKLLEEAEPLGW-----RGEVTRGDTN-PNHDSKYRQSNGTWLELLK-GDLVIERIRQRLADFLGL 74 (178)
T ss_pred cEEEECCCCCHHHHHHHHHHhhhhcc-----cceeecCCCC-ccccCCCEeecceecCCCC-CCHHHHHHHHHHHHHHCC
Confidence 57899999999999999888876532 3333222211 1123468999999998543 112223333333222211
Q ss_pred hhhcccCCcceeEEecCcccCCCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCCC
Q psy11980 731 ANRMVNNGRMGDFVINGRTKQNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNPN 810 (831)
Q Consensus 731 ~n~~~~~~R~~s~v~~~W~~~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~~ 810 (831)
.. + ... ...+.+.+.|++ |.+|..|+|+..
T Consensus 75 ~~---------------------~--------------------------~~~--~~~~~~~~~Y~~-g~~~~~H~D~~~ 104 (178)
T smart00702 75 LR---------------------G--------------------------LPL--SAEDAQVARYGP-GGHYGPHVDNFE 104 (178)
T ss_pred Cc---------------------h--------------------------hhc--cCcceEEEEECC-CCcccCcCCCCC
Confidence 00 0 000 024578899996 999999999963
Q ss_pred C---CCceEEEEEecCC
Q psy11980 811 R---DGRCITAIYYLNR 824 (831)
Q Consensus 811 ~---~~R~~t~~~Yln~ 824 (831)
. ..|.+|+++|||+
T Consensus 105 ~~~~~~r~~T~~~yLn~ 121 (178)
T smart00702 105 DDENGDRIATFLLYLND 121 (178)
T ss_pred CCCCCCeEEEEEEEecc
Confidence 3 3799999999996
No 34
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.62 E-value=7.7e-05 Score=57.73 Aligned_cols=51 Identities=39% Similarity=0.646 Sum_probs=33.5
Q ss_pred CccCCCCCCccCChhHHHHHhhh-ccCC-CCcccccccccccChhHHHHHHHHhcc
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLT-HTKE-PPKSCEVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~-H~~~-~py~C~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
.|.||+|++ ..+...|..|... |..+ +.+.|++|...+. .+|.+|+...|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 367777777 4445677777554 4433 4577888877544 378888877764
No 35
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.53 E-value=8.7e-05 Score=57.44 Aligned_cols=36 Identities=28% Similarity=0.484 Sum_probs=15.0
Q ss_pred ccCcccccccccchHHHhhhhhcccCC-CCccCccccc
Q psy11980 273 HTCELCNKTFNRKARLELHMKYIHEGA-DPYQCDICRK 309 (831)
Q Consensus 273 ~~C~~C~k~F~~~~~L~~H~~~~H~~~-k~~~C~~C~k 309 (831)
|.|++|++ ..+...|..|+...|..+ +.+.|++|..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~ 39 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSS 39 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchh
Confidence 44444444 223344444444444332 2344444444
No 36
>KOG1146|consensus
Probab=97.32 E-value=0.00038 Score=83.91 Aligned_cols=110 Identities=14% Similarity=0.180 Sum_probs=81.0
Q ss_pred cccCcccccccCchHHHHHhhhccccccccccccccCcccCChhHHHHHHHHhcCCCC---------------------C
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIKVCGRVFNRKDNLREHLRAHAGETK---------------------R 101 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~---------------------~ 101 (831)
-.|.-|+..|.+...+..|+..-.... ..+.|+.|+..|+....|..|||..+.+.. .
T Consensus 437 ~e~~~~e~~~~s~r~~~~~t~~L~S~~-kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~ 515 (1406)
T KOG1146|consen 437 VELTKAEPLLESKRSLEGQTVVLHSFF-KTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCP 515 (1406)
T ss_pred ccccchhhhhhhhcccccceeeeeccc-ccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCC
Confidence 446667777777766666665544333 348999999999999999999987333211 2
Q ss_pred CcccccCccccccCChhHHHHHHHHh--c------------------------------------------CCccccCcC
Q psy11980 102 KKKFKCDKCEKQFYGLTLLRIHERLH--A------------------------------------------EKLYECDIC 137 (831)
Q Consensus 102 ~k~~~C~~C~k~f~~~~~L~~H~~~H--~------------------------------------------~k~~~C~~C 137 (831)
.++|.|..|..++.+...|.+|+..- . .-.+.|.+|
T Consensus 516 ~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc 595 (1406)
T KOG1146|consen 516 GKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVC 595 (1406)
T ss_pred CCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhh
Confidence 47899999999999999999997531 0 014789999
Q ss_pred CCcCCChHHHHHHHHH
Q psy11980 138 KNRFPSSGAMKKHRRK 153 (831)
Q Consensus 138 ~k~F~~~~~L~~H~~~ 153 (831)
+..-.-...|+.||..
T Consensus 596 ~yetniarnlrihmts 611 (1406)
T KOG1146|consen 596 SYETNIARNLRIHMTA 611 (1406)
T ss_pred cchhhhhhcccccccc
Confidence 9988888888888764
No 37
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=97.17 E-value=0.0012 Score=67.01 Aligned_cols=39 Identities=33% Similarity=0.491 Sum_probs=32.1
Q ss_pred ccccccccCCCCceeEEeecCCCC-----C---CceEEEEEecCCCcc
Q psy11980 788 HEAMVACYPGHGSHYVKHVDNPNR-----D---GRCITAIYYLNRDWD 827 (831)
Q Consensus 788 HEVl~a~yp~~~~ry~~h~d~~~~-----~---~R~~t~~~Yln~~w~ 827 (831)
++.+.++|.+ |.+|..|+||+-. . +|.+|+++|||+.|+
T Consensus 80 ~~~~f~rY~~-G~~y~~H~D~~~~~~~~~~~~~rs~lS~~lyLnd~~~ 126 (226)
T PRK05467 80 HPPLFNRYEG-GMSYGFHVDNAVRSLPGTGGRVRTDLSATLFLSDPDD 126 (226)
T ss_pred ccceEEEECC-CCccCccccCCcccCCCCCcceeEEEEEEEEeCCCCC
Confidence 5788999996 9999999999522 1 347999999999875
No 38
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.11 E-value=0.00027 Score=44.11 Aligned_cols=22 Identities=36% Similarity=0.921 Sum_probs=15.6
Q ss_pred cccccccccccChhHHHHHHHH
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~ 379 (831)
|.|+.|++.|.++..|.+|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 5677777777777777777765
No 39
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.07 E-value=0.0003 Score=62.20 Aligned_cols=74 Identities=26% Similarity=0.428 Sum_probs=20.7
Q ss_pred cCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHHHhc
Q psy11980 303 QCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQKH 381 (831)
Q Consensus 303 ~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r~~H 381 (831)
+|..|+..|.+...|..|+...++...- ....+.....+..+++.- -...+.|..|++.|.+...|..||+.++
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred Cccccccccccccccccccccccccccc----cccccccccccccccccc-cCCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 4777777777777777777655543211 112222334444443322 1226889999999999999999998653
No 40
>PF13661 2OG-FeII_Oxy_4: 2OG-Fe(II) oxygenase superfamily
Probab=97.06 E-value=0.00066 Score=55.73 Aligned_cols=42 Identities=31% Similarity=0.495 Sum_probs=34.0
Q ss_pred CccccccccCCCCceeEEeecCCC---CCCceEEEEEecCCCcccc
Q psy11980 787 PHEAMVACYPGHGSHYVKHVDNPN---RDGRCITAIYYLNRDWDIK 829 (831)
Q Consensus 787 pHEVl~a~yp~~~~ry~~h~d~~~---~~~R~~t~~~Yln~~w~~~ 829 (831)
.+.+....|. .|..|..|+|+-. ...|.+|+|+|||++|+..
T Consensus 10 ~~~~~~~~~~-~g~~~~~H~D~~~~~~~~~r~~t~llYLn~~w~~d 54 (70)
T PF13661_consen 10 RPNFRFYRYR-RGDFFGWHVDADPSSSGKRRFLTLLLYLNEDWDED 54 (70)
T ss_pred CcceeEEEcC-CCCEeeeeEcCCccccccceeEEEEEEecccccCc
Confidence 5667777788 4999999999942 2459999999999888764
No 41
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.05 E-value=0.00024 Score=44.37 Aligned_cols=22 Identities=55% Similarity=0.949 Sum_probs=11.1
Q ss_pred ccCCCCCCccCChhHHHHHhhh
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
|+|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555444
No 42
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.04 E-value=0.00039 Score=43.68 Aligned_cols=24 Identities=33% Similarity=0.826 Sum_probs=15.2
Q ss_pred cccccccccccChhHHHHHHHHhc
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQKH 381 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~~H 381 (831)
|.|++|++.|.+...|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 567777777777777777776665
No 43
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.94 E-value=0.00039 Score=61.48 Aligned_cols=71 Identities=25% Similarity=0.471 Sum_probs=13.8
Q ss_pred CcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhh
Q psy11980 275 CELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 275 C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
|..|+..|.+...|..|+...|.-..+ ....+.....+..+.+... ...+.|..|++.|.+...|..|++.
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccc-----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcC
Confidence 555555555555555555544432211 1111122333333332211 1245666666666666666666654
No 44
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.94 E-value=0.00024 Score=72.29 Aligned_cols=70 Identities=30% Similarity=0.610 Sum_probs=39.7
Q ss_pred CCCccCcc--cccccCChhHHHHHHhh-hcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHH
Q psy11980 299 ADPYQCDI--CRKTFIRKEDLTRHCLL-HTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIR 375 (831)
Q Consensus 299 ~k~~~C~~--C~k~F~~~~~L~~H~~~-H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~ 375 (831)
+|||+|++ |.|.+++...|+.|+.. |...+...=+ .-..|.-.-...|||.|++|+|.|.....|+-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence 46677765 77777777777776642 2111111100 00001111234688999999999988888888
Q ss_pred HHH
Q psy11980 376 HLR 378 (831)
Q Consensus 376 H~r 378 (831)
|+.
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 864
No 45
>KOG2231|consensus
Probab=96.72 E-value=0.0018 Score=74.40 Aligned_cols=103 Identities=22% Similarity=0.391 Sum_probs=51.5
Q ss_pred ccCcCCCcCCChHHHHHHHHHhcCCCcccccccc---------CccccCchhHHHHHhhhcC-cccCCCccccCcccccc
Q psy11980 133 ECDICKNRFPSSGAMKKHRRKHTGERPYECKEVC---------GRVFNRKDNLREHLRAHAG-ETKRKKKFKCDKCEKQF 202 (831)
Q Consensus 133 ~C~~C~k~F~~~~~L~~H~~~h~~~~~~~C~~~C---------~~~f~~~~~L~~H~~~h~~-~~~~~~~~~C~~C~k~f 202 (831)
.|..| ..|.+...|+.|+...|. .+.|. +| .....++..|..|+..-.. +..-.---.|..|...|
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~-lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~f 192 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHK--LHLCS-LCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERF 192 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhh--hhccc-cccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhh
Confidence 45566 556666666666533221 12222 12 1222345556666553221 11111123677777777
Q ss_pred cCchhhhhhhhhccccccccc------ccccchHHhhhhcccC
Q psy11980 203 YGLTLLRIHERLHGLIERTCY------ARFAAKETLNRHMKTH 239 (831)
Q Consensus 203 ~~~~~L~~H~~~h~~~c~~C~------~~f~~~~~L~~H~~~h 239 (831)
.....|..|++.++..|..|. ..|.....|..|.+.+
T Consensus 193 ld~~el~rH~~~~h~~chfC~~~~~~neyy~~~~dLe~HfR~~ 235 (669)
T KOG2231|consen 193 LDDDELYRHLRFDHEFCHFCDYKTGQNEYYNDYDDLEEHFRKG 235 (669)
T ss_pred ccHHHHHHhhccceeheeecCcccccchhcccchHHHHHhhhc
Confidence 777777777777766665553 2344444555554443
No 46
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.72 E-value=0.00047 Score=70.29 Aligned_cols=71 Identities=23% Similarity=0.445 Sum_probs=45.1
Q ss_pred ccCCccCcc--cccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHH
Q psy11980 269 GENGHTCEL--CNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLK 346 (831)
Q Consensus 269 ~~k~~~C~~--C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~ 346 (831)
++|||+|++ |.|+|.+...|+.|+..-|...+...=+ .-..|.-.-...|||+|++|+|+|.....|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLK 415 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLK 415 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCccce
Confidence 358999987 9999999999999987445332221111 0011111123457888888888888877777
Q ss_pred HHh
Q psy11980 347 IHL 349 (831)
Q Consensus 347 ~H~ 349 (831)
-|.
T Consensus 416 YHr 418 (423)
T COG5189 416 YHR 418 (423)
T ss_pred ecc
Confidence 774
No 47
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.70 E-value=0.00075 Score=43.95 Aligned_cols=25 Identities=32% Similarity=0.703 Sum_probs=17.4
Q ss_pred CcccccccccccChhHHHHHHHHhc
Q psy11980 357 PKSCEVCGRAFIRQDCLIRHLRQKH 381 (831)
Q Consensus 357 py~C~~C~~~F~~~~~L~~H~r~~H 381 (831)
||.|..|++.|.+...|..|++.|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777776654
No 48
>KOG2231|consensus
Probab=96.62 E-value=0.002 Score=74.09 Aligned_cols=107 Identities=24% Similarity=0.416 Sum_probs=65.8
Q ss_pred ccCccccccCChhHHHHHHHHhcCCccccCcCC---------CcCCChHHHHHHHHHhcC-CC----ccccccccCcccc
Q psy11980 106 KCDKCEKQFYGLTLLRIHERLHAEKLYECDICK---------NRFPSSGAMKKHRRKHTG-ER----PYECKEVCGRVFN 171 (831)
Q Consensus 106 ~C~~C~k~f~~~~~L~~H~~~H~~k~~~C~~C~---------k~F~~~~~L~~H~~~h~~-~~----~~~C~~~C~~~f~ 171 (831)
.|..| ..|.+...|+.|+..-+ +.+.|.+|- ...-+...|..|+..--. ++ .-.|. .|...|-
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H-~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~-~C~~~fl 193 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQH-KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCK-FCHERFL 193 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhh-hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccch-hhhhhhc
Confidence 46666 66667777777763222 234444442 112234556666543211 11 23576 7888888
Q ss_pred CchhHHHHHhhhcCcccCCCccccCcc------cccccCchhhhhhhhhcccccc--ccc
Q psy11980 172 RKDNLREHLRAHAGETKRKKKFKCDKC------EKQFYGLTLLRIHERLHGLIER--TCY 223 (831)
Q Consensus 172 ~~~~L~~H~~~h~~~~~~~~~~~C~~C------~k~f~~~~~L~~H~~~h~~~c~--~C~ 223 (831)
....|..|++.++ |.|..| +.-|.....|..|.+.+|+.|. .|.
T Consensus 194 d~~el~rH~~~~h--------~~chfC~~~~~~neyy~~~~dLe~HfR~~HflCE~~~C~ 245 (669)
T KOG2231|consen 194 DDDELYRHLRFDH--------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHFLCEEEFCR 245 (669)
T ss_pred cHHHHHHhhccce--------eheeecCcccccchhcccchHHHHHhhhcCccccccccc
Confidence 8888888887765 446555 3556777889999998888886 553
No 49
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.59 E-value=0.0012 Score=41.34 Aligned_cols=23 Identities=52% Similarity=1.005 Sum_probs=11.1
Q ss_pred ccCCCCCCccCChhHHHHHhhhc
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555544
No 50
>PLN00052 prolyl 4-hydroxylase; Provisional
Probab=96.50 E-value=0.0073 Score=64.51 Aligned_cols=147 Identities=16% Similarity=0.234 Sum_probs=88.4
Q ss_pred CCeEEEcCCCChhHHHHHHHHHHhhhhCCCccCCeeecCCcCCcccCccccCceeEEecCCcc-c---------------
Q psy11980 650 YGLCVVDNFLGPERGMAVLHEVLGMYHSGVFKDGQLVSNKSTNKQDLKTIRGDQITWIDGRET-Y--------------- 713 (831)
Q Consensus 650 ~g~~~id~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~d~~~~~~~~~~-~--------------- 713 (831)
-.++++++||.+++...|+.-... .+....+..+.. .......+|.....|++..+. .
T Consensus 54 P~i~~~~nfLs~~Ecd~Li~la~~-----~l~~S~v~~~~~-g~~~~s~~RTS~~~~l~~~~dpvv~~I~~Ria~~t~lp 127 (310)
T PLN00052 54 PRIFVYKGFLSDAECDHLVKLAKK-----KIQRSMVADNKS-GKSVMSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLP 127 (310)
T ss_pred CCEEEECCcCCHHHHHHHHHhccc-----ccccceeecCCC-CccccCCCEEecceeecCCCCHHHHHHHHHHHHHhCCC
Confidence 478999999999999888765543 334433322211 111223466666667643210 0
Q ss_pred -----------chhHHHHHHHHHHHHHHhhhcccCCcceeEEe--cCcccCCCceeEeccCCCC--------------Cc
Q psy11980 714 -----------CSNIGRLISEVDAIIMRANRMVNNGRMGDFVI--NGRTKQNGGLLRIFPEGGG--------------DK 766 (831)
Q Consensus 714 -----------~~~~~~~~~~~d~~~~~~n~~~~~~R~~s~v~--~~W~~~~GGeL~ly~~~~~--------------~~ 766 (831)
|...+.|..|.|.|....+......|+.|+++ ++ .+.||+ -+||.... ..
T Consensus 128 ~~~~E~lQVlrY~~Gq~Y~~H~D~~~~~~~~~~gg~R~aTvL~YLnd--v~~GGe-T~FP~~~~~~~~~~~~~~s~c~~~ 204 (310)
T PLN00052 128 EENAENIQILRYEHGQKYEPHFDYFHDKINQALGGHRYATVLMYLST--VDKGGE-TVFPNAEGWENQPKDDTFSECAHK 204 (310)
T ss_pred cccCcceEEEecCCCCCCCCCCCccccccccccCCceeEEEEEEecc--CCCCCc-eecCCcccccccccccchhhhhcC
Confidence 11114566666766543333233457777776 44 246999 55775310 12
Q ss_pred ceeecCCCCeEEEEecC-------CCCCccccccccCCCCceeEEeecC
Q psy11980 767 VADIEPMFDRILFFWSD-------RRNPHEAMVACYPGHGSHYVKHVDN 808 (831)
Q Consensus 767 ~~~i~P~~grlV~F~S~-------~~~pHEVl~a~yp~~~~ry~~h~d~ 808 (831)
-+.|.|..|+.|||++- ..+.|+++|.. .|..|+..+|=
T Consensus 205 gl~VkPkkG~ALlF~nl~~dG~~D~~SlHagcPVi---~G~Kw~atkWi 250 (310)
T PLN00052 205 GLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVI---EGEKWSAPKWI 250 (310)
T ss_pred CeEeccCcceEEEEeccCCCCCCCcccccCCCeee---cCeEEEEEEee
Confidence 47899999999999993 34479988653 48899888873
No 51
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.35 E-value=0.0017 Score=42.29 Aligned_cols=24 Identities=38% Similarity=0.706 Sum_probs=13.5
Q ss_pred CccCCCCCCccCChhHHHHHhhhc
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
||+|..|++.|.+...|..|++.|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCCccCCccCChhHHHHHhHHh
Confidence 355555555555555555555554
No 52
>KOG1146|consensus
Probab=96.32 E-value=0.002 Score=77.89 Aligned_cols=101 Identities=18% Similarity=0.246 Sum_probs=67.8
Q ss_pred CcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhh---
Q psy11980 275 CELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLT--- 351 (831)
Q Consensus 275 C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~--- 351 (831)
|..|+..+.+...+..|+...|...+.|+|+.|+..|+....|..|||..|.+-.- .+|. ..+.|.+.
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~-------~gq~~~~~arg 509 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCK-------AGQNHPRLARG 509 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhH-------hcccccccccc
Confidence 44556666666777777777777778888888888888888888888864433211 2221 11122111
Q ss_pred ---ccCCCCcccccccccccChhHHHHHHHH-hccCc
Q psy11980 352 ---HTKEPPKSCEVCGRAFIRQDCLIRHLRQ-KHRSH 384 (831)
Q Consensus 352 ---H~~~~py~C~~C~~~F~~~~~L~~H~r~-~H~~~ 384 (831)
-.+.+||.|..|..+++.+.+|..|+.. .|...
T Consensus 510 ~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~ 546 (1406)
T KOG1146|consen 510 EVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNE 546 (1406)
T ss_pred ccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHH
Confidence 1245789999999999999999999974 44443
No 53
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.09 E-value=0.0037 Score=46.06 Aligned_cols=35 Identities=26% Similarity=0.487 Sum_probs=16.1
Q ss_pred cccCCCCccCcccccccCChhHHHHHHhhhcCCCC
Q psy11980 295 IHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKP 329 (831)
Q Consensus 295 ~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~ 329 (831)
.+..+.|-.|++|+..+....+|++|+.++|+.||
T Consensus 18 ~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 18 KSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp CCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34444555566666666665666666555555443
No 54
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.93 E-value=0.0051 Score=45.34 Aligned_cols=32 Identities=31% Similarity=0.634 Sum_probs=20.8
Q ss_pred CCCCcccccccccccChhHHHHHHHHhccCch
Q psy11980 354 KEPPKSCEVCGRAFIRQDCLIRHLRQKHRSHL 385 (831)
Q Consensus 354 ~~~py~C~~C~~~F~~~~~L~~H~r~~H~~~~ 385 (831)
.+.|-.|++|+..+.+..+|++|+...|+.++
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 46778888888888888888888888887664
No 55
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.83 E-value=0.0062 Score=38.33 Aligned_cols=24 Identities=29% Similarity=0.569 Sum_probs=16.3
Q ss_pred cccccccccccChhHHHHHHHHhcc
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
|+|+.|++... +..|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777777777 7777788777764
No 56
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.41 E-value=0.013 Score=37.22 Aligned_cols=20 Identities=35% Similarity=0.879 Sum_probs=8.5
Q ss_pred ccccccccccChhHHHHHHH
Q psy11980 359 SCEVCGRAFIRQDCLIRHLR 378 (831)
Q Consensus 359 ~C~~C~~~F~~~~~L~~H~r 378 (831)
.|+.|++.|.....|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCCcchhCCHHHHHHHHH
Confidence 34444444444444444443
No 57
>PRK04860 hypothetical protein; Provisional
Probab=95.30 E-value=0.0076 Score=57.77 Aligned_cols=36 Identities=25% Similarity=0.685 Sum_probs=16.8
Q ss_pred CccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccC
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFT 340 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~ 340 (831)
+|.|. |+. ....+.+|.++|+++++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 34454 444 3344444444444444444444444443
No 58
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.28 E-value=0.025 Score=58.68 Aligned_cols=127 Identities=26% Similarity=0.489 Sum_probs=84.0
Q ss_pred ccccc--chhhhcChhhHHhhhhcccccCCccCccccc---cc------ccchHHHhhhhhcccCCCCc----cCccccc
Q psy11980 245 HVCEY--CEKAFIQASQLKAHMFHHTGENGHTCELCNK---TF------NRKARLELHMKYIHEGADPY----QCDICRK 309 (831)
Q Consensus 245 ~~C~~--C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k---~F------~~~~~L~~H~~~~H~~~k~~----~C~~C~k 309 (831)
|.|+. |.........|..|.+..++ .+.|.+|-+ .| -++..|+.|... -..+.-| .|..|..
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~-G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNG-GLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccC-CccccCcCCCchhhhccc
Confidence 56654 55555556677788776543 246666632 23 345677777652 2112123 5999999
Q ss_pred ccCChhHHHHHHhhhcCCCCccCCCCCC-------ccCChhHHHHHhhhccCCCCccccc--cc----ccccChhHHHHH
Q psy11980 310 TFIRKEDLTRHCLLHTGVKPHKCPICTK-------GFTMKSSLKIHLLTHTKEPPKSCEV--CG----RAFIRQDCLIRH 376 (831)
Q Consensus 310 ~F~~~~~L~~H~~~H~~~k~~~C~~C~k-------~F~~~~~L~~H~~~H~~~~py~C~~--C~----~~F~~~~~L~~H 376 (831)
.|-+-..|.+|+|..|. +|-+|++ -|.+...|..|.+.- -|.|.+ |- ..|.....|..|
T Consensus 229 ~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~~el~~h 300 (493)
T COG5236 229 YFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYHTELLEH 300 (493)
T ss_pred eecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccHHHHHHH
Confidence 99999999999987652 4666665 467778888886542 266643 32 568889999999
Q ss_pred HHHhcc
Q psy11980 377 LRQKHR 382 (831)
Q Consensus 377 ~r~~H~ 382 (831)
+.+.|+
T Consensus 301 ~~~~h~ 306 (493)
T COG5236 301 LTRFHK 306 (493)
T ss_pred HHHHhh
Confidence 988886
No 59
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.23 E-value=0.015 Score=36.92 Aligned_cols=23 Identities=52% Similarity=0.855 Sum_probs=21.2
Q ss_pred ccCCCCCCccCChhHHHHHhhhc
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
|+|+.|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 68999999999999999999865
No 60
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=95.06 E-value=0.024 Score=50.05 Aligned_cols=34 Identities=35% Similarity=0.727 Sum_probs=28.9
Q ss_pred ccccCCCCceeEEeecCCCCCCceEEEEEecC-CCc
Q psy11980 792 VACYPGHGSHYVKHVDNPNRDGRCITAIYYLN-RDW 826 (831)
Q Consensus 792 ~a~yp~~~~ry~~h~d~~~~~~R~~t~~~Yln-~~w 826 (831)
...|+ .|..|..|.|+.....|.+|+||||| ++|
T Consensus 3 ~~~y~-~G~~~~~H~D~~~~~~~~~t~llyL~~~~~ 37 (100)
T PF13640_consen 3 LNRYP-PGGFFGPHTDNSYDPHRRVTLLLYLNDPEW 37 (100)
T ss_dssp EEEEE-TTEEEEEEESSSCCCSEEEEEEEESS-CS-
T ss_pred EEEEC-cCCEEeeeECCCCCCcceEEEEEEECCCCc
Confidence 34676 49999999998778899999999999 888
No 61
>PRK04860 hypothetical protein; Provisional
Probab=94.77 E-value=0.016 Score=55.56 Aligned_cols=39 Identities=21% Similarity=0.541 Sum_probs=32.1
Q ss_pred CccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChh
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQD 371 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~ 371 (831)
+|.|. |++ ....+.+|.++|+++++|.|..|+..|....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 68897 887 6778888999999999999999988876543
No 62
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.34 E-value=0.02 Score=65.12 Aligned_cols=55 Identities=25% Similarity=0.440 Sum_probs=44.0
Q ss_pred CcccCcccccccCchHHHHHhhhcccccccccccc--ccCcccCChhHHHHHHHHhcCC
Q psy11980 42 GHLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIK--VCGRVFNRKDNLREHLRAHAGE 98 (831)
Q Consensus 42 ~~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~--~C~k~f~~~~~L~~H~~~h~~~ 98 (831)
.+.|..|...|........|.+.+.++++ +.|. .|...|.....+..|...++..
T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (467)
T COG5048 33 PDSCPNCTDSFSRLEHLTRHIRSHTGEKP--SQCSYSGCDKSFSRPLELSRHLRTHHNN 89 (467)
T ss_pred hhhcccccccccccchhhhhcccccccCC--ccccccccccccCCcchhhhhccccccc
Confidence 46799999999998888999999988887 4453 5777888888888888777654
No 63
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.27 E-value=0.023 Score=35.68 Aligned_cols=22 Identities=50% Similarity=0.807 Sum_probs=11.2
Q ss_pred ccCCCCCCccCChhHHHHHhhhc
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
|+|+.|++... +..|..|++.|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 45555555555 55555555554
No 64
>KOG2785|consensus
Probab=93.80 E-value=0.13 Score=54.92 Aligned_cols=74 Identities=19% Similarity=0.264 Sum_probs=38.4
Q ss_pred ccccccchhhhcChhhHHhhhhcccccCCccCcccccccccchHHHhhhhhcccCCCCccCcccc---cccCChhHHHHH
Q psy11980 244 PHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICR---KTFIRKEDLTRH 320 (831)
Q Consensus 244 ~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~---k~F~~~~~L~~H 320 (831)
|-.|-.|++.+.+...-..||..++|-. . +.=. -......|..-+.. .-..-|.|-.|+ +.|.+-.+.+.|
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~Hgff--I-Pdre-YL~D~~GLl~YLge--KV~~~~~CL~CN~~~~~f~sleavr~H 239 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFF--I-PDRE-YLTDEKGLLKYLGE--KVGIGFICLFCNELGRPFSSLEAVRAH 239 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCc--C-CchH-hhhchhHHHHHHHH--HhccCceEEEeccccCcccccHHHHHH
Confidence 3556667766666666666666665421 0 0000 00111111111110 012347788888 888888888888
Q ss_pred Hhh
Q psy11980 321 CLL 323 (831)
Q Consensus 321 ~~~ 323 (831)
|..
T Consensus 240 M~~ 242 (390)
T KOG2785|consen 240 MRD 242 (390)
T ss_pred Hhh
Confidence 864
No 65
>KOG2785|consensus
Probab=93.78 E-value=0.11 Score=55.40 Aligned_cols=53 Identities=26% Similarity=0.520 Sum_probs=36.8
Q ss_pred CCcccCcccccccCchHHHHHhhhc-c----------------------------------ccccccccccccCcccCCh
Q psy11980 41 PGHLCKGCYRIFTSQGYMLLSYKSK-E----------------------------------EEKQICTYIKVCGRVFNRK 85 (831)
Q Consensus 41 ~~~~C~~C~~~F~~~~~l~~H~~~~-~----------------------------------~~~~~~~~C~~C~k~f~~~ 85 (831)
..|.|.-|...|.+...-+.|+++- | .+....+.|..|.+.|.+.
T Consensus 2 t~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~ 81 (390)
T KOG2785|consen 2 TGFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASP 81 (390)
T ss_pred CcceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccCh
Confidence 3589999999999987777777532 0 1122346677777777777
Q ss_pred hHHHHHHH
Q psy11980 86 DNLREHLR 93 (831)
Q Consensus 86 ~~L~~H~~ 93 (831)
.....|+.
T Consensus 82 ~a~~~hl~ 89 (390)
T KOG2785|consen 82 KAHENHLK 89 (390)
T ss_pred hhHHHHHH
Confidence 77777765
No 66
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.60 E-value=0.023 Score=64.55 Aligned_cols=139 Identities=24% Similarity=0.426 Sum_probs=91.9
Q ss_pred CccccccchhhhcChhhHHhhhh--ccccc--CCccCc--ccccccccchHHHhhhhhcccCCCCccCcc--cccccCCh
Q psy11980 243 KPHVCEYCEKAFIQASQLKAHMF--HHTGE--NGHTCE--LCNKTFNRKARLELHMKYIHEGADPYQCDI--CRKTFIRK 314 (831)
Q Consensus 243 k~~~C~~C~k~f~~~~~L~~H~~--~H~~~--k~~~C~--~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~--C~k~F~~~ 314 (831)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|.. .|.+..++.|.. |...+...
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 366 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHIL-LHTSISPAKEKLLNSSSKFSPL 366 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcc-cccCCCccccccccCccccccc
Confidence 46788888888888888888888 78888 888888 68888888888888887 677777666654 33333333
Q ss_pred hHHH-----HHHhhhcCCCCccCC--CCCCccCChhHHHHHhhhccCCC--CcccccccccccChhHHHHHHHHhcc
Q psy11980 315 EDLT-----RHCLLHTGVKPHKCP--ICTKGFTMKSSLKIHLLTHTKEP--PKSCEVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 315 ~~L~-----~H~~~H~~~k~~~C~--~C~k~F~~~~~L~~H~~~H~~~~--py~C~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
..-. .....-...+.+.|. .|...+.....+..|...|.... .+.|..|.+.|.....+..|++.+..
T Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (467)
T COG5048 367 LNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTN 443 (467)
T ss_pred cCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCccccccccccccc
Confidence 2211 111111222334443 26666666677777766666555 35667788888888877777766543
No 67
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.48 E-value=0.041 Score=34.89 Aligned_cols=22 Identities=32% Similarity=0.830 Sum_probs=14.6
Q ss_pred cccccccccccChhHHHHHHHH
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~ 379 (831)
|.|.+|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4567777777777777777653
No 68
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.46 E-value=0.034 Score=35.27 Aligned_cols=22 Identities=41% Similarity=0.813 Sum_probs=13.4
Q ss_pred ccCCCCCCccCChhHHHHHhhh
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
|.|.+|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666554
No 69
>KOG2482|consensus
Probab=93.33 E-value=0.2 Score=52.37 Aligned_cols=23 Identities=35% Similarity=0.612 Sum_probs=15.7
Q ss_pred ccccCccccccCChhHHHHHHHH
Q psy11980 104 KFKCDKCEKQFYGLTLLRIHERL 126 (831)
Q Consensus 104 ~~~C~~C~k~f~~~~~L~~H~~~ 126 (831)
.+.|-.|.+.|..+..|+.||+.
T Consensus 195 r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHh
Confidence 36677777777777777777664
No 70
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=92.27 E-value=0.14 Score=53.36 Aligned_cols=135 Identities=30% Similarity=0.418 Sum_probs=61.7
Q ss_pred cccCc--ccccccCchHHHHHhhhccccccccccccccC---cccC------ChhHHHHHHHHhcCCCCCCcccccCccc
Q psy11980 43 HLCKG--CYRIFTSQGYMLLSYKSKEEEKQICTYIKVCG---RVFN------RKDNLREHLRAHAGETKRKKKFKCDKCE 111 (831)
Q Consensus 43 ~~C~~--C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~---k~f~------~~~~L~~H~~~h~~~~~~~k~~~C~~C~ 111 (831)
|.|+. |.........|..|.+..++. +.|.+|- +.|. ++..|..|...-..+..-+.--.|..|.
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~----~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~ 227 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF----VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCK 227 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc----EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhcc
Confidence 55554 455544455566666655432 2343332 2232 2344444443221111111112466666
Q ss_pred cccCChhHHHHHHHHhcCCccccCcCC----CcCCChHHHHHHHHHhcCCCccccc-ccc--C--ccccCchhHHHHHhh
Q psy11980 112 KQFYGLTLLRIHERLHAEKLYECDICK----NRFPSSGAMKKHRRKHTGERPYECK-EVC--G--RVFNRKDNLREHLRA 182 (831)
Q Consensus 112 k~f~~~~~L~~H~~~H~~k~~~C~~C~----k~F~~~~~L~~H~~~h~~~~~~~C~-~~C--~--~~f~~~~~L~~H~~~ 182 (831)
+.|.+...|.+|+|..+++-|.|+.=+ .-|.+...|..|.+..+ |.|. +.| + .+|.....|..|+..
T Consensus 228 ~~FYdDDEL~~HcR~~HE~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~~~ 303 (493)
T COG5236 228 IYFYDDDELRRHCRLRHEACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHLTR 303 (493)
T ss_pred ceecChHHHHHHHHhhhhhhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHHHH
Confidence 666666666666666555433332221 23555556666654322 3332 112 1 235555566666544
Q ss_pred hcC
Q psy11980 183 HAG 185 (831)
Q Consensus 183 h~~ 185 (831)
.++
T Consensus 304 ~h~ 306 (493)
T COG5236 304 FHK 306 (493)
T ss_pred Hhh
Confidence 443
No 71
>KOG2482|consensus
Probab=92.25 E-value=0.14 Score=53.49 Aligned_cols=22 Identities=27% Similarity=0.635 Sum_probs=18.7
Q ss_pred ccCCCCCCccCChhHHHHHhhh
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
-.|-.|.-.|-....|..||.-
T Consensus 335 ~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 335 SRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred cccccccccccCcchhhhhccc
Confidence 4688889999999999999864
No 72
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=91.27 E-value=0.079 Score=34.37 Aligned_cols=21 Identities=33% Similarity=0.760 Sum_probs=10.2
Q ss_pred ccCcccccccCChhHHHHHHh
Q psy11980 302 YQCDICRKTFIRKEDLTRHCL 322 (831)
Q Consensus 302 ~~C~~C~k~F~~~~~L~~H~~ 322 (831)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 344455555555555544443
No 73
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=90.57 E-value=0.12 Score=33.51 Aligned_cols=22 Identities=27% Similarity=0.581 Sum_probs=14.9
Q ss_pred cccCcccccccCchHHHHHhhh
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKS 64 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~ 64 (831)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5577777777777777666654
No 74
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=89.71 E-value=0.57 Score=44.99 Aligned_cols=55 Identities=16% Similarity=0.184 Sum_probs=41.3
Q ss_pred cCcccCCCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC
Q psy11980 746 NGRTKQNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN 808 (831)
Q Consensus 746 ~~W~~~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~ 808 (831)
+|-..-|||||+|-..-.+ ...-.|. |-||+|-|... |||-|. ..|.||+.-.|-
T Consensus 125 ~DPedYdGGeLVv~dtYg~--h~VklPA-GdLVlypStSl--H~VtPV---TRg~R~asffW~ 179 (229)
T COG3128 125 SDPEDYDGGELVVNDTYGN--HRVKLPA-GDLVLYPSTSL--HEVTPV---TRGERFASFFWI 179 (229)
T ss_pred CCccccCCceEEEeccccc--eEEeccC-CCEEEcccccc--eecccc---ccCceEEEeeeh
Confidence 7778899999999865442 1222343 88999999986 999954 247889888886
No 75
>KOG2893|consensus
Probab=88.80 E-value=0.13 Score=50.82 Aligned_cols=50 Identities=22% Similarity=0.419 Sum_probs=22.1
Q ss_pred CCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHHHhccCch
Q psy11980 332 CPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQKHRSHL 385 (831)
Q Consensus 332 C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r~~H~~~~ 385 (831)
|=+|++.|....-|..|++. |-|+|.+|.|...+--.|.-|-...|.+..
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhketi 62 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHKETI 62 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhhhhh
Confidence 33444444444444444332 224444444444444444444444444433
No 76
>KOG2893|consensus
Probab=88.60 E-value=0.15 Score=50.35 Aligned_cols=42 Identities=36% Similarity=0.688 Sum_probs=23.1
Q ss_pred cccccCcccCChhHHHHHHHHhcCCCCCCcccccCccccccCChhHHHHH
Q psy11980 74 YIKVCGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLRIH 123 (831)
Q Consensus 74 ~C~~C~k~f~~~~~L~~H~~~h~~~~~~~k~~~C~~C~k~f~~~~~L~~H 123 (831)
.|-.|++.|....-|.+|++. |-|+|.+|.|...+--.|..|
T Consensus 12 wcwycnrefddekiliqhqka--------khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA--------KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhh--------ccceeeeehhhhccCCCceee
Confidence 344566666666666666653 235666665554444444444
No 77
>PLN02997 flavonol synthase
Probab=88.24 E-value=7.7 Score=42.24 Aligned_cols=56 Identities=11% Similarity=0.189 Sum_probs=40.6
Q ss_pred cceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 766 KVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 766 ~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
..++|.|+.|.|||..-|.+ |++ .+++|...+=. .....-++++.|+.++++|+.+
T Consensus 229 ~Wi~V~p~pgalvVNiGD~L---e~~------TNG~~kSt~HRVv~~~~~~R~Si~fF~~P~~d~~i 286 (325)
T PLN02997 229 QWLDLNYINSAVVVIIGDQL---MRM------TNGRFKNVLHRAKTDKERLRISWPVFVAPRADMSV 286 (325)
T ss_pred cEEECCCCCCeEEEEechHH---HHH------hCCccccccceeeCCCCCCEEEEEEEecCCCCCeE
Confidence 57899999999999988887 555 35555432222 2234458999999999998754
No 78
>KOG4173|consensus
Probab=87.57 E-value=0.3 Score=47.35 Aligned_cols=69 Identities=29% Similarity=0.629 Sum_probs=37.2
Q ss_pred cCcccCChhHHHHHHHHhcCCCCCCcccccCccccccCChhHHHHHHHH-h----------cCCccccCc--CCCcCCCh
Q psy11980 78 CGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLRIHERL-H----------AEKLYECDI--CKNRFPSS 144 (831)
Q Consensus 78 C~k~f~~~~~L~~H~~~h~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~-H----------~~k~~~C~~--C~k~F~~~ 144 (831)
|.+.|........|..+-++. .|..|.+.|.+...|..|+.. | +...|.|-+ |+..|.+.
T Consensus 87 c~~~~d~lD~~E~hY~~~h~~-------sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~ 159 (253)
T KOG4173|consen 87 CCQVFDALDDYEHHYHTLHGN-------SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTS 159 (253)
T ss_pred hHHHHhhhhhHHHhhhhcccc-------hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhh
Confidence 455555555555555443332 355555555555555555421 1 134566643 77777777
Q ss_pred HHHHHHHHH
Q psy11980 145 GAMKKHRRK 153 (831)
Q Consensus 145 ~~L~~H~~~ 153 (831)
..-..|+..
T Consensus 160 r~RkdH~I~ 168 (253)
T KOG4173|consen 160 RDRKDHMIR 168 (253)
T ss_pred hhhhhHHHH
Confidence 666666543
No 79
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=87.25 E-value=0.91 Score=40.37 Aligned_cols=83 Identities=18% Similarity=0.294 Sum_probs=46.6
Q ss_pred CccCcccccccccchHHHhhhhhccc-------------CCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCc
Q psy11980 272 GHTCELCNKTFNRKARLELHMKYIHE-------------GADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKG 338 (831)
Q Consensus 272 ~~~C~~C~k~F~~~~~L~~H~~~~H~-------------~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~ 338 (831)
|-.|+.|+.+..+...|.+-. +|- ....-.|--|...|........= .-.....|+|+.|...
T Consensus 15 P~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~ 90 (112)
T TIGR00622 15 PVECPICGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNV 90 (112)
T ss_pred CCcCCcCCCEEeccchHHHhh--hccCCCcccccccccccCCCCcccCcCCCCCCccccccc--ccccccceeCCCCCCc
Confidence 456666666666666665542 221 00112377788777654311100 0122346888888888
Q ss_pred cCChhHHHHHhhhccCCCCccccccc
Q psy11980 339 FTMKSSLKIHLLTHTKEPPKSCEVCG 364 (831)
Q Consensus 339 F~~~~~L~~H~~~H~~~~py~C~~C~ 364 (831)
|-..-+...|...|. |+-|.
T Consensus 91 FC~dCD~fiHe~Lh~------CPGC~ 110 (112)
T TIGR00622 91 FCVDCDVFVHESLHC------CPGCI 110 (112)
T ss_pred cccccchhhhhhccC------CcCCC
Confidence 888777778866653 66664
No 80
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=87.02 E-value=6.5 Score=41.81 Aligned_cols=43 Identities=16% Similarity=0.307 Sum_probs=37.1
Q ss_pred hhhhhHHHHHHhhhcCCeEEEcCCCChhHHHHHHHHHHhhhhC
Q psy11980 635 VHSDMIENVIRDMDQYGLCVVDNFLGPERGMAVLHEVLGMYHS 677 (831)
Q Consensus 635 ~~~~~~~~~~~~l~~~g~~~id~~l~~~~~~~l~~~~~~~~~~ 677 (831)
.+..+.++.++.++++||++|.++|+++.+.+|+.++..+...
T Consensus 13 ~~~~Lt~eqi~~f~~dGyvvl~~vls~eev~~lr~~i~~~~~~ 55 (277)
T TIGR02408 13 AAGPLSAKQLQSYERDGFLLLENLFSDDEVAALLAEVERMTRD 55 (277)
T ss_pred CCCCCCHHHHHHHHHCCEEECcccCCHHHHHHHHHHHHHHHhc
Confidence 4445667788899999999999999999999999999887654
No 81
>PLN02704 flavonol synthase
Probab=86.99 E-value=17 Score=39.88 Aligned_cols=65 Identities=15% Similarity=0.279 Sum_probs=45.1
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.|+.. ...++|.|..|.|||.--|.+ +++ .+++|...+=. .+.+..++++.|++++++|+.+
T Consensus 236 ~GLQV~~~---g~Wi~V~p~pg~lvVNvGD~L---~~~------TNg~~kSt~HRVv~~~~~~R~Si~~F~~p~~d~~i 302 (335)
T PLN02704 236 QGLQVFRD---DHWFDVKYIPNALVIHIGDQI---EIL------SNGKYKSVLHRTTVNKEKTRMSWPVFLEPPSELAV 302 (335)
T ss_pred CceeEeEC---CEEEeCCCCCCeEEEEechHH---HHH------hCCeeecccceeecCCCCCeEEEEEEecCCCCceE
Confidence 33555532 357899999999999988877 555 45555433322 2234568999999999999754
No 82
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=86.86 E-value=0.52 Score=29.95 Aligned_cols=19 Identities=42% Similarity=0.989 Sum_probs=10.4
Q ss_pred ccccccccccChhHHHHHHH
Q psy11980 359 SCEVCGRAFIRQDCLIRHLR 378 (831)
Q Consensus 359 ~C~~C~~~F~~~~~L~~H~r 378 (831)
.|+.||+.| ....|.+|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455666666 4455555543
No 83
>PLN02912 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=85.95 E-value=19 Score=39.69 Aligned_cols=64 Identities=16% Similarity=0.263 Sum_probs=45.8
Q ss_pred eeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 755 LLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 755 eL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
-|.+..+ ...++|.|..|.|||-.-|.+ |++ .+++|...+-. .+...-++++.|++++++|+.+
T Consensus 235 GLQV~~~---g~Wi~V~p~pgalvVNiGD~L---~~~------TNG~~kSt~HRVv~~~~~~R~Sia~F~~p~~d~~i 300 (348)
T PLN02912 235 GLQVFKD---GKWIAVNPIPNTFIVNLGDQM---QVI------SNDKYKSVLHRAVVNTDKERISIPTFYCPSEDAVI 300 (348)
T ss_pred ceEEEEC---CcEEECCCcCCeEEEEcCHHH---HHH------hCCEEEcccccccCCCCCCEEEEEEEecCCCCCeE
Confidence 3555532 358899999999999988877 555 45666555444 1233458999999999999764
No 84
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=85.15 E-value=0.74 Score=29.25 Aligned_cols=19 Identities=37% Similarity=0.758 Sum_probs=10.9
Q ss_pred cCCCCCCccCChhHHHHHhh
Q psy11980 331 KCPICTKGFTMKSSLKIHLL 350 (831)
Q Consensus 331 ~C~~C~k~F~~~~~L~~H~~ 350 (831)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 466666666 4555556554
No 85
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=84.66 E-value=1.3 Score=39.86 Aligned_cols=72 Identities=13% Similarity=0.258 Sum_probs=46.4
Q ss_pred CCccCcccccccCChhHHHHHHhhhcCCC-CccCCCCCCccCChhHHHHHhhhcc--------------CC---------
Q psy11980 300 DPYQCDICRKTFIRKEDLTRHCLLHTGVK-PHKCPICTKGFTMKSSLKIHLLTHT--------------KE--------- 355 (831)
Q Consensus 300 k~~~C~~C~k~F~~~~~L~~H~~~H~~~k-~~~C~~C~k~F~~~~~L~~H~~~H~--------------~~--------- 355 (831)
+...|..|+..... +.+..|++..+... .. ....+..-.+.-. ..
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~----------~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~ 78 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQ----------ERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVY 78 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhcccccHH----------HHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCC
Confidence 45689999988766 88999998443321 10 0011111111100 00
Q ss_pred CCccc----ccccccccChhHHHHHHHHhcc
Q psy11980 356 PPKSC----EVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 356 ~py~C----~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
.-|.| ..|++.+.+...|.+|++.+|+
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 12899 9999999999999999999985
No 86
>KOG4173|consensus
Probab=84.25 E-value=0.47 Score=46.02 Aligned_cols=89 Identities=21% Similarity=0.449 Sum_probs=48.0
Q ss_pred CccCcc--cccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHh
Q psy11980 272 GHTCEL--CNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHL 349 (831)
Q Consensus 272 ~~~C~~--C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~ 349 (831)
.|.|++ |...|........|....|.. .|..|.+.|.+...|..|+..-|.. .++ .
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs----------------~Fq--a 136 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDS----------------LFQ--A 136 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHH----------------HHH--H
Confidence 355655 556666666666665544532 3666666666666666665432210 000 0
Q ss_pred hhccCCCCccc--ccccccccChhHHHHHHHHhcc
Q psy11980 350 LTHTKEPPKSC--EVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 350 ~~H~~~~py~C--~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
.+-.|..-|+| +-|+..|.+...-..|+-..|.
T Consensus 137 ~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 137 LVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred HHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 11223445666 3477777777777777666664
No 87
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=84.16 E-value=0.74 Score=31.67 Aligned_cols=23 Identities=22% Similarity=0.472 Sum_probs=18.9
Q ss_pred CcccccccccccChhHHHHHHHH
Q psy11980 357 PKSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 357 py~C~~C~~~F~~~~~L~~H~r~ 379 (831)
+|.|.+|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57888888888888888888763
No 88
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=82.92 E-value=0.73 Score=31.49 Aligned_cols=10 Identities=50% Similarity=1.175 Sum_probs=5.5
Q ss_pred CCcccccccc
Q psy11980 356 PPKSCEVCGR 365 (831)
Q Consensus 356 ~py~C~~C~~ 365 (831)
.++.|++||.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4555666653
No 89
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=82.20 E-value=0.64 Score=35.03 Aligned_cols=32 Identities=31% Similarity=0.518 Sum_probs=27.3
Q ss_pred ccCCCCcccccccccccChhHHHHHHHHhccC
Q psy11980 352 HTKEPPKSCEVCGRAFIRQDCLIRHLRQKHRS 383 (831)
Q Consensus 352 H~~~~py~C~~C~~~F~~~~~L~~H~r~~H~~ 383 (831)
-.||--+.|+-||+.|.......+|+...|+-
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~ 43 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHGW 43 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhhh
Confidence 45677789999999999999999999988863
No 90
>PLN02216 protein SRG1
Probab=81.90 E-value=44 Score=36.93 Aligned_cols=57 Identities=12% Similarity=0.194 Sum_probs=43.0
Q ss_pred CcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 765 DKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 765 ~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
...++|.|..|.|||-.-|.+ |++ .+++|...+=. .+...-++++.|+.++++|+.+
T Consensus 256 g~Wi~V~p~pgalvVNiGD~L---~~~------TNG~~kS~~HRVv~~~~~~R~Si~~F~~P~~d~~i 314 (357)
T PLN02216 256 GKWVSVKPLPNALVVNVGDIL---EII------TNGTYRSIEHRGVVNSEKERLSVATFHNTGMGKEI 314 (357)
T ss_pred CEEEECCCCCCeEEEEcchhh---Hhh------cCCeeeccCceeecCCCCCEEEEEEEecCCCCCeE
Confidence 357899999999999998888 666 45666554443 1334458999999999999864
No 91
>PLN02639 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=81.82 E-value=23 Score=38.80 Aligned_cols=63 Identities=16% Similarity=0.220 Sum_probs=43.9
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.|..+ ...++|.|..|.|||-.-|.+ |++ .+++|...+=. ......++++.|+.++++|+.+
T Consensus 230 LQV~~~---g~Wi~V~p~pg~lVVNiGD~L---~~~------TNG~~kSt~HRVv~~~~~~R~Sia~F~~p~~d~~i 294 (337)
T PLN02639 230 LQVLKD---GKWVAVNPHPGAFVINIGDQL---QAL------SNGRYKSVWHRAVVNTDKERMSVASFLCPCDDAVI 294 (337)
T ss_pred eEeecC---CeEEeccCCCCeEEEechhHH---HHH------hCCeeeccCcccccCCCCCEEEEEEEecCCCCceE
Confidence 555532 368899999999999988887 555 45666333222 1223458999999999998754
No 92
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=81.54 E-value=2.5 Score=37.67 Aligned_cols=48 Identities=25% Similarity=0.405 Sum_probs=25.7
Q ss_pred ccccccCccccCchhHHHHHhhhcCcccCCCccccCcccccccCchhhhhhhhhc
Q psy11980 161 ECKEVCGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLRIHERLH 215 (831)
Q Consensus 161 ~C~~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h 215 (831)
.|- .|...|....... .++......|+|+.|...|-..=++-.|..+|
T Consensus 57 ~C~-~C~~~f~~~~~~~------~~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 57 FCF-GCQGPFPKPPVSP------FDELKDSHRYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ccc-CcCCCCCCccccc------ccccccccceeCCCCCCccccccchhhhhhcc
Confidence 365 5777776543111 11112234678888887775555555555444
No 93
>PF01753 zf-MYND: MYND finger; InterPro: IPR002893 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MYND-type zinc finger domains. The MYND domain (myeloid, Nervy, and DEAF-1) is present in a large group of proteins that includes RP-8 (PDCD2), Nervy, and predicted proteins from Drosophila, mammals, Caenorhabditis elegans, yeast, and plants [, , ]. The MYND domain consists of a cluster of cysteine and histidine residues, arranged with an invariant spacing to form a potential zinc-binding motif []. Mutating conserved cysteine residues in the DEAF-1 MYND domain does not abolish DNA binding, which suggests that the MYND domain might be involved in protein-protein interactions []. Indeed, the MYND domain of ETO/MTG8 interacts directly with the N-CoR and SMRT co-repressors [, ]. Aberrant recruitment of co-repressor complexes and inappropriate transcriptional repression is believed to be a general mechanism of leukemogenesis caused by the t(8;21) translocations that fuse ETO with the acute myelogenous leukemia 1 (AML1) protein. ETO has been shown to be a co-repressor recruited by the promyelocytic leukemia zinc finger (PLZF) protein []. A divergent MYND domain present in the adenovirus E1A binding protein BS69 was also shown to interact with N-CoR and mediate transcriptional repression []. The current evidence suggests that the MYND motif in mammalian proteins constitutes a protein-protein interaction domain that functions as a co-repressor-recruiting interface. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3QWW_A 3QWV_A 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3RU0_A ....
Probab=80.76 E-value=0.17 Score=35.62 Aligned_cols=28 Identities=50% Similarity=1.200 Sum_probs=24.9
Q ss_pred cccCcCCcccccCCccccccCCCccccc
Q psy11980 428 LRCSKCKVTFYCSKQHQTLDWKAHKINC 455 (831)
Q Consensus 428 ~~C~~C~~~f~~~~~h~~~h~~~~~~~c 455 (831)
..|+.|+..++++..+|+.||+.|...|
T Consensus 10 ~~C~~C~~~~YCs~~Cq~~~w~~Hk~~C 37 (37)
T PF01753_consen 10 KRCSRCKSVYYCSEECQRADWPYHKFEC 37 (37)
T ss_dssp EEETTTSSSEESSHHHHHHHHHHHCCTH
T ss_pred CcCCCCCCEEecCHHHHHHHHHHHhhhC
Confidence 3999999999999999999998887654
No 94
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=80.50 E-value=0.93 Score=30.97 Aligned_cols=10 Identities=40% Similarity=1.228 Sum_probs=5.8
Q ss_pred CCccCCCCCC
Q psy11980 328 KPHKCPICTK 337 (831)
Q Consensus 328 k~~~C~~C~k 337 (831)
.++.|++|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4556666654
No 95
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=80.09 E-value=1.3 Score=30.49 Aligned_cols=23 Identities=30% Similarity=0.525 Sum_probs=17.3
Q ss_pred CccCCCCCCccCChhHHHHHhhh
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
+|.|.+|++.|.+...+..|+..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46788888888877777777653
No 96
>PLN02904 oxidoreductase
Probab=79.96 E-value=42 Score=37.11 Aligned_cols=64 Identities=16% Similarity=0.147 Sum_probs=43.8
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.|...+ ...++|.|..|.|||-.-|.+ |++ .+++|...+=. .+.+.-++++.|++++++|+.+
T Consensus 246 LQV~~~~--g~Wi~V~p~pgalVVNiGD~L---e~~------TNG~~kSt~HRVv~~~~~~R~Si~~F~~p~~d~~i 311 (357)
T PLN02904 246 LQIMDCN--KNWVCVPYIEGALIVQLGDQV---EVM------SNGIYKSVVHRVTVNKDYKRLSFASLHSLPLHKKI 311 (357)
T ss_pred eeEEeCC--CCEEECCCCCCeEEEEccHHH---HHH------hCCeeeccCCcccCCCCCCEEEEEEeecCCCCCeE
Confidence 5555432 257899999999999988877 555 45555544433 1223348899999999998754
No 97
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=79.94 E-value=3.3 Score=41.60 Aligned_cols=24 Identities=33% Similarity=0.339 Sum_probs=21.4
Q ss_pred ceeecCCCCeEEEEecCCCCCccccc
Q psy11980 767 VADIEPMFDRILFFWSDRRNPHEAMV 792 (831)
Q Consensus 767 ~~~i~P~~grlV~F~S~~~~pHEVl~ 792 (831)
.+.|.|..|+||||-|-.+ |+|+|
T Consensus 161 ~~~v~P~~G~lvlFPS~L~--H~v~p 184 (201)
T TIGR02466 161 FVYVPPQEGRVLLFESWLR--HEVPP 184 (201)
T ss_pred cEEECCCCCeEEEECCCCc--eecCC
Confidence 4579999999999999986 99994
No 98
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=77.65 E-value=2.9 Score=37.48 Aligned_cols=72 Identities=14% Similarity=0.249 Sum_probs=45.8
Q ss_pred CccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhc--------------CC---------C
Q psy11980 272 GHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHT--------------GV---------K 328 (831)
Q Consensus 272 ~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~--------------~~---------k 328 (831)
-..|..|+..... +.+..|++..|...+... ...+..-.+.-. +. .
T Consensus 11 vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~----------~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~ 79 (109)
T PF12013_consen 11 VLICRQCQYAVQP-SEVESHLRKRHHILKSQE----------RQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYD 79 (109)
T ss_pred EEEeCCCCcccCc-hHHHHHHHHhcccccHHH----------HHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCC
Confidence 3569999988776 889999986664321110 011111111100 00 2
Q ss_pred CccC----CCCCCccCChhHHHHHhhhccC
Q psy11980 329 PHKC----PICTKGFTMKSSLKIHLLTHTK 354 (831)
Q Consensus 329 ~~~C----~~C~k~F~~~~~L~~H~~~H~~ 354 (831)
-|.| ..|++.+.+...+..|++.++|
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 2889 9999999999999999988765
No 99
>PLN02984 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=77.55 E-value=4 Score=44.77 Aligned_cols=66 Identities=14% Similarity=0.332 Sum_probs=45.7
Q ss_pred CCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC---CCCCCceEEEEEecCCCccc
Q psy11980 752 NGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN---PNRDGRCITAIYYLNRDWDI 828 (831)
Q Consensus 752 ~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~---~~~~~R~~t~~~Yln~~w~~ 828 (831)
-|| |.|..+ ...++|.|..|.|||..-|.+ |++ .+++|...+-. ++....++++.|++++++|+
T Consensus 236 v~G-LQV~~~---g~Wv~V~p~pgalVVNiGD~L---e~w------TNg~~kSt~HRVv~~~~~~~R~Sia~F~~P~~d~ 302 (341)
T PLN02984 236 VGG-LEVMKD---GEWFNVKPIANTLVVNLGDMM---QVI------SDDEYKSVLHRVGKRNKKKERYSICYFVFPEEDC 302 (341)
T ss_pred CCC-eeEeeC---CceEECCCCCCeEEEECChhh---hhh------cCCeeeCCCCccccCCCCCCeEEEEEEecCCCCC
Confidence 344 566532 358999999999999988887 555 34555333222 23445699999999999987
Q ss_pred cc
Q psy11980 829 KV 830 (831)
Q Consensus 829 ~~ 830 (831)
.+
T Consensus 303 ~i 304 (341)
T PLN02984 303 VI 304 (341)
T ss_pred EE
Confidence 54
No 100
>PLN00052 prolyl 4-hydroxylase; Provisional
Probab=76.19 E-value=1.8 Score=46.51 Aligned_cols=34 Identities=26% Similarity=0.414 Sum_probs=28.7
Q ss_pred ccccccCCCCceeEEeecCCC-------CCCceEEEEEecCC
Q psy11980 790 AMVACYPGHGSHYVKHVDNPN-------RDGRCITAIYYLNR 824 (831)
Q Consensus 790 Vl~a~yp~~~~ry~~h~d~~~-------~~~R~~t~~~Yln~ 824 (831)
.+...|.. |..|..|.|.+. ...|..|+++|||+
T Consensus 134 lQVlrY~~-Gq~Y~~H~D~~~~~~~~~~gg~R~aTvL~YLnd 174 (310)
T PLN00052 134 IQILRYEH-GQKYEPHFDYFHDKINQALGGHRYATVLMYLST 174 (310)
T ss_pred eEEEecCC-CCCCCCCCCccccccccccCCceeEEEEEEecc
Confidence 67789995 999999999753 34699999999996
No 101
>PLN02758 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=75.18 E-value=87 Score=34.69 Aligned_cols=56 Identities=9% Similarity=0.294 Sum_probs=41.4
Q ss_pred cceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 766 KVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 766 ~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
..++|.|..|.|||..-|.+ |++ .+++|...+=. .+...-++++.|++++++|+.+
T Consensus 259 ~Wi~V~p~pgalVVNiGD~L---~~~------SNG~~kS~~HRVv~~~~~~R~Sia~F~~P~~d~~i 316 (361)
T PLN02758 259 TWVPVHPVPNALVINIGDTL---EVL------TNGKYKSVEHRAVTNKEKDRLSIVTFYAPSYEVEL 316 (361)
T ss_pred EEEeCCCCCCeEEEEccchh---hhh------cCCeeecccceeecCCCCCEEEEEEEecCCCCCeE
Confidence 57899999999999999888 666 45666544333 1223348999999999998754
No 102
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=73.82 E-value=8 Score=34.04 Aligned_cols=27 Identities=19% Similarity=0.047 Sum_probs=17.7
Q ss_pred CcceeecCCCCeEEEEecCCCCCcccccc
Q psy11980 765 DKVADIEPMFDRILFFWSDRRNPHEAMVA 793 (831)
Q Consensus 765 ~~~~~i~P~~grlV~F~S~~~~pHEVl~a 793 (831)
.....|.|..|.||||=|-.. |.|.|-
T Consensus 63 ~~~~~~~p~~G~lvlFPs~l~--H~v~p~ 89 (101)
T PF13759_consen 63 SPYYIVEPEEGDLVLFPSWLW--HGVPPN 89 (101)
T ss_dssp -SEEEE---TTEEEEEETTSE--EEE---
T ss_pred CceEEeCCCCCEEEEeCCCCE--EeccCc
Confidence 445689999999999999886 999844
No 103
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=71.04 E-value=2.3 Score=42.86 Aligned_cols=35 Identities=29% Similarity=0.605 Sum_probs=27.3
Q ss_pred CCCcccccccccccChhHHHHHHHHhccCchhHHH
Q psy11980 355 EPPKSCEVCGRAFIRQDCLIRHLRQKHRSHLDEVL 389 (831)
Q Consensus 355 ~~py~C~~C~~~F~~~~~L~~H~r~~H~~~~~~~~ 389 (831)
+..|.|+.|+|.|.-..-..+|+...|.++..++.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ve~~~ 109 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEKVEEVK 109 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHHHHHHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHHHHHHH
Confidence 44699999999999999999999999987766543
No 104
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=69.35 E-value=2.7 Score=28.92 Aligned_cols=10 Identities=40% Similarity=1.358 Sum_probs=5.5
Q ss_pred CCcccccccc
Q psy11980 356 PPKSCEVCGR 365 (831)
Q Consensus 356 ~py~C~~C~~ 365 (831)
.|..|++||.
T Consensus 17 ~p~~CP~Cg~ 26 (34)
T cd00729 17 APEKCPICGA 26 (34)
T ss_pred CCCcCcCCCC
Confidence 3555666654
No 105
>PTZ00273 oxidase reductase; Provisional
Probab=68.86 E-value=5.2 Score=43.51 Aligned_cols=64 Identities=11% Similarity=0.146 Sum_probs=40.1
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCC-CCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNP-NRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~-~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.|...+ ...++|.|..|.|||..-|.+ |++ .+++|...+=.- ....-++++.|++++++|+.+
T Consensus 217 LqV~~~~--g~Wi~V~p~pg~lvVNvGD~l---~~~------TnG~~kSt~HRVv~~~~~R~Si~~F~~p~~d~~i 281 (320)
T PTZ00273 217 LQVRNLS--GEWMDVPPLEGSFVVNIGDMM---EMW------SNGRYRSTPHRVVNTGVERYSMPFFCEPNPNVII 281 (320)
T ss_pred eEEECCC--CCEEeCCCCCCeEEEEHHHHH---HHH------HCCeeeCCCccccCCCCCeEEEEEEEcCCCCceE
Confidence 6665432 257899999999999866655 333 233333222210 111348999999999999754
No 106
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=68.77 E-value=1.7 Score=44.16 Aligned_cols=15 Identities=40% Similarity=0.831 Sum_probs=7.7
Q ss_pred CccCcccccccccch
Q psy11980 272 GHTCELCNKTFNRKA 286 (831)
Q Consensus 272 ~~~C~~C~k~F~~~~ 286 (831)
.+.|++|++.|.++.
T Consensus 5 ~~~CPvC~~~F~~~~ 19 (214)
T PF09986_consen 5 KITCPVCGKEFKTKK 19 (214)
T ss_pred ceECCCCCCeeeeeE
Confidence 345555555555543
No 107
>PRK04023 DNA polymerase II large subunit; Validated
Probab=68.68 E-value=4.3 Score=49.17 Aligned_cols=53 Identities=19% Similarity=0.510 Sum_probs=35.2
Q ss_pred ccCCcccCcccccccCchHHHHHhhhccccccccccccccCcccCChhHHHHHHHHhcCCCCCCcccccCccccccCChh
Q psy11980 39 TVPGHLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIKVCGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLT 118 (831)
Q Consensus 39 ~~~~~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~k~~~C~~C~k~f~~~~ 118 (831)
++....|+.||... .-+.|+.||.. + + ..+.|+.|+...
T Consensus 623 EVg~RfCpsCG~~t------------------~~frCP~CG~~--------------T-e----~i~fCP~CG~~~---- 661 (1121)
T PRK04023 623 EIGRRKCPSCGKET------------------FYRRCPFCGTH--------------T-E----PVYRCPRCGIEV---- 661 (1121)
T ss_pred cccCccCCCCCCcC------------------CcccCCCCCCC--------------C-C----cceeCccccCcC----
Confidence 33446799999863 11589999987 1 1 457888886543
Q ss_pred HHHHHHHHhcCCccccCcCCCcCCC
Q psy11980 119 LLRIHERLHAEKLYECDICKNRFPS 143 (831)
Q Consensus 119 ~L~~H~~~H~~k~~~C~~C~k~F~~ 143 (831)
.++.|+.|+..-..
T Consensus 662 -----------~~y~CPKCG~El~~ 675 (1121)
T PRK04023 662 -----------EEDECEKCGREPTP 675 (1121)
T ss_pred -----------CCCcCCCCCCCCCc
Confidence 24778888865443
No 108
>PLN03002 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=67.35 E-value=6.2 Score=43.17 Aligned_cols=72 Identities=14% Similarity=0.187 Sum_probs=44.6
Q ss_pred ccCCCceeEeccCCC--CCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC-CCCCCceEEEEEecCCC
Q psy11980 749 TKQNGGLLRIFPEGG--GDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN-PNRDGRCITAIYYLNRD 825 (831)
Q Consensus 749 ~~~~GGeL~ly~~~~--~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~-~~~~~R~~t~~~Yln~~ 825 (831)
+.+-|| |.++..+. +.+.++|.|..|.+||-.-|.+ +++ .+++|...+=. ......++++.|+.+++
T Consensus 216 qd~v~G-LQV~~~~~~~~g~Wi~Vpp~pg~~VVNiGD~L---~~w------Tng~~kSt~HRVv~~~~~R~Sia~F~~p~ 285 (332)
T PLN03002 216 TDGVMG-LQICKDKNAMPQKWEYVPPIKGAFIVNLGDML---ERW------SNGFFKSTLHRVLGNGQERYSIPFFVEPN 285 (332)
T ss_pred eCCCCc-eEEecCCCCCCCcEEECCCCCCeEEEEHHHHH---HHH------hCCeeECcCCeecCCCCCeeEEEEEecCC
Confidence 333344 67765431 3468999999999999876665 444 23344333222 01123588999999999
Q ss_pred ccccc
Q psy11980 826 WDIKV 830 (831)
Q Consensus 826 w~~~~ 830 (831)
+|+.+
T Consensus 286 ~d~~i 290 (332)
T PLN03002 286 HDCLV 290 (332)
T ss_pred CCeeE
Confidence 98753
No 109
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=67.33 E-value=1.3 Score=44.95 Aligned_cols=13 Identities=31% Similarity=0.616 Sum_probs=10.4
Q ss_pred cccccccccccCh
Q psy11980 358 KSCEVCGRAFIRQ 370 (831)
Q Consensus 358 y~C~~C~~~F~~~ 370 (831)
..|+.||++|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 4699999988765
No 110
>PLN02254 gibberellin 3-beta-dioxygenase
Probab=67.29 E-value=5.9 Score=43.78 Aligned_cols=66 Identities=15% Similarity=0.290 Sum_probs=46.8
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.|+..+ ...++|.|..|.|||-.-|.+ |++ .+++|...+=. .+...-++++.|++++++|+.+
T Consensus 247 ~GLQV~~~~--~~Wi~V~p~pgalVVNiGD~l---q~~------SNg~~kS~~HRVv~~~~~~R~Sia~F~~P~~d~~i 314 (358)
T PLN02254 247 SGLQVFREG--VGWVTVPPVPGSLVVNVGDLL---HIL------SNGRFPSVLHRAVVNKTRHRISVAYFYGPPSDVQI 314 (358)
T ss_pred CCceEECCC--CEEEEcccCCCCEEEEhHHHH---HHH------hCCeeccccceeecCCCCCEEEEEEEecCCCCcEE
Confidence 346776543 258999999999999988877 565 34555544333 1234458999999999999764
No 111
>PLN03001 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=65.98 E-value=7.1 Score=41.11 Aligned_cols=75 Identities=16% Similarity=0.293 Sum_probs=48.1
Q ss_pred ceeEEecCcccCCCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEE
Q psy11980 740 MGDFVINGRTKQNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCIT 817 (831)
Q Consensus 740 ~~s~v~~~W~~~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t 817 (831)
.+|.++ +.+-|| |.|..++ ..++|.|+.|.|||-.-|.+ +++ .+++|...+=. -.....+++
T Consensus 143 ~lTlL~---qd~v~G-LqV~~~g---~Wi~V~p~p~a~vVNiGD~l---~~~------tng~~~S~~HRVv~~~~~~R~S 206 (262)
T PLN03001 143 AITLLI---QDDVEG-LQLLKDA---EWLMVPPISDAILIIIADQT---EII------TNGNYKSAQHRAIANANKARLS 206 (262)
T ss_pred eeEEEE---eCCCCc-eEEeeCC---eEEECCCCCCcEEEEccHHH---HHH------hCCccccccceEEcCCCCCEEE
Confidence 455544 223344 6776432 58999999999999988877 555 34455332222 123345889
Q ss_pred EEEecCCCccccc
Q psy11980 818 AIYYLNRDWDIKV 830 (831)
Q Consensus 818 ~~~Yln~~w~~~~ 830 (831)
+.|+++++.|+.+
T Consensus 207 ia~F~~p~~d~~i 219 (262)
T PLN03001 207 VATFHDPAKTAKI 219 (262)
T ss_pred EEEEEcCCCCCEE
Confidence 9999999988743
No 112
>PF03171 2OG-FeII_Oxy: 2OG-Fe(II) oxygenase superfamily Entry for Lysyl hydrolases This Prosite entry is a sub-family of the Pfam entry; InterPro: IPR005123 This domain is found in members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily [], as well as the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity (1.14.11.2 from EC) catalysing the reaction: Procollagen L-proline + 2-oxoglutarate + O2 = procollagen trans-4-hydroxy-L-proline + succinate + CO2. The full enzyme consists of a alpha2 beta2 complex with the alpha subunit contributing most of the parts of the active site []. The family also includes lysyl hydrolases, isopenicillin synthases and AlkB. ; GO: 0016491 oxidoreductase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 3ON7_D 1BK0_A 1IPS_B 1QIQ_A 1OC1_A 2Y86_A 2Y60_A 1W03_A 2VE1_A 1QJF_A ....
Probab=64.50 E-value=9.6 Score=33.09 Aligned_cols=62 Identities=16% Similarity=0.338 Sum_probs=37.6
Q ss_pred cceeEEecCcccCCCceeEeccCCCCCcceeecCCCCeEEEEecCCC----------CCccccccccCCCCceeEEeecC
Q psy11980 739 RMGDFVINGRTKQNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRR----------NPHEAMVACYPGHGSHYVKHVDN 808 (831)
Q Consensus 739 R~~s~v~~~W~~~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~----------~pHEVl~a~yp~~~~ry~~h~d~ 808 (831)
+.+|+++. .+||.|.++..+ ..++|.|..+.++++..+.+ .+|+|. +
T Consensus 27 ~~~Til~~----~~~~gL~~~~~~---~~~~v~~~~~~~~v~~G~~l~~~t~g~~~~~~HrV~----~------------ 83 (98)
T PF03171_consen 27 GLLTILFQ----DEVGGLQVRDDG---EWVDVPPPPGGFIVNFGDALEILTNGRYPATLHRVV----P------------ 83 (98)
T ss_dssp SSEEEEEE----TSTS-EEEEETT---EEEE----TTCEEEEEBHHHHHHTTTSS----EEEE-----------------
T ss_pred CeEEEEec----ccchheeccccc---cccCccCccceeeeeceeeeecccCCccCCceeeeE----c------------
Confidence 67777775 789999998665 46788888888888887733 125555 2
Q ss_pred CCCCCceEEEEEecCC
Q psy11980 809 PNRDGRCITAIYYLNR 824 (831)
Q Consensus 809 ~~~~~R~~t~~~Yln~ 824 (831)
...+++++++|.+||
T Consensus 84 -~~~~~R~s~~~f~~p 98 (98)
T PF03171_consen 84 -PTEGERYSLTFFLRP 98 (98)
T ss_dssp --STS-EEEEEEEEE-
T ss_pred -CCCCCEEEEEEEECC
Confidence 246777888887775
No 113
>PF12851 Tet_JBP: Oxygenase domain of the 2OGFeDO superfamily ; InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=64.37 E-value=14 Score=36.16 Aligned_cols=58 Identities=19% Similarity=0.254 Sum_probs=42.8
Q ss_pred CCCceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCC-CCCCceEEEEEec
Q psy11980 751 QNGGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNP-NRDGRCITAIYYL 822 (831)
Q Consensus 751 ~~GGeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~-~~~~R~~t~~~Yl 822 (831)
.+||.|.|=.-+.+-.-+.|.|..|.+|+|.+... .|-|-|.- +| ..++.+++.+||.
T Consensus 111 ~~~g~l~lp~~~~~~~g~~~~~~~GtVl~~~~~~~-~Hgvtpv~-------------~~~~~~~~R~slvfy~ 169 (171)
T PF12851_consen 111 YDGGRLELPGLDPNILGVAFAYQPGTVLIFCAKRE-LHGVTPVE-------------SPNRNHGTRISLVFYQ 169 (171)
T ss_pred ccCceEeccccccccCCEEEecCCCcEEEEcccce-eeecCccc-------------CCCCCCCeEEEEEEEe
Confidence 77888888442222234789999999999999988 89998432 23 2457899999996
No 114
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=62.87 E-value=2.8 Score=31.75 Aligned_cols=28 Identities=29% Similarity=0.521 Sum_probs=15.3
Q ss_pred ccCCccCcccccccccchHHHhhhhhcc
Q psy11980 269 GENGHTCELCNKTFNRKARLELHMKYIH 296 (831)
Q Consensus 269 ~~k~~~C~~C~k~F~~~~~L~~H~~~~H 296 (831)
||.-+.|+.|+..|.......+|+...|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4444555555555555555555555444
No 115
>PLN02947 oxidoreductase
Probab=62.70 E-value=14 Score=41.09 Aligned_cols=65 Identities=15% Similarity=0.172 Sum_probs=48.7
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCC--CCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNP--NRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~--~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.+..+ ...++|.|..|.|||-.-|.+ +++ .+++|...+=.. +...-++++.|+++++||+.+
T Consensus 262 ~GLQV~~~---g~Wi~V~p~pga~VVNvGD~L---q~~------SNG~~kS~~HRVv~~~~~~R~Sia~F~~P~~d~~i 328 (374)
T PLN02947 262 EGLQIMHA---GRWVTVEPIPGSFVVNVGDHL---EIF------SNGRYKSVLHRVRVNSTKPRISVASLHSLPFERVV 328 (374)
T ss_pred CCeeEeEC---CEEEeCCCCCCeEEEEeCcee---eee------eCCEEeccccccccCCCCCEEEEEEEecCCCCCEE
Confidence 34666542 358999999999999999988 676 567776665542 334458999999999999754
No 116
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=62.63 E-value=4.7 Score=25.92 Aligned_cols=10 Identities=40% Similarity=0.883 Sum_probs=5.3
Q ss_pred cccccccccc
Q psy11980 359 SCEVCGRAFI 368 (831)
Q Consensus 359 ~C~~C~~~F~ 368 (831)
.|+.||+.|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4555555553
No 117
>PLN03178 leucoanthocyanidin dioxygenase; Provisional
Probab=62.47 E-value=8.6 Score=42.58 Aligned_cols=64 Identities=16% Similarity=0.135 Sum_probs=44.2
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCcccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIK 829 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~ 829 (831)
|-|.|+.+ ...++|.|..|.|||-.-|.+ |++ .+++|...+=. .+....++++.|++++++|+.
T Consensus 248 ~GLQV~~~---g~Wi~V~p~pg~lvVNiGD~L---~~~------TNG~~kSt~HRVv~~~~~~R~Si~~F~~P~~d~~ 313 (360)
T PLN03178 248 PGLQVLYE---GKWVTAKCVPDSIVVHIGDTL---EIL------SNGRYKSILHRGLVNKEKVRISWAVFCEPPKEKI 313 (360)
T ss_pred CceeEeEC---CEEEEcCCCCCeEEEEccHHH---HHH------hCCccccccceeecCCCCCeEEEEEEecCCcccc
Confidence 34666542 358999999999999888876 555 34555443332 123445899999999999864
No 118
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=62.38 E-value=5 Score=29.36 Aligned_cols=26 Identities=35% Similarity=0.731 Sum_probs=13.0
Q ss_pred CCcccccccccccCh----hHHHHHHHHhc
Q psy11980 356 PPKSCEVCGRAFIRQ----DCLIRHLRQKH 381 (831)
Q Consensus 356 ~py~C~~C~~~F~~~----~~L~~H~r~~H 381 (831)
.-..|..|++.+... +.|.+|++..|
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 345666666666553 66777765544
No 119
>PLN02515 naringenin,2-oxoglutarate 3-dioxygenase
Probab=61.96 E-value=14 Score=40.76 Aligned_cols=66 Identities=14% Similarity=0.186 Sum_probs=47.9
Q ss_pred eeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 755 LLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 755 eL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
-|.|...+ +...++|.|..|.|||-.-|.+ |++ .+++|...+=. .+...-++++.|+.++++|+.+
T Consensus 233 GLQV~~~~-~~~Wi~Vpp~pgalVVNiGD~L---~~~------TNG~~kSt~HRVv~~~~~~R~Si~~F~~P~~d~~i 300 (358)
T PLN02515 233 GLQATRDG-GKTWITVQPVEGAFVVNLGDHG---HYL------SNGRFKNADHQAVVNSNCSRLSIATFQNPAPDATV 300 (358)
T ss_pred ceEEEECC-CCeEEECCCCCCeEEEEccHHH---HHH------hCCeeeeecceEECCCCCCEEEEEEEecCCCCCEE
Confidence 35665432 2358999999999999988887 666 46666655544 2334569999999999999754
No 120
>PLN02276 gibberellin 20-oxidase
Probab=61.90 E-value=7.8 Score=42.91 Aligned_cols=63 Identities=17% Similarity=0.258 Sum_probs=42.2
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.+..+ ...++|.|..|.|||..-|.+ +++ .+++|...+=. .+...-++++.|++++++|+.+
T Consensus 245 LQV~~~---g~Wi~V~p~pgalVVNiGD~L---~~~------TNG~~kSt~HRVv~~~~~~R~Sia~F~~P~~d~~i 309 (361)
T PLN02276 245 LQVFVD---NKWRSVRPRPGALVVNIGDTF---MAL------SNGRYKSCLHRAVVNSERERRSLAFFLCPKEDKVV 309 (361)
T ss_pred eEEEEC---CEEEEcCCCCCeEEEEcHHHH---HHH------hCCccccccceeecCCCCCEEEEEEEecCCCCCEE
Confidence 555532 357899999999999987766 444 34444322221 1223458999999999999754
No 121
>PLN02485 oxidoreductase
Probab=60.77 E-value=9.5 Score=41.68 Aligned_cols=66 Identities=11% Similarity=0.116 Sum_probs=41.4
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.+...+ ...++|.|..|.|||-.-|.+ +++ .+++|...+=. ......++++.|++++++|+.+
T Consensus 226 ~GLqV~~~~--g~Wi~V~p~pg~~vVNiGD~L---~~~------TnG~~~St~HRVv~~~~~~R~Si~~F~~p~~d~~i 293 (329)
T PLN02485 226 TALQVRNLS--GEWIWAIPIPGTFVCNIGDML---KIW------SNGVYQSTLHRVINNSPKYRVCVAFFYETNFDAAV 293 (329)
T ss_pred CeeeEEcCC--CcEEECCCCCCcEEEEhHHHH---HHH------HCCEeeCCCceecCCCCCCeEEEEEEecCCCCcee
Confidence 345665432 257899999999999866665 333 23333222221 1122348999999999999754
No 122
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=60.66 E-value=5.5 Score=35.55 Aligned_cols=30 Identities=30% Similarity=0.723 Sum_probs=23.2
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccCh
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQ 370 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~ 370 (831)
..|+.||++|-.. +..|-.|+.||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 5788899888653 34688899999888866
No 123
>PLN00417 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=60.19 E-value=11 Score=41.54 Aligned_cols=56 Identities=23% Similarity=0.209 Sum_probs=40.3
Q ss_pred cceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 766 KVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 766 ~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
..++|.|..|.|||-.-|.+ |++ .+++|...+=. .....-++++.|+.++++|+.+
T Consensus 250 ~Wi~V~p~pg~lVVNiGD~L---e~~------Tng~~kSt~HRVv~~~~~~R~Si~fF~~P~~d~~i 307 (348)
T PLN00417 250 KWYKAPIVPDTILINVGDQM---EIM------SNGIYKSPVHRVVTNREKERISVATFCIPGADKEI 307 (348)
T ss_pred eEEECCCCCCcEEEEcChHH---HHH------hCCeecccceEEecCCCCCEEEEEEEecCCCCcee
Confidence 57899999999999988777 444 34555444432 1223458999999999999765
No 124
>KOG1591|consensus
Probab=59.39 E-value=45 Score=35.54 Aligned_cols=37 Identities=30% Similarity=0.302 Sum_probs=30.7
Q ss_pred ccccccCCCCceeEEeecCCC-----------CCCceEEEEEecCCCcc
Q psy11980 790 AMVACYPGHGSHYVKHVDNPN-----------RDGRCITAIYYLNRDWD 827 (831)
Q Consensus 790 Vl~a~yp~~~~ry~~h~d~~~-----------~~~R~~t~~~Yln~~w~ 827 (831)
.+.+.|.- |..|..|.|.+. ...|.-|+|+||.+.|.
T Consensus 178 lqVlnYg~-Gg~Y~~H~D~~~~~~~~~~~~~~~g~RiaT~l~yls~v~~ 225 (289)
T KOG1591|consen 178 LQVLNYGL-GGHYEPHYDYFLPEEDETFNGLNGGNRIATVLMYLSDVEQ 225 (289)
T ss_pred ceEEEecC-CccccccccccccccchhhhhcccCCcceeEEEEecccCC
Confidence 56778985 999999999852 33699999999999885
No 125
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=59.07 E-value=5.7 Score=38.28 Aligned_cols=11 Identities=45% Similarity=1.295 Sum_probs=6.0
Q ss_pred CCCCccccccc
Q psy11980 354 KEPPKSCEVCG 364 (831)
Q Consensus 354 ~~~py~C~~C~ 364 (831)
|+.|-+||+||
T Consensus 146 ge~P~~CPiCg 156 (166)
T COG1592 146 GEAPEVCPICG 156 (166)
T ss_pred CCCCCcCCCCC
Confidence 34555555555
No 126
>PLN02156 gibberellin 2-beta-dioxygenase
Probab=59.06 E-value=13 Score=40.73 Aligned_cols=64 Identities=17% Similarity=0.285 Sum_probs=45.0
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.+...+ ...++|.|..|.+||-.-|.+ +++ .+++|...+=. .+...-++++.|++.++.|+.+
T Consensus 219 LQV~~~~--g~Wi~Vpp~pga~VVNiGD~l---~~w------TNg~~kSt~HRVv~~~~~~R~SiafF~~P~~d~~i 284 (335)
T PLN02156 219 LQICVKD--GTWVDVPPDHSSFFVLVGDTL---QVM------TNGRFKSVKHRVVTNTKRSRISMIYFAGPPLSEKI 284 (335)
T ss_pred eEEEeCC--CCEEEccCCCCcEEEEhHHHH---HHH------hCCeeeccceeeecCCCCCEEEEEEeecCCCCCEE
Confidence 6665322 258999999999999988887 666 45566544443 2223348999999999998754
No 127
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=58.67 E-value=2.8 Score=39.94 Aligned_cols=12 Identities=33% Similarity=1.057 Sum_probs=6.2
Q ss_pred cccccccccccC
Q psy11980 358 KSCEVCGRAFIR 369 (831)
Q Consensus 358 y~C~~C~~~F~~ 369 (831)
|+|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 455555555544
No 128
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=58.63 E-value=6.3 Score=47.26 Aligned_cols=41 Identities=27% Similarity=0.655 Sum_probs=29.1
Q ss_pred CccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCccccccccc
Q psy11980 301 PYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRA 366 (831)
Q Consensus 301 ~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~ 366 (831)
.++|+.|+..+ ..|...+...|.+||+. ...|..|+.||-.
T Consensus 444 v~~Cp~Cd~~l----------t~H~~~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 444 IAECPNCDSPL----------TLHKATGQLRCHYCGYQ---------------EPIPQSCPECGSE 484 (730)
T ss_pred cccCCCCCcce----------EEecCCCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence 35788887653 23445567899999965 2468899999965
No 129
>KOG2071|consensus
Probab=58.15 E-value=6.2 Score=45.19 Aligned_cols=24 Identities=21% Similarity=0.443 Sum_probs=17.1
Q ss_pred ccccccCcCCcccccCCccccccC
Q psy11980 425 ENLLRCSKCKVTFYCSKQHQTLDW 448 (831)
Q Consensus 425 e~~~~C~~C~~~f~~~~~h~~~h~ 448 (831)
+....|.+|+..|---..+.+--|
T Consensus 511 e~~~~C~IC~EkFe~v~d~e~~~W 534 (579)
T KOG2071|consen 511 ERQASCPICQEKFEVVFDQEEDLW 534 (579)
T ss_pred ccccCCcccccccceeecchhhhe
Confidence 778899999999976554444444
No 130
>PLN02393 leucoanthocyanidin dioxygenase like protein
Probab=56.41 E-value=17 Score=40.19 Aligned_cols=56 Identities=13% Similarity=0.248 Sum_probs=41.9
Q ss_pred cceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 766 KVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 766 ~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
..++|.|..|.|||-.-|.+ |++ .+++|...+=. .+....++++.|++++++|+.+
T Consensus 260 ~W~~V~p~pgalVVNiGD~l---~~~------Tng~~kSt~HRVv~~~~~~R~SiafF~~P~~d~~i 317 (362)
T PLN02393 260 AWITVKPVPDAFIVNIGDQI---QVL------SNAIYKSVEHRVIVNSAKERVSLAFFYNPKSDLPI 317 (362)
T ss_pred EEEECCCCCCeEEEEcchhh---Hhh------cCCeeeccceecccCCCCCEEEEEEEecCCCCceE
Confidence 57899999999999999888 666 46666443332 1233458999999999998754
No 131
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.16 E-value=6.6 Score=35.06 Aligned_cols=15 Identities=33% Similarity=0.755 Sum_probs=7.6
Q ss_pred CCccCcccccccCCh
Q psy11980 300 DPYQCDICRKTFIRK 314 (831)
Q Consensus 300 k~~~C~~C~k~F~~~ 314 (831)
.|..|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 345555555555443
No 132
>KOG2186|consensus
Probab=54.20 E-value=8.2 Score=39.18 Aligned_cols=45 Identities=20% Similarity=0.434 Sum_probs=19.8
Q ss_pred cccCccccccCChhHHHHHHHHhcCCccccCcCCCcCCChHHHHHHH
Q psy11980 105 FKCDKCEKQFYGLTLLRIHERLHAEKLYECDICKNRFPSSGAMKKHR 151 (831)
Q Consensus 105 ~~C~~C~k~f~~~~~L~~H~~~H~~k~~~C~~C~k~F~~~~~L~~H~ 151 (831)
|.|..||...... .+..|+...+..-|.|-.|++.|.. ..+..|.
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCCeeEEeeccccccc-chhhhhh
Confidence 4455554443322 2333444444444555555555544 3444443
No 133
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=53.71 E-value=7.8 Score=28.07 Aligned_cols=30 Identities=20% Similarity=0.422 Sum_probs=21.2
Q ss_pred ccccCcCCcccccCCccccccCCCcccccccCcc
Q psy11980 427 LLRCSKCKVTFYCSKQHQTLDWKAHKINCNILSN 460 (831)
Q Consensus 427 ~~~C~~C~~~f~~~~~h~~~h~~~~~~~c~~c~~ 460 (831)
.|+|..||..|.-. +.+.. ...-.|+.||.
T Consensus 5 ey~C~~Cg~~fe~~---~~~~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 5 EYRCEECGHEFEVL---QSISE-DDPVPCPECGS 34 (42)
T ss_pred EEEeCCCCCEEEEE---EEcCC-CCCCcCCCCCC
Confidence 39999999999622 23333 56678999985
No 134
>PRK04023 DNA polymerase II large subunit; Validated
Probab=53.04 E-value=12 Score=45.59 Aligned_cols=49 Identities=27% Similarity=0.589 Sum_probs=32.1
Q ss_pred CccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhh
Q psy11980 272 GHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLT 351 (831)
Q Consensus 272 ~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~ 351 (831)
...|+.||... -.+.|+.||.. .+..+.|+.|+...
T Consensus 626 ~RfCpsCG~~t-----------------~~frCP~CG~~---------------Te~i~fCP~CG~~~------------ 661 (1121)
T PRK04023 626 RRKCPSCGKET-----------------FYRRCPFCGTH---------------TEPVYRCPRCGIEV------------ 661 (1121)
T ss_pred CccCCCCCCcC-----------------CcccCCCCCCC---------------CCcceeCccccCcC------------
Confidence 45788888652 23678888865 24457888885442
Q ss_pred ccCCCCccccccccccc
Q psy11980 352 HTKEPPKSCEVCGRAFI 368 (831)
Q Consensus 352 H~~~~py~C~~C~~~F~ 368 (831)
.+|.|+.||..-.
T Consensus 662 ----~~y~CPKCG~El~ 674 (1121)
T PRK04023 662 ----EEDECEKCGREPT 674 (1121)
T ss_pred ----CCCcCCCCCCCCC
Confidence 2477999986543
No 135
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=52.68 E-value=7.1 Score=27.42 Aligned_cols=10 Identities=30% Similarity=0.972 Sum_probs=4.3
Q ss_pred cCcccccccc
Q psy11980 274 TCELCNKTFN 283 (831)
Q Consensus 274 ~C~~C~k~F~ 283 (831)
.|+.|+..|.
T Consensus 4 ~CP~C~~~~~ 13 (38)
T TIGR02098 4 QCPNCKTSFR 13 (38)
T ss_pred ECCCCCCEEE
Confidence 3444444443
No 136
>COG3491 PcbC Isopenicillin N synthase and related dioxygenases [General function prediction only]
Probab=52.43 E-value=2.3e+02 Score=30.40 Aligned_cols=71 Identities=17% Similarity=0.297 Sum_probs=44.9
Q ss_pred cCCCceeEeccCCC-C--------CcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC---CCCCCceEE
Q psy11980 750 KQNGGLLRIFPEGG-G--------DKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN---PNRDGRCIT 817 (831)
Q Consensus 750 ~~~GGeL~ly~~~~-~--------~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~---~~~~~R~~t 817 (831)
-.|+|.|.|--.+. + ...++|.|+.|.|||+.-|.+ |.+ .+.+|..++=. |.. .=++|
T Consensus 196 HtD~G~lTLl~Qd~~~GLqv~~~~g~Wl~v~P~pgtlvVNiGdmL---e~~------Tng~lrST~HRV~~~~~-~~R~S 265 (322)
T COG3491 196 HTDYGLLTLLFQDDVGGLEVRPPNGGWLDVPPIPGTLVVNIGDML---ERW------TNGRLRSTVHRVRNPPG-VDRYS 265 (322)
T ss_pred ccCCCeEEEEEecccCCeEEecCCCCeeECCCCCCeEEEeHHHHH---HHH------hCCeeccccceeecCCC-cccee
Confidence 45677776653221 1 356799999999999988876 555 46666665554 211 23566
Q ss_pred EEEecCCCccccc
Q psy11980 818 AIYYLNRDWDIKV 830 (831)
Q Consensus 818 ~~~Yln~~w~~~~ 830 (831)
.=|++-++-|+.+
T Consensus 266 ipfF~~p~~Da~I 278 (322)
T COG3491 266 IPFFLEPNFDAEI 278 (322)
T ss_pred eeeeccCCCCccc
Confidence 6677777666544
No 137
>KOG2186|consensus
Probab=52.36 E-value=7.6 Score=39.41 Aligned_cols=50 Identities=22% Similarity=0.449 Sum_probs=36.1
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHHHHHHHHhcc
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQKHR 382 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L~~H~r~~H~ 382 (831)
|.|..||.... +..+.+|+...++ .-|.|-.|++.|.. .+...|...-..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITE 53 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITE 53 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcch
Confidence 66777777765 3456668877777 56888888888887 677888765443
No 138
>PLN02299 1-aminocyclopropane-1-carboxylate oxidase
Probab=52.14 E-value=15 Score=39.90 Aligned_cols=63 Identities=19% Similarity=0.353 Sum_probs=43.0
Q ss_pred eEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecCC--CCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDNP--NRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~~--~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.+..+ ...++|.|..|.|||-.-|.+ |++ .+++|...+-.- +..+-++++.|++++++|+.+
T Consensus 198 LQV~~~---g~Wi~V~p~pg~lvVNiGD~l---~~~------Tng~~kS~~HRVv~~~~~~R~Si~~F~~p~~d~~i 262 (321)
T PLN02299 198 LQLLKD---GEWVDVPPMRHSIVVNLGDQL---EVI------TNGKYKSVMHRVVAQTDGNRMSIASFYNPGSDAVI 262 (321)
T ss_pred cCcccC---CeEEECCCCCCeEEEEeCHHH---HHH------hCCceecccceeecCCCCCEEEEEEEecCCCCceE
Confidence 555432 357899999999999888877 555 345554444331 223348999999999998754
No 139
>PLN02365 2-oxoglutarate-dependent dioxygenase
Probab=51.48 E-value=18 Score=39.01 Aligned_cols=67 Identities=15% Similarity=0.153 Sum_probs=43.6
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.+...+ +...++|.|..|.|||-.-|.+ |++ .+++|...+=. .....-++++.|++++++|+.+
T Consensus 187 ~GLqV~~~~-~g~Wi~V~p~pga~vVNiGD~l---~~~------TNG~~~St~HRVv~~~~~~R~Si~~F~~p~~d~~i 255 (300)
T PLN02365 187 GGLEVMDPS-SGEFVPVDPLPGTLLVNLGDVA---TAW------SNGRLCNVKHRVQCKEATMRISIASFLLGPKDDDV 255 (300)
T ss_pred CceEEEECC-CCeEEecCCCCCeEEEEhhHHH---HHH------hCCceecccceeEcCCCCCEEEEEEEecCCCCCeE
Confidence 447776431 1258999999999999887776 555 34444433332 1122348888889899988743
No 140
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=51.36 E-value=4.3 Score=38.76 Aligned_cols=14 Identities=36% Similarity=0.681 Sum_probs=8.3
Q ss_pred ccCCCCCCccCChh
Q psy11980 330 HKCPICTKGFTMKS 343 (831)
Q Consensus 330 ~~C~~C~k~F~~~~ 343 (831)
++|+.||++|.+..
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 56666666666543
No 141
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=51.02 E-value=9.2 Score=28.07 Aligned_cols=11 Identities=18% Similarity=0.365 Sum_probs=5.3
Q ss_pred ccCCCCCCccC
Q psy11980 330 HKCPICTKGFT 340 (831)
Q Consensus 330 ~~C~~C~k~F~ 340 (831)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 44555554444
No 142
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=50.69 E-value=11 Score=28.31 Aligned_cols=23 Identities=48% Similarity=0.979 Sum_probs=12.4
Q ss_pred ccccccccccCh-----hHHHHHHHHhc
Q psy11980 359 SCEVCGRAFIRQ-----DCLIRHLRQKH 381 (831)
Q Consensus 359 ~C~~C~~~F~~~-----~~L~~H~r~~H 381 (831)
.|..|++.+... ++|.+|++..|
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 455555555443 46666666443
No 143
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=50.61 E-value=12 Score=34.63 Aligned_cols=33 Identities=36% Similarity=0.538 Sum_probs=22.2
Q ss_pred cccccccccccChhHHHHHHHHhccCchhHHHhhcc
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQKHRSHLDEVLAQGD 393 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~~H~~~~~~~~~~~~ 393 (831)
..|-+||+.|.. |.+|++.||+..+.+.....+
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gltp~eYR~kwG 105 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLTPEEYRAKWG 105 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-HHHHHHHTT
T ss_pred eEEccCCcccch---HHHHHHHccCCCHHHHHHHhC
Confidence 579999998875 599999999988887765544
No 144
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=49.90 E-value=9.9 Score=29.49 Aligned_cols=8 Identities=50% Similarity=1.476 Sum_probs=3.4
Q ss_pred CccCCCCC
Q psy11980 329 PHKCPICT 336 (831)
Q Consensus 329 ~~~C~~C~ 336 (831)
+|+|+.||
T Consensus 50 ~Y~Cp~CG 57 (61)
T COG2888 50 PYRCPKCG 57 (61)
T ss_pred ceECCCcC
Confidence 34444444
No 145
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.27 E-value=8.9 Score=36.95 Aligned_cols=13 Identities=54% Similarity=1.142 Sum_probs=7.0
Q ss_pred ccCCCCccCcccc
Q psy11980 296 HEGADPYQCDICR 308 (831)
Q Consensus 296 H~~~k~~~C~~C~ 308 (831)
|.++.|-+||+||
T Consensus 144 ~~ge~P~~CPiCg 156 (166)
T COG1592 144 HEGEAPEVCPICG 156 (166)
T ss_pred ccCCCCCcCCCCC
Confidence 4445555555555
No 146
>PHA00626 hypothetical protein
Probab=49.15 E-value=7 Score=29.84 Aligned_cols=10 Identities=30% Similarity=0.507 Sum_probs=4.2
Q ss_pred ccCccccccc
Q psy11980 302 YQCDICRKTF 311 (831)
Q Consensus 302 ~~C~~C~k~F 311 (831)
|+|+.||..|
T Consensus 24 YkCkdCGY~f 33 (59)
T PHA00626 24 YVCCDCGYND 33 (59)
T ss_pred eEcCCCCCee
Confidence 4444444443
No 147
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=48.99 E-value=14 Score=35.64 Aligned_cols=33 Identities=24% Similarity=0.440 Sum_probs=19.5
Q ss_pred cCCCCccCCCCCCccCChhHHHHHhhhccCCCCccccccccc
Q psy11980 325 TGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRA 366 (831)
Q Consensus 325 ~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~ 366 (831)
.+..-|.|+.|+..|+.-..+. .-|.|+.||..
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 3444566666666666655553 24667777654
No 148
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=48.64 E-value=13 Score=35.40 Aligned_cols=14 Identities=36% Similarity=0.797 Sum_probs=7.4
Q ss_pred CCccCCCCCCccCC
Q psy11980 328 KPHKCPICTKGFTM 341 (831)
Q Consensus 328 k~~~C~~C~k~F~~ 341 (831)
.-|.|+.|+..|..
T Consensus 98 ~~Y~Cp~C~~~y~~ 111 (147)
T smart00531 98 AYYKCPNCQSKYTF 111 (147)
T ss_pred cEEECcCCCCEeeH
Confidence 34555555555553
No 149
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=48.02 E-value=5.3 Score=32.71 Aligned_cols=40 Identities=30% Similarity=0.609 Sum_probs=21.1
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccc--cccccccChh
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCE--VCGRAFIRQD 371 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~--~C~~~F~~~~ 371 (831)
+.|+.||.......+-..+.. ..+.-++|. .||.+|....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYE 43 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEE
Confidence 456666665543333322221 334456776 6777776543
No 150
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=46.56 E-value=7.1 Score=39.67 Aligned_cols=25 Identities=24% Similarity=0.369 Sum_probs=15.9
Q ss_pred CCCcccccccccccChhHHHHHHHH
Q psy11980 355 EPPKSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 355 ~~py~C~~C~~~F~~~~~L~~H~r~ 379 (831)
.+++.|+.||+.......|..-.|+
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR~ 231 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTRS 231 (314)
T ss_pred CCCCCCCCCCCcccccccceeeeec
Confidence 3677777777776666666554443
No 151
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=45.97 E-value=12 Score=26.09 Aligned_cols=12 Identities=25% Similarity=0.736 Sum_probs=5.4
Q ss_pred cCcccccccccc
Q psy11980 274 TCELCNKTFNRK 285 (831)
Q Consensus 274 ~C~~C~k~F~~~ 285 (831)
.|+.|+..|.-.
T Consensus 4 ~Cp~C~~~y~i~ 15 (36)
T PF13717_consen 4 TCPNCQAKYEID 15 (36)
T ss_pred ECCCCCCEEeCC
Confidence 344444444433
No 152
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=45.97 E-value=14 Score=33.58 Aligned_cols=32 Identities=19% Similarity=0.334 Sum_probs=22.5
Q ss_pred ccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhH
Q psy11980 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDC 372 (831)
Q Consensus 330 ~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~ 372 (831)
..|+.||++|-.. +..|-.|+.||..|.-...
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~~~ 41 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPEEA 41 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcchh
Confidence 5788888888642 3467888888888766533
No 153
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=45.27 E-value=12 Score=35.99 Aligned_cols=34 Identities=15% Similarity=0.279 Sum_probs=20.8
Q ss_pred CCCCCccccccchhhhcChhhHHhhhhcccccCCccCcccccc
Q psy11980 239 HSGVKPHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKT 281 (831)
Q Consensus 239 h~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~ 281 (831)
.....-|.|+.|+..|+....+. ..|.|+.||..
T Consensus 104 e~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 104 ETNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred ccCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 34445577777777776665553 24677777654
No 154
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=44.16 E-value=6.7 Score=27.95 Aligned_cols=11 Identities=27% Similarity=0.742 Sum_probs=5.2
Q ss_pred ccCCCCCCccC
Q psy11980 330 HKCPICTKGFT 340 (831)
Q Consensus 330 ~~C~~C~k~F~ 340 (831)
|+|+.||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 44444544443
No 155
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=43.53 E-value=7.1 Score=29.64 Aligned_cols=11 Identities=27% Similarity=0.854 Sum_probs=5.1
Q ss_pred ccCCCCCCccC
Q psy11980 330 HKCPICTKGFT 340 (831)
Q Consensus 330 ~~C~~C~k~F~ 340 (831)
|+|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44444444444
No 156
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=43.31 E-value=17 Score=35.70 Aligned_cols=36 Identities=22% Similarity=0.460 Sum_probs=24.7
Q ss_pred cCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccccC
Q psy11980 325 TGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIR 369 (831)
Q Consensus 325 ~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~ 369 (831)
....-|.|+.|+..|+.-..+. .-|.|+.||.....
T Consensus 113 ~~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 113 ENNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLEE 148 (178)
T ss_pred cCCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence 3445688888888887766652 35888888876543
No 157
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=41.75 E-value=26 Score=33.27 Aligned_cols=14 Identities=21% Similarity=0.719 Sum_probs=7.7
Q ss_pred cccccCccccccCC
Q psy11980 103 KKFKCDKCEKQFYG 116 (831)
Q Consensus 103 k~~~C~~C~k~f~~ 116 (831)
.-|.|+.|+..|..
T Consensus 98 ~~Y~Cp~C~~~y~~ 111 (147)
T smart00531 98 AYYKCPNCQSKYTF 111 (147)
T ss_pred cEEECcCCCCEeeH
Confidence 34556666655553
No 158
>PLN02403 aminocyclopropanecarboxylate oxidase
Probab=41.13 E-value=38 Score=36.53 Aligned_cols=63 Identities=17% Similarity=0.314 Sum_probs=41.7
Q ss_pred eEeccCCCCCcceeecCCC-CeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 756 LRIFPEGGGDKVADIEPMF-DRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 756 L~ly~~~~~~~~~~i~P~~-grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|.+..+ ...+.|.|.. |.|||-.-|.+ +++ .+++|...+=. ......++++.|+++++.|+.+
T Consensus 193 LqV~~~---g~Wi~V~p~p~~~lvVNvGD~L---~~~------Tng~~~S~~HRVv~~~~~~R~Si~~F~~p~~d~~i 258 (303)
T PLN02403 193 LEFLKD---GKWVPIPPSKNNTIFVNTGDQL---EVL------SNGRYKSTLHRVMADKNGSRLSIATFYNPAGDAII 258 (303)
T ss_pred eEeccC---CeEEECCCCCCCEEEEEehHHH---HHH------hCCeeecccceeecCCCCCEEEEEEEEcCCCCCeE
Confidence 666432 3588999997 69999877766 444 34444443332 1234458999999999998754
No 159
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=41.05 E-value=15 Score=28.62 Aligned_cols=8 Identities=50% Similarity=1.368 Sum_probs=3.5
Q ss_pred CccCCCCC
Q psy11980 329 PHKCPICT 336 (831)
Q Consensus 329 ~~~C~~C~ 336 (831)
+|.|+.||
T Consensus 48 ~Y~CP~CG 55 (59)
T PRK14890 48 PYTCPKCG 55 (59)
T ss_pred ceECCCCC
Confidence 34444444
No 160
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=40.98 E-value=19 Score=27.06 Aligned_cols=20 Identities=40% Similarity=0.717 Sum_probs=10.2
Q ss_pred cCCCCCCccCCh-----hHHHHHhh
Q psy11980 331 KCPICTKGFTMK-----SSLKIHLL 350 (831)
Q Consensus 331 ~C~~C~k~F~~~-----~~L~~H~~ 350 (831)
.|.+|++.++.. +.|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 455555554433 45666655
No 161
>PLN02750 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=39.85 E-value=41 Score=36.95 Aligned_cols=67 Identities=10% Similarity=0.167 Sum_probs=42.6
Q ss_pred ceeEeccCCCCCcceeecCCCCeEEEEecCCCCCccccccccCCCCceeEEeecC--CCCCCceEEEEEecCCCccccc
Q psy11980 754 GLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAMVACYPGHGSHYVKHVDN--PNRDGRCITAIYYLNRDWDIKV 830 (831)
Q Consensus 754 GeL~ly~~~~~~~~~~i~P~~grlV~F~S~~~~pHEVl~a~yp~~~~ry~~h~d~--~~~~~R~~t~~~Yln~~w~~~~ 830 (831)
|-|.|.... ....++|.|..|.|||-.-|.+ +++ .+++|...+=. .....-++++.|++++++|+.+
T Consensus 230 ~GLQV~~~~-~g~Wi~V~p~pg~~vVNiGD~L---~~~------Tng~~~St~HRVv~~~~~~R~Si~~F~~P~~d~~i 298 (345)
T PLN02750 230 GGLQISRRS-DGEWIPVKPIPDAFIINIGNCM---QVW------TNDLYWSAEHRVVVNSQKERFSIPFFFFPSHYVNI 298 (345)
T ss_pred CceEEeecC-CCeEEEccCCCCeEEEEhHHHH---HHH------hCCeeecccceeccCCCCCEEEEEEeecCCCCCee
Confidence 346665311 1257899999999999877766 444 34444322221 1123447999999999999754
No 162
>KOG0717|consensus
Probab=39.67 E-value=20 Score=39.92 Aligned_cols=32 Identities=28% Similarity=0.567 Sum_probs=24.3
Q ss_pred cccccccccccChhHHHHHHH-HhccCchhHHH
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLR-QKHRSHLDEVL 389 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r-~~H~~~~~~~~ 389 (831)
+.|.+|.|+|.+...|..|.. ..|.+...++.
T Consensus 293 lyC~vCnKsFKseKq~kNHEnSKKHkenv~eLr 325 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENSKKHKENVAELR 325 (508)
T ss_pred eEEeeccccccchHHHHhhHHHHHHHHHHHHHH
Confidence 779999999999999998876 35665555554
No 163
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=39.52 E-value=17 Score=35.76 Aligned_cols=33 Identities=18% Similarity=0.487 Sum_probs=19.7
Q ss_pred CCCccccccchhhhcChhhHHhhhhcccccCCccCccccccc
Q psy11980 241 GVKPHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTF 282 (831)
Q Consensus 241 ~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F 282 (831)
...-|.|+.|+..|+....+. ..|.|+.||...
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 344577777777766655542 246777776543
No 164
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=39.50 E-value=8.9 Score=31.43 Aligned_cols=9 Identities=33% Similarity=0.955 Sum_probs=4.3
Q ss_pred cCccccccc
Q psy11980 274 TCELCNKTF 282 (831)
Q Consensus 274 ~C~~C~k~F 282 (831)
.|+.||...
T Consensus 3 ~CP~Cg~~a 11 (72)
T PRK09678 3 HCPLCQHAA 11 (72)
T ss_pred cCCCCCCcc
Confidence 455555433
No 165
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=37.92 E-value=4.4 Score=31.13 Aligned_cols=31 Identities=26% Similarity=0.649 Sum_probs=25.1
Q ss_pred ccccccCcCCcccccCCccccccCCCccccc
Q psy11980 425 ENLLRCSKCKVTFYCSKQHQTLDWKAHKINC 455 (831)
Q Consensus 425 e~~~~C~~C~~~f~~~~~h~~~h~~~~~~~c 455 (831)
...|.|+.||-.++++..|++-...+|...|
T Consensus 12 ~v~~~Cp~cGipthcS~ehw~~D~e~H~~~c 42 (55)
T PF13824_consen 12 HVNFECPDCGIPTHCSEEHWEDDYEEHRQLC 42 (55)
T ss_pred ccCCcCCCCCCcCccCHHHHHHhHHHHHHHH
Confidence 3459999999999999999888876665555
No 166
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.72 E-value=25 Score=32.08 Aligned_cols=30 Identities=17% Similarity=0.275 Sum_probs=20.7
Q ss_pred ccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCCh
Q psy11980 302 YQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMK 342 (831)
Q Consensus 302 ~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~ 342 (831)
..|+.||+.|-.. +..|-.|+.||..|.-.
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 4678888777542 34677888888877654
No 167
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=36.92 E-value=20 Score=26.47 Aligned_cols=28 Identities=11% Similarity=0.345 Sum_probs=20.2
Q ss_pred ccccCcCCcccccCCccccccCCCcccccccCccC
Q psy11980 427 LLRCSKCKVTFYCSKQHQTLDWKAHKINCNILSNQ 461 (831)
Q Consensus 427 ~~~C~~C~~~f~~~~~h~~~h~~~~~~~c~~c~~~ 461 (831)
.|+|+.||..|.....- ....|+.||.+
T Consensus 3 ~y~C~~CG~~~~~~~~~-------~~~~Cp~CG~~ 30 (46)
T PRK00398 3 EYKCARCGREVELDEYG-------TGVRCPYCGYR 30 (46)
T ss_pred EEECCCCCCEEEECCCC-------CceECCCCCCe
Confidence 48999999988643221 26789999965
No 168
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=36.75 E-value=18 Score=30.48 Aligned_cols=13 Identities=31% Similarity=0.680 Sum_probs=7.8
Q ss_pred CcccccccccccC
Q psy11980 357 PKSCEVCGRAFIR 369 (831)
Q Consensus 357 py~C~~C~~~F~~ 369 (831)
-+.|..||..|.-
T Consensus 53 IW~C~kCg~~fAG 65 (89)
T COG1997 53 IWKCRKCGAKFAG 65 (89)
T ss_pred eEEcCCCCCeecc
Confidence 4666666666653
No 169
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=36.65 E-value=30 Score=43.53 Aligned_cols=39 Identities=18% Similarity=0.428 Sum_probs=24.9
Q ss_pred cCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCcccccccccc
Q psy11980 303 QCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAF 367 (831)
Q Consensus 303 ~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F 367 (831)
.|+.||... +.+|.|+.||....... ++ ...|+.|+...
T Consensus 681 fCP~CGs~t---------------e~vy~CPsCGaev~~de---------s~--a~~CP~CGtpl 719 (1337)
T PRK14714 681 RCPDCGTHT---------------EPVYVCPDCGAEVPPDE---------SG--RVECPRCDVEL 719 (1337)
T ss_pred cCcccCCcC---------------CCceeCccCCCccCCCc---------cc--cccCCCCCCcc
Confidence 688888653 23578999988654311 11 44699998543
No 170
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=36.39 E-value=27 Score=22.39 Aligned_cols=19 Identities=32% Similarity=0.644 Sum_probs=13.0
Q ss_pred ccccccccccChhHHHHHHH
Q psy11980 359 SCEVCGRAFIRQDCLIRHLR 378 (831)
Q Consensus 359 ~C~~C~~~F~~~~~L~~H~r 378 (831)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 577787777 5566677764
No 171
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=35.78 E-value=19 Score=37.54 Aligned_cols=24 Identities=25% Similarity=0.530 Sum_probs=14.2
Q ss_pred CccCCCCCCccCChhHHHHHhhhc
Q psy11980 329 PHKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
.|.|+.|...|-.--+.-.|...|
T Consensus 388 rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ceechhhhhhhhhhhHHHHHHHHh
Confidence 466666666666665655555444
No 172
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=34.94 E-value=21 Score=27.75 Aligned_cols=21 Identities=24% Similarity=0.504 Sum_probs=9.2
Q ss_pred hhHHHHHHhhhcCCCCccCCC
Q psy11980 314 KEDLTRHCLLHTGVKPHKCPI 334 (831)
Q Consensus 314 ~~~L~~H~~~H~~~k~~~C~~ 334 (831)
+..|..|+...-..++..|++
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHHccCCCCcEECCC
Confidence 334455554444444444444
No 173
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=34.92 E-value=19 Score=33.33 Aligned_cols=25 Identities=12% Similarity=0.344 Sum_probs=15.6
Q ss_pred cccCcccccccCchHHHHHhhhcccccc
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKSKEEEKQ 70 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~~~~~~~ 70 (831)
..|-+||+.|.. |.+|++.|+|-.+
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred eEEccCCcccch---HHHHHHHccCCCH
Confidence 568899998876 5888888877554
No 174
>PRK14873 primosome assembly protein PriA; Provisional
Probab=33.89 E-value=24 Score=42.38 Aligned_cols=44 Identities=25% Similarity=0.708 Sum_probs=0.0
Q ss_pred CcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccC
Q psy11980 275 CELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTK 354 (831)
Q Consensus 275 C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~ 354 (831)
|..|| ...+|+.|+-.... |...+...|.+||..
T Consensus 386 C~~Cg--------------------~~~~C~~C~~~L~~----------h~~~~~l~Ch~CG~~---------------- 419 (665)
T PRK14873 386 CARCR--------------------TPARCRHCTGPLGL----------PSAGGTPRCRWCGRA---------------- 419 (665)
T ss_pred hhhCc--------------------CeeECCCCCCceeE----------ecCCCeeECCCCcCC----------------
Q ss_pred CCCccccccc
Q psy11980 355 EPPKSCEVCG 364 (831)
Q Consensus 355 ~~py~C~~C~ 364 (831)
..|+.|+.||
T Consensus 420 ~~p~~Cp~Cg 429 (665)
T PRK14873 420 APDWRCPRCG 429 (665)
T ss_pred CcCccCCCCc
No 175
>KOG2807|consensus
Probab=33.41 E-value=39 Score=35.76 Aligned_cols=31 Identities=26% Similarity=0.590 Sum_probs=20.0
Q ss_pred CCccCCCCCCccCChhHHHHHhhhccCCCCccccccc
Q psy11980 328 KPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCG 364 (831)
Q Consensus 328 k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~ 364 (831)
..|.|..|...|-.--+...|...| .|+-|.
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh------~CpgCe 374 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLH------NCPGCE 374 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhh------cCCCcC
Confidence 3477777777777776666665544 455554
No 176
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=33.01 E-value=22 Score=26.67 Aligned_cols=30 Identities=20% Similarity=0.440 Sum_probs=21.1
Q ss_pred ccccccCcCCcccccCCccccccCCCcccccccCccC
Q psy11980 425 ENLLRCSKCKVTFYCSKQHQTLDWKAHKINCNILSNQ 461 (831)
Q Consensus 425 e~~~~C~~C~~~f~~~~~h~~~h~~~~~~~c~~c~~~ 461 (831)
...|+|..||+.|. .........|..||..
T Consensus 4 ~~~Y~C~~Cg~~~~-------~~~~~~~irCp~Cg~r 33 (49)
T COG1996 4 MMEYKCARCGREVE-------LDQETRGIRCPYCGSR 33 (49)
T ss_pred eEEEEhhhcCCeee-------hhhccCceeCCCCCcE
Confidence 45699999999992 2223455679999854
No 177
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=32.38 E-value=21 Score=24.19 Aligned_cols=9 Identities=44% Similarity=1.224 Sum_probs=3.8
Q ss_pred Ccccccccc
Q psy11980 357 PKSCEVCGR 365 (831)
Q Consensus 357 py~C~~C~~ 365 (831)
+.+|+.||.
T Consensus 17 ~irC~~CG~ 25 (32)
T PF03604_consen 17 PIRCPECGH 25 (32)
T ss_dssp TSSBSSSS-
T ss_pred cEECCcCCC
Confidence 344555543
No 178
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=31.94 E-value=31 Score=40.13 Aligned_cols=45 Identities=27% Similarity=0.627 Sum_probs=0.0
Q ss_pred CcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccC
Q psy11980 275 CELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTK 354 (831)
Q Consensus 275 C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~ 354 (831)
|..|| ...+|+.|+-.... |.......|.+||...
T Consensus 216 C~~Cg--------------------~~~~C~~C~~~l~~----------h~~~~~l~Ch~Cg~~~--------------- 250 (505)
T TIGR00595 216 CRSCG--------------------YILCCPNCDVSLTY----------HKKEGKLRCHYCGYQE--------------- 250 (505)
T ss_pred hhhCc--------------------CccCCCCCCCceEE----------ecCCCeEEcCCCcCcC---------------
Q ss_pred CCCccccccc
Q psy11980 355 EPPKSCEVCG 364 (831)
Q Consensus 355 ~~py~C~~C~ 364 (831)
..|..|+.|+
T Consensus 251 ~~~~~Cp~C~ 260 (505)
T TIGR00595 251 PIPKTCPQCG 260 (505)
T ss_pred CCCCCCCCCC
No 179
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=31.38 E-value=41 Score=35.24 Aligned_cols=85 Identities=25% Similarity=0.506 Sum_probs=52.0
Q ss_pred CccccccchhhhcChhhHHhhhh----------ccccc--CCccCcccccccccchHHHhhhhhcccCCCCccCcccccc
Q psy11980 243 KPHVCEYCEKAFIQASQLKAHMF----------HHTGE--NGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKT 310 (831)
Q Consensus 243 k~~~C~~C~k~f~~~~~L~~H~~----------~H~~~--k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~ 310 (831)
-|..|+.|.........|.+-.. ...+. +.-.|-.|.-.|..... |..-.-+....|+|+.|...
T Consensus 321 LPi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~---~~~~~~~ss~rY~Ce~CK~~ 397 (421)
T COG5151 321 LPISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPV---SPFDESTSSGRYQCELCKST 397 (421)
T ss_pred CCccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCCCC---Ccccccccccceechhhhhh
Confidence 35678888776655544443211 11122 22348888888876432 21111223457999999999
Q ss_pred cCChhHHHHHHhhhcCCCCccCCCCC
Q psy11980 311 FIRKEDLTRHCLLHTGVKPHKCPICT 336 (831)
Q Consensus 311 F~~~~~L~~H~~~H~~~k~~~C~~C~ 336 (831)
|-..-..-.|...| .|..|.
T Consensus 398 FC~dCdvfiHe~Lh------~C~gCe 417 (421)
T COG5151 398 FCSDCDVFIHETLH------FCIGCE 417 (421)
T ss_pred hhhhhHHHHHHHHh------hCCCCc
Confidence 99999988898877 366664
No 180
>PF15269 zf-C2H2_7: Zinc-finger
Probab=31.37 E-value=30 Score=25.07 Aligned_cols=22 Identities=18% Similarity=0.365 Sum_probs=18.3
Q ss_pred cccccccccccChhHHHHHHHH
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQ 379 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~ 379 (831)
|+|-+|..+...++.|-.||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 6788888888888888888864
No 181
>KOG4167|consensus
Probab=30.65 E-value=21 Score=41.68 Aligned_cols=24 Identities=25% Similarity=0.557 Sum_probs=15.5
Q ss_pred ccccccCcccCChhHHHHHHHHhc
Q psy11980 73 TYIKVCGRVFNRKDNLREHLRAHA 96 (831)
Q Consensus 73 ~~C~~C~k~f~~~~~L~~H~~~h~ 96 (831)
|.|.+|+|.|.....+..||++|-
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHH
Confidence 566666666666666666666664
No 182
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=30.53 E-value=35 Score=41.13 Aligned_cols=47 Identities=21% Similarity=0.428 Sum_probs=35.6
Q ss_pred ccccCcCCcccccCC--ccccccCCCcccccccCccCcccCCccccCCCc-cccccccc
Q psy11980 427 LLRCSKCKVTFYCSK--QHQTLDWKAHKINCNILSNQSATNNTSLANNTP-QVAQIQNQ 482 (831)
Q Consensus 427 ~~~C~~C~~~f~~~~--~h~~~h~~~~~~~c~~c~~~~~~~~~~~~~~~~-~~~~~~~~ 482 (831)
...|..||..+.|.. .-...|...+...|..||.. ...| .|.+||..
T Consensus 435 ~l~C~~Cg~v~~Cp~Cd~~lt~H~~~~~L~CH~Cg~~---------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 435 LLLCRDCGYIAECPNCDSPLTLHKATGQLRCHYCGYQ---------EPIPQSCPECGSE 484 (730)
T ss_pred eeecccCCCcccCCCCCcceEEecCCCeeEeCCCCCC---------CCCCCCCCCCCCC
Confidence 367999999999885 55666777788899999954 4456 78888764
No 183
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=29.35 E-value=43 Score=33.00 Aligned_cols=27 Identities=26% Similarity=0.352 Sum_probs=23.9
Q ss_pred hcCCeEEEcCCCChhHHHHHHHHHHhh
Q psy11980 648 DQYGLCVVDNFLGPERGMAVLHEVLGM 674 (831)
Q Consensus 648 ~~~g~~~id~~l~~~~~~~l~~~~~~~ 674 (831)
+++||++|.++|+++.+.+++.++..+
T Consensus 2 ~~~Gyvvi~~~l~~~~~~~l~~~~~~~ 28 (211)
T PF05721_consen 2 RRDGYVVIRNVLSPEEVERLREELDRL 28 (211)
T ss_dssp HHHSEEEETTSS-HHHHHHHHHHHHHH
T ss_pred cCCcEEEECCcCCHHHHHHHHHHHHHH
Confidence 468999999999999999999999886
No 184
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=29.24 E-value=9.8 Score=40.43 Aligned_cols=32 Identities=13% Similarity=0.286 Sum_probs=27.6
Q ss_pred ccccCCccceeeecCCCCcccccccccceeee
Q psy11980 593 VIRDMDQYGLCVLDDFLGPERDMHLDTNVVVV 624 (831)
Q Consensus 593 i~~~l~~~g~~v~d~fl~~~~~~~l~~~~~~~ 624 (831)
.++.+.+.||.|+++||+++.+++|.+++..+
T Consensus 21 qi~~f~~dGyvvl~~vls~eev~~lr~~i~~~ 52 (277)
T TIGR02408 21 QLQSYERDGFLLLENLFSDDEVAALLAEVERM 52 (277)
T ss_pred HHHHHHHCCEEECcccCCHHHHHHHHHHHHHH
Confidence 36688899999999999999999998887543
No 185
>PF15269 zf-C2H2_7: Zinc-finger
Probab=29.23 E-value=40 Score=24.46 Aligned_cols=25 Identities=24% Similarity=0.262 Sum_probs=15.6
Q ss_pred ccccccccccCcccCChhHHHHHHH
Q psy11980 69 KQICTYIKVCGRVFNRKDNLREHLR 93 (831)
Q Consensus 69 ~~~~~~C~~C~k~f~~~~~L~~H~~ 93 (831)
+++-|+|-.|..+...++.|-.||+
T Consensus 17 kp~~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 17 KPFKYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred CCccceeecCCcccchHHHHHHHHH
Confidence 3444666666666666666666664
No 186
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=29.22 E-value=39 Score=35.99 Aligned_cols=27 Identities=22% Similarity=0.359 Sum_probs=21.4
Q ss_pred ccCCcccCcccccccCchHHHHHhhhc
Q psy11980 39 TVPGHLCKGCYRIFTSQGYMLLSYKSK 65 (831)
Q Consensus 39 ~~~~~~C~~C~~~F~~~~~l~~H~~~~ 65 (831)
..+.+-|..|++.|....-+..|....
T Consensus 235 ~~~~~YC~~C~r~f~~~~VFe~Hl~gK 261 (470)
T COG5188 235 WFPKVYCVKCGREFSRSKVFEYHLEGK 261 (470)
T ss_pred hccceeeHhhhhHhhhhHHHHHHHhhh
Confidence 345678999999999988888887543
No 187
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=29.04 E-value=43 Score=42.24 Aligned_cols=54 Identities=24% Similarity=0.544 Sum_probs=33.5
Q ss_pred cccccchhhhcChhhHHhhhhcccccCCccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhh
Q psy11980 245 HVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLH 324 (831)
Q Consensus 245 ~~C~~C~k~f~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H 324 (831)
++|+.||.... ...|+.||... +.+|.|+.||.......
T Consensus 668 rkCPkCG~~t~----------------~~fCP~CGs~t----------------e~vy~CPsCGaev~~de--------- 706 (1337)
T PRK14714 668 RRCPSCGTETY----------------ENRCPDCGTHT----------------EPVYVCPDCGAEVPPDE--------- 706 (1337)
T ss_pred EECCCCCCccc----------------cccCcccCCcC----------------CCceeCccCCCccCCCc---------
Confidence 67888886421 12788888432 23578999998654311
Q ss_pred cCCCCccCCCCCCccCC
Q psy11980 325 TGVKPHKCPICTKGFTM 341 (831)
Q Consensus 325 ~~~k~~~C~~C~k~F~~ 341 (831)
++ ...|+.|+.....
T Consensus 707 s~--a~~CP~CGtplv~ 721 (1337)
T PRK14714 707 SG--RVECPRCDVELTP 721 (1337)
T ss_pred cc--cccCCCCCCcccc
Confidence 12 4579999865443
No 188
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=28.87 E-value=22 Score=27.63 Aligned_cols=40 Identities=25% Similarity=0.581 Sum_probs=18.4
Q ss_pred CCccCCC--CCCccCChhHHHHHhhhccCCCCccccc----cccccc
Q psy11980 328 KPHKCPI--CTKGFTMKSSLKIHLLTHTKEPPKSCEV----CGRAFI 368 (831)
Q Consensus 328 k~~~C~~--C~k~F~~~~~L~~H~~~H~~~~py~C~~----C~~~F~ 368 (831)
.+..|+. |...+. +..|..|+...-..++..|++ |+..+.
T Consensus 8 ~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 8 RPVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp SEEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred CEeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 3445655 333333 446666666544455556666 665554
No 189
>PF14353 CpXC: CpXC protein
Probab=28.83 E-value=45 Score=30.73 Aligned_cols=12 Identities=25% Similarity=0.897 Sum_probs=5.9
Q ss_pred cccCcCCCcCCC
Q psy11980 132 YECDICKNRFPS 143 (831)
Q Consensus 132 ~~C~~C~k~F~~ 143 (831)
|.|+.||..|.-
T Consensus 39 ~~CP~Cg~~~~~ 50 (128)
T PF14353_consen 39 FTCPSCGHKFRL 50 (128)
T ss_pred EECCCCCCceec
Confidence 445555554443
No 190
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=28.48 E-value=21 Score=36.03 Aligned_cols=27 Identities=22% Similarity=0.396 Sum_probs=18.2
Q ss_pred cccCcccccccCchHHHHHhhhccccc
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKSKEEEK 69 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~~~~~~ 69 (831)
|.|+.|+|.|........|+...|.++
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred ECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 778888888888777777877766654
No 191
>PF13661 2OG-FeII_Oxy_4: 2OG-Fe(II) oxygenase superfamily
Probab=27.94 E-value=63 Score=26.20 Aligned_cols=24 Identities=21% Similarity=0.316 Sum_probs=16.1
Q ss_pred CcceeEEe--c-CcccCCCceeEeccC
Q psy11980 738 GRMGDFVI--N-GRTKQNGGLLRIFPE 761 (831)
Q Consensus 738 ~R~~s~v~--~-~W~~~~GGeL~ly~~ 761 (831)
.|.+|+++ + +|.++.||...+|++
T Consensus 38 ~r~~t~llYLn~~w~~d~~Gg~~~f~~ 64 (70)
T PF13661_consen 38 RRFLTLLLYLNEDWDEDFGGGELFFDD 64 (70)
T ss_pred ceeEEEEEEecccccCccCCcEEEEeC
Confidence 37777777 4 999998555555443
No 192
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=27.84 E-value=44 Score=30.64 Aligned_cols=31 Identities=32% Similarity=0.341 Sum_probs=24.0
Q ss_pred cccccccccccChhHHHHHHHHhccCchhHHHhh
Q psy11980 358 KSCEVCGRAFIRQDCLIRHLRQKHRSHLDEVLAQ 391 (831)
Q Consensus 358 y~C~~C~~~F~~~~~L~~H~r~~H~~~~~~~~~~ 391 (831)
..|-+||+.|. +|++|+.+||+..+++.-+.
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTPd~YR~K 107 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTPDEYRAK 107 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCHHHHHHh
Confidence 36888888886 58899999998887766543
No 193
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=27.49 E-value=24 Score=27.09 Aligned_cols=28 Identities=21% Similarity=0.485 Sum_probs=19.7
Q ss_pred cccCcccccccCchHHHHHhhhccccccccccccccCcccCC
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIKVCGRVFNR 84 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~k~f~~ 84 (831)
-+|..||+.|.....+ ..|+.|+..+.+
T Consensus 6 ~~C~~Cg~~~~~~dDi--------------VvCp~CgapyHR 33 (54)
T PF14446_consen 6 CKCPVCGKKFKDGDDI--------------VVCPECGAPYHR 33 (54)
T ss_pred ccChhhCCcccCCCCE--------------EECCCCCCcccH
Confidence 4699999988754332 478888887643
No 194
>KOG4124|consensus
Probab=27.35 E-value=16 Score=38.77 Aligned_cols=70 Identities=23% Similarity=0.380 Sum_probs=37.1
Q ss_pred cCCccCcc--cccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHH
Q psy11980 270 ENGHTCEL--CNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKI 347 (831)
Q Consensus 270 ~k~~~C~~--C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~ 347 (831)
.++|+|++ |.+.+.....|..|....|...- ...++ .-+-|.-.-...|+|+|++|.+++..-..|.-
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i---------~~~s~-~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~ 416 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPI---------TTPTP-APIPHQGFVVENKPYRCEVCSKRYKNLNGLKY 416 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCC---------CCCCC-CCCCcceeeeccCcccChhhhhhhccCCCCCc
Confidence 36777754 88888877777777654442110 00000 00112111123567777777777776655555
Q ss_pred Hh
Q psy11980 348 HL 349 (831)
Q Consensus 348 H~ 349 (831)
|.
T Consensus 417 ~~ 418 (442)
T KOG4124|consen 417 HR 418 (442)
T ss_pred ee
Confidence 43
No 195
>KOG2807|consensus
Probab=27.09 E-value=72 Score=33.91 Aligned_cols=18 Identities=28% Similarity=0.700 Sum_probs=9.3
Q ss_pred ccccccccCccccCchhHH
Q psy11980 159 PYECKEVCGRVFNRKDNLR 177 (831)
Q Consensus 159 ~~~C~~~C~~~f~~~~~L~ 177 (831)
|..|+ +|+-.......|.
T Consensus 290 P~eCp-iC~ltLVss~hLA 307 (378)
T KOG2807|consen 290 PIECP-ICSLTLVSSPHLA 307 (378)
T ss_pred CccCC-ccceeEecchHHH
Confidence 45565 5655554444443
No 196
>KOG2593|consensus
Probab=27.05 E-value=56 Score=36.23 Aligned_cols=40 Identities=30% Similarity=0.580 Sum_probs=26.4
Q ss_pred hcCCCCccCCCCCCccCChhHHHHHhhhccCCCCccccccccc
Q psy11980 324 HTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRA 366 (831)
Q Consensus 324 H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~ 366 (831)
-+...-|.|+.|.++|+.-..++. .-.....|.|..|+-.
T Consensus 123 ~t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 123 DTNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE 162 (436)
T ss_pred ccccccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence 445567899999988876555432 2222446889999743
No 197
>KOG4124|consensus
Probab=26.61 E-value=22 Score=37.79 Aligned_cols=50 Identities=12% Similarity=0.099 Sum_probs=31.1
Q ss_pred CCccccCc--CCCcCCChHHHHH-----HHHHhcCCCccccccccCccccCchhHHHHH
Q psy11980 129 EKLYECDI--CKNRFPSSGAMKK-----HRRKHTGERPYECKEVCGRVFNRKDNLREHL 180 (831)
Q Consensus 129 ~k~~~C~~--C~k~F~~~~~L~~-----H~~~h~~~~~~~C~~~C~~~f~~~~~L~~H~ 180 (831)
|+++.|.. |.+.......... |+-.-+..+||+|+ |++...++..|..|-
T Consensus 176 E~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~--~~~~~~T~~~l~~HS 232 (442)
T KOG4124|consen 176 EYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP--ESLVMDTSSPLSDHS 232 (442)
T ss_pred cccccccCchhhhhhhccccccccccccccccccccCCccCc--ccccccccchhhhcc
Confidence 45566654 5555544333332 33334456788885 888888888888874
No 198
>KOG2461|consensus
Probab=26.57 E-value=45 Score=37.25 Aligned_cols=70 Identities=9% Similarity=-0.110 Sum_probs=33.9
Q ss_pred cccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhccCCCCccccccc
Q psy11980 295 IHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCG 364 (831)
Q Consensus 295 ~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~ 364 (831)
.++.....-++++.+.+.....+..|...+.++.++.+..+...+.....+..+..+|...+.+.+..|+
T Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (396)
T KOG2461|consen 325 PATVSVWTGETIPVRTPAGQLIYTQSHSMEVAEPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMSFVC 394 (396)
T ss_pred cccccccCcCcccccccccccchhhhhhcccCCCCcccccccccceeccccceeeeeccccccccccccC
Confidence 3444444445555555555555555555555555544444444444444444444455554555544443
No 199
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=25.82 E-value=50 Score=24.33 Aligned_cols=11 Identities=18% Similarity=0.655 Sum_probs=5.8
Q ss_pred CCCcccccccc
Q psy11980 355 EPPKSCEVCGR 365 (831)
Q Consensus 355 ~~py~C~~C~~ 365 (831)
...|+|..|++
T Consensus 35 ~~~~~C~~C~~ 45 (46)
T PF12760_consen 35 RGRYRCKACRK 45 (46)
T ss_pred CCeEECCCCCC
Confidence 34456665553
No 200
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=25.24 E-value=72 Score=29.87 Aligned_cols=14 Identities=14% Similarity=0.432 Sum_probs=10.7
Q ss_pred ccccccCcCCcccc
Q psy11980 425 ENLLRCSKCKVTFY 438 (831)
Q Consensus 425 e~~~~C~~C~~~f~ 438 (831)
.--|.|..||..-.
T Consensus 79 nl~~~CE~CG~~I~ 92 (137)
T TIGR03826 79 NLGYPCERCGTSIR 92 (137)
T ss_pred CCcCcccccCCcCC
Confidence 33499999998765
No 201
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=25.06 E-value=21 Score=25.61 Aligned_cols=10 Identities=20% Similarity=0.488 Sum_probs=6.2
Q ss_pred ccccccCccc
Q psy11980 73 TYIKVCGRVF 82 (831)
Q Consensus 73 ~~C~~C~k~f 82 (831)
|.|..|+..+
T Consensus 29 y~C~~C~~~w 38 (40)
T smart00440 29 YVCTKCGHRW 38 (40)
T ss_pred EEeCCCCCEe
Confidence 5666666654
No 202
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=25.05 E-value=17 Score=36.05 Aligned_cols=25 Identities=24% Similarity=0.356 Sum_probs=20.9
Q ss_pred ccceeeecCCCCcccccccccceee
Q psy11980 599 QYGLCVLDDFLGPERDMHLDTNVVV 623 (831)
Q Consensus 599 ~~g~~v~d~fl~~~~~~~l~~~~~~ 623 (831)
++||+|+.++|+++.+++|++++..
T Consensus 3 ~~Gyvvi~~~l~~~~~~~l~~~~~~ 27 (211)
T PF05721_consen 3 RDGYVVIRNVLSPEEVERLREELDR 27 (211)
T ss_dssp HHSEEEETTSS-HHHHHHHHHHHHH
T ss_pred CCcEEEECCcCCHHHHHHHHHHHHH
Confidence 5899999999999999999777753
No 203
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=24.93 E-value=43 Score=30.80 Aligned_cols=15 Identities=40% Similarity=0.875 Sum_probs=11.4
Q ss_pred CccCCCCCCccCChh
Q psy11980 329 PHKCPICTKGFTMKS 343 (831)
Q Consensus 329 ~~~C~~C~k~F~~~~ 343 (831)
|++|..||+.|..-+
T Consensus 1 PH~Ct~Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGS 15 (131)
T ss_pred CcccCcCCCCcCCCc
Confidence 578888888887654
No 204
>PRK05580 primosome assembly protein PriA; Validated
Probab=24.40 E-value=47 Score=40.24 Aligned_cols=47 Identities=26% Similarity=0.557 Sum_probs=0.0
Q ss_pred ccCcccccccccchHHHhhhhhcccCCCCccCcccccccCChhHHHHHHhhhcCCCCccCCCCCCccCChhHHHHHhhhc
Q psy11980 273 HTCELCNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKGFTMKSSLKIHLLTH 352 (831)
Q Consensus 273 ~~C~~C~k~F~~~~~L~~H~~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~C~~C~k~F~~~~~L~~H~~~H 352 (831)
..|..|| ...+|+.|+-.... |...+...|..||+..
T Consensus 382 ~~C~~Cg--------------------~~~~C~~C~~~l~~----------h~~~~~l~Ch~Cg~~~------------- 418 (679)
T PRK05580 382 LLCRDCG--------------------WVAECPHCDASLTL----------HRFQRRLRCHHCGYQE------------- 418 (679)
T ss_pred eEhhhCc--------------------CccCCCCCCCceeE----------ECCCCeEECCCCcCCC-------------
Q ss_pred cCCCCccccccc
Q psy11980 353 TKEPPKSCEVCG 364 (831)
Q Consensus 353 ~~~~py~C~~C~ 364 (831)
..|..|+.||
T Consensus 419 --~~~~~Cp~Cg 428 (679)
T PRK05580 419 --PIPKACPECG 428 (679)
T ss_pred --CCCCCCCCCc
No 205
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=24.31 E-value=51 Score=33.79 Aligned_cols=24 Identities=13% Similarity=0.308 Sum_probs=16.9
Q ss_pred cCCccCcccccccccchHHHhhhh
Q psy11980 270 ENGHTCELCNKTFNRKARLELHMK 293 (831)
Q Consensus 270 ~k~~~C~~C~k~F~~~~~L~~H~~ 293 (831)
.+++.|+.|+........|..-.+
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR 230 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTR 230 (314)
T ss_pred CCCCCCCCCCCcccccccceeeee
Confidence 367788888877776666665555
No 206
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=24.04 E-value=30 Score=23.13 Aligned_cols=10 Identities=30% Similarity=0.730 Sum_probs=5.6
Q ss_pred CCcccccccc
Q psy11980 356 PPKSCEVCGR 365 (831)
Q Consensus 356 ~py~C~~C~~ 365 (831)
..|.|+.|+.
T Consensus 18 ~~~vCp~C~~ 27 (30)
T PF08274_consen 18 ELLVCPECGH 27 (30)
T ss_dssp SSEEETTTTE
T ss_pred CEEeCCcccc
Confidence 3456666654
No 207
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=24.02 E-value=38 Score=36.11 Aligned_cols=20 Identities=25% Similarity=0.496 Sum_probs=10.7
Q ss_pred ccCcccccccCChhHHHHHH
Q psy11980 302 YQCDICRKTFIRKEDLTRHC 321 (831)
Q Consensus 302 ~~C~~C~k~F~~~~~L~~H~ 321 (831)
+-|+.|++.|.....+..|.
T Consensus 239 ~YC~~C~r~f~~~~VFe~Hl 258 (470)
T COG5188 239 VYCVKCGREFSRSKVFEYHL 258 (470)
T ss_pred eeeHhhhhHhhhhHHHHHHH
Confidence 44555555555555555554
No 208
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=23.96 E-value=40 Score=20.97 Aligned_cols=6 Identities=50% Similarity=1.481 Sum_probs=2.7
Q ss_pred cccccc
Q psy11980 360 CEVCGR 365 (831)
Q Consensus 360 C~~C~~ 365 (831)
|+.||.
T Consensus 16 C~~CG~ 21 (23)
T PF13240_consen 16 CPNCGT 21 (23)
T ss_pred hhhhCC
Confidence 444443
No 209
>KOG4167|consensus
Probab=23.89 E-value=21 Score=41.69 Aligned_cols=24 Identities=29% Similarity=0.710 Sum_probs=22.1
Q ss_pred cccCccccccCChhHHHHHHHHhc
Q psy11980 105 FKCDKCEKQFYGLTLLRIHERLHA 128 (831)
Q Consensus 105 ~~C~~C~k~f~~~~~L~~H~~~H~ 128 (831)
|.|.+|+|.|.....+..||+.|.
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHH
Confidence 789999999999999999999996
No 210
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.61 E-value=42 Score=29.51 Aligned_cols=27 Identities=19% Similarity=0.199 Sum_probs=20.5
Q ss_pred cCCCCCCccCChhHHHHHhhhccCCCCccccccccccc
Q psy11980 331 KCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFI 368 (831)
Q Consensus 331 ~C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~ 368 (831)
.|+.|++.|-.. +..|..|+.||++|.
T Consensus 11 idPetg~KFYDL-----------NrdPiVsPytG~s~P 37 (129)
T COG4530 11 IDPETGKKFYDL-----------NRDPIVSPYTGKSYP 37 (129)
T ss_pred cCccccchhhcc-----------CCCccccCcccccch
Confidence 588888888542 356888999999884
No 211
>KOG2907|consensus
Probab=23.45 E-value=59 Score=28.93 Aligned_cols=46 Identities=22% Similarity=0.291 Sum_probs=24.9
Q ss_pred ccccCCcccCcccccccCchHHHHHhhhccccccccccccccCcccCC
Q psy11980 37 EKTVPGHLCKGCYRIFTSQGYMLLSYKSKEEEKQICTYIKVCGRVFNR 84 (831)
Q Consensus 37 e~~~~~~~C~~C~~~F~~~~~l~~H~~~~~~~~~~~~~C~~C~k~f~~ 84 (831)
+.++...+|+.||-.=-+.. .+-+|+-.+....-|.|+.|+..|..
T Consensus 69 ~ga~I~~kCpkCghe~m~Y~--T~QlRSADEGQTVFYTC~kC~~k~~e 114 (116)
T KOG2907|consen 69 DGAVIKHKCPKCGHEEMSYH--TLQLRSADEGQTVFYTCPKCKYKFTE 114 (116)
T ss_pred cccchhccCcccCCchhhhh--hhhcccccCCceEEEEcCccceeeec
Confidence 44556678888885322221 22233333333334778888877753
No 212
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=23.40 E-value=32 Score=29.47 Aligned_cols=12 Identities=17% Similarity=0.448 Sum_probs=7.1
Q ss_pred Cccccccccccc
Q psy11980 357 PKSCEVCGRAFI 368 (831)
Q Consensus 357 py~C~~C~~~F~ 368 (831)
.+.|..|++.|.
T Consensus 54 IW~C~~C~~~~A 65 (90)
T PTZ00255 54 IWRCKGCKKTVA 65 (90)
T ss_pred EEEcCCCCCEEe
Confidence 456666666554
No 213
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=22.84 E-value=98 Score=33.06 Aligned_cols=34 Identities=9% Similarity=0.068 Sum_probs=29.8
Q ss_pred HHHHHHhhhcCCeEEEcCCCChhHHHHHHHHHHh
Q psy11980 640 IENVIRDMDQYGLCVVDNFLGPERGMAVLHEVLG 673 (831)
Q Consensus 640 ~~~~~~~l~~~g~~~id~~l~~~~~~~l~~~~~~ 673 (831)
.++.++..+++||+++.++|+++.+.+|+.++..
T Consensus 4 t~eqi~~y~e~Gyv~~~~~~s~eei~~L~~~~~~ 37 (288)
T TIGR01762 4 TAEERQSFEKNGFIGPFTLYSPEEMKETWKRIRL 37 (288)
T ss_pred CHHHHHHHHhCCEEeCcCCCCHHHHHHHHHHHHH
Confidence 3556888999999999999999999999988864
No 214
>KOG3290|consensus
Probab=22.49 E-value=38 Score=34.92 Aligned_cols=30 Identities=20% Similarity=0.273 Sum_probs=25.1
Q ss_pred ccccCCccceeeecCCCCccccccccccee
Q psy11980 593 VIRDMDQYGLCVLDDFLGPERDMHLDTNVV 622 (831)
Q Consensus 593 i~~~l~~~g~~v~d~fl~~~~~~~l~~~~~ 622 (831)
..+.+.+.||.|+.+|+.++.+..|..+..
T Consensus 8 ~~~~f~~~Gflv~e~f~~~~e~~~L~~~~~ 37 (286)
T KOG3290|consen 8 QLVKFQTDGFLVIEGFLSAEEIDVLSQEQR 37 (286)
T ss_pred HHhhcccCceEEEeccccHHHHHHHHHHHH
Confidence 457788999999999999998888866654
No 215
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=21.93 E-value=50 Score=28.09 Aligned_cols=12 Identities=33% Similarity=0.872 Sum_probs=5.7
Q ss_pred CccCCCCCCccC
Q psy11980 329 PHKCPICTKGFT 340 (831)
Q Consensus 329 ~~~C~~C~k~F~ 340 (831)
|-.|..||+.|.
T Consensus 58 Pa~CkkCGfef~ 69 (97)
T COG3357 58 PARCKKCGFEFR 69 (97)
T ss_pred ChhhcccCcccc
Confidence 344555555444
No 216
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=21.52 E-value=61 Score=29.75 Aligned_cols=25 Identities=12% Similarity=0.082 Sum_probs=20.5
Q ss_pred cccCcccccccCchHHHHHhhhcccccc
Q psy11980 43 HLCKGCYRIFTSQGYMLLSYKSKEEEKQ 70 (831)
Q Consensus 43 ~~C~~C~~~F~~~~~l~~H~~~~~~~~~ 70 (831)
..|-+||+.|.+ |++|+.+|.+-.+
T Consensus 77 IicLEDGkkfKS---LKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFKS---LKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchHH---HHHHHhcccCCCH
Confidence 679999999976 8899999877544
No 217
>PF12907 zf-met2: Zinc-binding
Probab=21.49 E-value=44 Score=24.03 Aligned_cols=12 Identities=25% Similarity=0.315 Sum_probs=5.1
Q ss_pred HHHhhhhhcccC
Q psy11980 287 RLELHMKYIHEG 298 (831)
Q Consensus 287 ~L~~H~~~~H~~ 298 (831)
.|..|....|..
T Consensus 19 ~L~eH~enKHpK 30 (40)
T PF12907_consen 19 QLKEHAENKHPK 30 (40)
T ss_pred HHHHHHHccCCC
Confidence 344444444433
No 218
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=21.48 E-value=32 Score=29.48 Aligned_cols=13 Identities=31% Similarity=0.716 Sum_probs=8.0
Q ss_pred CcccccccccccC
Q psy11980 357 PKSCEVCGRAFIR 369 (831)
Q Consensus 357 py~C~~C~~~F~~ 369 (831)
.+.|..|++.|.-
T Consensus 53 IW~C~~C~~~~AG 65 (91)
T TIGR00280 53 IWTCRKCGAKFAG 65 (91)
T ss_pred EEEcCCCCCEEeC
Confidence 4666666666653
No 219
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=20.90 E-value=44 Score=31.49 Aligned_cols=35 Identities=17% Similarity=0.435 Sum_probs=29.4
Q ss_pred ccccccCcCCcccccCCccccccCCCcccccccCccC
Q psy11980 425 ENLLRCSKCKVTFYCSKQHQTLDWKAHKINCNILSNQ 461 (831)
Q Consensus 425 e~~~~C~~C~~~f~~~~~h~~~h~~~~~~~c~~c~~~ 461 (831)
.-+|.|. |+.-|-....|-.+--|+ .|.|..|+..
T Consensus 115 ~~~Y~C~-C~q~~l~~RRhn~~~~g~-~YrC~~C~gk 149 (156)
T COG3091 115 TYPYRCQ-CQQHYLRIRRHNTVRRGE-VYRCGKCGGK 149 (156)
T ss_pred ceeEEee-cCCccchhhhcccccccc-eEEeccCCce
Confidence 5679999 999998888887777777 8999999854
No 220
>PF09416 UPF1_Zn_bind: RNA helicase (UPF2 interacting domain); InterPro: IPR018999 UPF1 (or regulator of nonsense transcripts 1 homologue) is an essential RNA helicase that detects mRNAs containing premature stop codons and triggers their degradation. This domain contains 3 zinc binding motifs and forms interactions with another protein (UPF2) that is also involved nonsense-mediated mRNA decay (NMD) []. ; GO: 0003677 DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, 0005737 cytoplasm; PDB: 2IYK_B 2WJY_A 2WJV_A 2XZL_A.
Probab=20.71 E-value=73 Score=30.25 Aligned_cols=17 Identities=35% Similarity=0.768 Sum_probs=8.2
Q ss_pred cccCCCCccCccccccc
Q psy11980 295 IHEGADPYQCDICRKTF 311 (831)
Q Consensus 295 ~H~~~k~~~C~~C~k~F 311 (831)
.|...-..+|..|++-|
T Consensus 8 ~~~p~~vv~C~~c~kWF 24 (152)
T PF09416_consen 8 IHDPSCVVKCNTCNKWF 24 (152)
T ss_dssp ---CCCEEEETTTTEEE
T ss_pred CCCcccEeEcCCCCcEe
Confidence 34444455666666655
No 221
>KOG1280|consensus
Probab=20.33 E-value=1e+02 Score=33.12 Aligned_cols=64 Identities=14% Similarity=0.355 Sum_probs=0.0
Q ss_pred ccCCCCccCcccccccCChhHHHHHHhhhcCCCCcc--CCCCCCccCChhHHHHHhhhccCCCCcccccccccccChhHH
Q psy11980 296 HEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHK--CPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCL 373 (831)
Q Consensus 296 H~~~k~~~C~~C~k~F~~~~~L~~H~~~H~~~k~~~--C~~C~k~F~~~~~L~~H~~~H~~~~py~C~~C~~~F~~~~~L 373 (831)
|.....|.|++|++.-.+...|..|...-|.+-++. |++|. -.++.|+.|.+.-......
T Consensus 74 ~y~~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~------------------~~~~~qp~~~~~~~~~~~~ 135 (381)
T KOG1280|consen 74 HYDPQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA------------------ANPEMQPIHSKETENLSVH 135 (381)
T ss_pred ccccccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc------------------cCcccCchhhhhhhhhhhh
Q ss_pred HHHH
Q psy11980 374 IRHL 377 (831)
Q Consensus 374 ~~H~ 377 (831)
..|+
T Consensus 136 ~~~~ 139 (381)
T KOG1280|consen 136 WTEI 139 (381)
T ss_pred hhhh
No 222
>COG1660 Predicted P-loop-containing kinase [General function prediction only]
Probab=20.10 E-value=41 Score=34.91 Aligned_cols=57 Identities=18% Similarity=0.320 Sum_probs=35.2
Q ss_pred cccCCccceeeecCCCCcccccccccceee----eCCCCCC--CCcchhhhhHHHHHHhhhcCC
Q psy11980 594 IRDMDQYGLCVLDDFLGPERDMHLDTNVVV----VPPFHHT--NNDRVHSDMIENVIRDMDQYG 651 (831)
Q Consensus 594 ~~~l~~~g~~v~d~fl~~~~~~~l~~~~~~----~~~~~~~--~~~~~~~~~~~~~~~~l~~~g 651 (831)
+..|.+-||.++|| ||+++.-++.+-+.. ....... .+.+.....+.++++.|++.+
T Consensus 18 l~~lEDlGyycvDN-LPp~Llp~~~~~~~~~~~~~~kvAv~iDiRs~~~~~~l~~~l~~l~~~~ 80 (286)
T COG1660 18 LRVLEDLGYYCVDN-LPPQLLPKLADLMLTLESRITKVAVVIDVRSREFFGDLEEVLDELKDNG 80 (286)
T ss_pred HHHHHhcCeeeecC-CCHHHHHHHHHHHhhcccCCceEEEEEecccchhHHHHHHHHHHHHhcC
Confidence 56677889999999 788888777662210 1111111 122344456677888888774
Done!