Query psy679
Match_columns 510
No_of_seqs 338 out of 1804
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 18:25:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy679.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/679hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.8 3.8E-19 8.2E-24 177.1 7.3 110 371-484 159-268 (279)
2 KOG2462|consensus 99.7 1.1E-17 2.3E-22 166.8 2.6 83 369-453 183-265 (279)
3 KOG3576|consensus 99.5 4.9E-15 1.1E-19 142.5 4.8 96 370-467 114-220 (267)
4 KOG3623|consensus 99.5 1.6E-14 3.4E-19 157.8 2.3 90 371-462 892-982 (1007)
5 KOG3576|consensus 99.3 3.3E-12 7.2E-17 123.2 4.8 101 358-460 130-241 (267)
6 KOG3623|consensus 99.2 3.4E-12 7.3E-17 140.0 2.0 80 371-452 238-330 (1007)
7 KOG1074|consensus 99.0 2.9E-10 6.3E-15 126.8 6.1 53 373-427 353-405 (958)
8 KOG1074|consensus 98.9 1.4E-10 3.1E-15 129.2 0.1 84 371-456 603-693 (958)
9 PHA00733 hypothetical protein 98.8 2.9E-09 6.2E-14 96.9 3.6 85 368-456 35-124 (128)
10 KOG3608|consensus 98.7 2.6E-09 5.6E-14 110.1 1.0 101 363-467 197-300 (467)
11 KOG3608|consensus 98.7 5.7E-09 1.2E-13 107.7 2.9 102 369-475 233-337 (467)
12 PLN03086 PRLI-interacting fact 98.6 3.2E-08 6.9E-13 108.9 5.4 95 370-472 450-554 (567)
13 PHA02768 hypothetical protein; 98.5 2.5E-08 5.5E-13 78.2 0.7 42 374-419 6-47 (55)
14 PHA02768 hypothetical protein; 98.3 2.7E-07 5.9E-12 72.5 2.6 44 403-448 5-48 (55)
15 PF13465 zf-H2C2_2: Zinc-finge 98.1 7.1E-07 1.5E-11 59.7 1.1 24 391-414 2-25 (26)
16 PF13465 zf-H2C2_2: Zinc-finge 98.1 2.1E-06 4.5E-11 57.5 2.5 26 418-443 1-26 (26)
17 PLN03086 PRLI-interacting fact 98.1 4.1E-06 8.8E-11 92.6 5.7 78 372-455 477-564 (567)
18 PHA00733 hypothetical protein 98.0 6.6E-06 1.4E-10 75.0 4.0 56 369-428 69-124 (128)
19 KOG3993|consensus 97.9 2.3E-06 4.9E-11 90.5 0.2 87 373-461 267-386 (500)
20 COG5189 SFP1 Putative transcri 97.9 6.2E-06 1.3E-10 84.6 2.4 73 370-452 346-419 (423)
21 PHA00732 hypothetical protein 97.7 1.8E-05 4E-10 66.5 1.9 47 373-427 1-48 (79)
22 PHA00616 hypothetical protein 97.6 1.9E-05 4.2E-10 59.5 1.2 31 431-461 1-31 (44)
23 KOG3993|consensus 97.4 4.6E-05 1E-09 80.8 1.1 53 373-427 295-380 (500)
24 PHA00616 hypothetical protein 97.3 9.9E-05 2.1E-09 55.7 1.6 34 373-408 1-34 (44)
25 PHA00732 hypothetical protein 97.3 0.00016 3.4E-09 61.0 2.9 48 403-456 1-49 (79)
26 PF00096 zf-C2H2: Zinc finger, 97.2 0.0003 6.6E-09 44.9 2.3 23 432-454 1-23 (23)
27 PF05605 zf-Di19: Drought indu 97.0 0.00059 1.3E-08 53.0 3.0 49 404-455 3-53 (54)
28 PF13894 zf-C2H2_4: C2H2-type 96.7 0.0012 2.7E-08 41.5 2.5 24 432-455 1-24 (24)
29 PF12756 zf-C2H2_2: C2H2 type 96.7 0.0013 2.8E-08 55.0 3.1 74 375-455 1-74 (100)
30 PF05605 zf-Di19: Drought indu 96.5 0.0028 6E-08 49.2 3.3 51 373-428 2-54 (54)
31 PF13912 zf-C2H2_6: C2H2-type 96.4 0.0015 3.2E-08 43.3 1.1 26 431-456 1-26 (27)
32 PF00096 zf-C2H2: Zinc finger, 96.1 0.0036 7.8E-08 39.9 1.9 23 374-398 1-23 (23)
33 PRK04860 hypothetical protein; 95.8 0.0038 8.2E-08 59.3 1.2 37 403-443 119-155 (160)
34 smart00355 ZnF_C2H2 zinc finge 95.7 0.0099 2.1E-07 37.6 2.5 24 432-455 1-24 (26)
35 PF09237 GAGA: GAGA factor; I 95.3 0.019 4E-07 44.9 3.3 33 427-459 20-52 (54)
36 COG5048 FOG: Zn-finger [Genera 94.8 0.0089 1.9E-07 60.7 0.3 73 372-446 288-368 (467)
37 PF13894 zf-C2H2_4: C2H2-type 94.7 0.026 5.7E-07 35.3 2.3 23 374-398 1-23 (24)
38 COG5048 FOG: Zn-finger [Genera 94.3 0.022 4.7E-07 57.9 1.8 92 367-458 313-415 (467)
39 PRK04860 hypothetical protein; 94.2 0.031 6.7E-07 53.1 2.4 40 372-417 118-157 (160)
40 PF13912 zf-C2H2_6: C2H2-type 93.9 0.028 6E-07 37.1 1.2 25 373-399 1-25 (27)
41 PF12874 zf-met: Zinc-finger o 93.0 0.052 1.1E-06 35.2 1.3 23 432-454 1-23 (25)
42 KOG1146|consensus 93.0 0.23 4.9E-06 59.9 7.3 80 368-453 460-540 (1406)
43 PF13909 zf-H2C2_5: C2H2-type 92.7 0.12 2.5E-06 33.4 2.6 24 432-456 1-24 (24)
44 smart00355 ZnF_C2H2 zinc finge 92.5 0.13 2.9E-06 32.2 2.7 24 374-399 1-24 (26)
45 KOG4173|consensus 92.4 0.039 8.3E-07 54.4 0.1 82 371-455 77-171 (253)
46 PF09237 GAGA: GAGA factor; I 90.5 0.35 7.5E-06 38.0 3.5 32 370-403 21-52 (54)
47 PF12171 zf-C2H2_jaz: Zinc-fin 89.0 0.1 2.2E-06 34.8 -0.4 22 432-453 2-23 (27)
48 PF12874 zf-met: Zinc-finger o 87.2 0.33 7.2E-06 31.3 1.2 23 374-398 1-23 (25)
49 COG5189 SFP1 Putative transcri 86.5 0.26 5.6E-06 51.6 0.7 40 429-468 347-407 (423)
50 COG5236 Uncharacterized conser 86.5 0.58 1.3E-05 49.5 3.2 80 373-454 151-243 (493)
51 KOG2186|consensus 86.4 0.44 9.5E-06 48.4 2.2 50 404-456 4-53 (276)
52 KOG4124|consensus 84.5 0.61 1.3E-05 49.4 2.2 72 370-451 346-418 (442)
53 KOG2231|consensus 84.5 0.76 1.7E-05 52.5 3.2 18 381-398 188-205 (669)
54 PF12171 zf-C2H2_jaz: Zinc-fin 83.9 0.51 1.1E-05 31.4 0.9 22 404-425 2-23 (27)
55 smart00451 ZnF_U1 U1-like zinc 83.5 0.95 2.1E-05 31.3 2.2 23 431-453 3-25 (35)
56 PF13913 zf-C2HC_2: zinc-finge 82.9 1.1 2.3E-05 29.8 2.2 21 432-453 3-23 (25)
57 KOG2231|consensus 80.3 1.3 2.9E-05 50.6 3.1 13 386-398 125-138 (669)
58 PF13909 zf-H2C2_5: C2H2-type 80.2 1.5 3.3E-05 28.1 2.2 23 374-399 1-23 (24)
59 PF09986 DUF2225: Uncharacteri 76.9 1.2 2.7E-05 43.9 1.4 42 402-443 4-60 (214)
60 KOG2893|consensus 75.6 1.5 3.3E-05 44.4 1.6 46 371-423 9-54 (341)
61 KOG1146|consensus 72.1 1.6 3.4E-05 53.1 0.9 78 373-457 1260-1354(1406)
62 PF09538 FYDLN_acid: Protein o 71.5 2.5 5.4E-05 37.9 1.8 30 404-444 10-39 (108)
63 KOG2893|consensus 70.0 1.1 2.3E-05 45.5 -0.9 44 402-450 10-53 (341)
64 KOG4167|consensus 67.7 2.6 5.7E-05 48.5 1.4 26 431-456 792-817 (907)
65 PF12013 DUF3505: Protein of u 67.2 7.3 0.00016 34.1 3.9 25 432-456 81-109 (109)
66 COG5236 Uncharacterized conser 65.2 3 6.4E-05 44.4 1.1 68 381-453 226-303 (493)
67 PRK00464 nrdR transcriptional 64.1 3.4 7.4E-05 39.2 1.2 14 404-417 29-42 (154)
68 cd00350 rubredoxin_like Rubred 64.0 4.6 0.0001 28.4 1.6 10 430-439 16-25 (33)
69 COG4049 Uncharacterized protei 63.2 4 8.6E-05 32.8 1.2 29 425-453 11-39 (65)
70 smart00451 ZnF_U1 U1-like zinc 62.7 6.3 0.00014 27.1 2.1 23 373-397 3-25 (35)
71 COG1997 RPL43A Ribosomal prote 62.5 4.5 9.7E-05 35.1 1.5 32 402-443 34-65 (89)
72 PF12756 zf-C2H2_2: C2H2 type 62.4 6.2 0.00014 32.6 2.4 24 373-398 50-73 (100)
73 TIGR00622 ssl1 transcription f 61.2 12 0.00026 33.9 4.0 80 372-455 14-105 (112)
74 TIGR02300 FYDLN_acid conserved 57.8 6.9 0.00015 36.2 2.0 33 404-447 10-42 (129)
75 PF02892 zf-BED: BED zinc fing 56.7 7.4 0.00016 28.5 1.7 26 428-453 13-42 (45)
76 smart00614 ZnF_BED BED zinc fi 54.3 9 0.0002 29.1 1.8 24 432-455 19-48 (50)
77 KOG2482|consensus 53.8 8.3 0.00018 41.1 2.0 81 373-455 195-303 (423)
78 smart00531 TFIIE Transcription 53.3 13 0.00028 34.6 3.1 38 401-442 97-134 (147)
79 TIGR02098 MJ0042_CXXC MJ0042 f 52.0 11 0.00024 26.8 1.9 34 404-442 3-36 (38)
80 PF05443 ROS_MUCR: ROS/MUCR tr 52.0 8.2 0.00018 35.8 1.5 27 429-458 70-96 (132)
81 PF13719 zinc_ribbon_5: zinc-r 47.9 15 0.00032 26.6 2.0 34 404-442 3-36 (37)
82 TIGR00373 conserved hypothetic 46.9 15 0.00033 34.7 2.6 34 400-442 106-139 (158)
83 PRK09678 DNA-binding transcrip 46.0 9.7 0.00021 31.8 0.9 14 429-442 25-40 (72)
84 PRK13185 chlL protochlorophyll 45.8 3.4 7.3E-05 41.2 -2.2 18 29-46 7-24 (270)
85 PF04959 ARS2: Arsenite-resist 44.7 20 0.00042 35.9 3.0 33 428-460 74-106 (214)
86 PF09723 Zn-ribbon_8: Zinc rib 43.3 13 0.00029 27.5 1.2 29 404-439 6-34 (42)
87 PF13717 zinc_ribbon_4: zinc-r 43.1 22 0.00047 25.6 2.2 33 404-441 3-35 (36)
88 smart00834 CxxC_CXXC_SSSS Puta 43.1 12 0.00026 26.7 0.9 30 404-440 6-35 (41)
89 KOG2186|consensus 42.8 11 0.00025 38.5 1.0 47 374-425 4-50 (276)
90 KOG2785|consensus 42.0 18 0.00039 39.1 2.4 73 374-453 167-242 (390)
91 cd00729 rubredoxin_SM Rubredox 41.5 17 0.00038 25.8 1.5 25 403-439 2-26 (34)
92 COG1592 Rubrerythrin [Energy p 41.2 17 0.00038 35.0 2.0 25 402-439 133-157 (166)
93 KOG0978|consensus 40.9 6.6 0.00014 45.4 -1.1 21 430-450 677-697 (698)
94 cd02032 Bchl_like This family 40.8 4.4 9.6E-05 40.3 -2.2 18 29-46 5-22 (267)
95 TIGR02605 CxxC_CxxC_SSSS putat 40.3 14 0.0003 28.0 1.0 11 404-414 6-16 (52)
96 KOG2593|consensus 40.3 22 0.00047 39.0 2.7 38 400-440 125-162 (436)
97 PRK13231 nitrogenase reductase 40.2 6 0.00013 39.3 -1.4 18 29-46 7-24 (264)
98 COG4957 Predicted transcriptio 40.1 15 0.00033 34.4 1.3 25 432-459 77-101 (148)
99 PRK00398 rpoP DNA-directed RNA 40.0 18 0.00039 27.1 1.5 12 403-414 3-14 (46)
100 PF14353 CpXC: CpXC protein 38.3 8.1 0.00018 34.7 -0.7 19 404-422 39-57 (128)
101 PRK06266 transcription initiat 37.7 23 0.00049 34.3 2.2 34 401-443 115-148 (178)
102 PRK13235 nifH nitrogenase redu 37.6 5.2 0.00011 40.1 -2.3 38 29-85 6-43 (274)
103 KOG2461|consensus 37.3 1.8E+02 0.0039 31.8 9.1 55 384-438 340-394 (396)
104 COG4049 Uncharacterized protei 36.9 9.6 0.00021 30.7 -0.4 27 369-397 13-39 (65)
105 COG4957 Predicted transcriptio 36.2 37 0.00081 31.9 3.2 26 372-402 75-100 (148)
106 cd02117 NifH_like This family 36.0 5.6 0.00012 38.3 -2.3 18 29-46 5-22 (212)
107 PF01780 Ribosomal_L37ae: Ribo 36.0 15 0.00032 32.1 0.5 31 402-442 34-64 (90)
108 KOG2071|consensus 35.1 32 0.00068 39.1 3.0 27 371-399 416-442 (579)
109 cd02040 NifH NifH gene encodes 34.9 6 0.00013 39.0 -2.3 17 30-46 7-23 (270)
110 smart00659 RPOLCX RNA polymera 34.3 32 0.00068 26.0 2.0 11 404-414 3-13 (44)
111 PF09986 DUF2225: Uncharacteri 33.3 8.8 0.00019 38.0 -1.4 44 371-416 3-61 (214)
112 COG1192 Soj ATPases involved i 32.8 6.9 0.00015 38.7 -2.3 36 29-82 8-43 (259)
113 COG4530 Uncharacterized protei 32.6 25 0.00055 31.9 1.5 28 405-443 11-38 (129)
114 KOG2482|consensus 32.4 45 0.00098 35.8 3.5 23 431-453 195-217 (423)
115 PRK13230 nitrogenase reductase 32.3 7.3 0.00016 39.2 -2.2 18 29-46 6-23 (279)
116 PRK13233 nifH nitrogenase redu 32.1 7.9 0.00017 38.7 -2.0 17 30-46 8-24 (275)
117 PF12760 Zn_Tnp_IS1595: Transp 31.8 73 0.0016 23.8 3.6 38 393-439 8-45 (46)
118 TIGR01287 nifH nitrogenase iro 31.8 9.7 0.00021 38.1 -1.5 18 29-46 5-22 (275)
119 COG1198 PriA Primosomal protei 31.1 32 0.00069 40.3 2.4 13 428-440 472-484 (730)
120 KOG2785|consensus 30.5 38 0.00083 36.6 2.6 24 402-425 216-242 (390)
121 CHL00072 chlL photochlorophyll 29.9 8.9 0.00019 39.4 -2.1 18 29-46 5-22 (290)
122 PF04959 ARS2: Arsenite-resist 29.5 11 0.00024 37.6 -1.4 28 402-429 76-103 (214)
123 COG1996 RPC10 DNA-directed RNA 29.3 34 0.00073 26.7 1.5 11 403-413 6-16 (49)
124 COG5151 SSL1 RNA polymerase II 29.0 52 0.0011 34.9 3.2 49 405-455 364-412 (421)
125 PF09538 FYDLN_acid: Protein o 28.7 50 0.0011 29.6 2.7 30 374-416 10-39 (108)
126 TIGR03371 cellulose_yhjQ cellu 28.4 9.3 0.0002 37.1 -2.2 18 29-46 7-24 (246)
127 PF01656 CbiA: CobQ/CobB/MinD/ 28.4 11 0.00024 34.7 -1.5 18 29-46 4-21 (195)
128 PRK10037 cell division protein 28.3 9.6 0.00021 37.7 -2.1 18 29-46 7-24 (250)
129 PF05443 ROS_MUCR: ROS/MUCR tr 28.2 39 0.00084 31.4 2.0 30 368-402 67-96 (132)
130 KOG0782|consensus 27.5 20 0.00044 40.6 -0.0 54 387-445 237-290 (1004)
131 COG3677 Transposase and inacti 27.0 62 0.0013 29.8 3.1 47 391-444 20-66 (129)
132 KOG3598|consensus 26.1 19 0.0004 44.8 -0.6 20 33-52 1846-1865(2220)
133 PHA02518 ParA-like protein; Pr 25.7 12 0.00025 35.3 -2.0 18 29-46 6-23 (211)
134 PHA00626 hypothetical protein 25.4 43 0.00093 27.0 1.5 14 430-443 22-35 (59)
135 PF09818 ABC_ATPase: Predicted 25.0 35 0.00077 37.7 1.3 80 25-113 245-330 (448)
136 KOG4377|consensus 24.7 34 0.00074 37.4 1.1 81 374-457 314-429 (480)
137 COG1327 Predicted transcriptio 24.4 36 0.00078 32.5 1.0 29 444-472 63-91 (156)
138 cd02037 MRP-like MRP (Multiple 24.3 18 0.00039 33.4 -1.0 18 29-46 5-22 (169)
139 COG3091 SprT Zn-dependent meta 24.1 43 0.00094 32.0 1.5 31 403-438 117-147 (156)
140 TIGR00373 conserved hypothetic 24.1 40 0.00086 31.9 1.3 33 371-414 107-139 (158)
141 PF10263 SprT-like: SprT-like 23.9 35 0.00076 31.2 0.9 32 403-442 123-154 (157)
142 KOG2071|consensus 23.6 46 0.001 37.8 1.9 31 428-458 415-445 (579)
143 smart00440 ZnF_C2C2 C2C2 Zinc 23.4 36 0.00077 25.1 0.6 12 431-442 28-39 (40)
144 PTZ00255 60S ribosomal protein 23.2 41 0.00089 29.4 1.1 32 402-443 35-66 (90)
145 PRK14890 putative Zn-ribbon RN 23.2 60 0.0013 26.3 1.9 31 403-439 25-56 (59)
146 TIGR01281 DPOR_bchL light-inde 23.1 17 0.00037 36.1 -1.5 17 30-46 6-22 (268)
147 PF01927 Mut7-C: Mut7-C RNAse 23.1 64 0.0014 29.9 2.4 47 404-451 92-144 (147)
148 PF15135 UPF0515: Uncharacteri 22.8 62 0.0013 33.3 2.4 62 366-442 105-166 (278)
149 cd02036 MinD Bacterial cell di 22.7 18 0.00039 32.9 -1.3 18 29-46 5-22 (179)
150 TIGR01969 minD_arch cell divis 22.6 19 0.0004 34.9 -1.3 18 29-46 6-23 (251)
151 smart00734 ZnF_Rad18 Rad18-lik 22.5 76 0.0016 21.3 2.0 20 433-453 3-22 (26)
152 PRK06266 transcription initiat 22.5 44 0.00096 32.3 1.3 33 372-415 116-148 (178)
153 TIGR00244 transcriptional regu 22.4 43 0.00093 31.8 1.1 15 402-416 27-41 (147)
154 COG3357 Predicted transcriptio 22.3 45 0.00098 29.3 1.1 27 402-438 57-83 (97)
155 TIGR00280 L37a ribosomal prote 22.1 41 0.00088 29.5 0.8 32 402-443 34-65 (91)
156 COG0489 Mrp ATPases involved i 21.6 20 0.00043 36.5 -1.3 18 28-45 62-79 (265)
157 cd03110 Fer4_NifH_child This p 21.3 20 0.00043 33.2 -1.3 17 29-45 5-21 (179)
158 PF06524 NOA36: NOA36 protein; 21.0 38 0.00083 35.0 0.5 84 368-455 137-233 (314)
159 PRK14873 primosome assembly pr 20.7 29 0.00062 40.2 -0.5 27 398-440 405-431 (665)
160 PRK03824 hypA hydrogenase nick 20.7 55 0.0012 30.2 1.5 13 403-415 70-82 (135)
161 PF05191 ADK_lid: Adenylate ki 20.6 40 0.00086 24.4 0.4 31 404-442 2-32 (36)
162 COG1655 Uncharacterized protei 20.6 38 0.00082 34.5 0.4 40 401-440 17-71 (267)
163 PRK13234 nifH nitrogenase redu 20.2 17 0.00037 37.3 -2.2 37 30-85 10-46 (295)
164 PF01363 FYVE: FYVE zinc finge 20.0 56 0.0012 26.0 1.2 13 402-414 24-36 (69)
165 PRK13849 putative crown gall t 20.0 17 0.00037 36.2 -2.2 18 29-46 7-24 (231)
No 1
>KOG2462|consensus
Probab=99.77 E-value=3.8e-19 Score=177.07 Aligned_cols=110 Identities=32% Similarity=0.639 Sum_probs=100.6
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH 450 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH 450 (510)
.+-|.|+ .|+|.|.+-..|+.|+|+|+ -+++|.+|||.|.+.--|..|+|+|||||||.|..|+|+|+.+++|+.|
T Consensus 159 ~ka~~C~--~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAH 234 (279)
T KOG2462|consen 159 KKAFSCK--YCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAH 234 (279)
T ss_pred cccccCC--CCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHH
Confidence 5678895 59999999999999999997 5899999999999999999999999999999999999999999999999
Q ss_pred HHHhhcccchhHHHHHHHHHHHHHHHHHHHhhhh
Q psy679 451 AKVHLKQRSKKESKMAVMMSQQQAQQQQQQNHVH 484 (510)
Q Consensus 451 lRtHtgeK~~k~~~~~~~~~QQQ~qQQqQQN~~~ 484 (510)
|++|.+.|.|+|..|.+.|+..--...+......
T Consensus 235 mQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~C~ 268 (279)
T KOG2462|consen 235 MQTHSDVKKHQCPRCGKSFALKSYLNKHSESACL 268 (279)
T ss_pred HHhhcCCccccCcchhhHHHHHHHHHHhhhhccc
Confidence 9999999999999999999988766666554433
No 2
>KOG2462|consensus
Probab=99.68 E-value=1.1e-17 Score=166.79 Aligned_cols=83 Identities=37% Similarity=0.779 Sum_probs=77.7
Q ss_pred CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679 369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK 448 (510)
Q Consensus 369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk 448 (510)
++.-+++|.+ |||.|.+..-|+.|+|+|+|||||.|..|+|+|..+.+|+.|+++|.+.|+|.|..|+|.|++.+.|.
T Consensus 183 TH~l~c~C~i--CGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLn 260 (279)
T KOG2462|consen 183 THTLPCECGI--CGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLN 260 (279)
T ss_pred ccCCCccccc--ccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHH
Confidence 3446778855 99999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHH
Q psy679 449 RHAKV 453 (510)
Q Consensus 449 rHlRt 453 (510)
+|...
T Consensus 261 KH~ES 265 (279)
T KOG2462|consen 261 KHSES 265 (279)
T ss_pred Hhhhh
Confidence 99754
No 3
>KOG3576|consensus
Probab=99.53 E-value=4.9e-15 Score=142.50 Aligned_cols=96 Identities=26% Similarity=0.582 Sum_probs=85.8
Q ss_pred CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHH
Q psy679 370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKR 449 (510)
Q Consensus 370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkr 449 (510)
+...|.|.+ |+|.|.-..-|.+|++.|...|.|-|..|+|.|....+|++|+|+|+|.+||+|..|+|+|.++..|..
T Consensus 114 d~d~ftCrv--CgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 114 DQDSFTCRV--CGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred CCCeeeeeh--hhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence 456799977 999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhc-----------ccchhHHHHHH
Q psy679 450 HAKVHLK-----------QRSKKESKMAV 467 (510)
Q Consensus 450 HlRtHtg-----------eK~~k~~~~~~ 467 (510)
|++.-++ .|.|.|++|..
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~ 220 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGY 220 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCC
Confidence 9975443 45566777643
No 4
>KOG3623|consensus
Probab=99.46 E-value=1.6e-14 Score=157.84 Aligned_cols=90 Identities=40% Similarity=0.844 Sum_probs=81.4
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH 450 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH 450 (510)
+..|.| +.|.|.|...+.|.||.--|+|.|||+|.+|.|+|+.+.+|..|+|.|.|||||.|++|+|+|.....+..|
T Consensus 892 ~gmyaC--DqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQH 969 (1007)
T KOG3623|consen 892 DGMYAC--DQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQH 969 (1007)
T ss_pred cccchH--HHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhh
Confidence 467999 679999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HH-HhhcccchhH
Q psy679 451 AK-VHLKQRSKKE 462 (510)
Q Consensus 451 lR-tHtgeK~~k~ 462 (510)
|. ...--|+|++
T Consensus 970 MNHRYSYCKpyrE 982 (1007)
T KOG3623|consen 970 MNHRYSYCKPYRE 982 (1007)
T ss_pred hccchhcccchhh
Confidence 84 3334455543
No 5
>KOG3576|consensus
Probab=99.27 E-value=3.3e-12 Score=123.18 Aligned_cols=101 Identities=28% Similarity=0.597 Sum_probs=90.4
Q ss_pred CCCCCCCCCCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccC---------
Q psy679 358 RKYPNRPSKTPVHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTG--------- 428 (510)
Q Consensus 358 ~~~p~~pskth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtG--------- 428 (510)
.++.+++.+-|...|.|-|.+ |||.|...-+|++|+|+|+|.+||+|..|+|+|..+..|..|.+.-+|
T Consensus 130 QRmlnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~~yayke 207 (267)
T KOG3576|consen 130 QRMLNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKE 207 (267)
T ss_pred HHHHHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHH
Confidence 346688889999999999977 999999999999999999999999999999999999999999875333
Q ss_pred --CCceecCcCCCccCChHHHHHHHHHhhcccch
Q psy679 429 --EKPFSCDVCGRKFARSDEKKRHAKVHLKQRSK 460 (510)
Q Consensus 429 --eKPfkC~~CgKaFsrsssLkrHlRtHtgeK~~ 460 (510)
+|-|.|+.||..-.+...+..|++.|+..-+.
T Consensus 208 rr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 208 RRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPA 241 (267)
T ss_pred hhhheeeecccCCCCCChhHHHHHHHhcCCCCHH
Confidence 57899999999999999999999999876554
No 6
>KOG3623|consensus
Probab=99.21 E-value=3.4e-12 Score=139.97 Aligned_cols=80 Identities=34% Similarity=0.807 Sum_probs=72.9
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcC-------------CCceecccCCCccCCchhhhhhhhcccCCCceecCcC
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTG-------------QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVC 437 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtg-------------eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~C 437 (510)
+..|.|.. |..+|..+..|.+||.+|.. -|.|+|..|+|+|+.+.+|+.|+|+|.|||||.|+.|
T Consensus 238 e~nfsC~l--CsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnC 315 (1007)
T KOG3623|consen 238 EPNFSCML--CSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNC 315 (1007)
T ss_pred CCCCcchh--hhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCccc
Confidence 34577954 99999999999999999863 3679999999999999999999999999999999999
Q ss_pred CCccCChHHHHHHHH
Q psy679 438 GRKFARSDEKKRHAK 452 (510)
Q Consensus 438 gKaFsrsssLkrHlR 452 (510)
+|+|.....+..||.
T Consensus 316 kKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 316 KKRFSHSGSYSSHMS 330 (1007)
T ss_pred ccccccCCccccccc
Confidence 999999999999984
No 7
>KOG1074|consensus
Probab=99.01 E-value=2.9e-10 Score=126.80 Aligned_cols=53 Identities=36% Similarity=0.813 Sum_probs=42.8
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT 427 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt 427 (510)
+++|.+ |.|.|...+.|+.|.|.|+|+|||+|.+|+.+|..+.+|+.|...|+
T Consensus 353 khkCr~--CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~ 405 (958)
T KOG1074|consen 353 KHKCRF--CAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHR 405 (958)
T ss_pred cchhhh--hHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeecc
Confidence 356755 88888888888888888888888888888888888888888865543
No 8
>KOG1074|consensus
Probab=98.94 E-value=1.4e-10 Score=129.17 Aligned_cols=84 Identities=36% Similarity=0.805 Sum_probs=77.2
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCC----CceecC---cCCCccCC
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGE----KPFSCD---VCGRKFAR 443 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGe----KPfkC~---~CgKaFsr 443 (510)
..|..|.+ |-|....++.|+.|.|+|+|+|||+|.+|+++|..+.+|+.|+.+|... -.|.|. +|-+.|..
T Consensus 603 TdPNqCii--C~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 603 TDPNQCII--CLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred CCccceee--eeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccc
Confidence 35788977 9999999999999999999999999999999999999999999998543 357899 99999999
Q ss_pred hHHHHHHHHHhhc
Q psy679 444 SDEKKRHAKVHLK 456 (510)
Q Consensus 444 sssLkrHlRtHtg 456 (510)
.-.|..|+|+|.+
T Consensus 681 ~V~lpQhIriH~~ 693 (958)
T KOG1074|consen 681 AVTLPQHIRIHLG 693 (958)
T ss_pred cccccceEEeecC
Confidence 9999999999994
No 9
>PHA00733 hypothetical protein
Probab=98.80 E-value=2.9e-09 Score=96.87 Aligned_cols=85 Identities=24% Similarity=0.445 Sum_probs=70.5
Q ss_pred CCCCCCcccCCCCCCCccCChHHHHHH--HH---HhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 368 PVHERPYACPVENCDRRFSRSDELTRH--IR---IHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 368 h~~EKPy~C~~e~CgKsFssss~LkrH--~r---tHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
....+++.|.+ |.+.|.....|..| ++ .+.+.++|.|..|++.|.....|..|++.| +++|.|..|++.|.
T Consensus 35 ~~~~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~ 110 (128)
T PHA00733 35 TPEQKRLIRAV--VKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFR 110 (128)
T ss_pred ChhhhhHHHHH--HhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccC
Confidence 34567899965 99998888777665 22 234588999999999999999999999987 46799999999999
Q ss_pred ChHHHHHHHHHhhc
Q psy679 443 RSDEKKRHAKVHLK 456 (510)
Q Consensus 443 rsssLkrHlRtHtg 456 (510)
....|.+|++..++
T Consensus 111 ~~~sL~~H~~~~h~ 124 (128)
T PHA00733 111 NTDSTLDHVCKKHN 124 (128)
T ss_pred CHHHHHHHHHHhcC
Confidence 99999999977654
No 10
>KOG3608|consensus
Probab=98.73 E-value=2.6e-09 Score=110.15 Aligned_cols=101 Identities=25% Similarity=0.523 Sum_probs=49.0
Q ss_pred CCCCCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcC--CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679 363 RPSKTPVHERPYACPVENCDRRFSRSDELTRHIRIHTG--QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK 440 (510)
Q Consensus 363 ~pskth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtg--eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa 440 (510)
++.++|++||...|+. ||..|+++..|-.|+|..+. ..+|.|..|-|.|.....|..|++.|. .-|+|+.|...
T Consensus 197 eH~r~Hs~eKvvACp~--Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmt 272 (467)
T KOG3608|consen 197 EHIRTHSNEKVVACPH--CGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMT 272 (467)
T ss_pred HHHHhcCCCeEEecch--HHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccC
Confidence 4456666666666633 66666666666666654332 234444444444444444444443332 12444444444
Q ss_pred cCChHHHHHHHH-HhhcccchhHHHHHH
Q psy679 441 FARSDEKKRHAK-VHLKQRSKKESKMAV 467 (510)
Q Consensus 441 FsrsssLkrHlR-tHtgeK~~k~~~~~~ 467 (510)
....+.|.+|+| .|.+.|++||..|..
T Consensus 273 c~~~ssL~~H~r~rHs~dkpfKCd~Cd~ 300 (467)
T KOG3608|consen 273 CSSASSLTTHIRYRHSKDKPFKCDECDT 300 (467)
T ss_pred CCChHHHHHHHHhhhccCCCccccchhh
Confidence 444444444443 344444444444444
No 11
>KOG3608|consensus
Probab=98.71 E-value=5.7e-09 Score=107.68 Aligned_cols=102 Identities=26% Similarity=0.573 Sum_probs=80.2
Q ss_pred CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc-ccCCCceecCcCCCccCChHHH
Q psy679 369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT-HTGEKPFSCDVCGRKFARSDEK 447 (510)
Q Consensus 369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt-HtGeKPfkC~~CgKaFsrsssL 447 (510)
....+|.| ..|-|+|.....|+.|++.|.. -|+|+.|.......+.|.+|++. |..+|||+|+.|.+.|.+.+.|
T Consensus 233 l~~n~fqC--~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL 308 (467)
T KOG3608|consen 233 LNTNSFQC--AQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDL 308 (467)
T ss_pred hcCCchHH--HHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHH
Confidence 34568999 5599999999999999988863 58899998888888889988874 7778899999999999998899
Q ss_pred HHHHHHhhcccchhHHH--HHHHHHHHHHH
Q psy679 448 KRHAKVHLKQRSKKESK--MAVMMSQQQAQ 475 (510)
Q Consensus 448 krHlRtHtgeK~~k~~~--~~~~~~QQQ~q 475 (510)
.+|+.+|. +-.|.|+. |...+....+.
T Consensus 309 ~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~ 337 (467)
T KOG3608|consen 309 AKHVQVHS-KTVYQCEHPDCHYSVRTYTQM 337 (467)
T ss_pred HHHHHhcc-ccceecCCCCCcHHHHHHHHH
Confidence 99988887 66677766 55444433333
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.62 E-value=3.2e-08 Score=108.87 Aligned_cols=95 Identities=16% Similarity=0.431 Sum_probs=79.5
Q ss_pred CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC------
Q psy679 370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR------ 443 (510)
Q Consensus 370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr------ 443 (510)
.++.+.|. .|++.|. ...|..|++.|+ +++.|+ |++.+ .+..|..|+++|..+|++.|..|++.|..
T Consensus 450 l~~H~~C~--~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d 522 (567)
T PLN03086 450 AKNHVHCE--KCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMD 522 (567)
T ss_pred cccCccCC--CCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccc
Confidence 35567894 5999996 678999999985 899999 99765 66899999999999999999999999952
Q ss_pred ----hHHHHHHHHHhhcccchhHHHHHHHHHHH
Q psy679 444 ----SDEKKRHAKVHLKQRSKKESKMAVMMSQQ 472 (510)
Q Consensus 444 ----sssLkrHlRtHtgeK~~k~~~~~~~~~QQ 472 (510)
...|..|++++ +.+++.|..|.+.....
T Consensus 523 ~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 523 VRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK 554 (567)
T ss_pred hhhhhhhHHHHHHhc-CCcceEccccCCeeeeh
Confidence 45899999986 99999999997654433
No 13
>PHA02768 hypothetical protein; Provisional
Probab=98.51 E-value=2.5e-08 Score=78.23 Aligned_cols=42 Identities=21% Similarity=0.517 Sum_probs=25.2
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhh
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHL 419 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L 419 (510)
|.| +.||+.|.+..+|.+|+++|+ ++|+|..|++.|.+...|
T Consensus 6 y~C--~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YEC--PICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCc--chhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 556 346666666666666666665 456666666666555444
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.34 E-value=2.7e-07 Score=72.46 Aligned_cols=44 Identities=25% Similarity=0.532 Sum_probs=40.8
Q ss_pred ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK 448 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk 448 (510)
-|.|+.|++.|.+..+|.+|+++|+ ++|+|..|+|.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999998 7999999999999887764
No 15
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.15 E-value=7.1e-07 Score=59.70 Aligned_cols=24 Identities=58% Similarity=1.211 Sum_probs=13.3
Q ss_pred HHHHHHHhcCCCceecccCCCccC
Q psy679 391 LTRHIRIHTGQKPFQCRICMRSFS 414 (510)
Q Consensus 391 LkrH~rtHtgeKPfkC~~CgKsF~ 414 (510)
|.+|+++|+++|||.|+.|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 455555555555555555555553
No 16
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.10 E-value=2.1e-06 Score=57.46 Aligned_cols=26 Identities=54% Similarity=1.206 Sum_probs=24.0
Q ss_pred hhhhhhhcccCCCceecCcCCCccCC
Q psy679 418 HLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 418 ~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
+|.+|+++|+|+|||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47899999999999999999999974
No 17
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.07 E-value=4.1e-06 Score=92.55 Aligned_cols=78 Identities=21% Similarity=0.449 Sum_probs=65.7
Q ss_pred CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCC----------chhhhhhhhcccCCCceecCcCCCcc
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSR----------SDHLTTHIRTHTGEKPFSCDVCGRKF 441 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~----------ks~L~rH~rtHtGeKPfkC~~CgKaF 441 (510)
+++.| . |++.| .+..|..|+++|...|++.|.+|++.|.. ...|..|...+ |.+++.|..||+.|
T Consensus 477 kpv~C--p-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~V 551 (567)
T PLN03086 477 EPLQC--P-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSV 551 (567)
T ss_pred CCccC--C-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCee
Confidence 78999 4 99765 66899999999999999999999999852 34789999986 89999999999998
Q ss_pred CChHHHHHHHHHhh
Q psy679 442 ARSDEKKRHAKVHL 455 (510)
Q Consensus 442 srsssLkrHlRtHt 455 (510)
..+ .+..|+...+
T Consensus 552 rlr-dm~~H~~~~h 564 (567)
T PLN03086 552 MLK-EMDIHQIAVH 564 (567)
T ss_pred eeh-hHHHHHHHhh
Confidence 875 5677875544
No 18
>PHA00733 hypothetical protein
Probab=97.97 E-value=6.6e-06 Score=74.99 Aligned_cols=56 Identities=29% Similarity=0.550 Sum_probs=50.0
Q ss_pred CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccC
Q psy679 369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTG 428 (510)
Q Consensus 369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtG 428 (510)
..+++|.| +.|++.|.....|..|++.| .++|.|..|++.|.....|.+|+..+++
T Consensus 69 ~~~kPy~C--~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 69 KAVSPYVC--PLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCCccC--CCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 34789999 55999999999999999987 4579999999999999999999987765
No 19
>KOG3993|consensus
Probab=97.92 E-value=2.3e-06 Score=90.45 Aligned_cols=87 Identities=29% Similarity=0.604 Sum_probs=70.1
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc--------CCC--------------
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT--------GEK-------------- 430 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt--------GeK-------------- 430 (510)
-|.|.. |...|.+.-.|..|.-...-.-.|+|+.|+|.|.-..+|..|.|=|. +.+
T Consensus 267 dyiCqL--CK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 267 DYICQL--CKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHH--HHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 388955 99999999999999754333346999999999999999999987663 111
Q ss_pred -----------ceecCcCCCccCChHHHHHHHHHhhcccchh
Q psy679 431 -----------PFSCDVCGRKFARSDEKKRHAKVHLKQRSKK 461 (510)
Q Consensus 431 -----------PfkC~~CgKaFsrsssLkrHlRtHtgeK~~k 461 (510)
.|.|.+|+|.|++...|+.|+-+|.....-+
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k 386 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAK 386 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccch
Confidence 3899999999999999999998887654433
No 20
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.87 E-value=6.2e-06 Score=84.65 Aligned_cols=73 Identities=27% Similarity=0.626 Sum_probs=53.9
Q ss_pred CCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679 370 HERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK 448 (510)
Q Consensus 370 ~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk 448 (510)
++|||+|++++|.|.|+....|+.|+.- |...|-..-+. -..|...-...|||+|++|+|+|....-|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~----------p~~~~~F~~~~KPYrCevC~KRYKNlNGLK 415 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPS----------PEKMNIFSAKDKPYRCEVCDKRYKNLNGLK 415 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCC----------ccccccccccCCceeccccchhhccCccce
Confidence 3699999999999999999999999874 65544333221 111111133579999999999999999999
Q ss_pred HHHH
Q psy679 449 RHAK 452 (510)
Q Consensus 449 rHlR 452 (510)
.|++
T Consensus 416 YHr~ 419 (423)
T COG5189 416 YHRK 419 (423)
T ss_pred eccc
Confidence 9864
No 21
>PHA00732 hypothetical protein
Probab=97.68 E-value=1.8e-05 Score=66.54 Aligned_cols=47 Identities=26% Similarity=0.506 Sum_probs=37.0
Q ss_pred CcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhccc
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHT 427 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHt 427 (510)
||.| +.|++.|.+...|+.|++. |. ++.|..|++.|. .+..|.+++.
T Consensus 1 py~C--~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKC--PICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccC--CCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccC
Confidence 5788 5599999999999999984 65 357999999987 4778886654
No 22
>PHA00616 hypothetical protein
Probab=97.64 E-value=1.9e-05 Score=59.47 Aligned_cols=31 Identities=26% Similarity=0.382 Sum_probs=16.5
Q ss_pred ceecCcCCCccCChHHHHHHHHHhhcccchh
Q psy679 431 PFSCDVCGRKFARSDEKKRHAKVHLKQRSKK 461 (510)
Q Consensus 431 PfkC~~CgKaFsrsssLkrHlRtHtgeK~~k 461 (510)
||.|..||+.|....+|.+|++.|++++++.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~ 31 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLT 31 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccc
Confidence 3455555555555555555555555555443
No 23
>KOG3993|consensus
Probab=97.42 E-value=4.6e-05 Score=80.82 Aligned_cols=53 Identities=36% Similarity=0.690 Sum_probs=45.6
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCC---------------------------------CceecccCCCccCCchhh
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQ---------------------------------KPFQCRICMRSFSRSDHL 419 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtge---------------------------------KPfkC~~CgKsF~~ks~L 419 (510)
.|+| .+|+|.|....||..|+|+|.-. --|.|.+|+|.|.+...|
T Consensus 295 EYrC--PEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYL 372 (500)
T KOG3993|consen 295 EYRC--PECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYL 372 (500)
T ss_pred eecC--CcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHHhhhhhHHHHHH
Confidence 4889 77999999999999999999521 138999999999999999
Q ss_pred hhhhhccc
Q psy679 420 TTHIRTHT 427 (510)
Q Consensus 420 ~rH~rtHt 427 (510)
+.|..+|.
T Consensus 373 rKHqlthq 380 (500)
T KOG3993|consen 373 RKHQLTHQ 380 (500)
T ss_pred HHhHHhhh
Confidence 99976654
No 24
>PHA00616 hypothetical protein
Probab=97.33 E-value=9.9e-05 Score=55.69 Aligned_cols=34 Identities=21% Similarity=0.415 Sum_probs=26.0
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCCCceeccc
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRI 408 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~ 408 (510)
||+| ..||+.|..+.+|.+|++.|++++++.|+.
T Consensus 1 pYqC--~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQC--LRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCcc--chhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4677 458888888888888888888888877765
No 25
>PHA00732 hypothetical protein
Probab=97.31 E-value=0.00016 Score=60.98 Aligned_cols=48 Identities=27% Similarity=0.475 Sum_probs=40.8
Q ss_pred ceecccCCCccCCchhhhhhhhc-ccCCCceecCcCCCccCChHHHHHHHHHhhc
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRT-HTGEKPFSCDVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rt-HtGeKPfkC~~CgKaFsrsssLkrHlRtHtg 456 (510)
||.|..|++.|.....|..|++. |.+ +.|+.|++.|. .|..|.+++..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccCC
Confidence 58999999999999999999985 653 68999999998 47888866543
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.15 E-value=0.0003 Score=44.91 Aligned_cols=23 Identities=35% Similarity=0.973 Sum_probs=20.5
Q ss_pred eecCcCCCccCChHHHHHHHHHh
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVH 454 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtH 454 (510)
|+|..|++.|.+...|++|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999875
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.99 E-value=0.00059 Score=52.98 Aligned_cols=49 Identities=24% Similarity=0.517 Sum_probs=23.3
Q ss_pred eecccCCCccCCchhhhhhhhc-ccC-CCceecCcCCCccCChHHHHHHHHHhh
Q psy679 404 FQCRICMRSFSRSDHLTTHIRT-HTG-EKPFSCDVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rt-HtG-eKPfkC~~CgKaFsrsssLkrHlRtHt 455 (510)
|.|++|++ ......|..|... |.. .+.+.|++|...+. .+|.+|++.++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 45555555 2333445555433 222 23455666655433 25666665544
No 28
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.75 E-value=0.0012 Score=41.51 Aligned_cols=24 Identities=33% Similarity=0.924 Sum_probs=19.2
Q ss_pred eecCcCCCccCChHHHHHHHHHhh
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtHt 455 (510)
|.|+.|++.|.....|+.|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999998874
No 29
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.69 E-value=0.0013 Score=54.98 Aligned_cols=74 Identities=22% Similarity=0.512 Sum_probs=22.5
Q ss_pred ccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHh
Q psy679 375 ACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVH 454 (510)
Q Consensus 375 ~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtH 454 (510)
.|.+ |+..|.....|..|+....+... . ....+.....+..+.+... ...+.|..|++.|.....|..|++.+
T Consensus 1 ~C~~--C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLF--CDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccc--ccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 3755 99999999999999976443211 1 1122223344444444322 23699999999999999999999975
Q ss_pred h
Q psy679 455 L 455 (510)
Q Consensus 455 t 455 (510)
.
T Consensus 74 ~ 74 (100)
T PF12756_consen 74 H 74 (100)
T ss_dssp T
T ss_pred c
Confidence 4
No 30
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.47 E-value=0.0028 Score=49.24 Aligned_cols=51 Identities=25% Similarity=0.597 Sum_probs=39.6
Q ss_pred CcccCCCCCCCccCChHHHHHHHHH-hcCC-CceecccCCCccCCchhhhhhhhcccC
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRI-HTGQ-KPFQCRICMRSFSRSDHLTTHIRTHTG 428 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rt-Htge-KPfkC~~CgKsF~~ks~L~rH~rtHtG 428 (510)
.|.|++ |++ ..+...|..|... |..+ +.+.|++|...+. .+|.+|++.+++
T Consensus 2 ~f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 488966 999 5567889999875 6654 5799999998755 489999987653
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.37 E-value=0.0015 Score=43.32 Aligned_cols=26 Identities=35% Similarity=0.887 Sum_probs=21.0
Q ss_pred ceecCcCCCccCChHHHHHHHHHhhc
Q psy679 431 PFSCDVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 431 PfkC~~CgKaFsrsssLkrHlRtHtg 456 (510)
+|.|..|++.|.....|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 57888888888888888888877753
No 32
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.11 E-value=0.0036 Score=39.88 Aligned_cols=23 Identities=43% Similarity=0.953 Sum_probs=17.5
Q ss_pred cccCCCCCCCccCChHHHHHHHHHh
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIH 398 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtH 398 (510)
|.| +.|++.|.+...|.+|++.|
T Consensus 1 y~C--~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKC--PICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEE--TTTTEEESSHHHHHHHHHHH
T ss_pred CCC--CCCCCccCCHHHHHHHHhHC
Confidence 567 44888888888888888765
No 33
>PRK04860 hypothetical protein; Provisional
Probab=95.78 E-value=0.0038 Score=59.25 Aligned_cols=37 Identities=27% Similarity=0.743 Sum_probs=19.2
Q ss_pred ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
+|.|. |++ ....+.+|.++|+++++|+|..|++.|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence 35554 554 34445555555555555555555555543
No 34
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.66 E-value=0.0099 Score=37.59 Aligned_cols=24 Identities=29% Similarity=0.764 Sum_probs=20.6
Q ss_pred eecCcCCCccCChHHHHHHHHHhh
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtHt 455 (510)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 578899999999999999988765
No 35
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.32 E-value=0.019 Score=44.90 Aligned_cols=33 Identities=24% Similarity=0.618 Sum_probs=23.4
Q ss_pred cCCCceecCcCCCccCChHHHHHHHHHhhcccc
Q psy679 427 TGEKPFSCDVCGRKFARSDEKKRHAKVHLKQRS 459 (510)
Q Consensus 427 tGeKPfkC~~CgKaFsrsssLkrHlRtHtgeK~ 459 (510)
..+.|..|++|+..+.+..+|++|+..+++.|+
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 346889999999999999999999998888765
No 36
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.79 E-value=0.0089 Score=60.75 Aligned_cols=73 Identities=41% Similarity=0.716 Sum_probs=61.4
Q ss_pred CCcccCCCCCCCccCChHHHHHHHH--HhcCC--Cceecc--cCCCccCCchhhhhhhhcccCCCceecCc--CCCccCC
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIR--IHTGQ--KPFQCR--ICMRSFSRSDHLTTHIRTHTGEKPFSCDV--CGRKFAR 443 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~r--tHtge--KPfkC~--~CgKsF~~ks~L~rH~rtHtGeKPfkC~~--CgKaFsr 443 (510)
.++.| ..|...|.+...|.+|.+ .|.++ +++.|+ .|++.|.+...+.+|..+|++.++++|.. |.+.+..
T Consensus 288 ~~~~~--~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (467)
T COG5048 288 LPIKS--KQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSP 365 (467)
T ss_pred cCCCC--ccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccc
Confidence 46777 569999999999999999 79999 999999 79999999999999999999988888864 6665554
Q ss_pred hHH
Q psy679 444 SDE 446 (510)
Q Consensus 444 sss 446 (510)
...
T Consensus 366 ~~~ 368 (467)
T COG5048 366 LLN 368 (467)
T ss_pred ccC
Confidence 433
No 37
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=94.73 E-value=0.026 Score=35.31 Aligned_cols=23 Identities=39% Similarity=1.099 Sum_probs=15.2
Q ss_pred cccCCCCCCCccCChHHHHHHHHHh
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIH 398 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtH 398 (510)
|.|.+ |++.|.+...|..|+++|
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHH
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhh
Confidence 56744 888888888888777765
No 38
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.30 E-value=0.022 Score=57.95 Aligned_cols=92 Identities=29% Similarity=0.465 Sum_probs=55.4
Q ss_pred CCCCC--CCcccCCCCCCCccCChHHHHHHHHHhcCCCceeccc--CCCccCCchhhhhhhh-----cccCCCceecCc-
Q psy679 367 TPVHE--RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRI--CMRSFSRSDHLTTHIR-----THTGEKPFSCDV- 436 (510)
Q Consensus 367 th~~E--KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~--CgKsF~~ks~L~rH~r-----tHtGeKPfkC~~- 436 (510)
.|..+ +++.|.+..|++.|.+...+.+|...|.+.+++.|.. |.+.+.....-..+.. .....+.+.|..
T Consensus 313 ~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (467)
T COG5048 313 NHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSN 392 (467)
T ss_pred ccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccCCCCccchhhccCccCCcccccccc
Confidence 57777 7777763357777777777777777777777766654 4444433333111111 122334555543
Q ss_pred -CCCccCChHHHHHHHHHhhccc
Q psy679 437 -CGRKFARSDEKKRHAKVHLKQR 458 (510)
Q Consensus 437 -CgKaFsrsssLkrHlRtHtgeK 458 (510)
|.+.+.+...+..|...|...+
T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~ 415 (467)
T COG5048 393 SCIRNFKRDSNLSLHIITHLSFR 415 (467)
T ss_pred chhhhhccccccccccccccccC
Confidence 6677777777777777766655
No 39
>PRK04860 hypothetical protein; Provisional
Probab=94.15 E-value=0.031 Score=53.12 Aligned_cols=40 Identities=28% Similarity=0.694 Sum_probs=33.8
Q ss_pred CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCch
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSD 417 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks 417 (510)
-+|.| . |++ ....+.+|.++|+++++|.|..|++.|....
T Consensus 118 ~~Y~C--~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRC--K-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEc--C-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 36899 4 887 6677999999999999999999999887543
No 40
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=93.94 E-value=0.028 Score=37.12 Aligned_cols=25 Identities=36% Similarity=0.877 Sum_probs=15.5
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhc
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHT 399 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHt 399 (510)
+|.| ..|++.|.+...|..|++.|.
T Consensus 1 ~~~C--~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFEC--DECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEE--TTTTEEESSHHHHHHHHCTTT
T ss_pred CCCC--CccCCccCChhHHHHHhHHhc
Confidence 3566 336666666666666666653
No 41
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=92.99 E-value=0.052 Score=35.15 Aligned_cols=23 Identities=26% Similarity=0.852 Sum_probs=18.8
Q ss_pred eecCcCCCccCChHHHHHHHHHh
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVH 454 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtH 454 (510)
|.|..|++.|.....|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67888888888888888888654
No 42
>KOG1146|consensus
Probab=92.97 E-value=0.23 Score=59.86 Aligned_cols=80 Identities=18% Similarity=0.398 Sum_probs=52.7
Q ss_pred CCCCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHH
Q psy679 368 PVHERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDE 446 (510)
Q Consensus 368 h~~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsss 446 (510)
+...|.|+|+ .|+..|+....|..|||. |...+- .+|... .....+.+-..---+.++|.|..|...|..+.+
T Consensus 460 ~S~~kt~~cp--kc~~~yk~a~~L~vhmRskhp~~~~---~~c~~g-q~~~~~arg~~~~~~~~p~~C~~C~~stttng~ 533 (1406)
T KOG1146|consen 460 HSFFKTLKCP--KCNWHYKLAQTLGVHMRSKHPESQS---AYCKAG-QNHPRLARGEVYRCPGKPYPCRACNYSTTTNGN 533 (1406)
T ss_pred ecccccccCC--ccchhhhhHHHhhhcccccccccch---hHhHhc-cccccccccccccCCCCcccceeeeeeeecchH
Confidence 4445789995 599999999999999997 433222 333211 111111111111234689999999999999999
Q ss_pred HHHHHHH
Q psy679 447 KKRHAKV 453 (510)
Q Consensus 447 LkrHlRt 453 (510)
|..|+..
T Consensus 534 LsihlqS 540 (1406)
T KOG1146|consen 534 LSIHLQS 540 (1406)
T ss_pred HHHHHHH
Confidence 9999864
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=92.70 E-value=0.12 Score=33.39 Aligned_cols=24 Identities=29% Similarity=0.555 Sum_probs=17.1
Q ss_pred eecCcCCCccCChHHHHHHHHHhhc
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtHtg 456 (510)
|+|..|..... ...|.+|+++|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 67888887777 7788888887653
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=92.45 E-value=0.13 Score=32.22 Aligned_cols=24 Identities=33% Similarity=0.811 Sum_probs=17.7
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhc
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHT 399 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHt 399 (510)
|.| ..|++.|.....|..|++.|.
T Consensus 1 ~~C--~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRC--PECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCC--CCCcchhCCHHHHHHHHHHhc
Confidence 457 348888888888888887664
No 45
>KOG4173|consensus
Probab=92.42 E-value=0.039 Score=54.39 Aligned_cols=82 Identities=29% Similarity=0.622 Sum_probs=66.4
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc----------CCCceecC--cCC
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT----------GEKPFSCD--VCG 438 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt----------GeKPfkC~--~Cg 438 (510)
.+.|.|.+.+|...|....+...|..+-++ -.|.+|.+.|....-|..|+...+ |...|.|- .|+
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 456889999999999998888888865333 269999999999999999986533 45579994 599
Q ss_pred CccCChHHHHHHH-HHhh
Q psy679 439 RKFARSDEKKRHA-KVHL 455 (510)
Q Consensus 439 KaFsrsssLkrHl-RtHt 455 (510)
-.|......+.|+ ++|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 9999999999998 5664
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=90.49 E-value=0.35 Score=37.99 Aligned_cols=32 Identities=38% Similarity=0.795 Sum_probs=18.3
Q ss_pred CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCc
Q psy679 370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKP 403 (510)
Q Consensus 370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKP 403 (510)
.+.|-.|++ |+..+....+|++|+..+++.||
T Consensus 21 S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 356677744 88888888888888776666654
No 47
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=88.96 E-value=0.1 Score=34.83 Aligned_cols=22 Identities=36% Similarity=0.844 Sum_probs=15.8
Q ss_pred eecCcCCCccCChHHHHHHHHH
Q psy679 432 FSCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRt 453 (510)
|.|..|++.|.....|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777643
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=87.16 E-value=0.33 Score=31.30 Aligned_cols=23 Identities=30% Similarity=0.901 Sum_probs=15.5
Q ss_pred cccCCCCCCCccCChHHHHHHHHHh
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIH 398 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtH 398 (510)
|.|.+ |++.|.+...|..|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 45644 777777777777777654
No 49
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=86.51 E-value=0.26 Score=51.56 Aligned_cols=40 Identities=23% Similarity=0.415 Sum_probs=31.8
Q ss_pred CCceecCc--CCCccCChHHHHHHHHHhh-------------------cccchhHHHHHHH
Q psy679 429 EKPFSCDV--CGRKFARSDEKKRHAKVHL-------------------KQRSKKESKMAVM 468 (510)
Q Consensus 429 eKPfkC~~--CgKaFsrsssLkrHlRtHt-------------------geK~~k~~~~~~~ 468 (510)
+|||+|++ |.|.|....-|+.|+.--+ +.|+|+|+.|.+-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KR 407 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKR 407 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchh
Confidence 58999986 9999999999999975422 3589999998764
No 50
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=86.45 E-value=0.58 Score=49.49 Aligned_cols=80 Identities=31% Similarity=0.630 Sum_probs=55.5
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCC---cc------CCchhhhhhhhcccCCCce----ecCcCCC
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMR---SF------SRSDHLTTHIRTHTGEKPF----SCDVCGR 439 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgK---sF------~~ks~L~rH~rtHtGeKPf----kC~~CgK 439 (510)
.|.|+...|..+......|+.|.+..++ .+.|.+|-+ .| .++..|+.|...-..+.-| .|..|.+
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 3889888888887778889999987443 367777642 33 3455566665432222122 5999999
Q ss_pred ccCChHHHHHHHHHh
Q psy679 440 KFARSDEKKRHAKVH 454 (510)
Q Consensus 440 aFsrsssLkrHlRtH 454 (510)
.|-..+.|.+|+|..
T Consensus 229 ~FYdDDEL~~HcR~~ 243 (493)
T COG5236 229 YFYDDDELRRHCRLR 243 (493)
T ss_pred eecChHHHHHHHHhh
Confidence 999999999988753
No 51
>KOG2186|consensus
Probab=86.37 E-value=0.44 Score=48.42 Aligned_cols=50 Identities=30% Similarity=0.610 Sum_probs=36.3
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHhhc
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtHtg 456 (510)
|.|..|+.... +..+.+|+...++ ..|.|-.|++.|.+ ..++.|.+.-+.
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITE 53 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITE 53 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcch
Confidence 67888887664 4556678877766 66888888888887 677788765443
No 52
>KOG4124|consensus
Probab=84.52 E-value=0.61 Score=49.39 Aligned_cols=72 Identities=26% Similarity=0.599 Sum_probs=48.1
Q ss_pred CCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679 370 HERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK 448 (510)
Q Consensus 370 ~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk 448 (510)
..++|+|+++.|.+.++....|+.|... |.... -.....-.-|.....-.|+|+|++|.+++.....|+
T Consensus 346 ~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i----------~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~ 415 (442)
T KOG4124|consen 346 VDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPI----------TTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLK 415 (442)
T ss_pred ecCCCCCCCCcchhhcccCcceeeccccCcCCCC----------CCCCCCCCCcceeeeccCcccChhhhhhhccCCCCC
Confidence 3678999999999999998888877653 32211 111111223444444568999999999888777776
Q ss_pred HHH
Q psy679 449 RHA 451 (510)
Q Consensus 449 rHl 451 (510)
.|+
T Consensus 416 ~~~ 418 (442)
T KOG4124|consen 416 YHR 418 (442)
T ss_pred cee
Confidence 664
No 53
>KOG2231|consensus
Probab=84.48 E-value=0.76 Score=52.48 Aligned_cols=18 Identities=50% Similarity=0.964 Sum_probs=10.3
Q ss_pred CCCccCChHHHHHHHHHh
Q psy679 381 CDRRFSRSDELTRHIRIH 398 (510)
Q Consensus 381 CgKsFssss~LkrH~rtH 398 (510)
|...|.....|.+|++.+
T Consensus 188 C~~~fld~~el~rH~~~~ 205 (669)
T KOG2231|consen 188 CHERFLDDDELYRHLRFD 205 (669)
T ss_pred hhhhhccHHHHHHhhccc
Confidence 555555555555555543
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=83.87 E-value=0.51 Score=31.37 Aligned_cols=22 Identities=32% Similarity=0.824 Sum_probs=13.4
Q ss_pred eecccCCCccCCchhhhhhhhc
Q psy679 404 FQCRICMRSFSRSDHLTTHIRT 425 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rt 425 (510)
|.|..|++.|.....+..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666553
No 55
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=83.46 E-value=0.95 Score=31.30 Aligned_cols=23 Identities=26% Similarity=0.505 Sum_probs=20.0
Q ss_pred ceecCcCCCccCChHHHHHHHHH
Q psy679 431 PFSCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 431 PfkC~~CgKaFsrsssLkrHlRt 453 (510)
+|.|+.|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999854
No 56
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=82.93 E-value=1.1 Score=29.84 Aligned_cols=21 Identities=43% Similarity=1.067 Sum_probs=16.0
Q ss_pred eecCcCCCccCChHHHHHHHHH
Q psy679 432 FSCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRt 453 (510)
..|..||+.| ..+.|.+|++.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 3688888888 66788888764
No 57
>KOG2231|consensus
Probab=80.28 E-value=1.3 Score=50.59 Aligned_cols=13 Identities=31% Similarity=0.621 Sum_probs=9.2
Q ss_pred CChHHHHHHHH-Hh
Q psy679 386 SRSDELTRHIR-IH 398 (510)
Q Consensus 386 ssss~LkrH~r-tH 398 (510)
.....|+.|++ .|
T Consensus 125 ~s~~~Lk~H~~~~H 138 (669)
T KOG2231|consen 125 KSVENLKNHMRDQH 138 (669)
T ss_pred hHHHHHHHHHHHhh
Confidence 36778888885 35
No 58
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=80.24 E-value=1.5 Score=28.07 Aligned_cols=23 Identities=39% Similarity=0.909 Sum_probs=13.2
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhc
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHT 399 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHt 399 (510)
|.|. .|..... ...|.+|++.|.
T Consensus 1 y~C~--~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCP--HCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-S--SSS-EES-HHHHHHHHHHHH
T ss_pred CCCC--CCCCcCC-HHHHHHHHHhhC
Confidence 4563 3776666 667777776643
No 59
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=76.91 E-value=1.2 Score=43.94 Aligned_cols=42 Identities=26% Similarity=0.598 Sum_probs=25.4
Q ss_pred CceecccCCCccCCchhhhhhhhc----------ccCCCc-----eecCcCCCccCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRT----------HTGEKP-----FSCDVCGRKFAR 443 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rt----------HtGeKP-----fkC~~CgKaFsr 443 (510)
|.+.|++|++.|..+.-+....+. ..+..| ..|+.||.+|..
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~ 60 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFE 60 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCcccc
Confidence 456677777777666555544432 112333 479999988764
No 60
>KOG2893|consensus
Probab=75.57 E-value=1.5 Score=44.44 Aligned_cols=46 Identities=30% Similarity=0.753 Sum_probs=35.3
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhh
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHI 423 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~ 423 (510)
.|++ |- .|++-|....-|..|.+. |.|+|-+|.|....-..|..|.
T Consensus 9 ~kpw-cw--ycnrefddekiliqhqka----khfkchichkkl~sgpglsihc 54 (341)
T KOG2893|consen 9 DKPW-CW--YCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHC 54 (341)
T ss_pred CCce-ee--ecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeeh
Confidence 4555 53 399999998888888764 5699999998887777777774
No 61
>KOG1146|consensus
Probab=72.07 E-value=1.6 Score=53.06 Aligned_cols=78 Identities=18% Similarity=0.351 Sum_probs=58.0
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc-----------------ccCCCceecC
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT-----------------HTGEKPFSCD 435 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt-----------------HtGeKPfkC~ 435 (510)
.+.| ..|.+.|...-.+. |+-. ..+|.|..|...|.....|..|.+. |...++| |.
T Consensus 1260 e~~c--~~~~~~~~~~~~~~-~l~~---~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~ 1332 (1406)
T KOG1146|consen 1260 EGEC--GAVDELLTPSFGIS-TLDV---THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CL 1332 (1406)
T ss_pred cchh--hhccccccCcccee-eccc---chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-ch
Confidence 3678 44888888776666 5543 3468899999888888888888642 2223456 99
Q ss_pred cCCCccCChHHHHHHHHHhhcc
Q psy679 436 VCGRKFARSDEKKRHAKVHLKQ 457 (510)
Q Consensus 436 ~CgKaFsrsssLkrHlRtHtge 457 (510)
.|...|.....|..|+|.-+++
T Consensus 1333 ~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1333 ACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred HHHhhcchhHHHHHHHHHhhhc
Confidence 9999999999999999864443
No 62
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=71.48 E-value=2.5 Score=37.86 Aligned_cols=30 Identities=30% Similarity=0.738 Sum_probs=18.7
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCCh
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARS 444 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrs 444 (510)
..|+.||++|+. | +..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYD---L--------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYD---L--------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhcc---C--------CCCCccCCCCCCccCcc
Confidence 467777777753 2 12566777777777654
No 63
>KOG2893|consensus
Probab=69.99 E-value=1.1 Score=45.47 Aligned_cols=44 Identities=32% Similarity=0.668 Sum_probs=34.6
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH 450 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH 450 (510)
|+ -|-+|++-|....-|..|++. |-|+|.+|.|.....--|..|
T Consensus 10 kp-wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 10 KP-WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred Cc-eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence 44 388899999999999988864 569999999876665556555
No 64
>KOG4167|consensus
Probab=67.74 E-value=2.6 Score=48.47 Aligned_cols=26 Identities=31% Similarity=0.721 Sum_probs=24.1
Q ss_pred ceecCcCCCccCChHHHHHHHHHhhc
Q psy679 431 PFSCDVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 431 PfkC~~CgKaFsrsssLkrHlRtHtg 456 (510)
-|.|..|+|.|.....+..|||+|.-
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 48999999999999999999999964
No 65
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=67.19 E-value=7.3 Score=34.13 Aligned_cols=25 Identities=16% Similarity=0.404 Sum_probs=23.1
Q ss_pred eec----CcCCCccCChHHHHHHHHHhhc
Q psy679 432 FSC----DVCGRKFARSDEKKRHAKVHLK 456 (510)
Q Consensus 432 fkC----~~CgKaFsrsssLkrHlRtHtg 456 (510)
|.| ..|+..+.....+++|.+.++|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 899 8999999999999999998775
No 66
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=65.15 E-value=3 Score=44.38 Aligned_cols=68 Identities=28% Similarity=0.534 Sum_probs=33.6
Q ss_pred CCCccCChHHHHHHHHHhcCCCceecccCCC----ccCCchhhhhhhhcccCCCceecCc--C--C--CccCChHHHHHH
Q psy679 381 CDRRFSRSDELTRHIRIHTGQKPFQCRICMR----SFSRSDHLTTHIRTHTGEKPFSCDV--C--G--RKFARSDEKKRH 450 (510)
Q Consensus 381 CgKsFssss~LkrH~rtHtgeKPfkC~~CgK----sF~~ks~L~rH~rtHtGeKPfkC~~--C--g--KaFsrsssLkrH 450 (510)
|.+.|-....|.+|+|..+ ++-|.|..-+. -|..-..|.+|.+. .-|.|.+ | | ..|.....|..|
T Consensus 226 C~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~----~hy~ct~qtc~~~k~~vf~~~~el~~h 300 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRN----AHYCCTFQTCRVGKCYVFPYHTELLEH 300 (493)
T ss_pred ccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhc----CceEEEEEEEecCcEEEeccHHHHHHH
Confidence 6666766667766666532 22233322211 24444555555442 2255543 3 2 246666666666
Q ss_pred HHH
Q psy679 451 AKV 453 (510)
Q Consensus 451 lRt 453 (510)
+-.
T Consensus 301 ~~~ 303 (493)
T COG5236 301 LTR 303 (493)
T ss_pred HHH
Confidence 643
No 67
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=64.09 E-value=3.4 Score=39.20 Aligned_cols=14 Identities=21% Similarity=0.612 Sum_probs=9.3
Q ss_pred eecccCCCccCCch
Q psy679 404 FQCRICMRSFSRSD 417 (510)
Q Consensus 404 fkC~~CgKsF~~ks 417 (510)
+.|+.|++.|....
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 67777777776543
No 68
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=64.05 E-value=4.6 Score=28.41 Aligned_cols=10 Identities=50% Similarity=1.381 Sum_probs=6.3
Q ss_pred CceecCcCCC
Q psy679 430 KPFSCDVCGR 439 (510)
Q Consensus 430 KPfkC~~CgK 439 (510)
.++.|+.||.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 5566777764
No 69
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=63.22 E-value=4 Score=32.80 Aligned_cols=29 Identities=31% Similarity=0.613 Sum_probs=23.7
Q ss_pred cccCCCceecCcCCCccCChHHHHHHHHH
Q psy679 425 THTGEKPFSCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 425 tHtGeKPfkC~~CgKaFsrsssLkrHlRt 453 (510)
.-.||--++|+-|++.|.....+.+|...
T Consensus 11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred ccCCceeeeCCchhHHHHHhHHHHHHhhH
Confidence 34577778999999999999999999854
No 70
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=62.71 E-value=6.3 Score=27.10 Aligned_cols=23 Identities=17% Similarity=0.557 Sum_probs=17.6
Q ss_pred CcccCCCCCCCccCChHHHHHHHHH
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRI 397 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rt 397 (510)
+|.|.+ |++.|.+...+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 467844 88888888888888764
No 71
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=62.47 E-value=4.5 Score=35.11 Aligned_cols=32 Identities=38% Similarity=0.688 Sum_probs=23.1
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
..|.|+.|++. .+.|+-+ .-+.|..||..|+-
T Consensus 34 ~~~~Cp~C~~~--------~VkR~a~--GIW~C~kCg~~fAG 65 (89)
T COG1997 34 AKHVCPFCGRT--------TVKRIAT--GIWKCRKCGAKFAG 65 (89)
T ss_pred cCCcCCCCCCc--------ceeeecc--CeEEcCCCCCeecc
Confidence 46899999875 2444444 45899999998864
No 72
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=62.39 E-value=6.2 Score=32.60 Aligned_cols=24 Identities=29% Similarity=0.897 Sum_probs=20.3
Q ss_pred CcccCCCCCCCccCChHHHHHHHHHh
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRIH 398 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rtH 398 (510)
.+.|.+ |++.|.+...|..||+.+
T Consensus 50 ~~~C~~--C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPY--CNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSS--SS-EESSHHHHHHHHHHT
T ss_pred CCCCCc--cCCCCcCHHHHHHHHcCc
Confidence 688966 999999999999999975
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=61.23 E-value=12 Score=33.93 Aligned_cols=80 Identities=21% Similarity=0.357 Sum_probs=50.7
Q ss_pred CCcccCCCCCCCccCChHHHHHHHHHhcCCCc------------eecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKP------------FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKP------------fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
-|-.|++ |+-..-.+.+|.|...---.-++ ..|--|.+.|........-. -.....|+|..|..
T Consensus 14 LP~~Cpi--CgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~~--~~~~~~y~C~~C~~ 89 (112)
T TIGR00622 14 LPVECPI--CGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFDE--LKDSHRYVCAVCKN 89 (112)
T ss_pred CCCcCCc--CCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCcccccccc--cccccceeCCCCCC
Confidence 3556866 99998888888764211001111 23888888887543211100 12235799999999
Q ss_pred ccCChHHHHHHHHHhh
Q psy679 440 KFARSDEKKRHAKVHL 455 (510)
Q Consensus 440 aFsrsssLkrHlRtHt 455 (510)
.|-..-....|...|.
T Consensus 90 ~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 90 VFCVDCDVFVHESLHC 105 (112)
T ss_pred ccccccchhhhhhccC
Confidence 9998888888887775
No 74
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=57.80 E-value=6.9 Score=36.17 Aligned_cols=33 Identities=18% Similarity=0.291 Sum_probs=21.5
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHH
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEK 447 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssL 447 (510)
..|+.|+++|+. | +..|..|++||..|.....+
T Consensus 10 r~Cp~cg~kFYD---L--------nk~p~vcP~cg~~~~~~~~~ 42 (129)
T TIGR02300 10 RICPNTGSKFYD---L--------NRRPAVSPYTGEQFPPEEAL 42 (129)
T ss_pred ccCCCcCccccc---c--------CCCCccCCCcCCccCcchhh
Confidence 468888887754 2 23577788888777655333
No 75
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=56.71 E-value=7.4 Score=28.50 Aligned_cols=26 Identities=35% Similarity=0.593 Sum_probs=15.8
Q ss_pred CCCceecCcCCCccCCh----HHHHHHHHH
Q psy679 428 GEKPFSCDVCGRKFARS----DEKKRHAKV 453 (510)
Q Consensus 428 GeKPfkC~~CgKaFsrs----ssLkrHlRt 453 (510)
++...+|..|++.+... +.|.+|++.
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 45567888898887764 778888843
No 76
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=54.28 E-value=9 Score=29.13 Aligned_cols=24 Identities=38% Similarity=0.818 Sum_probs=17.3
Q ss_pred eecCcCCCccCCh-----HHHHHHHH-Hhh
Q psy679 432 FSCDVCGRKFARS-----DEKKRHAK-VHL 455 (510)
Q Consensus 432 fkC~~CgKaFsrs-----ssLkrHlR-tHt 455 (510)
-.|..|++.+... ++|.+|++ +|.
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 4588888887665 58888887 453
No 77
>KOG2482|consensus
Probab=53.80 E-value=8.3 Score=41.10 Aligned_cols=81 Identities=16% Similarity=0.315 Sum_probs=46.2
Q ss_pred CcccCCCCCCCccCChHHHHHHHHH--hcCCCc--------eeccc--CCCccCC-chhhhhhhh----cc---------
Q psy679 373 PYACPVENCDRRFSRSDELTRHIRI--HTGQKP--------FQCRI--CMRSFSR-SDHLTTHIR----TH--------- 426 (510)
Q Consensus 373 Py~C~~e~CgKsFssss~LkrH~rt--HtgeKP--------fkC~~--CgKsF~~-ks~L~rH~r----tH--------- 426 (510)
.+.|-+ |.|.|+.+..|+.|||. |....| |.=.. -+|+... ...+.+-.. .+
T Consensus 195 r~~CLy--CekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~e 272 (423)
T KOG2482|consen 195 RLRCLY--CEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNE 272 (423)
T ss_pred hheeee--eccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhc
Confidence 467966 99999999999999985 544322 11110 2222111 011100000 00
Q ss_pred cCCCc--eecCcCCCccCChHHHHHHHHHhh
Q psy679 427 TGEKP--FSCDVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 427 tGeKP--fkC~~CgKaFsrsssLkrHlRtHt 455 (510)
.+..+ .+|-.|.+..-....|..||++.+
T Consensus 273 d~a~a~~v~CLfC~~~~en~~~l~eHmk~vH 303 (423)
T KOG2482|consen 273 DDAEALSVVCLFCTNFYENPVFLFEHMKIVH 303 (423)
T ss_pred CCCCccceEEEeeccchhhHHHHHHHHHHHH
Confidence 01122 478889888888888999998644
No 78
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.27 E-value=13 Score=34.56 Aligned_cols=38 Identities=16% Similarity=0.564 Sum_probs=25.9
Q ss_pred CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
..-|.|+.|+..|.....+.. . +. +..|.|+.||....
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea~~~-~--d~-~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEANQL-L--DM-DGTFTCPRCGEELE 134 (147)
T ss_pred CcEEECcCCCCEeeHHHHHHh-c--CC-CCcEECCCCCCEEE
Confidence 456999999999886544332 0 11 33499999998754
No 79
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=52.03 E-value=11 Score=26.83 Aligned_cols=34 Identities=26% Similarity=0.536 Sum_probs=19.0
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
+.|+.|+..|.-...... .......|..|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~~-----~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG-----ANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHcC-----CCCCEEECCCCCCEEE
Confidence 467777777655443221 1112467777777664
No 80
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=51.98 E-value=8.2 Score=35.85 Aligned_cols=27 Identities=33% Similarity=0.576 Sum_probs=16.4
Q ss_pred CCceecCcCCCccCChHHHHHHHHHhhccc
Q psy679 429 EKPFSCDVCGRKFARSDEKKRHAKVHLKQR 458 (510)
Q Consensus 429 eKPfkC~~CgKaFsrsssLkrHlRtHtgeK 458 (510)
+.-..|-+|||.|.. |++|++.|+|-.
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred cCeeEEccCCcccch---HHHHHHHccCCC
Confidence 344678899998875 589999997743
No 81
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=47.88 E-value=15 Score=26.57 Aligned_cols=34 Identities=26% Similarity=0.643 Sum_probs=19.7
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
..|+.|+..|.-..+-. ..+.+..+|..|+..|.
T Consensus 3 i~CP~C~~~f~v~~~~l-----~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDKL-----PAGGRKVRCPKCGHVFR 36 (37)
T ss_pred EECCCCCceEEcCHHHc-----ccCCcEEECCCCCcEee
Confidence 35777777776554321 12334567777777664
No 82
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=46.89 E-value=15 Score=34.66 Aligned_cols=34 Identities=21% Similarity=0.468 Sum_probs=26.3
Q ss_pred CCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 400 GQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 400 geKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
...-|.|+.|+..|.....+. .-|.|+.||....
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~ 139 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAMLD 139 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence 456699999999888777664 2599999997654
No 83
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=46.01 E-value=9.7 Score=31.85 Aligned_cols=14 Identities=29% Similarity=0.740 Sum_probs=6.8
Q ss_pred CCceecC--cCCCccC
Q psy679 429 EKPFSCD--VCGRKFA 442 (510)
Q Consensus 429 eKPfkC~--~CgKaFs 442 (510)
++.+.|. .||..|.
T Consensus 25 ~~Y~qC~N~eCg~tF~ 40 (72)
T PRK09678 25 ERYHQCQNVNCSATFI 40 (72)
T ss_pred eeeeecCCCCCCCEEE
Confidence 3444554 4555554
No 84
>PRK13185 chlL protochlorophyllide reductase iron-sulfur ATP-binding protein; Provisional
Probab=45.76 E-value=3.4 Score=41.20 Aligned_cols=18 Identities=44% Similarity=0.590 Sum_probs=15.8
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||-||||+-+||.+
T Consensus 7 v~~KGGVGKTT~~~nLA~ 24 (270)
T PRK13185 7 VYGKGGIGKSTTSSNLSA 24 (270)
T ss_pred EECCCCCCHHHHHHHHHH
Confidence 347899999999999986
No 85
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=44.73 E-value=20 Score=35.94 Aligned_cols=33 Identities=18% Similarity=0.498 Sum_probs=24.7
Q ss_pred CCCceecCcCCCccCChHHHHHHHHHhhcccch
Q psy679 428 GEKPFSCDVCGRKFARSDEKKRHAKVHLKQRSK 460 (510)
Q Consensus 428 GeKPfkC~~CgKaFsrsssLkrHlRtHtgeK~~ 460 (510)
.+..|.|..|+|.|.-..-.++|+...|.++..
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ve 106 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEKVE 106 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HHHHH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHHHHH
Confidence 455699999999999999999999877766543
No 86
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=43.29 E-value=13 Score=27.47 Aligned_cols=29 Identities=24% Similarity=0.754 Sum_probs=17.3
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
|+|..|+..|..... ... ...-.|+.||.
T Consensus 6 y~C~~Cg~~fe~~~~------~~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 6 YRCEECGHEFEVLQS------ISE-DDPVPCPECGS 34 (42)
T ss_pred EEeCCCCCEEEEEEE------cCC-CCCCcCCCCCC
Confidence 777778777753222 222 35567877875
No 87
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=43.12 E-value=22 Score=25.64 Aligned_cols=33 Identities=21% Similarity=0.598 Sum_probs=20.0
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCcc
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKF 441 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaF 441 (510)
..|+.|...|.-.+.... ...+..+|..|+..|
T Consensus 3 i~Cp~C~~~y~i~d~~ip-----~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIP-----PKGRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCC-----CCCcEEECCCCCCEe
Confidence 457777777766554321 223456788887766
No 88
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=43.11 E-value=12 Score=26.70 Aligned_cols=30 Identities=20% Similarity=0.639 Sum_probs=16.2
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK 440 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa 440 (510)
|+|..|++.|...... .. ...-.|+.||..
T Consensus 6 y~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~~ 35 (41)
T smart00834 6 YRCEDCGHTFEVLQKI------SD-DPLATCPECGGD 35 (41)
T ss_pred EEcCCCCCEEEEEEec------CC-CCCCCCCCCCCc
Confidence 6777777766432221 11 234567777753
No 89
>KOG2186|consensus
Probab=42.75 E-value=11 Score=38.54 Aligned_cols=47 Identities=21% Similarity=0.590 Sum_probs=36.5
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT 425 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt 425 (510)
|.|.+ ||....... +.+|+..-++ .-|.|..|++.|.+ .....|..-
T Consensus 4 FtCnv--CgEsvKKp~-vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kC 50 (276)
T KOG2186|consen 4 FTCNV--CGESVKKPQ-VEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKC 50 (276)
T ss_pred Eehhh--hhhhccccc-hHHHHHhccC-CeeEEeeccccccc-chhhhhhhh
Confidence 77954 998877654 6779987666 56999999999988 667778653
No 90
>KOG2785|consensus
Probab=42.03 E-value=18 Score=39.06 Aligned_cols=73 Identities=21% Similarity=0.387 Sum_probs=47.8
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCC---CccCChHHHHHH
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG---RKFARSDEKKRH 450 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg---KaFsrsssLkrH 450 (510)
-.|-+ |++.|+.-.....||..|+|- |.=.. .-......|...+....+ .-|.|-.|+ +.|......+.|
T Consensus 167 t~CLf--C~~~~k~~e~~~~HM~~~Hgf--fIPdr--eYL~D~~GLl~YLgeKV~-~~~~CL~CN~~~~~f~sleavr~H 239 (390)
T KOG2785|consen 167 TDCLF--CDKKSKSLEENLKHMFKEHGF--FIPDR--EYLTDEKGLLKYLGEKVG-IGFICLFCNELGRPFSSLEAVRAH 239 (390)
T ss_pred cceee--cCCCcccHHHHHHHHhhccCC--cCCch--HhhhchhHHHHHHHHHhc-cCceEEEeccccCcccccHHHHHH
Confidence 34855 999999999999999987762 11100 001223334444433332 348899998 999999999999
Q ss_pred HHH
Q psy679 451 AKV 453 (510)
Q Consensus 451 lRt 453 (510)
|..
T Consensus 240 M~~ 242 (390)
T KOG2785|consen 240 MRD 242 (390)
T ss_pred Hhh
Confidence 963
No 91
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.49 E-value=17 Score=25.84 Aligned_cols=25 Identities=28% Similarity=0.748 Sum_probs=13.9
Q ss_pred ceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
.|+|..||..+... +.|.+|+.||.
T Consensus 2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred eEECCCCCCEeECC------------cCCCcCcCCCC
Confidence 36677776554221 14557777764
No 92
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=41.22 E-value=17 Score=34.98 Aligned_cols=25 Identities=40% Similarity=0.995 Sum_probs=20.0
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
+-|.|.+||.. |.|+-|-+|++||.
T Consensus 133 ~~~vC~vCGy~-------------~~ge~P~~CPiCga 157 (166)
T COG1592 133 KVWVCPVCGYT-------------HEGEAPEVCPICGA 157 (166)
T ss_pred CEEEcCCCCCc-------------ccCCCCCcCCCCCC
Confidence 35999999865 44688999999993
No 93
>KOG0978|consensus
Probab=40.92 E-value=6.6 Score=45.37 Aligned_cols=21 Identities=24% Similarity=0.586 Sum_probs=15.9
Q ss_pred CceecCcCCCccCChHHHHHH
Q psy679 430 KPFSCDVCGRKFARSDEKKRH 450 (510)
Q Consensus 430 KPfkC~~CgKaFsrsssLkrH 450 (510)
+.-+|+.|+.+|...+-+..|
T Consensus 677 RqRKCP~Cn~aFganDv~~I~ 697 (698)
T KOG0978|consen 677 RQRKCPKCNAAFGANDVHRIH 697 (698)
T ss_pred hcCCCCCCCCCCCcccccccC
Confidence 445799999999888776555
No 94
>cd02032 Bchl_like This family of proteins contains bchL and chlL. Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. Three genes, bchL, bchN and bchB, are involved in light-independent protochlorophyllide reduction in bacteriochlorophyll biosynthesis. In cyanobacteria, algae, and gymnosperms, three similar genes, chlL, chlN and chlB are involved in protochlorophyllide reduction during chlorophylls biosynthesis. BchL/chlL, bchN/chlN and bchB/chlB exhibit significant sequence similarity to the nifH, nifD and nifK subunits of nitrogenase, respectively. Nitrogenase catalyzes the reductive formation of ammonia from dinitrogen.
Probab=40.76 E-value=4.4 Score=40.34 Aligned_cols=18 Identities=44% Similarity=0.567 Sum_probs=16.0
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
+++-||-||||+-+||.+
T Consensus 5 v~gKGGvGKTT~a~nLA~ 22 (267)
T cd02032 5 VYGKGGIGKSTTSSNLSV 22 (267)
T ss_pred EecCCCCCHHHHHHHHHH
Confidence 457899999999999986
No 95
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=40.35 E-value=14 Score=28.01 Aligned_cols=11 Identities=27% Similarity=0.990 Sum_probs=6.2
Q ss_pred eecccCCCccC
Q psy679 404 FQCRICMRSFS 414 (510)
Q Consensus 404 fkC~~CgKsF~ 414 (510)
|+|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 55666665554
No 96
>KOG2593|consensus
Probab=40.25 E-value=22 Score=39.03 Aligned_cols=38 Identities=18% Similarity=0.568 Sum_probs=26.5
Q ss_pred CCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679 400 GQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK 440 (510)
Q Consensus 400 geKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa 440 (510)
...-|.|+.|.+.|.....+.- .-...-.|.|..|+-.
T Consensus 125 ~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 125 NVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE 162 (436)
T ss_pred ccccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence 4567999999999987666542 2222346999999744
No 97
>PRK13231 nitrogenase reductase-like protein; Reviewed
Probab=40.19 E-value=6 Score=39.31 Aligned_cols=18 Identities=39% Similarity=0.617 Sum_probs=15.8
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||-||||+-+||.+
T Consensus 7 v~~KGGvGKTT~a~nLA~ 24 (264)
T PRK13231 7 IYGKGGIGKSTTVSNMAA 24 (264)
T ss_pred EECCCCCcHHHHHHHHhc
Confidence 347899999999999987
No 98
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=40.08 E-value=15 Score=34.38 Aligned_cols=25 Identities=32% Similarity=0.469 Sum_probs=19.1
Q ss_pred eecCcCCCccCChHHHHHHHHHhhcccc
Q psy679 432 FSCDVCGRKFARSDEKKRHAKVHLKQRS 459 (510)
Q Consensus 432 fkC~~CgKaFsrsssLkrHlRtHtgeK~ 459 (510)
..|-+|||.|. .|++|+.+|.+--+
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 56888888886 58899988887433
No 99
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=39.98 E-value=18 Score=27.06 Aligned_cols=12 Identities=25% Similarity=0.772 Sum_probs=7.2
Q ss_pred ceecccCCCccC
Q psy679 403 PFQCRICMRSFS 414 (510)
Q Consensus 403 PfkC~~CgKsF~ 414 (510)
.|+|..|+..|.
T Consensus 3 ~y~C~~CG~~~~ 14 (46)
T PRK00398 3 EYKCARCGREVE 14 (46)
T ss_pred EEECCCCCCEEE
Confidence 466666666553
No 100
>PF14353 CpXC: CpXC protein
Probab=38.33 E-value=8.1 Score=34.72 Aligned_cols=19 Identities=32% Similarity=0.565 Sum_probs=9.0
Q ss_pred eecccCCCccCCchhhhhh
Q psy679 404 FQCRICMRSFSRSDHLTTH 422 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH 422 (510)
|.|+.||+.|.-...+.-|
T Consensus 39 ~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 39 FTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EECCCCCCceecCCCEEEE
Confidence 4455555555444444333
No 101
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=37.66 E-value=23 Score=34.26 Aligned_cols=34 Identities=24% Similarity=0.565 Sum_probs=25.8
Q ss_pred CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
..-|.|+.|++.|.....+. .-|.|+.||.....
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLEE 148 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence 35699999999887766653 25999999977554
No 102
>PRK13235 nifH nitrogenase reductase; Reviewed
Probab=37.63 E-value=5.2 Score=40.11 Aligned_cols=38 Identities=34% Similarity=0.401 Sum_probs=26.1
Q ss_pred cccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccchhhc
Q psy679 29 ISNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPYLES 85 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (510)
|.+-||-||||+-|||.+-- -++ ..|| ++||..|=...
T Consensus 6 v~~KGGVGKTT~~~nLA~~L----------a~~-G~rV--------LlID~Dpq~~~ 43 (274)
T PRK13235 6 IYGKGGIGKSTTTQNTVAGL----------AEM-GKKV--------MVVGCDPKADS 43 (274)
T ss_pred EeCCCCccHHHHHHHHHHHH----------HHC-CCcE--------EEEecCCcccc
Confidence 34789999999999998632 222 2244 47888886543
No 103
>KOG2461|consensus
Probab=37.30 E-value=1.8e+02 Score=31.75 Aligned_cols=55 Identities=7% Similarity=-0.116 Sum_probs=23.5
Q ss_pred ccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679 384 RFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG 438 (510)
Q Consensus 384 sFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg 438 (510)
.+.....+..|...|.++..+.|..+...+.....+..+..+|+..+.+.|..|+
T Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (396)
T KOG2461|consen 340 TPAGQLIYTQSHSMEVAEPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMSFVC 394 (396)
T ss_pred ccccccchhhhhhcccCCCCcccccccccceeccccceeeeeccccccccccccC
Confidence 3333333444444444444444444333333333334444444444445444443
No 104
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=36.92 E-value=9.6 Score=30.66 Aligned_cols=27 Identities=30% Similarity=0.599 Sum_probs=14.6
Q ss_pred CCCCCcccCCCCCCCccCChHHHHHHHHH
Q psy679 369 VHERPYACPVENCDRRFSRSDELTRHIRI 397 (510)
Q Consensus 369 ~~EKPy~C~~e~CgKsFssss~LkrH~rt 397 (510)
.+|.-+.|+ -|++.|....+..+|...
T Consensus 13 DGE~~lrCP--RC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 13 DGEEFLRCP--RCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred CCceeeeCC--chhHHHHHhHHHHHHhhH
Confidence 344555553 366666655555555543
No 105
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=36.20 E-value=37 Score=31.86 Aligned_cols=26 Identities=27% Similarity=0.512 Sum_probs=19.7
Q ss_pred CCcccCCCCCCCccCChHHHHHHHHHhcCCC
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIRIHTGQK 402 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeK 402 (510)
.--.| -+|||.|++ |+||.++|.+--
T Consensus 75 D~Iic--LEDGkkfKS---LKRHL~t~~gmT 100 (148)
T COG4957 75 DYIIC--LEDGKKFKS---LKRHLTTHYGLT 100 (148)
T ss_pred CeEEE--eccCcchHH---HHHHHhcccCCC
Confidence 33467 579999976 999999987743
No 106
>cd02117 NifH_like This family contains the NifH (iron protein) of nitrogenase, L subunit (BchL/ChlL) of the protochlorophyllide reductase and the BchX subunit of the Chlorophyllide reductase. Members of this family use energey from ATP hydrolysis and transfer electrons through a Fe4-S4 cluster to other subunit for reduction of substrate.
Probab=36.03 E-value=5.6 Score=38.32 Aligned_cols=18 Identities=50% Similarity=0.597 Sum_probs=15.6
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||.||||+-+||.+
T Consensus 5 v~gKGGvGKTt~~~nLA~ 22 (212)
T cd02117 5 IYGKGGIGKSTTSQNLSA 22 (212)
T ss_pred EECCCcCcHHHHHHHHHH
Confidence 346899999999999976
No 107
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=36.02 E-value=15 Score=32.14 Aligned_cols=31 Identities=35% Similarity=0.888 Sum_probs=21.4
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
..|.|++|++.- + .|.-.| -+.|..|++.|+
T Consensus 34 ~ky~Cp~Cgk~~-----v---kR~a~G--IW~C~~C~~~~A 64 (90)
T PF01780_consen 34 AKYTCPFCGKTS-----V---KRVATG--IWKCKKCGKKFA 64 (90)
T ss_dssp S-BEESSSSSSE-----E---EEEETT--EEEETTTTEEEE
T ss_pred CCCcCCCCCCce-----e---EEeeeE--EeecCCCCCEEe
Confidence 469999999753 1 233334 499999999885
No 108
>KOG2071|consensus
Probab=35.09 E-value=32 Score=39.14 Aligned_cols=27 Identities=37% Similarity=0.844 Sum_probs=21.2
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhc
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHT 399 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHt 399 (510)
.++..|.. ||++|.+.....+||..|.
T Consensus 416 ~~pnqC~~--CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKS--CGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcc--cccccccchhhhhHhhhhh
Confidence 56788955 9999999988877776653
No 109
>cd02040 NifH NifH gene encodes component II (iron protein) of nitrogenase. Nitrogenase is responsible for the biological nitrogen fixation, i.e. reduction of molecular nitrogen to ammonia. NifH consists of two oxygen-sensitive metallosulfur proteins: the mollybdenum-iron (alternatively, vanadium-iron or iron-iron) protein (commonly referred to as component 1), and the iron protein (commonly referred to as component 2). The iron protein is a homodimer, with an Fe4S4 cluster bound between the subunits and two ATP-binding domains. It supplies energy by ATP hydrolysis, and transfers electrons from reduced ferredoxin or flavodoxin to component 1 for the reduction of molecular nitrogen to ammonia.
Probab=34.87 E-value=6 Score=39.05 Aligned_cols=17 Identities=47% Similarity=0.565 Sum_probs=15.3
Q ss_pred ccCCCCCceeeeeecCC
Q psy679 30 SNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 30 ~~~~~~~~~~~~~~~~~ 46 (510)
.+-||-||||+-+||.+
T Consensus 7 ~~KGGvGKTT~~~nLA~ 23 (270)
T cd02040 7 YGKGGIGKSTTTQNLSA 23 (270)
T ss_pred EeCCcCCHHHHHHHHHH
Confidence 46899999999999986
No 110
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=34.26 E-value=32 Score=26.04 Aligned_cols=11 Identities=27% Similarity=0.637 Sum_probs=6.3
Q ss_pred eecccCCCccC
Q psy679 404 FQCRICMRSFS 414 (510)
Q Consensus 404 fkC~~CgKsF~ 414 (510)
|.|..|+..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 55666665554
No 111
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=33.31 E-value=8.8 Score=38.00 Aligned_cols=44 Identities=25% Similarity=0.648 Sum_probs=29.1
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHH---h-------cCCCc-----eecccCCCccCCc
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRI---H-------TGQKP-----FQCRICMRSFSRS 416 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rt---H-------tgeKP-----fkC~~CgKsF~~k 416 (510)
+|.+.|++ |++.|....-+....|. . .+..| ..|+.|+.+|...
T Consensus 3 ~k~~~CPv--C~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPV--CGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCC--CCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 46678977 99999887655444432 1 12223 6799999987654
No 112
>COG1192 Soj ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=32.79 E-value=6.9 Score=38.65 Aligned_cols=36 Identities=36% Similarity=0.558 Sum_probs=25.4
Q ss_pred cccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccch
Q psy679 29 ISNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPY 82 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (510)
.+..||-||||+-+||.+- |-..-+.|| .+||+.|-
T Consensus 8 ~n~KGGvGKTT~a~nLa~~----------La~~~~~kV--------LliDlDpQ 43 (259)
T COG1192 8 ANQKGGVGKTTTAVNLAAA----------LAKRGGKKV--------LLIDLDPQ 43 (259)
T ss_pred EecCCCccHHHHHHHHHHH----------HHHhcCCcE--------EEEeCCCc
Confidence 4678999999999999862 222222455 47999985
No 113
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.56 E-value=25 Score=31.91 Aligned_cols=28 Identities=18% Similarity=0.300 Sum_probs=21.4
Q ss_pred ecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 405 QCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 405 kC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
.|+.|++.|.. | +..|..|++||+.|.+
T Consensus 11 idPetg~KFYD---L--------NrdPiVsPytG~s~P~ 38 (129)
T COG4530 11 IDPETGKKFYD---L--------NRDPIVSPYTGKSYPR 38 (129)
T ss_pred cCccccchhhc---c--------CCCccccCcccccchH
Confidence 68889988864 2 2478899999999943
No 114
>KOG2482|consensus
Probab=32.38 E-value=45 Score=35.76 Aligned_cols=23 Identities=22% Similarity=0.504 Sum_probs=21.1
Q ss_pred ceecCcCCCccCChHHHHHHHHH
Q psy679 431 PFSCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 431 PfkC~~CgKaFsrsssLkrHlRt 453 (510)
.+.|-.|.|.|+.+..|+.|||.
T Consensus 195 r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHh
Confidence 47899999999999999999975
No 115
>PRK13230 nitrogenase reductase-like protein; Reviewed
Probab=32.25 E-value=7.3 Score=39.17 Aligned_cols=18 Identities=33% Similarity=0.580 Sum_probs=15.8
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||-||||+-+||.+
T Consensus 6 ~~gKGGVGKTT~a~nLA~ 23 (279)
T PRK13230 6 FYGKGGIGKSTTVCNIAA 23 (279)
T ss_pred EECCCCCcHHHHHHHHHH
Confidence 446899999999999986
No 116
>PRK13233 nifH nitrogenase reductase; Reviewed
Probab=32.12 E-value=7.9 Score=38.71 Aligned_cols=17 Identities=35% Similarity=0.440 Sum_probs=15.1
Q ss_pred ccCCCCCceeeeeecCC
Q psy679 30 SNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 30 ~~~~~~~~~~~~~~~~~ 46 (510)
.+-||-||||+-|||.+
T Consensus 8 ~~KGGVGKTT~a~nLA~ 24 (275)
T PRK13233 8 YGKGGIGKSTTTQNTAA 24 (275)
T ss_pred EcCCCCcHHHHHHHHHH
Confidence 37899999999999975
No 117
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=31.76 E-value=73 Score=23.83 Aligned_cols=38 Identities=16% Similarity=0.406 Sum_probs=21.3
Q ss_pred HHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679 393 RHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 393 rH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
.++....-..-+.|+.|+.. ....+. +.+.|+|..|++
T Consensus 8 ~~l~~~RW~~g~~CP~Cg~~--~~~~~~-------~~~~~~C~~C~~ 45 (46)
T PF12760_consen 8 EYLEEIRWPDGFVCPHCGST--KHYRLK-------TRGRYRCKACRK 45 (46)
T ss_pred HHHHHhcCCCCCCCCCCCCe--eeEEeC-------CCCeEECCCCCC
Confidence 34433333334789999864 111111 156799999975
No 118
>TIGR01287 nifH nitrogenase iron protein. This model describes nitrogenase (EC 1.18.6.1) iron protein, also called nitrogenase reductase or nitrogenase component II. This model includes molybdenum-iron nitrogenase reductase (nifH), vanadium-iron nitrogenase reductase (vnfH), and iron-iron nitrogenase reductase (anfH). The model excludes the homologous protein from the light-independent protochlorophyllide reductase.
Probab=31.75 E-value=9.7 Score=38.10 Aligned_cols=18 Identities=39% Similarity=0.566 Sum_probs=15.8
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||-||||+-+||.+
T Consensus 5 ~~gKGGVGKTT~a~nLA~ 22 (275)
T TIGR01287 5 IYGKGGIGKSTTTQNIAA 22 (275)
T ss_pred EeCCCcCcHHHHHHHHHH
Confidence 467899999999999974
No 119
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=31.09 E-value=32 Score=40.28 Aligned_cols=13 Identities=38% Similarity=0.815 Sum_probs=10.3
Q ss_pred CCCceecCcCCCc
Q psy679 428 GEKPFSCDVCGRK 440 (510)
Q Consensus 428 GeKPfkC~~CgKa 440 (510)
...|+.|+.||..
T Consensus 472 ~~~p~~Cp~Cgs~ 484 (730)
T COG1198 472 EPIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCCC
Confidence 4578999999865
No 120
>KOG2785|consensus
Probab=30.49 E-value=38 Score=36.63 Aligned_cols=24 Identities=33% Similarity=0.699 Sum_probs=17.8
Q ss_pred CceecccCC---CccCCchhhhhhhhc
Q psy679 402 KPFQCRICM---RSFSRSDHLTTHIRT 425 (510)
Q Consensus 402 KPfkC~~Cg---KsF~~ks~L~rH~rt 425 (510)
+-|.|-.|+ +.|.+....+.||.-
T Consensus 216 ~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 216 IGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred cCceEEEeccccCcccccHHHHHHHhh
Confidence 347788887 788888888888753
No 121
>CHL00072 chlL photochlorophyllide reductase subunit L
Probab=29.90 E-value=8.9 Score=39.39 Aligned_cols=18 Identities=39% Similarity=0.560 Sum_probs=16.2
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
|.+-||-||||+-+||.+
T Consensus 5 ~~gKGGVGKTTta~nLA~ 22 (290)
T CHL00072 5 VYGKGGIGKSTTSCNISI 22 (290)
T ss_pred EECCCCCcHHHHHHHHHH
Confidence 578899999999999986
No 122
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=29.52 E-value=11 Score=37.60 Aligned_cols=28 Identities=21% Similarity=0.565 Sum_probs=17.7
Q ss_pred CceecccCCCccCCchhhhhhhhcccCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGE 429 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGe 429 (510)
..|.|..|+|.|....-..+|+...|.+
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred CEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 4577878888887777777777665543
No 123
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=29.29 E-value=34 Score=26.75 Aligned_cols=11 Identities=27% Similarity=0.836 Sum_probs=6.1
Q ss_pred ceecccCCCcc
Q psy679 403 PFQCRICMRSF 413 (510)
Q Consensus 403 PfkC~~CgKsF 413 (510)
-|+|..|++.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 35555555555
No 124
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=29.03 E-value=52 Score=34.93 Aligned_cols=49 Identities=18% Similarity=0.491 Sum_probs=31.9
Q ss_pred ecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHhh
Q psy679 405 QCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 405 kC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtHt 455 (510)
.|-.|...|.....-..-.. +..-.|.|+.|...|-..-..-.|...|.
T Consensus 364 ~Cf~CQ~~fp~~~~~~~~~~--~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~ 412 (421)
T COG5151 364 HCFVCQGPFPKPPVSPFDES--TSSGRYQCELCKSTFCSDCDVFIHETLHF 412 (421)
T ss_pred cceeccCCCCCCCCCccccc--ccccceechhhhhhhhhhhHHHHHHHHhh
Confidence 37777777754332221111 22346999999999988888888877764
No 125
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=28.73 E-value=50 Score=29.62 Aligned_cols=30 Identities=27% Similarity=0.732 Sum_probs=23.1
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCc
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRS 416 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~k 416 (510)
..| ..||++|-. | +..|..|+.||..|...
T Consensus 10 R~C--p~CG~kFYD---L--------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTC--PSCGAKFYD---L--------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccC--CCCcchhcc---C--------CCCCccCCCCCCccCcc
Confidence 458 569999954 2 34688999999999766
No 126
>TIGR03371 cellulose_yhjQ cellulose synthase operon protein YhjQ. Members of this family are the YhjQ protein, found immediately upsteam of bacterial cellulose synthase (bcs) genes in a broad range of bacteria, including both copies of the bcs locus in Klebsiella pneumoniae. In several species it is seen clearly as part of the bcs operon. It is identified as a probable component of the bacterial cellulose metabolic process not only by gene location, but also by partial phylogenetic profiling, or Haft-Selengut algorithm (PubMed:16930487), based on a bacterial cellulose biosynthesis genome property profile. Cellulose plays an important role in biofilm formation and structural integrity in some bacteria. Mutants in yhjQ in Escherichia coli, show altered morphology an growth, but the function of YhjQ has not yet been determined.
Probab=28.41 E-value=9.3 Score=37.06 Aligned_cols=18 Identities=44% Similarity=0.575 Sum_probs=15.6
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+..||.||||+-+||.+
T Consensus 7 ~s~KGGvGKTt~a~nla~ 24 (246)
T TIGR03371 7 VGVKGGVGKTTLTANLAS 24 (246)
T ss_pred EeCCCCccHHHHHHHHHH
Confidence 356899999999999975
No 127
>PF01656 CbiA: CobQ/CobB/MinD/ParA nucleotide binding domain; InterPro: IPR002586 This entry consists of various cobyrinic acid a,c-diamide synthases. These include CbiA and CbiP from Salmonella typhimurium []., and CobQ from Rhodobacter capsulatus []. These amidases catalyse amidations to various side chains of hydrogenobyrinic acid or cobyrinic acid a,c-diamide in the biosynthesis of cobalamin (vitamin B12) from uroporphyrinogen III. Vitamin B12 is an important cofactor and an essential nutrient for many plants and animals and is primarily produced by bacteria [].; PDB: 3K9G_A 3K9H_B 3EZ9_B 3EZF_A 3EZ2_B 3EZ6_A 3EZ7_A 1G3Q_A 1G3R_A 1DTS_A ....
Probab=28.40 E-value=11 Score=34.72 Aligned_cols=18 Identities=50% Similarity=0.617 Sum_probs=15.1
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+..||-||||+-+||..
T Consensus 4 ~~~kGG~GKTt~a~~la~ 21 (195)
T PF01656_consen 4 TSGKGGVGKTTIAANLAQ 21 (195)
T ss_dssp EESSTTSSHHHHHHHHHH
T ss_pred EcCCCCccHHHHHHHHHh
Confidence 467899999999998863
No 128
>PRK10037 cell division protein; Provisional
Probab=28.28 E-value=9.6 Score=37.75 Aligned_cols=18 Identities=28% Similarity=0.320 Sum_probs=15.7
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.++-||-||||+-+||.+
T Consensus 7 ~n~KGGvGKTT~a~nLA~ 24 (250)
T PRK10037 7 QGVRGGVGTTSITAALAW 24 (250)
T ss_pred ecCCCCccHHHHHHHHHH
Confidence 467899999999999975
No 129
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=28.23 E-value=39 Score=31.42 Aligned_cols=30 Identities=30% Similarity=0.709 Sum_probs=17.3
Q ss_pred CCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCC
Q psy679 368 PVHERPYACPVENCDRRFSRSDELTRHIRIHTGQK 402 (510)
Q Consensus 368 h~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeK 402 (510)
...+.--.| -.|||.|+. |++|.+.|+|--
T Consensus 67 SI~~d~i~c--lecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 67 SITPDYIIC--LECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp TB-SS-EE---TBT--EESB---HHHHHHHTT-S-
T ss_pred ccccCeeEE--ccCCcccch---HHHHHHHccCCC
Confidence 334455678 459999988 799999997753
No 130
>KOG0782|consensus
Probab=27.53 E-value=20 Score=40.64 Aligned_cols=54 Identities=19% Similarity=0.415 Sum_probs=34.5
Q ss_pred ChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChH
Q psy679 387 RSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSD 445 (510)
Q Consensus 387 sss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrss 445 (510)
+...+.+|--+|.....=+|..|+|.|..+..+..-. .....|.-|..+|..+.
T Consensus 237 rE~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~FhsKE-----ivAisCSWCKqayH~Kv 290 (1004)
T KOG0782|consen 237 RESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFFHSKE-----IVAISCSWCKQAYHLKV 290 (1004)
T ss_pred ccccchHHhHhhHhhhccccchhhhhhhhheeecccc-----EEEEEehHHHHHhhcch
Confidence 3446777777776655668999999997765543211 12356877877775543
No 131
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=27.00 E-value=62 Score=29.78 Aligned_cols=47 Identities=21% Similarity=0.487 Sum_probs=27.3
Q ss_pred HHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCCh
Q psy679 391 LTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARS 444 (510)
Q Consensus 391 LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrs 444 (510)
...+++.+... -.|+.|+... ..++-..-.|...|+|..|++.|...
T Consensus 20 ~~~~~~~~~~~--~~cP~C~s~~-----~~k~g~~~~~~qRyrC~~C~~tf~~~ 66 (129)
T COG3677 20 AAYAIRMQITK--VNCPRCKSSN-----VVKIGGIRRGHQRYKCKSCGSTFTVE 66 (129)
T ss_pred HHHHHhhhccc--CcCCCCCccc-----eeeECCccccccccccCCcCcceeee
Confidence 34566665442 3688886543 11222222235679999999999753
No 132
>KOG3598|consensus
Probab=26.15 E-value=19 Score=44.82 Aligned_cols=20 Identities=25% Similarity=0.212 Sum_probs=11.0
Q ss_pred CCCCceeeeeecCCCCCCCc
Q psy679 33 GGDGKTTSFVNLSHPPTRSI 52 (510)
Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~ 52 (510)
+-.|-+|+--|+-||-..++
T Consensus 1846 ~p~~~~~s~~~~hhp~~~~~ 1865 (2220)
T KOG3598|consen 1846 SPKDDVTSEKNEHHPRASDA 1865 (2220)
T ss_pred CCCCCCCChHhhcCCCchhh
Confidence 33444556666666655555
No 133
>PHA02518 ParA-like protein; Provisional
Probab=25.73 E-value=12 Score=35.34 Aligned_cols=18 Identities=44% Similarity=0.697 Sum_probs=15.3
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+.-||-||||+-+||.+
T Consensus 6 ~~~KGGvGKTT~a~~la~ 23 (211)
T PHA02518 6 LNQKGGAGKTTVATNLAS 23 (211)
T ss_pred EcCCCCCCHHHHHHHHHH
Confidence 356799999999999875
No 134
>PHA00626 hypothetical protein
Probab=25.41 E-value=43 Score=27.00 Aligned_cols=14 Identities=21% Similarity=0.612 Sum_probs=9.1
Q ss_pred CceecCcCCCccCC
Q psy679 430 KPFSCDVCGRKFAR 443 (510)
Q Consensus 430 KPfkC~~CgKaFsr 443 (510)
..|+|..||..|+.
T Consensus 22 nrYkCkdCGY~ft~ 35 (59)
T PHA00626 22 DDYVCCDCGYNDSK 35 (59)
T ss_pred cceEcCCCCCeech
Confidence 45777777766654
No 135
>PF09818 ABC_ATPase: Predicted ATPase of the ABC class; InterPro: IPR019195 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry consists of various predicted ABC transporter class ATPases.
Probab=24.97 E-value=35 Score=37.68 Aligned_cols=80 Identities=31% Similarity=0.420 Sum_probs=50.7
Q ss_pred CccccccCCCCCceeeeeecCCCCCCCcc----ceeeeeec-CceeeecCCCC-ceeeeeccchhhcccCCccccCCCcc
Q psy679 25 GADQISNSGGDGKTTSFVNLSHPPTRSIP----EYLILVEM-PKLRVLKRPKG-CSCIVDISPYLESLTGTRGSLSGNSF 98 (510)
Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (510)
|.--|...|--||||+.--|..---.-|| ||+|-+.- -|+|- -.| .---||||||+..|-+- .+=
T Consensus 245 GvTlIvGGGyHGKSTLL~Ale~GVYnHipGDGRE~VVT~~~avkirA---EDGR~V~~vDISpFI~~LP~g------~dT 315 (448)
T PF09818_consen 245 GVTLIVGGGYHGKSTLLEALERGVYNHIPGDGREFVVTDPDAVKIRA---EDGRSVEGVDISPFINNLPGG------KDT 315 (448)
T ss_pred cEEEEECCCCccHHHHHHHHHhcccCCCCCCCceEEEECCCceEEEe---cCCceEeCccchHHHhhCCCC------CCC
Confidence 44557777889999997666544444444 78877642 23433 122 22348999999999753 222
Q ss_pred CCCCCCCCCCCcccc
Q psy679 99 ERPSTKDDSGELCQP 113 (510)
Q Consensus 99 ~~~~~~~~~~~~~~~ 113 (510)
..=+|.|-||---|-
T Consensus 316 ~~FsT~~ASGSTSqA 330 (448)
T PF09818_consen 316 TCFSTENASGSTSQA 330 (448)
T ss_pred CcccccCCCchHHHH
Confidence 345899999976553
No 136
>KOG4377|consensus
Probab=24.68 E-value=34 Score=37.40 Aligned_cols=81 Identities=26% Similarity=0.568 Sum_probs=0.0
Q ss_pred cccCCCCCCCccCChHHHHHHHHHhcCCCc-------eecccCC--CccCCchhhhhhhhcccCCCc-------------
Q psy679 374 YACPVENCDRRFSRSDELTRHIRIHTGQKP-------FQCRICM--RSFSRSDHLTTHIRTHTGEKP------------- 431 (510)
Q Consensus 374 y~C~~e~CgKsFssss~LkrH~rtHtgeKP-------fkC~~Cg--KsF~~ks~L~rH~rtHtGeKP------------- 431 (510)
|.|....|.| +-++...|...|+..+. |.|..|+ ..|....+-..|.+-+.+++-
T Consensus 314 yhC~~~~C~k---sTsdV~~h~nFht~~~n~GfrrthfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay 390 (480)
T KOG4377|consen 314 YHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAY 390 (480)
T ss_pred chhhhcccCc---ccccccccCccccccccCceecceeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCc
Q ss_pred -----------eecCc--CCCccCChHHHHHHHHHhhcc
Q psy679 432 -----------FSCDV--CGRKFARSDEKKRHAKVHLKQ 457 (510)
Q Consensus 432 -----------fkC~~--CgKaFsrsssLkrHlRtHtge 457 (510)
|.|.. |+..|...+.+..|.|.|.++
T Consensus 391 ~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkheRq 429 (480)
T KOG4377|consen 391 TGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKHERQ 429 (480)
T ss_pred cCcccccceeeeeecccCCceEEEehhhhhhhhhhhhhh
No 137
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=24.37 E-value=36 Score=32.49 Aligned_cols=29 Identities=10% Similarity=0.130 Sum_probs=16.4
Q ss_pred hHHHHHHHHHhhcccchhHHHHHHHHHHH
Q psy679 444 SDEKKRHAKVHLKQRSKKESKMAVMMSQQ 472 (510)
Q Consensus 444 sssLkrHlRtHtgeK~~k~~~~~~~~~QQ 472 (510)
.+.|.+-+..-...++........+....
T Consensus 63 r~Kl~~gl~~A~~KRpVs~e~ie~~v~~i 91 (156)
T COG1327 63 REKLRRGLIRACEKRPVSSEQIEEAVSHI 91 (156)
T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Confidence 45566666666666666555555444443
No 138
>cd02037 MRP-like MRP (Multiple Resistance and pH adaptation) is a homologue of the Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains a ATP-binding domain at the N-termini. It is found in bacteria as a membrane-spanning protein and functions as a Na+/H+ antiporter.
Probab=24.29 E-value=18 Score=33.44 Aligned_cols=18 Identities=50% Similarity=0.674 Sum_probs=15.3
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+..||.||||+-+||..
T Consensus 5 ~s~kgG~GKTt~a~~LA~ 22 (169)
T cd02037 5 MSGKGGVGKSTVAVNLAL 22 (169)
T ss_pred ecCCCcCChhHHHHHHHH
Confidence 467899999999999874
No 139
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=24.12 E-value=43 Score=31.99 Aligned_cols=31 Identities=32% Similarity=1.008 Sum_probs=13.6
Q ss_pred ceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG 438 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg 438 (510)
+|.|. |+..|.+ .++|-.+-.|+ .|.|..|+
T Consensus 117 ~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~ 147 (156)
T COG3091 117 PYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCG 147 (156)
T ss_pred eEEee-cCCccch---hhhcccccccc-eEEeccCC
Confidence 45555 5554432 12233333333 45555554
No 140
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=24.06 E-value=40 Score=31.91 Aligned_cols=33 Identities=21% Similarity=0.459 Sum_probs=25.8
Q ss_pred CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccC
Q psy679 371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFS 414 (510)
Q Consensus 371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~ 414 (510)
..-|.| +.|+..|+....+. .-|.|+.||....
T Consensus 107 ~~~Y~C--p~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~ 139 (158)
T TIGR00373 107 NMFFIC--PNMCVRFTFNEAME---------LNFTCPRCGAMLD 139 (158)
T ss_pred CCeEEC--CCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence 456899 56999998888774 2699999998653
No 141
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=23.91 E-value=35 Score=31.21 Aligned_cols=32 Identities=28% Similarity=0.791 Sum_probs=17.5
Q ss_pred ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
.|.|..|+..+. +|.+. ..+.|.|..|+..|.
T Consensus 123 ~~~C~~C~~~~~------r~~~~--~~~~~~C~~C~~~l~ 154 (157)
T PF10263_consen 123 VYRCPSCGREYK------RHRRS--KRKRYRCGRCGGPLV 154 (157)
T ss_pred EEEcCCCCCEee------eeccc--chhhEECCCCCCEEE
Confidence 366777766552 23333 223477777776554
No 142
>KOG2071|consensus
Probab=23.61 E-value=46 Score=37.85 Aligned_cols=31 Identities=29% Similarity=0.707 Sum_probs=24.8
Q ss_pred CCCceecCcCCCccCChHHHHHHHHHhhccc
Q psy679 428 GEKPFSCDVCGRKFARSDEKKRHAKVHLKQR 458 (510)
Q Consensus 428 GeKPfkC~~CgKaFsrsssLkrHlRtHtgeK 458 (510)
..+|..|..||++|.......+||-.|...+
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~dwh 445 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHDDWH 445 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhhhhh
Confidence 3567889999999999888888888776543
No 143
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=23.38 E-value=36 Score=25.07 Aligned_cols=12 Identities=25% Similarity=0.958 Sum_probs=9.0
Q ss_pred ceecCcCCCccC
Q psy679 431 PFSCDVCGRKFA 442 (510)
Q Consensus 431 PfkC~~CgKaFs 442 (510)
-|.|..|++.|.
T Consensus 28 fy~C~~C~~~w~ 39 (40)
T smart00440 28 FYVCTKCGHRWR 39 (40)
T ss_pred EEEeCCCCCEeC
Confidence 478888887764
No 144
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=23.25 E-value=41 Score=29.43 Aligned_cols=32 Identities=19% Similarity=0.517 Sum_probs=21.9
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
..|.|++|++.- +. |. +...+.|..|++.|+-
T Consensus 35 a~y~CpfCgk~~-----vk---R~--a~GIW~C~~C~~~~AG 66 (90)
T PTZ00255 35 AKYFCPFCGKHA-----VK---RQ--AVGIWRCKGCKKTVAG 66 (90)
T ss_pred CCccCCCCCCCc-----ee---ee--eeEEEEcCCCCCEEeC
Confidence 469999998642 11 12 2356899999998864
No 145
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=23.24 E-value=60 Score=26.35 Aligned_cols=31 Identities=29% Similarity=0.739 Sum_probs=16.8
Q ss_pred ceecccCCCc-cCCchhhhhhhhcccCCCceecCcCCC
Q psy679 403 PFQCRICMRS-FSRSDHLTTHIRTHTGEKPFSCDVCGR 439 (510)
Q Consensus 403 PfkC~~CgKs-F~~ks~L~rH~rtHtGeKPfkC~~CgK 439 (510)
.|.|+.|++. -.+-..- |. -..+|+|+.||.
T Consensus 25 ~F~CPnCG~~~I~RC~~C----Rk--~~~~Y~CP~CGF 56 (59)
T PRK14890 25 KFLCPNCGEVIIYRCEKC----RK--QSNPYTCPKCGF 56 (59)
T ss_pred EeeCCCCCCeeEeechhH----Hh--cCCceECCCCCC
Confidence 4777777765 2221111 11 125788888873
No 146
>TIGR01281 DPOR_bchL light-independent protochlorophyllide reductase, iron-sulfur ATP-binding protein. The BchL peptide (ChlL in chloroplast and cyanobacteria) is an ATP-binding iron-sulfur protein of the dark form protochlorophyllide reductase, an enzyme similar to nitrogenase. This subunit resembles the nitrogenase NifH subunit.
Probab=23.07 E-value=17 Score=36.14 Aligned_cols=17 Identities=47% Similarity=0.540 Sum_probs=14.7
Q ss_pred ccCCCCCceeeeeecCC
Q psy679 30 SNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 30 ~~~~~~~~~~~~~~~~~ 46 (510)
..-||-||||+-+||.+
T Consensus 6 ~gKGGVGKTT~~~nLA~ 22 (268)
T TIGR01281 6 YGKGGIGKSTTSSNLSV 22 (268)
T ss_pred EcCCcCcHHHHHHHHHH
Confidence 35799999999999875
No 147
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=23.05 E-value=64 Score=29.90 Aligned_cols=47 Identities=15% Similarity=0.369 Sum_probs=27.9
Q ss_pred eecccCCCccCCchh--hh----hhhhcccCCCceecCcCCCccCChHHHHHHH
Q psy679 404 FQCRICMRSFSRSDH--LT----THIRTHTGEKPFSCDVCGRKFARSDEKKRHA 451 (510)
Q Consensus 404 fkC~~CgKsF~~ks~--L~----rH~rtHtGeKPfkC~~CgKaFsrsssLkrHl 451 (510)
-.|..|+..+..-.. .. ..... ..+.-+.|..|||.|-..+++++-.
T Consensus 92 sRC~~CN~~L~~v~~~~v~~~vp~~v~~-~~~~f~~C~~C~kiyW~GsH~~~~~ 144 (147)
T PF01927_consen 92 SRCPKCNGPLRPVSKEEVKDRVPPYVYE-TYDEFWRCPGCGKIYWEGSHWRRME 144 (147)
T ss_pred CccCCCCcEeeechhhccccccCccccc-cCCeEEECCCCCCEecccccHHHHH
Confidence 469999875533211 10 01111 1234578999999998887776644
No 148
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=22.80 E-value=62 Score=33.33 Aligned_cols=62 Identities=24% Similarity=0.594 Sum_probs=34.2
Q ss_pred CCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 366 KTPVHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 366 kth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
.++...+-|.| ..|...+ ..|.-....--+|..|.+.|---.. -.--|.--|.|..|++.|.
T Consensus 105 ~ip~~drqFaC--~~Cd~~W--------wRrvp~rKeVSRCr~C~~rYDPVP~-----dkmwG~aef~C~~C~h~F~ 166 (278)
T PF15135_consen 105 LIPSVDRQFAC--SSCDHMW--------WRRVPQRKEVSRCRKCRKRYDPVPC-----DKMWGIAEFHCPKCRHNFR 166 (278)
T ss_pred cccccceeeec--cccchHH--------HhccCcccccccccccccccCCCcc-----ccccceeeeecccccccch
Confidence 44556688999 5586443 1122222334567778776643221 0112344588888887775
No 149
>cd02036 MinD Bacterial cell division requires the formation of a septum at mid-cell. The site is determined by the min operon products MinC, MinD and MinE. MinC is a nonspecific inhibitor of the septum protein FtsZ. MinE is the supressor of MinC. MinD plays a pivotal role, selecting the mid-cell over other sites through the activation and regulation of MinC and MinE. MinD is a membrane-associated ATPase, related to nitrogenase iron protein. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. MinD is a monomer.
Probab=22.66 E-value=18 Score=32.93 Aligned_cols=18 Identities=50% Similarity=0.639 Sum_probs=15.1
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+..||-||||+-+||..
T Consensus 5 ~~~kgG~GKtt~a~~la~ 22 (179)
T cd02036 5 TSGKGGVGKTTTTANLGT 22 (179)
T ss_pred eeCCCCCCHHHHHHHHHH
Confidence 356899999999999864
No 150
>TIGR01969 minD_arch cell division ATPase MinD, archaeal. This model represents the archaeal branch of the MinD family. MinD, a weak ATPase, works in bacteria with MinC as a generalized cell division inhibitor and, through interaction with MinE, prevents septum placement inappropriate sites. Often several members of this family are found in archaeal genomes, and the function is uncharacterized. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. The exact roles of the various archaeal MinD homologs are unknown.
Probab=22.59 E-value=19 Score=34.93 Aligned_cols=18 Identities=50% Similarity=0.597 Sum_probs=15.7
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
++..||-||||+-+||..
T Consensus 6 ~~~KGGvGKTt~a~~LA~ 23 (251)
T TIGR01969 6 ASGKGGTGKTTITANLGV 23 (251)
T ss_pred EcCCCCCcHHHHHHHHHH
Confidence 467899999999999975
No 151
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=22.51 E-value=76 Score=21.30 Aligned_cols=20 Identities=25% Similarity=0.471 Sum_probs=14.8
Q ss_pred ecCcCCCccCChHHHHHHHHH
Q psy679 433 SCDVCGRKFARSDEKKRHAKV 453 (510)
Q Consensus 433 kC~~CgKaFsrsssLkrHlRt 453 (510)
.|++|++.+ ....+.+|+..
T Consensus 3 ~CPiC~~~v-~~~~in~HLD~ 22 (26)
T smart00734 3 QCPVCFREV-PENLINSHLDS 22 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHHH
Confidence 588888887 55677788753
No 152
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=22.47 E-value=44 Score=32.30 Aligned_cols=33 Identities=24% Similarity=0.612 Sum_probs=25.4
Q ss_pred CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCC
Q psy679 372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSR 415 (510)
Q Consensus 372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ 415 (510)
.-|.| +.|++.|+....+. .-|.|+.||.....
T Consensus 116 ~~Y~C--p~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~ 148 (178)
T PRK06266 116 MFFFC--PNCHIRFTFDEAME---------YGFRCPQCGEMLEE 148 (178)
T ss_pred CEEEC--CCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence 46899 55999998877653 36999999986543
No 153
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=22.45 E-value=43 Score=31.79 Aligned_cols=15 Identities=20% Similarity=0.507 Sum_probs=9.5
Q ss_pred CceecccCCCccCCc
Q psy679 402 KPFQCRICMRSFSRS 416 (510)
Q Consensus 402 KPfkC~~CgKsF~~k 416 (510)
+.-.|..|++.|..-
T Consensus 27 RRReC~~C~~RFTTy 41 (147)
T TIGR00244 27 RRRECLECHERFTTF 41 (147)
T ss_pred ecccCCccCCcccee
Confidence 334677777777653
No 154
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=22.27 E-value=45 Score=29.33 Aligned_cols=27 Identities=33% Similarity=0.860 Sum_probs=18.9
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG 438 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg 438 (510)
+|-.|..||..|.. ..+ .+|-+|+.|.
T Consensus 57 ~Pa~CkkCGfef~~-~~i---------k~pSRCP~CK 83 (97)
T COG3357 57 RPARCKKCGFEFRD-DKI---------KKPSRCPKCK 83 (97)
T ss_pred cChhhcccCccccc-ccc---------CCcccCCcch
Confidence 57788899888865 111 2577888885
No 155
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=22.11 E-value=41 Score=29.51 Aligned_cols=32 Identities=28% Similarity=0.656 Sum_probs=21.8
Q ss_pred CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679 402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR 443 (510)
Q Consensus 402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr 443 (510)
..|.|++|++.-. .|.- ...+.|..|++.|+-
T Consensus 34 a~y~CpfCgk~~v--------kR~a--~GIW~C~~C~~~~AG 65 (91)
T TIGR00280 34 AKYVCPFCGKKTV--------KRGS--TGIWTCRKCGAKFAG 65 (91)
T ss_pred cCccCCCCCCCce--------EEEe--eEEEEcCCCCCEEeC
Confidence 4699999986421 1222 346899999998863
No 156
>COG0489 Mrp ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=21.58 E-value=20 Score=36.49 Aligned_cols=18 Identities=50% Similarity=0.625 Sum_probs=15.6
Q ss_pred ccccCCCCCceeeeeecC
Q psy679 28 QISNSGGDGKTTSFVNLS 45 (510)
Q Consensus 28 ~~~~~~~~~~~~~~~~~~ 45 (510)
-.|..||+||||..+||.
T Consensus 62 V~S~kgGvGKStva~nLA 79 (265)
T COG0489 62 VTSGKGGVGKSTVAVNLA 79 (265)
T ss_pred EEeCCCCCcHHHHHHHHH
Confidence 357889999999999985
No 157
>cd03110 Fer4_NifH_child This protein family's function is unkown. It contains nucleotide binding site. It uses NTP as energy source to transfer electron or ion.
Probab=21.30 E-value=20 Score=33.23 Aligned_cols=17 Identities=53% Similarity=0.714 Sum_probs=14.9
Q ss_pred cccCCCCCceeeeeecC
Q psy679 29 ISNSGGDGKTTSFVNLS 45 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~ 45 (510)
.|..||-||||.-+||.
T Consensus 5 ~s~kgG~GKSt~a~nLA 21 (179)
T cd03110 5 ISGKGGTGKTTVTAALA 21 (179)
T ss_pred EcCCCCCCHHHHHHHHH
Confidence 46789999999999995
No 158
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=21.02 E-value=38 Score=35.05 Aligned_cols=84 Identities=21% Similarity=0.455 Sum_probs=43.9
Q ss_pred CCCCCCcccCCCCCCCccCChHHHHHHHHHh--cCCCceecccCCCccCCchh-------hhhhhhc----ccCCCceec
Q psy679 368 PVHERPYACPVENCDRRFSRSDELTRHIRIH--TGQKPFQCRICMRSFSRSDH-------LTTHIRT----HTGEKPFSC 434 (510)
Q Consensus 368 h~~EKPy~C~~e~CgKsFssss~LkrH~rtH--tgeKPfkC~~CgKsF~~ks~-------L~rH~rt----HtGeKPfkC 434 (510)
..+.+.|.|.+ |.........+ .|+..- .....|+|.-|++.-. -+- ...|+|. ....+++.|
T Consensus 137 ~hGGrif~Csf--C~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrlGq-~sCLRCK~cfCddHvrrKg~ky~k~k~~PC 212 (314)
T PF06524_consen 137 DHGGRIFKCSF--CDNFLCEDDQF-EHQASCQVLESETFKCQSCNRLGQ-YSCLRCKICFCDDHVRRKGFKYEKGKPIPC 212 (314)
T ss_pred cCCCeEEEeec--CCCeeeccchh-hhhhhhhhhhcccccccccccccc-hhhhheeeeehhhhhhhcccccccCCCCCC
Confidence 34567788855 87544433333 354321 1223577766664211 111 1123332 233478889
Q ss_pred CcCCCccCChHHHHHHHHHhh
Q psy679 435 DVCGRKFARSDEKKRHAKVHL 455 (510)
Q Consensus 435 ~~CgKaFsrsssLkrHlRtHt 455 (510)
+.||..-.....|..-.|+|.
T Consensus 213 PKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 213 PKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred CCCCCcccccccceeeeecch
Confidence 889877666666655555554
No 159
>PRK14873 primosome assembly protein PriA; Provisional
Probab=20.69 E-value=29 Score=40.19 Aligned_cols=27 Identities=26% Similarity=0.668 Sum_probs=17.2
Q ss_pred hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679 398 HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK 440 (510)
Q Consensus 398 HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa 440 (510)
|...+...|.+|+.. ..++.|..||..
T Consensus 405 h~~~~~l~Ch~CG~~----------------~~p~~Cp~Cgs~ 431 (665)
T PRK14873 405 PSAGGTPRCRWCGRA----------------APDWRCPRCGSD 431 (665)
T ss_pred ecCCCeeECCCCcCC----------------CcCccCCCCcCC
Confidence 444455667777632 246789999865
No 160
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.67 E-value=55 Score=30.20 Aligned_cols=13 Identities=31% Similarity=0.697 Sum_probs=7.2
Q ss_pred ceecccCCCccCC
Q psy679 403 PFQCRICMRSFSR 415 (510)
Q Consensus 403 PfkC~~CgKsF~~ 415 (510)
.+.|..|+..|..
T Consensus 70 ~~~C~~CG~~~~~ 82 (135)
T PRK03824 70 VLKCRNCGNEWSL 82 (135)
T ss_pred EEECCCCCCEEec
Confidence 3566666655543
No 161
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=20.64 E-value=40 Score=24.44 Aligned_cols=31 Identities=23% Similarity=0.451 Sum_probs=16.6
Q ss_pred eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679 404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA 442 (510)
Q Consensus 404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs 442 (510)
+.|+.|++.|.. ........=+|+.||-.+.
T Consensus 2 r~C~~Cg~~Yh~--------~~~pP~~~~~Cd~cg~~L~ 32 (36)
T PF05191_consen 2 RICPKCGRIYHI--------EFNPPKVEGVCDNCGGELV 32 (36)
T ss_dssp EEETTTTEEEET--------TTB--SSTTBCTTTTEBEB
T ss_pred cCcCCCCCcccc--------ccCCCCCCCccCCCCCeeE
Confidence 457777777632 2222233456777876544
No 162
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.58 E-value=38 Score=34.55 Aligned_cols=40 Identities=28% Similarity=0.665 Sum_probs=25.3
Q ss_pred CCceecccCCCccCCchhhhhhhhcccCCC----------c-----eecCcCCCc
Q psy679 401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEK----------P-----FSCDVCGRK 440 (510)
Q Consensus 401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeK----------P-----fkC~~CgKa 440 (510)
++.+.|++|+..|....-+..-.|+-.|+- | ..|+.|..+
T Consensus 17 kk~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA 71 (267)
T COG1655 17 KKTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA 71 (267)
T ss_pred hceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence 456778888877776665555555544421 2 369999753
No 163
>PRK13234 nifH nitrogenase reductase; Reviewed
Probab=20.17 E-value=17 Score=37.29 Aligned_cols=37 Identities=38% Similarity=0.425 Sum_probs=25.4
Q ss_pred ccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccchhhc
Q psy679 30 SNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPYLES 85 (510)
Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (510)
+.-||-||||+-+||.+- |.++ ..| .+|||..|-..+
T Consensus 10 ~~KGGvGKTt~~~nLa~~----------la~~-g~k--------VLliD~D~q~~~ 46 (295)
T PRK13234 10 YGKGGIGKSTTSQNTLAA----------LVEM-GQK--------ILIVGCDPKADS 46 (295)
T ss_pred ECCCCccHHHHHHHHHHH----------HHHC-CCe--------EEEEeccccccc
Confidence 468999999999999752 2232 223 468898886433
No 164
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=20.05 E-value=56 Score=25.97 Aligned_cols=13 Identities=31% Similarity=0.723 Sum_probs=5.3
Q ss_pred CceecccCCCccC
Q psy679 402 KPFQCRICMRSFS 414 (510)
Q Consensus 402 KPfkC~~CgKsF~ 414 (510)
+.+.|..||+.|-
T Consensus 24 rrhhCr~CG~~vC 36 (69)
T PF01363_consen 24 RRHHCRNCGRVVC 36 (69)
T ss_dssp -EEE-TTT--EEE
T ss_pred eeEccCCCCCEEC
Confidence 4456666666554
No 165
>PRK13849 putative crown gall tumor protein VirC1; Provisional
Probab=20.01 E-value=17 Score=36.16 Aligned_cols=18 Identities=44% Similarity=0.634 Sum_probs=15.6
Q ss_pred cccCCCCCceeeeeecCC
Q psy679 29 ISNSGGDGKTTSFVNLSH 46 (510)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~ 46 (510)
.+.-||-||||+-+||.+
T Consensus 7 ~n~KGGvGKTT~a~nLA~ 24 (231)
T PRK13849 7 CSFKGGAGKTTALMGLCA 24 (231)
T ss_pred ECCCCCccHHHHHHHHHH
Confidence 466799999999999875
Done!