Query psy16294
Match_columns 486
No_of_seqs 638 out of 3673
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 17:41:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16294.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16294hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 7.9E-24 1.7E-28 199.8 6.4 135 218-390 128-265 (279)
2 KOG2462|consensus 99.9 2E-23 4.4E-28 197.0 4.3 126 250-410 129-257 (279)
3 KOG3608|consensus 99.7 4.2E-19 9.1E-24 171.5 1.9 237 184-440 142-399 (467)
4 KOG1074|consensus 99.7 9.2E-18 2E-22 177.7 11.1 85 337-423 604-692 (958)
5 KOG3623|consensus 99.7 6.9E-19 1.5E-23 183.0 2.5 72 336-409 892-963 (1007)
6 KOG3608|consensus 99.7 2.5E-18 5.3E-23 166.2 0.8 172 185-395 186-379 (467)
7 KOG1074|consensus 99.6 1.5E-16 3.3E-21 168.6 5.7 57 339-397 880-936 (958)
8 KOG3576|consensus 99.6 4.3E-15 9.3E-20 134.0 5.9 127 218-397 115-241 (267)
9 KOG3623|consensus 99.4 1.4E-13 3.1E-18 143.9 2.1 121 220-390 210-331 (1007)
10 KOG3576|consensus 99.4 1.4E-13 3.1E-18 124.2 1.2 86 248-394 114-199 (267)
11 PLN03086 PRLI-interacting fact 98.9 5E-09 1.1E-13 111.1 8.7 135 220-406 407-551 (567)
12 PLN03086 PRLI-interacting fact 98.7 3.1E-08 6.6E-13 105.2 7.8 137 259-439 410-553 (567)
13 PHA00733 hypothetical protein 98.7 1.5E-08 3.3E-13 88.3 4.0 56 335-394 70-125 (128)
14 PHA02768 hypothetical protein; 98.6 1.3E-08 2.8E-13 73.9 1.9 44 338-385 5-48 (55)
15 PF13465 zf-H2C2_2: Zinc-finge 98.4 1.7E-07 3.7E-12 58.1 1.9 26 353-380 1-26 (26)
16 PHA00733 hypothetical protein 98.3 3.4E-07 7.3E-12 79.9 1.9 84 218-324 38-123 (128)
17 PHA02768 hypothetical protein; 98.1 1.2E-06 2.5E-11 63.8 1.7 42 221-272 6-47 (55)
18 PHA00616 hypothetical protein 98.0 2.1E-06 4.6E-11 59.4 1.8 33 366-400 1-33 (44)
19 PF13465 zf-H2C2_2: Zinc-finge 98.0 2E-06 4.3E-11 53.3 1.1 22 271-292 1-22 (26)
20 PHA00616 hypothetical protein 97.9 3.8E-06 8.2E-11 58.2 1.3 34 338-371 1-34 (44)
21 KOG3993|consensus 97.9 2E-06 4.4E-11 86.2 -0.5 59 334-394 291-382 (500)
22 KOG3993|consensus 97.8 2.4E-06 5.2E-11 85.7 -1.9 147 219-392 266-482 (500)
23 PHA00732 hypothetical protein 97.7 1.9E-05 4.2E-10 62.7 2.5 47 338-392 1-48 (79)
24 PF05605 zf-Di19: Drought indu 97.6 6.3E-05 1.4E-09 55.3 4.1 51 338-393 2-54 (54)
25 COG5189 SFP1 Putative transcri 97.5 5.1E-05 1.1E-09 73.7 2.1 27 362-390 394-420 (423)
26 PF00096 zf-C2H2: Zinc finger, 97.3 0.00014 3E-09 43.4 1.9 23 367-391 1-23 (23)
27 COG5189 SFP1 Putative transcri 97.2 0.00015 3.3E-09 70.5 1.8 31 362-392 345-375 (423)
28 PHA00732 hypothetical protein 97.1 0.00041 9E-09 55.1 3.2 42 366-411 1-42 (79)
29 PF13894 zf-C2H2_4: C2H2-type 97.1 0.00034 7.3E-09 41.8 2.1 24 367-392 1-24 (24)
30 PF00096 zf-C2H2: Zinc finger, 97.0 0.00032 7E-09 41.8 1.4 23 339-361 1-23 (23)
31 PF13912 zf-C2H2_6: C2H2-type 96.9 0.0006 1.3E-08 42.3 1.7 26 366-393 1-26 (27)
32 COG5048 FOG: Zn-finger [Genera 96.8 0.00051 1.1E-08 71.3 1.3 161 219-402 288-452 (467)
33 PF05605 zf-Di19: Drought indu 96.7 0.0022 4.9E-08 47.0 4.1 51 220-280 2-53 (54)
34 PF13894 zf-C2H2_4: C2H2-type 96.2 0.0026 5.7E-08 37.7 1.6 23 339-361 1-23 (24)
35 PF12756 zf-C2H2_2: C2H2 type 96.2 0.003 6.5E-08 51.9 2.4 25 366-392 50-74 (100)
36 PF09237 GAGA: GAGA factor; I 95.9 0.005 1.1E-07 43.8 2.1 33 362-396 20-52 (54)
37 KOG2231|consensus 95.9 0.015 3.2E-07 63.3 6.4 95 339-437 183-283 (669)
38 PF13912 zf-C2H2_6: C2H2-type 95.8 0.0057 1.2E-07 37.8 1.7 25 338-362 1-25 (27)
39 COG5048 FOG: Zn-finger [Genera 95.7 0.0081 1.8E-07 62.3 3.7 70 337-406 288-361 (467)
40 smart00355 ZnF_C2H2 zinc finge 95.6 0.01 2.2E-07 35.6 2.3 24 367-392 1-24 (26)
41 PF12756 zf-C2H2_2: C2H2 type 95.5 0.0087 1.9E-07 49.1 2.2 24 338-361 50-73 (100)
42 PRK04860 hypothetical protein; 95.1 0.0095 2.1E-07 53.9 1.5 40 337-382 118-157 (160)
43 smart00355 ZnF_C2H2 zinc finge 94.9 0.02 4.4E-07 34.2 2.0 24 339-362 1-24 (26)
44 PF13909 zf-H2C2_5: C2H2-type 94.8 0.02 4.3E-07 34.3 1.8 24 367-393 1-24 (24)
45 PF12874 zf-met: Zinc-finger o 94.1 0.031 6.8E-07 33.7 1.6 23 367-391 1-23 (25)
46 PF09237 GAGA: GAGA factor; I 93.4 0.054 1.2E-06 38.7 1.9 33 334-366 20-52 (54)
47 PF12874 zf-met: Zinc-finger o 93.4 0.042 9.1E-07 33.1 1.2 23 339-361 1-23 (25)
48 PRK04860 hypothetical protein; 92.4 0.082 1.8E-06 47.9 2.2 29 219-254 118-146 (160)
49 PF12171 zf-C2H2_jaz: Zinc-fin 91.3 0.059 1.3E-06 33.3 0.0 22 367-390 2-23 (27)
50 PF12171 zf-C2H2_jaz: Zinc-fin 91.3 0.1 2.3E-06 32.2 1.1 21 221-241 2-22 (27)
51 COG5236 Uncharacterized conser 90.5 0.14 3E-06 50.9 1.7 121 252-392 152-275 (493)
52 PF13909 zf-H2C2_5: C2H2-type 90.0 0.14 3.1E-06 30.5 0.8 23 339-362 1-23 (24)
53 KOG4377|consensus 87.9 0.37 8.1E-06 49.1 2.6 134 220-393 271-428 (480)
54 KOG2231|consensus 86.7 0.34 7.3E-06 53.1 1.7 81 309-394 123-208 (669)
55 KOG1146|consensus 86.4 0.41 8.9E-06 55.6 2.3 50 222-278 438-487 (1406)
56 COG5236 Uncharacterized conser 86.2 0.45 9.7E-06 47.4 2.1 133 221-394 152-307 (493)
57 PF13913 zf-C2HC_2: zinc-finge 84.5 0.78 1.7E-05 27.9 1.9 21 367-390 3-23 (25)
58 smart00451 ZnF_U1 U1-like zinc 83.7 1 2.2E-05 29.3 2.3 22 220-241 3-24 (35)
59 smart00451 ZnF_U1 U1-like zinc 83.1 0.81 1.8E-05 29.8 1.7 23 366-390 3-25 (35)
60 KOG4173|consensus 81.6 0.5 1.1E-05 43.8 0.3 54 340-393 108-171 (253)
61 PF13913 zf-C2HC_2: zinc-finge 81.5 1.1 2.4E-05 27.2 1.7 21 339-360 3-23 (25)
62 KOG2482|consensus 80.1 1.6 3.4E-05 43.7 3.1 58 335-394 192-305 (423)
63 KOG2893|consensus 77.6 0.72 1.6E-05 43.7 -0.1 47 341-393 13-59 (341)
64 KOG1146|consensus 74.4 1.7 3.7E-05 50.7 1.8 58 335-395 1281-1355(1406)
65 KOG2893|consensus 74.2 1.1 2.3E-05 42.5 0.2 49 219-280 10-59 (341)
66 KOG2482|consensus 71.9 3.4 7.3E-05 41.4 3.0 50 339-390 280-356 (423)
67 COG4049 Uncharacterized protei 69.7 1.8 3.9E-05 31.5 0.4 26 217-242 14-39 (65)
68 KOG4173|consensus 69.4 1.2 2.6E-05 41.3 -0.6 94 334-433 75-181 (253)
69 cd00350 rubredoxin_like Rubred 66.4 3.7 7.9E-05 26.7 1.3 24 339-376 2-25 (33)
70 COG4049 Uncharacterized protei 63.4 3.6 7.8E-05 30.0 0.9 32 362-395 13-44 (65)
71 PF09986 DUF2225: Uncharacteri 60.1 2.6 5.7E-05 40.1 -0.3 24 218-241 3-26 (214)
72 PF09538 FYDLN_acid: Protein o 56.5 6.6 0.00014 33.1 1.5 30 339-381 10-39 (108)
73 cd00729 rubredoxin_SM Rubredox 55.7 7 0.00015 25.6 1.2 25 338-376 2-26 (34)
74 PF02892 zf-BED: BED zinc fing 55.1 10 0.00022 26.1 2.1 26 364-391 14-43 (45)
75 PRK09678 DNA-binding transcrip 53.7 3.7 8.1E-05 31.9 -0.4 42 339-382 2-43 (72)
76 PRK00464 nrdR transcriptional 53.6 5.3 0.00012 35.9 0.6 16 367-384 29-44 (154)
77 KOG2186|consensus 52.9 7.1 0.00015 37.6 1.3 57 338-399 3-59 (276)
78 PF12013 DUF3505: Protein of u 49.1 12 0.00026 31.3 2.0 25 367-393 81-109 (109)
79 PRK00464 nrdR transcriptional 49.1 4.3 9.4E-05 36.5 -0.8 12 339-350 29-40 (154)
80 COG2331 Uncharacterized protei 46.7 6.6 0.00014 30.6 0.1 34 219-267 11-44 (82)
81 PF09986 DUF2225: Uncharacteri 45.6 5.8 0.00013 37.7 -0.5 44 336-381 3-61 (214)
82 smart00614 ZnF_BED BED zinc fi 44.3 18 0.00039 25.7 2.0 22 367-390 19-45 (50)
83 TIGR02300 FYDLN_acid conserved 43.5 15 0.00032 31.8 1.6 34 338-384 9-42 (129)
84 PF05443 ROS_MUCR: ROS/MUCR tr 43.2 13 0.00029 32.5 1.4 26 365-395 71-96 (132)
85 COG1592 Rubrerythrin [Energy p 42.2 14 0.0003 33.7 1.4 23 338-375 134-156 (166)
86 TIGR00622 ssl1 transcription f 41.7 30 0.00065 29.3 3.2 22 336-357 13-34 (112)
87 PF06524 NOA36: NOA36 protein; 39.5 23 0.0005 34.3 2.5 101 279-427 137-237 (314)
88 PRK00398 rpoP DNA-directed RNA 38.8 17 0.00038 25.3 1.2 30 338-379 3-32 (46)
89 COG1592 Rubrerythrin [Energy p 38.2 16 0.00035 33.3 1.2 13 334-346 145-157 (166)
90 COG1997 RPL43A Ribosomal prote 37.7 16 0.00035 29.3 1.0 34 336-381 33-66 (89)
91 PF07800 DUF1644: Protein of u 37.7 22 0.00048 31.9 1.9 57 336-394 78-134 (162)
92 PF04959 ARS2: Arsenite-resist 36.9 14 0.0003 35.2 0.6 31 363-395 74-104 (214)
93 COG2888 Predicted Zn-ribbon RN 34.0 25 0.00054 26.2 1.3 33 337-376 26-58 (61)
94 KOG2785|consensus 33.1 31 0.00067 35.4 2.3 50 339-390 4-90 (390)
95 PTZ00255 60S ribosomal protein 33.0 17 0.00036 29.6 0.4 34 335-380 33-66 (90)
96 TIGR00373 conserved hypothetic 32.8 47 0.001 30.0 3.3 43 360-413 103-145 (158)
97 PF09538 FYDLN_acid: Protein o 32.7 37 0.0008 28.6 2.4 10 221-230 10-19 (108)
98 COG4957 Predicted transcriptio 32.1 25 0.00054 30.7 1.3 25 367-396 77-101 (148)
99 KOG4167|consensus 31.6 41 0.0009 37.3 3.1 27 218-247 790-816 (907)
100 TIGR00280 L37a ribosomal prote 30.4 17 0.00037 29.5 0.1 34 335-380 32-65 (91)
101 KOG4167|consensus 30.0 15 0.00033 40.6 -0.5 26 366-393 792-817 (907)
102 smart00531 TFIIE Transcription 29.8 23 0.00049 31.5 0.7 13 338-350 99-111 (147)
103 TIGR02098 MJ0042_CXXC MJ0042 f 29.5 27 0.00058 23.1 0.9 34 339-379 3-36 (38)
104 PF01780 Ribosomal_L37ae: Ribo 29.0 16 0.00035 29.7 -0.3 34 335-380 32-65 (90)
105 PF04959 ARS2: Arsenite-resist 28.6 29 0.00064 33.0 1.3 25 218-242 75-99 (214)
106 COG4530 Uncharacterized protei 28.0 39 0.00085 28.3 1.7 33 339-385 10-42 (129)
107 PF02176 zf-TRAF: TRAF-type zi 26.3 35 0.00076 24.8 1.1 42 338-379 9-53 (60)
108 PRK03976 rpl37ae 50S ribosomal 25.9 22 0.00047 28.9 -0.1 34 335-380 33-66 (90)
109 PRK06266 transcription initiat 25.6 68 0.0015 29.6 3.1 38 363-411 114-151 (178)
110 PHA00626 hypothetical protein 25.5 31 0.00067 25.3 0.6 15 365-381 22-36 (59)
111 COG1198 PriA Primosomal protei 25.5 47 0.001 37.5 2.4 38 285-375 445-482 (730)
112 PF10571 UPF0547: Uncharacteri 25.4 40 0.00086 20.7 1.0 11 367-379 15-25 (26)
113 PRK04023 DNA polymerase II lar 24.6 65 0.0014 37.4 3.2 11 366-378 663-673 (1121)
114 KOG4124|consensus 23.8 20 0.00043 36.2 -0.8 25 217-242 210-234 (442)
115 KOG2461|consensus 23.7 90 0.0019 32.7 3.9 46 352-399 317-362 (396)
116 smart00154 ZnF_AN1 AN1-like Zi 23.1 48 0.001 22.4 1.2 12 366-379 12-23 (39)
117 PF13719 zinc_ribbon_5: zinc-r 22.7 46 0.001 22.1 1.0 34 339-379 3-36 (37)
118 PRK06266 transcription initiat 22.1 39 0.00085 31.1 0.8 34 336-380 115-148 (178)
119 PF04606 Ogr_Delta: Ogr/Delta- 22.0 21 0.00046 25.1 -0.8 39 340-380 1-39 (47)
120 TIGR00373 conserved hypothetic 22.0 41 0.00088 30.4 0.9 34 335-379 106-139 (158)
121 PF05443 ROS_MUCR: ROS/MUCR tr 21.8 54 0.0012 28.7 1.5 29 335-366 69-97 (132)
122 COG0447 MenB Dihydroxynaphthoi 20.9 40 0.00087 32.1 0.6 20 79-99 106-125 (282)
123 PF12907 zf-met2: Zinc-binding 20.5 62 0.0014 22.1 1.3 26 367-394 2-30 (40)
124 smart00734 ZnF_Rad18 Rad18-lik 20.5 77 0.0017 19.3 1.6 19 368-389 3-21 (26)
125 COG1327 Predicted transcriptio 20.2 27 0.00059 31.1 -0.6 16 365-382 27-42 (156)
126 PRK03564 formate dehydrogenase 20.1 38 0.00083 34.1 0.3 13 279-291 207-219 (309)
No 1
>KOG2462|consensus
Probab=99.89 E-value=7.9e-24 Score=199.79 Aligned_cols=135 Identities=24% Similarity=0.303 Sum_probs=118.4
Q ss_pred ccceecCCCCCCCCCHHHHHHhhhcccccC---CCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCC
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKHINNDHIHA---NKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFD 294 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H~~~~~~H~---~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~ 294 (486)
...|+|.+|||.|.+..+|.+|.. +|- ..+.+.| .+|+|.|..-..|..|+|+|+ -+++|.+||
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ---~H~~~~s~ka~~C-----~~C~K~YvSmpALkMHirTH~--l~c~C~iCG--- 194 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQ---THRSLDSKKAFSC-----KYCGKVYVSMPALKMHIRTHT--LPCECGICG--- 194 (279)
T ss_pred CCceeccccccccccccccchhhc---ccccccccccccC-----CCCCceeeehHHHhhHhhccC--CCccccccc---
Confidence 457999999999999999999998 564 4566777 555999999999999999998 689999988
Q ss_pred cchhhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCcccc
Q psy16294 295 TSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENC 374 (486)
Q Consensus 295 ~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gC 374 (486)
|.|....-|+.|+ |+| .+||||.|+.|+|.|..+++|+.||++|.+.|+|+|.. |
T Consensus 195 -------------KaFSRPWLLQGHi-RTH---------TGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~--C 249 (279)
T KOG2462|consen 195 -------------KAFSRPWLLQGHI-RTH---------TGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPR--C 249 (279)
T ss_pred -------------ccccchHHhhccc-ccc---------cCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcc--h
Confidence 5777888888888 888 88999999999999999999999999999999999988 9
Q ss_pred cccccChhhHHHHHHh
Q psy16294 375 TKRYTDPSSLRKHIKT 390 (486)
Q Consensus 375 gK~F~~~s~L~~H~k~ 390 (486)
+|+|..++.|.+|...
T Consensus 250 ~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 250 GKSFALKSYLNKHSES 265 (279)
T ss_pred hhHHHHHHHHHHhhhh
Confidence 9999999999998865
No 2
>KOG2462|consensus
Probab=99.88 E-value=2e-23 Score=197.03 Aligned_cols=126 Identities=21% Similarity=0.245 Sum_probs=113.7
Q ss_pred CcccccCCCCCCCCCcCCChHHHHHHhhhhcC---CCCccCCCCCCCCcchhhhhceecCccccCCccccccccccccch
Q psy16294 250 KSFICRWEDCTRDEKPFKAQYMLVVHMRRHTG---EKPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVN 326 (486)
Q Consensus 250 k~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htg---ekpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~ 326 (486)
..|+| +.|||.+.+..+|.+|..+|.. .+-|.|++|+ |.|.....|+.|+ ++|
T Consensus 129 ~r~~c-----~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~----------------K~YvSmpALkMHi-rTH-- 184 (279)
T KOG2462|consen 129 PRYKC-----PECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCG----------------KVYVSMPALKMHI-RTH-- 184 (279)
T ss_pred Cceec-----cccccccccccccchhhcccccccccccccCCCCC----------------ceeeehHHHhhHh-hcc--
Confidence 34666 5559999999999999999963 5779999998 5788888999999 888
Q ss_pred hhhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcc
Q psy16294 327 ILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDM 406 (486)
Q Consensus 327 ~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~ 406 (486)
.-+++|.+|||.|.+.+.|+-|+|+|+|||||.|.. |+|+|.++++|+.||+||.+.|.|+|..|.+.|
T Consensus 185 ---------~l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~h--C~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsF 253 (279)
T KOG2462|consen 185 ---------TLPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPH--CGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSF 253 (279)
T ss_pred ---------CCCcccccccccccchHHhhcccccccCCCCccCCc--ccchhcchHHHHHHHHhhcCCccccCcchhhHH
Confidence 468999999999999999999999999999999999 999999999999999999999999999999987
Q ss_pred hhch
Q psy16294 407 QDIQ 410 (486)
Q Consensus 407 ~~~~ 410 (486)
....
T Consensus 254 sl~S 257 (279)
T KOG2462|consen 254 ALKS 257 (279)
T ss_pred HHHH
Confidence 6443
No 3
>KOG3608|consensus
Probab=99.74 E-value=4.2e-19 Score=171.47 Aligned_cols=237 Identities=15% Similarity=0.213 Sum_probs=163.9
Q ss_pred CCCCCCCccccccccccCCCCCCCC-CCCCCCCccccceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCC
Q psy16294 184 GGGASWKPMKIKRETKQHSPESPAG-DKYEPGDFVETNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRD 262 (486)
Q Consensus 184 ~~~~~~~~~~l~~~~~~hs~~~~~~-~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C 262 (486)
|+..|.....+-+|...|.---.++ ............|.|..|.+.|.++..|++|++ .|++||...| +.|
T Consensus 142 Ce~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r---~Hs~eKvvAC-----p~C 213 (467)
T KOG3608|consen 142 CEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIR---THSNEKVVAC-----PHC 213 (467)
T ss_pred cCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHH---hcCCCeEEec-----chH
Confidence 5566777777777776665431111 111122224568999999999999999999999 9999999888 444
Q ss_pred CCcCCChHHHHHHhhhhc--CCCCccCCCCCCCCcchhhhh---ceecC-cc------ccCCccccccccccccchhhhh
Q psy16294 263 EKPFKAQYMLVVHMRRHT--GEKPHKCTDLEAFDTSSRVAA---RRIRG-WK------ISRRISDLIPVRNRTRVNILAV 330 (486)
Q Consensus 263 ~k~F~~~~~L~~H~r~Ht--gekpy~C~~C~c~~~~~~~~~---~~H~~-ek------~F~~~~~L~~H~~~~H~~~~~~ 330 (486)
|.-|.++..|..|.+..+ ...+|.|..|-++|....++. ..|.. -| .....+.|..|++..|
T Consensus 214 g~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rH------ 287 (467)
T KOG3608|consen 214 GELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRH------ 287 (467)
T ss_pred HHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhh------
Confidence 999999999999988654 346899987654444444433 11111 11 4556677777876666
Q ss_pred hhhccccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhc-C--CCCCCCCCCCCcch
Q psy16294 331 VKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVH-A--PASLNPVPQHKDMQ 407 (486)
Q Consensus 331 ~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh-~--~~~~~c~~c~~~~~ 407 (486)
...++|+|+.|.+.|.+.+.|.+|..+|+ +--|.|+.+.|...|.+...|++|++.|| | +.+|.|-.|.+.|+
T Consensus 288 ---s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft 363 (467)
T KOG3608|consen 288 ---SKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFT 363 (467)
T ss_pred ---ccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhc
Confidence 55788888888888888888888888887 56688888888888888888888888887 3 35678877777665
Q ss_pred hchhhhhhcccccccc-----ccccchhhhccccccCC
Q psy16294 408 DIQSNEVTQSNMRQST-----YQRTLEYVEQCQNWVSS 440 (486)
Q Consensus 408 ~~~~~~~~~~h~~~h~-----~~~~~~c~~c~~~f~~s 440 (486)
.-.+ +..+.++.|. +-+.|.+..|..+|-.-
T Consensus 364 ~G~~--L~~HL~kkH~f~~PsGh~RFtYk~~edG~mRL 399 (467)
T KOG3608|consen 364 SGKS--LSAHLMKKHGFRLPSGHKRFTYKVDEDGFMRL 399 (467)
T ss_pred cchh--HHHHHHHhhcccCCCCCCceeeeeccCceeee
Confidence 4444 3333444444 33446677776666433
No 4
>KOG1074|consensus
Probab=99.73 E-value=9.2e-18 Score=177.71 Aligned_cols=85 Identities=12% Similarity=0.187 Sum_probs=70.8
Q ss_pred cccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCC----CCCCCCCCcchhchhh
Q psy16294 337 RRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPAS----LNPVPQHKDMQDIQSN 412 (486)
Q Consensus 337 ~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~----~~c~~c~~~~~~~~~~ 412 (486)
.+-+|-+|-++...++.|+.|+|+|+|||||+|++ ||++|+++.+|+.||-+|...-+ +.|....-+.+.+.+.
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKi--CgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~ 681 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKI--CGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNA 681 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCcccccc--ccchhccccchhhcccccccCccccccccCCchhhhccccccc
Confidence 45789999999999999999999999999999999 99999999999999998876544 4555222455566666
Q ss_pred hhhcccccccc
Q psy16294 413 EVTQSNMRQST 423 (486)
Q Consensus 413 ~~~~~h~~~h~ 423 (486)
..++.|++.|.
T Consensus 682 V~lpQhIriH~ 692 (958)
T KOG1074|consen 682 VTLPQHIRIHL 692 (958)
T ss_pred ccccceEEeec
Confidence 66778888887
No 5
>KOG3623|consensus
Probab=99.73 E-value=6.9e-19 Score=183.01 Aligned_cols=72 Identities=11% Similarity=0.104 Sum_probs=67.5
Q ss_pred ccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhc
Q psy16294 336 MRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDI 409 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~ 409 (486)
...|.|+.|+|+|...+.|-+|..-|+|.+||+|.+ |.|+|+.+..|..|+|.|.|||||.|..|.+.|.-.
T Consensus 892 ~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~i--CkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHS 963 (1007)
T KOG3623|consen 892 DGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCII--CKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHS 963 (1007)
T ss_pred cccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccch--hhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccc
Confidence 446999999999999999999999999999999999 999999999999999999999999999998887543
No 6
>KOG3608|consensus
Probab=99.70 E-value=2.5e-18 Score=166.18 Aligned_cols=172 Identities=19% Similarity=0.247 Sum_probs=129.6
Q ss_pred CCCCCCccccccccccCCCCCCCCCCCCCCC----------------ccccceecCCCCCCCCCHHHHHHhhhcccccCC
Q psy16294 185 GGASWKPMKIKRETKQHSPESPAGDKYEPGD----------------FVETNCHWKDCGLEFGTQDILVKHINNDHIHAN 248 (486)
Q Consensus 185 ~~~~~~~~~l~~~~~~hs~~~~~~~~~~~~~----------------~~~~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ 248 (486)
...+..+.+|.+|.+.|+.++...+--..+. ....+|.|..|.|.|.+...|+.|++ .|-
T Consensus 186 t~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~---rHv- 261 (467)
T KOG3608|consen 186 TKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVV---RHV- 261 (467)
T ss_pred hhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHH---Hhh-
Confidence 3455677899999999998664322111111 11245677777777777777777765 332
Q ss_pred CCcccccCCCCCCCCCcCCChHHHHHHhh-hhcCCCCccCCCCCCCCcchhhhhceecCccccCCccccccccccccchh
Q psy16294 249 KKSFICRWEDCTRDEKPFKAQYMLVVHMR-RHTGEKPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNI 327 (486)
Q Consensus 249 ek~~~C~~~~C~~C~k~F~~~~~L~~H~r-~Htgekpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~ 327 (486)
.-|+| +.|..+....+.|.+||+ .|...|||+|+.|. +.|.+.++|.+|. .+|
T Consensus 262 -n~ykC-----plCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd----------------~~c~~esdL~kH~-~~H--- 315 (467)
T KOG3608|consen 262 -NCYKC-----PLCDMTCSSASSLTTHIRYRHSKDKPFKCDECD----------------TRCVRESDLAKHV-QVH--- 315 (467)
T ss_pred -hcccc-----cccccCCCChHHHHHHHHhhhccCCCccccchh----------------hhhccHHHHHHHH-Hhc---
Confidence 23444 555888888888999988 68888999999886 4778889999999 577
Q ss_pred hhhhhhcccccccccc--ccccccChhhHHHHHhhcC-C--CCccccCcccccccccChhhHHHHHHhhcCCC
Q psy16294 328 LAVVKRLVMRRIELNI--RTELTLMSLNVQFEILSRP-K--LKPYVCGEENCTKRYTDPSSLRKHIKTVHAPA 395 (486)
Q Consensus 328 ~~~~~~~~~~~~~C~~--C~k~F~~~~~L~~H~~~H~-~--ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~ 395 (486)
.+-.|.|.. |...|.+...+++|++.+. | +-+|.|.. |++.|++-.+|..|+...|+-+
T Consensus 316 -------S~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~--Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 316 -------SKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHC--CDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred -------cccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeec--chhhhccchhHHHHHHHhhccc
Confidence 366789988 9999999999999998744 5 45699999 9999999999999998888754
No 7
>KOG1074|consensus
Probab=99.64 E-value=1.5e-16 Score=168.58 Aligned_cols=57 Identities=18% Similarity=0.253 Sum_probs=54.2
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCC
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASL 397 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~ 397 (486)
+.|.+|++.|.....|+.||++|+++|||.|.+ |++.|..+.+|+.||.+|+...++
T Consensus 880 h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~f--C~~aFttrgnLKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 880 HVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHF--CEEAFTTRGNLKVHMGTHMWVQPP 936 (958)
T ss_pred hhhccchhcccchHHHHHhhhcCCCCCCccchh--hhhhhhhhhhhhhhhccccccCCC
Confidence 679999999999999999999999999999999 999999999999999999987665
No 8
>KOG3576|consensus
Probab=99.55 E-value=4.3e-15 Score=133.97 Aligned_cols=127 Identities=26% Similarity=0.417 Sum_probs=105.4
Q ss_pred ccceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcch
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTSS 297 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~~ 297 (486)
...|.|..|+|.|.-+.-|.+|++ .|...+.|.| .. |||.|.....|++|+|+|+|.+||+|..|+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~k---ch~~vkr~lc--t~---cgkgfndtfdlkrh~rthtgvrpykc~~c~------ 180 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLK---CHSDVKRHLC--TF---CGKGFNDTFDLKRHTRTHTGVRPYKCSLCE------ 180 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhh---hccHHHHHHH--hh---ccCcccchhhhhhhhccccCccccchhhhh------
Confidence 346999999999999999999999 9999999988 44 499999999999999999999999999987
Q ss_pred hhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCccccccc
Q psy16294 298 RVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKR 377 (486)
Q Consensus 298 ~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~ 377 (486)
|.|.....|..|.+++|.....+.-+ -..+|-|.|+. ||.+
T Consensus 181 ----------kaftqrcsleshl~kvhgv~~~yayk---------------------------err~kl~vced--cg~t 221 (267)
T KOG3576|consen 181 ----------KAFTQRCSLESHLKKVHGVQHQYAYK---------------------------ERRAKLYVCED--CGYT 221 (267)
T ss_pred ----------HHHHhhccHHHHHHHHcCchHHHHHH---------------------------Hhhhheeeecc--cCCC
Confidence 68888899999998888322211100 02357899988 9999
Q ss_pred ccChhhHHHHHHhhcCCCCC
Q psy16294 378 YTDPSSLRKHIKTVHAPASL 397 (486)
Q Consensus 378 F~~~s~L~~H~k~hh~~~~~ 397 (486)
-.....+..|++.||...++
T Consensus 222 ~~~~e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 222 SERPEVYYLHLKLHHPFSPA 241 (267)
T ss_pred CCChhHHHHHHHhcCCCCHH
Confidence 99999999999998876543
No 9
>KOG3623|consensus
Probab=99.37 E-value=1.4e-13 Score=143.90 Aligned_cols=121 Identities=18% Similarity=0.308 Sum_probs=98.2
Q ss_pred ceecCCCCCCCCCHHHHHHhhhcccccC-CCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcchh
Q psy16294 220 NCHWKDCGLEFGTQDILVKHINNDHIHA-NKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTSSR 298 (486)
Q Consensus 220 ~~~C~~C~k~F~~~~~L~~H~~~~~~H~-~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~~~ 298 (486)
...|..|++.|.....|+.|++. .|. .+..|.| ..|..+|..+..|.+||.+|.--.+-
T Consensus 210 lltcpycdrgykrltslkeHiky--rhekne~nfsC-----~lCsytFAyRtQLErhm~~hkpg~dq------------- 269 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKY--RHEKNEPNFSC-----MLCSYTFAYRTQLERHMQLHKPGGDQ------------- 269 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHH--HHhhCCCCCcc-----hhhhhhhhhHHHHHHHHHhhcCCCcc-------------
Confidence 46799999999999999999984 343 3455777 44599999999999999999632110
Q ss_pred hhhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCcccccccc
Q psy16294 299 VAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRY 378 (486)
Q Consensus 299 ~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F 378 (486)
| .+ .......+.|+|..|+|.|+.+.+|+.|+|+|.|||||.|+. |+|+|
T Consensus 270 --------------------a---~s-----ltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpn--CkKRF 319 (1007)
T KOG3623|consen 270 --------------------A---IS-----LTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPN--CKKRF 319 (1007)
T ss_pred --------------------c---cc-----ccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcc--ccccc
Confidence 0 00 012246788999999999999999999999999999999998 99999
Q ss_pred cChhhHHHHHHh
Q psy16294 379 TDPSSLRKHIKT 390 (486)
Q Consensus 379 ~~~s~L~~H~k~ 390 (486)
....++-.||-.
T Consensus 320 SHSGSySSHmSS 331 (1007)
T KOG3623|consen 320 SHSGSYSSHMSS 331 (1007)
T ss_pred ccCCcccccccc
Confidence 999999999865
No 10
>KOG3576|consensus
Probab=99.36 E-value=1.4e-13 Score=124.18 Aligned_cols=86 Identities=29% Similarity=0.508 Sum_probs=72.7
Q ss_pred CCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcchhhhhceecCccccCCccccccccccccchh
Q psy16294 248 NKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNI 327 (486)
Q Consensus 248 ~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~ 327 (486)
+...|.| ..| +|.|.-+..|.+||+.|...|.|
T Consensus 114 d~d~ftC--rvC---gK~F~lQRmlnrh~kch~~vkr~------------------------------------------ 146 (267)
T KOG3576|consen 114 DQDSFTC--RVC---GKKFGLQRMLNRHLKCHSDVKRH------------------------------------------ 146 (267)
T ss_pred CCCeeee--ehh---hhhhhHHHHHHHHhhhccHHHHH------------------------------------------
Confidence 3456777 445 99999999999999888654444
Q ss_pred hhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCC
Q psy16294 328 LAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAP 394 (486)
Q Consensus 328 ~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~ 394 (486)
.|.+|||.|.....|++|+|+|+|.+||+|.. |+|.|++.-+|..|.+..||.
T Consensus 147 ------------lct~cgkgfndtfdlkrh~rthtgvrpykc~~--c~kaftqrcsleshl~kvhgv 199 (267)
T KOG3576|consen 147 ------------LCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSL--CEKAFTQRCSLESHLKKVHGV 199 (267)
T ss_pred ------------HHhhccCcccchhhhhhhhccccCccccchhh--hhHHHHhhccHHHHHHHHcCc
Confidence 45567778888889999999999999999999 999999999999999998874
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.87 E-value=5e-09 Score=111.12 Aligned_cols=135 Identities=7% Similarity=0.044 Sum_probs=85.4
Q ss_pred ceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcchhh
Q psy16294 220 NCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTSSRV 299 (486)
Q Consensus 220 ~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~~~~ 299 (486)
.-.|..|.+... ...|..|.. +-....-.| .. ..|+..|. +..+.. .+.|++|+.
T Consensus 407 ~V~C~NC~~~i~-l~~l~lHe~----~C~r~~V~C--p~-~~Cg~v~~-r~el~~---------H~~C~~Cgk------- 461 (567)
T PLN03086 407 TVECRNCKHYIP-SRSIALHEA----YCSRHNVVC--PH-DGCGIVLR-VEEAKN---------HVHCEKCGQ------- 461 (567)
T ss_pred eEECCCCCCccc-hhHHHHHHh----hCCCcceeC--Cc-ccccceee-cccccc---------CccCCCCCC-------
Confidence 346888987754 556778874 333333344 21 02577663 223333 357887763
Q ss_pred hhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhcCCCCccccCccccccccc
Q psy16294 300 AARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYT 379 (486)
Q Consensus 300 ~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~ 379 (486)
.|. ...|.+|+ ..| .+++.|+ |++.+ .+..|..|+.+|..++++.|.. |++.|.
T Consensus 462 ---------~f~-~s~LekH~-~~~-----------Hkpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~f--C~~~v~ 515 (567)
T PLN03086 462 ---------AFQ-QGEMEKHM-KVF-----------HEPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRF--CGDMVQ 515 (567)
T ss_pred ---------ccc-hHHHHHHH-Hhc-----------CCCccCC-CCCCc-chhHHHhhhhccCCCCceeCCC--CCCccc
Confidence 332 34566666 333 1567787 88644 5678888888888888888887 888884
Q ss_pred ----------ChhhHHHHHHhhcCCCCCCCCCCCCcc
Q psy16294 380 ----------DPSSLRKHIKTVHAPASLNPVPQHKDM 406 (486)
Q Consensus 380 ----------~~s~L~~H~k~hh~~~~~~c~~c~~~~ 406 (486)
..+.|..|..+ .|.+++.|..|++.+
T Consensus 516 ~g~~~~d~~d~~s~Lt~HE~~-CG~rt~~C~~Cgk~V 551 (567)
T PLN03086 516 AGGSAMDVRDRLRGMSEHESI-CGSRTAPCDSCGRSV 551 (567)
T ss_pred cCccccchhhhhhhHHHHHHh-cCCcceEccccCCee
Confidence 23578888877 478888887776654
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.70 E-value=3.1e-08 Score=105.21 Aligned_cols=137 Identities=7% Similarity=-0.008 Sum_probs=96.4
Q ss_pred CCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhhhccccc
Q psy16294 259 CTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRR 338 (486)
Q Consensus 259 C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~ 338 (486)
|..|.+.. ....|..|.... ....-.|+.++|+..|. +.++ ++.
T Consensus 410 C~NC~~~i-~l~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~---------------r~el-------------------~~H 453 (567)
T PLN03086 410 CRNCKHYI-PSRSIALHEAYC-SRHNVVCPHDGCGIVLR---------------VEEA-------------------KNH 453 (567)
T ss_pred CCCCCCcc-chhHHHHHHhhC-CCcceeCCcccccceee---------------cccc-------------------ccC
Confidence 45556544 455677787433 23445788655654442 1222 233
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhc------h-h
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDI------Q-S 411 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~------~-~ 411 (486)
+.|+.|++.|. ...|..|+.+|+ ++|.|+ ||+.| .+..|..|+.+|.+++++.|..|...+..- . .
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp---Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~ 526 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP---CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDR 526 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC--CCccCC---CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhh
Confidence 58999999996 688999999975 899996 99765 679999999999999999999998877411 0 1
Q ss_pred hhhhccccccccccccchhhhccccccC
Q psy16294 412 NEVTQSNMRQSTYQRTLEYVEQCQNWVS 439 (486)
Q Consensus 412 ~~~~~~h~~~h~~~~~~~c~~c~~~f~~ 439 (486)
......|.... +.+++.|..||+.+..
T Consensus 527 ~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrl 553 (567)
T PLN03086 527 LRGMSEHESIC-GSRTAPCDSCGRSVML 553 (567)
T ss_pred hhhHHHHHHhc-CCcceEccccCCeeee
Confidence 11233566664 8999999999987743
No 13
>PHA00733 hypothetical protein
Probab=98.68 E-value=1.5e-08 Score=88.31 Aligned_cols=56 Identities=16% Similarity=0.161 Sum_probs=49.3
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAP 394 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~ 394 (486)
..++|.|+.|++.|.....|..|++.| +.+|.|.+ |++.|.....|.+|+...|+.
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~--CgK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPV--CGKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCC--CCCccCCHHHHHHHHHHhcCc
Confidence 356778888899999999999999987 46799999 999999999999999998874
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.64 E-value=1.3e-08 Score=73.91 Aligned_cols=44 Identities=9% Similarity=0.055 Sum_probs=36.7
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHH
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLR 385 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~ 385 (486)
.|+|+.||+.|....+|..|+++|+ ++|+|.. |+|.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~--C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSN--CKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCc--ccceecccceeE
Confidence 4788888888888888888888888 6888887 888888877664
No 15
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.38 E-value=1.7e-07 Score=58.09 Aligned_cols=26 Identities=27% Similarity=0.631 Sum_probs=23.9
Q ss_pred hHHHHHhhcCCCCccccCcccccccccC
Q psy16294 353 NVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 353 ~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
+|+.||++|+++|||.|++ |+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~--C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY--CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS--SSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCC--CcCeeCc
Confidence 5889999999999999999 9999974
No 16
>PHA00733 hypothetical protein
Probab=98.27 E-value=3.4e-07 Score=79.85 Aligned_cols=84 Identities=14% Similarity=0.178 Sum_probs=63.9
Q ss_pred ccceecCCCCCCCCCHHHHHHh--hhcccccCCCCcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCc
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKH--INNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDT 295 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H--~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~ 295 (486)
.+.+.|.+|.+.|.....|..| ++....+.++++|.| ..| ++.|.....|..|++.| +.+|.|++|+
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C--~~C---gk~Fss~s~L~~H~r~h--~~~~~C~~Cg---- 106 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVC--PLC---LMPFSSSVSLKQHIRYT--EHSKVCPVCG---- 106 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccC--CCC---CCcCCCHHHHHHHHhcC--CcCccCCCCC----
Confidence 4678899999999888777766 221113445889988 455 99999999999999987 4689999997
Q ss_pred chhhhhceecCccccCCcccccccccccc
Q psy16294 296 SSRVAARRIRGWKISRRISDLIPVRNRTR 324 (486)
Q Consensus 296 ~~~~~~~~H~~ek~F~~~~~L~~H~~~~H 324 (486)
+.|.....|..|+.+.|
T Consensus 107 ------------K~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 107 ------------KEFRNTDSTLDHVCKKH 123 (128)
T ss_pred ------------CccCCHHHHHHHHHHhc
Confidence 46777777777775555
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.11 E-value=1.2e-06 Score=63.77 Aligned_cols=42 Identities=12% Similarity=0.238 Sum_probs=26.9
Q ss_pred eecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHH
Q psy16294 221 CHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYML 272 (486)
Q Consensus 221 ~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L 272 (486)
|.|+.||+.|....+|..|++ +|+ ++|+| ..| ++.|.+...|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r---~H~--k~~kc--~~C---~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLR---KHN--TNLKL--SNC---KRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHH---hcC--CcccC--Ccc---cceeccccee
Confidence 567777777777777777776 566 46655 344 6666655544
No 18
>PHA00616 hypothetical protein
Probab=98.04 E-value=2.1e-06 Score=59.40 Aligned_cols=33 Identities=21% Similarity=0.416 Sum_probs=25.8
Q ss_pred ccccCcccccccccChhhHHHHHHhhcCCCCCCCC
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPV 400 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~ 400 (486)
||+|.. ||+.|.+++.|.+|++.||+++++.|+
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 577877 888888888888888888888777665
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.02 E-value=2e-06 Score=53.27 Aligned_cols=22 Identities=55% Similarity=0.745 Sum_probs=17.6
Q ss_pred HHHHHhhhhcCCCCccCCCCCC
Q psy16294 271 MLVVHMRRHTGEKPHKCTDLEA 292 (486)
Q Consensus 271 ~L~~H~r~Htgekpy~C~~C~c 292 (486)
+|.+||++|+|++||+|++|+.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k 22 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGK 22 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSE
T ss_pred CHHHHhhhcCCCCCCCCCCCcC
Confidence 4778888888888888887763
No 20
>PHA00616 hypothetical protein
Probab=97.93 E-value=3.8e-06 Score=58.15 Aligned_cols=34 Identities=12% Similarity=-0.145 Sum_probs=32.3
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGE 371 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~ 371 (486)
+|+|+.||+.|..++.|..|++.|+|++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5899999999999999999999999999999875
No 21
>KOG3993|consensus
Probab=97.90 E-value=2e-06 Score=86.23 Aligned_cols=59 Identities=17% Similarity=0.172 Sum_probs=51.0
Q ss_pred ccccccccccccccccChhhHHHHHhhcC--------C-------------------------CCccccCcccccccccC
Q psy16294 334 LVMRRIELNIRTELTLMSLNVQFEILSRP--------K-------------------------LKPYVCGEENCTKRYTD 380 (486)
Q Consensus 334 ~~~~~~~C~~C~k~F~~~~~L~~H~~~H~--------~-------------------------ekpy~C~~~gCgK~F~~ 380 (486)
...-.|+|+.|+|+|....+|-.|.|-|. + +..|.|.+ |+|.|.+
T Consensus 291 IV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~--C~KkFrR 368 (500)
T KOG3993|consen 291 IVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHT--CGKKFRR 368 (500)
T ss_pred eEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHH--hhhhhHH
Confidence 44568999999999999999999999874 1 12499999 9999999
Q ss_pred hhhHHHHHHhhcCC
Q psy16294 381 PSSLRKHIKTVHAP 394 (486)
Q Consensus 381 ~s~L~~H~k~hh~~ 394 (486)
...|++|+.+|+..
T Consensus 369 qAYLrKHqlthq~~ 382 (500)
T KOG3993|consen 369 QAYLRKHQLTHQRA 382 (500)
T ss_pred HHHHHHhHHhhhcc
Confidence 99999999999864
No 22
>KOG3993|consensus
Probab=97.80 E-value=2.4e-06 Score=85.69 Aligned_cols=147 Identities=16% Similarity=0.137 Sum_probs=97.0
Q ss_pred cceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHHHHHhhhhcC--------C--------
Q psy16294 219 TNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVHMRRHTG--------E-------- 282 (486)
Q Consensus 219 ~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htg--------e-------- 282 (486)
..|.|..|...|.....|.+|.- .---...|+| .+| +|.|+...+|..|.|.|.- .
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC---~RIV~vEYrC--PEC---~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~ 337 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRC---PRIVHVEYRC--PEC---DKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVE 337 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccC---CeeEEeeecC--Ccc---cccccCchhhhhhhcccCCchhhhhcCCCChhhhhh
Confidence 45999999999999999999975 2222245777 455 9999999999999999851 1
Q ss_pred -----------------CCccCCCCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhh-------------
Q psy16294 283 -----------------KPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVK------------- 332 (486)
Q Consensus 283 -----------------kpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~------------- 332 (486)
..|.|.+|+ |.|.....|++|+ .+|........
T Consensus 338 ~rae~~ea~rsg~dss~gi~~C~~C~----------------KkFrRqAYLrKHq-lthq~~~~~k~~a~~f~~s~~~~l 400 (500)
T KOG3993|consen 338 TRAEVQEAERSGDDSSSGIFSCHTCG----------------KKFRRQAYLRKHQ-LTHQRAPLAKEKAPKFLLSRVIPL 400 (500)
T ss_pred hhhhhhhccccCCcccCceeecHHhh----------------hhhHHHHHHHHhH-HhhhccccchhcccCcchhhcccc
Confidence 237888776 3455555555554 33311111000
Q ss_pred ------------------------hccccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHH
Q psy16294 333 ------------------------RLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHI 388 (486)
Q Consensus 333 ------------------------~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~ 388 (486)
........|+.|+..+..+..--.+.+.-.-+.-|.|.+ |--+|.....|.+|+
T Consensus 401 ~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky--~~atfyss~~ltrhi 478 (500)
T KOG3993|consen 401 MHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKY--CPATFYSSPGLTRHI 478 (500)
T ss_pred cccccccccccccccccccceeeeeccccccCCCCCCCCcccCCCCCccccccchhhcccccc--chHhhhcCcchHhHh
Confidence 001123457777777766655555554444456688988 999999999999988
Q ss_pred Hhhc
Q psy16294 389 KTVH 392 (486)
Q Consensus 389 k~hh 392 (486)
..-|
T Consensus 479 n~~H 482 (500)
T KOG3993|consen 479 NKCH 482 (500)
T ss_pred hhcC
Confidence 7655
No 23
>PHA00732 hypothetical protein
Probab=97.73 E-value=1.9e-05 Score=62.67 Aligned_cols=47 Identities=17% Similarity=0.204 Sum_probs=39.0
Q ss_pred ccccccccccccChhhHHHHHhh-cCCCCccccCcccccccccChhhHHHHHHhhc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILS-RPKLKPYVCGEENCTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~-H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh 392 (486)
+|.|..|++.|.....|+.|++. |. ++.|+. |++.|. .|..|++++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~--CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPV--CNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCC--CCCEeC---ChhhhhcccC
Confidence 47899999999999999999985 65 468988 999998 5888886644
No 24
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.64 E-value=6.3e-05 Score=55.29 Aligned_cols=51 Identities=18% Similarity=0.238 Sum_probs=42.5
Q ss_pred ccccccccccccChhhHHHHHhh-cCCC-CccccCcccccccccChhhHHHHHHhhcC
Q psy16294 338 RIELNIRTELTLMSLNVQFEILS-RPKL-KPYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~-H~~e-kpy~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
.|.|++|++ ..+...|..|... |..+ +.+.|++ |...+. .+|.+|+..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhcC
Confidence 488999999 5667899999776 7664 6799999 999766 499999999875
No 25
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.48 E-value=5.1e-05 Score=73.69 Aligned_cols=27 Identities=44% Similarity=0.796 Sum_probs=23.0
Q ss_pred CCCCccccCcccccccccChhhHHHHHHh
Q psy16294 362 PKLKPYVCGEENCTKRYTDPSSLRKHIKT 390 (486)
Q Consensus 362 ~~ekpy~C~~~gCgK~F~~~s~L~~H~k~ 390 (486)
...|||.|++ |+|+|+....|+-|.+-
T Consensus 394 ~~~KPYrCev--C~KRYKNlNGLKYHr~H 420 (423)
T COG5189 394 AKDKPYRCEV--CDKRYKNLNGLKYHRKH 420 (423)
T ss_pred ccCCceeccc--cchhhccCccceecccc
Confidence 3459999999 99999999999988764
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.32 E-value=0.00014 Score=43.41 Aligned_cols=23 Identities=30% Similarity=0.792 Sum_probs=18.6
Q ss_pred cccCcccccccccChhhHHHHHHhh
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTV 391 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~h 391 (486)
|+|+. |++.|.++..|++|++.|
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhHC
Confidence 67877 888888888888888774
No 27
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.20 E-value=0.00015 Score=70.47 Aligned_cols=31 Identities=35% Similarity=0.719 Sum_probs=27.3
Q ss_pred CCCCccccCcccccccccChhhHHHHHHhhc
Q psy16294 362 PKLKPYVCGEENCTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 362 ~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh 392 (486)
.++|||+|+++||+|+|+....|+-|+.-=|
T Consensus 345 ~d~KpykCpV~gC~K~YknqnGLKYH~lhGH 375 (423)
T COG5189 345 KDGKPYKCPVEGCNKKYKNQNGLKYHMLHGH 375 (423)
T ss_pred ecCceecCCCCCchhhhccccchhhhhhccc
Confidence 3569999999999999999999999997544
No 28
>PHA00732 hypothetical protein
Probab=97.11 E-value=0.00041 Score=55.08 Aligned_cols=42 Identities=14% Similarity=0.219 Sum_probs=34.1
Q ss_pred ccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhchh
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDIQS 411 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~~~ 411 (486)
||.|+. |++.|.+...|++|++.+|. ++.|..|++.|.....
T Consensus 1 py~C~~--Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~~l~~ 42 (79)
T PHA00732 1 MFKCPI--CGFTTVTLFALKQHARRNHT--LTKCPVCNKSYRRLNQ 42 (79)
T ss_pred CccCCC--CCCccCCHHHHHHHhhcccC--CCccCCCCCEeCChhh
Confidence 689999 99999999999999996443 3579999888874333
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.11 E-value=0.00034 Score=41.75 Aligned_cols=24 Identities=38% Similarity=0.823 Sum_probs=18.7
Q ss_pred cccCcccccccccChhhHHHHHHhhc
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~hh 392 (486)
|.|++ |++.|.+...|++|+++||
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHHS
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhhC
Confidence 67888 8888888888888888876
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.02 E-value=0.00032 Score=41.75 Aligned_cols=23 Identities=9% Similarity=-0.067 Sum_probs=21.6
Q ss_pred cccccccccccChhhHHHHHhhc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSR 361 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H 361 (486)
|.|+.|++.|..+..|+.|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 68999999999999999999875
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.86 E-value=0.0006 Score=42.31 Aligned_cols=26 Identities=35% Similarity=0.811 Sum_probs=22.0
Q ss_pred ccccCcccccccccChhhHHHHHHhhcC
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
||.|.. |++.|.....|..|++.|+.
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhcC
Confidence 588888 99999999999999888764
No 32
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=96.76 E-value=0.00051 Score=71.31 Aligned_cols=161 Identities=22% Similarity=0.296 Sum_probs=114.1
Q ss_pred cceecCCCCCCCCCHHHHHHhhhcccccCCC--CcccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCCCCCcc
Q psy16294 219 TNCHWKDCGLEFGTQDILVKHINNDHIHANK--KSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLEAFDTS 296 (486)
Q Consensus 219 ~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~e--k~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~c~~~~ 296 (486)
..+.|..|...|.....|..|.+. ..|.++ +++.|.+..| ++.|.+...+..|..+|++.++++|....+...+
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~-~~h~~~~~~~~~~p~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRS-VNHSGESLKPFSCPYSLC---GKLFSRNDALKRHILLHTSISPAKEKLLNSSSKF 363 (467)
T ss_pred cCCCCccccCCccccccccccccc-cccccccCCceeeeccCC---CccccccccccCCcccccCCCccccccccCcccc
Confidence 478999999999999999999871 258999 9999944345 9999999999999999999999999876554443
Q ss_pred hhhhhceecCccccCCccccccccccccchhhhhhhhcccccccc--ccccccccChhhHHHHHhhcCCCCccccCcccc
Q psy16294 297 SRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIEL--NIRTELTLMSLNVQFEILSRPKLKPYVCGEENC 374 (486)
Q Consensus 297 ~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C--~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gC 374 (486)
......- .+. ... ..........+.| ..|-..+.....+..|...|...+++.|...+|
T Consensus 364 ~~~~~~~--------------~~~-~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (467)
T COG5048 364 SPLLNNE--------------PPQ-SLQ----QYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPC 424 (467)
T ss_pred ccccCCC--------------Ccc-chh----hccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcc
Confidence 3222200 000 000 0000112222223 347777788888888888888877766666669
Q ss_pred cccccChhhHHHHHHhhcCCCCCCCCCC
Q psy16294 375 TKRYTDPSSLRKHIKTVHAPASLNPVPQ 402 (486)
Q Consensus 375 gK~F~~~s~L~~H~k~hh~~~~~~c~~c 402 (486)
.+.|.....|..|++.|....++.|..+
T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (467)
T COG5048 425 SKSFNRHYNLIPHKKIHTNHAPLLCSIL 452 (467)
T ss_pred hhhccCcccccccccccccCCceeeccc
Confidence 9999999999999998887777665533
No 33
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.71 E-value=0.0022 Score=47.01 Aligned_cols=51 Identities=20% Similarity=0.338 Sum_probs=37.2
Q ss_pred ceecCCCCCCCCCHHHHHHhhhcccccCCC-CcccccCCCCCCCCCcCCChHHHHHHhhhhc
Q psy16294 220 NCHWKDCGLEFGTQDILVKHINNDHIHANK-KSFICRWEDCTRDEKPFKAQYMLVVHMRRHT 280 (486)
Q Consensus 220 ~~~C~~C~k~F~~~~~L~~H~~~~~~H~~e-k~~~C~~~~C~~C~k~F~~~~~L~~H~r~Ht 280 (486)
.|.|+.|++. -+...|..|+.. .|..+ +.+.| ++|...+. .+|..|+..+.
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~--~H~~~~~~v~C-----PiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCED--EHRSESKNVVC-----PICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHh--HCcCCCCCccC-----CCchhhhh--hHHHHHHHHhc
Confidence 4889999995 557889999874 56654 45766 55577654 48999998654
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.22 E-value=0.0026 Score=37.75 Aligned_cols=23 Identities=4% Similarity=-0.045 Sum_probs=19.7
Q ss_pred cccccccccccChhhHHHHHhhc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSR 361 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H 361 (486)
|.|++|++.|.+...|+.|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 68999999999999999999986
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.21 E-value=0.003 Score=51.88 Aligned_cols=25 Identities=24% Similarity=0.736 Sum_probs=21.8
Q ss_pred ccccCcccccccccChhhHHHHHHhhc
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~hh 392 (486)
.+.|.. |++.|.....|..||+.++
T Consensus 50 ~~~C~~--C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 50 SFRCPY--CNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp SEEBSS--SS-EESSHHHHHHHHHHTT
T ss_pred CCCCCc--cCCCCcCHHHHHHHHcCcc
Confidence 599999 9999999999999999753
No 36
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.95 E-value=0.005 Score=43.79 Aligned_cols=33 Identities=21% Similarity=0.427 Sum_probs=24.4
Q ss_pred CCCCccccCcccccccccChhhHHHHHHhhcCCCC
Q psy16294 362 PKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPAS 396 (486)
Q Consensus 362 ~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~ 396 (486)
..+.|-.|++ |+..+.+..+|++|+.++|+.+|
T Consensus 20 ~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 4578999999 99999999999999999998876
No 37
>KOG2231|consensus
Probab=95.90 E-value=0.015 Score=63.33 Aligned_cols=95 Identities=11% Similarity=-0.031 Sum_probs=57.9
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCc----ccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhchhhhh
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGE----ENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDIQSNEV 414 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~----~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~~~~~~ 414 (486)
-.|.+|...|.....|.+|++.+. |.|.. ++++.-|.....|..|-+.+|--....++.+.+.+.+++.-..
T Consensus 183 p~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~HflCE~~~C~~~~f~~~~~~ei~ 258 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHFLCEEEFCRTKKFYVAFELEIE 258 (669)
T ss_pred ccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcCccccccccccceeeehhHHHHH
Confidence 468888888888888888887643 44544 1234567778888888888774444333334333334344334
Q ss_pred hccccccccccccchhh--hccccc
Q psy16294 415 TQSNMRQSTYQRTLEYV--EQCQNW 437 (486)
Q Consensus 415 ~~~h~~~h~~~~~~~c~--~c~~~f 437 (486)
++.|.+.++.+..|.|. .-|+.+
T Consensus 259 lk~~~~~~~~e~~~~~~~~r~Gr~s 283 (669)
T KOG2231|consen 259 LKAHNRFIQHEKCYICRPSRPGRPS 283 (669)
T ss_pred HHhhccccchheeccCCcccCCCCc
Confidence 44455566666666664 555544
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.79 E-value=0.0057 Score=37.81 Aligned_cols=25 Identities=8% Similarity=-0.219 Sum_probs=23.2
Q ss_pred ccccccccccccChhhHHHHHhhcC
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRP 362 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~ 362 (486)
+|+|..|++.|.....|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4799999999999999999999875
No 39
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.75 E-value=0.0081 Score=62.28 Aligned_cols=70 Identities=14% Similarity=0.138 Sum_probs=61.3
Q ss_pred cccccccccccccChhhHHHHHh--hcCCC--CccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcc
Q psy16294 337 RRIELNIRTELTLMSLNVQFEIL--SRPKL--KPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDM 406 (486)
Q Consensus 337 ~~~~C~~C~k~F~~~~~L~~H~~--~H~~e--kpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~ 406 (486)
.++.|..|...|.....|..|.+ .|.++ +|+.|+...|++.|.+...+..|..+|.+..++.+.......
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSS 361 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcc
Confidence 46899999999999999999999 89999 999999223999999999999999999999988876444443
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.61 E-value=0.01 Score=35.64 Aligned_cols=24 Identities=38% Similarity=0.807 Sum_probs=17.9
Q ss_pred cccCcccccccccChhhHHHHHHhhc
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~hh 392 (486)
|+|.. |++.|.....|..|++.|.
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHhc
Confidence 56777 8888888888888877553
No 41
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.47 E-value=0.0087 Score=49.10 Aligned_cols=24 Identities=8% Similarity=-0.089 Sum_probs=20.9
Q ss_pred ccccccccccccChhhHHHHHhhc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSR 361 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H 361 (486)
.+.|.+|++.|.....|..||+.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 689999999999999999999974
No 42
>PRK04860 hypothetical protein; Provisional
Probab=95.13 E-value=0.0095 Score=53.92 Aligned_cols=40 Identities=15% Similarity=0.087 Sum_probs=33.1
Q ss_pred cccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChh
Q psy16294 337 RRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPS 382 (486)
Q Consensus 337 ~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s 382 (486)
.+|.|. |++ ....+++|.++|+++++|.|.. |++.|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~--C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRR--CGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCC--CCceeEEec
Confidence 478897 887 7778899999999999999988 998887543
No 43
>smart00355 ZnF_C2H2 zinc finger.
Probab=94.87 E-value=0.02 Score=34.24 Aligned_cols=24 Identities=0% Similarity=-0.141 Sum_probs=22.2
Q ss_pred cccccccccccChhhHHHHHhhcC
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRP 362 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~ 362 (486)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 579999999999999999999875
No 44
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.78 E-value=0.02 Score=34.33 Aligned_cols=24 Identities=33% Similarity=0.760 Sum_probs=18.1
Q ss_pred cccCcccccccccChhhHHHHHHhhcC
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
|+|+. |+.... ...|++|+++||+
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHHS
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhCc
Confidence 67888 888887 8888999888774
No 45
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.13 E-value=0.031 Score=33.73 Aligned_cols=23 Identities=30% Similarity=0.856 Sum_probs=17.4
Q ss_pred cccCcccccccccChhhHHHHHHhh
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTV 391 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~h 391 (486)
|.|++ |++.|.....|+.|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 56777 888888888888887653
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.42 E-value=0.054 Score=38.66 Aligned_cols=33 Identities=18% Similarity=0.032 Sum_probs=23.8
Q ss_pred ccccccccccccccccChhhHHHHHhhcCCCCc
Q psy16294 334 LVMRRIELNIRTELTLMSLNVQFEILSRPKLKP 366 (486)
Q Consensus 334 ~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekp 366 (486)
..+.+-.|++|+..+....+|++|+.++++.||
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 456688999999999999999999998777775
No 47
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.39 E-value=0.042 Score=33.15 Aligned_cols=23 Identities=9% Similarity=-0.006 Sum_probs=21.1
Q ss_pred cccccccccccChhhHHHHHhhc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSR 361 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H 361 (486)
|.|.+|++.|.....|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67999999999999999999864
No 48
>PRK04860 hypothetical protein; Provisional
Probab=92.39 E-value=0.082 Score=47.91 Aligned_cols=29 Identities=17% Similarity=0.296 Sum_probs=16.3
Q ss_pred cceecCCCCCCCCCHHHHHHhhhcccccCCCCcccc
Q psy16294 219 TNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFIC 254 (486)
Q Consensus 219 ~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C 254 (486)
.+|.|. |++ ....+++|.+ +|+++++|.|
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~r---i~~g~~~YrC 146 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNR---VVRGEAVYRC 146 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHH---HhcCCccEEC
Confidence 356665 555 4445556655 5666666655
No 49
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=91.33 E-value=0.059 Score=33.32 Aligned_cols=22 Identities=27% Similarity=0.872 Sum_probs=16.8
Q ss_pred cccCcccccccccChhhHHHHHHh
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKT 390 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~ 390 (486)
|.|.. |++.|.+...|..|+++
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 66777 88888888888777764
No 50
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=91.32 E-value=0.1 Score=32.20 Aligned_cols=21 Identities=19% Similarity=0.430 Sum_probs=15.5
Q ss_pred eecCCCCCCCCCHHHHHHhhh
Q psy16294 221 CHWKDCGLEFGTQDILVKHIN 241 (486)
Q Consensus 221 ~~C~~C~k~F~~~~~L~~H~~ 241 (486)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 567777777777777777765
No 51
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=90.54 E-value=0.14 Score=50.89 Aligned_cols=121 Identities=13% Similarity=0.052 Sum_probs=62.0
Q ss_pred ccccCCCCCCCCCcCCChHHHHHHhhhhcCCCCccCCCCC-CCCcchhhhhceecCccccCCccccccccccccchhhhh
Q psy16294 252 FICRWEDCTRDEKPFKAQYMLVVHMRRHTGEKPHKCTDLE-AFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAV 330 (486)
Q Consensus 252 ~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgekpy~C~~C~-c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~ 330 (486)
|.|....| ..+-.....|..|.+..++ .+-|..|- -.+.|... ...-++..|..|+..--
T Consensus 152 F~CP~skc---~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E--------~~lF~~~~Lr~H~~~G~------ 212 (493)
T COG5236 152 FKCPKSKC---HRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNE--------IRLFRSSTLRDHKNGGL------ 212 (493)
T ss_pred hcCCchhh---hhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccc--------eeeeecccccccccCCc------
Confidence 55543334 3333345567777664332 36677663 22222211 12223445666652111
Q ss_pred hhhccccccccccccccccChhhHHHHHhhcCCCCccccCccc--ccccccChhhHHHHHHhhc
Q psy16294 331 VKRLVMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEEN--CTKRYTDPSSLRKHIKTVH 392 (486)
Q Consensus 331 ~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~g--CgK~F~~~s~L~~H~k~hh 392 (486)
.....+.--.|.+|...|-....|..|+|..+ |+-|.|+.-+ =-.-|....+|..|-+.-|
T Consensus 213 ~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h 275 (493)
T COG5236 213 EEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNAH 275 (493)
T ss_pred cccCcCCCchhhhccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcCc
Confidence 00011122368999999999999999988632 2334433200 0123677778888877655
No 52
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=89.99 E-value=0.14 Score=30.55 Aligned_cols=23 Identities=9% Similarity=-0.097 Sum_probs=18.4
Q ss_pred cccccccccccChhhHHHHHhhcC
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRP 362 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~ 362 (486)
|+|+.|+.... +..|..|++.|.
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 68999999998 999999999853
No 53
>KOG4377|consensus
Probab=87.91 E-value=0.37 Score=49.08 Aligned_cols=134 Identities=16% Similarity=0.226 Sum_probs=79.1
Q ss_pred ceec--CCCCCCCCCHHHHHHhhhcccccCCC------------CcccccCCCCCCCCCcCCChHHHHHHhhhhcCCC--
Q psy16294 220 NCHW--KDCGLEFGTQDILVKHINNDHIHANK------------KSFICRWEDCTRDEKPFKAQYMLVVHMRRHTGEK-- 283 (486)
Q Consensus 220 ~~~C--~~C~k~F~~~~~L~~H~~~~~~H~~e------------k~~~C~~~~C~~C~k~F~~~~~L~~H~r~Htgek-- 283 (486)
-|+| +.|+..+-.+.+..+|.. +|... ..|.|....| +| +-.....|-..|+..+
T Consensus 271 hyhcl~e~C~ykr~~k~DvirH~~---~hkkrdnsL~dgf~rfs~syhC~~~~C---~k---sTsdV~~h~nFht~~~n~ 341 (480)
T KOG4377|consen 271 HYHCLNEYCFYKRGQKNDVIRHVE---IHKKRDNSLIDGFHRFSNSYHCTGQIC---EK---STSDVLLHDNFHTDKRNN 341 (480)
T ss_pred hhcccCccccccccchhhhHHHHH---HHhhcccccccchhhcCccchhhhccc---Cc---ccccccccCccccccccC
Confidence 3555 568777777888999987 66432 1367866666 66 3445666777775321
Q ss_pred -----CccCCCCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccc---cccChhhHH
Q psy16294 284 -----PHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTE---LTLMSLNVQ 355 (486)
Q Consensus 284 -----py~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k---~F~~~~~L~ 355 (486)
-|.|..|+|...|... +|. .-|............+.|+-..|.. .|...
T Consensus 342 GfrrthfhC~r~gCTdtfK~~------------------khk-~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~---- 398 (480)
T KOG4377|consen 342 GFRRTHFHCQRIGCTDTFKDS------------------KHK-PYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGI---- 398 (480)
T ss_pred ceecceeEEeccCCccccccc------------------ccc-ccccCcchhhhhhhhhhhccccCCccCcccccc----
Confidence 2778888887766521 222 2222222222222333333333322 11111
Q ss_pred HHHhhcCCCCccccCcccccccccChhhHHHHHHhhcC
Q psy16294 356 FEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 356 ~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
-.-|.|...||+.+|...+.+..|.+.|..
T Consensus 399 --------cnhfhc~r~Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 399 --------CNHFHCDRLGCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred --------eeeeeecccCCceEEEehhhhhhhhhhhhh
Confidence 134889999999999999999999888653
No 54
>KOG2231|consensus
Probab=86.72 E-value=0.34 Score=53.07 Aligned_cols=81 Identities=14% Similarity=0.194 Sum_probs=48.9
Q ss_pred ccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhc-CCCC----ccccCcccccccccChhh
Q psy16294 309 ISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSR-PKLK----PYVCGEENCTKRYTDPSS 383 (486)
Q Consensus 309 ~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H-~~ek----py~C~~~gCgK~F~~~s~ 383 (486)
.|.....|+.|+...|............+ .+..+.+.| ++..|..|+..- .+++ --.|.. |...|.....
T Consensus 123 ~~~s~~~Lk~H~~~~H~~~~c~lC~~~~k--if~~e~k~Y-t~~el~~h~~~gd~d~~s~rGhp~C~~--C~~~fld~~e 197 (669)
T KOG2231|consen 123 EFKSVENLKNHMRDQHKLHLCSLCLQNLK--IFINERKLY-TRAELNLHLMFGDPDDESCRGHPLCKF--CHERFLDDDE 197 (669)
T ss_pred chhHHHHHHHHHHHhhhhhccccccccce--eeeeeeehe-hHHHHHHHHhcCCCccccccCCccchh--hhhhhccHHH
Confidence 56667788889877772111111111111 222333333 456777777641 1221 257988 9999999999
Q ss_pred HHHHHHhhcCC
Q psy16294 384 LRKHIKTVHAP 394 (486)
Q Consensus 384 L~~H~k~hh~~ 394 (486)
|.+|++.+|-.
T Consensus 198 l~rH~~~~h~~ 208 (669)
T KOG2231|consen 198 LYRHLRFDHEF 208 (669)
T ss_pred HHHhhccceeh
Confidence 99999988743
No 55
>KOG1146|consensus
Probab=86.43 E-value=0.41 Score=55.56 Aligned_cols=50 Identities=20% Similarity=0.223 Sum_probs=38.5
Q ss_pred ecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHHHHHhhh
Q psy16294 222 HWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVHMRR 278 (486)
Q Consensus 222 ~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H~r~ 278 (486)
.|.-|+..+..+..+.-|+.. +|...+.|+| +.|+..|+....|..|||.
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~--L~S~~kt~~c-----pkc~~~yk~a~~L~vhmRs 487 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVV--LHSFFKTLKC-----PKCNWHYKLAQTLGVHMRS 487 (1406)
T ss_pred cccchhhhhhhhcccccceee--eecccccccC-----CccchhhhhHHHhhhcccc
Confidence 456677777777778777753 6777788877 4449999999999999995
No 56
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=86.23 E-value=0.45 Score=47.40 Aligned_cols=133 Identities=17% Similarity=0.207 Sum_probs=80.7
Q ss_pred eecC--CCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcC------CChHHHHHHhhhhcCCCCc----cCC
Q psy16294 221 CHWK--DCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPF------KAQYMLVVHMRRHTGEKPH----KCT 288 (486)
Q Consensus 221 ~~C~--~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F------~~~~~L~~H~r~Htgekpy----~C~ 288 (486)
|.|+ .|.........|+.|.+. .|. .+.| ..|..-.+.| .++..|..|...-..+.-| .|.
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~--~H~---~~~C--~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~ 224 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKA--QHG---FVLC--SECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCI 224 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHh--hcC---cEEh--HhhhcCcccCccceeeeecccccccccCCccccCcCCCchhh
Confidence 4453 467777778889999873 222 2344 3343223333 3345566665543322222 577
Q ss_pred CCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccc-------cccChhhHHHHHhhc
Q psy16294 289 DLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTE-------LTLMSLNVQFEILSR 361 (486)
Q Consensus 289 ~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k-------~F~~~~~L~~H~~~H 361 (486)
+|. +.|..-..|.+|++..| + .|-+|++ -|+.-..|..|.+.-
T Consensus 225 FC~----------------~~FYdDDEL~~HcR~~H-----------E---~ChICD~v~p~~~QYFK~Y~~Le~HF~~~ 274 (493)
T COG5236 225 FCK----------------IYFYDDDELRRHCRLRH-----------E---ACHICDMVGPIRYQYFKSYEDLEAHFRNA 274 (493)
T ss_pred hcc----------------ceecChHHHHHHHHhhh-----------h---hhhhhhccCccchhhhhCHHHHHHHhhcC
Confidence 765 35666777778876666 2 3445544 367777788887642
Q ss_pred CCCCccccCccccc----ccccChhhHHHHHHhhcCC
Q psy16294 362 PKLKPYVCGEENCT----KRYTDPSSLRKHIKTVHAP 394 (486)
Q Consensus 362 ~~ekpy~C~~~gCg----K~F~~~s~L~~H~k~hh~~ 394 (486)
-|.|.+..|- ..|.....|..|+...|+.
T Consensus 275 ----hy~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~ 307 (493)
T COG5236 275 ----HYCCTFQTCRVGKCYVFPYHTELLEHLTRFHKV 307 (493)
T ss_pred ----ceEEEEEEEecCcEEEeccHHHHHHHHHHHhhc
Confidence 2788766563 4688999999999887765
No 57
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=84.54 E-value=0.78 Score=27.89 Aligned_cols=21 Identities=33% Similarity=0.774 Sum_probs=15.2
Q ss_pred cccCcccccccccChhhHHHHHHh
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKT 390 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~ 390 (486)
..|+. ||+.| ..+.|.+|+++
T Consensus 3 ~~C~~--CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPI--CGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCC--CCCEE-CHHHHHHHHHh
Confidence 46777 88888 66777788764
No 58
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=83.73 E-value=1 Score=29.28 Aligned_cols=22 Identities=18% Similarity=0.309 Sum_probs=17.9
Q ss_pred ceecCCCCCCCCCHHHHHHhhh
Q psy16294 220 NCHWKDCGLEFGTQDILVKHIN 241 (486)
Q Consensus 220 ~~~C~~C~k~F~~~~~L~~H~~ 241 (486)
+|.|+.|++.|.....+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4778888888888888888875
No 59
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=83.07 E-value=0.81 Score=29.75 Aligned_cols=23 Identities=30% Similarity=0.652 Sum_probs=19.7
Q ss_pred ccccCcccccccccChhhHHHHHHh
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKT 390 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~ 390 (486)
+|.|++ |++.|.....+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 588988 99999999999988864
No 60
>KOG4173|consensus
Probab=81.65 E-value=0.5 Score=43.76 Aligned_cols=54 Identities=17% Similarity=0.285 Sum_probs=45.4
Q ss_pred ccccccccccChhhHHHHHhh-c---------CCCCccccCcccccccccChhhHHHHHHhhcC
Q psy16294 340 ELNIRTELTLMSLNVQFEILS-R---------PKLKPYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 340 ~C~~C~k~F~~~~~L~~H~~~-H---------~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
.|.+|.+.|.+..-|..|+.- | .|.--|+|-++||+..|.+...-+.|+-.-|.
T Consensus 108 sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 108 SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 699999999999999999864 3 24456999999999999999999999866553
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=81.46 E-value=1.1 Score=27.23 Aligned_cols=21 Identities=5% Similarity=-0.227 Sum_probs=17.8
Q ss_pred cccccccccccChhhHHHHHhh
Q psy16294 339 IELNIRTELTLMSLNVQFEILS 360 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~ 360 (486)
..|+.|++.| ....|..|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4799999999 77889999764
No 62
>KOG2482|consensus
Probab=80.14 E-value=1.6 Score=43.70 Aligned_cols=58 Identities=21% Similarity=0.303 Sum_probs=47.4
Q ss_pred cccccccccccccccChhhHHHHHhh--cCCC------------------------------------------------
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILS--RPKL------------------------------------------------ 364 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~--H~~e------------------------------------------------ 364 (486)
.-.++.|-.|.+.|..+..|+.|||. |...
T Consensus 192 kL~r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ 271 (423)
T KOG2482|consen 192 KLERLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWN 271 (423)
T ss_pred HHhhheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhh
Confidence 34578999999999999999999984 4311
Q ss_pred ----Cc--cccCcccccccccChhhHHHHHHhhcCC
Q psy16294 365 ----KP--YVCGEENCTKRYTDPSSLRKHIKTVHAP 394 (486)
Q Consensus 365 ----kp--y~C~~~gCgK~F~~~s~L~~H~k~hh~~ 394 (486)
-+ .+|-. |.+.+-+...|..||++.|.-
T Consensus 272 ed~a~a~~v~CLf--C~~~~en~~~l~eHmk~vHe~ 305 (423)
T KOG2482|consen 272 EDDAEALSVVCLF--CTNFYENPVFLFEHMKIVHEF 305 (423)
T ss_pred cCCCCccceEEEe--eccchhhHHHHHHHHHHHHHh
Confidence 11 48988 999999999999999998853
No 63
>KOG2893|consensus
Probab=77.57 E-value=0.72 Score=43.69 Aligned_cols=47 Identities=17% Similarity=0.092 Sum_probs=38.0
Q ss_pred cccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcC
Q psy16294 341 LNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 341 C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
|.+|++.|....-|..|++. |-|+|.+ |.|...+--.|..|-..+|.
T Consensus 13 cwycnrefddekiliqhqka----khfkchi--chkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHI--CHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeee--ehhhhccCCCceeehhhhhh
Confidence 99999999999999998764 5599999 99988777777777443343
No 64
>KOG1146|consensus
Probab=74.39 E-value=1.7 Score=50.72 Aligned_cols=58 Identities=12% Similarity=0.051 Sum_probs=47.0
Q ss_pred cccccccccccccccChhhHHHHHhh-----------------cCCCCccccCcccccccccChhhHHHHHHhhcCCC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILS-----------------RPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPA 395 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~-----------------H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~ 395 (486)
...+|.|..|...|.....|..|.+. |...++| |.+ |...|.....|..||+..+++.
T Consensus 1281 ~~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~--c~~~~~~~~alqihm~~~~~~~ 1355 (1406)
T KOG1146|consen 1281 VTHRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLA--CEVLLSGREALQIHMRSSAHRR 1355 (1406)
T ss_pred cchhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-chH--HHhhcchhHHHHHHHHHhhhcc
Confidence 44578999999999999999988752 2334567 999 9999999999999999876654
No 65
>KOG2893|consensus
Probab=74.18 E-value=1.1 Score=42.52 Aligned_cols=49 Identities=31% Similarity=0.513 Sum_probs=36.1
Q ss_pred cceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCCChHHHHHH-hhhhc
Q psy16294 219 TNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFKAQYMLVVH-MRRHT 280 (486)
Q Consensus 219 ~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~~~~~L~~H-~r~Ht 280 (486)
+++ |-.|++.|.....|.+|++ .|-|+| ..| .|...+--.|..| |.+|.
T Consensus 10 kpw-cwycnrefddekiliqhqk-------akhfkc--hic---hkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 10 KPW-CWYCNREFDDEKILIQHQK-------AKHFKC--HIC---HKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred Cce-eeecccccchhhhhhhhhh-------hcccee--eee---hhhhccCCCceeehhhhhh
Confidence 444 7789999999999999986 355878 455 7777776677777 44553
No 66
>KOG2482|consensus
Probab=71.95 E-value=3.4 Score=41.38 Aligned_cols=50 Identities=14% Similarity=0.098 Sum_probs=36.7
Q ss_pred cccccccccccChhhHHHHHhh-cCC--------------------------CCccccCcccccccccChhhHHHHHHh
Q psy16294 339 IELNIRTELTLMSLNVQFEILS-RPK--------------------------LKPYVCGEENCTKRYTDPSSLRKHIKT 390 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~-H~~--------------------------ekpy~C~~~gCgK~F~~~s~L~~H~k~ 390 (486)
..|-+|....-+...|..||.+ |.- .+.-.|-. |.-.|.....|..||.-
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~--cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAE--CDLSFWKEPGLLIHMVE 356 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhcccccc--ccccccCcchhhhhccc
Confidence 6899999999999999999986 531 01234555 77778888888877753
No 67
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=69.65 E-value=1.8 Score=31.54 Aligned_cols=26 Identities=23% Similarity=0.473 Sum_probs=22.5
Q ss_pred cccceecCCCCCCCCCHHHHHHhhhc
Q psy16294 217 VETNCHWKDCGLEFGTQDILVKHINN 242 (486)
Q Consensus 217 ~~~~~~C~~C~k~F~~~~~L~~H~~~ 242 (486)
+|..+.|+-||+.|..+..+.+|++.
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhH
Confidence 46678999999999999999999874
No 68
>KOG4173|consensus
Probab=69.40 E-value=1.2 Score=41.32 Aligned_cols=94 Identities=10% Similarity=0.109 Sum_probs=74.2
Q ss_pred cccccccccc--ccccccChhhHHHHHhh-cCCCCccccCcccccccccChhhHHHHHHhhc----------CCCCCCCC
Q psy16294 334 LVMRRIELNI--RTELTLMSLNVQFEILS-RPKLKPYVCGEENCTKRYTDPSSLRKHIKTVH----------APASLNPV 400 (486)
Q Consensus 334 ~~~~~~~C~~--C~k~F~~~~~L~~H~~~-H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh----------~~~~~~c~ 400 (486)
.....|.|++ |...|..-..+..|..+ |+ -.|.+ |.+.|.+..-|..|+..-| |...|.|.
T Consensus 75 ~~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~----~sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~Cl 148 (253)
T KOG4173|consen 75 PRVPAFACQVAGCCQVFDALDDYEHHYHTLHG----NSCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCL 148 (253)
T ss_pred cccccccccccchHHHHhhhhhHHHhhhhccc----chhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHH
Confidence 3456678877 88899988888888876 44 37999 9999999999999986544 55678998
Q ss_pred CCCCcchhchhhhhhccccccccccccchhhhc
Q psy16294 401 PQHKDMQDIQSNEVTQSNMRQSTYQRTLEYVEQ 433 (486)
Q Consensus 401 ~c~~~~~~~~~~~~~~~h~~~h~~~~~~~c~~c 433 (486)
.-+|..++.......+|-.+.|.+--.|+...-
T Consensus 149 vEgCt~KFkT~r~RkdH~I~~Hk~Pa~frFdk~ 181 (253)
T KOG4173|consen 149 VEGCTEKFKTSRDRKDHMIRMHKYPADFRFDKP 181 (253)
T ss_pred HHhhhhhhhhhhhhhhHHHHhccCCcceeecCc
Confidence 778888888877778777888888877766555
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=66.35 E-value=3.7 Score=26.67 Aligned_cols=24 Identities=17% Similarity=0.218 Sum_probs=15.7
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTK 376 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK 376 (486)
|+|..||..+.-. +.++.|++ ||.
T Consensus 2 ~~C~~CGy~y~~~------------~~~~~CP~--Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGE------------EAPWVCPV--CGA 25 (33)
T ss_pred EECCCCCCEECCC------------cCCCcCcC--CCC
Confidence 6777887765422 26778877 764
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=63.37 E-value=3.6 Score=29.98 Aligned_cols=32 Identities=13% Similarity=0.242 Sum_probs=24.6
Q ss_pred CCCCccccCcccccccccChhhHHHHHHhhcCCC
Q psy16294 362 PKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPA 395 (486)
Q Consensus 362 ~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~ 395 (486)
.||.-+.|+. ||+.|.+..+..+|+..-|+-.
T Consensus 13 DGE~~lrCPR--C~~~FR~~K~Y~RHVNKaH~~~ 44 (65)
T COG4049 13 DGEEFLRCPR--CGMVFRRRKDYIRHVNKAHGWL 44 (65)
T ss_pred CCceeeeCCc--hhHHHHHhHHHHHHhhHHhhhh
Confidence 5677788877 8888888888888887766643
No 71
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=60.12 E-value=2.6 Score=40.11 Aligned_cols=24 Identities=25% Similarity=0.437 Sum_probs=18.7
Q ss_pred ccceecCCCCCCCCCHHHHHHhhh
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKHIN 241 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H~~ 241 (486)
++.+.|+.|++.|.++.-.....+
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCce
Confidence 467889999999998876665554
No 72
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=56.51 E-value=6.6 Score=33.13 Aligned_cols=30 Identities=13% Similarity=0.178 Sum_probs=20.6
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccccccCh
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDP 381 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~ 381 (486)
..|+.||+.|-. .+..|-.|+. ||..|.-.
T Consensus 10 R~Cp~CG~kFYD-----------Lnk~PivCP~--CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYD-----------LNKDPIVCPK--CGTEFPPE 39 (108)
T ss_pred ccCCCCcchhcc-----------CCCCCccCCC--CCCccCcc
Confidence 457788877765 3346777877 88777654
No 73
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=55.69 E-value=7 Score=25.61 Aligned_cols=25 Identities=12% Similarity=0.050 Sum_probs=16.2
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCcccccc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTK 376 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK 376 (486)
.|.|.+||..+.. .+.|..|++ ||.
T Consensus 2 ~~~C~~CG~i~~g------------~~~p~~CP~--Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEG------------EEAPEKCPI--CGA 26 (34)
T ss_pred eEECCCCCCEeEC------------CcCCCcCcC--CCC
Confidence 4678888875532 225678877 875
No 74
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=55.08 E-value=10 Score=26.07 Aligned_cols=26 Identities=31% Similarity=0.460 Sum_probs=15.5
Q ss_pred CCccccCcccccccccCh----hhHHHHHHhh
Q psy16294 364 LKPYVCGEENCTKRYTDP----SSLRKHIKTV 391 (486)
Q Consensus 364 ekpy~C~~~gCgK~F~~~----s~L~~H~k~h 391 (486)
..-..|.. |++.+... ++|.+|++..
T Consensus 14 ~~~a~C~~--C~~~~~~~~~~ts~l~~HL~~~ 43 (45)
T PF02892_consen 14 KKKAKCKY--CGKVIKYSSGGTSNLKRHLKKK 43 (45)
T ss_dssp SS-EEETT--TTEE-----SSTHHHHHHHHHT
T ss_pred cCeEEeCC--CCeEEeeCCCcHHHHHHhhhhh
Confidence 45577887 88887764 7888888543
No 75
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=53.70 E-value=3.7 Score=31.91 Aligned_cols=42 Identities=10% Similarity=0.097 Sum_probs=23.8
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccccccChh
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPS 382 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s 382 (486)
+.|+.|+..-.....-..+.. ..++-++|....||.+|....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYE 43 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEE
Confidence 467888776533322222211 345668887445888887644
No 76
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=53.58 E-value=5.3 Score=35.95 Aligned_cols=16 Identities=31% Similarity=0.569 Sum_probs=9.3
Q ss_pred cccCcccccccccChhhH
Q psy16294 367 YVCGEENCTKRYTDPSSL 384 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L 384 (486)
|+|+. ||++|.+...+
T Consensus 29 ~~c~~--c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLA--CGKRFTTFERV 44 (154)
T ss_pred eeccc--cCCcceEeEec
Confidence 66665 66666654433
No 77
>KOG2186|consensus
Probab=52.94 E-value=7.1 Score=37.63 Aligned_cols=57 Identities=16% Similarity=0.237 Sum_probs=45.4
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCC
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNP 399 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c 399 (486)
.|.|..||..... ..+.+|+-.-.+ .-|.|-. |++.|.+ .+.+.|.+.-+....|.-
T Consensus 3 ~FtCnvCgEsvKK-p~vekH~srCrn-~~fSCID--C~k~F~~-~sYknH~kCITEaQKYg~ 59 (276)
T KOG2186|consen 3 FFTCNVCGESVKK-PQVEKHMSRCRN-AYFSCID--CGKTFER-VSYKNHTKCITEAQKYGK 59 (276)
T ss_pred EEehhhhhhhccc-cchHHHHHhccC-CeeEEee--ccccccc-chhhhhhhhcchHHHhhh
Confidence 4789999998764 467779988666 6799987 9999999 788999988776665643
No 78
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=49.11 E-value=12 Score=31.34 Aligned_cols=25 Identities=28% Similarity=0.583 Sum_probs=21.6
Q ss_pred ccc----CcccccccccChhhHHHHHHhhcC
Q psy16294 367 YVC----GEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 367 y~C----~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
|.| .. |+..+.+...+++|.+.+||
T Consensus 81 ~~C~~~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCC--CCcEeccHHHHHHHHHHhcC
Confidence 888 66 99999999999999988885
No 79
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=49.07 E-value=4.3 Score=36.52 Aligned_cols=12 Identities=8% Similarity=-0.344 Sum_probs=6.1
Q ss_pred cccccccccccC
Q psy16294 339 IELNIRTELTLM 350 (486)
Q Consensus 339 ~~C~~C~k~F~~ 350 (486)
++|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 455555555544
No 80
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=46.75 E-value=6.6 Score=30.56 Aligned_cols=34 Identities=18% Similarity=0.426 Sum_probs=22.4
Q ss_pred cceecCCCCCCCCCHHHHHHhhhcccccCCCCcccccCCCCCCCCCcCC
Q psy16294 219 TNCHWKDCGLEFGTQDILVKHINNDHIHANKKSFICRWEDCTRDEKPFK 267 (486)
Q Consensus 219 ~~~~C~~C~k~F~~~~~L~~H~~~~~~H~~ek~~~C~~~~C~~C~k~F~ 267 (486)
..|+|..|+..| .+.+||+ +.|+ ..|+.|+-.|+
T Consensus 11 Y~Y~c~~cg~~~----dvvq~~~-------ddpl----t~ce~c~a~~k 44 (82)
T COG2331 11 YSYECTECGNRF----DVVQAMT-------DDPL----TTCEECGARLK 44 (82)
T ss_pred eEEeecccchHH----HHHHhcc-------cCcc----ccChhhChHHH
Confidence 568999998766 5777774 3444 45666676543
No 81
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=45.55 E-value=5.8 Score=37.75 Aligned_cols=44 Identities=14% Similarity=0.157 Sum_probs=31.0
Q ss_pred ccccccccccccccChhhHHHHHhh---cC-------CCCc-----cccCcccccccccCh
Q psy16294 336 MRRIELNIRTELTLMSLNVQFEILS---RP-------KLKP-----YVCGEENCTKRYTDP 381 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H~~~---H~-------~ekp-----y~C~~~gCgK~F~~~ 381 (486)
.+.+.|++|++.|.++.-.....+. .+ +..| ..|+. ||.+|...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~--CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPH--CGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCC--CCCccccc
Confidence 4568899999999988666555542 11 2333 58988 99998754
No 82
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=44.29 E-value=18 Score=25.73 Aligned_cols=22 Identities=32% Similarity=0.720 Sum_probs=15.2
Q ss_pred cccCcccccccccCh-----hhHHHHHHh
Q psy16294 367 YVCGEENCTKRYTDP-----SSLRKHIKT 390 (486)
Q Consensus 367 y~C~~~gCgK~F~~~-----s~L~~H~k~ 390 (486)
-.|.. |++.+... ++|.+|++.
T Consensus 19 a~C~~--C~~~l~~~~~~gTs~L~rHl~~ 45 (50)
T smart00614 19 AKCKY--CGKKLSRSSKGGTSNLRRHLRR 45 (50)
T ss_pred EEecC--CCCEeeeCCCCCcHHHHHHHHh
Confidence 46766 77777654 588888873
No 83
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=43.51 E-value=15 Score=31.81 Aligned_cols=34 Identities=9% Similarity=-0.017 Sum_probs=24.7
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhH
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSL 384 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L 384 (486)
...|+.||+.|-. .+..|-.|+. ||..|.-...+
T Consensus 9 Kr~Cp~cg~kFYD-----------Lnk~p~vcP~--cg~~~~~~~~~ 42 (129)
T TIGR02300 9 KRICPNTGSKFYD-----------LNRRPAVSPY--TGEQFPPEEAL 42 (129)
T ss_pred cccCCCcCccccc-----------cCCCCccCCC--cCCccCcchhh
Confidence 3568899988876 3457888988 99888665343
No 84
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=43.21 E-value=13 Score=32.49 Aligned_cols=26 Identities=31% Similarity=0.746 Sum_probs=16.8
Q ss_pred CccccCcccccccccChhhHHHHHHhhcCCC
Q psy16294 365 KPYVCGEENCTKRYTDPSSLRKHIKTVHAPA 395 (486)
Q Consensus 365 kpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~ 395 (486)
.-..|-+ |||.|+. |++|+++|||-.
T Consensus 71 d~i~cle--cGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 71 DYIICLE--CGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp S-EE-TB--T--EESB---HHHHHHHTT-S-
T ss_pred CeeEEcc--CCcccch---HHHHHHHccCCC
Confidence 4478988 9999985 599999998864
No 85
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=42.17 E-value=14 Score=33.65 Aligned_cols=23 Identities=17% Similarity=0.125 Sum_probs=19.5
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCccccc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCT 375 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCg 375 (486)
.|.|++||..+ .|+-|-+|++ ||
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPi--Cg 156 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPI--CG 156 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCC--CC
Confidence 69999999754 6788999999 98
No 86
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=41.75 E-value=30 Score=29.35 Aligned_cols=22 Identities=14% Similarity=0.015 Sum_probs=13.8
Q ss_pred ccccccccccccccChhhHHHH
Q psy16294 336 MRRIELNIRTELTLMSLNVQFE 357 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H 357 (486)
+-|..|++|+-+.....+|.+-
T Consensus 13 ~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHh
Confidence 4456677777766666666543
No 87
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=39.54 E-value=23 Score=34.33 Aligned_cols=101 Identities=18% Similarity=0.208 Sum_probs=54.7
Q ss_pred hcCCCCccCCCCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHH
Q psy16294 279 HTGEKPHKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEI 358 (486)
Q Consensus 279 Htgekpy~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~ 358 (486)
.+|-+.|+|.+|.. |.-..+-..|+ +.+.......|+|.-|++.
T Consensus 137 ~hGGrif~CsfC~~-----------------flCEDDQFEHQ--------AsCQvLe~E~~KC~SCNrl----------- 180 (314)
T PF06524_consen 137 DHGGRIFKCSFCDN-----------------FLCEDDQFEHQ--------ASCQVLESETFKCQSCNRL----------- 180 (314)
T ss_pred cCCCeEEEeecCCC-----------------eeeccchhhhh--------hhhhhhhcccccccccccc-----------
Confidence 34668899998862 22233333344 2233445667888888762
Q ss_pred hhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhchhhhhhcccccccccccc
Q psy16294 359 LSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDIQSNEVTQSNMRQSTYQRT 427 (486)
Q Consensus 359 ~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~~~~~~~~~h~~~h~~~~~ 427 (486)
..|.|-. |--.|-.-...++=.+. ...+++.|+.|+-..+.... +-...|.|.+.+.
T Consensus 181 ------Gq~sCLR--CK~cfCddHvrrKg~ky-~k~k~~PCPKCg~et~eTkd---LSmStR~hkyGRQ 237 (314)
T PF06524_consen 181 ------GQYSCLR--CKICFCDDHVRRKGFKY-EKGKPIPCPKCGYETQETKD---LSMSTRSHKYGRQ 237 (314)
T ss_pred ------cchhhhh--eeeeehhhhhhhccccc-ccCCCCCCCCCCCccccccc---ceeeeecchhccc
Confidence 2466655 65555543332222222 34477888877776665555 2233455555443
No 88
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=38.78 E-value=17 Score=25.29 Aligned_cols=30 Identities=10% Similarity=-0.044 Sum_probs=18.2
Q ss_pred ccccccccccccChhhHHHHHhhcCCCCccccCccccccccc
Q psy16294 338 RIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYT 379 (486)
Q Consensus 338 ~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~ 379 (486)
.|.|+.||..|.... ....+.|+. ||..+.
T Consensus 3 ~y~C~~CG~~~~~~~----------~~~~~~Cp~--CG~~~~ 32 (46)
T PRK00398 3 EYKCARCGREVELDE----------YGTGVRCPY--CGYRIL 32 (46)
T ss_pred EEECCCCCCEEEECC----------CCCceECCC--CCCeEE
Confidence 467777777665431 112577777 776554
No 89
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=38.23 E-value=16 Score=33.27 Aligned_cols=13 Identities=8% Similarity=-0.404 Sum_probs=10.8
Q ss_pred ccccccccccccc
Q psy16294 334 LVMRRIELNIRTE 346 (486)
Q Consensus 334 ~~~~~~~C~~C~k 346 (486)
.++-|-.||+||.
T Consensus 145 ~ge~P~~CPiCga 157 (166)
T COG1592 145 EGEAPEVCPICGA 157 (166)
T ss_pred cCCCCCcCCCCCC
Confidence 4588999999984
No 90
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=37.71 E-value=16 Score=29.32 Aligned_cols=34 Identities=9% Similarity=0.143 Sum_probs=24.2
Q ss_pred ccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccCh
Q psy16294 336 MRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDP 381 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~ 381 (486)
...|.|+.|++.-.. | ....-+.|.. ||..|.--
T Consensus 33 ~~~~~Cp~C~~~~Vk--------R--~a~GIW~C~k--Cg~~fAGg 66 (89)
T COG1997 33 RAKHVCPFCGRTTVK--------R--IATGIWKCRK--CGAKFAGG 66 (89)
T ss_pred hcCCcCCCCCCccee--------e--eccCeEEcCC--CCCeeccc
Confidence 456899999986322 2 3345699998 99999753
No 91
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=37.71 E-value=22 Score=31.93 Aligned_cols=57 Identities=19% Similarity=0.131 Sum_probs=37.5
Q ss_pred ccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCC
Q psy16294 336 MRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAP 394 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~ 394 (486)
.....|+.|.-.-.-..-. .--|.+.+.|+-.|...+|...= +...|++|.|..|..
T Consensus 78 ~~~L~CPLCRG~V~GWtvv-e~AR~~LN~K~RsC~~e~C~F~G-tY~eLrKHar~~HP~ 134 (162)
T PF07800_consen 78 QPELACPLCRGEVKGWTVV-EPARRFLNAKKRSCSQESCSFSG-TYSELRKHARSEHPS 134 (162)
T ss_pred cccccCccccCceeceEEc-hHHHHHhccCCccCccccccccc-CHHHHHHHHHhhCCC
Confidence 3457799985432222222 23566677889999987786443 567899999987754
No 92
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=36.95 E-value=14 Score=35.15 Aligned_cols=31 Identities=29% Similarity=0.529 Sum_probs=23.3
Q ss_pred CCCccccCcccccccccChhhHHHHHHhhcCCC
Q psy16294 363 KLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPA 395 (486)
Q Consensus 363 ~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~ 395 (486)
++..|.|.. |+|.|.-..-.++|+..-|.++
T Consensus 74 ~~~K~~C~l--c~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 74 DEDKWRCPL--CGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SSEEEEE-S--SS-EESSHHHHHHHHHHH-HHH
T ss_pred cCCEECCCC--CCcccCChHHHHHHHhhcCHHH
Confidence 455699988 9999999999999998888754
No 93
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=34.01 E-value=25 Score=26.17 Aligned_cols=33 Identities=15% Similarity=0.170 Sum_probs=18.4
Q ss_pred cccccccccccccChhhHHHHHhhcCCCCccccCcccccc
Q psy16294 337 RRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTK 376 (486)
Q Consensus 337 ~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK 376 (486)
-.|.|+.||+.-..+..--+ .-..+|.|+. ||.
T Consensus 26 v~F~CPnCGe~~I~Rc~~CR-----k~g~~Y~Cp~--CGF 58 (61)
T COG2888 26 VKFPCPNCGEVEIYRCAKCR-----KLGNPYRCPK--CGF 58 (61)
T ss_pred eEeeCCCCCceeeehhhhHH-----HcCCceECCC--cCc
Confidence 35778888865544332111 1125788876 773
No 94
>KOG2785|consensus
Probab=33.12 E-value=31 Score=35.37 Aligned_cols=50 Identities=16% Similarity=0.193 Sum_probs=0.0
Q ss_pred cccccccccccChhhHHHHHhh--cC-----------------------------------CCCccccCcccccccccCh
Q psy16294 339 IELNIRTELTLMSLNVQFEILS--RP-----------------------------------KLKPYVCGEENCTKRYTDP 381 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~--H~-----------------------------------~ekpy~C~~~gCgK~F~~~ 381 (486)
|.|..|...|.....-+.|+++ |. .+-++.|.+ |.|.|...
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~--c~k~~~s~ 81 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEA--CNKSFASP 81 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHH--hhccccCh
Q ss_pred hhHHHHHHh
Q psy16294 382 SSLRKHIKT 390 (486)
Q Consensus 382 s~L~~H~k~ 390 (486)
....+|++.
T Consensus 82 ~a~~~hl~S 90 (390)
T KOG2785|consen 82 KAHENHLKS 90 (390)
T ss_pred hhHHHHHHH
No 95
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=32.98 E-value=17 Score=29.55 Aligned_cols=34 Identities=9% Similarity=0.134 Sum_probs=23.9
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
....|.|++|++.-..+ .....+.|.. |++.|.-
T Consensus 33 q~a~y~CpfCgk~~vkR----------~a~GIW~C~~--C~~~~AG 66 (90)
T PTZ00255 33 QHAKYFCPFCGKHAVKR----------QAVGIWRCKG--CKKTVAG 66 (90)
T ss_pred HhCCccCCCCCCCceee----------eeeEEEEcCC--CCCEEeC
Confidence 34578999998643322 2345699988 9999874
No 96
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=32.76 E-value=47 Score=29.97 Aligned_cols=43 Identities=9% Similarity=0.132 Sum_probs=28.6
Q ss_pred hcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhchhhh
Q psy16294 360 SRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDIQSNE 413 (486)
Q Consensus 360 ~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~~~~~ 413 (486)
...+..-|.|+. |+.+|+....+. .-|.|+.|+..+....+..
T Consensus 103 ~e~~~~~Y~Cp~--c~~r~tf~eA~~---------~~F~Cp~Cg~~L~~~dn~~ 145 (158)
T TIGR00373 103 FETNNMFFICPN--MCVRFTFNEAME---------LNFTCPRCGAMLDYLDNSE 145 (158)
T ss_pred hccCCCeEECCC--CCcEeeHHHHHH---------cCCcCCCCCCEeeeccCHH
Confidence 334456688877 888887777764 2577887777766555533
No 97
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=32.70 E-value=37 Score=28.63 Aligned_cols=10 Identities=30% Similarity=0.716 Sum_probs=5.4
Q ss_pred eecCCCCCCC
Q psy16294 221 CHWKDCGLEF 230 (486)
Q Consensus 221 ~~C~~C~k~F 230 (486)
..|..||+.|
T Consensus 10 R~Cp~CG~kF 19 (108)
T PF09538_consen 10 RTCPSCGAKF 19 (108)
T ss_pred ccCCCCcchh
Confidence 3455555555
No 98
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=32.08 E-value=25 Score=30.65 Aligned_cols=25 Identities=28% Similarity=0.506 Sum_probs=20.4
Q ss_pred cccCcccccccccChhhHHHHHHhhcCCCC
Q psy16294 367 YVCGEENCTKRYTDPSSLRKHIKTVHAPAS 396 (486)
Q Consensus 367 y~C~~~gCgK~F~~~s~L~~H~k~hh~~~~ 396 (486)
..|-+ |||.|+ +|++|+.+|++-.|
T Consensus 77 IicLE--DGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLE--DGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEec--cCcchH---HHHHHHhcccCCCH
Confidence 57888 999996 68999999987543
No 99
>KOG4167|consensus
Probab=31.59 E-value=41 Score=37.35 Aligned_cols=27 Identities=19% Similarity=0.256 Sum_probs=23.1
Q ss_pred ccceecCCCCCCCCCHHHHHHhhhcccccC
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKHINNDHIHA 247 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H~~~~~~H~ 247 (486)
...|.|.+|+|.|.....+..||+ .|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK---~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMK---THR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHH---HHH
Confidence 457999999999999999999998 564
No 100
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=30.39 E-value=17 Score=29.54 Aligned_cols=34 Identities=9% Similarity=0.213 Sum_probs=23.7
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
....|.|++|++.-..+ .....+.|.. |++.|.-
T Consensus 32 q~a~y~CpfCgk~~vkR----------~a~GIW~C~~--C~~~~AG 65 (91)
T TIGR00280 32 QKAKYVCPFCGKKTVKR----------GSTGIWTCRK--CGAKFAG 65 (91)
T ss_pred HhcCccCCCCCCCceEE----------EeeEEEEcCC--CCCEEeC
Confidence 34578999998643222 2335699998 9999964
No 101
>KOG4167|consensus
Probab=29.95 E-value=15 Score=40.59 Aligned_cols=26 Identities=31% Similarity=0.511 Sum_probs=23.7
Q ss_pred ccccCcccccccccChhhHHHHHHhhcC
Q psy16294 366 PYVCGEENCTKRYTDPSSLRKHIKTVHA 393 (486)
Q Consensus 366 py~C~~~gCgK~F~~~s~L~~H~k~hh~ 393 (486)
.|.|.. |+|.|-.-.++..|||+|.-
T Consensus 792 iFpCre--C~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRE--CGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHH--HHHHHHHHhhhhHHHHHHHH
Confidence 499999 99999999999999999863
No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=29.76 E-value=23 Score=31.53 Aligned_cols=13 Identities=0% Similarity=-0.309 Sum_probs=6.2
Q ss_pred ccccccccccccC
Q psy16294 338 RIELNIRTELTLM 350 (486)
Q Consensus 338 ~~~C~~C~k~F~~ 350 (486)
.|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3445555544443
No 103
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=29.49 E-value=27 Score=23.09 Aligned_cols=34 Identities=12% Similarity=0.036 Sum_probs=21.1
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCccccccccc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYT 379 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~ 379 (486)
+.|+.|+..|.-...... .......|+. |+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~~-----~~~~~v~C~~--C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG-----ANGGKVRCGK--CGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHcC-----CCCCEEECCC--CCCEEE
Confidence 568888887766543211 1223578887 888774
No 104
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=29.03 E-value=16 Score=29.67 Aligned_cols=34 Identities=9% Similarity=0.240 Sum_probs=22.9
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
....|.|++|++.-..+ ...--+.|.. |++.|.-
T Consensus 32 q~~ky~Cp~Cgk~~vkR----------~a~GIW~C~~--C~~~~AG 65 (90)
T PF01780_consen 32 QHAKYTCPFCGKTSVKR----------VATGIWKCKK--CGKKFAG 65 (90)
T ss_dssp HHS-BEESSSSSSEEEE----------EETTEEEETT--TTEEEE-
T ss_pred HhCCCcCCCCCCceeEE----------eeeEEeecCC--CCCEEeC
Confidence 44678999999864332 2224599998 9999863
No 105
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=28.58 E-value=29 Score=32.97 Aligned_cols=25 Identities=32% Similarity=0.449 Sum_probs=19.9
Q ss_pred ccceecCCCCCCCCCHHHHHHhhhc
Q psy16294 218 ETNCHWKDCGLEFGTQDILVKHINN 242 (486)
Q Consensus 218 ~~~~~C~~C~k~F~~~~~L~~H~~~ 242 (486)
+..|.|..|+|.|.-..-.+.|+.+
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~n 99 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFN 99 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhh
Confidence 4579999999999999999999874
No 106
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.97 E-value=39 Score=28.32 Aligned_cols=33 Identities=15% Similarity=-0.059 Sum_probs=24.8
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCcccccccccChhhHH
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLR 385 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~ 385 (486)
-.|+.|++.|-. .+..|..|+. |||.|. .+.|.
T Consensus 10 ridPetg~KFYD-----------LNrdPiVsPy--tG~s~P-~s~fe 42 (129)
T COG4530 10 RIDPETGKKFYD-----------LNRDPIVSPY--TGKSYP-RSYFE 42 (129)
T ss_pred ccCccccchhhc-----------cCCCccccCc--ccccch-HHHHH
Confidence 368899998865 4558999988 999994 44443
No 107
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=26.32 E-value=35 Score=24.84 Aligned_cols=42 Identities=10% Similarity=0.107 Sum_probs=18.7
Q ss_pred cccccc-ccccccChhhHHHHHhhcCCCCccccCc--cccccccc
Q psy16294 338 RIELNI-RTELTLMSLNVQFEILSRPKLKPYVCGE--ENCTKRYT 379 (486)
Q Consensus 338 ~~~C~~-C~k~F~~~~~L~~H~~~H~~ekpy~C~~--~gCgK~F~ 379 (486)
...|+. |+..-..+..|..|+...-..++..|.. .||+..+.
T Consensus 9 ~v~C~~~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 9 PVPCPNGCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp EEE-TT--S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred EeeCCCCCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 445665 3322223556777776555556666655 45666554
No 108
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=25.85 E-value=22 Score=28.91 Aligned_cols=34 Identities=6% Similarity=0.180 Sum_probs=23.5
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
....|.|++|++.-..+ .....+.|.. |++.|.-
T Consensus 33 q~a~y~CpfCgk~~vkR----------~a~GIW~C~~--C~~~~AG 66 (90)
T PRK03976 33 MRAKHVCPVCGRPKVKR----------VGTGIWECRK--CGAKFAG 66 (90)
T ss_pred HhcCccCCCCCCCceEE----------EEEEEEEcCC--CCCEEeC
Confidence 34578999997643322 2335699988 9999864
No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=25.60 E-value=68 Score=29.59 Aligned_cols=38 Identities=11% Similarity=0.242 Sum_probs=24.4
Q ss_pred CCCccccCcccccccccChhhHHHHHHhhcCCCCCCCCCCCCcchhchh
Q psy16294 363 KLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNPVPQHKDMQDIQS 411 (486)
Q Consensus 363 ~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c~~c~~~~~~~~~ 411 (486)
+..-|.|+. |+++|+....+. .-|.|+.|+..+....+
T Consensus 114 ~~~~Y~Cp~--C~~rytf~eA~~---------~~F~Cp~Cg~~L~~~dn 151 (178)
T PRK06266 114 NNMFFFCPN--CHIRFTFDEAME---------YGFRCPQCGEMLEEYDN 151 (178)
T ss_pred CCCEEECCC--CCcEEeHHHHhh---------cCCcCCCCCCCCeeccc
Confidence 345577766 777777766652 35777777776655544
No 110
>PHA00626 hypothetical protein
Probab=25.48 E-value=31 Score=25.35 Aligned_cols=15 Identities=27% Similarity=0.674 Sum_probs=12.3
Q ss_pred CccccCcccccccccCh
Q psy16294 365 KPYVCGEENCTKRYTDP 381 (486)
Q Consensus 365 kpy~C~~~gCgK~F~~~ 381 (486)
..|+|+. ||..|+..
T Consensus 22 nrYkCkd--CGY~ft~~ 36 (59)
T PHA00626 22 DDYVCCD--CGYNDSKD 36 (59)
T ss_pred cceEcCC--CCCeechh
Confidence 5699988 99999754
No 111
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=25.47 E-value=47 Score=37.54 Aligned_cols=38 Identities=21% Similarity=0.376 Sum_probs=0.0
Q ss_pred ccCCCCCCCCcchhhhhceecCccccCCccccccccccccchhhhhhhhccccccccccccccccChhhHHHHHhhcCCC
Q psy16294 285 HKCTDLEAFDTSSRVAARRIRGWKISRRISDLIPVRNRTRVNILAVVKRLVMRRIELNIRTELTLMSLNVQFEILSRPKL 364 (486)
Q Consensus 285 y~C~~C~c~~~~~~~~~~~H~~ek~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~e 364 (486)
++|+.|+ ..|.-|+ ..+ ...|.+||. ...
T Consensus 445 ~~Cp~Cd----------------------~~lt~H~-~~~-------------~L~CH~Cg~---------------~~~ 473 (730)
T COG1198 445 AECPNCD----------------------SPLTLHK-ATG-------------QLRCHYCGY---------------QEP 473 (730)
T ss_pred ccCCCCC----------------------cceEEec-CCC-------------eeEeCCCCC---------------CCC
Q ss_pred CccccCccccc
Q psy16294 365 KPYVCGEENCT 375 (486)
Q Consensus 365 kpy~C~~~gCg 375 (486)
.|..|+. ||
T Consensus 474 ~p~~Cp~--Cg 482 (730)
T COG1198 474 IPQSCPE--CG 482 (730)
T ss_pred CCCCCCC--CC
No 112
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=25.36 E-value=40 Score=20.70 Aligned_cols=11 Identities=18% Similarity=0.646 Sum_probs=8.0
Q ss_pred cccCccccccccc
Q psy16294 367 YVCGEENCTKRYT 379 (486)
Q Consensus 367 y~C~~~gCgK~F~ 379 (486)
-.|+. ||..|.
T Consensus 15 ~~Cp~--CG~~F~ 25 (26)
T PF10571_consen 15 KFCPH--CGYDFE 25 (26)
T ss_pred CcCCC--CCCCCc
Confidence 46877 888774
No 113
>PRK04023 DNA polymerase II large subunit; Validated
Probab=24.59 E-value=65 Score=37.39 Aligned_cols=11 Identities=18% Similarity=0.519 Sum_probs=6.3
Q ss_pred ccccCcccccccc
Q psy16294 366 PYVCGEENCTKRY 378 (486)
Q Consensus 366 py~C~~~gCgK~F 378 (486)
+|.|+. ||..-
T Consensus 663 ~y~CPK--CG~El 673 (1121)
T PRK04023 663 EDECEK--CGREP 673 (1121)
T ss_pred CCcCCC--CCCCC
Confidence 466766 66443
No 114
>KOG4124|consensus
Probab=23.79 E-value=20 Score=36.18 Aligned_cols=25 Identities=24% Similarity=0.110 Sum_probs=19.7
Q ss_pred cccceecCCCCCCCCCHHHHHHhhhc
Q psy16294 217 VETNCHWKDCGLEFGTQDILVKHINN 242 (486)
Q Consensus 217 ~~~~~~C~~C~k~F~~~~~L~~H~~~ 242 (486)
..++|+|. |++.+.++..|..|-.+
T Consensus 210 ~~~p~k~~-~~~~~~T~~~l~~HS~N 234 (442)
T KOG4124|consen 210 TGTPKKMP-ESLVMDTSSPLSDHSMN 234 (442)
T ss_pred ccCCccCc-ccccccccchhhhcccc
Confidence 35678885 89999999999888643
No 115
>KOG2461|consensus
Probab=23.69 E-value=90 Score=32.67 Aligned_cols=46 Identities=4% Similarity=-0.259 Sum_probs=21.5
Q ss_pred hhHHHHHhhcCCCCccccCcccccccccChhhHHHHHHhhcCCCCCCC
Q psy16294 352 LNVQFEILSRPKLKPYVCGEENCTKRYTDPSSLRKHIKTVHAPASLNP 399 (486)
Q Consensus 352 ~~L~~H~~~H~~ekpy~C~~~gCgK~F~~~s~L~~H~k~hh~~~~~~c 399 (486)
..+..|...+++..++-+.+ +.+.+.....+..|...+.++..+.+
T Consensus 317 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (396)
T KOG2461|consen 317 LVLDQSEVPATVSVWTGETI--PVRTPAGQLIYTQSHSMEVAEPTDMA 362 (396)
T ss_pred cccccccccccccccCcCcc--cccccccccchhhhhhcccCCCCccc
Confidence 33344444445555544444 44444444455555544444444433
No 116
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=23.10 E-value=48 Score=22.39 Aligned_cols=12 Identities=17% Similarity=0.731 Sum_probs=7.4
Q ss_pred ccccCccccccccc
Q psy16294 366 PYVCGEENCTKRYT 379 (486)
Q Consensus 366 py~C~~~gCgK~F~ 379 (486)
||+|.. |++.|=
T Consensus 12 ~f~C~~--C~~~FC 23 (39)
T smart00154 12 GFKCRH--CGNLFC 23 (39)
T ss_pred CeECCc--cCCccc
Confidence 566666 666664
No 117
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=22.73 E-value=46 Score=22.09 Aligned_cols=34 Identities=9% Similarity=0.043 Sum_probs=21.4
Q ss_pred cccccccccccChhhHHHHHhhcCCCCccccCccccccccc
Q psy16294 339 IELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYT 379 (486)
Q Consensus 339 ~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~ 379 (486)
..|+.|+..|.-...- .-.+.+..+|.. |+..|.
T Consensus 3 i~CP~C~~~f~v~~~~-----l~~~~~~vrC~~--C~~~f~ 36 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDK-----LPAGGRKVRCPK--CGHVFR 36 (37)
T ss_pred EECCCCCceEEcCHHH-----cccCCcEEECCC--CCcEee
Confidence 4688888877655431 113345688887 887774
No 118
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=22.08 E-value=39 Score=31.14 Aligned_cols=34 Identities=6% Similarity=-0.000 Sum_probs=27.3
Q ss_pred ccccccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 336 MRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 336 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
...|.|+.|+..|.....+. .-|.|+. ||.....
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~--Cg~~L~~ 148 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQ--CGEMLEE 148 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cCCcCCC--CCCCCee
Confidence 45799999999999887764 3599999 9987654
No 119
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=22.04 E-value=21 Score=25.11 Aligned_cols=39 Identities=10% Similarity=0.133 Sum_probs=23.0
Q ss_pred ccccccccccChhhHHHHHhhcCCCCccccCcccccccccC
Q psy16294 340 ELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYTD 380 (486)
Q Consensus 340 ~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~~ 380 (486)
.|+.||..-.-+.....+.. ..+.-|+|..+.||.+|..
T Consensus 1 ~CP~Cg~~a~ir~S~~~s~~--~~~~Y~qC~N~~Cg~tfv~ 39 (47)
T PF04606_consen 1 RCPHCGSKARIRTSRQLSPL--TRELYCQCTNPECGHTFVA 39 (47)
T ss_pred CcCCCCCeeEEEEchhhCcc--eEEEEEEECCCcCCCEEEE
Confidence 37777765544433332211 2334588887789988864
No 120
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=22.02 E-value=41 Score=30.36 Aligned_cols=34 Identities=6% Similarity=-0.023 Sum_probs=27.7
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCccccCccccccccc
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKPYVCGEENCTKRYT 379 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekpy~C~~~gCgK~F~ 379 (486)
...-|.|+.|+..|....++. .-|.|+. ||....
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~--Cg~~L~ 139 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME---------LNFTCPR--CGAMLD 139 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH---------cCCcCCC--CCCEee
Confidence 456799999999999888875 2599999 998754
No 121
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=21.77 E-value=54 Score=28.73 Aligned_cols=29 Identities=10% Similarity=-0.076 Sum_probs=18.0
Q ss_pred cccccccccccccccChhhHHHHHhhcCCCCc
Q psy16294 335 VMRRIELNIRTELTLMSLNVQFEILSRPKLKP 366 (486)
Q Consensus 335 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ekp 366 (486)
......|-+||+.|+. |++|++.|.|-.|
T Consensus 69 ~~d~i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 69 TPDYIICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp -SS-EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred ccCeeEEccCCcccch---HHHHHHHccCCCH
Confidence 3455789999999986 5999999987654
No 122
>COG0447 MenB Dihydroxynaphthoic acid synthase [Coenzyme metabolism]
Probab=20.87 E-value=40 Score=32.08 Aligned_cols=20 Identities=25% Similarity=0.605 Sum_probs=18.5
Q ss_pred hhhhhhhhcCCCcccccccCC
Q psy16294 79 ADIHTMIRFSPNSLVNSLVNS 99 (486)
Q Consensus 79 ~d~~~~ir~sp~sl~~~~~n~ 99 (486)
+|||.+||+=|.+.+ |.+||
T Consensus 106 LdlQrlIR~~PKpVi-A~V~G 125 (282)
T COG0447 106 LDLQRLIRTMPKPVI-AMVAG 125 (282)
T ss_pred hhHHHHHHhCCcceE-EEEee
Confidence 499999999999999 89887
No 123
>PF12907 zf-met2: Zinc-binding
Probab=20.51 E-value=62 Score=22.12 Aligned_cols=26 Identities=19% Similarity=0.603 Sum_probs=12.9
Q ss_pred cccCcccccccccC---hhhHHHHHHhhcCC
Q psy16294 367 YVCGEENCTKRYTD---PSSLRKHIKTVHAP 394 (486)
Q Consensus 367 y~C~~~gCgK~F~~---~s~L~~H~k~hh~~ 394 (486)
+.|.+ |-.+|.. ...|+.|...-|..
T Consensus 2 i~C~i--C~qtF~~t~~~~~L~eH~enKHpK 30 (40)
T PF12907_consen 2 IICKI--CRQTFMQTTNEPQLKEHAENKHPK 30 (40)
T ss_pred cCcHH--hhHHHHhcCCHHHHHHHHHccCCC
Confidence 34555 5544432 34466666544443
No 124
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=20.46 E-value=77 Score=19.32 Aligned_cols=19 Identities=16% Similarity=0.490 Sum_probs=13.1
Q ss_pred ccCcccccccccChhhHHHHHH
Q psy16294 368 VCGEENCTKRYTDPSSLRKHIK 389 (486)
Q Consensus 368 ~C~~~gCgK~F~~~s~L~~H~k 389 (486)
.|++ |++.+ ....+..|+.
T Consensus 3 ~CPi--C~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPV--CFREV-PENLINSHLD 21 (26)
T ss_pred cCCC--CcCcc-cHHHHHHHHH
Confidence 5777 87777 5566777764
No 125
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=20.18 E-value=27 Score=31.09 Aligned_cols=16 Identities=31% Similarity=0.607 Sum_probs=12.3
Q ss_pred CccccCcccccccccChh
Q psy16294 365 KPYVCGEENCTKRYTDPS 382 (486)
Q Consensus 365 kpy~C~~~gCgK~F~~~s 382 (486)
+.-.|.. |+++|++..
T Consensus 27 RRReC~~--C~~RFTTfE 42 (156)
T COG1327 27 RRRECLE--CGERFTTFE 42 (156)
T ss_pred hhhcccc--cccccchhh
Confidence 4468988 999998754
No 126
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=20.15 E-value=38 Score=34.09 Aligned_cols=13 Identities=23% Similarity=0.307 Sum_probs=7.7
Q ss_pred hcCCCCccCCCCC
Q psy16294 279 HTGEKPHKCTDLE 291 (486)
Q Consensus 279 Htgekpy~C~~C~ 291 (486)
..|.|-..|..|+
T Consensus 207 ~~G~RyL~CslC~ 219 (309)
T PRK03564 207 TQGLRYLHCNLCE 219 (309)
T ss_pred CCCceEEEcCCCC
Confidence 4455666666665
Done!