Query psy1773
Match_columns 359
No_of_seqs 371 out of 2402
Neff 10.6
Searched_HMMs 46136
Date Fri Aug 16 21:30:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1773.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1773hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3608|consensus 100.0 4.5E-31 9.7E-36 217.8 13.8 264 78-354 69-377 (467)
2 KOG3608|consensus 100.0 5.2E-29 1.1E-33 205.7 12.9 233 73-324 129-378 (467)
3 KOG1074|consensus 100.0 2.4E-29 5.3E-34 228.9 5.9 196 149-356 608-934 (958)
4 KOG2462|consensus 99.9 2.2E-28 4.7E-33 197.1 6.8 136 203-351 128-265 (279)
5 KOG2462|consensus 99.9 8.5E-27 1.8E-31 188.0 4.5 133 175-320 131-265 (279)
6 KOG3623|consensus 99.9 5.6E-22 1.2E-26 178.4 7.6 112 76-198 208-331 (1007)
7 KOG3623|consensus 99.8 4.4E-22 9.6E-27 179.1 5.0 80 264-351 893-972 (1007)
8 KOG1074|consensus 99.8 2E-20 4.3E-25 171.3 3.4 244 64-326 591-935 (958)
9 KOG3576|consensus 99.7 1.7E-17 3.7E-22 127.4 3.1 114 234-355 115-238 (267)
10 KOG3576|consensus 99.6 1.4E-16 2.9E-21 122.5 1.4 112 206-325 118-239 (267)
11 PHA00733 hypothetical protein 99.1 6.7E-11 1.4E-15 88.3 5.0 81 264-355 39-125 (128)
12 PLN03086 PRLI-interacting fact 99.1 7.1E-10 1.5E-14 101.7 10.3 98 176-289 455-562 (567)
13 PLN03086 PRLI-interacting fact 99.1 5.2E-10 1.1E-14 102.6 9.2 149 175-355 408-566 (567)
14 PHA00733 hypothetical protein 99.0 2.7E-10 5.9E-15 85.1 4.9 84 233-323 37-124 (128)
15 PHA02768 hypothetical protein; 98.8 2E-09 4.2E-14 65.8 2.2 44 296-346 5-48 (55)
16 PHA02768 hypothetical protein; 98.7 1E-08 2.2E-13 62.7 2.3 43 237-284 6-48 (55)
17 KOG3993|consensus 98.6 1.3E-08 2.8E-13 87.7 1.0 199 145-354 268-483 (500)
18 PF13465 zf-H2C2_2: Zinc-finge 98.6 4.2E-08 9.2E-13 51.1 2.5 26 313-341 1-26 (26)
19 KOG3993|consensus 98.4 4.3E-08 9.4E-13 84.5 -1.4 197 108-322 267-482 (500)
20 PF13465 zf-H2C2_2: Zinc-finge 98.4 2E-07 4.4E-12 48.5 1.6 26 95-127 1-26 (26)
21 PHA00616 hypothetical protein 98.3 4.1E-07 9E-12 52.8 1.7 34 296-332 1-34 (44)
22 PHA00616 hypothetical protein 98.2 5.4E-07 1.2E-11 52.3 1.6 31 327-359 1-31 (44)
23 PHA00732 hypothetical protein 98.2 1.2E-06 2.5E-11 59.2 3.0 49 265-323 1-49 (79)
24 PF05605 zf-Di19: Drought indu 98.1 4.4E-06 9.5E-11 52.4 4.5 52 296-354 2-54 (54)
25 PHA00732 hypothetical protein 98.0 3.3E-06 7.1E-11 57.1 2.5 45 236-288 1-45 (79)
26 PF05605 zf-Di19: Drought indu 98.0 6.4E-06 1.4E-10 51.6 3.0 50 266-322 3-53 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.8 2.1E-05 4.5E-10 39.7 2.0 22 328-351 1-22 (23)
28 PF00096 zf-C2H2: Zinc finger, 97.7 1.8E-05 3.9E-10 39.9 1.4 23 297-321 1-23 (23)
29 COG5189 SFP1 Putative transcri 97.7 1.8E-05 3.9E-10 66.1 1.9 75 264-353 348-422 (423)
30 PF13894 zf-C2H2_4: C2H2-type 97.7 3.1E-05 6.8E-10 39.4 2.2 24 328-353 1-24 (24)
31 PF12756 zf-C2H2_2: C2H2 type 97.6 3.4E-05 7.4E-10 55.6 2.1 73 238-321 1-73 (100)
32 PF13912 zf-C2H2_6: C2H2-type 97.5 5.8E-05 1.3E-09 39.7 1.9 26 327-354 1-26 (27)
33 PF12756 zf-C2H2_2: C2H2 type 97.5 6.8E-05 1.5E-09 54.0 2.5 74 267-353 1-74 (100)
34 PF13894 zf-C2H2_4: C2H2-type 97.5 8.5E-05 1.8E-09 37.7 1.9 24 297-322 1-24 (24)
35 PF13912 zf-C2H2_6: C2H2-type 97.3 9.5E-05 2.1E-09 38.9 1.3 26 296-323 1-26 (27)
36 COG5189 SFP1 Putative transcri 97.3 4E-05 8.6E-10 64.1 -0.4 26 203-228 347-372 (423)
37 PF09237 GAGA: GAGA factor; I 96.9 0.00074 1.6E-08 40.2 2.2 40 284-325 12-51 (54)
38 PF13909 zf-H2C2_5: C2H2-type 96.9 0.00084 1.8E-08 34.1 1.9 24 328-354 1-24 (24)
39 KOG4173|consensus 96.7 0.00055 1.2E-08 53.7 0.6 87 205-295 79-174 (253)
40 smart00355 ZnF_C2H2 zinc finge 96.7 0.0022 4.7E-08 33.0 2.8 23 297-321 1-23 (26)
41 COG5236 Uncharacterized conser 96.5 0.0018 3.8E-08 55.2 2.4 129 175-320 152-303 (493)
42 COG5236 Uncharacterized conser 96.4 0.0051 1.1E-07 52.5 4.4 132 145-292 152-306 (493)
43 PF09237 GAGA: GAGA factor; I 96.3 0.0029 6.3E-08 37.7 1.9 40 315-356 12-51 (54)
44 smart00355 ZnF_C2H2 zinc finge 96.3 0.0035 7.6E-08 32.1 2.1 24 328-353 1-24 (26)
45 KOG4173|consensus 96.3 0.0013 2.8E-08 51.7 0.5 71 242-318 87-166 (253)
46 PF12874 zf-met: Zinc-finger o 96.1 0.0027 5.9E-08 32.5 1.1 16 304-319 6-21 (25)
47 PF12874 zf-met: Zinc-finger o 96.0 0.0049 1.1E-07 31.5 1.8 23 328-352 1-23 (25)
48 PF13909 zf-H2C2_5: C2H2-type 96.0 0.0045 9.8E-08 31.3 1.5 24 297-323 1-24 (24)
49 KOG2231|consensus 95.8 0.017 3.7E-07 54.8 5.6 128 206-350 100-260 (669)
50 COG5048 FOG: Zn-finger [Genera 95.7 0.0026 5.6E-08 59.1 -0.1 61 73-140 28-88 (467)
51 KOG2231|consensus 95.6 0.03 6.4E-07 53.2 6.3 139 175-331 100-275 (669)
52 KOG1146|consensus 95.6 0.0044 9.6E-08 62.2 0.8 104 242-351 442-611 (1406)
53 KOG1146|consensus 95.2 0.011 2.3E-07 59.7 2.2 74 273-351 442-540 (1406)
54 COG5048 FOG: Zn-finger [Genera 95.2 0.0042 9E-08 57.7 -0.6 64 204-268 385-449 (467)
55 KOG2482|consensus 95.1 0.026 5.7E-07 48.3 3.9 159 189-352 128-357 (423)
56 PF12171 zf-C2H2_jaz: Zinc-fin 95.1 0.0081 1.7E-07 31.4 0.5 22 328-351 2-23 (27)
57 PRK04860 hypothetical protein; 95.1 0.012 2.6E-07 45.8 1.6 40 295-343 118-157 (160)
58 PF12171 zf-C2H2_jaz: Zinc-fin 94.8 0.009 2E-07 31.2 0.2 21 297-319 2-22 (27)
59 PRK04860 hypothetical protein; 94.2 0.027 5.8E-07 43.9 1.7 37 236-279 119-155 (160)
60 KOG2482|consensus 93.7 0.15 3.3E-06 43.8 5.3 54 173-230 143-218 (423)
61 smart00451 ZnF_U1 U1-like zinc 92.1 0.11 2.5E-06 28.8 1.8 23 327-351 3-25 (35)
62 PF13913 zf-C2HC_2: zinc-finge 91.9 0.18 3.9E-06 25.7 2.2 19 298-319 4-22 (25)
63 smart00451 ZnF_U1 U1-like zinc 91.6 0.14 3.1E-06 28.3 1.8 23 296-320 3-25 (35)
64 PF13913 zf-C2HC_2: zinc-finge 90.7 0.21 4.6E-06 25.4 1.7 21 328-351 3-23 (25)
65 KOG4377|consensus 87.5 0.41 9E-06 42.3 2.3 28 327-354 401-428 (480)
66 PF12013 DUF3505: Protein of u 86.9 1.3 2.8E-05 32.2 4.4 26 327-354 80-109 (109)
67 KOG2785|consensus 85.9 3 6.5E-05 36.9 6.6 22 116-137 4-25 (390)
68 COG4049 Uncharacterized protei 85.6 0.46 9.9E-06 29.0 1.1 28 233-260 14-41 (65)
69 COG4049 Uncharacterized protei 84.3 0.53 1.1E-05 28.7 1.0 24 294-319 15-38 (65)
70 PF12013 DUF3505: Protein of u 83.5 1.2 2.5E-05 32.5 2.7 82 235-323 10-109 (109)
71 KOG2893|consensus 82.5 0.52 1.1E-05 38.4 0.6 38 213-255 16-53 (341)
72 KOG2785|consensus 81.9 2.2 4.7E-05 37.8 4.1 22 145-168 4-25 (390)
73 KOG2893|consensus 81.6 0.46 9.9E-06 38.7 0.0 38 273-317 16-53 (341)
74 PF01352 KRAB: KRAB box; Inte 78.4 0.5 1.1E-05 27.4 -0.5 33 15-53 4-36 (41)
75 cd00350 rubredoxin_like Rubred 71.7 2 4.4E-05 23.4 0.8 9 327-337 17-25 (33)
76 PF02892 zf-BED: BED zinc fing 70.6 3 6.6E-05 24.5 1.5 25 327-353 16-44 (45)
77 TIGR00622 ssl1 transcription f 69.0 7.8 0.00017 28.1 3.5 25 295-321 80-104 (112)
78 KOG4377|consensus 68.7 3.8 8.3E-05 36.6 2.3 26 297-322 402-427 (480)
79 KOG4124|consensus 67.7 1.5 3.2E-05 38.1 -0.4 29 263-291 347-375 (442)
80 smart00614 ZnF_BED BED zinc fi 66.6 5.6 0.00012 24.1 2.1 25 328-354 19-48 (50)
81 PF09538 FYDLN_acid: Protein o 63.9 5.4 0.00012 28.8 1.9 15 235-249 25-39 (108)
82 PF12907 zf-met2: Zinc-binding 61.5 4.8 0.0001 23.1 1.0 32 237-268 2-36 (40)
83 COG1592 Rubrerythrin [Energy p 61.0 5.4 0.00012 31.3 1.6 13 260-274 144-156 (166)
84 PF09986 DUF2225: Uncharacteri 60.6 1.7 3.6E-05 36.0 -1.3 13 328-342 49-61 (214)
85 KOG2186|consensus 59.7 5.8 0.00013 33.0 1.6 46 206-257 4-49 (276)
86 cd00729 rubredoxin_SM Rubredox 58.8 4.6 0.0001 22.2 0.7 24 297-337 3-26 (34)
87 PF07800 DUF1644: Protein of u 56.9 25 0.00054 27.2 4.5 38 259-297 101-138 (162)
88 smart00734 ZnF_Rad18 Rad18-lik 54.1 14 0.00031 18.8 2.0 19 298-319 3-21 (26)
89 PF09538 FYDLN_acid: Protein o 54.1 9.3 0.0002 27.7 1.7 31 78-128 9-39 (108)
90 KOG2636|consensus 53.6 21 0.00044 32.6 4.1 25 105-136 398-423 (497)
91 PRK09678 DNA-binding transcrip 51.1 5.2 0.00011 26.5 0.1 15 327-341 27-41 (72)
92 PF09986 DUF2225: Uncharacteri 49.1 7.5 0.00016 32.2 0.7 13 297-311 49-61 (214)
93 TIGR00622 ssl1 transcription f 49.0 21 0.00046 25.9 2.9 48 298-352 57-104 (112)
94 PF04959 ARS2: Arsenite-resist 45.2 10 0.00022 31.2 0.9 23 296-320 77-99 (214)
95 PF05443 ROS_MUCR: ROS/MUCR tr 45.1 14 0.0003 27.9 1.5 22 328-354 73-94 (132)
96 PF13717 zinc_ribbon_4: zinc-r 44.9 18 0.00039 20.2 1.6 6 238-243 27-32 (36)
97 PF04959 ARS2: Arsenite-resist 44.4 18 0.00038 29.9 2.2 28 234-261 75-102 (214)
98 COG4957 Predicted transcriptio 37.9 22 0.00047 26.6 1.5 22 328-354 77-98 (148)
99 KOG0717|consensus 37.3 1.6E+02 0.0034 27.5 7.0 22 79-102 293-314 (508)
100 TIGR02300 FYDLN_acid conserved 36.6 25 0.00054 26.1 1.7 32 78-129 9-40 (129)
101 PRK00464 nrdR transcriptional 36.4 13 0.00028 28.9 0.3 13 297-311 29-41 (154)
102 TIGR00373 conserved hypothetic 36.2 19 0.00041 28.2 1.1 17 296-314 109-125 (158)
103 PF15269 zf-C2H2_7: Zinc-finge 36.1 49 0.0011 19.3 2.5 23 77-101 19-41 (54)
104 smart00531 TFIIE Transcription 34.5 26 0.00056 27.0 1.6 36 295-339 98-133 (147)
105 COG1592 Rubrerythrin [Energy p 33.9 36 0.00079 26.8 2.3 22 77-113 133-154 (166)
106 PF04606 Ogr_Delta: Ogr/Delta- 33.1 9.2 0.0002 22.8 -0.8 14 265-278 25-38 (47)
107 PRK06266 transcription initiat 32.5 20 0.00043 28.7 0.7 16 297-314 118-133 (178)
108 PF05443 ROS_MUCR: ROS/MUCR tr 32.3 33 0.00072 25.9 1.8 33 71-108 65-97 (132)
109 KOG4167|consensus 32.0 10 0.00022 36.8 -1.2 26 295-322 791-816 (907)
110 COG4957 Predicted transcriptio 30.2 63 0.0014 24.2 2.9 33 71-108 69-101 (148)
111 PF07800 DUF1644: Protein of u 29.7 68 0.0015 24.9 3.1 35 319-355 100-134 (162)
112 KOG4124|consensus 29.2 31 0.00066 30.4 1.3 52 263-317 176-231 (442)
113 COG1997 RPL43A Ribosomal prote 27.2 28 0.00062 23.8 0.6 12 296-309 53-64 (89)
114 KOG4167|consensus 27.0 22 0.00047 34.7 0.1 29 106-141 790-818 (907)
115 PF11789 zf-Nse: Zinc-finger o 27.0 27 0.00058 21.9 0.5 9 327-335 48-56 (57)
116 KOG2186|consensus 26.9 36 0.00079 28.5 1.3 50 78-139 3-52 (276)
117 PF03145 Sina: Seven in absent 26.1 73 0.0016 25.9 3.0 28 327-355 44-73 (198)
118 smart00659 RPOLCX RNA polymera 25.7 43 0.00094 19.7 1.2 27 297-339 3-29 (44)
119 KOG1842|consensus 24.9 42 0.0009 30.7 1.4 27 236-262 15-41 (505)
120 COG2879 Uncharacterized small 24.9 70 0.0015 20.4 2.0 16 341-356 24-39 (65)
121 KOG3002|consensus 24.8 40 0.00087 29.5 1.3 79 235-320 79-161 (299)
122 KOG1280|consensus 24.6 63 0.0014 28.6 2.4 32 121-153 85-116 (381)
123 PF10571 UPF0547: Uncharacteri 24.3 36 0.00079 17.4 0.6 8 330-339 17-24 (26)
124 KOG2593|consensus 23.3 65 0.0014 29.5 2.3 20 293-314 125-144 (436)
125 PF07754 DUF1610: Domain of un 22.9 42 0.00091 16.9 0.6 6 327-332 16-21 (24)
126 COG2888 Predicted Zn-ribbon RN 22.2 79 0.0017 20.0 1.8 9 327-337 50-58 (61)
127 TIGR00244 transcriptional regu 22.0 49 0.0011 25.4 1.1 11 121-131 34-44 (147)
128 TIGR00373 conserved hypothetic 22.0 1.1E+02 0.0024 23.9 3.2 15 237-251 110-124 (158)
129 KOG3408|consensus 21.9 50 0.0011 24.3 1.1 52 49-102 28-79 (129)
130 KOG2593|consensus 21.5 91 0.002 28.6 2.9 17 173-191 127-143 (436)
131 KOG2295|consensus 21.3 54 0.0012 31.0 1.4 42 313-355 493-537 (648)
132 PF04780 DUF629: Protein of un 20.1 77 0.0017 29.6 2.2 27 237-263 58-84 (466)
133 PF11567 PfUIS3: Plasmodium fa 20.1 2.3E+02 0.005 19.4 3.8 39 4-42 57-95 (101)
No 1
>KOG3608|consensus
Probab=99.97 E-value=4.5e-31 Score=217.82 Aligned_cols=264 Identities=24% Similarity=0.436 Sum_probs=226.7
Q ss_pred eeeccCCCCCCcccC-hhHHHhHHhhhC-----------------CCCccccccCC------------CCCccchhhccc
Q psy1773 78 LWSCPIVNCNFTSVN-LNTFKIHLLKHF-----------------DKRPFKCKLTT------------ANNTPCMWGFFS 127 (359)
Q Consensus 78 ~~~C~~~~C~~~~~~-~~~l~~H~~~h~-----------------~~~~~~C~~~~------------~~~~~C~~~f~~ 127 (359)
.++|.|..|++...+ ...|.+|.-.|- +..+-.|++-. +.|.+|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 589999999998877 578999976552 11111111111 124679999999
Q ss_pred HHHHHhhhhhcCC------------CCCccccCCCccchhhcCHHHHHHHHHHhCCCCccccccccchhhcCChHHHHHH
Q psy1773 128 KHKLLRHMTSHDE------------DKKCYPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTH 195 (359)
Q Consensus 128 ~~~l~~H~~~~~~------------~~~~~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H 195 (359)
...+..|+..|.. ++..+.|.|..|.+.+.+++.|+.|++.|++++...|.. |+..|.++..|..|
T Consensus 149 ~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~--Cg~~F~~~tkl~DH 226 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH--CGELFRTKTKLFDH 226 (467)
T ss_pred HHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch--HHHHhccccHHHHH
Confidence 9999999877642 225689999999999999999999999999999999999 99999999999999
Q ss_pred HHHcCC-CCCCccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcC
Q psy1773 196 LKEHSN-VSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCK 274 (359)
Q Consensus 196 ~~~h~~-~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~ 274 (359)
.+..+. ...+|.|.. |.|.|.+...|..|+..|. .-|+|+.|+.+....++|..|++..|+..+||+| +.|+
T Consensus 227 ~rRqt~l~~n~fqC~~--C~KrFaTeklL~~Hv~rHv---n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKC--d~Cd 299 (467)
T KOG3608|consen 227 LRRQTELNTNSFQCAQ--CFKRFATEKLLKSHVVRHV---NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKC--DECD 299 (467)
T ss_pred HHhhhhhcCCchHHHH--HHHHHhHHHHHHHHHHHhh---hcccccccccCCCChHHHHHHHHhhhccCCCccc--cchh
Confidence 876542 246899986 9999999999999999994 4599999999999999999999999999999999 4699
Q ss_pred ccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhc-CC-CCceeccCCCccccccchhhHHHHHhhh
Q psy1773 275 LLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHN-EN-KPKFTCPYQDCLMSYVAKSSLYAHLKHV 352 (359)
Q Consensus 275 ~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~-~~-~~~~~C~~~~C~~~f~~~~~l~~H~~~h 352 (359)
+.|.+.+.|.+|.. .|+ +..|.|..+.|..+|.+...+.+|++.++ |. .++|.|-. |++.|++..+|.+|+...
T Consensus 300 ~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~--Cdr~ft~G~~L~~HL~kk 375 (467)
T KOG3608|consen 300 TRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHC--CDRFFTSGKSLSAHLMKK 375 (467)
T ss_pred hhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeec--chhhhccchhHHHHHHHh
Confidence 99999999999998 688 67799999899999999999999998877 33 33799999 999999999999999988
Q ss_pred cC
Q psy1773 353 HS 354 (359)
Q Consensus 353 h~ 354 (359)
|+
T Consensus 376 H~ 377 (467)
T KOG3608|consen 376 HG 377 (467)
T ss_pred hc
Confidence 85
No 2
>KOG3608|consensus
Probab=99.96 E-value=5.2e-29 Score=205.67 Aligned_cols=233 Identities=24% Similarity=0.478 Sum_probs=205.1
Q ss_pred CCCCCeeeccCCCCCCcccChhHHHhHHhhhCC------------CCc-cccccCCCCCccchhhcccHHHHHhhhhhcC
Q psy1773 73 SKKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFD------------KRP-FKCKLTTANNTPCMWGFFSKHKLLRHMTSHD 139 (359)
Q Consensus 73 ~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~h~~------------~~~-~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 139 (359)
......|.|.|-.|+..|.++..|..|+..|.. ++| +.|.+ ..|-+.|.++..|++|+++|+
T Consensus 129 p~~g~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W-----~~Ct~~~~~k~~LreH~r~Hs 203 (467)
T KOG3608|consen 129 PALGQNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNW-----AMCTKHMGNKYRLREHIRTHS 203 (467)
T ss_pred ccchhhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccc-----hhhhhhhccHHHHHHHHHhcC
Confidence 334677999999999999999999999988852 222 44554 569999999999999999999
Q ss_pred CCCCccccCCCccchhhcCHHHHHHHHHHh--CCCCccccccccchhhcCChHHHHHHHHHcCCCCCCccccCCCCCccc
Q psy1773 140 EDKKCYPCDQKNCNKRFTTLSNLKMHMVRH--GKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSF 217 (359)
Q Consensus 140 ~~~~~~~c~c~~C~~~f~~~~~l~~H~~~h--~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f 217 (359)
+++ ...| +.||..|+++..|..|++.. ....+|.|.. |.+.|.+...|..|+..|. ..|.|++ |..+.
T Consensus 204 ~eK-vvAC--p~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~--C~KrFaTeklL~~Hv~rHv---n~ykCpl--Cdmtc 273 (467)
T KOG3608|consen 204 NEK-VVAC--PHCGELFRTKTKLFDHLRRQTELNTNSFQCAQ--CFKRFATEKLLKSHVVRHV---NCYKCPL--CDMTC 273 (467)
T ss_pred CCe-EEec--chHHHHhccccHHHHHHHhhhhhcCCchHHHH--HHHHHhHHHHHHHHHHHhh---hcccccc--cccCC
Confidence 994 4445 59999999999999998864 3456899999 9999999999999999996 4699997 99999
Q ss_pred cChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhccccccccCC--c
Q psy1773 218 YLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNM--R 295 (359)
Q Consensus 218 ~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~--~ 295 (359)
...++|.+|++.-|...+||+|+.|++.|.+.+.|.+|.. +|+ +..|+|..+.|.++|.+...|++|++.+|.|. .
T Consensus 274 ~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~ 351 (467)
T KOG3608|consen 274 SSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPI 351 (467)
T ss_pred CChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCC
Confidence 9999999999997788999999999999999999999988 898 77899998899999999999999999888654 4
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHhhcCC
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQHNEN 324 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~ 324 (359)
+|.|-. |++.|++..+|..|++..++-
T Consensus 352 ~Y~CH~--Cdr~ft~G~~L~~HL~kkH~f 378 (467)
T KOG3608|consen 352 LYACHC--CDRFFTSGKSLSAHLMKKHGF 378 (467)
T ss_pred ceeeec--chhhhccchhHHHHHHHhhcc
Confidence 699999 999999999999998776653
No 3
>KOG1074|consensus
Probab=99.95 E-value=2.4e-29 Score=228.94 Aligned_cols=196 Identities=22% Similarity=0.418 Sum_probs=150.6
Q ss_pred CCccchhhcCHHHHHHHHHHhCCCCccccccccchhhcCChHHHHHHHHHcCCC---CCCcccc---CCCCCccccChHH
Q psy1773 149 QKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNV---SAPYMCD---YKGCGKSFYLMAS 222 (359)
Q Consensus 149 c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~---~~~~~C~---~~~C~~~f~~~~~ 222 (359)
|-+|.++.+.++.|+.|+++|++++||+|.+ |++.|.++.+|+.|+-.|... ..++.|+ + |-+.|...-.
T Consensus 608 CiiC~rVlSC~saLqmHyrtHtGERPFkCKi--CgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~i--c~~kftn~V~ 683 (958)
T KOG1074|consen 608 CIICLRVLSCPSALQMHYRTHTGERPFKCKI--CGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFI--CQKKFTNAVT 683 (958)
T ss_pred eeeeeecccchhhhhhhhhcccCcCcccccc--ccchhccccchhhcccccccCccccccccCCchhh--hccccccccc
Confidence 8889999999999999999999999999999 999999999999999888641 2457888 7 9999999999
Q ss_pred HHHHhhhhcCCCCc------------cccCCcCccCCChHHHHHHHHHHcc----------------CCC----cccccc
Q psy1773 223 LKSHQRVHVTNPED------------LVCKGCGKQFKVPCRLREHYKAVHE----------------STK----NFKCTY 270 (359)
Q Consensus 223 l~~H~~~h~~~~~~------------~~C~~C~~~f~~~~~l~~H~~~~H~----------------~~~----~~~C~~ 270 (359)
|..|+++|.++..+ -.|..|.+.|.....+..++. .|. ++. +..+-
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~s-e~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~- 761 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQIS-EQPSPESEPDEQMDERTETEELDVTPPPPEN- 761 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhh-ccCCcccCCcccccccccccccccCCCcccc-
Confidence 99999998533222 468899999988888877765 341 111 22232
Q ss_pred CCcCccccChHhhhcccccc----------------------ccC-----------------------------------
Q psy1773 271 DGCKLLFSTKSALKRHNKSK----------------------HLN----------------------------------- 293 (359)
Q Consensus 271 ~~C~~~f~~~~~l~~H~~~~----------------------H~~----------------------------------- 293 (359)
.|+..+.....+..+-... +++
T Consensus 762 -~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg 840 (958)
T KOG1074|consen 762 -SCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG 840 (958)
T ss_pred -ccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence 2555544433333222110 000
Q ss_pred ------------------------------------CcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccc
Q psy1773 294 ------------------------------------MRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLM 337 (359)
Q Consensus 294 ------------------------------------~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~ 337 (359)
.....|.+ |++.|...++|..|+|+|+++| ||.|.+ |++
T Consensus 841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~v--Cgk~FsSSsALqiH~rTHtg~K-PF~C~f--C~~ 915 (958)
T KOG1074|consen 841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNV--CGKQFSSSAALEIHMRTHTGPK-PFFCHF--CEE 915 (958)
T ss_pred cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhcc--chhcccchHHHHHhhhcCCCCC-Cccchh--hhh
Confidence 01166998 9999999999999999999999 699999 999
Q ss_pred cccchhhHHHHHhhhcCCC
Q psy1773 338 SYVAKSSLYAHLKHVHSTN 356 (359)
Q Consensus 338 ~f~~~~~l~~H~~~hh~~~ 356 (359)
.|..+.+|+.||.+|+..+
T Consensus 916 aFttrgnLKvHMgtH~w~q 934 (958)
T KOG1074|consen 916 AFTTRGNLKVHMGTHMWVQ 934 (958)
T ss_pred hhhhhhhhhhhhccccccC
Confidence 9999999999999998654
No 4
>KOG2462|consensus
Probab=99.95 E-value=2.2e-28 Score=197.14 Aligned_cols=136 Identities=33% Similarity=0.640 Sum_probs=123.1
Q ss_pred CCCccccCCCCCccccChHHHHHHhhhhcC--CCCccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccCh
Q psy1773 203 SAPYMCDYKGCGKSFYLMASLKSHQRVHVT--NPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTK 280 (359)
Q Consensus 203 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~--~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~ 280 (359)
...|.|+ +||+.+.+.++|.+|..+|-. ..+.+.|+.|+|+|.+...|..|++ +|+ -+++|.+ ||+.|..+
T Consensus 128 ~~r~~c~--eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHir-TH~--l~c~C~i--CGKaFSRP 200 (279)
T KOG2462|consen 128 HPRYKCP--ECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIR-THT--LPCECGI--CGKAFSRP 200 (279)
T ss_pred CCceecc--ccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhh-ccC--CCccccc--ccccccch
Confidence 4558887 599999999999999999842 3467999999999999999999988 786 6789987 99999999
Q ss_pred HhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchhhHHHHHhh
Q psy1773 281 SALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 281 ~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
+-|+.|+| .|+|++||.|+. |++.|.++++|+.||++|.+.| +|.|.. |+++|..++.|.+|...
T Consensus 201 WLLQGHiR-THTGEKPF~C~h--C~kAFADRSNLRAHmQTHS~~K-~~qC~~--C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 201 WLLQGHIR-THTGEKPFSCPH--CGKAFADRSNLRAHMQTHSDVK-KHQCPR--CGKSFALKSYLNKHSES 265 (279)
T ss_pred HHhhcccc-cccCCCCccCCc--ccchhcchHHHHHHHHhhcCCc-cccCcc--hhhHHHHHHHHHHhhhh
Confidence 99999999 699999999998 9999999999999999999999 799999 99999999999999753
No 5
>KOG2462|consensus
Probab=99.93 E-value=8.5e-27 Score=187.98 Aligned_cols=133 Identities=35% Similarity=0.707 Sum_probs=106.9
Q ss_pred cccccccchhhcCChHHHHHHHHHcCC--CCCCccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHH
Q psy1773 175 LCCEELGCGRKFQTMKQYSTHLKEHSN--VSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRL 252 (359)
Q Consensus 175 ~~C~~~~C~~~f~~~~~l~~H~~~h~~--~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l 252 (359)
|.|.. |++.+.+..+|.+|..+|.. ..+.+.|.+ |++.+.+...|..|+++|. -+++|.+|||.|.....|
T Consensus 131 ~~c~e--Cgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~--C~K~YvSmpALkMHirTH~---l~c~C~iCGKaFSRPWLL 203 (279)
T KOG2462|consen 131 YKCPE--CGKSYSTSSNLSRHKQTHRSLDSKKAFSCKY--CGKVYVSMPALKMHIRTHT---LPCECGICGKAFSRPWLL 203 (279)
T ss_pred eeccc--cccccccccccchhhcccccccccccccCCC--CCceeeehHHHhhHhhccC---CCcccccccccccchHHh
Confidence 55555 55555555555555555532 145688886 9999999999999999992 578899999999999999
Q ss_pred HHHHHHHccCCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHh
Q psy1773 253 REHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQ 320 (359)
Q Consensus 253 ~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 320 (359)
+-|++ .|+|||||.|+. |+++|..+++|+.||+ .|.+.++|+|.. |+|+|...+-|.+|...
T Consensus 204 QGHiR-THTGEKPF~C~h--C~kAFADRSNLRAHmQ-THS~~K~~qC~~--C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 204 QGHIR-THTGEKPFSCPH--CGKAFADRSNLRAHMQ-THSDVKKHQCPR--CGKSFALKSYLNKHSES 265 (279)
T ss_pred hcccc-cccCCCCccCCc--ccchhcchHHHHHHHH-hhcCCccccCcc--hhhHHHHHHHHHHhhhh
Confidence 99988 899999999976 9999999999999998 599999999988 99999999999998654
No 6
>KOG3623|consensus
Probab=99.86 E-value=5.6e-22 Score=178.39 Aligned_cols=112 Identities=29% Similarity=0.556 Sum_probs=95.0
Q ss_pred CCeeeccCCCCCCcccChhHHHhHHhh-h-CCCCccccccCCCCCccchhhcccHHHHHhhhhhcCCCCC----------
Q psy1773 76 QQLWSCPIVNCNFTSVNLNTFKIHLLK-H-FDKRPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDKK---------- 143 (359)
Q Consensus 76 ~~~~~C~~~~C~~~~~~~~~l~~H~~~-h-~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~~---------- 143 (359)
.....|+ .|...++....|+.|++- | ..+.-|.|.+ |.++|..+..|.+||.+|.....
T Consensus 208 sqlltcp--ycdrgykrltslkeHikyrhekne~nfsC~l-------CsytFAyRtQLErhm~~hkpg~dqa~sltqsa~ 278 (1007)
T KOG3623|consen 208 SQLLTCP--YCDRGYKRLTSLKEHIKYRHEKNEPNFSCML-------CSYTFAYRTQLERHMQLHKPGGDQAISLTQSAL 278 (1007)
T ss_pred hhhhcch--hHHHHHHHHHHHHHHHHHHHhhCCCCCcchh-------hhhhhhhHHHHHHHHHhhcCCCcccccccchhh
Confidence 4567899 799999999999999874 3 3466699999 99999999999999998864321
Q ss_pred ccccCCCccchhhcCHHHHHHHHHHhCCCCccccccccchhhcCChHHHHHHHHH
Q psy1773 144 CYPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKE 198 (359)
Q Consensus 144 ~~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 198 (359)
..++.|.+|||.|..+..|+.|+++|.|++||.|.. |.+.|.....+-.|+..
T Consensus 279 lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpn--CkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 279 LRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPN--CKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hccccccccchhhhhHHHHHhhheeecCCCCcCCcc--cccccccCCcccccccc
Confidence 123334499999999999999999999999999999 99999999999999854
No 7
>KOG3623|consensus
Probab=99.85 E-value=4.4e-22 Score=179.05 Aligned_cols=80 Identities=33% Similarity=0.679 Sum_probs=75.2
Q ss_pred CccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchh
Q psy1773 264 KNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKS 343 (359)
Q Consensus 264 ~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~ 343 (359)
.+|.|. .|+++|...++|.+|.- .|+|.+||+|.+ |.|.|+.+-.|..|+|.|.|+| ||+|.. |+++|....
T Consensus 893 gmyaCD--qCDK~FqKqSSLaRHKY-EHsGqRPyqC~i--CkKAFKHKHHLtEHkRLHSGEK-PfQCdK--ClKRFSHSG 964 (1007)
T KOG3623|consen 893 GMYACD--QCDKAFQKQSSLARHKY-EHSGQRPYQCII--CKKAFKHKHHLTEHKRLHSGEK-PFQCDK--CLKRFSHSG 964 (1007)
T ss_pred ccchHH--HHHHHHHhhHHHHHhhh-hhcCCCCcccch--hhHhhhhhhhhhhhhhhccCCC-cchhhh--hhhhccccc
Confidence 468995 59999999999999988 799999999999 9999999999999999999999 799999 999999999
Q ss_pred hHHHHHhh
Q psy1773 344 SLYAHLKH 351 (359)
Q Consensus 344 ~l~~H~~~ 351 (359)
++.+||..
T Consensus 965 SYSQHMNH 972 (1007)
T KOG3623|consen 965 SYSQHMNH 972 (1007)
T ss_pred chHhhhcc
Confidence 99999863
No 8
>KOG1074|consensus
Probab=99.80 E-value=2e-20 Score=171.33 Aligned_cols=244 Identities=23% Similarity=0.374 Sum_probs=175.4
Q ss_pred eeeCCCCCCCCCCCeeeccCCCCCCcccChhHHHhHHhhhCCCCccccccCCCCCccchhhcccHHHHHhhhhhcCCCCC
Q psy1773 64 FISSTGGKNSKKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDKK 143 (359)
Q Consensus 64 ~~~~~~~~~~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~~ 143 (359)
..+.++........+-+|. .|.++.+-.+.|+.|+++|+|++||+|++ |++.|.++.+|+.||..|... .
T Consensus 591 l~slv~~~~~~~TdPNqCi--iC~rVlSC~saLqmHyrtHtGERPFkCKi-------CgRAFtTkGNLkaH~~vHka~-p 660 (958)
T KOG1074|consen 591 LTSLVENSENKRTDPNQCI--ICLRVLSCPSALQMHYRTHTGERPFKCKI-------CGRAFTTKGNLKAHMSVHKAK-P 660 (958)
T ss_pred ccccccccccccCCcccee--eeeecccchhhhhhhhhcccCcCcccccc-------ccchhccccchhhcccccccC-c
Confidence 3344444455667788999 59999999999999999999999999999 999999999999999998765 2
Q ss_pred c--cccCCC---ccchhhcCHHHHHHHHHHhCCCC-------------ccccccccchhhcCChHHHHHHHHHcCCC---
Q psy1773 144 C--YPCDQK---NCNKRFTTLSNLKMHMVRHGKPL-------------TLCCEELGCGRKFQTMKQYSTHLKEHSNV--- 202 (359)
Q Consensus 144 ~--~~c~c~---~C~~~f~~~~~l~~H~~~h~~~~-------------~~~C~~~~C~~~f~~~~~l~~H~~~h~~~--- 202 (359)
+ ....|+ +|.+.|...-.|..|+++|.+.. .-+|.. |.+.|.....+..++..+.+.
T Consensus 661 ~~R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~--~qk~~~~a~~f~~~~se~~~~~s~ 738 (958)
T KOG1074|consen 661 PARVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSS--CQKTFSDARSFSQQISEQPSPESE 738 (958)
T ss_pred cccccccCCchhhhcccccccccccceEEeecCCCCCCCcccccccchhcccch--hhhcccccccchhhhhccCCcccC
Confidence 2 244577 89999999999999999987321 246777 999998888888888776320
Q ss_pred ------------CC----CccccCCCCCccccChHHHHHHhhhh----------------------cCCCCcc-ccCCcC
Q psy1773 203 ------------SA----PYMCDYKGCGKSFYLMASLKSHQRVH----------------------VTNPEDL-VCKGCG 243 (359)
Q Consensus 203 ------------~~----~~~C~~~~C~~~f~~~~~l~~H~~~h----------------------~~~~~~~-~C~~C~ 243 (359)
+. +..+.. |+..+.....+..+-..+ ...+++. .+.+++
T Consensus 739 ~~~~~~~~t~t~~~~~tp~~~e~~--~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~ 816 (958)
T KOG1074|consen 739 PDEQMDERTETEELDVTPPPPENS--CGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPG 816 (958)
T ss_pred CcccccccccccccccCCCccccc--cccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCC
Confidence 11 344443 776666554443332111 0223444 566655
Q ss_pred ccCCChHH----HHHHHH-------------HHccC------------------------CCccccccCCcCccccChHh
Q psy1773 244 KQFKVPCR----LREHYK-------------AVHES------------------------TKNFKCTYDGCKLLFSTKSA 282 (359)
Q Consensus 244 ~~f~~~~~----l~~H~~-------------~~H~~------------------------~~~~~C~~~~C~~~f~~~~~ 282 (359)
......-. |..-.. ..+.+ .....|.+ |++.|.+.++
T Consensus 817 ~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~v--Cgk~FsSSsA 894 (958)
T KOG1074|consen 817 EILAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNV--CGKQFSSSAA 894 (958)
T ss_pred cCCccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhcc--chhcccchHH
Confidence 43322110 000000 00000 01267987 9999999999
Q ss_pred hhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCC
Q psy1773 283 LKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKP 326 (359)
Q Consensus 283 l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~ 326 (359)
|..|++ .|++++||.|.+ |++.|..+..|+.||.+|....+
T Consensus 895 LqiH~r-THtg~KPF~C~f--C~~aFttrgnLKvHMgtH~w~q~ 935 (958)
T KOG1074|consen 895 LEIHMR-THTGPKPFFCHF--CEEAFTTRGNLKVHMGTHMWVQP 935 (958)
T ss_pred HHHhhh-cCCCCCCccchh--hhhhhhhhhhhhhhhccccccCC
Confidence 999999 699999999999 99999999999999999987763
No 9
>KOG3576|consensus
Probab=99.68 E-value=1.7e-17 Score=127.41 Aligned_cols=114 Identities=30% Similarity=0.604 Sum_probs=90.1
Q ss_pred CCccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHH
Q psy1773 234 PEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEH 313 (359)
Q Consensus 234 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~ 313 (359)
...|.|.+|+|.|.....|.+|++ -|...+.|-|.. ||+.|...-.|++|++ .|+|.+||+|.. |+++|..+-+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~k-ch~~vkr~lct~--cgkgfndtfdlkrh~r-thtgvrpykc~~--c~kaftqrcs 188 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLK-CHSDVKRHLCTF--CGKGFNDTFDLKRHTR-THTGVRPYKCSL--CEKAFTQRCS 188 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhh-hccHHHHHHHhh--ccCcccchhhhhhhhc-cccCccccchhh--hhHHHHhhcc
Confidence 355788888888888888888877 788888888876 8888888888888888 588888888887 8888888888
Q ss_pred HHHHHHhhcCCC----------CceeccCCCccccccchhhHHHHHhhhcCC
Q psy1773 314 LQEHMLQHNENK----------PKFTCPYQDCLMSYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 314 l~~H~~~h~~~~----------~~~~C~~~~C~~~f~~~~~l~~H~~~hh~~ 355 (359)
|..|++..+|.. +.|.|+. ||..-.....+..|++.||+.
T Consensus 189 leshl~kvhgv~~~yaykerr~kl~vced--cg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 189 LESHLKKVHGVQHQYAYKERRAKLYVCED--CGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHHHHHHcCchHHHHHHHhhhheeeecc--cCCCCCChhHHHHHHHhcCCC
Confidence 888876655421 1588887 888888888888888888865
No 10
>KOG3576|consensus
Probab=99.61 E-value=1.4e-16 Score=122.50 Aligned_cols=112 Identities=29% Similarity=0.612 Sum_probs=56.7
Q ss_pred ccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhc
Q psy1773 206 YMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKR 285 (359)
Q Consensus 206 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~ 285 (359)
|.|.+ |++.|.....|.+|++.| ...+.+.|..||+.|.....|++|++ +|+|.+||+|.. |+++|+.+-+|..
T Consensus 118 ftCrv--CgK~F~lQRmlnrh~kch-~~vkr~lct~cgkgfndtfdlkrh~r-thtgvrpykc~~--c~kaftqrcsles 191 (267)
T KOG3576|consen 118 FTCRV--CGKKFGLQRMLNRHLKCH-SDVKRHLCTFCGKGFNDTFDLKRHTR-THTGVRPYKCSL--CEKAFTQRCSLES 191 (267)
T ss_pred eeeeh--hhhhhhHHHHHHHHhhhc-cHHHHHHHhhccCcccchhhhhhhhc-cccCccccchhh--hhHHHHhhccHHH
Confidence 44443 444444444444444444 23344444444444444444444444 444444444433 4444444444444
Q ss_pred ccccccc----------CCcccCCCCCCcCcccCChHHHHHHHHhhcCCC
Q psy1773 286 HNKSKHL----------NMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENK 325 (359)
Q Consensus 286 H~~~~H~----------~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~ 325 (359)
|++.+|. ..+.|.|.. ||.+-.....+..|+..|+..-
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vced--cg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCED--CGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecc--cCCCCCChhHHHHHHHhcCCCC
Confidence 4444443 234566766 7777777777777777666544
No 11
>PHA00733 hypothetical protein
Probab=99.13 E-value=6.7e-11 Score=88.34 Aligned_cols=81 Identities=22% Similarity=0.363 Sum_probs=55.3
Q ss_pred CccccccCCcCccccChHhhhcc------ccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccc
Q psy1773 264 KNFKCTYDGCKLLFSTKSALKRH------NKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLM 337 (359)
Q Consensus 264 ~~~~C~~~~C~~~f~~~~~l~~H------~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~ 337 (359)
+++.|.+ |+..|..+..|..+ +. +.+.+||.|+. |++.|.+...|..|++.| +. +|.|+. |++
T Consensus 39 ~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~--~~~~kPy~C~~--Cgk~Fss~s~L~~H~r~h--~~-~~~C~~--CgK 107 (128)
T PHA00733 39 KRLIRAV--VKTLIYNPQLLDESSYLYKLLT--SKAVSPYVCPL--CLMPFSSSVSLKQHIRYT--EH-SKVCPV--CGK 107 (128)
T ss_pred hhHHHHH--HhhhccChhhhcchHHHHhhcc--cCCCCCccCCC--CCCcCCCHHHHHHHHhcC--Cc-CccCCC--CCC
Confidence 4455554 55555554444333 22 33467788887 888888888888888776 23 588887 888
Q ss_pred cccchhhHHHHHhhhcCC
Q psy1773 338 SYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 338 ~f~~~~~l~~H~~~hh~~ 355 (359)
+|.....|.+|+...|+.
T Consensus 108 ~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 108 EFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred ccCCHHHHHHHHHHhcCc
Confidence 888888888888887764
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.08 E-value=7.1e-10 Score=101.71 Aligned_cols=98 Identities=23% Similarity=0.494 Sum_probs=44.3
Q ss_pred ccccccchhhcCChHHHHHHHHHcCCCCCCccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCC--------
Q psy1773 176 CCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFK-------- 247 (359)
Q Consensus 176 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~-------- 247 (359)
.|+. |++.|. ...|..|+..++ .++.|+ ||+.+ .+..|..|+.+| ...+++.|..|++.|.
T Consensus 455 ~C~~--Cgk~f~-~s~LekH~~~~H---kpv~Cp---Cg~~~-~R~~L~~H~~th-Cp~Kpi~C~fC~~~v~~g~~~~d~ 523 (567)
T PLN03086 455 HCEK--CGQAFQ-QGEMEKHMKVFH---EPLQCP---CGVVL-EKEQMVQHQAST-CPLRLITCRFCGDMVQAGGSAMDV 523 (567)
T ss_pred cCCC--CCCccc-hHHHHHHHHhcC---CCccCC---CCCCc-chhHHHhhhhcc-CCCCceeCCCCCCccccCccccch
Confidence 4444 555443 344455555443 345553 55433 334555555544 2234555555555542
Q ss_pred --ChHHHHHHHHHHccCCCccccccCCcCccccChHhhhccccc
Q psy1773 248 --VPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKS 289 (359)
Q Consensus 248 --~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 289 (359)
..+.|..|.. .+ |.+++.|.. ||+.+..+ .|..|+..
T Consensus 524 ~d~~s~Lt~HE~-~C-G~rt~~C~~--Cgk~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 524 RDRLRGMSEHES-IC-GSRTAPCDS--CGRSVMLK-EMDIHQIA 562 (567)
T ss_pred hhhhhhHHHHHH-hc-CCcceEccc--cCCeeeeh-hHHHHHHH
Confidence 1234555543 22 455555533 55554433 34444433
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.07 E-value=5.2e-10 Score=102.60 Aligned_cols=149 Identities=23% Similarity=0.515 Sum_probs=105.8
Q ss_pred cccccccchhhcCChHHHHHHHHHcCCCCCCccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHH
Q psy1773 175 LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLRE 254 (359)
Q Consensus 175 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~ 254 (359)
-.|.. |..... ...|..|...-. .....|+...|+..|.. ..+. ..+.|+.|++.|. ...|..
T Consensus 408 V~C~N--C~~~i~-l~~l~lHe~~C~--r~~V~Cp~~~Cg~v~~r-~el~----------~H~~C~~Cgk~f~-~s~Lek 470 (567)
T PLN03086 408 VECRN--CKHYIP-SRSIALHEAYCS--RHNVVCPHDGCGIVLRV-EEAK----------NHVHCEKCGQAFQ-QGEMEK 470 (567)
T ss_pred EECCC--CCCccc-hhHHHHHHhhCC--CcceeCCcccccceeec-cccc----------cCccCCCCCCccc-hHHHHH
Confidence 35666 666544 344456664432 23455753348888733 2223 3357999999986 677899
Q ss_pred HHHHHccCCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccC----------ChHHHHHHHHhhcCC
Q psy1773 255 HYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFL----------RSEHLQEHMLQHNEN 324 (359)
Q Consensus 255 H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~----------~~~~l~~H~~~h~~~ 324 (359)
|++ +|. +++.|+ |+..+ .+..|..|+. .|.+.+++.|+. |++.|. ..+.|..|..+. |.
T Consensus 471 H~~-~~H--kpv~Cp---Cg~~~-~R~~L~~H~~-thCp~Kpi~C~f--C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~ 539 (567)
T PLN03086 471 HMK-VFH--EPLQCP---CGVVL-EKEQMVQHQA-STCPLRLITCRF--CGDMVQAGGSAMDVRDRLRGMSEHESIC-GS 539 (567)
T ss_pred HHH-hcC--CCccCC---CCCCc-chhHHHhhhh-ccCCCCceeCCC--CCCccccCccccchhhhhhhHHHHHHhc-CC
Confidence 987 553 789995 98655 6688999987 688999999998 999885 245789998885 77
Q ss_pred CCceeccCCCccccccchhhHHHHHhhhcCC
Q psy1773 325 KPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 325 ~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~~ 355 (359)
+ ++.|.. ||+.+..+ .|..|+...|..
T Consensus 540 r-t~~C~~--Cgk~Vrlr-dm~~H~~~~h~~ 566 (567)
T PLN03086 540 R-TAPCDS--CGRSVMLK-EMDIHQIAVHQK 566 (567)
T ss_pred c-ceEccc--cCCeeeeh-hHHHHHHHhhcC
Confidence 7 699988 99988765 478888887764
No 14
>PHA00733 hypothetical protein
Probab=99.04 E-value=2.7e-10 Score=85.09 Aligned_cols=84 Identities=21% Similarity=0.424 Sum_probs=64.2
Q ss_pred CCCccccCCcCccCCChHHHHHH--HH--HHccCCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCccc
Q psy1773 233 NPEDLVCKGCGKQFKVPCRLREH--YK--AVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSF 308 (359)
Q Consensus 233 ~~~~~~C~~C~~~f~~~~~l~~H--~~--~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f 308 (359)
..+++.|.+|...|.....|..+ +. ..+.+.+||.|+. |++.|.+...|..|++ .| ..+|.|+. |++.|
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~--Cgk~Fss~s~L~~H~r-~h--~~~~~C~~--CgK~F 109 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPL--CLMPFSSSVSLKQHIR-YT--EHSKVCPV--CGKEF 109 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCC--CCCcCCCHHHHHHHHh-cC--CcCccCCC--CCCcc
Confidence 35678888888888777666655 11 1234578899965 9999999999999988 35 35689988 99999
Q ss_pred CChHHHHHHHHhhcC
Q psy1773 309 LRSEHLQEHMLQHNE 323 (359)
Q Consensus 309 ~~~~~l~~H~~~h~~ 323 (359)
.....|..|+...++
T Consensus 110 ~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 110 RNTDSTLDHVCKKHN 124 (128)
T ss_pred CCHHHHHHHHHHhcC
Confidence 999999999887765
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.83 E-value=2e-09 Score=65.78 Aligned_cols=44 Identities=18% Similarity=0.579 Sum_probs=37.8
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchhhHH
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLY 346 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~ 346 (359)
.|.|+. ||+.|+..++|..|+++|+ + +|.|.. |++.|...+.|.
T Consensus 5 ~y~C~~--CGK~Fs~~~~L~~H~r~H~--k-~~kc~~--C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPI--CGEIYIKRKSMITHLRKHN--T-NLKLSN--CKRISLRTGEYI 48 (55)
T ss_pred ccCcch--hCCeeccHHHHHHHHHhcC--C-cccCCc--ccceecccceeE
Confidence 488988 9999999999999999998 4 689988 999998877664
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.69 E-value=1e-08 Score=62.70 Aligned_cols=43 Identities=21% Similarity=0.387 Sum_probs=29.5
Q ss_pred cccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhh
Q psy1773 237 LVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALK 284 (359)
Q Consensus 237 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~ 284 (359)
|.|+.||+.|...+.|..|++ +|+ ++|+|.. |++.|...+.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r-~H~--k~~kc~~--C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLR-KHN--TNLKLSN--CKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHH-hcC--CcccCCc--ccceecccceeE
Confidence 667777777777777777766 566 5677754 777777666554
No 17
>KOG3993|consensus
Probab=98.59 E-value=1.3e-08 Score=87.68 Aligned_cols=199 Identities=17% Similarity=0.235 Sum_probs=114.4
Q ss_pred cccCCCccchhhcCHHHHHHHHHHhCCCCccccccccchhhcCChHHHHHHHHHcCCCCCCccccCCCCCccccChHHHH
Q psy1773 145 YPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLK 224 (359)
Q Consensus 145 ~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~ 224 (359)
|.|. .|...|.+.-.|.+|.-.-.-...|+|.+ |++.|.-..+|..|.+.|......-.-..+.=.+...+....+
T Consensus 268 yiCq--LCK~kYeD~F~LAQHrC~RIV~vEYrCPE--C~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ 343 (500)
T KOG3993|consen 268 YICQ--LCKEKYEDAFALAQHRCPRIVHVEYRCPE--CDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQ 343 (500)
T ss_pred HHHH--HHHHhhhhHHHHhhccCCeeEEeeecCCc--ccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence 4555 78888888888888854333334588888 8888888888888888875311100000000000000000111
Q ss_pred HHhhh-hcCCCCccccCCcCccCCChHHHHHHHHHHccCCCc----------------cccccCCcCccccChHhhhccc
Q psy1773 225 SHQRV-HVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKN----------------FKCTYDGCKLLFSTKSALKRHN 287 (359)
Q Consensus 225 ~H~~~-h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~----------------~~C~~~~C~~~f~~~~~l~~H~ 287 (359)
.-.+. -......|.|..|++.|.....|+.|..+.|..... +.|. .|+-.+.....--.+.
T Consensus 344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~--~~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQ--AVATHSSASDSHGDEV 421 (500)
T ss_pred hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccc--ccccccccccccccce
Confidence 11100 001224689999999999999999997633332211 2232 2443333333222222
Q ss_pred cccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchhhHHHHHhhhcC
Q psy1773 288 KSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 288 ~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
..+-.....-.|+. |+..+.++..--.+.+.-..+. .|.|.+ |...|.+...|.+|+..-|.
T Consensus 422 l~~a~sael~~pp~--~~~ppsss~~sgg~~rlg~~~q-~f~~ky--~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 422 LYVAGSAELELPPY--DGSPPSSSGSSGGYGRLGIAEQ-GFTCKY--CPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred eeeeccccccCCCC--CCCCcccCCCCCccccccchhh-cccccc--chHhhhcCcchHhHhhhcCh
Confidence 21111122345777 8877777766666666555666 699999 99999999999999887764
No 18
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.59 E-value=4.2e-08 Score=51.11 Aligned_cols=26 Identities=38% Similarity=0.874 Sum_probs=22.1
Q ss_pred HHHHHHHhhcCCCCceeccCCCccccccc
Q psy1773 313 HLQEHMLQHNENKPKFTCPYQDCLMSYVA 341 (359)
Q Consensus 313 ~l~~H~~~h~~~~~~~~C~~~~C~~~f~~ 341 (359)
+|.+|+++|++++ ||.|+. |+++|.+
T Consensus 1 ~l~~H~~~H~~~k-~~~C~~--C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEK-PYKCPY--CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSS-SEEESS--SSEEESS
T ss_pred CHHHHhhhcCCCC-CCCCCC--CcCeeCc
Confidence 5788999999998 699988 9998863
No 19
>KOG3993|consensus
Probab=98.37 E-value=4.3e-08 Score=84.53 Aligned_cols=197 Identities=19% Similarity=0.246 Sum_probs=111.9
Q ss_pred ccccccCCCCCccchhhcccHHHHHhhhhhcCCCCCccccCCCccchhhcCHHHHHHHHHHhCCCCccc-cccccchhhc
Q psy1773 108 PFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDKKCYPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLC-CEELGCGRKF 186 (359)
Q Consensus 108 ~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~~~~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~-C~~~~C~~~f 186 (359)
.|.|.+ |...|.+...|.+|.-.---. .-|.|. +|+|+|+...+|..|.+-|.....-- -...-=.+..
T Consensus 267 dyiCqL-------CK~kYeD~F~LAQHrC~RIV~-vEYrCP--EC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~ 336 (500)
T KOG3993|consen 267 DYICQL-------CKEKYEDAFALAQHRCPRIVH-VEYRCP--ECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAV 336 (500)
T ss_pred HHHHHH-------HHHhhhhHHHHhhccCCeeEE-eeecCC--cccccccCchhhhhhhcccCCchhhhhcCCCChhhhh
Confidence 388998 999999999999986332211 225665 99999999999999988875432110 0000000000
Q ss_pred CChHHHHHHHHH-cCCCCCCccccCCCCCccccChHHHHHHhhhhcCCCC----------------ccccCCcCccCCCh
Q psy1773 187 QTMKQYSTHLKE-HSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPE----------------DLVCKGCGKQFKVP 249 (359)
Q Consensus 187 ~~~~~l~~H~~~-h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~----------------~~~C~~C~~~f~~~ 249 (359)
.+....+.-.+. ....+.-|.|.+ |++.|.....|+.|+.+|+.... -+-|..|.-.+...
T Consensus 337 ~~rae~~ea~rsg~dss~gi~~C~~--C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~ 414 (500)
T KOG3993|consen 337 ETRAEVQEAERSGDDSSSGIFSCHT--CGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSAS 414 (500)
T ss_pred hhhhhhhhccccCCcccCceeecHH--hhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccc
Confidence 011000000000 001134688987 99999999999999888743211 13355555444432
Q ss_pred HHHHHHHHHHcc-CCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhc
Q psy1773 250 CRLREHYKAVHE-STKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHN 322 (359)
Q Consensus 250 ~~l~~H~~~~H~-~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~ 322 (359)
..--.+.. .+. ...-..|++ |+..+.++..--.+.+. -..+..|.|.+ |.-.|.+...|.+|+..-|
T Consensus 415 ~~~g~~vl-~~a~sael~~pp~--~~~ppsss~~sgg~~rl-g~~~q~f~~ky--~~atfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 415 DSHGDEVL-YVAGSAELELPPY--DGSPPSSSGSSGGYGRL-GIAEQGFTCKY--CPATFYSSPGLTRHINKCH 482 (500)
T ss_pred ccccccee-eeeccccccCCCC--CCCCcccCCCCCccccc-cchhhcccccc--chHhhhcCcchHhHhhhcC
Confidence 21111111 011 111234655 77777776665555542 33445689998 9999999999999886644
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.36 E-value=2e-07 Score=48.54 Aligned_cols=26 Identities=27% Similarity=0.694 Sum_probs=23.1
Q ss_pred HHHhHHhhhCCCCccccccCCCCCccchhhccc
Q psy1773 95 TFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFS 127 (359)
Q Consensus 95 ~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~ 127 (359)
+|..|+++|++++||.|++ |++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~-------C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY-------CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS-------SSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCC-------CcCeeCc
Confidence 4889999999999999998 9998863
No 21
>PHA00616 hypothetical protein
Probab=98.27 E-value=4.1e-07 Score=52.81 Aligned_cols=34 Identities=32% Similarity=0.711 Sum_probs=25.9
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccC
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPY 332 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~ 332 (359)
||+|+. ||+.|...+.|..|++.||+++ ++.|+.
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~-~~~~~~ 34 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQN-KLTLEY 34 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCC-ccceeE
Confidence 577877 8888888888888888888877 577654
No 22
>PHA00616 hypothetical protein
Probab=98.23 E-value=5.4e-07 Score=52.34 Aligned_cols=31 Identities=32% Similarity=0.535 Sum_probs=28.6
Q ss_pred ceeccCCCccccccchhhHHHHHhhhcCCCCCC
Q psy1773 327 KFTCPYQDCLMSYVAKSSLYAHLKHVHSTNKIT 359 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~~~~l~~H~~~hh~~~~~~ 359 (359)
||+|+. ||+.|..++.|.+|++.||+++..+
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~ 31 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLT 31 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCccc
Confidence 599999 9999999999999999999998753
No 23
>PHA00732 hypothetical protein
Probab=98.20 E-value=1.2e-06 Score=59.22 Aligned_cols=49 Identities=35% Similarity=0.732 Sum_probs=31.7
Q ss_pred ccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcC
Q psy1773 265 NFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNE 323 (359)
Q Consensus 265 ~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~ 323 (359)
||.|.. |++.|.+...|..|++..|. ++.|+. |++.|. .|..|.+++.+
T Consensus 1 py~C~~--Cgk~F~s~s~Lk~H~r~~H~---~~~C~~--CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPI--CGFTTVTLFALKQHARRNHT---LTKCPV--CNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCC--CCCccCCHHHHHHHhhcccC---CCccCC--CCCEeC---ChhhhhcccCC
Confidence 466754 77777777777777763344 256776 777776 46677765544
No 24
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.14 E-value=4.4e-06 Score=52.36 Aligned_cols=52 Identities=31% Similarity=0.632 Sum_probs=38.4
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHhh-cCCCCceeccCCCccccccchhhHHHHHhhhcC
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQH-NENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~h-~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
.|.|++ |++ .-+...|..|.... ..+...+.|++ |...+. .+|.+|+..+|+
T Consensus 2 ~f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhcC
Confidence 488888 998 45567888887654 44433688988 988655 488889888875
No 25
>PHA00732 hypothetical protein
Probab=98.03 E-value=3.3e-06 Score=57.05 Aligned_cols=45 Identities=24% Similarity=0.429 Sum_probs=24.4
Q ss_pred ccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhcccc
Q psy1773 236 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNK 288 (359)
Q Consensus 236 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~ 288 (359)
||.|..|++.|.+...|..|++..|.+ +.|+. |++.|. .|..|+.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~--CgKsF~---~l~~H~~ 45 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPV--CNKSYR---RLNQHFY 45 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCC--CCCEeC---Chhhhhc
Confidence 355666666666666666665422432 35643 666665 3555554
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.99 E-value=6.4e-06 Score=51.61 Aligned_cols=50 Identities=26% Similarity=0.610 Sum_probs=23.1
Q ss_pred cccccCCcCccccChHhhhccccccccCC-cccCCCCCCcCcccCChHHHHHHHHhhc
Q psy1773 266 FKCTYDGCKLLFSTKSALKRHNKSKHLNM-RMFPCPLVTCGKSFLRSEHLQEHMLQHN 322 (359)
Q Consensus 266 ~~C~~~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~~~C~~~f~~~~~l~~H~~~h~ 322 (359)
|.||+ |++ ..+...|..|....|..+ +.+.|++ |...+. .+|..|+..++
T Consensus 3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhc
Confidence 44554 555 233444555544444432 2355555 554333 25555555443
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.75 E-value=2.1e-05 Score=39.66 Aligned_cols=22 Identities=36% Similarity=0.854 Sum_probs=13.2
Q ss_pred eeccCCCccccccchhhHHHHHhh
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
|.|+. |++.|.+...|.+|++.
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhH
Confidence 45655 66666666666666655
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.71 E-value=1.8e-05 Score=39.90 Aligned_cols=23 Identities=48% Similarity=1.123 Sum_probs=17.6
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhh
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQH 321 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h 321 (359)
|.|+. |++.|.+...|..|++.|
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhHC
Confidence 56777 888888888888887764
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.69 E-value=1.8e-05 Score=66.14 Aligned_cols=75 Identities=28% Similarity=0.514 Sum_probs=40.3
Q ss_pred CccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchh
Q psy1773 264 KNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKS 343 (359)
Q Consensus 264 ~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~ 343 (359)
+||+|++++|.+.+.+...|+.|+..-|...+...=+-+.=-..| -..+| ||+|++ |++++....
T Consensus 348 KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F------------~~~~K-PYrCev--C~KRYKNlN 412 (423)
T COG5189 348 KPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF------------SAKDK-PYRCEV--CDKRYKNLN 412 (423)
T ss_pred ceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc------------cccCC-ceeccc--cchhhccCc
Confidence 566666666666666666666665422221111111100000111 12334 799999 999999988
Q ss_pred hHHHHHhhhc
Q psy1773 344 SLYAHLKHVH 353 (359)
Q Consensus 344 ~l~~H~~~hh 353 (359)
.|+-|...-|
T Consensus 413 GLKYHr~Hsh 422 (423)
T COG5189 413 GLKYHRKHSH 422 (423)
T ss_pred cceecccccC
Confidence 8888865433
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.69 E-value=3.1e-05 Score=39.39 Aligned_cols=24 Identities=42% Similarity=0.837 Sum_probs=14.1
Q ss_pred eeccCCCccccccchhhHHHHHhhhc
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHVH 353 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~hh 353 (359)
|.|++ |++.|.+...|..|+++||
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHHS
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhhC
Confidence 45666 6666666666666666655
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.60 E-value=3.4e-05 Score=55.56 Aligned_cols=73 Identities=27% Similarity=0.387 Sum_probs=18.6
Q ss_pred ccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHH
Q psy1773 238 VCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEH 317 (359)
Q Consensus 238 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H 317 (359)
.|..|+..|.+...|..|+...|.-..+ ....+.....+..+++. .....+.|.+ |++.|.+...|..|
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-------~~~~l~~~~~~~~~~~~--~~~~~~~C~~--C~~~f~s~~~l~~H 69 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-------DQKYLVDPNRLLNYLRK--KVKESFRCPY--CNKTFRSREALQEH 69 (100)
T ss_dssp ---------------------------------------------------------SSEEBSS--SS-EESSHHHHHHH
T ss_pred Cccccccccccccccccccccccccccc-------ccccccccccccccccc--ccCCCCCCCc--cCCCCcCHHHHHHH
Confidence 3777888888888888887766653322 11122244444444431 2223578887 88888888888888
Q ss_pred HHhh
Q psy1773 318 MLQH 321 (359)
Q Consensus 318 ~~~h 321 (359)
|+.+
T Consensus 70 m~~~ 73 (100)
T PF12756_consen 70 MRSK 73 (100)
T ss_dssp HHHT
T ss_pred HcCc
Confidence 8764
No 32
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.53 E-value=5.8e-05 Score=39.70 Aligned_cols=26 Identities=27% Similarity=0.456 Sum_probs=16.7
Q ss_pred ceeccCCCccccccchhhHHHHHhhhcC
Q psy1773 327 KFTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
||.|.. |++.|.+...|..|++.|++
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhcC
Confidence 366666 66666666666666666654
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.49 E-value=6.8e-05 Score=53.97 Aligned_cols=74 Identities=26% Similarity=0.506 Sum_probs=22.1
Q ss_pred ccccCCcCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchhhHH
Q psy1773 267 KCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLY 346 (359)
Q Consensus 267 ~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~ 346 (359)
+|.+ |+..|.+...|..|+...|.-..+ ....+.....+..+++.-... .+.|.+ |++.|.+...|.
T Consensus 1 ~C~~--C~~~f~~~~~l~~H~~~~H~~~~~-------~~~~l~~~~~~~~~~~~~~~~--~~~C~~--C~~~f~s~~~l~ 67 (100)
T PF12756_consen 1 QCLF--CDESFSSVDDLLQHMKKKHGFDIP-------DQKYLVDPNRLLNYLRKKVKE--SFRCPY--CNKTFRSREALQ 67 (100)
T ss_dssp ---------------------------------------------------------S--SEEBSS--SS-EESSHHHHH
T ss_pred Cccc--cccccccccccccccccccccccc-------cccccccccccccccccccCC--CCCCCc--cCCCCcCHHHHH
Confidence 4877 999999999999999877764332 112233445555555443332 499999 999999999999
Q ss_pred HHHhhhc
Q psy1773 347 AHLKHVH 353 (359)
Q Consensus 347 ~H~~~hh 353 (359)
.|++.++
T Consensus 68 ~Hm~~~~ 74 (100)
T PF12756_consen 68 EHMRSKH 74 (100)
T ss_dssp HHHHHTT
T ss_pred HHHcCcc
Confidence 9999753
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.46 E-value=8.5e-05 Score=37.72 Aligned_cols=24 Identities=50% Similarity=1.110 Sum_probs=17.7
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhhc
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQHN 322 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h~ 322 (359)
|.|++ |++.|.+...|..|+++|+
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHHS
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhhC
Confidence 56887 8888888888888888764
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.35 E-value=9.5e-05 Score=38.87 Aligned_cols=26 Identities=38% Similarity=0.680 Sum_probs=20.1
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHhhcC
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQHNE 323 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~ 323 (359)
||.|.. |++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhcC
Confidence 477877 88888888888888877754
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.34 E-value=4e-05 Score=64.14 Aligned_cols=26 Identities=35% Similarity=0.691 Sum_probs=17.2
Q ss_pred CCCccccCCCCCccccChHHHHHHhh
Q psy1773 203 SAPYMCDYKGCGKSFYLMASLKSHQR 228 (359)
Q Consensus 203 ~~~~~C~~~~C~~~f~~~~~l~~H~~ 228 (359)
++||.|++..|.|.+.+...|+-|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~l 372 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHML 372 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhh
Confidence 36677766667777666666666654
No 37
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.92 E-value=0.00074 Score=40.19 Aligned_cols=40 Identities=28% Similarity=0.498 Sum_probs=20.8
Q ss_pred hccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhcCCC
Q psy1773 284 KRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENK 325 (359)
Q Consensus 284 ~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~ 325 (359)
..+.+.++..+.|-.|++ |+..+.+..+|++|+.++|+.+
T Consensus 12 ~~~~k~~~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp ----CCCCTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS
T ss_pred hhHHHHhhccCCCCCCCc--chhhccchhhHHHHHHHHhccc
Confidence 334443445566667777 7777777777777777766665
No 38
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.86 E-value=0.00084 Score=34.05 Aligned_cols=24 Identities=46% Similarity=0.783 Sum_probs=15.9
Q ss_pred eeccCCCccccccchhhHHHHHhhhcC
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
|.|+. |++... +..|.+|++.+|+
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHHS
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhCc
Confidence 56777 777776 7777777777764
No 39
>KOG4173|consensus
Probab=96.69 E-value=0.00055 Score=53.75 Aligned_cols=87 Identities=23% Similarity=0.434 Sum_probs=57.7
Q ss_pred CccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHHHHHHc---------cCCCccccccCCcCc
Q psy1773 205 PYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVH---------ESTKNFKCTYDGCKL 275 (359)
Q Consensus 205 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H---------~~~~~~~C~~~~C~~ 275 (359)
.+.|++..|...|.+-..+..|..+-|++ .|.+|.+.|.+...|..|+...| .|...|+|-+++|+.
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 35566656666666666666665443333 47777777777777777765555 245568888888888
Q ss_pred cccChHhhhccccccccCCc
Q psy1773 276 LFSTKSALKRHNKSKHLNMR 295 (359)
Q Consensus 276 ~f~~~~~l~~H~~~~H~~~~ 295 (359)
.|.+...-..|+-..|.-..
T Consensus 155 KFkT~r~RkdH~I~~Hk~Pa 174 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHKYPA 174 (253)
T ss_pred hhhhhhhhhhHHHHhccCCc
Confidence 88888888888776675433
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.68 E-value=0.0022 Score=32.95 Aligned_cols=23 Identities=48% Similarity=0.988 Sum_probs=12.4
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhh
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQH 321 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h 321 (359)
|.|+. |++.|.....|..|++.|
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHh
Confidence 34554 555555555555555544
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.48 E-value=0.0018 Score=55.20 Aligned_cols=129 Identities=26% Similarity=0.538 Sum_probs=64.8
Q ss_pred cccccccchhhcCChHHHHHHHHHcCCCCCCccccCCCCC---cccc------ChHHHHHHhhhhcCC---CCccccCCc
Q psy1773 175 LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCG---KSFY------LMASLKSHQRVHVTN---PEDLVCKGC 242 (359)
Q Consensus 175 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~---~~f~------~~~~l~~H~~~h~~~---~~~~~C~~C 242 (359)
|.|+...|......-..|..|.+..++ .+.|.+ |- +.|. ++..|..|...-..+ ..--.|..|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~---~~~C~~--C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC 226 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHG---FVLCSE--CIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFC 226 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcC---cEEhHh--hhcCcccCccceeeeecccccccccCCccccCcCCCchhhhc
Confidence 666665566666666777777776654 245553 42 2222 334455554432111 011246666
Q ss_pred CccCCChHHHHHHHHHHccCCCccccccCCcCcc-------ccChHhhhccccccccCCcccCCCCCCcC----cccCCh
Q psy1773 243 GKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLL-------FSTKSALKRHNKSKHLNMRMFPCPLVTCG----KSFLRS 311 (359)
Q Consensus 243 ~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~-------f~~~~~l~~H~~~~H~~~~~~~C~~~~C~----~~f~~~ 311 (359)
...|-+-..|.+|++..|. .|-+ |+.. |.+..+|..|.+..| |.|.++.|- ..|...
T Consensus 227 ~~~FYdDDEL~~HcR~~HE-----~ChI--CD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~ 294 (493)
T COG5236 227 KIYFYDDDELRRHCRLRHE-----ACHI--CDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYH 294 (493)
T ss_pred cceecChHHHHHHHHhhhh-----hhhh--hhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccH
Confidence 6666666666666665553 3443 4432 444555555554322 344332231 245555
Q ss_pred HHHHHHHHh
Q psy1773 312 EHLQEHMLQ 320 (359)
Q Consensus 312 ~~l~~H~~~ 320 (359)
..|..|+..
T Consensus 295 ~el~~h~~~ 303 (493)
T COG5236 295 TELLEHLTR 303 (493)
T ss_pred HHHHHHHHH
Confidence 555555533
No 42
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.35 E-value=0.0051 Score=52.47 Aligned_cols=132 Identities=24% Similarity=0.522 Sum_probs=76.5
Q ss_pred cccCCCccchhhcCHHHHHHHHHHhCCCCccccccccch---h------hcCChHHHHHHHHHcCCC---CCCccccCCC
Q psy1773 145 YPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCG---R------KFQTMKQYSTHLKEHSNV---SAPYMCDYKG 212 (359)
Q Consensus 145 ~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~---~------~f~~~~~l~~H~~~h~~~---~~~~~C~~~~ 212 (359)
|.|+-.-|..+......|..|.+..++. +.|.. |- + ...++..|+.|...-..+ ..--.|.+
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~--C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F-- 225 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSE--CIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF-- 225 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHh--hhcCcccCccceeeeecccccccccCCccccCcCCCchhhh--
Confidence 5666556666666677777887764432 33444 32 2 223445555555432211 11235776
Q ss_pred CCccccChHHHHHHhhhhcCCCCccccCCcCc-------cCCChHHHHHHHHHHccCCCccccccCCcC----ccccChH
Q psy1773 213 CGKSFYLMASLKSHQRVHVTNPEDLVCKGCGK-------QFKVPCRLREHYKAVHESTKNFKCTYDGCK----LLFSTKS 281 (359)
Q Consensus 213 C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~-------~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~----~~f~~~~ 281 (359)
|...|.+-..|..|.+..|. .|.+|++ -|.+...|..|.+..| |.|.+..|- ..|....
T Consensus 226 C~~~FYdDDEL~~HcR~~HE-----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~ 295 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRHE-----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHT 295 (493)
T ss_pred ccceecChHHHHHHHHhhhh-----hhhhhhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHH
Confidence 77788888888888776532 2555543 4667777777766433 555443232 4577777
Q ss_pred hhhcccccccc
Q psy1773 282 ALKRHNKSKHL 292 (359)
Q Consensus 282 ~l~~H~~~~H~ 292 (359)
.|+.|+...|.
T Consensus 296 el~~h~~~~h~ 306 (493)
T COG5236 296 ELLEHLTRFHK 306 (493)
T ss_pred HHHHHHHHHhh
Confidence 78888765564
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.30 E-value=0.0029 Score=37.73 Aligned_cols=40 Identities=20% Similarity=0.306 Sum_probs=26.2
Q ss_pred HHHHHhhcCCCCceeccCCCccccccchhhHHHHHhhhcCCC
Q psy1773 315 QEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHSTN 356 (359)
Q Consensus 315 ~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~~~ 356 (359)
..+.+.+.....|-.|++ |+..+.+..+|.+|+...|+.+
T Consensus 12 ~~~~k~~~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp ----CCCCTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS
T ss_pred hhHHHHhhccCCCCCCCc--chhhccchhhHHHHHHHHhccc
Confidence 455666655555899999 9999999999999999998754
No 44
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.28 E-value=0.0035 Score=32.12 Aligned_cols=24 Identities=29% Similarity=0.657 Sum_probs=21.3
Q ss_pred eeccCCCccccccchhhHHHHHhhhc
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHVH 353 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~hh 353 (359)
|.|.. |++.|.+...|..|++.|.
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHhc
Confidence 57988 9999999999999999553
No 45
>KOG4173|consensus
Probab=96.27 E-value=0.0013 Score=51.70 Aligned_cols=71 Identities=28% Similarity=0.581 Sum_probs=33.7
Q ss_pred cCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhcccccccc---------CCcccCCCCCCcCcccCChH
Q psy1773 242 CGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHL---------NMRMFPCPLVTCGKSFLRSE 312 (359)
Q Consensus 242 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~---------~~~~~~C~~~~C~~~f~~~~ 312 (359)
|...|.+...+..|+...|++ .|.. |.+.|.+..-|..|+..-|. |...|.|-+.+|+..|.+..
T Consensus 87 c~~~~d~lD~~E~hY~~~h~~----sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r 160 (253)
T KOG4173|consen 87 CCQVFDALDDYEHHYHTLHGN----SCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSR 160 (253)
T ss_pred hHHHHhhhhhHHHhhhhcccc----hhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhh
Confidence 444555555555554434432 3443 55555555555555443231 23345555445555555555
Q ss_pred HHHHHH
Q psy1773 313 HLQEHM 318 (359)
Q Consensus 313 ~l~~H~ 318 (359)
.-..|+
T Consensus 161 ~RkdH~ 166 (253)
T KOG4173|consen 161 DRKDHM 166 (253)
T ss_pred hhhhHH
Confidence 555554
No 46
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.14 E-value=0.0027 Score=32.47 Aligned_cols=16 Identities=38% Similarity=0.764 Sum_probs=7.4
Q ss_pred cCcccCChHHHHHHHH
Q psy1773 304 CGKSFLRSEHLQEHML 319 (359)
Q Consensus 304 C~~~f~~~~~l~~H~~ 319 (359)
|++.|.+...|..|++
T Consensus 6 C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 6 CNKSFSSENSLRQHLR 21 (25)
T ss_dssp TTEEESSHHHHHHHHT
T ss_pred CCCCcCCHHHHHHHHC
Confidence 4444444444444443
No 47
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.04 E-value=0.0049 Score=31.48 Aligned_cols=23 Identities=35% Similarity=0.678 Sum_probs=21.0
Q ss_pred eeccCCCccccccchhhHHHHHhhh
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHV 352 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~h 352 (359)
|.|.+ |+..|.+...|..|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 68999 999999999999999864
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.99 E-value=0.0045 Score=31.28 Aligned_cols=24 Identities=29% Similarity=0.718 Sum_probs=16.2
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhhcC
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQHNE 323 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h~~ 323 (359)
|+|+. |+.... ...|.+|++.|++
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHHS
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhCc
Confidence 56777 887777 7788888877653
No 49
>KOG2231|consensus
Probab=95.83 E-value=0.017 Score=54.79 Aligned_cols=128 Identities=22% Similarity=0.386 Sum_probs=76.0
Q ss_pred ccccCCCCCcccc---------------ChHHHHHHhhhhcCCCCccccCCcCc---------cCCChHHHHHHHHHHcc
Q psy1773 206 YMCDYKGCGKSFY---------------LMASLKSHQRVHVTNPEDLVCKGCGK---------QFKVPCRLREHYKAVHE 261 (359)
Q Consensus 206 ~~C~~~~C~~~f~---------------~~~~l~~H~~~h~~~~~~~~C~~C~~---------~f~~~~~l~~H~~~~H~ 261 (359)
+.|.+ |++.|. +...|++|+.-.| +.+.|.+|-. ..-+...|..|++.--.
T Consensus 100 ~~C~~--C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H---~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~ 174 (669)
T KOG2231|consen 100 HSCHI--CDRRFRALYNKKECLHCTEFKSVENLKNHMRDQH---KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDP 174 (669)
T ss_pred hhcCc--cccchhhhcccCCCccccchhHHHHHHHHHHHhh---hhhccccccccceeeeeeeehehHHHHHHHHhcCCC
Confidence 67886 888773 6677888884432 3355555532 12234567777552111
Q ss_pred CCC----ccccccCCcCccccChHhhhccccccccCCcccCCCC----CCcCcccCChHHHHHHHHhhcCCCCceeccCC
Q psy1773 262 STK----NFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMFPCPL----VTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQ 333 (359)
Q Consensus 262 ~~~----~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~----~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~ 333 (359)
+++ --.|.+ |...|.....|..|++..| |.|.+ .+++..|.....|..|-|.+| |.|+.+
T Consensus 175 d~~s~rGhp~C~~--C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~~ 242 (669)
T KOG2231|consen 175 DDESCRGHPLCKF--CHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEEE 242 (669)
T ss_pred ccccccCCccchh--hhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC-----cccccc
Confidence 121 135776 8889999999999988655 45555 113567888888999888764 566633
Q ss_pred Cccc-cccchhhHHHHHh
Q psy1773 334 DCLM-SYVAKSSLYAHLK 350 (359)
Q Consensus 334 ~C~~-~f~~~~~l~~H~~ 350 (359)
-|.- .|..-..+..|++
T Consensus 243 ~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 243 FCRTKKFYVAFELEIELK 260 (669)
T ss_pred ccccceeeehhHHHHHHH
Confidence 3553 2332224444444
No 50
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.72 E-value=0.0026 Score=59.11 Aligned_cols=61 Identities=28% Similarity=0.526 Sum_probs=52.1
Q ss_pred CCCCCeeeccCCCCCCcccChhHHHhHHhhhCCCCccccccCCCCCccchhhcccHHHHHhhhhhcCC
Q psy1773 73 SKKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDE 140 (359)
Q Consensus 73 ~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~~ 140 (359)
......+.|+ .|...|........|.+.+.++.++.|...+ |...+.....+..|...++.
T Consensus 28 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 88 (467)
T COG5048 28 SNAPRPDSCP--NCTDSFSRLEHLTRHIRSHTGEKPSQCSYSG-----CDKSFSRPLELSRHLRTHHN 88 (467)
T ss_pred ccCCchhhcc--cccccccccchhhhhcccccccCCccccccc-----cccccCCcchhhhhcccccc
Confidence 3456678899 7999999999999999999999999999855 89999999999888877654
No 51
>KOG2231|consensus
Probab=95.62 E-value=0.03 Score=53.23 Aligned_cols=139 Identities=23% Similarity=0.478 Sum_probs=89.9
Q ss_pred cccccccchhhcC---------------ChHHHHHHHHHcCCCCCCccccCCCCC---------ccccChHHHHHHhhhh
Q psy1773 175 LCCEELGCGRKFQ---------------TMKQYSTHLKEHSNVSAPYMCDYKGCG---------KSFYLMASLKSHQRVH 230 (359)
Q Consensus 175 ~~C~~~~C~~~f~---------------~~~~l~~H~~~h~~~~~~~~C~~~~C~---------~~f~~~~~l~~H~~~h 230 (359)
+.|.. |++.|. +...|+.|+...|. -+.|.+ |- ....+...|.+|+..-
T Consensus 100 ~~C~~--C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H~---~~~c~l--C~~~~kif~~e~k~Yt~~el~~h~~~g 172 (669)
T KOG2231|consen 100 HSCHI--CDRRFRALYNKKECLHCTEFKSVENLKNHMRDQHK---LHLCSL--CLQNLKIFINERKLYTRAELNLHLMFG 172 (669)
T ss_pred hhcCc--cccchhhhcccCCCccccchhHHHHHHHHHHHhhh---hhcccc--ccccceeeeeeeehehHHHHHHHHhcC
Confidence 67999 998884 78899999966653 255554 32 2233566788887654
Q ss_pred cCCCC----ccccCCcCccCCChHHHHHHHHHHccCCCcccccc----CCcCccccChHhhhccccccccCCcccCCCCC
Q psy1773 231 VTNPE----DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTY----DGCKLLFSTKSALKRHNKSKHLNMRMFPCPLV 302 (359)
Q Consensus 231 ~~~~~----~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~----~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~ 302 (359)
-.+++ --.|..|...|.....|.+|++..| |.|.+ ++++.-|.....|..|.+..| |.|...
T Consensus 173 d~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~~ 242 (669)
T KOG2231|consen 173 DPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEEE 242 (669)
T ss_pred CCccccccCCccchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC-----cccccc
Confidence 22222 2469999999999999999998555 45554 124566788889999998766 677743
Q ss_pred CcC-cccCChHHHHHHHH----hhcCCCCceecc
Q psy1773 303 TCG-KSFLRSEHLQEHML----QHNENKPKFTCP 331 (359)
Q Consensus 303 ~C~-~~f~~~~~l~~H~~----~h~~~~~~~~C~ 331 (359)
.|. +.|...-.+..|++ .+.-++ .|.|.
T Consensus 243 ~C~~~~f~~~~~~ei~lk~~~~~~~~e~-~~~~~ 275 (669)
T KOG2231|consen 243 FCRTKKFYVAFELEIELKAHNRFIQHEK-CYICR 275 (669)
T ss_pred ccccceeeehhHHHHHHHhhccccchhe-eccCC
Confidence 364 44544444444444 333344 46663
No 52
>KOG1146|consensus
Probab=95.56 E-value=0.0044 Score=62.23 Aligned_cols=104 Identities=22% Similarity=0.409 Sum_probs=60.7
Q ss_pred cCccCCChHHHHHHHHHHccCCCccccccCCcCccccChHhhhcccccccc------------------------CCccc
Q psy1773 242 CGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHL------------------------NMRMF 297 (359)
Q Consensus 242 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~------------------------~~~~~ 297 (359)
|+..+.....+..|+...|+-.+.|+|+. |+..|+....|-.|++.+|. +-++|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpk--c~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~ 519 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPK--CNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPY 519 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCc--cchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcc
Confidence 33344444444444444444455555543 55555555555555554332 23467
Q ss_pred CCCCCCcCcccCChHHHHHHHHhh--c----------------------------------------CCCCceeccCCCc
Q psy1773 298 PCPLVTCGKSFLRSEHLQEHMLQH--N----------------------------------------ENKPKFTCPYQDC 335 (359)
Q Consensus 298 ~C~~~~C~~~f~~~~~l~~H~~~h--~----------------------------------------~~~~~~~C~~~~C 335 (359)
.|.. |..++..+..|.+|+..- - ..++.+.|.+ |
T Consensus 520 ~C~~--C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~v--c 595 (1406)
T KOG1146|consen 520 PCRA--CNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEV--C 595 (1406)
T ss_pred ccee--eeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhh--h
Confidence 7777 888888888887776532 1 1123577888 8
Q ss_pred cccccchhhHHHHHhh
Q psy1773 336 LMSYVAKSSLYAHLKH 351 (359)
Q Consensus 336 ~~~f~~~~~l~~H~~~ 351 (359)
++.-.-..+|..|+..
T Consensus 596 ~yetniarnlrihmts 611 (1406)
T KOG1146|consen 596 SYETNIARNLRIHMTA 611 (1406)
T ss_pred cchhhhhhcccccccc
Confidence 8877777777777653
No 53
>KOG1146|consensus
Probab=95.24 E-value=0.011 Score=59.66 Aligned_cols=74 Identities=23% Similarity=0.442 Sum_probs=64.0
Q ss_pred cCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHHHHhhc-------------------------CCCCc
Q psy1773 273 CKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHN-------------------------ENKPK 327 (359)
Q Consensus 273 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~-------------------------~~~~~ 327 (359)
|+..+.+...+..|+...|...+.|+|+. |+..|+....|..|||..| +.+ +
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpk--c~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~-p 518 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPK--CNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGK-P 518 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCc--cchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCC-c
Confidence 56667777888888888888889999999 9999999999999999833 223 7
Q ss_pred eeccCCCccccccchhhHHHHHhh
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
|.|.. |..++..+.+|..|++.
T Consensus 519 ~~C~~--C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 519 YPCRA--CNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred cccee--eeeeeecchHHHHHHHH
Confidence 99999 99999999999999864
No 54
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.23 E-value=0.0042 Score=57.73 Aligned_cols=64 Identities=22% Similarity=0.312 Sum_probs=38.2
Q ss_pred CCccccCCCCCccccChHHHHHHhhhhcCC-CCccccCCcCccCCChHHHHHHHHHHccCCCcccc
Q psy1773 204 APYMCDYKGCGKSFYLMASLKSHQRVHVTN-PEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKC 268 (359)
Q Consensus 204 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~-~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C 268 (359)
..+.+....|...+.....+..|...+... ...+.+..|.+.|.....+..|++ .|....++.|
T Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 449 (467)
T COG5048 385 KKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKK-IHTNHAPLLC 449 (467)
T ss_pred ccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCccccccccc-ccccCCceee
Confidence 444444444666666666666666555221 124566777777777777777766 6666666555
No 55
>KOG2482|consensus
Probab=95.13 E-value=0.026 Score=48.27 Aligned_cols=159 Identities=21% Similarity=0.305 Sum_probs=94.0
Q ss_pred hHHHHHHHHHcCCCCCCccccCCCCCcccc-ChHHHHHHhh-hhcCC--------------------CCccccCCcCccC
Q psy1773 189 MKQYSTHLKEHSNVSAPYMCDYKGCGKSFY-LMASLKSHQR-VHVTN--------------------PEDLVCKGCGKQF 246 (359)
Q Consensus 189 ~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~-~~~~l~~H~~-~h~~~--------------------~~~~~C~~C~~~f 246 (359)
+..|..+++.-.+.....+|-+ |...+. .++....|+- +|.-. -..+.|-.|.+.|
T Consensus 128 ~eaLeqqQ~Eredt~fslqClF--Cn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekif 205 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLF--CNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIF 205 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEE--ecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeecccc
Confidence 4566667666655456778988 886654 4555666653 34211 1237899999999
Q ss_pred CChHHHHHHHHHH-ccCCCcccccc--------CCcCccccCh-Hhh-----hccccc--------cccCCcc--cCCCC
Q psy1773 247 KVPCRLREHYKAV-HESTKNFKCTY--------DGCKLLFSTK-SAL-----KRHNKS--------KHLNMRM--FPCPL 301 (359)
Q Consensus 247 ~~~~~l~~H~~~~-H~~~~~~~C~~--------~~C~~~f~~~-~~l-----~~H~~~--------~H~~~~~--~~C~~ 301 (359)
+.+..|+.||+.. |....|-.=.+ -+=|++..-. +.+ ..+... .+ +..+ ..|-.
T Consensus 206 rdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed-~a~a~~v~CLf 284 (423)
T KOG2482|consen 206 RDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNED-DAEALSVVCLF 284 (423)
T ss_pred CCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcC-CCCccceEEEe
Confidence 9999999999743 43333311111 1122221111 111 011110 00 1112 58988
Q ss_pred CCcCcccCChHHHHHHHHhhcC--------C----------------CCceeccCCCccccccchhhHHHHHhhh
Q psy1773 302 VTCGKSFLRSEHLQEHMLQHNE--------N----------------KPKFTCPYQDCLMSYVAKSSLYAHLKHV 352 (359)
Q Consensus 302 ~~C~~~f~~~~~l~~H~~~h~~--------~----------------~~~~~C~~~~C~~~f~~~~~l~~H~~~h 352 (359)
|.....+...|..||.+.|. + +-.+.|...+|.-.|.....|..||..+
T Consensus 285 --C~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 285 --CTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred --eccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 99999999999999987651 0 0023333344999999999999998654
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.07 E-value=0.0081 Score=31.36 Aligned_cols=22 Identities=27% Similarity=0.590 Sum_probs=13.4
Q ss_pred eeccCCCccccccchhhHHHHHhh
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
|.|.. |++.|.+...|..|++.
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 55666 66666666666666543
No 57
>PRK04860 hypothetical protein; Provisional
Probab=95.05 E-value=0.012 Score=45.83 Aligned_cols=40 Identities=18% Similarity=0.368 Sum_probs=32.7
Q ss_pred cccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchh
Q psy1773 295 RMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKS 343 (359)
Q Consensus 295 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~ 343 (359)
-+|.|. |+. ....+.+|.++|++++ +|.|.. |+..|....
T Consensus 118 ~~Y~C~---C~~---~~~~~rrH~ri~~g~~-~YrC~~--C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK---CQE---HQLTVRRHNRVVRGEA-VYRCRR--CGETLVFKG 157 (160)
T ss_pred EEEEcC---CCC---eeCHHHHHHHHhcCCc-cEECCC--CCceeEEec
Confidence 368895 887 7788899999999988 699988 998886543
No 58
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.81 E-value=0.009 Score=31.17 Aligned_cols=21 Identities=38% Similarity=0.774 Sum_probs=10.7
Q ss_pred cCCCCCCcCcccCChHHHHHHHH
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHML 319 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~ 319 (359)
|.|.. |++.|.+...|..|++
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTT
T ss_pred CCccc--CCCCcCCHHHHHHHHc
Confidence 34554 5555555555555544
No 59
>PRK04860 hypothetical protein; Provisional
Probab=94.18 E-value=0.027 Score=43.94 Aligned_cols=37 Identities=22% Similarity=0.504 Sum_probs=22.6
Q ss_pred ccccCCcCccCCChHHHHHHHHHHccCCCccccccCCcCccccC
Q psy1773 236 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTYDGCKLLFST 279 (359)
Q Consensus 236 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~ 279 (359)
+|.|. |+. ....+.+|.+ +|+++++|.|.. |+..|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~r-i~~g~~~YrC~~--C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNR-VVRGEAVYRCRR--CGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHH-HhcCCccEECCC--CCceeEE
Confidence 46665 665 4555666655 666666666653 6666543
No 60
>KOG2482|consensus
Probab=93.70 E-value=0.15 Score=43.83 Aligned_cols=54 Identities=26% Similarity=0.378 Sum_probs=38.9
Q ss_pred CccccccccchhhcC-ChHHHHHHHHHcCCC---------------------CCCccccCCCCCccccChHHHHHHhhhh
Q psy1773 173 LTLCCEELGCGRKFQ-TMKQYSTHLKEHSNV---------------------SAPYMCDYKGCGKSFYLMASLKSHQRVH 230 (359)
Q Consensus 173 ~~~~C~~~~C~~~f~-~~~~l~~H~~~h~~~---------------------~~~~~C~~~~C~~~f~~~~~l~~H~~~h 230 (359)
...+|-. |...+. +.+....|+-..++- -..++|-+ |.+.|+.+..|+.||+.-
T Consensus 143 fslqClF--Cn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLy--CekifrdkntLkeHMrkK 218 (423)
T KOG2482|consen 143 FSLQCLF--CNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLY--CEKIFRDKNTLKEHMRKK 218 (423)
T ss_pred eeeEEEE--ecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeee--eccccCCcHHHHHHHHhc
Confidence 3467888 977664 466677776554320 12378988 999999999999999863
No 61
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.12 E-value=0.11 Score=28.76 Aligned_cols=23 Identities=35% Similarity=0.670 Sum_probs=17.3
Q ss_pred ceeccCCCccccccchhhHHHHHhh
Q psy1773 327 KFTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
+|.|.+ |+..|.+...+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 477887 88888877777777654
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.93 E-value=0.18 Score=25.66 Aligned_cols=19 Identities=42% Similarity=1.230 Sum_probs=9.7
Q ss_pred CCCCCCcCcccCChHHHHHHHH
Q psy1773 298 PCPLVTCGKSFLRSEHLQEHML 319 (359)
Q Consensus 298 ~C~~~~C~~~f~~~~~l~~H~~ 319 (359)
.|+. ||+.| ....|..|+.
T Consensus 4 ~C~~--CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPI--CGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCC--CCCEE-CHHHHHHHHH
Confidence 4554 55555 4445555543
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.62 E-value=0.14 Score=28.35 Aligned_cols=23 Identities=26% Similarity=0.564 Sum_probs=19.3
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHh
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQ 320 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~ 320 (359)
+|.|++ |++.|.+...+..|+..
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 578988 99999988888888754
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.66 E-value=0.21 Score=25.41 Aligned_cols=21 Identities=29% Similarity=0.536 Sum_probs=17.7
Q ss_pred eeccCCCccccccchhhHHHHHhh
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKH 351 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~ 351 (359)
..|+. ||+.| ....|.+|+..
T Consensus 3 ~~C~~--CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPI--CGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCC--CCCEE-CHHHHHHHHHh
Confidence 57999 99999 67889999763
No 65
>KOG4377|consensus
Probab=87.51 E-value=0.41 Score=42.35 Aligned_cols=28 Identities=18% Similarity=0.383 Sum_probs=24.3
Q ss_pred ceeccCCCccccccchhhHHHHHhhhcC
Q psy1773 327 KFTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
-|.|...||+..|.+.+.+..|.+.|-.
T Consensus 401 hfhc~r~Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 401 HFHCDRLGCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred eeeecccCCceEEEehhhhhhhhhhhhh
Confidence 4789999999999999999999887743
No 66
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=86.93 E-value=1.3 Score=32.24 Aligned_cols=26 Identities=23% Similarity=0.533 Sum_probs=23.4
Q ss_pred ceec----cCCCccccccchhhHHHHHhhhcC
Q psy1773 327 KFTC----PYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 327 ~~~C----~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
-|.| .. |++.+.+...+.+|++.+|+
T Consensus 80 G~~C~~~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCC--CCcEeccHHHHHHHHHHhcC
Confidence 3889 77 99999999999999999986
No 67
>KOG2785|consensus
Probab=85.86 E-value=3 Score=36.93 Aligned_cols=22 Identities=18% Similarity=0.166 Sum_probs=17.6
Q ss_pred CCCccchhhcccHHHHHhhhhh
Q psy1773 116 ANNTPCMWGFFSKHKLLRHMTS 137 (359)
Q Consensus 116 ~~~~~C~~~f~~~~~l~~H~~~ 137 (359)
++|..|...|.+...-+.|+++
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKS 25 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKS 25 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhh
Confidence 3444599999999999999875
No 68
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=85.56 E-value=0.46 Score=28.98 Aligned_cols=28 Identities=29% Similarity=0.468 Sum_probs=18.2
Q ss_pred CCCccccCCcCccCCChHHHHHHHHHHc
Q psy1773 233 NPEDLVCKGCGKQFKVPCRLREHYKAVH 260 (359)
Q Consensus 233 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H 260 (359)
++.-+.|+.|+..|....++.+|+...|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4455667777777777777777765444
No 69
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=84.32 E-value=0.53 Score=28.73 Aligned_cols=24 Identities=29% Similarity=0.663 Sum_probs=11.7
Q ss_pred CcccCCCCCCcCcccCChHHHHHHHH
Q psy1773 294 MRMFPCPLVTCGKSFLRSEHLQEHML 319 (359)
Q Consensus 294 ~~~~~C~~~~C~~~f~~~~~l~~H~~ 319 (359)
+.-+.|+- |+..|....++.+|..
T Consensus 15 E~~lrCPR--C~~~FR~~K~Y~RHVN 38 (65)
T COG4049 15 EEFLRCPR--CGMVFRRRKDYIRHVN 38 (65)
T ss_pred ceeeeCCc--hhHHHHHhHHHHHHhh
Confidence 33444544 5555555555555543
No 70
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=83.49 E-value=1.2 Score=32.47 Aligned_cols=82 Identities=16% Similarity=0.293 Sum_probs=48.3
Q ss_pred CccccCCcCccCCChHHHHHHHHHHccCCCccccc-c----CCcCccccChHhhh---------ccccccccCCcccCC-
Q psy1773 235 EDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCT-Y----DGCKLLFSTKSALK---------RHNKSKHLNMRMFPC- 299 (359)
Q Consensus 235 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~-~----~~C~~~f~~~~~l~---------~H~~~~H~~~~~~~C- 299 (359)
+...|..|+....- +.+..|++..|...+...-. + ..-.........+. ..+. .+ ..|.|
T Consensus 10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp-~~---~G~~C~ 84 (109)
T PF12013_consen 10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLP-VY---DGYRCQ 84 (109)
T ss_pred CEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCC-CC---CCeeee
Confidence 56789999988776 88999999666533211100 0 00000000111110 0011 11 33899
Q ss_pred ---CCCCcCcccCChHHHHHHHHhhcC
Q psy1773 300 ---PLVTCGKSFLRSEHLQEHMLQHNE 323 (359)
Q Consensus 300 ---~~~~C~~~f~~~~~l~~H~~~h~~ 323 (359)
.. |+..+.+...+..|++.+||
T Consensus 85 ~~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 85 CDPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred cCCCC--CCcEeccHHHHHHHHHHhcC
Confidence 76 99999999999999998875
No 71
>KOG2893|consensus
Probab=82.55 E-value=0.52 Score=38.44 Aligned_cols=38 Identities=26% Similarity=0.420 Sum_probs=17.5
Q ss_pred CCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHH
Q psy1773 213 CGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREH 255 (359)
Q Consensus 213 C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H 255 (359)
|++.|....-|.+|++. +-|+|.+|.+.+-+-..|..|
T Consensus 16 cnrefddekiliqhqka-----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 16 CNREFDDEKILIQHQKA-----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred cccccchhhhhhhhhhh-----ccceeeeehhhhccCCCceee
Confidence 45555555545444432 124455554444444444444
No 72
>KOG2785|consensus
Probab=81.86 E-value=2.2 Score=37.77 Aligned_cols=22 Identities=23% Similarity=0.680 Sum_probs=18.1
Q ss_pred cccCCCccchhhcCHHHHHHHHHH
Q psy1773 145 YPCDQKNCNKRFTTLSNLKMHMVR 168 (359)
Q Consensus 145 ~~c~c~~C~~~f~~~~~l~~H~~~ 168 (359)
|.|. .|...|.+....+.|+++
T Consensus 4 ftC~--tC~v~F~~ad~Qr~HyKS 25 (390)
T KOG2785|consen 4 FTCN--TCNVEFDDADEQRAHYKS 25 (390)
T ss_pred ceee--ceeeeeccHHHHHHHhhh
Confidence 5666 699999999888999875
No 73
>KOG2893|consensus
Probab=81.64 E-value=0.46 Score=38.73 Aligned_cols=38 Identities=34% Similarity=0.717 Sum_probs=20.1
Q ss_pred cCccccChHhhhccccccccCCcccCCCCCCcCcccCChHHHHHH
Q psy1773 273 CKLLFSTKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEH 317 (359)
Q Consensus 273 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H 317 (359)
|.+.|.....|.+|++..| |+|.+ |.+..-+...|..|
T Consensus 16 cnrefddekiliqhqkakh-----fkchi--chkkl~sgpglsih 53 (341)
T KOG2893|consen 16 CNREFDDEKILIQHQKAKH-----FKCHI--CHKKLFSGPGLSIH 53 (341)
T ss_pred cccccchhhhhhhhhhhcc-----ceeee--ehhhhccCCCceee
Confidence 5555555555555554333 55555 55555444445444
No 74
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=78.39 E-value=0.5 Score=27.39 Aligned_cols=33 Identities=15% Similarity=0.093 Sum_probs=16.3
Q ss_pred CCchhHHHHHHHHHHHHHHHHHhhhhhhhcChhHHHHHH
Q psy1773 15 DDVSARNVLTKLAVLYSTWSLEKHLATFYEDEEDAITEA 53 (359)
Q Consensus 15 ~~v~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (359)
.||.+....++|. .|++.++.+|.+...++..+
T Consensus 4 ~Dvav~fs~eEW~------~L~~~Qk~ly~dvm~Eny~~ 36 (41)
T PF01352_consen 4 EDVAVYFSQEEWE------LLDPAQKNLYRDVMLENYRN 36 (41)
T ss_dssp ---TT---HHHHH------TS-HHHHHHHHHHHHHTTTS
T ss_pred EEEEEEcChhhcc------cccceecccchhHHHHhhcc
Confidence 5666666666666 66666777776666554443
No 75
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=71.70 E-value=2 Score=23.44 Aligned_cols=9 Identities=33% Similarity=1.014 Sum_probs=4.6
Q ss_pred ceeccCCCccc
Q psy1773 327 KFTCPYQDCLM 337 (359)
Q Consensus 327 ~~~C~~~~C~~ 337 (359)
++.|++ ||.
T Consensus 17 ~~~CP~--Cg~ 25 (33)
T cd00350 17 PWVCPV--CGA 25 (33)
T ss_pred CCcCcC--CCC
Confidence 455555 553
No 76
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=70.62 E-value=3 Score=24.51 Aligned_cols=25 Identities=40% Similarity=0.566 Sum_probs=13.2
Q ss_pred ceeccCCCccccccch----hhHHHHHhhhc
Q psy1773 327 KFTCPYQDCLMSYVAK----SSLYAHLKHVH 353 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~~----~~l~~H~~~hh 353 (359)
...|.. |++.+... +.|.+|++..|
T Consensus 16 ~a~C~~--C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 16 KAKCKY--CGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp -EEETT--TTEE-----SSTHHHHHHHHHTT
T ss_pred eEEeCC--CCeEEeeCCCcHHHHHHhhhhhC
Confidence 466777 77666543 56777775544
No 77
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=69.03 E-value=7.8 Score=28.09 Aligned_cols=25 Identities=20% Similarity=0.414 Sum_probs=16.1
Q ss_pred cccCCCCCCcCcccCChHHHHHHHHhh
Q psy1773 295 RMFPCPLVTCGKSFLRSEHLQEHMLQH 321 (359)
Q Consensus 295 ~~~~C~~~~C~~~f~~~~~l~~H~~~h 321 (359)
..|.|+. |...|-..-++-.|...|
T Consensus 80 ~~y~C~~--C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 80 HRYVCAV--CKNVFCVDCDVFVHESLH 104 (112)
T ss_pred cceeCCC--CCCccccccchhhhhhcc
Confidence 3577776 777776666666665444
No 78
>KOG4377|consensus
Probab=68.68 E-value=3.8 Score=36.56 Aligned_cols=26 Identities=19% Similarity=0.549 Sum_probs=23.1
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhhc
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQHN 322 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h~ 322 (359)
|-|...+|+..+.+.+.+..|.|.|.
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKHE 427 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhhh
Confidence 66888899999999999999998884
No 79
>KOG4124|consensus
Probab=67.69 E-value=1.5 Score=38.14 Aligned_cols=29 Identities=28% Similarity=0.608 Sum_probs=23.4
Q ss_pred CCccccccCCcCccccChHhhhccccccc
Q psy1773 263 TKNFKCTYDGCKLLFSTKSALKRHNKSKH 291 (359)
Q Consensus 263 ~~~~~C~~~~C~~~f~~~~~l~~H~~~~H 291 (359)
.++|+|+++.|++.++....|..|...-|
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h 375 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGH 375 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCc
Confidence 46899999999999998888888876433
No 80
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=66.59 E-value=5.6 Score=24.10 Aligned_cols=25 Identities=32% Similarity=0.580 Sum_probs=13.6
Q ss_pred eeccCCCccccccch-----hhHHHHHhhhcC
Q psy1773 328 FTCPYQDCLMSYVAK-----SSLYAHLKHVHS 354 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~-----~~l~~H~~~hh~ 354 (359)
-.|.. |++.+... +.|.+|++..|+
T Consensus 19 a~C~~--C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 19 AKCKY--CGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEecC--CCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 44655 66655443 466666664443
No 81
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=63.93 E-value=5.4 Score=28.85 Aligned_cols=15 Identities=33% Similarity=0.933 Sum_probs=7.5
Q ss_pred CccccCCcCccCCCh
Q psy1773 235 EDLVCKGCGKQFKVP 249 (359)
Q Consensus 235 ~~~~C~~C~~~f~~~ 249 (359)
.|..|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 445555555555433
No 82
>PF12907 zf-met2: Zinc-binding
Probab=61.50 E-value=4.8 Score=23.13 Aligned_cols=32 Identities=28% Similarity=0.585 Sum_probs=17.1
Q ss_pred cccCCcCccC---CChHHHHHHHHHHccCCCcccc
Q psy1773 237 LVCKGCGKQF---KVPCRLREHYKAVHESTKNFKC 268 (359)
Q Consensus 237 ~~C~~C~~~f---~~~~~l~~H~~~~H~~~~~~~C 268 (359)
+.|.+|..+| .+...|..|....|....+-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 4566666333 3445666666656655433333
No 83
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.01 E-value=5.4 Score=31.30 Aligned_cols=13 Identities=31% Similarity=0.623 Sum_probs=6.8
Q ss_pred ccCCCccccccCCcC
Q psy1773 260 HESTKNFKCTYDGCK 274 (359)
Q Consensus 260 H~~~~~~~C~~~~C~ 274 (359)
|.++.|-+||+ ||
T Consensus 144 ~~ge~P~~CPi--Cg 156 (166)
T COG1592 144 HEGEAPEVCPI--CG 156 (166)
T ss_pred ccCCCCCcCCC--CC
Confidence 34455555655 55
No 84
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=60.64 E-value=1.7 Score=36.00 Aligned_cols=13 Identities=23% Similarity=0.708 Sum_probs=9.8
Q ss_pred eeccCCCccccccch
Q psy1773 328 FTCPYQDCLMSYVAK 342 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~ 342 (359)
..|+. ||.+|...
T Consensus 49 ~vCP~--CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPH--CGYAAFEE 61 (214)
T ss_pred EECCC--CCCccccc
Confidence 56888 99888654
No 85
>KOG2186|consensus
Probab=59.67 E-value=5.8 Score=33.04 Aligned_cols=46 Identities=28% Similarity=0.626 Sum_probs=22.9
Q ss_pred ccccCCCCCccccChHHHHHHhhhhcCCCCccccCCcCccCCChHHHHHHHH
Q psy1773 206 YMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYK 257 (359)
Q Consensus 206 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~ 257 (359)
|.|.+ ||....-. .+.+|+... .+ .-|.|-.|++.|.. ..+..|..
T Consensus 4 FtCnv--CgEsvKKp-~vekH~srC-rn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNV--CGESVKKP-QVEKHMSRC-RN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhh--hhhhcccc-chHHHHHhc-cC-CeeEEeeccccccc-chhhhhhh
Confidence 44554 55544332 344455554 22 44566666666655 44555543
No 86
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=58.81 E-value=4.6 Score=22.24 Aligned_cols=24 Identities=29% Similarity=0.775 Sum_probs=15.0
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccc
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLM 337 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~ 337 (359)
|.|.+ ||..+.. +.+|..|++ ||.
T Consensus 3 ~~C~~--CG~i~~g-------------~~~p~~CP~--Cg~ 26 (34)
T cd00729 3 WVCPV--CGYIHEG-------------EEAPEKCPI--CGA 26 (34)
T ss_pred EECCC--CCCEeEC-------------CcCCCcCcC--CCC
Confidence 67777 7765432 223677887 875
No 87
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=56.93 E-value=25 Score=27.21 Aligned_cols=38 Identities=26% Similarity=0.448 Sum_probs=22.5
Q ss_pred HccCCCccccccCCcCccccChHhhhccccccccCCccc
Q psy1773 259 VHESTKNFKCTYDGCKLLFSTKSALKRHNKSKHLNMRMF 297 (359)
Q Consensus 259 ~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~ 297 (359)
.+.+.|+-.|..+.|.+. .+...|..|.+..|...+|-
T Consensus 101 ~~LN~K~RsC~~e~C~F~-GtY~eLrKHar~~HP~~rP~ 138 (162)
T PF07800_consen 101 RFLNAKKRSCSQESCSFS-GTYSELRKHARSEHPSARPS 138 (162)
T ss_pred HHhccCCccCcccccccc-cCHHHHHHHHHhhCCCCCCc
Confidence 444556666666666533 24456777777667666553
No 88
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=54.08 E-value=14 Score=18.83 Aligned_cols=19 Identities=21% Similarity=0.646 Sum_probs=9.9
Q ss_pred CCCCCCcCcccCChHHHHHHHH
Q psy1773 298 PCPLVTCGKSFLRSEHLQEHML 319 (359)
Q Consensus 298 ~C~~~~C~~~f~~~~~l~~H~~ 319 (359)
.|++ |++.+ ....+..|+.
T Consensus 3 ~CPi--C~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPV--CFREV-PENLINSHLD 21 (26)
T ss_pred cCCC--CcCcc-cHHHHHHHHH
Confidence 4555 65555 4445555543
No 89
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=54.07 E-value=9.3 Score=27.65 Aligned_cols=31 Identities=29% Similarity=0.636 Sum_probs=23.0
Q ss_pred eeeccCCCCCCcccChhHHHhHHhhhCCCCccccccCCCCCccchhhcccH
Q psy1773 78 LWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFSK 128 (359)
Q Consensus 78 ~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~ 128 (359)
+..|+ .||.+|-.+ +..|..|+. ||..|.-.
T Consensus 9 KR~Cp--~CG~kFYDL-----------nk~PivCP~-------CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCP--SCGAKFYDL-----------NKDPIVCPK-------CGTEFPPE 39 (108)
T ss_pred cccCC--CCcchhccC-----------CCCCccCCC-------CCCccCcc
Confidence 35788 899988765 457888888 77777655
No 90
>KOG2636|consensus
Probab=53.57 E-value=21 Score=32.61 Aligned_cols=25 Identities=16% Similarity=0.530 Sum_probs=15.3
Q ss_pred CCCccccccCCCCCccch-hhcccHHHHHhhhh
Q psy1773 105 DKRPFKCKLTTANNTPCM-WGFFSKHKLLRHMT 136 (359)
Q Consensus 105 ~~~~~~C~~~~~~~~~C~-~~f~~~~~l~~H~~ 136 (359)
-...|.|.+ || +++.-...+.+|..
T Consensus 398 L~~ey~CEI-------CGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 398 LDIEYNCEI-------CGNYVYKGRKAFDRHFN 423 (497)
T ss_pred CCcccceee-------ccCccccCcHHHHHHhH
Confidence 345566776 66 56666666666654
No 91
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=51.06 E-value=5.2 Score=26.47 Aligned_cols=15 Identities=13% Similarity=0.609 Sum_probs=6.9
Q ss_pred ceeccCCCccccccc
Q psy1773 327 KFTCPYQDCLMSYVA 341 (359)
Q Consensus 327 ~~~C~~~~C~~~f~~ 341 (359)
.++|...+||.+|..
T Consensus 27 Y~qC~N~eCg~tF~t 41 (72)
T PRK09678 27 YHQCQNVNCSATFIT 41 (72)
T ss_pred eeecCCCCCCCEEEE
Confidence 345543335555543
No 92
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=49.08 E-value=7.5 Score=32.19 Aligned_cols=13 Identities=31% Similarity=0.933 Sum_probs=9.6
Q ss_pred cCCCCCCcCcccCCh
Q psy1773 297 FPCPLVTCGKSFLRS 311 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~ 311 (359)
..|+. ||.+|...
T Consensus 49 ~vCP~--CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPH--CGYAAFEE 61 (214)
T ss_pred EECCC--CCCccccc
Confidence 45988 99887644
No 93
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=48.98 E-value=21 Score=25.91 Aligned_cols=48 Identities=13% Similarity=0.324 Sum_probs=30.9
Q ss_pred CCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccccchhhHHHHHhhh
Q psy1773 298 PCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHV 352 (359)
Q Consensus 298 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h 352 (359)
.|-- |...|........ .-..+...|+|+. |...|=..-+.-.|...|
T Consensus 57 ~C~~--C~~~f~~~~~~~~---~~~~~~~~y~C~~--C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 57 FCFG--CQGPFPKPPVSPF---DELKDSHRYVCAV--CKNVFCVDCDVFVHESLH 104 (112)
T ss_pred cccC--cCCCCCCcccccc---cccccccceeCCC--CCCccccccchhhhhhcc
Confidence 4876 9999986542110 0011122699999 999998777777776554
No 94
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=45.15 E-value=10 Score=31.24 Aligned_cols=23 Identities=39% Similarity=0.972 Sum_probs=10.6
Q ss_pred ccCCCCCCcCcccCChHHHHHHHHh
Q psy1773 296 MFPCPLVTCGKSFLRSEHLQEHMLQ 320 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l~~H~~~ 320 (359)
.|.|++ |+|.|.-..-...|+..
T Consensus 77 K~~C~l--c~KlFkg~eFV~KHI~n 99 (214)
T PF04959_consen 77 KWRCPL--CGKLFKGPEFVRKHIFN 99 (214)
T ss_dssp EEEE-S--SS-EESSHHHHHHHHHH
T ss_pred EECCCC--CCcccCChHHHHHHHhh
Confidence 355554 55555555555555443
No 95
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=45.10 E-value=14 Score=27.88 Aligned_cols=22 Identities=27% Similarity=0.528 Sum_probs=12.0
Q ss_pred eeccCCCccccccchhhHHHHHhhhcC
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
..|-+ ||+.|.. |.+|++.|||
T Consensus 73 i~cle--cGk~~k~---LkrHL~~~~g 94 (132)
T PF05443_consen 73 IICLE--CGKKFKT---LKRHLRTHHG 94 (132)
T ss_dssp EE-TB--T--EESB---HHHHHHHTT-
T ss_pred eEEcc--CCcccch---HHHHHHHccC
Confidence 56766 7777743 4777777765
No 96
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=44.90 E-value=18 Score=20.16 Aligned_cols=6 Identities=50% Similarity=1.475 Sum_probs=2.2
Q ss_pred ccCCcC
Q psy1773 238 VCKGCG 243 (359)
Q Consensus 238 ~C~~C~ 243 (359)
+|+.|+
T Consensus 27 ~C~~C~ 32 (36)
T PF13717_consen 27 RCSKCG 32 (36)
T ss_pred ECCCCC
Confidence 333333
No 97
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=44.44 E-value=18 Score=29.88 Aligned_cols=28 Identities=36% Similarity=0.568 Sum_probs=21.7
Q ss_pred CCccccCCcCccCCChHHHHHHHHHHcc
Q psy1773 234 PEDLVCKGCGKQFKVPCRLREHYKAVHE 261 (359)
Q Consensus 234 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 261 (359)
+..|.|..|+|.|.-..-+.+|+...|.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 4569999999999999999999987785
No 98
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=37.87 E-value=22 Score=26.58 Aligned_cols=22 Identities=23% Similarity=0.167 Sum_probs=12.5
Q ss_pred eeccCCCccccccchhhHHHHHhhhcC
Q psy1773 328 FTCPYQDCLMSYVAKSSLYAHLKHVHS 354 (359)
Q Consensus 328 ~~C~~~~C~~~f~~~~~l~~H~~~hh~ 354 (359)
..|-. +|+.|. +|++|+.+|++
T Consensus 77 IicLE--DGkkfK---SLKRHL~t~~g 98 (148)
T COG4957 77 IICLE--DGKKFK---SLKRHLTTHYG 98 (148)
T ss_pred EEEec--cCcchH---HHHHHHhcccC
Confidence 44655 666663 45666666654
No 99
>KOG0717|consensus
Probab=37.32 E-value=1.6e+02 Score=27.46 Aligned_cols=22 Identities=23% Similarity=0.237 Sum_probs=18.7
Q ss_pred eeccCCCCCCcccChhHHHhHHhh
Q psy1773 79 WSCPIVNCNFTSVNLNTFKIHLLK 102 (359)
Q Consensus 79 ~~C~~~~C~~~~~~~~~l~~H~~~ 102 (359)
+.|. .|++.|.+...|..|..+
T Consensus 293 lyC~--vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCV--VCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEe--eccccccchHHHHhhHHH
Confidence 7899 699999999999888754
No 100
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=36.64 E-value=25 Score=26.09 Aligned_cols=32 Identities=22% Similarity=0.267 Sum_probs=23.2
Q ss_pred eeeccCCCCCCcccChhHHHhHHhhhCCCCccccccCCCCCccchhhcccHH
Q psy1773 78 LWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFSKH 129 (359)
Q Consensus 78 ~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~ 129 (359)
+..|+ .|+.+|-.. +..|..|+. ||..|.-..
T Consensus 9 Kr~Cp--~cg~kFYDL-----------nk~p~vcP~-------cg~~~~~~~ 40 (129)
T TIGR02300 9 KRICP--NTGSKFYDL-----------NRRPAVSPY-------TGEQFPPEE 40 (129)
T ss_pred cccCC--CcCcccccc-----------CCCCccCCC-------cCCccCcch
Confidence 35788 899888765 457888888 777776553
No 101
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=36.43 E-value=13 Score=28.88 Aligned_cols=13 Identities=38% Similarity=0.720 Sum_probs=6.4
Q ss_pred cCCCCCCcCcccCCh
Q psy1773 297 FPCPLVTCGKSFLRS 311 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~ 311 (359)
++|+. ||++|...
T Consensus 29 ~~c~~--c~~~f~~~ 41 (154)
T PRK00464 29 RECLA--CGKRFTTF 41 (154)
T ss_pred eeccc--cCCcceEe
Confidence 45554 55555444
No 102
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=36.19 E-value=19 Score=28.20 Aligned_cols=17 Identities=29% Similarity=0.516 Sum_probs=8.9
Q ss_pred ccCCCCCCcCcccCChHHH
Q psy1773 296 MFPCPLVTCGKSFLRSEHL 314 (359)
Q Consensus 296 ~~~C~~~~C~~~f~~~~~l 314 (359)
-|.|+. |+..|+...++
T Consensus 109 ~Y~Cp~--c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPN--MCVRFTFNEAM 125 (158)
T ss_pred eEECCC--CCcEeeHHHHH
Confidence 355554 55555555555
No 103
>PF15269 zf-C2H2_7: Zinc-finger
Probab=36.07 E-value=49 Score=19.34 Aligned_cols=23 Identities=22% Similarity=0.405 Sum_probs=16.7
Q ss_pred CeeeccCCCCCCcccChhHHHhHHh
Q psy1773 77 QLWSCPIVNCNFTSVNLNTFKIHLL 101 (359)
Q Consensus 77 ~~~~C~~~~C~~~~~~~~~l~~H~~ 101 (359)
-.|+|- .|.++....+.|-.||+
T Consensus 19 ~~ykcf--qcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 19 FKYKCF--QCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ccceee--cCCcccchHHHHHHHHH
Confidence 446777 78888777777777775
No 104
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=34.50 E-value=26 Score=27.03 Aligned_cols=36 Identities=28% Similarity=0.540 Sum_probs=17.9
Q ss_pred cccCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccc
Q psy1773 295 RMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSY 339 (359)
Q Consensus 295 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f 339 (359)
.-|.|+. |+..|.....+..- .. .. .|.|+. ||...
T Consensus 98 ~~Y~Cp~--C~~~y~~~ea~~~~---d~-~~-~f~Cp~--Cg~~l 133 (147)
T smart00531 98 AYYKCPN--CQSKYTFLEANQLL---DM-DG-TFTCPR--CGEEL 133 (147)
T ss_pred cEEECcC--CCCEeeHHHHHHhc---CC-CC-cEECCC--CCCEE
Confidence 3466665 66666644333220 01 12 366766 66654
No 105
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=33.92 E-value=36 Score=26.77 Aligned_cols=22 Identities=32% Similarity=1.017 Sum_probs=18.0
Q ss_pred CeeeccCCCCCCcccChhHHHhHHhhhCCCCcccccc
Q psy1773 77 QLWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKL 113 (359)
Q Consensus 77 ~~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~ 113 (359)
+.|.|+ .||.++. |+.|-.||+
T Consensus 133 ~~~vC~--vCGy~~~-------------ge~P~~CPi 154 (166)
T COG1592 133 KVWVCP--VCGYTHE-------------GEAPEVCPI 154 (166)
T ss_pred CEEEcC--CCCCccc-------------CCCCCcCCC
Confidence 389999 7998654 678999999
No 106
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=33.10 E-value=9.2 Score=22.84 Aligned_cols=14 Identities=29% Similarity=0.562 Sum_probs=7.0
Q ss_pred ccccccCCcCcccc
Q psy1773 265 NFKCTYDGCKLLFS 278 (359)
Q Consensus 265 ~~~C~~~~C~~~f~ 278 (359)
-++|.-.+||..|.
T Consensus 25 Y~qC~N~~Cg~tfv 38 (47)
T PF04606_consen 25 YCQCTNPECGHTFV 38 (47)
T ss_pred EEEECCCcCCCEEE
Confidence 34555445555554
No 107
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=32.51 E-value=20 Score=28.70 Aligned_cols=16 Identities=38% Similarity=0.743 Sum_probs=7.1
Q ss_pred cCCCCCCcCcccCChHHH
Q psy1773 297 FPCPLVTCGKSFLRSEHL 314 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l 314 (359)
|.|+. |+..|+...++
T Consensus 118 Y~Cp~--C~~rytf~eA~ 133 (178)
T PRK06266 118 FFCPN--CHIRFTFDEAM 133 (178)
T ss_pred EECCC--CCcEEeHHHHh
Confidence 44443 44444444433
No 108
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=32.34 E-value=33 Score=25.85 Aligned_cols=33 Identities=24% Similarity=0.381 Sum_probs=19.1
Q ss_pred CCCCCCCeeeccCCCCCCcccChhHHHhHHhhhCCCCc
Q psy1773 71 KNSKKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFDKRP 108 (359)
Q Consensus 71 ~~~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~ 108 (359)
+++.......|- .||+.|+. |++|++.|+|-.|
T Consensus 65 ~kSI~~d~i~cl--ecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 65 RKSITPDYIICL--ECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp STTB-SS-EE-T--BT--EESB---HHHHHHHTT-S-H
T ss_pred hhccccCeeEEc--cCCcccch---HHHHHHHccCCCH
Confidence 445566677888 69999875 5899999977544
No 109
>KOG4167|consensus
Probab=31.98 E-value=10 Score=36.81 Aligned_cols=26 Identities=38% Similarity=0.873 Sum_probs=23.7
Q ss_pred cccCCCCCCcCcccCChHHHHHHHHhhc
Q psy1773 295 RMFPCPLVTCGKSFLRSEHLQEHMLQHN 322 (359)
Q Consensus 295 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~ 322 (359)
..|.|.. |++.|-...++..||++|.
T Consensus 791 giFpCre--C~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRE--CGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHH--HHHHHHHHhhhhHHHHHHH
Confidence 3499998 9999999999999999995
No 110
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=30.23 E-value=63 Score=24.25 Aligned_cols=33 Identities=24% Similarity=0.314 Sum_probs=24.4
Q ss_pred CCCCCCCeeeccCCCCCCcccChhHHHhHHhhhCCCCc
Q psy1773 71 KNSKKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFDKRP 108 (359)
Q Consensus 71 ~~~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~ 108 (359)
+++....-..|- .+|+.|+ +|++|+.+|.+-.|
T Consensus 69 kKSVtpD~IicL--EDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 69 KKSVTPDYIICL--EDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred ccccCCCeEEEe--ccCcchH---HHHHHHhcccCCCH
Confidence 445556667888 8999885 68999999877544
No 111
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=29.70 E-value=68 Score=24.94 Aligned_cols=35 Identities=26% Similarity=0.380 Sum_probs=17.2
Q ss_pred HhhcCCCCceeccCCCccccccchhhHHHHHhhhcCC
Q psy1773 319 LQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 319 ~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~~ 355 (359)
|.+-+.| +-.|...+|.++ .++..|.+|.+..|+.
T Consensus 100 R~~LN~K-~RsC~~e~C~F~-GtY~eLrKHar~~HP~ 134 (162)
T PF07800_consen 100 RRFLNAK-KRSCSQESCSFS-GTYSELRKHARSEHPS 134 (162)
T ss_pred HHHhccC-CccCcccccccc-cCHHHHHHHHHhhCCC
Confidence 3344444 345555445544 3445566666655543
No 112
>KOG4124|consensus
Probab=29.20 E-value=31 Score=30.40 Aligned_cols=52 Identities=8% Similarity=-0.029 Sum_probs=30.0
Q ss_pred CCccccccCCcCccccChHhhhccccccc----cCCcccCCCCCCcCcccCChHHHHHH
Q psy1773 263 TKNFKCTYDGCKLLFSTKSALKRHNKSKH----LNMRMFPCPLVTCGKSFLRSEHLQEH 317 (359)
Q Consensus 263 ~~~~~C~~~~C~~~f~~~~~l~~H~~~~H----~~~~~~~C~~~~C~~~f~~~~~l~~H 317 (359)
|.++.|+++.|.+..........|-...| +-.+||+|+ |++.+.+...|+.|
T Consensus 176 E~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~---~~~~~~T~~~l~~H 231 (442)
T KOG4124|consen 176 EYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP---ESLVMDTSSPLSDH 231 (442)
T ss_pred cccccccCchhhhhhhccccccccccccccccccccCCccCc---ccccccccchhhhc
Confidence 45666666666666655544444433222 234567776 67777776666666
No 113
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=27.16 E-value=28 Score=23.83 Aligned_cols=12 Identities=33% Similarity=1.043 Sum_probs=7.0
Q ss_pred ccCCCCCCcCcccC
Q psy1773 296 MFPCPLVTCGKSFL 309 (359)
Q Consensus 296 ~~~C~~~~C~~~f~ 309 (359)
.+.|.. |+..|.
T Consensus 53 IW~C~k--Cg~~fA 64 (89)
T COG1997 53 IWKCRK--CGAKFA 64 (89)
T ss_pred eEEcCC--CCCeec
Confidence 356665 666654
No 114
>KOG4167|consensus
Probab=26.98 E-value=22 Score=34.69 Aligned_cols=29 Identities=28% Similarity=0.491 Sum_probs=22.6
Q ss_pred CCccccccCCCCCccchhhcccHHHHHhhhhhcCCC
Q psy1773 106 KRPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDED 141 (359)
Q Consensus 106 ~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~ 141 (359)
...|.|.. |++.|.....+..||++|...
T Consensus 790 ~giFpCre-------C~kvF~KiKSrNAHMK~Hr~q 818 (907)
T KOG4167|consen 790 TGIFPCRE-------CGKVFFKIKSRNAHMKTHRQQ 818 (907)
T ss_pred CceeehHH-------HHHHHHHHhhhhHHHHHHHHH
Confidence 45688887 888888888888888888643
No 115
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=26.96 E-value=27 Score=21.92 Aligned_cols=9 Identities=33% Similarity=0.962 Sum_probs=3.9
Q ss_pred ceeccCCCc
Q psy1773 327 KFTCPYQDC 335 (359)
Q Consensus 327 ~~~C~~~~C 335 (359)
...|++.||
T Consensus 48 ~~~CPv~GC 56 (57)
T PF11789_consen 48 SKRCPVAGC 56 (57)
T ss_dssp -EE-SCCC-
T ss_pred CCCCCCCCC
Confidence 466766665
No 116
>KOG2186|consensus
Probab=26.93 E-value=36 Score=28.55 Aligned_cols=50 Identities=16% Similarity=0.312 Sum_probs=33.5
Q ss_pred eeeccCCCCCCcccChhHHHhHHhhhCCCCccccccCCCCCccchhhcccHHHHHhhhhhcC
Q psy1773 78 LWSCPIVNCNFTSVNLNTFKIHLLKHFDKRPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHD 139 (359)
Q Consensus 78 ~~~C~~~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 139 (359)
.|.|. .||..... -.+..|+..-++ .-|.|- +|++.|.. ..+..|..--+
T Consensus 3 ~FtCn--vCgEsvKK-p~vekH~srCrn-~~fSCI-------DC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 3 FFTCN--VCGESVKK-PQVEKHMSRCRN-AYFSCI-------DCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred EEehh--hhhhhccc-cchHHHHHhccC-CeeEEe-------eccccccc-chhhhhhhhcc
Confidence 37788 68877663 356668776655 446664 48888887 67777765444
No 117
>PF03145 Sina: Seven in absentia protein family; InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=26.11 E-value=73 Score=25.89 Aligned_cols=28 Identities=29% Similarity=0.553 Sum_probs=11.4
Q ss_pred ceeccC--CCccccccchhhHHHHHhhhcCC
Q psy1773 327 KFTCPY--QDCLMSYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 327 ~~~C~~--~~C~~~f~~~~~l~~H~~~hh~~ 355 (359)
||.|+. .+|+..- ....|..|+...|+.
T Consensus 44 p~~CP~~~~~C~~~G-~~~~l~~Hl~~~H~~ 73 (198)
T PF03145_consen 44 PCSCPFPGSGCDWQG-SYKELLDHLRDKHSW 73 (198)
T ss_dssp EEE-SSSSTT---EE-ECCCHHHHHHHHTTT
T ss_pred CCcCCCCCCCccccC-CHHHHHHHHHHHCCC
Confidence 455555 4454332 233455555555544
No 118
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=25.65 E-value=43 Score=19.69 Aligned_cols=27 Identities=19% Similarity=0.445 Sum_probs=16.0
Q ss_pred cCCCCCCcCcccCChHHHHHHHHhhcCCCCceeccCCCccccc
Q psy1773 297 FPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSY 339 (359)
Q Consensus 297 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f 339 (359)
|.|.. ||..|... ... +..|+. ||..-
T Consensus 3 Y~C~~--Cg~~~~~~-----------~~~-~irC~~--CG~rI 29 (44)
T smart00659 3 YICGE--CGRENEIK-----------SKD-VVRCRE--CGYRI 29 (44)
T ss_pred EECCC--CCCEeecC-----------CCC-ceECCC--CCceE
Confidence 66766 77766533 122 577777 77543
No 119
>KOG1842|consensus
Probab=24.95 E-value=42 Score=30.73 Aligned_cols=27 Identities=19% Similarity=0.330 Sum_probs=16.9
Q ss_pred ccccCCcCccCCChHHHHHHHHHHccC
Q psy1773 236 DLVCKGCGKQFKVPCRLREHYKAVHES 262 (359)
Q Consensus 236 ~~~C~~C~~~f~~~~~l~~H~~~~H~~ 262 (359)
-|.|++|...|.+...|..|+-..|.+
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~~ 41 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHFE 41 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhccc
Confidence 366666666666666666666655543
No 120
>COG2879 Uncharacterized small protein [Function unknown]
Probab=24.89 E-value=70 Score=20.39 Aligned_cols=16 Identities=13% Similarity=0.324 Sum_probs=9.0
Q ss_pred chhhHHHHHhhhcCCC
Q psy1773 341 AKSSLYAHLKHVHSTN 356 (359)
Q Consensus 341 ~~~~l~~H~~~hh~~~ 356 (359)
+..+...|++.+|++.
T Consensus 24 dYdnYVehmr~~hPd~ 39 (65)
T COG2879 24 DYDNYVEHMRKKHPDK 39 (65)
T ss_pred cHHHHHHHHHHhCcCC
Confidence 3455556666666554
No 121
>KOG3002|consensus
Probab=24.82 E-value=40 Score=29.51 Aligned_cols=79 Identities=19% Similarity=0.290 Sum_probs=40.8
Q ss_pred CccccCCcCccCCChHHHHHHHHHHccCCCcccccc--CCcCccccChHhhhccccccccCCcccCCCCC--CcCcccCC
Q psy1773 235 EDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTY--DGCKLLFSTKSALKRHNKSKHLNMRMFPCPLV--TCGKSFLR 310 (359)
Q Consensus 235 ~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~--~C~~~f~~ 310 (359)
....|+.|...+.....+..- .+ -....+.|++ -+|.+.|..... ..|.+ .- .-+||.|+.+ .|... ..
T Consensus 79 ~~~~CP~Cr~~~g~~R~~amE--kV-~e~~~vpC~~~~~GC~~~~~Y~~~-~~HE~-~C-~f~~~~CP~p~~~C~~~-G~ 151 (299)
T KOG3002|consen 79 VSNKCPTCRLPIGNIRCRAME--KV-AEAVLVPCKNAKLGCTKSFPYGEK-SKHEK-VC-EFRPCSCPVPGAECKYT-GS 151 (299)
T ss_pred hcccCCccccccccHHHHHHH--HH-HHhceecccccccCCceeeccccc-ccccc-cc-ccCCcCCCCCcccCCcc-Cc
Confidence 456688887777755443332 12 2233455653 257777766655 55554 12 2266667654 44332 23
Q ss_pred hHHHHHHHHh
Q psy1773 311 SEHLQEHMLQ 320 (359)
Q Consensus 311 ~~~l~~H~~~ 320 (359)
...|..|.+.
T Consensus 152 ~~~l~~H~~~ 161 (299)
T KOG3002|consen 152 YKDLYAHLND 161 (299)
T ss_pred HHHHHHHHHh
Confidence 3455566543
No 122
>KOG1280|consensus
Probab=24.59 E-value=63 Score=28.57 Aligned_cols=32 Identities=16% Similarity=0.196 Sum_probs=21.4
Q ss_pred chhhcccHHHHHhhhhhcCCCCCccccCCCccc
Q psy1773 121 CMWGFFSKHKLLRHMTSHDEDKKCYPCDQKNCN 153 (359)
Q Consensus 121 C~~~f~~~~~l~~H~~~~~~~~~~~~c~c~~C~ 153 (359)
|+..-.+...|..|....|.+ ..+.+.|+.|+
T Consensus 85 C~~~Gfte~~f~~Hv~s~Hpd-a~~~~icp~c~ 116 (381)
T KOG1280|consen 85 CGIMGFTERQFGTHVLSQHPE-ASTSVICPLCA 116 (381)
T ss_pred ccccccchhHHHHHhhhcCcc-cCcceeeeccc
Confidence 877777777888887777665 33455555553
No 123
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=24.33 E-value=36 Score=17.40 Aligned_cols=8 Identities=38% Similarity=1.223 Sum_probs=4.0
Q ss_pred ccCCCccccc
Q psy1773 330 CPYQDCLMSY 339 (359)
Q Consensus 330 C~~~~C~~~f 339 (359)
|+. ||..|
T Consensus 17 Cp~--CG~~F 24 (26)
T PF10571_consen 17 CPH--CGYDF 24 (26)
T ss_pred CCC--CCCCC
Confidence 544 55544
No 124
>KOG2593|consensus
Probab=23.34 E-value=65 Score=29.47 Aligned_cols=20 Identities=35% Similarity=0.713 Sum_probs=10.7
Q ss_pred CCcccCCCCCCcCcccCChHHH
Q psy1773 293 NMRMFPCPLVTCGKSFLRSEHL 314 (359)
Q Consensus 293 ~~~~~~C~~~~C~~~f~~~~~l 314 (359)
....|.|+. |.+.|+....+
T Consensus 125 ~~~~Y~Cp~--C~kkyt~Lea~ 144 (436)
T KOG2593|consen 125 NVAGYVCPN--CQKKYTSLEAL 144 (436)
T ss_pred ccccccCCc--cccchhhhHHH
Confidence 344466665 66666554444
No 125
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=22.90 E-value=42 Score=16.86 Aligned_cols=6 Identities=50% Similarity=1.342 Sum_probs=3.0
Q ss_pred ceeccC
Q psy1773 327 KFTCPY 332 (359)
Q Consensus 327 ~~~C~~ 332 (359)
+|.|+.
T Consensus 16 ~f~CPn 21 (24)
T PF07754_consen 16 PFPCPN 21 (24)
T ss_pred eEeCCC
Confidence 355554
No 126
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=22.22 E-value=79 Score=19.97 Aligned_cols=9 Identities=33% Similarity=1.110 Sum_probs=5.8
Q ss_pred ceeccCCCccc
Q psy1773 327 KFTCPYQDCLM 337 (359)
Q Consensus 327 ~~~C~~~~C~~ 337 (359)
+|.|+. ||+
T Consensus 50 ~Y~Cp~--CGF 58 (61)
T COG2888 50 PYRCPK--CGF 58 (61)
T ss_pred ceECCC--cCc
Confidence 577766 664
No 127
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=22.04 E-value=49 Score=25.36 Aligned_cols=11 Identities=18% Similarity=0.298 Sum_probs=6.2
Q ss_pred chhhcccHHHH
Q psy1773 121 CMWGFFSKHKL 131 (359)
Q Consensus 121 C~~~f~~~~~l 131 (359)
|+++|++-..+
T Consensus 34 C~~RFTTyErv 44 (147)
T TIGR00244 34 CHERFTTFERA 44 (147)
T ss_pred cCCccceeeec
Confidence 66666655443
No 128
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=21.98 E-value=1.1e+02 Score=23.91 Aligned_cols=15 Identities=13% Similarity=0.357 Sum_probs=6.7
Q ss_pred cccCCcCccCCChHH
Q psy1773 237 LVCKGCGKQFKVPCR 251 (359)
Q Consensus 237 ~~C~~C~~~f~~~~~ 251 (359)
|.|+.|+..|+....
T Consensus 110 Y~Cp~c~~r~tf~eA 124 (158)
T TIGR00373 110 FICPNMCVRFTFNEA 124 (158)
T ss_pred EECCCCCcEeeHHHH
Confidence 444444444444333
No 129
>KOG3408|consensus
Probab=21.94 E-value=50 Score=24.25 Aligned_cols=52 Identities=12% Similarity=0.205 Sum_probs=0.0
Q ss_pred HHHHHHHhcCcCccceeeCCCCCCCCCCCeeeccCCCCCCcccChhHHHhHHhh
Q psy1773 49 AITEALQSLGINKEYFISSTGGKNSKKQQLWSCPIVNCNFTSVNLNTFKIHLLK 102 (359)
Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~~~~~l~~H~~~ 102 (359)
+++.....+......+.........++...|.|. .|.+-|.+...|..|.++
T Consensus 28 dLDqi~~dl~~~~~kll~~~~D~dlPG~GqfyCi--~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 28 DLDQIDEDLETQKGKLLNQEIDPDLPGGGQFYCI--ECARYFIDAKALKTHFKT 79 (129)
T ss_pred cccccccccccccchhhcCcCCCCCCCCceeehh--hhhhhhcchHHHHHHHhc
No 130
>KOG2593|consensus
Probab=21.53 E-value=91 Score=28.58 Aligned_cols=17 Identities=18% Similarity=0.714 Sum_probs=8.1
Q ss_pred CccccccccchhhcCChHH
Q psy1773 173 LTLCCEELGCGRKFQTMKQ 191 (359)
Q Consensus 173 ~~~~C~~~~C~~~f~~~~~ 191 (359)
..|.|+. |.+.|.....
T Consensus 127 ~~Y~Cp~--C~kkyt~Lea 143 (436)
T KOG2593|consen 127 AGYVCPN--CQKKYTSLEA 143 (436)
T ss_pred ccccCCc--cccchhhhHH
Confidence 3455555 5555544333
No 131
>KOG2295|consensus
Probab=21.25 E-value=54 Score=30.97 Aligned_cols=42 Identities=14% Similarity=0.448 Sum_probs=32.7
Q ss_pred HHHHHHHhhc---CCCCceeccCCCccccccchhhHHHHHhhhcCC
Q psy1773 313 HLQEHMLQHN---ENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 355 (359)
Q Consensus 313 ~l~~H~~~h~---~~~~~~~C~~~~C~~~f~~~~~l~~H~~~hh~~ 355 (359)
.+..|++..+ ++. .|.|+..||.+.|.-..-..+|+...|.+
T Consensus 493 eve~~v~s~t~e~~kd-Ky~C~lsgc~KlF~gpEFvrKHi~~KH~d 537 (648)
T KOG2295|consen 493 EVENFVDSNTMELDKD-KYLCPLSGCAKLFKGPEFVRKHINKKHKD 537 (648)
T ss_pred HHHHHHHHHHHHhhcc-cccCCCcchHhhccCHHHHHHHHHHHHHH
Confidence 4566665544 333 59999999999999999999999988854
No 132
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=20.14 E-value=77 Score=29.62 Aligned_cols=27 Identities=22% Similarity=0.459 Sum_probs=0.0
Q ss_pred cccCCcCccCCChHHHHHHHHHHccCC
Q psy1773 237 LVCKGCGKQFKVPCRLREHYKAVHEST 263 (359)
Q Consensus 237 ~~C~~C~~~f~~~~~l~~H~~~~H~~~ 263 (359)
+.|+.|.+.|.+...+..|+...|.+.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhhhh
No 133
>PF11567 PfUIS3: Plasmodium falciparum UIS3 membrane protein; InterPro: IPR021626 UIS3 is a membrane protein essential for sporozoite development in infected hepatocytes. This family is 130-229 of the Plasmodium falciparum UIS3 protein which is compact and has an all alpha-helical structure.PfUIS3(130-229) interacts with lipids, phospholipid lysosomes, the human liver fatty acid-binding protein and with the lipid phosphatidylethanolamine. The interaction with liver fatty acid-binding protein provides the parasite with a method to import essential fatty acids/lipids during rapid growth phases of sporozoites []. ; PDB: 2VWA_C.
Probab=20.11 E-value=2.3e+02 Score=19.43 Aligned_cols=39 Identities=15% Similarity=0.352 Sum_probs=23.7
Q ss_pred HHHHHHHhhcCCCchhHHHHHHHHHHHHHHHHHhhhhhh
Q psy1773 4 VKKLLEFAQDTDDVSARNVLTKLAVLYSTWSLEKHLATF 42 (359)
Q Consensus 4 ~~~l~~~~q~~~~v~~e~~~~~~~~~~~~~~~~~~~~~~ 42 (359)
+|.+++--+..++|.+-...++.+|++-+.-|.+.....
T Consensus 57 ~rkiVQ~L~E~rnv~lSr~QEd~AVl~~e~fLq~~y~~~ 95 (101)
T PF11567_consen 57 IRKIVQTLDENRNVNLSRMQEDMAVLNVEHFLQEQYQQY 95 (101)
T ss_dssp HHHHHHHHHHHTT----HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhcccccHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 455566666668888888888888888766665555443
Done!