Query psy11069
Match_columns 391
No_of_seqs 467 out of 2543
Neff 10.1
Searched_HMMs 46136
Date Fri Aug 16 17:13:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11069.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11069hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3608|consensus 100.0 2.5E-31 5.4E-36 227.4 12.7 261 108-386 67-377 (467)
2 KOG3608|consensus 100.0 2.2E-29 4.9E-34 215.5 10.3 233 106-357 130-379 (467)
3 KOG1074|consensus 100.0 2.4E-29 5.2E-34 237.3 5.7 196 181-388 608-934 (958)
4 KOG2462|consensus 99.9 2.8E-27 6E-32 198.1 7.2 134 237-383 130-265 (279)
5 KOG2462|consensus 99.9 3E-26 6.5E-31 191.9 5.4 106 235-351 159-264 (279)
6 KOG3623|consensus 99.9 2E-22 4.3E-27 187.7 4.9 111 107-230 207-331 (1007)
7 KOG3623|consensus 99.9 4.7E-22 1E-26 185.3 7.0 109 140-262 210-332 (1007)
8 KOG1074|consensus 99.8 4.4E-20 9.5E-25 175.2 2.9 108 269-384 606-732 (958)
9 KOG3576|consensus 99.7 7.2E-18 1.6E-22 134.3 3.2 114 266-387 115-238 (267)
10 KOG3576|consensus 99.6 2.2E-16 4.8E-21 125.9 1.8 115 236-358 116-240 (267)
11 PLN03086 PRLI-interacting fact 99.1 9.8E-10 2.1E-14 104.5 10.7 150 206-387 407-566 (567)
12 PLN03086 PRLI-interacting fact 99.0 8E-10 1.7E-14 105.1 8.8 146 181-355 410-565 (567)
13 PHA00733 hypothetical protein 99.0 3.4E-10 7.4E-15 87.7 5.1 82 295-387 38-125 (128)
14 PHA00733 hypothetical protein 99.0 6.9E-10 1.5E-14 86.0 5.6 85 265-356 37-125 (128)
15 PHA02768 hypothetical protein; 98.7 6.9E-09 1.5E-13 65.9 2.4 44 328-378 5-48 (55)
16 KOG3993|consensus 98.7 5.7E-09 1.2E-13 93.1 0.7 200 177-387 268-484 (500)
17 PHA02768 hypothetical protein; 98.6 1.3E-08 2.8E-13 64.7 1.9 42 269-315 6-47 (55)
18 KOG3993|consensus 98.5 1.2E-08 2.7E-13 91.0 -0.3 25 206-232 356-380 (500)
19 PF13465 zf-H2C2_2: Zinc-finge 98.5 1.3E-07 2.9E-12 51.2 2.7 26 345-373 1-26 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.3 4.9E-07 1.1E-11 48.9 2.0 26 127-159 1-26 (26)
21 PHA00616 hypothetical protein 98.2 1.1E-06 2.3E-11 53.1 1.8 34 328-364 1-34 (44)
22 PHA00616 hypothetical protein 98.2 9.4E-07 2E-11 53.3 1.5 31 359-391 1-31 (44)
23 PF05605 zf-Di19: Drought indu 98.0 9.5E-06 2.1E-10 52.8 4.7 51 329-386 3-54 (54)
24 PHA00732 hypothetical protein 98.0 4E-06 8.7E-11 58.8 3.0 56 297-362 1-56 (79)
25 PHA00732 hypothetical protein 98.0 7.3E-06 1.6E-10 57.5 3.2 46 268-321 1-46 (79)
26 PF05605 zf-Di19: Drought indu 97.9 1.4E-05 3E-10 52.0 4.3 50 298-354 3-53 (54)
27 COG5189 SFP1 Putative transcri 97.6 1.6E-05 3.4E-10 68.8 0.9 72 204-288 347-418 (423)
28 PF00096 zf-C2H2: Zinc finger, 97.6 4.2E-05 9.1E-10 40.0 1.9 22 360-383 1-22 (23)
29 COG5189 SFP1 Putative transcri 97.6 3.4E-05 7.3E-10 66.9 2.3 75 295-384 347-421 (423)
30 PF13894 zf-C2H2_4: C2H2-type 97.5 6E-05 1.3E-09 39.8 2.1 24 360-385 1-24 (24)
31 PF00096 zf-C2H2: Zinc finger, 97.5 5.1E-05 1.1E-09 39.7 1.5 23 329-353 1-23 (23)
32 PF12756 zf-C2H2_2: C2H2 type 97.5 5.8E-05 1.3E-09 56.3 2.4 73 270-353 1-73 (100)
33 PF12756 zf-C2H2_2: C2H2 type 97.5 5.9E-05 1.3E-09 56.3 2.1 72 300-384 2-73 (100)
34 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00016 3.4E-09 38.1 2.0 24 329-354 1-24 (24)
35 PF13912 zf-C2H2_6: C2H2-type 97.3 0.00015 3.3E-09 39.6 1.8 26 359-386 1-26 (27)
36 COG5236 Uncharacterized conser 97.2 0.00021 4.5E-09 62.8 2.1 131 206-353 151-304 (493)
37 COG5236 Uncharacterized conser 97.1 0.00079 1.7E-08 59.2 5.0 74 299-386 222-306 (493)
38 PF13912 zf-C2H2_6: C2H2-type 97.0 0.00034 7.3E-09 38.2 1.0 23 329-353 2-24 (27)
39 PF09237 GAGA: GAGA factor; I 96.9 0.00073 1.6E-08 41.8 1.9 40 317-358 13-52 (54)
40 PF13909 zf-H2C2_5: C2H2-type 96.7 0.001 2.2E-08 35.0 1.8 24 360-386 1-24 (24)
41 PF09237 GAGA: GAGA factor; I 96.5 0.0016 3.5E-08 40.3 1.5 41 346-388 11-51 (54)
42 smart00355 ZnF_C2H2 zinc finge 96.3 0.0052 1.1E-07 32.7 2.8 18 336-353 6-23 (26)
43 KOG2231|consensus 96.2 0.01 2.2E-07 58.2 6.0 127 238-382 100-260 (669)
44 KOG4173|consensus 96.1 0.0017 3.6E-08 52.9 0.5 84 237-324 79-171 (253)
45 KOG1146|consensus 96.1 0.0023 5E-08 66.3 1.2 56 326-385 516-613 (1406)
46 KOG4173|consensus 95.9 0.0053 1.1E-07 50.1 2.3 95 108-233 77-171 (253)
47 smart00355 ZnF_C2H2 zinc finge 95.9 0.0052 1.1E-07 32.7 1.6 24 360-385 1-24 (26)
48 PF13909 zf-H2C2_5: C2H2-type 95.8 0.0071 1.5E-07 31.7 1.8 23 329-354 1-23 (24)
49 PF12874 zf-met: Zinc-finger o 95.8 0.0046 1E-07 32.8 1.0 15 336-350 6-20 (25)
50 PF12874 zf-met: Zinc-finger o 95.6 0.0082 1.8E-07 31.8 1.7 23 360-384 1-23 (25)
51 KOG2231|consensus 95.6 0.022 4.7E-07 56.0 5.5 18 245-262 188-205 (669)
52 COG5048 FOG: Zn-finger [Genera 95.5 0.0034 7.4E-08 60.3 -0.2 154 110-300 289-449 (467)
53 KOG1146|consensus 95.4 0.0033 7.2E-08 65.2 -0.6 132 245-382 442-639 (1406)
54 COG5048 FOG: Zn-finger [Genera 94.9 0.0088 1.9E-07 57.4 0.7 163 180-348 291-463 (467)
55 KOG2482|consensus 94.8 0.085 1.8E-06 46.9 6.2 77 269-352 280-356 (423)
56 PRK04860 hypothetical protein; 94.4 0.027 5.8E-07 45.5 2.2 38 268-312 119-156 (160)
57 PF12171 zf-C2H2_jaz: Zinc-fin 94.0 0.015 3.2E-07 31.5 -0.0 22 360-383 2-23 (27)
58 PRK04860 hypothetical protein; 93.8 0.038 8.2E-07 44.7 2.0 40 327-375 118-157 (160)
59 PF12171 zf-C2H2_jaz: Zinc-fin 93.7 0.05 1.1E-06 29.4 1.7 21 298-320 2-22 (27)
60 KOG2482|consensus 92.4 0.12 2.7E-06 45.8 3.2 156 221-384 128-357 (423)
61 PF13913 zf-C2HC_2: zinc-finge 91.5 0.16 3.6E-06 26.8 1.8 20 360-382 3-22 (25)
62 PF13913 zf-C2HC_2: zinc-finge 90.5 0.3 6.5E-06 25.8 2.3 19 330-351 4-22 (25)
63 smart00451 ZnF_U1 U1-like zinc 90.3 0.2 4.4E-06 28.8 1.7 22 360-383 4-25 (35)
64 KOG4377|consensus 90.0 0.33 7.1E-06 44.4 3.4 128 206-354 271-427 (480)
65 KOG4124|consensus 89.6 0.098 2.1E-06 46.6 -0.1 56 203-262 175-235 (442)
66 smart00451 ZnF_U1 U1-like zinc 89.6 0.27 5.8E-06 28.3 1.8 23 328-352 3-25 (35)
67 PF12013 DUF3505: Protein of u 89.2 0.65 1.4E-05 35.0 4.2 26 359-386 80-109 (109)
68 KOG2893|consensus 88.8 0.12 2.7E-06 43.4 -0.0 43 336-385 16-58 (341)
69 KOG2785|consensus 88.5 1.3 2.9E-05 40.3 6.2 140 237-384 3-243 (390)
70 COG4049 Uncharacterized protei 86.5 0.35 7.6E-06 30.6 1.0 28 265-292 14-41 (65)
71 KOG2785|consensus 85.7 1.7 3.6E-05 39.7 5.2 139 206-352 3-242 (390)
72 PF12013 DUF3505: Protein of u 85.2 1.3 2.9E-05 33.3 3.9 85 266-355 9-109 (109)
73 KOG2893|consensus 84.9 0.24 5.1E-06 41.7 -0.4 42 300-350 13-54 (341)
74 COG4049 Uncharacterized protei 83.3 0.59 1.3E-05 29.6 0.9 30 324-355 13-42 (65)
75 cd00350 rubredoxin_like Rubred 78.5 1.2 2.6E-05 25.3 1.1 9 269-277 2-10 (33)
76 KOG4377|consensus 77.2 1.5 3.2E-05 40.4 1.8 28 359-386 401-428 (480)
77 PF02892 zf-BED: BED zinc fing 75.3 2 4.3E-05 26.3 1.6 25 359-385 16-44 (45)
78 smart00614 ZnF_BED BED zinc fi 66.9 5.5 0.00012 25.0 2.2 26 359-386 18-48 (50)
79 PF09986 DUF2225: Uncharacteri 65.3 3.3 7.1E-05 35.5 1.2 44 236-281 4-61 (214)
80 PF12907 zf-met2: Zinc-binding 63.7 4.2 9.1E-05 24.3 1.1 26 270-295 3-31 (40)
81 PF04959 ARS2: Arsenite-resist 59.5 3.9 8.5E-05 34.8 0.6 27 359-387 77-103 (214)
82 PF09986 DUF2225: Uncharacteri 56.1 2.5 5.5E-05 36.2 -1.0 44 327-374 4-61 (214)
83 PF04959 ARS2: Arsenite-resist 55.9 7.5 0.00016 33.1 1.8 29 265-293 74-102 (214)
84 TIGR00622 ssl1 transcription f 55.8 16 0.00034 27.5 3.2 20 268-287 15-34 (112)
85 PF09538 FYDLN_acid: Protein o 55.8 7.7 0.00017 29.1 1.6 15 267-281 25-39 (108)
86 PF15269 zf-C2H2_7: Zinc-finge 55.1 10 0.00023 22.9 1.7 26 175-200 17-42 (54)
87 smart00734 ZnF_Rad18 Rad18-lik 52.9 10 0.00023 20.1 1.4 20 360-382 2-21 (26)
88 PF13719 zinc_ribbon_5: zinc-r 51.9 11 0.00023 22.1 1.5 9 269-277 26-34 (37)
89 COG1592 Rubrerythrin [Energy p 51.9 9.1 0.0002 31.1 1.6 23 268-306 134-156 (166)
90 KOG4124|consensus 51.4 4 8.7E-05 36.8 -0.5 27 235-261 347-373 (442)
91 TIGR00622 ssl1 transcription f 49.7 38 0.00082 25.5 4.4 90 269-384 2-104 (112)
92 PF07800 DUF1644: Protein of u 49.6 21 0.00045 28.6 3.2 59 268-329 80-138 (162)
93 KOG2186|consensus 48.1 9.5 0.00021 32.9 1.2 49 111-171 4-52 (276)
94 PF09538 FYDLN_acid: Protein o 46.9 14 0.0003 27.7 1.8 32 109-160 8-39 (108)
95 PF13717 zinc_ribbon_4: zinc-r 46.7 15 0.00033 21.3 1.6 10 269-278 26-35 (36)
96 PRK09678 DNA-binding transcrip 44.8 7.9 0.00017 26.6 0.2 16 359-374 27-42 (72)
97 COG2888 Predicted Zn-ribbon RN 43.3 17 0.00037 23.7 1.5 10 268-277 27-36 (61)
98 cd00729 rubredoxin_SM Rubredox 40.6 26 0.00056 20.0 1.9 22 110-145 2-23 (34)
99 PF05443 ROS_MUCR: ROS/MUCR tr 40.0 18 0.00038 28.3 1.4 24 359-387 72-95 (132)
100 PRK00464 nrdR transcriptional 39.8 13 0.00028 29.9 0.7 13 269-281 29-41 (154)
101 PF04606 Ogr_Delta: Ogr/Delta- 38.3 7.9 0.00017 24.0 -0.6 13 298-310 26-38 (47)
102 KOG2186|consensus 36.6 23 0.00049 30.7 1.7 48 141-201 4-51 (276)
103 COG1592 Rubrerythrin [Energy p 32.0 33 0.00072 27.9 1.9 23 297-337 134-156 (166)
104 PRK14890 putative Zn-ribbon RN 31.2 25 0.00054 23.0 0.9 10 268-277 25-34 (59)
105 TIGR02605 CxxC_CxxC_SSSS putat 30.8 13 0.00029 23.4 -0.4 7 269-275 27-33 (52)
106 TIGR02300 FYDLN_acid conserved 30.7 38 0.00082 26.1 1.9 34 109-162 8-41 (129)
107 smart00531 TFIIE Transcription 30.5 37 0.00079 27.1 1.9 17 327-345 98-114 (147)
108 PRK00398 rpoP DNA-directed RNA 30.2 42 0.00091 20.5 1.8 30 110-158 3-32 (46)
109 COG4957 Predicted transcriptio 29.8 37 0.00079 26.3 1.7 22 360-386 77-98 (148)
110 PF05443 ROS_MUCR: ROS/MUCR tr 29.4 32 0.00069 26.9 1.4 25 328-357 72-96 (132)
111 PRK06266 transcription initiat 29.4 24 0.00052 29.2 0.8 16 297-314 117-132 (178)
112 KOG2295|consensus 29.1 30 0.00064 33.6 1.4 43 344-387 492-537 (648)
113 COG1997 RPL43A Ribosomal prote 28.2 37 0.0008 24.1 1.4 12 268-279 53-64 (89)
114 PF07800 DUF1644: Protein of u 27.9 73 0.0016 25.6 3.1 36 350-387 99-134 (162)
115 PF02176 zf-TRAF: TRAF-type zi 27.4 27 0.00058 22.6 0.6 9 252-260 23-31 (60)
116 TIGR00373 conserved hypothetic 27.4 52 0.0011 26.6 2.3 20 326-347 107-126 (158)
117 KOG4167|consensus 26.7 18 0.0004 36.3 -0.5 26 359-386 792-817 (907)
118 PF03145 Sina: Seven in absent 26.7 67 0.0014 27.0 3.0 13 375-387 61-73 (198)
119 KOG4167|consensus 26.2 15 0.00032 36.9 -1.2 27 327-355 791-817 (907)
120 PF09723 Zn-ribbon_8: Zinc rib 26.2 39 0.00085 20.3 1.1 9 268-276 26-34 (42)
121 PF01352 KRAB: KRAB box; Inte 25.1 8.2 0.00018 23.2 -2.0 15 39-53 20-34 (41)
122 TIGR00373 conserved hypothetic 24.5 93 0.002 25.2 3.3 33 294-340 106-138 (158)
123 PRK06266 transcription initiat 24.3 45 0.00097 27.6 1.5 18 327-346 116-133 (178)
124 COG4530 Uncharacterized protei 23.8 48 0.001 24.6 1.3 13 266-278 24-36 (129)
125 PHA00626 hypothetical protein 23.6 41 0.0009 21.6 0.8 13 359-373 23-35 (59)
126 PF10571 UPF0547: Uncharacteri 23.4 38 0.00082 18.0 0.6 9 361-371 16-24 (26)
127 KOG3408|consensus 23.3 45 0.00098 25.3 1.1 26 203-230 54-79 (129)
128 PF04780 DUF629: Protein of un 22.3 59 0.0013 31.3 2.0 27 269-295 58-84 (466)
129 KOG3002|consensus 22.0 60 0.0013 29.4 1.9 80 266-352 78-161 (299)
130 smart00659 RPOLCX RNA polymera 20.8 68 0.0015 19.6 1.4 26 329-370 3-28 (44)
131 PTZ00255 60S ribosomal protein 20.6 45 0.00097 24.0 0.6 13 267-279 53-65 (90)
132 PF07754 DUF1610: Domain of un 20.4 49 0.0011 17.3 0.6 7 360-368 17-23 (24)
133 COG1996 RPC10 DNA-directed RNA 20.1 51 0.0011 20.7 0.7 29 327-370 5-33 (49)
134 PF13451 zf-trcl: Probable zin 20.1 62 0.0013 20.3 1.1 14 268-281 4-17 (49)
135 PF10013 DUF2256: Uncharacteri 20.0 67 0.0015 19.4 1.2 17 269-285 9-25 (42)
No 1
>KOG3608|consensus
Probab=99.97 E-value=2.5e-31 Score=227.43 Aligned_cols=261 Identities=25% Similarity=0.469 Sum_probs=229.4
Q ss_pred CCceecCCCCCCCcccC-hHHHHHHHHhcC-----------------C-----------------CCcccCccCCCCCCc
Q psy11069 108 QQLWSCPIVNCNFTSVN-LNTFKIHLLKHF-----------------D-----------------KKPFKCKLTTANNTP 152 (391)
Q Consensus 108 ~~~~~C~~~~C~~~f~~-~~~l~~H~~~h~-----------------~-----------------~~~~~C~~~~~~~~~ 152 (391)
...+.|.|..|++...+ ...|.+|+-.|- + ...|.|.+ ..
T Consensus 67 ~de~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~W-----ed 141 (467)
T KOG3608|consen 67 TDEHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGW-----ED 141 (467)
T ss_pred ccceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccCh-----hh
Confidence 44799999999999887 588999987652 0 01244544 55
Q ss_pred cccccCCHHHHHHHHhhcCC------------CCCcccccccccccccCCHHHHHHHHHHcCCCCccccccccccccccC
Q psy11069 153 CMWGFFSKHKLLRHMTSHDE------------DKKCYPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQT 220 (391)
Q Consensus 153 C~~~f~~~~~l~~H~~~h~~------------~~~~~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~ 220 (391)
|+..|.+...|..|+..|.. ++..+.|.|..|-+.|.+++.|+.|++.|.+++...|+. |+..|++
T Consensus 142 Ce~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~--Cg~~F~~ 219 (467)
T KOG3608|consen 142 CEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH--CGELFRT 219 (467)
T ss_pred cCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch--HHHHhcc
Confidence 99999999999999876642 224578999999999999999999999999999999999 9999999
Q ss_pred chHHHHHHhHhcC-CCCCeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhcCCCCccc
Q psy11069 221 MKQYSTHLKEHSN-VSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFK 299 (391)
Q Consensus 221 ~~~l~~H~~~h~~-~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~ 299 (391)
...|..|++..+. ...+|.|.. |.|.|.+...|..|+..|. ..|+|+.|+.+....++|..|++..|...+||+
T Consensus 220 ~tkl~DH~rRqt~l~~n~fqC~~--C~KrFaTeklL~~Hv~rHv---n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfK 294 (467)
T KOG3608|consen 220 KTKLFDHLRRQTELNTNSFQCAQ--CFKRFATEKLLKSHVVRHV---NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFK 294 (467)
T ss_pred ccHHHHHHHhhhhhcCCchHHHH--HHHHHhHHHHHHHHHHHhh---hcccccccccCCCChHHHHHHHHhhhccCCCcc
Confidence 9999999986542 245899996 9999999999999999994 469999999999999999999999999999999
Q ss_pred cccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhc-CC-CCceeccCcCcccccccchhH
Q psy11069 300 CTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHN-EN-KPKFTCPYQDCLMSYVAKSSL 377 (391)
Q Consensus 300 C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~-~~-~~~~~C~~~~C~~~f~~~~~L 377 (391)
|.. |++.|.+.+.|.+|.. .|+ +-.|.|..++|.++|.+...|.+|++.++ |. .++|.|-. |++.|++-.+|
T Consensus 295 Cd~--Cd~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~--Cdr~ft~G~~L 368 (467)
T KOG3608|consen 295 CDE--CDTRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHC--CDRFFTSGKSL 368 (467)
T ss_pred ccc--hhhhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeec--chhhhccchhH
Confidence 977 9999999999999999 688 77899999999999999999999999887 32 34799999 99999999999
Q ss_pred HHHHHhhcC
Q psy11069 378 YAHLKHVHS 386 (391)
Q Consensus 378 ~~H~~~~H~ 386 (391)
.+|++..|+
T Consensus 369 ~~HL~kkH~ 377 (467)
T KOG3608|consen 369 SAHLMKKHG 377 (467)
T ss_pred HHHHHHhhc
Confidence 999999986
No 2
>KOG3608|consensus
Probab=99.96 E-value=2.2e-29 Score=215.51 Aligned_cols=233 Identities=23% Similarity=0.489 Sum_probs=206.2
Q ss_pred CCCCceecCCCCCCCcccChHHHHHHHHhcCC------------CC-cccCccCCCCCCccccccCCHHHHHHHHhhcCC
Q psy11069 106 KKQQLWSCPIVNCNFTSVNLNTFKIHLLKHFD------------KK-PFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDE 172 (391)
Q Consensus 106 ~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~------------~~-~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~~ 172 (391)
.....|.|.|.+|+..|.+...|..|+..|.. ++ .+.|.+.+ |-+.|.++..|++|++.|.+
T Consensus 130 ~~g~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~-----Ct~~~~~k~~LreH~r~Hs~ 204 (467)
T KOG3608|consen 130 ALGQNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAM-----CTKHMGNKYRLREHIRTHSN 204 (467)
T ss_pred cchhhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchh-----hhhhhccHHHHHHHHHhcCC
Confidence 34567999999999999999999999988742 22 25676544 99999999999999999999
Q ss_pred CCCcccccccccccccCCHHHHHHHHHHcC--CCCccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCcccc
Q psy11069 173 DKKCYPCDQKNCNKRFTTLSNLKMHMVRHG--KPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFY 250 (391)
Q Consensus 173 ~~~~~~c~c~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~ 250 (391)
+ +...|. .||..|.++..|..|++..+ ...+|.|.. |.+.|.+...|..|+..|.. .|+|++ |+.+..
T Consensus 205 e-KvvACp--~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~--C~KrFaTeklL~~Hv~rHvn---~ykCpl--Cdmtc~ 274 (467)
T KOG3608|consen 205 E-KVVACP--HCGELFRTKTKLFDHLRRQTELNTNSFQCAQ--CFKRFATEKLLKSHVVRHVN---CYKCPL--CDMTCS 274 (467)
T ss_pred C-eEEecc--hHHHHhccccHHHHHHHhhhhhcCCchHHHH--HHHHHhHHHHHHHHHHHhhh---cccccc--cccCCC
Confidence 8 544555 99999999999999998754 456899999 99999999999999999974 799997 999999
Q ss_pred chhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCC--Cc
Q psy11069 251 LMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNM--RM 328 (391)
Q Consensus 251 ~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~--~~ 328 (391)
..++|..|++..|...+||+|..|++.|.+.+.|.+|.. +|+ +..|+|..++|.++|.+...|++|++.+|.+. .+
T Consensus 275 ~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~ 352 (467)
T KOG3608|consen 275 SASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQ-VHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPIL 352 (467)
T ss_pred ChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHH-hcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCc
Confidence 999999999998788999999999999999999999988 898 66899999999999999999999999888654 56
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHhcCCC
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQHNENK 357 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h~~~~ 357 (391)
|.|.+ |++.|++..+|.+|++.-++-+
T Consensus 353 Y~CH~--Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 353 YACHC--CDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred eeeec--chhhhccchhHHHHHHHhhccc
Confidence 99998 9999999999999998766543
No 3
>KOG1074|consensus
Probab=99.95 E-value=2.4e-29 Score=237.32 Aligned_cols=196 Identities=23% Similarity=0.432 Sum_probs=156.0
Q ss_pred ccccccccCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhHhcCC---CCCeecc---CCCCCccccchhh
Q psy11069 181 QKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNV---SAPYMCD---YKGCGKSFYLMAS 254 (391)
Q Consensus 181 c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~---~~~~~C~---~~~C~~~f~~~~~ 254 (391)
|-+|.++...+++|+.|+++|+|++||+|.+ |++.|.++.+|+.|+-.|... ..++.|+ + |-+.|...-.
T Consensus 608 CiiC~rVlSC~saLqmHyrtHtGERPFkCKi--CgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~i--c~~kftn~V~ 683 (958)
T KOG1074|consen 608 CIICLRVLSCPSALQMHYRTHTGERPFKCKI--CGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFI--CQKKFTNAVT 683 (958)
T ss_pred eeeeeecccchhhhhhhhhcccCcCcccccc--ccchhccccchhhcccccccCccccccccCCchhh--hccccccccc
Confidence 8899999999999999999999999999999 999999999999999988642 2458888 7 9999999999
Q ss_pred HhccccccCCCCC------------ccccCcccccCCChhHHHHHHHHhcC----------------CCC----cccccc
Q psy11069 255 LKSHQRVHVTNPE------------DLVCKGCGKQFKVPCRLREHYKAVHE----------------STK----NFKCTH 302 (391)
Q Consensus 255 l~~H~~~h~~~~~------------~~~C~~C~~~f~~~~~l~~H~~~~H~----------------~~~----~~~C~~ 302 (391)
|..|+++|.++.. .-+|..|.+.|.....+..++. .|. .+. +..+..
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~s-e~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~ 762 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQIS-EQPSPESEPDEQMDERTETEELDVTPPPPENS 762 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhh-ccCCcccCCcccccccccccccccCCCccccc
Confidence 9999999963322 2469999999988888888865 441 111 334433
Q ss_pred CCcCccccCHHHHHHhhhcc----------------------CCC-----------------------------------
Q psy11069 303 DGCKLLFSSKSALKRHNKSK----------------------HLN----------------------------------- 325 (391)
Q Consensus 303 ~~C~~~f~~~~~l~~H~~~~----------------------H~~----------------------------------- 325 (391)
|+..+.....+..+-.+. +++
T Consensus 763 --~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg 840 (958)
T KOG1074|consen 763 --CGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG 840 (958)
T ss_pred --cccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence 666665544443332110 000
Q ss_pred ------------------------------------CCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCccc
Q psy11069 326 ------------------------------------MRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLM 369 (391)
Q Consensus 326 ------------------------------------~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~ 369 (391)
.....|.+ |++.|...++|..|+|+|+++|| |.|.. |++
T Consensus 841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~v--Cgk~FsSSsALqiH~rTHtg~KP-F~C~f--C~~ 915 (958)
T KOG1074|consen 841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNV--CGKQFSSSAALEIHMRTHTGPKP-FFCHF--CEE 915 (958)
T ss_pred cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhcc--chhcccchHHHHHhhhcCCCCCC-ccchh--hhh
Confidence 01267998 99999999999999999999995 99999 999
Q ss_pred ccccchhHHHHHHhhcCCC
Q psy11069 370 SYVAKSSLYAHLKHVHSTN 388 (391)
Q Consensus 370 ~f~~~~~L~~H~~~~H~~~ 388 (391)
.|.++.+|+.||.+|+...
T Consensus 916 aFttrgnLKvHMgtH~w~q 934 (958)
T KOG1074|consen 916 AFTTRGNLKVHMGTHMWVQ 934 (958)
T ss_pred hhhhhhhhhhhhccccccC
Confidence 9999999999999998654
No 4
>KOG2462|consensus
Probab=99.94 E-value=2.8e-27 Score=198.05 Aligned_cols=134 Identities=34% Similarity=0.649 Sum_probs=105.0
Q ss_pred CeeccCCCCCccccchhhHhccccccCC--CCCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHH
Q psy11069 237 PYMCDYKGCGKSFYLMASLKSHQRVHVT--NPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSA 314 (391)
Q Consensus 237 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~--~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~ 314 (391)
.|.|. +|++.+.+..+|.+|..+|-. ..+.+.|+.||+.|.+...|..|++ +|+ -+++|.+ ||+.|...+-
T Consensus 130 r~~c~--eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHir-TH~--l~c~C~i--CGKaFSRPWL 202 (279)
T KOG2462|consen 130 RYKCP--ECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIR-THT--LPCECGI--CGKAFSRPWL 202 (279)
T ss_pred ceecc--ccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhh-ccC--CCccccc--ccccccchHH
Confidence 35554 355555555555555555522 3466888888888888888888887 776 5688877 8888888888
Q ss_pred HHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHHHHHh
Q psy11069 315 LKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKH 383 (391)
Q Consensus 315 l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~ 383 (391)
|+-|+|+ |+|||||.|+. |+++|.++++|+.||++|.+.| .|+|.. |+|+|..++.|.+|...
T Consensus 203 LQGHiRT-HTGEKPF~C~h--C~kAFADRSNLRAHmQTHS~~K-~~qC~~--C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 203 LQGHIRT-HTGEKPFSCPH--CGKAFADRSNLRAHMQTHSDVK-KHQCPR--CGKSFALKSYLNKHSES 265 (279)
T ss_pred hhccccc-ccCCCCccCCc--ccchhcchHHHHHHHHhhcCCc-cccCcc--hhhHHHHHHHHHHhhhh
Confidence 8888885 88888888887 9999999889999999888888 588988 99999888888888753
No 5
>KOG2462|consensus
Probab=99.92 E-value=3e-26 Score=191.86 Aligned_cols=106 Identities=37% Similarity=0.752 Sum_probs=78.8
Q ss_pred CCCeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHH
Q psy11069 235 SAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSA 314 (391)
Q Consensus 235 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~ 314 (391)
.+.+.|.+ |++.|.+...|..|+++|. -+.+|.+|||.|.....|..|+| .|+|||||.|+. |++.|.++++
T Consensus 159 ~ka~~C~~--C~K~YvSmpALkMHirTH~---l~c~C~iCGKaFSRPWLLQGHiR-THTGEKPF~C~h--C~kAFADRSN 230 (279)
T KOG2462|consen 159 KKAFSCKY--CGKVYVSMPALKMHIRTHT---LPCECGICGKAFSRPWLLQGHIR-THTGEKPFSCPH--CGKAFADRSN 230 (279)
T ss_pred cccccCCC--CCceeeehHHHhhHhhccC---CCcccccccccccchHHhhcccc-cccCCCCccCCc--ccchhcchHH
Confidence 34567765 7777777777777777772 46777777777777777777777 777777777776 7777777777
Q ss_pred HHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHH
Q psy11069 315 LKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHML 351 (391)
Q Consensus 315 l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~ 351 (391)
|+.||++ |.+.|.|+|.. |+|.|...+.|.+|..
T Consensus 231 LRAHmQT-HS~~K~~qC~~--C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 231 LRAHMQT-HSDVKKHQCPR--CGKSFALKSYLNKHSE 264 (279)
T ss_pred HHHHHHh-hcCCccccCcc--hhhHHHHHHHHHHhhh
Confidence 7777774 77777777776 7777777777777754
No 6
>KOG3623|consensus
Probab=99.86 E-value=2e-22 Score=187.72 Aligned_cols=111 Identities=30% Similarity=0.619 Sum_probs=96.1
Q ss_pred CCCceecCCCCCCCcccChHHHHHHHHh-c-CCCCcccCccCCCCCCccccccCCHHHHHHHHhhcCCCC----------
Q psy11069 107 KQQLWSCPIVNCNFTSVNLNTFKIHLLK-H-FDKKPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDK---------- 174 (391)
Q Consensus 107 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~-h-~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~~~~---------- 174 (391)
-..+.+|++ |...+.....|+.|++- | ..+..|.|.+ |...|..+..|.+||.+|....
T Consensus 207 fsqlltcpy--cdrgykrltslkeHikyrhekne~nfsC~l-------CsytFAyRtQLErhm~~hkpg~dqa~sltqsa 277 (1007)
T KOG3623|consen 207 FSQLLTCPY--CDRGYKRLTSLKEHIKYRHEKNEPNFSCML-------CSYTFAYRTQLERHMQLHKPGGDQAISLTQSA 277 (1007)
T ss_pred hhhhhcchh--HHHHHHHHHHHHHHHHHHHhhCCCCCcchh-------hhhhhhhHHHHHHHHHhhcCCCcccccccchh
Confidence 356689996 99999999999999875 4 3456699999 9999999999999999886421
Q ss_pred --CcccccccccccccCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhH
Q psy11069 175 --KCYPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKE 230 (391)
Q Consensus 175 --~~~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 230 (391)
+.|+|. .|+|.|..+..|+.|+|+|.|++||.|.. |.+.|.....+..|+-.
T Consensus 278 ~lRKFKCt--ECgKAFKfKHHLKEHlRIHSGEKPfeCpn--CkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 278 LLRKFKCT--ECGKAFKFKHHLKEHLRIHSGEKPFECPN--CKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhcccccc--ccchhhhhHHHHHhhheeecCCCCcCCcc--cccccccCCcccccccc
Confidence 225555 99999999999999999999999999999 99999999999999754
No 7
>KOG3623|consensus
Probab=99.86 E-value=4.7e-22 Score=185.28 Aligned_cols=109 Identities=28% Similarity=0.549 Sum_probs=94.2
Q ss_pred cccCccCCCCCCccccccCCHHHHHHHHhhcCCC-CCcccccccccccccCCHHHHHHHHHHcCC-------------CC
Q psy11069 140 PFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDED-KKCYPCDQKNCNKRFTTLSNLKMHMVRHGK-------------PL 205 (391)
Q Consensus 140 ~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~~~-~~~~~c~c~~C~~~f~~~~~l~~H~~~h~~-------------~~ 205 (391)
...|++ |++.+.....|+.|++.-+.. ...|.|. .|.++|..+..|.+|+.+|.. .+
T Consensus 210 lltcpy-------cdrgykrltslkeHikyrhekne~nfsC~--lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lR 280 (1007)
T KOG3623|consen 210 LLTCPY-------CDRGYKRLTSLKEHIKYRHEKNEPNFSCM--LCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLR 280 (1007)
T ss_pred hhcchh-------HHHHHHHHHHHHHHHHHHHhhCCCCCcch--hhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhc
Confidence 367888 999999999999999765443 3446655 999999999999999999853 35
Q ss_pred ccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHhcccccc
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVH 262 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h 262 (391)
.|+|.+ |++.|..+..|+.|+++|.| ++||.|+ .|+|.|....++..|+...
T Consensus 281 KFKCtE--CgKAFKfKHHLKEHlRIHSG-EKPfeCp--nCkKRFSHSGSySSHmSSK 332 (1007)
T KOG3623|consen 281 KFKCTE--CGKAFKFKHHLKEHLRIHSG-EKPFECP--NCKKRFSHSGSYSSHMSSK 332 (1007)
T ss_pred cccccc--cchhhhhHHHHHhhheeecC-CCCcCCc--ccccccccCCccccccccc
Confidence 699999 99999999999999999997 9999998 5999999999999997643
No 8
>KOG1074|consensus
Probab=99.78 E-value=4.4e-20 Score=175.19 Aligned_cols=108 Identities=27% Similarity=0.555 Sum_probs=79.9
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCC----CccCCC---CCCCCcccC
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNM----RMFPCP---LVTCGKSFL 341 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~----~~~~C~---~~~C~~~f~ 341 (391)
-.|-+|.++..-++.|+.|++ .|+|+|||+|.+ ||+.|.++.+|+.||- +|... -.|.|+ + |-+.|.
T Consensus 606 NqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKi--CgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~i--c~~kft 679 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYR-THTGERPFKCKI--CGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFI--CQKKFT 679 (958)
T ss_pred cceeeeeecccchhhhhhhhh-cccCcCcccccc--ccchhccccchhhccc-ccccCccccccccCCchhh--hccccc
Confidence 568888888888888888877 788888888877 8888888888888887 46543 346677 7 888888
Q ss_pred ChHHHHHHHHHhcCCC-C-----------ceeccCcCcccccccchhHHHHHHhh
Q psy11069 342 RSEHLQEHMLQHNENK-P-----------KFTCPYQDCLMSYVAKSSLYAHLKHV 384 (391)
Q Consensus 342 ~~~~L~~H~~~h~~~~-~-----------~~~C~~~~C~~~f~~~~~L~~H~~~~ 384 (391)
....|..|++.|.+.. + .=+|+. |.+.|.....+..++-.+
T Consensus 680 n~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~--~qk~~~~a~~f~~~~se~ 732 (958)
T KOG1074|consen 680 NAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSS--CQKTFSDARSFSQQISEQ 732 (958)
T ss_pred ccccccceEEeecCCCCCCCcccccccchhcccch--hhhcccccccchhhhhcc
Confidence 8888888888876321 1 235777 888887777777776665
No 9
>KOG3576|consensus
Probab=99.70 E-value=7.2e-18 Score=134.32 Aligned_cols=114 Identities=30% Similarity=0.595 Sum_probs=94.8
Q ss_pred CCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHH
Q psy11069 266 PEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEH 345 (391)
Q Consensus 266 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~ 345 (391)
...|.|.+|++.|.....|.+|++ -|...+.|-|.. ||+.|.....|++|+++ |++.+||+|.. |+++|+++-+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~k-ch~~vkr~lct~--cgkgfndtfdlkrh~rt-htgvrpykc~~--c~kaftqrcs 188 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLK-CHSDVKRHLCTF--CGKGFNDTFDLKRHTRT-HTGVRPYKCSL--CEKAFTQRCS 188 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhh-hccHHHHHHHhh--ccCcccchhhhhhhhcc-ccCccccchhh--hhHHHHhhcc
Confidence 346888888888888888888887 888888888887 88888888888888884 88888888887 8888888888
Q ss_pred HHHHHHHhcCCC----------CceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 346 LQEHMLQHNENK----------PKFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 346 L~~H~~~h~~~~----------~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
|..|++..+|.. ..|.|.. ||.+-.....+..|++.||+.
T Consensus 189 leshl~kvhgv~~~yaykerr~kl~vced--cg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 189 LESHLKKVHGVQHQYAYKERRAKLYVCED--CGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHHHHHHcCchHHHHHHHhhhheeeecc--cCCCCCChhHHHHHHHhcCCC
Confidence 888887766532 1588977 998888888888899888875
No 10
>KOG3576|consensus
Probab=99.60 E-value=2.2e-16 Score=125.90 Aligned_cols=115 Identities=29% Similarity=0.614 Sum_probs=74.8
Q ss_pred CCeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHH
Q psy11069 236 APYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSAL 315 (391)
Q Consensus 236 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l 315 (391)
..|.|.+ |++.|.....|.+|++-| ...+.|.|..||+.|...-.|++|++ .|+|.+||+|.. |+++|..+-+|
T Consensus 116 d~ftCrv--CgK~F~lQRmlnrh~kch-~~vkr~lct~cgkgfndtfdlkrh~r-thtgvrpykc~~--c~kaftqrcsl 189 (267)
T KOG3576|consen 116 DSFTCRV--CGKKFGLQRMLNRHLKCH-SDVKRHLCTFCGKGFNDTFDLKRHTR-THTGVRPYKCSL--CEKAFTQRCSL 189 (267)
T ss_pred Ceeeeeh--hhhhhhHHHHHHHHhhhc-cHHHHHHHhhccCcccchhhhhhhhc-cccCccccchhh--hhHHHHhhccH
Confidence 3466665 666666666666666666 44455666666666666666666665 666666666655 66666666666
Q ss_pred HHhhhccCC----------CCCccCCCCCCCCcccCChHHHHHHHHHhcCCCC
Q psy11069 316 KRHNKSKHL----------NMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKP 358 (391)
Q Consensus 316 ~~H~~~~H~----------~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~ 358 (391)
..|++.+|. ..+.|.|.- ||..-.....+..|+..|+..-|
T Consensus 190 eshl~kvhgv~~~yaykerr~kl~vced--cg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 190 ESHLKKVHGVQHQYAYKERRAKLYVCED--CGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHHHHHcCchHHHHHHHhhhheeeecc--cCCCCCChhHHHHHHHhcCCCCH
Confidence 666665554 245677874 88887778888888877766543
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.06 E-value=9.8e-10 Score=104.53 Aligned_cols=150 Identities=23% Similarity=0.523 Sum_probs=114.9
Q ss_pred ccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhHHH
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLR 285 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~ 285 (391)
...|.. |..... ...|..|..... ...-.|+..+|+..|. +..+..| +.|+.|++.|. ...|.
T Consensus 407 ~V~C~N--C~~~i~-l~~l~lHe~~C~--r~~V~Cp~~~Cg~v~~-r~el~~H----------~~C~~Cgk~f~-~s~Le 469 (567)
T PLN03086 407 TVECRN--CKHYIP-SRSIALHEAYCS--RHNVVCPHDGCGIVLR-VEEAKNH----------VHCEKCGQAFQ-QGEME 469 (567)
T ss_pred eEECCC--CCCccc-hhHHHHHHhhCC--CcceeCCcccccceee-ccccccC----------ccCCCCCCccc-hHHHH
Confidence 346887 877765 456668875543 3445676335999883 3333333 58999999996 67899
Q ss_pred HHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccC----------ChHHHHHHHHHhcC
Q psy11069 286 EHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFL----------RSEHLQEHMLQHNE 355 (391)
Q Consensus 286 ~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~----------~~~~L~~H~~~h~~ 355 (391)
.|++ .|. +++.|+ |++.+ .+..|..|+. .|...+++.|.+ |++.|. ....|..|.... +
T Consensus 470 kH~~-~~H--kpv~Cp---Cg~~~-~R~~L~~H~~-thCp~Kpi~C~f--C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G 538 (567)
T PLN03086 470 KHMK-VFH--EPLQCP---CGVVL-EKEQMVQHQA-STCPLRLITCRF--CGDMVQAGGSAMDVRDRLRGMSEHESIC-G 538 (567)
T ss_pred HHHH-hcC--CCccCC---CCCCc-chhHHHhhhh-ccCCCCceeCCC--CCCccccCccccchhhhhhhHHHHHHhc-C
Confidence 9988 553 789995 99755 6799999998 599999999998 999995 245899999986 8
Q ss_pred CCCceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 356 NKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 356 ~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
.+ ++.|.. ||+.+.. ..|..|+-..|..
T Consensus 539 ~r-t~~C~~--Cgk~Vrl-rdm~~H~~~~h~~ 566 (567)
T PLN03086 539 SR-TAPCDS--CGRSVML-KEMDIHQIAVHQK 566 (567)
T ss_pred Cc-ceEccc--cCCeeee-hhHHHHHHHhhcC
Confidence 88 599988 9999875 5688999988864
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.03 E-value=8e-10 Score=105.12 Aligned_cols=146 Identities=20% Similarity=0.484 Sum_probs=109.8
Q ss_pred ccccccccCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHhcccc
Q psy11069 181 QKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQR 260 (391)
Q Consensus 181 c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~ 260 (391)
|+.|..... ...|..|...... ....|+..+|+..|.. ..+ +..+.|.. |++.|. ...|..|+.
T Consensus 410 C~NC~~~i~-l~~l~lHe~~C~r-~~V~Cp~~~Cg~v~~r-~el----------~~H~~C~~--Cgk~f~-~s~LekH~~ 473 (567)
T PLN03086 410 CRNCKHYIP-SRSIALHEAYCSR-HNVVCPHDGCGIVLRV-EEA----------KNHVHCEK--CGQAFQ-QGEMEKHMK 473 (567)
T ss_pred CCCCCCccc-hhHHHHHHhhCCC-cceeCCcccccceeec-ccc----------ccCccCCC--CCCccc-hHHHHHHHH
Confidence 447877654 4556688765432 3456774459998843 222 34468986 999996 678999999
Q ss_pred ccCCCCCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCcccc----------CHHHHHHhhhccCCCCCccC
Q psy11069 261 VHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFS----------SKSALKRHNKSKHLNMRMFP 330 (391)
Q Consensus 261 ~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~----------~~~~l~~H~~~~H~~~~~~~ 330 (391)
+++ .++.|+ ||+.+ .+..|..|+. .|...+++.|.+ |++.|. ....|..|... .+.+++.
T Consensus 474 ~~H---kpv~Cp-Cg~~~-~R~~L~~H~~-thCp~Kpi~C~f--C~~~v~~g~~~~d~~d~~s~Lt~HE~~--CG~rt~~ 543 (567)
T PLN03086 474 VFH---EPLQCP-CGVVL-EKEQMVQHQA-STCPLRLITCRF--CGDMVQAGGSAMDVRDRLRGMSEHESI--CGSRTAP 543 (567)
T ss_pred hcC---CCccCC-CCCCc-chhHHHhhhh-ccCCCCceeCCC--CCCccccCccccchhhhhhhHHHHHHh--cCCcceE
Confidence 984 689999 99755 6789999976 899999999998 999985 24589999884 4899999
Q ss_pred CCCCCCCcccCChHHHHHHHHHhcC
Q psy11069 331 CPLVTCGKSFLRSEHLQEHMLQHNE 355 (391)
Q Consensus 331 C~~~~C~~~f~~~~~L~~H~~~h~~ 355 (391)
|.. ||+.|..+ +|..|+...|.
T Consensus 544 C~~--Cgk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 544 CDS--CGRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred ccc--cCCeeeeh-hHHHHHHHhhc
Confidence 987 99988765 57888876543
No 13
>PHA00733 hypothetical protein
Probab=99.02 E-value=3.4e-10 Score=87.74 Aligned_cols=82 Identities=22% Similarity=0.357 Sum_probs=52.4
Q ss_pred CCccccccCCcCccccCHHHHHHh------hhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcc
Q psy11069 295 TKNFKCTHDGCKLLFSSKSALKRH------NKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCL 368 (391)
Q Consensus 295 ~~~~~C~~~~C~~~f~~~~~l~~H------~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~ 368 (391)
.+++.|.+ |+..|.....|..| +. +.+.+||.|+. |++.|.+...|..|++.| +. +|.|+. |+
T Consensus 38 ~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~--~~~~kPy~C~~--Cgk~Fss~s~L~~H~r~h--~~-~~~C~~--Cg 106 (128)
T PHA00733 38 QKRLIRAV--VKTLIYNPQLLDESSYLYKLLT--SKAVSPYVCPL--CLMPFSSSVSLKQHIRYT--EH-SKVCPV--CG 106 (128)
T ss_pred hhhHHHHH--HhhhccChhhhcchHHHHhhcc--cCCCCCccCCC--CCCcCCCHHHHHHHHhcC--Cc-CccCCC--CC
Confidence 34455544 55555544443333 22 23466777776 777777777777777765 23 377777 77
Q ss_pred cccccchhHHHHHHhhcCC
Q psy11069 369 MSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 369 ~~f~~~~~L~~H~~~~H~~ 387 (391)
+.|.....|..|++..|+.
T Consensus 107 K~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 107 KEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred CccCCHHHHHHHHHHhcCc
Confidence 7777777777777777765
No 14
>PHA00733 hypothetical protein
Probab=98.99 E-value=6.9e-10 Score=86.04 Aligned_cols=85 Identities=22% Similarity=0.405 Sum_probs=65.6
Q ss_pred CCCccccCcccccCCChhHHHHH--HH--HhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCccc
Q psy11069 265 NPEDLVCKGCGKQFKVPCRLREH--YK--AVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSF 340 (391)
Q Consensus 265 ~~~~~~C~~C~~~f~~~~~l~~H--~~--~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f 340 (391)
..+++.|.+|...|.....|..| +. ..+.+.+||.|+. |++.|.+...|..|++. | ..+|.|.. |++.|
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~--Cgk~Fss~s~L~~H~r~-h--~~~~~C~~--CgK~F 109 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPL--CLMPFSSSVSLKQHIRY-T--EHSKVCPV--CGKEF 109 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCC--CCCcCCCHHHHHHHHhc-C--CcCccCCC--CCCcc
Confidence 34678888888888877766665 11 1334578899977 99999999999999883 5 35689997 99999
Q ss_pred CChHHHHHHHHHhcCC
Q psy11069 341 LRSEHLQEHMLQHNEN 356 (391)
Q Consensus 341 ~~~~~L~~H~~~h~~~ 356 (391)
.....|.+|+...|+.
T Consensus 110 ~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 110 RNTDSTLDHVCKKHNI 125 (128)
T ss_pred CCHHHHHHHHHHhcCc
Confidence 9999999999887653
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.72 E-value=6.9e-09 Score=65.86 Aligned_cols=44 Identities=18% Similarity=0.579 Sum_probs=36.2
Q ss_pred ccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHH
Q psy11069 328 MFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLY 378 (391)
Q Consensus 328 ~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~ 378 (391)
.|.|+. ||+.|+....|..|+++|+ + +|+|.. |++.|.+.+.|.
T Consensus 5 ~y~C~~--CGK~Fs~~~~L~~H~r~H~--k-~~kc~~--C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPI--CGEIYIKRKSMITHLRKHN--T-NLKLSN--CKRISLRTGEYI 48 (55)
T ss_pred ccCcch--hCCeeccHHHHHHHHHhcC--C-cccCCc--ccceecccceeE
Confidence 478887 9999999999999999988 4 488887 999888877664
No 16
>KOG3993|consensus
Probab=98.66 E-value=5.7e-09 Score=93.09 Aligned_cols=200 Identities=17% Similarity=0.232 Sum_probs=115.6
Q ss_pred ccccccccccccCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHh
Q psy11069 177 YPCDQKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLK 256 (391)
Q Consensus 177 ~~c~c~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~ 256 (391)
|.|. +|...|.+...|.+|.-...-...|+|.+ |++.|.-..+|..|.+.|......-.=..+-=.+...+....+
T Consensus 268 yiCq--LCK~kYeD~F~LAQHrC~RIV~vEYrCPE--C~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ 343 (500)
T KOG3993|consen 268 YICQ--LCKEKYEDAFALAQHRCPRIVHVEYRCPE--CDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQ 343 (500)
T ss_pred HHHH--HHHHhhhhHHHHhhccCCeeEEeeecCCc--ccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence 5565 88888888888888864444445688888 8888888888888888886421111000000000001111111
Q ss_pred cccccc-CCCCCccccCcccccCCChhHHHHHHHHhcCCCCc----------------cccccCCcCccccCHHHHHHhh
Q psy11069 257 SHQRVH-VTNPEDLVCKGCGKQFKVPCRLREHYKAVHESTKN----------------FKCTHDGCKLLFSSKSALKRHN 319 (391)
Q Consensus 257 ~H~~~h-~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~----------------~~C~~~~C~~~f~~~~~l~~H~ 319 (391)
.-.+.- -.....|.|..|++.|+....|+.|+...|..... +-|.. |.-.+.....--.+.
T Consensus 344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~--~a~h~~a~~~~g~~v 421 (500)
T KOG3993|consen 344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQA--VATHSSASDSHGDEV 421 (500)
T ss_pred hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccc--cccccccccccccce
Confidence 111100 01234799999999999999999997633322211 11222 332222221111111
Q ss_pred hccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 320 KSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 320 ~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
..+-.......|++ |+..+.++..--.+.+.-..+. .|.|.+ |.-+|.+...|.+|+..-|..
T Consensus 422 l~~a~sael~~pp~--~~~ppsss~~sgg~~rlg~~~q-~f~~ky--~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 422 LYVAGSAELELPPY--DGSPPSSSGSSGGYGRLGIAEQ-GFTCKY--CPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred eeeeccccccCCCC--CCCCcccCCCCCccccccchhh-cccccc--chHhhhcCcchHhHhhhcChH
Confidence 11001122345777 8888887776666666555556 599999 999999999999999887753
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.64 E-value=1.3e-08 Score=64.67 Aligned_cols=42 Identities=21% Similarity=0.434 Sum_probs=27.1
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHH
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSAL 315 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l 315 (391)
|.|+.||+.|...+.|..|++ +|+ ++|+|.. |++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r-~H~--k~~kc~~--C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLR-KHN--TNLKLSN--CKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHH-hcC--CcccCCc--ccceeccccee
Confidence 566666666666666666666 565 4666655 66666665554
No 18
>KOG3993|consensus
Probab=98.55 E-value=1.2e-08 Score=90.98 Aligned_cols=25 Identities=28% Similarity=0.677 Sum_probs=13.1
Q ss_pred ccccccccccccccCchHHHHHHhHhc
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHS 232 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~ 232 (391)
.|.|.. |++.|.....|+.|+.+|+
T Consensus 356 i~~C~~--C~KkFrRqAYLrKHqlthq 380 (500)
T KOG3993|consen 356 IFSCHT--CGKKFRRQAYLRKHQLTHQ 380 (500)
T ss_pred eeecHH--hhhhhHHHHHHHHhHHhhh
Confidence 355555 5555555555555555444
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.48 E-value=1.3e-07 Score=51.17 Aligned_cols=26 Identities=42% Similarity=0.979 Sum_probs=21.6
Q ss_pred HHHHHHHHhcCCCCceeccCcCccccccc
Q psy11069 345 HLQEHMLQHNENKPKFTCPYQDCLMSYVA 373 (391)
Q Consensus 345 ~L~~H~~~h~~~~~~~~C~~~~C~~~f~~ 373 (391)
+|.+|+++|++++| |.|+. |+++|.+
T Consensus 1 ~l~~H~~~H~~~k~-~~C~~--C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKP-YKCPY--CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSS-EEESS--SSEEESS
T ss_pred CHHHHhhhcCCCCC-CCCCC--CcCeeCc
Confidence 47889999999884 99988 9988863
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.28 E-value=4.9e-07 Score=48.93 Aligned_cols=26 Identities=31% Similarity=0.700 Sum_probs=22.6
Q ss_pred HHHHHHHhcCCCCcccCccCCCCCCccccccCC
Q psy11069 127 TFKIHLLKHFDKKPFKCKLTTANNTPCMWGFFS 159 (391)
Q Consensus 127 ~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~ 159 (391)
+|.+|+++|++++||.|++ |++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~-------C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY-------CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS-------SSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCC-------CcCeeCc
Confidence 4789999999999999998 9998863
No 21
>PHA00616 hypothetical protein
Probab=98.15 E-value=1.1e-06 Score=53.09 Aligned_cols=34 Identities=29% Similarity=0.674 Sum_probs=25.2
Q ss_pred ccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccC
Q psy11069 328 MFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPY 364 (391)
Q Consensus 328 ~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~ 364 (391)
||+|+. ||+.|.+.+.|.+|++.||++++ +.|+.
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~-~~~~~ 34 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNK-LTLEY 34 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCc-cceeE
Confidence 577776 88888888888888877777773 77654
No 22
>PHA00616 hypothetical protein
Probab=98.15 E-value=9.4e-07 Score=53.34 Aligned_cols=31 Identities=32% Similarity=0.535 Sum_probs=28.6
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcCCCCCC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHSTNKIT 391 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~~~~~~ 391 (391)
+|+|+. ||+.|.+++.|.+|++.||++++.+
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~ 31 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLT 31 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCccc
Confidence 499999 9999999999999999999998753
No 23
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.04 E-value=9.5e-06 Score=52.78 Aligned_cols=51 Identities=33% Similarity=0.690 Sum_probs=33.1
Q ss_pred cCCCCCCCCcccCChHHHHHHHHH-hcCCCCceeccCcCcccccccchhHHHHHHhhcC
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQ-HNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~-h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
|.|++ |++ .-+...|..|... |..+...+.||+ |...+. .+|..|+..+|+
T Consensus 3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhcC
Confidence 67777 777 4445677777654 333333477777 777644 377778777774
No 24
>PHA00732 hypothetical protein
Probab=98.03 E-value=4e-06 Score=58.79 Aligned_cols=56 Identities=30% Similarity=0.660 Sum_probs=40.9
Q ss_pred ccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceec
Q psy11069 297 NFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTC 362 (391)
Q Consensus 297 ~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C 362 (391)
||.|.. |++.|.+...|..|++..|. ++.|+. |++.|. .|..|++++.+..-...|
T Consensus 1 py~C~~--Cgk~F~s~s~Lk~H~r~~H~---~~~C~~--CgKsF~---~l~~H~~~~~~~~~~~~~ 56 (79)
T PHA00732 1 MFKCPI--CGFTTVTLFALKQHARRNHT---LTKCPV--CNKSYR---RLNQHFYSQYDIEHLIYC 56 (79)
T ss_pred CccCCC--CCCccCCHHHHHHHhhcccC---CCccCC--CCCEeC---ChhhhhcccCCccceEEe
Confidence 578877 99999999999999874455 357887 999887 578888776654323334
No 25
>PHA00732 hypothetical protein
Probab=97.96 E-value=7.3e-06 Score=57.50 Aligned_cols=46 Identities=26% Similarity=0.423 Sum_probs=31.0
Q ss_pred ccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhc
Q psy11069 268 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKS 321 (391)
Q Consensus 268 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 321 (391)
||.|..||+.|.+...|..|++..|. ++.|+. |++.|. .|..|+.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~--CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPV--CNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCC--CCCEeC---Chhhhhcc
Confidence 46777777777777777777653354 356766 777776 36667654
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.95 E-value=1.4e-05 Score=51.98 Aligned_cols=50 Identities=26% Similarity=0.548 Sum_probs=25.1
Q ss_pred cccccCCcCccccCHHHHHHhhhccCCCC-CccCCCCCCCCcccCChHHHHHHHHHhc
Q psy11069 298 FKCTHDGCKLLFSSKSALKRHNKSKHLNM-RMFPCPLVTCGKSFLRSEHLQEHMLQHN 354 (391)
Q Consensus 298 ~~C~~~~C~~~f~~~~~l~~H~~~~H~~~-~~~~C~~~~C~~~f~~~~~L~~H~~~h~ 354 (391)
|.|++ |++ ..+...|..|....|..+ +.+.|++ |...+. .+|..|+..++
T Consensus 3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhc
Confidence 45555 555 333445555555555443 3455555 555433 25555555543
No 27
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.64 E-value=1.6e-05 Score=68.83 Aligned_cols=72 Identities=17% Similarity=0.246 Sum_probs=37.8
Q ss_pred CCccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhH
Q psy11069 204 PLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCR 283 (391)
Q Consensus 204 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~ 283 (391)
++||+|+++||++.+.+...|+.|+.--|...+...=+-++=-..| ....+||+|++|+|.|.+...
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F-------------~~~~KPYrCevC~KRYKNlNG 413 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF-------------SAKDKPYRCEVCDKRYKNLNG 413 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc-------------cccCCceeccccchhhccCcc
Confidence 3666666666666666666666666533211111100000000111 123467777777777777777
Q ss_pred HHHHH
Q psy11069 284 LREHY 288 (391)
Q Consensus 284 l~~H~ 288 (391)
|+-|.
T Consensus 414 LKYHr 418 (423)
T COG5189 414 LKYHR 418 (423)
T ss_pred ceecc
Confidence 77774
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.60 E-value=4.2e-05 Score=40.04 Aligned_cols=22 Identities=36% Similarity=0.854 Sum_probs=14.0
Q ss_pred eeccCcCcccccccchhHHHHHHh
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKH 383 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~ 383 (391)
|.|+. |++.|.++..|.+|++.
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhH
Confidence 45665 66666666666666665
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.60 E-value=3.4e-05 Score=66.85 Aligned_cols=75 Identities=27% Similarity=0.480 Sum_probs=42.0
Q ss_pred CCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccc
Q psy11069 295 TKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAK 374 (391)
Q Consensus 295 ~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~ 374 (391)
+|||+|++++|.+.++....|+.|+..-|...+...=+-++=-..| -...| ||+|++ |+|+|...
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F------------~~~~K-PYrCev--C~KRYKNl 411 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF------------SAKDK-PYRCEV--CDKRYKNL 411 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc------------cccCC-ceeccc--cchhhccC
Confidence 3666666666666666666666666533322222111110011111 12345 599988 99999988
Q ss_pred hhHHHHHHhh
Q psy11069 375 SSLYAHLKHV 384 (391)
Q Consensus 375 ~~L~~H~~~~ 384 (391)
..|+-|+..-
T Consensus 412 NGLKYHr~Hs 421 (423)
T COG5189 412 NGLKYHRKHS 421 (423)
T ss_pred ccceeccccc
Confidence 8888886543
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.54 E-value=6e-05 Score=39.80 Aligned_cols=24 Identities=42% Similarity=0.837 Sum_probs=13.7
Q ss_pred eeccCcCcccccccchhHHHHHHhhc
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKHVH 385 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~~H 385 (391)
|.|++ |++.|.+...|..|++++|
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHHS
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhhC
Confidence 45666 6666666666666666654
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.52 E-value=5.1e-05 Score=39.69 Aligned_cols=23 Identities=48% Similarity=1.123 Sum_probs=14.3
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHh
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQH 353 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h 353 (391)
|.|+. |++.|.+...|.+|++.|
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhHC
Confidence 45664 666666666666666653
No 32
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.51 E-value=5.8e-05 Score=56.32 Aligned_cols=73 Identities=27% Similarity=0.389 Sum_probs=16.0
Q ss_pred ccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHH
Q psy11069 270 VCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEH 349 (391)
Q Consensus 270 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H 349 (391)
.|..|+..|.+...|..|+...|.-..+ ....+.....+..+++. .....+.|.+ |++.|.+...|..|
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-------~~~~l~~~~~~~~~~~~--~~~~~~~C~~--C~~~f~s~~~l~~H 69 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-------DQKYLVDPNRLLNYLRK--KVKESFRCPY--CNKTFRSREALQEH 69 (100)
T ss_dssp ---------------------------------------------------------SSEEBSS--SS-EESSHHHHHHH
T ss_pred Cccccccccccccccccccccccccccc-------ccccccccccccccccc--ccCCCCCCCc--cCCCCcCHHHHHHH
Confidence 3667777777777777776655543221 11122233444444331 1222466776 77777777777777
Q ss_pred HHHh
Q psy11069 350 MLQH 353 (391)
Q Consensus 350 ~~~h 353 (391)
|+.+
T Consensus 70 m~~~ 73 (100)
T PF12756_consen 70 MRSK 73 (100)
T ss_dssp HHHT
T ss_pred HcCc
Confidence 7654
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.49 E-value=5.9e-05 Score=56.27 Aligned_cols=72 Identities=26% Similarity=0.465 Sum_probs=14.0
Q ss_pred cccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHH
Q psy11069 300 CTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYA 379 (391)
Q Consensus 300 C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~ 379 (391)
|.. |+..|.+...|..|+...|.-..+ ....+.....+..+++.-... .+.|.+ |++.|.+...|..
T Consensus 2 C~~--C~~~f~~~~~l~~H~~~~H~~~~~-------~~~~l~~~~~~~~~~~~~~~~--~~~C~~--C~~~f~s~~~l~~ 68 (100)
T PF12756_consen 2 CLF--CDESFSSVDDLLQHMKKKHGFDIP-------DQKYLVDPNRLLNYLRKKVKE--SFRCPY--CNKTFRSREALQE 68 (100)
T ss_dssp --------------------------------------------------------S--SEEBSS--SS-EESSHHHHHH
T ss_pred ccc--cccccccccccccccccccccccc-------cccccccccccccccccccCC--CCCCCc--cCCCCcCHHHHHH
Confidence 545 666666666666666555543221 111122333333443332222 266666 6666666666666
Q ss_pred HHHhh
Q psy11069 380 HLKHV 384 (391)
Q Consensus 380 H~~~~ 384 (391)
|++.+
T Consensus 69 Hm~~~ 73 (100)
T PF12756_consen 69 HMRSK 73 (100)
T ss_dssp HHHHT
T ss_pred HHcCc
Confidence 66644
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.33 E-value=0.00016 Score=38.13 Aligned_cols=24 Identities=50% Similarity=1.110 Sum_probs=14.9
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHhc
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQHN 354 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h~ 354 (391)
|.|++ |++.|.+...|..|+++|+
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHHS
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhhC
Confidence 45666 7777777777777776654
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.30 E-value=0.00015 Score=39.56 Aligned_cols=26 Identities=27% Similarity=0.456 Sum_probs=19.8
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
+|+|.. |++.|.+..+|..|++.|++
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhcC
Confidence 377877 88888888888888877764
No 36
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.16 E-value=0.00021 Score=62.78 Aligned_cols=131 Identities=26% Similarity=0.532 Sum_probs=74.6
Q ss_pred ccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCC---cccc------chhhHhccccccCCC---CCccccCc
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCG---KSFY------LMASLKSHQRVHVTN---PEDLVCKG 273 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~---~~f~------~~~~l~~H~~~h~~~---~~~~~C~~ 273 (391)
.|.|+...|.........|+.|.+..|+ .+.|.+ |- +.|. ++..|..|...-..+ ..-=.|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~---~~~C~~--C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F 225 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG---FVLCSE--CIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF 225 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC---cEEhHh--hhcCcccCccceeeeecccccccccCCccccCcCCCchhhh
Confidence 4778877788888888889999888775 466765 52 2333 334455554432111 11124777
Q ss_pred ccccCCChhHHHHHHHHhcCCCCccccccCCcCcc-------ccCHHHHHHhhhccCCCCCccCCCCCCCC----cccCC
Q psy11069 274 CGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLL-------FSSKSALKRHNKSKHLNMRMFPCPLVTCG----KSFLR 342 (391)
Q Consensus 274 C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~-------f~~~~~l~~H~~~~H~~~~~~~C~~~~C~----~~f~~ 342 (391)
|...|-+-..|.+|++..|. .|-+ |++. |.+...|..|.+..| |.|.+..|- ..|..
T Consensus 226 C~~~FYdDDEL~~HcR~~HE-----~ChI--CD~v~p~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~ 293 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRHE-----ACHI--CDMVGPIRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPY 293 (493)
T ss_pred ccceecChHHHHHHHHhhhh-----hhhh--hhccCccchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEecc
Confidence 77777777777777765553 3333 4332 555566666655322 455443342 24556
Q ss_pred hHHHHHHHHHh
Q psy11069 343 SEHLQEHMLQH 353 (391)
Q Consensus 343 ~~~L~~H~~~h 353 (391)
...|..|+..-
T Consensus 294 ~~el~~h~~~~ 304 (493)
T COG5236 294 HTELLEHLTRF 304 (493)
T ss_pred HHHHHHHHHHH
Confidence 66666665443
No 37
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.10 E-value=0.00079 Score=59.24 Aligned_cols=74 Identities=28% Similarity=0.683 Sum_probs=34.9
Q ss_pred ccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCc-------ccCChHHHHHHHHHhcCCCCceeccCcCcc---
Q psy11069 299 KCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGK-------SFLRSEHLQEHMLQHNENKPKFTCPYQDCL--- 368 (391)
Q Consensus 299 ~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~-------~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~--- 368 (391)
.|.+ |...|.+-..|.+|++..|. .|-+ |++ -|.+..+|.+|.+.- .|.|.++.|.
T Consensus 222 ~C~F--C~~~FYdDDEL~~HcR~~HE-----~ChI--CD~v~p~~~QYFK~Y~~Le~HF~~~-----hy~ct~qtc~~~k 287 (493)
T COG5236 222 LCIF--CKIYFYDDDELRRHCRLRHE-----ACHI--CDMVGPIRYQYFKSYEDLEAHFRNA-----HYCCTFQTCRVGK 287 (493)
T ss_pred hhhh--ccceecChHHHHHHHHhhhh-----hhhh--hhccCccchhhhhCHHHHHHHhhcC-----ceEEEEEEEecCc
Confidence 3544 55555555555555554332 2332 322 344555555554421 2444444443
Q ss_pred -cccccchhHHHHHHhhcC
Q psy11069 369 -MSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 369 -~~f~~~~~L~~H~~~~H~ 386 (391)
..|.....|..|+.+.|+
T Consensus 288 ~~vf~~~~el~~h~~~~h~ 306 (493)
T COG5236 288 CYVFPYHTELLEHLTRFHK 306 (493)
T ss_pred EEEeccHHHHHHHHHHHhh
Confidence 345555555555555554
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.97 E-value=0.00034 Score=38.16 Aligned_cols=23 Identities=43% Similarity=0.826 Sum_probs=11.9
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHh
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQH 353 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h 353 (391)
|.|.. |++.|.+...|..|++.|
T Consensus 2 ~~C~~--C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDE--CGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETT--TTEEESSHHHHHHHHCTT
T ss_pred CCCCc--cCCccCChhHHHHHhHHh
Confidence 44554 555555555555555444
No 39
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.86 E-value=0.00073 Score=41.77 Aligned_cols=40 Identities=30% Similarity=0.561 Sum_probs=18.8
Q ss_pred HhhhccCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCCC
Q psy11069 317 RHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENKP 358 (391)
Q Consensus 317 ~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~ 358 (391)
.+.+.++..+.|-.|++ |+..+.+..+|.+|+.++|+.+|
T Consensus 13 ~~~k~~~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 13 KKPKSKSQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ---CCCCTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred hHHHHhhccCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 33443444555666665 66666666666666666665553
No 40
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.74 E-value=0.001 Score=35.03 Aligned_cols=24 Identities=46% Similarity=0.783 Sum_probs=15.9
Q ss_pred eeccCcCcccccccchhHHHHHHhhcC
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
|+|+. |++... +..|.+|++.+|+
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHHS
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhCc
Confidence 56777 777776 7777777777764
No 41
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.47 E-value=0.0016 Score=40.30 Aligned_cols=41 Identities=20% Similarity=0.282 Sum_probs=26.6
Q ss_pred HHHHHHHhcCCCCceeccCcCcccccccchhHHHHHHhhcCCC
Q psy11069 346 LQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHSTN 388 (391)
Q Consensus 346 L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~~ 388 (391)
+..+.+.+.....|-.|++ |+..+.+..+|.+|+...|+.+
T Consensus 11 ~~~~~k~~~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 11 STKKPKSKSQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp -----CCCCTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS
T ss_pred hhhHHHHhhccCCCCCCCc--chhhccchhhHHHHHHHHhccc
Confidence 3556666655555799999 9999999999999999999865
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.28 E-value=0.0052 Score=32.66 Aligned_cols=18 Identities=50% Similarity=0.883 Sum_probs=8.9
Q ss_pred CCcccCChHHHHHHHHHh
Q psy11069 336 CGKSFLRSEHLQEHMLQH 353 (391)
Q Consensus 336 C~~~f~~~~~L~~H~~~h 353 (391)
|++.|.+...|..|++.|
T Consensus 6 C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 6 CGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CcchhCCHHHHHHHHHHh
Confidence 555555555555554443
No 43
>KOG2231|consensus
Probab=96.18 E-value=0.01 Score=58.22 Aligned_cols=127 Identities=23% Similarity=0.382 Sum_probs=72.5
Q ss_pred eeccCCCCCcccc---------------chhhHhccccccCCCCCccccCccc----------ccCCChhHHHHHHHHhc
Q psy11069 238 YMCDYKGCGKSFY---------------LMASLKSHQRVHVTNPEDLVCKGCG----------KQFKVPCRLREHYKAVH 292 (391)
Q Consensus 238 ~~C~~~~C~~~f~---------------~~~~l~~H~~~h~~~~~~~~C~~C~----------~~f~~~~~l~~H~~~~H 292 (391)
+.|.+ |+..|. +...|+.|+...|. .+.|.+|- +.| +...|..|++.--
T Consensus 100 ~~C~~--C~~~~~~~~~~~~~~~c~~~~s~~~Lk~H~~~~H~---~~~c~lC~~~~kif~~e~k~Y-t~~el~~h~~~gd 173 (669)
T KOG2231|consen 100 HSCHI--CDRRFRALYNKKECLHCTEFKSVENLKNHMRDQHK---LHLCSLCLQNLKIFINERKLY-TRAELNLHLMFGD 173 (669)
T ss_pred hhcCc--cccchhhhcccCCCccccchhHHHHHHHHHHHhhh---hhccccccccceeeeeeeehe-hHHHHHHHHhcCC
Confidence 77886 888774 56677777744322 34454442 333 3556666654211
Q ss_pred CCCC----ccccccCCcCccccCHHHHHHhhhccCCCCCccCCCC----CCCCcccCChHHHHHHHHHhcCCCCceeccC
Q psy11069 293 ESTK----NFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPL----VTCGKSFLRSEHLQEHMLQHNENKPKFTCPY 364 (391)
Q Consensus 293 ~~~~----~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~----~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~ 364 (391)
.+++ --.|.. |...|.....|.+|++..| |.|.+ ..++.-|.+..+|..|.|.+| |.|..
T Consensus 174 ~d~~s~rGhp~C~~--C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----flCE~ 241 (669)
T KOG2231|consen 174 PDDESCRGHPLCKF--CHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----FLCEE 241 (669)
T ss_pred CccccccCCccchh--hhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC-----ccccc
Confidence 1111 135766 8888888888888887544 45555 013456777788888887664 55552
Q ss_pred cCcc-cccccchhHHHHHH
Q psy11069 365 QDCL-MSYVAKSSLYAHLK 382 (391)
Q Consensus 365 ~~C~-~~f~~~~~L~~H~~ 382 (391)
+-|. +.|.....+..|++
T Consensus 242 ~~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 242 EFCRTKKFYVAFELEIELK 260 (669)
T ss_pred cccccceeeehhHHHHHHH
Confidence 2253 34444334455555
No 44
>KOG4173|consensus
Probab=96.13 E-value=0.0017 Score=52.94 Aligned_cols=84 Identities=23% Similarity=0.460 Sum_probs=51.9
Q ss_pred CeeccCCCCCccccchhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhc---------CCCCccccccCCcCc
Q psy11069 237 PYMCDYKGCGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVH---------ESTKNFKCTHDGCKL 307 (391)
Q Consensus 237 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H---------~~~~~~~C~~~~C~~ 307 (391)
.|.|++.+|...|.+...+..|-.+.|++ .|.+|.+.|.+...|..|+...| .|.--|+|-.++|+.
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 35555555666666555555555443332 46666666666666666665444 234457777777888
Q ss_pred cccCHHHHHHhhhccCC
Q psy11069 308 LFSSKSALKRHNKSKHL 324 (391)
Q Consensus 308 ~f~~~~~l~~H~~~~H~ 324 (391)
.|.+...-..|+...|.
T Consensus 155 KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhHHHHhcc
Confidence 88777777777765554
No 45
>KOG1146|consensus
Probab=96.07 E-value=0.0023 Score=66.26 Aligned_cols=56 Identities=23% Similarity=0.488 Sum_probs=42.3
Q ss_pred CCccCCCCCCCCcccCChHHHHHHHHH--hc----------------------------------------CCCCceecc
Q psy11069 326 MRMFPCPLVTCGKSFLRSEHLQEHMLQ--HN----------------------------------------ENKPKFTCP 363 (391)
Q Consensus 326 ~~~~~C~~~~C~~~f~~~~~L~~H~~~--h~----------------------------------------~~~~~~~C~ 363 (391)
.++|.|.. |...++....|..||.. |- ..+|.+.|.
T Consensus 516 ~~p~~C~~--C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~ 593 (1406)
T KOG1146|consen 516 GKPYPCRA--CNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCE 593 (1406)
T ss_pred CCccccee--eeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchh
Confidence 45688887 88888888888888754 11 124579999
Q ss_pred CcCcccccccchhHHHHHHhhc
Q psy11069 364 YQDCLMSYVAKSSLYAHLKHVH 385 (391)
Q Consensus 364 ~~~C~~~f~~~~~L~~H~~~~H 385 (391)
+ |++.-.-..+|..|+..-+
T Consensus 594 v--c~yetniarnlrihmtss~ 613 (1406)
T KOG1146|consen 594 V--CSYETNIARNLRIHMTASP 613 (1406)
T ss_pred h--hcchhhhhhccccccccCC
Confidence 9 9999988899999876544
No 46
>KOG4173|consensus
Probab=95.91 E-value=0.0053 Score=50.12 Aligned_cols=95 Identities=19% Similarity=0.373 Sum_probs=58.6
Q ss_pred CCceecCCCCCCCcccChHHHHHHHHhcCCCCcccCccCCCCCCccccccCCHHHHHHHHhhcCCCCCcccccccccccc
Q psy11069 108 QQLWSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDKKCYPCDQKNCNKR 187 (391)
Q Consensus 108 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~c~c~~C~~~ 187 (391)
-..|.|++..|...|.....+..|..+-++ -.|.. |.+.|.+...|..|+...|+.-
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~-------C~r~~Pt~hLLd~HI~E~HDs~------------- 133 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSF-------CKRAFPTGHLLDAHILEWHDSL------------- 133 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhccc---chhHH-------HHHhCCchhhhhHHHHHHHHHH-------------
Confidence 345778777777777777777766644222 24666 7777777777777765544320
Q ss_pred cCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhHhcC
Q psy11069 188 FTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSN 233 (391)
Q Consensus 188 f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~ 233 (391)
| ....-.|...|.|-..||+..|.+...-..|+...|.
T Consensus 134 F--------qa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 134 F--------QALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred H--------HHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence 0 0011224456777777788888887777777776654
No 47
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.90 E-value=0.0052 Score=32.67 Aligned_cols=24 Identities=29% Similarity=0.657 Sum_probs=21.2
Q ss_pred eeccCcCcccccccchhHHHHHHhhc
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKHVH 385 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~~H 385 (391)
|.|+. |++.|.+...|..|++.|.
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHhc
Confidence 57988 9999999999999999553
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.77 E-value=0.0071 Score=31.74 Aligned_cols=23 Identities=30% Similarity=0.756 Sum_probs=13.6
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHhc
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQHN 354 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h~ 354 (391)
|+|+. |++... ...|.+|++.|+
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHH
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhC
Confidence 45665 776666 666777776654
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.75 E-value=0.0046 Score=32.83 Aligned_cols=15 Identities=40% Similarity=0.864 Sum_probs=6.3
Q ss_pred CCcccCChHHHHHHH
Q psy11069 336 CGKSFLRSEHLQEHM 350 (391)
Q Consensus 336 C~~~f~~~~~L~~H~ 350 (391)
|++.|.+...|..|+
T Consensus 6 C~~~f~s~~~~~~H~ 20 (25)
T PF12874_consen 6 CNKSFSSENSLRQHL 20 (25)
T ss_dssp TTEEESSHHHHHHHH
T ss_pred CCCCcCCHHHHHHHH
Confidence 444444444444443
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.62 E-value=0.0082 Score=31.82 Aligned_cols=23 Identities=35% Similarity=0.678 Sum_probs=21.0
Q ss_pred eeccCcCcccccccchhHHHHHHhh
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKHV 384 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~~ 384 (391)
|.|.+ |++.|.+...|..|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 67999 999999999999999864
No 51
>KOG2231|consensus
Probab=95.57 E-value=0.022 Score=55.97 Aligned_cols=18 Identities=28% Similarity=0.414 Sum_probs=8.5
Q ss_pred CCccccchhhHhcccccc
Q psy11069 245 CGKSFYLMASLKSHQRVH 262 (391)
Q Consensus 245 C~~~f~~~~~l~~H~~~h 262 (391)
|...|.....|.+|++.+
T Consensus 188 C~~~fld~~el~rH~~~~ 205 (669)
T KOG2231|consen 188 CHERFLDDDELYRHLRFD 205 (669)
T ss_pred hhhhhccHHHHHHhhccc
Confidence 444444444444444433
No 52
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.51 E-value=0.0034 Score=60.30 Aligned_cols=154 Identities=23% Similarity=0.298 Sum_probs=79.3
Q ss_pred ceecCCCCCCCcccChHHHHHHHH--hcCCC--CcccCc--cCCCCCCccccccCCHHHHHHHHhhcCCCCCcccccccc
Q psy11069 110 LWSCPIVNCNFTSVNLNTFKIHLL--KHFDK--KPFKCK--LTTANNTPCMWGFFSKHKLLRHMTSHDEDKKCYPCDQKN 183 (391)
Q Consensus 110 ~~~C~~~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~~~~~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~c~c~~ 183 (391)
++.|. .|...|.....|..|.+ .|.++ +++.|+ . |++.|.....+..|...|.+. ..+.+.-..
T Consensus 289 ~~~~~--~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~-------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 358 (467)
T COG5048 289 PIKSK--QCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSL-------CGKLFSRNDALKRHILLHTSI-SPAKEKLLN 358 (467)
T ss_pred CCCCc--cccCCccccccccccccccccccccCCceeeeccC-------CCccccccccccCCcccccCC-Ccccccccc
Confidence 45555 36666666666666666 56666 666666 4 666666666666666666555 333333223
Q ss_pred cccccCCHHHHHHHHHHcCCCCccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhHhccccccC
Q psy11069 184 CNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVHV 263 (391)
Q Consensus 184 C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~ 263 (391)
+...+.....-..+ ..............+.+....|...+.....+..|...+.
T Consensus 359 ~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (467)
T COG5048 359 SSSKFSPLLNNEPP--------------------------QSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHL 412 (467)
T ss_pred CccccccccCCCCc--------------------------cchhhccCccCCccccccccchhhhhcccccccccccccc
Confidence 33333222111000 0000000011133444554456666666666666655552
Q ss_pred C-CCCccccCcccccCCChhHHHHHHHHhcCCCCcccc
Q psy11069 264 T-NPEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKC 300 (391)
Q Consensus 264 ~-~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C 300 (391)
. ....+.+..|.+.|.....+..|++ .|....++.|
T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 449 (467)
T COG5048 413 SFRPYNCKNPPCSKSFNRHYNLIPHKK-IHTNHAPLLC 449 (467)
T ss_pred ccCCcCCCCCcchhhccCccccccccc-ccccCCceee
Confidence 2 1124556677777777777777765 5655555555
No 53
>KOG1146|consensus
Probab=95.43 E-value=0.0033 Score=65.16 Aligned_cols=132 Identities=19% Similarity=0.259 Sum_probs=88.9
Q ss_pred CCccccchhhHhccccccCCCCCccccCcccccCCChhHHHHHHHHhcC------------------------CCCcccc
Q psy11069 245 CGKSFYLMASLKSHQRVHVTNPEDLVCKGCGKQFKVPCRLREHYKAVHE------------------------STKNFKC 300 (391)
Q Consensus 245 C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~------------------------~~~~~~C 300 (391)
|+..+.....+..|+..-+...+.|+|+.|+..|.....|..|+|.+|. +.++|.|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C 521 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPC 521 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccc
Confidence 5555666666666665544555677777777777777777777776552 2357888
Q ss_pred ccCCcCccccCHHHHHHhhhcc-CC-----------------------------------------CCCccCCCCCCCCc
Q psy11069 301 THDGCKLLFSSKSALKRHNKSK-HL-----------------------------------------NMRMFPCPLVTCGK 338 (391)
Q Consensus 301 ~~~~C~~~f~~~~~l~~H~~~~-H~-----------------------------------------~~~~~~C~~~~C~~ 338 (391)
.. |...+....+|.+|+.+- |. ....+.|.+ |++
T Consensus 522 ~~--C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~v--c~y 597 (1406)
T KOG1146|consen 522 RA--CNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEV--CSY 597 (1406)
T ss_pred ee--eeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhh--hcc
Confidence 76 888888888888887531 10 012488998 999
Q ss_pred ccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHHHHH
Q psy11069 339 SFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLK 382 (391)
Q Consensus 339 ~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~ 382 (391)
.-+-...|..||..-....++..|-. |+..+.....+..+.+
T Consensus 598 etniarnlrihmtss~~s~~p~~~Lq--~~it~~l~~~~~~~~~ 639 (1406)
T KOG1146|consen 598 ETNIARNLRIHMTASPSSSPPSLVLQ--QNITSSLASLLGGQGR 639 (1406)
T ss_pred hhhhhhccccccccCCCCCChHHHhh--hcchhhccccccCcCC
Confidence 99999999999976444444466666 7666665555554433
No 54
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.94 E-value=0.0088 Score=57.42 Aligned_cols=163 Identities=24% Similarity=0.371 Sum_probs=108.1
Q ss_pred cccccccccCCHHHHHHHHH--HcCCC--CccccccccccccccCchHHHHHHhHhcCCCCCeeccCCCCCccccchhhH
Q psy11069 180 DQKNCNKRFTTLSNLKMHMV--RHGKP--LTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGCGKSFYLMASL 255 (391)
Q Consensus 180 ~c~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~l 255 (391)
.|..|...|.....|..|.+ .|.++ +++.|....|++.|.....+..|...|.+ ..++.+....+...+.....-
T Consensus 291 ~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 369 (467)
T COG5048 291 KSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS-ISPAKEKLLNSSSKFSPLLNN 369 (467)
T ss_pred CCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccC-CCccccccccCccccccccCC
Confidence 33378888888888888888 68888 78888822388888888888888888876 555666554565555544332
Q ss_pred hcccccc----CCCCCccccC--cccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCcc
Q psy11069 256 KSHQRVH----VTNPEDLVCK--GCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMF 329 (391)
Q Consensus 256 ~~H~~~h----~~~~~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~ 329 (391)
..+.... ......+.+. .|-..+.....+..|.. .|....++.|....|...|.....+..|++ .|....++
T Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 447 (467)
T COG5048 370 EPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHII-THLSFRPYNCKNPPCSKSFNRHYNLIPHKK-IHTNHAPL 447 (467)
T ss_pred CCccchhhccCccCCccccccccchhhhhccccccccccc-cccccCCcCCCCCcchhhccCccccccccc-ccccCCce
Confidence 1111111 1122334443 46677778888888865 777766555555569999999999999999 58777777
Q ss_pred CCCCCCCCcccCChHHHHH
Q psy11069 330 PCPLVTCGKSFLRSEHLQE 348 (391)
Q Consensus 330 ~C~~~~C~~~f~~~~~L~~ 348 (391)
.|. +-+.|.....+..
T Consensus 448 ~~~---~~~~~~~~~~~~~ 463 (467)
T COG5048 448 LCS---ILKSFRRDLDLSN 463 (467)
T ss_pred eec---cccccchhhhhhc
Confidence 775 4445554444443
No 55
>KOG2482|consensus
Probab=94.78 E-value=0.085 Score=46.86 Aligned_cols=77 Identities=27% Similarity=0.395 Sum_probs=42.6
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHH
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQE 348 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~ 348 (391)
.+|-.|.....+...|..||+.+|.-.-. +=. ..=+..|..+-.+..-++.. ...-.|.. |.-.|.....|..
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~-Ki~-sd~~Ln~YqrvrviNyiRkq---~~~~~c~~--cd~~F~~e~~l~~ 352 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLL-KIQ-SDYSLNFYQRVRVINYIRKQ---KKKSRCAE--CDLSFWKEPGLLI 352 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHH-hhc-cccccchhhhhhHHHHHHHH---hhcccccc--ccccccCcchhhh
Confidence 57888888888888888888877742100 000 00122233333333332210 11234554 7778888888888
Q ss_pred HHHH
Q psy11069 349 HMLQ 352 (391)
Q Consensus 349 H~~~ 352 (391)
||..
T Consensus 353 hm~e 356 (423)
T KOG2482|consen 353 HMVE 356 (423)
T ss_pred hccc
Confidence 8864
No 56
>PRK04860 hypothetical protein; Provisional
Probab=94.43 E-value=0.027 Score=45.53 Aligned_cols=38 Identities=24% Similarity=0.535 Sum_probs=30.6
Q ss_pred ccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCH
Q psy11069 268 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSK 312 (391)
Q Consensus 268 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~ 312 (391)
+|.|. |+. ....+++|.+ +|+++++|.|.. |+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~r-i~~g~~~YrC~~--C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNR-VVRGEAVYRCRR--CGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHH-HhcCCccEECCC--CCceeEEe
Confidence 68897 887 6677888877 888888999977 88887654
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.01 E-value=0.015 Score=31.54 Aligned_cols=22 Identities=27% Similarity=0.590 Sum_probs=12.7
Q ss_pred eeccCcCcccccccchhHHHHHHh
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKH 383 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~ 383 (391)
|.|.. |++.|.+...|..|++.
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 45555 66666666666655543
No 58
>PRK04860 hypothetical protein; Provisional
Probab=93.84 E-value=0.038 Score=44.67 Aligned_cols=40 Identities=18% Similarity=0.368 Sum_probs=31.5
Q ss_pred CccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccch
Q psy11069 327 RMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKS 375 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~ 375 (391)
-+|.|. |+. ....+.+|.++|++++ +|.|.. |+..|....
T Consensus 118 ~~Y~C~---C~~---~~~~~rrH~ri~~g~~-~YrC~~--C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK---CQE---HQLTVRRHNRVVRGEA-VYRCRR--CGETLVFKG 157 (160)
T ss_pred EEEEcC---CCC---eeCHHHHHHHHhcCCc-cEECCC--CCceeEEec
Confidence 468886 877 6777889999999888 499988 988876543
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.67 E-value=0.05 Score=29.42 Aligned_cols=21 Identities=48% Similarity=0.865 Sum_probs=11.6
Q ss_pred cccccCCcCccccCHHHHHHhhh
Q psy11069 298 FKCTHDGCKLLFSSKSALKRHNK 320 (391)
Q Consensus 298 ~~C~~~~C~~~f~~~~~l~~H~~ 320 (391)
|.|.. |++.|.+...|..|++
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTT
T ss_pred CCccc--CCCCcCCHHHHHHHHc
Confidence 44544 5555555555555554
No 60
>KOG2482|consensus
Probab=92.44 E-value=0.12 Score=45.85 Aligned_cols=156 Identities=22% Similarity=0.296 Sum_probs=90.6
Q ss_pred chHHHHHHhHhcCCCCCeeccCCCCCcccc-chhhHhcccc-ccCCC--------------------CCccccCcccccC
Q psy11069 221 MKQYSTHLKEHSNVSAPYMCDYKGCGKSFY-LMASLKSHQR-VHVTN--------------------PEDLVCKGCGKQF 278 (391)
Q Consensus 221 ~~~l~~H~~~h~~~~~~~~C~~~~C~~~f~-~~~~l~~H~~-~h~~~--------------------~~~~~C~~C~~~f 278 (391)
+..|.++++.-.+......|-+ |...+. .++....|+- .|.-. -..+.|-.|.+.|
T Consensus 128 ~eaLeqqQ~Eredt~fslqClF--Cn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekif 205 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLF--CNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIF 205 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEE--ecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeecccc
Confidence 3455666666555456678988 987655 4556666653 33211 1237899999999
Q ss_pred CChhHHHHHHHHh-cCCCCc--------cccccCCcCcccc-CHHHHH-----HhhhccCC---------CCCc--cCCC
Q psy11069 279 KVPCRLREHYKAV-HESTKN--------FKCTHDGCKLLFS-SKSALK-----RHNKSKHL---------NMRM--FPCP 332 (391)
Q Consensus 279 ~~~~~l~~H~~~~-H~~~~~--------~~C~~~~C~~~f~-~~~~l~-----~H~~~~H~---------~~~~--~~C~ 332 (391)
+.+..|+.||+.. |....| |.=.+..=|++.. ....+. .+.. +. +..+ ..|-
T Consensus 206 rdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E--~D~~wsDw~ed~a~a~~v~CL 283 (423)
T KOG2482|consen 206 RDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINE--TDDTWSDWNEDDAEALSVVCL 283 (423)
T ss_pred CCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccc--cccchhhhhcCCCCccceEEE
Confidence 9999999998742 322222 1111100111110 001110 1100 10 1112 5788
Q ss_pred CCCCCcccCChHHHHHHHHHhcCC--------------------------CCceeccCcCcccccccchhHHHHHHhh
Q psy11069 333 LVTCGKSFLRSEHLQEHMLQHNEN--------------------------KPKFTCPYQDCLMSYVAKSSLYAHLKHV 384 (391)
Q Consensus 333 ~~~C~~~f~~~~~L~~H~~~h~~~--------------------------~~~~~C~~~~C~~~f~~~~~L~~H~~~~ 384 (391)
+ |.....+...|..||.+.|.- +..-.|-. |...|.....|..||..+
T Consensus 284 f--C~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~--cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 284 F--CTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAE--CDLSFWKEPGLLIHMVED 357 (423)
T ss_pred e--eccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhcccccc--ccccccCcchhhhhcccc
Confidence 8 999888899999999875420 00234655 999999999999998643
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.48 E-value=0.16 Score=26.83 Aligned_cols=20 Identities=30% Similarity=0.599 Sum_probs=11.3
Q ss_pred eeccCcCcccccccchhHHHHHH
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLK 382 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~ 382 (391)
..|+. ||+.| ....|.+|++
T Consensus 3 ~~C~~--CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPI--CGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCC--CCCEE-CHHHHHHHHH
Confidence 44666 66666 3455666654
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.50 E-value=0.3 Score=25.83 Aligned_cols=19 Identities=42% Similarity=1.230 Sum_probs=11.3
Q ss_pred CCCCCCCCcccCChHHHHHHHH
Q psy11069 330 PCPLVTCGKSFLRSEHLQEHML 351 (391)
Q Consensus 330 ~C~~~~C~~~f~~~~~L~~H~~ 351 (391)
.|++ ||+.| ....|.+|+.
T Consensus 4 ~C~~--CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPI--CGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCC--CCCEE-CHHHHHHHHH
Confidence 4665 66666 5555666654
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.33 E-value=0.2 Score=28.81 Aligned_cols=22 Identities=36% Similarity=0.717 Sum_probs=14.2
Q ss_pred eeccCcCcccccccchhHHHHHHh
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKH 383 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~ 383 (391)
|.|.+ |++.|.+...+..|++.
T Consensus 4 ~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 4 FYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred eEccc--cCCccCCHHHHHHHHCh
Confidence 66666 66666666666666543
No 64
>KOG4377|consensus
Probab=89.99 E-value=0.33 Score=44.42 Aligned_cols=128 Identities=20% Similarity=0.315 Sum_probs=72.7
Q ss_pred ccccccccccccccCchHHHHHHhHhcCCC-----------CCeeccCCCCCccccchhhHhccccccCC----C--CCc
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHSNVS-----------APYMCDYKGCGKSFYLMASLKSHQRVHVT----N--PED 268 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~-----------~~~~C~~~~C~~~f~~~~~l~~H~~~h~~----~--~~~ 268 (391)
-|.|-...|...+-.+.++.+|.+.|...+ ..|.|-...|.+ +-++...|-..|.. + ..-
T Consensus 271 hyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrth 347 (480)
T KOG4377|consen 271 HYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTH 347 (480)
T ss_pred hhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecce
Confidence 466666668887777888888888775422 236787777887 33445555555521 1 123
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCC---------Cc---cccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCC
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHEST---------KN---FKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTC 336 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~---------~~---~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C 336 (391)
|-|..||-++..+ ...|.. .|-.. +. -.|.+++|.+.-. -..|.|.+++|
T Consensus 348 fhC~r~gCTdtfK--~~khk~-yh~kdda~~~dGfkkf~k~e~cay~gCkys~~---------------cnhfhc~r~Gc 409 (480)
T KOG4377|consen 348 FHCQRIGCTDTFK--DSKHKP-YHYKDDAGEIDGFKKFFKDENCAYTGCKYSGI---------------CNHFHCDRLGC 409 (480)
T ss_pred eEEeccCCccccc--cccccc-cccCcchhhhhhhhhhhccccCCccCcccccc---------------eeeeeecccCC
Confidence 6677777444433 333422 33211 11 1355555553311 11256777778
Q ss_pred CcccCChHHHHHHHHHhc
Q psy11069 337 GKSFLRSEHLQEHMLQHN 354 (391)
Q Consensus 337 ~~~f~~~~~L~~H~~~h~ 354 (391)
+..|.+.+.+..|.|.|.
T Consensus 410 ~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 410 EATLYSVSQMASHKRKHE 427 (480)
T ss_pred ceEEEehhhhhhhhhhhh
Confidence 888888888888777764
No 65
>KOG4124|consensus
Probab=89.64 E-value=0.098 Score=46.59 Aligned_cols=56 Identities=5% Similarity=-0.037 Sum_probs=39.7
Q ss_pred CCCccccccccccccccCchHHHHHHh-----HhcCCCCCeeccCCCCCccccchhhHhcccccc
Q psy11069 203 KPLTLCCEELGCGRKFQTMKQYSTHLK-----EHSNVSAPYMCDYKGCGKSFYLMASLKSHQRVH 262 (391)
Q Consensus 203 ~~~~~~C~~~~C~~~f~~~~~l~~H~~-----~h~~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h 262 (391)
.|.++.|..++|.+..........|.. ... ..++|.|+ |++...++..|+.|-..-
T Consensus 175 EE~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t-~~~p~k~~---~~~~~~T~~~l~~HS~N~ 235 (442)
T KOG4124|consen 175 EEYRVSVVVPAAAAAAAAANSSDMSSDEASSTAET-TGTPKKMP---ESLVMDTSSPLSDHSMNI 235 (442)
T ss_pred ccccccccCchhhhhhhcccccccccccccccccc-ccCCccCc---ccccccccchhhhccccC
Confidence 567888998888887665544444332 222 35789998 999999999998886543
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.61 E-value=0.27 Score=28.28 Aligned_cols=23 Identities=26% Similarity=0.564 Sum_probs=17.8
Q ss_pred ccCCCCCCCCcccCChHHHHHHHHH
Q psy11069 328 MFPCPLVTCGKSFLRSEHLQEHMLQ 352 (391)
Q Consensus 328 ~~~C~~~~C~~~f~~~~~L~~H~~~ 352 (391)
+|.|.+ |++.|.+...+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 477887 88888888888888753
No 67
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=89.19 E-value=0.65 Score=35.02 Aligned_cols=26 Identities=23% Similarity=0.533 Sum_probs=23.6
Q ss_pred ceec----cCcCcccccccchhHHHHHHhhcC
Q psy11069 359 KFTC----PYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 359 ~~~C----~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
-|.| +. |++.+.+...+.+|++.+||
T Consensus 80 G~~C~~~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCC--CCcEeccHHHHHHHHHHhcC
Confidence 4889 88 99999999999999999986
No 68
>KOG2893|consensus
Probab=88.84 E-value=0.12 Score=43.37 Aligned_cols=43 Identities=26% Similarity=0.508 Sum_probs=22.2
Q ss_pred CCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHHHHHhhc
Q psy11069 336 CGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVH 385 (391)
Q Consensus 336 C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H 385 (391)
|++.|.+..-|.+|++.. .|+|-+ |.+...+-..|..|.-.+|
T Consensus 16 cnrefddekiliqhqkak-----hfkchi--chkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 16 CNREFDDEKILIQHQKAK-----HFKCHI--CHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred cccccchhhhhhhhhhhc-----cceeee--ehhhhccCCCceeehhhhh
Confidence 555555555555554432 255555 5555555555555544444
No 69
>KOG2785|consensus
Probab=88.50 E-value=1.3 Score=40.34 Aligned_cols=140 Identities=17% Similarity=0.334 Sum_probs=91.8
Q ss_pred CeeccCCCCCccccchhhHhccccc--cC----------------------------------CCCCccccCcccccCCC
Q psy11069 237 PYMCDYKGCGKSFYLMASLKSHQRV--HV----------------------------------TNPEDLVCKGCGKQFKV 280 (391)
Q Consensus 237 ~~~C~~~~C~~~f~~~~~l~~H~~~--h~----------------------------------~~~~~~~C~~C~~~f~~ 280 (391)
.|.|.. |...|.+....+.|.++ |. ..+.++.|..|.+.|.+
T Consensus 3 ~ftC~t--C~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s 80 (390)
T KOG2785|consen 3 GFTCNT--CNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFAS 80 (390)
T ss_pred cceeec--eeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccC
Confidence 488986 99999999888888765 31 11235889999999999
Q ss_pred hhHHHHHHHHhcCCC----------------Cccc-------------cccCCcCccccCHHHHHHhhhc----------
Q psy11069 281 PCRLREHYKAVHEST----------------KNFK-------------CTHDGCKLLFSSKSALKRHNKS---------- 321 (391)
Q Consensus 281 ~~~l~~H~~~~H~~~----------------~~~~-------------C~~~~C~~~f~~~~~l~~H~~~---------- 321 (391)
......|+...-..+ .... +.. +...+........+...
T Consensus 81 ~~a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~--~~E~~~~~d~~~e~~~dd~~Edi~~d~ 158 (390)
T KOG2785|consen 81 PKAHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELK--WYEVDSDEDSSEEEEEDDEEEDIEEDG 158 (390)
T ss_pred hhhHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccc--hhhcccccccchhhccCcchhhhhhcc
Confidence 999999976321100 0111 111 22222221222222110
Q ss_pred -cCCCCCccCCCCCCCCcccCChHHHHHHHHHhcCCC----------------------CceeccCcCcc---cccccch
Q psy11069 322 -KHLNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNENK----------------------PKFTCPYQDCL---MSYVAKS 375 (391)
Q Consensus 322 -~H~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~----------------------~~~~C~~~~C~---~~f~~~~ 375 (391)
.-...-|-.|-+ |++.|.+...-..||..+|+-- .-|.|-. |+ +.|.+..
T Consensus 159 ~~e~e~~Pt~CLf--C~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~--CN~~~~~f~sle 234 (390)
T KOG2785|consen 159 DDEDELIPTDCLF--CDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLF--CNELGRPFSSLE 234 (390)
T ss_pred chhcccCCcceee--cCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEE--eccccCcccccH
Confidence 001123466887 9999999999999999988621 1488998 99 9999999
Q ss_pred hHHHHHHhh
Q psy11069 376 SLYAHLKHV 384 (391)
Q Consensus 376 ~L~~H~~~~ 384 (391)
+.+.||...
T Consensus 235 avr~HM~~K 243 (390)
T KOG2785|consen 235 AVRAHMRDK 243 (390)
T ss_pred HHHHHHhhc
Confidence 999998754
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=86.47 E-value=0.35 Score=30.61 Aligned_cols=28 Identities=29% Similarity=0.468 Sum_probs=18.7
Q ss_pred CCCccccCcccccCCChhHHHHHHHHhc
Q psy11069 265 NPEDLVCKGCGKQFKVPCRLREHYKAVH 292 (391)
Q Consensus 265 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H 292 (391)
++.-+.|+-||..|.....+.+|....|
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 4455677777777777777777765444
No 71
>KOG2785|consensus
Probab=85.67 E-value=1.7 Score=39.73 Aligned_cols=139 Identities=19% Similarity=0.273 Sum_probs=93.3
Q ss_pred ccccccccccccccCchHHHHHHhHhc------------------------------------CCCCCeeccCCCCCccc
Q psy11069 206 TLCCEELGCGRKFQTMKQYSTHLKEHS------------------------------------NVSAPYMCDYKGCGKSF 249 (391)
Q Consensus 206 ~~~C~~~~C~~~f~~~~~l~~H~~~h~------------------------------------~~~~~~~C~~~~C~~~f 249 (391)
.|.|.. |...|.+...-+.|.++.. ....++.|.+ |.+.|
T Consensus 3 ~ftC~t--C~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~--c~k~~ 78 (390)
T KOG2785|consen 3 GFTCNT--CNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEA--CNKSF 78 (390)
T ss_pred cceeec--eeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHH--hhccc
Confidence 478999 9999999999999987521 0234689997 99999
Q ss_pred cchhhHhcccccc--CCC--------------CCccc-------------cCcccccCCChhHHHHHHHH-----h----
Q psy11069 250 YLMASLKSHQRVH--VTN--------------PEDLV-------------CKGCGKQFKVPCRLREHYKA-----V---- 291 (391)
Q Consensus 250 ~~~~~l~~H~~~h--~~~--------------~~~~~-------------C~~C~~~f~~~~~l~~H~~~-----~---- 291 (391)
.+......|+..- ... ...+. +..+...+........+... +
T Consensus 79 ~s~~a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~ 158 (390)
T KOG2785|consen 79 ASPKAHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDG 158 (390)
T ss_pred cChhhHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhcc
Confidence 9998888876532 100 00122 33333333332222222110 0
Q ss_pred --cCCCCccccccCCcCccccCHHHHHHhhhccCCC----------------------CCccCCCCCCCC---cccCChH
Q psy11069 292 --HESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLN----------------------MRMFPCPLVTCG---KSFLRSE 344 (391)
Q Consensus 292 --H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~----------------------~~~~~C~~~~C~---~~f~~~~ 344 (391)
-...-|-.|-. |+..|.+...-..||...|+- ...|.|-+ |+ +.|.+..
T Consensus 159 ~~e~e~~Pt~CLf--C~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~--CN~~~~~f~sle 234 (390)
T KOG2785|consen 159 DDEDELIPTDCLF--CDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLF--CNELGRPFSSLE 234 (390)
T ss_pred chhcccCCcceee--cCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEE--eccccCcccccH
Confidence 01123567888 999999999999999976652 23488988 99 9999999
Q ss_pred HHHHHHHH
Q psy11069 345 HLQEHMLQ 352 (391)
Q Consensus 345 ~L~~H~~~ 352 (391)
+.+.||+.
T Consensus 235 avr~HM~~ 242 (390)
T KOG2785|consen 235 AVRAHMRD 242 (390)
T ss_pred HHHHHHhh
Confidence 99999975
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=85.23 E-value=1.3 Score=33.32 Aligned_cols=85 Identities=14% Similarity=0.221 Sum_probs=48.2
Q ss_pred CCccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHH---hhhccCC---------CCCccCC--
Q psy11069 266 PEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKR---HNKSKHL---------NMRMFPC-- 331 (391)
Q Consensus 266 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~---H~~~~H~---------~~~~~~C-- 331 (391)
-+...|..|+..... +.+..|++..|...+...-.. =...+.....|.. .+..-.. ....|.|
T Consensus 9 ~~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~--i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~ 85 (109)
T PF12013_consen 9 YRVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQR--IVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQC 85 (109)
T ss_pred CCEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHH--HHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeec
Confidence 356789999988766 899999986664332111100 0000000000000 0000000 0134889
Q ss_pred --CCCCCCcccCChHHHHHHHHHhcC
Q psy11069 332 --PLVTCGKSFLRSEHLQEHMLQHNE 355 (391)
Q Consensus 332 --~~~~C~~~f~~~~~L~~H~~~h~~ 355 (391)
.. |++.+.+...+.+|++.+||
T Consensus 86 ~~~~--C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 86 DPPH--CGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCCC--CCcEeccHHHHHHHHHHhcC
Confidence 75 99999999999999998875
No 73
>KOG2893|consensus
Probab=84.91 E-value=0.24 Score=41.75 Aligned_cols=42 Identities=33% Similarity=0.691 Sum_probs=29.2
Q ss_pred cccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCcccCChHHHHHHH
Q psy11069 300 CTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSFLRSEHLQEHM 350 (391)
Q Consensus 300 C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~L~~H~ 350 (391)
|.+ |.+.|....-|.+|.+..| |+|.+ |.+..-+...|..|-
T Consensus 13 cwy--cnrefddekiliqhqkakh-----fkchi--chkkl~sgpglsihc 54 (341)
T KOG2893|consen 13 CWY--CNREFDDEKILIQHQKAKH-----FKCHI--CHKKLFSGPGLSIHC 54 (341)
T ss_pred eee--cccccchhhhhhhhhhhcc-----ceeee--ehhhhccCCCceeeh
Confidence 666 7777777777777766533 77777 777766666666663
No 74
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=83.27 E-value=0.59 Score=29.62 Aligned_cols=30 Identities=23% Similarity=0.503 Sum_probs=17.4
Q ss_pred CCCCccCCCCCCCCcccCChHHHHHHHHHhcC
Q psy11069 324 LNMRMFPCPLVTCGKSFLRSEHLQEHMLQHNE 355 (391)
Q Consensus 324 ~~~~~~~C~~~~C~~~f~~~~~L~~H~~~h~~ 355 (391)
.++--+.|+- |+..|....++.+|+..-|+
T Consensus 13 DGE~~lrCPR--C~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPR--CGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCc--hhHHHHHhHHHHHHhhHHhh
Confidence 3444566664 66666666666666654444
No 75
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=78.47 E-value=1.2 Score=25.31 Aligned_cols=9 Identities=44% Similarity=0.929 Sum_probs=3.8
Q ss_pred cccCccccc
Q psy11069 269 LVCKGCGKQ 277 (391)
Q Consensus 269 ~~C~~C~~~ 277 (391)
|+|..||..
T Consensus 2 ~~C~~CGy~ 10 (33)
T cd00350 2 YVCPVCGYI 10 (33)
T ss_pred EECCCCCCE
Confidence 344444433
No 76
>KOG4377|consensus
Probab=77.15 E-value=1.5 Score=40.37 Aligned_cols=28 Identities=18% Similarity=0.383 Sum_probs=24.3
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
.|.|...||+.++.+.+.+..|.|.|-.
T Consensus 401 hfhc~r~Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 401 HFHCDRLGCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred eeeecccCCceEEEehhhhhhhhhhhhh
Confidence 4778888999999999999999887754
No 77
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=75.32 E-value=2 Score=26.27 Aligned_cols=25 Identities=40% Similarity=0.566 Sum_probs=15.8
Q ss_pred ceeccCcCccccccc----chhHHHHHHhhc
Q psy11069 359 KFTCPYQDCLMSYVA----KSSLYAHLKHVH 385 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~----~~~L~~H~~~~H 385 (391)
..+|.+ |++.+.. .+.|.+|++..|
T Consensus 16 ~a~C~~--C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 16 KAKCKY--CGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp -EEETT--TTEE-----SSTHHHHHHHHHTT
T ss_pred eEEeCC--CCeEEeeCCCcHHHHHHhhhhhC
Confidence 477888 8877765 378888886655
No 78
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=66.85 E-value=5.5 Score=25.05 Aligned_cols=26 Identities=31% Similarity=0.593 Sum_probs=16.9
Q ss_pred ceeccCcCcccccccc-----hhHHHHHHhhcC
Q psy11069 359 KFTCPYQDCLMSYVAK-----SSLYAHLKHVHS 386 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~-----~~L~~H~~~~H~ 386 (391)
.-.|.+ |++.+... ++|.+|++..|+
T Consensus 18 ~a~C~~--C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 18 RAKCKY--CGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEEecC--CCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 356777 77776554 577777775444
No 79
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=65.28 E-value=3.3 Score=35.48 Aligned_cols=44 Identities=25% Similarity=0.376 Sum_probs=24.6
Q ss_pred CCeeccCCCCCccccchhhHhccccccC---------CCCCc-----cccCcccccCCCh
Q psy11069 236 APYMCDYKGCGKSFYLMASLKSHQRVHV---------TNPED-----LVCKGCGKQFKVP 281 (391)
Q Consensus 236 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~---------~~~~~-----~~C~~C~~~f~~~ 281 (391)
+.+.|++ |++.|.+..-+....+.-. .+..| ..|+.||.+|...
T Consensus 4 k~~~CPv--C~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPV--CGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCC--CCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 4567776 7777777655544443310 11112 4588888776543
No 80
>PF12907 zf-met2: Zinc-binding
Probab=63.69 E-value=4.2 Score=24.26 Aligned_cols=26 Identities=31% Similarity=0.609 Sum_probs=11.7
Q ss_pred ccCcccccC---CChhHHHHHHHHhcCCC
Q psy11069 270 VCKGCGKQF---KVPCRLREHYKAVHEST 295 (391)
Q Consensus 270 ~C~~C~~~f---~~~~~l~~H~~~~H~~~ 295 (391)
.|.+|..+| .+...|..|....|...
T Consensus 3 ~C~iC~qtF~~t~~~~~L~eH~enKHpK~ 31 (40)
T PF12907_consen 3 ICKICRQTFMQTTNEPQLKEHAENKHPKN 31 (40)
T ss_pred CcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence 455555322 33344555554445443
No 81
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=59.49 E-value=3.9 Score=34.81 Aligned_cols=27 Identities=22% Similarity=0.506 Sum_probs=18.6
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
.|.|+. |+|.|.-.....+|+...|++
T Consensus 77 K~~C~l--c~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 77 KWRCPL--CGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp EEEE-S--SS-EESSHHHHHHHHHHH-HH
T ss_pred EECCCC--CCcccCChHHHHHHHhhcCHH
Confidence 478877 888888888888888887754
No 82
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=56.06 E-value=2.5 Score=36.18 Aligned_cols=44 Identities=23% Similarity=0.591 Sum_probs=24.3
Q ss_pred CccCCCCCCCCcccCChHHHHHHHHH---hc-------CCCC----ceeccCcCcccccccc
Q psy11069 327 RMFPCPLVTCGKSFLRSEHLQEHMLQ---HN-------ENKP----KFTCPYQDCLMSYVAK 374 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~L~~H~~~---h~-------~~~~----~~~C~~~~C~~~f~~~ 374 (391)
+.+.|++ |+..|.+..-+....+. .+ +..| ...||. ||++|...
T Consensus 4 k~~~CPv--C~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~--CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPV--CGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPH--CGYAAFEE 61 (214)
T ss_pred CceECCC--CCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCC--CCCccccc
Confidence 3456666 66666665444444332 11 1121 356998 99888754
No 83
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=55.94 E-value=7.5 Score=33.13 Aligned_cols=29 Identities=34% Similarity=0.574 Sum_probs=22.8
Q ss_pred CCCccccCcccccCCChhHHHHHHHHhcC
Q psy11069 265 NPEDLVCKGCGKQFKVPCRLREHYKAVHE 293 (391)
Q Consensus 265 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 293 (391)
.+..|.|+.|+|.|.-..-.++|+...|.
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 34569999999999999999999998885
No 84
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=55.84 E-value=16 Score=27.55 Aligned_cols=20 Identities=20% Similarity=0.458 Sum_probs=12.7
Q ss_pred ccccCcccccCCChhHHHHH
Q psy11069 268 DLVCKGCGKQFKVPCRLREH 287 (391)
Q Consensus 268 ~~~C~~C~~~f~~~~~l~~H 287 (391)
|..|+.||.+..+...|.+-
T Consensus 15 P~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 15 PVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCcCCcCCCEEeccchHHHh
Confidence 55666666666666666654
No 85
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.78 E-value=7.7 Score=29.10 Aligned_cols=15 Identities=33% Similarity=0.933 Sum_probs=9.4
Q ss_pred CccccCcccccCCCh
Q psy11069 267 EDLVCKGCGKQFKVP 281 (391)
Q Consensus 267 ~~~~C~~C~~~f~~~ 281 (391)
.|..|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 466677777666544
No 86
>PF15269 zf-C2H2_7: Zinc-finger
Probab=55.08 E-value=10 Score=22.89 Aligned_cols=26 Identities=23% Similarity=0.167 Sum_probs=19.0
Q ss_pred CcccccccccccccCCHHHHHHHHHH
Q psy11069 175 KCYPCDQKNCNKRFTTLSNLKMHMVR 200 (391)
Q Consensus 175 ~~~~c~c~~C~~~f~~~~~l~~H~~~ 200 (391)
++|...|-.|......++.|..||+.
T Consensus 17 kp~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 17 KPFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CCccceeecCCcccchHHHHHHHHHH
Confidence 55666666788887778888888764
No 87
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=52.93 E-value=10 Score=20.13 Aligned_cols=20 Identities=25% Similarity=0.705 Sum_probs=13.4
Q ss_pred eeccCcCcccccccchhHHHHHH
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLK 382 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~ 382 (391)
..||+ |++.+ ....+..|+.
T Consensus 2 v~CPi--C~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPV--CFREV-PENLINSHLD 21 (26)
T ss_pred CcCCC--CcCcc-cHHHHHHHHH
Confidence 35777 77777 5566777764
No 88
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=51.95 E-value=11 Score=22.07 Aligned_cols=9 Identities=33% Similarity=0.992 Sum_probs=3.6
Q ss_pred cccCccccc
Q psy11069 269 LVCKGCGKQ 277 (391)
Q Consensus 269 ~~C~~C~~~ 277 (391)
.+|+.|+..
T Consensus 26 vrC~~C~~~ 34 (37)
T PF13719_consen 26 VRCPKCGHV 34 (37)
T ss_pred EECCCCCcE
Confidence 334444433
No 89
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=51.93 E-value=9.1 Score=31.11 Aligned_cols=23 Identities=35% Similarity=0.670 Sum_probs=13.8
Q ss_pred ccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcC
Q psy11069 268 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCK 306 (391)
Q Consensus 268 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~ 306 (391)
.|.|++||. +|.++.|-+||. ||
T Consensus 134 ~~vC~vCGy--------------~~~ge~P~~CPi--Cg 156 (166)
T COG1592 134 VWVCPVCGY--------------THEGEAPEVCPI--CG 156 (166)
T ss_pred EEEcCCCCC--------------cccCCCCCcCCC--CC
Confidence 566666663 344466666666 66
No 90
>KOG4124|consensus
Probab=51.44 E-value=4 Score=36.77 Aligned_cols=27 Identities=30% Similarity=0.583 Sum_probs=21.3
Q ss_pred CCCeeccCCCCCccccchhhHhccccc
Q psy11069 235 SAPYMCDYKGCGKSFYLMASLKSHQRV 261 (391)
Q Consensus 235 ~~~~~C~~~~C~~~f~~~~~l~~H~~~ 261 (391)
.++|.|.++.|++.+.....|..|...
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~ 373 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLH 373 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeecccc
Confidence 567889888899888888887777543
No 91
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=49.68 E-value=38 Score=25.52 Aligned_cols=90 Identities=17% Similarity=0.399 Sum_probs=57.6
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCC-CC------------ccCCCCCC
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLN-MR------------MFPCPLVT 335 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~-~~------------~~~C~~~~ 335 (391)
|.|+.|+-.. -+-|-.|+. |+....+...|.+-. +|.- -+ .-.|--
T Consensus 2 Y~CPrC~skv---------------C~LP~~Cpi--CgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C~~-- 60 (112)
T TIGR00622 2 YFCPQCRAKV---------------CELPVECPI--CGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFCFG-- 60 (112)
T ss_pred ccCCCCCCCc---------------cCCCCcCCc--CCCEEeccchHHHhh--hccCCCcccccccccccCCCCcccC--
Confidence 6777776432 245789988 999999988888764 3421 11 123875
Q ss_pred CCcccCChHHHHHHHHHhcCCCCceeccCcCcccccccchhHHHHHHhh
Q psy11069 336 CGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHV 384 (391)
Q Consensus 336 C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~ 384 (391)
|...|........= .-.... .|+|+. |...|--.=++-.|...|
T Consensus 61 C~~~f~~~~~~~~~--~~~~~~-~y~C~~--C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 61 CQGPFPKPPVSPFD--ELKDSH-RYVCAV--CKNVFCVDCDVFVHESLH 104 (112)
T ss_pred cCCCCCCccccccc--cccccc-ceeCCC--CCCccccccchhhhhhcc
Confidence 99999875421100 011122 599999 999998777777776554
No 92
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=49.60 E-value=21 Score=28.60 Aligned_cols=59 Identities=22% Similarity=0.405 Sum_probs=36.4
Q ss_pred ccccCcccccCCChhHHHHHHHHhcCCCCccccccCCcCccccCHHHHHHhhhccCCCCCcc
Q psy11069 268 DLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMF 329 (391)
Q Consensus 268 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~ 329 (391)
...|+.|.-.-.-..-.. -.+ .+...|+-.|..+.|.+. .+...|.+|.+..|...+|-
T Consensus 80 ~L~CPLCRG~V~GWtvve-~AR-~~LN~K~RsC~~e~C~F~-GtY~eLrKHar~~HP~~rP~ 138 (162)
T PF07800_consen 80 ELACPLCRGEVKGWTVVE-PAR-RFLNAKKRSCSQESCSFS-GTYSELRKHARSEHPSARPS 138 (162)
T ss_pred cccCccccCceeceEEch-HHH-HHhccCCccCcccccccc-cCHHHHHHHHHhhCCCCCCc
Confidence 467888853322211111 123 455566778887778765 56677888888888877663
No 93
>KOG2186|consensus
Probab=48.11 E-value=9.5 Score=32.91 Aligned_cols=49 Identities=16% Similarity=0.310 Sum_probs=26.2
Q ss_pred eecCCCCCCCcccChHHHHHHHHhcCCCCcccCccCCCCCCccccccCCHHHHHHHHhhcC
Q psy11069 111 WSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKLTTANNTPCMWGFFSKHKLLRHMTSHD 171 (391)
Q Consensus 111 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~ 171 (391)
|.|.+ ||.....+ .+.+|+-..++ .-|.|-- |++.|.. ..+..|..--+
T Consensus 4 FtCnv--CgEsvKKp-~vekH~srCrn-~~fSCID-------C~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCNV--CGESVKKP-QVEKHMSRCRN-AYFSCID-------CGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred Eehhh--hhhhcccc-chHHHHHhccC-CeeEEee-------ccccccc-chhhhhhhhcc
Confidence 56664 66655442 44556655444 4455544 6666665 45555554333
No 94
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=46.94 E-value=14 Score=27.71 Aligned_cols=32 Identities=28% Similarity=0.624 Sum_probs=23.1
Q ss_pred CceecCCCCCCCcccChHHHHHHHHhcCCCCcccCccCCCCCCccccccCCH
Q psy11069 109 QLWSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKLTTANNTPCMWGFFSK 160 (391)
Q Consensus 109 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~ 160 (391)
.+..|+ .||.+|... +..|..|+. ||..|.-.
T Consensus 8 tKR~Cp--~CG~kFYDL-----------nk~PivCP~-------CG~~~~~~ 39 (108)
T PF09538_consen 8 TKRTCP--SCGAKFYDL-----------NKDPIVCPK-------CGTEFPPE 39 (108)
T ss_pred CcccCC--CCcchhccC-----------CCCCccCCC-------CCCccCcc
Confidence 456788 588888664 457788888 88877655
No 95
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=46.71 E-value=15 Score=21.25 Aligned_cols=10 Identities=40% Similarity=1.149 Sum_probs=4.6
Q ss_pred cccCcccccC
Q psy11069 269 LVCKGCGKQF 278 (391)
Q Consensus 269 ~~C~~C~~~f 278 (391)
.+|+.|+..|
T Consensus 26 v~C~~C~~~f 35 (36)
T PF13717_consen 26 VRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 96
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=44.80 E-value=7.9 Score=26.58 Aligned_cols=16 Identities=13% Similarity=0.528 Sum_probs=8.3
Q ss_pred ceeccCcCcccccccc
Q psy11069 359 KFTCPYQDCLMSYVAK 374 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~ 374 (391)
.++|...+||.+|...
T Consensus 27 Y~qC~N~eCg~tF~t~ 42 (72)
T PRK09678 27 YHQCQNVNCSATFITY 42 (72)
T ss_pred eeecCCCCCCCEEEEE
Confidence 3566523366666543
No 97
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=43.28 E-value=17 Score=23.67 Aligned_cols=10 Identities=30% Similarity=0.992 Sum_probs=5.4
Q ss_pred ccccCccccc
Q psy11069 268 DLVCKGCGKQ 277 (391)
Q Consensus 268 ~~~C~~C~~~ 277 (391)
.|.|+.||..
T Consensus 27 ~F~CPnCGe~ 36 (61)
T COG2888 27 KFPCPNCGEV 36 (61)
T ss_pred EeeCCCCCce
Confidence 3556666643
No 98
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=40.59 E-value=26 Score=20.01 Aligned_cols=22 Identities=32% Similarity=1.067 Sum_probs=14.7
Q ss_pred ceecCCCCCCCcccChHHHHHHHHhcCCCCcccCcc
Q psy11069 110 LWSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKL 145 (391)
Q Consensus 110 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 145 (391)
.|.|.+ ||..+... +.|..|++
T Consensus 2 ~~~C~~--CG~i~~g~------------~~p~~CP~ 23 (34)
T cd00729 2 VWVCPV--CGYIHEGE------------EAPEKCPI 23 (34)
T ss_pred eEECCC--CCCEeECC------------cCCCcCcC
Confidence 478885 88776542 25668888
No 99
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=40.01 E-value=18 Score=28.28 Aligned_cols=24 Identities=25% Similarity=0.437 Sum_probs=13.7
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
...|-. ||+.|.. |.+|++.|||-
T Consensus 72 ~i~cle--cGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 72 YIICLE--CGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp -EE-TB--T--EESB---HHHHHHHTT-S
T ss_pred eeEEcc--CCcccch---HHHHHHHccCC
Confidence 367876 8888754 57888888763
No 100
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=39.83 E-value=13 Score=29.90 Aligned_cols=13 Identities=38% Similarity=0.818 Sum_probs=7.1
Q ss_pred cccCcccccCCCh
Q psy11069 269 LVCKGCGKQFKVP 281 (391)
Q Consensus 269 ~~C~~C~~~f~~~ 281 (391)
+.|+.||++|.+.
T Consensus 29 ~~c~~c~~~f~~~ 41 (154)
T PRK00464 29 RECLACGKRFTTF 41 (154)
T ss_pred eeccccCCcceEe
Confidence 5555555555543
No 101
>PF04606 Ogr_Delta: Ogr/Delta-like zinc finger; InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=38.33 E-value=7.9 Score=24.01 Aligned_cols=13 Identities=31% Similarity=0.708 Sum_probs=6.0
Q ss_pred cccccCCcCcccc
Q psy11069 298 FKCTHDGCKLLFS 310 (391)
Q Consensus 298 ~~C~~~~C~~~f~ 310 (391)
|+|....||..|.
T Consensus 26 ~qC~N~~Cg~tfv 38 (47)
T PF04606_consen 26 CQCTNPECGHTFV 38 (47)
T ss_pred EEECCCcCCCEEE
Confidence 4454444554443
No 102
>KOG2186|consensus
Probab=36.56 E-value=23 Score=30.70 Aligned_cols=48 Identities=25% Similarity=0.520 Sum_probs=37.6
Q ss_pred ccCccCCCCCCccccccCCHHHHHHHHhhcCCCCCcccccccccccccCCHHHHHHHHHHc
Q psy11069 141 FKCKLTTANNTPCMWGFFSKHKLLRHMTSHDEDKKCYPCDQKNCNKRFTTLSNLKMHMVRH 201 (391)
Q Consensus 141 ~~C~~~~~~~~~C~~~f~~~~~l~~H~~~h~~~~~~~~c~c~~C~~~f~~~~~l~~H~~~h 201 (391)
|.|.. ||.... +..+.+|+-..++. .|.|- .|++.|.. ..+..|..--
T Consensus 4 FtCnv-------CgEsvK-Kp~vekH~srCrn~--~fSCI--DC~k~F~~-~sYknH~kCI 51 (276)
T KOG2186|consen 4 FTCNV-------CGESVK-KPQVEKHMSRCRNA--YFSCI--DCGKTFER-VSYKNHTKCI 51 (276)
T ss_pred Eehhh-------hhhhcc-ccchHHHHHhccCC--eeEEe--eccccccc-chhhhhhhhc
Confidence 67877 888876 55688899888874 37777 99999998 7788887643
No 103
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=32.01 E-value=33 Score=27.94 Aligned_cols=23 Identities=30% Similarity=0.743 Sum_probs=15.6
Q ss_pred ccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCC
Q psy11069 297 NFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCG 337 (391)
Q Consensus 297 ~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~ 337 (391)
.|.|++ ||.. |.++.|-+||+ ||
T Consensus 134 ~~vC~v--CGy~--------------~~ge~P~~CPi--Cg 156 (166)
T COG1592 134 VWVCPV--CGYT--------------HEGEAPEVCPI--CG 156 (166)
T ss_pred EEEcCC--CCCc--------------ccCCCCCcCCC--CC
Confidence 577776 7753 45577777777 76
No 104
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=31.16 E-value=25 Score=22.95 Aligned_cols=10 Identities=30% Similarity=1.112 Sum_probs=5.5
Q ss_pred ccccCccccc
Q psy11069 268 DLVCKGCGKQ 277 (391)
Q Consensus 268 ~~~C~~C~~~ 277 (391)
.|.|+.||..
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 3556666554
No 105
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=30.78 E-value=13 Score=23.39 Aligned_cols=7 Identities=43% Similarity=1.284 Sum_probs=3.4
Q ss_pred cccCccc
Q psy11069 269 LVCKGCG 275 (391)
Q Consensus 269 ~~C~~C~ 275 (391)
-.|+.||
T Consensus 27 ~~CP~Cg 33 (52)
T TIGR02605 27 ATCPECG 33 (52)
T ss_pred CCCCCCC
Confidence 3455555
No 106
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=30.66 E-value=38 Score=26.06 Aligned_cols=34 Identities=18% Similarity=0.228 Sum_probs=23.5
Q ss_pred CceecCCCCCCCcccChHHHHHHHHhcCCCCcccCccCCCCCCccccccCCHHH
Q psy11069 109 QLWSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKLTTANNTPCMWGFFSKHK 162 (391)
Q Consensus 109 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~~~~~ 162 (391)
.+..|+ .|+.+|... +..|..|+. ||..|.....
T Consensus 8 tKr~Cp--~cg~kFYDL-----------nk~p~vcP~-------cg~~~~~~~~ 41 (129)
T TIGR02300 8 TKRICP--NTGSKFYDL-----------NRRPAVSPY-------TGEQFPPEEA 41 (129)
T ss_pred ccccCC--CcCcccccc-----------CCCCccCCC-------cCCccCcchh
Confidence 456788 588888654 457788888 8877765533
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=30.48 E-value=37 Score=27.10 Aligned_cols=17 Identities=24% Similarity=0.624 Sum_probs=9.6
Q ss_pred CccCCCCCCCCcccCChHH
Q psy11069 327 RMFPCPLVTCGKSFLRSEH 345 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~ 345 (391)
..|.|+. |+..|.....
T Consensus 98 ~~Y~Cp~--C~~~y~~~ea 114 (147)
T smart00531 98 AYYKCPN--CQSKYTFLEA 114 (147)
T ss_pred cEEECcC--CCCEeeHHHH
Confidence 3466665 6666664443
No 108
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=30.22 E-value=42 Score=20.52 Aligned_cols=30 Identities=13% Similarity=0.396 Sum_probs=18.3
Q ss_pred ceecCCCCCCCcccChHHHHHHHHhcCCCCcccCccCCCCCCccccccC
Q psy11069 110 LWSCPIVNCNFTSVNLNTFKIHLLKHFDKKPFKCKLTTANNTPCMWGFF 158 (391)
Q Consensus 110 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~~~~~~C~~~f~ 158 (391)
.|.|+ .|+..|.... ....+.|+. ||..+.
T Consensus 3 ~y~C~--~CG~~~~~~~----------~~~~~~Cp~-------CG~~~~ 32 (46)
T PRK00398 3 EYKCA--RCGREVELDE----------YGTGVRCPY-------CGYRIL 32 (46)
T ss_pred EEECC--CCCCEEEECC----------CCCceECCC-------CCCeEE
Confidence 57888 5887765431 112578888 766543
No 109
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=29.79 E-value=37 Score=26.32 Aligned_cols=22 Identities=23% Similarity=0.167 Sum_probs=14.9
Q ss_pred eeccCcCcccccccchhHHHHHHhhcC
Q psy11069 360 FTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 360 ~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
..|-. +|+.|. +|.+|+.+|++
T Consensus 77 IicLE--DGkkfK---SLKRHL~t~~g 98 (148)
T COG4957 77 IICLE--DGKKFK---SLKRHLTTHYG 98 (148)
T ss_pred EEEec--cCcchH---HHHHHHhcccC
Confidence 55755 777774 46777777765
No 110
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=29.37 E-value=32 Score=26.87 Aligned_cols=25 Identities=28% Similarity=0.542 Sum_probs=15.3
Q ss_pred ccCCCCCCCCcccCChHHHHHHHHHhcCCC
Q psy11069 328 MFPCPLVTCGKSFLRSEHLQEHMLQHNENK 357 (391)
Q Consensus 328 ~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~ 357 (391)
...|-. ||+.|... ++|++.|+|-.
T Consensus 72 ~i~cle--cGk~~k~L---krHL~~~~glt 96 (132)
T PF05443_consen 72 YIICLE--CGKKFKTL---KRHLRTHHGLT 96 (132)
T ss_dssp -EE-TB--T--EESBH---HHHHHHTT-S-
T ss_pred eeEEcc--CCcccchH---HHHHHHccCCC
Confidence 356875 99999765 89999998754
No 111
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=29.35 E-value=24 Score=29.20 Aligned_cols=16 Identities=31% Similarity=0.727 Sum_probs=7.3
Q ss_pred ccccccCCcCccccCHHH
Q psy11069 297 NFKCTHDGCKLLFSSKSA 314 (391)
Q Consensus 297 ~~~C~~~~C~~~f~~~~~ 314 (391)
-|.|+. |+..|+...+
T Consensus 117 ~Y~Cp~--C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPN--CHIRFTFDEA 132 (178)
T ss_pred EEECCC--CCcEEeHHHH
Confidence 344543 5544444444
No 112
>KOG2295|consensus
Probab=29.14 E-value=30 Score=33.60 Aligned_cols=43 Identities=14% Similarity=0.458 Sum_probs=33.4
Q ss_pred HHHHHHHHHhc---CCCCceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 344 EHLQEHMLQHN---ENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 344 ~~L~~H~~~h~---~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
..+..|++.-+ ++. .|.|+..+|.|-|.-.....+|+...|.+
T Consensus 492 ~eve~~v~s~t~e~~kd-Ky~C~lsgc~KlF~gpEFvrKHi~~KH~d 537 (648)
T KOG2295|consen 492 DEVENFVDSNTMELDKD-KYLCPLSGCAKLFKGPEFVRKHINKKHKD 537 (648)
T ss_pred HHHHHHHHHHHHHhhcc-cccCCCcchHhhccCHHHHHHHHHHHHHH
Confidence 34556665543 233 59999999999999999999999999865
No 113
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=28.21 E-value=37 Score=24.14 Aligned_cols=12 Identities=33% Similarity=0.908 Sum_probs=7.0
Q ss_pred ccccCcccccCC
Q psy11069 268 DLVCKGCGKQFK 279 (391)
Q Consensus 268 ~~~C~~C~~~f~ 279 (391)
.+.|..||..|.
T Consensus 53 IW~C~kCg~~fA 64 (89)
T COG1997 53 IWKCRKCGAKFA 64 (89)
T ss_pred eEEcCCCCCeec
Confidence 456666666554
No 114
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=27.88 E-value=73 Score=25.60 Aligned_cols=36 Identities=25% Similarity=0.363 Sum_probs=19.4
Q ss_pred HHHhcCCCCceeccCcCcccccccchhHHHHHHhhcCC
Q psy11069 350 MLQHNENKPKFTCPYQDCLMSYVAKSSLYAHLKHVHST 387 (391)
Q Consensus 350 ~~~h~~~~~~~~C~~~~C~~~f~~~~~L~~H~~~~H~~ 387 (391)
.|.+.+.| +--|..+.|.+. .+...|.+|.+..|+.
T Consensus 99 AR~~LN~K-~RsC~~e~C~F~-GtY~eLrKHar~~HP~ 134 (162)
T PF07800_consen 99 ARRFLNAK-KRSCSQESCSFS-GTYSELRKHARSEHPS 134 (162)
T ss_pred HHHHhccC-CccCcccccccc-cCHHHHHHHHHhhCCC
Confidence 44444455 255555555554 3445566666666654
No 115
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=27.44 E-value=27 Score=22.65 Aligned_cols=9 Identities=22% Similarity=0.294 Sum_probs=4.1
Q ss_pred hhhHhcccc
Q psy11069 252 MASLKSHQR 260 (391)
Q Consensus 252 ~~~l~~H~~ 260 (391)
+..|..|+.
T Consensus 23 r~~l~~H~~ 31 (60)
T PF02176_consen 23 RKELDDHLE 31 (60)
T ss_dssp CCCHHHHHH
T ss_pred HHHHHHHHH
Confidence 344444444
No 116
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=27.43 E-value=52 Score=26.63 Aligned_cols=20 Identities=25% Similarity=0.403 Sum_probs=11.0
Q ss_pred CCccCCCCCCCCcccCChHHHH
Q psy11069 326 MRMFPCPLVTCGKSFLRSEHLQ 347 (391)
Q Consensus 326 ~~~~~C~~~~C~~~f~~~~~L~ 347 (391)
..-|.|+. |+..|+...++.
T Consensus 107 ~~~Y~Cp~--c~~r~tf~eA~~ 126 (158)
T TIGR00373 107 NMFFICPN--MCVRFTFNEAME 126 (158)
T ss_pred CCeEECCC--CCcEeeHHHHHH
Confidence 34455654 666666555553
No 117
>KOG4167|consensus
Probab=26.70 E-value=18 Score=36.25 Aligned_cols=26 Identities=27% Similarity=0.410 Sum_probs=23.4
Q ss_pred ceeccCcCcccccccchhHHHHHHhhcC
Q psy11069 359 KFTCPYQDCLMSYVAKSSLYAHLKHVHS 386 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~~~~L~~H~~~~H~ 386 (391)
.|-|.. |+|.|....++..||++|--
T Consensus 792 iFpCre--C~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCRE--CGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehHH--HHHHHHHHhhhhHHHHHHHH
Confidence 589988 99999999999999999853
No 118
>PF03145 Sina: Seven in absentia protein family; InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=26.69 E-value=67 Score=27.02 Aligned_cols=13 Identities=38% Similarity=0.516 Sum_probs=6.0
Q ss_pred hhHHHHHHhhcCC
Q psy11069 375 SSLYAHLKHVHST 387 (391)
Q Consensus 375 ~~L~~H~~~~H~~ 387 (391)
..|..|+...|+.
T Consensus 61 ~~l~~Hl~~~H~~ 73 (198)
T PF03145_consen 61 KELLDHLRDKHSW 73 (198)
T ss_dssp CCHHHHHHHHTTT
T ss_pred HHHHHHHHHHCCC
Confidence 3444454444443
No 119
>KOG4167|consensus
Probab=26.22 E-value=15 Score=36.89 Aligned_cols=27 Identities=37% Similarity=0.861 Sum_probs=23.8
Q ss_pred CccCCCCCCCCcccCChHHHHHHHHHhcC
Q psy11069 327 RMFPCPLVTCGKSFLRSEHLQEHMLQHNE 355 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~L~~H~~~h~~ 355 (391)
.-|.|.. |++.|....++..||++|.-
T Consensus 791 giFpCre--C~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 791 GIFPCRE--CGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred ceeehHH--HHHHHHHHhhhhHHHHHHHH
Confidence 4599997 99999999999999999853
No 120
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=26.19 E-value=39 Score=20.27 Aligned_cols=9 Identities=33% Similarity=1.021 Sum_probs=5.0
Q ss_pred ccccCcccc
Q psy11069 268 DLVCKGCGK 276 (391)
Q Consensus 268 ~~~C~~C~~ 276 (391)
...|+.||.
T Consensus 26 ~~~CP~Cg~ 34 (42)
T PF09723_consen 26 PVPCPECGS 34 (42)
T ss_pred CCcCCCCCC
Confidence 445666654
No 121
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=25.14 E-value=8.2 Score=23.19 Aligned_cols=15 Identities=7% Similarity=0.264 Sum_probs=9.0
Q ss_pred cccccchhhhhhhhh
Q psy11069 39 LIPRTVCGDILENMF 53 (391)
Q Consensus 39 ~~~~~~~~d~~~~~~ 53 (391)
+++++++.|++.++.
T Consensus 20 ~~Qk~ly~dvm~Eny 34 (41)
T PF01352_consen 20 PAQKNLYRDVMLENY 34 (41)
T ss_dssp HHHHHHHHHHHHHTT
T ss_pred ceecccchhHHHHhh
Confidence 456777777744333
No 122
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=24.46 E-value=93 Score=25.17 Aligned_cols=33 Identities=27% Similarity=0.621 Sum_probs=20.7
Q ss_pred CCCccccccCCcCccccCHHHHHHhhhccCCCCCccCCCCCCCCccc
Q psy11069 294 STKNFKCTHDGCKLLFSSKSALKRHNKSKHLNMRMFPCPLVTCGKSF 340 (391)
Q Consensus 294 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~~C~~~f 340 (391)
...-|.|+. |+..|+...++. ..|.|+. ||...
T Consensus 106 ~~~~Y~Cp~--c~~r~tf~eA~~----------~~F~Cp~--Cg~~L 138 (158)
T TIGR00373 106 NNMFFICPN--MCVRFTFNEAME----------LNFTCPR--CGAML 138 (158)
T ss_pred CCCeEECCC--CCcEeeHHHHHH----------cCCcCCC--CCCEe
Confidence 344577765 777777766663 1477776 77653
No 123
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=24.28 E-value=45 Score=27.63 Aligned_cols=18 Identities=33% Similarity=0.659 Sum_probs=9.7
Q ss_pred CccCCCCCCCCcccCChHHH
Q psy11069 327 RMFPCPLVTCGKSFLRSEHL 346 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~L 346 (391)
.-|.|+. |+..|+...++
T Consensus 116 ~~Y~Cp~--C~~rytf~eA~ 133 (178)
T PRK06266 116 MFFFCPN--CHIRFTFDEAM 133 (178)
T ss_pred CEEECCC--CCcEEeHHHHh
Confidence 3455654 66555555554
No 124
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.82 E-value=48 Score=24.58 Aligned_cols=13 Identities=23% Similarity=0.447 Sum_probs=8.9
Q ss_pred CCccccCcccccC
Q psy11069 266 PEDLVCKGCGKQF 278 (391)
Q Consensus 266 ~~~~~C~~C~~~f 278 (391)
+.|..|+.||++|
T Consensus 24 rdPiVsPytG~s~ 36 (129)
T COG4530 24 RDPIVSPYTGKSY 36 (129)
T ss_pred CCccccCcccccc
Confidence 3566777777777
No 125
>PHA00626 hypothetical protein
Probab=23.62 E-value=41 Score=21.63 Aligned_cols=13 Identities=15% Similarity=0.381 Sum_probs=10.7
Q ss_pred ceeccCcCccccccc
Q psy11069 359 KFTCPYQDCLMSYVA 373 (391)
Q Consensus 359 ~~~C~~~~C~~~f~~ 373 (391)
.|.|+. ||+.|+.
T Consensus 23 rYkCkd--CGY~ft~ 35 (59)
T PHA00626 23 DYVCCD--CGYNDSK 35 (59)
T ss_pred ceEcCC--CCCeech
Confidence 599988 9998874
No 126
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=23.44 E-value=38 Score=18.02 Aligned_cols=9 Identities=33% Similarity=1.006 Sum_probs=5.0
Q ss_pred eccCcCccccc
Q psy11069 361 TCPYQDCLMSY 371 (391)
Q Consensus 361 ~C~~~~C~~~f 371 (391)
.|+. ||..|
T Consensus 16 ~Cp~--CG~~F 24 (26)
T PF10571_consen 16 FCPH--CGYDF 24 (26)
T ss_pred cCCC--CCCCC
Confidence 3555 66555
No 127
>KOG3408|consensus
Probab=23.35 E-value=45 Score=25.32 Aligned_cols=26 Identities=35% Similarity=0.545 Sum_probs=20.7
Q ss_pred CCCccccccccccccccCchHHHHHHhH
Q psy11069 203 KPLTLCCEELGCGRKFQTMKQYSTHLKE 230 (391)
Q Consensus 203 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~ 230 (391)
|...|.|-. |.+-|.+...|..|.++
T Consensus 54 G~GqfyCi~--CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIE--CARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhh--hhhhhcchHHHHHHHhc
Confidence 334577888 99999999999988875
No 128
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=22.30 E-value=59 Score=31.34 Aligned_cols=27 Identities=22% Similarity=0.459 Sum_probs=0.0
Q ss_pred cccCcccccCCChhHHHHHHHHhcCCC
Q psy11069 269 LVCKGCGKQFKVPCRLREHYKAVHEST 295 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~~H~~~~H~~~ 295 (391)
+.|+.|.+.|.....+..|+...|.+.
T Consensus 58 WiCp~CskkF~d~~~~~~H~~~eH~~~ 84 (466)
T PF04780_consen 58 WICPRCSKKFSDAESCLSHMEQEHPAG 84 (466)
T ss_pred eeCCcccceeCCHHHHHHHHHHhhhhh
No 129
>KOG3002|consensus
Probab=21.95 E-value=60 Score=29.41 Aligned_cols=80 Identities=19% Similarity=0.296 Sum_probs=46.8
Q ss_pred CCccccCcccccCCChhHHHHHHHHhcCCCCcccccc--CCcCccccCHHHHHHhhhccCCCCCccCCCCC--CCCcccC
Q psy11069 266 PEDLVCKGCGKQFKVPCRLREHYKAVHESTKNFKCTH--DGCKLLFSSKSALKRHNKSKHLNMRMFPCPLV--TCGKSFL 341 (391)
Q Consensus 266 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~--~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~~--~C~~~f~ 341 (391)
+...+|+.|...+.....+.. ..+ .....+.|+. -+|.+.|..... ..|.+ .-.-.+|.|+.+ .|... .
T Consensus 78 ~~~~~CP~Cr~~~g~~R~~am--EkV-~e~~~vpC~~~~~GC~~~~~Y~~~-~~HE~--~C~f~~~~CP~p~~~C~~~-G 150 (299)
T KOG3002|consen 78 KVSNKCPTCRLPIGNIRCRAM--EKV-AEAVLVPCKNAKLGCTKSFPYGEK-SKHEK--VCEFRPCSCPVPGAECKYT-G 150 (299)
T ss_pred hhcccCCccccccccHHHHHH--HHH-HHhceecccccccCCceeeccccc-ccccc--ccccCCcCCCCCcccCCcc-C
Confidence 456778888888775544433 222 2334566764 368888877766 45554 122367777765 55432 3
Q ss_pred ChHHHHHHHHH
Q psy11069 342 RSEHLQEHMLQ 352 (391)
Q Consensus 342 ~~~~L~~H~~~ 352 (391)
....|..|.+.
T Consensus 151 ~~~~l~~H~~~ 161 (299)
T KOG3002|consen 151 SYKDLYAHLND 161 (299)
T ss_pred cHHHHHHHHHh
Confidence 45567777654
No 130
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=20.77 E-value=68 Score=19.57 Aligned_cols=26 Identities=19% Similarity=0.471 Sum_probs=16.4
Q ss_pred cCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccc
Q psy11069 329 FPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMS 370 (391)
Q Consensus 329 ~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~ 370 (391)
|.|.- ||..|.-. ... +.+|+. ||..
T Consensus 3 Y~C~~--Cg~~~~~~-----------~~~-~irC~~--CG~r 28 (44)
T smart00659 3 YICGE--CGRENEIK-----------SKD-VVRCRE--CGYR 28 (44)
T ss_pred EECCC--CCCEeecC-----------CCC-ceECCC--CCce
Confidence 67775 88777643 123 478877 8754
No 131
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=20.55 E-value=45 Score=24.01 Aligned_cols=13 Identities=38% Similarity=0.700 Sum_probs=8.4
Q ss_pred CccccCcccccCC
Q psy11069 267 EDLVCKGCGKQFK 279 (391)
Q Consensus 267 ~~~~C~~C~~~f~ 279 (391)
..+.|..|++.|.
T Consensus 53 GIW~C~~C~~~~A 65 (90)
T PTZ00255 53 GIWRCKGCKKTVA 65 (90)
T ss_pred EEEEcCCCCCEEe
Confidence 3567777776664
No 132
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=20.44 E-value=49 Score=17.27 Aligned_cols=7 Identities=57% Similarity=1.516 Sum_probs=4.3
Q ss_pred eeccCcCcc
Q psy11069 360 FTCPYQDCL 368 (391)
Q Consensus 360 ~~C~~~~C~ 368 (391)
|.||. ||
T Consensus 17 f~CPn--CG 23 (24)
T PF07754_consen 17 FPCPN--CG 23 (24)
T ss_pred EeCCC--CC
Confidence 66666 65
No 133
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=20.07 E-value=51 Score=20.69 Aligned_cols=29 Identities=28% Similarity=0.777 Sum_probs=17.8
Q ss_pred CccCCCCCCCCcccCChHHHHHHHHHhcCCCCceeccCcCcccc
Q psy11069 327 RMFPCPLVTCGKSFLRSEHLQEHMLQHNENKPKFTCPYQDCLMS 370 (391)
Q Consensus 327 ~~~~C~~~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~~~C~~~ 370 (391)
..|.|.. ||+.|.... ... ..+|++ ||..
T Consensus 5 ~~Y~C~~--Cg~~~~~~~----------~~~-~irCp~--Cg~r 33 (49)
T COG1996 5 MEYKCAR--CGREVELDQ----------ETR-GIRCPY--CGSR 33 (49)
T ss_pred EEEEhhh--cCCeeehhh----------ccC-ceeCCC--CCcE
Confidence 3578876 888882111 122 488888 8854
No 134
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=20.06 E-value=62 Score=20.33 Aligned_cols=14 Identities=43% Similarity=1.039 Sum_probs=7.1
Q ss_pred ccccCcccccCCCh
Q psy11069 268 DLVCKGCGKQFKVP 281 (391)
Q Consensus 268 ~~~C~~C~~~f~~~ 281 (391)
.+.|..||..|...
T Consensus 4 ~l~C~dCg~~FvfT 17 (49)
T PF13451_consen 4 TLTCKDCGAEFVFT 17 (49)
T ss_pred eEEcccCCCeEEEe
Confidence 44555555555433
No 135
>PF10013 DUF2256: Uncharacterized protein conserved in bacteria (DUF2256); InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.04 E-value=67 Score=19.37 Aligned_cols=17 Identities=24% Similarity=0.616 Sum_probs=12.6
Q ss_pred cccCcccccCCChhHHH
Q psy11069 269 LVCKGCGKQFKVPCRLR 285 (391)
Q Consensus 269 ~~C~~C~~~f~~~~~l~ 285 (391)
-.|..||+.|..+....
T Consensus 9 K~C~~C~rpf~WRKKW~ 25 (42)
T PF10013_consen 9 KICPVCGRPFTWRKKWA 25 (42)
T ss_pred CcCcccCCcchHHHHHH
Confidence 45888888888776554
Done!