Query psy1991
Match_columns 366
No_of_seqs 361 out of 1653
Neff 5.2
Searched_HMMs 46136
Date Fri Aug 16 21:24:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1991.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1991hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 1.5E-24 3.2E-29 206.9 4.6 133 229-364 128-264 (279)
2 KOG2462|consensus 99.7 1.1E-18 2.3E-23 166.9 4.8 104 259-366 128-238 (279)
3 KOG3576|consensus 99.4 2.7E-13 5.9E-18 125.6 7.1 109 230-366 116-235 (267)
4 KOG3623|consensus 99.4 2.8E-13 6E-18 142.7 6.0 85 276-364 887-971 (1007)
5 KOG1074|consensus 99.3 5.7E-13 1.2E-17 142.2 4.5 51 314-366 880-930 (958)
6 KOG3623|consensus 99.3 1.9E-13 4.2E-18 143.9 0.3 120 232-364 211-330 (1007)
7 KOG3608|consensus 99.3 7.6E-13 1.6E-17 130.6 1.2 104 232-339 208-316 (467)
8 KOG3576|consensus 99.3 6.2E-13 1.4E-17 123.2 0.4 79 282-364 116-194 (267)
9 KOG1074|consensus 99.3 3.7E-12 8.1E-17 136.0 5.4 50 231-282 353-402 (958)
10 KOG3608|consensus 99.2 8E-12 1.7E-16 123.5 3.2 129 229-364 235-373 (467)
11 PHA00733 hypothetical protein 98.7 1.1E-08 2.5E-13 88.9 3.1 85 275-365 32-121 (128)
12 PLN03086 PRLI-interacting fact 98.6 1.1E-07 2.5E-12 100.2 7.5 106 231-353 407-514 (567)
13 PLN03086 PRLI-interacting fact 98.5 1E-07 2.3E-12 100.5 5.7 112 232-364 434-561 (567)
14 PHA02768 hypothetical protein; 98.5 6.6E-08 1.4E-12 72.7 1.9 44 313-360 5-48 (55)
15 PF13465 zf-H2C2_2: Zinc-finge 98.4 2E-07 4.3E-12 59.7 2.4 26 330-355 1-26 (26)
16 PHA00733 hypothetical protein 98.1 1.8E-06 3.8E-11 75.2 3.6 85 231-340 40-124 (128)
17 PHA02768 hypothetical protein; 98.0 3.3E-06 7.1E-11 63.6 1.6 44 283-332 5-48 (55)
18 PF13465 zf-H2C2_2: Zinc-finge 97.9 8.7E-06 1.9E-10 52.0 2.5 26 300-327 1-26 (26)
19 KOG3993|consensus 97.8 3.5E-06 7.7E-11 85.5 -0.8 81 282-366 266-379 (500)
20 PHA00732 hypothetical protein 97.7 2.9E-05 6.3E-10 62.5 3.1 40 291-338 7-47 (79)
21 COG5189 SFP1 Putative transcri 97.6 1.9E-05 4.1E-10 78.0 1.3 54 311-364 347-419 (423)
22 KOG3993|consensus 97.5 4E-05 8.7E-10 78.1 2.1 78 283-364 356-479 (500)
23 PHA00616 hypothetical protein 97.5 3.7E-05 8.1E-10 55.5 1.3 34 313-348 1-34 (44)
24 PF00096 zf-C2H2: Zinc finger, 97.5 4.4E-05 9.5E-10 46.6 1.1 23 344-366 1-23 (23)
25 PF13894 zf-C2H2_4: C2H2-type 96.9 0.00041 8.9E-09 41.8 1.3 23 344-366 1-23 (24)
26 PF12756 zf-C2H2_2: C2H2 type 96.9 0.00062 1.3E-08 54.3 2.3 70 286-364 2-71 (100)
27 PF13912 zf-C2H2_6: C2H2-type 96.9 0.0006 1.3E-08 43.1 1.8 24 343-366 1-24 (27)
28 PF05605 zf-Di19: Drought indu 96.8 0.0015 3.2E-08 48.4 3.5 47 314-365 3-51 (54)
29 PHA00616 hypothetical protein 96.8 0.0005 1.1E-08 49.7 0.9 33 284-318 2-34 (44)
30 PHA00732 hypothetical protein 96.6 0.0015 3.3E-08 52.6 2.5 45 313-365 1-46 (79)
31 PF00096 zf-C2H2: Zinc finger, 96.5 0.0021 4.7E-08 39.0 2.1 23 314-338 1-23 (23)
32 COG5189 SFP1 Putative transcri 96.4 0.0011 2.3E-08 66.0 0.9 54 281-336 347-419 (423)
33 smart00355 ZnF_C2H2 zinc finge 96.2 0.0027 5.8E-08 38.4 1.7 23 344-366 1-23 (26)
34 PF05605 zf-Di19: Drought indu 96.2 0.0057 1.2E-07 45.3 3.6 49 284-339 3-53 (54)
35 PF12756 zf-C2H2_2: C2H2 type 95.9 0.0049 1.1E-07 49.0 2.4 25 233-257 1-25 (100)
36 PF13894 zf-C2H2_4: C2H2-type 95.9 0.0072 1.6E-07 36.2 2.5 23 314-338 1-23 (24)
37 PF13912 zf-C2H2_6: C2H2-type 95.2 0.015 3.2E-07 36.7 2.1 25 313-339 1-25 (27)
38 PRK04860 hypothetical protein; 94.7 0.018 3.9E-07 52.2 2.2 38 313-356 119-156 (160)
39 PF12874 zf-met: Zinc-finger o 94.6 0.017 3.7E-07 35.7 1.1 22 344-365 1-22 (25)
40 KOG2231|consensus 94.4 0.033 7.2E-07 60.4 3.5 67 286-364 185-260 (669)
41 PF09237 GAGA: GAGA factor; I 93.6 0.043 9.3E-07 41.1 1.8 40 302-343 12-52 (54)
42 smart00355 ZnF_C2H2 zinc finge 93.4 0.07 1.5E-06 31.9 2.2 24 314-339 1-24 (26)
43 COG5048 FOG: Zn-finger [Genera 92.8 0.054 1.2E-06 52.3 1.7 55 231-285 289-347 (467)
44 PF12171 zf-C2H2_jaz: Zinc-fin 92.5 0.052 1.1E-06 34.5 0.8 22 344-365 2-23 (27)
45 PF09237 GAGA: GAGA factor; I 91.8 0.067 1.5E-06 40.1 0.7 34 332-365 12-46 (54)
46 PF13909 zf-H2C2_5: C2H2-type 91.4 0.11 2.3E-06 32.0 1.2 21 344-365 1-21 (24)
47 PF12874 zf-met: Zinc-finger o 91.0 0.17 3.7E-06 31.1 1.9 23 314-338 1-23 (25)
48 PF13913 zf-C2HC_2: zinc-finge 89.5 0.24 5.1E-06 31.4 1.6 20 344-364 3-22 (25)
49 COG5048 FOG: Zn-finger [Genera 88.8 0.21 4.5E-06 48.3 1.5 132 230-362 320-464 (467)
50 PRK04860 hypothetical protein; 87.2 0.54 1.2E-05 42.7 3.0 40 284-329 118-157 (160)
51 PF13909 zf-H2C2_5: C2H2-type 86.7 0.59 1.3E-05 28.6 2.1 23 314-339 1-23 (24)
52 smart00451 ZnF_U1 U1-like zinc 86.3 0.56 1.2E-05 30.9 2.0 22 343-364 3-24 (35)
53 PF12171 zf-C2H2_jaz: Zinc-fin 84.8 0.39 8.5E-06 30.4 0.7 22 314-337 2-23 (27)
54 KOG1146|consensus 83.2 0.42 9.2E-06 55.3 0.5 72 291-364 442-539 (1406)
55 KOG2482|consensus 82.6 1.4 3E-05 44.8 3.7 49 314-364 280-355 (423)
56 KOG2231|consensus 77.4 2.5 5.5E-05 46.3 4.0 113 232-365 100-234 (669)
57 KOG4173|consensus 73.5 1.5 3.2E-05 41.8 0.9 48 286-335 109-166 (253)
58 KOG2893|consensus 71.8 1.3 2.7E-05 43.1 0.0 46 286-339 13-59 (341)
59 KOG4173|consensus 70.3 1.5 3.2E-05 41.8 0.1 77 282-363 78-166 (253)
60 smart00451 ZnF_U1 U1-like zinc 67.5 4.8 0.0001 26.3 2.1 23 313-337 3-25 (35)
61 COG5236 Uncharacterized conser 66.9 3.4 7.4E-05 42.1 1.9 67 286-364 223-302 (493)
62 PF02892 zf-BED: BED zinc fing 65.6 5.3 0.00012 27.8 2.2 23 341-363 14-40 (45)
63 COG5236 Uncharacterized conser 60.2 4.4 9.5E-05 41.4 1.3 23 285-307 153-175 (493)
64 smart00614 ZnF_BED BED zinc fi 60.2 5.6 0.00012 28.8 1.5 20 345-364 20-44 (50)
65 KOG1146|consensus 59.2 2 4.4E-05 50.0 -1.4 72 232-306 1229-1305(1406)
66 COG4049 Uncharacterized protei 57.8 4.6 9.9E-05 31.0 0.7 25 339-363 13-37 (65)
67 TIGR00622 ssl1 transcription f 57.7 22 0.00049 30.7 5.0 96 231-350 15-110 (112)
68 KOG4124|consensus 55.5 2.3 5.1E-05 43.3 -1.5 53 311-363 347-418 (442)
69 cd00350 rubredoxin_like Rubred 54.8 8.9 0.00019 25.6 1.7 24 314-351 2-25 (33)
70 TIGR00373 conserved hypothetic 43.8 23 0.00051 31.8 3.1 34 309-353 105-138 (158)
71 smart00531 TFIIE Transcription 42.0 26 0.00057 30.9 3.1 38 310-353 96-133 (147)
72 KOG2575|consensus 39.4 10 0.00022 39.7 0.1 37 64-110 52-89 (510)
73 KOG4167|consensus 38.4 6.7 0.00015 43.5 -1.4 24 343-366 792-815 (907)
74 PF02176 zf-TRAF: TRAF-type zi 37.6 21 0.00045 26.1 1.5 44 311-355 7-54 (60)
75 TIGR02098 MJ0042_CXXC MJ0042 f 36.8 27 0.00059 23.5 1.9 34 314-354 3-36 (38)
76 PRK06266 transcription initiat 34.9 34 0.00075 31.5 2.8 34 310-354 114-147 (178)
77 smart00734 ZnF_Rad18 Rad18-lik 33.0 32 0.00069 22.0 1.6 19 345-364 3-21 (26)
78 PF09538 FYDLN_acid: Protein o 32.6 28 0.00061 29.7 1.7 15 312-328 25-39 (108)
79 COG4049 Uncharacterized protei 32.1 23 0.0005 27.3 1.0 29 308-338 12-40 (65)
80 cd00729 rubredoxin_SM Rubredox 31.4 33 0.00072 23.2 1.6 25 313-351 2-26 (34)
81 PF05443 ROS_MUCR: ROS/MUCR tr 29.6 26 0.00057 31.0 1.1 20 344-366 73-92 (132)
82 KOG2785|consensus 29.0 55 0.0012 33.9 3.3 49 314-364 167-241 (390)
83 COG1592 Rubrerythrin [Energy p 27.5 41 0.00088 31.0 1.9 24 313-351 134-157 (166)
84 TIGR00622 ssl1 transcription f 26.7 77 0.0017 27.4 3.4 73 286-366 18-104 (112)
85 KOG2785|consensus 25.8 67 0.0015 33.3 3.3 48 285-336 168-241 (390)
86 TIGR02605 CxxC_CxxC_SSSS putat 25.4 26 0.00055 25.2 0.2 11 314-326 6-16 (52)
87 smart00834 CxxC_CXXC_SSSS Puta 24.8 27 0.00059 23.5 0.3 29 314-351 6-34 (41)
88 KOG2482|consensus 24.5 84 0.0018 32.4 3.7 52 312-365 143-217 (423)
89 PF13717 zinc_ribbon_4: zinc-r 23.8 58 0.0012 22.3 1.7 14 233-246 4-17 (36)
90 COG2888 Predicted Zn-ribbon RN 23.3 80 0.0017 24.5 2.5 32 313-351 27-58 (61)
91 PF05443 ROS_MUCR: ROS/MUCR tr 23.2 43 0.00093 29.7 1.2 22 286-312 75-96 (132)
92 KOG4377|consensus 22.9 46 0.00099 34.9 1.5 68 284-354 272-360 (480)
93 PRK00398 rpoP DNA-directed RNA 22.8 37 0.0008 24.1 0.6 29 313-353 3-31 (46)
94 KOG2186|consensus 22.7 54 0.0012 32.3 1.9 50 283-339 3-52 (276)
95 smart00659 RPOLCX RNA polymera 22.7 71 0.0015 22.9 2.1 26 314-352 3-28 (44)
96 PF09432 THP2: Tho complex sub 22.7 34 0.00075 30.3 0.5 16 55-70 99-114 (132)
97 PF11756 YgbA_NO: Nitrous oxid 22.3 27 0.0006 30.1 -0.2 37 37-73 45-87 (112)
98 PF04959 ARS2: Arsenite-resist 21.8 52 0.0011 31.4 1.6 20 344-363 78-97 (214)
99 PF15269 zf-C2H2_7: Zinc-finge 20.8 61 0.0013 23.9 1.4 21 344-364 21-41 (54)
100 PF12013 DUF3505: Protein of u 20.5 80 0.0017 26.2 2.3 12 232-243 12-23 (109)
No 1
>KOG2462|consensus
Probab=99.90 E-value=1.5e-24 Score=206.89 Aligned_cols=133 Identities=26% Similarity=0.453 Sum_probs=120.6
Q ss_pred CCCcccccccccccchhHhHhhhhccccC--CCCccccCCCCCccccchhhhcccceee--eecCCCCCcccCchhHHHH
Q psy1991 229 VNSSSHGNYNKLHIHPYYQTHHHTQIFQT--PSITIKSSLLSKPKRRQNWSRRKIIMHT--CSHNGCGKTYTKSSHLKAH 304 (366)
Q Consensus 229 ~~P~iC~~CGK~F~~~~~L~hHh~~iH~~--p~~~~kC~~Csk~f~~k~~l~~H~k~H~--C~~~~CgK~F~skssLk~H 304 (366)
+..+.|++|||.+.+..+|.+| ++.|.. ....+.|..|+|.+....+++.|+++|. |+|.+|||.|.+.+-|+.|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrH-kQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l~c~C~iCGKaFSRPWLLQGH 206 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRH-KQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTLPCECGICGKAFSRPWLLQGH 206 (279)
T ss_pred CCceeccccccccccccccchh-hcccccccccccccCCCCCceeeehHHHhhHhhccCCCcccccccccccchHHhhcc
Confidence 3458899999999999999955 444432 2357899999999999999999999998 8888899999999999999
Q ss_pred HHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhh
Q psy1991 305 LRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 305 ~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
+|+|+|||||.| +.|+|.|..+++|+.||++|.+.|+|+|..|+|+|++++.|.+|..
T Consensus 207 iRTHTGEKPF~C--~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 207 IRTHTGEKPFSC--PHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred cccccCCCCccC--CcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 999999999999 4599999999999999999999999999999999999999999975
No 2
>KOG2462|consensus
Probab=99.74 E-value=1.1e-18 Score=166.92 Aligned_cols=104 Identities=38% Similarity=0.647 Sum_probs=94.5
Q ss_pred CCccccCCCCCccccchhhhcccceee-------eecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHH
Q psy1991 259 SITIKSSLLSKPKRRQNWSRRKIIMHT-------CSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDEL 331 (366)
Q Consensus 259 ~~~~kC~~Csk~f~~k~~l~~H~k~H~-------C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L 331 (366)
...++|..|++.+.....|.+|...|. +.|+.|||.|.+..+|+.|+|+|+ -+++|.+ |||.|.+...|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~i--CGKaFSRPWLL 203 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGI--CGKAFSRPWLL 203 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCccccc--ccccccchHHh
Confidence 346889999999999999999987664 333449999999999999999997 6799977 99999999999
Q ss_pred HHHhhhhcCCCCeecCccccccCChhHHHHHhhcC
Q psy1991 332 TRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 332 ~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~KtH 366 (366)
+.|+|+|+|||||.|..|+|.|+.+++|+.||+||
T Consensus 204 QGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTH 238 (279)
T KOG2462|consen 204 QGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTH 238 (279)
T ss_pred hcccccccCCCCccCCcccchhcchHHHHHHHHhh
Confidence 99999999999999999999999999999999998
No 3
>KOG3576|consensus
Probab=99.42 E-value=2.7e-13 Score=125.61 Aligned_cols=109 Identities=29% Similarity=0.531 Sum_probs=86.6
Q ss_pred CCcccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccceeeeecCCCCCcccCchhHHHHHHhhc
Q psy1991 230 NSSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTHT 309 (366)
Q Consensus 230 ~P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH~ 309 (366)
..+.|.+|||.|.-.+.|.+|.+. | ...++|.|.+ |||.|....+|++|+|+|+
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kc-h-----------------------~~vkr~lct~--cgkgfndtfdlkrh~rtht 169 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKC-H-----------------------SDVKRHLCTF--CGKGFNDTFDLKRHTRTHT 169 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhh-c-----------------------cHHHHHHHhh--ccCcccchhhhhhhhcccc
Confidence 356777777777777666644211 1 1123455999 9999999999999999999
Q ss_pred CCCceecCCCCCCCcccChHHHHHHhhhhcC-----------CCCeecCccccccCChhHHHHHhhcC
Q psy1991 310 GEKPYQCGWKGCGWKFARSDELTRHFRKHTG-----------DRPFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 310 ~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtg-----------eKpykC~~CgKsFs~kssLk~H~KtH 366 (366)
|.+||+|.. |+|.|..+-.|..|.+.-+| +|-|.|+.||.+-.+...+..|++.|
T Consensus 170 gvrpykc~~--c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~ 235 (267)
T KOG3576|consen 170 GVRPYKCSL--CEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLH 235 (267)
T ss_pred Cccccchhh--hhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhc
Confidence 999999955 99999999999999875444 45699999999998888888888764
No 4
>KOG3623|consensus
Probab=99.39 E-value=2.8e-13 Score=142.74 Aligned_cols=85 Identities=39% Similarity=0.752 Sum_probs=78.5
Q ss_pred hhhcccceeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCC
Q psy1991 276 WSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSR 355 (366)
Q Consensus 276 ~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~ 355 (366)
.++....++.|.. |+|+|...++|.+|.-.|+|.|||+|.| |.|.|+.+..|..|+|.|.|||||+|+.|+|+|+.
T Consensus 887 ~~kte~gmyaCDq--CDK~FqKqSSLaRHKYEHsGqRPyqC~i--CkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSH 962 (1007)
T KOG3623|consen 887 HAKTEDGMYACDQ--CDKAFQKQSSLARHKYEHSGQRPYQCII--CKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSH 962 (1007)
T ss_pred cccCccccchHHH--HHHHHHhhHHHHHhhhhhcCCCCcccch--hhHhhhhhhhhhhhhhhccCCCcchhhhhhhhccc
Confidence 4444556777877 9999999999999999999999999977 99999999999999999999999999999999999
Q ss_pred hhHHHHHhh
Q psy1991 356 SDHLSLHMK 364 (366)
Q Consensus 356 kssLk~H~K 364 (366)
..+...||-
T Consensus 963 SGSYSQHMN 971 (1007)
T KOG3623|consen 963 SGSYSQHMN 971 (1007)
T ss_pred ccchHhhhc
Confidence 999999983
No 5
>KOG1074|consensus
Probab=99.34 E-value=5.7e-13 Score=142.19 Aligned_cols=51 Identities=43% Similarity=0.888 Sum_probs=48.5
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhhcC
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~KtH 366 (366)
..|.+ |++.|....+|..|+|+|+|+|||.|.+|++.|..+.+|+.||.+|
T Consensus 880 h~C~v--Cgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH 930 (958)
T KOG1074|consen 880 HVCNV--CGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTH 930 (958)
T ss_pred hhhcc--chhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhccc
Confidence 46855 9999999999999999999999999999999999999999999987
No 6
>KOG3623|consensus
Probab=99.33 E-value=1.9e-13 Score=143.89 Aligned_cols=120 Identities=22% Similarity=0.342 Sum_probs=91.0
Q ss_pred cccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccceeeeecCCCCCcccCchhHHHHHHhhcCC
Q psy1991 232 SSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTHTGE 311 (366)
Q Consensus 232 ~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH~~e 311 (366)
..|..|++.+.....|+.|.+..|......+.|..|...|..+..|.+|+..|+= |+-.- .|.-.-...
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkp----g~dqa-------~sltqsa~l 279 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKP----GGDQA-------ISLTQSALL 279 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcC----CCccc-------ccccchhhh
Confidence 6799999999999999999888888777677766666666655555555555430 11000 010111234
Q ss_pred CceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhh
Q psy1991 312 KPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 312 Kpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
|.|+| .+|||.|+.+..|+.|+|+|.|||||.|+.|+|+|+...++..||.
T Consensus 280 RKFKC--tECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 280 RKFKC--TECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred ccccc--cccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 67999 7799999999999999999999999999999999999999999974
No 7
>KOG3608|consensus
Probab=99.28 E-value=7.6e-13 Score=130.60 Aligned_cols=104 Identities=19% Similarity=0.395 Sum_probs=61.8
Q ss_pred cccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccceee----eecCCCCCcccCchhHHHHHHh
Q psy1991 232 SSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMHT----CSHNGCGKTYTKSSHLKAHLRT 307 (366)
Q Consensus 232 ~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H~----C~~~~CgK~F~skssLk~H~rt 307 (366)
..|+.||..|+++..|--|-++.-......+.|..|.|.|.....|+.|+..|. |+- |+.+....++|.+|++.
T Consensus 208 vACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCpl--Cdmtc~~~ssL~~H~r~ 285 (467)
T KOG3608|consen 208 VACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPL--CDMTCSSASSLTTHIRY 285 (467)
T ss_pred EecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccc--cccCCCChHHHHHHHHh
Confidence 456666666666666554544333333345666666666666666666655443 433 66666666666666664
Q ss_pred -hcCCCceecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 308 -HTGEKPYQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 308 -H~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
|...|||+| +.|++.|.+...|.+|..+|.
T Consensus 286 rHs~dkpfKC--d~Cd~~c~~esdL~kH~~~HS 316 (467)
T KOG3608|consen 286 RHSKDKPFKC--DECDTRCVRESDLAKHVQVHS 316 (467)
T ss_pred hhccCCCccc--cchhhhhccHHHHHHHHHhcc
Confidence 666666666 446666666666666655554
No 8
>KOG3576|consensus
Probab=99.28 E-value=6.2e-13 Score=123.23 Aligned_cols=79 Identities=41% Similarity=0.797 Sum_probs=74.8
Q ss_pred ceeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHH
Q psy1991 282 IMHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSL 361 (366)
Q Consensus 282 k~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~ 361 (366)
..+.|.. |+|.|.-..-|.+|++-|...+.+-|.+ |||.|...-+|++|+|+|+|.+||+|..|+|.|.++-+|..
T Consensus 116 d~ftCrv--CgK~F~lQRmlnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 116 DSFTCRV--CGKKFGLQRMLNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred Ceeeeeh--hhhhhhHHHHHHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence 4466888 9999999999999999999999999977 99999999999999999999999999999999999999999
Q ss_pred Hhh
Q psy1991 362 HMK 364 (366)
Q Consensus 362 H~K 364 (366)
|++
T Consensus 192 hl~ 194 (267)
T KOG3576|consen 192 HLK 194 (267)
T ss_pred HHH
Confidence 975
No 9
>KOG1074|consensus
Probab=99.26 E-value=3.7e-12 Score=136.04 Aligned_cols=50 Identities=4% Similarity=-0.112 Sum_probs=34.9
Q ss_pred CcccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccc
Q psy1991 231 SSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKII 282 (366)
Q Consensus 231 P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k 282 (366)
++.|..|.|.|...+.|+ .|.+.|... +.|+|..|+..|..+..|+.|..
T Consensus 353 khkCr~CakvfgS~SaLq-iHlRSHTGE-RPfqCnvCG~~FSTkGNLKvH~~ 402 (958)
T KOG1074|consen 353 KHKCRFCAKVFGSDSALQ-IHLRSHTGE-RPFQCNVCGNRFSTKGNLKVHFQ 402 (958)
T ss_pred cchhhhhHhhcCchhhhh-hhhhccCCC-CCeeecccccccccccceeeeee
Confidence 456777777777777777 555666544 36777777777777777777653
No 10
>KOG3608|consensus
Probab=99.19 E-value=8e-12 Score=123.47 Aligned_cols=129 Identities=22% Similarity=0.391 Sum_probs=107.3
Q ss_pred CCCcccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhccc-------ceeeeecCCCCCcccCchhH
Q psy1991 229 VNSSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKI-------IMHTCSHNGCGKTYTKSSHL 301 (366)
Q Consensus 229 ~~P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~-------k~H~C~~~~CgK~F~skssL 301 (366)
.++|.|.+|-|.|.++..|..|..+... .++|+.|...-...+.|++|+ +.++|.+ |++.|.+.+.|
T Consensus 235 ~n~fqC~~C~KrFaTeklL~~Hv~rHvn----~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~--Cd~~c~~esdL 308 (467)
T KOG3608|consen 235 TNSFQCAQCFKRFATEKLLKSHVVRHVN----CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDE--CDTRCVRESDL 308 (467)
T ss_pred CCchHHHHHHHHHhHHHHHHHHHHHhhh----cccccccccCCCChHHHHHHHHhhhccCCCccccc--hhhhhccHHHH
Confidence 3479999999999999999866544433 356667776666667776665 5667888 99999999999
Q ss_pred HHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhc-CC--CCeecCccccccCChhHHHHHhh
Q psy1991 302 KAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHT-GD--RPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 302 k~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHt-ge--KpykC~~CgKsFs~kssLk~H~K 364 (366)
.+|..+|. +-.|.|..+.|..+|++...+++|++.++ |. .+|.|..|++.|.+..+|.+|++
T Consensus 309 ~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~ 373 (467)
T KOG3608|consen 309 AKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLM 373 (467)
T ss_pred HHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHH
Confidence 99999987 77899988889999999999999988654 54 45999999999999999999965
No 11
>PHA00733 hypothetical protein
Probab=98.67 E-value=1.1e-08 Score=88.94 Aligned_cols=85 Identities=20% Similarity=0.291 Sum_probs=66.8
Q ss_pred hhhhcccceeeeecCCCCCcccCchhHHHH--HH---hhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcc
Q psy1991 275 NWSRRKIIMHTCSHNGCGKTYTKSSHLKAH--LR---THTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLC 349 (366)
Q Consensus 275 ~~l~~H~k~H~C~~~~CgK~F~skssLk~H--~r---tH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~C 349 (366)
..+..+.+.+.|.. |.+.|.....|..| ++ .+.++++|.| +.|++.|.+...|..|++.| +.+|.|+.|
T Consensus 32 ~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C--~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~C 105 (128)
T PHA00733 32 HSLTPEQKRLIRAV--VKTLIYNPQLLDESSYLYKLLTSKAVSPYVC--PLCLMPFSSSVSLKQHIRYT--EHSKVCPVC 105 (128)
T ss_pred hcCChhhhhHHHHH--HhhhccChhhhcchHHHHhhcccCCCCCccC--CCCCCcCCCHHHHHHHHhcC--CcCccCCCC
Confidence 33444455566777 88888777776665 22 2345889999 55999999999999999987 457999999
Q ss_pred ccccCChhHHHHHhhc
Q psy1991 350 ERAFSRSDHLSLHMKR 365 (366)
Q Consensus 350 gKsFs~kssLk~H~Kt 365 (366)
++.|.....|.+|+..
T Consensus 106 gK~F~~~~sL~~H~~~ 121 (128)
T PHA00733 106 GKEFRNTDSTLDHVCK 121 (128)
T ss_pred CCccCCHHHHHHHHHH
Confidence 9999999999999864
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.56 E-value=1.1e-07 Score=100.19 Aligned_cols=106 Identities=15% Similarity=0.372 Sum_probs=49.8
Q ss_pred CcccccccccccchhHhHhhhhccccCCCCccccC--CCCCccccchhhhcccceeeeecCCCCCcccCchhHHHHHHhh
Q psy1991 231 SSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSS--LLSKPKRRQNWSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTH 308 (366)
Q Consensus 231 P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~--~Csk~f~~k~~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH 308 (366)
--.|..|.+......... |... |..-...|+ .|+..++ +..+..|. .|+. |++.|. ...|..|+++|
T Consensus 407 ~V~C~NC~~~i~l~~l~l-He~~---C~r~~V~Cp~~~Cg~v~~-r~el~~H~---~C~~--Cgk~f~-~s~LekH~~~~ 475 (567)
T PLN03086 407 TVECRNCKHYIPSRSIAL-HEAY---CSRHNVVCPHDGCGIVLR-VEEAKNHV---HCEK--CGQAFQ-QGEMEKHMKVF 475 (567)
T ss_pred eEECCCCCCccchhHHHH-HHhh---CCCcceeCCcccccceee-ccccccCc---cCCC--CCCccc-hHHHHHHHHhc
Confidence 346999997766655443 3211 222223333 2444442 22223332 2444 555553 34455555554
Q ss_pred cCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccccc
Q psy1991 309 TGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAF 353 (366)
Q Consensus 309 ~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsF 353 (366)
+ +++.| + |++.+ .+..|..|+++|.+++++.|..|++.|
T Consensus 476 H--kpv~C--p-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v 514 (567)
T PLN03086 476 H--EPLQC--P-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMV 514 (567)
T ss_pred C--CCccC--C-CCCCc-chhHHHhhhhccCCCCceeCCCCCCcc
Confidence 2 44555 3 55433 334555555555555555555555555
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.52 E-value=1e-07 Score=100.49 Aligned_cols=112 Identities=15% Similarity=0.301 Sum_probs=86.0
Q ss_pred ccccc--ccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhccccee----eeecCCCCCcccCchhHHHHH
Q psy1991 232 SSHGN--YNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMH----TCSHNGCGKTYTKSSHLKAHL 305 (366)
Q Consensus 232 ~iC~~--CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H----~C~~~~CgK~F~skssLk~H~ 305 (366)
-.|+. ||..|... .+..| +.|..|++.|. ...+..|++.+ .|+ ||+.+ .+..|..|+
T Consensus 434 V~Cp~~~Cg~v~~r~-el~~H-----------~~C~~Cgk~f~-~s~LekH~~~~Hkpv~Cp---Cg~~~-~R~~L~~H~ 496 (567)
T PLN03086 434 VVCPHDGCGIVLRVE-EAKNH-----------VHCEKCGQAFQ-QGEMEKHMKVFHEPLQCP---CGVVL-EKEQMVQHQ 496 (567)
T ss_pred eeCCcccccceeecc-ccccC-----------ccCCCCCCccc-hHHHHHHHHhcCCCccCC---CCCCc-chhHHHhhh
Confidence 35774 99988543 23323 37888988885 56678887653 364 99766 668999999
Q ss_pred HhhcCCCceecCCCCCCCccc----------ChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhh
Q psy1991 306 RTHTGEKPYQCGWKGCGWKFA----------RSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 306 rtH~~eKpf~C~i~~CgKsF~----------sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
++|.+++++.|.+ |++.|. ....|..|..++ |.+++.|..||+.+..+ .|..|+.
T Consensus 497 ~thCp~Kpi~C~f--C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~ 561 (567)
T PLN03086 497 ASTCPLRLITCRF--CGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQI 561 (567)
T ss_pred hccCCCCceeCCC--CCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHH
Confidence 9999999999977 999995 235899999886 89999999999998776 5666753
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.47 E-value=6.6e-08 Score=72.72 Aligned_cols=44 Identities=25% Similarity=0.560 Sum_probs=36.4
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHH
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLS 360 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk 360 (366)
-|.| +.||+.|.+..+|..|+++|+ ++|+|..|+|.|.+.+.|.
T Consensus 5 ~y~C--~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYEC--PICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCc--chhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 3788 559999999999999999887 6888999999888877764
No 15
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.39 E-value=2e-07 Score=59.67 Aligned_cols=26 Identities=46% Similarity=1.179 Sum_probs=19.9
Q ss_pred HHHHHhhhhcCCCCeecCccccccCC
Q psy1991 330 ELTRHFRKHTGDRPFQCTLCERAFSR 355 (366)
Q Consensus 330 ~L~~H~rtHtgeKpykC~~CgKsFs~ 355 (366)
+|.+|+++|+|++||+|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 46778888888888888888887763
No 16
>PHA00733 hypothetical protein
Probab=98.13 E-value=1.8e-06 Score=75.23 Aligned_cols=85 Identities=20% Similarity=0.287 Sum_probs=62.4
Q ss_pred CcccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccceeeeecCCCCCcccCchhHHHHHHhhcC
Q psy1991 231 SSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTHTG 310 (366)
Q Consensus 231 P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH~~ 310 (366)
+.+|.+|++.|.+...|.++........ ....+.+.|.. ||+.|.+...|..|++.|
T Consensus 40 ~~~~~~~~~~~~~~~~l~~~~~l~~~~~-------------------~~~~kPy~C~~--Cgk~Fss~s~L~~H~r~h-- 96 (128)
T PHA00733 40 RLIRAVVKTLIYNPQLLDESSYLYKLLT-------------------SKAVSPYVCPL--CLMPFSSSVSLKQHIRYT-- 96 (128)
T ss_pred hHHHHHHhhhccChhhhcchHHHHhhcc-------------------cCCCCCccCCC--CCCcCCCHHHHHHHHhcC--
Confidence 5789999999998877774411000000 00123455766 999999999999999987
Q ss_pred CCceecCCCCCCCcccChHHHHHHhhhhcC
Q psy1991 311 EKPYQCGWKGCGWKFARSDELTRHFRKHTG 340 (366)
Q Consensus 311 eKpf~C~i~~CgKsF~sk~~L~~H~rtHtg 340 (366)
+.+|.|. .|++.|.....|.+|++..++
T Consensus 97 ~~~~~C~--~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 97 EHSKVCP--VCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CcCccCC--CCCCccCCHHHHHHHHHHhcC
Confidence 4579995 499999999999999987654
No 17
>PHA02768 hypothetical protein; Provisional
Probab=97.96 E-value=3.3e-06 Score=63.61 Aligned_cols=44 Identities=27% Similarity=0.508 Sum_probs=38.1
Q ss_pred eeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHH
Q psy1991 283 MHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELT 332 (366)
Q Consensus 283 ~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~ 332 (366)
.+.|+. ||+.|.+..+|..||++|+ ++++| ..|++.|.+.+.|.
T Consensus 5 ~y~C~~--CGK~Fs~~~~L~~H~r~H~--k~~kc--~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPI--CGEIYIKRKSMITHLRKHN--TNLKL--SNCKRISLRTGEYI 48 (55)
T ss_pred ccCcch--hCCeeccHHHHHHHHHhcC--CcccC--CcccceecccceeE
Confidence 456777 9999999999999999998 78999 56999999877664
No 18
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.89 E-value=8.7e-06 Score=52.03 Aligned_cols=26 Identities=58% Similarity=1.270 Sum_probs=23.2
Q ss_pred hHHHHHHhhcCCCceecCCCCCCCcccC
Q psy1991 300 HLKAHLRTHTGEKPYQCGWKGCGWKFAR 327 (366)
Q Consensus 300 sLk~H~rtH~~eKpf~C~i~~CgKsF~s 327 (366)
+|.+||++|+++++|.|++ |++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~--C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY--CGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS--SSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCC--CcCeeCc
Confidence 5899999999999999955 9999974
No 19
>KOG3993|consensus
Probab=97.80 E-value=3.5e-06 Score=85.53 Aligned_cols=81 Identities=26% Similarity=0.450 Sum_probs=56.4
Q ss_pred ceeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcC---------------------
Q psy1991 282 IMHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTG--------------------- 340 (366)
Q Consensus 282 k~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtg--------------------- 340 (366)
..+.|.- |...|.....|.+|.-.-.-.-.|+| ++|+|.|.-..+|..|+|.|..
T Consensus 266 GdyiCqL--CK~kYeD~F~LAQHrC~RIV~vEYrC--PEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae 341 (500)
T KOG3993|consen 266 GDYICQL--CKEKYEDAFALAQHRCPRIVHVEYRC--PECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAE 341 (500)
T ss_pred HHHHHHH--HHHhhhhHHHHhhccCCeeEEeeecC--CcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhh
Confidence 3344555 77777777777777532222234777 5677777777777777776631
Q ss_pred ------------CCCeecCccccccCChhHHHHHhhcC
Q psy1991 341 ------------DRPFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 341 ------------eKpykC~~CgKsFs~kssLk~H~KtH 366 (366)
+.-|.|.+|+|+|.++..|+.|+.+|
T Consensus 342 ~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlth 379 (500)
T KOG3993|consen 342 VQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTH 379 (500)
T ss_pred hhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhh
Confidence 12499999999999999999998776
No 20
>PHA00732 hypothetical protein
Probab=97.69 E-value=2.9e-05 Score=62.47 Aligned_cols=40 Identities=35% Similarity=0.578 Sum_probs=21.1
Q ss_pred CCCcccCchhHHHHHHh-hcCCCceecCCCCCCCcccChHHHHHHhhhh
Q psy1991 291 CGKTYTKSSHLKAHLRT-HTGEKPYQCGWKGCGWKFARSDELTRHFRKH 338 (366)
Q Consensus 291 CgK~F~skssLk~H~rt-H~~eKpf~C~i~~CgKsF~sk~~L~~H~rtH 338 (366)
||+.|.+...|+.|++. |.+ +.| +.|++.|. .|..|.+++
T Consensus 7 Cgk~F~s~s~Lk~H~r~~H~~---~~C--~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 7 CGFTTVTLFALKQHARRNHTL---TKC--PVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CCCccCCHHHHHHHhhcccCC---Ccc--CCCCCEeC---Chhhhhccc
Confidence 66666666666666553 332 356 33666665 355555433
No 21
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.62 E-value=1.9e-05 Score=78.00 Aligned_cols=54 Identities=31% Similarity=0.823 Sum_probs=41.9
Q ss_pred CCceecCCCCCCCcccChHHHHHHhhh-h------------------cCCCCeecCccccccCChhHHHHHhh
Q psy1991 311 EKPYQCGWKGCGWKFARSDELTRHFRK-H------------------TGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 311 eKpf~C~i~~CgKsF~sk~~L~~H~rt-H------------------tgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
+|||+|++++|+|.++....|+-|+.. | ...|||.|++|+|+|+....|+-|++
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 467777777777777777777777652 2 13589999999999999999999975
No 22
>KOG3993|consensus
Probab=97.52 E-value=4e-05 Score=78.06 Aligned_cols=78 Identities=23% Similarity=0.384 Sum_probs=52.1
Q ss_pred eeeeecCCCCCcccCchhHHHHHHhhcCCC----------------------------------------------ceec
Q psy1991 283 MHTCSHNGCGKTYTKSSHLKAHLRTHTGEK----------------------------------------------PYQC 316 (366)
Q Consensus 283 ~H~C~~~~CgK~F~skssLk~H~rtH~~eK----------------------------------------------pf~C 316 (366)
.+.|.. |+|.|.+..-|+.|+.+|+... ...|
T Consensus 356 i~~C~~--C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~p 433 (500)
T KOG3993|consen 356 IFSCHT--CGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGSAELELP 433 (500)
T ss_pred eeecHH--hhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeeccccccCC
Confidence 455666 9999999888888887764210 0123
Q ss_pred CCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhh
Q psy1991 317 GWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 317 ~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
++ |+..+.++..--.|.+.-..+.-|.|.+|...|.....|.+|+.
T Consensus 434 p~--~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin 479 (500)
T KOG3993|consen 434 PY--DGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHIN 479 (500)
T ss_pred CC--CCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhh
Confidence 22 45455555444444443334556899999999999999999964
No 23
>PHA00616 hypothetical protein
Probab=97.52 E-value=3.7e-05 Score=55.46 Aligned_cols=34 Identities=24% Similarity=0.461 Sum_probs=27.0
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTL 348 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~ 348 (366)
+|+| +.||+.|..+..|.+|++.|+|++++.|+.
T Consensus 1 pYqC--~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQC--LRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCcc--chhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5778 558888888888888888888888887764
No 24
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.48 E-value=4.4e-05 Score=46.64 Aligned_cols=23 Identities=39% Similarity=1.041 Sum_probs=15.7
Q ss_pred eecCccccccCChhHHHHHhhcC
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~KtH 366 (366)
|+|+.|++.|.++..|.+|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56777777777777777776654
No 25
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.92 E-value=0.00041 Score=41.75 Aligned_cols=23 Identities=39% Similarity=0.973 Sum_probs=14.3
Q ss_pred eecCccccccCChhHHHHHhhcC
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~KtH 366 (366)
|.|++|++.|.+...|+.|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 56777777777777777776653
No 26
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.88 E-value=0.00062 Score=54.30 Aligned_cols=70 Identities=21% Similarity=0.526 Sum_probs=12.5
Q ss_pred eecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHHhh
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H~K 364 (366)
|.. |+..|.+...|..||...++-.. . ....+.....+..+.+.-. ...+.|..|++.|.....|..||+
T Consensus 2 C~~--C~~~f~~~~~l~~H~~~~H~~~~-~-----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~ 71 (100)
T PF12756_consen 2 CLF--CDESFSSVDDLLQHMKKKHGFDI-P-----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMR 71 (100)
T ss_dssp --------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHH
T ss_pred ccc--ccccccccccccccccccccccc-c-----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHc
Confidence 444 77777777777777655332110 0 0011112222323322111 114556666666666666666654
No 27
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.87 E-value=0.0006 Score=43.14 Aligned_cols=24 Identities=38% Similarity=0.829 Sum_probs=17.4
Q ss_pred CeecCccccccCChhHHHHHhhcC
Q psy1991 343 PFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 343 pykC~~CgKsFs~kssLk~H~KtH 366 (366)
+|.|..|++.|.....|..|++.|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCCccCCccCChhHHHHHhHHh
Confidence 467777777777777777777665
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.77 E-value=0.0015 Score=48.45 Aligned_cols=47 Identities=23% Similarity=0.636 Sum_probs=25.0
Q ss_pred eecCCCCCCCcccChHHHHHHhh-hhcCC-CCeecCccccccCChhHHHHHhhc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFR-KHTGD-RPFQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~r-tHtge-KpykC~~CgKsFs~kssLk~H~Kt 365 (366)
|.|++ |++ ..+...|..|.. .|..+ +.+.|++|...+. .+|.+|++.
T Consensus 3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~ 51 (54)
T PF05605_consen 3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNS 51 (54)
T ss_pred cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHH
Confidence 56655 666 333455666644 34433 3466666665433 366666543
No 29
>PHA00616 hypothetical protein
Probab=96.77 E-value=0.0005 Score=49.66 Aligned_cols=33 Identities=18% Similarity=0.427 Sum_probs=29.6
Q ss_pred eeeecCCCCCcccCchhHHHHHHhhcCCCceecCC
Q psy1991 284 HTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGW 318 (366)
Q Consensus 284 H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i 318 (366)
++|.. ||+.|..+++|.+|++.|++++++.|++
T Consensus 2 YqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 2 YQCLR--CGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred Cccch--hhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 45766 9999999999999999999999998854
No 30
>PHA00732 hypothetical protein
Probab=96.57 E-value=0.0015 Score=52.58 Aligned_cols=45 Identities=27% Similarity=0.647 Sum_probs=37.8
Q ss_pred ceecCCCCCCCcccChHHHHHHhhh-hcCCCCeecCccccccCChhHHHHHhhc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRK-HTGDRPFQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rt-HtgeKpykC~~CgKsFs~kssLk~H~Kt 365 (366)
||.| ..|++.|.+...|+.|++. |.+ +.|+.||+.|. .|..|+++
T Consensus 1 py~C--~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKC--PICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccC--CCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhhcc
Confidence 5889 5599999999999999984 653 58999999998 47888765
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.45 E-value=0.0021 Score=39.04 Aligned_cols=23 Identities=43% Similarity=0.993 Sum_probs=15.0
Q ss_pred eecCCCCCCCcccChHHHHHHhhhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKH 338 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtH 338 (366)
|.|+ .|++.|.++..|.+|++.|
T Consensus 1 y~C~--~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCP--ICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEET--TTTEEESSHHHHHHHHHHH
T ss_pred CCCC--CCCCccCCHHHHHHHHhHC
Confidence 5663 3777777777777776654
No 32
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.43 E-value=0.0011 Score=65.96 Aligned_cols=54 Identities=30% Similarity=0.617 Sum_probs=47.1
Q ss_pred cceeeeecCCCCCcccCchhHHHHHHh-hc------------------CCCceecCCCCCCCcccChHHHHHHhh
Q psy1991 281 IIMHTCSHNGCGKTYTKSSHLKAHLRT-HT------------------GEKPYQCGWKGCGWKFARSDELTRHFR 336 (366)
Q Consensus 281 ~k~H~C~~~~CgK~F~skssLk~H~rt-H~------------------~eKpf~C~i~~CgKsF~sk~~L~~H~r 336 (366)
.+.++|+.++|.|.+++...|+.|+.- |. ..|||+|++ |+|+++.-..|+-|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCev--C~KRYKNlNGLKYHr~ 419 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEV--CDKRYKNLNGLKYHRK 419 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccc--cchhhccCccceeccc
Confidence 378999999999999999999999875 41 248999977 9999999999999865
No 33
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.24 E-value=0.0027 Score=38.41 Aligned_cols=23 Identities=30% Similarity=0.807 Sum_probs=15.6
Q ss_pred eecCccccccCChhHHHHHhhcC
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~KtH 366 (366)
|+|..|++.|.....|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 45777777777777777776654
No 34
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.21 E-value=0.0057 Score=45.31 Aligned_cols=49 Identities=35% Similarity=0.608 Sum_probs=37.9
Q ss_pred eeeecCCCCCcccCchhHHHHHHh-hcCC-CceecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 284 HTCSHNGCGKTYTKSSHLKAHLRT-HTGE-KPYQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 284 H~C~~~~CgK~F~skssLk~H~rt-H~~e-Kpf~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
+.|+. |++ -.+...|..|... |..+ +.+.|++ |...+. .+|.+|++.++
T Consensus 3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhc
Confidence 56777 999 4557899999776 6554 5799977 998755 49999998764
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.95 E-value=0.0049 Score=49.04 Aligned_cols=25 Identities=4% Similarity=-0.238 Sum_probs=0.0
Q ss_pred ccccccccccchhHhHhhhhccccC
Q psy1991 233 SHGNYNKLHIHPYYQTHHHTQIFQT 257 (366)
Q Consensus 233 iC~~CGK~F~~~~~L~hHh~~iH~~ 257 (366)
.|..|+..|.+...+..|-...|.-
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~ 25 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGF 25 (100)
T ss_dssp -------------------------
T ss_pred Ccccccccccccccccccccccccc
Confidence 3888999988888877676666653
No 36
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.91 E-value=0.0072 Score=36.16 Aligned_cols=23 Identities=43% Similarity=0.937 Sum_probs=13.8
Q ss_pred eecCCCCCCCcccChHHHHHHhhhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKH 338 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtH 338 (366)
|.|++ |++.|.+...|+.|+++|
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHH
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhh
Confidence 46644 777777777777776655
No 37
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.19 E-value=0.015 Score=36.69 Aligned_cols=25 Identities=36% Similarity=0.850 Sum_probs=19.4
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
+|.| ..|++.|.+...|..|++.|.
T Consensus 1 ~~~C--~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFEC--DECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEE--TTTTEEESSHHHHHHHHCTTT
T ss_pred CCCC--CccCCccCChhHHHHHhHHhc
Confidence 4778 448888888888888887774
No 38
>PRK04860 hypothetical protein; Provisional
Probab=94.73 E-value=0.018 Score=52.20 Aligned_cols=38 Identities=29% Similarity=0.673 Sum_probs=26.7
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCCh
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRS 356 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~k 356 (366)
+|.| . |++ ....+++|.++|+++++|.|..|++.|...
T Consensus 119 ~Y~C--~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRC--K-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEc--C-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 5777 4 766 566677777777777777777777776544
No 39
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.57 E-value=0.017 Score=35.72 Aligned_cols=22 Identities=32% Similarity=0.916 Sum_probs=14.3
Q ss_pred eecCccccccCChhHHHHHhhc
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~Kt 365 (366)
|.|.+|++.|.....|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666653
No 40
>KOG2231|consensus
Probab=94.37 E-value=0.033 Score=60.40 Aligned_cols=67 Identities=30% Similarity=0.676 Sum_probs=42.4
Q ss_pred eecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCC------CcccChHHHHHHhhhhcCCCCeecC--ccc-cccCCh
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCG------WKFARSDELTRHFRKHTGDRPFQCT--LCE-RAFSRS 356 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~Cg------KsF~sk~~L~~H~rtHtgeKpykC~--~Cg-KsFs~k 356 (366)
|.. |...|-....|.+|++.++ |.|.+ |+ .-|.....|..|-|.++ |.|+ .|- +.|...
T Consensus 185 C~~--C~~~fld~~el~rH~~~~h----~~chf--C~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~ 252 (669)
T KOG2231|consen 185 CKF--CHERFLDDDELYRHLRFDH----EFCHF--CDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVA 252 (669)
T ss_pred chh--hhhhhccHHHHHHhhccce----eheee--cCcccccchhcccchHHHHHhhhcC----ccccccccccceeeeh
Confidence 555 8888888888888877643 44544 43 34666777888877544 6776 563 445544
Q ss_pred hHHHHHhh
Q psy1991 357 DHLSLHMK 364 (366)
Q Consensus 357 ssLk~H~K 364 (366)
..+..|++
T Consensus 253 ~~~ei~lk 260 (669)
T KOG2231|consen 253 FELEIELK 260 (669)
T ss_pred hHHHHHHH
Confidence 45555544
No 41
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.60 E-value=0.043 Score=41.07 Aligned_cols=40 Identities=25% Similarity=0.494 Sum_probs=21.2
Q ss_pred HHHHHh-hcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCC
Q psy1991 302 KAHLRT-HTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRP 343 (366)
Q Consensus 302 k~H~rt-H~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKp 343 (366)
..+.+. +..+.|-.|++ |+..+.+..+|++|+..+++.||
T Consensus 12 ~~~~k~~~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 334433 34567788855 88888888888888877666665
No 42
>smart00355 ZnF_C2H2 zinc finger.
Probab=93.36 E-value=0.07 Score=31.94 Aligned_cols=24 Identities=38% Similarity=0.814 Sum_probs=15.2
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
|.| ..|++.|.....|..|++.|.
T Consensus 1 ~~C--~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRC--PECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCC--CCCcchhCCHHHHHHHHHHhc
Confidence 356 337777777777777766553
No 43
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.84 E-value=0.054 Score=52.34 Aligned_cols=55 Identities=20% Similarity=0.140 Sum_probs=37.0
Q ss_pred CcccccccccccchhHhHhhhhc-cccCCC-CccccC--CCCCccccchhhhcccceee
Q psy1991 231 SSSHGNYNKLHIHPYYQTHHHTQ-IFQTPS-ITIKSS--LLSKPKRRQNWSRRKIIMHT 285 (366)
Q Consensus 231 P~iC~~CGK~F~~~~~L~hHh~~-iH~~p~-~~~kC~--~Csk~f~~k~~l~~H~k~H~ 285 (366)
+..|..|.+.|.....+..|... .|.... ..+.|. .|++.+.+...+.+|...|.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 347 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccccc
Confidence 57788888888888777755543 555442 455666 67777777777777765543
No 44
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=92.52 E-value=0.052 Score=34.50 Aligned_cols=22 Identities=41% Similarity=0.889 Sum_probs=15.7
Q ss_pred eecCccccccCChhHHHHHhhc
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~Kt 365 (366)
|.|..|++.|.....|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777654
No 45
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=91.77 E-value=0.067 Score=40.05 Aligned_cols=34 Identities=21% Similarity=0.527 Sum_probs=20.6
Q ss_pred HHHhhh-hcCCCCeecCccccccCChhHHHHHhhc
Q psy1991 332 TRHFRK-HTGDRPFQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 332 ~~H~rt-HtgeKpykC~~CgKsFs~kssLk~H~Kt 365 (366)
..+.+. +..+.|-.|++|+..+.+..+|++|+.+
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~ 46 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEI 46 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHH
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHH
Confidence 344433 4457889999999999999999999853
No 46
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=91.40 E-value=0.11 Score=32.00 Aligned_cols=21 Identities=33% Similarity=0.757 Sum_probs=11.5
Q ss_pred eecCccccccCChhHHHHHhhc
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~Kt 365 (366)
|+|+.|+-... +..|.+|+++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~ 21 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKR 21 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHh
Confidence 45666666555 6666666654
No 47
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=90.96 E-value=0.17 Score=31.10 Aligned_cols=23 Identities=26% Similarity=0.738 Sum_probs=17.3
Q ss_pred eecCCCCCCCcccChHHHHHHhhhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKH 338 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtH 338 (366)
|.|.+ |++.|.+...|+.|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 56755 888888888888887643
No 48
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=89.54 E-value=0.24 Score=31.44 Aligned_cols=20 Identities=40% Similarity=0.860 Sum_probs=13.2
Q ss_pred eecCccccccCChhHHHHHhh
Q psy1991 344 FQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~K 364 (366)
..|+.||++| ..+.|.+|++
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3577777777 5566667764
No 49
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=88.84 E-value=0.21 Score=48.31 Aligned_cols=132 Identities=23% Similarity=0.262 Sum_probs=89.0
Q ss_pred CCcccc--cccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhh-----------cccceeeeecCCCCCccc
Q psy1991 230 NSSSHG--NYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSR-----------RKIIMHTCSHNGCGKTYT 296 (366)
Q Consensus 230 ~P~iC~--~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~-----------~H~k~H~C~~~~CgK~F~ 296 (366)
.|+.|+ .|++.|.....+..|.............-..+.+.+....... .......+....|.+.+.
T Consensus 320 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (467)
T COG5048 320 KPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFK 399 (467)
T ss_pred CceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccCCCCccchhhccCccCCccccccccchhhhhc
Confidence 688999 8999999998887665554444433333333333333222211 111233366667999999
Q ss_pred CchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccCChhHHHHH
Q psy1991 297 KSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFSRSDHLSLH 362 (366)
Q Consensus 297 skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs~kssLk~H 362 (366)
+...+..|...|...+...|....|.+.|.....+..|++.|....++.|..+ +.|.....+..|
T Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 464 (467)
T COG5048 400 RDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSIL-KSFRRDLDLSNH 464 (467)
T ss_pred cccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCceeeccc-cccchhhhhhcc
Confidence 99999999988888776665557799999999999999999987777766644 444444444443
No 50
>PRK04860 hypothetical protein; Provisional
Probab=87.18 E-value=0.54 Score=42.69 Aligned_cols=40 Identities=23% Similarity=0.516 Sum_probs=31.3
Q ss_pred eeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChH
Q psy1991 284 HTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSD 329 (366)
Q Consensus 284 H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~ 329 (366)
|...|. |++ ....+++|.++|.++++|.| ..|++.|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC--~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRC--RRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEEC--CCCCceeEEec
Confidence 333343 776 66789999999999999999 66999887543
No 51
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=86.71 E-value=0.59 Score=28.60 Aligned_cols=23 Identities=30% Similarity=0.910 Sum_probs=14.4
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
|+|.. |+.... +..|.+|++.|+
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHH
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhC
Confidence 56744 777776 777888877654
No 52
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.27 E-value=0.56 Score=30.92 Aligned_cols=22 Identities=32% Similarity=0.720 Sum_probs=15.5
Q ss_pred CeecCccccccCChhHHHHHhh
Q psy1991 343 PFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 343 pykC~~CgKsFs~kssLk~H~K 364 (366)
+|.|+.|++.|.....+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4667777777777777777764
No 53
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=84.80 E-value=0.39 Score=30.41 Aligned_cols=22 Identities=23% Similarity=0.688 Sum_probs=17.6
Q ss_pred eecCCCCCCCcccChHHHHHHhhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRK 337 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rt 337 (366)
|.|.+ |++.|.+...|..|++.
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 67854 88888888888888774
No 54
>KOG1146|consensus
Probab=83.20 E-value=0.42 Score=55.31 Aligned_cols=72 Identities=22% Similarity=0.559 Sum_probs=51.9
Q ss_pred CCCcccCchhHHHHHHh-hcCCCceecCCCCCCCcccChHHHHHHhhh-hc------------------------CCCCe
Q psy1991 291 CGKTYTKSSHLKAHLRT-HTGEKPYQCGWKGCGWKFARSDELTRHFRK-HT------------------------GDRPF 344 (366)
Q Consensus 291 CgK~F~skssLk~H~rt-H~~eKpf~C~i~~CgKsF~sk~~L~~H~rt-Ht------------------------geKpy 344 (366)
|+..+.++..+..|+.. |.-.+.++| +.|+..|+....|..|||. |. +.++|
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~c--pkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~ 519 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKC--PKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPY 519 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccC--CccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcc
Confidence 55555566666666554 555567777 4477777777777777775 21 24679
Q ss_pred ecCccccccCChhHHHHHhh
Q psy1991 345 QCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 345 kC~~CgKsFs~kssLk~H~K 364 (366)
.|..|...+..+.+|.+|++
T Consensus 520 ~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 520 PCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred cceeeeeeeecchHHHHHHH
Confidence 99999999999999999986
No 55
>KOG2482|consensus
Probab=82.60 E-value=1.4 Score=44.80 Aligned_cols=49 Identities=22% Similarity=0.431 Sum_probs=36.2
Q ss_pred eecCCCCCCCcccChHHHHHHhhh-hcC--------------------------CCCeecCccccccCChhHHHHHhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRK-HTG--------------------------DRPFQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rt-Htg--------------------------eKpykC~~CgKsFs~kssLk~H~K 364 (366)
.+|-+ |.....+...|..||.. |.- .+.-.|-.|.-.|.....|..||.
T Consensus 280 v~CLf--C~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~ 355 (423)
T KOG2482|consen 280 VVCLF--CTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMV 355 (423)
T ss_pred eEEEe--eccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcc
Confidence 47866 88888888899999874 420 112357778889999999999875
No 56
>KOG2231|consensus
Probab=77.44 E-value=2.5 Score=46.30 Aligned_cols=113 Identities=19% Similarity=0.287 Sum_probs=75.8
Q ss_pred cccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccce-e---eeecCC-------CCCcccCchh
Q psy1991 232 SSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIM-H---TCSHNG-------CGKTYTKSSH 300 (366)
Q Consensus 232 ~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~-H---~C~~~~-------CgK~F~skss 300 (366)
+.|.+|++.|..... ...|..| ..|+....|+.|+.. | .|..-. |....-+...
T Consensus 100 ~~C~~C~~~~~~~~~--------------~~~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~e~k~Yt~~e 164 (669)
T KOG2231|consen 100 HSCHICDRRFRALYN--------------KKECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFINERKLYTRAE 164 (669)
T ss_pred hhcCccccchhhhcc--------------cCCCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeeeeeehehHHH
Confidence 568888877654321 3356677 777888888888832 2 233211 2333445667
Q ss_pred HHHHHHhhcC-CC----ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccc------cccCChhHHHHHhhc
Q psy1991 301 LKAHLRTHTG-EK----PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCE------RAFSRSDHLSLHMKR 365 (366)
Q Consensus 301 Lk~H~rtH~~-eK----pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~Cg------KsFs~kssLk~H~Kt 365 (366)
|..|++.-.. ++ .-.|.+ |...|.....|.+|++.++ |.|..|+ --|.....|..|-|.
T Consensus 165 l~~h~~~gd~d~~s~rGhp~C~~--C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~ 234 (669)
T KOG2231|consen 165 LNLHLMFGDPDDESCRGHPLCKF--CHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRK 234 (669)
T ss_pred HHHHHhcCCCccccccCCccchh--hhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhh
Confidence 8888775322 22 246855 9999999999999999765 5677774 458888899999764
No 57
>KOG4173|consensus
Probab=73.47 E-value=1.5 Score=41.77 Aligned_cols=48 Identities=35% Similarity=0.728 Sum_probs=28.3
Q ss_pred eecCCCCCcccCchhHHHHHHh-h---------cCCCceecCCCCCCCcccChHHHHHHh
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRT-H---------TGEKPYQCGWKGCGWKFARSDELTRHF 335 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rt-H---------~~eKpf~C~i~~CgKsF~sk~~L~~H~ 335 (366)
|.. |.+.|.+...|..|+.. | .|..-|.|-+++|+-.|.+...-+.|+
T Consensus 109 Cs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~ 166 (253)
T KOG4173|consen 109 CSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHM 166 (253)
T ss_pred hHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHH
Confidence 555 66666666666666542 3 233446666666666666666666664
No 58
>KOG2893|consensus
Probab=71.82 E-value=1.3 Score=43.14 Aligned_cols=46 Identities=24% Similarity=0.547 Sum_probs=36.7
Q ss_pred eecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHH-hhhhc
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRH-FRKHT 339 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H-~rtHt 339 (366)
|=+ |++.|....-|.+|++. |.|+|.| |-|...+--.|..| |.+|.
T Consensus 13 cwy--cnrefddekiliqhqka----khfkchi--chkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 13 CWY--CNREFDDEKILIQHQKA----KHFKCHI--CHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred eee--cccccchhhhhhhhhhh----ccceeee--ehhhhccCCCceeehhhhhh
Confidence 666 99999999999988765 3489988 99887777778777 56664
No 59
>KOG4173|consensus
Probab=70.27 E-value=1.5 Score=41.78 Aligned_cols=77 Identities=26% Similarity=0.569 Sum_probs=61.0
Q ss_pred ceeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhh-h---------cCCCCeec--Ccc
Q psy1991 282 IMHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRK-H---------TGDRPFQC--TLC 349 (366)
Q Consensus 282 k~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rt-H---------tgeKpykC--~~C 349 (366)
..+.|...+|.+.|........|-.+-++. .|.+ |.+.|.+...|..|+.- | .|.--|+| +-|
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc---hhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 345688889999999988888887763332 5965 99999999999999764 3 23334899 459
Q ss_pred ccccCChhHHHHHh
Q psy1991 350 ERAFSRSDHLSLHM 363 (366)
Q Consensus 350 gKsFs~kssLk~H~ 363 (366)
+-+|++...-+.|+
T Consensus 153 t~KFkT~r~RkdH~ 166 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHM 166 (253)
T ss_pred hhhhhhhhhhhhHH
Confidence 99999999999996
No 60
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=67.54 E-value=4.8 Score=26.34 Aligned_cols=23 Identities=17% Similarity=0.515 Sum_probs=19.3
Q ss_pred ceecCCCCCCCcccChHHHHHHhhh
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRK 337 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rt 337 (366)
+|.|.+ |++.|.....+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 578966 99999999999999763
No 61
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=66.93 E-value=3.4 Score=42.14 Aligned_cols=67 Identities=28% Similarity=0.656 Sum_probs=49.7
Q ss_pred eecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCc-------ccChHHHHHHhhhhcCCCCeecCc--c--cc--c
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWK-------FARSDELTRHFRKHTGDRPFQCTL--C--ER--A 352 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKs-------F~sk~~L~~H~rtHtgeKpykC~~--C--gK--s 352 (366)
|.. |.+.|-....|.+|+|..+ + .|.| |++. |....+|.+|-+.- -|.|.+ | || .
T Consensus 223 C~F--C~~~FYdDDEL~~HcR~~H-E---~ChI--CD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~v 290 (493)
T COG5236 223 CIF--CKIYFYDDDELRRHCRLRH-E---ACHI--CDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYV 290 (493)
T ss_pred hhh--ccceecChHHHHHHHHhhh-h---hhhh--hhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEE
Confidence 888 9999999999999998733 2 4666 6653 78888898887642 277764 4 33 5
Q ss_pred cCChhHHHHHhh
Q psy1991 353 FSRSDHLSLHMK 364 (366)
Q Consensus 353 Fs~kssLk~H~K 364 (366)
|.....|..|+.
T Consensus 291 f~~~~el~~h~~ 302 (493)
T COG5236 291 FPYHTELLEHLT 302 (493)
T ss_pred eccHHHHHHHHH
Confidence 888888888864
No 62
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=65.59 E-value=5.3 Score=27.84 Aligned_cols=23 Identities=22% Similarity=0.592 Sum_probs=11.0
Q ss_pred CCCeecCccccccCCh----hHHHHHh
Q psy1991 341 DRPFQCTLCERAFSRS----DHLSLHM 363 (366)
Q Consensus 341 eKpykC~~CgKsFs~k----ssLk~H~ 363 (366)
+...+|..|++.++.. +.|.+|+
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHH
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 3445666666665553 5666666
No 63
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=60.23 E-value=4.4 Score=41.40 Aligned_cols=23 Identities=22% Similarity=0.551 Sum_probs=12.6
Q ss_pred eeecCCCCCcccCchhHHHHHHh
Q psy1991 285 TCSHNGCGKTYTKSSHLKAHLRT 307 (366)
Q Consensus 285 ~C~~~~CgK~F~skssLk~H~rt 307 (366)
.|+...|..+......|+.|.++
T Consensus 153 ~CP~skc~~~C~~~k~lk~H~K~ 175 (493)
T COG5236 153 KCPKSKCHRRCGSLKELKKHYKA 175 (493)
T ss_pred cCCchhhhhhhhhHHHHHHHHHh
Confidence 35555555555555566666555
No 64
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=60.15 E-value=5.6 Score=28.78 Aligned_cols=20 Identities=35% Similarity=0.866 Sum_probs=11.2
Q ss_pred ecCccccccCCh-----hHHHHHhh
Q psy1991 345 QCTLCERAFSRS-----DHLSLHMK 364 (366)
Q Consensus 345 kC~~CgKsFs~k-----ssLk~H~K 364 (366)
.|..|++.++.. +.|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 455566555444 36666655
No 65
>KOG1146|consensus
Probab=59.21 E-value=2 Score=50.05 Aligned_cols=72 Identities=11% Similarity=0.009 Sum_probs=45.6
Q ss_pred cccccccccccchhHhHhhhh-c-cccCCCCccccCCCCCccccchhhhccc---ceeeeecCCCCCcccCchhHHHHHH
Q psy1991 232 SSHGNYNKLHIHPYYQTHHHT-Q-IFQTPSITIKSSLLSKPKRRQNWSRRKI---IMHTCSHNGCGKTYTKSSHLKAHLR 306 (366)
Q Consensus 232 ~iC~~CGK~F~~~~~L~hHh~-~-iH~~p~~~~kC~~Csk~f~~k~~l~~H~---k~H~C~~~~CgK~F~skssLk~H~r 306 (366)
..|..|-..|.+...+..+-. . ..........|..|.+.++....+. ++ ..|.|.. |...|..+..|..|.+
T Consensus 1229 l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~~~~~~~~~~--~~~~~~~~~~l~~~~~ 1305 (1406)
T KOG1146|consen 1229 LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLDVTHRYLCRQ--CKMAFDGEAPLTAHQR 1305 (1406)
T ss_pred ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-ecccchhHHHHH--HHhhhcchhHHHHHHH
Confidence 568888888888766554321 1 1222234566777777777665555 43 3344666 8888888888888873
No 66
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=57.78 E-value=4.6 Score=31.02 Aligned_cols=25 Identities=24% Similarity=0.583 Sum_probs=17.3
Q ss_pred cCCCCeecCccccccCChhHHHHHh
Q psy1991 339 TGDRPFQCTLCERAFSRSDHLSLHM 363 (366)
Q Consensus 339 tgeKpykC~~CgKsFs~kssLk~H~ 363 (366)
.||.-+.|+.||+.|..+.+..+|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHV 37 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHV 37 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHh
Confidence 4556677777777777777777764
No 67
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=57.69 E-value=22 Score=30.68 Aligned_cols=96 Identities=15% Similarity=0.120 Sum_probs=56.2
Q ss_pred CcccccccccccchhHhHhhhhccccCCCCccccCCCCCccccchhhhcccceeeeecCCCCCcccCchhHHHHHHhhcC
Q psy1991 231 SSSHGNYNKLHIHPYYQTHHHTQIFQTPSITIKSSLLSKPKRRQNWSRRKIIMHTCSHNGCGKTYTKSSHLKAHLRTHTG 310 (366)
Q Consensus 231 P~iC~~CGK~F~~~~~L~hHh~~iH~~p~~~~kC~~Csk~f~~k~~l~~H~k~H~C~~~~CgK~F~skssLk~H~rtH~~ 310 (366)
|..|+.||-+......|.+.- .|.-|...+.--.-. +......|.- |++.|........- .-..
T Consensus 15 P~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev~~~----------~~~~~~~C~~--C~~~f~~~~~~~~~--~~~~ 78 (112)
T TIGR00622 15 PVECPICGLTLILSTHLARSY--HHLFPLKAFQEIPLE----------EYNGSRFCFG--CQGPFPKPPVSPFD--ELKD 78 (112)
T ss_pred CCcCCcCCCEEeccchHHHhh--hccCCCccccccccc----------ccCCCCcccC--cCCCCCCccccccc--cccc
Confidence 678999999988888877532 222232222211000 0111123655 99988765321100 0123
Q ss_pred CCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccc
Q psy1991 311 EKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCE 350 (366)
Q Consensus 311 eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~Cg 350 (366)
...|+| +.|...|-..-+.-.|...|. |+-|.
T Consensus 79 ~~~y~C--~~C~~~FC~dCD~fiHe~Lh~------CPGC~ 110 (112)
T TIGR00622 79 SHRYVC--AVCKNVFCVDCDVFVHESLHC------CPGCI 110 (112)
T ss_pred ccceeC--CCCCCccccccchhhhhhccC------CcCCC
Confidence 456999 669999999999989887774 66665
No 68
>KOG4124|consensus
Probab=55.52 E-value=2.3 Score=43.27 Aligned_cols=53 Identities=25% Similarity=0.701 Sum_probs=42.2
Q ss_pred CCceecCCCCCCCcccChHHHHHHhhh-h--------------c----CCCCeecCccccccCChhHHHHHh
Q psy1991 311 EKPYQCGWKGCGWKFARSDELTRHFRK-H--------------T----GDRPFQCTLCERAFSRSDHLSLHM 363 (366)
Q Consensus 311 eKpf~C~i~~CgKsF~sk~~L~~H~rt-H--------------t----geKpykC~~CgKsFs~kssLk~H~ 363 (366)
.++++|+++.|++.++....|+.|..+ | + ..|+|+|++|.++++....|+-|+
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~ 418 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHR 418 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCcee
Confidence 468999999999999998888888642 3 1 257899999999998877776653
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=54.81 E-value=8.9 Score=25.63 Aligned_cols=24 Identities=29% Similarity=1.013 Sum_probs=15.5
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
|+|.+ ||..+.... .++.|++||.
T Consensus 2 ~~C~~--CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPV--CGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCC--CCCEECCCc------------CCCcCcCCCC
Confidence 67744 876654432 5678888875
No 70
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=43.81 E-value=23 Score=31.83 Aligned_cols=34 Identities=18% Similarity=0.449 Sum_probs=26.9
Q ss_pred cCCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccccc
Q psy1991 309 TGEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAF 353 (366)
Q Consensus 309 ~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsF 353 (366)
.+..-|.| +.|+.+|+.-.++. .-|.|+.||...
T Consensus 105 ~~~~~Y~C--p~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFIC--PNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEEC--CCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 34566999 66999999988885 259999999763
No 71
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=41.98 E-value=26 Score=30.93 Aligned_cols=38 Identities=21% Similarity=0.543 Sum_probs=25.7
Q ss_pred CCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccccc
Q psy1991 310 GEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAF 353 (366)
Q Consensus 310 ~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsF 353 (366)
+..-|.| +.|++.|.....+.. . +. +..|.|+.||...
T Consensus 96 ~~~~Y~C--p~C~~~y~~~ea~~~-~--d~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 96 NNAYYKC--PNCQSKYTFLEANQL-L--DM-DGTFTCPRCGEEL 133 (147)
T ss_pred CCcEEEC--cCCCCEeeHHHHHHh-c--CC-CCcEECCCCCCEE
Confidence 4456999 669999986655433 1 11 3349999999864
No 72
>KOG2575|consensus
Probab=39.37 E-value=10 Score=39.72 Aligned_cols=37 Identities=49% Similarity=0.886 Sum_probs=28.4
Q ss_pred hcCCC-CceEEeecchhhhhhhhccCccCCCCCCccceeeEeeccCCC
Q psy1991 64 MSGEG-PPVFIDYTAASDLVQDIIPNVESSTSNTPVSNYYIDFNTSNN 110 (366)
Q Consensus 64 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (366)
-||+| ||.|=||.|-.-- |+| |-|.||+.+| ||.++|
T Consensus 52 YSG~~~PPmyGDyEAQRHW-mEI-------T~nLPv~qWY--~n~t~N 89 (510)
T KOG2575|consen 52 YSGAGSPPMYGDYEAQRHW-MEI-------TVNLPVSQWY--FNGTHN 89 (510)
T ss_pred CCCCCCCCCCCCHHHHHHH-HHH-------hhcCcHHHHh--hcCCCC
Confidence 57887 8999999886654 444 7899999999 565555
No 73
>KOG4167|consensus
Probab=38.37 E-value=6.7 Score=43.48 Aligned_cols=24 Identities=33% Similarity=0.672 Sum_probs=22.1
Q ss_pred CeecCccccccCChhHHHHHhhcC
Q psy1991 343 PFQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 343 pykC~~CgKsFs~kssLk~H~KtH 366 (366)
-|.|..|+|.|-.-.++..|||+|
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~H 815 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTH 815 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHH
Confidence 399999999999999999999987
No 74
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=37.61 E-value=21 Score=26.11 Aligned_cols=44 Identities=27% Similarity=0.622 Sum_probs=20.9
Q ss_pred CCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCc----cccccCC
Q psy1991 311 EKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTL----CERAFSR 355 (366)
Q Consensus 311 eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~----CgKsFs~ 355 (366)
..+..|+. .|+..-..+..|..|....-..++..|.. |+..+.+
T Consensus 7 ~~~v~C~~-~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 7 FRPVPCPN-GCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp TSEEE-TT---S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEEH
T ss_pred CCEeeCCC-CCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccch
Confidence 34556642 24433333556777777555556667777 7766544
No 75
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=36.78 E-value=27 Score=23.51 Aligned_cols=34 Identities=21% Similarity=0.464 Sum_probs=18.8
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccC
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFS 354 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs 354 (366)
+.| +.|+..|.-...... .+.....|+.|+..|.
T Consensus 3 ~~C--P~C~~~~~v~~~~~~-----~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQC--PNCKTSFRVVDSQLG-----ANGGKVRCGKCGHVWY 36 (38)
T ss_pred EEC--CCCCCEEEeCHHHcC-----CCCCEEECCCCCCEEE
Confidence 467 557777765554321 1122467777777654
No 76
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=34.91 E-value=34 Score=31.45 Aligned_cols=34 Identities=18% Similarity=0.578 Sum_probs=25.5
Q ss_pred CCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccccC
Q psy1991 310 GEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAFS 354 (366)
Q Consensus 310 ~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsFs 354 (366)
...-|.| +.|+++|+.-.++. .-|.|+.||....
T Consensus 114 ~~~~Y~C--p~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 114 NNMFFFC--PNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CCCEEEC--CCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 3456899 66999998888763 3599999997643
No 77
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=33.01 E-value=32 Score=21.96 Aligned_cols=19 Identities=26% Similarity=0.684 Sum_probs=12.8
Q ss_pred ecCccccccCChhHHHHHhh
Q psy1991 345 QCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 345 kC~~CgKsFs~kssLk~H~K 364 (366)
.|++|++.+ ....+.+|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 577777777 5566677754
No 78
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=32.64 E-value=28 Score=29.74 Aligned_cols=15 Identities=33% Similarity=0.846 Sum_probs=6.9
Q ss_pred CceecCCCCCCCcccCh
Q psy1991 312 KPYQCGWKGCGWKFARS 328 (366)
Q Consensus 312 Kpf~C~i~~CgKsF~sk 328 (366)
.|..| +.||..|.-.
T Consensus 25 ~PivC--P~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVC--PKCGTEFPPE 39 (108)
T ss_pred CCccC--CCCCCccCcc
Confidence 44455 2255554433
No 79
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=32.10 E-value=23 Score=27.25 Aligned_cols=29 Identities=34% Similarity=0.713 Sum_probs=21.7
Q ss_pred hcCCCceecCCCCCCCcccChHHHHHHhhhh
Q psy1991 308 HTGEKPYQCGWKGCGWKFARSDELTRHFRKH 338 (366)
Q Consensus 308 H~~eKpf~C~i~~CgKsF~sk~~L~~H~rtH 338 (366)
-.|+.-+.| +.||+.|....+..+|....
T Consensus 12 RDGE~~lrC--PRC~~~FR~~K~Y~RHVNKa 40 (65)
T COG4049 12 RDGEEFLRC--PRCGMVFRRRKDYIRHVNKA 40 (65)
T ss_pred cCCceeeeC--CchhHHHHHhHHHHHHhhHH
Confidence 346667888 66888888888888887643
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=31.41 E-value=33 Score=23.17 Aligned_cols=25 Identities=24% Similarity=0.788 Sum_probs=16.0
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
.|+|.+ ||..+... +.|..|++||.
T Consensus 2 ~~~C~~--CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPV--CGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred eEECCC--CCCEeECC------------cCCCcCcCCCC
Confidence 477855 88765432 24568888875
No 81
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=29.64 E-value=26 Score=31.00 Aligned_cols=20 Identities=30% Similarity=0.760 Sum_probs=9.4
Q ss_pred eecCccccccCChhHHHHHhhcC
Q psy1991 344 FQCTLCERAFSRSDHLSLHMKRH 366 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~KtH 366 (366)
..|-+|||.|+. |++|+++|
T Consensus 73 i~clecGk~~k~---LkrHL~~~ 92 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTH 92 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHT
T ss_pred eEEccCCcccch---HHHHHHHc
Confidence 456666666655 25555544
No 82
>KOG2785|consensus
Probab=28.95 E-value=55 Score=33.92 Aligned_cols=49 Identities=31% Similarity=0.596 Sum_probs=31.6
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcC-----------------------CCCeecCccc---cccCChhHHHHHhh
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTG-----------------------DRPFQCTLCE---RAFSRSDHLSLHMK 364 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtg-----------------------eKpykC~~Cg---KsFs~kssLk~H~K 364 (366)
-.|-+ |++.|..-..-..||..++| ..-|.|-.|+ +.|......+.||.
T Consensus 167 t~CLf--C~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 167 TDCLF--CDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred cceee--cCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence 45644 66666666666666665554 1236677777 77888888888874
No 83
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=27.49 E-value=41 Score=30.97 Aligned_cols=24 Identities=33% Similarity=0.995 Sum_probs=19.3
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
.|.|++ ||.. |.|+-|-+|++||.
T Consensus 134 ~~vC~v--CGy~-------------~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPV--CGYT-------------HEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCC--CCCc-------------ccCCCCCcCCCCCC
Confidence 699966 9864 55788999999984
No 84
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=26.72 E-value=77 Score=27.43 Aligned_cols=73 Identities=23% Similarity=0.459 Sum_probs=44.5
Q ss_pred eecCCCCCcccCchhHHHHHHhhc--------------CCCceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHT--------------GEKPYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~--------------~eKpf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
|+. ||-...+...|.+-- |+ ..+.-.| -.|.+.|........ ..-.....|+|+.|..
T Consensus 18 Cpi--CgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C--~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~ 89 (112)
T TIGR00622 18 CPI--CGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFC--FGCQGPFPKPPVSPF--DELKDSHRYVCAVCKN 89 (112)
T ss_pred CCc--CCCEEeccchHHHhh--hccCCCcccccccccccCCCCcc--cCcCCCCCCcccccc--cccccccceeCCCCCC
Confidence 444 777777666665431 21 0111248 569999976542110 0012245799999999
Q ss_pred ccCChhHHHHHhhcC
Q psy1991 352 AFSRSDHLSLHMKRH 366 (366)
Q Consensus 352 sFs~kssLk~H~KtH 366 (366)
.|--.-+.-.|...|
T Consensus 90 ~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 90 VFCVDCDVFVHESLH 104 (112)
T ss_pred ccccccchhhhhhcc
Confidence 999888888887655
No 85
>KOG2785|consensus
Probab=25.82 E-value=67 Score=33.27 Aligned_cols=48 Identities=21% Similarity=0.457 Sum_probs=40.2
Q ss_pred eeecCCCCCcccCchhHHHHHHhhcCC-----------------------CceecCCCCCC---CcccChHHHHHHhh
Q psy1991 285 TCSHNGCGKTYTKSSHLKAHLRTHTGE-----------------------KPYQCGWKGCG---WKFARSDELTRHFR 336 (366)
Q Consensus 285 ~C~~~~CgK~F~skssLk~H~rtH~~e-----------------------Kpf~C~i~~Cg---KsF~sk~~L~~H~r 336 (366)
.|-. |++.+++-..-..||..++|- .-|.|-+ |+ +.|.+-...+.||+
T Consensus 168 ~CLf--C~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~--CN~~~~~f~sleavr~HM~ 241 (390)
T KOG2785|consen 168 DCLF--CDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLF--CNELGRPFSSLEAVRAHMR 241 (390)
T ss_pred ceee--cCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEE--eccccCcccccHHHHHHHh
Confidence 3777 999999999999999987661 2478866 98 99999999999997
No 86
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=25.40 E-value=26 Score=25.22 Aligned_cols=11 Identities=45% Similarity=1.464 Sum_probs=5.7
Q ss_pred eecCCCCCCCccc
Q psy1991 314 YQCGWKGCGWKFA 326 (366)
Q Consensus 314 f~C~i~~CgKsF~ 326 (366)
|+| .+|+..|.
T Consensus 6 y~C--~~Cg~~fe 16 (52)
T TIGR02605 6 YRC--TACGHRFE 16 (52)
T ss_pred EEe--CCCCCEeE
Confidence 455 44555554
No 87
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=24.80 E-value=27 Score=23.54 Aligned_cols=29 Identities=28% Similarity=0.761 Sum_probs=14.3
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
|+| .+||+.|........ ...-.|+.||.
T Consensus 6 y~C--~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (41)
T smart00834 6 YRC--EDCGHTFEVLQKISD-------DPLATCPECGG 34 (41)
T ss_pred EEc--CCCCCEEEEEEecCC-------CCCCCCCCCCC
Confidence 566 446666653332211 23345666665
No 88
>KOG2482|consensus
Probab=24.49 E-value=84 Score=32.36 Aligned_cols=52 Identities=27% Similarity=0.595 Sum_probs=36.2
Q ss_pred CceecCCCCCCCccc-ChHHHHHHhh-hhc---C----------------C--CCeecCccccccCChhHHHHHhhc
Q psy1991 312 KPYQCGWKGCGWKFA-RSDELTRHFR-KHT---G----------------D--RPFQCTLCERAFSRSDHLSLHMKR 365 (366)
Q Consensus 312 Kpf~C~i~~CgKsF~-sk~~L~~H~r-tHt---g----------------e--KpykC~~CgKsFs~kssLk~H~Kt 365 (366)
...+|.+ |...+. .++....|+- .|. | + ..+.|-.|.|.|+.+..|+.|||.
T Consensus 143 fslqClF--Cn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 143 FSLQCLF--CNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred eeeEEEE--ecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHh
Confidence 3467877 877654 4455666654 342 1 1 237899999999999999999973
No 89
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=23.77 E-value=58 Score=22.26 Aligned_cols=14 Identities=7% Similarity=-0.311 Sum_probs=8.7
Q ss_pred ccccccccccchhH
Q psy1991 233 SHGNYNKLHIHPYY 246 (366)
Q Consensus 233 iC~~CGK~F~~~~~ 246 (366)
.|+.|++.|.-...
T Consensus 4 ~Cp~C~~~y~i~d~ 17 (36)
T PF13717_consen 4 TCPNCQAKYEIDDE 17 (36)
T ss_pred ECCCCCCEEeCCHH
Confidence 47777776665544
No 90
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=23.27 E-value=80 Score=24.54 Aligned_cols=32 Identities=25% Similarity=0.551 Sum_probs=17.5
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCER 351 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgK 351 (366)
.|.| +.||..-..+..--+ .+ ..+|.|+.||.
T Consensus 27 ~F~C--PnCGe~~I~Rc~~CR---k~--g~~Y~Cp~CGF 58 (61)
T COG2888 27 KFPC--PNCGEVEIYRCAKCR---KL--GNPYRCPKCGF 58 (61)
T ss_pred EeeC--CCCCceeeehhhhHH---Hc--CCceECCCcCc
Confidence 4778 558765443332211 11 34688888874
No 91
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=23.23 E-value=43 Score=29.67 Aligned_cols=22 Identities=55% Similarity=1.052 Sum_probs=14.6
Q ss_pred eecCCCCCcccCchhHHHHHHhhcCCC
Q psy1991 286 CSHNGCGKTYTKSSHLKAHLRTHTGEK 312 (366)
Q Consensus 286 C~~~~CgK~F~skssLk~H~rtH~~eK 312 (366)
|-. |||.|+ .|++|++.|+|-.
T Consensus 75 cle--cGk~~k---~LkrHL~~~~glt 96 (132)
T PF05443_consen 75 CLE--CGKKFK---TLKRHLRTHHGLT 96 (132)
T ss_dssp -TB--T--EES---BHHHHHHHTT-S-
T ss_pred Ecc--CCcccc---hHHHHHHHccCCC
Confidence 666 999999 5799999997754
No 92
>KOG4377|consensus
Probab=22.91 E-value=46 Score=34.92 Aligned_cols=68 Identities=21% Similarity=0.388 Sum_probs=47.1
Q ss_pred eeeecCCCCCcccCchhHHHHHHhhcCCC------------ceecCCCCCCCcccChHHHHHHhhhhcCCC-------Ce
Q psy1991 284 HTCSHNGCGKTYTKSSHLKAHLRTHTGEK------------PYQCGWKGCGWKFARSDELTRHFRKHTGDR-------PF 344 (366)
Q Consensus 284 H~C~~~~CgK~F~skssLk~H~rtH~~eK------------pf~C~i~~CgKsF~sk~~L~~H~rtHtgeK-------py 344 (366)
+.|.-+.|+..+-++..+.+|...|...+ .|.|....|.| +.++...|-..|+..+ -|
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrthf 348 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHF 348 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceeccee
Confidence 44777779877777999999998885432 36888778988 5556667766665321 27
Q ss_pred ecCccc--cccC
Q psy1991 345 QCTLCE--RAFS 354 (366)
Q Consensus 345 kC~~Cg--KsFs 354 (366)
.|..|| -.|+
T Consensus 349 hC~r~gCTdtfK 360 (480)
T KOG4377|consen 349 HCQRIGCTDTFK 360 (480)
T ss_pred EEeccCCccccc
Confidence 788777 4444
No 93
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=22.85 E-value=37 Score=24.05 Aligned_cols=29 Identities=21% Similarity=0.590 Sum_probs=15.1
Q ss_pred ceecCCCCCCCcccChHHHHHHhhhhcCCCCeecCcccccc
Q psy1991 313 PYQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERAF 353 (366)
Q Consensus 313 pf~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKsF 353 (366)
.|.| +.||..|...... ....|+.||..+
T Consensus 3 ~y~C--~~CG~~~~~~~~~----------~~~~Cp~CG~~~ 31 (46)
T PRK00398 3 EYKC--ARCGREVELDEYG----------TGVRCPYCGYRI 31 (46)
T ss_pred EEEC--CCCCCEEEECCCC----------CceECCCCCCeE
Confidence 4666 5576665542211 145677776543
No 94
>KOG2186|consensus
Probab=22.74 E-value=54 Score=32.33 Aligned_cols=50 Identities=26% Similarity=0.589 Sum_probs=33.4
Q ss_pred eeeeecCCCCCcccCchhHHHHHHhhcCCCceecCCCCCCCcccChHHHHHHhhhhc
Q psy1991 283 MHTCSHNGCGKTYTKSSHLKAHLRTHTGEKPYQCGWKGCGWKFARSDELTRHFRKHT 339 (366)
Q Consensus 283 ~H~C~~~~CgK~F~skssLk~H~rtH~~eKpf~C~i~~CgKsF~sk~~L~~H~rtHt 339 (366)
.+.|.. ||..-+. ..+.+|+-.-++ .-|.| -.|++.|.+ ...+.|..--+
T Consensus 3 ~FtCnv--CgEsvKK-p~vekH~srCrn-~~fSC--IDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 3 FFTCNV--CGESVKK-PQVEKHMSRCRN-AYFSC--IDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred EEehhh--hhhhccc-cchHHHHHhccC-CeeEE--eeccccccc-chhhhhhhhcc
Confidence 355655 8876654 467778777555 56888 568888887 66777765433
No 95
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=22.72 E-value=71 Score=22.93 Aligned_cols=26 Identities=23% Similarity=0.578 Sum_probs=14.5
Q ss_pred eecCCCCCCCcccChHHHHHHhhhhcCCCCeecCccccc
Q psy1991 314 YQCGWKGCGWKFARSDELTRHFRKHTGDRPFQCTLCERA 352 (366)
Q Consensus 314 f~C~i~~CgKsF~sk~~L~~H~rtHtgeKpykC~~CgKs 352 (366)
|.| .+||..|... ...+.+|+.||..
T Consensus 3 Y~C--~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 3 YIC--GECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EEC--CCCCCEeecC-----------CCCceECCCCCce
Confidence 666 5577666533 1234667777643
No 96
>PF09432 THP2: Tho complex subunit THP2; InterPro: IPR018557 The THO complex plays a role in coupling transcription elongation to mRNA export. It is composed of subunits THP2, HPR1, THO2 and MFT1 [].
Probab=22.70 E-value=34 Score=30.29 Aligned_cols=16 Identities=44% Similarity=1.026 Sum_probs=14.3
Q ss_pred cchhhhhhhhcCCCCc
Q psy1991 55 QVWHDIETVMSGEGPP 70 (366)
Q Consensus 55 ~~~~~~~~~~~~~~~~ 70 (366)
+-|.+||.+|-|+||-
T Consensus 99 ~ewd~iE~~mFGD~pn 114 (132)
T PF09432_consen 99 KEWDNIEMLMFGDGPN 114 (132)
T ss_pred HHHHHHHHHHhcCChH
Confidence 5699999999999984
No 97
>PF11756 YgbA_NO: Nitrous oxide-stimulated promoter; InterPro: IPR020483 This entry contains proteins with no known function.
Probab=22.33 E-value=27 Score=30.07 Aligned_cols=37 Identities=38% Similarity=0.896 Sum_probs=26.4
Q ss_pred CCCCCCc----chhhhhhhcc--ccchhhhhhhhcCCCCceEE
Q psy1991 37 GLDKCPF----PYILFCFLCC--PQVWHDIETVMSGEGPPVFI 73 (366)
Q Consensus 37 ~~~~~~~----~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 73 (366)
-||.||| |.=--|-.-| |+.|-+|.+||.=.||-..+
T Consensus 45 Rl~~Cp~~e~Kp~C~~C~iHCY~p~~re~Ir~VMrysGpRMl~ 87 (112)
T PF11756_consen 45 RLDRCPFGEEKPFCSKCPIHCYKPEMREKIREVMRYSGPRMLF 87 (112)
T ss_pred HHHcCCCCCCCCCccccCchhCCHHHHHHHHHHHHHcCchHHH
Confidence 3688888 4333333334 88999999999999986543
No 98
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=21.82 E-value=52 Score=31.43 Aligned_cols=20 Identities=25% Similarity=0.833 Sum_probs=8.8
Q ss_pred eecCccccccCChhHHHHHh
Q psy1991 344 FQCTLCERAFSRSDHLSLHM 363 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~ 363 (366)
|.|..|+|.|+-..-.++|+
T Consensus 78 ~~C~lc~KlFkg~eFV~KHI 97 (214)
T PF04959_consen 78 WRCPLCGKLFKGPEFVRKHI 97 (214)
T ss_dssp EEE-SSS-EESSHHHHHHHH
T ss_pred ECCCCCCcccCChHHHHHHH
Confidence 44555555555444444443
No 99
>PF15269 zf-C2H2_7: Zinc-finger
Probab=20.81 E-value=61 Score=23.95 Aligned_cols=21 Identities=33% Similarity=0.659 Sum_probs=15.1
Q ss_pred eecCccccccCChhHHHHHhh
Q psy1991 344 FQCTLCERAFSRSDHLSLHMK 364 (366)
Q Consensus 344 ykC~~CgKsFs~kssLk~H~K 364 (366)
|+|-.|..+..-++.|-.||+
T Consensus 21 ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred ceeecCCcccchHHHHHHHHH
Confidence 667777777777777777765
No 100
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=20.49 E-value=80 Score=26.18 Aligned_cols=12 Identities=0% Similarity=-0.374 Sum_probs=9.4
Q ss_pred cccccccccccc
Q psy1991 232 SSHGNYNKLHIH 243 (366)
Q Consensus 232 ~iC~~CGK~F~~ 243 (366)
.+|..|+-...-
T Consensus 12 lIC~~C~~av~~ 23 (109)
T PF12013_consen 12 LICRQCQYAVQP 23 (109)
T ss_pred EEeCCCCcccCc
Confidence 589999977655
Done!