Query psy4686
Match_columns 2028
No_of_seqs 773 out of 7242
Neff 10.1
Searched_HMMs 46136
Date Fri Aug 16 23:17:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4686.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4686hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.8E-29 3.9E-34 298.0 9.6 233 1791-2026 601-932 (958)
2 KOG1074|consensus 99.9 2.3E-27 5E-32 280.3 11.8 254 1293-1564 602-933 (958)
3 KOG2462|consensus 99.9 5.9E-27 1.3E-31 245.2 4.8 132 1885-2022 129-264 (279)
4 KOG2462|consensus 99.9 4.2E-24 9.2E-29 223.9 5.3 133 1822-1994 128-264 (279)
5 KOG3623|consensus 99.9 5.8E-22 1.2E-26 229.4 14.3 77 1946-2022 891-971 (1007)
6 KOG3608|consensus 99.9 9.3E-22 2E-26 208.9 12.2 244 599-907 135-392 (467)
7 KOG3608|consensus 99.9 1.3E-21 2.8E-26 207.9 12.2 236 503-805 134-376 (467)
8 KOG3623|consensus 99.8 1.2E-19 2.6E-24 210.5 16.0 82 1506-1594 890-971 (1007)
9 KOG3576|consensus 99.6 1.1E-15 2.3E-20 150.3 2.5 110 1883-2025 114-236 (267)
10 KOG3576|consensus 99.5 9.2E-15 2E-19 143.8 2.2 118 1477-1600 118-239 (267)
11 PLN03086 PRLI-interacting fact 99.1 5E-11 1.1E-15 144.4 5.3 94 1922-2021 454-560 (567)
12 KOG1146|consensus 99.0 8.2E-10 1.8E-14 140.7 8.3 675 100-863 463-1353(1406)
13 PLN03086 PRLI-interacting fact 98.9 5.9E-10 1.3E-14 135.2 5.7 72 1921-1995 478-562 (567)
14 KOG1146|consensus 98.9 1.3E-09 2.8E-14 139.0 4.1 188 1299-1495 439-637 (1406)
15 PHA00733 hypothetical protein 98.8 3.7E-09 8.1E-14 104.3 4.1 83 1921-2026 40-124 (128)
16 PF13465 zf-H2C2_2: Zinc-finge 98.6 1.4E-08 3E-13 68.5 1.7 26 1988-2013 1-26 (26)
17 PHA02768 hypothetical protein; 98.6 1.2E-08 2.6E-13 80.4 1.5 42 1975-2018 5-48 (55)
18 PHA00733 hypothetical protein 98.5 1.1E-07 2.3E-12 94.1 4.6 81 1508-1598 38-124 (128)
19 KOG3993|consensus 98.4 7.4E-08 1.6E-12 107.4 1.2 188 569-776 267-483 (500)
20 KOG3993|consensus 98.4 1.5E-07 3.2E-12 105.1 2.0 196 596-805 265-482 (500)
21 PHA02768 hypothetical protein; 98.3 1.8E-07 4E-12 73.9 1.6 43 1510-1554 5-47 (55)
22 PF13465 zf-H2C2_2: Zinc-finge 98.1 2E-06 4.3E-11 58.1 1.7 25 1060-1084 1-25 (26)
23 PHA00616 hypothetical protein 98.0 3.9E-06 8.4E-11 63.0 2.0 32 1975-2006 1-34 (44)
24 PHA00616 hypothetical protein 97.8 3.6E-06 7.9E-11 63.1 -0.0 35 1921-1955 1-35 (44)
25 PHA00732 hypothetical protein 97.7 1.8E-05 3.9E-10 70.2 2.5 43 1975-2023 1-46 (79)
26 PHA00732 hypothetical protein 97.7 2.2E-05 4.7E-10 69.7 2.0 22 1886-1907 1-22 (79)
27 PF05605 zf-Di19: Drought indu 97.5 8.7E-05 1.9E-09 61.3 3.6 50 103-155 3-53 (54)
28 PF05605 zf-Di19: Drought indu 97.4 0.00018 3.8E-09 59.5 4.4 53 255-310 2-54 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.3 0.0001 2.3E-09 48.6 1.4 23 2002-2024 1-23 (23)
30 PF13894 zf-C2H2_4: C2H2-type 96.6 0.0011 2.4E-08 44.4 1.8 24 2002-2025 1-24 (24)
31 PF13912 zf-C2H2_6: C2H2-type 96.5 0.0011 2.3E-08 45.9 1.2 26 2001-2026 1-26 (27)
32 COG5189 SFP1 Putative transcri 96.4 0.0011 2.4E-08 71.7 1.0 58 1787-1844 341-418 (423)
33 PF00096 zf-C2H2: Zinc finger, 96.4 0.0024 5.1E-08 42.1 2.1 21 635-655 2-22 (23)
34 PF12756 zf-C2H2_2: C2H2 type 96.3 0.0018 3.9E-08 62.9 1.9 73 104-186 1-73 (100)
35 COG5189 SFP1 Putative transcri 96.1 0.0021 4.6E-08 69.6 1.0 70 454-525 347-420 (423)
36 PF12756 zf-C2H2_2: C2H2 type 95.9 0.0049 1.1E-07 59.8 2.8 73 1298-1378 1-73 (100)
37 KOG2482|consensus 95.9 0.015 3.3E-07 64.1 6.4 189 131-340 144-356 (423)
38 KOG2231|consensus 95.9 0.019 4.1E-07 71.9 7.9 48 256-309 183-236 (669)
39 PF13894 zf-C2H2_4: C2H2-type 95.8 0.0069 1.5E-07 40.5 2.1 21 133-153 2-22 (24)
40 smart00355 ZnF_C2H2 zinc finge 95.7 0.0077 1.7E-07 41.1 2.3 25 2002-2026 1-25 (26)
41 PF13912 zf-C2H2_6: C2H2-type 95.6 0.0044 9.5E-08 42.8 0.6 23 1922-1944 2-24 (27)
42 COG5236 Uncharacterized conser 95.4 0.035 7.6E-07 61.2 7.0 85 100-187 149-244 (493)
43 KOG2785|consensus 95.1 0.045 9.8E-07 62.5 6.7 79 102-185 3-90 (390)
44 KOG2231|consensus 95.0 0.041 8.9E-07 69.0 7.0 126 1449-1594 116-260 (669)
45 COG5236 Uncharacterized conser 95.0 0.059 1.3E-06 59.5 7.2 135 366-531 149-309 (493)
46 PF13909 zf-H2C2_5: C2H2-type 94.9 0.013 2.9E-07 39.0 1.2 24 2002-2026 1-24 (24)
47 PRK04860 hypothetical protein; 94.7 0.013 2.8E-07 60.6 1.2 38 1975-2014 119-156 (160)
48 KOG2785|consensus 94.1 0.13 2.9E-06 58.8 7.6 74 569-655 166-242 (390)
49 PF12874 zf-met: Zinc-finger o 93.8 0.029 6.2E-07 37.9 1.1 23 2002-2024 1-23 (25)
50 PF09237 GAGA: GAGA factor; I 93.8 0.067 1.5E-06 41.5 3.0 40 119-158 12-51 (54)
51 PRK04860 hypothetical protein; 93.7 0.019 4.2E-07 59.3 0.0 34 1950-1985 120-153 (160)
52 PF09237 GAGA: GAGA factor; I 93.6 0.054 1.2E-06 42.0 2.3 28 1999-2026 22-49 (54)
53 KOG2482|consensus 93.1 0.18 3.8E-06 56.2 6.1 27 503-529 279-305 (423)
54 smart00355 ZnF_C2H2 zinc finge 92.9 0.083 1.8E-06 35.9 2.3 23 1539-1561 1-23 (26)
55 PF13909 zf-H2C2_5: C2H2-type 92.9 0.056 1.2E-06 36.0 1.4 23 1574-1597 1-23 (24)
56 PF12874 zf-met: Zinc-finger o 91.7 0.11 2.3E-06 35.1 1.6 19 133-151 2-20 (25)
57 KOG2893|consensus 90.7 0.072 1.6E-06 55.6 -0.0 52 428-483 10-61 (341)
58 COG5048 FOG: Zn-finger [Genera 90.4 0.19 4.1E-06 65.2 3.6 57 1794-1851 288-349 (467)
59 KOG4173|consensus 90.4 0.11 2.4E-06 53.2 0.9 82 397-483 78-173 (253)
60 COG5048 FOG: Zn-finger [Genera 90.3 0.19 4E-06 65.3 3.3 54 1509-1562 32-87 (467)
61 PF12171 zf-C2H2_jaz: Zinc-fin 89.7 0.18 4E-06 34.7 1.4 22 103-124 2-23 (27)
62 PF12171 zf-C2H2_jaz: Zinc-fin 89.6 0.19 4.1E-06 34.6 1.3 22 286-307 2-23 (27)
63 KOG2893|consensus 87.0 0.15 3.3E-06 53.2 -0.6 48 105-157 13-60 (341)
64 PF13913 zf-C2HC_2: zinc-finge 81.8 0.82 1.8E-05 30.7 1.3 19 1977-1995 4-23 (25)
65 PF13913 zf-C2HC_2: zinc-finge 81.7 1.1 2.3E-05 30.2 1.8 21 2002-2023 3-23 (25)
66 smart00451 ZnF_U1 U1-like zinc 79.9 1.1 2.3E-05 33.2 1.5 23 2001-2023 3-25 (35)
67 KOG4173|consensus 79.6 1.5 3.3E-05 45.3 2.9 85 101-189 78-172 (253)
68 TIGR00622 ssl1 transcription f 78.1 3.5 7.5E-05 39.4 4.5 19 1823-1841 14-32 (112)
69 PF12013 DUF3505: Protein of u 72.9 5.1 0.00011 39.2 4.6 29 1445-1473 77-109 (109)
70 smart00451 ZnF_U1 U1-like zinc 71.8 3 6.6E-05 30.8 2.1 23 1649-1671 2-24 (35)
71 cd00350 rubredoxin_like Rubred 70.7 2.3 5E-05 31.0 1.1 8 1923-1930 3-10 (33)
72 cd00350 rubredoxin_like Rubred 70.1 2.5 5.4E-05 30.8 1.2 10 1974-1983 16-25 (33)
73 COG4049 Uncharacterized protei 69.6 2.2 4.8E-05 33.8 0.9 29 128-156 14-42 (65)
74 cd00729 rubredoxin_SM Rubredox 67.4 3 6.6E-05 30.6 1.2 9 2001-2009 18-26 (34)
75 TIGR00622 ssl1 transcription f 66.6 8.8 0.00019 36.8 4.3 21 1475-1495 14-34 (112)
76 KOG2186|consensus 62.6 2.5 5.5E-05 45.8 0.0 44 1950-1994 4-49 (276)
77 COG4049 Uncharacterized protei 61.5 3.3 7.1E-05 32.9 0.5 30 1791-1820 13-42 (65)
78 PF12013 DUF3505: Protein of u 57.9 27 0.00058 34.2 6.3 79 503-593 11-108 (109)
79 KOG1280|consensus 56.5 7 0.00015 44.6 2.2 40 366-405 77-116 (381)
80 PF09538 FYDLN_acid: Protein o 55.6 7.3 0.00016 37.5 1.8 27 1922-1959 10-36 (108)
81 KOG2907|consensus 53.1 6.5 0.00014 37.0 1.0 32 1976-2014 75-115 (116)
82 COG2888 Predicted Zn-ribbon RN 52.9 9.2 0.0002 31.4 1.7 14 1886-1899 27-40 (61)
83 COG1592 Rubrerythrin [Energy p 52.0 7.8 0.00017 40.4 1.5 12 1971-1982 145-156 (166)
84 PF09986 DUF2225: Uncharacteri 48.2 9.8 0.00021 42.4 1.7 10 1976-1985 49-58 (214)
85 PF09986 DUF2225: Uncharacteri 47.7 7.7 0.00017 43.2 0.8 10 1950-1959 49-58 (214)
86 PF02176 zf-TRAF: TRAF-type zi 47.2 7 0.00015 33.3 0.3 33 1960-1992 23-60 (60)
87 COG1592 Rubrerythrin [Energy p 45.9 11 0.00024 39.3 1.4 24 1921-1957 134-157 (166)
88 COG2331 Uncharacterized protei 45.1 7.3 0.00016 33.6 0.1 36 1920-1963 11-47 (82)
89 PF09538 FYDLN_acid: Protein o 44.7 13 0.00027 36.0 1.6 15 963-977 25-39 (108)
90 KOG2186|consensus 44.2 10 0.00023 41.3 1.0 48 1019-1069 4-52 (276)
91 COG1997 RPL43A Ribosomal prote 43.0 9.2 0.0002 34.3 0.4 30 1948-1985 34-63 (89)
92 smart00614 ZnF_BED BED zinc fi 41.4 18 0.00038 29.5 1.7 20 2003-2022 20-44 (50)
93 TIGR02300 FYDLN_acid conserved 39.9 18 0.00039 35.3 1.8 27 1922-1959 10-36 (129)
94 smart00734 ZnF_Rad18 Rad18-lik 39.9 21 0.00046 24.4 1.6 21 164-185 2-22 (26)
95 PF13717 zinc_ribbon_4: zinc-r 39.8 21 0.00045 26.7 1.7 14 869-882 4-17 (36)
96 KOG1280|consensus 39.0 19 0.00041 41.4 2.0 59 100-159 21-107 (381)
97 TIGR02098 MJ0042_CXXC MJ0042 f 37.3 20 0.00044 27.1 1.4 11 869-879 4-14 (38)
98 PF15269 zf-C2H2_7: Zinc-finge 37.0 19 0.00041 27.3 1.1 22 2002-2023 21-42 (54)
99 COG4957 Predicted transcriptio 36.8 11 0.00023 36.7 -0.2 24 1950-1974 77-100 (148)
100 smart00834 CxxC_CXXC_SSSS Puta 36.6 15 0.00033 28.3 0.6 12 1921-1932 5-16 (41)
101 COG1198 PriA Primosomal protei 36.3 21 0.00046 47.1 2.3 10 1922-1931 436-445 (730)
102 PF06524 NOA36: NOA36 protein; 35.6 12 0.00026 40.8 -0.1 180 1818-2024 32-232 (314)
103 PF03604 DNA_RNApol_7kD: DNA d 35.3 28 0.0006 25.3 1.7 11 930-940 1-11 (32)
104 COG4530 Uncharacterized protei 35.2 25 0.00054 32.8 1.8 26 1923-1959 11-36 (129)
105 PF06524 NOA36: NOA36 protein; 34.8 18 0.0004 39.5 1.1 91 986-1097 136-233 (314)
106 PRK14890 putative Zn-ribbon RN 33.3 26 0.00057 29.1 1.5 10 1886-1895 25-34 (59)
107 PF12907 zf-met2: Zinc-binding 33.0 32 0.0007 26.3 1.8 27 103-129 2-31 (40)
108 TIGR02605 CxxC_CxxC_SSSS putat 31.9 20 0.00043 29.5 0.6 12 1921-1932 5-16 (52)
109 PRK04023 DNA polymerase II lar 31.7 35 0.00077 45.5 3.1 35 1921-1984 638-672 (1121)
110 PRK00464 nrdR transcriptional 31.1 14 0.0003 38.3 -0.5 12 1922-1933 29-40 (154)
111 COG4530 Uncharacterized protei 30.9 27 0.00059 32.5 1.4 28 1887-1932 10-37 (129)
112 smart00659 RPOLCX RNA polymera 30.8 30 0.00064 27.3 1.4 11 930-940 3-13 (44)
113 TIGR02300 FYDLN_acid conserved 30.2 32 0.00069 33.7 1.8 13 930-942 10-22 (129)
114 PRK00464 nrdR transcriptional 30.2 23 0.00049 36.8 0.9 10 1950-1959 29-38 (154)
115 PF13719 zinc_ribbon_5: zinc-r 30.1 34 0.00074 25.8 1.5 10 902-911 26-35 (37)
116 COG1996 RPC10 DNA-directed RNA 29.9 23 0.0005 28.4 0.7 28 1921-1958 6-33 (49)
117 PF05290 Baculo_IE-1: Baculovi 29.5 52 0.0011 32.5 3.0 60 1941-2016 72-136 (140)
118 PF02176 zf-TRAF: TRAF-type zi 29.0 51 0.0011 27.9 2.7 27 1003-1029 23-53 (60)
119 PHA00626 hypothetical protein 28.7 25 0.00054 28.6 0.6 14 2000-2013 22-35 (59)
120 PF02892 zf-BED: BED zinc fing 28.6 36 0.00077 26.9 1.6 22 427-448 15-40 (45)
121 smart00531 TFIIE Transcription 28.2 27 0.00058 36.4 1.0 12 1886-1897 99-110 (147)
122 PF10571 UPF0547: Uncharacteri 27.5 31 0.00066 23.7 0.8 9 2003-2011 16-24 (26)
123 PRK00398 rpoP DNA-directed RNA 27.1 35 0.00076 27.2 1.3 11 1477-1487 4-14 (46)
124 TIGR00373 conserved hypothetic 26.8 27 0.00059 36.8 0.8 18 1885-1902 108-125 (158)
125 PF09723 Zn-ribbon_8: Zinc rib 26.7 26 0.00057 27.3 0.5 13 1921-1933 5-17 (42)
126 PRK09678 DNA-binding transcrip 26.4 29 0.00063 30.6 0.7 13 1948-1960 26-40 (72)
127 TIGR00373 conserved hypothetic 26.3 40 0.00087 35.5 1.9 32 1792-1833 106-137 (158)
128 PF02892 zf-BED: BED zinc fing 25.7 44 0.00095 26.4 1.6 11 1975-1985 16-26 (45)
129 PRK06266 transcription initiat 25.2 29 0.00063 37.3 0.7 14 1922-1935 118-131 (178)
130 COG5151 SSL1 RNA polymerase II 24.8 48 0.001 37.2 2.2 97 1725-1893 309-417 (421)
131 PF14353 CpXC: CpXC protein 24.3 32 0.0007 34.8 0.8 33 1951-1985 3-48 (128)
132 PRK04023 DNA polymerase II lar 23.6 57 0.0012 43.7 2.9 47 1859-1959 627-673 (1121)
133 PF15269 zf-C2H2_7: Zinc-finge 23.5 50 0.0011 25.3 1.3 22 1356-1377 21-42 (54)
134 smart00531 TFIIE Transcription 22.8 50 0.0011 34.4 1.8 35 1858-1897 99-134 (147)
135 PF12760 Zn_Tnp_IS1595: Transp 22.6 52 0.0011 26.2 1.5 10 1974-1983 36-45 (46)
136 PF05191 ADK_lid: Adenylate ki 22.3 36 0.00078 25.5 0.5 17 1998-2014 18-34 (36)
137 PF04959 ARS2: Arsenite-resist 22.1 43 0.00093 36.9 1.2 30 98-127 73-102 (214)
138 KOG2071|consensus 21.9 72 0.0016 40.2 3.2 98 1536-1660 416-523 (579)
139 PRK06266 transcription initiat 20.9 41 0.00088 36.2 0.7 33 252-295 114-146 (178)
140 COG1198 PriA Primosomal protei 20.3 63 0.0014 42.9 2.4 41 1921-1984 444-484 (730)
141 PF12907 zf-met2: Zinc-binding 20.0 78 0.0017 24.4 1.8 32 570-601 2-36 (40)
No 1
>KOG1074|consensus
Probab=99.96 E-value=1.8e-29 Score=297.97 Aligned_cols=233 Identities=21% Similarity=0.477 Sum_probs=181.2
Q ss_pred cCCCCccCCCCCCcCCCHHHHHhhhhhhcCCCCccccCcchhhhcCchhHhhhhcccccCCCCCCcCccccC---ccCCC
Q psy4686 1791 VAEKTKLCPVCNKEYATPVTMRKHLREVHVSKKKFKCDLCQKQFKASRSLQCHKQFVHLGVHKPKFLNMECD---YCSRK 1867 (2028)
Q Consensus 1791 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~p~~~~~~C~---~C~~~ 1867 (2028)
....|..|-+|-++..=+..|+.|.| .|+||+||+|.+||+.|.++-+|+.| +.+|..- .|-...+.|+ +|-+.
T Consensus 601 ~~TdPNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH-~~vHka~-p~~R~q~ScP~~~ic~~k 677 (958)
T KOG1074|consen 601 KRTDPNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAH-MSVHKAK-PPARVQFSCPSTFICQKK 677 (958)
T ss_pred ccCCccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhc-ccccccC-ccccccccCCchhhhccc
Confidence 34567899999999999999999999 99999999999999999999999999 6688654 4444679999 99999
Q ss_pred CCChhHhhhcccccCCC-c------------cccCCCCcccccchhhHHHHHhhccc-------------ccCCCCCCCc
Q psy4686 1868 FPSKNEVTNHIKSHMGV-R------------DILCPICKKGFIALKHMKTHLKKHMW-------------KAGEIPLEDT 1921 (2028)
Q Consensus 1868 f~~~~~l~~H~~~h~~~-~------------~~~C~~C~~~f~~~~~l~~H~~~h~~-------------~~~~~~~~~~ 1921 (2028)
|...-.|.+|+++|.+. . .-.|..|.+.|.....+..++--|.. .+++...-.+
T Consensus 678 ftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~ 757 (958)
T KOG1074|consen 678 FTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPP 757 (958)
T ss_pred ccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCC
Confidence 99999999999999832 1 24699999999988888888776611 0111112236
Q ss_pred cccCcccccccCchhhhcccc-----------------------cccCCccc-cccccccccccc---------------
Q psy4686 1922 YLCDLCSKVFLEHKDMIRHRE-----------------------WVHGDKCH-VCKVCGAKIKGN--------------- 1962 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~~L~~H~~-----------------------~h~~~~~~-~C~~C~~~~~~~--------------- 1962 (2028)
..+..|+..+.....+..+-. .++++++. .+..++-.....
T Consensus 758 ~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l 837 (958)
T KOG1074|consen 758 PPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSML 837 (958)
T ss_pred ccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccc
Confidence 789999999988777665532 23456777 677777542110
Q ss_pred -----cccccccccCC------------------------CCcccCCcchhh--hhhHHHHhhhhcCCCcccCCccCccc
Q psy4686 1963 -----MKRHMLSHTGE------------------------KPFCCDICGKSL--KGNLKAHILRCQGERPYKCDVCGSSF 2011 (2028)
Q Consensus 1963 -----l~~H~~~h~~~------------------------k~~~C~~C~~~f--~~~l~~H~~~h~~~~p~~C~~C~~~f 2011 (2028)
+..-..++-+. ....|.+||+.| .++|..|||+|+|+|||.|.+|+++|
T Consensus 838 ~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aF 917 (958)
T KOG1074|consen 838 NEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAF 917 (958)
T ss_pred ccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhh
Confidence 11111111111 117899999999 67899999999999999999999999
Q ss_pred CCchhHHHHHHhhCC
Q psy4686 2012 KDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 2012 ~~~~~l~~H~~~h~~ 2026 (2028)
.++++|+.||.+|..
T Consensus 918 ttrgnLKvHMgtH~w 932 (958)
T KOG1074|consen 918 TTRGNLKVHMGTHMW 932 (958)
T ss_pred hhhhhhhhhhccccc
Confidence 999999999999975
No 2
>KOG1074|consensus
Probab=99.94 E-value=2.3e-27 Score=280.28 Aligned_cols=254 Identities=22% Similarity=0.510 Sum_probs=160.7
Q ss_pred CCCCCCCCccccccCChHHHHHHHhhhcCCCCcccccccccccccchHHHhhhccccCCCC--Cceecc---cchhhccC
Q psy4686 1293 ASGSKDCPVCHKTYSTPKTMRRHLRQVHTSQKRYLCDICGKQFTSTNRVNIHKACVHSSTG--NKFECI---YCKKKYRR 1367 (2028)
Q Consensus 1293 ~~~~~~C~~C~k~f~~~~~L~~H~~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~--~~~~C~---~C~~~f~~ 1367 (2028)
.++..+|-+|.++.+.++.|+.|+| .|+||+||+|.+||+.|.++.+|..|+-.+...-. ..|.|+ +|-+.|.+
T Consensus 602 ~TdPNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 602 RTDPNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred cCCccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccc
Confidence 3456679999999999999999999 99999999999999999999999999975443222 339999 99999999
Q ss_pred hhhHHHHHhhcCCC-CC------------cccccccccccchhhhhhHHHhhhhhhhhhhhhcccccccccccCCCCCCC
Q psy4686 1368 KFDLKEHINKHTGN-KP------------YHCQICKESFYTLKTYRGDLKRHEQLAEQSDEMHSTSKFQNKIKKPLDTGD 1434 (2028)
Q Consensus 1368 ~~~L~~H~~~H~~~-~~------------~~C~~C~~~f~~~~~~~~~l~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1434 (2028)
.-.|..|+++|.+. .+ -.|..|.+.|.....+..+...|....... . .....+.
T Consensus 681 ~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~---~------~~~~~~t---- 747 (958)
T KOG1074|consen 681 AVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEP---D------EQMDERT---- 747 (958)
T ss_pred cccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCC---c------ccccccc----
Confidence 99999999999842 22 469999999976666655554442110000 0 0000000
Q ss_pred CchhhhhhhccCCCCcCCCCCCcccCHHHHHHH-----------------------HhhhcCCCcc-ccCccCccccchh
Q psy4686 1435 LSWEEWNELVLSGTKKCPVCHKTYSTPRQMRIH-----------------------LREAHSQKKY-ACDVCGKQFTSTN 1490 (2028)
Q Consensus 1435 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H-----------------------~~~h~~~~~~-~C~~C~~~f~~~~ 1490 (2028)
+..+.......-+..|+..+.....+..+ -...+++++. .+..++......-
T Consensus 748 ----~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v 823 (958)
T KOG1074|consen 748 ----ETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSV 823 (958)
T ss_pred ----cccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCcccc
Confidence 00111111222333344333333222222 1122344444 4444443322111
Q ss_pred ----hhhhhc---c-----ccccccc------------------------ccccccccccccccccCChHhHHHHHhhcC
Q psy4686 1491 ----RVSQHK---A-----HSHFGII------------------------KTIQRKFECDFCKKKFYRNFDLQEHINTHT 1534 (2028)
Q Consensus 1491 ----~l~~H~---~-----~~h~~~~------------------------~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~ 1534 (2028)
.|..-. . ..-..+. -.......|.+|++.|...++|+.|||+|+
T Consensus 824 ~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHt 903 (958)
T KOG1074|consen 824 NMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHT 903 (958)
T ss_pred ccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCC
Confidence 010000 0 0000000 001112569999999999999999999999
Q ss_pred CCCCcCCcchhhhcCCHHHHHHHHHHhccc
Q psy4686 1535 GNKPYQCQICNKSFGTRRNYRLHLKRHKRS 1564 (2028)
Q Consensus 1535 ~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~ 1564 (2028)
++|||.|.+|++.|.++.+|+.||.+|.+.
T Consensus 904 g~KPF~C~fC~~aFttrgnLKvHMgtH~w~ 933 (958)
T KOG1074|consen 904 GPKPFFCHFCEEAFTTRGNLKVHMGTHMWV 933 (958)
T ss_pred CCCCccchhhhhhhhhhhhhhhhhcccccc
Confidence 999999999999999999999999988753
No 3
>KOG2462|consensus
Probab=99.93 E-value=5.9e-27 Score=245.20 Aligned_cols=132 Identities=28% Similarity=0.577 Sum_probs=112.7
Q ss_pred ccccCCCCcccccchhhHHHHHhhcccccCCCCCCCccccCcccccccCchhhhcccccccCCcccccccccccc--ccc
Q psy4686 1885 RDILCPICKKGFIALKHMKTHLKKHMWKAGEIPLEDTYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI--KGN 1962 (2028)
Q Consensus 1885 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~--~~~ 1962 (2028)
..|+|+.||+.+.+.++|.+|.++|..-...+ .+.|.+|+|.|.+...|..|+|+|+ -|++|.+|||.| .=-
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~k----a~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWL 202 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKK----AFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWL 202 (279)
T ss_pred Cceeccccccccccccccchhhcccccccccc----cccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHH
Confidence 35777777777777777777777774433322 6888888888888888888888886 688899999999 346
Q ss_pred cccccccccCCCCcccCCcchhh--hhhHHHHhhhhcCCCcccCCccCcccCCchhHHHHHH
Q psy4686 1963 MKRHMLSHTGEKPFCCDICGKSL--KGNLKAHILRCQGERPYKCDVCGSSFKDKKHFSVHIR 2022 (2028)
Q Consensus 1963 l~~H~~~h~~~k~~~C~~C~~~f--~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~~ 2022 (2028)
|+-|+|+|||||||.|+.|+|+| +++||+||++|.+.|+|+|+.|+|+|..++-|++|..
T Consensus 203 LQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 203 LQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred hhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 88999999999999999999999 9999999999999999999999999999999999964
No 4
>KOG2462|consensus
Probab=99.89 E-value=4.2e-24 Score=223.90 Aligned_cols=133 Identities=28% Similarity=0.579 Sum_probs=95.2
Q ss_pred CCccccCcchhhhcCchhHhhhhcccccCCCCCCcCccccCccCCCCCChhHhhhcccccCCCccccCCCCcccccchhh
Q psy4686 1822 KKKFKCDLCQKQFKASRSLQCHKQFVHLGVHKPKFLNMECDYCSRKFPSKNEVTNHIKSHMGVRDILCPICKKGFIALKH 1901 (2028)
Q Consensus 1822 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~p~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 1901 (2028)
...|+|+.||+.|.+..+|.+| +..|.... ..+.+.|++|||.|.+...
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrH-kQ~H~~~~------------------------------s~ka~~C~~C~K~YvSmpA 176 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRH-KQTHRSLD------------------------------SKKAFSCKYCGKVYVSMPA 176 (279)
T ss_pred CCceeccccccccccccccchh-hccccccc------------------------------ccccccCCCCCceeeehHH
Confidence 3456777777777776666666 33442211 1345666666666666666
Q ss_pred HHHHHhhcccccCCCCCCCccccCcccccccCchhhhcccccccCCcccccccccccc--ccccccccccccCCCCcccC
Q psy4686 1902 MKTHLKKHMWKAGEIPLEDTYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI--KGNMKRHMLSHTGEKPFCCD 1979 (2028)
Q Consensus 1902 l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~--~~~l~~H~~~h~~~k~~~C~ 1979 (2028)
|+.|+++|. -++.|.+|||.|...=-|+.|+|+|||||||.|+.|||.| +++|+.||++|++.|+|.|.
T Consensus 177 LkMHirTH~---------l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~ 247 (279)
T KOG2462|consen 177 LKMHIRTHT---------LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCP 247 (279)
T ss_pred HhhHhhccC---------CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCc
Confidence 666666662 1577777777777777778888888888888888888888 78888888888888888888
Q ss_pred Ccchhh--hhhHHHHhh
Q psy4686 1980 ICGKSL--KGNLKAHIL 1994 (2028)
Q Consensus 1980 ~C~~~f--~~~l~~H~~ 1994 (2028)
.|+|+| ++.|..|..
T Consensus 248 ~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 248 RCGKSFALKSYLNKHSE 264 (279)
T ss_pred chhhHHHHHHHHHHhhh
Confidence 888888 788888864
No 5
>KOG3623|consensus
Probab=99.87 E-value=5.8e-22 Score=229.41 Aligned_cols=77 Identities=31% Similarity=0.732 Sum_probs=72.1
Q ss_pred CCcccccccccccc--ccccccccccccCCCCcccCCcchhh--hhhHHHHhhhhcCCCcccCCccCcccCCchhHHHHH
Q psy4686 1946 GDKCHVCKVCGAKI--KGNMKRHMLSHTGEKPFCCDICGKSL--KGNLKAHILRCQGERPYKCDVCGSSFKDKKHFSVHI 2021 (2028)
Q Consensus 1946 ~~~~~~C~~C~~~~--~~~l~~H~~~h~~~k~~~C~~C~~~f--~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~ 2021 (2028)
.+-+|.|+.|+|.| .+.|.+|..-|+|.|||.|.||.|+| +..|..|+|.|.|||||+|+.|+|+|++.+++..||
T Consensus 891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 45689999999999 57899999999999999999999999 778999999999999999999999999999999998
Q ss_pred H
Q psy4686 2022 R 2022 (2028)
Q Consensus 2022 ~ 2022 (2028)
-
T Consensus 971 N 971 (1007)
T KOG3623|consen 971 N 971 (1007)
T ss_pred c
Confidence 3
No 6
>KOG3608|consensus
Probab=99.86 E-value=9.3e-22 Score=208.95 Aligned_cols=244 Identities=27% Similarity=0.471 Sum_probs=170.4
Q ss_pred cccC--ccccccCCHHHHHHHHhhcccc-----cccccCC-Ccccc--cccccccCChHHHHHHHHHhcCCCccccCccc
Q psy4686 599 YMCD--ICGKQFTTLSYLDLHRKSHNET-----VKRKEED-KKYEC--DSCNKKFWSKRALSEHMIIHTGIKEHQCHVCN 668 (2028)
Q Consensus 599 ~~C~--~C~~~f~~~~~L~~H~~~h~~~-----~~~~~~~-~~~~C--~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~ 668 (2028)
|.|. .|+..|.+...|..|+-.|..- ....+.+ ..+.| .-|-+.|.+++.|++|++.|+++|...|+.||
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 5553 6888888888888888776532 1112222 23445 46888888888888888888888888888888
Q ss_pred cccccchhhhhhhhhccccCccccCCcccccccCCHHHHHhhhccccCCCcccccccCcccccccchhhhccccccccCC
Q psy4686 669 TAFYHIRSLSRHLKIHEYNESKMYKCPVCSKMFTELYEMKRHRDHKHGGKCHVCKICGATIKLFTELEEKEDSSFERKKS 748 (2028)
Q Consensus 669 ~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~~~~~~~l~~h~~~~~~~~~~ 748 (2028)
..|.+...|..|++........+|.|..|.+.|.+...|+.|+..|-.
T Consensus 215 ~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-------------------------------- 262 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-------------------------------- 262 (467)
T ss_pred HHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--------------------------------
Confidence 888888888888877655556678888888888888888888765532
Q ss_pred CCccccCCccCcccchhHHHhhhcccccCCCCcccCCcchhhhcChHHHhhhhccccCCCcccCCCCCCCCCCCCCcchh
Q psy4686 749 KDSFHCDTCGKYFRSRQICNKHIKRVHLNPTKTFKCDLCSDRFSTSAKLKKHVDKIHFTPEFQEPSKKNEPLDTGDYTLE 828 (2028)
Q Consensus 749 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~ 828 (2028)
-|+|+.|+.+...++.|..||+-.|. ..++|+|+.|++.|.+.+.|.+|+. .|.
T Consensus 263 --~ykCplCdmtc~~~ssL~~H~r~rHs-~dkpfKCd~Cd~~c~~esdL~kH~~-~HS---------------------- 316 (467)
T KOG3608|consen 263 --CYKCPLCDMTCSSASSLTTHIRYRHS-KDKPFKCDECDTRCVRESDLAKHVQ-VHS---------------------- 316 (467)
T ss_pred --cccccccccCCCChHHHHHHHHhhhc-cCCCccccchhhhhccHHHHHHHHH-hcc----------------------
Confidence 36777777778888888888876776 4778888888888888888888874 441
Q ss_pred hhhhhhhcCCCCCCC--CCCCCCChHHHHHHHHhhhcCC--CccccccccccccCchHHHhhhhhcccCccccccccccc
Q psy4686 829 QWNELVTAQSKDCPV--CHKTYSTPKSMRKHLREVHSSQ--KKYVCDMCGKQFTSNNRVSQHKAYSHFGIIKTIERKFEC 904 (2028)
Q Consensus 829 ~~~~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~--~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~~~~~~C 904 (2028)
...|.|+. |...|.+...|++|++++|.|. -+|.|-.|++.|++-..|..|+...|.-.+..|-+.|.-
T Consensus 317 -------~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtY 389 (467)
T KOG3608|consen 317 -------KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTY 389 (467)
T ss_pred -------ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceee
Confidence 13466766 7777777777777777777543 346677777777666666666665554434444333433
Q ss_pred ccc
Q psy4686 905 DYC 907 (2028)
Q Consensus 905 ~~C 907 (2028)
..|
T Consensus 390 k~~ 392 (467)
T KOG3608|consen 390 KVD 392 (467)
T ss_pred eec
Confidence 333
No 7
>KOG3608|consensus
Probab=99.86 E-value=1.3e-21 Score=207.87 Aligned_cols=236 Identities=23% Similarity=0.503 Sum_probs=191.8
Q ss_pred ceec--ccchhhcCCHHHHHHHHhhccccccccCCCCCCCcccCCCCCcccccCCCChhHHhhhhcCCCCccC--cchhc
Q psy4686 503 QIKC--RYCARTFSTQNDLKEHVSSVHMFMTESFENPNEPRVLNSNRYPLAIDGGLSLEEYQQIVATKSKQCP--ICEKI 578 (2028)
Q Consensus 503 ~~~C--~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~--~C~~~ 578 (2028)
.|.| +.|+..|.+...+..|+..+-....-. .. .. ++ .....+.|. .|.+.
T Consensus 134 ~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd--~~-------------~~-~~---------D~~pv~~C~W~~Ct~~ 188 (467)
T KOG3608|consen 134 NFRCGWEDCEREFVSIVEFQDHVVKHALFCEYD--IQ-------------KT-PE---------DERPVTMCNWAMCTKH 188 (467)
T ss_pred hhccChhhcCCcccCHHHHHHHHHHhhhhhhhh--hh-------------hC-CC---------CCCceeeccchhhhhh
Confidence 4556 569999999999999985433221000 00 00 00 001224463 59999
Q ss_pred CCCHHHHHHHHhhhcCCCCCcccCccccccCCHHHHHHHHhhcccccccccCCCcccccccccccCChHHHHHHHHHhcC
Q psy4686 579 FAAPKQMRMHLRHVHSSIKRYMCDICGKQFTTLSYLDLHRKSHNETVKRKEEDKKYECDSCNKKFWSKRALSEHMIIHTG 658 (2028)
Q Consensus 579 f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 658 (2028)
|.++..|+.|++ .|++++...|+.||..|.++..|..|.+..+.. ...+|+|..|.|.|.+...|..|+..|..
T Consensus 189 ~~~k~~LreH~r-~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l-----~~n~fqC~~C~KrFaTeklL~~Hv~rHvn 262 (467)
T KOG3608|consen 189 MGNKYRLREHIR-THSNEKVVACPHCGELFRTKTKLFDHLRRQTEL-----NTNSFQCAQCFKRFATEKLLKSHVVRHVN 262 (467)
T ss_pred hccHHHHHHHHH-hcCCCeEEecchHHHHhccccHHHHHHHhhhhh-----cCCchHHHHHHHHHhHHHHHHHHHHHhhh
Confidence 999999999999 999999999999999999999999999877643 34589999999999999999999999964
Q ss_pred CCccccCccccccccchhhhhhhhhccccCccccCCcccccccCCHHHHHhhhccccCCCcccccccCcccccccchhhh
Q psy4686 659 IKEHQCHVCNTAFYHIRSLSRHLKIHEYNESKMYKCPVCSKMFTELYEMKRHRDHKHGGKCHVCKICGATIKLFTELEEK 738 (2028)
Q Consensus 659 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~~~~~~~l~~h 738 (2028)
-|+|++|+.+....++|..||+..+ ...++|+|+.|++.|.+...|.+|...|. +.
T Consensus 263 --~ykCplCdmtc~~~ssL~~H~r~rH-s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~-------------------- 318 (467)
T KOG3608|consen 263 --CYKCPLCDMTCSSASSLTTHIRYRH-SKDKPFKCDECDTRCVRESDLAKHVQVHS-KT-------------------- 318 (467)
T ss_pred --cccccccccCCCChHHHHHHHHhhh-ccCCCccccchhhhhccHHHHHHHHHhcc-cc--------------------
Confidence 6999999999999999999999866 45999999999999999999999988654 33
Q ss_pred ccccccccCCCCccccCC--ccCcccchhHHHhhhcccc-cCCCCcccCCcchhhhcChHHHhhhhcccc
Q psy4686 739 EDSSFERKKSKDSFHCDT--CGKYFRSRQICNKHIKRVH-LNPTKTFKCDLCSDRFSTSAKLKKHVDKIH 805 (2028)
Q Consensus 739 ~~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~~h-~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h 805 (2028)
.|.|+. |..+|++...+++|++.+| .+++.+|.|-.|++.|++-.+|..|+.+.|
T Consensus 319 ------------~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH 376 (467)
T KOG3608|consen 319 ------------VYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKH 376 (467)
T ss_pred ------------ceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhh
Confidence 356655 8888888888999998888 445678999999999999999999988877
No 8
>KOG3623|consensus
Probab=99.82 E-value=1.2e-19 Score=210.46 Aligned_cols=82 Identities=34% Similarity=0.705 Sum_probs=78.4
Q ss_pred cccccccccccccccCChHhHHHHHhhcCCCCCcCCcchhhhcCCHHHHHHHHHHhcccCCCCCCCCccccccccccccC
Q psy4686 1506 TIQRKFECDFCKKKFYRNFDLQEHINTHTGNKPYQCQICNKSFGTRRNYRLHLKRHKRSAGQLKPEDIHECKICHKIFLE 1585 (2028)
Q Consensus 1506 ~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~ 1585 (2028)
+.+..|.|+.|+|+|...+.|.+|.-.|+|.+||+|.+|.|+|..+..|..|+|.|. ||+||+|+.|++.|..
T Consensus 890 te~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHS-------GEKPfQCdKClKRFSH 962 (1007)
T KOG3623|consen 890 TEDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHS-------GEKPFQCDKCLKRFSH 962 (1007)
T ss_pred CccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhcc-------CCCcchhhhhhhhccc
Confidence 444579999999999999999999999999999999999999999999999999998 9999999999999999
Q ss_pred hhHHHHHhh
Q psy4686 1586 NSRLTRHMN 1594 (2028)
Q Consensus 1586 ~~~l~~H~~ 1594 (2028)
......||.
T Consensus 963 SGSYSQHMN 971 (1007)
T KOG3623|consen 963 SGSYSQHMN 971 (1007)
T ss_pred ccchHhhhc
Confidence 999999995
No 9
>KOG3576|consensus
Probab=99.55 E-value=1.1e-15 Score=150.28 Aligned_cols=110 Identities=30% Similarity=0.685 Sum_probs=69.2
Q ss_pred CCccccCCCCcccccchhhHHHHHhhcccccCCCCCCCccccCcccccccCchhhhcccccccCCccccccccccccccc
Q psy4686 1883 GVRDILCPICKKGFIALKHMKTHLKKHMWKAGEIPLEDTYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKIKGN 1962 (2028)
Q Consensus 1883 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~~~~ 1962 (2028)
+...|.|.+|||+|.....|.+|++-| ..-+ .|.|..|||.|....+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch---~~vk----r~lct~cgkgfndtfd-------------------------- 160 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCH---SDVK----RHLCTFCGKGFNDTFD-------------------------- 160 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhc---cHHH----HHHHhhccCcccchhh--------------------------
Confidence 344566777777777666667776666 2222 5666666666665555
Q ss_pred cccccccccCCCCcccCCcchhh--hhhHHHHhhhhcC-----------CCcccCCccCcccCCchhHHHHHHhhC
Q psy4686 1963 MKRHMLSHTGEKPFCCDICGKSL--KGNLKAHILRCQG-----------ERPYKCDVCGSSFKDKKHFSVHIRNHG 2025 (2028)
Q Consensus 1963 l~~H~~~h~~~k~~~C~~C~~~f--~~~l~~H~~~h~~-----------~~p~~C~~C~~~f~~~~~l~~H~~~h~ 2025 (2028)
|++|+|+|||-+||+|..|+|+| +-.|..|++.=+| +|.|+|..||++-.....+..|++.|+
T Consensus 161 lkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 161 LKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHH 236 (267)
T ss_pred hhhhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcC
Confidence 55555666666666666666666 4556666653222 466777777777777777777777765
No 10
>KOG3576|consensus
Probab=99.48 E-value=9.2e-15 Score=143.83 Aligned_cols=118 Identities=29% Similarity=0.591 Sum_probs=92.3
Q ss_pred cccCccCccccchhhhhhhcccccccccccccccccccccccccCChHhHHHHHhhcCCCCCcCCcchhhhcCCHHHHHH
Q psy4686 1477 YACDVCGKQFTSTNRVSQHKAHSHFGIIKTIQRKFECDFCKKKFYRNFDLQEHINTHTGNKPYQCQICNKSFGTRRNYRL 1556 (2028)
Q Consensus 1477 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~k~~~C~~C~~~f~~~~~l~~ 1556 (2028)
|.|.+|++.|.-...|.+|++ -|...+.+-|..||+.|....+|++|+|+|+|-+||+|..|+++|+++-+|..
T Consensus 118 ftCrvCgK~F~lQRmlnrh~k------ch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles 191 (267)
T KOG3576|consen 118 FTCRVCGKKFGLQRMLNRHLK------CHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES 191 (267)
T ss_pred eeeehhhhhhhHHHHHHHHhh------hccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence 444444555554555555555 44555667799999999999999999999999999999999999999999999
Q ss_pred HHHHhcccC----CCCCCCCccccccccccccChhHHHHHhhhhcCCC
Q psy4686 1557 HLKRHKRSA----GQLKPEDIHECKICHKIFLENSRLTRHMNFTHGDK 1600 (2028)
Q Consensus 1557 H~~~H~~~~----~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 1600 (2028)
|++.-++.. .....++.|.|+.||++-.....+..|++.||...
T Consensus 192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 997543222 12235678999999999999999999999998754
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.11 E-value=5e-11 Score=144.36 Aligned_cols=94 Identities=29% Similarity=0.614 Sum_probs=60.2
Q ss_pred cccCcccccccCchhhhcccccccCCcccccccccccc-ccccccccccccCCCCcccCCcchhh-h-----------hh
Q psy4686 1922 YLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI-KGNMKRHMLSHTGEKPFCCDICGKSL-K-----------GN 1988 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~-~~~l~~H~~~h~~~k~~~C~~C~~~f-~-----------~~ 1988 (2028)
+.|++|++.|. ..+|..|+++|| +|+.|+ ||+.+ +..|..|+++|.++||+.|..|++.| . +.
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~ 529 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVLEKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRG 529 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCcchhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhh
Confidence 45666666664 455666666654 566666 66555 55666677777777777777777766 2 25
Q ss_pred HHHHhhhhcCCCcccCCccCcccCCchhHHHHH
Q psy4686 1989 LKAHILRCQGERPYKCDVCGSSFKDKKHFSVHI 2021 (2028)
Q Consensus 1989 l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~~H~ 2021 (2028)
|..|..++ |.+++.|..||+.|..+ .|..|+
T Consensus 530 Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~ 560 (567)
T PLN03086 530 MSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQ 560 (567)
T ss_pred HHHHHHhc-CCcceEccccCCeeeeh-hHHHHH
Confidence 77777775 77777777777776654 344444
No 12
>KOG1146|consensus
Probab=98.97 E-value=8.2e-10 Score=140.70 Aligned_cols=675 Identities=13% Similarity=0.085 Sum_probs=317.9
Q ss_pred CCcccCcchhhhccCHHHHHhhhcccCCCCCcccccccCccccChHHHHHHHHhhccC--CCCCCCccCCcchhhccChH
Q psy4686 100 PDRRTCRICTEVFENLTLLRRHMRVKHPGEQNLPCRLCDMSFSNKYQKKKHYSAFHKG--MPFIPSFTCPICQKIFKNKE 177 (2028)
Q Consensus 100 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~--~~~~~~~~C~~C~~~f~~~~ 177 (2028)
.+.|.|+.|++.|.....|-.|||.+|+..+. .+|- ..+.|.+.-.+. ......|.|..|++.+..+.
T Consensus 463 ~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~---~~c~-------~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng 532 (1406)
T KOG1146|consen 463 FKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS---AYCK-------AGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNG 532 (1406)
T ss_pred cccccCCccchhhhhHHHhhhcccccccccch---hHhH-------hccccccccccccccCCCCcccceeeeeeeecch
Confidence 37777777777777777777777777765443 2221 122222211110 11133489999999999999
Q ss_pred HHHHHhhh--hhhhhhhccCCCCccccccccCCCCCCCCcccchhhhhcccCcccccccccccccCCCCCCccc-cCCCC
Q psy4686 178 WYLDHLSL--HEEQLISLNRRKPIHVRSVRLPSVRGQDIFTTTTKFMRRKAGNVLHVARWFTDCQSIEDPSLEC-KNNKA 254 (2028)
Q Consensus 178 ~l~~Hl~~--h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 254 (2028)
+|..||++ |...+....+....++...... +....+.......+.... ...+-
T Consensus 533 ~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~------------------------v~s~~P~~ag~~~~ags~~pktkP 588 (1406)
T KOG1146|consen 533 NLSIHLQSDLHRNELEEAEENAGEQVRLLPAS------------------------VTSAVPEEAGLGPSAGSSGPKTKP 588 (1406)
T ss_pred HHHHHHHHHhhHHHHHHHHhccccchhhhhhh------------------------hcccCcccccCCCCCCCCCCCCCC
Confidence 99999998 4444333333222222111111 011111111112222222 33456
Q ss_pred ccccCCCCCCCCCHHHHHHHHhhhccCC----------------------------------------------------
Q psy4686 255 SFTCDTCDKPFDTIEKCRRHAIRMHMNP---------------------------------------------------- 282 (2028)
Q Consensus 255 ~~~C~~C~~~~~~~~~l~~H~~~~h~~~---------------------------------------------------- 282 (2028)
.|.|.+|++-..-...|+.|+...+...
T Consensus 589 ~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~lp~tps~~~~s~~~l~qkl~l~~~~~e~~ 668 (1406)
T KOG1146|consen 589 SWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGRLPFTPSSPPSSDYELSQKLELPDLSLEVA 668 (1406)
T ss_pred CcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCCCCCCCCCCccccchhhhhhcccccccccc
Confidence 7999999999998889998873322100
Q ss_pred ---------------------------------------------------------------CCccccccccc-cccCh
Q psy4686 283 ---------------------------------------------------------------CKMFKCDICVA-SFLTP 298 (2028)
Q Consensus 283 ---------------------------------------------------------------~~~~~C~~C~~-~f~~~ 298 (2028)
.+.|.|-.|.. +-.+.
T Consensus 669 a~~as~~~~~~~~~~~~~~~aPt~s~~g~~~~~~~~~~~~~~~~~~~aa~~l~~a~~~~~sps~k~~~civcd~~st~~l 748 (1406)
T KOG1146|consen 669 AKPASGLDGQLQLIRADPFLAPTGSEVGGGESLSPRDKLLRLTILPEAAMILGRAYMQDNSPSLKVFDCIVCDVFSTDRL 748 (1406)
T ss_pred ccccccccchhhhccCCcccCCCCCCCCCCCCCCcccccCcccccHHHHhhhhhcccCCCCHHHHHHHHhhhhhhhhhhH
Confidence 05578999997 45567
Q ss_pred hhhhhhhhhhccccc-ccccceecCCCcccCCChhHhhhcccccccc-CCCC-CCC-CCCCChhhhhhhh----ccCCCC
Q psy4686 299 HKLKTHIKTKHRTQL-KELYTFECQHCEDKFSTPFELNKHNESKHEK-GRKL-NPE-TGGYTYEEYKEVV----VSKSKQ 370 (2028)
Q Consensus 299 ~~l~~H~~~~h~~~~-~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~-~~~~-~~~-~~~~~~~~~~~~~----~~~~~~ 370 (2028)
..|+.|+........ .-...|.|.+|.+.-..+..++.|.+..-.. ...+ ... ..++..+....++ ......
T Consensus 749 ~~l~~h~~~~rs~ke~v~g~~~~c~l~~y~t~~kanfqlh~Ktdkh~qk~~~~~hikegg~a~~~~t~~la~~~~P~~l~ 828 (1406)
T KOG1146|consen 749 DQLWFHNTRERSRKEQVPGDVPSCKLKPYATNTKANFQLHNKTDKHVQKYQLRAHIKEGGPANEYPTKTLASGPNPVHLK 828 (1406)
T ss_pred HHHhhcchhhhhhhhcccCCCCcceecccccccchhhhhhcccchhhhccchhhhhhccCCccCCcchhhhcCCCccccc
Confidence 778888876543211 1124689999999999999999986542211 0000 000 0111101011111 234568
Q ss_pred CCccccccCCHHHHHHHhhhhc-CCCcccee-----------------cccccccccChhhhhhcccccccCCCcceecc
Q psy4686 371 CPLCTKIFTTAKHMRVHLRSVH-NGKERKFI-----------------CDICGKQFTSNIHASRHKNYAHSRDRQKMECD 432 (2028)
Q Consensus 371 C~~C~~~f~~~~~l~~H~~~~h-~~~~~~~~-----------------C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~ 432 (2028)
|..|+.-..+...|..|-+... .+...+|+ |..|...-.... +. .......|.
T Consensus 829 cn~cd~~Tns~~~L~lh~~~~~he~~~~~~ks~~~l~~~ls~~~~s~~~as~~~s~~~~~-~~--------~s~~s~~~~ 899 (1406)
T KOG1146|consen 829 CNACDTPTNSPEPLLLHTPPSEHEGPEEIYKSLLSLACSLSAGDLSDSSASSLASPESPG-GG--------PSGGSGVPD 899 (1406)
T ss_pred cccccCCCCCccccccCCCCCccccccccchhHHHHHHHhhccccccccccccccccCCC-CC--------CCCCcccch
Confidence 9999988888878888765322 22222222 111111111100 00 011223345
Q ss_pred cccccccChhHHHHHHHHc---------------------------------------------------------cCCC
Q psy4686 433 YCKRKFTCKRYLAEHINAH---------------------------------------------------------TGNT 455 (2028)
Q Consensus 433 ~C~~~f~~~~~L~~H~~~h---------------------------------------------------------~~~~ 455 (2028)
.|++.-..-.....|+++- .+-.
T Consensus 900 ~~~r~a~~~~~~d~qlk~i~~~~~~q~~~~~~~~E~l~~~~~~~~~~i~vw~qna~~~s~k~~~n~~~~ss~~~s~sp~s 979 (1406)
T KOG1146|consen 900 GMGRRAYRTQESDLQLKIIKACYEAQRTPTMQECEVLEEPIGLPKRVIQVWFQNARAKSKKAKLNGTAASSTGGSSSPLS 979 (1406)
T ss_pred hhhhhhhccchhHHHHHHHHHHHhhccCChHHHHHhhcccccCCcchhHHhhhhhhhhhhhhhhccccccccccccCccc
Confidence 5555443333333333211 0011
Q ss_pred ccccccccccccChhHHHHHHhhhccCcchhhhhhccccCcCccCccceecccchhhcCCHHHHHHHHhhccccccccCC
Q psy4686 456 IYGCRICKKTFLYTNGLRRHILSRHKDTDVVILNEVKDIFPSHLQSYQIKCRYCARTFSTQNDLKEHVSSVHMFMTESFE 535 (2028)
Q Consensus 456 ~~~C~~C~k~F~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~ 535 (2028)
++.|..|.+.|.....+..|....-....+..+.+... +.....++|..|...-..+..|..|....|..... .
T Consensus 980 ~~q~k~~~~~~~~~~~~~~~~~k~~s~~~l~~l~~~~~----~v~~s~~kc~~~q~~~q~k~~l~a~~~~l~s~~p~--a 1053 (1406)
T KOG1146|consen 980 AAQPKDCPYCDVKYDFYVSCCGKLFSRQHLAKLKEAVR----AVLKSESKCYLCQPAPQAKPALKAHPATLPSSAPG--A 1053 (1406)
T ss_pred ccCCCcccccCcchhhhhhhccchhhHHHHHHHHHhhh----ccccchhhhhhhhhhhhCchhhhcCcccccccCcc--c
Confidence 36677777777666666555433222211111111111 22233578999999999999999999888866652 1
Q ss_pred CCCCCcccCCCCC-cccc---cCCCChhHHhhhhcCCCCccCcchhcCCCHHHHHHHHhhhcCCCC---CcccCcccccc
Q psy4686 536 NPNEPRVLNSNRY-PLAI---DGGLSLEEYQQIVATKSKQCPICEKIFAAPKQMRMHLRHVHSSIK---RYMCDICGKQF 608 (2028)
Q Consensus 536 ~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~---~~~C~~C~~~f 608 (2028)
....++....... +..+ .+................-|..|.--|.+......|++-.|.... .+.=+.|+..+
T Consensus 1054 ~t~~~~~aa~s~~~~a~~~~~l~~s~~~~s~~~~ss~~~l~~~~~~d~~~~~~~~~~~~~~~pq~m~~~t~~~~~a~~a~ 1133 (1406)
T KOG1146|consen 1054 ATTPPRLAAVSLFGPALAQPPLGNSAPFNSGPAPSSGLLLLATSVLDTTTVVQTAGPGRPLPPQPMPDQTNTKTAATQAP 1133 (1406)
T ss_pred cCCCccchhhhhcCcccccccccccccccCCCCCcchhhhhhhccCCccccccccCCCCCCCcccccCccccccccccCc
Confidence 1111111110000 0000 000000000000001112355566666666666555554443221 12233344444
Q ss_pred CCHHHHHHHHhhcccccccccCCCcccccccccccCChHHHHHHHHHhcCCCccccCccccccccchhhhhhhhhccccC
Q psy4686 609 TTLSYLDLHRKSHNETVKRKEEDKKYECDSCNKKFWSKRALSEHMIIHTGIKEHQCHVCNTAFYHIRSLSRHLKIHEYNE 688 (2028)
Q Consensus 609 ~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 688 (2028)
..-.-+.-|...-+ .+. +| -.+.+...+.-|.+.+.-.-.-.|-.|+ .|.....+..|..+.+..-
T Consensus 1134 ~~~~~~~~~~de~~-------s~v--~~----la~~~se~~~l~s~l~~l~~a~~~~~~~-s~~~~~~~~p~~nt~~~~~ 1199 (1406)
T KOG1146|consen 1134 VPGAPTEPLGDEVS-------SEV--KP----LAGPTSESNDLLSNLKALKTAVPALLGG-SFLPFPLPPPGGNTPPLEF 1199 (1406)
T ss_pred cCCCcccccccccc-------ccc--cc----cCCccchhHHHHHHHHHHHHHHHHHhcc-cccCCCCCCcccCCChhhh
Confidence 33332222221111 111 12 4455555555554444221112222233 3322222222211110000
Q ss_pred ccccCCcccccccCCH-----HHHHhhhccccCCCcccccccCcccccccchhhhccccccccCCCCccccCCccCcccc
Q psy4686 689 SKMYKCPVCSKMFTEL-----YEMKRHRDHKHGGKCHVCKICGATIKLFTELEEKEDSSFERKKSKDSFHCDTCGKYFRS 763 (2028)
Q Consensus 689 ~~~~~C~~C~~~f~~~-----~~l~~H~~~~h~~~~~~C~~C~~~~~~~~~l~~h~~~~~~~~~~~~~~~C~~C~~~f~~ 763 (2028)
...-.=.+|+..|... .-+..|+ | ..|..|...|....+...+-.-..........+.|..|++.|..
T Consensus 1200 ~~~~~~q~~~~~~~~~~manp~l~~~~~----g---l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~ 1272 (1406)
T KOG1146|consen 1200 GPQLQGQYFQQLLGMKFMANPMLPQTLI----G---LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTP 1272 (1406)
T ss_pred CchHHHHHHHHHHhccccCCcccchhhh----c---ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccC
Confidence 0000001111111111 0011111 1 12333334444443333332221111222345889999999998
Q ss_pred hhHHHhhhcccccCCCCcccCCcchhhhcChHHHhhhhccccCCCcccCCCCCCCCCCCCCcchhhhhhhhhcCCCCCCC
Q psy4686 764 RQICNKHIKRVHLNPTKTFKCDLCSDRFSTSAKLKKHVDKIHFTPEFQEPSKKNEPLDTGDYTLEQWNELVTAQSKDCPV 843 (2028)
Q Consensus 764 ~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~ 843 (2028)
.-.+. |+ ....+|.|.+|...|.....|..|.++.-....-.+.. ...++.. ...+.+ |..
T Consensus 1273 ~~~~~-~l-----~~~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~-~~~~l~~-----------~d~~~~-c~~ 1333 (1406)
T KOG1146|consen 1273 SFGIS-TL-----DVTHRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGS-MPPPLRV-----------PDCTYH-CLA 1333 (1406)
T ss_pred cccee-ec-----ccchhHHHHHHHhhhcchhHHHHHHHHHHhccCccccC-CCCcccC-----------cccccc-chH
Confidence 88776 65 35667999999999999999999974322111100000 0011111 112445 888
Q ss_pred CCCCCCChHHHHHHHHhhhc
Q psy4686 844 CHKTYSTPKSMRKHLREVHS 863 (2028)
Q Consensus 844 C~~~f~~~~~L~~H~~~~H~ 863 (2028)
|...|.....|..||+.++.
T Consensus 1334 c~~~~~~~~alqihm~~~~~ 1353 (1406)
T KOG1146|consen 1334 CEVLLSGREALQIHMRSSAH 1353 (1406)
T ss_pred HHhhcchhHHHHHHHHHhhh
Confidence 88888888888888765553
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.95 E-value=5.9e-10 Score=135.19 Aligned_cols=72 Identities=25% Similarity=0.577 Sum_probs=40.0
Q ss_pred ccccCcccccccCchhhhcccccccCCccccccccccccc------------cccccccccccCCCCcccCCcchhh-hh
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKIK------------GNMKRHMLSHTGEKPFCCDICGKSL-KG 1987 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~~------------~~l~~H~~~h~~~k~~~C~~C~~~f-~~ 1987 (2028)
+|.|+ ||+.+ .+..|..|+++|.+++|+.|+.|++.|. +.|..|..+. |.+++.|..||+.| ..
T Consensus 478 pv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 478 PLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK 554 (567)
T ss_pred CccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh
Confidence 34444 44322 3344444444444455555555554441 2466676664 77777777777777 56
Q ss_pred hHHHHhhh
Q psy4686 1988 NLKAHILR 1995 (2028)
Q Consensus 1988 ~l~~H~~~ 1995 (2028)
.|..|+..
T Consensus 555 dm~~H~~~ 562 (567)
T PLN03086 555 EMDIHQIA 562 (567)
T ss_pred hHHHHHHH
Confidence 67777653
No 14
>KOG1146|consensus
Probab=98.85 E-value=1.3e-09 Score=138.96 Aligned_cols=188 Identities=18% Similarity=0.243 Sum_probs=107.9
Q ss_pred CCccccccCChHHHHHHHhhhcCCCCcccccccccccccchHHHhhhccccCCCCCceecccchhhccChhhHHHHHhhc
Q psy4686 1299 CPVCHKTYSTPKTMRRHLRQVHTSQKRYLCDICGKQFTSTNRVNIHKACVHSSTGNKFECIYCKKKYRRKFDLKEHINKH 1378 (2028)
Q Consensus 1299 C~~C~k~f~~~~~L~~H~~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H 1378 (2028)
|.-|+-.+.....+..|+...|+-.+.|.|+.|+..|+....|..|++..|...+. .+|.. +..-..+.+-.-.-
T Consensus 439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~----~~c~~-gq~~~~~arg~~~~ 513 (1406)
T KOG1146|consen 439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS----AYCKA-GQNHPRLARGEVYR 513 (1406)
T ss_pred ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch----hHhHh-cccccccccccccc
Confidence 66777778888888888887888889999999999999999999999977655433 22221 11111111111122
Q ss_pred CCCCCcccccccccccchhhhhhHHHhhhhhhhhhhhh---cccccccc-------cccCCCCCCCCchhhhhhhccCCC
Q psy4686 1379 TGNKPYHCQICKESFYTLKTYRGDLKRHEQLAEQSDEM---HSTSKFQN-------KIKKPLDTGDLSWEEWNELVLSGT 1448 (2028)
Q Consensus 1379 ~~~~~~~C~~C~~~f~~~~~~~~~l~~H~~~~~~~~~~---~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~ 1448 (2028)
.+.+||.|..|.+++ ..+..|.+|++......++ ......+. ....|...|........+......
T Consensus 514 ~~~~p~~C~~C~~st----ttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~ 589 (1406)
T KOG1146|consen 514 CPGKPYPCRACNYST----TTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPS 589 (1406)
T ss_pred CCCCcccceeeeeee----ecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCC
Confidence 345688888888888 5555555555432211111 00000000 000111111110001111223346
Q ss_pred CcCCCCCCcccCHHHHHHHHhhhcCCC-ccccCccCccccchhhhhhh
Q psy4686 1449 KKCPVCHKTYSTPRQMRIHLREAHSQK-KYACDVCGKQFTSTNRVSQH 1495 (2028)
Q Consensus 1449 ~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~~C~~C~~~f~~~~~l~~H 1495 (2028)
+.|.+|++...-..+|+.||..-.... |..|..|+-.+.....+..+
T Consensus 590 ~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~ 637 (1406)
T KOG1146|consen 590 WRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQ 637 (1406)
T ss_pred cchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCc
Confidence 889999999999999999997654333 35566666555555444444
No 15
>PHA00733 hypothetical protein
Probab=98.79 E-value=3.7e-09 Score=104.33 Aligned_cols=83 Identities=18% Similarity=0.384 Sum_probs=66.1
Q ss_pred ccccCcccccccCchhhhcccccccCCccccccccccccccccccccccccCCCCcccCCcchhh--hhhHHHHhhhhcC
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKIKGNMKRHMLSHTGEKPFCCDICGKSL--KGNLKAHILRCQG 1998 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~~~~l~~H~~~h~~~k~~~C~~C~~~f--~~~l~~H~~~h~~ 1998 (2028)
++.|.+|.+.|.....|..|. .|.+|+.+| +++||.|+.||+.| ...|..|+++|
T Consensus 40 ~~~~~~~~~~~~~~~~l~~~~--------------------~l~~~~~~~-~~kPy~C~~Cgk~Fss~s~L~~H~r~h-- 96 (128)
T PHA00733 40 RLIRAVVKTLIYNPQLLDESS--------------------YLYKLLTSK-AVSPYVCPLCLMPFSSSVSLKQHIRYT-- 96 (128)
T ss_pred hHHHHHHhhhccChhhhcchH--------------------HHHhhcccC-CCCCccCCCCCCcCCCHHHHHHHHhcC--
Confidence 567777777777655555543 266676555 48999999999999 67899999987
Q ss_pred CCcccCCccCcccCCchhHHHHHHhhCC
Q psy4686 1999 ERPYKCDVCGSSFKDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 1999 ~~p~~C~~C~~~f~~~~~l~~H~~~h~~ 2026 (2028)
+.||.|+.|+++|....+|..|+...++
T Consensus 97 ~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 97 EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 4689999999999999999999987654
No 16
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.62 E-value=1.4e-08 Score=68.52 Aligned_cols=26 Identities=50% Similarity=1.096 Sum_probs=23.4
Q ss_pred hHHHHhhhhcCCCcccCCccCcccCC
Q psy4686 1988 NLKAHILRCQGERPYKCDVCGSSFKD 2013 (2028)
Q Consensus 1988 ~l~~H~~~h~~~~p~~C~~C~~~f~~ 2013 (2028)
+|..||++|+|++||+|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58899999999999999999999974
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.62 E-value=1.2e-08 Score=80.38 Aligned_cols=42 Identities=17% Similarity=0.399 Sum_probs=37.5
Q ss_pred CcccCCcchhh--hhhHHHHhhhhcCCCcccCCccCcccCCchhHH
Q psy4686 1975 PFCCDICGKSL--KGNLKAHILRCQGERPYKCDVCGSSFKDKKHFS 2018 (2028)
Q Consensus 1975 ~~~C~~C~~~f--~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~~l~ 2018 (2028)
.|.|++||+.| .++|..||++|+ +||+|..|++.|.+++.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 48999999999 778999999999 7999999999999888763
No 18
>PHA00733 hypothetical protein
Probab=98.49 E-value=1.1e-07 Score=94.09 Aligned_cols=81 Identities=21% Similarity=0.448 Sum_probs=44.2
Q ss_pred cccccccccccccCChHhHHHH------HhhcCCCCCcCCcchhhhcCCHHHHHHHHHHhcccCCCCCCCCccccccccc
Q psy4686 1508 QRKFECDFCKKKFYRNFDLQEH------INTHTGNKPYQCQICNKSFGTRRNYRLHLKRHKRSAGQLKPEDIHECKICHK 1581 (2028)
Q Consensus 1508 ~~~~~C~~C~~~f~~~~~l~~H------~~~H~~~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~~~~~~~~C~~C~~ 1581 (2028)
++++.|.+|...|.....|..| +.. .+.+||.|+.|++.|.+...|..|++.|. .+|.|+.|++
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~-~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~---------~~~~C~~CgK 107 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTS-KAVSPYVCPLCLMPFSSSVSLKQHIRYTE---------HSKVCPVCGK 107 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhccc-CCCCCccCCCCCCcCCCHHHHHHHHhcCC---------cCccCCCCCC
Confidence 3444555555544444433333 222 23556666666666666666666666442 2356666666
Q ss_pred cccChhHHHHHhhhhcC
Q psy4686 1582 IFLENSRLTRHMNFTHG 1598 (2028)
Q Consensus 1582 ~f~~~~~l~~H~~~~~~ 1598 (2028)
.|.....|.+|+...|+
T Consensus 108 ~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 108 EFRNTDSTLDHVCKKHN 124 (128)
T ss_pred ccCCHHHHHHHHHHhcC
Confidence 66666666666665554
No 19
>KOG3993|consensus
Probab=98.40 E-value=7.4e-08 Score=107.44 Aligned_cols=188 Identities=19% Similarity=0.280 Sum_probs=107.4
Q ss_pred CCccCcchhcCCCHHHHHHHH--hhhcCCCCCcccCccccccCCHHHHHHHHhhcccccccc------------------
Q psy4686 569 SKQCPICEKIFAAPKQMRMHL--RHVHSSIKRYMCDICGKQFTTLSYLDLHRKSHNETVKRK------------------ 628 (2028)
Q Consensus 569 ~~~C~~C~~~f~~~~~l~~H~--~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~------------------ 628 (2028)
.|.|..|...|.+...|.+|. +.+|. -|.|+.|+|.|.-..+|..|+|+|.......
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~v---EYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ 343 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHV---EYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQ 343 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEe---eecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence 488999999999999999995 44554 3999999999999999999999996321100
Q ss_pred --------cCCCcccccccccccCChHHHHHHHHHhcCCCccccCccccccccchhhhhhhhhccccCccccCCcccccc
Q psy4686 629 --------EEDKKYECDSCNKKFWSKRALSEHMIIHTGIKEHQCHVCNTAFYHIRSLSRHLKIHEYNESKMYKCPVCSKM 700 (2028)
Q Consensus 629 --------~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~ 700 (2028)
..+..|.|.+|+|.|.....|+.|+..|.....-.=. .-.|..... ..-.+-|..|+-.
T Consensus 344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~--a~~f~~s~~-----------~~l~~~~~~~a~h 410 (500)
T KOG3993|consen 344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK--APKFLLSRV-----------IPLMHFNQAVATH 410 (500)
T ss_pred hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc--ccCcchhhc-----------ccccccccccccc
Confidence 1223466666666666666666665555422100000 000000000 0011334444443
Q ss_pred cCCHHHHHhhhccccC-CCcccccccCcccccccchhhhccccccccCCCCccccCCccCcccchhHHHhhhccccc
Q psy4686 701 FTELYEMKRHRDHKHG-GKCHVCKICGATIKLFTELEEKEDSSFERKKSKDSFHCDTCGKYFRSRQICNKHIKRVHL 776 (2028)
Q Consensus 701 f~~~~~l~~H~~~~h~-~~~~~C~~C~~~~~~~~~l~~h~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 776 (2028)
+.....--.+.-.+-+ .....|+.||..+.....-..+.+. ......|.|.+|.-.|.+...|.+|++..|.
T Consensus 411 ~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rl----g~~~q~f~~ky~~atfyss~~ltrhin~~Hp 483 (500)
T KOG3993|consen 411 SSASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRL----GIAEQGFTCKYCPATFYSSPGLTRHINKCHP 483 (500)
T ss_pred cccccccccceeeeeccccccCCCCCCCCcccCCCCCccccc----cchhhccccccchHhhhcCcchHhHhhhcCh
Confidence 3332221111111111 1123455666544444333333322 1223468999999999999999999988886
No 20
>KOG3993|consensus
Probab=98.35 E-value=1.5e-07 Score=105.14 Aligned_cols=196 Identities=19% Similarity=0.281 Sum_probs=110.1
Q ss_pred CCCcccCccccccCCHHHHHHHHh--hcccccccccCCCcccccccccccCChHHHHHHHHHhcCCCcccc---Cccccc
Q psy4686 596 IKRYMCDICGKQFTTLSYLDLHRK--SHNETVKRKEEDKKYECDSCNKKFWSKRALSEHMIIHTGIKEHQC---HVCNTA 670 (2028)
Q Consensus 596 ~~~~~C~~C~~~f~~~~~L~~H~~--~h~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C---~~C~~~ 670 (2028)
...|+|..|...|.+.-.|..|+- +-+ -.|+|+.|+|+|+-..+|..|.+.|.....-.= .-=.+.
T Consensus 265 iGdyiCqLCK~kYeD~F~LAQHrC~RIV~---------vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~ 335 (500)
T KOG3993|consen 265 IGDYICQLCKEKYEDAFALAQHRCPRIVH---------VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQA 335 (500)
T ss_pred HHHHHHHHHHHhhhhHHHHhhccCCeeEE---------eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhh
Confidence 345788888888888888888762 211 137788888888877788888777743211000 000000
Q ss_pred cccchhhhhhhhhccccCccccCCcccccccCCHHHHHhhhccccCCCc-----------------ccccccCccccccc
Q psy4686 671 FYHIRSLSRHLKIHEYNESKMYKCPVCSKMFTELYEMKRHRDHKHGGKC-----------------HVCKICGATIKLFT 733 (2028)
Q Consensus 671 f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~-----------------~~C~~C~~~~~~~~ 733 (2028)
-.+....+.--+.-....+..|.|.+|+++|.....|+.|+.+|+.... +.|..|+..+....
T Consensus 336 ~~~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~ 415 (500)
T KOG3993|consen 336 VETRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASD 415 (500)
T ss_pred hhhhhhhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccccccccccccc
Confidence 0000000011111111234579999999999999999999887763210 11222221111100
Q ss_pred chhhhccccccccCCCCccccCCccCcccchhHHHhhhcccccCCCCcccCCcchhhhcChHHHhhhhcccc
Q psy4686 734 ELEEKEDSSFERKKSKDSFHCDTCGKYFRSRQICNKHIKRVHLNPTKTFKCDLCSDRFSTSAKLKKHVDKIH 805 (2028)
Q Consensus 734 ~l~~h~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h 805 (2028)
.|.......-.....-.|..|+..+.++..--.+.+.-+ .+..|.|.+|..+|.+.-.|.+|+.+.|
T Consensus 416 ---~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~--~~q~f~~ky~~atfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 416 ---SHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGI--AEQGFTCKYCPATFYSSPGLTRHINKCH 482 (500)
T ss_pred ---ccccceeeeeccccccCCCCCCCCcccCCCCCccccccc--hhhccccccchHhhhcCcchHhHhhhcC
Confidence 000000000111223457888877777665555542222 3567999999999999999999998888
No 21
>PHA02768 hypothetical protein; Provisional
Probab=98.34 E-value=1.8e-07 Score=73.89 Aligned_cols=43 Identities=16% Similarity=0.465 Sum_probs=32.6
Q ss_pred cccccccccccCChHhHHHHHhhcCCCCCcCCcchhhhcCCHHHH
Q psy4686 1510 KFECDFCKKKFYRNFDLQEHINTHTGNKPYQCQICNKSFGTRRNY 1554 (2028)
Q Consensus 1510 ~~~C~~C~~~f~~~~~l~~H~~~H~~~k~~~C~~C~~~f~~~~~l 1554 (2028)
.|.|+.||+.|....+|..||++|+ ++|+|..|++.|.+.+.|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 3678888888888888888888877 577788888877766654
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.05 E-value=2e-06 Score=58.14 Aligned_cols=25 Identities=60% Similarity=1.313 Sum_probs=20.4
Q ss_pred HHHHHhhhccCCccccccccchhcc
Q psy4686 1060 YLGVHMRKHNGEKPYNCDYCGQTFA 1084 (2028)
Q Consensus 1060 ~L~~H~~~h~~~~~~~C~~C~~~f~ 1084 (2028)
+|+.||++|+|++||.|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4778888888888888888888875
No 23
>PHA00616 hypothetical protein
Probab=97.96 E-value=3.9e-06 Score=62.98 Aligned_cols=32 Identities=19% Similarity=0.414 Sum_probs=26.2
Q ss_pred CcccCCcchhh--hhhHHHHhhhhcCCCcccCCc
Q psy4686 1975 PFCCDICGKSL--KGNLKAHILRCQGERPYKCDV 2006 (2028)
Q Consensus 1975 ~~~C~~C~~~f--~~~l~~H~~~h~~~~p~~C~~ 2006 (2028)
||.|+.||+.| ++.|..|++.|+|++||.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 67888888888 677888888888888888865
No 24
>PHA00616 hypothetical protein
Probab=97.84 E-value=3.6e-06 Score=63.14 Aligned_cols=35 Identities=26% Similarity=0.421 Sum_probs=24.7
Q ss_pred ccccCcccccccCchhhhcccccccCCcccccccc
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVC 1955 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C 1955 (2028)
||+|..||+.|...++|..|++.|||++++.|+.-
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~ 35 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYF 35 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEE
Confidence 46777777777777777777777777777766543
No 25
>PHA00732 hypothetical protein
Probab=97.72 E-value=1.8e-05 Score=70.18 Aligned_cols=43 Identities=35% Similarity=0.626 Sum_probs=25.2
Q ss_pred CcccCCcchhh--hhhHHHHhhh-hcCCCcccCCccCcccCCchhHHHHHHh
Q psy4686 1975 PFCCDICGKSL--KGNLKAHILR-CQGERPYKCDVCGSSFKDKKHFSVHIRN 2023 (2028)
Q Consensus 1975 ~~~C~~C~~~f--~~~l~~H~~~-h~~~~p~~C~~C~~~f~~~~~l~~H~~~ 2023 (2028)
||.|++||++| .++|+.|++. |+ ++.|+.||++|. .|..|+++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 45666666666 4556666663 43 346666666665 35555544
No 26
>PHA00732 hypothetical protein
Probab=97.65 E-value=2.2e-05 Score=69.69 Aligned_cols=22 Identities=32% Similarity=0.605 Sum_probs=11.9
Q ss_pred cccCCCCcccccchhhHHHHHh
Q psy4686 1886 DILCPICKKGFIALKHMKTHLK 1907 (2028)
Q Consensus 1886 ~~~C~~C~~~f~~~~~l~~H~~ 1907 (2028)
||.|+.||++|.+..+|+.|++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r 22 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHAR 22 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhh
Confidence 3455555555555555555554
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.51 E-value=8.7e-05 Score=61.27 Aligned_cols=50 Identities=24% Similarity=0.421 Sum_probs=24.3
Q ss_pred ccCcchhhhccCHHHHHhhhcccCCCC-CcccccccCccccChHHHHHHHHhhc
Q psy4686 103 RTCRICTEVFENLTLLRRHMRVKHPGE-QNLPCRLCDMSFSNKYQKKKHYSAFH 155 (2028)
Q Consensus 103 ~~C~~C~~~f~~~~~l~~H~~~~h~~~-~~~~C~~C~~~~~~~~~l~~H~~~~h 155 (2028)
|.||+|++ .-+...|..|+...|.++ +.+.|++|...+. .+|..||...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 45555555 233445555555555433 2355555554332 24555554444
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.43 E-value=0.00018 Score=59.47 Aligned_cols=53 Identities=30% Similarity=0.495 Sum_probs=40.5
Q ss_pred ccccCCCCCCCCCHHHHHHHHhhhccCCCCccccccccccccChhhhhhhhhhhcc
Q psy4686 255 SFTCDTCDKPFDTIEKCRRHAIRMHMNPCKMFKCDICVASFLTPHKLKTHIKTKHR 310 (2028)
Q Consensus 255 ~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 310 (2028)
.|.|+.|++ .-+...|..|+...|....+.+.|++|...+. .+|..||...|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 478888888 45567888888888887777788888887654 388888877663
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.31 E-value=0.0001 Score=48.63 Aligned_cols=23 Identities=48% Similarity=1.115 Sum_probs=20.7
Q ss_pred ccCCccCcccCCchhHHHHHHhh
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRNH 2024 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~h 2024 (2028)
|+|++|++.|++++.|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999876
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.64 E-value=0.0011 Score=44.37 Aligned_cols=24 Identities=38% Similarity=1.018 Sum_probs=19.2
Q ss_pred ccCCccCcccCCchhHHHHHHhhC
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRNHG 2025 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~h~ 2025 (2028)
|+|++|++.|.++..|..|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 789999999999999999998874
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.54 E-value=0.0011 Score=45.87 Aligned_cols=26 Identities=35% Similarity=0.850 Sum_probs=22.2
Q ss_pred cccCCccCcccCCchhHHHHHHhhCC
Q psy4686 2001 PYKCDVCGSSFKDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 2001 p~~C~~C~~~f~~~~~l~~H~~~h~~ 2026 (2028)
||.|+.|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68899999999999999999888754
No 32
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.40 E-value=0.0011 Score=71.71 Aligned_cols=58 Identities=28% Similarity=0.655 Sum_probs=47.3
Q ss_pred hhhhcCCCCccCC--CCCCcCCCHHHHHhhhhhhc------------------CCCCccccCcchhhhcCchhHhhhh
Q psy4686 1787 WRQLVAEKTKLCP--VCNKEYATPVTMRKHLREVH------------------VSKKKFKCDLCQKQFKASRSLQCHK 1844 (2028)
Q Consensus 1787 ~~~~~~~~~~~C~--~C~~~f~~~~~l~~H~~~~h------------------~~~~~~~C~~C~~~f~~~~~l~~H~ 1844 (2028)
+..+..++||+|+ -|+++|+....|+.|+..-| ...|||.|++|+++|++...|+.|+
T Consensus 341 ~l~v~d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 341 MLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred eeEeecCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 3445557999994 59999999999999987555 1247899999999999999999884
No 33
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.38 E-value=0.0024 Score=42.14 Aligned_cols=21 Identities=43% Similarity=0.698 Sum_probs=9.8
Q ss_pred cccccccccCChHHHHHHHHH
Q psy4686 635 ECDSCNKKFWSKRALSEHMII 655 (2028)
Q Consensus 635 ~C~~C~~~f~~~~~L~~H~~~ 655 (2028)
.|+.|++.|.++..|..|++.
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 2 KCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCccCCHHHHHHHHhH
Confidence 444444444444444444443
No 34
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.33 E-value=0.0018 Score=62.88 Aligned_cols=73 Identities=25% Similarity=0.430 Sum_probs=21.5
Q ss_pred cCcchhhhccCHHHHHhhhcccCCCCCcccccccCccccChHHHHHHHHhhccCCCCCCCccCCcchhhccChHHHHHHh
Q psy4686 104 TCRICTEVFENLTLLRRHMRVKHPGEQNLPCRLCDMSFSNKYQKKKHYSAFHKGMPFIPSFTCPICQKIFKNKEWYLDHL 183 (2028)
Q Consensus 104 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~Hl 183 (2028)
+|.+|+..|.+...|+.||...|....+ ....+.....+..++...- ...+.|.+|++.|.+...|..||
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~~~~-----~~~~~C~~C~~~f~s~~~l~~Hm 70 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLRKKV-----KESFRCPYCNKTFRSREALQEHM 70 (100)
T ss_dssp ----------------------------------------------------------SSEEBSSSS-EESSHHHHHHHH
T ss_pred Cccccccccccccccccccccccccccc-----ccccccccccccccccccc-----CCCCCCCccCCCCcCHHHHHHHH
Confidence 5999999999999999999998853222 2222223333444433211 12489999999999999999999
Q ss_pred hhh
Q psy4686 184 SLH 186 (2028)
Q Consensus 184 ~~h 186 (2028)
+.+
T Consensus 71 ~~~ 73 (100)
T PF12756_consen 71 RSK 73 (100)
T ss_dssp HHT
T ss_pred cCc
Confidence 984
No 35
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.07 E-value=0.0021 Score=69.64 Aligned_cols=70 Identities=27% Similarity=0.574 Sum_probs=52.0
Q ss_pred CCcccccc--ccccccChhHHHHHHhhhccCcchhhh--hhccccCcCccCccceecccchhhcCCHHHHHHHHhh
Q psy4686 454 NTIYGCRI--CKKTFLYTNGLRRHILSRHKDTDVVIL--NEVKDIFPSHLQSYQIKCRYCARTFSTQNDLKEHVSS 525 (2028)
Q Consensus 454 ~~~~~C~~--C~k~F~~~~~L~~H~~~~h~~~~~~~~--~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 525 (2028)
.+||+|++ |+|.|.....|+-|+..-|....+... ++..+.| -..++||.|++|++.+.+...|+.|...
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F--~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF--SAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc--cccCCceeccccchhhccCccceecccc
Confidence 58999998 999999999999999876643222111 1111222 3456899999999999999999999753
No 36
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.92 E-value=0.0049 Score=59.83 Aligned_cols=73 Identities=23% Similarity=0.511 Sum_probs=19.9
Q ss_pred CCCccccccCChHHHHHHHhhhcCCCCcccccccccccccchHHHhhhccccCCCCCceecccchhhccChhhHHHHHhh
Q psy4686 1298 DCPVCHKTYSTPKTMRRHLRQVHTSQKRYLCDICGKQFTSTNRVNIHKACVHSSTGNKFECIYCKKKYRRKFDLKEHINK 1377 (2028)
Q Consensus 1298 ~C~~C~k~f~~~~~L~~H~~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 1377 (2028)
+|.+|+..|.+...|..||...|.-.-+ ....+.....+..+.. ......+.|.+|++.|.+...|..||+.
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~-----~~~~l~~~~~~~~~~~---~~~~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP-----DQKYLVDPNRLLNYLR---KKVKESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp --------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred Cccccccccccccccccccccccccccc-----cccccccccccccccc---cccCCCCCCCccCCCCcCHHHHHHHHcC
Confidence 3899999999999999999877754322 1111223333333332 1222358899999999999999999876
Q ss_pred c
Q psy4686 1378 H 1378 (2028)
Q Consensus 1378 H 1378 (2028)
+
T Consensus 73 ~ 73 (100)
T PF12756_consen 73 K 73 (100)
T ss_dssp T
T ss_pred c
Confidence 4
No 37
>KOG2482|consensus
Probab=95.88 E-value=0.015 Score=64.14 Aligned_cols=189 Identities=17% Similarity=0.228 Sum_probs=93.9
Q ss_pred cccccccCccc-cChHHHHHHHHhhccC---CCC---------------CCCccCCcchhhccChHHHHHHhhhhhhhhh
Q psy4686 131 NLPCRLCDMSF-SNKYQKKKHYSAFHKG---MPF---------------IPSFTCPICQKIFKNKEWYLDHLSLHEEQLI 191 (2028)
Q Consensus 131 ~~~C~~C~~~~-~~~~~l~~H~~~~h~~---~~~---------------~~~~~C~~C~~~f~~~~~l~~Hl~~h~~~~~ 191 (2028)
..+|-+|...+ -+++..-.|+-..|+- .+. ..++.|-+|.+.|.++..|+.||+.......
T Consensus 144 slqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrri 223 (423)
T KOG2482|consen 144 SLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRRI 223 (423)
T ss_pred eeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCccc
Confidence 36677776555 5566666676666651 110 1237999999999999999999988332222
Q ss_pred hccC--CCCccccccccCCCCCCCCcccchhhhhcccCcccccccccccccCCCCCCc---cccCCCCccccCCCCCCCC
Q psy4686 192 SLNR--RKPIHVRSVRLPSVRGQDIFTTTTKFMRRKAGNVLHVARWFTDCQSIEDPSL---ECKNNKASFTCDTCDKPFD 266 (2028)
Q Consensus 192 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~C~~C~~~~~ 266 (2028)
.+++ .....+.+...- |..+ ........ .+. .++.+..++... +-.......+|-.|...+.
T Consensus 224 nPknreYDkfyiINY~ev---~ks~-t~~~~e~d--------ret-~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~e 290 (423)
T KOG2482|consen 224 NPKNREYDKFYIINYLEV---GKSW-TIVHSEDD--------RET-NEDINETDDTWSDWNEDDAEALSVVCLFCTNFYE 290 (423)
T ss_pred CCCccccceEEEEeHhhc---CCcc-chhhhhhh--------hhh-hccccccccchhhhhcCCCCccceEEEeeccchh
Confidence 1111 111111111100 0000 00000000 000 011111111111 1111223379999999999
Q ss_pred CHHHHHHHHhhhccCCCCccccccccccccChhhhhhhhhhhcccccccccceecCCCcccCCChhHhhhcccc
Q psy4686 267 TIEKCRRHAIRMHMNPCKMFKCDICVASFLTPHKLKTHIKTKHRTQLKELYTFECQHCEDKFSTPFELNKHNES 340 (2028)
Q Consensus 267 ~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 340 (2028)
+...|..||..+|.-.-..-+=+ -+.-|-....+..-++. ......|-.|...|.....|..||..
T Consensus 291 n~~~l~eHmk~vHe~Dl~Ki~sd-~~Ln~YqrvrviNyiRk-------q~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 291 NPVFLFEHMKIVHEFDLLKIQSD-YSLNFYQRVRVINYIRK-------QKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred hHHHHHHHHHHHHHhhHHhhccc-cccchhhhhhHHHHHHH-------HhhccccccccccccCcchhhhhccc
Confidence 99999999988885321110000 01112222233322222 12245678888888888888888754
No 38
>KOG2231|consensus
Probab=95.85 E-value=0.019 Score=71.88 Aligned_cols=48 Identities=23% Similarity=0.388 Sum_probs=34.4
Q ss_pred cccCCCCCCCCCHHHHHHHHhhhccCCCCcccccccc------ccccChhhhhhhhhhhc
Q psy4686 256 FTCDTCDKPFDTIEKCRRHAIRMHMNPCKMFKCDICV------ASFLTPHKLKTHIKTKH 309 (2028)
Q Consensus 256 ~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~C~~C~------~~f~~~~~l~~H~~~~h 309 (2028)
-.|..|...|-....|.+|+...| |.|..|. .-|.....|..|.+.+|
T Consensus 183 p~C~~C~~~fld~~el~rH~~~~h------~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHLRFDH------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred ccchhhhhhhccHHHHHHhhccce------eheeecCcccccchhcccchHHHHHhhhcC
Confidence 367778888877777878776555 6666663 45777778888887776
No 39
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.77 E-value=0.0069 Score=40.46 Aligned_cols=21 Identities=29% Similarity=0.708 Sum_probs=8.7
Q ss_pred cccccCccccChHHHHHHHHh
Q psy4686 133 PCRLCDMSFSNKYQKKKHYSA 153 (2028)
Q Consensus 133 ~C~~C~~~~~~~~~l~~H~~~ 153 (2028)
.|++|++.|.+...|++|+..
T Consensus 2 ~C~~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 2 QCPICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp E-SSTS-EESSHHHHHHHHHH
T ss_pred CCcCCCCcCCcHHHHHHHHHh
Confidence 344444444444444444443
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.73 E-value=0.0077 Score=41.14 Aligned_cols=25 Identities=40% Similarity=0.925 Sum_probs=21.5
Q ss_pred ccCCccCcccCCchhHHHHHHhhCC
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~h~~ 2026 (2028)
|+|+.|+++|.....|..|++.|..
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~~ 25 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHXX 25 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhcc
Confidence 6899999999999999999988764
No 41
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.56 E-value=0.0044 Score=42.79 Aligned_cols=23 Identities=26% Similarity=0.557 Sum_probs=11.8
Q ss_pred cccCcccccccCchhhhcccccc
Q psy4686 1922 YLCDLCSKVFLEHKDMIRHREWV 1944 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~~L~~H~~~h 1944 (2028)
|.|..|++.|.+..+|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 45555555555555555555444
No 42
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.41 E-value=0.035 Score=61.19 Aligned_cols=85 Identities=18% Similarity=0.271 Sum_probs=56.2
Q ss_pred CCcccCcc--hhhhccCHHHHHhhhcccCCCCCcccccccC---c------cccChHHHHHHHHhhccCCCCCCCccCCc
Q psy4686 100 PDRRTCRI--CTEVFENLTLLRRHMRVKHPGEQNLPCRLCD---M------SFSNKYQKKKHYSAFHKGMPFIPSFTCPI 168 (2028)
Q Consensus 100 ~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~---~------~~~~~~~l~~H~~~~h~~~~~~~~~~C~~ 168 (2028)
...|.||. |.........|..|.+..|. .+.|.+|- + ..-+...|+.|...--.+..+..--.|..
T Consensus 149 LL~F~CP~skc~~~C~~~k~lk~H~K~~H~---~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~F 225 (493)
T COG5236 149 LLSFKCPKSKCHRRCGSLKELKKHYKAQHG---FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIF 225 (493)
T ss_pred HHHhcCCchhhhhhhhhHHHHHHHHHhhcC---cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhh
Confidence 35688975 88888889999999998773 47788883 2 22345566666432111112222246888
Q ss_pred chhhccChHHHHHHhhhhh
Q psy4686 169 CQKIFKNKEWYLDHLSLHE 187 (2028)
Q Consensus 169 C~~~f~~~~~l~~Hl~~h~ 187 (2028)
|+..|-+-+.|..|++..|
T Consensus 226 C~~~FYdDDEL~~HcR~~H 244 (493)
T COG5236 226 CKIYFYDDDELRRHCRLRH 244 (493)
T ss_pred ccceecChHHHHHHHHhhh
Confidence 8888888888888887743
No 43
>KOG2785|consensus
Probab=95.06 E-value=0.045 Score=62.52 Aligned_cols=79 Identities=19% Similarity=0.251 Sum_probs=43.9
Q ss_pred cccCcchhhhccCHHHHHhhhcc-cCC--------CCCcccccccCccccChHHHHHHHHhhccCCCCCCCccCCcchhh
Q psy4686 102 RRTCRICTEVFENLTLLRRHMRV-KHP--------GEQNLPCRLCDMSFSNKYQKKKHYSAFHKGMPFIPSFTCPICQKI 172 (2028)
Q Consensus 102 ~~~C~~C~~~f~~~~~l~~H~~~-~h~--------~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~~~C~~C~~~ 172 (2028)
.|+|--|...|.+.+.-+.|++. -|. .-.|+.= ..|..+..-..--... ........+.|.+|.+.
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPIta----E~F~~k~~s~~~~~~~-~~e~~~~~~~c~~c~k~ 77 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITA----EEFNEKVLSDDSEKEE-NLEEAESVVYCEACNKS 77 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCH----HHHhHHHhhhhhhhhh-hhhhcccceehHHhhcc
Confidence 58999999999999999999987 231 0011110 0110000000000000 00111223778888888
Q ss_pred ccChHHHHHHhhh
Q psy4686 173 FKNKEWYLDHLSL 185 (2028)
Q Consensus 173 f~~~~~l~~Hl~~ 185 (2028)
|........||.+
T Consensus 78 ~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 78 FASPKAHENHLKS 90 (390)
T ss_pred ccChhhHHHHHHH
Confidence 8888888888877
No 44
>KOG2231|consensus
Probab=95.04 E-value=0.041 Score=68.96 Aligned_cols=126 Identities=18% Similarity=0.466 Sum_probs=84.2
Q ss_pred CcCCCCCCcccCHHHHHHHHhhhcCCCccccCccCcc---------ccchhhhhhhcccccc-ccccccccccccccccc
Q psy4686 1449 KKCPVCHKTYSTPRQMRIHLREAHSQKKYACDVCGKQ---------FTSTNRVSQHKAHSHF-GIIKTIQRKFECDFCKK 1518 (2028)
Q Consensus 1449 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~---------f~~~~~l~~H~~~~h~-~~~~~~~~~~~C~~C~~ 1518 (2028)
..|..| -.|.+...|+.|+...|+ .+.|..|-.. .-+...|.+|+..--. ...+.| .-.|.+|..
T Consensus 116 ~~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rG--hp~C~~C~~ 190 (669)
T KOG2231|consen 116 KECLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRG--HPLCKFCHE 190 (669)
T ss_pred CCCccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccC--Cccchhhhh
Confidence 468888 888899999999965553 3555555422 2245667777652111 111111 246999999
Q ss_pred ccCChHhHHHHHhhcCCCCCcCCcch------hhhcCCHHHHHHHHHHhcccCCCCCCCCccccc--cccc-cccChhHH
Q psy4686 1519 KFYRNFDLQEHINTHTGNKPYQCQIC------NKSFGTRRNYRLHLKRHKRSAGQLKPEDIHECK--ICHK-IFLENSRL 1589 (2028)
Q Consensus 1519 ~f~~~~~l~~H~~~H~~~k~~~C~~C------~~~f~~~~~l~~H~~~H~~~~~~~~~~~~~~C~--~C~~-~f~~~~~l 1589 (2028)
.|.....|.+||+.+. |.|.+| +..|.....|..|.+.++ |.|. .|.- .|.....+
T Consensus 191 ~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-----------flCE~~~C~~~~f~~~~~~ 255 (669)
T KOG2231|consen 191 RFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-----------FLCEEEFCRTKKFYVAFEL 255 (669)
T ss_pred hhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC-----------ccccccccccceeeehhHH
Confidence 9999999999998765 667777 467888999999999877 7787 5643 33333344
Q ss_pred HHHhh
Q psy4686 1590 TRHMN 1594 (2028)
Q Consensus 1590 ~~H~~ 1594 (2028)
..|++
T Consensus 256 ei~lk 260 (669)
T KOG2231|consen 256 EIELK 260 (669)
T ss_pred HHHHH
Confidence 44544
No 45
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.02 E-value=0.059 Score=59.51 Aligned_cols=135 Identities=24% Similarity=0.459 Sum_probs=96.8
Q ss_pred cCCCCCCc--cccccCCHHHHHHHhhhhcCCCccceecccccc---ccc------ChhhhhhcccccccC--CCcceecc
Q psy4686 366 SKSKQCPL--CTKIFTTAKHMRVHLRSVHNGKERKFICDICGK---QFT------SNIHASRHKNYAHSR--DRQKMECD 432 (2028)
Q Consensus 366 ~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~---~f~------~~~~l~~H~~~~h~~--~~~~~~C~ 432 (2028)
-..+.||. |.........|..|++..|. .+.|.+|-. .|. +...|+.|...--.. -+.--.|.
T Consensus 149 LL~F~CP~skc~~~C~~~k~lk~H~K~~H~----~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~ 224 (493)
T COG5236 149 LLSFKCPKSKCHRRCGSLKELKKHYKAQHG----FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCI 224 (493)
T ss_pred HHHhcCCchhhhhhhhhHHHHHHHHHhhcC----cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhh
Confidence 34678875 78887889999999998775 488998843 333 456677776522111 11223599
Q ss_pred cccccccChhHHHHHHHHccCCCcccccccccc-------ccChhHHHHHHhhhccCcchhhhhhccccCcCccCcccee
Q psy4686 433 YCKRKFTCKRYLAEHINAHTGNTIYGCRICKKT-------FLYTNGLRRHILSRHKDTDVVILNEVKDIFPSHLQSYQIK 505 (2028)
Q Consensus 433 ~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~-------F~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (2028)
.|+..|-+-..|..|++.-+. .|-+|++. |.+..+|..|.+..| |-
T Consensus 225 FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-----------------------y~ 277 (493)
T COG5236 225 FCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH-----------------------YC 277 (493)
T ss_pred hccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc-----------------------eE
Confidence 999999999999999987553 58888754 788889999987654 33
Q ss_pred ccc--ch----hhcCCHHHHHHHHhhcccccc
Q psy4686 506 CRY--CA----RTFSTQNDLKEHVSSVHMFMT 531 (2028)
Q Consensus 506 C~~--C~----~~f~~~~~l~~H~~~~h~~~~ 531 (2028)
|.+ |- .+|....+|..|+-..|....
T Consensus 278 ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~~ 309 (493)
T COG5236 278 CTFQTCRVGKCYVFPYHTELLEHLTRFHKVNA 309 (493)
T ss_pred EEEEEEecCcEEEeccHHHHHHHHHHHhhccc
Confidence 332 21 469999999999998886543
No 46
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.86 E-value=0.013 Score=39.01 Aligned_cols=24 Identities=33% Similarity=0.700 Sum_probs=18.6
Q ss_pred ccCCccCcccCCchhHHHHHHhhCC
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~h~~ 2026 (2028)
|+|+.|+++.+ +..|..|++.|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 78999999888 8899999988754
No 47
>PRK04860 hypothetical protein; Provisional
Probab=94.69 E-value=0.013 Score=60.58 Aligned_cols=38 Identities=29% Similarity=0.707 Sum_probs=25.5
Q ss_pred CcccCCcchhhhhhHHHHhhhhcCCCcccCCccCcccCCc
Q psy4686 1975 PFCCDICGKSLKGNLKAHILRCQGERPYKCDVCGSSFKDK 2014 (2028)
Q Consensus 1975 ~~~C~~C~~~f~~~l~~H~~~h~~~~p~~C~~C~~~f~~~ 2014 (2028)
+|.|. |++ ....++.|.++|+|++||+|..|+..|...
T Consensus 119 ~Y~C~-C~~-~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE-HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC-eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 56676 665 455667777777777777777777776544
No 48
>KOG2785|consensus
Probab=94.09 E-value=0.13 Score=58.85 Aligned_cols=74 Identities=22% Similarity=0.234 Sum_probs=44.7
Q ss_pred CCccCcchhcCCCHHHHHHHHhhhcCCCCCcccCccccccCCHHHHHHHHhhcccccccccCCCcccccccc---cccCC
Q psy4686 569 SKQCPICEKIFAAPKQMRMHLRHVHSSIKRYMCDICGKQFTTLSYLDLHRKSHNETVKRKEEDKKYECDSCN---KKFWS 645 (2028)
Q Consensus 569 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~---~~f~~ 645 (2028)
|-.|-.|++.|.+...-..||...|+---| + -.-......|..-+..- ....|.|-.|+ +.|.+
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIP---d--reYL~D~~GLl~YLgeK--------V~~~~~CL~CN~~~~~f~s 232 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIP---D--REYLTDEKGLLKYLGEK--------VGIGFICLFCNELGRPFSS 232 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCC---c--hHhhhchhHHHHHHHHH--------hccCceEEEeccccCcccc
Confidence 356777888877777777777755542111 0 00112223333322211 23458899999 99999
Q ss_pred hHHHHHHHHH
Q psy4686 646 KRALSEHMII 655 (2028)
Q Consensus 646 ~~~L~~H~~~ 655 (2028)
..+.+.||..
T Consensus 233 leavr~HM~~ 242 (390)
T KOG2785|consen 233 LEAVRAHMRD 242 (390)
T ss_pred cHHHHHHHhh
Confidence 9999999963
No 49
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.82 E-value=0.029 Score=37.87 Aligned_cols=23 Identities=30% Similarity=0.886 Sum_probs=20.0
Q ss_pred ccCCccCcccCCchhHHHHHHhh
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRNH 2024 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~h 2024 (2028)
|.|++|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67899999999999999998765
No 50
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.75 E-value=0.067 Score=41.49 Aligned_cols=40 Identities=18% Similarity=0.330 Sum_probs=23.3
Q ss_pred HhhhcccCCCCCcccccccCccccChHHHHHHHHhhccCC
Q psy4686 119 RRHMRVKHPGEQNLPCRLCDMSFSNKYQKKKHYSAFHKGM 158 (2028)
Q Consensus 119 ~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~ 158 (2028)
..+.+.++..+++-+|++|+..+....+|++||...|...
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 3444555556678888888888888888888888777643
No 51
>PRK04860 hypothetical protein; Provisional
Probab=93.66 E-value=0.019 Score=59.32 Aligned_cols=34 Identities=32% Similarity=0.670 Sum_probs=16.4
Q ss_pred ccccccccccccccccccccccCCCCcccCCcchhh
Q psy4686 1950 HVCKVCGAKIKGNMKRHMLSHTGEKPFCCDICGKSL 1985 (2028)
Q Consensus 1950 ~~C~~C~~~~~~~l~~H~~~h~~~k~~~C~~C~~~f 1985 (2028)
|.|. |++ ....+++|.++|+|+++|.|..|+..|
T Consensus 120 Y~C~-C~~-~~~~~rrH~ri~~g~~~YrC~~C~~~l 153 (160)
T PRK04860 120 YRCK-CQE-HQLTVRRHNRVVRGEAVYRCRRCGETL 153 (160)
T ss_pred EEcC-CCC-eeCHHHHHHHHhcCCccEECCCCCcee
Confidence 4443 443 333344555555555555555555444
No 52
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.60 E-value=0.054 Score=42.00 Aligned_cols=28 Identities=21% Similarity=0.629 Sum_probs=12.2
Q ss_pred CCcccCCccCcccCCchhHHHHHHhhCC
Q psy4686 1999 ERPYKCDVCGSSFKDKKHFSVHIRNHGE 2026 (2028)
Q Consensus 1999 ~~p~~C~~C~~~f~~~~~l~~H~~~h~~ 2026 (2028)
+.|-.||+|+..+++..+|++|+..+++
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhc
Confidence 4455555555555555555555544444
No 53
>KOG2482|consensus
Probab=93.06 E-value=0.18 Score=56.20 Aligned_cols=27 Identities=30% Similarity=0.614 Sum_probs=24.2
Q ss_pred ceecccchhhcCCHHHHHHHHhhcccc
Q psy4686 503 QIKCRYCARTFSTQNDLKEHVSSVHMF 529 (2028)
Q Consensus 503 ~~~C~~C~~~f~~~~~l~~H~~~~h~~ 529 (2028)
...|-.|.....+...|..||+.+|..
T Consensus 279 ~v~CLfC~~~~en~~~l~eHmk~vHe~ 305 (423)
T KOG2482|consen 279 SVVCLFCTNFYENPVFLFEHMKIVHEF 305 (423)
T ss_pred ceEEEeeccchhhHHHHHHHHHHHHHh
Confidence 368999999999999999999999854
No 54
>smart00355 ZnF_C2H2 zinc finger.
Probab=92.94 E-value=0.083 Score=35.88 Aligned_cols=23 Identities=30% Similarity=0.852 Sum_probs=14.5
Q ss_pred cCCcchhhhcCCHHHHHHHHHHh
Q psy4686 1539 YQCQICNKSFGTRRNYRLHLKRH 1561 (2028)
Q Consensus 1539 ~~C~~C~~~f~~~~~l~~H~~~H 1561 (2028)
|+|..|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 45666666666666666666655
No 55
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=92.93 E-value=0.056 Score=35.99 Aligned_cols=23 Identities=30% Similarity=0.508 Sum_probs=11.6
Q ss_pred cccccccccccChhHHHHHhhhhc
Q psy4686 1574 HECKICHKIFLENSRLTRHMNFTH 1597 (2028)
Q Consensus 1574 ~~C~~C~~~f~~~~~l~~H~~~~~ 1597 (2028)
|+|+.|++... +..|.+|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45555555555 555555555544
No 56
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=91.67 E-value=0.11 Score=35.07 Aligned_cols=19 Identities=32% Similarity=0.765 Sum_probs=8.5
Q ss_pred cccccCccccChHHHHHHH
Q psy4686 133 PCRLCDMSFSNKYQKKKHY 151 (2028)
Q Consensus 133 ~C~~C~~~~~~~~~l~~H~ 151 (2028)
.|++|++.|.+...|+.|+
T Consensus 2 ~C~~C~~~f~s~~~~~~H~ 20 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHL 20 (25)
T ss_dssp EETTTTEEESSHHHHHHHH
T ss_pred CCCCCCCCcCCHHHHHHHH
Confidence 3444444444444444444
No 57
>KOG2893|consensus
Probab=90.73 E-value=0.072 Score=55.57 Aligned_cols=52 Identities=37% Similarity=0.704 Sum_probs=45.8
Q ss_pred ceecccccccccChhHHHHHHHHccCCCccccccccccccChhHHHHHHhhhccCc
Q psy4686 428 KMECDYCKRKFTCKRYLAEHINAHTGNTIYGCRICKKTFLYTNGLRRHILSRHKDT 483 (2028)
Q Consensus 428 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~~h~~~ 483 (2028)
+-+|-+|++.|....-|.+|.+. +-|+|-+|.|...+--.|..|.+.+|+++
T Consensus 10 kpwcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhket 61 (341)
T KOG2893|consen 10 KPWCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHKET 61 (341)
T ss_pred Cceeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhhhh
Confidence 34699999999999999999875 34999999999999999999999888765
No 58
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.43 E-value=0.19 Score=65.16 Aligned_cols=57 Identities=28% Similarity=0.582 Sum_probs=40.8
Q ss_pred CCccCCCCCCcCCCHHHHHhhhhh-hcCCC--CccccC--cchhhhcCchhHhhhhcccccCC
Q psy4686 1794 KTKLCPVCNKEYATPVTMRKHLRE-VHVSK--KKFKCD--LCQKQFKASRSLQCHKQFVHLGV 1851 (2028)
Q Consensus 1794 ~~~~C~~C~~~f~~~~~l~~H~~~-~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~ 1851 (2028)
.++.|..|...|.....|.+|.+. .|.++ +++.|+ .|++.|.+...|..| ...|.+.
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~-~~~~~~~ 349 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRH-ILLHTSI 349 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCC-cccccCC
Confidence 467788888888888888888774 67777 777777 677777777777777 4455544
No 59
>KOG4173|consensus
Probab=90.36 E-value=0.11 Score=53.24 Aligned_cols=82 Identities=27% Similarity=0.626 Sum_probs=65.7
Q ss_pred cceeccc--ccccccChhhhhhcccccccCCCcceecccccccccChhHHHHHHHHc----------cCCCcccccc--c
Q psy4686 397 RKFICDI--CGKQFTSNIHASRHKNYAHSRDRQKMECDYCKRKFTCKRYLAEHINAH----------TGNTIYGCRI--C 462 (2028)
Q Consensus 397 ~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~h----------~~~~~~~C~~--C 462 (2028)
..|.|++ |.+.|.+..+...|....|.. .|.+|.+.|.+...|..|+..- -|...|.|-+ |
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~-----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN-----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc-----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 4577776 778888888888887766654 4999999999999999998543 3567899977 9
Q ss_pred cccccChhHHHHHHhhhccCc
Q psy4686 463 KKTFLYTNGLRRHILSRHKDT 483 (2028)
Q Consensus 463 ~k~F~~~~~L~~H~~~~h~~~ 483 (2028)
...|.+...-+.|+...|+-.
T Consensus 153 t~KFkT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHMIRMHKYP 173 (253)
T ss_pred hhhhhhhhhhhhHHHHhccCC
Confidence 999999999999998877643
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.33 E-value=0.19 Score=65.29 Aligned_cols=54 Identities=41% Similarity=0.732 Sum_probs=38.8
Q ss_pred ccccccccccccCChHhHHHHHhhcCCCCCcCCcc--hhhhcCCHHHHHHHHHHhc
Q psy4686 1509 RKFECDFCKKKFYRNFDLQEHINTHTGNKPYQCQI--CNKSFGTRRNYRLHLKRHK 1562 (2028)
Q Consensus 1509 ~~~~C~~C~~~f~~~~~l~~H~~~H~~~k~~~C~~--C~~~f~~~~~l~~H~~~H~ 1562 (2028)
..+.|+.|...|.....+..|.+.|++++++.|.. |...|.....+..|...+.
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (467)
T COG5048 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHH 87 (467)
T ss_pred chhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhccccc
Confidence 45667777777777777777777777777777776 5556666777777766665
No 61
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=89.75 E-value=0.18 Score=34.67 Aligned_cols=22 Identities=27% Similarity=0.626 Sum_probs=15.1
Q ss_pred ccCcchhhhccCHHHHHhhhcc
Q psy4686 103 RTCRICTEVFENLTLLRRHMRV 124 (2028)
Q Consensus 103 ~~C~~C~~~f~~~~~l~~H~~~ 124 (2028)
|.|..|++.|.+...|..||+.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777664
No 62
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=89.62 E-value=0.19 Score=34.63 Aligned_cols=22 Identities=41% Similarity=0.842 Sum_probs=15.5
Q ss_pred cccccccccccChhhhhhhhhh
Q psy4686 286 FKCDICVASFLTPHKLKTHIKT 307 (2028)
Q Consensus 286 ~~C~~C~~~f~~~~~l~~H~~~ 307 (2028)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777664
No 63
>KOG2893|consensus
Probab=87.04 E-value=0.15 Score=53.23 Aligned_cols=48 Identities=25% Similarity=0.569 Sum_probs=40.9
Q ss_pred CcchhhhccCHHHHHhhhcccCCCCCcccccccCccccChHHHHHHHHhhccC
Q psy4686 105 CRICTEVFENLTLLRRHMRVKHPGEQNLPCRLCDMSFSNKYQKKKHYSAFHKG 157 (2028)
Q Consensus 105 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~ 157 (2028)
|=+|++.|++..-|..|.+.+| |+|.||.+.+.+--.|..|...+|..
T Consensus 13 cwycnrefddekiliqhqkakh-----fkchichkkl~sgpglsihcmqvhke 60 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH-----FKCHICHKKLFSGPGLSIHCMQVHKE 60 (341)
T ss_pred eeecccccchhhhhhhhhhhcc-----ceeeeehhhhccCCCceeehhhhhhh
Confidence 7889999999999999998877 89999999888888888886666653
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=81.80 E-value=0.82 Score=30.73 Aligned_cols=19 Identities=32% Similarity=0.667 Sum_probs=11.7
Q ss_pred ccCCcchhh-hhhHHHHhhh
Q psy4686 1977 CCDICGKSL-KGNLKAHILR 1995 (2028)
Q Consensus 1977 ~C~~C~~~f-~~~l~~H~~~ 1995 (2028)
.|++||+.| .+.|..|+..
T Consensus 4 ~C~~CgR~F~~~~l~~H~~~ 23 (25)
T PF13913_consen 4 PCPICGRKFNPDRLEKHEKI 23 (25)
T ss_pred cCCCCCCEECHHHHHHHHHh
Confidence 566666666 5566666543
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=81.71 E-value=1.1 Score=30.21 Aligned_cols=21 Identities=24% Similarity=0.692 Sum_probs=18.6
Q ss_pred ccCCccCcccCCchhHHHHHHh
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRN 2023 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~ 2023 (2028)
..|++||..| ....|.+|++.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 5799999999 88889999875
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=79.94 E-value=1.1 Score=33.20 Aligned_cols=23 Identities=22% Similarity=0.733 Sum_probs=20.0
Q ss_pred cccCCccCcccCCchhHHHHHHh
Q psy4686 2001 PYKCDVCGSSFKDKKHFSVHIRN 2023 (2028)
Q Consensus 2001 p~~C~~C~~~f~~~~~l~~H~~~ 2023 (2028)
+|-|++|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999865
No 67
>KOG4173|consensus
Probab=79.60 E-value=1.5 Score=45.33 Aligned_cols=85 Identities=21% Similarity=0.411 Sum_probs=71.1
Q ss_pred CcccCcc--hhhhccCHHHHHhhhcccCCCCCcccccccCccccChHHHHHHHHhhccC------CCCCCCccC--Ccch
Q psy4686 101 DRRTCRI--CTEVFENLTLLRRHMRVKHPGEQNLPCRLCDMSFSNKYQKKKHYSAFHKG------MPFIPSFTC--PICQ 170 (2028)
Q Consensus 101 ~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~------~~~~~~~~C--~~C~ 170 (2028)
-.|.|++ |.+.|++.+....|...-|. -.|.+|.+.|++-..|..|+..-|.. .--...|.| ..|.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~----~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG----NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc----chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 5678876 99999999999999988783 57999999999999999999888872 111456999 5699
Q ss_pred hhccChHHHHHHhhhhhhh
Q psy4686 171 KIFKNKEWYLDHLSLHEEQ 189 (2028)
Q Consensus 171 ~~f~~~~~l~~Hl~~h~~~ 189 (2028)
..|.+.+.-.+||...|..
T Consensus 154 ~KFkT~r~RkdH~I~~Hk~ 172 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHKY 172 (253)
T ss_pred hhhhhhhhhhhHHHHhccC
Confidence 9999999999999886654
No 68
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=78.10 E-value=3.5 Score=39.42 Aligned_cols=19 Identities=21% Similarity=0.438 Sum_probs=9.2
Q ss_pred CccccCcchhhhcCchhHh
Q psy4686 1823 KKFKCDLCQKQFKASRSLQ 1841 (2028)
Q Consensus 1823 ~~~~C~~C~~~f~~~~~l~ 1841 (2028)
-|..|++||........|.
T Consensus 14 LP~~CpiCgLtLVss~HLA 32 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLA 32 (112)
T ss_pred CCCcCCcCCCEEeccchHH
Confidence 3445555555544444444
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=72.94 E-value=5.1 Score=39.23 Aligned_cols=29 Identities=28% Similarity=0.575 Sum_probs=24.6
Q ss_pred cCCCCcC----CCCCCcccCHHHHHHHHhhhcC
Q psy4686 1445 LSGTKKC----PVCHKTYSTPRQMRIHLREAHS 1473 (2028)
Q Consensus 1445 ~~~~~~C----~~C~~~f~~~~~l~~H~~~h~~ 1473 (2028)
..+.|.| +.|++.+.+...|++|++.+|+
T Consensus 77 ~~~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 77 VYDGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCCCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 3356899 9999999999999999988775
No 70
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=71.78 E-value=3 Score=30.77 Aligned_cols=23 Identities=26% Similarity=0.719 Sum_probs=20.8
Q ss_pred CceeccCCCcccccHHHHHHHHH
Q psy4686 1649 GSFKCGICGKCYKTREKCRRHIK 1671 (2028)
Q Consensus 1649 ~~~~C~~C~~~f~~~~~L~~H~~ 1671 (2028)
++|.|++|+..|.+...+..|+.
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHC
Confidence 36899999999999999999985
No 71
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=70.73 E-value=2.3 Score=30.97 Aligned_cols=8 Identities=25% Similarity=1.136 Sum_probs=3.3
Q ss_pred ccCccccc
Q psy4686 1923 LCDLCSKV 1930 (2028)
Q Consensus 1923 ~C~~C~k~ 1930 (2028)
+|..||..
T Consensus 3 ~C~~CGy~ 10 (33)
T cd00350 3 VCPVCGYI 10 (33)
T ss_pred ECCCCCCE
Confidence 34444433
No 72
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=70.14 E-value=2.5 Score=30.78 Aligned_cols=10 Identities=40% Similarity=1.411 Sum_probs=5.7
Q ss_pred CCcccCCcch
Q psy4686 1974 KPFCCDICGK 1983 (2028)
Q Consensus 1974 k~~~C~~C~~ 1983 (2028)
.|+.|++||.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4556666653
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=69.56 E-value=2.2 Score=33.79 Aligned_cols=29 Identities=31% Similarity=0.579 Sum_probs=20.9
Q ss_pred CCCcccccccCccccChHHHHHHHHhhcc
Q psy4686 128 GEQNLPCRLCDMSFSNKYQKKKHYSAFHK 156 (2028)
Q Consensus 128 ~~~~~~C~~C~~~~~~~~~l~~H~~~~h~ 156 (2028)
|+..++|+-|+..|....++.+|+...|+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 55667777777777777777777776665
No 74
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=67.40 E-value=3 Score=30.55 Aligned_cols=9 Identities=56% Similarity=1.523 Sum_probs=4.2
Q ss_pred cccCCccCc
Q psy4686 2001 PYKCDVCGS 2009 (2028)
Q Consensus 2001 p~~C~~C~~ 2009 (2028)
|.+||+||.
T Consensus 18 p~~CP~Cg~ 26 (34)
T cd00729 18 PEKCPICGA 26 (34)
T ss_pred CCcCcCCCC
Confidence 444544443
No 75
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=66.63 E-value=8.8 Score=36.81 Aligned_cols=21 Identities=14% Similarity=0.512 Sum_probs=14.1
Q ss_pred CccccCccCccccchhhhhhh
Q psy4686 1475 KKYACDVCGKQFTSTNRVSQH 1495 (2028)
Q Consensus 1475 ~~~~C~~C~~~f~~~~~l~~H 1495 (2028)
-|..|+.|+....+...|.+-
T Consensus 14 LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCcCCcCCCEEeccchHHHh
Confidence 466777777777776666653
No 76
>KOG2186|consensus
Probab=62.56 E-value=2.5 Score=45.77 Aligned_cols=44 Identities=32% Similarity=0.685 Sum_probs=21.3
Q ss_pred cccccccccc-ccccccccccccCCCCcccCCcchhh-hhhHHHHhh
Q psy4686 1950 HVCKVCGAKI-KGNMKRHMLSHTGEKPFCCDICGKSL-KGNLKAHIL 1994 (2028)
Q Consensus 1950 ~~C~~C~~~~-~~~l~~H~~~h~~~k~~~C~~C~~~f-~~~l~~H~~ 1994 (2028)
|.|+.||-+. +-.|-+|+..=+| .-|.|--||+.| +..++.|..
T Consensus 4 FtCnvCgEsvKKp~vekH~srCrn-~~fSCIDC~k~F~~~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQVEKHMSRCRN-AYFSCIDCGKTFERVSYKNHTK 49 (276)
T ss_pred EehhhhhhhccccchHHHHHhccC-CeeEEeecccccccchhhhhhh
Confidence 4455555554 3444445444444 335555555555 444555543
No 77
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=61.48 E-value=3.3 Score=32.91 Aligned_cols=30 Identities=20% Similarity=0.396 Sum_probs=24.6
Q ss_pred cCCCCccCCCCCCcCCCHHHHHhhhhhhcC
Q psy4686 1791 VAEKTKLCPVCNKEYATPVTMRKHLREVHV 1820 (2028)
Q Consensus 1791 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 1820 (2028)
.++.-+.||-|+..|....++.+|+...|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 456667899999999999999999886663
No 78
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=57.92 E-value=27 Score=34.23 Aligned_cols=79 Identities=22% Similarity=0.478 Sum_probs=46.1
Q ss_pred ceecccchhhcCCHHHHHHHHhhcccccccc-----------C----CCCCCCcccCCCCCcccccCCCChhHHhhhhcC
Q psy4686 503 QIKCRYCARTFSTQNDLKEHVSSVHMFMTES-----------F----ENPNEPRVLNSNRYPLAIDGGLSLEEYQQIVAT 567 (2028)
Q Consensus 503 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~-----------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 567 (2028)
-..|..|...... .++..|++..|...... . .++.+........ ....+ ....
T Consensus 11 vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~---~Pi~g--------Lp~~ 78 (109)
T PF12013_consen 11 VLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPS---PPIPG--------LPVY 78 (109)
T ss_pred EEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCC---CcCCC--------CCCC
Confidence 3569999988876 88999999766433110 0 0111100000000 00011 1112
Q ss_pred CCCcc----CcchhcCCCHHHHHHHHhhhc
Q psy4686 568 KSKQC----PICEKIFAAPKQMRMHLRHVH 593 (2028)
Q Consensus 568 ~~~~C----~~C~~~f~~~~~l~~H~~~~H 593 (2028)
..|.| +.|+..+.+...|+.|++..|
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 45889 899999999999999998766
No 79
>KOG1280|consensus
Probab=56.53 E-value=7 Score=44.59 Aligned_cols=40 Identities=28% Similarity=0.586 Sum_probs=33.6
Q ss_pred cCCCCCCccccccCCHHHHHHHhhhhcCCCccceeccccc
Q psy4686 366 SKSKQCPLCTKIFTTAKHMRVHLRSVHNGKERKFICDICG 405 (2028)
Q Consensus 366 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~ 405 (2028)
..+|.||+|+..=.+...|..|+...|.......+|++|+
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 3478899999999999999999999998766667888885
No 80
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.62 E-value=7.3 Score=37.53 Aligned_cols=27 Identities=26% Similarity=0.710 Sum_probs=16.2
Q ss_pred cccCcccccccCchhhhcccccccCCcccccccccccc
Q psy4686 1922 YLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI 1959 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~ 1959 (2028)
..|+-||+.|.. | +..|-+|+.||..|
T Consensus 10 R~Cp~CG~kFYD---L--------nk~PivCP~CG~~~ 36 (108)
T PF09538_consen 10 RTCPSCGAKFYD---L--------NKDPIVCPKCGTEF 36 (108)
T ss_pred ccCCCCcchhcc---C--------CCCCccCCCCCCcc
Confidence 466666666643 2 23466677776666
No 81
>KOG2907|consensus
Probab=53.11 E-value=6.5 Score=37.00 Aligned_cols=32 Identities=25% Similarity=0.635 Sum_probs=21.9
Q ss_pred cccCCcchhhhhhHHHHhhhhc--------CC-CcccCCccCcccCCc
Q psy4686 1976 FCCDICGKSLKGNLKAHILRCQ--------GE-RPYKCDVCGSSFKDK 2014 (2028)
Q Consensus 1976 ~~C~~C~~~f~~~l~~H~~~h~--------~~-~p~~C~~C~~~f~~~ 2014 (2028)
.+|+.||- .+|..|| |. .-|.|+.|++.|+.-
T Consensus 75 ~kCpkCgh-------e~m~Y~T~QlRSADEGQTVFYTC~kC~~k~~e~ 115 (116)
T KOG2907|consen 75 HKCPKCGH-------EEMSYHTLQLRSADEGQTVFYTCPKCKYKFTEN 115 (116)
T ss_pred ccCcccCC-------chhhhhhhhcccccCCceEEEEcCccceeeecc
Confidence 68999995 3444443 22 459999999998753
No 82
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=52.91 E-value=9.2 Score=31.43 Aligned_cols=14 Identities=29% Similarity=0.465 Sum_probs=9.1
Q ss_pred cccCCCCcccccch
Q psy4686 1886 DILCPICKKGFIAL 1899 (2028)
Q Consensus 1886 ~~~C~~C~~~f~~~ 1899 (2028)
.|.|+.||+..+..
T Consensus 27 ~F~CPnCGe~~I~R 40 (61)
T COG2888 27 KFPCPNCGEVEIYR 40 (61)
T ss_pred EeeCCCCCceeeeh
Confidence 46777777665543
No 83
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=51.98 E-value=7.8 Score=40.36 Aligned_cols=12 Identities=58% Similarity=1.284 Sum_probs=6.5
Q ss_pred cCCCCcccCCcc
Q psy4686 1971 TGEKPFCCDICG 1982 (2028)
Q Consensus 1971 ~~~k~~~C~~C~ 1982 (2028)
-|+-|-.||+||
T Consensus 145 ~ge~P~~CPiCg 156 (166)
T COG1592 145 EGEAPEVCPICG 156 (166)
T ss_pred cCCCCCcCCCCC
Confidence 345555555555
No 84
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=48.19 E-value=9.8 Score=42.42 Aligned_cols=10 Identities=30% Similarity=0.936 Sum_probs=6.8
Q ss_pred cccCCcchhh
Q psy4686 1976 FCCDICGKSL 1985 (2028)
Q Consensus 1976 ~~C~~C~~~f 1985 (2028)
..|+.||.+|
T Consensus 49 ~vCP~CgyA~ 58 (214)
T PF09986_consen 49 WVCPHCGYAA 58 (214)
T ss_pred EECCCCCCcc
Confidence 3677777765
No 85
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=47.66 E-value=7.7 Score=43.24 Aligned_cols=10 Identities=40% Similarity=0.873 Sum_probs=8.5
Q ss_pred cccccccccc
Q psy4686 1950 HVCKVCGAKI 1959 (2028)
Q Consensus 1950 ~~C~~C~~~~ 1959 (2028)
.+||.||.+|
T Consensus 49 ~vCP~CgyA~ 58 (214)
T PF09986_consen 49 WVCPHCGYAA 58 (214)
T ss_pred EECCCCCCcc
Confidence 5799999986
No 86
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=47.20 E-value=7 Score=33.28 Aligned_cols=33 Identities=18% Similarity=0.584 Sum_probs=13.7
Q ss_pred ccccccccccccCCCCcccCC----cchhh-hhhHHHH
Q psy4686 1960 KGNMKRHMLSHTGEKPFCCDI----CGKSL-KGNLKAH 1992 (2028)
Q Consensus 1960 ~~~l~~H~~~h~~~k~~~C~~----C~~~f-~~~l~~H 1992 (2028)
+..|..|....=..++..|+. |+..+ +..|..|
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~~~l~~H 60 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVPREDLEEH 60 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEEHHHHHHC
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccchhHHhCC
Confidence 444444444444444445555 55444 4444443
No 87
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=45.87 E-value=11 Score=39.34 Aligned_cols=24 Identities=33% Similarity=0.964 Sum_probs=16.7
Q ss_pred ccccCcccccccCchhhhcccccccCCcccccccccc
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGA 1957 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~ 1957 (2028)
.|.|..||..+. |+-|-+||+||-
T Consensus 134 ~~vC~vCGy~~~-------------ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTHE-------------GEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCccc-------------CCCCCcCCCCCC
Confidence 477888876643 577777888773
No 88
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.12 E-value=7.3 Score=33.56 Aligned_cols=36 Identities=28% Similarity=0.626 Sum_probs=22.7
Q ss_pred CccccCcccccccCchhhhcccccccCCcccc-cccccccccccc
Q psy4686 1920 DTYLCDLCSKVFLEHKDMIRHREWVHGDKCHV-CKVCGAKIKGNM 1963 (2028)
Q Consensus 1920 ~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~-C~~C~~~~~~~l 1963 (2028)
+.|.|..|+..|. +..|| .+.|++ |+.||-.++..|
T Consensus 11 Y~Y~c~~cg~~~d----vvq~~----~ddplt~ce~c~a~~kk~l 47 (82)
T COG2331 11 YSYECTECGNRFD----VVQAM----TDDPLTTCEECGARLKKLL 47 (82)
T ss_pred eEEeecccchHHH----HHHhc----ccCccccChhhChHHHHhh
Confidence 3577888887764 44444 355654 888887664444
No 89
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=44.65 E-value=13 Score=35.98 Aligned_cols=15 Identities=27% Similarity=0.496 Sum_probs=8.3
Q ss_pred CccccCcccccccCc
Q psy4686 963 DIHECKICHKTFLEK 977 (2028)
Q Consensus 963 ~~~~C~~C~~~f~~~ 977 (2028)
.|-.|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 455566666655443
No 90
>KOG2186|consensus
Probab=44.18 E-value=10 Score=41.30 Aligned_cols=48 Identities=27% Similarity=0.713 Sum_probs=25.5
Q ss_pred cccCCCCcccc-hhhHHHHhhcccCCCcccccccccccCChhHHHHHhhhcc
Q psy4686 1019 FCCHICGKSLK-GNLKDHILKCHTGERPYKCDVCGSSFKDKWYLGVHMRKHN 1069 (2028)
Q Consensus 1019 ~~C~~C~~~f~-~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 1069 (2028)
|.|..||.+.+ ..|..|+.+.+ + .-|.|-.|++.|.. .+...|..--+
T Consensus 4 FtCnvCgEsvKKp~vekH~srCr-n-~~fSCIDC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQVEKHMSRCR-N-AYFSCIDCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred EehhhhhhhccccchHHHHHhcc-C-CeeEEeeccccccc-chhhhhhhhcc
Confidence 55666666655 45555654322 2 44666666666655 44555544333
No 91
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=42.96 E-value=9.2 Score=34.28 Aligned_cols=30 Identities=33% Similarity=0.691 Sum_probs=19.8
Q ss_pred ccccccccccccccccccccccccCCCCcccCCcchhh
Q psy4686 1948 KCHVCKVCGAKIKGNMKRHMLSHTGEKPFCCDICGKSL 1985 (2028)
Q Consensus 1948 ~~~~C~~C~~~~~~~l~~H~~~h~~~k~~~C~~C~~~f 1985 (2028)
.+|+|+.|++.- +.|+-+| .+.|..||..|
T Consensus 34 ~~~~Cp~C~~~~------VkR~a~G--IW~C~kCg~~f 63 (89)
T COG1997 34 AKHVCPFCGRTT------VKRIATG--IWKCRKCGAKF 63 (89)
T ss_pred cCCcCCCCCCcc------eeeeccC--eEEcCCCCCee
Confidence 468888887752 2233333 58888888877
No 92
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=41.38 E-value=18 Score=29.51 Aligned_cols=20 Identities=30% Similarity=0.815 Sum_probs=9.4
Q ss_pred cCCccCcccCCc-----hhHHHHHH
Q psy4686 2003 KCDVCGSSFKDK-----KHFSVHIR 2022 (2028)
Q Consensus 2003 ~C~~C~~~f~~~-----~~l~~H~~ 2022 (2028)
.|.+|++.++.. ++|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 344444444433 35555555
No 93
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=39.94 E-value=18 Score=35.30 Aligned_cols=27 Identities=15% Similarity=0.358 Sum_probs=15.4
Q ss_pred cccCcccccccCchhhhcccccccCCcccccccccccc
Q psy4686 1922 YLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI 1959 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~ 1959 (2028)
..|+.||+.|.. | +..|-+|+.||..|
T Consensus 10 r~Cp~cg~kFYD---L--------nk~p~vcP~cg~~~ 36 (129)
T TIGR02300 10 RICPNTGSKFYD---L--------NRRPAVSPYTGEQF 36 (129)
T ss_pred ccCCCcCccccc---c--------CCCCccCCCcCCcc
Confidence 456666666642 1 23466666666665
No 94
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=39.93 E-value=21 Score=24.43 Aligned_cols=21 Identities=24% Similarity=0.588 Sum_probs=15.5
Q ss_pred ccCCcchhhccChHHHHHHhhh
Q psy4686 164 FTCPICQKIFKNKEWYLDHLSL 185 (2028)
Q Consensus 164 ~~C~~C~~~f~~~~~l~~Hl~~ 185 (2028)
..||+|++.+ ....+..|+..
T Consensus 2 v~CPiC~~~v-~~~~in~HLD~ 22 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLDS 22 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHHH
Confidence 4688898888 55677888753
No 95
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=39.78 E-value=21 Score=26.73 Aligned_cols=14 Identities=14% Similarity=0.672 Sum_probs=7.0
Q ss_pred ccccccccccCchH
Q psy4686 869 VCDMCGKQFTSNNR 882 (2028)
Q Consensus 869 ~C~~C~~~f~~~~~ 882 (2028)
.|+.|+..|.....
T Consensus 4 ~Cp~C~~~y~i~d~ 17 (36)
T PF13717_consen 4 TCPNCQAKYEIDDE 17 (36)
T ss_pred ECCCCCCEEeCCHH
Confidence 45555555544433
No 96
>KOG1280|consensus
Probab=39.04 E-value=19 Score=41.37 Aligned_cols=59 Identities=24% Similarity=0.362 Sum_probs=44.4
Q ss_pred CCcccC------cchhhhccCHHHHHhhhcccCC----------------------CCCcccccccCccccChHHHHHHH
Q psy4686 100 PDRRTC------RICTEVFENLTLLRRHMRVKHP----------------------GEQNLPCRLCDMSFSNKYQKKKHY 151 (2028)
Q Consensus 100 ~~~~~C------~~C~~~f~~~~~l~~H~~~~h~----------------------~~~~~~C~~C~~~~~~~~~l~~H~ 151 (2028)
+..|+| ++|..-|.+....-.|-.. |+ ..+.|+|++|++.=.+...|..|+
T Consensus 21 ~rrYkCL~C~DyDlC~sCyen~~tt~~H~~d-HPmqcil~~~dfeL~f~Ge~i~~y~~qSftCPyC~~~Gfte~~f~~Hv 99 (381)
T KOG1280|consen 21 FRRYKCLRCSDYDLCFSCYENGATTPIHDED-HPMQCILSRVDFELYFGGEPISHYDPQSFTCPYCGIMGFTERQFGTHV 99 (381)
T ss_pred eeeeEeeeecchhHHHHHhhcCCCCcccCCC-CceeEEeeccceeeEecCccccccccccccCCcccccccchhHHHHHh
Confidence 467777 5677778887776677653 44 224699999999988999999999
Q ss_pred HhhccCCC
Q psy4686 152 SAFHKGMP 159 (2028)
Q Consensus 152 ~~~h~~~~ 159 (2028)
...|....
T Consensus 100 ~s~Hpda~ 107 (381)
T KOG1280|consen 100 LSQHPEAS 107 (381)
T ss_pred hhcCcccC
Confidence 98887543
No 97
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=37.33 E-value=20 Score=27.08 Aligned_cols=11 Identities=27% Similarity=0.715 Sum_probs=5.4
Q ss_pred ccccccccccC
Q psy4686 869 VCDMCGKQFTS 879 (2028)
Q Consensus 869 ~C~~C~~~f~~ 879 (2028)
.|+.|+..|..
T Consensus 4 ~CP~C~~~~~v 14 (38)
T TIGR02098 4 QCPNCKTSFRV 14 (38)
T ss_pred ECCCCCCEEEe
Confidence 45555555443
No 98
>PF15269 zf-C2H2_7: Zinc-finger
Probab=37.03 E-value=19 Score=27.33 Aligned_cols=22 Identities=32% Similarity=0.665 Sum_probs=17.6
Q ss_pred ccCCccCcccCCchhHHHHHHh
Q psy4686 2002 YKCDVCGSSFKDKKHFSVHIRN 2023 (2028)
Q Consensus 2002 ~~C~~C~~~f~~~~~l~~H~~~ 2023 (2028)
|+|-.|+.+..-+++|-.||+.
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 6788888888888888888864
No 99
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=36.78 E-value=11 Score=36.73 Aligned_cols=24 Identities=38% Similarity=0.652 Sum_probs=17.2
Q ss_pred ccccccccccccccccccccccCCC
Q psy4686 1950 HVCKVCGAKIKGNMKRHMLSHTGEK 1974 (2028)
Q Consensus 1950 ~~C~~C~~~~~~~l~~H~~~h~~~k 1974 (2028)
.+|-++||+||+ |++|+++|-|--
T Consensus 77 IicLEDGkkfKS-LKRHL~t~~gmT 100 (148)
T COG4957 77 IICLEDGKKFKS-LKRHLTTHYGLT 100 (148)
T ss_pred EEEeccCcchHH-HHHHHhcccCCC
Confidence 467777777765 788888877643
No 100
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=36.57 E-value=15 Score=28.28 Aligned_cols=12 Identities=33% Similarity=0.899 Sum_probs=6.5
Q ss_pred ccccCccccccc
Q psy4686 1921 TYLCDLCSKVFL 1932 (2028)
Q Consensus 1921 ~~~C~~C~k~f~ 1932 (2028)
.|+|..||..|.
T Consensus 5 ~y~C~~Cg~~fe 16 (41)
T smart00834 5 EYRCEDCGHTFE 16 (41)
T ss_pred EEEcCCCCCEEE
Confidence 355555555554
No 101
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=36.33 E-value=21 Score=47.11 Aligned_cols=10 Identities=30% Similarity=0.727 Sum_probs=6.6
Q ss_pred cccCcccccc
Q psy4686 1922 YLCDLCSKVF 1931 (2028)
Q Consensus 1922 ~~C~~C~k~f 1931 (2028)
..|..||..+
T Consensus 436 l~C~~Cg~v~ 445 (730)
T COG1198 436 LLCRDCGYIA 445 (730)
T ss_pred eecccCCCcc
Confidence 3577777764
No 102
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=35.55 E-value=12 Score=40.79 Aligned_cols=180 Identities=23% Similarity=0.465 Sum_probs=92.7
Q ss_pred hcCCCCccccCcchhhhcCchhHhhhhcccccCCCCCCcCccccCccCCC---CCChhHhhhcccccC-C--CccccCCC
Q psy4686 1818 VHVSKKKFKCDLCQKQFKASRSLQCHKQFVHLGVHKPKFLNMECDYCSRK---FPSKNEVTNHIKSHM-G--VRDILCPI 1891 (2028)
Q Consensus 1818 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~p~~~~~~C~~C~~~---f~~~~~l~~H~~~h~-~--~~~~~C~~ 1891 (2028)
.|+-+....|+.|++.=.++.--..= .+-...| .|..||+. ..+..-+..|--+.+ | .---+|++
T Consensus 32 ~~pcN~~MeCdkC~r~QKnRAFCYFC----~s~qrlp-----~Ca~Cgk~KCm~k~gdCvvkH~g~~~tGl~mvGaiCDf 102 (314)
T PF06524_consen 32 KHPCNALMECDKCQRKQKNRAFCYFC----QSVQRLP-----MCAHCGKTKCMLKTGDCVVKHPGVFTTGLGMVGAICDF 102 (314)
T ss_pred cCCCCccccchhhhhhccccceeehh----hhhhcCc-----hhhhcCCeeeeccCCCeEEecCceeecccchhhhhhcc
Confidence 34555667788887765554331100 0001133 68888876 344445555654432 2 12236777
Q ss_pred CcccccchhhHHHHHhhcccccCCCCCCCccccCcccccccCchhhhcccccccCCcccccccccccc-ccccccccccc
Q psy4686 1892 CKKGFIALKHMKTHLKKHMWKAGEIPLEDTYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI-KGNMKRHMLSH 1970 (2028)
Q Consensus 1892 C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~-~~~l~~H~~~h 1970 (2028)
|..-.- +=+.=+.+| +-.-|+. .-.|..|.+. .|.||.+.|.|..|+.-. .-.--.|+-.=
T Consensus 103 CEawvC---HgrkCl~~H---aC~Cpl~-da~C~EC~R~-----------vw~hGGrif~CsfC~~flCEDDQFEHQAsC 164 (314)
T PF06524_consen 103 CEAWVC---HGRKCLSTH---ACTCPLQ-DAVCIECERG-----------VWDHGGRIFKCSFCDNFLCEDDQFEHQASC 164 (314)
T ss_pred chhhee---ccccccccc---cccCcCC-CcEeeeeecc-----------cccCCCeEEEeecCCCeeeccchhhhhhhh
Confidence 753111 111112233 2222221 1356677665 367888999999997643 11111222110
Q ss_pred --cCCCCcccCCcchhh-h-------hhHHHHhhhhc----CCCcccCCccCcccCCchhHHHHHHhh
Q psy4686 1971 --TGEKPFCCDICGKSL-K-------GNLKAHILRCQ----GERPYKCDVCGSSFKDKKHFSVHIRNH 2024 (2028)
Q Consensus 1971 --~~~k~~~C~~C~~~f-~-------~~l~~H~~~h~----~~~p~~C~~C~~~f~~~~~l~~H~~~h 2024 (2028)
-....|+|.-|++-= . .--..|+|+-. ..+|+.||.||+-......|.+=.|+|
T Consensus 165 QvLe~E~~KC~SCNrlGq~sCLRCK~cfCddHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 165 QVLESETFKCQSCNRLGQYSCLRCKICFCDDHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred hhhhcccccccccccccchhhhheeeeehhhhhhhcccccccCCCCCCCCCCCcccccccceeeeecc
Confidence 012347777777521 1 12334666532 348999999999888877776665555
No 103
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=35.32 E-value=28 Score=25.26 Aligned_cols=11 Identities=27% Similarity=0.727 Sum_probs=6.4
Q ss_pred cccCCCCcccc
Q psy4686 930 YQCQICKKSFA 940 (2028)
Q Consensus 930 ~~C~~C~~~f~ 940 (2028)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 55666766655
No 104
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.23 E-value=25 Score=32.76 Aligned_cols=26 Identities=15% Similarity=0.158 Sum_probs=14.7
Q ss_pred ccCcccccccCchhhhcccccccCCcccccccccccc
Q psy4686 1923 LCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKI 1959 (2028)
Q Consensus 1923 ~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~ 1959 (2028)
.|+.||+.|.. | +..|.+|++||++|
T Consensus 11 idPetg~KFYD---L--------NrdPiVsPytG~s~ 36 (129)
T COG4530 11 IDPETGKKFYD---L--------NRDPIVSPYTGKSY 36 (129)
T ss_pred cCccccchhhc---c--------CCCccccCcccccc
Confidence 46666666642 2 23456666666665
No 105
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=34.79 E-value=18 Score=39.46 Aligned_cols=91 Identities=25% Similarity=0.594 Sum_probs=53.4
Q ss_pred cccCCCccccCcCcccc-cchHHHHHhhc--cCCCCcccCCCCcccchhhHHHHhhcccCCCcccccccccccCChhHHH
Q psy4686 986 WVHGDKCHICKVCGAKI-KGNLQKHMLSH--TGEKPFCCHICGKSLKGNLKDHILKCHTGERPYKCDVCGSSFKDKWYLG 1062 (2028)
Q Consensus 986 ~~~~~~~~~C~~C~~~~-~~~l~~H~~~h--~~~~~~~C~~C~~~f~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~L~ 1062 (2028)
+.||.+.|.|..|...+ ..+--.|+.+- -....|+|..|++. ..|.|-.|.--|-.
T Consensus 136 w~hGGrif~CsfC~~flCEDDQFEHQAsCQvLe~E~~KC~SCNrl----------------Gq~sCLRCK~cfCd----- 194 (314)
T PF06524_consen 136 WDHGGRIFKCSFCDNFLCEDDQFEHQASCQVLESETFKCQSCNRL----------------GQYSCLRCKICFCD----- 194 (314)
T ss_pred ccCCCeEEEeecCCCeeeccchhhhhhhhhhhhcccccccccccc----------------cchhhhheeeeehh-----
Confidence 46778888888887655 33334455321 22346888888642 23445555444432
Q ss_pred HHhhh----ccCCccccccccchhccChHHHHHHHhhcc
Q psy4686 1063 VHMRK----HNGEKPYNCDYCGQTFAARSTFTFHLKKHE 1097 (2028)
Q Consensus 1063 ~H~~~----h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 1097 (2028)
.|.+. -...+++.|+.|+........|..-.++|.
T Consensus 195 dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 195 DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred hhhhhcccccccCCCCCCCCCCCcccccccceeeeecch
Confidence 23332 123378889999888777777766666654
No 106
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=33.29 E-value=26 Score=29.13 Aligned_cols=10 Identities=40% Similarity=1.002 Sum_probs=6.7
Q ss_pred cccCCCCccc
Q psy4686 1886 DILCPICKKG 1895 (2028)
Q Consensus 1886 ~~~C~~C~~~ 1895 (2028)
.|.|+.||+.
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 4667777765
No 107
>PF12907 zf-met2: Zinc-binding
Probab=33.02 E-value=32 Score=26.34 Aligned_cols=27 Identities=37% Similarity=0.763 Sum_probs=15.8
Q ss_pred ccCcchhhhc---cCHHHHHhhhcccCCCC
Q psy4686 103 RTCRICTEVF---ENLTLLRRHMRVKHPGE 129 (2028)
Q Consensus 103 ~~C~~C~~~f---~~~~~l~~H~~~~h~~~ 129 (2028)
+.|.+|...| .+...|..|...+|+..
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~ 31 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKN 31 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence 4566776433 44455667766666543
No 108
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=31.94 E-value=20 Score=29.48 Aligned_cols=12 Identities=33% Similarity=0.847 Sum_probs=7.8
Q ss_pred ccccCccccccc
Q psy4686 1921 TYLCDLCSKVFL 1932 (2028)
Q Consensus 1921 ~~~C~~C~k~f~ 1932 (2028)
.|.|..||..|.
T Consensus 5 ey~C~~Cg~~fe 16 (52)
T TIGR02605 5 EYRCTACGHRFE 16 (52)
T ss_pred EEEeCCCCCEeE
Confidence 466666766665
No 109
>PRK04023 DNA polymerase II large subunit; Validated
Probab=31.67 E-value=35 Score=45.47 Aligned_cols=35 Identities=23% Similarity=0.672 Sum_probs=20.7
Q ss_pred ccccCcccccccCchhhhcccccccCCccccccccccccccccccccccccCCCCcccCCcchh
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKIKGNMKRHMLSHTGEKPFCCDICGKS 1984 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~~~~l~~H~~~h~~~k~~~C~~C~~~ 1984 (2028)
.+.|+.||.. .+..+.|+.||... .++.|+.||..
T Consensus 638 ~frCP~CG~~---------------Te~i~fCP~CG~~~--------------~~y~CPKCG~E 672 (1121)
T PRK04023 638 YRRCPFCGTH---------------TEPVYRCPRCGIEV--------------EEDECEKCGRE 672 (1121)
T ss_pred cccCCCCCCC---------------CCcceeCccccCcC--------------CCCcCCCCCCC
Confidence 3677777766 13346677774432 13567777753
No 110
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=31.07 E-value=14 Score=38.34 Aligned_cols=12 Identities=33% Similarity=0.567 Sum_probs=6.6
Q ss_pred cccCcccccccC
Q psy4686 1922 YLCDLCSKVFLE 1933 (2028)
Q Consensus 1922 ~~C~~C~k~f~~ 1933 (2028)
|+|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 555555555554
No 111
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.88 E-value=27 Score=32.51 Aligned_cols=28 Identities=21% Similarity=0.287 Sum_probs=20.4
Q ss_pred ccCCCCcccccchhhHHHHHhhcccccCCCCCCCccccCccccccc
Q psy4686 1887 ILCPICKKGFIALKHMKTHLKKHMWKAGEIPLEDTYLCDLCSKVFL 1932 (2028)
Q Consensus 1887 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~ 1932 (2028)
-+|+.||++|.... .+|..|++||++|+
T Consensus 10 ridPetg~KFYDLN------------------rdPiVsPytG~s~P 37 (129)
T COG4530 10 RIDPETGKKFYDLN------------------RDPIVSPYTGKSYP 37 (129)
T ss_pred ccCccccchhhccC------------------CCccccCcccccch
Confidence 46888888886421 23788899999885
No 112
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=30.77 E-value=30 Score=27.28 Aligned_cols=11 Identities=27% Similarity=0.724 Sum_probs=5.1
Q ss_pred cccCCCCcccc
Q psy4686 930 YQCQICKKSFA 940 (2028)
Q Consensus 930 ~~C~~C~~~f~ 940 (2028)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 44445544443
No 113
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=30.24 E-value=32 Score=33.69 Aligned_cols=13 Identities=15% Similarity=0.215 Sum_probs=7.1
Q ss_pred cccCCCCccccCh
Q psy4686 930 YQCQICKKSFANK 942 (2028)
Q Consensus 930 ~~C~~C~~~f~~~ 942 (2028)
..|+.||+.|...
T Consensus 10 r~Cp~cg~kFYDL 22 (129)
T TIGR02300 10 RICPNTGSKFYDL 22 (129)
T ss_pred ccCCCcCcccccc
Confidence 3566666666543
No 114
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=30.17 E-value=23 Score=36.83 Aligned_cols=10 Identities=30% Similarity=0.986 Sum_probs=9.1
Q ss_pred cccccccccc
Q psy4686 1950 HVCKVCGAKI 1959 (2028)
Q Consensus 1950 ~~C~~C~~~~ 1959 (2028)
++|+.||++|
T Consensus 29 ~~c~~c~~~f 38 (154)
T PRK00464 29 RECLACGKRF 38 (154)
T ss_pred eeccccCCcc
Confidence 7899999998
No 115
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=30.06 E-value=34 Score=25.79 Aligned_cols=10 Identities=30% Similarity=0.786 Sum_probs=4.3
Q ss_pred cccccccccc
Q psy4686 902 FECDYCKMKF 911 (2028)
Q Consensus 902 ~~C~~C~~~f 911 (2028)
.+|+.|+..|
T Consensus 26 vrC~~C~~~f 35 (37)
T PF13719_consen 26 VRCPKCGHVF 35 (37)
T ss_pred EECCCCCcEe
Confidence 3444444433
No 116
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=29.91 E-value=23 Score=28.35 Aligned_cols=28 Identities=21% Similarity=0.605 Sum_probs=17.9
Q ss_pred ccccCcccccccCchhhhcccccccCCccccccccccc
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAK 1958 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~ 1958 (2028)
.|.|..||+.|... .......|+.||..
T Consensus 6 ~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 6 EYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 57888888887211 12345678888765
No 117
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=29.48 E-value=52 Score=32.48 Aligned_cols=60 Identities=28% Similarity=0.462 Sum_probs=0.0
Q ss_pred cccccCCcccccccccccc-ccccccccccccCCCCcccCCcchhh----hhhHHHHhhhhcCCCcccCCccCcccCCch
Q psy4686 1941 REWVHGDKCHVCKVCGAKI-KGNMKRHMLSHTGEKPFCCDICGKSL----KGNLKAHILRCQGERPYKCDVCGSSFKDKK 2015 (2028)
Q Consensus 1941 ~~~h~~~~~~~C~~C~~~~-~~~l~~H~~~h~~~k~~~C~~C~~~f----~~~l~~H~~~h~~~~p~~C~~C~~~f~~~~ 2015 (2028)
|..-...+.|.|++|.... ...+. ||-+| ||.+- -.+|..|-.+| =+||.|..+|++.+
T Consensus 72 mnvF~d~~lYeCnIC~etS~ee~FL---------KPneC--CgY~iCn~Cya~LWK~~~~y-----pvCPvCkTSFKss~ 135 (140)
T PF05290_consen 72 MNVFLDPKLYECNICKETSAEERFL---------KPNEC--CGYSICNACYANLWKFCNLY-----PVCPVCKTSFKSSS 135 (140)
T ss_pred heeecCCCceeccCcccccchhhcC---------Ccccc--cchHHHHHHHHHHHHHcccC-----CCCCcccccccccc
Q ss_pred h
Q psy4686 2016 H 2016 (2028)
Q Consensus 2016 ~ 2016 (2028)
.
T Consensus 136 ~ 136 (140)
T PF05290_consen 136 S 136 (140)
T ss_pred c
No 118
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=29.01 E-value=51 Score=27.95 Aligned_cols=27 Identities=19% Similarity=0.543 Sum_probs=11.3
Q ss_pred cchHHHHHhhccCCCCcccCC----CCcccc
Q psy4686 1003 KGNLQKHMLSHTGEKPFCCHI----CGKSLK 1029 (2028)
Q Consensus 1003 ~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 1029 (2028)
+..|..|....=..++..|.+ |+..+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 444555554333344445555 555444
No 119
>PHA00626 hypothetical protein
Probab=28.70 E-value=25 Score=28.59 Aligned_cols=14 Identities=29% Similarity=0.565 Sum_probs=11.6
Q ss_pred CcccCCccCcccCC
Q psy4686 2000 RPYKCDVCGSSFKD 2013 (2028)
Q Consensus 2000 ~p~~C~~C~~~f~~ 2013 (2028)
..|+|+.||+.|+-
T Consensus 22 nrYkCkdCGY~ft~ 35 (59)
T PHA00626 22 DDYVCCDCGYNDSK 35 (59)
T ss_pred cceEcCCCCCeech
Confidence 46999999998864
No 120
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=28.57 E-value=36 Score=26.89 Aligned_cols=22 Identities=27% Similarity=0.732 Sum_probs=9.0
Q ss_pred cceecccccccccCh----hHHHHHH
Q psy4686 427 QKMECDYCKRKFTCK----RYLAEHI 448 (2028)
Q Consensus 427 ~~~~C~~C~~~f~~~----~~L~~H~ 448 (2028)
....|.+|++.+... ..|..|+
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 345566666555543 3455554
No 121
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=28.23 E-value=27 Score=36.36 Aligned_cols=12 Identities=25% Similarity=0.755 Sum_probs=5.6
Q ss_pred cccCCCCccccc
Q psy4686 1886 DILCPICKKGFI 1897 (2028)
Q Consensus 1886 ~~~C~~C~~~f~ 1897 (2028)
-|.|+.|+..|.
T Consensus 99 ~Y~Cp~C~~~y~ 110 (147)
T smart00531 99 YYKCPNCQSKYT 110 (147)
T ss_pred EEECcCCCCEee
Confidence 344444444444
No 122
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=27.51 E-value=31 Score=23.67 Aligned_cols=9 Identities=44% Similarity=1.080 Sum_probs=4.8
Q ss_pred cCCccCccc
Q psy4686 2003 KCDVCGSSF 2011 (2028)
Q Consensus 2003 ~C~~C~~~f 2011 (2028)
.|++||+.|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 455555555
No 123
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=27.13 E-value=35 Score=27.20 Aligned_cols=11 Identities=36% Similarity=1.183 Sum_probs=5.8
Q ss_pred cccCccCcccc
Q psy4686 1477 YACDVCGKQFT 1487 (2028)
Q Consensus 1477 ~~C~~C~~~f~ 1487 (2028)
|.|+.||..|.
T Consensus 4 y~C~~CG~~~~ 14 (46)
T PRK00398 4 YKCARCGREVE 14 (46)
T ss_pred EECCCCCCEEE
Confidence 55555555443
No 124
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.77 E-value=27 Score=36.75 Aligned_cols=18 Identities=22% Similarity=0.248 Sum_probs=8.7
Q ss_pred ccccCCCCcccccchhhH
Q psy4686 1885 RDILCPICKKGFIALKHM 1902 (2028)
Q Consensus 1885 ~~~~C~~C~~~f~~~~~l 1902 (2028)
.-|.|+.|+..|+....+
T Consensus 108 ~~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 108 MFFICPNMCVRFTFNEAM 125 (158)
T ss_pred CeEECCCCCcEeeHHHHH
Confidence 345555555555544433
No 125
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=26.68 E-value=26 Score=27.27 Aligned_cols=13 Identities=31% Similarity=0.764 Sum_probs=7.2
Q ss_pred ccccCcccccccC
Q psy4686 1921 TYLCDLCSKVFLE 1933 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~ 1933 (2028)
.|.|..||..|-.
T Consensus 5 ey~C~~Cg~~fe~ 17 (42)
T PF09723_consen 5 EYRCEECGHEFEV 17 (42)
T ss_pred EEEeCCCCCEEEE
Confidence 3556666655543
No 126
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.45 E-value=29 Score=30.62 Aligned_cols=13 Identities=31% Similarity=0.789 Sum_probs=6.7
Q ss_pred cccccc--ccccccc
Q psy4686 1948 KCHVCK--VCGAKIK 1960 (2028)
Q Consensus 1948 ~~~~C~--~C~~~~~ 1960 (2028)
+-++|. .||.+|.
T Consensus 26 ~Y~qC~N~eCg~tF~ 40 (72)
T PRK09678 26 RYHQCQNVNCSATFI 40 (72)
T ss_pred eeeecCCCCCCCEEE
Confidence 344554 5555553
No 127
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.26 E-value=40 Score=35.51 Aligned_cols=32 Identities=19% Similarity=0.406 Sum_probs=18.3
Q ss_pred CCCCccCCCCCCcCCCHHHHHhhhhhhcCCCCccccCcchhh
Q psy4686 1792 AEKTKLCPVCNKEYATPVTMRKHLREVHVSKKKFKCDLCQKQ 1833 (2028)
Q Consensus 1792 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~ 1833 (2028)
...-|.|+.|+..|..-..+. ..|.|+.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME----------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH----------cCCcCCCCCCE
Confidence 344566666666666555552 14666666654
No 128
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=25.71 E-value=44 Score=26.39 Aligned_cols=11 Identities=36% Similarity=0.869 Sum_probs=4.4
Q ss_pred CcccCCcchhh
Q psy4686 1975 PFCCDICGKSL 1985 (2028)
Q Consensus 1975 ~~~C~~C~~~f 1985 (2028)
-..|.+|++.+
T Consensus 16 ~a~C~~C~~~~ 26 (45)
T PF02892_consen 16 KAKCKYCGKVI 26 (45)
T ss_dssp -EEETTTTEE-
T ss_pred eEEeCCCCeEE
Confidence 34455555444
No 129
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=25.17 E-value=29 Score=37.29 Aligned_cols=14 Identities=21% Similarity=0.607 Sum_probs=6.4
Q ss_pred cccCcccccccCch
Q psy4686 1922 YLCDLCSKVFLEHK 1935 (2028)
Q Consensus 1922 ~~C~~C~k~f~~~~ 1935 (2028)
|.|+.|+..|+...
T Consensus 118 Y~Cp~C~~rytf~e 131 (178)
T PRK06266 118 FFCPNCHIRFTFDE 131 (178)
T ss_pred EECCCCCcEEeHHH
Confidence 44444444444433
No 130
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=24.82 E-value=48 Score=37.19 Aligned_cols=97 Identities=25% Similarity=0.471 Sum_probs=0.0
Q ss_pred ccccccccccccCCcccccccccccCCCCCCCCccCCCCccccccccccCCCCCCCchhhhhhhhhcCCCCccCCCCCCc
Q psy4686 1725 KECPVCHKTSMESSSSDINKQTFPKTSKTSTSENENHKPSCGNSFMVHKNQPIDTGDFSLEQWRQLVAEKTKLCPVCNKE 1804 (2028)
Q Consensus 1725 ~~c~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~ 1804 (2028)
|.||.|....... |-.||.|...
T Consensus 309 y~CP~CktkVCsL---------------------------------------------------------Pi~CP~Csl~ 331 (421)
T COG5151 309 YECPVCKTKVCSL---------------------------------------------------------PISCPICSLQ 331 (421)
T ss_pred eeCCcccceeecC---------------------------------------------------------CccCcchhHH
Q ss_pred CCCHHHHHhhhhhhcCCCCccc------------cCcchhhhcCchhHhhhhcccccCCCCCCcCccccCccCCCCCChh
Q psy4686 1805 YATPVTMRKHLREVHVSKKKFK------------CDLCQKQFKASRSLQCHKQFVHLGVHKPKFLNMECDYCSRKFPSKN 1872 (2028)
Q Consensus 1805 f~~~~~l~~H~~~~h~~~~~~~------------C~~C~~~f~~~~~l~~H~~~~H~~~~~p~~~~~~C~~C~~~f~~~~ 1872 (2028)
......|.+-.. +=..-++|. |..|.-.|+. -.+|.....-....|.|+.|...|-.-.
T Consensus 332 LilsthLarSyh-hL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~--------~~~~~~~~~~ss~rY~Ce~CK~~FC~dC 402 (421)
T COG5151 332 LILSTHLARSYH-HLYPLKPFVEKPEGTNPKSTHCFVCQGPFPK--------PPVSPFDESTSSGRYQCELCKSTFCSDC 402 (421)
T ss_pred HHHHHHHHHHHH-hhccCcccccccCCCCCCCccceeccCCCCC--------CCCCcccccccccceechhhhhhhhhhh
Q ss_pred HhhhcccccCCCccccCCCCc
Q psy4686 1873 EVTNHIKSHMGVRDILCPICK 1893 (2028)
Q Consensus 1873 ~l~~H~~~h~~~~~~~C~~C~ 1893 (2028)
..-.|...| .|.-|.
T Consensus 403 dvfiHe~Lh------~C~gCe 417 (421)
T COG5151 403 DVFIHETLH------FCIGCE 417 (421)
T ss_pred HHHHHHHHh------hCCCCc
No 131
>PF14353 CpXC: CpXC protein
Probab=24.32 E-value=32 Score=34.82 Aligned_cols=33 Identities=27% Similarity=0.699 Sum_probs=0.0
Q ss_pred cccccccccccccccccccccCCCC-------------cccCCcchhh
Q psy4686 1951 VCKVCGAKIKGNMKRHMLSHTGEKP-------------FCCDICGKSL 1985 (2028)
Q Consensus 1951 ~C~~C~~~~~~~l~~H~~~h~~~k~-------------~~C~~C~~~f 1985 (2028)
.|+.||..|...+.. .+.+.+.| |.|+.||..|
T Consensus 3 tCP~C~~~~~~~v~~--~I~~~~~p~l~e~il~g~l~~~~CP~Cg~~~ 48 (128)
T PF14353_consen 3 TCPHCGHEFEFEVWT--SINADEDPELKEKILDGSLFSFTCPSCGHKF 48 (128)
T ss_pred CCCCCCCeeEEEEEe--EEcCcCCHHHHHHHHcCCcCEEECCCCCCce
No 132
>PRK04023 DNA polymerase II large subunit; Validated
Probab=23.63 E-value=57 Score=43.69 Aligned_cols=47 Identities=23% Similarity=0.588 Sum_probs=34.6
Q ss_pred cccCccCCCCCChhHhhhcccccCCCccccCCCCcccccchhhHHHHHhhcccccCCCCCCCccccCcccccccCchhhh
Q psy4686 1859 MECDYCSRKFPSKNEVTNHIKSHMGVRDILCPICKKGFIALKHMKTHLKKHMWKAGEIPLEDTYLCDLCSKVFLEHKDMI 1938 (2028)
Q Consensus 1859 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~C~~C~k~f~~~~~L~ 1938 (2028)
..|..||... -.+.|+.||... . ..|.|+.|+...
T Consensus 627 RfCpsCG~~t----------------~~frCP~CG~~T-----------------e-----~i~fCP~CG~~~------- 661 (1121)
T PRK04023 627 RKCPSCGKET----------------FYRRCPFCGTHT-----------------E-----PVYRCPRCGIEV------- 661 (1121)
T ss_pred ccCCCCCCcC----------------CcccCCCCCCCC-----------------C-----cceeCccccCcC-------
Confidence 6899999872 357999999861 1 158999996553
Q ss_pred cccccccCCcccccccccccc
Q psy4686 1939 RHREWVHGDKCHVCKVCGAKI 1959 (2028)
Q Consensus 1939 ~H~~~h~~~~~~~C~~C~~~~ 1959 (2028)
.++.|+.||...
T Consensus 662 ---------~~y~CPKCG~El 673 (1121)
T PRK04023 662 ---------EEDECEKCGREP 673 (1121)
T ss_pred ---------CCCcCCCCCCCC
Confidence 247899999754
No 133
>PF15269 zf-C2H2_7: Zinc-finger
Probab=23.52 E-value=50 Score=25.25 Aligned_cols=22 Identities=23% Similarity=0.525 Sum_probs=18.1
Q ss_pred eecccchhhccChhhHHHHHhh
Q psy4686 1356 FECIYCKKKYRRKFDLKEHINK 1377 (2028)
Q Consensus 1356 ~~C~~C~~~f~~~~~L~~H~~~ 1377 (2028)
|.|-.|.+....++.|-.||+.
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7888888888888888888865
No 134
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=22.76 E-value=50 Score=34.37 Aligned_cols=35 Identities=17% Similarity=0.431 Sum_probs=20.8
Q ss_pred ccccCccCCCCCChhHhhh-cccccCCCccccCCCCccccc
Q psy4686 1858 NMECDYCSRKFPSKNEVTN-HIKSHMGVRDILCPICKKGFI 1897 (2028)
Q Consensus 1858 ~~~C~~C~~~f~~~~~l~~-H~~~h~~~~~~~C~~C~~~f~ 1897 (2028)
.|.|+.|+..|.....+.. .. ...|.||.||....
T Consensus 99 ~Y~Cp~C~~~y~~~ea~~~~d~-----~~~f~Cp~Cg~~l~ 134 (147)
T smart00531 99 YYKCPNCQSKYTFLEANQLLDM-----DGTFTCPRCGEELE 134 (147)
T ss_pred EEECcCCCCEeeHHHHHHhcCC-----CCcEECCCCCCEEE
Confidence 4777777777775544332 21 23477777776553
No 135
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=22.63 E-value=52 Score=26.22 Aligned_cols=10 Identities=30% Similarity=0.860 Sum_probs=5.2
Q ss_pred CCcccCCcch
Q psy4686 1974 KPFCCDICGK 1983 (2028)
Q Consensus 1974 k~~~C~~C~~ 1983 (2028)
..|.|..|++
T Consensus 36 ~~~~C~~C~~ 45 (46)
T PF12760_consen 36 GRYRCKACRK 45 (46)
T ss_pred CeEECCCCCC
Confidence 3455555543
No 136
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=22.30 E-value=36 Score=25.49 Aligned_cols=17 Identities=24% Similarity=0.550 Sum_probs=8.1
Q ss_pred CCCcccCCccCcccCCc
Q psy4686 1998 GERPYKCDVCGSSFKDK 2014 (2028)
Q Consensus 1998 ~~~p~~C~~C~~~f~~~ 2014 (2028)
...+=+|+.||-.+.++
T Consensus 18 P~~~~~Cd~cg~~L~qR 34 (36)
T PF05191_consen 18 PKVEGVCDNCGGELVQR 34 (36)
T ss_dssp -SSTTBCTTTTEBEBEE
T ss_pred CCCCCccCCCCCeeEeC
Confidence 33445566665554443
No 137
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=22.06 E-value=43 Score=36.91 Aligned_cols=30 Identities=33% Similarity=0.764 Sum_probs=22.5
Q ss_pred cCCCcccCcchhhhccCHHHHHhhhcccCC
Q psy4686 98 ELPDRRTCRICTEVFENLTLLRRHMRVKHP 127 (2028)
Q Consensus 98 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 127 (2028)
.....|.|++|++.|.-..-+++|+..+|+
T Consensus 73 ~~~~K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 73 EDEDKWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred HcCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence 345789999999999999999999998884
No 138
>KOG2071|consensus
Probab=21.88 E-value=72 Score=40.20 Aligned_cols=98 Identities=12% Similarity=0.149 Sum_probs=0.0
Q ss_pred CCCcCCcchhhhcCCHHHHHHHHHHhcccCCCCCCCCccccccccccccChhHHHHHhhhhcCCCCCCCCCCchhhhhcc
Q psy4686 1536 NKPYQCQICNKSFGTRRNYRLHLKRHKRSAGQLKPEDIHECKICHKIFLENSRLTRHMNFTHGDKCHGATNSTDKIFRDA 1615 (2028)
Q Consensus 1536 ~k~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~~~~ 1615 (2028)
..|-+|..||.+|........||-.| +..|+++....+..........-+...
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H---------------------------~dwh~r~n~~~r~s~~~~~sR~Wf~s~ 468 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIH---------------------------DDWHRRKNTTIRGSSKWQKSRSWFPSK 468 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhh---------------------------hhhhhhhhhhcccccccccccceeech
Q ss_pred ccccCCCChhhHHHHhhhhc---------cchhhhhhhccC-CCceeccCCCccc
Q psy4686 1616 DSIVHPNSTVWAKRLITELK---------EDEYSSLKRAIS-KGSFKCGICGKCY 1660 (2028)
Q Consensus 1616 ~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~-~~~~~C~~C~~~f 1660 (2028)
..=+.+.............. +........... +..+.|++|+..|
T Consensus 469 s~W~~~K~~e~t~e~~~~~~~~~~~~~~~s~~~k~~~Vp~d~e~~~~C~IC~EkF 523 (579)
T KOG2071|consen 469 SGWLAAKAGEETDEKEKVEHEELQIKKELSLRSKYELVPADSERQASCPICQEKF 523 (579)
T ss_pred hhhccccccccccccccccchhhcchhhhhhhccceecccCcccccCCccccccc
No 139
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=20.94 E-value=41 Score=36.21 Aligned_cols=33 Identities=27% Similarity=0.522 Sum_probs=24.2
Q ss_pred CCCccccCCCCCCCCCHHHHHHHHhhhccCCCCccccccccccc
Q psy4686 252 NKASFTCDTCDKPFDTIEKCRRHAIRMHMNPCKMFKCDICVASF 295 (2028)
Q Consensus 252 ~~~~~~C~~C~~~~~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f 295 (2028)
+...|.|+.|+.+|+...++. .-|.|+.||...
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~-----------~~F~Cp~Cg~~L 146 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME-----------YGFRCPQCGEML 146 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh-----------cCCcCCCCCCCC
Confidence 345788988998888877652 248899888743
No 140
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=20.32 E-value=63 Score=42.94 Aligned_cols=41 Identities=22% Similarity=0.483 Sum_probs=26.6
Q ss_pred ccccCcccccccCchhhhcccccccCCccccccccccccccccccccccccCCCCcccCCcchh
Q psy4686 1921 TYLCDLCSKVFLEHKDMIRHREWVHGDKCHVCKVCGAKIKGNMKRHMLSHTGEKPFCCDICGKS 1984 (2028)
Q Consensus 1921 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~~~~~l~~H~~~h~~~k~~~C~~C~~~ 1984 (2028)
.+.|+.|+..+. +|...+...|.+||+ ....|..|+.||..
T Consensus 444 v~~Cp~Cd~~lt----------~H~~~~~L~CH~Cg~-------------~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 444 IAECPNCDSPLT----------LHKATGQLRCHYCGY-------------QEPIPQSCPECGSE 484 (730)
T ss_pred cccCCCCCcceE----------EecCCCeeEeCCCCC-------------CCCCCCCCCCCCCC
Confidence 468999988743 233445667777753 34567788888853
No 141
>PF12907 zf-met2: Zinc-binding
Probab=20.04 E-value=78 Score=24.38 Aligned_cols=32 Identities=22% Similarity=0.394 Sum_probs=20.9
Q ss_pred CccCcchh---cCCCHHHHHHHHhhhcCCCCCccc
Q psy4686 570 KQCPICEK---IFAAPKQMRMHLRHVHSSIKRYMC 601 (2028)
Q Consensus 570 ~~C~~C~~---~f~~~~~l~~H~~~~H~~~~~~~C 601 (2028)
++|.+|-. .-.+...|..|....|....+-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 46888874 445557788888877776544333
Done!