Query psy11585
Match_columns 2227
No_of_seqs 1113 out of 8155
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 22:19:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11585.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11585hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 99.9 1.5E-27 3.3E-32 287.4 9.3 246 1777-2029 604-932 (958)
2 KOG1074|consensus 99.9 1.3E-27 2.9E-32 287.9 5.7 271 462-761 605-931 (958)
3 KOG2462|consensus 99.9 2.2E-24 4.8E-29 231.7 3.4 134 1917-2053 128-265 (279)
4 KOG3608|consensus 99.9 1.9E-22 4.2E-27 219.1 14.9 282 271-604 69-380 (467)
5 KOG3608|consensus 99.9 2.6E-23 5.6E-28 225.8 6.8 225 1750-2060 133-381 (467)
6 KOG2462|consensus 99.9 1.2E-22 2.6E-27 218.4 6.3 132 682-816 131-265 (279)
7 KOG3623|consensus 99.8 2.8E-19 6E-24 211.6 8.8 81 1314-1394 891-971 (1007)
8 KOG3623|consensus 99.8 9.5E-19 2.1E-23 207.1 12.1 105 350-454 211-330 (1007)
9 KOG3576|consensus 99.5 8.8E-15 1.9E-19 147.7 2.0 84 1917-2001 115-198 (267)
10 KOG3576|consensus 99.4 1.9E-14 4.1E-19 145.3 1.3 113 1945-2058 115-239 (267)
11 PLN03086 PRLI-interacting fact 98.8 2.7E-09 5.7E-14 132.4 6.2 133 1920-2072 408-553 (567)
12 PLN03086 PRLI-interacting fact 98.8 4.3E-09 9.2E-14 130.6 5.7 120 1920-2056 434-565 (567)
13 KOG1146|consensus 98.6 6.9E-08 1.5E-12 126.0 7.2 134 68-201 459-638 (1406)
14 PHA00733 hypothetical protein 98.5 3.8E-08 8.2E-13 99.9 3.3 78 1945-2024 38-119 (128)
15 PHA00733 hypothetical protein 98.5 1.2E-07 2.6E-12 96.2 5.3 83 735-819 38-124 (128)
16 KOG3993|consensus 98.4 6.1E-08 1.3E-12 110.5 -0.8 165 350-514 268-482 (500)
17 PHA02768 hypothetical protein; 98.4 1.5E-07 3.3E-12 76.7 1.6 44 1637-1682 5-48 (55)
18 PHA02768 hypothetical protein; 98.3 3.2E-07 7E-12 74.8 2.2 42 1977-2020 6-47 (55)
19 KOG3993|consensus 98.2 6.6E-07 1.4E-11 102.3 3.4 28 574-601 354-381 (500)
20 KOG1146|consensus 98.2 1.3E-05 2.8E-10 105.6 14.7 175 411-595 442-637 (1406)
21 PF13465 zf-H2C2_2: Zinc-finge 98.2 8.9E-07 1.9E-11 61.8 1.7 26 1991-2016 1-26 (26)
22 PF13465 zf-H2C2_2: Zinc-finge 98.1 1.1E-06 2.3E-11 61.4 2.0 26 1652-1677 1-26 (26)
23 PHA00616 hypothetical protein 97.6 1.7E-05 3.6E-10 61.6 1.0 34 1637-1670 1-34 (44)
24 PHA00616 hypothetical protein 97.6 2.8E-05 6.2E-10 60.4 2.0 35 1778-1812 1-35 (44)
25 PHA00732 hypothetical protein 97.5 7.8E-05 1.7E-09 68.2 3.2 48 1976-2029 1-49 (79)
26 PHA00732 hypothetical protein 97.4 9.5E-05 2E-09 67.6 2.7 47 1513-1565 1-48 (79)
27 PF05605 zf-Di19: Drought indu 97.2 0.00023 5E-09 60.5 2.9 49 1948-1999 3-52 (54)
28 PF05605 zf-Di19: Drought indu 96.8 0.0019 4.2E-08 54.9 4.7 48 738-788 3-51 (54)
29 PF00096 zf-C2H2: Zinc finger, 96.4 0.0017 3.6E-08 44.3 1.6 20 1780-1799 2-21 (23)
30 PF00096 zf-C2H2: Zinc finger, 96.3 0.0029 6.2E-08 43.1 2.0 23 1403-1425 1-23 (23)
31 PF12756 zf-C2H2_2: C2H2 type 96.1 0.0025 5.4E-08 63.4 1.4 71 104-174 1-72 (100)
32 KOG2482|consensus 96.0 0.0042 9E-08 70.0 3.1 214 71-298 25-306 (423)
33 PF12756 zf-C2H2_2: C2H2 type 96.0 0.0029 6.3E-08 62.9 1.5 26 2029-2054 48-73 (100)
34 COG5189 SFP1 Putative transcri 95.9 0.0044 9.5E-08 68.9 2.5 71 573-645 346-418 (423)
35 PF13894 zf-C2H2_4: C2H2-type 95.8 0.0039 8.4E-08 43.0 0.9 24 2032-2055 1-24 (24)
36 COG5189 SFP1 Putative transcri 95.6 0.0053 1.1E-07 68.3 1.7 71 100-173 347-419 (423)
37 PF09237 GAGA: GAGA factor; I 95.4 0.011 2.5E-07 47.0 2.4 41 1964-2004 12-52 (54)
38 PF13912 zf-C2H2_6: C2H2-type 95.3 0.0087 1.9E-07 42.7 1.3 25 1806-1830 1-25 (27)
39 PF13912 zf-C2H2_6: C2H2-type 95.3 0.0096 2.1E-07 42.4 1.5 25 1665-1689 1-25 (27)
40 KOG2482|consensus 95.1 0.021 4.6E-07 64.6 4.3 186 88-292 129-355 (423)
41 PF13894 zf-C2H2_4: C2H2-type 95.1 0.014 3E-07 40.2 1.9 23 103-125 1-23 (24)
42 PF09237 GAGA: GAGA factor; I 95.1 0.021 4.6E-07 45.5 2.9 42 1389-1430 11-52 (54)
43 KOG2785|consensus 95.0 0.028 6.1E-07 65.6 4.9 178 102-296 3-245 (390)
44 KOG2231|consensus 94.8 0.057 1.2E-06 69.1 7.5 47 435-486 184-236 (669)
45 KOG2231|consensus 94.6 0.052 1.1E-06 69.5 6.6 39 549-600 100-138 (669)
46 PRK04860 hypothetical protein; 93.2 0.043 9.3E-07 58.3 1.7 39 1636-1678 118-156 (160)
47 PRK04860 hypothetical protein; 93.0 0.057 1.2E-06 57.4 2.2 40 1975-2018 118-157 (160)
48 PF13909 zf-H2C2_5: C2H2-type 92.9 0.049 1.1E-06 37.5 1.0 24 2092-2116 1-24 (24)
49 smart00355 ZnF_C2H2 zinc finge 92.5 0.095 2.1E-06 36.7 2.3 22 1780-1801 2-23 (26)
50 smart00355 ZnF_C2H2 zinc finge 92.1 0.13 2.9E-06 35.9 2.6 24 1403-1426 1-24 (26)
51 KOG2785|consensus 91.9 0.23 4.9E-06 58.4 5.4 75 348-427 165-242 (390)
52 PF13909 zf-H2C2_5: C2H2-type 91.2 0.14 3E-06 35.3 1.8 24 891-915 1-24 (24)
53 PF12874 zf-met: Zinc-finger o 90.5 0.14 3E-06 35.7 1.3 22 1977-1998 1-22 (25)
54 PF12874 zf-met: Zinc-finger o 90.5 0.15 3.2E-06 35.5 1.4 20 1780-1799 2-21 (25)
55 COG5048 FOG: Zn-finger [Genera 90.1 0.2 4.3E-06 66.0 3.4 149 1918-2068 288-454 (467)
56 COG5048 FOG: Zn-finger [Genera 89.9 0.33 7.2E-06 63.9 5.2 158 1946-2127 288-454 (467)
57 COG5236 Uncharacterized conser 89.6 0.34 7.4E-06 55.0 4.0 37 472-512 261-303 (493)
58 PF12171 zf-C2H2_jaz: Zinc-fin 88.5 0.26 5.7E-06 35.0 1.5 22 1948-1969 2-23 (27)
59 COG5236 Uncharacterized conser 88.2 0.55 1.2E-05 53.5 4.4 35 151-188 150-186 (493)
60 PF12171 zf-C2H2_jaz: Zinc-fin 87.5 0.32 7E-06 34.6 1.4 22 2032-2053 2-23 (27)
61 KOG4173|consensus 84.5 0.65 1.4E-05 49.1 2.4 92 315-427 75-168 (253)
62 TIGR00622 ssl1 transcription f 78.3 3.8 8.3E-05 40.2 5.0 47 712-760 58-104 (112)
63 PF13913 zf-C2HC_2: zinc-finge 77.0 1.1 2.5E-05 31.1 0.8 18 1949-1967 4-21 (25)
64 KOG2893|consensus 76.6 0.8 1.7E-05 49.3 -0.0 51 76-130 12-62 (341)
65 PF13913 zf-C2HC_2: zinc-finge 76.3 2.1 4.6E-05 29.8 2.0 21 244-265 3-23 (25)
66 COG4049 Uncharacterized protei 75.9 1.3 2.8E-05 36.2 1.0 30 98-127 13-42 (65)
67 KOG2893|consensus 74.5 1.2 2.5E-05 48.2 0.5 42 578-644 12-53 (341)
68 PF12013 DUF3505: Protein of u 73.9 5.7 0.00012 39.9 5.3 27 889-915 79-109 (109)
69 KOG4173|consensus 73.7 1 2.2E-05 47.8 -0.2 93 73-180 78-174 (253)
70 COG4049 Uncharacterized protei 72.3 1.4 3E-05 36.0 0.4 37 63-99 6-42 (65)
71 cd00350 rubredoxin_like Rubred 69.9 2.6 5.7E-05 31.7 1.3 10 2005-2014 2-11 (33)
72 smart00451 ZnF_U1 U1-like zinc 64.8 5 0.00011 30.6 2.0 21 378-398 4-24 (35)
73 cd00350 rubredoxin_like Rubred 64.4 4.6 0.0001 30.4 1.7 12 2032-2043 2-13 (33)
74 smart00451 ZnF_U1 U1-like zinc 64.1 4.7 0.0001 30.7 1.8 24 2030-2053 2-25 (35)
75 PF10571 UPF0547: Uncharacteri 60.9 5.8 0.00013 28.0 1.5 24 2062-2102 2-25 (26)
76 PF09986 DUF2225: Uncharacteri 60.6 4.5 9.7E-05 46.1 1.6 46 2059-2104 4-61 (214)
77 TIGR00622 ssl1 transcription f 60.6 11 0.00024 37.2 4.0 20 1288-1307 14-33 (112)
78 PF09986 DUF2225: Uncharacteri 57.1 5.1 0.00011 45.7 1.2 25 1945-1969 3-27 (214)
79 PF12013 DUF3505: Protein of u 53.2 12 0.00027 37.5 3.2 20 1190-1209 88-107 (109)
80 cd00729 rubredoxin_SM Rubredox 52.5 8.3 0.00018 29.3 1.3 10 2005-2014 3-12 (34)
81 PF09538 FYDLN_acid: Protein o 48.5 11 0.00025 37.3 1.9 13 2060-2072 26-38 (108)
82 PRK14890 putative Zn-ribbon RN 47.4 21 0.00045 30.6 3.0 10 2031-2040 25-34 (59)
83 COG2888 Predicted Zn-ribbon RN 40.2 18 0.00038 30.9 1.5 36 71-111 24-59 (61)
84 PRK00398 rpoP DNA-directed RNA 39.6 18 0.00038 29.8 1.5 13 2004-2016 3-15 (46)
85 PRK00464 nrdR transcriptional 38.0 12 0.00027 39.7 0.4 12 2061-2072 29-40 (154)
86 smart00659 RPOLCX RNA polymera 37.5 23 0.00049 28.8 1.7 11 2005-2015 3-13 (44)
87 PF03604 DNA_RNApol_7kD: DNA d 37.2 24 0.00051 26.4 1.6 11 2005-2015 1-11 (32)
88 PF09538 FYDLN_acid: Protein o 37.2 21 0.00045 35.5 1.8 31 1919-1960 9-39 (108)
89 smart00834 CxxC_CXXC_SSSS Puta 35.0 20 0.00043 28.5 1.1 11 2005-2015 6-16 (41)
90 COG5188 PRP9 Splicing factor 3 34.4 16 0.00034 42.5 0.6 21 1485-1505 239-259 (470)
91 COG1592 Rubrerythrin [Energy p 34.2 23 0.00051 37.9 1.8 11 2001-2011 146-156 (166)
92 PRK04023 DNA polymerase II lar 33.9 35 0.00075 46.4 3.5 10 1976-1985 626-635 (1121)
93 TIGR02300 FYDLN_acid conserved 33.3 29 0.00063 34.9 2.1 12 2005-2016 10-21 (129)
94 COG2888 Predicted Zn-ribbon RN 32.5 25 0.00055 30.0 1.3 35 150-190 25-59 (61)
95 PHA00626 hypothetical protein 32.3 19 0.00041 30.1 0.5 12 2060-2071 23-34 (59)
96 COG1996 RPC10 DNA-directed RNA 32.0 23 0.00051 29.2 1.0 12 2003-2014 5-16 (49)
97 PF13717 zinc_ribbon_4: zinc-r 31.0 41 0.00089 26.0 2.1 35 74-113 2-36 (36)
98 TIGR02605 CxxC_CxxC_SSSS putat 30.8 25 0.00053 29.8 1.0 11 2032-2042 6-16 (52)
99 PF15269 zf-C2H2_7: Zinc-finge 30.6 14 0.00029 29.0 -0.5 22 2173-2194 20-41 (54)
100 TIGR00373 conserved hypothetic 30.6 35 0.00077 36.8 2.5 32 2001-2040 106-137 (158)
101 KOG2186|consensus 29.9 25 0.00054 39.5 1.2 46 2005-2052 4-49 (276)
102 TIGR01384 TFS_arch transcripti 29.8 16 0.00036 36.3 -0.2 36 243-282 62-101 (104)
103 TIGR02098 MJ0042_CXXC MJ0042 f 29.8 32 0.00069 26.9 1.5 35 243-282 2-36 (38)
104 KOG2807|consensus 28.7 69 0.0015 37.5 4.4 33 653-699 276-308 (378)
105 PF01096 TFIIS_C: Transcriptio 27.6 20 0.00044 28.2 0.0 35 2174-2212 1-39 (39)
106 PF00301 Rubredoxin: Rubredoxi 27.5 43 0.00093 27.7 1.8 12 2005-2016 2-13 (47)
107 smart00531 TFIIE Transcription 27.4 52 0.0011 35.1 3.0 14 2003-2016 98-111 (147)
108 PRK09678 DNA-binding transcrip 27.3 20 0.00044 32.5 -0.0 41 1638-1680 2-44 (72)
109 PRK00464 nrdR transcriptional 27.1 37 0.00079 36.2 1.8 43 549-591 1-43 (154)
110 COG1997 RPL43A Ribosomal prote 26.7 37 0.00081 31.6 1.5 33 2060-2105 35-67 (89)
111 smart00614 ZnF_BED BED zinc fi 26.1 38 0.00082 28.4 1.4 25 2032-2056 19-48 (50)
112 PF09723 Zn-ribbon_8: Zinc rib 25.8 34 0.00074 27.5 1.0 11 2005-2015 6-16 (42)
113 PRK09678 DNA-binding transcrip 25.7 32 0.0007 31.3 0.9 46 2174-2221 2-49 (72)
114 COG1594 RPB9 DNA-directed RNA 24.9 20 0.00043 36.1 -0.6 106 77-192 5-111 (113)
115 COG1592 Rubrerythrin [Energy p 24.5 43 0.00092 36.1 1.7 23 1484-1520 134-156 (166)
116 PRK06266 transcription initiat 24.1 49 0.0011 36.5 2.1 31 2002-2040 115-145 (178)
117 TIGR02300 FYDLN_acid conserved 23.8 51 0.0011 33.3 1.9 35 1919-1964 9-43 (129)
118 KOG4124|consensus 23.3 40 0.00086 39.5 1.3 24 2030-2053 348-373 (442)
119 COG1198 PriA Primosomal protei 23.1 56 0.0012 44.2 2.8 9 2060-2068 475-483 (730)
120 PF13719 zinc_ribbon_5: zinc-r 22.9 65 0.0014 25.1 2.0 8 656-663 5-12 (37)
121 KOG2186|consensus 22.4 41 0.00089 37.9 1.1 46 1752-1799 4-49 (276)
122 TIGR00373 conserved hypothetic 22.0 75 0.0016 34.4 3.0 33 345-386 105-137 (158)
123 cd00730 rubredoxin Rubredoxin; 21.6 54 0.0012 27.6 1.4 11 2005-2015 2-12 (50)
124 PF02892 zf-BED: BED zinc fing 21.4 88 0.0019 25.4 2.7 28 573-600 13-44 (45)
125 KOG2071|consensus 21.2 90 0.002 40.2 3.8 26 1512-1537 417-442 (579)
126 PF12907 zf-met2: Zinc-binding 21.1 24 0.00052 27.9 -0.6 32 1948-1979 2-36 (40)
127 smart00734 ZnF_Rad18 Rad18-lik 20.8 72 0.0016 22.7 1.7 19 272-291 2-20 (26)
128 COG5151 SSL1 RNA polymerase II 20.2 76 0.0016 36.6 2.6 24 737-760 388-411 (421)
No 1
>KOG1074|consensus
Probab=99.94 E-value=1.5e-27 Score=287.41 Aligned_cols=246 Identities=23% Similarity=0.405 Sum_probs=159.1
Q ss_pred CcccccchhhcccChHHHHHHHhhhcCCCcccccccccccccchHHHHHHHhccCCCC-CCccccC---cchhhccCccc
Q psy11585 1777 KNHICEICGKSFKKKDLLREHMIVHSTLRPFLCEFCNAGFKCRKHLLQHYRTHTKPKA-TNSFSSS---KCEESFDNCNN 1852 (2227)
Q Consensus 1777 ~~~~C~~C~k~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~-~~~~~C~---~C~~~f~~~~~ 1852 (2227)
.|-.|-+|-+...-++.|+.|.|+|+||+||+|.+||++|.++.+|..||-+|....+ ...++|| +|...|.+...
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 4678999999999999999999999999999999999999999999999998865433 3578899 99999999999
Q ss_pred ccccccccccCC-c------------cccCCCCCchhHHHHHHHHHHHHhhhcccccccccccccccccccccc-cCCcc
Q psy11585 1853 LWSHMFIKHENS-D------------FVCNLCPPDSKIVIKYAHLLVRHMKKHHTMQLSISSVSKHIKSKTQIF-VDGAI 1918 (2227)
Q Consensus 1853 L~~H~~~~~~~~-~------------~~C~~C~~~~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 1918 (2227)
|..|+++|.++. + -+|..|...+.. +..+..++.-+-. +.+ ..+..+..++... .+..-
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~----a~~f~~~~se~~~-~~s--~~~~~~~~~t~t~~~~~tp 756 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSD----ARSFSQQISEQPS-PES--EPDEQMDERTETEELDVTP 756 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccc----cccchhhhhccCC-ccc--CCcccccccccccccccCC
Confidence 999999877321 1 123333332211 1111222221110 011 1111122222211 22233
Q ss_pred ccCCCCCccccCChHHHHHHH-----------------------HhccCCCcc-ccccCccccCCchhhh----hc---h
Q psy11585 1919 RFKCPDCPTILQTFRGLKAHL-----------------------DIHSGEKDY-ACHICNKVFVRHSTLE----NH---M 1967 (2227)
Q Consensus 1919 ~~~C~~C~~~f~~~~~l~~H~-----------------------~~h~~~~~~-~C~~C~~~f~~~~~L~----~H---~ 1967 (2227)
++.+..|+..+.....+..+- ..++.+++. .+.+++-.-...-... .= +
T Consensus 757 ~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~ 836 (958)
T KOG1074|consen 757 PPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSM 836 (958)
T ss_pred CccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccc
Confidence 567888888877765554432 133556666 6777765544332111 00 0
Q ss_pred -----h-----cccCC------------------------CCcccCCCCCccccchhhHhhHhhhhcCCCcccCCCcCcc
Q psy11585 1968 -----K-----AVHEK------------------------IRDFQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCETCGAS 2013 (2227)
Q Consensus 1968 -----~-----~~H~~------------------------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 2013 (2227)
. ..+.+ .....|.+||+.|.+.+.|++|||+|+|+|||.|.+|++.
T Consensus 837 l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~a 916 (958)
T KOG1074|consen 837 LNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEA 916 (958)
T ss_pred cccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhh
Confidence 0 00000 0115699999999999999999999999999999999999
Q ss_pred cccchhhhhhcccccC
Q psy11585 2014 FVHWGSLNIHNYSHIN 2029 (2227)
Q Consensus 2014 f~~~~~l~~H~~~h~~ 2029 (2227)
|..+++|+.||.+|..
T Consensus 917 FttrgnLKvHMgtH~w 932 (958)
T KOG1074|consen 917 FTTRGNLKVHMGTHMW 932 (958)
T ss_pred hhhhhhhhhhhccccc
Confidence 9999988888888764
No 2
>KOG1074|consensus
Probab=99.94 E-value=1.3e-27 Score=287.93 Aligned_cols=271 Identities=20% Similarity=0.395 Sum_probs=174.9
Q ss_pred ccccCcccCcccChhHHHHHHHHcCCCCccccccccccccChHHHHHHHHhhcccchhhcccCCCCCCcccccchhhhhh
Q psy11585 462 RHICQTCGSEFHTRKNLLTHIRTHNTDRTHVCELCNANLKTRRSLLRHYTTHGTQLAAIAFNNSQSSSSDHRLVKSEVQI 541 (2227)
Q Consensus 462 ~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (2227)
+-+|-+|.++...+++|+-|+|+|+||+||+|.+||+.|.++.+|+.||-.|....+
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~----------------------- 661 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPP----------------------- 661 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCcc-----------------------
Confidence 578999999999999999999999999999999999999999999999999974322
Q ss_pred hccCcccccCC---CCCCcccCccccccccccccCcc-------------ceecCCCCccccchhHHHHHHHHhcccccc
Q psy11585 542 LEGDRIKYKCP---LCDRIYTSFSETKRHFEVHSGER-------------KYTCSICSKCFFIKNRLSEHYRRVHKMRVS 605 (2227)
Q Consensus 542 ~~~~~~~~~C~---~C~~~f~~~~~l~~H~~~H~~~~-------------~~~C~~C~k~f~~~~~L~~H~~~~H~~~~~ 605 (2227)
-+..+.|+ +|-+.|.....|..|+++|.+.. .-+|..|.+.|.....+..++- .|....+
T Consensus 662 ---~R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~s-e~~~~~s 737 (958)
T KOG1074|consen 662 ---ARVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQIS-EQPSPES 737 (958)
T ss_pred ---ccccccCCchhhhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhh-ccCCccc
Confidence 23479999 99999999999999999998421 2479999999999999999887 3432222
Q ss_pred ccccccccccccccccCcCccccCCCCccccchhhHHhhhh-----------------------cccCCCcc-cCCCCCc
Q psy11585 606 MARTNDVKKSAEISVDGVTKYKCHICDSIFTRYDSLRLHVR-----------------------THTGDRPY-TCDVCGK 661 (2227)
Q Consensus 606 ~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~-----------------------~H~~~~~~-~C~~C~~ 661 (2227)
.....+.........+...+..+..|+..+.....+..+-. .++++++. .+.+++.
T Consensus 738 ~~~~~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~ 817 (958)
T KOG1074|consen 738 EPDEQMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGE 817 (958)
T ss_pred CCcccccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCc
Confidence 21111111111111223335677888888777666555422 23445666 5666654
Q ss_pred cccChhhhhh-hhcccccC--CcccCCCCCCccc--C-hhHHHHHHHhcC--------CC--CcccCCCCCCccCChhHH
Q psy11585 662 SFVAKKHLNR-HYNCSHAG--FGYQCNICGRVMS--D-STNFKDHLDNHK--------GE--KKYTCEICGTGFMYKSSL 725 (2227)
Q Consensus 662 ~f~~~~~L~~-H~~~~h~~--~~~~C~~C~~~f~--~-~~~l~~H~~~h~--------~~--~~~~C~~C~~~F~~~~~L 725 (2227)
.-...-...- =+...... ....=.. -.+. . ...+..--..-. +. ....|.+|++.|.+.+.|
T Consensus 818 ~~~~~v~~~pvl~~~~~~~l~eg~~t~~--n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsAL 895 (958)
T KOG1074|consen 818 ILAPSVNMDPVLWNQETSMLNEGLATKT--NEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAAL 895 (958)
T ss_pred CCccccccCchhhccccccccccccccc--ccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHH
Confidence 3322211100 00000000 0000000 0000 0 000000000000 00 125688888888888888
Q ss_pred HHHHHhcCCCCccccccCccccCCHHHHHHHHHHhc
Q psy11585 726 HHHKFSHSKERMFQCSFCEKKYMSPKTLKEHEQTHR 761 (2227)
Q Consensus 726 ~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 761 (2227)
..|+++|++++||.|.+|++.|..+.+|+.||.+|+
T Consensus 896 qiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH~ 931 (958)
T KOG1074|consen 896 EIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTHM 931 (958)
T ss_pred HHhhhcCCCCCCccchhhhhhhhhhhhhhhhhcccc
Confidence 888888888888888888888888888888888874
No 3
>KOG2462|consensus
Probab=99.89 E-value=2.2e-24 Score=231.68 Aligned_cols=134 Identities=31% Similarity=0.704 Sum_probs=126.5
Q ss_pred ccccCCCCCccccCChHHHHHHHHhccC---CCccccccCccccCCchhhhhchhcccCCCCcccCCCCCccccchhhHh
Q psy11585 1917 AIRFKCPDCPTILQTFRGLKAHLDIHSG---EKDYACHICNKVFVRHSTLENHMKAVHEKIRDFQCKVCDRAFFDVYNLK 1993 (2227)
Q Consensus 1917 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~ 1993 (2227)
...|+|+.||+.|.+..+|.+|..+|.. .+.+.|++|||+|.+...|+.||++ |+ -+++|.+|||.|.+.|.|+
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirT-H~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRT-HT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhc-cC--CCcccccccccccchHHhh
Confidence 3469999999999999999999999954 4679999999999999999999998 77 5899999999999999999
Q ss_pred hHhhhhcCCCcccCCCcCcccccchhhhhhcccccC-CceecCcCcCcCCCchHHHhhhcc
Q psy11585 1994 LHMRIHTGEKKYVCETCGASFVHWGSLNIHNYSHIN-AQFVCSFCGNTYKNPKSLDSHIRN 2053 (2227)
Q Consensus 1994 ~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~~~~l~~H~~~ 2053 (2227)
-|+|+|||||||.|+.|++.|..+++|+.||.+|.+ ++|.|+.|+|.|..++.|.+|...
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 999999999999999999999999999999999998 899999999999999999999864
No 4
>KOG3608|consensus
Probab=99.88 E-value=1.9e-22 Score=219.07 Aligned_cols=282 Identities=24% Similarity=0.478 Sum_probs=153.0
Q ss_pred cccccc--cccccCC-hHHHHHHHHHhcccccc------------c---CccccccccccccCCcccccCCCCCCCcccc
Q psy11585 271 HFVCSV--CQRGFFM-KNRLNEHYKRVHHMNFT------------S---RDHDLRRETETNVDGVRKYKCPHPGCPSSFQ 332 (2227)
Q Consensus 271 ~~~C~~--C~~~f~~-~~~l~~H~~~~H~~~~~------------~---~~~~~~~~~~~~~~~~~~~~C~~~~C~~~f~ 332 (2227)
.++|.+ |++...+ ...|.+|+ -.|.-... . ......+..+........|.|.+..|+..|.
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHv-y~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~ 147 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHV-YFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFV 147 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhh-hhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCccc
Confidence 477765 7766555 46788887 44421100 0 0000111112233444568888888999999
Q ss_pred ChhHHHHHHhhcCCCCCccCCccccccCChHHHHHHHHHhcCCC--CcccC--CCCCccCChhHHHHHHhhhcCCCcccc
Q psy11585 333 RFNALQEHMLSHTGEKPYTCEACGKSFPLKRRLNAHYNKWHLGK--GYRCH--ICGSTMSNAANFKDHLDSHRGEKKYTC 408 (2227)
Q Consensus 333 ~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~--~~~C~--~C~~~f~~~~~l~~H~~~h~~~k~~~C 408 (2227)
+...|..|+-.|..- | ..- ++..-+++ .+.|. .|-+.|.+++.|+.|++.|+++|...|
T Consensus 148 s~~ef~dHV~~H~l~----c---eyd----------~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvAC 210 (467)
T KOG3608|consen 148 SIVEFQDHVVKHALF----C---EYD----------IQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVAC 210 (467)
T ss_pred CHHHHHHHHHHhhhh----h---hhh----------hhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEec
Confidence 999999988776410 0 000 00000011 12332 245555555555555555555555555
Q ss_pred CCcCCccccHhHHHHHHhh--cCCCCccccCcCCCccCChhHHHHHhhhcCCCCcccccCcccCcccChhHHHHHHHH-c
Q psy11585 409 ETCGTGFAYKSSLYHHRFT--HIKDRTYPCTYCERKYQSPKTLKEHLKVHTSGDVRHICQTCGSEFHTRKNLLTHIRT-H 485 (2227)
Q Consensus 409 ~~C~~~f~~~~~L~~H~~~--H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~~C~~C~~~F~~~~~L~~H~~~-H 485 (2227)
+.||.-|+++..|..|.+. -....+|.|..|.|.|.+...|..|+..|. .-|+|+.|+.....++.|..||+. |
T Consensus 211 p~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv---n~ykCplCdmtc~~~ssL~~H~r~rH 287 (467)
T KOG3608|consen 211 PHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV---NCYKCPLCDMTCSSASSLTTHIRYRH 287 (467)
T ss_pred chHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh---hcccccccccCCCChHHHHHHHHhhh
Confidence 5555555555555555332 223345555555555555555555555553 235555555555555555555542 3
Q ss_pred CCCCccccccccccccChHHHHHHHHhhcccchhhcccCCCCCCcccccchhhhhhhccCcccccCCC--CCCcccCccc
Q psy11585 486 NTDRTHVCELCNANLKTRRSLLRHYTTHGTQLAAIAFNNSQSSSSDHRLVKSEVQILEGDRIKYKCPL--CDRIYTSFSE 563 (2227)
Q Consensus 486 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~ 563 (2227)
..++||+|+.|++.|.+.++|.+|...|. ...|+|+. |.+.|.+...
T Consensus 288 s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-------------------------------~~~y~C~h~~C~~s~r~~~q 336 (467)
T KOG3608|consen 288 SKDKPFKCDECDTRCVRESDLAKHVQVHS-------------------------------KTVYQCEHPDCHYSVRTYTQ 336 (467)
T ss_pred ccCCCccccchhhhhccHHHHHHHHHhcc-------------------------------ccceecCCCCCcHHHHHHHH
Confidence 34455555555555555555555555543 11355555 5555555555
Q ss_pred ccccccccc-C--ccceecCCCCccccchhHHHHHHHHhccccc
Q psy11585 564 TKRHFEVHS-G--ERKYTCSICSKCFFIKNRLSEHYRRVHKMRV 604 (2227)
Q Consensus 564 l~~H~~~H~-~--~~~~~C~~C~k~f~~~~~L~~H~~~~H~~~~ 604 (2227)
|++|++.+. | +-+|.|-.|++.|.+-.+|..|+.+.|+.+.
T Consensus 337 ~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~ 380 (467)
T KOG3608|consen 337 MRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRL 380 (467)
T ss_pred HHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccC
Confidence 555554322 3 3468888888888888889999888887654
No 5
>KOG3608|consensus
Probab=99.88 E-value=2.6e-23 Score=225.76 Aligned_cols=225 Identities=27% Similarity=0.555 Sum_probs=195.8
Q ss_pred cceec--ccccccccCcchhhhhhcccC-------------CCc-cccc--chhhcccChHHHHHHHhhhcCCCcccccc
Q psy11585 1750 QRCVC--SYCGNSYANPGNLRTHMVVHS-------------NKN-HICE--ICGKSFKKKDLLREHMIVHSTLRPFLCEF 1811 (2227)
Q Consensus 1750 ~~~~C--~~C~~~f~~~~~L~~H~~~h~-------------~~~-~~C~--~C~k~f~~~~~l~~H~~~H~~~~~~~C~~ 1811 (2227)
..|.| +.|+..|.+...|..|+..|. ++| +.|. .|-+.|.+++.|++|++.|++++...|+.
T Consensus 133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~ 212 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH 212 (467)
T ss_pred hhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch
Confidence 34666 689999999999999988774 122 4565 58899999999999999999999999999
Q ss_pred cccccccchHHHHHHHhccCCCCCCccccCcchhhccCcccccccccccccCCccccCCCCCchhHHHHHHHHHHHHhhh
Q psy11585 1812 CNAGFKCRKHLLQHYRTHTKPKATNSFSSSKCEESFDNCNNLWSHMFIKHENSDFVCNLCPPDSKIVIKYAHLLVRHMKK 1891 (2227)
Q Consensus 1812 C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~~~~~~~~~~~l~~h~~~ 1891 (2227)
||..|.++..|..|.+.-+...
T Consensus 213 Cg~~F~~~tkl~DH~rRqt~l~---------------------------------------------------------- 234 (467)
T KOG3608|consen 213 CGELFRTKTKLFDHLRRQTELN---------------------------------------------------------- 234 (467)
T ss_pred HHHHhccccHHHHHHHhhhhhc----------------------------------------------------------
Confidence 9999999999999988644211
Q ss_pred cccccccccccccccccccccccCCccccCCCCCccccCChHHHHHHHHhccCCCccccccCccccCCchhhhhchhccc
Q psy11585 1892 HHTMQLSISSVSKHIKSKTQIFVDGAIRFKCPDCPTILQTFRGLKAHLDIHSGEKDYACHICNKVFVRHSTLENHMKAVH 1971 (2227)
Q Consensus 1892 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H 1971 (2227)
..+|.|..|.|.|.+...|..|+..|- ..|+|+.|+.....++.|..||+..|
T Consensus 235 -------------------------~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rH 287 (467)
T KOG3608|consen 235 -------------------------TNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRH 287 (467)
T ss_pred -------------------------CCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHhhh
Confidence 114999999999999999999999985 46999999999999999999999999
Q ss_pred CCCCcccCCCCCccccchhhHhhHhhhhcCCCcccCCC--cCcccccchhhhhhcccccC----CceecCcCcCcCCCch
Q psy11585 1972 EKIRDFQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCET--CGASFVHWGSLNIHNYSHIN----AQFVCSFCGNTYKNPK 2045 (2227)
Q Consensus 1972 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~----~~~~C~~C~~~f~~~~ 2045 (2227)
++.|||+|+.|++.|.+...|..|..+|+ +.-|.|+. |.++|++...+++|++.+.+ .+|.|..|++.|++-.
T Consensus 288 s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~ 366 (467)
T KOG3608|consen 288 SKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGK 366 (467)
T ss_pred ccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccch
Confidence 99999999999999999999999999998 77899998 99999999999999887662 4789999999999999
Q ss_pred HHHhhhcccCCCCCc
Q psy11585 2046 SLDSHIRNSHTNRKK 2060 (2227)
Q Consensus 2046 ~l~~H~~~~H~~~~~ 2060 (2227)
+|..|++..|.-.-|
T Consensus 367 ~L~~HL~kkH~f~~P 381 (467)
T KOG3608|consen 367 SLSAHLMKKHGFRLP 381 (467)
T ss_pred hHHHHHHHhhcccCC
Confidence 999998888874433
No 6
>KOG2462|consensus
Probab=99.86 E-value=1.2e-22 Score=218.42 Aligned_cols=132 Identities=31% Similarity=0.660 Sum_probs=113.4
Q ss_pred ccCCCCCCcccChhHHHHHHHhcC---CCCcccCCCCCCccCChhHHHHHHHhcCCCCccccccCccccCCHHHHHHHHH
Q psy11585 682 YQCNICGRVMSDSTNFKDHLDNHK---GEKKYTCEICGTGFMYKSSLHHHKFSHSKERMFQCSFCEKKYMSPKTLKEHEQ 758 (2227)
Q Consensus 682 ~~C~~C~~~f~~~~~l~~H~~~h~---~~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~ 758 (2227)
|+|+.||+.+++.++|.+|...|- ..+.+.|.+|++.|.+...|..|+++|+ -+++|.+|||.|..++.|+-|+|
T Consensus 131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiR 208 (279)
T KOG2462|consen 131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIR 208 (279)
T ss_pred eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccc
Confidence 455555555555555555555553 2567899999999999999999999997 68999999999999999999999
Q ss_pred HhcCCCcccccCCCccccCChHHHHHHHHHhCCCCCcccccccccccChHhHHHHHhh
Q psy11585 759 THRSGDIKHICDTCGSEFNTRKNMLRHTKVHSTERPYICEYCNVSFKEKKSLVRHYKI 816 (2227)
Q Consensus 759 ~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~ 816 (2227)
+| +||+||.|+.|+|+|..+++|+.||++|.+.|+|+|..|+++|..++.|.+|...
T Consensus 209 TH-TGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 209 TH-TGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cc-cCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 99 9999999999999999999999999999999999999999999999999999864
No 7
>KOG3623|consensus
Probab=99.78 E-value=2.8e-19 Score=211.63 Aligned_cols=81 Identities=30% Similarity=0.603 Sum_probs=76.7
Q ss_pred cCCCcccCCCccccCChhHHHHHHhhhcCCcccccccCccccccchhhhhhhhcccCCCccccccccccccCcchhhchh
Q psy11585 1314 MKVGYQCNVCGRVLTDSSNLKVHMRNHTGEKKYVCEICGKGFTQWASHYYHKFTHSEERSFKCSYCAMTFRCPRTLTEHK 1393 (2227)
Q Consensus 1314 ~~~~~~C~~C~k~f~~~~~l~~H~~~h~~ek~~~C~~C~k~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~ 1393 (2227)
.+..|.|+.|+|+|.-.++|.+|.-.|+|.+||+|.+|.|+|+.+..|..|+|.|+|||||+|+.|+|+|...+..-.||
T Consensus 891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 34579999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred h
Q psy11585 1394 K 1394 (2227)
Q Consensus 1394 ~ 1394 (2227)
.
T Consensus 971 N 971 (1007)
T KOG3623|consen 971 N 971 (1007)
T ss_pred c
Confidence 5
No 8
>KOG3623|consensus
Probab=99.77 E-value=9.5e-19 Score=207.13 Aligned_cols=105 Identities=22% Similarity=0.625 Sum_probs=66.3
Q ss_pred ccCCccccccCChHHHHHHHHHhcC--CCCcccCCCCCccCChhHHHHHHhhhcC-------------CCccccCCcCCc
Q psy11585 350 YTCEACGKSFPLKRRLNAHYNKWHL--GKGYRCHICGSTMSNAANFKDHLDSHRG-------------EKKYTCETCGTG 414 (2227)
Q Consensus 350 ~~C~~C~~~f~~~~~L~~H~~~~h~--~~~~~C~~C~~~f~~~~~l~~H~~~h~~-------------~k~~~C~~C~~~ 414 (2227)
.+|++|++.+.....|..|++-.|. +..|.|..|.++|..+..|.+||.+|.. -+.|+|.+|||.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 3555555555555555555544432 2335555555555555555555555532 256777777777
Q ss_pred cccHhHHHHHHhhcCCCCccccCcCCCccCChhHHHHHhh
Q psy11585 415 FAYKSSLYHHRFTHIKDRTYPCTYCERKYQSPKTLKEHLK 454 (2227)
Q Consensus 415 f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~ 454 (2227)
|..+-.|+.|+|+|.|++||.|+.|+|+|...++...||-
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 7777777777777777777777777777777777777764
No 9
>KOG3576|consensus
Probab=99.48 E-value=8.8e-15 Score=147.67 Aligned_cols=84 Identities=30% Similarity=0.588 Sum_probs=71.2
Q ss_pred ccccCCCCCccccCChHHHHHHHHhccCCCccccccCccccCCchhhhhchhcccCCCCcccCCCCCccccchhhHhhHh
Q psy11585 1917 AIRFKCPDCPTILQTFRGLKAHLDIHSGEKDYACHICNKVFVRHSTLENHMKAVHEKIRDFQCKVCDRAFFDVYNLKLHM 1996 (2227)
Q Consensus 1917 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 1996 (2227)
...|.|..|+|.|.-..-|.+||+-|+..+.|-|..|||.|.....|++|+++ |+|.+||+|..|+++|+.+-.|..|+
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rt-htgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRT-HTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhcc-ccCccccchhhhhHHHHhhccHHHHH
Confidence 44588888888888888888888888888888888888888888888888887 88888888888888888888888888
Q ss_pred hhhcC
Q psy11585 1997 RIHTG 2001 (2227)
Q Consensus 1997 ~~h~~ 2001 (2227)
+.-+|
T Consensus 194 ~kvhg 198 (267)
T KOG3576|consen 194 KKVHG 198 (267)
T ss_pred HHHcC
Confidence 64443
No 10
>KOG3576|consensus
Probab=99.44 E-value=1.9e-14 Score=145.30 Aligned_cols=113 Identities=30% Similarity=0.652 Sum_probs=102.2
Q ss_pred CCccccccCccccCCchhhhhchhcccCCCCcccCCCCCccccchhhHhhHhhhhcCCCcccCCCcCcccccchhhhhhc
Q psy11585 1945 EKDYACHICNKVFVRHSTLENHMKAVHEKIRDFQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCETCGASFVHWGSLNIHN 2024 (2227)
Q Consensus 1945 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 2024 (2227)
...|.|.+|+|.|....-|.+||+- |...+.+-|..||+.|...-+|++|+|+|+|.+||.|..|++.|+.+-+|..|.
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kc-h~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKC-HSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhh-ccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 3459999999999999999999997 999999999999999999999999999999999999999999999999999997
Q ss_pred ccccC------------CceecCcCcCcCCCchHHHhhhcccCCCC
Q psy11585 2025 YSHIN------------AQFVCSFCGNTYKNPKSLDSHIRNSHTNR 2058 (2227)
Q Consensus 2025 ~~h~~------------~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 2058 (2227)
+.-+| +-|+|..||++-.....+..|++.+|+..
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence 64332 46899999999999999999999888753
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.84 E-value=2.7e-09 Score=132.41 Aligned_cols=133 Identities=22% Similarity=0.537 Sum_probs=107.4
Q ss_pred cCCCCCccccCChHHHHHHHHhccCCCcccccc--CccccCCchhhhhchhcccCCCCcccCCCCCccccchhhHhhHhh
Q psy11585 1920 FKCPDCPTILQTFRGLKAHLDIHSGEKDYACHI--CNKVFVRHSTLENHMKAVHEKIRDFQCKVCDRAFFDVYNLKLHMR 1997 (2227)
Q Consensus 1920 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 1997 (2227)
-.|+.|........ |..|...-. -..-.|+. |+..| .+..|..| +.|+.|++.|. ...|..|++
T Consensus 408 V~C~NC~~~i~l~~-l~lHe~~C~-r~~V~Cp~~~Cg~v~-~r~el~~H----------~~C~~Cgk~f~-~s~LekH~~ 473 (567)
T PLN03086 408 VECRNCKHYIPSRS-IALHEAYCS-RHNVVCPHDGCGIVL-RVEEAKNH----------VHCEKCGQAFQ-QGEMEKHMK 473 (567)
T ss_pred EECCCCCCccchhH-HHHHHhhCC-CcceeCCccccccee-eccccccC----------ccCCCCCCccc-hHHHHHHHH
Confidence 47999988876554 778885542 23457985 99999 34555555 48999999996 688999999
Q ss_pred hhcCCCcccCCCcCcccccchhhhhhcccccC-CceecCcCcCcCC----------CchHHHhhhcccCCCCCccCCCcc
Q psy11585 1998 IHTGEKKYVCETCGASFVHWGSLNIHNYSHIN-AQFVCSFCGNTYK----------NPKSLDSHIRNSHTNRKKSICDDC 2066 (2227)
Q Consensus 1998 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~----------~~~~l~~H~~~~H~~~~~~~C~~C 2066 (2227)
+|+ +||.|+ ||+.+ .+..|..|+.+|.+ +++.|++|++.|. ..+.|..|+.. .+.+++.|..|
T Consensus 474 ~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~--CG~rt~~C~~C 547 (567)
T PLN03086 474 VFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESI--CGSRTAPCDSC 547 (567)
T ss_pred hcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHh--cCCcceEcccc
Confidence 986 899999 99765 66899999999987 8999999999995 24589999885 48999999999
Q ss_pred CCccCC
Q psy11585 2067 TKAMST 2072 (2227)
Q Consensus 2067 ~~~f~~ 2072 (2227)
|+.+..
T Consensus 548 gk~Vrl 553 (567)
T PLN03086 548 GRSVML 553 (567)
T ss_pred CCeeee
Confidence 999876
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.79 E-value=4.3e-09 Score=130.61 Aligned_cols=120 Identities=23% Similarity=0.476 Sum_probs=99.9
Q ss_pred cCCCC--CccccCChHHHHHHHHhccCCCccccccCccccCCchhhhhchhcccCCCCcccCCCCCccccchhhHhhHhh
Q psy11585 1920 FKCPD--CPTILQTFRGLKAHLDIHSGEKDYACHICNKVFVRHSTLENHMKAVHEKIRDFQCKVCDRAFFDVYNLKLHMR 1997 (2227)
Q Consensus 1920 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~ 1997 (2227)
-.|+. |+..|.... +++++.|+.|++.|. ...|..|+...| +|+.|+ ||+.+ .+..|..|++
T Consensus 434 V~Cp~~~Cg~v~~r~e----------l~~H~~C~~Cgk~f~-~s~LekH~~~~H---kpv~Cp-Cg~~~-~R~~L~~H~~ 497 (567)
T PLN03086 434 VVCPHDGCGIVLRVEE----------AKNHVHCEKCGQAFQ-QGEMEKHMKVFH---EPLQCP-CGVVL-EKEQMVQHQA 497 (567)
T ss_pred eeCCcccccceeeccc----------cccCccCCCCCCccc-hHHHHHHHHhcC---CCccCC-CCCCc-chhHHHhhhh
Confidence 46885 999995332 344568999999996 688999999855 789999 99765 6799999999
Q ss_pred hhcCCCcccCCCcCcccc----------cchhhhhhcccccCCceecCcCcCcCCCchHHHhhhcccCC
Q psy11585 1998 IHTGEKKYVCETCGASFV----------HWGSLNIHNYSHINAQFVCSFCGNTYKNPKSLDSHIRNSHT 2056 (2227)
Q Consensus 1998 ~h~~~~~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 2056 (2227)
+|.+++|+.|+.|++.|. ....|..|......+++.|..||+.+.. .+|..|+..+|.
T Consensus 498 thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG~rt~~C~~Cgk~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 498 STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICGSRTAPCDSCGRSVML-KEMDIHQIAVHQ 565 (567)
T ss_pred ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhcCCcceEccccCCeeee-hhHHHHHHHhhc
Confidence 999999999999999995 2358999999987799999999988755 589999988875
No 13
>KOG1146|consensus
Probab=98.57 E-value=6.9e-08 Score=125.99 Aligned_cols=134 Identities=20% Similarity=0.266 Sum_probs=94.4
Q ss_pred cccCcccccCCcccccccCHHHHHhhhhccC-------------------------CCceeecccccccccchhHHHHHH
Q psy11585 68 EIDGEIKFQCPDCHTMMKNFAYLKKHVRDNH-------------------------SGETFSCDECSKSFTTKKCLREHY 122 (2227)
Q Consensus 68 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~-------------------------~~~~~~C~~C~~~F~~~~~l~~H~ 122 (2227)
.......|+|+.|+..|+-...|..|||..| +.++|.|..|..++.++.+|..|+
T Consensus 459 L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Lsihl 538 (1406)
T KOG1146|consen 459 LHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHL 538 (1406)
T ss_pred eecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHHH
Confidence 3344578999999999999999999999855 246799999999999999999998
Q ss_pred hhhccccccCchhhhc---------------cc-----ccceee-ecccccccCCcccccccchhhhhhhhhhcccccCC
Q psy11585 123 KKLHTIRIRSSREEND---------------MK-----KKTMVY-VEGVVKYKCPECGFMVKRFQGLREHIVSVHAQVKD 181 (2227)
Q Consensus 123 ~~~H~~~~~~~~~~~~---------------~~-----~~~~~~-~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~ 181 (2227)
+..-........+++. .. ...... ....-.+.|.+|+....-..+|+.||...+....|
T Consensus 539 qS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p 618 (1406)
T KOG1146|consen 539 QSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPP 618 (1406)
T ss_pred HHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCCh
Confidence 7532211100000000 00 000000 11234589999999999999999999876666666
Q ss_pred CcCccccccccchhhhhhcc
Q psy11585 182 HVCIVCGAAFGLARRLKTHY 201 (2227)
Q Consensus 182 ~~C~~C~~~f~~~~~l~~H~ 201 (2227)
..|..++-.+.....+..+-
T Consensus 619 ~~~Lq~~it~~l~~~~~~~~ 638 (1406)
T KOG1146|consen 619 SLVLQQNITSSLASLLGGQG 638 (1406)
T ss_pred HHHhhhcchhhccccccCcC
Confidence 77888888888777777664
No 14
>PHA00733 hypothetical protein
Probab=98.55 E-value=3.8e-08 Score=99.87 Aligned_cols=78 Identities=22% Similarity=0.385 Sum_probs=37.8
Q ss_pred CCccccccCccccCCchhhhhch--hc--ccCCCCcccCCCCCccccchhhHhhHhhhhcCCCcccCCCcCcccccchhh
Q psy11585 1945 EKDYACHICNKVFVRHSTLENHM--KA--VHEKIRDFQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCETCGASFVHWGSL 2020 (2227)
Q Consensus 1945 ~~~~~C~~C~~~f~~~~~L~~H~--~~--~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 2020 (2227)
.+++.|.+|.+.|..+..|..|. +. .+.+.+||.|+.|++.|.+...|..|++.| +.+|.|.+|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 45667777777776666665551 11 122244455555555555555555555443 2234444444444444444
Q ss_pred hhhc
Q psy11585 2021 NIHN 2024 (2227)
Q Consensus 2021 ~~H~ 2024 (2227)
..|+
T Consensus 116 ~~H~ 119 (128)
T PHA00733 116 LDHV 119 (128)
T ss_pred HHHH
Confidence 4333
No 15
>PHA00733 hypothetical protein
Probab=98.50 E-value=1.2e-07 Score=96.24 Aligned_cols=83 Identities=16% Similarity=0.327 Sum_probs=52.7
Q ss_pred CCccccccCccccCCHHHHHHH--HHH--hcCCCcccccCCCccccCChHHHHHHHHHhCCCCCcccccccccccChHhH
Q psy11585 735 ERMFQCSFCEKKYMSPKTLKEH--EQT--HRSGDIKHICDTCGSEFNTRKNMLRHTKVHSTERPYICEYCNVSFKEKKSL 810 (2227)
Q Consensus 735 ~~~~~C~~C~~~f~~~~~L~~H--~~~--h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L 810 (2227)
.+++.|.+|...|.++..|..+ +.. ...++.+|.|+.|++.|.+...|..|++.| +.+|.|+.|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 3445555555555555444444 111 112456777777777777777777777765 3467777777777777777
Q ss_pred HHHHhhhcC
Q psy11585 811 VRHYKIHKG 819 (2227)
Q Consensus 811 ~~H~~~h~~ 819 (2227)
..|+...++
T Consensus 116 ~~H~~~~h~ 124 (128)
T PHA00733 116 LDHVCKKHN 124 (128)
T ss_pred HHHHHHhcC
Confidence 777776665
No 16
>KOG3993|consensus
Probab=98.36 E-value=6.1e-08 Score=110.53 Aligned_cols=165 Identities=19% Similarity=0.301 Sum_probs=109.1
Q ss_pred ccCCccccccCChHHHHHHHHHhcCCCCcccCCCCCccCChhHHHHHHhhhcC---------------------------
Q psy11585 350 YTCEACGKSFPLKRRLNAHYNKWHLGKGYRCHICGSTMSNAANFKDHLDSHRG--------------------------- 402 (2227)
Q Consensus 350 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~--------------------------- 402 (2227)
|.|.+|...|...-.|.+|.-.-...-.|+|++|+|+|+-..+|..|.+.|..
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~r 347 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAER 347 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccc
Confidence 77777777777777777775322222347788888888888888888777742
Q ss_pred ------CCccccCCcCCccccHhHHHHHHhhcCCCCc-----------------cccCcCCCccCChhHHHHHhhhcCCC
Q psy11585 403 ------EKKYTCETCGTGFAYKSSLYHHRFTHIKDRT-----------------YPCTYCERKYQSPKTLKEHLKVHTSG 459 (2227)
Q Consensus 403 ------~k~~~C~~C~~~f~~~~~L~~H~~~H~~~~~-----------------~~C~~C~k~f~~~~~L~~H~~~h~~~ 459 (2227)
+..|.|.+|++.|.+...|+.|+.+|..... +-|..|.-.+.....--.+...+...
T Consensus 348 sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~s 427 (500)
T KOG3993|consen 348 SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGS 427 (500)
T ss_pred cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeecc
Confidence 1247888888888888888888777743210 22333333333222222222222212
Q ss_pred CcccccCcccCcccChhHHHHHHHHcCCCCccccccccccccChHHHHHHHHhhc
Q psy11585 460 DVRHICQTCGSEFHTRKNLLTHIRTHNTDRTHVCELCNANLKTRRSLLRHYTTHG 514 (2227)
Q Consensus 460 ~~~~~C~~C~~~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 514 (2227)
.....|+.|+..+.+...--.+.+.-..+..|.|.+|..+|.+...|.+|+..-+
T Consensus 428 ael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 428 AELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCH 482 (500)
T ss_pred ccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcC
Confidence 2245688888888887776666666667788999999999999999999986544
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.35 E-value=1.5e-07 Score=76.67 Aligned_cols=44 Identities=25% Similarity=0.431 Sum_probs=37.4
Q ss_pred ccccccCcCccCCcchHHHHHhhhcCCCccccccccccccChhhHh
Q psy11585 1637 PYECDTCGHGLSSKKSLDDHYRIHTGEKKYVCQQCGASFTQWASLF 1682 (2227)
Q Consensus 1637 ~~~C~~C~~~f~~~~~L~~H~r~H~~ek~~~C~~C~k~F~~~~~L~ 1682 (2227)
.|+|++||+.|.....|..|+|+|+ ++|+|..|++.|...+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4789999999999999999999988 6888999999888777664
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.30 E-value=3.2e-07 Score=74.84 Aligned_cols=42 Identities=17% Similarity=0.493 Sum_probs=27.4
Q ss_pred ccCCCCCccccchhhHhhHhhhhcCCCcccCCCcCcccccchhh
Q psy11585 1977 FQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCETCGASFVHWGSL 2020 (2227)
Q Consensus 1977 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 2020 (2227)
|.|+.||+.|.....|..||++|+ +||.|..|++.|+..+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 566666666666666666666666 566666666666655544
No 19
>KOG3993|consensus
Probab=98.22 E-value=6.6e-07 Score=102.31 Aligned_cols=28 Identities=21% Similarity=0.433 Sum_probs=22.2
Q ss_pred ccceecCCCCccccchhHHHHHHHHhcc
Q psy11585 574 ERKYTCSICSKCFFIKNRLSEHYRRVHK 601 (2227)
Q Consensus 574 ~~~~~C~~C~k~f~~~~~L~~H~~~~H~ 601 (2227)
+..|.|.+|++.|.....|+.|+..+|.
T Consensus 354 ~gi~~C~~C~KkFrRqAYLrKHqlthq~ 381 (500)
T KOG3993|consen 354 SGIFSCHTCGKKFRRQAYLRKHQLTHQR 381 (500)
T ss_pred CceeecHHhhhhhHHHHHHHHhHHhhhc
Confidence 3468888888888888888888876664
No 20
>KOG1146|consensus
Probab=98.20 E-value=1.3e-05 Score=105.57 Aligned_cols=175 Identities=17% Similarity=0.191 Sum_probs=89.9
Q ss_pred cCCccccHhHHHHHHh-hcCCCCccccCcCCCccCChhHHHHHhhhcCCCCcccccCcccCcccChhHHHHHHHHc----
Q psy11585 411 CGTGFAYKSSLYHHRF-THIKDRTYPCTYCERKYQSPKTLKEHLKVHTSGDVRHICQTCGSEFHTRKNLLTHIRTH---- 485 (2227)
Q Consensus 411 C~~~f~~~~~L~~H~~-~H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~~C~~C~~~F~~~~~L~~H~~~H---- 485 (2227)
|+..+.++..+..|+. .|.-.+.|+|+.|+..|.....|..|||..+..-.. .+|. .-+.|.+.-
T Consensus 442 ~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~---~~c~-------~gq~~~~~arg~~ 511 (1406)
T KOG1146|consen 442 AEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS---AYCK-------AGQNHPRLARGEV 511 (1406)
T ss_pred hhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch---hHhH-------hcccccccccccc
Confidence 3334444444444432 244456777888888888888888887764321111 1111 011122111
Q ss_pred --CCCCccccccccccccChHHHHHHHHhh--cccchhhcccCC-----------CCCCcccccchhhhhhhccCccccc
Q psy11585 486 --NTDRTHVCELCNANLKTRRSLLRHYTTH--GTQLAAIAFNNS-----------QSSSSDHRLVKSEVQILEGDRIKYK 550 (2227)
Q Consensus 486 --~~~~~~~C~~C~~~f~~~~~L~~H~~~h--~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (2227)
-+.++|.|..|..++.++.+|.+||..- ...+.....+.. .....................-.+.
T Consensus 512 ~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~ 591 (1406)
T KOG1146|consen 512 YRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWR 591 (1406)
T ss_pred ccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcc
Confidence 1346788888888888888888887542 211111000000 0000000000000000012233689
Q ss_pred CCCCCCcccCcccccccccc-ccCccceecCCCCccccchhHHHHH
Q psy11585 551 CPLCDRIYTSFSETKRHFEV-HSGERKYTCSICSKCFFIKNRLSEH 595 (2227)
Q Consensus 551 C~~C~~~f~~~~~l~~H~~~-H~~~~~~~C~~C~k~f~~~~~L~~H 595 (2227)
|.+|++..+-..+|+.||.. ++-..|..+..++-.+.....+..+
T Consensus 592 C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~ 637 (1406)
T KOG1146|consen 592 CEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQ 637 (1406)
T ss_pred hhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCc
Confidence 99999999999999999864 3333346666666655554444443
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.15 E-value=8.9e-07 Score=61.84 Aligned_cols=26 Identities=62% Similarity=1.105 Sum_probs=22.1
Q ss_pred hHhhHhhhhcCCCcccCCCcCccccc
Q psy11585 1991 NLKLHMRIHTGEKKYVCETCGASFVH 2016 (2227)
Q Consensus 1991 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 2016 (2227)
+|.+||++|+|++||.|++|+++|++
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47889999999999999999988863
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.15 E-value=1.1e-06 Score=61.42 Aligned_cols=26 Identities=54% Similarity=1.028 Sum_probs=23.4
Q ss_pred hHHHHHhhhcCCCccccccccccccC
Q psy11585 1652 SLDDHYRIHTGEKKYVCQQCGASFTQ 1677 (2227)
Q Consensus 1652 ~L~~H~r~H~~ek~~~C~~C~k~F~~ 1677 (2227)
+|..|+++|+|++||.|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 48899999999999999999999964
No 23
>PHA00616 hypothetical protein
Probab=97.65 E-value=1.7e-05 Score=61.62 Aligned_cols=34 Identities=24% Similarity=0.380 Sum_probs=28.5
Q ss_pred ccccccCcCccCCcchHHHHHhhhcCCCcccccc
Q psy11585 1637 PYECDTCGHGLSSKKSLDDHYRIHTGEKKYVCQQ 1670 (2227)
Q Consensus 1637 ~~~C~~C~~~f~~~~~L~~H~r~H~~ek~~~C~~ 1670 (2227)
||+|+.||+.|..+++|..|++.|+|++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5788888888888888888888888888888764
No 24
>PHA00616 hypothetical protein
Probab=97.62 E-value=2.8e-05 Score=60.39 Aligned_cols=35 Identities=26% Similarity=0.436 Sum_probs=29.2
Q ss_pred cccccchhhcccChHHHHHHHhhhcCCCccccccc
Q psy11585 1778 NHICEICGKSFKKKDLLREHMIVHSTLRPFLCEFC 1812 (2227)
Q Consensus 1778 ~~~C~~C~k~f~~~~~l~~H~~~H~~~~~~~C~~C 1812 (2227)
||.|+.||+.|..++.|.+|++.|+|++++.|+.=
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~ 35 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYF 35 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEE
Confidence 57888888888888888888888888888888753
No 25
>PHA00732 hypothetical protein
Probab=97.48 E-value=7.8e-05 Score=68.15 Aligned_cols=48 Identities=35% Similarity=0.681 Sum_probs=30.3
Q ss_pred cccCCCCCccccchhhHhhHhhh-hcCCCcccCCCcCcccccchhhhhhcccccC
Q psy11585 1976 DFQCKVCDRAFFDVYNLKLHMRI-HTGEKKYVCETCGASFVHWGSLNIHNYSHIN 2029 (2227)
Q Consensus 1976 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 2029 (2227)
||.|++|++.|.+...|+.|++. |+ ++.|+.||+.|. .|..|.+++.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccCC
Confidence 46677777777777777777763 44 246777777775 46666655544
No 26
>PHA00732 hypothetical protein
Probab=97.40 E-value=9.5e-05 Score=67.63 Aligned_cols=47 Identities=30% Similarity=0.546 Sum_probs=33.7
Q ss_pred cccCCccCCcccCchhhhhhhcc-ccCCcccccCccccccCChhhHHhHHhhcc
Q psy11585 1513 PYECDTCGHGLSSKKSLDDHYRI-HTGEKKYVCQQCGASFTQWASLFYHKFSHS 1565 (2227)
Q Consensus 1513 ~~~C~~C~k~f~~~~~L~~H~~~-H~~~k~~~C~~C~k~f~~~~~L~~H~~~H~ 1565 (2227)
||.|+.||+.|.+..+|..|++. |+ ++.|++||++|. .|..|.++..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccC
Confidence 57788888888888888888774 54 357888888887 4667776554
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.21 E-value=0.00023 Score=60.53 Aligned_cols=49 Identities=24% Similarity=0.518 Sum_probs=22.8
Q ss_pred cccccCccccCCchhhhhchhcccCCC-CcccCCCCCccccchhhHhhHhhhh
Q psy11585 1948 YACHICNKVFVRHSTLENHMKAVHEKI-RDFQCKVCDRAFFDVYNLKLHMRIH 1999 (2227)
Q Consensus 1948 ~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~h 1999 (2227)
|+||+|++ ..+...|..|+...|..+ +.+.|++|...+. .+|..||..+
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~ 52 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQ 52 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHh
Confidence 45555555 233344555555444433 3355555554433 2555555443
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.78 E-value=0.0019 Score=54.95 Aligned_cols=48 Identities=29% Similarity=0.677 Sum_probs=21.1
Q ss_pred cccccCccccCCHHHHHHHHHHhcCCC-cccccCCCccccCChHHHHHHHHH
Q psy11585 738 FQCSFCEKKYMSPKTLKEHEQTHRSGD-IKHICDTCGSEFNTRKNMLRHTKV 788 (2227)
Q Consensus 738 ~~C~~C~~~f~~~~~L~~H~~~h~~~~-~~~~C~~C~~~f~~~~~L~~H~~~ 788 (2227)
|.|++|++ ..+...|..|....|..+ +.+.|++|...+. .+|.+|+..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~ 51 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNS 51 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHH
Confidence 44555555 333444555544333322 3355555554322 244444443
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.42 E-value=0.0017 Score=44.29 Aligned_cols=20 Identities=55% Similarity=0.987 Sum_probs=8.6
Q ss_pred cccchhhcccChHHHHHHHh
Q psy11585 1780 ICEICGKSFKKKDLLREHMI 1799 (2227)
Q Consensus 1780 ~C~~C~k~f~~~~~l~~H~~ 1799 (2227)
.|+.|++.|.++..|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 2 KCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCccCCHHHHHHHHh
Confidence 34444444444444444443
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.26 E-value=0.0029 Score=43.14 Aligned_cols=23 Identities=35% Similarity=0.834 Sum_probs=17.3
Q ss_pred cccCccccccCCHHHHHHHHHhh
Q psy11585 1403 HVCNTCGNEYNTRKNLLSHMKIH 1425 (2227)
Q Consensus 1403 ~~C~~C~~~f~~~~~L~~H~~~H 1425 (2227)
|.|+.||+.|.+.+.|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56777778888888887777764
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.06 E-value=0.0025 Score=63.40 Aligned_cols=71 Identities=25% Similarity=0.415 Sum_probs=13.5
Q ss_pred ecccccccccchhHHHHHHhhhccccccCchhhhcccccceeeecc-cccccCCcccccccchhhhhhhhhh
Q psy11585 104 SCDECSKSFTTKKCLREHYKKLHTIRIRSSREENDMKKKTMVYVEG-VVKYKCPECGFMVKRFQGLREHIVS 174 (2227)
Q Consensus 104 ~C~~C~~~F~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~-~~~~~C~~C~~~f~~~~~L~~H~~~ 174 (2227)
.|.+|+..|.+...|..|+...|...+.......+...-....... ...+.|..|++.|.+..+|..|+..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp ------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred CccccccccccccccccccccccccccccccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcC
Confidence 3777777787778888887777765433111111100000000000 1135666666666666666666653
No 32
>KOG2482|consensus
Probab=96.04 E-value=0.0042 Score=69.98 Aligned_cols=214 Identities=14% Similarity=0.200 Sum_probs=119.4
Q ss_pred CcccccCCccccccc--CHHHHHhhhhccCCCceeecccccccccchhHHHHHHhhhccccccCchhhhcccccceeeec
Q psy11585 71 GEIKFQCPDCHTMMK--NFAYLKKHVRDNHSGETFSCDECSKSFTTKKCLREHYKKLHTIRIRSSREENDMKKKTMVYVE 148 (2227)
Q Consensus 71 ~~~~~~C~~C~~~f~--~~~~L~~H~~~~~~~~~~~C~~C~~~F~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~ 148 (2227)
....-+|-.|.++|. ....+..|+.+.|.--.-.|+.=...-.....|+.|..+.|....-.....+. +.......
T Consensus 25 ~tl~g~C~~C~e~f~~peq~~~l~Hl~~ehkiVIadv~~iadf~~yi~~w~k~ftt~~~~~~~t~i~~~~--~d~~fee~ 102 (423)
T KOG2482|consen 25 KTLAGKCLKCDEVFIFPEQKDCLAHLMSEHKIVIADVPLIADFARYIIYWRKRFTTIHISSVCTLIKNAY--PDCAFEEA 102 (423)
T ss_pred ccccceeeehhhcccCcchHHHHHHHHhhccEEEEecchhhhHHHHHHHHHHHhhccchhhheeeeecCC--CCchhhhh
Confidence 344455999997765 56669999988775322222211111123445566655555322111000000 00000000
Q ss_pred ccccccCCcccccc---------cchhhhhhhhhhcccccCCCcCccccccc-cchhhhhhcccccccccccccccCCCC
Q psy11585 149 GVVKYKCPECGFMV---------KRFQGLREHIVSVHAQVKDHVCIVCGAAF-GLARRLKTHYIRRHTVNILTQANHDNE 218 (2227)
Q Consensus 149 ~~~~~~C~~C~~~f---------~~~~~L~~H~~~~H~~~~~~~C~~C~~~f-~~~~~l~~H~~~~H~~~~~~~~~~~~~ 218 (2227)
.....-|..=+... .-...|..+++..-......+|-.|...+ ++++..-.|+...|..++-...+...-
T Consensus 103 e~yyll~d~l~qd~~Lrqrlqe~rL~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyv 182 (423)
T KOG2482|consen 103 EEYYLLCDLLPQDRLLRQRLQENRLKEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYV 182 (423)
T ss_pred hHHHhccccchHhHHHHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeH
Confidence 11112233222211 12355666766554555678899999877 578888899999997655322211110
Q ss_pred CccchhhhcccCccccccccCcccccccCCCCCcccChhhHHHHHhh--ccC----------------------------
Q psy11585 219 DKLDVTKIFNVNKEDCQIMQGEKVKFKCPECPRSYGNFSELKKHLAV--HTG---------------------------- 268 (2227)
Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~--h~~---------------------------- 268 (2227)
..... .. ..+--.+.|-.|.+.|+.+..|+.||+. |..
T Consensus 183 -----nelLe------hL-kekL~r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~ 250 (423)
T KOG2482|consen 183 -----NELLE------HL-KEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHS 250 (423)
T ss_pred -----HHHHH------HH-HHHHhhheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhh
Confidence 00000 00 0011158999999999999999999974 210
Q ss_pred ------------------------CCc--ccccccccccCChHHHHHHHHHhcccc
Q psy11585 269 ------------------------EKH--FVCSVCQRGFFMKNRLNEHYKRVHHMN 298 (2227)
Q Consensus 269 ------------------------~~~--~~C~~C~~~f~~~~~l~~H~~~~H~~~ 298 (2227)
..+ ..|-.|.....+...|..||+.+|..+
T Consensus 251 e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~vHe~D 306 (423)
T KOG2482|consen 251 EDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMKIVHEFD 306 (423)
T ss_pred hhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHHHHHHhh
Confidence 011 489999999999999999999999544
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.00 E-value=0.0029 Score=62.94 Aligned_cols=26 Identities=27% Similarity=0.866 Sum_probs=17.5
Q ss_pred CCceecCcCcCcCCCchHHHhhhccc
Q psy11585 2029 NAQFVCSFCGNTYKNPKSLDSHIRNS 2054 (2227)
Q Consensus 2029 ~~~~~C~~C~~~f~~~~~l~~H~~~~ 2054 (2227)
...+.|.+|++.|.+...|..||+..
T Consensus 48 ~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 48 KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 34577777777777777788887753
No 34
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.90 E-value=0.0044 Score=68.92 Aligned_cols=71 Identities=21% Similarity=0.459 Sum_probs=41.4
Q ss_pred CccceecCC--CCccccchhHHHHHHHHhccccccccccccccccccccccCcCccccCCCCccccchhhHHhhh
Q psy11585 573 GERKYTCSI--CSKCFFIKNRLSEHYRRVHKMRVSMARTNDVKKSAEISVDGVTKYKCHICDSIFTRYDSLRLHV 645 (2227)
Q Consensus 573 ~~~~~~C~~--C~k~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~ 645 (2227)
+++||+|++ |.|.+.....|+-|+..-|........ .......+.....+||+|++|+|.+.....|+-|+
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~--p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHEN--PSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCC--CCccccccccccCCceeccccchhhccCccceecc
Confidence 358999987 999999999999998755533221111 11111112223345566666666665555555553
No 35
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.76 E-value=0.0039 Score=43.04 Aligned_cols=24 Identities=33% Similarity=0.980 Sum_probs=17.2
Q ss_pred eecCcCcCcCCCchHHHhhhcccC
Q psy11585 2032 FVCSFCGNTYKNPKSLDSHIRNSH 2055 (2227)
Q Consensus 2032 ~~C~~C~~~f~~~~~l~~H~~~~H 2055 (2227)
|.|++|++.|.+..+|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 578888888888888888887655
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.62 E-value=0.0053 Score=68.28 Aligned_cols=71 Identities=24% Similarity=0.468 Sum_probs=50.0
Q ss_pred Cceeeccc--ccccccchhHHHHHHhhhccccccCchhhhcccccceeeecccccccCCcccccccchhhhhhhhh
Q psy11585 100 GETFSCDE--CSKSFTTKKCLREHYKKLHTIRIRSSREENDMKKKTMVYVEGVVKYKCPECGFMVKRFQGLREHIV 173 (2227)
Q Consensus 100 ~~~~~C~~--C~~~F~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~ 173 (2227)
++||+|++ |++.+.+...|.-|+..-|-..+ ..++....+........++|+|++|++.++...+|+-|..
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~---~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQK---LHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcc---cCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence 58999987 99999999999999877664332 2222222333334445677888888888888888887754
No 37
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.40 E-value=0.011 Score=46.96 Aligned_cols=41 Identities=20% Similarity=0.329 Sum_probs=23.6
Q ss_pred hhchhcccCCCCcccCCCCCccccchhhHhhHhhhhcCCCc
Q psy11585 1964 ENHMKAVHEKIRDFQCKVCDRAFFDVYNLKLHMRIHTGEKK 2004 (2227)
Q Consensus 1964 ~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 2004 (2227)
..+.+..+..+.|-.|++|+..+.+..+|++||.++++.+|
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 45555555566777888888888888888888887777766
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.27 E-value=0.0087 Score=42.68 Aligned_cols=25 Identities=36% Similarity=0.745 Sum_probs=14.8
Q ss_pred cccccccccccccchHHHHHHHhcc
Q psy11585 1806 PFLCEFCNAGFKCRKHLLQHYRTHT 1830 (2227)
Q Consensus 1806 ~~~C~~C~~~f~~~~~l~~H~~~h~ 1830 (2227)
||.|+.|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4556666666666666666665543
No 39
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.26 E-value=0.0096 Score=42.45 Aligned_cols=25 Identities=36% Similarity=0.780 Sum_probs=16.9
Q ss_pred ccccccccccccChhhHhhhhhccC
Q psy11585 1665 KYVCQQCGASFTQWASLFYHKFSHS 1689 (2227)
Q Consensus 1665 ~~~C~~C~k~F~~~~~L~~H~~~H~ 1689 (2227)
||.|.+|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777766654
No 40
>KOG2482|consensus
Probab=95.09 E-value=0.021 Score=64.56 Aligned_cols=186 Identities=19% Similarity=0.323 Sum_probs=102.8
Q ss_pred HHHHhhhhccCC-Cceeeccccccccc-chhHHHHHHhhhccccccCchhhhcccccceeee--------cccccccCCc
Q psy11585 88 AYLKKHVRDNHS-GETFSCDECSKSFT-TKKCLREHYKKLHTIRIRSSREENDMKKKTMVYV--------EGVVKYKCPE 157 (2227)
Q Consensus 88 ~~L~~H~~~~~~-~~~~~C~~C~~~F~-~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~C~~ 157 (2227)
..|.++++...+ ...-.|-.|...+. +++.+..|+-..|...+-..-+ .++. +....++|-.
T Consensus 129 eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDn--------iVyvnelLehLkekL~r~~CLy 200 (423)
T KOG2482|consen 129 EALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDN--------IVYVNELLEHLKEKLERLRCLY 200 (423)
T ss_pred HHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcc--------eeeHHHHHHHHHHHHhhheeee
Confidence 445566654422 12468999998775 8999999998888765432111 1211 1223478888
Q ss_pred ccccccchhhhhhhhhh-cccccCCCcCccccccccchhhhhh--cccccccccccccccC-CCCCccchhhhcccCccc
Q psy11585 158 CGFMVKRFQGLREHIVS-VHAQVKDHVCIVCGAAFGLARRLKT--HYIRRHTVNILTQANH-DNEDKLDVTKIFNVNKED 233 (2227)
Q Consensus 158 C~~~f~~~~~L~~H~~~-~H~~~~~~~C~~C~~~f~~~~~l~~--H~~~~H~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 233 (2227)
|.++|..+..|+.||+. .|....|-. ..-++ |.--..|.. -..-+|...- .+.. +.+...+....|+.++.
T Consensus 201 CekifrdkntLkeHMrkK~HrrinPkn-reYDk-fyiINY~ev~ks~t~~~~e~d--ret~~d~~E~D~~wsDw~ed~a- 275 (423)
T KOG2482|consen 201 CEKIFRDKNTLKEHMRKKRHRRINPKN-REYDK-FYIINYLEVGKSWTIVHSEDD--RETNEDINETDDTWSDWNEDDA- 275 (423)
T ss_pred eccccCCcHHHHHHHHhccCcccCCCc-cccce-EEEEeHhhcCCccchhhhhhh--hhhhccccccccchhhhhcCCC-
Confidence 88888888888888874 354433310 00011 110000000 0000110000 0000 01111111222222211
Q ss_pred cccccCcccccccCCCCCcccChhhHHHHHhhccC---------------------------CCcccccccccccCChHH
Q psy11585 234 CQIMQGEKVKFKCPECPRSYGNFSELKKHLAVHTG---------------------------EKHFVCSVCQRGFFMKNR 286 (2227)
Q Consensus 234 ~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---------------------------~~~~~C~~C~~~f~~~~~ 286 (2227)
+.....|-.|.....+...|..||.+-+. ...-.|..|+-.|.....
T Consensus 276 ------~a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~ 349 (423)
T KOG2482|consen 276 ------EALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPG 349 (423)
T ss_pred ------CccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcch
Confidence 12236899999999999999999976432 112369999999999999
Q ss_pred HHHHHH
Q psy11585 287 LNEHYK 292 (2227)
Q Consensus 287 l~~H~~ 292 (2227)
|..||.
T Consensus 350 l~~hm~ 355 (423)
T KOG2482|consen 350 LLIHMV 355 (423)
T ss_pred hhhhcc
Confidence 999984
No 41
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.08 E-value=0.014 Score=40.20 Aligned_cols=23 Identities=43% Similarity=0.844 Sum_probs=10.3
Q ss_pred eecccccccccchhHHHHHHhhh
Q psy11585 103 FSCDECSKSFTTKKCLREHYKKL 125 (2227)
Q Consensus 103 ~~C~~C~~~F~~~~~l~~H~~~~ 125 (2227)
|.|++|++.|.+...|..|+.++
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555554433
No 42
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.07 E-value=0.021 Score=45.52 Aligned_cols=42 Identities=26% Similarity=0.486 Sum_probs=26.8
Q ss_pred hhchhhccccCccccccCccccccCCHHHHHHHHHhhcCCCC
Q psy11585 1389 LTEHKKTHVLSDVKHVCNTCGNEYNTRKNLLSHMKIHSTGRP 1430 (2227)
Q Consensus 1389 L~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~ 1430 (2227)
+..+.+.+...+.|..|++|+..+....+|++|+.++++.+|
T Consensus 11 ~~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 11 STKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp -----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 355666666678889999999999999999999988877665
No 43
>KOG2785|consensus
Probab=94.95 E-value=0.028 Score=65.58 Aligned_cols=178 Identities=22% Similarity=0.360 Sum_probs=100.2
Q ss_pred eeecccccccccchhHHHHHHhhh-cccccc---------Cch--hhh---cccccceeeecccccccCCcccccccchh
Q psy11585 102 TFSCDECSKSFTTKKCLREHYKKL-HTIRIR---------SSR--EEN---DMKKKTMVYVEGVVKYKCPECGFMVKRFQ 166 (2227)
Q Consensus 102 ~~~C~~C~~~F~~~~~l~~H~~~~-H~~~~~---------~~~--~~~---~~~~~~~~~~~~~~~~~C~~C~~~f~~~~ 166 (2227)
.|.|..|...|.+...-+.|+++. |.+++. ... +.. +...+......++.++.|.+|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 488889999998888888888752 222111 000 000 00111111234566799999999999999
Q ss_pred hhhhhhhh-cccccCCCcCccccccccchhhhhhccccccccc--ccc---------ccc-------CCCCCccchhhhc
Q psy11585 167 GLREHIVS-VHAQVKDHVCIVCGAAFGLARRLKTHYIRRHTVN--ILT---------QAN-------HDNEDKLDVTKIF 227 (2227)
Q Consensus 167 ~L~~H~~~-~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~--~~~---------~~~-------~~~~~~~~~~~~~ 227 (2227)
.-..|+.. .|. ..+..|.+..+... +.. +.. .+.....+.....
T Consensus 83 a~~~hl~Sk~h~-----------------~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~ 145 (390)
T KOG2785|consen 83 AHENHLKSKKHV-----------------ENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEE 145 (390)
T ss_pred hHHHHHHHhhcc-----------------hhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhh
Confidence 88888864 121 12222222111100 000 000 0000000111111
Q ss_pred ccCccccccc-----cCcccccccCCCCCcccChhhHHHHHhhccCC-----------------------Ccccccccc-
Q psy11585 228 NVNKEDCQIM-----QGEKVKFKCPECPRSYGNFSELKKHLAVHTGE-----------------------KHFVCSVCQ- 278 (2227)
Q Consensus 228 ~~~~~~~~~~-----~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~-----------------------~~~~C~~C~- 278 (2227)
..+++...+. ..+..|-.|-.|++.+.+...-..||..++|. .-|.|-.|+
T Consensus 146 ~~dd~~Edi~~d~~~e~e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~ 225 (390)
T KOG2785|consen 146 EEDDEEEDIEEDGDDEDELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNE 225 (390)
T ss_pred ccCcchhhhhhccchhcccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEecc
Confidence 1111111111 12234678999999999999999999988762 237899999
Q ss_pred --cccCChHHHHHHHHHhcc
Q psy11585 279 --RGFFMKNRLNEHYKRVHH 296 (2227)
Q Consensus 279 --~~f~~~~~l~~H~~~~H~ 296 (2227)
+.|.+....+.||...-+
T Consensus 226 ~~~~f~sleavr~HM~~K~H 245 (390)
T KOG2785|consen 226 LGRPFSSLEAVRAHMRDKGH 245 (390)
T ss_pred ccCcccccHHHHHHHhhccC
Confidence 999999999999976544
No 44
>KOG2231|consensus
Probab=94.79 E-value=0.057 Score=69.15 Aligned_cols=47 Identities=21% Similarity=0.633 Sum_probs=24.1
Q ss_pred ccCcCCCccCChhHHHHHhhhcCCCCcccccCccc------CcccChhHHHHHHHHcC
Q psy11585 435 PCTYCERKYQSPKTLKEHLKVHTSGDVRHICQTCG------SEFHTRKNLLTHIRTHN 486 (2227)
Q Consensus 435 ~C~~C~k~f~~~~~L~~H~~~h~~~~~~~~C~~C~------~~F~~~~~L~~H~~~H~ 486 (2227)
.|..|...|.....|..|++.++ |.|-+|. .-|.....|..|.+.+|
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h-----~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH-----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce-----eheeecCcccccchhcccchHHHHHhhhcC
Confidence 35555555555555555555544 3344442 33445555555555543
No 45
>KOG2231|consensus
Probab=94.63 E-value=0.052 Score=69.51 Aligned_cols=39 Identities=31% Similarity=0.632 Sum_probs=26.8
Q ss_pred ccCCCCCCcccCccccccccccccCccceecCCCCccccchhHHHHHHHHhc
Q psy11585 549 YKCPLCDRIYTSFSETKRHFEVHSGERKYTCSICSKCFFIKNRLSEHYRRVH 600 (2227)
Q Consensus 549 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~~H 600 (2227)
+.|.+|++.|.... ..-.|.+| -.|.+...|++|+...|
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H 138 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQH 138 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhh
Confidence 57888887764221 12357777 77778888888887777
No 46
>PRK04860 hypothetical protein; Provisional
Probab=93.19 E-value=0.043 Score=58.27 Aligned_cols=39 Identities=31% Similarity=0.781 Sum_probs=33.2
Q ss_pred cccccccCcCccCCcchHHHHHhhhcCCCccccccccccccCh
Q psy11585 1636 KPYECDTCGHGLSSKKSLDDHYRIHTGEKKYVCQQCGASFTQW 1678 (2227)
Q Consensus 1636 ~~~~C~~C~~~f~~~~~L~~H~r~H~~ek~~~C~~C~k~F~~~ 1678 (2227)
-+|.|. |+. ....+.+|.++|+|+++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 468997 887 778889999999999999999999888654
No 47
>PRK04860 hypothetical protein; Provisional
Probab=92.95 E-value=0.057 Score=57.38 Aligned_cols=40 Identities=33% Similarity=0.791 Sum_probs=33.7
Q ss_pred CcccCCCCCccccchhhHhhHhhhhcCCCcccCCCcCcccccch
Q psy11585 1975 RDFQCKVCDRAFFDVYNLKLHMRIHTGEKKYVCETCGASFVHWG 2018 (2227)
Q Consensus 1975 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~ 2018 (2227)
-+|.|. |+. ....+++|+++|+|+++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 368898 887 7788899999999999999999998887654
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=92.86 E-value=0.049 Score=37.53 Aligned_cols=24 Identities=29% Similarity=0.427 Sum_probs=16.8
Q ss_pred cccCcccccccChhhHhhhhhhccc
Q psy11585 2092 HSCQKCEESFDNCNNLWSHMFIKHE 2116 (2227)
Q Consensus 2092 ~~C~~C~~~f~~~~~l~~H~~~~~~ 2116 (2227)
|+|+.|++... ++.|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 57888888887 7788888887664
No 49
>smart00355 ZnF_C2H2 zinc finger.
Probab=92.54 E-value=0.095 Score=36.72 Aligned_cols=22 Identities=55% Similarity=0.957 Sum_probs=10.2
Q ss_pred cccchhhcccChHHHHHHHhhh
Q psy11585 1780 ICEICGKSFKKKDLLREHMIVH 1801 (2227)
Q Consensus 1780 ~C~~C~k~f~~~~~l~~H~~~H 1801 (2227)
.|+.|+++|.....|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 3444444444444444444433
No 50
>smart00355 ZnF_C2H2 zinc finger.
Probab=92.09 E-value=0.13 Score=35.93 Aligned_cols=24 Identities=29% Similarity=0.754 Sum_probs=16.9
Q ss_pred cccCccccccCCHHHHHHHHHhhc
Q psy11585 1403 HVCNTCGNEYNTRKNLLSHMKIHS 1426 (2227)
Q Consensus 1403 ~~C~~C~~~f~~~~~L~~H~~~H~ 1426 (2227)
|.|+.|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 457777777777777777777654
No 51
>KOG2785|consensus
Probab=91.87 E-value=0.23 Score=58.39 Aligned_cols=75 Identities=16% Similarity=0.111 Sum_probs=41.9
Q ss_pred CCccCCccccccCChHHHHHHHHHhcCCCCcccCCCCCccCChhHHHHHHhhhcCCCccccCCcC---CccccHhHHHHH
Q psy11585 348 KPYTCEACGKSFPLKRRLNAHYNKWHLGKGYRCHICGSTMSNAANFKDHLDSHRGEKKYTCETCG---TGFAYKSSLYHH 424 (2227)
Q Consensus 348 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~---~~f~~~~~L~~H 424 (2227)
-|-.|-.|++.|.+...-..||..+|+-- =+.=.+ ......|..=+..- -..-|.|-.|+ +.|.+-.+.+.|
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~Hgff---IPdreY-L~D~~GLl~YLgeK-V~~~~~CL~CN~~~~~f~sleavr~H 239 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFF---IPDREY-LTDEKGLLKYLGEK-VGIGFICLFCNELGRPFSSLEAVRAH 239 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCc---CCchHh-hhchhHHHHHHHHH-hccCceEEEeccccCcccccHHHHHH
Confidence 35678888888888888888888777521 000000 01111111111111 11346777777 888888888888
Q ss_pred Hhh
Q psy11585 425 RFT 427 (2227)
Q Consensus 425 ~~~ 427 (2227)
|..
T Consensus 240 M~~ 242 (390)
T KOG2785|consen 240 MRD 242 (390)
T ss_pred Hhh
Confidence 865
No 52
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=91.25 E-value=0.14 Score=35.32 Aligned_cols=24 Identities=29% Similarity=0.387 Sum_probs=19.8
Q ss_pred ccccccccccCCHHHHHHHHhhhcC
Q psy11585 891 HSCIYCEESFSDSKFLDAHVNIEHG 915 (2227)
Q Consensus 891 ~~C~~C~~~f~~~~~L~~H~~~~h~ 915 (2227)
|.|..|+++.. ...|.+|++++|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 68999999998 9999999998874
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=90.50 E-value=0.14 Score=35.67 Aligned_cols=22 Identities=32% Similarity=0.863 Sum_probs=12.4
Q ss_pred ccCCCCCccccchhhHhhHhhh
Q psy11585 1977 FQCKVCDRAFFDVYNLKLHMRI 1998 (2227)
Q Consensus 1977 ~~C~~C~~~f~~~~~l~~H~~~ 1998 (2227)
|.|++|++.|.+...|+.|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 3455555555555555555553
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=90.48 E-value=0.15 Score=35.54 Aligned_cols=20 Identities=45% Similarity=1.001 Sum_probs=9.7
Q ss_pred cccchhhcccChHHHHHHHh
Q psy11585 1780 ICEICGKSFKKKDLLREHMI 1799 (2227)
Q Consensus 1780 ~C~~C~k~f~~~~~l~~H~~ 1799 (2227)
.|.+|++.|.+...|+.|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 34444444444444444444
No 55
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.10 E-value=0.2 Score=66.02 Aligned_cols=149 Identities=24% Similarity=0.410 Sum_probs=103.4
Q ss_pred cccCCCCCccccCChHHHHHHHH--hccCC--Cccccc--cCccccCCchhhhhchhcccCCCCcccCCC--CCccccch
Q psy11585 1918 IRFKCPDCPTILQTFRGLKAHLD--IHSGE--KDYACH--ICNKVFVRHSTLENHMKAVHEKIRDFQCKV--CDRAFFDV 1989 (2227)
Q Consensus 1918 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~--C~~~f~~~ 1989 (2227)
.++.|..|...|.....|..|.+ .|+++ +++.|+ .|++.|.+...|..|+.. |.+..++.+.. +...+...
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 366 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILL-HTSISPAKEKLLNSSSKFSPL 366 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccc-ccCCCccccccccCccccccc
Confidence 35888999999999999999998 78888 889998 799999999999999887 88887777665 44444433
Q ss_pred hhH-----hhHhhhhcCCCcccCCC--cCcccccchhhhhhcccccC---CceecCcCcCcCCCchHHHhhhcccCCCCC
Q psy11585 1990 YNL-----KLHMRIHTGEKKYVCET--CGASFVHWGSLNIHNYSHIN---AQFVCSFCGNTYKNPKSLDSHIRNSHTNRK 2059 (2227)
Q Consensus 1990 ~~l-----~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~---~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 2059 (2227)
..- ......-.....+.+.. |-..+.....+..|...|.. ..+.+..|.+.|.....|..|++ .|....
T Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 445 (467)
T COG5048 367 LNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKK-IHTNHA 445 (467)
T ss_pred cCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCccccccccc-ccccCC
Confidence 331 11111222345555543 66777777777777777764 35677778888888888888877 455555
Q ss_pred ccCCCccCC
Q psy11585 2060 KSICDDCTK 2068 (2227)
Q Consensus 2060 ~~~C~~C~~ 2068 (2227)
++.|..+..
T Consensus 446 ~~~~~~~~~ 454 (467)
T COG5048 446 PLLCSILKS 454 (467)
T ss_pred ceeeccccc
Confidence 554444333
No 56
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=89.89 E-value=0.33 Score=63.91 Aligned_cols=158 Identities=22% Similarity=0.328 Sum_probs=87.5
Q ss_pred CccccccCccccCCchhhhhchhc-ccCCC--CcccCC--CCCccccchhhHhhHhhhhcCCCcccCCCc--Ccccccch
Q psy11585 1946 KDYACHICNKVFVRHSTLENHMKA-VHEKI--RDFQCK--VCDRAFFDVYNLKLHMRIHTGEKKYVCETC--GASFVHWG 2018 (2227)
Q Consensus 1946 ~~~~C~~C~~~f~~~~~L~~H~~~-~H~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C--~~~f~~~~ 2018 (2227)
.++.|..|...|.....|..|.+. .|+++ +|+.|+ .|++.|.....|..|..+|++-.++.+..- ...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 356667777777777777777663 46666 666666 566666666666666666666666555432 22222211
Q ss_pred hhhhhcccccCCceecCcCcCcCCCchHHHhhhcccCCCCCccCCC--ccCCccCCCCCCCccccccccCCCCCccccCc
Q psy11585 2019 SLNIHNYSHINAQFVCSFCGNTYKNPKSLDSHIRNSHTNRKKSICD--DCTKAMSTPAPSSKSVCIEHSNLIPKCHSCQK 2096 (2227)
Q Consensus 2019 ~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~--~C~~~f~~~~~~~~h~~~~h~~~~~~~~~C~~ 2096 (2227)
.-..+ ........-...+.+.+. .|...+.....+..|. ..|.......+.++.
T Consensus 368 ~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 423 (467)
T COG5048 368 NNEPP-----------------------QSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHI-ITHLSFRPYNCKNPP 423 (467)
T ss_pred CCCCc-----------------------cchhhccCccCCccccccccchhhhhcccccccccc-ccccccCCcCCCCCc
Confidence 10000 001111111122233333 2444444333333332 234444455677788
Q ss_pred ccccccChhhHhhhhhhcccCCccccCCCCC
Q psy11585 2097 CEESFDNCNNLWSHMFIKHENSDFVCNLCPP 2127 (2227)
Q Consensus 2097 C~~~f~~~~~l~~H~~~~~~~~~~~c~~c~~ 2127 (2227)
|...|.....|..|++.+....++.|.-+..
T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (467)
T COG5048 424 CSKSFNRHYNLIPHKKIHTNHAPLLCSILKS 454 (467)
T ss_pred chhhccCcccccccccccccCCceeeccccc
Confidence 8888888888888888888888888776654
No 57
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=89.57 E-value=0.34 Score=55.01 Aligned_cols=37 Identities=27% Similarity=0.532 Sum_probs=19.0
Q ss_pred ccChhHHHHHHHHcCCCCcccccc--cc----ccccChHHHHHHHHh
Q psy11585 472 FHTRKNLLTHIRTHNTDRTHVCEL--CN----ANLKTRRSLLRHYTT 512 (2227)
Q Consensus 472 F~~~~~L~~H~~~H~~~~~~~C~~--C~----~~f~~~~~L~~H~~~ 512 (2227)
|.....|..|.+.- -|.|.. |- ..|+....|..|+..
T Consensus 261 FK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~~el~~h~~~ 303 (493)
T COG5236 261 FKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYHTELLEHLTR 303 (493)
T ss_pred hhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccHHHHHHHHHH
Confidence 55555565555432 233432 21 356666777777644
No 58
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=88.49 E-value=0.26 Score=35.05 Aligned_cols=22 Identities=36% Similarity=0.869 Sum_probs=13.8
Q ss_pred cccccCccccCCchhhhhchhc
Q psy11585 1948 YACHICNKVFVRHSTLENHMKA 1969 (2227)
Q Consensus 1948 ~~C~~C~~~f~~~~~L~~H~~~ 1969 (2227)
|.|..|++.|.+...|+.|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666654
No 59
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=88.19 E-value=0.55 Score=53.47 Aligned_cols=35 Identities=26% Similarity=0.636 Sum_probs=27.4
Q ss_pred ccccCC--cccccccchhhhhhhhhhcccccCCCcCcccc
Q psy11585 151 VKYKCP--ECGFMVKRFQGLREHIVSVHAQVKDHVCIVCG 188 (2227)
Q Consensus 151 ~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~ 188 (2227)
..|.|| .|..+......|..|++..|.. +.|.+|-
T Consensus 150 L~F~CP~skc~~~C~~~k~lk~H~K~~H~~---~~C~~C~ 186 (493)
T COG5236 150 LSFKCPKSKCHRRCGSLKELKKHYKAQHGF---VLCSECI 186 (493)
T ss_pred HHhcCCchhhhhhhhhHHHHHHHHHhhcCc---EEhHhhh
Confidence 459998 5888888899999999977763 5677773
No 60
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=87.54 E-value=0.32 Score=34.61 Aligned_cols=22 Identities=23% Similarity=0.750 Sum_probs=16.1
Q ss_pred eecCcCcCcCCCchHHHhhhcc
Q psy11585 2032 FVCSFCGNTYKNPKSLDSHIRN 2053 (2227)
Q Consensus 2032 ~~C~~C~~~f~~~~~l~~H~~~ 2053 (2227)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6677777777777777777763
No 61
>KOG4173|consensus
Probab=84.51 E-value=0.65 Score=49.11 Aligned_cols=92 Identities=27% Similarity=0.603 Sum_probs=69.8
Q ss_pred CCcccccCCCCCCCccccChhHHHHHHhhcCCCCCccCCccccccCChHHHHHHHHHhcCCCCcccCCCCCccCChhHHH
Q psy11585 315 DGVRKYKCPHPGCPSSFQRFNALQEHMLSHTGEKPYTCEACGKSFPLKRRLNAHYNKWHLGKGYRCHICGSTMSNAANFK 394 (2227)
Q Consensus 315 ~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~ 394 (2227)
..++.+.|+..+|...|.+......|...-++. .|..|.+.|++...|..|+..+|.. -++
T Consensus 75 ~~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs----------------~Fq 135 (253)
T KOG4173|consen 75 PRVPAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDS----------------LFQ 135 (253)
T ss_pred cccccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHH----------------HHH
Confidence 445778999999999999999888887654433 6999999999999999999887721 111
Q ss_pred HHHhhhcCCCcccc--CCcCCccccHhHHHHHHhh
Q psy11585 395 DHLDSHRGEKKYTC--ETCGTGFAYKSSLYHHRFT 427 (2227)
Q Consensus 395 ~H~~~h~~~k~~~C--~~C~~~f~~~~~L~~H~~~ 427 (2227)
..+-.|.-.|+| +.|+..|.+...-..|+..
T Consensus 136 --a~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 136 --ALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIR 168 (253)
T ss_pred --HHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence 122345556888 7788888888888888754
No 62
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=78.28 E-value=3.8 Score=40.24 Aligned_cols=47 Identities=17% Similarity=0.272 Sum_probs=21.1
Q ss_pred CCCCCCccCChhHHHHHHHhcCCCCccccccCccccCCHHHHHHHHHHh
Q psy11585 712 CEICGTGFMYKSSLHHHKFSHSKERMFQCSFCEKKYMSPKTLKEHEQTH 760 (2227)
Q Consensus 712 C~~C~~~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h 760 (2227)
|--|...|....... ...-.....|+|+.|...|=..-++-.|...|
T Consensus 58 C~~C~~~f~~~~~~~--~~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 58 CFGCQGPFPKPPVSP--FDELKDSHRYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ccCcCCCCCCccccc--ccccccccceeCCCCCCccccccchhhhhhcc
Confidence 666666665432110 00011123455555555555555555554444
No 63
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=77.00 E-value=1.1 Score=31.12 Aligned_cols=18 Identities=39% Similarity=0.861 Sum_probs=8.3
Q ss_pred ccccCccccCCchhhhhch
Q psy11585 1949 ACHICNKVFVRHSTLENHM 1967 (2227)
Q Consensus 1949 ~C~~C~~~f~~~~~L~~H~ 1967 (2227)
+|+.||+.| ....|..|+
T Consensus 4 ~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred cCCCCCCEE-CHHHHHHHH
Confidence 344444444 344444443
No 64
>KOG2893|consensus
Probab=76.58 E-value=0.8 Score=49.32 Aligned_cols=51 Identities=25% Similarity=0.472 Sum_probs=45.6
Q ss_pred cCCcccccccCHHHHHhhhhccCCCceeecccccccccchhHHHHHHhhhccccc
Q psy11585 76 QCPDCHTMMKNFAYLKKHVRDNHSGETFSCDECSKSFTTKKCLREHYKKLHTIRI 130 (2227)
Q Consensus 76 ~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~F~~~~~l~~H~~~~H~~~~ 130 (2227)
=|=+|++.|.....|++|++..| |+|-+|.+...+--.|..|....|...|
T Consensus 12 wcwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhketi 62 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHKETI 62 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhhhhh
Confidence 38999999999999999999766 9999999999899999999998886543
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=76.34 E-value=2.1 Score=29.83 Aligned_cols=21 Identities=38% Similarity=0.808 Sum_probs=15.5
Q ss_pred cccCCCCCcccChhhHHHHHhh
Q psy11585 244 FKCPECPRSYGNFSELKKHLAV 265 (2227)
Q Consensus 244 ~~C~~C~~~f~~~~~L~~H~~~ 265 (2227)
..|+.|++.| ....|.+|+++
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4688888888 66777788653
No 66
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=75.90 E-value=1.3 Score=36.16 Aligned_cols=30 Identities=27% Similarity=0.487 Sum_probs=18.4
Q ss_pred CCCceeecccccccccchhHHHHHHhhhcc
Q psy11585 98 HSGETFSCDECSKSFTTKKCLREHYKKLHT 127 (2227)
Q Consensus 98 ~~~~~~~C~~C~~~F~~~~~l~~H~~~~H~ 127 (2227)
.|+..+.|+-|+..|..+.+...|+.+.|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 345556666666666666666666666553
No 67
>KOG2893|consensus
Probab=74.48 E-value=1.2 Score=48.15 Aligned_cols=42 Identities=31% Similarity=0.608 Sum_probs=27.0
Q ss_pred ecCCCCccccchhHHHHHHHHhccccccccccccccccccccccCcCccccCCCCccccchhhHHhh
Q psy11585 578 TCSICSKCFFIKNRLSEHYRRVHKMRVSMARTNDVKKSAEISVDGVTKYKCHICDSIFTRYDSLRLH 644 (2227)
Q Consensus 578 ~C~~C~k~f~~~~~L~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H 644 (2227)
.|.+|++.|....-|.+|++..| |+|.+|.+..-+--.|..|
T Consensus 12 wcwycnrefddekiliqhqkakh-------------------------fkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH-------------------------FKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc-------------------------ceeeeehhhhccCCCceee
Confidence 47777777777777777766444 6677776665555555555
No 68
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=73.94 E-value=5.7 Score=39.92 Aligned_cols=27 Identities=26% Similarity=0.359 Sum_probs=25.1
Q ss_pred Ccccc----cccccccCCHHHHHHHHhhhcC
Q psy11585 889 KTHSC----IYCEESFSDSKFLDAHVNIEHG 915 (2227)
Q Consensus 889 ~~~~C----~~C~~~f~~~~~L~~H~~~~h~ 915 (2227)
..|.| ..|++.+.+...|+.|++.+||
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 56999 9999999999999999999885
No 69
>KOG4173|consensus
Probab=73.69 E-value=1 Score=47.78 Aligned_cols=93 Identities=20% Similarity=0.427 Sum_probs=69.2
Q ss_pred ccccCCc--ccccccCHHHHHhhhhccCCCceeecccccccccchhHHHHHHhhhccccccCchhhhcccccceeeeccc
Q psy11585 73 IKFQCPD--CHTMMKNFAYLKKHVRDNHSGETFSCDECSKSFTTKKCLREHYKKLHTIRIRSSREENDMKKKTMVYVEGV 150 (2227)
Q Consensus 73 ~~~~C~~--C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~F~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~ 150 (2227)
-.|.|+. |..+|++.+....|+..-|++. |.+|...|++..-|..|+..-|.-.--.. +.-|.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~s---Cs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~------------veRG~ 142 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGNS---CSFCKRAFPTGHLLDAHILEWHDSLFQAL------------VERGQ 142 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccch---hHHHHHhCCchhhhhHHHHHHHHHHHHHH------------HHcCc
Confidence 3577875 8899999999999999888864 99999999999999999877764221000 01122
Q ss_pred ccccCC--cccccccchhhhhhhhhhcccccC
Q psy11585 151 VKYKCP--ECGFMVKRFQGLREHIVSVHAQVK 180 (2227)
Q Consensus 151 ~~~~C~--~C~~~f~~~~~L~~H~~~~H~~~~ 180 (2227)
-.|+|- .|+..|+...+-..|+...|.-.-
T Consensus 143 dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk~Pa 174 (253)
T KOG4173|consen 143 DMYQCLVEGCTEKFKTSRDRKDHMIRMHKYPA 174 (253)
T ss_pred cHHHHHHHhhhhhhhhhhhhhhHHHHhccCCc
Confidence 348884 688888888888888887775443
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=72.29 E-value=1.4 Score=35.97 Aligned_cols=37 Identities=24% Similarity=0.486 Sum_probs=31.5
Q ss_pred hcccccccCcccccCCcccccccCHHHHHhhhhccCC
Q psy11585 63 EKSAVEIDGEIKFQCPDCHTMMKNFAYLKKHVRDNHS 99 (2227)
Q Consensus 63 ~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~ 99 (2227)
++......|+.-++||-|+++|.......+|+..-|+
T Consensus 6 A~Kv~~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 6 AIKVRDRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred eeEeeccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 3444567889999999999999999999999987775
No 71
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=69.85 E-value=2.6 Score=31.67 Aligned_cols=10 Identities=50% Similarity=1.378 Sum_probs=4.3
Q ss_pred ccCCCcCccc
Q psy11585 2005 YVCETCGASF 2014 (2227)
Q Consensus 2005 ~~C~~C~~~f 2014 (2227)
|+|.+||+.+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 3444444443
No 72
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=64.80 E-value=5 Score=30.57 Aligned_cols=21 Identities=24% Similarity=0.770 Sum_probs=11.7
Q ss_pred cccCCCCCccCChhHHHHHHh
Q psy11585 378 YRCHICGSTMSNAANFKDHLD 398 (2227)
Q Consensus 378 ~~C~~C~~~f~~~~~l~~H~~ 398 (2227)
|.|++|++.|.+...+..|++
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred eEccccCCccCCHHHHHHHHC
Confidence 455555555555555555554
No 73
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=64.36 E-value=4.6 Score=30.39 Aligned_cols=12 Identities=42% Similarity=1.257 Sum_probs=6.9
Q ss_pred eecCcCcCcCCC
Q psy11585 2032 FVCSFCGNTYKN 2043 (2227)
Q Consensus 2032 ~~C~~C~~~f~~ 2043 (2227)
|+|..||+.+..
T Consensus 2 ~~C~~CGy~y~~ 13 (33)
T cd00350 2 YVCPVCGYIYDG 13 (33)
T ss_pred EECCCCCCEECC
Confidence 566666665543
No 74
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=64.12 E-value=4.7 Score=30.74 Aligned_cols=24 Identities=25% Similarity=0.773 Sum_probs=18.2
Q ss_pred CceecCcCcCcCCCchHHHhhhcc
Q psy11585 2030 AQFVCSFCGNTYKNPKSLDSHIRN 2053 (2227)
Q Consensus 2030 ~~~~C~~C~~~f~~~~~l~~H~~~ 2053 (2227)
++|.|.+|+..|.....+..|++.
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHCh
Confidence 357788888888888888888764
No 75
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=60.94 E-value=5.8 Score=27.97 Aligned_cols=24 Identities=25% Similarity=0.771 Sum_probs=16.7
Q ss_pred CCCccCCccCCCCCCCccccccccCCCCCccccCccccccc
Q psy11585 2062 ICDDCTKAMSTPAPSSKSVCIEHSNLIPKCHSCQKCEESFD 2102 (2227)
Q Consensus 2062 ~C~~C~~~f~~~~~~~~h~~~~h~~~~~~~~~C~~C~~~f~ 2102 (2227)
.||.|++.....+ -.||.||+.|.
T Consensus 2 ~CP~C~~~V~~~~-----------------~~Cp~CG~~F~ 25 (26)
T PF10571_consen 2 TCPECGAEVPESA-----------------KFCPHCGYDFE 25 (26)
T ss_pred cCCCCcCCchhhc-----------------CcCCCCCCCCc
Confidence 4788888765432 37888888885
No 76
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=60.64 E-value=4.5 Score=46.12 Aligned_cols=46 Identities=22% Similarity=0.328 Sum_probs=25.4
Q ss_pred CccCCCccCCccCCCCCCCccccccccCCC---------C---CccccCcccccccCh
Q psy11585 2059 KKSICDDCTKAMSTPAPSSKSVCIEHSNLI---------P---KCHSCQKCEESFDNC 2104 (2227)
Q Consensus 2059 ~~~~C~~C~~~f~~~~~~~~h~~~~h~~~~---------~---~~~~C~~C~~~f~~~ 2104 (2227)
|.+.||.|++.|.++..+...++....... | .+..||.|++.|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 455566666666655544444443222210 1 236899999888744
No 77
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=60.62 E-value=11 Score=37.18 Aligned_cols=20 Identities=35% Similarity=0.861 Sum_probs=12.0
Q ss_pred CCcccccchhhccCHHHHHH
Q psy11585 1288 KPFSCQVCGKSFAAREHLKR 1307 (2227)
Q Consensus 1288 ~~~~C~~C~~~f~~~~~L~~ 1307 (2227)
-|-.|+.||-......+|.+
T Consensus 14 LP~~CpiCgLtLVss~HLAR 33 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLAR 33 (112)
T ss_pred CCCcCCcCCCEEeccchHHH
Confidence 45666666666665555554
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=57.10 E-value=5.1 Score=45.71 Aligned_cols=25 Identities=20% Similarity=0.575 Sum_probs=15.9
Q ss_pred CCccccccCccccCCchhhhhchhc
Q psy11585 1945 EKDYACHICNKVFVRHSTLENHMKA 1969 (2227)
Q Consensus 1945 ~~~~~C~~C~~~f~~~~~L~~H~~~ 1969 (2227)
++.++||+|++.|.+..-+...++.
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~ 27 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRV 27 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceE
Confidence 3456777777777776655555543
No 79
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=53.18 E-value=12 Score=37.50 Aligned_cols=20 Identities=10% Similarity=0.117 Sum_probs=11.6
Q ss_pred ccccccccChhhHHHHhhcc
Q psy11585 1190 SDCDKTYTRFYELKCHLMVH 1209 (2227)
Q Consensus 1190 ~~C~k~f~~~~~L~~H~~~h 1209 (2227)
..|+..+.+...+..|++.+
T Consensus 88 ~~C~y~~~~~~~m~~H~~~~ 107 (109)
T PF12013_consen 88 PHCGYITRSKKTMRKHWRKE 107 (109)
T ss_pred CCCCcEeccHHHHHHHHHHh
Confidence 66666666666666555544
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=52.54 E-value=8.3 Score=29.28 Aligned_cols=10 Identities=40% Similarity=1.082 Sum_probs=4.4
Q ss_pred ccCCCcCccc
Q psy11585 2005 YVCETCGASF 2014 (2227)
Q Consensus 2005 ~~C~~C~~~f 2014 (2227)
|+|.+||+.+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 3444444443
No 81
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=48.46 E-value=11 Score=37.27 Aligned_cols=13 Identities=15% Similarity=0.631 Sum_probs=6.3
Q ss_pred ccCCCccCCccCC
Q psy11585 2060 KSICDDCTKAMST 2072 (2227)
Q Consensus 2060 ~~~C~~C~~~f~~ 2072 (2227)
|..||.||..|..
T Consensus 26 PivCP~CG~~~~~ 38 (108)
T PF09538_consen 26 PIVCPKCGTEFPP 38 (108)
T ss_pred CccCCCCCCccCc
Confidence 4445555555443
No 82
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=47.39 E-value=21 Score=30.62 Aligned_cols=10 Identities=40% Similarity=1.268 Sum_probs=5.4
Q ss_pred ceecCcCcCc
Q psy11585 2031 QFVCSFCGNT 2040 (2227)
Q Consensus 2031 ~~~C~~C~~~ 2040 (2227)
.|.||.||..
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 4555555553
No 83
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=40.25 E-value=18 Score=30.85 Aligned_cols=36 Identities=22% Similarity=0.612 Sum_probs=23.9
Q ss_pred CcccccCCcccccccCHHHHHhhhhccCCCceeeccccccc
Q psy11585 71 GEIKFQCPDCHTMMKNFAYLKKHVRDNHSGETFSCDECSKS 111 (2227)
Q Consensus 71 ~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~ 111 (2227)
.-..|.||.||.+-..+-. +-..-+.+|.|+.||..
T Consensus 24 ~~v~F~CPnCGe~~I~Rc~-----~CRk~g~~Y~Cp~CGF~ 59 (61)
T COG2888 24 TAVKFPCPNCGEVEIYRCA-----KCRKLGNPYRCPKCGFE 59 (61)
T ss_pred ceeEeeCCCCCceeeehhh-----hHHHcCCceECCCcCcc
Confidence 3468999999975544222 22223478999999964
No 84
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=39.59 E-value=18 Score=29.79 Aligned_cols=13 Identities=31% Similarity=0.836 Sum_probs=7.9
Q ss_pred cccCCCcCccccc
Q psy11585 2004 KYVCETCGASFVH 2016 (2227)
Q Consensus 2004 ~~~C~~C~~~f~~ 2016 (2227)
.|.|+.||..|..
T Consensus 3 ~y~C~~CG~~~~~ 15 (46)
T PRK00398 3 EYKCARCGREVEL 15 (46)
T ss_pred EEECCCCCCEEEE
Confidence 4666666666644
No 85
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=37.96 E-value=12 Score=39.72 Aligned_cols=12 Identities=33% Similarity=0.636 Sum_probs=6.2
Q ss_pred cCCCccCCccCC
Q psy11585 2061 SICDDCTKAMST 2072 (2227)
Q Consensus 2061 ~~C~~C~~~f~~ 2072 (2227)
+.|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 445555555543
No 86
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=37.50 E-value=23 Score=28.80 Aligned_cols=11 Identities=36% Similarity=1.035 Sum_probs=6.0
Q ss_pred ccCCCcCcccc
Q psy11585 2005 YVCETCGASFV 2015 (2227)
Q Consensus 2005 ~~C~~C~~~f~ 2015 (2227)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 55555555554
No 87
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=37.21 E-value=24 Score=26.45 Aligned_cols=11 Identities=45% Similarity=1.223 Sum_probs=5.6
Q ss_pred ccCCCcCcccc
Q psy11585 2005 YVCETCGASFV 2015 (2227)
Q Consensus 2005 ~~C~~C~~~f~ 2015 (2227)
|.|..||..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45555655554
No 88
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.15 E-value=21 Score=35.48 Aligned_cols=31 Identities=19% Similarity=0.578 Sum_probs=22.8
Q ss_pred ccCCCCCccccCChHHHHHHHHhccCCCccccccCccccCCc
Q psy11585 1919 RFKCPDCPTILQTFRGLKAHLDIHSGEKDYACHICNKVFVRH 1960 (2227)
Q Consensus 1919 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 1960 (2227)
...|+.||+.|--.. ..|.+||.||..|.-.
T Consensus 9 KR~Cp~CG~kFYDLn-----------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDLN-----------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccCC-----------CCCccCCCCCCccCcc
Confidence 357888888886433 3577888888888765
No 89
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=35.00 E-value=20 Score=28.45 Aligned_cols=11 Identities=55% Similarity=1.355 Sum_probs=5.2
Q ss_pred ccCCCcCcccc
Q psy11585 2005 YVCETCGASFV 2015 (2227)
Q Consensus 2005 ~~C~~C~~~f~ 2015 (2227)
|.|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 44444444443
No 90
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=34.44 E-value=16 Score=42.48 Aligned_cols=21 Identities=19% Similarity=0.336 Sum_probs=14.0
Q ss_pred eecccchhhcCCchhHhhhcc
Q psy11585 1485 YECDICKKQVTNRKNMIDHQR 1505 (2227)
Q Consensus 1485 ~~C~~C~~~f~~~~~L~~H~~ 1505 (2227)
+=|..|++.|....-+..|..
T Consensus 239 ~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 239 VYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred eeeHhhhhHhhhhHHHHHHHh
Confidence 447777777777666666654
No 91
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=34.24 E-value=23 Score=37.95 Aligned_cols=11 Identities=55% Similarity=1.162 Sum_probs=5.0
Q ss_pred CCCcccCCCcC
Q psy11585 2001 GEKKYVCETCG 2011 (2227)
Q Consensus 2001 ~~~~~~C~~C~ 2011 (2227)
|+.|-+|++||
T Consensus 146 ge~P~~CPiCg 156 (166)
T COG1592 146 GEAPEVCPICG 156 (166)
T ss_pred CCCCCcCCCCC
Confidence 34444444444
No 92
>PRK04023 DNA polymerase II large subunit; Validated
Probab=33.85 E-value=35 Score=46.41 Aligned_cols=10 Identities=20% Similarity=0.740 Sum_probs=6.6
Q ss_pred cccCCCCCcc
Q psy11585 1976 DFQCKVCDRA 1985 (2227)
Q Consensus 1976 ~~~C~~C~~~ 1985 (2227)
...|+.||..
T Consensus 626 ~RfCpsCG~~ 635 (1121)
T PRK04023 626 RRKCPSCGKE 635 (1121)
T ss_pred CccCCCCCCc
Confidence 4567777765
No 93
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=33.31 E-value=29 Score=34.91 Aligned_cols=12 Identities=25% Similarity=0.733 Sum_probs=6.6
Q ss_pred ccCCCcCccccc
Q psy11585 2005 YVCETCGASFVH 2016 (2227)
Q Consensus 2005 ~~C~~C~~~f~~ 2016 (2227)
..|+.||++|.-
T Consensus 10 r~Cp~cg~kFYD 21 (129)
T TIGR02300 10 RICPNTGSKFYD 21 (129)
T ss_pred ccCCCcCccccc
Confidence 456666666543
No 94
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=32.50 E-value=25 Score=29.96 Aligned_cols=35 Identities=26% Similarity=0.437 Sum_probs=23.3
Q ss_pred cccccCCcccccccchhhhhhhhhhcccccCCCcCcccccc
Q psy11585 150 VVKYKCPECGFMVKRFQGLREHIVSVHAQVKDHVCIVCGAA 190 (2227)
Q Consensus 150 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~ 190 (2227)
-..|.|+.||.+...+-.--+ ....+|+|+.||+.
T Consensus 25 ~v~F~CPnCGe~~I~Rc~~CR------k~g~~Y~Cp~CGF~ 59 (61)
T COG2888 25 AVKFPCPNCGEVEIYRCAKCR------KLGNPYRCPKCGFE 59 (61)
T ss_pred eeEeeCCCCCceeeehhhhHH------HcCCceECCCcCcc
Confidence 356999999977655432221 12348999999974
No 95
>PHA00626 hypothetical protein
Probab=32.29 E-value=19 Score=30.15 Aligned_cols=12 Identities=33% Similarity=0.609 Sum_probs=5.2
Q ss_pred ccCCCccCCccC
Q psy11585 2060 KSICDDCTKAMS 2071 (2227)
Q Consensus 2060 ~~~C~~C~~~f~ 2071 (2227)
.|.|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 344444444443
No 96
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=32.02 E-value=23 Score=29.24 Aligned_cols=12 Identities=33% Similarity=0.971 Sum_probs=6.9
Q ss_pred CcccCCCcCccc
Q psy11585 2003 KKYVCETCGASF 2014 (2227)
Q Consensus 2003 ~~~~C~~C~~~f 2014 (2227)
..|.|..||+.|
T Consensus 5 ~~Y~C~~Cg~~~ 16 (49)
T COG1996 5 MEYKCARCGREV 16 (49)
T ss_pred EEEEhhhcCCee
Confidence 345666666655
No 97
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=30.95 E-value=41 Score=26.02 Aligned_cols=35 Identities=20% Similarity=0.491 Sum_probs=24.3
Q ss_pred cccCCcccccccCHHHHHhhhhccCCCceeeccccccccc
Q psy11585 74 KFQCPDCHTMMKNFAYLKKHVRDNHSGETFSCDECSKSFT 113 (2227)
Q Consensus 74 ~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~F~ 113 (2227)
...|+.|+..|.=.+.. +-..+...+|+.|+..|.
T Consensus 2 ~i~Cp~C~~~y~i~d~~-----ip~~g~~v~C~~C~~~f~ 36 (36)
T PF13717_consen 2 IITCPNCQAKYEIDDEK-----IPPKGRKVRCSKCGHVFF 36 (36)
T ss_pred EEECCCCCCEEeCCHHH-----CCCCCcEEECCCCCCEeC
Confidence 46789999888765544 223445688999988773
No 98
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=30.78 E-value=25 Score=29.80 Aligned_cols=11 Identities=27% Similarity=1.186 Sum_probs=5.4
Q ss_pred eecCcCcCcCC
Q psy11585 2032 FVCSFCGNTYK 2042 (2227)
Q Consensus 2032 ~~C~~C~~~f~ 2042 (2227)
|.|+.||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44555555444
No 99
>PF15269 zf-C2H2_7: Zinc-finger
Probab=30.64 E-value=14 Score=28.99 Aligned_cols=22 Identities=23% Similarity=0.525 Sum_probs=20.7
Q ss_pred ccchhhhhhhhhcccccccchh
Q psy11585 2173 HHSCQKCEESFDNCNNLWSHMF 2194 (2227)
Q Consensus 2173 ~~~c~~c~~~~~~~~~~~~h~~ 2194 (2227)
+|+|=+|-.+.+.|++|-+||.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred cceeecCCcccchHHHHHHHHH
Confidence 6899999999999999999995
No 100
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=30.59 E-value=35 Score=36.82 Aligned_cols=32 Identities=19% Similarity=0.553 Sum_probs=17.9
Q ss_pred CCCcccCCCcCcccccchhhhhhcccccCCceecCcCcCc
Q psy11585 2001 GEKKYVCETCGASFVHWGSLNIHNYSHINAQFVCSFCGNT 2040 (2227)
Q Consensus 2001 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~ 2040 (2227)
+..-|.|+.|+.+|+.-..+. ..|.||.||..
T Consensus 106 ~~~~Y~Cp~c~~r~tf~eA~~--------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 106 NNMFFICPNMCVRFTFNEAME--------LNFTCPRCGAM 137 (158)
T ss_pred CCCeEECCCCCcEeeHHHHHH--------cCCcCCCCCCE
Confidence 444566666666666555543 14666666643
No 101
>KOG2186|consensus
Probab=29.93 E-value=25 Score=39.48 Aligned_cols=46 Identities=30% Similarity=0.606 Sum_probs=25.0
Q ss_pred ccCCCcCcccccchhhhhhcccccCCceecCcCcCcCCCchHHHhhhc
Q psy11585 2005 YVCETCGASFVHWGSLNIHNYSHINAQFVCSFCGNTYKNPKSLDSHIR 2052 (2227)
Q Consensus 2005 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~ 2052 (2227)
|.|.+||-... +..|.+|+..-.|.-|.|--|++.|.. .+.+.|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccCCeeEEeeccccccc-chhhhhhh
Confidence 44555555442 334555555555555666666666665 55666654
No 102
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=29.78 E-value=16 Score=36.27 Aligned_cols=36 Identities=17% Similarity=0.579 Sum_probs=21.6
Q ss_pred ccccCCCCCcccChhhHHHHHhhccCC----CcccccccccccC
Q psy11585 243 KFKCPECPRSYGNFSELKKHLAVHTGE----KHFVCSVCQRGFF 282 (2227)
Q Consensus 243 ~~~C~~C~~~f~~~~~L~~H~~~h~~~----~~~~C~~C~~~f~ 282 (2227)
...|+.|+..-.....++ +...+ .-|.|..|+..+.
T Consensus 62 ~~~Cp~Cg~~~a~f~~~Q----~RsadE~~T~fy~C~~C~~~w~ 101 (104)
T TIGR01384 62 RVECPKCGHKEAYYWLLQ----TRRADEPETRFYKCTKCGYVWR 101 (104)
T ss_pred cCCCCCCCCCeeEEEEec----cCCCCCCcEEEEEeCCCCCeeE
Confidence 578999986544333332 22222 2389999998764
No 103
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=29.78 E-value=32 Score=26.85 Aligned_cols=35 Identities=17% Similarity=0.574 Sum_probs=23.7
Q ss_pred ccccCCCCCcccChhhHHHHHhhccCCCcccccccccccC
Q psy11585 243 KFKCPECPRSYGNFSELKKHLAVHTGEKHFVCSVCQRGFF 282 (2227)
Q Consensus 243 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~ 282 (2227)
.+.|+.|+..|.-..+.. ..+.....|+.|+..|.
T Consensus 2 ~~~CP~C~~~~~v~~~~~-----~~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 2 RIQCPNCKTSFRVVDSQL-----GANGGKVRCGKCGHVWY 36 (38)
T ss_pred EEECCCCCCEEEeCHHHc-----CCCCCEEECCCCCCEEE
Confidence 367999999887665441 11222588999998774
No 104
>KOG2807|consensus
Probab=28.75 E-value=69 Score=37.49 Aligned_cols=33 Identities=21% Similarity=0.557 Sum_probs=20.4
Q ss_pred cccCCCCCccccChhhhhhhhcccccCCcccCCCCCCcccChhHHHH
Q psy11585 653 PYTCDVCGKSFVAKKHLNRHYNCSHAGFGYQCNICGRVMSDSTNFKD 699 (2227)
Q Consensus 653 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~ 699 (2227)
-|.|+.|.. .+-.-|-.|++|+-+......|.+
T Consensus 276 Gy~CP~Cka--------------kvCsLP~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 276 GYFCPQCKA--------------KVCSLPIECPICSLTLVSSPHLAR 308 (378)
T ss_pred ceeCCcccC--------------eeecCCccCCccceeEecchHHHH
Confidence 467777765 333446667777777666666554
No 105
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=27.62 E-value=20 Score=28.22 Aligned_cols=35 Identities=17% Similarity=0.512 Sum_probs=22.9
Q ss_pred cchhhhhhhhhcccccccchhcccCCCc----ccCCCCCCCcc
Q psy11585 2174 HSCQKCEESFDNCNNLWSHMFIKHENRD----FVCNLCPPDSK 2212 (2227)
Q Consensus 2174 ~~c~~c~~~~~~~~~~~~h~~~~~~~~~----~~~~~~~~~~~ 2212 (2227)
+.|..|| .+.....-.++..+|-| |+|..||..++
T Consensus 1 ~~Cp~Cg----~~~a~~~~~Q~rsaDE~~T~fy~C~~C~~~wr 39 (39)
T PF01096_consen 1 IKCPKCG----HNEAVFFQIQTRSADEPMTLFYVCCNCGHRWR 39 (39)
T ss_dssp S--SSS-----SSEEEEEEESSSSSSSSSEEEEEESSSTEEEE
T ss_pred CCCcCCC----CCeEEEEEeeccCCCCCCeEEEEeCCCCCeeC
Confidence 4678888 55566666667777766 99999987653
No 106
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=27.47 E-value=43 Score=27.69 Aligned_cols=12 Identities=33% Similarity=0.924 Sum_probs=7.6
Q ss_pred ccCCCcCccccc
Q psy11585 2005 YVCETCGASFVH 2016 (2227)
Q Consensus 2005 ~~C~~C~~~f~~ 2016 (2227)
|+|.+|++.|--
T Consensus 2 y~C~~CgyvYd~ 13 (47)
T PF00301_consen 2 YQCPVCGYVYDP 13 (47)
T ss_dssp EEETTTSBEEET
T ss_pred cCCCCCCEEEcC
Confidence 566677766644
No 107
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=27.36 E-value=52 Score=35.12 Aligned_cols=14 Identities=21% Similarity=0.757 Sum_probs=7.0
Q ss_pred CcccCCCcCccccc
Q psy11585 2003 KKYVCETCGASFVH 2016 (2227)
Q Consensus 2003 ~~~~C~~C~~~f~~ 2016 (2227)
.-|.|+.|+..|+.
T Consensus 98 ~~Y~Cp~C~~~y~~ 111 (147)
T smart00531 98 AYYKCPNCQSKYTF 111 (147)
T ss_pred cEEECcCCCCEeeH
Confidence 34555555555543
No 108
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=27.34 E-value=20 Score=32.50 Aligned_cols=41 Identities=29% Similarity=0.569 Sum_probs=24.8
Q ss_pred cccccCcCccCCcchHHHHHhhhcCCCccccc--cccccccChhh
Q psy11585 1638 YECDTCGHGLSSKKSLDDHYRIHTGEKKYVCQ--QCGASFTQWAS 1680 (2227)
Q Consensus 1638 ~~C~~C~~~f~~~~~L~~H~r~H~~ek~~~C~--~C~k~F~~~~~ 1680 (2227)
+.|+.||.......+-..... +.+..+.|. .||.+|.+..+
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEE
Confidence 457777766543333222221 456778887 89988877544
No 109
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=27.09 E-value=37 Score=36.25 Aligned_cols=43 Identities=23% Similarity=0.492 Sum_probs=21.4
Q ss_pred ccCCCCCCcccCccccccccccccCccceecCCCCccccchhH
Q psy11585 549 YKCPLCDRIYTSFSETKRHFEVHSGERKYTCSICSKCFFIKNR 591 (2227)
Q Consensus 549 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~k~f~~~~~ 591 (2227)
++||+|+-.++....-..--..-.-.+.++|+.||++|.+-..
T Consensus 1 m~cp~c~~~~~~~~~s~~~~~~~~~~~~~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 1 MRCPFCGHPDTRVIDSRPAEDGNAIRRRRECLACGKRFTTFER 43 (154)
T ss_pred CcCCCCCCCCCEeEeccccCCCCceeeeeeccccCCcceEeEe
Confidence 4688888655321111000000001123788899988876443
No 110
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=26.74 E-value=37 Score=31.57 Aligned_cols=33 Identities=21% Similarity=0.589 Sum_probs=20.5
Q ss_pred ccCCCccCCccCCCCCCCccccccccCCCCCccccCcccccccChh
Q psy11585 2060 KSICDDCTKAMSTPAPSSKSVCIEHSNLIPKCHSCQKCEESFDNCN 2105 (2227)
Q Consensus 2060 ~~~C~~C~~~f~~~~~~~~h~~~~h~~~~~~~~~C~~C~~~f~~~~ 2105 (2227)
+|.||+|++. ... -....+.+|.+|+..|+--.
T Consensus 35 ~~~Cp~C~~~-~Vk------------R~a~GIW~C~kCg~~fAGga 67 (89)
T COG1997 35 KHVCPFCGRT-TVK------------RIATGIWKCRKCGAKFAGGA 67 (89)
T ss_pred CCcCCCCCCc-cee------------eeccCeEEcCCCCCeecccc
Confidence 6778888775 211 12245677888888887543
No 111
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=26.10 E-value=38 Score=28.42 Aligned_cols=25 Identities=28% Similarity=0.760 Sum_probs=16.2
Q ss_pred eecCcCcCcCCCc-----hHHHhhhcccCC
Q psy11585 2032 FVCSFCGNTYKNP-----KSLDSHIRNSHT 2056 (2227)
Q Consensus 2032 ~~C~~C~~~f~~~-----~~l~~H~~~~H~ 2056 (2227)
-.|.+|++.+... +.|.+|++..|+
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 3566666665444 588888886554
No 112
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=25.75 E-value=34 Score=27.50 Aligned_cols=11 Identities=55% Similarity=1.313 Sum_probs=5.0
Q ss_pred ccCCCcCcccc
Q psy11585 2005 YVCETCGASFV 2015 (2227)
Q Consensus 2005 ~~C~~C~~~f~ 2015 (2227)
|.|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 44444444443
No 113
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=25.69 E-value=32 Score=31.26 Aligned_cols=46 Identities=11% Similarity=0.267 Sum_probs=31.6
Q ss_pred cchhhhhhhhhcccccccchhcccCCCcccCC--CCCCCcchhheeeeee
Q psy11585 2174 HSCQKCEESFDNCNNLWSHMFIKHENRDFVCN--LCPPDSKIMIKYVHFV 2221 (2227)
Q Consensus 2174 ~~c~~c~~~~~~~~~~~~h~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 2221 (2227)
+.|..||...-.+..-+..-. .-++=|.|. +||.+|+|+..++|++
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~s~ti 49 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESVQRYI 49 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEEEEEE
Confidence 468888865533333333322 444669999 9999999999998876
No 114
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=24.92 E-value=20 Score=36.11 Aligned_cols=106 Identities=17% Similarity=0.334 Sum_probs=0.0
Q ss_pred CCcccccccCHHHHHhhhhccCCCceeecccccccccch-hHHHHHHhhhccccccCchhhhcccccceeeecccccccC
Q psy11585 77 CPDCHTMMKNFAYLKKHVRDNHSGETFSCDECSKSFTTK-KCLREHYKKLHTIRIRSSREENDMKKKTMVYVEGVVKYKC 155 (2227)
Q Consensus 77 C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~F~~~-~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~C 155 (2227)
|+.|+.+.-. +....+..+.|..|++.+... ..+..+................. ...............|
T Consensus 5 Cp~Cgsll~p--------~~~~~~~~l~C~kCgye~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~C 75 (113)
T COG1594 5 CPKCGSLLYP--------KKDDEGGKLVCRKCGYEEEASNKKVYRYSVKEAVEKKKEVVLVVE-DETQGAKTLPTAKEKC 75 (113)
T ss_pred cCCccCeeEE--------eEcCCCcEEECCCCCcchhccccceeEEEEeeccCCcceeeeeec-ccccCccccccccccC
Q ss_pred CcccccccchhhhhhhhhhcccccCCCcCcccccccc
Q psy11585 156 PECGFMVKRFQGLREHIVSVHAQVKDHVCIVCGAAFG 192 (2227)
Q Consensus 156 ~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~ 192 (2227)
+.||..=.....|+.=-. --+...=|+|..||.+++
T Consensus 76 pkCg~~ea~y~~~QtRsa-DEp~T~Fy~C~~Cg~~wr 111 (113)
T COG1594 76 PKCGNKEAYYWQLQTRSA-DEPETRFYKCTRCGYRWR 111 (113)
T ss_pred CCCCCceeEEEeeehhcc-CCCceEEEEecccCCEee
No 115
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=24.47 E-value=43 Score=36.05 Aligned_cols=23 Identities=35% Similarity=0.881 Sum_probs=17.4
Q ss_pred ceecccchhhcCCchhHhhhccccccCCCcccCCccC
Q psy11585 1484 IYECDICKKQVTNRKNMIDHQRSVHELLKPYECDTCG 1520 (2227)
Q Consensus 1484 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~k~~~C~~C~ 1520 (2227)
.|.|.+||. ++.|+.|-+|++||
T Consensus 134 ~~vC~vCGy--------------~~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGY--------------THEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCC--------------cccCCCCCcCCCCC
Confidence 688888876 56667777888887
No 116
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=24.06 E-value=49 Score=36.50 Aligned_cols=31 Identities=23% Similarity=0.615 Sum_probs=16.3
Q ss_pred CCcccCCCcCcccccchhhhhhcccccCCceecCcCcCc
Q psy11585 2002 EKKYVCETCGASFVHWGSLNIHNYSHINAQFVCSFCGNT 2040 (2227)
Q Consensus 2002 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~ 2040 (2227)
..-|.|+.|+.+|+....+. ..|.||.||..
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~--------~~F~Cp~Cg~~ 145 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME--------YGFRCPQCGEM 145 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh--------cCCcCCCCCCC
Confidence 34466666666665554432 24555555543
No 117
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=23.77 E-value=51 Score=33.29 Aligned_cols=35 Identities=11% Similarity=0.263 Sum_probs=24.2
Q ss_pred ccCCCCCccccCChHHHHHHHHhccCCCccccccCccccCCchhhh
Q psy11585 1919 RFKCPDCPTILQTFRGLKAHLDIHSGEKDYACHICNKVFVRHSTLE 1964 (2227)
Q Consensus 1919 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~ 1964 (2227)
...|+.||+.|--.. ..|.+||.||..|.-...++
T Consensus 9 Kr~Cp~cg~kFYDLn-----------k~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 9 KRICPNTGSKFYDLN-----------RRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred cccCCCcCccccccC-----------CCCccCCCcCCccCcchhhc
Confidence 357888888885432 35778888888887654443
No 118
>KOG4124|consensus
Probab=23.30 E-value=40 Score=39.52 Aligned_cols=24 Identities=29% Similarity=0.682 Sum_probs=12.8
Q ss_pred CceecCc--CcCcCCCchHHHhhhcc
Q psy11585 2030 AQFVCSF--CGNTYKNPKSLDSHIRN 2053 (2227)
Q Consensus 2030 ~~~~C~~--C~~~f~~~~~l~~H~~~ 2053 (2227)
++|+|++ |++.+++...|+.|-..
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~~ 373 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKLH 373 (442)
T ss_pred CCCCCCCCcchhhcccCcceeecccc
Confidence 3445544 55666655566555443
No 119
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=23.15 E-value=56 Score=44.25 Aligned_cols=9 Identities=22% Similarity=0.807 Sum_probs=4.2
Q ss_pred ccCCCccCC
Q psy11585 2060 KSICDDCTK 2068 (2227)
Q Consensus 2060 ~~~C~~C~~ 2068 (2227)
|..||.||-
T Consensus 475 p~~Cp~Cgs 483 (730)
T COG1198 475 PQSCPECGS 483 (730)
T ss_pred CCCCCCCCC
Confidence 444444443
No 120
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=22.89 E-value=65 Score=25.13 Aligned_cols=8 Identities=38% Similarity=1.082 Sum_probs=3.1
Q ss_pred CCCCCccc
Q psy11585 656 CDVCGKSF 663 (2227)
Q Consensus 656 C~~C~~~f 663 (2227)
|+.|+..|
T Consensus 5 CP~C~~~f 12 (37)
T PF13719_consen 5 CPNCQTRF 12 (37)
T ss_pred CCCCCceE
Confidence 33343333
No 121
>KOG2186|consensus
Probab=22.40 E-value=41 Score=37.88 Aligned_cols=46 Identities=30% Similarity=0.639 Sum_probs=20.7
Q ss_pred eecccccccccCcchhhhhhcccCCCcccccchhhcccChHHHHHHHh
Q psy11585 1752 CVCSYCGNSYANPGNLRTHMVVHSNKNHICEICGKSFKKKDLLREHMI 1799 (2227)
Q Consensus 1752 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~f~~~~~l~~H~~ 1799 (2227)
|.|..||.+...+ .|..||-.-.+..|.|-.|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCCeeEEeeccccccc-chhhhhhh
Confidence 3444444444332 2333444433444555555555554 34444443
No 122
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=22.04 E-value=75 Score=34.36 Aligned_cols=33 Identities=18% Similarity=0.528 Sum_probs=20.6
Q ss_pred CCCCCccCCccccccCChHHHHHHHHHhcCCCCcccCCCCCc
Q psy11585 345 TGEKPYTCEACGKSFPLKRRLNAHYNKWHLGKGYRCHICGST 386 (2227)
Q Consensus 345 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~ 386 (2227)
.+..-|.|+.|+..|+....+. ..|.|+.||..
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~ 137 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM 137 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence 3344577777777777666664 24677777664
No 123
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=21.56 E-value=54 Score=27.56 Aligned_cols=11 Identities=36% Similarity=1.014 Sum_probs=6.9
Q ss_pred ccCCCcCcccc
Q psy11585 2005 YVCETCGASFV 2015 (2227)
Q Consensus 2005 ~~C~~C~~~f~ 2015 (2227)
|+|.+||+.|.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 56666666664
No 124
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=21.44 E-value=88 Score=25.42 Aligned_cols=28 Identities=25% Similarity=0.591 Sum_probs=16.7
Q ss_pred CccceecCCCCccccch----hHHHHHHHHhc
Q psy11585 573 GERKYTCSICSKCFFIK----NRLSEHYRRVH 600 (2227)
Q Consensus 573 ~~~~~~C~~C~k~f~~~----~~L~~H~~~~H 600 (2227)
+.....|.+|++.+... ..|..|++..|
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 34556788888777653 67888886555
No 125
>KOG2071|consensus
Probab=21.17 E-value=90 Score=40.22 Aligned_cols=26 Identities=27% Similarity=0.712 Sum_probs=20.1
Q ss_pred CcccCCccCCcccCchhhhhhhcccc
Q psy11585 1512 KPYECDTCGHGLSSKKSLDDHYRIHT 1537 (2227)
Q Consensus 1512 k~~~C~~C~k~f~~~~~L~~H~~~H~ 1537 (2227)
.|-+|..||..|........||.+|.
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CcchhcccccccccchhhhhHhhhhh
Confidence 45778888888888888888877774
No 126
>PF12907 zf-met2: Zinc-binding
Probab=21.14 E-value=24 Score=27.87 Aligned_cols=32 Identities=28% Similarity=0.624 Sum_probs=20.9
Q ss_pred cccccCccccCC---chhhhhchhcccCCCCcccC
Q psy11585 1948 YACHICNKVFVR---HSTLENHMKAVHEKIRDFQC 1979 (2227)
Q Consensus 1948 ~~C~~C~~~f~~---~~~L~~H~~~~H~~~~~~~C 1979 (2227)
++|.+|-..|.. ...|+.|....|.+..+-.|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 567788766553 35688888877777544333
No 127
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=20.81 E-value=72 Score=22.70 Aligned_cols=19 Identities=37% Similarity=0.702 Sum_probs=14.4
Q ss_pred ccccccccccCChHHHHHHH
Q psy11585 272 FVCSVCQRGFFMKNRLNEHY 291 (2227)
Q Consensus 272 ~~C~~C~~~f~~~~~l~~H~ 291 (2227)
..|++|++.+ ....++.|+
T Consensus 2 v~CPiC~~~v-~~~~in~HL 20 (26)
T smart00734 2 VQCPVCFREV-PENLINSHL 20 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHH
Confidence 3688888887 667788886
No 128
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=20.21 E-value=76 Score=36.57 Aligned_cols=24 Identities=29% Similarity=0.635 Sum_probs=12.8
Q ss_pred ccccccCccccCCHHHHHHHHHHh
Q psy11585 737 MFQCSFCEKKYMSPKTLKEHEQTH 760 (2227)
Q Consensus 737 ~~~C~~C~~~f~~~~~L~~H~~~h 760 (2227)
.|+|+.|...|-..-..-.|...|
T Consensus 388 rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ceechhhhhhhhhhhHHHHHHHHh
Confidence 455555555555555555554444
Done!