Query psy4681
Match_columns 1930
No_of_seqs 893 out of 7090
Neff 10.1
Searched_HMMs 46136
Date Fri Aug 16 23:06:38 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4681.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4681hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 99.9 3.5E-29 7.6E-34 293.8 2.8 254 124-383 605-933 (958)
2 KOG1074|consensus 99.9 1.6E-27 3.4E-32 280.0 7.1 87 328-417 604-693 (958)
3 KOG2462|consensus 99.9 4.5E-25 9.8E-30 229.7 5.8 133 1212-1346 131-265 (279)
4 KOG2462|consensus 99.9 8.7E-25 1.9E-29 227.6 5.4 135 1181-1318 128-265 (279)
5 KOG3608|consensus 99.9 5E-23 1.1E-27 217.2 9.8 253 1650-1921 69-379 (467)
6 KOG3608|consensus 99.9 2.8E-22 6.1E-27 211.6 13.2 196 1651-1851 178-380 (467)
7 KOG3623|consensus 99.8 9.6E-21 2.1E-25 218.2 8.0 80 1619-1699 891-970 (1007)
8 KOG3623|consensus 99.8 2E-20 4.4E-25 215.5 8.3 79 1266-1344 892-970 (1007)
9 KOG3576|consensus 99.6 2.1E-16 4.6E-21 154.2 2.4 114 1240-1353 117-241 (267)
10 KOG3576|consensus 99.5 3.1E-15 6.6E-20 146.2 2.9 115 1209-1324 115-240 (267)
11 PLN03086 PRLI-interacting fact 99.0 1.1E-09 2.3E-14 132.1 9.8 145 1185-1349 409-565 (567)
12 PLN03086 PRLI-interacting fact 98.9 3.1E-09 6.8E-14 128.2 8.1 144 1155-1318 407-562 (567)
13 KOG1146|consensus 98.7 4.8E-09 1E-13 133.1 3.9 82 1372-1453 765-851 (1406)
14 PHA00733 hypothetical protein 98.6 3.4E-08 7.3E-13 97.0 4.5 81 689-772 37-125 (128)
15 PHA00733 hypothetical protein 98.6 4.4E-08 9.6E-13 96.1 4.3 54 1294-1349 71-124 (128)
16 PHA02768 hypothetical protein; 98.5 4.7E-08 1E-12 76.6 1.8 44 472-517 5-48 (55)
17 PHA02768 hypothetical protein; 98.5 5E-08 1.1E-12 76.4 1.9 40 1269-1310 6-45 (55)
18 KOG3993|consensus 98.4 7.3E-08 1.6E-12 106.8 -0.1 198 1709-1921 267-485 (500)
19 KOG3993|consensus 98.3 1.6E-07 3.4E-12 104.3 1.0 54 1092-1145 268-321 (500)
20 PF13465 zf-H2C2_2: Zinc-finge 98.2 6.6E-07 1.4E-11 60.0 2.2 24 1256-1279 2-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.2 1E-06 2.2E-11 59.1 2.5 26 1283-1308 1-26 (26)
22 KOG1146|consensus 98.2 2.4E-06 5.2E-11 109.4 7.9 180 810-999 438-641 (1406)
23 PHA00616 hypothetical protein 97.9 4.5E-06 9.8E-11 62.2 1.4 34 1268-1301 1-34 (44)
24 PHA00616 hypothetical protein 97.9 4.2E-06 9.2E-11 62.3 0.7 35 472-506 1-35 (44)
25 PHA00732 hypothetical protein 97.6 4.8E-05 1E-09 67.0 2.7 47 472-524 1-48 (79)
26 PHA00732 hypothetical protein 97.4 9E-05 2E-09 65.4 2.9 44 1269-1318 2-46 (79)
27 PF05605 zf-Di19: Drought indu 97.4 0.00016 3.6E-09 59.2 4.3 52 1794-1848 2-54 (54)
28 PF05605 zf-Di19: Drought indu 97.3 0.00018 3.9E-09 59.0 3.0 37 1680-1717 3-39 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.0 0.00049 1.1E-08 45.0 2.1 22 1269-1290 1-22 (23)
30 PF00096 zf-C2H2: Zinc finger, 96.8 0.00056 1.2E-08 44.8 1.5 23 15-37 1-23 (23)
31 PF12756 zf-C2H2_2: C2H2 type 96.2 0.0027 5.9E-08 61.2 2.4 22 1269-1290 51-72 (100)
32 PF13894 zf-C2H2_4: C2H2-type 96.2 0.0039 8.4E-08 41.3 2.3 23 1269-1291 1-23 (24)
33 PF13894 zf-C2H2_4: C2H2-type 95.9 0.0048 1E-07 40.9 1.9 24 1824-1847 1-24 (24)
34 PF13912 zf-C2H2_6: C2H2-type 95.9 0.0046 1E-07 42.4 1.7 25 472-496 1-25 (27)
35 PF13912 zf-C2H2_6: C2H2-type 95.9 0.0049 1.1E-07 42.2 1.7 25 1091-1115 1-25 (27)
36 COG5189 SFP1 Putative transcri 95.8 0.0054 1.2E-07 66.2 2.4 25 572-596 394-418 (423)
37 KOG2785|consensus 95.7 0.012 2.6E-07 66.6 4.8 80 1375-1454 2-94 (390)
38 COG5189 SFP1 Putative transcri 95.5 0.0055 1.2E-07 66.2 1.2 24 1863-1886 395-418 (423)
39 PF12756 zf-C2H2_2: C2H2 type 95.4 0.0057 1.2E-07 58.9 1.1 76 1378-1454 1-76 (100)
40 KOG2231|consensus 95.0 0.03 6.5E-07 69.7 5.5 124 1595-1744 100-244 (669)
41 PF13909 zf-H2C2_5: C2H2-type 94.8 0.018 3.9E-07 38.1 1.6 23 1867-1890 1-23 (24)
42 KOG2231|consensus 94.7 0.054 1.2E-06 67.5 6.9 70 155-236 184-262 (669)
43 PF09237 GAGA: GAGA factor; I 94.7 0.029 6.2E-07 43.1 2.7 28 1240-1267 24-51 (54)
44 PF13909 zf-H2C2_5: C2H2-type 94.5 0.023 5.1E-07 37.5 1.7 24 1824-1848 1-24 (24)
45 PF09237 GAGA: GAGA factor; I 94.4 0.03 6.5E-07 43.0 2.2 32 715-746 21-52 (54)
46 smart00355 ZnF_C2H2 zinc finge 94.2 0.035 7.7E-07 37.5 2.1 22 1270-1291 2-23 (26)
47 smart00355 ZnF_C2H2 zinc finge 93.9 0.042 9.1E-07 37.1 2.0 24 15-38 1-24 (26)
48 PRK04860 hypothetical protein; 93.7 0.04 8.6E-07 56.6 2.4 36 1240-1279 119-154 (160)
49 PRK04860 hypothetical protein; 93.6 0.039 8.4E-07 56.7 2.0 38 472-513 119-156 (160)
50 COG5236 Uncharacterized conser 93.5 0.087 1.9E-06 57.9 4.7 33 1565-1600 151-185 (493)
51 PF12874 zf-met: Zinc-finger o 93.5 0.04 8.6E-07 36.9 1.3 23 15-37 1-23 (25)
52 COG5236 Uncharacterized conser 93.4 0.13 2.7E-06 56.7 5.6 139 124-322 151-308 (493)
53 PF12874 zf-met: Zinc-finger o 92.9 0.058 1.3E-06 36.1 1.4 20 1270-1289 2-21 (25)
54 KOG2482|consensus 92.7 0.087 1.9E-06 58.1 3.3 73 1378-1452 146-219 (423)
55 KOG2785|consensus 91.4 0.22 4.9E-06 56.7 4.6 58 1866-1923 166-250 (390)
56 KOG2482|consensus 91.4 0.33 7.2E-06 53.8 5.7 212 1224-1452 128-358 (423)
57 PF12171 zf-C2H2_jaz: Zinc-fin 90.4 0.12 2.6E-06 35.3 0.8 22 186-207 2-23 (27)
58 PF12171 zf-C2H2_jaz: Zinc-fin 89.0 0.25 5.4E-06 33.8 1.5 22 358-379 2-23 (27)
59 COG5048 FOG: Zn-finger [Genera 87.9 0.24 5.2E-06 63.7 1.7 145 1183-1328 289-450 (467)
60 COG5048 FOG: Zn-finger [Genera 86.7 0.27 5.9E-06 63.2 1.2 55 717-771 32-88 (467)
61 KOG4173|consensus 86.0 0.39 8.6E-06 49.1 1.6 82 1649-1734 78-171 (253)
62 PF13913 zf-C2HC_2: zinc-finge 81.0 1.1 2.4E-05 29.9 1.6 20 15-35 3-22 (25)
63 KOG2893|consensus 80.7 0.68 1.5E-05 48.3 0.9 41 156-204 13-53 (341)
64 TIGR00622 ssl1 transcription f 80.5 2.8 6E-05 39.8 4.7 78 124-209 15-105 (112)
65 PF13913 zf-C2HC_2: zinc-finge 78.9 1.7 3.7E-05 29.0 2.0 19 1242-1261 4-22 (25)
66 TIGR00622 ssl1 transcription f 78.0 2.3 5.1E-05 40.3 3.4 20 1118-1137 14-33 (112)
67 KOG4173|consensus 76.3 1.2 2.6E-05 45.7 1.1 78 1678-1760 78-168 (253)
68 smart00451 ZnF_U1 U1-like zinc 75.5 1.8 3.8E-05 31.8 1.6 22 14-35 3-24 (35)
69 smart00451 ZnF_U1 U1-like zinc 74.9 2 4.3E-05 31.5 1.7 21 1269-1289 4-24 (35)
70 KOG2893|consensus 73.7 0.92 2E-05 47.4 -0.5 46 1682-1733 13-58 (341)
71 cd00350 rubredoxin_like Rubred 71.4 2.8 6.1E-05 30.3 1.7 24 473-508 2-25 (33)
72 cd00350 rubredoxin_like Rubred 70.4 2.4 5.3E-05 30.6 1.2 8 1296-1303 17-24 (33)
73 PF12013 DUF3505: Protein of u 67.7 5.3 0.00011 38.8 3.3 26 1866-1891 80-109 (109)
74 PF09986 DUF2225: Uncharacteri 62.7 2.1 4.5E-05 47.3 -0.5 55 1821-1879 3-61 (214)
75 PF12013 DUF3505: Protein of u 61.1 19 0.00041 35.0 5.8 26 575-600 79-108 (109)
76 KOG2186|consensus 56.8 5.3 0.00011 43.2 1.2 45 1066-1112 4-49 (276)
77 PF13717 zinc_ribbon_4: zinc-r 54.0 6.8 0.00015 29.0 1.0 32 955-986 4-35 (36)
78 COG4049 Uncharacterized protei 53.5 7.9 0.00017 30.7 1.3 28 1115-1142 13-40 (65)
79 COG4049 Uncharacterized protei 51.8 7.1 0.00015 30.9 0.9 32 1320-1351 13-44 (65)
80 cd00729 rubredoxin_SM Rubredox 49.9 11 0.00025 27.4 1.6 25 472-508 2-26 (34)
81 PF13719 zinc_ribbon_5: zinc-r 48.2 9.1 0.0002 28.5 1.0 32 955-986 4-35 (37)
82 COG2888 Predicted Zn-ribbon RN 47.9 12 0.00026 30.6 1.6 32 953-984 27-58 (61)
83 TIGR02098 MJ0042_CXXC MJ0042 f 47.0 8.4 0.00018 28.9 0.7 32 955-986 4-35 (38)
84 COG2888 Predicted Zn-ribbon RN 44.0 15 0.00032 30.1 1.6 33 1794-1832 27-59 (61)
85 PHA00626 hypothetical protein 42.3 9.5 0.00021 30.6 0.3 12 1012-1023 24-35 (59)
86 KOG2186|consensus 40.1 13 0.00028 40.4 1.0 45 447-493 4-49 (276)
87 PF09538 FYDLN_acid: Protein o 37.8 18 0.00039 34.7 1.4 31 185-226 9-39 (108)
88 PF09538 FYDLN_acid: Protein o 37.5 26 0.00056 33.6 2.4 33 1153-1196 7-39 (108)
89 PF09986 DUF2225: Uncharacteri 37.2 10 0.00023 41.9 -0.2 13 214-226 49-61 (214)
90 PF02892 zf-BED: BED zinc fing 35.8 22 0.00048 27.8 1.5 26 1822-1847 15-44 (45)
91 COG1592 Rubrerythrin [Energy p 34.8 25 0.00054 36.5 2.0 24 472-508 134-157 (166)
92 smart00734 ZnF_Rad18 Rad18-lik 33.5 29 0.00063 23.6 1.5 20 15-35 2-21 (26)
93 PF06524 NOA36: NOA36 protein; 32.1 24 0.00052 38.3 1.5 44 1643-1687 135-179 (314)
94 smart00659 RPOLCX RNA polymera 31.6 35 0.00076 26.7 1.9 26 473-509 3-28 (44)
95 PRK14890 putative Zn-ribbon RN 30.3 29 0.00063 28.7 1.3 33 952-984 24-56 (59)
96 PRK00464 nrdR transcriptional 30.3 26 0.00055 36.2 1.3 14 358-371 29-42 (154)
97 PF15269 zf-C2H2_7: Zinc-finge 30.1 31 0.00067 26.1 1.3 22 1867-1888 21-42 (54)
98 COG5188 PRP9 Splicing factor 3 30.1 63 0.0014 36.8 4.2 22 42-63 238-259 (470)
99 PF03604 DNA_RNApol_7kD: DNA d 29.7 29 0.00063 24.9 1.1 10 358-367 1-10 (32)
100 COG1997 RPL43A Ribosomal prote 29.7 26 0.00056 31.4 1.0 13 77-89 53-65 (89)
101 PRK00398 rpoP DNA-directed RNA 29.3 24 0.00053 27.9 0.7 6 1326-1331 23-28 (46)
102 smart00531 TFIIE Transcription 28.9 32 0.00069 35.6 1.7 33 952-984 98-131 (147)
103 smart00834 CxxC_CXXC_SSSS Puta 28.4 33 0.00072 26.1 1.3 12 330-341 6-17 (41)
104 COG1592 Rubrerythrin [Energy p 28.4 30 0.00066 35.9 1.4 12 468-479 145-156 (166)
105 smart00531 TFIIE Transcription 28.2 59 0.0013 33.6 3.5 40 466-510 93-133 (147)
106 PF12907 zf-met2: Zinc-binding 28.0 32 0.00069 26.2 1.1 27 1895-1921 2-31 (40)
107 TIGR00373 conserved hypothetic 27.8 52 0.0011 34.4 3.1 19 1266-1284 107-125 (158)
108 PF12907 zf-met2: Zinc-binding 27.7 37 0.00079 25.8 1.3 27 1710-1736 2-31 (40)
109 PRK09678 DNA-binding transcrip 26.6 24 0.00051 30.9 0.2 44 1184-1229 2-47 (72)
110 TIGR00373 conserved hypothetic 26.4 58 0.0012 34.1 3.1 37 1084-1129 102-138 (158)
111 smart00614 ZnF_BED BED zinc fi 26.2 45 0.00097 26.9 1.7 21 473-493 19-44 (50)
112 KOG2071|consensus 26.1 51 0.0011 41.2 2.9 27 470-496 416-442 (579)
113 PF04959 ARS2: Arsenite-resist 25.7 33 0.00072 37.5 1.2 30 1821-1850 75-104 (214)
114 PRK09678 DNA-binding transcrip 24.6 33 0.00072 30.0 0.8 39 331-371 3-43 (72)
115 PRK06266 transcription initiat 24.4 57 0.0012 34.9 2.6 18 1267-1284 116-133 (178)
116 COG1996 RPC10 DNA-directed RNA 24.2 45 0.00097 26.6 1.3 29 471-509 5-33 (49)
117 TIGR02605 CxxC_CxxC_SSSS putat 24.0 43 0.00094 27.2 1.3 12 330-341 6-17 (52)
118 COG5112 UFD2 U1-like Zn-finger 23.9 22 0.00047 32.6 -0.5 21 43-63 56-76 (126)
119 PRK00464 nrdR transcriptional 23.3 25 0.00054 36.2 -0.2 34 955-988 2-40 (154)
120 TIGR02300 FYDLN_acid conserved 23.2 63 0.0014 31.5 2.4 33 1153-1196 7-39 (129)
121 PF10571 UPF0547: Uncharacteri 22.8 51 0.0011 22.5 1.2 9 811-819 17-25 (26)
122 PRK06266 transcription initiat 22.6 68 0.0015 34.3 2.8 37 1085-1130 111-147 (178)
123 TIGR01384 TFS_arch transcripti 22.4 36 0.00078 32.7 0.6 10 1184-1193 17-26 (104)
124 PF09416 UPF1_Zn_bind: RNA hel 22.3 72 0.0016 32.5 2.7 30 289-318 8-43 (152)
125 KOG1280|consensus 22.2 62 0.0013 37.2 2.4 37 1679-1715 79-115 (381)
126 PF09723 Zn-ribbon_8: Zinc rib 21.9 18 0.00039 27.9 -1.2 11 977-987 6-16 (42)
127 PF02176 zf-TRAF: TRAF-type zi 21.4 87 0.0019 26.3 2.7 21 1281-1301 23-43 (60)
128 PF02176 zf-TRAF: TRAF-type zi 21.1 49 0.0011 27.8 1.1 21 677-697 23-43 (60)
129 PF05443 ROS_MUCR: ROS/MUCR tr 20.9 36 0.00078 33.9 0.3 27 11-40 69-95 (132)
130 COG3357 Predicted transcriptio 20.6 56 0.0012 29.5 1.3 14 1793-1806 57-70 (97)
No 1
>KOG1074|consensus
Probab=99.95 E-value=3.5e-29 Score=293.81 Aligned_cols=254 Identities=23% Similarity=0.433 Sum_probs=166.8
Q ss_pred CCCCCCCCCccCcchHHHHhhhhhcCCCCCcCCccccCcccChhHHHhHHhhhccCCCCCccccCc---cchhccCCchh
Q psy4681 124 ARKCPICGDRYKSGTDMRRHYRDLHDSTRKCPCEVCGKRFNSIKRVKQHRKVVHMGIKQKKKFECA---HCSKTYLSRVG 200 (1930)
Q Consensus 124 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~k~f~~~~~l~~H~~~~h~~~~~~~~~~C~---~C~k~f~~~~~ 200 (1930)
+..|-+|.+..+-...|++||| .|++||||+|.+||+.|.++.+|+.|+-++.....-+-.|.|+ +|-+.|.....
T Consensus 605 PNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred ccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence 3479999999999999999999 8999999999999999999999999998444333344678999 99999999999
Q ss_pred hhhhhcccccC-Cc------------cccccccccccCHHHHHHHHhhcccccccchhhhhccCCchhHHHHHHhhcccc
Q psy4681 201 LEDHINNHTGE-KG------------HICEICNRDFYSDAMLKRHLVKHSRMIKETSEEFVETGSITREEWYKMVLQRVK 267 (1930)
Q Consensus 201 L~~H~~~H~~~-~~------------~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (1930)
|..|+++|+|. .+ -+|.+|.+.|.....+..++..|............+... +.+ .......
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~-t~~----~~~tp~~ 758 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTE-TEE----LDVTPPP 758 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCccccccccc-ccc----cccCCCc
Confidence 99999999942 11 469999999999999999998774322111111000000 000 0011233
Q ss_pred cCCcchhcccChHHHHHHH----------------------HhhcCCCCCc-cccccccccccHH----HHHHHHHH---
Q psy4681 268 TCPLCKKTYQSAKGMRLHI----------------------REVHSKVRPH-QCKGCGKYFKSQR----HLVQHERR--- 317 (1930)
Q Consensus 268 ~C~~C~k~f~~~~~l~~H~----------------------~~~h~~~~~~-~C~~C~~~f~~~~----~l~~H~~~--- 317 (1930)
.+..|+..+.....+..+- ...++.+++. .+..++..-...- .|..-.-.
T Consensus 759 ~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~ 838 (958)
T KOG1074|consen 759 PENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLN 838 (958)
T ss_pred cccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccc
Confidence 4566666665433222211 1123445666 5666654433211 11110000
Q ss_pred ----------hccCc-------------------ccccCCccccccccccccccccccccccccCCCCceeCCCCCcccc
Q psy4681 318 ----------VHLGV-------------------KKIKHSNFECFHCGAKFISRTHIADHMTSHTGIKNHVCSICQSTYT 368 (1930)
Q Consensus 318 ----------~h~~~-------------------~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 368 (1930)
++.+. .........|..||+.|.+.++|..||++|+|+|||.|.+|++.|.
T Consensus 839 eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFt 918 (958)
T KOG1074|consen 839 EGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFT 918 (958)
T ss_pred cccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhh
Confidence 00000 0001123568888888888888888888888888888888888888
Q ss_pred ChhHHhhhhhccccc
Q psy4681 369 TARGLKRHNKNHLRE 383 (1930)
Q Consensus 369 ~~~~L~~H~~~h~~~ 383 (1930)
++.+|+.||.+|++.
T Consensus 919 trgnLKvHMgtH~w~ 933 (958)
T KOG1074|consen 919 TRGNLKVHMGTHMWV 933 (958)
T ss_pred hhhhhhhhhcccccc
Confidence 888888888887655
No 2
>KOG1074|consensus
Probab=99.94 E-value=1.6e-27 Score=280.03 Aligned_cols=87 Identities=26% Similarity=0.464 Sum_probs=78.5
Q ss_pred CccccccccccccccccccccccccCCCCceeCCCCCccccChhHHhhhhhcccccccccCCCCccccC---cccccccC
Q psy4681 328 SNFECFHCGAKFISRTHIADHMTSHTGIKNHVCSICQSTYTTARGLKRHNKNHLREAGVLRADEMYKCD---KCDKLFIE 404 (1930)
Q Consensus 328 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~---~C~k~f~~ 404 (1930)
.+-+|-+|-++.+-+++|+.|.++|+||+||+|.+||+.|+++.+|+.||-+|.... ...-+|.|+ +|-+.|.+
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p---~~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKP---PARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCc---cccccccCCchhhhcccccc
Confidence 478999999999999999999999999999999999999999999999999995442 223678999 99999999
Q ss_pred hhhHHhhhhcccC
Q psy4681 405 QSEMVQHRDWVHG 417 (1930)
Q Consensus 405 ~~~l~~H~~~~~~ 417 (1930)
...|.+|+++|.+
T Consensus 681 ~V~lpQhIriH~~ 693 (958)
T KOG1074|consen 681 AVTLPQHIRIHLG 693 (958)
T ss_pred cccccceEEeecC
Confidence 9999999999874
No 3
>KOG2462|consensus
Probab=99.91 E-value=4.5e-25 Score=229.70 Aligned_cols=133 Identities=32% Similarity=0.633 Sum_probs=76.1
Q ss_pred eeccccccccccchhhhhhhcccCC--CCCcccCCCCCcccCChhhHHhHHHHhcCCceeecccccccccchhhHHHHHH
Q psy4681 1212 FECNICLKTFNFKTSYKRHLKQHDD--SVTYYPCTVCSKNLSSPYRLKTHMLIHANNRVFTCEVCGKGFIQKRYLEEHKR 1289 (1930)
Q Consensus 1212 ~~C~~C~~~f~~~~~l~~H~~~h~~--~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~F~~~~~L~~H~~ 1289 (1930)
|+|+.||+.+.+.++|.+|.++|.. ++..+.|++|+|.+.+..+|+.|+|+|+ -|++|.+|||.|...+.|+.|+|
T Consensus 131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiR 208 (279)
T KOG2462|consen 131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIR 208 (279)
T ss_pred eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccc
Confidence 4444444444445555555544432 2224556666666666666666666654 34566666666666666666666
Q ss_pred HhCCCCceecccccccccChhhHHHHHhhccCCCcccCCcCccccCChhHHHHHHHh
Q psy4681 1290 VHTGYKPYACDLCSKQFTQKSTLNIHRKLHLNIKDFICDLCGAKFYEFNTYVTHVHE 1346 (1930)
Q Consensus 1290 ~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 1346 (1930)
+|||||||.|+.|++.|.++++|+.||++|.+.++|+|..|+++|.....|.+|...
T Consensus 209 THTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 209 THTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 666666666666666666666666666666666666666666666666555555543
No 4
>KOG2462|consensus
Probab=99.90 E-value=8.7e-25 Score=227.58 Aligned_cols=135 Identities=31% Similarity=0.656 Sum_probs=126.8
Q ss_pred CCCcccCCCCcccCChhhHHHHHhhhcC---CCceeccccccccccchhhhhhhcccCCCCCcccCCCCCcccCChhhHH
Q psy4681 1181 LPPFICEHCSKPFTSKGNLTVHVKYYHA---KTLFECNICLKTFNFKTSYKRHLKQHDDSVTYYPCTVCSKNLSSPYRLK 1257 (1930)
Q Consensus 1181 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~ 1257 (1930)
...|+|+.|||.+.+.++|.+|..+|-. .+.+.|++|+|.|.+-..|+.|+++|+- +..|.+|||.|..+.-|+
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~l---~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHTL---PCECGICGKAFSRPWLLQ 204 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccCC---CcccccccccccchHHhh
Confidence 4568999999999999999999988863 5789999999999999999999999983 789999999999999999
Q ss_pred hHHHHhcCCceeecccccccccchhhHHHHHHHhCCCCceecccccccccChhhHHHHHhh
Q psy4681 1258 THMLIHANNRVFTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDLCSKQFTQKSTLNIHRKL 1318 (1930)
Q Consensus 1258 ~H~~~H~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 1318 (1930)
-|+|+|||||||.|+.|+|+|+.+++|+.||++|.+.++|+|..|+|.|..++.|..|...
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999999999999999999999999999864
No 5
>KOG3608|consensus
Probab=99.88 E-value=5e-23 Score=217.25 Aligned_cols=253 Identities=22% Similarity=0.448 Sum_probs=201.1
Q ss_pred ceecc--cccccccC-hHHHHHHhhhcCC----------------C-----------------CCccccc--cccccccc
Q psy4681 1650 PHTCP--VCKKIFVN-KFNLTTHKKLHLP----------------M-----------------NRNHQCD--TCGKSFTG 1691 (1930)
Q Consensus 1650 ~~~C~--~C~~~f~~-~~~l~~H~~~h~~----------------~-----------------~~~~~C~--~C~~~f~~ 1691 (1930)
.++|. .|++...+ ...|.+|+-.|-- + +..|.|. .|+..|.+
T Consensus 69 e~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s 148 (467)
T KOG3608|consen 69 EHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCEREFVS 148 (467)
T ss_pred ceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcCCcccC
Confidence 47784 48888777 4789999766531 0 1136675 48999999
Q ss_pred chHHHHHHhhhcc----------CCC-Ccccccc--chhhcCCHHHHHhcccccccCCCccccCCCcccccCHHHHHHHH
Q psy4681 1692 NNHLKRHIYSVHL----------KRD-TKFPCRL--CSQEFDTKEQRKKHERKDHETQGLFSCDLCSYTSTQKYYLVKHK 1758 (1930)
Q Consensus 1692 ~~~l~~H~~~~h~----------~~~-~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~ 1758 (1930)
...|..|+..+-. .++ ..+.|.. |...|.++..|+.|+ +.|.+++...|+.|+..|+++..|-.|+
T Consensus 149 ~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~-r~Hs~eKvvACp~Cg~~F~~~tkl~DH~ 227 (467)
T KOG3608|consen 149 IVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHI-RTHSNEKVVACPHCGELFRTKTKLFDHL 227 (467)
T ss_pred HHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHH-HhcCCCeEEecchHHHHhccccHHHHHH
Confidence 9999988865541 122 4477876 888999999999998 4688888899999999999999999998
Q ss_pred hhcC--CCCccccccccccccccccccccccccccCCCcccccccccccChHHHHHHHHHhCCCCCccccCCCccccCCh
Q psy4681 1759 SRHI--KDYNVFCKICQLGFLSKNELDVHNIKQHDAQPHTCPVCKKIFVNKVTLAAHKKIHLPIDKNCQCDVCGKSFART 1836 (1930)
Q Consensus 1759 ~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~C~~~f~~~ 1836 (1930)
++-+ ...+|+|..|.++|.++..|..|+..|- .-|+|+.|+......++|.+||+..|+.++||+|+.|++.|.+.
T Consensus 228 rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~e 305 (467)
T KOG3608|consen 228 RRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRE 305 (467)
T ss_pred HhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccH
Confidence 7654 3457889999999999999988887653 34889999999999999999999888888999999999999999
Q ss_pred hhHHhHHhhhcchhhhhhhhhccccCCCcccccCC--CCccccchhhhHHHHhhhcc---cccccccccccccCChHHHH
Q psy4681 1837 FHLKSHISSVHLKREQRKKHERKDHETQGLFSCDL--CSYTSTQKYYLVKHKSRHIK---DYNVFCKICQLGFLSKNELD 1911 (1930)
Q Consensus 1837 ~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~C~~--C~~~~~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~ 1911 (1930)
+.|.+|+. .|. +..|+|.. |.+++.+...|.+|++.++. .-+|.|-.|+.-|++-.+|.
T Consensus 306 sdL~kH~~-~HS---------------~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~ 369 (467)
T KOG3608|consen 306 SDLAKHVQ-VHS---------------KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLS 369 (467)
T ss_pred HHHHHHHH-hcc---------------ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHH
Confidence 99999987 432 23588888 99999999999999877763 37788999999999999999
Q ss_pred HHHhhhCCCC
Q psy4681 1912 VHNIKQHDAQ 1921 (1930)
Q Consensus 1912 ~H~~~~H~~~ 1921 (1930)
+|+++.|+-+
T Consensus 370 ~HL~kkH~f~ 379 (467)
T KOG3608|consen 370 AHLMKKHGFR 379 (467)
T ss_pred HHHHHhhccc
Confidence 9999988876
No 6
>KOG3608|consensus
Probab=99.88 E-value=2.8e-22 Score=211.60 Aligned_cols=196 Identities=23% Similarity=0.389 Sum_probs=153.1
Q ss_pred eecc--cccccccChHHHHHHhhhcCCCCCcccccccccccccchHHHHHHhhhccCCCCccccccchhhcCCHHHHHhc
Q psy4681 1651 HTCP--VCKKIFVNKFNLTTHKKLHLPMNRNHQCDTCGKSFTGNNHLKRHIYSVHLKRDTKFPCRLCSQEFDTKEQRKKH 1728 (1930)
Q Consensus 1651 ~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H 1728 (1930)
+.|. .|.+.|.++..|++|++.|.++. ...|+.||..|.++..|-.|+++-.....-+|+|..|.+.|.++..|..|
T Consensus 178 ~~C~W~~Ct~~~~~k~~LreH~r~Hs~eK-vvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~H 256 (467)
T KOG3608|consen 178 TMCNWAMCTKHMGNKYRLREHIRTHSNEK-VVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSH 256 (467)
T ss_pred eeccchhhhhhhccHHHHHHHHHhcCCCe-EEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHH
Confidence 4553 47778888888888888776643 77788888888888888888877765555678888888888888888888
Q ss_pred ccccccCCCccccCCCcccccCHHHHHHHHh-hcCCCCccccccccccccccccccccccccccCCCccccc--cccccc
Q psy4681 1729 ERKDHETQGLFSCDLCSYTSTQKYYLVKHKS-RHIKDYNVFCKICQLGFLSKNELDVHNIKQHDAQPHTCPV--CKKIFV 1805 (1930)
Q Consensus 1729 ~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~--C~~~f~ 1805 (1930)
+.++ ..-|+|++|+.+....++|..||+ +|..+++|+|+.|++.+.++.+|.+|+..| ++..|.|.. |.++|.
T Consensus 257 v~rH---vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~H-S~~~y~C~h~~C~~s~r 332 (467)
T KOG3608|consen 257 VVRH---VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVH-SKTVYQCEHPDCHYSVR 332 (467)
T ss_pred HHHh---hhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhc-cccceecCCCCCcHHHH
Confidence 7554 235788888888888888888876 466777888888888888888888887744 467788877 999999
Q ss_pred ChHHHHHHHHHhCCC--CCccccCCCccccCChhhHHhHHhhhcchhh
Q psy4681 1806 NKVTLAAHKKIHLPI--DKNCQCDVCGKSFARTFHLKSHISSVHLKRE 1851 (1930)
Q Consensus 1806 ~~~~L~~H~~~h~~~--~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~ 1851 (1930)
+...|++|++.++-. +.+|.|..|++.|++..+|.+|+...|..+.
T Consensus 333 ~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~ 380 (467)
T KOG3608|consen 333 TYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRL 380 (467)
T ss_pred HHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccC
Confidence 999999999988733 3359999999999999999999999886543
No 7
>KOG3623|consensus
Probab=99.82 E-value=9.6e-21 Score=218.19 Aligned_cols=80 Identities=31% Similarity=0.645 Sum_probs=71.0
Q ss_pred CCcccccccccccccChhhhhhhhcccCCCCceecccccccccChHHHHHHhhhcCCCCCcccccccccccccchHHHHH
Q psy4681 1619 KEYTVFCKKCQLGFLSKNELNVHNIKQHDAQPHTCPVCKKIFVNKFNLTTHKKLHLPMNRNHQCDTCGKSFTGNNHLKRH 1698 (1930)
Q Consensus 1619 ~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H 1698 (1930)
.+..|.|.+|++.|.....|.+|.-.|.|.+||+|.+|.+.|..+..|..|++.|.++. ||+|+.|++.|.....+.+|
T Consensus 891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEK-PfQCdKClKRFSHSGSYSQH 969 (1007)
T KOG3623|consen 891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEK-PFQCDKCLKRFSHSGSYSQH 969 (1007)
T ss_pred ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCC-cchhhhhhhhcccccchHhh
Confidence 45678899999999999999999888889999999999999999999999999998754 99999999999999999999
Q ss_pred H
Q psy4681 1699 I 1699 (1930)
Q Consensus 1699 ~ 1699 (1930)
|
T Consensus 970 M 970 (1007)
T KOG3623|consen 970 M 970 (1007)
T ss_pred h
Confidence 8
No 8
>KOG3623|consensus
Probab=99.81 E-value=2e-20 Score=215.53 Aligned_cols=79 Identities=35% Similarity=0.792 Sum_probs=69.8
Q ss_pred CceeecccccccccchhhHHHHHHHhCCCCceecccccccccChhhHHHHHhhccCCCcccCCcCccccCChhHHHHHH
Q psy4681 1266 NRVFTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDLCSKQFTQKSTLNIHRKLHLNIKDFICDLCGAKFYEFNTYVTHV 1344 (1930)
Q Consensus 1266 ~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 1344 (1930)
+.+|.|+.|+|.|...+.|.+|.--|+|.|||+|.+|.|.|..+-.|..|++.|.|++||.|+.|++.|....++.+||
T Consensus 892 ~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 892 DGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred cccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 4568899999999989999999888999999999999999999999999999999999999999999998888888887
No 9
>KOG3576|consensus
Probab=99.60 E-value=2.1e-16 Score=154.20 Aligned_cols=114 Identities=34% Similarity=0.682 Sum_probs=85.9
Q ss_pred cccCCCCCcccCChhhHHhHHHHhcCCceeecccccccccchhhHHHHHHHhCCCCceecccccccccChhhHHHHHhhc
Q psy4681 1240 YYPCTVCSKNLSSPYRLKTHMLIHANNRVFTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDLCSKQFTQKSTLNIHRKLH 1319 (1930)
Q Consensus 1240 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h 1319 (1930)
.|.|.+|+|.|.-..-|.+|++-|.+.+.|-|..|||+|.....|.+|+|+|||.+||+|..|++.|+++-+|..|++.-
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kv 196 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKV 196 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHH
Confidence 46677777777777777777777777777777777777777777777777777777777777777777777777777554
Q ss_pred cC-----------CCcccCCcCccccCChhHHHHHHHhhcCCCCc
Q psy4681 1320 LN-----------IKDFICDLCGAKFYEFNTYVTHVHETHAILPR 1353 (1930)
Q Consensus 1320 ~~-----------~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~ 1353 (1930)
+| .+.|.|..||++......+..|+..+|+..|.
T Consensus 197 hgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 197 HGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPA 241 (267)
T ss_pred cCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCHH
Confidence 43 46788999999999999999999999987653
No 10
>KOG3576|consensus
Probab=99.53 E-value=3.1e-15 Score=146.24 Aligned_cols=115 Identities=32% Similarity=0.618 Sum_probs=101.1
Q ss_pred CCceeccccccccccchhhhhhhcccCCCCCcccCCCCCcccCChhhHHhHHHHhcCCceeecccccccccchhhHHHHH
Q psy4681 1209 KTLFECNICLKTFNFKTSYKRHLKQHDDSVTYYPCTVCSKNLSSPYRLKTHMLIHANNRVFTCEVCGKGFIQKRYLEEHK 1288 (1930)
Q Consensus 1209 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~F~~~~~L~~H~ 1288 (1930)
...|.|.+|+|+|.....|.+|++-|.+.+ .|-|..|||.|.....|++|+|+|+|.+||+|..|+|+|+++-+|..|.
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vk-r~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVK-RHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHH-HHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 456889999999999999999999888765 7899999999999999999999999999999999999999999999998
Q ss_pred HHhCC-----------CCceecccccccccChhhHHHHHhhccCCCc
Q psy4681 1289 RVHTG-----------YKPYACDLCSKQFTQKSTLNIHRKLHLNIKD 1324 (1930)
Q Consensus 1289 ~~H~~-----------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 1324 (1930)
+.-+| ++-|.|+.||+.-.....+..|++.|+...|
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 64333 4679999999999999999999999876543
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.98 E-value=1.1e-09 Score=132.10 Aligned_cols=145 Identities=18% Similarity=0.421 Sum_probs=92.7
Q ss_pred ccCCCCcccCChhhHHHHHhhhcCCCceeccc--cccccccchhhhhhhcccCCCCCcccCCCCCcccCChhhHHhHHHH
Q psy4681 1185 ICEHCSKPFTSKGNLTVHVKYYHAKTLFECNI--CLKTFNFKTSYKRHLKQHDDSVTYYPCTVCSKNLSSPYRLKTHMLI 1262 (1930)
Q Consensus 1185 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 1262 (1930)
.|+.|..... ...|..|..... -..-.|+. |+..| .+..+..| +.|+.|++.|. ...|..|+.+
T Consensus 409 ~C~NC~~~i~-l~~l~lHe~~C~-r~~V~Cp~~~Cg~v~-~r~el~~H----------~~C~~Cgk~f~-~s~LekH~~~ 474 (567)
T PLN03086 409 ECRNCKHYIP-SRSIALHEAYCS-RHNVVCPHDGCGIVL-RVEEAKNH----------VHCEKCGQAFQ-QGEMEKHMKV 474 (567)
T ss_pred ECCCCCCccc-hhHHHHHHhhCC-CcceeCCccccccee-eccccccC----------ccCCCCCCccc-hHHHHHHHHh
Confidence 4666655443 334446654332 23445663 77776 33334443 46777777774 5667777777
Q ss_pred hcCCceeecccccccccchhhHHHHHHHhCCCCceecccccccccC----------hhhHHHHHhhccCCCcccCCcCcc
Q psy4681 1263 HANNRVFTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDLCSKQFTQ----------KSTLNIHRKLHLNIKDFICDLCGA 1332 (1930)
Q Consensus 1263 H~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~----------~~~l~~H~~~h~~~~~~~C~~C~~ 1332 (1930)
|+ +|+.|+ ||+.+ .+..|..|+++|.+++|+.|.+|++.|.. .+.|..|.... |.+++.|..||+
T Consensus 475 ~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk 549 (567)
T PLN03086 475 FH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGR 549 (567)
T ss_pred cC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCC
Confidence 64 677777 77644 55777777777777777777777777741 34677777764 777777777777
Q ss_pred ccCChhHHHHHHHhhcC
Q psy4681 1333 KFYEFNTYVTHVHETHA 1349 (1930)
Q Consensus 1333 ~f~~~~~l~~H~~~~h~ 1349 (1930)
.+.. ..|..|+...|.
T Consensus 550 ~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 550 SVML-KEMDIHQIAVHQ 565 (567)
T ss_pred eeee-hhHHHHHHHhhc
Confidence 6664 566777777665
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.87 E-value=3.1e-09 Score=128.18 Aligned_cols=144 Identities=23% Similarity=0.491 Sum_probs=102.7
Q ss_pred eeecCCCCCcccChhHHHhhhhhhcCCCCcccCC--CCcccCChhhHHHHHhhhcCCCceeccccccccccchhhhhhhc
Q psy4681 1155 TVFCKECNIGFYSSTHLHSHGIKVHGLPPFICEH--CSKPFTSKGNLTVHVKYYHAKTLFECNICLKTFNFKTSYKRHLK 1232 (1930)
Q Consensus 1155 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 1232 (1930)
...|+.|........ |..|.....- ..-.|+. ||..|. +..|..| +.|+.|++.|. ...|..|++
T Consensus 407 ~V~C~NC~~~i~l~~-l~lHe~~C~r-~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~ 473 (567)
T PLN03086 407 TVECRNCKHYIPSRS-IALHEAYCSR-HNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMK 473 (567)
T ss_pred eEECCCCCCccchhH-HHHHHhhCCC-cceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHH
Confidence 344666555443332 3344433221 2335764 888883 3444444 57999999885 678899998
Q ss_pred ccCCCCCcccCCCCCcccCChhhHHhHHHHhcCCceeeccccccccc----------chhhHHHHHHHhCCCCceecccc
Q psy4681 1233 QHDDSVTYYPCTVCSKNLSSPYRLKTHMLIHANNRVFTCEVCGKGFI----------QKRYLEEHKRVHTGYKPYACDLC 1302 (1930)
Q Consensus 1233 ~h~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~F~----------~~~~L~~H~~~H~~~~~~~C~~C 1302 (1930)
+|+ .++.|+ ||+.+ ....|..|+.+|.+++|+.|+.|++.|. ..+.|..|..+. |.+++.|..|
T Consensus 474 ~~H---kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~C 547 (567)
T PLN03086 474 VFH---EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSC 547 (567)
T ss_pred hcC---CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEcccc
Confidence 876 378999 99755 6689999999999999999999999985 235799998885 8899999999
Q ss_pred cccccChhhHHHHHhh
Q psy4681 1303 SKQFTQKSTLNIHRKL 1318 (1930)
Q Consensus 1303 ~~~f~~~~~l~~H~~~ 1318 (1930)
|+.|.. .++..|+..
T Consensus 548 gk~Vrl-rdm~~H~~~ 562 (567)
T PLN03086 548 GRSVML-KEMDIHQIA 562 (567)
T ss_pred CCeeee-hhHHHHHHH
Confidence 988875 467788754
No 13
>KOG1146|consensus
Probab=98.74 E-value=4.8e-09 Score=133.15 Aligned_cols=82 Identities=15% Similarity=0.183 Sum_probs=54.9
Q ss_pred ccccccccccccccccchhhhhhhhcccccccccc----cccccccccccChhHhhh-hhhccCCccccccCCChhhHHH
Q psy4681 1372 MQSAKSTCVLCKKVFSTRENCTNHIMECHSYDVFE----WKDKGVIKEHINPLFLKK-FAFALNCPVCKLYFDRESDFHS 1446 (1930)
Q Consensus 1372 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~C~~C~~~f~~~~~l~~ 1446 (1930)
+....+.|.+|.+...++.+++.|.+..-..+... ++..+...+......... ..+.+.|..|++...+..-|+-
T Consensus 765 v~g~~~~c~l~~y~t~~kanfqlh~Ktdkh~qk~~~~~hikegg~a~~~~t~~la~~~~P~~l~cn~cd~~Tns~~~L~l 844 (1406)
T KOG1146|consen 765 VPGDVPSCKLKPYATNTKANFQLHNKTDKHVQKYQLRAHIKEGGPANEYPTKTLASGPNPVHLKCNACDTPTNSPEPLLL 844 (1406)
T ss_pred ccCCCCcceecccccccchhhhhhcccchhhhccchhhhhhccCCccCCcchhhhcCCCccccccccccCCCCCcccccc
Confidence 34556899999999999999999987654433332 233333333333322222 3456789999999999999988
Q ss_pred hhhhccc
Q psy4681 1447 HMQSYHN 1453 (1930)
Q Consensus 1447 H~~~~h~ 1453 (1930)
|-+....
T Consensus 845 h~~~~~h 851 (1406)
T KOG1146|consen 845 HTPPSEH 851 (1406)
T ss_pred CCCCCcc
Confidence 8665444
No 14
>PHA00733 hypothetical protein
Probab=98.60 E-value=3.4e-08 Score=96.96 Aligned_cols=81 Identities=20% Similarity=0.397 Sum_probs=66.5
Q ss_pred CCCcccccccchhhh--------hhhhhhhcccCCCCcccccccccccccchhHHHHhhhccCCCccccCccchhhhchh
Q psy4681 689 GERKYCCHICGKKMR--------GKLKEHMLTHTGERPYACEICGGTFKTKWYLGVHMRKHNGERPYMCSECGQSFAARS 760 (1930)
Q Consensus 689 ~~~~~~C~~C~k~f~--------~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 760 (1930)
..+++.|.+|.+.|. +.|..|+.++ +++||.|+.||+.|.+...|..|++.| +.+|.|+.|++.|....
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~-~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~ 113 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSK-AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTD 113 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccC-CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHH
Confidence 456778888877664 2255665554 489999999999999999999999987 46799999999999999
Q ss_pred HHHHHHHHhcCC
Q psy4681 761 AFSLHLKKHAGF 772 (1930)
Q Consensus 761 ~l~~H~~~h~~~ 772 (1930)
.|..|+...+++
T Consensus 114 sL~~H~~~~h~~ 125 (128)
T PHA00733 114 STLDHVCKKHNI 125 (128)
T ss_pred HHHHHHHHhcCc
Confidence 999999887753
No 15
>PHA00733 hypothetical protein
Probab=98.57 E-value=4.4e-08 Score=96.12 Aligned_cols=54 Identities=28% Similarity=0.664 Sum_probs=27.3
Q ss_pred CCceecccccccccChhhHHHHHhhccCCCcccCCcCccccCChhHHHHHHHhhcC
Q psy4681 1294 YKPYACDLCSKQFTQKSTLNIHRKLHLNIKDFICDLCGAKFYEFNTYVTHVHETHA 1349 (1930)
Q Consensus 1294 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~ 1349 (1930)
++||.|+.|++.|.+.+.|..|++.| ..+|.|+.|++.|.....|..|+...|.
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 44555555555555555555555443 2345555555555555555555555554
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.50 E-value=4.7e-08 Score=76.58 Aligned_cols=44 Identities=20% Similarity=0.473 Sum_probs=39.5
Q ss_pred ceeccccCcccccchhHHhHhhhcCCCCcccCCCCCccCCCchhHH
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGHSFAARPAFN 517 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~ 517 (1930)
-|.|+.||+.|...++|..||++|+ +||+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999998 6999999999999877654
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.49 E-value=5e-08 Score=76.44 Aligned_cols=40 Identities=20% Similarity=0.555 Sum_probs=19.1
Q ss_pred eecccccccccchhhHHHHHHHhCCCCceecccccccccChh
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDLCSKQFTQKS 1310 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 1310 (1930)
|.|+.||+.|+..++|..||++|+ +||+|..|++.|.+.+
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s 45 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTG 45 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccc
Confidence 444444444444444444444444 3444444444444443
No 18
>KOG3993|consensus
Probab=98.36 E-value=7.3e-08 Score=106.80 Aligned_cols=198 Identities=15% Similarity=0.160 Sum_probs=115.4
Q ss_pred ccccccchhhcCCHHHHHhcccccccCCCccccCCCcccccCHHHHHHHHhhcCCCCcccc--ccc-ccccccccccccc
Q psy4681 1709 KFPCRLCSQEFDTKEQRKKHERKDHETQGLFSCDLCSYTSTQKYYLVKHKSRHIKDYNVFC--KIC-QLGFLSKNELDVH 1785 (1930)
Q Consensus 1709 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~~~~~~~l~~H~~~h~~~~~~~C--~~C-~~~f~~~~~l~~H 1785 (1930)
.|.|..|...|.+.=.|.+|.-.. -.--.|+|+.|++.|+...+|..|.++|.....-.= ..= .+...+..+.+.-
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~R-IV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPR-IVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCe-eEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 489999999999999998885211 011248999999999999999999999864431100 000 0000011111110
Q ss_pred cc--ccccCCCcccccccccccChHHHHHHHHHhCCCCCc----------------cccCCCccccCChhhHHhHHhhhc
Q psy4681 1786 NI--KQHDAQPHTCPVCKKIFVNKVTLAAHKKIHLPIDKN----------------CQCDVCGKSFARTFHLKSHISSVH 1847 (1930)
Q Consensus 1786 ~~--~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~----------------~~C~~C~~~f~~~~~l~~H~~~~H 1847 (1930)
.+ ....+.-|.|.+|++.|.+...|+.|..+|+..... +.|..|.-.+.....--.+...+
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~- 424 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV- 424 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee-
Confidence 00 001123466666666666666666666666422111 11221211111111100000000
Q ss_pred chhhhhhhhhccccCCCcccccCCCCccccchhhhHHHHhhhcccccccccccccccCChHHHHHHHhhhCCCC
Q psy4681 1848 LKREQRKKHERKDHETQGLFSCDLCSYTSTQKYYLVKHKSRHIKDYNVFCKICQLGFLSKNELDVHNIKQHDAQ 1921 (1930)
Q Consensus 1848 ~~~~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 1921 (1930)
-..+-.-.|++|+--+.++..-..+.+.-.....|.|.+|.-.|.+...|.+|+.+.|..+
T Consensus 425 -------------a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 425 -------------AGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred -------------eccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 0011134588888888888877778777778889999999999999999999999998654
No 19
>KOG3993|consensus
Probab=98.31 E-value=1.6e-07 Score=104.29 Aligned_cols=54 Identities=22% Similarity=0.507 Sum_probs=42.4
Q ss_pred ccCCcCccccCChhhHHhHhhhccCCCCccCchhhhhccChhHHHHHHhhhhCC
Q psy4681 1092 YACEFCGSSFKDKSYLRIHIRKHNGERPFTCSECGQSFAARSAFSLHLKKHAGS 1145 (1930)
Q Consensus 1092 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 1145 (1930)
|.|..|...|.+.-.|..|.=.---.-.|+|+.|+|.|.-..+|-.|.++|...
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR 321 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPR 321 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCc
Confidence 889999888998888988853211123589999999999999999999998643
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.23 E-value=6.6e-07 Score=60.00 Aligned_cols=24 Identities=38% Similarity=0.899 Sum_probs=12.3
Q ss_pred HHhHHHHhcCCceeeccccccccc
Q psy4681 1256 LKTHMLIHANNRVFTCEVCGKGFI 1279 (1930)
Q Consensus 1256 L~~H~~~H~~~~~~~C~~C~k~F~ 1279 (1930)
|..||++|+|++||.|++|+++|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 445555555555555555555543
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.19 E-value=1e-06 Score=59.08 Aligned_cols=26 Identities=50% Similarity=1.007 Sum_probs=22.6
Q ss_pred hHHHHHHHhCCCCceecccccccccC
Q psy4681 1283 YLEEHKRVHTGYKPYACDLCSKQFTQ 1308 (1930)
Q Consensus 1283 ~L~~H~~~H~~~~~~~C~~C~~~f~~ 1308 (1930)
+|..||++|+|++||.|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58899999999999999999998863
No 22
>KOG1146|consensus
Probab=98.19 E-value=2.4e-06 Score=109.44 Aligned_cols=180 Identities=17% Similarity=0.189 Sum_probs=106.4
Q ss_pred cCCcchhhcCCcHHHHHHHHhhccCCCccccccCchhcCChHHHHHHHHhhccccccCCCCCccccccCCCcccChhHHH
Q psy4681 810 ICPKCNKEFYSDRTMRRHLKQVHIEIKTFSCEECDKIFATREKLQRHWNYIHQGIRNTGPNQLLECHYCGITKNNKTLLR 889 (1930)
Q Consensus 810 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~C~~C~~~f~~~~~L~ 889 (1930)
.|..|+..+.+...+..|+..++.-.+.|.|+.|+..|.....|..|++.+|...++ .+|-. +.+.-.+.
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~---------~~c~~-gq~~~~~a 507 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS---------AYCKA-GQNHPRLA 507 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch---------hHhHh-cccccccc
Confidence 366777777777778888777888889999999999999999999999877754221 11110 00111111
Q ss_pred HHHhhccCccceecccccccccChhHHHHHHHhh--hcccccccc-cc-------ccccccchhh--------HHHhhhc
Q psy4681 890 DHISAHLGIKPYCCIFCEEKYFSKKSLKRHEAKH--NKVYNKAQY-QD-------YQIQDLSMDQ--------YRELVQS 951 (1930)
Q Consensus 890 ~H~~~H~~~k~~~C~~C~~~f~~~~~L~~H~~~h--~~~~~~~~~-~~-------~~~~~~~~~~--------~~~~~~~ 951 (1930)
+=...-.+.+||.|..|..++..+.+|.+|+..- ......... .. ..+.....++ ..+....
T Consensus 508 rg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktk 587 (1406)
T KOG1146|consen 508 RGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTK 587 (1406)
T ss_pred ccccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCC
Confidence 1011122457899999999999999999998642 111000000 00 0000000000 0001223
Q ss_pred CCccCcccccccCChHHHhhhcc------CccccccCCCccCCHHHHHHHHHHh
Q psy4681 952 KERKCPKCEKEFSTPRYMRKHLR------KKFKCDVCGNGYTSVKHLKRHKIKH 999 (1930)
Q Consensus 952 ~~~~C~~C~k~f~~~~~l~~H~~------~~~~C~~C~~~f~~~~~L~~H~~~h 999 (1930)
..+.|..|+..-+-..+|+.|+. .+..+..++-.+.....+..+.+.+
T Consensus 588 P~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 588 PSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred CCcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCCCC
Confidence 45789999999988899999886 2355666666666655555554444
No 23
>PHA00616 hypothetical protein
Probab=97.90 E-value=4.5e-06 Score=62.19 Aligned_cols=34 Identities=21% Similarity=0.349 Sum_probs=22.0
Q ss_pred eeecccccccccchhhHHHHHHHhCCCCceeccc
Q psy4681 1268 VFTCEVCGKGFIQKRYLEEHKRVHTGYKPYACDL 1301 (1930)
Q Consensus 1268 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~ 1301 (1930)
||+|+.||+.|..++.|..|++.|+|++||.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4666666666666666666666666666666654
No 24
>PHA00616 hypothetical protein
Probab=97.86 E-value=4.2e-06 Score=62.34 Aligned_cols=35 Identities=17% Similarity=0.388 Sum_probs=29.9
Q ss_pred ceeccccCcccccchhHHhHhhhcCCCCcccCCCC
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYC 506 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C 506 (1930)
||+|..||+.|..++.|..|++.|+|++|+.|++=
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~ 35 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYF 35 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEE
Confidence 68888888889888899999988888888887753
No 25
>PHA00732 hypothetical protein
Probab=97.55 E-value=4.8e-05 Score=67.04 Aligned_cols=47 Identities=32% Similarity=0.606 Sum_probs=38.3
Q ss_pred ceeccccCcccccchhHHhHhhh-cCCCCcccCCCCCccCCCchhHHHHHHhcc
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRK-HTGERPYVCNYCGHSFAARPAFNLHLKRHT 524 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~-H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~ 524 (1930)
||.|..||+.|.+...|..|++. |+ ++.|+.||++|. .|..|++++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccC
Confidence 68899999999999999999984 65 368999999997 5778876543
No 26
>PHA00732 hypothetical protein
Probab=97.44 E-value=9e-05 Score=65.36 Aligned_cols=44 Identities=34% Similarity=0.644 Sum_probs=21.6
Q ss_pred eecccccccccchhhHHHHHHH-hCCCCceecccccccccChhhHHHHHhh
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKRV-HTGYKPYACDLCSKQFTQKSTLNIHRKL 1318 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~~-H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 1318 (1930)
|.|+.||+.|.+...|..|++. |+ ++.|+.|++.|. .|..|+++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 4555555555555555555542 33 234555555554 34555543
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.44 E-value=0.00016 Score=59.21 Aligned_cols=52 Identities=29% Similarity=0.497 Sum_probs=41.9
Q ss_pred CcccccccccccChHHHHHHHHHhCCC-CCccccCCCccccCChhhHHhHHhhhcc
Q psy4681 1794 PHTCPVCKKIFVNKVTLAAHKKIHLPI-DKNCQCDVCGKSFARTFHLKSHISSVHL 1848 (1930)
Q Consensus 1794 ~~~C~~C~~~f~~~~~L~~H~~~h~~~-~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 1848 (1930)
.|.||.|++ ..+...|..|+...|.. .+.+.||+|...+. .+|.+||...|.
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 488999999 56678899998876654 45799999998755 489999998873
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.30 E-value=0.00018 Score=58.99 Aligned_cols=37 Identities=27% Similarity=0.537 Sum_probs=17.1
Q ss_pred ccccccccccccchHHHHHHhhhccCCCCccccccchh
Q psy4681 1680 HQCDTCGKSFTGNNHLKRHIYSVHLKRDTKFPCRLCSQ 1717 (1930)
Q Consensus 1680 ~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~ 1717 (1930)
|.||+|++ .-+...|..|+...|..+...+.|++|..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~ 39 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSS 39 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchh
Confidence 44555555 33344455555554444333444444443
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.98 E-value=0.00049 Score=45.05 Aligned_cols=22 Identities=41% Similarity=0.922 Sum_probs=12.2
Q ss_pred eecccccccccchhhHHHHHHH
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKRV 1290 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~~ 1290 (1930)
|+|++||+.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4555555555555555555554
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.84 E-value=0.00056 Score=44.79 Aligned_cols=23 Identities=39% Similarity=0.736 Sum_probs=18.6
Q ss_pred ccccccccccCChhhHHHHHhhc
Q psy4681 15 VECHHCAMRYSSKSQLLDHLNSH 37 (1930)
Q Consensus 15 ~~C~~C~~~f~~~~~l~~H~~~h 37 (1930)
|+|++|+++|+++.+|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 57888888888888888888764
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.21 E-value=0.0027 Score=61.16 Aligned_cols=22 Identities=41% Similarity=0.818 Sum_probs=11.0
Q ss_pred eecccccccccchhhHHHHHHH
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKRV 1290 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~~ 1290 (1930)
+.|.+|++.|.+...|..||+.
T Consensus 51 ~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 51 FRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp EEBSSSS-EESSHHHHHHHHHH
T ss_pred CCCCccCCCCcCHHHHHHHHcC
Confidence 5555555555555555555543
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.16 E-value=0.0039 Score=41.34 Aligned_cols=23 Identities=48% Similarity=1.003 Sum_probs=10.3
Q ss_pred eecccccccccchhhHHHHHHHh
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKRVH 1291 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~~H 1291 (1930)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555554443
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.92 E-value=0.0048 Score=40.88 Aligned_cols=24 Identities=42% Similarity=0.887 Sum_probs=11.4
Q ss_pred cccCCCccccCChhhHHhHHhhhc
Q psy4681 1824 CQCDVCGKSFARTFHLKSHISSVH 1847 (1930)
Q Consensus 1824 ~~C~~C~~~f~~~~~l~~H~~~~H 1847 (1930)
|.|++|++.|.+...|..|+..+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 345555555555555555555443
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.91 E-value=0.0046 Score=42.36 Aligned_cols=25 Identities=40% Similarity=0.804 Sum_probs=17.6
Q ss_pred ceeccccCcccccchhHHhHhhhcC
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHT 496 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~ 496 (1930)
||.|.+|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 5677777777777777777776664
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.86 E-value=0.0049 Score=42.23 Aligned_cols=25 Identities=44% Similarity=0.894 Sum_probs=17.1
Q ss_pred cccCCcCccccCChhhHHhHhhhcc
Q psy4681 1091 PYACEFCGSSFKDKSYLRIHIRKHN 1115 (1930)
Q Consensus 1091 ~~~C~~C~k~f~~~~~L~~H~~~H~ 1115 (1930)
||.|++|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4667777777777777777776654
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.77 E-value=0.0054 Score=66.24 Aligned_cols=25 Identities=20% Similarity=0.530 Sum_probs=17.4
Q ss_pred CCCCcccccccccccCChhHHHhHh
Q psy4681 572 KRDQKIECNICGALFATKYTLQDHM 596 (1930)
Q Consensus 572 ~~~~~~~C~~C~k~f~~~~~L~~H~ 596 (1930)
..++||.|+.|+|.|.+...|.-|.
T Consensus 394 ~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 394 AKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred ccCCceeccccchhhccCccceecc
Confidence 3457777777777777777776664
No 37
>KOG2785|consensus
Probab=95.68 E-value=0.012 Score=66.60 Aligned_cols=80 Identities=18% Similarity=0.275 Sum_probs=53.3
Q ss_pred cccccccccccccchhhhhhhhccc-ccccccccccccc-c-----------ccccChhHhhhhhhccCCccccccCCCh
Q psy4681 1375 AKSTCVLCKKVFSTRENCTNHIMEC-HSYDVFEWKDKGV-I-----------KEHINPLFLKKFAFALNCPVCKLYFDRE 1441 (1930)
Q Consensus 1375 ~~~~C~~C~~~f~~~~~l~~H~~~~-h~~~~~~~~~~~~-~-----------~~~~~~~~~~~~~~~~~C~~C~~~f~~~ 1441 (1930)
..|+|..|...|.+...-+.|+.+. |.+++.......+ + .....+..+......+.|.+|.+.|.+.
T Consensus 2 t~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~ 81 (390)
T KOG2785|consen 2 TGFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASP 81 (390)
T ss_pred CcceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccCh
Confidence 3579999999999999999999765 5554422111110 0 0111122233445678999999999999
Q ss_pred hhHHHhhhhcccc
Q psy4681 1442 SDFHSHMQSYHNS 1454 (1930)
Q Consensus 1442 ~~l~~H~~~~h~~ 1454 (1930)
....+|+.+....
T Consensus 82 ~a~~~hl~Sk~h~ 94 (390)
T KOG2785|consen 82 KAHENHLKSKKHV 94 (390)
T ss_pred hhHHHHHHHhhcc
Confidence 9999998875543
No 38
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.47 E-value=0.0055 Score=66.22 Aligned_cols=24 Identities=17% Similarity=0.397 Sum_probs=21.1
Q ss_pred CCcccccCCCCccccchhhhHHHH
Q psy4681 1863 TQGLFSCDLCSYTSTQKYYLVKHK 1886 (1930)
Q Consensus 1863 ~~~~~~C~~C~~~~~~~~~l~~H~ 1886 (1930)
..+||+|.+|++++.+..-|..|+
T Consensus 395 ~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 395 KDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCCceeccccchhhccCccceecc
Confidence 457999999999999999999885
No 39
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.42 E-value=0.0057 Score=58.88 Aligned_cols=76 Identities=24% Similarity=0.486 Sum_probs=23.3
Q ss_pred ccccccccccchhhhhhhhcccccccccccccccccccccChhHhhhhhhccCCccccccCCChhhHHHhhhhcccc
Q psy4681 1378 TCVLCKKVFSTRENCTNHIMECHSYDVFEWKDKGVIKEHINPLFLKKFAFALNCPVCKLYFDRESDFHSHMQSYHNS 1454 (1930)
Q Consensus 1378 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 1454 (1930)
.|.+|+..|.+...+..||...|............... ..+.........+.|..|+..|.+...|..||+.++..
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~~H~ 76 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNR-LLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSKHHK 76 (100)
T ss_dssp -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTTTT
T ss_pred Cccccccccccccccccccccccccccccccccccccc-cccccccccCCCCCCCccCCCCcCHHHHHHHHcCccCC
Confidence 48999999999999999999999876653221111111 11111122233589999999999999999999976543
No 40
>KOG2231|consensus
Probab=94.96 E-value=0.03 Score=69.70 Aligned_cols=124 Identities=27% Similarity=0.591 Sum_probs=83.8
Q ss_pred eeCCCCcccccCHHHHHHHHHhcCCCcccccccccccccChhhhhhhhcccCCCCceeccccc---------ccccChHH
Q psy4681 1595 FSCDLCSYTSTRKYYLVKHKSRHIKEYTVFCKKCQLGFLSKNELNVHNIKQHDAQPHTCPVCK---------KIFVNKFN 1665 (1930)
Q Consensus 1595 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~C~~C~---------~~f~~~~~ 1665 (1930)
+.|.+|+.+|.... ..-.|..| ..|.+...|+.|+...|+ .+.|.+|- ....++..
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~e 164 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAE 164 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHH
Confidence 67888987764221 11247777 777788888888877774 24444442 22235677
Q ss_pred HHHHhhhcCCCCC----cccccccccccccchHHHHHHhhhccCCCCccccccch------hhcCCHHHHHhcccccccC
Q psy4681 1666 LTTHKKLHLPMNR----NHQCDTCGKSFTGNNHLKRHIYSVHLKRDTKFPCRLCS------QEFDTKEQRKKHERKDHET 1735 (1930)
Q Consensus 1666 l~~H~~~h~~~~~----~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~------~~f~~~~~l~~H~~~~h~~ 1735 (1930)
|..|++.-.+.++ --.|..|...|-+...|.+|++..| |.|.+|+ ..|...++|..|.+..|
T Consensus 165 l~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h------~~chfC~~~~~~neyy~~~~dLe~HfR~~H-- 236 (669)
T KOG2231|consen 165 LNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-- 236 (669)
T ss_pred HHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce------eheeecCcccccchhcccchHHHHHhhhcC--
Confidence 8888775444222 2568889999999999999988777 6677774 46777888888888776
Q ss_pred CCccccC--CC
Q psy4681 1736 QGLFSCD--LC 1744 (1930)
Q Consensus 1736 ~~~~~C~--~C 1744 (1930)
|.|. .|
T Consensus 237 ---flCE~~~C 244 (669)
T KOG2231|consen 237 ---FLCEEEFC 244 (669)
T ss_pred ---cccccccc
Confidence 5666 45
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.76 E-value=0.018 Score=38.05 Aligned_cols=23 Identities=39% Similarity=0.833 Sum_probs=11.9
Q ss_pred cccCCCCccccchhhhHHHHhhhc
Q psy4681 1867 FSCDLCSYTSTQKYYLVKHKSRHI 1890 (1930)
Q Consensus 1867 ~~C~~C~~~~~~~~~l~~H~~~h~ 1890 (1930)
|+|+.|+|+.. +..|..|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666555 555666655544
No 42
>KOG2231|consensus
Probab=94.75 E-value=0.054 Score=67.52 Aligned_cols=70 Identities=29% Similarity=0.595 Sum_probs=42.5
Q ss_pred CCccccCcccChhHHHhHHhhhccCCCCCccccCccch------hccCCchhhhhhhcccccCCccccc--cc-cccccC
Q psy4681 155 PCEVCGKRFNSIKRVKQHRKVVHMGIKQKKKFECAHCS------KTYLSRVGLEDHINNHTGEKGHICE--IC-NRDFYS 225 (1930)
Q Consensus 155 ~C~~C~k~f~~~~~l~~H~~~~h~~~~~~~~~~C~~C~------k~f~~~~~L~~H~~~H~~~~~~~C~--~C-~~~f~~ 225 (1930)
.|..|...|-....|..|++..| |.|.+|. .-|.....|..|-+.++ |.|+ .| .+.|..
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h--------~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~ 251 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH--------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYV 251 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce--------eheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeee
Confidence 36777777777777777766444 4555553 45677777777777654 6676 55 344444
Q ss_pred HHHHHHHHhhc
Q psy4681 226 DAMLKRHLVKH 236 (1930)
Q Consensus 226 ~~~l~~H~~~h 236 (1930)
...+..+++.|
T Consensus 252 ~~~~ei~lk~~ 262 (669)
T KOG2231|consen 252 AFELEIELKAH 262 (669)
T ss_pred hhHHHHHHHhh
Confidence 44555555433
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.71 E-value=0.029 Score=43.10 Aligned_cols=28 Identities=18% Similarity=0.444 Sum_probs=10.9
Q ss_pred cccCCCCCcccCChhhHHhHHHHhcCCc
Q psy4681 1240 YYPCTVCSKNLSSPYRLKTHMLIHANNR 1267 (1930)
Q Consensus 1240 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 1267 (1930)
|-.|++|+..++...+|++|+.++++.+
T Consensus 24 PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 24 PATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp -EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCcchhhccchhhHHHHHHHHhccc
Confidence 4444444444444444444444444333
No 44
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.48 E-value=0.023 Score=37.52 Aligned_cols=24 Identities=29% Similarity=0.509 Sum_probs=15.1
Q ss_pred cccCCCccccCChhhHHhHHhhhcc
Q psy4681 1824 CQCDVCGKSFARTFHLKSHISSVHL 1848 (1930)
Q Consensus 1824 ~~C~~C~~~f~~~~~l~~H~~~~H~ 1848 (1930)
|+|+.|+++.. ...|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777777776 6677777776653
No 45
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.37 E-value=0.03 Score=43.00 Aligned_cols=32 Identities=28% Similarity=0.647 Sum_probs=20.1
Q ss_pred CCCcccccccccccccchhHHHHhhhccCCCc
Q psy4681 715 GERPYACEICGGTFKTKWYLGVHMRKHNGERP 746 (1930)
Q Consensus 715 ~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~ 746 (1930)
.+.|-.|++|+..+.+..+|++|+.++++.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 45677788888888888888888877777665
No 46
>smart00355 ZnF_C2H2 zinc finger.
Probab=94.18 E-value=0.035 Score=37.47 Aligned_cols=22 Identities=50% Similarity=0.862 Sum_probs=9.9
Q ss_pred ecccccccccchhhHHHHHHHh
Q psy4681 1270 TCEVCGKGFIQKRYLEEHKRVH 1291 (1930)
Q Consensus 1270 ~C~~C~k~F~~~~~L~~H~~~H 1291 (1930)
.|+.|++.|.....|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 3444444444444444444433
No 47
>smart00355 ZnF_C2H2 zinc finger.
Probab=93.87 E-value=0.042 Score=37.09 Aligned_cols=24 Identities=33% Similarity=0.679 Sum_probs=16.8
Q ss_pred ccccccccccCChhhHHHHHhhcC
Q psy4681 15 VECHHCAMRYSSKSQLLDHLNSHT 38 (1930)
Q Consensus 15 ~~C~~C~~~f~~~~~l~~H~~~h~ 38 (1930)
|.|+.|+++|.+...|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 467777777777777777777654
No 48
>PRK04860 hypothetical protein; Provisional
Probab=93.71 E-value=0.04 Score=56.64 Aligned_cols=36 Identities=22% Similarity=0.707 Sum_probs=17.6
Q ss_pred cccCCCCCcccCChhhHHhHHHHhcCCceeeccccccccc
Q psy4681 1240 YYPCTVCSKNLSSPYRLKTHMLIHANNRVFTCEVCGKGFI 1279 (1930)
Q Consensus 1240 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~k~F~ 1279 (1930)
+|.|. |+. ....+.+|.++|+++++|.|..|+..|.
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 35554 544 4444455555555555555555554443
No 49
>PRK04860 hypothetical protein; Provisional
Probab=93.55 E-value=0.039 Score=56.73 Aligned_cols=38 Identities=29% Similarity=0.723 Sum_probs=29.0
Q ss_pred ceeccccCcccccchhHHhHhhhcCCCCcccCCCCCccCCCc
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGHSFAAR 513 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~ 513 (1930)
+|.|. |++ ....+.+|.++|+|+++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 67887 876 667778888888888888888888777543
No 50
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=93.53 E-value=0.087 Score=57.89 Aligned_cols=33 Identities=33% Similarity=0.714 Sum_probs=21.6
Q ss_pred Cccccc--chhccCCHHHHHhHhhhccCCCCceeCCCC
Q psy4681 1565 KFPCRL--CSQEFGTKKQRKKHERKDHETRGVFSCDLC 1600 (1930)
Q Consensus 1565 ~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C 1600 (1930)
.|.|+. |...-.....|..|.+..|.. +.|.+|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~---~~C~~C 185 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF---VLCSEC 185 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc---EEhHhh
Confidence 466765 666666677777777766653 566666
No 51
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.46 E-value=0.04 Score=36.87 Aligned_cols=23 Identities=35% Similarity=0.642 Sum_probs=18.3
Q ss_pred ccccccccccCChhhHHHHHhhc
Q psy4681 15 VECHHCAMRYSSKSQLLDHLNSH 37 (1930)
Q Consensus 15 ~~C~~C~~~f~~~~~l~~H~~~h 37 (1930)
|.|++|+++|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 56888888888888888888754
No 52
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=93.41 E-value=0.13 Score=56.71 Aligned_cols=139 Identities=29% Similarity=0.562 Sum_probs=84.4
Q ss_pred CCCCCC--CCCccCcchHHHHhhhhhcCCCCCcCCccccCcccChhHHHhHHhhhccCCCCCccccCccchhccCCchhh
Q psy4681 124 ARKCPI--CGDRYKSGTDMRRHYRDLHDSTRKCPCEVCGKRFNSIKRVKQHRKVVHMGIKQKKKFECAHCSKTYLSRVGL 201 (1930)
Q Consensus 124 ~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~k~f~~~~~l~~H~~~~h~~~~~~~~~~C~~C~k~f~~~~~L 201 (1930)
.|.||. |..+...+..|+.|++..|. .+-|++|-+ .++.|.|++ ..-++..|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~---~~~C~~C~~--------------------nKk~F~~E~---~lF~~~~L 204 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG---FVLCSECIG--------------------NKKDFWNEI---RLFRSSTL 204 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC---cEEhHhhhc--------------------CcccCccce---eeeecccc
Confidence 466764 66666667777777775443 455666621 123344433 23345667
Q ss_pred hhhhcccccCCcc----ccccccccccCHHHHHHHHhhcccccccchhhhhccCCchhHHHHHHhhcccccCCcchh---
Q psy4681 202 EDHINNHTGEKGH----ICEICNRDFYSDAMLKRHLVKHSRMIKETSEEFVETGSITREEWYKMVLQRVKTCPLCKK--- 274 (1930)
Q Consensus 202 ~~H~~~H~~~~~~----~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~k--- 274 (1930)
..|...-..+..| .|.+|...|.....|..|++.-++ .|-+|++
T Consensus 205 r~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~HE-----------------------------~ChICD~v~p 255 (493)
T COG5236 205 RDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE-----------------------------ACHICDMVGP 255 (493)
T ss_pred cccccCCccccCcCCCchhhhccceecChHHHHHHHHhhhh-----------------------------hhhhhhccCc
Confidence 7776543333233 488888888888888888774221 3555544
Q ss_pred ----cccChHHHHHHHHhhcCCCCCcccc--cc----ccccccHHHHHHHHHHhccCc
Q psy4681 275 ----TYQSAKGMRLHIREVHSKVRPHQCK--GC----GKYFKSQRHLVQHERRVHLGV 322 (1930)
Q Consensus 275 ----~f~~~~~l~~H~~~~h~~~~~~~C~--~C----~~~f~~~~~l~~H~~~~h~~~ 322 (1930)
-|.+..+|..|.+..| |.|. .| -..|+....|..|+.+.|...
T Consensus 256 ~~~QYFK~Y~~Le~HF~~~h-----y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~ 308 (493)
T COG5236 256 IRYQYFKSYEDLEAHFRNAH-----YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVN 308 (493)
T ss_pred cchhhhhCHHHHHHHhhcCc-----eEEEEEEEecCcEEEeccHHHHHHHHHHHhhcc
Confidence 4667778888877555 4442 23 146888888888887777543
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=92.89 E-value=0.058 Score=36.07 Aligned_cols=20 Identities=35% Similarity=0.843 Sum_probs=9.4
Q ss_pred ecccccccccchhhHHHHHH
Q psy4681 1270 TCEVCGKGFIQKRYLEEHKR 1289 (1930)
Q Consensus 1270 ~C~~C~k~F~~~~~L~~H~~ 1289 (1930)
.|++|++.|.+...|..|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 44444444444444444443
No 54
>KOG2482|consensus
Probab=92.75 E-value=0.087 Score=58.14 Aligned_cols=73 Identities=19% Similarity=0.332 Sum_probs=50.4
Q ss_pred cccccccccc-chhhhhhhhcccccccccccccccccccccChhHhhhhhhccCCccccccCCChhhHHHhhhhcc
Q psy4681 1378 TCVLCKKVFS-TRENCTNHIMECHSYDVFEWKDKGVIKEHINPLFLKKFAFALNCPVCKLYFDRESDFHSHMQSYH 1452 (1930)
Q Consensus 1378 ~C~~C~~~f~-~~~~l~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h 1452 (1930)
.|-.|...+. .++....|+...|..+++-++.--...+ --.-++...-.+.|-.|.+.|..+..|+.||+...
T Consensus 146 qClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvne--lLehLkekL~r~~CLyCekifrdkntLkeHMrkK~ 219 (423)
T KOG2482|consen 146 QCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNE--LLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKR 219 (423)
T ss_pred EEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHH--HHHHHHHHHhhheeeeeccccCCcHHHHHHHHhcc
Confidence 6888877664 4788899999999877644332211111 01123334556899999999999999999999643
No 55
>KOG2785|consensus
Probab=91.41 E-value=0.22 Score=56.75 Aligned_cols=58 Identities=21% Similarity=0.253 Sum_probs=48.6
Q ss_pred ccccCCCCccccchhhhHHHHhhhccc-----------------------ccccccccc---cccCChHHHHHHHhh-hC
Q psy4681 1866 LFSCDLCSYTSTQKYYLVKHKSRHIKD-----------------------YNVFCKICQ---LGFLSKNELDVHNIK-QH 1918 (1930)
Q Consensus 1866 ~~~C~~C~~~~~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~~-~H 1918 (1930)
|=.|-+|+..+.+......||..+|+- ..|.|-.|+ +.|.+..++++||.. -|
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K~H 245 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDKGH 245 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhccC
Confidence 567999999999999999999888773 566799999 999999999999975 34
Q ss_pred CCCCc
Q psy4681 1919 DAQPH 1923 (1930)
Q Consensus 1919 ~~~~~ 1923 (1930)
-.-||
T Consensus 246 Ckl~y 250 (390)
T KOG2785|consen 246 CKLPY 250 (390)
T ss_pred cccCC
Confidence 44444
No 56
>KOG2482|consensus
Probab=91.37 E-value=0.33 Score=53.80 Aligned_cols=212 Identities=17% Similarity=0.287 Sum_probs=105.7
Q ss_pred chhhhhhhcccCCCCCcccCCCCCcccCC-hhhHHhHHH-HhcCC--ceeecccccccccchhhHHHHHHHhCCCCceec
Q psy4681 1224 KTSYKRHLKQHDDSVTYYPCTVCSKNLSS-PYRLKTHML-IHANN--RVFTCEVCGKGFIQKRYLEEHKRVHTGYKPYAC 1299 (1930)
Q Consensus 1224 ~~~l~~H~~~h~~~~~~~~C~~C~~~f~~-~~~L~~H~~-~H~~~--~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C 1299 (1930)
+..|..+++.-.+.....+|-+|...+.. .+..-.|+- +|.-. .| -...--..|..|.+--- ..+.|
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlp-------DniVyvnelLehLkekL--~r~~C 198 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLP-------DNIVYVNELLEHLKEKL--ERLRC 198 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCC-------cceeeHHHHHHHHHHHH--hhhee
Confidence 45566666655554444567777765543 344445543 33210 00 00111234445543211 23778
Q ss_pred ccccccccChhhHHHHHhh--ccCCCcccCCcCcccc--------CChhHHHHHHHhhcCCCCcchhcccccccc--chh
Q psy4681 1300 DLCSKQFTQKSTLNIHRKL--HLNIKDFICDLCGAKF--------YEFNTYVTHVHETHAILPRVIVTKFKVEDF--QFF 1367 (1930)
Q Consensus 1300 ~~C~~~f~~~~~l~~H~~~--h~~~~~~~C~~C~~~f--------~~~~~l~~H~~~~h~~~~~~~~~~~~~~~~--~~~ 1367 (1930)
-+|.+.|..+..|+.||+. |....|-. ..-++.| .+......+. .+.+.......|. ..+
T Consensus 199 LyCekifrdkntLkeHMrkK~HrrinPkn-reYDkfyiINY~ev~ks~t~~~~e~-------dret~~d~~E~D~~wsDw 270 (423)
T KOG2482|consen 199 LYCEKIFRDKNTLKEHMRKKRHRRINPKN-REYDKFYIINYLEVGKSWTIVHSED-------DRETNEDINETDDTWSDW 270 (423)
T ss_pred eeeccccCCcHHHHHHHHhccCcccCCCc-cccceEEEEeHhhcCCccchhhhhh-------hhhhhccccccccchhhh
Confidence 8888888888888888864 43322200 0011111 0111111111 0111111111110 011
Q ss_pred hhccccccccccccccccccchhhhhhhhcccccccccccccc--ccccccc-ChhHhhhhhhccCCccccccCCChhhH
Q psy4681 1368 VCESMQSAKSTCVLCKKVFSTRENCTNHIMECHSYDVFEWKDK--GVIKEHI-NPLFLKKFAFALNCPVCKLYFDRESDF 1444 (1930)
Q Consensus 1368 ~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~C~~C~~~f~~~~~l 1444 (1930)
......+....|-.|....-+...|..||..+|..+....... -..-..+ .--.+.+..-...|-.|+..|.....|
T Consensus 271 ~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l 350 (423)
T KOG2482|consen 271 NEDDAEALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGL 350 (423)
T ss_pred hcCCCCccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchh
Confidence 1112223335899999999999999999999998766432211 1110000 001122233456799999999999999
Q ss_pred HHhhhhcc
Q psy4681 1445 HSHMQSYH 1452 (1930)
Q Consensus 1445 ~~H~~~~h 1452 (1930)
+.||..+.
T Consensus 351 ~~hm~e~k 358 (423)
T KOG2482|consen 351 LIHMVEDK 358 (423)
T ss_pred hhhccccc
Confidence 99998644
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=90.38 E-value=0.12 Score=35.29 Aligned_cols=22 Identities=32% Similarity=0.708 Sum_probs=13.8
Q ss_pred ccCccchhccCCchhhhhhhcc
Q psy4681 186 FECAHCSKTYLSRVGLEDHINN 207 (1930)
Q Consensus 186 ~~C~~C~k~f~~~~~L~~H~~~ 207 (1930)
|.|.+|++.|.+...|+.|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666653
No 58
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=88.99 E-value=0.25 Score=33.76 Aligned_cols=22 Identities=27% Similarity=0.676 Sum_probs=14.4
Q ss_pred eeCCCCCccccChhHHhhhhhc
Q psy4681 358 HVCSICQSTYTTARGLKRHNKN 379 (1930)
Q Consensus 358 ~~C~~C~~~f~~~~~L~~H~~~ 379 (1930)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5566666666666666666654
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=87.87 E-value=0.24 Score=63.73 Aligned_cols=145 Identities=23% Similarity=0.375 Sum_probs=72.1
Q ss_pred CcccCCCCcccCChhhHHHHHh--hhcCC--Cceecc--ccccccccchhhhhhhcccCCCCCcccCCC--CCcccCChh
Q psy4681 1183 PFICEHCSKPFTSKGNLTVHVK--YYHAK--TLFECN--ICLKTFNFKTSYKRHLKQHDDSVTYYPCTV--CSKNLSSPY 1254 (1930)
Q Consensus 1183 ~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~~C~~--C~~~f~~~~ 1254 (1930)
++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.+...+..|...|.+.. ++.+.. +.+.+....
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSIS-PAKEKLLNSSSKFSPLL 367 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCC-ccccccccCcccccccc
Confidence 4555555555555555555555 45555 555555 46666666666666655555443 233322 222222222
Q ss_pred hH-----HhHHHHhcCCceeeccc--ccccccchhhHHHHHHHhCCCC--ceecccccccccChhhHHHHHhhccCCCcc
Q psy4681 1255 RL-----KTHMLIHANNRVFTCEV--CGKGFIQKRYLEEHKRVHTGYK--PYACDLCSKQFTQKSTLNIHRKLHLNIKDF 1325 (1930)
Q Consensus 1255 ~L-----~~H~~~H~~~~~~~C~~--C~k~F~~~~~L~~H~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~ 1325 (1930)
.- ......-...+.+.+.. |-..+.....+..|...|-..+ .+.+..|.+.|.....+..|++.|....++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 10 00011112223344432 5555566666666665555544 244555666666666666666666555444
Q ss_pred cCC
Q psy4681 1326 ICD 1328 (1930)
Q Consensus 1326 ~C~ 1328 (1930)
-|.
T Consensus 448 ~~~ 450 (467)
T COG5048 448 LCS 450 (467)
T ss_pred eec
Confidence 443
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=86.69 E-value=0.27 Score=63.20 Aligned_cols=55 Identities=38% Similarity=0.719 Sum_probs=38.2
Q ss_pred CcccccccccccccchhHHHHhhhccCCCccccCc--cchhhhchhHHHHHHHHhcC
Q psy4681 717 RPYACEICGGTFKTKWYLGVHMRKHNGERPYMCSE--CGQSFAARSAFSLHLKKHAG 771 (1930)
Q Consensus 717 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~ 771 (1930)
+++.|+.|...|.....+..|.+.|++++++.|.. |...|.....+..|...+..
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (467)
T COG5048 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88 (467)
T ss_pred chhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhcccccc
Confidence 45667777777777777777777777777777765 55566666666666666554
No 61
>KOG4173|consensus
Probab=85.96 E-value=0.39 Score=49.10 Aligned_cols=82 Identities=24% Similarity=0.511 Sum_probs=52.8
Q ss_pred Cceeccc--ccccccChHHHHHHhhhcCCCCCcccccccccccccchHHHHHHhhhccC--------CCCcccccc--ch
Q psy4681 1649 QPHTCPV--CKKIFVNKFNLTTHKKLHLPMNRNHQCDTCGKSFTGNNHLKRHIYSVHLK--------RDTKFPCRL--CS 1716 (1930)
Q Consensus 1649 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~--------~~~~~~C~~--C~ 1716 (1930)
..|.|++ |...|.+......|...-|+. .|.+|.+.|.+...|..||..-|.. ....|.|-+ |.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 3466765 777777777777777655443 4888888888888888887776621 123466644 66
Q ss_pred hhcCCHHHHHhccccccc
Q psy4681 1717 QEFDTKEQRKKHERKDHE 1734 (1930)
Q Consensus 1717 ~~f~~~~~l~~H~~~~h~ 1734 (1930)
.+|.+..+.+.|+-..|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 666666666666655553
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=80.98 E-value=1.1 Score=29.89 Aligned_cols=20 Identities=25% Similarity=0.620 Sum_probs=14.9
Q ss_pred ccccccccccCChhhHHHHHh
Q psy4681 15 VECHHCAMRYSSKSQLLDHLN 35 (1930)
Q Consensus 15 ~~C~~C~~~f~~~~~l~~H~~ 35 (1930)
.+|++||++| ....|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 4688888888 6677777765
No 63
>KOG2893|consensus
Probab=80.69 E-value=0.68 Score=48.33 Aligned_cols=41 Identities=39% Similarity=0.745 Sum_probs=23.7
Q ss_pred CccccCcccChhHHHhHHhhhccCCCCCccccCccchhccCCchhhhhh
Q psy4681 156 CEVCGKRFNSIKRVKQHRKVVHMGIKQKKKFECAHCSKTYLSRVGLEDH 204 (1930)
Q Consensus 156 C~~C~k~f~~~~~l~~H~~~~h~~~~~~~~~~C~~C~k~f~~~~~L~~H 204 (1930)
|-.|++.|....-|.+|++. |-|+|.+|.|..-+--.|..|
T Consensus 13 cwycnrefddekiliqhqka--------khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA--------KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhh--------ccceeeeehhhhccCCCceee
Confidence 55666666666666666552 235666666655555555555
No 64
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=80.52 E-value=2.8 Score=39.75 Aligned_cols=78 Identities=23% Similarity=0.411 Sum_probs=51.6
Q ss_pred CCCCCCCCCccCcchHHHHhhhhhcC-CC------------CCcCCccccCcccChhHHHhHHhhhccCCCCCccccCcc
Q psy4681 124 ARKCPICGDRYKSGTDMRRHYRDLHD-ST------------RKCPCEVCGKRFNSIKRVKQHRKVVHMGIKQKKKFECAH 190 (1930)
Q Consensus 124 ~~~C~~C~~~f~~~~~l~~H~~~~h~-~~------------~~~~C~~C~k~f~~~~~l~~H~~~~h~~~~~~~~~~C~~ 190 (1930)
+-.|++|+-+.-+...|.+-+. |. .- +.-.|-.|.+.|........- .......|+|+.
T Consensus 15 P~~CpiCgLtLVss~HLARSyH--HLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~------~~~~~~~y~C~~ 86 (112)
T TIGR00622 15 PVECPICGLTLILSTHLARSYH--HLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD------ELKDSHRYVCAV 86 (112)
T ss_pred CCcCCcCCCEEeccchHHHhhh--ccCCCcccccccccccCCCCcccCcCCCCCCccccccc------ccccccceeCCC
Confidence 4579999999999988888764 21 11 112388888888765321100 011234688888
Q ss_pred chhccCCchhhhhhhcccc
Q psy4681 191 CSKTYLSRVGLEDHINNHT 209 (1930)
Q Consensus 191 C~k~f~~~~~L~~H~~~H~ 209 (1930)
|...|----..-.|..+|.
T Consensus 87 C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 87 CKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred CCCccccccchhhhhhccC
Confidence 8888888888888877775
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=78.91 E-value=1.7 Score=29.00 Aligned_cols=19 Identities=37% Similarity=1.004 Sum_probs=10.2
Q ss_pred cCCCCCcccCChhhHHhHHH
Q psy4681 1242 PCTVCSKNLSSPYRLKTHML 1261 (1930)
Q Consensus 1242 ~C~~C~~~f~~~~~L~~H~~ 1261 (1930)
+|++||..| .+..|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4455555543
No 66
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=78.00 E-value=2.3 Score=40.25 Aligned_cols=20 Identities=20% Similarity=0.567 Sum_probs=12.7
Q ss_pred CCccCchhhhhccChhHHHH
Q psy4681 1118 RPFTCSECGQSFAARSAFSL 1137 (1930)
Q Consensus 1118 ~~~~C~~C~~~f~~~~~l~~ 1137 (1930)
-|..|++||........|.+
T Consensus 14 LP~~CpiCgLtLVss~HLAR 33 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLAR 33 (112)
T ss_pred CCCcCCcCCCEEeccchHHH
Confidence 46677777766666655544
No 67
>KOG4173|consensus
Probab=76.25 E-value=1.2 Score=45.75 Aligned_cols=78 Identities=21% Similarity=0.437 Sum_probs=65.6
Q ss_pred Ccccccc--cccccccchHHHHHHhhhccCCCCccccccchhhcCCHHHHHhccccccc---------CCCccccCC--C
Q psy4681 1678 RNHQCDT--CGKSFTGNNHLKRHIYSVHLKRDTKFPCRLCSQEFDTKEQRKKHERKDHE---------TQGLFSCDL--C 1744 (1930)
Q Consensus 1678 ~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~---------~~~~~~C~~--C 1744 (1930)
..|.|++ |...|.+.+....|....|+. .|.+|.+.|.+..-|..|+...|+ +...|+|-+ |
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~-----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgC 152 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN-----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGC 152 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc-----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhh
Confidence 4577875 889999999999999888854 699999999999999999988884 345688965 9
Q ss_pred cccccCHHHHHHHHhh
Q psy4681 1745 SYTSTQKYYLVKHKSR 1760 (1930)
Q Consensus 1745 ~~~~~~~~~l~~H~~~ 1760 (1930)
+..|.+...-..||-.
T Consensus 153 t~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 153 TEKFKTSRDRKDHMIR 168 (253)
T ss_pred hhhhhhhhhhhhHHHH
Confidence 9999999999999754
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=75.54 E-value=1.8 Score=31.80 Aligned_cols=22 Identities=18% Similarity=0.558 Sum_probs=12.0
Q ss_pred cccccccccccCChhhHHHHHh
Q psy4681 14 NVECHHCAMRYSSKSQLLDHLN 35 (1930)
Q Consensus 14 ~~~C~~C~~~f~~~~~l~~H~~ 35 (1930)
+|.|++|++.|.+...+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3555555555555555555554
No 69
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=74.86 E-value=2 Score=31.54 Aligned_cols=21 Identities=29% Similarity=0.713 Sum_probs=11.2
Q ss_pred eecccccccccchhhHHHHHH
Q psy4681 1269 FTCEVCGKGFIQKRYLEEHKR 1289 (1930)
Q Consensus 1269 ~~C~~C~k~F~~~~~L~~H~~ 1289 (1930)
|.|++|++.|.....+..|++
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred eEccccCCccCCHHHHHHHHC
Confidence 445555555555555555543
No 70
>KOG2893|consensus
Probab=73.67 E-value=0.92 Score=47.42 Aligned_cols=46 Identities=24% Similarity=0.508 Sum_probs=23.8
Q ss_pred ccccccccccchHHHHHHhhhccCCCCccccccchhhcCCHHHHHhcccccc
Q psy4681 1682 CDTCGKSFTGNNHLKRHIYSVHLKRDTKFPCRLCSQEFDTKEQRKKHERKDH 1733 (1930)
Q Consensus 1682 C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h 1733 (1930)
|-+|++.|.+...|.+|.+..| |+|.+|.++.-+--.|..|...+|
T Consensus 13 cwycnrefddekiliqhqkakh------fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH------FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhhcc------ceeeeehhhhccCCCceeehhhhh
Confidence 5555555555555555554443 555555555544444544444333
No 71
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=71.43 E-value=2.8 Score=30.27 Aligned_cols=24 Identities=42% Similarity=1.053 Sum_probs=15.9
Q ss_pred eeccccCcccccchhHHhHhhhcCCCCcccCCCCCc
Q psy4681 473 FGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGH 508 (1930)
Q Consensus 473 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k 508 (1930)
|+|.+||..+.... .|+.|++||.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 67777776664322 5777888875
No 72
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=70.38 E-value=2.4 Score=30.57 Aligned_cols=8 Identities=38% Similarity=1.510 Sum_probs=3.2
Q ss_pred ceeccccc
Q psy4681 1296 PYACDLCS 1303 (1930)
Q Consensus 1296 ~~~C~~C~ 1303 (1930)
|+.|++|+
T Consensus 17 ~~~CP~Cg 24 (33)
T cd00350 17 PWVCPVCG 24 (33)
T ss_pred CCcCcCCC
Confidence 33444443
No 73
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=67.74 E-value=5.3 Score=38.85 Aligned_cols=26 Identities=23% Similarity=0.454 Sum_probs=17.4
Q ss_pred cccc----CCCCccccchhhhHHHHhhhcc
Q psy4681 1866 LFSC----DLCSYTSTQKYYLVKHKSRHIK 1891 (1930)
Q Consensus 1866 ~~~C----~~C~~~~~~~~~l~~H~~~h~~ 1891 (1930)
.|.| ..|+|.+.+...|..|++.+|+
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 5677 6677777777777777665553
No 74
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=62.72 E-value=2.1 Score=47.28 Aligned_cols=55 Identities=22% Similarity=0.481 Sum_probs=32.7
Q ss_pred CCccccCCCccccCChhhHHhHHhhhcchhhhhhhhhccccCCCc----ccccCCCCccccch
Q psy4681 1821 DKNCQCDVCGKSFARTFHLKSHISSVHLKREQRKKHERKDHETQG----LFSCDLCSYTSTQK 1879 (1930)
Q Consensus 1821 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~----~~~C~~C~~~~~~~ 1879 (1930)
++.+.||+|+..|.++.-+...++......+-. ....+.+. ...|+.|||++...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~----~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFC----PRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCc----cccCCCCCeeeeEEECCCCCCccccc
Confidence 466889999999998876666655432111100 00111111 44799999998855
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=61.06 E-value=19 Score=34.97 Aligned_cols=26 Identities=23% Similarity=0.503 Sum_probs=17.0
Q ss_pred Ccccc----cccccccCChhHHHhHhhccC
Q psy4681 575 QKIEC----NICGALFATKYTLQDHMNTHT 600 (1930)
Q Consensus 575 ~~~~C----~~C~k~f~~~~~L~~H~~~h~ 600 (1930)
..|.| ..|+..+.+...+.+|++.++
T Consensus 79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CCeeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 34667 777777777777777766543
No 76
>KOG2186|consensus
Probab=56.77 E-value=5.3 Score=43.17 Aligned_cols=45 Identities=22% Similarity=0.674 Sum_probs=21.9
Q ss_pred eecCCCChHHHH-HHHHHHhhccCCCcccCCcCccccCChhhHHhHhh
Q psy4681 1066 ICCHICGKKLRG-RLNEHMLTHTGERPYACEFCGSSFKDKSYLRIHIR 1112 (1930)
Q Consensus 1066 ~~C~~C~k~f~~-~l~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~ 1112 (1930)
|.|.+||..... .|..|+-.-++ .-|.|-.|++.|.. .++..|..
T Consensus 4 FtCnvCgEsvKKp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EehhhhhhhccccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 444455544432 44445444444 34555555555554 44555543
No 77
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=54.00 E-value=6.8 Score=28.98 Aligned_cols=32 Identities=22% Similarity=0.693 Sum_probs=18.5
Q ss_pred cCcccccccCChHHHhhhccCccccccCCCcc
Q psy4681 955 KCPKCEKEFSTPRYMRKHLRKKFKCDVCGNGY 986 (1930)
Q Consensus 955 ~C~~C~k~f~~~~~l~~H~~~~~~C~~C~~~f 986 (1930)
.|+.|+..|.-+.....-.....+|..|+..|
T Consensus 4 ~Cp~C~~~y~i~d~~ip~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKIPPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHCCCCCcEEECCCCCCEe
Confidence 47777777766665444333445566665554
No 78
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=53.46 E-value=7.9 Score=30.68 Aligned_cols=28 Identities=32% Similarity=0.733 Sum_probs=18.7
Q ss_pred cCCCCccCchhhhhccChhHHHHHHhhh
Q psy4681 1115 NGERPFTCSECGQSFAARSAFSLHLKKH 1142 (1930)
Q Consensus 1115 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h 1142 (1930)
-||.-+.|+.||..|....++.+|....
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKa 40 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKA 40 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHH
Confidence 3555667777777777777777776543
No 79
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=51.77 E-value=7.1 Score=30.93 Aligned_cols=32 Identities=28% Similarity=0.514 Sum_probs=24.3
Q ss_pred cCCCcccCCcCccccCChhHHHHHHHhhcCCC
Q psy4681 1320 LNIKDFICDLCGAKFYEFNTYVTHVHETHAIL 1351 (1930)
Q Consensus 1320 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~ 1351 (1930)
.|+..+.|+.|+..|....++..|+-..|..+
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~~ 44 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHGWL 44 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhhhh
Confidence 35666788888888888888888887777653
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=49.91 E-value=11 Score=27.39 Aligned_cols=25 Identities=36% Similarity=0.774 Sum_probs=14.7
Q ss_pred ceeccccCcccccchhHHhHhhhcCCCCcccCCCCCc
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGH 508 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k 508 (1930)
.|.|.+||..+... +.|..|++||.
T Consensus 2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 2 VWVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred eEECCCCCCEeECC------------cCCCcCcCCCC
Confidence 36677777554321 24567777765
No 81
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=48.17 E-value=9.1 Score=28.54 Aligned_cols=32 Identities=28% Similarity=0.765 Sum_probs=18.1
Q ss_pred cCcccccccCChHHHhhhccCccccccCCCcc
Q psy4681 955 KCPKCEKEFSTPRYMRKHLRKKFKCDVCGNGY 986 (1930)
Q Consensus 955 ~C~~C~k~f~~~~~l~~H~~~~~~C~~C~~~f 986 (1930)
.||.|+..|.-+.....-.....+|..|+..|
T Consensus 4 ~CP~C~~~f~v~~~~l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDKLPAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHHcccCCcEEECCCCCcEe
Confidence 47777777766655333333355666666555
No 82
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=47.92 E-value=12 Score=30.61 Aligned_cols=32 Identities=22% Similarity=0.513 Sum_probs=20.8
Q ss_pred CccCcccccccCChHHHhhhccCccccccCCC
Q psy4681 953 ERKCPKCEKEFSTPRYMRKHLRKKFKCDVCGN 984 (1930)
Q Consensus 953 ~~~C~~C~k~f~~~~~l~~H~~~~~~C~~C~~ 984 (1930)
.|.||.|++.-..+...-+-+.++|+|+.||.
T Consensus 27 ~F~CPnCGe~~I~Rc~~CRk~g~~Y~Cp~CGF 58 (61)
T COG2888 27 KFPCPNCGEVEIYRCAKCRKLGNPYRCPKCGF 58 (61)
T ss_pred EeeCCCCCceeeehhhhHHHcCCceECCCcCc
Confidence 46777777666655555555556677777664
No 83
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=47.03 E-value=8.4 Score=28.90 Aligned_cols=32 Identities=25% Similarity=0.656 Sum_probs=13.5
Q ss_pred cCcccccccCChHHHhhhccCccccccCCCcc
Q psy4681 955 KCPKCEKEFSTPRYMRKHLRKKFKCDVCGNGY 986 (1930)
Q Consensus 955 ~C~~C~k~f~~~~~l~~H~~~~~~C~~C~~~f 986 (1930)
.||.|+..|.-......-......|+.|+..|
T Consensus 4 ~CP~C~~~~~v~~~~~~~~~~~v~C~~C~~~~ 35 (38)
T TIGR02098 4 QCPNCKTSFRVVDSQLGANGGKVRCGKCGHVW 35 (38)
T ss_pred ECCCCCCEEEeCHHHcCCCCCEEECCCCCCEE
Confidence 45555555554433322111234444444433
No 84
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=43.97 E-value=15 Score=30.09 Aligned_cols=33 Identities=18% Similarity=0.578 Sum_probs=20.4
Q ss_pred CcccccccccccChHHHHHHHHHhCCCCCccccCCCccc
Q psy4681 1794 PHTCPVCKKIFVNKVTLAAHKKIHLPIDKNCQCDVCGKS 1832 (1930)
Q Consensus 1794 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~C~~C~~~ 1832 (1930)
.|.||.||..-..+..--+ ....+|+||.||+.
T Consensus 27 ~F~CPnCGe~~I~Rc~~CR------k~g~~Y~Cp~CGF~ 59 (61)
T COG2888 27 KFPCPNCGEVEIYRCAKCR------KLGNPYRCPKCGFE 59 (61)
T ss_pred EeeCCCCCceeeehhhhHH------HcCCceECCCcCcc
Confidence 3788888865554432211 12478999999863
No 85
>PHA00626 hypothetical protein
Probab=42.28 E-value=9.5 Score=30.57 Aligned_cols=12 Identities=17% Similarity=0.149 Sum_probs=6.2
Q ss_pred ccCccchhccCC
Q psy4681 1012 HKCPTCYKIFTE 1023 (1930)
Q Consensus 1012 ~~C~~C~k~f~~ 1023 (1930)
|.|+.|+..|+.
T Consensus 24 YkCkdCGY~ft~ 35 (59)
T PHA00626 24 YVCCDCGYNDSK 35 (59)
T ss_pred eEcCCCCCeech
Confidence 555555555543
No 86
>KOG2186|consensus
Probab=40.13 E-value=13 Score=40.36 Aligned_cols=45 Identities=24% Similarity=0.648 Sum_probs=20.4
Q ss_pred ccCCcCChHHHh-hHHHHHhhccCCcceeccccCcccccchhHHhHhh
Q psy4681 447 VCCHICGKKLRG-KLKDHMLTHTGERPFGCEVCGSTYKYKYYLAVHMR 493 (1930)
Q Consensus 447 ~~C~~C~k~f~~-~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~ 493 (1930)
|.|++||....- .|.+|+..-.+ .-|.|-.|++.|.. .+...|..
T Consensus 4 FtCnvCgEsvKKp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred EehhhhhhhccccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 334444443332 34444444333 34555555555554 44444444
No 87
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.80 E-value=18 Score=34.68 Aligned_cols=31 Identities=19% Similarity=0.626 Sum_probs=24.7
Q ss_pred cccCccchhccCCchhhhhhhcccccCCccccccccccccCH
Q psy4681 185 KFECAHCSKTYLSRVGLEDHINNHTGEKGHICEICNRDFYSD 226 (1930)
Q Consensus 185 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~ 226 (1930)
+..|+.||+.|-.. +..|.+|+.||..|.-.
T Consensus 9 KR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 46899999999652 44688999999999765
No 88
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.52 E-value=26 Score=33.65 Aligned_cols=33 Identities=27% Similarity=0.769 Sum_probs=20.5
Q ss_pred CceeecCCCCCcccChhHHHhhhhhhcCCCCcccCCCCcccCCh
Q psy4681 1153 GYTVFCKECNIGFYSSTHLHSHGIKVHGLPPFICEHCSKPFTSK 1196 (1930)
Q Consensus 1153 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~ 1196 (1930)
|....|+.||+.|.... ..|-.|+.||..|.-.
T Consensus 7 GtKR~Cp~CG~kFYDLn-----------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 7 GTKRTCPSCGAKFYDLN-----------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred CCcccCCCCcchhccCC-----------CCCccCCCCCCccCcc
Confidence 44566777777776632 2456677777776544
No 89
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=37.21 E-value=10 Score=41.89 Aligned_cols=13 Identities=15% Similarity=0.682 Sum_probs=8.6
Q ss_pred cccccccccccCH
Q psy4681 214 HICEICNRDFYSD 226 (1930)
Q Consensus 214 ~~C~~C~~~f~~~ 226 (1930)
..|+.||.+|...
T Consensus 49 ~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 49 WVCPHCGYAAFEE 61 (214)
T ss_pred EECCCCCCccccc
Confidence 3577777777654
No 90
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=35.81 E-value=22 Score=27.85 Aligned_cols=26 Identities=35% Similarity=0.721 Sum_probs=11.5
Q ss_pred CccccCCCccccCCh----hhHHhHHhhhc
Q psy4681 1822 KNCQCDVCGKSFART----FHLKSHISSVH 1847 (1930)
Q Consensus 1822 ~~~~C~~C~~~f~~~----~~l~~H~~~~H 1847 (1930)
....|.+|++.+... .+|.+|++..|
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 334555565555442 45666665444
No 91
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=34.77 E-value=25 Score=36.55 Aligned_cols=24 Identities=50% Similarity=1.032 Sum_probs=16.5
Q ss_pred ceeccccCcccccchhHHhHhhhcCCCCcccCCCCCc
Q psy4681 472 PFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGH 508 (1930)
Q Consensus 472 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k 508 (1930)
-|.|++||..+ -|+-|-+||+||.
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 57777777554 3567777888873
No 92
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=33.51 E-value=29 Score=23.58 Aligned_cols=20 Identities=25% Similarity=0.562 Sum_probs=13.3
Q ss_pred ccccccccccCChhhHHHHHh
Q psy4681 15 VECHHCAMRYSSKSQLLDHLN 35 (1930)
Q Consensus 15 ~~C~~C~~~f~~~~~l~~H~~ 35 (1930)
..|++|++.+ +...+..|+-
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 4677777777 5566666664
No 93
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=32.14 E-value=24 Score=38.32 Aligned_cols=44 Identities=14% Similarity=0.521 Sum_probs=25.2
Q ss_pred cccCCCCceecccccccccChHHHHHHhhh-cCCCCCccccccccc
Q psy4681 1643 IKQHDAQPHTCPVCKKIFVNKFNLTTHKKL-HLPMNRNHQCDTCGK 1687 (1930)
Q Consensus 1643 ~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~~C~~C~~ 1687 (1930)
.+.||.+.|+|..|+.-.-.-..+ .|..+ ...+...|+|.-|++
T Consensus 135 vw~hGGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNr 179 (314)
T PF06524_consen 135 VWDHGGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNR 179 (314)
T ss_pred cccCCCeEEEeecCCCeeeccchh-hhhhhhhhhhccccccccccc
Confidence 456778888888886544433333 34332 222334677777764
No 94
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=31.59 E-value=35 Score=26.67 Aligned_cols=26 Identities=27% Similarity=0.650 Sum_probs=16.9
Q ss_pred eeccccCcccccchhHHhHhhhcCCCCcccCCCCCcc
Q psy4681 473 FGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGHS 509 (1930)
Q Consensus 473 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k~ 509 (1930)
|.|..||..|... ...+..|+.||..
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r 28 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGYR 28 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCce
Confidence 6777777776633 2345778777764
No 95
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=30.35 E-value=29 Score=28.69 Aligned_cols=33 Identities=18% Similarity=0.435 Sum_probs=16.0
Q ss_pred CCccCcccccccCChHHHhhhccCccccccCCC
Q psy4681 952 KERKCPKCEKEFSTPRYMRKHLRKKFKCDVCGN 984 (1930)
Q Consensus 952 ~~~~C~~C~k~f~~~~~l~~H~~~~~~C~~C~~ 984 (1930)
..|.||.|++.-..+-.-=+-+..+|+|+.||.
T Consensus 24 ~~F~CPnCG~~~I~RC~~CRk~~~~Y~CP~CGF 56 (59)
T PRK14890 24 VKFLCPNCGEVIIYRCEKCRKQSNPYTCPKCGF 56 (59)
T ss_pred CEeeCCCCCCeeEeechhHHhcCCceECCCCCC
Confidence 346677777653333222222334555555553
No 96
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=30.33 E-value=26 Score=36.18 Aligned_cols=14 Identities=29% Similarity=0.617 Sum_probs=8.0
Q ss_pred eeCCCCCccccChh
Q psy4681 358 HVCSICQSTYTTAR 371 (1930)
Q Consensus 358 ~~C~~C~~~f~~~~ 371 (1930)
++|+.||++|.+..
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 55666666665543
No 97
>PF15269 zf-C2H2_7: Zinc-finger
Probab=30.13 E-value=31 Score=26.09 Aligned_cols=22 Identities=27% Similarity=0.596 Sum_probs=0.0
Q ss_pred cccCCCCccccchhhhHHHHhh
Q psy4681 1867 FSCDLCSYTSTQKYYLVKHKSR 1888 (1930)
Q Consensus 1867 ~~C~~C~~~~~~~~~l~~H~~~ 1888 (1930)
|+|-+|.++..-++.|..||+.
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
No 98
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=30.11 E-value=63 Score=36.84 Aligned_cols=22 Identities=23% Similarity=0.501 Sum_probs=15.6
Q ss_pred CccCCcchhhHHhhHhHHHHHH
Q psy4681 42 PYICHICKNSYVAAKGLKRHLK 63 (1930)
Q Consensus 42 ~~~C~~C~~~f~~~~~l~~H~~ 63 (1930)
.+=|..|++.|.+...+..|+.
T Consensus 238 ~~YC~~C~r~f~~~~VFe~Hl~ 259 (470)
T COG5188 238 KVYCVKCGREFSRSKVFEYHLE 259 (470)
T ss_pred ceeeHhhhhHhhhhHHHHHHHh
Confidence 3567777777777777777764
No 99
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=29.74 E-value=29 Score=24.94 Aligned_cols=10 Identities=20% Similarity=0.683 Sum_probs=5.2
Q ss_pred eeCCCCCccc
Q psy4681 358 HVCSICQSTY 367 (1930)
Q Consensus 358 ~~C~~C~~~f 367 (1930)
|.|..||..+
T Consensus 1 Y~C~~Cg~~~ 10 (32)
T PF03604_consen 1 YICGECGAEV 10 (32)
T ss_dssp EBESSSSSSE
T ss_pred CCCCcCCCee
Confidence 4455555554
No 100
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=29.74 E-value=26 Score=31.40 Aligned_cols=13 Identities=23% Similarity=0.838 Sum_probs=7.3
Q ss_pred ccccccccchhhc
Q psy4681 77 MYQCDICSKMFIE 89 (1930)
Q Consensus 77 ~~~C~~C~~~f~~ 89 (1930)
..+|.-|+..|+.
T Consensus 53 IW~C~kCg~~fAG 65 (89)
T COG1997 53 IWKCRKCGAKFAG 65 (89)
T ss_pred eEEcCCCCCeecc
Confidence 3556666666653
No 101
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=29.28 E-value=24 Score=27.86 Aligned_cols=6 Identities=50% Similarity=1.470 Sum_probs=2.3
Q ss_pred cCCcCc
Q psy4681 1326 ICDLCG 1331 (1930)
Q Consensus 1326 ~C~~C~ 1331 (1930)
.|+.||
T Consensus 23 ~Cp~CG 28 (46)
T PRK00398 23 RCPYCG 28 (46)
T ss_pred ECCCCC
Confidence 333333
No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=28.93 E-value=32 Score=35.56 Aligned_cols=33 Identities=24% Similarity=0.677 Sum_probs=15.5
Q ss_pred CCccCcccccccCChHHHhh-hccCccccccCCC
Q psy4681 952 KERKCPKCEKEFSTPRYMRK-HLRKKFKCDVCGN 984 (1930)
Q Consensus 952 ~~~~C~~C~k~f~~~~~l~~-H~~~~~~C~~C~~ 984 (1930)
..|.||.|+..|+....+.. .....|.|+.||.
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~~d~~~~f~Cp~Cg~ 131 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQLLDMDGTFTCPRCGE 131 (147)
T ss_pred cEEECcCCCCEeeHHHHHHhcCCCCcEECCCCCC
Confidence 34556666666554443322 2222355555544
No 103
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=28.37 E-value=33 Score=26.12 Aligned_cols=12 Identities=33% Similarity=0.949 Sum_probs=7.1
Q ss_pred cccccccccccc
Q psy4681 330 FECFHCGAKFIS 341 (1930)
Q Consensus 330 ~~C~~C~~~f~~ 341 (1930)
|.|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (41)
T smart00834 6 YRCEDCGHTFEV 17 (41)
T ss_pred EEcCCCCCEEEE
Confidence 566666666543
No 104
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=28.35 E-value=30 Score=35.91 Aligned_cols=12 Identities=50% Similarity=1.168 Sum_probs=6.1
Q ss_pred cCCcceeccccC
Q psy4681 468 TGERPFGCEVCG 479 (1930)
Q Consensus 468 ~~~k~~~C~~C~ 479 (1930)
-|+-|-+|++||
T Consensus 145 ~ge~P~~CPiCg 156 (166)
T COG1592 145 EGEAPEVCPICG 156 (166)
T ss_pred cCCCCCcCCCCC
Confidence 344455555554
No 105
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=28.21 E-value=59 Score=33.61 Aligned_cols=40 Identities=20% Similarity=0.558 Sum_probs=25.4
Q ss_pred hccCCcceeccccCcccccchhHHh-HhhhcCCCCcccCCCCCccC
Q psy4681 466 THTGERPFGCEVCGSTYKYKYYLAV-HMRKHTGERPYVCNYCGHSF 510 (1930)
Q Consensus 466 ~h~~~k~~~C~~C~k~F~~~~~L~~-H~~~H~~~~~~~C~~C~k~f 510 (1930)
.-.+..-|.|+.|+..|.....+.. . . +..|.|+.||...
T Consensus 93 ~e~~~~~Y~Cp~C~~~y~~~ea~~~~d----~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 93 DETNNAYYKCPNCQSKYTFLEANQLLD----M-DGTFTCPRCGEEL 133 (147)
T ss_pred cccCCcEEECcCCCCEeeHHHHHHhcC----C-CCcEECCCCCCEE
Confidence 3344557888888888875544332 2 1 2348888888764
No 106
>PF12907 zf-met2: Zinc-binding
Probab=28.00 E-value=32 Score=26.15 Aligned_cols=27 Identities=26% Similarity=0.624 Sum_probs=20.9
Q ss_pred ccccccc---cccCChHHHHHHHhhhCCCC
Q psy4681 1895 VFCKICQ---LGFLSKNELDVHNIKQHDAQ 1921 (1930)
Q Consensus 1895 ~~C~~C~---~~f~~~~~l~~H~~~~H~~~ 1921 (1930)
+.|.+|- ....+...|..|..+.|...
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~ 31 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKN 31 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence 5688888 45555678999999888887
No 107
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=27.82 E-value=52 Score=34.40 Aligned_cols=19 Identities=21% Similarity=0.307 Sum_probs=9.7
Q ss_pred CceeecccccccccchhhH
Q psy4681 1266 NRVFTCEVCGKGFIQKRYL 1284 (1930)
Q Consensus 1266 ~~~~~C~~C~k~F~~~~~L 1284 (1930)
..-|.|+.|+..|+.-..+
T Consensus 107 ~~~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 107 NMFFICPNMCVRFTFNEAM 125 (158)
T ss_pred CCeEECCCCCcEeeHHHHH
Confidence 3445555555555554444
No 108
>PF12907 zf-met2: Zinc-binding
Probab=27.68 E-value=37 Score=25.84 Aligned_cols=27 Identities=22% Similarity=0.560 Sum_probs=13.0
Q ss_pred cccccch---hhcCCHHHHHhcccccccCC
Q psy4681 1710 FPCRLCS---QEFDTKEQRKKHERKDHETQ 1736 (1930)
Q Consensus 1710 ~~C~~C~---~~f~~~~~l~~H~~~~h~~~ 1736 (1930)
++|.+|. ....+...|..|....|+..
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~ 31 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKN 31 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence 3455555 23334455555555555443
No 109
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.64 E-value=24 Score=30.92 Aligned_cols=44 Identities=20% Similarity=0.405 Sum_probs=27.3
Q ss_pred cccCCCCcccCChhhHHHHHhhhcCCCceecc--ccccccccchhhhh
Q psy4681 1184 FICEHCSKPFTSKGNLTVHVKYYHAKTLFECN--ICLKTFNFKTSYKR 1229 (1930)
Q Consensus 1184 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~~l~~ 1229 (1930)
+.|+.||.......+-..... ..++.++|. .||.+|.....+..
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~s~ 47 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESVQR 47 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEEEE
Confidence 467888876644433333322 456678887 88888887765443
No 110
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.42 E-value=58 Score=34.10 Aligned_cols=37 Identities=22% Similarity=0.394 Sum_probs=22.3
Q ss_pred hhccCCCcccCCcCccccCChhhHHhHhhhccCCCCccCchhhhhc
Q psy4681 1084 LTHTGERPYACEFCGSSFKDKSYLRIHIRKHNGERPFTCSECGQSF 1129 (1930)
Q Consensus 1084 ~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 1129 (1930)
....+..-|.|+.|+..|+....+. ..|.|+.||...
T Consensus 102 ~~e~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 102 EFETNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred hhccCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 3334445567777777776666664 247777776553
No 111
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=26.21 E-value=45 Score=26.94 Aligned_cols=21 Identities=29% Similarity=0.580 Sum_probs=14.0
Q ss_pred eeccccCcccccc-----hhHHhHhh
Q psy4681 473 FGCEVCGSTYKYK-----YYLAVHMR 493 (1930)
Q Consensus 473 ~~C~~C~k~F~~~-----~~L~~H~~ 493 (1930)
-.|..|++.++.. +.|.+|++
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 4677777777654 46677766
No 112
>KOG2071|consensus
Probab=26.13 E-value=51 Score=41.21 Aligned_cols=27 Identities=26% Similarity=0.521 Sum_probs=18.4
Q ss_pred CcceeccccCcccccchhHHhHhhhcC
Q psy4681 470 ERPFGCEVCGSTYKYKYYLAVHMRKHT 496 (1930)
Q Consensus 470 ~k~~~C~~C~k~F~~~~~L~~H~~~H~ 496 (1930)
..|-+|..||.+|........||-.|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 345677777777777777777776663
No 113
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=25.67 E-value=33 Score=37.50 Aligned_cols=30 Identities=27% Similarity=0.629 Sum_probs=21.7
Q ss_pred CCccccCCCccccCChhhHHhHHhhhcchh
Q psy4681 1821 DKNCQCDVCGKSFARTFHLKSHISSVHLKR 1850 (1930)
Q Consensus 1821 ~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 1850 (1930)
+..|.|+.|+|.|....-+++||.+.|.-.
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 346888888888888888889988888654
No 114
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=24.61 E-value=33 Score=30.02 Aligned_cols=39 Identities=28% Similarity=0.376 Sum_probs=18.4
Q ss_pred ccccccccccccccccccccccCCCCceeCC--CCCccccChh
Q psy4681 331 ECFHCGAKFISRTHIADHMTSHTGIKNHVCS--ICQSTYTTAR 371 (1930)
Q Consensus 331 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~ 371 (1930)
.|+.||....-..+-..... ..+.-++|. .||.+|.+..
T Consensus 3 ~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 3 HCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYE 43 (72)
T ss_pred cCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEE
Confidence 46666555433332222211 334445666 6666665543
No 115
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=24.38 E-value=57 Score=34.88 Aligned_cols=18 Identities=22% Similarity=0.451 Sum_probs=9.6
Q ss_pred ceeecccccccccchhhH
Q psy4681 1267 RVFTCEVCGKGFIQKRYL 1284 (1930)
Q Consensus 1267 ~~~~C~~C~k~F~~~~~L 1284 (1930)
.-|.|+.|+..|+.-..+
T Consensus 116 ~~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAM 133 (178)
T ss_pred CEEECCCCCcEEeHHHHh
Confidence 445555555555554443
No 116
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=24.25 E-value=45 Score=26.62 Aligned_cols=29 Identities=28% Similarity=0.751 Sum_probs=16.8
Q ss_pred cceeccccCcccccchhHHhHhhhcCCCCcccCCCCCcc
Q psy4681 471 RPFGCEVCGSTYKYKYYLAVHMRKHTGERPYVCNYCGHS 509 (1930)
Q Consensus 471 k~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~k~ 509 (1930)
..|.|..||+.|... .......|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVELD----------QETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence 347777777777211 12345677777754
No 117
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=23.96 E-value=43 Score=27.24 Aligned_cols=12 Identities=33% Similarity=1.035 Sum_probs=7.2
Q ss_pred cccccccccccc
Q psy4681 330 FECFHCGAKFIS 341 (1930)
Q Consensus 330 ~~C~~C~~~f~~ 341 (1930)
|.|..||..|..
T Consensus 6 y~C~~Cg~~fe~ 17 (52)
T TIGR02605 6 YRCTACGHRFEV 17 (52)
T ss_pred EEeCCCCCEeEE
Confidence 566666666653
No 118
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=23.95 E-value=22 Score=32.59 Aligned_cols=21 Identities=29% Similarity=0.569 Sum_probs=12.5
Q ss_pred ccCCcchhhHHhhHhHHHHHH
Q psy4681 43 YICHICKNSYVAAKGLKRHLK 63 (1930)
Q Consensus 43 ~~C~~C~~~f~~~~~l~~H~~ 63 (1930)
+-|-.|.+.|.+...|..|.+
T Consensus 56 hYCieCaryf~t~~aL~~Hkk 76 (126)
T COG5112 56 HYCIECARYFITEKALMEHKK 76 (126)
T ss_pred eeeehhHHHHHHHHHHHHHhc
Confidence 555556666666666666654
No 119
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=23.30 E-value=25 Score=36.23 Aligned_cols=34 Identities=26% Similarity=0.718 Sum_probs=16.0
Q ss_pred cCcccccccC---ChHHHhh--hccCccccccCCCccCC
Q psy4681 955 KCPKCEKEFS---TPRYMRK--HLRKKFKCDVCGNGYTS 988 (1930)
Q Consensus 955 ~C~~C~k~f~---~~~~l~~--H~~~~~~C~~C~~~f~~ 988 (1930)
+||.|+..++ +...+.. .+++.+.|+.||.+|.+
T Consensus 2 ~cp~c~~~~~~~~~s~~~~~~~~~~~~~~c~~c~~~f~~ 40 (154)
T PRK00464 2 RCPFCGHPDTRVIDSRPAEDGNAIRRRRECLACGKRFTT 40 (154)
T ss_pred cCCCCCCCCCEeEeccccCCCCceeeeeeccccCCcceE
Confidence 4666665442 2222222 33334566666666554
No 120
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=23.23 E-value=63 Score=31.53 Aligned_cols=33 Identities=18% Similarity=0.391 Sum_probs=19.6
Q ss_pred CceeecCCCCCcccChhHHHhhhhhhcCCCCcccCCCCcccCCh
Q psy4681 1153 GYTVFCKECNIGFYSSTHLHSHGIKVHGLPPFICEHCSKPFTSK 1196 (1930)
Q Consensus 1153 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~C~~C~~~f~~~ 1196 (1930)
|....|+.|++.|.... ..|-.|+.||..|...
T Consensus 7 GtKr~Cp~cg~kFYDLn-----------k~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 7 GTKRICPNTGSKFYDLN-----------RRPAVSPYTGEQFPPE 39 (129)
T ss_pred CccccCCCcCccccccC-----------CCCccCCCcCCccCcc
Confidence 34556777777766532 2456677777666544
No 121
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=22.81 E-value=51 Score=22.47 Aligned_cols=9 Identities=44% Similarity=1.350 Sum_probs=6.6
Q ss_pred CCcchhhcC
Q psy4681 811 CPKCNKEFY 819 (1930)
Q Consensus 811 C~~C~~~f~ 819 (1930)
|+.||..|.
T Consensus 17 Cp~CG~~F~ 25 (26)
T PF10571_consen 17 CPHCGYDFE 25 (26)
T ss_pred CCCCCCCCc
Confidence 778877774
No 122
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=22.59 E-value=68 Score=34.29 Aligned_cols=37 Identities=19% Similarity=0.510 Sum_probs=24.6
Q ss_pred hccCCCcccCCcCccccCChhhHHhHhhhccCCCCccCchhhhhcc
Q psy4681 1085 THTGERPYACEFCGSSFKDKSYLRIHIRKHNGERPFTCSECGQSFA 1130 (1930)
Q Consensus 1085 ~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~ 1130 (1930)
.-....-|.|+.|+..|+....+. ..|.|+.||....
T Consensus 111 ~e~~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 111 EEENNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred hccCCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 333445678888888877776653 2588888876643
No 123
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=22.39 E-value=36 Score=32.72 Aligned_cols=10 Identities=20% Similarity=0.833 Sum_probs=4.9
Q ss_pred cccCCCCccc
Q psy4681 1184 FICEHCSKPF 1193 (1930)
Q Consensus 1184 ~~C~~C~~~f 1193 (1930)
+.|+.|+..+
T Consensus 17 ~~C~~C~~~~ 26 (104)
T TIGR01384 17 YVCPSCGYEK 26 (104)
T ss_pred EECcCCCCcc
Confidence 4455555443
No 124
>PF09416 UPF1_Zn_bind: RNA helicase (UPF2 interacting domain); InterPro: IPR018999 UPF1 (or regulator of nonsense transcripts 1 homologue) is an essential RNA helicase that detects mRNAs containing premature stop codons and triggers their degradation. This domain contains 3 zinc binding motifs and forms interactions with another protein (UPF2) that is also involved nonsense-mediated mRNA decay (NMD) []. ; GO: 0003677 DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, 0005737 cytoplasm; PDB: 2IYK_B 2WJY_A 2WJV_A 2XZL_A.
Probab=22.27 E-value=72 Score=32.54 Aligned_cols=30 Identities=33% Similarity=0.633 Sum_probs=18.0
Q ss_pred hcCCCCCccccccccccc------cHHHHHHHHHHh
Q psy4681 289 VHSKVRPHQCKGCGKYFK------SQRHLVQHERRV 318 (1930)
Q Consensus 289 ~h~~~~~~~C~~C~~~f~------~~~~l~~H~~~~ 318 (1930)
.|...--.+|..|++.|= +.+.+..|+.+.
T Consensus 8 ~~~p~~vv~C~~c~kWFCNg~~~~s~SHIv~HLv~s 43 (152)
T PF09416_consen 8 IHDPSCVVKCNTCNKWFCNGRGNTSGSHIVNHLVRS 43 (152)
T ss_dssp ---CCCEEEETTTTEEEES--TTSSS-HHHHHHHHH
T ss_pred CCCcccEeEcCCCCcEeecCCCCCcccHHHHHHHHc
Confidence 556666678888888773 456677776543
No 125
>KOG1280|consensus
Probab=22.17 E-value=62 Score=37.20 Aligned_cols=37 Identities=27% Similarity=0.538 Sum_probs=19.4
Q ss_pred cccccccccccccchHHHHHHhhhccCCCCccccccc
Q psy4681 1679 NHQCDTCGKSFTGNNHLKRHIYSVHLKRDTKFPCRLC 1715 (1930)
Q Consensus 1679 ~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~~C~~C 1715 (1930)
.|.|++|+..-.+...|..|+...|..-....+|++|
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c 115 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLC 115 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeecc
Confidence 4555555555555555555555555443344445444
No 126
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=21.91 E-value=18 Score=27.92 Aligned_cols=11 Identities=27% Similarity=1.319 Sum_probs=5.5
Q ss_pred cccccCCCccC
Q psy4681 977 FKCDVCGNGYT 987 (1930)
Q Consensus 977 ~~C~~C~~~f~ 987 (1930)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 44555555444
No 127
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=21.38 E-value=87 Score=26.31 Aligned_cols=21 Identities=19% Similarity=0.488 Sum_probs=8.3
Q ss_pred hhhHHHHHHHhCCCCceeccc
Q psy4681 1281 KRYLEEHKRVHTGYKPYACDL 1301 (1930)
Q Consensus 1281 ~~~L~~H~~~H~~~~~~~C~~ 1301 (1930)
+..|..|...-=..++-.|++
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHHccCCCCcEECCC
Confidence 344444444333333444444
No 128
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=21.15 E-value=49 Score=27.84 Aligned_cols=21 Identities=19% Similarity=0.350 Sum_probs=9.2
Q ss_pred chhhhhhhhcccCCCcccccc
Q psy4681 677 KGSLKEHMIVHTGERKYCCHI 697 (1930)
Q Consensus 677 ~~~l~~H~~~h~~~~~~~C~~ 697 (1930)
+..|..|....-..++..|+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHHccCCCCcEECCC
Confidence 344444444434444445555
No 129
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=20.90 E-value=36 Score=33.95 Aligned_cols=27 Identities=26% Similarity=0.596 Sum_probs=14.2
Q ss_pred hhccccccccccccCChhhHHHHHhhcCCC
Q psy4681 11 RQLNVECHHCAMRYSSKSQLLDHLNSHTGL 40 (1930)
Q Consensus 11 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 40 (1930)
.+....|-+||+.|.+. .+|++.|+|-
T Consensus 69 ~~d~i~clecGk~~k~L---krHL~~~~gl 95 (132)
T PF05443_consen 69 TPDYIICLECGKKFKTL---KRHLRTHHGL 95 (132)
T ss_dssp -SS-EE-TBT--EESBH---HHHHHHTT-S
T ss_pred ccCeeEEccCCcccchH---HHHHHHccCC
Confidence 44557777777777654 6777777654
No 130
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=20.59 E-value=56 Score=29.47 Aligned_cols=14 Identities=29% Similarity=0.688 Sum_probs=6.8
Q ss_pred CCcccccccccccC
Q psy4681 1793 QPHTCPVCKKIFVN 1806 (1930)
Q Consensus 1793 ~~~~C~~C~~~f~~ 1806 (1930)
+|-+|..||+.|..
T Consensus 57 ~Pa~CkkCGfef~~ 70 (97)
T COG3357 57 RPARCKKCGFEFRD 70 (97)
T ss_pred cChhhcccCccccc
Confidence 34455555555543
Done!